GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOFABE 1229 bp ds-DNA BCT 22-FEB-1990
DEFINITION E.coli biotin carboxyl carrier protein (fabE) gene, complete cds.
ACCESSION M32214
KEYWORDS biotin carboxyl carrier protein.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1229)
AUTHORS Alix,J.-H.
TITLE A rapid procedure for cloning genes from lambda libraries by
complementation of E.coli defective mutants: Application to the
fabE region of the E.coli chromosome
JOURNAL DNA 8, 779-789 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 269 739 biotin carboxyl carrier protein (fabE)
BASE COUNT 321 a 324 c 318 g 266 t
ORIGIN
1 cgacctcgtc ctccctgacg cagtttttgc gctgcggaaa aggtgacatt ggcgcaacga
61 aggtatattt tgttttttgc cggaggatag cagcagatcg ctgcacaatg tccgtcaagt
121 ctaacattga cactctgggg caaaatagac cggcgtcccg gcctgctgga atttatcgct
181 atgcatacag ctgtcggggc atacgcttta cagacggcgg tgaaacgcct gtcacaatca
241 cactaaacaa agagtacgga acccactcat ggatattcgt aagattaaaa aactgatcga
301 gctggttgaa gaatcaggca tctccgaact ggaaatttct gaaggcgaag agtcagtacg
361 cattagccgt gcagctcctg ccgcaagttt ccctgtgatg caacaagctt acgctgcacc
421 aatgatgcag cagccagctc aatctaacgc agccgctccg gcgaccgttc cttccatgga
481 agcgccagca gcagcggaaa tcagtggtca catcgtacgt tccccgatgg ttggtacttt
541 ctaccgcacc ccaagcccgg acgcaaaagc gttcatcgaa gtgggtcaga aagtcaacgt
601 gggcgatacc ctgtgcatcg ttgaagccat gaaaatgatg aaccagatcg aagcggacaa
661 atccggtacc gtgaaagcaa ttctggtcga aagtggacaa ccggtagaat ttgacgagcc
721 gctggtcgtc atcgagtaac gaggcgaaca tgctggataa aattgttatt gccaaccgcg
781 gcgagattgc attgcgtatt cttcgtgcct gtaaagaact gggcatcaag actgtcgctg
841 tgcactccag cgcggatcgc gatctaaaac acgtattact ggcagatgaa acggtctgta
901 ttggccctgc tccgtcagta aaaagttatc tgaacatccc ggcaatcatc agcgccgctg
961 aaatcaccgg cgcagtagca atccatccgg gttacggctt cctctccgag aacgccaact
1021 ttgccgagca ggttgaacgc tccggcttta tcttcattgg cccgaaagca gaaaccattc
1081 gcctgatggg cgacaaagta tccgcaatcg cggcgatgaa aaaagcgggc gtcccttgcg
1141 taccgggttc tgacgcgccg ctgggcgacg atatggataa aaaccgtgcc attgctaaac
1201 gcattggtta tccggtgatt atcaaaccg
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS VACWRA 2955 bp ds-DNA VRL 13-FEB-1990
DEFINITION Vaccinia virus abortive late protein gene, complete cds.
ACCESSION M32064
KEYWORDS abortive late protein.
SOURCE Vaccinia virus (strain WR) DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 2955)
AUTHORS Pacha,R.F., Meis,R.J. and Condit,R.C.
TITLE Structure and expression of the vaccinia virus gene which prevents
virus-induced breakdown of RNA
JOURNAL Unpublished (1990) D. of Biochem., SUNY-Buffalo, Buffalo, NY 14214
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.C.Condit, 12-FEB-1990.
FEATURES from to/span description
pept 825 < 1 (c) 32 kD protein
pept 1439 828 (c) 23 kD protein
pept 1454 2935 56 kD abortive late protein
mRNA 1028 > 2955 56 kD abortive late protein early mRNA
mRNA < 2955 1442 (c) 23 kD protein late mRNA
BASE COUNT 1019 a 536 c 474 g 926 t
ORIGIN About 120 kb from left end of genome.
1 tctagattcg tcggtgcact catgtaacca acatacccta ggtcctaaat atttatctcc
61 ggaattagat tttggataat tcgcgcacca acaatttcta tttcctttat gatcgttaca
121 aaagacgtat aatgccgtat ccccaaaagt aaaataatca ggacgaataa ttctaataaa
181 ctcagaacaa tatctcgcat ccatatgttt ggagcaaata tcggaataag tagacatagc
241 cggtttccgt tttgcacgta accattctaa acaattgggg tttccaggat cgtttctaca
301 aaatccagtc atgaaatcgt cacaatgttc tgtcttgtaa ttattattaa atatttttgg
361 acagtgtttg gtatttgtct tagaacaaca ttttgccacg ctatcactat cgcccaggag
421 ataatccttt tttataaaat gacatcgttg cccggatgct atataatcag tagcgtgttt
481 taaatcctta atatattcag gagttacctc gttctgataa tagattaatg atccaggacg
541 aaatttgaaa gaactacatg gttctccatg aattaataca tattgtttag caaattcagg
601 aactataaaa ctactacaat gatctatcga cataccatct atcaaacaaa acttgggttt
661 aatttctccc ggagatgttt cataatagta cgtataactt tcttctgcaa acttaacagc
721 tctattatat tcaggataat taaaacctaa ttccatatat ttgtctcgta tatctgctat
781 tcctggtgct attttgattc tattaagagt aacagctgcc cccattctta ataatcgtca
841 gtatttaaac tgttaaatgt tggtatatca acatctacct tatttcccgc agtataaggt
901 ttgttgcagg tatactgttc aggaatggtt acatttatac ttcttctata gtcctgtctt
961 tcgatgttca tcacatatgc aaagaacaga ataaacaaaa taatgtaaga aataatatta
1021 aatatctgtg aattcgtaaa tacattgatt gccataataa ttacagcagc tacaatacac
1081 acaatagaca ttcccacagt gttgccatta cctccacgat acatttgagt tactaagcaa
1141 taggtaataa ctaagctagt aagaggcaat agaaaagatg agataaatat catcaatata
1201 gagattagag gagggctata tagagccaag acgaacaaaa tcaaaccgag taacgttcta
1261 acatcattat ttttgaagat tcccaaataa tcattcattc ctccataatc gttttgcatc
1321 atacctccat ctttaggcat aaacgattgc tgctgttcct ctgtaaataa atctttatca
1381 agcactccag cacccgcaga gaagtcgtca agcatattgt aatatcttaa ataactcatt
1441 tatatattaa aaaatgtcac tattaaagat ggagtataat ctttatgccg aactaaaaaa
1501 aatgacttgt ggtcaacccc taagtctttt taacgaagac ggggatttcg tagaagttga
1561 accgggatca tcctttaagt ttctgatacc taagggattt tacgcctctc cttccgtaaa
1621 gacgagtcta gtatttgaaa cattaacaac gaccgataat aaaatcacta gtatcaatcc
1681 aacaaatgcg ccaaagttat atcctcttca acgcaaagtc gtatctgaag tagtttctaa
1741 tatgaggaaa atgatcgaat caaaacgtcc tctatacatt actcttcact tggcgtgtgg
1801 atttggtaag actattacca cgtgttatct tatggctaca cacggtagaa aaaccgtcat
1861 ttgcgtaccc aataaaatgt taatacatca atggaagaca caggtagagg cagtcggatt
1921 ggaacataag atatccatag atggagtaag tagtctatta aaggaactaa agactcaaag
1981 tccggatgta ttaatagtag tcagtagaca tctgacaaac gatgcctttt gtaaatatat
2041 caataagcat tatgatttgt tcatcttgga tgaatcacat acgtataatc tgatgaacaa
2101 tacagcagtt acaagatttt tagcgtatta tcctccgatg atgtgttatt ttttaactgc
2161 tacacctaga ccagctaacc gaatttattg taacagtatt attaatattg ccaagttatc
2221 cgatctaaaa aaaactatct atgcggtaga tagttttttt gagccatatt ccacagacaa
2281 tattagacat atggtaaaac gactagatgg accatctaat aaatatcata tatataccga
2341 gaagttatta tctgtagacg agcctagaaa tcaacttatt cttaataccc tggtagaaga
2401 attcaagtca ggaactatta atcgcatttt agttattact aaactacgtg aacatatggt
2461 attattctac aaacgattat tagatctttt cggaccagag gttgtattta taggagacgc
2521 ccaaaataga cgtactccag atatggtcaa atcaatcaag gaactaaata gatttatatt
2581 cgtatccacc ttattttatt ccggtactgg tttagatatt cctagtttgg attcgttgtt
2641 catttgctcg gcagtaatca acaatatgca aatagagcaa ttactaggga gggtatgtcg
2701 agaaacagaa ctattagata ggacggtata tgtatttcct aacacatcca tcaaagaaat
2761 aaagtacatg ataggaaatt tcatgcaacg aattattagt ctgtctgtag ataaactagg
2821 atttaaacaa gaaagttatc ggaaacatca agaatccgat cccacttctg tatgtacaac
2881 atcctccaga gaagaacgtg tattaaatag aatatttaac tcgcaaaatc gttaagaagt
2941 ttaagcgacg atccg
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS PSECAMAB 3825 bp ds-DNA BCT 13-FEB-1990
DEFINITION P.putida putidaredoxin reductase (camA) and putidaredoxin (camB)
genes, complete cds.
ACCESSION J05406
KEYWORDS electron transfer protein; flavoprotein reductase;
iron-sulfur protein; putidaredoxin; putidaredoxin reductase.
SOURCE P.putida (strain ATCC 17453) DNA, clone pBS5.
ORGANISM Pseudomonas putida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE 1 (bases 1 to 3825)
AUTHORS Peterson,J.A., Lorence,M.C. and Amarneh,B.
TITLE Putidaredoxin reductase and putidaredoxin: Cloning, sequence
determination, and heterologous expression of the proteins
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Peterson, 07-FEB-1990.
FEATURES from to/span description
pept 47 1315 putidaredoxin reductase
pept 1371 1694 putidaredoxin
mRNA < 1 1842 camA/camB mRNA (3' end +/- 11 bp)
binding 32 36 ribosome binding site
binding 77 106 FAD binding domain (pot.)
binding 511 541 NADH binding domain (pot.)
binding 1358 1363 ribosome binding site
BASE COUNT 855 a 1086 c 1135 g 749 t
ORIGIN
1 tccggcgact accaaagcgg tataaacaca tgggagtgcg tgctaagtga acgcaaacga
61 caacgtggtc atcgtcggta ccggactggc tggcgttgag gtcgccttcg gcctgcgcgc
121 cagcggctgg gaaggcaata tccggttggt gggggatgcg acggtaattc cccatcacct
181 accaccgcta tccaaagctt acttggccgg caaagccaca gcggaaagcc tgtacctgag
241 aaccccagat gcctatgcag cgcagaacat ccaactactc ggaggcacac aggtaacggc
301 tatcaaccgc gaccgacagc aagtaatcct atcggatggc cgggcactgg attacgaccg
361 gctggtattg gctaccggag ggcgtccaag acccctaccg gtggccagtg gcgcagttgg
421 aaaggcgaac aactttcgat acctgcgcac actcgaggac gccgagtgca ttcgccggca
481 gctgattgcg gataaccgtc tggtggtgat tggtggcggc tacattggcc ttgaagtggc
541 tgccaccgcc atcaaggcga acatgcacgt caccctgctt gatacggcag cccgggttct
601 ggagcgggtt accgccccgc cggtatcggc cttttacgag cacctacacc gcgaagccgg
661 cgttgacata cgaaccggca cgcaggtgtg cgggttcgag atgtcgaccg accaacagaa
721 ggttactgcc gtcctctgcg aggacggcac aaggctgcca gcggatctgg taatcgccgg
781 gattggcctg ataccaaact gcgagttggc cagtgcggcc ggcctgcagg ttgataacgg
841 catcgtgatc aacgaacaca tgcagacctc tgatcccttg atcatggccg tcggcgactg
901 tgcccgattt cacagtcagc tctatgaccg ctgggtgcgt atcgaatcgg tgcccaatgc
961 cttggagcag gcacgaaaga tcgccgccat cctctgtggc aaggtgccac gcgatgaggc
1021 ggcgccctgg ttctggtccg atcagtatga gatcggattg aagatggtcg gactgtccga
1081 agggtacgac cggatcattg tccgcggctc tttggcgcaa cccgacttca gcgttttcta
1141 cctgcaggga gaccgggtat tggcggtcga tacagtgaac cgtccagtgg agttcaacca
1201 gtcaaaacaa ataatcacgg atcgtttgcc ggttgaacca aacctactcg gtgacgaaag
1261 cgtgccgtta aaggaaatca tcgccgccgc caaagctgaa ctgagtagtg cctgaaatct
1321 atacccacaa taaatcaccg ttttgcccca tagcgtgtga ggataaacag atgtctaaag
1381 tagtgtatgt gtcacatgat ggaacgcgtc gcgaactgga tgtggcggat ggcgtcagcc
1441 tgatgcaggc tgcagtctcc aatggtatct acgatattgt cggtgattgt ggcggcagcg
1501 ccagctgtgc cacctgccat gtctatgtga acgaagcgtt cacggacaag gtgcccgccg
1561 ccaacgagcg ggaaatcggc atgctggagt gcgtcacggc cgaactgaag ccgaacagca
1621 ggctctgctg ccagatcatc atgacgcccg agctggatgg catcgtggtc gatgttcccg
1681 ataggcaatg gtaaaccaca atggtaaacc actgcgagcc aaaacagccg agcaggagcg
1741 cagtccggca acaccttatt aagcacatgc cgaaccctat ttgcagcgct tcatgcctgc
1801 aaagtcccga ttgatgaaat ccgggctcca agcaaggagc ccggaatctc tcaccgccac
1861 gaaatcaatg gccaatcccg ggacttggtt gagacgtccg tttctccact acgactttgt
1921 cgcaatgatc tcctcatgct cagatccgaa ggatctctat agatcataca gccaggctgt
1981 atctggagaa gtccaaggtt catctggtct gtgatgccaa tgactcgccc tcaagcctcg
2041 ttttgtcagt gagagtagtc acctgatgca acagcgtctc actcaacgcc tgcgccgcgc
2101 tggatagctg atgcccagcc tggtgcagca ggccgacgcg acgcgacacc tgtggctcgc
2161 tcagcggtag gcagcgggcg cccagttctt ccatctgctg ccggcacagt tgcggcacgg
2221 cgctgacgcc gagtgaacgc aaacgacaac gtggtcatcg tcggtaccgg actggctggc
2281 gttgaggtcg ccttcggcct gcgcgccagc ggctgggaag gcaatatccg gttggtgggg
2341 gatgcgacgg taattcccca tcacctacca ccgctatcca aagcttactt ggccggcaaa
2401 gccacagcgg aaagcctgta cctgagaacc ccagatgcct atgcagcgca gaacatccaa
2461 ctactcggag gcacacaggt aacggctatc aaccgcgacc gacagcaagt aatcctatcg
2521 gatggccggg cactggatta cgaccggctg gtattggcta ccggagggcg tccaagaccc
2581 ctaccggtgg ccagtggcgc agttggaaag gcgaacaact ttcgatacct gcgcacactc
2641 gaggacgccg agtgcattcg ccggcagctg attgcggata accgtctggt ggtgattggt
2701 ggcggctaca ttggccttga agtggctgcc accgccatca aggcgaacat gcacgtcacc
2761 ctgcttgata cggcagcccg ggttctggag cgggttaccg ccccgccggt atcggccttt
2821 tacgagcacc tacaccgcga agccggcgtt gacatacgaa ccggcacgca ggtgtgcggg
2881 ttcgagatgt cgaccgacca acagaaggtt actgccgtcc tctgcgagga cggcacaagg
2941 ctgccagcgg atctggtaat cgccgggatt ggcctgatac caaactgcga gttggccagt
3001 gcggccggcc tgcaggttga taacggcatc gtgatcaacg aacacatgca gacctctgat
3061 cccttgatca tggccgtcgg cgactgtgcc cgatttcaca gtcagctcta tgaccgctgg
3121 gtgcgtatcg aatcggtgcc caatgccttg gagcaggcac gaaagatcgc cgccatcctc
3181 tgtggcaagg tgccacgcga tgaggcggcg ccctggttct ggtccgatca gtatgagatc
3241 ggattgaaga tggtcggact gtccgaaggg tacgaccgga tcattgtccg cggctctttg
3301 gcgcaacccg acttcagcgt tttctacctg cagggagacc gggtattggc ggtcgataca
3361 gtgaaccgtc cagtggagtt caaccagtca aaacaaataa tcacggatcg tttgccggtt
3421 gaaccaaacc tactcggtga cgaaagcgtg ccgttaaagg aaatcatcgc cgccgccaaa
3481 gctgaactga gtagtgcctg aatgtctaaa gtagtgtatg tgtcacatga tggaacgcgt
3541 cgcgaactgg atgtggcgga tggcgtcagc ctgatgcagg ctgcagtctc caatggtatc
3601 tacgatattg tcggtgattg tggcggcagc gccagctgtg ccacctgcca tgtctatgtg
3661 aacgaagcgt tcacggacaa ggtgcccgcc gccaacgagc gggaaatcgg catgctggag
3721 tgcgtcacgg ccgaactgaa gccgaacagc aggctctgct gccagatcat catgacgccc
3781 gagctggatg gcatcgtggt cgatgttccc gataggcaat ggtaa
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS CHKAK1 6300 bp ds-DNA VRT 15-MAR-1989
DEFINITION Chicken cytosolic adenylate kinase (AK1) gene, complete cds and
flanks.
ACCESSION D00251
KEYWORDS AK1 gene; adenylate kinase; cytosolic adenylate kinase.
SOURCE Chicken liver DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 6300)
AUTHORS Suminami,Y., Kishi,F., Torigoe,T. and Nakazawa,A.
TITLE Structure and complete nucleotide sequence of the gene encoding
chicken cytosolic adenylate kinase
JOURNAL J. Biochem. 103, 611-617 (1988)
STANDARD full staff_entry
COMMENT The chicken AK1 gene spanned about 6 kbp and consisted of 7 exons.
The transcription initiation site was determined by the S1 nuclease
mapping analysis. Evolution of the AK1 gene is discussed in
comparison with the isozyme AK2. corresponds to DDBJ (rel4.0)
entry <CHKAK1>(D00251) dated 12-SEP-1988
FEATURES from to/span description
pept 2612 2621 adenylate kinase (a part of exon 2)
4056 4091 adenylate kinase (exon 3)
4478 4641 adenylate kinase (exon 4)
4754 4870 adenylate kinase (exon 5)
5382 5573 adenylate kinase (exon 6)
5689 5754 adenylate kinase (exon 7)
pre-msg 1070 5914 AK1 mRNA and introns
IVS 1121 2578 AK1 intron a
IVS 2622 4055 AK1 intron b
IVS 4092 4477 AK1 intron c
IVS 4642 4753 AK1 intron d
IVS 4871 5381 AK1 intron e
IVS 5574 5688 AK1 intron f
signal 973 976 CAT box
signal 1041 1047 TATA box
signal 5896 5901 polyadenylation signal
BASE COUNT 998 a 2011 c 1904 g 1387 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccgat tgcttcagag tccccttggt ttggggtccg ggctgcactg tgcaccccag
61 ggctgataca gcatcagggc cgtgtgcacc gctgctgccc atgggtctgc acagccacga
121 catggcaggt tttagggtgt ctcatcccct cctgcaggac gtgggaaccc tgcatggtcc
181 caactcagaa cctcccaggc tgcgttccat caccctgaga gcacacgggg ggtaggtgaa
241 tggtcccctt tggctggcat gggggtcacg ttgctttggg ggcaaaattt tagccccacc
301 cccccccccc acctgtcccc cagcacaaat cctcacattt tcagtgtcac catccccaga
361 gcagagacct gagagccacc ccagttagac caaggggatg acactgccaa cgtggtgccg
421 ggagggggga ggtggggagg aggggggggg caataccaca cagcagctgc cggaaccttt
481 tgtaacctgg ccatggggcg tggggggatt ttttccctcc tggtcccggt gtttggattc
541 ccactgtgga gcccacgatg ctcatcactc agcagcccct cacgaaccat tcagaagggt
601 ttggacgctg agccccattg ggtggcagcc ccgtgcctca gtttccccct cggccgtggc
661 agcggggggg gggggcgtcc ggtggggtca caccttgggg ggaggggggg gctcagcacc
721 gtgttttgta tcgatttgta caggaataaa cctcgctgca gtctgttgcc ttggaccgtg
781 ctgcgggatg catctctggg gtccgggggg gagggaaagg gaccccgtag gacccccccc
841 catcccgagc ggggctgagg tgggcggttg ggggctgcag gacctcccag ctctacccgt
901 catccgcggg ctcctcccgt tgcgggggcg cggtaccggg gggctgcggg ggggctgcgg
961 gcgggcgggc ggcattgccc ccccccacag tgccccccct ccccgctgcc cgctcccggc
1021 atgccccgcg ccgccagcat tatatatatc tagcggtacg gcagaagctg ccggccagca
1081 gcagccgccc cccccgtatt ctgcaccgcc ccccccgcag gtcagtgccg ccccccccag
1141 ccctgctgct ggtttgcacc gcggttcgtg tcccaacccc cccccccaga tctcccccct
1201 ggtccccccc ggcacagtct cgcggtgacc ttggggaggg cgttccggca tccggcttcg
1261 gggccccccc cgctgccccc cgcccgacca ttctcgcgtg ggggcgagac gcgatcggtc
1321 cgtccccccc tctccgtgcg ttgagctgtg gggatggggg tgcggggggg gggatgtgaa
1381 catcgctgcg ggtgtgcggg acccgggggg ggagaaggga gaggagcggt ggggagcacc
1441 ggggggcatt gcaggggtgc tgtgctattg cggggggtgc tgagctatcg gggggcgccg
1501 gggtgcggcc cttgggctgg agtccccaca tgggaggtgg gtgcgggcaa agcgtgcggt
1561 tggagctcct ggtgggctgc agttctgggg agcaccctgg ggtgcagcac ggcgtgggct
1621 gtgctgggga gtgctgtgcc gcccgctccg tgggcacagc gccacgctgg atcaggccgt
1681 gagcccacca agggctgctg ccaggtgtgc tcggtgcccc cgcgccaggg acccccaggt
1741 tgtgcccttc ctatggggcg atgcagggct cagcctcaaa acgcaagcag gacgtgaccc
1801 ggtgcagatt tcagcccgtc catacttagc ctctgccctt tcccggctgt aacacgacct
1861 gatggatggc cacggcccct tcggccccgg tctggccggt ttggggacag cggggtgacg
1921 tggtgtggct gagcaggtgg ggtgctgcgg ggctgggtgc gggcgggcgg ttatgggtat
1981 ggccaaaccc acagcgagga actggaaggg cagtggggtg cagcagcccc cccttgggca
2041 ccgagaacag gctgacccct taaaaagcat cccctccagc acatggggag ctggaggtac
2101 gtgggatact gcctccgtcc ctcagtgtcc tgctgccatg tgtcaccccg agccaatttc
2161 cggctgccag gcagcacccc gggcatgggg atgtggtcag ggcaggggag ggtgctggga
2221 ttgtcaccgt gcgtcagccc agtgccggca cagggcgggt ggctgccggc agcgaggtag
2281 gcgacggccg caggaagtga ggtcccagct cctggagcca tggaattcag gcagcattgt
2341 gcctttctcg tgtattttag taattttata tatttaagag ccaagctggg gaacggtgct
2401 gctagcaggg gggaaacccc gtcccccagc gtggccctag agctcaattt gagatctgct
2461 ccatcccatc gtggaagcaa agggagagct gttgtcctgg caacgtcctc cccaggccgc
2521 agtgccgggc cccgtgtgtc cctgccgtgc ctctcactct gctctgcttc ccttgcagcc
2581 tgaagtgctc ctccacccag acagcagcag catgtcgaca ggtgagtgct ttgcatctgt
2641 gcctcctgca tccattccct ccatcccctt tgcattcacc cccccttgca cctgatccct
2701 caatccccct tggcatccat cctctccaac ccctgttgcc tccatccctt ttatctcatt
2761 gtgtgtactc aatttttcct ccttgtgtcc atctcctgcg tcctttttgc atttgtcctc
2821 tccgtccctc ttgcatctaa acccttgggc cccctcgtgt ccatcctctg tgtccatctg
2881 ttccctctgt tcctcttgca tctaatccct tagttccctt tgcatccatc catctgtccc
2941 cccatcctcc tatgttgtcc atttgtgtcc atcacttgtg cctcgacatg tccatccctt
3001 ttctccccct tacatctgat ctctcagccc tctgtgcctg tctcccatgt cctctctgta
3061 tctgtcacct ctgtccctct tgcatctgat ccttaggtcc ccttttgtgt ctgtctcatg
3121 ttctctttcc atctatctcc tctgtttccc ttgtatccaa gtcctgaatc tctcatccaa
3181 tccccagagt cccccccttg tgtccatctt ctatgttctc cagtagtgtc tgtctcctct
3241 gtctccccct cgtgtccatt ccgtgtcctc tttccatccc ttccctccat tccccttacc
3301 tgtccccccg ttgcctctgt cccatttctc ccccttgctt ccctccagcc cccacgtact
3361 ccatgtcttc tgcccacccc ccttccccac cagcctcttg gctccctccc atcccggggc
3421 tcaccggctg ctggaggagc tgaaatttca gaagactgag atctggagat ggcagcacgg
3481 ccccagggtg gctcatgtta tttttagcgt cctccctctg cgctgtcaca ccccaacgcg
3541 tggcagggac agatcctgcg gggggggggg gggaatcctg cagggaaaac ccccccgtcc
3601 tatgggctgg agtgctctga gtggatgggg ctatgccatt tctggggaca gtcagatccc
3661 acagggttgt gccatgccgt gccatgctgc accaagcatg actaggagct gaccatgggg
3721 atggttgtgg ctgtccctgc tggttcctat ctctcagtgg ggtcggagaa atgaccgtct
3781 ccaggatgca gtggagttcc aacccctcag ctcaacccct cgtctgccag gacaccatgg
3841 ggtctcagtg cacgagcatg tgcagcttca atcctggact gtctaggtga caggatgaca
3901 ggatgagtgc ttcctactct gccctccctg gggttggggt tggtccccac cactcctcgt
3961 tcagtcctgc cactggggac agttttctcc ctggtgtaac ttctggcttt tagatttatt
4021 gttgtttgtt ttttttttct ctctctctcc accagaaaaa ctgaagcacc acaagatcat
4081 cttcgtggtg ggtaagtggg tgtgggacag aggtggccgg tgtggggtgt gctgggacca
4141 gcagggtcta gcaaagcttg ggatgccagc atccaatgtg atgtgatggc aaactcctgg
4201 gggtggtgtc atgggtgctt gatggggcgg tgatgtccag ggtaccatca ggagagtgat
4261 gtctgggata ttggtcatgg tgcctctggt gccatcagag caatgatgcc caggtgccaa
4321 tggccaatca gccacaccag cagcccatgc cgccccaaca gctctttcca tccatcctgt
4381 ctcctctcat caggctttga tgggtctggg ctgccgctcc tcctaccccc actagatggg
4441 gcatgggaca tgacggtggc cctggtgtga cttgcaggtg gccccggctc agggaagggg
4501 acgcaatgcg agaagattgt gcacaagtat gggtacactc acctctccac tggggacctg
4561 ctccgggcag aggtcagctc gggctcagag cggggcaaga agctgcaagc catcatggag
4621 aagggcgagc tggttcccct ggtgagtttg tggtcatgaa cacagggtga tgtggggaca
4681 ctgtggtgat gcagcagcag gaatggggca cgtcgtccca catcaccctg acagctgtat
4741 gccccctccc caggacacgg tgctggacat gctgcgggac gccatgttgg ctaaggcaga
4801 cacctccaag ggtttcctca ttgacggcta ccctcgcgag gtgaagcagg gagaggagtt
4861 tgaaaagaag gtgagggccg ttcccctggt gagtgccgtg cggtgttatg ttgtgccacc
4921 ctccgtctgc ggcctatgcg gtgctggcca atggtgaggc cttgtcggct tcactgcagc
4981 ctcagcccag ccctggcgtc agccacagca gggcacggtg gcaggggaca tctatccagg
5041 ctgccctcag gcctggggca gtggcagaga gccaaggcct gtcccctggt gggtgccatg
5101 ccacagtgtg ttgtgtcatc ctccatctgc ggcctacatg gtgatggcac tatgtgaggc
5161 ctagtcaggc ttcactgcgg cctacaggcc tacgccctgg ggacccccta tagtgggatg
5221 cggtggcagg ggacgtctgt ctgggctgca cccagggctg gcagagggtg actggggtga
5281 tggccacgct gtcccctcct tgccggtgtc acacctccag ccccacctgg aggagacgtg
5341 gggacaccca gacccacatg gtgacagctg tgtccccaca gattgccccc cccacgctgc
5401 tgctctacgt ggatgcgggg aaggagacga tggtgaagcg gctgctgaag cggggagaga
5461 ccagcgggcg ggtggacgac aacgaggaga ccatcaagaa gcgcttggag acgtactaca
5521 aggctaccga acccgtcatc gccttctaca agggccgggg catcgtccgc caggtgagcg
5581 gggtacaggg aggggggaag caatacatgg gggtaacaga gctgccagct gccccccctt
5641 ccccccagca ttgctgaggg aaccagcacc ccgtcactgt ccccacagct gaatgccgag
5701 ggcaccgtgg atgaggtttt ccagcaggtc tgctcctacc tcgacaagct gtgacggccc
5761 cgcgcccccc accctgcgct ccccccacac gcgcagggca gagacagcgg aagtggcctt
5821 atcctgtttt cgtggacaga gccgcgcgaa ggaaatttca aggacattgt gtttggctct
5881 ttcccgtctc tccccagtaa agttcacttt aatgagccca gactttatct ttttcttctg
5941 tcgcaggaaa tgagtttttc tttccagaga tttttgtttc tagcctcccc ccaccccccc
6001 tcagccagcc cctctctgaa gctgataaag ggcaggaagc gggtgtttat cccggctgag
6061 atgcagcggg cccgggggat gcttggagga aggcaggaga tcccggctgc ttcgcctggg
6121 gccttggtgt cctggccctg tcctgctccc cccaacccga tctcattcct gtgctcctgt
6181 ctctgcatgc gcctggggca gcatcccacc ccatccctcc tcctcatccc atcagatggt
6241 gatttccacc ccttccacca cttcctcgct gcttttgtcc attggggtgt tttgggatcc
//
LOCUS CHKBGBL3 192 bp ds-DNA VRT 15-MAR-1990
DEFINITION Chick 14K beta-galactoside-binding lectin gene, exon 3.
ACCESSION D00310
KEYWORDS 14K lectin; beta-galactoside-binding lectin;
chick 14K beta-galactoside-binding lectin; lectin.
SEGMENT 3 of 4
SOURCE A library of chick genomic fragments in lambda-EMBL3 provided by J.
D. Engel (Northwestern University).
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 192)
AUTHORS Ohyama,Y. and Kasai,K.
TITLE Isolation and characterization of the chick 14K beta-galactoside-
binding lectin gene
JOURNAL J. Biochem. 104, 173-177 (1988)
STANDARD full staff_entry
COMMENT The mRNA for the lectin is encoded by four exons. The first exon
encodes only three amino acid residues and the other three exons
encode, respectively, one of the three repeating sequences found in
this lectin.
FEATURES from to/span description
pept + 11 + 182 14K beta-galactoside-binding lectin, exon 3
(AA 31 at 12)
IVS < 1 10 14K beta-galactoside-binding lectin, intron 2
IVS 183 > 192 14K beta-galactoside-binding lectin, intron 3
BASE COUNT 49 a 48 c 52 g 43 t
ORIGIN About 1.2 kb after segment 2.
1 ttctctgcag ctttgtgatg aatctgggca aggactcaac ccacctgggg cttcacttca
61 acccccgctt tgatgctcac ggtgatgtga acctcattgt gtgcaactca aagaaaatgg
121 aagagtgggg taccgagcaa agggagacgg tcttcccttt ccagaaggga gccccaatag
181 aggtaaagcc tg
//
LOCUS XELPRL5B 986 bp ss-mRNA VRT 15-MAR-1990
DEFINITION X.laevis ribosomal protein L5b 60 S subunit mRNA, complete cds.
ACCESSION M29033
KEYWORDS 5S ribosomal RNA binding protein; nucleolar protein;
ribosomal protein.
SOURCE X.laevis stage 17 embryo, cDNA to mRNA, clone L5b.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 986)
AUTHORS Wormington,W.M.
TITLE Developmental expression and 5S rRNA-binding activity of Xenopus
laevis ribosomal protein L5
JOURNAL Mol. Cell. Biol. 9, 5281-5288 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Wormington, 14-OCT-1989.
FEATURES from to/span description
pept 53 943 L5b ribosomal protein
mRNA < 1 > 986 L5b mRNA
BASE COUNT 309 a 200 c 252 g 225 t
ORIGIN
1 gttagggctg tgactctccg gtcctagccc cgacggctca aaatagagca aaatggggtt
61 cgtaaaggtc gtcaagaaca aggcttattt taagaggtac caggtcaagt tccgcagaag
121 gagagagggc aagaccgatt actatgctcg caagcgactg gtgatccagg ataagaacaa
181 gtacaatact cccaagtaca ggatgattgt acgtgtcacc aacagagaca tcatctgcca
241 gattgcttat gctaggattg agggagacat gattgtatgt gctgcttatg ctcatgagct
301 tcccaagtat ggtgtcaaag ttgggctgac aaactatgct gcagcatatt gtactggcct
361 gctgctggct cgtaggcttc tcaacaaatt cggtctggac aaagtatacg aaggtcaagt
421 ggaagtaaca ggagatgagt acaatgtaga gagcgttgat ggtgaacctg gtgcctttac
481 atgctacctt gatgccggcc tcaccagaac caccactgga aacaaagttt ttggtgctct
541 taagggagct gtggatggag gtttatctat tccacacagt accaagcgat tccctggcta
601 tgactctgaa agcaaagaat tcaatcctga ggtccaccgc aagcacatct tcgcccagaa
661 tattgcagag tacatgcgtc ttctgatgga agaagatgaa gatgcatata aaaaacagtt
721 ctcccagtac ataaagaatg gtgtcgcagc agatcagttg gaagacatat acaaaaaggc
781 tcatgcaggt attcgggaga atccagtgca cgaaaagaaa cccaagaagg aagtcaaaaa
841 gaagaggtgg aaccgtgcca aattgtctct ggaacagaaa aaggaccgtg ttgctcagaa
901 aaaagccagt ttcctcagag ctcaggagaa ggcagacagc taaatggatt atgttgcgat
961 tttctaagtg ttgtcagcaa aaaaat
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS DRMMTTRND 976 bp ds-DNA ORG 15-JUN-1989
DEFINITION D.mauritania Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION M18073
KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
transfer RNA-Trp; transfer RNA-Tyr.
SOURCE D.mauritania (adult, from Mauritius Island) mitochondrial DNA.
ORGANISM Mitochondrion Drosophila mauritania
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae;
Drosophila mauritiana.
REFERENCE 1 (bases 1 to 976)
AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I.
TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in
Drosophila melanogaster and its sibling species
JOURNAL Mol. Biol. Evol. 4, 638-650 (1987)
STANDARD simple staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
Y.Satta, 04-FEB-1988.
FEATURES from to/span description
ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop
codon)
ORF 668 > 976 cytochrome oxidase subunit 1 (gta start codon)
tRNA 461 526 Trp-tRNA
tRNA 580 519 (c) Cys-tRNA
tRNA 666 601 (c) Tyr-tRNA
BASE COUNT 348 a 105 c 96 g 427 t
ORIGIN Unreported.
1 tctttaataa ttagagaatc aatttgactt atttattttt ttttttattc atttttatca
61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
121 tgatttgtaa acagaaaaat tttaaaattt acattattta taaatttttt atcattagga
181 ggattacctc catttttagg atttttacct aaatgacttg taattcaaca attaacatta
241 tgtaatcaat attttttatt attattaata ataatatcaa ctttaattac attatttttt
301 tatttacgaa tctgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata
361 aagataaata taattagtag taatactaat atatatttaa ttataacttt tttttcaatt
421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa
481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc
541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt tgtccttatt
601 tgattaagaa gaataaatct tataaataga tttacaatct atcgcctaaa cttcagccac
661 ttaatcagta accgcgacaa tgattatttt ctacaaatca taaagatatt ggaactttat
721 attttatttt tggagcttga gctggaatag ttggaacatc attaagaatt ttaattcgag
781 ccgaattagg acatcctgga gcattaattg gagatgacca aatttataat gtaattgtaa
841 ctgcacatgc ttttattata atttttttta tagttatacc tattataatt ggtggatttg
901 gaaattgatt agtaccttta atattaggtg ctcctgatat agcattccca cgaataaata
961 atataagatt ttgatt
//
LOCUS DROMTTRND 975 bp ds-DNA ORG 15-JUN-1989
DEFINITION D.melanogaster Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION M18022
KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
transfer RNA-Trp; transfer RNA-Tyr.
SOURCE D.melanogaster (adult, from Sapporo, Japan) mitochondrial DNA.
ORGANISM Mitochondrion Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae;
Drosophila melanogaster.
REFERENCE 1 (bases 1 to 975)
AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I.
TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in
Drosophila melanogaster and its sibling species
JOURNAL Mol. Biol. Evol. 4, 638-650 (1987)
STANDARD simple staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
Y.Satta, 04-FEB-1988.
FEATURES from to/span description
ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop
codon)
ORF 667 > 975 cytochrome oxidase subunit 1 (ata start codon)
tRNA 461 526 Trp-tRNA
tRNA 580 519 (c) Cys-tRNA
tRNA 665 600 (c) Tyr-tRNA
BASE COUNT 345 a 101 c 97 g 432 t
ORIGIN Unreported.
1 tctttaataa ttagagaatc aatttgatta atttattttt ttttttattc atttttatca
61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
121 tgatttgtta atagaaaaat tttgaaattt acattattta taaatttttt atcattagga
181 ggattacctc catttttagg atttttacca aaatgacttg taattcaaca attaacatta
241 tgtaatcaat attttatatt aacaattata ataatatcaa ctttaattac attatttttt
301 tatttacgaa tttgttattc cgcttttata ataaattatt ttgaaaataa ttgaatcata
361 aagataaata taaatagtat taattataat atatatataa ttataacttt tttttcaatt
421 tttggattat ttttaatttc tttattttat tttatatttt aaggctttaa gttaataaaa
481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aacttactcc
541 ttcaaaattg cagtttgata tcattattga ctataagacc taattaattt gtccttattt
601 gattaagaag aataaatctt atatatagat ttacaatcta tcgcctaaac ttcagccact
661 taatcaataa tcgcgacaat gattattttc tacaaatcat aaagatattg gaactttata
721 ttttattttt ggagcttgag ctggaatagt tggaacatct ttaagaattt taattcgagc
781 tgaattagga catcctggag cattaattgg agatgatcaa atttataatg taattgtaac
841 tgcacatgct tttattataa ttttttttat ggttatacct attataattg gtggatttgg
901 aaattgatta gtgcctttaa tattaggtgc tcctgatata gcattcccac gaataaataa
961 tataagattt tgact
//
LOCUS DRSMTTRND 975 bp ds-DNA ORG 15-JUN-1989
DEFINITION D.simulans Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION M18072
KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
transfer RNA-Trp; transfer RNA-Tyr.
SOURCE D.simulans (adult from Nairobi, Kenya) mitochondrial DNA.
ORGANISM Mitochondrion Drosophila simulans
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae;
Drosophila simulans.
REFERENCE 1 (bases 1 to 975)
AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I.
TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in
Drosophila melanogaster and its sibling species
JOURNAL Mol. Biol. Evol. 4, 638-650 (1987)
STANDARD simple staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
Y.Satta, 04-FEB-1988.
FEATURES from to/span description
ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop
codon)
ORF 667 > 975 cytochrome oxidase subunit 1 (gtg start codon)
tRNA 461 526 Trp-tRNA
tRNA 580 519 (c) Cys-tRNA
tRNA 665 600 (c) Tyr-tRNA
BASE COUNT 343 a 107 c 102 g 423 t
ORIGIN Unreported.
1 tctttaataa ttagagaatc agtttggttt atttattttt ttttttattc atttttatca
61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
121 tgatttgtaa atagaaaaat tttaaaattt acattattta taaatttttt atcattggga
181 ggattacccc catttttagg atttttacca aaatgacttg taatccaaca attaacatta
241 tgtaatcaat attttttatt attaataata ataatatcaa ctttaattac attatttttt
301 tatttacgaa tttgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata
361 aagataaata taattagtat taatactaat atatatttaa ttataacttt tttttcaatt
421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa
481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc
541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt gtccttattt
601 gattaagaag aataaatctt ataaatagat ttacaatcta tcgcctaaac ttcagccact
661 taatcagtga ccgcgacaat gattattttc tacaaatcat aaagatattg gaactttata
721 ttttatcttt ggagcttgag ctgggatagt cggaacatca ttaagaattt taattcgagc
781 cgaattagga catcctggag cattaatcgg agatgatcaa atttataatg taattgtaac
841 tgcacatgct tttattataa ttttttttat agttatacct attataattg gtggatttgg
901 aaattgatta gtgcctttaa tattaggtgc ccctgatata gcattcccgc gaataaataa
961 tataagattt tgatt
//
LOCUS RICMTB1 2135 bp ds-DNA circular ORG 15-MAR-1989
DEFINITION Oryza sativa (rice) Mitochondrial plasmid-like DNA (B1), complete
sequence.
ACCESSION D00293
KEYWORDS cytoplasmic male sterility; mitochondrial DNA;
mitochondrial plasmid.
SOURCE Cytoplasmic male-sterile rice line (A-58CMS), mitochondrial
plasmid-like DNA (B1).
ORGANISM Mitochondrion Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE 1 (bases 1 to 2135)
AUTHORS Shikanai,T., Yang,Z.-Q. and Yamada,Y.
TITLE Properties of the circular plasmid-like DNA B1 from mitochondria of
cytoplasmic male-sterile rice
JOURNAL Plant Cell Physiol. 28, 1243-1251 (1987)
STANDARD full staff_entry
COMMENT The complete nucleotide sequence of B1 (2,135bp) which is a
plasmid-like minicircular DNA in the mitochondria of Oryza sativa
A-58CMS was found to contain sequences homologous to those in the
1,913bp plasmid-like DNA of maize, and therefore, this sequence may
have originated from the nuclear genome. The number 2014 described
under the result section as the beginning of ORF seems to be wrong
number. According to the sequence map, correct number should be
2019. corresponds to DDBJ (rel4.0) entry <RICMTB1>(D00293) dated
ORG
FEATURES from to/span description
ORF 1793 2035 open reading frame
ORF 2019 1732 (c) open reading frame (comp strand)
BASE COUNT 545 a 494 c 449 g 647 t
ORIGIN 3 bp downstream of EcoRI site.
1 ttcaggattg gctaaggctc tagccattaa atgtgactgg ggttgatcca agggctggtc
61 tcatggtttt gactaggtgt ttctcaaatt tctgatcata tttcctgtcg ttataacccc
121 caaaatacgc taactcgcag ctcgaagttc ccagttacat gggcaaccct actcactata
181 catagggtgt gaagagacgg acaacaatag tctttctctg ctttcactcg agccaaacta
241 agtgttttct aggattggcc ccaaagccat actatggtca caacggtaac cccctcgatt
301 tagtttatcc aatcaatcga gaggcagcat ctctatcaat tactcggatc ggcaacctcg
361 gttctggaat gtggacttca ctgaccagcc gcctcttagt ccacacccaa cttggaagtg
421 ttattgattc gttagtaacc ttcgccatcc taatacccga gactgtagtg ttctaagttg
481 gttaaagccg tcgtctcctg ggatgcccgg ctggtcgggc aagtattagt gaaacggctc
541 cgcgcaatca ataactaccc tgcttaggta tgagcttcca agtcccgaat ccttttgtct
601 caagtctgcc ttggtaatgt gaccacacat cctttgatga tggaagtgtt gtttcccaac
661 aatccacccc ggaatcctag ttggagtaat gtatccctac atactatcgc gctaatgtaa
721 tcgggttctc agatcatact caatgagggg tggctagtat cccgaaactt tgaagaacat
781 gagttaagta gttcattcac ttggtttcgc caccagtcaa aacgccctaa aacccagtag
841 gttttctcga tcaccattct acgattttat ttccacctgt tttagaacac ataaattgtg
901 ctacccttaa gagtttatca ctaaaatcaa gaactttatt tatgacctgt aagttgcctc
961 ataaaaagta gatgttagta cccgactaaa aggagggaat caaataaaag gtaaagctct
1021 caattaaaat aaaaagctat cgtttacttt gttaagggta tattgcccgt agcgaggtta
1081 aggtaaaccc ttgaccgagt gaaggcgagc ccggtaaagc actaagtgta ttggctagct
1141 tgtactttat tagctatcaa gtgtttgatg taccaattac tttattgtgt caactagttt
1201 attgaatgca aaggtaattc cctcgataat tgatttaatt atcaagaaag agaattagtt
1261 accctccttc cctcgagctg agctatttct tgtgtattca atcgtatgca cgagggctgt
1321 cctctcctct gctctatctc ccggacaggt atgaaagtcg taattctttg ggtttttgta
1381 gtcaatggaa tgagcggagg gagacacatc cttccatcca aaggtgtgtg tgtcccgtag
1441 cgagtggaat ggctggttcg gtctagtcgt atagaacggg ttgagttctt tttactgagt
1501 cctttcgagg ctggaagtgt ttcttacagt ttccttttcc gaagggaaag tcaagtagta
1561 taaactagta ctaagtccaa gaagaagggg tgaaaagtga ggagggcgat tagcccgatc
1621 cacttgaggg gaggaaggcc ttccttccct actaatagta ccaattcata attcctattc
1681 gtaataaatt catatttatt tggaaaaata tgaaaactaa ttatctttat tctagttccc
1741 taaaaccctt tccgtgccac tcctttctgt tccactcatc caacattgtt ggatgccaaa
1801 gactttgacg aagactttcg tacctgtatt tctgtccggt atgacgtcgc tttcgtgtcg
1861 gcatagcgtg ccccagttgc cacagttgct aggctgttca gagtcctctc ggaccccggc
1921 taggcatcct cccggtaatg ccaggggctg cttcaagaaa cgggtgatgt gggttccaat
1981 cattcttggt ttggttcctc ggcaccaagc caatcccatt cctttccgct ggtgaccgcc
2041 ttcttgttgc ctaccttttt aacccctgta gtaaccccaa gtccaagtac aaggcatacg
2101 tgctccttta atatgcttgc tcctcttttc ttgaa
//
LOCUS TOBCPNPLR 93 bp ds-DNA ORG 15-MAR-1990
DEFINITION Tobacco chloroplast 23S rDNA and lincomycin resistant mutants.
ACCESSION X13526
KEYWORDS 23S ribosomal RNA; antibiotic resistance; lincomycin resistance;
ribosomal RNA.
SOURCE tobacco (Chloroplast Nicotiana plumbaginifolia).
ORGANISM Chloroplast Nicotiana plumbaginifolia
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae; Nicotiana plumbaginifolia.
REFERENCE 1 (bases 1 to 93)
AUTHORS Cseploe,A., Etzold,T., Schell,J. and Schreier,P.H.
TITLE Point mutations in the 23 S rRNA genes of four lincomycin resistant
Nicotiana plumbaginifolia mutants could provide new selectable
markers for chloroplast transformation
JOURNAL Mol. Gen. Genet. 214, 295-299 (1988)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
MUTANT 22 22 g is a in LR400
MUTANT 48 48 a is g in LR415
MUTANT 49 49 a is g in LR421
MUTANT 49 49 a is g in LR446
FEATURES from to/span description
rRNA < 1 > 93 23S ribosomal RNA
BASE COUNT 23 a 22 c 25 g 23 t
ORIGIN
1 tgaaatagac atgtctgtga agatgcggac tacctgcacc tggacagaaa gaccctatga
61 agcttcactg ttccctggga ttggctttgg gcc
//
LOCUS VFACPTG 1409 bp ds-DNA ORG 15-SEP-1989
DEFINITION Broad bean (V.faba) chloroplast Leu-tRNA-CAA, Leu-tRNA-UAA and
Phe-tRNA genes.
ACCESSION M23771
KEYWORDS transfer RNA-Leu; transfer RNA-Phe.
SOURCE Broad bean (V.faba) chloroplast DNA, clone Bam6a.
ORGANISM Chloroplast Vicia faba
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea; Vicia faba.
REFERENCE 1 (bases 1 to 1409)
AUTHORS Bonnard,G., Weil,J.-H. and Steinmetz,A.
TITLE The intergenic region between the Vicia faba chloroplast
Leu-tRNA-CAA and Leu-tRNA-UAA genes contains a partial copy of the
split Leu-tRNA-UAA gene
JOURNAL Curr. Genet. 9, 417-422 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 15 95 Leu-tRNA-CAA
tRNA 539 573 Leu-tRNA-TAA, exon 1
1025 1074 Leu-tRNA-TAA, exon 2
tRNA 1185 1257 Phe-tRNA
BASE COUNT 506 a 240 c 238 g 425 t
ORIGIN 1 bp upstream of BglII site.
1 agatctcaat attatgcctt gaagaggact cgaacctcca cgctgtttag cacgagattt
61 tgagtctcgc gtgtctacca tttcaccacc aaggcatcct taaagggaat catattctct
121 gaatatgata tctatcttat gtcatggttc aatattatta tactccaatt ctaatccaat
181 tgcaacatag aatacaacat ctcaaataga atacaacatc tcaaatagaa tacggtacga
241 tataatacag tatctactat ataatacagc gagaatagaa ttctttctcg actaaataaa
301 tatagcttct agatgaacga aaagctagtc aaagggtcga ttcaaaattg gtagacgcta
361 cggacttaat tgtattgagc tttggtatgg aaacatagta atgttaagtg aaaacataac
421 ataatataat acagcatcta ctatataata caacatgtac tatataatac agcgagaaga
481 gaattctttc tcgactaaat aaatatagct tctagatgaa cgaaaagcaa gtcaaacggg
541 ggatatggcg aaattggtag acgctacgga cttaattgta ttgagccttg gtatggaaac
601 atattaagtg aaaactttca aattcagaga aaccctagaa ttttaaatgg gcaatcctga
661 gccaaatcct tctttccgaa aacaaaaaac taaaagttca gaaaaaaagg ataggtgcag
721 agactcaatg gaagctattc taacaaatgg agttgacaac attcaattga ttaatgaaga
781 tttctaactt ctatttgtaa attgtaacta ttttttttag ttttgattcg atcacaattg
841 aaacgaaaca ttagaatcaa atcaattcca actggaagaa aaaatggtga atattcattg
901 atcaaatcag tcactccacc aaaatatgat ggatcttttg aataactgat taatcagacg
961 agaataaaga tagagtccca ttctacatgt caataccgac atcaatgaaa gttttagtaa
1021 tcggaaaatc cgtcgacttt aaaaatcgtg agggttcaag tccctctatc cccaaaagcc
1081 gattgaattt gctcattttt atggcccttt cccttttttt tattgagttg acatagactc
1141 aagtaatttc cctcaaatta ggatggtgcg tcaagatttt aatggtcggg atagctcagc
1201 cggtagagca aaggactgaa aatccttgtg tcaccagttc aaatctggtt cccggcgact
1261 cattatgtat gagtatctat tcccatattt cttttcaaaa atcaaatatt gggaggaata
1321 gatacatatt gattagtggt cttaataatt atacataatt tatctaggcg tacagagata
1381 tgctctttct agatgaaaaa ggaatcgat
//
LOCUS WHTCPTAREP 781 bp ds-DNA ORG 15-MAR-1990
DEFINITION Wheat chloroplast dispersed repeat (9) DNA for nonfunctional rpl23'
homologue segment (located in large single-copy region LSCR).
ACCESSION X12849
KEYWORDS pseudogene; rbcL gene; repetitive sequence; ribosomal protein;
ribosomal protein L23; ribulose bisphosphate carboxylase.
SOURCE wheat (Chloroplast Triticum aestivum).
ORGANISM Chloroplast Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE 1 (bases 1 to 781)
AUTHORS Bowman,C.M., Barker,R.F. and Dyer,T.A.
TITLE In wheat ctDNA, segments of ribosomal protein genes are dispersed
repeats, probably conserved by nonreciprocal recombination
JOURNAL Curr. Genet. 14, 127-136 (1988)
STANDARD simple automatic
COMMENT *source: clone=pTacE8. see x12850 for repeat 9 in functional rpl23
gene; see x12848 for maize chloroplast repeat 9 in rpl23' homologue
EMBL features not translated to GenBank features:
key from to description
INVREP 214 228 inverted repeat A
INVREP 234 249 inverted repeat A'
RPT 268 276 direct repeat 1
RPT 297 301 direct repeat 2
RPT 302 306 direct repeat 2
RPT 313 318 direct repeat 3
SITE 377 678 rpl23' nonfunctional segment
INVREP 377 384 inverted repeat B
INVREP 385 392 inverted repeat B'
RPT 606 614 direct repeat 1
RPT 631 635 direct repeat 2
RPT 636 640 direct repeat 2
RPT 645 650 direct repeat 3
FEATURES from to/span description
pept < 1 93 rbcL gene product (AA at 1)
BASE COUNT 302 a 105 c 141 g 233 t
ORIGIN
1 gcttgcaaat ggagtcctga actagccgca gcttgtgaag tatggaaggc gatcaaattc
61 gagttcgagc cggtagatac tattgataag tagataaaac taaagataaa gaagagaagg
121 tataaataaa aaagaaacga aataaaaaga gaaaaaaata agttatgaaa tgcagtaatc
181 ttctttattc ttctaattga ttgcaattaa actcggctca atcttttttt tctaaaaaag
241 attgagccga ataaaaatag atcatgatat gatcatgaga cttgacaaat cgagattcgt
301 ctattctata tatctagaat atatatatta aggtataata caataatgaa atacaaataa
361 aataataaaa tatagtatta tcatatgata attgaatcaa atacgcagta tttactgaaa
421 taagtcttcg tttattgggt ttgaatcaat atacttttaa tgtcgaatgg gattcactaa
481 gactgaaata aagcattggg tcgtactctt ctttggtgtt aaggtggtag ctgtgtatag
541 ccatcgacta cctagaaagg taaaagaata ggacctattc tgggccatac aatgcattac
601 agacgtatga tcattaccct tcaaccgggt tattctattc cacttctaga tagagaaaaa
661 aactaaagga gaatgaatga aaaaagacat agtttggaag ttagaccttt ttataggact
721 ctctttcaat ttcaaaaaag aggacgtttg aaacttttaa caggcgtaat cgtgagtcaa
781 c
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ACYFIMBA 1850 bp ds-DNA BCT 13-FEB-1990
DEFINITION A.viscosus fimbrial structural protein type 1 subunit gene,
complete cds.
ACCESSION M32067
KEYWORDS fimbrial structural protein.
SOURCE A.viscosus (strain T14V) DNA, clone MY3833.
ORGANISM Actinomyces viscosus
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 31 to 1850)
AUTHORS Yeung,M.K. and Cisar,J.O.
TITLE Sequence homology between the subunits of two immunologically and
functionally distinct types of Actinomyces spp
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 30)
AUTHORS Yeung,M.K.
JOURNAL Unpublished (1990) Univ. of Texas, San Antonio, Texas 78284
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.K.Yeung, 12-FEB-1990.
FEATURES from to/span description
pept 124 1725 fimbrial structural protein type 1 subunit
precursor
sigp 124 213 fimbrial structural protein type 1 subunit
signal peptide
matp 214 1722 fimbrial structural protein type 1 subunit
BASE COUNT 395 a 714 c 515 g 226 t
ORIGIN
1 caagcttgca tgcctgcagg tcgactctag aggaacgcga gaagaaccca cgactccgct
61 ccgtcgcgcg cgggtcaccc cttctatccc tcattcatcc caacaacaca ggagagaaca
121 gcaatgcact ccctcaacac gcgccggggc ctcggcctcg ccgctgcgat gacgctcgcc
181 gccggcgcgc tcgtcgcccc gaccggcgcg gcggcacccg ctgacccgaa cggctccacc
241 atcgaccccg acgcggccac cacgctgacc gtccacaagt gcgagcagac cgacaccaac
301 ggcgtcaagg aggggaccgg taacgaggac ccccaggccg agtgcaagcc cgtctccgac
361 gtcgagttca ccatcaccaa gctgaacgtg gacctgacga cctacgacgg ctggaagacc
421 ctcgcggacc tcaagggtga cgtggtcaag gccggtgccc tcaagagcac caccgtccag
481 aagatcacca ccggtgccaa cggtctggcc tccttcaccg acgcccagac cgaggtcggc
541 gcctacctcg tcagcgagac gcgcaccccc gacaaggtca tcccggccga ggacttcgtc
601 gtcaccctgc cgatgaccaa cccgcaggac accgccaagt ggaactacaa cgtccacgtc
661 taccccaaga acaccctctc cggtgtggac aagcaggtga ccgacaagcc ggcccccggc
721 tccgggcgcg acatcaccta caccatcacc acctccatcc cgaaggtgga ctaccccggc
781 ggtgcgcgca tcaagcgcta cgaggtcgtt gaccgcctcg acaagcgcat caagaaggaa
841 gccctgaccc cggtcgtcaa gatcgtcggt cagaacgagg tgaccctggc ggagaccacc
901 gactacaccc tcatcactgc cgagggtaag gaccacaact gggccaccat ccagctcacc
961 gaggagggcc gccgcaaggc ctccgaggcg cgctacaacg gcaacggcga gaccaagctc
1021 caggtgaccc tgaacgccaa gttcgacgcc gccgtcaacc tcgagggcga cctgtccaac
1081 accgcgggcc tcatccccaa cgacagcccc aacttcacct gggacccgaa caaccccggc
1141 accaccacgg acatccccgg catccccacc acccccgtgc tctccaagta cggcaaggtg
1201 gttctcacca agaccggtac ggacgacctg gccgacaaga ccaagtacaa cggcgcccag
1261 ttccaggtct acgagtgcac caagaccgcc agcggtgcca cgctgcgtga ctccgacccc
1321 agcacccaga ccgtcgaccc gctgaccatc ggtggggaga agaccttcac caccgccggc
1381 cagggcaccg tggagatcaa ctacctgcgc gccaacgact acgtcaacgg tgcgaagaag
1441 gaccagctga ccgacgagga ctactactgc ctcgtggaga ccaaggcccc cgagggctac
1501 aacctccagg ccgacccgct ccccttccgg gtcctggctg agaaggccga gaagaaggcc
1561 gcgaccgagg tgaccgtcac cgacatcccg aagaacgccg gcttccgcct gccgctgacc
1621 ggtgccaacg gcgtcatctt cctgaccatc gccggcgccc tgctggtggc cggtggcgcg
1681 gtggtcgcct acgccaacaa gcgccgtcac gttgccaagc actgacagcc tctggcagcc
1741 actggcagcc actgaggtga ccggccccag ccggccgtca tcacgctgac tgcttccttg
1801 tgggggaggc gggagaaccc gtctccccca cacgggcgtc gcagggatcc
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS BOVTCRA19 123 bp ss-mRNA MAM 15-MAR-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90028
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA41.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080
FEATURES from to/span description
pept < 1 > 123 T-cell receptor alpha chain, variable joining
region. (AA at 1)
BASE COUNT 23 a 34 c 32 g 34 t
ORIGIN
1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg
61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca
121 caa
//
LOCUS CATSIS1 1125 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 1.
ACCESSION M25353
KEYWORDS proto-oncogene.
SEGMENT 1 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 1125)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1043 + 1105 c-sis proto-oncogene
pre-msg 51 > 1125 c-sis, mRNA and introns
IVS 1106 > 1125 c-sis, intron A
BASE COUNT 171 a 430 c 362 g 162 t
ORIGIN 780 bp upstream of BamHI site.
1 tagcactctc ccttctcctt tataaaggcc ggaacagctg aaagggtggc aacttctcct
61 cctgccgccg ggagcagccc gcctgtctcc ccgcgcgccc gcagcctccc ccgctgcctc
121 cctgagggct cccctctggc cgccagcgcc catctttcat ttccgagatt gcgatatttt
181 gcgcacacac gcatacacac acgcgcaaaa agggggggaa aaaagggccc accctccagc
241 ctcgctgcaa agagaagccg gagcagccgc agctcacagc ccagaggacg cccagagcgg
301 cagcgcgcag acagacggac cgacggactc cgccgcgtcc acctgtctgc ccggcccggc
361 actgcgcgca cgggcacggc cgagcgcgga gcagccgtgc ccgccgcccg ggcccgccaa
421 ggcgcacacg ctccggcccc ccccacccgg cccgggcggg agtttgcacc tctccctgcc
481 cgggtactcg ggccgccgct gcaaagccaa ctttggaaaa agtttttggg gggagacttt
541 ggccttgagg tgcccggctc tgcactttcc gacttcgggg gcctttctag aaaatgttgc
601 aaaaaagcta agccagcggg cagaggaaaa cgcctatagc cggcgagtga agacgaacca
661 tcggctgccg tgttcttttt cctcttggag gttggagtcc cctgggcgcc cccacacggc
721 tagacgcctc ggctggttcg cgacgcagac ccccggccgt ggatgctcgc tcgggctcgg
781 gatccgccca ggtagcggcc ggaccctggt cccgcgccca ggccctcccc agccccccaa
841 cgacggagcc ggggccgggg gcggcggcgc cgggggccat gcgggtgagc cgggcggcgc
901 ctgcagcggc ctgagcgcct gatcgccgca gacccgagcc gagcccacct ccttccccag
961 cccccgcccc ccaccctggc cgcgggggcg gcgcgctcgg tccacgcgtc cggggccccg
1021 cggggccggg cccggagtcg gcatgaatcg ctgctgggcg ctcttcctgt ctctctgctg
1081 ctacctgcgt ctggtcagcg ccgaggtgag ttgcgacggc ggcgg
//
LOCUS CATSIS2 137 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 2.
ACCESSION M25354
KEYWORDS proto-oncogene.
SEGMENT 2 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 137)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 117 c-sis proto-oncogene
pre-msg < 1 > 137 c-sis, mRNA and introns
IVS < 1 20 c-sis, intron A
IVS 118 > 137 c-sis, intron B
BASE COUNT 29 a 44 c 34 g 30 t
ORIGIN About 7.5 kb after segment 1.
1 ccatggtgtc tccctggcag ggggacccca ttcccgagga actctacaag atgctgagtg
61 accactcaat ccgctccttc gatgacctcc agcgcctgct acacggagac tccgtaggta
121 aattgaacct tgttcgg
//
LOCUS CATSIS3 142 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 3.
ACCESSION M25355
KEYWORDS proto-oncogene.
SEGMENT 3 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 142)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 122 c-sis proto-oncogene
pre-msg < 1 > 142 c-sis, mRNA and introns
IVS < 1 20 c-sis, intron B
IVS 123 > 142 c-sis, intron C
BASE COUNT 31 a 35 c 45 g 31 t
ORIGIN About 2.4 kb after segment 2.
1 tgccttgcct tgccttgcag acgaagacag agccgagttg gacctgaatt cgactcgatc
61 ccattgtgga ggcgagctgg agagcttatc ccgagggaga aggagcctag gtgaggctgc
121 aggtaagaat cctcctttct cg
//
LOCUS CATSIS4 246 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 4.
ACCESSION M25356
KEYWORDS proto-oncogene.
SEGMENT 4 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 226 c-sis proto-oncogene
pre-msg < 1 > 246 c-sis, mRNA and introns
IVS < 1 20 c-sis, intron C
IVS 227 > 246 c-sis, intron D
BASE COUNT 37 a 90 c 80 g 39 t
ORIGIN About 1.5 kb after segment 3.
1 tgaggccccc ctcccttcag gttccccgac ggtcgccgag ccagccatga tcgccgagtg
61 caagacacgc acggaggtgt tcgaggtgtc ccggcgcctc atagaccgca ccaacgccaa
121 cttcctggtg tggccgccct gcgtggaggt gcagcgctgc tccggctgct gtaacaaccg
181 caacgtgcag tgccgtccca cccaggtgca gctgcggctt gtccaggtgc gcgggggggc
241 ccgacc
//
LOCUS CATSIS5 185 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 5.
ACCESSION M25357
KEYWORDS proto-oncogene.
SEGMENT 5 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 185)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 165 c-sis proto-oncogene
pre-msg < 1 > 185 c-sis, mRNA and introns
IVS < 1 20 c-sis, intron D
IVS 166 > 185 c-sis, intron E
BASE COUNT 40 a 47 c 67 g 31 t
ORIGIN About 1.4 kb after segment 4.
1 tcgatggttc tgtcctccag gtgagaaaaa tcgagattgt gcggaagagg ccagtcttta
61 agaaggccac ggtgaccctg gaggaccacc tggcgtgcaa gtgtgagacg gtggtggctg
121 cacgacccgt gacccgaagc ccggggagct cccaagagca gcgaggtaag cgctggtccg
181 gggtc
//
LOCUS CATSIS6 187 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 6.
ACCESSION M25358
KEYWORDS proto-oncogene.
SEGMENT 6 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 187)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 40 c-sis proto-oncogene
pre-msg < 1 > 187 c-sis, mRNA and introns
IVS < 1 20 c-sis, intron E
IVS 168 > 187 c-sis, intron F
BASE COUNT 44 a 57 c 64 g 22 t
ORIGIN About 2.1 kb after segment 6.
1 tctttctctc gagcagccag gacaccccag actcgggtga ccattcggac ggtgcgagtc
61 cgccggcccc ccaaggggaa gcaccagaag ttcaagcaca cgcatgacaa gaaggcactg
121 aaggagaccc tcggagccta ggggcatctg caggagcgtg cgggcaggtg aggccaggcg
181 gggaacc
//
LOCUS CATSIS7 1643 bp ds-DNA MAM 15-SEP-1989
DEFINITION Feline c-sis proto-oncogene, segment 7.
ACCESSION M25359
KEYWORDS proto-oncogene.
SEGMENT 7 of 7
SOURCE Feline placenta DNA, clone pPHS1.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 1643)
AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
TITLE Genetic organization of the c-sis transcription unit
JOURNAL Nucleic Acids Res. 15, 959-970 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pre-msg < 1 1623 c-sis proto-oncogene, mRNA and introns
IVS < 1 20 c-sis proto-oncogene, intron F
BASE COUNT 310 a 513 c 437 g 383 t
ORIGIN About 300 bp after segment 6.
1 tccctctggt tggcctacag ggttatttaa tacggtattt gctgtatcgc ccccatgggg
61 tccttggagt gataatattg ttcccctcgt ccgtctgtct cgatgcctga ttcggacggc
121 caatggtgct tccccccctc ccacgcgcct gtccacccac cagcgggtcc cctcattggc
181 ctccagcgcc ttgcccggga gctcgagaag gacacgaagg acctgaactc cactgctgtc
241 ttcctccgct gaccccgaga acgtggcata gaagtgtaag agagacccat gggatcgccg
301 ttcggggacc ggaggagctt cccttcccca gctcctggcc cgggtcacac ctgagcgccg
361 tggactggcc tgaaaccctg cacgtggccc gcccggtccc aggacccctg gcccgctctg
421 aagggagcac ccccaggcag gccagggtgc ctcgtcctcc tgtggctggg accacgcgtt
481 ggaccgtaga ctggagagaa atcctcccac ggcgcccagg cccagtcact tctttctccc
541 cggtcacctc tgcttatagt ggcttccttt cgttttctat atatgaaatc ttcgaagatg
601 tggactcctc tgggtgggcg tggcccgagt acccggcagc caagtgcccc gtcagatggg
661 ttagagatga agtttgctct ggagccacca tggatggtga tctgggcgct cactgcctcc
721 ttccactccc ctgccttcac ctcctctgtt tcatctctct acctccaccc tgcgtcttcc
781 tcttgtcccg gccttcagtc cgctccaccg agggactcct tggaccccgt gctgagaccg
841 taaataaccc cactcactgc tccattaggc cgaagaccag gggtcagggc agcagggggc
901 ctgcgcttcg tatcccagcc cggccgagac tgccgtgtag gttcgtgcag ggtgtgcact
961 gtgcaaggca tgcgtgcagg ggtcgttgtt catgccgtag aacatggccc gctggtgtat
1021 ttgttaggac agttttctgg cagatggagc taaaatgtct tgagcaaagg gatcctgttc
1081 ctaatcgaca tgaaggttcc atgtggactg gctgtgtccc gacccagcca atggcttggg
1141 gtagttggac aggagagtga gcccacgttg ggtggggacg ccacgcgttg tccagctggc
1201 ctctattttc ctgagtcctc agctcaagca gttccccttc caggcgggcg tgaaaaaccc
1261 aagagagaat ctccaaggga gggggcagcc ctcgctctcc ccgggcacgc taccccccag
1321 agtagattct tttgagatcc gcccgtggtg gctcctccca ggaaaccagc tcacgggctg
1381 ggagtggggg agaaaaggga aaagatcccc aggggtccct ggggtggggt ctgagctccc
1441 acctccctcc cctccacccc actgcacttt cccccttacc ccccccctcc aaaatctgct
1501 ctcttcagtt tgtaaagttg gtgattatat ttttgggggc tttcctttta ttttttaaat
1561 gtaaatttat ttatattccg tatttaaagt tgtaaaaaaa aaaaaataac cacaaacaaa
1621 accaaatgaa tgaatccacc ggc
//
LOCUS PIGMDHA 1080 bp ss-mRNA MAM 15-MAR-1990
DEFINITION Pig malate dehydrogenase mRNA, 3' end.
ACCESSION M29463
KEYWORDS cytosolic malate dehydrogenase; malate dehydrogenase.
SOURCE Pig liver, cDNA to mRNA, clone ppcMDH-1.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1080)
AUTHORS Joh,T., Takeshima,H., Tsuzuki,T., Setoyama,C., Shimada,K.,
Tanase,S., Kuramitsu,S., Kagamiyama,H. and Morino,Y.
TITLE Cloning and sequence analysis of cDNAs encoding mammalian cytosolic
malate dehydrogenase
JOURNAL J. Biol. Chem. 262, 15127-15131 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 870 malate dehydrogenase (AA at 1; EC 1.1.1.37)
BASE COUNT 320 a 232 c 240 g 288 t
ORIGIN
1 atgatgggtg tcctggatgg tgtcctgatg gagctgcaag actgtgccct tcccctcctg
61 aaagatgtca ttgcaacaga taaagaagag attgccttca aagacctgga cgttgccatt
121 ctcgtgggtt ccatgccaag aagggatggc atggagagga aagatttact caaagcaaat
181 gtgaaaatct tcaaatgcca gggtgctgcc ttggacaaat atgctaagaa gtcagttaag
241 gttatcgtgg tgggaaaccc agccaatacc aactgcctga ctgcctccaa gtcggctcca
301 tccattccca aggagaactt cagctgcttg actcgtttgg atcacaaccg agctaaagca
361 cagattgctc tcaaacttgg tgtgacttct gatgatgtca agaatgtcat catctgggga
421 aaccattcct caactcagta tccagatgtc aaccatgcca aggtgaaact gcaggcaaag
481 gaagttggtg tttatgaagc tgtgaaggat gacagctggc tcaagggaga gttcatcacg
541 actgtgcagc agcgcggtgc tgctgtcatc aaggctcaga aactgtccag tgcaatgtct
601 gcggcaaaag ccatttgtga ccacgtcaga gacatctggt ttggaacccc agagggagag
661 tttgtgtcca tgggcattat ctctgatggc aactcctatg gtgttcctga tgatctgctc
721 tactcattcc ctgttacaat caaggataag acctggaaaa ttgttgaagg tctccctatt
781 aatgatttct ctcgtgagaa gatggacctc actgcaaagg aactagcaga agaaaaagaa
841 actgcatttg aatttctttc ctctgcctga ttaaacaatc atttcaatgt actaaatgcc
901 ccaaagctga aaaatctaaa tgtccttttt gaccctagta ccaaataata ataatgctaa
961 ccttaaatta ctagtgaaaa ataacacact ttaaagattg tgtgcttctt ggtacaaatt
1021 tgtgacagtt tatcatcatg ctgttagtgc cacattctaa aataaatata tatccaaatg
//
LOCUS PIGUFG 996 bp ds-DNA MAM 15-MAR-1990
DEFINITION Pig uteroferrin gene, 5' end.
ACCESSION M30283
KEYWORDS iron-transport protein; transport protein; uteroferrin.
SOURCE Pig adult liver DNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 996)
AUTHORS Simmen,R.C.M., Srinivas,V. and Roberts,R.M.
TITLE cDNA sequence, gene organization, and progesterone induction of
mRNA for uteroferrin, a porcine uterine iron transport protein
JOURNAL DNA 8, 543-554 (1989)
STANDARD simple staff_review
COMMENT See GenBank entry M30284 for sequence of uteroferrin mRNA.
FEATURES from to/span description
pept 887 > 996 uteroferrin precursor, exon 2 (first expressed
exon)
sigp 887 946 uteroferrin signal peptide
matp 947 > 996 uteroferrin
pre-msg 651 > 996 UF mRNA and introns
IVS 745 886 UF intron A
BASE COUNT 239 a 333 c 265 g 159 t
ORIGIN
1 aatgggggtc aggcaggact tcctggaaaa agggagcccc agctgtttgg gggaggggag
61 gtgaccctgg gacactggct gggatgttgt caaacacagc tcacacacag gacacacagc
121 tcagctgaga aacacaacag agacacacaa cttgggcaca aaacaggcac acacaagtcg
181 ccttcccccc tacacacagt tccacttgag gactcaggtg cacgtcacag acatgccatg
241 caggccttgc cagctgagta tacacagcca ctggaacact ggctcccagg ctcacaacca
301 acatgtgact gaggcaggca cgcttggcta cacgcagaaa acagctcaga cacacccaga
361 cacaggcaga caaacaccac ctctgagaac accgacacac aggaggtgcc tggcccgggc
421 attcggacac agctgcctac acagcacaca cgggccccga gacacgaact gctcctgtgg
481 gcagacgcag gggctgtggg gcaggcgcag ctgcactggt gcaccagaca gccacctcaa
541 atcccgtgtc cagcgctggg gaactaactc caaggctcac atgacccgag gggagggctt
601 ttggaacagt tggcggagaa actgcatcat cctcactatg atcatttccg acccagggaa
661 taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct gagcccccag
721 cgaccggctc aagcctcccc tagggtgagt gctgggccct gcatgaccac cctttcccct
781 aagccccatg gctggaccct atcctcccac taaccctggg gataaggggg ggtgggtctc
841 gggtgcctgc tgcccacatc ccagacactt gtctctcctc tcccagatgg acacgtggac
901 ggtactgctc atcctgcaag cctccctggt gctccccggg gccgtgggca ccagaaccaa
961 caccaggact gcccccaccc ccatcctgcg ttttgt
//
LOCUS PIGUFMR 1424 bp ss-mRNA MAM 15-MAR-1990
DEFINITION Pig uteroferrin mRNA, complete cds.
ACCESSION M30284
KEYWORDS iron-transport protein; transport protein; uteroferrin.
SOURCE Pig (60-day pregnant) endometrial, cDNA to mRNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1424)
AUTHORS Simmen,R.C.M., Srinivas,V. and Roberts,R.M.
TITLE cDNA sequence, gene organization, and progesterone induction of
mRNA for uteroferrin, a porcine uterine iron transport protein
JOURNAL DNA 8, 543-554 (1989)
STANDARD simple staff_review
COMMENT See GenBank entry M30283 for sequence of the 5' end of the
uteroferrin gene.
FEATURES from to/span description
pept 95 1111 uteroferrin precursor
sigp 95 154 uteroferrin signal peptide
matp 155 1108 uteroferrin
mRNA 1 1424 UF mRNA
BASE COUNT 299 a 446 c 411 g 268 t
ORIGIN
1 acccagggaa taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct
61 gagcccccag cgaccggctc aagcctcccc taggatggac acgtggacgg tactgctcat
121 cctgcaagcc tccctggtgc tccccggggc cgtgggcacc agaaccaaca ccaggactgc
181 ccccaccccc atcctgcgtt ttgtggccgt gggggactgg ggaggggtcc ccaatgcccc
241 gttccacaca gcccgggaaa tggccaatgc caaggcgatt gccacaactg tgaagaccct
301 gggcgcagac ttcatcctgt ccctggggga caatttctac ttcactgggg tgcatgatgc
361 caaagacaag aggttccagg agacctttga ggatgtgttt tctgacccct ccctccgcaa
421 tgtgccctgg cacgtgctgg ctggcaacca cgaccatctg gggaacgtct cggcacagat
481 agcctattct aagatctcca agcgctggaa cttccccagc ccttactacc gcctgcgctt
541 caagatccca cggtccaatg tgtctgtggc catcttcatg ctggacacgg tgacgctgtg
601 tggcaactcg gacgactttg tcagccagca accggagcgc aacctggcgc tggcccgcac
661 acagctggcc tggatcaaga agcagctggc ggcagcaaag gaggactatg tgctggtggc
721 cggccactat cctgtgtggt ccatcgccga gcatggaccc acccactgcc tagtcaagca
781 gttgctgcca ctgctgacca cacacaaggt cactgcctac ctgtgcggcc atgaccacaa
841 cctgcagtac cttcaggatg agaatggctt gggctttgtg ctgagcgggg ccgggaactt
901 catggacccc tccaagaagc acctgcgcaa ggtccccaac ggctacctgc gcttccactt
961 tggggctgag aactcactag gtggcttcgc ctacgtggag atcaccccca aagagatgag
1021 cgtcacttac attgaagcct cgggcaagtc tctcttcaag accaagttgc caaggcgagc
1081 caggtctgag caccagcacc gacgggcctg agaagtgcct ccctgctggc gggtgggtgg
1141 gtggcccact gggccatccc ccccagccct taagcaggct ttcctgggta cctgtggtgc
1201 tacggcagag cgggaaggag atggacagat gaggaaatgt gatgccaagt ggcccttgtg
1261 acaaggatgc ccacagaggt gaaagaagca tggacacatg taccggccag agtgccaagg
1321 ccccatggcc gggctcgccc agcctgagct tctgggcaat ggggggtgag gaagaaagct
1381 ctctcctggg tcaggcatcc ttctgtcact gccaaatagt taat
//
LOCUS RABBGLOB 44594 bp ds-DNA MAM 15-MAR-1990
DEFINITION Rabbit beta-like globin gene cluster encoding the epsilon,
gamma, delta (pseudogene) and beta globin polypeptides, complete
cds.
ACCESSION M18818 X07786 X07787 X07788
KEYWORDS beta-globin; delta globin; epsilon globin; gamma globin;
globin gene cluster; hemoglobin alpha-2 beta-2; pseudogene.
SOURCE Rabbit (strain New Zealand White, haplotype short) liver DNA,
clones lambda R beta G[3,5,8].
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 44594)
AUTHORS Margot,J., Demers,G.W. and Hardison,R.
TITLE Complete nucleotide sequence of the rabbit beta-like globin gene
cluster: analysis of intergenic sequences and comparison with the
human beta-like globin gene cluster
JOURNAL J. Mol. Biol. (1988) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by R.Hardison, 02-MAY-1988.
The epsilon and gamma genes are only transcribed in embryonic red
cells derived from the yolk-sac. Both genes encode abundant
amounts of embryonic globin polypeptides. The delta globin is a
pseudogene, and is not transcribed in any erythroid tissue. It is
the product of a gene conversion between delta and beta, so that
the 5' end of the delta globin is derived from the beta gene.
Finally, the beta gene is transcribed in red cells derived from
both fetal liver and adult bone marrow.
FEATURES from to/span description
pept 5999 6090 epsilon globin, exon 1
6212 6434 epsilon globin, exon 2
7237 7365 epsilon globin, exon 3
pre-msg 5942 > 7489 E-G mRNA + introns
IVS 6091 6211 E-G, intron A
IVS 6435 7236 E-G, intron B
pept 15525 15616 gamma globin, exon 1
15741 15963 gamma globin, exon 2
16781 16909 gamma globin, exon 3
pre-msg 15463 > 16997 G-G mRNA + introns
IVS 15617 15740 G-G, intron A
IVS 15964 16780 G-G, intron B
pept.ps 22252 22342 delta pseudo-globin, exon 1
22445 22665 delta pseudo-globin, exon 2
23421 23548 delta pseudo-globin, exon 3
pre-msg 22204 > 23676 D-pG mRNA + introns
IVS 22343 22444 D-pG, intron A
IVS 22666 23420 D-pG, intron B (no splice consensus)
pept 30970 31061 beta globin, exon 1
31188 31410 beta globin, exon 2
31984 32112 beta globin, exon 3
pre-msg 30917 > 32204 B-G mRNA + introns
rpt 1201 2586 direct repeat L10c-1
rpt 2817 3137 direct repeat C1
rpt 3553 3827 direct repeat C2
rpt 8895 9206 direct repeat C3
rpt 10039 10390 direct repeat C4
rpt 10444 10808 direct repeat C5
rpt 10866 11186 direct repeat C6
rpt 11815 12184 direct repeat C7
rpt 18604 18936 direct repeat C8
rpt 18937 19059 direct repeat C9 (approx. 5' and 3' ends)
rpt 19162 19470 direct repeat C10
rpt 20027 20379 direct repeat C11
rpt 20382 20743 direct repeat C12
rpt 24175 24573 direct repeat C13
rpt 25864 26076 direct repeat L10c-2
rpt 26206 26587 direct repeat C14
rpt 28578 29020 direct repeat L10c-3
rpt 32898 33190 direct repeat C15
rpt 33724 38097 direct repeat L10c-4
rpt 38098 44594 direct repeat L10c-5
BASE COUNT 12729 a 8346 c 9035 g 14484 t
ORIGIN
1 agctacggcg ctggcttgct agacatttta atgatagtca catagctgta acacacatac
61 atccatatag atgcagacaa agacatactc tgacacacac agatagatga tatagatggt
121 ccagaatgta ttttcagagg atttctgagc aattaaccac acaggaccct taaatcctaa
181 cccacaccac tttgccttca gtatgctacc tgtgtgggaa ccagggatgg ggacacagaa
241 gtcagaagca ctccaggttt ctgttgttag ccagggccct cccttaccat agcttcgtgt
301 cacactgagc cttatctata cccacagttt ctgaatccac ctggctatca ccagctcagt
361 ccttaggaga agggaaaaca aacactgagt ttgtctctgc ttataaaaac aaagcaaaat
421 aaaattaaat ttcaatccca tcccaaacaa atgaataact aagcaaaaac aaaaaggaga
481 agtggaaggg aagtggagga ggaagaagaa gagggtaaga aaacagttgc agaaagaggc
541 tggagatgtg tgcgtctgaa ggagggaaga gccccagtag cctcacaccg gcctagggac
601 ctgcggaagg agcagcttct ttcctctaca aaatgctcca gatttgataa gctaggaggt
661 ggggtaattt actaagagaa gggacacatt tggggctttt gttattcaag acttaaaaat
721 tttgggtagt gaatggtctt gctaatggat ggggatctat cttcttccac ttaagccctg
781 cacacttgcc attccttaat tttgtttcag ggagatactt tttttgtttt gttttgtttt
841 gttttagatt caggaaactt caatgtcact gaaaacagca tttctaggaa agaaagagtt
901 atttactttt gtaagtcaaa aatgatgaaa tcatggatga ctttctagca gcagaggtag
961 aatattatct tataattgga tattttccta caaattatag tacaaggttt tatgctctat
1021 atatctaaga gtctcatttg aagcaactct gtaaaaattt tccagtattg agtttatgta
1081 tattttagaa gtgctgcatt tgaaggaaac ttgtaaacta ttacttaagt cttgggcaag
1141 agcaatgaga tttgtctcaa tgacctaagg tagtaaactg gagggaagga agattaaaac
1201 atactcattc tgaacaccaa aaaggaaatc tgttgaagtg aaatggacac tatgagaaac
1261 ggtgacttga tcagcagagc cctgactgtt aatgaacaac ttaatacatt atccctctta
1321 ttagtttttt gtctgttcta cttaatatga ctggtttaat tctgtaattt atacacagtt
1381 attcttaagt gttgaaaatc aactaaaatg tgatccctgt taaacataag agtgggaata
1441 agagagggaa gagatgtaca atttgggaca tgctcaagct gacttgcccc aaatggtaga
1501 gttagaaaca taccagggga ttccaattca atcccatcaa ggtggcatgt accaatgcca
1561 tctcctagtc caagtgatca atttcacttc acaattgatc ataatgaaag gactaagagt
1621 caaagggagc acataaacaa gtctagtacc tgctaacact aaccgataga ataaataaag
1681 gggagagtga tccaacatgg gaagtgaggt actcagcaga ctcatagaat ggcagatgtc
1741 ctaaatagca ctctggcctc agaatcagcc ctaaaggcat tccgatctgg ctgaaaagcc
1801 catgagagta tttcaggcat ggaaagccaa gacactctgg caaaagatct ctgtgagtga
1861 gatcccagtg gaaagaacag gtcttcaaag aaggaggtac ctttctctga agggaggaga
1921 gaacctccac tttgactatg accttgtcta aataagataa gagtcggaga actcagaggg
1981 cttccatagc cttggaaact catgactgga gcatagggag attactgatg ccatagacag
2041 gagtgtcaat tggtaaagtc aacaacagga gtcactgtgc acttactcct catgtaggat
2101 ctctgtcctt aatgtgctgt acattgagac ttaatgctat aacgagtact caaacaatat
2161 atttcacttt gtgtttctat gggggtgcaa actgttgaaa tctttactta atgcatacta
2221 aactgatcct ctgtaaaaaa aaaaaaagaa attatcaact cccaacttga ctctcactgg
2281 gattaaacat gacaataggt ctgatctgat ttcatcatca tttaaaaaaa atcatctatt
2341 atttttcact ttatgtttct gtgtgggagc aaactgttga aatccttact taatgtatac
2401 taagctgatc ttctgtatat taagataatc gaaaatgaat cttgatgtga atggaagggg
2461 agagggagtg ggaaagggga gggttgtggg tgggagggac ggtatggggg ggaaaccatt
2521 gtaatccata aatcgtattt ggaaatttat attcattaaa taaaagttaa aaaaaatact
2581 cattctaata aaaatggcat cccttctatt tcctaacatt ttatttctat gttgctttta
2641 attcatggac tatgttagaa aaattattta acacaaacac agaaaaataa gaatttacca
2701 gtattttaat catgaaataa tctaacactt atgtatagga tcagaaaatc ccaaagaaac
2761 caaaaaaagt aaggccatat ttcagataaa aggccatatt tcagataaat ggtaaccaaa
2821 tgcaagaatg ttttaaacat ttaattaacc acttagaaag ggacacacag gcacacacac
2881 atacacagaa tcagagagag ctacctgctt gttcactccc aaatgccccc caaaaaccca
2941 ggataggtta ggcagaaacc aggagcccag actcaatctg gatctcccac aggtgtggca
3001 gggacccccc atacctgagt tctcacttgc tgcctcttag gaagctagaa tggggagcag
3061 agccaggacc caagctcaag ccttctgata tagaatacgg gcatctcagg cagcatcaca
3121 actggtgact tcaaaagttg tccctaattt caacttctaa tttccaagct ttatccttca
3181 gaaaagtgtg acgtgaaatg aatgattctg cctttccact ccttcttccc agtaaattta
3241 atttgcattc acagatctat ggttatgctt tagggccatt gtccacaagg tccaaagtct
3301 gtgaactaag ctactaagag ataaatcagt ctgagaatga gacctaattt tgactcaaat
3361 tcatctacaa cctttgtaag aaatgaggca gacgcctgat gtgttcacag cagcagcatt
3421 gtatgtccac tagtagtggt ttagtctcac tttttgcagc tacaccagaa aatattctta
3481 agacttatga aagcagccac cagaaattct tgttttttgt taccaaagtc attgccagct
3541 aaggcaaaac ataagagaaa cttctgctgt ggccagggag tgcagtggag gatggcctag
3601 gtgcttgggc cctgcacccc atgggagacc aggaaaagca cctggctcct ggctcctgcc
3661 atcggatcac gcggtgcgcc ggccgcagcg cgccggccgc ggcggccatt ggagggtgaa
3721 ccaacggcaa aggaagacct ttctctctgt ctctctctct cactgttcac tctgcctgtc
3781 aaaaaataaa aaaaaataaa aaaaataaaa aaaaaaggaa agagaaaaag agagagagag
3841 gaagagagga tatcggggaa ttcaatctgg gtctcctata ttggtagcag agaaccaagt
3901 actagagcca tcaccagatg tgcccctcca ccccaggttt gcattaacag gaagctggaa
3961 ttgacaacaa agctaagact tgaacctagg aaatctgata ttggatgtgg gtatcccaat
4021 caatcatcaa cttgtcatca tttgacatca actatgtcaa atgcttggtg ttcagatatc
4081 ttgaagtaat atttgtctca atacttggat attaccctca ttgtgaaatc accactagat
4141 ctaatgtatt cataatgaag tatatgtaac actacactac atacttgata tttcatgttt
4201 ggttatttta tctcaataca ttttatcctt agcaatatta cgtatattaa ttcatgcatt
4261 aacacattat ttagtgtttc cattctacct aattagtcca tgatatagaa gaagagtagg
4321 aatttctgac tttatctatc ttcttcacta aaagcaagaa agggaggtag agattcaata
4381 atctggaata atactagttt gaggggagtc atttaatctg atgaagcttc tactattaga
4441 gtataaagga agataaagta gtctcatgtg ctagaaatgt ttgcattaca cctttaggaa
4501 gtaatcataa gtaccagcgt agggttataa acaagagaag agttctgggt aaacttagaa
4561 cattgagatg caaggcactg gactagggag acaaaagcca tagtgcaaaa caaaaaggac
4621 ccagaagcac ccatgtcacc agaatgcaag aaactatagg aaaagaaatt tccatgtata
4681 gcaagagcaa aggaagaact agaggaaaga aaggatagaa acatacgggg aattagccag
4741 atataagaac tagttaagga atactagata atgattatgt agtggaatgt ttctggaatt
4801 aaaaaggctg gaatttctgc agttctcttc cattatttct ctgatcatca actttttcct
4861 tttgacaatt atataacttc tgtgacctct ctttctttat tccattccgt ggataatgcc
4921 aattgtatga ctcagtacag agcagtgaaa taacaaaggg aatatctttc atgaacatca
4981 ataattcact aatacatata aatgtggatt tacagttttg tctccaacat gcaaagaatt
5041 agccagtcat cacatcaggg ggcttcaaaa acaagggcac ttagaagaat ttgaacctct
5101 accctggtga ctacagcaca cactagacaa atcaattaac ttattcctca gtttgattat
5161 aaatcctggg tttaaataat tttagtgtca tctgcctcat ggattaaagt ataagtagag
5221 cagacaagcc aaaaaagaga gtcagggaaa tatccctgga gtgctctctg taagcacagg
5281 agataacctt gctgtctgtc ctgtctagga aatcactcct agactgtact ttggaattga
5341 gtcttagtcc tactgcagat cccttgtcac tctcttcagc ttccaggagt aaccatgcat
5401 ggcccctgtc tcatggaaaa actaagatct caaaagtgtt tgcagttcac tcagggaagc
5461 tcaagtggat gatgttctgc ctcaaccact ttcatttttt gtgtgtgtgt gtgctgtgtt
5521 gctttgtttt ccttgacctt ctttttgtca ccaaccttga cccaaatatg aaactacttt
5581 agccctaaca ctttccagtc ttatgcttag ctgctctcta tgaatctttc ttggaaaagg
5641 ggaatgggaa tgatggatgc catcttggaa agattacatg gaaaaagaag agagtgaaaa
5701 catcgataat gtatgacaaa ttgtgttctg cttggaggag gaaccaacag agtagggatg
5761 tctcacgatc cgaaagccat tatcacaaac tcagtgaacc accactgacc atctggtggc
5821 ctggctccac ccatgaagga cacattccag tcttgaccaa tgacttcaaa gtatgaaaga
5881 gaacaagggg ccagaccatc agcagtgggg aataaaaggc caagccttga agcagcagca
5941 caaagctgct tctgacacat ttgtgatcga tcaccagcaa gctcccagac gtgacaccat
6001 ggtgcacttt actcctgagg aaaagtgcat tatcagtaag cagtggggcc aggtgaacat
6061 cgatgagact ggaggtgaag ctttgggcag gtaagtattg gttctcaata cttggtagag
6121 aaaggcaaat atgaccctgg tagactgacc aggaaattgc tcaaagtttt tgtatctctg
6181 attttctatt tgctgttgtc ccatcatata ggctccttgt tgtctacccc tggacccaaa
6241 gattctttga caactttggc aacttgtcct cttcctctgc catcatggga aaccccaagg
6301 tcaaggcaca tggcaagaag gtgctgacct cctttggaga tgccatcaag aacatggaca
6361 accttaaggg tgcctttgct aagctgagtg agctgcactg tgacaaactg catgtggatc
6421 ctgagaactt caaagtgagt ttaggagatg ctcatgagct ctttctggtt ttataactta
6481 gcacaataat gggatataag actatgattg gagagatcaa caaagatttc agaagtcata
6541 gatatgtttt gttgttagag atctccaatg gtcatcaaca agaccaattt aattcagaaa
6601 tagtaaggta aaaggctatt aacagtctaa tttttcagaa aatgatgtgt ttatcctgag
6661 aataatgtta ttgaattgta gctttgtaca aggaaaaata gaatttgtct gttagtagat
6721 gaaggcctat ctcaatgaag gagaaatgca atgcttacat tatatttttt ctgatgatga
6781 actaggactt gattagggac gatttatgcg agatctcttg aaaaaaaata caacaaattt
6841 tctgaaaaat attaaattct ctcagtattg tgaccaaata taaacttgtt cctgctttgg
6901 ttggagactt taaactccaa gacccttgca acactggcat tctatgttta ctgaaactaa
6961 acaattatta aatgagaaag aaaaaaagaa agaagataaa tatggtgaaa tgtggaagga
7021 ggataagata taggtagtca gaatgttgaa cggaagtctc ataaaaatga aattgaatcc
7081 attcttgtct tggtttaagg gtaggcacaa tccttggagt aatttgaggt cagagttttc
7141 agggaatgta ggtacatagt catttctttc tttgtaaatt cttttgggaa cacctactca
7201 atatgcctat gtatcatttt gtcttttacc taacagctcc tgggaaacgt gctgctgatc
7261 gttctggcca ctcattttgg caaagaattc actccggagg tgcaggctgc ttggcagaag
7321 ctcgtgtctg gtgttgccat tgctctggcc cacaagtatc actgaactct cttcccagtt
7381 tctttagtgc ccctactcat ccccagcgtc ctccttctgt accttgggga ctaggctcag
7441 ccttgatggc acagtttctg tttaataaaa tacattctat tcagtaatca aaaaatgatg
7501 tctcatcttc ttcgtatttt actcatgtgt ttaaaaaaaa aaaaccacac atgttcatga
7561 ttgagacaga gagagacata ggaagaaata agaatctttg agtagtatat taggatttat
7621 caaataaagg gaggtgttat tggagggtac aggagaatcc agagaggaca ccaagatggg
7681 gaagaagttc tcttgggtgg ggcaaaggct tcattaagtg gactgagtgt ggcaaaaaca
7741 tacggcagag ggaagggaat gagggaccaa agtggtatct gcgaaatcag atttatggtg
7801 aatgttgttt cactcaaaag aaagatttgt agtgatcaaa actctagaca tggtaatact
7861 attttaatgt acttgaacat acacattgaa gaaattagct tgatttatat cttgagaaat
7921 tggtttgaat ataaagaaat gagttaaagt atgagttgga aatttccaga aaaaaatgtt
7981 ttcatgaaca aactgcagtt tgttctttcc tgaagggttc tgttgtctca atgggatgcc
8041 aagactgaga catattacat cattgtgcat tatcttctga ttgaatgctg taaaataaag
8101 ctagtttcaa cacaaatgaa aactttcatg ggaaagataa agatttcatg tagaaaataa
8161 aatactagca aatgtgcctg gtcctaaatc agcaaggtaa aacttggtgt catagatgga
8221 actggaattc tggaagtaga ggggagtatt aggaaagatt agatagtgct gagctttgac
8281 tataaaaaat aaaatctact actaggctat tagcgacaat gagttataaa tattcttagg
8341 atgctaagtg actccatgga taagattttg ggggaaataa tcttcttatg catgctggat
8401 gaattcaaat tttcagagac tagacatgac tgattattca aaaggacaga tataatgatt
8461 taaacctgag gggggcagag agatctacaa aagtgaatga tgtacaatgt cgagggagga
8521 agagatggta tctgattgac ggaggtgagt gagaaggaag gaggagctgg acaatgactg
8581 acaggtccca gactggctga atgtaacagg aggaggccag gcactggcct ggagaccctc
8641 ctataggagt gtgattaaat gattcactct tgagggagtg ctggtttgct cagggaatct
8701 tgacattcag gtcctgagtc cacctctcaa atcgggagaa tttacaagtt agaatgctgt
8761 atcttcacaa atgaaacttt ctcagatatt cattgccccc acacattgat cagaatttgt
8821 cctatgaatt atttttctct tttctgaata tattaattgt aaaattgtaa taaaacaagt
8881 aacaaaatgc atgtcagtat acttcttttt atttatttga cagatagagt tagacagtga
8941 gagagacaga gagaaaggtc ttccttccat tggttcaccc cctaaatggc tgccacggcc
9001 ggaactaccc cgatctgaag ccaggagcca ggtgcttttt cctacgcgag tgcaggcaca
9061 caatcacttg ggccatcctc cactgccttc ctaggccaca gcagagagct ggattggaag
9121 aggagcaacc gggactagaa cctggcgccc atatgggatg ccggcaccgc aggtggagga
9181 ttaaccaaga gagccacagt atacttaaca acataataaa tttgaaggca ttctatcttc
9241 catcctcatg aattccttaa aatttagtta attgaataat agtgtctaac agtattagaa
9301 aaagattact taattttgat agaaaccaga tattttgtga atattgttgc tactcaagga
9361 gaaaaaaatg caaaaaacac caaatgaaag ttttcttctt tttcttcaac attttgggaa
9421 tcattgtttt cagagtgagt taaatcaatt attccatgtg actttttgaa ttattgatgt
9481 ttgttgaagt ggaaacaaga atgtcgaatc taagaaattt gaaaggtatt ggctaggttt
9541 tccacctgta tattaaagga tattttcact agatataaaa tgatatgttg gcaatatatt
9601 tttatgaaag ataaaatatc attccattgt tctatgtttt cttctttgag ttttagcagt
9661 ttgactatgg tgtgcattgg atcgtttttc tcttctgttt aaaccttgct taaagtttcc
9721 agtcttggat atgtggaaca tttatgagat tttaaaagat tttgctcaat atttattggg
9781 tttccttctg tcctcctttg ctccaggctc ctctatcagc tgctcagttt tttttttttt
9841 cattctatct gtgcttcaat ttgagtattt caattgagta tttcctattg ctctttgagt
9901 ctgctaatct tttttatgtc atttctgtct gctattagct aatgtactga attcttcatc
9961 ttaaatatta caattttcag cacaggagtt ttatttgatt ttcttctata ctccaattct
10021 atgaaaatat gctgtttgct cattctctac cttttaaatt tatttttaat gattatctac
10081 tttctatgtt taggttttat ttttatgttt gtttttcatt tgaaaggcac agggacagaa
10141 aaagagatag aggtagagat agacccagag agagagaaag agatctttct gctggttcac
10201 tactcaaatt tctgcagtag ccataatagt atcagcctaa aaccaggaac tagggctcaa
10261 tctagatctc ttatgccgca ttccagggta catgttaact ggaaggtgta atagaaaaca
10321 cagttaggac ttgaaccaag cactctgata ggagatgtgg gcatcccaaa tggtgtcata
10381 accactgcat caaacactca cctcattaaa aaagtttttg attaacatat taaacgtaca
10441 tatttatgtt caaatattta tttgtgtatt tgaaagtcag agcaacagag agagagagac
10501 agagacagag acagaaacct tctataggct ggctcactcc ccaaatggca gatatggcca
10561 ggacagtgtc aggccaaagc caggagccaa gaactctatc caggtcattc atgtgggtgg
10621 cagaagttca agtacttggg aaatcttcag atgcttctcc aaatacgtta gcatgcagct
10681 ggatctgaag tggaacagcc aggactcgaa ccagtgctcc aatatgggat gccagtgttg
10741 caagcagtgt cttaaattgc tgcaccacaa cactggcccc aataattgta tattctaatt
10801 atgttaaatg cagtatccaa cacaagtatt tagcataaac tcatcaaata ggtaattagc
10861 ctttccatta tctttccttt tttgaagatt tattttattt atttgaaaga caaagtttca
10921 tagaggacaa gacaaagaga ggtctttcat ctgctggttc acttcccaaa tgactacaac
10981 aactgggtct gccagcccaa gccagaaaca ggagcttctt ctgggtcttc catgagggta
11041 gaggggtgga agcacttgga tctctgctgc tttcccagcc acattagcag agaatagata
11101 gaagtagaca tgaaccggct cccatatggg atgctgatga tgcagatagt ggttttacct
11161 gctacaccac aacaccaact ttatctcttc tttgtgtttg gagcctgtga gatcctctct
11221 ttcagttctt tgcagaatag ataacggtat tatgaactat aatagctcta ttctcctatg
11281 aaacactaga atttatttat ttggtctaac tgttttggta tctgttattc tacctccctc
11341 tatcctgctc cccatacctt ctctatcttc tagtaatcat tatgctaaag ccagtttttt
11401 gagacttcca tatatgagac agaacacata gtagttgtct ttctgtgtct ggtttatttc
11461 acttaacata aagagttcca gttccattca ttttgctcca aatgtcaaaa tttcattctt
11521 tttatggctg aataatattt cattgtatgc acatcacatt ttaattttta ttaatataat
11581 gaaaacagac ttaatgtagc tcatagatac aactctaata atataatgat acttcccttc
11641 ctccctccct ctccttttct ttcttctttt aattattgat ataacatatt tgtaacttac
11701 attaactttg acattatagt caaagactta atgctccatt aaataaagag ttagataagg
11761 ccatagttca tcagacatat aatcaagaac aataaacaat aatcaaaaga aattatgtta
11821 attttactca tatacattaa ttttttaaaa agatttattt atttattcga aggtcagagt
11881 aacacagaga gaaaaggagg ggcagagaga gagagtggtt ttccatccaa tggttcactc
11941 ccaaattgct gcaacagccg gaactgtgtt gatccaaagc caggagccag gagcttcttc
12001 caggtctccc acgtaagtac agggcccaag gacttgggac atcttctact gctttcccag
12061 gccatagcag agagctggat tgtaagtgga gcagccgggt cttgaactgg cacccatatg
12121 ggatggcggc actttgggcc agggtgttaa tccgctgtgc cacagcacca gccccatgta
12181 aattacattt taaaataatc ataggtcatt taaactacag tagtaaatca ttcttaaaca
12241 tttttgtaca aagattcttt tttttacatg acattttctt tatctatcaa ttgggcacct
12301 agatcaattc catattttgg ctgttgtgaa cagccaaatg aatatgacag ggaaggtatt
12361 tttatgatat actgttttta ttccattgga tatatatcca gaagagagat gctggataat
12421 atatggttaa tatattttta gttttttgag aaaactaaat accccatgat gactgcccta
12481 atttccattg tgaatgactt cttaatttta tgttaaatct ctatactatg gataaatgaa
12541 caacgattat tgtagttttg ggttatcagt ggcctctttg agttcagtat gaccctaaac
12601 agagtaaaca ggggttattc atgccactaa tgtttttaag caatcataaa tgagatgaag
12661 gtgaattttt aatcaatcaa ttttttcctc tcttgttccc aactattatt ctggctctgc
12721 tagaaggcat caaggatggt gactatggag ctatagacag gtaacacact cttcatccct
12781 aagagggggg tagatctaaa tctagatagt actttcccaa cataacataa ttggaataaa
12841 tttttaattc acgcatacac aactcagaga caggtgtaaa tgatacacaa aagtgtaagt
12901 aattcatgtc aatgtatagg caataagttt tctgaacaaa atagtgttgc aacaaccaga
12961 atattctgac aacagtaatg taaccaggaa tacagagagc ttagagatta tgccattacg
13021 aagattaaaa tcctgtaacg acctctcact cccaccagcg gaagacaggt tccatctctt
13081 tagacctgtg cctcttctgc aacaatgagt caatgttctc cacttcccag gcccaccatt
13141 tatctctttg gcattccaag agttgctttt ctattgcttc agagctttga atttgctatt
13201 tcatcttcta agttctcttt cctaatatct tcccatgaca gattcctgtt tgtcagtctg
13261 aagcagctca agtacttcct ctttggaaaa agctctttct gacacaatca tgcttcattc
13321 ttaccaagtc cagataataa ttttcttttt agaaagcagt gatatctgaa attattttgt
13381 taatcattta cttattcatt ttctgtatct tctgccccta aattttaaac cctcagagaa
13441 aaatttattt ccttgatcag ttgtcttaaa gccagagttt ctagaacaat tataacacaa
13501 ttattcatgt tattatgtaa atgtttaata gtacatggga tggcatttat gaaaacacaa
13561 ctttataggt gaggtcttag gtatatcata agtaatgacg tggggaaaat ggaaggataa
13621 aacatctatg gacaaggaga aaattggaaa tattacctca aaaagatagt gggggggggg
13681 attttcttgt tagcgatgac tcttgaatca ctgggtttct agaaagatag ggaagttgct
13741 tgcagatgtg agggatcata gatatccctg agtctgagag ggagtgatta aaacaccctc
13801 atttgttaag acttttggag tcttagatat tagcctctga gtcatcctcc aaggtcgagt
13861 tctataaact gaccacaata ggactattaa tgtggtcaaa gtgctttctt tataaatcta
13921 cttcacaggc ttcatttctt aaggagactg taacctgaat tcctttatat cagtagaaat
13981 gtcctctact cacccttagc ttgagaaaag tcataaatta ttttggttat actggaaaat
14041 tactgacatc ttgaacaaat ctgatcttcc tagtttttta atcattatca acagtgaaca
14101 aatacaggtt gtatgtggca tcaagaaaag tagaaggatt ttagctgtat tattttaatt
14161 gctgttgatc tgaaatgtgt aatttcataa cctgtatttt tatatgatca ttgagacttg
14221 ccacagggac aaataataat taagccagac ttccaaaact tcgaatatta tgaactagca
14281 aggtatgaaa ccatggactg ttttcagaga caataaaaca aaatagcata aacttaaaaa
14341 aaggactctt cctgccttat tcaaagtgtg acatgtcaga gacagaagtc aattaaagat
14401 aaaagtgtct aaaagtttag ttatgtccca gctgtgacag tacatgaatg atgagtcaat
14461 ggaaatagct cctgcctttt gagctatttc aaacacacat aagaagagaa ggagatgctc
14521 cctgatgttt tgtcttataa taaaaattgc tgaaaatttt ccctggcatt tatcagagat
14581 aatttactca gagccacact gagtacagaa gagcaacacc aatactctgg gtcaccagtc
14641 agcatacaca cttatccagt gggaagcaca cattatgtgg tgcctacaga gcaagacatt
14701 catgtggatg aaaatatatt ttgctcacgc agtacagaca tacttatttc aaacagatac
14761 aatacatgta agtagaaaat aaagtagata catgtaagta gaaaataaag taagagttca
14821 attctgtatt atgctactta atactaacat atttatgcat ggagagttat gctttatggg
14881 agttgttttc cctacacagt tgtgaattag gaatatatct cctaaagcca caaaggatat
14941 attcacaatc agtataggac ttctcccctg gccattccta actgaatgag atttatactt
15001 gtctcatcag aaatcttgga tctgtgccca aaattatctg aatacttttc atatgtttgc
15061 ttatgcttta gctataattt gcaggcatct atggtgttag tgactaaaat ataggaaatg
15121 ctggcaaaaa aagtcctaga ctatttaatg gtgaggaaac aagaaaaata gctaaataga
15181 aaaataaatg ggagaactct ttaaaaaaat gaattgaaaa gggacaggac taacaaaact
15241 tgaggagcag agcccccttg agggggcctc tctgccacta tctcaatata tagtctgtgt
15301 gaagcttctc tgcctaaacc tcacccctgc gctgaccagc cttgccttga ccaatagtcg
15361 ttacacaaaa acactgacca atagcctcag agaacacggc gaaacaaggg gccagatgtc
15421 cagcgaggaa gaataaaagg acgagcctta gagcagtttc acatacttgc ttctgagaca
15481 tctgagacta tcagcaagct cagcgagctc ctagaccaga catcatggtt catttcacgg
15541 ctgaggaaaa ggctgctatc acaagcacat ggaagctggt ggacgtagaa gatgccggag
15601 cagaagccct cggaaggtag gccctggggt ccaggacaag gcagaggagg aatgaaactg
15661 agcctggcag gactccaggc cctttctcag gacttgtgaa gctctctgat actcccacca
15721 ctattgttct gtctctatag gctcctggtc gtctatcctt ggacccagag attcttcgat
15781 agctttggaa acctgtcctc ttcctctgcc atcatgggaa accccaaagt gaaggcccat
15841 ggcaagaagg tgttgactgc ctttggagat gctgtaaaga atgtggatga tctcaaaaac
15901 acattcgccc acctgagtga gctgcactgt gacaggctgc atgtggatcc cgagaacttc
15961 aaagtaagtc caagacaagc tcaagccctc ttgcatttag tcatgggtga aggtaggaca
16021 ttaagtattg gccttctgca gcataggaga tatttggttg ggagataaga aagctctaga
16081 ggaactgaac tgagactaag tggtaatgca ttagggtcta aacaatactt tcaggagtgt
16141 agagggattc gtctatactc aagtgtattt tagaagaggc ttctgtagag atgaggaaac
16201 agacttttgt taatttgact tttgaagaaa taactccttc atttctcccc caactcactt
16261 tccttaaaaa agtttaaaac cctaatgaag gcagaaaagg agtgacaatg ggaaagataa
16321 atgagggcag ccataaggcg agacttacgt gattttgtgt agagtgccat gcatattcct
16381 ctaatggcta atacttgttt aaaatgctgc aatgcccatt tctaacctaa attatacact
16441 tccagattga ggattatgtg tctctgggag agaatattgg aacatttggt ccgaagtggt
16501 tttgaaaaga taccgtgagg tgttaccttt tcccctcagc ccacaaatgg ggagttgtgt
16561 tagatgatga cagaaatatt taaattatga ccaatgatta gtttggacag gggaccaaac
16621 tacttacatg taacaactgg gaggggggga agatcagcct tggaggtaaa ttaacaaagg
16681 cttggttttg aggggtattc agtgtccaga tagcgggaga ctagcctgaa ctctctatta
16741 actatgtgtt tgtgtttgtc ttctcttttc acctccacag ctcttaggaa atgtgctagt
16801 aattgtcttg gcgaagtatt ttggcaagga attcactccc caggtccaga gtgcctggca
16861 gaaattggtg gctggagtgg ccactgccct ggcccacaag tatcactgaa ccccttcact
16921 gtaggacaga gcttctagga agaagcttta tccctcaaat aataatgaaa ataataaaac
16981 tactctaaga aattatttgt gatggtattg agtttatttt ccttgtactt ttaaatatat
17041 gatcctcaag ggatttattt tttttattgt tgagggatgt gtgtgtgtgt gtgtgtgtgc
17101 ctgtgtacta tgtgtgtata ctcttcattt acttttgctt gagcggtgga tttctgatga
17161 gaaaacagaa gaacaaggca acaaaagggc atgcatgtga gaattctgca cgggaaacaa
17221 ggagtgggga tttcgggtga ggaggaagtg ctttatggga caagaattca gaacaggaca
17281 gtgtagagga ggggatgaag tcttcaggga agtcctcagg ggctgacttg aggcagagta
17341 gtgatggggc tcaatggaaa gaagaaatta cagaccttag taggtctaag tctgggcaag
17401 agttctaatt tcagtctatt tggtgcaaca caattcccac atgcaccata ctttccacaa
17461 catttgtggg gctagatagg aggttggaga tcttgaagaa cacagcaggg ggtagttctt
17521 ttccaagtta accacaaata gctgatcctg aagagattta tgctttaagt gttgaggctt
17581 gctaggatag ttaatttgat cccatgcttc ctgggcagac aatgtccaag ggtacacata
17641 gaatacagta cttcaaggaa aaagtatcat tgtgagtagc tagaagggat gagtatcttt
17701 gtgggggaaa taaatcatca gaaggttaga aataagaatg catgtgatat gtgatcaatg
17761 gagcagcact gggcggtcag agggaaacag gagggatttg tattgattag tttgcagtga
17821 aggtgttgta tgactaaaag ttaatccatg tttataggca ggtgtacaat acaatgacca
17881 tggaattagg aagattagcc tggctgtggg tgctaatcat gctaatgtta ggtgactatc
17941 attgcttttc ttgccttgga attcaatctc atctcaattt tcatctataa tattccaata
18001 atacacctga gcccatttct ctactttcaa tccttttgtt cctcccttaa tacattttac
18061 tctgatgttc taatttattt tttctattat gaaatgccaa aaattcttgg ttcagcttgt
18121 ttttttcttc tgcagtttca gagttttgca ctgaaaattc ctgctgcttc catggactta
18181 tttctctaac ttcttatatc tggctctatc ttgtcattag gttcctaatt caactctgac
18241 ttcttttact aaagctattc tacgttattt ttctatagcc agttgctgct gtatcatttt
18301 atgcattttc ttctgtcata caatacctca taaaattatc ttattcagtt atttgtttag
18361 ttgtgtgcat tcattgcctt ttctattagc tgtgaccttc atgagacttc ataagttttc
18421 ttgtctgttc attcaaaatt tccaacattt agaggtgagc atttagccta ggagttaaga
18481 tgctggttaa aatgtctgtg tctcacacta gagtcacagg gtttaatacc cagctatgcc
18541 cacaactcta gtttattggt aatgaaagcc atgggaagca atgatgattg ttcaaataat
18601 tgcttttctg tggccagcac acggctcact tggttaatcc tccgctgcgg tgccggcatc
18661 ccatatgggc ccagggttct agttccggtt gctcctcttc cagtccagct ctctgctgtg
18721 gcctggaggg caatggagga tggcccaagt gcttggcccc tgcacctgca tgacagacca
18781 ggaagaagca cctggctcct ggctttagat cggcatacct ccagctgtag cagtcatttg
18841 gggagtgaac caatggaagg aaaacctttc tctctgcctc tctctcactg tctattactc
18901 tacctgtcca ataaataaaa aaaaaatgtt ttctgtcacc tctgtgggag accaggattg
18961 agttcctacc ttggctttgg atcctatctc cctcttggcc attgcaggac ttagaggagt
19021 aaaccagtgg atggaaactg tttctctcta cctctcaaat ctatttttta agtttccagc
19081 attttaaaca atgtatagat tatttggcta agtgaatgtt ttgctttaaa aaatgatggg
19141 agcatgaatt tggaagattg gttatgtagc tgcttagaat actcacatcc catattggag
19201 tgcttccttc aaggcccagc tcctctgctt ctgatccaac ttcctgctaa cacataccct
19261 gggaggccac aggtgatagc ttaagtgctt ggttccctgt caccaatgtg ggagacctgg
19321 attgagcttc agactcctgg cttcaatctg gccctgctct ggctgctgta gccatttgaa
19381 gagttaacaa gataaaatat ctctctatct tctctgggtc tctacctttc agataaaaag
19441 taaaataaaa taataaaata aaacttatgt gatgagtcag atgattgtga tgatgttcta
19501 gcactgatgt aaagagggat caaatgtcag tggaggccat gcatgtgaaa taaattggaa
19561 gtgtggaaca ggtatgaaca agaaagaatt aggaagcctt aaaaagtgat ttacaaagag
19621 ggtcaaaaca gcatttggaa gaaactgcag taataaatgc ctgtccagct ggtaaggtgg
19681 ggagaacatt cgctgacctg cagagtgatg aatacccctg aatcagcagg gatggcagtg
19741 tgttcacagt gtttgaggga taagcattac tgccattaga agctagtgcc ctaaaccatt
19801 tcatcccagc ccacaaagga gtttaagtga gagccccttg gcttctatct ttcactaact
19861 tctactctca aaagggctcc tttgggtcga ctcaaactct tccttcaggg ctatataaca
19921 aagcacagaa tctttccttt gcagctctgc accagtccaa aaggcactga gaggtcagag
19981 agaattcagt ttgttgggat gaattagctt cttgaagctt agcaattgtt attcatgttt
20041 tatatatttt aagatttatt tacttgaaaa gcagagttac agagagagat ggagacacac
20101 acacacacac acacacacac ggatagaatc ttccattcac tggttcacta cccaaatgcc
20161 catgacagcc aggtctaggc caaagccagg agccaggaac tccatctggg tcatcccctt
20221 gacggcaagg ccccaaacac ttgggtgatt atctgctgct ttcccaggct tattatcagg
20281 aagctggatt gggagtggag cactagggac tcaaaacggc attcctacat gggacatctg
20341 ttttgcaagc agcaactgaa cctgctgtac catggtattg gcccaatgga tgcattattt
20401 tttttttact tgacagagtt tgacagtgag aaagagagag agagagacat agagaaagga
20461 gaaagttctt ccttctgttg gttcaccccc aaaatggccg ccacggcagg cacgctgcac
20521 tgatctgaag ccaggagcca ggtgcttcct cctgatttcg catgcaggtg caggagccca
20581 agcaattggg ccattctcca ctgcccttct gggccacagc agagagctgg attggaagag
20641 gagcaactgg gactagaacc caacccccat aagggatgcc tgagctgcag ggggatgatt
20701 aaccaagcga gccatggtgc tggctcccaa tggatgcatt attaatagta atcactcaca
20761 atggaacaat gctcagttct tatcaaacat aatcaattct tagttgtgtt tccaaacaga
20821 aggtagatag aaatcctttt aattttaaaa agcaaatcaa aaaccaagaa aaaattttct
20881 gtacccttat cttaaagtgt gaattacaat taggaaggtc aagaattata aatctattga
20941 gcattgggtc ggatgctgct atttttaagc taactttaaa caaaatactc tgataatttt
21001 tgaatgtatt tctgaagttt gttgagctta gactcaagac taggaaaaga gaaataaccc
21061 tcttcaatgg aggttttcac acccagcaat gctgaaatgc ttgcatatac atggtgtgag
21121 gttctttttc ataggagcat atctacacaa tataaaagtt tctagtatct gtggcctaga
21181 ctagatattt tagtctcctt ggattaccaa cccttcttta taaaagaggg cagtctctag
21241 aggagcgaga aaatcaacca cagtgttctg tcatagtgaa ttagaaggtg gtggttaaaa
21301 tgctggttgg gaggcccaga ctccacatta cactgcttgc ctttgagtct tggctctgat
21361 cgtgattcca gcctccacac acagctaatg tgtgtcctgg gaggatgctg gtgatggcac
21421 aaataataga gttcctgtca agtcatattc tttctctgct tccctagaca cttccccagt
21481 tcagttcaga cagaaataat atcaacatgc aatgttgata tcccatttga aagcattgct
21541 tctcccagtt tttcaacttt ggctggtaat tgggttttgg tgtgtttgta aattttacat
21601 ctgaagtctc catgtttcca tttctgcatt gtaaaaatct aataagtatc acacagaaca
21661 aacttgttcc agatttattc tatttttcaa acattattgt ttgaatgaat atgtattttc
21721 ttaccagaag gattaatgca aagaagagag aagacatact tagaaatatg gcagagatat
21781 catccattct ttctaatttt ttttgcatgt tcttgagaca caggaagaga tccatatgta
21841 tacttttaaa ttatccaaat atgcaaacct ctactttgtg tgctttcttt cttcatttgt
21901 gtaataaaaa aagttgtgga agtaatttct cataggcttg tgatatcaga aaatttacat
21961 aaaactgcat aaataaaact gctaaggagg gtgattttgg taacaatgaa tattttctgg
22021 gaggggtcct ggagatacat agagggaaaa gtctgaattc agactcataa gccagtgcca
22081 gagctgacca tgaccagtgt tctgtcactc ttctggcctc accctggcat tggtgactca
22141 ccagggtagg cagtgcaggg gccactactg ggcataaaag gcagagtggg acagctgctg
22201 cttatgcttg tatttgacac aactgtgttt actagcaata ttccaaatag aatggtgcat
22261 ctctctggtg aagagaagga tacattcact gccctgtggg gcaaggttga tgtgaagaag
22321 ttggtgatga ggccctgaga gggttgcttt tcaagttata gggcagtcta ttgaggcaga
22381 tggaagctgg gcttgtgtag acagagctgg ccccagggtt tctgccaggc actgaatccc
22441 ccagtgctga ttgtctaccc aagaacccag aggttcttta taccgtttgg ggacctgtcc
22501 tctgaacctg ctgttatgag taatgtcaag gtgacggccc attgcaagaa ggtgctggct
22561 accttcagtg agggtctgaa tcacctggac aacctcgggg aaaattttgt ccagctgagt
22621 gagctgcact gtcacaagct ccatgtggat cctgagaact tcagtgttag tctaggcaat
22681 gttccacttt tttcttttca cattctcatc ttcttttttt gttgtttgtt tgttaagcaa
22741 cctgtctttg tcaacatttc tatctaattc cctatatttc gtcatttatt gcttttcaaa
22801 aattttcatt tttctttagt atttcgtttt ttcattcctc acagcattgt ttttttaatg
22861 tttctttatt aatatcctat attttccttg ctctccactc cctgcccaat atcctttcct
22921 ctaattgcta tctttgttat gcattctaga ttccatatgc taattctgca cctggaaaaa
22981 accttctaac cctccatgga gataggagag acttctcaaa aaggagaagc acaggatatg
23041 tctataggct ataaatcatt tggaacatga agaataattt gaattttata aattcaagaa
23101 taaatgaaaa taaaaggaag gtaactatct atctgaggac aatagatcaa aaggtcacat
23161 tgtaggtggg gttgcagatg ttgctaagaa gcaaccacta ttacatcaat taagggatta
23221 attagtaata ttattatata tgcatataca tgtacatacc tatgtgatta tacacttata
23281 tatatattgg ttgttttaaa ttgagactct atttagatta agatgtagga agaaaataat
23341 tagtgttggc ccaatttcag aatacaagct tgatttctgt gctaaccata cgtaaaatat
23401 ctccatagct ctttctcaca ctccagggca atgggctggt gattgtgctg gccttctact
23461 tcggcaaaga atttacccca tagctgaagg tgcctatcac agagtggcag ctggtgtggc
23521 caatgcctgg gctcacaagt acccctaaga tcttgacttg ttctctactt accctgttcc
23581 atggaaaatc ttttctatat ttcatctatt gaacatgaag acataatacc caacctaagg
23641 gtgtgagcct aataaaagat catttatctt aacttcatgg ttcatatcat ttattttatt
23701 tttctcacca tggatatgtg gtgggggaga atccctgatg gcttacagat aggaacctct
23761 tgtttcttat aaataaatca agaaaatgag aaaaagaaga gggccatata ttcctactaa
23821 catttgcact tctatctcca aagcatagag attgtcacaa gtgtttcata aagggaggat
23881 tctgtattac agagattcca caagagacct tcaactaaga tatggcctca taaccacttt
23941 caaaagttac tgagaaaata atgtgctcaa gggcataaat ttttttgctc cagactgtgg
24001 tgtttccttt ctttctcttt ccaatattta tacactgtaa ctcgtattac taattgccaa
24061 acatgtattc ctgctctaag aaccttcctg ccttcctgct cccaaaacct cttcagaata
24121 atgcttcttt atccagtcct caatttccag tacagtttag tactctctca tcactgctga
24181 ggtctgtgga gctcagcctt ggggaatttt tatttttaat ttttaagatt tatttattta
24241 atttaaaatt ggtattatag agaaagggag agagagagag ggagagagag agagagagag
24301 agatcttcca ttcactggtt tactctacaa atggccatat tggctaggct gaagctagga
24361 gctttatcca gatcccctat gtgggtggca aggacccaca cattaggcca tcttctgctg
24421 cttttcctgt gccattagca gggagttggg tcagaagcgg agcagctggg acacaaacta
24481 gtgctcatat gagatgtcag catcagaggt ggcagcttta cctcgtacgc cacaatactg
24541 gctccctcag tgaacttttt aatgttgctg aggaaattct ttccaaaccc ttcctgtata
24601 ttcatttttc acaactttac agctcttctt atgtcacaca cacacacaca cacacacacg
24661 agcaaataca tacatacacc ccacatatat tatgcttgat aaatatttct gtctattctg
24721 cactaatttg caggcctact gtgccctgta tcaaattctt ctatagataa taagtggaaa
24781 ttttttcaat ggaataataa gtgaatgaaa gtaattagga ctcctagctc ttgagggaaa
24841 taagaatatg ttagatcatt gcgtggagtt cttttgatgt aaatactgca atatacaatg
24901 aggaagcaag caggaaagta aggtactaaa gataaatctt ttgctttata atacaggctg
24961 ccaaatttat tcttgttgaa agtaattttg ttatgtgtaa gattagaaga atttttagaa
25021 ccctctaaca tatgaatttc ttgaaacaca acacattctg gttaattaac ttggacttac
25081 ttgaatacct aaatagaacc tggtgctttc ttctatatca acatgattct gtaagaactc
25141 tgcatatact gccaggattt accaacatgc cactgtgtga gggtaaaggg ttttttatgt
25201 acattttttt cttctcagat gattgcacat agtgtgacca aggaagtctt tatatacaaa
25261 taaaattttt taaggactac agacttggga tatagatttg acataattgc tatgatgctt
25321 tattgctatg attgccaact gctagatacc tatatcctat agtggagtgc ccgaatttga
25381 ttcccagctc tgtttcctgt ttccagcttt tggtaatgca gaccagagag gcagcagtga
25441 tagttcaaga gattgtgatc agtacatcca tgcgggagaa ttgcattaag ttcccagctc
25501 atagctcctt ttctgttcat ttgtgttgtg aaccaattga tgggaacaaa taacaatgat
25561 tattaaagaa acacagactc acatctcaga aaccaaatta ctccaggaaa aatcacagga
25621 agatcacttt tatcctcctg aacataggaa acccctccta agaatcagat aaaaggataa
25681 gcgatttctg aaggatctga aagagggtaa gagaatgtgg actttatgct cttgaaatag
25741 ggaaacaact gacctttttt tcctcaacat ggtcaagaaa agaacaaaaa tattttatta
25801 tattgtgttc catttttctc catctgaaag tggcaaatca atttaggaag aaatttggcc
25861 tacaacttac tatgaactga tcttctgtaa ataaagagaa ttgaaaatga attttgatgt
25921 gaatagaaga ggagagggag cgggaaaagg ggagggttgt gggtgggagg gaagttatgg
25981 gttgggaagt cactgtaatc cataagctgc actttggaaa tttatattca ttaaataaaa
26041 gttaaaaaaa aaaaagcaaa aaaaaaaaag aaacttgtga caagcataag taattactgt
26101 gggcaagcat tactgccttt ctattgtggt tgaggtggtc aagagccagt agagacttgg
26161 cagcttgtta ttatagtctc atttattggt agacaataat tacttgaaaa tcacagcaga
26221 ggctggtgct gcggcttact aggctaatcc tctgctctgc ggcgccggca ctccaggttc
26281 tagttccggt tggggcaccg gattctgtcc cggttgcccc tcttccaggc cagctctctg
26341 ctgtggcctg ggagtgcagt ggaggatggc ccaagtgctt gggccctgca cccccatggg
26401 agaccaggag gaagctcctg actcctggct tcggatcggc gcagtgcgcc ggccgtagcg
26461 accatctagg gggtgaacca acagaaaagg aagacctttt tctctgtctc tctctgttta
26521 actctgcctg tcaaaaataa gtaaataaat aaatgtgcac ttccctattt acaaaaaaag
26581 aaaatcatag aagaggaatt aattccttgt gatttgactt cacatttatt tgataacttc
26641 ccatacagtt atttctattt tcccccatga tgttgcttta taggtgattt cattacatga
26701 agatgggaaa gatagagatg ggaggaggga gggaagaagg gagggagaga aagagagaga
26761 ttgattgtag gtttgatttc aaaaacattt aaaaataata ttatttaatt aaaccaagtt
26821 gaatctgaca gaaaattatc ctaagtatcc atcttttgag ctcagagacc tactctgtag
26881 attttaactt gtgaatcaat atctacttgt atttataggg tgcccagact taagccaaga
26941 ttctgtctct aaaaaggtga aatctccaat ttaaccttgg aggagtttgc accattgctt
27001 tcaaggaatg gcccactcag tgaggtgaat aaaatctaat tctgagtgaa aatctgagaa
27061 aacatcaaag taaaagtcat acactgaaat accaattgct ctataataat tagaaatata
27121 taagttgtat tcaaaaaagt tcatggaaat gcattttatg aaaagcatac atgaaaatat
27181 gtttttctac caagatcaat ttatctatta atttcatttt tcaatggagt ttttaaagaa
27241 tccctgcact ctagcaaagt ggtatctggg ttaactttac actttttgat gagataaata
27301 aattcttcct aataaataaa gactacaata ttccctaatt cttcaactat tttgtgtatg
27361 cccctctttc tatgagttct gattttctta gcgctatttt attttactta tccttagaag
27421 atgctaagtt gaagtagctg aaaaattgtg cttctgggaa acaaaactac agcaaacagc
27481 tgaactatgt ttagtcaaga aagcagacag tagattgctg aaatgtcaca catgaggaac
27541 ttaatataga acaccaaata tattcagaaa cattcatgct gaaggccatt tttgaactgt
27601 acagcttcac attcacctag atgggtgtca acaggtggat tctgttgacg gagcttcatg
27661 ttcatccact taattatact ctgattagga ccactttcgt atagaacatt aggaaacagg
27721 aaagcctgcc agaatggagt caaggaagga aacttgtgcc tctgcctctt tctgagaaca
27781 catattttgt gtatgtgtat gtgtgtgtgt gtgagtgtgt gtatgtgagt gtatgtgtgt
27841 gtgtatgtgt gttgcagtgc ttccgggagc tggaggcaaa gtagagcaga ctatctgatc
27901 cctacaggaa agaagcaggc tcatgctgtg aggcagcagt gaatctcagt tactttctgg
27961 gaactttgga tggtgtagat ttgagactga ggctctggac ataaccacat cgatcgctat
28021 ttatcctgtc ttgaaactta ctatgtccaa tacatcagaa cttttgatag tcctatgact
28081 tgtaggttat ctccacaggg ttatgggtaa cagacggaaa agtctgtttc acattctaat
28141 cttgctgggg cagcagcaaa ccctcccctg aacccaagat atagcactgg tgctggtgcc
28201 agtacaagac agtgacaaaa tgtaacttag gggatctagt ttgaacttga gaaatggcgg
28261 aggaggtatt tagaaaggtg ccaatttctg ctaaccaaga ctgcaaagtt gtagtcttga
28321 caattctgct ctttacataa aattgaagca ttttctgtgt tgttctttag taagttattt
28381 aatttttcta tgaaaaagaa ttcttgaaaa ggatttaccc aaacttccta gaacaccttc
28441 tttcatatat atacacctga catcagataa aataggtatg aaagaggata tttaaaagtc
28501 tccttgtaaa gtgatgacta aactaatgac tttaaattta ataaaatgaa aaccaaaggc
28561 aaaattgcta aagccaagtt aaaaaaagaa acttaaaact caaaaggctt ccatagtctt
28621 ggcaactcat gacaagagcc tagggtgatt actgatgcca taaacaagag tgtcaatttg
28681 ttaagtcaac aacaggagtc accgtgcact tactcctcat gtaggatctc tgtccttaat
28741 gtgctgtaca ttatgattta atgatataac tagtactcag tgtttttcac tttgtgtttc
28801 tatgtgggtg caaactgttg aaatctttac ttaatatatg ctaaactgat cttctgtata
28861 taaagagaat tgaaaatgaa tcttgatgtg aatggaaggg gagagggagt gggaaagggg
28921 agggttgcgg gtgggaggga agttatggtg ggggggaagc cattgtaatc cataagctgt
28981 actttggaaa tttatattca ttaaataaat gttaaaaaaa aaacatttgg aagacctctt
29041 tccccagtat tcagcatttg aaatgccttc ctaggggtaa attatatgct cattgctcca
29101 gtctttgaaa gaggaataag tataaagtat ttaaattctg tcccttttac cccatgatca
29161 tatcctgaac acataattgt tacatatttc cgtataccca gagaagaatc aggccttccc
29221 ataagtgaaa gaatgagcag aatgcagttt gttcttgagt acctgatggg tgtctgagca
29281 gtttaaaatg gtgagaaata actgatggat ttagcagtga tggggacagt aatttggaga
29341 atagcatgat agaaagggaa ttttgtgaat ttttctggaa aaactggaaa tagagaattt
29401 gttttttttt ttttctggct cactgaaaac acattaatgt aattggagtt tcagagtgaa
29461 atgacaagag aattctattt gacatttgtt gccagtaatg gcaaaggaga gagaaagatg
29521 tttctagaca ttatggtacc tgcattagaa aatggttcta tggtaaggaa aagtatggct
29581 ctttccatac tggaaggaat ggcagtttag gggagggata tactgatgag tcagggatga
29641 tgcaaagagg ggagaatggg cagcagggtc tgtgggaatg ctgagatcct ccactgagct
29701 cagaagcact tcctcctttg cttttttcat tgcactttta acaatgacaa gcagaactcc
29761 acactccaca cttcagaagt ctttcctagc cggagaagtt agcaatttat tatgtgggag
29821 aaaagatgag aaagagaaag acattgggaa aagctataaa gatagaaggg taaactttag
29881 taaaagcaat caagtaaaca aaaaatgtat agagaaaaca gggctgttgg tggagagttt
29941 ggatcaaagg aatgtttcta aaagatagca ttcttagtgg tttagaaggg atgaaacagc
30001 tcaggctgta aatgtcttcc catgctccta ccaccacctt caagatccag aaacatcccc
30061 gtttttccct agacactttc ccagatcacc aaggcagagc taattaggca tccagtgtgg
30121 gtatggcatt gtaagccatt gcttttccta gcgttacagc actggcatgt agctggtttt
30181 gacacatgtg tagttttcac ctcattgaga ggtccctgtt tccattgctt caccccccaa
30241 accataacag agtaaaaatt ccacaaaata aatttgattc aaatttattc tattttaaaa
30301 gtttatttat ttgattatat gataaaatta aatgagtaac aacaacaata acatataatg
30361 aataaattaa taagtgtttt tcttaccagg aagatttact ccaaagcaga aggaagtcgt
30421 acttataact atggcacagg tttcatccat tctgtttttt gtaattattt tggatattct
30481 gaagacataa agagatctct ctctctctct ctctctctct ctacctatct atttatctat
30541 ttaagtggat ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta
30601 tgataaataa ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta
30661 tataaataaa tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc
30721 tgggagatac atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc
30781 caaggacagg ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg
30841 gccaatctac acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc
30901 agggcagctg ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa
30961 acagacagaa tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc
31021 aaggtgaatg tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag
31081 cacaacttaa tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt
31141 ctgccaggtg ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc
31201 tacccatgga cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt
31261 atgagcaatc ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt
31321 ctgaatcacc tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac
31381 aagctgcacg tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc
31441 tttttcgcta ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt
31501 agaatgggaa gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac
31561 tttctactct gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt
31621 tcgttaaact ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca
31681 gattgtaagt actttctcta atcacttttt tttcaaggca atcagggtat attatattgt
31741 acttcagcac agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg
31801 catataaatt ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc
31861 atcatcctgc ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa
31921 tactctgagt ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta
31981 cagctcctgg gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact
32041 cctcaggtgc aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac
32101 aaataccact gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt
32161 gagcatctga cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat
32221 tttttgtgtc tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat
32281 gagtatttgg tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg
32341 gctataaaga ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag
32401 aaaagccttg acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt
32461 aacatcccta aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact
32521 actcccagtc atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag
32581 ggacagaaaa agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc
32641 actgatacca ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa
32701 taaactggat gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt
32761 tttatggact caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg
32821 attgcagagt tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat
32881 tagcttatgt caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca
32941 ctccccaaat acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg
33001 ccatggggtc tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt
33061 ccaggtacat tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact
33121 ttgatatggg atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct
33181 aatcctcatt acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag
33241 agatcatata ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa
33301 taggaataga atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga
33361 catggttaac ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg
33421 gaaaatttat agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat
33481 tgctgctagg tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc
33541 tagaaaacct gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc
33601 agcaataagt ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc
33661 atgaaggtaa cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg
33721 caatttttat tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa
33781 ttacagctta tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc
33841 tcccatctcc tgctccctct cccattccat tcacatcaag attcattttc aattatcttt
33901 atatacagaa gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac
33961 agaacataaa gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa
34021 cacattaagg acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact
34081 taacaaattg acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt
34141 accaaggcta tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata
34201 gtgaaagtgg aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat
34261 tctttcaact gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt
34321 tttttttttt tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc
34381 agccagatgt gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg
34441 ccattctatg agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta
34501 attctgtcag ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat
34561 gcctatcatt acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt
34621 ggaacatggc cacctcaatg ggattgaatt cgaatcccct ggtctgtttc taactctacc
34681 atttgaggta agtcagtttg agcatgtccc gaattgcaca tctcttccct ctcttattcc
34741 cactcttata tttaacaggg attacttttc agttaaattt aaacacctaa gaataattgt
34801 gtgttaatta cagagttcaa ccaatagtat taagtagaac aaccaaaaaa tactaaaagg
34861 gataaagtat tacattgtac atcaacagtc aggacaaggg ctgttcaagt cactgtttct
34921 catagtgttc atttcacttt gacaggtttc ctttttggtg ctgggtcagt tgtcactgat
34981 cagggagaac atatgatatt tgtccctttg ggactggctt atttcactca gcatgatgtg
35041 ttccagattc ctccattttg ttgcaaatga ccggatttca ttgttttttt ttgcttctat
35101 atagtattct atagagtaca tgtcccataa tttcttcctc cagtctactg ttgatgggca
35161 tttgggttgg ttccaggtct tagctattgt gaagtgagcc gcaataaaca ttgaggtgca
35221 gacagcttgt ttgtttgcca atttaatttc ctttgggtaa attccaggag cgggatggct
35281 gggttgtatg gtagggttat attcaggttt ctgaggatct ccagactgac ttccataggg
35341 gcttaaccag tttgcattcc caccaacagt gggttagtgt ccctttctcc ccacatcctt
35401 tccagcatct attgttggta gatttctgta tgtgagccat tctaagcggg gtgaggtgaa
35461 acctcattgt ggttttgatt tgcatttccc tgattgctag cgatcttgaa catttcttca
35521 tgtggatgtt ggccatttgg atttcctctt ttcaaaaatg gcaagtgagg tccttggccc
35581 atctcttaag tgggttgttt gttttgatgc tgtggagttt ctttatgtct ttgtggattc
35641 tagctattaa tgctttatct gttgcttagt ttgcaaatat tttttcccat tctgtcagtt
35701 gcctcttcac ttcctgactg cttcttttgc agtacagaac ttctcaattt gatgtaatct
35761 caatagttaa ttttggcttt gactgcctgt gcctccaggg tcttttccaa gaagtctttg
35821 cggtgccaat atcttgcagg gtttctccaa tgttctctaa taacttcatg gtgtcgggtc
35881 atagatttag gtctttaatc catgttgagt ggatttttgt gtaaggtgta aggtaggggt
35941 ctttcttcat gcttcagcac gtggaaatcc cagcaccatt tattgaatag actgtccttg
36001 ctccaggaat tggttttaga ttcctgatca aatataagta ggctcttgta tcccttcaat
36061 ttctttttct tgcctaacag ctctggctaa agcctccaga aatatactga atagcagtgg
36121 tgagaatgga tatccctgta tggtaccaga tctcagtgga aatgcttcca actttttccc
36181 attcaatagg atgctggtcg tgggtttttc ataaattgct ttgattgtat tgaggaacat
36241 tccttctata cccagtttac ttagagtttt caccatgaaa gggtcttgtg ttttattgaa
36301 tgctttctct gcatctattg agataatcat atggtttttc ttctgcagtc tgttaatgtg
36361 gtgtatcaca tttgcaaaca cttgaaccat ccctgcatac cagggttata tcccacttgg
36421 tctgggtgaa tgatctttct gaaatgttgt tgcactccgt tggccagaat tttattgaga
36481 atttttgagt ctatgttcat taggtatatt gttctgtaat tttctttcaa tgctgcatct
36541 ttttccggct taggaattaa ggtgatgctg gattcataga aagattttgg gaggattccc
36601 tctttttcaa ttgttctgaa tagtttgaga agaattgagt tagttcttct ttaaatttct
36661 ggtagaattc agtagtgaat ccatctggtc ctgggctttt ctttgttggg agggccttta
36721 ttactgtttc aatttctgcc tcagttatgg gtttgtttag gctttcgatg tcttcctggt
36781 tcaatgtagg taggttgcag gtgtccagga atctatgcat ttctgataga tttccctgtt
36841 tgctggcata cagtccttgt agtaatttct gatgattctt ttcatttctg tggtgtctgc
36901 tgttacattt cctatttcat ctctgatttt attgatttgg tctcttcttc ttttagttag
36961 ttgagctaat gcggtatcaa ttttgtttat tttttcaaaa aaccagctcc ccatttggct
37021 gatttttggt aatttttttg gattcaatcc tgttgatttc ttctctgatt ttaattattt
37081 ctcttctcct actagatttg ggtctgcttt gctgcagttt ttctagatcc ttgaggtgat
37141 ttgaaagctc atctatttgg tgcctttcca atttcttgat gtaggcacct attgatataa
37201 acttttctct taacactgct ttcgctgcat ctcatacatt ttggtatgtt gtgctgttat
37261 cctcatttac ttccagaaag tttttgattt ctcttttgat ttctttgatg acctagtgtt
37321 cattcaggag catgttgttc actctccatg tgtttgcata tgctgtaggg attcctgagt
37381 tgctaatttc cgacttcatt ctattatggt ctgagaagct gcatcatatg attctaattc
37441 ttttgaatgt gctgagactt gctttatggc ctagtatgtg gttaatctta gagtaggttc
37501 catgtactgc tgagaagaat gtaaattctt taagtgcagg atgaaaagtt ctgtagatat
37561 gtgtcagatc catctgggct atagtatcct ttgaatgtac tgtttcctgt agtcttctgt
37621 cctgtgatct gtctatttct gagagtggag tattgaagtc ccccagtact attgtattgg
37681 agtctaagtc tccctttaac tctcttaaca aatcttttaa ataaaccggt gccctgtaat
37741 taggtgcata tacattgata atcgttatat tttttctgtt gaattcatcc cttaatcatt
37801 atgtagtgcc cctctttgtc tctcttaaca gtttttgtgc taaagtttat tttgtctgat
37861 attaagatgg ttatgcctgc tcttttttca tttctgttgg catggaatat ctttctccag
37921 cctttcacat tcagtctgga tgcatctttg ttggaaagat gtgtttctgt aagcagcaaa
37981 tagatgggtt ttgttcctta acccaatcag ccaatctatg tcttttaact ggacagttca
38041 ggccattaac gttcagtgtg actattgata agtgtaattt tgccctgcca ttttcccaaa
38101 gatatatttt ttttctccaa aattgtattt attttacatt tattcttttt ttttatcttt
38161 tatttaatga atataaattt ccaaagtacg actcatgggt tacaatggct tcccccccat
38221 accgtccctc ccacccacaa ccctcccctt tcccactccc tctccccttc cattcacatc
38281 aagattcatt tttattatct taatatgcag aagatcagct tagtatacct taagtaagga
38341 tttcaacagt ttgctcccac acagaaacat aaagtgaaaa ataatagatg attttttttt
38401 aaatgatgat gaaatcagat gagacctatt gtcatgttta atcccagtga gagtcaagtt
38461 gggaattgat aatttctttt ttttttcttt tctttttttt tttacagaag atcagtttag
38521 tgtacattaa gtaaagattt caatcgtttg cacccccata gaaacacaaa gtgaaatata
38581 ctgtttgagt actcgttata gcattaagct tcagtgtaca gcacattaag gacagagatc
38641 ctacatgagg agtaagtgca cagtgactcc tgttgttgac tttacaaatt gacactcctg
38701 tttatggcat cagtaatctc cctatgcacc agtcatgagt ttccaaggct atggaagccc
38761 cttgagttct ccgactctta tcttgtttag acacggtcat agtcaaagtg gaggttctct
38821 cctcccttca gagaaaggca cctccctctt tgaagacctg ttctttccac tgggatctca
38881 ttcacagaga tctttttgcc agagtgtctt ggctttccat gcctgaaata ctctcatggg
38941 cttttcagcc agatccgagt gcctttaggg ctgattctga ggccagagtg ctatttagga
39001 cactccgcca ttctatgagt ctgctgagta tctcacttcc catgttggat cactctcccc
39061 tttatttatt ctatcggtta gtgttagcag atactagact tgtttatgtg ctccctttga
39121 ctcttagtcc tttcattatg atcaattgtg aactgaaatt gatcacttgg aatagtgaga
39181 tggcattggc acatgccacc ttgatgggat tgaattggaa tcccctggta tgtttccaac
39241 tctaccaatt ggggcaagtc agcctgagca tgtcccaaat tatacatctc ttccctctct
39301 tattgccact cttatgttta acagggatca catttcagtt aattttcaac acttaagaat
39361 aactgtgtga taattacaga attaaaccag tcatattaag tagaacagac aaaaaaaata
39421 ctatgaggga taatgtatta agttgtccat tagcagtcag ggctatgctg atcaagtcac
39481 catttctcat agtgtccatt tcacttcagg aggtttcctt tttggtgttc agtcagttgt
39541 caccgatcag ggagaacata tggtatttgt ccctttggga ctggcttact tcactcagca
39601 tgatgtgttc cagattcctc cattttgttg caaatgactg gatttcgttg tttcttactg
39661 cggtatagta ttctaaagaa tacatatccc ataatttctt tatccagtct accattgatg
39721 ggcatttagg ttggttccag gtgttggcta ttgtgaattg tgctgcaaca aacattaggg
39781 tgcagaccgc ttttttcttt atcaatttaa actcctttgg gtaaattcca aggagtggga
39841 tggctgggtc gaacggtagg gttatattca ggtttctgag gaatctccag actgatttcc
39901 ataggggctt gaccagtttg cattcccacc aacagtgggt tagtgtccct ttttccccac
39961 atcctcacca gcatctgttg ttggtagatt tctgcatgtg aaccattcta accggggtga
40021 ggtgaaacct cattgtggtt ttgatttgca tttccctgat tgctagtgac cttgaacatt
40081 ttttcatgtg cctgttggcc atttggattt cctcttttga aaaatgtcta ttgaggtcct
40141 tggtccatct cttaagtggg ttgtttgttt tgatgttgtg gagtttcttg atttctttgt
40201 agattctggt tattaaccct ttatctgttg catagtttgc aaatattttt tcccattctg
40261 ttggttgtct cttcactctc ctgactgttt cttttgcagt acagaaactt ctcaatttga
40321 tgcaatccca atagttgatt ttggctttga ctgcctgtgc ctcccgggtc ttttccagaa
40381 attctttgcc tgtgccaata tcttgaaggg tttctccaat gttctctaat aacttaatgg
40441 tgtcaggtcg tggatttagg tctttaatcc atgttgagtg gatttttgtg tgagatgtaa
40501 ggtaggggtc ttgcttcatg cttctgcacg tggaaatcca gttttcccaa caccatttat
40561 tgaatagact gtccttgctc caggaattag ttttagatcc ttgatcaaat aaaagttggc
40621 tgtagatgtt tggattgatt tctggtgttt caattctgtt ccattggtct atccatctgt
40681 ttctgtacca gtaccatgct gttttgataa ctactgccct gtagtatgtc ctgaagtctg
40741 gtattgtgat gcctccggct ttgtttttgt tgtacaagat tgctttagct attcgaggtc
40801 tcttgtgcct ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg
40861 gtatcttgat tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga
40921 tgatgttgat cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc
40981 tatttctttc tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa
41041 gtttattcca aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag
41101 ttctttctca gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat
41161 tttatatcct gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt
41221 ctttggatcc tctaagtaca gaatcatatc gtctgcaaag agggatagtt tgacttcttc
41281 cttcttgatt tgtattcctt tgatttcttt ttcttgtctg atggctctgg ctaaaacatc
41341 cagaactatg ttaaatagca gtggtgagag tgggcatccc tgcctggtgc cagattttag
41401 tggaaatgct tccaactttt ccccattcaa taggatgctg gctgtgggtt ttttataaat
41461 tgctttgatt atattgagga atgttccttc tatacccaat ttgcttagag ttttcatcat
41521 gaaagggtgt tgaattttat caaatgcttt ctctgcatca attgagataa ccatatggtt
41581 tttcttctgc agtctgttaa tgtggtgaat cacattgatt gatttgcgaa cattgaacca
41641 tccctgcata ccagggatga atcccacttg gtctgggtgg atgattttcc tgatgtgttg
41701 ttgtattcta ttggccagaa ttttattgag gatttttgca tctatgttca tcagggatat
41761 tggtctgtaa ttttctttca gtcctgcatc tttctctggc ttagggatta aggtgatgct
41821 ggcttcatag aaagaatttg ggaggattcc ctctttttcg attgttctga atagtttgag
41881 aataatggga ttagttcttc tttaaatgtc tggtagaatt cagcagtgaa tccatctggt
41941 cctgggcttt tctttgttgg gagggccttt attactgttt caatttctgt ttcagttatg
42001 ggtctattta ggttttcgat gtcttcatgg ttcaattttg gtagattgca tgtgtccagg
42061 aatctatcca tttctgatag gttttcctgt ttgctggcat acaggtcctt gtagtaattt
42121 ctgatgattc tttttatttc tgtggtgtct gttgttacgt ttactttttc atctctgatt
42181 ttattgattt gggtcttttc tcttcttttt ttagttagtt gggccaatgg ggtgtcaatt
42241 ttgtttattt tttcaaaaaa ccagcttctc gcttggctga ttttttgtaa tgtttttttt
42301 ttttttgatt caatcctgtt aatttcttct ctggttttaa ttatttctct tctcctacta
42361 gatttgggtt tggtttgctg cagtttttct aggtccttga ggtgcactga aagctcattt
42421 atttggtgcc tttccaattt cttgatatag gcacctattg ctataaattt gcctctcaat
42481 actgcttttg ctgtatccca taagttttga tatgttgtgt tgttgtcttc atttacttcc
42541 agaaagtttt tgatttctct tttgatttct tgaatgaccc agtgttcatt caggagcatg
42601 ttgttcagtc tccatgtgtt tgcatacttt cttgggtttc ctgagttgct aatttccagc
42661 ttcatcccac tgtggtctga gaagctgcat ggtatgattc taattctttt aaatttgctg
42721 agacttgctt tatggcctag tatgtgatca atcctagaga aggtcccatg cgctgctgag
42781 aagaatgtga agtctgtaga tgtagggttg aaagttctgt agatatctgt tagatccatt
42841 tgggcaatag tgtcaattaa atctgctgtt tccttgttga tcttctgtcc ggatgatctg
42901 tctatttctg agagtggagt attgaagtcc cccagtacta ttgtattgga atctaagtct
42961 ccctttaagt cccttaacat atcttttaaa tagaccggtg ccctgtaatt aggtgcatat
43021 acatttataa tagttacatc ttcctgttga attgaaccct taatcattat atagtgtccc
43081 tctttgtctc tcttaacagt ttttgtatta aagtttattt tgtctgatat taatatggct
43141 acacctgctt tttttggttt ctgttggcat ggaatatctt tttccaacct ttcactttca
43201 gtctgcatgc ctctttgtta gagagatgtg tttcttgtag gcaacaaata gttgggttgt
43261 gttctgtgag ccagtcatcc aaacggtgtc ttttaactga agaattcaga ccattaatgt
43321 tcaatgtgac aattgatacg tagtgacttt gccctgccat tttcccggaa atattttcta
43381 gtatatgctt tgagcttccc atgctctttt actggtaggt gttcttcctt tcccttcttt
43441 catattgatg gccgtgtttc tgtgtttctg agtgtagcac atctttaagt atcttttgca
43501 gggccggaca agtggccaca aagtctttca atttctgttt gctatgaaag gtctttattt
43561 caccttcatt cacaaatgag agcttggcag gatataatat tctgggctgg caatttttct
43621 ctcttagcac ctgtgctatg tctcgccatt ccctcctagc ttgtagggtt tctgatgaga
43681 agtctgctgt gagtctgatt ggagatcctc tgagagtaat ctgatgtttc tctcttgcac
43741 attttaggat cttttcttta tgtttcactg tggtaagttt aattaccacg tgtcgtgtga
43801 ggatctcttt tggtcatgtt tattgggggt tctatgagct tcctgtacta ggatatctct
43861 gtccttctcc aaacctggaa agttctctgc tagtatctca ctaaaaaggc cttccaatcc
43921 tttctctctc tccatgcctt caggaactcc tagaactcga atgttggttt ttttaatagt
43981 atcctgtaga ttcccaacaa tattttttag atttctaatt tcctcttctt ttctttggtt
44041 tgactgtata ctttcctgtg ctctgtcttc taagtccaat attctgtctt ctgcttcacc
44101 cattctgttt ttaaggcttt ctaatgtgtt tgccatttga tctattgagc tcttcatttc
44161 attttgattt ctcttcacta ttacactttc ctgttctact agtttctgag tttcattttg
44221 actcttcctt aaaatttcat tttcacgaga gagattttca atcttgtcca ttaaggattt
44281 ctgtagttca aggatttgct tttgaaaact tctaaatgtt cttatcataa attttttgaa
44341 atccgtatct tgcatttctt ctatctcatc atcttcatac tcttggcttg gggtgttttg
44401 cttatttgga ggcatcatag tgtcatcgtt gatcttgctc cctctatttc tgtgtttgtt
44461 actcggcata gttaattctt cttgtgtcac tgtgcgtttt ttttttttct tttttttttt
44521 tttttatact gtgtccatgt taagtggact gcctgctgtt ggaggagcct tggaggcttg
44581 agatgggtgc ggcc
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS YSCPROMA 162 bp ds-DNA PLN 13-FEB-1990
DEFINITION S.cerevisiae promoter region, clone pUT340.
ACCESSION M32023
KEYWORDS promoter.
SOURCE S.cerevisiae (strain FL200) DNA, clone pUT340.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 162)
AUTHORS Gatignol,A., Dassain,M. and Tiraby,G.
TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector
based on phleomycin resistance
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.L.Stassi, 08-FEB-1990.
BASE COUNT 46 a 25 c 29 g 62 t
ORIGIN
1 gatctttttt tttttttttt ttttttttgt gctgcaaagt ttcttaaagc cttcgggctt
61 acgaaatcct ttatcaccga aaggggaccg cttcgaaaag tggatataaa acaaggtatt
121 tattttttat agacaatgac caaatgacag gatagatcca tg
//
LOCUS YSCPROMB 270 bp ds-DNA PLN 13-FEB-1990
DEFINITION S.cerevisiae promoter region, clone pUT341.
ACCESSION M32024
KEYWORDS promoter.
SOURCE S.cerevisiae (strain FL200) DNA, clone pUT341.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 270)
AUTHORS Gatignol,A., Dassain,M. and Tiraby,G.
TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector
based on phleomycin resistance
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.L.Stassi, 08-FEB-1990.
BASE COUNT 74 a 47 c 55 g 94 t
ORIGIN
1 gatatatttt cgcggctcgc gttttgtttg cttcttattt tacactgagt tttcgtgccg
61 caaacgtgga gatgggaaaa agaaaagtcg ggaaaataat gagaaatttc tacttttggt
121 attcctcata cagcctgcgc ggtttattag taaaataccc gataatcctc gaggtttgaa
181 aaacttttcc ctctactact gttgacacgg atttttttat ttaagaggaa aagtcgtggt
241 tgttttcctc gaacaaatta gatatccatg
//
LOCUS YSCPROMC 343 bp ds-DNA PLN 13-FEB-1990
DEFINITION S.cerevisiae promoter region, clone pUT342.
ACCESSION M32025
KEYWORDS promoter.
SOURCE S.cerevisiae (strain FL200) DNA, clone pUT342.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 343)
AUTHORS Gatignol,A., Dassain,M. and Tiraby,G.
TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector
based on phleomycin resistance
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.L.Stassi, 08-FEB-1990
BASE COUNT 87 a 81 c 64 g 111 t
ORIGIN
1 ctaaaataca gcaaacttgc attttggtca tttatggtta gcaagtttaa gtgctactac
61 attacatatt acctgaccct gaaaacgggc ttaaaagccc atctactata gtgccgctgc
121 gggtaaccgt agcttcttgc tttttttccc accgtcattt attttctagt cttctttccg
181 agaaatgccg gcggcgtttg ccgttccgtt cgcaatctct cgagaagttc gagagatgaa
241 aaattttctt gggaaagcca agaaggtacg atcgtaggca tgattcatat ataaactctt
301 tcccaatctt tttggttacg tacatactcc cccctgatcc atg
//
LOCUS YSCPROMD 328 bp ds-DNA PLN 13-FEB-1990
DEFINITION S.cerevisiae promoter region, clone pUT343.
ACCESSION M32026
KEYWORDS promoter.
SOURCE S.cerevisiae (strain FL200) DNA, clone pUT343.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Gatignol,A., Dassain,M. and Tiraby,G.
TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector
based on phleomycin resistance
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.L.Stassi, 08-FEB-1990.
BASE COUNT 95 a 80 c 63 g 90 t
ORIGIN
1 gtcatatata accataacca agtaatacat attcaaaatg ggtgttgaac aaatcttaaa
61 gagaaagacc ggtgtcatcg ttggtgaaga tgtccacaac ttattcactt acgctaagga
121 acacaagttc gctattccag ctattaacgt cacctcttct tctactgccg tcgctgcttt
181 agaagctgct agagacagca agtccccaat cattttgcaa acctctaacg gtggtgctgc
241 ttacttcgct ggtaagggta tctctaacga aggtcaaaat gcttccatca agggtgctat
301 tgccgctgcc cactacatca gatccatg
//
LOCUS YSCPROME 123 bp ds-DNA PLN 13-FEB-1990
DEFINITION S.cerevisiae promoter region, clone pUT344.
ACCESSION M32027
KEYWORDS promoter.
SOURCE S.cerevisiae (strain FL200) DNA, clone pUT344.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Gatignol,A., Dassain,M. and Tiraby,G.
TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector
based on phleomycin resistance
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.L.Stassi, 08-FEB-1990.
BASE COUNT 41 a 15 c 21 g 46 t
ORIGIN
1 gatcattatt gatagcttac ttctttgagc ggaatatata gagattttgg actcgaagaa
61 tgctaaatat ttcttagagt taatttttgc aaaaatgaga tatcaatcat ttcatgatcc
121 atg
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS MUSACTMEL 1857 bp ss-mRNA ROD 15-MAR-1989
DEFINITION Mouse A-X actin mRNA, complete cds.
ACCESSION J04181
KEYWORDS actin.
SOURCE Mouse (strain C57BL/6) B16 melanoma F-1, cDNA to mRNA, clone pMA30.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1857)
AUTHORS Sadano,H., Taniguchi,S., Kakunaga,T. and Baba,T.
TITLE cDNA cloning and sequence of a new type of actin in mouse B16
melanoma
JOURNAL J. Biol. Chem. 263, 15868-15871 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by H.Sadano, 08-SEP-1988.
FEATURES from to/span description
pept 63 1190 A-X actin
mRNA < 1 1857 actin mRNA
BASE COUNT 410 a 498 c 453 g 496 t
ORIGIN 794 bp upstream of BglII site.
1 cccgcgagca cagcttcttt gcagctcctt cgttgccggt ccactcccgc caccagttcg
61 ccatggatga cgatatcgct gcgctggtcg tcgacaacgg ctccggcatg tgcaaagccg
121 gcttcgcggg cgacgatgct cccctggctg tattcccctc catcgtgggc cgccctaggc
181 accagggtgt gatggtggga atgggtcaga aggactccta tgtgggtgac gaggcccaga
241 gcaagagagg tatcctgacc ctgaagtacc ccattgaaca tggcattgtt accaactggg
301 acgacatgga gaagatctgg caccacacct tctacaatga gctgcgtgtg gcccctgagg
361 agcaccctgt gctgctcacc gaggcccccc tgaaccctaa ggccaaccgt gaaaagatga
421 cccagatcat gtttgagacc ttcaacaccc cagccatgta cgtagccatc caggctgtgc
481 tgtccctgta tgcctctggt cgtaccacag gcattgtgat ggactccgga gacggggtca
541 cccacactgt gcccatctac gagggctatg ctctccctca cgccatcctg cgtctggacc
601 tggctggccg ggacctgaca gactacctca tgaagatcct gaccgagcgt ggctacagct
661 tcaccaccac agctgagagg gaaatcgtgc gtgacatcaa agagaagctg tgctatgttg
721 ctctagactt cgagcaggag atggccactg ccgcatcctc ttcctccctg gagaagagct
781 atgagctgcc tgacggccag gtcatcacta ttggcaacga gcggttccga tgccctgagg
841 ctcttttcca gccttccttc ttgggtatgg aatcctgtgg catccatgaa actacattca
901 attccatcat gaagtgtgac gttgacatcc gtaaagacct ctatgccaac acagtgctgt
961 ctggtggtac caccatgtac ccaggcattg ctgacaggat gcagaaggag attactgctc
1021 tggctcctag caccatgaag atcaagatca ttgctcctcc tgagcgcaag tactctgtgt
1081 ggatcggtgg ctccatcctg gcctcactgt ccaccttcca gcagatgtgg atcagcaagc
1141 aggagtacga tgagtccggc ccctccatcg tgcaccgcaa gtgcttctag gcggactgtt
1201 actgagctgc gttttacacc ctttctttga caaaacctaa cttgcgcagg aaaaaaaaaa
1261 taagagacaa cattggcatg gctttgtttt tttaaatttt ttttaaagtt tttttttttt
1321 ttttttttta agtttttttg ttttgttttg gcgcttttga ctcaggattt aaaaactgga
1381 acggtgaagg cgacagcagt tggttggagc aaacatcccc caaagttcta caaatgtggc
1441 tgaggacttt gtacattgtt ttgttttttt ttttttggtt ttgtcttttt ttaatagtca
1501 ttccaagtat ccatgaaata agtggttaca ggaagtccct caccctccca aaagccaccc
1561 ccactcctaa gaggaggatg gtcgcgtcca tgccctgagt ccaccccggg gaaggtgaca
1621 gcattgcttc tgtgtaaatt atgtactgca aaaatttttt taaatcttcc gccttaatac
1681 ttcatttttg tttttaattt ctgaatggcc caggtctgag gcctcccttt tttttgtccc
1741 cccaacttga tgtatgaagg ctttggtctc cctgggaggg ggttgaggtg ttgaggcagc
1801 cagggctggc ctgtacactg acttgagacc aataaaagtg cacaccttac cttacac
//
LOCUS MUSIGHMM 369 bp ds-DNA ROD 15-DEC-1989
DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
anti-alpha(1->6)dextran hybridoma 27.7.2.
ACCESSION M24320
KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin;
variable region.
SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.7.2, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 369)
AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
Morrison,S.L. and Kabat,E.A.
TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT
the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
JOURNAL J. Immunol. 142, 888-893 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Kabat, 15-MAR-1989.
FEATURES from to/span description
pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1)
matp < 1 > 369 Ig mu chain VJ3-region
recomb 321 322 V-region end/J3-region start
BASE COUNT 92 a 85 c 96 g 84 t 12 others
ORIGIN
1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag
121 cagaggcctg gacatggcct tgagtggatt ggagagattt tacctggaag tggtagtact
181 aactacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca
241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact
361 gtctctgca
//
LOCUS MUSIGHMN 369 bp ds-DNA ROD 15-DEC-1989
DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
anti-alpha(1->6)dextran hybridoma 27.4b.2.
ACCESSION M24321
KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin;
variable region.
SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.4b.2, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 369)
AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
Morrison,S.L. and Kabat,E.A.
TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT
the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
JOURNAL J. Immunol. 142, 888-893 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Kabat, 15-MAR-1989.
FEATURES from to/span description
pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1)
matp < 1 > 369 Ig mu chain VJ3-region
recomb 321 322 V-region end/J3-region start
BASE COUNT 90 a 82 c 97 g 88 t 12 others
ORIGIN
1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcagt agttactgga tagagnnnnn ntgggtaaaa
121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagttct
181 aactacagtg agaagttcaa gggtaaggcc acattcactg cagatacatc ctccaacaca
241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
301 cattactacg gtagtagctc gtttnnnnnn gcttactggg gccaagggac tctggtcact
361 gtctctgca
//
LOCUS MUSIGHMO 369 bp ds-DNA ROD 15-DEC-1989
DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
anti-alpha(1->6)dextran hybridoma 27.10.2.
ACCESSION M28529 M24322
KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin;
variable region.
SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.10.2, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 369)
AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
Morrison,S.L. and Kabat,E.A.
TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT
the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
JOURNAL J. Immunol. 142, 888-893 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Kabat, 15-MAR-1989.
FEATURES from to/span description
pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1)
matp < 1 > 369 Ig mu chain VJ3-region
recomb 321 322 V-region end/J3-region start
BASE COUNT 90 a 84 c 98 g 85 t 12 others
ORIGIN
1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag
121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagtact
181 aagtacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca
241 gcctacatgc agctcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact
361 gtctctgca
//
LOCUS MUSIL71 1926 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse interleukin 7 (IL-7) gene, exon 1.
ACCESSION M29054
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 1 of 4
SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1926)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept 1897 + 1906 interleukin 7, exon 1
pre-msg 1350 > 1926 IL7 mRNA and introns
IVS 1907 > 1926 IL7 intron A
BASE COUNT 498 a 478 c 430 g 520 t
ORIGIN 156 bp upstream of XbaI site.
1 aatccctggt ttctttctct ttcctctctt tccttttttt tttttttttt tttttttttg
61 gtggacattt ttccatgttc catcgttgaa atgctttctc accacagtag ctctcttcca
121 tcctgtcata ctgcaacgct ggagcctaat cctattctag atagcccttc tctgtggatg
181 ggagctgtct ggagaccagg gctgcagtga agataacaag tactgtcaaa tctcctgatg
241 ctttgagcag gatgacacca agagagacac cagagaggcc tttaaacacc cctgcaggca
301 ccgggtagga tttattttgg aagtgcacta agacaacagg ttttacattg aatgtggaaa
361 gtgagagtcc ctgcagagtc cacaggccag tgcccagtta gaactcctat ggtccacact
421 ctaaatgcac agacagtacc tctctgcctc ttttcactgt taacagacag aatctgaagc
481 tcgtgtcctg atgtccgatt aggaaccgcc agctccagac ccctttctga ggttccccag
541 agcgttaaga gcacacccac caccctaggg cgtcttctta gtctactgta tcctccatta
601 tttacaaggt tctcttccac aagagattcc aagcagactc tttgaagcca gcctgcaggt
661 gttttttatc caaattcagc cagtgaactt agctctgtta cttaaggaag cctctgttcc
721 catgtgtatc tctcaactgc agataagaaa acagattgcc tcggcagaaa actccagatg
781 ccaaaataca tgtaaattcc tgaccaagag tcctggcata gagcaaatac tagatatatg
841 agagatatgg tcctgatctc agatctctct tgttactggt ttgcacaacc tgccttcacc
901 taagcagtgt ttctcctgat tataagaaac aacactaaat caggaatccc ctcggtaatt
961 caggcctatt attaaaacaa aacaaaacac tttaatggaa acaatccaca atgggtggat
1021 tagctatgtc aggagataac tgtgggagat agttactatg cagaacttct atgtgccccc
1081 aactaatacg agttttggtt ttctgaagct accacttaca tactagactg gatggactca
1141 caagaattat ttgttcttaa agtcctacct gagcaggtgc attgcacaga tgacagggag
1201 gaaaaaaaaa aaaaaacaaa ctagagaacc tagaggggca gagggcgtga ccctcttaat
1261 catccttcac ttcctttttt tttttttttt ttttaagatg acttacatcg gcggcagcag
1321 aagcagcgac gcctcctcag tggaactgac ttgcaatagc gagctttctc tgctgcacat
1381 ttgtggcttc tgtggacata ttagtaacca gcggttttag ctcccagtct cacagagttg
1441 ccagagaggt tagaagtcat ttgaaaagcc tattagccga atctttctga tccagaaggc
1501 cagctggctt ctcctgagct actttcaatt cgcagcaacc actgatcctg gtccaggtga
1561 ctggggaaga cgctgagggt ataaacccaa acattgaacc tgaagaccca gcgcaaagta
1621 gaaactgaaa gtaccctgct tactctgccg gcagatccta cggaagttat ggcaaagcca
1681 gagcgcctgg gtggccggtg atgcatgcgg cccctcttgg gatggatgga ccaggcgtgg
1741 cgtgggtgag aggagtcagc tgcctgaact gccctgccca gcaccggttt gcggccaccc
1801 ggtggatgac cggggtcctg ggagtgatta tgggtggtga gagccggctc ctgctgcagt
1861 cccagtcatc atgactacac ccacctcccg cagaccatgt tccatggtaa gcgctgctct
1921 ctggtg
//
LOCUS MUSIL72 174 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse interleukin 7 (IL-7) gene, exon 2.
ACCESSION M29055
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 2 of 4
SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 174)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 154 interleukin 7, exon 2
pre-msg < 1 > 174 IL7 mRNA and introns
IVS < 1 20 IL7 intron A
IVS 155 > 174 IL7 intron B
BASE COUNT 44 a 31 c 35 g 64 t
ORIGIN Undetermined number of bp after segment 1.
1 tcattaactg ctgggttcag tttcttttag atatatcttt ggaattcctc cactgatcct
61 tgttctgctg cctgtcacat catctgagtg ccacattaaa gacaaagaag gtaaagcata
121 tgagagtgta ctgatgatca gcatcgatga attggtacgt gttgttttat tttt
//
LOCUS MUSIL73 121 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse interleukin 7 (IL-7) gene, exon 3.
ACCESSION M29056
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 3 of 4
SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 101 interleukin 7, exon 3
pre-msg < 1 > 121 IL7 mRNA and introns
IVS < 1 20 IL7 intron B
IVS 102 > 121 IL7 intron C
BASE COUNT 46 a 13 c 20 g 42 t
ORIGIN Undetermined number of bp after segment 2.
1 attgttttat attattacag gacaaaatga caggaactga tagtaattgc ccgaataatg
61 aaccaaactt ttttagaaaa catgtatgtg atgatacaaa ggtaatggta atttttactt
121 t
//
LOCUS MUSIL74 172 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse interleukin 7 (IL-7) gene, exon 4.
ACCESSION M29057
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 4 of 4
SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 172)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 / 152 interleukin 7, exon 4
pre-msg < 1 > 172 IL7 mRNA and introns
IVS < 1 20 IL7 intron C
IVS 153 > 172 IL7 intron D
BASE COUNT 56 a 34 c 32 g 50 t
ORIGIN About 1.5 kb after segment 3.
1 tgactttttt cttctaatag gaagctgctt ttctaaatcg tgctgctcgc aagttgaagc
61 aatttcttaa aatgaatatc agtgaagaat tcaatgtcca cttactaaca gtatcacaag
121 gcacacaaac actggtgaac tgcacaagta aggtgagagt ggactactca ct
//
LOCUS MUSMFG1A 508 bp ss-mRNA ROD 15-MAR-1990
DEFINITION Mouse zinc finger protein (mfg1) mRNA, partial cds.
ACCESSION M28513
KEYWORDS zinc finger protein.
SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 508)
AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P.
TITLE Mouse genes coding for "zinc-finger"-containing proteins:
Characterization and expression in differentiated cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Felsani, 29-SEP-1989.
FEATURES from to/span description
pept < 1 > 508 zinc finger protein mfg1 mRNA (AA at 1; put.)
BASE COUNT 191 a 99 c 97 g 121 t
ORIGIN
1 attcgtacag gggagaaagc cttcaagtgt gaagaatgtg gtaaggcctt taataatcgt
61 tcaaaccttt ctgaacacaa gagaatacat actggagaaa aaccctacaa atgtgaagaa
121 tgtggtaagg cctttcgtat tcgctcaaaa ctttctacac accagagagt tcatacggga
181 gaaaaaccct acaaatgtga agaatgtggt aaggctttta atagtcattc aaacctttct
241 gaacacaaga gaatccatac tggagagaaa ccttacaaat gtgaagaatg tggtaaggct
301 ttcagtactt catcaaacct ttctgaacac aagagaatcc atactggaga aaaaccgatc
361 aaatgtgaag aatgtggcaa ggccttcagt actcgctcaa catattatag acaccagaaa
421 aaccatactg gaaaaaaacc atacaagtgt gaagaatgcg ccaaagaatt ttcctaccct
481 tcattgctta aggtacatca aagaattc
//
LOCUS MUSMFG2A 1218 bp ss-mRNA ROD 15-MAR-1990
DEFINITION Mouse zinc finger protein (mfg2) mRNA, partial cds.
ACCESSION M28514
KEYWORDS zinc finger protein.
SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1218)
AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P.
TITLE Mouse genes coding for "zinc-finger"-containing proteins:
Characterization and expression in differentiated cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Felsani, 29-SEP-1989.
FEATURES from to/span description
pept < 1 > 1218 zinc finger protein mfg2 mRNA (AA at 1; put.)
BASE COUNT 359 a 259 c 282 g 318 t
ORIGIN
1 tatgaatgta aggagtctgg gaaggccttc agactccccc aacagctcac aaggcatcag
61 aagtgtcaca gtggtgagaa gcctttcagt cataatgaag gcagacaggc ttttcaacat
121 cccaacctac tcaagtaccc taaagccatt catacaggtg ccaaagcatt tgcatgcagg
181 gaatgtggga agtccttcaa ccgtgtctcc agccttgttg aacatgggct tattcatgct
241 gatgtgaaac catatgaatg taatgagtgt gggaaagcct ttaaacgaca cagaagtttt
301 gtgcgccatc agaaaatcca ttctggtgag agaccctttc agtgtaagga ttgtgggaag
361 ggcttcattg ttctggctca cctcactcgg caccagagca gtcatagtga agagaagcca
421 tttgaatgtg aggaatgtgg caagaaattt agaactgcca gacaccttgt taagcaccag
481 cgcattcata gcggcgagaa accctttgaa tgcaatgtat gtgggagtgc tttccgcctt
541 caactatacc tttctgagca tcagaaaact cacatggaag agaaatactt ggaatgcaat
601 gtgtgtggga aggcttttcg gcttcaggat atcctttctg agcacttgaa aactcacact
661 gaagagaatc ctttcaagtg taagctctgt gggtcatcct tcccacataa gtaccaactt
721 aataagcatc tcacagttca cactgatggg aaaccctatc agtgcaagga atgtggcaaa
781 tgctttcgtc aaaggtcaaa actaacggaa catgagagta ttcacactgg caagaaaccc
841 ttccagtgtg aagcatgtgg gaagtcttta gctaacacac ttctcattca tcatcagaag
901 tctcacagtg gtgagagacc ttttgaatgt aaagaatgcg gaaaggcttt ccttcttccc
961 agtcaactta atagccataa aattgttcat acaagtaaaa gaccctttga atgcaaggta
1021 tgtgggaagt ccttcaagcg tgaatccaat ctcattcaac atggcgctgt tcatgctggt
1081 gtgaaatcat atgaatgcag tgagtgtggg aaaggtttta ttgaccgctc aagcctgttc
1141 caccatcgga aaattcattc tgatgagaaa ccatttaagt gtcaggagtg tggaaaggcc
1201 ttcgttgttc tggcgtat
//
LOCUS MUSMFG3A 916 bp ss-mRNA ROD 15-MAR-1990
DEFINITION Mouse zinc finger protein (mfg3) mRNA, partial cds.
ACCESSION M28515
KEYWORDS zinc finger protein.
SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 916)
AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P.
TITLE Mouse genes coding for "zinc-finger"-containing proteins:
Characterization and expression in differentiated cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Felsani, 29-SEP-1989.
FEATURES from to/span description
pept < 1 > 916 zinc finger protein mfg3 mRNA (AA at 1; put.)
BASE COUNT 281 a 186 c 200 g 249 t
ORIGIN
1 cttactgtgc tggctcagct tactcggcac gagaacattc acactgaaga caaatcgttt
61 gaatgtaagc agtgtggtaa gatatttagt aatggctctt acctcttacg acactacgat
121 actcacacca atgagaaacc ctttgagtgt aatatatgtg gcaaggcttt taggcttcat
181 ctataccttt ctgagcatca gaaaactcac accgatgaga agcctttcaa gtgtaagctg
241 tgtgagtcag ccttcagacg taagtaccag ctcagtgaac atcagagaat tcacactgat
301 ggtaaaccct atcagtgcaa ggattgttgg gaattctttc gtcgacgatc aaattttatc
361 gaacatcaga gtattcacac tggaaagaaa ccctttgagt gtaaggactg tgggaaggtc
421 tttagactaa atatacatct cattcgacat cagagatttc atagtgatga gaagcccttt
481 gaatgtaaag aatgtggaaa ggcttttcat ttttccagcc agcttaataa ccataaaacc
541 agtcatacag gtcagacacc ttttgaatgt aaagaatgtg ggaagtcgtt taagcgtgtc
601 tccagccttg tggaacatag gattattcac agtggtgtga aaccatataa atgtaatgcg
661 tgtggaagag ccttcaatcg tcgctcaaac ctcatgcaac atgagaaaat tcattctgat
721 gagagaccct ttgagtgtaa ggactgtgga aaggccttca ctgttctggc tcagctcaca
781 cggcaccaga cgattcataa tggaaagaaa tcatatgaat gtgagcagtg tgggtcagcc
841 ttcagactcc cgtaccaact cactcaacat cagagaattc attatgatgt gaagccattt
901 cagtgcaagg atgtgg
//
LOCUS MUSMHSLPC 5362 bp ss-mRNA ROD 15-SEP-1989
DEFINITION Mouse MHC sex-limited protein (Slp) mRNA, complete cds.
ACCESSION M21576 M12385
KEYWORDS antigen; major histocompatibility complex; sex-limited protein.
SOURCE Mouse (strain B10.WR) liver, cDNA to mRNA, clone pSlp20B.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5362)
AUTHORS Ogata,R.T. and Sepich,D.S.
TITLE Murine sex-limited protein: Complete cDNA sequence and comparison
with murine fourth complement component
JOURNAL J. Immunol. 135, 4239-4244 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 52 5259 sex-limited protein precursor
sigp 52 108 sex-limited protein signal peptide
matp 109 5256 sex-limited protein
mRNA < 1 5362 Slp mRNA
BASE COUNT 1219 a 1494 c 1462 g 1187 t
ORIGIN 39 bp upstream of BamHI site.
1 cgagcagaca gtcagaccag acaggtctga cctttcctgg atcctccagc catgcggctc
61 ctctgggggc tggcctgggt gttcagcttc tgtgcctcat ccctgcagaa gcccaggttg
121 ctcctgtttt ccccttctgt ggttaatttg gggacccccc tgtcggtggg ggtacagctc
181 ctggatgccc ctccaggaca ggaggtaaaa ggatcagtgt tcctcagaaa cccaaagggt
241 ggttcctgct ccccaaagaa ggactttaag ctgagctcgg gagatgactt tgtgctgctc
301 agccttgagg tcccactgga agatgtgagg agctgtggcc tctttgacct gcgcagagcc
361 ccccacatcc agctggtagc tcagtctccg tggctaagga acacagcttt caaagccaca
421 gagactcagg gtgtcaactt gctcttctct tcccgacgag gccacatctt tgtgcagacc
481 gatcagccta tctataatcc agggcagcgg gttcgttatc gggtctttgc actggatcaa
541 aagatgcgcc catccactga tttcctcacc atcacagtgg agaactccca tggcctcagt
601 gtactcaaga aggagatatt tacttccaca cccatcttac aagatacctt catcattcca
661 gacatctcag agcctgggac ctggaagatc tcagctaggt tctcagatgg actggagtcc
721 aataggagca cccactttga agtgaagagg tatgtccttc ccaacttcga ggtgaagatt
781 actccttgga agccatatat cctgatggtg cccagcaaca gtgatgaaat ccaattagac
841 atccaggcca ggtacatcta tgggaagccc gtgcagggcg tggcatacac acggtttgcg
901 ctcatggatg agcaagggaa gaggactttc cttcagggcc tagagaccca ggccaagttg
961 gtggaaggcc ggacccacat ttccatctca aaggaccagt tccaggctgc cctggataaa
1021 atcaatattg gggtcagaga cctggagggg ctgcgtctct atgctgctac agctgtcatt
1081 gagtctccag gaggagagat ggaggaggca gaactcacgt cctggcgctt tgtatcatct
1141 gccttttcct tggatctcag ccgcactaag cggcatctcg tgcctggagc ccacttcctg
1201 ctgcaggcct tggtcagaga aatatcaggc tctgaagcct ctaacgttat tgtcaaagtt
1261 tctgccacat tggtgtcagg ctctgattcc caagtcctca acgttcaaca gagcaccaat
1321 agaattggcc aagtcagcat ttccttcccc atcccaccaa ccgtcacaga acttcggctc
1381 ttggtgtctg cgggctccct ctacccaacg atagccaggc tcaccgtgca atccccacct
1441 tcaagaggca ctggctttct ttctattgag ccactagacc ctcggtcccc tcgtgtgggg
1501 gacaccttta tcctaaactt acaagctgtg ggcatccctg cacctacctt ctctcattac
1561 tactacatga tcatctccag aggccagatc atggctatga gtcgggaggc ccggaggacc
1621 gtgacctctg tctccgtgtt ggtggaccat cagctggctc cttcattcta cttcatggct
1681 tacttctatc accaaggaca cccggtggcc aactctctgc tcatcaacat ccagcccagg
1741 gactgtgagg gcaagctgca attgaaggta gttggtgcca aggagtatca taatggggac
1801 atgatgaagc tccaaattca aactgactcc aaagccctgg tggcactggg agctgtggac
1861 accgctctgt atgctgtggg cggttggtct cacaaacccc tcgacatgag caaggtcttt
1921 gaagtaatca acagctacaa ccttggctgt ggtcctggag gtggggatga tgcccttcag
1981 gtgttccagg atgctggtct ggccttttct gatggtgatc gactaactca aaccagagag
2041 gacctgagct gtcccaagga gaagaaaagt cggcaaaaga gaaatgttaa cttcctgaag
2101 gctcttagtg agaagttggg ccagtattct tctccagatg ccaagcgctg ctgccaagac
2161 gggatgacga agctgcccat gaagcgtacc tgtgagcagc gggctgcccg tgtgcctcag
2221 caggcctgcc gtgagccctt cttgtcctgt tgcaagtttg ctgaggacct tcgcaggaac
2281 cagaccagga gccaggcacc ccttgcccga aaggtccggg acatggtgaa tttgatagag
2341 gaagatgaca ttcttgtgcg cacctccttc ccagagaact ggctctggag agtggaacct
2401 gtagacagct ccaaactgtt gacagtgcgg cttcctgatt ctatgaccac atgggagatt
2461 catggtgtga gcctgtccaa aagcaaaggt ctgtgtgtag ccaagccaac tcgtgttcga
2521 gtgttcagaa aattccacct tcacctgcgc ctgcccatct ccgtccgccg ctttgagcag
2581 cttgaattac ggcctgttct ctacaactat ctgaatgatg acaagaacgt gagtgtccat
2641 gtgaccccag tggagggcct gtgcatggct ggtggtggaa tgatggccca gtgggtgata
2701 gtgcctgcag gctctgccct gcctgtggcc ttctctgtgg tacccacagc ttctaccaac
2761 gtgcccctga aattggtggc taaagggact ttggattcaa gcgattctgt gtctaagatt
2821 ctccaaattg agaaggaagg agccatccac agagaagaga tagtctacaa cctcgacccc
2881 ctaaataacc taggtcagat gttggagata cctggcagct cggatcccaa catagtccct
2941 gaaggagact tcagcacctt tgtcaaggtt acagcctctg aacccttgga gactctgggc
3001 tctgaaggag ctttgtcccc aggaggcgtg gcctcccttc tgaggcttcc ccggagctgc
3061 gcagagcaaa ccatgatcta tttggctcct accctgactg cttccaacta cctggacagg
3121 acaaaacagt ggagcaaact gtcccctgag accaaggacc atgctgtgca tctgatccag
3181 aaaggccacg ttcggatcca gcagtttcgg aagaaagatg gctcctttgg ggcttggtta
3241 caccgggaca gcagcacctg gctgactgcc tttgtgctga agattctgag tttggcccag
3301 gaacaggtgg gcaactcccc ggagaagctg caggagacgg ctagctggct gctggcccag
3361 cagctgggtg atggctcctt ccacgaccca tgtccagtca tccacagagc aatgcagggg
3421 ggcttggtgg ggtccaatga gacggtggca ctgaccgcct ttgtggtcat tgcccttcac
3481 cacgggctga acgtcttccg ggaggggcat gcgaagcagc tgaagaacag agtggaagcc
3541 tccatcacca aggcaaactc tttcttgggg cagaaggcaa gtgctgggct cctgggtgcc
3601 catgccgccg ccatcacagc ctatgccctt acgctgacca aggcctcgga ggacctgcgg
3661 aatgttgccc acaacagcct gatggccatg gctgaggaaa cgggggaaaa cctctactgg
3721 ggcttagtcc ttggctctca ggacaaagtt gtgttgcgcc ccgcagcccc ccgtagccca
3781 acagaacctg tgccccaggc cccagccttg tggatcgaaa ccacagccta tgccctgctc
3841 cacctgcttc tgcgggaggg aaagggaaaa atggctgaca aggctgcatc ctggctcacc
3901 caccagggaa acttccacgg ggcattccgc agtacccagg acactgtggt caccctggat
3961 gccctgtctg cctactggat cgcttcgcac accactgagg agaaagcact gaacgtgacg
4021 ctcagctcca tgggccgcaa tgggctcaaa acccacgtgc tacacttgaa caaccaccaa
4081 gtcaagggcc tggaggagga gctgaagttc tccctgggca gcacaatcag tgtcaaggtg
4141 gaaggaaaca gcaaaggcac cttgaagatc cttcgtacct acaacgtcct ggacatgaag
4201 aacaccacat gccaggacct tcagatagaa gtgaaggtca cagacgctgt ggaatatgca
4261 tggagcgcct acgaagacta tgaagacgac tataacatgc cagccacaga tgatcccagc
4321 gttcccttgc agcctgtcac gcccctgcag ctatttgagg gtcgttggag ccgccgcagg
4381 agggaggccc ccaaggtggc tgaagagcgg gagtccagag ttcactacac tgtgtgtatc
4441 tggcacaatg gcaagctggg gctgtctggc atggccatcg cagacatcac cctcctgagt
4501 ggattccacg ccctgagggc tgacctggag aagctgacct ctctctctga ccgttatgtg
4561 agtcactttg agactgacgg gccccatgtc ctgttgtact ttgactcggt ccctaccacc
4621 cgggagtgtg tgggctttgg agcctcacag gaggtggttg tgggactggt gcagccagcc
4681 agtgctgtcc tgtatgacta ctacagccct gatcacaagt gctctgtgtt ttatgctgca
4741 cccaccaaga gccagctcct ggccacactg tgctctggag atgtatgcca gtgtgccgag
4801 gggaagtgcc ctcgactgct aaggtcactg gagcgaaggg tggaggacaa ggacggctac
4861 cggatgaggt tcgcctgcta ttatcaccaa gtggagtatg gcttcacggt taaggttctt
4921 cgagaagatg gcagagctgc cttccgtctc tttgagtcca agatcaccca agtcctgcat
4981 ttcagaacgg acaccatggc ctccataggt cagacccgca acttcctgag ccgggcctct
5041 tgccgccttc gtttggagcc taacaaagag tacttgatca tggggatgga cggggaaacc
5101 agtgacaaca agggagaccc ccagtacttg ctggactcaa atacctggat tgaggagatg
5161 ccttcagaac aaatgtgcaa gagcacccgc catcgggcag cctgtttcca gctcaaagat
5221 ttcctgatgg agttcagcag ccgggggtgc caggtgtgag gccttaggac tctggctctc
5281 tgagctcagc tcagggtcag ggcctcgctg gatgaggggc tctgctctac agggtaaata
5341 aaagaaaagc tttttgacag cc
//
LOCUS MUSNCAML1 3783 bp ss-mRNA ROD 15-MAR-1990
DEFINITION Mouse mRNA for neural cell adhesion molecule L1 (N-CAM L1).
ACCESSION X12875
KEYWORDS glycoprotein; neural cell adhesion molecule; transmembrane protein.
SOURCE house mouse (Mus musculus).
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3783)
AUTHORS Moos,M., Tacke,R., Scherer,H., Teplow,D., Frueh,K. and Schachner,M.
TITLE Neural adhesion molecule L1 as a member of the immunoglobulin
superfamily with binding domains similar to fibronectin
JOURNAL Nature 334, 701-703 (1988)
STANDARD simple automatic
COMMENT *source: developmental stage=8-day old; tissue=brain; *source:
library=lambda gt10 and lambda gt11.
FEATURES from to/span description
pept 1 3783 N-CAM L1 (AA 1 - 1260)
BASE COUNT 982 a 1038 c 997 g 766 t
ORIGIN
1 atggtcgtga tgctgcggta cgtgtggcct ctcctcctct gcagcccctg cctgctcata
61 cagattccag acgaatataa aggacaccat gtgctagagc cacctgtcat cacggaacag
121 tctccacggc gcctggttgt cttcccaaca gatgacataa gcctgaaatg tgaagccaga
181 ggcagacccc aagtggagtt ccgctggacg aaagatggca tccacttcaa acccaaggaa
241 gaattgggtg tagtggtgca tgaggcaccc tattctggct ccttcaccat cgaaggcaac
301 aacagctttg cccagaggtt tcagggcatc tatcgctgct atgccagcaa taagctagga
361 actgccatgt cgcatgagat ccagctcgtg gccgagggtg cccccaagtg gccgaaggag
421 actgtaaaac ctgtggaagt ggaggaagga gaatcagtag ttctgccttg caaccctcca
481 cccagtgcag ccccacctag gatctactgg atgaacagca agattttcga catcaaacaa
541 gatgagcggg tgtccatggg ccagaatgga gacctatatt ttgccaatgt gcttacctca
601 gacaatcatt cagactacat ctgcaatgcc cacttccctg gtacccggac catcattcaa
661 aaggaaccta ttgacctccg ggtcaagccc accaacagca tgattgaccg gaagccacgt
721 ctgctctttc ccacaaactc cagcagccgc ctggtagcct tgcagggcca gtcattgatc
781 ctggagtgca ttgctgaggg attccctaca cccaccatca agtggctgca ccccagtgac
841 ccaatgccaa cagaccgtgt tatctaccaa aaccacaaca agaccctgca actactcaat
901 gtgggcgaag aggacgatgg cgagtatacc tgccttgctg agaactcgct gggcagtgcc
961 cggcatgcct actatgttac tgtggaagct gccccatatt ggctgcagaa gccccagagc
1021 catttgtatg gtccaggaga gactgcccgc ctagactgcc aagtccaggg caggccccaa
1081 ccagagatca cttggagaat caacggaatg tctatggaga cggtgaacaa ggaccagaag
1141 taccggattg agcaggggtc tctgatcttg agtaacgtgc agccaactga cacaatggtg
1201 acccagtgtg aagcccgcaa ccagcatggg ctcctgctag ccaatgccta catttatgtt
1261 gtccagctgc cagccaggat cctaacaaaa gacaatcaga catacatggc agttgagggc
1321 agtactgctt acttgctgtg caaagccttt ggagctcctg ttcccagtgt ccagtggctg
1381 gatgaagaag gaaccacagt gcttcaggat gaacgatttt tcccctatgc caatggaacg
1441 ctgagcatca gagacctcca ggccaatgac actggacgct atttctgcca ggctgccaat
1501 gaccagaaca atgtgaccat tttggctaac ctacaggtta aagaagcaac ccagatcaca
1561 caggggcccc ggagcgcaat tgagaagaaa ggtgcaaggg tgacattcac gtgccaggcc
1621 tcctttgacc cctctttgca ggccagcatc acttggcgtg gagatgggag agacctacag
1681 gaacgtgggg acagtgacaa gtatttcata gaagatggga aactagtcat ccagagcctg
1741 gactacagtg accagggcaa ctacagttgt gtggccagca ctgaactgga tgaggtggag
1801 agcagggcac agctcttagt ggtggggagc cctgggccag tgcctcacct ggagctgtcc
1861 gaccgccacc tgctgaagca gagccaggtg cacttgtctt ggagccctgc tgaagaccac
1921 aactctccca ttgagaagta tgacattgaa tttgaggaca aggaaatggc tcctgagaaa
1981 tggttcagtc tgggcaaggt gccaggaaat cagacctcta ctaccctcaa gctgtccccc
2041 tatgtccact acacctttcg ggtcactgcc attaacaaat atggtcctgg agaacccagc
2101 cctgtctctg agagtgtggt cacacctgag gcagccccag agaagaaccc tgtggatgtg
2161 agaggggaag ggaatgagac caacaatatg gtcatcacat ggaagcccct tcggtggatg
2221 gattggaatg ccccccagat tcagtaccgt gtacagtggc gtccacaggg caagcaggag
2281 acctggagga aacagaccgt gagcgaccct ttcctggtgg tgtctaacac ttccacattt
2341 gtgccttatg agatcaaagt ccaggcagtg aacaaccagg gcaagggccc tgagccccag
2401 gtcaccattg gctattcagg ggaagactac ccccaggtga gccctgaact tgaagacatc
2461 acaatcttca actcaagtac tgtgcttgtc aggtggaggc ctgtggactt ggcccaggtt
2521 aagggccacc tcaagggata caatgtaaca tactggtgga agggcagcca gagaaagcac
2581 agcaagaggc atatccacaa aagccacata gtggtacctg caaataccac cagtgccatc
2641 ctcagtggtt tgcgccctta cagctcttac catgtggagg tacaggcctt taatgggcgg
2701 ggcttggggc ctgcgagtga atggaccttc agcaccccag agggagtgcc tggccaccct
2761 gaggcattac acctggagtg tcagtcggac actagtctgc tactgcactg gcagccacca
2821 ctcagccaca atggagtgct cactggctac ctgctctctt accatcccgt ggaaggggaa
2881 agcaaagagc agttgttctt caacctttcg gacccagaac tccggactca taatctgacc
2941 aacctcaacc ctgatctaca gtaccgcttc cagcttcagg ccaccaccca acaggggggt
3001 cctggtgagg ccatcgtgcg tgaaggaggc accatggccc tgtttggcaa gccagatttt
3061 ggcaacatct cagccacagc aggtgaaaac tacagcgtgg tctcctgggt ccctcggaag
3121 ggccagtgca atttcaggtt ccatatcttg ttcaaagcct taccagaagg gaaagtgagc
3181 cctgatcacc agcctcagcc tcagtatgtc agctacaatc agagctccta cacacaatgg
3241 aacctacagc ctgacaccaa atatgagatc cacctgataa aggagaaggt cctcctgcac
3301 catctggatg tgaagactaa tggaactggc cctgtgcgag tttctactac agggagcttt
3361 gcctccgagg gctggttcat cgcctttgtc agcgctatca ttctcttgct cctcatcctg
3421 ctcatcctct gcttcatcaa acgcagcaag ggtggcaaat actcagtgaa ggacaaggag
3481 gacactcagg tagattccga ggcccggccc atgaaagacg agaccttcgg cgagtacagg
3541 tccctggaga gtgacaatga agagaaggcc tttggcagca gccagccatc tctcaacgga
3601 gacatcaaac ccctaggcag tgatgacagc ctggctgatt atgggggcag tgtggacgtc
3661 cagttcaatg aggatggctc tttcatcggc cagtacagtg gcaagaaaga gaaggaggca
3721 gcaggaggca atgacagttc aggggctacc tctcctatca atcctgcagt agccctagaa
3781 tag
//
LOCUS MUSPBGD1 2663 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1.
ACCESSION M29949 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 1 of 4
SOURCE Mouse (C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2663)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene
JOURNAL J. Biol. Chem. (1989) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept 505 537 porphobilinogen deaminase (housekeeping) exon
1 (EC 4.3.1.8)
2191 2307 porphobilinogen deaminase (housekeeping) exon
2 (EC 4.3.1.8)
2457 + 2510 porphobilinogen deaminase (housekeeping) exon
3 (EC 4.3.1.8)
pep$ 2475 + 2510 porphobilinogen deaminase (erythroid sp.) exon
1
pre-msg 341 > 2663 PGB deaminase (hk) mRNA and introns
IVS 538 2190 PGB deaminase (hk) intron A (no splice
consensus)
IVS 2308 2456 PGB deaminase (hk) intron B
IVS 2511 > 2663 PGB deaminase (hk) intron C
pre-msg 2192 > 2663 PGB deaminase (ery.sp.) mRNA and introns
IVS 2511 > 2663 PGB deaminase (ery.sp.) intron A
binding 1126 1133 NPE binding site
binding 312 317 Sp1 binding site
binding 329 334 Sp1 binding site
site 2101 2107 CACCC box
site 2118 2124 CACCC box
BASE COUNT 622 a 661 c 707 g 673 t
ORIGIN Chromosome 9
1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga
61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca
121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg
181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga
241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc
301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac
361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc
421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc
481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg
541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg
601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc
661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag
721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc
781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag
841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc
901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt
961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc
1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc
1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag
1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat
1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt
1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac
1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca
1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt
1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg
1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc
1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg
1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt
1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg
1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga
1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg
1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc
1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta
1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga
2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc
2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc
2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg
2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc
2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga
2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg
2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag
2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag
2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct
2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc
2641 tctgaaaaga gagtctctgg agc
//
LOCUS MUSPBGD2 2761 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2.
ACCESSION M29950 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 2 of 4
SOURCE Mouse (C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2761)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene
JOURNAL J. Biol. Chem. (1989) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 50 122 porphobilinogen deaminase (housekeeping), exon
4 (EC 4.3.1.8)
485 534 porphobilinogen deaminase (housekeeping), exon
5 (EC 4.3.1.8)
617 672 porphobilinogen deaminase (housekeeping), exon
6 (EC 4.3.1.8)
981 1058 porphobilinogen deaminase (housekeeping), exon
7 (EC 4.3.1.8)
1969 2046 porphobilinogen deaminase (housekeeping), exon
8 (EC 4.3.1.8)
2141 + 2216 porphobilinogen deaminase (housekeeping), exon
9 (EC 4.3.1.8)
pep$ + 50 122 porphobilinogen deaminase (erythroid sp.) exon
2
485 534 porphobilinogen deaminase (erythroid sp.) exon
3
617 672 porphobilinogen deaminase (erythroid sp.) exon
4
981 1058 porphobilinogen deaminase (erythroid sp.) exon
5
1969 2046 porphobilinogen deaminase (erythroid sp.) exon
6
2141 + 2216 porphobilinogen deaminase (erythroid sp.) exon
7
pre-msg < 1 > 2761 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 49 PGB deaminase (hk) intron C; ery.sp. intron A
IVS 123 484 PGB deaminase (hk) intron D; ery.sp. intron B
IVS 535 616 PGB deaminase (hk) intron E; ery.sp. intron C
IVS 673 980 PGB deaminase (hk) intron F; ery.sp. intron D
IVS 1059 1968 PGB deaminase (hk) intron G; ery.sp. intron E
IVS 2047 2140 PGB deaminase (hk) intron H; ery.sp. intron F
IVS 2217 > 2761 PGB deaminase (hk) intron I; ery.sp. intron G
BASE COUNT 703 a 605 c 725 g 728 t
ORIGIN Chromosome 9; approximately 147 bp upstream of segment 1.
1 aaggcagtgg ccaggggagg tgagaaacca tctgactctc tttccccagc tggctcgcat
61 acagaccgag actgtggtgg cgatgctgaa agccttgtac cctggcatac agtttgaaat
121 cagtaagttt tcttgagagg agtgattggt agtgaacggg aagccagtga accggaggac
181 agggcatctc tcgtttgcct gtggtcaaag cctgccttgt aagactattc tggctgcttg
241 tgaagggaaa gaaagattgt ctcctgtgca catctcctcc agctgcccgg gctagcctga
301 catttccata ctttctgctt tgggttcttt tatgagtatg tctgcttttt ctgtcggtgt
361 gtgtatctga gagagttagg ggctgggtct tctatgcctc agactccact gtgaatccag
421 tcaaggcctg aacgaggggt gactcagtag gtgttaatgg gtatctgatt gactctctcc
481 tcagttgcta tgtccaccac gggagacaag attgttgata ctgcactctc taaggtaacg
541 ccagtccttg tcccattctt cttgtccctc tcccacgtgt aaggggttca ctctgaggct
601 ctctcttgcc tggcagattg gagagaagag cctgtttacc aaggagctag aaaacgccct
661 ggaaaaaaac gagtgagtga ggatggagga atgtggtacc ccgagcctag aaccccaaag
721 tggctctcca atattggcag gattgtcggg ttagactgtg gagctcacag gctttcacag
781 agaagagagc cttgccttgg agtagcctaa ctacctgggg aatcagactg ccgggggaaa
841 ggggtagagt agttgagaag agaccaggtc ttagatctta agatgctatc ttcctgaacg
901 gtcaaggatg ctggggtggg tggtggagat aaggtcacct actcaaagcc tctctctgtg
961 cctccccctg ccgtctccag agtggacctg gttgttcact ccctgaagga tgtgcctacc
1021 atactacctc ctggctttac tattggagcc atctgcaagt aagcggggag gacatgcatg
1081 ggacggaggg ccctgggcag gattaatcct actgtgggaa tctttgagtt tttttttttt
1141 ttttttccat ttggaactta accgcttagc cgtctgtttt gaaggttctc agacatagtg
1201 tggcaggaaa gccaattggt tgacttggtt gactatttag agtttgtgga gttgggctca
1261 gtggcacgga cctgaaatcc cagctactgg gaggctaaga caggatcaga gattctgggc
1321 cagcctgggc tacagaggga tttgaaccag cctgaggaac ttagattgtg ccttaggggc
1381 acagaaggct ggcttacagt ggcttaggtg gtaaaggcat tttttgctgt caagccaatg
1441 acctgagttc agtccgtggg gtgcacttgg tgaaagaaga gggttgaatc ccacaagttg
1501 tcatctgact catgcataca tgctgtagaa tgtttatgct cctcatccct caatgaaaat
1561 ggaaacaatc aaggaaatga aatataaaac ctgctgggtg gtggtgcgca cgcctataat
1621 cccagcactt gggaggcaga ggcaggtgaa ttcaacctgg tctacaaagt gagttccagg
1681 actatacaga gaaacccagt cttaaaaaca aaacaaaact aaacaacaac aacaacaaca
1741 acaaaaaaga aaaaacaaag aaagaaatat aaaacctttc caaagaaaat aaaatgaatt
1801 tggcctggtg gctcatgcta taatctcagc attcagagag ctgaggcagg agggttattg
1861 tgagttaaag gctagctggg gtacagagaa aattttaggt cacctgggct agagttaacc
1921 ctatctccaa atgctaatac ctttatttca tcatcatttg ctttgcagac ggcaaaaccc
1981 ttgtgatgct gttgtctttc acccaaagtt tattggaaag accctggaaa ccttgccaga
2041 gaaaaggtga gtgggcctag tgtgcggggg agagaggcct ggacagtgga gaacagttgg
2101 cagcctgggt taagtttaat tctaaactct ctctgagcag tgccgtggga accagctctc
2161 tgaggagagt ggctcagcta cagagaaagt tccccaacct ggaattcaag agtattgtat
2221 cctttcagaa gaaggagggg aaaaagaggg aaagaaggac cttccgaagc aagtggtcca
2281 tgcggtcagg gggtcgtctt tccatctgtc cgtccaccca cccacccacc catccatcca
2341 tccatccaca catccacagt cctttaatgt tttgcttttt tttttttttc ctgagacagg
2401 gtttctctgt gtagcctggc tgtcctggaa ctcactttgt agaccaagct ggcctgcaaa
2461 gtgagaaatc cgcctgcctc tgcctcctga gtgctgggat taaaggtgcg ccaccactgc
2521 ctggcaccct aatgtttttt aaactcaggc ctggcaatga ggacaatttg caaaacaaac
2581 atggttcctt gttctataca gctgacatgt tagacagaca ggcaggcact gcagatactg
2641 accggtgacc actcctggtg cagggacaga ggcgcttctg cttttacttt ctgtgctgct
2701 aagtggtttt ggtttttaca gtgaatatgt gatatgtttc ataaaagtaa tttttttttc
2761 t
//
LOCUS MUSPBGD3 800 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5.
ACCESSION M29951 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 3 of 4
SOURCE Mouse (C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 800)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene
JOURNAL J. Biol. Chem. (1989) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 402 515 porphobilinogen deaminase (housekeeping), exon
10 (EC 4.3.1.8)
739 + 777 porphobilinogen deaminase (housekeeping), exon
11 (EC 4.3.1.8)
pep$ + 402 515 porphobilinogen deaminase (erythroid sp.) exon
8
739 + 777 porphobilinogen deaminase (erythroid sp.) exon
9
pre-msg < 1 > 790 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 401 PGB deaminase (hk) intron I; ery.sp. intron G
IVS 516 738 PGB deaminase (hk) intron J; ery.sp. intron H
IVS 778 > 790 PGB deaminase (hk) intron K; ery.sp. intron I
BASE COUNT 181 a 201 c 182 g 236 t
ORIGIN Chromosome 9; approximately 477 bp upstream of segment 2.
1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg
61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct
121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct
181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta
241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa
301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg
361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc
421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc
481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc
541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac
601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc
661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca
721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta
781 ggtaggtttg cctggagaga
//
LOCUS MUSPBGD4 1386 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4.
ACCESSION M29952 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 4 of 4
SOURCE Mouse (C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1386)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene
JOURNAL J. Biol. Chem. (1989) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 154 273 porphobilinogen deaminase (housekeeping), exon
12 (EC 4.3.1.8)
459 512 porphobilinogen deaminase (housekeeping), exon
13 (EC 4.3.1.8)
618 704 porphobilinogen deaminase (housekeeping), exon
14 (EC 4.3.1.8)
783 956 porphobilinogen deaminase (housekeeping), exon
15 (EC 4.3.1.8)
pep$ + 154 273 porphobilinogen deaminase (erythroid sp.) exon
10
459 512 porphobilinogen deaminase (erythroid sp.) exon
11
618 704 porphobilinogen deaminase (erythroid sp.) exon
12
783 956 porphobilinogen deaminase (erythroid sp.) exon
13
pre-msg < 1 1279 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 153 PGB deaminase (hk) intron K; ery.sp. intron I
IVS 274 458 PGB deaminase (hk) intron L; ery.sp. intron J
IVS 513 617 PGB deaminase (hk) intron M; ery.sp. intron K
IVS 705 782 PGB deaminase (hk) intron N; ery.sp. intron L
BASE COUNT 372 a 305 c 343 g 366 t
ORIGIN Chromosome 9; approximately 77 bp upstream of segment 3.
1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga
61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct
121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc
181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc
241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg
301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt
361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt
421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg
481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg
541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca
601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata
661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat
721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc
781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc
841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca
901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg
961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc
1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga
1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag
1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac
1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc
1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt
1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca
1381 gtcgac
//
LOCUS MUSPHKGB 1012 bp ss-mRNA ROD 15-MAR-1990
DEFINITION Mouse skeletal muscle phosphorylase kinase alpha subunit mRNA,
partial cds.
ACCESSION M28867
KEYWORDS phosphorylase kinase; protein kinase.
SOURCE Mouse (strain ICR Swiss White, adult) mixed fiber skeletal muscle
from hind limb, cDNA to mRNA, clone alpha-1.0Kb.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1012)
AUTHORS Bender,P.K. and Lalley,P.A.
TITLE The I/Lyn mouse phosphorylase kinase deficiency: Mutation disrupts
expression of the alpha/alpha' subunit mRNAs
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9996-10000 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Bender, 05-OCT-1989.
FEATURES from to/span description
pept < 1 > 1012 phosphorylase kinase alpha subunit (AA at 1)
(EC 2.7.1.3.8)
BASE COUNT 280 a 236 c 229 g 267 t
ORIGIN Chromosome X.
1 aattcactac tgcccagggc ttcaacatcc aaagaagttg atgccagtct gctctcagtg
61 gtctctttcc cagcctttgc tgtagaggac agccatttgg tggagctcac caaacaggag
121 atcatcacca agcttcaggg tcgttatggt tgctgtcgtt ttctgcgaga tggatataaa
181 actcctaaag aggatcccca acgcctatac tataacccag ctgagctgaa gctatttgaa
241 aacattgagt gcgaatggcc attgttctgg acatacttta tccttgatgg gatcttcagt
301 ggcaacgtag aacaggttca agaatataga gaggctcttg atgcagtcct catcaagggc
361 aaaaatggag tccctcttct tccagagctg tacagtgtcc ctcctgacag ggttgatgaa
421 gagtatcaaa atccccacac tgtggatcga gtccctatgg gaaaattgcc tcacatgtgg
481 ggtcagtctc tatacatttt aggaagcttg atggcagagg gatttttagc tcctggagaa
541 attgatcccc tgaatcgtag gttttctact gtgccaaagc cagatgtggt ggttcaagtc
601 tccattctgg ctgaaacaga agaaatcaag gccattttga aggacaaagg aattgatgtg
661 gagaccattg ctgaagtgta ccccataaga gtacagccag ctcgtattct cagccatatt
721 tattctagtc taggatgcaa cagtagaatg aaactcagtg gacgacccta caggctcatg
781 ggtgtgcttg gaacatcaaa actttatgac attcgcaaaa ctatctttac tttcactcca
841 cagtttatag accagcaaca gttctacctg gctctggaca accagatgat agtagaaatg
901 ctcagaacag acctttccta cctctgtagc cgctggagga tgacaggcca gcccacgatc
961 actttcccta tctcgcacac catgcttgat gaagatggaa ccagcttgaa tt
//
LOCUS MUSTCBVE 389 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse germline T cell receptor V-beta-13g gene V-region, 2 partial
exons.
ACCESSION M25914
SOURCE Mouse T lymphocyte DNA, clone V-beta-13g.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 389)
AUTHORS Morahan,G., Allison,J., Peterson,M.G. and Malcolm,L.
TITLE Sequence of the V-beta-13 gene used by an influenza-specific T cell
JOURNAL Immunogenetics 30, 311-313 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by Grant Morahan, 05-JUL-1989.
FEATURES from to/span description
pept < 1 23 T cell receptor V-region, exon 1 (AA at 2)
350 > 389 T cell receptor V-region, exon 2
IVS 24 349 TCR intron
BASE COUNT 87 a 91 c 89 g 122 t
ORIGIN
1 cagtgttctg ttccttgaca caggtaagtt tttagaaaaa cagtctccgt tttgcatttc
61 tctggggtgt gtgtgtgtga ttatttcaat tgttcccatt ttgaaacagt ctgatctttt
121 cccacagtac tgtctgaagc tggagtcacc cagtctccca gatatgcagt cctacaggaa
181 gggcaagctg tttccttttg gtgtgaccct atttctggac atgataccct ttactggtat
241 cagcagccca gagaccaggg gccccagctt ctagtttact ttcgggatga ggctgttata
301 gataattcac agttgccctc ggatcgattt tctgctgtga ggcctaaagg aactaactcc
361 actctcaaga tccagtctgc aaagcaggg
//
LOCUS MUSTYR 1838 bp ss-mRNA ROD 15-MAR-1988
DEFINITION Mouse tyrosinase mRNA, complete cds.
ACCESSION D00131
KEYWORDS melanin; melanocyte; tyrosinase.
SOURCE Mouse melanocytes cell line TM10 mRNA, cDNA to mRNA, clones
Tyrs-33, Tyrs-J2, and Tyrs-J19.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1838)
AUTHORS Yamamoto,H., Takeuchi,S., Kudo,T., Makino,K., Nakata,A.,
Shinoda,T. and Takeuchi,T.
TITLE Cloning and sequencing of mouse tyrosinase cDNA
JOURNAL Jpn. J. Genet. 62, 271-274 (1987)
STANDARD simple automatic
COMMENT Tyrosinase is the key enzyme for melanin,synthesis in mammalian
melanocytes and has been considered to be a unique marker for the
study of melanocyte differentiation. A cDNA library was constructed
from poly(A)+ mRNA from mouse melanocytes and screened using anti-
tyrosinase antiserum and oligonucleotide probes corresponding to
amino acid sequence of tyrosinase. sequencing of some cDNA clones
positive in these screenings gave a nucleotide sequence of 1838
nucleotides including a open reading frame of 1344 nucleotides that
was found to correspond exactly to the amino acid sequence of the
cyanogen bromide fragments of tyrosinase.
FEATURES from to/span description
pept 65 1411 mouse tyrosinase
signal 1735 1740 polyadenylation signal
BASE COUNT 505 a 397 c 399 g 537 t
ORIGIN 64 bp upstream of the tyrosinase initiation codon.
1 ggaaaagaag tctgtgacac tcattaacct attggtgcag attttgtatg atctaaagga
61 gaaaatgttc ttggctgttt tgtattgcct tctgtggagt ttccagatct ctgatggcca
121 ttttcctcga gcctgtgcct cctctaagaa cttgttggca aaagaatgct gcccaccatg
181 gatgggtgat gggagtccct gcggccagct ttcaggcaga ggttcctgcc aggatatcct
241 tctgtccagt gcaccatctg gacctcagtt ccccttcaaa ggggtggatg accgtgagtc
301 ctggccctct gtgttttata ataggacctg ccagtgctca ggcaacttca tgggtttcaa
361 ctgcggaaac tgtaagtttg gatttggggg cccaaattgt acagagaagc gagtcttgat
421 tagaagaaac atttttgatt tgagtgtctc cgaaaagaat aagttctttt cttacctcac
481 tttagcaaaa catactatca gctcagtcta tgtcatcccc acaggcacct atggccaaat
541 gaacaatggg tcaacaccca tgtttaatga tatcaacatc tacgacctct ttgtatggat
601 gcattactat gtgtcaaggg acacactgct tgggggctct gaaatatgga gggacattga
661 ttttgcccat gaagcaccag ggtttctgcc ttggcacaga cttttcttgt tattgtggga
721 acaagaaatt cgagaactaa ctggggatga gaacttcact gttccatact gggattggag
781 agatgcagaa aactgtgaca tttgcacaga tgagtacttg ggaggtcgtc accctgaaaa
841 tcctaactta ctcagcccag catccttctt ctcctcctgg cagatcattt gtagcagatc
901 agaagagtat aatagccatc aggttttatg cgatggaaca cctgagggac cactattacg
961 taatcctgga aaccatgaca aagccaaaac ccccaggctc ccatcttcag cagatgtgga
1021 attttgtctg agtttgaccc agtatgaatc tggatcaatg gatagaactg ccaatttcag
1081 ctttagaaac acactggaag tatttttgtt tgaacacaat ggctgcgaag ggcaccgccc
1141 tcttttggaa gtttacccag aagccaatgc acctatcggc cataacagag actcttacat
1201 ggttcctttc ataccgctct atagaaatgg tgatttcttc ataacatcca aggatctggg
1261 atatgactac agctacctcc aagagtcaga tccaggcttt tacagaaatt atattgagcc
1321 ttacttggaa caaggccagt cgtatctggc catggcttct tggggcagca ctggtgggag
1381 ctgttattgc tgcagctctc tctgggctta gcagtaggct atgccttcag aagaagaaga
1441 agaagaagca accccaggag gaaaggcagc cactcctcat ggacaaagac gactaccaca
1501 gcttgctgta tcagagccat ctgtgaacat cctaggaaac agagtggact gaaaggtttt
1561 acctcactcg acctatttgt tggtgtttct acaaatttaa actagcataa aacatagacc
1621 atagctgttt ggcttttttt cagacccatg ttttttccta agtcctagtt tctaagaaat
1681 gactgggatt tgctaaaata tatatatata taaataataa cttactaata gctaaataaa
1741 atttcctctt acaactaatt gagtggtttt tatgaatgtg tcttaattat ttaaacttga
1801 ggcacatttt tgttttcctt acttcattgt gaatttcc
//
LOCUS RATALDCPR 1294 bp ds-DNA ROD 15-MAR-1990
DEFINITION Rat aldolase gene 5'flank promoter region and exon 1
(EC 4.1.2.13).
ACCESSION X07845
KEYWORDS aldolase C; fructose 1,6-bisphosphate aldolase;
fructose bisphosphate aldolase; promoter.
SOURCE rat (Rattus rattus).
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1294)
AUTHORS Skala,H.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1294)
AUTHORS Vibert,M., Henry,A., Kahn,A. and Skala,H.
TITLE ;
JOURNAL Eur. J. Biochem. 181, 33-39 (1989)
STANDARD simple automatic
COMMENT *source: library=Charon 4A; strain=Buffalo; see X06984 for rat
aldolase C cDNA
EMBL features not translated to GenBank features:
key from to description
SITE 1 640 promoter region
SITE 641 855 multiple transcription start sites
MSG 641 901 exon 1
IVS 902 >1294 intron I
[1] Author address Skala H., INSERM U. 129, 24 rue du Faubourg
Saint Jacques, 75014 Paris, France.
Submitted (06-JUN-1988) on tape to the EMBL data library
BASE COUNT 261 a 401 c 284 g 348 t
ORIGIN
1 ggatccctta tcccaataat aaaccaattg gcatggagct gactatgttt tatgctattt
61 aaataaagtg tatttaatgt gtccttccag tccactgaat ctaattttgg gagtaggggt
121 gggcaggtga aatcatggag gcagatcagc cctccagcct aggcagcctt cacttcctgc
181 ggcagctaaa gccagggctg actgcttttc ttcgcttgat tctcttgctt ccactctccc
241 ctccccccca aaaaaagggc aatggactgc ctgttccctg gatcactgcc tgtggggaac
301 agaaaagact tcattcactt actttcccca ttccgcccac ccgcccccgc cccccggggc
361 aggcactggt aaatatgggg cagagcatag aaactcaaat ccattattcc atgccttgaa
421 gtgggcaggc agcctgttag gtggcctctg tcctacctcc catctactga tgtttagatg
481 tctctcctct aaaccaggct taatccccaa cattccgcat aatgtgaaag catttgcctt
541 tcacagcaaa agactagtta tggatagagc aaacacaagt ccatctcttt cctcatgggt
601 tctagtggca gggttccatg atgagggagg ggcatcgtct tttacaaacc ttggggaccc
661 tggcagagaa gtcactccac ctcactactc cacccctgaa gcgtggcctt accctagacc
721 agtcctgggg agagggcggg accaggatgg gaggtgtctg tcacgccccc agggagtcac
781 gtagctctgc ggcatctgct gctgccttat ttactccagc ttggactgag ctactgtctg
841 ttgctaacca gaccaaagtg agctgtgcct gtggctccgg ccgctgactg actgagtata
901 ggtaaggtcc cgggtgaggc ggagatgccc cgtccccgtt tgtcttcccc tgccgtcctt
961 gccctgctcc ctaggaccct agctcctctg ctttctgtgg ctcaggaaaa ccctgcttct
1021 ctccttcccc ctcgctctcc tttccatttc caagtcttcc tctgcctcca gaacttcact
1081 ggggtcggtg tatcccacct agggtatttt aatctcagcc ccttcctggt ccttctaggc
1141 ttgtccccag gtctaggaag agcatcagga ctggttaacc tgtatcacac cactatctag
1201 ctcatctctc ccaggtctcc ttcctgcttt tcctgcatct ctccctcttt cccataatcc
1261 actcactcag ccttaattcc tcacctctgg atcc
//
LOCUS RATCNRAB 2267 bp ss-RNA ROD 15-MAR-1990
DEFINITION Rat calcineurin A beta mRNA.
ACCESSION D90036
KEYWORDS calcineurin; calcineurin A beta; calmodulin-binding protein;
calmodulin-dependent protein phosphatase; isoform.
SOURCE Rat brain, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2267)
AUTHORS Kuno,T., Takeda,T., Hirai,M., Ito,A., Mukai,H. and Tanaka,C.
TITLE Evidence for a Second Isoform of the Catalytic Subunit of
Calmodulin-Dependent Protein Phosphatase (Calcineurin A)
JOURNAL Biochem. Biophys. Res. Commun. (19xx) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Takayoshi
Kuno Department of Pharmacology Kobe University School of Medicine
7-5-1 Kusunoki-cho,Chuo-ku Kobe 650 Japan Phone: 078-341-7451
x3273 Fax: 078-351-6531 Peptides, 87-338 and 401-424, seem to be
putative catalytic domain and calmodulin binding domain,
respectively.
FEATURES from to/span description
ORF 40 1617 ORF (525 AA)
BASE COUNT 639 a 481 c 525 g 622 t
ORIGIN
1 cccgcccggc ccggcaccta gccgagcccc gggcccagca tggccgcccc ggagccggcc
61 cgggccgcgc cgccccctcc cccgcccccg ccgccccccc tcggggccga ccgcgtcgtc
121 aaagccgttc cttttccccc aactcaccgg ttgacatctg aagaagtgtt tgatatggat
181 gggataccca gggttgatgt tctgaagaac catctggtaa aagaaggtcg ggtagatgaa
241 gaaattgcac taagaattat caatgagggt gctgccatac ttaggcggga gaaaaccatg
301 atagaagtag aagctccaat tacagtgtgt ggtgacatcc atggccaatt ttttgatctg
361 atgaaacttt ttgaagtagg aggatcacct gctaatacac gatacctttt tcttggtgat
421 tatgtggaca gaggttattt tagtatagag tgtgtcttgt atttatgggt cttgaagatt
481 ctatacccaa gcacattatt ccttctgaga ggcaaccatg aatgcagaca cctcactgag
541 tattttacct ttaagcagga atgtaaaatt aagtattcag aaagggtcta tgaagcttgt
601 atggaggctt ttgacagctt gccccttgct gcacttctaa accaacaatt tctttgtgtt
661 catggtggac tttcaccaga gatacacaca ctggatgata ttaggagatt agatagattt
721 aaagagccac ctgcatttgg accaatgtgt gacttgctgt ggtctgatcc ttctgaagac
781 tttggaaatg aaaaatcaca agaacatttt agtcataata cggttcgagg atgttcttat
841 ttttataact atccagcagt gtgtgaattt ttgcaaaaca ataatttgtt atcgattatt
901 agagctcatg aagctcaaga tgcaggctat agaatgtaca gaaaaagtca aactacaggg
961 tttccttcat taataacaat tttttcggca cctaattact tagatgtcta caataataaa
1021 gctgctgtac taaagtatga gaacaatgtg atgaacatcc gccagtttaa ctgttctcca
1081 catccttact ggttgcccaa ttttatggat gtctttacat ggtctttacc atttgttgga
1141 gaaaaagtca cagagatgtt ggtaaatgtt ctcagtattt gctctgatga tgaactaatg
1201 acagaagggg aagaccagtt tgatgtaggt tcagctgcag cccggaaaga aatcataaga
1261 aacaagatcc gagcaattgg caagatggca agagtcttct ctgttctcag ggaggagagt
1321 gaaagcgtgc tgacactcaa gggcctgact cccacaggga tgttgcctag tggagtgttg
1381 gctggaggac ggcagacctt gcaaagtgcc acagttgagg ctattgaggc tgaaaaagca
1441 atacgaggat cctctccacc acatagaatc tgcagttttg aagaggcaaa gggtttggat
1501 aggatcaatg agagaatgcc accccggaaa gatgctgtgc agcaagatgg tttcaattcc
1561 ctgaacaccg cacataccac tgagaaccac gggactggca accatagtgc ccagtgacca
1621 gcggcttccc agggatggac agatcactaa ggagccggag gggtcggccg agctgatgat
1681 aaatgtcata atctctctga agaaacaaac cattgtgctt tttgagaccc ttgccccctt
1741 cccggatgga ggcttgaggg ccttgggact tgtgctgtcc ataagattgg ggaatcgctg
1801 ccacagcgga gagcagtgag caaggggctt ggggcaaatt ccagtggagg aagcccaaac
1861 ctccatttat gcttgtggtt cacacattta cgtttacaaa tgagatttcc tttgttttct
1921 ccctcagtag aattagattt ttttcaacca tgactttaaa tgcaatcttt agagttaatg
1981 tggaacctcc ctcaccccca ccccatgaaa tgtctttaag aggatggatt agcatggtct
2041 taaaatatat ttctgaggtt actagatgta ttttgaattg tagacaaaat ctgagaaacc
2101 cagttggtgt ttatataaaa acgctgacct caggtcatag ttcttaaatg tggctaattc
2161 tgtaacatag tcttggtatt tttcaattat gaatgcataa actatttcta agaagactct
2221 tacttgaaca agatccaaaa aaccaattta gatccttttt gcccccg
//
LOCUS RATGSTB 1038 bp ss-mRNA ROD 25-SEP-1987
DEFINITION Rat liver glutathione S-transferase Yb-1 subunit mRNA, complete
cds.
ACCESSION M11719
KEYWORDS glutathione transferase.
SOURCE Rat, cDNA to mRNA, clone pGTA/C44.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1038)
AUTHORS Ding,G.J.-F., Lu,A.Y.H. and Pickett,C.B.
TITLE Rat liver glutathione S-transferases: Nucleotide sequence analysis
of a Yb-1 cDNA clone and prediction of the complete amino acid
sequence of the Yb-1 subunit
JOURNAL J. Biol. Chem. 260, 13268-13271 (1985)
STANDARD full staff_review
FEATURES from to/span description
pept 38 694 glutathione S-transferase Yb-1 subunit (EC
2.5.1.18)
BASE COUNT 243 a 313 c 233 g 249 t
ORIGIN Unreported.
1 gctgaagcca aattgagaag accacagcgc cagaaccatg cctatgatac tgggatactg
61 gaacgtccgc gggctgacac acccgatccg cctgctcctg gaatacacag actcaagcta
121 tgaggagaag agatacgcca tgggcgacgc tcccgactat gacagaagcc agtggctgaa
181 tgagaagttc aaactgggcc tggacttccc caatctgccc tacttaattg atggatcgcg
241 caagattacc cagagcaatg ccataatgcg ctaccttgcc cgcaagcacc acctgtgtgg
301 agagacagag gaggagcgga ttcgtgccga cattgtggag aaccaggtca tggacaaccg
361 catgcagctc atcatgcttt gttacaaccc cgactttgag aagcagaagc cagagttctt
421 gaagaccatc cctgagaaga tgaagctcta ctctgagttc ctgggcaagc gaccatggtt
481 tgcaggggac aaggtcacct atgtggattt ccttgcttat gacattcttg accagtacca
541 catttttgag cccaagtgcc tggacgcctt cccaaacctg aaggacttcc tggcccgctt
601 tgagggcctg aagaagatct ctgcctacat gaattgcagc cgctacctct caacacctat
661 attttcgaag ttggcccaat ggagtaacaa gtaggccctt gctacactgg cactcacaga
721 gaggacctgt ccacattgga tcctgcaggc accctggcct tctgcactgt ggttctctct
781 ccttcctgct cccttctcca gctttgtcag ccccatctcc tcaacctcac cccagtcatg
841 cccacatagt cttcattctc cccactttct ttcatagtgg tccccttctt tattgacacc
901 ttaacacaac ctcacagtcc ttttctgtga tttgaggtct gccctgaact cagtctccct
961 agacttaccc caaatgtaac actgtctcag tgccagcctg ttcctggtgg gggagctgcc
1021 ccaggcctgt ctcatctt
//
LOCUS RATRHLA 1116 bp ss-mRNA ROD 15-MAR-1989
DEFINITION Rat asialoglycoprotein receptor (RHL2) mRNA, complete cds.
ACCESSION M16347
KEYWORDS alternative splicing; asialoglycoprotein receptor.
SOURCE Rat (Sprague Dawley) liver RNA, clone 13D.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1116)
AUTHORS McPhaul,M. and Berg,P.
TITLE Identification and characterization of cDNA clones encoding two
homologous proteins that are part of the asialoglycoprotein
receptor
JOURNAL Mol. Cell. Biol. 7, 1841-1847 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 154 1059 asialoglycoprotein receptor (RHL2)
BASE COUNT 266 a 305 c 288 g 257 t
ORIGIN
1 ggtgcctaga ttagccccct cctccttctc gcctgctgtc ctgctgtccc aggtttaacc
61 ccctttttct ccttggactc aggctgcctc cggaagcaga gtagctctct atacatttaa
121 cagtcccaga tctgtctcca gcctagggcc atcatggaga aggactttca agatatccag
181 cagctggact ctgaggaaaa cgaccatcag ctcattggcg atgaggaaca aggctctcat
241 gtgcagaatc ttaggaccga aaatccacgt tggggaggac agcctccttc caggcccttt
301 ccacagcgcc tctgctccaa gttccgcctc agtctgctcg ccctggcctt caacattctc
361 ctgctggtgg tcatctgtgt ggtttcatcc caaagcatgc agctgcaaaa ggagttctgg
421 accctgaaag aaaccttgag caacttctcc accaccaccc tgatggagtt caaggctctg
481 gactcccacg gaggtagcag gaatgacaac ttgacttctt gggaaacaat actggagaaa
541 aagcagaagg acataaaagc agatcactcc acgctgctct tccacctgaa gcacttcccc
601 ctggatctgg caaccctgac ctgtcagctg gcgttcttcc tgagcaacgg cacagaatgc
661 tgccccgtta actgggtgga gtttggtgga agctgctact ggttttctcg ggatgggctc
721 acctgggctg aggctgacca gtactgccaa atggagattg cccatctgct ggtcatcaac
781 tcaagggagg agcaggaatt cgttgtaaag cacaggggcg cgtttcacat ttggataggt
841 ctcaccgaca aggatggctc ctggaaatgg gtggatggga cggaatatag aagtaacttc
901 aagaattggg ctttcactca gccagataac tgccaggggc atgaagaggg gggaagtgaa
961 gactgtgctg aaatcctgtc agatggcctc tggaatgaca acttctgcca gcaggtgaac
1021 cgctgggctt gtgaaaggaa acgggacatc acctactagg agtctgctct actatgtctt
1081 tgtcaccctc cgggaacccc gcatcactca ttagga
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOPHNAQ 15611 bp ds-DNA BCT 02-APR-1990
DEFINITION E.coli psiD locus containing alkylphosphonate uptake (phn) genes A
through Q, complete cds.
ACCESSION J05260
KEYWORDS C-P lyase; HisM-like integral membrane protein;
HisP-like nucleotide binding protein; inverted repeats.
SOURCE E.coli (strains B) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 15611)
AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and
sequencing of the phn (psiD) genes involved in alkylphosphonate
uptake and C-P lyase activity in Escherichia coli B
JOURNAL J. Biol. Chem. 265, 4461-4471 (1990)
STANDARD full staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by C.-M.Chen,
01-MAR-1990, for release after publication.
FEATURES from to/span description
pept 2874 3209 phnA protein
pept 3767 4210 phnB protein
pept 4343 5131 HisP-like nucleotide binding protein (phnC)
pept 5156 6172 phnD protein
pept 6227 7057 phnE protein
pept 7078 7803 phnF protein
pept 7804 8256 phnG protein
pept 8253 8837 phnH protein
pept 8837 9901 phnI protein
pept 9894 10739 phnJ protein
pept 10736 11494 HisP-like nucleotide binding protein (phnK)
pept 11605 12285 HisP-like nucleotide binding protein (phnL)
pept 12282 13418 HisM-like integral membrane protein (phnM)
pept 13418 13975 HisP-like nucleotide binding protein (phnN)
pept 13962 14396 phnO protein
pept 14398 15156 phnP protein
pept 15131 15496 phnQ protein
pept 2473 245 (c) ORF742
pept 7470 7030 (c) ORF146
pept 9298 8489 (c) ORF269
pept 12264 11920 (c) ORF114
pept 14760 14380 (c) ORF126
rpt 3224 3257 inverted repeat a
rpt 3271 3306 inverted repeat b
rpt 3324 3357 inverted repeat c
rpt 3371 3406 inverted repeat d
rpt 3424 3457 inverted repeat e
rpt 3471 3506 inverted repeat f
rpt 3524 3557 inverted repeat g
rpt 3571 3606 inverted repeat h
rpt 3624 3657 inverted repeat i
rpt 3671 3706 inverted repeat j
rpt 6176 6211 inverted repeat k
rpt 6229 6262 inverted repeat l
rpt 11501 11534 inverted repeat m
rpt 11548 11583 inverted repeat n
signal 2622 2627 -35 region
signal 2645 2650 -10 region
signal 2738 2743 -35 region
signal 2771 2776 -10 region
signal 4074 4079 -35 region
signal 4096 4101 -10 region
signal 2084 2079 (c) -10 region
signal 2108 2103 (c) -35 region
signal 2720 2715 (c) -10 region
signal 2742 2737 (c) -35 region
signal 12056 12051 (c) -10 region
signal 12080 12075 (c) -35 region
signal 2737 2754 phosphate (pho) box
BASE COUNT 3466 a 4625 c 4306 g 3214 t
ORIGIN
1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt
61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt
121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga
181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc
241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt
301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga
361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg
421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga
481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg
541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc
601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct
661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt
721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca
781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg
841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct
901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca
961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg
1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca
1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt
1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt
1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg
1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat
1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat
1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc
1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa
1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt
1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg
1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca
1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt
1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca
1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag
1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt
1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt
1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct
2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga
2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca
2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg
2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa
2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg
2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact
2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg
2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat
2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt
2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata
2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa
2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc
2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg
2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat
2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct
2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga
3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag
3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc
3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac
3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg
3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag
3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc
3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg
3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg
3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag
3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa
3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc
3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg
3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc
3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg
3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc
4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa
4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt
4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc
4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc
4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc
4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac
4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt
4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt
4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg
4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca
4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg
4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc
4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct
4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt
4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga
4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt
4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta
5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa
5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa
5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag
5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc
5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg
5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt
5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc
5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt
5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa
5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta
5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg
5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac
5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat
5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac
5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc
5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc
6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa
6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg
6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg
6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt
6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc
6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt
6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc
6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga
6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat
6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg
6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt
6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt
6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat
6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat
6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt
6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt
6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga
7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg
7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt
7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg
7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa
7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat
7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc
7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg
7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc
7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc
7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc
7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc
7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc
7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac
7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa
7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc
7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt
7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc
8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac
8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg
8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc
8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc
8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg
8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca
8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata
8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg
8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt
8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc
8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg
8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc
8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc
8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc
8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc
8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc
9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat
9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga
9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg
9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc
9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt
9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc
9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca
9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta
9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt
9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat
9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct
9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca
9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc
9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt
9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta
9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg
9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc
10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa
10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc
10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc
10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct
10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca
10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc
10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa
10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc
10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg
10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca
10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca
10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa
10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag
10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc
10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat
10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg
10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc
11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg
11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct
11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac
11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc
11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt
11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg
11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac
11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc
11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc
11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc
11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac
11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc
11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag
11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc
11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg
11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt
12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt
12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg
12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt
12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt
12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta
12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc
12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc
12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc
12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga
12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc
12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca
12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc
12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc
12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact
12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct
12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg
12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga
13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg
13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca
13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc
13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg
13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga
13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata
13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt
13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca
13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa
13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt
13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca
13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg
13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg
13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc
13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt
13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc
13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg
14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca
14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc
14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc
14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc
14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt
14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg
14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc
14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc
14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc
14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac
14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg
14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac
14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc
14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg
14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg
14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt
15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc
15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag
15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc
15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata
15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa
15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc
15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg
15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc
15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa
15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta
15601 catggggatc c
//GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOPHNAQ 15611 bp ds-DNA BCT 02-APR-1990
DEFINITION E.coli psiD locus containing alkylphosphonate uptake (phn) genes A
through Q, complete cds.
ACCESSION J05260
KEYWORDS C-P lyase; HisM-like integral membrane protein;
HisP-like nucleotide binding protein; inverted repeats.
SOURCE E.coli (strain B) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 15611)
AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and
sequencing of the phn (psiD) genes involved in alkylphosphonate
uptake and C-P lyase activity in Escherichia coli B
JOURNAL J. Biol. Chem. 265, 4461-4471 (1990)
STANDARD full staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by C.-M.Chen,
01-MAR-1990, for release after publication.
FEATURES from to/span description
pept 2874 3209 phnA protein
pept 3767 4210 phnB protein
pept 4343 5131 HisP-like nucleotide binding protein (phnC)
pept 5156 6172 phnD protein
pept 6227 7057 phnE protein
pept 7078 7803 phnF protein
pept 7804 8256 phnG protein
pept 8253 8837 phnH protein
pept 8837 9901 phnI protein
pept 9894 10739 phnJ protein
pept 10736 11494 HisP-like nucleotide binding protein (phnK)
pept 11605 12285 HisP-like nucleotide binding protein (phnL)
pept 12282 13418 HisM-like integral membrane protein (phnM)
pept 13418 13975 HisP-like nucleotide binding protein (phnN)
pept 13962 14396 phnO protein
pept 14398 15156 phnP protein
pept 15131 15496 phnQ protein
pept 2473 245 (c) ORF742
pept 7470 7030 (c) ORF146
pept 9298 8489 (c) ORF269
pept 12264 11920 (c) ORF114
pept 14760 14380 (c) ORF126
rpt 3224 3257 inverted repeat a
rpt 3271 3306 inverted repeat b
rpt 3324 3357 inverted repeat c
rpt 3371 3406 inverted repeat d
rpt 3424 3457 inverted repeat e
rpt 3471 3506 inverted repeat f
rpt 3524 3557 inverted repeat g
rpt 3571 3606 inverted repeat h
rpt 3624 3657 inverted repeat i
rpt 3671 3706 inverted repeat j
rpt 6176 6211 inverted repeat k
rpt 6229 6262 inverted repeat l
rpt 11501 11534 inverted repeat m
rpt 11548 11583 inverted repeat n
signal 2622 2627 -35 region
signal 2645 2650 -10 region
signal 2738 2743 -35 region
signal 2771 2776 -10 region
signal 4074 4079 -35 region
signal 4096 4101 -10 region
signal 2084 2079 (c) -10 region
signal 2108 2103 (c) -35 region
signal 2720 2715 (c) -10 region
signal 2742 2737 (c) -35 region
signal 12056 12051 (c) -10 region
signal 12080 12075 (c) -35 region
signal 2737 2754 phosphate (pho) box
BASE COUNT 3466 a 4625 c 4306 g 3214 t
ORIGIN
1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt
61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt
121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga
181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc
241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt
301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga
361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg
421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga
481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg
541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc
601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct
661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt
721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca
781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg
841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct
901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca
961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg
1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca
1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt
1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt
1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg
1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat
1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat
1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc
1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa
1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt
1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg
1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca
1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt
1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca
1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag
1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt
1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt
1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct
2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga
2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca
2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg
2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa
2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg
2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact
2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg
2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat
2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt
2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata
2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa
2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc
2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg
2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat
2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct
2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga
3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag
3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc
3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac
3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg
3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag
3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc
3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg
3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg
3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag
3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa
3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc
3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg
3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc
3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg
3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc
4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa
4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt
4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc
4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc
4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc
4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac
4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt
4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt
4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg
4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca
4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg
4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc
4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct
4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt
4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga
4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt
4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta
5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa
5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa
5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag
5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc
5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg
5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt
5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc
5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt
5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa
5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta
5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg
5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac
5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat
5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac
5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc
5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc
6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa
6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg
6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg
6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt
6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc
6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt
6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc
6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga
6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat
6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg
6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt
6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt
6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat
6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat
6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt
6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt
6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga
7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg
7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt
7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg
7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa
7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat
7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc
7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg
7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc
7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc
7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc
7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc
7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc
7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac
7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa
7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc
7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt
7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc
8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac
8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg
8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc
8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc
8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg
8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca
8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata
8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg
8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt
8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc
8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg
8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc
8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc
8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc
8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc
8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc
9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat
9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga
9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg
9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc
9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt
9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc
9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca
9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta
9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt
9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat
9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct
9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca
9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc
9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt
9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta
9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg
9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc
10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa
10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc
10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc
10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct
10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca
10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc
10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa
10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc
10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg
10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca
10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca
10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa
10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag
10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc
10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat
10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg
10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc
11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg
11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct
11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac
11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc
11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt
11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg
11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac
11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc
11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc
11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc
11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac
11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc
11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag
11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc
11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg
11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt
12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt
12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg
12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt
12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt
12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta
12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc
12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc
12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc
12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga
12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc
12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca
12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc
12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc
12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact
12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct
12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg
12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga
13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg
13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca
13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc
13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg
13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga
13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata
13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt
13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca
13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa
13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt
13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca
13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg
13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg
13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc
13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt
13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc
13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg
14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca
14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc
14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc
14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc
14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt
14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg
14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc
14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc
14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc
14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac
14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg
14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac
14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc
14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg
14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg
14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt
15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc
15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag
15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc
15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata
15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa
15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc
15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg
15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc
15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa
15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta
15601 catggggatc c
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMJNLTRA 765 bp ss-mRNA PRI 22-FEB-1990
DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction
mRNA, clone lambda-LTR8.
ACCESSION M32219
KEYWORDS Alu repetitive sequence; long terminal repeat.
SOURCE Human, cDNA to mRNA, clone lambda-LTR8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 765)
AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
Matsubara,K.
TITLE Transcription of human endogenous retroviral long terminal repeat
(LTR) sequence in a lung cancer cell line
JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
STANDARD simple staff_review
FEATURES from to/span description
recomb 223 224 human DNA end; endogenous retrovirus DNA start
rpt 224 342 Alu repeat
LTR 343 765 long terminal repeat
BASE COUNT 252 a 158 c 148 g 207 t
ORIGIN
1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt
61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt
121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt
181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt
241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc
301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc
361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa
421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc
481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc
541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc
601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag
661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct
721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca
//
LOCUS HUMJNLTRB 551 bp ss-mRNA PRI 22-FEB-1990
DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction
mRNA, clone lambda-LTR22.
ACCESSION M32220
KEYWORDS long terminal repeat.
SOURCE Human, cDNA to mRNA, clone lambda-LTR22.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 551)
AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
Matsubara,K.
TITLE Transcription of human endogenous retroviral long terminal repeat
(LTR) sequence in a lung cancer cell line
JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
STANDARD simple staff_review
FEATURES from to/span description
recomb 120 121 human DNA end; endogenous retrovirus DNA start
LTR 121 551 long terminal repeat
BASE COUNT 146 a 146 c 129 g 130 t
ORIGIN
1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg
61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa
121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca
181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg
241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta
301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt
361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat
421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc
481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc
541 ttccttctac a
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMJNLTRA 765 bp ss-mRNA PRI 22-FEB-1990
DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction
mRNA, clone lambda-LTR8.
ACCESSION M32219
KEYWORDS Alu repetitive sequence; long terminal repeat.
SOURCE Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 765)
AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
Matsubara,K.
TITLE Transcription of human endogenous retroviral long terminal repeat
(LTR) sequence in a lung cancer cell line
JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
STANDARD simple staff_review
FEATURES from to/span description
recomb 223 224 human DNA end; endogenous retrovirus DNA start
rpt 224 342 Alu repeat
LTR 343 765 long terminal repeat
BASE COUNT 252 a 158 c 148 g 207 t
ORIGIN
1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt
61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt
121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt
181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt
241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc
301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc
361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa
421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc
481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc
541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc
601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag
661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct
721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca
//
LOCUS HUMJNLTRB 551 bp ss-mRNA PRI 22-FEB-1990
DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction
mRNA, clone lambda-LTR22.
ACCESSION M32220
KEYWORDS long terminal repeat.
SOURCE Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR22.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 551)
AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
Matsubara,K.
TITLE Transcription of human endogenous retroviral long terminal repeat
(LTR) sequence in a lung cancer cell line
JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
STANDARD simple staff_review
FEATURES from to/span description
recomb 120 121 human DNA end; endogenous retrovirus DNA start
LTR 121 551 long terminal repeat
BASE COUNT 146 a 146 c 129 g 130 t
ORIGIN
1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg
61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa
121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca
181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg
241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta
301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt
361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat
421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc
481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc
541 ttccttctac a
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS SHPPACAP 1763 bp ss-mRNA MAM 23-FEB-1990
DEFINITION Ovine pituitary adenylate cyclase activating polypeptide (PACAP38)
mRNA, complete cds.
ACCESSION M32216
KEYWORDS pituitary adenylate cyclase activating polypeptide.
SOURCE Ovine hypothalamus, cDNA to mRNA, clone pOH38P7.
ORGANISM Ovis sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1763)
AUTHORS Kimura,C., Ohkubo,S., Ogi,K., Hosoya,M., Itoh,Y., Onda,H.,
Miyata,A., Jiang,L., Dahl,R.R., Stibbs,H.H., Arimura,A. and
Fujino,M.
TITLE A novel peptide which stimulates adenylate cyclase: Molecular
cloning and characterization of the ovine and human cDNAs
JOURNAL Biochem. Biophys. Res. Commun. 166, 81-89 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 577 1107 pituitary adenylate cyclase activating
polypeptide precursor
sigp 577 648 pituitary adenylate cyclase activating signal
peptide
matp 970 1083 pituitary adenylate cyclase activating peptide
BASE COUNT 411 a 490 c 418 g 444 t
ORIGIN
1 ctgctaactg cccagataaa taggagcaga gggctggtca cctctgtaat aaccaccggc
61 agcagtagaa gaaaccgcag cttcagaagc agccagagag acttctgagc agcgaaggcg
121 ctgcctgctc gagctgcctg gccgggcggc tgccccagac gccgacttcg ccgaggccct
181 ctctctctct ctctctctct ctctctctct ctctctctct ctctgcttct ttccttatca
241 ctcctttctt ctcagtggac ttcaggccac tttgtctccc acccccactc agctcgtcgc
301 ctcctccgtc ttccttctcc atctctcctc tcgcccccct tctctcagtg tcacgctccg
361 tcctagttcc gagcgtcgtc aaacttttga acagaataac aggactcagc aaacaagtcc
421 tccagctcct cccgcggctc cggctcgttc ctgcggctcc tgctcagaca ctaacgccag
481 acggcgatgc ctcttgggtt gtgactacag cgcacaaact tggagaagct ctttgcccgc
541 cgtcctactt ggcagcaaat cctctcctgg cagcgaatga ccatgtgtag cggagcgagg
601 ctggccctgc tcgtttacgg gatactgatg cacagcagcg tctacggctc acctgccgcc
661 tccggactcc ggttcccggg gatcaggccg gagaacgagg cgtacgacga ggacggaaac
721 ccgcagcagg acttctacga ctcggagccg ccaggcgtgg ggagccccgc ctccgcgctg
781 cgcgatgcct acgcgctcta ctacccggcg gaggaaagag atgtcgccca cgggatcctt
841 gataaggcct accgcaaagt gctggaccag ctgtccgcca ggagatacct gcagacgctc
901 atggccaagg gcttgggtgg gaccccgggc ggcggcgcgg acgacgactc ggagccgctc
961 tccaagcgcc actcggacgg catcttcact gacagctaca gccgctaccg gaagcaaatg
1021 gctgttaaga aatacttggc ggctgtccta gggaaaaggt ataaacaaag ggttaagaac
1081 aaaggacggc gaataccgta cttgtagcga cgagttacca gctatcctgt gtatacagcc
1141 ctgacacaat gagaagtcgt ttttcccaac tgactgaact gtcatcgctg ctgtgttctg
1201 tcccacatgt atttatgtat gaagtcaagc cattaaatga atattttgat aataatattg
1261 tttttctttt tacgaagcac tggagaatgc acagatatac tttgtggacc aattattgat
1321 attgacatat atattacgaa tatataaaga gtatatatat atatatataa gtataataga
1381 gagccgttca tacagtgtgc acaaggactg aagattcgcc tgagctgttt gtttttatat
1441 aaaataaata gaaaaataga caatcattgt tttgaatatt actcctattt ttgtaaactg
1501 gaattaaaag gatagtattt ttatccacaa taggcctgaa gatattaatc ctgaccattt
1561 gctactgtac ataaacagtg atgccctgct ccagggagac tttgaggtaa tgatttggga
1621 ggattgctga aggtctctct ttcccaggga gtctctgggg caggctgctt caatcccagc
1681 tgaactcgac tgaggctctg tctacccctt gctgggtggc aatgccaata cttccgcttt
1741 ctttgattct atttttatgt gta
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMZFX 3000 bp ss-mRNA PRI 04-DEC-1989
DEFINITION Human zinc finger protein X-linked (ZFX) mRNA, complete cds.
ACCESSION M30608
KEYWORDS zinc finger protein.
SOURCE Human HeLa cell line, cDNA to mRNA, clone pCD5.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3000)
AUTHORS Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N.
TITLE Comparison of human ZFY and ZFX transcripts
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.S.Palmer, 08-DEC-1989.
FEATURES from to/span description
pept 259 2676 zinc finger protein /nomgen="ZFX"
/hgml_locus_uid="LS0126V" /map="Xp22.1-p21.3"
BASE COUNT 918 a 588 c 693 g 801 t
ORIGIN
1 gtccgtccgg ttcttgctat attgccccag gctggtctcg aactcctggg ctcaagccgt
61 tctcccgcct cccactgccc gcctgtcact gccgtctgtt ccctgagctg tgctttacgc
121 tgggaaagac atagaagaac tattaagaag atagaattgt tttgctgcgc agtacagcaa
181 cagtggatgt tcaagattaa gattagagtc aagttgtgtg attaagacag gagctgtgac
241 tgatgagaat taaaggccat ggatgaagat gggcttgaat tacaacaaga gccaaactca
301 ttttttgatg caacaggagc tgatggtaca cacatggatg gtgatcaaat tgttgtggaa
361 gtacaagaaa ctgtttttgt ttcagatgtt gtggattcag acataactgt gcataacttt
421 gttcctgatg acccagattc agttgtaatc caagatgtta ttgaggacgt tgttatagaa
481 gatgttcagt gcccagatat catggaagaa gcagatgtgt ctgaaacggt catcattcct
541 gagcaagtgc tggactcaga tgtaactgaa gaagtttctt tagcacattg cacagtccca
601 gatgatgttt tagcttctga cattacttca gcctcaatgt ctatgccaga acacgtcttg
661 acgggtgatt ctatacatgt gtctgacgtt ggacatgttg gacatgttgg acatgttgaa
721 catgtggttc atgatagtgt agtggaagca gaaattgtca ctgatcctct gactaccgac
781 gtagtttcag aagaagtatt ggtagcagac tgtgcctctg aagcagtcat agatgccaat
841 gggatccctg tggaccagca ggatgatgac aaaggcaact gtgaggacta ccttatgatt
901 tccttggatg atgctggcaa aatagaacac gatggttctt ctggaatgac catggacaca
961 gagtcggaaa ttgatccttg taaagtggat ggcacttgcc ctgaggtcat caaggtgtac
1021 atttttaaag ctgaccctgg agaagatgac ttaggtggaa ctgtagacat tgtggagagt
1081 gagcctgaga atgatcatgg agttgaactg cttgatcaga acagcagtat tcgtgttccc
1141 agggaaaaga tggtttatat gactgtcaat gactctcagc cagaagatga agatttaaat
1201 gttgctgaaa tcgctgacga agtttatatg gaagtgatcg taggagagga ggatgctgca
1261 gcagcacggg cagccgccgc cgtgcacgag cagcaaatgg atgacaatga aatcaaaacc
1321 ttcatgccga ttgcatgggc agcagcttat ggtaataatt ctgatggaat tgaaaaccgg
1381 aatggcactg caagtgccct cttgcacata gatgagtctg ctggcctcgg cagactggct
1441 aaacaaaaac caaagaaaag gagaagacct gattccaggc agtaccaaac agcaataatt
1501 attggccctg atggacatcc tttgactgtc tatccttgca tgatttgtgg gaagaagttt
1561 aagtcgagag gttttttgaa aaggcacatg aaaaaccatc ccgaacacct tgccaagaag
1621 aaataccgct gtactgactg tgattacact accaacaaga agataagttt acacaaccac
1681 ctggagagcc acaagctgac cagcaaggca gagaaggcca ttgaatgcga tgagtgtggg
1741 aagcatttct ctcatgcagg ggctttgttt actcacaaaa tggtgcataa ggaaaaagga
1801 gccaacaaaa tgcacaagtg taaattctgt gaatacgaga cagctgaaca agggttattg
1861 aatcgccacc tcttggcagt ccacagcaag aactttcctc atatttgtgt ggagtgtggt
1921 aagggttttc gtcacccgtc agagctcaaa aagcacatga gaatccatac tggggagaag
1981 ccgtaccaat gccagtactg cgaatatagg tctgcagact cttctaactt gaaaacgcat
2041 gtcaaaacta agcatagtaa agagatgcca ttcaagtgtg acatttgtct tctgactttc
2101 tcggatacca aagaggtgca gcaacatgct cttatccacc aagaaagcaa aacacaccag
2161 tgtttgcatt gcgaccacaa gagttcgaac tcaagtgatt tgaaacgaca cataatttca
2221 gttcacacga aagactaccc ccataagtgt gacatgtgtg ataaaggctt tcacaggcct
2281 tcagaactca agaaacacgt ggctgcccac aagggcaaaa aaatgcacca gtgtagacat
2341 tgtgacttta agattgcaga tccatttgtt ctaagtcgcc atattctctc agttcacaca
2401 aaggatcttc catttaggtg caagagatgt agaaagggat ttaggcaaca gagtgagctt
2461 aaaaagcata tgaagacaca cagtggcagg aaagtgtatc agtgtgagta ctgtgagtat
2521 agcactacag atgcctcagg ctttaaacgg cacgttattt ccattcacac gaaagactat
2581 cctcaccggt gtgagtactg caagaaaggc ttccgaagac cttcagaaaa gaaccagcac
2641 ataatgcgac atcataaaga agttggcctg ccctaacaat acttctacag aacgtttgta
2701 gagatattgg ccttgaagca gaaaattcat tttaaagcca atcagtctca ttcacataca
2761 atactgtata ttgatttatg ctgtgtacaa atagaattat tacttctagt tgactttttt
2821 ttaaatatac attttgctca gtagtgtgtt ctgaattcta ttcagtttgt ttaataaata
2881 gggaaaactg gcaacatgct agttactttt aataaagtaa tccctgattc tataccgaag
2941 ttttatatct tagaatttta tatttattta aatatttacc ttgcttacct tgatggtact
//
LOCUS HUMZFY 2659 bp ss-mRNA PRI 04-DEC-1989
DEFINITION Human zinc finger protein Y-linked (ZFY) mRNA, complete cds.
ACCESSION M30607
KEYWORDS zinc finger protein.
SOURCE Human foreskin fibroblast, cDNA to mRNA, (pCD2Bassing library of
H.Okayama), clones pMF-1 and p5.8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2659)
AUTHORS Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N.
TITLE Comparison of human ZFY and ZFX transcripts
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.S.Palmer, 08-DEC-1989.
FEATURES from to/span description
pept 1 2406 zinc finger protein /nomgen="ZFY"
/hgml_locus_uid="LU0028V" /map="Yp11.3"
mRNA < 259 2630 ZFY mRNA (pot.)
BASE COUNT 861 a 488 c 597 g 713 t
ORIGIN
1 atggatgaag atgaatttga attgcagcca caagagccaa actcattttt tgatggaata
61 ggagctgatg ctacacacat ggatggtgat cagattgttg tggaaataca agaagcagtt
121 tttgtttcta atattgtgga ttctgacata actgtgcata actttgttcc tgatgaccca
181 gactcagttg taatccaaga tgttgttgaa gatgttgtca tagaggagga tgttcagtgc
241 tcagatatct tagaagaggc agatgtatct gaaaatgtca tcattcctga gcaagtgctg
301 gactcagatg taactgaaga agtttcttta ccacactgca cagtcccaga tgatgtttta
361 gcttctgaca ttacttcaac ctcaatgtct atgccagaac atgttttaac gagtgaatcc
421 atgcatgtgt gtgacattgg acatgttgaa catatggtgc atgatagtgt agtggaagca
481 gaaatcatta ctgatcctct gacgagtgac atagtttcag aagaagtatt ggtagcagac
541 tgtgcccctg aagcagtcat agatgccagc gggatctcag tggaccagca agataatgac
601 aaagccagct gtgaggacta cctaatgatt tcgttggatg atgctggcaa aatagaacat
661 gatggttcca ctggagtgac catcgatgca gaatcagaaa tggatccttg taaagtggat
721 agcacttgtc ctgaagtcat caaggtgtac atttttaaag ctgaccctgg agaagatgac
781 ttaggtggaa ctgtagacat tgtggagagt gaacctgaaa atgatcatgg agttgaacta
841 cttgatcaga acagcagtat tcgtgttccc agggaaaaga tggtttatat gactgtcaat
901 gactctcaac aagaagatga agatttaaat gttgctgaaa ttgctgatga agtttatatg
961 gaagtgatcg taggagagga ggatgctgct gttgcagcag cagcagctgc tgtgcatgag
1021 cagcaaattg atgaggatga aatgaaaacc ttcgtaccaa ttgcatgggc agcagcttat
1081 ggtaataatt ctgatggaat tgaaaaccgg aatggcactg caagtgccct cttgcacata
1141 gatgagtctg ctggccttgg cagactggct aaacagaaac caaagaaaaa gagaagacct
1201 gattccaggc agtaccaaac agcaataatt attggccctg atggtcatcc tttgactgtc
1261 tatccttgca tgatttgtgg gaagaagttt aagtcgaggg gttttttgaa aagacacatg
1321 aaaaaccatc ctgaacacct tgccaagaag aagtaccact gtactgactg tgattacact
1381 accaataaga agataagttt acataaccac ctggagagcc acaagctgac cagcaaggca
1441 gagaaggcca ttgaatgtga tgagtgtggg aagcattttt ctcatgcagg ggctttgttt
1501 actcacaaaa tggtgcataa ggaaaaaggg gccaacaaaa tgcacaagtg taaattctgt
1561 gaatatgaga cagctgaaca ggggttattg aatcgccacc tcttggcagt ccacagcaag
1621 aactttcctc atatttgtgt ggagtgtggt aaaggtttcc gatacccgtc ggaactgaga
1681 aagcacatgc gaatccatac cggcgagaag ccataccaat gccagtactg tgaatatagg
1741 tctgcagact cttctaactt gaaaacacat ataaaaacaa agcatagtaa agagatgcca
1801 ttcaagtgtg acatttgtct tctgactttc tcagatacca aagaagtgca gcaacatact
1861 cttgtccacc aagaaagcaa aacacatcag tgtttgcatt gcgaccacaa gagttcaaac
1921 tcaagtgatt tgaaacgaca tgtaatttca gttcatacga aagactatcc tcataagtgt
1981 gagatgtgcg agaaaggctt tcacaggcct tcagaactta agaaacatgt ggctgtccac
2041 aaaggtaaaa aaatgcacca atgtagacat tgtgacttta agattgcaga cccatttgtt
2101 ctaagtcgcc atattctctc agttcacaca aaggatcttc catttaggtg taagagatgt
2161 agaaagggat ttaggcaaca aaatgagctt aaaaagcata tgaagacaca cagtggcagg
2221 aaagtatatc agtgtgagta ctgtgagtat agcactacag atgcctcagg ctttaaacgg
2281 cacgttattt ccattcatac aaaagactat cctcatcggt gtgagtactg caagaaaggc
2341 ttccgaagac cttcagaaaa gaaccagcac ataatgagac accataaaga agttggtctg
2401 ccctaacagt gtgtctacaa gcttgtaaag atgttggcct tgaagcagaa aattcatttt
2461 ttaaagccag tcttgttcac atccattact atacattgaa ttatgctgtg taaaaataga
2521 attattgctt ctagtccact tttctttaca ttttattcaa tacgctgtcc tgaatcctat
2581 tcagtttctt taatagatga gtaaaaatag caacaagcaa gttgcttata ataaaataat
2641 ttgtgattct cccgaattc
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMANTCD36 1870 bp ss-mRNA PRI 15-DEC-1989
DEFINITION Human CD36 antigen mRNA, complete cds.
ACCESSION M24795
KEYWORDS cell surface antigen; cell surface receptor; erythrocyte antigen;
monocyte antigen; platelet antigen.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1870)
AUTHORS Oquendo,P., Hundt,E., Lawler,J. and Seed,B.
TITLE CD36 directly mediates cytoadherence of Plasmodium falciparum
parasitized erythrocytes
JOURNAL Cell 58, 95-101 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Seed, 12-MAY-1989.
FEATURES from to/span description
pept 211 1629 CD36 antigen
BASE COUNT 599 a 343 c 356 g 572 t
ORIGIN Unreported.
1 gaaaaatcct tcttagccat tttaaagata gctttccaat gattagacga attgattctt
61 tctgtgactc atcagttcct ttcctgtaaa attcatgtct tgctgttgat ttgtgaataa
121 gaaccagagc ttgtagaaac cactttaatc atatccagga gtttgcaaga aacaggtgct
181 taacactaat tcacctcctg aacaagaaaa atgggctgtg accggaactg tgggctcatc
241 gctggggctg tcattggtgc tgtcctggct gtgtttggag gtattctaat gccagttgga
301 gacctgctta tccagaagac aattaaaaag caagttgtcc tcgaagaagg tacaattgct
361 tttaaaaatt gggttaaaac aggcacagaa gtttacagac agttttggat ctttgatgtg
421 caaaatccac aggaagtgat gatgaacagc agcaacattc aagttaagca aagaggtcct
481 tatacgtaca gagttcgttt tctagccaag gaaaatgtaa cccaggacgc tgaggacaac
541 acagtctctt tcctgcagcc caatggtgcc atcttcgaac cttcactatc agttggaaca
601 gaggctgaca acttcacagt tctcaatctg gctgtggcag ctgcatccca tatctatcaa
661 aatcaatttg ttcaaatgat cctcaattca cttattaaca agtcaaaatc ttctatgttc
721 caagtcagaa ctttgagaga actgttatgg ggctataggg atccattttt gagtttggtt
781 ccgtaccctg ttactaccac agttggtctg ttttatcctt acaacaatac tgcagatgga
841 gtttataaag ttttcaatgg aaaagataac ataagtaaag ttgccataat cgacacatat
901 aaaggtaaaa ggaatctgtc ctattgggaa agtcactgcg acatgattaa tggtacagat
961 gcagcctcat ttccaccttt tgttgagaaa agccaggtat tgcagttctt ttcttctgat
1021 atttgcaggt caatctatgc tgtatttgaa tccgacgtta atctgaaagg aatccctgtg
1081 tatagatttg ttcttccatc caaggccttt gcctctccag ttgaaaaccc agacaactat
1141 tgtttctgca cagaaaaaat tatctcaaaa aattgtacat catatggtgt gctagacatc
1201 agcaaatgca aagaagggag acctgtgtac atttcacttc ctcattttct gtatgcaagt
1261 cctgatgttt cagaacctat tgatggatta aacccaaatg aagaagaaca taggacatac
1321 ttggatattg aacctataac tggattcact ttacaatttg caaaacggct gcaggtcaac
1381 ctattggtca agccatcaga aaaaattcaa gtattaaaga atctgaagag gaactatatt
1441 gtgcctattc tttggcttaa tgagactggg accattggtg atgagaaggc aaacatgttc
1501 agaagtcaag taactggaaa aataaacctc cttggcctga tagaaatgat cttactcagt
1561 gttggtgtgg tgatgtttgt tgcttttatg atttcatatt gtgcatgcag atcgaaaaca
1621 ataaaataag tatgtaccaa aaaatattgc ttcaataata ttagcttata tattacttgt
1681 tttcacttta tcaaagagaa gttacatatt aggccatata tatttctaga catgtctagc
1741 cactgatcat ttttaaatat aggtaaataa acctataaat attatcacgc agatcactaa
1801 agtatatctt taattctggg agaaatgaga taaaagatgt acttgtgacc attgtaacaa
1861 tagcacaaat
//
LOCUS HUMARB3A 1270 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human beta-3-adrenergic receptor gene.
ACCESSION M29932
KEYWORDS adrenergic receptor; beta-3-adrenergic receptor.
SOURCE Human placenta DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1270)
AUTHORS Emorine,L.J., Marullo,S., Briend-Sutren,M.-M., Patey,G., Tate,K.,
Delavier-Klutchko,C. and Strosberg,A.D.
TITLE Molecular characterization of the human beta-3-adrenergic receptor
JOURNAL Science 245, 1118-1121 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.J.Emorine 17-NOV-1989.
FEATURES from to/span description
pept 38 1246 beta-3-adrenergic receptor
BASE COUNT 132 a 484 c 405 g 249 t
ORIGIN 33 bp upstream of SmaI site.
1 ttccttcttt ccctaccgcc ccacgcgcga cccggggatg gctccgtggc ctcacgagaa
61 cagctctctt gccccatggc cggacctccc caccctggcg cccaataccg ccaacaccag
121 tgggctgcca ggggttccgt gggaggcggc cctagccggg gccctgctgg cgctggcggt
181 gctggccacc gtgggaggca acctgctggt catcgtggcc atcgcctgga ctccgagact
241 ccagaccatg accaacgtgt tcgtgacttc gctggccgca gccgacctgg tgatgggact
301 cctggtggtg ccgccggcgg ccaccttggc gctgactggc cactggccgt tgggcgccac
361 tggctgcgag ctgtggacct cggtggacgt gctgtgtgtg accgccagca tcgaaaccct
421 gtgcgccctg gccgtggacc gctacctggc tgtgaccaac ccgctgcgtt acggcgcact
481 ggtcaccaag cgctgcgccc ggacagctgt ggtcctggtg tgggtcgtgt cggccgcggt
541 gtcgtttgcg cccatcatga gccagtggtg gcgcgtaggg gccgacgccg aggcgcagcg
601 ctgccactcc aacccgcgct gctgtgcctt cgcctccaac atgccctacg tgctgctgtc
661 ctcctccgtc tccttctacc ttcctcttct cgtgatgctc ttcgtctacg cgcgggtttt
721 cgtggtggct acgcgccagc tgcgcttgct gcgcggggag ctgggccgct ttccgcccga
781 ggagtctccg ccggcgccgt cgcgctctct ggccccggcc ccggtgggga cgtgcgctcc
841 gcccgaaggg gtgcccgcct gcggccggcg gcccgcgcgc ctcctgcctc tccgggaaca
901 ccgggccctg tgcaccttgg gtctcatcat gggcaccttc actctctgct ggttgccctt
961 ctttctggcc aacgtgctgc gcgccctggg gggcccctct ctagtcccgg gcccggcttt
1021 ccttgccctg aactggctag gttatgccaa ttctgccttc aacccgctca tctactgccg
1081 cagcccggac tttcgcagcg ccttccgccg tcttctgtgc cgctgcggcc gtcgcctgcc
1141 tccggagccc tgcgccgccg cccgcccggc cctcttcccc tcgggcgttc ctgcggcccg
1201 gagcagccca gcgcagccca ggctttgcca acggctcgac gggtaggtaa ccggggcaga
1261 gggaccggcg
//
LOCUS HUMCY4ARO 2736 bp ss-mRNA PRI 15-SEP-1989
DEFINITION Human aromatase system cytochrome P-450 (P450XIX) mRNA, complete
cds.
ACCESSION J04127
KEYWORDS aromatase cytochrome P-450; microsomal protein.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2736)
AUTHORS Corbin,C.J., Graham-Lorence,S., McPhaul,M., Mason,J.I.,
Mendelson,C.R. and Simpson,E.R.
TITLE Isolation of a full-length cDNA insert encoding human aromatase
system cytochrome P-450 and its expression in nonsteroidogenic
cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8948-8952 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by E.R.Simpson, 03-MAR-1989.
FEATURES from to/span description
pept 28 1539 cytochrome P-450-Arom /nomgen="CYP19"
/map="15q21" /hgml_locus_uid="LL0035S"
mRNA < 1 2385 CYP19 mRNA (alt.)
mRNA < 1 2736 CYP19 mRNA (alt.)
BASE COUNT 801 a 599 c 591 g 745 t
ORIGIN Chromosome 15q21.
1 gccccctctg aggtcaagga acacaagatg gttttggaaa tgctgaaccc gatacattat
61 aacatcacca gcatcgtgcc tgaagccatg cctgctgcca ccatgccagt cctgctcctc
121 actggccttt ttctcttggt gtggaattat gagggcacat cctcaatacc aggtcctggc
181 tactgcatgg gaattggacc cctcatctcc cacggcagat tcctgtggat ggggatcggc
241 agtgcctgca actactacaa ccgggtgtat ggagaattca tgcgagtctg gatctctgga
301 gaggaaacac tcattatcag caagtcctca agtatgttcc acataatgaa gcacaatcat
361 tacagctctc gattcggcag caaacttggg ctgcagtgca tcggtatgca tgagaaaggc
421 atcatattta acaacaatcc agagctctgg aaaacaactc gacccttctt tatgaaagct
481 ctgtcaggcc ccggccttgt tcgtatggtc acagtctgtg ctgaatccct caaaacacat
541 ctggacaggt tggaggaggt gaccaatgaa tcgggctatg tggacgtgtt gacccttctg
601 cgtcgtgtca tgctggacac ctctaacacg ctcttcttga ggatcccttt ggacgaaagt
661 gctatcgtgg ttaaaatcca aggttatttt gatgcatggc aagctctcct catcaaacca
721 gacatcttct ttaagatttc ttggctatac aaaaagtatg agaagtctgt caaggatttg
781 aaagatgcca tagaagttct gatagcagaa aaaagatgca ggatttccac agaagagaaa
841 ctggaagaat gtatggactt tgccactgag ttgattttag cagagaaacg tggtgacctg
901 acaagagaga atgtgaacca gtgcatattg gaaatgctga tcgcagctcc tgacaccatg
961 tctgtctctt tgttcttcat gctatttctc attgcaaagc accctaatgt tgaagaggca
1021 ataataaagg aaatccagac tgttattggt gagagagaca taaagattga tgatatacaa
1081 aaattaaaag tgatggaaaa cttcatttat gagagcatgc ggtaccagcc tgtcgtggac
1141 ttggtcatgc gcaaagcctt agaagatgat gtaatcgatg gctacccagt gaaaaagggg
1201 acaaacatta tcctgaatat tggaaggatg cacagactcg agtttttccc caaacccaat
1261 gaatttactc ttgaaaattt tgcaaagaat gttccttata ggtactttca gccatttggc
1321 tttgggcccc gtggctgtgc aggaaagtac atcgccatgg tgatgatgaa agccatcctc
1381 gttacacttc tgagacgatt ccacgtgaag acattgcaag gacagtgtgt tgagagcata
1441 cagaagatac acgacttgtc cttgcaccca gatgagacta aaaacatgct ggaaatgatc
1501 tttaccccaa gaaactcaga caggtgtctg gaacactaga gaaggctggt cagtacccac
1561 tctggagcat tttctcatca gtagttcaca tacaaatcat ccatccttgc caatagtgtc
1621 atcctcacag tgaacactca gtggcccatg gaattttata ggcatacctc ctatggttgt
1681 caccaagcta ggtgctattg gtcatctgct cctgttcaca ccagagaacc aggctacaag
1741 agaaaaagca gaggccaaga gtttgagggg agaaatagtc ggtgaagaaa ccgtatccat
1801 aaagacccga ttccaccaaa tgtgctttga gaaggatagg ccttcattaa caaaatgtat
1861 gtctggttcc ccagtagagc tctactgcct caaccaaggg gatttttatg tctggggcag
1921 aaacactcaa gttgattaga aagaccaggc caatgtcagg gtacctgggg ccaaacccac
1981 ctgctagtgt gaattaaagt actttaattt tgttttctgt ggaggtggaa aagcaacatt
2041 catagtcttt ggagaaatgc ttagaaattc agcatttgac ccttgctgtg aattaagccc
2101 aattaattcc tgtttgtcta catatgatct gtctgtggca aaagtttaat cagaggaaat
2161 tctttcccag tctgtcgatt tatgcctcag ccacttgcct gtgctacaat tcattgtgtt
2221 acctgtagat tcaggtaata caaaccatat ataatcatca agtaatacaa actaatttag
2281 taatagcctg ggttaagtat tattagggcc ctgtgtctgc atgtagaaaa aaaaattcac
2341 atgatgcact tcaaattcaa ataaaaatcc ttttggcatg ttcccatttt tgcttagctc
2401 aattagtgtg gctaaccaag agataactgt aaatgtgaca ttgatttgct cttactacag
2461 ctacagtgat tgggggagga aaagtcccaa cccaatgggc tcaaacttct aaggggtact
2521 cctctcatcc ccttatcctt ctccctcgac attttctccc tctttcttcc catgacccca
2581 aagccaaggg caacagatca gtaaagaacg tggtcagagt agaacccctg aagtattttt
2641 taatcctacc tcaaaattta acagttacct gagagattta acattatcta gttcattgaa
2701 tcattgtatg tggtcatgga taaattgcac acttgg
//
LOCUS HUMFCRII 1403 bp ss-mRNA PRI 30-SEP-1988
DEFINITION Human FcRII mRNA for immunoglobulin G receptor.
ACCESSION Y00644
KEYWORDS IgG receptor; cell surface glycoprotein; immunoglobulin receptor.
SOURCE human (Homo sapiens).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1403)
AUTHORS Moore,K.W. and Martens,C.L.
JOURNAL Unpublished (1987) DNAX, Palo Alto, CA, 94304 USA.
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1403)
AUTHORS Stuart,S.G.
TITLE Isolation and Expression of cDNA clones encoding a
JOURNAL J. Exp. Med. 166, 1668-1684 (1987)
STANDARD simple automatic
COMMENT *source: cell line=K937; library=pcD vector; clone=16.2;
EMBL features not translated to GenBank features:
key from to description
SITE 308 316 N-glycosylation site
SITE 551 559 N-glycosylation site
SITE 656 742 transmembrane domain
FEATURES from to/span description
pept 20 973 precursor polypeptide (AA -34 to 287)
sigp 20 121 signal peptide (AA -34 to -1)
matp 122 970 mature IgG receptor (AA 1-287)
BASE COUNT 410 a 372 c 305 g 316 t
ORIGIN
1 ggggggggac agtgctggga tgactatgga gacccaaatg tctcagaatg tatgtcccag
61 aaacctgtgg ctgcttcaac cattgacagt tttgctgctg ctggcttctg cagacagtca
121 agctgcagct cccccaaagg ctgtgctgaa acttgagccc ccgtggatca acgtgctcca
181 ggaggactct gtgactctga catgccaggg ggctcgcagc cctgagagcg actccattca
241 gtggttccac aatgggaatc tcattcccac ccacacgcag cccagctaca ggttcaaggc
301 caacaacaat gacagcgggg agtacacgtg ccagactggc cagaccagcc tcagcgaccc
361 tgtgcatctg actgtgcttt ccgaatggct ggtgctccag acccctcacc tggagttcca
421 ggagggagaa accatcatgc tgaggtgcca cagctggaag gacaagcctc tggtcaaggt
481 cacattcttc cagaatggaa aatcccagaa attctcccgt ttggatccca ccttctccat
541 cccacaagca aaccacagtc acagtggtga ttaccactgc acaggaaaca taggctacac
601 gctgttctca tccaagcctg tgaccatcac tgtccaagtg cccagcatgg gcagctcttc
661 accaatgggg atcattgtgg ctgtggtcat tgcgactgct gtagcagcca ttgttgctgc
721 tgtagtggcc ttgatctact gcaggaaaaa gcggatttca gccaattcca ctgatcctgt
781 gaaggctgcc caatttgagc cacctggacg tcaaatgatt gccatcagaa agagacaact
841 tgaagaaacc aacaatgact atgaaacagc tgacggcggc tacatgactc tgaaccccag
901 ggcacctact gacgatgata aaaacatcta cctgactctt cctcccaacg accatgtcaa
961 cagtaataac taaagagtaa cgttatgcca tgtggtcata ctctcagctt gcgtatggat
1021 gcaaaaaaga ggggaattgt taaaggaaaa tttaaatgga gactggaaaa atcctgagca
1081 aacaaaacca cctggccctt agaaatagct ttaactttgc ttaaactaca aacacaagca
1141 aaacttcacg gggtcatact acatacaagc ataagcaaaa cttaacttgg atcatttctg
1201 gtaaatgctt atgttagaaa taagacaacc ccagccaatc acaagcagcc tactaacata
1261 taattaggtg actagggact ttctaagaag atacctaccc ccaaaaaaca acttatgtaa
1321 ttgaaaacca accgattgcc tttattttgc ttccacattt tcccaataaa tacttgcctg
1381 tgacattttg ccactggaac act
//
LOCUS HUMGNAT1 1292 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human GNAT1 mRNA for transducin alpha-chain.
ACCESSION X15088
KEYWORDS guanine nucleotide-binding protein; transducin;
transducin alpha-chain.
SOURCE human (Homo sapiens).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1292; enum. 1 to 1292)
AUTHORS Van Dop,C.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1292)
AUTHORS Van Dop,C., Medynski,D.C. and Apone,L.M.
TITLE Nucleotide sequence for a cDNA encoding the alpha subunit of
retinal transducin (GNAT1) isolated from the human eye
JOURNAL Nucleic Acids Res. 17, 4887-4887 (1989)
STANDARD simple automatic
COMMENT *source: tissue=eye; clone=UO50;
EMBL features not translated to GenBank features:
key from to description
SITE 1253 1258 pot. polyA site
[1] Author address Van Dop C., Howard Hughes Medical Institute,
Children's Hospital Enders 1107, 300 Longwood Avenue, Boston MA
02115, U S A.
Submitted (21-APR-1989) on tape to the EMBL data library
FEATURES from to/span description
pept < 1 1156 transducin alpha-chain (AA at 2)
/hgml_locus_uid="LE0116J" /map="3"
/nomgen="GNAT1"
BASE COUNT 311 a 387 c 362 g 232 t
ORIGIN
1 aggtcctcct gggccagaag ggttcctggg agccaggttc tgggatcccc tccatccaga
61 agaaccacct gctcactctg tcccttcgcc tgctgctggg accatggggg ctggggccag
121 tgctgaggag aagcactcca gggagctgga aaagaagctg aaagaggacg ctgagaagga
181 tgctcgaacc gtgaagctgc tgcttctggg tgccggtgag tccgggaaga gcaccatcgt
241 caagcagatg aagattatcc accaggacgg gtactcgctg gaagagtgcc tcgagtttat
301 cgccatcatc tacggcaaca cgttgcagtc catcctggcc atcgtacgcg ccatgaccac
361 actcaacatc cagtacggag actctgcacg ccaggacgac gcccggaagc tgatgcacat
421 ggcagacact atcgaggagg gcacgatgcc caaggagatg tcggacatca tccagcggct
481 gtggaaggac tccggtatcc aggcctgttt tgagcgcgcc tcggagtacc agctcaacga
541 ctcggcgggc tactacctct ccgacctgga gcgcctggta accccgggct acgtgcccac
601 cgagcaggac gtgctgcgct cgcgagtcaa gaccactggc atcatcgaga cgcagttctc
661 cttcaaggat ctcaacttcc ggatgttcga tgtgggcggg cagcgctcgg agccgaagaa
721 gtggatccac tgcttcgagg gcgtgacctg catcatcttc atcgcggcgc tgaccgcgta
781 cgacatggtg ctagtggagg acgacgaagt gaaccgcatg cacgagagcc tgcacctgtt
841 caacagcatc tgcaaccacc gctacttcgc cacgacgtcc atcgtgctct tccttaacaa
901 gaaggacgtc ttcttcgaga aggtcaagaa ggcgcacctc agcatctgtt tcccggacta
961 cgatggaccc aacacctacg aggacgccgg caactacatc aaggtgcagt tcctcgagct
1021 caacatgcgg cgcgacgtga aggagatcta ttcccacatg acgtgcgcca ccgacacgca
1081 gaacgtcaaa ttctgcttcg acgctgtcac cgacatcatc atcaaggaga acctcaaaga
1141 ctgtggcctc ttctgagcca gggcctgtgc tgcagtcggg gacaaggagc ttccgtctgg
1201 caaggccggg gcacaatttg cactcccctc agctagacgc agcagactca gcaataaacc
1261 tttgcatcag gcaaaaaaaa aaaaacaaaa aa
//
LOCUS HUMGSHPXG 1733 bp ds-DNA PRI 15-MAR-1989
DEFINITION Human glutathione peroxidase gene, complete cds.
ACCESSION Y00483
KEYWORDS glutathione peroxidase.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1733)
AUTHORS Ishida,K.
JOURNAL Unpublished (1987) Nippon Kayaku Co.,Ltd, Kita-ku, Tokyo 115, Japan
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1733)
AUTHORS Ishida,K., Morino,T., Takagi,K. and Sukenaga,Y.
TITLE Nucleotide sequence of a human gene for glutathione peroxidase
JOURNAL Nucleic Acids Res. 15, 10051-10051 (1987)
STANDARD simple automatic
COMMENT An in-phase "tga" codon is located at positions 612-614 and encodes
selenocysteine.
FEATURES from to/span description
pept 474 611 glutathione peroxidase, exon 1 (EC 1.11.1.9).
/nomgen="GPX1" /map="3q11-q12"
/hgml-locus_uid="LE0062E"
615 719 glutathione peroxidase, exon 1
998 1357 glutathione peroxidase, exon 2
pre-msg 155 1566 GTP mRNA and intron
IVS 720 997 GTP intron
BASE COUNT 277 a 529 c 526 g 401 t
ORIGIN
1 aacctagatc cctctgctgt cccctgcact gccggtaaca tggcacagca gagcagggtt
61 gtttgtgcac gggcagctcc tgcagctgct gccgtcgccc accagcctcc tatgccaaac
121 cccacatcct aactcaggaa cctctgagaa aaaacggagc cctcgagggg cccagccttg
181 gaagggtaac tggaccgctg ccgcctggtt gcctgggcca gaccagacat gcctgctgct
241 ccttccggct taggaggagc acgcgtcccg ctcgcgcgca ctctccagcc ttttcctggc
301 tgaggagggg ccgagcctcc ggtagggcgg gggccggatg aggcgggacc tcaggcccgg
361 aaaactgcct gtgccacgtg acccgccgcc ggccagttaa aaggaggcgc ctgctggcct
421 ccccttacag tgcttgttcg gggcgctccg ctggcttctt ggacaattgc gccatgtgtg
481 ctgctcggct agcggcggcg gcggcccagt cggtgtatgc cttctcggcg cgcccgttgg
541 ccggcgggga gcctgtgagc ctgggctccc tgcggggcaa ggtactactt atcgagaatg
601 tggcgtccct ctgaggcacc acggtccggg actacaccca gatgaacgag ctgcagcggc
661 gcctcggacc ccggggcctg gtggtgctcg gcttcccgtg caaccagttt gggcatcagg
721 tgcgccgggc ggagcgggac gggacggggg cggacgtgca gtagtggctg ggggcgccgg
781 cggtgtggtg gtgggtgcgt cggctccatg cgcggagagt ctggctactc tctcgtttcc
841 tttctgttgc tcgtagctgc tgaaattcct ctccgccctt gggattgcgc atggagggaa
901 aaatcccggt gactcataga aaatctcccc tgtttgtggt tagaacgttt ctctcctcct
961 cttgaccccg ggttctagct gcccttctct cctgtaggag aacgccaaga acgaagagat
1021 tctgaattcc ctcaagtacg tccggcctgg tggtgggttc gagcccaact tcatgctctt
1081 cgagaagtgc gaggtgaacg gtgcgggggc gcaccctctc ttcgccttcc tgcgggaggc
1141 cctgccagct cccagcgacg acgccaccgc gcttatgacc gaccccaagc tcatcacctg
1201 gtctccggtg tgtcgcaacg atgttgcctg gaactttgag aagttcctgg tgggccctga
1261 cggtgtgccc ctacgcaggt acagccgccg cttccagacc attgacatcg agcctgacat
1321 cgaagccctg ctgtctcaag ggcccagctg tgcctagggc gcccctccta ccccggctgc
1381 ttggcagttg cagtgctgct gtctcggggg ggttttcatc tatgagggtg tttcctctaa
1441 acctacgagg gaggaacacc ttgatcttac agaaaatacc acctcgagat gggtgctggt
1501 cctgttgatc ccagtctctg ccagaccaag gctagtttcc ccactaataa agtgccgggt
1561 gtcagcagac tgtgtgtatg tcctgtgtca ttgtcatttg ggaattcttt ttcttttctt
1621 tttttttttt tttttttgag acggagtttt ttgctctatt gcccaggctt gagtgcagtg
1681 gcgcaatcta ggctcactgc aagctccgcc tcccgggttc agcatttctg cta
//
LOCUS HUMHBB3E 213 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human beta-globin mRNA, 3' end.
ACCESSION M11428
KEYWORDS beta-globin; globin.
SOURCE Human cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 213)
AUTHORS Lang,K.M. and Spritz,R.A.
TITLE Cloning specific complete polyadenylylated 3'-terminal cDNA
segments
JOURNAL Gene 33, 191-196 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 81 beta globin /nomgen="HBB" /map="11p11.5"
/hgml_locus_uid="LQ0007V"
BASE COUNT 53 a 50 c 45 g 65 t
ORIGIN
1 gaattcaccc caccagtgca ggctgcctat cagaaagtgg tggctggtgt ggctaatgcc
61 ctggcccaca agtatcacta agctcgcttt cttgctgtcc aatttctatt aaaggttcct
121 ttgttccgta agtccaacta ctaaactggg ggatattatg aagggccttg agcatctgga
181 ttctgcctaa taaaaaacat ttattttcat tgc
//
LOCUS HUMHK1A 4134 bp ss-mRNA PRI 15-DEC-1989
DEFINITION Human calcium-ATPase (HK1) mRNA, complete cds.
ACCESSION M23114 J04025
KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase.
SOURCE Human kidney cortex, cDNA to mRNA, clones lambda HK[1a,1b].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4134)
AUTHORS Lytton,J. and MacLennan,D.H.
TITLE Molecular cloning of cDNAs from human kidney coding for two
alternatively spliced products of the cardiac Ca2+-ATPase gene
JOURNAL J. Biol. Chem. 263, 15024-15031 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1
and HK2, are realized in human kidney cDNAs. HK2 codes for a
protein identical to rabbit cardiac muscle Ca2+ ATPase, with the
exception of 6 scattered amino acid replacements, whereas HK1 codes
for a protein identical to that encoded by HK2, but with the
carboxyl-terminal 4 amino acids replaced by an extended sequence of
49 amino acids. See accession M23115 and J04703.
FEATURES from to/span description
pept 164 3292 calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B"
/map="12" /hgml_locus_uid="LP0123P"
mRNA 1 4134 calcium-ATPase mRNA
site 3143 3144 alternative splice site
BASE COUNT 1056 a 938 c 1012 g 1128 t
ORIGIN 254 bp upstream of HindIII site.
1 gggtgattca gcgcccggcg aggcggaacg ggccgcaaga ggaggagggg agagcccgtc
61 cgcgcctggg ctcccggggt ggcacgagcc cgcggccgga gtgcgaggcg gaggcgagga
121 ggccgcgggg acgggaggcg aggccggccg ggcccccgaa gccatggaga acgcgcacac
181 caagacggtg gaggaggtgc tgggccactt cggcgtcaac gagagtacgg ggctgagcct
241 ggaacaggtc aagaagctta aggagagatg gggctccaac gagttaccgg ctgaagaagg
301 aaaaaccttg ctggaacttg tgattgagca gtttgaagac ttgctagtta ggattttatt
361 actggcagca tgtatatctt ttgttttggc ttggtttgaa gaaggtgaag aaacaattac
421 agcctttgta gaaccttttg taattttact catattagta gccaatgcaa ttgtgggtgt
481 atggcaggaa agaaatgctg aaaatgccat cgaagccctt aaggaatatg agcctgaaat
541 gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg attaaagcta aagacatagt
601 tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt cctgctgata taaggttaac
661 ttccatcaaa tctaccacac taagagttga ccagtcaatt ctcacaggtg aatctgtctc
721 tgtcatcaag cacactgatc ccgtccctga cccacgagct gtcaaccaag ataaaaagaa
781 catgctgttt tctggtacaa acattgctgc tgggaaagct atgggagtgg tggtagcaac
841 tggagttaac accgaaattg gcaagatccg ggatgaaatg gtggcaacag aacaggagag
901 aacacccctt cagcaaaaac tagatgaatt tggggaacag ctttccaaag tcatctccct
961 tatttgcatt gcagtctgga tcataaatat tgggcacttc aatgacccgg ttcatggagg
1021 gtcctggatc agaggtgcta tttactactt taaaattgca gtggccctgg ctgtagcagc
1081 cattcctgaa ggtctgcctg cagtcatcac cacctgcctg gctcttggaa ctcgcagaat
1141 ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg gaaacccttg gttgtacttc
1201 tgttatctgc tcagacaaga ctggtacact tacaacaaac cagatgtcag tctgcaggat
1261 gttcattctg gacagagtgg aaggtgatac ttgttccctt aatgagttta ccataactgg
1321 atcaacttat gcacctattg gagaagtgca taaagatgat aaaccagtga attgtcacca
1381 gtatgatggt ctggtagaat tagcaacaat ttgtgctctt tgtaatgact ctgctttgga
1441 ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa gctacagaga ctgctctcac
1501 ttgcctagta gagaagatga atgtatttga taccgaattg aagggtcttt ctaaaataga
1561 acgtgcaaat gcctgcaact cagtcattaa acagctgatg aaaaaggaat tcactctaga
1621 gttttcacgt gacagaaagt caatgtcggt ttactgtaca ccaaataaac caagcaggac
1681 atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt gtcattgaca ggtgcaccca
1741 cattcgagtt ggaagtacta aggttcctat gacctctgga gtcaaacaga agatcatgtc
1801 tgtcattcga gagtggggta gtggcagcga cacactgcga tgcctggccc tggccactca
1861 tgacaaccca ctgagaagag aagaaatgca ccttgaggac tctgccaact ttattaaata
1921 tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg gatcctccga gaatcgaggt
1981 ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg gtcatcatga tcactgggga
2041 caacaagggc actgctgtgg ccatctgtcg ccgcatcggc atcttcgggc aggatgagga
2101 cgtgacgtca aaagctttca caggccggga gtttgatgaa ctcaacccct ccgcccagcg
2161 agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa ccctcccaca agtctaaaat
2221 cgtagaattt cttcagtctt ttgatgagat tacagctatg actggcgatg gcgtgaacga
2281 tgctcctgct ctgaagaaag ccgagattgg cattgctatg ggctctggca ctgcggtggc
2341 taaaaccgcc tctgagatgg tcctggcgga tgacaacttc tccaccattg tggctgccgt
2401 tgaggagggg cgggcaatct acaacaacat gaaacagttc atccgctacc tcatctcgtc
2461 caacgtcggg gaagttgtct gtattttcct gacagcagcc cttggatttc ccgaggcttt
2521 gattcctgtt cagctgctct gggtcaatct ggtgacagat ggcctgcctg ccactgcact
2581 ggggttcaac cctcctgatc tggacatcat gaataaacct ccccggaacc caaaggaacc
2641 attgatcagc gggtggctct ttttccgtta cttggctatt ggctgttacg tcggcgctgc
2701 taccgtgggt gctgctgcat ggtggttcat tgctgctgac ggtggtccaa gagtgtcctt
2761 ctaccagctg agtcatttcc tacagtgtaa agaggacaac ccggactttg aaggcgtgga
2821 ttgtgcaatc tttgaatccc catacccgat gacaatggcg ctctctgttc tagtaactat
2881 agaaatgtgt aacgccctca acagcttgtc cgaaaaccag tccttgctga ggatgccccc
2941 ctgggagaac atctggctcg tgggctccat ctgcctgtcc atgtcactcc acttcctgat
3001 cctctatgtc gaacccttgc cactcatctt ccagatcaca ccgctgaacg tgacccagtg
3061 gctgatggtg ctgaaaatct ccttgcccgt gattctcatg gatgagacgc tcaagtttgt
3121 ggcccgcaac tacctggaac ctggtaaaga gtgtgtgcag cctgccacca aatcctgctc
3181 gttctcggca tgcaccgatg ggatttcctg gccgtttgtg ctgctcataa tgcccctggt
3241 gatctgggtc tatagcacag acactaactt tagcgatatg ttctggtctt gactgacagt
3301 tttccataaa gaagatgttt aacttaatca attaattttt ttattgttta aagcaactgt
3361 ctatttctgc tgaattttca catgaacata ctggctggtg atggaggttt catactctag
3421 attttgtttt gctttttctg actccagtgg ggcaagattt tcctttttta tacacataat
3481 taaagtgtcc attgacatgt acagagaact aacactattt tatgcaaata tttttttgta
3541 gatgaaaaag catgtacagt gttctgttta atactcatcc ttgtataaaa aaaatagttg
3601 agccagcaga cattgtcagc aaattaattg gcagcagatt ttaggaaatg aatgtgtgtg
3661 gttttttttc taaaactaaa tagcatgtat tgtgtctttt gcatgatgat ccggatttaa
3721 tttgatatca cagtctaatt tttattcata agccaatttt tctgcactga gcagagtctt
3781 gctacctcag tcagtattgt tttggtttgc tacttccctc acccactttg gcctccgttc
3841 accccacccc accccacctc tccccacctt acccccgccc cgcttggctt cttctttagg
3901 attgtgatgg ttcgttctgt ttacatcagt tttaacgaga ggtatgcctg tactcgcttg
3961 tgcagaaaac attgttccag attcaatcga ctgggtttat gtcccttcac atagttttta
4021 aggttattta tttaaatgtc taatgtattt tattgtaaca gacattgttt tgccaacatt
4081 gcctatttca gtggcacgtc atctagtttt aaaaaaataa aacattttaa aaag
//
LOCUS HUMHK2A 3781 bp ss-mRNA PRI 15-DEC-1989
DEFINITION Human calcium-ATPase (HK2) mRNA, complete cds.
ACCESSION M23115 J04025
KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase.
SOURCE Human kidney cortex, cDNA to mRNA, clones lambda HK2[a,b].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3781)
AUTHORS Lytton,J. and MacLennan,D.H.
TITLE Molecular cloning of cDNAs from human kidney coding for two
alternatively spliced products of the cardiac Ca2+-ATPase gene
JOURNAL J. Biol. Chem. 263, 15024-15031 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1
and HK2, are realized in human kidney cDNAs. HK2 codes for a
protein identical to rabbit cardiac muscle Ca2+ ATPase, with the
exception of 6 scattered amino acid replacements, whereas HK1 codes
for a protein identical to that encoded by HK2, but with the
carboxyl-terminal 4 amino acids replaced by an extended sequence of
49 amino acids. See accession M23114 and J04703.
FEATURES from to/span description
pept 4 2997 calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B"
/map="12" /hgml_locus_uid="LP0123P"
mRNA 1 3781 calcium-ATPase mRNA
site 2983 2984 alternative splice site
BASE COUNT 1018 a 829 c 913 g 1021 t
ORIGIN 94 bp upstream of HindIII site.
1 gccatggaga acgcgcacac caagacggtg gaggaggtgc tgggccactt cggcgtcaac
61 gagagtacgg ggctgagcct ggaacaggtc aagaagctta aggagagatg gggctccaac
121 gagttaccgg ctgaagaagg aaaaaccttg ctggaacttg tgattgagca gtttgaagac
181 ttgctagtta ggattttatt actggcagca tgtatatctt ttgttttggc ttggtttgaa
241 gaaggtgaag aaacaattac agcctttgta gaaccttttg taattttact catattagta
301 gccaatgcaa ttgtgggtgt atggcaggaa agaaatgctg aaaatgccat cgaagccctt
361 aaggaatatg agcctgaaat gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg
421 attaaagcta aagacatagt tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt
481 cctgctgata taaggttaac ttccatcaaa tctaccacac taagagttga ccagtcaatt
541 ctcacaggtg aatctgtctc tgtcatcaag cacactgatc ccgtccctga cccacgagct
601 gtcaaccaag ataaaaagaa catgctgttt tctggtacaa acattgctgc tgggaaagct
661 atgggagtgg tggtagcaac tggagttaac accgaaattg gcaagatccg ggatgaaatg
721 gtggcaacag aacaggagag aacacccctt cagcaaaaac tagatgaatt tggggaacag
781 ctttccaaag tcatctccct tatttgcatt gcagtctgga tcataaatat tgggcacttc
841 aatgacccgg ttcatggagg gtcctggatc agaggtgcta tttactactt taaaattgca
901 gtggccctgg ctgtagcagc cattcctgaa ggtctgcctg cagtcatcac cacctgcctg
961 gctcttggaa ctcgcagaat ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg
1021 gaaacccttg gttgtacttc tgttatctgc tcagacaaga ctggtacact tacaacaaac
1081 cagatgtcag tctgcaggat gttcattctg gacagagtgg aaggtgatac ttgttccctt
1141 aatgagttta ccataactgg atcaacttat gcacctattg gagaagtgca taaagatgat
1201 aaaccagtga attgtcacca gtatgatggt ctggtagaat tagcaacaat ttgtgctctt
1261 tgtaatgact ctgctttgga ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa
1321 gctacagaga ctgctctcac ttgcctagta gagaagatga atgtatttga taccgaattg
1381 aagggtcttt ctaaaataga acgtgcaaat gcctgcaact cagtcattaa acagctgatg
1441 aaaaaggaat tcactctaga gttttcacgt gacagaaagt caatgtcggt ttactgtaca
1501 ccaaataaac caagcaggac atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt
1561 gtcattgaca ggtgcaccca cattcgagtt ggaagtacta aggttcctat gacctctgga
1621 gtcaaacaga agatcatgtc tgtcattcga gagtggggta gtggcagcga cacactgcga
1681 tgcctggccc tggccactca tgacaaccca ctgagaagag aagaaatgca ccttgaggac
1741 tctgccaact ttattaaata tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg
1801 gatcctccga gaatcgaggt ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg
1861 gtcatcatga tcactgggga caacaagggc actgctgtgg ccatctgtcg ccgcatcggc
1921 atcttcgggc aggatgagga cgtgacgtca aaagctttca caggccggga gtttgatgaa
1981 ctcaacccct ccgcccagcg agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa
2041 ccctcccaca agtctaaaat cgtagaattt cttcagtctt ttgatgagat tacagctatg
2101 actggcgatg gcgtgaacga tgctcctgct ctgaagaaag ccgagattgg cattgctatg
2161 ggctctggca ctgcggtggc taaaaccgcc tctgagatgg tcctggcgga tgacaacttc
2221 tccaccattg tggctgccgt tgaggagggg cgggcaatct acaacaacat gaaacagttc
2281 atccgctacc tcatctcgtc caacgtcggg gaagttgtct gtattttcct gacagcagcc
2341 cttggatttc ccgaggcttt gattcctgtt cagctgctct gggtcaatct ggtgacagat
2401 ggcctgcctg ccactgcact ggggttcaac cctcctgatc tggacatcat gaataaacct
2461 ccccggaacc caaaggaacc attgatcagc gggtggctct ttttccgtta cttggctatt
2521 ggctgttacg tcggcgctgc taccgtgggt gctgctgcat ggtggttcat tgctgctgac
2581 ggtggtccaa gagtgtcctt ctaccagctg agtcatttcc tacagtgtaa agaggacaac
2641 ccggactttg aaggcgtgga ttgtgcaatc tttgaatccc catacccgat gacaatggcg
2701 ctctctgttc tagtaactat agaaatgtgt aacgccctca acagcttgtc cgaaaaccag
2761 tccttgctga ggatgccccc ctgggagaac atctggctcg tgggctccat ctgcctgtcc
2821 atgtcactcc acttcctgat cctctatgtc gaacccttgc cactcatctt ccagatcaca
2881 ccgctgaacg tgacccagtg gctgatggtg ctgaaaatct ccttgcccgt gattctcatg
2941 gatgagacgc tcaagtttgt ggcccgcaac tacctggaac ctgcaatact ggagtaaccg
3001 cttcctaaac cattttgcag aaatgtaagg gtgttcggtt gcgtgcatgt gcgtttttag
3061 caacacatct accaaccctg tgcatgactg atgttgggga aaaagaaaag taaaaaactt
3121 cccaactcac tttgtgttat gtggaggaaa tgtgtattac caatggggtt gttagctttt
3181 aaatcaaaat actgattaca gatgtacaat ttagcttaat cagaaagcct ctccagagaa
3241 gtttggtttc tttgctgcaa gaggaatgag gctctgtaac cttatctaag aacttggaag
3301 ccgtcagcca agtcgccaca tttctctgca aaatgtcata gcttatataa atgtacagta
3361 ttcaattgta atgcatgctt cggttgtaag tagccagatc cctctccagt gacattggaa
3421 catgctactt tttaattggc cctgtacagt ttgcttattt ataaattcat taaaaacact
3481 acaggtgttg aatggttaaa atgtaggcct ccagttcatt ttcagttatt ttctgagtgt
3541 gcagacagct atttcgcact gtattaaatg taacttattt aatgaaatca gaagcagtag
3601 acagatgttg gtgcaataca aatattgtga tgcatttatc ttaataaaat gctaaatgtc
3661 aatttatcac tgcgcatgtt tgactttaga ctgtaaatag agatcagttt gtttctttct
3721 gtgctggtaa caatgagcgt cgcacagaca tggtttcagg taaataaatc tattctatga
3781 t
//
LOCUS HUMHMG17G 7195 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human HMG-17 gene for non histone chromosomal protein HMG-17.
ACCESSION X13546
KEYWORDS HMG-17 chromosomal protein; HMG-17 gene; non histone protein.
SOURCE human (Homo sapiens).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 7195; enum. 1 to 7195)
AUTHORS Landsman,D.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (sites; for [1])
AUTHORS Landsman,D., McBride,O.W. and Bustin,M.
TITLE Human non-histone chromosomal protein HMG-17: identification,
characterization, chromosome localization and RFLP's of a
functional gene from the large multigene family
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
COMMENT *source: cell line=MCF-7; library=EMBL3; clone=pH17g; **map:
chromosomal location=1p12-34.
See also HSHMG17 (M12623) for the cDNA sequence and HSHMG17P
(X06444) for a retropseudogene HMG-17, published by Landsman D.
and others.
EMBL features not translated to GenBank features:
key from to description
PRM 2315 2319 CAAT-box
PRM 2372 2377 TATA-box
TRANSCR 2402 5851 primary transcript
MSG 2402 2523 exon 1
IVS 2524 3369 intron I
MSG 3370 3414 exon 2
IVS 3415 3589 intron II
MSG 3590 3619 exon 3
IVS 3620 3969 intron III
MSG 3970 4020 exon 4
IVS 4021 4456 intron IV
MSG 4457 4552 exon 5
IVS 4553 4997 intron V
MSG 4998 5851 exon 6
[1] Author address Landsman D., Laboratory of Molecular
Carcinogenesis, National Institutes of Health, Building 37, Room
3D20, Bethesda, MD 20892, U.S.A__
Submitted (12-NOV-1988) on tape to the EMBL data library by:
FEATURES from to/span description
pept 2509 2523 put. HMG-17 protein (AA 1-5)
/hgml_locus_uid="LY0033Z" /map="1p36.1-p35"
/nomgen="HMG17"
3370 3414 put. HMG-17 protein (AA 6-20)
3590 3619 put. HMG-17 protein (AA 21-30)
3970 4020 put. HMG-17 protein (AA 31-47)
4457 4552 put. HMG-17 protein (AA 48-79)
4998 5033 put. HMG-17 protein (AA 80-90)
BASE COUNT 1664 a 1754 c 1809 g 1968 t
ORIGIN
1 aagcttgttc taggtcctct gcttgtgagg gtcaaagctg tgtcctttcc cttacctccc
61 tctgccagga cttgctgcag agctgctgag aggattagtg cctttgaaga gctgtctgcc
121 tgagcaactc tatttcaggt gccccacacc ggcaagtacc agccagcaac accaaccaaa
181 tgctactctc tttaaagtcc attttccttc ttcttttttt tttttttttt ttggagacag
241 catctcaccc tgtcctggct ggtctcgaac tcctgacctc aggtgatccg ccggcctcac
301 cctcccaaag tgctgggatt acaggcatga gccaccatgc cagacttccc attttacttt
361 ctgcaagctg tttccctagc agctccctct aggggagagg tgaaatcttg caagttgtag
421 caagagcaca caggaaaccc ctaactttcc tataccccac ccgcctcttc ccctttctgt
481 cccgggatac tggcggcaag agacttcttg gctattgtcc atgctcccag aatcaagcat
541 aaatgccaga cacggcgatt gagaagccaa tcagtgaacc ctttgcaaag ccccatccac
601 acctggcact cccctctacc aatccctggc acagggttcc tggagagcag gtgctgtaca
661 ttttacagct ttacaatggg gctgttgaca gccataatta gggaggcatg aattatgcgg
721 ctataatgca gagccctaca attaaggcgg gaatgagggg ctggaggcag caaacggaat
781 ctgccctatg agcgtggctg ttgagtcctg tctcctgggt ctgactttcc gtaatatgat
841 tggggtacag tagaggtgat taatggggct ggcatctctc tttggcctga ggttctgtat
901 tctgggaaag gtatacaggg tggagtaggg agaagctgcc ccaggaggcg atgtagtggt
961 ggaaagaaga ggcagagagg tcgtcgtcgt cgcccagcag caagggctgc aaaatagtag
1021 aactcgtggt tgctttggac aggtgtgatt tgtgcaagcc aggttcaacc cttgcctcaa
1081 gaaatcagat gggaccaatt tagtgtcctt ccacctgtga gccaagcccc catttgagga
1141 catctatcgt attcttgtgt gctgggtctc aaatagaatt tttaaagatt cttagatgta
1201 aaacttgttt gctaactgca acggaagaac accttctttt gctgctgcaa ttgcctaccc
1261 tccctcaggt gcacttggct agctccaaga ccctggcttt ggactagggg gcaccggaag
1321 agcccttttg ccatcacacc acctgttcct ctacttgttt cctaatcccc gggcgctttc
1381 ctccaccctc ctttcctcgc gcaagtgaaa ggcagccagc ttgacagtgt ggaaggaggg
1441 atttcgccag ctgcggctgc agtgcccagg cttggtggcc gcgtcgggtt gggcagctgg
1501 gcaaaatgtc tccgagaggg gtagagtgta gggggaagtg agtgctggta gacctcgacg
1561 gccgctctcc accatactgt ggcggggtct ggaggtttgg gaatacaggg accttaaggg
1621 agccacccga gaggagttgg gggtgggggc ttcgatagca tttggggagg cggctgcgca
1681 acgggtgctg aaaggacagc tcctacctga cccacggcac ccaaaaggca cctctgcatg
1741 gaaagtctcc ccctgaaacc cctagcccac ctcgagtggg accgtctttc aggtgaggtg
1801 cacgaaaggc ttctccgaag tggcagctcg gaaggatgca cgtccggctg cgccaagctg
1861 gtttccccgc ctttttccca cttcggcgtc agcctgggct ctgggaactg cagcctgtgc
1921 ctccagccgc gcgcttcctc ctgcttccag aggagcaggg cagtgcagcc aggcctcgca
1981 aacccctctt ctggggcccc cagcgggagc aggaaggaaa gccgctgaga tgcgccgtga
2041 gtgtcgcacg gcttggagaa gtgggttgct gggttgcata aggtggaagg aacttgctag
2101 tttgcaaaag acagctcttg gcgggagccg gaacccggag cccgccgact cgcgcacccc
2161 tcctcattta ctgccttcga cgcgcgccac cccgactgcg cccccacccc ctcggcgcgc
2221 cgccgtctcc gcgcactacc ccccccacac acccccctcc tcactccctc cagaggaggt
2281 gagtttaaac cccgcccacg tgaccccagc tgggccaatg aacggcggcg ggaggtgaaa
2341 tcggttctaa ccggtccggg gctcccagcg ctataaaaac tttataaacc ccccggagcc
2401 cgagcagtgt gaagaagagg cgagaacgac ccccggaccg accaaagccc gcgcgccgct
2461 gcatcccgcg tccagcacct acgtcccgct gccgtcgccg ccgccaccat gcccaagaga
2521 aaggtacgtg gcgcgagggc cccaggcgcc gggccaccac tgccgccacc gccgccgccg
2581 cctccctggt gcagggagcg agaatcggcg ccgagcagga gccagcgcac cgtccccgcg
2641 cgggggctgg agacggtgtc gggcagctcg gggctaaccc tgagcggctc ggctgcccgc
2701 gggcgccaga ggccatattg gaggagcggc ggccgcggcg ggaggagcca tgttggcggc
2761 tgtttatccc gctctcctcg ctcgccgccc ccgccccgtg ccccctcccc catcgcgagg
2821 gccccttccc tccctctgcg ggcgggcaat tcaaacccga aagggcggga aggcggcgct
2881 cggggttggc gggcggggga aagcgtgccg ccaaaaaacc gccgccgtga ggcggggctt
2941 gtgcggtatg gccccgcccc ctcgcccacg ttccccgcgc acgagacgcg cgctgtcgcc
3001 gcccacgagt tccccgggct gcgcgcgcct ccctctcccg ccctcgacgg ctgccatagc
3061 aacggcgctg ggctccgctc cggagggggt ttgtttgcgc catctgcagc tgttgctcct
3121 gcctgtcgcg gtggtgcggg ctccgctgcc ctccccggct gcgctccggt ccagccctcg
3181 cttctcgggg tcggcgagcc ggagctcctg cgcgcgcttc gttcttatac gaacgtcggg
3241 ctcactcatt tatgtcctag aaaagttgtg tggacgactg ctttaatttt catttttagc
3301 actctaaagt ttgggaagta attaagaacc acctcaaaat ctgcagtttt ttgttcttgt
3361 ttcttatagg ctgaagggga tgctaaggga gataaagcaa aggtgaagga cgaagtaagt
3421 cattctctct tcaagggtca aagccttgga ctagcagagg ccactggact cggtgattaa
3481 ccgtaacctg tgtcctgaat ttacactcct ataatctaga gcaaattgat accaaacttt
3541 caaagcgact tacctgtcct atttctaact ttctcgttgt ctttaatagc cacagagaag
3601 atccgcgagg ttgtctgctg taagtgtatg cttttgaatt ttcgtgcttg tccctgaaac
3661 taaaaaacat caaaaaacaa ttccctttgc ttccatgaat tatggttagt gcctggtttt
3721 gaatcattgc ctctacttgg gactcttgcc ctttgggttt tgctggttct gaaattctga
3781 tgcctgtagc caaagtggga catttgagtg ggcttctgga aatccaacat tctagaagaa
3841 agccaaccac aaaaactttg aggaggagga gaaacttctc tacccttggt acttttggtt
3901 ggttgtgggt ggttttcttc agtccattgt actgatgttc actttttcct ctcttcctgc
3961 caaaaaaaga aacctgctcc tccaaagcca gagcccaagc ctaaaaaggc ccctgcaaag
4021 gtaagtgcta acattggaac tgatcatttt cacagaatga ggactgtcct tagtgcctta
4081 acttaattag cataatggtg cctccattaa tggaggttat aaactgtgtg gatagcttac
4141 ctcgtccgtg tcattcataa cgttggtttc ctgatcaaga attctgttgt tagtttatag
4201 gaaaatctgc atttaaatat taaatacccc agggtttaaa ctgatttatt cttctgttac
4261 ttaggaaatt ctaagtacta aagattatag caaattacag ataatttagc ctagttttga
4321 tcagttgttc ttgtcaaatg ggtgagggtt tgtgttttat cttgagtaaa caatcctacc
4381 ttgtgcagaa ctttgcagac cataccttgg taatacgaaa tggggagaaa cagttctatt
4441 ttttcccctt tttcagaagg gagagaaggt acccaaaggg aaaaagggaa aagctgatgc
4501 tggcaaggag gggaataacc ctgcagaaaa tggagatgcc aaaacagacc aggtataact
4561 gctgtttcac cctttgttag atttgttcat tcagttagtt gctgatatca aaaatttaat
4621 tgcccttttc ttgtatcact ccaaatgtac catttggtcc agtgtgcttg tggctttcct
4681 gttaacttaa atcctggatt cttgaaatct ctactgtcag ctgaagggca ttgtgtttta
4741 tactacatga agttttcaag ctaggagtaa cctcaggtct cttattctgg tgttcttcct
4801 acctcaattt tacagtgagg cccagaggag aggtgactta gcaaagttac ctagtaagtc
4861 attctcagaa gaaatactga gtctcagtac ctgaaacctg aactttggcc tgggagcaaa
4921 gtgatgctgt aggtggaatg tgaacacaga tagttttgaa atctacgcat tgcattaatt
4981 tgtctgtttt cttttaggca cagaaagctg aaggtgctgg agatgccaag tgaagtgtgt
5041 gcatttttga taactgtgta cttctggtga ctgtacagtt tgaaatacta ttttttatca
5101 agttttataa aaatgcagaa ttttgtttta cttttttttt ttttttaaaa gctatgttgt
5161 tagcacacag aacacttcat tgttgttttt gggggaaggg gcatatgtca ctaatagaat
5221 gtctccaaag ctggattgat gtggagaaaa cacctttccc ttctagtttt gagagacttc
5281 ctcttggctc ccaggaggag ggattccctg actttgacac acatggccac cttggcacaa
5341 aagccttgtg gtatagaaaa acaaatttgt ttttatgtcc tcttctccct ttccatcttt
5401 cagcatagac ttaactccct taagcccaga catctgttga gacctgaccc ctagtcattg
5461 gttaccagtg tgtcaggcaa tctggacttt ccagtgatgc cactgagatg gcacctgtca
5521 aaagagcagt ggttccattt ctagattgtg gatcttcaga taaattctgc cattttcatt
5581 tcacttcctg aaagtcaggg tcggcttgtg aaaagttgtt aaacaacatg ctaaatgtga
5641 aatgtcaacc ctcactctaa actttccctg ttcagagcat cagatgaaga cttcattggg
5701 ttttatagtg gctttctgat ttttggtagt ccattgaaga agggagtttg aaagttgttg
5761 tatactgtta acgattgtct gcccatgtcc tgcctgaaat accatgattg tttatggaaa
5821 gtatctttaa taaagctgga tacagtttgg cttggaatgc tgcctctgat cttttccaca
5881 agtggggtaa cctggtttat ccaagtctct tggaacaggg tacgttctgc tttgaggtac
5941 tccatgtaca gtctaatgat gatctttcac tgatttatag tcagcttcca aaacacacag
6001 tgctaggtgc agttaggaag gattccagga gtagacttag ctgatggtga aaggctggga
6061 gtatggagtg atttctgtac ttggttatgc agcattccct gcagtgtttg ttttttgcat
6121 gactccgaat acatgaagtg tattaaatgc agatttttgg tagatctgcg gttttggctg
6181 ggtagctgac aggagcctct ctacttccat ctgcatttgc aacttctata aaattagcaa
6241 attaatagac tcatgtcaga gagtgtctct gtgaatgtgt acacagttgt ccatttgtaa
6301 ggtagatgac actttgctcc taagtagacc tgcaaacaaa gacaatgggg tctttcctga
6361 ttcctttagt ttggagatgc agagaacccc acgtttctct aagtgaaatg gcaagaagtt
6421 atttgttaga atatggaact ggtaacaaat tcatgtttgc accttgaatt tgctttcaca
6481 gacctggctg tattgttgga caggaataaa gtaaattgga gactaattgc tttgtccatt
6541 cccttagtca aaaaagtatc gctgttaagt tcttgaggtt ttttgtgtat tatctctcaa
6601 gtgttgagat gtttgtgtaa tatatgtatc acactcatgg atgaggaatc aggcacagat
6661 gaaacttgtt gaaggttatg tgtacattta atgagcagca gagtcaggat tcagaccctc
6721 aactagacaa aagtcatctc tgacagactt ttattcagga taaaatacca tttgtccatt
6781 tagccagtta ccagccattt actttcatta ttgctgaagt cattcattct aggttgtaga
6841 cgggtaattt ttattctaaa agcggtgtcc tcttccagag gaaacttaag agacatctta
6901 gaatggtcag gggtagtggg ctggtatttg agcagaggtt gcagaatttg agaaagttac
6961 agctgaggtt ctttgttagt ggcaagtctg ctaagccctg ggctatgcta acagctctgc
7021 ctcatgctct ttaatctttg agtcagacct atgagataca ggtgtagccc cattttgtag
7081 tcgagagtga ggctcagatg tgaagcgact ggctgaggtt aaacttgata gtctgtttac
7141 ttattttctg gccctcttct tgttgccata taattaggat gagtagggta agctt
//
LOCUS HUMIGLVB 464 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human Ig germline lambda-chain (autoantibody Humv117) gene,
V-region, partial cds.
ACCESSION M30446
KEYWORDS V-region; anti-DNA antibody; autoantibody; germline;
immunoglobulin light chain; lambda-immunoglobulin.
SOURCE Human retinoblastoma cell line Y79 DNA, (library of W.-H.Lee).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 464)
AUTHORS Siminovitch,K.A., Misener,V., Kwong,P.C., Song,Q.-L. and Chen,P.P.
TITLE A natural autoantibody is encoded by germline heavy and lambda
light chain variable region genes without somatic mutation
JOURNAL J. Clin. Invest. 84, 1675-1678 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 44 / 349 Ig lambda chain V-region precursor (AA at 46)
sigp / 44 54 Ig lambda chain signal peptide (AA at 46)
matp 55 / 349 Ig lambda chain
IVS < 1 43 Ig lambda-chain intron
iDNA 350 > 464 intervening DNA (5' end approx.)
signal 351 357 7 mer recombination signal
signal 381 389 9 mer recombination signal
BASE COUNT 99 a 150 c 118 g 97 t
ORIGIN
1 tctagaccaa gaatcaccgt gtctgtgtct ctcctgcttc cagggtcctg ggcccagtct
61 gtgttgacgc agccgccctc agtgtctgcg gccccaggac agaaggtcac catctcctgc
121 tctggaagca gctccaacat tgggaataat tatgtatcct ggtaccagca gctcccagga
181 acagccccca aactcctcat ctatgaaaat aataagcgac cctcagggat tcctgaccga
241 ttctctggct ccaagtctgg cacgtcagcc accctgggca tcaccggact ccagactggg
301 gacgaggccg attattactg cggaacatgg gatagcagcc tgagtgctgg cacagtgctc
361 cagcccaatg gggaactgag acaagaaccc ccttcttcct cccccaggag ggtgagtgcc
421 gccagctgct gctcacgcct gacctgtagc ttctgctgct gcag
//
LOCUS HUMIL71 2195 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 1.
ACCESSION M29048
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 1 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2195)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept 2167 + 2176 interleukin 7, exon 1
pre-msg < 1783 > 2195 IL7 mRNA and introns
IVS 2177 > 2195 IL7 intron A
BASE COUNT 570 a 543 c 498 g 584 t
ORIGIN 1783 bp upstream of EcoRI site.
1 ctaggcgtgt ttcattatag aaatcatttt taaaactcac tccagtgtca tgtctgtcct
61 cctccccacc tcccactcta ggttccagtc aaatggaaat atgtggcttt tctcactcca
121 ttctgtctct cttttccatt tgcttacccc cttcgcttcc tctttcattc tatttccttc
181 tgctttcatg aactttctct ttcacacata ttcactgatg cattcccact cacctgttct
241 ctcttactct gtcacccccc agtacttact gaatgccagc catgggctaa agaccatagt
301 aagtggacgc tgggcaggaa gagaaaggcc attgtggctg gaggatagtg atgtggtacc
361 acgaaaacgt cagagagggc agtggggccc agatcaccag ggcactgagg gcctggtacc
421 cagtgggaag tgaccagaag gtggggcaca aaatagaatc agatatcctg ctgctctgag
481 aagaatgaag gggttgtggg caaaagtgaa agctggaaga tcccttagac agtctctata
541 gtcaccttag ggttaatttt ggaggtgggg cagacaggac ttgtctatgg atttgatgtg
601 ggaaatgaga ccaaaaaagg aatcagagac ccaagtttca ggcttgagca actctgttct
661 ccctagttca ttgccatcct tttttttcct ttctccttcc ctcctcttcc ttgttcacgc
721 tatagcagaa tgtgacatac tccttaccag ccccaagttc attcctcttc aagtgtcagt
781 ttagaagtca cagactccag gaaactttcc tgaagtcccc acactggggc aaagccacac
841 acgtactgtc atctccctcc cggcatcagt gtgcacatta ttactttatt cacaggagtt
901 ccaagctgga gtttctacag cccaaatgca gcaagtctgt gtctaagctc agttagctaa
961 gtgaaatttg gaaagttact taaggaagtc tcagtcccca catctgtaaa gtgcagataa
1021 gaaaatggtc tacctcacac aaagttttgc atgagatgat atacataaaa tgcttagcac
1081 aaaactggca gagagcaaat gctagataaa tgatagtcgt tattattatc gctgttgtta
1141 ctggtttaca ttatccacct tcatctaagc accctttctg cagaatagca gaaaccaaac
1201 taatgtagca aataagctac ataattcaag cccaggaaaa agttaacatt tcagtggcat
1261 gcattcaaga cgaatagttt gatttattag ccaattcaga taaatgtgca cgtggaagtc
1321 atagttaaat attatcgtca gtttccacgt cctgcgttta atttggggtt tgattttcca
1381 aatacaacac ttaccagatt aggtggaccc acaggattat ttttccttga ggtctcacct
1441 gagcaggtgc atgtacagca gacggagcag aaagagactg attagagagg ttggagtggt
1501 agagggcgtg accctcttaa tcattcttca cttccttttt taaaagacga cttggcatcg
1561 tccaccacat ccgcggcaac gcctccttgg tgtcgtccgc ttccaataac ccagcttgcg
1621 tcctgcacac ttgtggcttc cgtgcacaca ttaacaactc atgggtctag ctcccagtcg
1681 ccaagcgttg ccaaggcgtt gagagatcat ctgggaagtc ttttacccag aattgctttg
1741 attcaggcca gctggttttt ctgcggtgat tcggaaattc gcgaattcct ctggtcctca
1801 tccaggtgcg cgggaagcag gtgcccagga gagaggggat aatgaagatt ccatgctgat
1861 gatcccaaag attgaacctg cagaccaagc gcaaagtaga aactgaaagt acactgctgg
1921 cggatcctac ggaagttatg gaaaaggcaa agcgcagagc cacgccgtag tgtgtgccgc
1981 cccccttggg atggatgaaa ctgcagtcgc ggcgtgggta agaggaacca gctgcagaga
2041 tcaccctgcc caacacagac tcggcaactc cgcggaagac cagggtcctg ggagtgacta
2101 tgggcggtga gagcttgctc ctgctccagt tgcggtcatc atgactacgc ccgcctcccg
2161 cagaccatgt tccatggtaa gcgctcttct ccctt
//
LOCUS HUMIL72 177 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 2.
ACCESSION M29049
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 2 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 177)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 157 interleukin 7, exon 2
pre-msg < 1 > 177 IL7 mRNA and introns
IVS < 1 20 IL7 intron A
IVS 158 > 177 IL7 intron B
BASE COUNT 43 a 26 c 34 g 74 t
ORIGIN About 3 kb after segment 1.
1 ttatttactg tcaaatttag tttcttttag gtatatcttt ggacttcctc ccctgatcct
61 tgttctgttg ccagtagcat catctgattg tgatattgaa ggtaaagatg gcaaacaata
121 tgagagtgtt ctaatggtca gcatcgatca attattggta tgtgattatt ttgtttt
//
LOCUS HUMIL73 121 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 3.
ACCESSION M29050
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 3 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 101 interleukin 7, exon 3
pre-msg < 1 > 121 IL7 mRNA and introns
IVS < 1 20 IL7 intron B
IVS 102 > 121 IL7 intron C
BASE COUNT 44 a 10 c 22 g 45 t
ORIGIN Undetermined number of bp after segment 2.
1 ttttatgtta tttattacag gacagcatga aagaaattgg tagcaattgc ctgaataatg
61 aatttaactt ttttaaaaga catatctgtg atgctaataa ggtaatgata attatttgga
121 g
//
LOCUS HUMIL74 157 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 4.
ACCESSION M29051
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 4 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 157)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 152 interleukin 7, exon 4
pre-msg < 1 > 157 IL7 mRNA and introns
IVS < 1 20 IL7 intron C
IVS 153 > 157 IL7 intron D
BASE COUNT 45 a 28 c 32 g 52 t
ORIGIN About 1.5 kb after segment 3.
1 ctgacttttt cctataatag gaaggtatgt ttttattccg tgctgctcgc aagttgaggc
61 aatttcttaa aatgaatagc actggtgatt ttgatctcca cttattaaaa gtttcagaag
121 gcacaacaat actgttgaac tgcactggcc aggtaag
//
LOCUS HUMIL75 94 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 5.
ACCESSION M29052
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 5 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 94)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 + 74 interleukin 7, exon 5
pre-msg < 1 > 94 IL7 mRNA and introns
IVS < 1 20 IL7 intron D
IVS 75 > 94 IL7 intron E
BASE COUNT 33 a 13 c 22 g 26 t
ORIGIN About 1 kb after segment 4.
1 aatgtgactt tgtttttaag gttaaaggaa gaaaaccagc tgccctgggt gaagcccaac
61 caacaaagag tttggtgaga ataattgtat aatt
//
LOCUS HUMIL76 1672 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human interleukin 7 (IL7) gene, exon 6.
ACCESSION M29053
KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin;
pre-B cell growth factor.
SEGMENT 6 of 6
SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
to mRNA, clone 3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1672)
AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
Cosman,D. and Goodwin,R.G.
TITLE Characterization of the human and murine interleukin-7 genes
JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Lupton, 26-SEP-1989.
FEATURES from to/span description
pept + 21 140 interleukin 7, exon 6
pre-msg < 1 811 IL7 mRNA and introns
IVS < 1 20 IL7 intron E
signal 753 758 polyA signal
signal 1060 1065 polyA signal
BASE COUNT 586 a 266 c 258 g 562 t
ORIGIN About 0.2 kb after segment 5.
1 tttaaaactc tattctctag gaagaaaata aatctttaaa ggaacagaaa aaactgaatg
61 acttgtgttt cctaaagaga ctattacaag agataaaaac ttgttggaat aaaattttga
121 tgggcactaa agaacactga aaaatatgga gtggcaatat agaaacacga actttagctg
181 catcctccaa gaatctatct gcttatgcag tttttcagag tggaatgctt cctagaagtt
241 actgaatgca ccatggtcaa aacggattag ggcatttgag aaatgcatat tgtattacta
301 gaagatgaat acaaacaatg gaaactgaat gctccagtca acaaactatt tcttatatat
361 gtgaacattt atcaatcagt ataattctgt actgattttt gtaagacaat ccatgtaagg
421 tatcagttgc aataatactt ctcaaacctg tttaaatatt tcaagacatt aaatctatga
481 agtatataat ggtttcaaag attcaaaatt gacattgctt tactgtcaaa ataattttat
541 ggctcactat gaatctatta tactgtatta agagtgaaaa ttgtcttctt ctgtgctgga
601 gatgttttag agttaacaat gatatatgga taatgccggt gagaataaga gagtcataaa
661 ccttaagtaa gcaacagcat aacaaggtcc aagataccta aaagagattt caagagattt
721 aattaatcat gaatgtgtaa cacagtgcct tcaataaatg gtatagcaaa tgttttgaca
781 tgaaaaaagg acaatttcaa aaaaataaaa taaaataaaa ataaattcac ctagtctaag
841 gatgctaaac cttagtactg agttacattg tcatttatat agattataac tgtctaaata
901 agtttgcaat ttgggagata tatttttaag ataataatat atgtttacct tttaattaat
961 gaaatatctg tatttaattt tgacactata tctgtatata aaatattttc atacagcatt
1021 acaaattgct tactttggaa tacatttctc ctttgataaa ataaatgagc tatgtattaa
1081 cactgccaga ttcagttaat aaatctcaac agaattttta aggtgagatt tttaatactt
1141 cactgctctt taattttcta ctttcattga atatatgacc tttaatagcc tattaccaac
1201 aatgaatatg agacattacc ttctatgtta aattttagtc ttttaaaaga aacttgtctg
1261 gaaatgtctg cagtcttcat tcactgataa agggaaatgg aaacactatt ctaagcaggc
1321 cagaatctac aaatagctgt cagtttgcat tgcactgaac tcacatcctt catgctctct
1381 cagcctcatc accatctagt tatgtgctga tgggctttat ttgatctaca gaagatgctc
1441 tggtgacacc ctcagtgtgt gttggtaaca ccttcctgcc ttctctcact ttccttgacc
1501 catatgacat ttcaccttca ctggtgcgat agctgtatag aattgtatac acttccatac
1561 aatttactga actgccccat taacaggact ttgagtcaag tatttttttc tgtttggtta
1621 tcactgttaa ctccagaaag atgagttgac taacattttt tagaagcaaa tg
//
LOCUS HUMINSRMUT 349 bp ds-DNA PRI 15-DEC-1989
DEFINITION Human insulin receptor gene allele C, exon 17, with an Alu
insertion.
ACCESSION M27196
KEYWORDS Alu repetitive element; insulin receptor.
SOURCE Human (patient 1, haplotype C) white bloodcell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Taira,M., Taira,M., Hashimoto,N., Shimada,F., Suzuki,Y.,
Kanatsuka,A., Nakamura,F., Ebina,Y., Tatibana,M., Makino,H. and
Yoshida,S.
TITLE Human diabetes associated with a deletion of the tyrosine kinase
domain of the insulin receptor
JOURNAL Science 245, 63-66 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by M.Taira, 02-SEP-1989.
FEATURES from to/span description
pept.ps / 40 341 pseudo-insulin receptor, exon 17 (AA at 42)
/nomgen="INSR" /map="19p13.1-p13"
/hgml_locus_uid="LG0007M"
IVS < 1 39 INSR intron P
rpt 145 231 Alu repeat (partial)
rpt 233 349 Alu repeat (partial)
BASE COUNT 99 a 81 c 89 g 80 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcctcca aggatgctgt gtagataagt aagaagtagt gtttccatgc tctgtgtacg
61 tgccggacca gtgggaggtg tctcgagaga agatcaccct ccttcgagag ctggggcagg
121 gctccttcgg catggtgtat gaggctggtc tcgaactccc cacctcaggt gatccgcgtg
181 ccttggcctc ccaaagtggt gggattacag gcgtgagcca ccacgcccag cccctgcact
241 gtggttttta tcaccacagt gaagggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagct
301 cttttaactc tatgctttgt gtccactttc caaaagaatg aaaatagat
//
LOCUS HUMPDGFRB 400 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human platelet-derived growth factor beta mRNA fragment.
ACCESSION M30493
KEYWORDS platelet-derived growth factor.
SOURCE Human embryonic lung fibroblast, cDNA to mRNA, clone HPR5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 400)
AUTHORS Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A.
TITLE An efficient directional cloning system to construct cDNA libraries
containing full-length inserts at high frequency
JOURNAL Gene 83, 137-146 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA < 1 > 400 platelet-derived growth factor beta mRNA
/nomgen="PDGFRB" /map="5q33-q35"
/hgml_locus_uid="LW0046Z"
BASE COUNT 70 a 144 c 112 g 74 t
ORIGIN Chromosome 5q33-q35.
1 gccggcccct cagccctgct gcccagcacg agcctgtgct cgccctgccc aacgcagaca
61 gccagaccca gggcggcccc tctggcggct ctgctcctcc cgaaggatgc ttggggagtg
121 aggcgaagct gggcgctcct ctcccctaca gcagccccct tcctccatcc ctctgttctc
181 ctgagccttc aggagcctgc accagtcctg cctgtccttc tactcagctg ttaccaactc
241 tgggaccagc agtctttctg ataactggga gagggcagta aggaggactt cctggagggg
301 gtgactgtcc agagcctgga actgtgccca caccagaagc catcagcagc aaggacacca
361 tgcggcttcc gggtgcgatg ccagctctgg ccctcaaagg
//
LOCUS HUMPDGFRC 150 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human platelet-derived growth factor alpha mRNA fragment.
ACCESSION M30494
KEYWORDS platelet-derived growth factor.
SOURCE Human embryonic lung fibroblast, cDNA to mRNA, clone TR4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A.
TITLE An efficient directional cloning system to construct cDNA libraries
containing full-length inserts at high frequency
JOURNAL Gene 83, 137-146 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA < 1 > 150 platelet-derived growth factor alpha mRNA
/nomgen="PDGFRA" /map="4q11-q13"
/hgml_locus_uid="LR0223Z"
BASE COUNT 40 a 30 c 47 g 33 t
ORIGIN Chromosome 4q11-q13.
1 ccattactgt tggagctaca gggagagaaa caggaggaga ctgcaagaga tcatttggga
61 aggccgtggg cacgctcttt actccatgtg tgggacattc attgcggaat aacatcggag
121 gagaagtttc ccagagctat ggggacttcc
//
LOCUS HUMPSAA 7130 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human prostate specific antigen gene, complete cds.
ACCESSION M27274
KEYWORDS Alu repetitive element; kallikrein; prostate specific antigen.
SOURCE Human peripheral leukocyte DNA, (library of R.Wydro), clones
lambda-PSA.[24A,25A,25B].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 7130)
AUTHORS Lundwall,A.
TITLE Characterization of the gene for prostate-specific antigen, a human
glandular kallikrein
JOURNAL Biochem. Biophys. Res. Commun. 161, 1151-1159 (1989)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Lundwall, 03-NOV-1989.
FEATURES from to/span description
pept 675 720 prostate specific antigen, exon 1
/nomgen="APS" /map="19q13.3-qter"
/hgml_locus_uid="LN0098S"
1959 2118 prostate specific antigen, exon 2
3756 4042 prostate specific antigen, exon 3
4186 4322 prostate specific antigen, exon 4
5699 5854 prostate specific antigen, exon 5
pre-msg 634 6491 PSA mRNA and introns
IVS 721 1958 PSA intron A
IVS 2119 3755 PSA intron B
IVS 4043 4185 PSA intron C
IVS 4323 5698 PSA intron D
rpt 2583 2935 Alu repeat
signal 503 512 CACCC box
signal 605 611 TATA box
binding 581 586 SpI binding site
BASE COUNT 1530 a 2024 c 1867 g 1709 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 19q13.3-qter.
1 gaattccaca ttgtttgctg cacgttggat tttgaaatgc tagggaactt tgggagactc
61 atatttctgg gctagaggat ctgtggacca caagatcttt ttatgatgac agtagcaatg
121 tatctgtgga gctggattct gggttgggag tgcaaggaaa agaatgtact aaatgccaag
181 acatctattt caggagcatg aggaataaaa gttctagttt ctggtctcag agtggtgcag
241 ggatcaggga gtctcacaat ctcctgagtg ctggtgtctt agggcacact gggtcttgga
301 gtgcaaagga tctaggcacg tgaggctttg tatgaagaat cggggatcgt acccaccccc
361 tgtttctgtt tcatcctggg catgtctcct ctgcctttgt cccctagatg aagtctccat
421 gagctacaag ggcctggtgc atccagggtg atctagtaat tgcagaacag caagtgctag
481 ctctccctcc ccttccacag ctctgggtgt gggagggggt tgtccagcct ccagcagcat
541 ggggagggcc ttggtcagcc tctgggtgcc agcagggcag gggcggagtc ctggggaatg
601 aaggttttat agggctcctg ggggaggctc cccagcccca agcttaccac ctgcacccgg
661 agagctgtgt caccatgtgg gtcccggttg tcttcctcac cctgtccgtg acgtggattg
721 gtgagagggg ccatggttgg ggggatgcag gagagggagc cagccctgac tgtcaagctg
781 aggctctttc ccccccaacc cagcacccca gcccagacag ggagctgggc tcttttctgt
841 ctctcccagc cccacttcaa gcccataccc ccagcccctc catattgcaa cagtcctcac
901 tcccacacca ggtccccgct ccctcccact taccccagaa ctttctcccc attgcccagc
961 cagctccctg ctcccagctg ctttactaaa ggggaagttc ctgggcatct ccgtgtttct
1021 ctttgtgggg ctcaaaacct ccaaggacct ctctcaatgc cattggttcc ttggaccgta
1081 tcactggtcc atctcctgag cccctcaatc ctatcacagt ctactgactt ttcccattca
1141 gctgtgagtg tccaacccta tcccagagac cttgatgctt ggcctcccaa tcttgcccta
1201 ggatacccag atgccaacca gacacctcct tcttcctagc caggctatct ggcctgagac
1261 aacaaatggg tccctcagtc tggcaatggg actctgagaa ctcctcattc cctgactctt
1321 agccccagac tcttcattca gtggcccaca ttttccttag gaaaaacatg agcatcccca
1381 gccacaactg ccagctctct gattccccaa atctgcatcc ttttcaaaac ctaaaaacaa
1441 aaagaaaaac aaataaaaca aaaccaactc agaccagaac tgttttctca acctgggact
1501 tcctaaactt tccaaaacct tcctcttcca gcaactgaac ctggccataa ggcacttatc
1561 cctggttcct agcacccctt atcccctcag aatccacaac ttgtaccaag tttcccttct
1621 cccagtccaa gaccccaaat caccacaaag gacccaatcc ccagactcaa gatatggtct
1681 gggcgctgtc ttgtgtctcc taccctgatc cctgggttca actctgctcc cagagcatga
1741 agcctctcca ccagcaccag ccaccaacct gcaaacctag ggaagattga cagaattccc
1801 agcctttccc agctccccct gcccatgtcc caggactccc agccttggtt ctctgccccc
1861 gtgtcttttc aaacccacat cctaaatcca tctcctatcc gagtccccca gttccccctg
1921 tcaaccctga ttcccctgat ctagcacccc ctctgcaggc gctgcgcccc tcatcctgtc
1981 tcggattgtg ggaggctggg agtgcgagaa gcattcccaa ccctggcagg tgcttgtggc
2041 ctctcgtggc agggcagtct gcggcggtgt tctggtgcac ccccagtggg tcctcacagc
2101 tgcccactgc atcaggaagt gagtaggggc ctggggtctg gggagcaggt gtctgtgtcc
2161 cagaggaata acagctgggc attttcccca ggataacctc taaggccagc cttgggactg
2221 ggggagagag ggaaagttct ggttcaggtc acatggggag gcagggttgg ggctggacca
2281 ccctccccat ggctgcctgg gtctccatct gtgtccctct atgtctcttt gtgtcgcttt
2341 cattatgtct cttggtaact ggcttcggtt gtgtctctcc gtgtgactat tttgttctct
2401 ctctccctct cttctctgtc ttcagtctcc atatctcccc ctctctctgt ccttctctgg
2461 tccctctcta gccagtgtgt ctcaccctgt atctctctgc caggctctgt ctctcggtct
2521 ctgtctcacc tgtgccttct ccctactgaa cacacgcacg ggatgggcct ggggggaccc
2581 tgagaaaagg aagggctttg gctgggcgcg gtggctcaca cctgtaatcc cagcactttg
2641 ggaggccaag gcaggtagat cacctgaggt caggagttcg agaccagcct ggccaactgg
2701 tgaaacccca tctctactaa aaatacaaaa aattagccag gcgtggtggc gcatgcctgt
2761 agtcccagct actcaggagg ctgagggagg agaattgctt gaacctggga ggttgaggtt
2821 gcagtgagcc gagaccgtgc cactgcactc cagcctgggt gacagagtga gactccgcct
2881 caaaaaaaaa aaaaaaaaaa aaaaaaaaaa agaaaagaaa agaaaagaaa aggaatcttt
2941 tatccctgat gtgtgtgggt atgagggtat gagagggccc ctctcactcc attccttctc
3001 caggacatcc ctccactctt gggagacaca gagaagggct ggttccagct ggagctggga
3061 ggggcaattg agggaggagg aaggagaagg gggaaggaaa acagggtatg ggggaaagga
3121 ccctggggag cgaagtggag gatacaacct tgggcctgca ggccaggcta cctacccact
3181 tggaaaccca cgccaaagcc gcatctacag ctgagccact ctgaggcctc ccctccccgg
3241 cggtccccac tcagctccaa agtctctctc ccttttctct cccacacttt atcatccccc
3301 ggattcctct ctacttggtt ctcattcttc ctttgacttc ctgcttccct ttctcattca
3361 tctgtttctc actttctgcc tggttttgtt cttctctctc tctttctctg gcccatgtct
3421 gtttctctat gtttctgtct tttctttctc atcctgtgta ttttcggctc accttgtttg
3481 tcactgttct cccctctgcc ctttcattct ctctgtcctt ttaccctctt cctttttccc
3541 ttggtttctc tcagtttctg tatctgccct tcaccctctc acactgctgt ttcccaactc
3601 gttgtctgta tttttggcct gaactgtgtc ttccccaacc ctgtgttttt ctcactgttt
3661 ctttttctct tttggagcct cctccttgct cctctgtccc ttctctcttt ccttatcatc
3721 ctcgctcctc attcctgcgt ctgcttcctc cccagcaaaa gcgtgatctt gctgggtcgg
3781 cacagcctgt ttcatcctga agacacaggc caggtatttc aggtcagcca cagcttccca
3841 cacccgctct acgatatgag cctcctgaag aatcgattcc tcaggccagg tgatgactcc
3901 agccacgacc tcatgctgct ccgcctgtca gagcctgccg agctcacgga tgctgtgaag
3961 gtcatggacc tgcccaccca ggagccagca ctggggacca cctgctacgc ctcaggctgg
4021 ggcagcattg aaccagagga gtgtacgcct gggccagatg gtgcagccgg gagcccagat
4081 gcctgggtct gagggaggag gggacaggac tcctgggtct gagggaggag ggccaaggaa
4141 ccaggtgggg tccagcccac aacagtgttt ttgcctggcc cgtagtcttg accccaaaga
4201 aacttcagtg tgtggacctc catgttattt ccaatgacgt gtgtgcgcaa gttcaccctc
4261 agaaggtgac caagttcatg ctgtgtgctg gacgctggac agggggcaaa agcacctgct
4321 cggtgagtca tccctactcc caagatcttg aggggaaagg tgagtgggga ccttaattct
4381 gggctggggt ctagaagcca acaaggcgtc tgcctcccct gctccccagc tgtagccatg
4441 ccacctcccc gtgtctcatc tcattccctc cttccctctt ctttgactcc ctcaaggcaa
4501 taggttattc ttacagcaca actcatctgt tcctgcgttc agcacacggt tactaggcac
4561 ctgctatgca cccagcactg ccctagagcc tgggacatag cagtgaacag acagagagca
4621 gcccctccct tctgtagccc ccaagccagt gaggggcaca ggcaggaaca gggaccacaa
4681 cacagaaaag ctggagggtg tcaggaggtg atcaggctct cggggaggga gaaggggtgg
4741 ggagtgtgac tgggaggaga catcctgcag aaggtgggag tgagcaaaca cctgccgcag
4801 gggaggggag ggccctgcgg cacctggggg agcagaggga acagcatctg gccaggcctg
4861 ggaggagggg cctagagggc gtcaggagca gagaggaggt tgcctggctg gagtgaagga
4921 tcggggcagg gtgcgagagg gaagaaagga cccctcctgc agggcctcac ctgggccaca
4981 ggaggacact gcttttcctc tgaggagtca ggaactgtgg atggtgctgg acagaagcag
5041 gacagggcct ggctcaggtg tccagaggct gccgctggcc tccctatggg atcagactgc
5101 agggagggag ggcagcaggg atgtggaggg agtgatgatg gggctgacct gggggtggct
5161 ccaggcattg tccccacctg ggcccttacc cagcctccct cacaggctcc tggccctcag
5221 tctctcccct ccactccatt ctccacctac ccacagtggg tcattctgat caccgaactg
5281 accatgccag ccctgccgat ggtcctccat ggctccctag tgccctggag aggaggtgtc
5341 tagtcagaga gtagtcctgg aaggtggcct ctgtgaggag ccacggggac agcatcctgc
5401 agatggtcct ggcccttgtc ccaccgacct gtctacaagg actgtcctcg tggaccctcc
5461 cctctgcaca ggagctggac cctgaagtcc cttccctacc ggccaggact ggagccccta
5521 cccctctgtt ggaatccctg cccaccttct tctggaagtc ggctctggag acatttctct
5581 cttcttccaa agctgggaac tgctatctgt tatctgcctg tccaggtctg aaagatagga
5641 ttgcccaggc agaaactggg actgacctat ctcactctct ccctgctttt acccttaggg
5701 tgattctggg ggcccacttg tctgtaatgg tgtgcttcaa ggtatcacgt catggggcag
5761 tgaaccatgt gccctgcccg aaaggccttc cctgtacacc aaggtggtgc attaccggaa
5821 gtggatcaag gacaccatcg tggccaaccc ctgagcaccc ctatcaactc cctattgtag
5881 taaacttgga accttggaaa tgaccaggcc aagactcaag cctccccagt tctactgacc
5941 tttgtcctta ggtgtgaggt ccagggttgc taggaaaaga aatcagcaga cacaggtgta
6001 gaccagagtg tttcttaaat ggtgtaattt tgtcctctct gtgtcctggg gaatactggc
6061 catgcctgga gacatatcac tcaatttctc tgaggacaca gataggatgg ggtgtctgtg
6121 ttatttgtgg gatacagaga tgaaagaggg gtgggatcca cactgagaga gtggagagtg
6181 acatgtgctg gacactgtcc atgaagcact gagcagaagc tggaggcaca acgcaccaga
6241 cactcacagc aaggatggag ctgaaaacat aacccactct gtcctggagg cactgggaag
6301 cctagagaag gctgtgagcc aaggagggag ggtcttcctt tggcatggga tggggatgaa
6361 gtaaggagag ggactggacc ccctggaagc tgattcacta tggggggagg tgtattgaag
6421 tcctccagac aaccctcaga tttgatgatt tcctagtaga actcacagaa ataaagagct
6481 cttatactgt ggtttattct ggtttgttac attgacagga gacacactga aatcagcaaa
6541 ggaaacaggc atctaagtgg ggatgtgaag aaaacaggga aaatctttca gttgttttct
6601 cccagtgggg tgttgtggac agcacttaaa tcacacagaa gtgatgtgtg accttgtgta
6661 tgaagtattt ccaactaagg aagctcacct gagccttagt gtccagagtt cttattgggg
6721 gtctgtagga taggcatggg gtactggaat agctgacctt aacttctcag acctgaggtt
6781 cccaagagtt caagcagata cagcatggcc tagagcctca gatgtacaaa aacaggcatt
6841 catcatgaat cgcactgtta gcatgaatca tctggcacgg cccaaggccc caggtatacc
6901 aaggcacttg ggccgaatgt tccaagggat taaatgtcat ctcccaggag ttattcaagg
6961 gtgagccctg tacttggaac gttcaggctt tgagcagtgc agggctgctg agtcaacctt
7021 ttactgtaca ggggggtgag ggaaagggag aagatgagga aaccgcctag ggatctggtt
7081 ctgtcttgtg gccgagtgga ccatggggct atcccaagaa ggaggaattc
//
LOCUS HUMRPS24A 659 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human ribosomal protein S24 mRNA.
ACCESSION M31520
KEYWORDS ribosomal protein S24.
SOURCE Human male lymphoblast from lymphoid tumor cell line HT1080 (ATCC
121) cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 133 to 641)
AUTHORS Brown,S.J., Jewell,A., Maki,C.G. and Roufa,D.J.
TITLE A cDNA encoding human ribosomal protein S24
JOURNAL Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 659)
AUTHORS Brown,S.J. and Roufa,D.J.
JOURNAL Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506
STANDARD full staff_entry
COMMENT Authorin Submission [2] kindly submitted by Roufa,D.J.,
22-JAN-1990.
FEATURES from to/span description
pept / 132 51 (c) unknown protein (AA at 131)
/evidence=EXPERIMENTAL
pept 143 544 ribosomal protein S24 /function="component of
human 40S ribosomal subunit"
/evidence=EXPERIMENTAL /gene="HUMRPS24"
/standard_name="rpS24"
mRNA 132 1 (c) unknown mRNA /evidence=EXPERIMENTAL /note="800
nt. anonymous human transcript detected on
Northern blots of HT1080 and HeLa cell
cytoplasmic mRNAs" [2]
signal 25 20 (c) poly-A signal /note="detected by pattern only"
[2]
signal 598 603 poly-A signal /evidence=EXPERIMENTAL [1]
recomb 133 142 misc_recomb site /note="site of artificial
recombination between 2 cDNAs which occurred
during reverse transcription or recombinant
DNA cloning operations" /evidence=EXPERIMENTAL
[2]
BASE COUNT 218 a 144 c 144 g 153 t
ORIGIN
1 gggtttatcg gaaaatgtgt ttattgagat ggtttcccac tcatcttgac tcagagtgct
61 tttagtgctg cttcctcctg aaggaacatc cttctgtaag ccttgctttt cctccttggc
121 tgtctgaaga tagatcgcca tcatgaacga caccgtaact atccgcacta gaaagttcat
181 gaccaaccga ctacttcaga ggaaacaaat ggtcattgat gtccttcacc ccgggaaggc
241 gacagtgcct aagacagaaa ttcgggaaaa actagccaaa atgtacaaga ccacaccgga
301 tgtcatcttt gtatttggat tcagaactca ttttggtggt ggcaagacaa ctggctttgg
361 catgatttat gattccctgg attatgcaaa gaaaaatgaa cccaaacata gacttgcaag
421 acatggcctg tatgagaaga aaaagacctc aagaaagcaa cgaaaggaac gcaagaacag
481 aatgaagaaa gtcaggggga ctgcaaaggc caatgttggt gctggcaaaa agccgaagga
541 gtaaaggtgc tgcaatgatg ttagctgtgg ccactgtgga tttttcgcaa gaacattaat
601 aaactaaaaa cttcatgtgt aaaaaaaaaa aaaaaaaaaa accccccccc ccccccccc
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSMK1A 2222 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse intronless potassium channel gene MK1.
ACCESSION M30439
KEYWORDS potassium channel protein.
SOURCE Mouse (strain AKR) cell line L47.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites; for [2])
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B.L. and Gutman,G.A.
TITLE A family of three mouse potassium channel genes with intronless
coding regions
JOURNAL Science 247, 943-975 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2222)
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B.L. and Gutman,G.A.
JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717
STANDARD full staff_entry
COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A.,
12-DEC-1989
MK1, MK2 and MK3 represent three members of a family of mouse genes
encoding potassium channel proteins, related to the Drosophila
shaker locus. Each mouse protein is encoded by a single,
uninterrupted exon, although one (or more) introns may be present
in the 5' untranslated region (MK1, MK2). The sequence of MK1 is
included within, and identical to, the published sequence of MBK1
cDNA (Tempel, Jan & Jan, Nature 232:837-9, 1988) with which it
overlaps, except for two additional G's, one each at MK1 positions
162 and 442. MK1 closely resemble the published rat cDNA sequence
RCK1 (Baumann et al., EMBO J. 7:2457, 1988).
FEATURES from to/span description
pept 584 2071 potassium channel protein
IVS < 1 52 intron
BASE COUNT 517 a 613 c 598 g 494 t
ORIGIN
1 atcagaatgc ttctgatttt ctaccctcgt atcacttttt atttctctgc agcgtccatc
61 gatcgccccg gtgggagctt agaaggcggc aggcgaagag gggtaggagg gggggagagc
121 caaggagcag caaagtgggt gacaggcctg gggagctgct ggagttggca ctgcaccgtg
181 tcctagaagg gctgcaggag gggaggggag ccggggcgag ccccggaagc gatggcaggt
241 gtgcattaag gtgatcagag caggaagaaa aacccattca ccaacacaaa aacagagcac
301 ccccccccaa aaaatacccc aagagtagca aaaaagcttc actgggggcc ctcaggcgag
361 aggggtttcc aaattgcgta aaaggcagag tggaggaggg gaggtgataa ctagcaaagt
421 tgcagacttc tgaaccttct ggctctgaag cccctccctg tgagcgtggg gaagactcac
481 tttccggtgg gggggccgct tgggtccccc ccacccctac tccctcgctc cttcacaacc
541 cgggctctct cctggcctcc tacccctgca ccctgctttc atcatgacgg tgatgtcggg
601 ggagaatgcg gacgaggctt cgaccgctcc aggtcacccc caggatggca gctacccgag
661 gcaggcggac cacgacgacc acgaatgctg cgagcgcgta gtaatcaaca tctccgggct
721 gcgcttcgaa acgcagctca agactctggc acagttcccc aacacgctgc tgggcaaccc
781 gaagaaacgc atgcgctact ttgaccccct gaggaacgag tacttctttg accgcaaccg
841 gcccagcttc gatgccatcc tttattacta ccagtccggg ggccgcctgc gcaggccggt
901 caacgtgccc ctggacatgt tctccgagga gattaaattt tacgagttgg gcgaggaagc
961 catggagaag ttccgggaag atgagggctt catcaaggaa gaggagcgcc ccctacccga
1021 gaaggagtac cagcgccagg tgtggctgct ctttgagtat ccggagagct caggacctgc
1081 ccgggttatt gccattgtgt cggtcatggt catcctcatc tccatagtca tcttttgcct
1141 ggagactctc cctgagctga aggacgacaa ggacttcacg ggcaccatcc accgcatcga
1201 caacaccaca gtcatctata cttccaacat cttcacagac cctttcttca ttgtggaaac
1261 cttgtgtatc atctggttct cttttgagct ggtggtgcgc ttcttcgcct gccccagcaa
1321 gacagacttc tttaagaaca tcatgaactt catcgacatt gtggccatca tcccttattt
1381 cattaccctg ggcacggaga tagctgagca ggagggaaat cagaagggcg agcaggccac
1441 ttccctggcc atcctcaggg tcatccgctt ggtaagggtg ttcagaatct tcaaactctc
1501 ccgccactcc aagggccttc agatcctggg ccagaccctc aaagctagta tgagggagtt
1561 agggctgctc atctttttcc tcttcattgg ggtcatactg ttttctagcg cagtgtactt
1621 tgcggaggcg gaagaagctg agtcgcactt ctccagtatc cccgatgctt tctggtgggc
1681 ggtggtgtcc atgaccactg tgggatacgg tgacatgtac cctgtgacaa ttggaggcaa
1741 gatcgtgggc tccttgtgtg ccatcgctgg tgtgctgaca attgccctgc ccgtacctgt
1801 cattgtgtcc aatttcaact atttctacca ccgagaaact gagggggaag agcaggctca
1861 gttgctccat gttagttctc ctaacttagc ctctgacagt gacctcagcc gccgcagctc
1921 ctctactatc agcaagtctg agtacatgga gatcgaagag gatatgaaca atagcatagc
1981 ccattacaga caggctaata tcagaactgg taactgcacc acagctgatc aaaactgcgt
2041 taataagagc aagctcctga ccgatgttta aaaagcaaca ggcaagccaa caaaagcccc
2101 aaacaaaccc cttggcgact cctgtcccgc tctgtagata ctttactaag cagtagtctt
2161 tgaatgcttt attttgctgg caatgcattg ttgcattgtg aatttgggga gtggcgaacc
2221 tg
//
LOCUS MUSMK2A 3383 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse intronless potassium channel gene MK2.
ACCESSION M30440
KEYWORDS potassium channel protein.
SOURCE Mouse (strain AKR) cell line L47.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites; for [2])
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B.L. and Gutman,G.A.
TITLE A family of three mouse potassium channel genes with intronless
coding regions
JOURNAL Science 247, 943-975 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3383)
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B.L. and Gutman,G.A.
JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717
STANDARD full staff_entry
COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A.,
05-DEC-1989
MK1, MK2 and MK3 represent three members of a family of mouse genes
encoding potassium channel proteins, related to the Drosophila
Shaker locus. Each mouse protein is encoded by a single,
uninterrupted exon, although one (or more) introns may be present
in the 5' untranslated region (MK1, MK2). MK2 closely resembles
the published rat cDNA sequence RBK2 (McKinnon, J. Biol. Chem.
264:8230, 1989), and the deduced amino acid sequence of the rat
protein RCK5 (Stuhmer et al., EMBO J. 8:3235, 1989).
FEATURES from to/span description
pept 1118 2617 potassium channel protein
IVS 527 959 intron /note="putative intron by comparison
with RBK2 cDNAs"
BASE COUNT 818 a 881 c 791 g 881 t 12 others
ORIGIN
1 cctgcaggca tcgaagctta tggttatcca ggagaggtta ggaaaaggct cctgtggtgg
61 atctcattca caagtgtcat ggaaagtaga tgtgggccnn nntnnacttc ttgccctgag
121 gggatctccc agagccctta aaagtcttng tctcagctgc ctggaancct tggtcccacc
181 ttgctgctgt gttttccagg caacgtcaca cctccggagg acagccagga ggactcaggc
241 ttttgctgag ctgtgcatct tgcctccttc cttccaaagg cttcagaccc aagctccact
301 ctcctcctca agtcgtgcgc tgtgcccctg gctaagcatc tcactgtccc tcgagcccca
361 tccagtccta gagcacacct ccaggacctg ccctcctgct cagaatgact caccattatt
421 tccagctcaa gtgagaagat gtgacgggga ctgagctgcc tatttgtgta tctgtgccat
481 cagatctgct cttcgaagaa gagaacctca gctcctgcct ggccaggtaa atcaaacaat
541 cacttagtag cagtcatata gaggcaatgg gggtggcatc agtgaccaag gnncaggtcc
601 caaacaccct accatgaata cntcctttgg tgcaggtatt gccaaaacat tagcacagtg
661 cctggcagag ggctcccttc tccactcctc ctgtgcagct atgtccccag catgtgtgtg
721 gtacccaaga tcctgtctca gccncctgcc cctagcgcag gccattctat agcttccttg
781 aaagattgtt tgttccctga accctactga cccctagaca agtgacaagg tgataatagt
841 tttagtggca gaagtgatgg tacagtcatg tttctgaact cttggggaca tcaagtgcac
901 ctcctgtcag gttgccaatg actggtgatc tcatgtcctt acctgcctct gtctttcagc
961 cgagtggggc tgtgtctcca actcacaaag ggctctgggg ctttcctgca gaagctcaga
1021 cttatgcttc tgtgtttcct gggccccagt tatcagcacc gagttccctg ccgcagctgg
1081 aaggcttaaa gtaaagcaca cttctcaagc cccaattatg acagtggcta ccggagaccc
1141 agtggacgag gctgctgccc tccctgggca cccacaagac acctatgacc cagaggcaga
1201 ccatgaatgc tgtgagagag tggtcatcaa catctcaggc ctgcggttcg aaactcagct
1261 aaagacctta gcccagttcc cagagaccct cttaggggac cccaagaaac ggatgaggta
1321 cttcgatccc ctccgaaatg agtacttttt tgatcgcaac cgccctagct ttgatgccat
1381 tttgtactac tatcagtctg ggggcaggtt gaggcgacct gtgaacgtgc ccttagatat
1441 cttctcggaa gaaatccggt tttatgagct aggagaagaa gcaatggaga tgtttcggga
1501 ggatgaaggc tacatcaagg aagaagagcg tcctctgcct gaaaatgagt ttcagagaca
1561 ggtgtggctt ctctttgaat accctgagag ctcagggcct gccaggatta tagccattgt
1621 atctgtgatg gtcattctga tctccatcgt cagcttctgt ctggaaacct tgcccatctt
1681 ccgggatgag aatgaggaca tgcatggtgg cggggtgacc ttccacacct attccaacag
1741 caccatcggg taccagcagt ccacctcctt caccgaccct ttcttcattg tagagactct
1801 ctgcatcatc tggttctcct ttgagtttct ggttagattc tttgcctgtc ccagcaaagc
1861 tggcttcttc accaacatca tgaacatcat tgacattgtg gctatcatcc cttactttat
1921 caccctgggg acagagttag ctgagaagcc agaggatgcc cagcaaggcc agcaggccat
1981 gtcactggcc attctccgtg tcatccggtt ggtaagagtc tttaggattt tcaagttgtc
2041 cagacactcc aaaggtctac agattctagg tcagaccctc aaagctagca tgagggaatt
2101 gggcctcctg atattcttcc tcttcattgg ggtcatcctc ttctctagtg ctgtctattt
2161 tgcagaagct gatgagagag attcccagtt ccccagcatc ccggatgctt tctggtgggc
2221 agtcgtctcc atgacaactg taggctatgg agacatggtt ccaactacca ttggggggaa
2281 gatagtgggt tctctgtgtg caattgcagg tgtgttaacc attgccttac cagtccctgt
2341 catagtgtct aatttcaact acttctacca ccgggagaca gagggagagg agcaggccca
2401 gtacttgcaa gtgacaagct gtccaaagat cccgtcctcc cctgacctaa agaaaagtag
2461 aagtgcctct accataagta agtctgatta catggagata caggagggag ttaacaacag
2521 caatgaggac tttagagagg agaacttaaa aacagccaac tgtaccttgg ctaacacaaa
2581 ctatgtgaat attaccaaaa tgttaactga tgtctgattg aagcctacta atgtactcac
2641 agctcaacag gactgatgca gatgttgcat aatagcctgc attgtagtca gtgttctaca
2701 gtgttctaca gcatctggtt ctgcatggaa agcaatagtt cgtgcaagtg acttttgatc
2761 ttttgatttt tgatttagaa cacagaatat ttatcatggc tttcatgaaa tcttcatcac
2821 cgacttacag gtttccaaag aggagagtca cccatggagc cagcatctca gaaaggcaca
2881 atgcagcccc ctcacaccca cacacaacct accatagcaa cactatccta tcttcctacc
2941 taaacatgag cacacctggg aggtgcattc cctccacccc acccctacca ccctatttca
3001 gcccatctgc cccttcacag aggaacatac cattgtggct tagttctaaa gtctggtgac
3061 tactcaaagg tcattctcgg ttttgtgttg agaagaacac acagtcctgt gtgttggaac
3121 tactttctgt gtcacaggct ggggtttgtg aattgcagtt gccaactaga tgctccggag
3181 gcttatgttt cgtaacggaa aaatgctgca ttctgctttt tctctgcagt gtcgatgtga
3241 gggaagccca ggggggtggg ggagggacag ttaatatgac caaaatatga attgccaagt
3301 ttcacatttg ttcccttagg cctatgggga gaagctacct aattcaggga cttctcagct
3361 tcagatacca ttttgcaagc ttg
//
LOCUS MUSMK3A 1994 bp ds-DNA ROD 15-MAR-1990
DEFINITION Mouse intronless potassium channel gene MK3.
ACCESSION M30441
KEYWORDS potassium channel protein.
SOURCE Mouse (strain AKR) cell line L47.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites; for [2])
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B. and Gutman,G.A.
TITLE A family of three mouse potassium channel genes with intronless
coding regions
JOURNAL Science 247, 943-975 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1994)
AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
Ghanshani,S., Tempel,B.L. and Gutman,G.A.
JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717
STANDARD full staff_entry
COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A.,
05-DEC-1989
MK1, MK2 and MK3 represent three members of a family of mouse genes
encoding potassium channel proteins, related to the Drosophila
shaker locus. Each mouse protein is encoded by a single,
uninterrupted exon, although one (or more) introns may be present
in the 5' untranslated region (MK1, MK2). MK3 closely resembles
the published rat cDNA sequence RCK3 (Stuhmer et al., EMBO J.
8:3235, 1989).
FEATURES from to/span description
pept 150 1736 potassium channel protein
BASE COUNT 394 a 612 c 534 g 454 t
ORIGIN
1 agccgccgct agggaaggaa agcaccgccg cctcccgcgc tcgaccgccg cagccctcca
61 cccatcaccg cgcccaccct gcaccggacc ccgcaggagg cggcgcgcgc atcctgcaga
121 gccccggcca cgccgagctg ccgccagaca tgaccgtggt gcccggggac cacctgctgg
181 agccagaggc ggcgggaggc ggtggcgggg acccgcctca gggaggctgt ggcagtggcg
241 gcggcggtgg cggctgcgac cgctacgagc cactgccacc cgcgctgccc gccgcgggcg
301 agcaagattg ctgcggcgag cgtgtggtca tcaacatctc cgggctgcgc ttcgagacgc
361 agctcaagac cctctgccag ttccccgaga cactgctggg cgaccccaag cggcgcatgc
421 ggtactttga cccactccgc aatgagtact tcttcgaccg caaccgaccc agcttcgacg
481 ccatcctcta ctactaccag tccgggggcc gcattcgccg gccggtcaac gtgcccatcg
541 acatcttctc cgaggagatc cgcttttacc agctgggtga ggaggccatg gaaaagttcc
601 gtgaggatga gggcttcctg cgggaggagg agcgacccct gccccgccgt gacttccagc
661 gccaggtgtg gctgctcttc gaatatccgg agagctccgg gccggcccgg ggcattgcca
721 ttgtgtcagt gctggtcatt ctcatctcca ttgtcatctt ctgcttggag acgcttcccg
781 agtttcgcga tgagaaagac tatcccgcct ccccgtcgca ggacgtgttt gaggctgcca
841 acaacagcac gtcgggggcc ccttctggag cctccagctt ctcggacccc ttcttcgtgg
901 tggagacctt gtgcatcatc tggttctcct ttgagcttct ggtgcggttc tttgcttgcc
961 ccagtaaagc caccttctcc agaaatatca tgaacttgat agacattgtg gccatcattc
1021 cttattttat cactctgggc actgagctgg ctgaacgaca aggtaatggg cagcaggcca
1081 tgtcgctggc catcctaaga gtcatccgcc tagtaagggt tttccgcatc ttcaagctct
1141 cccgccattc taaggggctg cagatcctag gacagacgct gaaggcttcc atgcgggagc
1201 tggggctgct catattcttc ctcttcattg gggtcatcct tttctccagt gcagcttact
1261 ttgctgaggc agacgaccct tcttcgggtt ttaacagtat cccggatgcc ttctggtggg
1321 cagtagtaac catgacaact gttggttatg gtgatatgca cccagtgacc ataggaggca
1381 agattgtggg ctctctttgt gccatcgcag gtgtcttgac cattgcattg ccagttcctg
1441 tgattgtttc caacttcaac tacttctacc accgggagac agaaggggaa gagcaagccc
1501 agtacatgca cgtgggcagt tgccagcacc tctcctcttc agccgaggag ctccgaaaag
1561 cccggagtaa ctccactctg agtaagtcgg agtatatggt gatcgaagag gggggtatga
1621 accagagcgc cttcccgcag acccccttca aaacgggcaa ctccacagcc acttgcacca
1681 cgaacaataa ccccaactcc tgtgtcaaca tcaagaagat attcactgat gtctaatata
1741 tgatacggtt gccaattctg tgcccagtat tgtgtggaac atgccccctt ggtctgtgta
1801 tgcccttgat ttatacattt ccagaccact catcaaggaa agtacaagaa gtgaggaagc
1861 acacttcatt ctccctattg cttcatactg aaacaggtgc ctgtttttgc aagtgggctg
1921 cattctctca gctctttttt tctctctctc cctgtctctt aattttgtga ccaacaaact
1981 tacattaagc gtgg
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS YSCMTSPR1A 81 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31839
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 81)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 21 60 40 bp direct repeat
BASE COUNT 30 a 13 c 25 g 13 t
ORIGIN
1 gaaaccggga cctcggagac gtaatagggg gagggggtgg gtgataataa ccagaatatt
61 caataaatac agagcacaca t
//
LOCUS YSCMTSPR1B 81 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31840
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 81)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 21 60 40 bp direct repeat
BASE COUNT 35 a 7 c 18 g 21 t
ORIGIN
1 tatatatttc aaatatataa gtaatagggg gagggggtgg gtgataataa ccagaatatt
61 aaataaatac agagcacaca t
//
LOCUS YSCMTSPR1C 80 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31841
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 16 66 51 bp direct repeat
BASE COUNT 25 a 15 c 17 g 23 t
ORIGIN
1 atttattata atatttatat taatagttcc ggggcccggc cacgggagcc ggaaccccga
61 aaggagtatt attaaacatt
//
LOCUS YSCMTSPR1D 80 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31842
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 16 66 51 bp direct repeat
BASE COUNT 28 a 15 c 19 g 18 t
ORIGIN
1 ttataattca atttatatat taatagttcc ggggcccggc cacgggagcc ggaaccccga
61 aaggaggaat aagataaata
//
LOCUS YSCMTSPR1E 80 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31843
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 17 65 49 bp direct repeat
BASE COUNT 29 a 14 c 17 g 20 t
ORIGIN
1 tattattaaa tattatatag ttccggggcc cggccacggg agccggaacc ccgaaaggag
61 aaatattaat atattataaa
//
LOCUS YSCMTSPR1F 80 bp ds-DNA ORG 21-FEB-1990
DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel
junction.
ACCESSION M31844
KEYWORDS .
SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Clark-Walker,G.D.
TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces
cerevisiae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
STANDARD full staff_review
FEATURES from to/span description
rpt 17 65 49 bp direct repeat
BASE COUNT 32 a 14 c 17 g 17 t
ORIGIN
1 atattattta ttattaatag ttccggggcc cggccacggg agccggaacc ccgaaaggag
61 aaataaataa aataaaaaat
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMBAT2A 6704 bp ss-mRNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 2 (BAT2) mRNA, complete cds.
ACCESSION M33509 M31293
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SOURCE Human T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6704)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept 102 6530 HLA-B-associated transcript 2 (BAT2)
mRNA 1 6704 BAT2 mRNA
signal 6692 6697 poly-A signal
BASE COUNT 1435 a 2224 c 1897 g 1148 t
ORIGIN Chromosome 6p21.3.
1 cctaggcccg ggtcccggat ccccgcgcac ccggccaggc tctggcacgt tttgggggag
61 gtgcctgcag gacccaacat actcaatgag cttccagcgc aatgtccgat cgctcggggc
121 cgactgccaa gggaaaggat ggaaagaagt attcctcgct caacctgttt gatacgtata
181 agggcaagtc cttagagatc cagaaacccg cctgttgccc ctcgccatgg cctgcagagt
241 ctcgggaaag ttgccattgc ccggcgtatc gacctccagc caaccttcca agcctgaaag
301 ccgagaacaa aggcaatgac cccaatgtct cactagtgcc aaaagacgga acaggatggg
361 caagcaaaca ggagcagtcc gaccccaaga gttccgatgc ctcaaccgct cagccgccgg
421 aatcgcagcc actgccggct tcacagacgc ctgcctccaa ccagccgaaa cgacccccag
481 cagcccccga gaacactcct ttggttccaa gcggggtaaa gtcctgggca caagccagcg
541 tcacccatgg agcacatgga gatggtggaa gggcatcaag cctactgtca cgattctctc
601 gagaggaatt tccgaccctg caggcggctg gcgaccagga caaggctgcc aaggaaaggg
661 agtctgccga acagtcgtct gggcccggac caagcctccg cccccaaaat tctacaactt
721 ggagggacgg aggtgggcgt ggccctgatg agctggaggg cccggactcc aaacttcatc
781 atggtcatga tccccggggt gggctacagc cttcaggccc accccagttc cctccctacc
841 gcggaatgat gccgcctttc atgtatcccc catatctccc gttccctccg ccctatggac
901 cccaggggcc ttaccgatac cccactcctg atgggcccag ccgttttccc cgtgtggcgg
961 gcccccgagg ctcagggcca ccaatgcgct tagtagagcc tgtgggtcgt ccctctattc
1021 tcaaagagga taatctcaaa gagtttgatc agttggatca ggagaatgat gatggttggg
1081 caggggccca tgaagaggtt gactacactg aaaagctcaa gttcagcgat gaggaagatg
1141 ggcgagactc tgatgaggag ggagctgagg gccacaggga ttcccaatca gcttctggtg
1201 aggaacggcc ccctgaagca gatggcaaaa agggcaactc ccccaacagc gaaccgccca
1261 ctcctaagac ggcctgggca gaaacctctc ggcctccaga gacagagccg ggacctcctg
1321 ccccaaagcc tcccctaccc cctggggact acccagatcg tgggggtcct ccctgcaagc
1381 ccccagcacc tgaagatgag gatgaggcat ggcggcagcg acgaaagcag tcgtcatctg
1441 agatttccct ggcagtggag cgggcccggc gacggcgaga agaagaggag cggcgcatgc
1501 aagaagagcg ccgggcagcc tgtgctgaga agctcaagcg actcgatgaa aagtttgggg
1561 cacctgacaa gcggctcaaa gcagagcctg ctgccccacc tgctgcccct tctaccccag
1621 ccccaccacc tgcagtccct aaagaactcc ctgcacctcc agctccacct ccagcatcag
1681 ccccaacacc agagacagaa cctgaagagc cagcacaggc ccctcctgcc caatctactc
1741 ctactccagg tgtggctgcg gctcccactc tggtgagtgg tggtggcagt accagtagca
1801 ccagcagtgg cagcttcgaa gccagcccag tggaaccaca actgccctca aaagagggtc
1861 ctgaaccacc agaagaggtt cctcctccta ccacaccccc agttccaaag gtggaaccca
1921 agggtgatgg gattggtccc acccgccagc cccctagtca gggcttgggc taccccaaat
1981 atcagaagtc gttgcctcct cgtttccagc ggcagcagca ggagcagctc ctgaagcagc
2041 agcagcagca ccagtggcag cagcatcaac agggctctgc ccctcctacc ccagtgcccc
2101 catcaccacc acagcctgtg accctggggg ctgtgccagc tccacaggct ccacccccgc
2161 cccccaaggc cctgtaccca ggtgctctgg gccggccccc acccatgccc ccaatgaact
2221 ttgatccccg atggatgatg attcctcctt atgtggaccc ccggctcctc cagggtcgtc
2281 cccctctaga gttctaccct cctggtgtgc atccctctgg cctagttccc cgagagcgtt
2341 cagacagtct ggggctcagc tcagagccat ttgaccgtca tgcacctgct atgttacggg
2401 aacggggcac tccaccggtg gatccaaagt tggcctgggt aggagatgtc ttcaccgcca
2461 cacccgctga accccgccca cttacctcac ctctgcgcca ggctgcggat gaggatgaca
2521 aggggatgag gagcgagact cctccagtac ctcccccacc accctatctg gccagttatc
2581 caggctttcc tgagaatgga gcccctgggc ccccaatctc tcgctttcct ctggaggaac
2641 cagggccccg tccactcccc tggcccccag gcagtgatga agtggccaag atacaaactc
2701 caccacccaa gaaggagccc cctaaggagg agactgcaca gctgacgggg ccagaagcag
2761 gccgaaagct gcccgcgagt cggagtggag caggcccccc accaccacgc agagagagtc
2821 gcacagagac ccgctggggc cctcgtccag ggagcagtcg tcgtggaatc cctccagagg
2881 agccaggggc cccaccccgc cgggctgggc ctataaagaa acctccacca cctacaaaag
2941 tagaagagct gcctcccaag cccctcgaac agggggatga aacccccaaa cccccaaagc
3001 cagacccact caagataacc aaggggaagc tagggggccc caaggagacc ccacccaatg
3061 gaaatctttc ccctgcccca aggcttcgga gggactattc gtatgaaaga gtgggtccta
3121 cctcttgccg gggtcggggc cgaggcgagt attttgccag agggaggggt tttcggggga
3181 cctatggggg acgagggcgg ggaggccaag cgaattccgc agttaccgag agtttcgagg
3241 agatgatggg cgtggaggtg ggacaggggg accaaaccac cctcctgctc cccgaggccg
3301 ccatgccagc gagacacgga gcgagggttc agagtatgag gaaatcccca agcggtgccg
3361 gcagcggggc tcagaaacag gcagcgagac ccatgagagt gatctggctc cttcagacaa
3421 ggaggctccc acacccaagg agggaacact cacccaggtc ctctcgctcc cccaccacca
3481 ggagccccac ccttcaccga gcgccagccc gcttcacgtg cccgggggtc ggcgagtctt
3541 cactcccaga gggtgccatc tcgccggggc cgaggaggag ggaggcccct cctcaagttt
3601 gcccaggctg gagccctcca gccaagtctc tggctcccaa gaaacctccc acaggccctt
3661 tgccaccaag taaggagcct ttgaaagaga agttgatccc agggcctctg tcccctgtgg
3721 cgcgcggagg cagcaatgga ggtagcaatg tgggcatgga agatggggag cgaccccgaa
3781 ggaggcgaca tgggagggct cagcagcagg ataaaccgcc tcgtttccgg aggctgaagc
3841 aggaacggga gaatgccgca agggggtctg agggcaagcc ctccctaacc cttccagcct
3901 ccgctcctgg acctgaggag gccctcacaa cagtcacagt ggccccagca cctccgcggg
3961 cagctgccaa gtctcctgat ctgtcaaacc agaactcaga ccaagccaat gaggaatggg
4021 agactgcatc agagagcagt gacttcacca gtgagcgccg aggggacaaa gaggcacccc
4081 caccagtact gctgacaccc aaggctgtgg gaactcctgg gggaggtgga ggtggagccg
4141 taccaggtat ttcagccatg tcccgcggag atctgagcca gagagccaag gatttgagta
4201 aacggagctt ctcaagtcag cggccaggca tggaacggca gaatcggcgc cctggcccag
4261 ggggcaaggc tggcagcagt ggcagcagca gtggaggagg cggtgggggt cctggaggaa
4321 ggaccgggcc aggacgaggc gacaagagga gctggccctc tcccaagaac cgaagtcgtc
4381 ctccagagga gcgtcccccg gggcttcccc tgcctccccc acctcccagc agttctgctg
4441 tcttccgcct ggaccaagtt atccacagca accctgctgg catccaacag gctctggccc
4501 agcttagtag ccgtcaaggg agtgtaactg caccaggggg tcatccaagg cacaagcctg
4561 ggcctcccca agcccctcag ggcccctctc ctaggccccc aacccgatac gagccccaga
4621 gggtcaacag cggcctcagt tctgaccccc actttgagga gccggggcca atggtgagag
4681 gggtgggtgg gactcctcgg gactctgccg gggttagtcc ctttccccct aaacgtcggg
4741 agcggcctcc cagaaaacca gagctgctac aggaggaatc tttgccacct cctcatagct
4801 ctggattctt gggctctaag cctgagggcc caggccctca ggcagagtcc agagatacag
4861 gcacagaggc cctgacccct cacatctgga accgtttaca tactgccact agccgaaaga
4921 gttaccggcc cacgtccatg gagccttgga tggagcccct gagtcctttt gaggatgtgg
4981 ctggcacaga aatgagtcag tctgacagtg gggtggacct gagtggggat tctcaggtgt
5041 catcaggtcc ctgcagccag cgaagttccc ctgatggagg actcaagggg gcagcagagg
5101 gaccccccaa gaggcctgga ggctcctcac ccctgaatgc tgttccttgt gagggtccac
5161 ctggctctga acctcctagg agaccaccac ctgcccccca cgatggggac agaaaggagc
5221 tgccccggga gcagcctctg ccccctggcc ccattggcac agaacgatca cagcgtacag
5281 accgaggcac agagcctggc cccattcggc catcccatcg acctggtccc ccagtccagt
5341 ttggcactag tgacaaggac tcagacttac gcctagtggt aggagacagc ttgaaagcag
5401 agaaggagct aacagcatca gtcactgagg ccattcctgt atcacgagac tgggagctgc
5461 ttcccagtgc tgctgcctct gctgagccac aatccaagaa cctggattct gggcactgtg
5521 tcccggagcc cagctcctca ggccagcgcc tgtatcctga ggttttctat ggcagtgctg
5581 ggccttccag ttctcagatc tctgggggga gccatggact ctcaattaca tccaaacagt
5641 ggaggcttcg ccctgggaca ccctcactgc acccttacag atcacagccc ctatacctac
5701 ccccgggccc agcccctccc tcagcactgc tctctggggt agctctcaag ggccagtttc
5761 tggatttctc cacaatgcaa gctacagagc tggggaagtt gccggctgga ggagttctct
5821 accctccacc ttccttcctc tactctccgg ctttctgccc cagtcctttg cctgacacat
5881 cgttgcttca ggtacgccag gatctgccat ccccttcgga tttttattct actcctctgc
5941 agcctggtgg ccaaagtggc tttctccctt caggggctcc tgcccagcag atgcttctac
6001 ccatggtaga ctcacagctg cctgtggtga actttggctc cctgccgcca gcaccacctc
6061 ctgccccacc tcccctttct ctgttacctg tgggccctgc tctgcagccc cccagcctgg
6121 ctgtgcggcc cccacctgct cctgctactc gggtgctgcc ttcacctgcc aggcccttcc
6181 ccgctagctt ggggcgagca gagctgcatc cagtggaact aaagccgttc caggattatc
6241 aaaaactgag cagcaacctt gggggacctg gatcatcacg gactccccca actggaaggt
6301 ccttctctgg cctcaattcc cgtctcaagg ccacgccttc cacctacagt ggagtcttcc
6361 gcacccagcg cgtcgacctt taccagcagg cctccccacc agatgccctg cgctggatac
6421 ctaagccttg ggagcggaca gggccgccac ctcgagaagg gccctcccga cgggcagagg
6481 agcctgggtc ccgaggggac aaggagcctg ggttgccccc accccgctga gggagttcct
6541 cttgccccct acccccgggg cttgtatata gattataaat atataagggg gaaaggggtg
6601 ggcggggagg ggttgtgggg ctggggcctc acttcccctc ctcccccttc ccctggtccc
6661 ctgtccctgg ggctgtttgt taaaaaagag taataaaagg attt
//
LOCUS HUMBAT2B1 336 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' flank.
ACCESSION M33510 M31293
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 1 of 4
SOURCE Human T-cell line MANN DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites; for [2])
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 336)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
BASE COUNT 108 a 60 c 65 g 101 t 2 others
ORIGIN Chromosome 6p21.3.
1 tctagaatcg ggtagtaaga gacaaaggag ggtaacagta ctgcatttca caaaatgaaa
61 cccattgtta agaaattaca aattcccaat aatttcaaat ataaaaattt attcatgaaa
121 attataggtt ataaaattaa atgtccgtct tagtcgatgg ttgcccatat tttgatgaac
181 gagtcattcc tagcctatct ttgttcaaat gatttgcata cttatgcaaa taggtagaac
241 tgcccgaaga atgcctacnt gcgtggtgcg gacgaaacgc ttgccgggsc ctttggattg
301 gtctgtctag ccacctcatt tgcatgacgt aatata
//
LOCUS HUMBAT2B2 188 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' end.
ACCESSION M33511 M31293
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 2 of 4
SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 154 to 188)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 188)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pre-msg 154 188 BAT2 mRNA
BASE COUNT 20 a 64 c 59 g 18 t 27 others
ORIGIN About 500 bp after segment 1; chromosome 6p21.3.
1 gtgcthhhng gggcggcggt tccgcggatg ggccgttagt cgggstcagc cgcggagtga
61 gngagggaga cgnnaggasg aacccggcca tccgccgcca tcctcccccg ccccaccgcc
121 atccgtcccg gggacnnnnn nnnnnnnnnn nnncctaggc ccgggtcccg gatccccgcg
181 cacccggc
//
LOCUS HUMBAT2B3 3090 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, exons 2 through 4.
ACCESSION M33512 M31293
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 3 of 4
SOURCE Human T-cell line MANN DNA (introns), and T-cell line HPB-All, cDNA
to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1460 to 1572; 2396 to 2572; and 2919 to 3090)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3090)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept 1460 1572 HLA-B-associated transcript 2 (BAT2), exon 2
(first expressed exon)
2396 2572 HLA-B-associated transcript 2, exon 3
2919 + 3090 HLA-B-associated transcript 2, exon 4
pre-msg < 1 > 3090 BAT2 mRNA and introns
IVS < 1 1459 BAT2 intron A (no splice consensus)
IVS 1573 2395 BAT2 intron B
IVS 2573 2918 BAT2 intron C
BASE COUNT 747 a 709 c 730 g 901 t 3 others
ORIGIN About 500 bp after segment 2; chromosome 6p21.3.
1 tctagaatcc tgcttttatc ccagcatctt tgctttctat gttgctcagt cgccctatgt
61 ctgctttttc atttttcctg ttcctcgtct cctttctccc ccaaccccgt ttttcttctt
121 gggcctctgc cccttacttc gttgtctaca tccttttttt ttttgccatt cctgtttcca
181 tatattttcc acctgctttc gtattcatta ttttctgtta gttttggact attcgctaca
241 tgactcttgt attcgttttc ccttcatata tttatcttca cagattggcc tcctcaaaca
301 cctacgaagc aacatccatc ttatgtgtag cttgtcataa agttctttct ccccaatttt
361 agctttcatt ctgggcctgt ctggatttcc ctgctttctt ccccactatt tctcatctct
421 ttacactgtt cccgaccata aacgaatgcc tggtcactct ggaatggact gagagacctg
481 tcgtccggct tgcttaggga gctggaggta tcgagtaaag aaacactggt gatggacatt
541 tttaatcagg ataggaaaac gaagatggct ctgccttggc cctctgtttt ctggcccatg
601 gttacagggt gctaaggtgg ctccataatg ctttttctca gttcttcata tggtaaaaca
661 gtatttcatc tggaggcgat tttttccagg agccaataca ggagcaagtt taccaaaaga
721 tgggatattt caaatacttg aggttcctat agcctgggag tatgtacagc cctagttgtt
781 ctatgaggat ttctctggta ccaaccccca ttccngctga gcaagctcat aaaatcctta
841 aactcccagc ataccttnct gcaaaccttc ccagatggac acgaggctgc tgggctggga
901 gctggggtac agggccctgg gggcatgatt agggagcttg tgtccaataa acagggaatc
961 taaagtgttg tttcttcttc tctgatggaa ttgtatgctt cttttttagt tttctcttag
1021 cttgaatttg tcctgttgta agtctctgaa acgattttgg tggagagaga agagattatt
1081 acttgtaggg aattactctt tngtagacag gcacaaaggg cagagtgttt atactaggag
1141 gatgctggat ttttacttag atttccttgt aacaaaggtc gtctggggcc aaggagggaa
1201 catggcattt gagctatgag ggagctaagt agatcatggt tggactttaa gaagagtggg
1261 cagtttacat agactggagg aaaagacacc agagggactc atatctgagt ccctaatgat
1321 aatgcaatgg agtttttaag tttctgttat ggtctgtaca gggacagaga ctgagacact
1381 tgcgtctggc ccacaggctc tggcacgttt tgggggaggt gcctgcagga cccaacatac
1441 tcaatgagct tccagcgcaa tgtccgatcg ctcggggccg actgccaagg gaaaggatgg
1501 aaagaagtat tcctcgctca acctgtttga tacgtataag ggcaagtcct tagagatcca
1561 gaaacccgcc tggtgagagt cctgcaaaga tgcttctgat ggttgaaaag ctaggcatgc
1621 atggggcata cgttttagag ctctaaagga agtggctgta gtagaaatac caaaagacta
1681 gaggagattt cccaacttac actgggtcct ttaaaggggg tgtgggctct gggtgaacac
1741 cagttatcct cctacaaagg cgtgtctgtg gttccctgtc tttggacacg taagaattgg
1801 aggaaataaa tgtggatttg ggaaactttg aggccagctt gcttcttgca ggctcatgat
1861 caaccaatct cacataaaag tattgaatgt tacatatctc agccttcttg atagggattt
1921 actagatttt tttttttttt tttttttttt ttttttgaga ccaagtttag ctcctgttgc
1981 ccaggctgga gtgcaatggt gtgatcttga cttaccacaa cctccaccgc ctgggtttaa
2041 gcgattatcc tgcctcagcc tcctgagtag ctgggattac aggcatgcac cccggctaat
2101 tttgtgtttt tagtagagac agggtttctc cattttggtc aagctggtct tgaactcctg
2161 acctcaggtg atccgcctcc ctcggcctgc caaagtgctg ggattgcaaa gtgtgagcca
2221 ccacaatcag cgcgatttca gagattatta aggcagggga aggaatccct tctaagagaa
2281 gtttggagga agtaggtaat aaaatattca acatgtataa atgtgtccca ggataggagg
2341 ccatcagatc tcccacatga ggcattttcg accctctctc cgtcttgttc tccagttgcc
2401 cctcgccatg gcctgcagag tctcgggaaa gttgccattg cccggcgtat cgacctccag
2461 ccaaccttcc aagcctgaaa gccgagaaca aaggcaatga ccccaatgtc tcactagtgc
2521 caaaagacgg aacaggatgg gcaagcaaac aggagcagtc cgaccccaag aggtagacag
2581 aggcttgggg gacctagagt gatgggtatt ttaacttgaa cttcagggag cattggggct
2641 tggtttagtc cagccacgtc tgaagagacg aagaggtccc tttcttacct attgcaggtt
2701 ccttgttaaa tgactaagga atggtactaa actttagctt tttgtcttgg agagagagca
2761 tgaaaaaata gacaacaggt acaaggatga caaaattaat ttgtccttat atttgtaaat
2821 ggtagcaatg ggcatgattt cagtcctgag tctccaccag ttggagaagt cagggaggca
2881 tctcaggtgt gaataacctt cccattctgt cccctcagtt ccgatgcctc aaccgctcag
2941 ccgccggaat cgcagccact gccggcttca cagacgcctg cctccaacca gccgaaacga
3001 cccccagcag cccccgagaa cactcctttg gttccaagcg gggtaaagtc ctgggcacaa
3061 gccagcgtca cccatggagc acatggagat
//
LOCUS HUMBAT2B4 6349 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 3' end.
ACCESSION M33518 M31293
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 4 of 4
SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 209 to 6349)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 6349)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept + 209 6175 HLA-B-associated transcript 2 (BAT2), exon 5
pre-msg < 1 6349 BAT2 mRNA and introns
IVS < 1 208 BAT2 intron D (no splice consensus)
signal 6337 6342 poly-A signal
BASE COUNT 1334 a 2095 c 1798 g 1121 t 1 others
ORIGIN About 370 bp after segment 3; chromosome 6p21.3.
1 agctaatttg tgtgtgttta gtagagatgg gttcacatgt tggcagatgg tctcgatctc
61 ttgacctctg tgatccgccc gcctcagccg gtcccagagt gctgggatta caggcgtgag
121 ccaccgcgcc cagccagagt cttccacttt tatnagcatg tcctcaggaa atgtcttctg
181 tctcctgttc tgcatcccca tcctaatagg tggaagggca tcaagcctac tgtcacgatt
241 ctctcgagag gaatttccga ccctgcaggc ggctggcgac caggacaagg ctgccaagga
301 aagggagtct gccgaacagt cgtctgggcc cggaccaagc ctccgccccc aaaattctac
361 aacttggagg gacggaggtg ggcgtggccc tgatgagctg gagggcccgg actccaaact
421 tcatcatggt catgatcccc ggggtgggct acagccttca ggcccacccc agttccctcc
481 ctaccgcgga atgatgccgc ctttcatgta tcccccatat ctcccgttcc ctccgcccta
541 tggaccccag gggccttacc gataccccac tcctgatggg cccagccgtt ttccccgtgt
601 ggcgggcccc cgaggctcag ggccaccaat gcgcttagta gagcctgtgg gtcgtccctc
661 tattctcaaa gaggataatc tcaaagagtt tgatcagttg gatcaggaga atgatgatgg
721 ttgggcaggg gcccatgaag aggttgacta cactgaaaag ctcaagttca gcgatgagga
781 agatgggcga gactctgatg aggagggagc tgagggccac agggattccc aatcagcttc
841 tggtgaggaa cggccccctg aagcagatgg caaaaagggc aactccccca acagcgaacc
901 gcccactcct aagacggcct gggcagaaac ctctcggcct ccagagacag agccgggacc
961 tcctgcccca aagcctcccc taccccctgg ggactaccca gatcgtgggg gtcctccctg
1021 caagccccca gcacctgaag atgaggatga ggcatggcgg cagcgacgaa agcagtcgtc
1081 atctgagatt tccctggcag tggagcgggc ccggcgacgg cgagaagaag aggagcggcg
1141 catgcaagaa gagcgccggg cagcctgtgc tgagaagctc aagcgactcg atgaaaagtt
1201 tggggcacct gacaagcggc tcaaagcaga gcctgctgcc ccacctgctg ccccttctac
1261 cccagcccca ccacctgcag tccctaaaga actccctgca cctccagctc cacctccagc
1321 atcagcccca acaccagaga cagaacctga agagccagca caggcccctc ctgcccaatc
1381 tactcctact ccaggtgtgg ctgcggctcc cactctggtg agtggtggtg gcagtaccag
1441 tagcaccagc agtggcagct tcgaagccag cccagtggaa ccacaactgc cctcaaaaga
1501 gggtcctgaa ccaccagaag aggttcctcc tcctaccaca cccccagttc caaaggtgga
1561 acccaagggt gatgggattg gtcccacccg ccagccccct agtcagggct tgggctaccc
1621 caaatatcag aagtcgttgc ctcctcgttt ccagcggcag cagcaggagc agctcctgaa
1681 gcagcagcag cagcaccagt ggcagcagca tcaacagggc tctgcccctc ctaccccagt
1741 gcccccatca ccaccacagc ctgtgaccct gggggctgtg ccagctccac aggctccacc
1801 cccgcccccc aaggccctgt acccaggtgc tctgggccgg cccccaccca tgcccccaat
1861 gaactttgat ccccgatgga tgatgattcc tccttatgtg gacccccggc tcctccaggg
1921 tcgtccccct ctagagttct accctcctgg tgtgcatccc tctggcctag ttccccgaga
1981 gcgttcagac agtctggggc tcagctcaga gccatttgac cgtcatgcac ctgctatgtt
2041 acgggaacgg ggcactccac cggtggatcc aaagttggcc tgggtaggag atgtcttcac
2101 cgccacaccc gctgaacccc gcccacttac ctcacctctg cgccaggctg cggatgagga
2161 tgacaagggg atgaggagcg agactcctcc agtacctccc ccaccaccct atctggccag
2221 ttatccaggc tttcctgaga atggagcccc tgggccccca atctctcgct ttcctctgga
2281 ggaaccaggg ccccgtccac tcccctggcc cccaggcagt gatgaagtgg ccaagataca
2341 aactccacca cccaagaagg agccccctaa ggaggagact gcacagctga cggggccaga
2401 agcaggccga aagctgcccg cgagtcggag tggagcaggc cccccaccac cacgcagaga
2461 gagtcgcaca gagacccgct ggggccctcg tccagggagc agtcgtcgtg gaatccctcc
2521 agaggagcca ggggccccac cccgccgggc tgggcctata aagaaacctc caccacctac
2581 aaaagtagaa gagctgcctc ccaagcccct cgaacagggg gatgaaaccc ccaaaccccc
2641 aaagccagac ccactcaaga taaccaaggg gaagctaggg ggccccaagg agaccccacc
2701 caatggaaat ctttcccctg ccccaaggct tcggagggac tattcgtatg aaagagtggg
2761 tcctacctct tgccggggtc ggggccgagg cgagtatttt gccagaggga ggggttttcg
2821 ggggacctat gggggacgag ggcggggagg ccaagcgaat tccgcagtta ccgagagttt
2881 cgaggagatg atgggcgtgg aggtgggaca gggggaccaa accaccctcc tgctccccga
2941 ggccgccatg ccagcgagac acggagcgag ggttcagagt atgaggaaat ccccaagcgg
3001 tgccggcagc ggggctcaga aacaggcagc gagacccatg agagtgatct ggctccttca
3061 gacaaggagg ctcccacacc caaggaggga acactcaccc aggtcctctc gctcccccac
3121 caccaggagc cccacccttc accgagcgcc agcccgcttc acgtgcccgg gggtcggcga
3181 gtcttcactc ccagagggtg ccatctcgcc ggggccgagg aggagggagg cccctcctca
3241 agtttgccca ggctggagcc ctccagccaa gtctctggct cccaagaaac ctcccacagg
3301 ccctttgcca ccaagtaagg agcctttgaa agagaagttg atcccagggc ctctgtcccc
3361 tgtggcgcgc ggaggcagca atggaggtag caatgtgggc atggaagatg gggagcgacc
3421 ccgaaggagg cgacatggga gggctcagca gcaggataaa ccgcctcgtt tccggaggct
3481 gaagcaggaa cgggagaatg ccgcaagggg gtctgagggc aagccctccc taacccttcc
3541 agcctccgct cctggacctg aggaggccct cacaacagtc acagtggccc cagcacctcc
3601 gcgggcagct gccaagtctc ctgatctgtc aaaccagaac tcagaccaag ccaatgagga
3661 atgggagact gcatcagaga gcagtgactt caccagtgag cgccgagggg acaaagaggc
3721 acccccacca gtactgctga cacccaaggc tgtgggaact cctgggggag gtggaggtgg
3781 agccgtacca ggtatttcag ccatgtcccg cggagatctg agccagagag ccaaggattt
3841 gagtaaacgg agcttctcaa gtcagcggcc aggcatggaa cggcagaatc ggcgccctgg
3901 cccagggggc aaggctggca gcagtggcag cagcagtgga ggaggcggtg ggggtcctgg
3961 aggaaggacc gggccaggac gaggcgacaa gaggagctgg ccctctccca agaaccgaag
4021 tcgtcctcca gaggagcgtc ccccggggct tcccctgcct cccccacctc ccagcagttc
4081 tgctgtcttc cgcctggacc aagttatcca cagcaaccct gctggcatcc aacaggctct
4141 ggcccagctt agtagccgtc aagggagtgt aactgcacca gggggtcatc caaggcacaa
4201 gcctgggcct ccccaagccc ctcagggccc ctctcctagg cccccaaccc gatacgagcc
4261 ccagagggtc aacagcggcc tcagttctga cccccacttt gaggagccgg ggccaatggt
4321 gagaggggtg ggtgggactc ctcgggactc tgccggggtt agtccctttc cccctaaacg
4381 tcgggagcgg cctcccagaa aaccagagct gctacaggag gaatctttgc cacctcctca
4441 tagctctgga ttcttgggct ctaagcctga gggcccaggc cctcaggcag agtccagaga
4501 tacaggcaca gaggccctga cccctcacat ctggaaccgt ttacatactg ccactagccg
4561 aaagagttac cggcccacgt ccatggagcc ttggatggag cccctgagtc cttttgagga
4621 tgtggctggc acagaaatga gtcagtctga cagtggggtg gacctgagtg gggattctca
4681 ggtgtcatca ggtccctgca gccagcgaag ttcccctgat ggaggactca agggggcagc
4741 agagggaccc cccaagaggc ctggaggctc ctcacccctg aatgctgttc cttgtgaggg
4801 tccacctggc tctgaacctc ctaggagacc accacctgcc ccccacgatg gggacagaaa
4861 ggagctgccc cgggagcagc ctctgccccc tggccccatt ggcacagaac gatcacagcg
4921 tacagaccga ggcacagagc ctggccccat tcggccatcc catcgacctg gtcccccagt
4981 ccagtttggc actagtgaca aggactcaga cttacgccta gtggtaggag acagcttgaa
5041 agcagagaag gagctaacag catcagtcac tgaggccatt cctgtatcac gagactggga
5101 gctgcttccc agtgctgctg cctctgctga gccacaatcc aagaacctgg attctgggca
5161 ctgtgtcccg gagcccagct cctcaggcca gcgcctgtat cctgaggttt tctatggcag
5221 tgctgggcct tccagttctc agatctctgg ggggagccat ggactctcaa ttacatccaa
5281 acagtggagg cttcgccctg ggacaccctc actgcaccct tacagatcac agcccctata
5341 cctacccccg ggcccagccc ctccctcagc actgctctct ggggtagctc tcaagggcca
5401 gtttctggat ttctccacaa tgcaagctac agagctgggg aagttgccgg ctggaggagt
5461 tctctaccct ccaccttcct tcctctactc tccggctttc tgccccagtc ctttgcctga
5521 cacatcgttg cttcaggtac gccaggatct gccatcccct tcggattttt attctactcc
5581 tctgcagcct ggtggccaaa gtggctttct cccttcaggg gctcctgccc agcagatgct
5641 tctacccatg gtagactcac agctgcctgt ggtgaacttt ggctccctgc cgccagcacc
5701 acctcctgcc ccacctcccc tttctctgtt acctgtgggc cctgctctgc agccccccag
5761 cctggctgtg cggcccccac ctgctcctgc tactcgggtg ctgccttcac ctgccaggcc
5821 cttccccgct agcttggggc gagcagagct gcatccagtg gaactaaagc cgttccagga
5881 ttatcaaaaa ctgagcagca accttggggg acctggatca tcacggactc ccccaactgg
5941 aaggtccttc tctggcctca attcccgtct caaggccacg ccttccacct acagtggagt
6001 cttccgcacc cagcgcgtcg acctttacca gcaggcctcc ccaccagatg ccctgcgctg
6061 gatacctaag ccttgggagc ggacagggcc gccacctcga gaagggccct cccgacgggc
6121 agaggagcct gggtcccgag gggacaagga gcctgggttg cccccacccc gctgagggag
6181 ttcctcttgc cccctacccc cggggcttgt atatagatta taaatatata agggggaaag
6241 gggtgggcgg ggaggggttg tggggctggg gcctcacttc ccctcctccc ccttcccctg
6301 gtcccctgtc cctggggctg tttgttaaaa aagagtaata aaaggattt
//
LOCUS HUMBAT3A 3740 bp ss-mRNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 3 (BAT3) mRNA, complete cds.
ACCESSION M33519 M31294
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SOURCE Human T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3740)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept 250 3648 HLA-B-associated transcript 3 (BAT3)
mRNA 1 3740 BAT3 mRNA
BASE COUNT 744 a 1182 c 1057 g 757 t
ORIGIN Chromosome 6p21.3.
1 ggcgacagcg gtggcggctc ctcggggtgc tcggctccct cccacctagg ccggccccgg
61 cccgactcgc cctcagaaac tcactgtttg gggctgcgga ctttctcgtc gtgccccaca
121 aaagtaaagc ttggggacct ggggggagcc ggaagtatcg cttcgagatc cccaaatact
181 atcggggaaa cggaagtggc cgtcggtggc aggtttgggg gagaccggaa gtgacgagac
241 ctgtcggcca tggagcctaa tgatagtacc agtaccgctg tggaggagcc tgacagcttg
301 gaggtgttgg tgaagacctt ggactctcaa actcgtacct ttattgtggg ggcccagatg
361 aatgtaaaag agtttaagga gcacattcgt gcctctgtca gcatcccatc tgaaaaacaa
421 cggctcattt accagggacg agttctgcaa gatgataaga agcttcagga atacaatgtt
481 gggggaaagg ttatccacct ggtggaacgg gctcctcctc agactcacct cccttctggg
541 gcatcttctg ggacggggtc tgcctcagcc actcatggtg ggggatcccc ccctggtact
601 cgggggcctg gggcctctgt tcatgaccgg aatgccaaca gctatgtcat ggttggaacc
661 ttcaatcttc ctagtgacgg ctctgctgtg gatgttcaca tcaacatgga acaggccccg
721 attcagagtg agccccgggt acggctggtg atggctcagc acatgatcag ggatatacag
781 accttactat cccggatgga gactctcccc taccttcagt gtcgaggagg gccccaaccg
841 cagcacagtc agccgccccc gcagccaccg gctgtgaccc cggagccagt agccttgagc
901 tctcaaacat cagaaccagt tgaaagtgaa gcacctcccc gggagcccat ggaggcagaa
961 gaagtggagg agcgtgcccc agcccagaac ccggagctca ctcctggccc agccccagcg
1021 ggcccaacac ctgccccgga aacaaatgca cccaaccatc cttcccctgc ggagtatgtc
1081 gaggtgctcc aggagctaca gcggctggag agtcgcctcc agcccttctt gcagcgctac
1141 tacgaggttc tgggtgctgc tgccaccacg gactacaata acaatcacga gggccgggag
1201 gaggatcagc ggttgatcaa cttggtaggg gagagcctgc gactgctggg caacaccttt
1261 gttgcactgt ctgacctgcg ctgcaatctg gcctgcacgc ccccacgaca cctgcatgtg
1321 gtccggccta tgtctcacta caccaccccc atggtgctcc agcaggcagc cattcccata
1381 cagatcaatg tgggaaccac tgtgaccatg acaggaaatg ggactcggcc ccccccaact
1441 cccaatgcag aggcacctcc ccctggtcct gggcaggcct catccgtggc tccgtcttct
1501 accaatgtcg agtcctcagc tgagggggct cccccgccag gtccagctcc cccgccagcc
1561 accagccacc cgagggtcat ccggatttcc caccagagtg tggaacccgt ggtcatgatg
1621 cacatgaaca ttcaagattc tggcacacag cctggtggtg ttccgagtgc tcccactggc
1681 cccctgggac cccctggtca tggccaaacc ctgggacagc aggtgccagg cttcccaaca
1741 gctccaaccc gggtggtgat tgcccggccc actcctccac aggctcggcc ttcccatcct
1801 ggagggcccc cagtctctgg gacactgcag ggcgccggtc tgggtaccaa tgcctcgttg
1861 gcccagatgg tgagcggcct tgtggggcag cttcttatgc agccagtcct tgtggctcag
1921 gggaccccag gtatggctcc accgccagcc cctgccactg cttctgccag tgctggcacc
1981 accaacacag ctaccacagc tggccccgct cctggggggc ctgcccagcc tccacccacc
2041 cctcaaccct ccatggctga tcttcagttc tctcagcttc tggggaacct gctagggcct
2101 gcagggccag gggctggagg gcctggtgtg gcttctccca ccatcactgt ggcgatgcct
2161 ggtgtccctg cctttctcca aggcatgact gacttcttgc aggcaacaca gacagcccct
2221 ccaccacccc cacctcctcc acccccacca cctgccccag agcagcagac catgccccca
2281 ccaggctccc cttctggtgg cgcagggagt cctggaggcc tgggtcttga gagcctgtca
2341 ccggagtttt ttacctcagt ggtgcagggt gtgctcagct ccctgctggg ctccctgggg
2401 gctcgggctg gcagcagtga aagtattgct gccttcatac aacgcctcag tggatccagc
2461 aacatctttg agcctggagc tgatggggcc cttggattct ttggggcctt gctttctctt
2521 ctgtgccaga acttctctat ggtggacgta gtgatgcttc tccatgggca tttccagcca
2581 ctacaacggc tccagcccca gctgcgatcc ttcttccacc agcactacct gggtggtcag
2641 gagcccacac ccagtaacat ccggatggca acccacacat tgatcacggg gctagaagag
2701 tatgtgcggg agagtttttc cttggtgcag gttcagccag gtgtggacat catccggaca
2761 aacctggaat ttctccaaga gcagtttaat agcattgctg cgcatgtgct gcattgcaca
2821 gatagtggat ttggggcccg gttgctggag ttgtgtaacc aaggcctgtt tgaatgcctg
2881 gccctaaacc tgcactgctt ggggggacag cagatggagc ttgctgctgt tatcaatggc
2941 cgaattcgtc gtatgtctcg tggggtgaat ccctccttgg tgagctggct gaccactatg
3001 atgggactga ggcttcaggt ggtactggag cacatgcctg taggccctga tgccattctc
3061 agatacgttc gcagggttgg tgatcccccc cagccacttc ctgaggagcc aatggaagtt
3121 cagggagcag aaagagcttc ccctgagcct cagcgggaga atgcttcccc agcccctgga
3181 acaacagcag aagaggccat gtcccgaggt ccacctcctg ctcctgaggg gggctcccgg
3241 gatgaacagg atggagcttc agctgagaca gaaccttggg cagctgcagt ccccccagaa
3301 tgggtcccta ttatccagca ggacattcag agccagcgga aggtgaaacc gcagccccct
3361 ctgagtgatg cctacctcag tggtatgcct gccaagagac gcaagacgat gcagggtgag
3421 ggcccccagc tgcttctctc agaggctgtg agccgggcag ctaaggcagc cggagctcgg
3481 cccctgacga gccccgagag cctgagccgg gacctggagg caccagaggt tcaggagagc
3541 tacaggcagc agctccggtc tgatatacaa aaacgactgc aggaagaccc caactacagt
3601 ccccagcgct tccccaatgc ccagcgggcc tttgctgatg atccttagct ctttgctcta
3661 tggcccttcc tcatcagggg accgtttccc ccctcttcct tcacagtatt taagaaataa
3721 aagtcggatt ttttctggcc
//
LOCUS HUMBAT3B1 785 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 5' end.
ACCESSION M33520 M31294
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 1 of 2
SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 333 to 689)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 785)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept 582 + 689 HLA-B-associated transcript 3 (BAT3), exon 1
pre-msg 333 > 785 BAT3 mRNA and introns
IVS 690 > 785 BAT3 intron A
BASE COUNT 170 a 210 c 239 g 165 t 1 others
ORIGIN Chromosome 6p21.3.
1 aaggcgcagc gagggcaata gggtggagaa gagttttagc tgctagacag tgccgcctga
61 aattatcagc ctgccaagat ttaaacatag atgaatgtgg cataatcccc catctccaaa
121 gtccaagtcc atacgaccgt ccatagcctc tcgaggcagt ggtagagtcc cagctggtga
181 ctgtttttca ggcatttacg gtagccacct caatcttcta gcgctcaacg cgcgcacaga
241 cgtgaacgcc gccagagggg ggagggggtg gggcgatgct taagtgtcca cgcatcccgt
301 agtgcgacgg cacagcgtag taggtncccc cgggcgacag cggtggcggc tcctcggggt
361 gctcggctcc ctcccaccta ggccggcccc ggcccgactc gccctcagaa actcactgtt
421 tggggctgcg gactttctcg tcgtgcccca caaaagtaaa gcttggggac ctggggggag
481 ccggaagtat cgcttcgaga tccccaaata ctatcgggga aacggaagtg gccgtcggtg
541 gcaggtttgg gggagaccgg aagtgacgag acctgtcggc catggagcct aatgatagta
601 ccagtaccgc tgtggaggag cctgacagct tggaggtgtt ggtgaagacc ttggactctc
661 aaactcgtac ctttattgtg ggggcccagg tgagacacct cactagttct ggaagacacc
721 tttagctttt ccacgtttag gccccttagc ctgagagatg agcttgattt ttctaggtca
781 ccaga
//
LOCUS HUMBAT3B2 4227 bp ds-DNA PRI 18-JAN-1990
DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 3' end.
ACCESSION M33521 M31294
KEYWORDS class III gene; major histocompatibility complex;
proline-rich protein.
SEGMENT 2 of 2
SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 731 to 848; 963 to 4227)
AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
TITLE A gene pair from the human major histocompatibility complex encodes
large proline-rich proteins with multiple repeated motifs and a
single ubiquitin-like domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 4227)
AUTHORS Banerji,J.
JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES from to/span description
pept + 731 848 HLA-B-associated transcript 3 (BAT3), exon 2
963 4135 HLA-B-associated transcript 3, exon 3
pre-msg < 1 4227 BAT3 mRNA and introns
IVS < 1 730 BAT3 intron A
IVS 849 962 BAT3 intron B
BASE COUNT 842 a 1276 c 1116 g 987 t 6 others
ORIGIN About 1.1 kb after segment 1; chromosome 6p21.3.
1 ttatcttntt agatcatttc cttccacctt aacctatacc agacccactc cttctttgcc
61 attttttaat cttggaaatc acaggagngt ctgtaaatna ctggatcatc ttgtgtttgg
121 aaggggtact gatgtctcta gacacatacn cccttggatg ccagacagat aatataattt
181 ccatgtgttt tttttttgtt tttcatccgt gttatttttc ctggatctat aacctgagct
241 tcattaagtt tatttattta attttttcga gatggagtcc cacnctttca cccaggctag
301 agtgtagtga tgcgatctcg gctcactgca acctccgcct cccgaattca agtgattctc
361 ttgcttcagc ctccctagta gctgggatta caggcgacca ccatgcctgg cttatttttt
421 gtatttttgg taaaaagggg ttttacatgt tggccaggct ggtctcgaac tctgacctaa
481 gtgatctgcc tgccttggcc tcccaagtgc tggattacag tgtgagacca ccgctccagc
541 caatatgtct gtatttttga cacgtgttac tttagttaag ggtttgcaca gtaatgatct
601 cacggtcaag acaaacgggt agtgattdct gtggtggttt ttacccctca cctccacaac
661 tcggttgtct gtctttgttc ttcctctttc ctccattctt tccattcctg tgcatgcctc
721 ttcttttcag atgaatgtaa aagagtttaa ggagcacatt cgtgcctctg tcagcatccc
781 atctgaaaaa caacggctca tttaccaggg acgagttctg caagatgata agaagcttca
841 ggaatacagt aagggggctg gggaggcagt tcagaggttg gggctactgt ctggagggat
901 gaactgaggc catgggttta cctgttcata ctatgttttg gtgtgtgtct atttttctgc
961 agatgttggg ggaaaggtta tccacctggt ggaacgggct cctcctcaga ctcacctccc
1021 ttctggggca tcttctggga cggggtctgc ctcagccact catggtgggg gatccccccc
1081 tggtactcgg gggcctgggg cctctgttca tgaccggaat gccaacagct atgtcatggt
1141 tggaaccttc aatcttccta gtgacggctc tgctgtggat gttcacatca acatggaaca
1201 ggccccgatt cagagtgagc cccgggtacg gctggtgatg gctcagcaca tgatcaggga
1261 tatacagacc ttactatccc ggatggagac tctcccctac cttcagtgtc gaggagggcc
1321 ccaaccgcag cacagtcagc cgcccccgca gccaccggct gtgaccccgg agccagtagc
1381 cttgagctct caaacatcag aaccagttga aagtgaagca cctccccggg agcccatgga
1441 ggcagaagaa gtggaggagc gtgccccagc ccagaacccg gagctcactc ctggcccagc
1501 cccagcgggc ccaacacctg ccccggaaac aaatgcaccc aaccatcctt cccctgcgga
1561 gtatgtcgag gtgctccagg agctacagcg gctggagagt cgcctccagc ccttcttgca
1621 gcgctactac gaggttctgg gtgctgctgc caccacggac tacaataaca atcacgaggg
1681 ccgggaggag gatcagcggt tgatcaactt ggtaggggag agcctgcgac tgctgggcaa
1741 cacctttgtt gcactgtctg acctgcgctg caatctggcc tgcacgcccc cacgacacct
1801 gcatgtggtc cggcctatgt ctcactacac cacccccatg gtgctccagc aggcagccat
1861 tcccatacag atcaatgtgg gaaccactgt gaccatgaca ggaaatggga ctcggccccc
1921 cccaactccc aatgcagagg cacctccccc tggtcctggg caggcctcat ccgtggctcc
1981 gtcttctacc aatgtcgagt cctcagctga gggggctccc ccgccaggtc cagctccccc
2041 gccagccacc agccacccga gggtcatccg gatttcccac cagagtgtgg aacccgtggt
2101 catgatgcac atgaacattc aagattctgg cacacagcct ggtggtgttc cgagtgctcc
2161 cactggcccc ctgggacccc ctggtcatgg ccaaaccctg ggacagcagg tgccaggctt
2221 cccaacagct ccaacccggg tggtgattgc ccggcccact cctccacagg ctcggccttc
2281 ccatcctgga gggcccccag tctctgggac actgcagggc gccggtctgg gtaccaatgc
2341 ctcgttggcc cagatggtga gcggccttgt ggggcagctt cttatgcagc cagtccttgt
2401 ggctcagggg accccaggta tggctccacc gccagcccct gccactgctt ctgccagtgc
2461 tggcaccacc aacacagcta ccacagctgg ccccgctcct ggggggcctg cccagcctcc
2521 acccacccct caaccctcca tggctgatct tcagttctct cagcttctgg ggaacctgct
2581 agggcctgca gggccagggg ctggagggcc tggtgtggct tctcccacca tcactgtggc
2641 gatgcctggt gtccctgcct ttctccaagg catgactgac ttcttgcagg caacacagac
2701 agcccctcca ccacccccac ctcctccacc cccaccacct gccccagagc agcagaccat
2761 gcccccacca ggctcccctt ctggtggcgc agggagtcct ggaggcctgg gtcttgagag
2821 cctgtcaccg gagtttttta cctcagtggt gcagggtgtg ctcagctccc tgctgggctc
2881 cctgggggct cgggctggca gcagtgaaag tattgctgcc ttcatacaac gcctcagtgg
2941 atccagcaac atctttgagc ctggagctga tggggccctt ggattctttg gggccttgct
3001 ttctcttctg tgccagaact tctctatggt ggacgtagtg atgcttctcc atgggcattt
3061 ccagccacta caacggctcc agccccagct gcgatccttc ttccaccagc actacctggg
3121 tggtcaggag cccacaccca gtaacatccg gatggcaacc cacacattga tcacggggct
3181 agaagagtat gtgcgggaga gtttttcctt ggtgcaggtt cagccaggtg tggacatcat
3241 ccggacaaac ctggaatttc tccaagagca gtttaatagc attgctgcgc atgtgctgca
3301 ttgcacagat agtggatttg gggcccggtt gctggagttg tgtaaccaag gcctgtttga
3361 atgcctggcc ctaaacctgc actgcttggg gggacagcag atggagcttg ctgctgttat
3421 caatggccga attcgtcgta tgtctcgtgg ggtgaatccc tccttggtga gctggctgac
3481 cactatgatg ggactgaggc ttcaggtggt actggagcac atgcctgtag gccctgatgc
3541 cattctcaga tacgttcgca gggttggtga tcccccccag ccacttcctg aggagccaat
3601 ggaagttcag ggagcagaaa gagcttcccc tgagcctcag cgggagaatg cttccccagc
3661 ccctggaaca acagcagaag aggccatgtc ccgaggtcca cctcctgctc ctgagggggg
3721 ctcccgggat gaacaggatg gagcttcagc tgagacagaa ccttgggcag ctgcagtccc
3781 cccagaatgg gtccctatta tccagcagga cattcagagc cagcggaagg tgaaaccgca
3841 gccccctctg agtgatgcct acctcagtgg tatgcctgcc aagagacgca agacgatgca
3901 gggtgagggc ccccagctgc ttctctcaga ggctgtgagc cgggcagcta aggcagccgg
3961 agctcggccc ctgacgagcc ccgagagcct gagccgggac ctggaggcac cagaggttca
4021 ggagagctac aggcagcagc tccggtctga tatacaaaaa cgactgcagg aagaccccaa
4081 ctacagtccc cagcgcttcc ccaatgccca gcgggccttt gctgatgatc cttagctctt
4141 tgctctatgg cccttcctca tcaggggacc gtttcccccc tcttccttca cagtatttaa
4201 gaaataaaag tcggattttt tctggcc
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS CHBSSRNA 1504 bp ss-RNA RNA 01-FEB-1990
DEFINITION C.limicola ribosomal RNA small subunit.
ACCESSION M31769
KEYWORDS ribosomal RNA; ribosomal RNA small subunit.
SOURCE C.limicola (strain ATCC 8327) RNA.
ORGANISM Chlorobium limicola
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Green bacteria; Green sulfur bacteria.
REFERENCE 1 (bases 1 to 1504)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990) Microbiol. Dept. U. of IL., Urbana, IL. 61801
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 01-FEB-1990.
FEATURES from to/span description
rRNA 1 1504 ribosomal RNA small subunit
BASE COUNT 396 a 334 c 437 g 311 t 26 others
ORIGIN
1 nnncaacgga gagtttgatc ctggctcagg acgaacgctg gcggcgtgcc taacacatgc
61 aagtcaaagg aaagtttctt cggaaataag tacttggcgc aagggtgagt aaggcatagg
121 taatctgccc tttggactgg cataaccccg agaaatcggg gacaatacca gatgatgcag
181 cggaaccgca tggttatgtt gttaaatgat ttatcgccaa aggatgagcc tatgttccat
241 caggtagttg gtagggtaac ggcctaccaa gccaacgacg gatagctggt ctgagaggat
301 gatcagccac attggaactg agacacggtc cagactccta cgggaggcag cagtgaggaa
361 tattgcgcaa tgggcgaaag cctgacgcag caacgccgcg tggatgatga agttcttcgg
421 aatgtaaaat ccttttgctg gggacgaata ggtcggcttg ccgactctga cggtacccag
481 cgaataagcc acggctaact ctgtgccagc agccgcggtg atacaggggt ggcaagcgtt
541 gtccggattt actgggtgta aagggtgcgc aggcggactg ataagtcggg ggttaaatcc
601 atgtgctcaa cacatgcacg gcttccgata ctgtcagtct tgagtctcga agaggaagat
661 ggaatttccg gtgtaacggt ggaatgtgta gatatcggaa agaacaccag tggcgaaggc
721 agtcttctgg tcgagtactg acgctcaggc acgaaagcgt ggggagcaaa caggattaga
781 taccctggta gtccacgccg taaacgatga atactagatg ttggtcatat tgatcagtgt
841 cgcagctaac gcattaagta ttccacctgg gaagtacgcc cgcaagggtg aaactcaaag
901 gaattgacgn nnnccngcac aagcggtgga tcatgtggtt taattcgatg caacgcgaag
961 aaccttacct aggcttgaaa tgttagctaa agctcctgaa agggagcgtc cttcggggag
1021 ctagcacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc
1081 gcaacgagcg caacccctac aattagttac taacaggtta agctgaggac tctaattgaa
1141 ctgcctacgc aagtagagag gaagganggg atgacgtcaa gtcctcatgg cccttacgcc
1201 tagggccaca cacgtgatac aatggcgact acagagggca aagccgcgag gcagaggaaa
1261 tcccttaaaa gtcgtctcag tccggatcgg agtctgcaac tcgactccgt gaagttggaa
1321 tcgctagtaa tcgcggatca gcatgccgcg gtgaatgtgt tcccgggcct tgtacacacc
1381 gcccgtcaag tcatggaagt caggagtacc caaagacgtc cgcgcgttta aggtaagact
1441 ggtaactggg actaagnngt aacaagnnnn nngtaccgga agnnnnnnnn ngatcacctc
1501 cttt
//
LOCUS ECOSPEA 3236 bp ds-DNA BCT 01-FEB-1990
DEFINITION E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase
(speB) and methionine adenosyltransferase (metK) genes, 5' end.
ACCESSION M31770
KEYWORDS agmatinase; arginine decarboxylase; metK gene;
methionine adenosyltransferase; speA gene; speB gene.
SOURCE E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3236)
AUTHORS Moore,R.C. and Boyle,S.M.
TITLE Nucleotide sequence and analysis of the speA gene encoding arginine
decarboxylase of Escherichia coli
JOURNAL Unpublished (1990) Virginia S. Uni. Vet. Med., Blacksburg, VA 24061
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
R.C.Moore, 02-FEB-1990.
FEATURES from to/span description
pept 192 < 1 (c) methionine adenosyltransferase (metK)
pept 987 2963 arginine decarboxylase (speA) (EC 4.1.1.19)
pept 3101 > 3236 agmatinase (speB)
signal 3009 3022 rho-independent transcription terminator
signal 811 816 -35 signal
signal 839 844 -10 signal
binding 977 980 ribosome binding site
site 1878 1886 pyridoxal phosphate binding site (put.)
BASE COUNT 743 a 784 c 860 g 849 t
ORIGIN 62.9 min on K12 map.
1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta
61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc
121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag
181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg
241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag
301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct
361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc
421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa
481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag
541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga
601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg
661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct
721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa
781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat
841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat
901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca
961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt
1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag
1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta
1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc
1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt
1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg
1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc
1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca
1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca
1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca
1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac
1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta
1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc
1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc
1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc
1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg
1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc
1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc
2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc
2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag
2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg
2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc
2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt
2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc
2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt
2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag
2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg
2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt
2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc
2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg
2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat
2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg
2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt
2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga
3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg
3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca
3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca
3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSIGHRT 314 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
68.2D8, partial cds.
ACCESSION M32044
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2D8.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 88 a 58 c 86 g 82 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgcacag atct
//
LOCUS MUSIGHRU 314 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
68.3D3, partial cds.
ACCESSION M32045
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.3D3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 88 a 58 c 85 g 82 t 1 others
ORIGIN Chromosome 12.
1 agggtccaga gtnaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgcacag atct
//
LOCUS MUSIGHRV 306 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
80.1B5, partial cds.
ACCESSION M32046
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain 80.1B5), cDNA to mRNA, anti-GAC hybridoma 80.1B5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 306)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 306 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 85 a 56 c 85 g 80 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgc
//
LOCUS MUSIGHRW 314 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
68.2C4, partial cds.
ACCESSION M32047
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2C4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 89 a 57 c 84 g 84 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta atgtctacct tcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgcacag attt
//
LOCUS MUSIGHRX 314 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
65.1B3, partial cds.
ACCESSION M32048
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J),cDNA to mRNA, anti-GAC hybridoma 65.1B3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 91 a 58 c 84 g 81 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgaactgg
121 gtccgccagt ctccagagaa gggacttgaa tgggttactg aaattagatt gaaatctgat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgcacag atct
//
LOCUS MUSIGHRY 313 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
66.4A10, partial cds.
ACCESSION M32049
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.4A10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 313)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 313 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 86 a 59 c 84 g 83 t 1 others
ORIGIN Chromosome 12.
1 agggtncaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtcctgg
121 gtccgccagt ctccagagaa gggactttat tgggttgctg aaattagatt gaaatctgat
181 aattctgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
301 tactgcacag atc
//
LOCUS MUSIGHRZ 314 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
66.5A3, partial cds.
ACCESSION M32050
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.5A3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 91 a 58 c 84 g 81 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaaccactg gatgtactgg
121 gtccgccaat ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
181 aattatgtaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac aacttaaggg ctgaagacac tggaatttat
301 tacagcacag atct
//
LOCUS MUSIGHSA 305 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
66.3E3, partial cds.
ACCESSION M32051
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.3E3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 305)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 305 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 85 a 55 c 85 g 80 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtgctgg
121 gtccgccagt ctccagagaa ggggcttgag tgggttgctg aaattagatt gaaatctaat
181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctacct gcaaatgaac aacttagagg ctgaagacac tggaatttat
301 tactg
//
LOCUS MUSIGHSB 305 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
65.1D2, partial cds.
ACCESSION M32052
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 65.1D2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 305)
AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
TITLE Mouse antibody response to group a streptococcal carbohydrate
JOURNAL J. Immunol. 143, 4213-4220 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept / 12 > 305 Ig H-chain V-region, exon X (AA at 13)
IVS < 1 11 Ig H-chain V-region intron
BASE COUNT 85 a 55 c 86 g 79 t
ORIGIN Chromosome 12.
1 agggtccaga gtgaagtgaa acttgaggag tctggaggag gcttggtgca acctggagga
61 tccatgaaac tctcctgtgc tgcctctgga ttcactttca ggaactcctg gatatactgg
121 gtccgccagt ctccagagaa ggggcttgag tgggttggtg aaattagatt gaagtcagat
181 aattttgcaa cacaatatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
241 tccaaaagta gtgtctactt gcaaatgaac agcttaaggg ttgaagacac tgcaatttat
301 tactg
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS BSPCGTD 2648 bp ds-DNA BCT 15-DEC-1989
DEFINITION Bacillus sp. cyclomaltodextrin glucanotransferase gene, complete
cds.
ACCESSION M28053
KEYWORDS cyclomaltodextrin glucanotransferase.
SOURCE Bacillus sp. (strain 17-1; ATCC 31007) DNA.
ORGANISM Bacillus sp.
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2648)
AUTHORS Kaneko,T., Song,K.-B., Hamamoto,T., Kudo,T. and Horikoshi,K.
TITLE Construction of a chimeric series of Bacillus cyclomaltodextrin
glucanotransferases and analysis of the thermal stabilities and pH
optima of the enzymes
JOURNAL J. Gen. Microbiol. 135, 3447-3457 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by T.Kaneko, 18-SEP-1989.
FEATURES from to/span description
pept 224 2365 cyclomaltodextrin glucanotransferase precursor
(EC 2.4.1.19)
sigp 224 304 cyclomaltodextrin glucanotransferase signal
peptide
matp 305 2362 cyclomaltodextrin glucanotransferase
BASE COUNT 692 a 777 c 652 g 527 t
ORIGIN
1 gaattcgctt tcatttacaa tattgttgaa atcaaacact cgcataattt aagggccatg
61 cattccgtca ccgcacaccc ggtatggaac aaccccggtt tctcttcagg agatgccggg
121 gttttttgtt gccctatatt tcgtacagga ggtgattccc cgcagcttcg gtttgatcat
181 tgaaccaacc cccattctat aactgctaca taggaggaat acgatgaaaa aaatctccaa
241 acttacaacc gctcttgccc ttagcttgtc cctggcgcta agcttgctcg gcccagccca
301 tgcagccccg gatacctcgg tatccaacaa gcaaaatttc agcacggatg tcatttatca
361 aatctttacg gaccgtttct ccgacggcaa tcctgccaac aacccgaccg gcccggcctt
421 tgacggtacc tgcaccaacc tgaggcttta ctgcggcggc gactggcaag ggatcatcaa
481 caaaatcaac gacggctacc tgaccggcat gggcgttacc gccatctgga tctcccagcc
541 ggtcgagaac atctacagcg tcatcaatta ttccggcgtg aataacaccg cataccacgg
601 ctattgggct agagacttca agaaaacgaa cccggcttac ggcacgattg ccgatttcca
661 gaatctcatc gctgccgccc atgccaagaa catcaaagtc atcatcgact tcgcaccgaa
721 ccacacgtct cccgcttcct tggaccagcc gtcgtttgcg gagaacggca agctgtacaa
781 caacggcagg gacgaaggcg gatacacgaa cgatacgcac aatttattcc atcataacgg
841 gggcaccgat ttctcgacga ccgagaacgg gatttacaaa aacctgtacg atcttgccga
901 tctgaaccac aacaacagca cagtggatac ctacctgaag gatgccatta aaatgtggct
961 cgatctgggc attgacggca ttcgaatgga cgcggtaaaa catatgccgt tcggctggca
1021 gaagagcttc atggccaccg tcaacaatta caagccggtc ttcaccttcg gcgaatggtt
1081 cctcggggta aacgaagtga gtgccgagaa tcataagttc gccaacgttt ccggcatgag
1141 cctgctggac ttccgcttcg cgcagaaggt tcgccaggta ttcaaggaca acaccgacaa
1201 tatgtacgga ctgaaatcca tgctagaggg ctctgcgacc gattacgcgc agatggagga
1261 tcaggtcacg ttcatcgata accacgacat ggaacgtttc cacaataaca gcgccaaccg
1321 ccggaagctg gagcaagcgc tcgccttcac gttaacttcc cgcggcgtgc cagccatcta
1381 ttatggaacc gagcaataca tgtccggcgg gaacgatccc gacaaccggg ctcgaattcc
1441 ttctttctct accacgacta cggcttatca ggtcagtaaa aagctggcgc ctctgcgtaa
1501 atccaatccg gccatcgctt atgggacgac gcaggagcgc tggataaaca acgatgtgtt
1561 gatctatgag cgcaagttcg gcaacaacgt agccgtcatc gccgtcaacc gcaacgtgaa
1621 cacgtcggcc tccattacgg gactggtaac ctcactgccg gccggcagct acaccgacgt
1681 gctgggcggc ctcttgaacg gtaacaattt gaccgtcggc tccggcggat ccgcttccat
1741 cttcacgctg gcagcgggcg gtacggccgt gtggcaatac accacggccg tgactgcgcc
1801 aaccatcggg cacgtaggac cgatgatggc caaaccaggc gcagccgtca ccattgacgg
1861 ccgcggtttc ggtgcaacca aaggcacggt atacttcggc acgacggcag ttaccggcgc
1921 taatattacg gcttgggaag acacgcagat caaagtgaaa atccctgccg ttgccggagg
1981 cgtatacaat atcaaaatcg ccaacagtgc cggaacctca agcaatgtgc atgacaactt
2041 cgaagtgctg agcggggatc aggtcagcgt gcgctttgtg gtgaacaacg ccaccaccgc
2101 gctcggccag aacgtatatc tggcaggcag cgtaagcgag ctcggcaact gggacccggc
2161 caaagccatc ggtccgttgt acaaccaggt catctaccaa tacccaacct ggtactatga
2221 cgtcaccgtt cccgccggca aaacgatcga atttaaattc ctgaaaaaac agggctcgac
2281 ggtaacgtgg gaaggcggct cgaaccatac cttcaccgca ccaacgagcg gcacggccac
2341 cataaacgtg aactggcaac cgtaagttca agaacgacta gcttctgccg aagaacggcc
2401 cctaaacaaa accagctccg gatatgatcc cggagctggt tttgttcatg caggtcataa
2461 ttccagtagc aatagcccgt tacgcgaagc atctcacagt tcctctgaac tgcatgagcg
2521 attcccaaag agagagcgtg ctcagcaggc agtaactcag cattcctctg cattgcatga
2581 gcgattccca aagggagagc gtctcaacaa gtagtgactc agcaagcaga cgggacaact
2641 caacgatc
//
LOCUS BSUDIV 1403 bp ds-DNA BCT 14-FEB-1990
DEFINITION B.subtilis division initiation protein (divIB) gene, 5' end.
ACCESSION M31800
KEYWORDS division initiation protein.
SOURCE B.subtilis (strain 168; prototroph SB19) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1403)
AUTHORS Harry,E.J. and Wake,R.G.
TITLE Cloning and expression of a Bacillus subtilis division initiation
gene for which a homolog has not been identified in another
organism
JOURNAL J. Bacteriol. 171, 6835-6839 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 165 ORF1 (AA at 1)
pept 295 1086 division initiation protein
pept 1083 > 1403 ORF3 (ttg start codon)
signal 213 240 transcription terminator
BASE COUNT 521 a 229 c 313 g 340 t
ORIGIN
1 ggcggtgcaa agatatcgga gatgcacgga aacttcatcg tcaatgcggg gggagcatca
61 gcaaaagatg tgcttgatct cattgaccat gtgaaaaaga caatccgtga aaaatacgag
121 attgatatgc acacagaggt tgaaatcatc ggcggaaatc gctgattcaa gttctgactg
181 aagctgttca tatgatatac tgtaagcaaa cgacaaacgg catcatagta tgccgtttgt
241 tttggaatag acagactttt aacagctgtt tcatttgaat gaggtgaaca ggcaatgaac
301 ccgggtcaag accgagaaaa aatcgttaac attgaagagc ggatccctaa aattaaagag
361 cagaggaaac aaaaggcaaa ccgccgtctc atttcattta ttatgctctt ttttattatg
421 gtgctgatca ttgtgtacct gcaaacacca atcagtaaag tatcaacaat ctctgttaca
481 gggaatgaaa atgtatcaaa aaaggaaatt atcgaccttt ccgatattaa cagcggagat
541 actgagtttt ggagtttgga caagcaaaaa acagaaaaga agattcagca aaataaatta
601 gtgaaaaaag cggagatcag caaatcgctt cctaataaaa tcaacattgc aatcgaagaa
661 tataaagcga ttgcttattt agaaaaagac gatgtgtact atgaagtgct tgaaaacggc
721 tcagttctgc caaacgaagt aacgccggat gatgccggac cgattttagt gaattggaca
781 aacgcgaaga agcggtctca aatggccaag cagctcgatg cgttgtcaaa ttcgttaaaa
841 cagtcgattt cagaaattta ttatacgcct gtaaaaatgg atgaaaaccg tattaagttg
901 tacatgaacg acggttatgt tgttacggca tctattaaaa cgtttgcaga ccgaatgaag
961 acgtatcctt ctatcatctc acagctaagc agcaataaaa aaggaatcat tcatttagaa
1021 gttgccacct attttgaaga gtttgggaaa aacgataaag ctgctaaaaa ggaagatgaa
1081 aattgagggg caaatcagca gtccttcttt ccttgattat gcttatcgca ggttttttga
1141 tttcattttc ttttcaaatg acaaaagaaa acaacaaaag cgcggctgag acggaagagt
1201 ggaaaaaaga atatgcgctg agagatgaac tgctgaaaca agagaaagaa aataagaagt
1261 ttgaaaaaga attatatcaa aagcaaaaca aagtccgcca ggcggaaaat aagctgaaga
1321 aagagaagtc ggaatactat aatgtcctcg aagatacaga aaaatacagg atgtatattg
1381 gggaagtcgg tgtgcaggga gaa
//
LOCUS HUMDF3AA 572 bp ss-mRNA PRI 16-FEB-1990
DEFINITION Human breast carcinoma-associated antigen DF3 mRNA, 5' end.
ACCESSION M31823
KEYWORDS antigen DF3; glycoprotein.
SOURCE Human breast carcinoma cell line MCF-7, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 572)
AUTHORS Abe,M., Siddiqui,J. and Kufe,D.
TITLE Sequence analysis of the 5' region of the human DF3 breast
carcinoma-associated antigen gene
JOURNAL Biochem. Biophys. Res. Commun. 165, 644-649 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 67 > 572 DF3 antigen
mRNA 1 > 572 DF3 mRNA
BASE COUNT 114 a 236 c 129 g 93 t
ORIGIN
1 acctctcaag cagccagcgc ctgcctgaat ctgttctgcc ccctccccac ccatttcacc
61 accaccatga caccgggcac ccagtctcct ttcttcctgc tgctgctcct cacagtgctt
121 acagctacca cagcccctaa acccgcaaca gttgttacgg gttctggtca tgcaagctct
181 accccaggtg gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact
241 gagaagaatg ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc
301 tcctccacca ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt
361 tcagctgcca cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc
421 tccaccaccc cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc
481 tccaccgccc cccaagccca cggtgtcacc tcggccccgg acaccaggcc ggccccgggc
541 tccaccgccc cccaagccca cggtgtcacc tc
//
LOCUS MUSH2LINK 2252 bp ss-mRNA ROD 14-FEB-1990
DEFINITION Mouse MHC H-2K/t-w5-linked open reading frame mRNA, complete cds.
ACCESSION M32010
KEYWORDS .
SOURCE Mouse embryo carcinoma cell line F9, cDNA to mRNA, clone 10.2a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2252)
AUTHORS Han,St.-Jacques.B., MacMurray,T.-H. and Shin,A.H.-S.
TITLE A putative transmembrane protein with histidine-rich charge
clusters encoded in the H-2K/t-w5 region of mice
JOURNAL Mol. Cell. Biol. 10, 138-145 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 301 1611 MHC H-2K/t-w5-linked ORF precursor
sigp 301 396 MHC H-2K/t-w5-linked ORF signal peptide (put.)
matp 397 1608 MHC H-2K/t-w5-linked ORF
mRNA < 1 2252 MHC H-2K/t-w5-linked ORF mRNA
BASE COUNT 491 a 586 c 702 g 472 t 1 others
ORIGIN
1 tcccggagcc ggtgagaggt ccctgctgct cccttacggc gctttctagg cctttacccc
61 aacgagtggg ccatagagac gcgggcccag agagaccgta aagttgctga tcaaaggcta
121 gagcggtgtc gggggtgggg ggctgcatcc aggaagggtg ttggggatga ggtggaccgg
181 ccttggggac aatgtaagag cggagcaagt agtatagagg aagggcttca agggacgcgg
241 atcccgaata ggtagattga gagtcaagtc gagtcgtctc ttgttcctcc ggtcagcgtg
301 atgaccatgg gcctgcgggc cccccactgg gtggctgtgg gactgctgac ctgggcggct
361 ttggggctgc tggtggccgg acacgagggt catggtgacc tgcacaaaga tgtggaagag
421 gacttccatg gccacagtca cgggcactca catgaagatt tccaccatgg ccacagccac
481 gggcactcgc atgaagattt ccaccatggc cacggccaca ctcatgaaag catctggcat
541 gggcatgccc acagccacga ccatggacac tcacgtgagg aattacacca tggccatagc
601 catggccact cccacgatag cctccaccac ggaggacatg gacatgccca tcgtgaacat
661 agccatggga cttctaggga ggctggggct ccaggcatca aacaccacct ggacactgtc
721 accctctggg cctacgcact gggggccaca gtgctgatct ccgcagctcc gttcttcgtg
781 ctgttcctca tcccagtaga atctaactct cccaggcacc gctctctgct ccagatcctg
841 ctcagttttg cttccggggg gctcctgggt gatgcgttcc tccacctcat cccgcatgcc
901 ttggaacctc attctcacca cgctccagag cagcctganc atggacactc ccacagtggc
961 cagggcccca ttctctctgt ggggctgtgg gttctcagtg ggattgtcgc cttcctcgtg
1021 gtggagaaat ttgtgagaca cgtgaaagga ggacatggac acagtcacgg acacggagac
1081 aggcatgcgc atggagacag tcacacccat ggagatagac atgagtgttc ttcaaaggaa
1141 aagcccagca cagaggaaga gaaggaagtg ggcgggttgc ggaaaaggag aggaggaaac
1201 actgggccca gagatggccc ggtgaaacct cagagccctg aagaagaaaa agcaggctca
1261 gacctgcgtg tgtctgggta cctgaatctg gctgctgact tggcacacaa cttcacagac
1321 ggtctggcca ttggtgcttc ctttcgtggg ggccgagggc tagggatcct gaccacaatg
1381 acagtcctgc tgcacgaact gcctcatgag gttggggatt ttgccatcct ggtccagtct
1441 ggctgcagca agaagcaggc gatgcgtctg caactcgtga ctgcaattgg agcattggca
1501 ggcactcgtg tgcccttctc accgagggag gggcagtgga cagtgacgtg gcaggtggtg
1561 caggtcctgg ctgggtcctg ccattcactg caggcggatt tatctacgta gcaacagtgt
1621 ctgtgctgcc tgagctattg agagaccgat ctccactgca gtcactgttg gaggtgctgg
1681 ggctgctggg gggtgttgcc atgatggtac tgattgccca tcttgagtga ggggtgaggt
1741 gacctgtcct tcccctaact cctaataaag gcagttggta gtcctggcca gtgctaatgc
1801 cagaaggagt gtcagccttg gagattagca gagcctctgt ttgaggcctt agaggtatga
1861 ggatcatacg gagcatgaga ggccagaagg accacggagt gggagactgc ccagcagcgt
1921 tgttgctttt ggaaaaatca atgggaccat gaagaagact gggaagacgg tagcctaccc
1981 gatggtccct aatctacgat ttctcagcgc cacgtgccgc aggggtccct ccaaggccca
2041 tctccctctg aactagttag tgatggcttc agggaagacc tggcagaagg actggggtag
2101 acatcaatcg tgtgtcctga tttgaagggg gaggggggct ccttgggaag atgtctcagc
2161 ctgatttttt gtctctactc attttatacc actgtttgaa tgggtgcgga ggaacggtga
2221 ccaggaataa aagaccttgg atcttcggcc cc
//
LOCUS PSEMVAAB 1117 bp ds-DNA BCT 16-FEB-1990
DEFINITION P.mevalonii HMG-CoA-reductase (mvaA) gene, 3' end, and
HMG-CoA-lyase (mvbB) gene, complete cds.
ACCESSION M31807
KEYWORDS 3-hydroxy-3-methyl glutaryl CoA-lyase;
3-hydroxy-3-methyl glutaryl CoA-reductase; HMG-CoA-lyase;
HMG-CoA-reductase.
SOURCE P.mevalonii DNA.
ORGANISM Pseudomonas mevalonii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1117)
AUTHORS Anderson,D.H. and Rodwell,V.W.
TITLE Nucleotide sequence and expression in Esherichia coli of the
3-hydroxy-3-methylglutaryl coenzyme A lyase gene of Pseudomonas
mevalonii
JOURNAL J. Bacteriol. 171, 6468-6472 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 135 HMG-CoA-reductase (mvaA; AA at 1)
pept 147 1052 HMG-CoA-lyase (mvaB)
BASE COUNT 188 a 362 c 380 g 187 t
ORIGIN
1 catgcgcgca atattgccgt ggtggcgggc gcccgaggcg atgaggtgga ctgggttgcc
61 cggcagttgg tggaatacca cgacgtgcgc gccgaccgcg ccgtagcagt gctgaaacaa
121 aagcgcggcc aatgagcgtg gtcccgatgc aagcggtaaa ggtctttgaa gtcggccccc
181 gtgacggcct gcagaacgaa cgccagccgc tgtcggtggc cgcccgtgtg ggcttgatcg
241 gcgaactggc tggcaccggc ctgcggcata tcgaagccgg cgccttcgtg tcgccgcgct
301 gggtgccgca gatggccggc agcgacgagg tgttgcgcca gttgcccagc aacgacgggg
361 tcagttacac ggccctggtg cccaaccggc aaggcttcga ggccgcgcaa cgggctggct
421 gccgcgaggt agcggtgttc gccgccgcct ccgaggcgtt ttcgcgcaac aacatcaatt
481 gctccatcga tgaaagcttc gagcgcttca ccccggtgtt gcgcgccgcc aacgaagcct
541 ctatccgggt gcgcggttat gtatcctgcg tgctcggttg cccgttcagt ggggccgttg
601 cgccggaggc tgtggccaag gtcgcacgcc gcctgtacga actgggctgc tacgaaatca
661 gcctgggtga caccattggc gccggccgcc cggatgaaac ggctcaattg ttcgagctct
721 gcgcacggca actgccggtc gcggcactgg ccggccactt ccacgatacc tggggcatgg
781 ccatcgccaa tgtgcatgcc gcactcgcgc agggtgtacg caccttcgac agctcggtcg
841 cgggcctcgg cggctgcccc tactcgccgg gtgccagcgg taacgtggcc acggaagatc
901 tgttgtacct gctgcacggc ctgggctaca gcaccggtgt cgacctggag gcggtggcac
961 aggttggtgt gcgcatcagc gcgcagctgg gcaccgccaa ccgctcccgt gccggccttg
1021 ccctggcagc aaggagcgcc cgcgaacact gatagcgcct gtggctcccg aacaaaaaca
1081 acaagcccgc aagggcagga gaatgccaat gaccttt
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMFAPS 1148 bp ss-mRNA PRI 12-JAN-1990
DEFINITION Human farnesyl pyrophosphate synthetase mRNA, complete cds.
ACCESSION J05262
KEYWORDS farnesyl pyrophosphate synthetase.
SOURCE Human hepatoma cell line HepG2, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1138)
AUTHORS Wilkin,D.J., Kutsunai,S.Y. and Edwards,P.A.
TITLE Isolation and sequence of the human farnesyl pyrophosphate
synthetase cDNA: Coordinate regulation of the mRNAs for farnesyl
pyrophosphate synthetase, 3-hydroxy-3-methylglutaryl coenzyme A
reductase, and 3-hydroxy-3-methylglutaryl coenzyme A synthase by
phorbol ester
JOURNAL J. Biol. Chem. 265, 4607-4614 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1139 to 1148)
AUTHORS Wilkin,D.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.J.Wilkin, 12-JAN-1990, for release after publication.
FEATURES from to/span description
pept 7 1068 farnesyl pyrophosphate synthetase (EC 2.5.1.1)
BASE COUNT 302 a 272 c 324 g 250 t
ORIGIN
1 cacagaatga acggagacca gaattcagat gtttatgccc aagaaaagca ggatttcgtt
61 cagcacttct cccagatcgt tagggtgctg actgaggatg agatggggca cccagagata
121 ggagatgcta ttgcccggct caaggaggtc ctggagtaca atgccattgg aggcaagtat
181 aaccggggtt tgacggtggt agtagcattc cgggagctgg tggagccaag gaaacaggat
241 gctgatagtc tccagcgggc ctggactgtg ggctggtgtg tggaactgct gcaagctttc
301 ttcctggtgg cagatgacat catggattca tcccttaccc gccggggaca gacctgctgg
361 tatcagaagc cgggcgtggg tttggatgcc atcaatgatg ctaacctcct ggaagcatgt
421 atctaccgcc tgctgaagct ctattgccgg gagcagccct attacctgaa cctgatcgag
481 ctcttcctgc agagttccta tcagactgag attgggcaga ccctggacct cctcacagcc
541 ccccagggca atgtggatct tgtcagattc actgaaaaga ggtacaaatc tattgtcaag
601 tacaagacag ctttctactc cttctacctt cctatagctg cagccatgta catggcagga
661 attgatggcg agaaggagca cgccaatgcc aagaagatcc tgctggagat gggggagttc
721 tttcagattc aggatgatta ccttgacctc tttggggacc ccagtgtgac cggcaaaatt
781 ggcactgaca tccaggacaa caaatgcagc tggctggtgg ttcagtgtct gcaacgggcc
841 actccagaac agtaccagat cctgaaggaa aattacgggc agaaggaggc tgagaaagtg
901 gcccgggtga aggcgctata tgaggagctg gatctgccag cagtgttctt gcaatatgag
961 gaagacagtt acagccacat tatggctctc attgaacagt acgcagcacc cctgccccca
1021 gccgtctttc tggggcttgc gcgcaaaatc tacaagcgga gaaagtgacc tagagattgc
1081 aagggcgggg agaggaggct ctcaataaat aatcgtgtaa ccttaaaaaa aaaaaaaacc
1141 tcgacgat
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSPROA 1987 bp ss-mRNA ROD 23-JAN-1990
DEFINITION Mouse protective protein (Mo54) mRNA, complete cds.
ACCESSION J05261
KEYWORDS glycoprotein; protective protein; serine carboxypeptidase.
SOURCE Mouse (strain BCBA) testis and brain, cDNA to mRNA, clones
lambda-B1 and lambda-T2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1987)
AUTHORS Galjart,N.J., Gillemans,N., Meijer,D. and d'Azzo,A.
TITLE Mouse "protective protein": cDNA cloning, sequence comparison, and
expression
JOURNAL J. Biol. Chem. 265, 4678-4684 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.d'Azzo, 22-JAN-1990, for release after publication.
FEATURES from to/span description
pept 214 1638 protective protein precursor
sigp 214 282 protective protein signal peptide
matp 283 1635 protective protein
signal 1966 1971 poly-A signal
BASE COUNT 453 a 580 c 522 g 432 t
ORIGIN
1 ggaactgctc gatgcgcaga tagggttcaa ccagtagggc ttggggacca ctggggaccg
61 ctgcctccgc ttctggttga ccttcccggg gcacacggtt gtggtggcgt gtgatagcga
121 agacccaact gtggccaagg ctgaccaagt caggcagtga aaactcgggt acagcggcca
181 gactggcggg atccagagcg gtcaggccaa gacatgcccg gaaccgcgct gtctccactg
241 ctcttgttgc tgctcctgtc ctgggcgtcc cggaacgaag cagctccgga ccaggatgaa
301 atcgattgtc tccccggcct ggccaagcag ccctctttcc ggcaatactc cggctacctc
361 agagcatcgg actccaagca cttccactac tggtttgtgg agtcgcagaa cgacccaaag
421 aacagccccg tggtgctttg gcttaacggg ggtcccggct gcagctcgct cgatgggctg
481 cttacagagc acggcccctt tctgatccag ccagatggtg tcaccctgga gtacaacccc
541 tatgcttgga acctgattgc caacgtgctg tatatcgagt ccccagctgg ggtgggcttc
601 tcctactcgg atgacaagat gtacgtgacc aatgacacag aggtggcgga gaacaattat
661 gaagccctta aagacttctt ccgcctcttt ccggaataca aggacaacaa acttttcctg
721 acaggagaga gctatgctgg catctacatc cccaccttgg ctgtactggt catgcaggat
781 cctagcatga atcttcaggg gctggctgtg ggcaatggac ttgcctccta tgagcagaac
841 gacaactccc tggtctactt tgcctactac catggccttc tggggaacag actttggact
901 tcactgcaga cccactgctg cgctcagaac aagtgtaact tctatgacaa caaagaccca
961 gagtgtgtaa acaatctcct ggaagtgtct cgaattgtgg gcaaatctgg cctcaacatc
1021 tacaatctct atgctccgtg tgctggtggg gtgcccggca gacatagata tgaggacaca
1081 cttgtagtcc aggattttgg caacatcttc actcgcctgc cacttaagcg gagatttcct
1141 gaggcactga tgcgttctgg ggacaaggta cgcttggatc ctccctgcac caacaccaca
1201 gccccttcca actacctcaa caacccctat gttcggaagg ctctccacat ccccgagtcg
1261 ctgccccgct gggacatgtg caacttcttg gtgaatttac agtaccgccg cctctaccaa
1321 agcatgaact cccagtacct gaagctgctc agttcacaga aataccagat cctgctctac
1381 aacggagatg tggacatggc ctgcaacttc atgggcgatg agtggtttgt ggattcgctc
1441 aaccagaaga tggaggtgca gcgccggccc tggctagtgg actacgggga gagcggagaa
1501 caggtagctg gtttcgtgaa ggagtgttca cacatcacct tcctcaccat caagggtgcc
1561 ggacacatgg tccccacgga caagcctcga gctgctttta ccatgttctc gaggttcctg
1621 aacaaagagc cttactgata gcttccgctg caacccatgg aaatcctcag tacctgatca
1681 gcccctctcg gcctctctca gcagagagga gtcctccgct aatgctcccg tggggcaagt
1741 cctgtactaa aactgcccag acttctcaca gcctggtgca tcccagcctg ggcctcaggg
1801 cgtcacagac agcctaggac caagtgagca ctttgttccg gccatcgtgg ggtggcctga
1861 cctccccgac tgcataaaaa ctgaccaccc caggacccca acagagctag agaaacccaa
1921 ggaggggggt ggatggattc taattgatgg ttgactaatt ctggaattaa atcgggtata
1981 gcttcca
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATCAA 1150 bp ds-DNA ROD 07-FEB-1990
DEFINITION Rat cytosolic aspartate aminotransferase gene, exon 1.
ACCESSION J05263
KEYWORDS cytosolic aspartate aminotransferase.
SOURCE Rat (strain Wistar) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1150)
AUTHORS Pave-Preux,M., Aggerbeck,M., Veyssier,C., Bousquet-Lemercier,B.,
Hanoune,J. and Barouki,R.
TITLE Hormonal discrimination among transcription start sites of
aspartate aminotransferase
JOURNAL J. Biol. Chem. 265, 4444-4448 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Barouki, 05-FEB-1990, for release after publication.
FEATURES from to/span description
pept 684 / 801 cytosolic aspartate aminotransferase, exon 1
(EC 2.6.1.1)
pre-msg 592 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 614 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 618 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 620 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 628 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 630 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 635 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
pre-msg 649 > 1150 cytosolic aspartate aminotransferase mRNA and
intron (alt.)
IVS 802 > 1150 cytosolic aspartate aminotransferase intron A
BASE COUNT 232 a 334 c 338 g 246 t
ORIGIN
1 ctgcagggtc tctcctgcca gcgatcaagg tagacaacac ccccaggttt ctttctgcgg
61 aagcactgag tggagatgga gaaggtcacc cgcgatcggg tacagcgtgt ccaggcggac
121 cactagagtc cagtgtcggg gacttggctg tcagcccggc ctggctaacg caggaacgtg
181 cgcaggaacg gagcctggcg gggacagcca cctgctcagc tggtacagaa agacctgttc
241 tctactattc ccgagggcgg gccaccgccc caggtctcca ggacccagaa ggtcaggagg
301 ctgtcctcta gggactcccg ccacccgcat catcctcatc accctaagga ataaaagcct
361 cagcgattgg aacacgctat gccaatcatc ctagtctttg cccacgcctc ctccggtgtc
421 cactgcattg gttgcatcat acaagcctcc gataagatcc tcctcttgaa ttggctaata
481 gagccttgtc ccgcccaacc cttctttgtc ggaaggttgt gattggttaa gcgcgttgcc
541 aagctccggg cgcggctcgg tcattggaag ccgcgggctg ccccgcccgc cggtgaaggt
601 gagagtctcc tccagtcaca ggtccccgca ccgcacggag cagctcccag ctcgtgctcg
661 aggtcccctc agattccatc gcgatggccc ctccatcatt ctttgcccag gttccacagg
721 ccccgccggt tctggtcttt aagctcattg cggacttccg ggatgatccc gatccccgca
781 aggttaacct cggcgtggga ggtaaggatg cagtgccctg gaatgcgagc agctttgtgg
841 ggggaggtgg gcttaggagg aactgaggct tagtttggtg cctccctgag aggggactgg
901 ggagagaagg ctgggaccac gagggcatca ccagcctacc catttcacag atggcgatac
961 agagtccttg gaaagaggcg gtgttcaagg tcacacagga caggtgtaac atctgggaca
1021 catgccgttt ttgtttcttc caccctgtct tctatatagt acttgttcta aagcaaccat
1081 ggggtggtaa ggagagagaa gtctggtagc tttggcgtca tggtcagctg ttagcctaat
1141 ggagctgcag
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATSKR 2979 bp ss-mRNA ROD 16-FEB-1990
DEFINITION Rat substance K receptor mRNA, complete cds.
ACCESSION M31838
KEYWORDS substance K receptor.
SOURCE Rat stomach, cDNA to mRNA, clone prTKR1-1.
ORGANISM Rattus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2979)
AUTHORS Sasai,Y. and Nakanishi,S.
TITLE Molecular characterization of rat substance K receptor and its
mRNAs
JOURNAL Biochem. Biophys. Res. Commun. 165, 695-702 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 449 1621 substance K receptor
signal 2957 2962 poly-A signal
BASE COUNT 721 a 859 c 729 g 670 t
ORIGIN
1 ggcccacaga cttggcacac tgtgcacctc cgtcatctgg gatgtaccct gaatagcatt
61 aagagagatc tgaattcaaa ttcgggtttg tcaccaactt ccggaatgcc tgttgctgga
121 cgcgtctttt ctctgactct cagcgttctg actctggtgt gaaagagagg agccgatcca
181 acgcgctatc aggatgcctc ctgctctgag actggagttt gagttctcca tctccaagag
241 atgcttgaac agtctgattg ggaccttgct tcctgctcag cctgactcat ttctgcccga
301 gctgtaggcg tggccctctg gctacacgtg tctgcttttg aatcgtcctt ccccccttaa
361 atctacacac cgaagaggat ttctcttctg tggcaattcg agacggaact gcagggccag
421 accctgtgtt ccaggcccag aatcagtcat ggggacccgt gccattgttt ctgacgccaa
481 catcttgtct ggcctcgaga gcaatgccac gggtgttaca gccttctcca tgcctggctg
541 gcagctggcg ttatgggcca cagcctacct ggccctggtg ctggtggctg tgacaggcaa
601 tgccacagtc atctggatca ttctggccca cgagaggatg cgcacagtca ccaactattt
661 catcatcaac ctggccttgg cggacctctg catggcagcc ttcaacgcga ccttcaactt
721 catctacgcc agtcacaaca tctggtactt cggccgtgcc ttctgctatt tccagaacct
781 ctttcccatc acagccatgt tcgtcagcat ctactccatg accgccattg ctgctgacag
841 gtacatggcc atcgttcacc ccttccagcc acggctctcg gcccccagca ccaaggcgat
901 tatcgctggc atctggctgg tagccctggc tctcgcctcc ccgcaatgct tctactccac
961 catcactgtg gacgaggggg ccaccaagtg tgtggtggcc tggcccaatg acaacggagg
1021 caagatgctc ctactgtatc atctggtcgt gtttgtcctc atctacttcc tgcctctcct
1081 ggtgatgttc ggggcttaca gtgtcatcgg cctcacactg tggaagcgcg cggtacccag
1141 acaccaggct cacggagcca acttacgcca tctacaggcc aagaagaagt ttgtgaaggc
1201 catggtactg gtggtgctga catttgccat ctgctggctg ccctaccacc tctacttcat
1261 cctggggacc ttccaagagg acatctacta ccacaagttt atccagcagg tctacctggc
1321 gctcttctgg ctggccatga gctccacaat gtacaaccct atcatttatt gctgccttaa
1381 ccacaggttt cgctctggat tccggcttgc tttccggtgc tgcccctggg tgacaccaac
1441 tgaggaagac aggctggagc tgactcacac gccatccctc tccaggagag tcaaccggtg
1501 tcataccaag gagactttgt tcatgacggg ggacatgacc cactctgagg ctaccaatgg
1561 acaggttggg agtccccagg atggggagcc tgctggaccc atctgcaagg cccaggctta
1621 gcccctttga gaagcagctg acggaaaagc ccctacataa tagcttgtcc cacaccccat
1681 tcttcaacac accaacagaa aaaacagaat ggtgctcgga atgccacagg caatacaggt
1741 gtcaacaaag aactctgagc acagacacct gtcacaggca atccagtgat ggaggaaggc
1801 gctgggagga ctgggaagcc acacttggcc aacctagaag ctgcaaagtg agatgtctcc
1861 caggagtata gagagagaag gaaaactgga tcccagcgaa gaaaactgca gttgtaatga
1921 gaagggcagc tgaaggacag agagatggct tggcgaggtg gaaaagcctc tgctaagtaa
1981 gcagtggaaa agcctctgct aagccttgtg acctaagtgc agtacctgca acccacaggg
2041 aaggtaagaa ccaaagccta caagttgtcc tctgacttcc tccaatgtat gcatgcatca
2101 tgtatgcatc tgtgtgtgca catacatctg cacatgcacg cacatacaaa atatgcgcat
2161 gcacgtgcac acttacaatg ttaaagagga agaaggcgac agctcccaca ggactgtaac
2221 agattgcaga ggacaagcgc ccaaagggaa acccagctgc ttcgctcgct ctggaacaca
2281 attgccaggt tttccgaata aagaaagcct aagatccaaa tatatatgta tgggagctcc
2341 gggcttctaa atttggttca tgaaaacgct tagagacact ggctaaccca gagcggtgca
2401 catccacacc accacctcct cgccaaagac aagccacaaa ggaagtcacc tatttggctg
2461 cgaatggatc catggttttt ctaccttagg atctaaaagc tcccatccag cccagagggg
2521 agggcagtta acagtccaga ggagcttcag ctgagatgta agatactctg accccctgct
2581 tgatggcagg attgccgtga ggcttgttgg aaacgatgcc ttcaaaccag caggccacat
2641 ccagcgaaat gctcaataaa tagtaaccac gattcttcgc acggcgtcag cttttaagac
2701 acaaaagcca cctacaagtt aggagaatgg ctccccctct cacttctgca gtgaggagct
2761 ggcaggcaga gcacaccgat ggtcccagtt ctaagaacgc ctggcctagg ccgctccagg
2821 ccagtccgtc tccctgcttg gtggtcactc atttgctgtc taccccgtgc cctctgaaag
2881 ccaaagacgc tgtaaacacg tgttcaaagt ttaatgtcac ctgagctttc ccagagttca
2941 ctgcgccagt acccccaata aagggttctg atgtcactt
//GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATIGFBP3 2352 bp ss-mRNA ROD 16-FEB-1990
DEFINITION Rat insulin-like growth factor-binding protein (IGF-BP3) mRNA,
complete cds.
ACCESSION M31837
KEYWORDS insulin-like growth factor-binding protein.
SOURCE Rat ovary, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2352)
AUTHORS Shimasaki,S., Koba,A., Mercado,M., Shimonaka,M. and Ling,N.
TITLE Complementary DNA structure of the high molecular weight rat
insulin-like growth factor binding protein (IGF-BP3) and tissue
distribution of its mRNA
JOURNAL Biochem. Biophys. Res. Commun. 165, 907-912 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 57 935 insulin-like growth factor binding protein
precursor
sigp 57 137 insulin-like growth factor binding protein
signal peptide
matp 138 932 insulin-like growth factor binding protein
mRNA < 1 2352 IGF-BP3 mRNA
BASE COUNT 573 a 623 c 606 g 550 t
ORIGIN
1 gggagcagcg agcaagcagg tcctcagcgt ccagtcaccg ctctaagcca ggcgccatgc
61 atcccgcgcg ccccgcgctc tgggcggctg cgctcaccgc cctcactctg ctccgcggac
121 cgccagtggc gcgggccggc gcgggcgcgg tgggcgcggg ccccgtggtg cgctgcgaac
181 cgtgcgacgc gcgtgcgctg gcccagtgcg cgcctccgcc caccgcgccc gcgtgcacgg
241 agctggtgcg agaacccggc tgcggctgct gcctgacttg cgcgctgcgc gaaggcgacg
301 cgtgcggcgt ctacacggag cgctgtggca ccggcctccg ctgccagccg cgaccggccg
361 agcagtatcc cctgaaggcg ctgctgaatg gccgcgggtt ctgcgccaac gccagcgccg
421 ccagcaacct gagtgcctac ctcccctccc agccgtctcc tggaaacacc actgagtctg
481 aggaggacca caatgctggg agtgtggaaa gccaggttgt ccccagcaca catcgcgtga
541 ctgattccaa gttccatcca ctccattcaa agatggaggt catcataaaa ggccaggcta
601 gggacagcca gcgctacaaa gttgactatg agtcccagag cacagacacc cagaacttct
661 cctccgagtc taagcgggag acagaatatg gtccctgccg cagagaaatg gaggacacac
721 tgaatcatct gaagttcctc aatgtgctga gtcccagggg cgtccacatc ccaaactgtg
781 acaagaaggg gttctataag aagaaacagt gtcgcccttc caaaggcaga aagcggggct
841 tctgctggtg cgtggacaag tacgggcagc cattgccagg ctatgacacc aaggggaaag
901 acgacgtgca ttgcctcagc gtgcagagcc agtagatacc gctgtgccac ttaacgtgga
961 gctcaaatac gccttatttt gcacaaaaga ctgccaacaa cgtgatcagc agctggctat
1021 accttgattt atatttctct ctctctctct ctctctctct ctctctctct ctctctcttt
1081 tgtggtgaac tgaataaaaa caaacaaaac acatacaaaa acaaaaacaa aaaaaaaagc
1141 caagtttaga cagatttctg aaatgcctct ggttgtttaa atagtgaact tggtcatctt
1201 tgtatctcgc agtagtcaac caaaagcagt ttgaattttc ttgttgcttc ctatgaaaac
1261 cacacgtgta ctccaggcca cggatgccgt cgccccctaa ctcacccacc cactgtgggc
1321 ttcagtgctg ctggccctct gccttcttga tttcagaggc tctgttgctg atagagaaaa
1381 accctctttc catcccctgt aagtaagtgc aggcactgtg gagaatgggg aagcctggaa
1441 cccagtgacc cggacgtctg gaagcatcct cctgaggcct ctggtcctta ttgtgccatc
1501 tctgaatcaa gggcctggcc ctgtatctgc aagtggcctg acctacttgg gaactgtggg
1561 agagaaaaat gtgttgtctc tcttactaaa aatgactaag aatgttctag ggcgctccga
1621 gagcccataa agacaaggac aaggaccttc ctttgtcagg cagcttcctg atgacttggc
1681 ccagcagaaa tatcaaactc catgtgcaga gatgtcgcaa ataacggtgc gcttagttct
1741 ccggatgact tcaagaaaac agtgttttct ggcccagcct ctcaaaataa aatttgttgt
1801 ggggtggggc tgaggggagg cagctttcaa aagagagaag gttttcatct tccttgttgg
1861 agaccctggt aagaacatgg agagaatcac ctgtttgttg atcttggggt ccttctcaaa
1921 ctttctttat aattcatgcg tatatgcaga caaaatatgt tcttaattgt taacattgta
1981 tacaacatag cccaaatata ttagaatctg tactagataa tcctagataa aaggttagag
2041 atgctaggtg atgtaaccac agacacgccc gaggaaagga gcctgtgtct ggaggctggg
2101 ccgctttccc cgaggccaag gccatggtgg tctggcaata cagggtgtga ggagactgta
2161 ctgcatccca cggggtggac atgcgctgta cagagctttc cttgagagca caaaggaatc
2221 ttgagacatt ctgcctgcct gtcagctttt ctttattttt ttaattaagt ttttggggga
2281 aaaatgtatt tttgaaaagt ttgtcttgca atgtatttat aaatagtaaa taaagttttt
2341 ttactattta ag
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS HUMTSHRX 2415 bp ss-mRNA PRI 22-FEB-1990
DEFINITION Human thyroid stimulatory hormone receptor (hTSHR) mRNA,
complete cds.
ACCESSION M32215
KEYWORDS thyroid stimulatory hormone receptor.
SOURCE Human thyroid, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2415)
AUTHORS Misrahi,M., Loosfelt,H., Atger,M., Sar,S., Guiochon-Mantel,A. and
Milgrom,E.
TITLE Cloning, sequencing and expression of human TSH receptor
JOURNAL Biochem. Biophys. Res. Commun. 166, 394-403 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 45 2339 thyroid stimulatory hormone receptor
/hgml_locus_uid="TSHR" /map="22q11-q13"
/nomgen="LY0058G"
BASE COUNT 638 a 624 c 527 g 626 t
ORIGIN
1 ggcgatttcg gaggatggag aaatagcccc gagtcccgtg gaaaatgagg ccggcggact
61 tgctgcagct ggtgctgctg ctcgacctgc ccagggacct gggcggaatg gggtgttcgt
121 ctccaccctg cgagtgccat caggaggagg acttcagagt cacctgcaag gatattcaac
181 gcatccccag cttaccgccc agtacgcaga ctctgaagct tattgagact cacctgagaa
241 ctattccaag tcatgcattt tctaatctgc ccaatatttc cagaatctac gtatctatag
301 atgtgactct gcagcagctg gaatcacact ccttctacaa tttgagtaaa gtgactcaca
361 tagaaattcg gaataccagg aacttaactt acatagaccc tgatgccctc aaagagctcc
421 ccctcctaaa gttccttggc attttcaaca ctggacttaa aatgttccct gacctgacca
481 aagtttattc cactgatata ttctttatac ttgaaattac agacaaccct tacatgacgt
541 caatccctgt gaatgctttt cagggactat gcaatgaaac cttgacactg aagctgtaca
601 acaacggctt tacttcagtc caaggatatg ctttcaatgg gacaaagctg gatgctgttt
661 acctaaacaa gaataaatac ctgacagtta ttgacaaaga tgcatttgga ggagtataca
721 gtggaccaag cttgctggac gtgtctcaaa ccagtgtcac tgcccttcca tccaaaggcc
781 tggagcacct gaaggaactg atagcaagaa acacctggac tcttaagaaa cttccacttt
841 ccttgagttt ccttcacctc acacgggctg acctttctta cccaagccac tgctgtgcct
901 ttaagaatca gaagaaaatc agaggaatcc ttgagtcctt gatgtgtaat gagagcagta
961 tgcagagctt gcgccagaga aaatctgtga atgccttgaa tagccccctc caccaggaat
1021 atgaagagaa tctgggtgac agcattgttg ggtacaagga aaagtccaag ttccaggata
1081 ctcataacaa cgctcattat tacgtcttct ttgaagaaca agaggatgag atcattggtt
1141 ttggccagga gctcaaaaac ccccaggaag agactctaca agcttttgac agccattatg
1201 actacaccat atgtggggac agtgaagaca tggtgtgtac ccccaagtcc gatgagttca
1261 acccgtgtga agacataatg ggctacaagt tcctgagaat tgtggtgtgg ttcgttagtc
1321 tgctggctct cctgggcaat gtctttgtcc tgcttattct cctcaccagc cactacaaac
1381 tgaacgtccc ccgctttctc atgtgcaacc tggcctttgc ggatttctgc atggggatgt
1441 acctgctcct catcgcctct gtagacctct acactcactc tgagtactac aaccatgcca
1501 tcgactggca gacaggccct gggtgcaaca cggctggttt cttcactgtc tttgcaagcg
1561 agttatcggt gtatacgctg acggtcatca ccctggagcg ctggtatgcc atcaccttcg
1621 ccatgcgcct ggaccggaag atccgcctca ggcacgcatg tgccatcatg gttgggggct
1681 gggtttgctg cttccttctc gccctgcttc ctttggtggg aataagtagc tatgccaaag
1741 tcagtatctg cctgcccatg gacaccgaga cccctcttgc tctggcatat attgtttttg
1801 ttctgacgct caacatagtt gccttcgtca tcgtctgctg ctgttatgtg aagatctaca
1861 tcacagtccg aaatccgcag tacaacccag gggacaaaga taccaaaatt gccaagagga
1921 tggctgtgtt gatcttcacc gacttcatat gcatggcccc aatctcattc tatgctctgt
1981 cagcaattct gaacaagcct ctcatcactg ttagcaactc caaaatcttg ctggtactct
2041 tctatccact taactcctgt gccaatccat tcctctatgc tattttcacc aaggccttcc
2101 agagggatgt gttcatccta ctcagcaagt ttggcatctg taaacgccag gctcaggcat
2161 accgggggca gagggttcct ccaaagaaca gcactgatat tcaggttcaa aaggttaccc
2221 acgagatgag gcagggtctc cacaacatgg aagatgtcta tgaactgatt gaaaagtccc
2281 atctaacccc aaagaagcaa ggccaaatct cagaagagta tatgcaaacg gttttgtaag
2341 ttaacactac actactcaca atggtagggg aacttacaaa ataatagttt cttgaatatg
2401 cattccaatc ccatg
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSHSP70 2249 bp ss-mRNA ROD 23-FEB-1990
DEFINITION Mouse heat shock-related protein (hcs70t) mRNA, complete cds.
ACCESSION M32218
KEYWORDS heat shock-related protein.
SOURCE Mouse male testicular germ cell, cDNA to mRNA, clones GE6 and HS2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2249)
AUTHORS Matsumoto,M. and Fujimoto,H.
TITLE Cloning of a hsp70-related gene expressed in mouse spermatids
JOURNAL Biochem. Biophys. Res. Commun. 166, 43-49 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1893 heat shock-related protein
BASE COUNT 612 a 569 c 583 g 484 t 1 others
ORIGIN
1 atggctgcta ataaaggaat ggcgatcggc atcgacctgg gcaccaccta ctcgtgcgtg
61 ggcgtgttcc aacacggcaa ggtggagatc atcgccgact accaggtcaa ccgcaccacc
121 cctgactacg tggccttcac cgacaccagc cgcctcatcg agcgctccaa gaaccaggtg
181 gccatgaatc cccagaacac tgtttttgat gccaaacgtc taattggcag gaagtttaat
241 gatcctgttg tgcagtcaaa tatggaagct ttggccattt caatgatcaa tgaagccggc
301 aaacccaagg tgatggtgtc ctataaagga gagaagaaag ccttctaccc agaggagatc
361 tcatccatgg tactgacgaa gatgaaggag actgcagagg ttttttgggc accaatgtca
421 caacgtgtga tcacggtgcc agcctatttc aatgactctc agcggcaagc caccaaggat
481 gcaggtgtca tcgcaggact caatgtgctg agaataatca atgagcccac ggcggtcgca
541 tgggctaggt tggataaagg aagtcacgca gagcggcacg tgctcatctt cgacctgggg
601 ggtggcacgt tcgacttgtc catcctgacg atcgacgacg catcttcagt gaagccacgc
661 ggcgacacgc acctgggagg ggaggacttc gacaaccggc tggtgagcca cttcgtgaga
721 gtcaagagga agcacaagaa ggacaatcaa gccaagaaca agcgcgcggt gcggcggctg
781 cgcacggcgt gtgagagggc caagaggacg ctgtcgtcca gcacccaggv aaacctggag
841 atcgactctt tatatgaggg catcgacttc tacacgtcca tcactagagc acggtttgaa
901 gagctgtgtg cagacctatt tagaggcaca cttgagcccg tggaaaagtc tcttcgggat
961 gccaagatgg ataaggctaa aatccatgac attgttctag tagggggctc cacccgcatc
1021 ccaaaggtgc aaaaaatgct tcaggactac tttaatggac gggatctcaa caagagtatc
1081 aatcccgatg aggcagtcgc ctacggagct gcagtccagg gagctatttt aatgggcgac
1141 aaatctgaaa aagtacagga tttgcttttg ttggacgtag ctcccctgtc tctaggattg
1201 gagacagctg ggggtgtgat gactgtattg atcaagcgga actccaccat ccccaccaag
1261 cagacgaaga tcttcaccac ctactcggac aaccagcccg gggtattgat ccagtgtacg
1321 aggcaaacca tacccgacaa caacctggtg gggccctttg acttgactgg aatacctcct
1381 gcacctagcg tgccacagat cgaggtgacc ttcgacatcg cgcgcaacgg tatcctgaac
1441 gttacggcca tggacaagag caccggcaag gccaacaaga tcaccatcac caacgacaag
1501 ggtcgcctga gcaaggagga gattgactcg atcgagagcc ggactgctta caaacgtgag
1561 gatgagggtc agagggagaa aatcgctgcc aaaaatgcct tagaatcgta cgcctttaat
1621 atgaagagcg cttgtggtga tgagggtctg aaggacaaga tcacggagtc cgataaaaag
1681 aaaatactgg ataaatgcaa tgaggttcct ttcctgctga gtcaaaccca gctggctgag
1741 aaagtagagt ttgatcataa aagaaaagaa ctggaaaata tgtgtaatcc gatcatcaca
1801 aaactgtacc agagcggatg caccgggccc actgtacgcc agggtatact ccggcagggc
1861 tccacaggcc ctaccatcga ggaagtagat tagccttttc cagaattgca gggtgctagg
1921 gtgcctctag gcgaatttta ttcatcttca aacatcaata tgattcttga actgactgga
1981 ctcagcctac gtcaccagtc ctttggatcc cgatggagaa gctcgaacgc accttttcac
2041 acccccaccc gtcgtcagtc tatgatcctg aactggacct ttaggaaaac caggcccctc
2101 tttagagcca tttgagaatt tgatgtctgt atttattatc cacaccccac ctttctcctt
2161 cctgtgtgtt ggttatttgt ctctcaataa atttgttccc aaaggaaaca actctgcact
2221 tttattttag caggccttac tgatatgag
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSGAS3A 1799 bp ss-mRNA ROD 26-FEB-1990
DEFINITION Mouse growth arrest specific (gas3) mRNA, complete cds.
ACCESSION M32240
KEYWORDS growth arrest specific protein.
SOURCE Mouse fibroblast cell line NIH-3T3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1799)
AUTHORS Manfioletti,G., Ruaro,M.E., Del Sal,G., Philipson,L. and
Schneider,C.
TITLE A growth arrest specific (gas) gene codes for a membrane protein
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Philipson, 15-FEB-1990.
FEATURES from to/span description
pept 187 621 growth arrest specific protein (gas3)
mRNA < 1 1799 gas3 mRNA
BASE COUNT 444 a 487 c 388 g 480 t
ORIGIN
1 ccggagcctc ccactgcccc cttgctttgc gcgcgcgtga cccgcagcac agctgtcttt
61 ggggacgcca gcaacccagt ggacgcaccg gagtttgtgc ctgaggctaa tctgctctga
121 gatagctgtc cctttgaact gaaacaggca ccgctcctct gatcccgagc ccaactccca
181 gccaccatgc tcctactctt gttggggatc ctgttcctgc acatcgcggt gctagtgttg
241 ctcttcgtct ccaccatcgt cagccaatgg ctcgtgggta atggacacac gactgatctc
301 tggcagaact gtaccacatc cgccttggga gccgtccaac actgctactc ctcatcagtg
361 agcgaatggc tgcagtctgt ccaggccacc atgatcctgt ctgtcatctt cagcgtcctg
421 gctctgttcc tgttcttctg ccagctcttc actctcacca aaggcggccg gttttacatc
481 actggattct tccaaatcct tgctggtctg tgcgtgatga gtgcagcggc catctacaca
541 gtgaggcaca gtgagtggca tgtcaacact gactactcct atggcttcgc tacatcctgg
601 cctgggtggc ctttccccta gccctcctca gtggtatcat ctatgtgatc ctgcggaaac
661 gcgaatgagg cgcccgacga cgcaccgtcc gtctaggctc tgagcgcgca tagggtccac
721 agggagggag gaaggaaacc agagaacaaa accaaccaac caaaaaagag ctagccccaa
781 acccaaacgc aagccaaacc aaacagaacg cagttgagtg gggattgctg ttgattgaag
841 atgtatataa tatctatggt ttataaaacc tatttataac actttttaca tatatgtaca
901 taggattgtt ttgcttttta tgttgaccgt cagcctcgtg ttgaatctta aacaacttta
961 catcctaaca ctataaccaa gctcagtatc tttgttttgt ttcgtttttt tttttaatct
1021 ttttgttttg ctcagacata aaaactccac gtggccccct ttcatctgaa agcagatacc
1081 tccctcccac tcaacctcat aggataacca aagtgtgggg acaaacccca gacagttgaa
1141 gacctttaca ctatgggtga cccagtgcat ttagcaggag tatccactgc ccgaatccat
1201 gtgtgaagcc ctaagcactc acagacgaaa agccctgacc ggaaccctct gcaaaaacag
1261 taatagctgg tggctcctga acacttgacc ctgtagacgg agtactgggg ccacacgttt
1321 aaatgagaag tcagagacaa gcaatctgtg aaatggtgct atagatttac cattccttgt
1381 tattactaat cgtttaaacc actcactgga aactcaatta acagttttat gcgatacagc
1441 agaatggaga cccgatacaa acggttcata actgctttca tacctagcta ggctgttgtt
1501 attactacaa taaataaatc tcaaagcctt cgtcagtccc acagttttct cacggtcgga
1561 gcatcaggac gagcatctag acccttggga ctagcgagtt ccctggcttt ctgggtctag
1621 agtgttctgt gcctccaagg actgtctggc gatgacttgt attggccacc aactgtagat
1681 gtatatacgg tgtccttctg atgctaagac tccagacctt tcttgttttt gcttgctttc
1741 tctgatttta taccaactgt gtggactaag atgcatcaaa ataaacatca gagtaactc
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS RATGFGDVA 630 bp ss-mRNA ROD 26-FEB-1990
DEFINITION Rat glioma-derived vascular endothelial cell growth factor mRNA,
complete cds.
ACCESSION M32167
KEYWORDS endothelial cell growth factor protein.
SOURCE Rat cell line GS-9L, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 630)
AUTHORS Conn,G., Bayne,M., Soderman,D., Kwok,P., Sullivan,K., Palisi,T.,
Hope,D. and Thomas,K.
TITLE Amino acid and cDNA sequences of a vascular endothelial cell
mitogen that is homologous to platelet-derived growth factor
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Bayne, 16-FEB-1900.
FEATURES from to/span description
pept 5 577 endothelial cell growth factor protein
precursor
sigp 5 82 endothelial cell growth factor protein signal
peptide
matp 83 574 endothelial cell growth factor protein
BASE COUNT 169 a 163 c 172 g 126 t
ORIGIN
1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca
61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt
121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga
181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct
241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag
301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat
361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga
421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac
481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa
541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc
601 ctcagggttt cgggaactag acctctcacc
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS HUMSHBGA 6087 bp ds-DNA PRI 31-JAN-1990
DEFINITION Human human sex hormone-binding globulin (SHBG) gene, complete cds.
ACCESSION M31651
KEYWORDS human sex hormone-binding globulin.
SOURCE Human adult testis DNA, clone hgSH!BG-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6087)
AUTHORS Hammond,G.L., Underhill,D.A., Rykse,H.M. and Smith,C.L.
TITLE The human sex hormone-binding globulin gene contains exons for
androgen-binding protein and two other testicular messenger RNAs
JOURNAL Mol. Endocrinol. 3, 1869-1876 (1989)
STANDARD simple staff_review
COMMENT A region of SHBGr-3 mRNA contains an inverted exon. However, the
paper does not state the boundaries. This exon is related to a
palindromic sequence "atcttggctcagtctccacctccaagat" located at
positions 4455-4482.
FEATURES from to/span description
pept 2643 2753 sex hormone-binding globulin, exon 1
2886 2977 sex hormone-binding globulin, exon 2
3150 3339 sex hormone-binding globulin, exon 3
3670 3831 sex hormone-binding globulin, exon 4
4059 4218 sex hormone-binding globulin, exon 5
4349 4485 sex hormone-binding globulin, exon 6
5218 5425 sex hormone-binding globulin, exon 7
5670 5818 sex hormone-binding globulin, exon 8
sigp 2643 2732 sex hormone-binding globulin signal peptide
matp 2733 2753 sex hormone-binding globulin
2886 2977 sex hormone-binding globulin
3150 3339 sex hormone-binding globulin
3670 3831 sex hormone-binding globulin
4059 4218 sex hormone-binding globulin
4349 4485 sex hormone-binding globulin
5218 5425 sex hormone-binding globulin
5670 5815 sex hormone-binding globulin
IVS 2754 2885 SHBG intron A
IVS 2978 3149 SHBG intron B
IVS 3340 3669 SHBG intron C
IVS 3832 4058 SHBG intron D
IVS 4219 4348 SHBG intron E
IVS 4486 5217 SHBG intron F
IVS 5426 5669 SHBG intron G
mRNA 656 751 SHBG-2 mRNA, exon 1 (5' end unsure)
< 2643 2753 SHBG-2 mRNA, exon 2
2886 2977 SHBG-2 mRNA, exon 3
3150 3339 SHBG-2 mRNA, exon 4
3670 3831 SHBG-2 mRNA, exon 5
4059 4218 SHBG-2 mRNA, exon 6
4349 4485 SHBG-2 mRNA, exon 7
5670 > 5818 SHBG-2 mRNA, exon 8
mRNA 540 596 SHBG-3 mRNA, exon 1
rpt 835 1223 Alu repeat
rpt 1359 2002 Alu repeat
rpt 4619 4928 Alu repeat
BASE COUNT 1400 a 1632 c 1640 g 1415 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcggct agctcctaag gcgtgggtac ggaagctaga ttagagcaga agggccccgc
61 tgctccccga gcaggttccc aaggcgagcc cctccccctg cccccgcctc ctacgacccc
121 gctctggccg cgccactctg acccccgggt taccggcctg cagtcttcac ccgaatcagc
181 ctcaggatat ctccacagtc tccctccttg gcctctcgga tccgcacgga agccatccgg
241 atccccgctg tctgggacca aagtcccagg gcctcgcaaa cggcaactag accccttaaa
301 gggcctacgg acttggatcc tgaagagcct gagagagcgg ggtggcggga gtcggggggg
361 acggcggggt agccgcggcc tggtaagtgg agctgggatt ccggcgccgt acgggaggag
421 agagtaggcc agcgaggcga tcctctgtcc ggcatagccc caccccctcg aattctgtcg
481 cagcaggggg cacaactgtc agccaatcag cttggagaac aggcacggcc gcgtcccccc
541 caagccccac ccccgacagc tggatcttgt gactgggctc ctgggtagag ttcaaggttg
601 gagtgaagcg gcttccttgc ggttgtgtgg gtgtcccaac ctgggtcgag ataccccgcg
661 gttcaaaggc tcccccgcag tgctttttaa attgacatat gcagtgataa cctgctttag
721 cctcaggctc actcacccgc ccagaccctg ggtaagcctt aagaccctca gctctgaaag
781 ctgtttcctg cagctcttga gtagcatgaa gtgttacctc ttgggggcat ttgcattttt
841 aaatgtttta ttttatattt atttatttat ttatttttgg agatggtgta ttgctttgtc
901 gcccaggctg gggtgtagtg gcgcgatctc tgctcactgc agcctccacc tcccgagttc
961 aagcgatcct cctgtctcag cctccggagt agctggaact acagtcgcgc accggcacgc
1021 ccggctaatt tttttttctt ttttcttctc tttttttttg ttttaacgga ttctcactct
1081 gtcactcagg ctggagtgca gtggcgcgat ctcggctcac tgcaacctct ccctcctggg
1141 ttcaagcgat tctcctgcct caccctggag atagctggga ttacaggcat gggcaaccat
1201 acctggctaa tttttgtatt tttagtagag acggggtttc actatgttgg ccaggctggt
1261 ctggaactcc tgacctcaag tgatctgccc gcctcagcct tctaaagtgc ggggattaca
1321 gatgtgaccc accaagcccg gtctgtcatt tgcattttaa aatgggtcat ggggtgggca
1381 cagtggctca cacctgtaat cccagcattt tggggaggca gaggcaggcg gatcactgag
1441 atcaggaatt tgagaccagc ctgaccaaca tggtgaaacc cgtctctact aaaatacaaa
1501 attagacagg tgtggtggcg catgccttta ctcccagcta cacgggaggc tgagacagga
1561 gaatcgcttg aacctgggag gtggaggtta cagtgagccg agatcgtgcc attgcactcc
1621 agcctgggca acaaaagcga aactccatct caaaataaat aaataaaata aaatgcgtca
1681 gggagggtcg ggccttgtgg ctaatgcctg taatccaggt actttgggag gctgaggtgg
1741 gcggatcatt tcaggtcaga ggttcgagat cagcctgggc aacgtggtga aatccccgtc
1801 tctactaaaa acacaaaaaa atttgctggg cgtggtggtg cgtgcacctg tagtcccaac
1861 tactaaggag gctgagacag gaggatcgct tgaactcgag aggcagaggc agcagtgagc
1921 cgagatcact ccactgcact ccagcctggg tgatagagca agactctgtc taaaataaaa
1981 taaaataaaa taaaataaaa ttgggtcagg gagtgggtga tttctactgc tagactgttt
2041 aggccctgta ataaatggat aagggaagat aactgagagg cggggggcag gtcccttctt
2101 aatattcact gaatcataca cacagacaat accttcttgg gagacaggcc tcagaggctg
2161 ggaaaagact gggggaggag ttcagaccag atgccaggca ctgtgcctgc attttctcaa
2221 tgaaccctct ttcacagtca ccccgtaaag tattatttcc tcattttaca gcaaggacac
2281 tgaagcacaa aggtgaagtg acttggccca aggtcactca gggacagaaa tcttggagga
2341 cctagatcag gccctagagg aggagagggg agatggaata tcctctccca gttcagaaac
2401 tttctcggca gtggaggatg atagtggagg gactctgtcc ttcaccccat tgatccccag
2461 aggggtgata gctgagtctt gtgactgggc ccctgggcag gggtcaaggg tcagtgcccc
2521 tgtttccttt accccctcct ccccgggcaa cctttaaccc tccaccgccc acacgcaagg
2581 ctgcctgcct ctacacattc tcccaagagt tgtctgagcc gccgagtgga cagtggctga
2641 ttatggagag cagaggccca ctggctacct cgcgcctgct gctgttgctg ctgttgctac
2701 tactgcgtca cacccgccag ggatgggccc tgagacctgt tctccccacc caggtgcagg
2761 agcgggacag ggcactcagc tcatgcagtc ttcccttctc tcctctggcc ctgtagcagg
2821 gcctctccct ctgtctgtct ctgacatgtc cctactcagc tttgtttgtt ttctctttct
2881 gatagagtgc ccacgaccct ccggctgtcc acctcagcaa tggcccagga caagagccta
2941 tcgctgtcat gacctttgac ctcaccaaga tcacaaagta tggggttggc ctagcccttg
3001 acccagtccc ctggttctgc cctctctcca tcagctcttc tcttttccct gtcttccttt
3061 ccttatctgt gaacaccatc tcccccaaac ccacactggt tctcaaagga cacatgacat
3121 acacaatctt tccttctgtg tccttccaga acctcctcct cctttgaggt tcgaacctgg
3181 gacccagagg gagtgatttt ttatggggat accaacccta aggatgactg gtttatgctg
3241 ggacttcgag acggcaggcc tgagatccaa ctgcacaatc actgggccca gcttacggtg
3301 ggtgctggac cacggctgga tgatgggaga tggcaccagg taagctagct ctggtcctca
3361 ggggagggat gtctggagct ggtctgagga aagggaacaa aaccaagtta ttgggcatcc
3421 ctttaccact gtcatctcgt ttaatccaca cgaaccccca caaagtagct attcttggcc
3481 ccatcttttc tgatgggaat tctaaggctc agtcagtata taagtgacaa gagctgagtg
3541 acccaaggcc aaggatgcta gctgcttctt taaggcatgt tctttccact atagtactag
3601 gctgcctcac aggaaggtgg cagaaacaga tcccaggggc ctctgatttt gcttcccacc
3661 ttcctgcagg tggaagtcaa gatggagggg gactctgtgc tgctggaggt ggatggggag
3721 gaggtgctgc gcctgagaca ggtctctggg cccctgacca gcaaacgcca tcccatcatg
3781 aggattgcgc ttggggggct gctcttcccc gcttccaacc ttcggttgcc ggtaactaca
3841 ccccaggggt ggaaccctag ccaagacttg gtaaagcact gctgggtggc tggccgtggg
3901 aatctaagtc cacactttta gggagaaggg aagggttgag agctgcaagg gggaggccaa
3961 atgctcagag gggagtcaac tgagggcagg gaggtcggga ctgcgcctcc gatgccctga
4021 tttctacatc cccgtatctt atctctgtca cactccagct ggttcctgcc ctggatggct
4081 gcctgcgccg ggattcctgg ctggacaaac aggccgagat ctcagcatct gcccccacta
4141 gcctcagaag ctgtgatgta gaatcaaatc ccgggatatt tctccctcca gggactcagg
4201 cagaattcaa tctccgaggt agatttcctc ggagtctatt tttcccaccc tggccagctc
4261 agcctgcctc tgtccccctc taccactggc ccctttcctc cttgagaccc cagctttgag
4321 gcctcaggat aatcatttct ccccacagac attccccagc ctcatgcaga gccctgggcc
4381 ttctctttgg acctgggact caagcaggca gcaggctcag gccacctcct tgctcttggg
4441 acaccagaga acccatcttg gctcagtctc cacctccaag atcaagtaag ggacagtggg
4501 cattgcctgt attcagtgga gcctggagca atgaggaaga gggagtccaa catgtcaata
4561 ttaggaaggt ttccagccca gggaacataa caagactggc tccacagaat tgtttttcat
4621 taataattag ccaggcatgg tggtggtgct tgcctgtaat cccaggtgct ggaggccaag
4681 accagaggat cacttgaggc caggagtttg acaccagcct gggcaacata gcagagacct
4741 ctgtctaaaa aaaaaaaaaa attagccagg catggtagca catgtctgct gccctagcta
4801 tttaggagcc tgaggcagga ggttcacttg agcccaggag tttgaagctg cagtgagcta
4861 tgatgtgcca ctgcactctg acctgggcca cagtgagacc ctgtctcaaa aaataaaaat
4921 aaaaataagg cttatggatg gcactcaggt gggtggtagg ggcgagggac atatcttgaa
4981 gctccccaca gcaagcaaac agttttgact tagactgcat atttacttgg ggcaggtgtg
5041 gtttcaaaaa gggtcaagcc aaaaaaaatt ggggcaggat ttaagtggtg agaatggcca
5101 gtaggtggag gcatagcgaa gaggcagaat taaggcagct aggggtgagg ccacaggcag
5161 taggcccggc tcattcttcc ctctctctct accgtccctt tcccacacac tctgcagaag
5221 gtggtgttgt cttctgggtc ggggccaggg ctggatctgc ccctggtctt gggactccct
5281 cttcagctga agctgagtat gtccagggtg gtcttgagcc aagggtcgaa gatgaaggcc
5341 cttgccctgc ctcccttagg cctgctcccc tcccttaacc tctgggccaa gcctcaaggg
5401 cgtctcttcc tgggggcttt accaggtaag agagaatgat gttcaagttc atgagcacaa
5461 cattggaaac agctcaaggg aggcggcaca ttttgagggg aaggaaacct ctgggaggga
5521 agaagaatag gccacaagaa gaagatatgg gggcagtgga aggtagtgct tttgcaaact
5581 caggttggag gagtggaaaa gtggggagaa gattctggat ccgagccacc ttaatgctct
5641 aatgccacct ttgcactacc tccctctagg agaagactct tccacctctt tttgcctgaa
5701 tggcctttgg gcacaaggtc agaggctgga tgtggaccag gccctgaaca gaagccatga
5761 gatctggact cacagctgcc cccagagccc aggcaatggc actgacgctt cccattaaag
5821 ctccacctaa gaaccccctt tgaaagttac tgattattca tttattcaac aaatattcac
5881 tgtgcactag caatgtacca ggcactgtgc caagtattga gttgtcttaa tgagcaaaaa
5941 cactctggtt cctaccctct tggtgcccac agtcccatag ggaagcagac atccatcaaa
6001 ggctaactaa taagtggata gttggaagca ctgataaaga agaattggag agttgtgaaa
6061 acatggagac tggcgggcgt gtggctc
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS OWEPH18 1548 bp ds-DNA INV 23-FEB-1990
DEFINITION O.fusiformis CCPurine repeat sequence.
ACCESSION M32217
KEYWORDS .
SOURCE O.fusiformis DNA, clone PH18.
ORGANISM Owenia fusiformis
Eukaryota; Animalia; Metazoa; Annelida; Polychaeta; Oweniida;
Oweniidae.
REFERENCE 1 (bases 1 to 1548)
AUTHORS Bakalara,N., Collet,J., Planells,R., Thouveny,Y. and Fontes,M.
TITLE Presence in invertebrate genomes of sequences characterized by the
repetition of the triplet CCpurine
JOURNAL Biochem. Biophys. Res. Commun. 166, 66-73 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 515 663 CCPu repeat sequence
BASE COUNT 424 a 395 c 230 g 499 t
ORIGIN
1 aaagcatttt tccaatcttc acgctcgagt gattttggat gatttatgga gtttgagact
61 ttttttcacc aaacggtcac tttattctta ctcctgagtc aatttagaga ctttttccaa
121 tcttcacgct cgagtgattt tggatgattt ctggagtttg agacattctt tcaccaaacg
181 atcacttgat tcttactccc aagtcattta agagtatttt ttcagtcgtc atggtttagt
241 gatttcggtt acgcttgatc aatttctgct aatttttggg agtttaatgc gttttcacaa
301 aatagacacc cacccactat acaaaataac atgttttaat aggcatctgt taatttaaca
361 taccactgat aaaaatataa aaccacacaa gataaaaaaa atgctaacat taacagtttt
421 acaagcgaaa cattttcgac caactaaagt gtatacttaa atgagcagct gaaccatttc
481 aatcaatcag attagattat tgcactcttt gaccccgccg ccaccaccac cgccgccgcc
541 accaccgcca ccaccaccgc cgccaccacc accaccgcca ccaccgccgc caccaccacc
601 accgccacca ccgccgccac caccaccacc gccaccacca ccaccgccgc caccaccgcc
661 gcctcggtgc cacaactatc gacaacaaca accccaacaa caataaaaat gtattttagt
721 gctatagtag tttattaata cacatatttt catttataca aattatttca tagtcccttt
781 tggatgaggg attcactgtc tgtgttgacg ttttccagga gggccagaat acaccataac
841 atccctcttt ttcttaggtt ttttaaaaag tcgtattcat cacatttcca ttggaggtgt
901 ggtcaaagga atcattgtca ctcatttgtc tgtaaacgtc tcttggtggc ctatcccgtt
961 cgccattttc tttctgctgc ttgtcaattt ctgccttggc tctctataaa cagcttcagt
1021 gggactgaga tgttgaacaa tcggtttcat ggactgatca cactgtgatc cacttgagtt
1081 ttcctcgttt accatatcca gcatgttcga tgactctgcc tttcacttgg gctcgtataa
1141 acttcttcca cacttctggg ttgtccacat aaggcactat cttaggcatt gttgcttctt
1201 ccttttctat taattatctg atgctaatgt ccaggtttcc ctttatatac tcatcaaaca
1261 ttcattcatg aaatctaact acgttcgtct tcttcttcca tctcacatag tgtatctgca
1321 tcttctaaat cacttccgtc aatagactct tcatcgttaa acatttcatc agcgatataa
1381 gaatatttgg ccaaagtttt tctgatggct ctcgacagag atatgtcttg atcaactaag
1441 aattcaattt cttctacaat ttctcgatgt aaatcagcat ctttgacttc aatgaataac
1501 ctcagaattt gaccataatc atttaaaaac tcagattgct gggataag
//GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSIGHRH 345 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-5,
partial cds.
ACCESSION M32034
KEYWORDS N-region; diversity exon; gamma-immunoglobulin;
immunoglobulin heavy chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 345)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 345 Ig H-chain V-D-J region (AA at 1)
recomb 279 280 V-region DNA end/N-region DNA start
recomb 282 283 N-region DNA end/D-region DNA start
recomb 300 301 D-region DNA end/N-region DNA start
recomb 303 304 N-region DNA end/J-region DNA start
BASE COUNT 93 a 78 c 90 g 84 t
ORIGIN Chromosome 12.
1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagccctg
241 acatctgagg actctgcagt ctatttctgt gcaagatctc attactatgg tggtagctac
301 gactttgact actggggcca aggcaccact ctcacagtct cctca
//
LOCUS MUSIGHRI 363 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-7,
partial cds.
ACCESSION M32035
KEYWORDS N-region; diversity exon; gamma-immunoglobulin;
immunoglobulin heavy chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 363 Ig H-chain V-D-J region (AA at 1)
recomb 297 298 V-region DNA end/N-region DNA start
recomb 300 301 N-region DNA end/D-region DNA start
recomb 318 319 D-region DNA end/N-region DNA start
recomb 321 322 N-region DNA end/J-region DNA start
BASE COUNT 99 a 83 c 94 g 87 t
ORIGIN Chromosome 12.
1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctcat
301 tactatggtg gtagctacga ctttgactac tggggccaag gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGHRJ 363 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-3,
partial cds.
ACCESSION M32036
KEYWORDS N-region; diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 363 Ig H-chain V-D-J region (AA at 1)
recomb 297 298 V-region DNA end/N-region DNA start
recomb 300 301 N-region DNA end/D-region DNA start
recomb 318 319 D-region DNA end/N-region DNA start
recomb 321 322 N-region DNA end/J-region DNA start
BASE COUNT 99 a 84 c 93 g 87 t
ORIGIN Chromosome 12.
1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcccat
301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGHRK 363 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65I4-2,
partial cds.
ACCESSION M32037
KEYWORDS N-region; diversity exon; gamma-immunoglobulin;
immunoglobulin heavy chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 363 Ig H-chain V-D-J region (AA at 1)
recomb 297 298 V-region DNA end/N-region DNA start
recomb 300 301 N-region DNA end/D-region DNA start
recomb 318 319 D-region DNA end/N-region DNA start
recomb 321 322 N-region DNA end/J-region DNA start
BASE COUNT 100 a 82 c 93 g 88 t
ORIGIN Chromosome 12.
1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat
301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGHRL 363 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65J4-1,
partial cds.
ACCESSION M32038
KEYWORDS N-region; diversity exon; gamma-immunoglobulin;
immunoglobulin heavy chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 363 Ig H-chain V-D-J region (AA at 1)
recomb 297 298 V-region DNA end/N-region DNA start
recomb 300 301 N-region DNA end/D-region DNA start
recomb 318 319 D-region DNA end/N-region DNA start
recomb 321 322 N-region DNA end/J-region DNA start
BASE COUNT 100 a 82 c 93 g 88 t
ORIGIN Chromosome 12.
1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat
301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGKCRP 324 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-5,
partial cds.
ACCESSION M32039
KEYWORDS N-region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J region (AA at 1)
recomb 285 286 V-region DNA end/N-region DNA start
recomb 288 289 N-region DNA end/J-region DNA start
BASE COUNT 96 a 80 c 72 g 76 t
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagc tggaaatcaa acgg
//
LOCUS MUSIGKCRQ 324 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-7,
partial cds.
ACCESSION M32040
KEYWORDS N-region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J region (AA at 1)
recomb 285 286 V-region DNA end/N-region DNA start
recomb 288 289 N-region DNA end/J-region DNA start
BASE COUNT 96 a 80 c 71 g 76 t 1 others
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattn ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagc tggaaatcaa acgg
//
LOCUS MUSIGKCRR 324 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-3,
partial cds.
ACCESSION M32041
KEYWORDS N-region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J region (AA at 1)
recomb 285 286 V-region DNA end/N-region DNA start
recomb 288 289 N-region DNA end/J-region DNA start
BASE COUNT 96 a 80 c 72 g 76 t
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagc tggaaatcaa acgg
//
LOCUS MUSIGKCRS 324 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65I4-2,
partial cds.
ACCESSION M32042
KEYWORDS N-region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J region (AA at 1)
recomb 285 286 V-region DNA end/N-region DNA start
recomb 288 289 N-region DNA end/J-region DNA start
BASE COUNT 96 a 80 c 72 g 76 t
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagc tggaaatcaa acgg
//
LOCUS MUSIGKCRT 324 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65J4-1,
partial cds.
ACCESSION M32043
KEYWORDS N-region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
TITLE The amino acid residues at the V-H-D-J-H junctions affect the
affinity of anti-p-azophenylarsonate antibodies
JOURNAL J. Immunol. 143, 4090-4097 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J region (AA at 1)
recomb 285 286 V-region DNA end/N-region DNA start
recomb 288 289 N-region DNA end/J-region DNA start
BASE COUNT 95 a 80 c 72 g 76 t 1 others
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccnacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagc tggaaatcaa acgg
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS MUSIGHRM 374 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM4,
partial cds.
ACCESSION M31953
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 374)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 374 Ig mu-chain precursor V-D-J1-region (AA at 3)
sigp < 1 20 Ig mu-chain signal peptide
sigp 21 > 374 Ig mu-chain
recomb 314 315 V-region DNA end/D-region DNA start
recomb 323 324 D-region DNA end/J1-region DNA start
BASE COUNT 92 a 80 c 109 g 91 t 2 others
ORIGIN Chromosome 12.
1 ttttaaaaga tgtccagtgt gaagtgcagc tgttggagac tggaggaggc ttggtgcaac
61 ctggggggtc acggggactc tcttgtgaag gctcagggtt cacttttagt ggcttctgga
121 tgngctgggt tcgacagaca cctgggaaga ccctggagtg gattggagac attnattctg
181 atggcagtgc aataaactac gcaccatcca taaaggatcg attcactatc ttcagagaca
241 atgacaagag caccctgtac ctgcagatga gcaatgtgcg atcggaggac acagccacgt
301 atttctgtat gagatatggt aactactggt acttcgatgt ctggggcaca gggaccacgg
361 tcaccgtctc ctca
//
LOCUS MUSIGHRN 287 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma S12M1,
partial cds.
ACCESSION M31954
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 287)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 287 Ig mu-chain V-D-J1-region (AA at 3)
recomb 227 228 V-region DNA end/D-region DNA start
recomb 236 237 D-region DNA end/J1-region DNA start
BASE COUNT 77 a 64 c 75 g 64 t 7 others
ORIGIN Chromosome 12.
1 aaggcnnagg gnncactttt agtggcttct ggatgagctg ggntcgacag acacctggga
61 agaccctgga gtggattgga gacattaatt ctgatggcag tgcaataaac tacgcaccat
121 ccataaagga tcgattcact atcttcagag acaatgacaa gagcaccctg tacctgcaga
181 tgagcaatgt gcgatcggag gacacagcca cgtatttctg tatgagatat ggtaactact
241 ngtacttcga tgtctggggc acagggacca cggtcaccgt ctcctna
//
LOCUS MUSIGHRO 366 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma CH12,
partial cds.
ACCESSION M31956
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma CH12, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 366)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 366 Ig mu-chain precursor V-D-J2-region (AA at 1)
matp 1 > 366 Ig mu-chain
recomb 294 295 V-region DNA end/D-region DNA start
recomb 321 322 D-region DNA end/J2-region DNA start
BASE COUNT 95 a 93 c 97 g 81 t
ORIGIN Chromosome 12.
1 caggtccaac tgcagcagcc tgggactgaa ctggtgaagc ctggggcttc agtgaagctg
61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac
181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac
301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc
361 tcctca
//
LOCUS MUSIGHRP 366 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma BrM1,
partial cds.
ACCESSION M31957
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM1., cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 366)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 366 Ig mu-chain precursor V-D-J2 region (AA at 1)
matp 1 > 366 Ig mu-chain
recomb 294 295 V-region DNA end/D-region DNA start
recomb 321 322 D-region DNA end/J2-region DNA start
BASE COUNT 93 a 90 c 98 g 80 t 5 others
ORIGIN Chromosome 12.
1 caggtccaac tgcngcagcc tgggactgaa ctggtgaagc cttgggcttc agtgaagctg
61 tcctgcaagg cttctggcca caccttcaca agctantgga taagctgggt gaagnagaat
121 cctggacagg gccttgagtg gattggatnt attnatccga gaagtggtga tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca aatcctccag cacagcgtac
241 atggacgtcc gcagcctgac atctgaggac tctgcggtct attattgtgc aagaagctat
301 tactacggta gtagctacgg ggactttgac tactggggcc aaggcaccac tctcacagtc
361 tcctca
//
LOCUS MUSIGHRQ 384 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J3 region, from hybridoma BrM8,
partial cds.
ACCESSION M31958
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM8, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 384)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 384 Ig mu-chain precursor V-D-J3-region (AA at 1)
matp < 1 > 384 Ig mu-chain
recomb 297 298 V-region DNA end/D-region DNA start
recomb 339 340 D-region DNA end/J3-region DNA start
BASE COUNT 96 a 92 c 99 g 95 t 2 others
ORIGIN Chromosome 12.
1 gaggtccagc tgcaacaatc tggacctgag ctggtgaagc ctggggcttc agtgaagata
61 tcctgtaagg cttctggata cacgttcact gactactana tgaactgggt gaagcagagc
121 catggaaaga gccttgantg gattggagat attaatccta acaatggtgg tactagctac
181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctcctc cagcacagcc
241 tacatggtac gccgcagcct gacatctgag gactttgcag tctattactg tgcaagatcc
301 ctctactact tggggagctc tctctactat gattacggct ggtttgctta ctggggccaa
361 gggactctgg tcactgtctc tgca
//
LOCUS MUSIGHRR 331 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM11,
partial cds.
ACCESSION M31959
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 331 Ig mu-chain V-D-J1-region (AA at 2)
recomb 262 263 V-region DNA end/D-region DNA start
recomb 283 284 D-region DNA end/J1-region DNA start
BASE COUNT 79 a 84 c 87 g 77 t 4 others
ORIGIN Chromosome 12.
1 aatggagcct ggagggtccc ggaaactctc ctgtgcagcc tctggattca ctttcagtag
61 cattggaatg cactgggttc gtcaggctcc agagaagggg ctggagttgg tngcatacat
121 tagtagtggc agtagtacca tctactatgc agacacagng aagggccgat tcaccatctc
181 cagagacaat gccaagaaca ccctnttcct gcaaatgacc agtctaaggt ctgaggacac
241 ggccatgtat tactgtgcaa ggtgccctta tgattacgac tcttngtact tcgatgtctg
301 gggcacaggg accacggtca ccgtctcctc a
//
LOCUS MUSIGHRS 195 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrMs19,
partial cds.
ACCESSION M31960
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrMs19, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 195)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 195 Ig mu-chain V-D-J1-region (AA at 1)
recomb 135 136 V-region DNA end/D-region DNA start
recomb 144 145 D-region DNA end/J1-region DNA start
BASE COUNT 50 a 56 c 41 g 40 t 8 others
ORIGIN Chromosome 12.
1 agtgngnnaa cnntctacaa cccatctctc cagagcccca tctccattac tagagaaacg
61 tcaaagaacc agtncttcct ccaattgaac tctgtgacca cagaggacac agccatgtat
121 tactgtgcag gagacagtgc tggctactgg tacttcgatg tctggggcac agggaccacg
181 gtcaccgtct ccnna
//
LOCUS MUSIGKCRU 321 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J4 region, from hybridoma BrM4,
partial cds.
ACCESSION M31955
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain precursor VJ4-region (AA at 1)
matp 1 > 321 Ig kappa-chain
recomb 285 286 V-region end/J4-region start
BASE COUNT 94 a 74 c 76 g 72 t 5 others
ORIGIN Chromosome 6.
1 gacatcaaga tgaccnagtc nccatcctcc atgtatgcat cgctgggaga gagagtcact
61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
121 tggaaatctc ctaagaccct gatctatnat gcaacaagct tggcagatgg ggtcccatca
181 agattcagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct
241 gacgatacag caacttntta ctgtctacag catggtgaga gcccattcac gttcggctcg
301 gggacaaagt tggaaataaa a
//
LOCUS MUSIGKCRV 261 bp ss-mRNA ROD 20-FEB-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J2 region, from hybridoma S12M1,
partial cds.
ACCESSION M31961
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B.
TITLE Diversity in the available repertoire of murine antibodies reactive
with bromelain-treated isologous erythrocytes
JOURNAL J. Immunol. 143, 4044-4052 (1989)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 261 Ig kappa-chain VJ2-region (AA at 1)
recomb 225 226 V-region end/J2-region start
BASE COUNT 79 a 58 c 65 g 55 t 4 others
ORIGIN Chromosome 6.
1 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
61 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
121 aganncagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct
181 gacgatacag caacttatna ctgtctacag catggtgaga gcccgtacac gttcggaggg
241 gggaccaagc tggaaataaa a
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS ECOCMA 1007 bp ds-DNA BCT 15-MAR-1989
DEFINITION Plasmid pColBM-Cl139 from E.coli cma gene encoding colicin M
activity peptide, complete cds.
ACCESSION M16754
KEYWORDS colicin M activity.
SOURCE Plasmid pColBM-Cl139 DNA, clone pTO4.
ORGANISM Plasmid Colicin BM-Cl139
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1007)
AUTHORS Koeck,J., Oelschlaeger,T., Kamp,R.M. and Braun,V.
TITLE Primary structure of colicin M, an inhibitor of murein biosynthesis
JOURNAL J. Bacteriol. 169, 3358-3361 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 60 875 colicin M activity peptide
mRNA < 1 901 cma mRNA
BASE COUNT 283 a 227 c 198 g 299 t
ORIGIN NaeI site.
1 ggcggaggcg ttagccagcg gcgaactctg ttatcttgtt aacttataag gagttatgta
61 tggaaacctt aactgttcat gcaccatcac catcaactaa cttaccaagt tatggcaatg
121 gtgcattttc tctttcagca ccacatgtgc ctggtgctgg ccctctttta gtccaggttg
181 tttatagttt tttccagagt ccaaacatgt gtcttcaggc tttaactcaa cttgaggatt
241 acatcaaaaa acatggggcc agcaaccctc tcacattgca gatcatatcg acaaatattg
301 gttacttctg taacgccgac cgaaatctgg ttcttcaccc tggaataagc gtttatgacg
361 cttaccactt cgcaaaacca gcgccaagtc aatatgacta tcgctcaatg aatatgaaac
421 aaatgagcgg taatgtcact acaccaattg tggcgcttgc tcactattta tggggtaatg
481 gcgctgaaag gagcgttaat atcgccaaca ttggtcttaa aatttcccct atgaaaatta
541 atcagataaa agacattata aaatctggtg tagtaggcac attccctgtt tctacaaagt
601 tcacacatgc cactggtgat tataatgtta ttaccggtgc atatcttggt aatatcacac
661 tgaaaacaga aggtacttta actatctctg ccaatggctc ctggacttac aatggcgttg
721 ttcgttcata tgatgataaa tacgatttta acgccagcac tcaccgtggc attatcggag
781 agtcgctcac aaggctcggg gcgatgtttt ctggtaaaga gtaccagata ctgcttcctg
841 gtgaaattca cattaaagaa agtggtaagc gataatataa gggccggggt tccggctttt
901 tatgcagcct ttataacagt gtattagtca ttcgcatatt caacatatcc ctcctcacaa
961 tggtacatgc cataatacat tctcttaccc tccgggcctt cagcttt
//
LOCUS ECOCOLM 810 bp ds-DNA BCT 15-SEP-1989
DEFINITION Plasmid pColBM-Cl139 colicin M resistance (cmi) locus, complete
cds.
ACCESSION M17810
KEYWORDS cmi protein; colicin M resistance protein.
SOURCE Plasmid pColBM-Cl139 DNA, isolated from E.coli strain JM101.
ORGANISM Plasmid Colicin BM-Cl139
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 810)
AUTHORS Oelschlaeger,T. and Braun,V.
TITLE Sequence, expression, and localization of the immunity protein for
colicin M
JOURNAL J. Bacteriol. 169, 4765-4769 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 322 675 colicin M resistance protein
pept 337 675 colicin M resistance protein (alt.)
BASE COUNT 252 a 146 c 177 g 235 t
ORIGIN 7 bp upstream of PstI site.
1 ggctgcagag cctgatgggg cataagtcca tcagctcaaa cgaggtctac acgaaggtgt
61 ttgcactgga tgtggctgta cggcaccggg tgcagttttc gatgcctgag tccgatgcgg
121 tcacaatgct gaaaaacaga catgcataat aagtcacaat tatgaattgt gatttcttct
181 ataaaaaaga gaccactgca atatgtgatc tcttgtatta tttcataatt gttaaagcca
241 cttcacagta tgctcacatt gtactgatat attcgtaatg tttttttata tcgaatgaac
301 gacagaagtt gtggagattt tatgaaagta attagcatga aatttatttt tattttaacg
361 attattgctc ttgctgctgt ttttttctgg tctgaagata aaggtccggc atgctatcag
421 gtcagcgatg aacaggccag aacgtttgta aaaaatgatt acctgcaaag aatgaaacgc
481 tgggacaacg atgtacaact tcttggtaca gaaatcccga aaattacatg ggaaaagatt
541 gagagaagtt taacagatgt tgaagatgaa aaaacacttc ttgtcccatt taaagctgaa
601 ggcccggacg gtaagagaat gtattatggc atgtaccatt gtgaggaggg atatgttgaa
661 tatgcgaatg actaatacac tgttataaag gctgcataaa aagccggaac cccggccctt
721 atattatcgc ttaccacttt ctttaatgtg aatttcacca ggaagcagta tctggtactc
781 tttaccagaa aacatcgccc cgacggatcc
//
LOCUS ECOORID 1890 bp ds-DNA BCT 15-DEC-1989
DEFINITION E.coli plasmid pColBM-Cl139 D protein gene, complete cds, oriV
region, and ORF.
ACCESSION J03312
KEYWORDS D protein; ORF; oriV region.
SOURCE E.coli (strain K12 Cl139) plasmid pColBM-Cl139 DNA.
ORGANISM Plasmid Colicin BM-Cl139
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1890)
AUTHORS Thumm,G., Oelschlaeger,T. and Braun,V.
TITLE Plasmid pCoIBM-CI139 does not encode a colicin lysis protein but
contains sequences highly homologous to the D protein (resolvase)
and the oriV region of the miniF plasmid
JOURNAL Plasmid (1988) In press
STANDARD simple staff_review
COMMENT Draft entry and sequence for [1] kindly submitted by G.Thumm,
18-AUG-1988. Sequence [1] complemented.
FEATURES from to/span description
pept 1108 1890 D protein
ORF 654 1106 ORF8
site 257 576 oriV region
BASE COUNT 434 a 476 c 534 g 446 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac
61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct
121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt
181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca
241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca
301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg
361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc
421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg
481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg
541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg
601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg
661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg
721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca
781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg
841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc
901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc
961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga
1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg
1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc
1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag
1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg
1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt
1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat
1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga
1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc
1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga
1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg
1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg
1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag
1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg
1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg
1861 gtcacaatgc tgaaaaacag acatgcataa
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CEC38P 1455 bp ds-DNA BCT 31-AUG-1987
DEFINITION Plasmid ColE3-CA38 colicinogenic region containing colicin E3
(colE3), immunity (immE3), and putative hic and immE8 genes.
ACCESSION J01574 J01575 M14038
KEYWORDS colicin; colicin release protein; immune response gene;
lysis protein; unidentified reading frame.
SOURCE Plasmid ColE3-CA38 (from E.coli) DNA.
ORGANISM Plasmid Colicin E3-CA38
Prokaryota; Bacteria.
REFERENCE 1 (bases 52 to 651)
AUTHORS Masaki,H. and Ohta,T.
TITLE A plasmid region encoding the active fragment and the inhibitor
protein of colicin E3-CA38
JOURNAL FEBS Lett. 149, 129-132 (1982)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 651)
AUTHORS Mock,M., Miyada,C.G. and Gunsalus,R.P.
TITLE Nucleotide sequence for the catalytic domain of colicin E3 and its
immunity protein. Evidence for a third gene overlapping colicin
JOURNAL Nucleic Acids Res. 11, 3547-3557 (1983)
STANDARD full staff_review
REFERENCE 3 (bases 640 to 1455)
AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
TITLE Characterization and nucleotide sequence of a colicin-release gene
in the hic region of plasmid ColE3-CA38
JOURNAL Gene 29, 175-184 (1984)
STANDARD full staff_review
REFERENCE 4 (bases 640 to 1455)
AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
TITLE Corrigenda: Characterization and nucleotide sequence of a
colicin-release gene in the hic region of plasmid ColE3-CA38
JOURNAL Gene 42, 351-355 (1986)
STANDARD full staff_review
COMMENT There are three ORFs distal to the immunity gene (immE3) in plasmid
ColE3-CA38. ORF1 (bases 803 to 1066) is homologous to the
E2-immunity gene in plasmid ColE2-P9. [3] has tentatively assigned
ORF1 as the immE8 gene, but points out that a gene product has not
yet been identified. Through deletion mutation studies this region
was shown to be non-essential for colicin release.
There are two overlapping reading frames further downstream of ORF1
(bases 1070 to 1246 and 1128 to 1421), which are homologous to the
H' and H genes in plasmid CloDF13. In CloDF13 the H gene has been
shown to be the lysis gene. Because of gene homology and in vitro
studies which show that the ORF3 region is necessary for cell
lysis, [3] has identified ORF3 as the hic gene. However, the in
vitro studies do not exclude the possibility that ORF2 functions in
colicin release or that it contributes to the Hic phenotype. There
is also the possibility that the hic gene initiates at bp 1095
rather than 1128. The hic gene product also has not been
identified.
The immE8 and hic genes have been experimentally localized to the
regions annotated in the Features Table. Their coding regions were
deduced by finding the open reading frames and comparing them with
sequences of genes in plasmids with like phenotypes [3].
There are six inverted repeats in the hic region. IR-2, IR-3, and
IR-5 (positions 762-792, 964-984, 1131-1157) are "a" + "t" rich and
show resemblance to SOS boxes. IR-1 an IR-4 (673-708 and 1072-1107)
show attenuator-like structure. They may attenuate transcription of
the hic gene after SOS induction to a level more optimal for
colicin release. IR-6 (1287-1332) has a terminator-like structure.
A Shine-Delgarno sequence is present at positions 376-379 between
the colE3 and immE3 genes. The putative ribosome binding site for
the hic gene can be found at 1117-1122.
[1] refers to colicin E3 as protein A and the immunity protein as
protein B. [2] refers to the colE3 gene as the ceaC gene and the
immE3 gene as the ceaC gene.
Draft entry and clean copy sequence kindly provide by R.J.Watson,
May 1985 [3].
FEATURES from to/span description
pept < 1 375 colicin E3 (AA at 1)
pept 385 642 immunity protein-E3
pept 803 1066 immE8 protein (putative; gtg start codon)
pept 1128 1271 lysis protein (putative)
revision 755 756 gc in [4]; cg in [3]
revision 1265 1267 gca in [4]; ga in [3]
revision 1348 1349 tt in [4]; ttt in [3]
BASE COUNT 473 a 214 c 352 g 416 t
ORIGIN 150 bp upstream of Sau3A site.
1 gctatggaaa gcaggaagaa gaaagaagat aagaaaagga gtgctgaaaa taatttaaac
61 gatgaaaaga ataagcccag aaaaggtttt aaagattacg ggcatgatta tcatccagct
121 ccgaaaactg agaatattaa agggcttggt gatcttaagc ctgggatacc aaaaacacca
181 aagcagaatg gtggtggaaa acgcaagcgc tggactggag ataaagggcg taagatttat
241 gagtgggatt ctcagcatgg tgagcttgag gggtatcgtg ccagtgatgg tcagcatctt
301 ggctcatttg accctaaaac aggcaatcag ttgaaaggtc cagatccgaa acgaaatatc
361 aagaaatatc tttgagagga agttatggga cttaaattgg atttaacttg gtttgataaa
421 agtacagaag attttaaggg tgaggagtat tcaaaagatt ttggagatga cggttcagtt
481 atggaaagtc taggtgtgcc ttttaaggat aatgttaata acggttgctt tgatgttata
541 gctgaatggg tacctttgct acaaccatac tttaatcatc aaattgatat ttccgataat
601 gagtattttg tttcgtttga ttatcgtgat ggtgattggt gatcaaatat tatcagggat
661 gagttgatat acgggcttct agtgttcatg gatgaacgct ggagcctcca aatgtagaaa
721 tgttatattt tttattgagt tcttggttat aattgctccg caatgattta aataagcatt
781 atttaaaaca ttctcaggag aggtgaaggt ggagctaaaa aaaagtattg gtgattacac
841 tgaaaccgaa ttcaaaaaat ttattgaaga catcatcaat tgtgaaggtg atgaaaaaaa
901 acaggatgat aacctcgagt attttataaa tgttactgag catcctagtg gttctgatct
961 gatttattac ccagaaggta ataatgatgg tagccctgaa ggtgttatta aagagattaa
1021 agaatggcga gccgctaacg gtaagtcagg atttaaacag ggctgaaata tgaatgccgg
1081 ttgtttatgg atgaatggct ggcattcttt cacaacaagg agtcgttatg aaaaaaataa
1141 cagggattat tttattgctt cttgcagtca ttattctgtc tgcatgtcag gcaaactata
1201 tccgggatgt tcagggcggg accgtatctc cgtcatcaac agctgaagtg accggattag
1261 caacgcagta acccgaaatc ctctttgaca aaaacaaagc gtgtcaggct gattctgatg
1321 cgcttttttt ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatcctcgt
1381 gcagaacttt ccatccaggg ggagaaaact tgtcgttttg agccgttcgg tgttcagaac
1441 gcacgaaacc gatcg
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS BLCNNS 961 bp ss-RNA VRL 15-MAR-1990
DEFINITION Bunyamwera virus small RNA segment, N and NSs protein.
ACCESSION D00353
KEYWORDS N protein; NSs protein; SRNA; nonstructural protein;
nucleocapsid protein; small RNA.
SOURCE Bunyamwera virus, cDNA to viral RNA, clones pBU[NS14,N3/59,N308,
N309,N93].
ORGANISM Bunyamwera virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Bunyavirus.
REFERENCE 1 (bases 1 to 961)
AUTHORS Elliott,R.M.
TITLE Nucleotide sequence analysis of the small(S) RNA segment of
Bunyamwera virus, the prototype of the family Bunyaviridae
JOURNAL J. Gen. Virol. 70, 1281-1285 (1989)
STANDARD full staff_entry
COMMENT Submitted in computer readable form by R.M. Elliott on 18-Jan-1989.
The virus contains the negative sense strand; the positive strand
is shown below.
FEATURES from to/span description
pept 86 787 N protein
pept 105 410 NSs protein
BASE COUNT 298 a 187 c 215 g 261 t
ORIGIN
1 agtagtgtac tccacactac aaacttgcta ttgttgaaaa tcgctgtgct attaaatcca
61 acagaaggtc attaaaggct ctttaatgat tgagttggaa tttcatgatg tcgctgctaa
121 caccagcagt acttttgacc cagaggtcgc atacgctaac tttaagcgtg tccacaccac
181 tgggcttagt tatgaccaca tacgaatctt ctacattaaa ggacgcgaga ttaaaactag
241 tctcgcaaaa agaagtgaat gggaagttac acttaacctt gggggctgga agattactgt
301 atataatacg aattttcctg gcaaccggaa caacccagtt cctgacgatg gtcttaccct
361 ccaccgcctc agtggattcc ttgccaggta cctacttgag aagatgctga aagtcagtga
421 accagagaaa ttgattatta aatcaaaaat aatcaaccct ttggctgaaa agaatgggat
481 cacttggaat gatggagagg aagtttatct ctctttcttc ccaggatcag agatgttctt
541 aggaactttc agattctacc ccttagcaat cgggatctac aaagttcagc gcaaggaaat
601 ggaaccaaaa taccttgaga aaacaatgcg gcagaggtac atgggactag aagcagcaac
661 ttggactgtt agtaaattga cagaagttca gtctgcactg acagttgtct ctagcttagg
721 ttggaagaaa accaatgtta gtgcagctgc cagggacttc cttgctaaat tcggaatcaa
781 catgtaagca gggatgcatt tttaatcggg ctaaagtcat ctgttttaat ttggctaaaa
841 gggttgtttc aacccacaaa ataacagctg cttgggtggg tggttgggga cagaaagaca
901 gcgggctaaa tcaacattat attgttaatg gtattttaag ttttaggtgg agcacactac
961 t
//
LOCUS DEN2NGC 2357 bp ss-RNA VRL 15-MAR-1990
DEFINITION Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA.
ACCESSION D00346
KEYWORDS E protein; M protein; prM protein; structural protein.
SOURCE Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA.
ORGANISM Dengue virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE 1 (bases 1 to 2357)
AUTHORS Gruenberg,A., Woo,W.S., Biedrzycka,A. and Wright,P.J.
TITLE Partial Nucleotide Sequence and Deduced Amino Acid Sequence of the
Structural Proteins of Dengue Virus Type 2, New Guinea C and
PUO-218 Strains;
JOURNAL J. Gen. Virol. 69, 1391-1398 (1988)
STANDARD simple staff_review
COMMENT Nucleotide 1 in the NGC sequence corresponds to nucleotide 77
counting from the 5'end of the DEN-2(JAM) sequence.
FEATURES from to/span description
pept 21 > 2357 viral polyprotein
matp 21 362 C protein
matp 363 860 prM protein
matp 636 860 M protein
matp 861 2345 E protein
matp 2346 2357 NS 1 protein (amino end)
BASE COUNT 782 a 471 c 595 g 509 t
ORIGIN 20bp upstream from the C protein amino terminal end
1 aattagagag cagatctctg atgaataacc aacgaaaaaa ggcgagaaat acgcctttca
61 atatgctgaa acgcgagaga aaccgcgtgt cgactgtaca acagctgaca aagagattct
121 cacttggaat gctgcaggga cgaggaccat taaaactgtt catggccctg gtggcgttcc
181 ttcgtttcct aacaatccca ccaacagcag ggatactgaa gagatgggga acaattaaaa
241 aatcaaaagc cattaatgtt ttgagagggt tcaggaaaga gattggaagg atgctgaaca
301 tcttgaacag gagacgcaga actgcaggca tgatcattat gctgattcca acagtgatgg
361 cgttccattt aaccacacgt aacggagaac cacacatgat cgtcagtaga caagagaaag
421 ggaaaagtct tctgtttaaa acagaggatg gtgtgaacat gtgtaccctc atggccatgg
481 accttggtga attgtgtgaa gatacaatca cgtacaagtg tccttttctc aggcagaatg
541 aaccagaaga catagattgt tggtgcaact ctacgtccac atgggtaact tatgggacgt
601 gtaccaccac aggagaacac agaagagaaa aaagatcagt ggcactcgtt ccacatgtgg
661 gaatgggact ggagacacga actgaaacat ggatgtcatc agaaggggcc tggaaacatg
721 cccagagaat tgaaacttgg atcttgagac atccaggctt taccataatg gcagcaatcc
781 tggcatacac cataggaacg acacatttcc aaagagccct gattttcatc ttactgacag
841 ctgtcgctcc ttcaatgaca atgcgttgca taggaatatc aaatagagac tttgtagaag
901 gggtttcagg aggaagctgg gttgacatag tcttagaaca tggaagctgt gtgacgacga
961 tggcaaaaaa caaaccaaca ttggattttg aactgataaa aacagaagcc aaacaacctg
1021 ccactctaag gaagtactgt atagaggcaa agctgaccaa cacaacaaca gattctcgct
1081 gcccaacaca aggagaaccc agcctaaatg aagagcagga caaaaggttc gtctgcaaac
1141 actccatggt ggacagagga tggggaaatg gatgtggatt atttggaaaa ggaggcattg
1201 tgacctgtgc tatgttcaca tgcaaaaaga acatgaaagg aaaagtcgtg caaccagaaa
1261 acttggaata caccattgtg ataacacctc actcagggga agagcatgca gtcggaaatg
1321 acacaggaaa acatggcaag gaaatcaaaa taacaccaca gagttccatc acagaagcag
1381 agttgacagg ctatggcact gtcacgatgg agtgctctcc gagaacgggc ctcgacttca
1441 atgagatggt gttgctgcaa atggaaaata aagcttggct ggtgcacagg caatggttcc
1501 tagacctgcc gttgccatgg ctgcccggag cggacacaca aggatcaaat tggatacaga
1561 aagagacatt ggtgactttc aaaaatcccc atgcgaagaa acaggatgtt gttgttttgg
1621 gatcccaaga aggggccatg cacacagcac tcacaggggc cacagaaatc cagatgtcat
1681 caggaaactt actgttcaca ggacatctca agtgcaggct gaggatggac aaactacagc
1741 tcaaaggaat gtcatactct atgtgcacag gaaagtttaa agttgtgaag gaaatagcag
1801 aaacacaaca tggaacaata gttatcagag tacaatatga aggggacggt tctccatgta
1861 agatcccttt tgagataatg gatttggaaa aaagacatgt tttaggtcgc ctgattacag
1921 tcaacccaat cgtaacagaa aaagatagcc cagtcaacat agaagcagaa cctccattcg
1981 gagacagcta catcatcata ggagtagagc cgggacaatt gaagctcaac tggtttaaga
2041 aaggaagttc tatcggccaa atgattgaga caacaatgag gggagcgaag agaatggcca
2101 ttttaggtga cacagcttgg gattttggat ccctgggagg agtgtttaca tctataggaa
2161 aggctctcca ccaagttttc ggagcaatct atggggctgc cttcagtggg gtctcatgga
2221 ctatgaaaat cctcatagga gtcattatca catggatagg aatgaattca cgcagcacct
2281 cactttctgt gtcactagta ttggtgggag tcgtgacgct gtatttggga gttatggtgc
2341 aggccgatag tggttgc
//
LOCUS HS1IRLULR 13052 bp ds-DNA VRL 15-MAR-1990
DEFINITION Herpes simplex virus type 1 (HSV-1) genome, rightmost part of the
long unique region (UL) and all of the internal long repeat region
(IRL).
ACCESSION D00374
KEYWORDS IE110; IE63; UL54; UL55; UL56; immediate-early protein;
internal long repeat region; long repeat region;
long unique region; transcriptional activator;
transcriptional modulating protein.
SOURCE HSV-1 (strain 17) DNA, clones BamHI b, XhoI c, BamHI k, HpaI s plus
v.
ORGANISM Herpes simplex virus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 13052)
AUTHORS Perry,L.J. and McGeoch,D.J.
TITLE The DNA sequences of the long repeat region and adjoining parts of
the long unique region in the genome of herpes simplex virus type 1
JOURNAL J. Gen. Virol. 69, 2831-2846 (1988)
STANDARD full staff_entry
COMMENT There were two small divergences within the two versions of the UL
proximal part of RL (discussed in [1]).
FEATURES from to/span description
pept 413 1951 immediate-early transcriptional modulating
protein IE63 (gene UL54)
ORF 2175 2735 ORF of gene UL55
ORF 3602 3009 (c) ORF of gene UL56
pept 10787 10731 (c) IE110 exon 1
9965 9299 (c) IE110 exon 2
9162 7559 (c) IE110 exon 3
mRNA 275 1974 IE63 mRNA
pre-msg 10935 7350 (c) IE110 mRNA and introns
IVS 10730 9967 (c) IE110 intron 1
IVS 9298 9163 (c) IE110 intron 2
rpt 3837 4017 reiteration set 1
rpt 4224 4244 reiteration set 2
rpt 4465 4496 reiteration set 3
rpt 7170 7317 reiteration set 4
rpt 10422 10583 reiteration set 5
rpt 12007 12060 reiteration set 6
rpt 12730 12952 reiteration set 7
refnumbr 1 1 numbered 113322 in [1]
signal 247 251 TATA box
signal 3792 3786 (c) TATA box
signal 10962 10958 (c) TATA box
signal 1956 1961 polyadenylation signal
signal 2777 2782 polyadenylation signal
signal 2880 2875 (c) polyadenylation signal
signal 2884 2879 (c) polyadenylation signal
signal 7372 7367 (c) polyadenylation signal
signal 7412 7407 (c) polyadenylation signal
variant 1055 1062 eight c residues in HpaI s plus v clone; seven
c residues in BamHI b clone
site 3836 3836 end of UL
BASE COUNT 1933 a 4879 c 4243 g 1997 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccaac gaccccgccc atgggtccca attggccgtc ccgttaccaa gaccaaccca
61 gccagcgtat ccacccccgc ccgggtcccc gcggaagcgg aacggggtat gtgatatgct
121 aattaaatac atgccacgta cttatggtgt ctgattggtc cttgtctgtg ccggaggtgg
181 ggcgggggcc ccgcccgggg ggcggaacga ggaggggttt gggagagccg gccccggcac
241 cacgggtata aggacatcca ccacccggcc ggtggtggtg tgcagccgtg ttccaaccac
301 ggtcacgctt cggtgcctct ccccgattcg ggcccggtcg ctcgctaccg gtgcgccacc
361 accagaggcc atatccgaca ccccagcccc gacggcagcc gacagcccgg tcatggcgac
421 tgacattgat atgctaattg acctcggcct ggacctctcc gacagcgatc tggacgagga
481 cccccccgag ccggcggaga gccgccgcga cgacctggaa tcggacagca gcggggagtg
541 ttcctcgtcg gacgaggaca tggaagaccc ccacggagag gacggaccgg agccgatact
601 cgacgccgct cgcccggcgg tccgcccgtc tcgtccagaa gaccccggcg tacccagcac
661 ccagacgcct cgtccgacgg agcggcaggg ccccaacgat cctcaaccag cgccccacag
721 tgtgtggtcg cgcctcgggg cccggcgacc gtcttgctcc cccgagcagc acgggggcaa
781 ggtggcccgc ctccaacccc caccgaccaa agcccagcct gcccgcggcg gacgccgtgg
841 gcgtcgcagg ggtcggggtc gcggtggtcc cggggctgcc gatggtttgt cggacccccg
901 ccggcgtgcc cccagaacca atcgcaaccc tgggggaccc cgccccgggg cggggtggac
961 ggacggcccc ggcgcccccc atggcgaggc gtggcgcggc agtgagcagc ccgacccacc
1021 cggaggccag cggacacggg gcgtgcgcca agcacccccc ccgctaatga cgctggcgat
1081 tgcccccccg cccgcggacc cccgcgcccc ggccccggag cgaaaggcgc ccgccgccga
1141 caccatcgac gccaccacgc ggttggtcct gcgctccatc tccgagcgcg cggcggtcga
1201 ccgcatcagc gagagctttg gccgcagcgc acaggtcatg cacgacccct ttggggggca
1261 gccgtttccc gccgcgaata gcccctgggc cccggtgctg gcgggccaag gagggccctt
1321 tgacgccgag accagacggg tctcctggga aaccttggtc gcccacggcc cgagcctcta
1381 tcgcactttt gccggcaatc ctcgggccgc atcgaccgcc aaggccatgc gcgactgcgt
1441 gctgcgccaa gaaaatttca tcgaggcgct ggcctccgcc gacgagacgc tggcgtggtg
1501 caagatgtgc atccaccaca acctgccgct gcgcccccag gaccccatta tcgggacgac
1561 cgcggctgtg ctggataacc tcgccacgcg cctgcggccc tttctccagt gctacctgaa
1621 ggcgcgaggc ctgtgcggcc tggacgaact gtgttcgcgg cggcgtctgg cggacattaa
1681 ggacattgca tccttcgtgt ttgtcattct ggccaggctc gccaaccgcg tcgagcgtgg
1741 cgtcgcggag atcgactacg cgacccttgg tgtcggggtc ggagagaaga tgcatttcta
1801 cctccccggg gcctgcatgg cgggcctgat cgaaatccta gacacgcacc gccaggagtg
1861 ttcgagtcgt gtctgcgagt tgacggccag tcacatcgtc gcccccccgt acgtgcacgg
1921 caaatatttt tattgcaact ccctgtttta ggtacaataa aaacaaaaca tttcaaacaa
1981 atcgcccctc gtgttgtcct tctttgctca tggccggcgg ggcgtgggtc acggcagatg
2041 gcgggggtgg gcccggcgta cggcctgggt gggcggaggg aactaaccca acgtataaat
2101 ccgtccccgt tccaaggccg gtgtcatagt gcccttagga gcttcccgcc cgggcgcatc
2161 cccccttttg cactatgaca gcgacccccc tcaccaacct gttcttacgg gccccggaca
2221 taacccacgt ggccccccct tactgcctca acgccacctg gcaggccgaa acggccatgc
2281 acaccagcaa aacggactcc gcttgcgtgg ccgtgcggag ttacctggtc cgcgcctcct
2341 gtgagaccag cggcacaatc cactgctttt tctttgcggt atacaaggac acccaccaca
2401 cccctccgct gattaccgag ctccgcaact ttgcggacct ggttaaccac ccgccggtcc
2461 tacgcgaact ggaggataag cgcggggtgc ggctgcggtg tgcgcggccg tttagcgtcg
2521 ggacgattaa ggacgtctct gggtccggcg cgtcctcggc gggagagtac acgataaacg
2581 ggatcgtgta ccactgccac tgtcggtatc cgttctcaaa aacatgctgg atgggggcct
2641 ccgcggccct acagcacctg cgctccatca gctccagcgg catggccgcc cgcgcggcag
2701 agcatcgacg cgtcaagatt aaaattaagg cgtgatctcc aaccccccca tgaatgtgtg
2761 taaccccccc caaaaaaata aagagccgta acccaaccaa accaggcgtg gtgtgagttt
2821 gtggacccaa agccctcaga gacaacgcga caggccagta tggaccgtga tacttttatt
2881 tattaactca caggggcgct taccgccaca ggaataccag aataatgacc accacaatcg
2941 cgaccacccc aaatacagca tggcgccaca ccacgccaca acagccctgt cgccggtatg
3001 gggcatgatc agacgagccg cgccgcgcgt tgggccctgt acagctcgcg cgaattgacc
3061 ctaggaggcc gccacgcgcc cgagttttgc gttcgtcgct ggtcgtcggg cgccaaagcc
3121 ccggacggct gttcggtcga acgaacggcc acgacagtgg cataggttgg ggggtggtcc
3181 gacatagcct cggcgtacgt cgggaggccc gacaagaggt cccttgtgat gtcgggtggg
3241 gccacaagcc tggtttccgg aagaaacagg ggggttgcca ataacccgcc agggccaaaa
3301 ctccggcgct gcgcacgtcg ttcggcgcgg cgccgggcgc gccgagcggc tcgctgggcg
3361 gcttggcgtg agcggccccg ctccgacgcc tcgccctctc cggaggaggt tggcggaatt
3421 ggcacggaca acaggggccc agcagagtac ggtggaggtg ggtccgtggg ggtgtccaga
3481 tcaataacga caaacggccc ctcgttccta ccagacaagc tatcgtaggg gggcggggga
3541 tcagcaaacg cgttccccgc gctccataaa cccgcgtcgg gttgcgccgc ctccgaagcc
3601 atggatgcgc cccaaagcca cgactcccgc gcgctaggtc cttggggtaa tggaaaaggc
3661 cctactcccc atccaagcca gccaagttaa cgggctacgc cttcgggaat gggactggca
3721 ccccggcgga ttttgttggg ctggcatgcg tcgcccaacc gagggccgcg tccacgggac
3781 gcgcctttta taaccccggg ggtcattccc aacgatcaca tgcaatctaa ctggctcccc
3841 tctccccccc tctcccctct ccccccctct cccctctccc cccctctccc ctctcccccc
3901 ctctcccctc tccccccctc tcccctctcc ccccctctcc cctctccccc cctctcccct
3961 ctccccccct ctcccctctc cccccctctc ccctctcccc ccctctcccc tctcccctct
4021 gctctttccc cgtgacaccc gacgctgggg gcgtggctgc cgggaggggc cgcggatggg
4081 cgggcctact tggtttcccg cccccccccc ccccccccga accgccccgc cggctttgcc
4141 cccctttgat cccctgctac ccccaacccg tgctggtggt gcgggttggg gggggatgtg
4201 ggcgggggtg cgcgggaggt gtcggtggtg gtggtggtgg tggtagtagg aatggtggtg
4261 aggggggggg ggcgctggtt ggtcaaaaaa gggagggacg ggggccggca gaccgacggc
4321 gacaacgctc cccggcggcc gggtcgcggc tcttacgagc ggcccggccc gcgctcccac
4381 cccccgggcc gtgtccttgc tttccccccg tctccccccc ccccgccttc tcctcctcct
4441 cctcgttttt ccaaaccccg cccacccggc ccggcccggc ccggcccggc ccggccaccg
4501 ccgcccaccc acccacctcg ggatacccag ccccggtccc ccgttccccg ggggccgtta
4561 tctccagcgc cccgtccggc gcgccgcccc ccgccgctaa accccatccc gcccccggga
4621 ccccacatat aagcccccag ccacacgcaa gaacagacac gcagaacggc tgtgtttatt
4681 taaataaacc aatgtcggaa taaacaaaca caaacacccg cgacgggggg acggagggga
4741 cggagggagg gggtgacggg ggacgggaac agacacaaaa acaaccacaa aaaacaacca
4801 cccaccgaca cccccacccc agtctcctcg ccttctccca cccaccccac gcccccactg
4861 agcccggtcg atcgacgagc acccccgccc acgcccccgc ccctgccccg gcgacccccg
4921 gcccgcacga tcccgacaac aataacaacc ccaacggaaa gcggcggggt gttgggggag
4981 gcgaggaaca accgagggga acgggggatg gaaggacggg aagtggaagt cctgataccc
5041 atcctacacc cccctgcctt ccaccctccg gccccccgcg agtccacccg ccggccggct
5101 accgagaccg aacacggcgg ccgccgcagc cgccgcagcc gccgccgaca ccgcagagcc
5161 ggcgcgcgca ctcacaagcg gcagaggcag aaaggcccag agtcattgtt tatgtggccg
5221 cgggccagca gacggcccgc gacacccccc ccccgcccgt gtgggtatcc ggccccccgc
5281 cccgcgccgg tccattaagg gcgcgcgtgc ccgcgagata tcaatccgtt aagtgctctg
5341 cagacagggg caccgcgccc ggaaatccat taggccgcag acgaggaaaa taaaattaca
5401 tcacctaccc acgtggtgct gtggcctgtt tttgctgcgt catctcagcc tttataaaag
5461 cgggggcgcg gccgtgccga tcgcgggtgg tgcgaaagac tttccgggcg cgtccgggtg
5521 ccgcggctct ccgggccccc ctgcagccgg ggcggccaag gggcgtcggc gacatcctcc
5581 ccctaagcgc cggccggccg ctggtctgtt ttttcgtttt ccccgtttcg ggggtggtgg
5641 gggttgcggt ttctgtttct ttaacccgtc tggggtgttt ttcgttccgt cgccggaatg
5701 tttcgttcgt ctgtcccctc acggggcgaa ggccgcgtac ggcccgggac gaggggcccc
5761 cgaccgcggc ggtccgggcc ccgtccggac ccgctcgccg gcacgcgacg cgaaaaaggc
5821 cccccggagg cttttccggg ttcccggccc ggggcctgag atgaacactc ggggttaccg
5881 ccaacggccg gcccccgtgg cggcccggcc cggggccccg gcggacccaa ggggccccgg
5941 cccggggccc cacaacggcc cggcgcatgc gctgtggttt ttttttcctc ggtgttctgc
6001 cgggctccat cgcctttcct gttctcgctt ctcccccccc ccttcttcac ccccagtacc
6061 ctcctccctc ccttcctccc ccgttatccc actcgtcgag ggcgccccgg tgtcgttcaa
6121 caaagacgcc gcgtttccag gtaggttaga cacctgcttc tccccaatag agggggggga
6181 cccaaacgac agggggcgcc ccagaggcta aggtcggcca cgccactcgc gggtgggctc
6241 gtgttacagc acaccagccc gttcttttcc ccccctccca cccttagtca gactctgtta
6301 cttacccgtc cgaccaccaa ctgccccctt atctaagggc cggctggaag accgccaggg
6361 ggtcggccgg tgtcgctgta accccccacg ccaatgaccc acgtactcca agaaggcatg
6421 tgtcccaccc cgcctgtgtt tttgtgcctg gctctctatg cttgggtctt actgcctggg
6481 gggggggagt gcgggggagg gggggtgtgg aaggaaatgc acggcgcgtg tgtacccccc
6541 ctaaagttgt tcctaaagcg aggatacgga ggagtggcgg gtgccggggg accggggtga
6601 tctctggcac gcgggggtgg gaagggtcgg gggagggggg gatggagtac cggcccacct
6661 ggccgcgcgg gtgcgcgtgc ctttgcacac caaccccacg tcccccggcg gtctctaaga
6721 agcaccgccc cccctccttc ataccaccga gcatgcctgg gtgtgggttg gtaaccaaca
6781 cgcccatccc ctcgtctcct gtgattctct ggctgcaccg cattcttgtt ttctaactat
6841 gttcctgttt ctgtctcccc cccccccacc cctccgcccc accccccaac acccacgtct
6901 gtggtgtggc cgaccccctt ttgggcgccc cgtcccgccc cgccacccct cccatccttt
6961 gttgccctat agtgtagtta accccccccg ccctttgtgg cggccagagg ccaggtcagt
7021 ccgggcgggc aggcgctcgc ggaaacttaa cacccacacc caacccactg tggttctggc
7081 tccatgccag tggcaggatg ctttcgggga tcggtggtca ggcagcccgg gccgcggctc
7141 tgtggttaac accagagcct gcccaacatg gcacccccac tcccacgcac ccccactccc
7201 acgcaccccc actcccacgc acccccactc ccacgcaccc ccactcccac gcacccccac
7261 tcccacgcac ccccactccc acgcaccccc actcccacgc acccccactc ccacgcatcc
7321 ccgcgataca tccaacacag acagggaaaa gatacaaaag taaaccttta tttcccaaca
7381 gacagcaaaa atcccctgag ttttttttta ttagggccaa cacaaaagac ccgctggtgt
7441 gtggtgcccg tgtctttcac ttttcccctc cccgacacgg attggctggt gtagtgggcg
7501 cggccagaga ccacccagcg cccgaccccc ccctccccac aaacacgggg ggcgtccctt
7561 attgttttcc ctcgtcccgg gtcgacgccc cctgctcccc ggaccacggg tgccgagacc
7621 gcaggctgcg gaagtccagg gcgcccacta gggtgccctg gtcgaacagc atgttcccca
7681 cgggggtcat ccagaggctg ttccactccg acgcgggggc cgtcgggtac tcggggggca
7741 tcacgtggtt acccgcggtc tcggggagca gggtgcggcg gctccagccg gggaccgcgg
7801 cccgcagccg ggtcgccatg tttcccgtct ggtccaccag gaccacgtac gccccgatgt
7861 tccccgtctc catgtccagg atgggcaggc agtcccccgt gatagtcttg ttcacgtaag
7921 gcgacagggc gaccacgcta gagacccccg agatgggcag gtagcgcgtg aggccgcccg
7981 cggggacggc cccggaagtc tccgcgtggc gcgtcttccg ggcacacttc ctcggccccc
8041 gcggcccaga agcagcgcgg gggccgaggg aggtttcctc ttgtctccct cccagggcac
8101 cgacggcccc gcccgaggag gcggaagcgg aggaggacgc ggccccggcg gcggaagagg
8161 cggcccccgc gggggtcggg gccgaggagg aagaggcaga ggaggaagag gcggaggccg
8221 ccgaggacgt caggggggtc ccgggcccac cctggccgcg cccccccggc cctgagtcgg
8281 agggggggtg cgtcgccgcc ctcttggccc ctgccggcgc gaggggggga cgcgtggact
8341 ggggggaggg gttttcctgg cccgacccgc gcctcttcct cggacgcacc gccgcctcct
8401 gctcgacaga ggcggcggag gggagcgggg cggcgccgga gggggcggcg ccgcgggagg
8461 gcccgtgccc accctccacg cccggccccc ccgagccgcg cgccaccgtc gcacgcgccc
8521 ggcacagact ctgttcttgg ttcgcggcct gagccaggga cgagtgcgac tggggcacac
8581 ggcgcgcgtc cgcggggcgg gcggccggct ccgccccggg ggccggggcg cgggggccgg
8641 gccccggagg cggcgctcgc acgcacgggg ccacggccgc gcgggggcgc gcgggtcccg
8701 acgcggccgc ggacgcgggg ggcccggggc ggggggcgga gcctggcatg ggcgccgcgg
8761 ggggcctgtg gggagaggcc gggggggagt cgctgatcac tatggggtct ctgttgtttg
8821 caaggggggc gggtctgttg acaagggggc ccgtccggcc cctcggccgc cccgcctccg
8881 cttcaacaac cccaacccca accccaaccc ccccggaggg gccagacgcc ccccgcggcg
8941 ccgcggctcg cgactggcgg gagccgccgc cgccgctgct gttggtggtg gtgttggtgt
9001 tactgctgcc gtgtggcccg atgggcgccg aggggggcgc tgtccgagcc gcggccggct
9061 ggggggctgc gtgagacgcc ccgcccgtca cggggggcgc ggcggcgcct ctgcgtgggg
9121 gggcgcgggg cgtccggcgg ggggcgggcg gtacgtagtc tgctgcaaga gacaacgggg
9181 ggcgcgatca ggttacgccc cctccccggc ccgccctttc ctcgcccgcc cgcctattcc
9241 tccctccccc cccctcctcc tcctcctccc ccagggtcct tgccgccccc cgcctcaccg
9301 tcgtccaggt cgtcgtcatc ctcgtccgtg gtgggctccg ggtgggtggg cgacagggcc
9361 ctcaccgtgt gcccccccag ggtcaggtac cgcggggcga accgctgatt gcccgtccag
9421 ataaagtcca cggccgtgcc cgccctgacg gcctcctcgg cctccatgcg ggtctggggg
9481 tcgttcacga tcgggatggt gctgaacgac ccgctgggcg tcacgcccac tatcaggtac
9541 accagcttgg cgttgcacag cgggcaggtg ttgcgcaatt gcatccaggt tttcatgcac
9601 gggatgcaga agcggtgcat gcacgggaag gtgtcgcagc gcaggtgggg cgcgatctca
9661 tccgtgcaca cggcgcacac gtcgccctcg tcgctccccc cgtcctctcg agggggggcg
9721 cccccgcaac tgccggggtc ttcctcgcgg ggggggctcc cccccgagac cgccccccca
9781 tccacgccct gcggccccag cagccccgtc tcgaacagtt ccgtgtccgt gctgtccgcc
9841 tcggaggcgg agtcgtcgtc atggtggtcg gcgtcccccc gcccccccac ttcggtctcc
9901 gcctcagagt cgctgctgtc cggcaggtct cggtcgcagg gaaacaccca gacatccggg
9961 gcgggctaag gggaaaaaag gggggcgggt aagaatgggg ggggatttcc cgcgtcaatc
10021 agcacccacg agttccccct ctcccccccc cgcctcacaa agtcctgccc ccctgctggc
10081 ctcggaagag gggggagaaa ggggtctgca accaaaggtg gtctgggtcc gtcctttgga
10141 tcccgacccc tcttcttccc tcttctcccg ccctccagac gcaccggagt cgggggtccc
10201 acggcgtccc ccaaatatgg cgggcggctc ctccccaccc ccctagatgc gtgtgagtaa
10261 ggggggcctg cgtatgagtc agtggggacc acgcccccaa cacggcgacc ccggtccttg
10321 tgtgtttgtt gtgggggcgt gtctctgtgt atgagtcagg gggtcccacg gcgaccccgg
10381 gccctgcgtc tgagtcaaag gggccatgtg tatgtgttgg gggtctgtat atataaagtc
10441 agggggtcac atggcgaccc ccaacagggc gaccccggtc cctgtatata tagggtcagg
10501 gggttccgca ccccctaaca tggcgccccc ggtccctgta tatatagtgt cacggggttc
10561 cacgccccct aacatggcgc cccaacatgg cgcccggctc ccgtgtatga gtgggggtcc
10621 cccaacatgg cggccggttc cagtgtaagg gtcgggggtc ccccaacatg gcgcccccca
10681 atatggcgcc ccccaatatg gcgccccaga catggcgccc ggcccctcac ctcgcgctgg
10741 gggcggccct caggccggcg ggtactcgct ccggggcggg gctccatggg ggtcgtatgc
10801 ggctggaggg tcgcggacgg agggtccctg ggggtcgcaa cgtaggcggg gcttctgtgg
10861 tgatgcggag agggggcggc ccgagtctgc ctggctgctg cgtctcgctc cgagtgccga
10921 ggtgcaaatg cgaccagact gtcgggccag ggctaactta taccccacgc ctttcccctc
10981 cccaaagggg cggcagtgac gattccccca atggccgcgc gtcccagggg aggcaggccc
11041 accgcggggc ggccccgtcc ccggggacca acccggcgcc cccaaagaat atcattagca
11101 tgcacggccc ggcccccgat ttgggggccc aacccggtgt cccccaaaga accccattag
11161 catgcccctc ccgccgacgc aacaggggct tggcctgcgt cggtgccccg gggcttcccg
11221 ccttcccgaa gaaactcatt accatacccg gaaccccagg ggaccaatgc gggttcattg
11281 agcgacccgc gggccaatgc gcgaggggcc gtgtgttccg ccaaaaaagc aattagcata
11341 acccggaacc ccaggggagt ggttacgcgc ggcgcgggag gcggggaata ccggggttgc
11401 ccattaaggg ccgcgggaat tgccggaagc gggaagggcg gccggggccg cccattaatg
11461 agtttctaat taccataccg ggaagcggaa caaggcctct tgcaagtttt taattaccat
11521 accgggaagt gggcggcccg gcccattggg cggtaactcc cgcccaatgg gccgggcccc
11581 gaagactcgg cggacgctgg ttggccgggc cccgccgcgc tggcggccgc cgattggcca
11641 gtcccgcccc cgaggcggcc cgccctgtga gggcgggctg gctccaagcg tatatatgcg
11701 cggctcctgc catcgtctct ccggagagcg gcttggtgcg gagctcccgg gagctccgcg
11761 gaagacccag gccgcctcgg gtgtaacgtt agaccgagtt cgccgggccg gctccgcggg
11821 ccagggcccg ggcacgggcc tcgggcccca ggcacggccc gatgaccgcc tcggcctccg
11881 ccacccggcg ccggaaccga gcccggtcgg cccgctcgcg ggcccacgag ccgcggcgcg
11941 ccaggcgggc ggccgaggcc cagaccacca ggtggcgcac ccggacgtgg ggcgagaagc
12001 gcacccgcgc gggggtcgcg ggggtcgcgg gggtcgcggg ggtcgcgggg gtcgcggggg
12061 gctccggcgc cccctccccg cccgcgcgtc gcaggcgcag gcgcgccagg tgctccgcgg
12121 tgacgcgcag gcggagggcg aggcgcggcg gaaggcggaa ggggcgcgag ggggggtggg
12181 aggggtcagc cccgcccccc gggcccacgc cgggcggtgg gggcccgggg ggcggggggc
12241 ggcggcggtg ggccgggcct ctggcgccga ctcgggcggg gggctgtccg gccagtcgtc
12301 gtcatcgtcg tcgtcggacg cggactcggg aacgtggagc cactggcgca gcagcagcga
12361 acaagaaggc gggggcccac cggcgggggg cggcggcggg gcggccgcgg gcgcgctcct
12421 gaccgcgggt tccgagttgg gcgtggaggt tacctgggac tgtgcggttg ggacggcgcc
12481 cgtgggcccg ggcggccggg ggcggcgggg gccgcgatgg cggcggcggc gggccatgga
12541 gacagagagc gtgccggggt ggtagagttt gacaggcaag catgtgcgtg cagaggcgag
12601 tagtgcttgc ctgtctaact cgctagtctc ggccgcgggg ggcccgggct gcccgccgcc
12661 accgctttaa agggccgcgc gcgacccccg gggggtgtgt tttggggggg gcccgttttc
12721 ggcgtctggc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
12781 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
12841 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
12901 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cccgcggccc
12961 cgccccccac gcccgccgcg cgcgcgcacg ccgcccggac cgccgcccgc cttttttgcg
13021 cgcgcgcgcg cccgcggggg gcccgggctg cc
//
LOCUS HS5IE5KB1 2520 bp ds-DNA VRL 15-DEC-1989
DEFINITION Human cytomegalovirus genome, BamHI-HindIII fragment (5'-terminal
part of the 5 kb transcript from the immediate-early region).
ACCESSION D00328
KEYWORDS immediate-early gene; transforming region.
SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clone pAT153
provided by J. D. Oram and R. G. Downing.
ORGANISM Human cytomegalovirus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Betaherpesvirinae.
REFERENCE 1 (bases 1 to 2520)
AUTHORS Kouzarides,T., Bankier,A.T. and Barrell,B.G.
TITLE Nucleotide sequence of the transforming region of human
cytomegalovirus
JOURNAL Mol. Biol. Med. 1, 47-58 (1983)
STANDARD full staff_entry
REFERENCE 2 (sites; 5 kb RNA start site)
AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G.
TITLE Abundant 5 kb RNA of human cytomegalovirus without a major
translational reading frame
JOURNAL J. Gen. Virol. 69, 2251-2266 (1988)
STANDARD full staff_entry
COMMENT In [2], the 5' end of the 5 kb RNA was determined by primer
extension. In [1], the BamHI-HindIII fragment was described as the
sequence containing the region capable of transforming NIH3T3
cells.
FEATURES from to/span description
site 1664 1664 5 kb RNA start site
BASE COUNT 647 a 713 c 589 g 571 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccgca gcagtccgtt ggcggagtcc gaggagtgct gaccgccgct cccgccgccg
61 ccaccgccac caccagcgcc gccgcctcca ccaccaccgg cagacgagga cgactttttg
121 cgccgttctt cgtgacgctg ttcctgcctt cgccgttgct gttcctccgc agaagggccg
181 tcgcgagtcc cgccgctgcc acccagcgga ggacacgcag acggcggaag cggtagacgc
241 ggcgccgcaa ccaccgcctc cgctggagga ttcgccgtgg tttttcaggt aatgccgcac
301 gtaagtcact tgcaaattac cgttctcgga aatcatggtg agcagcgcgc tctcattggg
361 tccgctggag cccaccaccg aggagacgga tttgttgaag acgataccgc cgcgtacaaa
421 gaggtgctcc tgcagctccc cgtcgcccgt aatgtcaata gacatgaagc cctgctgcgt
481 cttggcgccg gccgaagcct cgccgtgctg cataatggta gcgcagagcc agcccttgtt
541 gaggtgcagc accttgccat cgccgtccac gcagttgacc agacgcgcgg tatcgaagac
601 gaactggcgc acgtcgaaag tctgattgac gctttgatgc aggatgcgat taggattcgc
661 aaaagtccag tattttcgca cgacggtagt agggagatcc atgacgcggc ggcgcaaagc
721 gcgagcgcaa cgctcgtcgg aggccgtgga gcgagtgccg ccgcagccgg cagagcgccg
781 aaccccgtcg cagactctat ttatacatca tctttccagc ccgcctagca acacccacaa
841 acaacgtcac gacgcaacgt ggttaaacag tacgtttatt aaagtaactg ggtgaacgac
901 accggagcgg actgcaaatc gcaacgctac tttctcgagt gcagatactc ttcgagacgg
961 ctaaacaacg tgtccaactc gctgagacct ttccgcgtgc cgttatccga tttcctttcc
1021 gcctcctgag acagccgact aatcacggac ttatctccgc aacctaacag aggctgggag
1081 cccgacaaaa gtaaaacagc gtgctatgaa cacgttgtta cctctgtgcg gacagcgccg
1141 ccacagagac acttacacat tgccgcatgt ctttgtagat ggattctagc gtcgagcgca
1201 tactatgcaa ttccgtcttg agtccgggat agacgtggtc gcctgcggga aacacgatct
1261 ccagataccg cctcaacaac cagtccatga cgctgcatcc ccaacagcct ttgaccaccg
1321 taccgtcgag ccacacggag tagtcgtcct cacgttgcta caagaggaaa actacgtcac
1381 ccgacacgcg gaaaagaaag accgtcgcaa taaaccgtac ctacgtgacc taccaacgta
1441 ggttttactc gatgaaaggt gacgcggaga tcttgcaatc tggtcgcgta atcctctgga
1501 cgacactgcg gctttgtatt ctttatcgtc gtcgtcgccg gcttcgcctc ctcggaagcg
1561 cctagaaaaa agacgatcag gaccagagag gaagagacca tcaccgacag catcgccgca
1621 gcatgccgtc ccagtccgcc gcccaactgc gcgtcccagg taggtggtcc tttatgttat
1681 gatgtttttg tcaatttttt ttttcaattt ctttcttccg cggttagaat agtttctgta
1741 ggaaccaatt atcaatctga cgggttatcg tcaccacttg atggcaaaac gaaatttttt
1801 ttttcattgc cttgaagtct ctcccgccac caccaccacc gccgttgtct ccggctggag
1861 atcaagacga aattcctcct ctctaaaaaa aaaggtggtg ggcttaattg atcatggcaa
1921 gaagaaaaac tatactgaat aaactgtgtg caaaactact agtaacaaca aaaatagcga
1981 ctagatacac cacggacaat ctcagcagat actctctcaa aagaaaaaaa agacgccgta
2041 acgtcggaga atctggtatc tactgcctga cgaatttttt tttcgtccat gtatgtgatt
2101 acgagtagta gtggtatgta gaacaagaag aaaaatcgta gtccccaaaa ggataataaa
2161 aataacactc atagagaatc acagattttc tctagacaac tctctatcca aataacgaat
2221 gtgaagcgta caaagtaaga tattcaaaga atagcacctt catagattca tttcagcttt
2281 ctactccttg taatttaaag ttgcactaaa caaagctctt aaagaaggtt cgagccgctc
2341 tcgatcactc atcgatcacg cgagtcttat tattccacca caacgtaaca ttcttcactt
2401 tgtagagaca ctttatcgta gagtaaccct cgatttccta gctgttgttt tttgattatt
2461 ttgttcgctc taagagagat actcgaaatc ctacttacac caaggaccct acatcatcgc
//
LOCUS HS5IE5KB2 1291 bp ds-DNA VRL 15-DEC-1989
DEFINITION Human cytomegalovirus genome, 3'-terminal part of the 5 kb
transcript from the immediate-early region.
ACCESSION D00327
KEYWORDS immediate-early gene.
SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clones pGJ0.1,
pGJ0.2, pGJ0.3, pGJ0.5, pGJ0.6, and pGJ0.7.
ORGANISM Human cytomegalovirus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Betaherpesvirinae.
REFERENCE 1 (bases 1 to 1291)
AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G.
TITLE Abundant 5 kb RNA of human cytomegalovirus without a major
translational reading frame
JOURNAL J. Gen. Virol. 69, 2251-2266 (1988)
STANDARD full staff_entry
COMMENT One of the predominant transcripts from the immediate early region
is a 5 kb RNA. This sequence analysis revealed multiple stop
codons throughout the AT-rich potential coding region.
FEATURES from to/span description
RNA < 1 1267 5 kb RNA (3'-terminal part) (alt.)
RNA < 1 1280 5 kb RNA (3'-terminal part) (alt.)
BASE COUNT 384 a 333 c 275 g 299 t
ORIGIN 15 bp upstream of EcoRI site.
1 ctttttattt tttcgaattc atgttcgaaa acacaagctt ccataacaag aacccgtacc
61 gaagaaaagt tccatcgact aaaaagaaaa aagaaaacga agcaagacct cgacgacaac
121 aacacatcaa agaaagacga ccagctgatt atgttcttag aattccacac acccgcgagc
181 cgatccgcaa acgtcgtgcg aggcgcgctt tctctggctc gacacaatga tcacaccgca
241 cgctatagac acgtcgtcgt ggacgacgat gacctcaggc cacgaatgac aaccaacatg
301 ggcaaagtcc aattagccaa aaagacgacg attctaagaa ttgatgaatc ctcgatatac
361 gcctatcgat aggtttcaat tgtgtcatat acatcaaatg aaaaacagga cgcacgataa
421 aagcttcctt acagcataac tgtaacatac gatcatggaa catctcctca catacctttc
481 tcctctcaca taggaaaaca aaactctttt ttttctttcc tgtcaaggaa aaaatcaatg
541 taccaccaca tcactttctc ctcggtcccg gcgacggatg ggcgcgcacg cggacaaaga
601 cccaccggcc acttccactt attttttgtt gttaatcgtc ttctcccccg cacgcggacg
661 accaccaacg ctagctgctc attccgtcaa ccagtcacac cgcgcacgga gaaggggccg
721 gggtccgcgg gcacccgcgg cggaggcgcg gttccctctc tctaattccc tggaaaacaa
781 gtaatgacaa acaaaaagac gacaaaaggt ctctattctg ctacatgaga gaaattatag
841 ctgttggcaa tttttcaaaa tacatgttat aaggcatcct ctctgccaca cgcgcagtca
901 cggataggat cagtgcgtat tcattataaa aaaaaacaca aacaacccat atatgtgaag
961 cagaatgatg accgaccgca cggagcgacg ccgtcgactg tcagcctcgc gaggagacac
1021 cgcggaccgg ggaaacggat aagtttacga acagaaatct caaaagacgc tgacccgata
1081 agtaccgtca cggagacacg gtggtttttt attgaatttc cagtgtatcg agccaccgtg
1141 atgcaggtac ggtggtttta tgtaaagtgc cgctatctat aggcgatgtg ttcctgacgg
1201 tgtgtgtttt tttggggata gacaacgtgg ttcttgtacg tggtttttac cctgctcaat
1261 aaagtcacgt tttccttaca ggtgttgtgt c
//
LOCUS HSE1GB 4283 bp ds-DNA VRL 15-DEC-1989
DEFINITION Equine herpesvirus 1(EHV1) glycoprotein B (gB) gene and 3' end
of an overlapping upstream gene with homology to the HSV1 ICP18.5
gene.
ACCESSION D00401
KEYWORDS gB gene; glycoprotein; glycoprotein B.
SOURCE Equine herpesvirus 1 (isolate HVS 25A) genomic DNA, clones
pMAC[209, 221].
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 4283)
AUTHORS Whalley,J.M., Robertson,G.R., Scott,N.A., Hudson,G.C.,
Bell,C.W. and Woodworth,L.M.
TITLE Identification and nucleotide sequence of a gene in equine
herpesvirus 1 analogous to the herpes simplex virus gene encoding
the major envelope glycoprotein gB
JOURNAL J. Gen. Virol. 70, 383-394 (1989)
STANDARD full staff_entry
COMMENT Submitted in computer readable form by Whalley,J.M. on 19-Nov-1988.
The EHV1 gB ORF appears to be overlapped at its5' end by 135 nt of
the 3' end of an upstream ORF the potential translation product of
which has approximately 50% identity with HSV gene ICP 18.5 and VZV
gene 30 product.
FEATURES from to/span description
ORF 951 3893 equivalent to the gB glycoprotein gene of HSV.
ORF < 1 1089 analogous ORF to HSV1 ICP18.5.
sigp 951 1205 signal peptide.
signal 719 723 putative CAT box.
signal 802 806 putative TATA box.
signal 3902 3907 putative polyA signal.
site 823 831 similar sequence to putative HSV1 mRNA start
site.
BASE COUNT 1090 a 1168 c 1118 g 907 t
ORIGIN map position aprox. 0.41-0.44 unit.
1 ctgcagaggc tcacggaccc agacaccagc aacagagagg ccctcaagca gctgctgggt
61 cgcatagggg tggataccga cgacggggcc ggcgagttgg gggacgcctt agacgtggat
121 ttggataatc taggtggggc ccctcctgtc aacagcaccc cctgtggtga ggacgccctc
181 tgtcgaaccg tttccgagga acgcccgtgg gacaaacttt tagagcgggc gactgcggat
241 gcttcgcagc gcaggcgcat gtacgcggag cgtctgtcaa agcgttccat cgccagtttg
301 gggcgctgcg tgcgcgaaca gcgaagagaa ctagaaaaaa ccctgagagt taacgtgtat
361 ggcgaagtgc tgctacatac gtacgtatcg tcctacaacg ggttttgcgc caggcgcggg
421 ttttgcgcgg cggtgagtcg agcgggtacc atcatagata accgctctag cacgtccgcg
481 ttcgactcgc atcagttcat gaaggcggcg ctgcttcgcc accccattga ccagtcgctc
541 atgccgtcca taacacacaa gtttttcgag ctgatcaacg ggcccgtgtt tgacaacgct
601 ggccacaact ttgcgcagcc gccaaacacg gcattatatt acagcgttga aaacgttggg
661 ttgttaccgc atctcaagga ggaactagct cggtttatga ttactgcggc taaaggtgat
721 tggtcaatta gcgagtttca aaggttttat tgctttgagg gagtgacagg tgtgacggcc
781 acgcagcggc tggcgtggaa atatatcggg gagctcatcc tagccgccgc agtattctcc
841 tcggttttcc actgtggaga ggtgcgcctc ctgcgcgcag atcgtaccta cccggactcc
901 agcggcgcac agcgctgcgt gagcggcatt tacataacct acgaggcgtc atgtcctctg
961 gttgccgttc tgtcggcggc tccacatggg gcaattggcg cggagacggt ggtgatttac
1021 gacagcgacg tgttctctct cctgtatgca gtgctccagc agctggctcc tggatcggga
1081 gccaactagg caatgttgga aacttactcg ccacccccca cccgctggga aagccggcat
1141 catcgagggt gggcacaata gttctagcct gtttgttgct ttttggaagc tgtgttgtta
1201 gagccgtacc caccacgcca agccccccaa ctagtactcc cacttccatg tcaacgcact
1261 cccatgggac agtagaccct acgctgctcc ccacagaaac gcccgaccca ctcagactgg
1321 ctgtgcgcga gtccggtata ctcgctgagg atggagactt ttacacctgc ccaccgccta
1381 ccggatccac cgtcgtacgc atcgaaccac ctagaacttg ccccaagttt gaccttggga
1441 gaaacttcac ggaggggatt gctgttattt ttaaggaaaa catcgctccc tacaaattca
1501 gggcaaacgt atactacaag gacatcgttg taacacgtgt gtggaaagga tacagccata
1561 cgtccctgtc cgacagatac aatgacaggg ttccggtttc ggtggaggag atcttcggtc
1621 tcatcgacag taagggaaaa tgttcgtcaa aggccgagta cctcagagat aacatcatgc
1681 accacgcgta ccacgacgac gaggacgagg tggagcttga tttgtgccgt ccaagtttgc
1741 aactccgggg ggccagagcc tggcagacca ccaacgatac tacgtcttac gtggggtgga
1801 tgccatggag gcactacacg tcaacgtctg tcaactgcat cgtcgaggag gtggaggcgc
1861 ggtccgtcta cccctacgac tccttcgccc tgtccaccgg tgatattgtg tacgcgtctc
1921 cgttttacgg cctgagggct gccgctcgca tagagcacaa tagctacgcg caggagcgtt
1981 tcaggcaagt tgaagggtac aggccccgcg acttagacag taaactacaa gccgaagagc
2041 cggttaccaa aaattttatc actaccccgc atgtcaccgt cagctggaac tggaccgaga
2101 agaaagtcga ggcgtgtacg ctgaccaaat ggaaagaggt cgacgaactc gtcagggacg
2161 agttccgcgg gtcctacaga tttactattc gatccatctc gtctacgttt atcagtaaca
2221 ctactcaatt taagttggaa agtgcccccc ttactgaatg tgtatccaaa gaagcaaagg
2281 aagccataga ctcgatatac aaaaagcagt acgagtctac gcacgtcttt agcggtgatg
2341 tggaatatta cctggcacgc ggggggttct taattgcatt cagacctatg ctctccaacg
2401 aactcgccag gctgtacctg aacgagcttg tgagatctaa ccgcacctac gacctaaaaa
2461 atctattgaa ccccaatgca aacaataaca ataacaccac gcgaagacgc aggtctctcc
2521 tgtcagtacc agaacctcag ccaacccaag atggtgtgca tagagaacaa attctacatc
2581 gcttgcacaa acgagcagtg gaggcaacgg caggtaccga ttcttccaac gtcaccgcca
2641 aacagctgga gctcatcaaa accacgtcgt ctatcgagtt tgccatgcta cagtttgcat
2701 acgatcacat ccaatcccac gtcaatgaaa tgctaagtag aatagcaact gcgtggtgta
2761 ccctccaaaa caaagagcgg accctatgga acgaaatggt gaagattaac ccgagcgcca
2821 tagtctccgc aacccttgac gagcgagttg cagcgagggt cctgggggac gtgatagcta
2881 taacgcactg cgccaaaata gagggcaacg tgtacttgca aaactccatg cgctcgatgg
2941 acagtaacac gtgctactcc cgcccccccg taacatttac aattactaag aatgcaaaca
3001 acagagggtc gatagaaggc cagctgggag aggagaacga gattttcacg gagcgcaagc
3061 tgatcgagcc gtgcgccctc aatcagaagc gctactttaa gtttggcaaa gagtacgttt
3121 actacgagaa ctacacgttc gtccgcaaag tgccccccac ggaaatcgag gttatcagca
3181 cgtacgttga actaaacttg acccttttgg aagaccgcga gtttctgccc ctggaggtgt
3241 acacgcgggc tgagctggag gacaccggcc tgctagacta cagcgaaata cagcgccgca
3301 accagctcca cgctctcagg ttttacgaca tcgacagcgt ggtcaacgtg gacaataccg
3361 cagtgattat gcaggggatc gccagctttt tcaagggcct gggtaaagtg ggggaggccg
3421 tgggaacgct cgttctcggc gccgccggcg ctgttgtttc aaccgtatct ggaatagctt
3481 cgtttttaaa caacccattt ggggggctag ccatcggcct gctggtaatc gccggcctgg
3541 tagctgcgtt ttttgcttac agatatgtaa tgcagatccg cagtaacccc atgaaagctc
3601 tataccccat aacaacaaag gccttgaaaa acaaagccaa aacttcctac ggccagaacg
3661 aggaggacga tgggagcgac tttgatgagg ccaagcttga agaggctcgc gaaatgatca
3721 aatacatgtc tatggtttcg gccctggaaa agcaggaaaa gaaagctata aagaaaaaca
3781 gtggggttgg cctgatcgcc agtaacgtct caaagctggc cctgcgaagg cgcggtccca
3841 aatatacccg actccaacag aacgatacca tggaaaatga aaaaatggtt taaacatgtt
3901 taataaatat tatgacacgt actcaaagtg tgacctcata tttgcataac cactttctag
3961 ttccggcccc aaggatattt aagcctagta tctccgccga ggtttcatcc tcattcacca
4021 actcacactt agagttgacg cttcctcttg cgcctttgct ctcgccgctc ctgtgttagc
4081 gtatactgcc caagaaatgg attctccacg cggtatctcc acagctaccg gtgatgccca
4141 cgccgaggcc gcggtttccc cagccgcgaa atccagataa aaacgaagcc cccgatgtag
4201 acggaccaga agccactact gagtgtttag accacaccta cacccaacag acaagcgggg
4261 gtgatggcct agatgctatc gat
//
LOCUS HSEIEP 8174 bp ds-DNA VRL 15-DEC-1989
DEFINITION Equine herpesvirus type 1 immediate-early protein gene, complete
cds.
ACCESSION J04366
KEYWORDS immediate-early protein; nonstructural protein; regulatory gene.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 8174)
AUTHORS Grundy,F.J., Baumann,R.P. and O'Callaghan,D.J.
TITLE DNA sequence and comparative analyses of the equine herpesvirus
type 1 immediate early gene
JOURNAL Virology 172, 223-236 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by D.J.O'Callaghan, 13-JUN-1989.
FEATURES from to/span description
pept 988 5451 immediate-early protein
signal 334 339 TATA box
signal 5735 5740 polyA signal
BASE COUNT 1171 a 2937 c 2790 g 1276 t
ORIGIN
1 cccggggagg agacgcatgc agatgagatg tgcatcgagg tgtcatggcg tccaggggcg
61 ttcaccttta tgcatatgag aggcgctatt cggcatcccg ttggcgcgac gcgcttccct
121 gggaggagac atacgcaaat tagaaacgac acacgggttc taattggttg gagcgggggg
181 gaggcgaaaa gcgcatgcaa atgcaaagcg cgggaccggg ccccataggc tagagccgct
241 acacgcccac cgcccatcat caacggccaa tcacaatcga tagtgtgggc tggccactcc
301 cactaggggg aaggcaaaac tccctcgtag tagtataaag cacctgttgc ttacccatcg
361 tagcatcgcg gactagagag cctttcagct cactggacca gccagccttc gaggactatc
421 gatcgcatct tggaaagctt acccgctctt ggcactcctt cttcggcttg cggaggtaag
481 agctccccgg ggacacgacc ggcttcgatc tgcttcttct cccggggaga gcgttagaga
541 acggggcgag tgccaaaaag gccatggaac ccctccaaca acgatgtccc gagggggtgg
601 ctccgaggcc cgcttcgacc tagcggtcga agcgcggtgg ggatacttac ctcgaagccg
661 gcgaaggcta taccttcccc gggcagaccc gggcggcttc tgcctcggcg gagctcggcg
721 cggaagcctg gatatctgac ggggcgtggt taccacccaa gcgggggaga ggcccgggcc
781 gcccgcgttc ccttttacca ttcggctccg ctccaactca acatcttttc cgcctctgct
841 tttccagggt agagaagcgg cgcccgtcgt ccgagcgccc gccgcggaac cccgccaccg
901 gccacccgcc aaccttccct tctcggtctt ccgagcgagc cttctcgtgc ggttggttct
961 cgaccccgaa gccggagcta gcacgccatg gccagccagc gcagcgactt cgccccggac
1021 ctctacgact tcatcgagag caacgacttc ggcgaggacc ccctcatccg cgcagccagc
1081 gcggccgaag aggggttcac ccagcccgcc gcgcccgacc tgctgtacgg cagccagaac
1141 atgttcgggg tggacgacgc tccgctctcc accccggtgg tggtcatccc tccgccgtct
1201 ccggctcccg agccccgcgg agggaaggcg aagcggtcgc cctcggccgc cggcagcggc
1261 ggtcctccta ccccggcggc tgccgcccag ccggcgtccc cggcacccag cccggctccg
1321 gggctcgccg cgatgctgaa gatggtccac tcctccgtgg ccccggggaa cggtcgccgg
1381 gccacgggct cctcatcacc cggcggtggg gacgcggccg acccggtcgc cctcgacagc
1441 gataccgaga cctgcccggg gtccccgcag cccgagtttc catcctcggc ctccccgggc
1501 ggagggtccc cggcaccccg ggtccggagc atctccatct catcgtcgtc ctcgtcctcg
1561 tcctcgatgg acgaggacga ccaggcggat ggtgccgggg cgagtagctc ctcttcgtcg
1621 tcctccgacg acagcgacag cgacgaaggc ggcgaggagg agacccctcg cccgcggcac
1681 tcgcagaacg ccgcgaagac cccgtcggcc gccggctctc ccgggccgtc ctccggaggg
1741 gatcgcccgg ccgctggggc cgccaccccg aagagctgcc gctccggcgc cgcttccccc
1801 ggcgcacccg ctccggctcc agcttcggcg cccgctccca gccgcccggg aggaggcctc
1861 ctccctccgg gggctcgcat tttagagtac ctggagggcg tccgcgaggc caatctggcc
1921 aagacgctgg agaggcccga accgcccgcg gggatggctt ctccgccggg ccggagccct
1981 caccggctcc ccaaggacca gcgtccgaaa tcggctctgg cgggagcgtc gaagcgcaag
2041 cgggccaacc ccagacccag accccagacc cagacccagg caccggccga ggaggccccg
2101 cagacggccg tgtgggactt gctggacatg aactcatccc aggctaccgg ggcggcggca
2161 gcagcagcat cggccccggc ggcggcttcg tgcgccccgg gcgtctacca gcgcgagccg
2221 cttctcaccc cgtccgggga cccctggccc gggtcggatc caccaccgat ggggagggtg
2281 cgatacgggg ggaccgggga ctcgcgggac gggctgtggg acgaccccga gatagtcctg
2341 gccgcctcgc gctacgccga ggcgcaggcc ccagtaccgg tcttcgtgcc ggagatgggg
2401 gactccacca agcagtacaa cgctctggtc cgcatggtgt tcgagagccg cgaagccatg
2461 tcctggctgc agaactctaa gctcagcggg caagaccaga acctggcgca gttctgccag
2521 aagttcatcc acgctccgcg cggacacggg tccttcatca ccgggagcgt ggccaacccc
2581 ctgccccaca tcggggacgc catggcggcc gggaacgcgc tctgggccct gccacacgcg
2641 gccgcctcgg tggccatgag ccgccgctac gatcgcactc agaagagctt catcctccag
2701 agcctccggc gcgcctacgc ggacatggcc tacccgagag acgaggcggg gaggccggac
2761 tcactcgccg ccgtggccgg ctgcccggcc caggccgccg ctgccgcggc cagccagcaa
2821 cagcccgagg ccccggcgcc ctcggtccgc gtccgcgaag cgtacacccg ggtctgcgcg
2881 gccctcgggc cccgacgcaa ggctgccgcg gccgcggccg ctccggggac cagggcgccc
2941 aggccgtccg ccttcagact cagggagctc ggggacgcct gcgtgctggc ctgccaggcc
3001 gtcttcgagg ccctcctgcg cctccgcggc ggggcgtccg ccgtccccgg actggacccc
3061 agcgagatcc cctctcccgc ctgccctccc gaggcgctgt gctccaaccc ggccgggctg
3121 gagacggcgg ccctctccct ctacgaactc agggacctgg tcgagcgggc caggctcctc
3181 ggggactctg accctaccca ccgcctgggc tccgacgagc tgcgcctcgc ggtgcgcgcc
3241 gttctggtgg tggcccggac cgtggcgccg ctggtgcgct acaacgccga gggggcccgg
3301 gcccgggcct cggcctggac cgtcacccag gccgtgttca gcatacccag cctggtcggg
3361 gggatgttgg gggaggccgt gtccctgctg gccccaccga ctcggtccca gcagccctca
3421 tcgtcctcgc ccggcggcga gcccttctcc ggctccgcgg ccgcggaggg gagccttcag
3481 accctgccgc ccctgtggcc caccgtcccc gggaagcagt ccgcgacggt cccctcgtcc
3541 cactcccagt ccccccagca ctcccagagc ggcggaggcg ccggggctac gaccgccacc
3601 tgctgccggg ccacccagac aaacgcccgc tcccgggggc agcagcacca gccgcagaag
3661 gcccgctccc ctcaggcggc cgcctccccg gcccacctca gccaggaggc gatgcccggc
3721 tcctcctcgg acgaccgtgc catccacggg cgccccaggg gcaagagcgg caagcggcgc
3781 tccgagcccc tggagccggc ggcccaggcc ggagcctcgg cctccttctc ctcgtccgcc
3841 cgggggtacg atccctcggg gccggtcgac agccctccgg cccccaagcg cagggtggcc
3901 accccgggcc accaggctcc ccgggccctg ggacccatgc cagccgaggg ccccgaccgt
3961 cggggcggat tcaggcgcgt tccccgcgga gactgccaca ctccgcggcc cagcgacgcg
4021 gcttgcgcgg cctactgtcc ccccgagctg gtggcggagc tcatcgacaa ccagctgttc
4081 cccgaggcct ggcgcccggc gctcaccttc gatccccagg ccctggccac catcgcggcc
4141 cgctgcagcg gccccccggc ccgggacggc gcgcgcttag gggagctggc ggccagcggc
4201 ccgctgagac ggagggccgc ctggatgcac cagatccccg accccgagga cgtgaaggtg
4261 gtggtcctct actccccgct ccaggacgag gacctgctgg gcggactccc ggcctcccgc
4321 cccggcggct ctcggcgcga gcccctctgg tccgacctca aggggggact ctcggcgctg
4381 ctggcggccc tggggaaccg catcctcacc aagcggtccc acgcctgggc cggcaactgg
4441 accggggccc cggacgtctc ggccctcaac gcccaggggg tcctgctgct gtcgaccggg
4501 gacctggcct tcaccggctg cgtcgagtac ctctgcctgc gcctgggctc cgccaggcgc
4561 aagctcctgg tgctggacgc ggtctccacc gaggattggc cccaggacgg tcccgcgatc
4621 agccagtacc acatctacat gcgggccgcc ctgactccgc gggtcgcctg cgccgtgcgc
4681 tggcccgggg agcgccacct cagccgcgcg gtcctcacct ccagcaccct cttcgggccc
4741 ggactgttcg cgagggccga ggccgcgttc gcgcgcctgt acccggactc tgcgcccctg
4801 aggctgtgcc gctcctccaa cgtggcctac acggtggaca ctcgcgccgg cgagcgcacc
4861 cgcgttcccc tggctccgag ggagtaccgc cagcgcgtcc tgcccgacta cgacggctgc
4921 aaggacatgc gggcccaggc cgagggcctc gggttccacg acccggactt tgaggagggc
4981 gccgcgcaga gccaccgcgc ggccaaccga tggggactcg gggcctggct gcgccccgtg
5041 tacctcgcct gcggccggcg cggcgctggg gccgtggagc cctcggagct tctgatcccc
5101 gagctgctga gcgagttctg ccgggtggcg ctgctggagc ccgacgccga ggccgagccc
5161 ctggtgctgc ccatcaccga ggctccccgc cgccgagccc cgcgggtcga ctgggagccc
5221 gggttcggct ctcgctccac ctcggtcctg cacatggggg ccacggagct gtgcctgccg
5281 gagcccgacg acgagctcga gatcgacggg gccggcgatg tggagctggt ggttgagcac
5341 cccggcccga gccccggcgt ggcccaggcc ctccgccgcg ctcccatcaa gatcgaggtg
5401 gtgtcggacg acgaggacgg aggagactgg tgcaatccgt acctctcctg aacacgatgg
5461 agcgcctccc tgcggccgaa aacaagaaaa atcagtacat ccacaactat gtgtccgccc
5521 agcacaacgc agactccgcc tagactcccg cctccatccg ctgacgctga accccgcccc
5581 gccctctgct gacgcgaaga caaggccctc cccggacgac atgtgaggaa cgaagggggc
5641 gttgtatcta gcagcccacg ttccttattg ctcacatgtc tgcccaatcg gtgggcactt
5701 ccaggctttc ccctatcgct gagtggttgt ttttaataaa gtttttttta aattttgatt
5761 gaccgcgtgg tctttgttta ctgggcgggt tgatgggcgg gttgatgggc gggttgatgg
5821 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
5881 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
5941 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
6001 gcgggttgat gggcgggttg atggttcctg ctcctcccct tcctgctcct ccccttcctg
6061 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg
6121 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg
6181 ctcctcccct tcctgctcct ccccttccgc tacgtcacta ccgcctacgt cactaccgga
6241 ctcctcccct tccgcttccg gccacgcccc ttccggtgag ccccagcata gcagtgagcc
6301 ccagcatagc agtgacgtca ctttgacccc cccccttaga ccacgccccc ctattcaaat
6361 gcggggggga gacgcgggct gggggggcca ggctctctct cgggcgcggg cccgtgaccc
6421 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc
6481 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat
6541 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg
6601 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca
6661 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct
6721 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg
6781 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg
6841 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg
6901 tgacccttga ccagatatgg cccggggcca ggctctctct cgggcgcggg cccgtgaccc
6961 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc
7021 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat
7081 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg
7141 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca
7201 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct
7261 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg
7321 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg
7381 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg
7441 tgacccttga ccagatatgg cccgggtaga gagagactgg gttcagaaga gccagagtgg
7501 gtctgtaaag acaagggagt gggacgcggg tggtgggaag tggctcaaca ccgtggccgg
7561 agatggttgg ggagggggaa aatgggggaa atatagtaaa ctagtttact actggtacta
7621 ttccacggtt atagcatttc taagctggtc cgaggaggag agtagaaagg actcaatgtg
7681 tccatttgtg tgatatatag tctgtgaccc ctagtaacac tactgccatt agtttctccc
7741 cactatatgc tcagcttgtc tataccgcgc tcacactcag gaggttaggt gtgctaatag
7801 gccaatcggg gggggggggg ggtgtggtgg taaatagcgg catcccccct agagcagata
7861 aactggagtt taatagggct agggcagggg gctagggcag ggggctaggg cagggggcta
7921 gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc tagggcaggg
7981 ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag ggggctaggg
8041 cagggggcta gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc
8101 tagggcaggg ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag
8161 ggggctaggg cagg
//
LOCUS MLVCASBRE 3335 bp ds-DNA VRL 15-MAR-1989
DEFINITION Murine leukemia virus (Cas-Br-E MuLV), 3' end of proviral genome,
encoding pol polyprotein, partial cds, and env polyprotein,
complete cds.
ACCESSION M14702
KEYWORDS env gene; glycoprotein; pol gene; provirus.
SOURCE Murine leukemia virus (isolate pBR-NE-8) proviral DNA, from mouse
brain.
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 3335)
AUTHORS Rassart,E., Nelbach,L. and Jolicoeur,P.
TITLE Cas-Br-E murine leukemia virus: Sequencing of the paralytogenic
region of its genome and derivation of specific probes to study its
origin and the structure of its recombinant genomes in leukemic
tissues
JOURNAL J. Virol. 60, 910-919 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 850 pol polyprotein
pept 790 2775 env polyprotein
matp 2185 2772 Prp15E glycoprotein
LTR 2817 3335 3' long terminal repeat
rpt 3192 3259 3' LTR R region
rpt 2817 2828 inverted terminal repeat copy A
rpt 3324 3335 inverted terminal repeat copy B
BASE COUNT 851 a 928 c 798 g 758 t
ORIGIN 84 bp upstream of HindIII site.
1 gatcgatttc accgaggtaa aacctagatt gtatggctat aagtatcttt tagtttttgt
61 agatactttc tctggctgga tagaagcttt cccaaccaag aaagaaaccg ccaaggtcgt
121 gactaagaaa ctgctagaag agatcttccc taggttcggc atgccgcagg tattgggaac
181 tgacaatggg cctgccttcg tctccaaggt gagtcagaca gtggccgatc tgttggggat
241 tgattggaaa ttacattgtg catacagacc ccaaagctca ggtcaggtag aaagaatgaa
301 taggaccatc aaggagactt taactaaatt aacgcttgca actggctcta gagactgggt
361 cctcctactc cccttagccc tgtaccgagc ccgcaacacg ccgggccccc atggcctcac
421 cccatatgag atcttatatg gggcaccccc gccccttgta aacttccctg accctgacat
481 gaccagagtt actaacagcc cctctctcca agctcactta caggctctct acttagtcca
541 gcacgaagtt tggagaccac tggcggcagc ttaccaagaa caactggacc ggccggtggt
601 gcctcaccct taccgggtcg gcgacaccgt gtgggtccgc cgacatcaaa ccaagaacct
661 agaacctcgc tggaaaggac cttacacagt cctgctgacc acccccaccg ctctcaaagt
721 ggacggcatc tctgcgtggg tacacgccgc tcacgtaaag gcagcaacga cttctccggc
781 cagaacagca tggaaggtcc agcgttctca aaatccccta aagataagac tatcgagaga
841 gccttcctag gggttttggg gatcttattc gtgacaggag ggttagcgag cagagacaac
901 ccccaccagg tatataatat aacttgggaa gtaacaaatg gagaacaaga cactgtgtgg
961 gcagtaaccg gcaaccaccc cttgtggact tggtggccag acctcacacc agacctttgt
1021 atgctggccc tacatggccc aactcattgg ggcctagaca accaccctcc atattcctct
1081 cccccggggc ccccttgttg ttcaggagat gcaggggctg tgtcaggctg tgctagagac
1141 tgtgatgagc ccttgacctc ttactccccc cggtgcaata cagcctggaa tagactgaaa
1201 ctggcccggg taacacatgc acctaaagag ggattttata tctgccctgg gtcacatcgc
1261 cccaggtggg ctcggtcgtg cgggggtcta gacgcctatt attgtgcctc ctgggggtgc
1321 gaaactacag gccgagcagc ctggaaccca acttcatctt gggactatat cacagtaagc
1381 aataatttaa cttcctcaca ggccaccaaa gcctgcaaaa ataatggctg gtgcaacccc
1441 cttgtcatac gattcacggg tccaggaaaa agggccacct cctggactac aggtcatttc
1501 tggggactgc gcctgtacat ctctggacat gacccagggc tcacttttgg gattcggcta
1561 aaagtgacag atctgggacc tagagttcca atagggccaa atcctgtctt gtcagatcag
1621 cgaccgccct cccggcctgt acctgccaga cctcccccac cttcagcctc accttccact
1681 cccaccatac ctccacagca ggggaccggg gacaggttac ttaatctggt ccagggagcc
1741 tacctcacac tcaatatgac tgatcccacc agaacccagg agtgttggtt atgcctagtc
1801 tccgagcctc cgtattatga aggggtggcc gtgttgagag agtacactag tcatgagacg
1861 gcacctgcta actgctcctc cggatcccaa cataagctga ccttatctga ggtaactgga
1921 cagggaagat gtctaggaac ggttcccaaa actcaccagg ctctatgcaa ccgcaccgag
1981 cccaccgtaa gtggttccaa ttacttggtg gctcccgaag gtaccctctg ggcatgcagc
2041 accgggctca ctccctgtct gtctactact gtgctcaact taaccactga ttactgtgtc
2101 ctagttgaac tctggccaaa ggtgacctac cactcccctg actatgtcta tactcagttt
2161 gaaccagggg ccagattccg aagagagccg gtgtcgctga ccctcgccct gctaccagaa
2221 ggtctcacca tgggtggaat tgccgcagga gtagggacag ggacaactgc cctggtcgcc
2281 acccaacagt ttcaacaact tcaggctgct atgcacaacg acctcaagga agttgaaaaa
2341 tcaattacta atctagaaaa gtctctgacc tcgctgtcag aagtggtttt gcagaaccgc
2401 agaggcctag atctactatt tctaaaagag ggaggccttt gcgcggctct aaaagaagag
2461 tgctgctttt atgcagacca cacaggatta gtgagagata gcatggccaa acttagagaa
2521 agactaaacc agagacaaaa attgtttgaa tcaggacaag ggtggtttga aggactgttt
2581 aataggtccc catggttcac aaccctgata tccactatta tgggccctct gatagtactt
2641 ttattaatcc tacttttcgg accctgcatt ctcaatcgat tggtccaatt tgttaaagac
2701 aggatctcag tggtccaggc tctggttttg actcagcaat atcaccagct aaaacccata
2761 gagtacgagc cgtgaataaa ataaaagatt ttatttagtt tccagaaaaa ggggggaatg
2821 aaagacccac catcaggttt agcaagctag cttaagtaac gccatttatt ttgcaaggcc
2881 tggaaaaata ccgaactgag aatagggaag ttcggatcaa ggtcaggaac agaaaaacag
2941 ctgaagttgg gccaaacagg atatctgtgg taagcagttt cggccccggc ccgaggccag
3001 aacagatggt ccccagatat ggcccaatcc tcagcagttt ctagggaccc atcagatgtt
3061 ttcaggctgc cccaaagacc tgaagtgacc ctgtgcctta tttgaactaa ccaatcagct
3121 cgcttctcgc ttcggtttgc gcgcttctgc tccccgagct ctataaaaga gcacacaacc
3181 cctcactcgg cgcgccagtc ctccgataga ctgagtcgcc cgggtacccg tgtatccaat
3241 aaaccctctt gctgttgcat ccgactggtg gtctcgctgt tccttgggag ggtctcctca
3301 gagtgattga ctacccgcct cgggggtctt tcatt
//
LOCUS PPMCG 5089 bp ds-DNA VRL 15-MAR-1990
DEFINITION Monkey B-lymphotropic papovavirus complete genome.
ACCESSION M30540
KEYWORDS complete genome; large T-antigen; small T-antigen; viral protein.
SOURCE Monkey C-lymphotropic papovavirus DNA, clone pL02, passed in human
B-lymphoblastoid cell line BJA-B.
ORGANISM Monkey B-lymphotropic papovavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 5089)
AUTHORS Furuno,A., Kanda,T. and Yoshiike,K.
TITLE Monkey B-lymphotropic papovavirus genome: The entire DNA sequence
and variable regions
JOURNAL Jpn J Med Sci Biol 39, 151-161 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 152 721 small T-antigen
pept 152 388 large T antigen, exon 1
744 2600 large T antigen, exon 2
pept 3770 2664 (c) VP-1
pept 4362 3649 (c) VP-3
pept 4719 3649 (c) VP-2
rpt 4724 4783 repeat copy A
rpt 4784 4843 repeat copy B
BASE COUNT 1516 a 1015 c 1054 g 1504 t
ORIGIN 695 bp upstream of HindIII site.
1 cccctagcct cctcctcttc tttcaacaaa gagagaggct ttggaggctt ttccaaaaac
61 tcattaggta agctgccctg agatattttc ccatataatt aagtattaag gccacctagg
121 taattaaatt tattccattt tattcacagc catggaccaa acgctgtcta aggaggagag
181 aaatgagctt atggatttat tgcaaataac tagagctgca tggggaaatc tttctatgat
241 gaaaaaagcc tataaaaatg tctccaagct ctaccatcct gataaaggag gagattcagc
301 taaaatgcag cggctcaatg aattatttca aagggtccag gttaccttga tggagataag
361 gagtcaatgt ggatcctctt cttcccaggt agcttggttt ttttgggatg agaattttag
421 aaccctagga gcttttctag gagaaaaatt taatgaaaaa attattggac tctaccctac
481 ttgcactaaa tttgtaagag ctaattgtaa ttgtatagta tgtctgctaa aaaagcagca
541 tgcaggtaca aaaaaaaatt taaaaaagcc atgtttagtc tggggagaat gttggtgcta
601 caaatgttat ttagtatggt ttggctttcc tgaggatttc acctcttttc gctactggac
661 ccttcttatg gcaaatatgg atttatctat gctcaagctt tggacggaac tgggattcta
721 atgtaagtat ttttattttc tagggttact tcagtgagga cttctacttt gggcctacca
781 cctttcaata tagccctatg gatcgagatg cagttcggga ggatcttcca aatccagggg
841 aagggtcttg ggggaaatgg tggagagagt ttgttaatag gcaatgttgt gatgatttgt
901 tttgctcaga aacaatgagt agttcaagtg atgaagacac ccccccagcg gcgcaacctc
961 ctcctcctcc tgccccttcc ccagaagaag aggatgaaat agaatttgta gaagagaccc
1021 caagttcctg tgatggatct tcttctcaaa gctcctacac ctgcaccccc cctaaaagga
1081 agaaaactga agaaaagaag ccagatgatt ttcctgtatg tttatattcc tttttaagtc
1141 atgcaattta tagtaataag actatgaata gttttttaat atatactact ttggagaaag
1201 ccaggcaact gtataaaact gtggaaaaat ctaaaattgt agttgatttt aaggctagtt
1261 tttcttatca ggatgaggaa ggggaggggt gtttgctgtt tttaattact ttaggaaaac
1321 atagagtgtc tgctgttaag catttttgtg tatcccaatg tacttttagt tttattcatt
1381 gtaaagctgt tgttaaacct ctagagttat ataagacctt aagtaaacca ccttttaagt
1441 tgttggaaga gaacaaaccg ggtgtatcca tgtttgagtt ccaagaggag aaggaacagt
1501 ctgttaattg gcaagaaata tgtaactttg caaatgaggc caacatttct gatgtcttat
1561 tgttgcttgg catctacata gattttgcag tggaacctgg caaatgtggc aagtgtgaaa
1621 aaaagcagca caaattccac tataattatc acaaagcaca tcatgccaat gcttgcctct
1681 tcttggagag tagagcccaa aaaaacattt gccaacaagc agttgaccag gtcctagcag
1741 ctaaaaggtt aaaattagta gaatgcagta gaattgaatt attagaagag agatttttgc
1801 agctttttga tgaaatggat gacttcctgc atggtgagat agaaattcta agatggatgg
1861 cgggtgtggc ctggtacacc attttactag ataattcttg ggatgttttt caaaatatcc
1921 tacaattaat aactaccagc caacccaaaa aaaggaatgt cctgataaag ggaccaatta
1981 acagtggtaa aactactttg gcttctgctt tcatgcattt ttttgatggc aaagctctaa
2041 atataaattg tcctgcagat aaactgtcct ttgaacttgg ctgtgctatt gatcaattct
2101 gtgttttgtt agatgatgtg aagggccaaa taaccttaaa taagcacttg caaccaggtc
2161 aaggggtaaa taatcttgat aacctgagag atcatcttga tggaacaatt aaagttaatt
2221 tagaaaagaa acatgtaaac aaaaggagtc aaatttttcc cccggttatt atgactatga
2281 atgagtactt gttgcctcct accataggag ttagatttgc tcttcatctg catttaaaac
2341 ctaaggctta tcttaaacaa agcctggaaa aaagtgacct ggtagccaaa agaatattaa
2401 attcaggata tactattttg ctccttttgt tatggtacaa tcctgtggat tcttttactc
2461 caaaagtgca agaaaaagtg gtgcaatgga aagaaaccct tgaaaaatat gtgtcaatta
2521 ctcagtttgg taatattcag caaaatatca ttgatggaaa agaccccttg catggaattg
2581 taattgaaga acaaatgtaa ataatgtaat catcattttc tgttttattt ctggtacaat
2641 aaagtcttac aatgcattca gcctcacata tcatttgaga cagggagaac agtctggttc
2701 tgacaaaatt tatcaacata tctattaagg tcagggtccc ctgggagtcc ttctgttccc
2761 tcaaatattc tgacttcttc cacttgtcct gagacccctt ccattggttg tccctgaatt
2821 tggggcataa gaccagagaa gaagctattt agaagagagc tgacaggata aggattttta
2881 acaatccttt tcctgagggt cacattgaaa tatctgggaa gccccctcca actttgggtt
2941 tcagaatagt tggtatgaac tccagcaata tcagcacaag acagaaacag tttgtcccct
3001 ttacaaagag gcccaactcc attttcatcc agcagcacag ttgtgacaga attagtgaac
3061 tgcataactg gtggggtggt ggctccccct gtaaaactcc cataatatct agtattttca
3121 tttttagagg ggtcagggca ccacacctcc actgggtact ttccatcttt atccagcaag
3181 gctttggcct ttggatctag gccttggttt cctggtttca tatttttaat agcaactaca
3241 tcatcaggat aggtagctgt agagctagca actaggcctt ggagttccag gggctctcct
3301 ccaacagcaa acatgtgata ggtagtgccc tgcacgggga cacaccctga ggatgaccca
3361 tagatgtact ttcctccctg gtgcaaatta actagtgagg aaattccaac aacttcagtc
3421 tttacagaca ctgcttccca catcaaaatg gtgtcacagg tcatgtcttc atttaggagg
3481 gggagtttaa taacagctac tgaataacaa ggaagggtgc ctttgttggg ggtgtcagag
3541 gccttactga aagcagtatt tatagaatta ctatatccat acaagtcctc agaaggaata
3601 ttatttccca ttctaggatt aagataggcc tcaatttggg taatagcatc aggccctgtt
3661 cttacttcta gcacctctac tcctcctttt actaggagcc tggggacggg agcgggaata
3721 gggcatgttt ttttgcatgc tccgtcttgc ctttttcttt gaggggccat cttcttcttt
3781 ctccaattta ttaagctcca cttcccaggt gggagttata tcaccatata aacctagaat
3841 tagaggaagc atccagtctt gagttactct ttggtgtgcc cctcctggag cagtataatg
3901 ttctacatac tgaccagacc taggttcatc ataacccagt tctcttcttg ccctttgacc
3961 ttccctattt tcttgatatt caaaatcagc tctacttgga ggaggctctc ctctgtttct
4021 atattcttgt cttagctgga tgggatttct agcaggtaga tacctgtaat aatcttggac
4081 actggaataa atatggacag gcccactggt caaagcccac ctagcatttt cagcaatttg
4141 agccaaggtg tgactcagtt cattggtact tctgacagcc acagcccttg tagcttgacc
4201 aatttgcaaa gtggcctgtc tcatcaaatg cctccatact tctctgccca cagcatgaaa
4261 caatgattca ccccagtcaa gtacagcatt caggtagtag ctaaaagagg taaatcccgg
4321 gaacaaataa tcaacttgag gaaaccaagg cacaagagcc atattaacaa ctggtacttc
4381 tttggagtat ccaaaagttg tcactcctgc agcaaccaca gcactggcac ctgaaacagt
4441 ttgaaaaaaa actcctattc ctatggcatt gttgagagct gttgggatag cacttaggag
4501 ggaaaactgc tctgttgtaa gtccagtaag agacaaggcc tctagagtac taagtccagc
4561 aagatccact gcttctattt caatgagcca ggctgcctca gtacttacag cagcaaaagc
4621 ctccccagta aggatagcat caactgtaaa tccagtactt aagcttaatt cagcagcaat
4681 ttcagaaata ttaaacaaaa gagataatac accccccatt tcttaccaaa tggcgggcta
4741 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaccaaa tggcgggcta
4801 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaacttc ctctacttga
4861 ggttgctaag taggttgcta agcgccacct agcaactaga ccgcagaaca gttgtttgtc
4921 acttatcagg aaatgtcaca aaaagtcccc gggcggtgcg gtgagcgagt ctaaccacag
4981 cttcctctat cagttgattc tgcaaaaaca acctgttatt gaagtctgca agtctgcaaa
5041 atcactatgg caaccctagt tttttttacc tggtataaga ggccagggg
//
LOCUS PVYAAA 9704 bp ss-RNA VRL 15-MAR-1990
DEFINITION Potato virus Y (N strain) genomic RNA, complete.
ACCESSION D00441
KEYWORDS 38K protein; HC protein; NIa protein; NIb protein; Vpg protein;
capsid protein; genome-linked protein; helper component protein;
inclusion protein; polymerase; polyprotein; protease.
SOURCE Potato virus Y (N strain), 5'end of genomic RNA and cDNA to genomic
RNA.
ORGANISM Potato virus Y
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Potyvirus.
REFERENCE 1 (bases 1 to 9704)
AUTHORS Robaglia,C., Durand-Tardif,M., Tronchet,M., Boudazin,G.,
Astier-Manifacier,S. and Casse-Delbart,F.
TITLE Nucleotide sequence of potato virus Y (N strain) Genomic RNA
JOURNAL J. Gen. Virol. 70, 935-947 (1989)
STANDARD full staff_entry
COMMENT Most of the sequence was obtained from a shotgun cloning procedure.
The 150 nucleotides at the 5'end were directly sequenced on the
virul RNA.
FEATURES from to/span description
virion 1 9704 genomic RNA
pept 185 9376 polyproteins
matp 185 1009 putative extreme 5'protein
matp 1010 2656 putative helper component protein, HC
matp 2657 3655 putative 38K protein
matp 3656 5557 cytoplasmic inclusion protein
matp 5558 5713 putative genome linked protein, Vpg
matp 5714 7009 putative nuclear inclusion protein NIa
matp 7010 8572 putative nuclear inclusion protein NIb
matp 8573 9376 putative capsid protein
BASE COUNT 3004 a 1818 c 2273 g 2609 t
ORIGIN putative 5'end of RNA genome.
1 aattaaaaca actcaataca acataagaaa aacaacgcaa aaacactcat aaacgctcat
61 tctcactcaa gcaacttgct aagtttcagt ttaaatcatt tccttgcaat tctctagaac
121 aatattggaa accatttcaa ctcaacaagc aatttcatca cttccaacca atttcagatc
181 ctcaatggca acttacatgt caacaatctg ttttggttcg tttgaatgca agctaccata
241 ctcaccagcc tcttgcgagc atattgtgaa ggaacgagaa gtgccggctt ccgttgatcc
301 tttcgcagat ctggaaacac aacttagtgc acgattgctc aagcaaaaat atgctactgt
361 tcgtgtgctc aaaaacggta cttttacgta ccgatacaag actgatgccc agataatgcg
421 cattcagaag aaactggaga ggaaggatag ggaagaatat cacttccaaa tggccgctcc
481 tagtattgtg tcaaaaatta ctatagctgg cggagatcct ccatcaaagt ctgagccaca
541 agcaccaaga gggatcattc atacaactcc aaggatgcgt aaagtcaaga cacgccccat
601 aataaagttg acagaaggcc agatgaatca cctcattaag cagataaaac agattatgtc
661 ggagaaaaga gggtctgtcc acttaattag taagaaaacc actcatgttc aatataagaa
721 gatacttggt gcatactccg cagcggttcg aactgcacat atgatgggtt tgcgacggag
781 agtggactcc gatgtgatat gtggacagtt ggacttttgc aacgtctcgc tcggacggac
841 aaatggttcc aatcaagtcc gcactatcaa catacgaagg ggtgatagtg gagtcatctt
901 gaacacaaaa agcctcaaag gccactttgg tagaagttca ggaggcttgt tcatagtgcg
961 tggatcacac gaagggaaat tgtatgatgc acgttctaga gttactcaga gtattttaaa
1021 ctcaatgatc cagttttcga atgccgacaa tttttggaag ggtctggacg gtaattgggc
1081 acgaatgaga tatccttcgg atcacacatg tgtagctggt ttacctgtcg aagattgtgg
1141 tagggtagct gcattgatgg cacacagtat ccttccgtgc tataagataa cttgccccac
1201 ctgtgctcaa cagtatgcca gcttgccagt tagcgatctg tttaagctat tgcataaaca
1261 tgcaagagat ggtttgaatc gattgggagc ggataaagac cggtttatac atgttaataa
1321 gttcttgata gcgttagagc atctaactga accggtggac ctgaatctcg agcttttcaa
1381 tgagatattt aaatccatag gggagaaaca gcaagcaccg ttcaagaatt taaatgtctt
1441 aaataatttc ttcctgaaag gaaaagaaaa tacagctcat gaatggcagg tagctcaatt
1501 gagtttgctc gaattagcaa ggttccagaa gaacagaact gataacatca agaaaggtga
1561 tatatctttc ttcagaaata aattatctgc caaggcaaac tggaatctgt atttgtcgtg
1621 cgacaaccag ctggataaaa atgcaaactt cctctgggga caaagggagt atcatgctaa
1681 gcggtttttc tcaaacttct ttgaggaaat tgatccagca aagggatact cagcatatga
1741 aatccgcaag catccaagtg gaacaaggaa gctctcaatt ggtaacttag ttgtcccact
1801 tgatttagct gagtttaggc agaagatgaa aggtgactat aggaaacaac caggggtcag
1861 caaaaagtgc acgagttcga aagatggtaa ttatgtgtat ccctgttgtt gcacaacact
1921 tgatgatggt tcagccattg aatcaacatt ctatccacca actaaaaagc accttgtaat
1981 tggcaatagt ggtgaccaaa aatttgttga tttaccaaaa ggggattcgg agatgttata
2041 cattgccaag cagggttatt gttatattaa cgtgtttctt gcaatgctga ttaacattag
2101 cgaggaggat gcaaaggatt tcacaaagaa agttcgcgac atgtgtgtgc caaagcttgg
2161 aacctggcca actatgatgg atttggcgac cacttgtgct caaatgagaa tattctatcc
2221 tgacgtacat gatgcagaat tgcccagaat attggttgac catgacactc aaacgtgtca
2281 tgtggttgac tcatttggct cgcagacaac tggatatcat attctaaaag catccagcgt
2341 gtctcaactt atcttgtttg caaatgatga attagaatct gatataaaac attatagagt
2401 tggtggtgtt cctaatgcta gccctgaact tgggtccaca atatcacctt tcagagaagg
2461 aggagttata atgtctgagt cggcagcgct gaaactgctt ttgaagggaa tttttagacc
2521 taaggtgatg agacagttgc tgttagatga gccttacctg ttgattctat caatactatc
2581 ccctggcata ctgatggcta tgtataataa tgggattttt gaacttgcgg tgaggttgtg
2641 gattaatgag aaacaatcca tagctatgat agcatcgcta ctatcagctt tagccctacg
2701 agtgtcagcg gcagaaacac tcgtcgcaca gaggattata attgatgctg cagctacaga
2761 cctccttgat gctacgtgtg atgggttcaa cctacatcta acgtacccca ctgcattgat
2821 ggtgttgcaa gttgttaaga atagaaatga atgtgatgat accctattca aggcgggttt
2881 tccaagttac aacacgagcg tcgtacagat tatggaaaaa aattatctaa atctcttgaa
2941 cgatgcttgg aaagatttaa cttggcgaga aaattatccg caacatggta ctcatacaga
3001 gcaaaacgct ctatccactc ggtacataaa acccacagaa aaggcagatt tgaaagggtt
3061 atacaacata tcaccacaag cgttcttggg ccgaagcgcc caggtggtca aaggcactgc
3121 ctcaggattg agcgagcgat ttaataatta tttcaatact aagtgtgtaa atatttcatc
3181 ctttttcatt cgtagaatct ttaggcgttt gccaaccttt gtcacttttg ttaactcatt
3241 attagttatt agtatgttaa ccagcgtagt ggcagtgtgt caggcaataa ttttagatca
3301 gaggaagtat aggagagaaa tcgagttgat gcagatagag aagaatgaga ttgtctgcat
3361 ggagctatat gcaagtttac agcgcaaact tgaacgcgat ttcacatggg atgagtacat
3421 tgagtatttg aagtcagtaa accctcagat agttcagttt gctcaagcgc agatggaaga
3481 atatgatgtg cgacaccagc gttccacacc agttgttaaa aatttggaac aagtggtagc
3541 atttatggct ttagtcatca tggtgtttga tgctgaaagg agtgattgcg tgttcaaaac
3601 tctcaataaa tttaagggtg tcctttcctc actggattat gaagttagac atcagtcctt
3661 agacgatgtg atcaagaatt ttgatgagag gaatgagatt attgattttg aattgagtga
3721 ggacacaatt cgaacttcat cagtgctaga tacaaagttt agtgattggt gggatcgaca
3781 aatccagatg ggacatacac ttccacatta cagaactgag gggcacttca tggaatttac
3841 aagagcaact gctgttcaag tggctaatga cattgcccat agcgaacacc tagacttttt
3901 agtacgggga gctgttgggt ctggaaagtc aactgggttg cctgttcatc ttagtgtggc
3961 cggatctgtg cttttaattg aaccaacgcg accactagcg gagaacgttt tcaaacagct
4021 atctagtgaa ccattcttca agaagccaac actgcgtatg cgtggaaata gtatatttgg
4081 ctcttctcca atctccgtca tgactagcgg atttgcgcta cactacttcg ccaataatcg
4141 ctctcaatta gctcagttca actttgtaat atttgatgag tgtcatgttc tggatccttc
4201 cgcgatggcg ttccgcagtc tgctgagtgt ttatcatcaa gcatgcaaag tattaaaagt
4261 gtcagctact ccagtgggaa gagaggttga attcacaaca cagcagccag tcaagttaat
4321 agtggaggac acagtgtctt tccaatcatt tgttgatgca caaggttcta aaactaatgc
4381 tgatgttgtt cagtttggtt caaacgtact tgtgtacgtg tcgagctaca atgaagttga
4441 caccttggcc aagctcctaa cagacaagaa tatgatggtc acaaaggttg atggcagaac
4501 aatgaagcac ggttgcctag aaattgtcac aaaaggaacc agtgcgagac cacattttgt
4561 tgtagcaacc aacataattg agaatggagt gactttggac atagacgtgg ttgtggactt
4621 tgggttgaaa gtctcaccgt tcttggacat tgacaatagg agcattgctt acaataaggt
4681 gagtgttagc tatggtgaga gaattcaaag gctgggtcgt gttggacgct tcaagaaagg
4741 agtagcattg cgcattggac acactgaaaa gggaattatt gaaattccaa gcatggtcgc
4801 tactgaggcg gctcttgctt gctttgcata taacttgcca gtgatgacag gaggcgtttc
4861 aactagtctg attggcaatt gtactgtgcg ccaggttaaa acaatgcagc aatttgaatt
4921 gagtcccttc tttatccaga atttcgttgc tcatgatgga tcaatgcatc ctgtcataca
4981 tgacattctt aaaaagtata aacttcgaga ttgtatgacg cctttgtgcg atcagtctat
5041 accatacagg gcatcgagta cttggttatc ggttagtgaa tatgagcgac ttggagtggc
5101 cttagaaatt ccaaagcaag tcaaaattgc attccatatc aaagagatcc ctcctaagct
5161 ccacgaaatg ctttgggaaa cggttgtcaa gtacaaagac gtttgcttat ttccaagcat
5221 tcgagcatcg tccatcagca aaatcgcata cacattgcgt acagatctct tcgccatccc
5281 aagaactcta atattggtgg agagattgct tgaagaggag cgagtgaagc agagccaatt
5341 cagaagtctc atcgatgaag ggtgctcaag catgttttca attgttaact taaccaacac
5401 tctcagagct agatatgcaa aagattacac cgcagagaac atacaaaaac ttgagaaggt
5461 gagaagtcaa ctaaaagaat tctcaaattt ggatggttct gcatgtgagg agaatttaat
5521 aaagaggtat gagtcgttgc agttcgttca tcaccaagct gcgacgtcac ttgcaaagga
5581 tctcaagttg aaggggattt ggaacaagtc attagtggct aaagacttga tcatagcagg
5641 cgctgttgca attggtggaa taggactcat atatagttgg ttcacacaat cagttgagac
5701 tgtgtctcat caagggaaaa ataaatccaa aagaatccaa gccttgaagt ttcgccatgc
5761 tcgtgacaaa agggctggct ttgaaattga caacaatgat gacacaatag aggaattctt
5821 cggatctgca tacaggaaaa agggaaaagg taaaggtacc acagttggta tgggtaagtc
5881 aagcaggagg ttcatcaaca tgtatgggtt tgatccaaca gagtactcat tcatccaatt
5941 cgttgatcca ctcactgggc ggcaaataga agaaaatgtc tatgctgaca ttagagatat
6001 tcaagagaga tttagtgaag tgcgaaagaa aatggttgag aatgatgaca ttgaaatgca
6061 agccttgggt agtaacacga ccatacatgc atacttcagg aaagattggt gtgataaagc
6121 tttgaagatt gatttaatgc cacataaccc actcaaagtt tgtgacaaaa caaatggcat
6181 tgccaaattt cctgagagag agctcgaact aaggcagact gggccagctg tagaagtcga
6241 tgtgaaggac ataccagcac aggaggtgga gcatgaagct aaatcgctca tgagaggctt
6301 gagagacttc aacccaattg cccaaacagt ttgtaggctg aaagtatctg ttgaatatgg
6361 ggcatcagag atgtacggtt ttggatttgg agcatacata gtagcgaacc accatttatt
6421 taggagttac aatggttcca tggaggtgca atccatgcac ggtacattca gggtgaagaa
6481 tctacacagt ttgagcgttc tgccaattaa aggtagggac atcatcctca tcaaaatgcc
6541 gaaagatttc cctgtctttc cacagaaatt gcatttccga gctcctacac agaatgaaag
6601 aatttgttta gttggaacca acttccaaga gaagtatgct tcgtcgatca tcacagaaac
6661 aagcactact tacaatatac caggcagcac attctggaag cattggattg aaacagataa
6721 tggacattgt ggactaccag tggtgagcac cgccgatgga tgtatagtcg gaattcacag
6781 tctggcaaac aatgcacaca ccacgaacta ctactcagcc ttcgatgaag attttgaaag
6841 caagtacctc cgaaccaatg agcacaatga atgggtcaag tcttgggttt ataatccaga
6901 cacagtgttg tggggcccgt tgaaacttaa agacagcact cccaaagggt tattcaaaac
6961 aacaaagctt gtgcaagatc taatcgatca tgatgtagtg gtggagcaag ctaagcattc
7021 tgcatggatg tttgaagcct tgacaggaaa tttgcaagct gtcgcaacaa tgaagagcca
7081 attagtaacc aagcatgtag ttaaaggaga gtgtcgacac ttcacagaat ttctgactgt
7141 ggatgcagag gcagaggcag aggcattctt caggcctttg atggatgcgt atgggaaaag
7201 cttgctaaat agagatgcgt acatcaagga cataatgaag tattcaaaac ctatagatgt
7261 tggtgtcgtg gatcggatgc atttgaggaa gccatcaata gggttatcat ctacctgcaa
7321 tgtgcacggc ttcaagaagt gtgcatatgt cactgatgag caagaaattt tcaaagcgct
7381 caacatgaaa gctgcagtcg gagccagtta tgggtgcaaa aagaaagact attttgagca
7441 tttcactgat gcagataagg aagaaatagt catgcaaagc tgtctgcgat tgtataaagg
7501 tttgcttggc atttggaacg gatcattgaa ggcagagctc cggtgtaagg agaagatact
7561 tgcaaataag acgaggacgt tcactgctgc acctctagac actttgctgg gtggtaaagt
7621 gtgtgttgat gacttcaata atcaatttta ttcaaagaat attgaatgct gttggacagt
7681 tgggatgact aagttttatg gtggttggga taaactgctt cggcgtttac ctgagaattg
7741 ggtatactgt gatgctgatg gctcacagtt tgatagttca ctaactccat acctaatcaa
7801 tgctgttctc accatcagaa gcacatacat ggaagactgg gatgtggggt tgcagatgct
7861 gcgcaattta tacactgaga ttgtttacac accaatttca actccagatg gaacaattgt
7921 caagaagttt agaggtaata atagtggtca accttctacc gttgtggata attctctcat
7981 ggttgtcctt gctatgcatt acgctctcat taaggagtgc gttgagtttg aagaaatcga
8041 cagcacgtgt gtattctttg ttaatggtga tgacttattg attgctgtga atccggagaa
8101 agagagcatt ctcgatagaa tgtcacaaca tttctcagat cttggtttga actatgattt
8161 ttcgtcgaga acaagaagga aggaggaatt gtggttcatg tcccatagag gcctgctaat
8221 cgagggtatg tacgtgccaa agcttgaaga agagagaatt gtatccattc tgcaatggga
8281 tagagctgat ctgccagagc acagattaga agcgatttgc gcagctatga tagagtcctg
8341 gggttattct gaactaacac accaaatcag gagattctac tcatggttat tgcaacagca
8401 accttttgca acaatagcgc aggaagggaa ggctccttat atagcaagca tggcactaag
8461 gaaactgtat atggataggg ctgtggatga ggaagagcta agagccttca ctgaaatgat
8521 ggtcgcatta gatgatgagt ttgagcttga ctcttatgaa gtacaccatc aagcaaatga
8581 cacaattgat gcaggaggaa gcaacaagaa agatgcaaaa ccagagcagg gcagcatcca
8641 gccaaacccg aacaaaggaa aggataagga tgttaatgca ggcacatctg ggacacatac
8701 tgtgccgaga atcaaggcta tcacgtccaa aatgagaatg cccacaagca agggagcaac
8761 cgtgccaaac ttagaacatt tgcttgagta tgctccacaa caaattgata tttcaaatac
8821 tcgggcaact caatcacagt ttgatacgtg gtatgaggca gtgcggatgg catacgacat
8881 aggagaaact gagatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg
8941 aacctcgcca aatgtcaacg gagtttgggt tatgatggat gggaatgaac aagttgagta
9001 cccgttgaaa ccaatcgttg agaatgcaaa accaaccctt aggcaaatca tggcacattt
9061 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg
9121 atatggttta attcgaaatc tgcgggatat gggtttagcg cgttatgcct ttgactttta
9181 tgaggtcaca tcacgaacac cagtgagggc tagggaagcg cacattcaaa tgaaggccgc
9241 agcattgaaa tcagcccaac ctcgactttt cgggttggac ggtggcatca gtacacaaga
9301 ggagaacaca gagaggcaca ccaccgagga tgtctctcca agtatgcata ctctacttgg
9361 agtcaagaac atgtgatgta gtgtctctcc ggacgatata taagtattta catatgcagt
9421 aagtattttg gcttttcctg tactactttt atcataatta ataatcgttt gaatattact
9481 ggcagatagg ggtggtatag cgattccgtc gttgttagtg accttagctg tcggttctgt
9541 attattaagt cttagataaa aagtgccggg ttgttgttgt gtgactgatc tatcgattag
9601 gtgatgctgt gattctgtca tagcagtgac tatgtctgga tttagttact tgggtgatgc
9661 tgtgattctg tcatagcagt gactgtaaac ttcaatcagg agac
//
LOCUS ROBTRFVP2 2687 bp ss-RNA VRL 15-MAR-1990
DEFINITION Bovine rotavirus mRNA for RNA binding protein VP2.
ACCESSION X14057 X14507
KEYWORDS RNA binding protein.
SOURCE Bovine rotavirus.
ORGANISM Bovine rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 2687; enum. 1 to 2687)
AUTHORS Cohen,J.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 2687; enum. 1 to 2687)
AUTHORS Kumar,A., Charpilienne,A. and Cohen,J.
TITLE Nucleotide sequence of the gene encoding for the RNA binding
protein (VP2) of RF bovine rotavirus
JOURNAL Nucleic Acids Res. 17, 2126-2126 (1989)
STANDARD simple automatic
COMMENT *source: strain=RF;
Data kindly reviewed (21-APR-1989) by Cohen J.
[1] Author address Cohen J., INRA, Station de Virologie et
d'Immunologie Moleculaires , INRA, CRJ, Domaine de Vilvert, 78350
Jouy en Josas, France.
Submitted (16-JAN-1989) on tape to the EMBL data library
FEATURES from to/span description
pept 17 2659 VP2 protein (AA 1-880)
site 1622 1637 leucine zipper (AA 536-557)
site 2009 2074 leucine zipper (AA 665-686)
BASE COUNT 997 a 425 c 520 g 745 t
ORIGIN
1 ggctattaaa ggttcaatgg cgtacaggaa acgtggagcg cgccgtgagg cgaatataaa
61 taataatgac cgaatgcaag agaaagatga cgagaaacaa gatcaaaaca atagaatgca
121 gttgtctgat aaagtacttt caaagaaaga ggaagtcgta accgacagtc aagaagaaat
181 taaaattgct gatgaagtga agaaatcgac gaaagaagaa tctaaacaat tgcttgaagt
241 tttgaaaaca aaagaagagc accaaaaaga gatacaatat gaaattttgc aaaaaacgat
301 accaacattt gaaccaaaag agtcaatatt gaaaaaattg gaggatatca aaccggaaca
361 agcgaagaag cagactaagc tatttagaat atttgaaccg agacagctac caatttatag
421 agcgaatggt gaaaaagagt tgcgtaacag atggtattgg aagctgaaga aagatacttt
481 accagatgga gattatgatg ttagagaata ctttctaaat ttgtatgatc aggttcttac
541 tgaaatgcca gattatttac tattaaaaga tatggcagtt gaaaataaaa attcgagaga
601 tgccggtaaa gttgttgatt ctgaaacagc aagtatctgt gatgctatat ttcaagatga
661 ggaaacagaa ggtgcagtga gacgattcat tgcggagatg agacagcgcg tacaagctga
721 cagaaacgtt gtcaattacc catcaatatt gcatccaata gattacgctt ttaatgagta
781 ttttttgcaa caccaattag ttgaaccatt gaataatgat ataatattca attacattcc
841 tgaaaggata aggaatgacg ttaactatat acttaatatg gacagaaatc tgccatcaac
901 agctagatat ataagaccta atttactaca agacagactg aatttgcatg acaattttga
961 atccttgtgg gatacaataa caacttcaaa ctatattctg gcaagatcgg tagtaccaga
1021 tttaaaggaa ttagtttcaa ccgaagcgca aattcaaaaa atgtcacaag acttgcaact
1081 agaagcatta acaatacagt cagaaacgca gtttttaaca ggtataaact cacaagcagc
1141 aaatgactgt ttcaaaactc tgattgcagc aatgttaagt caacgaacca tgtcgcttga
1201 tttcgtgact acaaattata tgtcattaat ttcaggcatg tggttactaa ctgtagtgcc
1261 aaatgacatg ttcataaggg aatcattggt tgcatgtcaa ctggctatag tgaatacaat
1321 aatatatcca gcgttcggaa tgcaacgaat gcattataga aacggagacc cacaaagacc
1381 atttcagata gcagaacaac aaatacaaaa ttttcaagta gcgaattggc tgcattttgt
1441 caataacaat caatttagac aagtagttat tgatggtgta ttgaatcagg tgctgaatga
1501 caatattaga aatggacatg tcattaatca attgatggaa gctttaatgc aactatcacg
1561 acaacagttt ccaacaatgc ctgttgatta taagaggtca atccagcgtg gaatattatt
1621 gctatcaaat aggcttggtc aattagttga tttaactagg ttattagctt acaactacga
1681 aacactaatg gcatgtgtta cgatgaatat gcaacatgtt cagactttga caacagaaaa
1741 attacagtta acttcagtca catcgttgtg tatgcttatt ggaaatgcaa ccgttatacc
1801 cagcccgcag acattgtttc actattataa tgttaatgtt aattttcatt caaattataa
1861 tgaaagaatt aatgatgcag tggccataat aactggagct aatagactaa atttatatca
1921 gaaaaagatg aaggcaatag ttgaagattt tttaaaaaga ttacatattt tcgatgtagc
1981 tagagttcca gatgatcaaa tgtatagatt aagggataga ctacgactat tgccagtaga
2041 agtaagacga ttggatattt ttaatttgat actgatgaac atggatcaga tagaacgcgc
2101 atcagataaa attgcgcaag gtgttattat tgcgtaccgc gatatgcaat tggaaagaga
2161 cgaaatgtat ggctacgtga atatagctag aaatttagat gggttccagc aaataaacct
2221 agaagaattg atgagaacag gcgattatgc acaaataact aacatgctct tgaataatca
2281 accagtagcg ctagttggag ctcttccatt tgttacagac tcgtcagtca tatcgttgat
2341 agcgaacgtt gacgctacag tttttgccca aatagttaaa ttacggaaag ttgatacctt
2401 gaaaccaata ttgtataaaa taaattcaga ttcgaatgac ttttacctag ttgccaacta
2461 tgattgggtg cctacttcaa ccacaaaagt atataagcaa gttccacagc aatttgattt
2521 cagaaattcg atgcatatgt taacatcaaa tcttactttc actgtttact ctgatctgct
2581 tgcattcgta tcggccgata cagtagaacc tataaatgca gttgcatttg ataatatgcg
2641 catcatgaac gagttgtaaa cgccaacccc actgtggaga tatgacc
//
LOCUS VACH3K 4536 bp ds-DNA VRL 15-DEC-1989
DEFINITION Vaccinia virus, HindIII K fragment.
ACCESSION D00382
KEYWORDS nonessential gene; serine protease inhibitors.
SOURCE Vaccinia virus HindIII K fragment originally from vaccinia virus
strain WR, cloned in pBR322, was a gift from R. Wittek, transformed
into Escherichia coli strain TG1.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 4536)
AUTHORS Boursnell,M.E.G., Foulds,I.J., Campbell,J.I. and Binns,M.M.
TITLE Non-essential genes in the vaccinia virus HindIII K fragment: a
gene related to serine protease inhibitors and a gene related to
the 37K vaccinia virus major envelope antigen
JOURNAL J. Gen. Virol. 69, 2995-3003 (1988)
STANDARD full staff_entry
COMMENT One gene, predicted to encode a 42.2K protein, is highly related to
the family of serine protease inhibitors. It shows approximately
25% identity to human antithrombin III and 19% identity to the
cowpox virus 38K protein gene which is also related to serine
protease inhibitors. The product of another gene shows a similar
high level of identity to the 37K vaccinia virus major envelope
antigen. The existance of viable deletion mutants and recombinants
containing foreign DNA inserted into both these genes indicates
that they are non-essential.
FEATURES from to/span description
ORF 50 < 1 (c) ORF KO, amino end.
ORF 1381 272 (c) ORF K1
ORF 1697 1431 (c) ORF K2
ORF 3023 1749 (c) ORF K3
ORF 3185 3051 (c) ORF K4
ORF 3604 3200 (c) ORF K5
ORF 3818 3573 (c) ORF K6
ORF 3957 4406 ORF K7
ORF 4235 4041 (c) ORF8
BASE COUNT 1454 a 789 c 712 g 1581 t
ORIGIN 1 bp upstream of HindIII site
1 aagcttttca gctgcttaga cttccaagta ttaattcgtg acagatccat gtctgaaacg
61 agacgctaat tagtgtatat tttttcattt tttataattt tgtcatattg caccagaatt
121 aataatatct ctaatagatc tgattagtag atacatggct atcgcaaaac aacatataca
181 catttaataa aaataatatt tattaagaaa attcagattt cacgtaccca tcaatataaa
241 taaaataatg attccttaca ccgtacccat attaaggaga ttccacctta cccataaaca
301 atataaatcc agtaatatca tgtctgatga tgaacacaaa tggtgtatta aattccagtt
361 tttcaggaga tgatctcgcc gtagctacca taatagtaga tgcctctgct acagttcctt
421 gttcgtcgac atctatcttt gcattctgaa acattttata aatatataat gggtccctag
481 tcatatgttt aaacgacgca ttatctggat taaacatact aggagccatc atttcggcta
541 tcgacttaat atccctctta ttttcgatag aaaatttagg gagtttaaga ttgtacactt
601 tattccctaa ttgaaacgac caatagtcta attttgcagc cgtaatagaa tctgtgaaat
661 gggtcatatt atcacctatt gccaggtaca tactaatatt agcatcctta tacggaaggc
721 gtaccatatc atattcttcg tcatcgattg tgattgtatt tccttgcaat ttagtaacta
781 cgttcatcat gggaaccgtt ttcgtaccgt acttattagt aaaactagca ttgcgtgttt
841 tagtgatatc aaacggatat tgccatatac ctttaaaata tatagtatta atgattgccc
901 atagagtatt attgtcgagc atattagaat ctactacatt agacataccg gatctacgtt
961 ctactataga attaatttta ttaaccgcat ctcgtctaaa gtttaatcta tataggccga
1021 atctatgata ttgttgataa tacgacggtt taatgcacac agtattatct acgaaacttt
1081 gataagttag atcagtgtac gtatatttag atgttttcag cttagctaat cctgatatta
1141 attctgtaaa tgctggaccc agatctcttt ttctcaaatc catagtcttc aataattcta
1201 ttctagtatt acctgatgca ggcaatagcg acataaacat agaaaacgaa taaccaaacg
1261 gtgagaagac aatattatca tcttgaatat ttttatacgc tactataccg gcattggtaa
1321 atccttgcag acgataggta gacactgaac acgttaacga tagtatcaat aacgcaatca
1381 tgattttatg gtattaataa ttaaccttat ttttatgttc ggtataaaaa ttattgatgt
1441 ctacacatcc ttttgtaatt gacatctata tatccttttg tataatcaac tctaatcact
1501 ttaactttta cagttttccc taccagttta tccctatatt caacatatct atccatatgc
1561 atcttaacac tctctgccaa gatagcttca aagtgaggat agtcaaaaag ataaatatat
1621 agagcataat ccttctcgta tactctgccc tttattacat cacccgcatt gggcaacgaa
1681 taacaaaatg caagcatctt gttaacgggc tcgtaaattg ggataaaaat tatgttttta
1741 tatctatttt attcaagaga atattcagga atttcttttt ccggttgtat ctcatcgcag
1801 tatatatcat ttgtacattg tttcatattt tttaatagtc tacacctttt agtaggacta
1861 gtatcgtaca attcatagct gtattttgaa ttccaatcac gcataaaaat atcttccaat
1921 tgttgacgaa gacctaatcc atcatccggt gtaatattaa tagatgctcc acatgtatcc
1981 gtaaagtaat ttcctgtcca atttgaggta cctatatacg ccgttttatc ggttaccata
2041 tatttggcat ggtttaccct agaatacgga atgggaggat cagcatctgg tacaataaat
2101 agctttactt ctatatttat gtttttagat tttagcatag cgatagatct taaaaagttt
2161 ctcatgataa acgaagatcg ttgccagcaa ctaatcaata gcttaactga cacttgtctg
2221 tctatagcgg ctcttcttaa ttcatcttct atataaggcc aaaacaaaat attgcctgcc
2281 ttcgaataaa taatagggat aaagttcata acagatacat aaacgaattt actcgcattt
2341 ctgatacatg acaataaagc ggttaaatca ttggttcttt ccatagtaca tagttgttgc
2401 ggtgcagaag caataaatac agagtgtgga acgccgctta cgttaatact aagaggatga
2461 tctgtattat aatacgacgg ataaaagttt ttccaattat atggtagatt gttaactcca
2521 agataccagt atacctcaaa aatttgagtg agatccgctg ccaagttcct attattgaag
2581 atcgcaatac ccaattcttt gacctgagtt agtgatctcc aatccatgtt agcgcttcct
2641 aaataaatat gtgtattatc agatatccaa aattttgtat gaagaactcc tcctaggata
2701 tttgtaatat ctatgtatcg tacttcaact ccggccattt gtagtctttc aacatccttt
2761 aatggtttgt tagatttatt gacggctact ctaactcgta ctcctctttt gggtaattgt
2821 acaatcttgt ttaatattat cgtgccgaaa ttcgtaccca cttcatccga taaactccaa
2881 taaaaagatg atatatctag tgtttttgtg gtattggata gaatttccct ccacatgtta
2941 aatgtagaca aatatacttt atcaaattgc atacctatag gaatagtctc tgtaatcact
3001 gcgattgtat tatccggatt cattttattt gttaaaagaa taatcctata tcacttcact
3061 ctattaaaaa tccaagtttc tatttctttc atgactgatt ttttaacttc atccgtttcc
3121 ttatgaagat gatgtttggc accttcataa atttttattt ctctattaca atttgcatgt
3181 tgcatgaaat aatatgcacc taaaacatcg ctaatcttat tgtttgttcc ctggagtatg
3241 agagtcgggg ggtgttaatc ttggaaatta tttttctaac cttgttggta gccttcaaga
3301 cctgactagc aaatccagcc ttaatttttt catgattgat taatgggtcg tattggtatt
3361 tataaacttt atccatatct ctagatactg attctggaca tagctttccg actggcgcat
3421 ttagtgtgat ggttcccata agtttggcag ctagcagatt cagttttgaa acagcatctg
3481 cattaactag aggagacatt agaatcattg ctgtaaacaa gtttggatta tcgtaagagg
3541 ctagctccca tggaatgacc caataagtag atttaatagt taccacgtgc tgtaccaaag
3601 tcatcaatca tcattttttc accattactt cttccatgtc caatatgatc atgtgagaat
3661 actaaaattc ctaacgatga tatgttttca gctagttcgt cataacgtcc agaatgttta
3721 ccagctccat gacttatgaa tactaatgcc ttaggatatg taataggttt ccaatatatg
3781 taatcattgt ccagattgaa catacagttt gcactcatga ttcacgttat ataactatca
3841 atattaacag ttcgtttgat gatcatatta tttttatgtt ttattgataa ttgtaaaaac
3901 atacaattaa atcaatatag aggaaggaga cggctactgt cttttgtgag atagtcatgg
3961 cgactaaatt agattatgag gatgctgttt tttactttgt ggatgatgat aaaatatgta
4021 gtcgcgactc catcatcgat ctaatagatg aatatattac gtggagaaat catgttatag
4081 tgtttaacaa agatattacc agttgtggaa gactgtacaa ggaattgatg aagttcgatg
4141 atgtcgctat acggtactat ggtattgata aaattaatga gattgtcgaa gctatgagcg
4201 aaggagacca ctacatcaat tttacaaaag tccatgatca ggaaagttta ttcgctacca
4261 taggaatatg tgctaaaatc actgaacatt ggggatacaa aaagatttca gaatctagat
4321 tccaatcatt gggaaacatt acagatctga tgaccgacga taatataaac atcttgatac
4381 tttttctaga aaaaaaattg aattgatgat ataggggtct tcataacgca taattattac
4441 gttagcattc tatatccgtg ttaaaaaaaa ttatcctatc atgtatttga gagttttata
4501 tgtagcaaac atgatagctg tgatgccaat aagctt
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FPLTRAC 3227 bp ds-DNA BCT 05-FEB-1990
DEFINITION Plasmid F TraC protein (traC) gene, complete cds and TrbI protein
(traI) gene, partial cds.
ACCESSION M30936
KEYWORDS TraC protein; TrbI protein.
SOURCE plasmid F DNA, clone PKAS2.
ORGANISM Plasmid F
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 3227)
AUTHORS Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A.,
Ippen-Ihler,K. and Webster,R.E.
TITLE Nucleotide sequence of the F plasmid gene traC and identification
of its product
JOURNAL Unpublished (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 268 2895 TraC protein (gtg start codon)
pept 2892 > 3227 TrbI protein
binding 256 261 Shine-Delgarno sequence
BASE COUNT 785 a 827 c 900 g 715 t
ORIGIN
1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc
61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga
121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta
181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc
241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg
301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt
361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag
421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc
481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg
541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa
601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc
661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg
721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa
781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag
841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag
901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct
961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg
1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg
1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg
1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac
1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag
1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg
1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc
1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc
1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc
1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta
1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg
1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc
1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt
1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg
1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg
1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg
1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg
1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca
2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag
2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc
2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa
2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg
2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa
2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa
2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca
2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt
2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa
2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc
2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt
2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt
2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg
2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac
2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc
2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt
3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg
3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt
3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg
3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS DROLARM 7606 bp ss-mRNA INV 19-OCT-1989
DEFINITION D.melanogaster protein tyrosine phosphatase (DLAR) mRNA, complete
cds.
ACCESSION M27700
KEYWORDS protein tyrosine phosphatase.
SOURCE Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to
mRNA, clones DLAR[23,55].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (sites)
AUTHORS Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H.
TITLE A family of receptor-linked protein tyrosine phosphatases in humans
and Drosophila
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 7606; for [1])
AUTHORS Saito,H.
JOURNAL Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Saito, 07-SEP-1989.
FEATURES from to/span description
pept 118 6207 protein tyrosine phosphatase (DLAR) precursor
sigp 118 213 protein tyrosine phosphatase (DLAR) signal
peptide
matp 214 6204 protein tyrosine phosphatase (DLAR)
BASE COUNT 1999 a 1973 c 1926 g 1708 t
ORIGIN
1 aaagagaata aaaagagcat ttaaagttat tagtaaccat cgttgttgtt gagttgttgt
61 tgttgttacc aactgtcgac atacccttgc aactgccggc gaaaacatag cgaaataatg
121 ggtctgcaga tgacagcagc ccgtccaatc gcagctctca gcctactagt gttgtccttg
181 ctcacctgga ctcaccccac aatcgtggat gcagcccatc cgccggagat catcaggaag
241 ccgcagaatc agggagtccg agtgggcggc gttgctagct tctattgtgc ggcccgcggt
301 gatccgcctc catcgatagt gtggcgcaaa aatggcaaaa aagtttcggg aacccagtcg
361 cgttacacgg tgctggagca gcccggcggg atttccatac tccggattga gcccgtgcgg
421 gcgggacgcg atgatgcacc atacgagtgt gtggcggaga acggggtggg cgatgccgtt
481 tccgcagatg caactttaac catatatgaa ggcgataaaa cacccgcagg ctttccggtt
541 ataacccagg gtccgggaac tcgcgtcatt gaagtgggtc acacggtcct catgacatgc
601 aaagccatcg gcaatccgac gccaaacatt tactggatta agaatcagac aaaggttgat
661 atgagcaatc cgcgctactc tctcaaggat ggcttcctgc aaatcgaaaa cagtcgcgag
721 gaggatcagg gcaaatacga gtgtgtggct gagaactcaa tgggcacgga gcactcgaag
781 gccaccaact tgtatgtgaa agtccgtcgt gttccgccca ccttttcccg cccaccagag
841 accatcagcg aggtgatgtt gggatctaat ctgaatctat cctgcattgc cgtcggctca
901 cccatgccgc atgtcaagtg gatgaagggc tctgaagatc ttacacccga gaatgagatg
961 ccaatcggac gaaatgtcct gcagctgatc aatatccagg agagcgccaa ctacacttgc
1021 atagcggcct ccactttggg ccaaatcgat tccgtttcgg tggttaaagt gcaatctctg
1081 cccaccgcac ccaccgatgt gcaaatctcc gaggtgaccg ccacttcggt gcgtctggag
1141 tggtcgtaca agggtcccga ggacttgcaa tattacgtga tccagtacaa gccgaagaac
1201 gccaaccagg ccttcagcga gataagcggc atcatcacca tgtactatgt ggtccgtgcc
1261 ctgagtccct acacggagta cgagttctac gtgatagccg tgaacaatat tggacgcgga
1321 ccgccctcgg caccagcgac atgtaccacc ggtgagacaa aaatggaaag tgcaccacgt
1381 aatgtccaag tgcgcacgct gagctcgtcc acgatggtta ttacttggga accaccagag
1441 acgcccaatg gacaagtgac cggctacaag gtgtactaca cgaccaattc gaatcagccg
1501 gaggcgtcgt ggaactccca gatggtcgac aatagcgaac tgaccacagt ctcggacgtg
1561 acgccccacg ccatctacac ggtccgggtt caggcctaca catcgatggg agccggtcca
1621 atgtccacgc cggtccaggt gaaggcccag caaggtgtgc catcgcaacc gagcaatttc
1681 cgggcaaccg atatcggcga gaccgcagtc acactgcaat ggaccaagcc gacgcattcc
1741 agcgagaata tcgtgcacta cgagctctac tggaatgaca catacgccaa tcaggcccat
1801 cacaagcgca tttccaactc ggaggcgtat accctggacg gactctaccc cgataccctc
1861 tactacatct ggctggctgc aaggtcgcaa cgtggcgagg gggccaccac cccgcccatt
1921 ccggtgcgca ccaagcaata tgtaccaggt gctccgcctc gaaatatcac cgccatagcc
1981 accagctcga cgaccatatc cctcagctgg ctgcctccgc ccgtcgagcg atcgaacggc
2041 cggatcatat actataaggt gttcttcgtg gaggtgggtc gcgaagacga cgaggccacc
2101 accatgaccc tcaatatgac cagcattgtg ctggacgagc tgaagcgctg gacagagtac
2161 aagatctggg tgctggccgg cacctccgtc ggggatgggc cgcggtcgca tcccatcatt
2221 ttgcgcaccc aagaggatgt gcccggcgat ccgcaagatg tgaaggccac gcctttgaac
2281 tccacttcga tccatgtcag ctggaagccg cctctcgaaa aggatcgcaa tggcatcatc
2341 cgtgggtatc atatacacgc ccaggagctg cgagatgagg gcaagggctt tttgaacgaa
2401 cccttcaagt ttgatgtggt agacacgctg gagttcaatg tgactggctt gcagccggat
2461 acaaagtact ccattcaggt ggcggcacta actcgtaaag gagatggtga ccggagtgct
2521 gcgattgtgg tgaaaactcc tggcggagta ccagtccgac caacggtgag tctgaagatc
2581 atggagcggg aaccgatcgt gtccatcgaa ctcgaatggg agcgaccggc gcagacctat
2641 ggcgaattgc gtggctatcg acttcgatgg ggcgtcaagg accaagcact gaaggaggag
2701 atgctgtcag gaccgcagat gaccaagaaa cggtttgata acttggaacg cggagttgag
2761 tacgaatttc gtgtggcggg cagcaatcat attggtatcg gacaagagac ggtgaaaata
2821 tttcagacac ccgagggaac acccggtgga ccgccttcta acattaccat tcgcttccaa
2881 actccggatg tactgtgcgt gacctgggat ccaccaacta gggagcaccg gaatggcata
2941 atcacccgct atgatgttca gtttcacaag aaaatcgatc atggcctggg atccgagcga
3001 aatatgactc tccggaaggc ggtgttcaca aatctggagg agaacaccga gtatatcttc
3061 cgggtgaggg cttatacgaa gcagggagct ggtcccttca gcgacaagtt aatcgtggag
3121 acagaacgtg acatgggtcg agcacctatg tccctgcagg cagaggcaac atcggagcaa
3181 actgcggaga tctggtggga accggtaaca agtcgtggca agttgctggg ctacaagatc
3241 ttttacacca tgacagctgt cgaggatctg gacgattggc aaacgaaaac cgttggactt
3301 acggaatccg ctgatcttgt taatctcgag aagtttgccc aatatgccgt ggccattgcg
3361 gcgaggttca agaacggatt gggacgtctt agtgaaaagg ttacagtacg catcaagccg
3421 gaggatgtgc ccttaaatct tcgcgctcac gatgtcagca cccattcgat gaccttgagt
3481 tggtcgccac ccattcgcct aaccccggtc aactacaaga tcagcttcga tgccatgaag
3541 gtgtttgtgg actcgcaggg attctcccag acccagatcg ttcccaagcg agagattatc
3601 cttaagcact atgtgaagac ccacactatc aacgaactca gtccgtttac cacgtacaat
3661 gtgaatgtga gtgccattcc ctcggattat tcctaccggc cgcccacaaa gattacggtc
3721 acaacgcaaa tggctgcacc tcagccaatg gtgaagccgg atttctacgg cgttgttaat
3781 ggcgaggaaa ttctggtgat actgcctcag gcttcggagg aatatggacc catatcgcac
3841 tattatttgg tggtggtccc ggaggacaag tccaatctgc acaagatacc cgatcagttc
3901 cttaccgatg atctcttgcc gggcaggaac aagccagagc gtccgaatgc accgtacatt
3961 gcagccaagt tcccgcagcg ttccattccg ttcacattcc acctgggatc tggcgatgat
4021 tatcataact ttacaaatcg caaattggag cgagagaagc gctaccgcat ctttgtgcga
4081 gcggtggtgg atacgccaca gaagcacctc tacacctcca gtcccttctc tgagttccta
4141 tcgctggaca tgagggaagc tccgccaggt gagcggcccc accgacccga tcccaattgg
4201 cccgcggagc cggaagtgtc ggtgaaccgc aacaaggacg aaccggagat tctgtgggtg
4261 gtgctgcccc tgatggtgtc cacattcatt gtgtccaccg ccctgatcgt tctctgtgtg
4321 gtaaagcgtc gtcgccagcc gtgcaagact ccggatcagg cagctgtcac aaggccactg
4381 atggccgccg acctgggagc cggacctacg cccagcgatc cagtggacat gaggcgcttg
4441 aacttccaga cacccggcat gatctcccat ccgcccatac cgatatccga gtttgccaac
4501 cacatcgaac gactcaagtc caatgacaat cagaagtttt cgcaggaata cgaaagcatt
4561 gagccgggcc aacagttcac ctgggacaac tccaatctgg agcataacaa gtctaagaat
4621 cgctatgcaa atgttaccgc ctacgatcat tcacgcgtcc agttgccagc ggtggagggt
4681 gtggttggat cagattacat caatgccaat tactgtgacg gctatcggaa gcacaatgcc
4741 tacgtggcga cccaaggtcc gttgcaggag acctttgtgg acttctggcg catgtgttgg
4801 gaactgaaga cggccaccat tgtgatgatg acgcgattgg aggaacgaac gcgcataaag
4861 tgcgatcagt attggcccac tcgcggaacg gaaacctatg gtcagatctt tgtgaccatc
4921 acggagacac aggaactggc cacctacagc atccgcacct tccagttgtg ccggcagggc
4981 tttaacgatc ggcgtgagat caagcagctg cagttcacag cctggccaga tcatggagtg
5041 cccgatcatc cggctccctt ccttcagttc ttgcgccggt gtcgcgccct cacgccaccg
5101 gaatccggac ccgtgattgt tcactgctct gcgggagttg gtcgcactgg ctgttatatc
5161 gtaatcgatt caatgttgga acgaatgaaa cacgagaaga ttatcgacat ctatgggcat
5221 gttacttgtt tacgggcgca acggaactac atggtgcaga cggaggatca gtacatcttc
5281 attcatgacg ccatcctgga ggccatcatc tgtggggtga cggaggtgcc ggctcgcaat
5341 ctacacaccc acctacagaa actattgatc acggagcccg gcgagaccat ctcgggcatg
5401 gaggtggagt tcaagaagct gtctaacgtc aagatggact cgtccaagtt cgtaacggcc
5461 aatctgccgt gcaacaagca caagaatcgc ctggtccaca ttctgccgta cgagtcaagt
5521 cgcgtctacc tgacccccat ccatggaatc gagggaagcg actatgtcaa cgccagcttc
5581 atcgacggct atcgttaccg ttccgcatac atcgccgcac agggtcctgt ccaggatgcc
5641 gctgaggact tttggcgcat gctctgggag cacaactcca ccattgtggt catgctgacc
5701 aagctcaagg aaatgggaag ggaaaagtgc ttccagtact ggccccatga gcgatccgta
5761 cgctatcagt attatgtcgt ggatcccatt gctgagtaca acatgccgca gtataagctg
5821 cgtgaattta aggtcacgga tgcccgagat ggctcatcgc gcaccgtccg ccagttccag
5881 ttcatcgatt ggccggagca gggtgtgccc aagtcgggcg agggcttcat cgacttcatc
5941 ggacaggtgc acaagaccaa ggagcagttt ggccaggatg gacccattac cgtgcactgt
6001 tcggcgggcg tgggacgttc gggtgtcttt atcactctga gcatcgttct ggaacgaatg
6061 cagtacgagg gagtactgga cgtcttccag acagtgcgca tactgcgatc ccagcgtccg
6121 gctatggtac aaaccgagga tcaataccac ttctgctatc gcgctgcact ggagtacttg
6181 ggctcattcg acaattatac aaactgagtg catttctcaa tgggattgcc aggtccacag
6241 agctaaaatc tcatcgaccc tgcgttcgtt gcatacttat ttcggacttt acgcataata
6301 cgcatttacg atatacacgg atatcataac tctaaagtat tatagccagc tactgcccac
6361 acttacttaa tacacctata cttatatacg tggatatgtt tagttgataa gcgcaccccc
6421 gcccgcaggt tatgttacca ttacgattct ttgggggact cgatcttata tattgcgttc
6481 tattttattg ataatgtaaa ctaaagaaga ctgttacacg tttaattatc aaaaccttat
6541 atatcaaccc acatgcactt ctatgtacga gtatgtgcat caagatttgc tcgctaatgg
6601 agaaggatgt caatcaactt gttgttgttc gtggtactgc cgccggagtg accatgaatg
6661 aggggcgcca ctggaaaccg atacatattt gaactaagga cttggtagct attagtctta
6721 ccgcctaaat cgaatctcga acggaagcaa agggtcgaaa gtagaagcca ttttaagtaa
6781 atcgaaaact tctggcgatt tggcgcaaac tccagacgct attataatat ataatacaat
6841 caatcaaaca ctgctatgtt aatcagtttt catgccttgg ttaatggaaa ccagtataaa
6901 cggtccgtgc aattagtcaa aagtcttgca tagcgcatat tttataaggc agctagattc
6961 taaaattagt tttcaaacac aacttacgtt ttgatctccc cgcaaaagat gacgaacctt
7021 atggactgat cgatgtcttg gtagctaaca gacagttcaa atttcaaaga ccattatgat
7081 agatcacata cgcaacttcc agccaagcag ctattttaac ttcaacctca ttccctcgac
7141 ttctattcag ttccctagta cattcctgta tataccacag tcgatacccg agctagccac
7201 ttatgtatgt gtaatttaga ttgaaaagct atgaaaagtt attgtaccta aggtgtatga
7261 agacatagag aataaccgag cgaacagatt agaatgataa ctgtagatat gtactgtagc
7321 atataatttt taccataaaa tagagcgaca cacgcggtta taaataaaca taaacataaa
7381 atcgagacat tggcaatagt tcttaaggta cataataatg tatataactg ataagctgtt
7441 ttgcaaaacg tttgaattcg gcctcgtaac tatacataaa tatataaaga gttagcaagc
7501 aaatgcttaa aagcaaacac taaattatta gctgaaacac atgttaaaca accatagtca
7561 tatgtaaact aaatatttat acgaaataaa ggtatttgtt ataact
//
LOCUS DROPTPM 4477 bp ss-mRNA INV 19-OCT-1989
DEFINITION D.melanogaster protein tyrosine phosphatase (DPTP) mRNA, complete
cds.
ACCESSION M27699
KEYWORDS protein tyrosine phosphatase.
SOURCE Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to
mRNA, clone DPTP12.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (sites)
AUTHORS Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H.
TITLE A family of receptor-linked protein tyrosine phosphatases in humans
and Drosophila
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 4477; for [1])
AUTHORS Saito,H.
JOURNAL Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Saito, 07-SEP-1989.
FEATURES from to/span description
pept 55 4443 protein tyrosine phosphatase (DPTP) precursor
sigp 55 123 protein tyrosine phosphatase (DPTP) signal
peptide
matp 124 4440 protein tyrosine phosphatase (DPTP)
BASE COUNT 1456 a 996 c 970 g 1054 t 1 others
ORIGIN
1 atataatttt tgacatatgt atttatccgg cncggagtga gcaattagca agccatggcg
61 ttactctacc gacgcatgtc gatgctatta aatataattc ttgcatatat tttcctttgt
121 gcgatatgtg tgcaaggatc tgtgaaacaa gagtgggcag aaataggtaa aaacgtatct
181 ctagagtgcg catctgaaaa cgaagcagta gcatggaaac tgggcaacca aactattaat
241 aaaaaccata caagatataa aataagaact gagccattga aatccaacga cgatggcagt
301 gaaaacaacg atagtcaaga cttcatgaag tataagaatg tcctgacgct tctcgatgtc
361 aatataaatg actcggggaa ctacacctgc acagcccaaa cggggcaaaa ccattcgact
421 gaattccaag ttaagccata ccttccatcc aaagtcctgc aaagtacccc cgacaggatc
481 aagagaaaaa tcaagcagga tgtcatgttg tactgtttga tcgagatgta cccacaaaat
541 gagacaacta atagaaacct caagtggctt aaggacggca gccagtttga attcctggac
601 accttctcgt ccatttcgaa gctaaacgat acgcacttaa acttcaccct ggaattcaca
661 gaagtttaca agaaagagaa tggcacctac aagtgcaccg tcttcgatga caccggactc
721 gaaattacct ccaaagagat tactcttttc gtaatggaag tgccacaagt tagcattgat
781 ttcgccaagg cagttggtgc taataaaata tacctaaatt ggaccgttaa cgacggcaac
841 gatccaattc agaagttctt catcactctg caggaggctg gaacaccgac ttttacttat
901 cataaggact ttattaacgg cagccataca tcgtatattc tggaccattt caagccgaac
961 acaacctatt ttttaagaat cgtgggaaag aactcgattg gcaatggcca acccacccag
1021 tatccccagg gaatcaccac gcttagttat gatcccatat ttataccaaa agtcgagacc
1081 accggcagca cggcgtccac gataacgatt ggctggaatc caccgccgcc ggatctcatt
1141 gattatatac aatactacga actgattgtc tccgaatcgg gcgaggtacc caaagtgatc
1201 gaagaggcca tttaccagca gaattctcga aacttgccat acatgtttga taagcttaag
1261 accgccacag actacgaatt tagggtaagg gcatgtagtg atctaaccaa gacttgtgga
1321 ccatggtccg aaaacgtgaa cggcacaacg atggacggcg tggcaacaaa acccaccaac
1381 ttgagcatac agtgtcatca tgacaacgtc acgagaggca actccatcgc cattaactgg
1441 gacgttccca agacgccaaa cggcaaggtt gtgtcatatt taattcactt gctgggcaat
1501 cccatgagca cagtggatag agagatgtgg ggaccaaaga ttcgaagaat cgatgaaccc
1561 catcacaaga ccctctacga aagtgttagc ccgaacacaa actacacggt gacggtgtct
1621 gccataacgc ggcacaagaa gaacggcgaa ccggccaccg gaagttgtct aatgcccgtc
1681 tccacgccgg atgccattgg ccgtaccatg tggtcaaagg tgaacctgga ctccaagtac
1741 gttctcaagt tgtatctacc gaagatcagc gaacgcaatg ggcccatatg ctgctataga
1801 ttgtatctag ttagaattaa caatgacaac aaggaattgc cggacccgga gaagctaaac
1861 atcgccacat atcaggaggt tcacagcgat aacgtgacta gaagtagtgc atatatagcg
1921 gaaatgataa gtagcaagta ctttaggccg gaaatattct tgggcgctga aaagagattt
1981 agtgagaaca acgatataat ccgcggcaac gacgaaattt gtcggaaatg cttggagggt
2041 actccatttc tgaggaaacc cgagatcatt cacatacccc cacaaggttc actatcaaat
2101 tccgattctg aactgcccat tttgtcggag aaggacaact tgatcaaagg agcaaactta
2161 acagagcatg ctctgaaaat cttagaaagt aagttaaggg ataaaagaaa cgcggtgacc
2221 agcgatgaga atccaattct aagcgccgtc aacccaaatg tgccactcca cgattctagt
2281 cgagatgtct tcgatggtga gatagatatt aactccaact acaccggatt cctagagata
2341 atagttcggg atcgaaacaa tgccctgatg gcttatagca aatactttga tataattact
2401 ccggcgactg aagctgaacc catccaatcc ttgaacaata tggactacta cctaagcatt
2461 ggggtcaagg ctggagccgt attacttggt gtcatacttg tatttattgt gctgtgggtt
2521 ttccatcaca agaaaaccaa gaatgaactg cagggcgaag acactttaac actaagagat
2581 tctttgagca gggcattgtt cggtcgacga aatcacaacc acagtcattt tataacttcc
2641 ggaaatcaca aaggattcga cgctggcccc attcacagat tagatttaga aaacgcctac
2701 aagaaccgac ataaggacac tgactacgga tttctgcggg aatatgagat gctgccaaat
2761 cgctttagcg atcggacaac taaaaacagt gatttaaagg agaacgcctg caagaacagg
2821 tatcccgata taaaggctta cgatcagacg cgcgtaaagt tggctgtcat caatggcctg
2881 caaactacgg actacatcaa tgctaacttc gtaattggct acaaggagag aaagaagttt
2941 atctgtgcac agggtccaat ggaaagtacc atcgatgatt tttggcgtat gatttgggaa
3001 caacatctag aaataattgt gatacttaca aatttagagg aatataacaa ggccaagtgc
3061 gcgaaatatt ggccagaaaa agtatttgat acgaaacagt tcggagatat tttagtgaaa
3121 tttgcacaag aacgtaagac tggtgattat attgaacgta ccctgaacgt ttccaagaac
3181 aaagccaatg tcggcgagga ggaggaccgt agacaaatca cccaatacca ctacctaacg
3241 tggaaggact tcatggcacc agagcatcca catggcatca tcaaattcat acgtcaaatc
3301 aattccgtct actccctgca aaggggtcca attttagtgc attgcagtgc tggtgtgggt
3361 agaaccggaa ccctggtggc tttggattcc ctaatccaac agctggagga agaagactcg
3421 gtgtccattt acaacacagt gtgtgatcta cgacaccaac gaaatttttt agtccaatct
3481 ctgaaacaat acatctttct ttaccgggct ctattagata ctggaacctt tggaaacacg
3541 gatatttgca ttgatacaat ggcttctgca attgaatctc ttaagcggaa acccaacgag
3601 ggtaaatgca aattggaaat ggaattcgag aaactactgg ccacagcaga tgagataagt
3661 aaatcatgta gtgtgggcga aaacgaggag aacaatatga agaacagaag tcaagagatt
3721 ataccctacg accgcaacag agttatactg acaccacttc cgatgcggga aaactcaaca
3781 tacattaacg catcattcat agagggctat gataatagcg aaaccttcat cattgcccag
3841 gatccatttg aaaacactat aggagacttc tggcgaatga tctcggaaca gagtgttacc
3901 accctcgtca tgatatccga aatcggagat ggtcccagga aatgcccgcg atactgggca
3961 gatgatgagg ttcaatacga ccacatactc gtgaaatacg tgcacagcga aagttgtcca
4021 tactatactc gccgcgaatt ttatgttacg aattgcaaaa tagacgatac gctgaaagtc
4081 acacaatttc aatacaatgg ttggcccacc gtggacggag aagttcctga agtctgccgt
4141 ggcattatag aacttgtaga tcaagcatac aaccattata agaacaacaa gaattctggt
4201 tgtcgatctc cactcacagt tcattgcagt ctgggaaccg atcgaagttc cattttcgtt
4261 gccatgtgta ttttggtcca gcatctcagg ctggagaagt gtgtcgacat ctgtgccaca
4321 acaaggaaat tacgatctca gcgaacggga cttatcaact catacgcaca atacgagttc
4381 ctacatcgcg caataattaa ttattcagac ttacatcaca tagccgagtc aacattggat
4441 taagttagat atgtatatac ttatacgagt accgaat
//
LOCUS MUSPPC1 643 bp ds-DNA ROD 29-AUG-1989
DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 1.
ACCESSION M27391 J05044
KEYWORDS proteoglycan peptide core.
SEGMENT 1 of 3
SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 643)
AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
Stevens,R.L.
TITLE Cloning and characterization of the mouse gene that encodes the
peptide core of secretory granule proteoglycans, and expression of
this gene in transfected rat-1 fibroblasts
JOURNAL J. Biol. Chem. 264, 16719-16726 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by R.L. Stevens, 02-AUG-1989.
FEATURES from to/span description
pept 546 + 618 secretory granule proteoglycan peptide core
pre-msg 505 > 643 peptide core mRNA and introns
IVS 619 > 643 peptide core intron A
BASE COUNT 141 a 134 c 180 g 188 t
ORIGIN
1 aattctagca gactctggac gttaacggag accgctcatc ctgggggctg agaacccagc
61 tcggctcgga atgttccctg cttgtgcctg actctgtgcg cgcccagctt ctctttgatg
121 tgcgctgtgg atgagccgag ctcagttctg gaacagctga gtcctcctgt ctgtttagat
181 tgttacctga aggaagggag ggggaagaaa gtgctgattc gactttttga tggggaaaac
241 ttttttttta aacatgcaaa tgacagatgg cagagctttt tggaaaaaga aaaaataata
301 accacacagc aaacgcctag ggggagtccg gtggagtttc atcatgggta tgaacagttg
361 ttgttttttt caactttctt cttctttctg ggtgttgatg tggatctctt tctatttgtt
421 caggaaactg tgacgtgtgt tcttgggcag ggtctgaggt tttggaacct ctttctaaaa
481 gggacagaaa gagcaccctg ctacatttgc taatccagag gctgagtgga gccgagctgg
541 tcaggatgca ggttcccgtc ggcagcaggc ttgtcctggc tctcgccttc gtcctggttt
601 ggggatcttc agtgcaaggt aagagaccca ggatctttaa ttc
//
LOCUS MUSPPC2 183 bp ds-DNA ROD 29-AUG-1989
DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 2.
ACCESSION M27392 J05044
KEYWORDS proteoglycan peptide core.
SEGMENT 2 of 3
SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 183)
AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
Stevens,R.L.
TITLE Cloning and characterization of the mouse gene that encodes the
peptide core of secretory granule proteoglycans, and expression of
this gene in transfected rat-1 fibroblasts
JOURNAL J. Biol. Chem. 264, 16719-16726 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by R.L. Stevens, 02-AUG-1989.
FEATURES from to/span description
pept + 19 + 163 secretory granule proteoglycan peptide core
pre-msg < 1 > 183 peptide core mRNA and introns
IVS < 1 18 peptide core intron A
IVS 164 > 183 peptide core intron B
BASE COUNT 46 a 44 c 44 g 49 t
ORIGIN
1 ggttccttgt tcgcacaggt tatcctgctc ggagagccag gtaccagtgg gtccgctgca
61 aaccgaatgg cttttttgcg aactgcatcg aggagaaggg accacagttt gacctaatag
121 atgaatccaa taacatcggc cctcccatga ataatcctgt tttgtaagta gactttcatc
181 gat
//
LOCUS MUSPPC3 681 bp ds-DNA ROD 29-AUG-1989
DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 3.
ACCESSION M27393 J05044
KEYWORDS proteoglycan peptide core.
SEGMENT 3 of 3
SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
Stevens,R.L.
TITLE Cloning and characterization of the mouse gene that encodes the
peptide core of secretory granule proteoglycans, and expression of
this gene in transfected rat-1 fibroblasts
JOURNAL J. Biol. Chem. 264, 16719-16726 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by R.L. Stevens, 02-AUG-1989.
FEATURES from to/span description
pept + 20 260 secretory granule proteoglycan peptide core
pre-msg < 1 > 678 peptide core mRNA and introns
IVS < 1 19 peptide core intron B
BASE COUNT 208 a 135 c 123 g 215 t
ORIGIN
1 ttttttcttt gtattttagg atggaaggac cctcaaaaga tttcatctcc aattatgatg
61 actatgggtc aggttcgggc tccggctctg gctccggctc tggctcgggt tccggctccg
121 gaagtggctt cctaggtgac atggaatggg aataccagcc aacagatgaa agcaatattg
181 tctatttcaa ctataagcct tttgacagga ttctcactga gcaaaaccaa gaccaaccag
241 aagacgattt tattatatga atgtgacggt ctctgtctcc ccacctccat gtggaacaat
301 gtattcagta tacttagtgt accacgttta aatgaccagt ctcaggataa agagttttac
361 agaaaattta aaatgcctgg aaaagactct tgaatcctgt tacccctttc ctcattaact
421 cgtaaggaat tatgctttaa tgctgttacc tatcttgttg ttctggaaaa tgcctgcatt
481 tatgtgtatt gaatcaacat ttaagaaatt aacacacacc cccattatta tacaataact
541 ttcaaagcca tactggtttt gaaaatttta atttgatagc aagttgatga acaatctttc
601 atacctaaag tgttcaggaa cccaactcgc attgtgaatt acaaatatat tcctttatgt
661 gattaaaaag aaaataaagt g
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CE3LYS 392 bp ds-DNA BCT 15-JUN-1989
DEFINITION Plasmid ColE3 lysis protein gene, complete cds.
ACCESSION X03631
KEYWORDS inverted repeat; lysis protein.
SOURCE Plasmid Colicin E3 (from E.coli) DNA.
ORGANISM Plasmid Colicin E3
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 392; enum. 1 to 392)
AUTHORS Toba,M., Masaki,H. and Ohta,T.
TITLE Primary structures of the CoIE2-P9 and CoIE3-CA38 lysis genes
JOURNAL J. Biochem. 99, 591-596 (1986)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
INVREP 233 247 imp. inverted repeat A
INVREP 249 263 imp. inverted repeat A'
SITE 264 269 T-stretch
OPR 233 269 pot. transcription terminator
FEATURES from to/span description
pept 65 208 lysis protein (aa 1-47)
BASE COUNT 110 a 78 c 93 g 111 t
ORIGIN
1 tgaaatatga atgccggttg tttatggatg aatggctggc attctttcac aacaaggagt
61 cgttatgaaa aaaataacag ggattatttt attgcttctt gcagtcatta ttctgtctgc
121 atgtcaggca aactatatcc gggatgttca gggcgggacc gtatctccgt catcaacagc
181 tgaagtgacc ggattagcaa cgcagtaacc cgaaatcctc tttgacaaaa acaaagcgtg
241 tcaggctgat tctgatgcgc tttttttttg aaatgtcaca aaaattccat gtgggagatg
301 ggatctaaaa tcctcgtgca gaactttcca tccaggggga gaaaacttgt cgttttgagc
361 cgttcggtgt tcagaacgca cgaaaccgat cg
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CECCOLE3IM 915 bp ds-DNA BCT 16-JUN-1986
DEFINITION Plasmid ColE3-CA38 immunity protein gene, complete cds, and colicin
E3 gene, 3' end.
ACCESSION X01162
KEYWORDS colicin; immune response gene; nuclease; toxin.
SOURCE Plasmid ColE3-CA38 DNA, from E.coli.
ORGANISM Plasmid Colicin E3-CA38
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 915)
AUTHORS Lau,P.C.K., Rowsome,R.W., Zuker,M. and Visentin,L.P.
TITLE Comparative nucleotide sequences encoding the immunity proteins and
the carboxyl-terminal peptides of colicins E2 and E3
JOURNAL Nucleic Acids Res. 12, 8733-8745 (1984)
STANDARD simple staff_review
COMMENT EMBL features not translated to GenBank features:
key from to description
SITE 22 255 pot. receptor-binding domain
SITE 256 546 pot. RNase domain
FEATURES from to/span description
pept < 1 549 colicin E3 (AA at 1)
pept 559 816 immunity protein (aa 1-85)
BASE COUNT 297 a 129 c 236 g 253 t
ORIGIN 21 bp upstream of TaqI site.
1 gctgaaataa aacaatttaa tcgatttgcc catgacccaa tggctggcgg tcacagaatg
61 tggcaaatgg ccgggcttaa agcccagcgg gcgcagacgg atgtaaataa taagcaggct
121 gcatttgatg ctgctgcaaa agagaagtca gatgctgatg ctgcattgag ttctgctatg
181 gaaagcagga agaagaaaga agataagaaa aggagtgctg aaaataattt aaacgatgaa
241 aagaataagc ccagaaaagg ttttaaagat tacgggcatg attatcatcc agctccgaaa
301 actgagaata ttaaagggct tggtgatctt aagcctggga taccaaaaac accaaagcag
361 aatggtggtg gaaaacgcaa gcgctggact ggagataaag ggcgtaagat ttatgagtgg
421 gattctcagc atggtgagct tgaggggtat cgtgccagtg atggtcagca tcttggctca
481 tttgacccta aaacaggcaa tcagttgaaa ggtccagatc cgaaacgaaa tatcaagaaa
541 tatctttgag aggaagttat gggacttaaa ttggatttaa cttggtttga taaaagtaca
601 gaagatttta agggtgagga gtattcaaaa gattttggag atgacggttc agttatggaa
661 agtctaggtg tgccttttaa ggataatgtt aataacggtt gctttgatgt tatagctgaa
721 tgggtacctt tgctacaacc atactttaat catcaaattg atatttccga taatgagtat
781 tttgtttcgt ttgattatcg tgatggtgat tggtgatcaa atattatcag ggatgagttg
841 atatacgggc ttctagtgtt catggatgaa cgctggagcc tccaaatgta gaaatgttat
901 attttttatt gagtt
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FSOFARS1A 132 bp ds-DNA PLN 15-MAR-1990
DEFINITION F.oxysporum autonomous replicating sequence (ARS).
ACCESSION M29451
KEYWORDS autonomous replicating sequence.
SOURCE F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4.
ORGANISM Fusarium oxysporum
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 132)
AUTHORS Powell,W.A. and Kistler,H.C.
TITLE In vivo rearrangement of foreign DNA by Fusarium oxysporum produces
linear self-replicating plasmids
JOURNAL Unpublished (1989) College Env. Sci., S.U.N.Y., Sayracuse, NY 13210
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.A.Powell 27-OCT-1989.
FEATURES from to/span description
site 1 9 DNA homologous to yeast core sequence
rpt 7 12 telomere repeat unit
site 111 117 ligation point of telomeres -> ApaI site
BASE COUNT 26 a 8 c 56 g 42 t
ORIGIN 116 bp upstream of ApaI site.
1 tttggtttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg
61 ttagggttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggccctaa
121 ccctaaacca aa
//
LOCUS MZEGST3AA 913 bp ss-mRNA PLN 15-SEP-1989
DEFINITION Maize GSH glutathione S-transferase III mRNA, complete cds.
ACCESSION X06755
KEYWORDS glutathione S-transferase; tandem repeat; transferase.
SOURCE Maize, cDNA to mRNA, clone pGTC27.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 913)
AUTHORS Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and
Tu,C.-P.D.
TITLE Characterization and heterospecific expression of cDNA clones of
genes in the maize GSH S-transferase multigene family
JOURNAL Nucleic Acids Res. 16, 425-438 (1988)
STANDARD simple staff_entry
COMMENT Data kindly reviewed (16 JUNE 1988) by D.Tu.
FEATURES from to/span description
pept 75 743 glutathione S transferase III (EC 2.5.1.18)
mRNA < 1 873 GST mRNA (alt.)
mRNA < 1 884 GST mRNA (alt.)
mRNA < 1 898 GST mRNA (alt.)
mRNA < 1 913 GST mRNA (alt.)
rpt 54 71 tandemly repeated trinucleotide
BASE COUNT 145 a 332 c 270 g 166 t
ORIGIN
1 cccactttac tcctatccac tgcggcctgg acgcgtgcga gaggcttgac caagcagcag
61 cagcagcagc agcgatggcg cctctgaagc tgtacgggat gccgctgtcc cccaacgtgg
121 tgcgcgtggc caccgtgctc aacgagaagg gcctcgactt cgagatcgtc cccgtcgacc
181 tcaccaccgg cgcccacaag cagcccgact tcctcgccct caaccctttc ggccagatcc
241 cggctctcgt cgacggagac gaagtcctct tcgagtcccg tgcgatcaac cggtacatcg
301 ccagcaagta cgcgtcggag ggcacggacc tgctccccgc gacggcgtcg gcggcgaagc
361 tggaggtgtg gctggaggtg gagtcgcacc acttccaccc gaacgcgtcg ccgctggtgt
421 tccagctgct cgtgaggccg ctcctgggcg gcgcccccga cgcggcggtg gtggagaagc
481 acgcggagca gctcgccaag gtgctcgacg tgtacgaggc gcacctggcc cgcaacaagt
541 acctcgccgg ggacgagttc acgctcgccg acgccaacca cgcgctccta cctgctctca
601 cctcagcaag acccccaagg ccgggctgcg tcgccgcccg cccccacgtc aaggcctggt
661 gggaggccat cgccgcccgc cccgcgttcc agaagaccgt cgccgccatc cccttgcccc
721 cgccgccctc ctcctcggct tgacctcgcc ttgcgttgcc tgggtcgcgg atgcgtcgga
781 gccctgagtc gaataaaaga ggccgcatcc tgtcttgcat ttgctcgtgc gccatgtgct
841 ataacagcct gtgtaataaa cactgttgcc tttcgtgtgt gttcattgcc tttttgttgg
901 ttggtctttg caa
//
LOCUS MZEGSTI 929 bp ss-mRNA PLN 15-SEP-1989
DEFINITION Maize GSH glutathione S-transferase I mRNA, complete cds.
ACCESSION X06754
KEYWORDS glutathione S-transferase.
SOURCE Maize, cDNA to mRNA, clone pGTC2.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 929)
AUTHORS Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and
Tu,C.-P.D.
TITLE Characterization and heterospecific expression of cDNA clones of
genes in the maize GSH S-transferase multigene family
JOURNAL Nucleic Acids Res. 16, 425-438 (1988)
STANDARD simple staff_entry
COMMENT Data kindly reviewed (16 June 1988) by D. Tu
FEATURES from to/span description
pept 82 726 GST glutathione S transferase I (EC 2.5.1.18)
mRNA < 1 915 GST mRNA (alt.)
mRNA < 1 918 GST mRNA (alt.)
mRNA < 1 924 GST mRNA (alt.)
mRNA < 1 929 GST mRNA (alt.)
rpt 35 65 tandemly repeated pentanucleotide
BASE COUNT 211 a 230 c 277 g 211 t
ORIGIN
1 gggcgcagag aatccccaag caaacaaaca gggtagaggg agaggagagg agaggagagg
61 agaggttggg tctgggacac catggctccg atgaagctgt acggggcggt gatgtcgtgg
121 aacttgacga ggtgcgcaac ggcgctggag gaggctggct ccgactacga gatcgtgccc
181 atcaacttcg ccaccgccga gcacaagagc cccgagcacc tcgtccgcaa cccgtttggt
241 caggttccag ctctgcagga tggtgacttg tacctcttcg aatcacgagc aatctgcaag
301 tacgctgctc gcaaaaacaa gccagagctg ttgagggaag gaaacctcga ggaggcagca
361 atggtggatg tttggatcga ggtggaggct aaccagtaca ccgctgcact gaatcccatc
421 ctcttccagg tcctcatcag tcctatgctt gggggaacca ccgaccagaa agttgtggat
481 gagaaccttg agaagctgaa gaaggtgcta gaggtgtacg aggcacgcct gaccaagtgc
541 aagtaccttg ctggagactt cctcagcctc gccgacctga accatgtgtc tgtcactctc
601 tgcctgttcg ctacgcccta cgcatctgtg ctcgacgcct acccgcatgt gaaggcctgg
661 tggtctggtc tgatggagag gccgtctgtc cagaaggtcg ctgccctgat gaagccatct
721 gcttgaaacg gttgccctgt caatcggtga atggaataag ccaagtctgt tctgggtgtt
781 tgcttgttca gtgatgtgtc ttctatgacg aaagttgagt gccttatgtg tgagcatgct
841 tgcttgtaac gagaccttga attgttcgga aatgagcatt cctgtgctct tcttgaaaat
901 aaactttgcg ttaggttcct caggatctt
//
LOCUS MZEPCSSU 830 bp ss-mRNA PLN 15-DEC-1988
DEFINITION Maize ribulose 1,5-bisphosphate carboxylase small subunit mRNA,
complete cds.
ACCESSION D00170
KEYWORDS RuBPC; ribulose 1,5-bisphosphate carboxylase; small subunit.
SOURCE Maize leaf, cDNA to mRNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 830)
AUTHORS Matsuoka,M., Kano-Murakami,Y., Tanaka,Y., Ozeki,Y. and Yamamoto,N.
TITLE Nucleotide sequence of cDNA encoding the small subunit of ribulose-
1,5-bisphosphate carboxylase from maize
JOURNAL J. Biochem. 102, 673-676 (1987)
STANDARD simple automatic
COMMENT Comparison with the RuBPC small subunit genes from other plants
revealed that the maize small subunit is similar to the wheat one,
there being 73% homology between the transit peptides and 64%
between the mature proteins. This indicates that there is no
noteworthy difference between the C3 and C4 small subunit
structures. corresponds to DDBJ (rel3.0) entry <RUBPCSSU>(D00170)
dated 29-MAR-1988
FEATURES from to/span description
pept 66 578 ribulose 1,5-bisphosphate carboxylase small
subunit (SSu)
BASE COUNT 147 a 281 c 220 g 182 t
ORIGIN 65 bp upstream of the SSu initiation codon.
1 caggacgacc caagcaagca agcaagcagc gagtacatac atacatacta ggcagccagg
61 cagccatggc gcccaccgtg atgatggcct cgtcggccac cgccgtcgct ccgttccagg
121 ggctcaagtc caccgccagc ctccccgtcg cccgccgctc ctccagaagc ctcggcaacg
181 tcagcaacgg cggaaggatc cggtgcatgc aggtgtggcc ggcctacggc aacaagaagt
241 tcgagacgct gtcgtacctg ccgccgctgt cgacggacga cctgctgaag caggtggact
301 acctgctgcg caacggctgg ataccctgcc tcgagttcag caaggtcggc ttcgtgtacc
361 gcgagaactc cacctccccg tgctactacg acggccgcta ctggaccatg tggaagctgc
421 ccatgttcgg ctgcaacgac gccacccagg tgtacaagga gctgcaggag gccatcaaat
481 cctacccgga cgccttccac cgcgtcatcg gcttcgacaa catcaagcag acgcagtgcg
541 tcagcttcat cgcctacaag cccccgggca gcgactagac cgcgcccgcc ggccgccccc
601 cgccggctag ctagctagct agctcctgcg tgagctagta gctagtgcca tgcgtcgtct
661 ctgtcgttcg gttttgcttc gggtcaccgt accctttgct tgcttggttt cttctttcct
721 tttttccttt tttttttctt cttttccccg gccatggttc ctttgctttc agcagttctc
781 tgtgatgtga tgtatccatt gttgcaagca tgcatggcct tgcattggct
//
LOCUS PETRBCA 318 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
(clone pSSU 117), partial cds.
ACCESSION M29641
KEYWORDS ribulose 1,5-bisphosphate carboxylase.
SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 117.
ORGANISM Petunia hybrida
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J.
TITLE A number of different nuclear genes for the small subunit of
RuBPCase are transcribed in petunia
JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 318 ribulose 1,5-bisphosphate carboxylase (AA at
1)
BASE COUNT 89 a 67 c 76 g 86 t
ORIGIN
1 tacgagacac tctcatactt tcccgatttg accgacgaac aattgttcaa agaagttgag
61 taccttttga ataagggatg ggttccttgt ttggaattcg agctgaaaca caaatttatc
121 taccgtgaat atcacgcatc accaggatac tatgatggaa ggtactggac aatgtggaag
181 ttgcccatgt ttggttgcac tgatgctacc caagtcttgg gtgagctcca agaggccaag
241 aaggcttacc caaatgcatg gatcaggatc atcggattcg agaacgttcg tcaagtgcag
301 tgcatcagtt tcattgcc
//
LOCUS PETRBCB 212 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
(clone pSSU 51), partial cds.
ACCESSION M29642
KEYWORDS ribulose 1,5-bisphosphate carboxylase.
SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 51.
ORGANISM Petunia hybrida
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J.
TITLE A number of different nuclear genes for the small subunit of
RuBPCase are transcribed in petunia
JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 207 ribulose 1,5-bisphosphate carboxylase (AA at
1)
BASE COUNT 53 a 57 c 53 g 49 t
ORIGIN
1 catgcatctc cagggtacta tgatggcagg tactggacca tgtggaagct gcccatgttc
61 gggtgcactg atgccaccca agtcttgggt gagctccaag aggccaagaa ggcttacccc
121 aatgcctgga tcagtatcat tggcttcgac aacgtacgtc aagtgcaatg catcagtttc
181 attgcctaca agccccaagg ctactagatt ac
//
LOCUS PETRBCC 251 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
(clone pSSU 41), partial cds.
ACCESSION M29643
KEYWORDS ribulose 1,5-bisphosphate carboxylase.
SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 41.
ORGANISM Petunia hybrida
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 251)
AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J.
TITLE A number of different nuclear genes for the small subunit of
RuBPCase are transcribed in petunia
JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 141 ribulose 1,5-bisphosphate carboxylase (AA at
1)
BASE COUNT 77 a 67 c 61 g 46 t
ORIGIN
1 accgatgcaa cccaggtgtt gggtgagctc caagaggcca agaaggctta ccctaatgca
61 tggatcagaa tcatcggatt cgacaacgtc cgtcaagtgc aatgcatcag tttcattgcc
121 tacaagccac caggctacta agttatatta ggacagcttc ccatgtgtat ttaggggcag
181 tcccggcgga ttgggagcca aggaccccgg ccggccaaaa atagccagga acccaccaaa
241 aaaccaaaaa a
//
LOCUS SPIPSI2 863 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Spinach psAD mRNA for subunit II precursor of photosystem I
reaction center.
ACCESSION X14017
KEYWORDS photosynthesis; photosystem I; photosystem I reaction center.
SOURCE spinach (Spinacia oleracea).
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 863)
AUTHORS Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A.,
Nechushtai,R., Beyreuther,K. and Herrmann,R.G.
TITLE Nucleotide sequences of cDNAs encoding the entire precursor
polypeptides for subunits II and III of the photosystem I reaction
center from spinach
JOURNAL Curr. Genet. 14, 511-518 (1988)
STANDARD simple automatic
COMMENT *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4.
seq. differs from that reported by Lagoutte (1988), see Y00759; the
psaD gene for subunit II probably exists only in a single copy per
spinach genome
EMBL features not translated to GenBank features:
key from to description
MSG <1 863 psaD mRNA
SITE 842 846 put. polyA signal
POLYA 863 863 polyA site
FEATURES from to/span description
pept 155 793 subunit II preprotein (AA -50 to 162)
sigp 155 307 transit peptide (AA -50 to -1)
matp 305 790 mature subunit II (AA 1 - 162)
BASE COUNT 279 a 214 c 158 g 212 t
ORIGIN
1 ctaaggaaaa cagaaaaatt gccacaaatt gggctacagg aagctttaat tcaaactaca
61 gtctaatggt acacattcaa caagtcctat aaagtacaat tagttgacaa aacaaaaaac
121 caatccaaag ttccaaaaca acaacaatca aattatggcc atggcaactc aagccaccct
181 cttctcccca tcttctctct cctccgcaaa acccattgac acccgcctca ccacctcttt
241 caaacaaccc tccgccctct ctttcgcctc caagcccgcc tcccgccacc actctatccg
301 ggcagccgcc gcagctgaag gcaaagctgc caccccaact gaaaccaagg aggcgcctaa
361 aggttttact cctccggagt tggaccctaa cactccctcc cccatctttg ccggcagcac
421 aggtggacta ttgaggaagg cacaagtaga ggagttttac gtgataacat gggaatcacc
481 aaaagagcaa atattcgaga tgccaacagg aggagcagca ataatgaggg aaggaccaaa
541 cttgctaaaa ttagcacgta aagagcaatg tttagcactt ggaacaagac ttaggtctaa
601 atacaagatt aagtaccaat tttacagggt tttcccaagt ggtgaagttc aatatttaca
661 ccctaaagat ggtgtttacc cagaaaaggt taaccctggt agacaaggtg ttgggttaaa
721 catgaggtct attggtaaga atgttagccc tattgaggtt aaatttactg gcaagcaacc
781 ttatgatttg taataacttt ttatcttatt ttcatcatgt tcatgctact atgttttgga
841 caaatacaat gtgattattt tag
//
LOCUS SPIPSI3 653 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Spinach psaE mRNA for subunit III precursor of photosystem I
reaction center.
ACCESSION X14018
KEYWORDS photosynthesis; photosystem I; photosystem I reaction center;
psaE gene.
SOURCE spinach (Spinacia oleracea).
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 653)
AUTHORS Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A.,
Nechushtai,R., Beyreuther,K. and Herrmann,R.G.
TITLE Nucleotide sequences of cDNAs encoding the entire precursor
polypeptides for subunits II and III of the photosystem I reaction
center from spinach
JOURNAL Curr. Genet. 14, 511-518 (1988)
STANDARD simple automatic
COMMENT *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4.
at least two genes for subunit III are present in spinach genome
designated psaE-1 and psaE-2 by [1]
EMBL features not translated to GenBank features:
key from to description
MSG <1 653 psaE mRNA
MSG <1 608 altern. psAE mRNA
POLYA 608 608 altern. polyA site
POLYA 653 653 polyA site
FEATURES from to/span description
pept 106 483 subunit III preprotein (AA -34 to 91)
sigp 106 207 transit peptide (AA -34 to -1)
matp 208 480 subunit III (AA 1 - 91)
BASE COUNT 198 a 140 c 139 g 176 t
ORIGIN
1 aaaacacaaa gctactagtt caaagggact ccctcctttt cactacttgc aaaatttatt
61 tcttgtaaga agaaaaaaca gagagagaga gggagaaaat taataatggc tagcattgca
121 tcatcagtag ctgttaggct tgggctaaca caagtgttgc ccaacaaaaa cttctcatct
181 ccaaggagca ctaggcttgt tgtccgggct gctgaagagg ctgctgctgc acccgctgcc
241 gcatcgcccg aaggtgaagc acccaaagct gctgctaagc caccaccaat tggccccaag
301 aggggttcta aggtgagaat catgagaaag gaatcatact ggtacaaggg tgttggctca
361 gtggttgctg ttgatcagga tccaaaaaca agatatccag ttgtggtgag attcaacaag
421 gtgaattacg cgaatgtatc aaccaacaac tacgcactgg atgagattca agaagttgca
481 tgatcaactc tgttataagt ttgcgaaatt tccaaacatt tagatgaatt aggagctatt
541 tctgctgtgt gtatccaatg taaatctact tcttttgagt atcaacttct ttgcccatga
601 aacttgtcag atatgctcct catcatccta agtattcacc atattctttc ctt
//
LOCUS SPIPSI4 659 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Spinach mRNA for photosystem I subunit V.
ACCESSION X13134 Y00967
KEYWORDS photosystem I; photosystem I reaction center.
SOURCE spinach (Spinacia oleracea).
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 659)
AUTHORS Herrmann,R.G.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 659)
AUTHORS Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F.,
Lottspeich,F. and Herrmann,R.G.
TITLE Nucleotide sequence of cDNA clones encoding the entire precursor
polypeptides for subunits IV and V of the photosystem I reaction
center
JOURNAL FEBS Lett. 237, 218-224 (1988)
STANDARD simple automatic
COMMENT *source: tissue=green tissue; clone=p6SocPI5-7.
EMBL features not translated to GenBank features:
key from to description
MSG <1 659 PSI subunit V mRNA
SITE 620 624 put. polyA signal
POLYA 659 659 polyA site
[1] Author address Herrmann R.G., Botanisches Institut der Ludwig
Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG.
Submitted (06-OCT-1988) on tape to the EMBL data library
FEATURES from to/span description
pept 16 519 PSI subunit V preprotein (AA -69 to 98)
sigp 16 222 transit peptide (AA -69 to -1)
matp 223 516 mature PSI subunit V (AA 1 - 98)
BASE COUNT 185 a 128 c 131 g 215 t
ORIGIN
1 taaatccgta aaaaaatggc agcagcaaca gcatctttga gcagcacttt actagcaccc
61 tgcagctcaa agcagccgca gccgcaacag caacatcagc atcagcagct caaatgcaag
121 tccttttccg gcctccgccc actgaagcta aacatcagca gcaacaacag cagcagcagt
181 ctatcgatgt cttctgcaag aagatcaatg acttgccgtg cagagttgag cccatcattg
241 gtgataagtc tgagtacagg gctatccctt tttttaggaa ggtttgtgtt cttcaatttc
301 cagagggaga atatggccaa acaggtgcct gaacaaaatg gtatgtctca ttttgaggcg
361 ggcgatactc gtgctaagga gtatgtgagc cttcttaagt ctaatgatcc tgttgggttt
421 aacattgttg atgttcttgc ttggggttct attggtcata ttgttgctta ttatatcttg
481 gctactgcta gcaatggtta tgaccctagc ttcttttaaa ttatcatatt tcggatttta
541 gtttcttttt taattgttat attattatta ttattattat tattatgttg agttgttgtg
601 taattaacag attctgctca aatattattg cattcacttc tgtaaaaaaa aaaaaaaaa
//
LOCUS SPIPSI5 923 bp ss-mRNA PLN 15-MAR-1990
DEFINITION Spinach mRNA for photosystem I subunit IV.
ACCESSION X13133 Y00967
KEYWORDS photosystem I; photosystem I reaction center.
SOURCE spinach (Spinacia oleracea).
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 923)
AUTHORS Herrmann,R.G.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 923)
AUTHORS Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F.,
Lottspeich,F. and Herrmann,R.G.
TITLE Nucleotide sequence of cDNA clones encoding the entire precursor
polypeptides for subunits IV and V of the photosystem I reaction
center
JOURNAL FEBS Lett. 237, 218-224 (1988)
STANDARD simple automatic
COMMENT *source: tissue=green tissue; clone=p6SocPI4-7;
EMBL features not translated to GenBank features:
key from to description
MSG <1 923 PSI subunit IV mRNA
MSG 923 923 polyA site
[1] Author address Herrmann R.G., Botanisches Institut der Ludwig
Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG.
Submitted (06-OCT-1988) on tape to the EMBL data library
FEATURES from to/span description
pept 30 725 PSI subunit IV preprotein (AA -77 to 154)
sigp 30 260 signal peptide (AA -77 to -1)
matp 261 722 PSI subunit IV (AA 1 - 154)
BASE COUNT 249 a 183 c 202 g 289 t
ORIGIN
1 cctcacttcc ttccatccaa aacacaacca tgtctttcac aatccccact aacctctaca
61 aacctcttgc taccaaacct aagcatctat cttcttcctc ctttgcacca agatcaaaga
121 ttgtttgcca acaagaaaat gatcaacaac aacctaagaa gctagagcta gctaaggtag
181 gtgccaacgc tgccgctgcg ttggctcttt cctcagtgct actttcctca tggagtgtag
241 cacccgatgc tgccatggct gacattgcag ggctaacacc atgcaaggag tccaagcagt
301 ttgctaagag ggaaaagcag gccttgaaga agttgcaagc ctcgttgaag ctgtacgcag
361 atgatagcgc accagcttta gcgatcaagg ctacaatgga gaagaccaag aagaggttcg
421 ataactatgg aaagtatggt ttgctttgtg ggtctgatgg acttcctcat ttgatagtga
481 gtggtgatca gagacactgg ggtgagttta tcacaccagg gattttgttc ttgtatattg
541 ctgggtggat tgggtgggtt ggaaggagtt atttgattgc tattagggat gagaagaaac
601 ctactcagaa ggagattatt attgatgttc ctttggcttc aagtcttctt tttagaggct
661 tcagttggcc tgttgctgct tatagagagt tgctcaatgg tgagcttgtt gacaacaact
721 tttaatttta tcatcctttt atcaaatcaa ttaatcctgt gttatattgt ctactatgta
781 tatcttaaat actccttttt ggaaacttga atgtgtgttt cgattgtttg attgattcat
841 ccggatgtaa tcatcatccg attagaattg tgtaagtgtt tgtttaatga tgctaattgg
901 aaccgacgat cttttacaaa aaa
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS ATPCPRBCLA 1790 bp ds-DNA ORG 20-FEB-1990
DEFINITION A.rosea chloroplast ribulose bisphosphate carboxylase/oxygenase
large subunit (rbcL) gene, complete cds.
ACCESSION X15924
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE A.rosea chloroplast DNA.
ORGANISM Chloroplast Atriplex rosea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex rosea.
REFERENCE 1 (bases 1 to 1790)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 259 1686 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 521 a 314 c 417 g 538 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga
61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt
121 tgattagttg ataatattaa ttgagaattt attttatgaa agattcctat aaaaggtttc
181 attaggggct aatttatgtc gagtagacct tgttgttttg ttgtaaaaat ttaaaattga
241 agttgtaggg agggacttat gtcaccacaa acagagacta aagcaaatgt tggatttaaa
301 gctggtgtta aagattacaa attgacttat tatactcctg agtatgaaac cctagatact
361 gatatcttgg cagcattccg agtaagtcct caaccgggag ttccacccga agaagcaggg
421 gctgcagtag ctgctgaatc ttctactggt acatggacaa ctgtatggac cgacggactt
481 accagtcttg atcgttacaa aggacgatgc taccacatcg agcctgttgc tggagaagaa
541 aatcaatata tttgttatgt agcatatccc ttagaccttt ttgaagaagg ttctgttact
601 aatatgttta cttccattgt gggtaacgta tttgggttca aagccctgcg tgctctacgt
661 ttggaggatt tgcgaatccc tgttgcttat gtaaaaactt tccaaggccc gcctcacggt
721 atccaagttg agagagataa attgaataag tatggccgcc ccctattggg atgcactatt
781 aaaccaaaat tggggttatc cgctaaaaac tatggtcgag cagtttatga agtccttcgt
841 ggtggacttg attttaccaa agatgatgaa aacgtgaact cccagccgtt tatgcgttgg
901 agagaccgtt tcctattttg tgccgaagct ctttataaag cacaagccga aacaggcgaa
961 atcaagggtc attacttgaa tgctactgcg ggtacatgcg aagacatgat gaaaagggct
1021 gtatttgcca gagaattggg agttcctatc gtaatgcatg actacttaac agggggattc
1081 actgcaaata ctactttgtc tcattattgc cgagataatg gtctacttct gcacatccac
1141 cgtgcaatgc acgcagttat tgatagacag aagaatcatg gtatacactt ccgtgtacta
1201 gctaaagcgt tacgtctgtc tggtggagat catattcatt ctggtaccgt agtaggtaag
1261 cttgaaggag aaagagatat tactttaggt tttgttgatt tactacgtga tgattatact
1321 gaaaaagacc gaagtcgtgg tatttatttt agtcaatctt gggtttccac accaggtgtt
1381 ctgcccgttg cttcgggagg tattcacgtt tggcatatgc ctgctctaac cgagatcttt
1441 ggggatgatt ctgtactaca gtttggtgga ggaactttag gacacccttg ggggaatgca
1501 ccgggtgctg tagcgaatcg agtagctcta gaagcatgtg tacaagctcg taatgaggga
1561 cgtgatcttg ctcgtgaggg taatacaatt attcgcgagg ctagcaaatg gagtcctgaa
1621 ctagctgctg cttgtgagat atggaaggaa atcaaatttg aattcccagc aatggataca
1681 gtttaggcta agtaattaat gttcggtctc ttagttgaat tgtaattaaa caaactcggc
1741 ccaatctttt actaaaagga ttgagccgaa tacaattatt gtagatatat
//
LOCUS ATPCPRBCLB 1786 bp ds-DNA ORG 20-FEB-1990
DEFINITION A.patula chloroplast ribulose bisphosphate carboxylase/oxygenase
large subunit (rbcL) gene, complete cds.
ACCESSION X15925
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE A.patula chloroplast DNA.
ORGANISM Chloroplast Atriplex patula
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex patula.
REFERENCE 1 (bases 1 to 1786)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 255 1682 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 517 a 316 c 419 g 534 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga
61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt
121 tgattagttg ataatattaa ttgagaattt tatgaaagat tcctataaaa ggtttcatta
181 ggggctaatt tatgtcgagt agaccttgtt gttttgttgt aaaaatttaa aattgaagtt
241 gtagggaggg acttatgtca ccacaaacag agactaaagc aagtgttgga tttaaagctg
301 gtgttaaaga ttacaaattg acttattata ctcctgagta tgaaacccta gatactgata
361 tcttggcagc attccgagta agtcctcaac cgggagttcc acccgaagaa gcaggggctg
421 cagtagctgc tgaatcttct actggtacat ggacaactgt atggaccgac gggcttacca
481 gtcttgatcg ttacaaagga cgatgctacc acatcgagcc tgttgctgga gaagaaaatc
541 aatatatttg ttatgtagca tatcccttag acctttttga agaaggttct gttactaata
601 tgtttacttc cattgtgggt aacgtatttg ggttcaaagc cctgcgtgct ctacgtttgg
661 aggatttgcg aatccctgtt gcttatgtaa aaactttcca aggcccgcct cacggtatcc
721 aagttgagag agataaattg aacaagtatg gccgccccct attgggatgc actattaaac
781 caaaattggg gttatccgct aaaaactatg gtcgagcagt ttatgaagtc cttcgtggtg
841 gacttgattt taccaaagat gatgaaaacg tgaactccca gccgtttatg cgttggagag
901 accgtttcct attttgtgcc gaagctcttt ataaagcaca agccgaaaca ggcgaaatca
961 agggtcatta cttgaatgct actgcgggta catgcgaaga catgatgaaa agggctgtat
1021 ttgccagaga attgggagtt cctatcgtaa tgcatgacta cttaacaggg ggattcactg
1081 caaatactac tttgtctcat tattgccgag ataatggtct acttctgcac atccaccgtg
1141 caatgcacgc agttattgat agacagaaga atcatggtat gcacttccgt gtactagcta
1201 aagcgttacg tctgtctggt ggagatcata ttcattctgg taccgtagta ggtaagcttg
1261 aaggagaaag agatattact ttaggttttg ttgatttact acgtgatgat tatactgaaa
1321 aagaccgaag tcgtggtatt tattttactc aatcttgggt ttccacacca ggtgttctgc
1381 ccgttgcttc gggaggtatt cacgtttggc atatgcctgc tctaaccgag atctttgggg
1441 atgattctgt actacagttt ggtggaggaa ctttaggaca cccttggggg aatgcaccgg
1501 gtgctgtagc gaatcgagta gctctagaag catgtgtaca agctcgtaat gagggacgtg
1561 atcttgctcg tgagggtaat acaattattc gcgaggctag caaatggagt cctgaactag
1621 ctgctgcttg tgagatatgg aaggaaatca aatttgaatt cccagcaatg gatacagttt
1681 aggctaagta attaatgttc ggtctcttag ttgaattgta attaaacaaa ctcggcccaa
1741 tcttttacta aaaggattga gccgaataca attattgtag atatat
//
LOCUS BSPMANN 1939 bp ds-DNA BCT 06-FEB-1990
DEFINITION Bacillus sp. beta-mannanase gene, complete cds.
ACCESSION M31797
KEYWORDS 1,4-beta-D-mannan mannanohydrolase;
mannan endo-1,4-beta-mannosidase; mannanase.
SOURCE Bacillus sp. (strain AM-001) DNA, clone pMAH5.
ORGANISM Bacillus sp.
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1939)
AUTHORS Akino,T., Kato,C. and Horikoshi,K.
TITLE Two bacillus beta-mannanases having different COOH termini are
produced in Escherichia coli carrying pMAH5
JOURNAL Appl. Environ. Microbiol. 55, 3178-3183 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 220 1761 beta-mannanase precursor (EC 3.2.1.78)
sigp 220 297 beta-mannanase signal peptide
matp 298 1758 beta-mannanase
signal 1826 1840 pot. transcription termination signal
signal 1861 1889 pot. transcription termination signal
signal 1905 1916 pot. transcription termination signal
BASE COUNT 637 a 343 c 420 g 539 t
ORIGIN 1 bp upstream of XbaI site.
1 tctagactcc aaaggttact atcaacctgt ctatttattt aactgtacag tagatggggt
61 agaatcaaac catcatcatc cctgccatct aaattcatta tatgaactcc tcaatagaga
121 acaacaaatc ataatccaac catatttttc taatcaatca ctatgttaag ataaaaaatg
181 taatcgctta caattaaaag gatagaggag gattatgtaa tgaaggtgta caagaaggtg
241 gcttttgtta tggcttttat tatgtttttt tcggtcctgc cgacgatctc aatgtcgtca
301 gaagcaaacg gtgctgcatt atcgaatcct aatgcgaacc aaacgacaaa aaacgtgtat
361 agttggttag ccaatctacc aaacaagagt aataaacgtg tggtgtcggg acacttcgga
421 gggtacagtg attctacctt agcctggatc aaacaatgcg caagggagct gacaggaaaa
481 atgccaggaa tattatcttg tgattataag aattggcaga cgcgattgta tgtagccgat
541 caaattagct atggctgcaa tcaagaatta ataaactttt ggaaccaagg aggtttggtc
601 acgatcagtg tacacatgcc aaatccaggg tttcattcgg gggaaaacta caaaacaatt
661 ttgcctactt cacagttcca aaatctaacc aatcacagga caacagaggg tagaaggtgg
721 aaggatatgc tggataagat ggcagatggg ttggacgagc tacagaacaa tggagtgacg
781 gttcttttcc gtcctttaca tgaaatgaat ggagaatggt tctggtgggg agcagaaggt
841 tacaatcaat ttgatcaaac acgtgccaat gcctatatca gcgcatggag agatatgtat
901 caatatttta ctcatgagcg taagctgaat aaccttattt gggtttactc acctgatgtt
961 tacagagatc atgtaacaag ttactaccca ggagcaaatt atgtagatat tgtggctctt
1021 gattcctacc atcctgatcc acatagcctt actgaccaat ataatcgaat gatcgcttta
1081 gataaacctt ttgcttttgc tgaaatcggt cctcctgaaa gcatggctgg ttcctttgat
1141 tattcaaatt atattcaagc aattaaacaa aaatatccac gtactgtcta tttcctagct
1201 tggaatgata aatggagtcc acataacaac agaggagcat gggatctatt taatgattca
1261 tgggttgtaa ataggggaga gattgattat ggtcaatcaa atccagccac tgttctctat
1321 gattttgaaa acaatacgct atcgtggtcc gggtgtgaat ttacggacgg aggaccatgg
1381 acttcgaatg aatggtcggc aaatggtact caatcgttga aagcagatgt cgttctgggc
1441 aataatagct accatttgca aaaaacagtg aatcgaaatc ttagttcatt caaaaaccta
1501 gaaattaaag tgagccattc ttcgtgggga aatgtaggaa gtggcatgac agcaagagtt
1561 ttcgtcaaaa cagggagtgc ttggagatgg aatgcaggtg aattttgtca gtttgcaggc
1621 aaacgaacaa ccgcactatc tattgatttg acgaaagtaa gtaatctgca tgatgttcga
1681 gagataggtg tagagtataa agcaccagca aatagcaacg ggaagacggc gatttactta
1741 gatcatgtga ccgtaagata atacaaaaaa aagtggttga aagcggtaac atatctagca
1801 tatgatgata gggactagat aataatagac tgtcagacta ggaggtaagt cataatgaaa
1861 aaaagtctga tcctcttgct cggactttta ttagctttct ccatgctatt aatagcctat
1921 ctatcattca cccctgcag
//
LOCUS FTRCPRBCLA 1839 bp ds-DNA ORG 20-FEB-1990
DEFINITION F.bidentis chloroplast ribulose bisphosphate carboxylase/oxygenase
large subunit (rbcL) gene, complete cds.
ACCESSION X15923
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE F.bidentis chloroplast DNA.
ORGANISM Chloroplast Flaveria bidentis
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Asterales; Asteraceae; Flaveria bidentis.
REFERENCE 1 (bases 1 to 1839)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 280 1737 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 523 a 325 c 437 g 554 t
ORIGIN
1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tggataaaaa aaaaagcatt
61 gggttgcgcc atatatatga aagagtatac aataatgatg tatttgccga atcaaatacc
121 atggtctaat aatcaagcat tctgattagt tgataatttt actattagtt gggaattttg
181 tgaaaggttc ctgtaaaaag tttcattaac gcctaattca tgtcgagtag accttgttgt
241 tgtgagaatt cttaattcat gagttgtagg gagggattta tgtcaccaca aacagagact
301 aaagcaagtg ttggattcaa agctggtgtt aaagattata aattgactta ttatactcct
361 gaatatgaaa ccaaggatac tgatatcttg gcagcatttc gagtaactcc tcaacctgga
421 gttccgcctg aagaagcagg ggccgcagta gctgccgaat cttctactgg tacatggaca
481 actgtatgga ccgatggact tacgagcctt gatcgttaca aaggccgatg ctatggaatc
541 gagcctgttc ctggagaaga caatcaatat attgcttatg tagcttaccc attagacctt
601 tttgaagaag gttctgttac taacatgttt acttccattg taggtaatgt atttgggttc
661 aaagccctgc gtgctctacg tctggaagat ttgcgaatcc ctactgcgta tgttaaaact
721 ttcgcgggtc cgcctcacgg tatccaagtt gagagagata aattgaacaa gtatggtcgt
781 cccctgttgg gctgtactat taaacctaaa ttggggttat ccgctaaaaa ctacggtaga
841 gcttgttatg aatgtcttcg tggtggcctt gattttacta aagatgatga gaacgtgaac
901 tcccaaccat ttatgcgttg gagagaccgt ttcttatttt gtgccgaagc tatttataaa
961 gcacaagctg aaacaggtga aatcaaaggg cattacttga atgctactgc gggtacatgc
1021 gaagaaatgc ttaaaagggc tgtatttgcc agagaattgg gagttcctat cataatgcat
1081 gactacctaa cagggggatt cactgcaaat actagcttgt ctcattattg ccgagataac
1141 ggcctacttc ttcacatcca ccgcgcaatg catgcagtta ttgatagaca gaagaatcat
1201 ggtattcact tccgtgtact agctaaagcg ttacgtatgt ctggtggaga tcacattcat
1261 tccggtaccg tagtaggtaa acttgaaggg gaaagagaaa tcactttggg ctttgttgat
1321 ttattgcgtg atgattttat tgaaaaagat agaagtcgcg gtatttattt cacccaagat
1381 tgggtctctc taccaggtgt tctgcctgta gcttcggggg gtattcacgt ttggcatatg
1441 cctgctctga ccgagatctt tggggatgat tccgtactac agttcggtgg aggaacttta
1501 gggcaccctt gggggaatgc acctggtgcc gtagctaatc gagtagctct agaagcatgt
1561 gtacaagctc gtaatgaggg acgtgatctt gctactgagg gtaatgagat tatccgtgag
1621 gctaccaaat ggagccctga actagctgct gcttgtgaag tatggaagga gatcaaattt
1681 gagttccagg caatggatac tttggatacg gataaggata aggataagaa gagataacgt
1741 tcgttctctt aattgaattt caatgaaact cggcccaatc ttttactaaa aggattgagc
1801 cgaatccaac atgcatatat tttagataga tacatactt
//
LOCUS FTRCPRBCLB 1842 bp ds-DNA ORG 20-FEB-1990
DEFINITION F.pringlei chloroplast ribulose bisphosphate carboxylase/oxygenase
large subunit (rbcL) gene, complete cds.
ACCESSION X15922
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE F.pringlei chloroplast DNA.
ORGANISM Chloroplast Flaveria pringlei
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Asterales; Asteraceae; Flaveria pringlei.
REFERENCE 1 (bases 1 to 1842)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 283 1740 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 529 a 324 c 437 g 552 t
ORIGIN
1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tcgataaaaa aaaaaaaaga
61 attgggttgc gccatatata tgaaagagta tacaataatg atgtatttgc cgaatcaaat
121 accatggtct aataatcaag cattctgatt agttgataat tttactatta gttgggaatt
181 ttgtgaaagg ttcctgtaaa aagtttcatt aacgcctaat tcatgtcgag tagaccttgt
241 tgttgtgaga attcttaatt catgagttgt agggagggat ttatgtcacc acaaacagag
301 actaaagcaa gtgttggatt caaagctggt gttaaagatt ataaattgac ttattatact
361 cctgaatatg aaaccaagga tactgatatc ttggcagcat ttcgagtaac tcctcaacct
421 ggagttccgc ctgaagaagc aggggccgca gtagctgccg aatcttctac tggtacatgg
481 acaactgtat ggaccgatgg acttacgagc cttgatcgtt acaaaggccg atgctatgga
541 atcgagcctg ttcctggaga agacaatcaa tatattgctt atgtagctta cccattagac
601 ctttttgaag aaggttctgt tactaacatg tttacttcca ttgtaggtaa tgtatttggg
661 ttcaaagccc tgcgtgctct acgtctggaa gatttgcgaa tccctactgc gtatgttaaa
721 actttcgagg gtccgcctca cggtatccaa gttgagagag ataaattgaa caagtatggt
781 cgtcccctgt tgggatgtac tattaaacct aaattggggt tatccgctaa aaactacggt
841 agagcttgtt atgaatgtct tcgtggtggc cttgatttta ctaaagatga tgagaacgtg
901 aactcccaac catttatgcg ttggagagac cgtttcttat tttgtgccga agctatttat
961 aaagcacaag ctgaaacagg tgaaatcaaa gggcattact tgaatgctac tgcgggtaca
1021 tgcgaagaaa tgcttaaaag ggctgtattt gccagagaat tgggagttcc tatcgtaatg
1081 catgactacc taacaggggg attcactgca aatactagct tgtctcatta ttgccgagat
1141 aacggcctac ttcttcacat ccaccgcgca atgcatgcag ttattgatag acagaagaat
1201 catggtatgc acttccgtgt actagctaaa gcgttacgta tgtctggtgg agatcacatt
1261 cattccggta ccgtagtagg taaacttgaa ggggaaagag aaatcacttt gggctttgtt
1321 gatttactgc gtgatgattt tattgaaaaa gatagaagtc gcggtattta tttcacccaa
1381 gattgggtct ctctaccagg tgttctgcct gtagcttcgg ggggtattca cgtttggcat
1441 atgcctgctc tgaccgagat ctttggggat gattccgtac tacagttcgg tggaggaact
1501 ttagggcacc cttgggggaa tgcacctggt gccgtagcta atcgagtagc tctagaagca
1561 tgtgtacaag ctcgtaatga gggacgcgat cttgctactg agggtaatga aattatccgt
1621 gaggctacca aatggagtcc tgaactagct gctgcttgtg aagtatggaa ggagatcaaa
1681 tttgagttcc aggcaatgga tactttggat acggataagg ataaggataa gaagagataa
1741 cgttcgttct cttaattgaa tttcaatgaa actcggccca atcttttact aaaaggattg
1801 agccgaatcc aacatgcata tattttagat agatacatac tt
//
LOCUS HUMHPRT1 1331 bp ss-mRNA PRI 15-DEC-1989
DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) mRNA, complete
cds.
ACCESSION M31642 J00205
KEYWORDS hypoxanthine phosphoribosyltransferase; transferase.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1331)
AUTHORS Jolly,D.J., Okayama,H., Berg,P., Esty,A.C., Filpula,D., Bohlen,P.,
Johnson,G.G., Shively,J.E., Hunkapillar,T. and Friedmann,T.
TITLE Isolation and characterization of a full-length expressible cDNA
for human hypoxanthine phosphoribosyltransferase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 477-481 (1983)
STANDARD full staff_review
FEATURES from to/span description
pept 86 742 hypoxanthine phosphoribosyltransferase
/nomgen="HPRT" /map="Xq26"
/hgml_locus_uid="LE0031K"
mRNA < 1 1331 hprt mRNA
BASE COUNT 385 a 237 c 281 g 428 t
ORIGIN Chromosome Xq26.
1 tcttgctgcg cctccgcctc ctcctctgct ccgccaccgg cttcctcctc ctgagcagtc
61 agcccgcgcg ccggccggct ccgttatggc gacccgcagc cctggcgtcg tgattagtga
121 tgatgaacca ggttatgacc ttgatttatt ttgcatacct aatcattatg ctgaggattt
181 ggaaagggtg tttattcctc atggactaat tatggacagg actgaacgtc ttgctcgaga
241 tgtgatgaag gagatgggag gccatcacat tgtagccctc tgtgtgctca aggggggcta
301 taaattcttt gctgacctgc tggattacat caaagcactg aatagaaata gtgatagatc
361 cattcctatg actgtagatt ttatcagact gaagagctat tgtaatgacc agtcaacagg
421 ggacataaaa gtaattggtg gagatgatct ctcaacttta actggaaaga atgtcttgat
481 tgtggaagat ataattgaca ctggcaaaac aatgcagact ttgctttcct tggtcaggca
541 gtataatcca aagatggtca aggtcgcaag cttgctggtg aaaaggaccc cacgaagtgt
601 tggatataag ccagactttg ttggatttga aattccagac aagtttgttg taggatatgc
661 ccttgactat aatgaatact tcagggattt gaatcatgtt tgtgtcatta gtgaaactgg
721 aaaagcaaaa tacaaagcct aagatgagag ttcaagttga gtttggaaac atctggagtc
781 ctattgacat cgccagtaaa attatcaatg ttctagttct gtggccatct gcttagtaga
841 gctttttgca tgtatcttct aagaatttta tctgttttgt actttagaaa tgtcagttgc
901 tgcattccta aactgtttat ttgcactatg agcctataga ctatcagttc cctttgggcg
961 gattgttgtt taacttgtaa atgaaaaaat tctcttaaac cacagcacta ttgagtgaaa
1021 cattgaactc atatctgtaa gaaataaaga gaagatatat tagtttttta attggtattt
1081 taatttttat atatgcagga aagaatagaa gtgattgaat attgttaatt ataccaccgt
1141 gtgttagaaa agtaagaagc agtcaatttt cacatcaaag acagcatcta agaagttttg
1201 ttctgtcctg gaattatttt agtagtgttt cagtaatgtt gactgtattt tccaacttgt
1261 tcaaattatt accagtgaat ctttgtcagc agttcccttt taaatgcaaa tcaataaatt
1321 cccaaaaatt t
//
LOCUS NRACPRBCLA 2010 bp ds-DNA ORG 20-FEB-1990
DEFINITION N.tenuifolia chloroplast ribulose bisphosphate
carboxylase/oxygenase (rbcL) gene, complete cds.
ACCESSION X15920
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE N.tenuifolia chloroplast DNA.
ORGANISM Chloroplast Neurachne tenuifolia
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Graminneae; Neurachne tenuifolia.
REFERENCE 1 (bases 1 to 2010)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 438 1874 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 614 a 345 c 453 g 598 t
ORIGIN
1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaattgta
61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg
121 atggatttgg tgaatcaaat ccatggttta ataacaaatc gtgttaactt accataacaa
181 caactcaatt cctatcgaat tcctatagta gaattcctac aggatagaac gtacacaggg
241 tgtacgcatt atatatgaat gaaacatatt cattaactta agcatactcc tttttttatt
301 taatgagttg atattaattg aatatctttg tttttttttt tttagatttt tgcaaaggtt
361 tcatttacgc ctaatccata tcgagtagac cctgtcgttg tgagaattat taattaatga
421 gttgtaggga gggacttatg tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag
481 ctggtgttaa ggattataaa ttgacttact acaccccgga gtacgaaacc aaggatactg
541 atatcttggc agcatttcga gtaactcctc agcccggggt tccgcctgaa gaagcagggg
601 ctgcagtagc tgcggaatct tctactggta catggacaac tgtttggact gatggactta
661 ccagtcttga tcgttacaaa ggacgatgct atcacatcga gcccgttcct ggggaggaag
721 atcaatatat ctgttatgta gcttatccat tagacctatt tgaagagggt tctgttacta
781 acatgtttac ttccattgtg ggtaacgtat ttggtttcaa agccctacgc gctctacgtt
841 tggaggatct acgaattccc cctacttatt caaaaacttt ccaaggtccg cctcacggta
901 tccaagttga aagggataag ttgaacaagt atgggcgtcc tttattggga tgtactatta
961 aaccaaaatt gggattatct gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg
1021 gtggacttga ttttaccaaa gatgatgaaa acgtaaactc acaaccattt atgcgctgga
1081 gagaccgttt tgtcttttgt gcggaagcaa tttataaagc acaagccgaa accggtgaaa
1141 tcaagggaca ttacttgaat gcgactgcag gtacatgcga agaaatgatg aagagagcta
1201 tatttgcaag agaattaggg gttcctattg taatgcatga ctacttaact ggaggattca
1261 ccgcaaatac tagtttggct atgtattgcc gcgacaacgg cctacttctt cacattcacc
1321 gagcaatgca tgcagttatt gatagacaga aaaatcatgg tatgcatttc cgtgtattag
1381 ctaaagcatt gcgtatgtcg gggggagatc atgtccacgc cggtacagta gtaggtaagt
1441 tagaagggga acgtgaaata actttaggtt ttgttgattt attgcgcgat gattttattg
1501 aaaaagatcg atctcgcggt gtctttttca ctcaggactg ggtatccatg ccaggtgtta
1561 taccggtggc ttcagggggt attcatgttt ggcatatgcc agctctgacc gaaatctttg
1621 gagacgattc cgtattacaa tttggtggag gaactttagg acatccttgg ggaaatgcac
1681 ctggtgcagc agctaatcgt gtggctttag aagcctgtgt acaagctcgt aacgaagggc
1741 gtgatcttgc tcgtgaaggt aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac
1801 tagccgcagc ttgtgaagta tggaaggcga tcaaatttga gttcgcgccg gtggataccg
1861 tagataaggt ctaaataaaa taaaaaagaa gtaaaataga aagagaaaaa atcagttacg
1921 aaatgcagta attcttcttt attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt
1981 taaaagattg agccgaattt aaatagatct
//
LOCUS NRACPRBCLB 1990 bp ds-DNA ORG 20-FEB-1990
DEFINITION N.munroi chloroplast ribulose bisphosphate carboxylase/oxygenase
large subunit (rbcL) gene, complete cds.
ACCESSION X15921
KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE N.munroi chloroplast DNA.
ORGANISM Chloroplast Neurachne munroi
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Gramineae; Neurachne munroi.
REFERENCE 1 (bases 1 to 1990)
AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
Andrews,T.J. and Whitfield,P.R.
TITLE Comparisons of rbcL genes for the large subunit of
ribulose-bisphosphate carboxylase from closely related C3 and C4
plant species
JOURNAL J. Biol. Chem. 265, 808-814 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.S. Fudson, 20-JUL-1989.
FEATURES from to/span description
pept 418 1854 ribulose bisphosphate carboxylase/oxygenase.
large subunit (EC 4.1.1.39)
BASE COUNT 613 a 340 c 445 g 592 t
ORIGIN
1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaatccta
61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg
121 atggaatcaa atccatggtt taataacgaa tcgtgttaac ttaccataac aacaactcaa
181 ttcctataga attcctatag gatagaacgt acacagggtg tacgcattat atatgaatga
241 aacatattca ttaacttaag catactcctt tttttattta atgagttgat attaattgaa
301 tatttttgaa tatttttttt tttagatttt tgcaaaggtt tcatttacgc ctaatccata
361 tcgagtagac cttgtcgttg tgagaattat taattaatga gttgtaggga gggacttatg
421 tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag ctggtgttaa ggattataaa
481 ttgacttact acaccccgga gtacgaaacc aaggatactg atatcttggc agcatttcga
541 gtaactcctc agcccggggt tccgcctgaa gaagcagggg ctgcagtagc tgcggaatct
601 tctactggta catggacaac tgtttggact gatggactta ccagtcttga tcgttacaaa
661 ggacgatgct atcacatcga gcccgttcct ggggaagaag atcaatatat ctgttatgta
721 gcttatccat tagacctatt tgaagagggt tctgttacta acatgtttac ttccattgtg
781 ggtaacgtat ttggtttcaa agccctacgc gctctacgtt tggaggatct acgaattccc
841 cctacttatt caaaaacttt ccaaggtccg cctcacggta tccaagttga aagggataag
901 ttgaacaagt atgggcgtcc tttattggga tgtactatta aaccaaaatt gggattatct
961 gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg gtggacttga ttttaccaaa
1021 gatgatgaaa acgtaaactc acaaccattt atgcgctgga gagaccgttt tgtcttttgt
1081 gcggaagcaa tttataaagc acaagccgaa accggtgaaa tcaagggaca ttacttgaat
1141 gcgactgcag gtacatgcga agaaatgatg aagagagctg catttgcaag agaattaggg
1201 gttcctattg taatgcatga ctacttaact ggaggattca ccgcaaatac tagtttgtct
1261 atgtattgcc gcgacaacgg cctacttctt cacattcacc gagcaatgca tgcagttatt
1321 gatagacaga aaaatcatgg tatccatttc cgtgtattag ctaaagcatt gcgcatgtcg
1381 gggggagatc atgtccactc cggtacagta gtaggtaagt tagaagggga acgtgaaata
1441 actttaggtt ttgttgattt attgcgcgat gattttattg aaaaagatcg atctcgcggt
1501 gtctttttca ctcaggactg ggtatccatg ccaggtgtta taccggtggc ttcagggggt
1561 attcatgttt ggcatatgcc agctctgacc gaaatctttg gagacgattc tgtattacaa
1621 tttggtggag gaactttagg acatccttgg ggaaatgcac ctggtgcagt agctaatcgt
1681 gtggctttag aagcctgtgt acaagctcgt aacgaagggc gtgatcttgc tcgtgaaggt
1741 aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac tagccgcagc ttgtgaagta
1801 tggaaggcga tcaaatttga gttcgagccg gtggataccg tagataaggt ctaaataaaa
1861 taaaaaagaa gtaaaataga aagagaaaaa ataagttacg aaatgcagta attcttcttt
1921 attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt taaaagattg agccgaattt
1981 aaatagatct
//
LOCUS RATPGKXL 1675 bp ss-mRNA ROD 07-FEB-1990
DEFINITION Rat X-chromosome linked phosphoglycerate kinase mRNA, complete cds.
ACCESSION M31788
KEYWORDS phosphoglycerate kinase.
SOURCE Rat liver, cDNA to mRNA, clones 6, 15, and 16.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1675)
AUTHORS Ciccarese,S., Tommasi,S. and Vonghia,G.
TITLE Cloning and cDNA sequence of the rat X-chromosome linked
phosphoglycerate kinase
JOURNAL Biochem. Biophys. Res. Commun. 165, 1337-1344 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 41 1294 phosphoglycerate kinase
BASE COUNT 426 a 359 c 459 g 431 t
ORIGIN 289 bp upstream of PstI site.
1 ggcctctctc cgggcctctc gacctcacgg tgttgccaaa atgtcgcttt ctaacaagct
61 gactttggac aagctggacg tgaaggggaa gcgggtcgtg atgagggtgg acttcaatgt
121 tcctatgaag aacaaccaga taacgaataa ccaaaggatc aaggctgctg tcccaagcat
181 caaattctgc ttggacaatg gagccaactc ggttgtgctt atgagccacc tgggccgtcc
241 tgatggtgtg cccatgcccg acaagtactc cttagagcca gttgctgcag aactcaaatc
301 tctgctgggc aaggatgttc tgttcttgaa ggattgtgtg ggctcagaag tagagaatgc
361 ctgtgccaac ccagcggctg ggactgtcat cctcctggag aacctccgct ttcatgtaga
421 ggaagaaggg aagggaaaag atgcttctgg gaacaaggtt aaagctgagc cagctaaaat
481 tgatgctttc cgagcctccc tgtccaaact tggagatgtc tatgtcaatg atgcttttgg
541 gactgcacac agagcccaca gttccatggt gggtgtgaat ctgccacaga aggctggtgg
601 atttttgatg aagaaggagc tgaactactt tgccaaggct ttggagagtc cagagcgacc
661 cttcctggct atcttgggag gagctaaagt tgcagacaag atccagctga tcaataatat
721 gctagacaaa gtcaatgaga tgatcatcgg tgggggaatg gcttttacct tccttaaggt
781 gctcaacaac atggagattg gcacatctct gtatgatgaa gagggagcca agattgtcaa
841 agatctcatg accaaagctg agaaaaatgg tgtgaagatt accttgcctg ttgactttgt
901 cactgctgac aaatttgatg agaatgcaaa gactggccaa gctactgtgg cctctggtat
961 acctgctggc tggatgggct tggactgtgg tactgagagc agtaagaaat atgctgaggc
1021 tgtggctcga gctaagcaga ttgtttggaa cggccctgtt ggggtatttg aatgggaagc
1081 atttgccagg ggaaccaagt ccctcatgga tgaggtggtg aaagccacgt ctaggggctg
1141 catcactatc ataggtggtg gagacaccgc cacttgctgt gccaaatgga acacagagga
1201 taaagtcagc catgtgagca ctgggggcgg cgccagtttg gagctcctgg aaggtaaagt
1261 ccttcctggg gtggatgctc tcagcaatgt ttagtatttt cctgcctttt ggttcctgtg
1321 cacagcccct aagtcgactt agtgttttcc gcatctccat ttggtgttag tgcagctagt
1381 ggccaagacg agcaccagga acctaagcag cagctgcaca gcatctcagc tcgtctttac
1441 tgcatgggat tcattatagt taagtccatt taatttctta gtgactaaac cttgcattaa
1501 ggcatatatt tatactctgc ctgtgaaagg aagtgagctg taaaagctta gctctcttcg
1561 ctgtatgtag cctctggtta gcctgtcact gttcatgaca gcatggaaat aacgatgagg
1621 attccagctg taggtttggg aggttgatga tattgaacca ataaaagatc ccact
//
LOCUS RATQRE1 504 bp ds-DNA ROD 20-FEB-1990
DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 1.
ACCESSION M31801
KEYWORDS NAD(P)H:quinone reductase.
SEGMENT 1 of 5
SOURCE Rat cell line H5-6 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 504)
AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone
reductase gene expression by planar aromatic compounds and
determination of the exon structure of the quinone reductase
structural gene
JOURNAL J. Biol. Chem. 264, 21793-21797 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 325 + 331 NAD(P)H:quinone reductase, exon 1
pre-msg 206 > 504 NAD(P)H:quinone reductase mRNA and introns
IVS 332 > 504 NAD(P)H:quinone reductase intron A
BASE COUNT 112 a 150 c 127 g 115 t
ORIGIN
1 actctcaagc tcccagaaaa cccttttgac ttccatatac agaagtccta ggccagccct
61 aaactgcttc tccctgccaa taacttggta tcctcccccc agcgcctctg ggctcggcaa
121 tccagccccg ccctcgctgg ctgccctgca cagtgggctg ggccggaaaa gcaagatata
181 aagcctgaaa gtgctcagta cagctcgcac tagcctaggc tgtggcacgc aggatctttc
241 cgaagcattt cagggtcgtc ctggcaacca gctgctcagc caatcagcgc ttgacactac
301 gatccgcccc caacttctgg agccatggcg ggtgagtatg gctccaactc cagcctaatt
361 catcctgagg aggatgtagg ggcttgctat ggggtttgtt ccttgcctcg aagttgaaaa
421 gtgtagagat taggatcctg gatgagcctc ggtgagtccc cggaaggaga gcttcttctc
481 agaaccatag gtgcagatta ttct
//
LOCUS RATQRE2 634 bp ds-DNA ROD 20-FEB-1990
DEFINITION Rat liver NAD(P)H:quinone reductase gene, exons 2 and 3.
ACCESSION M31802
KEYWORDS NAD(P)H:quinone reductase.
SEGMENT 2 of 5
SOURCE Rat liver cell line H5-6 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 634)
AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone
reductase gene expression by planar aromatic compounds and
determination of the exon structure of the quinone reductase
structural gene
JOURNAL J. Biol. Chem. 264, 21793-21797 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 167 331 NAD(P)H:quinone reductase, exon 2
402 + 532 NAD(P)H:quinone reductase, exon 3
pre-msg < 1 > 634 NAD(P)H:quinone reductase mRNA and introns
IVS < 1 166 NAD(P)H:quinone reductase intron A
IVS 332 > 401 NAD(P)H:quinone reductase intron B
IVS 533 > 634 NAD(P)H:quinone reductase intron C
BASE COUNT 156 a 158 c 161 g 159 t
ORIGIN Undtermined number of bp after segment 1.
1 ctttagattt tcagcttaaa gacgtttagt tgataatgca gccactgttt cccgctctga
61 gaaactaagg tggggaacgt gtctggtccc aagcactttt agattaggga ctcacccgtc
121 ctgtttggat tttctttcct cacctcctca cgtacgcctt aaacagtgag aagagccctg
181 attgtattgg cccacgcaga gaggacatca ttcaactatg ccatgaagga ggctgctgtg
241 gaggctctga agaagaaagg atgggaggtg gtcgaatctg acctctatgc tatgaacttt
301 aaccccctca tttccagaaa cgacatcaca ggtaagaatc gtctccctcc actgacagtg
361 gaccacgtga cccagcctca gcccctcttg cctcccaaca ggggagccga aggactcgga
421 gaactttcag taccctgttg agtcatctct ggcgtataag gaaggccgcc tgagcccgga
481 tattgtagct gaacagaaaa agctggaagc tgcagacctg gtgatatttc aggtacctca
541 taggacggaa gtcatgagag cagtgtctag ggtgtctgtc cttattgtcc ctagagccct
601 ctgcattcta agtggtgtca gaaagtgtgc ggct
//
LOCUS RATQRE3 248 bp ds-DNA ROD 20-FEB-1990
DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 4.
ACCESSION M31803
KEYWORDS NAD(P)H:quinone reductase.
SEGMENT 3 of 5
SOURCE Rat liver cell line H5-6 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 248)
AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone
reductase gene expression by planar aromatic compounds and
determination of the exon structure of the quinone reductase
structural gene
JOURNAL J. Biol. Chem. 264, 21793-21797 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 27 + 140 NAD(P)H:quinone reductase, exon 4
pre-msg < 1 > 248 NAD(P)H:quinone reductase mRNA and introns
IVS < 1 26 NAD(P)H:quinone reductase intron C
IVS 141 > 248 NAD(P)H:quinone reductase intron D
BASE COUNT 48 a 52 c 74 g 74 t
ORIGIN Undtermined number of bp after segment 2.
1 tgacctcttg gtcttctgtc ccacagttcc cattgtattg gtttggggtg cccgccattc
61 tgaaaggctg gtttgagaga gtgcttgtag caggattcgc ctacacgtat gccaccatgt
121 atgacaaggg tcctttccag gtaggtgaac tgttgggtat ttcctgagta gattctgccc
181 tgacatcaca gatggactct taagaccggg ttggggtggg gttggggggt gcactgtaag
241 cttataaa
//
LOCUS RATQRE4 199 bp ds-DNA ROD 20-FEB-1990
DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 5.
ACCESSION M31804
KEYWORDS NAD(P)H:quinone reductase.
SEGMENT 4 of 5
SOURCE Rat liver cell line H5-6 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 199)
AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone
reductase gene expression by planar aromatic compounds and
determination of the exon structure of the quinone reductase
structural gene
JOURNAL J. Biol. Chem. 264, 21793-21797 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 63 + 164 NAD(P)H:quinone reductase, exon 5
pre-msg < 1 > 199 NAD(P)H:quinone reductase mRNA and introns
IVS < 1 62 NAD(P)H:quinone reductase intron D
IVS 165 199 NAD(P)H:quinone reductase intron E
BASE COUNT 41 a 55 c 53 g 50 t
ORIGIN Undetermined number of bp after segment 3.
1 cccccttcat taactcacgg ggaccctgga gaggagcggg tgatacttgt tttctctttc
61 agaataagaa gaccttgctt tccatcacca ccgggggcag cggctccatg tactctctgc
121 agggtgtcca cggggacatg aacgtcattc tctggccaat tcaggtagct cctcacgggg
181 aacttccaga ggagtttgt
//
LOCUS RATQRE5 964 bp ds-DNA ROD 20-FEB-1990
DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 6.
ACCESSION M31805
KEYWORDS NAD(P)H:quinone reductase.
SEGMENT 5 of 5
SOURCE Rat liver cell line H5-6 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 964)
AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone
reductase gene expression by planar aromatic compounds and
determination of the exon structure of the quinone reductase
structural gene
JOURNAL J. Biol. Chem. 264, 21793-21797 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 57 362 NAD(P)H:quinone reductase, exon 6
pre-msg < 1 > 964 NAD(P)H:quinone reductase mRNA and introns
IVS < 1 56 NAD(P)H:quinone reductase intron E
site 727 757 region of dyad symmetry
BASE COUNT 222 a 229 c 240 g 273 t
ORIGIN Undetermined number of bp after segment 4.
1 tgcttacagt gcagactcac tcccgtgcct tacctctgtg ttctgtgtct ctccagagtg
61 gcattctgcg cttctgtggc ttccaggtct tagaacctca actggtgtac agcattggcc
121 acaccccacc cgatgcccgc gtgcaggtcc tggaagggtg gaagaagcgt ctggagactg
181 tctgggagga gtcaccactc tactttgctc caagcagctt gtttgaccta aacttccagg
241 caggattctt actgaaaaaa gaggttcaag aggagcagaa aaagaacaag tttggccttt
301 ctgtgggcca tcatttgggc aagtccattc cagccgacaa ccagatcaaa gctagaaaat
361 aaggttttcc ataccatgta gttagaccca ggtttctttt tccccagttt gcctggcttg
421 ctttcagttt tcgcctttgt tccacaagga taggaaaagg aggaggctcg cctcatgcgt
481 ttttggatag ttctgccacg gcgtgtgaca gcaaaatgaa cgaggtcaga ttaggggcct
541 caggtggcct gggatatgaa tcagggagag gtgtagccgc gagcggggaa aatactcttc
601 taggtctttt gtacactata agcttttttc ttcgggctag cctggctaaa tggcatccaa
661 tcctccaccc acttgttgct attagttacc tctctgtggt ttagggcagg agggaattgc
721 tcaaacaatg gctgagggac taacttgttt agcagttagc taaagcctgt ttatgatcca
781 tcctggtttc aattactgtg cagtgactga caagcctcgg gggattgctc tccagctctt
841 ctctgccttg tacatagcac acccaggtcc tgggaaatga atacaaaaac aggtctccgc
901 ctcattcttg ctattctttc tttgtgtgtg tgtgtgtgga aataaatgga tatttcacac
961 gtca
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FPLTRAC 3227 bp ds-DNA BCT 05-FEB-1990
DEFINITION Plasmid F TraC protein (traC) gene, complete cds and TrbI protein
(traI) gene, partial cds.
ACCESSION M30936
KEYWORDS TraC protein; TrbI protein.
SOURCE plasmid F DNA, clone PKAS2.
ORGANISM Plasmid F
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 3227)
AUTHORS Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A.,
Ippen-Ihler,K. and Webster,R.E.
TITLE Nucleotide sequence of the F plasmid gene traC and identification
of its product
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Printed copy of sequence [1] kindly submitted by R.E.Webster,
20-DEC-1989
FEATURES from to/span description
pept 268 2895 TraC protein (gtg start codon)
pept 2892 > 3227 TrbI protein
binding 256 261 Shine-Delgarno sequence
BASE COUNT 785 a 827 c 900 g 715 t
ORIGIN
1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc
61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga
121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta
181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc
241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg
301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt
361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag
421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc
481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg
541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa
601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc
661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg
721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa
781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag
841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag
901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct
961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg
1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg
1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg
1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac
1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag
1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg
1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc
1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc
1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc
1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta
1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg
1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc
1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt
1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg
1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg
1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg
1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg
1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca
2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag
2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc
2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa
2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg
2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa
2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa
2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca
2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt
2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa
2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc
2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt
2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt
2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg
2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac
2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc
2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt
3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg
3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt
3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg
3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc
//GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS DDIGP24 3000 bp ds-DNA INV 12-SEP-1989
DEFINITION D.discoideum glycoprotein 24 A and B (GP24A and GP24B) genes,
complete cds.
ACCESSION M27588
KEYWORDS adhesion protein; glycoprotein.
SOURCE D.discoideum DNA, clones DG[2,3].
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 3000)
AUTHORS Loomis,W.F. and Fuller,D.L.
TITLE A pair of tandemly repeated genes code for gp24, a putative
adhesion protein of Dictyostelium discoideum
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 886-890 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.F.Loomis, 08-SEP-1989, for release after publication.
FEATURES from to/span description
pept 1072 1132 glycoprotein 24A (GP24A), exon 1
1234 1481 glycoprotein 24A (GP24A), exon 2
pept 2585 2645 glycoprotein 24B (GP24B), exon 1
2747 2997 glycoprotein 24B (GP24B), exon 2
pre-msg 1023 > 2997 GP24A mRNA and intron
pre-msg 2533 > 2997 GP24B mRNA and intron
IVS 1133 1233 GP24A intron A
IVS 2646 2746 GP24B intron A
signal 812 817 GP24A G box
signal 822 826 GP24A G box
signal 994 999 GP24A TATA box
signal 2334 2339 GP24B G box
signal 2342 2346 GP24B G box
signal 2504 2509 GP24B TATA box
BASE COUNT 1123 a 285 c 323 g 1269 t
ORIGIN
1 actccttttg atttggttgg cgttaatgaa tttgaatttt taattaattc tgattttaat
61 tttaatgatt cctgagaaat aaaatattag tattataatt attataatta ttattattac
121 tataatatat aaatataaat aatattaata aaaatatttt acaatattct ctatctcttc
181 taaaatttgt ttaattggtt tctttaataa attcgttaaa ttatatattt gattttgata
241 aagaggttga atattctcac tattttgaga ttttactact ctagtaattc tgtgacctga
301 aatgatgtaa tatttatacg cttttgtgat cctaatggta ttgttgatgt tattgttgtt
361 gttgttgttg ttgttgttgt tgttgttgtt gttgtcgata atgatggtgg attcgatttc
421 ttttcaaatg agatcgaatt tgttttgaat cttttattgc tattatcatt actattatta
481 tctacattct catcattaat tatatcattt gtatcactta agtttctttt tgtacctaac
541 atgtttttat tatttgttgt atctattaca acatttgaaa aaaataaagt tgaataaaaa
601 aaaaaaaaaa aaaaaaaaaa gactttattt tttaatttgg gcgccaattc aaaaaaaaaa
661 aaaaaaaaaa aattaatttg ggaaatacag tttttttttt ttatgcccac catttgacat
721 agataacctt ttataaataa taataaaacc tctaatttaa actgtttata aaaaaatggt
781 ggaatgataa aaaaaaaaaa actgttttga ttgtggggga ggggtgaaga tgattgtcat
841 tttattttgt tttatttatt attttttttt ttaatgggga tacaagtgag atataaatta
901 caataattta ttattaaaat aaaacaatta attttttttt tctttttttt ttatttttta
961 tttttttctt tttatttttt ttttttaatt atttaataaa attttttttc ctcattcaat
1021 ttaaatataa ttactcttaa aaataaataa ataataataa taaattttaa aatggttgat
1081 ttgaaaatta cattagttaa tgaagatggg gaatcaacaa taagcggtaa aggtaattgg
1141 aataattaat ttctaaatat ttaattttat ttatatatat ttaaaaatct aacatatcat
1201 ttcatttcat tctggctctt tttaaaaata taggtcatcc attaccagct cccttaatat
1261 ttcctccaat atatattttc cgttttaccc aataccaaac tgaaggtaaa ctttgggaca
1321 aaaatgaatt tcaaataaaa tcaggtaaaa ttgaattcga cggagaagaa tatgatattc
1381 ctgaatctaa aggcacatgg agtaaagatg atgaggaaaa cgcaatcgat gttaatttac
1441 atctatttag accaccagaa aaatttttcc caaaaaatta aagatcttaa aacagatctg
1501 atacaatgaa aacaaagaaa taaaaaattt cattaatctt attttttcat catatataat
1561 ttctatgttt ttaaaattta attatctgac atttaaataa ataagaaaca tatatgatgt
1621 aaatgtatta ggtggttttg gttttcgttt gttcaattga ggaaacataa gataaattta
1681 tctgacattt aaataaataa gaaacatata tgatgcaaat gtatttggag ttttgaaatt
1741 aagaaattca acatcatatt aggtttaata atttacattt ctctgaaatg ggagatatta
1801 ttcatcatat agttatacca aacattcagt tattgcaatt acattctttc cacaaaaagt
1861 tttatttata atttgtttta tttgatttta attttgcaga tttgttgtag tgagttttca
1921 cactaaattt tttttttttt tttttttaaa ttaaaaacaa ttaatttaac caaaacaata
1981 aaaaaaacta tttataaaca gttatttgat tgttttaaga ttaaataaca atttacttta
2041 aaatttgcgc caaatttttt ttttttaatt attttttttt ttttctatta tttttttttt
2101 tttttttgct agtcagattt tagcaaattt tagaaatcaa attcaatgat tggtaaaaat
2161 caagattttc aatcttttaa taatatatta cagcatcaac aacactcacc aaattaccat
2221 atcacaaatt attgttatca gtttaataat tgacatagaa taaacttttt ttaaataata
2281 ataaaacctc taatttaaac tgtttataaa aaaaaaaaaa aaactgtttt gattgtggga
2341 ggggtgaaga tgattgtcat tttattttat tttatttatt attttttttt aatggggata
2401 caagtgagat ataaattaca ataatttatt attaaaaaaa acaatttttt tatttttttt
2461 tttttatttt tttttttctt ttttttattt ttttttaatt atttaataaa attttttttc
2521 aacattcaat ttaaatataa ttactcttaa aaataaataa ataataataa taataaattt
2581 taaaatgact gatttgaaaa ttacattagt taatgaagat ggggaatcaa caataagtgg
2641 taaaggtaat tggaataatt aatttctaaa tatttaattt tatttatata tatttaaaaa
2701 tctaacatat catttcattt tattctggat tttttataaa atataggtca tccattacca
2761 gctcccctaa tatttcctcc aatatattgt ttctgtttta tccaatataa aactgaaggt
2821 aaactttggg acaaaaatga ttttcaaata aaatcaggta aaattgaatt cggcggagaa
2881 gaatatgata ttactgaatc taaaggcaca tggagtaaag atgatgaaga aaaccacatc
2941 aaagttagtt tacatctaat tgtaccacca aaaaaaattt tccaaaaaaa tttttagatc
//GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS BSUHEMAC 3795 bp ds-DNA BCT 15-FEB-1990
DEFINITION B.subtilis delta-aminolevulinate synthase (hemA) uroporphyrinogen I
synthase (hemC) genes, complete cds.
ACCESSION M32130
KEYWORDS delta-aminolevulinate synthase; uroporphyrinogen I synthase.
SOURCE B.subtilis DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3795)
AUTHORS Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt,
TITLE Cloning and characterization of the hemA region of the Bacillus
subtilis chromosome
JOURNAL Unpublished (1990) Univ. of Lund, Sweden
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
I.C.Schroeder, 16-FEB-1990.
FEATURES from to/span description
pept 232 1599 delta-aminolevulinate synthase (hemA, EC
2.3.1.37)
pept 1607 2437 ORF2
pept 2470 3414 uroporphyrinogen I synthase (hemC, EC 4.3.1.8)
BASE COUNT 1068 a 689 c 953 g 1085 t
ORIGIN
1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt
61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta
121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta
181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata
241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt
301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa
361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat
421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag
481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta
541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac
601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag
661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg
721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag
781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga
841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca
901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct
961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa
1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga
1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg
1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg
1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct
1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt
1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa
1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat
1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg
1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct
1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc
1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt
1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat
1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc
1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact
1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt
1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg
1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc
2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa
2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc
2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat
2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt
2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg
2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct
2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg
2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact
2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt
2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga
2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca
2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt
2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa
2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt
2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag
2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt
3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg
3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct
3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg
3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa
3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc
3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga
3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc
3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca
3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc
3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt
3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat
3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat
3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc
3781 tcttaagcag catgc
//
LOCUS ECOAFR1 1476 bp ds-DNA BCT 15-FEB-1990
DEFINITION E.coli AF/R1 major pili subunit (afrA) gene, complete cds.
ACCESSION M32083
KEYWORDS afrA gene; major pili subunit; surface antigen.
SOURCE E.coli (strain RDEC-1) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1127)
AUTHORS Wolf,M.K. and Boedeker,E.C.
TITLE Cloning of the genes for AF/R1 pili from rabbit enteroadherent
Escherichia coli RDEC-1 and DNA sequence of the major structural
subunit
JOURNAL Infect. Immun. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1476)
AUTHORS Wolf,M.K. and Boedeker,E.C.
JOURNAL Unpublished (1990) Walter reed Army Inst. of Res. Washington DC
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.K. Wolf, 13-FEB-1990.
FEATURES from to/span description
pept 627 1115 AF/R1 major pili subunit (afrA)
site 349 390 region of dyad symmetry
signal 493 498 pot. -35 region
signal 515 520 pot. -10 region
binding 612 616 pot. ribosome binding site
BASE COUNT 447 a 278 c 287 g 462 t 2 others
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcccta gtgaatgtct gctgggaatc ataaaacaat ctttctgata tatccacaat
61 ttttaggttg gtaaatctta aaagaatagc cgctcgcgtt atcctgctta attgaatgta
121 tttacctaaa gtaacaccta tgttttcttt aaacagtaat tgcagatacc gtctgctgta
181 tccggagtaa tcaacgaggg catttatatc tatagatata ctctctaaat tatcatcaat
241 gtactgtgtt atcgcgttta tcgtaagtgt tttcagcatg tacgtagctc ctatatgtat
301 gtttacgtgt taccccacat catgttaata aaaccccttc tgttttttta gctgattgtg
361 cattgtacac ataccgtgca caattagcta acaacgcaga ccaatatttt ttaaaatacc
421 ccgcgttttc acatgacttg tatctattct cttagagaaa ttaatgcatc tctatcacat
481 catgtgtagt actggacaaa tagtcatggg agcctattac cgaacagcga agatggcata
541 tgttttctta ttaagaaaga ggaaagaata tggcgcactc gttttatctc aattttggta
601 aaaaaaatat atggagaatg tcagaaatga aaaaaacatt tattgcgtct gtaattgtaa
661 taaccataaa tacgggatca gcaattgctg ctcaaggcga tgttcagttc tttggtaccg
721 ttactgcgaa gacctgtgat cttgtcgttg aacacgaggg ggctgtggtc aatatgattc
781 agttgggttc tgtaactaat ggtggaacta atgctggcac cgatatcgga gcaaacaaat
841 cgtttaccct gaagccagca tcaggggtga catgcaatac catcactact gctaaaatgg
901 catggtcttc tcctgcaatg accgttaatg gtattggtaa tctatcaggt aaggctattg
961 atgcccatgt gaagttagtg gcgattaaca gcacgggtaa agttcaaact gataccaacg
1021 cagataagga aattaaagcg ggtcaaaata cagttgatta ctcaattact ggttctggcc
1081 tactgatgaa ggctttaaat ttaaagctca gttaattggc ggtaccattc caggtgactt
1141 cgatagtgct gctgcatatt ccgttgcata caactaatat ttgaatgtaa atccgggaag
1201 cccctccctt cccggattta atatttagaa cagcatattt aactggtgcc cttaactttg
1261 cttaggtgtg aagaggttag cttatgaaat taaaaacatt tcctaaaata tctctactgg
1321 ccctgagtat atggtattct cactccagct tggctgatga acttaatctg gattttatac
1381 agaacgtcag cgttattcca tcaattctga aaagtgacgc aatttacccg gaaggacaat
1441 atatcgttga cgtaaccgta aataaagaac gtatdd
//
LOCUS ECOCYS 5755 bp ds-DNA BCT 14-FEB-1990
DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT,
cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes,
complete cds.
ACCESSION M32101
KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene;
o-acetylserine (thiol)-lyase-B; sulfate permease;
thiosulfate binding protein.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1292 to 5755)
AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A.
TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide
sequence and expression of the cysTWAM gene cluster
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1291)
AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D.
TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of
a gene encoding a novel protein involved in thiosulfate binding
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.Halanicka, 14-FEB-1990.
FEATURES from to/span description
pept 559 1575 thiosulfate binding protein (cysP)
pept 1575 2408 sulfate permease (cysT)
pept 2408 3283 sulfate permease (cysW)
pept 3273 4370 sulfate permease (cysA)
pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg
start codon; EC 4.2.99.8)
signal 516 521 -10 region
site 5737 5755 right end of mu
BASE COUNT 1259 a 1477 c 1682 g 1337 t
ORIGIN 52 min on K12 map.
1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag
61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt
121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc
181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt
241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt
301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc
361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac
421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta
481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta
541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc
601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac
661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat
721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg
781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa
841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat
901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc
961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg
1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt
1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc
1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg
1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa
1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac
1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg
1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac
1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc
1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg
1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa
1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg
1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg
1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg
1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc
1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg
1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag
1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg
2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa
2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg
2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga
2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc
2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat
2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag
2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg
2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat
2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc
2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc
2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg
2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg
2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca
2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg
2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga
2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa
3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt
3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca
3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt
3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa
3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga
3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga
3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg
3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc
3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga
3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg
3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg
3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg
3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc
3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca
3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga
3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga
3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg
4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc
4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt
4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg
4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc
4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg
4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt
4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt
4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga
4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat
4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc
4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa
4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat
4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg
4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat
4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga
4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg
4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat
5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca
5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg
5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa
5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc
5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat
5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca
5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg
5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg
5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt
5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa
5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct
5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag
//
LOCUS HUMCOLA2I 1994 bp ds-DNA PRI 15-FEB-1990
DEFINITION Human collagen type I alpha-2 (COL1A2) gene, exon 1 (partial).
ACCESSION M31886
KEYWORDS collagen.
SOURCE Human DNA, clone pCOL-alpha-2-bGH.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1994)
AUTHORS Sherwood,A.L., Bottenus,R.E., Martzen,M.R. and Bornstein,P.
TITLE Structural and functional analysis of the first intron of the human
alpha-2(I) collagen-encoding gene
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Bornstein, 02-FEB-1990.
FEATURES from to/span description
pre-msg < 1 > 1994 collagen alpha-2 type I mRNA and intron
/nomgen="COL1A2" /map="7q21.3-q22.1"
/hgml_locus_uid="LP0002V"
IVS 156 > 1994 COL1A2 intron A
binding 141 154 NF1 binding site
binding 1034 1040 AP1 binding site
binding 1061 1067 AP1 binding site
site 1370 1409 gt-rich region
BASE COUNT 580 a 413 c 456 g 545 t
ORIGIN
1 gcatgcccgc gcccgccagg tgatacctcc gccggtgacc caggggctct gcgacacaag
61 gagtctgcat gtctaagtgc tagacatgct cagctttgtg gatacgcgga ctttgttgct
121 gcttgcagta accttatgcc tagcaacatg ccaatgtaag tgccttcagc ttgtttgggg
181 gagactgggt agagaggtta gatgggaggg caccctgccc tgaaaaggaa aacctgtaac
241 ctgaattcca ggtacacttg gagggcagac tctcaggcat gtgggaaaac gccggaattg
301 ataagaaaca tggaaattac tttaaaaaat gaaaacataa aagccttgcc aaaagttagg
361 gaacttttcc tctaagttca gagtgagaca gttaactcgg tctggctcct cagcttagta
421 acccccaaag ggagcggaag gtctttttcc ctaaggatga gatattaacg accaatgtgg
481 tggaggaagt caagggcctg caccccacag gccccataac cgcactgatg tccaccttgt
541 aaaacttgag gcctgcgtta gaaagccctt caactgagta atgtaaaact cacctcctaa
601 gagcttttat cttctgggca ttgtaaggct tgtccggagg aggaggatga cgatgctgat
661 atgatgatgg ttataaggcg ccctctggag gaaggaaaat gaaagtacag gggacagggc
721 cttaagcaga tggaatccca attaaagctt ctacggattt atacagatta atgatcagca
781 tttctggttg gagcctttcc cagtggctag tcagtgaacc ctggaaagaa gaatggatgc
841 tacttggagt gggtacattc tgaaaagtaa tataagtgtc tcaattcact ttctagtcat
901 ggaaatggta acatttttta actcaaatct gctctaaatt ttgtttgagc ctgagaatta
961 cccctttgac atgttcccag tgataagcaa acattatgaa cgcagcaagt tgagaaatat
1021 caacattgag atgagactca agagaccggg gtttttccca tgagtctgac accaatttgc
1081 tgcgtgactt tgggcaagtc aaacggcctt ttctaaaatg tgagacagag attaaaggga
1141 ccccaaggcc actttccagc tctaggttcc atggccagac tttcatgtca acagagaatg
1201 aagaagatca gtccgttttc atcttgaaaa tggctgccaa agtgctagac aaagatattg
1261 actagatggg ggatggtatt gtctgaccac acccagtact ccaaaaagtt gttccaccca
1321 cacagcacgg tgtctaccac tgcataattt ctaatgcatt tgtgtgcttg tgtgtgtgtg
1381 tgtgtgtgtg tgtctgtgtg tctgtgtgtc tcttccccct tcattcactt ttagtataca
1441 tactgtggat actaaggagt aattgcagtg aacaaattca cattaccgag ttcatatttt
1501 taatgagatc ttgagagtgg gaggaaagag tcggctccta gagaataaaa tgaaggcaga
1561 cttagggaaa tttgaaggta caaaggcaac ttaccttctg atcaacagcc aaccacagtc
1621 tggaataaat gttatcaaac acacattctt caaaatggtc cgtgtctgag taattaaaag
1681 gcaaatttcc aaaatcataa ggacttccgt taatcaagtc aggcataatt attcttccta
1741 ctgatgacac aatgaagtaa acatatcatt cttgtaattt aacagtaatt ctcgtaaatt
1801 gcccttaaat gtcagtgctg gatgtggtcc accctcctaa attgtgactg ttgcaacaga
1861 tgttctcact tcaaataacg cacttcttgg ccacctaatt aaagcaattt ttggggtgat
1921 tcatcctact gcaagcttgg ccacacttgt atcctgtatt aacctataat ttttgtaccg
1981 taggagaaga attc
//
LOCUS HUMP120PC 2612 bp ss-mRNA PRI 14-FEB-1990
DEFINITION Human proliferating-cell nucleolar protein P120 mRNA, complete cds.
ACCESSION M32110
KEYWORDS proliferating cell nuclear protein.
SOURCE Human fetal liver cell line CML and testis, cDNA to mRNA, and lymph
node DNA (bases 1 to 30).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2612)
AUTHORS Fonagy,A., Henning,D., Jhiang,S., Haidar,M., Busch,R.K., Larson,R.,
Valdez,B. and Busch,H.
TITLE Cloning of the cDNA and sequence of the human proliferating cell
nuclear protein P120
JOURNAL Cancer Communications 1, 243-251 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.Busch, 14-FEB-1990.
FEATURES from to/span description
pept < 1 2612 proliferating cell nuclear protein P120 (AA at
3)
BASE COUNT 674 a 701 c 725 g 512 t
ORIGIN 346 bp upstream of AvaI site.
1 cacgcgcgac gccaccttct cccatttctg cctgccacag taccatgggg cgcaagttgg
61 accctacgaa ggagaagcgg gggccaggcc gaaaggcccg gaagcagaag ggtgccgaga
121 cagaactcgt cagattcttg cctgcagtaa gtgacgaaaa ttccaagagg ctgtctagtc
181 gtgctcgaaa gagggcagcc aagaggagat tgggctctgt tgaagcccct aagacaaata
241 agtctcctga ggccaaacca tcgcctggaa agctaccaaa agggatctct gcaggagctg
301 tccagacagc tggtaagaag ggaccccagt ccctatttaa tgctcctcga ggcaagaagc
361 gcccagcacc tggcagtgat gaggaagagg aggaggaaga ctctgaagaa gatggtatgg
421 tgaaccacgg ggacctctgg ggctccgagg acgatgctga tacggtagat gactatggag
481 ctgactccaa ctctgaggat gaggaggaag gtgaagcgtt gctgcccatt gaaagagctg
541 ctcggaagca gaaggcccgg gaagctgctg ctgggatcca gtggagtgaa gaggagaccg
601 aggacgagga ggaagagaaa gaagtgaccc ctgagtcagg ccccccaaag gtggaagagg
661 cagatggggg cctgcagatc aatgtggatg aggaaccatt tgtgctgccc cctgctgggg
721 agatggagca ggatgcccag gctccagacc tgcaacgagt tcacaagcgg atccaggata
781 ttgtgggaat tctgcgtgat tttggggctc agcgggagga agggcggtct cgttctgaat
841 acctgaaccg gctcaagaag gatctggcca tttactactc ctatggagac ttcctgcttg
901 gcaagctcat ggacctcttc cctctgtctg agctggtgga gttcttagaa gctaatgagg
961 tgcctcggcc cgtcaccctc cggaccaata ccttgaaaac ccgacgccga gaccttgcac
1021 aggctctaat caatcgtggg gttaacctgg atcccctggg caagtggtca aagactggac
1081 tagtggtgta tgattcttct gtgcccattg gtgctacccc cgagtacctg gctgggcact
1141 acatgctgca gggagcctcc agcatgttgc ccgtcatggc cttggcaccc caggaacatg
1201 agcggatcct ggacatgtgt tgtgcccctg gaggaaagac cagctacatg gcccagctga
1261 tgaagaacac gggtgtgatc cttgccaatg acgccaatgc tgagcggctc aagagtgttg
1321 tgggcaactt gcatcggctg ggagtcacca acaccattat cagccactat gatgggcgcc
1381 agttccccaa ggtggtgggg ggctttgacc gagtactgct ggatgctccc tgcagtggca
1441 ctggggtcat ctccaaggat ccagccgtga agactaacaa ggatgagaag gacatcctgc
1501 gctgtgctca cctccagaag gagttgctcc tgagtgctat tgactctgtc aatgcgacct
1561 ccaagacagg aggctacctg gtttactgca cctgttctat cacagtagaa gagaatgagt
1621 gggtggtaga ctatgctctg aaaaagagga atgtgcgact ggtgcccacg ggcctagact
1681 ttggccagga aggttttacc cgctttcgag aaaggcgctt ccaccccagt ctgcgttcta
1741 cccgacgctt ctaccctcat acccacaata tggatgggtt cttcattgcc aagttcaaga
1801 aattttccaa ttctatccct cagtcccaga caggaaattc tgaaacagcc acacctacaa
1861 atgtagactt gcctcaggtc atccccaagt ctgagaacag cagccagcca gccaagaaag
1921 ccaagggggc tggaaagaca aagcagcagc tgcagaaaca gcaacatccc aagaaggcct
1981 ccttccagaa gctgaatggc atctccaaag gggcagactc agaattgtcc actgtacctt
2041 ctgtcacaaa gacccaagct tcctccagct tccaggatag cagtcagcca gctggaaaag
2101 ccgaagggat cagggagcca aaggtgactg ggaagctaaa gcaacgatca cctaaattac
2161 agtcctccaa gaaagttgct ttcctcaggc agaatgcccc tcccaagggc acagacacac
2221 aaacaccggc tgtgttatcc ccatccaaga ctcaggccac cctgaaacct aaggaccatc
2281 atcagcccct tggaagggcc aagggggttg agaagcagca gttcgcagag cagccttttg
2341 agaaagctgc cttccagaaa cagaatgata cccccaaggg cctcagcctc ccactgtgtc
2401 tcccatccgt tccagccgcc ccccaccagc aaagaggaag aaatctcagt ccaggggcaa
2461 cagccagctg ctgctatctt agatggttga aaactagacg ggtggctcac tgccattgtc
2521 accaggttgg aactcttgcc tctgtgagga tgccttctct actgtgcata cccatgaaat
2581 ttaatacaca ttttaaaacc tctggccact ga
//
LOCUS MUSH2RIIBP 2204 bp ss-mRNA ROD 10-APR-1990
DEFINITION Mouse MHC class I regulatory element binding protein (H-2RIIBP)
mRNA, 3' end.
ACCESSION M26804
KEYWORDS MHC class I regulatory element binding protein.
SOURCE Mouse liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2204)
AUTHORS Hamada,K., Gleason,S.L., Levi,B.-Z., Hirschfeld,S., Appella,E. and
Ozato,K.
TITLE H-2RIIBP, a member of the nuclear hormone receptor superfamily that
binds to both the regulatory element of major histocompatibility
class I genes and the estrogen response element
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8289-8293 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by author, 10-AUG-1989.
FEATURES from to/span description
pept < 1 1341 MHC class I regulatory element binding protein
(H-2RIIBP)
mRNA < 1 2204 H-2RIIBP mRNA
BASE COUNT 436 a 645 c 648 g 475 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccccc gaagcccaga cagctcctcc ccaaatcccc tttctcaggg gatccgtccg
61 tcttctcctc ctggcccacc tcttacccct tcagcacctc cacctccaat gccacccccg
121 ccactgggct cccccttccc agtcatcagt tcttccatgg ggtcccctgg tctgccccct
181 ccggctcccc caggattctc cgggcctgtc agcagccctc agatcaactc cacagtgtcg
241 ctccctgggg gtgggtctgg cccccctgaa gatgtgaagc caccggtctt aggggtccgg
301 ggcctgcact gtccaccccc tccaggtggt cctggggctg gcaaacggct ctgtgcaatc
361 tgcggggacc gaagctcagg caagcactat ggggtttaca gctgcgaggg ctgcaagggt
421 ttcttcaagc gcaccattcg gaaggacctg acctactcgt gtcgtgataa caaagactgt
481 acagtggaca agcgccagcg gaatcgctgt cagtactgtc gctatcagaa gtgcctggcc
541 actggcatga aaagggaggc ggttcaggag gagcgtcaac gggggaagga caaagacggg
601 gatggagatg gggctggggg agcccctgag gagatgcctg tggacaggat cctggaggca
661 gagcttgctg tggagcagaa gagtgaccaa ggcgttgagg gtcctggggc caccgggggt
721 ggtggcagca gcccaaatga cccagtgact aacatctgcc aggcagctga caaacagctg
781 ttcacactcg ttgagtgggc aaagaggatc ccgcacttct cctccctacc tctggacgat
841 caggtcatac tgctgcgggc aggctggaac gagctcctca ttgcgtcctt ctcccatcgg
901 tccattgatg tccgagatgg catcctcctg gccacgggtc ttcatgtgca cagaaactca
961 gcccattccg caggcgtggg agccatcttt gatcgggtgc tgacagagct agtgtccaaa
1021 atgcgtgaca tgaggatgga caagacagag cttggctgcc tgcgggcaat catactgttt
1081 aatccagacg ccaagggcct ctccaaccct ggagaggtgg agatccttcg ggagaaggtg
1141 tacgcctcac tggagaccta ttgcaagcag aagtaccctg agcagcaggg ccggtttgcc
1201 aagctgctgt tacgtcttcc tgccctccgc tccatcggcc tcaagtgtct ggagcacctg
1261 ttcttcttca agctcattgg cgacaccccc attgacacct tcctcatgga gatgcttgag
1321 gctccccacc agctagcctg agcccagatg cacaccgagt gtcactgagg aggacttgag
1381 cctgggcagg gggcagagcc atgggacagg tgcagagcag gaggggactt gcccagcctg
1441 ccagggatct ggcaacactt agcagggttc gcttggtctc caagtcgaag gggaccccag
1501 atccctgtga ggactttatg tctaccttca gtggccttga gtctctgaat ttgtcggggt
1561 ctcccatggt gcaggtgatt cttcatcctg gctccccagc acaaagcact gccctgcttc
1621 cttctcattt ggcctcactc ccttctgaag agtggaacag agctccccca gaaaggggtg
1681 ttgtggggca ggccccccaa gctgatgatc atgggagcag ggctctgaca gcctttatcc
1741 tctcagactt gacagatggg ggcagaggag ggacctgcct ctgtctcctg tcagccccat
1801 ttccacagtc cctcctgcag tcagactgaa gaataaaggg gtagtgaagg ggctgctgga
1861 ggtggaggaa cccattgctc ttttaatttc ctgtgaggag agactgggag ttagactcaa
1921 agaagtactg tacatcccca ggttgactta aatgtcaggg ctggagatgg catgtgggca
1981 aggaggcccc tcaggtgggc tgtcccaaag ctccctgggc tctgcctcgg gtggccctac
2041 agctcttccc tagtcttaag cacagctagg ctgggagcaa gtggggacat tgatgggggt
2101 ggccagcctg cagagttggg tgctgggctg catggttttt gccctggacc tcttttgggg
2161 gttccctccc atctttcact tgcacataaa gttgctttcc agtt
//
LOCUS MUSID 927 bp ss-mRNA ROD 15-FEB-1990
DEFINITION Mouse helix-loop-helix DNA binding protein regulator (Id) mRNA, 3'
end.
ACCESSION M31885
KEYWORDS helix-loop-helix DNA binding protein regulator;
helix-loop-helix protein; regulatory protein.
SOURCE Mouse (strain DBA2) erythroleukemia cell line MEL, cDNA to mRNA,
clone pMH18.
REFERENCE 1 (bases 1 to 927)
AUTHORS Benezra,R., Davis,R.L., Lockshon,D., Turner,D.L. and Weintraub,H.
TITLE The protein Id: A negative regulator of helix-loop DNA binding
proteins
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
R.Benezra, 02-FEB-1990
FEATURES from to/span description
pept < 1 533 helix-loop-helix protein (Id) (AA at 3)
site 261 305 helix 1
site 306 335 loop
site 336 383 helix 2
BASE COUNT 192 a 256 c 259 g 220 t
ORIGIN
1 attgtacaac ctttctccaa cttcttgttc tcttcccaca ctctgttctc agcctcctcc
61 gctcccctcc gcctgttctc aggatcatga aggtcgccag tggcagtgcc gcagccgctg
121 caggccctag ctgttcgctg aaggcgggca ggacagcggg cgaggtggta cttggtctgt
181 cggagcaaag cgtggccatc tcgcgctgcg ctgggacgcg cctgcccgcc ttgctggacg
241 agcagcaggt gaacgtcctg ctctacgaca tgaacggctg ctactcacgc ctcaaggagc
301 tggtgcccac cctgccccag aaccgcaaag tgagcaaggt ggagatcctg cagcatgtaa
361 tcgactacat cagggacctg cagctggagc tgaactcgga gtctgaagtc gggaccaccg
421 gaggccgggg actgcctgtc cgcgccccgc tcagcaccct gaacggcgag atcagtgcct
481 tggcggccga ggcggcatgt gttccagccg acgatcgcat cttgtgtcgc tgaggcggcg
541 cactgaggga ccagatggac tccagccctt caggaggcaa gaggaaaaaa gtgctctcgg
601 ttccccaggg gatctctggg aaagacacta ccgcagccac cggactcttg gcggatcggt
661 ccagtgggta gagggtttga tcaacagagc ctcaccctct ccacctttca gcctccagag
721 actttgggga gggggttaat caaccccgcg tgtttctgtt ttattgaaaa agcagacatt
781 ttttttaaat ggtcacattt cgtgcttctc ggatttctga ggaaatattt tatattgtat
841 attacaatga tcactggctg aaaatattgt tttacaatag ttctatgggg gtgggttttt
901 tgttgttatt aaacaaacac tttagat
//
LOCUS MZEPPDK 644 bp ds-DNA PLN 16-FEB-1990
DEFINITION Z.mays pyruvate orthophosphate dikinase (PPDK) gene, 3' end.
ACCESSION M32081
KEYWORDS pyruvate orthophosphate dikinase.
SOURCE Z.mays mays (strain B73) DNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 644)
AUTHORS Glackin,C.A. and Grula,J.W.
TITLE Organ-specific transcripts of different size and abundance
JOURNAL Unpublished (1990) In Press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.A.Glacken, 13-FEB-1990.
FEATURES from to/span description
pept < 1 23 pyruvate orthophosphate dikinase (PPDK) (AA at
3)
mRNA < 1 322 PPDK mRNA
BASE COUNT 150 a 148 c 148 g 198 t
ORIGIN
1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc ctgcagctca
61 ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca tgctggtgat
121 taataatact actatgacag agccatatgc tgttggtgca tgctggtgat taataatact
181 actatgacag agccatatgc tctgtgaaga gtattagtag cagcgctcat aaaagctaca
241 gttccatcta tctgtgaaga gtattagtag cagcgctcat aaaagctaca gttccatcta
301 ttttctcagc tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttctcagc
361 tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg
421 catgtcgttg ttgaggccat tgctgcacat ttttcgtggt gtgagatgtg catgtcgttg
481 ttgaggccat tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa
541 gctgatcatc tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc
601 ttttctgaga actctagact cgttttctga gaactctaga ctcg
//
LOCUS RATCLATP 4350 bp ss-mRNA ROD 15-FEB-1990
DEFINITION Rat ATP citrate-lyase mRNA, complete cds.
ACCESSION J05210
KEYWORDS ATP citrate-lyase.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4350)
AUTHORS Elshourbagy,N.A., Near,J.C., Metz,P.J., Sathe,G.M., Southan,C.,
Stickler,J.E., Gross,M., Young,J.F., Wells,T.N.C. and Groot,P.H.E.
TITLE Rat ATP citrate-lyase: Molecular cloning and sequencing analysis of
a full length cDNA and mRNA abundance as a function of diet, organ,
and age
JOURNAL J. Biol. Chem. 265, 1430-1435 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
N.A.Elshourbagy, 13-FEB-1990.
FEATURES from to/span description
pept 73 3375 ATP citrate-lyase
mRNA < 1 4350 ATP cytrate-lyase mRNA
BASE COUNT 1084 a 1114 c 1136 g 1016 t
ORIGIN
1 taagctggtg cttacggaca gagagccaca ctcgggcttt ctcgaagagg taaaccaggt
61 ccctctgcag ccatgtcagc caaggcaatt tcagagcaga ccggcaaaga actcctttac
121 aagtacatct gtaccacctc agccatccag aaccggttca agtatgcccg ggttactccc
181 gacacagact gggcccatct cctgcaggac cacccctggc tgcttagcca gagcttggta
241 gtcaagccgg accagctgat caaacgtcga ggaaagcttg gtctagtcgg ggtcaacctc
301 tctctggatg gagtcaaatc ctggctgaaa cctcgactgg gacatgaggc caccgtcggc
361 aaggccaaag gcttcctcaa gaactttctg attgagccct tcgtccccca cagtcaggcg
421 gaggagttct acgtgtgcat ctatgctacc cgggaaggag actacgtcct gttccaccat
481 gaagggggtg tggatgtggg cgatgtggac accaaagccc agaagctgct tgtgggtgtg
541 gacgagaaac tgaacgctga agacattaag agacacctgt tggtccacgc ccccgaagac
601 aagaaagaaa tcctggccag cttcatctcc ggcctattca atttctacga agatctttac
661 ttcacctacc ttgagatcaa cccccttgtg gtgaccaaag atggtgtcta catccttgac
721 ctggcggcca aggtggacgc cactgctgac tacatctgca aagtcaagtg gggtgatata
781 gagttccctc ccccctttgg gcgtgaggca tacccagagg aagcctacat tgcagacctg
841 gatgccaaaa gtggggcgag cttgaagctg accttgctga accccaaggg gcggatctgg
901 accatggttg ccgggggtgg cgcctctgtc gtgtacagtg ataccatctg tgatcttgga
961 ggtgtcaacg aactggcgaa ttacggggag tactctggtg cccccagtga acaacagacc
1021 tatgactacg ccaagaccat cctctcactt atgactcgag agaagcaccc ggatggcaag
1081 atcctcatca ttggaggcag cattgcaaac ttcaccaacg tggccgccac cttcaagggc
1141 attgtgagag caattcgaga ttaccagggt tccctgaagg agcacgaggt caccatcttt
1201 gttcgaagag gtggcccgaa ctatcaagag ggattacgag tgatgggaga agttgggaag
1261 accactggaa tccccatcca tgtctttggc acagaaactc acatgacggc cattgtgggc
1321 atggcctggg caccggccat tcccaaccag ccacccacag cggctcacac tgccaacttc
1381 ctccttaatg ccagtgggag cacatcgaca ccagcaccca gcaggacagc gtctttttcc
1441 gagtccagag ctgacgaggt ggcccctgca aagaaagcca agccagccat gccccaagat
1501 tcagtcccaa gtccaagatc cctgcaagga aagagtgcca ccctcttcag ccgacatacc
1561 aaggctatcg tatggggcat gcagacccgg gctgtgcaag gcatgctgga ctttgactac
1621 gtgtgctccc gagatgagcc ttcagtggct gctatggtct acccgttcac gggggatcat
1681 aagcagaagt tttactgggg acacaaggaa atcctgatcc ctgtcttcaa gaacatggct
1741 gacgccatga aaaagcatcc ggaggtagac gtgctgatca actttgcatc tctgcgatcg
1801 gcttatgaca gcaccatgga gaccatgaac tatgcacaga tccggaccat agccatcata
1861 gcagaaggca tccctgaggc tctcacacgg aagctcatca agaaggcaga ccagaagggc
1921 gtgaccatca ttgggccagc cacggttggg ggcatcaagc ctggatgctt taagattggg
1981 aatactggtg ggatgctgga caacatcctg gcctccaaac tgtatcgccc aggcagtgtg
2041 gcctacgtct cgcgttcagg aggcatgtct aacgaactca ataatatcat ctctcggacc
2101 acagatggtg tctacgaggg tgttgccatc ggcggggaca ggtaccctgg gtccacattc
2161 atggatcacg tgctgcgtta ccaagacact ccaggagtca agatgattgt agttcttggg
2221 gagatagggg gtacagaaga atataagatc tgccggggca tcaaggaggg ccgcctcacc
2281 aagccagtgg tctgctggtg catcgggacc tgtgccacca tgttctcttc tgaggtccag
2341 tttggccacg ctggggcttg tgccaaccag gcttctgaaa cggcagtagc caagaaccag
2401 gccttgaagg aagcgggagt gtttgtgccc cgaagctttg atgagctcgg agaaatcatt
2461 cagtccgtgt atgaagatct tgtggccaaa ggcgccattg tacctgctca ggaagtgcca
2521 cctccaacag tacccatgga ctactcttgg gccagggagc tgggtttaat ccgaaaacct
2581 gcctcattca tgaccagcat ctgtgacgag cgggggcagg aactcattta tgcgggcatg
2641 cccatcaccg aggtcttcaa ggaagagatg ggcattggtg gtgtcctggg cctcctctgg
2701 ttccagagaa ggttgcccaa gtattcctgc cagttcattg agatgtgtct catggtcacc
2761 gctgatcacg ggccagctgt ctccggggcc cataacacta tcatctgtgc tcgggctggg
2821 aaggacctgg tctccagcct cacctcaggg ctgctcacca ttggggaccg gtttgggggt
2881 gccttggacg cagcagcgaa gatgttcagt aaagcctttg acagcggcat tattcccatg
2941 gagtttgtga acaagatgaa gaaggagggg aaactgatca tgggcatcgg ccatcgagtc
3001 aaatcgataa acaacccaga catgcgagtg cagatcctca aagactttgt caaacagcac
3061 ttccccgcca ccccgctgct cgactatgca ctggaagtgg agaaaatcac cacctcaaag
3121 aagccaaatc ttatcctgaa cgtggatggt ttcatcggcg ttgcgtttgt ggacatgctt
3181 aggaactgtg gctccttcac ccgggaggaa gctgacgagt atgttgacat tggagccctc
3241 aatggcgtct ttgtgctggg aaggagtatg ggcttcatcg ggcactatct tgaccagaag
3301 aggctgaagc aagggctgta tcgtcacccc tgggacgaca tttcctatgt tctcccggaa
3361 cacatgagca tgtaaccgag ccagcagccc taccgtagaa aaaggaagac aaaaactccc
3421 tcctcgacaa tatagcggac agacagctgg aaacagagcc cgttatgggc tgggcctgga
3481 atggaaatag ccattgatgt gcaggcatgg aaagccaaca ccacaggccc attcagtcca
3541 cacagagaag cttagtattt ttttttatat atatatctat atatatataa gcatagaaat
3601 ttaaaaccaa gccaatactt gtgacgtttg cgctgctacc tgctgtatct attacatgga
3661 agactgtaag caagcgctgt cagaataatg ttcttctagg gccttatgat gttgctttct
3721 ttttttaatt agttgaaaat ttatttttcc tctagaacta gtggatccga cttttaagac
3781 ttcaggatac tatctgtttg taggaccact gtctggtatc ccacctccca ctcatcttca
3841 caccacatga agaacactgt attaatctga ttttttagga tctttttttt tttttttgtg
3901 ttatgtgtta agggtttatt tagtatccca ctgaaacgtt ctgtgtttcg gaccaatgtc
3961 tacttatgtc aaggggagga gggttggggc cattgtaccc ttagccatcg tcacacatgt
4021 ggagtagtaa cttaaatgta aagttgtaac atacaagtgt ttaaaatgga aaccgcaaag
4081 caaaaagctg tgaaacgtct cgtgtcttgt gttctctgtg ttcatgcagc tgacttgtct
4141 gttactgaag tgtgggtcca aagactcaca tctgttccgc atctgtaacc cacagagatt
4201 ctggcagctg ccacctcagt ctcttctctg tattatcatg tttggtttaa ataaactaga
4261 tagtaaaaag aattcctgca gcccggggga tccactagtt ctagagcggc gcaccgcggt
4321 ggagctccag cttttgttcc ctttagtgag
//
LOCUS RATLOX 5351 bp ss-mRNA ROD 15-FEB-1990
DEFINITION Rat aorta lysyl oxidase mRNA, complete cds.
ACCESSION J02903
KEYWORDS lysyl oxidase.
SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7,
13, 8-1 and IIB.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5351)
AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D.,
Troxler,R.F. and Kagan,H.M.
TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and
predicted amino acid sequence
JOURNAL Biochemistry (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.C.Trackman, 07-FEB-1990.
FEATURES from to/span description
pept 287 1516 lysyl oxidase precursor
sigp 287 349 pot. lysyl oxidase signal peptide
matp 350 1513 lysyl oxidase
site 2252 2274 region of dyad symmetry
BASE COUNT 1302 a 1528 c 1245 g 1276 t
ORIGIN 192 bp upstream of TaqI site.
1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc
61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg
121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc
181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt
241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg
301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc
361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa
421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga
481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt
541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg
601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg
661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc
721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac
781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga
841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc
901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat
961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga
1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg
1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca
1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag
1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg
1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga
1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa
1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa
1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac
1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt
1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa
1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat
1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta
1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat
1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca
1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt
1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg
1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc
2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt
2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac
2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc
2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta
2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac
2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat
2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct
2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct
2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg
2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat
2641 gcagtactgt aatccccgtc cctcctggag cccgaattcc ttctacttca gacactgtgc
2701 gctctcccgg accgtcgtgc gccgctcccc gtcgccttcc aggactggga aaggggagag
2761 gaggacggtg ccacgtccga cggcctcctg ggctgggggc agggtctgcc gctcgccctt
2821 ggcaccagtc cctgcgaccc atccccgagc ctcgccctcc tcctccctgc tcgaagaggt
2881 ctccctcctt cgcgggatct gagtccctgt cttcattttt ctcctagcca cgtccctccc
2941 cgagaaggga cgagccggga gcatcatgcg tttcgcctgg accgtgctct ttctgggaca
3001 gctgcagttc tgtccccttc tccgctgcgc cccgcaggcc ccgcgcgagc ctcccgccgc
3061 ccccggtgcc tggcgccaga caatccaatg ggagaacaac gggcaggtgt tcagtctgtt
3121 gagcctgggg gcgcagtacc agcctcagcg acgccgcgac tccagcgcca ctgccccgag
3181 agccgacgca acgctgcagc acagccacgc acgcccattc tgctgctgcg tgacaaccgc
3241 actgcctctg cccgtgcgag gactccaagc ccatctgggg tcgccgcggg tcgtccccgg
3301 cccgcagccc gccactggtt ccaagttggt ttctcgccgt cgggggccgg cgatggagcc
3361 tcaaggcgcg cagaaccgga ctgcgtcgcc acagcctccg cagctcagta atctgaggcc
3421 acccagccac gtagatcgca tggtggcgac gacccctaca atccctacaa gtactccgac
3481 gacaacccct attataacta ctatgacact tatgagagac cggtccggga gcaggcaccg
3541 acctggatat ggcaccggtt acttccagta cggtctcccg gacctggtac ccgatcccta
3601 ctacatccag gcatccacgt acgtacaaaa gatgtctatg tacaacctga gatgcgctgc
3661 ggaagaaaac tgcctggcca gttcagcata tagggcggat gtcagagact atgaccacag
3721 ggtactgcta cgatttcctc agagagtgaa aaaccaaggg acgtctgact tcttaccaag
3781 ccgcccccgc tactcctggg agtggcacag ctgccaccaa cattaccaca gcatggatga
3841 attcagccac tacgacctgc tggatgccag cacacagagg agagtggccg agggccacaa
3901 agcaagcttc tgtctggagg acacttcctg tgattatggg taccacagac gatttgcctg
3961 tactgcacac acacaggggt tgagtcccgg atgttatgat acttatgcag cagacataga
4021 ctgccagtgg attgatatta cagatgtaca acccggaaat tacattctaa aggtcagtgt
4081 aaaccccagc tacctggtgc ctgaatcaga ctacagtaac aatgtcgtac gctgtgaaat
4141 tcgctacaca ggacatcacg cctatgcctc aggctgcacc atttcaccgt attagaaaga
4201 agctcacctt cccaaaggat gaagcagtac ctggtgtttg gacctatgaa aaccgtagat
4261 tagcttaagt aggaagactt agatatttta aaaggcaaac ggaaaaacaa caaagaaggt
4321 tttgtttgga ctctttcaca acaaatcaca taactggatt ttgagtgttt aaatcagcat
4381 tagattggca cattttaaat acttattcat gttgctttat gaagtaatgg tgtttcaatt
4441 ctgtgggtgc atagtgggct ctttcaaaga attctgaatt tcttaccttc ttttgaaatt
4501 atagtgcaaa aagaagagga tattttaatg aatgagccac aatttgaact gattactttc
4561 taaattgcca gacccatgag acaatgatga tgggtttgta tttgcctcaa catagattcg
4621 ctttttaaaa agggtgttcc tattgtatag gcaaaaatgg atacacttgg tgctgaggaa
4681 gggtcaaata ctaactattg ttgtcacgaa atataggtct acagcagaga gatggtgagt
4741 atatattcag atagttacat ccctatataa actatgttta cattttagat gcttttcttt
4801 ctgttaattg cttaatctca ctctgacttg aggtacaact tctgttttgg aatgaattag
4861 ataattccag attctggttt gataattgtt gacattcccc catgctactt tttctgaggg
4921 cagaaacgtc taatgtgacg actcttcaca ttaccattac gaggatacac agcacagcga
4981 aatcattccg atgacaggtg tgatagatgg agagctaaca tgcaactgcc gagtgtttca
5041 ctgttagcca gaactaagtc acttgcccca cacagcaatt acaccatgaa tctctaacat
5101 cacaaccttc tttcaaatac ccacggactc atccatcctt ccatccgtca tccatccatc
5161 cgtccgtccg tccgtcctga ctgcctagtg ccactgtctg gctaggcaca cccactatca
5221 acctggttca cctgtcatgg cagcctgtac ccacccccgc cacacacccc gacgctggcc
5281 tatagtgcaa aggttgtgcg ggctggtcct tcccacaatg cagtactgta atccccgtcc
5341 ctcctggagc c
//
LOCUS SHFIPAH 2900 bp ds-DNA BCT 16-FEB-1990
DEFINITION S.flexner invasion plasmid antigen (ipaH) gene, complete cds.
ACCESSION M32063
KEYWORDS invasion plasmid antigen.
SOURCE S.flexner (strain M90T-W), serotype S) DNA, clone pWR390.
ORGANISM Shigella flexneri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2900)
AUTHORS Hartman,A.B., Venkatesan,M.M., Oaks,E.V. and Buysse,J.M.
TITLE Sequence and molecular characterization of a multicopy invasion
plasmid antigen gene, ipaH, of Shigella flexner
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A.B.Hartman, 12-FEB-1990.
FEATURES from to/span description
pept 251 1849 invasion plasmid antigen
pept 2277 > 2900 ORF3
signal 139 144 -35 region
signal 162 167 -10 region
signal 2111 2116 -35 region
signal 2132 2137 -10 region
BASE COUNT 868 a 657 c 578 g 797 t
ORIGIN
1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta
61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt
121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat
181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa
241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag
301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac
361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac
421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc
481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga
541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata
601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac
661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt
721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt
781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa
841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc
901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact
961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt
1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct
1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc
1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc
1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa
1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct
1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt
1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc
1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt
1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt
1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga
1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag
1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc
1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt
1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat
1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa
1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt
1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa
2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga
2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga
2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga
2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga
2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt
2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg
2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata
2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt
2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc
2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat
2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac
2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc
2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc
2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc
2881 ctcagagctt aaaatatttg
//
LOCUS VACCSBP 1020 bp ds-DNA VRL 06-DEC-1989
DEFINITION Vaccinia virus cell surface-binding protein gene, complete cds.
ACCESSION J05190
KEYWORDS antigen; carbonic anhydrase-related transmembrane protein;
cell surface-binding protein; envelope protein.
SOURCE Vaccinia virus (wild type WR) DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 1020)
AUTHORS Maa,J.-S., Rodriguez,J.F. and Esteban,M.
TITLE Structural and functional characterization of a cell surface
binding protein of vaccinia virus
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Esteban 30-NOV-1989.
FEATURES from to/span description
pept 61 975 cell surface-binding protein
site 822 972 attachment site
site 9 19 alpha helix
site 103 114 alpha helix
site 270 280 alpha helix
site 286 292 alpha helix
BASE COUNT 354 a 182 c 167 g 317 t
ORIGIN Map position HindIII-D.
1 catccattgt aattcccata ctaagagcta tttttaaaca gttatcattt catttttact
61 atgccgcaac aactatctcc tattaatata gaaactaaaa aagcaatttc taacgcgcga
121 ttgaagccgt tagacataca ttataatgag tcgaaaccaa ccactatcca gaacactgga
181 aaactagtaa ggattaattt taaaggagga tatataagtg gagggtttct ccccaatgaa
241 tatgtgttat catcactaca tatatattgg ggaaaggaag acgattatgg atccaatcac
301 ttgatagatg tgtacaaata ctctggagag attaatcttg ttcattggaa taagaaaaaa
361 tatagttctt atgaagaggc aaaaaaacac gatgatggac ttatcattat ttctatattc
421 ttacaagtat tggatcataa aaatgtatat tttcaaaaga tagttaatca attgcattcc
481 attagatccg ccaatacgtc tgcaccgttt gattcagtat tttatctaga caatttgctg
541 cctagtaagt tggattattt tacatatcta ggaacaacta tcaaccactc tgcagacgct
601 gtatggataa tttttccaac gccaataaac attcattctg atcaactatc taaattcaga
661 acactattgt cgtcgtctaa tcatgatgga aaaccgcatt atataacaga gaactataga
721 aatccgtata aattgaacga cgacacgcaa gtatattatt ctggggagat tatacgagca
781 gcaactacct ctccagcgcg cgagaactat tttatgagat ggttgtccga tttgagagag
841 acatgttttt catattatca aaaatatatc gaagagaata aaacattcgc aattattgcc
901 atagtattcg tgtttatact taccgctatt ctctttttta tgagtcgacg atattcgcga
961 gaaaaacaaa actagattcg ataccttgtt gagcctccat tagaacggca gtgacttcgc
//GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS CATL11AA 424 bp ds-DNA MAM 28-FEB-1990
DEFINITION Cat LINE-1 DNA sequence region 1.
ACCESSION M32251
KEYWORDS L1 repetitive sequence; repetitive sequence.
SOURCE Cat DNA.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 424)
AUTHORS Fanning,T. and Singer,S.
TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins
that conserve homologies to retrovirus proteins
JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rpt 1 424 L1 repetitive element
BASE COUNT 182 a 71 c 61 g 110 t
ORIGIN
1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag
61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata
121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata
181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct
241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta
301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga
361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga
421 ttac
//
LOCUS CATL12AA 377 bp ds-DNA MAM 28-FEB-1990
DEFINITION Cat LINE-1 DNA sequence region 2.
ACCESSION M32252
KEYWORDS L1 repetitive sequence; repetitive sequence.
SOURCE Cat DNA.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 377)
AUTHORS Fanning,T. and Singer,S.
TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins
that conserve homologies to retrovirus proteins
JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rpt 1 377 L1 repetitive element
BASE COUNT 146 a 81 c 85 g 65 t
ORIGIN
1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc
61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc
121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga
181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa
241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca
301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc
361 tcagaaaatt aaaaata
//
LOCUS HUMCOL4A6 1056 bp ss-mRNA PRI 09-MAR-1990
DEFINITION Human collagen type IV mRNA.
ACCESSION M33653
KEYWORDS alternative splicing; collagen type IV.
SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1056)
AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
Prockop,D.J.
TITLE Partial characterization of a low molecular weight human collagen
that undergoes alternative splicing
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 740 collagen type IV /hgml_locus_uid="LJ0118J"
/map="13q34" /nomgen="COL4A2"
BASE COUNT 277 a 228 c 333 g 218 t
ORIGIN
1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg
61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg
121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc
181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag
241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc
301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag
361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc
421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac
481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg
541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca
601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag
661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc
721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca
781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct
841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct
901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt
961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg
1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg
//
LOCUS HUMCOL4A7 629 bp ss-mRNA PRI 09-MAR-1990
DEFINITION Human collagen type IV mRNA.
ACCESSION M15524
KEYWORDS alternative splicing; collagen type IV.
SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 629)
AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
Prockop,D.J.
TITLE Partial characterization of a low molecular weight human collagen
that undergoes alternative splicing
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 403 collagen type IV (AA at 2)
/hgml_locus_uid="LJ0118J" /map="13q34"
/nomgen="COL4A2"
BASE COUNT 175 a 131 c 185 g 138 t
ORIGIN
1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg
61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct
121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga
181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg
241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag
301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt
361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat
421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa
481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata
541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta
601 tgtctctaat acattttttg tttggtcgt
//
LOCUS HUMRB1RA 4600 bp ss-mRNA PRI 09-APR-1990
DEFINITION Human retinoblastoma associated (RB1) mRNA.
ACCESSION M33647 J02994
KEYWORDS retinoblastoma protein.
SOURCE Human retinoblast cDNA to mRNA, clone p4.7R.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4600)
AUTHORS Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E.,
Li,F.P. and Weinberg,R.A.
TITLE Deletions of a DNA sequence in retinoblastomas and mesenchymal
tumors: Organization of the sequence and its encoded protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence for [1] kindly
provided by S.H.Friend, 10-FEB-1988.
FEATURES from to/span description
pept 4 2790 retinoblastoma-associated protein
/hgml_locus_uid="LE0159J" /map="13q14.2"
/nomgen="RB1"
BASE COUNT 1489 a 842 c 815 g 1454 t
ORIGIN
1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa
61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg
121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca
181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag
241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg
301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag
361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat
421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt
481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt
541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta
601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg
661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat
721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg
781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa
841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt
901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa
961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat
1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa
1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg
1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat
1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg
1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc
1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc
1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac
1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc
1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat
1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa
1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa
1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga
1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat
1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact
1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag
1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat
1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc
2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat
2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt
2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc
2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc
2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg
2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt
2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg
2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc
2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc
2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt
2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag
2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc
2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta
2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta
2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt
2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc
3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat
3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct
3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa
3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc
3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa
3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta
3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa
3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct
3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga
3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat
3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag
3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat
3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg
3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct
3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt
3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa
3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc
4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac
4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa
4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag
4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt
4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga
4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag
4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt
4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt
4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac
4561 aattaatagt ttgtctattt taaaataaat tagttgttag
//
LOCUS MV1MOBPA 2436 bp ds-DNA BCT 15-DEC-1989
DEFINITION Plasmid pMV158 mobilization protein gene.
ACCESSION M28538
KEYWORDS mobilization.
SOURCE Plasmid pMV158 from Streptococcus agalactiae, DNA.
ORGANISM Plasmid pMV158
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 2436)
AUTHORS Priebe,S.D. and Lacks,S.A.
TITLE Region of the streptococcal plasmid pMV158 required for conjugative
mobilization
JOURNAL J. Bacteriol. 171, 4778-4784 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence for [1] kindly
provided by S.A.Lacks, 18-JAN-1990.
FEATURES from to/span description
pept 628 2112 mobilization peptide
BASE COUNT 824 a 393 c 565 g 654 t
ORIGIN
1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt
61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat
121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa
181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac
241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat
301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg
361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg
421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa
481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta
541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg
601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg
661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca
721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc
781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga
841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt
901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt
961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc
1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg
1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac
1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa
1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca
1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat
1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa
1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct
1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag
1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag
1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta
1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg
1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat
1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta
1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct
1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata
1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa
1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg
2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag
2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac
2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc
2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt
2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg
2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc
2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg
//
LOCUS RATIGFI1 958 bp ss-mRNA ROD 15-JUN-1988
DEFINITION Rat mRNA (clone IGF1AB1) for insulin-like growth factor I.
ACCESSION X06107 Y00429 M32260
KEYWORDS insulin-like growth factor I.
SOURCE rat (Rattus norvegicus).
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 958; enum. 1 to 958)
AUTHORS Rotwein,P.
JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
STANDARD simple automatic
REFERENCE 2 (bases ??)
AUTHORS Shimatsu,A. and Rotwein,P.
TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
Within the 5' Untranslated Region
JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987)
STANDARD simple automatic
COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is
described in $<X06118>$
*source: tissue=liver; strain=Sprague-Dawley;
EMBL features not translated to GenBank features:
key from to description
SITE 1 258 5' UT-region
SITE 736 958 3' UT-region
FEATURES from to/span description
pept 259 738 IGF-I (AA 1-159)
BASE COUNT 260 a 246 c 209 g 243 t
ORIGIN
1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg
61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca
121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca
181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga
241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc
301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac
361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc
421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac
481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag
541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct
601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag
661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca
721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc
781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca
841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca
901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt
//
LOCUS RATIGFI2 798 bp ss-mRNA ROD 15-JUN-1988
DEFINITION Rat mRNA (clone IGF1AB2) for insulin-like growth factor I.
ACCESSION X06108 Y00429 M32339
KEYWORDS insulin-like growth factor I.
SOURCE rat (Rattus norvegicus).
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 798)
AUTHORS Rotwein,P.
JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
STANDARD simple automatic
REFERENCE 2 (bases ??)
AUTHORS Shimatsu,A. and Rotwein,P.
TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
Within the 5' Untranslated Region
JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987)
STANDARD simple automatic
COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is
described in $<XX6107>$.. *source: tissue=liver;
strain=Sprague-Dawley;
EMBL features not translated to GenBank features:
key from to description
SITE 1 72 5' UT-region
SITE 553 798 3' UT-region
FEATURES from to/span description
pept 73 552 IGF-I (AA 1-159)
BASE COUNT 238 a 196 c 183 g 181 t
ORIGIN
1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt
61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc
121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca
181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct
241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc
301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc
361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag
421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag
481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa
541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga
601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat
661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc
721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa
781 aaaaaaaaaa acaaaaac
//
LOCUS VSVNJLA 590 bp ss-RNA VRL 14-MAR-1990
DEFINITION Vesicular stomatitis virus L gene.
ACCESSION M17436
KEYWORDS L gene.
SOURCE Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 590)
AUTHORS Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K.
TITLE Intergenic sequences of the vesicular stomatitis virus genome (New
Jersey serotype): Evidence for two transcription initiation sites
within the L gene
JOURNAL Virology 160, 88-94 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 66 > 590 L protein
BASE COUNT 189 a 110 c 117 g 174 t
ORIGIN
1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa
61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact
121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact
181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc
241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta
301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga
361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag
421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt
481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc
541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct
//GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS CATL11AA 424 bp ds-DNA MAM 28-FEB-1990
DEFINITION Cat LINE-1 DNA sequence region 1.
ACCESSION M32251
KEYWORDS L1 repetitive sequence; repetitive sequence.
SOURCE Cat DNA.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 424)
AUTHORS Fanning,T. and Singer,S.
TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins
that conserve homologies to retrovirus proteins
JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rpt 1 424 L1 repetitive element
BASE COUNT 182 a 71 c 61 g 110 t
ORIGIN
1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag
61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata
121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata
181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct
241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta
301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga
361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga
421 ttac
//
LOCUS CATL12AA 377 bp ds-DNA MAM 28-FEB-1990
DEFINITION Cat LINE-1 DNA sequence region 2.
ACCESSION M32252
KEYWORDS L1 repetitive sequence; repetitive sequence.
SOURCE Cat DNA.
ORGANISM Felis catus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE 1 (bases 1 to 377)
AUTHORS Fanning,T. and Singer,S.
TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins
that conserve homologies to retrovirus proteins
JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rpt 1 377 L1 repetitive element
BASE COUNT 146 a 81 c 85 g 65 t
ORIGIN
1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc
61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc
121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga
181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa
241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca
301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc
361 tcagaaaatt aaaaata
//
LOCUS HUMCOL4A6 1056 bp ss-mRNA PRI 09-MAR-1990
DEFINITION Human collagen type IV mRNA.
ACCESSION M33653
KEYWORDS alternative splicing; collagen type IV.
SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1056)
AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
Prockop,D.J.
TITLE Partial characterization of a low molecular weight human collagen
that undergoes alternative splicing
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 740 collagen type IV /hgml_locus_uid="LJ0118J"
/map="13q34" /nomgen="COL4A2"
BASE COUNT 277 a 228 c 333 g 218 t
ORIGIN
1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg
61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg
121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc
181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag
241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc
301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag
361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc
421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac
481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg
541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca
601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag
661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc
721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca
781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct
841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct
901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt
961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg
1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg
//
LOCUS HUMCOL4A7 629 bp ss-mRNA PRI 09-MAR-1990
DEFINITION Human collagen type IV mRNA.
ACCESSION M15524
KEYWORDS alternative splicing; collagen type IV.
SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 629)
AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
Prockop,D.J.
TITLE Partial characterization of a low molecular weight human collagen
that undergoes alternative splicing
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 403 collagen type IV (AA at 2)
/hgml_locus_uid="LJ0118J" /map="13q34"
/nomgen="COL4A2"
BASE COUNT 175 a 131 c 185 g 138 t
ORIGIN
1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg
61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct
121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga
181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg
241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag
301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt
361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat
421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa
481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata
541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta
601 tgtctctaat acattttttg tttggtcgt
//
LOCUS HUMRB1RA 4600 bp ss-mRNA PRI 09-APR-1990
DEFINITION Human retinoblastoma associated (RB1) mRNA.
ACCESSION M33647 J02994
KEYWORDS retinoblastoma protein.
SOURCE Human retinoblast cDNA to mRNA, clone p4.7R.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4600)
AUTHORS Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E.,
Li,F.P. and Weinberg,R.A.
TITLE Deletions of a DNA sequence in retinoblastomas and mesenchymal
tumors: Organization of the sequence and its encoded protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence for [1] kindly
provided by S.H.Friend, 10-FEB-1988.
FEATURES from to/span description
pept 4 2790 retinoblastoma-associated protein
/hgml_locus_uid="LE0159J" /map="13q14.2"
/nomgen="RB1"
BASE COUNT 1489 a 842 c 815 g 1454 t
ORIGIN
1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa
61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg
121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca
181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag
241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg
301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag
361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat
421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt
481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt
541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta
601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg
661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat
721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg
781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa
841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt
901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa
961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat
1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa
1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg
1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat
1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg
1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc
1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc
1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac
1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc
1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat
1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa
1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa
1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga
1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat
1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact
1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag
1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat
1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc
2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat
2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt
2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc
2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc
2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg
2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt
2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg
2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc
2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc
2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt
2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag
2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc
2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta
2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta
2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt
2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc
3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat
3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct
3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa
3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc
3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa
3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta
3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa
3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct
3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga
3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat
3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag
3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat
3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg
3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct
3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt
3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa
3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc
4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac
4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa
4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag
4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt
4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga
4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag
4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt
4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt
4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac
4561 aattaatagt ttgtctattt taaaataaat tagttgttag
//
LOCUS MV1MOBPA 2436 bp ds-DNA BCT 15-DEC-1989
DEFINITION Plasmid pMV158 mobilization protein gene.
ACCESSION M28538
KEYWORDS mobilization.
SOURCE Plasmid pMV158 from Streptococcus agalactiae, DNA.
ORGANISM Plasmid pMV158
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 2436)
AUTHORS Priebe,S.D. and Lacks,S.A.
TITLE Region of the streptococcal plasmid pMV158 required for conjugative
mobilization
JOURNAL J. Bacteriol. 171, 4778-4784 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence for [1] kindly
provided by S.A.Lacks, 18-JAN-1990.
FEATURES from to/span description
pept 628 2112 mobilization peptide
BASE COUNT 824 a 393 c 565 g 654 t
ORIGIN
1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt
61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat
121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa
181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac
241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat
301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg
361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg
421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa
481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta
541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg
601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg
661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca
721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc
781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga
841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt
901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt
961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc
1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg
1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac
1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa
1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca
1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat
1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa
1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct
1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag
1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag
1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta
1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg
1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat
1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta
1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct
1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata
1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa
1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg
2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag
2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac
2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc
2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt
2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg
2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc
2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg
//
LOCUS RATIGFI1 958 bp ss-mRNA ROD 15-JUN-1988
DEFINITION Rat mRNA (clone IGF1AB1) for insulin-like growth factor I.
ACCESSION X06107 Y00429 M32260
KEYWORDS insulin-like growth factor I.
SOURCE rat (Rattus norvegicus).
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 958; enum. 1 to 958)
AUTHORS Rotwein,P.
JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
STANDARD simple automatic
REFERENCE 2 (bases 1 to 958)
AUTHORS Shimatsu,A. and Rotwein,P.
TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
Within the 5' Untranslated Region
JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987)
STANDARD simple automatic
COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is
described in $<X06118>$
*source: tissue=liver; strain=Sprague-Dawley;
EMBL features not translated to GenBank features:
key from to description
SITE 1 258 5' UT-region
SITE 736 958 3' UT-region
FEATURES from to/span description
pept 259 738 IGF-I (AA 1-159)
BASE COUNT 260 a 246 c 209 g 243 t
ORIGIN
1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg
61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca
121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca
181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga
241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc
301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac
361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc
421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac
481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag
541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct
601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag
661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca
721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc
781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca
841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca
901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt
//
LOCUS RATIGFI2 798 bp ss-mRNA ROD 15-JUN-1988
DEFINITION Rat mRNA (clone IGF1AB2) for insulin-like growth factor I.
ACCESSION X06108 Y00429 M32339
KEYWORDS insulin-like growth factor I.
SOURCE rat (Rattus norvegicus).
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 798)
AUTHORS Rotwein,P.
JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
STANDARD simple automatic
REFERENCE 2 (bases 1 to 798)
AUTHORS Shimatsu,A. and Rotwein,P.
TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
Within the 5' Untranslated Region
JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987)
STANDARD simple automatic
COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is
described in $<XX6107>$.. *source: tissue=liver;
strain=Sprague-Dawley;
EMBL features not translated to GenBank features:
key from to description
SITE 1 72 5' UT-region
SITE 553 798 3' UT-region
FEATURES from to/span description
pept 73 552 IGF-I (AA 1-159)
BASE COUNT 238 a 196 c 183 g 181 t
ORIGIN
1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt
61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc
121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca
181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct
241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc
301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc
361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag
421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag
481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa
541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga
601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat
661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc
721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa
781 aaaaaaaaaa acaaaaac
//
LOCUS VSVNJLA 590 bp ss-RNA VRL 14-MAR-1990
DEFINITION Vesicular stomatitis virus L gene.
ACCESSION M17436
KEYWORDS L gene.
SOURCE Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 590)
AUTHORS Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K.
TITLE Intergenic sequences of the vesicular stomatitis virus genome (New
Jersey serotype): Evidence for two transcription initiation sites
within the L gene
JOURNAL Virology 160, 88-94 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 66 > 590 L protein
BASE COUNT 189 a 110 c 117 g 174 t
ORIGIN
1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa
61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact
121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact
181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc
241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta
301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga
361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag
421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt
481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc
541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct
//GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS LEIKPCAA 234 bp ss-mRNA ORG 28-MAR-1990
DEFINITION L.tarentolae kinetoplast ATPase 6 edited mRNA, partial cds.
ACCESSION M33229
KEYWORDS ATPase 6.
SOURCE L.tarentolae kinetoplast (strain University of California) cDNA to
mRNA.
ORGANISM Kinetoplast Leishmania tarentolae
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Leishmania tarentolae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S.
TITLE An extensively edited mitochondrial transcript in kinetoplastids
encodes a protein homologous to ATPase subunit 6
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by J.G.Bhat
23-MAR-1990.
Each uridine of the mRNA within the ranges specified ("t" in
GenBank conventions) are the result of post-transcriptional
editing. The "u addition"s and "u deletions" are alterations of
the maxicircle DNA.
FEATURES from to/span description
pept 32 > 234 ATPase 6
site 33 48 post-transcriptional editing (u addition)
site 52 64 post-transcriptional editing (u addition)
site 70 77 post-transcriptional editing (u addition)
site 28 29 post-transcriptional editing (u deletion)
site 80 98 post-transcriptional editing (u addition)
site 31 32 post-transcriptional editing (u deletion)
site 108 127 post-transcriptional editing (u addition)
site 130 136 post-transcriptional editing (u addition)
site 139 212 post-transcriptional editing (u addition)
site 66 67 post-transcriptional editing (u deletion)
site 129 130 post-transcriptional editing (u deletion)
BASE COUNT 58 a 7 c 35 g 134 t
ORIGIN
1 tatataaaaa attatatcag attaagaaaa aatgtttgtt ttttttgttt gtgatttagt
61 aattatgcgt attttattat gtttttgtta tagtgtttga tccagaatta tttttgtgtt
121 attttataat gttttttata tttgtaccga gttaatgttt tgtatttttg atgtttattt
181 atttgttggt ttatgtatgt ttatttgttt atgattcgtg ttatttaatt ttta
//
LOCUS TRBKPEATA 828 bp ss-mRNA ORG 28-MAR-1990
DEFINITION T.brucei kinetoplast ATPase 6 edited mRNA, complete cds.
ACCESSION M33228
KEYWORDS ATPase 6.
SOURCE T.brucei kinetoplast (strain EATRO 164) procyclic Istar1 serodeme
cell line [isolated from Bos Taurus blood], cDNA to mRNA.
ORGANISM Kinetoplast Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma brucei.
REFERENCE 1 (bases 1 to 828)
AUTHORS Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S.
TITLE An extensively edited mitochondrial transcript in kinetoplastids
encodes a protein homologous to ATPase subunit 6
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by J.G.Bhat
23-MAR-1990.
Each uridine of the mRNA within the ranges specified ("t" in
GenBank conventions) are the result of post-transcriptional
editing. The "u addition"s and "u deletions" are alterations of
the maxicircle DNA.
FEATURES from to/span description
pept 34 678 ATPase 6
site 33 59 post-transcriptional editing (u addition)
site 67 132 post-transcriptional editing (u addition)
site 135 170 post-transcriptional editing (u addition)
site 172 173 post-transcriptional editing (u deletion)
site 177 199 post-transcriptional editing (u addition)
site 201 202 post-transcriptional editing (u deletion)
site 205 245 post-transcriptional editing (u addition)
site 253 256 post-transcriptional editing (u addition)
site 258 259 post-transcriptional editing (u deletion)
site 261 267 post-transcriptional editing (u addition)
site 274 275 post-transcriptional editing (u deletion)
site 276 278 post-transcriptional editing (u addition)
site 280 281 post-transcriptional editing (u deletion)
site 282 283 post-transcriptional editing (u addition)
site 286 370 post-transcriptional editing (u addition)
site 374 380 post-transcriptional editing (u addition)
site 383 384 post-transcriptional editing (u deletion)
site 384 402 post-transcriptional editing (u addition)
site 407 408 post-transcriptional editing (u deletion)
site 408 473 post-transcriptional editing (u addition)
site 480 498 post-transcriptional editing (u addition)
site 505 528 post-transcriptional editing (u addition)
site 531 617 post-transcriptional editing (u addition)
site 621 631 post-transcriptional editing (u addition)
site 635 642 post-transcriptional editing (u addition)
site 646 647 post-transcriptional editing (u addition)
site 650 651 post-transcriptional editing (u deletion)
site 654 657 post-transcriptional editing (u addition)
site 660 661 post-transcriptional editing (u deletion)
site 662 699 post-transcriptional editing (u addition)
site 702 703 post-transcriptional editing (u deletion)
site 703 775 post-transcriptional editing (u addition)
site 776 777 post-transcriptional editing (u deletion)
site 778 779 post-transcriptional editing (u addition)
site 782 783 post-transcriptional editing (u addition)
site 785 786 post-transcriptional editing (u deletion)
BASE COUNT 156 a 21 c 137 g 514 t
ORIGIN
1 aaaaataagt attttgatat tattaaagta aatatgtttt tatttttttt ttgtgattta
61 ttttggttgc gtttgttatt atgtatgtat tattgtgtat gatctaggtt atgttttatt
121 gtgtatttta attgtttaat gttgattttt gattttttat tattttgttt gtttgatttg
181 tatttgtttg ttggtttgtg tttgttttta ttgttgtggt ttatgttgtt taatttatat
241 agtttaattt tgtattattg tattacttat ttgaatttgt atttgttgtt ttgtattgtt
301 tttttattgt atattgcatt tttatttttg ttttgttttt tatgtgattt ttttttgttt
361 aataatttgt tagttggtga tagttttatg gatgtttttt ttattcgttt tttgttgtgt
421 tttttagagt gtttttcttt gttgtgtcgt tgtttgtcga cgtttttgcg tttgttttgt
481 aatttattat catcccattt tttattgttg atgttttttg atttttttta ttttattttt
541 gttttttttt ttttatggtg ttttttgtta ttgatttatt ttatttattt ttgtgttttg
601 tttttgttta ttattttatg tgtttttata tttgttggat ttatttgccg ccatattaca
661 gttatttatt ttttgtaata tgattttgca gttgataatg gattttttgt tgtttttgtt
721 gtttgtttag ttttgtattt gatttttgat agttattata ttgttgttga aatttggttt
781 gttattggag ttatagaata agatcaaata agttaataat aaaaaaaa
//GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS RATCA2 446 bp ds-DNA ROD 13-NOV-1989
DEFINITION Rat calcium/calmodulin-dependent protein kinase II alpha-subunit
gene, 5' end.
ACCESSION M29699
KEYWORDS calcium/calmodulin-dependent protein kinase.
SOURCE Rat DNA, clone c4.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 446)
AUTHORS Sunyer,T. and Sahyoun,N.
TITLE Sequence analysis and DNA-protein interactions within the 5'
flanking region of the Ca-2+/calmodulin-dependent protein kinase II
alpha-subunit gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 278-282 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Sunyer, 03-NOV--1989, for release after publication.
FEATURES from to/span description
pept 429 > 446 calcium/calmodulin-dependent protein kinase II
mRNA 280 > 446 calcium/calmodulin-dependent protein kinase II
mRNA (alt.)
mRNA 282 > 446 calcium/calmodulin-dependent protein kinase II
mRNA (alt.)
signal 114 120 tata box
binding 201 205 Sp1 binding site
BASE COUNT 96 a 126 c 134 g 90 t
ORIGIN
1 gatcacttgt ggactaagtt tgttcacatc ccgttctcca accccctcag tacatcaccc
61 tgggagaaca aggtccactt gcttctgggc ccacacagtc ctgcagtatt gtgtatataa
121 ggccagggca acggaggagc aggttttgaa gtgaaaggca ggcaggtgtt ggggaggcag
181 ttaccggggc aacgggaaca gggcgtttcg gaggtggttg ccatggggac ctggatgctg
241 acgaaggctc gcgagctgtg agcagccaca gtgccctgct cagaagcccc aagctcgtca
301 atcaagctgg ttctccattt gcactcagga gcacgggcag gcgagtggcc cctagttctg
361 ggagcagagt atcagcatcc cagtcctagt cccgagccta aagcctcgcc tgcctgccca
421 gtgccaggat ggctaccatc acctgc
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS ECOTRWA 76 bp ss-tRNA RNA 15-SEP-1989
DEFINITION E.coli Trp-tRNA.
ACCESSION M24301
KEYWORDS transfer RNA-Trp.
SOURCE E.coli tRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L.
TITLE The effect of an Escherichia coli regulatory mutation on transfer
RNA structure
JOURNAL J. Mol. Biol. 135, 111-126 (1979)
STANDARD simple staff_entry
COMMENT In the TrpX mutation the base at position 37 ("a") is not modified.
In the "uga" suppressor mutation base 24 "g" is changed to "a".
FEATURES from to/span description
tRNA 1 76 Phe-tRNA
modified 16 16 d
modified 17 17 d
modified 20 20 d
modified 32 32 cm
anticdn 34 36 Phe-tRNA anticodon cca
modified 37 37 ms2i6a
modified 46 46 gm7
modified 55 55 psi
BASE COUNT 14 a 21 c 24 g 17 t
ORIGIN 5' end of mature tRNA.
1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt
61 ctctccgccc ctgcca
//
LOCUS ECOTRWSUP 76 bp ss-tRNA RNA 15-SEP-1989
DEFINITION E.coli Trp-tRNA suppressor mutant.
ACCESSION M25066
KEYWORDS transfer RNA-Trp suppressor.
SOURCE E.coli tRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L.
TITLE The effect of an Escherichia coli regulatory mutation on transfer
RNA structure
JOURNAL J. Mol. Biol. 135, 111-126 (1979)
STANDARD simple staff_entry
COMMENT In the TrpX mutation the base at position 37 ("a") is not modified.
In Trp-tRNA base 24 "a" is changed to "g".
FEATURES from to/span description
tRNA 1 76 Phe-tRNA
modified 16 16 d
modified 17 17 d
modified 20 20 d
modified 32 32 cm
anticdn 34 36 Phe-tRNA anticodon cca
modified 37 37 ms2i6a
modified 46 46 gm7
modified 55 55 psi
BASE COUNT 15 a 21 c 23 g 17 t
ORIGIN 5' end of mature tRNA.
1 aggggcgtag ttcaattggt agaacaccgg tctccaaaac cgggtgttgg gagttcgagt
61 ctctccgccc ctgcca
//
LOCUS ECOTRWX 76 bp ss-tRNA RNA 15-SEP-1989
DEFINITION E.coli Trp-tRNA TrpX mutation.
ACCESSION M25067
KEYWORDS transfer RNA-TrpX.
SOURCE E.coli tRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L.
TITLE The effect of an Escherichia coli regulatory mutation on transfer
RNA structure
JOURNAL J. Mol. Biol. 135, 111-126 (1979)
STANDARD simple staff_entry
COMMENT In the Trp-tRNA the base at position 37 ("a") is modified, whereas
it is not modified in the TrpX mutation. In the suppressor
Trp-tRNA base 24 "g" is changed to "a".
FEATURES from to/span description
tRNA 1 76 Phe-tRNA
modified 16 16 D
modified 17 17 D
modified 20 20 D
modified 32 32 Cm
anticdn 34 36 Phe-tRNA anticodon cca
modified 46 46 Gm7
modified 55 55 psi
BASE COUNT 14 a 21 c 24 g 17 t
ORIGIN 5' end of mature tRNA.
1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt
61 ctctccgccc ctgcca
//
LOCUS HCYSA 313 bp ds-DNA SYN 15-MAR-1990
DEFINITION Human cystatin A gene.
ACCESSION D00404
KEYWORDS cystatin A; cysteine-proteinase inhibitor.
SOURCE Artificial gene.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 313)
AUTHORS Kaji,H., Kumagai,I., Takeda,A., Miura,K. and Samejima,T.
TITLE Studies on chemical synthesis of human cystatin A gene and its
expression in Escherichia coli
JOURNAL J. Biochem. 105, 143-147 (1989)
STANDARD simple staff_entry
COMMENT Submitted in computer readable form by H. Kaji on 14-Apr-1989. The
expression plasmid for cystatin A was constructed by introducing
the synthetic gene downstream of the tac promoter of an E. coli
plasmid which is a derivative of pKK223-3 and the gene was
expressed in E. coli JM109 without IPTG-induction.
FEATURES from to/span description
pept 6 302 cystatin A
refnumbr 6 8 numbered codon 1 in [1]
BASE COUNT 96 a 74 c 83 g 60 t
ORIGIN
1 aattcatgat cccgggaggc ttaagcgaag ccaagccagc tacaccggag atccaggaga
61 tcgtcgacaa ggttaagccg cagctcgagg agaagaccaa cgagacatat ggtaagcttg
121 aggccgttca gtataagacc caggttgttg ccggcaccaa ctactacatc aaagttcgtg
181 ctggcgacaa caaatacatg catctgaagg tgttcaaaag cttacctggc cagaacgaag
241 atctggtact gactgggtac caggtcgaca agaacaagga cgacgagtta actggcttct
301 gatgaattct gca
//
LOCUS HUMPRG 1182 bp ss-mRNA PRI 15-MAR-1990
DEFINITION Human PRG gene (alt. hpg) encoding hemopoietic proteoglycan core
protein, complete cds.
ACCESSION X17042 M25538
KEYWORDS hemopoietic PG core protein.
SOURCE Human adult hematopoietic myeloid cell, cDNA to mRNA, clone D-D2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1182)
AUTHORS Stellrecht,C.M. and Saunders,G.F.
TITLE Nucleotide sequence of a cDNA encoding a hemopoietic proteoglycan
core protein
JOURNAL Nucleic Acids Res. 17, 7523-7523 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
provided by C.M.Stellrecht, 27-JUL-1989.
[1] Author Address: C.M.Stellrecht, University of Texas,
M.D.Anderson Cancer Center, Department of Biochemistry and
Molecular Biology, Box 117, 1515 Holcomb Blvd., Houston, TX, 77054.
FEATURES from to/span description
pept 25 501 proteoglycan core protein precursor
/nomgen="PRG" /map="10"
/hgml_locus_uid="LU0037W"
sigp 25 105 proteoglycan core protein signal peptide
matp 106 498 proteoglycan core protein
signal 279 315 CS substitution signal
rpt 304 357 Ser-Gly repeat
BASE COUNT 359 a 223 c 224 g 376 t
ORIGIN Chromosome 10q22.1.
1 gaattccgct agactaagtt ggtcatgatg cagaagctac tcaaatgcag tcggcttgtc
61 ctggctcttg ccctcatcct ggttctggaa tcctcagttc aaggttatcc tacgcagaga
121 gccaggtacc aatgggtgcg ctgcaatcca gacagtaatt ctgcaaactg ccttgaagaa
181 aaaggaccaa tgttcgaact acttccaggt gaatccaaca agatcccccg tctgaggact
241 gacctttttc caaagacgag aatccaggac ttgaatcgta tcttcccact ttctgaggac
301 tactctggat caggcttcgg ctccggctcc ggctctggat caggatctgg gagtggcttc
361 ctaacggaaa tggaacagga ttaccaacta gtagacgaaa gtgatgcttt ccatgacaac
421 cttaggtctc ttgacaggaa tctgccctca gacagccagg acttgggtca acatggatta
481 gaagaggatt ttatgttata aaagaggatt ttcccacctt gacaccaggc aatgtagtta
541 gcatatttta tgtaccatgg ttatatgatt aatcttggga caaagaattt tatagaaatt
601 tttaaacatc tgaaaaagaa gcttaagttt tatcatcctt ttttttctca tgaattctta
661 aaggattatg ctttaatgct gttatctatc ttattgttct tgaaaatacc tgcatttttt
721 ggtatcatgt tcaaccaaca tcattatgaa attaattaga ttcccatggc cataaaatgg
781 ctttaaagaa tatatatata tttttaaagt agcttgagaa gcaaattggc aggtaatatt
841 tcatacctaa attaagactc tgacttggat tgtgaattat aatgatatgc cccttttctt
901 ataaaaacaa aaaaaaaata atgaaacaca gtgaatttgt agagtggggg tatttgacat
961 attttacagg gtggagtgta ctatatacta ttacctttga atgtgtttgc agagctagtg
1021 gatgtgtttg tctacaagta tgattgctgt tacataacac cccaaattaa ctcccaaatt
1081 aaaacacagt tgtgctgtca atacctcata ctgctttacc tttttttcct ggatatctgt
1141 gtattttcaa atgttactat atattaaagc agaaatataa cc
//
LOCUS SUSMTA 4191 bp ds-DNA INV 15-MAR-1990
DEFINITION S.purpuratus metallothionein-A (SpMTA) gene, complete cds.
ACCESSION M30606
KEYWORDS metal binding protein; metallothionein; metallothionein-A.
SOURCE S.purpuratus sperm DNA, clone lambda-MT206.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 4191)
AUTHORS Harlow,P., Watkins,E., Thornton,R.D. and Nemer,M.
TITLE Structure of an ectodermally expressed sea urchin metallothionein
gene and characterization of its metal-responsive region
JOURNAL Mol. Cell. Biol. 9, 5445-5455 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Nemer, 11-DEC-1989.
FEATURES from to/span description
pept 757 787 metallothionein-A, exon 1
1908 1994 metallothionein-A, exon 2
3080 3156 metallothionein-A, exon 3
pre-msg 679 4154 MTA mRNA and introns
IVS 788 1907 MTA, intron A
IVS 1995 3079 MTA, intron B
IVS 3189 3738 MTA, intron C
signal 652 656 TATA box
BASE COUNT 1194 a 758 c 821 g 1391 t 27 others
ORIGIN
1 gtctacgtaa tttagtatgg aaaaataacc attttatcat acaggttgat catcaatata
61 tcattatgca gtgaacacca tgaatatttc gaaatggagg gatagttgca tttctggcgg
121 agatatccgg gtcgggaatc atctactttt attccccctc cccatttttt tcttcttctc
181 cttgttcttc ttctttttcg tttcttaatc atggtattaa tcatttgcaa atcaattaca
241 tgcctttata tatatataat aatcatggtg ctacatgatt ttgaaatact ttatggaaac
301 cttacgtttt accagacctt gacacaaagt tgacagtgtt acctttatca ttcacttccg
361 tactgtttca gcacatgagc aacactttgt tctttgaggg caaacgtttt gggggagggg
421 agggcatccc acggcgcttt aaaaacaaaa aggatttttg cttcccttcc atatcccgta
481 taatttatcc gacacgagcc tagtcatcac cattcctgat agaagcgtgt gcaaaactag
541 aaaggcccac ctagatcgag ctcctcaggc accgctaaca ggttgcctag ctcgtgccta
601 aacgtggtgt gtttacctat ttgtttgcac acggatattg taatgaaaac atataaaacc
661 ccgaatttca ccagtttttg tcacaagctg aagcgtttga gtgtgattta cattaccttc
721 tgaagaaaca tacaactcga gttcaatttc atcaccatgc ctgatgtcaa gtgtgtctgc
781 tgcaagggta agcaaaaaat cggaatcaat gttctcatta aaataccgag ttacagaata
841 ccgtgtacgt acggtgtagc gtttctagaa acaaaccgct cgatgttacg tgcatacgat
901 gatacgatac agccatggcc atgaccatga gaggcccccg atcacgacac acgcagctag
961 ctagaccgaa ttttccgtct ctttcgttcg ttaagcctct tatttgggta gggtcggctc
1021 tctctttaga gacggaagcc gactagacct agggcgcgtt catgcttgat tttgaaggca
1081 gaatgagcga tttgtaacgc tcatcgtaaa cgctcatcgt aatcgcagtt gcacggtgca
1141 ctgttcatgc tcacttctcc aggcaattct ccttgggtgc cttcagctgg attcgcaaac
1201 ctgtgtgcgc atgctacgag agcccataga ggtcagtgcg ttgaccccat ttaaacgcgt
1261 ttacaaattg taaacgcatc tttttgtgag tttacgatcg gcgatcatga atcggctttt
1321 tcggtgattc tacaaaatca agcatgaacg cgcccctata ctcattccct tgctaacacc
1381 aaggtgcatt tatttttann nntatnccna naannncccc aaatgtcatt ttacctcgtc
1441 gcatgttata ttgtaaatta ggcctaggct ataataaatc ggttccactt accaacttga
1501 ctgaaataat ttaatggtcc ctttttagga aattatgggt ctgaaatcga aatccatgtt
1561 gaaaatcttg caaaacgtat tttaactctc gctgcgcacg acgtaatcta aatctcatcc
1621 cttctagtct acaccgaagg caaaatgatt atgaaggaaa aactgtgcgc gagcttcacg
1681 ttcgttatca cagtttttct tcataatcat tttcatgttt tcttgcccgt ttcttcatac
1741 ttgttacatc aacagttatg tcaataacta agtttttacc aacaagcact gctggatttg
1801 tggtttcaaa ttgcaagatt gttattactt ttttaaaatt ttatatcatt atatagacta
1861 tatatatata tatatatata tatttttttt tttaattctg aacacagagg gcaaggagtg
1921 tgcctgcttt ggacaagatt gctgtaaaac tggtgaatgc tgcaaggatg gaacctgctg
1981 tggaatatgc acaagtaagc ctttcatcat tttcagtgaa ccctttaaaa ttaaagcagg
2041 cgcgtacgag gattttttaa gggggggggg ggggggttta acatttataa atcgggccga
2101 aaattctgca acgactcagc cgaccgctga ataagtgnng gggggggggg gggtagtaag
2161 aggacacttt ttattttttt ttggtctcga aattagaaaa tttgacattt tgctccgttg
2221 gggggggggg ggggaaggtc agctttttta ggtcagccat gggaattttt ttttttaata
2281 aaaaaaaaaa acatccagan nnnggggggg ggtttttaag gaaatttagg gggggggttt
2341 acacacaaaa aattaccaaa gggtgggggt ttaaccnnnn nnnncccctg tgtaagcncc
2401 tgcatcccag taattaaggc ttttttatac taaactgttt acacaaggct tttaaaagta
2461 tttatattca tagaaaaaaa ttgtcccaat tgctcatgca tttttttaat ttctttttac
2521 tctcaccttt aaatgggtgc aggcttaaat gtcaggctaa ctggcaatga ttatttgtgc
2581 tcattatcaa tattctagtg tatccatggt tttgttatgg tcgggggggg gggatctgac
2641 aaaaaatatt gacagccaag aatataggtc tatatgattt gtcagggaga tatgaatttc
2701 acatctccta ggttttattg ttgttgtcac caattaaatt attttaaaaa tcactgacaa
2761 gcttggaaaa ggattttgaa aataatttaa aacatgttca tctcattttc ctggtgaatt
2821 ttgggtctga aaagcgctat ataagaacta gctattatta tatatatata tatatatata
2881 taattagatt ttttattttt ttggaggggg gggggggggn ctttttccat gtatgcctaa
2941 atgaggggcg cagaccaatt gtgcccttta tatatcctgc tgtgtagtct gcatgaaaga
3001 gaaaatgcag ttataccaac aatagaatag agacatttca atttgatgtt attacttggt
3061 taattttatt tgtccacaga cgctgcgtgc aagtgtgcta atggctgcaa gtgtggaagc
3121 ggctgctcat gcactgaggg gaactgtgca tgctagacga gcttggccct ttgctatagg
3181 gatggcaggt gagattttat ttattttcat ggctacatag taaaaattgt ttccaggcaa
3241 aaacgttagc tagttatggg atatgacagg gtctgaaatg taagactgta gatttgcact
3301 gcataaagct gtgccatcat ttttttttaa atagaatttt ccgagagagc atcaaaatta
3361 atgtccaaat gtcgaaagcc aactgaatat gttaaatatt ggctgagatg tgaccatctg
3421 aataaatagg ttccctatag gaactcagtg ttattttgct gtggttcata acatcacgna
3481 ccagagcaaa ataacactga gttcctacac aaaatctatt tattacatta ctgcattgcg
3541 atatacaaaa aaggtaaaat tagaaaatct ttgcaagtta tggcaatatg tatagacctt
3601 ccgtgtcttt catgactttt gtatattttt ttcaacaacc ggaacaaatt cattaaccat
3661 ttctgtgttt tcatactttt ttgtaaaaat attttcaagc attttgtttt tctctccgtg
3721 tctgtttctc ttctacagat tgagcataca gctgatgaag tcactcagcg catctgccaa
3781 catgctccaa gcagacctga ctttgtatat actttcgacc cgatgtatta tattttttgt
3841 tgtaagatag ttgtatatta agttaattta atctttattt ttttctgaaa atatgagact
3901 aattggcatt aataggacta cttaaacaac aattatttat taaggctgac agtaatgcct
3961 gatagaattt cacttcattt tttgctgccc tgttttgtaa attcttcagt ctgtttctaa
4021 cttctatcaa gttgcaaatt tggaatgaaa ataataataa taatacatgt ggagagtact
4081 tgtagtgcgc aaactccaca tgactaatgt gttaaagact cgctcatgat taaatttaaa
4141 taatgctaca aagtatattc agtatgcgtt gcttagtttt ttctcatgat t
//
LOCUS SYNCYSTA 318 bp ds-DNA SYN 15-MAR-1990
DEFINITION Artificial cystatin alpha gene.
ACCESSION X13226
KEYWORDS artificial gene; cystatin; cysteine-proteinase inhibitor.
SOURCE Artificial gene.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 318)
AUTHORS Katunuma,N., Yamato,M., Kominami,E. and Ike,Y.
TITLE Total synthesis of the cystatin alpha gene and its expression in E.
coli
JOURNAL FEBS Lett. 238, 116-118 (1988)
STANDARD simple automatic
COMMENT *source: library=pBR322; clone=pTPI-010;
Data kindly reviewed (23/5/89) by Katunuma N.
FEATURES from to/span description
pept 6 317 cystatin alpha (AA 1 - 103)
BASE COUNT 99 a 63 c 76 g 80 t
ORIGIN
1 aattcatgga cccgggtact actggtatcg ttggtggtgt ttctgaagct aaaccggcta
61 ctccggaaat ccaggaagtt gctgacaaag ttaaacgtca gctggaagaa aaaactaacg
121 aaaaatacga aaaattcaaa gttgttgaat acaaatctca ggttgttgct ggtcagatcc
181 tgttcatgaa agttgacgtt ggtaacggtc gtttcctgca catgaaagtt ctgcgtggtc
241 tgtctggtga cgacgacctg aaactgctgg actaccagac taacaaaact aaaaacgacg
301 aactgactga cttctagg
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS YSCPDC1UP 1193 bp ds-DNA PLN 15-MAR-1990
DEFINITION Yeast PDC1 gene for pyruvate decarboxylase (EC 4.1.1.1.).
ACCESSION X14016
KEYWORDS PDC1 gene; pyruvate decarboxylase.
SOURCE yeast (Saccharomyces cerevisiae).
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1193; enum. 1 to 1193)
AUTHORS Butler,G. and McConnell,D.J.
TITLE Identification of an upstream activation site in the pyruvate
decarboxylase structural gene (PDC1) of Saccharomyces cerevisiae
JOURNAL Curr. Genet. 14, 405-412 (1988)
STANDARD simple automatic
COMMENT *source: clone=YIpDC1-2; The sequence overlaps with that reported
by Kellermann et. al. in Nucl. Acids Res. 14:8963-8977(1986)
X04675.
EMBL features not translated to GenBank features:
key from to description
SITE 555 569 RPG box (upstream activation site)
FEATURES from to/span description
pept 1191 > 1193 pyruvate decarboxylase
BASE COUNT 362 a 251 c 206 g 374 t
ORIGIN
1 gatcaagcga gtccatcggc gacctgccca ccgctaaact ctcttaacat acggccgtac
61 agaaaaaacg gcactggagt gagcagtgtc tccgacgatt tggatatcga tttcaacgat
121 agttttgcct cagaagaaag ttataaacgt gccgctgcag tgacatctac cctaaaggcc
181 agaattgaaa aaatgaaggc caaatcaagg cgggaaggga caaccaggac gtaaagggta
241 gcctccccat aacataaact caataaaata tatagtcttc aacttgaaaa aggaacaagc
301 tcatgcaaag aggtggtacc cgcacgccga aatgcatgca agtaacctat tcaaagtaat
361 atctcataca tgtttcatga gggtaacaac atgcgactgg gtgagcatat gttccgctga
421 tgtgatgtgc aagataaaca agcaagacag aaactaactt cttcttcatg taataaacac
481 accccgcgtt tatttaccta tctttaaact tcaacacctt atatcataac taatatttct
541 tgagataagc acactgcacc cataccttcc ttaaaaacgt agcttccagt ttttggtggt
601 tctggcttcc ttcccgattc cgcccgctaa acgcataatt ttgttgcctg gtggcatttg
661 caaaatgcat aacctatgca tttaaaagat tatgtatgct cttctgactt ttcgtgtgat
721 gaggctcgtg gaaaaaatga ataatttatg aatttgagaa caattttgtg ttgttacggt
781 attttactat ggaataatca atcaattgag gattttatgc aaatatcgtt tgaatatttt
841 tccgaccctt tgagtacttt tcttcataat tgcataatat tgtccgctgc ccgtttttct
901 gttagacggt gtcttgatct acttgctatc gttcaacacc accttatttt ctaactattt
961 tttttttagc tcatttgaat cagcttatgg tgatggcaca tttttgcata aacctagctg
1021 tcctcgttga acataggaaa aaaaaatata taaacaaggc tctttcactc tccttggaat
1081 cagatttggg tttgttccct ttattttcat atttcttgtc atattctttt ctcaattatt
1141 atcttctact cataacctca cgcaaaataa cacagtgcaa atcaatcaaa atg
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS APH5A10A 721 bp ss-RNA VRL 21-FEB-1990
DEFINITION Foot and mouth disease virus 5' non-coding region, typeA-12.
ACCESSION M32257
KEYWORDS .
SOURCE Foot and mouth disease virus (serotype A-12) cDNA to viral RNA.
ORGANISM Foot and mouth disease virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae; Aphthovirus.
REFERENCE 1 (bases 1 to 721)
AUTHORS Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E.,
Rowlands,D.J. and Carroll,A.R.
TITLE potential secondary and tertiary structure in the genomic rna of
foot and mouth disease virus
JOURNAL Nucleic Acids Res. 15, 7067-7079 (1987)
STANDARD simple staff_entry
BASE COUNT 178 a 202 c 181 g 160 t
ORIGIN
1 ccccctaagt tctaccgtcg ttcccgacgc aaaagggagg taaccacaag cttgacgccg
61 tctttcccga cgttaaaggg tagtaaccgc aagcttctac cgcccatccc ggcgttaaaa
121 gggtgtaacc acaagatgca ccttcgtccc ggaagtaaaa cggcaatttc acacagtttt
181 gcccgttttc acgagaaacg ggacgtctgc gcacgaaacg cctgtcgctt gaggaggact
241 tgtacaaaca cgatctaagc aggtttcccc aactgacaca aaccgtgcaa tttggaactc
301 cgcctggtct ttccaggtct agaggggtga cactttgtac tgtgtttggc tccacgctcg
361 gtccactggc gagtgttagt aacagcactg ttgcttcgta gcggagcatg atggccgtgg
421 gaactcctcc ttggtaacaa ggacccacgg ggccgaaagc cacgtccaat cggacccatc
481 atgtgtgcaa ccccagcaca gcaacttttc tgcgaaactc acttcaaggt gacactgata
541 ctggtactca aacactggtg acaggctaag gatgcccttc aggtaccccg aggtaacacg
601 cgtcactcgg gatctgagaa ggggactggg gcttctataa aagcgtccag gttaaaaagc
661 ttctatgcct gaataggtga ccggaggccg gcaccttttc tttacagcca ctgactttat
721 g
//
LOCUS APH5A10B 685 bp ss-RNA VRL 21-FEB-1990
DEFINITION Foot and mouth disease virus 5' non-coding region, typeA-10.
ACCESSION M32258
KEYWORDS .
SOURCE Foot and mouth disease virus (serotype A-10) cDNA to viral RNA.
ORGANISM Foot and mouth disease virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae; Aphthovirus.
REFERENCE 1 (bases 1 to 685)
AUTHORS Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E.,
Rowlands,D.J. and Carroll,A.R.
TITLE potential secondary and tertiary structure in the genomic rna of
foot and mouth disease virus
JOURNAL Nucleic Acids Res. 15, 7067-7079 (1987)
STANDARD simple staff_entry
BASE COUNT 169 a 194 c 174 g 148 t
ORIGIN
1 ccccctaagt tttgccgtcg tttccgacgt aaaaggaagt aatcacaagc ttgacaccgc
61 ctgtcccggc ggtaaaggga agtaaccgca agcttactac cgcctacccc ggcgttaacg
121 gggaacaacc acaaacacac ttttgcccat ttttgcgaga aatgggacgt ctgcgcacga
181 aacgcgccgt cgcttgagga ggacttgtac aaacacgatc tacgcaggtt tccccaactg
241 acacttaccg tgcaacttga aatcccgcct ggtctttcca ggtctagagg ggtgacaatt
301 tgtactgtgg ttgactccac gctcggtcca ctggcgagtg ctagtagcag cactgttgtc
361 tcgtagcgga gcatgatggc cgtgggaatc cctccttggt aacaaggacc cacggggcca
421 aaagccacgt cctaacggac ccatcatgtg tgcaacccca gcacggcgac tttcctgtga
481 aacccacttt aaggtgacac tgatactggt actcaaacac tggtggcagg ctaaggatgc
541 ccttcaggta ccccgaggta acacgcgaca cttgggatct gagaagggga ctgaggcttc
601 tttaaaagcg ctcggtttaa aaagcttcta cgcctgaaca ggtgaccgga ggccggcacc
661 tttctctaca accactgata ttatg
//
LOCUS CHKHMG17 1327 bp ss-mRNA VRT 15-MAR-1988
DEFINITION Chicken mRNA for high mobility group protein HMG-17.
ACCESSION Y00416 M32256
KEYWORDS DNA binding protein; high mobility group protein.
SOURCE chicken (Gallus gallus).
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1327; enum. 1 to 1327)
AUTHORS Landsman,D.
TITLE ;
JOURNAL Unpublished (1987) NIH/NCI, Bethesda, MD 20892, USA.
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1327; enum. 1 to 1327)
AUTHORS Landsman,D. and Bustin,M.
TITLE Chicken Non-Histone Chromosomal Protein HMG-17 cDNA Sequence
JOURNAL Nucleic Acids Res. 15, 6750-6750 (1987)
STANDARD simple automatic
COMMENT *source: developmental stage=embryo.
FEATURES from to/span description
pept 175 447 high mobility group protein HMG-17 (AA 1-90)
BASE COUNT 355 a 320 c 309 g 343 t
ORIGIN
1 gaattccgcc agcgcagcga gccggccgcc agccccgccg cgccgccccg ctctccccct
61 cggccctccc ccgcttctcg ccgccaccga gcgagcccgg ctgcccgccc ccgcccgccc
121 cctccgctcg ctctctccct cctcgcacaa cacacgcacg cgccgcccgg agctatgccg
181 aagagaaagg ctgaaggaga taccaagggc gataaggcca aagttaagga tgagccacaa
241 cggagatcgg caaggttatc tgctaaacct gcccctccga agccagagcc taaacctaaa
301 aaggcagctc caaagaagag tgagaaggtg cccaagggaa agaaggggaa agctgatgct
361 ggcaaggagg gaaacaaccc tgcagaaaat ggagatgcca aaacagacca ggcacagaaa
421 gccgaaggtg ctggtgatgc caagtaaaat gtgtgaattt ttgataactg tgtacttctg
481 gtgactgtac agtttgaaat actatttttt atcaagtttt ataacaatgc agaattttgt
541 tttacttttt tttaagctat gttgttagca cacagaccgc tttgttgttg tgttttgagg
601 ggggggcagt gggacaaacg tcacttaatc tgtttcttgg aacctaaatt ttaaaagttt
661 accccttccc agttttttag aaggactctt cctaaatgga gcaggaaggg attccttcgt
721 gctgcacacc tcttccgttt tgtggaccgc atcagagtga acggaagctc ccgagatgcc
781 tgttgccaac ttcagaactg cagtttgcag tgccctctgc gtttcctttc atgccctccc
841 tttttgccta gagcctatca ctccgaaata cagcagacat ggcatgttgg gactcaccac
901 tctaaatgca ttgtcaggtg atctggactt ctggtgtcta atttgggata taatagctct
961 aaaaggagct gcatttcctc tttcatattg tagatctaca gattaaggaa tctgcagttt
1021 ttaatttttc ctcgcaaagt cagggtagat ttgtgaagag ttgttaaaca acatgctaaa
1081 tgtgaaagtg tccgccctca ctctaaacat ttccctctac aagtatacaa aaatgaagat
1141 ttgtcggttt tatagcaacc tttatgtttg ggtagtccat gaagggaggg gagtttgaca
1201 gttgttgtaa aatgttgcag attgtagccc atgtcctgcc taaattacca tgattgttta
1261 tgaaaagtac ctttaataaa gctggatacg gtttggcttg gaaaaaaaaa aaaaaaaaac
1321 ggaattc
//
LOCUS HUMSAU3E 50 bp ds-DNA PRI 28-FEB-1990
DEFINITION Human Sau3A related sequence, clone pUCXba30.
ACCESSION M32265
KEYWORDS alphoid-like repetitive sequence; repetitive sequence.
SOURCE Human DNA, clone pUCXba30.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Okumura,K., Kiyama,R. and Oishi,M.
TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA:
Analysis of recombination in the excision event
JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987)
STANDARD simple staff_entry
BASE COUNT 16 a 11 c 7 g 16 t
ORIGIN
1 tgaaactctc tttctttgga ttctgcatct aaaaaccaaa cggaagcatt
//
LOCUS HUMSAU3F 88 bp ds-DNA PRI 28-FEB-1990
DEFINITION Human Sau3A related sequence, clone pUCXba25.
ACCESSION M32266
KEYWORDS alphoid-like repetitive sequence; repetitive sequence.
SOURCE Human DNA, clone pUCXba25.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Okumura,K., Kiyama,R. and Oishi,M.
TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA:
Analysis of recombination in the excision event
JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987)
STANDARD simple staff_entry
BASE COUNT 22 a 18 c 21 g 27 t
ORIGIN
1 cagcagtttc caatcactct ttctgtggaa tctgcaagtg atatttgggc ctctctgagg
61 atttcgttgg aaacgggata aaatgcac
//
LOCUS HUMSAU3G 1192 bp ds-DNA PRI 28-FEB-1990
DEFINITION Human Sau3A related sequence, clone pUCXba34.
ACCESSION M32267
KEYWORDS alphoid-like repetitive sequence; repetitive sequence.
SOURCE Human DNA, clone pUCXba34.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1192)
AUTHORS Okumura,K., Kiyama,R. and Oishi,M.
TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA:
Analysis of recombination in the excision event
JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987)
STANDARD simple staff_entry
BASE COUNT 328 a 239 c 273 g 352 t
ORIGIN
1 cctgtggtgg aaaaggaact atctactcat aaaagctaga tggaagcatt ctcagaaact
61 actttgtgat gattgctttc aggtcacaga gttgaacatt ccctttgata gagccgtttg
121 gagacacaat tttggttcaa tcggcaaggg gagatttgga ccgctttgag gcctatgcga
181 gtagaggaaa tcactgccca taaaaactag acagcagcat tctcaggaaa cactttgtga
241 cgattgagtt gaactcacag agcggaacat tcctttggat ggagcagttt tgaaacacac
301 tttttgtaga atctgcaagt ggacagttgg acctccctga ggatttcttt ggaaacggga
361 taacgtcacc taactaaaca gaagctttcg cagaaacttc tttgtgacgt ttgcattcaa
421 agtccagagt tgaaccttcc cttgatagtt cacgtttgaa acactctttt tgtaggatct
481 gcaagtggat attgggagca ctttgaggcc ttcgttcgaa atgggtatat cttcacataa
541 aatccagaca gaagccttct cagaaacttc tctgtgatga ttgcatgcaa ctcacagagt
601 tgaacattcc tttgcataga gcagtttcga aactctcttt tttctgaatc tgcacatgga
661 taggtgcaac tctgtgaaga tttccttgga aacgggaata tcttcactta aagagtaaac
721 ggatgccttc tcagaaactt ctttgtgagg catgtgttca actcccagac tttaaccttg
781 cttttcatag agcagttttg aaacattctt ttcgtagagt ctccaagtgg acgtttggag
841 cgctttcagg cctgtggtgg aaaaggaaat atcttcacct caaaactaga gagaagcatt
901 gtcagaaacg tctttgtgat gatggcattc aactcacgga gttgaaggtt ccttttgata
961 cagcagtttg gaaacactct ttcagtggga cctgcaagcg gatatttggg gctctttgga
1021 gatttcgatg gaaaaggcat aatcttccca taaaagctaa acggaagcgt gctcagagcc
1081 ttcttggtga tgtttgcatt caactcacag agttgtactt tcctttcgac agagcagctt
1141 tgaaaccctc tctttctaga atctgcaagt ggacatttgg agggcttcga gg
//
LOCUS HUMSAU3H 91 bp ds-DNA PRI 28-FEB-1990
DEFINITION Human Sau3A related sequence, clone pUCXba34.
ACCESSION M32268
KEYWORDS alphoid-like repetitive sequence; repetitive sequence.
SOURCE Human DNA, clone pUCXba34.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 91)
AUTHORS Okumura,K., Kiyama,R. and Oishi,M.
TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA:
Analysis of recombination in the excision event
JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987)
STANDARD simple staff_entry
BASE COUNT 20 a 15 c 26 g 30 t
ORIGIN
1 gttgaaggtt ccttttgata caacagtttg gaaacactct ttcagtggga cctgcaagcg
61 gatatttggg gctctttgga gatttcgatg g
//
LOCUS NEUMTINT1 1377 bp ss-mRNA ORG 27-FEB-1990
DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 1.
ACCESSION M32263
KEYWORDS .
SEGMENT 1 of 3
SOURCE N.crassa mitochondrial cDNA to precursor RNA.
ORGANISM Mitochondrion Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE 1 (bases 1 to 1377)
AUTHORS Agsteribbe,E. and Hartog,M.
TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa
JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 ATPase 6 (AA at 1)
pept 1369 > 1377 MAL protein
BASE COUNT 456 a 213 c 277 g 431 t
ORIGIN
1 ctacattaga acagtcctgt gcagtccttt aacaaaatta cgctgcttta ggtgacacct
61 gaaagcggtc caccctcctc ttacgtatcc gaaggatact aaaggtcccc aatatcaagt
121 taaagtcaca tatgcaatca aatagttaaa cgtctcgttg taacttgata agattactct
181 attcgaggtc tcttatagca aatagagaga ggaatgccca ctcgcaaacg ggttcggtac
241 ttaaaatgaa cagagatacg caagttcgca tctgaaaaag tggacggaga aggaacttat
301 ggccactcta ccgagtggtt gtagcaaatg cagaatcgac ccgctatagg tttaacttct
361 cctaggatgt agtttgggag gcacgcccaa actataaaag tgctaagggg taggaaggtt
421 cgaatcctga gttcccacca ccatgggtgg cgtgcctcga gaagggggcg tgcctctgga
481 tagcttgcgc ttattttttt ttttagccgt ccccccgccc tgcagtactg cagggggggg
541 aagctaaaaa aaattaaaga ggtggccaga ggctaaaaaa aaaaatgcta gctttagcta
601 aaaaaaaaat aaccgataaa ccaaattttt aatagcatat atagacgact aatttaatta
661 tattagtctg agcggagcta agctgtgcgg attaggatta ggattaatcc taatcgtact
721 taatgattat ctttaataat atgaaataaa tatgaatatt tatttcacca tgtaacatta
781 ataatgttaa tgtgtgattt aatttggtac gtatcacgta aggggaatga tggatgatac
841 attatgctat gtaatgatgt aatgtaatgt aatgtaatgt tatatccact atgtactgca
901 aatgtaatca tctgatagat aatataaaat gtaacttaat gtaacttaac ttaatgtaat
961 gtaatgtaac ttaatgtaac ttaatgttat gtaactgtaa tgtaatattt tgtaatgtaa
1021 ctgtaatgta acttaatata atgtaactgt aatgtaatat aatgtacagt aatgtaactg
1081 taatgtaatg ttacattaca tgtaatgtaa tgtaatgtat tgtaatgtaa tctttatatg
1141 taatgtacta aagattacga tagctagcca gttatatact ggctagctat cgtatgatta
1201 ttgtagccat tagggatagc ctaatggcta cttgattatc attagatata ctgcaacagc
1261 agtaaataca ctgttcggga gcctcagtgg tatttgctgc tttgcggtgg tggaattgaa
1321 gaggttacta gattctagtt aatttttgat tccaccattg tatatataat gatacaa
//
LOCUS NEUMTINT2 712 bp ss-mRNA ORG 28-FEB-1990
DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 2.
ACCESSION M32264
KEYWORDS .
SEGMENT 2 of 3
SOURCE N.crassa mitochondrial cDNA to precursor RNA.
ORGANISM Mitochondrion Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE 1 (bases 1 to 712)
AUTHORS Agsteribbe,E. and Hartog,M.
TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa
JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 MAL protein (AA at 1)
pept 704 > 712 COII protein
BASE COUNT 199 a 149 c 175 g 189 t
ORIGIN 225 bp after segment 1.
1 gttgcatagt aatgatagag attaggaata aaatgatgta tggctgctta aataaaaaat
61 atggctgctc ttcctgctta gctagctagc tagcttcgag ggaccccacg aatacggccc
121 acacacgaag gtgtgtgggg tattcgtggg gggccccctc ctacttcctg cctacgaagt
181 ctaggcaggc aggaagcagc cttaataatt tagctagcca taatatcgcg accaccctcc
241 tccactcaac cccccccttg gcggccccca tctttccccc gccctgcaat actgcagggg
301 ggcataatta aatgtgatgg ggaccttgag caaaggaggg ggtacgcgag tggaggaggg
361 tggtactttt ttgggaggga agctgagcga agctgagcga agctgagcga agtgagcgaa
421 gtgtgcgaag ctgacggaag gtaattcatt tttttgggcc cccgccccga agggcggtcc
481 cccgaaaaaa caaaggacgc tagcgtagcc taaccttatg agcagcaggc tataaaaacc
541 cattaggcac agggtatact tcgtttagtg gataagaatt agcctagcct taatatacaa
601 tacagtttaa tgttatattt aaattctata ttttatgaaa agattataat atatttatta
661 ttatggttta atattcaata tagaatattc tatttctttt tttatgggat ta
//
LOCUS NEUMTINT3 691 bp ss-mRNA ORG 28-FEB-1990
DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 3.
ACCESSION M33670
KEYWORDS .
SEGMENT 3 of 3
SOURCE N.crassa mitochondrial cDNA to precursor RNA.
ORGANISM Mitochondrion Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE 1 (bases 1 to 691)
AUTHORS Agsteribbe,E. and Hartog,M.
TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa
JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 COII protein (AA at 1)
BASE COUNT 181 a 154 c 198 g 158 t
ORIGIN 754 bp after of segment 2.
1 gaacagtaag acgataaaat agtttttggc taaataaagg aactcggggg accccacgaa
61 tacggcccac acacgaaggt gtgtggggta ttcgtggggc cctatgtatt tgagtatgaa
121 gccatctagt ggttatctct cccccccctc cccgcaggtt tgcggggggg agcgaggggg
181 tagcgataaa aagtttctaa tatgaagagg atgacttatg gaattaactc cacccccccc
241 ccccttaagg gggggagtgg agtggaacgg aatgtggaga cgccgcgtgc cccccgcaaa
301 tttgcggggg gggcgggtgg atggaacatc cgggaggtgg tgggttatat tataaacaat
361 attcggggga accccgcccc ttcgggggcg gaggccatta actggaaagc acccccccct
421 taaggggggt ggcttttaat tattattatt tttataggtt aattaggttt aaaaggctac
481 ccgccccccc ccctgcagta ctgcagggcg gggaagggtt aaaataacaa gcaagttcct
541 gaagtctggc gtagctagcg gaaggaacct tgccttaaat ccaaaaacct tttaaagagt
601 taccaagact gaaccggggg ctagcccccg attaggtaat taataagttt atgtatatct
661 agtaagggta taactctaat cccctgcagt a
//
LOCUS PMUMUC1 965 bp ds-DNA PHG 15-MAR-1990
DEFINITION Bacteriophage mu C operon 5' region
involved in late gene transcription.
ACCESSION Y00419 M32261
KEYWORDS C operon; activator protein; late gene transcription;
regulatory protein; trans-activator.
SOURCE Bacteriophage mu DNA, clone pWM6.
ORGANISM Bacteriophage mu
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 965)
AUTHORS Stoddard,S.F. and Howe,M.M.
TITLE DNA sequence within the mu C operon
JOURNAL Nucleic Acids Res. 15, 7198-7198 (1987)
STANDARD simple automatic
REFERENCE 2 (bases 161 to 360)
AUTHORS Stoddard,S.F. and Howe,M.M.
TITLE Characterization of the C operon transcript of bacteriophage mu
JOURNAL J. Bacteriol. 172, 361-371 (1990)
STANDARD simple staff_entry
COMMENT The mu C gene product is a transactivator of mu late transcription.
see also X03992 for Mu C gene
Data kindly reviewed (27-OCT-1987) by Howe M.M.
EMBL features not translated to GenBank features:
key from to description
SITE 53 43 (C) pot. MuA binding site
SITE 123 111 (C) pot.IHF binding site
SITE 185 173 (C) pot. Mu repressor binding site
SITE 193 180 (C) pot. Mu repressor binding site
SITE 183 193 pot. MuA binding site
INVREP 190 197 inverted repeat A
INVREP 203 210 inverted repeat A'
TERM 190 210 transcription terminator of an
early mRNA
PRM 225 230 pot. -10 region
SITE 237 240 region of 5' end of Mu C mRNA
(by S1 mapping)
RBS 251 253 pot. ribosome binding site
SITE 414 426 pot. IHF binding site
SITE 454 467 pot. Mu repressor binding site
RBS 462 465 pot. ribosome binding site
INVREP 782 800 inverted repeat B
SITE 820 811 (C) pot. MuA binding site
SITE 846 858 pot. Mu repressor binding site
SITE 903 890 (C) pot. IHF binding site
SITE 893 902 pot. MuA binding site
FEATURES from to/span description
ORF 262 480 open reading frame 1
ORF 278 406 open reading frame 2
ORF 390 97 (c) open reading frame 3
ORF 473 835 open reading frame 4
ORF 845 964 open reading frame 5
ORF 933 739 (c) open reading frame 6
BASE COUNT 183 a 230 c 269 g 283 t
ORIGIN
1 ctacattcca cgcggtcagg ctctggactc gctgattcgg gatttgcgta tctggaatga
61 ttttaacggt cgcaacgtca gcgaactgac cacccgttat ggtgtcactt tcaatacggt
121 gtacaaagcc attcgccgga tgcgacggct gaaataccgc cagtaccagc cctcacttct
181 gtaaacagta aagccggtta atccggcttt ttttacgtcc tcaatatcct gtgatgaata
241 accgtaccgg ggaaaatcat catgggtaaa ggctggaatg catcctttca tcttggcaga
301 cgtgagcggc tgcgtcagga ggttttgcac cgtgtcgccg gtgggccacg tcctgcgccc
361 cgcgactata ccggtcatga tggcacccac ggcagttatt acatgaaagg ctggcagtca
421 gttgatatgc cggaaattct tcatcattgc ctgctttaca gggagaaaca ttatgtctga
481 gcgttctgct cgtcagtggc ctgatttttt atcggtggtg ttactggcgc tgttattgtg
541 gataagtctg ttttgtggct ggcgcgcgct gatgttttgc tgtgcctcgg ttttcagtgt
601 tgctttatgt gttgccgctg attgtctgga tgcgctgata atgagctgcc gggtgcctga
661 acattttgcc cggttcgtct ggcctctgac ctggctgggg agtctgtccg ggctgggatt
721 agccgtgatg gcaacgtctc aactgaaaac gggtcctgag catgtcatct gggcgctggc
781 cggattgctg acgttctggc tctcatttcg ttttcgcgct cgtctgttcg ggtaaagggg
841 caccatgtac agaaaattca gtgatgaatg tttcgggccg tccacgctga ttaatgcgat
901 aaaagtgatt gcccttgtgg ttctgataac catcagtgcc gtggtgtatc tttctgtctg
961 ttaac
//
LOCUS PNSTETFA1 97 bp ds-DNA BCT 28-FEB-1990
DEFINITION Plasmid pNS1981 flanking region of tet resistance gene, 5' end.
ACCESSION M32262
KEYWORDS .
SEGMENT 1 of 2
SOURCE Plasmid pNS1981 from B.subtilis.
ORGANISM Plasmid pNS1981
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 97)
AUTHORS Sakaguchi,R. and Shishido,K.
TITLE A unique DNA structure of the junction of homologous and
nonhomologous regions between tetracycline-resistance plasmid
pNS1981 and kanamycin-resistance plasmid pUB110
JOURNAL Nucleic Acids Res. 15, 7202-7202 (1987)
STANDARD simple staff_entry
BASE COUNT 38 a 8 c 18 g 33 t
ORIGIN
1 tgggaacgga aaaattattt tattaaagag tagttcaaca aacgggccat attgttgtat
61 aagtgatgaa atactgaatt taaaacttag tttatat
//
LOCUS PNSTETFA2 94 bp ds-DNA BCT 28-FEB-1990
DEFINITION Plasmid pNS1981 flanking region of tet resistance gene, 3' end.
ACCESSION M33669
KEYWORDS .
SEGMENT 2 of 2
SOURCE Plasmid pNS1981 from B.subtilis.
ORGANISM Plasmid pNS1981
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 94)
AUTHORS Sakaguchi,R. and Shishido,K.
TITLE A unique DNA structure of the junction of homologous and
nonhomologous regions between tetracycline-resistance plasmid
pNS1981 and kanamycin-resistance plasmid pUB110
JOURNAL Nucleic Acids Res. 15, 7202-7202 (1987)
STANDARD simple staff_entry
BASE COUNT 27 a 14 c 16 g 37 t
ORIGIN
1 tttataacag gaattcaaat ctttttgttc cattaaaggg cgcgattgct gaataaaaga
61 tacgagagac ctctcttgta tcttttttat tttg
//
LOCUS STMTGFMA 147 bp ds-DNA BCT 28-FEB-1990
DEFINITION S.rimosus initiator tRNA gene.
ACCESSION M32254
KEYWORDS transfer RNA-fMet.
SOURCE Streptomyces rimosus DNA.
ORGANISM Streptomyces rimosus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 147)
AUTHORS Gamulin,V. and Soell,D.
TITLE The initiator tRNA genes from Streptomyces rimosus
JOURNAL Nucleic Acids Res. 15, 6747-6747 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 51 125 tRNA-fMet
BASE COUNT 28 a 39 c 51 g 29 t
ORIGIN
1 gtgatttgcg ctggtcgacg gcggtggtgc gtagactggt cacacaacgg cgcggggtgg
61 agcagctcgg tagctcgctg ggctcataac tcagaggtcg caggttcaaa tcctgtcccc
121 gctactgatg aagaaaggcc cggatcc
//
LOCUS STMTGFMB 147 bp ds-DNA BCT 28-FEB-1990
DEFINITION S.rimosus initiator tRNA gene.
ACCESSION M32255
KEYWORDS transfer RNA-fMet.
SOURCE Streptomyces rimosus DNA.
ORGANISM Streptomyces rimosus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 147)
AUTHORS Gamulin,V. and Soell,D.
TITLE The initiator tRNA genes from Streptomyces rimosus
JOURNAL Nucleic Acids Res. 15, 6747-6747 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 51 125 tRNA-fMet
BASE COUNT 28 a 48 c 49 g 22 t
ORIGIN
1 gcgtgggccg ggcaccacct ccgtgccgtg gtaccgtgga cacacaacgg cgcggggtgg
61 agcagctcgg tagctcgctg ggctcataac ccagaggtcg caggttcaaa tcctgtcccc
121 gctactggaa acgaaaggcc cggatcc
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS MUSIGR 2003 bp ss-mRNA ROD 18-JUL-1989
DEFINITION Mouse interferon-gamma receptor mRNA, complete cds.
ACCESSION M25764
KEYWORDS interferon-gamma receptor.
SOURCE Mouse (strain BALB/c) plasmacytoid lymphosarcoma cell line ABPL4,
cDNA to mRNA, clone 45.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2003)
AUTHORS Kumar,C.S., Muthukumaran,G., Frost,L.J., Noe,M., Ahn,Y.H.,
Mariano,T.M. and Pestka,S.
TITLE Molecular characterization of the murine interferon gamma receptor
cDNA
JOURNAL J. Biol. Chem. 264, 17939-17946 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence for [1] kindly
submitted by S.Pestka, 30-JUN-1989, for release after publication.
FEATURES from to/span description
pept 31 1464 interferon-gamma receptor precursor
sigp 31 111 interferon-gamma receptor signal peptide
matp 112 1461 interferon-gamma receptor
BASE COUNT 534 a 462 c 489 g 518 t
ORIGIN Chromosome 10.
1 ccccctgtca gaggtgtccc tcgcgcagga atgggcccgc aggcggcagc tggcaggatg
61 attctgctgg tggtcctgat gctgtctgcg aaggtcggga gtggagcttt gacgagcacc
121 gaggatcctg agcctccctc ggtgcctgta ccgacgaatg ttctaattaa gtcttataac
181 ttgaaccctg tcgtatgctg ggaataccag aacatgtcac agactcctat ttttactgta
241 caggtaaagg tgtattcggg ttcctggact gattcctgca ccaacatttc tgatcattgt
301 tgtaatatct atggacaaat tatgtatcct gatgtatctg cctgggccag agttaaagct
361 aaggttggac aaaaagaatc tgactatgca cggtcaaaag agttccttat gtgcctaaag
421 ggaaaggtcg ggccccctgg cctggagatc aggaggaaga aggaagaaca gctctccgtc
481 ctcgtatttc accctgaagt cgttgtgaat ggagagagcc agggaaccat gtttggtgac
541 gggagcacct gttacacatt cgactatact gtgtatgtgg agcataaccg gagtggggag
601 atcctacata cgaaacatac ggtcgaaaaa gaagagtgta atgagactct gtgtgagtta
661 aacatctcag tatccacact ggattccaga tattgtattt cagtagacgg aatctcatct
721 ttctggcaag ttagaacaga aaaatcgaaa gacgtctgta tccctccttt ccatgatgac
781 agaaaggatt caatttggat tctggtggtt gctcctctta ccgtctttac agtagttatc
841 ctggtatttg cgtattggta tactaagaag aattcattca agagaaaaag cataatgtta
901 cctaagtcct tgctctctgt ggtaaaaagt gccacgttag agacaaaacc tgaatcgaag
961 tattcacttg tcacaccgca ccagccagct gtcctagaga gtgagacggt gatctgtgaa
1021 gagcccctgt ccacagtgac agctccagac agccccgaag cagcagaaca ggaagaactt
1081 tcaaaagaaa caaaggctct ggaggctgga ggaagcacgt ctgccatgac cccagacagc
1141 cctccaactc cgacacaaag acgcagcttt tccctgttaa gtagtaacca gtcaggccct
1201 tgtagcctca ccgcctatca ctcccgaaac ggctctgaca gtggcctcgt gggatcgggc
1261 agctccatat cggacttgga atctctccca aacaacaact cagaaacaaa gatggcagag
1321 cacgaccctc cacccgtgag aaaggccccc atggcctccg gttatgacaa accgcacatg
1381 ttggtggacg tgcttgtgga tgttgggggg aaggagtctc tcatggggta tagactcaca
1441 ggagaggccc aggagctgtc ctaaggtctc ccgaggcctg ctggtggtaa agaaactgac
1501 cttttaggca gtttttctgc attgatttca tgaaagaagc tatacattag ctaatactaa
1561 ccacatagaa tatcagactt agatacgtga ataaggatcc tgtgggcact gctgggtcca
1621 ctctgcaaat gccaagacta tcaaaggaac gtattgtcgc ttctggctcc ttcccaggtg
1681 ggctagcatc tgtgagtttg cctcggctag ccttgcttcc tacagccgcc actgctcctc
1741 caccctgatc atctcacagg acagggtgga ccgggttttt tttttttttt tcacacacct
1801 ttgtatatgt aagttcatgt atataatatg tttacatgtt tcactttgaa ctgaaagcta
1861 ctcaaagcca gccgtaagtc tatggtagaa tgtgatggaa catgttggtg gaagcttgta
1921 caatagaaca cattggtggg agcttgtaca tactttttta tggagcatta cttacgattt
1981 tttaagtaaa atgttttgaa acc
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS DROELAVK 2505 bp ss-mRNA INV 10-OCT-1988
DEFINITION D.melanogaster elav protein, complete cds.
ACCESSION M21152
KEYWORDS RNA-binding protein; elav protein; neural protein.
SOURCE D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to
mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2505)
AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
TITLE The elav gene of Drosophila, required in neurons, has three RNP
consensus motifs
JOURNAL Science 242, 1570-1572 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence kindly submitted
by K.White, 26-OCT-1988.
FEATURES from to/span description
pept 492 1943 elav protein
BASE COUNT 772 a 669 c 629 g 435 t
ORIGIN Chromosome X; 1B5-9 on the cytogenetic map.
1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata
61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt
121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat
181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc
241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca
301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc
361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag
421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct
481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac
541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg
601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg
661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg
721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg
781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg
841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat
901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc
961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt
1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc
1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg
1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt
1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa
1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat
1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat
1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca
1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca
1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca
1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg
1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg
1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg
1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc
1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg
1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct
1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga
1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat
2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag
2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac
2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc
2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc
2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga
2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc
2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac
2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa
//
LOCUS DROELAVM 9285 bp ds-DNA INV 26-OCT-1988
DEFINITION D.melanogaster elav protein, complete cds.
ACCESSION M21153
KEYWORDS RNA-binding protein; elav protein; neural protein.
SOURCE D.melanogaster (Canton-S) Maniatis library DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 9285)
AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
TITLE The elav gene of Drosophila, required in neurons, has three RNP
consensus motifs
JOURNAL Science 242, 1570-1572 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence kindly submitted
by K.White, 26-OCT-1988.
BASE COUNT 2786 a 2147 c 2058 g 2294 t
ORIGIN Chromosome X; 1B5-9 on the cytogenetic map.
1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct
61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta
121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt
181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact
241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc
301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc
361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg
421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt
481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc
541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc
601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca
661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt
721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt
781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa
841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat
901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc
961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag
1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc
1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga
1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg
1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg
1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt
1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag
1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac
1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa
1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg
1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca
1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc
1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc
1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa
1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta
1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga
1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata
1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt
2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa
2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct
2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt
2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa
2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt
2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa
2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat
2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc
2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt
2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg
2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa
2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta
2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac
2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa
2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc
2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc
3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa
3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca
3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa
3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta
3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct
3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag
3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt
3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact
3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt
3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa
3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg
3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac
3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat
3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt
3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac
3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc
3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg
4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt
4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt
4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat
4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat
4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta
4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt
4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat
4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat
4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact
4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt
4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa
4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga
4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag
4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta
4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg
4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa
4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt
5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta
5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat
5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat
5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta
5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac
5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag
5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc
5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc
5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct
5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc
5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt
5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc
5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc
5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc
5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca
5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag
6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag
6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact
6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa
6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa
6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt
6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg
6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac
6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat
6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa
6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag
6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat
6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata
6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc
6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga
6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt
6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata
6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca
7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc
7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag
7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg
7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc
7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg
7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac
7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc
7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat
7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact
7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga
7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc
7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag
7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg
7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc
7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa
7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta
7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac
8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta
8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg
8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca
8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg
8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg
8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca
8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg
8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg
8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca
8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac
8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca
8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa
8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag
8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa
8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca
8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga
9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac
9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag
9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag
9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa
9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc
//
LOCUS HUMDBTEFH 2335 bp ds-DNA PRI 15-SEP-1989
DEFINITION Human beta-globin intergenic region spanning the deletion which
causes delta-beta-thalassemia.
ACCESSION M22800 X14550
KEYWORDS .
SOURCE Human patient s.h. leukocyte cell DNA, togi clone d.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 643)
AUTHORS Mishima,N. and Gilman,J.G.
JOURNAL Unpublished (1989)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Mishima,N. and Gilman,J.G.
TITLE DNA deletion in indian delta-beta-thalassemia with elevated fetal
hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into
downstream KpnI (L1) repeated sequence
JOURNAL FASEB J. 2, 546-546 (1988)
STANDARD simple staff_review
REFERENCE 3 (bases 1 to 2335)
AUTHORS Mishima,N. and Gilman,J.G.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990.
This sequence is across the DNA deletion in the beta-globin gene
cluster.
BASE COUNT 931 a 462 c 393 g 530 t 19 others
ORIGIN Chromosome 11.
1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct
121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc
181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt
1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc
1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg
1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga
1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca
1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca
1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc
1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact
1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat
2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta
2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa
2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag
2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat
2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa
//GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS DROELAVK 2505 bp ss-mRNA INV 10-OCT-1988
DEFINITION D.melanogaster elav protein, complete cds.
ACCESSION M21152
KEYWORDS RNA binding protein; elav protein; neural protein.
SOURCE D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to
mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2505)
AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
TITLE The elav gene of Drosophila, required in neurons, has three RNP
consensus motifs
JOURNAL Science 242, 1570-1572 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence kindly submitted
by K.White, 26-OCT-1988.
FEATURES from to/span description
pept 492 1943 elav protein
BASE COUNT 772 a 669 c 629 g 435 t
ORIGIN Chromosome X; 1B5-9 on the cytogenetic map.
1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata
61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt
121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat
181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc
241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca
301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc
361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag
421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct
481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac
541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg
601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg
661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg
721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg
781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg
841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat
901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc
961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt
1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc
1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg
1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt
1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa
1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat
1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat
1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca
1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca
1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca
1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg
1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg
1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg
1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc
1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg
1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct
1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga
1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat
2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag
2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac
2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc
2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc
2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga
2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc
2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac
2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa
//
LOCUS DROELAVM 9285 bp ds-DNA INV 26-OCT-1988
DEFINITION D.melanogaster elav protein, complete cds.
ACCESSION M21153
KEYWORDS RNA binding protein; elav protein; neural protein.
SOURCE D.melanogaster (Canton-S) Maniatis library DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 9285)
AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
TITLE The elav gene of Drosophila, required in neurons, has three RNP
consensus motifs
JOURNAL Science 242, 1570-1572 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence kindly submitted
by K.White, 26-OCT-1988.
BASE COUNT 2786 a 2147 c 2058 g 2294 t
ORIGIN Chromosome X; 1B5-9 on the cytogenetic map.
1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct
61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta
121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt
181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact
241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc
301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc
361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg
421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt
481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc
541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc
601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca
661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt
721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt
781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa
841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat
901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc
961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag
1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc
1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga
1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg
1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg
1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt
1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag
1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac
1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa
1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg
1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca
1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc
1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc
1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa
1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta
1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga
1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata
1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt
2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa
2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct
2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt
2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa
2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt
2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa
2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat
2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc
2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt
2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg
2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa
2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta
2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac
2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa
2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc
2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc
3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa
3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca
3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa
3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta
3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct
3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag
3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt
3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact
3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt
3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa
3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg
3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac
3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat
3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt
3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac
3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc
3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg
4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt
4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt
4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat
4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat
4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta
4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt
4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat
4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat
4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact
4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt
4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa
4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga
4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag
4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta
4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg
4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa
4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt
5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta
5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat
5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat
5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta
5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac
5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag
5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc
5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc
5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct
5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc
5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt
5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc
5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc
5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc
5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca
5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag
6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag
6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact
6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa
6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa
6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt
6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg
6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac
6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat
6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa
6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag
6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat
6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata
6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc
6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga
6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt
6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata
6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca
7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc
7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag
7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg
7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc
7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg
7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac
7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc
7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat
7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact
7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga
7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc
7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag
7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg
7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc
7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa
7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta
7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac
8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta
8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg
8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca
8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg
8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg
8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca
8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg
8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg
8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca
8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac
8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca
8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa
8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag
8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa
8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca
8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga
9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac
9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag
9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag
9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa
9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc
//
LOCUS HUMDBTEFH 2335 bp ds-DNA PRI 15-SEP-1989
DEFINITION Human beta-globin intergenic region spanning the deletion which
causes delta-beta-thalassemia.
ACCESSION M22800 X14550
KEYWORDS .
SOURCE Human patient s.h. leukocyte cell DNA, togi clone d.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 643)
AUTHORS Mishima,N. and Gilman,J.G.
JOURNAL Unpublished (1989)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Mishima,N. and Gilman,J.G.
TITLE DNA deletion in indian delta-beta-thalassemia with elevated fetal
hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into
downstream KpnI (L1) repeated sequence
JOURNAL FASEB J. 2, 546-546 (1988)
STANDARD simple staff_review
REFERENCE 3 (bases 1 to 2335)
AUTHORS Mishima,N. and Gilman,J.G.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990.
This sequence is across the DNA deletion in the beta-globin gene
cluster.
BASE COUNT 931 a 462 c 393 g 530 t 19 others
ORIGIN Chromosome 11.
1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct
121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc
181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt
1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc
1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg
1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga
1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca
1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca
1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc
1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact
1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat
2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta
2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa
2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag
2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat
2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CULESTB1 3105 bp ds-DNA INV 27-FEB-1990
DEFINITION C.pipiens esterase B1 gene, complete cds.
ACCESSION M32328
KEYWORDS esterase.
SOURCE Mosquito (C.pipiens quinquefasciatus, strain Tem-R) 4th instar
larva DNA.
ORGANISM Culex pipiens
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae.
REFERENCE 1 (bases 1 to 3105)
AUTHORS Mouches,C., Pauplin,Y., Agarwal,M., Lemieux,L., Herzog,M.,
Abadon,M., Beyssat-Arnaouty,V., Hyrien,O.,
Robert de Saint Vincent,B., Georghiou,G.P. and Pasteur,N.
TITLE Characterization of amplification core and esterase B1 gene
responsible for insecticide resistnace in Culex
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
N.Pasteur, 23-FEB-1990.
FEATURES from to/span description
pept 406 546 esterase B1, exon 1
1322 1459 esterase B1, exon 2
1512 2393 esterase B1, exon 3
2455 2916 esterase B1, exon 4
pre-msg 268 3016 esterase B1 mRNA and introns
IVS 547 1321 esterase B1 intron A
IVS 1460 1511 esterase B1 intron B
IVS 2394 2454 esterase B1 intron c
rpt 155 171 direct repeat copy 1
rpt 176 192 direct repeat copy 2
signal 246 250 pot. TATA box
signal 2998 3003 poly-A signal
BASE COUNT 904 a 671 c 700 g 830 t
ORIGIN Chromosome 2.
1 tacataccgc aataatcaaa atataacaaa aataatgttt tttttcctct caaatatgtt
61 gaacaatccc attttacccc gcatctgcat tcattttaag agagaattgt ctgtgtatag
121 gtagagtagt ctcgcaggta gggtacaaac acgagcgatg aacacatttg catttgcgat
181 gaacacattt gcttgatgcg tgttttaaaa cttagcagtg caaagcagaa taccaccaac
241 atcgcaattt tcagctccac aaatcatcag tacagagtgg gcagccgcac cgagctgttg
301 gtgcaagtca attcagctga gcaaaccgaa aaaaaaaaac ttcgaagagt cacacccagc
361 tgatagcgaa aatttaagca acaaaaaaac tccaatctac gtaggatgag tttggaaagc
421 ttaaccgttc agaccaaata cggcccggtc cggggcaaac ggaacgtatc gttgctggga
481 caggagtacg tcagctttca gggaattccg tacgcccggg caccggaagg ggagctgcgg
541 tttaaggtga gagtggtaaa ttgtttcaag tgctgttcaa attttatgga tgtgcaagtg
601 catttttgtt caaataaaga gcaacgagtg ctgctgatta gcgctgtatc taagagtgtg
661 acctccgcgg cttgatactt caccgcaatg acatgacata tttgttgata aaaataaaat
721 gtaataaata ttttgtactt ggacacgtac aaaccagtca tggtctaggc tggaaataaa
781 ttgaagagtg acaaagtcaa acataaattt ggcagtgact tccattttga taaaagataa
841 gaccataaac ttaatccaaa attgtttgtt ggacaattga taatgaagtc tgtgcaagta
901 gaagttattg aaacattaga ctgtttaaaa tttaatttaa tctatgattt aatctttatt
961 ttgttcaata atctaaattt ttgtaaaaca aaaaaatctt cagaaaacct atcaactgca
1021 tgctataaca ttttcaaatg acttcgaaat attaccgaaa taacattaaa tttgtcaatc
1081 aaatttagta gttttgacat tttgaaaatg ctaaaaaaat gatcaaactt gcaaaaccat
1141 tttttttttt cataaatcct cagcaatttt gcgaacaaat tattaataaa aagaaacgtt
1201 caaatatcac acctcactgt gtaaaatatt ttaaccagca ctgtatttcc cggtgacttg
1261 aacgcaaaac aagcaacgaa caaaactgga aatttaaaaa cataacctta aaacatttca
1321 ggcaccagtt ccaccgcaaa agtggaccga aacgttggac tgcacgcagc aatgcgagcc
1381 ctgctatcac ttcgaccggc gcctccagaa gatcgtcggc tgcgaggaca gtctgaagat
1441 caacgtgttt gcgaaggagg tgagttggtt tcaaagaatc aatttcaact ctgaattcac
1501 gatttctcca gatcaaccct tcaacccctc ttccggtgat gctgtacatc tacggcgggg
1561 gcttcacgga aggaaccagc ggaaccgaac tgtacgggcc ggatttcctg gttcagaagg
1621 atatcgtgtt ggtgtcgttc aattaccgta ttggggcgtt aggttttctg tgttgtcaat
1681 cggagcagga tggcgtaccc ggtaatgccg gactcaaaga tcagaacttg gccattcgtt
1741 gggttctgga gaacattgcc gcctttggag gagacccgaa gcgcgtgacc ctggccggcc
1801 atagcgcagg tgccgcttcg gttcagtatc atctgatttc ggatgcgtcc aaggacttgt
1861 ttcagcggcg tatcgtaatg tctgggagta cgtattccag ttggtctttg accaggcaac
1921 gcaactgggt tgagaagttg gcgaaggcca tcggttggga tggacagggt ggtgagtccg
1981 gagcgttgag attcttgaga cgtgccaaac cggaggacat tgttgctcac caggagaagc
2041 ttctgacgga ccaggacatg caggatgata tctttactcc gtttggacct accgttgaac
2101 cgtacctgac ggaacagtgc ataataccga aggcaccgtt cgagatggct cgaacagctt
2161 ggggtgacaa gattgatatc atgatcggtg gtacttctga agaaggactg ctactgctgc
2221 aaaagatcaa gttgcatccg gaactactgt cccatcctca tctattcctg ggaaatgttc
2281 ctccaaattt gaagatcagc atggaaaaac gaatcgagtt tgctgccaag ctgaaacaac
2341 gttactaccc cgacagcatt ccttcaatgg agaacaacct gggatacgtt catgtaagtc
2401 caaacctaac ctcaatcacc acatcaactt atcaaccctc cacaaaaatt ccagatgatg
2461 tccgaccggg tcttctggca cggcctgcac cgcaccatcc ttgcccgcgc cgctcgatcg
2521 cgcgcccgca ccttcgtgta ccggatctgt ctggattcgg agttttacaa ccactaccgc
2581 atcatgatga tcgacccgaa gctgcgcggc acggcccatg ccgacgagct gtcctatctg
2641 ttttccaact ttacccagca ggtccccggc aaggaaacgt tcgagtaccg cggtctgcaa
2701 acgctggtcg atgtgttcag cgcgttcgtc atcaacgggg atccaaactg tggcatgacg
2761 gcgaagggtg gtgtggtctt tgagccgaac gcgcagacga agcccacgtt caagtgtctg
2821 aacattgcca acgacggggt ggcgttcgtt gactatccgg atgcggaccg gttggacatg
2881 tgggacgcaa tgtacgtgaa tgatgagctg ttttgaggag aaagttttta aaatatccta
2941 ttgagatttt gcaatgcctt atttaatctg ttgttttatt ttaattatgt attgttgaat
3001 aaatttatac tatataacaa aagttttatt cttggagtca ttccgctaac ttttttgttt
3061 gtgttcaaca tattgaagat ctgacaaccc tatcaaaggt tatag
//
LOCUS DROFASI 15288 bp ds-DNA INV 26-FEB-1990
DEFINITION D.melanogaster fasciclin I (FasI) gene, complete cds.
ACCESSION M32311
SOURCE D.melanogaster DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 15288)
AUTHORS McAllistar,L., Zinn,K., Rehm,J. and Goodman,?.
TITLE no title given
JOURNAL Unpublished (1990)Howard Hughes Med. Inst. Berkeley, CA 94720
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
L.B. McAllister, 22-FEB-1990
FEATURES from to/span description
pept 1593 1697 fasciclin I, exon 2 (first expressed exon)
7090 7222 fasciclin I, exon 3
7304 7470 fasciclin I, exon 4
7545 7775 fasciclin I, exon 4
7843 8011 fasciclin I, exon 5
8089 8249 fasciclin I, exon 6
9814 9822 fasciclin I, exon 7
10989 11143 fasciclin I, exon 8
11210 11300 fasciclin I, exon 9
11364 11584 fasciclin I, exon 10
11644 11806 fasciclin I, exon 11
11871 12018 fasciclin I, exon 12
12428 12633 fasciclin I, exon 13
IVS 824 1449 FasI intron A
IVS 1698 7089 FasI intron B
IVS 7223 7303 FasI intron C
IVS 7471 7544 FasI intron D
IVS 7776 7842 FasI intron E
IVS 8012 8088 FasI intron F
IVS 8250 9813 FasI intron G
IVS 9823 10988 FasI intron H
IVS 11144 11209 FasI intron I
IVS 11301 11363 FasI intron J
IVS 11585 11643 FasI intron K
IVS 11807 11870 FasI intron L
IVS 12019 12427 FasI intron M
ORF 1 164 ORF (AA at 3)
BASE COUNT 4691 a 3381 c 2990 g 4225 t 1 others
ORIGIN Chromosome 3, map position 89e-f.
1 ctcgaggaga ccccgaaggt ccagctggag accccgatgc tggcgttgca gcctctgcag
61 ctcggcagtt gccaactgat ccagctcgct ttggtcagtc atcgttattg gctttctgaa
121 ttcggtcttt gcaatggtat tgtcaaaact gataatgata ttaatttgat ttcaatagca
181 aacaagcgtt gctggtggca agttgcttag taacggcgtg gtttatcgaa gacttttacc
241 atttacttaa agtaagaaga accaaatagg tacactgctt gttaaaattc aatgttatct
301 ccttagaaaa gtaaaaaata agtgcttgaa aattataacc ggttcaattt ggaataaaat
361 ctttcaaaca gtagagactc tgcttagcgg agttatccac cgcattctgt cgttgccaca
421 ttttgcgaaa cggttaaaat gattatttat gcttttaaac gaaaaatgag aaaatttaaa
481 gcagtcttct tgtaataata ataagattgg ataaacagac attattataa atctgacaca
541 gaagagcaac gcagcgctcg atttgtacta atattttacc tctaacaaga ttcggaaaag
601 cataaaaagc acataaataa gagtcaccgc caaagacatg cacaaataaa tataaatgtt
661 aagcgattta ttaacagtgc acagtgttgc gatttgaaac atcgacagct atcgataaca
721 ctcgttatcg tcaccacacg ctcggcttaa acctgttctc cgcttttcgt tttaattcga
781 aaaaaacgtt cgagcgtggc gcacgttgcg attcctccga ctggtacttg ctccccttat
841 attgaaactc tgtggaaaat ctatactgtg catagggcat aactctaaaa tagctgcacg
901 gcccatttcg ctgcccttgc tattgattta ccaccgcgct gcgttcgttt gtgtgtgcta
961 cttaaagaat ttgtgtgttt tccaagtttg gtccgtggac attgcataat ttctgcggcg
1021 gccgttaatg ttaattcctg cagcccgagt ttccgagaat tacgcagaat aaagaccaga
1081 gagaaaacta taaaatcgaa aacagaaaaa agtgccgcag cagcgaaatg caaaggcgca
1141 taataattaa acacacagcg acggaatgaa gaaaaaataa tacacaataa gcgcagcttt
1201 gtttctagtt aaattgcgtt tgtgttgtgc ttgccgattc ctcggtggtc cgtttttcgc
1261 ttgttgttct atgtgacata acggaactct gggcaaaagc gaacaggaag cagcgataac
1321 cttgcaaaaa caaagaaaat accaaggagg acaaaaagca tgccaagcat atatctgtga
1381 aataatattt tcttttccga ggaaatgctg tttgtcgtcg gctaactgtt gtttgccttt
1441 gaattgcaga tcttaatcgt agagcagcac tcacaccagc acacgccccc cgcaaaacag
1501 cacacacagc cacactcaca aacgacgtac actgccagcg cataaacaag tgctgttgtg
1561 gttgttgttg ctgctgctgc tcgtccgcta aaatgctgaa cgctgcagcg ctgcttttgg
1621 cgctgctctg cgccgcgaac gcagccgccg ccgccgattt ggcggacaaa ttgcgcgatg
1681 attcggaact ctctcaggtg agtcagaaga agaagaagaa gcgaataacg agaacgtgga
1741 ggatgaggag gtgaaaggag gtggtagagt aagagcacga agaaaagcag tagcagcaga
1801 aacagttggt gcaaataacc gaaacacttg tccgtagtca attaagcccc cagcaccacc
1861 caccaaacgt cgccccccct ccccccgctc acccttgggg tacagtaggg attgcctgcc
1921 atgaacatta gtaatccttt tctatctttg caaccgatta aattaatgcg aagagttaca
1981 cttcgaatat ttgtcaaaag tatgtttgtt taggtatccg tacgcgcaac atgtaaaatt
2041 tgaacacatt caggggcttt tattagtcta atatatcatt tcgttggaaa agtatttctt
2101 aactgcctac attttataca gtaatcagtc gcattttttc tgtagccagg gaaactgcca
2161 aaattaaaat aacttttgtg tgagcaaaaa acaaactatt tttagcagac atttatgcgt
2221 ttaaataatt ttttcagcct tttgaatatg tttcatatat ttctgtgatg ccttaatttc
2281 ttctcatggt cggatacctt attgcaatga taaaatcaat tttttattat gcctgacttg
2341 gctttggcga agtcaataaa tttaccttgc cacctaatag tggcaactta tcaaagttcg
2401 gctgtaattg tgccaccacc cacaagacgc cgccccaatt cgttggcctt gaaaggaaaa
2461 gccttggtgg agtgtgcgag gaggcggggc agatcgggcg ggaaaaagga gcctgcgcaa
2521 gagggacgca cccgcatttg ggcacaggta gcgtagcctt cgttccggca gcatcgtcct
2581 tgacctttag ccatctcact tgttgttgga ctgcggaaaa tcgatgatgc tgcctaaaat
2641 ccctggcggc agttcaggtc cagccgaccg ctaagctcca cttggccaca tttcgtatgc
2701 atggcggagt tcgttatgta aaaatgaaaa gccacataaa ttaagcccag gtctgctgaa
2761 ttggctctcc ccagcaaaag gcaaatggta gaaaaaacgc cggccgttga cgtccactgc
2821 gcataacagt tttatattta gttttgcctt tttacacctt cacaggccaa ctcacataca
2881 aagtacatag aaaacaaaaa cacattaatg tgtaaaaaac tcagaagcca acaaaggtgc
2941 gcggctgaaa taacaagaac aacatgtgta atgaattcaa aggcgggggt tcagtcacca
3001 gaaggggtat tagcattaca ccaggagaaa tatatatata tttttttaat atgttaaggt
3061 ttatggtagc ccattaagcc acataactta tactcaaaaa atagaattga tttgagtgga
3121 aaatagtaac atagttcgtc tattgtaatt gcttttatct tatatagtag gtgaagtagt
3181 agttgaatac gcgtttttaa gtaactgact tgagtccctg cttaattttc tagtctgcct
3241 taacactgca tttaacaccc gaaatacacg ctgcttcaaa agtaaattag acttccaagt
3301 aaaaagaaag cgtgtgacag actgtaattg ttgcagcaag tgtctcaggg aagtagttgc
3361 atccacacag aaaaggcaac cttttcgagt tacaataggc gaaacagagc gtacaactaa
3421 aaaaaattaa aaacaagaag ggggactcgg cttgctcact tgcacccttg ctttagggca
3481 ttgtaacttt gcccagaagt ttgccacttt cggagcaaag catttccggc ctcatatctg
3541 gagtactgct atacatgagg gtgggaccat ttggtcgcca taaaaaagta agggcggaaa
3601 gtaatcttct ggcatttatg caaattaagt gccgtaaaac taatggatca aaaagtcagg
3661 cttgggaaat cccttttaaa gttacttctt aagactatga aataatcatt ttattaataa
3721 tatgtcgtat tgcaggtagc tgctttcaat ccaatcccct ttttaaattt tgcgatttat
3781 atattttctc agttgtattt aaaatgtaga caagactggt tcaaaagggt atctccatct
3841 tctaaagtcg aagtcgggct tctttactct ggtgcctgca cttctcatta catgctgatg
3901 gcttagataa atgtaggttg cctgttttgg tttgcggctt acggcttacg gcttgtggtt
3961 ttgcatggca aaccaccgag tttaggagag agtttcgaaa ttagcagata attgagccac
4021 gccgatctga gttttatatt caacttcttg acagccgaaa tgaaagtaca ctccgaggaa
4081 cgagctgacc tcctgcgttt gacccccact tcctgtcgtg cgtcagccaa cgtcttgcgc
4141 gttttattaa ttttaatttt taacctttga ctgatgcaat tgcagttggc cgagtggggg
4201 tgtggacagc ttgggttgac aactgcgggg ctggctgaac tctgtctagg aacactcact
4261 cgtatggctc acggcacaga acagtccaca ctcgcagcat catcgaattc aattaaattt
4321 aatttaagat ttcaatgcga tgccaaggcg cgttatttga tatgcgacga gctccattac
4381 aggcgaggcg atgaatgagc cacgactgac cttaatggat ccagatgctc atagacggat
4441 ctctgtccga cggagcactg aaggtgcaca gttttacgat cctaggcgtg ggggcggaat
4501 ggcctcggag tgatgccagt gtagaacccc ccatttcgat ccctttccac tgctgaaccg
4561 ccaataaaag gagaatgctg ttgccccaat ctctcgattc cgaaccggaa tccgaagtcg
4621 ggggttgaga tcgctttcgc ttttgcttct gttttctgtt tctgctgccg ctgcaattat
4681 cgaaaagaaa atatactgaa caaccgacat atgcatacat aatttgtacc aattaaattt
4741 atttatacaa catgcactcg cccagaagga aacaacctaa ccctcctgct ccgatttcag
4801 tccctagtcc aaaatacgaa gcttggctcg ctctcttggc taattagttt taatggctcc
4861 attagatgat ggaaattgaa tttcctccaa acaggcttac cctctcatga ttgttgttgt
4921 ttttggtata ggaattagta caatttttgg ctacaaatcg aaagaaaaca gaacaaattt
4981 gaactgccag cggttctctc tctttctctt tctttctttc tctctctcta tctctttcag
5041 agcactgcca aatccttgag gcgccttatg ccagttaggg taagcagcaa ttccaaaaaa
5101 ccaaacagaa acacacgaac acaattagct attgtaaatt tggaacaaac tcaaaagaca
5161 ttgtaaatgt acaactcaca tgtttttggc ttaatttagt taaccaaccc gtactcatcg
5221 ggccattggc aaattattgt ttgctccgcc gtctgactct cacccctgat gttcgttgtt
5281 cgtctgtggc cccagggtca ttggcttggc actcgggaaa atcaaaatgc gcttttgaat
5341 aaacccaccc tcttgggagc acttttggcg gttgggcggc gagtggtgtt tgcatagtaa
5401 atgagacgcc ccagaccgaa agggaaactg tttatcgatg acagcataat actctaacaa
5461 ttgtatcaaa ttttcgggct tccaaaaaag gactccccgg ggaagagctg gccttatcgc
5521 gatgatgtgc caaaagagcc ggaaactcat taagatctgt tcccttccgg cccggccttt
5581 catcaaatgc aaaacattcg ataacggaaa tttatgaaag taattgcatc taattgtcaa
5641 aggcttccag ccagaaccca agggggttga atttagtaag gatgctgaca gatggccatt
5701 ggaaaatggc ggccaagaga aagtgtcgca aacaatatca ttgcctattc aataaaactt
5761 ctctgtactc tattggccat tatacctgaa cgtatcccat atcaacctgt gacactgtga
5821 catatgccac agccattccc caggacccca cttaacagca gggcactctc ttggccagat
5881 aagcgaaaac atttcgagag cgatatagac accagcaagc agttctgacc aagctggcca
5941 agaagacggt gtaagaacaa agcgcgctgg cgactcgaat gtccacgacc aggacaacaa
6001 caacgctaac ggccccgaca actgacggtc tcaaattacc aaaaagaaaa tgaatgcaga
6061 gcgaggataa atgtgtattt acactcagga aaaggacgaa aattcctcaa aaaaagagta
6121 tttcaatttt aaggtgttaa gttcgttcgt tgatgtttga aaggaattca aaggcatatc
6181 aatatcagta aaagaaatta caagtaagga tcccactggc aatttaaatt taagcatata
6241 tttccaatta tgaattgaat ggattgaatg gacgtgaagc acctatttct aggcctagcc
6301 actttttttt cggtgtacat gcgtgtgtat tggcgatgcc agagcataga tatgtacagc
6361 aggcagggga aatgggaatg gaaaagggaa aggacggact tttaaaccca aattgccatg
6421 actcatgccg ggcagcacac gaaaacacga gagtcacacg accgttttaa aggagggcac
6481 ggaaaaaggt gttgtcctcg gcaggacacg cacccacaca ggcacaccaa gacacagata
6541 tgtccgcagg acctcataaa gccgactggg aatgtgcagc ggggcgtggc ggcagtggtt
6601 ctccacgccg ggaagccgca gagtgtgcgg ttgcatgtgg cctcatatgg caggggcagg
6661 tggctggccc gcccgagcaa atgactctgg gcccggttta gggtcagtgg ctggcgtttt
6721 ggggtggctg gaaaatgcat tttcacatcc cgctctcaat tgaatgacac acaatccact
6781 tgggcggcgt aacccaccca gaggcttacg taattggttc aataaacctt ttttttctag
6841 tacgccgccc tggcatcacc accaggacat gcataaagtt gttcctcatt ttattagttt
6901 gttagacgtt gccatggtgc tggccggttg ccgttttgcc aaaaccgccc agcaaccgtc
6961 tgcgaaccac cccttaaggc acccactcca tcctagatcc accacccact actccatccc
7021 actcggttcg ataagatatt ggcaatagag cgacgcgaat gagtgtaacg aacccgttta
7081 tctttgcagt tctacagcct gctggagagc aatcaaattg ccaactcaac gctttcgctg
7141 cgcagctgca cgatctttgt gcccaccaat gaagccttcc agcgctacaa gagcaaaacc
7201 gcccatgtgc tctatcacat tagtgagttc agtttcaatc aggctgatta aaaataataa
7261 aaatcaaaat aacatgactc taccctctga ttactctttg cagccactga ggcgtacacc
7321 cagaaacgac tgccgaatac cgtgtcatcg gacatggccg gcaatccacc gctgtacatc
7381 acaaagaact cgaatggcga catctttgtg aacaatgccc ggatcatacc ctcgctcagt
7441 gtggagacaa acagcgatgg caagcggcag gtcagtggcc ccacatatgc atatgttgca
7501 tttcatagag tgattgaaca actctcttca ttacttaccg acagatcatg cacatcatcg
7561 acgaggtact ggagccgctc accgtcaagg ctggccattc ggataccccc aacaatccga
7621 atgctctcaa gttcctgaag aacgccgagg agttcaacgt ggacaacatc ggtgtgcgca
7681 cgtaccgcag ccaggtgacg atggccaaga aagagtcggt ctatgatgcc gccggacagc
7741 acacgttcct ggttcccgtc gatgaaggct tcaaggtgct tttctagttt aattgcatta
7801 tttgttacgc attgtaatct cactgttgct ttatgactcc agctctcggc tcgcagcagc
7861 ctcgtggacg gcaaggtcat cgatggccat gtgataccaa acactgtcat cttcactgcc
7921 gctgcccagc atgacgatcc caaggcttcc gccgcttttg aggacttact caaggtcacc
7981 gtcagtttct tcaagcagaa gaacggcaaa agtaagattg cactaaagac ataccatttt
8041 actaaaggtt tacgaatgat tgtatggttg atataatttt acttgcagtg tacgtcaagt
8101 caaacaccat tgtgggtgat gccaaacacc gcgtgggcgt ggttctggcc gagatcgtga
8161 aggcgaacat cccagtgagc aacggagtag tccatctgat ccaccgcccg ctgatgatca
8221 tcgatacgac ggtcacccaa ttcctgcagg tatgtagggt gttgaagact agcatccagt
8281 gcaattgtag acacacagac acgacacaga cagtccaaga aacaaacaca tagaaacaat
8341 gatagtgcct gtggcttgtg ataaggcttg gcattttgat tggctcggca cgcactgaaa
8401 gatagcgctg gcggcagacc gatggactgg gaacataaat cacttgcctg ctgccccagc
8461 gaatgctaca aacacatgag ctaaacacat tgaaacatac acactgataa gcggcttttc
8521 tttgccctaa gtgagcagct cctccaagtg gcgccaaagg gataatattt ccaaggttga
8581 agtttcgtcg gggctcaact gagggcaaag gccaaaacca aataatcttg cacccagaaa
8641 ccactgccta aacacataca gtataaataa ataagatcaa tgcagacact agttttcagt
8701 atcccctaga tatatctaac taatacaaat gaaaacccaa aagactgatt ttatttggaa
8761 ctgaattcgc tcaaaagtct gtattccgat tcggttctat gtaaaatcaa aagcaaaatt
8821 ttattttaaa ataattttta aaatataaat tttattttaa aaaaattttg ggtgtaatgt
8881 aagtattgcc aaaaagctga aaaacaaaaa aaaaaaaaac aaatcttaaa gcaatgtatt
8941 tcatgtaaat ttttctatga atcagatttt cgtatcaata tgcatttcac aagcagttgt
9001 ttgcattgca attgcaaaaa actgactcag tagggcaacg cgctcagaat tggccctcga
9061 aattcaatac gtgatttagt caccaattgg caaacactaa aaaaanttgc ttagcggctg
9121 cgctttgctt gatctgaccg aaatcgacca aaacttcatt cacaaaaact tccattgttc
9181 tatattgaaa tcgaccaaaa cttcattcac aaaaacttcc atttttctat attgaatgta
9241 ttggtacttt aatggcgtca atgaactaag caagtgtata aattattccc atcattgatt
9301 ggctaaagtt cttcgatccc tcgaccgccg tgcatatcca aagatcagtc atctgtacgc
9361 tcgcctacag ccggctagct ttgatcatca ccgtaaccgt agatgtatat gtatacacca
9421 agcataaact gataacaaac cacccacccg ctcgataaat agttaccctc ctcataggca
9481 tatacattag aaacagtgcg tttcgaaact gtgaggcaaa gtgtcgccga acagttttga
9541 gtccgcactg tagaacggct atatcattcg accgctcgaa tcgtaagcgc cttcatatcg
9601 tgctatcgtt ttcgaaattt tcaaactagt tttgttgtaa ttacgtgcga tactctcctt
9661 ttgctaattt actgtatcca tacctctatc ctttgtttcc gtcacatgcc gaataattgt
9721 acttaaaact acaaaacaca atccaaccca accacatatc gctcgaaatt gaatcggtgc
9781 acccaaatat tgaaacaaaa ccaaaacgtt cagtcgttca aggtaaacaa cagactcaaa
9841 atccgatttt aattttaagt tgttcggttt cttttgactt gcattttgtt ttgcatcccc
9901 tgctatttta gctatgcatg agtttattac aaaaaatgtg tagacctttt ttgtaaaatt
9961 agtccattgg ttgttaaagt cgcgatgcct tagggtctaa gaatatttct tctgattttt
10021 attcacaaat agtctttata ggatgtacag cactagttta aatttgcaaa tgaagcaagt
10081 gcacatgata ctttaaagtt aaagagtatt tatcgtatga tgagtttaca gaaattttac
10141 ctaattactt aagataaact tctatttttt gtcggagctt ttatatcctt gttttatttt
10201 tttaaatcat agtaatggaa attattatga ttcgctgtaa attgttgaaa aatgcacgta
10261 actcagtaac attgtacata atggagccaa accgtttgcc tgttctttga aaaatgtcta
10321 gaacaatgaa agattaaata taattccctt gtgcttcatt tcaaacgttc gtttgtggcg
10381 gaacaactcg ggcaatctgc gtggaaccca aatcccatat taaacaatat ttaaattcaa
10441 ttttattcga aacttgtctt caccaaatca cttcgaatga atattgaact atttctgcta
10501 acactccgtt ttaaaactga tttagtttat gaacgtaagt aaacatcaca ccagcagaca
10561 aagaacaaaa accacaacag tcatcagtgc aacgctttgc tatcaaattg ccatccaacc
10621 caaccaccgt gcaaacaatt caatgattcg gtagattttc cgtcattaaa tcgcataaat
10681 tcagctgtgt taagtgagtc attaagcgag cattcccatt gattcacaac ttaattggcc
10741 agccatccat tcccccaccc atatgtgtat aactggctta cattataatc ggatatattt
10801 ccaattgttg tatttttttc ctaattcttt gtactactcc caaaaaaaaa aaaacaattt
10861 ataagcttat gcatacccct cttccattca ccgtttattt atttatatgc attttgtttt
10921 aaggtacttc aaatagcatt atgatatgtt cattggtatt aattttgcaa tctatcttga
10981 aggagaagga gaatgctgag aacggagctc tgcgcaagtt ctacgaagtt ataatggaca
11041 atggtggagc agttctggac gacatcaata gcctgacaga agtgaccatt ttggctccca
11101 gcaatgaggc ttggaactcc tcgaacatca acaatgtttt gcggtaatgt tgctcaaatc
11161 gggaagataa tatttaatta tcataataat aaataaatac atcttacaga gatcggaata
11221 agatgaggca gatcctgaac atgcatatca tcaaggaccg cttaaatgtg gacaagatca
11281 ggcagaaaaa tgcaaatttg gtgagcttta catgagctta taatcaaatg gaaatttttt
11341 acaaacacca ttcttttcaa aagattgccc aggtgcccac tgtcaacaac aacactttcc
11401 tgtacttcaa cgttcgcggt gagggatcgg ataccgtgat aacagttgag ggaggcggcg
11461 tgaatgccac cgttatccag gctgatgtgg cccagactaa tggttatgtt cacatcatcg
11521 accatgtgct gggcgtgcct tacactacag ttcttggcaa acttgaatcc gatcccatga
11581 tgaggtatgt tcaattttaa agctttatta gtacgctgtt taaattttat cgtattcctt
11641 aagtgacacc tataagatgg gaaaattctc gcactttaat gaccagctga acaacacaca
11701 acgccgcttc acctactttg tgcccaggga caagggctgg cagaagaccg agctggatta
11761 cccatcggct cacaagaagc tttttatggc cgacttttcc tatcatgtag gtttttcacc
11821 ctaagcctat taggcttttc tattatactc aaaatgtttt gtaaccgcag tccaagtcca
11881 ttctggagcg tcatttggct atttcggata aggagtacac catgaaggat ctggttaagt
11941 tttcgcaaga atcgggcagc gtaatcctac ccacgttccg cgactctttg agtatccgcg
12001 tggaggagga agctggacgt aagtataaca cattggatac caagggctca gttgcttgtc
12061 ctttcaattc aagtcgattc gtttcaccta atttgattga ttctattggt cttcccatac
12121 cgtgctgcat tactaaccga acctgttttt ttacaactct taaactaaat ttttcgcttt
12181 ctgcttaact aaaactaatc gttgtccaat caaaagatct ccatgatgag tatgctagtc
12241 acgaatggac tggtgagtgc tgccgttttt caaacccaca tcgcaccgcc cacaacgaaa
12301 tccttgacct ggccatgtcc acccatagct cgctcattgg gcaatccact agcattaacc
12361 acccatcgat tgcataccaa ttttctattg tctagtggct attaacaacg attacttcca
12421 tttgcaggct atgtgatcat ttggaactac aagaagatca acgtataccg gcccgatgtt
12481 gagtgcacca acggaattat ccacgtcatc gactacccac tcctggagga aaaggatgtg
12541 gtcgtggccg gaggtagcta tttgccagaa tcaagcattt gcatcatctt ggccaacctc
12601 ataatgataa cagtagcaaa gttcttgaac taaatgcatc cgatatgtaa aaacaaatcc
12661 aatccaaagc aaatgcaaat caaacacaac aacaacagtc gtctacagaa caagaaccaa
12721 caacactcag tatcagacta acttaacatc cacatggatc taaataatca gcaccagttt
12781 gttgataccg atcgaaaacc acaagcaacc caaacagtat ctgtaatata tacgtcacaa
12841 ggaacgatca tcattccaac ccaacatgcc cacgtccacg ccatctacat acccacatac
12901 acacactcag aaaaaaacga tgcagaaaac tgcctgctaa atttgctttt gcatcgacag
12961 cttctcccaa aatagctggc aagagagact aagataaagt cgaagaaacc cctatgttta
13021 agtattaaac tcgattttcc ctcaaacggc cttatgtata atattgaata tgaaactttc
13081 tgtaaaacat tttaatttta tgttcgccct atactggtag cttacgttca tcagctcgct
13141 atgttttaag tttagaccca cactgacatt ggtatcgttc catgtaacaa ctgattgtgt
13201 ttatgatttt aagtgtacat tttttgtgaa ttgcttgttt tagttaatgg tctaatttat
13261 gatttatgaa atgagttatc ttaagtagaa tgcgaatctc attgtacatc tcgaaaacga
13321 agaagctaag cataagcaat aagcgcatca cacactctca tactcgagcg cccacacatt
13381 ccccctattc ctatccataa atacatacaa catgtaacga aacaaagttc attgtaagat
13441 ttgaaagtgt gaaatttgat aataaactga aagtttttca tttgaacctt actattaaga
13501 gcggcagcaa tcttgtctcg aaccaatcgc gtgacccctg taggcctaga actatatcag
13561 tgtcggacaa gaaatgtgca accaacaaac ccaattatat gatatgccat cctgcactta
13621 gttactagcc ccgctatgca tgtgaacccc cagaaaaacc cattcgccca cacacaccag
13681 ccacttggag cgagagtagc cactgcaact aggttagtcc aatgtataag ttccgaaact
13741 gccgacaact acgaaaacta tacagcatac ataaactccc tgtaatctaa atcactcacg
13801 gcacacaact acaactacaa ctacatgtca tcattgaatg gattttgata ccgattttaa
13861 cttgcatata aacaaaaaca agaactagac aacgtgaagg agattttaaa caaaattctc
13921 cctcggtcga gcagttgcat ttcaaacttt gtacgtagtt taaaactagt tttttagtcc
13981 gacgtagaac aacccaattg ctaactatat accaactttc tttctatttc tctctgtctc
14041 tccccctaat gctatgtact tataggttag acattgtaac tattgtaatc aacccagtgc
14101 gttaaacccg agtgttaagt cgaacagtaa cacagaattg tactatcccc aaatgaataa
14161 ctatcagcct accagtacac tgtctcaact ctcaccacca ccaccaccca cttaggaact
14221 cagtcgaact tgaactcgaa atcaaagatc cagttgtggc agtcgcttca cgtagttgct
14281 aattcccaat tcgaaccgat cctttccgaa agtcttatct ttagtatagg tggtttagtt
14341 tcatttggag ccgtgcagtg ccgtagcagc taagtaaaaa tgtatgaaat gaagatgaac
14401 acgagatcga aatcgtacgg aatgatcaga aatcagaaat aaataatgaa tacgctaatg
14461 aattgtacaa gtaagcttta aagaattgct ggaggagcgc ggatcggaga acttagagga
14521 gggagaaccg cattgcaatc gcattgcaat ttgtgtcgta gtcagtagtt acacgttaag
14581 cggcgtctta acgtgtaact agtgccttac taaagataaa cgcattacct taacctttat
14641 acaaatttac tcaaaacata cttgtacccc aagcatacgt tccgcttcga atgataccca
14701 gatatatata cggagttaca ccccaagaat acaagtataa ctacaaatga tattgcgcca
14761 cacgctattt acaccaaata caccaaacaa atcgagaaat gcatattttt catatattta
14821 attgtcagaa taatataacg tatatgtaat atgtagttta tttactgtaa aacgcaagaa
14881 cctaacaagt ggaatttgaa tcacatacaa ttgatgtata ttagctattg agtttctaag
14941 caagcgttag acactgaaat atatgttcaa atatatgatt caaatatata caatatgcga
15001 aaccaagcaa actatggaaa actggagtgc caaaagaata ttattccatt ttattttacg
15061 acaagcgctt tttacaaata aaccgaatcc atttaaatta ctcgtaaaga acagacagat
15121 tatatttagc attagttaaa ctaattatta catgtactag aaaaccgaat gtcaaccgag
15181 aatcttcagc aagcttgagc gaataataaa actttaaaac taactataaa taaatcgacc
15241 gtcctttttt gcacttaatc atgggttatg gttgtggaac ctgaattc
//
LOCUS DROIAB 2131 bp ds-DNA INV 14-FEB-1990
DEFINITION D.melanogaster abdominal-A (iab) 1.7 and 2.0 kb RNAs, complete
cds.
ACCESSION M32109
KEYWORDS .
SOURCE D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2131)
AUTHORS Cumberledge,S., Zaratzian,A. and Sakonju,S.
TITLE Characterization of two RNAs transcribed from the cis-regulatory
region of the abd-A domain within the Drosophila bithorax complex
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.Sakonju, 14-FEB-1990.
FEATURES from to/span description
RNA 141 1802 1.7 kb RNA
RNA 141 2101 2.0 kb RNA
BASE COUNT 592 a 468 c 478 g 593 t
ORIGIN Chromosome III, bithorax complex.
1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag
61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg
121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga
181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg
241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt
301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta
361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg
421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt
481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg
541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt
601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc
661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca
721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc
781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt
841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca
901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt
961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa
1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg
1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag
1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc
1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg
1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat
1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg
1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat
1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac
1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac
1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac
1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg
1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg
1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg
1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt
1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc
1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa
1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa
2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt
2101 aaatacttcg atttactggc tggaagttca t
//
LOCUS FVBOPD 1693 bp ds-DNA BCT 05-MAR-1990
DEFINITION Flavobacterium sp. parathion hydrolase gene, complete cds.
ACCESSION M29593
KEYWORDS parathion hydrolase.
SOURCE Flavobacterium sp. (strain ATCC 27551) DNA, clone pPDL2.
ORGANISM Flavobacterium sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Acidiphilium cryptum; Neisseriaceae.
REFERENCE 1 (bases 1 to 1693)
AUTHORS Mulbry,W.W. and Karns,J.S.
TITLE Parathion hydrolase specified by the Flavobacterium opd gene:
Relationship between the gene and protein
JOURNAL J. Bacteriol. 171, 6740-6746 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 419 1516 parathion hydrolase
signal 312 317 -35 region
signal 334 339 -10 region
binding 408 411 ribosme binding site
BASE COUNT 372 a 497 c 477 g 347 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccgagc ggcttaccgg cgccctgctc gaccggttga cccaccatgt ccatatcctc
61 gagatgaacg ggggcagcta tcggcttgcc agcagccgga aacggcaaaa gcacagccac
121 cacgcacagc aagaggaggt gcccccccca tgacctgatc agaaaacccc tcatctgctg
181 tgctgaacgg ccttcgctac gctccagacc gttcagcaca gcagatgaaa gcaccgcctc
241 gacaagaggc tttttgttca atccaactgg tacactctta caccggaatc ttgcacaatt
301 ttaccccggc attgacatct gacgcgtcaa cagtaaaaga aacaaccggt tcagatctgc
361 agcctgactc ggcaccagtc gctgcaagca gagtcgtaag caatcgcaag ggggcagcat
421 gcaaacgaga agggttgtgc tcaagtctgc ggccgccgca ggaactctgc tcggcggcct
481 ggctgggtgc gcgagcgtgg ctggatcgat cggcacaggc gatcggatca ataccgtgcg
541 cggtcctatc acaatctctg aagcgggttt cacactgact cacgagcaca tctgcggcag
601 ctcggcagga ttcttgcgtg cttggccaga gttcttcggt agccgcaaag ctctagcgga
661 aaaggctgtg agaggattgc gccgcgccag agcggctggc gtgcgaacga ttgtcgatgt
721 gtcgactttc gatatcggtc gcgacgtcag tttattggcc gaggtttcgc gggctgccga
781 cgttcatatc gtggcggcga ccggcttgtg gttcgacccg ccactttcga tgcgattgag
841 gagtgtagag gaactcacac agttcttcct gcgtgagatt caatatggca tcgaagacac
901 cggaattagg gcgggcatta tcaaggtcgc gaccacaggc aaggcgaccc cctttcagga
961 gttagtgtta aaggcggccg cccgggccag cttggccacc ggtgttccgg taaccactca
1021 cacggcagca agtcagcgcg atggtgagca gcaggccgcc atttttgagt ccgaaggctt
1081 gagcccctca cgggtttgta ttggtcacag cgatgatact gacgatttga gctatctcac
1141 cgccctcgct gcgcgcggat acctcatcgg tctagaccac atcccgcaca gtgcgattgg
1201 tctagaagat aatgcgagtg catcagccct cctgggcatc cgttcgtggc aaacacgggc
1261 tctcttgatc aaggcgctca tcgaccaagg ctacatgaaa caaatcctcg tttcgaatga
1321 ctggctgttc gggttttcga gctatgtcac caacatcatg gacgtgatgg atcgcgtgaa
1381 ccccgacggg atggccttca ttccactgag agtgatccca ttcctacgag agaagggcgt
1441 cccacaggaa acgctggcag gcatcactgt gactaacccg gcgcggttct tgtcaccgac
1501 cttgcgggcg tcatgacgcc atctggatcc ttccagccag cggccactat tccccgtcaa
1561 gataccgaac gatgaagtcg cgcatcgatc gataggcatc ttcaatttga tcagggctgc
1621 cacctccaaa gccgtggcca cccctgtcga tagtcttgag gacgtagggc acaccgtgct
1681 tttcgaactg cag
//
LOCUS HUMCSAE 1372 bp ss-mRNA PRI 26-FEB-1990
DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds.
ACCESSION M32306
KEYWORDS cell surface antigen; cell surface glycoprotein;
epithelial glycoprotein; transmembrane protein;
tumor-associated antigen.
SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library
HT-29/cdm8).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1372)
AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J.,
Gattoni-Celli,S. and Brand,S.J.
TITLE Epithelial glycoprotein is a member of a family of epithelial cell
surface antigens homologous to nidogen, a matrix adhesion protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.Simon, 23-FEB-1990.
FEATURES from to/span description
pept 22 966 epithelial glycoprotein (EGP) precursor
sigp 22 84 epithelial glycoprotein signal peptide
matp 85 963 epithelial glycoprotein
mRNA < 1 1372 EGP mRNA
BASE COUNT 436 a 233 c 317 g 386 t
ORIGIN
1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt
61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag
121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca
181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg
241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg
301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc
361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata
421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga
481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt
541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt
601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct
661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg
721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat
781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg
841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga
901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat
961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac
1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc
1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc
1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact
1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt
1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt
1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg
//
LOCUS HUMNOXF 2206 bp ss-mRNA PRI 15-FEB-1990
DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds.
ACCESSION M32011
KEYWORDS neutrophil oxidase factor.
SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA,
clone 10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2206)
AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G.,
Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L.
TITLE Cloning of a 67K neutrophil oxidase factor and its similarity to a
noncatalytic region of P60c-src
JOURNAL Science (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.L.Malech, 08-FEB-1990.
FEATURES from to/span description
pept 68 1648 neutrophil oxidase factor (p67-phox)
mRNA < 1 2206 p67-phox
signal 2030 2035 poly-A signal
signal 2181 2186 poly-A signal
BASE COUNT 624 a 499 c 573 g 510 t
ORIGIN
1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta
61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga
121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg
181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa
241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg
301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt
361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct
421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa
481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa
541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc
601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta
661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc
721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag
781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga
841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc
901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga
961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc
1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa
1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta
1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat
1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga
1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa
1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga
1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa
1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt
1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga
1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga
1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat
1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca
1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga
1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat
1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct
1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt
1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa
2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa
2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg
2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc
//
LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 26-FEB-1990
DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA,
complete cds.
ACCESSION J05428
KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase.
SOURCE Human liver, cDNA to mRNA, clone 63-11.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1855)
AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S.
TITLE Cloning and expression of human liver UDP-glucuronosyltransferase
in COS-1 cells: 3,4-catechol estrogens and estriol as primary
substrates
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.S.Owens, 22-FEB-1990.
FEATURES from to/span description
pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17)
BASE COUNT 571 a 337 c 394 g 553 t
ORIGIN
1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct
61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt
121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac
181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa
241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga
301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca
361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat
421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct
481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc
541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg
601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct
661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt
721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat
781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt
841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg
901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca
961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc
1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt
1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg
1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt
1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg
1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc
1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc
1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc
1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt
1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt
1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg
1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat
1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat
1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa
1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt
//
LOCUS PSERCBC 1060 bp ds-DNA BCT 08-MAR-1990
DEFINITION P.aeruginosa carbenicillinase gene, complete cds.
ACCESSION J05162
KEYWORDS carbenicillinase.
SOURCE P.aeruginosa (strain Dalgleish) DNA, clone pMON709.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1060)
AUTHORS Boissinot,M. and Levesque,R.C.
TITLE Nucleotide sequence of the PSE-4 carbenicillinase gene and
correlations with the Staphylococcus aureus PC1 beta-lactamase
crystal structure
JOURNAL J. Biol. Chem. 265, 1225-1230 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence kindly provided by
authors, 25-JAN-1990.
FEATURES from to/span description
pept 122 1036 carbenicillinase
BASE COUNT 316 a 195 c 246 g 303 t
ORIGIN 357 bp upstream of HincII site.
1 tcgggcatcc aagcagcaag cgcgttacgc cgtgggtcga tgtttgatgt tatggagcag
61 caacgatgtt acgcagcagg gcagtcgccc taaaacaaag ttagccatat tatggagcct
121 catgctttta tataaaatgt gtgacaatca aaattatggg gttacttaca tgaagttttt
181 attggcattt tcgcttttaa taccatccgt ggtttttgca agtagttcaa agtttcagca
241 agttgaacaa gacgttaagg caattgaagt ttctctttct gctcgtatag gtgtttccgt
301 tcttgatact caaaatggag aatattggga ttacaatggc aatcagcgct tcccgttaac
361 aagtactttt aaaacaatag cttgcgctaa attactatat gatgctgagc aaggaaaagt
421 taatcccaat agtacagtcg agattaagaa agcagatctt gtgacctatt cccctgtaat
481 agaaaagcaa gtagggcagg caatcacact cgatgatgcg tgcttcgcaa ctatgactac
541 aagtgataat actgcggcaa atatcatcct aagtgctgta ggtggcccca aaggcgttac
601 tgatttttta agacaaattg gggacaaaga gactcgtcta gaccgtattg agcctgattt
661 aaatgaaggt aagctcggtg atttgaggga tacgacaact cctaaggcaa tagccagtac
721 tttgaataaa tttttatttg gttccgcgct atctgaaatg aaccagaaaa aattagagtc
781 ttggatggtg aacaatcaag tcactggtaa tttactacgt tcagtattgc cggcgggatg
841 gaacattgcg gatcgctcag gtgctggcgg atttggtgct cggagtatta cagcagttgt
901 gtggagtgag catcaagccc caattattgt gagcatctat ctagctcaaa cacaggcttc
961 aatggaagag cgaaatgatg cgattgttaa aattggtcat tcaatttttg acgtttatac
1021 atcacagtcg cgctgataag gctaacaagg ccatcaagtt
//
LOCUS RAT5MDAA 2030 bp ss-mRNA ROD 15-DEC-1988
DEFINITION Rat iodothyronine 5' monodeiodinase (5'MD) mRNA, 3' end.
ACCESSION M21018
KEYWORDS iodothyronine 5' monodeiodinase.
SOURCE Rat liver, cDNA to mRNA, clone 2302.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2030)
AUTHORS Boado,R.J., Campbell,D.A. and Chopra,I.J.
TITLE Nucleotide sequence of rat liver iodothyronine 5'-monodeiodinase
(5' MD): Its identity with the protein disulfide isomerase
JOURNAL Biochem. Biophys. Res. Commun. 155, 1297-1304 (1988)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly
submitted by D.Campbell, 14-OCT-1988.
FEATURES from to/span description
pept < 1 1449 iodothyronine 5' monodeiodinase (AA at 1)
BASE COUNT 527 a 493 c 532 g 478 t
ORIGIN
1 gtcctggtgc tgaagaagag caacttcgca gaggcgctgg cggcgcacaa ctacctgctg
61 gtggagttct atgccccatg gtgtggccac tgcaaagcac tggccccaga gtatgccaaa
121 gctgctgcaa aactgaaggc agaaggctct gagatccgac tagcaaaggt ggacgccaca
181 gaagagtctg acctggccca gcagtatggt gtccgtggct accccacaat caagttcttc
241 aagaatggag acacagcctc cccaaaggaa tatacagctg gcagggaagc tgacgacatt
301 gtgaactggc tgaagaaacg cacaggccca gcagccacaa ccctgtctga cactgcagct
361 gcagagtcct tggtggactc aagcgaagtg acggtcatcg gcttcttcaa ggacgcaggg
421 tcagactccg ccaagcagtt cttgctggca gcagaggctg ttgatgacat accttttgga
481 atcacttcca atagcgatgt gttttccaag taccagctgg acaaggatgg ggtggtcctc
541 tttaagaagt ttgatgaagg ccgcaacaat tttgaaggtg agatcaccaa ggagaagctg
601 ttagacttca tcaagcacaa ccagctgcct ttggtcatcg agttcactga acagacagct
661 ccaaagattt tcggaggtga aatcaaaaca catattctgc tgttcctgcc caagagtgtg
721 tctgactacg atggcaaatt gagcaacttt aagaaagcgg ccgagggctt taagggcaag
781 atcctgttca tcttcatcga tagtgaccac actgacaacc agcgcatact tgagttcttt
841 ggcctgaaga aggaggaatg tccagctgtg cggcttatta ccctggagga agagatgacc
901 aagtacaaac cggagtcaga cgagctgaca gctgagaaga tcacacaatt ttgccaccac
961 ttcctggagg gcaagatcaa gccccacctg atgagccagg aactgcctga agactgggac
1021 aagcagccag tgaaagtgct agttgggaaa aactttgagg aggttgcttt tgatgagaaa
1081 aagaacgtgt ttgttgaatt ctatgctccc tggtgtggtc actgcaagca gctagccccg
1141 atttgggata aactgggaga gacatacaaa gaccatgaga atatcgtcat cgctaagatg
1201 gactcaacag ccaatgaggt ggaagctgtg aaggtgcaca gctttcccac actcaagttc
1261 ttcccagcaa gtgcagacag aacggtcatt gattacaacg gtgagcggac actagatggt
1321 tttaagaaat tcttggagag cggtggccag gatggagcgg gggacaatga cgacctcgac
1381 ctagaagaag ctttagagcc agatatggaa gaagacgacg atcagaaagc cgtgaaggat
1441 gaactgtagt gcagaagcca gatctgggcg cctgaaccca aaacctcggt ggccatgtcc
1501 cagcagccca catctccgga gcctgagcct caccccagga ggagcgccat cagaacccag
1561 ggaatctttc tgaagccaca ctcatctgac acacgtacac ttaaacctgt ccttcttttt
1621 ttgcttttca attttggaaa gggatctctg accaggccag cccatcttga agggctacgt
1681 tttgttttaa ttggtggtgt actttttgta cgtggatttt gtcccaagtg cttgctacca
1741 tatttgggat ttcacactgg taatgtcttt cctgttagag aggtttatgc tatcacttca
1801 gatttcgtct gtgagatgtt tcatcttcct gacatgtctt catgtcgagg tacttgttcc
1861 accacgcaga cctccctgag accccttctg ccctgcgcag gaggcgatgg ttctgggtcg
1921 tatgctctct ctctctccac cttgtactag tgttcgcatg acagcatggc ttttgtagtt
1981 tgcatttaac ctggggattt ctgcatcctg tcagagggtg ggtccccacc
//
LOCUS RIRANT17KA 539 bp ds-DNA BCT 23-FEB-1990
DEFINITION R.conorii 17K genus-common antigen gene, complete cds.
ACCESSION M28480
KEYWORDS antigen.
SOURCE R.conorii DNA.
ORGANISM Rickettsia conorii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Anderson,B.E. and Tzianabos,T.
TITLE Comparative sequence analysis of a genus-common rickettsial antigen
gene
JOURNAL J. Bacteriol. 171, 5199-5201 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly submitted
by B.Anderson, 27-OCT-1989.
FEATURES from to/span description
pept 60 539 17 kD antigen precursor
sigp 60 119 17 kD antigen signal peptide
matp 120 536 17 kD antigen
mRNA 38 > 539 17 kD mRNA
signal 1 6 -35 region
signal 21 27 -10 region
BASE COUNT 179 a 93 c 125 g 142 t
ORIGIN
1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata
61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta
121 acggtccggg cggcatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg
181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg
241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa
301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag
361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa
421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac
481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga
//
LOCUS RIRANT17KB 535 bp ds-DNA BCT 23-FEB-1990
DEFINITION R.typhi 17K genus-common antigen gene, complete cds.
ACCESSION M28481
KEYWORDS antigen.
SOURCE R.typhi DNA.
ORGANISM Rickettsia typhi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE 1 (bases 1 to 535)
AUTHORS Anderson,B.E. and Tzianabos,T.
TITLE Comparative sequence analysis of a genus-common rickettsial antigen
gene
JOURNAL J. Bacteriol. 171, 5199-5201 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly submitted
by B.Anderson, 27-OCT-1989.
FEATURES from to/span description
pept 56 535 17 kD antigen precursor
sigp 56 115 17 kD antigen signal peptide
matp 116 535 17 kD antigen
mRNA 37 > 535 17 kD mRNA
signal 1 6 -35 region
signal 21 27 -10 region
BASE COUNT 182 a 93 c 122 g 138 t
ORIGIN
1 tttacaaaat tctaaaaacc atatacttat taaataatag attaaagagg attatatgaa
61 actattatct aaagtaatga ttttagctct tgcagcttct atgttacaag catgtaatgg
121 tccgggtggt atgaacaaac aagggactgg aacacttctt ggtggcgccg gcggtgcatt
181 acttggttct caattcggtc acggtaaagg acaacttgtc ggagtaggtg taggcgcatt
241 acttggggca gttcttggtg gacaaatcgg tgcaagtctg gatgagcagg atagaaaact
301 tctagaacta acatcacaaa gagctttaga atctgctcct agcggtagta acatagaatg
361 gcgcaatcca gataatggca atcatggtta cgtaacacct aataaaactt ataggaacag
421 tacaggtcaa tattgccgtg aatacactca aacagttgta atagggggaa aacaacaaac
481 aacatatgga aatgcatgcc gccaacctga cggacaatgg caagttgtga attga
//
LOCUS RIRANT17KC 532 bp ds-DNA BCT 23-FEB-1990
DEFINITION R.typhi 17K genus-common antigen gene, complete cds.
ACCESSION M28482
KEYWORDS antigen.
SOURCE R.typhi DNA.
ORGANISM Rickettsia typhi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE 1 (bases 1 to 532)
AUTHORS Anderson,B.E. and Tzianabos,T.
TITLE Comparative sequence analysis of a genus-common rickettsial antigen
gene
JOURNAL J. Bacteriol. 171, 5199-5201 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly submitted
by B.Anderson, 27-OCT-1989.
FEATURES from to/span description
pept 56 532 17 kD antigen precursor
sigp 56 115 17 kD antigen signal peptide
matp 116 529 17 kD antigen
mRNA 37 > 532 17 kD mRNA
signal 1 6 -35 region
signal 21 27 -10 region
BASE COUNT 183 a 88 c 121 g 140 t
ORIGIN
1 tttacaaaat tctaaaaacc atatacttat ttaaataata attagagagg attatatgaa
61 actattatct aaaataatga ttatagctct tgcagcttct atgttacaag cttgtaatgg
121 tcagagtggt atgaacaaac aaggtaccgg tacacttctt ggtggcgcag gaggtgcatt
181 acttggttct caattcggtc aaggtaaagg acaacttgtc ggagtaggtg taggcgcatt
241 acttggggca gttcttggtg gacaaatcgg cgcaagtatg gatgagcagg atagaagact
301 tctagaacta acatcacaaa gagctttaga atctgcacct agcggtagta acatagaatg
361 gcggaatcca gataacggca atcatggtta cgtcacacct aataaaactt ataggaacag
421 tgcaggtcaa tattgccgtg aatacactca aacagttata ataggcggaa aacaaaaaac
481 atatggtaat gcatgccgtc aacctgatgg gcaatggcaa gttgtgaatt ga
//
LOCUS RIRANT17KD 539 bp ds-DNA BCT 23-FEB-1990
DEFINITION R.rickettsia 17K genus-common antigen gene, complete cds.
ACCESSION M28479
KEYWORDS antigen.
SOURCE R.rickettsia DNA.
ORGANISM Rickettsia rickettsia
Unclassified.
REFERENCE 1 (bases 1 to 539)
AUTHORS Anderson,B.E. and Tzianabos,T.
TITLE Comparative sequence analysis of a genus-common rickettsial antigen
gene
JOURNAL J. Bacteriol. 171, 5199-5201 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly submitted
by B.Anderson, 27-OCT-1989.
FEATURES from to/span description
pept 60 539 17 kD antigen precursor
sigp 60 119 17 kD antigen signal peptide
matp 120 536 17 kD antigen
mRNA 37 > 539 17 kD mRNA
signal 1 6 -35 region
signal 21 27 -10 region
BASE COUNT 179 a 92 c 125 g 143 t
ORIGIN
1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata
61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta
121 acggtccggg cggtatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg
181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg
241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa
301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag
361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa
421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac
481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga
//
LOCUS ROTNS53 1611 bp ds-RNA VRL 02-MAR-1990
DEFINITION Simian rotavirus SA11 segment 5 nonstructural protein (NS53) gene,
complete cds.
ACCESSION X14914
KEYWORDS nonstructural protein.
SOURCE Simian rotavirus, cDNA to viral RNAi, clone 35.
ORGANISM Simian (SA11) rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1611)
AUTHORS Mitchell,D.B. and Both,G.W.
TITLE Conservation of a potential metal binding motif despite extensive
sequence diversity in the rotavirus nonstructural protein NS53
JOURNAL Virology 174, 618-621 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.B.Mitchell, 03-APR-1989.
FEATURES from to/span description
pept 31 1518 nonstructural protein
BASE COUNT 571 a 214 c 288 g 538 t
ORIGIN
1 ggcttttttt tgaaaagtct tgtgttagcc atggctactt ttaaagatgc atgctttcat
61 tatcgtagat taactgcttt aaatcggaga ttatgcaaca ttggtgcaaa ttctatttgc
121 atgccagttc ctgatgagaa aattaagggg tggtgtttag aatgttgtca aatagctgat
181 ttaacccatt gttatggttg ctcattgccg catgtttgca aatggtgtgt tcagaacaga
241 agatgcttcc ttgacaatga acctcatttg cttaagctta gaactgtgaa acatccaatt
301 accaaagaca aattacagtg tatcatagac ttgtacaata taatatttcc aattaatgat
361 aaagtaatta gaaaatttga aagaatgata aagcaaagaa aatgtaggaa tcaatataaa
421 attgaatggt ataatcattt gctgctccca attacattaa atgctgctgc atttaagttt
481 gatgaaaata atctttatta tgtttttggg ttatatgaga aatcagtcag tgatatatat
541 gctccatata gaattgttaa ctttataaat gaatttgata aattattgct tgatgatatt
601 aactttacaa gaatgtccaa tctaccaata gagttgagaa ccattatgca agaatacttc
661 caattatcaa gactgccatc atcaaaacta aagcaaattt acttttcaga ttttactaaa
721 gaaactgtga tttttaatac ttatacaaaa acgccaggaa gatcaatata cagaaatgta
781 actgaattta attggagaga tgaattggag ctttattctg atttaaaaaa tgataagaat
841 aaattaattg ctgcaatgat gacgagtaag tatactcggt tctatgctca tgataataat
901 tttggaaggt tgaaaatgac aatatttgag ttgggacatc attgtcagcc taactacgtg
961 gcatctaatc acccaggcaa tgcttccgat atccagtact gtaaatggtg taatataaaa
1021 tattttctta gtaaaattga ttggcggatt cgtgatatgt ataatttatt gatggaattt
1081 attaaggatt gttataaaag taatgttaac gttggacatt gtagttctgt tgaaaacata
1141 tatcctttaa ttaaaagatt aatttggagt ttgtttacta atcacatgga tcaaacaatt
1201 gaagaagtgt ttaatcacat gtcgccagtg tcagttgaag gtacgaatgt catcatgttg
1261 attcttggat tgaatattag tttgtataat gaaattaagc gcactttgaa tgtagatagc
1321 ataccaatgg tacttaattt aaatgaattc agtagtatag ttaaatcaat tagcagtaaa
1381 tggtataatg ttgatgaatt ggataaattg ccaatgtcaa taaaatcaac ggaggaactg
1441 attgaaatga agaattctgg aactttaact gaagaatttg agctactgat ctccaactca
1501 gaagatgaca atgagtgaaa ttactaatgt cactatctaa ttatacagta tttagccatc
1561 acaagaccgt ccagactaga gtagcgccta gctggcaaaa tactgtgaac c
//
LOCUS SPINR 3284 bp ss-mRNA PLN 06-MAR-1990
DEFINITION Spinach NADH nitrate reductase mRNA, complete cds.
ACCESSION M32600
KEYWORDS NADH nitrate reductase; nitrate reductase.
SOURCE Spinach, cDNA to mRNA, clone pIMP4.
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 3284)
AUTHORS Prosser,I.M. and Lazarus,C.M.
TITLE Complete nucleotide sequence of spinach nitrate reductase cDNA
JOURNAL Unpublished (1990) Long Ashton Res. Stn, Long Ashton, Bristol, UK
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.M.Prosser, 05-MAR-1990.
FEATURES from to/span description
pept 116 2896 NADH nitrate reductase
BASE COUNT 894 a 642 c 849 g 899 t
ORIGIN
1 aaaaaccata acacctctgg aaaaatattg aggaagaacc gtagttcttc tttccttttt
61 ctttttttct ttttaattac atttcaaaaa accctacatt tgatcgattt gagctatggc
121 ggcgtcagtc gatcgtcaat atcatccggc gccgatgagc ggtgtcgttc gtaccccatt
181 tagcaaccac caccgttccg actctccggt tcggaacggt tatactttta gtaacccgcc
241 ttcttcaaat ggcgttgtta aacccggtga aaaaatcaag ctcgtagata ataatagtaa
301 tagtaataat ggtagcaata ataataataa tcgctatgat agcgatagtg aagaagatga
361 tgacgaaaat gagatgaatg tctggaatga gatgatcaaa aaaggtaaca gcgaattgga
421 gccgtcttcc gtcgatagcc gtgatgaagg aacggctgat caatggattg agaggaatcc
481 ttcaatgatc cgtctcacag ggaagcaccc gttcaattcg gagcctccat tgacccgatt
541 gatgcaccac gggtttctca ccccggttcc gcttcattac gttcggaacc acggtccagt
601 ccccaacgct aagtgggagg attggaccgt tgaggtgacc gggttagtca aaagaccaat
661 ccggttcact atggaccaat tggtcaatga cttccaaagc cgtgagtttc cggtgacgct
721 tgtctgcgcc ggtaaccgga ggaaggaaca gaatatgacg aaacaatcga tagggtttaa
781 ctggggttcc gccgccgtgt caacttccgt gtggcgcggt gttccacttc gcgacgtgct
841 taagcggtgt ggggtcatga gttcgttaaa aggggctctc aacgtttgtt ttgagggggc
901 tgaggattta ccaggaggtg gcgggtcgaa atatggtaca agtgttaaaa gggagtttgc
961 aatggatccg gcccgtgaca ttattctagc gtacatgcaa aatggggaga agttgagccc
1021 ggatcatggg tatccggttc ggatgatcat acccgggttt attggtgggc ggatggttaa
1081 gtggttgaag aggattattg tgactacaac agagtctgat aattattacc attacaagga
1141 taatagggtc cttccctctc atgttgatgc tgaacttgcc aattctgaag cttggtggta
1201 caaacaagag tacatcataa atgagctgaa cgttaactcg gtgataacat cgccgtgcca
1261 cgaagaaatc ttgcctatca acgcctggac tacccaacgt ccttacacaa tgaggggtta
1321 cgcttattct gggggaggaa gaaaagtgac ccgagtggaa gtgacaatgg atggtggaga
1381 cacatgggac atatgtgaat tggaccacca agagagggga agcaaatatg gtaaattctg
1441 gtgttggtgt ttctggtctt tagaggtcga agtactagac ttactcggag ccaaagagat
1501 tggggtccgt gcttgggatg aaagcctcaa cacccaacct gaaaagctca tttggaatgt
1561 catgggtatg atgaacaact gctggtttcg tgttaagacc aacgtgtgca agcctcacaa
1621 gggagagatc ggtatcgtgt tcgaacaccc gacccaaccc ggtaacaagt cgggcgggtg
1681 gatggcccgt gaacgccacc tcgagatctc cgattctggc ccaacattga agaggacagc
1741 atccacccca ttcatgaaca ccacctcaaa gatgtactca atgtcagagg tgaagaaaca
1801 caacactgct gactcagctt ggattgtagt ccatggaaat gtctacaacg cgacccggtt
1861 cctaaaggat cacccgggtg ggtcagacag tatcctcatc aatgcaggaa ccgactgcac
1921 cgaagagttc gatgcaattc actcagacaa ggccaagagg cttctagaag atttcaggat
1981 aggtgagttg atctcaacgg gctacacctc tgactcgtcc tcgcccggta actcagtcca
2041 tggcgggtcg gtttactccg ggttagccgg gttagcccca attacagaag cggttccatt
2101 acgtaatgtt gctttaaacc cacgggttaa gatcccatgc aaacttattg agaaagtttc
2161 actttcccat gacgtaagaa ggttccgatt cgggttaccc tctgaagacc aggtgttggg
2221 tttacctgta ggaaaacaca tcttcctttg tgctaatgtt gatgataagc tttgtatgag
2281 agcttatact ccatcaagta ccattgatgt ggttgggtac tttgatcttg tggttaaggt
2341 ttacttcaag gatgttcacc ctaggttccc taacggtggg gtgatgtcac aacatctaga
2401 ctcactttct ttgggctcga ttgttgacgt gaaaggccca ttgggccata ttgagtattt
2461 aggaaagggg aatttcacag tccatggaaa gcccaagttt gctaaaaagc ttgctatgat
2521 atcaggtggg accgggatta ccccgattta tcaggtgatg caggcaatac tgaaagatcc
2581 tgaggataag actgagatgc atgttgtgta tgcaaatcgg acggaggagg atatcttgct
2641 tagagaagag cttgataaat gggctgatga gtttagagat cgggttaaag tatggtacgt
2701 tgttgaaaag gcggaggaag ggtggaaata tgataccggg tttatatctg agaagatact
2761 tagagatcac gtgccggcgg ttggggatga tgtgttggca ctcacgtgcg gtcctcctcc
2821 tatgattcaa tttgcggttc aacctaactt ggataagatg ggttttgata ttaaggaaca
2881 gttgttgatt ttttgaaagt actgttacgt tgggaagtga atttactaca ctacccctga
2941 agatatttgt gttacgtgca cattgcaata tcgacgtgtg taacacatgc gttgtgcttt
3001 cgaatgtata gtagaagttg ggtaaaatgg agtatatgaa cactactata tgtattctat
3061 gtatgtaatg taatttgtgg agctttttct tctttcttta cccggtgaaa ttatgtgaat
3121 ataagtgctc cactagattg tttgttgata agctttttgt tgtccttatg aaaattggtt
3181 gatggaggaa gaagtatgta gaaggaagaa attcaagtag ctaaggttta tgtattgtaa
3241 gagggagatt actatttggt agtctccatt taccatatta ttat
//
LOCUS SRGMDH 4638 bp ds-DNA PLN 06-MAR-1990
DEFINITION S.vulgare NADP malate dehydrogenase gene, compete cds.
ACCESSION M31965
KEYWORDS NADP malate dehydrogenase.
SOURCE S.vulgare (haplotype 2n) 5 day old leaf mesophyll cell DNA.
ORGANISM Sorghum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 4638)
AUTHORS Luchetta,P., Cretin,C. and Gadal,P.
TITLE Structure and characterization of the sorghum vulgare gene encoding
NADP-malate dehydrogenase
JOURNAL Gene (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Luchetta, 02-MAR-1990.
FEATURES from to/span description
mRNA 763 783 NADP-malate dehydrogenase, exon 1
873 963 NADP-malate dehydrogenase, exon 2
1461 1547 NADP-malate dehydrogenase, exon 3
1643 1727 NADP-malate dehydrogenase, exon 4
1984 2165 NADP-malate dehydrogenase, exon 5
2420 2490 NADP-malate dehydrogenase, exon 6
2570 2660 NADP-malate dehydrogenase, exon 7
2749 2823 NADP-malate dehydrogenase, exon 8
2912 3013 NADP-malate dehydrogenase, exon 9
3100 3222 NADP-malate dehydrogenase, exon 10
3303 3371 NADP-malate dehydrogenase, exon 11
3456 3524 NADP-malate dehydrogenase, exon 12
3619 3672 NADP-malate dehydrogenase, exon 13
3765 4638 NADP-malate dehydrogenase, exon 14
pre-msg 603 4638 NADP-malate dehydrogenase mRNA and introns
IVS 784 872 NADP-malate dehydrogenase intron A
IVS 964 1460 NADP-malate dehydrogenase intron A
IVS 1548 1642 NADP-malate dehydrogenase intron A
IVS 1728 1983 NADP-malate dehydrogenase intron A
IVS 2166 2419 NADP-malate dehydrogenase intron A
IVS 2491 2569 NADP-malate dehydrogenase intron A
IVS 2661 2748 NADP-malate dehydrogenase intron A
IVS 2824 2911 NADP-malate dehydrogenase intron A
IVS 3014 3099 NADP-malate dehydrogenase intron A
IVS 3223 3302 NADP-malate dehydrogenase intron A
IVS 3372 3455 NADP-malate dehydrogenase intron A
IVS 3525 3618 NADP-malate dehydrogenase intron A
IVS 3673 3764 NADP-malate dehydrogenase intron A
BASE COUNT 1255 a 1044 c 993 g 1346 t
ORIGIN
1 aggcctagtc aaaaagtaaa tgtgattatt tacaaacatt gtctctacat tttgcatggc
61 ttcaaaacca aagatcgcac acacttttgc caaaattttg ccaccgttgc taaccaatcg
121 atgtgtaacg atgagaagtg ccatcacact ctctgaaaag actcaggccc tggcactggc
181 aggaccgcag gaaaggcact gagcaaagat gatggtcgtg ggagaataaa caaccttttt
241 ttttttttgc ctttgtaata aatgtccccg ccgtctctca cgtagtacgt agtttcggtg
301 tggctttgtt ttaaaaggga accaaatcta ccaatgagtc tttttaaaaa gaagaaaaat
361 actggtcagc aaaactgaga ttctccagca acgaaaatga aaaagacttg gttgtcagtg
421 ttaaccacca ccaggatagc aggcgtgtgg acgtacaaga tgccgaacag tcgctgtcgc
481 gcgacgtgtc ccgcgccggg cggtccgctc gccttgcctg ccgctaatcc aattcctacg
541 cgcgctcctg ctcctgcgta caaccagaga ggctataatc tcctccactc cctcccctgt
601 agctgcagcc cgttccactc ctatccatcc gtgctccagc tccaccatgg gcctctcaac
661 ggcttactcc ccggtcggat cgcacctcgc gccggcgcct ctcgggcacc gtcgctccgc
721 ccagctccac cgcccgcgcc gggccctgct cgccaccgtc cgatgctccg tcgacgccgc
781 caagtgagta gtagtatact acccggacgt ggacatgaat gcgtgccatt tatcaactac
841 taggctgact agccccctat acgttgatgc aggcaggtgc aggatggcgt ggcgacggcg
901 gaggcgccgg cgacgcggaa ggactgcttc ggggtcttct gcaccaccta cgacctcaag
961 gcggtgagtc cctcccttcg cctcaggcgg tggtggtcct ccccggttcc cggtccggcc
1021 tccccgtgtg tgttcgctga aatgctccat cgacttttgc cttttgcttc aaactcaaag
1081 tagaatccct tttccaccta aagctagcaa cgttgctttt ctccaaaaga gaaaaaaaaa
1141 gaaaaagcta gcacgtcgtt ggtgtcatgg tgtggaccgt tggaggtgtg gtgtggatat
1201 atgctacgat actggatagc ttgcactcat atcaacagct ggcgttatgc ctctataaat
1261 aaaaataaat atcatacaga ccaccaaaac caaagcgcag ggtggagtgc cacgcatgag
1321 acgctgcact gcacgccact gctgcttgac ttaatttggt ttctaattaa cttgctgctg
1381 tatatatcgt catatagtag tacagtacta acaatctgtc atcctatgcc attgaatgga
1441 ttgggagttg ggacacacag gaagacaaga ccaagtcgtg gaagaagcta gtgaacattg
1501 ctgtgtcagg cgcggctggg atgatatcga accacctgct gttcaaagta actaaatcgt
1561 ttactgactc tacttgtttc ctatcttgag ttgcagtatt gtaatgctga ccattcattt
1621 tattaacaca aaaaaaattc agcttgcctc tggtgaggtt ttcggacaag accaaccaat
1681 agcactgaag ttacttggtt cagagagatc gtttcaagct ctcgaaggtg aatgactgac
1741 tgcctcttct tctttctgcc ttgctatctg agaagaagaa tcaattcata ttttccactc
1801 actctgttcc aaaagctaaa atgccttata tataatttgg aacaggggga gtacatacgt
1861 atagctttag tttcaagctg cccttacagc ttccagtgtt gtactctctt atatcaatgg
1921 attactgtga cagtgaatat ttatcacttc ttcctcatgt gctctcaaaa actgcaactg
1981 taggtgttgc tatggaactg gaggactcgc tatatccatt gctgagggaa gtcagcattg
2041 gtatagatcc ttatgaggtc tttgaagatg tagattgggc ccttcttatt ggtgctaagc
2101 cccgaggtcc tggcatggag cgagctgcgt tactggatat caatggtcaa atctttgctg
2161 atcaggtttc ttttgcactc tgttcctcct ttcttctcaa gtactatgtc accaaattgt
2221 aagaagtcct ctcaggaaag gtgccaagaa gcattttctg aaacttcaac ctcaaatttt
2281 aaggattcct aagcattaga acctgcaaat tgccaaaacc agaaaccaag tacaaatttg
2341 agtcaaagcc cacctaaata aacgaccaga gtaacctgtt attcttgtta ttatctgaat
2401 tttttttaaa ttgtgtcagg ggaaagcact taatgccgtg gcctcgaaga acgtgaaagt
2461 cttagttgtt ggaaatccct gtaacactaa gtacggacat tgattgttca ccaagattaa
2521 cgattttatg tcatcaccat gtcttagttg tggcttgccc ttgatgcagt gcgttaattt
2581 gcttgaaaaa tgctccagac ataccagcaa aaaattttca tgcactgacg aggttggatg
2641 aaaatagagc aaagtgccag gtgactgacg ttgaacctta ttatgtctcg tccttccatt
2701 ccaaacagga aaacagatta agaaatcaca cacaatctcc ttttgtagct agcactcaaa
2761 gcaggtgtat tttatgacaa agtatcaaac gtgactattt gggggaacca ttcgacaact
2821 caggtgaagt tttgcattat aaattaaatg agcagactat tttgcagaca catttcttcg
2881 ctgaatggtt ggcactaact tctctgtgta ggttcctgat ttcttgaatg ccaaaattga
2941 tgggagacca gtgaaagaag tcattaagga taccaagtgg ttagaagaag agttcaccat
3001 tacggttcaa aaggtatgac aatccatttg gagtgtctgt tactcggata aataaacaag
3061 gttgtgacca taattcgttt cattcttata tctaaacagc gtggaggtgc gctcatccaa
3121 aaatggggca gatcttcagc tgcatcaacc gctgtatcaa tagcggatgc tattaaatcc
3181 ctggtaactc ctaccccaga aggcgactgg ttctccacag gggtaagttt tgatgccttt
3241 gaattgatca cagctttatt tcggcattcc tctaatggaa aaaagtaata aaaaaaacac
3301 aggtttatac gactggaaat ccttatggca tagcagagga tatcgtgttc agcatgccat
3361 gcagatcgaa ggtatgatct actaatcaaa ttgtccttat tatttttgtc ctaagataag
3421 gtaccactcc aatgatcgta aaattcttcc aacagggtga tggtgattac gaactagcta
3481 ctgatgtgtc aatggacgat tttctctggg aacggattaa aaaggtaagg gctataggaa
3541 ctgccatcac tgatatgctg agaatataag cagcactaga gattttaccc taacagttgg
3601 ccctgtacat attttcagag tgaagctgaa ttgcttgctg agaagaaatg cgttgcccat
3661 cttactggag aggtatgtca cagtgacgta ttttatttag cttaaaaaga gagaaaccgt
3721 gccggaaagg attggcttag ctttttctgt ttgtgcaaat gcaggggaat gcatattgtg
3781 atgttccgga ggataccatg ctaccgggag aagtgtagtt tagttacaga gaagcaatag
3841 caggggaatg catttgtcta tactatagct gagactcgag agaaacagct gcttgcaaga
3901 tgcagcaact acttttactt atatatatat attgtttata ttctgtaccc gctcagagcg
3961 ttgtgtgcac acgctgtgtg attccgggac cgtctttacg atcgttttgc cttttttggg
4021 ccataccgct tcgctcccgt gttgcgcttc cattattgag ttattaccac tctgttttat
4081 aatatttatt actgccgaga ttttgtgtag atattacaat cttgtgtttg gttgtgtctc
4141 tgctcatacg caagtgctgc tgcaactctt gtttttattc acatttagtt acaactgtgt
4201 tattttgtag taacgacgtg ttatatgtgc tgttacaatc gtattgtaac tctattcttt
4261 tcctcatttc tccttttact gtacttattt aacagaatta gtcgattttc agtcacagaa
4321 taatattttc tctcacaaca aatcaacatc aatcagtgtc gtcgaatctc tacctcgtgg
4381 agccatggac cgttgtctcg tgggcaatcc caagtgagcc ctatatgagg cagttctttc
4441 gctaatcatt tttaaaacat ataccacaag acccatacac cgattatgtt tgaaactttc
4501 aatgctagca acttcatcga catacagggc tcacacgtta gtaaaaaaat gtgaattaag
4561 cactgtctct gcattttgca tggcttcaaa caaaagatct cgcactttga ctacatttcg
4621 ccactgctgc taaggctt
//
LOCUS TRBHSRP 2378 bp ds-DNA INV 15-FEB-1990
DEFINITION T.brucei heat shock-related protein gene, complete cds.
ACCESSION M32139
KEYWORDS heat shock-related protein.
SOURCE T.brucei (strain 427-60) DNA, clone H8.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 2378)
AUTHORS Lee,M.G.-S., Polvere,R.I. and Van der Ploeg,L.H.T.
TITLE Evidence for segmental gene conversion between a cognate hsp 70
gene and the temperature sensitively transcribed hsp 70 genes of
Trypanosoma brucei
JOURNAL Mol. Biochem. Parasitol. (1900) In press
STANDARD full staff_entry
COMMENT The conserved domains are 100 % homologous to the corresponding
regions of Hsp70 genes 2-6 of T.brucei.
Draft entry and computer-readable sequence [1] kindly submitted by
L.T.H.Van der Ploeg, 16-FEB-1990.
FEATURES from to/span description
pept 331 2361 heat shock-related protein
site 301 463 conserved domain
site 532 613 conserved domain
site 732 920 conserved domain
site 985 1028 conserved domain
BASE COUNT 575 a 535 c 717 g 551 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttcctt tattttcttt ttccttttga gctaactcta cctccgtagc cgccagcgtt
61 ctcaatgcag ttttgcgtgt aagtcacttg cctcttggcc aagtagactc ggtatgcaac
121 accaggagcg ccgcacgcat aacaatttgc gtttacgcgt ctacgttagt gtgcagatgt
181 tatgatactt ttgttttaac tttgctccgc gtccttggcg ttggtattgt tgcaatcttt
241 catcggccag tgcctgtata tttcaatatc ctcttgcttt tcatccttgc tactgcctct
301 gaaaacaagg aatatctgcc tctttgaagg atgacatacg aaggcgccat tggtatcgac
361 ctcggtacga cgtactcgtg cgttggtgtg tggcagaacg aacgtgtgga aatcattgca
421 aatgaccagg gtaaccgtac gacgccgtcc tacgttgctt tcgtcaataa tgaggttctt
481 gttggcgatg cggcgaagag ccacgccgct cgtggctcga atggtgttat attcgacgcg
541 aaacggctga tcggacggaa gttctccgac tctgtcgtgc agtccgacat gaagcactgg
601 cccttcaagg tcgaggaggg cgagaaaggt ggtgctgtga tgagggtaga gcatcttgga
661 gaggggatgt tactgcagcc cgagcaaata tctgcccgtg tccttgcgta cctgaagtct
721 tgtgcggagt cgtatcttgg gaagcaggtg gctaaggctg tcgtaacagt gcctgcatac
781 ttcaacgact cgcagcgcca ggcaacgaag gacgcgggga cgattgctgg tctggaggtg
841 ctgcgcatca tcaacgagcc aacggctgct gcaattgcct atggcctgga caaggctgat
901 gaggggaagg aacgcaacgt tttggtgttt gactttggag gtggaacgtt tgacgtctca
961 attatttcag taagcggcgg tgttttcgag gtgaaagcca caaatggtga cacccacctt
1021 ggtggagagg atgtggacgc cgcactgctg gagcacgcac ttgctgatat aaggaaccgt
1081 tatggaatag aacagggctc tctgtcccaa aagatgctat ccaagctgcg ttcacggtgt
1141 gaggaagtga agcgagtact gtcacactcg acagtggggg aaattgcact cgacgggctg
1201 cttcccgacg gcgaagaata cgtgctcaag ctaacgcgag cgaggttgga agagctatgc
1261 acgaaaattt ttgctcgctg tttgagtgtg gtacagaggg cgttgaagga cgcttcaatg
1321 aaggttgagg atattgagga tgttgtgctt gtcgggggaa gttctcggat tcctgctgtg
1381 caggctcagt taagggagct attcagaggt aagcaactct gcagcagtgt gcaccctgac
1441 gaggcagtgg cgtatggggc tgcatggcag gcgcacgtac tttccggtgg atatggggag
1501 agctctagga cagcaggtat agtgcttctg gatgttgtgc ccctttcaat cggcgttgaa
1561 gtggacgacg gtaagtttga tgtgataatt cgtcggaaca caaccattcc ataccttgcg
1621 accaaggagt acagcactgt ggatgacaat cagagtgaag tggagataca ggttttcgaa
1681 ggtgagcgcc ccctcacgcg tcacaaccac aggctaggct cctttgttct tgacggtatt
1741 accccggcaa agcacggtga accgactatc accgttactt ttagtgttga cgccgacggg
1801 atcctgaccg ttacagccgc agaagaactc gggagcgtaa caaaaacact tgttgtggag
1861 aactcagagc gacttaccag tgaggaagta cagaagatga ttgaagttgc acaaaagttt
1921 gccttaacgg acgccaccgc tctggcgagg atggaggcca ccgagaggct cacgcagtgg
1981 tttgaccgac tcgaagctgt aatggagacc gtcccgcaac catactcaga gaagcttcaa
2041 aagcgtattg cctttcttcc gcacgggaaa gagtgggtcg gcacacagct tcatacgtac
2101 actgacgcgg cgtcgataga ggcgaaagta gcgaagattg aacggcttgc aaagagggcc
2161 ctgaaatcgg cgcgtcgtga aggaaaggat ggatgggcac cgggcaatga ggacaatgga
2221 agcggtgatg acaacgacgg tgacgataac agtgatgagg aggacgagct ccaaaggggg
2281 aggggcgtga cagagggatc ggggaggtcc ccgataagga aacgcgaccg catagaggct
2341 atcaacgcaa acacagagta agtcgtagag gtgtctaa
//
LOCUS YSCALG1 2011 bp ds-DNA PLN 06-MAR-1990
DEFINITION S.cerevisiae beta-mannosyltransferase (alg1) gene, complete cds.
ACCESSION J05416
KEYWORDS beta-mannosyltransferase.
SOURCE S.cerevisiae DNA.
REFERENCE 1 (bases 1 to 2011)
AUTHORS Albright,C.F. and Robbins,P.W.
TITLE The sequence and transcript heterogeneity of the yeast gene ALG1,
an essential mannosyltransferase involved in N-glycosylation
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.F.Albright 06-MAR-1990.
FEATURES from to/span description
pept 115 339 upstream ORF
pept 379 1728 beta-mannosyltransferase (alg1)
mRNA 102 1826 alg1 mRNA (alt., both ends approx.)
mRNA 203 1826 alg1 mRNA (alt., both ends approx.)
BASE COUNT 639 a 341 c 390 g 641 t
ORIGIN Chromosome II; 2.1 map units from Lys2.
1 gggcaataaa aaattctaag aatagaatca tatcacagta cacgaaaata taaatacatc
61 ttgaaaatgt ctcaaattgg gtggaaaaga gggtgcttga attcattagg acatatggtt
121 gatgttctca ggttctactt atgtttgctt tgtcgatttt tgcatgcttt gactgtgact
181 ttcctgtctg atatctttgt atggcttgtt gctaaaacaa ggtcaatcca agcagtgatc
241 atcttgcacg ttgcttcaat agaacgcgca tatagcaatc atcaagttaa ttggtcatat
301 atctttcaat cagcaattag caaagccatt agagggtaat tgtaacggca ttttaggggc
361 tatagtcaat tggtgaaaat gtttttggaa attcctcggt ggttacttgc cttaataata
421 ttataccttt ccataccgtt agtggtttat tatgttatac cctacttgtt ttatggcaac
481 aagtcgacca aaaaaaggat catcatattt gtgctgggtg atgtaggaca ctctccaagg
541 atatgctatc acgctataag tttcagtaag ttaggttggc aagtcgagct atgcggttat
601 gtggaggaca ctctacccaa aattatttcc agtgatccaa atatcaccgt ccatcatatg
661 tcaaacttga aaagaaaggg aggcggaaca tcagttatat ttatggtaaa gaaggtgctt
721 tttcaagttt taagtatttt caaattactt tgggaattga gaggaagcga ttacatacta
781 gttcaaaatc caccgagcat acccattctt ccgattgctg tgctatacaa gttgaccggt
841 tgtaaactaa ttattgattg gcacaatcta gcatattcga tattgcaact aaaatttaaa
901 ggaaactttt accatccttt agtgttgata tcttacatgg tagagatgat attcagcaaa
961 tttgctgatt ataacttgac tgttactgaa gcaatgagga aatatttaat tcaaagcttt
1021 cacttgaatc caaagagatg tgctgttctc tacgaccgcc cggcttccca atttcaacct
1081 ttggcaggtg acatttctcg tcaaaaagcc ctaactacca aagcctttat aaagaattat
1141 attcgcgatg attttgatac agaaaaaggc gataaaatta ttgtgacttc aacatcattc
1201 acccctgatg aagatattgg tattttatta ggtgccctaa agatttacga aaactcttat
1261 gtcaaatttg attcaagttt gcctaagatc ttgtgtttta taacgggtaa aggaccacta
1321 aaggagaaat atatgaagca agtagaagaa tatgactgga agcgctgtca aatcgaattt
1381 gtgtggttgt cagcagagga ttacccaaag ttattacaat tatgcgatta cggagtttcc
1441 ctgcatactt caagttcagg gttggacctg ccaatgaaaa ttttagatat gtttggctca
1501 ggtcttcctg ttattgcaat gaactatcca gtgcttgacg aattagtaca acacaatgta
1561 aatgggttaa aatttgttga tagaagggag cttcatgaat ctctgatttt tgctatgaaa
1621 gatgctgatt tataccaaaa attgaagaaa aatgtaacgc aggaagctga gaacagatgg
1681 caatcaaatt gggaacgaac aatgagagat ttgaagctaa ttcattgagt caatggtaac
1741 tcagcctttc ttttttgaaa attactattt tcgactcttt tttatacagt tacatagtac
1801 tacctctaat acacattcat gattaacaat gtttcaaaca atataaagtc ccgataacga
1861 ccttttgaag tggtgacgtt accgctcttc gttgacaaga ttcaagaggg ctgtcagaat
1921 aacagctatc atggtggaaa gtaggtgacc ctttgaacaa ggcagcatat atccatcgac
1981 tatcatgttc ctaaaatgtt atcatagccg g
//
LOCUS YSCHEM15 2916 bp ds-DNA PLN 07-MAR-1990
DEFINITION S.cerevisiae ferrochelatase (HEM15) gene, complete cds,
and Val-tRNA gene.
ACCESSION J05395
KEYWORDS delta repetitive element; ferrochelatase; protoheme ferro-lyase;
transfer RNA-Val.
SOURCE S.cerevisiae (strain FL200) DNA, clones pHEM15-[1,2].
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2916)
AUTHORS Labbe-Bois,R.
TITLE The ferrochelatase from Saccharomyces cerevisiae: Sequence,
disruption and expression of its structural gene HEM15
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Lobbe-Bois, 06-MAR-1990.
FEATURES from to/span description
pept 841 2022 ferrochelatase precursor (EC 4.99.1.1)
sigp 841 933 ferrochelatase signal peptide
matp 934 2019 ferrochelatase
tRNA 2253 2325 Val-tRNA
rpt 2435 2765 variant solo delta element
rpt 2776 2888 truncated delta-element
BASE COUNT 907 a 563 c 527 g 918 t 1 others
ORIGIN
1 aaaatagcat tcagtgggaa cgatcccaat aagcagatag catatcgcag tgtaaaacta
61 tcaatcccat agttggtaat tatctttgtt aaaacagcgt ccacaggatt gtacatgttg
121 gaagcgatca gcagcagaat taaatcacca cggtttgtct tggcgtatng gtttgtcttc
181 ttgcttgacc tgtgcctgta ttttgtcctt ggctttaaat ctgtttaata agaacccttt
241 atttagatga ccattgtttt cgacggtttg tggtttcgac gcgcaatgtg aaagatcttc
301 aatgttttgt tgaacagtaa agcaaaacag ctcaacagca ggacaaaaaa agctaaaaaa
361 gaactcttcg cacttttaac aagcagtttt actgaacaat tttagaagta aatttcagga
421 aggttcaaca gctttacatg aaagaggatg ctggccattt ttttttttta aaaaagtttt
481 ttatatacac aaactcagta aaataattac aaatatgtag catgtgtagg atgccttgaa
541 acatctccgc tggcaatagc gttagacatg cccaatcttc aatggtcacc gtaagcctta
601 tatgacgaag gagttgctgc ccttcgtttt ctcatgctgc tttttttcgt aggggatatg
661 ggattgaaga aggaggacgc gaaggaatac ttttggtcgg tcaaggtcac tcttggcttg
721 acaccaaaca agcgtaaatt ctcgtgttcc acttgagtcg gtttacaaga agtacagaaa
781 tatttaacac atacctgcta tttggacaaa tttagcagta aaaaaaagaa cgtttaaaga
841 atgctttcca gaacaatccg tacacaaggt tccttcctaa gaagatcaca actgaccatt
901 acaagatcat tttcggttac attcaacatg cagaatgcac aaaagagatc acccacagga
961 attgttttga tgaacatggg tggcccctct aaagttgagg aaacatatga ttttttgtat
1021 caattatttg ccgataatga cctaattccc attagtgcta agtatcagaa gacaattgct
1081 aaatatattg ctaagtttcg tacccccaag atagagaagc aatataggga aattggtggg
1141 ggctccccaa tccggaaatg gtctgagtat caagccactg aggtctgtaa aatcttagat
1201 aaaacctgtc cagaaacggc gcctcataag ccttacgtgg cgtttcgtta cgcaaagccg
1261 ctaaccgcag aaacttataa acaaatgcta aaagatggcg tgaagaaggc agtggccttt
1321 tctcaatatc ctcatttctc ttattccact accgggtcat ccatcaacga attgtggaga
1381 cagattaagg cattggactc cgagagatct atatcttggt cggttattga tcgttggcct
1441 acaaatgaag gtctaatcaa ggccttctcc gaaaatatca ccaaaaaact acaagagttt
1501 ccgcaacctg tcagagacaa ggttgtttta ttgttttccg cacattctct acccatggat
1561 gttgttaaca ccggtgatgc ctacccagct gaggtagctg cgacggttta caacatcatg
1621 caaaaattaa agtttaaaaa cccttatagg ttggtttggc aatcccaagt tggaccaaaa
1681 ccatggttgg gagcgcagac agctgaaatt gcggaatttt taggccccaa agttgatggc
1741 ctaatgttta ttcctatcgc ctttacctct gatcatattg aaacattgca tgaaattgac
1801 ttaggcgtca ttggggaatc ggaatataag gataaattta agagatgcga atctttaaat
1861 ggcaaccaga cctttattga aggcatggca gatctcgtca aaagccactt acagagtaac
1921 caactctatt ctaatcaact acctcttgat tttgcacttg gcaagtccaa tgatcctgta
1981 aaggaccttt cattggtatt tggcaatcac gaatctactt gataagggcc attcatccca
2041 caatctcaat atcagtatat tctcttattt attaattgag gttgcgtatt tatgttttct
2101 ttgtggtata taatatatca ttatttattg cattttaaaa cttttttttt cattattttt
2161 ctctgttgaa tgcagtcaac atctccctta atattcgata ctcatcttta atactatcag
2221 caaaaagttt aagataaaat attcttaaaa acgatttcgc ccaggatcga actggggacg
2281 ttctgcgtgt taagcagatg ccataaccga ctagaccacg aaaccactat ttgttgaaat
2341 ctttaacaag cgaggaaact ttagtgaaat attaaaagga ctaaaccgga ccggttcttc
2401 tatttctaca tatctaatca cccaatgtag tttttgaata tttgtgaata gttggataat
2461 tgttgggatc ccattgtgca caaaggccat aatattatgt ctacagaata tactagatgt
2521 cctccctata ggatatagta atcctctaaa tggaaccgat atttctacat aataatatta
2581 cgattattcc tccttccgtt ttatatgttt cattatccta gcacactatc aatctttgca
2641 tttcagcttc cattagattt gatgactatt tctcaatctt tatgttatct ccttacgccg
2701 catgtgataa tatactgcta gtatgactac tagttgatag aagatagttg atttttactc
2761 caacaaaagt aacaatatta tttagaacta tagattccat tttgtgcatt cccatattct
2821 cgaggaaaac ttttagtata ttctgtagac ataatattat cgcctttgtg aacaatagaa
2881 tcccaacaat tgtcgcaaat ttaccaattt tctaga
//
LOCUS YSCOSCP 1451 bp ds-DNA PLN 05-MAR-1990
DEFINITION S.cerevisiae oligomycin sensitivity conferring protein (OSCP) gene,
complete cds.
ACCESSION M32487
KEYWORDS oligomycin sensitivity conferring protein.
SOURCE S.cerevisiae (strain D-273-10B, ATCC 25657) DNA, clone 62-17.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1451)
AUTHORS Misook,U., Jones,D. and Mueller,D.M.
JOURNAL Unpublished (1990)Univ. of Health Sci., North Chicago, IL 60064
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
U.Misook, 02-MAR-1990.
FEATURES from to/span description
pept 498 1136 oligomycin sensitivity conferring protein
(OSCP)
mRNA 429 > 1136 OSCP mRNA (alt.)
mRNA 430 > 1136 OSCP mRNA (alt.)
BASE COUNT 484 a 245 c 273 g 449 t
ORIGIN
1 agatctggct gcaaacctgg tctacataaa tgtagtacat tataagggac taccaagaaa
61 tacgaagcgc tcagatatct cttaaagttc atttttttgc cggaataatc ttggtttagt
121 aacttataaa actacttacg tccaataaat cgttatattt tatgccgtcg tcataaagtg
181 gacactttcg agagacggtg ctggccaata agatgctgat atttgaaata tcatttaccc
241 gggcccgggt gggaaaaaca aggaattaaa gataatgaag tgatcttttg gcgagatagt
301 aaaactttca attatctagg aacaatttca aacgccaatt gttaactttg tgtacagtat
361 ttctagtttt tatctcattg aaaacaaaat tcagtcaact tggaaattta agcaggctac
421 tgtggcattg agatattttt ccgcagagta agaaccgatt agcaagttac cgcattattt
481 ttaaaccgtg gacaatcatg tttaatagag tctttaccag gtcatttgca tcaagcttaa
541 gagctgctgc ttccaaagct gctgctcccc caccggtgag attgttcggt gttgagggca
601 cttacgcaac agctttatat caggccgctg caaagaactc ctctattgat gctgcctttc
661 aatctttgca aaaagttgaa agcacagtca aaaagaatcc aaaattaggt cacttattac
721 tgaatcctgc attgtcattg aaggacagaa attctgtcat tgatgccatc gtggagactc
781 acaagaattt ggatggttat gtcgttaacc tattgaaggt tctttccgaa aataacagac
841 tgggatgttt tgaaaaaatt gcgtctgact ttggcgtttt aaacgatgct cacaacgggt
901 tacttaaagg tacagtgact agtgctgaac cattggatcc taaatctttc aagaggattg
961 aaaaagcttt gagcgcctcc aaattggtgg ggcaaggtaa atctttgaag ctggaaaacg
1021 tggttaagcc agagattaag ggtggtttga ttgtagaact tggtgacaag actgttgact
1081 taagcatttc tacaaagatt caaaaactga ataaggtctt agaggacagc atttaacgtt
1141 aataatatag ttggatatgc tcaactcgtt tgatgttact ttctttgcct ccctcctaat
1201 ttttgattct gttcaaatat atattctttt ccgtgaattt ttcctaaaca tcaaaattac
1261 gtacatatac gaataaataa gaaacccctc taaaaagtga atttttttgt tcgagacaac
1321 cattttaatc atatatatgt tcgtgtatcc aggcaaactt atattaatcg caaaaaaaaa
1381 aattataagt gtatatatgg aataaacata aaaaataata aaagaagaaa aaaaaaaata
1441 atgaagaatt c
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CHKMIM1 5818 bp ds-DNA VRT 25-OCT-1989
DEFINITION Chicken myeloid protein-1 (mim-1) gene, complete cds.
ACCESSION M29448
KEYWORDS myeloid protein-1.
SOURCE Chicken (strain SPAFAS) hatchling myeloblast DNA, clone 35B-J612.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 445 to 704)
AUTHORS Ness,S.A., Marknell,A. and Graf,T.
TITLE The v-myb oncogene product binds to and activates the promyelocyte-
specific mim-1 gene
JOURNAL Cell 59, 1115-1125 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 444; 705 to 5818)
AUTHORS Ness,S.A.
JOURNAL Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.A.Ness, 25-OCT-1989, for release after publication.
FEATURES from to/span description
pept 713 758 myeloid protein-1, exon 1
1313 1445 myeloid protein-1, exon 2
2411 2562 myeloid protein-1, exon 3
3069 3245 myeloid protein-1, exon 4
3992 4109 myeloid protein-1, exon 5
4628 4782 myeloid protein-1, exon 6
5316 5515 myeloid protein-1, exon 7
pre-msg 687 5546 mim-1 mRNA and introns
IVS 759 1312 mim-1 intron A
IVS 1446 2410 mim-1 intron B
IVS 2563 3068 mim-1 intron C
IVS 3246 3991 mim-1 intron D
IVS 4110 4627 mim-1 intron E
IVS 4783 5315 mim-1 intron F
signal 445 704 promoter sequence
signal 625 621 (c) CAAT box
signal 659 664 TATA box
signal 5526 5531 poly-A signal
BASE COUNT 1785 a 1235 c 1132 g 1666 t
ORIGIN
1 aactgtggct tggcaccatc catttcagaa tctgagaggt tttaacaaga tttctctgtg
61 aaaaaaacag ggttccagag aaaagactcc atggattgat gtccagacat ggtgaaaata
121 tcagacgaac agaaacagtc tatgtcaaga gcacctgcac tcttagccta agtaatatgt
181 gtactaaaca aacattacac tttcagtgtc ttctgtcgtg tcgtattgaa gccataatgg
241 gcacatacag ctggggagga ggatctggaa ccagcatgag aaagacaact ggtgatgccc
301 acaacagaca acacatggag ttttgcccat cttttcccac cagctgaatc ctctggatcc
361 aattcttgtt tcctatcatt tttggcctaa gcgtgtcctt gcttggggaa ggagaagaaa
421 aagcgttaag aaatggttca accaaagctt cctgaaaagt gggattgttc atttccatct
481 gttaatgtga gctaagacac ccgttacttt acctgtcttt cccaaccagc tctaaaaaac
541 cgttataatg tacagatatc tttagtgcct aatatggcaa ggctgtctgc tgcggcgctg
601 taacacttgg ctcaggactg attggccaac acaacagtcc cacactttgg aaaacacata
661 aatactacag agagctactg taggacgtct ttgactcgct caaaggcacg agatgccagc
721 cctcagcctg atcgccctgc tcagcctggt gtccactggt gagttgggac tgctttggtg
781 tggtttgtct cgagtctgta cagacagaca ctaattaata actgaaatta atttcagcat
841 aaagctaagc tctttaaatg tgtttttatg atgaacatat gtaatgtgcg ctagtatata
901 actattaaca aaaatacaaa gggtgtgttt tacagataga gctatcctga agtgctttgg
961 tttcatggaa aaagtagttg gcattatagc agaaactaat attagaaatc tgtgtctaca
1021 gctttacttc cttttactgt atgtaccttt atcaaggcac aaggcttcca ggaaagcagt
1081 caaatatgca taacaggttg ttcagtgttc ttacagctgc agtttggggc attatgaagg
1141 cccacaaacc atcatttacc tgtaattaag gaggaaacct ctcctgtgag gcagaagtca
1201 cgacacgtgt cttccatctt aaacattgca ttttgcagct taccactttc agttgcacaa
1261 cactaacgca ttcctatttc tattttcctg ttttcctctt ctgtatgggc agcttttgcc
1321 aggcagtggg aggtgcaccc tccgcagcag cagggcaggc actgggcgca gatatgcagc
1381 gggaatcctt tcaatagaat ccggggctgt gacagatacg gctgcggcaa ttacggagcc
1441 agcaggtaac agggcaggca gcaggggcct ggagcaggaa tgcatcatgg ctttcctaca
1501 ccaaatgttc acaaagcaag aatgagatat aaccaacttt taaaaacgaa aggcttgcta
1561 atattttgcc ttctttctcc aagatttaag agcaatacac aaaatcacca gtcgtgcctt
1621 tcatataaca ttcattgata tttctgctgt actttaaatg aaagagagct tctctatgtt
1681 ttctcttact cactttgaga cattactaag caatctcaca gaaaaaaata tcagtgtaat
1741 ttagactaac agatgtatgg gtaaagagta aagaagagag accactacat aaactagcag
1801 gctgagtagg aaccgcaatg gaaagcattc caaggactag tgtaattcat ccttggttct
1861 tatagacaag acacttaact agcaatcatc aatagagctt ataggaagct ctacaaatta
1921 ctttacacaa gctaaagatc tgcagacacc actgcctctg cataggcacc ttataacaga
1981 gaaggttatg ctgtcctaac aaatgctccc aggcccttcc agctcaggat attctctgat
2041 tctatgatct ctactacaga acttaagaat tgctaagtga ggaaaatgcc aaatatacat
2101 ttctatttga catggcacag atgcattctg gggtgggaac aggggtactg tgatggtctg
2161 gcgtcctatg aatttgggag aattaaaaac attcattaaa gtttgaaatt ctgcccccat
2221 ttaaagtggg tcatactaat acaaacacgt tgcaacaaaa atacgactga aactagcttc
2281 tccccactag cttctcccaa tttcactgca gatgtaaaac cttacgtttt aagagaagct
2341 tcatatgtag tgctacgtga tctatctatt aggcctgaca gattgcataa ttacttaatt
2401 atgtcttcag acagggtaaa ggagaaaagc acaagggcgt ggatgtcatc tgcactgatg
2461 gatcaatagt gtacgctccc ttcagcggcc agctctccgg acccattcga ttctttcata
2521 atggaaatgc cattgatgat ggagtccaaa tctctgggtc aggtaagtaa taacagcaca
2581 caggtcagaa ggcaggcgct acagatctta tttacctgag taagatattt ctgtgcctct
2641 tgaagtcaag cagattgctg atttagtctc tctagttctt atactgaagc acaactacac
2701 tgctgctcag aaataccaaa agctccagca ggatacagcc tttttcttag agagaacaga
2761 agaagggagt cgaaaaacac atagcagaaa aacgactgca agtcaaacaa aggaaacagt
2821 aaaaacaaag catccgaatc ctcagttttg atgagtggat attttcctgc ctctgttttc
2881 caacttcctt tttttccttt ttcctatttt ctatagggat gtgatactcg caagattgtt
2941 ggtaccttga ctccatttca gcttcaccac tgtagccccc agtaacagcc cagtgaggaa
3001 gatttcttgc actctgcctc aacagatgca gctggtgttt cctgaccttc ctctcccatc
3061 ttttccaggt tactgtgtaa aactggtctg cattcatccc atcagatacc acggccaaat
3121 ccagaaaggg caacaacttg gaagaatgct gccaatgcaa aaagtgtttc ctggcattgt
3181 gtctcacatt cacgtcgaga actgcgacca gtctgatcct actcatctcc tcagacctat
3241 acctggtaaa aaaaacatgc aacaaacatt tgagttttat catgaatgaa aatatgttta
3301 attgcaaata acaataatat taaacaatag gagtctcctc tgtcataggg tctcctaaag
3361 agcaacctcc aaaagagaga agacatgcat tatagcataa caggaccaca gaaaggaagg
3421 aacatgtatt agcttactga gctttatttc tcacccctca ctgccctccc atggcacagc
3481 agcatcaagt tccagcaaca gagtcagatg tctcttttga tccttctaac taccagcaat
3541 gtttccaata ttacatctta actagatcaa actttaaatt tgcacttgaa aacaatcatc
3601 ctaaaggttc tggtgatggt ttaggactta aggactttga ttccattagt gtaggcacag
3661 aataagaaaa atccaaatcc taccccctca gtttacactc tggtttataa ctgtgataat
3721 cctgtttttg tgtttttttt cctctttgaa gcgtgttctt tatatcagct caccgtaatc
3781 acaaagcatt aagatgtagt ttgaggcact tgttccactt atgtttccaa aggggtgaca
3841 aattttatgg agaatcaaac accaactgaa catgcaagat gtaattccag catgtttctt
3901 cctaactaaa atgaaaatgt aatagttgca aatatgaaat acattaaaaa aaaccaacac
3961 taccttcatt tcctgtttct gtatttcata gatatttcac caccattccc acaacaagat
4021 gctcactggg ctgtagtatg tgctgggaat cctacaaatg agataagagg ctgtgataaa
4081 tatggctgtg gatactttgg agctccaagg tactatactt gaacacatgg gggttttatt
4141 attgctctgt tggaatagac ttgaactctg tttaagctct gaaagtttag gttgaacata
4201 agggaacact tactacagag gtaatgcagc ataaagaaca ggctgccaat gttgttcatg
4261 aaaccgtcat acttggaggg tgttgggtta gacaaagtca tgcctgacct ggtgttggga
4321 acaatacaag ctgaaggtag ggctaaacgc cctccacagg taccttccaa ccagtacttc
4381 tccagctcta gagttctatt gcaaagctag ggagaaatac atatccaccc acacaattcc
4441 caaaagctgt aaatctagca gactatataa cattacctta cctaccaaaa cagaatttct
4501 acacattggt caatcatggc agcaccatct tagtactact aacctgtttt tagaacttgc
4561 taagcgacct ctctctgcat aacgtaaggt ctggaagaga cttctctcat tacatatttg
4621 tttccaggcg caatggtaaa ggagagaagc acaagggcgt ggatgtcatc tgtgctgatg
4681 gtgcaacagt gtatgctccc ttttctggtg agctgtctgg accagttaaa ttctttcata
4741 atggaaacgc cattgatgat ggagtccaaa tcaggggatc aggtataaag ctattttcat
4801 ttatttcatt taatagttca ttttgactca agttcatgta ggatacgttc cctcacaggt
4861 agttgagtat atcagcaaag cacacatttc cagctgctta gctccacatt cagaaatttc
4921 tgctcgtttc ttaggatgaa aatataaaca ctcaatctta aagacaacga gggaagatct
4981 gccttacatg aactattatt tagttttctc tctacaagga aaaaaaaaat tggttgggct
5041 gcaaattaca tcagaaaaaa ttaaaaacgg ggagcttgag ttctgtacaa tgtcttcaag
5101 ctcttctgct aaagcaagaa ttctcacctg tcccttcatt ttcctttccc taacaagcca
5161 gcaactccaa ccgctctctc tactttttac agtatttttg atagcattgc atttccaaaa
5221 ctgctatctc agtggaaagg acaagagttg ttcaaataga atgcaaaaga acaaatgtac
5281 atcatctaat tttttttctt cctcttctta accaggcttc tgtgtcaaac tgctgtgcat
5341 ccatcccatc agatacaatg gtaggatttc taagggacaa gtccttggca gaatgttgcc
5401 gatgcaaaga gtatttcccg ggtacatatc tcatattcat gttgagaact gcgatcgctc
5461 agatcctact agcaatcttg aaagggggaa aggagaaagt gaaatggaag tgtaataagt
5521 tctgaataaa tacatctcag cattcaaatg ctgtttttgt tctcacatgg atggtctatc
5581 tcaatgaaag aaggaaaatg tactttacaa atagaataaa gtgtacacag caggcacaac
5641 aatagattac tgatgcaata tttttgggcc ctacttaata ctgtaaggtc ctaacatcaa
5701 catctctcca ggtaccagtg aggtactgat ttttaagtat tggtagttgt gaatagctac
5761 tgttgcagct tcgccagttt acaaactttc tgcattttgg aatacaaagt tatcgaaa
//
LOCUS CHKMMIM1X 1054 bp ss-mRNA VRT 25-OCT-1989
DEFINITION Chicken myeloid protein (mim-1) mRNA, complete cds.
ACCESSION M29449
KEYWORDS myeloid protein-1.
SOURCE Chicken (strain SPAFAS) hatchling myeloblast, cDNA to mRNA, clone
J612.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (sites; for [2])
AUTHORS Ness,S.A., Marknell,A. and Graf,T.
TITLE The v-myb oncogene binds to and activates the promyelocyte-
specific mim-1 gene
JOURNAL Cell 59, 1115-1125 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1054)
AUTHORS Ness,S.A.
JOURNAL Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.A.Ness, 25-OCT-1989, for release after publication.
FEATURES from to/span description
pept 27 1007 myeloid protein-1
BASE COUNT 297 a 241 c 265 g 251 t
ORIGIN
1 gtctttgact cgctcaaagg cacgagatgc cagccctcag cctgatcgcc ctgctcagcc
61 tggtgtccac tgcttttgcc aggcagtggg aggtgcaccc tccgcagcag cagggcaggc
121 actgggcgca gatatgcagc gggaatcctt tcaatagaat ccggggctgt gacagatacg
181 gctgcggcaa ttacggagcc agcagacagg gtaaaggaga aaagcacaag ggcgtggatg
241 tcatctgcac tgatggatca atagtgtacg ctcccttcag cggccagctc tccggaccca
301 ttcgattctt tcataatgga aatgccattg atgatggagt ccaaatctct gggtcaggtt
361 actgtgtaaa actggtctgc attcatccca tcagatacca cggccaaatc cagaaagggc
421 aacaacttgg aagaatgctg ccaatgcaaa aagtgtttcc tggcattgtg tctcacattc
481 acgtcgagaa ctgcgaccag tctgatccta ctcatctcct cagacctata cctgatattt
541 caccaccatt cccacaacaa gatgctcact gggctgtagt atgtgctggg aatcctacaa
601 atgagataag aggctgtgat aaatatggct gtggatactt tggagctcca aggcgcaatg
661 gtaaaggaga gaagcacaag ggcgtggatg tcatctgtgc tgatggtgca acagtgtatg
721 ctcccttttc tggtgagctg tctggaccag ttaaattctt tcataatgga aacgccattg
781 atgatggagt ccaaatcagg ggatcaggct tctgtgtcaa actgctgtgc atccatccca
841 tcagatacaa tggtaggatt tctaagggac aagtccttgg cagaatgttg ccgatgcaaa
901 gagtatttcc cgggtacata tctcatattc atgttgagaa ctgcgatcgc tcagatccta
961 ctagcaatct tgaaaggggg aaaggagaaa gtgaaatgga agtgtaataa gttctgaata
1021 aatacatctc agcattcaaa aaaaaaaaaa aaaa
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS SYCTREA 76 bp ss-tRNA RNA 15-JUN-1989
DEFINITION Synechocystis sp. Glu-tRNA.
ACCESSION M19535
KEYWORDS transfer RNA; transfer RNA-Glu.
SOURCE Synechocystis PCC 6803 tRNA.
ORGANISM Synechocystis sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 76)
AUTHORS O'Neill,G., Peterson,D., Schon,A., Chen,M.-W. and Soll,D.
TITLE Formation of the chlorophyll-precursor delta-aminolevulinic acid in
cyanobacteria requires aminoacylation of a tRNA-Glu
JOURNAL J. Bacteriol. (1988) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.Soll, 30-JUN-1988
FEATURES from to/span description
tRNA 1 76 Glu-tRNA
modified 20 20 d
modified 54 54 t
modified 55 55 p
site 34 34 modified bsae (presumably uracil)
BASE COUNT 15 a 25 c 21 g 14 t 1 others
ORIGIN
1 gcccccatcg tctagaggcc taggacacct ccctntcacg gaggcgacag ggattcgaat
61 tcccttgggg gtacca
//
LOCUS SYCTREAB 1000 bp ds-DNA BCT 13-FEB-1990
DEFINITION Synechocystis sp. Glu-tRNA gene.
ACCESSION M32099
KEYWORDS transfer RNA; transfer RNA-Glu.
SOURCE Synechocystis PCC 6803 DNA, clone pS6TE.
ORGANISM Synechocystis sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 1000)
AUTHORS O'Neill,G.P. and Soll,D.
TITLE Expression of the Synechocystis strain 6803 tRNA-Glu provides a
functional excess of tRNA for protein and chlorophyll biosynthesis
JOURNAL Unpublished (1990) Yale Univ. New Haven, CT 06511
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.Soll, 14-FEB-1990.
FEATURES from to/span description
tRNA 201 273 Glu-tRNA
signal 97 136 inverted repeat/stem loop
signal 130 135 -35 transcriptional promoter element
signal 154 159 -10 transcriptional promoter element
signal 293 320 inverted repeat/rho-dependent terminator
BASE COUNT 261 a 223 c 254 g 262 t
ORIGIN
1 cccgatggca ctaaactggt gacggtccac gaacccattc gctagtaatg agctggccgg
61 tcagggcgga ctgtttttta ccgttaaacc ccaggcctgt tgaccgtaga aaaaattcat
121 tgtctagggt tgacagtgat caatctttat cgctatattt aaaaacgtct gaaataacga
181 actgttatcg agactgccta gcccccatcg tctagaggcc taggacacct ccctttcacg
241 gaggcgacag ggattcgaat tcccttgggg gtattaaaaa aaaagaaaaa acgaaaagaa
301 gttcttcggg gcttcttttt tgttgctcac ccaaacaatt ccccaagaag tttaaaaccc
361 tctttcggtt tatggaacag aagaagaagc ggggagcttg tcgagggaaa cagcgcagaa
421 atggcaagac ccaaactcca tctatggcag tataggggcg tagttctttg tccttgaggc
481 tctgtctggc aaatgttaat cctgcgtcgc ctaatactag ttgtttccct tagttttgcc
541 acgatcgcca ttgacattac tacgagacca gggctggaac cagtgcaagg ccagggggtg
601 gagacagtag actcgcaaac actaaaagtg ggggtggtgg gcaatcctcc ctttgtcttt
661 tatggagaag gaaaaaacgc agcatttacc ggtattagcc tagatgtgtg gagagctgtg
721 gccgagtccc agaagtgaat tcagaatatg taaggcaaaa ttccatctcc gcaggcatca
781 cagcggtggc ggaaggggaa ttggatattt taattggtcc cattagtgtt accccagagc
841 gggccgccat agagggcatt accttcaccc agccctattt cagtagtggc attggcctgt
901 attccaggca aaccggtgag tttatgggaa agattttcgc ctttttttgg catagcgctt
961 tgtcttcagc gggggtactt actctacttt tatttttagg
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS RATADHX1 546 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 1.
ACCESSION M29516 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 1 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 124; 125 to 239)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 239 to 546)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept 161 + 178 alcohol dehydrogenase, exon 1 (EC 1.1.1.1)
pre-msg < 1 > 422 ADH mRNA and introns
IVS 179 > 546 ADH intron A
BASE COUNT 162 a 101 c 127 g 156 t
ORIGIN
1 gaataaactg atgggtgtgt cgtcgtgaag ggctagatca catgtgggat cagctgacac
61 cccagcaaat aaaatgctgt ggcaactttg gcccagttca ctgcaggagg gaagaagtct
121 acgaggatcc atccatttcc tgcccacagc gaaggacagc atgagcacag ctggaaaagt
181 aaggaaaaat acctcaccct tggaaatata ttctcattga tatgtgaaga gatgcagaat
241 cctaacttag aaatgacagg catcgctggg gtctggtgca gcttgactgt actaagtatc
301 tttaatgaga gggtatagcg tctaggtttg tgaatgcctt catttcctca ttaactctgt
361 aggacttttt ccaagccaat ggtgactgct ttgcaaatag aaagttttta ggtcttttat
421 cgaaagtctg tatttatctt aagataccaa caaatgtgct gtccttcgta atcatatttt
481 attatgaaca caggaacgtt aaatcggaca gagtttcata tgaggaattt gcttcaggga
541 atttag
//
LOCUS RATADHX2 237 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 2.
ACCESSION M29517 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 2 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 233)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 234 to 237)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 112 + 213 alcohol dehydrogenase, exon 2 (EC 1.1.1.1)
pre-msg < 1 > 237 ADH mRNA and introns
IVS < 1 111 ADH intron A
IVS 214 > 237 ADH intron B
BASE COUNT 54 a 49 c 67 g 67 t
ORIGIN About 2.0 Kb after segment 1.
1 aattatgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gagtgataga
61 aggatacaat tgcgtgatct cgttggagct gaagtaaccc tttgcttcca ggtaatcaaa
121 tgcaaagcgg ccgtgctatg ggagcctcac aagcccttca ccatcgagga catagaagtc
181 gcacccccca aggcccatga agttcgcatt aaggtgaaca ccccttttca agtctgt
//
LOCUS RATADHX3 186 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 3.
ACCESSION M29518 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 3 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 11 to 186)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 10)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 30 + 169 alcohol dehydrogenase, exon 3 (EC 1.1.1.1)
pre-msg < 1 > 186 ADH mRNA and introns
IVS < 1 29 ADH intron B
IVS 170 > 185 ADH intron C
BASE COUNT 44 a 44 c 56 g 42 t
ORIGIN
1 gacaactgaa cttgatcttt aacccacaga tggtggccac cggagtctgc cgctcagacg
61 atcacgcggt tagtggatcc ctgttcacgc ctcttcctgc agttctaggc cacgagggag
121 ctggcattgt tgagagcatt ggagaagggg tgacttgtgt gaaaccaggg tacagaatag
181 agctga
//
LOCUS RATADHX4 586 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exons 4 and 5.
ACCESSION M29519 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 4 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 119 to 582)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 118; 583 to 586)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Med Res. build. rm 424 Indiana U. Ind., IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 154 240 alcohol dehydrogenase, exon 4 (EC 1.1.1.1)
332 + 554 alcohol dehydrogenase, exon 5
pre-msg < 1 > 586 ADH mRNA and introns
IVS < 1 153 ADH intron C
IVS 241 331 ADH intron D
IVS 555 > 586 ADH intron E
BASE COUNT 140 a 157 c 136 g 153 t
ORIGIN About .9 Kb after segment 3.
1 ttttagagca tattcataat agagacacac atgcacctga aagccaaaac tgactatgag
61 gaaatgatta tggcttggga acatgcatgc tcttctggag gccctaactg ggtcattcac
121 aaccacttgc cattctgttt tctcttcatc cagtgataaa gtcatcccgc tgttctctcc
181 ccagtgtgga aaatgcagga tctgcaagca cccggaaagc aacctctgtt gccaaactaa
241 gtgatgtgtt tcttattgtc ttcttgcaca attgggcggc actctggttt cattctgtct
301 cacatcgttt gtattcctgt gtttcaccaa ggaatctgac acagcctaag ggagctttgc
361 tggacggcac cagcaggttc tcctgcaggg gaaagcccat tcaccacttc atcagcacca
421 gcaccttctc ccagtacact gtggtagatg acatagcggt ggccaaaatc gatgcggctg
481 caccgctgga caaagtctgc ctcatcggct gtggcttctc gactggctat ggctctgccg
541 tccaagtcgc caaggtagga tggacaatgg cccctggaac aaggct
//
LOCUS RATADHX5 409 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 6.
ACCESSION M29520 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 5 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 110 to 404)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 109; 405 to 409)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 122 + 382 alcohol dehydrogenase, exon 6 (EC 1.1.1.1)
pre-msg < 1 > 409 ADH mRNA and introns
IVS < 1 121 ADH intron E
IVS 383 > 409 ADH intron F
BASE COUNT 108 a 94 c 106 g 101 t
ORIGIN About 2.3 Kb after segment 4.
1 gggtttccaa ctgatggaat gaagtagcca cgataaagga catagcaagg cagtttgtgc
61 agtgtggaac acttccgaac agtcaccatt cagtccactc ttgcgttttc tgaaaataca
121 ggtgacccca ggctccacct gtgccgtgtt tggcctggga ggtgttggtc tgtctgtcgt
181 cattggctgt aaaacagcag gagcagccaa gatcattgcc gtggacatca acaaagacaa
241 gtttgcgaag gccaaagagt taggtgccac tgactgtatc aaccctcaag actacaccaa
301 acccatccag gaagttctcc aggagatgac tgatggaggg gtggactttt catttgaagt
361 cattggccgt cttgatacca tggtatgtac tgtggcatac cttaagatc
//
LOCUS RATADHX6 262 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 7.
ACCESSION M29521 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 6 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 49 to 216)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 48; 217 to 262)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 63 + 198 alcohol dehydrogenase, exon 7 (EC 1.1.1.1)
pre-msg < 1 > 262 ADH mRNA and introns
IVS < 1 62 ADH intron F
IVS 199 > 262 ADH intron G
BASE COUNT 62 a 68 c 65 g 67 t
ORIGIN About 1.8 Kb after segment 5.
1 attgcctaaa gctgcacagg accacaaaaa gaatctactt taactattaa tgtttattcc
61 agacttctgc cctgttaagc tgccattcag catgcggtgt aagcgtcatt gtcggggtgc
121 ctcccagtgc ccaaagcctc tccgttaacc ccatgtcgct gctgctggga cgcacctgga
181 aaggagcaat attcggcggt atgtatttac agcgaggaga tcaacgtccg catctgttca
241 tgcatgtcaa ggtgggcgtg cg
//
LOCUS RATADHX7 186 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 8.
ACCESSION M29522 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 7 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 21 to 186)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 20)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 31 + 169 alcohol dehydrogenase, exon 8 (EC 1.1.1.1)
pre-msg < 1 > 186 ADH mRNA and introns
IVS < 1 30 ADH intron G
IVS 170 > 186 ADH intron H
BASE COUNT 47 a 37 c 45 g 57 t
ORIGIN About .9 Kb after segment 6.
1 gatctcatgg agatgctgct ttcccttcag ggtttaagag taaagatgcc gtccccaaac
61 ttgtcgctga cttcatggct aagaagtttc cgttggagcc gctgattact catgttttac
121 cttttgaaaa gataaatgaa gcatttgacc tgctccgtgc tggaaagagg taagctttga
181 gagtat
//
LOCUS RATADHX8 545 bp ds-DNA ROD 27-OCT-1989
DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 9.
ACCESSION M29523 J04747
KEYWORDS alcohol dehydrogenase.
SEGMENT 8 of 8
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 65 to 360)
AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
Qulali,M., Zhang,K. and Edenberg,H.J.
TITLE Structure and expression of the rat class I alcohol dehydrogenase
gene
JOURNAL Genomics 5, 906-914 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 64; 361 to 545)
AUTHORS Crabb,D.W.
JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES from to/span description
pept + 94 118 alcohol dehydrogenase, exon 9 (EC 1.1.1.1)
pre-msg < 1 > 360 ADH mRNA and introns
IVS < 1 93 ADH intron H
BASE COUNT 130 a 150 c 109 g 156 t
ORIGIN About .7 Kb after segment 7.
1 gttgtctcag agatcgtaac acatgtttta ctcgtgcata tggggagtgt aggtgtggag
61 tgtgaaaacg gtatcttcta ttttgacttt cagtatccgt accgtcctga cgttctgaga
121 acacatgaat gccttcccgg tttctgactc ctccgcctac cgaatcaagc gccagccaca
181 tcacaacctt aaccttgctc ttcagagaca cacagccaat aaagcacctg tgcaagcttt
241 ccgtgagaaa ctgagttttc gatggaacct gttccaaccg catgtttaca atcctatctt
301 attgtttcgc acctcagctg cctgttaaac accgtgatca aaacacccag cccattgctc
361 ttcgcccctc cccggaacac tgtactattt ctggaggtgt tgggtcagtc attatttcag
421 gtttttattg aaggaaacaa ctccatattc attgtccacc aaagggcact tagaacgcac
481 gagcgctcat gtgtggtgct gccttttagt tcttacaaca gagcttctcc acgcttccag
541 ccagc
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CHTHYPAB 2659 bp ds-DNA BCT 12-FEB-1990
DEFINITION C.trachomatis HypA and HypB protein genes, complete cds.
ACCESSION M31739
KEYWORDS .
SOURCE C.trachomatis (serovar A, isolate HAR-13) elementary body DNA,
clone pTA571.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 2659)
AUTHORS Morrison,R.P., Lyng,K., Su,H., Yuan,Y. and Zhang,Y.-X.
JOURNAL Unpublished (1990) Rocky Mountain Laboratory, Hamilton, MT 59840
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.P.Morrison, 31-JAN-1990.
FEATURES from to/span description
pept 441 779 HypA protein
pept 817 2451 HypB protein
BASE COUNT 830 a 527 c 613 g 689 t
ORIGIN
1 gtcgattttt gtgatgctcg tcagggggcg gagcctatgg aaaaacgcca gcaacgcggc
61 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
121 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
181 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
241 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
301 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
361 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
421 atttcacaca ggaaacagct atgaccatga ttacgaattc gagctcggta cccggggatc
481 aagcaacgac cctcaagatt aaacctttgg gagatagaat tttagttaaa agagaagaag
541 aagcttccac tgcaagaggc ggaatcattc ttcctgacac tgccaagaaa aagcaagata
601 gagctgaagt tttagctcta ggaacaggca aaaaagatga taaagggcag caacttcctt
661 ttgaagttca ggttggtgac atcgttttaa ttgataaata ttctggccaa gaacttactg
721 tcgaaggtga agagtacgtc atcgttcaaa tgagcgaagt tatcgcagtt ctgcaataaa
781 aactaagaga gtgaagaaga tttaaggagc gcatcaatgg tcgctaaaaa cattaaatac
841 aacgaagaag ccagaaagaa aattcaaaaa ggagttaaga ctttagctga agctgtaaaa
901 gtcactctag ggcctaaagg acgacatgtt gtcatagata aaagcttcgg atcccctcaa
961 gtaactaaag atggtgttac cgttgcgaaa gaagttgagc ttgccgacaa acatgaaaat
1021 atgggcgctc aaatggtcaa agaagtcgcc agcaaaactg ctgacaaagc tggagacgga
1081 actacaacag ctactgttct tgctgaagct atctatacag aaggattacg caatgtaaca
1141 gctggagcaa atccaatgga cctcaaacga ggtattgata aagctgttaa ggttgttgtt
1201 gatcaaatca aaaaaatcag caagcctgtt cagcatcata aagaaattgc tcaagttgca
1261 acaatttctg ctaataatga tgcagaaatc gggaatctga ttgctgaagc aatggagaaa
1321 gttggtaaaa acggctctat cactgttgaa gaagcaaaag gatttgaaac cgttttggat
1381 gttgttgaag gaatgaattt caatagaggt tacctctcta gctacttcgc aacaaatcca
1441 gaaactcaag aatgtgtatt agaagacgct ttggttctaa tctacgataa gaaaatttct
1501 gggatcaaag atttccttcc tattttacaa caagttgctg aatccggccg tcctcttctt
1561 attatagcag aagacattga aggcgaagct ttagctactt tggtcgtgaa cagaattcgt
1621 ggaggattcc gggtttgcgc agttaaagct ccaggctttg gagatagaag aaaagctatg
1681 ttggaagaca tcgctatctt aactggcggt caactcatta gcgaagagtt gggcatgaaa
1741 ttagaaaacg ctaacttagc tatgttaggt aaagctaaaa aagttatcgt ttctaaagaa
1801 gacacgacca tcgtcgaagg aatgggtgaa aaagaagctt tagaagctcg ttgcgaaagc
1861 atcaaaaaac aaattgaaga cagctcttct gattacgata aagaaaaact ccaagagcgt
1921 cttgctaagc tctctggtgg agtagcagtc attcgcgttg gagctgcaac agagattgag
1981 atgaaagaga aaaaagatcg tgtagacgat gctcaacatg ctacaatcgc tgctgttgaa
2041 gaaggaattc ttcctggtgg aggaacagca ttaatccgtt gtatccctac tcttgaagcc
2101 ttcttgccaa tgttgactaa tgaagatgag caaattggag ctcgcattgt tttgaaagct
2161 ctttccgctc ctttgaaaca aattgctgca aacgcaggaa aagaaggtgc tatcatcttc
2221 caacaagtta tgtcccgttc tgcgaacgaa ggatatgatg cattgcgtga tgcatacaca
2281 gatatgcttg aagctggtat tttagatcct gctaaagtaa cccgttctgc tttagaaagc
2341 gcggcttccg tagctggatt acttttgaca acagaagctc tcattgcaga gattccagaa
2401 gaaaaacctg ctgcagctcc agcaatgcct ggcgcaggaa tggactatta attcctctaa
2461 tgggaacaaa tagattcttc gagcctcgtt tccaaaagga acgaggcttt tttttagatt
2521 cctaatattt ctctattcct ctatcgtaaa catctagtgc ttacgaccat ccttttctat
2581 gtttaaacta atcaagagcg catttctcat agcctgttgt attgtagggt acttctggat
2641 aaaaaaagaa agtattgtt
//
LOCUS HUMCA1XIA 6158 bp ss-mRNA PRI 15-MAR-1989
DEFINITION Human alpha-1 type XI collagen (COL11A1) mRNA, complete cds.
ACCESSION J04177 J05407
KEYWORDS alpha-1 (XI) collagen; collagen; type XI collagen.
SOURCE Human placenta fibroblast, cDNA to mRNA, clone OK4 [1].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1773 to 6158)
AUTHORS Bernard,M., Yoshioka,H., Rodriguez,E., van der Rest,M., Kimura,T.,
Ninomiya,Y., Olsen,B.R. and Ramirez,F.
TITLE Cloning and sequencing of pro-alpha-1(XI) collagen cDNA
demonstrates that type XI belongs to the fibrillar class of
collagens and reveals that the expression of the gene is not
restricted to cartilagenous tissue
JOURNAL J. Biol. Chem. 263, 17159-17166 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1835)
AUTHORS Yoshioka,H. and Ramirez,F.
TITLE Pro alpha-1(XI) collagen, structure of the amino-terminal
propeptide and expression of the gene in tumor cell lines
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Ramirez, 02-FEB-1990; for [2] by M.Bernard, 21-SEP-1988.
FEATURES from to/span description
pept 162 5582 alpha-1 (type XI) collagen precursor
/nomgen="COL11A1" /map="1p21"
/hgml_locus_uid="LH0170J"
sigp 162 269 alpha-1 (type XI) collagen signal peptide
matp 270 1745 alpha-1 (type XI) collagen
matp 5013 5579 alpha-1 (type XI) collagen
BASE COUNT 1693 a 1355 c 1746 g 1361 t 3 others
ORIGIN
1 aaccatcaaa tttagaagaa aaagcccttt gactttttcc ccctctccct ccccaatggc
61 tgtgtagcaa acatccctgg cgataccttg gaaaggacga agttggtctg cagtcgcaat
121 ttcgtgggtt gagttcacag ttgtgagtgc ggggctcgga gatggagccg tggtcctcta
181 ggtggaaaac gaaacggtgg ctctgggatt tcaccgtaac aaccctcgca ttgaccttcc
241 tcttccaagc tagagaggtc agaggagctg ctccagttga tgtactaaaa gcactagatt
301 ttcacaattc tccagaggga atatcaaaaa caacgggatt ttgcacaaac agaaagaatt
361 ctaaaggctc agatactgct tacagagttt caaagcaagc acaactcagt gccccaacaa
421 aacagttatt tccaggtgga actttcccag aagacttttc aatactattt acagtaaaac
481 caaaaaaagg aattcagtct ttccttttat ctatatataa tgagcatggt attcagcaaa
541 ttggtgttga ggttgggaga tcacctgttt ttctgtttga agaccacact ggaaaacctg
601 ccccagaaga ctatcccctc ttcagaactg ttaacatcgc tgacgggaag tggcatcggg
661 tagcaatcag cgtggagaag aaaactgtga caatgattgt tgattgtaag aagaaaacca
721 cgaaaccact tgatagaagt gagagagcaa ttgttgatac caatggaatc acggtttttg
781 gaacaaggat tttggatgaa gaagtttttg agggggacat tcagcagttt ttgatcacag
841 gtgatcccaa ggcagcatat gactactgtg agcattatag tccagactgt gactcttcag
901 cacccaaggc tgctcaagct caggaacctc agatagatga gtatgcacca gaggatataa
961 tcgaatatga ctatgagtat ggggaagcag agtataaaga ggctgaaagt gtaacagagg
1021 gacccactgt aactgaggag acaatagcac agacggaggc aaacatcgtt gatgattttc
1081 aagaatacaa ctatggaaca atggaaagtt accagacaga agctcctagg catgtttctg
1141 ggacaaatga gccaaatcca gttgaagaaa tatttactga agaatatcta acgggagagg
1201 attatgattc ccagaggaaa aattctgagg atacactata tgaaaacaaa gaaatagacg
1261 gcagggattc tgatcttctg gtagatggag atttaggcga atatgatttt tatgaatata
1321 aagaatatga agataaacca acaagccccc ctaatgaaga atttggtcca ggtgtaccag
1381 cagaaactga tattacagaa acaagcataa atggccatgg tgcatatgga gagaaaggac
1441 agaaaggaga accagcagtg gttgagcctg gtatgcttgt cgaaggacca ccaggaccag
1501 caggacctgc aggtattatg ggtcctccag gtctacaagg ccccactgga ccccctggtg
1561 accctggcga taggggcccc ccaggacgtc ctggcttacc aggggctgat ggtctacctg
1621 gtcctcctgg tactatgttg atgttaccgt tccgttatgg tggtgatggt tccaaaggac
1681 caaccatctc tgctcaggaa gctcaggctc aagctattct tcagcaggct cggattgctc
1741 tgagaggccc acctggccca atgggtctaa ctggaagacc aggtcctgtg ggggggcctg
1801 gttcatctgg ggccaaaggt gagagtggtg atccaggtcc tcagggccct cgaggcgtcc
1861 agggtccccc tggtccaacg ggaaaacctg gaaaaagggg tcgtccaggt gcagatggag
1921 gaagaggaat gccaggagaa cctggggcaa agggagatcg agggtttgat ggacttccgg
1981 gtctgccagg tgacaaaggt cacaggggtg aacgaggtcc tcaaggtcct ccaggtcctc
2041 ctggtgatga tggaatgagg ggagaagatg gagaaattgg accaagaggt cttccaggtg
2101 aagctggccc acgaggtttg ctgggtccaa ggggaactcc aggagctcca gggcagcctg
2161 gtatggcagg tgtagatggc cccccaggac caaaagggaa catgggtccc caaggggagc
2221 ctgggcctcc aggtcaacaa gggaatccag gacctcaggg tcttcctggt ccacaaggtc
2281 caattggtcc tcctggtgaa aaaggaccac aaggaaaacc aggacttgct ggacttcctg
2341 gtgctgatgg gcctcctggt catcctggga aagaaggcca gtctggagaa aagggggctc
2401 tgggtccccc tggtccacaa ggtcctattg gatnnccggg cccccgggga gtaaagggag
2461 cagatggtgt cagaggtctc aagggatcta aaggtgaaaa gggtgaagat ggttttccag
2521 gattcaaagg tgacatgggt ctaaaaggtg acagaggaga agttggtcaa attggcccaa
2581 gagggnaaga tggccctgaa ggacccaaag gtcgagcagg cccaactgga gacccaggtc
2641 cttcaggtca agcaggagaa aagggaaaac ttggagttcc aggattacca ggatatccag
2701 gaagacaagg tccaaagggt tccactggat tccctgggtt tccaggtgcc aatggagaga
2761 aaggtgcacg gggagtagct ggcaaaccag gccctcgggg tcagcgtggt ccaacgggtc
2821 ctcgaggttc aagaggtgca agaggtccca ctgggaaacc tgggccaaag ggcacttcag
2881 gtggcgatgg ccctcctggc cctccaggtg aaagaggtcc tcaaggacct cagggtccag
2941 ttggattccc tggaccaaaa ggccctcctg gaccaccagg aaggatgggc tgcccaggac
3001 accctgggca acgtggggag actggatttc aaggcaagac cggccctcct gggccagggg
3061 gagtggttgg accacaggga ccaaccggtg agactggtcc aataggggaa cgtgggtatc
3121 ctggtcctcc tggccctcct ggtgagcaag gtcttcctgg tgctgcagga aaagaaggtg
3181 caaagggtga tccaggtcct caaggtatct cagggaaaga tggaccagca ggattacgtg
3241 gtttcccagg ggaaagaggt cttcctggag ctcagggtgc acctggactg aaaggagggg
3301 aaggtcccca gggcccacca ggtccagttg gctcaccagg agaacgtggg tcagcaggta
3361 cagctggccc aattggttta cgagggcgcc cgggacctca gggtcctcct ggtccagctg
3421 gagagaaagg tgctcctgga gaaaaaggtc cccaagggcc tgcagggaga gatggagttc
3481 aaggtcctgt tggtctccca gggccagctg gtcctgccgg ctcccctggg gaagacggag
3541 acaagggtga aattggtgag ccgggacaaa aaggcagcaa gggtggcaag ggagaaaatg
3601 gccctcccgg tcccccaggt cttcaaggac cagttggtgc ccctggaatt gctggaggtg
3661 atggtgaacc aggtcctaga ggacagcagg ggatgtttgg gcaaaaaggt gatgagggtg
3721 ccagaggctt ccctggacct cctggtccaa taggtcttca gggtctgcca ggcccacctg
3781 gtgaaaaagg tgaaaatggg gatgttggtc catgggggcc acctggtcct ccaggcccaa
3841 gaggccctca aggtcccaat ggagctgatg gaccacaagg acccccaggt tctgttggtt
3901 cagttggtgg tgttggagaa aagggtgaac ctggagaagc aggaaaccca gggcctcctg
3961 gggaagcagg tgtaggcggt cccaaaggag aaagaggaga gaaaggggaa gctggtccac
4021 ctggagctgc tggacctcca ggtgccaagg ggccgccagg tgatgatggc cctaagggta
4081 acccgggtcc tgttggtttt cctggagatc ctggtcctcc tggggaactt ggccctgcag
4141 gtcaagatgg tgttggtggt gacaagggtg aagatggaga tcctggtcaa ccgggtcctc
4201 ctggcccatc tggtgaggct ggcccaccag gtcctcctgg aaaacgaggt cctcctggag
4261 ctgcaggtgc agagggaaga caaggtgaaa aaggtgctaa gggggaagca ggtgcagaag
4321 gtcctcctgg aaaaaccggc ccagtcggtc ctcagggacc tgcaggaaag cctggtccag
4381 aaggtcttcg gggcatccct ggtcctgtgg gagaacaagg tctccctgga gctgcaggcc
4441 aagatggacc acctggtcct atgggacctc ctggcttacc tggtctcaaa ggtgaccctg
4501 gctccaaggg tgaaaaggga catcctggtt taattggcct gattggtcct ccaggagaac
4561 aaggggaaaa aggtgaccga gggctccctg gaactcaagg atctccagga gcaaaagggg
4621 atgggggaat tcctggtcct gctggtccct taggtccacc tggtcctcca ggcttaccag
4681 gtcctcaagg cccaaagggt aacaaaggct ctactggacc cgctggccag aaaggtgaca
4741 gtggtcttcc agggcctcct gggcctccag gtccacctgg tgaagtcatt cagcctttac
4801 caatcttgtc ctccaaaaaa acgagaagac atactgaagg catgcaagca gatgcagatg
4861 ataatattct tgattactcg gatggaatgg aagaaatatt tggttccctc aattccctga
4921 aacaagacat cgagcatatg aaatttccaa tgggtactca gaccaatcca gcccgaactt
4981 gtaaagacct gcaactcagc catcctgact tcccagatgg tgaatattgg attgatccta
5041 accaaggttg ctcaggagat tccttcaaag tttactgtaa tttcacatct ggtggtgaga
5101 cttgcattta tccagacaaa aaatctgagg gagtaagaat ttcatcatgg ccaaaggaga
5161 aaccaggaag ttggtttagt gaatttaaga ggggaaaact gctttcatac ttagatgttg
5221 aaggaaattc catcaatatg gtgcaaatga cattcctgaa acttctgact gcctctgctc
5281 ggcaaaattt cacctaccac tgtcatcagt cagcagcctg gtatgatgtg tcatcaggaa
5341 gttatgacaa agcacttcgc ttcctgggat caaatgatga ggagatgtcc tatgacaata
5401 atccttttat caaaacactg tatgatggtt gtacgtccag aaaaggctat gaaaaaactg
5461 tcattgaaat caatacacca aaaattgatc aagtacctat tgttgatgtc atgatcagtg
5521 actttggtga tcagaatcag aagttcggat ttgaagttgg tcctgtttgt tttcttggct
5581 aagattaaga caaagaacat atcaaatcaa cagaaaatgt accttggtgc caccaaccca
5641 ttttgtgcca catgcaagtt ttgaataagg atgtatggaa aacaacgctg catatacagg
5701 taccatttag gaaataccga tgcctttgtg ggggcagaat cacagacaaa agctttgaaa
5761 atcataaaga tataagttgg tgtggctaag atggaaacag ggctgattct tgattcccaa
5821 ttctcaactc tccttttcct atttgaattt ctttggtgct gtagaaaaca aaaaaagaaa
5881 aatatatatt cataaaaaat atggtgctca ttctcatcca tccaggatgt actaaaacag
5941 tgtgtttaat aaattgtaat tattttgtgt acagttctat actgttatct gtgtccattt
6001 ccaaaacttg cacgtgtccc tgaattccgc tgactctaat ttatgaggat gccgaactct
6061 gatggcaata atatatgtat tatgaaaatg aagttatgat ttccgatgac cctaagtccc
6121 tttctttggt taatgatgaa attcctttgt gtgtgttt
//
LOCUS STYENVM 1656 bp ds-DNA BCT 14-FEB-1990
DEFINITION S.typhimurium envM protein gene, complete cds.
ACCESSION M31806
KEYWORDS .
SOURCE S.typhimurium (strain AG701i50) DNA, clone KF501.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1656)
AUTHORS Turnowsky,F., Fuchs,K., Jeschek,C. and Hoegenauer,G.
TITLE envM genes of Salmonella typhimurium and Escherichia coli
JOURNAL J. Bacteriol. 171, 6555-6565 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 259 558 ORF-1
pept 676 1464 envM protein
BASE COUNT 413 a 395 c 394 g 454 t
ORIGIN
1 cgatgatatc tcttaaatac ctttctgata tctggcaact atgatttccg atctgaaatg
61 taaacggcat atgtctcatc ctctgaaaat ttattataat acacatgcaa acatataaaa
121 acctgatttg taactactgt aaccacctga tttccaaaat ttttcatatt cagtattact
181 ctgcatcaat aaatagagaa aaccttcagc acctaaaaaa gaaattattt ttgttaagca
241 tttgctggtt aaagttacat gtccataaca tcgctaatag gtgaggtaaa ccatttggat
301 gaatattcag gcttcctaca gaacctttat ttgtctggtg gaaagcgcat cgctatcgta
361 cttgctatcc gtcccggtgt ctctggttgg atgaggcgaa ccttacacca gctggtatac
421 tgcaccgtag ggcctgtccc attagactct ttgatttgcc agtttggccc tggggagtgc
481 tcaaaatcct cagggacttc gttaccgtgt ggttttagaa cgtcacctgc cggagatagc
541 ggcggaaaaa tagcgtgaca cacggggcca ggagtgggat ataatttcag actaatttcg
601 aataatgttg aaaagttgct gttaaccatt gttcgtaaaa gcaaaaaata agcaatgact
661 ataaggatta aagctatggg ttttctttcc ggtaagcgca ttctggtcac tggcgtggcc
721 agtaaactgt ccatcgccta cggtatcgca caggcgatgc accgcgaagg agctgaactg
781 gcgttcacct accagaacga caaactgaaa ggccgcgtag aagaatttgc tgcccagctg
841 ggttccagca tcgttttacc gtgtgatgtc gctgaagatg ccagcattga cgctatgttt
901 gcggagctgg gtaacgtttg gccgaaattt gacggttttg tacactcgat cggtttcgcg
961 cctggcgatc agctggacgg cgattatgtt aatgcggtta cccgcgaagg ctttaaagtc
1021 gctcacgata ttagctctta cagctttgtt gccatggcga aagcctgccg caccatgctg
1081 aatccgggtt ctgcgctgtt gaccttgtct tatctgggag cggagcgcgc catcccgaac
1141 tacaatgtca tgggtctggc gaaagcctct ctggaagcca acgtgcgtta tatggcaaac
1201 gctatgggac cggaaggggt tcgcgtcaac gccatctctg cgggtccaat ccgcacgctg
1261 gcggcgtccg gtattaaaga cttccgtaaa atgctggcgc attgtgaagc cgttacgcca
1321 attcgtcgca ccgtgaccat tgaagatgtg ggtaactcag cggcgtttct gtgttccgat
1381 ctctcggcag gcatttctgg cgaagtcgtc catgttgacg gcgggttcag cattgcggca
1441 atgaatgaac tggaactgaa gtaagcgcat taagtttatt agcgggcggc acggctgtgc
1501 tgccccattc ccctcactca ccactcatct cgttatatag cgttctgata tttgttatca
1561 cctaacaatc tttcatccct ttcccgcata cgccaggata atccagcata aacgatccga
1621 cgccgaatca ttatcgatcg ccatttcatc ctcggg
//GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS DROEMCA1 934 bp ds-DNA INV 07-MAR-1990
DEFINITION D.melanogaster extramacrochaetae protein gene, exon 1.
ACCESSION M32636
KEYWORDS extramacrochaetae protein; helix-loop-helix protein;
regulatory protein.
SEGMENT 1 of 2
SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and
DNA, clones pP7, lambda-B[11,16].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 934)
AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W.
TITLE Extramacrochaetae, a negative regulator of sensory organ
development in Drosophila, defines a new class of helix-loop-helix
proteins
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.W.Posakony, 06-MAR-1990.
FEATURES from to/span description
pept 347 + 910 extramacrochaetae protein, exon 1
pre-msg 85 > 934 emc mRNA and introns (minor alt.)
IVS 911 > 934 emc intron
BASE COUNT 270 a 273 c 244 g 147 t
ORIGIN Chromosome 3, left arm 61D1,2.
1 cgaacccgag agcgaaagac tccgctatcg gattcggaaa tgccgttcgg ggggtataaa
61 agcgggcgct ctgagcgaga caccactcat tcgttgtctg accgctcgca gcaaaagata
121 tagaaatcga gaacgtaaac gcagaaaaca cgctggatta aagtgtttcg tttcgaacat
181 cgacagagtg agaaaacaaa gtgaagtgag cgcacaaagc cgaaatcaaa tcaaattgaa
241 aagccagtca agagcaatct ttaactgaac ctcgcaagta atccggtacg accgtgtaac
301 aaccgccaag attctactca aggagcgtaa agcaatcatc tccagaatga agtccctgac
361 ggccgtctgc cagacaggtg cctccggaat gccggccttg aatgccagcg ggcgcatcca
421 gcgccatccc acgcaccgcg gcgacgggga gaacgccgag atgaagatgt atctgtccaa
481 actgaaggac ctcgttccgt tcatgcccaa gaacaggaag ctcaccaagc tggagatcat
541 ccagcacgtc atcgactaca tctgcgacct gcagaccgag ctggagacgc accccgagat
601 gggcaacttc gatgcggcag ccgctctgac ggcggtgaac ggactccacg aggacgagga
661 cagcgacatg gaggatgcgg atgccgaggc agaagcggaa gtcgatccag atatcctcgc
721 ccagcgcctg aatgccgagc agccggcgaa agtctctagt cccgccgccc gtctcccgct
781 taccgatcgc caaacgccca acactcttgt ggcgcccgcc catccgcagc agcatcagca
841 gcagcagcaa ctgcaactgc agcagcaaca actgcaatca cagcagcaac tgtccaacag
901 tttagcaacg gtgagttaac aaacaaattg ctag
//
LOCUS DROEMCA2 1307 bp ds-DNA INV 07-MAR-1990
DEFINITION D.melanogaster extramacrochaetae protein gene, exon 2.
ACCESSION M32637
KEYWORDS extramacrochaetae protein; helix-loop-helix protein;
regulatory protein.
SEGMENT 2 of 2
SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and
DNA, clones pP7, lambda-B[11,16].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1307)
AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W.
TITLE Extramacrochaetae, a negative regulator of sensory organ
development in Drosophila, defines a new class of helix-loop-helix
proteins
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.W.Posakony, 06-MAR-1990.
FEATURES from to/span description
pept + 31 66 extramacrochaetae protein (emc), exon 1
pre-msg < 1 1140 emc mRNA and introns
IVS < 1 30 emc intron
BASE COUNT 499 a 244 c 193 g 371 t
ORIGIN Chromosome 3, left arm 61D1,2.
1 taatttgaat cttccgcttc tctcttttag ccacagaatg cggagaaaga cagcagacag
61 tcgtaagcgt ggaaacaccc agccgggaaa acccacggaa cctataagcc tattaactaa
121 atgcataatg aaagtacagt caaaactcca ctcgcaacct tcgtgtgaag atctccgcat
181 aagataacgt gtgaaactct caagaaaacc acagcccagg cgcatctctt caacgctcct
241 tttcccggga aagggaaaac ccgggaaaac gagacctcgc ttttggctcc aacagagttt
301 agtttaaaca tttaataaca agaatacaac aacacaaaac ctagcattag agcagcgtaa
361 gcaagtacac attatgatta ttaattaaac attacaacta ggatgatggc gcttaagaat
421 gaacggaaga aagaaaactt ggcaaagctt taaacaattt aattttcctt taacgcttga
481 aattacactg aagcatatat ttttgttttc ctaattttag ttttgttttg gttaaaacaa
541 cgcaagaaaa aatgttacta atgcaaagca aacagagatt agcaataaaa aaaaattata
601 aaaatgcaca ggcaccaaac tggaattacg ttgaatcacc gtttcctgca ttacgattat
661 atgatttcaa tccaaatgat tttcctatta attttaatca atgaagagtt ttcctattat
721 tgattagcca cagccaataa acaacaagaa caaccaactc agcagcacac acacattcaa
781 attcattaat ttatatcgag agagtacgtt tagagtttaa gttcgagaaa gttcatttcc
841 aggcactaat tttcactcac accactcaca tagaaaaaca caaacacctc attttgtagt
901 tttccgagtc ccttggatcg ttttcctctt ttttaatcga aataacgcat aatgcataat
961 aattattacc agatatacac taaaacctat aaatatatat taatatatgt tctgtgatct
1021 gtagcatact ttgtacaaag tgtttttttt agcataatta tgcgatttta attatgatcc
1081 gcaatcgaaa ttagagattt ttgttctcat ttcgatggta aaaacactct atcatgatgt
1141 aaaaaaaaaa aacaaaagaa aacatttgca tgatgttgcg aaaaaatata ttatctaaaa
1201 cataaagaaa aacaaacact agcctcttaa taaattgcaa aaatttaaaa aacattaaaa
1261 atcgaatatg tcaatggatt tccgttggga gcaggggatt aagaaac
//
LOCUS DROFMRFA1 1328 bp ds-DNA INV 08-MAR-1990
DEFINITION D.melanogaster FMRFamide neuropeptide gene, 5' flank, exon 1.
ACCESSION M32640 J05415
KEYWORDS FMRFamide neuropeptide.
SEGMENT 1 of 2
SOURCE D.melanogaster (strain Canton S) DNA, (library of T.Maniatis).
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1328)
AUTHORS Schneider,L.E. and Taghert,P.H.
TITLE Organization and expression of the Drosophila FMRFamide
neuropeptide gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.E.Schneider, 07-MAR-1990.
FEATURES from to/span description
pre-msg 1139 > 1328 FMRFamide neuropeptide mRNA and intron
IVS 1248 > 1328 FMRFamide neuropeptide intron
signal 1066 1071 CAAT box
signal 1109 1114 TATA box
site 40 51 conserved sequence 1
site 130 140 conserved sequence 2
site 309 326 conserved sequence 3
site 374 388 conserved sequence 4
site 394 414 conserved sequence 5
site 507 545 conserved sequence 6
site 574 602 conserved sequence 7
site 637 649 conserved sequence 8
site 671 721 conserved sequence 9
site 746 766 conserved sequence 10
site 860 872 conserved sequence 11
site 894 909 conserved sequence 12
site 1015 1027 conserved sequence 13
site 1028 1041 conserved sequence 14
site 1053 1061 conserved sequence 15
site 1063 1076 conserved sequence 16
site 1086 1095 conserved sequence 17
site 1108 1175 conserved sequence 18
BASE COUNT 340 a 331 c 319 g 338 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttctgg ggtgagtcca tagattacag agcagcgcag ttttcttgac cagatgatgg
61 tctagctgaa ctatttttaa ctgggcgatt ggcacacgac ttggcctggt catcaaactc
121 aagccaagat ttgaacttga actagtctcg ccttatcaca ttgccttcta attcgttttc
181 attttcgatt ttcgcatcta cttatgctaa tgaagccatc tgcagacgtg gttttcgaac
241 gtatttatat tgattatggg tgatcgtcaa caagagcagt ggacacccaa taaacctgtc
301 caaaacccga cacatttctg cccagtcatg cgtggtggac aatagccaaa tgccattgat
361 gagactcgtc tccaaaactt tggccttttg ccgggccgta attacagact tccgtctttt
421 gaacagtttt ttcagcccca cccaagagtc gagtcttgaa aagctggctg ggatggggtg
481 gtttcgggtg ctggacgaga tgccagaggc gccacaatgt atcctgttac aggttacagg
541 gccataaagc gccataaacg ccgcgacggc aatggcaaat tataacgcat acggacacgt
601 agtcgatcca ctggctagaa ggctaattgg acgtgcccgg ccaggatgtc cctgctcatc
661 ccacagtcga cttaaaagtg ggcgtaacat aacaattgtg ggcgatttgc atacaatttg
721 gtgctcgagt ggccacctaa ctggcgtcga aatgttgcta cgtcattgtt gctgcccctc
781 gtgcttcttg gctaataagt cagcatgaat gcggtctgtg ttttggcctt ttgtcccgct
841 gacaaccgct gacaagcaaa acaaacatat cagcggctga ccagcgacca tttgccgaca
901 ttgtcaaggt ttttttttca gccccgaata cacctgctaa tctgtggtgg tgactagcgc
961 gctcggaagg cagaaaaaac tgcagcgaaa atcagttcca ttgaatgtcc tctggcaaca
1021 tgttgccgca gcacttgttg cgcgcgtcca ctcgcagagg cacctcaaca tcagcctcct
1081 tgggcgccag cacaaaggac gcaacgcata taaaacagga tgtttcgcca aacatcctca
1141 cagaactgtt ccaccttcga gcgggcaaca agtgtgtgtg cggcccaaaa ggatccccag
1201 accttcgaat tcactctagt ttcctagtaa ggggacaggt ttcagaggtg agtgccatgc
1261 gaaaagtgtg aaaacagtgc tgagccttaa gcagtcgagt tattctttgg aaaagccaat
1321 agccaatt
//
LOCUS DROFMRFA2 1603 bp ds-DNA INV 08-MAR-1990
DEFINITION D.melanogaster FMRFamide neuropeptide gene, exon 2.
ACCESSION M32641 J05415
KEYWORDS FMRFamide neuropeptide.
SEGMENT 2 of 2
SOURCE D.melanogaster (strain Canton S) DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1603)
AUTHORS Schneider,L.E. and Taghert,P.H.
TITLE Organization and expression of the Drosophila FMRFamide
neuropeptide gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.E.Schneider, 07-MAR-1990.
FEATURES from to/span description
pept 220 1263 FMRFamide neuropeptide precursor
sigp 220 291 FMRFamide neuropeptide signal peptide
matp 527 570 FMRFamide neuropeptide A
matp 571 663 FMRFamide neuropeptide B
matp 664 696 FMRFamide neuropeptide C
matp 697 729 FMRFamide neuropeptide D
matp 730 762 FMRFamide neuropeptide E
matp 763 795 FMRFamide neuropeptide F
matp 796 828 FMRFamide neuropeptide G
matp 829 861 FMRFamide neuropeptide H
matp 862 894 FMRFamide neuropeptide I
matp 895 921 FMRFamide neuropeptide J
matp 922 954 FMRFamide neuropeptide K
matp 976 1002 FMRFamide neuropeptide L
matp 1003 1035 FMRFamide neuropeptide M
matp 1036 1065 FMRFamide neuropeptide N
pre-msg < 1 1538 FMRFamide neuropeptide mRNA and introns
IVS < 1 219 FMRFamide neuropeptide intron
signal 1404 1408 poly-A signal
signal 1482 1486 poly-A signal
signal 1507 1511 poly-A signal
BASE COUNT 409 a 398 c 431 g 365 t
ORIGIN Undetermined number of bp after segment 1.
1 gcggctggtt aacttttaac gcgccggccc aacagatctg gccataaata tcatgatggt
61 taattagcag gacactttgt ctgcgcgtcc tggctgccgg tggctactgt taccctaagt
121 aaaacgaacg gtcttaatgt gattaaaagt tgcaatagat aacgcgagat gccgccgcct
181 cataattatc ctgccgtcac tccattctta tccttgcaga tgggcattgc cttgatgttc
241 ctgctggccc tgtaccagat gcagtcggcc atccacagcg agatcatcga tacgcccaac
301 tatgcgggca actcgttgca ggacgctgac tccgaggtga gtccatcgca ggacaatgac
361 ctggtagatg cactgctcgg caacgatcag accgagaggg cggagctgga gttccggcac
421 cccatctctg tgattggcat cgactactcg aagaacgccg tggtgctgca cttccagaaa
481 cacggccgga aaccgcgcta caagtacgat cccgagctgg aggccaagcg aaggtccgtg
541 caggacaact tcatgcactt cggcaagagg caggcggagc agctgccacc ggagggcagc
601 tatgctgaat ccgatgaact ggagggcatg gccaagcgag cagctatgga tcggtatggc
661 agagatccca agcaggactt catgcggttt ggtcgggatc cgaaacagga cttcatgagg
721 tttggcaggg atccaaagca ggacttcatg agattcggtc gggatcccaa gcaggatttc
781 atgagattcg gtcgagatcc caagcaggat ttcatgaggt ttggacgcac tccggctgag
841 gatttcatga ggttcggacg cactccggcg gaggacttca tgaggttcgg acgctccgac
901 aatttcatgc gcttcggacg cagtccccac gaggagcttc gcagtcccaa acaggatttc
961 atgcgattcg gtcgcccgga caacttcatg cgcttcgggc gttccgctcc gcaggatttt
1021 gtgcgctccg ggaagatgga ctcaaacttc attcgattcg gtaagagctt gaagccggcg
1081 gctcccgagt ccaagccagt caagtccaat caaggcaacc caggcgaacg cagtccagtg
1141 gacaaggcca tgacggagct gttcaagaaa caggagctgc aggatcagca ggtgaagaac
1201 ggcgcacagg cgaccaccac gcaggatggg agtgtggaac aggaccagtt cttcggccag
1261 tgaggtagtc ctgcgggacg cctccttgta aatagatatg gacaaatgta cgcaaggatc
1321 taaattgata tacgtatata acccactcct cacacgaact cctgacttat gcctgaacta
1381 tgaattttta atgaatgggc tggattaaaa attcaccgtg ctttgaagtt cttatctata
1441 aatatatcta gtgtaatatt gaagaaattg aaattggcgt gaataaaatc ctgtggcaac
1501 attttaaata aagattgctt tactgtaaat tatggcgcac tcgaatagcc ttatttcatt
1561 cggtcgaggt acatattgta tagccattat tagtttctcc act
//
LOCUS DRVFMRFRN1 3189 bp ds-DNA INV 08-MAR-1990
DEFINITION D.virilis FMRFamide-related neuropeptide gene, 5' flank, exon 1.
ACCESSION M32642
KEYWORDS FMRFamide-related neuropeptide; neuropeptide;
secretory granule protein.
SEGMENT 1 of 2
SOURCE D.virilis DNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 3189)
AUTHORS Taghert,P.H. and Schneider,L.E.
TITLE Inter-specific comparison of a Drosophila gene encoding FMRF
amide-related neuropeptides
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.H.Taghert 07-MAR-1990.
FEATURES from to/span description
pre-msg 2890 > 3189 FMRFamide-related neuropeptide mRNA and
introns
IVS 3059 > 3189 FMRFamide-related neuropeptide intron A
site 1211 1222 conserved sequence 1
site 1225 1236 conserved sequence 2
site 1300 1416 conserved sequence 3
site 1491 1504 conserved sequence 4
site 1513 1533 conserved sequence 5
site 1636 1668 conserved sequence 6
site 1695 1724 conserved sequence 7
site 1426 1438 conserved sequence 8
site 1814 1864 conserved sequence 9
site 1888 1908 conserved sequence 10
site 2116 2128 conserved sequence 11
site 2144 2159 conserved sequence 12
site 2635 2646 conserved sequence 13
site 2674 2688 conserved sequence 14
site 2754 2853 conserved sequence 15
site 2691 2704 conserved sequence 16
site 2041 2050 conserved sequence 17
site 2856 2933 conserved sequence 18
BASE COUNT 1034 a 630 c 656 g 869 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttctat cagtaaacaa caacaactac cgttattgct caatacctgt agttgttgtc
61 gctgctgctg atttcccatg cagcttagaa catgacgcaa ctgaacagca gcagcagcag
121 cagccgccat cgcacaaata tgttcgttta ccgttttgaa tttatgttta tagcagcagc
181 agcacatgtt gttacgtctt gcaattgttt atggcataaa tctttatagc atacacacac
241 acatacatgc atgtgtacat gcataaattc cgtgtgtgcc cattaattag agagaattgg
301 ccatggataa gctggtcggt tacttattag gttaccacct ggtggactcg aaaacaaata
361 aaactcggca cataaacaaa aaaactaaaa attaagacga aaattcaaca atggcattga
421 catcaaacac attgcctgcg ctttcaaacg gcccatgaca acaagagcta actaactaaa
481 tcgtaaaaaa agagctagta aaatccaatt tttttaacca tctaaaatcg taaagcctat
541 caaaacacgg cttttggcat ttggagcaaa cggttgtcaa gcggagcccc attgagctaa
601 tcaggttctg ctgttgctca ttgacggaca ccaattttat ttatctattt ttattgactc
661 ttctagatgt attccatatc cattgttgag ctgtgatatg cttaaaaaag taacgattaa
721 ctggaaaata ctctttagtg aaatcaacgt aataaaatac ttattggtat ttataataga
781 acaaatgcaa atacaattat atgatcggaa ataaaatttt tttaagtaat gcatttgaaa
841 agaaaagaaa aataaatata tttacaatta ggtttctttt attttggtaa cggatgcata
901 gcctagaaga ggtatatatc gaaaaatatg tctatattta attgtctaat ctgtcaaaaa
961 tgcaaactga aagattatct aaatattagt tgagcagtaa tggattctag tttatttaac
1021 tgatttccaa ttacctaaaa gtaaagtaag aaagaagcag ttgatatatt taaaagattg
1081 cagggcatcg aaaattcgtt gaagtgaact taaagcacgc ccaatcttag tttccgatat
1141 atataattta agttatttat ttcgtatgtg ttactcactg atggaacagc gcgacggagt
1201 ccacgctctc ggtcaagaaa acgctttgaa cttgagcgtt gttagcgtta ggtatttatg
1261 caaatggatt cagcatcatg catatttatc gatgcaaatg tctggctatt gattttcgac
1321 gatcgttaga gaacaaaaaa aaaaaaaaga agctaaaatg aaataagaat tgaagcgaac
1381 tgccagacac acaatggttg acacatttct tgcaagtcat ggtccacatc ttggccggac
1441 gctgggcggg ttcttcaaat tcagacacgc aaaaaacggg ttgaaaacaa aaaacttggc
1501 ctttcaaagt gtggccgtaa ttacagactt ccgcttacga aaccgaaagt gtccgcccag
1561 ccaaatggcc aagaagtatt ggccgaagat ggggttgggg ctgggcgcag tcgggcagac
1621 gttttgggct gctaaaggcg ccacaatgta tcccgttaca ggggccataa aactccataa
1681 gcgactggcg acaaggccaa attataccgc atacgcacac gtagccatgg tctaagttgt
1741 ggacccggct aattggacgc gcaccagggg tatggtcagg gggtggtggg gttgtttttg
1801 gattctctat aaacttaaaa gtgggcggca cataacaatt gtgggcgatt tgcatacaat
1861 ttggcgcttg gattaccagc aaagcgcgtc aaaatgttgc tgcgtcatcg cgggccaaaa
1921 accagatgac tttgccatgt gctaaatata aacacatgca catatatata tatatatatg
1981 tgtatatata atatatatag tatatatgca catgcaagcg gctaataaag atattgtatg
2041 gccagcacaa tgcagctgtc tgcgggccat tgtttgtctg acagtccgtc tgacagtttt
2101 tgtctgtcca tcacaaacaa acatatcaca gccaaaatgg caggccgaca ttgtcaaggc
2161 tggcttctgt tcaaagcttg cagtttaacg ataaatgaat tgcacaaaga caccaaaaaa
2221 aaaccaacta tatatgctcc agttgcaagt gcttgactag cagataccct gtagccagcg
2281 acaagctgcc ttgatttgag ttctttgcaa agaaatagtt gatggtcctg ataaataatg
2341 gggctccaat tggattactt taagcctcct tggagcctgg acctgtccaa agccttctcc
2401 gttgtgagag tcgaaaaacc caagagcaag caagtttcta gctcacgaaa cttgcgataa
2461 agaccctctg gtcgggcctc gtagcagcag gagtatttac cagacaggac tcggttgttg
2521 atgctgatca ggaatatata tatatatata tacttaatgg gcagagcttg ctgacacttt
2581 gtatcctctt tcacccgttt acaatgggct tagggtataa agagagaagt cgctgcaact
2641 gttgccagct aacatgttgc ataaacatgc cgcagcaaca cttgttgctg cctcaacaac
2701 acccacacac acacacacac atacacacac acacacacag ctgcacagca acacgcagag
2761 gctttggact gccgcctgcc tattatcctg aacaaaggac accgtcggct gtcggccgtc
2821 ggccggttgt acgttgtcgg ccgccgtcta acggtatata aaaccggatg tttcgccaaa
2881 catttgcaca gaagtgtttc acattgccac ggtcaacaag tgtcctgacc aaaatcaggt
2941 agaagaagaa ccacacgagt ggccagtgaa aagtgtttta aaatttccca tttcccaacg
3001 gaatgtcgat ttgattgcat tgcaatattc gtataatttt aaatatttga aaaaaaaggt
3061 gagtttttca aaattttgaa gccctgagca aagtgatttg agtgaaatac taaagcttag
3121 ttaatgggtg gaagaaagct taaatagtct taaaggcttt aaggcttaaa gaaaaagttt
3181 gaatctaga
//
LOCUS DRVFMRFRN2 1803 bp ds-DNA INV 08-MAR-1990
DEFINITION D.virilis FMRFamide-related neuropeptide gene, exon 2.
ACCESSION M32643
KEYWORDS FMRFamide-related neuropeptide; neuropeptide;
secretory granule protein.
SOURCE D.virilis DNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1803)
AUTHORS Taghert,P.H. and Schneider,L.E.
TITLE Inter-specific comparison of a Drosophila gene encoding
FMRFamide-related neuropeptides
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.H.Taghert 07-MAR-1990.
FEATURES from to/span description
pept 551 1570 FMRFamide-related neuropeptide precursor
pre-msg < 1 > 1570 FMRFamide-related neuropeptide mRNA and intron
IVS < 1 539 FMRFamide-related neuropeptide intron A
BASE COUNT 471 a 438 c 459 g 435 t
ORIGIN
1 tgacggagtg cgaacgttca tatacatcag ctccggcgcc cagcccctgc tcctggcgta
61 gaccatcagc ttagcgcctt taacaagttg tattaaggtt gttgatggcc gtgtgtgtgt
121 gtgtgtgtgt gtgacgggga ggggggcaac tcatgttgct gttgttgctg caggtcgatt
181 gttgttgctg ttgctgttgt tgctgccttt aatgcgcttg gttaactttt tacgcgcaca
241 ttcatatttc gctcttggcc aggcggatta tcgcataaat attgttatgg ttaattagcg
301 ggacactttg tctggccggg caaacaaaag gcgttaagac gctgccaccg gagtagcctg
361 ctgctgcctg gtttttgctg ctgttgccag ttaagtaaaa gtgaatgcgt tcattgcttt
421 atccatgggg gcgttaatgt gattaaaact tgttatagat aatgcggcaa gtgatttagt
481 caacacctca taattatgtt caatttctcc ctctctccct ccctttcact ccatctcagc
541 accttcaatc atgggcatag cactgatgtt cctgctggcc ctctaccaga tgcagtcggc
601 catccatagc gagattattg agacgccatc cagctataat gataactcgc tgctggaggc
661 agctgcggag gagcccaatt cgagagccac cgcgtctgag tcggatctgc tggatggact
721 gatgagcacc gataatccca atccggagca gcaaaccgaa ctggagtttc gctatccgat
781 ctcggccatt ggcattggtt atgccaagaa ttcggtggtg ttgcgtttcc agaagcacgc
841 gcgcaaacag aacttcaagt acgatccgga ctacgagatg aagcgcaagt cgctacagga
901 caatttcatg cactttggca agcggcaggc ggagcagctg cctcaggcaa ctggacccgg
961 ctactatgag tgcatcaagc gctccgccat ggatcgctac ggacgcgatc ccaagcagga
1021 ctttatgcgt ttcgggcgtg ctccgccttc ggactttatg cgcttcggac gtgcaccctc
1081 ggactttatg cgtttcggac gcgatcccag ccaggatttc atgcgcttcg ggcgctccga
1141 taactttatg cgctttggca ggaacctgaa ctttcacgag gagctgcgca gtcccaaaca
1201 ggatttcatg cgtttcggtc gcccggacaa tttcatgcgc ttcggccgat ccgcgcccac
1261 agaattcgag cgtaatggca aaatggactc gaacttcatg cgctttggca agagatcagg
1321 tgtaatggcc aaattgacca aatcccagct gcagcaaaac aaattgacaa ctgccgatgg
1381 caagcagcag ccagccgagg agggcaatcc cacggacaag gccatctcga tgctattcaa
1441 taagcaccag cagcagcagc agcagcagca ggggcagagg ctgcagcagg aagaccgaca
1501 gcagatgaag agcagcgcag agcagaacaa cttggaggaa gccagcgtgg agcagttcta
1561 tgagccatag attctaccgc gcacagcaca ctgtaaatat aaattggcca aaaaaaccga
1621 acactatgat aattgtactt aacactatgt atgtatgtac ttaaacgcat tatagacatt
1681 gcagaagaaa tttaaaaaag gaaaaaaaaa aacgtaattt gaattttaat ggggctggat
1741 taaaaattca ccacgctttg atctgatcat aagaaaacaa aagaaaaaaa agtagaaaac
1801 tct
//
LOCUS ECACELS 1224 bp ds-DNA BCT 06-MAR-1990
DEFINITION E.carotovora beta(1,4)-glucan glucanohydrolase (celS) gene,
complete cds.
ACCESSION M32399
KEYWORDS beta(1,4)-glucan glucanohydrolase; endoglucanase.
SOURCE E.carotovora carotovora (strain SCC3193) DNA, clone pHSK25.
ORGANISM Erwinia carotovora
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1224)
AUTHORS Saarilahti,H.T., Henrissat,B. and Palva,E.T.
TITLE CelS: A novel endoglucanase identified from Erwinia carotovora
subsp carotovora
JOURNAL Gene (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.T.Saarilahti, 02-MAR-1990.
FEATURES from to/span description
pept 208 1002 endoglucanase (celS) precursor
sigp 208 303 endoglucanase signal peptide
matp 304 999 endoglucanase
binding 196 200 ribosome binding site
BASE COUNT 353 a 278 c 253 g 340 t
ORIGIN
1 taaaattaac accctgattt tatttcacat tattcatatt tattaagact ttaatcattt
61 ccccatgaat tggattattc cgacagagtg gaaataacat aaccagttaa actaaccaca
121 tctcagactt gatcatagtc acgtgataac gatgtgttaa ctttcagatt aaaacacctc
181 aagaatatat ttaaaaagag atttattatg cagactgtga atacccagcc acaccgtatt
241 ttccgcgtat tattgcccgc agtattttct tccttgctac tctcatcact aacagtgtct
301 gctgccagtt catcaaacga tgcggataaa ctctatttcg gtaataataa atattaccta
361 ttcaataacg tatggggaaa agatgaaata aaaggatggc agcaaacgat tttctataat
421 agcccaatta gcatggggtg gaactggcac tggccaagca gtacccacag tgttaaagcc
481 tatccttcac tggtgagcgg ttggcactgg acggcgggct atacggaaaa tagcggattg
541 ccgatacaat tatccagtaa taagagtatc accagtaacg ttacttattc tatcaaggct
601 acgggtactt acaacgccgc ttacgacatt tggttccaca ctaccgataa agccaactgg
661 gactcgtcgc ctaccgacga attgatgatt tggctaaatg ataccaatgc aggccctgcg
721 ggtgattata ttgaaaccgt tttcctcggc gacagcagtt ggaacgtatt caaaggctgg
781 atcaacgcgg ataatggtgg aggttggaat gtcttttcct ttgtccatac ttccggcacc
841 aacagtgcat cactcaatat tcgtcacttc accgactatt tggtgcagac aaagcaatgg
901 atgagcgatg aaaaatatat cagcagcgtt gagtttggca ccgagatctt tggcggcgat
961 gggcagatcg acatcaccga atggcgcgta gacgtaaaat agcgcgtctg gcgttgctat
1021 tcctccccgt tcagccgcaa taaacacgtt aacgtagagc tgaaacggat tggtctcatc
1081 gacgttctct tttcattttc agttcagaca gacgttgccc gttctttagg cgggttcgct
1141 tccccccaca acaaaattcg tgagctgtgg cacgtaaatt tcattagcgt tatgataatc
1201 ccctcggcta atcaggaatg tccc
//
LOCUS ECODCM 2472 bp ds-DNA BCT 26-FEB-1990
DEFINITION E.coli DNA cytosine methylase (dcm) and patch repair protein genes,
complete cds.
ACCESSION M32307
KEYWORDS DNA cytosine methylase; patch repair protein.
SOURCE E.coli (K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2472)
AUTHORS Sohail,A., Lieb,M., Dar,M. and Bhagwat,A.S.
TITLE Gene required for very short patch repair in Escherichia coli
partly overlaps the DNA cytosine methylase gene
JOURNAL Unpublished (1990)Wayne St. Univ. Detroit, MI 48202
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.S.Bhagwat, 23-FEB-1990.
FEATURES from to/span description
pept 250 1668 DNA cytosine methylase (dcm)
pept 1649 2119 pot. patch repair protein
BASE COUNT 606 a 620 c 695 g 550 t 1 others
ORIGIN 43 min on K12 map.
1 gatcatttcc agactaagtt gctgaacgta ccgcaaacca tgcaaaccgc aagaggcaag
61 cacgtggcgc agcacaacgc gcatttttta gtcgagttta tggcgaagct cagtgccgaa
121 ctggcggggg agaatgaagg tgtcgatcac aaggtgatag atgcgttttc atccgctggc
181 tgagcgcgtg gccctaaatg gctgtaatta tgttaacctg tcggccatct cagatggccg
241 gtgaaatcta tgcaggaaaa tatatcagta accgattcat acagcaccgg gaatgccgca
301 caggcaatgc tggagaaact gctgcaaatt tatgatgtta aaacgttggt ggcgcagctt
361 aatggtgtag gtgagaatca ctggagcgcg gcaattttaa aacgtgcgct ggcgaatgac
421 tcggcatggc accgtttaag tgagaaagag ttcgcccatc tgcaaacgtt attacccaaa
481 ccaccggcac atcatccgca ttatgcgttt cgctttatcg atctattcgc cggaattggc
541 ggcatccgtc gcggttttga atcgattggc ggacagtgcg tgtttaccag cgaatggaac
601 aaacatgcgg tacgcactta taaagccaac cattattgcg atccggcgac gcatcatttt
661 aatgaagata tccgcgacat caccctcagc cataaagaag gcgtgagtga tgaggcggcg
721 gcggaacata ttcgtcaaca cattcctgaa cacgatgttt tactggccgg tttcccttgt
781 cagccatttt cgctggctgg cgtatcgaaa aagaactcgc tcgggcgggc gcacggtttt
841 gcctgcgata cccagggcac gctgtttttt gatgtggtac gcattatcga cgcgcgtcgt
901 ccggcgatgt ttgtgctcga aaacgtcaaa aacctgaaaa gtcacgacca gggtaaaacg
961 ttccgcatca tcatgcagac gctggacgaa ctgggctatg acgtggctga tgcagaagat
1021 aatgggccag acgatccgaa aatcatcgac ggcaaacatt ttctgccgca gcaccgtgaa
1081 cgcatcgtgc tggtgggttt tcgtcgcgat ctgaatctga aagccgattt taccctgcgt
1141 gatatcagcg aatgtttccc tgcgcagcga gtgacgctgg cgcagctgtt ggacccgatg
1201 gtcgaggcga aatatatcct gacgccggtg ctgtggaagt acctctatcg atatgcgaaa
1261 aaacatcagg cgcgcggtaa cggcttcggt tatggaatgg tttatccgaa caatccgcaa
1321 agcgtcacgc gtacgctgtc tgcgcgttat tacaaagatg gcgcggaaat tttaatcgat
1381 cgcggctggg atatggccac gggtgagaaa gactttgacg atccgctgaa tcagcaacat
1441 cgtccacgtc ggttaacgcc tcgggaatgc gcgcgcttaa tgggttttga agcgccggga
1501 gaagcgaaat tccgtattcc ggtttcggac actcaggcct atcgccagtt cggtaactcg
1561 gtggtcgtgc cggtctttgc cgcggtggca aaactgcttg agccaaaaat caaacaggcg
1621 gtggcgttgc gtcagcaaga ggcacaacat ggccgacgtt cacgataagg ccactcgcag
1681 caaaaatatg cgcgcgattg ccacgcgtga tacggcgata gagaagcgcc tcgccagtct
1741 gttaaccggg cagggcctgg catttcgcgt tcaggacgcc agtctgcccg gacgtccgga
1801 ttttgtcgtt gatgaatatc gctgcgtgat atttacccat ggctgcttct ggcatcatca
1861 tcactgctat ctgtttaaag tgcctgcgac tcgaaccgag ttctggctgg agaagatagg
1921 taaaaatgtt gagcgcgatc gccgcgatat cagtcgcttg caggaactcg gctggcgcgt
1981 attgattgtc tgggagtgcg cgttacgtgg gcgcgagaag ctgacggatg aagcgcttac
2041 cgagcgtctg gaagagtgga tctgcggcga aggtgccagc gcgcagatcg acacgcaggg
2101 gattcattta ctcgcttgat gcgtcctgaa taactggcgc aactacgggt tttgccggga
2161 agagatattt tcccaacgtg accagtacca ccgcgaagac aattacgccg agcgccagcc
2221 attcaatctt cgacagtgtt tctccaccca gtcccgtacc cagcaagacc gcgaccaccg
2281 ggttaacgta agcgtagctg gtggcgagag ccggactgac attacggatt aaatacatat
2341 aagcgttgat ggcgataatc gaaccaaaca gcgccagata gccgaccgca aggaagcctg
2401 aaagggaagg gagcgccgtc agtttttnac ccgcaatcat cgacgcgatc attaacacca
2461 cgcctgccgc cg
//
LOCUS FCLPOLCYS 3865 bp ss-RNA VRL 06-MAR-1990
DEFINITION Feline calicivirus RNA-dependent RNA polymerase, cystein protease,
and 2C protein,.
ACCESSION M32296
KEYWORDS RNA polymerase; cysteine protease; nonstructural protein.
SOURCE Feline calicivirus (isolate CFI/68 FIV), cDNA to viral RNA, clones
pCV[2,8].
ORGANISM Feline calicivirus
Viridae; ss-RNA nonenveloped viruses; Caliciviridae.
REFERENCE 1 (bases 1 to 3865)
AUTHORS Neill,J.D.
TITLE Nucleotide sequence of a region of the feline calicivirus genome
which encodes picornavirus-like RNA-dependent RNA polymerase,
cysteine protease and 2C polypeptides
JOURNAL Unpublished (1990)Pioneer Hi-Bred International, Johnston, IA 50131
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.D.Neill, 23-FEB-1990.
FEATURES from to/span description
pept < 1 3865 polyprotein (AA at 2)
matp < 1 451 2C-like protein
matp 1802 2242 cysteine protease-like protein
matp 2521 3762 RNA-dependent RNA polymerase-like protein
BASE COUNT 1057 a 865 c 894 g 1049 t
ORIGIN
1 ggttcctgta tgttacattc ttactggacc acctggatgc gggaaaacaa ctgcagcgca
61 ggcattagcc aagaagttat ctgaccaaga accatcggtc atcaaccttg atgtagacca
121 ccatgatacg tacactggca atgaggtgtg cataattgat gaatttgatt cctctgataa
181 ggttgattat gcaaattttg ttattggaat ggttaactcg gcaccaatgg tactaaattg
241 tgatatgctg gaaaataagg gcaagttgtt cacctcaaag tacataataa tgacttccaa
301 ttctgaaaca cctgtcaaac catcctccaa acgcgctggc gcattttatc ggagagttac
361 ctatcatgat gttgctacct tggttgaatc gcacaagcgt gcacggcctg gtactgctgt
421 cccccgtagt tgctacaaga aaaacttttc acatctatcg cttgctaaac gaggcgctga
481 gtgctggtgc aaagagtatg tccttgatcc taagggattg caacaccaga gcacaaaggc
541 tcctccccca accttcctta atattgattc tttggcccaa acaatgaagc aagactttgc
601 tttaaagaac atggcatttg aagctgaggt aggatgtagt gaacatcggt atggatttgt
661 atgtcagcag agtgaagttg aaacagttcg ccgactcctt aatgccatta gaatgaggct
721 taacgctacg tttaccgttt gcgttggcct tgaagcttct aattccgttg ggtgtactgc
781 gcacgtttta acacctgacg aaccattcaa tggaaaaagg ttcgttgtgt cacgctgcaa
841 tgaggcatcg ctatccgcac tagaaggaaa ctgcgttcaa actgcattag gagtgtgcat
901 gtccaataag gacctcactc acctgtgtca cttcattaag gggaaaattg tcaatgatag
961 tgtcaggtta gatgaactac ccgccaatca acatgtggta accgttaatt cggtgtttga
1021 tttggcctgg gctcttcgtc gtcactctac actaactggt cagttccaag caatcagagc
1081 cgcatatgat gtgcttcatg tccctgacaa ggtccccgcc atgttgcggc actggatgga
1141 tgaaacctcc ttctctgatg aacacgtggt aacgcagttc attacacctg gtggagttgt
1201 aattcttgag tcgtgtggcg gtgcgcgcat ctgggcactt ggtaacaatg ttatcagggc
1261 tggaggcgtc actgccatac ccactggggg atgtgttagg ttaatgggac tctcagctca
1321 aacaatgcca tggtctgaaa ttctttcgga actcttctct ctgctaggga aaatctggtc
1381 tagtgttaaa gtctctaccc ttattctcac cgctctcagc atgtatgcat ctaggttcag
1441 gcctaaaaca gaagcaaagg gaaaaacaaa atccaaaatt ggtccatata gaggccgagg
1501 tgtggcactt accgatgatg agtacgacga atggaaagaa cacaatgccg ctaggaagtt
1561 ggacctctca gttgaagatt ttctaatgtt gcgtcaccgt gccgcactag gggcggatga
1621 cactgatgcc gtcaaattta ggtcatggtg gaattcaagg tctagattgg ctgatgactt
1681 tgaagacgtc accgtgattg gtaagggtgg cgtaaagcat gagaaaatta gaacaaatac
1741 cctcagagcc gttgaccgtg gctacgatgt cagctttgcc gaggagtcag ggcccggggc
1801 aaaattccac aaaaacgcaa ttgggtcagt cactgacgtc tgtggagaac acaagggcta
1861 ttgtgttcac atgggtcacg gggtctacgc ctcagtagca cacgtggtta agggtgactc
1921 tttctttctg ggtgaaagaa tctttgactt aaaaactaat ggtgagttct gttgcttccg
1981 tagcactaag atcttgccta gtgctgctcc tttcttttct ggaagaccaa cgcgtgaccc
2041 ttggggatct cctgtagcaa ctgactggaa accaaaacct tactcaacca catcggggaa
2101 aattgtaggg tgtttcgcaa caacatcaac tgaaacccac cctggcgact gtggcctccc
2161 gtacattgac gacaatggta gagtcactgg gttgcacaca ggctctgggg gacccaagac
2221 cccaagtgca aaattggttg ttccctacgt acatattgac atgaaaacta aatctgtcac
2281 tgctcaaaag tatgatgtca ctaaaccaga tattagttat aaaggtttag tttgtaaaca
2341 attggatgaa attagaatta tcccaaaagg taccaggcta cacgtatctc ctgcacattt
2401 agaagatttt gaggaatgct ctcaccagcc tgcctctcta ggtagtggtg atcctcggtg
2461 tcccaaatcg ctcactgcaa ttgttgttga ctctcttaag ccctactgtg tggtagttaa
2521 tgggcccccc catgatatct tacatagggt acaaaagatg ctaattgatc atctttctgg
2581 ttttgtcccc atgaacatat catctgatac ctccatgctt tctgctttcc acaaactcaa
2641 ccatgataca tcttgtgggc catatctagg tggtagaaag aaagatcaca tggtcaatgg
2701 agaacctgac aaggcgctat tagatttgtt gtcttcaaaa tggaagttgg caacacaggg
2761 aattgcactc ccacatgagt acacaattgg tttgaaagat gaactaaggc ccattgaaaa
2821 agtgcaagaa gggaaaagaa ggatgatctg gggctgtgat gtcggcgttg ccactgtatg
2881 tgcggctgca tttaaaggtg ttagtgacgc cattacggca aaccatcaat atggccctat
2941 acaggttggc attaacatgg acagtcccag tgttgaggca cttttccaaa ggatcaaaag
3001 cgcacgaaag gtgtttgctg ttgattactc aaagtgggat tcaacccaat ccccccgcgt
3061 tagtgccgct tcaattgaca tactgcggta cttttctgat cggactccaa ttgttgattc
3121 ggccacaaac acgcttaaaa gcccccccat tgctgtattt aatggagtag ctgtcaaggt
3181 gtcctctggt ctcccatctg gcatgcccct cacatctgta atcaactcgc tcaatcactg
3241 tttatatgtt ggctgtgcaa tcttgcaatc tttggaagct cgtaacgtgc ctgtcacttg
3301 gaatcttttc tcaactttcg acatgatgac ttatggtgat gatggtgttt acatgttccc
3361 cacaatgtat gcaagtatta gtgaccaaat ttttgccaat ctttctgcat atggtcttaa
3421 gcccacaaga gttgacaaat cagttggttc aattgaacca attgatccta attcagttgt
3481 cttcctaaaa agaacaatta cacgaacccc ccaaggaata cgaggcttgc ttgatcgcag
3541 ctcaattctc aggcaattct actacatcaa gggtgagaac acagataatt ggaaagagcc
3601 cccaaaaact atagatccaa tgtcaagagg gcaacaattg tggaatgcat gtctgtatgc
3661 aagtcagcat ggtattgact tttataataa ggtatacaaa ttggccgaaa aagcagtcga
3721 gtacgaagga ttgcatcttg agcccccttc ttacagcaca gccttggagc attacaacag
3781 ccaatttaat ggtgtggagg cgcggactga ccagatcgat acgagtggca tggccgccct
3841 tcactgtgat gtgttcgaag tttga
//
LOCUS HUMGALOPS 794 bp ds-DNA PRI 06-MAR-1990
DEFINITION Human 1,3-galactosyltransferase pseudogene, 3' end.
ACCESSION J05421
KEYWORDS 1,3-galactosyltransferase; pseudogene.
SOURCE Human peripheral blood leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 794)
AUTHORS Larsen,R.D., Rivera-Marrero,C.A., Ernst,L.K., Cummings,R.D. and
Lowe,J.B.
TITLE Frameshift and nonsense mutation in a human genomic sequence
homologous to a murine UDP-gal-beta-D-gal alpha-1,3
galactosyltransferase cDNA
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.B. Lowe. 02-MAR-1990.
FEATURES from to/span description
pept.ps / 63 755 pseudo-1,3-galactosyltransferase
site 394 394 frameshift mutation
site 475 475 frameshift mutation
site 427 429 in-frame stop codon
site 708 710 in-frame stop codon
BASE COUNT 227 a 165 c 181 g 221 t
ORIGIN
1 cagcttgtgg tttctttcag gaatcccaga ggataaatgt tttgcttttc ttctttgttt
61 cagatataat gatcattact tggaggagtt cataacatct gctaataggt acttcatggt
121 tggccacaaa gtcatatttt acatcatggt ggatgatgtc tccaagctgc cgtttataga
181 gctgggtcct ctgcattcct tcaaaatgtt tgaggtcaag ccagagaaga ggtggcaaga
241 catcagcatg atgcgtatga agatcactgg ggagcacatc ttggcccaca tccaacacga
301 ggtcgacttc ctcttctgca tggatgtgga ccaggtcttc caagaccatt ttggggtgga
361 gaccctaggc cagtcagtgg ctcagctaca ggctggcggt acaaggcaga tccctatgac
421 tttacctagg agaggtggaa agagtcagca ggatacattc catttggcca ggggattttt
481 attaccatgc agccatttct ggaggaacac ccattcaggt tctcaacatc acccaggagt
541 gctttaaggg aatcctcctg gacaagaaaa atgacataga agccaagtgg catgatgaaa
601 gccacctaaa caagtatttc cttctcaata aaccctctaa aatcttatcc ctaaaatact
661 gctgggatta tcatataggc ctgccttcag atattaaaac tgtcaagtga tcgtggcaga
721 caaaagagta taatttggtt agaaataatg tctgacttca aattgtgcca gtagatttct
781 gaatttaaga gaga
//
LOCUS MEFTGSRNA 3367 bp ds-DNA BCT 06-MAR-1990
DEFINITION M.fervidus 7S RNA, Ser-tRNA, 16S rRNA and Ala-tRNA genes.
ACCESSION M32222
KEYWORDS 16SrRNA; 7S RNA; transfer RNA-Ala; transfer RNA-Ser.
SOURCE Methanothermus fervidus.
ORGANISM Methanothermus fervidus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanothermaceae.
REFERENCE 1 (bases 1 to 3367)
AUTHORS Haas,E.S., Brown,J.W., Daniels,C.J. and Reeve,J.N.
TITLE Genes encoding the 7S RNA and a Ser-tRNA are linked to one of the
two rRNA operons in the genome of the extremely thermophilic
archaebacterium Methanothermus fervidus
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.J.Daniels, 02-FEB-1990.
FEATURES from to/span description
pept 132 887 ORF 260
RNA 944 1258 7S RNA
tRNA 1273 1359 Ser-tRNA
rRNA 1566 3046 16S rRNA
tRNA 3105 3178 Ala-tRNA
BASE COUNT 898 a 751 c 998 g 720 t
ORIGIN
1 ggagacagtg gttcaccagt catacaagat ggtaaggttg tgggtaatat tgtcttcaat
61 atattacaca aatgctacag ctatagcatc ttccctagaa aaatttttag aaaggaagta
121 atgtggtggc catgaggatt ggatttattg gatttggaga agtaagttct actttatcac
181 aattttttaa ggataaggtg gaagtacaaa catgtgttaa aggtaggagt gaaaaaacta
241 aaaaaattgc aaaaaaactt ggagtaaaga tatacaaaga ttataaagac cttgtaaaaa
301 attctgacat tgttatttca gctgttactc cattttcagc tttagatgtg gctaaaaaat
361 atggaaaata tgtaaaaggt atatatgtag atgtgaataa tgtatcacct ctaacaaaac
421 ataaaatttt gaaatatatt gacgaggaaa aatttgttga ttgtgcaata attggaagaa
481 taaaacgtaa atttaaaatg atttgttcag gtaaaaatgc aaataaactt aaaatattag
541 aaaaatttgg agtacctatt gaagttatag gttcaaaggt tggggaagcc tcaacattaa
601 aaatgctcag aagtttatat actaaaagtc ttgctgcaat attattagaa gttttttcgg
661 ttgctaataa attaggactt atcgatgaat tacttgaaat attagaggaa acagaaggga
721 agaaatttgt agatttatgt aagtcaagag tggtaggatc atttattcac tcaaggagaa
781 gatatgaaga aatatgtgaa atagaaaaat ttatattatc tcataatttg aaaccaataa
841 tgatcaaatg tacaaaaaat atgtttaaac atatagagga atgttgacaa agattatatc
901 aaaaaattta aataagattg aaaaataaaa tataaaatgg cagcgaggct aggccggggg
961 gttaggggtc ccctgtaagc gcaaatcccc tatatggcgc ggccgaagcc caggaggcgg
1021 caagaccgcc agacatcggc ctgagggtta aacaatgaag cctcgtccca cagggccacc
1081 ggtggcgagg gtccagctgg agggctggac ctaatcgcct ttgctgcggg aacgggtcag
1141 gcccggaagg gagcagccct accgcagacg gatggtgctt gtgggtcaac ggggtggagt
1201 ctataaccct cagatcaccg gtgtctggtg gtcttgtcca ctcctgggcg tgcctcgctg
1261 ccaaatgtct tcagtagccg gggtggccca gcctggtagg gcgtcggcct gctaagccga
1321 tgatccgtta aggatcgcgc gggttcaaat cccgtccccg gcgtgcaata ttttctaatt
1381 tttttgtatt gatagtgttg ctaggtatgt aatgccatgg taccggcgtg gattcggagt
1441 gcctatttag gctcgatgaa gaatcgcact ccagccatgg caacaaaatt ggtgagagcc
1501 tgagtgaata actcaggctt gatacatagg tcccacgggt gctggtaaag tcccaaagga
1561 atcccccgcc aactccgttt gatcctggcg gaggccactg ctatgggggt ccgactaagc
1621 catgcaagtc gaacgggcct tgtgcccgtg gcgaacggct cagtaacacg tggacaacct
1681 accctggggt ccgggataac cccgggaaac tggggctaat cccggatagg cgaggtctcc
1741 tggaatgggg cctcgccgaa aggtcttttg accgccccag gatgggtctg cggccgatta
1801 ggtagttggt agggtaacgg cctaccaagc ctacgatcgg tacgggttgt gagagcaaga
1861 gcccggagac ggggcctgag acaaggcccc gggccctacg gggcgcagca ggcgcgaaaa
1921 ctccgcaatg cgcgaaagcg cgacgggggg acccccagtg ccactccgta aggagtggct
1981 tttccggagt gtaaaaagct ccgggaataa gggctgggca agaccggtgc cagccgccgc
2041 ggtaacaccg gcagcccgag tggtggccgc gtttattggg cctaaagcgt ccgtagccgg
2101 tccggtaagt ctccggtgaa agcccgcagc tcaactgcgg gagtagccgg agatactgcc
2161 ggacttgggg ccgggagagg ccggaggtac ccccggggta ggggtgaaat cctgtaatcc
2221 cggggggacc acctgtggcg aaggcgtccg gctggaacgg gcccgacggt gagggacgaa
2281 agccagggga gcgaaccgga ttagataccc gggtagtcct ggccgtaaac gatgcggact
2341 tggtgttggg gcaacctcga gttgccccag tgccgaaggg aagccgttaa gtccgccgcc
2401 tggggagtac ggccgcaagg ctgaaactta aaggaattgg cgggggagca ccacaacgcg
2461 tggagcctgc ggtttaattg gattcaacgc cggacacctc accgggggcg acggcaggat
2521 gatggccagg ttgatgacct tgcctgacga gccgagagga ggtgcatggc cgccgtcagc
2581 tcgtaccgtg aggcgtcctg ttaagtcagg caacgagcga gacccgcgcc cctagttgcc
2641 agcgggtccc gtaagggacg ccgggcacac tagggggacc gccagcgata agctggagga
2701 aggtgcgggc gacggtaggt ccgtatgccc cgaaaccccc gggctacacg cgggctacaa
2761 tggccgggac aatgggtacc gaccccgaaa gggggaggta atcccataaa cccggccgta
2821 gttcggatcg agggctgcaa ctcgccctcg tgaagctgga atgcgtagta atcgcgggtc
2881 actatcccgc ggtgaatacg tccctgctcc ttgcacacac cgcccgtcac gccacccaaa
2941 cggggttcgg atgaggccat ggcctctggc catggtcgaa tccgggcccc gtgaggaggg
3001 cgaagtcgta acaaggtagc cgtaggggaa cctgcggctg gatctaacct ccttaaaata
3061 ataatattcc tttgggacac caaaagcacc cgtgggacct tttttatttt tgggcccata
3121 gctcagcctg ggagagcgcc gcccttgcaa ggcggaggcc ccgggttcaa atcccggtgg
3181 gtccatgcaa ctagccatct tatgataggc tagtgaagct agatgatcaa agatgattct
3241 aggtgcatag gtagttcgtg gacaccaact gtagggaact ggtccaaggt ttaagccgcc
3301 cggtggatgg ctcggcttgg gcgccgacga aggccgtggc aagctgcgat aagcccgggt
3361 gaggcgc
//
LOCUS STASAU3AIM 3360 bp ds-DNA BCT 06-MAR-1990
DEFINITION S.aureus Sau3AI-restriction-enzyme and Sau3AI-modification-enzyme
genes, complete cds.
ACCESSION M32470
KEYWORDS Sau3AI-modification-enzyme; Sau3AI-restriction-enzyme.
SOURCE S.aureus (strain 3AI) DNA.
ORGANISM Staphylococcus aureus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 3360)
AUTHORS Seeber,S., Kessler,C. and Goetz,F.
TITLE Cloning, expression and characterization of the Sau3AI restriction
and modification genes in Staphylococcus carnosus TM300
JOURNAL Unpublished (1990) Univ. of Tuebingen, D-7400 Tuebingen 1
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.t Seeber, 02-MAR-1990.
FEATURES from to/span description
pept 448 1917 Sau3AIR protein (ttg start codon)
pept 2017 3255 Sau3AIM protein
binding 437 441 ribosome binding site
binding 2004 2010 ribosome binding site
BASE COUNT 1295 a 393 c 638 g 1034 t
ORIGIN
1 aagtcgacct tcaccaagac cgaattttcc gcgtataccc gcagtaccaa acgttaattt
61 actttcaaaa ccttctcgct gttcaatgtc agattgctgc tcataaaaat gtttaactaa
121 actatcatta gctctttcta tccataattc tttatccatt gttgctaaac atcctttcaa
181 aatctcagtt agacttaata aaacatgaaa actaaagccc ttacatttat gtaatgaatt
241 ataaagaaat acgccccaaa agtaaaaaaa cacagcccca agacaatact tttcacaagt
301 attatataat agatgtgtat gaaaatgcat ggagtagatg taagagtgat attcaaaatg
361 tgtaaaaaat atggataatt ctatataatt atattattga aattttaaat agcggaaata
421 attatttaat gttaagaggg gataattttg gaaagttatt tgacaaaaca agccgtacat
481 aatcgcgcaa aagaagctgt tggtaaaagt gtattagaat taaatggtgg tgaatcgatt
541 aaacaaagta agagttcagt tggtgatgca tttgaaaatt ggtttggtaa gaaaaaagac
601 agtgatagta aaccagatat ggcagaagct ggggtggaac ttaaggcaac gccatttaaa
661 aagttgaaaa acggaaagta tagctccaaa gaaagattag tattaaatat tataaactat
721 gagaaagtgg caaatgaaaa ttttgaaact agtagttttt tatctaagaa taatactata
781 gaattagctt tctatgaata tatcaaggga acacctagtg ataattggat tattaaagaa
841 gcggtgcttt atgaaatgca taaaaacccg attgattatg aaataattaa acaagattgg
901 gaaataataa atcaatatat taatgaagga aaggcacatg aattgagtga aggtttgaca
961 agttatttag cgccatgtac aaagggtgcg aatgctagtt ctttaagaaa tcagccttat
1021 tcagacataa aagcaaagca aagagcattt tctttgaagt ctgggtatat gacatctatt
1081 ttacgcaaat atgttctagg tgatgaaaaa atagattcaa ttgtcaaaga cccatttgaa
1141 ataaaagaaa aatcaataga ggacatagtc tttgaaaaat ttcagccata tataaactgg
1201 tcaatcgata aattatgcga acatttttct atcaataaag gtgagaaagg tttaaattat
1261 agaatagcct ctgccatttt aaatctaaaa ggtaaaacta ctaaaagtaa accattcccg
1321 gaagttgaag agtttgaaaa atcatctata gtagtcaaaa cagttcattt taataaaaag
1381 aatgtgaata aagaaagtat gtcatttgga gcttttaaat ttgaagaact agctaatgag
1441 gaatgggaag atagtgaagg atatcctagt gcacaatggc gaaacttttt gttagaaaca
1501 aggtttttat tttttgttgt taaagaagat gaagatggtg tagacatatt caaaggaata
1561 aaatttttta gtatgcctga agaagacata aacggacctg tcaaaagaat gtgggatgat
1621 acagtgaaga aattaaaaga gggtgtcaca ttagaagctg taccggacaa aagtacaaag
1681 gatggttgga gaataaaaaa taattttgta gataaaagtg atgatttaat ttgccatgtt
1741 agaccacaca ctaataacag agactatcgt ggaggaagta atgcagataa gcttcctaaa
1801 aagattaact ggattaatag acctgactca gatgattatt cggatgagtg gatgactaaa
1861 caaagttttt ggataaataa tgactacata aaaaagcaag ttgaagattt attgtagtta
1921 aagtatgtta aaatataaga tattctttta aaatatcgaa cgatcgttcg tattttgtgt
1981 tataataagg ttgaattaag tataggaggt cgcctaatga ataaaattaa agtagtagaa
2041 ttgtttgcgg gtgtaggcgg gtttcgttta ggtttagaaa atacgaaaaa tggtatattt
2101 gacataactt gggcaaatca atgggagccc tcacgaaaaa tccaacatgc atttgattgt
2161 tatagtaaaa gatttaagaa cggcatccat agtaataagg atattgccca ggtatctgat
2221 gaagaaatgg caaatactga agctgatatg attgtaggag gatttccttg ccaagattat
2281 tcagttgcaa ggagtttaaa tggagaatta ggaatacaag gaaaaaaggg cgttttattc
2341 tggcaaatta ttagatatat tcaaaataca tttcctaaat acttgttgct tgaaaatgtt
2401 gatagattat tgaaatcacc ttcgagtcag agagggagag actttgctgt aatgttatca
2461 accttaaatg agttaggcta taatgttgaa tggcgcgtga ttaatgctgc tgattatggc
2521 aatgctcaaa gacgtagaag ggtatttata tttggatata agcaagattt aaactatagc
2581 aaagctatgg aagaaagtcc gttggataaa attatatatc acaatggttt gtttgctgaa
2641 gcttttccga ttgaagatta tgccaataaa aatagagtaa ataggactca tattactcat
2701 gatatagtcg atatttcaga taatttcagt tttcaatttt ataatagtgg aatcatgaaa
2761 aatggagaaa ttttaactat tgacacaata ccaaaatatg aaaaatcagt aaccttagga
2821 gaaattattg aaagtaatgt agatgatggt ttttcattaa atcaagatca aattgataaa
2881 tttaaatatt taagaggacc caaaaagatt aaacgaacta ctaaagatgg tcatgaatat
2941 tatttttcag aaggtggtat gtctgaaaca gattcattag agttacctgc aagaacaatg
3001 cttacaagtg aatcatctat taatagaagt actcattttt taaacgtaga tggtgtttat
3061 agaactttga cacctattga agcagaaagg ttgaatgggt ttccagataa ttggacagaa
3121 ggtatgccaa ttaagatgag atacttttgt atgggcaatg ctcttgttgt gcctttgatt
3181 actagaatag gtaatcaaat tgaaaaaatt gatagtatta caaatgatga attcagtcag
3241 ctacgtttat tttaaataaa tcaaatgtag agtgcgtttg atttaacaag tttgtatact
3301 aaagattcat agattattgt atattgacca gtatttatca gcgtattatt ttaatatata
//
LOCUS TRPTYF1 1321 bp ds-DNA BCT 06-MAR-1990
DEFINITION T.pallidum pallidum antigen TyF1 gene, complete cds.
ACCESSION M32401
KEYWORDS antigen.
SOURCE T.pallidum pallidum (strain Nichols) DNA.
ORGANISM Treponema pallidum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes;
Spirochaetales; Spirochaetaceae.
REFERENCE 1 (bases 1 to 1321)
AUTHORS Noordhoek,G.T., Hermans,P.W.M., Paul,A.N., Schouls,L.M.,
van der Sluis,J.J. and van Embdon,J.D.A.
TITLE Treponema pallidum subspecies pallidum (Nichols) and Treponema
pallidum subspecies pertenue (CDC 2575) differ in at least one
nucleotide: Comparison of two homologous antigens
JOURNAL Microb. Pathog. 6, 29-42 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
G.T. Noordhoek, 02-MAR-1990.
FEATURES from to/span description
pept 615 1148 antigen TyF1
variant 736 736 g in T.pallidum pallidum; a in T.pallidum
pertenue
BASE COUNT 349 a 297 c 353 g 322 t
ORIGIN
1 aacagcacgc gggcccccgg actacatcgc tcttgtgaac ctcaacggtc gcctttccat
61 actctaagga ataggagcac gagtacgtat cccccggtag tttccccgcc ttccgaacta
121 aaataagagg tattcccatg cgatctgcaa aaggcgcggc aaaaataaag ccacgtgatt
181 cgattgctgc gaccgcggta acgtgctcat cgcggtagaa ttccaccatt tgatcaagac
241 agtaacgaaa tacagccgcg ttcatcagca cgccagtaat gtcgtagtag agaattcctt
301 ttttagggaa atcaatccgc ttacgaattg cgcggtccag cgccgcgtgt ccgtccatag
361 gggcatggta acgtccaata ccacgcacgt caatgatctt accggtttgt tgggaggctt
421 ggtggattga gaattacgtc tcctggaaaa aagatttcgc tgaaacttca cgaaatctcg
481 gtgaaaataa atgattattt taccaatcgg tgaaaaaaag ccgggaaaag tccaaaaaga
541 cagtggttat gctccatttc tttcgatttt ttgttggcat ggtttttgct ttaaagtttg
601 gaggagaaag aacgatgaac atgtgtacag atggaaaaaa ataccacagc accgccacga
661 gcgctgcagt tggagccagc gcccccggtg taccggacgc tcgtgccatt gctgctatct
721 gcgagcaatt gcgccggcat gtagcggatc tgggagtact gtatatcaag ctacataact
781 atcactggca catctacggc attgagttta aacaggtgca tgagctcctt gaagagtatt
841 atgtatcagt tactgaagcc tttgatacga ttgccgagcg gttgttacag ctgggcgcgc
901 aggctcctgc gtctatggct gaataccttg cgttgagtgg aattgcagaa gagacggaga
961 aagagatcac tatcgtctct gcgcttgcgc gcgtaaagcg ggattttgaa tacctaagta
1021 cgcgattcag ccaaacgcaa gtacttgcag ctgaaagtgg ggatgcagtg actgacggca
1081 ttatcacaga catactgagg acgttgggaa aggccatttg gatgcttggt gctaccctga
1141 aagcctaggt agagcaggct gtacgtacaa cacacgtacg gccatgcgct ggaagtcctg
1201 tattttgcac ataaggcctc tctcccgtta cagcatgagg ggagggaggt gttggttgaa
1261 gtgcttgggg aagtgtgcat aatcgtccta cggaaggggg cgttttgtgg aaaaaattgt
1321 t
//
LOCUS YS5MTII 1257 bp ds-DNA PLN 07-MAR-1990
DEFINITION C.glabrata metallothionein (MTII) gene, complete cds.
ACCESSION J05398
KEYWORDS metallothionein.
SOURCE C.glabrata (ATCC 2001) DNA.
ORGANISM Candida glabrata
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 1257)
AUTHORS Mehra,R.K., Garey,J.R. and Winge,D.R.
TITLE Selective and tandem amplification of a member of the
metallothionein gene family in Candida glabrata
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.R.Garey 07-MAR-1990.
FEATURES from to/span description
pept 631 789 metallothionein (MTII)
rpt 1246 > 1257 amplifiable repeat
BASE COUNT 420 a 255 c 199 g 383 t
ORIGIN
1 ccccgggatc gtttagaccc caagaatcca agatatgttt tagctgacag ggtgcatggc
61 tactccgtga tttaccatat gtccatcgca ttcgtcatgg cctgatgcca atgaccatac
121 tctttagact cggatccgag aagtcatccc tatccatacg atatcttctt ccgagtccat
181 tagataaggc aacgctagat ttagctgatt gattgccctg agaattaaag ggaatcagcg
241 aaaatagaaa aaaaggtttt ggatggatat ttttgttctt tttttgagac attggtgtca
301 ctgccacagc ttaataacat ggcaatcagc agtgtctaga ttcagcggaa tttagtggaa
361 gcgaaatgaa ttcggctgac ttagtgccta tcagtaatat acatatttgc tttttgcttt
421 ggcattgaat ttatcaataa ctgcataccc aaaattaaaa ataaaataga aaaaatagta
481 atatataaag cacccgtaac ttcccatttc tgggaaactt ggaattcatt tctcccattt
541 atcctttctt ctatatatcg aatcaacaca tcaacaatac caacaaactt caacttatac
601 acaacatcta atatttatta tagcttcgaa atgcctgaac aagtcaactg ccaatacgat
661 tgccactgct ccaactgtgc ttgtgaaaat acttgcaact gctgtgccaa gccagcatgt
721 gcttgcacaa actctgcttc caatgaatgc tcctgccaaa cttgcaagtg tcaaacatgc
781 aagtgctaaa cagcattcaa agaagaataa tttctagtat ttttgctatt gttgctttat
841 ttttcatgat tattgataag gtactgggta tcttctgcta atgacgctga actagctcta
901 tatgctagct atatacatta cgtataaaaa aatataaaaa tataaaaata taaaaaaaaa
961 ataaaaaaat cacttcaaaa actataacaa ctaccaagtt tctattttct ctttgactaa
1021 atcaaatcga tctagtaaac attgcagaac aatttgtgta attatcgtct gatctaaggg
1081 ccggcgtaag caattctaac cgaccacact acgttactta acaaccacac ttgctcttga
1141 atacgaaaca gcaagctcta cctatatgaa atggtctaca aaagaagttt tctctagtac
1201 tctgagaagg ttatattata catcattgaa atgcttaaag ctgatttgca tcccggg
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATA2UGLB 506 bp ds-DNA ROD 13-FEB-1990
DEFINITION Rat alpha-2u globulin gene, exon 1.
ACCESSION M33690 M32014
KEYWORDS alpha-2u globulin.
SOURCE Rat (strain Wistar-Lewis) adult liver DNA, clone 1H.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 138 to 506)
AUTHORS Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U.
TITLE Rapid identification of DNA fragments containing promoters for RNA
polymerase II
JOURNAL Gene 84, 371-381 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 137)
AUTHORS Tamone,F.
JOURNAL Unpublished (1990) Dept. Mol. Biol. U. of Geneva, Geneva, SWITZ.
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Tamone, 07-FEB-1990.
FEATURES from to/span description
pept 436 > 506 alpha-2u globulin, exon 1
mRNA 377 > 506 alpha-2u globulin mRNA
signal 1 376 promoter
BASE COUNT 146 a 120 c 117 g 123 t
ORIGIN Chromosome 5.
1 cttcttccac actccactca agtcttgaag caaagattct ctccatgctt gaccttccag
61 ttcagtaccc acccactcca taactggttt tctgattcta agtcagatcc aatgttgcat
121 gtggttccag atggccctct agaacaccca ctgtttttct tgggaatatg ttttgcgaaa
181 tgtatgagtg atagaatcaa tccataggag atgacatcgc caagtttcaa aagggcagga
241 acaatccttg gcttcacatc agtacatgag gaaaacattc cacaaagcct gaaggatgga
301 ggcccatatg agaaggaaaa aaaaacactc aaacccagag agagtataaa gacgagcaaa
361 gtgctggagg tggagtgtgg gcaccatcag cagagggatt gtcccgacag agaggcaatt
421 ctattcccta ccaacatgaa gctgttgctg ctgctgctgt gtctgggcct gaccctggtc
481 tgcggccatg cagaagaagc tagttt
//
LOCUS RATA2UGLBA 371 bp ds-DNA ROD 13-FEB-1990
DEFINITION Rat alpha 2u globulin pseudogene, exon 1.
ACCESSION M33652 M32014
KEYWORDS alpha-2u globulin.
SOURCE Rat (strain Wistar-Lewis) adult liver DNA, clone 1F.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 371)
AUTHORS Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U.
TITLE Rapid identification of DNA fragments containing promoters for RNA
polymerase II
JOURNAL Gene 84, 371-381 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F. Tamone, 07-FEB-1990.
FEATURES from to/span description
pept.ps 301 > 371 pseudo-alpha-2u globulin, exon 1
mRNA 243 > 371 alpha-2u globulin mRNA
BASE COUNT 119 a 71 c 96 g 85 t
ORIGIN Chromosome 5.
1 tctagaacat ccattgtttt tcttgggaat atgttttgtg aaatgtatga gtgataaaat
61 catccatagg agatgagatt gcccaagttt gaaaagggca ggaacaatcc ttggcttcac
121 atcagtacat gagaaaacat tccacaaagc ctgagggaag gaggcccata tgagaaggaa
181 aaattaaaaa acactcgacc cagagagagt ataaagagga gcaaagtgct tgaggtggag
241 tgtgggcacc atcaacagag agagtgtccc gacagagagg caattctatt ccctaccaac
301 atgaagctgt agctgctgct gctgtgtctg ggcctgaccc tggtctgtgg ccatgcagaa
361 gaagctaatt c
//
LOCUS RIRSTPSTA 2906 bp ds-DNA BCT 12-FEB-1990
DEFINITION R.tsutsugamushi heat shock protein 11 (stp11) and major antigen 58
(sta58) genes, complete cds.
ACCESSION M31887
KEYWORDS antigen; heat shock protein; heat shock protein 11;
major antigen 58; outer membrane protein.
SOURCE R.tsutsugamushi (strain KARP) DNA, clone pRTS58H2.9.
ORGANISM Rickettsia tsutsugamushi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE 1 (bases 1 to 2906)
AUTHORS Stover,C.K., Marana,D.P., Dasch,G.A. and Oaks,E.V.
TITLE Molecular cloning and sequence analysis of the sta58 major antigen
of Rickettsia tsutsugamushi: Sequence homology and antigenic
comparison to the 60 kilodalton family of stress proteins
JOURNAL Unpublished (1990) Walter Reed Hospital, Washington,D.C. 20307
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.V.Oaks, 02-FEB-1990.
FEATURES from to/span description
pept 780 1064 heat shock protein 11
pept 1137 2804 major antigen 58
binding 770 775 ribosome binding site
signal 675 680 -35 promoter
signal 700 705 -10 promoter
BASE COUNT 1083 a 354 c 599 g 870 t
ORIGIN
1 aagctttgca acattaattg ccaattcttt tgacagtaat gtaattgcgg ttagaaaaag
61 tgactgaaac ctcagaaaaa agaattgtag gagttaattg cttaatacaa tatttacatc
121 taatagtaaa acatttgact ttcttagttt taattttagg ctagtgaaaa ctgatcaatc
181 taaaattaaa attcagaaaa atgtctagaa cgagagcaca aacttctctg ctaaaaatac
241 taaaagaaat attctgcaag tttatattgt aaccaattga tagagtaata tatcttatca
301 accagatatt acatggcttg ataaattact ttaggattgg taactctagc agttgtttta
361 gatatgtaat ggacttgttt gataagatag taagacatca tctaatgcaa tcaagtggat
421 tgaaaggctt ttgctggaaa aggttgtgta gaagaaagtt atatgaaaag gtggaattat
481 ataatgatta ccaaattcga tagatctgtg gtaagcaata acttctgata ggctataagc
541 aatgatatag ctaaaccagt ataaaacagt tatgttatat actttttcaa attaattaag
601 aaaaacaatt agtgcgatat taatggttta aactcagcgt ataaaagaaa aatataaaaa
661 ttttttattc acacttgaaa taagtagtaa cagaagatat atgattaggg catagtaaaa
721 gtatagatta actaattatg ctaatttaat tatatagagt taaatttttg gaggattata
781 tgaaatacca accactgtat gatcgtgtgc tagttgagcc aatacaaaat gatgaagcac
841 atggaaaaat tcttattcca gataccgcaa aggagaaacc aacagaagga atagtagtta
901 tggttggcgg tggctataga aatgataaag gtgatattac accgttaaaa gtaaagaaag
961 gagatactat agtctataca aaatgggctg gtactgaaat aaaattagaa agtaaagact
1021 atgtagttat taaagaatca gatatattat tagttaaaag ctaaaatgct taatgtatta
1081 ataattaaag gatgctagac taatcaaatt ataaaatatt ttaaggagtt ttcagtatga
1141 gtaaacaaat tgtacatggc gatcaatgtc gtaaaaaaat aatagaaggc ataaatgttg
1201 tagctaatgc agtaggtata acattaggac ccaaaggtag atgtgtagca atagaacaat
1261 cttatggtcc tcctaaaatt actaaagatg gagttagtgt tgctaaagca attcaactta
1321 aggataaatc tcttaatgtt ggagcccaat ttgttatatc agttgctagt aaaacagctg
1381 atgtggctgg tgatggtaca actactgcta cagttatagc tgatgctgca gttagggagc
1441 taaataaagc tgaagttgca ggcattgata tacaggaagt aagaaaaggc gctgagaaag
1501 ctgttgaagc agttattgct gatgtacgta aaaacagcag ccctgttaaa aatgaggaag
1561 aaattgctca agtagctact gtttcatcta atggagaccg cgaaattggt gaaaagattg
1621 ctaatgcaat gaaacaagtt ggccaagaag gagtgataac tgttgaagat tctaaaaatt
1681 ttaactttga agttgaagtt gttaaaggaa tgagatttga ccgcggatat atatcacagt
1741 actttgcaac gaatcgtgaa aagatgatta ccgaatttga aaatccttat attttgctat
1801 tggatcagaa ggtatctaca gtgcagccac tggttcctgt gcttgaagct gttgctcaca
1861 ctggcaagcc attagtattg attgctgatg atgtagacgg agaagctctt actgcattga
1921 tattaaacaa tttaaaaggt tctattaaag tagtagctgt taaggcacct ggttttggtg
1981 atcgaaaaaa agaaatgtta gaagatattg ctattttaac aaatggagaa gtaattactg
2041 aacaactagg aatcaaacta gaaaaggtta atgatacatc taagttagga actgcaaata
2101 gagttattgt tacaaaagat catactacta tagttcatga taagaataat agtgatatag
2161 agaagaaagt taattctcgt tgtgaacaga tccgtgaagc aattaaagat acaacttcag
2221 attatgaaaa agaaaaatta caagagcgct tagcaaaact tagaaacggt gtagctgtac
2281 taaaagttgg tggtgctact gaagtagaac aaaaagaacg taaggataga gtagaagatg
2341 ctttacatgc tactagagca gctgttgaag aaggtatagt gcctggtggt ggtgttgcgc
2401 tattttatgc ttctcgtgtt ttagatagtt tgaagtttga taatgaagat caaagggtag
2461 gtattaacat tatcaaaaaa gtattagaag ctccggtaag acaaatagtt aaaaatgctg
2521 gaggcaaaga ggatgttgtt gttaatgaac taagcaagag tacagataaa aaccgtggat
2581 tcgatgctag aacaatgcaa tacgttgata tgataaaagc aggtattgtg gatccaacta
2641 aagttgtacg tactgcgtta caagatgctt tttctgtggc ttcattagta attgctacct
2701 cagctatgat tactgatcat gaggaagata ataatactgg taaccgcagt ggtggtggag
2761 taggtggtgg acatcatggt ggtatgggcg gaatggattt ctaatccgcc cccatgtcta
2821 ccttcaaaaa aaatagagta taacttttgt ataaggttgc tagttatgta tttttttaca
2881 aaattgcagt ataatatttc aagctt
//
LOCUS YSCTUP1A 2891 bp ds-DNA PLN 12-FEB-1990
DEFINITION S.cerevisiae TUP1 protein gene, complete cds.
ACCESSION M31733
KEYWORDS regulatory protein.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2891)
AUTHORS Williams,F.E. and Trumbly,R.J.
TITLE Characterization of TUP1: A mediator of glucose repression in
Saccharomyces cerevisiae
JOURNAL Unpublished (1990) Dept. Biochem. Ohio Med. Coll., Toledo, OH 43699
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
R.J.Trumbly, 30-JAN-1990.
FEATURES from to/span description
pept 509 2650 TUP1 protein
mRNA 234 2780 TUP1 mRNA
signal 108 112 TATA box
BASE COUNT 899 a 700 c 530 g 762 t
ORIGIN Chromosome III.
1 tgtacatccc aaagacataa acatatgtat ataagaacaa cataaacaaa taaataaacc
61 aggaaagccc taatttaccc ttctttttgg ctgagcaatc taggagttaa taattttttg
121 gaactttccc cctaacattt ttagagacag aaccctctta cattatcgct accgacggac
181 ctttaatttt tcgttacttc tcaaaaaaca ctgaaaaact gcgaaaaaat taaaaagctc
241 tcccgtcaaa gcaacaaaca acaaaacaaa ataacaagta cataagtaaa caaatagaga
301 gaaattcccc catctacccc caattgataa acatcagcga agcaagaaca actggctgaa
361 cacgtgcccc tctatctgtc cttctgcaat tgtttaattt tctttcacgc caagttacct
421 ttcgcacacc actactcttt ttctattgtt tttttgtctt ttttgataag caggggaaga
481 aagaaatcag ctttccatcc aaaccaatat gactgccagc gtttcgaata cgcagaataa
541 gctgaatgag cttctcgatg ccatcagaca ggagtttctc caagtctcac aagaggcaaa
601 tacctaccgt cttcaaaacc aaaaggatta cgatttcaaa atgaaccagc agctggctga
661 gatgcagcag ataagaaaca ccgtctacga actggagcta actcacagga aaatgaagga
721 cgcgtacgaa gcagagatca agcacttgaa actagggctg gagcaaagag accatcaaat
781 tgcatctttg accgtccagc aacagcagca acagcaacag cagcaacagg tccagcagca
841 tttacaacag caacaacagc agctagccgc tgcatctgca tctgttccag ttgcgcaaca
901 accaccggct actacttcgg ccaccgccac tccagcagca aacacaacta ctggttcgcc
961 atcggccttc ccagtacaag ctagccgtcc taatctggtt ggctcacagt tgcctaccac
1021 cactttgcct gtggtgtcct caaacgccca acaacaacta ccacaacagc aactgcaaca
1081 gcagcaactt caacaacagc aaccacctcc ccaggtttcc gtggcaccat tgagtaacac
1141 agccatcaac ggatctccta cttctaaaga gaccactact ttaccctctg tcaaggcacc
1201 tgaatctacg ttgaaagaaa ctgaaccgga aaataataat acctcgaaga taaatgacac
1261 cggatccgcc accacggcca ccactaccac cgcaactgaa actgaaatca aacctaagga
1321 ggaagacgcc accccggcta gtttgcacca ggatcactac ttagtccctt ataatcaaag
1381 agcaaaccac tctaaaccta tcccaccttt ccttttggat ctagattccc agtctgttcc
1441 cgatgctctg aagaagcaaa caaatgatta ttatatttta tacaacccgg cactaccaag
1501 agaaattgac gttgagttac acaaatcttt ggatcatact tcagttgttt gttgcgtgaa
1561 gttcagtaac gatggtgaat acttagccac aggctgcaac aaaactactc aagtgtatcg
1621 cgtttcagac ggttctctgg tggcccgttt atctgacgat tctgctgcca ataaccatcg
1681 aaattcgatc actgaaaata acaccaccac gtccacggat aacaatacaa tgacaaccac
1741 tactaccacc acaattacta ccacagcgat gacttcggca gcagaattgg caaaagatgt
1801 ggaaaacctg aacacttcgt cttccccatc atccgacttg tatatccgtt cagtgtgttt
1861 ttctccagat gggaaatttt tggcaacagg tgctgaagac agactgatta gaatttggga
1921 tattgaaaat agaaagattg ttatgattct tcaaggccac gaacaagata tttattcatt
1981 ggactacttt ccctcaggtg acaaattagt ctccggttct ggtgaccgta ccgttcgtat
2041 ttgggactta cgtacaggcc agtgttcatt gactttatcc attgaagatg gtgttaccac
2101 cgtcgctgta tcaccaggtg atggtaaata catcgctgct ggttctctag atcgtgctgt
2161 gagagtttgg gattccgaga ccggattctt ggtggaaaga ctagattcgg aaaacgaatc
2221 cggtacaggc cacaaggact ctgtttatag cgttgtcttc acgagagatg gacaaagcgt
2281 tgtatccggc tcattagata gatctgttaa gctctggaat ttgcagaatg caaacaacaa
2341 gagcgattcg aaaactccaa attccggcac ttgtgaagtt acgtatatcg ggcataaaga
2401 ctttgtattg tccgtggcca ccacacaaaa tgatgagtac atcttgtccg gttccaaaga
2461 tcgtggtgtc ctgttttggg ataagaaatc cggcaatccg ttattgatgt tgcaaggtca
2521 taggaattca gttatatctg tggctgtggc aaacgggtct tcgctgggtc cagaatataa
2581 cgtttttgct actggtagcg gtgattgtaa agcaaggatt tggaagtata aaaaaatagc
2641 gccaaattaa gaacagaaca caaaaggaac actttacaaa tgtaactaac taaactattc
2701 ttcaattcat tgtcattata tctttctctc tattcttttt tatctactta tttgttatgt
2761 tatgatccag gtacgcttta aaacaaacag ttactttcct caccattcat cccttcccct
2821 ccttttcctg tgtctgaata ttgcggttca ttattctttg atatgaattt atttgtgtat
2881 tgtataagct t
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS PFAMSAA 433 bp ds-DNA INV 13-FEB-1990
DEFINITION P.falciparum (strain B358) major surface antigen p190 gene,
partial cds.
ACCESSION M32111
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B358) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 433)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 433 major surface antigen p190 (AA at 1)
BASE COUNT 148 a 75 c 83 g 127 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa
61 aaaatgatct taaatgaaga agaaattact acaaaaggtg caagtgctca aggttcaagt
121 ggtccaagtg gtacaccaag tggtacaagt ggtccaagtg gtacaagtgg tccaagtggt
181 acaagtggtc caagtggtac aagtccatca tctcgttcaa acactttacc tcgttcaaat
241 acttcatctg gtgcaagccc tccagctgat gcaagcgatt cagatgctaa atcttacgct
301 gatttaaaac acagagtacg aaattacttg ttcactatta aagaactcaa atatcccgaa
361 ctctttgatt taaccaatca tatgttaact ttgtgtgata atattcatgg tttcaaatat
421 ttaattgatg gat
//
LOCUS PFAMSAB 388 bp ds-DNA INV 13-FEB-1990
DEFINITION P.falciparum (strain B342) major surface antigen p190 gene,
partial cds.
ACCESSION M32112
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B342) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 388)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S.A. and Scherf,
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 338 major surface antigen p190 (AA at 1)
BASE COUNT 138 a 63 c 70 g 117 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
241 gctaaatctt acgctgattt aaaacataga gctcaaaatt acttgttcac tattaaagaa
301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttacc taaaaatgtt
361 gatggtttca aatatttaat tgatggat
//
LOCUS PFAMSAC 379 bp ds-DNA INV 14-FEB-1990
DEFINITION P.falciparum (strain B608) major surface antigen p190 gene,
partial cds.
ACCESSION M32113
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B608) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 379)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 379 major surface antigen p190 (AA at 1)
BASE COUNT 135 a 64 c 62 g 118 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa
61 aaaatgctct taaatgaaga agaaattact acaaaaggtg caagtgctca aagtggtaca
121 agttgtacaa gtggtccaag tggtacaagt ccatcatctc gttcaaacac tttacctcgt
181 tcaaatactt catctggtgc aagccctcca gctgatgcaa gcgattcaga tgctaaatct
241 tactctgatt taaaacacag agtacgaaat tacttgttaa ctatcaaaga actcaaatat
301 cctcaactct tttatttaac taatcatatg ttgactttat ctaaaaatgt tgatggtttc
361 aaatatttaa ttgatggat
//
LOCUS PFAMSAD 388 bp ds-DNA INV 13-FEB-1990
DEFINITION P.falciparum (strain B440) major surface antigen p190 gene,
partial cds.
ACCESSION M32114
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B440) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 388)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 388 major surface antigen p190 (AA at 1)
BASE COUNT 138 a 62 c 69 g 119 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagccagt acctgctgta
121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcgc tattaaagaa
301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
361 gatggtttca aatatttaat tgatggat
//
LOCUS PFAMSAE 388 bp ds-DNA INV 13-FEB-1990
DEFINITION P.falciparum (strain B411) major surface antigen p190 gene,
partial cds.
ACCESSION M32115
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B411) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 388)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 388 major surface antigen p190 (AA at 1)
BASE COUNT 138 a 61 c 70 g 119 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa
301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
361 gatggtttca aatatttaat tgatggat
//
LOCUS PFAMSAF 388 bp ds-DNA INV 14-FEB-1990
DEFINITION P.falciparum (strain B439) major surface antigen p190 gene,
partial cds.
ACCESSION M32116
KEYWORDS major surface antigen p190.
SOURCE P.falciparum (strain B439) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 388)
AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
TITLE Genetic diversity in the major merozoite surface antigen of
Plasmodium falciparum: High prevalence of a third polymorphic form
detected in strains derived from malaria patients
JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A. Scherf, 14-FEB-1990.
FEATURES from to/span description
pept < 1 > 388 major surface antigen p190 (AA at 1)
BASE COUNT 138 a 61 c 70 g 119 t
ORIGIN
1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa
301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
361 gatggtttca aatatttaat tgatggat
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HPEPP 2570 bp ss-RNA VRL 13-APR-1990
DEFINITION Hepatitis E virus polyprotein (encoding NTP-binding protein and
RNA-dependent RNA polymerase) mRNA, 3' end.
ACCESSION M32400
KEYWORDS NTP-binding protein; NTP-binding protein; polyprotein.
SOURCE Hepatitis E virus (strain Burma; enterically transmitted non-A,
non-B), cDNA to viral RNA, (host human liver and bile).
ORGANISM Hepatitis E virus
Viridae; ss-RNA nonenveloped viruses.
REFERENCE 1 (bases 1 to 2570)
AUTHORS Reyes,G.R., Purdy,M.A., Kim,J.P., Luk K,-C., Young,L.M.,
Fry,K.E. and Bradley,D.
TITLE Isolation of a cDNA from the virus responsible for enterically
transmitted non-A, non-B hepatitis
JOURNAL Science 247, 1335-1339 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2570)
AUTHORS Tam,A.W., Smith,M.M., Kim J,P., Young,L.M., Piatak,M.,
Feldman,R.A., Purdy,M.A., Bradley,D.W., Reyes,G.R. and Fry,K.E.
TITLE Hepatitis E virus: A positive strand RNA virus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Authorin entry and computer-readable sequence for [1] kindly
submitted by K.E.Fry, 02-MAR-1990.
FEATURES from to/span description
pept < 1 2184 polyprotein (NTP-binding protein/RNA-dependent
polymerase region) (AA at 1)
BASE COUNT 429 a 800 c 711 g 630 t
ORIGIN
1 gttgttcagt accagtttac tgcaggtgtg cctggatccg gcaagtcccg ctctatcacc
61 caagccgatg tggacgttgt cgtggtcccg acgcgtgagt tgcgtaatgc ctggcgccgt
121 cgcggctttg ctgcttttac cccgcatact gccgccagag tcacccaggg gcgccgggtt
181 gtcattgatg aggctccatc cctcccccct cacctgctgc tgctccacat gcagcgggcc
241 gccaccgtcc accttcttgg cgacccgaac cagatcccag ccatcgactt tgagcacgct
301 gggctcgtcc ccgccatcag gcccgactta ggccccacct cctggtggca tgttacccat
361 cgctggcctg cggatgtatg cgagctcatc cgtggtgcat accccatgat ccagaccact
421 agccgggttc tccgttcgtt gttctggggt gagcctgccg tcgggcagaa actagtgttc
481 acccaggcgg ccaagcccgc caaccccggc tcagtgacgg tccacgaggc gcagggcgct
541 acctacacgg agaccactat tattgccaca gcagatgccc ggggccttat tcagtcgtct
601 cgggctcatg ccattgttgc tctgacgcgc cacactgaga agtgcgtcat cattgacgca
661 ccaggcctgc ttcgcgaggt gggcatctcc gatgcaatcg ttaataactt tttcctcgct
721 ggtggcgaaa ttggtcacca gcgcccatca gttattcccc gtggcaaccc tgacgccaat
781 gttgacaccc tggctgcctt cccgccgtct tgccagatta gtgccttcca tcagttggct
841 gaggagcttg gccacagacc tgtccctgtt gcagctgttc taccaccctg ccccgagctc
901 gaacagggcc ttctctacct gccccaggag ctcaccacct gtgatagtgt cgtaacattt
961 gaattaacag acattgtgca ctgccgcatg gccgccccga gccagcgcaa ggccgtgctg
1021 tccacactcg tgggccgcta cggcggtcgc acaaagctct acaatgcttc ccactctgat
1081 gttcgcgact ctctcgcccg ttttatcccg gccattggcc ccgtacaggt tacaacttgt
1141 gaattgtacg agctagtgga ggccatggtc gagaagggcc aggatggctc cgccgtcctt
1201 gagcttgatc tttgcaaccg tgacgtgtcc aggatcacct tcttccagaa agattgtaac
1261 aagttcacca caggtgagac cattgcccat ggtaaagtgg gccagggcat ctcggcctgg
1321 agcaagacct tctgcgccct ctttggccct tggttccgcg ctattgagaa ggctattctg
1381 gccctgctcc ctcagggtgt gttttacggt gatgcctttg atgacaccgt cttctcggcg
1441 gctgtggccg cagcaaaggc atccatggtg tttgagaatg acttttctga gtttgactcc
1501 acccagaata acttttctct gggtctagag tgtgctatta tggaggagtg tgggatgccg
1561 cagtggctca tccgcctgta tcaccttata aggtctgcgt ggatcttgca ggccccgaag
1621 gagtctctgc gagggttttg gaagaaacac tccggtgagc ccggcactct tctatggaat
1681 actgtctgga atatggccgt tattacccac tgttatgact tccgcgattt tcaggtggct
1741 gcctttaaag gtgatgattc gatagtgctt tgcagtgagt atcgtcagag tccaggagct
1801 gctgtcctga tcgccggctg tggcttgaag ttgaaggtag atttccgccc gatcggtttg
1861 tatgcaggtg ttgtggtggc ccccggcctt ggcgcgctcc ctgatgttgt gcgcttcgcc
1921 ggccggctta ccgagaagaa ttggggccct ggccctgagc gggcggagca gctccgcctc
1981 gctgttagtg atttcctccg caagctcacg aatgtagctc agatgtgtgt ggatgttgtt
2041 tcccgtgttt atggggtttc ccctggactc gttcataacc tgattggcat gctacaggct
2101 gttgctgatg gcaaggcaca tttcactgag tcagtaaaac cagtgctcga cttgacaaat
2161 tcaatcttgt gtcgggtgga atgaataaca tgtcttttgc tgcgcccatg ggttcgcgac
2221 catgcgccct cggcctattt tgttgctgct cctcatgttt ttgcctatgc tgcccgcgcc
2281 accgcccggt cagccgtctg gccgccgtcg tgggcggcgc agcggcggtt ccggcggtgg
2341 tttctggggt gaccgggttg attctcagcc cttcgcaatc ccctatattc atccaaccaa
2401 ccccttcgcc cccgatgtca ccgctgcggc cggggctgga cctcgtgttc gccaacccgc
2461 ccgaccactc ggctccgctt ggcgtgacca ggcccagcgc cccgccgttg cctcacgtcg
2521 tagacctacc acagctgggg ccgcgccgct aaccgcggtc gctccggccc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATTNT 17249 bp ds-DNA ROD 20-MAY-1987
DEFINITION Rat fast skeletal TnT gene encoding troponin T isoforms, complete
cds.
ACCESSION M15202 M14514 M14515 M14516 M14517 M14518 M14519 M14520 M14521
M14522 M14523 M14524 M14525 M14527 M14528 M14529 M14530 M14531
M10059 M14532 M14533 M14534 K02429 J00794
KEYWORDS alternate splicing; troponin.
SOURCE Rat adult skeletal muscle DNA [2],[4], clone pTnT-1 [2]; myogenic
cell line L6E9, cDNA to mRNA, clone pTnT-15 [1],[2],[3].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 12798 to 16893; exons 12, 13, 14, 15, 17, 18)
AUTHORS Garfinkel,L.I., Periasamy,M. and Nadal-Ginard,B.
TITLE Cloning and characterization of cDNA sequences corresponding to
myosin light chains 1, 2, and 3, troponin-C, troponin-T,
alpha-tropomyosin, and alpha-actin
JOURNAL J. Biol. Chem. 257, 11078-11086 (1982)
STANDARD full staff_review
REFERENCE 2 (bases 13478 to 16906)
AUTHORS Medford,R.M., Nguyen,H.T., Destree,A.T., Summers,E. and
Nadal-Ginard,B.
TITLE A novel mechanism of alternative RNA splicing for the
developmentally regulated generation of troponin T isoforms from a
single gene
JOURNAL Cell 38, 409-421 (1984)
STANDARD full staff_review
REFERENCE 3 (bases 313 to 16893; exons and intron/exon boundaries)
AUTHORS Breitbart,R.E., Nguyen,H.T., Medford,R.M., Destree,A.T.,
Mahdavi,V. and Nadal-Ginard,B.
TITLE Intricate combinatorial patterns of exon splicing generate multiple
regulated troponin T isoforms from a single gene
JOURNAL Cell 41, 67-82 (1985)
STANDARD full staff_review
REFERENCE 4 (bases 1 to 17249)
AUTHORS Breitbart,R.E. and Nadal-Ginard,B.
TITLE Complete nucleotide sequence of the fast skeletal troponin T gene:
Alternatively spliced exons exhibit unusual interspecies divergence
JOURNAL J. Mol. Biol. 188, 313-324 (1986)
STANDARD full staff_review
REFERENCE 5 (bases 14620 to 14621; sequence correction)
AUTHORS Breitbart,R.E.
JOURNAL Unpublished (1990) Children's Hosp., Boston MA 02115
STANDARD full staff_review
COMMENT Other exon combinations, besides the ones given in the Features
table, are possible in theory, but no mRNAs corresponding to the
right lengths were found [3]. There still is a question if there
might be an alternate exon 9, which would again double the possible
exon combinations and proteins. Even though no proteins have been
identified for each and every possible protein described below,
mRNAs of the appropriate lengths have been identified. The
variation in mRNAs encoding tropomyosin T may be due to different
half-lives at particular developmental stages [3].
Distinct similarities are found among exons 3, 4, 5, 6, and 8 [4].
These exons may have originated by gene duplication [4]. The
variable region of troponin T is represented by exons 4-8 and the
constant regions by exons 2 and 3, 9-15 and 18 [3],[4]. Exons 16
and 17 are mutually exclusive [3],[4]. When exon 16 is translated
the alpha forms of the proteins are found, and when exon 17 is
expressed the beta forms are produced [3],[[4].
Class Ia mRNAs are expressed only after the first postnatal week;
class IVb mRNAs are found in L6E9 myotubes, but not in the fetus
and are then increasingly expressed postnatally. Class IIa mRNAs
increase continually through 14 days.
FEATURES from to/span description
pept 3146 3162 tropomyosin T class Ia alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ia alpha-1, exon 3
4313 4330 tropomyosin T class Ia alpha-1, exon 4
5538 5555 tropomyosin T class Ia alpha-1, exon 5
5737 5751 tropomyosin T class Ia alpha-1, exon 6
6722 6733 tropomyosin T class Ia alpha-1, exon 7
6935 6949 tropomyosin T class Ia alpha-1, exon 8
9703 9721 tropomyosin T class Ia alpha-1, exon 9
11980 12025 tropomyosin T class Ia alpha-1, exon 10
12591 12707 tropomyosin T class Ia alpha-1, exon 11
12795 12872 tropomyosin T class Ia alpha-1, exon 12
13082 13195 tropomyosin T class Ia alpha-1, exon 13
13295 13404 tropomyosin T class Ia alpha-1, exon 14
13515 13605 tropomyosin T class Ia alpha-1, exon 15
14613 14653 tropomyosin T class Ia alpha-1, exon 16
16666 16720 tropomyosin T class Ia alpha-1, exon 17
pept 3146 3162 tropomyosin T class Ia alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ia alpha-2, exon 3
5538 5555 tropomyosin T class Ia alpha-2, exon 4
5737 5751 tropomyosin T class Ia alpha-2, exon 5
6722 6733 tropomyosin T class Ia alpha-2, exon 6
6935 6949 tropomyosin T class Ia alpha-2, exon 7
9703 9721 tropomyosin T class Ia alpha-2, exon 8
11980 12025 tropomyosin T class Ia alpha-2, exon 9
12591 12707 tropomyosin T class Ia alpha-2, exon 10
12795 12872 tropomyosin T class Ia alpha-2, exon 11
13082 13195 tropomyosin T class Ia alpha-2, exon 12
13295 13404 tropomyosin T class Ia alpha-2, exon 13
13515 13605 tropomyosin T class Ia alpha-2, exon 14
14613 14653 tropomyosin T class Ia alpha-2, exon 15
16666 16720 tropomyosin T class Ia alpha-2, exon 16
pept 3146 3162 tropomyosin T class Ib alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ib alpha-1, exon 3
4313 4330 tropomyosin T class Ib alpha-1, exon 4
5737 5751 tropomyosin T class Ib alpha-1, exon 5
6722 6733 tropomyosin T class Ib alpha-1, exon 6
6935 6949 tropomyosin T class Ib alpha-1, exon 7
9703 9721 tropomyosin T class Ib alpha-1, exon 8
11980 12025 tropomyosin T class Ib alpha-1, exon 9
12591 12707 tropomyosin T class Ib alpha-1, exon 10
12795 12872 tropomyosin T class Ib alpha-1, exon 11
13082 13195 tropomyosin T class Ib alpha-1, exon 12
13295 13404 tropomyosin T class Ib alpha-1, exon 13
13515 13605 tropomyosin T class Ib alpha-1, exon 14
14613 14653 tropomyosin T class Ib alpha-1, exon 15
16666 16720 tropomyosin T class Ib alpha-1, exon 16
pept 3146 3162 tropomyosin T class Ib alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ib alpha-2, exon 3
5737 5751 tropomyosin T class Ib alpha-2, exon 4
6722 6733 tropomyosin T class Ib alpha-2, exon 5
6935 6949 tropomyosin T class Ib alpha-2, exon 6
9703 9721 tropomyosin T class Ib alpha-2, exon 7
11980 12025 tropomyosin T class Ib alpha-2, exon 8
12591 12707 tropomyosin T class Ib alpha-2, exon 9
12795 12872 tropomyosin T class Ib alpha-2, exon 10
13082 13195 tropomyosin T class Ib alpha-2, exon 11
13295 13404 tropomyosin T class Ib alpha-2, exon 12
13515 13605 tropomyosin T class Ib alpha-2, exon 13
14613 14653 tropomyosin T class Ib alpha-2, exon 14
16666 16720 tropomyosin T class Ib alpha-2, exon 15
pept 3146 3162 tropomyosin T class Ic alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic alpha-1, exon 3
4313 4330 tropomyosin T class Ic alpha-1, exon 4
5538 5555 tropomyosin T class Ic alpha-1, exon 5
6722 6733 tropomyosin T class Ic alpha-1, exon 6
6935 6949 tropomyosin T class Ic alpha-1, exon 7
9703 9721 tropomyosin T class Ic alpha-1, exon 8
11980 12025 tropomyosin T class Ic alpha-1, exon 9
12591 12707 tropomyosin T class Ic alpha-1, exon 10
12795 12872 tropomyosin T class Ic alpha-1, exon 11
13082 13195 tropomyosin T class Ic alpha-1, exon 12
13295 13404 tropomyosin T class Ic alpha-1, exon 13
13515 13605 tropomyosin T class Ic alpha-1, exon 14
14613 14653 tropomyosin T class Ic alpha-1, exon 15
16666 16720 tropomyosin T class Ic alpha-1, exon 16
pept 3146 3162 tropomyosin T class Ic alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic alpha-2, exon 3
4313 4330 tropomyosin T class Ic alpha-2, exon 4
6722 6733 tropomyosin T class Ic alpha-2, exon 5
6935 6949 tropomyosin T class Ic alpha-2, exon 6
9703 9721 tropomyosin T class Ic alpha-2, exon 7
11980 12025 tropomyosin T class Ic alpha-2, exon 8
12591 12707 tropomyosin T class Ic alpha-2, exon 9
12795 12872 tropomyosin T class Ic alpha-2, exon 10
13082 13195 tropomyosin T class Ic alpha-2, exon 11
13295 13404 tropomyosin T class Ic alpha-2, exon 12
13515 13605 tropomyosin T class Ic alpha-2, exon 13
14613 14653 tropomyosin T class Ic alpha-2, exon 14
16666 16720 tropomyosin T class Ic alpha-2, exon 15
pept 3146 3162 tropomyosin T class Ic alpha-3, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic alpha-3, exon 3
5538 5555 tropomyosin T class Ic alpha-3, exon 4
6722 6733 tropomyosin T class Ic alpha-3, exon 5
6935 6949 tropomyosin T class Ic alpha-3, exon 6
9703 9721 tropomyosin T class Ic alpha-3, exon 7
11980 12025 tropomyosin T class Ic alpha-3, exon 8
12591 12707 tropomyosin T class Ic alpha-3, exon 9
12795 12872 tropomyosin T class Ic alpha-3, exon 10
13082 13195 tropomyosin T class Ic alpha-3, exon 11
13295 13404 tropomyosin T class Ic alpha-3, exon 12
13515 13605 tropomyosin T class Ic alpha-3, exon 13
14613 14653 tropomyosin T class Ic alpha-3, exon 14
16666 16720 tropomyosin T class Ic alpha-3, exon 15
pept 3146 3162 tropomyosin T class Ic alpha-4, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic alpha-4, exon 3
6722 6733 tropomyosin T class Ic alpha-4, exon 4
6935 6949 tropomyosin T class Ic alpha-4, exon 5
9703 9721 tropomyosin T class Ic alpha-4, exon 6
11980 12025 tropomyosin T class Ic alpha-4, exon 7
12591 12707 tropomyosin T class Ic alpha-4, exon 8
12795 12872 tropomyosin T class Ic alpha-4, exon 9
13082 13195 tropomyosin T class Ic alpha-4, exon 10
13295 13404 tropomyosin T class Ic alpha-4, exon 11
13515 13605 tropomyosin T class Ic alpha-4, exon 12
14613 14653 tropomyosin T class Ic alpha-4, exon 13
16666 16720 tropomyosin T class Ic alpha-4, exon 14
pept 3146 3162 tropomyosin T class Ia beta-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ia beta-1, exon 3
4313 4330 tropomyosin T class Ia beta-1, exon 4
5538 5555 tropomyosin T class Ia beta-1, exon 5
5737 5751 tropomyosin T class Ia beta-1, exon 6
6722 6733 tropomyosin T class Ia beta-1, exon 7
6935 6949 tropomyosin T class Ia beta-1, exon 8
9703 9721 tropomyosin T class Ia beta-1, exon 9
11980 12025 tropomyosin T class Ia beta-1, exon 10
12591 12707 tropomyosin T class Ia beta-1, exon 11
12795 12872 tropomyosin T class Ia beta-1, exon 12
13082 13195 tropomyosin T class Ia beta-1, exon 13
13295 13404 tropomyosin T class Ia beta-1, exon 14
13515 13605 tropomyosin T class Ia beta-1, exon 15
15251 15291 tropomyosin T class Ia beta-1, exon 16
16666 16720 tropomyosin T class Ia beta-1, exon 17
pept 3146 3162 tropomyosin T class Ia beta-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ia beta-2, exon 3
5538 5555 tropomyosin T class Ia beta-2, exon 4
5737 5751 tropomyosin T class Ia beta-2, exon 5
6722 6733 tropomyosin T class Ia beta-2, exon 6
6935 6949 tropomyosin T class Ia beta-2, exon 7
9703 9721 tropomyosin T class Ia beta-2, exon 8
11980 12025 tropomyosin T class Ia beta-2, exon 9
12591 12707 tropomyosin T class Ia beta-2, exon 10
12795 12872 tropomyosin T class Ia beta-2, exon 11
13082 13195 tropomyosin T class Ia beta-2, exon 12
13295 13404 tropomyosin T class Ia beta-2, exon 13
13515 13605 tropomyosin T class Ia beta-2, exon 14
15251 15291 tropomyosin T class Ia beta-2, exon 15
16666 16720 tropomyosin T class Ia beta-2, exon 16
pept 3146 3162 tropomyosin T class Ib beta-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ib beta-1, exon 3
4313 4330 tropomyosin T class Ib beta-1, exon 4
5737 5751 tropomyosin T class Ib beta-1, exon 5
6722 6733 tropomyosin T class Ib beta-1, exon 6
6935 6949 tropomyosin T class Ib beta-1, exon 7
9703 9721 tropomyosin T class Ib beta-1, exon 8
11980 12025 tropomyosin T class Ib beta-1, exon 9
12591 12707 tropomyosin T class Ib beta-1, exon 10
12795 12872 tropomyosin T class Ib beta-1, exon 11
13082 13195 tropomyosin T class Ib beta-1, exon 12
13295 13404 tropomyosin T class Ib beta-1, exon 13
13515 13605 tropomyosin T class Ib beta-1, exon 14
15251 15291 tropomyosin T class Ib beta-1, exon 15
16666 16720 tropomyosin T class Ib beta-1, exon 16
pept 3146 3162 tropomyosin T class Ib beta-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ib beta-2, exon 3
5737 5751 tropomyosin T class Ib beta-2, exon 4
6722 6733 tropomyosin T class Ib beta-2, exon 5
6935 6949 tropomyosin T class Ib beta-2, exon 6
9703 9721 tropomyosin T class Ib beta-2, exon 7
11980 12025 tropomyosin T class Ib beta-2, exon 8
12591 12707 tropomyosin T class Ib beta-2, exon 9
12795 12872 tropomyosin T class Ib beta-2, exon 10
13082 13195 tropomyosin T class Ib beta-2, exon 11
13295 13404 tropomyosin T class Ib beta-2, exon 12
13515 13605 tropomyosin T class Ib beta-2, exon 13
15251 15291 tropomyosin T class Ib beta-2, exon 14
16666 16720 tropomyosin T class Ib beta-2, exon 15
pept 3146 3162 tropomyosin T class Ic beta-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic beta-1, exon 3
4313 4330 tropomyosin T class Ic beta-1, exon 4
5538 5555 tropomyosin T class Ic beta-1, exon 5
6722 6733 tropomyosin T class Ic beta-1, exon 6
6935 6949 tropomyosin T class Ic beta-1, exon 7
9703 9721 tropomyosin T class Ic beta-1, exon 8
11980 12025 tropomyosin T class Ic beta-1, exon 9
12591 12707 tropomyosin T class Ic beta-1, exon 10
12795 12872 tropomyosin T class Ic beta-1, exon 11
13082 13195 tropomyosin T class Ic beta-1, exon 12
13295 13404 tropomyosin T class Ic beta-1, exon 13
13515 13605 tropomyosin T class Ic beta-1, exon 14
15251 15291 tropomyosin T class Ic beta-1, exon 15
16666 16720 tropomyosin T class Ic beta-1, exon 16
pept 3146 3162 tropomyosin T class Ic beta-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic beta-2, exon 3
4313 4330 tropomyosin T class Ic beta-2, exon 4
6722 6733 tropomyosin T class Ic beta-2, exon 5
6935 6949 tropomyosin T class Ic beta-2, exon 6
9703 9721 tropomyosin T class Ic beta-2, exon 7
11980 12025 tropomyosin T class Ic beta-2, exon 8
12591 12707 tropomyosin T class Ic beta-2, exon 9
12795 12872 tropomyosin T class Ic beta-2, exon 10
13082 13195 tropomyosin T class Ic beta-2, exon 11
13295 13404 tropomyosin T class Ic beta-2, exon 12
13515 13605 tropomyosin T class Ic beta-2, exon 13
15251 15291 tropomyosin T class Ic beta-2, exon 14
16666 16720 tropomyosin T class Ic beta-2, exon 15
pept 3146 3162 tropomyosin T class Ic beta-3, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic beta-3, exon 3
5538 5555 tropomyosin T class Ic beta-3, exon 4
6722 6733 tropomyosin T class Ic beta-3, exon 5
6935 6949 tropomyosin T class Ic beta-3, exon 6
9703 9721 tropomyosin T class Ic beta-3, exon 7
11980 12025 tropomyosin T class Ic beta-3, exon 8
12591 12707 tropomyosin T class Ic beta-3, exon 9
12795 12872 tropomyosin T class Ic beta-3, exon 10
13082 13195 tropomyosin T class Ic beta-3, exon 11
13295 13404 tropomyosin T class Ic beta-3, exon 12
13515 13605 tropomyosin T class Ic beta-3, exon 13
15251 15291 tropomyosin T class Ic beta-3, exon 14
16666 16720 tropomyosin T class Ic beta-3, exon 15
pept 3146 3162 tropomyosin T class Ic beta-4, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class Ic beta-4, exon 3
6722 6733 tropomyosin T class Ic beta-4, exon 4
6935 6949 tropomyosin T class Ic beta-4, exon 5
9703 9721 tropomyosin T class Ic beta-4, exon 6
11980 12025 tropomyosin T class Ic beta-4, exon 7
12591 12707 tropomyosin T class Ic beta-4, exon 8
12795 12872 tropomyosin T class Ic beta-4, exon 9
13082 13195 tropomyosin T class Ic beta-4, exon 10
13295 13404 tropomyosin T class Ic beta-4, exon 11
13515 13605 tropomyosin T class Ic beta-4, exon 12
15251 15291 tropomyosin T class Ic beta-4, exon 13
16666 16720 tropomyosin T class Ic beta-4, exon 14
pept 3146 3162 tropomyosin T class IIa alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa alpha-1, exon 3
4313 4330 tropomyosin T class IIa alpha-1, exon 4
5538 5555 tropomyosin T class IIa alpha-1, exon 5
5737 5751 tropomyosin T class IIa alpha-1, exon 6
6935 6949 tropomyosin T class IIa alpha-1, exon 7
9703 9721 tropomyosin T class IIa alpha-1, exon 8
11980 12025 tropomyosin T class IIa alpha-1, exon 9
12591 12707 tropomyosin T class IIa alpha-1, exon 10
12795 12872 tropomyosin T class IIa alpha-1, exon 11
13082 13195 tropomyosin T class IIa alpha-1, exon 12
13295 13404 tropomyosin T class IIa alpha-1, exon 13
13515 13605 tropomyosin T class IIa alpha-1, exon 14
14613 14653 tropomyosin T class IIa alpha-1, exon 15
16666 16720 tropomyosin T class IIa alpha-1, exon 16
pept 3146 3162 tropomyosin T class IIa alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa alpha-2, exon 3
4313 4330 tropomyosin T class IIa alpha-2, exon 4
5737 5751 tropomyosin T class IIa alpha-2, exon 5
6935 6949 tropomyosin T class IIa alpha-2, exon 6
9703 9721 tropomyosin T class IIa alpha-2, exon 7
11980 12025 tropomyosin T class IIa alpha-2, exon 8
12591 12707 tropomyosin T class IIa alpha-2, exon 9
12795 12872 tropomyosin T class IIa alpha-2, exon 10
13082 13195 tropomyosin T class IIa alpha-2, exon 11
13295 13404 tropomyosin T class IIa alpha-2, exon 12
13515 13605 tropomyosin T class IIa alpha-2, exon 13
14613 14653 tropomyosin T class IIa alpha-2, exon 14
16666 16720 tropomyosin T class IIa alpha-2, exon 15
pept 3146 3162 tropomyosin T class IIa alpha-3, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa alpha-3, exon 3
5538 5555 tropomyosin T class IIa alpha-3, exon 4
5737 5751 tropomyosin T class IIa alpha-3, exon 5
6935 6949 tropomyosin T class IIa alpha-3, exon 6
9703 9721 tropomyosin T class IIa alpha-3, exon 7
11980 12025 tropomyosin T class IIa alpha-3, exon 8
12591 12707 tropomyosin T class IIa alpha-3, exon 9
12795 12872 tropomyosin T class IIa alpha-3, exon 10
13082 13195 tropomyosin T class IIa alpha-3, exon 11
13295 13404 tropomyosin T class IIa alpha-3, exon 12
13515 13605 tropomyosin T class IIa alpha-3, exon 13
14613 14653 tropomyosin T class IIa alpha-3, exon 14
16666 16720 tropomyosin T class IIa alpha-3, exon 15
pept 3146 3162 tropomyosin T class IIa alpha-4, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa alpha-4, exon 3
5737 5751 tropomyosin T class IIa alpha-4, exon 4
6935 6949 tropomyosin T class IIa alpha-4, exon 5
9703 9721 tropomyosin T class IIa alpha-4, exon 6
11980 12025 tropomyosin T class IIa alpha-4, exon 7
12591 12707 tropomyosin T class IIa alpha-4, exon 8
12795 12872 tropomyosin T class IIa alpha-4, exon 9
13082 13195 tropomyosin T class IIa alpha-4, exon 10
13295 13404 tropomyosin T class IIa alpha-4, exon 11
13515 13605 tropomyosin T class IIa alpha-4, exon 12
14613 14653 tropomyosin T class IIa alpha-4, exon 13
16666 16720 tropomyosin T class IIa alpha-4, exon 14
pept 3146 3162 tropomyosin T class IIa beta-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa beta-1, exon 3
4313 4330 tropomyosin T class IIa beta-1, exon 4
5538 5555 tropomyosin T class IIa beta-1, exon 5
5737 5751 tropomyosin T class IIa beta-1, exon 6
6935 6949 tropomyosin T class IIa beta-1, exon 7
9703 9721 tropomyosin T class IIa beta-1, exon 8
11980 12025 tropomyosin T class IIa beta-1, exon 9
12591 12707 tropomyosin T class IIa beta-1, exon 10
12795 12872 tropomyosin T class IIa beta-1, exon 11
13082 13195 tropomyosin T class IIa beta-1, exon 12
13295 13404 tropomyosin T class IIa beta-1, exon 13
13515 13605 tropomyosin T class IIa beta-1, exon 14
15251 15291 tropomyosin T class IIa beta-1, exon 15
16666 16720 tropomyosin T class IIa beta-1, exon 16
pept 3146 3162 tropomyosin T class IIa beta-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa beta-2, exon 3
4313 4330 tropomyosin T class IIa beta-2, exon 4
5737 5751 tropomyosin T class IIa beta-2, exon 5
6935 6949 tropomyosin T class IIa beta-2, exon 6
9703 9721 tropomyosin T class IIa beta-2, exon 7
11980 12025 tropomyosin T class IIa beta-2, exon 8
12591 12707 tropomyosin T class IIa beta-2, exon 9
12795 12872 tropomyosin T class IIa beta-2, exon 10
13082 13195 tropomyosin T class IIa beta-2, exon 11
13295 13404 tropomyosin T class IIa beta-2, exon 12
13515 13605 tropomyosin T class IIa beta-2, exon 13
15251 15291 tropomyosin T class IIa beta-2, exon 14
16666 16720 tropomyosin T class IIa beta-2, exon 15
pept 3146 3162 tropomyosin T class IIa beta-3, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa beta-3, exon 3
5538 5555 tropomyosin T class IIa beta-3, exon 4
5737 5751 tropomyosin T class IIa beta-3, exon 5
6935 6949 tropomyosin T class IIa beta-3, exon 6
9703 9721 tropomyosin T class IIa beta-3, exon 7
11980 12025 tropomyosin T class IIa beta-3, exon 8
12591 12707 tropomyosin T class IIa beta-3, exon 9
12795 12872 tropomyosin T class IIa beta-3, exon 10
13082 13195 tropomyosin T class IIa beta-3, exon 11
13295 13404 tropomyosin T class IIa beta-3, exon 12
13515 13605 tropomyosin T class IIa beta-3, exon 13
15251 15291 tropomyosin T class IIa beta-3, exon 14
16666 16720 tropomyosin T class IIa beta-3, exon 15
pept 3146 3162 tropomyosin T class IIa beta-4, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIa beta-4, exon 3
5737 5751 tropomyosin T class IIa beta-4, exon 4
6935 6949 tropomyosin T class IIa beta-4, exon 5
9703 9721 tropomyosin T class IIa beta-4, exon 6
11980 12025 tropomyosin T class IIa beta-4, exon 7
12591 12707 tropomyosin T class IIa beta-4, exon 8
12795 12872 tropomyosin T class IIa beta-4, exon 9
13082 13195 tropomyosin T class IIa beta-4, exon 10
13295 13404 tropomyosin T class IIa beta-4, exon 11
13515 13605 tropomyosin T class IIa beta-4, exon 12
15251 15291 tropomyosin T class IIa beta-4, exon 13
16666 16720 tropomyosin T class IIa beta-4, exon 14
pept 3146 3162 tropomyosin T class IIIa alpha, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIIa alpha, exon 3
4313 4330 tropomyosin T class IIIa alpha, exon 4
5538 5555 tropomyosin T class IIIa alpha, exon 5
9703 9721 tropomyosin T class IIIa alpha, exon 6
11980 12025 tropomyosin T class IIIa alpha, exon 7
12591 12707 tropomyosin T class IIIa alpha, exon 8
12795 12872 tropomyosin T class IIIa alpha, exon 9
13082 13195 tropomyosin T class IIIa alpha, exon 10
13295 13404 tropomyosin T class IIIa alpha, exon 11
13515 13605 tropomyosin T class IIIa alpha, exon 12
14613 14653 tropomyosin T class IIIa alpha, exon 13
16666 16720 tropomyosin T class IIIa alpha, exon 14
pept 3146 3162 tropomyosin T class IIIb alpha, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIIb alpha, exon 3
5538 5555 tropomyosin T class IIIb alpha, exon 4
9703 9721 tropomyosin T class IIIb alpha, exon 5
11980 12025 tropomyosin T class IIIb alpha, exon 6
12591 12707 tropomyosin T class IIIb alpha, exon 7
12795 12872 tropomyosin T class IIIb alpha, exon 8
13082 13195 tropomyosin T class IIIb alpha, exon 9
13295 13404 tropomyosin T class IIIb alpha, exon 10
13515 13605 tropomyosin T class IIIb alpha, exon 11
14613 14653 tropomyosin T class IIIb alpha, exon 12
16666 16720 tropomyosin T class IIIb alpha, exon 13
pept 3146 3162 tropomyosin T class IIIa beta, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIIa beta, exon 3
4313 4330 tropomyosin T class IIIa beta, exon 4
5538 5555 tropomyosin T class IIIa beta, exon 5
9703 9721 tropomyosin T class IIIa beta, exon 6
11980 12025 tropomyosin T class IIIa beta, exon 7
12591 12707 tropomyosin T class IIIa beta, exon 8
12795 12872 tropomyosin T class IIIa beta, exon 9
13082 13195 tropomyosin T class IIIa beta, exon 10
13295 13404 tropomyosin T class IIIa beta, exon 11
13515 13605 tropomyosin T class IIIa beta, exon 12
15251 15291 tropomyosin T class IIIa beta, exon 13
16666 16720 tropomyosin T class IIIa beta, exon 14
pept 3146 3162 tropomyosin T class IIIb beta, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IIIb beta, exon 3
5538 5555 tropomyosin T class IIIb beta, exon 4
9703 9721 tropomyosin T class IIIb beta, exon 5
11980 12025 tropomyosin T class IIIb beta, exon 6
12591 12707 tropomyosin T class IIIb beta, exon 7
12795 12872 tropomyosin T class IIIb beta, exon 8
13082 13195 tropomyosin T class IIIb beta, exon 9
13295 13404 tropomyosin T class IIIb beta, exon 10
13515 13605 tropomyosin T class IIIb beta, exon 11
15251 15291 tropomyosin T class IIIb beta, exon 12
16666 16720 tropomyosin T class IIIb beta, exon 13
pept 3146 3162 tropomyosin T class IVa alpha, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVa alpha, exon 3
5538 5555 tropomyosin T class IVa alpha, exon 4
5737 5751 tropomyosin T class IVa alpha, exon 5
6722 6733 tropomyosin T class IVa alpha, exon 6
9703 9721 tropomyosin T class IVa alpha, exon 7
11980 12025 tropomyosin T class IVa alpha, exon 8
12591 12707 tropomyosin T class IVa alpha, exon 9
12795 12872 tropomyosin T class IVa alpha, exon 10
13082 13195 tropomyosin T class IVa alpha, exon 11
13295 13404 tropomyosin T class IVa alpha, exon 12
13515 13605 tropomyosin T class IVa alpha, exon 13
14613 14653 tropomyosin T class IVa alpha, exon 14
16666 16720 tropomyosin T class IVa alpha, exon 15
pept 3146 3162 tropomyosin T class IVb alpha, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVb alpha, exon 3
4313 4330 tropomyosin T class IVb alpha, exon 4
5538 5555 tropomyosin T class IVb alpha, exon 5
5737 5751 tropomyosin T class IVb alpha, exon 6
6722 6733 tropomyosin T class IVb alpha, exon 7
9703 9721 tropomyosin T class IVb alpha, exon 8
11980 12025 tropomyosin T class IVb alpha, exon 9
12591 12707 tropomyosin T class IVb alpha, exon 10
12795 12872 tropomyosin T class IVb alpha, exon 11
13082 13195 tropomyosin T class IVb alpha, exon 12
13295 13404 tropomyosin T class IVb alpha, exon 13
13515 13605 tropomyosin T class IVb alpha, exon 14
14613 14653 tropomyosin T class IVb alpha, exon 15
16666 16720 tropomyosin T class IVb alpha, exon 16
pept 3146 3162 tropomyosin T class IVc alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVc alpha-1, exon 3
4313 4330 tropomyosin T class IVc alpha-1, exon 4
5737 5751 tropomyosin T class IVc alpha-1, exon 5
6722 6733 tropomyosin T class IVc alpha-1, exon 6
9703 9721 tropomyosin T class IVc alpha-1, exon 7
11980 12025 tropomyosin T class IVc alpha-1, exon 8
12591 12707 tropomyosin T class IVc alpha-1, exon 9
12795 12872 tropomyosin T class IVc alpha-1, exon 10
13082 13195 tropomyosin T class IVc alpha-1, exon 11
13295 13404 tropomyosin T class IVc alpha-1, exon 12
13515 13605 tropomyosin T class IVc alpha-1, exon 13
14613 14653 tropomyosin T class IVc alpha-1, exon 14
16666 16720 tropomyosin T class IVc alpha-1, exon 15
pept 3146 3162 tropomyosin T class IVc alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVc alpha-2, exon 3
5737 5751 tropomyosin T class IVc alpha-2, exon 4
6722 6733 tropomyosin T class IVc alpha-2, exon 5
9703 9721 tropomyosin T class IVc alpha-2, exon 6
11980 12025 tropomyosin T class IVc alpha-2, exon 7
12591 12707 tropomyosin T class IVc alpha-2, exon 8
12795 12872 tropomyosin T class IVc alpha-2, exon 9
13082 13195 tropomyosin T class IVc alpha-2, exon 10
13295 13404 tropomyosin T class IVc alpha-2, exon 11
13515 13605 tropomyosin T class IVc alpha-2, exon 12
14613 14653 tropomyosin T class IVc alpha-2, exon 13
16666 16720 tropomyosin T class IVc alpha-2, exon 14
pept 3146 3162 tropomyosin T class IVd alpha-1, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVd alpha-1, exon 3
4313 4330 tropomyosin T class IVd alpha-1, exon 4
5538 5555 tropomyosin T class IVd alpha-1, exon 5
6722 6733 tropomyosin T class IVd alpha-1, exon 6
9703 9721 tropomyosin T class IVd alpha-1, exon 7
11980 12025 tropomyosin T class IVd alpha-1, exon 8
12591 12707 tropomyosin T class IVd alpha-1, exon 9
12795 12872 tropomyosin T class IVd alpha-1, exon 10
13082 13195 tropomyosin T class IVd alpha-1, exon 11
13295 13404 tropomyosin T class IVd alpha-1, exon 12
13515 13605 tropomyosin T class IVd alpha-1, exon 13
14613 14653 tropomyosin T class IVd alpha-1, exon 14
16666 16720 tropomyosin T class IVd alpha-1, exon 15
pept 3146 3162 tropomyosin T class IVd alpha-2, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVd alpha-2, exon 3
4313 4330 tropomyosin T class IVd alpha-2, exon 4
6722 6733 tropomyosin T class IVd alpha-2, exon 5
9703 9721 tropomyosin T class IVd alpha-2, exon 6
11980 12025 tropomyosin T class IVd alpha-2, exon 7
12591 12707 tropomyosin T class IVd alpha-2, exon 8
12795 12872 tropomyosin T class IVd alpha-2, exon 9
13082 13195 tropomyosin T class IVd alpha-2, exon 10
13295 13404 tropomyosin T class IVd alpha-2, exon 11
13515 13605 tropomyosin T class IVd alpha-2, exon 12
14613 14653 tropomyosin T class IVd alpha-2, exon 13
16666 16720 tropomyosin T class IVd alpha-2, exon 14
pept 3146 3162 tropomyosin T class IVd alpha-3, exon 2 (first
expressed exon)
3344 3357 tropomyosin T class IVd alpha-3, exon 3
5538 5555 tropomyosin T class IVd alpha-3, exon 4
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS TRSSTOBAA 360 bp ss-RNA VRL 14-MAR-1990
DEFINITION Tobacco ringspot virus (TobRV) mRNA.
ACCESSION M17439
KEYWORDS .
SOURCE Tobacco ringspot virus (isolate 62L), cDNA to viral RNA.
ORGANISM Tobacco ringspot virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Nepovirus.
REFERENCE 1 (bases 1 to 360)
AUTHORS Buzayan,J.M., McNinch,J.S., Schneider,I.R. and Bruening,G.
TITLE A nucleotide sequence rearrangement distinguishes two isolates of
satellite tobacco ringspot virus RNA
JOURNAL Virology 160, 95-99 (1987)
STANDARD simple staff_review
BASE COUNT 70 a 90 c 110 g 90 t
ORIGIN
1 accggatgtg ctttccggtc tgatgagtcc gtgaggacga aacaggactg tcaggtggcc
61 gaaagccacc acgtaaacta atgaaccgtg ctgcgtagcg ttgtgcctcg tggaggtggg
121 atgccacctc gtggagcagc cttcgtgtgg gcgcggcggt gtagttagtc aaggcgtacc
181 aggtaatata ccacaacgtg tgtttctctg gttgacttct ctgtttgttg tgtcattagg
241 tcccgggtct cgcattagcg gcgacggggt attctcttac gacatggaag tttgagagac
301 cgcgcctcta cactatgcgc ggccggggcg aatccaaata ttctagcccg ataccctgtc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATMYHOA 260 bp ss-mRNA ROD 14-MAR-1990
DEFINITION Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-21.
ACCESSION M32697
KEYWORDS beta-myosin; myosin; myosin heavy chain.
SOURCE Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone
pCMHC-21.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 260)
AUTHORS Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B.
TITLE Cardiac myosin heavy chain isozymic transitions during development
and under pathological conditions are regulated at the level of
mRNA availability
JOURNAL Eur. Heart J. 5, 181-191 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 204 beta-myosin heavy chain (AA at 1)
BASE COUNT 77 a 69 c 79 g 35 t
ORIGIN
1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac
61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg
121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag
181 cagaaaatcc acgatgagga ataacctgtc cagcagaaag agcctcgccg ttgccatccc
241 acaataaata cgaatgttcg
//
LOCUS RATMYHOB 303 bp ss-mRNA ROD 14-MAR-1990
DEFINITION Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-5.
ACCESSION M32698
KEYWORDS beta-myosin; myosin; myosin heavy chain.
SOURCE Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone
pCMHC-5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B.
TITLE Cardiac myosin heavy chain isozymic transitions during development
and under pathological conditions are regulated at the level of
mRNA availability
JOURNAL Eur. Heart J. 5, 181-191 (1984)
STANDARD simle staff_review
FEATURES from to/span description
pept < 1 198 beta-myosin heavy chain (AA at 1)
BASE COUNT 87 a 80 c 89 g 47 t
ORIGIN
1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac
61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg
121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag
181 ggcctgaatg aagagtagat cttgctctac ccaaccctaa ggatgcctgt gaagccctga
241 gacctggagc ctttgaaaag caccttcagg cagaaacaca ataaagcaat tttccttcaa
301 gcc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HUMP42LA 2465 bp ss-mRNA PRI 11-JAN-1990
DEFINITION Human erythrocyte membrane protein 4.2 (HUMP4.2L) mRNA,
complete cds.
ACCESSION M30647
KEYWORDS .
SOURCE Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and
12.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2465)
AUTHORS Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A.,
Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C.
TITLE Molecular cloning of human protein 4.2: A major component of the
red erythrocyte membrane
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.Sung, 12-DEC-1989, for release after publication.
FEATURES from to/span description
pept 228 2393 protein 4.2 (HUMP4.2L) (put.)
mRNA < 1 2465 protein 4.2 (HUMP4.2L) mRNA
BASE COUNT 603 a 661 c 697 g 504 t
ORIGIN
1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc
61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc
121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag
181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacagggtg
241 agccaagcca gcgctcgaca gggcttgctg gactgtatgc agcccccgca gcatcacctg
301 ttttcattaa aggaagtggg atggatgccc tgggtatcaa gagctgtgac tttcaggcag
361 caagaaacaa tgaggagcac cacaccaagg ccctcagctc ccggcgcctc tttgtgagga
421 gggggcagcc cttcaccatc atcctgtact tccgcgctcc agtccgtgca tttctgcctg
481 ccctgaagaa ggtggccctc actgcacaaa ctggagagca gccttccaag atcaacagga
541 cccaagccac attcccaatt tccagtctgg gggaccgaaa gtggtggagt gcagtggtgg
601 aggagagaga tgcccagtcc tggaccatct ctgtgaccac acctgcagac gctgtcattg
661 gccactactc gcttctgctg caggtctcag gcaggaagca actcctcttg ggtcagttca
721 cactgctttt taacccctgg aatagagagg atgctgtgtt cctgaagaat gaggctcagc
781 gcatggagta cttgttgaac cagaatggtc tcatctacct gggtacagct gactgcatcc
841 aggcagagtc ctgggacttt ggccagttcg agggggatgt cattgacctc agcctgcgct
901 tgctgagcaa ggacaagcag gtagagaagt ggagccagcc ggtgcacgtg gcccgtgtgt
961 tgggtgcctt gctgcatttt ctcaaggagc agagggtcct gcccaccccg cagacccagg
1021 ccacccagga aggggccttg ctgaacaagc gccggggcag cgtgcccatc ctgcggcagt
1081 ggctcaccgg ccgaggccga cctgtgtatg atggccaggc ctgggtgttg gctgctgttg
1141 cttgcacagt gctgcgatgc ctgggaatcc ctgcccgcgt ggtgaccacg tttgcctcag
1201 cacagggcac cggtgggcgt cttctcatag atgaatacta taatgaggag ggacttcaga
1261 acggagaagg ccagagaggc agaatctgga tcttccagac ttccacagag tgctggatga
1321 agcgcggcct gccttgccag ggttatgatg gatggcagat tctgcaccca agtgctccta
1381 atggaggtgg agtcctgggg tcctgtgatc tggtgccggt cagagcagtc aaggagggga
1441 cgctggggct gaccccagca gtgtcagacc tttttgctgc cataaatgcc tcatgtgtgg
1501 tctggaagtg ctgtgaggat gggacactgg agttgactga ctccaacaca aagtatgttg
1561 gcaacaacat cagcaccaag ggtgtgggca gtgaccgctg cgaggacatc actcagaact
1621 acaagtatcc tgaagggtct cttcaggaaa aagaggtgct ggagagagtc gagaaagaga
1681 aaatggaacg tgagaaagac aacggcatcc gtcctcccag tctcgagact gccagtcctc
1741 tgtacctgct cttgaaagca cccagctccc tacccctgag aggggatgcc cagatctcag
1801 tgacgctggt taatcacagt gagcaggaga aggcagtgca gctggcaatt ggggtccagg
1861 ctgtacacta caacggtgtc cttgctgcca agctctggag gaagaagctg cacctcacgc
1921 tcagtgccaa cctggaaaag ataataacca tcggcctgtt cttctccaat tttgagcgaa
1981 acccacccga gaacaccttc cttagactca ccgccatggc aacacactct gaatccaacc
2041 ttagctgctt tgctcaggaa gacattgcca tttgtagacc acaccttgcc atcaagatgc
2101 cagagaaagc agagcagtat caacccctca cagcctcagt cagcctccag aactccctag
2161 atgcccccat ggaggactgt gtgatctcca tcctgggaag ggggctcatt cacagagaga
2221 ggagctacag attccgttca gtgtggcctg aaaacaccat gtgtgccaag ttccagttca
2281 cgccaacaca tgtggggctc cagagactca ctgtggaagt ggactgcaac atgttccaga
2341 acctaaccaa ctataaaagc gtcaccgtgg tagcccctga actatcagct taaacttcca
2401 gctctatcac cactctcctg ccaacccttg ttctacaatc taaaccaaac atgtgctagg
2461 aagag
//
LOCUS HUMP42SA 2375 bp ss-mRNA PRI 11-JAN-1990
DEFINITION Human erythrocyte membrane protein 4.2 (HUMP4.2S) mRNA,
complete cds.
ACCESSION M30646
KEYWORDS .
SOURCE Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and
16.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2375)
AUTHORS Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A.,
Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C.
TITLE Molecular cloning of human protein 4.2: A major component of the
red erythrocyte membrane
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.Sung, 12-DEC-1989, for release after publication.
FEATURES from to/span description
pept 228 2303 protein 4.2 (HUMP4.2S) (put.)
mRNA < 1 2375 protein 4.2 mRNA
BASE COUNT 583 a 638 c 669 g 485 t
ORIGIN
1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc
61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc
121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag
181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacaggccc
241 tgggtatcaa gagctgtgac tttcaggcag caagaaacaa tgaggagcac cacaccaagg
301 ccctcagctc ccggcgcctc tttgtgagga gggggcagcc cttcaccatc atcctgtact
361 tccgcgctcc agtccgtgca tttctgcctg ccctgaagaa ggtggccctc actgcacaaa
421 ctggagagca gccttccaag atcaacagga cccaagccac attcccaatt tccagtctgg
481 gggaccgaaa gtggtggagt gcagtggtgg aggagagaga tgcccagtcc tggaccatct
541 ctgtgaccac acctgcagac gctgtcattg gccactactc gcttctgctg caggtctcag
601 gcaggaagca actcctcttg ggtcagttca cactgctttt taacccctgg aatagagagg
661 atgctgtgtt cctgaagaat gaggctcagc gcatggagta cttgttgaac cagaatggtc
721 tcatctacct gggtacagct gactgcatcc aggcagagtc ctgggacttt ggccagttcg
781 agggggatgt cattgacctc agcctgcgct tgctgagcaa ggacaagcag gtagagaagt
841 ggagccagcc ggtgcacgtg gcccgtgtgt tgggtgcctt gctgcatttt ctcaaggagc
901 agagggtcct gcccaccccg cagacccagg ccacccagga aggggccttg ctgaacaagc
961 gccggggcag cgtgcccatc ctgcggcagt ggctcaccgg ccgaggccga cctgtgtatg
1021 atggccaggc ctgggtgttg gctgctgttg cttgcacagt gctgcgatgc ctgggaatcc
1081 ctgcccgcgt ggtgaccacg tttgcctcag cacagggcac cggtgggcgt cttctcatag
1141 atgaatacta taatgaggag ggacttcaga acggagaagg ccagagaggc agaatctgga
1201 tcttccagac ttccacagag tgctggatga agcgcggcct gccttgccag ggttatgatg
1261 gatggcagat tctgcaccca agtgctccta atggaggtgg agtcctgggg tcctgtgatc
1321 tggtgccggt cagagcagtc aaggagggga cgctggggct gaccccagca gtgtcagacc
1381 tttttgctgc cataaatgcc tcatgtgtgg tctggaagtg ctgtgaggat gggacactgg
1441 agttgactga ctccaacaca aagtatgttg gcaacaacat cagcaccaag ggtgtgggca
1501 gtgaccgctg cgaggacatc actcagaact acaagtatcc tgaagggtct cttcaggaaa
1561 aagaggtgct ggagagagtc gagaaagaga aaatggaacg tgagaaagac aacggcatcc
1621 gtcctcccag tctcgagact gccagtcctc tgtacctgct cttgaaagca cccagctccc
1681 tacccctgag aggggatgcc cagatctcag tgacgctggt taatcacagt gagcaggaga
1741 aggcagtgca gctggcaatt ggggtccagg ctgtacacta caacggtgtc cttgctgcca
1801 agctctggag gaagaagctg cacctcacgc tcagtgccaa cctggaaaag ataataacca
1861 tcggcctgtt cttctccaat tttgagcgaa acccacccga gaacaccttc cttagactca
1921 ccgccatggc aacacactct gaatccaacc ttagctgctt tgctcaggaa gacattgcca
1981 tttgtagacc acaccttgcc atcaagatgc cagagaaagc agagcagtat caacccctca
2041 cagcctcagt cagcctccag aactccctag atgcccccat ggaggactgt gtgatctcca
2101 tcctgggaag ggggctcatt cacagagaga ggagctacag attccgttca gtgtggcctg
2161 aaaacaccat gtgtgccaag ttccagttca cgccaacaca tgtggggctc cagagactca
2221 ctgtggaagt ggactgcaac atgttccaga acctaaccaa ctataaaagc gtcaccgtgg
2281 tagcccctga actatcagct taaacttcca gctctatcac cactctcctg ccaacccttg
2341 ttctacaatc taaaccaaac atgtgctagg aagag
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATFERA1 563 bp ds-DNA ROD 14-MAR-1990
DEFINITION Rat ferritin heavy subunit gene, partial exon X.
ACCESSION M18051
KEYWORDS ferritin.
SEGMENT 1 of 3
SOURCE Rat (strain Fischer) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 563)
AUTHORS Murray,M.T., White,K. and Munro,H.N.
TITLE Conservation of ferritin heavy subunit gene structure: Implications
for the regulation of ferritin gene expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept / 349 + 459 ferritin heavy subunit, exon X
pre-msg < 1 > 563 ferritin H mRNA and intron
IVS 460 > 563 ferritin H intron A
BASE COUNT 87 a 204 c 173 g 99 t
ORIGIN
1 agctcagaga cccaagagcc gcctcacaat cacacaggct cctccccgcc cacgcactgc
61 tggcttgggc aacacgccta caggaagagg cggggctggg cggcccaccg cgctgattgg
121 ccggagcgcg cctgacgcag gatcccgcta taaagtgcgg cccgctggtc cctacgccag
181 acgttctcgc ccagagtcgc cgcggtttcc tgcttcaaca gtgcttgaac ggaacccggt
241 gctcgacccc tccgaccccc gtccggccgc tttgagcctg agccctttgc aacttcgtcg
301 ctccgccgct ccagcgtcgc ctccgcgcct cgtccagccg ccatcatgac caccgcgtct
361 ccctcgcaag tgcgccagaa ctaccaccag gactcggagg ctgccatcaa ccgccagatc
421 aacctggagt tgtatgcctc ctacgtctat ctgtccatgg tgagtgcggc ctggcctttg
481 cgggggcgga aagagggtgc ggcctggcct cccttgggcc acttggtgag ctggcggagg
541 gtgggttggg gcgtggctgc ggg
//
LOCUS RATFERA2 232 bp ds-DNA ROD 14-MAR-1990
DEFINITION Rat ferritin heavy subunit gene, exon X+1.
ACCESSION M18052
KEYWORDS ferritin.
SEGMENT 2 of 3
SOURCE Rat (strain Fischer) three months-old liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 232)
AUTHORS Murray,M.T., White,K. and Munro,H.N.
TITLE Conservation of ferritin heavy subunit gene structure: Implications
for the regulation of ferritin gene expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept + 51 + 197 ferritin heavy subunit, exon X+1
pre-msg < 1 > 232 ferritin H mRNA and introns
IVS < 1 50 ferritin H intron A
IVS 198 > 232 ferritin H intron B
BASE COUNT 66 a 48 c 57 g 61 t
ORIGIN About 1 kb after segment 1.
1 gcatctgcct tgctgtgggg atcaataaca aatacccttt ccactttcag tcttgttatt
61 ttgaccggga tgatgtggcc ctgaagaact ttgccaaata ctttctccat caatctcatg
121 aagagaggga acatgctgag aaactgatga agctgcagaa ccagcgaggt ggacgaatct
181 tcctgcagga tatcaaggta agtagactat gggactgcgt taaatgagca gt
//
LOCUS RATFERA3 764 bp ds-DNA ROD 14-MAR-1990
DEFINITION Rat ferritin heavy subunit gene, last two exons.
ACCESSION M18053
KEYWORDS ferritin.
SEGMENT 3 of 3
SOURCE Rat (strain Fischer) three months-old liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 764)
AUTHORS Murray,M.T., White,K. and Munro,H.N.
TITLE Conservation of ferritin heavy subunit gene structure: Implications
for the regulation of ferritin gene expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept + 34 159 ferritin heavy subunit, exon X+2
252 413 ferritin heavy subunit, exon X+3
pre-msg < 1 566 ferritin H mRNA and introns
IVS < 1 33 ferritin H intron B
IVS 160 251 ferritin H intron C
BASE COUNT 209 a 166 c 188 g 201 t
ORIGIN About 150 bp after segment 2.
1 ctgcagatga attgacatgt ttctttgatt cagaaacctg accgtgatga ctgggagagc
61 gggctgaatg caatgaggtg tgcactgcac ttggaaaaga gtgtgaatca gtcactactg
121 gaacttcaca aactggctac tgacaagaat gatccccacg tgagtatcag aaacacgggg
181 tgagtggaga tgatttgcca cagggcttgg gagagctgac cagtaaccct gtcccatgtt
241 ctctttccta gttatgtgac ttcattgaga cgcattacct gaatgagcag gtgaaatcca
301 ttaaagaact gggtgaccac gtgaccaact tacgcaagat gggagcccct gaatctggca
361 tggcagaata tctctttgac aagcacaccc tgggacacgg tgatgagagc taagctgacg
421 tccccaaggc catgtgactt tactggctca ctgaggcagt gcatgcatgt caggctgcct
481 ttatcttttc tataagttgc accaaaacat ctgcttaaaa gttctttaat ttgtaccatt
541 tcttcaaata aagaattttg gtacccagct cttgttgtga ttgaggatga gcgcaccagc
601 ttcccttgcg tcggctatat aaccacactg caacgcctga aagaatattt attaaactcg
661 tagttgggga aagatagtga aagacaggtg tgttcagaca ggactaagca gtcctggttc
721 tgagttacct gccagactgc catgggaaca tattcttgag tgtc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 15-MAR-1990
DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
cds.
ACCESSION M26434
KEYWORDS Alu repetitive element; LINE repetitive element;
hypoxanthine phosphoribosyltransferase.
SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalasemia
patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 56736)
AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
TITLE Automated DNA sequencing of the human HPRT locus
JOURNAL Genomics 6, 593-608 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Edwards, 26-JUL-1989.
FEATURES from to/span description
pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1
/nomgen="HPRT" /map="Xq26"
/hgml_locus_uid="LE0031K"
14780 14886 hypoxanthine phosphoribosyltransferase, exon 2
16603 16786 hypoxanthine phosphoribosyltransferase, exon 3
27891 27956 hypoxanthine phosphoribosyltransferase, exon 4
31617 31634 hypoxanthine phosphoribosyltransferase, exon 5
34937 35019 hypoxanthine phosphoribosyltransferase, exon 6
39815 39861 hypoxanthine phosphoribosyltransferase, exon 7
40033 40109 hypoxanthine phosphoribosyltransferase, exon 8
41454 41501 hypoxanthine phosphoribosyltransferase, exon 9
IVS 1704 14779 HPRT intron A
IVS 14887 16602 HPRT intron B
IVS 16787 27890 HPRT intron C
IVS 27957 31616 HPRT intron D
IVS 31635 34936 HPRT intron E
IVS 35020 39814 HPRT intron F
IVS 39862 40032 HPRT intron G
IVS 40110 41453 HPRT intron H
rpt 258 41 (c) Alu repeat copy A
rpt 1012 694 (c) Alu repeat copy B
rpt 3532 3342 (c) Alu repeat copy C
rpt 3982 3343 (c) Alu repeat copy D
rpt 6232 5937 (c) Alu repeat copy E
rpt 6524 6288 (c) Alu repeat copy F
rpt 7504 7807 Alu repeat copy G
rpt 8713 8408 (c) Alu repeat copy H
rpt 9477 9169 (c) Alu repeat copy I
rpt 9699 10142 Alu repeat copy J
rpt 10407 10293 (c) Alu repeat copy K
rpt 12154 12310 Alu repeat copy L
rpt 13302 12998 (c) Alu repeat copy M
rpt 13638 13314 (c) Alu repeat copy N
rpt 14238 13923 (c) Alu repeat copy O
rpt 14610 14239 (c) Alu repeat copy P
rpt 16090 15843 (c) Alu repeat copy Q
rpt 16518 16293 (c) Alu repeat copy R
rpt 18783 18439 (c) Alu repeat copy S
rpt 18962 19084 Alu repeat copy T
rpt 19397 19104 (c) Alu repeat copy U
rpt 20064 19778 (c) Alu repeat copy V
rpt 20486 20636 Alu repeat copy W
rpt 20990 20696 (c) Alu repeat copy X
rpt 21293 20996 (c) Alu repeat copy Y
rpt 22171 21878 (c) Alu repeat copy Z
rpt 23772 23460 (c) Alu repeat copy AA
rpt 25310 25426 Alu repeat copy AB
rpt 26388 26279 (c) Alu repeat copy AC
rpt 29863 29560 (c) Alu repeat copy AD
rpt 33618 33148 (c) Alu repeat copy AE
rpt 36575 36645 Alu repeat copy AF
rpt 38764 39083 Alu repeat copy AG
rpt 39091 39388 Alu repeat copy AH
rpt 40622 40925 Alu repeat copy AI
rpt 40952 41262 Alu repeat copy AJ
rpt 42640 42936 Alu repeat copy AK
rpt 44776 44486 (c) Alu repeat copy AL
rpt 46627 46305 (c) Alu repeat copy AM
rpt 47520 47763 Alu repeat copy AN
rpt 49257 49551 Alu repeat copy AO
rpt 49705 49993 Alu repeat copy AP
rpt 50002 50312 Alu repeat copy AQ
rpt 51066 50774 (c) Alu repeat copy AR
rpt 52276 51978 (c) Alu repeat copy AS
rpt 52407 52674 Alu repeat copy AT
rpt 54653 54551 (c) Alu repeat copy AU
rpt 55465 55164 (c) Alu repeat copy AV
rpt 56523 56689 Alu repeat copy AW
rpt 11312 11254 (c) LINE repeat copy A (partial)
rpt 18284 18242 (c) LINE repeat copy B (partial)
rpt 27091 26843 (c) LINE repeat copy C (partial)
rpt 37558 37300 (c) LINE repeat copy D (partial)
rpt 39451 39427 (c) LINE repeat copy E (partial)
rpt 51317 51100 (c) LINE repeat copy F (partial)
rpt 261 321 short interspersed repeat copy A
rpt 1013 1054 short interspersed repeat copy B
rpt 2902 2869 (c) short interspersed repeat copy C
rpt 3078 3107 short interspersed repeat copy D
rpt 4228 4254 short interspersed repeat copy E
rpt 4463 4494 short interspersed repeat copy F
rpt 21582 21635 short interspersed repeat copy G
rpt 28430 28394 (c) short interspersed repeat copy H
rpt 28923 28981 short interspersed repeat copy I
rpt 35562 35604 short interspersed repeat copy J
rpt 44042 44108 short interspersed repeat copy K
rpt 46628 46688 short interspersed repeat copy L
rpt 51715 51687 (c) short interspersed repeat copy M
unsure 17479 17479 polymorphism or cloning error
unsure 24903 24903 polymorphism or cloning error
BASE COUNT 15689 a 11281 c 11599 g 18167 t
ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26.
1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
56701 gccgcccctg gtgatggacc agcggggctt cgacga
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS THARGA 635 bp ds-DNA BCT 06-MAR-1990
DEFINITION T.acidophilum 5S rRNA gene.
ACCESSION M32297
KEYWORDS 5S ribosomal RNA.
SOURCE T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23.
ORGANISM Thermoplasma acidophilum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Thermoplasmales.
REFERENCE 1 (bases 1 to 635)
AUTHORS Ree,H.K. and Zimmerman,R.A.
TITLE The organization and expression of the three rRNA genes from
Thermoplasma acidophilum
JOURNAL Unpublished (1990) Univ. of Massachusetts, Dept. of Biochemistry,
Amherst, MA 01003
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.K.Ree 23-FEB-1990.
FEATURES from to/span description
rRNA 213 335 5S rRNA
RNA 211 346 5S RNA ( 3' end +/- 4 bp)
site 184 191 put. promoter
site 333 354 put. terminator
BASE COUNT 175 a 140 c 140 g 180 t
ORIGIN
1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc
61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac
121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa
181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac
241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg
301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct
361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag
421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt
481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca
541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga
601 gcgccataga ccaggtcatc ttcttccaga agctt
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS MUSFILA 765 bp ds-DNA ROD 06-MAR-1990
DEFINITION Mouse filaggrin gene, partial cds (type A repeat subunit).
ACCESSION M32300 J05198
KEYWORDS epidermal matrix protein; filaggrin.
SOURCE Mouse cell line 3T3 DNA, clone 6.1A2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 765)
AUTHORS Rothnagel,J.A. and Steinert,P.M.
TITLE The structure of the gene for mouse filaggrin and a comparison of
the repeating units
JOURNAL J. Biol. Chem. 265, 1862-1865 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Rothnagel, 23-FEB-1990.
FEATURES from to/span description
pept < 1 > 765 filaggrin (AA at 1)
rpt 1 765 type A repeat
BASE COUNT 178 a 257 c 250 g 80 t
ORIGIN
1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc gtcaaggcca ggcatctgcc
61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg
121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctccgacttc
181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag
241 cggtcctccc gaggccagca cggctccgga ttctacccag tatactacta ctatgaacaa
301 gaacacagcg aggaggagtc cgattcccag catcagcacg gccaccagca cgaacagcaa
361 cgcgggcacc agcaccagca ccagcaccag catgagcacg aacagcccga gtccggccac
421 aggcagcagc agtccagtgg gagaggacac cagggagccc accaggaaca gggaagggac
481 agcgccaggt cccggggttc taaccaaggg cacagctcct ctcgccacca ggccgactct
541 cccagggtca gcgcaagatc aggctcagga ggaagaggac agtcccccga cgcctctggc
601 cgcagctcaa acaggaggga cagaccccgg cagcccagcc ccagccagtc cagtgacagc
661 caagtccatt ctggagtcca ggtcgaaggc cggcgcgggc agtcctcatc tgccaacagg
721 agggccggat ccagctccgg ctcaggggtc cagggggcct ctgca
//
LOCUS MUSFILB 750 bp ds-DNA ROD 06-MAR-1990
DEFINITION Mouse filaggrin gene, partial cds (type B repeat subunit).
ACCESSION M32301 J05198
KEYWORDS epidermal matrix protein; filaggrin.
SOURCE Mouse cell line 3T3 DNA, clone 6.1A2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 750)
AUTHORS Rothnagel,J.A. and Steinert,P.M.
TITLE The structure of the gene for mouse filaggrin and a comparison of
the repeating units
JOURNAL J. Biol. Chem. 265, 1862-1865 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Rothnagel 23-FEB-1990.
FEATURES from to/span description
pept < 1 > 750 filaggrin (AA at 1)
rpt 1 750 type B repeat
BASE COUNT 174 a 252 c 248 g 76 t
ORIGIN
1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc ttcaaggcca ggcatctgcc
61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg
121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctctgacttc
181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag
241 cggtcctccc gaggccagca cggctccgga tactactatg agcaagaaca cagcgaggag
301 gagtccgatt cccagcacca gcacggccac cagcacgaac agcaacgcgg gcaccagcac
361 cagcaccagc accagcatga gcacgaacag cccgagtccg gccacaggca gcagcagtcc
421 agtgggagag gacaccaggg agcccaccag gaacagggaa gggacagcgc caggtcccgg
481 ggttctaacc aagggcacag ctcctctcgc caccaggccg actctcccag ggtcagcgca
541 agatcaggct caggaggaag aggacagtcc cccgacgcct ctggccgcag ctcaaacagg
601 agggacagac cccggcagcc cagccccagc cagtccagtg acagccaagt ccattctgga
661 gtccaggtcg aaggccggcg cgggcagtcc tcatctgcca acaggagggc cggatccagc
721 tccagctcag gggtccaggg ggcctctgca
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS MUSJUNDR 1596 bp ss-mRNA ROD 17-JAN-1990
DEFINITION Mouse junD proto-oncogene mRNA, complete cds.
ACCESSION J05205
KEYWORDS JUN-D protein, junD proto-oncogene.
SOURCE Mouse myocyte cell line BC-3H1, cDNA to genomic RNA, clone jun16.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1596)
AUTHORS Li,L., Hu,J.-S. and Olson,E.N.
TITLE Different members of the jun proto-oncogene family exhibit distinct
patterns of expression in response to type beta transforming growth
factor
JOURNAL J. Biol. Chem. 265, 1556-1562 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Li, 12-JAN-1990, for release after publication.
FEATURES from to/span description
pept 43 1068 JUN-D protein
BASE COUNT 264 a 559 c 492 g 280 t 1 others
ORIGIN
1 gagaggagat gacagagctt catgagtgat tcagtcccac ggatggaaac gcccttgtat
61 ggcgaggagg cgctgagcgg cctggctgcg ggtgcgtcga gcgtcgctgg tgctactggg
121 gcccccggcg gtggtggctt cgcgcccccg ggccgcgctt tccccggggc gcccccgacg
181 agcagcatgc tgaagaaaga cgcgctgacg ctcagcctgg cggacgaggg agcggcggga
241 ttgaaaccag ggtcggccac tgcaccttct gcgctgcgcc ccgacggcgc ccccgacggg
301 ctgctggctt cgccggatct tgggctgctc aaactcgcgt cgccggagct ggagaggctg
361 atcatccagt ccaacgggct ggtgaccact accccgacca gtacgcagtt cctctacccg
421 aaggtggcag ccagcgagga gcaggagttc gccgaaggct tcgtcaaggc gctggaggac
481 ctgcacaagc aaagccagct gggtgcggcc accgcggcca cctcaggggc tcccgcgcct
541 cccgcgcccg ccgacctggc cgccaccccc ggggccacgg agaccccggt ctacgccaac
601 ctgagcagtt tcgcgggtgg cgccgggccc cctgggggcg cggccaccgt ggctttcgcc
661 gcggagccag tgcccttccc gccgcccccg ggcgcgctgg ggccgccgcc acctccgcat
721 ccaccgcgcc tggccgcgct caaggacgag ccgcagaccg tgccggacgt gccgagcttc
781 ggcgacagcc ctccgctgtc gcccatcgac atggacacgc aagaacgcat caaggcggag
841 cgcaagaggc tgcgcaaccg catcgccgcc tccaaatgcc gcaagcgcaa gctggagcgt
901 atctcgcgcc tggaggagaa agtcaagacc ctcaaaagcc agaacaccga gctggcgtcc
961 accgccagcc tgctgcgcga gcaggtggcg cacgtcaaac agaaagtcct cagccacgtc
1021 aacagcggct gccagctgct gccccagcac caggtcccgg cgtactgagc ccgagcncgg
1081 ggcgcatgcg cggactagct gcggtggggg ggcgccccgg actctttcga gactcggtgc
1141 ccccggactc gacaagccgg acccccctta actctggatg gggaacccga gcgcacgacc
1201 cccgccctcg cgccgcctct ctacccccag tcctgcgcgt ggctgcccct ttgatcctcc
1261 gagtaggggc tctaaggggc gcgcagcctc acgctctgcc tttccttttt tttttgcctt
1321 ttggaagaga gaacgggagt gtccgattcc gccctattta tgtttctact cgggaacaaa
1381 cgttggttgc gtgttgagtg tgtgtttcct tgtgttggtt ttttaaagaa acgggaagaa
1441 gaaaaaaaaa accctccccc ctcccccccc cgctctctct cctcccgaca ccatcccctc
1501 tctttttgtt tggttttgtt ttgctacgag tccacattcc tgttccgtaa tcctggttcg
1561 ccggttttgt gttttcagta aagtcttcgt tacgcc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS CECCOLE3A 3066 bp ds-DNA BCT 31-AUG-1987
DEFINITION Plasmid ColE3-CA38 colicin E3, immunity proteins E3 and E8,
and lysis genes, complete cds.
ACCESSION J01574 J01575 M14038 X02397
KEYWORDS colicin; colicin release protein; immune response gene;
immunity protein; lysis protein.
SOURCE Plasmid ColE3-CA38 (from E.coli) DNA.
ORGANISM Plasmid Colicin E3-CA38
Prokaryota; Bacteria.
REFERENCE 1 (bases 1663 to 2662)
AUTHORS Masaki,H. and Ohta,T.
TITLE A plasmid region encoding the active fragment and the inhibitor
protein of colicin E3-CA38
JOURNAL FEBS Lett. 149, 129-132 (1982)
STANDARD full staff_review
REFERENCE 2 (bases 1612 to 2662)
AUTHORS Mock,M., Miyada,C.G. and Gunsalus,R.P.
TITLE Nucleotide sequence for the catalytic domain of colicin E3 and its
immunity protein. Evidence for a third gene overlapping colicin
JOURNAL Nucleic Acids Res. 11, 3547-3557 (1983)
STANDARD full staff_review
REFERENCE 3 (bases 2251 to 3066)
AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
TITLE Characterization and nucleotide sequence of a colicin-release gene
in the hic region of plasmid ColE3-CA38
JOURNAL Gene 29, 175-184 (1984)
STANDARD full staff_review
REFERENCE 4 (bases 2251 to 3066)
AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
TITLE Corrigenda: Characterization and nucleotide sequence of a
colicin-release gene in the hic region of plasmid ColE3-CA38
JOURNAL Gene 42, 351-355 (1986)
STANDARD full staff_review
REFERENCE 5 (bases 1 to 2854)
AUTHORS Masaki,H. and Ohta,T.
TITLE Colicin E3 and its immunity genes
JOURNAL J. Mol. Biol. 182, 217-227 (1985)
STANDARD simple automatic
REFERENCE 6 (bases 1 to 3066; revises [5])
AUTHORS Akutsu,A., Masaki,H. and Ohta,T.
TITLE Molecular structure and immunity specificity of colicin E6, an
evolutionary intermediate between E-group colicins and cloacin DF13
JOURNAL J. Bacteriol. 171, 6430-6436 (1989)
STANDARD simple staff_review
COMMENT [1] refers to colicin E3 as protein A and the immunity protein E3
as protein B. [2] refers to the colE3 gene as the ceaC gene and the
immE3 gene as the ceaC gene.
Draft entry and printed copy of sequence for [3] kindly provide by
R.J.Watson, May 1985.
FEATURES from to/span description
pept 331 1986 colicin E3
pept 1996 2253 immunity protein E3
pept 2420 2677 immunity protein E8 (gtg start codon)
pept 2739 2882 lysis protein
binding 317 322 ribosome binding site (pot.)
binding 1988 1993 ribosome binding site (pot.)
revision 132 134 atc in [6]; attc in [5]
revision 755 756 gc in [4]; cg in [3]
revision 1265 1267 gca in [4]; ga in [3]
revision 1348 1349 tt in [4]; ttt in [3]
BASE COUNT 897 a 546 c 796 g 827 t
ORIGIN
1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac
61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac
121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt
181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa cgatgacgag
241 tactttttga tctgtacata aacccagtgg ttttatgtac agtattaatc gtgtaatcaa
301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac
361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt
421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc
481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat
541 tccggtggtg gttcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt
601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga
661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt
721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat
781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt
841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa
901 gacgagcgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat
961 gcaaaaccta ccgaacgtcc gggtgttttt acggcatcaa ttccaggtgc acctgttctg
1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat
1081 aatactgata aggatgttcg cccggcagga tttactcagg gtggtaatac cagggatgca
1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt
1201 cttagccctg accaggtaaa acaacgtcaa gatgaagaaa atcgccgtca gcaggaatgg
1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg
1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt
1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct
1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg
1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca
1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa
1621 agcaggaaga agaaagaaga taagaaaagg agtgctgaaa ataatttaaa cgatgaaaag
1681 aataagccca gaaaaggttt taaagattac gggcatgatt atcatccagc tccgaaaact
1741 gagaatatta aagggcttgg tgatcttaag cctgggatac caaaaacacc aaagcagaat
1801 ggtggtggaa aacgcaagcg ctggactgga gataaagggc gtaagattta tgagtgggat
1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattt
1921 gaccctaaaa caggcaatca gttgaaaggt ccagatccga aacgaaatat caagaaatat
1981 ctttgagagg aagttatggg acttaaattg gatttaactt ggtttgataa aagtacagaa
2041 gattttaagg gtgaggagta ttcaaaagat tttggagatg acggttcagt tatggaaagt
2101 ctaggtgtgc cttttaagga taatgttaat aacggttgct ttgatgttat agctgaatgg
2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt
2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgata
2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa atgttatatt
2341 ttttattgag ttcttggtta taattgctcc gcaatgattt aaataagcat tatttaaaac
2401 attctcagga gaggtgaagg tggagctaaa aaaaagtatt ggtgattaca ctgaaaccga
2461 attcaaaaaa tttattgaag acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga
2521 taacctcgag tattttataa atgttactga gcatcctagt ggttctgatc tgatttatta
2581 cccagaaggt aataatgatg gtagccctga aggtgttatt aaagagatta aagaatggcg
2641 agccgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttatg
2701 gatgaatggc tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta
2761 ttttattgct tcttgcagtc attattctgt ctgcatgtca ggcaaactat atccgggatg
2821 ttcagggcgg gaccgtatct ccgtcatcaa cagctgaagt gaccggatta gcaacgcagt
2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt
2941 tttgaaatgt cacaaaaatt ccatgtggga gatgggatct aaaatcctcg tgcagaactt
3001 tccatccagg gggagaaaac ttgtcgtttt gagccgttcg gtgttcagaa cgcacgaaac
3061 cgatcg
//
LOCUS CECCOLE6A 3065 bp ds-DNA BCT 16-FEB-1990
DEFINITION Plasmid ColE6-CT14 colicin E6, immunity proteins E6 and E8, and
lysis protein genes, complete cds.
ACCESSION M31808
KEYWORDS colicin E6; immunity protein; lysis protein.
SOURCE Plasmid ColE6-CT14 (from E.coli) DNA.
ORGANISM Plasmid Colicin E6-CT14
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 3065)
AUTHORS Akutsu,A., Masaki,H. and Ohta,T.
TITLE Molecular structure and immunity specificity of colicin E6, an
evolutionary intermediate between E-group colicins and cloacin DF13
JOURNAL J. Bacteriol. 171, 6430-6436 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 331 1986 colicin E6
pept 1996 2253 immunity protein E6
pept 2420 2677 immunity protein E8 (gtg start codon)
pept 2739 2882 lysis protein
BASE COUNT 903 a 545 c 799 g 818 t
ORIGIN
1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac
61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac
121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt
181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa agatgacgag
241 tactttttga tctgtacata aaaccagtgg ttttatgtac agtattaatc atgtaattaa
301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac
361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt
421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc
481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat
541 tccggtggtg gctcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt
601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga
661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt
721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat
781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt
841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa
901 gacgaacgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat
961 gcaaaaccta ccgaacgtcc aggtgttttt acggcatcaa ttccaggtgc acctgttctg
1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat
1081 aatactgata aggatgttcg cccggcagga tttactcagg ggggtaatac cagggatgca
1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt
1201 cttagccctg accaggtaaa acaacgtcag gatgaagaaa atcgccgtca gcaggaatgg
1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg
1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt
1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct
1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg
1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca
1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa
1621 agcaggaaga agaaagaaga taagaaaagg agcgctgaaa ataaattaaa cgaggaaaaa
1681 aacaagcctc gcaagggagt taaagattac ggtcatgatt atcatccaga tcctaaaact
1741 gaagatataa aagggctggg tgagttaaaa gagggtaaac caaaaactcc aaagcaaggt
1801 ggtggcggta aacgtgctag atggtatgga gataaagggc gtaagattta tgagtgggac
1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattc
1921 gagcctaaga ctggtaatca gttgaaagga cctgatccaa aacgaaatat caaaaagtat
1981 ctttgagagg atgttatggg gcttaaatta catattaatt ggtttgataa gacgaccgag
2041 gaatttaaag gtggtgagta ttcaaaagat tttggagatg atggctcggt cattgaacgt
2101 cttggaatgc ctttaaaaga taatatcaat aatggttggt ttgatgttat agctgaatgg
2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt
2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgatg
2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa gtgttatatt
2341 ttttattgag ttcttggtta taattgctcc gcaataattt aaataggcat tatttaaaac
2401 attctcagga gaggtgaagg tggagctaaa gaaaagtatt ggtgattaca ctgaaaccga
2461 attcaaaaaa attattgaaa acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga
2521 taacctcgag cattttataa gtgttactga gcatcctagt ggttctgatc tgatttatta
2581 cccagaaggt aataatgatg gtagccctga agctgttatt aaagagatta aagaatggcg
2641 agctgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttaag
2701 gatgaatgac tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta
2761 ttttattgct tcttgcagtc attattctgg ctgcatgtca ggcaaactat atccgtgatg
2821 ttcagggcgg gactgtatca ccgtcgtcaa ctgctgaact gaccggagtg gaaacgcagt
2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt
2941 ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatccccgt gcagaacttt
3001 ccatccagag ggagaaaact tgtcgttttg agccgttcgg tgttcagaac gcacgaaacc
3061 gatcg
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
1 and 2.
ACCESSION M32476 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 1 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene and analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 704 767 carcinoembryonic antigen-related protein
(CGM1), exon 1
1770 + 2129 carcinoembryonic antigen-related protein, exon
2
pre-msg 512 > 2238 CGM1 mRNA and introns
IVS 768 1769 CGM1 intron A
IVS 2130 > 2238 CGM1 intron B
BASE COUNT 615 a 532 c 539 g 552 t
ORIGIN
1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
2221 tgcattgatc tacatgtt
//
LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32477 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 2 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 539 carcinoembryonic antigen-related protein
intron B
BASE COUNT 126 a 127 c 138 g 148 t
ORIGIN
1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32478 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 3 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 828)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 828 carcinoembryonic antigen-related protein
intron B
BASE COUNT 193 a 255 c 190 g 190 t
ORIGIN
1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 3.
ACCESSION M32479 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 4 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 61 + 420 carcinoembryonic antigen-related protein
(CGM1), exon 3
pre-msg < 1 > 642 CGM1 mRNA and introns
IVS < 1 60 CGM1 intron B
IVS 421 > 642 CGM1 intron C
BASE COUNT 151 a 160 c 154 g 177 t
ORIGIN
1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 4.
ACCESSION M32480 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 5 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 616)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 90 + 449 carcinoembryonic antigen-related protein
(CGM1), exon 4
pre-msg < 1 > 616 CGM1 mRNA and introns
IVS < 1 89 CGM1 intron C
IVS 450 616 CGM1 intron D
BASE COUNT 152 a 153 c 147 g 164 t
ORIGIN
1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
601 ttgtctgatt cccttc
//
LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron 4.
ACCESSION M32481 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 6 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 654 carcinoembryonic antigen-related protein
intron D
BASE COUNT 185 a 146 c 155 g 168 t
ORIGIN
1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 5.
ACCESSION M32482 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 7 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 118 + 471 carcinoembryonic antigen-related protein
(CGM1), exon 5
pre-msg < 1 > 492 CGM1 mRNA and introns
IVS < 1 117 CGM1 intron D
IVS 472 492 CGM1 intron E
BASE COUNT 134 a 134 c 105 g 119 t
ORIGIN
1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
481 tctcaaggtc tc
//
LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 6.
ACCESSION M32483 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 8 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 640 / 987 carcinoembryonic antigen-related protein
(CGM1), exon 6
pre-msg < 1 > 1341 CGM1 mRNA and introns
IVS < 1 639 CGM1 intron E
IVS 988 1341 CGM1 intron F
BASE COUNT 357 a 329 c 321 g 334 t
ORIGIN
1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
1321 atccgttgcc tgacggtcga c
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
1 and 2.
ACCESSION M32476 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 1 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene and analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 704 767 carcinoembryonic antigen-related protein
(CGM1), exon 1
1770 + 2129 carcinoembryonic antigen-related protein, exon
2
pre-msg 512 > 2238 CGM1 mRNA and introns
IVS 768 1769 CGM1 intron A
IVS 2130 > 2238 CGM1 intron B
BASE COUNT 615 a 532 c 539 g 552 t
ORIGIN
1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
2221 tgcattgatc tacatgtt
//
LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32477 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 2 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 539 carcinoembryonic antigen-related protein
intron B
BASE COUNT 126 a 127 c 138 g 148 t
ORIGIN
1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32478 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 3 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 828)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 828 carcinoembryonic antigen-related protein
intron B
BASE COUNT 193 a 255 c 190 g 190 t
ORIGIN
1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 3.
ACCESSION M32479 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 4 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 61 + 420 carcinoembryonic antigen-related protein
(CGM1), exon 3
pre-msg < 1 > 642 CGM1 mRNA and introns
IVS < 1 60 CGM1 intron B
IVS 421 > 642 CGM1 intron C
BASE COUNT 151 a 160 c 154 g 177 t
ORIGIN
1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 4.
ACCESSION M32480 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 5 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 616)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 90 + 449 carcinoembryonic antigen-related protein
(CGM1), exon 4
pre-msg < 1 > 616 CGM1 mRNA and introns
IVS < 1 89 CGM1 intron C
IVS 450 616 CGM1 intron D
BASE COUNT 152 a 153 c 147 g 164 t
ORIGIN
1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
601 ttgtctgatt cccttc
//
LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron 4.
ACCESSION M32481 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 6 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 654 carcinoembryonic antigen-related protein
intron D
BASE COUNT 185 a 146 c 155 g 168 t
ORIGIN
1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 5.
ACCESSION M32482 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 7 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 118 + 471 carcinoembryonic antigen-related protein
(CGM1), exon 5
pre-msg < 1 > 492 CGM1 mRNA and introns
IVS < 1 117 CGM1 intron D
IVS 472 492 CGM1 intron E
BASE COUNT 134 a 134 c 105 g 119 t
ORIGIN
1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
481 tctcaaggtc tc
//
LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 6.
ACCESSION M32483 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 8 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 640 / 987 carcinoembryonic antigen-related protein
(CGM1), exon 6
pre-msg < 1 > 1341 CGM1 mRNA and introns
IVS < 1 639 CGM1 intron E
IVS 988 1341 CGM1 intron F
BASE COUNT 357 a 329 c 321 g 334 t
ORIGIN
1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
1321 atccgttgcc tgacggtcga c
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (rnCGM1) mRNA,
complete cds.
ACCESSION M32474 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3190)
AUTHORS Rebstock,S., Lucas,K., Thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analysis imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1900) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 122 2251 carcinoembryonic antigen-related protein
precursor (rnCGM1)
sigp 122 221 carcinoembryonic antigen-related protein
signal peptide
matp 222 2248 rnCGM1 protein
BASE COUNT 871 a 798 c 693 g 828 t
ORIGIN
1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
3181 ttctccaaat
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
1 and 2.
ACCESSION M32476 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 1 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene and analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 704 767 carcinoembryonic antigen-related protein
(CGM1), exon 1
1770 + 2129 carcinoembryonic antigen-related protein, exon
2
pre-msg 512 > 2238 CGM1 mRNA and introns
IVS 768 1769 CGM1 intron A
IVS 2130 > 2238 CGM1 intron B
BASE COUNT 615 a 532 c 539 g 552 t
ORIGIN
1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
2221 tgcattgatc tacatgtt
//
LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32477 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 2 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 539 carcinoembryonic antigen-related protein
intron B
BASE COUNT 126 a 127 c 138 g 148 t
ORIGIN
1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32478 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 3 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 828)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 828 carcinoembryonic antigen-related protein
intron B
BASE COUNT 193 a 255 c 190 g 190 t
ORIGIN
1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 3.
ACCESSION M32479 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 4 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 61 + 420 carcinoembryonic antigen-related protein
(CGM1), exon 3
pre-msg < 1 > 642 CGM1 mRNA and introns
IVS < 1 60 CGM1 intron B
IVS 421 > 642 CGM1 intron C
BASE COUNT 151 a 160 c 154 g 177 t
ORIGIN
1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 4.
ACCESSION M32480 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 5 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 616)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 90 + 449 carcinoembryonic antigen-related protein
(CGM1), exon 4
pre-msg < 1 > 616 CGM1 mRNA and introns
IVS < 1 89 CGM1 intron C
IVS 450 616 CGM1 intron D
BASE COUNT 152 a 153 c 147 g 164 t
ORIGIN
1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
601 ttgtctgatt cccttc
//
LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron 4.
ACCESSION M32481 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 6 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 654 carcinoembryonic antigen-related protein
intron D
BASE COUNT 185 a 146 c 155 g 168 t
ORIGIN
1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 5.
ACCESSION M32482 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 7 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 118 + 471 carcinoembryonic antigen-related protein
(CGM1), exon 5
pre-msg < 1 > 492 CGM1 mRNA and introns
IVS < 1 117 CGM1 intron D
IVS 472 492 CGM1 intron E
BASE COUNT 134 a 134 c 105 g 119 t
ORIGIN
1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
481 tctcaaggtc tc
//
LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 6.
ACCESSION M32483 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 8 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 640 / 987 carcinoembryonic antigen-related protein
(CGM1), exon 6
pre-msg < 1 > 1341 CGM1 mRNA and introns
IVS < 1 639 CGM1 intron E
IVS 988 1341 CGM1 intron F
BASE COUNT 357 a 329 c 321 g 334 t
ORIGIN
1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
1321 atccgttgcc tgacggtcga c
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
1 and 2.
ACCESSION M32476 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 1 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene and analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 704 767 carcinoembryonic antigen-related protein
(CGM1) precursor, exon 1
1770 + 2129 carcinoembryonic antigen-related protein, exon
2
sigp 704 767 carcinoembryonic antigen-related protein
(CGM1) signal peptide
1770 1804 carcinoembryonic antigen-related protein
signal peptide
matp 1805 + 2129 carcinoembryonic antigen-related protein
pre-msg 512 > 2238 CGM1 mRNA and introns
IVS 768 1769 CGM1 intron A
IVS 2130 > 2238 CGM1 intron B
BASE COUNT 615 a 532 c 539 g 552 t
ORIGIN
1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
2221 tgcattgatc tacatgtt
//
LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32477 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 2 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 539 carcinoembryonic antigen-related protein
intron B
BASE COUNT 126 a 127 c 138 g 148 t
ORIGIN
1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32478 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 3 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 828)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 828 carcinoembryonic antigen-related protein
intron B
BASE COUNT 193 a 255 c 190 g 190 t
ORIGIN
1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 3.
ACCESSION M32479 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 4 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 61 + 420 carcinoembryonic antigen-related protein
(CGM1), exon 3
matp + 61 + 420 carcinoembryonic antigen-related protein
pre-msg < 1 > 642 CGM1 mRNA and introns
IVS < 1 60 CGM1 intron B
IVS 421 > 642 CGM1 intron C
BASE COUNT 151 a 160 c 154 g 177 t
ORIGIN
1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 4.
ACCESSION M32480 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 5 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 616)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 90 + 449 carcinoembryonic antigen-related protein
(CGM1), exon 4
matp + 90 + 449 carcinoembryonic antigen-related protein
pre-msg < 1 > 616 CGM1 mRNA and introns
IVS < 1 89 CGM1 intron C
IVS 450 616 CGM1 intron D
BASE COUNT 152 a 153 c 147 g 164 t
ORIGIN
1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
601 ttgtctgatt cccttc
//
LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron 4.
ACCESSION M32481 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 6 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 654 carcinoembryonic antigen-related protein
intron D
BASE COUNT 185 a 146 c 155 g 168 t
ORIGIN
1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 5.
ACCESSION M32482 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 7 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 118 + 471 carcinoembryonic antigen-related protein
(CGM1), exon 5
matp + 118 + 471 carcinoembryonic antigen-related protein
pre-msg < 1 > 492 CGM1 mRNA and introns
IVS < 1 117 CGM1 intron D
IVS 472 492 CGM1 intron E
BASE COUNT 134 a 134 c 105 g 119 t
ORIGIN
1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
481 tctcaaggtc tc
//
LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 6.
ACCESSION M32483 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 8 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 640 / 987 carcinoembryonic antigen-related protein
(CGM1), exon 6
matp + 640 / 987 carcinoembryonic antigen-related protein
pre-msg < 1 > 1341 CGM1 mRNA and introns
IVS < 1 639 CGM1 intron E
IVS 988 1341 CGM1 intron F
BASE COUNT 357 a 329 c 321 g 334 t
ORIGIN
1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
1321 atccgttgcc tgacggtcga c
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete
cds.
ACCESSION M32474 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3190)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analysis imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 122 2251 carcinoembryonic antigen-related protein
precursor (CGM1)
sigp 122 220 carcinoembryonic antigen-related protein
signal peptide
matp 221 2248 CGM1 protein
BASE COUNT 871 a 798 c 693 g 828 t
ORIGIN
1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
3181 ttctccaaat
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM4AA 4627 bp ds-DNA ROD 05-MAR-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2
and 3.
ACCESSION M32475 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SOURCE Rat (SPrarue-Dawley) liver DNA, clone lambda-rnCGM415-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4627)
AUTHORS Zimmermann,W.
JOURNAL Unpublished (1990) Univ. of Freiburg, D7800 Freiburg FRG
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept / 145 489 carcinoembryonic antigen-related protein
(CGM4), exon 2 (AA at 147)
3379 / 3738 carcinoembryonic antigen-related protein exon
3
pre-msg < 1 > 4627 CGM4 mRNA and introns
IVS < 1 144 CGM4 intron B
IVS 490 3378 CGM4 intron C
IVS 3739 > 4627 CGM4 intron D
BASE COUNT 1213 a 1165 c 1162 g 1087 t
ORIGIN
1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct
61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga
121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc
181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg
241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg
301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga
361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct
421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct
481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga
541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga
601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta
661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact
721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt
781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac
841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga
901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc
961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca
1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt
1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct
1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc
1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga
1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt
1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca
1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt
1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa
1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag
1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca
1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg
1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc
1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct
1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg
1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga
1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct
1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag
2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg
2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga
2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg
2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg
2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa
2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg
2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac
2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa
2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt
2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc
2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg
2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac
2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt
2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc
2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga
2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt
3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac
3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc
3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca
3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg
3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg
3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg
3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc
3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc
3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca
3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca
3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg
3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg
3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt
3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat
3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga
3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct
3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt
4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc
4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga
4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct
4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg
4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata
4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac
4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg
4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt
4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc
4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac
4621 agagctc
//GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS YSCSEN2X 2813 bp ds-DNA PLN 26-FEB-1990
DEFINITION S.cerevisiae tRNA splicing endonuclease beta-subunit (SEN2) gene,
complete cds.
ACCESSION M32336
KEYWORDS tRNA splicing endonuclease beta-subunit.
SOURCE S.cerevisiae (strain S288C) YCp50 library of Rose DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2813)
AUTHORS Ho,C.K., Rauhut,R. and Abelson,J.
TITLE Saccharomyces cerevisiae SEN2 gene encodes the middle subunit of
tRNA splicing endonuclease
JOURNAL Unpublished (1990) Cal. Tech., Div. of Bio., Pasadena, CA 91125
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.K.Ho, 23-FEB-1990. Dr. Ho relayed updated title information
by phone on 09-APR-1990.
FEATURES from to/span description
pept 520 1653 tRNA splicing endonuclease beta-subunit
BASE COUNT 927 a 522 c 539 g 825 t
ORIGIN
1 aaaattttct tcaagatgca agtttcccat tcagttctat aggaagttaa aacaaaagag
61 ctgtttgaga gctcgtaggc tttggttgtt aagagaacgg actaaataac accaggaggt
121 gtgtctagac ttagttgaac gagttttttc tcgtgctatt atatctgttc ttaatacttc
181 cactttattt taatgatgat agcatgattt atgttgctcg aagttaatcg atgtgagcac
241 aatgatttct caagactaca acggtatctg aattttattt cttgtcaaaa atgttttata
301 gaagccactc ttgctataga tttatattgc cacacataac tttttcacgt aaatttgaaa
361 gagctatatt agctggtgct ttcactaaca tctctttgct ctagtatttc atttcggtgc
421 ataaaacatt tgaacttcca tcgaaattca atatgctaaa acgtttcatt gtaacacgac
481 gctgaaaaga caataagtac gaaggactca agaaggagca tgtctaaagg gagggtcaat
541 cagaagcgtt acaagtatcc tcttccaatt caccccgtag atgaccttcc agagttaatc
601 cttcataatc cgttatcttg gctatattgg gcataccgct attacaagag tacgaacgca
661 ctgaacgata aagtacatgt agattttata ggagatacca ctcttcacat cacggttcaa
721 gatgacaagc aaatgctata tctttggaat aatggatttt ttggcactgg tcaatttagc
781 aggagtgagc ctacatggaa agctagaaca gaggccagac tgggtctcaa tgatactccc
841 ctccacaatc ggggaggaac aaagagtaat actgaaacgg agatgacttt agagaaggtc
901 acacaacaga ggagattaca aagattagag ttcaagaaag aacgtgcgaa gttagaaaga
961 gaattactag aattaagaaa aaaaggtggt catatcgatg aggaaaacat cctccttgag
1021 aaacaacgag aatcattaag aaaatttaaa ctaaaacaaa cagaagacgt gggcatcgtt
1081 gcacaacagc aagacatttc cgaatcaaat ttaagagatg aagacaacaa tcttcttgat
1141 gagaacggtg atcttttgcc actagagtca ctagaactaa tgccagtgga agctatgttt
1201 ttaacttttg cacttcctgt tcttgacata tctcccgcgt gcctggcagg gaaactgttt
1261 caattcgatg ccaaatataa agatattcac tcctttgtca gatcatacgt tatataccat
1321 cactacagat cacacggttg gtgcgtaaga tctggtataa aattcggctg cgattattta
1381 ttatataaga gagggccacc atttcaacac gctgaatttt gtgttatggg tcttgaccac
1441 gacgtctcta aagattatac gtggtattct agcatagccc gtgttgtggg cggcgcaaag
1501 aagacgtttg tgttatgcta cgtggagaga ctaatctcgg aacaggaggc gatagcacta
1561 tggaaatcaa acaactttac taaattgttc aatagttttc aagttggtga agtattgtat
1621 aagagatggg ttcccggaag aaatagagac tagaatatca atgagaagat aaatccttcg
1681 tatgtatata tgtatgtgta tgtatttttt atacagattt caatggtttt tgtaagcatc
1741 tgatacccat ccacaatccc taaacccctt gttctttgct ttggggccct tgacagggtc
1801 atcatatatt gctctaaact tttcattgtg tgtggctggt ttggatatgc aaacaaattt
1861 attatcagga aggacacatt tcaactgaga tttagggaag ggacaaggac agtctttctg
1921 ctgtgcaacg cacgttaggg tgtccggaca caaataccca tcacaagggt tatttaaaac
1981 ttgatcttca tatgattgtt gctgttgctg ttgctgttgc tgatgatgac cgaaattgaa
2041 gaaggcattc actccagtgc tgccaaaaat aaacaatgca aagatgagaa cgatactcca
2101 gcgactttga ctcatactac agtaaaattt ggtcgttttt tgcctggcaa accatgccta
2161 tttttcgtca ttaaaggtct tctgagcttt catcaaaatt atctctataa ttataaagga
2221 cagctcagag ccacacccaa acatctgcaa aggtaaaggc catctttcat ctcacattgc
2281 ctaacaagaa agtcacattg caattgtttc gagaagaaaa cattaattca attacgtggt
2341 tcccatgcgt ctgtgttaag cgtcaattta tcatttgacg aactcgttgc aaccacaaac
2401 aaagcaaggg ccgcatttat caaaggaaaa ccaaccaaaa gcataaataa acgagatttt
2461 tattatgaaa aagccatgcg aatcctacta gaataaattc aaagtgaaga catttagtca
2521 attcgatctt ctctctcttg ctttaactag tcgtaaaacc taataagttt cattcacgac
2581 gcgtattttt ttttcaacat atcatctaac gcgttttaag aaatttcagt aaataatgaa
2641 gctgtagtaa tgccaacaat ccattgtggc tactttatac aacgcactat agagcaataa
2701 aagtgtagca ataaacacta gagagaaggc acataataac aagaatatac tcacgcacat
2761 gtattatgga atcagccagt ttagcgaacc gtacaacaaa atcttaagga att
//GenBank-Updates@genbank.bio.net (05/25/90)
LOCUS FCLCAPS 2493 bp ss-RNA VRL 25-MAY-1990
DEFINITION Feline calicivirus capsid protein gene (put.), complete cds.
ACCESSION M32819
KEYWORDS capsid protein.
SOURCE Feline calicivirus (CFI/68 FIV), cDNA to viral RNA, clone pCV[2,7].
ORGANISM Feline calicivirus
Viridae; ss-RNA nonenveloped viruses; Caliciviridae.
REFERENCE 1 (bases 1 to 2493)
AUTHORS Neill,J.D.
TITLE Nucleotide sequence and expression of the capsid protein gene of
Feline calicivirus
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D.Neill, 14-MAR-1990. Pioneer Hi-Bred International, 7300 NW
62nd Ave., Johnston, IA 50131
FEATURES from to/span description
pept 127 2133 capsid protein ORF (put.)
pept 2130 2450 capsid protein ORF (put.)
BASE COUNT 666 a 593 c 531 g 703 t
ORIGIN
1 gcccccttct tacagcacag ccttggagca ttacaacagc caatttaatg gtgtggaggc
61 gcggactgac cagatcgata cgagtggcat ggccgccctt cactgtgatg tgttcgaagt
121 ttgagcatgt gctcaacctg cgctaacgtg cttaaatact atgattggga tcctcacatc
181 aaattggtaa tcaaccccaa caaatttcta catgttggct tctgcgataa ccctttaatg
241 tgttgttatc ctgaattact acctgaattt ggcaccatgt gggattgtga tcaatcgcca
301 ctccaagtct accttgagtc aatcctgggt gatgatgaat ggtcctccac tcatgaagca
361 attgacccag ttgtgccacc aatgcattgg gatgaagccg gaaaaatctt ccaaccacac
421 cctggcgtcc ttatgcatca cctcatctgt aaggttgcag aaggatggga cccaaacctg
481 ccacttttcc gcttggaagc ggacgatggt tccatcacga cacctgaaca gggaacaatg
541 gttggtggag tcattgctga gcccaacgcc caaatgtcaa ccgcagctga catggccact
601 gggaaaagtg tggactctga gtgggaagcc ttcttctcct ttcacactag tgtgaactgg
661 agcacatctg aaactcaggg gaagatactc tttaaacaat ccttaggacc attgctcaac
721 ccctacctta cccatcttgc aaagctgtat gttgcttggt ctggttctgt tgatgttagg
781 ttttctattt ctggatctgg tgtctttgga gggaaattag ctgctattgt tgtgccgcca
841 ggaattgatc ctgttcaaag tacttcaatg ctgcaatatc ctcatgtcct ctttgatgct
901 cgtcaagttg aacctgttat cttttccatt cccgatctaa gaagcacctt atatcacctt
961 atgtctgaca ctgataccac atcgttggta atcatggtgt acaatgatct tattaacccc
1021 tatgctaatg actcaaactc ttcgggctgc attgtcactg tggaaactaa accggggcca
1081 gatttcaagt ttcacctctt aaaacctcct gggtctatgt taactcacgg atctatccca
1141 tctgatctaa tcccaaaatc atcttcgctt tggattggaa atcggttttg gtctgacata
1201 accgattttg taattcggcc ttttgtgttc caggcaaatc gacactttga tttcaaccaa
1261 gagacagcag gttggagcac cccaaggttt cgcccaatta ctatcactat cagtgttaag
1321 gagtcagcaa agcttggtat tggagtggcc accgactaca ttgttcccgg cataccagat
1381 ggatggcccg acacaacaat cccaggtgag ttggtacctg ttggtgacta tgccatcact
1441 aatggcacca acaatgatat caccacagct gcgcagtacg atgcagccac tgagattaga
1501 aacaacacca atttcagagg catgtacatt tgtggttctc ttcaaagagc ttggggggat
1561 aagaagattt caaatactgc ttttatcaca accggcacgg ttgatggagc caaattgata
1621 cccagtaata ccattgacca aacaaaaatt gccgtattcc aagacacaca tgcgaataag
1681 catgtccaga cctcggacga cacattggcc ctgcttggtt atactggtat tggtgaggaa
1741 gcaattggtg ctgaccgcga tagagttgtg cgaattagcg tcctcccgga acgtggcgca
1801 cgtggtggca atcacccaat cttccacaaa aactctatca agcttggtta tgtaattagg
1861 tccattgatg tgttcaattc tcaaattctg catacctcta ggcaactttc cctcaatcat
1921 tacttattgt cgcctgactc ctttgctgtc tataggatta ttgactctaa tggatcctgg
1981 tttgacatag gcattgataa tgatggattt tcttttgttg gtgtatcaag tattggtaaa
2041 ttagagtttc ctttaactgc ctcctacatg ggaattcaat tggcaaaaat tcgacttgcc
2101 tctaacatta ggagtgtgat gacaaaatta tgaattcaat tttgggctta attgacactg
2161 tcacgaacac aattggcaaa gctcaacaaa tcgaattgga taaggctgca cttggtcagc
2221 aacgcgagct ggcactccaa cgtatgaact tggatcgcca ggctctaaat aatcaagtgg
2281 agcaatttaa caaactgctt gagcagaggg tacaaggccc aatccaatct gtgcgcctgg
2341 cacgcgcagc tggtttcagg gtcgaccctt actcatacac aaatcaaaac ttttatgacg
2401 atcaattaaa tgcaatcaga ctatcatata gaaatttgtt caagaattga tcacttaacc
2461 ctttgggtgc cgcacttgcg cctaacccca ggg
//
LOCUS RIRSTA56 2280 bp ds-DNA BCT 25-MAY-1990
DEFINITION R.tsutsugamushi scrub typhus antigen 56 (sta 56) gene, complete
cds.
ACCESSION M33004
KEYWORDS antigen; outer membrane protein; scrub typhus antigen 56.
SOURCE R.tsutsugamushi (strain Karp) DNA, clone pRTS56H2.3.
ORGANISM Rickettsia tsutsugamushi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE 1 (bases 1 to 2280)
AUTHORS Stover,C.K., Marana,D.P., Carter,J.M., Roe,B.A., Mardis,E. and
Oaks,E.V.
TITLE The 56 kilodalton major protein antigen of Rickettsia
tsutsugamushi: Molecular cloning and sequence analysis of sta 56
gene, and precise identification of a strain specific epitope
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.V.Oaks, 16-MAR-1990. Walter Reed Institute, Dept. of Enteric
Infections, Walter Reed Army Institute of Research, Washington,
D.C. 20307
FEATURES from to/span description
pept 556 2154 scrub typhus antigen 56 precursor (sta 56)
sigp 556 621 scrub typhus antigen 56 signal peptide
matp 622 2151 scrub typhus antigen 56
signal 77 82 -35 region
signal 99 103 -10 region
binding 545 550 ribosome binding site
BASE COUNT 764 a 319 c 489 g 708 t
ORIGIN
1 aagcttgttc attttttatg tgggctaatt ttagataatg caatgttagt ataattatgt
61 ggttaattaa tgtatcttga tttaagattt tatataaata taataagatt tatgtagggc
121 ttaattatta gcttaaaaaa ctgttgctat tttagctaaa aataaaagtt tgggcaagaa
181 aaattattaa taattgaagg tagttgttgc gtaaaaagct gtgttatgct atctaaggtt
241 aaatgtagca aggtgctaat agataattaa tgtattttcg aacgtgtctt taagctatat
301 ataagagcag tattctattg aatattgttt ctaagtatat aaaaaataaa aatacatttt
361 acaattgata aaacgctttg agcacatttt taacacagtg ttttatagat tgtttaaatt
421 attttacaag tactattaaa tattagtata ctaaataata gttttttgat ataaaactaa
481 agttagtgtg gctaaataat tagtttagaa tggttaccac taaaaaataa atttaattct
541 tttaaggaga ttagaatgaa aaaaattatg ttaattgcta gtgcaatgtc tgcgttgtcg
601 ttgccatttt cagctagtgc aatagaattg ggggaagaag gattagagtg tggtccttat
661 gctaaagttg gagttgttgg aggaatgatt actggcgtag aatctgctcg cttggatcca
721 gctgatgctg aaggcaaaaa acacttgtca ttaacaaatg ggctgccatt tggtggaacg
781 ttggctgcag gtatgacaat cgctccagga tttagagcag agataggtgt tatgtacctt
841 acaaatataa ctgctcaggt tgaagaaggt aaagttaagg cagattctgt aggtgagaca
901 aaggcagatt ctgtaggtgg gaaagatgct cctatacgta agcggtttaa acttacacct
961 cctcagccta ctataatgcc tataagtata gctgtacgtg actttgggat tgatattcct
1021 aaccagacct cagcagcaag cacaagccgc agcctcaggc ttaatgatga gcaacgtgct
1081 gcagctagga tcgcttggtt aaagaattgt gctggtattg actatagggt aaaaaaccct
1141 aatgatccta atgggcctat ggttataaat ccgatattgt taaatattcc acagggtaac
1201 cctaatcctg ttggaaatcc accgcagcga gcaaatccgc ctgcaggttt tgcgatacat
1261 aaccatgagc aatggaggca tttggtagtt gggcttgctg cattatcaaa tgctaataaa
1321 cctagcgctt ctcctgtcaa agtattaagt gataaaatta ctcagatata tagtgatata
1381 aagcatttgg ctgatatagc tggtattgat gttcctgata ctagtttgcc taatagtgca
1441 tctgtcgaac agatacagaa taaaatgcaa gaattaaacg atctattgga agagctcaga
1501 gaatcttttg atgggtatct tggtggtaat gcttttgcta atcagataca gttgaatttt
1561 gtcatgccgc agcaagcaca gcagcagggg caagggcagc aacagcaagc tcaagctaca
1621 gcgcaagaag cagtagcagc agcagctgtt aggcttttaa atggcaatga tcagattgcg
1681 cagttatata aagatcttgt taaattgcag cgtcatgcag gaattaagaa agcgatggaa
1741 aaattagctg cccaacaaga agaagatgca aagaatcaag gtgaaggtga ctgcaagcag
1801 caacaaggaa catctgaaaa atctaaaaaa ggaaaagaca aagaggcaga gtttgatctg
1861 agtatgattg tcggccaagt taaactctat gctgacgtaa tgataactga atcagtctca
1921 atatatgctg gtgttggtgc agggttagct tatacttctg gaaaaataga taataaggat
1981 attaaagggc atacaggcat ggttgcatca ggagcacttg gtgtagcaat taatgctgct
2041 gaaggtgtgt atgtggacat agaaggtagt tatatgtact cattcagtaa aatagaagag
2101 aagtattcaa taaatcctct tatggcaagt gtaagtgtac gctataactt ctagtttttt
2161 ctttaaaatt ataaaaaaag cagctaaaag ttctttacag ggtttttagc tgctttttca
2221 gagttttttt ataataataa aaataacttt attctttgct atttaattag cttgaagctt
//
LOCUS HUMTROPI 816 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds.
ACCESSION J04760
KEYWORDS slow-twitch skeletal troponin I; troponin I.
SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 816)
AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L.
TITLE cDNA sequence, tissue-specific expression and chromosomal mapping
of the human slow-twitch skeletal muscle isoform of troponin I
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Wade, 15-MAR-1990.
FEATURES from to/span description
pept 77 640 slow-twitch skeletal troponin I (TNN1)
BASE COUNT 180 a 247 c 244 g 145 t
ORIGIN Chromosome 1q12-qter..
1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg
61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa
121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga
181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg
241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt
301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa
361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg
421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc
481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc
541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa
601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc
661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg
721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct
781 ctggaactgg gattaaacag atacccaaga ggcagg
//
LOCUS ASOTAAG1 2935 bp ds-DNA PLN 25-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION M33218
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 2935)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1
828 868 Taka-amylase A precursor, exon 2
955 1070 Taka-amylase A precursor, exon 3
1140 1248 Taka-amylase A precursor, exon 4
1317 1545 Taka-amylase A precursor, exon 5
1603 1765 Taka-amylase A precursor, exon 6
1830 1976 Taka-amylase A precursor, exon 7
2041 2281 Taka-amylase A precursor, exon 8
2360 2647 Taka-amylase A precursor, exon 9
sigp 607 669 Taka-amylase A signal peptide
matp 670 772 Taka-amylase A
828 868 Taka-amylase A
955 1070 Taka-amylase A
1140 1248 Taka-amylase A
1317 1545 Taka-amylase A
1603 1765 Taka-amylase A
1830 1976 Taka-amylase A
2041 2281 Taka-amylase A
2360 2644 Taka-amylase A
pre-msg 543 > 2789 Taa-G1 mRNA and introns
IVS 773 827 Taa-G1 intron A (no splice consensus)
IVS 869 954 Taa-G1 intron B
IVS 1071 1139 Taa-G1 intron C
IVS 1249 1316 Taa-G1 intron D
IVS 1546 1602 Taa-G1 intron E
IVS 1766 1829 Taa-G1 intron F
IVS 1977 2040 Taa-G1 intron G
IVS 2282 2359 Taa-G1 intron H
signal 2784 2789 poly-A signal
BASE COUNT 818 a 752 c 657 g 708 t
ORIGIN
1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 25-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION M33220
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 195 > 197 Taka-amylase A (Taa-G2) precursor
pre-msg 131 > 197 Taa-G2 mRNA and introns
signal 2 11 CAAT box
signal 95 100 TATA box
BASE COUNT 59 a 52 c 36 g 50 t
ORIGIN
1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
181 cacagaaggc atttatg
//
LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 25-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION M33222
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1)
pre-msg < 1 151 Taa-G2 mRNA and introns (alt.)
pre-msg < 1 156 Taa-G2 mRNA and introns (alt.)
BASE COUNT 45 a 48 c 53 g 52 t
ORIGIN About 2.1 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
181 gtcctccaga ggcgagga
//
LOCUS ASOTTAM1 191 bp ss-mRNA PLN 25-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION M33219
KEYWORDS Taka-amylase A.
SEGMENT 1 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 189 > 191 Taka-amylase A (Taa)
mRNA 125 > 191 Taa mRNA
BASE COUNT 56 a 52 c 32 g 51 t
ORIGIN
1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
181 aggcatttat g
//
LOCUS ASOTTAM2 156 bp ss-mRNA PLN 25-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION M33221
KEYWORDS Taka-amylase A.
SEGMENT 2 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa) (AA at 1)
mRNA < 1 151 Taa mRNA (alt.)
mRNA < 1 156 Taa mRNA (alt.)
BASE COUNT 40 a 35 c 42 g 39 t
ORIGIN About 1.5 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 25-MAY-1990
DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
mRNA, complete cds.
ACCESSION J05223
KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
glyceraldehyde-3-phosphate dehydrogenase.
SOURCE M.crystallinum, cDNA to mRNA.
ORGANISM Mesembryanthemum crystallinum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE 1 (bases 1 to 1354)
AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3-
phosphate dehydrogenase during the transition from C-3
photosynthesis to crassulacean acid metabolism in Mesembryanthemum
crystallinum
JOURNAL J. Biol. Chem. 265, 3497-3502 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC
1.2.1.12)
BASE COUNT 315 a 308 c 345 g 386 t
ORIGIN
1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS ECOPRIA 2658 bp ds-DNA BCT 25-MAY-1990
DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR
gene, 5' end.
ACCESSION M33293
KEYWORDS cytR gene; priA gene; primosomal protein.
SOURCE E.coli (strain W3110) DNA, clone pEL042.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2658)
AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
TITLE The priA gene encoding the primosomal, replicative n' protein of
E.coli
JOURNAL Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305
STANDARD full automatic
COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
26-MAR-1990.
FEATURES from to/span description
pept 64 2262 primosomal protein n'
pept 2418 > 2658 cytR gene product
signal 27 32 -10 region
signal 5 10 -35 region
binding 53 59 ribosome binding site
BASE COUNT 578 a 738 c 756 g 586 t
ORIGIN
1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
2641 cggatatctg cgatcccc
//
LOCUS EWCTELRNA 657 bp ds-DNA INV 25-MAY-1990
DEFINITION E.crassus telomerase RNA component gene, complete cds.
ACCESSION M33461
KEYWORDS telomerase RNA.
SOURCE E.crassus DNA.
ORGANISM Euplotes crassus
Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora;
Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE 1 (bases 1 to 657)
AUTHORS Shippen-Lentz,D. and Blackburn,E.H.
TITLE Functional evidence for an RNA template in telomerase
JOURNAL Science 247, 546-552 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
RNA 152 342 telomerase RNA component
site 186 197 functional telomeric template
BASE COUNT 202 a 122 c 111 g 222 t
ORIGIN
1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS HS4DWXJ 160 bp ds-DNA VRL 25-MAY-1990
DEFINITION Epstein-Barr virus defective WZhet junction.
ACCESSION M33474
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 160)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 87 90 WZhet junction
BASE COUNT 41 a 45 c 41 g 33 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS HS4WXJ 181 bp ds-DNA VRL 25-MAY-1990
DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION M33473
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 181)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 108 111 WZhet junction
BASE COUNT 50 a 42 c 46 g 43 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
181 c
//
LOCUS HUMREGA 4251 bp ds-DNA PRI 25-MAY-1990
DEFINITION Human regenerating protein (reg) gene, complete cds.
ACCESSION J05412
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4251)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
FEATURES from to/span description
pept 1571 1634 regenerating protein (reg), exon 2 (first
expressed exon)
2270 2388 regenerating protein, exon 3
2696 2833 regenerating protein, exon 4
3549 3660 regenerating protein, exon 5
3856 3923 regenerating protein, exon 6
pre-msg 1196 4116 reg mRNA and introns
IVS 1224 1524 reg intron A
IVS 1635 2269 reg intron B
IVS 2389 2695 reg intron C
IVS 2834 3548 reg intron D
IVS 3661 3855 reg intron E
signal 1169 1174 TATA box
BASE COUNT 1161 a 927 c 869 g 1294 t
ORIGIN
1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS HUMREGRELA 1524 bp ds-DNA PRI 25-MAY-1990
DEFINITION Human reg-related sequence, complete cds.
ACCESSION J05413
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human esophageal mucosa DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
BASE COUNT 382 a 368 c 345 g 429 t
ORIGIN
1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
1501 tataaatcat ataatgtaaa ggat
//
LOCUS MDPCGA 4801 bp ss-RNA VRL 25-MAY-1990
DEFINITION Aleutian mink disease parvovirus complete genome.
ACCESSION M20036
KEYWORDS complete genome.
SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (bases 1 to 4592)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 4593 to 4801)
AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
Perryman,S. and Wolfinbarger,J.B.
TITLE Nucleotide sequence of the 5' terminal palindrome of Aleutian mink
disease parvovirus (ADV) and construction of an infectious
molecular clone
JOURNAL J. Virol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by M.E.Bloom, 18-MAR-1990.
Sequence reported below is (+) strand.
FEATURES from to/span description
ORF 116 1978 Left ORF
ORF 1983 2207 Middle ORF2
ORF 1993 2211 Middle ORF1
ORF 2241 4399 Right ORF
BASE COUNT 1740 a 912 c 943 g 1206 t
ORIGIN
1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
4801 a
//
LOCUS MDPUPS 3454 bp ss-RNA VRL 25-MAY-1990
DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
sequence.
ACCESSION M32981
KEYWORDS .
SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (sites)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3454, for [1])
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by M.E.Bloom 18-MAR-1990.
Sequence reported below is (+) strand.
BASE COUNT 1268 a 673 c 698 g 815 t
ORIGIN
1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS MUSAA2DEL 300 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
fragment.
ACCESSION M33468
KEYWORDS deletion mutant.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 130 133 deletion breakpoint
BASE COUNT 82 a 45 c 76 g 97 t
ORIGIN
1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS MUSSL20JA 300 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
fragment.
ACCESSION M33467
KEYWORDS deletion mutant.
SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps / 42 122 dilute gene, exon 3
179 > 300 dilute gene, exon 5
recomb 129 130 deletion breakpoint
BASE COUNT 86 a 51 c 61 g 102 t
ORIGIN
1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS PRVVP4 2359 bp ds-RNA VRL 25-MAY-1990
DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION M33516
KEYWORDS capsid protein VP4.
SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA.
ORGANISM Porcine rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 2359)
AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain
of porcine rotavirus to that of asymptomatic human rotavirus
strains
JOURNAL J. Virol. 64, 414-418 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 10 2337 capsid protein VP4
BASE COUNT 812 a 406 c 442 g 699 t
ORIGIN
1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
2341 ctctatagag gatgtgacc
//
LOCUS RATSTAA 1000 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION M33329
KEYWORDS hydroxysteroid sulfotransferase a.
SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
Yamazoe,Y. and Kato,R.
TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
strong homology in amino acid sequence with the senescence marker
protein SMP-2 in rat livers
JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 38 892 hydroxysteroid sulfotransferase a (STa)
mRNA < 1 1000 STa mRNA
signal 977 982 poly-A signal
BASE COUNT 299 a 187 c 211 g 303 t
ORIGIN
1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS SMFPOLENV 3534 bp ss-RNA VRL 25-MAY-1990
DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
envelope (env) gene, complete cds.
ACCESSION M33561
KEYWORDS envelope protein; polymerase.
SOURCE Simian foamy virus type 1, cDNA to viral RNA.
ORGANISM Simian foamy virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Spumavirinae.
REFERENCE 1 (bases 1 to 3534)
AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
TITLE Relationship of the env genes and the endonuclease domain of the
pol genes of simian foamy virus type 1 and human foamy virus
JOURNAL J. Virol. 64, 406-410 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 582 polymerase (AA at 1)
pept 524 3481 envelope protein
BASE COUNT 1162 a 645 c 670 g 1046 t 11 others
ORIGIN
1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat
2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS ENHCYPA 495 bp ds-DNA INV 25-MAY-1990
DEFINITION Entamoeba histolytica cysteine protease gene, partial cds.
ACCESSION M27307
KEYWORDS cysteine protease.
SOURCE Entamoeba histolytica (strain HM-1) trophozoites DNA, clone E452.
ORGANISM Entamoeba histolytica
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina;
Entamoebidae.
REFERENCE 1 (sites; for [2])
AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
TITLE Amplification and sequencing of genomic DNA fragments encoding
cysteine proteases from protozoan parasites
JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 495)
AUTHORS Eakin,A.E.
JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by A.E.Eakin, 24-AUG-1989.
FEATURES from to/span description
pept < 1 > 495 cysteine protease (AA at 1)
BASE COUNT 161 a 75 c 107 g 152 t
ORIGIN
1 caagggcaat gcgggtcgtg ctggactttc tgtacaactg cagttcttga aggaagagtt
61 aacaaagatc ttggaaaact ttactcattc tctgaacaac aattagttga ttgtgatgct
121 tctgataatg gatgtgaacg aggacatcca tctaactcac ttaaattcat ccaagaaaat
181 aatggattag gattagaaag cgattatcca tataaagctg ttgctggtac ttgcaagaaa
241 gttaaaaacg ttgctactgt tactggttct agaagagtta ctgatggaag tgaaactgga
301 cttcaaacta ttattgctga aaacggacct gttgctgttg gtatggatgc tagcagacca
361 tcattccaat tatataagaa aggaactatc tattctgata ctaaatgtag atcaagaatg
421 atgaatcact gtgttactgc tgttggttat ggttcaaata gtaatggtaa atattggatt
481 gtcaagaatt cctgg
//
LOCUS TRBCYPA 495 bp ds-DNA INV 25-MAY-1990
DEFINITION Trypanosoma brucei cysteine protease gene, partial cds.
ACCESSION M27306
KEYWORDS cysteine protease.
SOURCE Trypanosoma brucei gambiense trypomastigote DNA, clone B502.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (sites; for [2])
AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
TITLE Amplification and sequencing of genomic DNA fragments encoding
cysteine proteases from protozoan parasites
JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 495)
AUTHORS Eakin,A.E.
JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by A.E.Eakin, 24-AUG-1989.
FEATURES from to/span description
pept < 1 > 495 cysteine protease (AA at 1)
BASE COUNT 116 a 115 c 140 g 124 t
ORIGIN
1 caagggcagt gtgggtcgtg ctgggccttt tcaactatcg gcaacatcga agggcagtgg
61 caggtggcag gaaatcctct cgtatccctc tcggagcaga ttctagtgta ctgtgatcca
121 ttgattggct gtggtggtgg gctgatggac aatgccttca actggatagt aaattcaaac
181 ggtggaaacg tattcacgga ggcgagctat ccctatgttt ctgggaatgg tgagcagcca
241 cagtgccaga tgaatggtca cgagatcggt gctgcgataa cagaccatgt tgacttaccg
301 caggatgagg acgccatcgc cgcgtatttg gcagaaaacc gtccccttgc tattgccgtt
361 gaagcaccgc agttttatgg acataacggt gggattctga cttcatgcac ctccgagcaa
421 ctggatcatg gtgtgctcct cgttggttac aatgataata gcaatccacc ctattggatc
481 gtcaagaact cctgg
//
LOCUS TRCCYPA 495 bp ds-DNA INV 25-MAY-1990
DEFINITION Trypanosoma cruzi cysteine protease gene, partial cds.
ACCESSION M27305
KEYWORDS cysteine protease.
SOURCE Trypanosoma cruzi (strain RA; Argentine) epimastigote DNA, clone
R501.
ORGANISM Trypanosoma cruzi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (sites; for [2])
AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
TITLE Amplification and sequencing of genomic DNA fragments encoding
cysteine proteases from protozoan parasites
JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 495)
AUTHORS Eakin,A.E.
JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by A.E.Eakin, 24-AUG-1989.
FEATURES from to/span description
pept < 1 > 495 cysteine protease (AA at 1)
BASE COUNT 94 a 143 c 158 g 100 t
ORIGIN
1 cagggccaat gcggctcgtg ctgggccttc tccgccattg gcaacgttag tggccagtgg
61 tttcttgccg gtcacccgct gactaacctg tcggagcaga tgctcgtgtc gtgcgacaaa
121 acggactctg gctgcagtgg tggcctgatg aacaacgcct ttgagtggat tgtgcaggag
181 aataacggcg gcgtgtacac ggaggacagc tacccttatg cgtcgggcga ggggatatcg
241 ccgccgtgca cgacgtcagg ccacacggtg ggtgccacga ttaccggtca cgttgaatta
301 cctcaggacg aggcccaaat agccgcatgg cttgccgtca atggcccggt tgccgttgcg
361 cacgccagca gctggatgac ctacacgggc ggcgttatga cgagctgcgt ctccgagcag
421 ctggatcacg gccttcttct cgtcggctac aatgacagcg ccgcagtgcc gtactggatc
481 atcaagaact cgtgg
//
LOCUS STRDEXB 1800 bp ds-DNA BCT 25-MAY-1990
DEFINITION S.mutans dextran glucosidase (dexB) gene, complete cds.
ACCESSION M30944
KEYWORDS dextran glucosidase.
SOURCE S.mutans (strain INGBRITT) DNA.
ORGANISM Streptococcus mutans
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 1800)
AUTHORS Russell,R.R.B. and Ferretti,J.J.
TITLE Nucleotide sequence of the dextran glucosidase (dexB) gene from
Streptococcus mutans
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma
City, OK 73190
FEATURES from to/span description
pept 91 1701 dextran glucosidase (dexB) (EC 3.2.1.11)
binding 78 83 ribosome binding site
BASE COUNT 585 a 290 c 376 g 549 t
ORIGIN
1 aaaacggctt aaattgtttt agattttctc agtctcttta ttagaactaa agaaactaac
61 atgtcataat aaattaaagg tggtgataag atgcaaaaac attggtggca caaggcaact
121 gtttatcaaa tttatccaaa atcttttatg gatacaaatg gtgatggaat tggtgatctc
181 aaaggtatta cgagtaaatt ggattatttg caaaagttag gggttatggc tatttggcta
241 tctccagttt atgatagccc catggatgac aatggctatg acattgcgaa ctatgaagca
301 attgcggata tttttggcaa tatggctgat atggataatt tgctgacgca ggcaaaaatg
361 cgcgacataa aaatcattat ggatctagtg gttaatcata cctcagatga acatacttgg
421 tttattgaag cacgtgagca tccagacagt tctgaacgcg attattatat ttggtgtgac
481 cagccaaatg atttggaatc tattttcggt ggttctgctt ggcagtatga tgataagtcc
541 gatcaatatt atttgcattt ttttagtaag aagcagccag atctaaactg ggaaaacgca
601 aacttacgtc agaagattta tgatatgatg aatttctgga ttgataaagg tattggcggc
661 tttcggatgg acgtcattga tatgattggg aaaattcctg ctcagcatat tgtcagtaac
721 ggaccaaaat tgcatgctta tcttaaggag atgaatgccg ctagttttgg tcaacatgat
781 ctgctgactg tgggggaaac ttggggagca acgcctgaga ttgcgaagca atattcaaat
841 ccagtcaatc acgaactctc tatgattttt caatttgaac atattggtct tcagcataaa
901 ccagaagctc ctaaatggga ttatgtgaag gaacttaatg ttcctgcttt aaaaacaatc
961 tttaataaat ggcagactga gttggaatta ggacaggggt ggaattcgtt attctggaat
1021 aaccatgacc tgcctcgtgt tttatcaatc tggggaaata cgggcaaata tcgtgagaag
1081 tctgctaaag cactggctat tcttcttcac cttatgcgtg ggacacctta tatttatcaa
1141 ggtgaagaga ttgggatgac caattatcct tttaaagatt taaatgaact tgatgatatt
1201 gaatcactta attatgctaa ggaagctttt acaaatggta agtctatgga aactatcatg
1261 gacagtattc gtatgattgg ccgtgataat gccagaacac ctatgcaatg ggatgcttct
1321 caaaatgccg gattttcaac agcggataaa acatggctgc cagttaatcc aaactataaa
1381 gacatcaatg ttcaagcagc tctgaaaaat tccaattcta tcttttacac ctatcaacaa
1441 ctcattcagc ttcgaaaaga aaatgattgg ctagtagatg ccgattttga attgctccct
1501 acagcggaca aagtatttgc ctatttacga aaggtaagag aagaaaggta tcttatagtg
1561 gtcaatgttt cagatcagga agaagttcta gagattgatg ttgacaaaca agaaactctc
1621 attagcaata caaatgaaag cgctgctctt gccaatcaca aactccagcc ttgggatgct
1681 ttttgtatta agataaacta ggatagaata agaaaacaac agacgatacc acttaaaaag
1741 tgacggcctg ttgttttttt aatgtttgta ttaagtaata aagcgattct gattgactat
//
LOCUS STRGBP 1968 bp ds-DNA BCT 25-MAY-1990
DEFINITION S.mutans glucan-binding protein (gbp) gene, complete cds.
ACCESSION M30945
KEYWORDS glucan binding protein.
SOURCE S.mutans (strain INGBRITT) DNA.
ORGANISM Streptococcus mutans
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 1968)
AUTHORS Banas,J.A., Russell,R.R.B. and Ferretti,J.J.
TITLE Sequence analysis of the gene for the glucan-binding protein of
Streptococcus mutans INGBRITT
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma
City, OK 73190.
FEATURES from to/span description
pept 124 1815 glucan-binding protein (gbp)
signal 12 17 -35 region
signal 37 42 -10 region
binding 112 116 ribosome binding site
BASE COUNT 648 a 341 c 445 g 534 t
ORIGIN
1 ccggctataa gttgaaatat tgtaggtatt aaaaactatc tttagtttag tatttacatt
61 aattttaaaa atgttatagt ggaagtgtca tgttgattac tattttttta aggaggtaaa
121 atgatgaaag aaaagacacg ttttaaactg cacaaggtta aaaagcagtg ggtggcgatt
181 gccgtgacta gtctagctct agctgcgata ttgtcaggag ctcacttgac tcaggctgag
241 gaacaatccg gcggtactga cagtaagcca agactgacag cgactgtaca ggaaagctca
301 gaacaaccaa ttacaaaagc tccagcagct gattcatctg tagaaaataa cagtgctaac
361 gctgttaaaa gttctgaaac agcagaggca gctgaagtat ccgatggagg cagagccagc
421 caaactgaag cagtaacaaa ccaaacaaac tctgaagagc accatccagc agaaaaagcg
481 acagccgttt ctggagaagc tcagtcagtg caaaatgctc catcagaaaa tgctgcccag
541 caggaaacgg ctaaaaccga gccagcgact gctgcagaaa ataatgacgc tgctccaacc
601 aatagcttct ttaaaaaaga tggtaaatgg tactacaaaa aggccgatgg acagctggca
661 accggttggc agataattga tggaaagcag ctctatttca accaagatgg tagtcaggtc
721 aaaggagaaa ttcatgtgga gacaggggat caaatcattt atcatcctgt tttcataagt
781 gattcacctt cagttttgga agtcaataag atttattact ttgatcctga tagtggtgaa
841 ctctggaagg atcgttttgt ctattctagt tatgcagatc ccctccatta tgaaaatatt
901 aaacatgaag gctggttcta tcttggagaa gatggaaagg ctgctatcgg ctggagaact
961 attggcggta aaaaatacta ttttgacact aatggtgttc aagtcaaagg aaagctaatt
1021 agtacagatg gcaattataa tctaattagc cagaagtatg gcaagaaatc tttcctagat
1081 cctgacaccg gtgaagcttg gactaatcgt tttgtcaatg caaagtatta tttctacaac
1141 tttgcaggat acgtctctac gacagactgg ttctatatgg gagccgatgg tatcggcgtg
1201 accgattggc aaaagatcga tggtatggat tactatttcg aaccttccag tggtattcag
1261 gttaaaggcg acattgctga gcgtgatggc aaggtctatt atttagatga agacagtgga
1321 caagttgtta agaatcgttt tggcacaaca cctgccgagc gtatcagtac agttgaggct
1381 cgtttcccta aaacttatta ttttggagcg gacggtagcc gcaaagatct aactggttgg
1441 cagattattg atggtaaaac ttattacttt aaggatgatc acagcataaa agcaaagtca
1501 gagtatagtc aaattggtgg ttctgtgcct gatgacggtt ttgcagagat tgatggtgat
1561 ggttactttt ttgatactca aggtcaattc gtaacgaata gatttgtcag aaaatacgac
1621 tacagtaata tttggtatta ttatggaagc gatggcaaac gtgtatcagg ctggcaaact
1681 atcgacggta agcgctacta ctttagccaa gatgaaaaga caaagggccg tcaaattaaa
1741 ggacaaacca tcactatcga tggtaaagaa tatacttttg acaaagacag cggtgaagtt
1801 atcaatagta actagttggt aaatcccatg gcacacaaaa acgagcagat ttcatactct
1861 gttcgttttt tcgccttaaa acttatatat ttataaatcg tcaataaagt gtttacttga
1921 aaacggtaaa tatgccaaga gtttgactgt tatcaattaa tgggaaag
//
LOCUS STRGTFS 4376 bp ds-DNA BCT 25-MAY-1990
DEFINITION S.downei glucosyltransferase S (gtfs) gene, complete cds.
ACCESSION M30943
KEYWORDS glucosyltransferase S.
SOURCE S.downei (strain MFe28) DNA.
ORGANISM Streptococcus downei
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 4376)
AUTHORS Gilmore,K.S., Russell,R.R.B. and Ferretti,J.J.
TITLE Sequence analysis of a Streptococcus downei gtfs gene which
specifies a glucosyltransferase synthesizing soluble glucans
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma
City, OK 73190
FEATURES from to/span description
pept 247 4344 glucosyltransferase S (gtfs)
signal 127 132 -35 region
signal 150 154 -10 region
binding 233 237 ribosome binding site
BASE COUNT 1282 a 944 c 1000 g 1150 t
ORIGIN
1 tggcctatgg agtagcggtc attctcacca tcttgaacat caagttaatt attgatttgt
61 tccaataaat atgaatctct ttaaagaata aatttaaaga gatttttatg ttgttttaca
121 atctaatgac aaatttagga tttctgagat taaacttaaa taacatgtta aaataaattg
181 ataattaaaa ttatttggtt attaaagttt taattgtcga atttttttat aaggagaatg
241 atattcatgg aaaaaaattt gcgctacaaa ttgcataagg ttaaaaagca atgggtagcc
301 attggtgtga cgaccgtaac gcttagcttt ctagctggag gccaagtcgt tgcagccgac
361 acaaacaata atgacgggac aagtgttcag gtcaacaaaa tggtgccaag tgatcccaag
421 tttgacgctc aagcccaaaa cggtcaactt gctcaagcta tgtttaaggc tgctaatcaa
481 gcagatcaaa ctgctacaag tcaagttagc ccagcaacag atggcagggt tgataaccag
541 gtgacaccag cagctaatca acctgcggct aatgtggcca atcaagatgt ggccaatcca
601 gcgactgatg ccggagcact taatcgccaa tcggccgctg atacgtcaac agatggtaag
661 gctgtacctc agactagtga ccaaccaggt catctggaaa cagttgatgg caaaacttac
721 tatgtggatg ccaatggtca acggttgaaa aactattcaa tggtcattga tggcaagacc
781 tattactttg acggtcagac aggcgaagcg caaaccgact tgcctaagac cggtcaagct
841 aatcaagaca atgtgcccga tagctatcag gccaataatc aagcctatag taatgaggct
901 tctagctttg agacagttga taattacctg acagcggatt cctggtatcg ccctcgcaag
961 attttgaaaa atggccagag ctggcaggct agttcagaag gtgacctgcg tcctattctc
1021 atgacctggt ggccagatgc ggccaccaag gcagcctatg ccaacttctg ggctaaagaa
1081 ggcctgattt caggttctta caggcaaaat tctgccaatc ttgatgcggc tacccagaat
1141 attcaatctg ccattgaaaa gaagattgcc agtgaaggca ataccaactg gttgcgcgac
1201 aagatgtccc aattcgtcaa gtcacaaaac cagtggagta ttgcttctga aaatgagact
1261 gtctatccta atcaggatca tatgcaaggg ggggccctgc tcttttcaaa cagtaaggat
1321 accgagcatg ctaactcaga ctggcgcctg ctcaaccgca atccaacctt ccaaactggt
1381 aaacaaaagt attttacaac caactatgcg ggttatgaat tgctcttagc caatgatgtt
1441 gataattcta acccggtcgt ccaagccgaa cagctcaatc acctccacta ccttatgaac
1501 tggggtgaca ttgttatggg ggataaggat gctaactttg atggtgtccg agttgatgcg
1561 gttgataatg tcaatgccga cctgctacaa attcagaggg attattataa ggccaaatat
1621 ggcaccgatc aaaatgaaaa aaatgccatt gaccacctct ctatcctaga agcctggtct
1681 ggcaacgaca atgactacgt caaggaccaa aacaactttt ctctctctat tgacaatgat
1741 caacgcagtg gtatgttgaa agctttcggt tatgcttctg cctatcgtgg taatttgagt
1801 aatctcgcaa ctgctggctt gaaaaaccgt agtgcaaatc ctgatagtga cccagttcca
1861 aactatgtct ttattcgtgc ccatgactct gaggtacaga cacggattgc taagattatt
1921 cgtgagaaat tgggtaagac caatgctgat ggtttgacca atctgacctt agatgattta
1981 aacaaggcct ttgacatcta caaccaggat atgaatgcca ccgataaggt ttattatcct
2041 aataatttac cgatggccta tgcttggatg ctgcaaaata aggatacggt cactcgggtc
2101 tactatggtg atatgtacac cgataatggc cagtatatgg cgaccaagac cccattttat
2161 aatgctattg agaccctgct taagggccgg attaagtatg ttgctggtgg tcaagcggtc
2221 tcctacaagc aagattggtc tagtggaatt ttgacctcgg ttcgctatgg taagggagct
2281 aattcggcta gtgatgctgg gaacacggaa acccgcaatt ctggtatggc cctcctgatt
2341 aacaatcgac ctaacttcag ggcttatcgg aacttgacct tgaacatggg ggcagcccat
2401 aagagccaag cctaccgtcc tctcttgctg tctaccaagg atggcattgc cacctacctt
2461 aatgatagcg atgtggattc tcgccaatac aagtacaccg atagtcaggg gaacttaagc
2521 tttagtgcct ctgaactgca aagtgttgct aacgcccaag tctcaggtat gattcaggtt
2581 tgggtacccg tcggtgcggc agataaccaa gatgtccgta cttctccaag tacacaagcg
2641 accaaggatg gcaatatcta tcatcaaagt gacgccctag attcccaagt catctatgaa
2701 ggtttctcta atttccaagc cttcgcccaa agtcctgacc aatataccaa tgctgtcatt
2761 gctaaaaatg gcgacctctt taagtcttgg ggcattaccc aatttgaaat ggcaccgcag
2821 tacgtgtcat cagaagatgg aactttcctt gattccgtta ttttaaatgg ttatgccttc
2881 tcagaccgct atgacctggc tatgagtaag aataataaat atggctccaa gcaggatttg
2941 gccaatgcca ttaaaggact tcagtctgca ggtatcaagg tcttgtcaga cttggtaccg
3001 aaccaacttt ataatctgcc tggaaaagaa gtggtaacgg caacccgggt caatcaatat
3061 ggtcaagcca aatcaggggc gaccatcaat aagactcctt atgtggccaa tacccgttcc
3121 tatggtgact accaagaaca gtacggtgga aaattcttgg atgacttgca aaagctctac
3181 ccaagactct ttagcaccaa gcaaatttca acgggcaaac caattgatcc atctgttaag
3241 attaccaatt ggtcggctaa gtatttcaat ggctccaata ttttggggcg tggtgccaag
3301 tatgttctga gtgagggcaa taagtatctc aacttggctg atggcaagct cttcttgcca
3361 acggtcctca ataatactta tggtcagccg caagtatcgg ctaatggttt tatttctaag
3421 aatggcggta ttcattatct tgataaaaat ggtcaggaag tcaagaatcg gttcaaggaa
3481 atttctggca gctggtatta cttcgattct gacggtaaga tggctactgg aaaaacgaag
3541 attggcaatg atacctacct ctttatgcct aatggtaaac aacttaagga aggtgtctgg
3601 tatgatggta agaaggccta ctattatgat gataatggta ggacgtggac caataagggc
3661 ttcgttgaat ttagagttga cggtcaagac aagtggcgtt acttcaatgg tgatggcact
3721 atcgccattg gactagttag tctagataat cgcaccctct actttgatgc ctatggctat
3781 caagtcaagg gacaaaccgt gacgattaat ggtaagtcct atacctttga tgccgatcag
3841 ggtgacttgg tccaaacaga caatgccaat ccagctcctc agggtcaagc aggctggaaa
3901 ctcctaggag ataaccagtg gggctaccgc aaggacggtc aactcttgac gggtgagcaa
3961 actattgatg gtcaaaaggt cttcttccaa gataatggcg tccaagtcaa aggtggaact
4021 gcgacagatg cttcaggtgt cttgcgtttc tacgaccgtg accagggcca ccaagttggc
4081 aagggctggt actcaacctc cgacgataat tgggtctatg tcaatgaatc cggtcaagtt
4141 ctgacaggct tacaaaccat tgatggtcag acggtctatt tcgatgacaa gggtatccag
4201 gccaagggca aggctgtctg ggatgaaaat ggcaacctgc gttactttga tgccgattca
4261 ggtaatatgc ttcgggaccg ttggaagaac gttgacggaa actggtatta cttcaaccgc
4321 aatggcctag ccacgaggtg gtaagcctaa tactcgctaa atattaaaag agctcg
//
LOCUS VCHTCPA 1048 bp ds-DNA BCT 25-MAY-1990
DEFINITION V.cholerae major pilin subunit (tcpA) gene, complete cds.
ACCESSION M33514
KEYWORDS antigen; pilin.
SOURCE V.cholerae (strain Z17561 (Classical, Inaba)) DNA, clone pPM2114.
ORGANISM Vibrio cholerae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1048)
AUTHORS Faast,R., Ogierman,M.A., Stroeher,U.H. and Manning,P.A.
TITLE Nucleotide sequence of the structural gene, tcpA, for a major pilin
subunit of Vibrio cholerae
JOURNAL Gene 85, 227-231 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 144 818 major pilin subunit (tcpA) precursor
sigp 144 218 major pilin subunit (tcpA) signal peptide
matp 219 815 major pilin subunit (tcpA)
ORF 865 > 1048 ORF
BASE COUNT 327 a 182 c 240 g 299 t
ORIGIN
1 aaaaaataaa aaacacagca aaaatgacat ctgtcaattg taggtgactt tgtgtggtta
61 aatgtgcgtg ttgcttacgt tatctaaaaa agaccaagcg acgcatttct ttaaagacag
121 taaaatggtg gagttacata aatatgcaat tattaaaaca gctttttaag aagaaatttg
181 taaaagaaga acacgataag aaaaccggtc aagagggtat gacattactc gaagtgatca
241 tcgttctagg cattatgggg gtggtttcgg cgggggttgt tactctggcg cagcgtgcga
301 ttgattcgca gattatgacc aaggccgcgc aaagtctcaa tagtatccaa gttgcactga
361 cacagacata ccgtggtcta ggtaattatc cagcaacagc tgatgcgaca gctgctagta
421 agctaacttc aggcttggtt agtttaggta aaatatcatc cgatgaggca aaaaacccat
481 tcaatggtac aaatatgaat attttttcat ttccgcgtaa tgcagcagct aataaagcat
541 ttgcaatttc agtggatggt ctgacacagg ctcaatgcaa gacacttatt accagtgtcg
601 gtgatatgtt cccatatatt gcaatcaaag ctggtggcgc agtagcactt gcagatctag
661 gtgattttga gaattctgca gcagcggctg agacaggcgt tggtgtgatc aaatctatcg
721 ctcccgctag taagaattta gatctaacga acatcactca cgttgagaaa ttatgtaaag
781 gtactgctcc attcggcgtt gcatttggta acagctaatt caaataagtt tgtttaactt
841 aatcttaacg ttgcccatta aataatgggc aacttattaa attcaatgtg gtatcaatat
901 gagaaaatac caacaaggtg tcggattatt ggaggcgatt ctggcttctg cgtattaggg
961 atggcattgg tcgctgctgg gagctattac aagcgggaag ctgaactcat gattaaatcc
1021 agtaacgcat ttgatgttat tgagttgt
//
LOCUS HPCPOLYP 7310 bp ss-RNA VRL 25-MAY-1990
DEFINITION Hepatitis C virus polyprotein gene, partial cds.
ACCESSION M32084
KEYWORDS polyprotein.
SOURCE Hepatitis C virus, cDNA to viral RNA, clones K9-1 through 15e,
isolated from chimpanzee (individual 910) blood plasma.
ORGANISM Hepatitis C virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae incertae sedis.
REFERENCE 1 (bases 1 to 7310)
AUTHORS Choo,Q.-L., Richman,K. and Han,J.
TITLE The nucleotide sequence of the Hepatitis C viral genome
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.Houghton, 22-FEB-1990. Chiron Corporation, 4560 Horton Street,
Emeryville CA 94608.
FEATURES from to/span description
pept < 1 > 7310 polyprotein (AA at 3)
BASE COUNT 1495 a 2218 c 2058 g 1539 t
ORIGIN
1 caggctgtcc tgagaggcta gccagctgcc gaccccttac cgattttgac cagggctggg
61 gccctatcag ttatgccaac ggaagcggcc ccgaccagcg cccctactgc tggcactacc
121 ccccaaaacc ttgcggtatt gtgcccgcga agagtgtgtg tggtccggta tattgcttca
181 ctcccagccc cgtggtggtg ggaacgaccg acaggtcggg cgcgcccacc tacagctggg
241 gtgaaaatga tacggacgtc ttcgtcctta acaataccag gccaccgctg ggcaattggt
301 tcggttgtac ctggatgaac tcaactggat tcaccaaagt gtgcggagcg cctccttgtg
361 tcatcggagg ggcgggcaac aacaccctgc actgccccac tgattgcttc cgcaagcatc
421 cggacgccac atactctcgg tgcggctccg gtccctggat cacacccagg tgcctggtcg
481 actacccgta taggctttgg cattatcctt gtaccatcaa ctacaccata tttaaaatca
541 ggatgtacgt gggaggggtc gaacacaggc tggaagctgc ctgcaactgg acgcggggcg
601 aacgttgcga tctggaagac agggacaggt ccgagctcag cccgttactg ctgaccacta
661 cacagtggca ggtcctcccg tgttccttca caaccctacc agccttgtcc accggcctca
721 tccacctcca ccagaacatt gtggacgtgc agtacttgta cggggtgggg tcaagcatcg
781 cgtcctgggc cattaagtgg gagtacgtcg ttctcctgtt ccttctgctt gcagacgcgc
841 gcgtctgctc ctgcttgtgg atgatgctac tcatatccca agcggaggcg gctttggaga
901 acctcgtaat acttaatgca gcatccctgg ccgggacgca cggtcttgta tccttcctcg
961 tgttcttctg ctttgcatgg tatttgaagg gtaagtgggt gcccggagcg gtctacacct
1021 tctacgggat gtggcctctc ctcctgctcc tgttggcgtt gccccagcgg gcgtacgcgc
1081 tggacacgga ggtggccgcg tcgtgtggcg gtgttgttct cgtcgggttg atggcgctga
1141 ctctgtcacc atattacaag cgctatatca gctggtgctt gtggtggctt cagtattttc
1201 tgaccagagt ggaagcgcaa ctgcacgtgt ggattccccc cctcaacgtc cgaggggggc
1261 gcgacgccgt catcttactc atgtgtgctg tacacccgac tctggtattt gacatcacca
1321 aattgctgct ggccgtcttc ggaccccttt ggattcttca agccagtttg cttaaagtac
1381 cctactttgt gcgcgtccaa ggccttctcc ggttctgcgc gttagcgcgg aagatgatcg
1441 gaggccatta cgtgcaaatg gtcatcatta agttaggggc gcttactggc acctatgttt
1501 ataaccatct cactcctctt cgggactggg cgcacaacgg cttgcgagat ctggccgtgg
1561 ctgtagagcc agtcgtcttc tcccaaatgg agaccaagct catcacgtgg ggggcagata
1621 ccgccgcgtg cggtgacatc atcaacggct tgcctgtttc cgcccgcagg ggccgggaga
1681 tactgctcgg gccagccgat ggaatggtct ccaaggggtg gaggttgctg gcgcccatca
1741 cggcgtacgc ccagcagaca aggggcctcc tagggtgcat aatcaccagc ctaactggcc
1801 gggacaaaaa ccaagtggag ggtgaggtcc agattgtgtc aactgctgcc caaaccttcc
1861 tggcaacgtg catcaatggg gtgtgctgga ctgtctacca cggggccgga acgaggacca
1921 tcgcgtcacc caagggtcct gtcatccaga tgtataccaa tgtagaccaa gaccttgtgg
1981 gctggcccgc tccgcaaggt agccgctcat tgacaccctg cacttgcggc tcctcggacc
2041 tttacctggt cacgaggcac gccgatgtca ttcccgtgcg ccggcggggt gatagcaggg
2101 gcagcctgct gtcgccccgg cccatttcct acttgaaagg ctcctcgggg ggtccgctgt
2161 tgtgccccgc ggggcacgcc gtgggcatat ttagggccgc ggtgtgcacc cgtggagtgg
2221 ctaaggcggt ggactttatc cctgtggaga acctagagac aaccatgagg tccccggtgt
2281 tcacggataa ctcctctcca ccagtagtgc cccagagctt ccaggtggct cacctccatg
2341 ctcccacagg cagcggcaaa agcaccaagg tcccggctgc atatgcagct cagggctata
2401 aggtgctagt actcaacccc tctgttgctg caacactggg ctttggtgct tacatgtcca
2461 aggctcatgg gatcgatcct aacatcagga ccggggtgag aacaattacc actggcagcc
2521 ccatcacgta ctccacctac ggcaagttcc ttgccgacgg cgggtgctcg gggggcgctt
2581 atgacataat aatttgtgac gagtgccact ccacggatgc cacatccatc ttgggcatcg
2641 gcactgtcct tgaccaagca gagactgcgg gggcgagact ggttgtgctc gccaccgcca
2701 cccctccggg ctccgtcact gtgccccatc ccaacatcga ggaggttgct ctgtccacca
2761 ccggagagat ccctttttac ggcaaggcta tccccctcga agtaatcaag ggggggagac
2821 atctcatctt ctgtcattca aagaagaagt gcgacgaact cgccgcaaag ctggtcgcat
2881 tgggcatcaa tgccgtggcc tactaccgcg gtcttgacgt gtccgtcatc ccgaccagcg
2941 gcgatgttgt cgtcgtggca accgatgccc tcatgaccgg ctataccggc gacttcgact
3001 cggtgataga ctgcaatacg tgtgtcaccc agacagtcga tttcagcctt gaccctacct
3061 tcaccattga gacaatcacg ctcccccagg atgctgtctc ccgcactcaa cgtcggggca
3121 ggactggcag ggggaagcca ggcatctaca gatttgtggc accgggggag cgcccctccg
3181 gcatgttcga ctcgtccgtc ctctgtgagt gctatgacgc aggctgtgct tggtatgagc
3241 tcacgcccgc cgagactaca gttaggctac gagcgtacat gaacaccccg gggcttcccg
3301 tgtgccagga ccatcttgaa ttttgggagg gcgtctttac aggcctcact catatagatg
3361 cccactttct atcccagaca aagcagagtg gggagaacct tccttacctg gtagcgtacc
3421 aagccaccgt gtgcgctagg gctcaagccc ctcccccatc gtgggaccag atgtggaagt
3481 gtttgattcg cctcaagccc accctccatg ggccaacacc cctgctatac agactgggcg
3541 ctgttcagaa tgaaatcacc ctgacgcacc cagtcaccaa atacatcatg acatgcatgt
3601 cggccgacct ggaggtcgtc acgagcacct gggtgctcgt tggcggcgtc ctggctgctt
3661 tggccgcgta ttgcctgtca acaggctgcg tggtcatagt gggcagggtc gtcttgtccg
3721 ggaagccggc aatcatacct gacagggaag tcctctaccg agagttcgat gagatggaag
3781 agtgctctca gcacttaccg tacatcgagc aagggatgat gctcgccgag cagttcaagc
3841 agaaggccct cggcctcctg cagaccgcgt cccgtcaggc agaggttatc gcccctgctg
3901 tccagaccaa ctggcaaaaa ctcgagacct tctgggcgaa gcatatgtgg aacttcatca
3961 gtgggataca atacttggcg ggcttgtcaa cgctgcctgg taaccccgcc attgcttcat
4021 tgatggcttt tacagctgct gtcaccagcc cactaaccac tagccaaacc ctcctcttca
4081 acatattggg ggggtgggtg gctgcccagc tcgccgcccc cggtgccgct actgcctttg
4141 tgggcgctgg cttagctggc gccgccatcg gcagtgttgg actggggaag gtcctcatag
4201 acatccttgc agggtatggc gcgggcgtgg cgggagctct tgtggcattc aagatcatga
4261 gcggtgaggt cccctccacg gaggacctgg tcaatctact gcccgccatc ctctcgcccg
4321 gagccctcgt agtcggcgtg gtctgtgcag caatactgcg ccggcacgtt ggcccgggcg
4381 agggggcagt gcagtggatg aaccggctga tagccttcgc ctcccggggg aaccatgttt
4441 cccccacgca ctacgtgccg gagagcgatg cagctgcccg cgtcactgcc atactcagca
4501 gcctcactgt aacccagctc ctgaggcgac tgcaccagtg gataagctcg gagtgtacca
4561 ctccatgctc cggttcctgg ctaagggaca tctgggactg gatatgcgag gtgttgagcg
4621 actttaagac ctggctaaaa gctaagctca tgccacagct gcctgggatc ccctttgtgt
4681 cctgccagcg cgggtataag ggggtctggc gagtggacgg catcatgcac actcgctgcc
4741 actgtggagc tgagatcact ggacatgtca aaaacgggac gatgaggatc gtcggtccta
4801 ggacctgcag gaacatgtgg agtgggacct tccccattaa tgcctacacc acgggcccct
4861 gtacccccct tcctgcgccg aactacacgt tcgcgctatg gagggtgtct gcagaggaat
4921 atgtggagat aaggcaggtg ggggacttcc actacgtgac gggtatgact actgacaatc
4981 tcaaatgccc gtgccaggtc ccatcgcccg aatttttcac agaattggac ggggtgcgcc
5041 tacataggtt tgcgcccccc tgcaagccct tgctgcggga ggaggtatca ttcagagtag
5101 gactccacga atacccggta gggtcgcaat taccttgcga gcccgaaccg gacgtggccg
5161 tgttgacgtc catgctcact gatccctccc atataacagc agaggcggcc gggcgaaggt
5221 tggcgagggg atcacccccc tctgtggcca gctcctcggc tagccagcta tccgctccat
5281 ctctcaaggc aacttgcacc gctaaccatg actcccctga tgctgagctc atagaggcca
5341 acctcctatg gaggcaggag atgggcggca acatcaccag ggttgagtca gaaaacaaag
5401 tggtgattct ggactccttc gatccgcttg tggcggagga ggacgagcgg gagatctccg
5461 tacccgcaga aatcctgcgg aagtctcgga gattcgccca ggccctgccc gtttgggcgc
5521 ggccggacta taaccccccg ctagtggaga cgtggaaaaa gcccgactac gaaccacctg
5581 tggtccatgg ctgtccgctt ccacctccaa agtcccctcc tgtgcctccg cctcggaaga
5641 agcggacggt ggtcctcact gaatcaaccc tatctactgc cttggccgag ctcgccacca
5701 gaagctttgg cagctcctca acttccggca ttacgggcga caatacgaca acatcctctg
5761 agcccgcccc ttctggctgc ccccccgact ccgacgctga gtcctattcc tccatgcccc
5821 ccctggaggg ggagcctggg gatccggatc ttagcgacgg gtcatggtca acggtcagta
5881 gtgaggccaa cgcggaggat gtcgtgtgct gctcaatgtc ttactcttgg acaggcgcac
5941 tcgtcacccc gtgcgccgcg gaagaacaga aactgcccat caatgcacta agcaactcgt
6001 tgctacgtca ccacaatttg gtgtattcca ccacctcacg cagtgcttgc caaaggcaga
6061 agaaagtcac atttgacaga ctgcaagttc tggacagcca ttaccaggac gtactcaagg
6121 aggttaaagc agcggcgtca aaagtgaagg ctaacttgct atccgtagag gaagcttgca
6181 gcctgacgcc cccacactca gccaaatcca agtttggtta tggggcaaaa gacgtccgtt
6241 gccatgccag aaaggccgta acccacatca actccgtgtg gaaagacctt ctggaagaca
6301 atgtaacacc aatagacact accatcatgg ctaagaacga ggttttctgc gttcagcctg
6361 agaagggggg tcgtaagcca gctcgtctca tcgtgttccc cgatctgggc gtgcgcgtgt
6421 gcgaaaagat ggctttgtac gacgtggtta caaagctccc cttggccgtg atgggaagct
6481 cctacggatt ccaatactca ccaggacagc gggttgaatt cctcgtgcaa gcgtggaagt
6541 ccaagaaaac cccaatgggg ttctcgtatg atacccgctg ctttgactcc acagtcactg
6601 agagcgacat ccgtacggag gaggcaatct accaatgttg tgacctcgac ccccaagccc
6661 gcgtggccat caagtccctc accgagaggc tttatgttgg gggccctctt accaattcaa
6721 ggggggagaa ctgcggctat cgcaggtgcc gcgcgagcgg cgtactgaca actagctgtg
6781 gtaacaccct cacttgctac atcaaggccc gggcagcctg tcgagccgca gggctccagg
6841 actgcaccat gctcgtgtgt ggcgacgact tagtcgttat ctgtgaaagc gcgggggtcc
6901 aggaggacgc ggcgagcctg agagccttca cggaggctat gaccaggtac tccgcccccc
6961 ctggggaccc cccacaacca gaatacgact tggagctcat aacatcatgc tcctccaacg
7021 tgtcagtcgc ccacgacggc gctggaaaga gggtctacta cctcacccgt gaccctacaa
7081 cccccctcgc gagagctgcg tgggagacag caagacacac tccagtcaat tcctggctag
7141 gcaacataat catgtttgcc cccacactgt gggcgaggat gatactgatg acccatttct
7201 ttagcgtcct tatagccagg gaccagcttg aacaggccct cgattgcgag atctacgggg
7261 cctgctactc catagaacca cttgatctac ctccaatcat tcaaagactc
//
LOCUS YSCREP1A 1498 bp ds-DNA PLN 25-MAY-1990
DEFINITION S.cerevisiae rep1 protein gene, complete cds.
ACCESSION M31942
KEYWORDS rep1 protein.
SOURCE S.cerevisiae (strain SKQ-2n) plasmid Scp3 DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1498)
AUTHORS Neuville,P., Bonneu,M. and Aigle,M.
TITLE 2 micron plasmid heterogeneity in the yeast Saccharomyces
cerevisiae: A new sequence for the rep1 gene
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Neuville, 13-FEB-1990.
FEATURES from to/span description
pept 295 1437 rep1 protein
BASE COUNT 513 a 245 c 315 g 425 t
ORIGIN
1 aagctttgac aaaaagcatt gagtagctaa ctcttctatg caatctatag ctgttttata
61 aggcatttaa tgacaaattg attgtttaga aacatactag tgaaagtagc cttaatcgcc
121 tctcgaaatt aataatgcag tatggtgtaa aaaatgcaac tcgcgttgct ctactttttt
181 ccgaatttcc gaatacgcag ctggggtgat tgatcgattt cgtatcaaaa gttttgttta
241 taaaaaccgc gaaatccttc ttcaacagat ggatttttac agtgctgaca tacaatgaca
301 acaactataa tagaaaataa ccgaaatatg aatggagaga gactacttgt ttgtattaag
361 caatgtatta tgcagcactt ccaacctatg atatacgatg aaagtagatg tgtaattgaa
421 accacaaggg gaactttttc agttcctgac aattataaga aatataaaac cttagcattt
481 gcttttgttg gacacgtact gaatacagat gacacaccgg taattgaaaa agaactggat
541 tggcctgatc ctgcactagt gtacaataca attgtcgatc gaatcagaaa tcacccagaa
601 ttatcacagt ttatatcggt tgcattcatt agtcaattaa aggcggccat aggggaggtt
661 agatatcaat gtaagaggca cgttaaaccg cagggaaagg gtataagaag gccgaacgta
721 ttttttagat acatggaatc cccgtttgtc aataccaagg ttactgactt cttttcttat
781 cttcgagact ataataaaat tgcctcagag tatcacaata atactaaatt cattctcacg
841 ttttcatgta aagcatattg ggcatctggc caaaactttt ccgccttgaa gaatgttatt
901 aggtgctcca tagttcatga atacatttct aggtttgtgg aaagagaaca gggtcttata
961 ggagatcagg agcaaccacg tgaagaaaac tcttcccctg aactaagtaa tgtgcaacat
1021 gaagtcaata atttaatcga acaagatgcc gcggcggatg aagggttgtg gagtgagata
1081 gattcattat gtgaaaaatg gcagtctgaa gcggaagacc aagctgaagc ggagataata
1141 gccgacagaa taattggaaa tagtcaaagg atggcgaacc ttaaaattcg tcgtacaaag
1201 ttcaaaagtg tcttgtatca tatactaaag gaactaattc aatctcaggg aaccgtaaag
1261 gtttatcgcg gtagtagctt ttcacacgat tctataaaga taagcttaca ttatgaagag
1321 cagcatatta cagccgtatg ggtctaccta acagtgaaat ttgaagagca ttggaaacca
1381 gttgatgtag aagtcgagtt taggtgcaag ttcaaggagc gaaaggtcaa tgagtaggtt
1441 atatagggat atagaagata tagaacatag caaagagctg cttttgagca atgtttgt
//
LOCUS MUSREN1C 758 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse renin (Ren-1-c) gene, 5' end.
ACCESSION M34190
KEYWORDS renin.
SOURCE M.musculus (strain C57BL/10) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 758)
AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
Pioli,D. and Brammar,W.J.
TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
and its upstream region
JOURNAL Gene 84, 91-104 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 729 > 758 renin (Ren-1-c), exon 1
pre-msg 701 > 758 Ren-1-c mRNA and introns
BASE COUNT 208 a 156 c 245 g 149 t
ORIGIN
1 tctagagtca ttgggctcag ccacccttcc cacaccccca tgcctgccac cactctgctc
61 tgtgaccagg tcctgcttat cctataccta cctagcttgg tctcacaggc taagatttat
121 cagagctgcc ctgccatggg ccttgttggg cacctggcgt ggggaaataa aggaggtggg
181 cgagactcac cagagctcag agagatgggc gggcagacag caggggaagg cactgggtgg
241 tctggcagct ggaaatgctg ggaggccttc ttgggggaga ttagataaag ctcttcaggg
301 gaaggcctat tccatgactc cagcatggtg agtctagatg aaaggaggta gtctatggtt
361 ttagagcttt attgtagaag agagagagag aaggtagaga agtagaagcc agtcattgcc
421 atgaagaagg aagggggaga aggagagcga aaggtaagag taagaaaaca agagctttag
481 gagagagaca agagagagag gaggaggcaa gcagccaggt aactctgggg gtggagtctg
541 gacagcctac atgactgatg gccacagaat tatggagctg ggtccttggc cagaaaacag
601 gctgcctttc atggtcccac aggccctggg gtaataaatc aaagcagagc ctgtgataca
661 tggtgtgtat aaaagaaggc tcagggggtc tgggctacac agctcttaga aagccttggc
721 tgaaccagat ggacagaagg aggatgcctc tctgggca
//
LOCUS MUSREN1D 11231 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse renin (Ren-1-d) gene, complete cds.
ACCESSION M32352
KEYWORDS renin.
SOURCE M.musculus (strain DBA/2J) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 11231)
AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
Pioli,D. and Brammar,W.J.
TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
and its upstream region
JOURNAL Gene 84, 91-104 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 1286 1377 renin (Ren-1-d), exon 1
4697 4847 renin, exon 2
5364 5487 renin, exon 3
6056 6174 renin, exon 4
6955 7151 renin, exon 5
8940 9059 renin, exon 6
9487 9631 renin, exon 7
9884 9982 renin, exon 8
10547 10708 renin, exon 9
pre-msg 1258 10887 Ren-1-d mRNA and introns
IVS 1378 4696 Ren-1-d intron A
IVS 4848 5363 Ren-1-d intron B
IVS 5488 6055 Ren-1-d intron C
IVS 6175 6954 Ren-1-d intron D
IVS 7152 8939 Ren-1-d intron E
IVS 9060 9486 Ren-1-d intron F
IVS 9632 9883 Ren-1-d intron G
IVS 9983 10546 Ren-1-d intron H
BASE COUNT 2809 a 2725 c 3051 g 2646 t
ORIGIN
1 ggtacttagg gagagtaata ctgccagcca gggtcaggtc acttcgacag ctctgccagg
61 cttctgagag gtacagggaa gaaggacagg gggcaaaagg ataggtctaa gggagtgaag
121 ggccagtagg agggggtcag ctggaagcca gctgcactta agacctcctt tcttttagag
181 ctacccttct gtccaccatc actaacccag gagggacacc caccctctcc ctgatgccag
241 aggttagatg acaggttact tttccaacta ggcagcagcc aaaaggaatc actcatgatc
301 aagatggaag actagggcag agcctaggag gagccagggg tgatttgaag gttaatatag
361 attaatagat aggttaatat ataatattag gttaatatgc aggtctcggg ggcatgagtt
421 ggtagggtct gagttagagg tttgaagcca gggtctatgc tgtctgtacc ctgacctcat
481 gctgaacagc acagctcctc attagcctct gtctaagcta gaggttgcgg ggccaggcca
541 aacagggact ctagagtcat tgggctcagc cacccttccc acacccccat gcctgccacc
601 actctgctct gcgaccaggt tctgcttatc ctatacctac ctaacttggt ctcacaggct
661 agaatttatc aggactgccc tgccatgggc cttgttgggc acctggcatg gggaaataaa
721 ggaggtgggc gagacctgtg tctcaccaga gctcagagtg actggatggg cagacagcag
781 gggaaggcac tgggtggtct ggcagctgga aacgctggga ggccttcttg ggggaaatta
841 gatgaagctc ttcaggggaa ggcctattcc atgactccag catggtgagt ctagatgaaa
901 ggaggtagtc tatggtttta gagctttatt gtagaagaga gagagagaag gtagagaagt
961 agatgccagc cattgccacg aagaaggaag ggggagaagg agagcaaaag gtaagagtaa
1021 gaaaacaaga gctttaggag agagacaaga gagagaggag gaggcaagca gccaggtaac
1081 tctgggggtg gagtctggac agcctacatg actgatggcc acagaattat ggagctgggt
1141 ccttggccag aaaacaggct gcctttcatg gtcccacagg ccctggggta ataaatcaaa
1201 gcagatcctg tgatacatgg tgtgtataaa agaaggctca gggggtctgg gctacacagc
1261 tcttagaaag ccttggctga accagatgga caggaggagg atgcctctct gggcactctt
1321 gttgctctgg agtccttgca ccttcagtct cccaacacgc accgctacct ttgaacggta
1381 acttgggcag agaaggggtg gcagggtaca ggaactggca tcttactaac gccctcagct
1441 gtctatgcgt tgggtcatcc agtccttttt ggccagccag tcagggatgt atgatcctgc
1501 tctgacaatc ctagggaata gcccaaagcc ctagtggacg gccataatta aggggtacca
1561 caacttcccc cttctactac ttttttaaaa ggcgaggggt cactggagtg actgtgtcat
1621 ttgtggtgaa aattttgggt ggagtggaag tatcagaaga aacatgtagg ccaaaggagt
1681 aacatgacta tcagttaatt aatctagata gacattaagt tttccttgtg atcatgtaac
1741 tccccccaag ttcagcagcc gtttggtttg gcgagagagt tcatctgtag ttggaagacc
1801 tcaagtctgt cctctgtggg gttcagctcg tgtctctgga tctcatgtgt gtcttgggag
1861 tcttctgtca gatgaggtgt ctggaggaag gcagctttgt ggtctgtagg tctcaggaga
1921 cctgtctatg ctggatctgc agctgcaggc actgcccttt acttttgtgg ggataaacct
1981 gtggggtaat cacagcagag gagcagaacc ctagcagcag gatgtctgca tagggtggat
2041 ggggaacgtt aagcctgcaa tggtttgctt ggaagcacat cctggcctgt tgttaggatc
2101 ctagaaaagc aagggtgttt gaggaatgta gtcctggaag gcatccttta agtctgcttt
2161 gggtggctaa ggaggaaaat atcatgttaa ttgtgtatct gggataggca gtaacagatt
2221 gtctgcaaaa ggaagctggt tgtcagttaa ggattggatg acggtaatca gcatggttaa
2281 ttcaacccag ggaagcacat tagatggctt ttggccctta aaggaaatca tgaatgctat
2341 ggagaaaaat aaacattaaa ataaagaagc aagaaaattt tggattgaaa agcttactag
2401 cagcagatga gggtaagcag caagctaatc tcaccaatcc tcagtccgcc attcaaaaac
2461 aggatgaaag cagagcctgc ttcaccggct gctttgaaga tttgatgaga ggataagcat
2521 agcacttccg atgccaggcg gggaccggtg taagaagcca tggttacagc atggattacc
2581 acagagtgta ggaaagggaa ggctcctggc agccccacga gggaggatga aacagccatt
2641 tagtgcagtt ccagggctgt aggaacaagg cttcttgagg caggagcagg gcagaggcaa
2701 gagaaggctc acccagcaat ggtctgggga ctatgatgag gaagtcctga aagagtcaag
2761 gcaatgagaa ctgtgaatct gcttggtgct taggagctgg tgggttgctt tcatcctaaa
2821 ttgagtctcc agacagacat cagcacccat gttgcagatg aggatgtcct ccagaaaggt
2881 ctgaggactc actggtcttt gtttccaaag ccctgccatc ttgctcccag catgcagcag
2941 agagcgtggg attaactaga gggcgtgggg ttctgtgctc cctgctctgt gtaaagtatt
3001 gagcaagtca ccttgagggg ctgagcaatg ggctttcttc tcaaggccat tacatgcctt
3061 gcatacacac aatgtccagc ctccacgctt agcctgagct gtgcctgggc gaaaaaacca
3121 tcaaggtgcc agaggacgaa gaggttagtg aatgctcaga accgtgcagc atgccctcag
3181 catcaggtgg ctctatcccc gttgtccctg taggctaggc ttgtggggag cactcagggg
3241 aaatggaccc attggctggc atccttggtg agagcaccta ggtatgggct caggtcttga
3301 aacattaagg atctaggaga ggaatagaag gggttttatg tttattttct tttgggacag
3361 agcctcatta tggagcctgg aactcactat atagaccaga ctggcctcaa acccacagag
3421 atcagccttc ctctgcctcc tgaggactga aattaatgtc atgccctacc acacaacaga
3481 gaggttttct aaaagaagga agggtcataa aagaaacagc aagcattggg tctttttcag
3541 gaaattggtg gagaactaga cctgtcttcc tcccttgaag agtggggaaa taaaacacac
3601 ccatctggtg agatggctgg ggtcaactaa cacaagagtg gaaatacttg caagccgtgc
3661 taggactaag tcatttctgc tggagattac tcggataggt gtgaaatatg ggcaaagctg
3721 ggaggtgagg cacggagtgc tgggaggaag gttagggctc acactggaga agaaagaggg
3781 ctctctggat gtctgggaaa gccctggctc aggactggtg gcatcttcct ccaactgcaa
3841 aacagtctgt gtggtcccgg cagtcctggg taggaaagac agacagacag agcttggact
3901 gtctctgttt ggcatttctg agtcatcttc atccttgtga accttcaatg tctccgccac
3961 caccgccgga aaccagggag ctagatgtca tgtggtttgt atgagggtgc gtgccaggca
4021 gctggcctgc ggccaggagt agctcaggga gctgctgttc taaatgaaat ctcttgaatt
4081 ttgccagtca ataatgattg gaggtgagga ggggcccagg attcaggccc aagacacatc
4141 agtccccagg tgaggcacac cggcaagcta tagacactgg cctatccaat ccaagtcaag
4201 ctaggttcat tgggcacata tcacctggct ctgtgatacc ccagggagca acaagtcctt
4261 gcagcctgct gtgggcatct gctgccactt acaggccatc cgtggtactg ccttgcccgt
4321 caatccaagg agatggcaag aggctgtata gaccttttat gttcctctcc agaacagtga
4381 ggaagagtag gtgttcaaca cacgatgatg ataatggtag atagatgaac agacggactg
4441 actgatgaac agacgggtgg atggacagac ggatggacgg atgaacaaac tgctttatct
4501 aactaaaata agatctggga attacttagt accagaataa gtgactccaa gaaaccccag
4561 ggaactaaaa gtgattcgta attgcaatgg gagagggtta gtgttagacg taccaaaaag
4621 caaggcaaca tctgggctct gtcaacgttg gagcactgca cgactcttct gtttttcttt
4681 ctctaccctg gagcagaatc ccactcaaga aaatgccctc tgtccgggaa atcctggagg
4741 agcggggagt ggacatgacc aggctcagtg ctgaaagggg cgtattcaca aagaggcctt
4801 ccttgatcaa tcttacctcc cccgtggtcc tcaccaacta cctgaatgtg agtcctggct
4861 catctgctat ctccttttct ctctgggact agcaggctcc cactccttcc acaccatgtt
4921 tgactgagag ccaaagagca ccccatagaa ctcctattta ccagccgaga agtctgatgg
4981 ggaggggctg ttgtgattgt tttgttgttt ggtttggttt gtttttgaga cagggtctca
5041 ctgtgtagcc tggcaagcct ggaactcact ttgtagacca gactgacctc gaactcacag
5101 agttctacct acctctgcct cccaggtgct gggattaaag catgtgacat catgcccagc
5161 aattttaatt gctttgatta agcttgcccc agcaagcctc ctgtatccca ggccagcaaa
5221 agacaatgct gagtcctcac atctcagcaa aaagaaagcc aagagccgag aaggggtcca
5281 gacaaccccc caaaccagag gccccaagcc ctgaagtcac ccccatctct ctgggcccca
5341 ttttcctttt gcccccgcca cagacccagt actacggcga gattggcatc ggtaccccac
5401 cccagacctt caaagtcatc tttgacacgg gttcggccaa cctctgggtg ccctccacca
5461 agtgcagccg cctctacctt gcttgtggta agagtcaaga gtcagcctcg tgcctctccc
5521 cagctctgcc tgctgcacct atgacctgct tctcctcagc tccacaccac tgactgagga
5581 agagagagga agtgtctctc ttctttttga aatggcagcc aagttaataa gccagctttg
5641 ggaatgcctg ggcacctgga aggccttttg ccatgtccca aggattcaga caggaacaaa
5701 gtgtaaaccc actccaggtc ttgaggcttt ttcgatgctc ccatgctgag tacttactct
5761 gtgcccagaa tcctctttag cactatatca gtgttagcaa tggctttaga actctgtgaa
5821 aaaaatgtct cagaaagcac gagcacataa gacgaaaaca aaaaaaacca ctatccttgc
5881 ctggtttaca tttttccaag ctggagactc aggaaactaa acacacacac acacacacac
5941 acacacacac acacacacct aagtcttcaa ccatgtttat catttgtcct gcctctgagc
6001 cttctatgta aatcaatctc tcttgtgctg actggggcct gtaaccctct gctagggatt
6061 cacagcctct atgagtcctc tgactcctcc agctacatgg agaacgggtc cgacttcacc
6121 atccactacg gatcaggaag agtcaaaggc ttcctcagcc aggacgtggt gactgtgagt
6181 aggatcgcct ctatttacca cagggcccat gtggttccag acaggtatac tccagccctc
6241 ctccctaccc ctcccactcc tctgcaggag cctgggaagt tgaagtgcag cctccttggc
6301 agcttcttct agctgaagct ccgaagccaa ttgcaggaga gccaaacctg ctgacactga
6361 tgcacaaggc tttgcatcta acttgcatgt ggatgtgtgc ctaaacccca cagtttcctt
6421 cctagtaacc agtctgactt tgctaattga gaccaggttt ggagtactga gattaggaat
6481 caggggtttg tgtccgggca gttttgattt ctttccttca gtgtctaagc ctgttctggc
6541 atatctcagc cccgctgtgt cagaggaaga gaaagacaca tacagaggtc agaggaaatc
6601 agtgcgagcc cttccctcgc actgcccccg cggaatctgg ggagggcaga aggagccctg
6661 gtaatgacga ggcatcctca gcctcggctg gtgggctagc acgaaagcgg gcctctagga
6721 gagttccggt cacctgtctg gcaggacctc tttctcagcc tttatagcat gagtcacccc
6781 catctctctg ggccccaagc atagaggtgt tgtcagtgtg tgtgcagact tgacggaata
6841 gtcttaactg ctgtacagag cgtgtgtatc tctgctgccc tggtatgaaa gccagagcag
6901 atcccagcca aggttaggca gtaacctatg tcccttctgg ttcttctctc acaggtgggt
6961 ggaatcactg tgacacagac ctttggagag gtcaccgagc tgcccctgat ccctttcatg
7021 ctggccaagt ttgacggtgt cctaggcatg ggctttcctg ctcaggccgt tggcggggtt
7081 acccctgtct ttgaccacat tctctcccag ggggtgctga aggaggaagt gttctctgtc
7141 tactacaaca ggtgggcctt tgacaccaag gccagggcag ggaggggtgg agagcaggga
7201 gagcagccac cgtccagttc aggatttata tgcttctaat taggtggttc tcagtcccca
7261 tctagaacca cagtagccaa ttagaacaaa tggaagtcaa gcattcggtc cccagggact
7321 ggccacacag acacagccag cttcatgcct ttgtttatct gttctcattg atccggttag
7381 tgctctgtta ccatgtgcta tggtaggctc caggggtgga cagtgaacag acgacccctg
7441 ccctccatcc tcacagagct cactgcccag caggggagat gagaggtaca acacaatgag
7501 tgccattcta aagctgcaga tggttttcag ttctaggcca ggattctggg cagttcagcc
7561 aggcctgggg aaactgtggt cagtgcccca gcaggggtgg gatctgctgc cagtggctgg
7621 gtttagaaat atggtctaag ctggaggtaa gatctttacg ggtaagaata gaggccaggg
7681 gtggtgagag tggaagggag tggtgtcctg gaacaatctt agaggaagca ggagtctctg
7741 gatcatgact ctggcagact tctgtgcaaa gaggggcatt gagccaatga caccactcgg
7801 aggctcctgc agtcaatggt cttgggactg tgtttcaaag agaccagaaa aacaaagtgt
7861 ctaagagtgg aagtagatgt cagaaagggg gaggggggac tccgcaaaat gtgggcagtt
7921 ctggaaattc aacagggaag gagatgtggg gcagagaagc gggagagggg aggtgggtgc
7981 tctaaacgct agaggatgct tgagcaatgc tgtgggtgga agaaggagag ctgtggaaaa
8041 cacatgcatg aggctgagaa gcaggagccc tggaagggat gaggacccag caacaggtag
8101 caagcagtcc tgggaagcag agacctgctt cctttccctt gggctgggaa gaaaggaaga
8161 tctagataag tcactgactc ttgcacagaa cctgtgtaca cctaacccat gtacatatgt
8221 gcacatacaa atgtgtgttg tgaattctaa gaaactccat catatacaca tgagtatctt
8281 aaaaccaccc aaggcaatcc tgagcttagg tagcatcatg cccgtggaga gagaagtgag
8341 ggtagtcagc tctggctggg aaggagagcc tggctccagc catttggtga ctcagttgcc
8401 tggggggggg gggtgtgaaa attaacctct accaaacaag ggttgctgat tctttcctac
8461 gaagtctccc ctctgtgtcc ctctttgcct cagcagaaat ctcaaggagg gaggagcccc
8521 atatggcccc aaagctccaa tacttcccag acccttaccc tggaatgata cccaccaccc
8581 acacactcca ttcatccagg cccaactcca ctaaatgtct gccttctccc tagtgtccac
8641 atgcccagga aagacgtgat ttgccacttg tctgcatcta cccaatgtcc ttgagctaga
8701 cagttttctg ggtgcaggtg aacagtccta gaaatacttc tgttacattt agggaggatc
8761 cagcaaggga agatgcaatg gggtctgtgt gcctcctgat gacagcttac agccttgcca
8821 tctacccact aaacccttcc tctatcccaa agtcctctgc cctaccccac accagactct
8881 actctccttc atccctgtat ctcgggtatg tctctaactt ccaatgttct ttcctttagg
8941 ggttcccacc tgctgggggg cgaggtggtg ctaggaggca gtgacccgca gcattaccaa
9001 ggcaattttc actatgtgag catcagcaag actgactcct ggcagatcac gatgaagggg
9061 tgggtcagca agcgtaactc tcccaaccat gcaaactgat gtcactgtgt gattggggtg
9121 ggctgggaat gggatgaggg ggacacttct tagggcacag ccaccatctg cctctctgct
9181 tgctctgtcc taggatgggg gcatgaaaat ttgaggatgg gaaggaaggg actcagggaa
9241 aggaggggga gtccggaggc tacatgctga tgcaaggaac agttagtacc tttccatagt
9301 gtctcggccc tcttccagtt catcatcaga ccatgtatgg cctcctatca cactacccta
9361 aattgggaaa ggcatgcagg cggcgggaaa agggtgcccg acagcttcta cctgcctgcc
9421 ttgtctgtct gtctgactgt ctgcctgcct gcctgcctgc ctgccagtct aactatagct
9481 ctgcagggtg tctgtggggt cttccaccct gctatgtgaa gaaggctgtg cggtagtggt
9541 ggacactggt tcatccttta tctcggctcc tacgagctcc ctgaagttga tcatgcaagc
9601 cctgggagcc aaggagaaga gaatagatga agtaagagat ccgtggggag ggctgggtgg
9661 tggtgagggg tggcagcagc actgtgcggc tccagcctgc aaatcgtcca gacagcactg
9721 gatcggaaac aaagctttcc cctcctcccc tctcccaccc cctgaatgta tggcatgtct
9781 tccatagaag gtgaccagag actggcacag gctctctggc acagggtgct ctagtctgct
9841 tcgaagcttt ccaccctgac aacccctctc tttattcccc cagtatgttg tgaactgtag
9901 ccaggtgccc accctccccg acatttcctt tgacctggga ggcagggcct acacactcag
9961 cagtacggac tacgtgctac aggtgaggct gggacaaggg tgggcagggg cactgaaggg
10021 ggacaagtgt aatgtacaga tcccccagta gactcgtggt tgatgaatac ggctcatgtc
10081 atcatctgga gaatttttag cagtcccagt acccaggcca catacaagtc tcgcctagca
10141 gatcggagct tccacaacag gcatctggga atcagagttt taaaacgact tccccagtag
10201 gttccattat gaagctctgc tgtgactcag tgagccccct ctctccatta ttgagtagcc
10261 ctctttctag tcctccatcc ttttcatgct cccaggctta gagaagttca atgacttgtt
10321 ttgggtcaga agctaagtga gcccgtgagc aaggggtaga cacaccattt gcctgatacc
10381 tactacagtg cctgcctacc ccaggaaggc tgggatttag gataggtaaa acagttctgg
10441 gaggatgtag gagagaggag gcatatactg aagagctgtt gcttcactct ggaggtggga
10501 atgcgttgtg tggcgggcag tagagcaagc ctctgcttct tgccagtatc cctacaggag
10561 agacaagctg tgcacactgg ctctccatgc catggacatc ccaccaccca ctgggcctgt
10621 ctgggtcctg ggtgccacct tcatccgcaa gttctataca gagtttgatc ggcataacaa
10681 tcgcattgga ttcgccttgg cccgctaagg ccctctgcca cccagtaacc ctaggccaag
10741 ccaagctggc actcctgggg gccattttgt ctggctttgt ccccaacata gggacactgg
10801 acacagagac cctaacgagt gtttgcccct tcacctgcac tcacccttcc ctgctttaag
10861 gaaaaaccga ataaagattt catgtttaaa gcctgtttcg gatgggttct ttggagtttg
10921 gaggaggtac acctgcatgg tcacacgtgg agaagactat gacacgtgag gccagagatg
10981 gaggtttgtt agtaaagtgc ttgcttttca agtgtgagaa cctgagtttg attccaggaa
11041 cccacactaa aaccaacaaa caaacaaaca gggctggaga gatggctcag tagttaagtg
11101 cactgagtgt tcttcctgat gtccttgcgt tcaattccca gcaacacacg tggtggctca
11161 caaccatgtg tatggggtct gatgccctct tctggcatgt ctaagagagt gataatgtac
11221 tcctaattaa t
//
LOCUS MUSREN2D 1116 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse renin (Ren-2-d) gene, 5' end.
ACCESSION M34191
KEYWORDS renin.
SOURCE M.musculus (strain DBA/2J) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1116)
AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
Pioli,D. and Brammar,W.J.
TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
and its upstream region
JOURNAL Gene 84, 91-104 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 1087 > 1116 renin (Ren-2-d), exon 1
pre-msg 1059 > 1116 Ren-2-d mRNA and introns
BASE COUNT 309 a 248 c 303 g 256 t
ORIGIN
1 tctagagtca ttgggctcag cacccttccc acacccccat gcctgccacc actctgctct
61 gcgaccagtg tcttgcttat cctatacctc acctaagcta catgccataa atcaatgctt
121 gttattcaag gcatcatctc ctttgggggg cacttctcaa aaagtcctct cctgtcccta
181 gatatctgag actcctcagt ggccttccat ttagaagggt gttgtctctg aagtaaaaac
241 ttcagttcct ctcacaggct aagatttatc agagctgcct gccatgggcc ttgttggcac
301 tggcgtggga ataagggggt ggcgagactg tgtcccgcag agctcagaga ctgggcaggc
361 agacagcagg agaagacact gggtggtctg gcagctggaa acgctgggag gccttcttgc
421 gggagattag atgaggctat tcaggggaag gcctattcca tttctccagc atggtgagtc
481 tagatgaaaa gaggtagtga aatagaggcc agccattgcc acaaagaagg taagagtaag
541 agaacaagag cttaagaaag agccaagatg ggctggagag atggctcagt gggtaagagc
601 acccgactgc tcttccaaag gtccgaagtt caaatcccag caaccacatg gtggctcaca
661 accatctcat aatgagatct gactccctct tcttggagtg tctgaagaca gctacagtat
721 atttacatat aataaaaata aatttaaaaa ataaaaaaat aaaaataaat ttaagagaga
781 gagagaggag gagccaagca gccccattta tagtgggctg ggctaccttg ctgttgtagg
841 gtaactgtgg ggaggagcaa acctggctat tgccaggtaa ctctgggggg tggagttacc
901 ttgactactt gactgatggc cacagaatga tggaggtgga tccttggcca gaaaacaggc
961 tgcgtttcat ggtcccacag gccctggggt aataaatcag agcagagcct gtgatacatg
1021 gtgtgtataa aagaaggctc agggggtctg ggctacacag ctcttagaaa gccttggctg
1081 aaccagatgg acaggaggag gatgcctctc tgggca
//
LOCUS HUMASPX 7787 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human nonerythroid alpha-spectrin (SPTAN1) mRNA, complete cds.
ACCESSION J05243
KEYWORDS alpha-fodrin; nonerythroid alpha-spectrin.
SOURCE Human lung fibroblast cell line WI38, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 7787)
AUTHORS Moon,R.T. and McMahon,A.P.
TITLE Generation of diversity of nonerythroid spectrins: Multiple
polypeptides are predicted by sequence analysis of cDNAs
encompassing the coding region of human nonerythroid alpha-
spectrin
JOURNAL J. Biol. Chem. 265, 4427-4433 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.T.Moon, 12-DEC-1989.
FEATURES from to/span description
pept 103 7521 nonerythroid alpha-spectrin
/hgml_locus_uid="LK0022P" /nomgen="SPTAN1"
/map="9q33-q34"
BASE COUNT 2147 a 1926 c 2239 g 1475 t
ORIGIN
1 gaattcgggg aacggtgtgg agcggaggcc gcggaggctc ctcggtcctt cagcacccct
61 cggcccgacg cacccacgcc cctcaccccc cgagagccga aaatggaccc aagtggggtc
121 aaagtgctgg aaacagcaga ggacatccag gagaggcggc agcaggtcct agaccgatac
181 caccgcttca aggaactctc aacccttagg cgtcagaagc tggaagattc ctatcgattc
241 cagttctttc aaagagatgc tgaagagctg gagaaatgga tacaggaaaa acttcagatt
301 gcatctgatg agaattataa agacccaacc aacttgcagg gaaagcttca gaagcatcaa
361 gcatttgaag ctgaagtgca ggccaactca ggagccattg ttaagctgga tgaaactgga
421 aacctgatga tctcagaagg gcattttgca tctgaaacca tacggacccg tttgatggag
481 ctgcaccgcc agtgggaatt acttttggag aagatgcgag aaaaaggaat caaattgctg
541 caggcccaga acttggtgca gtacttacga gaatgtgagg acgtgatgga ctggatcaat
601 gacaaggaag caattgttac ttctgaagag ctgggccagg atctggagca tgtagaggtt
661 ttacagaaga aatttgaaga gtttcaaaca gatatggctg ctcatgaaga aagagttaat
721 gaagtgaacc agtttgctgc caaactcata caggagcagc accctgagga ggaactgatc
781 aagactaagc aggatgaagt caatgcagcc tggcagcggc tgaagggcct ggctctgcag
841 aggcagggga agctctttgg ggcagcagaa gttcagcgct ttaacaggga tgtggatgag
901 actatcagtt ggattaagga aaaggagcag ttaatggcct ctgatgattt tggccgagac
961 ctggcaagtg ttcaggctct gcttcggaag cacgagggtc tggagagaga tcttgctgct
1021 ctagaagaca aggtcaaagc cctgtgtgct gaggctgacc gcctgcaaca gtcccaccct
1081 ctgagtgcaa cacagattca agtgaagcga gaggaactga ttacaaactg ggagcagatc
1141 cgcaccttgg cggcagagag acatgcacgg ctcaatgatt catacaggct tcaacgcttc
1201 cttgctgact tccgtgacct caccagctgg gtgactgaga tgaaagccct catcaatgca
1261 gatgagcttg ccagtgatgt ggctggggct gaagccctgc tagatagaca ccaagagcac
1321 aagggtgaaa ttgatgccca tgaagacagc ttcaaatctg cagatgaatc tggacaggca
1381 ctgcttgctg ctggtcacta tgcctcagat gaagtgaggg agaagctgac cgtcctttcc
1441 gaggagagag cggcgctgct ggagctgtgg gagctgcgca ggcagcagta cgagcagtgc
1501 atggacctgc agctcttcta ccgggacact gagcaggtgg acaactggat gagcaagcag
1561 gaggcgttcc tgttgaatga agacttggga gatttcttgg atagtgtgga agcgcttctt
1621 aagaagcacg aagactttga gaaatccctt agtgcccagg aggaaaagat tacagcatta
1681 gatgaatttg caaccaagct aattcagaac aaccactatg caatggaaga tgtggccact
1741 cgccgagacg ctctgttgag ccgccgcaat gcccttcacg agagagccat gcgtcgccgg
1801 gcccagctag ccgattcttt ccatctgcag cagtttttcc gtgattctga tgagctcaag
1861 agttgggtga atgagaagat gaaaactgcc acagatgaag cttataaaga tccatccaac
1921 ctacaaggaa aagtacagaa gcatcaggct tttgaggctg agctctcagc aaaccagagc
1981 cgaattgatg ccttggagaa agctggccaa aagctgattg atgtcaacca ctatgccaag
2041 gatgaagtgg cagctcgtat gaatgaggtg atcagtttgt ggaagaaact gctagaggcc
2101 actgaactga aaggaataaa gcttcgtgaa gccaaccagc aacagcaatt taatcgcaat
2161 gttgaggata ttgaattgtg gctatatgaa gtagaaggtc acttggcttc ggatgattac
2221 ggcaaagatc ttaccaatgt gcagaacctc cagaagaaac atgccctgct agaggcagat
2281 gtggctgctc accaggaccg aattgatggc gtcaccattc aggcccgcca gttccaagat
2341 gctggccatt ttgatgcaga aaacatcaag aagaaacagg aagccctcgt ggctcgctat
2401 gaggcactca aggagcccat ggttgcccgg aagcagaagc tggccgattc tctgcggttg
2461 cagcagctct tccgggatgt tgaggatgag gagacgtgga ttcgagagaa agagcccatt
2521 gccgcatcta ccaacagagg taaggattta attggggtcc agaatctgct aaagaaacat
2581 caagccttac aagcagaaat tgctggacat gaaccacgca tcaaagcagt tacacagaag
2641 gggaatgcca tggtggagga aggccatttt gctgcagagg atgtgaaggc caagcttcac
2701 gagctgaacc aaaagtggga ggcactgaaa gcaaaagctt cccagcgtcg gcaggacctg
2761 gaggactctc tgcaggccca gcagtacttt gctgatgcta acgaggctga atcctggatg
2821 cgggagaagg aacccattgt gggcagcact gactatggca aggacgaaga ctctgctgag
2881 gctctactga agaaacacga agctttgatg tcagatctca gtgcctacgg cagcagcatc
2941 caggctttgc gagaacaagc acagtcctgc cggcaacaag tggcccccac ggatgatgag
3001 actgggaagg agctggtctt ggctctctac gactatcagg agaagagtcc ccgagaggtc
3061 accatgaaga agggagatat ccttacctta ctcaacagca ccaacaagga ttggtggaaa
3121 gtggaagtga acgatcgtca gggttttgtg ccggctgcgt acgtgaagaa attggacccc
3181 gcccagtcag cctcccggga gaatctcctg gaggagcaag gcagcatagc actgcggcag
3241 gagcagattg acaatcagac acgcataact aaggaggccg gcagtgtatc tctgcgtatg
3301 aagcaggtgg aagaactata tcattctctg ctggaactgg gtgagaagcg taaaggcatg
3361 ttggagaaga gttgcaagaa gtttatgttg ttccgtgaag cgaatgaact acagcaatgg
3421 atcaatgaga aggaagccgc tctgacaagt gaggaggtcg gagcagactt ggagcaggtt
3481 gaggtgctcc agaagaagtt tgatgacttc cagaaggacc tgaaggccaa tgagtcacgg
3541 ttgaaggaca ttaacaaggt agctgaagac ctggagtctg aaggtcttat ggcagaggag
3601 gtgcaggctg tgcaacaaca ggaagtgtat ggcatgatgc ccagggatga aactgattcc
3661 aagacagcct ccccgtggaa gtctgctcgt ctgatggttc acaccgtggc cacctttaat
3721 tccatcaagg agctgaatga gcgctggcgg tccctacagc agctggccga ggaacggagc
3781 cagctcttgg gcagcgccca tgaagtacag aggttccaca gagatgctga tgaaaccaaa
3841 gaatggattg aagagaagaa tcaagctcta aacacagaca attatggaca tgatctcgcc
3901 agtgtccagg ccctgcaacg caagcatgag ggcttcgaga gggaccttgc ggctctcggt
3961 gacaaggtaa actcccttgg tgaaacagca gagcgcctga cccagtccca tcccgagtca
4021 gcagaagacc tgcaggaaaa gtgcacagag ttaaaccagg cctggagcag cctggggaaa
4081 cgtgcagatc agcgcaaggc aaagttgggt gactcccacg acctgcagcg cttccttagc
4141 gatttccggg acctcatgtc ttggatcaat ggaatacggg ggttggtgtc ctcagatgag
4201 ctagccaagg atgtcaccgg agctgaggca ttgctggagc gacaccagga acaccggaca
4261 gaaatcgatg ccagggctgg cactttccag gcatttgagc agtttggaca gcagctgttg
4321 gctcacggac actatgccag ccctgagatc aagcagaaac ttgatattct tgaccaggag
4381 cgtgcagacc tggagaaggc ctgggttcag cgcaggatga tgctggatca gtgccttgaa
4441 ctgcagctgt tccatcggga ctgtgagcaa gctgagaact ggatggctgc ccgggaggcc
4501 ttcttgaata ccgaagacaa aggagactca ctggacagcg tagaggctct gatcaaaaaa
4561 catgaagact ttgacaaagc gattaacgtc caggaagaga agattgctgc tctgcaggcc
4621 tttgccgacc agctcatcgc tgccggccat tatgccaagg gagacatttc tagccggcgc
4681 aatgaggtct tggacaggtg gcgacgtctg aaagcccaga tgattgagaa aaggtcaaag
4741 ctaggagaat ctcaaaccct ccaacagttc agccgggatg tggatgagat tgaggcttgg
4801 atcagtgaaa aattgcaaac agcgagtgat gagtcgtaca aggatcccac caacatccag
4861 agcaagcacc agaagcacca ggcttttgaa gcagagctgc atgccaacgc tgaccggatc
4921 cgtggggtta tcgacatggg caactccctc attgaacgtg gagcctgtgc cggcaatgag
4981 gatgctgtca aggcccgcct ggctgcctta gctgaccagt ggcaattctt ggtgcaaaag
5041 tcagcggaaa agagccagaa actgaaagaa gccaacaagc agcagaactt caacacaggg
5101 atcaaggaca ttgcattctg gctgtctgag gtggaggccc tgctggcatc cgaagattat
5161 ggcaaagacc tggcttctgt gaacaacctg ctgaaaaagc atcaactgct ggaagcagat
5221 atatctgccc atgaggatcg cctgaaggac ctgaacagcc aggcagacag cctgatgacc
5281 agcagtgcct tcgacacctc ccaagtaaag gacaagaggg acaccatcaa cgggcgcttc
5341 cagaagatca agagcatggc ggcctcccgg cgagccaagc tgaatgaatc ccatcgcctg
5401 caccagttct tccgggacat ggatgacgag gagtcctgga tcaaggagaa gaagctgctg
5461 gtgggctcag aggactacgg ccgggaccta actggcgtgc agaacctgag gaagaagcac
5521 aagcggctgg aagcagaact ggctgcgcat gagccggcta ttcagggtgt cctggacact
5581 ggcaagaagc tgtccgatga caacaccatc gggaaagagg agatccagca gcggctggcg
5641 cagtttgtgg agcactggaa agagctgaag cagctggcag ctgcccgggg tcagcggctg
5701 gaagagtcct tggaatatca gcagtttgta gccaatgtgg aagaggaaga agcctggatc
5761 aatgagaaaa tgaccctggt ggccagcgaa gattatggcg acactcttgc cgccatccag
5821 ggcttactga agaaacatga agcttttgag acagccttca ccgtccacaa ggatcgcgtg
5881 aatgatgtct gcaccaatgg acaagacctc attaagaaga acaatcacca tgaggagaac
5941 atctcttcaa agatgaaggg cctgaacggg aaagtgtcag acctggagaa agctgcagcc
6001 cagagaaagg cgaacgtgga tgagaactcg gccttccttc agttcaactg gaaggcggac
6061 gtggtggagt cctggatcgg tgaaaaggag aacagcttga agacagatga ttatggccga
6121 gacctgtctt ctgtgcagac gctcctcacc aaacaggaaa cttttgacgc tgggctgcag
6181 gccttccagc aggaaggcat tgccaacatc actgccctca aagatcagct tctcgccgcc
6241 aaacacgttc agtccaaggc catcgaggcc cggcacgcct ccctcatgaa gaggtggagc
6301 cagcttctgg ccaactcagc cgcccgcaag aagaagcttc tggaggctca gagtcacttc
6361 cgcaaggtgg aggacctctt cctgaccttc gccaaaaagg cttctgcctt caacagctgg
6421 tttgaaaatg cagaggagga cttaacagac cccgtgcgct gcaactcctt ggaagaaatc
6481 aaagctttgc gcgaggccca cgacgccttc cgctcctccc tcagctctgc ccaggctgac
6541 ttcaaccagc tggccgagct ggaccgccag atcaagagct tccgcgtagc ctccaacccc
6601 tacacctggt ttaccatgga ggccctggag gagacctgga ggaacctaca gaaaatcatc
6661 aaggagaggg agctggagct gcagaaggaa cagcggcggc aggaggagaa cgacaagctg
6721 cgccaggagt ttgcccagca cgccaacgcc ttccaccagt ggatccaaga gaccaggaca
6781 tacctcctcg atgggtcctg tatggtggaa gagtcgggga ccctcgaatc ccagcttgaa
6841 gctaccaaac gcaagcacca ggaaatccga gccatgagaa gtcagctcaa aaagatcgag
6901 gacctggggg ccgccatgga ggaggccctc atcctggaca acaagtacac ggagcacagc
6961 accgtgggcc tcgcccagca gtgggaccag ctggaccagc tgggcatgcg catgcagcac
7021 aacctggagc agcagatcca ggccaggaac acaacaggtg tgactgagga ggccctcaaa
7081 gaattcagca tgatgtttaa acactttgac aaggacaagt ctggcaggct gaaccatcag
7141 gatggcaaat cttgcctgcg ctccctgggc tatgacctgc ccatggtgga ggaaggggaa
7201 cctgaccctg agttcgaggc aatcctggac acggtggatc cgaacagaga tggccatgtc
7261 tccttgcaag aatacatggc tttcatgatc agccgcgaaa ctgagaacgt caagtccagc
7321 gaggagattg agagcgcctt ccgggccctc agctcagagg gaaagcctta cgtgaccaag
7381 gaggagctct accagaacct gacccgggaa caagccgact actgcgtctc ccacatgaag
7441 cccatcgtgg acggcaaggg ccgcgagctc cccaccgcgt tcgactacgt ggagttcacc
7501 cgctcgcttt tcgtgaactg agccactccc tgggtcaccc acccctcgct gcttgccctg
7561 cgtcgccttg ctgcatgtcc gctcctctgt gtgctctcac tttccactgt aaccttaagc
7621 ctgcttagct tggaataaga cttaggagaa aatggtgctt cactaacccg cttccggtcc
7681 agtcacaatc atcatgtcac tgtgggaccc agatctgtgt cttgaagcag ctgccctcat
7741 tccgacttca gaaaatcgaa gcagctggcg cctccccttc ggaattc
//
LOCUS CIPFNRA 1419 bp ss-mRNA PLN 25-MAY-1990
DEFINITION M.crystallinum ferredoxin-NADP+ reductase (fnrA) mRNA, complete
cds.
ACCESSION M25528
KEYWORDS ferredoxin-NADP+ reductase.
SOURCE M.crystallinum, cDNA to mRNA, clone lambdaMC-fnr1.
ORGANISM Mesembryanthemum crystallinum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE 1 (bases 1 to 1419)
AUTHORS Michalowski,C.B., Schmitt,J.M. and Bohnert,H.J.
TITLE Expression during salt stress and nucleotide sequence of cDNA for
ferredoxin-NADP+ reductase from Mesembryanthemum crystallinum
JOURNAL Plant Physiol. 89, 817-822 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.J.Bohnert, 13-JUN-1989.
FEATURES from to/span description
pept 75 1172 ferredoxin-NADP+ reductase precursor (fnrA; EC
1.6.7.1)
sigp 75 230 ferredoxin-NADP+ reductase signal peptide
matp 231 1169 ferredoxin-NADP+ reductase
mRNA 45 1395 fnrA mRNA (alt.)
mRNA < 1 1419 fnrA mRNA (alt.)
BASE COUNT 416 a 283 c 322 g 398 t
ORIGIN
1 tttttttttt ttctggttcc tcttcttctt cttcttcttc ttctgcaaag atatcttctt
61 caatcagctc agcaatggct gccgcagtaa ctgccgcagt ctctttccct tctactaaat
121 ccactcccct ttctaccaga acatcttccg tcatcaccca tgagaaaatt aacttcaaca
181 aggttcctct atactacagg aatgtatcag ttggtgggaa agtgggaacc attagggcag
241 tagcctcaga tgttgaggca cctgttgcaa aggtggagaa gcattcaaag aaaatggagg
301 aaggagtgat agtcaacaag tacaagccca agaaccctta cactggcaga tgcctcctca
361 acactaagat tactggagat gatgctcctg gagagacctg gcacatggtt ttctcccatg
421 aaggagagat tccctacaga gaggggcagt ctgttggagt gattcccgaa ggtattgaca
481 agaatggaaa gccccacaag ttgagattgt actcaattgc cagcaggccc ttgggcgact
541 ttggtgactc taaaactgtt tccttgtgtg taaaacgact tatctacacc aatgacaacg
601 gagaaatagt caagggtgtt tgctcaaact ttttgtgtga tttgaaacct ggtagtgaag
661 tagtgctaac tggaccagtt gggaaggaaa tgctgatgcc aaaagaccct aatgccacaa
721 ttatcatgct tgcaactgga actggtattg ctcctttccg ttcgttcttg tggaaaatgt
781 tctttgagaa gcacgatgac tacaagttca atggtttggc atggctcttc ttgggtgtac
841 ccactagcag ttctttgtta tacaaggagg aatttgagaa gatgaaggag aaagctccag
901 aaaacttcag gttggatttc gctgtgagca gagaacaaac taatgagaaa ggggagaaaa
961 tgtacatcca aacccgtatg gcacaatatg atagagagct atgggaattg ctcaagaaag
1021 acaacactta cgtctacatg tgtgggctga aaggcatgga gaaaggaatc gatgacatta
1081 tggtttcatt ggccgcagaa gatggtattg actggttcga ctacaagaag cagttgaaga
1141 aggcagaaca gtggaacgtg gaagtctact gataaacttg tacaaaactc tcttctttct
1201 ttttgtgaag cagaatatat ctatttgtca tctcccactg caccttgtag ataggtcagt
1261 ttcttttcga tattgttatg aatatcagct aattgcattg tagaagctat agccaaaaaa
1321 gctctgtttt ggagtactca tgtatcgcgt aattatactg gaaaccagtg tcaacacttc
1381 tacactgaga aattcatgag gatgaacaca ttatgctca
//
LOCUS HUMCAMPPK 3036 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human cAMP-dependent protein kinase type I-alpha subunit, complete
cds.
ACCESSION M33336
KEYWORDS cAMP-dependent protein kinase;
cAMP-dependent protein kinase type I; protein kinase.
SOURCE Human testis, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3036)
AUTHORS Sandberg,M., Skalhegg,B. and Jahnsen,T.
TITLE The two mRNA forms for the type I-alpha regulatory subunit of
cAMP-dependent protein kinase from human testis are due to the use
of different polyadenylation site signals
JOURNAL Biochem. Biophys. Res. Commun. 167, 323-330 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 88 1233 protein kinase type I-alpha subunit
/nomgen="PRKAR1" /map="7p13-qter"
/hgml_locus_uid="LW0133F"
BASE COUNT 821 a 552 c 682 g 981 t
ORIGIN
1 gctgggagca aagcgctgag ggagctcggt acgccgccgc ctcgcacccg cagcctcgcg
61 cccgccgccg cccgtcccca gagaaccatg gagtctggca gtaccgccgc cagtgaggag
121 gcacgcagcc ttcgagaatg tgagctctac gtccagaagc ataacattca agcactgctc
181 aaagattcta ttgtgcagtt gtgcactgct cgacctgaga gacccatggc attcctcagg
241 gaatactttg agaggttgga gaaggaggag gcaaaacaga ttcagaatct gcagaaagca
301 ggcactcgta cagactcaag ggaggatgag atttctcctc ctccacccaa cccagtggtt
361 aaaggtagga ggcgacgagg tgctatcagc gctgaggtct acacggagga agatgcggca
421 tcctatgtta gaaaggttat accaaaagat tacaagacaa tggccgcttt agccaaagcc
481 attgaaaaga atgtgctgtt ttcacatctt gatgataatg agagaagtga tatttttgat
541 gccatgtttt cggtctcctt tatcgcagga gagactgtga ttcagcaagg tgatgaaggg
601 gataacttct atgtgattga tcaaggagag acggatgtct atgttaacaa tgaatgggca
661 accagtgttg gggaaggagg gagctttgga gaacttgctt tgatttatgg aacaccgaga
721 gcagccactg tcaaagcaaa gacaaatgtg aaattgtggg gcatcgaccg agacagctat
781 agaagaatcc tcatgggaag cacactgaga aagcggaaga tgtatgagga attccttagt
841 aaagtctcta ttttagagtc tctggacaag tgggaacgtc ttacggtagc tgatgcattg
901 gaaccagtgc agtttgaaga tgggcagaag attgtggtgc agggagaacc aggggatgag
961 ttcttcatta ttttagaggg gtcagctgct gtgctacaac gtcggtcaga aaatgaagag
1021 tttgttgaag tgggaagatt ggggccttct gattattttg gtgaaattgc actactgatg
1081 aatcgtcctc gtgctgccac agttgttgct cgtggcccct tgaagtgcgt taagctggac
1141 cgacctagat ttgaacgtgt tcttggccca tgctcagaca tcctcaaacg aaacatccag
1201 cagtacaaca gttttgtgtc actgtctgtc tgaaatctgc ctcctgtgcc tcccttttct
1261 cctctcccca atccatgctt cactcatgca aactgcttta ttttccctac ttgcagcgcc
1321 aagtggccac tggcatcgca gcttcctgtc tgtttatata ttgaaagttg cttttattgc
1381 accattttca atttggagca ttaactaaat gctcatacac agttaaataa atagaaagag
1441 ttctatggag actttgctgt tactgcttct ctttgtgcag tgttagtatt caccctgggc
1501 agtgagtgcc atgctttttg gtgagggcag atccagcacc tattgaatta ccatagagta
1561 atgatgtaac agtgcaagat tttttttttt aagtgacata attgtccagt tataagcgta
1621 tttagactgt ggccatatat gctgtatttc tttgtagaat aaatggtttc tcattaaact
1681 ctaaagatta gggaaatgga tatagaaaat cttagtatag tagaaagaca tctgcctgta
1741 attaaactag tttaagggtg gaaaaatgaa aatttttgct aattatcaat gggatatgat
1801 tggttcagtt ttttttttcc agagttgttg tttgccaagc taatctgcct ggtttattta
1861 tatcttgtta ttaatgtttc ttctccaatt ctgaaatact tttgagtatg gctatctata
1921 cctgcctttt aagtttgaaa ctaactcata gatgcaaata ttggttagta tttaactaca
1981 tctgcctcgg ctcacaaatt ccgattagac ctttatccag ctagtgccaa ataattgatc
2041 agatgctgaa ttgagaataa gaatttgagg tctacattct tggttgttaa tttagagcgt
2101 ttggttaaag tatgtccttc agctgactcc agtataatct cctctgctca ttaaactgat
2161 tccaggagat tggatttgct gtgactagat acagatggag caaatgtcct aacagagaaa
2221 tagaggtgat gctgctaaag ggagaaatgc caggcggaca aagttcagtg tcgggaattt
2281 tccccgtgac attcactggg gcatgagatt ttggaagaag ttttttactt tggtttagtc
2341 tttttttcct cctttttatt cagctagaat ttctggtggg ttgatggtag ggtataatgt
2401 gtctgtgttg cttcaaattg gtctgaaagg ctatcctgct gaaagtcctg ctttcctatc
2461 tagcatttat tcctctggca aacttttctt tcttttcttt tttaaagtaa acttgtgtat
2521 tgagtcttaa ctgtatttca gtattttcca gccttatgtg ttacattatt ccaatgatac
2581 ccaacagttt atttttatta tttttttaaa caaaatttca cagttctgta atgtaggcac
2641 ttttattttc attgtgattt atatataagg taatgtaggg ttatatttgg gagtgactgc
2701 aagcattttt ccatctgtgt gcaactaact gactctgtta ttgatccctt ctcctgccct
2761 ttcccaggta atttaaattg gtcatggtag atttttttca tagatttgaa aaacttttag
2821 gttgttacca agtatgaagt ataaatctgg ggaagaggtt ttatttacat tttagggtgg
2881 gtaagaaagc caccttgtta caaatttttt aatttccaaa ataatctata ttaaatgagg
2941 gtttctgatc tgtactttgt gtttagctac ctttttatat ttaaaaaatt aaaaatgaaa
3001 attatgttct tacaagctta aagcttgatt tgatct
//
LOCUS HUMIGHCU 426 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human Ig rearranged H-chain mRNA V-J4-region, hybridoma AE6-5,
5' end.
ACCESSION M29468
KEYWORDS immunoglobulin heavy chain; joining exon; processed gene;
variable region; variable region subgroup VH-I.
SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVH, derived
from B-CLL cell line UC729-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 426)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
TITLE Cloning and sequence analysis of the variable heavy (VH) and light
(VL) chain regions of an anti-myelin/DNA antibody from a patient
with peripheral neuropathy and chronic lymphocytic leukemia
JOURNAL J. Immunol. 144, 2821-2828 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
L.A.Spatz, 26-OCT-1989, for release after publication.
Columbia University, Department of Neurology BB-322, 630 W. 168th
street, New York, NY 10032.
FEATURES from to/span description
pept 1 > 426 Ig H-chain V-J4-region precursor
/nomgen="IGHV" /map="14q32.33"
/hgml_locus_uid="LK0080M"
sigp 1 57 Ig H-chain signal peptide
matp 58 > 426 Ig H-chain
recomb 384 385 V-region DNA end/J4-region DNA start
BASE COUNT 93 a 114 c 129 g 90 t
ORIGIN
1 atggactgga cctggaggat cctcttcttg gtggcagcag ccacaggagc ccactcccag
61 gtgcagctgg tgcaatctgg ggctgaggtg aagaagcctg gggcctcagt gaaggtctcc
121 tgcgaggctt ctggatacac cttcaccggc cactatatgc actgggtcgg acaggcccct
181 ggacaagggc ttgagtggat ggggtggatc aaccctaaca gtggtggcac aaactatgca
241 cagaagtttc agggcagggt caccatcacc agggacacgt ccatcaacac agcctacatg
301 gagctgagca ggctgagatc tgacgacacg gccgtgtatt actgtgcgag agcctcatat
361 tgtggttatg actgctatta cttctttgac tactggggcc agggaaccct ggtcaccgtc
421 tcctcg
//
LOCUS HUMIGHCV 489 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human Ig rearranged anti-myelin H-chain mRNA V-J4-region, hybridoma
AE6-5, 5' end.
ACCESSION M29470
KEYWORDS autoantibody; immunoglobulin heavy chain; joining exon;
processed gene; variable region; variable region subgroup VH-III.
SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVH, derived
from B-CLL cell line UC729-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 489)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
TITLE Cloning and sequence analysis of the variable heavy (VH) and light
(VL) chain regions of an anti-myelin/DNA antibody from a patient
with peripheral neuropathy and chronic lymphocytic leukemia
JOURNAL J. Immunol. 144, 2821-2828 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
L.A.Spatz, 26-OCT-1989, for release after publication.
Columbia University, Department of Neurology BB-322, 630 W. 168th
street, New York, NY 10032.
FEATURES from to/span description
pept 64 > 489 Ig H-chain V-J4-region precursor
/nomgen="IGHV" /map="14q32.33"
/hgml_locus_uid="LK0080M"
sigp 64 120 Ig H-chain signal peptide
matp 121 > 489 Ig H-chain
recomb 450 451 V-region DNA end/J4-region DNA start
BASE COUNT 101 a 121 c 144 g 123 t
ORIGIN
1 cccagccctg ggattttcag gtgttttcat ttggtgatca ggactgaaca gagagaactc
61 accatggagt ttgggctgag ctggcttttt cttgtggcta ttttaaaagg tgtccagtgt
121 gaggtgcagc tgttggagtc tgggggaggc ttggtacagc ctgaggggtc cctgagactc
181 tcctgtgcag tctccggatt cacttttagc agctttgcca tgacctgggt ccgccaggct
241 ccagggaagg ggctggagtg ggtctcagct attagtacta gtggtggtag cacatactac
301 gcagagtccg tgaagggccg cttcaccatc tccagagaca attccatgca cacgctgtat
361 ctgcaaatga acagcctgag agccgaggac acggccgtct attactgtgc gaaaggtcct
421 acatattgta gtagaatcag ctgccctccg gactactggg gccagggaac cctggtcacc
481 gtctcctca
//
LOCUS HUMIGHVW 778 bp ds-DNA PRI 25-MAY-1990
DEFINITION Human Ig germline H-chain pseudogene V-J6-region, hybridoma AE6-5,
5' end.
ACCESSION M29466
KEYWORDS germline; immunoglobulin heavy chain; joining exon;
variable region; variable region subgroup VH-I.
SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone II4a, derived from
B-CLL cell line UC729-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 257 to 778)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
TITLE Cloning and sequence analysis of the variable heavy (VH) and light
(VL) chain regions of an anti-myelin/DNA antibody from a patient
with peripheral neuropathy and chronic lymphocytic leukemia
JOURNAL J. Immunol. 144, 2821-2828 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 256)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
L.A.Spatz, 26-OCT-1989, for release after publication.
Columbia University, Department of Neurology BB-322, 630 W. 168th
street, New York, NY 10032.
FEATURES from to/span description
pept.ps 257 302 pseudo-Ig H-chain V-J6-region precursor, exon
1 /nomgen="IGHV" /map="14q32.33"
/hgml_locus_uid="LK0080M"
384 > 778 pseudo-Ig H-chain V-J6-region precursor, exon
2
sigp.ps 257 302 pseudo-Ig H-chain signal peptide
384 394 pseudo-Ig H-chain signal peptide
matp.ps 395 > 778 pseudo-Ig H-chain
recomb 733 734 V-region DNA end/J6-region DNA start
signal 157 164 octamer recognition sequence
BASE COUNT 193 a 197 c 216 g 172 t
ORIGIN
1 aacacagagt tcagggatgg gtaggggatg cgtggcctct aacaggatta cggcttgaac
61 cctcagcttc tacaattgtg tcgtccatgt gtcatgtatt tgctctttct catcctgggt
121 caggaattgg gctattaaat agcatccttc atgaatatgc aaataactga ggtgaatata
181 gatatctgtg tgccctgaga gcatcaccca aaaaccacac ccctccttgg gagaatccct
241 agatcacagc tctaccatgg actggacctg gagcatcctt ttcttggtgg cggcagcaac
301 actaacggac tccccagtcc cggggctgag agagaaacca ggccagtcat gtgagacttc
361 acccatcctg tgtctctcca caggtgccca ctcccaggtt cgactggtgc agtctggagc
421 tgacgtgaag aagcctgggg cctcagtgaa ggtctcctgc aaggcttctg gttacacctt
481 taccagctat gatatcagct gggtgcgaca ggcccctgga caagggcttg agtggatggg
541 atggatcagc gcttttaatg gtaacacaga ctatgcacag aagttccagg gcagggtcac
601 catgaccaca gacacatcca cgagcacagc ctacacggag ctgaggagcc tgagatctga
661 cgacacggcc gtgtattact gtgcgagaga cgggggcttg ggagtggtgg tgatagttga
721 aacctaaagc aagtactaca acgatatgga cgtctggggc caagggacca cggtcacg
//
LOCUS HUMIGKAW 405 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human Ig rearranged kappa-chain mRNA V-J1-region, hybridoma AE6-5,
5' end.
ACCESSION M29467
KEYWORDS joining exon; kappa-immunoglobulin;
processed gene. variable region subgroup VK-IV; variable region.
SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVK, derived
from B-CLL cell line UC729-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 405)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
TITLE Cloning and sequence analysis of the variable heavy (VH) and light
(VL) chain regions of an anti-myelin/DNA antibody from a patient
with peripheral neuropathy and chronic lymphocytic leukemia
JOURNAL J. Immunol. 144, 2821-2828 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
L.A.Spatz, 26-OCT-1989, for release after publication.
Columbia University, Department of Neurology BB-322, 630 W. 168th
street, New York, NY 10032.
FEATURES from to/span description
pept 1 > 405 Ig kappa-chain V-J1-region precursor
/nomgen="IGKV" /map="2p12"
/hgml_locus_uid="LF0081F"
sigp 1 60 Ig kappa-chain signal peptide
matp 61 > 405 Ig kappa-chain
recomb 369 370 V-region DNA end/J1-region DNA start
BASE COUNT 95 a 110 c 101 g 99 t
ORIGIN
1 atggtgttgc agacccaggt cttcatttct ctgttgctct ggatctctgg tgcctacggg
61 gacatcgtga tgacccagtc tccagactcc ctggctgtgt ctctgggcga gagggccacc
121 atcaactgca agtccagcca gagtctttta tacagctcca acaataagaa ctacttagct
181 tggtaccagc agaaaccagg acagcctcct aagttgctca tttactgggc atctacccgg
241 gaatccgggg tccctgaccg attcagtggc agcgggtctg ggacagattt cactctcacc
301 atcagcagcc tgcaggctga agatgtggca gtttattact gtcagcaata ttatagtact
361 cctccgatgt tcggccaagg gaccaaggtg gaaatcaaac gaact
//
LOCUS HUMIGKAX 407 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human Ig rearranged anti-myelin kappa-chain mRNA V-J4-region,
hybridoma AE6-5, 5' end.
ACCESSION M29469
KEYWORDS autoantibody; joining exon; kappa-immunoglobulin; processed gene;
variable region; variable region subgroup VK-IIIa.
SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVK, derived
from B-CLL cell line UC729-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 407)
AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
Berman,J.E., Alt,F.W. and Latov,N.
TITLE Cloning and sequence analysis of the variable heavy (VH) and light
(VL) chain regions of an anti-myelin/DNA antibody from a patient
with peripheral neuropathy and chronic lymphocytic leukemia
JOURNAL J. Immunol. 144, 2821-2828 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
L.A.Spatz, 26-OCT-1989, for release after publication.
Columbia University, Department of Neurology BB-322, 630 W. 168th
street, New York, NY 10032.
FEATURES from to/span description
pept 24 > 407 Ig kappa-chain V-J4-region precursor
/nomgen="IGKV" /map="2p12"
/hgml_locus_uid="LF0081F"
sigp 24 83 Ig kappa-chain signal peptide
matp 84 > 407 Ig kappa-chain
recomb 368 369 V-region DNA end/J-region DNA start
BASE COUNT 95 a 123 c 102 g 87 t
ORIGIN
1 ctcagttagg acccagagga accatggaag ccccagctca gcttctcttc ctcctgctac
61 tctggctccc agataccacc ggagaaattg tgttgacaca gtctccagcc accctgtctt
121 tgtctccagg ggaaagagcc accctctcct gcagggccag tcagagtgtt agcagctact
181 tagcctggta ccaacagaaa cctggccagg ctcccaggct cctcatctat gatgcatcca
241 acaaggccac tggcgtccca gccaggttca gtggcagtgg gtctgggaca gacttcactc
301 tcaccatcag cagcctagag cctgaagatt ttgcagttta ttactgtcag cagagtagca
361 agtggcctct cactttcggc ggagggacca aggtggagat caaagga
//
LOCUS HUMLSP1A 1631 bp ss-mRNA PRI 25-MAY-1990
DEFINITION Human lymphocyte-specific protein 1 (LSP1) mRNA, complete cds.
ACCESSION M33552
KEYWORDS lymphocyte-specific protein; phosphoprotein.
SOURCE Human CTL AH2 cell line, cDNA to mRNA, clones hlsp1.[3,5-1].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1631)
AUTHORS Jongstra-Bilen,J., Young,A.J., Chong,R. and Jongstra,J.
TITLE Human and mouse LSP1 genes code for highly conserved
phosphoproteins
JOURNAL J. Immunol. 144, 1104-1110 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 109 1128 lymphocyte-specific protein 1 (LSP1)
mRNA < 1 1631 LSP1 mRNA
BASE COUNT 373 a 496 c 500 g 262 t
ORIGIN
1 tttaacaccc atagtaggcc taaaagcagc caccaattaa gaaagccaca gcacgtacac
61 ccactccagg gatctgccag caccctgtgg ggcccagact acaggctgat ggcggaggct
121 tcgagtgacc cgggtgctga ggagcgggaa gagttgctgg ggcccactgc tcagtggagc
181 gtggaggacg aggaggaggc cgtccacgag caatgccagc atgagagaga caggcagctt
241 caggcccagg acgaggaggg aggcggccat gtccccgagc ggccgaagca ggagatgctc
301 ctcagcctga agccctcgga ggcccctgaa ctggatgagg acgagggctt tggcgactgg
361 tcccagaggc cagagcagcg gcagcagcac gagggggcgc agggcgcctt ggacagcgga
421 gagccccccc agtgcaggag tcctgagggg gagcaagagg acaggcccgg cctgcatgcc
481 tacgaaaagg aggacagtga tgaagtccac ctggaggagt tgagtctgag caaggagggg
541 ccaggcccag aggacactgt ccaggacaac ctgggggccg caggggctga ggaggaacag
601 gaggagcacc agaaatgtca gcagcccagg acacccagcc ccttggtctt ggaggggacc
661 atcgaacaga gctcgcctcc cctgagccct accaccaaac tcatcgacag gaccgagtcc
721 ctaaaccgct ccatagagaa gagtaacagt gtgaagaaat cccagccaga cttgcccatc
781 tccaagattg atcagtggct ggaacaatac acccaggcca tcgagaccgc tggccggacc
841 cccaagctag cccgccaggc ctccatagag ctgcccagca tggctgtggc cagtaccaag
901 agtcggtggg agacgggtga ggtacaggct cagtctgcgg ccaagactcc gtcctgcaag
961 gatattgtgg ctggagacat gagcaagaaa agcctctggg agcagaaggg aggctccaag
1021 acctcatcaa caattaagag caccccatct gggaagaggt ataagtttgt ggccaccggg
1081 catgggaagt atgagaaggt gcttgtggaa gggggcccgg ctccctaggc gtcccatctc
1141 gcttcctggg tctgcaggtc cagccggctg gcaccctcca tgtacccagg ggagattcca
1201 gccagacacc cgccccccgg ccctggctaa gaagttgctt cctgttgcca gcatgaccta
1261 ccctcgcctc tttgatgcca tccgctgcca cctccttttg ctcctggacc ctttagcctc
1321 tctgcccttc cactctctga ccaccgcccc cgccctcccc acccagctcc gcttcttgtt
1381 acttggggga ggaaagaaac tcctgatcat tggccaaagg gacttacccc tggagaggcc
1441 aagtgccttc taggaagtta ggaggttgag gcacagcctg tgcagagagg gtgggtcacc
1501 cccccagatc caaggggaaa ctgcaggtca agggctgata acggccatgc aggatgcttg
1561 atgctgcgtc ccccgctgct tgccgccccc caccccgcca ttttgtataa taaagctccc
1621 tgtgtattct c
//
LOCUS MUSVG1A 1985 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse Vgr-1 protein mRNA, complete cds.
ACCESSION J04566
KEYWORDS Vgr-1 protein.
SOURCE Mouse (strain ICR) 8.5 day embryo (George), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1985)
AUTHORS Lyons,K., Graycar,J.L., Lee,A., Hashmi,S., Lindquist,P.B.,
Chen,E.Y., Hogan,B.L.M. and Derynck,R.
TITLE Vgr-1, a mammalian gene related to Xenopus Vg-1, is a member of the
transforming growth factor beta gene superfamily
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4554-4558 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by A.Lee.
22-MAY-1989, for release after publication.
FEATURES from to/span description
pept 16 1332 Vgr-1 protein
BASE COUNT 448 a 566 c 551 g 420 t
ORIGIN
1 ccttgaccgt aagacatgag gaagatgcaa aaggagatcc tgtcggtgct ggggcccccg
61 cacaggccgc ggcccctgca cggtctccag cagcctcagc ccccggtgct cccgccacag
121 cagcagcagc agcagcagca gcagcagacg gcccgcgagg agccccctcc agggcggctg
181 aagtccgctc cactcttcat gctggatctc tacaacgccc tgtccaatga cgacgaagag
241 gatggggcat cggagggtgt ggggcaagag cctgggtccc acggaggggc cagctcgtcc
301 cagctcaggc agccgtctcc cggcgctgca cactccttga accgcaagag tctcctggcc
361 ccgggacccg gtggcggtgc gtccccactg actagcgcgc aggacagcgc tttcctcaac
421 gacgcggaca tggtcatgag ctttgtgaac ctggtggagt acgacaagga gttctcccca
481 catcaacgac accacaaaga gttcaagttc aacctatccc agattcctga gggtgaggcg
541 gtgacggctg ctgagttccg cgtctacaag gactgtgtgg tggggagttt taaaaaccaa
601 acctttctta tcagcattta ccaagtcttg caggagcatc agcacagaga ctctgaccta
661 tttttgttgg acacccgggt ggtgtgggcc tcagaagaag gttggctgga atttgacatc
721 acagcaacta gcaatctgtg ggtggtgaca ccgcagcaca acatggggct ccagctgagt
781 gtggtgactc gggatggact ccacgtcaac ccccgtgcgg cgggcctggt gggcagagac
841 ggcccttacg acaagcagcc cttcatggtg gccttcttca aggtgagcga ggtccacgtg
901 cgcaccacca ggtcagcctc cagtcggcgg cggcagcaga gtcgcaaccg gtccacccag
961 tcgcaggacg tgtcccgggg ctccggttct tcagactaca acggcagtga gttaaaaaca
1021 gcttgcaaga agcatgagct ctatgtgagc ttccaggacc tgggatggca ggactggatc
1081 attgcaccca aaggctacgc tgccaactac tgtgatggag agtgttcctt cccactcaac
1141 gcacacatga atgccaccaa ccacgccatt gtacagacct tggtccacct tatgaatccc
1201 gagtacgtcc ccaaaccatg ctgcgcacca accaaactga atgccatctc ggttctttac
1261 ttcgatgata actccaatgt catcttgaaa aagtacagga atatggtcgt gagagcttgt
1321 ggttgccatt aagttgaagc tggtgtgtgt gtgtgggtgg gggcatggtt ctgccttgga
1381 ttcctaacaa caacatctgc cttaaaccac gaacaacagc acagcgaagc gggatggtga
1441 cacacagagg gatcgtgaca cgcagacaca tctcccgctg gtgccttacc cacggaggct
1501 tttatgagga ccttgtcaag ggctttccca gttcctaact gagcagttgc tggtctgcag
1561 gaagctggaa ggcttgtagt acaggcctgg aaactgcagt tacctaatgt tcgcctcccc
1621 caaccccgcc cggagtagtt ttagctttta gatctagctg cttgtggtgt aagtaaactt
1681 gaaggaatat taaatatccc tgggttgaaa gacccggtgg tggctctaca gcacccatcc
1741 cagggagatt tttgcagaca tccgaatgga ggggagaagg gcactctttc aggttccatt
1801 cccagcaagg gcagctcaca caggacctgc agcctggcca tcagcaggct ctgtggaggt
1861 gccttctgtc tactgttgta gttacgtgtt ttgtgttgac tctcggtggt gtgagaatgt
1921 actaatctct gtcaagacaa actgtagcat ttccacccca tcctcctccc tccctcacag
1981 aattc
//
LOCUS NPAIFP1A 1730 bp ds-DNA VRL 25-MAY-1990
DEFINITION AcNPV mutant with an S.frugiperda insertion element IFP1.6.
ACCESSION M32775
KEYWORDS insertion element; insertion element IFP1.6.
SOURCE AcNPV (mutant strain AcFP875.2) DNA.
ORGANISM Autographa californica nuclear polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 1730)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 118 119 AcNPV DNA end/S.frugiperda DNA start
recomb 1683 1684 S.frugiperda DNA end/AcNPV DNA start
site 115 118 viral target sequence
site 1684 1687 viral target sequence
BASE COUNT 610 a 279 c 252 g 589 t
ORIGIN
1 tcgtcgattg cgttttgatc aacgacttga gcagagacac gttaatcaac tgttcaaatt
61 gatccatatt aactatatca acccgatgcg tatatggtgc gtaaaatata ttttttaacc
121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac
181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
1681 aggttaaccc tcttatactt tgcactctgc gttaatacgc gttcgtgtac
//
LOCUS NPAIFP2A 2225 bp ds-DNA VRL 25-MAY-1990
DEFINITION AcNPV with an S.frugiperda insertion element IFP2.2.
ACCESSION M32779
KEYWORDS insertion element; insertion element IFP2.2.
SOURCE AcNPV DNA.
ORGANISM Autographa californica nuclear polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 2225)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 26 27 AcNPV DNA end/S.frugiperda DNA start
recomb 2191 2192 S.frugiperda DNA end/AcNPV DNA start
BASE COUNT 760 a 407 c 349 g 709 t
ORIGIN
1 ccgtctcacc ataacaccgt ttttaccaca gaacatataa agagagtaga aatgaaatag
61 ttcctgattt gtatcagaac gcgtgtcgcc gtttctccca ccctttaatt tcactacaag
121 caagagacca atgtctgagc ggattacccc cacaaataaa taacacacat ttaaatctcg
181 aatttgctat tggttgtcac acaaacgcaa tgactgcatt tgacgtacac tctaacacgc
241 ataaaaactc atgtatgcta gggcgagaca aaggtcgatt gttgatatcg ataaaattta
301 agtaatcttg tttttgtttt acaagtttgt tgaatattgt gaaattgtat tcatgcgaaa
361 tagaaaccaa gtaataaaag taaaataaat aaaaaataca taactgaatt ttaaaaccaa
421 ttttattact caataagtta aaaatcagta aaaaataaat aaatcatagg cacttttgaa
481 cgtcaaagca aatataataa tattaataac gtctgtctgt cggtcagccc tctagtgaag
541 ctatttgctc gttccaaagt gtagattcct atggagaaga tcgaagatct gttcgttcga
601 tctttttcaa ttagattcac aatacaattc ttgataacat tgcttttctt tgcatcgatt
661 gcttgaacta tgtgagaaca atgtaaaact aatattcagt caaagcgata gaaaaaaaaa
721 ccttaagaac aacaaaatta atacagtctg gagctgacca gtcccagttg acagcgctcg
781 ttcaccaaca tgacacgtac accagcaccg tccaactcaa ccatgttgca gggaatcgaa
841 cgatccaatg cccgtgccac cgccagtgag acctttgagt gagcatgaca actctaagct
901 gacacaaatg cattctacta gtctaattta gctaattaca gggctctcac atttacaaaa
961 atctgtataa agtacagaac acattgaaat aacatggtta tattatttca ttttttaaat
1021 ttacgaaaga tgatatttta aatatcgcca tttttagaca aaaaaactga tttaatgaga
1081 ctaatttaaa attttctaaa ataaaatcag tttacttact gaattacaca ttgcgatatt
1141 gtgcggataa acgcctaatt ttaagtttct tttccctgac tttgttatta tgtaattatc
1201 aacatcgaaa tgttgcgaac gaaatatgct acatttagta ggaatccagg agcccatagt
1261 acaggttttc ctagtttggg ttcctgaatc ccaccacttg tcatttcaat tgtcttaaac
1321 ctggtcgtgt aacttgctac ataatttatt atactattaa ttattatata agcttagttt
1381 ttaagaccaa tgtctgtata aatatggtgg agataaaaat atacgcagtt tctaatctgt
1441 gtggtggaac ttgaaaaaaa aaaaataatt ctaccagttg tgtcaatcca caattttgtt
1501 acacctggat ctgttggaaa cctaaaacat aatattaaaa taaatcatat gtcacattct
1561 ataaaaaaat aaaaacataa aactcactca aaatctatat tactaattag tcgtaataaa
1621 taatacttat tatttaaatt ataaagtgtg ctattgatta taatagactg tatcatagta
1681 acaatcgaca ttgattacaa atttgcccaa cactatagat gtcatagatt ttgtggaaat
1741 gagcttttgt ttcggtactc gggactctcg caatggacgt catacattgc gcgattgctc
1801 acgtagtgca agattgtact catctatcgc aagcaatgta ttactattag tctgcttgcg
1861 attatatcgt ttttgtctat aaatttcata atactaatgt tataattaca aatatcacac
1921 gtatttttat gtttttggac ttaccggtga aaagaaattc cttctcgtgc tatgctacac
1981 acctggtttc ttttccgaca cgttataata gcgcaagacg gcataatata atatttttgt
2041 ggagatacgt acgctgtcaa aacatgacag cgattgctag ccggtaagta tgaacatttg
2101 agcgttatga cagcggtatg cacgttcgtt cgcaccctcg gcacgcgtgg cccgcccaca
2161 aggccaatcc agtctctttc ttttatccat ggtttttaca aatggaaatg tatttgtaaa
2221 acggc
//
LOCUS RATPFKFBP1 2917 bp ds-DNA ROD 25-MAY-1990
DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, 5'
flank and exon 1.
ACCESSION M26215
KEYWORDS 6-phosphofructo-2-kinase; alternative splicing;
fructose-2,6-bisphosphatase.
SEGMENT 1 of 2
SOURCE Rat liver DNA, clone lambda-20.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2917)
AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G.
TITLE 5' flanking sequence and structure of a gene encoding rat
6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.I.Darville, 08-AUG-1989.
FEATURES from to/span description
pept 1374 / 1401 M-type 6-phosphofructo-2-kinase (EC
2.7.1.105)/fructose-2,6-bisphosphatase (EC
3.1.3.46; PFK-2/FBPase-2), exon 1
pre-msg 1255 > 2917 PFK-2/FBPase-2 mRNA and introns (alt.)
IVS 1402 > 2917 PFK-2/FBPase-2 intron A
BASE COUNT 779 a 627 c 605 g 906 t
ORIGIN
1 ggatccagac agaactccgg cttaagtgag cggcccctgg cggtggctgt gaggggcttc
61 tgcctggagt cctttgggga actttctcag ttgaaagatg gagcagttaa gaatttactt
121 agctagcctc cgttttaatg gagtctggag gaaggagact acctcaaatc ctccctgtag
181 agccatattt gtcttgactt tcaaagcgtg agtcagactt gcatttggct ccatttctca
241 gatgagcaaa atcgaggcct agaacattta agctccggag cagtatcttt tccctatcct
301 gtctacgctt tgtcctttca ttcttaggat aaaccaggaa gaggcactga agctaagtag
361 aagttgaata aattcgtctc agcctcacag aagttataga gccaagattt aagcccttct
421 taatcaggta aacgattgat tttctgtcca cacaactctg tgtcttttcc cgtgggcacc
481 actgcacttt acgatccatt ctagtgcaac atttaccaca gtgaacagct cccattgatt
541 tccattgtca ctagaccgct aacagcctga atgcagtggc catcttggac ccagcttata
601 tgataacgga ataaatattt agcagatgga tgaagcagca ggccagtagc gatgagttct
661 tgacttattt tttttaacag gaaatgccca atttccagga aaactataac tttgcatcgt
721 aaggtttttt cactgaaaat tgctacagag ttattcccac tcgaatacac acaccctcag
781 aatctcacag cctttaagcc atgatcttca gtgggttgct cctctaccct cagaggactt
841 aaaggtcact agctattgac attagaaaaa catctatgta attaaaagtc tgtgccttga
901 gcttttccat gcattgggag ttactgacta aagatcgagt cccaggagtg ggttaattag
961 gacaagggaa atgaacgttt ttcttcatgt ttccagattg ctttcaagca agggattttg
1021 ccaatgctgt actaaaggca gacagaaacc tttgtcccaa ggaaatccag cttttgctca
1081 cctcaatagc tggttttaat ttctccattc tccagtggct tgttaatatt agtactgacc
1141 tttggggcaa ggtgaacaca tggttggact gaagagaaaa cgctctgggt ggctcaggaa
1201 cttctttggc aagtacagca gctaatattt caacactgca cacacccctt aacaagggcg
1261 cttcctcagc cttcccagga accaacggat gtctccgagt tttctatttg atgccaccca
1321 ccggacctgc tttgggggtc tgtaaatgca agagagccat tggaaaatta gcgatggaag
1381 aaaaagcctc taagagaaca ggtaggtgag cctaaaggaa aaacagacag gtctttttac
1441 aggtcaaaaa tccttctttc tggcttcttg gaagattttt tctgcatagg attctgattt
1501 agctttatta aaggggatgg gagggttggg gaatgataat atgaatacct aattcaactg
1561 ccaatatttt atttttttaa ttattttatt aatattccaa atgttgcccc ctccctattg
1621 tcagtatttt aaaattcatt tgtacttgac tttttaggac cctctctaaa ggtcagaccc
1681 ctactatctt tgaaattcca atttttgttt tatttagttt tggaaaatga atttaactgt
1741 gcttcccatg ggagcctgaa actttctatg tagcctggag tggccatgat tgtgtaaccc
1801 tcccagccta gcttccacct atgtagtgta tgtaattttt taaattaaat gcacaataga
1861 gagtgcatct ctgacctctc ctctagtgta taaagttgag ttgatttgct gtgaaacttt
1921 gacacattct ttagtttctc tgggtgtcaa tggatacctt agtaaaacag aaattaaagc
1981 ccctagcctc cttcccacag aggactggcc tgatgatcca gaatctctaa taggcagaat
2041 ccagagttct tttgcttcag ggctttcatg ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt
2101 gtgtgtgtgt gtgtgtgtgt gtgtgtgaca ttcatgttct tggcttttat gtggcgtcca
2161 caggtagtac cacccacagc cctctctcta tgtgctctcc ttcctaagac tcgtcctaga
2221 taagaatgca gtgagctttg tggagcagac aggagtctga aaatagtggc ataatggggc
2281 cagagcagtc actgagaaca aagggattct aggctgctgt cccagaagtc ctgatatata
2341 ttcatacatg aatctagcct gtaatcatgt gattatccgt cagccttcta gcctataaca
2401 aatatattca caaccacaca tttcaagact taccaataag gagaccccat ttatgacttt
2461 attgcccatc ctatcttgat aaaatgagct tcctcaccac attgtacatg gaggtgggag
2521 caggctctat ggccgaacat catactgcat agtttcttcc ctagagatag ttctacttta
2581 tggttcagcc tttagccctg agtatgaagc tgagtaaacc ctgtgtctat aaccaataca
2641 ttaaaacgtg ttagaatttt atcattttat tctgtgccat cctaaagttt taaggttttt
2701 tttttttttt ggatgttgct ctttacctca gaaaaaccag ctctcccctg tccatttccc
2761 tgaaagctgt cataatatct accatggtga actgggattt cttgtcacca tgtttatctc
2821 ttctgcagca gagagatgta gattgtgaac atgatcactg aacaagtgtc ctccatatca
2881 gagcccaaga tgaacagtca gcaggtcaca gggatcc
//
LOCUS RATPFKFBP2 3720 bp ds-DNA ROD 25-MAY-1990
DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, exon
1'.
ACCESSION M26216
KEYWORDS 6-phosphofructo-2-kinase; alternative splicing;
fructose-2,6-bisphosphatase.
SEGMENT 2 of 2
SOURCE Rat liver DNA, clone lambda-20.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3720)
AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G.
TITLE 5' flanking sequence and structure of a gene encoding rat 6-
phosphofructo-2-kinase/fructose-2,6-bisphosphatase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.I.Darville, 08-AUG-1989.
FEATURES from to/span description
pept 2913 / 3009 L-type 6-phosphofructo-2-kinase (EC
2.7.1.105)/fructose-2,6-bisphosphatase (EC
3.1.3.46; PFK-2/FBPase-2), exon 1'
pre-msg 2664 > 3720 PFK-2/FBPase-2 mRNA and introns (alt.)
IVS 3010 > 3720 PFK-2/FBPase-2 intron A'
BASE COUNT 1020 a 752 c 859 g 1089 t
ORIGIN About o.1 kb after segment 1.
1 aagcttctct caaggctata atactgactt tcagagaaga tgaacactaa aactggtctt
61 aactaagaca gagttcagtg agaataaaag atacttcagg ctgaggaaga gcctaaaagc
121 aggaaaacgc tgtgttcgag ggtcctgtaa tcagccagtg tggttaaaca tgggggagtc
181 ctggggggta gaagactgat aggtggctta ggacttgatt tagaagaagt ttgaatgcta
241 gtctgagtca tttctctaag gcttgttttt tggtgtttct gagttcagaa ttgctaacgg
301 tagtgaacaa tttcatagtt agttctaggc agggttaaat tagggtgtca aacctggacc
361 ctgagtttac ccattcagac cttgtcagag gccttctagc tacctagctc agatttcttg
421 ctactaaaat tatcttcttg attcttggca gtgttgtaag gacaaaattg taaacataaa
481 agacatgttc atgttgttgg ctagattacc ttagacaagc catcccctcc tcaggcccat
541 ttccctgtgc tattactggg gaagtttcaa agaaacaaat aagataaaga aaatgttact
601 ttgtgcgcac tggaaaacat acaaagttac ggtaagatca ttaaaacaga cttccactaa
661 ttatgtgact tactaggatt aagagactca tgtcttgcaa agtctctttc tgagactcta
721 ggagttatct gctaatctgg aaaaaactca ttatattgta ttttttactt taaattgata
781 gataattgtt tatacttgtg tggtacagtg catgtatata atgtacaatg gttaatagct
841 ttatgtatca cctcaaatat ctatcatttc tcctttccac cagctatatt gaagtatcca
901 attatctgtt gtgaaccata gttatccaga tgtgctcttg aacatcagaa gttattcctc
961 ctgtttccca gaatccatcc tgtttgttct ctcttttctc cctctccttt gcccactagt
1021 ctatactcag tttctcacca atctctttta cctttgtcca gtaggtgaga gcatgtagta
1081 tttgatgttt tttgccagac ttaagttaac agaaacctag gaaggagttt acctttctaa
1141 ttgacactca agaagtcttt acagtataaa tgttaatccc gtgtcacaat aacgtgctag
1201 accttgcaga ttacttggag ggaaaaagat gaatcaatta ctgcccaaag gaaacaacag
1261 tttgaatcat atgtattcag cttccccagg tttcttcatg tgtatcaata tattgactct
1321 tcgctaacat tgagggaatg gaccctacta tccttatatc tagcaatggg cagaagtgag
1381 agtccaccca gacagtatgg ttcctaatcc cattgttatt gttcttaact gctctcatcc
1441 attgcctcag taggctcctt gagataggta ggccttcacg ttgaaagtct tatatgctca
1501 gcaactgcag tgaatctaac ccttctatag actaaggttc tgagaagata ttttaggaag
1561 tgtcttaaaa ggcaaagtaa agctgggagc atttgagggc agggactgga gatttgtttt
1621 cccgagagat tgcccttgca tactccagct cagtggaaga aagttgagcc tcctaatacc
1681 caatcactgg agctttgctg ggggtgggaa ggttacaaag aatgaagtaa aagggcactt
1741 ggacatcttc caggcaagcc atgcagttgc atgaattcta cagataaaag tctgaaaata
1801 actgggggaa tggcaataag agtggggaca agtgaaaggg agacatctgg atgagatatc
1861 tctgcctgtg gatttgtgct atttaaatag atgccattac ctcttattcc ccatagtctc
1921 ctcagtgtgg tttccacctc tcctaagatc ccatcccaac actttggctt gaagagtttt
1981 agcacgggtt tgaattagat tctgatgcct ttagtgctgg gtcatttcag cattggcagc
2041 catgtgagcg acccttactt atcggctctc tgcccagcaa tggctgccct cactctcatt
2101 cctaaggtgt cagatttact ctgatttatt taaccttggc tggagagagt ataaaggtga
2161 tcagatagtg ccaatctctt tgaagtgccc aggacagctc attttaagag ggagaggtcc
2221 ggtagagagg cagggaacaa actctaggga aagacagttt tacagtgcct tgttgaatag
2281 ggtgagtggt gaggtagtga aagtctactt ccagtttggg cctttaattg ctagctcaaa
2341 acattgcatt ttgattagga acattatgtt tgtggttttg ttgtgggggg gggggcttgt
2401 aatgaacctc tgtttctgag ttggagctta atgttctaat taggcctgct ttgaaatcaa
2461 tttcaaagct tagacctatc tccatcacct ttgtcatgca cacaacacag gatgtaagca
2521 catgatcttt ggtcttttat ttgcatactc tactagtcct gccttcattg gcgagaagta
2581 actggccaga cctgggctgt cttgctggct gccttccttg ctgggctcca acttatcttc
2641 tatgtacata gcccttggag ttcagaggcc tctgctgact tctgctcagc ctctggctct
2701 cagcacccac atccccactt gtacatctgg gagcagaaga aaaagaggga gctggagata
2761 atctgtgaga ggctccgaag cccagtacat ccccctgact cagccctgcc cacctgctgc
2821 tgtgactaca gcaacagaca agacagctag gaagatagga agtgaggtcc aataccttgt
2881 gggcagtggt gtcattagct gcaacgccta agatgtctcg agagatggga gaactcactc
2941 aaaccaggtt acagaagatc tggattccac acagcagcag tagcagcgtg ctgcaacggc
3001 gaaggggctg taagtgaggg ttgtcatttg tgggtggtgg tgatgggtgg gtctagaaga
3061 acaagtggtg ttccccatcc actcctaccc tcacaaagaa gttggaatag aagatcattt
3121 ggatgaaatt gctagagtgt aacaagaggc taatatgaga agggtaagga agtaagagaa
3181 aggcaggtag aaaagacaac gatgaggctt agataaaaag aacccaaact acagggaccc
3241 ctgacctaag ggctgtctcc ctttatttga gaaacggtcg gaatgcttct tgtctggaat
3301 gcgagcgttc tagtgtagtg ttacccgaga gtattttatt ttagcttctc ttatgaaagc
3361 tctggaaaca ggtcttgttt aatttatgcc agggtgaaag aaggtgatag ttcctggcgt
3421 ggtttctcgg aggctaggtg cttagtccac accagtaccg tgtcactgct cagcattgcc
3481 cttcctcttc caccatcctc ttaaaaataa ctgtacctgc ccagctgggg acaactgcgt
3541 acttggcctt gacattcaga gacagtcttg ctgttctgag gagtcgaggg gacagacagt
3601 atcacaaaga ctgatctggt gttcagttgg agcttaacat cctaattaga cctgatgtct
3661 ttattttaaa actcaggcct ttgaaatttt aaggaggagc agaggcaaag taagcaaagc
//
LOCUS SPFIFP1A 1801 bp ds-DNA INV 25-MAY-1990
DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 879.
ACCESSION M32776
KEYWORDS insertion element; insertion element IFP1.6.
SOURCE S.frugiperda DNA, clone lambda 879.
ORGANISM Spodoptera frugiperda
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Noctuoidea; Noctuidae.
REFERENCE 1 (bases 1 to 1801)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 115 118 5' insertion target site
site 1755 1758 3' insertion target site
BASE COUNT 629 a 274 c 273 g 625 t
ORIGIN
1 gagaatcaat gttcactgga aatatatccg ttttagtttg taacgaagaa aaagatgcga
61 tgaagtgttt tgtgacgtgg attcttagaa ggaagaattc aatgttctat tttattaacc
121 ttttgaccgc cagagcataa acgcttcgcc gtgccctcca cgccgagaca caaattcaac
181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
1681 aggtaatgtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg
1741 tggcgcacag ggcattaata tctatcacct taccttttat tattacttta atgtataatg
1801 g
//
LOCUS SPFIFP1B 1820 bp ds-DNA INV 25-MAY-1990
DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 883.
ACCESSION M32777
KEYWORDS insertion element; insertion element IFP1.6.
SOURCE S.frugiperda DNA, clone lambda 883.
ORGANISM Spodoptera frugiperda
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Noctuoidea; Noctuidae.
REFERENCE 1 (bases 1 to 1820)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 115 118 5' insertion target site
site 1774 1777 3' insertion target site
BASE COUNT 636 a 281 c 270 g 633 t
ORIGIN
1 agagttaagt ggctttgatg atttgtgcat atagtgccat tgtgcataca aaaccaaaat
61 gttttatatt caaacttaat tttatcgttg tcatctaaaa cgaaagccat ttctttaacc
121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac
181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
1681 aggacacgtg aaagtaatga tttaatgtgt ttattttgtt tttctctgtg atctgtttct
1741 tggcatctat agatgtcggt ggcgcacagg gcattaatcc attttctatc ataagttatt
1801 tgatacttag atctgtaagt
//
LOCUS SPFIFP1C 1741 bp ds-DNA INV 25-MAY-1990
DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 889.
ACCESSION M32778
KEYWORDS insertion element; insertion element IFP1.6.
SOURCE S.frugiperda DNA, clone lambda 889.
ORGANISM Spodoptera frugiperda
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Noctuoidea; Noctuidae.
REFERENCE 1 (bases 1 to 1741)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 60 63 5' insertion target site
site 1696 1699 3' insertion target site
BASE COUNT 609 a 275 c 263 g 594 t
ORIGIN
1 ggtaggcagt gacaatggaa cgccaattgc tacgaatctt acatacttca ttcgcttcat
61 taaccttttg accgccagag cataaacgct tcgctgtgcc ctccacgcca agccacaaat
121 tcaacgaaat aaccttgaaa atattaggga ttccaaaatg ttatcgataa accaaagtga
181 gtaataattt tcgtctgatt tttttttgaa taaactgctt ataacataaa aagaatattt
241 atttaaccta atctcccctg taataaactt aagaccacaa taatagcaac tatcaataaa
301 atatcagcct tttgagacac aaaaatacat ataaaattgc agcaagttac agcactatta
361 tctatcagtt acgcgccatt ggcatgtata catgccaatg gcgcggcaac tatcaatttt
421 ttttaaatct ctcgtttatg attagggatg tagctgaata atgtaactta tcgagttcat
481 ttatgtatta ggtaatcaaa tttgaaaact gctaaatagg tacagatttt agtgatatta
541 tttatacact ttcatattct taaaatagtt gccttgtatt tgtaaaatgt cacatgaaac
601 ttaaaaatat aaaaaatcaa caatcgtatt tatttattat gctcattaac agccttgtaa
661 ttgaaccgtc acgtgaaact taaaaatata taaaaaatca acaattgtgt ataattatta
721 tgctcaatac agccttgtat ttgtaaaatg tcacatgaaa cttaaaaata taaaaaatca
781 caatcgtatt tatttattat gctcattaac agccttgtaa ttgaaaaagg ttacgtgaaa
841 cttaaaaata tataaaaatc aacaattgta cctataaaac attatgctca ctaacagtct
901 tggaattgta aaatttaaaa ctaaattgta tattatctat aattattatc aataacaaaa
961 atttaataaa taaatagcaa ttataatgta ggtactcgat aaaatatttt ttgcaacttc
1021 cccgcattta aaaacacggc acgcgcttac gacgtcacga gtccgacatc tatacatgtc
1081 aatggcgcag aagggattgc atatcggtag tatgctagga tgcgcatgat tggcgaaaag
1141 ggaatgtcga taatattgaa aatatagtaa atatgctatt atacccaaga atttttcagt
1201 ttaaaatata gtatatttac cttttttctc agagactaag gttatatata tactttttaa
1261 tagttattac aaaaaataag cgagtcattt agctgaagtt ttaatattac ttgaattaca
1321 ctacatcact agtaggcatg tatacatgcc attggcgcga gagatacagt tcttttttag
1381 ccgatattta agttgttttt atcaatcaaa tattagttat tctatattgc aattttgttc
1441 actgtgtaca attatgtaat aactgggtat tcagatctta agatacatca ataaaacacg
1501 tgaaagtaat gatttaatgt gtttattttg tttttctctg tgatctgttt cttggcatct
1561 atagatgtcg gtggcgcaca gggcacgctt gcgcatgaca tctatacatg ccattggcgt
1621 tcaaaaggtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg
1681 tggcgcacag ggcattaaca taacttggta taaattgttg cataattgca ttgagagcag
1741 a
//
LOCUS SPFIFP6A 1792 bp ds-DNA INV 25-MAY-1990
DEFINITION S.frugiperda insertion element IFP1.6 DNA.
ACCESSION M32780
KEYWORDS insertion element; insertion element IFP2.2.
SOURCE S.frugiperda DNA, clone lambda 858.
ORGANISM Spodoptera frugiperda
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Noctuoidea; Noctuidae.
REFERENCE 1 (bases 1 to 1792)
AUTHORS Beames,B. and Summers,M.D.
TITLE Sequence comparison of cellular and viral copies of host cell DNA
insertions found in Autographa californica nuclear polyhedrosis
virus
JOURNAL Virology 174, 354-363 (1990)
STANDARD simple staff_entry
BASE COUNT 586 a 326 c 287 g 593 t
ORIGIN
1 atgatgtttg tacataatgt tggtacgttt ctcccaccct ttaatttcac tacaagcaag
61 agaccaatgt ctgagcggat tacccccaca aataaataac acacatttaa atctcgaatt
121 tgctattggt tgtcacacaa acgcaatgac tgcatttgac gtacactcta acacgcataa
181 aaactcatgt atgctagggc gagacaaagg tcgattgttg atatcgataa aatttaagta
241 atcttgtttt tgttttacaa gtttgttgaa tattgtgaaa ttgtattcat gcgattcaaa
301 ttgtatcacc aacatgacac gtacaccagc accgtccaac tcaaccatgt tgcagggaat
361 cgaacgatcc aatgcccgtg ccaccgccag tgagaccttt gagtgagcat gacaactcta
421 agctgacaca aatgctgaaa tgcattctac tagtctaatt tagctaatta cagggctctc
481 acatttacaa taatctgtat aaagtacaga acacattgaa ataacatggt tatattattt
541 cattttttaa atttacgaaa gatgatattt taaatatcgc catttttaga caaaaaaact
601 gatttaatga gactaattta aaattttcta aaataaaatc agtttactta ctgaattaca
661 cattgcgata ttgtgcggat aaacgcctaa ttttaagttt cttttccctg actttgttat
721 tatgtaatta tcaacatcga aatgttgcga acgaaatatg ctacatttag taggaatcca
781 ggagcccata gtacaggttt tcctagtttg ggttcctgaa tcccaccact tgtcatttca
841 attgtcttaa acctggtcgt gtaacttgct acataattta ttatactatt aattattata
901 taagcttagt ttttaagacc aatgtctgta taaatatggt ggagataaaa atatacgcag
961 tttctaatct gtgtggtgga acttgaaaaa aaaaaaatta attctaccag ttgtgtcaat
1021 ccacaatttt gttacacctg gatctgttgg aaacctaaaa cataatatta aaataaatca
1081 tatgtcacat tctataaaaa aataaaaaca taaaactcac tcaaaatcta tattactaat
1141 tagtcgtaat aaataatact tattatttaa attataaagt gtgctattga ttataataga
1201 ctgtatcata gtaacaatcg acattgatta caaatttgcc caacactata gatgtcatag
1261 attttgtgga aatgagcttt tgtttcggta ctcgggactc tcgcaatgga cgtcatacat
1321 tgcgcgattg ctcacgtagt gcaagattgt actcatctat cgcaagcaat gtattactat
1381 tagtctgctt gcgattatat cgtttttgtc tataaatttc ataatactaa tgttataatt
1441 acaaatatca cacgtatttt tatgtttttg gacttaccgg tgaaaagaaa ttccttctcg
1501 tgctatgcta cacacctggt ttcttttccg acacgttata atagcgcaag acggcataat
1561 ataatatttt tgcggagata cgtacgctgt caaaacatga cagcgattgc tagccggtaa
1621 gtatgaacat ttgagcgtta tgacagcggt atgcacgttc gttcgcaccc tcggcacgcg
1681 tggcccgccc acaaggccaa tccagtctct ttcttttatc catggttttt acaaatggaa
1741 atgtatttgt aaaacggcgt tggtacatct gtacctaaaa atgatatagt at
//
LOCUS TACLSEG 7102 bp ss-RNA VRL 25-MAY-1990
DEFINITION Tacaribe virus L protein and P-11 protein genes, complete cds.
ACCESSION J04340 M33513
KEYWORDS L protein; P-11 protein.
SOURCE Tacaribe virus (strain T.RVL.II 573), cDNA to viral RNA.
ORGANISM Tacaribe virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Arenaviridae.
REFERENCE 1 (bases 1 to 449)
AUTHORS Iapalucci,S., Lopez,N., Rey,O., Zakin,M.M., Cohen,G.N. and
Franze-Fernandez,M.T.
TITLE The 5' region of Tacaribe virus L RNA encodes a protein with a
potential metal binding domain
JOURNAL Virology 173, 357-361 (1989)
STANDARD simple staff_entry
REFERENCE 2 (bases 420 to 7102)
AUTHORS Iapalucci,S., Lopez,R., Rey,O., Lopez,N., Franze-Fernandez,M.T.,
Cohen,G.N., Lucero,M., Ochoa,A. and Zakin,M.M.
TITLE Tacaribe virus L gene encodes a protein of 2210 amino acid residues
JOURNAL Virology 170, 40-47 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by S.Iapalucci, 08-AUG-1989.
FEATURES from to/span description
pept 70 357 P-11 protein
pept 7072 440 (c) L protein
BASE COUNT 2071 a 1653 c 1240 g 2138 t
ORIGIN L segment.
1 cgcaccgggg atcctaggcg ttacgtgcac tcttctttat tgggctggat tacacaaaac
61 ttttcaagca tgggcaactg caatcgaact cagaaaccca gttcgtcttc taacaacctc
121 gagaagccac cacaagctgc agaatttaga agaacggcag agcccagtct atatgggagg
181 tacaattgca aatgttgttg gtttgctgac aaaaacttaa tcacctgtag tgaccattat
241 ctttgtctaa gatgtcacca aatcatgctg agaaactcag agctgtgcaa catctgctgg
301 aagcccctgc caacatccat aagagtcccg ctagaggcaa gtgctccgga tctgtaacca
361 acgacaaaca aggtcgctga ccccccgggg ggcccccaag ggggcccccg tgggggtggc
421 ccccggggga gtttgggttc tactcaatgt cctccactgt gtttctccta gctagaggtc
481 tgcacataat gcctctcaac ctcaaggacc cctggttgtc agatacaaca aggtctttca
541 gggacttact gtatgcaacc ttgaagttga caaattcaac ccagtccccg atcaactcca
601 gggacctgat caagccagat ctatgtaaat catagacaag gacaaagtcc tcttgttgga
661 atagcatcac ctgtgcagtt gattttaaat agggccagag ttggttcaac aaagcattca
721 catcttcagg gtcgtccagt gcgccaattg ccttcactat tatatcacca gtcaactcca
781 ctctgtaagt agaaactttc tcagttccca taaaaaagtg gccgtctctt aacaccaatg
841 gctccccttg atcgactggc ccattgagtg tataggcata cttgtcactc aattcagata
901 aaaatccacc aaagctgaat cttacatccg tcagaattag gttgtcattt gtctccacaa
961 gatctgcatc catcatcagg gaagccatat taaaagtgcc ttgacctgtg aatatcggaa
1021 gtcctgcaat catggaaact ccatccaaca agaggtgact tccccccgac cctgacactg
1081 actgcgtcac atccaccacc tgtaactcta cgcttgcttc tttaagatca atgactttag
1141 aaacacatgt aacactaaat gagtaagtca ttttcctctt aaacctgtac ttagttgatg
1201 ttctagaatg ctccacctgg atcaaaaacc tcaagtcaca atcctcatgg tcatcaaagt
1261 tcccaagaag atccagcaag tcattgatgt gtctccctgg agccagtggt tgctctccaa
1321 ttgtgatttt gtcaatgtac aatctaacat cttcaagatt tgaaatcaca taacccattg
1381 ctatgtcttc aataaacatc gctctactga tcttctgatc cctaactaag aaggtgaaat
1441 ctgttaactc acccaattca atctgaccat catcttccag ctccaatact tcattaaacc
1501 aaaaaaacga ttttaggcaa gaagttgaca tcactaaagg gttcaccatc gatgatagga
1561 tgatttgaac catgaagttg taacaaacct gttgaggact cagtccaact tccttatcta
1621 cgtaatcaga ctgcctaacg tgtgccatgg cacagtcgct ctgtttgact atgtagacac
1681 tattgtctct ggctttcagt aggtggctga cgattcctag attctcattc atcaaatcca
1741 atgctatgca aacatctaag aacttcagct tcctcccatc cagtgttctg ttcacttgca
1801 aatcgctcat gaagggagac aagtgctcat caaataattt cgggtaattt ttccgaatgg
1861 aatacaatac atgattcata cccagttgat cttctaattg acgaacaatt ctaggcttaa
1921 cagcccagaa caaattcctg ttacttaaca ccactggaat ctgtggctct ttcacagatg
1981 agaataccca ctcccctatt tcgcaggcat tggtcaaaac taaagaaaag taatcccaaa
2041 gaataggttt ggaaaatttt tcaatttttt ccagtgaggg tctgcaataa atcacacctc
2101 ttttcatgtt gcacacctca cacagacagg gcttcacctt tgtataaata tctttaatgt
2161 aaaggaaacc gccctcacca tctctgacac atttgctgcc tatactttta caaaaaccta
2221 taaaaccaga tgccactgaa gactggaatg cagatttgtt tatggactcc gctaagatct
2281 ttttaacacc ctttgtgaag ttccttgaca acctggactg caatgtctta atcagtgttg
2341 gcacctcttc tctctcaaga acccttctgc tcgtcattaa cttggtcctc aatactagtc
2401 ttaggtcacc aaaacttgac aaattcaccc atctaaagcc taaggcacgc ataatatctt
2461 gctctgagca acttgaataa cgcaagaacc ccacaagggc gtcatcacca tctctaccaa
2521 taagactgat caggttttcc tcaaaaactt tgcctctttt tatatcatta aaaacccttt
2581 tagcacagct tctaatgaag cctgtcatct catcatcaga aaatattgat tcaatcttcc
2641 tctgcaatcg ataacccctc gtcccatcca cccaatcttt tacatcttgt ttctcaacag
2701 caccaaaagg ggtttccttg aagccggagt atttgatcag ctggttcacc ctttgagaga
2761 tctttgagac aatttggaca ctaacaccat ttgcgacaca ctggtcacag attgtgtcta
2821 ttgtttctga caactgtgtt ggtgttttgc atttcacatt gtgcaacgct gcagcaacaa
2881 attgagttaa tagcggggtt tcttccccca tcacaaaaaa tcttgatttg aattcagcta
2941 caaaggtgcc cgctacactc ttggggctaa caaatttgtt caattttgag gagaggtact
3001 catgaaaaca caacatttcc aaccactcat cattaacttg attatcatct gaagcacaag
3061 gcaatttgat aagtgtgacc tggtcatcgc tagaagtgta ggaagttgga atgacatcat
3121 atagtaaatc tagcacataa tttaagaact gttctgttat caatccatac aaatcagaca
3181 tgttatgcag gatgccttgg cccatatcta gaacagacat tatgtgactt ggtatctctc
3241 cttggactgg catgaattgg tggaagaatt cctctcccac gcttgagcta ccacaaccca
3301 tcaaccctaa gcttctcttt agattgccag tacagtaagc ctcagcaaca tttactggaa
3361 cttctataac tttatggagg tgccatctca gcacatttaa gactggttca agattaataa
3421 tgttcctttc ttttggggtc ctaagttcca aaaattggag gaatgtcaag aaaagcgcag
3481 gactcatggt tggtccccat ttagagtgat caagagaaca tgataggtca ccattgttca
3541 ctgctaactt catgtcacag atggctctat caaattcttt ttctgaactg agacatgtat
3601 atctcatgga actgccaact gcctctgcaa agtcttcaac taaccttgtt atcagcttag
3661 tgttaagatc cccaacgtaa agttccctat tggaacccac ttgctccttg taacttaagc
3721 caaactttaa gttaccatta tttgaactga tactggtaaa ctcagtagga gaatcatcag
3781 agtaaaagca caggttcctt agtgcagcat tggtaaaata actcctatcc aacttatctg
3841 ctatagcttc agaattgctc tctctcatgc taagtcttac ttcatctttc actctcagtg
3901 cttcattagt gaaaccaagc cttgatctac tcctatgttc ataacttcct agtttttgat
3961 cataatgacc ttgtatcaat aagtatttga agcactcaaa atactcttca tcaaaatatg
4021 cagaagtagt gagatttttc aaaagcaatc ctagaggaca aacttctaaa ggtctctcca
4081 agaaatactc tggtttgctt gaggagtaat agaccacatc acatagttct ttatacaagt
4141 ctgaagcgag tacatcagga tcaaagtctt tcacctcatg aagtgagacc tctgtcctaa
4201 tctctctcat aatcccaaaa ggaccccaaa gttcctcaag tgtacctgtg ctgcttttaa
4261 ccctgccttc cttttgattc ggacttagat tgacatctcc agatttggag tccttcattt
4321 tgaccattgc taattgcacg gaatttttta tatcattgaa agcttctgct tgctcttctg
4381 tcaacccttc aaacatcatt gacagctctt cacaatctct ctgggtttta gaaccaatac
4441 taacaagccc ggttaagttt tttagtatca agtaatctaa tgtacttgga tcaatgttgt
4501 atcttccctt tgttttgaac ctttccgcca tggaagaaac acatgaagaa atcatcttct
4561 ctctatcata taccgaaaga acattcccaa gttcatctag ttttggtact actacgctct
4621 tattgctaga gaggtctaaa gcagtgctag taatgtttgg cttgaatggg tcgttcttaa
4681 gctcacctgt gacttttagc tttcctttat taaataatga gatgcaataa ctcaacagtt
4741 cattgctgac ccctggtaat ttggtgtcag aacagtcttt gagggtcttt gaataaaatt
4801 tatccagccc gtccaacatc attccttctt gagcttctgt cagttgacca tttaaagatg
4861 ggttgactat aacacagtta aaatcaattt tgggttcaat gaatttctca aaacatttga
4921 tttggtctgt taatctgtca ggtgtttcct ttgttattaa atgacaaagg taacttacat
4981 ttaagagata cttgaacctc ctagcaaaat aggggtccga atcaacacct tcaccaagaa
5041 ggcattggaa aaggtccact gccagcctct gaactatgac ttcagagcca gacatacact
5101 ccacttttaa tttggacagt agatccacgt ggtggaattg atttgagtat gccatcaaaa
5161 aataccttag cccttgcaga aatacctgat ttcttttgga cggggcgcat aaaatgctta
5221 acaataaaag tcttagtttc tctttggata tcaccataag atcactgttg aagtccagcc
5281 aagttgtcat ctcatcacac atctcaacca acacctcttg tgaaaagatt ggcagaaaaa
5341 atctcttcgg gtcacagtaa aaagagccct tctcgacaag aacaccatct tcacatgtgt
5401 aaattgagta gcatcctgaa cgctcacctg tcttttgata aaaaagaata acaatcagtc
5461 catcagttaa aaagaatctc tgttggtagc actcccttag tctcacattg ccaaaatacc
5521 ttgagtattc tttttcatta ataagcaacc ttgaagaaaa cgaggttttc atagagttta
5581 ttaaggaaag agacactgag gagagtattt tgaaagcctc aacctgttca gcatctgaca
5641 tcatgtgaaa tgtcttatgt ttacattcag tgttaaagtt ccccccagca gagtatctca
5701 taatgggtct atcagaagat attgtctgag ctatagatat tagcttggat aaatcaggat
5761 atcttgccca caccgatatt tgtgaccaga tatcttgttg agacctcctc aagattttag
5821 agcaattctc cacccactta acattcgtct ctattttctt tttaattaat ttcaccgcct
5881 ccaaaaaatc caaggccctg tcattgacta gtgtgaccat ttgacccagg atgagaccag
5941 accttctcaa agagtccagt gcttcgggtt tggctgtaaa taggtcatct aaaaagttta
6001 ccattatgag atctagaaag agcaacgtat ttcttcgggt gttaaaaact tttaatccct
6061 taattttatt gcaaatcgac aacacgccta aataggcctt ctcatggggg cagacaacat
6121 actgatctaa ttcagagttt gatttctccc tgtcctgcct ttttagtgct gtgaaagtca
6181 aactatagat tatgtccatg aatttgttga cacaaaaggg acaatctgag gtggtgttat
6241 cctcatgttt ccctaaccta attaaattgt tcagaagctc tgatctatta gttatcctaa
6301 acttccccaa gtttttgtca aaaacttcct ttttgtacca gagtcttaat ttcattaatt
6361 cttccctaat cacagattct cgaacatcgc ctttgactcc aacaaataat gatttggcca
6421 gtctatcatc atactccaaa agatttcttt tgagacactc tatattctca aatcctaaac
6481 tttgattacg aacgtttgaa agactgtgta tcagacgttg atactcttga tcctcaatat
6541 caatctctgc ggacgatcgg gatttccgca aaatttcaat taacaaccat ctgatcctct
6601 ccaccaccca ttcaggagtg aacctattag aataattagt tctaccgtca acaacaggaa
6661 caagtgtgat gcctgacctc agaaggtctt gtttgagatt tagtagtttt gtcatgtcac
6721 tcttccattt tctctcaaat tcttctggat tgaccctcac aaatgtttcc aatagtatta
6781 atgtttttcc agtgagtttg aatccatcag gtatcacttt agggagagtg ggacatggga
6841 ttccatgatc actgaggatc acatcaacat atttttggca ggtgttcaag tggcatgcat
6901 gagattcaca ggactctaac tccacaaggt tggagagaag tttaaaacct tcttgaagca
6961 gtgaccctga atgacagtgt gacaggaagg catctttctg gtgggcaaac tcatgcctat
7021 ttggaatgtg ttttctaacc aagtctttga gttcagacac agtttcatcc atggtcaagt
7081 gccgcctagg atccactgtg cg
//
LOCUS DRHBCDA 922 bp ss-mRNA INV 25-MAY-1990
DEFINITION D.heteroneura bicoid (bcd) mRNA, 3' end.
ACCESSION M32125
KEYWORDS bicoid protein.
SOURCE D.heteroneura, cDNA to mRNA.
ORGANISM Drosophila heteroneura
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 922)
AUTHORS Macdonald,P.M.
TITLE Bicoid mRNA localization signal: Phylogenetic conservation of
functional and RNA secondary structure
JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.M.Macdonald, 14-FEB-1990.
FEATURES from to/span description
pept < 1 36 bicoid protein (bcd) (AA at 1)
BASE COUNT 309 a 157 c 140 g 316 t
ORIGIN
1 ggtggcaaca cgcagttcgc atactgtttc aattaggctg ctacattaat aataattgta
61 aagaaaaaaa aaaaaaaaaa acaatttcaa tgtttgtgga gtttatagct ttaaccgaca
121 cgttgtttcc tgattgtaca aagatctaat aattgtagat atccatatgc atagtattta
181 agccaattta acattgttta aaccaagtat gtaacgcata gttgtagctg ccgagttttt
241 taatgcattt tctactgaca taaactgaag tgccattatt tggaaacttt atttgtaaag
301 cgttctttga tctcaacgct gtctggctgg acatttgcca aagcccatga atgcccaacc
361 agacactgtt gagacgaata tgggctttaa ttgaacgctt tacagaagaa gtttataaat
421 acacataaaa tgcatttctt tcttataact caggccaact ggcatagcaa gtaaaactgt
481 actgatctaa atgaaaaatg tttctcttcc tgataatcgc acgtccttct atggacatta
541 tatgccatta tcagtcgaag atcgaacgtt taaccaataa tgttcttcag aatattttta
601 agctttgtta tgccacagtt tgcatgttgc aattaaacat tcttcacgtg agaattgttt
661 acaaattgtc tctaaacaca tgtgaaccac aggcctcact tatctaaacg aaatctgtac
721 ttactttcag aatgtaagat agttttatca attgaaaatg cattccaaat atgtatctca
781 gttacccttg aatggtgcac catttaaata ttattctaaa cgtaacttat ctatatatct
841 gtatgaaata gtatatttgg ggatatattt aaaatattca gagtcctgtt ttgtatacta
901 ttttaataaa tcttacaaac ta
//
LOCUS DRQBCDA 987 bp ss-mRNA INV 25-MAY-1990
DEFINITION D.sechellia bicoid (bcd) mRNA, 3' end.
ACCESSION M32124
KEYWORDS bicoid protein.
SOURCE D.sechellia, cDNA to mRNA.
ORGANISM Drosophila sechellia
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 987)
AUTHORS Macdonald,P.M.
TITLE Bicoid mRNA localization signal: Phylogenetic conservation of
functional and RNA secondary structure
JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.M.Macdonald, 14-FEB-1990.
FEATURES from to/span description
pept < 1 90 bicoid protein (bcd) (AA at 1)
BASE COUNT 301 a 204 c 169 g 313 t
ORIGIN
1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa cgccgcgggc
61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca
121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
181 ctacgcgtag aaagttaggt ctagtcctaa gatccgtgta aatggttccc agggaagttt
241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca
301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatctg ggtggttgct
361 ccattaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat
421 tccatggaaa gttttcgccc cacatttcgg aaattaaatt ccgagccatc aagaaaatct
481 tctctgttac aatttgacat agctaaaaac tgtactaatc aaaatgaaaa atgtttctct
541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg
601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac
661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc
721 agtttcggag attttttgct gccaaacgga atacttataa aaacccaaat tctatttaca
781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct
841 ttgacgctgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca
901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattt cggaattata cttagatact
961 tatcccatat ttccgtccct ttctaga
//
LOCUS DRSBCDA 978 bp ss-mRNA INV 25-MAY-1990
DEFINITION D.simulans bicoid (bcd) mRNA, 3' end.
ACCESSION M32123
KEYWORDS bicoid protein.
SOURCE D.simulans, cDNA to mRNA.
ORGANISM Drosophila simulans
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 978)
AUTHORS Macdonald,P.M.
TITLE Bicoid mRNA localization signal: Phylogenetic conservation of
functional and RNA secondary structure
JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.M.Macdonald, 14-FEB-1990.
FEATURES from to/span description
pept < 1 90 bicoid protein (bcd) (AA at 1)
BASE COUNT 296 a 203 c 171 g 308 t
ORIGIN
1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa ccgcgcgggc
61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca
121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
181 ctacgcgtag aaagttaggt ctagtcctaa gatctgtgta aatggttccc agggaagttt
241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca
301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatttg ggtggttgct
361 ccactaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat
421 tccatggaaa gttttctccc cacatttcgg aaattaaatt ccgagccagc aagaaaatct
481 tctctgttac aatttgacat ggctaaaaac tgtactaatc aaaatgaaaa atgtttctct
541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg
601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac
661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc
721 agtttcggag attttttgct gccaaacgga atacttataa aaacccacat tctatttaca
781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct
841 ttgacgttgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca
901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattc cgaaggatac ttagcccata
961 tttccgtccc tttctaga
//
LOCUS DRTBCDA 963 bp ss-mRNA INV 25-MAY-1990
DEFINITION D.teissieri bicoid (bcd) mRNA, 3' end.
ACCESSION M32121
KEYWORDS bicoid protein.
SOURCE D.teissieri, cDNA to mRNA.
ORGANISM Drosophila teissieri
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 963)
AUTHORS Macdonald,P.M.
TITLE Bicoid mRNA localization signal: Phylogenetic conservation of
functional and RNA secondary structure
JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.M.Macdonald, 14-FEB-1990.
FEATURES from to/span description
pept < 1 90 bicoid protein (bcd) (AA at 1)
BASE COUNT 281 a 195 c 168 g 319 t
ORIGIN
1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cccatcgtaa cgccgcgggc
61 aactcgcagt ttgcctactg cttcaattag cctggacgaa cggcgtatta gagagtttta
121 ttagccttat gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
181 ctacgcgtag aaagttaggt ctagtcctaa tggttcccag ggaagtttta tgtagtagac
241 tagtcagcag gctacacgga ttccagtgca cgtcataggg ataccctagt ttactcgata
301 ctttccctgt aatacgcttt tcgcctcaat tgtatctggg tggctgctcc ataaagcccg
361 ggaatatcca accagttaca tttgaagcca tttgggcttt agcgtattcc acagaaagtt
421 atcgtcccac aacttatatt ccgagccagc aagaaaatct tctctgtaac tatttgacat
481 agctaaaaac tgtactaatc aaaatgaaaa atgtttctct tgggcgtaat cttatacaat
541 gattaccctc aaagatcgaa catttaagca ataatatttg atatgatatt ttcaatttct
601 atgctatgcc aaagcgtctg acataatcaa acatttgcac attcttcgac caagaatagt
661 cagcaaattg tattttcaat caatgcagat catgtgttcc agtttcggag attttttgct
721 gtcaacggaa taactatcct acccacactg tctatttcta tcacttagaa gagcattgaa
781 atctgttttg tttagcctca actttaattt taaaatggtg cacctttgat gttgtctttt
841 taagatttgt atttttaatt atgaaaatat ataagaacta ctccactcgg gtaatttgtg
901 actaacttag tattttaaca aatttcaagg gacgcgtagg cattctttcc ctctctttct
961 aga
//
LOCUS DRVBCDA 1023 bp ss-mRNA INV 25-MAY-1990
DEFINITION D.virilis bicoid (bcd) mRNA, 3' end.
ACCESSION M32122
KEYWORDS bicoid protein.
SOURCE D.virilis, cDNA to mRNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1023)
AUTHORS Macdonald,P.M.
TITLE Bicoid mRNA localization signal: Phylogenetic conservation of
functional and RNA secondary structure
JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.M.Macdonald, 14-FEB-1990.
FEATURES from to/span description
pept < 1 90 bicoid protein (bcd) (AA at 1)
BASE COUNT 326 a 176 c 160 g 361 t
ORIGIN
1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cgcatcgtaa tgccggtggc
61 aacacgcagt tcgcctactg ttttaattag atcgggacac caggcgcccc atataatata
121 ataattattt taagtacaca aattttaatg tttgtggtgt ttctttagct ttaagtgaca
181 cgttagttcc tgattgtaca aagatttagt gattgtagat atctatgcgt agtattttaa
241 gcaaattatc aatattttaa atgaattttg taaagcttaa ttaagtcata gctgtaattt
301 tccaagcatt ttcaatgtat ttccataaga aacttccttg gtaaaacctt cattggtctc
361 aactctgtct ggctgggcat ttgccaaagc ccatgagtgc ccatccagac atacttgaga
421 caataatggg cttaagctga aggttttacc gaagaagttt tcgtatgaaa atctcatgta
481 agattcattt ctataatttt ggcaccttgg catagcttac aaaaaaactg cactgatcta
541 aatggaaaat gtttctcttc ctgataatcg cacgcctttc tttttggcct gccattatca
601 gccgaagatc gaacatttaa acaataatac tcgtcagaat atttttaatt cttgctatgc
661 cccggttttc atgttacaat tgaacattct tcacgtgaga attgttcaaa tattgtacca
721 caatacatga ataccacatg cctaagttat cgaaacgaat actatatatt actggaggat
781 tttcgttttt ttcttataat tgaaaatggg ttgaaaattt accagttacc ttacgaatcg
841 tgcacctttc aactgatgtt ttaaagctaa gttatcttta atatagtgca tttgggaata
901 tatattaaat gttcaataca atatttccat aaatttatat attaataaat cttacaaact
961 aaccaatttt aagaagtgtt ttgcttttaa cgttttacga taaacaaagt aaacgctttc
1021 gaa
//
LOCUS M33420 71 bp ds-DNA BAD 25-MAY-1990
DEFINITION Figure 3. Nucleotide sequence of junctions in the multicloning
site and promoter probe plasmids.
ACCESSION M33420
KEYWORDS .
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 71)
AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M.
TITLE Transposable elements for efficient manipulation of a wide range of
gram-negative bacteria: Promoter probes and vectors for foreign
genes
JOURNAL Gene 85, 83-89 (1989)
STANDARD simple staff_entry
BASE COUNT 20 a 15 c 13 g 23 t
ORIGIN
1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatccc
61 gtcgttttac a
//
LOCUS MUSTCM1AA 805 bp ds-DNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor germline alpha-chain V-region, 5' end.
ACCESSION M33586
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; germline;
variable region.
SOURCE Mouse (strain C.B20) lymphocyte DNA, clone p2111.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 805)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 188 > 526 T-cell receptor alpha-chain V-region
sigp 188 244 T-cell receptor alpha-chain V-region signal
peptide
matp 245 526 T-cell receptor alpha-chain V-region
BASE COUNT 169 a 218 c 203 g 215 t
ORIGIN
1 caagaatcag atattcaaaa ccttccaggc tcgtgtttgc agcgtcagtt tttatgtgaa
61 cgggagactt ctccagcaga ctcttcccta cacctgcttc aagctcttgc tttcctcagc
121 ctctttcctg cctgtactgt tccagagttc ctccacgaca gagctgcagc cttcccaagg
181 ctcagccatg ctcctggcgc tcctctcagt cctggggata cactttctcc tgagagatgc
241 ccaagctcag tcagtgacac agcccgatgc tcgcgtcact gtctctcaag gagcctctct
301 gcagctgaga tgcaagtatt cctactctgg gacaccttat ctgttctggt atgtccagta
361 cccgcggcag gggctgctgc tgctcctcaa gtactattcc ggagacccag tggttcaagg
421 agtgaacggc tttgaggctg agttcagcaa gagtaactct tccttccacc cgcggaaagc
481 ctccgtgcac tggagcgact cggctgtgta cttctgtgct gtgagcgcac agtgtgtggg
541 gctgcagggg gagctgaaca caaacaccga tggctgtggt gctcagactc aagatctctc
601 ctgtggtctc tgtggcatct ctttggactt ggagagcaaa gaaggaaagt ttcgcagtct
661 cagctgtctt tctgcaactc atttccaact aaaacatcca ctatatgcaa tgctcaattg
721 tagttctgtg gaatgaggag actgtgatgt tatcctgagt gtatggctga gtgaacattg
781 cttgggcagt gactagacta gggta
//
LOCUS MUSTCM1AB 265 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
partial cds.
ACCESSION M33587
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region;
joining exon; variable region.
SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 265)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 265 T-cell receptor alpha-chain V-region (AA at 2)
BASE COUNT 54 a 78 c 67 g 64 t 2 others
ORIGIN
1 cctctcctgg tacatccagc accatggccg tggcctccag tttctcctca agtactattc
61 gggaaaccca gtggttcaag gagtgaacgg cttcgaggct gagttcagca agagcgactc
121 ttccttccac cttcggaaag cctccgtgca ctggagcgac tcggctgtgt acttctgtgc
181 tgcgaccngn ttatctggta gcttcaataa gttgaccttt ggagcaggga ccagactggc
241 tgtgtgccca tacatccaga accca
//
LOCUS MUSTCM1AC 163 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
partial cds.
ACCESSION M33588
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region;
joining exon; variable region.
SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 163)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 163 T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT 42 a 42 c 44 g 35 t
ORIGIN
1 tccacttgca gaaagcctca gtgcaagagt cagactcggc tgtgtactac tgtgctctga
61 ggccggggac acaggttgtg gggcagctca ctttcgggag cggtacaaga ctccaagttt
121 atgcaaacat ccagaaccca gaacctgctg tgtaccagtt aaa
//
LOCUS MUSTCM1AD 287 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
partial cds.
ACCESSION M33589
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region;
joining exon; variable region.
SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 287)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 287 T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT 69 a 88 c 67 g 63 t
ORIGIN
1 ctattgcctt tttctggtat gtgcaatatc tcaacgaagc ccctcaggta ctcctgcgga
61 gctccacaga caacaagagg accgagcacc aagggttcca cgccactctc cataagagca
121 gcagctcctt ccatctgcag aagtcctcag cgcagctgtc agactctgcc ctgtactact
181 gtgctctgat ccggggacac aggttgtggg gcagctcact ttcgggagcg gtacaagact
241 ccaagtttat gcaaacatcc agaacccaga acctgctgtg taccagt
//
LOCUS MUSTCM1AE 283 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
partial cds.
ACCESSION M33590
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region;
joining exon; variable region.
SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 283)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 283 T-cell receptor alpha-chain V-region (AA at 1)
BASE COUNT 73 a 88 c 62 g 60 t
ORIGIN
1 attgcctttt tctggtatgt gcaatatctc aacgaagccc ctcaggtact cctgcggagc
61 tccacagaca acaagaggac cgagcaccaa gggttccacg ccactctcca taagagcagc
121 agctccttcc atctgcagaa gtcctcagcg cagctgtcag actctgccct gtactactgt
181 gctctgagaa cctcaggagg aaactacaaa cctacgtttg ggaaagggac cagcctcgtg
241 gttcatccat acatccagaa cccagaacct gctgtgtacc agt
//
LOCUS MUSTCM1AF 246 bp ss-mRNA ROD 25-MAY-1990
DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
partial cds.
ACCESSION M33591
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region;
joining exon; variable region.
SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
receptor secondary rearrangements in a murine T cell lymphoma
JOURNAL J. Immunol. 144, 1094-1103 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 246 T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT 69 a 80 c 53 g 44 t
ORIGIN
1 aatatctcaa cgaagcccct caggtactcc tgcggagctc cacagacaac aagaggaccg
61 agcaccaagg gttccacgcc actctccata agagcagcag ctccttccat ctgcagaagt
121 cctcagcgca gctgtcagac tctgccctgt actactgtgc tctgagaacc tcaggaggaa
181 actacaaacc tacgtttggg aaagggacca gcctcgtggt tcatccatac atccagaacc
241 cagaac
//
LOCUS SYNCATAA 66 bp ds-DNA SYN 25-MAY-1990
DEFINITION Synthetic chloramphenicol acetyl-transferase (cat) gene, partial
cds.
ACCESSION M33419
KEYWORDS chloramphenicol acetyl-transferase.
SOURCE Synthetic DNA, clones pEJL4[30,49].
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M.
TITLE Transposable elements for efficient manipulation of a wide range of
gram-negative bacteria: Promoter probes and vectors for foreign
genes
JOURNAL Gene 85, 83-89 (1989)
STANDARD simple staff_entry
BASE COUNT 16 a 12 c 18 g 20 t
ORIGIN
1 aattcggtga tgctgccaac ttactgattt agtggatccg gtgattgatt gagcaagctt
61 cgacga
//
LOCUS SYNIS1AA 65 bp ss-mRNA SYN 25-MAY-1990
DEFINITION Synthetic insertion element IS1.
ACCESSION M33418
KEYWORDS .
SOURCE Synthetic DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 65)
AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M.
TITLE Transposable elements for efficient manipulation of a wide range of
gram-negative bacteria: Promoter probes and vectors for foreign
genes
JOURNAL Gene 85, 83-89 (1989)
STANDARD simple staff_entry
BASE COUNT 21 a 12 c 11 g 21 t
ORIGIN
1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatcca
61 aattt
//GenBank-Updates@genbank.bio.net (05/26/90)
LOCUS ECOSPEA 3236 bp ds-DNA BCT 26-MAY-1990
DEFINITION E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase
(speB) and methionine adenosyltransferase (metK) genes, 5' end.
ACCESSION M31770
KEYWORDS agmatinase; arginine decarboxylase; metK gene;
methionine adenosyltransferase; speA gene; speB gene.
SOURCE E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3236)
AUTHORS Moore,R.C. and Boyle,S.M.
TITLE Nucleotide sequence and analysis of the speA gene encoding arginine
decarboxylase of Escherichia coli
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
R.C.Moore, 02-FEB-1990.
FEATURES from to/span description
pept 192 < 1 (c) methionine adenosyltransferase (metK)
pept 987 2963 arginine decarboxylase (speA) (EC 4.1.1.19)
pept 3101 > 3236 agmatinase (speB)
signal 3030 3067 rho-independent transcription terminator
signal 811 816 -35 region
signal 839 844 -10 region
binding 977 980 ribosome binding site
site 1878 1886 pyridoxal phosphate binding site (put.)
BASE COUNT 743 a 784 c 860 g 849 t
ORIGIN 62.9 min on K12 map.
1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta
61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc
121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag
181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg
241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag
301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct
361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc
421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa
481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag
541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga
601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg
661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct
721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa
781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat
841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat
901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca
961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt
1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag
1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta
1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc
1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt
1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg
1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc
1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca
1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca
1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca
1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac
1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta
1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc
1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc
1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc
1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg
1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc
1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc
2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc
2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag
2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg
2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc
2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt
2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc
2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt
2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag
2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg
2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt
2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc
2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg
2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat
2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg
2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt
2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga
3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg
3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca
3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca
3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca
//
LOCUS FIBEGASE 2310 bp ds-DNA BCT 26-MAY-1990
DEFINITION F.succinogenes endoglucanase 3 (cel3) gene, complete cds.
ACCESSION M29047 M29681
KEYWORDS cellobiosidase; endoglucanase.
SOURCE F.succinogenes (strain S85, ATCC 19169) DNA.
ORGANISM Fibrobacter succinogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 2310)
AUTHORS McGavin,M.J., Forsberg,C.W., Crosby,B., Bell,A.W., Dignard,D. and
Thomas,D.Y.
TITLE Structure of the cel-3 gene from Fibrobacter succinogenes S85 and
characteristics of the encoded gene product, endoglucanase 3
JOURNAL J. Bacteriol. 171, 5587-5595 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Dignard, 14-OCT-1989.
FEATURES from to/span description
pept 177 2153 endoglucanase 3 precursor
sigp 177 245 endoglucanase 3 signal peptide A (alt.)
sigp 177 251 endoglucanase 3 signal peptide A' (alt.)
matp 246 2150 endoglucanase 3 A (alt.)
matp 252 2150 endoglucanase 3 A' (alt.)
site 167 172 ribosome binding site
site 2172 2213 region of dyad symmetry
BASE COUNT 649 a 653 c 529 g 479 t
ORIGIN
1 ggatccgggt gcgtcagtta aataaaatat tttttaacgt ttttcgtaca gaaagtggac
61 ttttagacca aaacacttat tacacttttt attccgatat atcattttac atagcataaa
121 accgaccccc aaatatatct ttggtaaaaa agaaaaaatc accttaagag ggttttatgc
181 aactcaagaa tttctatccc aaaatgagcg ttctcggtat cgcaaccgtg atggcactta
241 ccgcctgtgg cgatgaaaat acccaggcac tgttcgccaa caatccggtt ccgggtgccg
301 aaaatcaggt tccggtttct agcagcgaca tgagcccgac ctctagcgac gctgtcattg
361 acccgacctc cagctctgcc gcagtggtcg acccgtctac gctccctgca gaaggtccta
421 ttaccatgcc ggaaggtctc ggcactttgg tcgatgactt tgaagatggc gataacttga
481 gcaaaatcgg tgattactgg tacacctaca acgataacga caacggtggt gcatccatca
541 tcacgactcc gctaaacgaa gaagaaaaca tcatcccggg ccgcgtcaac aacggttcca
601 actacgcctt gcaagtcaac tacacgcttg atagaggcga ttacgaatac gatccgtacg
661 taggctgggg cgtgcaggtc gcaccggacg aagccaacgg acatttcggc ggccttacct
721 actggtacaa gggcggcgca cacgaagtac atatcgaaat caccgacgtc gaagactacg
781 acgtgcatct cgccaagttc ccggcatccc gcacatggaa gcaggctgtc gtccgcttca
841 aggacctcgt tcaaggtggc tggggcaagg aaattccgtt cgacgccaag cacatcatgg
901 caatcagctt ccaggccaag ggaaacaaga gcaagctcgt gaccgactcc ctcttcatcg
961 acaacatcta cctgcaggat tcttccgaag ttgaaaagga ccagccggat atggaaatca
1021 aggacccggt cattccggtc gttgaattta ccgaagctga aatcactgtg acgaacccgt
1081 tgcaggaaaa ggccatgaag tacctcaaca agggtgtcaa ctttaccaac tggctcgaaa
1141 acgcagatgg caagttcaag tcctttgaat tgggcgaaag cgacgtcaag attcttgccg
1201 acaacggatt caagagcctc cgcttgccga ttgaccttga cctctatgcc acaaaccgtg
1261 acgcattcat cgcaggcacc gacacagaac tcaagttcga tgacgacacc ttgttcctgg
1321 ttctcgactc cttcgtagaa tggaccgcca agtacaacat gtctttcgtg attgactacc
1381 atgaatatga caacagctac aacaccacca gcgctaagga ccccaactac atcaagatga
1441 tggcagaaac gtggaagcat gttgcagccc actacgccga aagcccccgc gaagacttgt
1501 tcttcgaact cttgaacgaa ccggacatga gcgatggtaa ggtcactgca gcaacatgga
1561 ccaccgcagc ccaggccatg attgacgcca tccgcacggt tgataccaag cacaccatcc
1621 tcttcggtga tgcccagtgg tactccatca cgctcctcgc caagcgcact ccgttcaccg
1681 atgacaacat catctacgtg atccacacct acgaaccgtt cgccttcacg catcagggcg
1741 gttcctggac ggactacgcc accatccacg atattccgtt cccctacgat ccggcaaagt
1801 ggtctacggt ttctggcgac ttcggtgtca acaagagcac aaagtcctac gtgaaaacca
1861 acatcaagaa ctactacaag accggcagca aggaagccat cttggaacag attctcaagg
1921 ccaagaagtg ggccgccacc aacaacgtac cggtgatcat caacgaattc ggcgcattga
1981 acctccgctc taccgctgaa tcccgcctca actacctcac ggccatgcgc gaaatctgcg
2041 ataccctcca gattccttgg acgcactggg gctacaccgg caacttctcc gtgatcgaaa
2101 acggcaagtt gattgaaggc ctcgacaagg cactcggcgt cggtagcaaa taagtctctc
2161 cttaaaaccc cctcaaaaaa aggtcacgca gaaatgcgtg gcttttttag taggaagtag
2221 acggtaggaa gttggaagtt agaagtagga agtaacagga atggcgcaat ggatacagtt
2281 gacacagata cattacaaaa ccccggatcc
//
LOCUS SFSSA 1747 bp ss-RNA VRL 26-MAY-1990
DEFINITION Sandfly fever sicilian virus S RNA encoding N protein, complete
cds, and NS-s protein, complete cds.
ACCESSION J04418
KEYWORDS N protein; NS-s protein.
SOURCE Sandfly fever Sicilian virus, cDNA to viral RNA.
ORGANISM Sandfly fever Sicilian virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Uukuvirus.
REFERENCE 1 (bases 1 to 1747)
AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A.
TITLE The S RNA segment of Sandfly fever sicilian virus: Evidence for an
ambisense genome
JOURNAL Virology 169, 341-345 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 693 to 695)
AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A.
JOURNAL Unpublished (1990) Oxford, UK
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A.C.Marriott 12-JAN-1989.
FEATURES from to/span description
pept 42 782 N protein
pept 1727 924 (c) NS-s protien
revision 693 695 ttc in [2]; tc in [1]
BASE COUNT 483 a 358 c 459 g 447 t
ORIGIN Unreported.
1 acacaaaggt ccctagttaa tctgagtgag ctaagtttga aatggacgag taccagaaaa
61 ttgctgttga gtttggagag caggctattg atgagactgt gatccaggat tggctacaag
121 catttgcgta tcaaggattt gatgccagaa caattataca caaccttgtg cagcttggag
181 ggaagagttg ggaagaggat gccaagaaga tgatcatcct atccctaact cgtggcaaca
241 agcccaagaa gatggttgag agaatgtctc cagagggagc aagagaagtt aagagcctgg
301 ttgcaaagta taagatagta gagggcagac caggcaggaa tggaattacc ctgtcaaggg
361 tgctgcagcc ctggctgggt ggacagtcca agctgtggaa gtggttgaaa acttcttacc
421 agtcccaggg agcacaatgg accgcattgt gtggacaaac ataccccagg cagatgatgc
481 atccaagctt tgccggtctc attgacccaa gcctcgacca ggaggatttt aatgcagtat
541 tggatgctca caaacttttc ttgttcatgt tttccaaaac aatcaatgtc agcctccgcg
601 gtgcgcagaa gagagacatt gaggaatcat tctctcaacc aatgcttgct gctatcaata
661 gctcattcat tgacaacact cagaggaggg cattcttgac taagtttggg atcctaactt
721 ctggagcaag agctacagca gttgtaaaga agattgcaga agtttacagg aaactagagt
781 aagctgctgc tagtgtgggg tgggatgggg attctgggtt gggggttctg gggtggaggg
841 tggctaggtg gggggtggca agggtggatt cggtttgggt tggggtcatg gggaggggtg
901 ggtctggggc tgggcagcgg agatcaaaag tcagagtcag acgagctctc atcattttca
961 tccacatgac tgtgtattgg ggtccaaaga gaattgccat actcggtgag gccagtagaa
1021 gggtcacttg ctctatagga tctaatcact gttcttacat caagtgcctc cccagaggag
1081 gcagtgtcaa aaggctctgc attgataagt ctgagacaaa ccagagatcc tatctctcta
1141 aatagatcgt atccattgta atgctcatca ctaagaccca acctcctagc ttcttgtagt
1201 atctttttgt gtgcctgaac tatgcactca tccaagctat gtgaatcccc cattctcaga
1261 atgtaagaca ttagctgatc ccttgtttgt agccctctca caaatctatc actgcatatg
1321 ctaaagatct cacaatcagg gatacctagt ggccagctaa gagccttcag gacatttggc
1381 agcccctttc tagagaaact tgtgaggtca aacctggaga ggtcacttgc cataccttgg
1441 aaggtataca tcataggctt gacagaacta aaatagcatg ctgggcccca agaagctggc
1501 aactctccaa gggaataaaa gtcagccagt gagtttctgc gtccaaaccc aagtcttaac
1561 ttctctagtg gtatttcaca atgctcataa gttgaaacgt catgagtgtg aaatttattg
1621 taggcaacat aagacacact ggagaggagt ctatgacacc tcacatcaat gttaattgcc
1681 gggtagtcaa acatgtactg gctgttcatc atgttgttgt tgatcattga ctagggggtc
1741 tttgtgt
//
LOCUS RSSB800AB 437 bp ds-DNA BCT 26-MAY-1990
DEFINITION R.sphaeroides B800-850 alpha and beta subunits of major
light-harvesting complex.
ACCESSION X05200
KEYWORDS light-harvesting complex.
SOURCE Rhodobacter sphaeroides.
ORGANISM Rhodobacter sphaeroides
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 437; no enum.)
AUTHORS Ashby,M.K., Coomber,S.A. and Hunter,C.N.
TITLE Cloning,nucleotide sequence and transfer of genes for the B800-850
light harvesting complex of Rhodobacter sphaeroides
JOURNAL FEBS Lett. 213, 245-248 (1987)
STANDARD simple automatic
FEATURES from to/span description
pept 40 195 B800-850 beta subunit (AA 1-51)
pept 210 374 B800-850 alpha subunit (AA 1-54)
BASE COUNT 78 a 156 c 124 g 79 t
ORIGIN
1 gccctagcgc acaccgtcga tttaccattg gagacgcaca tgactgacga tctcaacaaa
61 gtctggccga gcggcctcac cgttgccgaa gccgaagaag ttcataagca actcatcctc
121 ggcacccgcg tcttcggtgg catggctctg ctcgcgcact tcctcgccgc cgctgcgacc
181 ccctggctcg gctgatatga gagactgaca tgaccaacgg caaaatctgg ctcgtggtga
241 aaccgaccgt cggcgttccg ctgttcctca gcgctgccgt catcgcctcc gtcgttatcc
301 acgctgctgt gctgacgacc accacctggc tgcccgccta ctaccaaggc tcggctgcgg
361 tcgcggccga gtaatgctgc gcaagcgcgg gcctgcgggc ccacgccagc cagtccgtga
421 gtccgagcag gccggga
//
LOCUS RSSPETA 316 bp ds-DNA BCT 26-MAY-1990
DEFINITION R.sphaeroides Rieske Fe-S protein cytochrome b (petA) gene, 5' end.
ACCESSION M18577
KEYWORDS cytochrome b.
SOURCE R.sphaeroides (strain GA) DNA.
ORGANISM Rhodobacter sphaeroides
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 316)
AUTHORS Davidson,E. and Daldal,F.
TITLE fbc operon, encoding the Rieske Fe-S protein cytochrome b, and
cytochrome c1 apoproteins previously described from
Rhodopseudomonas sphaeroides, is from Rhodopseudomonas capsulata
JOURNAL J. Mol. Biol. 195, 25-29 (1987)
STANDARD full staff_entry
FEATURES from to/span description
pept 32 > 316 Rieske Fe-S protein cytochrome b (gtg start
codon)
BASE COUNT 54 a 111 c 99 g 52 t
ORIGIN Unreported.
1 ctgcagcggc ccgaggaagg gagaagttct cgtgtccaac gcagaagatc acgcaggcac
61 tcgcagggat ttcctgtatt acgccacggc cggagccggg gcggtggcca ccggggccgc
121 cgtctggccg ctgatcaacc aaatgaatcc gtcggccgac gtgcaggccc tcgcctccat
181 cttcgtcgat gtgagctcgg tcgagccggg tgtccagctg accgtcaagt tcctcggcaa
241 accgatcttc atccgccgcc gcaccgaggc cgacatcgag ctcggccgct ccgtccagct
301 cggccagctg gtcgac
//
LOCUS HUMERCC3A 2751 bp ss-mRNA PRI 26-MAY-1990
DEFINITION Human DNA repair helicase (ERCC3) mRNA, complete cds.
ACCESSION M31899
KEYWORDS Cockayne's syndrome; DNA repair; excision repair; helicase.
SOURCE Human lymphoid cell line K562 cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2751)
AUTHORS Weeda,G., Van Ham,R.C.A., Vermeulen,W., Bootsma,D.,
Van der Eb,A.J. and Hoeijmakers,J.H.J.
TITLE Identification of the molecular defect involving the human repair
disorders xeroderma pigmentosum and Cockayne's syndrome in the
ERCC-3-encoding, a presumed DNA repair helicase
JOURNAL Mol. Cell. Biol. 10, 2570-2581 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G. Weeda, 07-FEB-1990, for release after publication.
FEATURES from to/span description
pept 96 2444 DNA repair helicase /hgml_locus_uid="LF0034Q"
/map="2q21" /nomgen="ERCC3"
BASE COUNT 727 a 668 c 726 g 630 t
ORIGIN
1 gggagcttcc ggattgagcc ggaagtcccc ccagagcgga tgccgcggcg ggcctgtggg
61 agcggggtca tcttctctct gctgctgtag ctgccatggg caaaagagac cgagcggacc
121 gcgacaagaa gaaatccagg aagcggcact atgaggatga agaggatgat gaagaggacg
181 ccccggggaa cgaccctcag gaagcggttc cctcggcggc ggggaagcag gtggatgagt
241 caggcaccaa agtggatgaa tatggagcca aggactacag gctgcaaatg ccgctgaagg
301 acgaccacac ctccaggccc ctctgggtgg ctcccgatgg ccatatcttc ttggaagcct
361 tctctccagt ttacaaatat gcccaagact tcttggtggc tattgcagag ccagtgtgcc
421 gaccaaccca tgtgcatgag tacaaactaa ctgcctactc cttgtatgca gctgtcagcg
481 ttgggctgca aaccagtgac atcaccgagt acctcaggaa gctcagcaag actggagtcc
541 ctgatggaat tatgcagttt attaagttgt gtactgtcag ctatggaaaa gtcaagctgg
601 tcttgaagca caacagatac ttcgttgaaa gttgccaccc tgatgtaatc cagcatcttc
661 tccaggaccc cgtgatccga gaatgccgct taagaaactc tgaaggggag gccactgagc
721 tcatcacaga gactttcaca agcaaatctg ccatttctaa gactgctgaa agcagtggtg
781 ggccctccac ttcccgagtg acagatccac agggtaaatc tgacatcccc atggacctgt
841 ttgacttcta tgagcaaatg gacaaggatg aagaagaaga agaagagaca cagacagtgt
901 cttttgaagt caagcaggaa atgattgagg aactccagaa acgttgcatc cacctggagt
961 accctctgtt ggcagaatat gacttccgga atgattctgt caaccctgat atcaacattg
1021 acctaaagcc cacagctgtc ctcagaccct atcaggagaa gagcttgcga aagatgtttg
1081 gaaacgggcg tgcacgttcg ggggtcattg ttcttccctg cggtgctgga aagtccctgg
1141 ttggtgtgac tgctgcatgc actgtcagaa aacgctgtct ggtgctgggc aactcagctg
1201 tttctgtgga gcagtggaaa gcccagttca agatgtggtc caccattgac gacagccaga
1261 tctgccggtt cacctccgat gccaaggaca agcccatcgg ctgctccgtt gccattagca
1321 cctactccat gctgggccac accaccaaaa ggtcctggga ggccgagcga gtcatggagt
1381 ggctcaagac ccaggagtgg ggcctcatga tcctggatga agtgcacacc ataccagcca
1441 agatgttccg aagggtgctc accatcgtgc aggcccactg taagctgggt ttgactgcga
1501 ccctcgtccg cgaagatgac aaaattgtgg atttaaattt tctgattggg cctaagctct
1561 acgaagccaa ctggatggag ctgcagaata atggctacat cgccaaagtc cagtgtgctg
1621 aggtctggtg ccctatgtct cctgaatttt accgggaata tgtggcaatc aaaaccaaga
1681 aacgaatctt gctgtacacc atgaacccca acaaatttag agcttgccag tttctgatca
1741 agtttcatga aaggaggaat gacaagatta ttgtctttgc tgacaatgtg tttgccctaa
1801 aggaatatgc cattcgactg aacaaaccct atatctacgg acctacgtct cagggggaaa
1861 ggatgcaaat tctccagaat ttcaagcaca accccaaaat taacaccatc ttcatatcca
1921 aggtaggtga cacttcgttt gatctgccgg aagcaaatgt cctcattcag atctcatccc
1981 atggtggctc caggcgtcag gaagcccaaa ggctagggcg ggtgcttcga gctaaaaaag
2041 ggatggttgc agaagagtac aatgcctttt tctactcact ggtatcccag gacacacagg
2101 aaatggctta ctcaaccaag cggcagagat tcttggtaga tcaaggttat agcttcaagg
2161 tgatcacgaa actcgctggc atggaggagg aagacttggc gttttcgaca aaagaagagc
2221 aacagcagct cttacagaaa gtcctggcag ccactgacct ggatgccgag gaggaggtgg
2281 tggctgggga atttggctcc agatccagcc aggcatctcg gcgctttggc accatgagtt
2341 ctatgtctgg ggccgacgac actgtgtaca tggagtacca ctcatcgcgg agcaaggcgc
2401 ccagcaaaca tgtacacccg ctcttcaagc gctttaggaa atgatgctta ggcagggtac
2461 ttcgttcaag accggcgctt ggcacccttg ttggaaaggg attttcagca taacattttc
2521 cttccacctc tttgaccttc cctccagcgt tggccaaatt gtgctgagga agatgcatca
2581 agggcttggc tgtgccttca taggtcatct agggttttat aaaggaggag gagacaatat
2641 tttttcaaac tttttgggga gtggggtcat ttctgtatat aaaaaatgtt aatatttaag
2701 gtgtatttat gttaccgttc tgaataaaca gaatggacca ttgaaccagt a
//
LOCUS BOLREPA 182 bp ds-DNA PLN 26-MAY-1990
DEFINITION B.campestris tandemly repeated DNA.
ACCESSION M30962
KEYWORDS repetitive DNA.
SOURCE B.campestris (strain Var B-85) seedling DNA.
ORGANISM Brassica campestris
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 182)
AUTHORS Das Gupta,J. and Mandal,R.K.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry,
Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA.
FEATURES from to/span description
BASE COUNT 54 a 39 c 29 g 60 t
ORIGIN
1 aagcttctta catcgtgatt catcctggtt tgattagaat gacaaagaag ctgtccaatt
61 cccaaacagg aaaactggga tcacctgatt tgaaagtggg ttagcttctt catcctaact
121 cctatgagat ttcttcaact tcctagtgat tctccattac tttaagtatc aaaatcaagc
181 tt
//
LOCUS BOLREPB 182 bp ds-DNA PLN 26-MAY-1990
DEFINITION B.juncea tandemly repeated DNA.
ACCESSION M30963
KEYWORDS repetitive DNA.
SOURCE B.juncea (strain Var B-9) DNA.
ORGANISM Brassica juncea
Unclassified.
REFERENCE 1 (bases 1 to 182)
AUTHORS Das Gupta,J. and Mandal,R.K.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry,
Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA.
FEATURES from to/span description
BASE COUNT 53 a 42 c 30 g 57 t
ORIGIN
1 aagcttctta cagagtcatt tatcctggtt tgattggaac accgaagaag ctgtcctatt
61 cccaaactgg gaaactggaa tcacctgatt agaaagtggg ataacttctt catcccaact
121 cctatgagat ttattcaact tcctggtgat tctccaacac tttatgtatc caaatcaagc
181 tt
//
LOCUS HUMHPV16A1 336 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H022.
ACCESSION M33610
KEYWORDS insertion site.
SEGMENT 1 of 2
SOURCE Human cervical cancer DNA, clone H022.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 336)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 325 326 Human DNA end/HPV-16 DNA start
BASE COUNT 114 a 61 c 56 g 105 t
ORIGIN
1 aggtatataa atggccaagg tagaagatat caaaatgagg tggatttgat ttctcatgtg
61 agactcatag ctaatttaaa tgaaaattta aataagattt atttgacatg attgggaaca
121 attcaattca actttacaaa cactgattaa atgtctacca tctggatggc accgtgctaa
181 gtgagtctcc aaacctgaac tgtgattata aagggcattt ataaactttc cctcaaagat
241 aggacatttg cccatgtaat catgccatct ttaaaagcat cactctaaat tatttaggtg
301 acttctaact ttgcccagta ctctgtccca cagcta
//
LOCUS HUMHPV16A2 1002 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H022.
ACCESSION M33611
KEYWORDS insertion site.
SEGMENT 2 of 2
SOURCE Human cervical cancer DNA, clone H022.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1002)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 9 10 HPV-16 DNA end/Human DNA start
BASE COUNT 284 a 190 c 206 g 322 t
ORIGIN About 3187 bp after segment 1.
1 gaagtggaat aaagtgaaag cctcactctt ctctagccta agttttagag tccagtgaag
61 cattgcaagc ataggctttg tagtcagaaa accctgagat caaatcctgg ttctaccact
121 tgctatagcg atcttgggca aggggtcaga tctctctaag cctgtttcct catctgtaag
181 gaagggtatt atatcacata aggttactgt gaggactaaa ttagactaag tatgcaatag
241 gaatacaggg tccagttttc tttggatgta atgggcctgg aaaattcctt aaaatccttt
301 tcacctacaa aatcttatga agttctgcct attttctgct taaaaacttt aaaaaattaa
361 tagaaataaa agagaattct actagagaga taggttgacg ttacttcttc cttgcttttt
421 ccttaaagtg gaatgttaaa aactaggata tgcctggaaa gtgttctatc tacaaaaaag
481 gaagttagca gccgctgaaa agtaactaca gatggctatt cactttactc tgaaagcatt
541 tgctgttgat ataatcacac cacaggaaaa catcataatg ttggctgaaa gaaatctgaa
601 atgacacagc aataatgctt catcatgtag aagttggttt caagtttttt tttttttttc
661 ggtctggata gtgtgattgc aagaagggag gctatgctag cttggttata agcagggaag
721 ttggctgtga ggagataaac agagatctca caggaattct ggggtagaaa tcactggacc
781 ggaactgaag ggctatctcc cagcttctgt ttctgccttt tcattcagtc attccctcgt
841 ttactcaaca gttccctctg ctttggtggc agtttctgct ccttctcaag gctgacttgc
901 acatggctct gacttgctgt ggcctcctct ccatcattct ctgcatcagg tgctttcaac
961 cttgatttta ttgtttatat atacttatga acttttctgc ag
//
LOCUS HUMHPV16B 871 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H404.
ACCESSION M33612
KEYWORDS insertion site.
SOURCE Human cervical cancer DNA, clone H404.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 871)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 861 862 Human DNA end/HPV-16 DNA start
BASE COUNT 291 a 147 c 164 g 269 t
ORIGIN
1 atactctgag taaacaagta aaacatttgg taaaataact ggaaggatat ataccatagt
61 aaatgattct ttttcaaatt ttctattata tagctatata aggtatgaat ctagtagtta
121 ccctcaaatt agggtaaaca atttcctcag cagtttgagc agctcatctc ataatacttt
181 gcaaagatag ccacacaagg gaatgggctg cttgatttga acacaggtgg ggatggatta
241 atagaactgg ggatcaggga acattgggca ggactaataa gaattaggca gtcagaaaaa
301 gatttacaaa aaagactgta taacgagtct aaagataaat tctacctatt taacatttct
361 gcctgagttt ggagaaggca agaaaacatt cttctcttcc tcttacgtac acagacaatt
421 agggaagcca caatgagata atttatgcta tgttagtgag taacacataa ttttccttca
481 cagctgatat aacttgatta ctggagtggc agtggaaggg catggagacc caggccatgg
541 tcacttttct aggtgctcct acgactcaat ttctctcttc tgtcttgatt cctttgggag
601 attcctggat tttagaaaat cagatgagta agttgttatc atctgaaaaa tgccctctta
661 ccacacaatt atctattaga ggaaagttta ggaacagttg gtttaactga gagaaataaa
721 gataatctct atctcccttg cctgctctta ggataagggt tctgagatcc tatataatct
781 tatatcattt aacataaaca caatttctta ctttgcttga aaagttgtat taaagattcc
841 agggtgcagt taaatacact tcacaatata c
//
LOCUS HUMHPV16C1 1130 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H705.
ACCESSION M33613
KEYWORDS insertion site.
SEGMENT 1 of 2
SOURCE Human cervical cancer DNA, clone H705.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1130)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 1120 1121 Human DNA end/HPV-16 DNA start
BASE COUNT 321 a 222 c 244 g 343 t
ORIGIN
1 tgccatcatt aatgcagctg gcacgacagg ttcccgactg aaacggcagt gagcgcaacg
61 caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg
121 ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgaca
181 tgattacaat tcgagctcgg taccagcaca atgaggaatg catgctagca caagtgaaac
241 tcatagatgt ccattgtgct atgcattttt tccttgggcc tgatccattt atccatttac
301 tggtttcctg tctgtaaatt tagaaaagat acaggctctc tgaaaagtaa tttctgtctc
361 ttacaagtga agggttaatc aaccaatcca cataattttc tccagtactg agagatcatt
421 tgttttaata aatgcaaata aggtttctta tagttaaagg taattggctt ttcattgtaa
481 ttcttgatgc tggtcatttt gtgtctgagt tgttcctaat tgctttggtt cagagtctga
541 gaaatgaaat agccccttga ctataactgt aactacaatt ataacaattt atttatttaa
601 atcagcaatc cctgcaaagt catttacagt ttgtttattt cagtatgttt tacaaggtgc
661 aacaaaagca gcctcatcac atagcaaatc tttcttacag gattaaaagt taatgggtaa
721 ggtaagtctg gcataggcat taaagtggaa gcattgtttc ttcttgactg gtcaacttta
781 gagacaactt ttcccattcg aagttatcta tcctctaaaa tatacagaga ttgaggccag
841 gtgggatggc tcacccctgt aatcccagaa ctttgggagg ccaagatgag tggattgctt
901 gagctcagaa gtttgagacc ttggtaacat ggcgggatgc cggtggtgcc atgcctgtgg
961 tcccagcttc ttgggggctg aggtgggagg accttctgag cctggtggca aagttgcagt
1021 aagctgtgtt ggtatcactg cactccagcc tgcactcctg cacaaagcaa gaccctgtct
1081 caaaacaata aataaattaa aatatagaga gactttgcat tgcaaaggca
//
LOCUS HUMHPV16C2 148 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H705.
ACCESSION M33614
KEYWORDS insertion site.
SEGMENT 2 of 2
SOURCE Human cervical cancer DNA, clone H705.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 148)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 28 29 HPV-16 DNA end/Human DNA start
BASE COUNT 45 a 24 c 28 g 51 t
ORIGIN About 489 bases after segment 1.
1 attatcacag atggtacaat gggcctactg atgcagtgat aatagtactg agatgtacta
61 ttatcccaca tttagttaag ttaggattga tcctagattc acatgttgtc agtgtgatgc
121 cttaaatatc aagtttccaa ttaagctt
//
LOCUS HUMHPV16D2 510 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 3 ' flank, clone H901.
ACCESSION M33616
KEYWORDS insertion site.
SEGMENT 2 of 2
SOURCE Human cervical cancer DNA, clone H022.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 510)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 10 11 HPV-16 DNA end/Human DNA start
BASE COUNT 140 a 133 c 94 g 143 t
ORIGIN About 3994 bp after segment 1.
1 acattattat ggaaacagat ctgtgagtac caagaaaaga ggataaagat tcatcccatc
61 caccagtcat tcccatgcac ctctacccgc catcccctgt atccaggaca acccccttct
121 gacaccaaaa tgcatttcac cattggctgc tgtcggtaga taatacctgc tcagcatttg
181 ggacaagttc cagacataac ttcctcttag tgaatgatcc tgacaggaga aagaattgag
241 cttaatttat gccatctaat aacctcagtg cagctacttg ggaagttagc cctccagagt
301 ttcccccaaa gttttctcca gtgaattaca gtgccatata ttctcattgc taccagcgct
361 gctcccaaaa tctatctgct gtttaatagt ttttaccttt caaaaatgca agctggctgg
421 gcgtggattt ttgaaagcat tcctcctgcc ttggcctctc aaagtgctgg attagagggt
481 gccttctaat cccagcaatc agcattggaa
//
LOCUS HUMHPV1D1 510 bp ds-DNA PRI 26-MAY-1990
DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H901.
ACCESSION M33615
KEYWORDS insertion site.
SEGMENT 1 of 2
SOURCE Human cervical cancer DNA, clone H022.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 510)
AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T.
TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical
cancers: Amplification of viral sequences together with cellular
flanking sequences
JOURNAL J. Virol. 64, 813-821 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
site 500 501 Human DNA end/HPV-16 DNA start
BASE COUNT 159 a 82 c 101 g 168 t
ORIGIN
1 attcgagctc ggtacccaac atctcaaaat tttgttcttc agtctgtaaa atgggatgat
61 aaatctctca ggtttggtgt aagaaaaaaa taatatgctc acctaataga ccttcaatta
121 ctggtagttt ccatcatctt aatgaggatt atatctttat agtgagcacc cattagatgg
181 tgttgataaa tacatcaatg agtattttag gcagaaagca gagtaaagca gaagtactgg
241 cattctttgc tgtactcagt tttattaact gattttatat tgatcacgtt ctttgttaca
301 tgtcagtatt atagtggcag ttgaaggtgg taatattttt agtctccgtt agtgaaatga
361 caggcattga gctctcagtc atacctttgt aggccttcgt tgaggtgaat acctacctct
421 taactagaaa aagatggaga atttcttgct tggaaggaaa ttaatgcaat gtccaggtca
481 tctcctaaaa agcctgaagg aaacaaagta
//
LOCUS HUMMHDQBH 1104 bp ss-mRNA PRI 26-MAY-1990
DEFINITION Human MHC HLA-DQ beta mRNA, complete cds.
ACCESSION M32577
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (DR4-Dw14), cDNA to mRNA, LS40 homozygous cell line.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1104)
AUTHORS Hilden,J.M., Curtsinger,J.M., Cairns,J.S. and Bach,F.H.
TITLE DQ beta sequences in HLA-DR4 haplotypes
JOURNAL Hum. Immunol. 18, 261-264 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 754 MHC HLA-DQ beta precursor (AA at 2)
/nomgen="LS0098W" /map="6p21.3"
/hgml_locus_uid="HLA-DQB1"
sigp < 1 64 MHC HLA-DQ beta signal peptide (AA at 2)
matp 65 751 MHC HLA-DQ beta
BASE COUNT 231 a 324 c 303 g 246 t
ORIGIN
1 aggccttcgg gtagcaactg tgaccttgat gctggcgatg ctgagcaccc cggtggctga
61 gggcagagac tctcccgagg atttcgtgta ccagtttaag ggcatgtgct acttcaccaa
121 cgggacggag cgcgtgcgtc ttgtgaccag atacatctat aaccgagagg agtacgcacg
181 cttcgacagc gacgtggggg tgtatcgggc ggtgacgccg ctggggccgc ctgccgccga
241 gtactggaac agccagaagg aagtcctgga gaggacccgg gcggagttgg acacggtgtg
301 cagacacaac taccagttgg agctccgcac gaccttgcag cggcgagtgg agcccacagt
361 gaccatctcc ccatccagga cagaggccct caaccaccac aacctgctgg tctgctcagt
421 gacagatttc tatccagccc agatcaaagt ccggtggttt cggaatgacc aggaggagac
481 aactggcgtt gtgtccaccc cccttattag gaacggtgac tggaccttcc agatcctggt
541 gatgctggaa atgactcccc agcgtggaga cgtctacacc tgccacgtgg agcaccccag
601 cctccagaac cccatcatcg tggagtggcg ggctcagtct gaatctgccc agagcaagat
661 gctgagtggc attggaggct tcgtgctggg gctgatcttc ctcgggctgg gccttattat
721 ccatcacagg agtcagaaag ggctcctgca ctgactcctg agactatttt aactgggatt
781 ggttatcact tttctgtaac gcctgcttgt ccctgcccag aattcccagc tgcctgtgtc
841 agcctgtccc cctgagatca gagtcctaca gtggctgtca cgcagccacc aggtcatctc
901 ctttcatccc cacctcgagg ctgatggctg tgaccctgct tcctgcactt acccagagcc
961 tctgcctgtg cacggccagc tgcgtctact gaggccccaa ggggtttctg tttctattct
1021 ctcctcagac tgctcaagag aagcacatga aaaccattac ctgactttag agctttttta
1081 cataattaaa catgatcctg agtt
//
LOCUS HUMMHDR1C 1191 bp ss-mRNA PRI 26-MAY-1990
DEFINITION Human class II HLA-DRB1-BON mRNA, complete cds.
ACCESSION M33600
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (haplotype DRB1-BON) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1191)
AUTHORS Coppin,H.L., Avoustin,P., Fabron,J., Huchenq,A., Garnier,J.M.,
Thomsen,M. and De Preval,C.
TITLE Evolution of the HLA-DR1 gene family: Structural and functional
analysis of the new allele "DR-BON"
JOURNAL J. Immunol. 144, 984-989 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 71 871 MHC HLA-DR1-BON precursor
sigp 71 157 MHC HLA-DR1-BON signal peptide
matp 158 868 MHC HLA-DR1-BON
BASE COUNT 258 a 312 c 344 g 277 t
ORIGIN
1 gcccaagtat caagagggag agtgagactt gcctgcttct ctggcccctg gtcctgtcct
61 gttctccagc atggtgtgtc tgaagctccc tggaggctcc tgcatgacag cgctgacagt
121 gacactgatg gtgctgagct ccccactggc tttggctggg gacacccgac cacgtttctt
181 gtggcagctt aagtttgaat gtcatttctt caatgggacg gagcgggtgc ggttgctgga
241 aagatgcatc tataaccaag aggagtccgt gcgcttcgac agcgacgtgg gggagtaccg
301 ggcggtgacg gagctggggc ggcctgatgc cgagtactgg aacagccaga aggacatcct
361 ggaagacgag cgggccgcgg tggacaccta ctgcagacac aactacgggg ttggtgagag
421 cttcacagtg cagcggcgag ttgagcctaa ggtgactgtg tatccttcaa agacccagcc
481 cctgcagcac cacaacctcc tggtctgctc tgtgagtggt ttctatccag gcagcattga
541 agtcaggtgg ttccggaacg gccaggaaga gaaggctggg gtggtgtcca caggcctgat
601 ccagaatgga gattggacct tccagaccct ggtgatgctg gaaacagttc ctcggagtgg
661 agaggtttac acctgccaag tggagcaccc aagtgtgacg agccctctca cagtggaatg
721 gagagcacgg tctgaatctg cacagagcaa gatgctgagt ggagtcgggg gcttcgtgct
781 gggcctgctc ttccttgggg ccgggctgtt catctacttc aggaatcaga aaggacactc
841 tggacttcag ccaacaggat tcctgagctg aaatgcagat gaccacattc aaggaagaac
901 cttctgtccc agctttgcag aatgaaaagc tttcctgctt ggcagttatt cttccacaag
961 agagggcttt ctcaggacct ggttgctact ggttcggcaa ctgcagaaaa tgtcctccct
1021 tgtggcttcc tcagctcctg cccttggcct gaagtcccag cattgatgac agcgcctcat
1081 cttcaacttt tgtgctcccc tttgcctaaa ccgtatggcc tcccgtgcat ctgtacctca
1141 ccctgtacga caaacacatt acattattaa atgtttctca aagatggagt t
//
LOCUS HUMMHDRBBB 1216 bp ss-mRNA PRI 26-MAY-1990
DEFINITION Human MHC class II HLA-DR beta-1 mRNA (DR2.3), 5'end.
ACCESSION M32578
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human type I diabetic (Dw4/LD MN2), cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1216)
AUTHORS Freeman,S.M., Saunders,T.L., Madden,M., Segall,M., Bach,F.H. and
Wu,S.
TITLE Comparison of DR beta-1 alleles from diabetic and normal
individuals
JOURNAL Hum. Immunol. 19, 1-6 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 62 862 MHC HLA-DR beta-1 precursor /nomgen="LV0063D"
/map="6p21.3" /hgml_locus_uid="HLA-DRB1"
sigp 62 148 MHC HLA-DR beta-1 signal peptide
matp 149 859 MHC HLA-DR beta-1
BASE COUNT 265 a 331 c 341 g 279 t
ORIGIN
1 agttctccct gagtgagact tgcctgctcc tctggcccct ggtcctgtcc tgttctccag
61 catggtgtgt ctgaagctcc ctggaggttc ctacatggca gtgctgacag tgacactgat
121 ggtgctgagc tccccactgg ctttggctgg ggacacccga ccatgtttct tgcagcagga
181 taagtatgag tgtcatttct tcaacgggac ggagcgggtg cggttcctgc acagaggcat
241 ctataaccaa caggagaacg tgcgcttcga cagcgacgtg ggggagtacc gggcggtgac
301 ggagctgggg cggcctgacg ctgagtactg gaacagccag aaggacatcc tggagcaggc
361 gcgggccgcg gtggacacct actgcagaca caactacggg gctgtggaga gcttcacagt
421 gcagcggcga gttgagccta aggtgactgt gtatcctgca aggacccaga ccctgcagca
481 ccacaacctc ctggtctgct ctgtgaatgg tttctatcca ggcagcattg aagtcaggtg
541 gttccggaac ggccaggaag agaaggctgg ggtggtgtcc acaggcctga ttcagaatgg
601 agactggacc ttccagattc tggtgatgct ggaaacagtt cctcggagtg gagaggttta
661 cacctgccaa gtggagcacc caagcgtgac gagccctctc acagtggaat ggagagcaca
721 gtctgaatct gcacagagca agatgctgag tggaatcggg ggctttgtgc tgggcctgct
781 cttccttggg gccgggctat tcatctactt caagaatcag aaagggcact ctggacttca
841 cccaacagga ctcgtgagct gaagtgcaga tgaccacatt caagggggaa ccttctgccc
901 cagctttgca tgatgaaaag ctttcctgct tggctcttat tcttccacaa gagaggactt
961 tctcaggccc tggttgctac cggttcagca actctgcaga aaatgtccat ccttgtggct
1021 tcctcagctc ctgcccttgg cctgaagtcc cagcattgat ggcagtgcct catcttcaac
1081 tttagtgctc ccctttacct aaccctacgg cctcccatgc atctgtactc cccctgtgcc
1141 acaaatggac tacgttatta aatttttctg aagcccagag ttaaaaatca tctgtccacc
1201 tggcaccaaa gacaaa
//
LOCUS J05239 162 bp ds-DNA BAD 26-MAY-1990
DEFINITION Figure 1. Sequence of the 166-bp restriction fragment.
ACCESSION J05239
REFERENCE 1 (bases 1 to 162)
AUTHORS Jones,B.K. and Yeung,A.T.
TITLE dna base composition determines the specificity of uvrabc
endonuclease incision of a psoralen cross-link
JOURNAL J. Biol. Chem. 265, 3489-3496 (1990)
STANDARD unannotated staff_entry
COMMENT Bad entry: secondary reference to PNASU 75, 5314-5318 (1978): lac
promoter sequence.
FEATURES from to/span description
BASE COUNT 40 a 43 c 40 g 39 t
ORIGIN
1 cctccgttga gccatctgga tcggcagcgt tgtcttcatc aaccggaacg agcatgccgg
61 agagcagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtataat
121 gtgtggaatt gtgagcggat aacaatttca cacaggaaac ag
//
LOCUS MLVENVB 2002 bp ss-RNA VRL 26-MAY-1990
DEFINITION Murine leukemia virus 10A1 derivative env gene, complete cds.
ACCESSION M33470
KEYWORDS envelope protein.
SOURCE Murine leukemia virus 10A1 derivative viral RNA, clone 10A1.
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 2002)
AUTHORS Ott,D., Friedrich,R. and Rein,A.
TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses:
Close relationship to mink cell focus-inducing viruses
JOURNAL J. Virol. 64, 757-766 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 65 2002 env protein
BASE COUNT 528 a 553 c 482 g 439 t
ORIGIN
1 ggatccacgc cgctcacgta aaggcggcga caacccctcc ggccggaaca gcatcaggac
61 cgacatggaa ggtccagcgt tctcaaaacc ccttaaagat aagattaacc cgtggaagtc
121 cttaatggtc atgggggtct atttaagagt agggatggca gagagccccc atcaggtctt
181 taatgtaacc tggagagtca ccaacctgat gactgggcgt accgccaatg ccacctccct
241 tttaggaact gtacaagatg ccttcccaag attatatttt gatctatgtg atctggtcgg
301 agaagagtgg gacccttcag accaggaacc atatgtcggg tatggctgca aataccccgg
361 agggagaaag cggacccgga cttttgactt ttacgtgtgc cctgggcata ccgtaaaatc
421 ggggtgtggg gggccaagag agggctactg tggtgaatgg ggttgtgaaa ccaccggaca
481 ggcttactgg aagcccacat catcatggga cctaatctcc cttaagcgcg gtaacacccc
541 ctgggacacg ggatgctcca aaatggcttg tggcccctgc tacgacctct ccaaagtatc
601 caattccttc caaggggcta ctcgaggggg cagatgcaac cctctagtcc tagaattcac
661 tgatgcagga aaaaaggcta attgggacgg gcccaaatcg tggggactga gactgtaccg
721 gacaggaaca gatcctatta ccatgttctc cctgacccgc caggtcctca atatagggcc
781 ccgcatcccc attgggccta atcccgtgat cactggtcaa ctacccccct cccgacccgt
841 gcagatcagg ctccccaggc ctcctcagcc tcctcctaca ggcgcagcct ctatagtccc
901 tgagactgcc ccaccttctc aacaacctgg gacgggagac aggctgctaa acctggtaga
961 aggagcctat caggcgctta acctcaccaa tcccgacaag acccaagaat gttggctgtg
1021 cttagtgtcg ggacctcctt attacgaagg agtagcggtc gtgggcactt ataccaatca
1081 ttctaccgcc ccggccagct gtacggccac ttcccaacat aagcttaccc tatctgaagt
1141 gacaggacag ggcctatgca tgggagcact acctaaaact caccaggcct tatgtaacac
1201 cacccaaagt gccggctcag gatcctacta ccttgcagca cccgctggaa caatgtgggc
1261 ttgtagcact ggattgactc cctgcttgtc caccacgatg ctcaatctaa ccacagacta
1321 ttgtgtatta gttgagctct ggcccagaat aatttaccac tcccccgatt atatgtatgg
1381 tcagcttgaa cagcgtacca aatataagag ggagccagta tcgttgaccc tggcccttct
1441 gctaggagga ttaaccatgg gagggattgc agctggaata gggacgggga ccactgccct
1501 aatcaaaacc cagcagtttg agcagcttca cgccgctatc cagacagacc tcaacgaagt
1561 cgaaaaatca attaccaacc tagaaaagtc actgacctcg ttgtctgaag tagtcctaca
1621 gaaccgaaga ggcctagatt tgctcttcct aaaagaggga ggtctctgcg cagccctaaa
1681 agaagaatgt tgtttttatg cagaccacac gggactagtg agagacagca tggccaaact
1741 aagggaaagg cttaatcaga gacaaaaact atttgagtca ggccaaggtt ggttcgaagg
1801 gcagtttaat agatccccct ggtttaccac cttaatctcc accatcatgg gacctctaat
1861 agtactctta ctgatcttac tctttggacc ctgcattctc aatcgattgg tccaatttgt
1921 taaagacagg atctcagtgg tccaggctct ggttttgact caacaatatc accagctaaa
1981 acctatagag tacgagccat ga
//
LOCUS MLVENVC 2001 bp ss-RNA VRL 26-MAY-1990
DEFINITION Murine leukemia virus env gene, complete cds.
genes.
ACCESSION M33469
KEYWORDS envelope protein.
SOURCE Murine leukemia virus viral RNA, clone 4070A.
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 2001)
AUTHORS Ott,D., Friedrich,R. and Rein,A.
TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses:
Close relationship to mink cell focus-inducing viruses
JOURNAL J. Virol. 64, 757-766 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 37 2001 env protein
BASE COUNT 532 a 560 c 472 g 437 t
ORIGIN
1 ggccgacacc cagagtggac catcctctgg acggacatgg cgcgttcaac gctctcaaaa
61 ccccctcaag ataagattaa cccgtggaag cccttaatag tcatgggagt cctgttagga
121 gtagggatgg cagagagccc ccatcaggtc tttaatgtaa cctggagagt caccaacctg
181 atgactgggc gtaccgccaa tgccacctcc ctcctgggaa ctgtacaaga tgccttccca
241 aaattatatt ttgatctatg tgatctggtc ggagaggagt gggacccttc agaccaggaa
301 ccgtatgtcg ggtatggctg caagtacccc gcagggagac agcggacccg gacttttgac
361 ttttacgtgt gccctgggca taccgtaaag tcggggtgtg ggggaccagg agagggctac
421 tgtggtaaat gggggtgtga aaccaccgga caggcttact ggaagcccac atcatcgtgg
481 gacctaatct cccttaagcg cggtaacacc ccctgggaca cgggatgctc taaagttgcc
541 tgtggcccct gctacgacct ctccaaagta tccaattcct tccaaggggc tactcgaggg
601 ggcagatgca accctctagt cctagaattc actgatgcag gaaaaaaggc taactgggac
661 gggcccaaat cgtggggact gagactgtac cggacaggaa cagatcctat taccatgttc
721 tccctgaccc ggcaggtcct taatgtggga ccccgagtcc ccatagggcc caacccagta
781 ttacccgacc aaagactccc ttcctcacca atagagattg taccggctcc acagccacct
841 agccccctca ataccagtta ccccccttcc actaccagta caccctcaac ctcccctaca
901 agtccaagtg tcccacagcc acccccagga actggagata gactactagc tctagtcaaa
961 ggagcctatc aggcgcttaa cctcaccaat cccgacaaga cccaagaatg ttggctgtgc
1021 ttagtgtcgg gacctcctta ttacgaagga gtagcggtcg tgggcactta taccaatcat
1081 tccaccgctc cggccaactg tacggccact tcccaacata agcttaccct atctgaagtg
1141 acaggacagg gcctatgcat gggggcagta cctaaaactc accaggcctt atgtaacacc
1201 acccaaagcg ccggctcagg atcctactac cttgcagcac ccgccggaac aatgtgggct
1261 tgcagcactg gattgactcc ctgcttgtcc accacggtgc tcaatctaac cacagattat
1321 tgtgtattag ttgaactctg gcccagagta atttaccact cccccgatta tatgtatggt
1381 cagcttgaac agcgtaccaa atataaaaga gagccagtat cattgaccct ggcccttcta
1441 ctaggaggat taaccatggg agggattgca gctggaatag ggacggggac cactgcctta
1501 attaaaaccc agcagtttga gcagcttcat gccgctatcc agacagacct caacgaagtc
1561 gaaaagtcaa ttaccaacct agaaaagtca ctgacctcgt tgtctgaagt agtcctacag
1621 aaccgcagag gcctagattt gctattccta aaggagggag gtctctgcgc agccctaaaa
1681 gaagaatgtt gtttttatgc agaccacacg gggctagtga gagacagcat ggccaaatta
1741 agagaaaggc ttaatcagag acaaaaacta tttgagacag gccaaggatg gttcgaaggg
1801 ctgtttaata gatccccctg gtttaccacc ttaatctcca ccatcatggg acctctaata
1861 gtactcttac tgatcttact ctttggacct tgcattctca atcgattggt ccaatttgtt
1921 aaagacagga tctcagtggt ccaggctctg gttttgactc agcaatatca ccagctaaaa
1981 cccatagagt acgagccatg a
//
LOCUS MTYRPVP 6331 bp ss-RNA VRL 26-MAY-1990
DEFINITION Eggplant mosaic virus genome.
ACCESSION J04374
KEYWORDS replicase protein; virion protein.
SOURCE Eggplant mosaic tymovirus viral RNA.
ORGANISM Eggplant mosaic virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Tymovirus.
REFERENCE 1 (bases 1 to 6331)
AUTHORS Osorio-Keese,M.E., Keese,P. and Gibbs,A.
TITLE Nucleotide sequence of the genome of eggplant mosaic tymovirus
JOURNAL Virology 172, 547-554 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 102 2051 overlapping out-of-phase protein
pept 109 5628 replicase protein (putative)
pept 5633 6199 virion protein
BASE COUNT 1337 a 2441 c 987 g 1566 t
ORIGIN
1 gtaatcagaa ccagaactaa ccctgttatc agccttagtt cttttacttt cctgtccaaa
61 tttctgaacc gactagtgcc ttcctagaac ccactacgtc aatgcctcat ggcctttcag
121 tctgctctcg aagctctcaa ctcaactact cacagagatg cttctacaaa tccaattctg
181 aactccgtcg tggaacctct ccgcgactct ctatccctat atccctggct ccttcccaaa
241 gaagccgttc cccaccttct atcctggggc atcccgaact ccggcctcgg agtcactccc
301 cacccccacc caatccacaa aacagtcgag acttttctcc tgttcaatca ctggcatgct
361 ctcgctcgcc tgccttcaac tgtgatgttc atgaaaccgt ccaagtttca aaaacttgcg
421 gctctaaacc caaaattcca agagttgatc aactttcgac tcactgccgc cgacaccact
481 cgctacccct ccacctcact cacttttcca agcaattcaa tttgcttcat gcacgatgct
541 ctgatgtact tttctccagc tcagatcgtc gatctcttca ctcagtctcc cgcactcgag
601 accctgtact gcagtctcat agtgcctcca gagtctcatt tcacagatct ctctctcttc
661 cccgagatct acacttacaa gatctcaggt cagactctcc actacatccc ggagaatcac
721 cactccggct cgtacaatca gcccctccaa gccccatctt ggctgaagat ttcctccatc
781 ctctcgcctt ccctcgcttt gtctgtgacc aagctggaat cttggggccc agtccactcc
841 atattgatcc agcgaggcct accaccaaag ccctctctct ctgcacgccc ccccgtcctg
901 ccaaatcaac ctccccgtgc aacaactccc aactcccaaa accaactgct gcatcagaca
961 agccagctat tcttccaact gcagcagcct caactcagcc tggtctcctt ccgaattcca
1021 gactgcgtag aactgccaca agccaccttt ctgcgccaac ctctccgcca ccggctagtg
1081 ccaacaagcg tttacaacgc tctcttcacc tacactcgcg cagtccgcac tcttcgcact
1141 tccgacccag ccggatttgt gcgaactcaa agcaacaaac ccgagcacgc ttgggtcact
1201 ccaaacgcgt gggacaatct gcagaccttg tctgtcaatg ccccccaccg cccccaagta
1261 tgctaccact tcttctcctc ccccgtggca aggttaaagc tccacttcgc ccaacactgg
1321 cgagcctatc ttttggctct caccccattc cttaccacgt cacctcttct cctcccctta
1381 ttcaatttca acaccccttt ccccctccct cggctacttt ctctgtttcg ccgctcggtg
1441 tcctcaccac ggcttttgca ctcaatccta cccagtcagc tgagaggagc tgcgatcccg
1501 aatcgcccac tcccactctg ggtcacaaaa ctacatcact ttctcgactc ccactccctc
1561 ctccccactc cccccattcg gcccaggata gagcttcagc gcttgccact gatgtctcta
1621 attccgaaac caaaaattgt ccttccccta ctgtccctcc tcctttcctc cccaaccatc
1681 tacatccact tcttccaggc acagaccccc caacaactcc acgacaatta tcaccttcac
1741 cttcatccct ctcgcttcga actttcctgg actctgcagt catatcatgt gactcaagcc
1801 cagtccttcc tccctctcct tctcccagct cccactcaag ctcaagcttc caatcctgca
1861 cctcgccccc ccgctttcca tgctatcccc ctcccccctc agccctcgac ctcctcttct
1921 cctccactcc aggaaccgac cctttccccc cacctgatac acccccccct cacaagagaa
1981 ccatcgccct tgaacggctg cgcctgcgac agtgcgctac tcccttccac agctgcgatg
2041 acgtctgctg aacatcccac tccactcaac ccccccacac ctagcccaac accagacgtc
2101 cctcctcccg actcacccgg taacccatca cttttgaagc aagtccctcc cgaagcgaac
2161 ttgcatccta tccacaaccc agacctcccc tcttccacca ctcttccttc tggggccctg
2221 acactggtcc cagccaaaac tccttccatc tacgccaatc ccaccccccc cagttcccat
2281 ccgttcaccc cactggctga tgaccccact gctgtgggtc cttgcctacc gttccacgtt
2341 ctccacccgg ctgactactt tcctctttca gccgagtttc tcacacggac ccggcatgtc
2401 cccccctctt ctctctcaca tccaaaactc aattgcctac tcacctgctt ttctgaactt
2461 tcaggacact ctgagtcaga tctttggttg tccctgcaat caatacttcc tgactcccaa
2521 ctccaaaatc ctgaagtctc gacacttggc ctgtccactg acattctcac agctctctgc
2581 ttcatctacc attcatctgt gactctccat gccccctcag gagtttatca ctacggcata
2641 gcctcctctt ctaccgtcta tgtcatccac tatcaaccag gccctcctcc tcatttttct
2701 ctctccccta gacttgccgc ttctgctcct cgctgcaacc ccaccaacag cagattggtc
2761 agacaagctc tgcggtttaa attgaacggc gagtttctcc ccttcaccca ggcttacgcg
2821 catgaatctt ccatcaccca tgccaaaaac ctcatctcca acatgaagaa tggttttgat
2881 ggaatcatgt cttctctcac tgactcctct aagggtccct ccccccgtga aaaactgacc
2941 actctcgact ctctcataga tgtcgctgcc cctcgcgaag tttctctcat ccacatcgcc
3001 ggcttcgcag gctgcggcaa gacccacccc atccaaaaac tcctccaaac ttcccctttt
3061 cacgacttcc gaatctcatg ccccactaat gaactccgat ccgaatggaa gcgtgatatg
3121 caaccaacag ctgaaaatgt ttggaggttc tccacatggg aatccagcct gctcaaacat
3181 tccgagatcc tcgtaatcga cgagatttac aagctccctc gtggctacct agatctctcc
3241 atccttgctg atccaactct ctccttggtc atcatccttg gtgaccctct ccaaggagag
3301 tatcactcga cctctcctca cagctccaat cactttcttc caagtgaggt ccaccgcttc
3361 aagtcttaca tcgactgcta ctgtttttgg tcccaccgca ttccaaagca gatagcatcc
3421 ttgttcggcg tagtatgcca caacacgaac gaaggtttcg tgagagccct cacatctcat
3481 ccccccaatt ccaaaaacct caccaatgcg accaacactg ctctcagtct ccaacagatg
3541 ggccaccacg ctatcaccat cagcgccaga agggtcacct tcaccgaggc ccatacaatt
3601 ctgcttgatc gtcataccaa ccttctctcc cccaacaact gtcttgttgc cctcacccgc
3661 agccgcactg gcgtctactt cgtcggcaat ctgcacctgg catcaaacag ctttggcaca
3721 aactacatgt tctctcaagc tctctgccaa ggcacaatcg acctaaacaa cgtgttcccc
3781 cacatcatgc ctcacctccc gaaaatgtat gaacccatcc gctcccggtc caaccgtttt
3841 gtgtctgggt ccctcaattt tcgaccaacc accaattccc gcctcctttc cagtctcact
3901 aagccaaccc acctcccccc tcacatccct accaaccact ccctggatgt cctagtttcc
3961 aaccctgtgc tccttggtga gaccctcgac cctcgattgg aggtcctcca cctcccccca
4021 actcgcctcc cattgcatct ggacctcctg cccacagtac cttcctcttc cagcttctcc
4081 tcagtcgacc atcttttccc aacccccatc tcccccgcta tctgcggcta caccttcgaa
4141 aatttggccg cattcttcct cccagctcat gacccggacc taaaggaggt gctcatcaat
4201 gaccaaaaga gcaaccagtt cccatacttg gacgcccctt ttgagctttc gtgccaaccc
4261 tcctcactgt tggcaccaat tcacaagccg gcctcggatc caacccttct ccctggctcc
4321 atcaagaaac gcctcagatt ccgcgcttct tcctccccat attccatcac tccatctgat
4381 caacttcttg gtcaacacct cttctcttct ttgtgcctgg cttatgggcg caaccccaat
4441 tctgtcctcc ccttccaacc tgagctcttc agtgagtgca tatgcattaa tgattacgct
4501 caactctcct ccaagactca agccaccatc gtggccaatc atcaaaggtc tgatcctgac
4561 tggcgcctaa ctgctgtccg catctttgcc aaggctcaac acaaagtaaa cgacgcttcc
4621 atcttttccg ggtggaaggc ttgccaaact ctagccctga tgcacggtta catcattctc
4681 gtactcggcc cagtcaagaa ataccaacgc atttttgatt ccaaggacag acctccccac
4741 atctactacc actgcggtaa aactccctcc cagctctccc aatggtgcca aactcacctt
4801 tctggctctt cctacatcgc caacgactac actgcctttg atcagtccca acacggcgag
4861 gctgtggtcc tggaatgttt gaagatgcgc cgcctctcca tcccggactc tctcattcag
4921 ctccactccc acctcaagtg ttccgtcgac acccagttcg gccccctcac ctgcatgcgc
4981 ctcactggcg agccgggcac ttatgatgac aactctgact acaacctagc tgtcatctac
5041 tcccaatact ccctcaatgg ccaccccatt ctgatctcag gcgatgactc cgtcctttgc
5101 ggcacaccgc ccccttctcc actttggccc actctcaaga aaatgcttca tctccgtttc
5161 aagatcgaac ggacctccca ccccctcttc tgcgggtatt acgtctcccc tcatggcgct
5221 gcccgcaacc cgtatgctct cttcgccaag ctcatgatct gcgttgatga caagagcctc
5281 catgacaaga agttgtccta tctctctgaa ttctccactg gccatctggc tggcgacctg
5341 gtcacctcca ttctcccttc ccacctactt ccctatcagt ccgccgtgca cgacttcttc
5401 tgccggaatt gcacgcccgc ggaaaaaatt ctcctgtctc tggacccaat ccctgagtcc
5461 aaaatcctcc agctcattct caaagttcgc tgggcttctc aagctttctt ttcctacctg
5521 cctcaaaaag ctcgcgaact ccttgtggca cgctcttctc tcccgtccct ctattccaat
5581 cccaaagtct ctcaactgga gtctgaattg cttcccttct ctcaatagat caatggaaga
5641 cacagcaatc atcagaagcc ctcagccctc cataaacgca ccaggcttcc atctgccacc
5701 caccgactca caacaatcct ctgctattga actccccttc cagtttcagg ccaccacttt
5761 tggcgcgact gaaacagctg ctcaaatcag tctggcctcc gccaacgcta ttaccaagct
5821 cgcgtctctc taccgccatg tgcggctcac gcagtgcgct gccaccatca ctccgacagc
5881 ggccgccatt gccaatcctc tcactgtcaa catcgtctgg gtgtctgaca attccactgc
5941 caagcccacc gagattctca atgtctttgg tggatcttcc tacacgtttg gcggcgccct
6001 caatgccacc aagcccctta ccatccctct ccccatgaac tcggtcaact gtatgctcaa
6061 ggactctgtt ctttacacag attgcccaaa gctcctggcc tactcagctg ctcccagctc
6121 tccctccaaa accccaaccg ccactatcca aatccatggc aagctccgct tgtcctcccc
6181 cctcctccaa gccaattaac tctctctccc tcagccacca cctcgctcct cccccatctc
6241 ctatggtaat tgcggacagt tccgctccct ctagcacaca gaggtccatt tgggtgcgac
6301 tcccccccct cccgtgggtc aacgggaacc a
//
LOCUS RATRGHA 542 bp ds-DNA ROD 26-MAY-1990
DEFINITION Rat growth hormone (rGH) gene, intron B repetitive DNA.
ACCESSION M32696
KEYWORDS repetitive DNA.
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 542)
AUTHORS Guitierrez-Hartmann,A., Lieberburg,I., Gardner,D., Baxter,J.D. and
Cathala,G.G.
TITLE Transcription of two classes of rat growth hormone gene-associated
repetitive DNA: Differences in activity and effects of tandem
repeat structure
JOURNAL Nucleic Acids Res. 12, 7153-7173 (1984)
STANDARD simple staff_entry
BASE COUNT 199 a 114 c 118 g 111 t
ORIGIN
1 aacagtaatg acagagaggg ctggagagat ggctcagtgg ttaagagcac ccgactgctc
61 ttccaaaggt cctgagttca attccagcaa ccacatggtg gctcacaacc atctgtaaag
121 agatccgatg ccctcttctg gtgtgtctga agacagctac agtgtactta tataataaac
181 aaataaatct ttaaaaaaaa aaacaaaaac ggggctggag agatggctca gcggttaaga
241 gcgcccgact gctcttccag aggtcatgag ttcaattcca gcaaccacat ggtggctcac
301 aaccatctgt aaagagatct gatgccctct tctggtgtat ctgaagacag ctacagtgta
361 cttatatata ataaataaat aaatctttaa aaaaaaaaca aaacaggggc tggggattta
421 gctcagtggt agagcgctta cctaggaagc gcaaggccct gggttcggtc cccagctccg
481 aaaaaaagaa ccaaaaaaaa aaaaaaaaac caaaacaaaa acaaaacagt aatgacagag
541 ag
//
LOCUS ALRVSRC 1801 bp ss-RNA VRL 26-MAY-1990
DEFINITION Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete
cds.
ACCESSION M33292
KEYWORDS oncogene; pp60v-src; src gene; tyrosine kinase.
SOURCE Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1.
ORGANISM Rous sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian sarcoma viruses.
REFERENCE 1 (bases 1 to 1801)
AUTHORS Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D.
TITLE Sequence and functional differences between Schmidt-Ruppin D and
Schmidt-Ruppin A strains of pp60v-src
JOURNAL Unpublished (1990) 406 S. Frear Bldg, University Park, PA 16802
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.I.Shalloway, 26-MAR-1990.
FEATURES from to/span description
pept 65 1645 pp60v-src protein
BASE COUNT 392 a 537 c 551 g 321 t
ORIGIN
1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac
61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga
121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc
181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa
241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact
301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga
361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg
421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc
481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga
541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac
601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt
661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca
721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca
781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc
841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg
901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc
961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca
1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga
1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg
1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag
1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt
1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg
1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt
1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg
1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta
1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg
1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc
1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg
1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga
1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg
1801 g
//
LOCUS AVIH2AA 3800 bp ds-DNA BCT 26-MAY-1990
DEFINITION A.vinelandii H2 uptake hydrogenase (hoxK), complete cds,
and H2 uptake hydrogenase (hoxG), complete cds.
ACCESSION M33152
KEYWORDS H2 uptake hydrogenase.
SOURCE A.vinelandii (strain OP) DNA, clone pALM21.
ORGANISM Azotobacter vinelandii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Azotobacteraceae.
REFERENCE 1 (bases 1 to 3800)
AUTHORS Menon,A.L., Stultz,L.W., Robson,R.L. and Mortenson,L.E.
TITLE Cloning, nucleotide sequence and characterization of the (NiFe)
hydrogenase structural genes and hoxG from Azotobacter vinelandii
JOURNAL Unpublished (1990) U of Georgia, Dep Biochemistry, Athens, GA 30602
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.L.Robson, 22-MAR-1990.
FEATURES from to/span description
pept 149 1225 H2 uptake hydrogenase (hoxK) precursor
sigp 149 283 H2 uptake hydrogenase signal peptide (put.)
matp 284 1222 H2 uptake hydrogenase
pept 1222 3030 H2 uptake hydrogenase (hoxG)
ORF 3047 3769 ORF3
BASE COUNT 686 a 1318 c 1213 g 583 t
ORIGIN
1 tgtatcaagc catgacaaaa acatggcatt ggcgcattat tcgtgcggtt ttcattcagc
61 aaccgtgggc catacaaccg gcgcgccgtc atagccgaag gacggtgcgc aggggcgccg
121 ataacgacct ggccacaagg gtaacggcat gtctcgactc gaaactttct atgacgtgat
181 gcggcgtcag ggcatcacgc gccgcagctt tctcaaatat tgcagcctga ccgccgcggc
241 cctgggcctc ggcccggcct tcgccccgcg gatcgcccac gcgatggaaa ccaagccgcg
301 cactccggtg ctctggctgc acggcctgga gtgcacctgc tgctccgagt cgttcatccg
361 ttcggcccac ccgctggtca aggacgtggt gctgtcgatg atctcgctgg actacgacga
421 caccctgatg gccgccgccg gccaccaggc cgaggccgcc ctcgaagaga ccatgcgcaa
481 gtacaagggc gagtacatcc tcgccgtgga gggcaacccg ccgctcaacg aggacggcat
541 gttctgcatc gtcggcggca agccgttcat cgagcagctc aggcatgtgg cgaaggacgc
601 caaggcggtg atcgcctggg gcagttgcgc cagttggggc tgcgtgcagg cggcccggcc
661 caacccgacc caggcggtgc cgatccacaa ggtcatcacc gacaagccga tcgtcaaggt
721 gcccggctgc ccgccgatcg ccgaggtgat gaccggggtg atcacctaca tgctgacctt
781 cggcaagctg cccgagctgg accgccaggg gcggccgaag atgttctacg gccagcgcat
841 ccacgacaag tgctaccgcc gcccgcactt cgacgccggc cagttcgtcg agcactggga
901 cgacgagggc gcgcgcaagg gctactgcct gtacaaggtc ggctgcaagg gcccgaccag
961 ctacaacgcc tgctcgacgg tgcgctggaa cgagggcact tccttcccga tccaggccgg
1021 ccacggctgc atcggctgct cggaggacgg tttctgggac aagggctcgt tctatgaacg
1081 cctgaccacc attccgcagt tcggcatcga gaagaacgcc gacgaaatcg gcgccgccgt
1141 cgccggcggg gtcggcgcgg ccatcgccgc gcatgccgcg gtcaccgcca tcaagcgcct
1201 gcagaacaag ggggatcgcc catgagcagc ctgccgaacg ccagccaact ggacaagtcc
1261 ggcaggcgca tcgtcgtcga cccggtgacc cgcatcgagg gccacatgcg ctgcgaggtc
1321 aacgtcgacg ccagcaacgt gatcaccaac gccgtctcca ccggcaccat gtggcgcggc
1381 ctggaggtca tcctcaaggg ccgcgacccg cgcgacgcct gggccttcgt cgagcgcatc
1441 tgcggcgtct gcaccggcac ccatgcgctg acctcggtgc gcgcggtgga ggatgccctg
1501 gacatccgca tcccctacaa cgcccacctg atccgcaacc tgatggacaa gacgctgcag
1561 gtgcacgacc acatcgtgca cttctaccac ctgcacgcgc tggactgggt caacccggtc
1621 aacgccctga aggccgatcc caaggctacc tccgccctgc agcaggcggt ttcgccggcc
1681 catgccaagt ccagccccgg ctacttccgc gacgtgcaga cgcgcctgaa gaagttcgtc
1741 gagagcggcc agctcggcct gttctccaac ggctactggg acaatccggc ctacaagctg
1801 ccgcccgagg cggacctgat ggccgtggcc cactacctgg aggcgctgga cctgcagaag
1861 gacatcgtca agatccatac catcttcggc ggcaagaacc cgcatccgaa ctacatggtc
1921 ggcggcgtgg cctgcgccat caacctggac gacgtcggcg ccgccggcgc gccggtcaac
1981 atgaccagcc tgaacttcgt cctcgaacgc atccacgagg cccgcgagtt caccaggaac
2041 gtctacctgc cggacgtgct ggcggtcgcc gggatctaca aggactggct gtacggcggc
2101 ggtctggccg cgcacaacct gctgtcctac ggcaccttca ccaaggtgcc ctacgacaag
2161 tccagcgacc tgttgccggc cggcgccatc gtcggcggca attgggacga ggtgctgccg
2221 gtcgacgtgc gcgatcccga ggagatccag gagttcgtca gccactcctg gtacagctac
2281 gccgacgaaa ccaaggggct gcatccctgg gacggcgtca ccgagccgaa attcgagctc
2341 ggcccgaaca ccaagggcag ccgcacccac atccaggaaa tcgacgaggc gcacaagtac
2401 agctggatca aggcgccgcg ctggcgcggc cacgctatgg aggtcggccc gctggcacgt
2461 tacatcatcg cctacgcttc gggccgcgaa tacgtgaagg aacaggtcga ccgctcgctg
2521 gccgccttca accagagcac cggcctgaac ctcggcctca agcagttcct gccctcgacc
2581 ctcggccgca ccctggcgcg cgccctggag tgcgagctgg cggtggacag catgctcgac
2641 gactggcagg ccctggtcgg caacatcaag gccggcgacc gcgccaccgc caacgtcgag
2701 aagtgggacc cgagcacctg gccgaaggag gccaagggcg tgggcatcaa cgaggcgccg
2761 cgcggcgccc tgggccactg gatcaggatc aaggacggca agatcgagaa ctaccaggcg
2821 atcgtgccga ccacctggaa cggcaccccg cgcgaccatc tgggcaacat cggcgcctac
2881 gaggccgcgc tgctcaacac caggatggag cgcccggacg agccggtgga gatcctgcgc
2941 accctgcaca gcttcgaccc ctgcctggcc tgttcgaccc acgtgatgtc gccggacggc
3001 caggagctga cccgggtgaa ggtccgctga accggaggat tgcgcgatgg cactggaaaa
3061 atccctggaa accggcgacg gccaggagaa ggtccgcaag cagaccgcgg tgtacgtcta
3121 cgaggcgccg ctgcgcctct ggcactgggt cacggcgctg tccatcgtcg tgctcggcgt
3181 gaccggctac ttcatcggcg cgccgctgcc gacgatgccc ggcgaggcga tggacaacta
3241 cctgatgggc tacatccgct tcgcccactt cgccgccggc tacgtgctgg cgatcggctt
3301 cctcggccgg gtctactggg ccttcgtcgg caaccaccac gcccgcgagc tgttcctcgt
3361 gccggtgcac cgcaaggcct ggtggaagga gctgtggcac gaggtgcgct ggtacctgtt
3421 cctggaaaag accccgaaga agtacatcgg ccacaacccc ctgggccagt tggcgatgtt
3481 ctgcttcttc gtggtcggcg cggtgttcat gagcgtcacc ggcttcgccc tctacgccga
3541 ggggctgggg cgggacagct gggccgaccg gctgttcggc tgggtgatcc cgctgttcgg
3601 ccagagccag gacgtgcaca cctggcacca cctgggcatg tggtacctcg tcgtcttcgt
3661 catggtgcat gtctacctgg ccgtgcgcga agacatcgtt tcccggcagt cgctgatctc
3721 caccatggtc ggcggctggc ggatgttcaa ggacgaccgg ccggattgag ccccgtgtcg
3781 tcccttccgt ccgggccggt
//
LOCUS RABIGHAS 402 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-1.
ACCESSION M29412
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-1.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 402)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 402 Ig mu-chain V-D-J precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 402 Ig mu-chain
recomb 339 340 V-region end/D-region start
recomb 355 356 D-region end/J-region start
BASE COUNT 86 a 108 c 118 g 90 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
121 acagcctctg gattctccct cagtagttac tacatgcaat gggtccgcca ggctccaggg
181 aaggggctgg aatggatcgg aatcattggt agtagtggta gcacatacta cgcgagctgg
241 gtgaagggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt
301 ctgacaaccg aggacacggc cacctatttc tgtgccagag catatattag taatactgat
361 ggttctggct ttaacttgtg gggccaaggc accctggtca cc
//
LOCUS RABIGHAT 399 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-3-1.
ACCESSION M29413
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-3-1.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 399)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 399 Ig mu-chain V-D-J precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 399 Ig mu-chain
recomb 342 343 V-region end/D-region start
recomb 361 362 D-region end/J-region start
BASE COUNT 88 a 114 c 111 g 86 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcggtggagg agtccggcgg tcgcctggta aagcctgacg aatccctgac actcacctgc
121 acagcctctg gattctccct cagtacctac aacatgatct gggtccgcca ggctccagga
181 aaggggctgg aatacatcgg ccacattagt tttggtggta gcacatacta cgcgagctgg
241 gcgaaaggtc gatgcaccat atccaaaacc tcgaccacgg tggatctgaa aatgaccagt
301 ctgacaaccg aggacacggc cacctatttc tgtgccaggg gatggactcc taaaagtctt
361 tcagccttta acttgtgggg cccaggcacc ctggtcacc
//
LOCUS RABIGHAU 390 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-5.
ACCESSION M29414
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-5.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 390)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 390 Ig mu-chain V-D-J precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 390 Ig mu-chain
recomb 339 340 V-region end/D-region start
recomb 368 369 D-region end/J-region start
BASE COUNT 86 a 100 c 119 g 85 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcggtggagg agtccgggga tcgcctggtc acgcctggga cacccctgac actcacatgc
121 acagtctctg gattctccct caatagttat gtagtgggct gggtccgcca ggctccagag
181 aagggactgg aatacatcgg aaccatttgg gtcgatggta agacatacta cgcgagctgg
241 acgaagggcc gattcaccat ctctaaaacc tcgaccacgg tggatctgaa aatgaccagt
301 ctgacaaccg aggacacggc cacatatttc tgtgccagat atggtagtag tggtgattta
361 ggcgtgtggg gccaagggac cctggtcacc
//
LOCUS RABIGHAV 351 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 2-1.
ACCESSION M29415
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 2-1.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 351)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept < 1 > 351 Ig mu-chain V-D-J precursor (AA at 1)
sigp < 1 21 Ig mu-chain signal peptide
matp 22 > 351 Ig mu-chain
recomb 306 307 V-region end/D-region start
recomb 339 340 D-region end/J-region start
BASE COUNT 77 a 101 c 100 g 73 t
ORIGIN
1 gtgctcaaag gtgtccagtg tcagtcgctg gaggagtccg ggggtcgcct ggtcacgcct
61 gggacacccc tgacactcac ctgcacagcc tctggattct ccctcagtag ctactggatg
121 acctgggtcc gccaggctcc agggaagggg ctggaatgga tcggaatcat tgttcatggt
181 gatagcgcat actacgcgag ctgggcgaaa ggccgattca ccatctccag aacctcgacc
241 acggtggatc tgaaaatcac cagtccgaca accgaggaca cggccaccta tttctgtgcc
301 agagaatatt atggtactat taacttgtgg ggcccaggca ccctggttac c
//
LOCUS RABIGHAW 408 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-2.
ACCESSION M29416
KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
joining exon; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-2.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 408)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 408 Ig gamma-chain V-D-J precursor
sigp 1 57 Ig gamma-chain signal peptide
matp 58 > 408 Ig gamma-chain
recomb 342 343 V-region end/D-region start
recomb 369 370 D-region end/J-region start
BASE COUNT 77 a 112 c 127 g 92 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcgctggagg agtccggggg tcgcctggtc acgcctggga catccctgac actcacctgc
121 acagtctctg gattctccct cagtactagt gcaatggcct gggtccgcca ggctccaggg
181 aaggggctgg aatatgtcgg agtcattagt ggaagtggtg gcacatacta cgcgagctgg
241 gcgagcggcc ggttcaccat ttccaaagcc tcgtcgacca cggtggatct gaaaatgacc
301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagtcaggga tagtcatggt
361 tatattggtg atgcttttga tccctggggc ccaggcaccc tggtcacc
//
LOCUS RABIGHAX 390 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-3-1.
ACCESSION M29417
KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
joining exon; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-3-1.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 390)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 390 Ig gamma-chain V-D-J precursor
sigp 1 57 Ig gamma-chain signal peptide
matp 58 > 390 Ig gamma-chain
recomb 339 340 V-region end/D-region start
recomb 356 357 D-region end/J-region start
BASE COUNT 77 a 110 c 120 g 83 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
121 acagtctctg gattctccct cagtagtcgc tggatgagct gggtccgcca ggctccaggg
181 gaggggctgg aatccatcgg agccattgat actggtggta gcgcatacta cgcgaactgg
241 gtgaaaggcc gactcaccat ctccaaaacc tcgtcgacca cggtggattt gaaaatgacc
301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagattatag tggtggactt
361 gacttgtggg gcacaggcac cctggtcacc
//
LOCUS RABIGHAY 399 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-4.
ACCESSION M29418
KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
joining exon; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-4.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 399)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 399 Ig gamma-chain V-D-J precursor
sigp 1 57 Ig gamma-chain signal peptide
matp 58 > 399 Ig gamma-chain
recomb 339 340 V-region end/D-region start
recomb 364 365 D-region end/J-region start
BASE COUNT 83 a 116 c 116 g 84 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
121 acagcctctg gattcaccat cagtagctac cacatgatct gggtccgcca ggctccaggg
181 gaggggctgg aatacatcgg atggattagt actggtggta gcgcatacta cgcgaactgg
241 gcaaaaggcc gattcaccat ctccagaacc tcgaccacgg tggatctgaa aatgaccagt
301 ctgacaaccg aggacacggc cacctatttc tgttgcagaa ctcctgctgt tagtaaatgg
361 gacttgtggg gcccgggcac cctagtcacc gtctcctca
//
LOCUS RABIGHAZ 384 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 4-1.
ACCESSION M29419
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 4-1.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 384)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 384 Ig mu-chain V-D-J precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 384 Ig mu-chain
recomb 339 340 V-region end/D-region start
recomb 365 366 D-region end/J-region start
BASE COUNT 82 a 114 c 113 g 75 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
121 acagtctctg gaatcgacct cagtggctac cacatgagct gggtccgcca ggctccaggg
181 gaggggctgg aatggatcgg aaccatgagt actactgata acacatatta cgcgagctgg
241 gcaaaaggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt
301 ctgacagccg cggacacggc cacctatttc tgtgccagag gacaggcaac ttttattccc
361 tggggcccag gcaccctggt cacc
//
LOCUS RABIGHBA 393 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 5-2.
ACCESSION M29420
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 5-2.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 393)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept 1 > 393 Ig mu-chain V-D-J precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 393 Ig mu-chain
recomb 339 340 V-region end/D-region start
recomb 364 365 D-region end/J-region start
BASE COUNT 80 a 103 c 118 g 92 t
ORIGIN
1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
121 acagtctctg gaatcgacct cagtagcttt gcaatggcct gggttcgcca ggctccaggg
181 aaggggctgg agtggatcgg aatcattaat ggttatggta ctacatacta cgcgagctgg
241 gtgaatggcc gattcaccat ctccaaaacc tcgacctcgg tggatctgaa aatgaccagt
301 ctgacaaccg aggacacggc cacctatttc tgtgtcagat atcttagtga tggttggtat
361 ctagacttgt ggggccaagg caccctggtc acc
//
LOCUS RABIGHBB 375 bp ss-mRNA MAM 26-MAY-1990
DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 7-2.
ACCESSION M29421
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region.
SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 7-2.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 375)
AUTHORS DiPietro,L.A. and Knight,K.L.
TITLE Restricted utilization of germ-line VH gene and diversity of D
regions in rabbit splenic Ig mRNA
JOURNAL J. Immunol. 144, 1969-1973 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.DiPietro, 25-OCT-1989.
FEATURES from to/span description
pept < 1 > 375 Ig mu-chain V-D-J precursor (AA at 1)
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 375 Ig mu-chain
recomb 339 340 V-region end/D-region start
recomb 361 362 D-region end/J-region start
BASE COUNT 82 a 98 c 112 g 83 t
ORIGIN
1 gtgctcaaag gtgtccagtg tcagtcggtg gaggagtccg ggggtcgcct ggtcacgcct
61 gggacacccc tgacactcac ctgcacagtc tctggattct ccctcaataa ttatgcaatg
121 ggctgggtcc gccaggctcc agggaagggg ctagaatgga tcggaaccat tggtactggt
181 ggtagcgtat actacgcgaa ctgggcaaaa ggccgattca ccatctccag aacctcgacc
241 acggtggatc tgaaaatgac cagtctgaca accgaagaag gacacgccac ctatttctgt
301 gccagagtgg ctggtggtac tgtttttggc tatgtggggt actttaactt gtggggccaa
361 ggcaccctgg tcacc
//
LOCUS PHVARCA 902 bp ss-mRNA PLN 26-MAY-1990
DEFINITION P.vulgaris arcelin 2 mRNA, complete cds.
ACCESSION M28470
KEYWORDS arcelin.
SOURCE P.vulgaris, cDNA to mRNA, clone pARC2-11 and pARC2-191.
ORGANISM Phaseolus vulgaris
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 902)
AUTHORS John,M.E. and Long,C.M.
TITLE Sequence analysis of arcelin 2: A lectin-like plant protein
JOURNAL Gene 86, 171-176 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by M.E.John, 29-SEP-1989.
FEATURES from to/span description
pept 1 798 arcelin 2
BASE COUNT 249 a 282 c 164 g 207 t
ORIGIN
1 atggcttcct ccaacttact caccctagcc ctcttccttg tgcttctcac ccacgcaaac
61 tcaagcaacg acgcctcctt caacgtcgag acgttcaaca aaaccaacct catcctccaa
121 ggcgatgcca ccgtctcatc cgaaggccac ttactactaa ccaatgttaa aggcaacgaa
181 gaggactcta tgggccgcgc cttctactcc gcccccatcc aaatcaatga cagaaccatc
241 gacaacctcg ccagcttctc caccaacttc acattccgta tcaacgctaa gaacaatgaa
301 aattccgcct atggccttgc ctttgctctc gtccccgtcg gctctcggcc caaacttaaa
361 ggccgttatc taggtctttt caacacagcc aactacgacc gcgacgccca tactgtggct
421 gtggtgttcg acaccgtcag caaccgtatt gaaatcgacg tgaactccat ccggcctatc
481 gcaacggagt cttgcaattt cggccacaac aacggagaaa aggccgaggt tcggatcacc
541 tattactccc ccaagaacga cttgagggtt tctctgcttt acccttcttc ggaagaaaag
601 tgccacgtct ctgccacagt gccgctggag aaagaagttg aggactgggt gagcgttggg
661 ttctctgcca cctcagggtc gaaaaaagag accactgaaa cgcacaacgt cctctcttgg
721 tctttttctt ccaacttcat caattttgag ggcaaaaaat ctgaacgttc caacatcctc
781 ctcaacaaga tcctctagac tcccaaagcc agcttcactg tgacagtaaa accttcctta
841 tacgctaata atgttcatct gtcacacaaa ctacaataaa taaaatggga gcaataaata
901 aa
//
LOCUS DROGOALA 2204 bp ss-mRNA INV 26-MAY-1990
DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo59.
ACCESSION M29731 J05089
KEYWORDS G protein; guanine nucleotide-binding protein.
SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo59.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2204)
AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M.
TITLE Immunological and molecular characterization of G-o-alpha-like
proteins in the Drosophila central nervous system
JOURNAL J. Biol. Chem. 264, 18552-18560 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Thambi 04-OCT-1989.
FEATURES from to/span description
pept 166 1230 G-o-alpha-like protein
BASE COUNT 757 a 493 c 459 g 495 t
ORIGIN
1 gaattccgtg ctcggcaagt gcaacgttga aatcgttaaa ctgtacataa gcaaataaga
61 cataaagaaa aaagtccagg aaaattggaa aacaaaagcc cgaaaaccga aaagccccgt
121 gtaaatccga atccgaatcc aaatcagtat ccaaacccaa ccacaatggg ctgcaccaca
181 tccgccgaag aacgcgccgc catccagcga tccaaacaga tcgagaagaa tctaaaggag
241 gatggaatcc aggcggccaa ggacatcaag ctcctgctgc tgggtgccgg tgagtcgggc
301 aagagcacaa tagtcaaaca gatgaaaatc attcacgaga gcggcttcac tgcggaggac
361 tttaaacaat atcgaccggt tgtctacagc aacacaatac aatcattagt tgcaatattg
421 cgcgcgatgc caaccctaag tattcagtac agcaataacg agcgggagag cgatgccaag
481 atggtgttcg acgtatgcca acgcatgcac gacaccgagc ccttctcgga ggagctgctg
541 gccgccatga aacgcctctg gcaggacgcc ggtgtccagg agtgcttctc gcgcagcaac
601 gaataccaac taaacgattc cgcaaaatat ttcctggacg atttggatcg gttaggcgcc
661 aaggattacc agccaactga acaagatatc ttgcgcactc gcgtcaagac cactggcatc
721 gttgaggtac acttctcctt caaaaacctc aactttaaat tgtttgacgt gggcggtcag
781 cgctcggaac gtaagaaatg gatacactgc ttcgaagatg tcacggcgat cattttctgc
841 gtggccatgt ccgagtacga tcaagtcttg catgaggatg aaaccacgaa ccgcatgcaa
901 gagtcgctga aactgtttga ctcgatctgt aacaacaaat ggttcacgga cacctcgatt
961 attctatttc tgaacaagaa ggatttgttc gaggagaaga ttcgcaagag tcccctgacg
1021 atttgcttcc ccgaatacac aggtggacag gagtacggcg aggcggctgc ttacattcag
1081 gctcaatttg aagcgaaaaa caaatcaacc tcaaaagaaa tctactgcca catgacgtgt
1141 gccacagata ccaataacat tcagtttgta ttcgatgctg tcaccgatgt catcatagca
1201 aacaacctgc gcggctgtgg actgtactaa gatggattcc aggccggatc ccgacgatgt
1261 cgacgtccga gtcgatattg atgacgatga cgattatgtg gagcagaatg ggggcgttac
1321 gagggaacac cgtaacggta ttaaagagca gcgcggagca caacaaccca ccagcattga
1381 tcaaaaaacc aaacaattta ggagcagatg atagaaccaa ccaacaaacc aaccgcaaac
1441 cacacagaaa acataggaca ctgaacaagc aaagcccaaa gaacttttat ttgtttaaca
1501 aaaaaacggc ggacggacgg aaatcccgaa tggatgttat agggaaaatg agcgacaagt
1561 acattacata atatcgataa tattgaagca gatgcagatg caaatacaca caatgctaat
1621 gatgatcagg gcgactatga ctaaatgagg cagcaggcaa ctgacactgg gacacgcgat
1681 taaagtcaca tctgaaaaaa ggcagttgat tgaaaggcat ttctatatac aaacatatac
1741 aaacacatac atatgcatta tgcaaagcca catgtacgac atgacactaa cacactcaca
1801 cgacaaacac aagcgccaac attgcataca gttgttgttt ggtctgaata atttttatag
1861 aatttcataa tttatgtgta gtttagtttc ctcatgtatt tattaaaaca aaaaccaaac
1921 gagcgtatat ctacatatac cgcatatata tatatacata cacttctata catatatata
1981 tatatatata catatatata aatattatat attaaatgtt tcctgttgca atctctcttt
2041 aaaattattc atgccatcaa cgctctgcat ttgtcatgct tgtttagact taagttcgaa
2101 agtttcaaca aaatccagcg tcaaaggaaa tatcaatatt catttgattg agtgtcagcg
2161 tgtggtctaa agtaaatata taaaataaca aaccaaaaaa aaaa
//
LOCUS DROGOALB 2558 bp ss-mRNA INV 26-MAY-1990
DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo21.
ACCESSION M29732 J05089
KEYWORDS G protein; guanine nucleotide-binding protein.
SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo21.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2558)
AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M.
TITLE Immunological and molecular characterization of G-o-alpha-like
proteins in the Drosophila central nervous system
JOURNAL J. Biol. Chem. 264, 18552-18560 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Thambi 04-OCT-1989.
FEATURES from to/span description
pept 520 1584 G-o-alpha-like protein
BASE COUNT 867 a 557 c 530 g 604 t
ORIGIN
1 gaattccggt tgcctatttc tctcgcttac ctatttattt agcatacatt ttccaagcat
61 cctgtgaaaa aaccatcaca agttttcctt cgaacggaat gccaagtgca ttctggaagg
121 aaatcgttgt acatctacat aatgccaata aagaaaatgt aactaaagta aaaaaaaaaa
181 aaaagagcta aaccgttaaa ttaaagtttt aaagttaaaa aaacgctgaa taagtgttaa
241 atatatataa caaaaatatt gttgaattga agaaaaccaa agttcaaaaa cctgaaaaaa
301 ccataaagaa gtgattgaaa aatcagttga agtgccgtac tgaaaattaa agtccagtga
361 cacgatcgaa tccctcggat agcggagtta gtttagcccc ccgaattcga gtccccgcac
421 gttgtacacc tggtttttct cgctggcaac gtagtcggcc attgagttgg ccgataccaa
481 acgaccttca aaacgttttg cgtcgaggca atacgcacca tgggctgcgc acagtctgcc
541 gaggagcgag ccgcagccgc caggagtcgc ctcatcgagc gcaatctaaa ggaggatgga
601 atccaggcgg ccaaggacat caagctcctg ctgctgggtg ccggtgagtc gggcaagagc
661 acaatagtca aacagatgaa aatcattcac gagagcggct tcactgcgga ggactttaaa
721 caatatcgac cggttgtcta cagcaacaca atacaatcat tagttgcaat attgcgcgcg
781 atgccaaccc taagtattca gtacagcaat aacgagcggg agagcgatgc caagatggtg
841 ttcgacgtat gccaacgcat gcacgacacc gagcccttct cggaggagct gctggccgcc
901 atgaaacgcc tctggcagga cgccggtgtc caggagtgct tctcgcgcag caacgaatac
961 caactaaacg attccgcaaa atatttcctg gacgatttgg atcggttagg cgccaaggat
1021 taccagccaa ctgaacaaga tatcttgcgc actcgcgtca agaccactgg catcgttgag
1081 gtacacttct ccttcaaaaa cctcaacttt aaattgtttg acgtgggcgg tcagcgctcg
1141 gaacgtaaga aatggataca ctgcttcgaa gatgtcacgg cgatcatttt ctgcgtggcc
1201 atgtccgagt acgatcaagt cttgcatgag gatgaaacca cgaaccgcat gcaagagtcg
1261 ctgaaactgt ttgactcgat ctgtaacaac aaatggttca cggacacctc gattattcta
1321 tttctgaaca agaaggattt gttcgaggag aagattcgca agagtcccct gacgatttgc
1381 ttccccgaat acacaggtgg acaggagtac ggcgaggcgg ctgcttacat tcaggctcaa
1441 tttgaagcga aaaacaaatc aacctcaaaa gaaatctact gccacatgac gtgtgccaca
1501 gataccaata acattcagtt tgtattcgat gctgtcaccg atgtcatcat agcaaacaac
1561 ctgcgcggct gtggactgta ctaagatgga ttccaggccg gatcccgacg atgtcgacgt
1621 ccgagtcgat attgatgacg atgacgatta tgtggagcag aatgggggcg ttacgaggga
1681 acaccgtaac ggtattaaag agcagcgcgg agcacaacaa cccaccagca ttgatcaaaa
1741 aaccaaacaa tttaggagca gatgatagaa ccaaccaaca aaccaaccgc aaaccacaca
1801 gaaaacatag gacactgaac aagcaaagcc caaagaactt ttatttgttt aacaaaaaaa
1861 cggcggacgg acggaaatcc cgaatggatg ttatagggaa aatgagcgac aagtacatta
1921 cataatatcg ataatattga agcagatgca gatgcaaata cacacaatgc taatgatgat
1981 cagggcgact atgactaaat gaggcagcag gcaactgaca ctgggacacg cgattaaagt
2041 cacatctgaa aaaaggcagt tgattgaaag gcatttctat atacaaacat atacaaacac
2101 atacatatgc attatgcaaa gccacatgta cgacatgaca ctaacacact cacacgacaa
2161 acacaagcgc caacattgca tacagttgtt gtttggtctg aataattttt atagaatttc
2221 ataatttatg tgtagtttag tttcctcatg tatttattaa aacaaaaacc aaacgagcgt
2281 atatctacat ataccgcata tatatatata catacacttc tatacatata tatatatata
2341 tatacatata tataaatatt atatattaaa tgtttcctgt tgcaatctct ctttaaaatt
2401 attcatgcca tcaacgctct gcatttgtca tgcttgttta gacttaagtt cgaaagtttc
2461 aacaaaatcc agcgtcaaag gaaatatcaa tattcatttg attgagtgtc agcgtgtggt
2521 ctaaagtaaa tatataaaat aacaaaccaa aaaaaaaa
//
LOCUS MTYCLCGA 6319 bp ss-RNA VRL 26-MAY-1990
DEFINITION Turnip yellow mosaic virus Club Lake isolate, complete genome.
ACCESSION J04373
KEYWORDS complete genome; nucleotide binding protein; replicase;
virion protein.
SOURCE Turnip yellow mosaic virus Club Lake isolate cDNA to viral RNA.
ORGANISM Turnip yellow mosaic virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Tymovirus.
REFERENCE 1 (bases 1 to 6319)
AUTHORS Keese,P., Mackenzie,A. and Gibbs,A.
TITLE Nucleotide sequence of the genome of an Australian isolate of
turnip yellow mosaic tymovirus
JOURNAL Virology 172, 536-546 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Gibbs, 04-AUG-1989.
FEATURES from to/span description
pept 96 5630 replicase polyprotein
pept 89 1975 Unknown protein
pept 5645 6214 virion protein
BASE COUNT 1461 a 2426 c 1061 g 1371 t
ORIGIN
1 gtaatcaact accaattcca gctctctttt gacaactggt cttataccaa ctttccgtac
61 acttgcaacc ctcgtaagac aattgcaaat gagtaatggc cttccaatta gcattggacg
121 cccttgcacc cacgactcac agagatccct ctctgcatcc gattctcgaa tccacagtag
181 attcgattcg ctcctcgata cagacctacc catggtccat tccgaaggaa cttctgcccc
241 tactcaactc ctacggcatc ccaacatctg gtttgggaac atcccaccac ccccacgccg
301 cccacaagac aatcgagact tttctccttt gcacccactg gtctttccag gccaccactc
361 ccagctccgt catgttcatg aaacccagca agttcaacaa acttgcccag gtgaactcaa
421 actttcggga attgaagaac taccgcctgc accccaacga cagcactcgt taccccttca
481 catcaccaga ccttcccgtt ttccccacca ttttcatgca cgacgccctg atgtactacc
541 atccctccca gatcatggac ctgttcttgc agaaaccaaa cctcgaacgt ctgtacgcca
601 gcctcgtagt accacccgag gcccatcttt ccgaccaatc cttcttcccg aagttgtaca
661 cgtacacgac gacccgccac actcttcact acgtcccgga aggtcacgaa gccggcagct
721 acaaccaacc atccgacgcc cactcttggc tccgaatcaa ttcaattcgc ctcggcaacc
781 accacctctc agtgacgatc ctggaatcct ggggccctgt ccactcgctc ctaattcaac
841 gagggacccc cccccccgac ccatcactcc aggccccttc aacacccatg gcgtccgacc
901 tctttcggtc ttaccaagag ccccgcctcg acgtggtctc cttccgaatc ccagacgcca
961 tcgaacttcc acaggccaca ttccttcaac aaccgcttcg agaccgactg gtcccccgag
1021 ccgtctacaa cgccctgttc acctacacca gagcggtccg cacactccgg acttcagacc
1081 cagcggcatt cgtaaggatg cattcctcca aaccggacca cgattgggtc acctcgaacg
1141 cctgggacaa tctgcagacc ttcgcacttc tgaacgtacc ccttcgacca aacgtcgtct
1201 accacgtcct tcagagccca attgcctccc tagctcttta cctgaggcaa cattggcgcc
1261 gtcttaccgc caccgccgtt cccatcctct ccttcctaac cctcctgcag cgcttcctcc
1321 cattgcctat acctctggca gaggtaaaat ccatcacagc cttccgaagg gagctctacc
1381 gaaagaaggc cccccaccac cccctcgacg tcttccatct ccagcaacac ctccgcaatc
1441 accactccgc gatctcggcc gtacgcccag cttccccacc ccaccaaaga cttccacacg
1501 cgctccagaa agctgcattg ctgctcctcc gaccgatatc gcccctcttg acagcgaccc
1561 cgttctttcg gtccgaacag aagtccatgc tcccgaacgc cgaactttca tggaccctga
1621 agcgcttcgc gctgccttgg caggcctccc tagtcctcct ctctctgtcg gaatcatccg
1681 tactgcttca caaactgttc tccccaccaa ctctccaagc ccaacacgac acctaccacc
1741 gacatcttca ccctggatcc tacagtctcc agtgggagag gacgccattg tcgattccga
1801 ggacgacagc atttcttcct ttcactccca cgacttcaac agcccctccg gaccactccg
1861 aagccagtct ccctcccgct ttcgcctcca cctccgttcc ccgtccacct ccagtggcat
1921 cgagccttgg agcccagcct cctacgacta cggcagcgcc cccgacaccg attgaaccca
1981 cccagcgcgc tcatcaaaat tctgacctca cgcttgaaag ttcaacccca attgaacccc
2041 ccccaccccc catccaatcc tccgacatcc cgccttccgc ccccgttctt ttcccagaaa
2101 tcaactcacc gcatcgtttt tcccccaaac ttcccaccac acccgatttc gaacccaccc
2161 gcacttcacc ccctccttcc acttcgcatc aagattcgac tgaccccgcg gaccccctga
2221 tgggctccca ccttctgcac cattcactac ctgcacctcc cacccacccg cttcaatctt
2281 cacagctctt gcccgcacct ttgacaaacg accccaccgc gatcggcccg gtactcccct
2341 ttgaagaact ccacccacgc aggtaccccg aaaacaccgc cactttcctc acgaggctcc
2401 gttcacttcc ttcaaaccat ctaccacaac ccaccctgaa ttgtctcctc tctgctgtct
2461 ccgaccaaac caaggtttcc gaggatcacc tctgggagtc cctacagaca attctcccag
2521 acagccaact caggaacgaa gagatcaact ctctcgggct ttcaactgaa cacctcactg
2581 cgttggccca tctttacaac ttccaggcaa ccatctactc cgatcgtggt cccatcctct
2641 tcggcccatc cgacaccatt aagagaatcg acatcaccca caccaccgga ccgccatccc
2701 acttttcacc cggcaaaaga cttttaggca gccaaccctc agctaagggc catccctccg
2761 actcactcat cagagccatg aagtctttca aagtatccgg caactacctt cccttctctg
2821 aggcccacaa ccatcccacc tccatctcac atgccaagaa cttggtttca aacatgaaga
2881 atggattcga cggcatcctc tcccttctcg acgtctccac aggccaacga accggaccca
2941 cccccaaaga cgcgatcatt cagatagacc actacctcga caccaacccc ggcaaaacca
3001 cccctgtggt gcattttgct ggtttcgctg gctgtggaaa gacatatccg atccaacagc
3061 tccttaaaac taaactgttc aaagactttc gggtctcctg ccccaccaca gaactcagaa
3121 ccgaatggaa gactgcgatg gaacttcatg gctcccagtc atggcgcttt aacacttggg
3181 agtcttccat tctcaagtca tccagaattc tggtcatcga tgaaatctac aaaatgccaa
3241 gagggtacct cgacctttcc attctcgctg accccgccct cgaactcgtc ataattctcg
3301 gtgatcctct ccagggcgag taccactctc aatccaaaga ctcatccaat caccgccttc
3361 cctccgaaac tctcaggctg ctaccataca ttgacatgta ctgctggtgg agttatcgca
3421 ttccccaatg tatcgcccga ctcttccaaa ttcacagctt caatgcctgg cagggaatca
3481 tcggctccgt ttcaactccc caggatcaat cccccgttct caccaacagt catgcctcat
3541 ctctcacctt caacagcctg ggatatcgct cctgcacgat cagctctagc caaggcctca
3601 cattctgcga ccctgccatc atcgtcctgg acaactacac caagtggctc tcctcggcca
3661 acggcctcgt cgccctcacc cgatccagat caggtgtcca attcatgggc ccctcttcct
3721 atgtcggggg aaccaacggc tcttctgcca tgttttctga cgccttcaac aacagcctca
3781 tcatcatgga tcgctacttc ccatccctgt tcccacaact caagctcatc acctcccccc
3841 tcacaactcg cagccccaaa ctcaacgggg ccacccccag cgcatctccc acccatcgct
3901 cgccaaactt ccacctcccc ccacacattc ccctctctta tgatcgtgat ttcgtcacgg
3961 tcaacccaac tctccctgat cagggacccg aaacaagact cgacacccac ttcctcccac
4021 cttctcggct cccgcttcat ttcgatctcc caccagctat cacccccccc ccgatttcca
4081 caagcgtcga cccgccacaa gctaaagcta gccccgtcta tccaggcgag ttcttcgatt
4141 ctctggcggc gttcttctta ccagcacacg acccatcaac aagggaagta ctccacaaag
4201 atcaatctag caaccagttc ccttggttcg accgaccctt cagcttgtcc tgccagccct
4261 caagtttaat ttctgccaag catgcaccca accacgatcc gacccttctg cctgcctcca
4321 tcaataaacg cttgcgattc agacccagtg aagcaccgca ccaaatcacc gcagacgacg
4381 tggtcctagg cctgcaactc ttccactctc tctgccgcgc ctactcacgt caacccaaca
4441 tcaccgttcc attcaaccct gaacttttcg cagaatgtat ctctctgaat gaatacgcgc
4501 agctcagttc caaaacccaa tccaccatag tggccaacgc ttcacgctcc gacccagact
4561 ggcgacacac caccgtcaag atttttgcga aagctcaaca caaagtcaac gacggctcca
4621 tcttcggttc atggaaggcc tgccaaactc tcgcactcat gcatgattac gtaattctgg
4681 ttcttggacc cgtcaagaaa tatcaaagaa tcttcgacaa cgttgatcgg ccatctcaca
4741 tctactcaca ctgcggcaag acacccaacc aacttcgaga ttggtgccag gaacatctca
4801 ctcattccac cccaaaaatc gcaaacgact acaccgcctt cgaccaatcc cagcatggag
4861 aatccgtggt tcttgaagcc ctcaaaatga agagactgaa cattccgagc catttgattc
4921 agctccatgt ccacctcaag accaacgtct ccacccagtt cggccccctc acatgcatgc
4981 gcctgaccgg ggaacccgga acctacgacg acaacactga ctacaacctc gcagtcatct
5041 actctcagta tgacgttggt tcctgcccca tcatggtctc tggcgacgac tcactcatag
5101 accaccctct tcccactcgc cacgactggc cctctgttct caaacgcctc cacctccgct
5161 ttaaacttga actcacttct catcccctct tttgtggcta ctacgtcggt ccagcaggct
5221 gcatccgcaa ccccttggcc cttttctgca agctcatgat cgcagtggac gatgacgccc
5281 tcgacgaccg acgactcagc tacctcaccg agttcaccac cggacacctc cttggcgaat
5341 cactatggca cctcctccct gaaacccacg tccagtatca gtcagcttgc tttgacttct
5401 tctgcagacg ttgcccaaaa cacgagaaga tgctcctcga tgattccaca cccacactca
5461 gcctcctcga acgaatcact tcttcaccga ggtggctcac caagaacgcc atgtacctcc
5521 tccccgccaa gctcagactg gctatcacct ctctgtctca aacgcaatct ttcccagaat
5581 ccattgaggt ttcccacgct gagtctgaat tgcttcacta tgtccaatag caatcagccc
5641 cgacatggaa atcgacaaag aactcgcccc ccaagaccgc accgtcaccg tcgccaccgt
5701 tttaccgact gtccccggcc cctcaccttt caccatcaaa caaccgttcc agtctgaagt
5761 tctgtttgct gggaccaaag atgccgaggc ctctctcacc atcgccaaca tcgacagcgt
5821 ttccaccctc accaccttct atcgtcatgc ctctctggaa tcactctggg tcaccatcca
5881 tcctaccttg caagccccag ctttcccgac cacggttggc gtttgctggg tacccgccaa
5941 ctccccagtc actcccaccc aaatcaccaa gacctacggc ggccagatct tctgcattgg
6001 aggcgccatc aacactctct cacccctcat tgtcaagtgc ccacttgaaa tgatgaaccc
6061 ccgggtcaaa gattcaattc aataccttga ctcgcccaaa ctcctcatct ccatcaccgc
6121 tcaacccacc gctccccccg catcgacctg cataataact gtatcaggaa ctctctcgat
6181 gcattctccg ctcatcacgg acacttccac ctaagttctc gatctttaaa atcgttagct
6241 cgccagttag cgaggtctgt ccccacacga cagataatcg ggtgcaactc ccgccccttt
6301 tccgagggtc atcggaacc
//
LOCUS RATTH2BAA 181 bp ds-DNA ROD 26-MAY-1990
DEFINITION Rat TH2B gene promoter region.
ACCESSION M33578
KEYWORDS H2B histone; histone; transcription regulatory element.
SOURCE Rat DNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 181)
AUTHORS Hwang,I., Lim,K. and Chae,C.-B.
TITLE Characterization of the S-phase-specific transcription regulatory
elements in a DNA replication-independent testis-specific H2B
(TH2B) histone gene
JOURNAL Mol. Cell. Biol. 10, 585-592 (1990)
STANDARD simpl staff_entry
FEATURES from to/span description
mRNA 161 > 181 H2B histone mRNA
signal 61 68 octamer signal
signal 110 115 hexamer signal
BASE COUNT 45 a 47 c 32 g 57 t
ORIGIN
1 acctgattgg ctgattggtg atgaattaac caatcagaaa gcaccacttg aattcccctt
61 atttgcatac aaggaacatt tattgtccaa tcatctttcg cgtgctcata cgtcatccaa
121 ggcccacgcc tataaatacc tctcttcttg gccttcaagc ggtgtgtttt ctcagcagtt
181 g
//
LOCUS TCVDIGAA 347 bp ss-RNA VRL 26-MAY-1990
DEFINITION Turnip crinkle virus defective interfering RNA.
ACCESSION M29290
KEYWORDS defective interfering RNA.
SOURCE Turnip crinkle virus cDNA to RNA.
ORGANISM Turnip crinkle virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Tombusvirus.
REFERENCE 1 (bases 1 to 347)
AUTHORS Li,X.H., Heaton,L.A., Morris,T.J. and Simon,A.E.
TITLE turnip crinkle virus defective interfering rnas intensify viral
symptoms and are generated de novo
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9173-9177 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
A.E.Simon, 20-OCT-1989.
FEATURES from to/span description
RNA 1 347 defective interfering RNA
BASE COUNT 90 a 98 c 86 g 73 t
ORIGIN
1 gggataaaaa aggaggctta ccaaccttct ctctattcac gatgcctctt ctacacacac
61 tcaaaacagc gctcgcagtg ggactccttg gagccaggta ctaccccgaa ggttcaaaac
121 caagaccccc aagtcgcttt actttgagat gtgttagaaa gccccaaggt cattttactt
181 tgacctgtgt tagagaccca aaacggtggc agcactgtct agctgcgggc attagactgg
241 aaaactagtg ctctctgggt aaccactaaa atcccgaaag ggtgggctag tggcgaccct
301 ccgaactaaa agacagcctc cctcctcgcg gggggggggg cctgccc
//GenBank-Updates@genbank.bio.net (05/30/90)
LOCUS FSBISO 676 bp ss-mRNA VRT 30-MAY-1990
DEFINITION C.commersoni isotocin/neurophysin mRNA, complete cds.
ACCESSION X16621 M25145
KEYWORDS isotocin; neurophysin.
SOURCE C.commersoni hypothalamic region, cDNA to mRNA, clones pCIT-1,
M132.1 and M135.1.
ORGANISM Catostomus commersoni
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Cypriniformes; Cyprinoidei; Catostomidae.
REFERENCE 1 (bases 1 to 676)
AUTHORS Figueroa,J., Morley,S.D., Heierhorst,J., Krentler,C.,
Lederis,K. and Richter,D.
TITLE Two isotocin genes are present in the white sucker Catostomus
commersoni both lacking introns in their protein coding regions
JOURNAL EMBO J. 8, 2873-2877 (1989)
STANDARD full automatic
REFERENCE 2 (bases 1 to 676)
AUTHORS Heierhorst,J., Morley,S.D., Figueroa,J., Krentler,C.,
Lederis,K. and Richter,D.
TITLE Vasotocin and isotocin precursors from the white sucker, Catostomus
commersoni: Cloning and sequence analysis of the cDNAs
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 5242-5246 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.D.Morely, 31-MAY-1989.
FEATURES from to/span description
pept 43 507 isotocin-1 precursor
sigp 43 102 isotocin-1 signal peptide
matp 103 129 isotocin-1
matp 139 504 neurophysin (put.)
mRNA < 1 676 isotocin mRNA
signal 573 578 poly-A signal
signal 657 662 poly-A signal
BASE COUNT 147 a 169 c 176 g 184 t
ORIGIN
1 ctgaatcgtt tcactacctg tcaatttcag cctttgcaaa atatgtctgg aagcatgttc
61 tcagtgtttt ctctgctgta tctgctgtct gtctgctcag cctgttacat ctccaactgc
121 cccatcggtg gtaaacgagc catacaggat tcaccctctc gacagtgtat gtcttgcggt
181 ccgggggatc gcggccggtg ttttggtccc agtatctgct gtggtgaggg tctcggctgt
241 ctgctgggct ctccggagac tcagcgatgt ctggaggagg actttctgcc ctccccgtgt
301 gaggccggtg ggaaagtgtg cggatatgag ggccgctgtg ctgctccagg tgtctgctgt
361 gactcagagg gctgcagcgt tgatcagtcg tgtgttgatg gtgacggtga cgctacagct
421 gtcagtcaac ctgccagcag ccaagattta ctgctgaaac tactgcacct gtcaaacccc
481 gcccacccat acagactcca ccaatgacat cagagtgaca tccgctcagg agaccaccag
541 ccaatcagaa ttattcttga agaccttttg taaataaagt ttgtatagcc gatagacatg
601 tacatggctc atacagatat tagtagaact gtattgatag tgtttactgt gcagctatta
661 aagtcagaga gaacgt
//
LOCUS HUMTCBYAL 416 bp ss-mRNA PRI 30-MAY-1990
DEFINITION Human T-cell receptor beta-chain mRNA V9-D-J2.3-C2 region, partial
cds.
ACCESSION M33240
KEYWORDS T-cell receptor beta-chain; constant region; diversity exon;
joining exon; pseudogene; variable region.
SOURCE Human (individual VW) T-cell line VW114, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 416)
AUTHORS Hansen,T. and Lundin,K.
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Hansen, 22-MAR-1990. Institute of Transplantation Immunology,
The National Hospital, N-0027 Oslo 1 Norway
FEATURES from to/span description
pept.ps < 1 > 416 T-cell receptor beta-chain V-D-J-C region
precursor /nomgen="TCRB" /map="7q35"
/hgml_locus_uid="LP0151Q" (AA at 1)
sigp.ps < 1 35 T-cell receptor beta-chain signal peptide
matp.ps 36 > 416 T-cell receptor beta-chain
recomb 317 318 V-region DNA end/D-region DNA start
recomb 330 331 D-region DNA end/J-region DNA start
recomb 375 376 J-region DNA end/C-region DNA start
BASE COUNT 121 a 105 c 87 g 103 t
ORIGIN
1 atgtggccct ctgcctcctg caagcagatc cactggacac agccgtttcc cagactccaa
61 aatacctggt cacacagacg ggaaaaaagg agtctcttaa atgagaacaa aatctgggcc
121 ataatgctat gtattggtat aaacaggact ctaagaaatt tctgaagaca atgtttatct
181 acagtaacaa ggagccaatt ttaaatgaaa cagttccaaa tcgcttctca cctgactctc
241 cagacaaagt tcatttaaat cttcacatca attccctgga gcttggtgac tctgctgtgt
301 atttctgtgc cagcagccaa gggcaattgc acagatacgc agtattttgg cccaggcacc
361 cggctgacag tgctcgagga cctgaaaaac gtgttcccac ccgaggtcgc tgtgtt
//
LOCUS RATODCAB 2153 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Rat ornithine decarboxylase (ODC) mRNA, complete cds.
ACCESSION J04791
KEYWORDS ornithine decarboxylase.
SOURCE Rat (strain Sprague-Dawley) testis, cDNA to mRNA, clone pODC2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2153)
AUTHORS Wen,L., Huang,J.-K. and Blackshear,P.J.
TITLE Rat ornithine decarboxylase gene: Nucleotide sequence, potential
regulatory elements, and comparison to the mouse gene
JOURNAL J. Biol. Chem. 264, 9016-9021 (1989)
STANDARD full staff_review
COMMENT Draft entry for [1] kindly submitted by L.Wen, 28-MAY-1989, for
release after publication.
FEATURES from to/span description
pept 160 1545 ornithine decarboxylase (EC 4.1.1.17)
ORF 39 71 ORF
signal 1855 1860 poly-A signal
BASE COUNT 541 a 460 c 565 g 587 t
ORIGIN
1 cggccgggtg cgccttgggg tttagtggcg gcttctccat gggtccagcc agccgcttcc
61 ctgtgctgtg aggagacagc attcagagtt gaccttgtga gagctggcca taatttaatt
121 ccatctctag gttttctgta ggcacatcga gaaccaacca tgggcagctt tactaaggaa
181 gagtttgact gccatatcct cgatgaaggt ttcactgcta aggacattct ggaccaaaaa
241 atcaatgaag tttcttcctc tgatgataag gatgctttct atgttgcgga cctcggagac
301 gttctaaaga agcatctgag gtggctgaaa gctcttcccc gtgttactcc cttctatgct
361 gtcaagtgta atgacagcag agccatagtg agcaccctgg ctgccattgg gacaggattt
421 gattgtgcaa gcaagactga aatacagttg gtgcaggggc ttggggtgcc tccagagagg
481 attatctatg caaatccttg taagcaagtg tctcagatca agtatgctgc cagtaatgga
541 gtccagatga tgacttttga cagtgaaatt gagttgatga aagttgccag agcacatcca
601 aaggcaaagt tggttttgcg gattgccact gatgattcca aagcagtttg tcggctcagt
661 gttaagtttg gtgccacact gaaaaccagc aggcttctct tggaacgggc aaaagagcta
721 aatattgatg tcattggtgt cagcttccat gtgggcagtg ggtgtactga ccctgagacc
781 ttcgtgcagg cagtgtcaga tgcccgctgt gtctttgaca tgggaacaga agttggtttc
841 agcatgtatc tgcttgacat tggtggtggc tttcctgggt ctgaagacac gaagcttaaa
901 tttgaggaga tcaccagtgt aatcaaccca gctctggaca agtacttccc atcggactct
961 ggagtgagaa tcatagctga gccaggcaga tactacgtcg catcagcttt cacacttgca
1021 gtgaatatca ttgccaaaaa aaccgtgtgg aaggagcaga ccggctcgga cgatgaagat
1081 gagtcaaacg agcaaacttt gatgtattac gtgaatgatg gagtgtatgg gtcatttaac
1141 tgcattcttt atgaccatgc acatgtgaag gccctgctgc agaagagacc caagccagat
1201 gagaagtatt actcatccag catctgggga ccaacatgtg atggccttga tcggatcgtc
1261 gagcgctgta gcctgcctga aatgcatgtg ggtgattgga tgctgtttga gaacatgggt
1321 gcatacactg ttgctgctgc ttctactttc aatgggttcc agaggccaaa catctactac
1381 gtaatgtcac ggtcaatgtg gcaactcatg aagcaaatcc agagccatgg cttcccgcca
1441 gaagtggagg agcaggatgt tggcactctg cccatgtctt gtgcccagga gagcgggatg
1501 gaccgtcacc ctgcagcctg tgcttctgct agtatcaatg tatagatgcc attcttgtag
1561 ctcttacctg caagtttagc ttgagttcac ggcatttggg gggaccattt aacttaatta
1621 ctgctagttt ggaatgtctt tgtaagagta gggttggcac caatgcagta tggaaagact
1681 aggagatggg ggtcacactt actgtgttcc tatggaaact ttgaatattt tatatggatt
1741 tttattcact tttcagacct gatactaatg agtgcccctc ggctgctgag caagcatttg
1801 tagcttgtac attggcagaa tgggctaaaa gcttatgttg tgacccattt tgaaaataaa
1861 gtatcttgaa atgattggac attggagaat gtgtgcaagt atcccttaca gaaggcacaa
1921 acttctgcac aggctgtgtg ttacagcagt gagtctagcc cagcagagat gtggatgata
1981 caaagctgtg ccccctctgt acagcatcaa tgtgcttagc ccatctcaag tgtttactgt
2041 gaacttggtg cccaaagtct cttaagagtg tcatctgcct agtggcctct tgacttggcc
2101 acttcctaag gagagggcat ctgaggctct ttgaaccttg cctgcagaaa ccc
//
LOCUS BPOORF2 3135 bp ds-DNA BCT 30-MAY-1990
DEFINITION B.polymyxa beta-1,4-endoglucanase gene, complete cds, and ORF2,
complete cds.
ACCESSION M33791 M33840
KEYWORDS beta-1,4-endoglucanase.
SOURCE B.polymyxa DNA.
ORGANISM Bacillus polymyxa
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3135)
AUTHORS Baird,S.D., Johnson,D.A. and Seligy,V.L.
TITLE Molecular cloning, expression, and characterization of endo-beta-
1,4-glucanase genes from Bacillus polymyxa and Bacillus circulans
JOURNAL J. Bacteriol. 172, 1576-1586 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1447 2640 beta-1,4-endoglucanase
ORF 1072 509 (c) ORF2
BASE COUNT 848 a 689 c 676 g 922 t
ORIGIN
1 gaattcataa aaaacgtttt caaaaagatg aaaaggactt attcaaacat tcagttcagc
61 tttaggaccg attctctgtg gttgatgacc ggattcatac ggatgatttc tgacagctcc
121 tcatgatcct gttctttttg atcaattttc tccagcagta gcctggcact tcttttcccc
181 atttcaaaac tcggctggga aacggttgtg atcggcggat tataaaaaga cgcaaaggat
241 acatcatcta tgccaatcaa ggaaaagtca ttaggtatgg tcagatgatt attttttgca
301 taaatcagca cctgctccaa aatcatgtca ttcgtagcaa taattgcagt gggcggattc
361 tccatgtgaa atagttgtcc aatcagatta ggcatttcat ccaacctgcc gcttttgata
421 tagttttcat tcacgggtat gtcatgttta cccagcgtat ttcggtaacc tgacagtcgc
481 tcacttcgtg tggtgatcgc ttttttgcct aaggggaacg tgataatccc aatgttgtta
541 tgaccatgct caatgagtgc ttcaacgcaa gctcgctggc cttttcatta ttcagtagca
601 ccgtatctac gctgagttca tcgatttttc ggtccacaaa cacgaaggga tagccatttt
661 tgacaaggga aacatataat ttcttgttct cttccgtcgg gaaaatgata agtccatcca
721 cttgccgagc aatcatcgac tggacatact ttttctcttt gagcgaatca tcatcggtat
781 tgcagacaat gacctggaca ttttccagtt ggcattcatc ttcaatggca cgcacgactt
841 ctgttgtaaa tctcgataaa atggttgaag aaataacccc cacaatgaag gtcttcttct
901 gcttgagact acgtgccacc tcattgggta tataactgag atcctcaatc gcttgctcga
961 tcttcttctt ggtgttttca ctcatatagt tgtatctctt ttgcagaaac tgcgaaactg
1021 tactcttcga caccccggca cgttctgcca cttccttcat cgttgttacc ataattaccc
1081 tcctttcttt tttgcatgtc atgcgtgcta attttcaaat atatattctg caacctgtat
1141 gtaaagtgaa tttcacttta ttatatacct tcttttacag caggaaaata taataaataa
1201 agagttagag cgtgcctcta ctcatataca agcgcggcca aaaatatggg gtggaaactc
1261 cctcacgaat cttttacaag gggaactttc ctacttagct tgataatccg ttgttataaa
1321 tctattgatt tcgaacgaaa tcttagaaat atacgaattt tatgaacttg tttgatggaa
1381 gatttgaaca taaaatgaaa atgtaatcgg ttacaaattc ttttaaataa aggaggaagc
1441 agaattatga agaaaaaagg attaaaaaaa acatttttcg tcattgcctc cctcgtaatg
1501 ggcttcacac tgtatggcta tacacccgtt tctgcagatg cagccagtgt gaaaggatat
1561 tatcacaccc aaggaaacaa gattgtagac gaatccggga aagaagcggc atttaacggc
1621 ctgaactggt tcggtctgga aactcctaat tacaccttgc atggactgtg gagccgctca
1681 atggacgaca tgctggatca ggtgaagaaa gaaggctaca atctgattcg tctgccttac
1741 agcaatcagt tgttcgattc cagttcccgt ccagacagta ttgattatca caaaaaccct
1801 gatctggtcg gattaaaccc gattcaaatt atggacaagc tgatcgaaaa agctggacaa
1861 cgcggtattc agattatcct tgaccgtcac cgtccaggct caggtgggca atccgagctg
1921 tggtacacat cccagtaccc tgagtctcgc tggattagtg actggaaaat gttggctgat
1981 cgttataaaa ataaccccac cgtcattggt gcggatttgc acaacgagcc acacggtcaa
2041 gcaagctggg gtacaggcaa tgcctccaca gactggcgtc tggcggcaca acgtgcaggg
2101 aatgcgattc tgtccgtgaa tccgaattgg ctgattctcg tagaaggtgt agaccacaat
2161 gtacaaggca acaatagcca atactggtgg ggtggcaacc tgacaggtgt agccaactat
2221 cctgtcgttc tggacgtacc gaaccgtgtc gtatattctc cacacgatta cggccccggt
2281 gtgtcttcgc agccatggtt caacgacccg gccttcccgt ccaacctgcc agcgatctgg
2341 gatcaaacct ggggctacat cagcaaacaa aacatagctc cggtgctggt tggtgaattc
2401 ggcggccgta atgttgattt gtcctgccct gaggggaaat ggcaaaatgc gcttgttcac
2461 tatattggtg ccaacaacct gtactttacg tactggtccc tgaatccgaa tagcggcgac
2521 acaggcggtc tgctgctgga tgactggact acctggaatc gtccgaagca agatatgctg
2581 ggtcgaatta tgaagcctgt tgtttccgta gcccagcaag cggaagcagc agccgaatag
2641 gcacaggcct tcaacttcac ttaattaaaa aagccgattt tcctccttca ggccagatta
2701 ctgctgggct atgaagatgt ggagaaacgg ctttttttgt tcttccagtt ttacacattg
2761 taaattcccc attgaatggg tcattttgta aactaaatct ctctttcaac cgataaatac
2821 atatgttgga ttttaattta gaaaggaagt attcatgtga ttatttggaa gggttttggt
2881 attttaaata ttattattcc agggatttta tttgttattg tcggtagttt agtatccgct
2941 ctagggcttg actccataga ttcccggctg ccaatggctt ttgtgttcat tgtgtcggga
3001 gttattatct ggtatttggg gaaagcactc aactccgact caaagtgcta gttgatatgg
3061 aaacgggaca acgctatcga atgggaaccc agcacagtct gttttttatc cctatgcatt
3121 attggggccc tgcag
//
LOCUS SHPMHDQB 3123 bp ds-DNA MAM 30-MAY-1990
DEFINITION Ovine MHC class II OLA-DQB beta-chain gene, partial cds.
ACCESSION M33307
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Ovine (strain Merino) DNA, clone B10-1, library Ch.28.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 3123)
AUTHORS Scott,P.C. and Brandon,M.R.
TITLE Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.C.Scott 28-MAR-1990. Univ of Melbourne, Veterinary Preclinical
Centre, Parkville, Victoria, AUSTRALIA 3052
FEATURES from to/span description
pept / 204 473 MHC OLA-DQB beta-chain, exon 2 (AA at 3)
876 1157 MHC OLA-DQB beta-chain, exon 3
1656 1766 MHC OLA-DQB beta-chain, exon 4
2232 2255 MHC OLA-DQB beta-chain, exon 5
2697 2710 MHC OLA-DQB beta-chain, exon 6
IVS < 1 203 MHC OLA-DQB intron A
IVS 474 875 MHC OLA-DQB intron B
IVS 1158 1655 MHC OLA-DQB intron C
IVS 1767 2231 MHC OLA-DQB intron D
IVS 2256 2696 MHC OLA-DQB intron E
signal 3014 3020 poly-A signal
BASE COUNT 635 a 773 c 851 g 707 t 157 others
ORIGIN
1 cgggaggccg gggacgagcc gctggaggac ctgggttcgc atgcacgaag gcgcagggcc
61 gggctggaac cgaacagggg ccgagcggcg gcgggttgcc ggtttcggga cccgctggcg
121 cgcggccggc acctccccct ctgccgagcg cgccccgtgg ggctgtgtgg gctgagcctg
181 accgagcggc tgtctccccg cagaggattt cgtgtaccag tttataggcc actgttactt
241 caccaacggg acggagcggg tgcggcatgt gaccagatac atctacaacc aggaggagtt
301 cgtgcgcttc gacagcggac tgggacgagc accggcggtg accctgctgg ggcggccgga
361 cgccgagtac tggaacagcc agaaggacat catggagcgg acgcgggccg aggtggacac
421 ggtgtgcaga cacaactacc aggtggaagc cgccttcacc tggcagcggc gaggtgagtg
481 ccggcgcccg ctccgcgggg ccgccctcca cccgccagga ctccgcaccg aaggactgag
541 tctccggggt cggtccccag accacggatg ggacagagag gcgctgaggg acaggggccc
601 aggggcacag cgtgagggcg gggcatcgaa tttggcaccc tggagccttc ctgcagnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tcccacctgg aatgtgctgt gtggtctcac
841 gtctcactgt cttttcctgc ctgttcttcc ctcagtggaa cctacagtga ccgtctcccc
901 atccaggact gaggctctaa accaccacaa cctgctggtc tgctcggtga cagatttcta
961 tccaggccag atcaaggttc ggtggttccg gaatgaccgg gaggagacag ccggcgttgt
1021 gtccacccct cttattagga acggggactg gaccttccag atcctcgtga tgctggaaat
1081 gaccccccag cgaggagatg tgtacacctg ccgcgtggag caccccagcc tccagagccc
1141 catctcggtg gaatggcgta agggcacttg gtttcctttc actgtgggcc caacaggtca
1201 gggcagacag agcttcccca gtccatgccc tctcatccct tgtccccacc atcactactg
1261 aactggaaat cacaggacac aagagtgctc atgcctccta gcacaggcat cagaagagcc
1321 aaatcacatt gtcttttcac atacagggaa ctcactctac acatcatggc cccagagccc
1381 agcctggtag ctctgtagaa ttgactggtg accacagtct taaggtctaa agttatagaa
1441 gtgtccctga gagcagggat ccactttcac cttctctcac ctgcccactg tgtccaaagc
1501 tctgttggca ggtcccttcc atgggggtgg tcagaatgga gagccaggtt cccctgacac
1561 ctccacctcc tgtaccttag actagacctc aagcttccta aagggacacc atgagatgtg
1621 gggacaaacg ctgatactca ggctctgctc cccaggggcg cagtctgaat ctgcccagag
1681 taagatgctg agtggtgttg ggggcttcgt gctggggctg atcttcctca gtctgggcct
1741 catcatccat cacaggagcc agaagggtaa ggaactctgg ggaaatggga aaatgggctg
1801 tgattgagac cctctgttca gggcagtgtc tgctctgggt gtagttcttt ccacctgatc
1861 ctgaaacagg gaggcggggc tggggggtgg gaggaaatgg acaacttagg gagacattgg
1921 agtctgactt tactggtttg agagggtagc cctgccatgg agatggcagt ggaatttatt
1981 ctaagactac cttaaagttc atcattgtct catgggctcc tttctgaaaa cttcctgcat
2041 taagaggatc agagcactgg tttcctttcc ttctagtgac agcttcattc atttggggga
2101 attttagctt agggcagtta aggcctgtgg tcaatggata aaaaagaaat acattttccc
2161 ttgagtagct tatctcattt ccctttggat gagtaaggca gtggctgttt gtgtcacgag
2221 agcttttgta ggacattgtg ggcccctacc agcaggtgct ctgtttctgt tctgtgtcag
2281 tggggttgtg ggataaagga gggaagggtg tgaggtgagt gtacctgggt gcagtgtctc
2341 gttcatgacc tgttccctac tatggaatca ggagttaggg aagaagtttc tgtaggaggt
2401 tctgtaggaa gccataactt tgtggcatct ttctgtgaaa cttggagcca gagctctggt
2461 ttgaaagata gataccagga tgttgcctac tttgtgccac atgttggtgg ctactgcctg
2521 tgggcattta taagtgactg aatgtggtag aaagaaggtg agctatcatt aaatttacta
2581 aaaaattgaa atcttcacat cccccagaag gacaatagcc gctcccctgc ttcccatgcc
2641 tccttgccag gttgaagtgt gggtcctgtg tgctgatcac tctctctctt ctacagggct
2701 cacgcgctga ctcctgagga tattttggga ttggtgtttg ctcttcttta atcatgtgcc
2761 tgttccgccc ggaattcnnn cttctgagat cagagttggt caccaggtca tttcccgtgg
2821 ccatccacca accaaggctc tggctgtgat gctgcttcct ccactgaccc tggagcctct
2881 gactgtgcat tttcagctgc atctactcag accccaagag cttcctctgc tcccattttt
2941 acccaacaga gtgtgcaaaa gaagcacatt gaaataattt accaggctgt agagtttttt
3001 tccctttaat cataattaaa catgattatg aggttatctg tactctgaac ttccttaaat
3061 gagtaggagg gtagggaacc actgcagaat gaagggaaac tattttgggg tgaaccagcc
3121 aag
//
LOCUS SHPMHDRB 2277 bp ds-DNA MAM 30-MAY-1990
DEFINITION Ovine MHC class II OLA-DRB beta-chain pseudogene, partial cds.
ACCESSION M33306
KEYWORDS cell surface glycoprotein; class II gene;
major histocompatibility complex;
pseudogene. pseudogene. integral membrane protein.
SOURCE Ovine (strain Merino) DNA, clone C5-2, library CH.28.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 2277)
AUTHORS Scott,P.C. and Brandon,M.R.
TITLE Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.C.Scott, 28-MAR-1990. Univ of Melbourne, Veterinary
Preclinical Centre, Parkville, Victoria, AUSTRALIA 3052
FEATURES from to/span description
pept.ps / 891 1169 MHC OLA-DRB beta-chain, exon 3
1555 1665 MHC OLA-DRB beta-chain, exon 4
1914 1937 MHC OLA-DRB beta-chain, exon 5
2144 2155 MHC OLA-DRB beta-chain, exon 6 (alt.)
2232 / 2243 MHC OLA-DRB beta-chain, exon 6' (alt.)
IVS < 1 890 MHC OLA-DRB intron B (no splice consensus)
IVS 1170 1554 MHC OLA-DRB intron C (no splice consensus)
IVS 1666 1913 MHC OLA-DRB intron D (no splice consensus)
IVS 1938 2143 MHC OLA-DRB intron E (alt.; no splice
consensus)
IVS 1938 2231 MHC OLA-DRB intron E' (alt.; no splice
consensus)
IVS 2156 > 2277 MHC OLA-DRB intron F (alt.; no splice
consensus)
IVS 2244 > 2277 MHC OLA-DRB intron F' (alt.; no splice
consensus)
BASE COUNT 485 a 437 c 447 g 513 t 395 others
ORIGIN
1 ccaacgtcgc taaaaaagag agactgtata aaatgatcag ttcagtaaaa agactgctta
61 ttttgcaaag ggggcctcct tggtggctga tacagcaaag aatgtgcctg caatgtgggg
121 gacccaggtt caaactctag gttgggaagt tcccctggag aagggaatgg ctaccctctc
181 cagtattctt gtctggagaa ttccatggac agagctgctg tccagtgggt agcaaagttg
241 gacacaactg agtgactaac actttcactt tcacttcatt ttgcaaagga gagaaattga
301 tcagctgagt gtaattgcac gtgttctttt tttaaaaaat aaatgactat ttgctccttt
361 ccttctcctt cagagctcnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngtcaaccct
601 aaataccttg caaagattgg taaacacatc aggtcataat gaataattct tttgggctat
661 gaaaaaacaa aacaaaatta atttcctcca catctctgaa atctaacagt cccgtattgt
721 aagaattaaa gtctaagctc atggttctaa ctcctgttgg gtcacatggc cagttttcag
781 agaggtctgc caagactgga gcatgggact agagactagg ggcagaaagg cagccaggga
841 gatttactct gccttcctca ctgatttcct ccactttttc tctcccagtg gagcctacag
901 tgactgtgta tcctgaaaag acccagcccc tgcagcacca cgacctcctg gtctgctctg
961 tggatggttt ctatctgggc catgctgaag tcaggtggtt ctggaaaggc caggaagagg
1021 aggctggggt gatctccaca ggcctgatcc agaacggaga ctagaacctc cagaccatgg
1081 tgatgcttga aatggttcct cagagtggag aggtctacac ctgccaagtg gagcacccca
1141 gccggacgag ccctctcaca gtggaacgag tgagaagctt ctgatcttgt gagttcctac
1201 caccgtggag ggggcttgtt ttccctgagt gtcaggtgcc tctctccaca ccatgtgttc
1261 atttgctcat gtttctcnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1381 nnnnnnnnnn nntgatcgac ttagcgtgat ttttcacact tctttctatg cagtggtcag
1441 ttattttagt tcttccggct gccctccctc attcctgaag ctctcagtct cagagtctca
1501 atcagcaaag ttgtatttca gataaaaatc acaaaaccat ggtttctttt ctcaggggca
1561 cagtctgact ctgcttagag taagatgatg agtggaatag gggactttgt tctgggtttg
1621 ctctttcttg tggtggggct gttcacccac ttcaggaatc agaaacgtga gagcctgttt
1681 acagggggat cctcatannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1741 nnnnnnnnnn nnnnnnnnnn nnnntcgggt cgctctcttt ctgccctgga tcatctggtg
1801 tggttgcagt gtcgagaaat ttcagatggg gctttgggtc tgaggacatt gaatctggta
1861 acagaattgc cttcatatct ggtaaatctg tatatcttcc cttcagtttc ccaggacact
1921 ctggacttca gccaacaggt catgcccttt aatcctcttt tagagataga tttggtctct
1981 ggctcagtgg taaagaacta tcctgccaat gcagagacac aagagacatg ggtttcatcc
2041 ctgagtcggg aatatctctt ggaataggaa atagcaacac actccagtat tcttgcctgg
2101 aaaatttcat ggaccagagg agcctgggac ctacagtccc ctagggttga aaagagtcag
2161 acatgattgt gtgagtgcat gggcacaagc gcaccctgta cacacacaca cacacacaca
2221 cacacacaca cggtctccta gaattgatcg tagagatgac aagacaggag acagaca
//
LOCUS ASGALDAA 3497 bp ds-DNA PLN 30-MAY-1990
DEFINITION A.niger aldehyde dehydrogenase (aldA) gene, complete cds.
ACCESSION M32351
KEYWORDS aldehyde dehydrogenase.
SOURCE A.niger DNA.
ORGANISM Aspergillus niger
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 3497)
AUTHORS O'Connell,M.J. and Kelly,J.M.
TITLE Physical characterization of the aldehyde-dehydrogenase-encoding
gene of Aspergillus niger
JOURNAL Gene 84, 173-180 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 1187 1251 aldehyde dehydrogenase (aldA), exon 1
1321 2522 aldehyde dehydrogenase, exon 2
2580 2649 aldehyde dehydrogenase, exon 3
2703 2859 aldehyde dehydrogenase, exon 4
pre-msg 1158 2969 aldA mRNA and introns (alt., + 1 bp)
pre-msg 1158 3016 aldA mRNA and introns (alt., +/- 1 bp)
IVS 1252 1320 aldehyde dehydrogenase intron A
IVS 2523 2579 aldehyde dehydrogenase intron B
IVS 2650 2702 aldehyde dehydrogenase intron C
BASE COUNT 833 a 995 c 806 g 863 t
ORIGIN
1 agatctcact agtcagtggc taattggact atacagggat gtttgcaaag tgcgtgaact
61 agcatctact aagtactaat ccccatcatg aaaggaacaa ggggactggc tgactaaaat
121 gttacagctt gcaggtcaac aaactagctc aagagactag attccacttg gtccccgcca
181 cttacaccga ggtggagggg ttccggaaat acgaaaagag cttgctccaa tcgctgtgag
241 taatattccc tagtgagtta caaaggagat gactgataac tcactactaa gaggttgcct
301 agctactagc ctcgtcttct cccggccggc gaaggtcctc cattcttctt cggatgatgt
361 aatgaagaaa ccagatgtaa ctgactagta accatatcaa gtcaagacac gataaccatc
421 cccttccact accactacca gtaccagtac cagtaccaca gggactcttc gatggatgaa
481 tactagttag ttacacggta aaatggctga cacaggggga atctgggtgg tggtggtggt
541 ggtggtggtg gccaggaaaa agagccacgg atagcgaaac cccgcacgat cgctgccgat
601 tgttggatag agagtgggga acgggctggc atgtgggaat tcacctactt ttcaggttcc
661 cttcatgccc tggtcactta cgctttaccg cgatacaata taatatgatg gttacttata
721 ggggtctcgg gagtacgtag gttaactcgc ggtgggtttc gggctcaagg agcttggtct
781 ctccaatcgg agcatctaga tgagggggtg gttatccgcg gagggattag aatgaggtaa
841 atggcaatgg atgagacccg gaatttctcc catggatccg actagaccag tccaattgaa
901 ctaaactaaa ctaaacccag tggacgagga tgatctccgc tggcgtttcc ccctcgcgcc
961 tcgcaagctt cccctccctt caaccatcaa ccatcaaacc aacttctctt tcctctttct
1021 ctctcttccc ccccccactt ccaccccacc actagtacaa agtaactaaa cttcctgtct
1081 gctttattaa ttgatagctc tccccctctc cgctcttctt tcttcttctc ttccctctct
1141 ctctttcttt ctcttctaac caacacctcc tctcccatat ctcatcatgt ctgacctctt
1201 cgccaccatc acgaccccca acggcgtcaa atacgaacag cctctgggct tgtatgttat
1261 ccccgcgtcc cgtcctctcg atatcgcgct tttggaaact aaccactgtt tactttacag
1321 gttcattgac ggtgaattcg tcaagggcgc ggaaggcaag accttcgaga ctatcaaccc
1381 cagcaacgag aagcccattg tcgccgttca cgaagccacg gagaaggatg tggatactgc
1441 cgttgccgct gctcgcaagg ccttcgaggg ctcctggcgc caggtcactc cttccacccg
1501 tggccgcatg ctgaccaagc tggccgacct tttcgagcgc gatgccgaga tcctggcttc
1561 catcgaagct ctggacaatg gcaagtctat caccatggcc cacggtgata tcgctggcgc
1621 cgccggttgc ctgcgctact atggtggctg ggccgacaag atccatggtc agaccatcga
1681 caccaactcc gagaccttga actacactcg ccacgaaccc atcggtgtct gcggacagat
1741 cattccctgg aacttcccct tgctgatgtg ggcctggaag attggtcctg ccatcgctac
1801 cggtaacact gttgttatca agactgctga gcagactcct ctgtccggtc tctatgccgc
1861 gaacgtgatc aaggaggctg gcatccccgc tggtgtggtt aacgtcatct ccggtttcgg
1921 tcgtgtggcc ggttccgcca tctctcacca catggacatc gacaaggtcg ccttcactgg
1981 ctccaccctt gttggccgta ctatcctcca ggccgccgcc aagagcaacc tcaagaaggt
2041 gaccctcgag ctgggtggca agtcccccaa cattgtcttc aacgatgccg acattgacaa
2101 cgctatctcc tgggccaact tcggtatctt ctacaaccac ggtcagtgct gctgtgctgg
2161 ttctcgtatc ctggttcagg aaggcatcta cgacaagttc attgcccgtc tcaaggagcg
2221 tgctctccag aacaaggtcg gtgacccctt cgccaaggac accttccaag gtccccaggt
2281 ttcgcagctc cagttcgacc gcatcatgga atacatccag cacggtaagg acgccggtgc
2341 taccgtggct gttggtggtg agcgtcacgg cacagagggt tacttcatcc agcctaccgt
2401 cttcaccgac gtcacctccg acatgaagat caaccaggag gagatcttcg gccccgtcgt
2461 cactgtccag aagttcaagg acgtcgagga cgctatcaag atcggcaaca gcacttctta
2521 tggtgagttc tttttcgcca ctaaattgca gtgtgtcaag tgctaatctt tcgtcctagg
2581 tctcgctgcc ggtatccaca ccaaggatgt caccaccgcc atccgtgtct ccaacgccct
2641 ccgtgctggg taagctatac ttaacactac attcctgtat tcattcacta acatgcacat
2701 agaaccgtct gggtcaacag ctacaacctg atccaatacc aggttccctt cggaggattc
2761 aaggagtccg gtatcggtcg cgagctcggc tcgtacgccc tcgagaacta cacccagatc
2821 aaggccgtgc actaccgtct gggcgacgct ctcttctaat ttaatatttg aaaccatgag
2881 aaaatgggaa gagttcgctt cagcgacgag ctcgtggata ttgggttacc tttttcgcag
2941 atagccatga ctggttcatg caaaacaatc aggatacgaa tttacgatga ttacaaattg
3001 ttcttacatt gcagtaactg tagacctttg taaatataat ctgtctcaag cttcaacctc
3061 cacataatct tggtactggc tataatcact ccctagtggt taactaagga cgaatcacca
3121 ccccttccac tcgttcaatt gcaatcgacc tccatcaggc cattgcatat cgtacccatt
3181 gttacacacc caaattcccg ggacaaccac cggcgagttt gagtagcctg ggtctagtct
3241 cgggagaatt actacgccga gaagagagac cccctcgggt cgaagatggc ccgaagcgag
3301 ggcattcatc agcgaataca caaggaccga atgattggtt atgtcaccga cgttgatgat
3361 gatcagccat tcaaaaagac agctaatttc aatcttcagt acttcatagg ttgattgatt
3421 acttgctggt gccgaaaata ggtattcggg aactataatc tcctatcttg tcagttctgg
3481 tactagaatg catcgat
//
LOCUS POABETA1 2293 bp ds-DNA PLN 30-MAY-1990
DEFINITION P.agilis beta-1 tubulin (beta-1-tub) gene, complete cds.
ACCESSION M33372
KEYWORDS beta-1 tubulin.
SOURCE P.agilis (strain Aragoa) DNA.
ORGANISM Polytomella agilis
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Dunaliellaceae.
REFERENCE 1 (bases 1 to 2293)
AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D.
TITLE Structure of the three beta-tubulin-encoding genes of the
unicellular alga, Polytomella agilis
JOURNAL Gene 84, 345-358 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 233 250 beta-1 tubulin (beta-1-tub), exon 1
327 353 beta-1 tubulin, exon 2
432 555 beta-1 tubulin, exon 3
624 848 beta-1 tubulin, exon 4
974 1911 beta-1 tubulin, exon 5
IVS 251 326 beta-1 tubulin intron A
IVS 354 431 beta-1 tubulin intron B
IVS 556 623 beta-1 tubulin intron C
IVS 849 973 beta-1 tubulin intron D
BASE COUNT 524 a 578 c 464 g 727 t
ORIGIN
1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg
61 tgtaggtgct ggttgattat atgcttccgg ctcgtatgtt gtgtggaatg tgagcggata
121 acaatttcac acaggaaaca gctatgacat gattacgaat tatatatatg tgccgtcctc
181 ccaaaaaatt taaattgaac tacgcctgtt ttgtcaattt aaccggtagc aaatgcgtga
241 aatcgttcat gtgagtaata tttatttgat atctgccact tactctcctt attctaatag
301 catcttctcc ttttcctcac ctacagattc aaggtggcca gtgtggtaac caggtgtgtt
361 tttcgccctc ttgcctgcta aatgttcgtt ttgactctat caacacctct aacatgcttc
421 tctccctgta gattggtgcc aagttctggg aagtcgtttc tgacgagcat ggcattgacc
481 ccaccggtac tgctcttggt gacagcgacc tccagcttga gcgcatcaac gtctacttca
541 acgaggccac tggaagtaag taatttgagc ccctcctcaa tatttttttc agctccaatt
601 aaaatctttt tttctcactc caggccgcta tgtcccccgt gccatcctca tggatcttga
661 gcccggtacc atggattccg tccgctctgg tccctatggc cagattttcc gccctgacaa
721 ctttgtcttc ggtcagactg gtgccggtaa caactgggct aagggccact atactgaggg
781 tgctgagctc attgactccg tccttgatgt tgtccgcaag gaggctgagt cctgcgattg
841 ccttcagggt aagaatttga tatcttttat ttctatgatt cttagattat tttttcaacc
901 taacatagtt aatatttcaa attgtatctc actgttcatt tttataaaat taatttttat
961 tatgtcttta aaggcttcca ggtctgccac tccctcggtg gtggtaccgg ctccggtatg
1021 ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat gctcaccttc
1081 tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa cgccaccctc
1141 tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa cgaggctctc
1201 tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga ccttaaccat
1261 cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg tcagctcaac
1321 gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca tttcttcatg
1381 gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct caccgtcccc
1441 gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgccgctga tccccgccac
1501 ggtcgttacc tcaccgcctc cgcccttttc cgtggtcgta tgtccaccaa ggaggtcgat
1561 gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg gatccccaac
1621 aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc tgctaccttc
1681 attggtaaca gcactgctat ccaggagatg ttcaagcgtg tctctgagca gttcactgcc
1741 atgttcaggc gcaaggcttt cttgcattgg tatactggtg agggtatgga tgagatggag
1801 ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca gtaccaggat
1861 gcctctgctg aagaggaggg cgagttcggt gaggaagagg aggagaacta aactatcaca
1921 aactatttgt gaaactagcc gctgatttta aggtttatgt agaaggggga tacccacaat
1981 ttttgcttta atgattcagt ttatgtttaa ctgtatgtat gtttttttat aatgaatagc
2041 agtcactgtc cactctataa tgtaatttta ttttgttttc ttcactaata gcgtaatgcc
2101 gagagcttgt aatgataaag gacgttaatt acgaagagta atatagatat aaatattttt
2161 tacatgatgt ttataattat aataacaata aaatactttt tgactcttct aacgttaatg
2221 ttattattct ctatcctcgg tcacgtttac gtttttttta ccctttgttt tatttcctgc
2281 cgcatcttta taa
//
LOCUS POABETA2 2121 bp ds-DNA PLN 30-MAY-1990
DEFINITION P.agilis beta-2 tubulin (beta-2-tub) gene, complete cds.
ACCESSION M33373
KEYWORDS beta-2 tubulin.
SOURCE P.agilis (strain Aragoa) DNA.
ORGANISM Polytomella agilis
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Dunaliellaceae.
REFERENCE 1 (bases 1 to 2121)
AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D.
TITLE Structure of the three beta-tubulin-encoding genes of the
unicellular alga, Polytomella agilis
JOURNAL Gene 84, 345-358 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 233 277 beta-2 tubulin (beta-2-tub), exon 1
377 500 beta-2 tubulin, exon 2
565 789 beta-2 tubulin, exon 3
910 1847 beta-2 tubulin, exon 4
IVS 278 376 beta-2 tubulin intron A
IVS 501 564 beta-1 tubulin intron B
IVS 790 909 beta-1 tubulin intron C
BASE COUNT 479 a 530 c 439 g 673 t
ORIGIN
1 tttttttgta aggtttaaga ggtcatttgc aagatcagag ctagcgtatc cttatgaatc
61 gctcctacac atcttatagt gtggttacga aaatcaaatt tggagtagtt tggatcccat
121 ggatatttac tataaactag gtgtccaaac caaatatttc aaaagctatt ttttgaagct
181 ccgattgcag agacttcatt ttccttcata atatattagt agaaaaacaa ccatgcgtga
241 aattgttcat attcagggtg gccagtgcgg taaccaggta aattattaat cgttttttat
301 tctttgcact tattgcacag tcctcttaag ccctacgtct aattcttttc cttttttact
361 attcaacctc ctgtagattg gtgccaagtt ttgggaggtc gtctctgacg agcacggcat
421 tgaccccacc ggtactgcaa ttggtgacag cgatcttcag ctcgagcgta tcaacgttta
481 ctttaacgaa gccaccggca gtatgtatac ccgtattctt aagaatccaa tattttttcc
541 tgatgtttta cttgatatat acaggccgtt atgtcccccg tgccatcctc atggatcttg
601 agcctggtac tatggactcc gttcgttctg gtccctatgg tcagatcttc cgtcctgaca
661 acttcgtctt tggccagact ggtgccggta acaactgggc taagggtcat tataccgagg
721 gtgccgagct catcgactcc gtccttgatg ttgtccgcaa ggaggccgag tcttgcgact
781 gtctccaagg tagatttctt tttgcatttt tctttttcaa atttgcgtat taattcttca
841 acctattaaa ctttgacttt tccgatttgc tgaaatattt tttctcaatg ttcctttttt
901 ttgtttcagg tttccaggtt tgtcactctc tcggtggtgg tactggctct ggtatgggaa
961 ctctccttat ttccaagatc cgtgaggagt accctgaccg tatgatgctc accttctccg
1021 tcgtcccctc ccccaaggtg tctgataccg tcgttgagcc ctacaacgcc accctctccg
1081 ttcatcagct cgtcgagaac gccgatgagt gcatggtcct cgataacgag gctctctacg
1141 atatctgctt ccgcaccctt aagctcacca cccccacctt cggtgacctt aaccacctta
1201 tctccgccgt catgtccggc atcacctgct gcctccgctt ccccggtcag ctcaacgctg
1261 atctccgtaa gcttgctgtc aaccttatcc ccttcccccg tctccacttc ttcatggttg
1321 gtttcacccc cctcacctcc cgtggcagcc agcagtaccg tgctctcacc gtccccgagc
1381 ttacccagca gatgtgggat gccaagaaca tgatgtgcgc cgctgatccc cgccacggtc
1441 gttacctcac tgcctccgcc cttttccgtg gtcgtatgtc caccaaggag gtcgatgagc
1501 agatgcttaa cgtccagaac aagaactctt cttacttcgt tgagtggatc cccaacaacg
1561 tcaagtcctc cgtttgcgat atccctccta agggtcttaa gatgtctgct accttcattg
1621 gtaacagcac tgccatccag gagatgttca agcgtgtctc tgagcagttc actgccatgt
1681 tcaggcgcaa ggctttcttg cactggtaca ctggtgaggg tatggatgaa atggagttca
1741 ctgaggccga gtccaacatg aacgatctcg tctccgagta ccagcagtac caggatgcct
1801 ctgctgagga agagggcgag ttcggtgagg aagaagagga gaactaagcg ctcttttttt
1861 gtgagaatta attgaaacgc tcgatatatt tttacaaaga attagaggag tgataggtta
1921 aagcctgatc ttaatttttc tttgctgaat atgtattgat gtgttgtagt attagtcatg
1981 tgagtatgtt tgtaatatag taaaccttta tgcatctttt cgtttttata tgtagcagga
2041 gttaaaaact aaacttttga attttccaaa taacttttca tccttaataa attgtaatat
2101 tggtatatgc tttatagcca c
//
LOCUS POABETA3 2202 bp ds-DNA PLN 30-MAY-1990
DEFINITION P.agilis beta-3 tubulin (beta-3-tub) gene, complete cds.
ACCESSION M33371
KEYWORDS beta-3 tubulin protein.
SOURCE P.agilis (strain Aragoa) DNA.
ORGANISM Polytomella agilis
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Dunaliellaceae.
REFERENCE 1 (bases 1 to 2202)
AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D.
TITLE Structure of the three beta-tubulin-encoding genes of the
unicellular alga, Polytomella agilis
JOURNAL Gene 84, 345-358 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 233 250 beta-3 tubulin (beta-3-tub), exon 1
325 351 beta-3 tubulin, exon 2
429 552 beta-3 tubulin, exon 3
620 844 beta-3 tubulin, exon 4
984 1921 beta-3 tubulin, exon 5
IVS 251 324 beta-3 tubulin intron A
IVS 352 428 beta-3 tubulin intron B
IVS 553 619 beta-3 tubulin intron C
IVS 845 983 beta-3 tubulin intron D
BASE COUNT 494 a 552 c 464 g 692 t
ORIGIN
1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg
61 tgtaggtgct ggttgattat tacttttatg gatcaatact caaaactcga aagtgtatca
121 aattcataac gacactaaaa attcgagaat tgtcgtacaa cctatggcct tatataaatc
181 atgctcccct tttaaaaatc aacttgttta gtttttagcc aaataaataa ccatgcgtga
241 aatagttcat gtaagttgca attgttgttg attttgccgt tttgttcata ttcttattct
301 ttttgctgtt ttctgcctcc acagattcaa ggtggccagt gcggtaacca ggtggcttct
361 gccagttttg gctccctttc ctcagctttt gaatacatga gcctttctaa cttgtcttct
421 tactgcagat cggtgccaag ttctgggagg tcgtctctga tgagcacggc atcgacccca
481 ccggtaccgc ccttggtgac agcgacctcc agcttgagcg catcaacgtc tacttcaatg
541 aggccactgg aagtaagtaa tcagcgtgtt gccaatattt tttggggctt ggatttacta
601 tttttttact tctcttcagg ccgttatgtc ccccgtgcta tcctcatgga tcttgagccc
661 ggtaccatgg attccgtccg ctctggtccc tatggtcaga ttttccgccc tgacaacttt
721 gtcttcggcc agactggtgc cggtaacaac tgggctaagg gtcactacac tgagggtgct
781 gagctcattg actctgtcct tgatgttgtc cgcaaggagg ctgagtcctg cgattgcctt
841 cagggtaaga ttatttttat tttccacttt tatgttcctt ataatagggt ggggcattaa
901 ttttgtatta ttcagcacat ctctcgtata tttaaatata taaatatatt tcttatttaa
961 aatttttaaa taatatgtta caggcttcca ggtctgccac tctctcggtg gtggtaccgg
1021 ttccggtatg ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat
1081 gctcaccttc tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa
1141 cgccaccctc tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa
1201 cgaggctctc tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga
1261 ccttaaccat cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg
1321 tcagctcaac gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca
1381 tttcttcatg gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct
1441 caccgtcccc gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgctgctga
1501 tccccgtcac ggtcgttacc tcactgcctc cgcccttttc cgtggtcgta tgtccaccaa
1561 ggaggtcgat gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg
1621 gatccccaac aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc
1681 tgctaccttc attggtaaca gcactgccat ccaggagatg ttcaagcgtg tctctgagca
1741 gttcactgcc atgttcaggc gcaaggcttt cttgcattgg tacactggtg agggtatgga
1801 tgagatggag ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca
1861 gtaccaggat gcctctgctg aagaggaagg cgagttcggt gaggaggagg aggagaacta
1921 aagaactcaa ttgcatttca tgttttaaat tcaggaatga tctagagata ctttgctttt
1981 ttttttcttg cttgtagcac ctctcgtcgc gattttgaaa gcattatcaa tacatcctag
2041 tcagctatgt tcggtggatg atataatgaa gcttttgcac gacaatattt gtgtgctgaa
2101 ccgcttatga ctgaattaag gttatagata atttcagtgt aaattattag tgctccgcta
2161 ataccaaatc cattttttat tttatgtcca tgaagttata ta
//
LOCUS CELDAF1 5899 bp ds-DNA INV 30-MAY-1990
DEFINITION C.elegans serine/threonine protein kinase (daf-1) gene, complete
cds.
ACCESSION M32877
KEYWORDS serine protein kinase; threonine protein kinase;
transmembrane protein.
SOURCE C.elegans (strain N2) DNA, clones DR#[168,169], and cDNA to mRNA,
clones DR#[113,114,118].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 5899)
AUTHORS Georgi,L.L., Albert,P.S. and Riddle,D.L.
TITLE daf-1, a Caenorhabditis elegans gene controlling dauer larva
development, encodes a novel receptor protein kinase
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by L.L.Georgi,
16-MAR-1990.
FEATURES from to/span description
pept 268 313 serine/threonine protein kinase precursor 1
1693 1779 serine/threonine protein kinase precursor 2
2112 2239 serine/threonine protein kinase precursor 3
2972 3190 serine/threonine protein kinase precursor 4
3243 3999 serine/threonine protein kinase precursor 5
4182 4345 serine/threonine protein kinase precursor 6
4473 4739 serine/threonine protein kinase precursor 7
4785 5058 serine/threonine protein kinase precursor 8
5371 5438 serine/threonine protein kinase precursor 9
sigp 268 312 serine/threonine protein kinase signal peptide
matp 313 313 serine/threonine protein kinase
1693 1779 serine/threonine protein kinase
2112 2239 serine/threonine protein kinase
2972 3190 serine/threonine protein kinase
3443 3999 serine/threonine protein kinase
4182 4345 serine/threonine protein kinase
4473 4739 serine/threonine protein kinase
4785 5058 serine/threonine protein kinase
5371 5435 serine/threonine protein kinase
pre-msg 233 5899 daf-1 mRNA and introns
IVS 314 1692 daf-1 intron A
IVS 1780 2111 daf-1 intron B
IVS 2240 2971 daf-1 intron C
IVS 3191 3242 daf-1 intron D
IVS 4000 4181 daf-1 intron E
IVS 4346 4472 daf-1 intron F
IVS 4740 4784 daf-1 intron G
IVS 5059 5370 daf-1 intron H
rpt 2302 2832 direct repeat
rpt 2520 2634 direct repeat
BASE COUNT 1759 a 1194 c 1121 g 1825 t
ORIGIN Map position 29.2-29.4.
1 cttcttctct attcaattcc attcatagtg gactaggtga gccgctcgtc agtatctgct
61 cactgtctgc gtctctgctt ggctcttctc tttctctcca tgtgctccgg ctccccctcc
121 tcacactcaa ccacacacac acacacacgt atacatatct tcttatagtt ttgttgcatg
181 cttcatctct tttctcatct tttgccttct atgtactttg ctttttttcc agatctcctg
241 atgaactaaa ctcttgggaa ataaattatg aggatacggc acgtggtgtt ctgcctgctc
301 gcactggttt atggtgagta ctttgaggtt ttggtgtttg atagcaaaag tcatcaatct
361 atttttcggg ttaattgggt taatataggt attattatgc agctagatga tgatttggtt
421 ctatctaaac aaaaaagtgc aattaaaatc gaattctcac tggtttacat aaatttttag
481 ttataatcac aaatttaata gtttacgctc tatttttttt tttgacaaac tagatatagt
541 taaacatatt tgaaatttga tccaatatcc aatataagtg aaatcaagtt aaggttgtta
601 agtttgatat atgtagtagt aaatatatag aaatgttttt catatcaggt tcagccaatc
661 atattcgagt tgcgtagtgc ttatgcagtt gataggtttt ttggtataat ctagttatgt
721 tgtaactcta aaaaaatttt ctaattatca tcacgaatat gtatatcgtt taagaagtta
781 gaaaaaaaca ctgattattt ttttattagc tcaaaaagta tgtattagtc ccatttagct
841 ggcttttaga aattcgtgtt taattttgat tagtttgact atggaaattg tttgagccca
901 aagttaagca atttatgtta ttgcagatta gctgcagact ttttctatta gaatagaatc
961 ttattacata gttattattg cctaatttca gctatacaaa atgcgactta tcaacttatc
1021 attttagttg tagttgatcg cagttgaaga aaattgaaat gttattatga tcttaaaaaa
1081 atacatatca ctttaattat gagtgacttg aaattgctat tttttattgc aattcatagt
1141 tgatatatgt tcaaaaaagt caacaagtta actgatttgc tgatagtgaa tattaaattt
1201 agcgcaatga gcaattgatg caattaaatg ttagttgcag ttttattctg ttagtgggta
1261 tttagaaaat agctaggtgc aatttaaact aaattttcaa caatttagga atttaaactt
1321 tactttttag tcgcatcact tatctaggca ccccactttt taactttaaa agcgcctaac
1381 aataagtatc taagttgtcc tgtattttat tttaaaattg ccaatttgtt aattttttct
1441 ctatataatt aattatctat tgatatttac aaaatgtgcc gagtcctgcc cttccaaagg
1501 aaactagtgg ggtgcaatac caatagaaga gtacaaaatt tacagaagtc caaaataata
1561 tgcatcacgg ttgagagctt tttaaatttg ttatcagtat ttaggccttg gctcaaacac
1621 atcataaatt ggtacctgtg caagaaaatt aaccaaatac agctttctac gacccttttc
1681 aattttttcc aggcgccgaa acttccgacg atgatctcga cgaacggaca aatatcttca
1741 tccgtgataa actgattccg gcgttgaagc ttgcagaagg tagtttgttt ttttggactc
1801 aacagaaaat aatcgccggc actcaaatta gatggcattt atcgtcagaa gaagctttgg
1861 gcaaaatccc ccggtagcaa agcaaaataa tttcgttcga tcaaacatta aaagagattt
1921 tcaactatat atataatcag tgataagcct ataatcttta gtgtttcccc cgaaagcact
1981 aatctatggg aaatttcctt ttcgggggac gtgagtttat gcagaaaaaa aaattgaagt
2041 ttagtttatt tttggaagga tttttaatct tgtgataagg gatttaacaa gattttctga
2101 ttttcttgca gtgaccaagg tgaatttcac acgattacac ttatgccatt gctcaaggga
2161 agttggatgc aacgcgagga ctaccggatg ggtgcctgga atcgagttcc ttaatgagac
2221 ggatcgcagt ttttatgagg ttaggcttta accatgtggt cgagtttttc cttgcccgca
2281 atagatcctt attgaccaaa actacagtaa tcctacagta cctcaaccat cttcctctat
2341 tagacctgtc accaaagtat ttgacaaagc aaaaaaacaa gaataattta attactcgat
2401 aactccaaaa aacaacaaaa aattgtattt tggtttttta aatttcttta tctgattgtg
2461 aatattttcg ttttccccgc gatttcttta aaagatcaaa agactaagac taagataatc
2521 ttacagtacc ccgaccctat ccctccacta accttaatct cttcaaaaaa caatgagcat
2581 gttttctcct aagctacagt aatcctacag tacccctaca gtaccccagt actactacag
2641 taccccgacc ctatccctct actaatctta atctcttcaa aaaacaatga acatgttttc
2701 ttctaagcta cagtactcct acagtaccca tacagtaccc ttacaacact actacagtac
2761 ccgaccctat ccctctacta accttaatct cttcaaaaaa caaaaaaaac atgtttctcc
2821 tatactacag tactcttaca gtaccccaga ggtgtcctta taagtttgac tagtttttgt
2881 catttggcag taaaactacg gttcttctaa aatttcggtt acttggttac taggtctcaa
2941 ttctcacact agaccaatcc tctctttcca gaacacgtgc tacacggacg gaagctgcta
3001 ccagagcgct cggccgtcac ccgaaatctc acatttcgga tgcatggacg agaaatctgt
3061 gaccgacgag accgagtttc acgatacggc agcgaaagtg tgcacgaaca acacgaaaga
3121 cccgcacgct accgtctgga tttgctgtga taagggaaat ttttgtgcga acgagacgat
3181 catccacttg gtaagtgtga tctctttttt gacatccgga ttttgaattt tcaaattttc
3241 aggcgccagg cccgcagcag tcatccacgt ggctgattct tactatcctc gccctgctca
3301 cgttcatcgt gctcctgggc attgcaattt tcctcaccag aaaatcatgg gaggcgaaat
3361 tcgactggta tataagattc aagccaaaac ctggagatcc tctacgcgag acggagaaca
3421 atgtaccgat ggttacaatg ggcgacggtg ctggaagttc tgtgccagaa gttgctccaa
3481 tcgagcagca gggaagcaca atgagcacat ctgcagggaa ctcgttccct cctggaatca
3541 tgcctaataa catgaaggat atgcttgatg tactcgagga gacaagtgga tctggtatgg
3601 gtccaactac acttcacaaa ctcacgattg gtggacagat caggttaact ggtagagttg
3661 gatccggaag atttggtaac gtgtcacgtg gagactaccg aggtgaagcg gtcgctgtga
3721 aggttttcaa tgcgctcgac gagccggcgt tccacaagga gacggagatc tttgagacgc
3781 ggatgttgag gcatccgaat gtgctgagat atattggaag tgatcgggtt gatactggat
3841 ttgtaacaga actttggctg gtgacagagt atcatcccag cgggtccttg cacgattttc
3901 tgctcgagaa tacggttaac attgagacgt attataattt gatgaggtcg acggcaagtg
3961 gacttgcgtt cctgcacaat cagattggag ggagcaaagg taagtgcaat atacatgtgt
4021 ctcagaaatc taggagccta gactaggccc ctgagaagaa aaatttggac caacggtgtc
4081 tgaaactttt ctattcagag agtagtctag gacgccaact gagtccgctg gccagacgta
4141 ctctgtcgtt gaacatatat taatcaaaat acaaattcca gagtccaaca agccagcgat
4201 ggctcaccgg gatatcaaga gcaagaacat catggtcaag aacgatctga cgtgtgccat
4261 cggagatctc ggtctctcgt tatccaagcc agaagacgcg gcatcagata ttattgccaa
4321 tgagaactac aagtgcggta ctgttgtgag ttttgggatc taggccattt tagtcagaga
4381 gctcaaggct cgagaccctg ccaagacgga ggttttggcg taaagtctag ctataatctt
4441 ttttcaaaat tttcacaaat taaatttttt agagatacct tgcaccggaa atcctgaaca
4501 gtacgatgca gttcaccgtc ttcgagagct accaatgtgc cgatgtctac tcattttctc
4561 tagtcatgtg ggaaacccta tgcagatgtg aagatggaga tgttctgccg agagaagcgg
4621 ccacagtgat cccatatatt gagtggaccg atcgggatcc acaagacgct cagatgtttg
4681 atgtggtttg cacgcggagg ctcagaccta cagagaatcc tctgtggaag gatcatccgg
4741 taagataata aactttatgt aaagtacttc taactacttt tcaggaaatg aaacacatta
4801 tggagattat caaaacgtgc tggaatggca atccgagcgc ccgtttcacc tcctatattt
4861 gccgcaaacg aatggatgaa cggcagcaac tgttgctgga caagaaggca aaagctgtgg
4921 cccagacggc aggggtgaca gttcaggatc ggaagatttt gggtccccag aaaccgaaag
4981 acgagtcacc ggcaaatggt gctccgcgga tagttcagaa ggagattgat cgagaagatg
5041 agcaggaaaa ttggcggggt aagaattttg atcaggctcc tattgtggac cgagctacta
5101 agcccggacc aaaaaaaaac caacattttg ggtatcatgt ctatctggtg ccttaaatac
5161 cctacgaatt tagaaacttt tctcaaaagc ttccaactaa acaaattatc tttaaccttg
5221 aaggtcagat ctacctacta cctaatacta tagtagcgga cacttaataa gtccaggtcc
5281 ctaacctagc atagtttata tgcccgccac cgtagaattt tgcgccttta aagtgaagaa
5341 cacttttcaa cataaattaa tttttttcag aaactgcaaa aactcccaat ggacacattt
5401 ccagtaatga cgactcgtca cgcccacttc ttggttagag cccattttat cttcgttctt
5461 aggcgccaat atatcaaaat tattataatt attatgttct atatactaca cttatcacct
5521 ctcgtgactt cctcccaccc ccatatcgat gatctcattt ctcttcctca accactaact
5581 gtctgctacc tatttcttgt ttattagcgg gtgcattatg cagctaattg tccggcgccc
5641 catcattcta aaagagactc ctcaccgtgt acctcagaaa aacataaaaa tcttgtgata
5701 ggtatgtact gcataagctt tattaccctc ccccgaaact ttgtagtttg tagtttaccc
5761 tgtttttttt ctgtaccatg atttcaagtg tttttgaaat tttattttgt ttttattgtt
5821 attattatta ctattttcaa aaaaggagaa aacacatttc aatgcaaaaa ctgaaaatga
5881 aaataaaaaa tttgatttt
//
LOCUS MUSIGHAAI 318 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-14-9.
ACCESSION M32760
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 318 Ig H-chain V-D-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
BASE COUNT 92 a 69 c 86 g 71 t
ORIGIN
1 gaggtgaagc ttctcgagtc tggaggtggc ctggtgcagc ctggaggatc cctgaatctc
61 tcctgtgcag cctcaggatt cgattttagt agatactgga tgagttgggc tcggcaggct
121 ctagggaaag ggcaggaatg gatcggagaa atcaatctag gaagcagtac gataaactat
181 acgccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aagactactg
301 gccaaggcta tggactac
//
LOCUS MUSIGHAAJ 318 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 1-5-1.
ACCESSION M32761
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 318 Ig H-chain V-D-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
BASE COUNT 93 a 67 c 82 g 76 t
ORIGIN
1 gaggtgaagc ttctccagtc tggaggtggc ctggtgcagc ctggaggatc cctgaaactc
61 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct
121 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac atttaactat
181 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg
301 gactggtact tcgatgtc
//
LOCUS MUSIGHAAK 258 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-28-9.
ACCESSION M32762
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 258)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 258 Ig H-chain V-D-region (AA at 1)
recomb 234 235 V-region DNA end/D-region DNA start
BASE COUNT 84 a 52 c 61 g 61 t
ORIGIN
1 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct
61 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac aataaactat
121 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
181 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg
241 gactggtact tcgatgtc
//
LOCUS MUSIGHAAO 243 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-27-6.
ACCESSION M32759
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 243)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 243 Ig H-chain V-D-region (AA at 1)
recomb 219 220 V-region DNA end/D-region DNA start
BASE COUNT 82 a 46 c 58 g 57 t
ORIGIN
1 ggaatcgatt ttagtagata ctggatgagt tgggttcggc gggctccagg gaaaggacta
61 gaatggattg gagaaattaa tccagatagc agtacaataa actatgcacc atctctaaag
121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa
181 gtgagatctg aggacacagc cctttattac tgtgcaaaaa actgggactg gtacttcgat
241 gtc
//
LOCUS MUSIGHAAP 246 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-1-3.
ACCESSION M32763
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 246 Ig H-chain V-D-region (AA at 1)
recomb 222 223 V-region DNA end/D-region DNA start
BASE COUNT 78 a 50 c 57 g 61 t
ORIGIN
1 tcaggaatcg attttagcag atactggatg agctgggtcc ggcgggctcc agggaaaggg
61 ctagaatgga ttggagaaat taatccaaat agcagtacaa taaactatgc accatctcta
121 aaggataaat tcatcatctc cagagacaac gccaaaaatg cgctgtacct gcaagtgagc
181 aaagtgagat ctgttaacac atccctttat tactgtgcaa gatcttacgg tagctggttt
241 gcttac
//
LOCUS MUSIGHAAQ 261 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 10L16-5.
ACCESSION M32764
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene;
variable region.
SOURCE Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 261 Ig H-chain V-D-region (AA at 1)
recomb 219 220 V-region DNA end/D-region DNA start
BASE COUNT 83 a 49 c 63 g 66 t
ORIGIN
1 ggattcgatt ttagtagata ctggatgatt tgggtccggc aggctccagg gaaagggcta
61 gaatggattg gagaaattaa tccagatagc agtacgataa actatacgcc atctctaaag
121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa
181 gtgagatctg aggacacagc cctttattac tgtgcaagaa aggggtattt ttactacggt
241 agtagctcgt acttcgatgt c
//
LOCUS MUSIGHAAY 291 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-27-6.
ACCESSION M32766
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1)
recomb 285 286 V-region DNA end/J-region DNA start
BASE COUNT 85 a 73 c 61 g 72 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggttatacgc ttcctcggac g
//
LOCUS MUSIGHABA 291 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 1-5-1.
ACCESSION M32768
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1)
recomb 285 286 V-region DNA end/J-region DNA start
BASE COUNT 84 a 72 c 60 g 75 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS MUSIGKAAX 288 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-region.
ACCESSION M32765
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 288)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 288 Ig kappa-chain V-region (AA at 1)
BASE COUNT 83 a 70 c 60 g 75 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatgtattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccagttacta ttgtcagcag tatagtaagc ttcctcca
//
LOCUS MUSIGKAAZ 291 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-14-9.
ACCESSION M32767
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1)
recomb 285 286 V-region DNA end/J-region DNA start
BASE COUNT 86 a 73 c 61 g 71 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aatttattaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactacgg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac g
//
LOCUS MUSIGKABB 291 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-28-9.
ACCESSION M32769
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1)
recomb 285 286 V-region DNA end/J-region DNA start
BASE COUNT 84 a 73 c 61 g 73 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca gggcattagc aattatttaa actggtcaca gcagaaacca
121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS MUSIGKABC 291 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-1-3.
ACCESSION M32770
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1)
recomb 285 286 V-region DNA end/J-region DNA start
BASE COUNT 83 a 72 c 61 g 75 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS MUSIGLKABD 246 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 10L16-5.
ACCESSION M32771
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
French,D., Rothe,J. and Bona,C.
TITLE A molecular and structural analysis of the V-H and V-K regions of
monoclonal antibodies bearing the A48 regulatory idiotype
JOURNAL J. Immunol. 144, 614-624 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 246 Ig kappa-chain V-J-region (AA at 1)
recomb 240 241 V-region DNA end/J-region DNA start
BASE COUNT 74 a 54 c 52 g 66 t
ORIGIN
1 ggagacagag tcaccatcag ttgcagtgta agtcagggca ttagcaatta tttaaactgg
61 tatcagcaga aaccagatgg aactgttaaa ctcctgatct ataacacatc aactttatac
121 tcaggagtcc catcaaggtt cagtggcagt gggtctggga cagattattc tctcaccatc
181 agtaacctgg aacctgaaga tattgccact tattattgtc agcagtatag taagcttcct
241 ccgacg
//
LOCUS HUMACTAF 1123 bp ds-DNA PRI 30-MAY-1990
DEFINITION Human alpha-actin (ACTA) gene, non-coding exon 1.
ACCESSION J05193
KEYWORDS alpha-actin.
SOURCE Human placental DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1123)
AUTHORS Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and
Ruley,H.E.
TITLE Structure of the human smooth muscle alpha-actin gene: Analysis of
a cDNA and 5' upstream region
JOURNAL J. Biol. Chem. 265, 1683-1687 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Reddy, 20-DEC-1989.
FEATURES from to/span description
pre-msg 892 > 1123 alpha-actin mRNA and introns
/hgml_locus_uid="LW0057W" /nomgen="ACTA"
/map="1p21-qter"
IVS 935 > 1123 ACTA intron A
signal 863 869 TATA box
BASE COUNT 286 a 258 c 258 g 321 t
ORIGIN
1 gaattcgaga cgagatttgg gtggggacgt agaaccaaac catatcacct ggtctctcta
61 cttcctgtca aggaggttag tgggcagaga ggagggctag aggcttcctt tgaacaatct
121 cctttctttt ccaaactact tctttgacag gctgctgggt agactctctg gtcaaaggat
181 ggtccctact tatgctgcta aattgctcgg tgacaaatta gtagacaaag ctaatgcacc
241 aaaaaaatga atgtagttat agtaatgcta acatccaaat tcctctttgt aagacatagg
301 cctgtcaacc ttgtctccat acttcaattc ctatttccac tcacctccct caagaacttg
361 atttataaac agtgtgccta ccataaaatc atcactccct ctatgtattt atagacgact
421 gaaggaatat ctttcttctt tgacatgcta ccgtggtaga agggttttaa aagtccgtgc
481 taggcagagg cagccctttc tgcccctttc tgttctcagt ttattaggaa atggcctgaa
541 attccagcat gatagcaagc tggcatcctc tgtggaatgt gcaaaccatg cctgcatctg
601 cccattaccc tagctcagtg tctctgggca tttctgcagt tgttctgaag gcttggcgtg
661 tttatctccc acaggcggct gaaccgcctc ccgtttcatg agcagaccag tggaatgcag
721 tggaagagac ccaggcctcc ggccacccag attagagagt tttgtgctga ggtccctata
781 tggttgtgtt agactgaaca gcaggctcaa gtctgtcttt gctccttgtt tgggaagcaa
841 gtgggaggag agcagaaggg gctatataac ccttcagctt tcagcttccc tgaacaccac
901 ccagtgtgga gcagcccagc caagcactgt cagggtaagt ggcgccaggc caaggatgtg
961 acttatagat tccagtggct cttttaatta cccggtataa taagacatca tctgcaggga
1021 tttggctggg ttcatgcact gatatttctg aatgaagatt gtactactaa aatgattgta
1081 gcttttggct ttaatgatct aacgttaaag acagggctaa tat
//
LOCUS HUMACTAM 1329 bp ss-mRNA PRI 30-MAY-1990
DEFINITION Human alpha-actin (ACTA) mRNA, complete cds.
ACCESSION J05192
KEYWORDS alpha-actin.
SOURCE Human umbilical artery, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and
Ruley,H.E.
TITLE Structure of the human smooth muscle alpha-actin gene: Analysis of
a cDNA and 5' upstream region
JOURNAL J. Biol. Chem. 265, 1683-1687 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Reddy, 20-DEC-1989.
FEATURES from to/span description
pept 49 1182 alpha-actin /hgml_locus_uid="LW0057W"
/nomgen="ACTA" /map="1p21-qter"
mRNA < 1 1329 ACTA mRNA
signal 1305 1310 poly-A signal
BASE COUNT 326 a 361 c 327 g 315 t
ORIGIN
1 gcagcccagc caagcactgt acaggaatcc tgtgaagcag ctccagctat gtgtgaagaa
61 gaggacagca ctgccttggt gtgtgacaat ggctctgggc tctgtaaggc cggctttgct
121 ggggacgatg ctcccagggc tgttttccca tccattgtgg gacgtcccag acatcagggg
181 gtgatggtgg gaatgggaca aaaagacagc tacgtgggtg acgaagcaca gagcaaaaga
241 ggaatcctga ccctgaagta cccgatagaa catggcatca tcaccaactg ggacgacatg
301 gaaaagatct ggcaccactc tttctacaat gagcttcgtg ttgcccctga agagcatccc
361 accctgctca cggaggcacc cctgaacccc aaggccaacc gggagaaaat gactcaaatt
421 atgtttgaga ctttcaatgt cccagccatg tatgtggcta tccaggcggt gctgtctctc
481 tatgcctctg gacgcacaac tggcatcgtg ctggactctg gagatggtgt cacccacaat
541 gtccccatct atgagggcta tgccttgccc catgccatca tgcgtctgga tctggctggc
601 cgagatctca ctgactacct catgaagatc ctgactgagc gtggctattc cttcgttact
661 actgctgagc gtgagattgt ccgggacatc aaggagaaac tgtgttatgt agctctggac
721 tttgaaaatg agatggccac tgccgcatgg tcatcctccc ttgagaagag ttacgagttg
781 cctgatgggc aagtgatcac catcggaaat gaacgtttcc gctgcccaga gaccctgttc
841 cagccatcct tcatcgggat ggagtctgct ggcatccatg aaaccaccta caacagcatc
901 atgaagtgtg atattgacat caggaaggac ctctatgcta acaatgtcct atcagggggc
961 accactatgt accctggcat tgccgaccga atgcagaagg agatcacggc cctagcaccc
1021 agcaccatga agatcaagat cattgcccct ccggagcgca aatactctgt ctggatcggt
1081 ggctccatcc tggcctctct gtccaccttc cagcagatgt ggatcagcaa acaggaatac
1141 gatgaagccg ggccttccat tgtccaccgc aaatgcttct aaaacacttt cctgctcctc
1201 tctgtctcat gcacacaact gtgaatgtcc tgtggaatta tgccttcagt tcttttccaa
1261 atcattccta gccaaagctc tgactcgtta cctatgtgtt ttttaataaa tctgaaatag
1321 gctactggt
//
LOCUS ASOTAAG1 2935 bp ds-DNA PLN 30-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION M33218
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 2935)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1
828 868 Taka-amylase A precursor, exon 2
955 1070 Taka-amylase A precursor, exon 3
1140 1248 Taka-amylase A precursor, exon 4
1317 1545 Taka-amylase A precursor, exon 5
1603 1765 Taka-amylase A precursor, exon 6
1830 1976 Taka-amylase A precursor, exon 7
2041 2281 Taka-amylase A precursor, exon 8
2360 2647 Taka-amylase A precursor, exon 9
sigp 607 669 Taka-amylase A signal peptide
matp 670 772 Taka-amylase A
828 868 Taka-amylase A
955 1070 Taka-amylase A
1140 1248 Taka-amylase A
1317 1545 Taka-amylase A
1603 1765 Taka-amylase A
1830 1976 Taka-amylase A
2041 2281 Taka-amylase A
2360 2644 Taka-amylase A
pre-msg 543 > 2789 Taa-G1 mRNA and introns
IVS 773 827 Taa-G1 intron A (no splice consensus)
IVS 869 954 Taa-G1 intron B
IVS 1071 1139 Taa-G1 intron C
IVS 1249 1316 Taa-G1 intron D
IVS 1546 1602 Taa-G1 intron E
IVS 1766 1829 Taa-G1 intron F
IVS 1977 2040 Taa-G1 intron G
IVS 2282 2359 Taa-G1 intron H
signal 2784 2789 poly-A signal
BASE COUNT 818 a 752 c 657 g 708 t
ORIGIN
1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 30-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION M33220
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 195 > 197 Taka-amylase A (Taa-G2) precursor
pre-msg 131 > 197 Taa-G2 mRNA and introns
signal 2 11 CAAT box
signal 95 100 TATA box
BASE COUNT 59 a 52 c 36 g 50 t
ORIGIN
1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
181 cacagaaggc atttatg
//
LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 30-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION M33222
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1)
pre-msg < 1 151 Taa-G2 mRNA and introns (alt.)
pre-msg < 1 156 Taa-G2 mRNA and introns (alt.)
BASE COUNT 45 a 48 c 53 g 52 t
ORIGIN About 2.1 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
181 gtcctccaga ggcgagga
//
LOCUS ASOTTAM1 191 bp ss-mRNA PLN 30-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION M33219
KEYWORDS Taka-amylase A.
SEGMENT 1 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 189 > 191 Taka-amylase A (Taa)
mRNA 125 > 191 Taa mRNA
BASE COUNT 56 a 52 c 32 g 51 t
ORIGIN
1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
181 aggcatttat g
//
LOCUS ASOTTAM2 156 bp ss-mRNA PLN 30-MAY-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION M33221
KEYWORDS Taka-amylase A.
SEGMENT 2 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa) (AA at 1)
mRNA < 1 151 Taa mRNA (alt.)
mRNA < 1 156 Taa mRNA (alt.)
BASE COUNT 40 a 35 c 42 g 39 t
ORIGIN About 1.5 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 30-MAY-1990
DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
mRNA, complete cds.
ACCESSION J05223
KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
glyceraldehyde-3-phosphate dehydrogenase.
SOURCE M.crystallinum, cDNA to mRNA.
ORGANISM Mesembryanthemum crystallinum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE 1 (bases 1 to 1354)
AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3-
phosphate dehydrogenase during the transition from C-3
photosynthesis to crassulacean acid metabolism in Mesembryanthemum
crystallinum
JOURNAL J. Biol. Chem. 265, 3497-3502 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC
1.2.1.12)
BASE COUNT 315 a 308 c 345 g 386 t
ORIGIN
1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS ECOPRIA 2658 bp ds-DNA BCT 30-MAY-1990
DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR
gene, 5' end.
ACCESSION M33293
KEYWORDS cytR gene; priA gene; primosomal protein.
SOURCE E.coli (strain W3110) DNA, clone pEL042.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2658)
AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
TITLE The priA gene encoding the primosomal, replicative n' protein of
E.coli
JOURNAL Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305
STANDARD full automatic
COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
26-MAR-1990.
FEATURES from to/span description
pept 64 2262 primosomal protein n'
pept 2418 > 2658 cytR gene product
signal 27 32 -10 region
signal 5 10 -35 region
binding 53 59 ribosome binding site
BASE COUNT 578 a 738 c 756 g 586 t
ORIGIN
1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
2641 cggatatctg cgatcccc
//
LOCUS EWCTELRNA 657 bp ds-DNA INV 30-MAY-1990
DEFINITION E.crassus telomerase RNA component gene, complete cds.
ACCESSION M33461
KEYWORDS telomerase RNA.
SOURCE E.crassus DNA.
ORGANISM Euplotes crassus
Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora;
Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE 1 (bases 1 to 657)
AUTHORS Shippen-Lentz,D. and Blackburn,E.H.
TITLE Functional evidence for an RNA template in telomerase
JOURNAL Science 247, 546-552 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
RNA 152 342 telomerase RNA component
site 186 197 functional telomeric template
BASE COUNT 202 a 122 c 111 g 222 t
ORIGIN
1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS HS4DWXJ 160 bp ds-DNA VRL 30-MAY-1990
DEFINITION Epstein-Barr virus defective WZhet junction.
ACCESSION M33474
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 160)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 87 90 WZhet junction
BASE COUNT 41 a 45 c 41 g 33 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS HS4WXJ 181 bp ds-DNA VRL 30-MAY-1990
DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION M33473
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 181)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 108 111 WZhet junction
BASE COUNT 50 a 42 c 46 g 43 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
181 c
//
LOCUS HUMREGA 4251 bp ds-DNA PRI 30-MAY-1990
DEFINITION Human regenerating protein (reg) gene, complete cds.
ACCESSION J05412
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4251)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues: The reg protein,
pancreatic stone protein, and pancreatic thread protein are one and
the same product of the gene
JOURNAL J. Biol. Chem. 265, 7432-7439 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
FEATURES from to/span description
pept 1571 1634 regenerating protein (reg), exon 2 (first
expressed exon)
2270 2388 regenerating protein, exon 3
2696 2833 regenerating protein, exon 4
3549 3660 regenerating protein, exon 5
3856 3923 regenerating protein, exon 6
pre-msg 1196 4116 reg mRNA and introns
IVS 1224 1524 reg intron A
IVS 1635 2269 reg intron B
IVS 2389 2695 reg intron C
IVS 2834 3548 reg intron D
IVS 3661 3855 reg intron E
signal 1169 1174 TATA box
BASE COUNT 1161 a 927 c 869 g 1294 t
ORIGIN
1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS HUMREGRELA 1524 bp ds-DNA PRI 30-MAY-1990
DEFINITION Human reg-related sequence, complete cds.
ACCESSION J05413
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human esophageal mucosa DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues: The reg protein,
pancreatic stone protein, and pancreatic thread protein are one and
the same product of the gene
JOURNAL J. Biol. Chem. 265, 7432-7439 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
BASE COUNT 382 a 368 c 345 g 429 t
ORIGIN
1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
1501 tataaatcat ataatgtaaa ggat
//
LOCUS MDPCGA 4801 bp ss-RNA VRL 30-MAY-1990
DEFINITION Aleutian mink disease parvovirus complete genome.
ACCESSION M20036
KEYWORDS complete genome.
SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (bases 1 to 4592)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 4593 to 4801)
AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
Perryman,S. and Wolfinbarger,J.B.
TITLE Nucleotide sequence of the 5' terminal palindrome of Aleutian mink
disease parvovirus (ADV) and construction of an infectious
molecular clone
JOURNAL J. Virol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by M.E.Bloom, 18-MAR-1990.
Sequence reported below is (+) strand.
FEATURES from to/span description
ORF 116 1978 Left ORF
ORF 1983 2207 Middle ORF2
ORF 1993 2211 Middle ORF1
ORF 2241 4399 Right ORF
BASE COUNT 1740 a 912 c 943 g 1206 t
ORIGIN
1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
4801 a
//
LOCUS MDPUPS 3454 bp ss-RNA VRL 30-MAY-1990
DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
sequence.
ACCESSION M32981
KEYWORDS .
SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (sites)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3454, for [1])
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by M.E.Bloom 18-MAR-1990.
Sequence reported below is (+) strand.
BASE COUNT 1268 a 673 c 698 g 815 t
ORIGIN
1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS MUSAA2DEL 300 bp ds-DNA ROD 30-MAY-1990
DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
fragment.
ACCESSION M33468
KEYWORDS deletion mutant.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 130 133 deletion breakpoint
BASE COUNT 82 a 45 c 76 g 97 t
ORIGIN
1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS MUSSL20JA 300 bp ds-DNA ROD 30-MAY-1990
DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
fragment.
ACCESSION M33467
KEYWORDS deletion mutant.
SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps / 42 122 dilute gene, exon 3
179 > 300 dilute gene, exon 5
recomb 129 130 deletion breakpoint
BASE COUNT 86 a 51 c 61 g 102 t
ORIGIN
1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS PRVVP4 2359 bp ds-RNA VRL 30-MAY-1990
DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION M33516
KEYWORDS capsid protein VP4.
SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA.
ORGANISM Porcine rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 2359)
AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain
of porcine rotavirus to that of asymptomatic human rotavirus
strains
JOURNAL J. Virol. 64, 414-418 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 10 2337 capsid protein VP4
BASE COUNT 812 a 406 c 442 g 699 t
ORIGIN
1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
2341 ctctatagag gatgtgacc
//
LOCUS RATSTAA 1000 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION M33329
KEYWORDS hydroxysteroid sulfotransferase a.
SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
Yamazoe,Y. and Kato,R.
TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
strong homology in amino acid sequence with the senescence marker
protein SMP-2 in rat livers
JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 38 892 hydroxysteroid sulfotransferase a (STa)
mRNA < 1 1000 STa mRNA
signal 977 982 poly-A signal
BASE COUNT 299 a 187 c 211 g 303 t
ORIGIN
1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS SMFPOLENV 3534 bp ss-RNA VRL 30-MAY-1990
DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
envelope (env) gene, complete cds.
ACCESSION M33561
KEYWORDS envelope protein; polymerase.
SOURCE Simian foamy virus type 1, cDNA to viral RNA.
ORGANISM Simian foamy virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Spumavirinae.
REFERENCE 1 (bases 1 to 3534)
AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
TITLE Relationship of the env genes and the endonuclease domain of the
pol genes of simian foamy virus type 1 and human foamy virus
JOURNAL J. Virol. 64, 406-410 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 582 polymerase (AA at 1)
pept 524 3481 envelope protein
BASE COUNT 1162 a 645 c 670 g 1046 t 11 others
ORIGIN
1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat
2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS IPBFATPD 1035 bp ss-mRNA PLN 30-MAY-1990
DEFINITION Sweet potato F-1-ATPase (mitochondrial type) delta subunit mRNA,
complete cds.
ACCESSION J05397
KEYWORDS F-1-ATPase delta subunit.
SOURCE I.batatas (strain Kokei No.4) tuberous root, cDNA to mRNA.
ORGANISM Ipomoea batatas
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Convolvulaceae.
REFERENCE 1 (bases 1 to 1035)
AUTHORS Kimura,T., Takeda,S., Asahi,T. and Nakamura,K.
TITLE Primary structure of a precursor for the delta-subunit of sweet
potato mitochondrial F-1-ATPase deduced from full-length cDNA
JOURNAL J. Biol. Chem. 265, 6079-6085 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Nakamura, 23-FEB-1990.
FEATURES from to/span description
pept 115 849 F-1-ATPase delta subunit precursor (EC
3.6.1.3)
sigp 115 249 F-1-ATPase delta subunit signal peptide
matp 250 846 F-1-ATPase delta subunit
signal 964 969 poly-A signal
BASE COUNT 283 a 225 c 238 g 289 t
ORIGIN
1 catagggtaa actgcagcgt tctttgggct tcgttctcaa acccttccct tctgtctccg
61 aattctctcg cgccggcgaa cgattcagct tagagcaaag aagagctctc tgatatggca
121 atgacaggac gtgctagatc gatgggcttc tctattctcc agaaggccct ctcttcagcc
181 cagagatcta acgctcatcg atccattctc tgccctaccc tctcaaactc tgagctgctg
241 aggaactatg caactgcctc tgcttcaaaa gaacaaaaaa ttaaggtgcc tctgacaatg
301 tatggggttt ctggaaacta tgcatctgct ttgtaccttg cagcagtaaa gtctaacaca
361 ttagagaagg ttgagtcaga gctctatgac cttgttgagg cttcaaagaa gagtcctaca
421 ttttctcagt tcatgaggga tccatctgta cctgttgata ctagagtgaa tgccataaag
481 gaaatttgtg ctcaagctaa atttggggat accacccaga acttcttgct tattttggct
541 gaaaatggga ggctgaaaca cattgatcgc attgtgaagc gattcaaaga gctgaccatg
601 gcacacaggg gagaagtcaa agccactgtg acaacagtaa ttccccttcc cgctgatgag
661 gagaaagaat tgaaagccac attgcaggaa atggttggac agggaaagag tgttcaaatt
721 gaacagaaga ttgatccaac cattcttggt ggccttgttg tggaattcgg gcagaaggtc
781 tttgacatgt ctataaggac acgggcacgc cagatggaga gattcttgcg tgaaccactc
841 aacttttgat tgcctccaat cccgttagaa ggcgtttgtt tccatcttga ggaaagcacg
901 atttcggtct gggctccttg agctgcctgt tttccgataa acattgttct gctctttttt
961 gcaaataaag aaaacttgct ttttgtaaga atttgatggg aaaaaaagaa aacctaatgt
1021 tgtttttact tgctg
//
LOCUS YSCSAC7P 1500 bp ds-DNA PLN 30-MAY-1990
DEFINITION S.cerevisiae SAC7 protein gene, complete cds.
ACCESSION M32335
KEYWORDS SAC7 protein.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 301 to 1500)
AUTHORS Dunn,T.M. and Shortle,D.
TITLE Null alleles of SAC7 suppress temperature-sensitive actin mutations
in Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 2308-2314 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 300)
AUTHORS Dunn,T.M.
JOURNAL Unpublished (1990) Univ. of Health Sciences, Bethesda MD 20814-4799
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1], [2] kindly
submitted by T.M.Dunn, 23-FEB-1990.
FEATURES from to/span description
pept 606 1430 SAC7 protein
BASE COUNT 526 a 313 c 288 g 373 t
ORIGIN Chromosome 4R, between ADE8 and TRP4.
1 ggatcaccaa cttctctttc aaggaatgct caaccaacta ctttaaaacg gcctccatta
61 tcttcaagac catattcata taatacccca accaaagata ggaaatcctt ttccaaatcc
121 gcgaaacaga acaataacaa taacaacgct aattccggaa catcgcctca cgcagagttc
181 aaaaattaca gagatatgtt tttatctaat agaaatggtt tcacaggtag ggttttcggt
241 gttactttag cagaatcgtt gagcgtcgcc agtgcagagg tcattgttca aagtgagttg
301 gtgagttttg gtcggatacc catcgtggta gccaagtgcg gcgcatactt aaaagcaaat
361 ggattggaga cctcgggtat atttcgtata gcgggcaatg gtaaaagagt aaaagccctt
421 caatacatat tctcgtcgcc acctgattat ggtaccaaat tcaacgattg gaaacatata
481 cagtgcacga tgttgcatcg ctcctgagga gataccttaa taatttggcc gaaccactaa
541 tacctttatc cctatatgaa caattcagaa acccgctacg atctagacca agaatcctaa
601 ggcatatgtt gacccacgaa gtttctcatc cgaatgcgaa taaaacaaat aatgtaacag
661 taaaatcaag tagacagaac tataatgatg atggtgctaa tgatggtgac atcgaaaagg
721 aggacgccaa agatgatgaa gaaaaaagac gaagaaaaat acgtcataag agaaggctga
781 cccgcgatat aagggcagct atcaaggaat atgaagagct ctttgttacc ttatcaaatg
841 acacgaaaca gctaactata tacctactcg atttactgag tctttttgca aggcaatcac
901 agtttaacct aatgtctggt agaaacttgg cggccatctt ccaaccttca atattatcgc
961 atcctcaaca tgatatggac cccaaagaat acgagttatc ccgactagta gtggaatttt
1021 tgattgaata ctcgtacaag ctattacccc atcttttgaa gttggctaaa agggaacaac
1081 aggaacggtt gtcaaccgaa aataagaaaa ataatggaga taaacagaaa actgatccta
1141 tagaaatacc aaagatcacc tcatcggatt cgccaccaat agtttcttcc aataaaaacc
1201 caccggcgat tgacaataac aataaactag accatacgac gttatcgcca atatctactt
1261 cgataccgga gaactcatca gacctacaga cttcaaaaat gttaaagcct ccgaagcaaa
1321 gaagaccaca ctcaaaatcg tttggttcta ctccggttct ccggatgtta ttgctagtaa
1381 taaaagaaga acgagtttat tcccgtggtt acataaaccg ggaattttga gtgacacagg
1441 agacaatggc gacttaactg ctactgaagc tgaaggtgat gattatgaag aagaaaatgt
//
LOCUS BOVINSMN 960 bp ss-mRNA MAM 30-MAY-1990
DEFINITION Bovine inositol monophosphatase mRNA, complete cds.
ACCESSION J05394
KEYWORDS inositol 1-phosphatase; inositol monophosphatase.
SOURCE Bovine calf brain, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 960)
AUTHORS Diehl,R.E., Whiting,P., Potter,J., Gee,N., Ragan,C.I.,
Linemeyer,D., Schoepfer,R., Bennett,C. and Dixon,R.A.F.
TITLE Cloning and expression of bovine brain inositol monophosphatase
JOURNAL J. Biol. Chem. 265, 5946-5949 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.Whiting, 07-FEB-1990.
FEATURES from to/span description
pept 61 894 inositol monophosphatase
BASE COUNT 266 a 197 c 248 g 249 t
ORIGIN
1 cggaacggcc gcgcgtctga caggctggtg cgcccgcggc cgcgcgcccg ccgccctgag
61 atggctgatc cttggcagga atgcatggat tatgcagtaa ccctggcagg acaagctgga
121 gaggtggttc gtgaagcgct caagaatgaa atgaatatta tggttaaaag ttctccggct
181 gatttggtaa ctgctactga ccaaaaagtt gaaaaaatgc ttatcacatc cataaaggaa
241 aagtatccat ctcacagttt cattggtgag gaatctgtgg cagctgggga aaaaagtatc
301 ttaaccgaca accctacatg gatcattgac cctattgatg gaacaactaa ctttgtacat
361 ggatttcctt ttgtagctgt ttcaattggc tttgtggtaa ataaaaagat ggagtttgga
421 attgtgtaca gttgcctgga ggataagatg tacactggcc ggaaaggaaa aggtgccttt
481 tgtaatggtc aaaaactaca ggtttcacac caagaagata ttaccaaatc tctcttggtg
541 acagagttgg gctcttccag aacaccagag actgtgagaa ttattctttc taatatagaa
601 aggcttcttt gccttcccat ccatgggatc cggggtgttg gaacagcagc tctgaacatg
661 tgcctggtgg ccgctggggc tgcggacgcg tattatgaaa tggggatcca ctgctgggat
721 gttgcaggcg ccggcatcat tgtgactgaa gctggcggag tgctcctgga tgtcacaggt
781 ggaccatttg atttgatgtc acgaagagta attgcttcaa gcaataaaac attagcagaa
841 aggatagcca aagaaattca gataatacct cttcaaagag atgatgaaga ttaactccga
901 cggcttcaca cattcaatca cagttgcgtc tctccaggtt ttctaatgca ctcatggtgt
//
LOCUS FVBPNGF 1350 bp ds-DNA BCT 30-MAY-1990
DEFINITION F.meningosepticum endoglycosidase F (PNGase F) gene, complete cds.
ACCESSION J05411
KEYWORDS N-glycanase; endoglycosidase F;
peptide-N-4-(N-acetyl-beta-D-glucosaminyl) asparagine amidase F.
SOURCE F.meningosepticum (ATCC 33958) DNA, clone pGB29, subclone J.
ORGANISM Flavobacterium meningosepticum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1350)
AUTHORS Barsomian,G.D., Johnson,T.L., Borowski,M., Denman,J.,
Ollington,J.F., Hirani,S., McNeilly,D.S. and Rasmussen,J.R.
TITLE Cloning and expression of peptide-N-4-(N-acetyl-beta-D-
glucosaminyl) asparagine amidase F in Escherichia coli
JOURNAL J. Biol. Chem. 265, 6967-6972 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Barsomian, 29-MAR-1990, for release after publication.
FEATURES from to/span description
pept 139 1203 endoglycosidase F precursor (EC 3.5.1.52)
matp 259 1200 endoglycosidase F
BASE COUNT 470 a 219 c 276 g 385 t
ORIGIN
1 ggatctgtac tgagtattgt aaaggtttta taatttgatt aaaaatgaat ctttgttaat
61 aaatcagggt ttaatttgac ttttaaaacg atttcttaaa taaatacaat atattcgtat
121 aacaaatcgc aataaattat gagaaaacta ctaattttta gtatatctgc ttacttgatg
181 gcaggtatcg tttcgtgtaa aggtgtagac agtgcaacac ctgtaacaga agatcgctta
241 gctctgaatg cggtaaatgc tccggcagat aataccgtaa atattaaaac attcgacaaa
301 gtaaaaaatg cctttggtga cggattgtcc caaagtgcag aaggaacctt tacatttccg
361 gccgatgtaa caactgtaaa aacgattaag atgttcatca aaaatgaatg tcctaataaa
421 acctgtgatg aatgggatcg ttatgccaat gtttatgtaa aaaataaaac aacaggagaa
481 tggtatgaaa taggacgctt tattactcca tattgggtgg gtacggaaaa attacctcgt
541 ggactggaaa ttgatgttac cgatttcaaa tctttactgt ccggaaatac agaacttaaa
601 atttatacgg agacttgctt ggccaaagga agagaataca gtgtagactt tgatattgta
661 tatggtacac cggattataa atattcggca gtagtacctg taatccaata taacaaatca
721 tccattgatg gtgttcctta tggtaaagca catacactgg gattaaaaaa gaatattcag
781 ttaccaacaa acacggaaaa agcttatctt agaactacta tttccggatg gggacatgcc
841 aagccatatg atgcgggaag caggggctgt gcagaatggt gcttcagaac acatactata
901 gcaataaata atgcgaatac tttccaacac cagctgggtg ctttaggatg ttcagcaaac
961 cctattaata atcagagtcc gggaaattgg gctcctgaca gagcagggtg gtgtccggga
1021 atggcagtgc caacacgtat agatgtgttg aataactctt taacgggtag tacttttagt
1081 tatgaatata agttccagag ttggacaaac aacggaacca atggagatgc tttttatgca
1141 atttccagtt ttgtgattgc aaaaagtaat acacctatta gtgctccggt agttacaaac
1201 taaaaaacag catatattca tttatagata aaaccgcctt ggcaaggaat caattacttt
1261 tcgggtattt gcaaaaacac cccgcagatg aaaatctgcg gggtgtatga aaaaggtaag
1321 aagttagaga tatctttcct cagccgaaat
//
LOCUS POTPPIIIK 945 bp ds-DNA PLN 30-MAY-1990
DEFINITION Potato proteinase inhibitor IIK gene , 5' flank.
ACCESSION M29965
KEYWORDS proteinase inhibitor IIK.
SOURCE Potato (strain Russet Burbank) DNA.
ORGANISM Solanum tuberosum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 945)
AUTHORS Palm,C.J., Costa,M.A., An,g. and Ryan,C.A.
TITLE Wound-inducible nuclear protein binds DNA fragments that regulate a
proteinase inhibitor II gene from potato
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 603-607 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.An, 18-NOV-1989.
FEATURES from to/span description
mRNA 893 > 945 proteinase inhibitor IIK
BASE COUNT 374 a 129 c 138 g 304 t
ORIGIN
1 tcgacccaat tcaaagaact tgttcattaa ttattatttt tccagcaaaa tgaaaatttt
61 tagttaggat gataaaattt atccaaattt ctaccaccac ccaaaactaa atcttccaat
121 tcagaaaaaa tcattcatta caactcttgg atttgtttgt atgatgtgtt agaagcagaa
181 atgggtttaa aaggtgaatt aaatggtatt tttaactaaa gaaatgaatg cgggatatat
241 ttaagttttt tcagataatt catgaatatt tttatcctct ttccgtgtgc aataatagag
301 ttccaactta attatcacgt ggacttataa gaaaccgatg ctggatgata attatttaaa
361 aaaacaagca agtcggggtc agtacagctt gctgaatagg agaaattaaa gatagcaacc
421 aggaaaagtt aagagctttt ggctcctccg tccaattata accgtccact atatatattt
481 gagactcgta ttgagaaaca ataaataata agaatgatat tactatatta ccttttgaat
541 atattaaatt taatgctttg aaaaatatat ttgataatga ataatatcta atagcagcaa
601 gaataaaata gacacaaaca agtaaaatta ctcattgatc tttcaaatta gacaagtatt
661 atcggacatc tacttttagt atagtaaaca agtaaagatc ggataaagag agtaataaag
721 aagaagcaag cgtaagtacc ttgccaaaat aattaactaa caagcacatc tttttttttt
781 tatcaaatat taataaaaat aatttatatt aatatgaaga aaaaaaaggt tttagtttgc
841 tatctttttg atcactcgtt tgctataaat aggtggagga ggacagacac tcttcacccc
901 aaaattaaaa gaaaaagagg cagtactaat taattatcca tcatg
//
LOCUS CADPOLYP 1631 bp ss-RNA VRL 30-MAY-1990
DEFINITION Canine distemper virus polymerase-associated (P) protein and C
protein mRNA, complete cds.
ACCESSION M32418
KEYWORDS C protein; P protein; polymerase-associated protein.
SOURCE Canine distemper virus, cDNA to viral RNA, clone pCDV 30.
ORGANISM Canine distemper virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Paramyxoviridae; Morbillivirus.
REFERENCE 1 (bases 1 to 1631)
AUTHORS Barrett,T., Shrimpton,S.B. and Russell,S.E.H.
TITLE Nucleotide sequence of the entire protein coding region of Canine
distemper virus polymerase-associated (P) protein mRNA
JOURNAL Virus Res. 3, 367-372 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 41 1564 polymerase-associated (P) protein
pept 63 587 C protein
BASE COUNT 526 a 367 c 396 g 342 t
ORIGIN
1 tacccgatca tcattcaccg accacccgtt ctatccctaa atggcagagg aacaggccta
61 ccatgtcagc aaagggctgg aatgcctcaa agccctcaga gagaatcctc ctgacattga
121 ggagattcaa gaggtcagca gcctcagaga ccaaacctgc aacccaggcc aagagaatgg
181 aaccacaggc atgcaggaag aggaggactc tcagaatctc gatgaatcac acgagccaac
241 aaaaggatca aactatgtcg gccatgtacc tcaaaataat ccgggatgtg gagaacgcaa
301 tactgcgctt gtggaggcgg agcggccccc tagagaggac atccaaccag gacctggaat
361 acgatgtgat catgtttatg atcacagcgg tgaagaggtt aagggaatcg aagatgctga
421 cagtctcgtg gtacctgcag gcactgtcgg taatcgagga ttcgagagag gagaaggaag
481 ccttgatgat agcactgagg attctggcga agattattcc gaaggaaatg cttcatctaa
541 ctggggatat tctttcggcc ttaaaccgga cagagcagct gatgtgagca tgctgatgga
601 agaggaatta agtgctctac tcaggacaag cagaaatgta gggattcaga aaagggatgg
661 gaagactctg cagttcccac ataatcccga agtaagacaa gggatccgga gtgtggatcc
721 attaaaaagg ggcacagaag agaggtcagt ctcacatgga atggggatag ttgctggatc
781 gacaagtggt gcaacccaat ctgcactcaa gtcaactggg ggatcatcag agccaagtgt
841 ttctgcgggg aatgtccgcc aacctgcaat gaatgcaaag atgacccaga aatgcaaact
901 cgagtctggc acgcaactcc ctcccaggac ctcaaatgag gctgagtctg acagtgagta
961 cgatgatgag cttttctctg agatacaaga aattcgatct gccattacta aactaactga
1021 agataatcaa gcaatactta ctaaactgga taccttatta ctgcttaaag gagagactga
1081 ttcaattaag aaacaaatca gcaaacaaaa tattgctatt tccacgattg aggggcatct
1141 atcaagcatt atgatagcta tacctggttt tggaaaggac acgggagatc ctacggcaaa
1201 tgtcgacatt aacccagagc tccgccctat catagggaga gattcaggaa gaccactagc
1261 agaagttctc aagcagcccg catcatcccg cggtaatcgg aaggacagtg gtattactct
1321 gggctcaaaa ggtcaactat tgagagacct ccagctgaaa cccattgaca aagagtctag
1381 ctcggcaatc ggatacaaac cgaaggatac cgcaccttcc aaagctgtac ttgcatcatt
1441 gattagatca agcagagttg atcaaagtca caaacataac atgctggctc tgctcaaaaa
1501 tatcaagggg gatgacaacc taaacgagtt ctaccaaatg gtcaaaagta ttactcatgc
1561 ttaatctgta gcgttgacta atctactaac cggcgcaaaa ctgctttcac tatcgcttaa
1621 aagcaattat a
//
LOCUS ROHSEG4A 2359 bp ds-RNA VRL 30-MAY-1990
DEFINITION Human rotavirus segment 4 gene, complete cds.
ACCESSION M32559
KEYWORDS .
SOURCE Human rotavirus (strain RV-5, serotype 2) RNA.
ORGANISM Human rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 2359)
AUTHORS Kantharidis,P., Dyall-Smith,M.L. and Holmes,I.H.
TITLE Marked sequence variation between segment 4 genes of human RV-5 and
simian SA 11 rotaviruses
JOURNAL Arch. Virol. 93, 111-121 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 10 2337 segment 4 protein
BASE COUNT 871 a 343 c 413 g 732 t
ORIGIN
1 ggctataaaa tggcttcgct catttataga caacttctca ctaattcata ttcggtagac
61 ttgcatgatg aaatagaaca gattggatcg gagaaaactc aaagtgtaac agtaaatcca
121 ggtccatttg cacaaaccag atacgctcca gttaattggg gacatgggga gattaatgat
181 tcaactacag tggaaccagt tttagatggt ccttatcaac ccactacatt caaaccaccc
241 aatgattatt ggttccttat tagttcaaat acagatggag tagtctacga aagtacaaat
301 aataatgatt tttggacagc agttatcgca gttgaaccac atgttagtca aacaaatagg
361 caatatattt tatttggtga aaataaacag tttaacgtag aaaacaattc agataaatgg
421 aaatttttcg aaatgtttaa aggtagtagt cagggtgatt tttctaatag acggactcta
481 acctctaaca atagacttgt agggatgcta aaatatggtg gaagagtatg gacatttcat
541 ggtgaaacac caagagctac tactgatagt tcaaatactg cggatttaaa taatatatca
601 attatgattc attcagagtt ttatattatt ccaagatccc aagaatctaa atgtaacgag
661 tatatcaaga atggtttacc accaattcag aatactagaa acgtagttcc attatcttta
721 tcatccagat ctattcagta taggagagca caagttaatg aagatattac aatttcaaaa
781 acttcattat ggaaggaaat gcaatataat agagatatta taataagatt taaatttggt
841 aatagtgtta taaaactagg aggattggga tataaatggt ctgaaatatc atataaagca
901 gcgaattatc agtatagtta ttcacgtgat ggtgaacaag ttactgcaca taccacttgt
961 tcagtaaatg gagtaaataa ttttagctat aatggaggtt cactacctac tgatttcagt
1021 atttcgagat atgaagttag taaagaaaat tcttatgtgt atatagacta ctgggatgat
1081 tcaaaagcat ttagaaatat ggtgtatgtt agatcgctag cagcaaattt aaattcagtg
1141 aaatgtacag gtgggagtta tgattttaga ttgcccgtag gtggatggcc tattatgaat
1201 ggtggtgctg tatcattaca ttttgctgga gttacattat ctacacagtt cactgatttt
1261 gtatcattaa attcactgcg atttagattc agtttaacag tagatgaacc atctttctca
1321 ataatacgaa cacgtacaat aaacttgtac ggattaccag cagctaatcc gaacaatgga
1381 aatgagtact atgaaatgtc aggaaggttt tcacttattt ctttagttct aaccaatgat
1441 gattatcaaa ctccaattat gaattcagta acagtaaggc aggatttaga acgccagctt
1501 aatgatttgc gagaagaatt taattcattg tcacaagaaa tagctatgtc acaattaatt
1561 gatttagcat tactaccttt agatatgttc tctatgtttt cgggaataaa aagtacaatt
1621 gatctaacca aatcaatggc aactagtgta atgaaaaaat ttagaaaatc gaaattagct
1681 acatcaattt cagaaatgac taattcattg tcagatgcgg cttcatcagc atcaagaagt
1741 gcttctatta gatcaaattt atctacgatt tcaaattgga ctaatacttc aaaaagtgtg
1801 tcaaatgtaa ctgactcagt aaatgatgtt tcaacacaaa catctacaat tagtaagaaa
1861 cttagattaa gagagatgat tactcaaact gaaggaatga gttttgatga tatttcagca
1921 gctgtactga aaacgaaaat agatatgtcc acacaaattg gaaaaaacac tttacctgat
1981 atagttactg aagcgtctga aaaatttatt ccaaaacgat catatcgagt attaaaagat
2041 aatgaagtaa tggaaattaa cactgaagga aagttttttg catataaagt ggatacactc
2101 aatgagatac catttgatat aaataaattc gccgaacttg tcacggattc tccagttata
2161 tcagcgataa tagactttaa gacgttaaaa aatttaaacg acaattatgg aattactcgt
2221 attgaagcgt ttaatttaat taaatcgaat ccaaatgtgc tacgtaattt cattaatcaa
2281 aataatccaa ttataagaaa tagaattgag cagttaattc tacaatgtaa gttgtgagaa
2341 tgtcatccag gatgtgacc
//
LOCUS MUSCAIA 1224 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse carbonic anhydrase I (CAI) mRNA, complete cds.
ACCESSION M32452
KEYWORDS carbonic anhydrase; carbonic anhydrase I; zinc metalloenzyme.
SOURCE Mouse anemic spleen, cDNA to mRNA, clone pMCAI.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1224)
AUTHORS Fraser,P.J. and Curtis,P.J.
TITLE Molecular evolution of the carbonic anhydrase genes: Calculation of
divergence time for mouse carbonic anhydrase I and II
JOURNAL J. Mol. Evol. 23, 294-299 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 46 831 carbonic anhydrase I
BASE COUNT 358 a 299 c 234 g 333 t
ORIGIN
1 gccagcatct tctctgaagc tgaatcacaa cctaaataag agaaaatggc aagtgcagac
61 tggggatatg gaagcgaaaa tggtcctgac caatggagca agctgtatcc cattgccaat
121 ggtaacaacc agtctcctat tgatattaaa accagtgaag ccaatcatga ctcctctctg
181 aaaccactca gcatctccta taatcctgca actgccaaag aaattgttaa cgtgggacat
241 tctttccatg taatttttga tgacagtagc aaccaatctg ttctgaaagg tggccctctt
301 gctgatagct atcggctcac tcagttccat tttcactggg gcaactcaaa cgaccatggc
361 tctgagcaca ccgtggatgg aactagatat tctggagagc ttcacttagt tcactggaat
421 tctgcaaagt actccagtgc ttctgaagcc atctccaagg ctgatggcct ggcaatcctt
481 ggcgttttga tgaaggttgg tccagccaac ccaagcctgc agaaagtact tgatgctcta
541 aactcagtta aaactaaggg aaaacgagcc ccattcacaa attttgaccc atccagtctg
601 cttccttcat ctctggatta ctggacctac tttggctctc tgactcaccc tcctcttcat
661 gaaagtgtga cctgggtgat ctgcaaggat agcatcagtc taagcccaga gcagctggcc
721 cagctccgtg gtcttctgtc aagtgcagag ggagagtctg cagttccagt tctgagcaac
781 caccgtccac cccaacccct gaagggcaga acagtcagag cctcattttg agtcccagca
841 aggaatgagt cctcaattat gacctggccc cctctacagc agcatccaat aaagggatca
901 cgattaagaa acagacttat gtcagtgcta gacagtatac ctgcaaattt aatccacaga
961 actaaaatgc tttcatttta attcctgata cttaatgcaa atactctata agtttgccta
1021 ctgtaatttt tatgcattcg tttttgctta aatacacctt tttgtggaag tcgttcataa
1081 ttacaactaa gtgattcatt ctctttctgc tctcttcact ttaaataaac caataaataa
1141 atcttttaaa aaaaaaaaaa aaaaaagctg agaaataagc gctgatagca gccttccagt
1201 tctgttcaag tatatggctg cctg
//
LOCUS RATGLOBBET 287 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Rat hemoglobin beta-chain mRNA, 3' end.
ACCESSION M32509
KEYWORDS hemoglobin; hemoglobin beta-chain.
SOURCE Rat reticulocyte, cDNA to mRNA, clone pBRrg X.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 287)
AUTHORS Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and
Glisin,V.
TITLE Characterization of two rat globin cDNA clones
JOURNAL Hemoglobin 8, 597-611 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 158 hemoglobin beta-chain (AA at 3)
BASE COUNT 67 a 72 c 68 g 80 t
ORIGIN
1 agctgcatgt ggatcctgag aacttcaggc tcctgggcaa catgattgtg attatgatgg
61 gccaccacct gggcaaggaa ttcaccccga gtgcacaggc tgccttccag aaggtggtgg
121 ctggagtggc cactgccttg gctcacaagt accactaaac ctcttttcct gctctggtct
181 ttgtgcaatg gtcaattgtt ccctagagag catctgtcag ttgttgtcaa aatgacatcg
241 acctttgaaa atctgtccta ctaataaaag gcatttactt tcactgc
//
LOCUS RATHEMALPH 194 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Rat hemoglobin alpha chain mRNA, partial cds.
ACCESSION M32510
KEYWORDS hemoglobin; hemoglobin alpha-chain.
SOURCE Rat reticulocyte, cDNA to mRNA, clone pBRrg 5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 194)
AUTHORS Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and
Glisin,V.
TITLE Characterization of two rat globin cDNA clones
JOURNAL Hemoglobin 8, 597-611 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 194 hemoglobin alpha-chain (AA at 3)
BASE COUNT 35 a 73 c 39 g 47 t
ORIGIN
1 tcgacctgcc tggtgccctg tccactctga gcgacctgca tgcccacaaa cttgatcctg
61 tcaacttcaa gttcctgagc cactgcctgc tggtgacctt ggcttgccac caccctggag
121 atttcacacc cgccatgcac gcctctctgg acaaattcct tgcctccgtg agcaccgtgc
181 ttacctccaa gtac
//
LOCUS HUMIGKVAE 1241 bp ds-DNA PRI 30-MAY-1990
DEFINITION Human germline kappa-chain gene (A2) V-region, exons 1 and 2.
ACCESSION M31952
KEYWORDS germline; immunoglobulin light chain; kappa-immunoglobulin;
variable region; variable region subgroup VK-II.
SOURCE Human DNA, isolated from immune serum of Hib-PS vaccine recipients.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1241)
AUTHORS Scott,M.G., Crimmins,D.L., McCourt,D.W., Zocher,I., Thiebe,R.,
Zachau,H.G. and Nahm,M.H.
TITLE Clonal characterizaton of the human IgG antibody repertoire to
Haemophilus influenzae type B polysaccharide: III. A single
V-kappa-II gene and one of several JK genes are joined by an
invariant arginine to form the most common L chain V region
JOURNAL J. Immunol. 143, 4110-4116 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 325 373 Ig kappa-chain V-region, exon 1
742 / 1052 Ig kappa-chain V-region, exon 2
IVS 374 741 IgK intron A
iDNA 1053 > 1241 intervening DNA (5' end put.)
BASE COUNT 318 a 306 c 283 g 334 t
ORIGIN
1 ccgacaagaa tttggaagcc ctgacatcct ataaaacgtt acttgcccaa gattgaaact
61 ttcaaattca ggtccttctt cctcctctat gatgaattag gttttattag tttcctccaa
121 gggacacttt atatcacatt gctcacagag aagacatatc tacccccttc accccccacc
181 caatggctct ttccacacca ctgcacccac caggtgattt gcatattgtc ccctagggag
241 gaccttccct tgtgaggcct gagataaaag ctcagctcta accttgcctt gactgatcag
301 gacttctcag ttcatcttct caccatgagg ctccctgctc agctcctggg gctgctaatg
361 ctctggatac ctggtaagga tggaaggaga tgagggagga ggagggggtg ggaagctgag
421 ctctggcggc cccactgatt cccgtgttta ttctaaccat gtgttaaagg aatatggcct
481 atgctccagg gagaggaatt catattttgc cctgatgatg atttgaaaac tcctaaaagc
541 agtgctctga ataatatctt gagaaatgaa agaactcttg tgcctattta ataaagggtt
601 catttaaaga gtttgttttt atgatatgaa tacaaatttg taaaaataaa agattagcca
661 taaatcaata ccataaggca aatctcaaaa gttgttcatt atgctttcac ataaccttgc
721 acttctctct cataatttca ggatccagtg cagatattgt gatgacccag actccactct
781 ctctgtccgt cacccctgga cagccggcct ccatctcctg caagtctagt cagagcctcc
841 tgcatagtga tggaaagacc tatttgtatt ggtacctgca gaagccaggc cagcctccac
901 agctcctgat ctatgaagtt tccaaccggt tctctggagt gccagatagg ttcagtggca
961 gcgggtcagg gacagatttc acactgaaaa tcagccgggt ggaggctgag gatgttgggg
1021 tttattactg catgcaaagt atacagcttc ctcccacagt ggtacagacc aatacagaaa
1081 cctccctgct ggggtgtccc agctgctcac ttgcactgct tgtctgggga gtagctcagc
1141 agggactctg agtctgcaga agaggaggct gttggagaac tcagtggcag ggcttgctgt
1201 tgaggactct ggcccatgag gtctcagcag cacctcagtc c
//
LOCUS RATS14 350 bp ds-DNA ROD 30-MAY-1990
DEFINITION Rat S14 protein gene, 5' flank.
ACCESSION M33553
KEYWORDS S14 protein.
SOURCE Rat (strain Sprague-Dawley) male liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 350)
AUTHORS Jump,D.B., Bell,A. and Santiago,V.
TITLE Thyroid hormone and dietary carbohydrate interact to regulate rat
liver S14 gene transcription and chromatin structure
JOURNAL J. Biol. Chem. 265, 3474-3478 (1990)
STANDARD simple staff_review
BASE COUNT 71 a 77 c 96 g 106 t
ORIGIN
1 ctagacacta agtgacatat gttcatatag atagtgtttt taaactggaa agactgtccc
61 taccttggtt tgtagccacc tgggctgagg gccagctgag gttacttggg gcctggcagc
121 ctgggtacca cttgctcatc ttacccttgg atcacattca catgatgttc tggccaggcc
181 cttgacccca gttcctctgg ggccaattca atttgtccag cctagggcat tctgggtgag
241 gttattccaa aagatgtgtc agccaaccct ccgtttctcc acagggttgt agtcatggaa
301 agactagaga aagtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
//
LOCUS MUSPTKA 211 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD15.
ACCESSION M33421
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 211)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 211 protein tyrosine kinase (AA at 3)
BASE COUNT 47 a 55 c 64 g 45 t
ORIGIN
1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga
61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc
121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct
181 cagagagtga tgtgtggagc tttggaattc c
//
LOCUS MUSPTKB 211 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD16.
ACCESSION M33422
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 211)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 211 protein tyrosine kinase (AA at 3)
BASE COUNT 50 a 50 c 59 g 52 t
ORIGIN
1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa
61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga
121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta
181 ctcattctga cgtctggtcc ttcggaattc c
//
LOCUS MUSPTKC 214 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD17.
ACCESSION M33423
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 214 protein tyrosine kinase (AA at 3)
BASE COUNT 69 a 42 c 58 g 45 t
ORIGIN
1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa
61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg
121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt
181 ctgtggcctc agatgtctgg tcctttggaa ttcc
//
LOCUS MUSPTKD 217 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD19.
ACCESSION M33424
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 217 protein tyrosine kinase (AA at 3)
BASE COUNT 64 a 44 c 57 g 52 t
ORIGIN
1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg
61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca
121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt
181 tcaccaccac gtcagatgtg tggtcctttg gaattcc
//
LOCUS MUSPTKE 214 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD22.
ACCESSION M33425
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 214 protein tyrosine kinase (AA at 3)
BASE COUNT 59 a 46 c 55 g 54 t
ORIGIN
1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga
61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg
121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt
181 atatcgcctc tgacgtctgg tcctttggaa ttcc
//
LOCUS MUSPTKF 208 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD175.
ACCESSION M33426
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 208)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 208 protein tyrosine kinase (AA at 3)
BASE COUNT 49 a 53 c 55 g 51 t
ORIGIN
1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga
61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg
121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca
181 aatctgacgt gtggtccttt ggaattcc
//
LOCUS MUSPTKG 208 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds.
ACCESSION M33427
KEYWORDS protein tyrosine kinase.
SOURCE Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 208)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 208 protein tyrosine kinase (AA at 3)
BASE COUNT 53 a 44 c 56 g 55 t
ORIGIN
1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag
61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg
121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca
181 agtcagacgt gtggtcctat ggaattcc
//
LOCUS DIRPARAM 2545 bp ss-mRNA INV 30-MAY-1990
DEFINITION D.immitis paramyosin mRNA, partial cds.
ACCESSION M29733
KEYWORDS paramyosin.
SOURCE D.immitis, cDNA to mRNA.
ORGANISM Dirofilaria immitis
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 2545)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 2545 paramyosin (AA at 1)
BASE COUNT 949 a 420 c 544 g 632 t
ORIGIN
1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt
61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa
121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt
181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt
241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag
301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca
361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt
421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa
481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa
541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa
601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca
661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa
721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca
781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt
841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa
901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa
961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca
1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa
1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt
1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa
1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg
1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt
1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag
1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt
1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat
1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta
1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg
1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa
1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg
1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat
1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact
1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta
1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg
1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa
2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa
2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa
2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag
2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt
2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa
2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa
2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa
2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct
2521 tcagcatcta aaatttacgt tctcg
//
LOCUS ONGPARAM1 134 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-1.
ACCESSION M29734
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 134)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 134 paramyosin (AA at 1)
BASE COUNT 53 a 23 c 28 g 30 t
ORIGIN
1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt
61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca
121 gaacgttttg aagc
//
LOCUS ONGPARAM2 135 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-2.
ACCESSION M29735
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 135 paramyosin (AA at 2)
BASE COUNT 43 a 31 c 26 g 35 t
ORIGIN
1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga
61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt
121 ggattccgta cgtac
//
LOCUS ONGPARAM3 176 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-3.
ACCESSION M29736
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 176 paramyosin (AA at 1)
BASE COUNT 74 a 20 c 36 g 46 t
ORIGIN
1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa
61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct
121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca
//
LOCUS ONGPARAM4 414 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-4.
ACCESSION M29737
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 414)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 414 paramyosin (AA at 3)
BASE COUNT 143 a 70 c 101 g 100 t
ORIGIN
1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg
61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa
121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc
181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc
241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc
301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga
361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc
//
LOCUS ONGPARAM5 145 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-5.
ACCESSION M29738
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 145 paramyosin (AA at 3)
BASE COUNT 48 a 28 c 28 g 41 t
ORIGIN
1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg
61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac
121 ttgataatgc aattcgtgct cgtaa
//
LOCUS ONGPARAM6 129 bp ss-mRNA INV 30-MAY-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-6.
ACCESSION M29739
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 129)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 129 paramyosin (AA at 3)
BASE COUNT 48 a 20 c 31 g 30 t
ORIGIN
1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta
61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata
121 aggaaacgc
//
LOCUS MUSA1AT 1350 bp ss-mRNA ROD 30-MAY-1990
DEFINITION Mouse alpha-1-antitrypsin (AT) mRNA, complete cds.
ACCESSION M33567
KEYWORDS alpha-1-antitrypsin.
SOURCE Mouse liver, cDNA to mRNA.
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1350)
AUTHORS Latimer,J.J., Berger,F.G. and Baumann,H.
TITLE Highly conserved upstream regions of the alpha-1-antitrypsin gene
in two mouse species govern liver-specific expression by different
mechanisms
JOURNAL Mol. Cell. Biol. 10, 760-769 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 49 1287 alpha-1-antitrypsin (AT)
mRNA 1 > 1350 alpha-1-antitrypsin mRNA
BASE COUNT 355 a 391 c 306 g 298 t
ORIGIN
1 gtctcctcag cctccgtcac cacccagctc tgggacagca agctgaaaat gactccctcc
61 atttcatggg gtctactgct tctggcaggc ctgttttgcc tggtccccag ctttctagct
121 gaggatgttc aggagacaga cacctccaga agggattcag tcccagcctc ccatgatact
181 ccctacaacc tcgagctttc catcagccta taccgggagc ttggacacaa gtccaccact
241 tccaacatct tcttctccca ggtgagcatt gccacagcct ttgctatgct ctccctaggg
301 gagaagggtg acactcacac gcagatccta gagggcctgc agttcaacct cacacaaaca
361 tcggaggctg acatccacaa ggccttccaa cacctcctcc aaaccctcaa caggccagac
421 agtgagctgc agctgagcac aggcaatggc tctttgctca acaatgacct gaagctggtg
481 gagaagtttc tggaagaggc caagaacaat tatcactcag aagtcttctc cgtcaacttt
541 gcagagtcag aggaggccaa gaaagtgatc aatgattttg tggagaaggg aacccaagga
601 aagatagctg aggcggtgaa agaccctgac gaagacacag ttttcgccct ggcgaattac
661 attctcttta aaggcaaatg gaagaagcca ttcgatccta agcacactga ggaagctgaa
721 ttccacgtgg acacagtcac cacggtgaag gtgcccatga tgaccctcac gggcatgctt
781 gacgtgcacc attgcagcac gctgtccagc tgggtgctgc tgatggatta cctgggcaat
841 cgcactgctg tcttcctcct gcccgatgat ggcaagatgc agcatctgga gcaaactctc
901 aacaaggagc tcatctctaa gttcctgcta aacaggcaca gaaggttagc ccaggtccac
961 ttacccagat tgtccctctc tggaaactat actttgaaca cactcatgag tcacctgggc
1021 atcacccgga tcttcaacaa tggggctgac ctctccggaa tcacagagga gaatgctccc
1081 ctgaagctca gcaaggctgc agataaggcc gtgctgacca tggatgagac aggaacagaa
1141 gctgcagcag ctacagtctt acaagccgtt cctatgtcta tgccccctat cctgaacttc
1201 aacaaacctt tcattttcat aatagttgaa gaacacactc agagccccct ctttgtggga
1261 aaagtggtag atcccacacg taaatgacca ccctaagaaa tcatccttcc tttctgaatt
1321 gggtcccttc cttaaactct gacggctgct
//
LOCUS XEBALPA 586 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.borealis alpha-I globin mRNA, complete cds.
ACCESSION M32453
KEYWORDS alpha globin.
SOURCE X.borealis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 586)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 42 470 alpha-I globin protein
BASE COUNT 175 a 140 c 111 g 160 t
ORIGIN
1 atctggtgta gactgcttat tgcacaacac aaacaacaac catgcttctt tcagccgatg
61 acaagaaaca catcaaggca attatgcctt ctattgctgc tcatggcgac aaatttggtg
121 gagaagcttt gtacaggatg ttcttggtca accccaagac caaaacctac ttccctactt
181 ttgacttcca ccacaattca aaacagatca gtgctcatgg caagaaagtt gtggatgctc
241 tgaatgaagc ctccaaccat ttggataaca ttgctggaag cctgagcaag ctgagtgacc
301 tccatgccta tgacctgaga gtggatccgg gcaacttccc attgctggcc cataatattt
361 tggtggttgt tgctatgaac ttccctaagc agtttgatcc tgcaacccat aaggccctgg
421 ataagttcct ggctaccgtt tcttctgttc tgacatccaa atatcgttaa ggctcaacag
481 caacagtagc agaagtctca acatcagaca tcagttaatt atatgcaatt aaactgacaa
541 agcatgttaa agaatgttct gaaataaaca tttgaaccat tatcct
//
LOCUS XEBALPB 525 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.borealis alpha-II globin mRNA, complete cds.
ACCESSION M32455
KEYWORDS alpha globin.
SOURCE X.borealis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 525)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 43 471 alpha-II globin protein
BASE COUNT 159 a 123 c 98 g 145 t
ORIGIN
1 atctggtgtg tactgctaat tgcaacaaaa caaacagcaa caatgcttct tacagctgat
61 gacaagaaac acatcaaggc aattttgcct tccatagctg ctcatggcga caaatttggt
121 ggagaagctt tgtacagaat gttcttgatt aatcctaaaa ccaaaaccta ctttcctaat
181 tttgacttcc accacaattc aaaacaaatc agtgctcatg gcaagaaagt tgtggatgct
241 ctgaatgaag ctgccaacca cttggataac attgctggaa gcatgagcaa gctgagtgac
301 ctccatgcct atgacttgag agtggatccg ggcaacttcc cattgctggc ccataatatt
361 ctggtgactg ttgctatgta cttccctcag cagtttgatc ctcataccca taaggctctg
421 gataagttcc tggcttccgt atcttctgtt ctgacatcca aatatcgtta aggctcagca
481 gcaaaagcaa cagcacaagt ttcaatatta atcatcagta aatca
//
LOCUS XEBBETA 605 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.borealis beta-I globin mRNA, complete cds.
ACCESSION M32456
KEYWORDS beta globin.
SOURCE X.borealis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 605)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 47 487 beta-I globin protein
BASE COUNT 177 a 146 c 126 g 156 t
ORIGIN
1 acgggttatt tttgcagaag ctcagaataa actctaaact ttgaccatgg gtttgacagc
61 acatgatcgt cagctgatca acagcacctg gggcaaagta tgtgccaaga ctataggaaa
121 agaggccctt ggacgtctgc tgtggactta tccctggacc caaagatact ttagttcttt
181 tgggaacctc aacagtgctg acgccgtctt ccacaatgaa gcagtggctg ctcatggtga
241 aaaggtggtg acatctattg gagaggccat caagcacatg gatgacatta agggatatta
301 tgctcagctg agcaaatacc actcagaaac cctacatgtg gatccatgca acttcaagcg
361 cttcggtggc tgcttgtcca tttccctggc tcgccaattc catgaagaat atacacctga
421 gctacatgct gcctatgaac atctgtttga tgcaattgct gatgcccttg ggaagggtta
481 ccactaaacc atcctcaaga acacccaaat ggagtctcta agctacataa taccaaccta
541 caaaatgttt tcctctaaaa cgtagccatt catatctgct cctaataaaa agaaagtttc
601 ttcac
//
LOCUS XEBBETB 619 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.borealis beta-II globin mRNA, complete cds.
ACCESSION M32458
KEYWORDS beta globin.
SOURCE X.borealis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 619)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 48 488 beta-II globin protein
BASE COUNT 185 a 152 c 130 g 152 t
ORIGIN
1 cacgagttct ttttgcagaa actcagaata ctctctcaat tttaaccatg ggtttgacag
61 cacatgaaaa acagctcatc accggcagct ggggcaaaat aaatgccaaa gccataggaa
121 aagaggccct cggcaggctg ctgaatactt ttccttggac ccaaagatac tttagttctt
181 ttgggaacct cggcagtgct gaagccatct tccacaatga agctgtagct gctcatgggg
241 aaaaggtagt gacatctgtt ggagaggcca tcaaacacat ggatgacata aagggctatt
301 atgctgagct gagcaagtac cactctgaaa ccctgcatgt ggatccaaac aacttcaagc
361 gcttcggtgg ctgcttgtcc attaccctgg gtcaccactt tggtgaagaa tatacacccg
421 agttacatgc tgcctatgaa catctgtttg atgcaattgc cgacgccctt ggcaagggtt
481 accactgaac cagcctcaag aacacctgaa cggagtcgca aagctacata gaaacaacct
541 acagtacaca ttgctaaaat gttttttccc cccaaaatgt agccattcat atgagctcct
601 aataaaataa agtttcttc
//
LOCUS XETALPA 581 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.tropicalis alpha globin mRNA, complete cds.
ACCESSION M32454
KEYWORDS alpha globin.
SOURCE X.tropicalis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus tropicalis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 581)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 42 470 alpha globin protein
BASE COUNT 170 a 138 c 119 g 154 t
ORIGIN
1 agcatctggt gtggactgct tattgcacaa cacaaacaac aatgcatctt acagctgatg
61 acaagaaaca catcaaggcc atttggcctt ctgtagctgc tcatggtgac aaatatggcg
121 gagaagcttt gcacaggatg ttcatgtgtg ctcccaagac caaaacctac tttcctgatt
181 ttgacttcag cgaacattca aaacacatct tggctcatgg caagaaagtt tcggatgctc
241 tgaatgaggc ttgcaaccat ctggacaaca ttgccggatg cctgtccaag ctgagtgacc
301 tccatgccta tgacctgaga gtggatccag gcaacttccc attgctggcc catcaaattc
361 tggtggttgt tgctatccat ttccctaagc agtttgaccc tgcaacccat aaggccctgg
421 acaagttcct ggtttccgta tctaatgttc tgacatccaa atatcgttaa ggctcagcag
481 taacagtagc agaagtttgg acatcagaca tcagttaatg acaaacaatc aaactgacac
541 agcttgtgaa agaatgttct gaaataaaca tttttaaaat t
//
LOCUS XETBETA 563 bp ss-mRNA VRT 30-MAY-1990
DEFINITION X.tropicalis beta globin mRNA, complete cds.
ACCESSION M32457
KEYWORDS beta globin.
SOURCE X.tropicalis adult immature red blood cell, cDNA to mRNA.
ORGANISM Xenopus tropicalis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 563)
AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
TITLE Globin evolution in the genus Xenopus: Comparative analysis of
cDNAs coding for adult globin polypeptides of Xenopus borealis and
Xenopus tropicalis
JOURNAL J. Mol. Evol. 23, 211-223 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 52 495 beta globin protein
BASE COUNT 170 a 133 c 115 g 145 t
ORIGIN
1 aacacaggct ctttttttgc agaagcttaa aataaacgct cagctttgac catggttaat
61 ctgacagcaa aggaacgcca gctcatcact ggcacctgga gcaaaatatg tgcaaagacc
121 ttgggaaaac aggcccttgg cagtatgctg tatacttatc cttggaccca aagatacttt
181 agtagttttg ggaacctcag cagtatcgaa gccatcttcc acaatgcagc tgtggctact
241 catggtgaaa aggtgttgac atctattgga gaggccatca aacacatgga tgacataaag
301 ggatattatg ctcagctgag caagtaccac tcagaaaccc tacatgtgga tccatacaat
361 ttcaagcgct tctgtagctg cacgattatt tccatggctc aaaccctcca agaagacttt
421 acacctgagc tacaggctgc tttcgaaaag ctgtttgctg caattgctga tgcccttggc
481 aagggttacc actaaaccaa cctcaagaac atgtgactgg agtctcttag ctacacagaa
541 acaaaatctc gttttttttc aaa
//
LOCUS ROB11AA 667 bp ds-RNA VRL 30-MAY-1990
DEFINITION Bovine rotavirus gene 11, complete cds.
ACCESSION M33606
KEYWORDS .
SOURCE Bovine rotavirus (strain VMRI) viral RNA.
ORGANISM Bovine rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 667)
AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
TITLE Sequence analysis of gene 11 equivalents from "short" and "super
short" strains of rotavirus
JOURNAL J. Virol. 64, 120-124 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 22 618 gene 11 protein
BASE COUNT 219 a 123 c 139 g 186 t
ORIGIN
1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacgagtct tccttctttt
61 tcttctagca tttataagaa tgaatcatct gcaacagcgt caactctttc tggaaaatct
121 attggtagga gtgtacaata tgtttcacca gatgcagaag cattcagtaa atatatgctg
181 tcaaagtctc cagaagatat tggaccatct gattctgctt caaacgatcc actcaccagc
241 ttttcgatta gatcgaatgc agttaagaca aatgcagacg ctggggtgtc tatggattca
301 tcagtacaat cacgaccatc aataaatgtt gggtgcgatc aagtggattt ctcctttaat
361 aaaggaatca aagtaaatgc taacctggat tcatcaatat cagtgtcaac aaattcaaga
421 aaggagaaat ccaaaggtga tcgtaaaagt aggaaacact accctaaaat tgaagcagaa
481 tctgattcgg atgagtacgt acttgacgat tcagacagtg atgatggcaa atgtaggaat
541 tgtaaatata aaaggaagta tttcgcacta agaatgagaa tgaaacaagt agctatgcaa
601 ttaatcgaag atttgtaggt ctgacctgag aggtcactag ggagctcccc actcccgttt
661 tgtgacc
//
LOCUS ROH11AA 658 bp ds-RNA VRL 30-MAY-1990
DEFINITION Human rotavirus gene 11, complete cds.
ACCESSION M33607
KEYWORDS .
SOURCE Human rotavirus (strain 69M) viral RNA.
ORGANISM Human rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 658)
AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
TITLE Sequence analysis of gene 11 equivalents from "short" and "super
short" strains of rotavirus
JOURNAL J. Virol. 64, 120-124 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 22 609 gene 11 protein
BASE COUNT 221 a 118 c 127 g 192 t
ORIGIN
1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacaagtct tccctcaatt
61 tcttctagta tttataaaca tgaatcgtct tcaacaacgt caactatttc tggaaaatct
121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcaataa atacatgttg
181 tcgaagtctc cagaggatat tggaccatct gattctgcat caaacgatcc actcaccagc
241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca
301 tcaacacaat cacgaccatc aagtgatatt ggatgcgatc aagtggattt ctccttaaac
361 agaggtatta agattgatgc tatggtagat tcttcaatat caatatctac taaatctaag
421 aaagagaaat ccaaacatga aactaagaac agaaaatgtt atccaaagat tgaagcagag
481 tcagattctg atgactacgt tttagatagt gatgatggta aatgtaagaa ctgtaaatat
541 aaaaagaagt attttgcact tcggttaaga atgaaacaag ttgcaatgca attaattgaa
601 aatttgtgag tctaacctgg gaggtcacta gggagctccc cactcccgtt ttgtgacc
//
LOCUS ROH11AB 667 bp ds-RNA VRL 30-MAY-1990
DEFINITION Human rotavirus gene 11, complete cds.
ACCESSION M33608
KEYWORDS .
SOURCE Human rotavirus (strain DS-1) viral RNA.
ORGANISM Human rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 667)
AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
TITLE Sequence analysis of gene 11 equivalents from "short" and "super
short" strains of rotavirus
JOURNAL J. Virol. 64, 120-124 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 22 618 gene 11 protein
BASE COUNT 231 a 121 c 121 g 194 t
ORIGIN
1 ggcttttgaa gcgctacagt tatgtctctc agtattgatg tgactggtct tccttcaatt
61 tcttcaagtg tttataaaaa tgaatcgtct tcaacaacgt caactatttc tggaaaatct
121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcagaaa atacatgttg
181 tcaaaatctc cagaagatat tggaccttct gattctgcat cgaacgatcc actcaccagc
241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca
301 tcagcgcaat cacgaccatc tagcgacatt ggatacgatc aaatggattt ctccttgaat
361 aaaggtatta aatttgatgc tacagtggac tcttcaatat caatatctac cacatcaaag
421 aaggagaaat ctaaaaacaa aaataagtat aaaaaatgtt acccaaaaat cgaagcagag
481 tctgattccg atgattacat attagatgat tcagatagcg atgatggaaa atgtaaaaat
541 tgcaaatata aaaagaaata ttttgcactt cgtttaagaa tgaagcaagt tgcaatgcaa
601 ttaattaaag atttatgagt ctaacctgag agatcactag ggagctcccc actcccgttt
661 tgtgacc
//GenBank-Updates@genbank.bio.net (05/31/90)
LOCUS SHPMHDQAA 2387 bp ds-DNA MAM 31-MAY-1990
DEFINITION Sheep MHC class II OLA-DQA alpha-chain gene, 3' end.
ACCESSION M33304
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep DNA, clone OLAIIC8-1, library CH.28.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 2387)
AUTHORS Scott,P.C. and Brandon,M.R.
TITLE Nucleotide sequence, polymorphism and expression of ovine MHC class
II alpha genes DQA
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.C.Scott 28-MAR-1990. University of Melbourne, Veterinary
Preclinical Centre, Parkville, Victoria, Australia 3052
FEATURES from to/span description
pept / 359 607 MHC OLA-DQA alpha-chain, exon 2 (AA at 361)
1085 1366 MHC OLA-DQA alpha-chain, exon 3
1604 1758 MHC OLA-DQA alpha-chain, exon 4
pre-msg 358 > 2228 MHC OLA-DQA alpha-chain mRNA
IVS < 1 358 MHC OLA-DQA intron A
IVS 608 1084 MHC OLA-DQA intron B
IVS 1367 1603 MHC OLA-DQA intron C
IVS 608 1084 MHC OLA-DQA intron A
IVS 1367 1603 MHC OLA-DQA intron B
signal 2223 2228 poly-A signal
BASE COUNT 608 a 610 c 489 g 680 t
ORIGIN Chromosome 20.
1 ggatccagct gaggctaaaa ctgtaaaagt gaattagttc caattgacac agtagagtga
61 tactcttcag catcagttgg tagcccagga caagtaccag atgaaaagga tattacatgt
121 atttctgcct ggaaaaggag tcaagatgca ttgcaaagat aaggaggctt tgttgcaagg
181 ctttttaact aactggacaa ctaccaacac agagcgaaga aggagcaggt actgggaatt
241 ttgtctagaa actggccaca aatgaagccc acaatgtttg atagtcaatt tcttctttca
301 ctgcttaatg aagatctttt ctctattttt ccctttcttg ctcctcaccc tgactcagct
361 gaccacattg gcacctatgg cgtaaacatc taccaaacat atggtccctc tggctactat
421 acccatgaat ttgatggaga tgaagagttc tacgtggacc tggaaaagag ggagactgtc
481 tggcgtctgc ctgagtttag taaatttaca agttttgacc ctcagggtgc actgagaaac
541 atagctacgg tgaaacataa tttggagatc ttgattcaaa ggtccaactc tactgctgct
601 accaacagta tgtgttcacc attctgcctc tctttgttgt tcttcccctt cataccagct
661 tcactccctt tttccctagg gatagatacc cttcaccact ctataaaact ttctcctttc
721 caaggagtca ccacattttc tcatggtaat atctgaaccc tcatcttctg ccttctttcc
781 actcatatat ttccatatga tagaaagatc cttacttcca taatgcaagc tttaatattt
841 cataggagag ccccacagac atcccacttg acaggtcgtc aaaaacaggg ggacaaggat
901 aaagcagaag caacatatag tgcctcccaa cagaagggaa gcaagagctt ctcctctgtc
961 agagtgcaac tgttggtgga agggctcccc caggaggcag tgcagtaaag gctcccccag
1021 gaggcagtgc agaatcaggg cagagctact tccgtttcac atctgtgctg tttcctcacc
1081 acagaggttc ctgaggtgac tgtgttttcc aagtctcccg tgatgctggg tcaacccaac
1141 accctcatct gtcacgtgga caacatcttt ccccctgtga tcaacatcac atggctgagg
1201 aacgggcact cggtcacaga gggagtttct gagaccagtt ttctcatcaa gagtgattat
1261 tcttttctca agatcaatta tctcaccttc ctcccttctg atgatgatgt ttatgactgc
1321 aaagtggagc actggggtct ggatgagcca cttctgaaac actggggtat gtacgagttc
1381 cagccttttg gtaccttctt ttctctgtca ggtccacaat gtcatgtttg ttatccccta
1441 atcctcatgg ccttgaactt tgagggattc taatgataga cctcactctc ttctctcagc
1501 tccatgcccc aagtctgtta gaaccaaccc ctcttcaccc atcccacata cacacacaga
1561 aacactctgt attgtgactt ccacaaaact tcagctttca cagagcctga gattccagtc
1621 cctatgtcag agctgacaga gactgtggtc tgtgccctgg ggttgaccgt gggcctcgtg
1681 ggcatcgtgg tgggcaccgt cctcatcatc cgaggtctgc gctcaggtgg cccctccaga
1741 caccaggggc cgttgtgagt catactccag aaggaaggta agaattcaga tctgttagat
1801 ctgggaacat agtacagacg agggaaagtg ggaggaggct gtggacacaa atgtgctgaa
1861 agtttctaaa ttgggaaaca gcatggcagt ggcacaggag ccctgagccc actcatctca
1921 tgccttttct gttgcaggtg cgctgtccat ctttaagaac agaagaacgg acttgctaga
1981 tgacctagag ctattttctg cccaagtcca tcatgtacct tctctcctcc tcctaccctc
2041 ctcctcatac ttcttctctg gacttaaggt gctgtatcat ctcgagctca catacatttg
2101 ggattctccc ccgaccttct aatttttttt tttctcaatt gttacctatt atgggatcac
2161 tgagatatcc cacccagcta cctaatccac caatgaccct aaaataatat atccatggaa
2221 gaaataaatg cctttatgag gtctttactg actttttccc atctttcatc tcagggctga
2281 ttagttccat gtcctcttct gtttcaggcc cttttaacct catttcccag attatgtttc
2341 acggcccagt aacaccggag cagcttgtag agactagtaa tattttt
//
LOCUS SHPOLADQAB 3355 bp ds-DNA MAM 31-MAY-1990
DEFINITION Sheep MHC class II OLA-DQA alpha-chain gene, 3' end.
ACCESSION M33305
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep (strain Merino) DNA, clone OLAIIC17-2, library CH.28.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 3355)
AUTHORS Scott,P.C. and Brandon,M.R.
TITLE Nucleotide sequence, polymorphism and expression of ovine MHC class
II alpha genes DQA
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.C.Scott 28-MAR-1990. University of Melbourne, Veterinary
Preclinical Centre, Parkville, Victoria, Australia 3052
FEATURES from to/span description
pept / 1057 1305 MHC OLA-DQA alpha-chain, exon 2 (AA at 1059)
1751 2032 MHC OLA-DQA alpha-chain, exon 3
2281 2435 MHC OLA-DQA alpha-chain, exon 4
IVS < 1 1056 MHC OLA-DQA intron A
IVS 1306 1750 MHC OLA-DQA intron B
IVS 2033 2280 MHC OLA-DQA intron C
signal 2913 2918 poly-A signal
BASE COUNT 888 a 842 c 665 g 960 t
ORIGIN
1 ctgcagtgca gagacctggg tttgatccct gggtcgggaa gatcccctgg agaaggaaat
61 ggctacccac catgagtatt cttgcctgga taatccatgg acagaggaac ccagcaggct
121 aaagtccatg ggatcgcaaa gcgttggaca cgactgagtg actttcacat taaagaaaga
181 aataagggga aggaagtgtc agaaatgcaa aataaagctt catttttttt tttattttca
241 gaaggaatca cggcattcac tttaagaact cactacggtt aacaaaatgg acaccttaga
301 agcattgttt attctttata tgctttgtta tgttgtcact gtgcttgcta tactgtttct
361 tataccttat aaaagaggaa tctctgttaa aatagtagga tcatgtgtgc ttttccttgg
421 ttaaaaaaaa aaaataaaga aggaatctct atgcagtaaa tgcttctgct atgcctaccc
481 ccgagggaaa agaactcgaa tgaatgtgtt attatctgat gaaaaacttt tgaagaagag
541 gaaacaatgc taataccagg aaaattcatt cattcattcc cttttttctt ctctctttcc
601 attataatct ttattacaga ctacaatgtg ccaggcactg agctgggcag gcaggaaata
661 gcagtaaacg aagcctgtga aatcctgccc ttgtgagcat acaagctcag tcgtgatgga
721 actctcaaac ccatggactg tagcctttcc cttctccagg ggacttccca acccagggat
781 cgaacccagg tctcccactg caggcggatt ctttaccagc caagccgcaa gggactctca
841 agaatcacat atgttacagt gcaaaagcag cctggattct aacaggacag ctaccaacac
901 ttagggaaaa ggaagcaggt gctggcactt tgcttagaga cattgtgcca aaggtgaagc
961 ccaccgtgtt tgaaagttag tttcttcagc tactttgttt aatatggcct tttctctctg
1021 ttttccacct tcctgctcct caccctcact tatcagctga ccactttggc tcctatggca
1081 cagagatcta ccaatctcat ggtccctctg gccagtacac ccaggaattt gatggagacg
1141 agctgtttta tgtggacctg gggaagaagg agactgtctg gaggctgcct atgtttagcc
1201 agtttgcagg ttttgatcca cagggtgcac tgagtgaaat agctacagca aaacacaact
1261 tggatatcct gactaaacgc tccaacttta cccctgctat caatggtaag tgtccaccat
1321 tctacttctc tttactgaat ctattctttc atatcaggct tcactccctt cttttctaag
1381 gagagatatc cttcaccatg ctatgaaact ttcccaagtg tccccagatt tcatagtaat
1441 tattgaacga ctcatcctct cccacctcaa agatcacata tttccatgta atataaggac
1501 ccttactccc ataacatatt ccttgaatcc ctcaaggagg agtcccacag acctcctcct
1561 taacaagcat gcccacagac agcacgggga taaagcatgg gcaacatata gcatctccca
1621 gcagaaggcg aacaagagct cctcctctgt cagactggga aacgttgtga gagggctccc
1681 ccaggaggca gtgcagaatc agggccaagc tttttcccca tttcacatct gtgctgtttt
1741 ctcaccatag aagttcctga ggtgactgtg ttttccaagt ctcccgtgat gctgggtcaa
1801 cccaacaccc tcatctgtca cgtggacaac atttttcccc ctgtgatcaa catcacatgg
1861 ctgaagaacg ggcatgcagt cacagagggt gtttctgaga ccagcttcct tcccaaggat
1921 gatcattcct tcctcaagat tggttatctc accttcctcc cttctgatga tgacgtttat
1981 gactgcaaag tggagcactg gggcctggat gagccacttc tgaaacactg gggtatgtag
2041 agttcaccct tttggtactc tctctttttc taccaagtac aaaacactga gttttggtcc
2101 tcatctcaca gtcaagcttg ttttccacac ttcaaggttt ctaaaattat agaacttcat
2161 tctcttccct tagcctggtg ccctgagttt ttgtagcatg aacacacact ctgcccaatc
2221 tcatacacag gcacatgaac caaccctctg tattctgact ttcataactt cattttccag
2281 agcctgagat tccagcccct atgtcagagc tgacagagac tgtggtctgt gccctggggt
2341 tgaccgtggg ccttgtgggc atcgtggtgg gcaccatctt catcatccaa ggcctgcgct
2401 caggtggggc ctccagacac cagggtcccc tgtgagtcgc accctagaaa ggaaggtaag
2461 gattcatatt tgtcagtgcc acagacacac ttcaggagaa agcaaagcgg gaagtaaatt
2521 gtagacacaa atgtggttga aagttgtggg aaacgtggga accagcatga ctacagcaca
2581 gaagctccct aggactcatc agtttcatgc ctttcctgtt gcaggtctct gcccatctag
2641 gattgcagaa gagaggacgt tctagacgac ctagaactag tttctggcca agttcatcat
2701 atactctctc ttcccctaca cactctgccc ctcctctctt ctctgggact taagatgctg
2761 ttacatctca gagctcacat ataccccaga gttctcccct gactttctga tatttttttc
2821 tgttctcagt agttgcctac catgagatca ctggggtatt ccacccagtt acctactccc
2881 tagtgacccc aaccccatat caccatggaa gcaataaact ccccttcaat agaaattttt
2941 cctgtctttc atcacggagc tgaccaagat catgttctct tatgctttag gccccttaat
3001 ttcatttctc agatcatgtt tcatgctgaa gaacataaga gcaactttta caaattcata
3061 atatttcagt atgttatgtt cactatggtt aatatttctc tcttcctact atttccaatc
3121 ttggctcttg gcacttccat ccacccctta attcaggcat cagtgaaagt aatatatacc
3181 ttttttcctt ggtttatatt tgttatagca gaaatacaaa aaccaagaga ggagtctgta
3241 ctcatctctt tttttttttt ggctgcactg tccagcctgt gagatcttag ttctccaacc
3301 tggcatcgaa ctcagaacca cagcaaagaa agctggtgtt cactactgaa tcagg
//
LOCUS BSUPOLCA 240 bp ds-DNA BCT 31-MAY-1990
DEFINITION B.subtilis DNA polymerase III (polC) gene, partial cds.
ACCESSION M33543
KEYWORDS DNA polymerase III.
SOURCE B.subtilis DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 240)
AUTHORS Barnes,M.H., Hammond,R.A., Foster,K.A., Mitchener,J.A. and
Brown,N.C.
TITLE The cloned polC gene of Bacillus subtilis: Characterization of the
azp12 mutation and controlled in vitro synthesis of active DNA
polymerase III
JOURNAL Gene 85, 177-186 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 240 DNA polymerase III (polC) (AA at 1)
mut 76 76 t in wt; g in azp12 mutant
BASE COUNT 59 a 52 c 56 g 73 t
ORIGIN
1 gaattcggaa cccgatttgt ccggcagatg cttgaagata caaagccgac cactttttct
61 gagctcgttc agatttcagg cttgtctcac ggaactgatg tatggcttgg caatgcacag
121 gagctcatcc acaataatat ttgtgagctg agtgaggtta tcggctgccg tgatgacatt
181 atggtttatt taatctatca aggccttgag ccgtcccttg cctttaaaat catggaattc
//
LOCUS FIBGLUC 1426 bp ds-DNA BCT 31-MAY-1990
DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
complete cds.
ACCESSION M33676
KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE F.succinogenes (strain S85) DNA, clone PJI5.
ORGANISM Fibrobacter succinogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1426)
AUTHORS Teather,R.M. and Erfle,J.D.
TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage
beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Teather, 11-APR-1990.
FEATURES from to/span description
pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase
precursor (EC 3.2.1.73)
sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase
signal peptide
matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase
binding 132 137 ribosome binding site
signal 62 66 -35 region
signal 85 90 -10 region
BASE COUNT 371 a 346 c 335 g 374 t
ORIGIN
1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS CEATAT 264 bp ss-RNA VRL 31-MAY-1990
DEFINITION Caprine arthritis encephalitis virus tat gene, complete cds.
ACCESSION M33674
KEYWORDS tat protein.
SOURCE Caprine arthritis encephalitis virus (strain Cork) proviral DNA,
clone pCo1.9, isolated from goat synovial membrane.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990) see COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 11-APR-1990. Washington State University, Dept.
of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 97 a 40 c 80 g 47 t
ORIGIN Map position 5856.
1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
241 tgtaacccgg ggtggggaac ataa
//
LOCUS CEATATX 264 bp ss-RNA VRL 31-MAY-1990
DEFINITION Caprine arthritis encephalitis virus tat protein mRNA, complete
cds.
ACCESSION M33675
KEYWORDS tat protein.
SOURCE Caprine arthritis encephalitis virus (strain G63) cDNA to viral
RNA, clone pC63-49, isolated from goat synovial membrane.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990) see COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 11-APR-1990. Washington State University, Dept.
of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 99 a 43 c 75 g 47 t
ORIGIN Map position 5856.
1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
241 tgtaacccgg ggtgggaaac ataa
//
LOCUS MUSLRPA 3070 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Mouse LCA-related phosphatase (LRP) mRNA, complete cds.
ACCESSION M33671
KEYWORDS leukocyte common antigen-related phosphatase; tyrosine phosphatase.
SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to
mRNA, clones mLRP-[B20,C26].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3070)
AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L.
TITLE Identification of a novel member of the protein tyrosine
phosphatase family: Evidence for alternative splicing in the
tyrosine phosphatase domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Thomas, 11-APR-1990.
FEATURES from to/span description
pept 26 2515 LCA-related phosphatase precursor (EC 3.13.48)
sigp 26 82 LCA-related phosphatase signal peptide
matp 83 2512 LCA-related phosphatase
mRNA < 1 3070 LCA-related phosphatase mRNA
site 828 936 variably spliced exon
BASE COUNT 874 a 754 c 732 g 710 t
ORIGIN
1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag
61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac
121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc
181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga
241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc
301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca
361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc
421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc
481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa
541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga
601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact
661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt
721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc
781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc
841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt
901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac
961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca
1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg
1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag
1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt
1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat
1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt
1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa
1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg
1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga
1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt
1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg
1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat
1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat
1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt
1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa
1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc
1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa
1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca
2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga
2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa
2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga
2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa
2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt
2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa
2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta
2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt
2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa
2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta
2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt
2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa
2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact
2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga
2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct
2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa
3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga
3061 ttccctgagg
//
LOCUS BOV33DPTP 982 bp ss-mRNA MAM 31-MAY-1990
DEFINITION Cow 33-kDa phototransducing protein mRNA, 3' end.
ACCESSION M33529
KEYWORDS phototransducing protein.
SOURCE Cow adult photoreceptor rod cell, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 982)
AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
Yamaki,K. and Shinohara,T.
TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
retina and pineal gland
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Shinohara, 03-APR-1990.
FEATURES from to/span description
pept < 1 719 33-kDa phototransducing protein (AA at 3)
BASE COUNT 334 a 164 c 215 g 269 t
ORIGIN
1 aaagtttgga ggaagacttt gaaggacagg cctcacatac aggacccaaa ggagtaataa
61 atgactggag aaagtttaaa ttggagagtg aagatagtga ttcagttgca cccagcaaga
121 aggagattct cagacaaatg tcttctcctc agagtagaga tgacaaagac tcaaaagaaa
181 gattcagcag aaagatgagc gttcaagaat atgaactaat ccacaaagac aaagaagatg
241 aaaattgcct tcgtaaatac cgcagacagt gtatgcagga tatgcaccag aagctgagtt
301 ttgggcctag atatgggttt gtgtatgagc tggaatctgg ggagcaattc ctggaaacca
361 ttgaaaagga acagaaaatc accactatcg ttgttcatat ttatgaagat ggtattaagg
421 gctgtgatgc tctaaacagt agcttgatat gccttgcagc cgaataccct atggtcaagt
481 tttgtaaaat aaaggcttct aatacaggtg ccggagaccg cttttcctca gatgtactcc
541 ccacgctgct tgtctacaaa ggtggggaac tcctaagcaa tttcattagt gttactgaac
601 agctggctga agaatttttt actggggatg tggagtcttt cctaaatgaa tatgggttat
661 tacctgaaaa agagatgcat gtcctagagc agagcaaaat ggaagaggat atggaataaa
721 gattcactat gtcaatgttt catatttcgt atttctcctt taagcattga acactgattt
781 tggtagtatt cacattcttt tagggaatac caaacatagc cctggctttt ctaatttggg
841 gaagaaaaac tccagactga cactaaaatt atatgattag catgtcttaa tattagttac
901 tcaagctgat ataacacttt acctcaaaac attgtagtct tcagcaatat gttagtagac
961 aaagagaata tgaaaaatac tg
//
LOCUS HUM33DPTP 1182 bp ss-mRNA PRI 31-MAY-1990
DEFINITION Human 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION M33478
KEYWORDS phototransducing protein.
SOURCE Human adult photoreceptor rod cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1182)
AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
Yamaki,K. and Shinohara,T.
TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
retina and pineal gland
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Shinohara, 30-MAR-1990.
FEATURES from to/span description
pept 52 792 33-kDa phototransducing protein
mRNA < 1 1181 33-kDa phototransducing protein mRNA
BASE COUNT 405 a 187 c 242 g 348 t
ORIGIN
1 aggacaccag gcacagagat ccaaactatt atatcaaatc caatccctaa aatggaagaa
61 gccaaaagcc aaagtttgga ggaagacttt gaaggacagg ccacacatac aggacccaaa
121 ggagtaataa atgattggag aaagtttaaa ttagagagtc aagacagtga ttcaattcca
181 cctagcaaga aggagattct caggcaaatg tcttctcctc agagtaggaa tggcaaagat
241 tcaaaggaac gagtcagcag aaagatgagc attcaagaat atgaactaat ccataaagag
301 aaagaggatg aaaactgcct tcgtaaatac cgtagacagt gtatgcagga tatgcaccag
361 aagctgagtt ttgggcctag atatgggttt gtgtatgagc tggaaactgg aaagcaattc
421 ctagaaacaa ttgaaaagga actgaagatc accacaattg ttgttcacat ttatgaagat
481 ggtattaagg gttgtgatgc tctaaacagt agtttaacat gccttgcagc agaataccct
541 atagttaagt tttgtaaaat aaaagcttcg aatacaggtg ctggggaccg cttttcctta
601 gatgtacttc ctacactgct catctataaa ggtggggaac tcataagcaa ttttattagt
661 gttgctgaac agtttgctga agaatttttt gctggggatg tggagtcttt cctaaatgaa
721 tatgggttac tacctgaaag agaggtacat gtcctagagc ataccaaaat agaagaagaa
781 gatgttgaat gaagattcac tatgtcaata tctcatgttt atcctttagg tattggatga
841 tggttttggt agtatctata ttgcttttgt gaacacagag tatgggcacg gctatgctaa
901 cttgacaaaa atgactgatg caacaatcga gttattagca tttcatagta ttagttactc
961 aaattgatac aatgcttgac tacaaaacaa agctgtcttc agcaacatta ttagtagaca
1021 aagaggatgt ggataatatt atgacatttt tcaaaaatcc ctttcaagtt atgttttgtc
1081 ttttttactc cattttccct catcactgtt attatttgga cttttcaaat tacattattc
1141 attataattt tctttgtgta ataaaaatga aatctcatga ag
//
LOCUS RAT33DPPT 1221 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION M33530
KEYWORDS phototransducing protein.
SOURCE Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to
mRNA, pineal gland library.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1221)
AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
Yamaki,K. and Shinohara,T.
TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
retina and pineal gland
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Shinohara, 03-APR-1990.
FEATURES from to/span description
pept 92 832 33-kDa phototransducing protein
BASE COUNT 399 a 234 c 282 g 306 t
ORIGIN
1 gttcagtgga cagagattct cacccactgg acataatcta ggactccagg aacagagacc
61 ccaaactact acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga
121 ggaagatttt gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag
181 aaagtttaaa ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct
241 cagacaaatg tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag
301 aaagatgagc attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct
361 tcgcaaatac cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag
421 gtatgggttt gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga
481 gcagaaggtc accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc
541 actcaacagc agtttagaat gccttgcagc agaataccca atggtcaagt tctgtaaaat
601 aagagcttcg aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct
661 cgtatacaaa ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga
721 agattttttc gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag
781 agagatacat gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac
841 ggtgcagtat ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat
901 ctatattcct ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac
961 taaaattatg taaatagcat ttttagtact aattattcaa attgagataa tattttactg
1021 caaaaaatat tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac
1081 attaggtcat tgtaaaaatc cctttccagt tccgtgttag ctttttactc ctccttttcc
1141 ccttggcatt tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga
1201 taagaataaa gtcctatgaa g
//
LOCUS RAT33DPTP 1272 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION M33528
KEYWORDS phototransducing protein.
SOURCE Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to
mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1272)
AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
Yamaki,K. and Shinohara,T.
TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
retina and pineal gland
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Shinohara, 03-APR-1990.
FEATURES from to/span description
pept 82 822 33-kDa phototransducing protein
mRNA < 1 1272 33-kDa phototransducing protein mRNA
BASE COUNT 420 a 244 c 289 g 319 t
ORIGIN
1 cagagattct cacccactgg acataatcta ggactccagg aacagagacc ccaaactact
61 acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga ggaagatttt
121 gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag aaagtttaaa
181 ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct cagacaaatg
241 tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag aaagatgagc
301 attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct tcgcaaatac
361 cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag gtatgggttt
421 gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga gcagaaggtc
481 accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc actcaacagc
541 agtttagaat gccttgcagc agagtaccca atggtcaagt tctgtaaaat aagagcttcg
601 aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct catatacaaa
661 ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga agattttttc
721 gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag agagatacat
781 gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac ggtgcagtat
841 ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat ctatattcct
901 ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac taaaattatg
961 taaatagcat ttttagtact aattattcaa attgagataa tattttactg caaaaaatat
1021 tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac gttaggtcat
1081 tgtaaaaatc cctttccagt tccgtgttag cttattactc ctccttttcc ccttggcatt
1141 tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga taagaataaa
1201 gtcctatgaa gaaacaaata gtctgtctaa agtctgccgt gagtcaagga cttaaaaact
1261 tcataaagct tt
//
LOCUS HUMCPIIA3A 1748 bp ss-mRNA PRI 31-MAY-1990
DEFINITION Human cytochrome P450IIA3 (CYP2A3) mRNA, complete cds.
ACCESSION M33318 M33316
KEYWORDS coumarin 7-hydroxylase; cytochrome P450; cytochrome P450IIA3.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1748)
AUTHORS Yamano,S., Tatsuno,J. and Gonzalez,F.J.
TITLE The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human
liver microsomes
JOURNAL Biochemistry 29, 1322-1329 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 10 1494 cytochrome P450IIA3 /nomgen="CYP2A"
/map="19q13.1-q13.2" /hgml_locus_uid="LN0074T"
mRNA < 1 1748 CYPIIA3 mRNA
allele 60 60 g in allele CYP2A3; a in allele CYP2A3v
allele 488 488 t in allele CYP2A3; a in allele CYP2A3v
allele 1645 1645 g in allele CYP2A3; c in allele CYP2A3v
BASE COUNT 395 a 478 c 516 g 359 t
ORIGIN
1 gctaccacca tgctggcctc agggatgctt ctggtggcct tgctggtctg cctgactgtg
61 atggtcttga tgtctgtttg gcagcagagg aagagcaagg ggaagctgcc tccgggaccc
121 accccattgc ccttcattgg aaactacctg cagctgaaca cagagcagat gtacaactcc
181 ctcatgaaga tcagtgagcg ctatggcccc gtgttcacca ttcacttggg gccccggcgg
241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag
301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggta
361 ttcagcaacg gggagcgcgc caagcagctc cggcgcttct ccatcgccac cctgcgggac
421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggaggcggg cttcctcatc
481 gacgccctcc ggggcactgg cggcgccaat atcgatccca ccttcttcct gagccgcaca
541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactataa ggacaaagag
601 ttcctgtcac tgttgcgcat gatgctagga atcttccagt tcacgtcaac ctccacgggg
661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc
721 tttcagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc
781 acgctggatc ccaattcccc acgggacttc attgactcct ttctcatccg catgcaggag
841 gaggagaaga accccaacac ggagttctac ttgaaaaacc tggtgatgac cacgttgaac
901 ctcttcattg ggggcaccga gaccgtcagc accaccctgc gctatggctt cttgctgctc
961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag
1021 aaccggcagc ccaagtttga ggaccgggcc aagatgccct acatggaggc agtgatccac
1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac
1141 accaagtttc gggatttctt cctccctaag ggcaccgaag tgtaccctat gctgggctct
1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg
1261 aatgagaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg
1321 aactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg
1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ccccaaacac
1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg
1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag ggcagggcca agaccgggct
1561 tgggagaggg gcgcagctaa gactgggggc aggatggcgg aaaggaaggg gcgtggtggc
1621 tagagggaag agaagaaaca gaaggggctc agttcacctt gataaggtgc ttccgagctg
1681 ggatgagagg aaggaaaccc ttacattatg ctatgaagag tagtaataat agcagctctt
1741 atttcctg
//
LOCUS HUMCYIIA4A 1747 bp ss-mRNA PRI 31-MAY-1990
DEFINITION Human cytochrome P450IIA4 (CYP2A4) mRNA, complete cds.
ACCESSION M33317
KEYWORDS cytochrome P450; cytochrome P450IIA4.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1747)
AUTHORS Yamano,S., Tatsuno,J. and Gonzalez,F.J.
TITLE The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human
liver microsomes
JOURNAL Biochemistry 29, 1322-1329 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 10 1494 cytochrome P450IIA4 /hgml_locus_uid="LN0074T"
/nomgen="CYP2A" /map="19q13.1-q13.2"
mRNA < 1 1747 CYP2A4 mRNA
BASE COUNT 396 a 471 c 517 g 363 t
ORIGIN
1 gctaccacca tgctggcctc agggctgctt ctggtggcct tgctggcctg cctgactgtg
61 atggtcttga tgtctgtctg gcagcagagg aagagcaggg ggaagctgcc tccgggaccc
121 accccactgc ccttcattgg aaactacctc cagctgaaca cagagcacat atgtgactcc
181 atcatgaagt tcagtgagtg ctatggcccc gtgttcacca ttcacttggg gccccggcgg
241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag
301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggcg
361 ttcagcaacg gggagcgcgc caagcagctc ctgcgctttg ccatcgccac cctgagggac
421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggagtcggg cttcctcatc
481 gaggccatcc ggagcagcca cggcgccaat atcgatccca ccttcttcct gagccgcaca
541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactatga ggacaaagag
601 ttcctgtcac tgctgagcat gatgctagga atcttccagt tcacgtcaac ctccacgggg
661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc
721 tttaagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc
781 acgctggatc ccaattcccc acaggacttc atcgactcct ttctcatcca catgcaggag
841 gaggagaaga accccaacac ggagttctac ttgaagaacc tgatgatgag cacgttgaac
901 ctcttcattg caggcacgga gaccgtcagc accaccctgc gctatggctt cttgctgctc
961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag
1021 aaccggcagc ccaagtttga ggaccggacc aagatgccct acatggaggc agtgatccac
1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac
1141 accaagtttc gggatttttt cctccctaag ggcaccgaag tgttccctat gctgggctcc
1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg
1261 gatgacaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg
1321 tactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg
1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ctccaaacac
1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg
1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag gcggggtcag ggcggggttc
1561 gcggaagagg cgggtataag aatgggggga agatgcggga aaggaagggg cgtggtggct
1621 agagggaaga gaagaaacag aaggggctca gttcaccttg ataaggtgct tccgtggtgg
1681 gatgagagga agggaaacct tacattatgc tatgaagagt agtaataata gcagctctta
1741 tttcctg
//
LOCUS TRLKPRGXY 2568 bp ds-DNA ORG 31-MAY-1990
DEFINITION Leptomonas sp. 9S and 12S ribosomal RNA genes.
ACCESSION J03814
KEYWORDS 12S ribosomal RNA; 9S ribosomal RNA; ribosomal RNA gene.
SOURCE Leptomonas sp. kinetoplast DNA.
ORGANISM Kinetoplast Leptomonas sp.
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Leptomonas sp..
REFERENCE 1 (bases 1 to 2568)
AUTHORS Lake,J.A., de la Cruz,V.F., Ferreira,P.C.G., Morel,C. and
Simpson,L.
TITLE Evolution of parasitism: Kinetoplastid protozoan history
reconstructed from mitochondrial rRNA gene sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4779-4783 (1988)
STANDARD simple staff_review
COMMENT Draft entry and computer_readable sequence for [1] kindly provided
by L.Simpson, 06-MAY-1988.
FEATURES from to/span description
rRNA 351 1500 12S rRNA (put.)
rRNA 1651 2250 9S rRNA (put.)
BASE COUNT 968 a 207 c 322 g 1071 t
ORIGIN BamHI site.
1 gatcctaggt aatctcgtca gctgctgatc acaccaaaaa acatatatta caataaatta
61 ctaattatac taatattaaa taagaaagat agaaatgtgg ttaatatgca ttattaaggg
121 atgtaatgat gtgatgacat gtaattaaaa gacctcggtt atatgaatta tttataaaaa
181 cttgtagggt gtggaattaa aagagtatct aagcctaagc cctaagtccc ctctctctca
241 tcgtacttta catcaattat tactgtaatt aattttcgtg agcggagcac atggcgttta
301 atttgagaca taaattagta agaaaagggt aaaaataaat tgtagatttt attttgtttt
361 tatttatcaa tatagttaat aaaataatct agaattttat gttaaatata taattatatt
421 tttgattatt atattttgtt attttattta agttaattaa attgtattat atttaatttt
481 taaatttgtt gttttatatt tagttttatg tttataattt aatgcaatac tgcacatatt
541 ttaaaatttt aatgaactgt tatttatagt tgattatttt tagtttaatg tttaaatatt
601 taactaatgg aggcacagtt gttctatatg taccaataaa aaatagtaaa attaatttaa
661 ttaatttaat aaataattat ttgatcaaaa ttagtacaaa tattatgtaa aatttaaaaa
721 ttaaatattt tatctaatat taacttatat taaatatttg gtttacattg aatattaaat
781 ttacagattt aatttgtaat taatttaata ttgtgtgttt attaaaatag aggattaatg
841 taataaattt ttaattttaa tatttaatat ttgttaaatt aatagtaaca taacttaatt
901 aaaaagaatt atttatacgt ataaattttt aaaattataa ttatattgtt aataaaatta
961 tcaagtttca aaagcgttta ttaaatgcgt ctgtctaaga tttatattta agagtattct
1021 tgtatatagt tttaagtttt aataatccta tttaattata aagttaacct caaaaagcat
1081 ttataagtag catagtaatt tgttaactaa atattaaagt gttccataga aaattttaaa
1141 attataacaa tcatcgtaac taataaatta aaataaaaat tttaaaaaaa attaaaaaat
1201 taaaataggg caagtcctac tctcctttac aaagagaaca tttaatatgt agttgtatgt
1261 ttgattgggg caatactata tcttgttata tagaaaaaga actataatta ttgaaataat
1321 aaaaggttcg agcaggttaa caagcattaa tactaaatgt gtttcatcgt ctacttattg
1381 ctataaaaaa ttgattgttc atcaaaaatg caattcgtta gttgggttaa aatcgttgta
1441 aagcagattt gtttatatat ttaatatttt tattatttta aaaattaata ttagtacgca
1501 aggattcatt atttgttgct ttatttatat ttatatttat atttatattt atatttataa
1561 tatttattat attttaattg cttttttgcg tttgtatcgt ataattacat atttattata
1621 aatatataat tctatattta aatgtagtta ttttatatat gtatatgtat atgtagtcaa
1681 ttgttattat tcatattaat ttttttaatt gtttttaaat tgttttatat tagtttattg
1741 aaatacttat ttattttcaa tgtttaggaa tagttaataa taatttataa ttttgattag
1801 attactttgt tgatgctata agaaggggtg tggaaatagt gttaaatgta agatatataa
1861 ctataataaa taacatataa attattagtc agaaatggat gccagccgtt gcggtaatat
1921 ctatgctttt aaatattata catttattat attaatttgt tacttattaa ttttagtcaa
1981 taaaaagttt tatatatttt aatttgtttt tatacaccat caggtatatg caaatataaa
2041 atgacattaa ttattaatta tattatatta tatttattca tatctttata cacataaaat
2101 ttttagattt tgacaccatg aaaaggctat cgaatggaat tgtatatttt ataatcaaaa
2161 ttaattaatt atattaaatt attaaatttt agataaaaaa ataaattcaa aaggtattgt
2221 tgcccaccaa tttttataat aaaaataacg tgcagtaatt aatgaactta taaaagtaca
2281 ttttttcata ttaaactttc attatctttt ttgttattta aatatttatg cattaaatct
2341 taaaaattta aatatttgca cacatgtgta taatatatta aattgggaac cccgttgaag
2401 gagggacaga ccaagaggac agagaggtcg ggaatttcag cgatttgatt tttttttttt
2461 tgggggagcg gagcagtcga ggaaagccca gagttttcag agcgttggcg aagaagggtc
2521 gttttattcg gaaaataaag accgttctgg aaggggagtt ttttcagg
//
LOCUS MUSCALP1 83 bp ds-DNA ROD 31-MAY-1990
DEFINITION Mouse calpactin I heavy chain (p36) gene, exon 1.
ACCESSION M33321
KEYWORDS calpactin I; calpactin I heavy chain; p36 protein.
SEGMENT 1 of 2
SOURCE Mouse DNA, clones RW22 and B34.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 83)
AUTHORS Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A.,
Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F.
TITLE Structure and chromosome assignment of the murine p36 (calpactin I
heavy chain) gene
JOURNAL Biochemistry 29, 1226-1232 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 30 / 77 calpactin I heavy chain (p36), exon 1
pre-msg 23 > 83 calpactin I heavy chain mRNA and introns
IVS 78 > 83 calpactin I heavy chain intron A
BASE COUNT 17 a 16 c 16 g 34 t
ORIGIN
1 tttttttttc agcttttttt ttcttcaaaa tgtctactgt ccacgaaatc ctgtgcaagc
61 tcagcctgga gggtgatgta agt
//
LOCUS MUSCALP2 573 bp ds-DNA ROD 31-MAY-1990
DEFINITION Mouse calpactin I heavy chain (p36) gene, exon 12.
ACCESSION M33322
KEYWORDS calpactin I; calpactin I heavy chain; p36 protein.
SEGMENT 2 of 2
SOURCE Mouse DNA, clones RW22 and B34.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 573)
AUTHORS Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A.,
Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F.
TITLE Structure and chromosome assignment of the murine p36 (calpactin I
heavy chain) gene
JOURNAL Biochemistry 29, 1226-1232 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 7 63 calpactin I heavy chain (p36), exon 12
pre-msg < 1 343 calpactin I heavy chain mRNA and introns
IVS < 1 6 calpactin I heavy chain intron K
BASE COUNT 137 a 145 c 132 g 159 t
ORIGIN About 22 kb after segment 1.
1 ctgcaggaca ccaagggtga ctaccagaag gcactgctgt acctgtgtgg tggggatgac
61 tgaagggctc agcacagtgg atcacccaga agtggctcta cctgtgcccc aacctggcgt
121 tctagagact tcgctctcca ctaatggacc cctgagctcc tccctgtgag gatgatgaca
181 gggctgccga ccctttcccc atcttagctg cccttgcctg gctttctcct cattctctcc
241 tttatgccaa agaagtgaac attccaggga gtggggcgtc agtctgtgac atgagacact
301 tcctcttatg tactgtgtcg tgaataaacc gtttttactt tagaaacaag aatgccgttg
361 cctttgcttt caagccactg ttcagaaagc agacatactt gggatttgat tgttcgccca
421 gggagttagg cccctagagg atgaaagttc tagaacttat ctcttccctc tggtagccac
481 tatacagatg gcactttttt aaaaggatga gtgaatttaa aattatatac agttcaaatt
541 ttacccctta ggcaaagggc tacagctctt atc
//
LOCUS HUMLSERP1 1235 bp ds-DNA PRI 31-MAY-1990
DEFINITION Human leuserpin 2 (hLS2) gene, 5' flank.
ACCESSION M33659 J03921
KEYWORDS leuserpin 2; serine protease inhibitor.
SEGMENT 1 of 2
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1235)
AUTHORS Ragg,H. and Preibisch,G.
TITLE Structure and expression of the gene coding for the human serpin
hLS2
JOURNAL J. Biol. Chem. 263, 12129-12134 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Ragg, 17-JAN-1988
FEATURES from to/span description
IVS 1134 > 1235 leuserpin 2 (hLS2) intron A
BASE COUNT 345 a 313 c 302 g 275 t
ORIGIN
1 ctcgggaggt tgaggctgca gtgagccaag atcacgccac tgcacttcag cctgggtaac
61 agagtgagac cctgtctcaa aaaacacata gggcaggcgt ggtggctcac gcatgtaatc
121 ccagcacttt gggaggccga gacgggagga tcacttcact ccaggagttc aacaccagcc
181 tggccaacat agtgaaaccc cgtctctact aaaaatacaa aaaattagtt ggacatggtg
241 gtgtgcgcct gtaatctcag ccactcagga ggctgaggca ggagaatcgc ttgaacttgg
301 gagacagagg ttgcagtgag ctgagatcgc accactgcac tccagcatgg gcagcagcgc
361 gaaactctgt ctcaaaacaa acaaacaaac aaacaaacac ccataaacac aaaatgtatc
421 acagcctcag agatccccac gaatgcctaa gtggccctga atttgggagg cactgctcag
481 taatagtcct atctgtccca caacagacag gagtgctggg ctgcacctac tggcaacaaa
541 cacagcaacc cttgactgaa gaaaggtcca tgccacaatc cccttattct gtaagccact
601 aattttgtcc tctctcctcc acctttcact gaggaacgag ctcttggaag gacagggaca
661 cccgcctagt agctgagcca gccacatcag tcctggagag caggtggagg gcagatgctg
721 tgatcatccc agaagagagg acacagttgg aggcagatgc atggtctcta ctttcagcta
781 ccctcaatgc agcctggtcc ccagaggcct gaagagcgcc ttgtttatgt ggtgacctca
841 agaggggctg ctcctgcacc aaggctatgt gtgcatgcta acacagtaac cgtcatatac
901 tcaaaatgtc agctctaaga actggagatg aggagctgca agccactcta cagttatcaa
961 aggcacagct gagggggttt gtgctgacca agctggttgc ctggtgtttg gattgggact
1021 tatttacttt ggaaaatatg cagcaacagc ccagcaccaa agttcacatc aaaatcccac
1081 tgatgacctt ggctgctttc atctctgaag cgccacttct cagaaacaca gaggtaagtt
1141 gggtttctaa tgtttctgct gattataaat tatttttggt gtttacggat aggcaactgg
1201 ttcatttttc tagcaaacta agaattcaga agctt
//
LOCUS HUMLSERP2 408 bp ds-DNA PRI 31-MAY-1990
DEFINITION Human leuserpin 2 (hLS2) gene, exon 2.
ACCESSION M33660 J03921
KEYWORDS leuserpin 2; serine protease inhibitor.
SEGMENT 2 of 2
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 408)
AUTHORS Ragg,H. and Preibisch,G.
TITLE Structure and expression of the gene coding for the human serpin
hLS2
JOURNAL J. Biol. Chem. 263, 12129-12134 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Ragg, 17-JAN-1988
FEATURES from to/span description
pept 52 > 408 leuserpin 2 (hLS2) precursor, exon 2 (first
expressed exon)
sigp 52 108 leuserpin 2 (hLS2) signal peptide
matp 109 > 408 leuserpin 2 (hLS2)
IVS < 1 36 hLS2 intron A
BASE COUNT 107 a 111 c 98 g 92 t
ORIGIN About 5.2 kb after segment 1.
1 aggccgcctt tcactgtgtt ctgttttccc tcccagcttt agctccgcaa aatgaaacac
61 tcattaaacg cacttctcat tttcctcatc ataacatctg cgtggggtgg gagcaaaggc
121 ccgctggatc agctagagaa aggaggggaa actgctcagt ctgcagatcc ccagtgggag
181 cagttaaata acaaaaacct gagcatgcct cttctccctg ccgacttcca caaggaaaac
241 accgtcacca acgactggat tccagagggg gaggaggacg acgactatct ggacctggag
301 aagatattca gtgaagacga cgactacatc gacatcgtcg acagtctgtc agtttccccg
361 acagactctg atgtgagtgc tgggaacatc ctccagcttt ttcatggc
//
LOCUS MUSIGHAAN 375 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Mouse Ig rearranged gamma-chain mRNA V-D-J4-region, hybridoma A6.1,
partial cds.
ACCESSION M33558
KEYWORDS N-region; diversity exon; gamma-immunoglobulin;
immunoglobulin heavy chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain BW) hybridoma A6.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 375)
AUTHORS Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N.,
Calame,K. and Hahn,B.H.
TITLE Structural characteristics of the variable regions of
immunoglobulin genes encoding a pathogenic autoantibody in murine
lupus
JOURNAL J. Clin. Invest. 85, 530-540 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 375 Ig gamma-chain V-D-J4-region (AA at 1)
recomb 300 301 V-region DNA end (3' end put.)/D-region DNA
start (5' end put.)
recomb 324 325 D-region DNA end/N-region DNA start
recomb 327 328 N-region DNA end/J-region DNA start
BASE COUNT 93 a 90 c 97 g 95 t
ORIGIN
1 gaggtccagc tgcagcagtc tggacctgag ctggtgaagc ctggggcttc agtgaagatg
61 tcctgcaagg cttctggtta ctcctttact ggctacttta tgaactgggt gaagcagagc
121 catggaaaga gccttgagtg gattggacgt attaatcctt acaatggtga tactttctac
181 aaccagaagt tcaagggcaa ggccacattg actgtagaca aatcctctag cacagcccac
241 atggagctcc ggagcctgac atctgaggac tctgcactct attattgtgc aagagattcc
301 ccttattact acggtagtag ctacgggtat gctatggact actggggtca aggaacctca
361 gtcaccgtct cctca
//
LOCUS MUSIGKABE 321 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J2-region, hybridoma A6.1,
partial cds.
ACCESSION M33559
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BW) hybridoma A6.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N.,
Calame,K. and Hahn,B.H.
TITLE Structural characteristics of the variable regions of
immunoglobulin genes encoding a pathogenic autoantibody in murine
lupus
JOURNAL J. Clin. Invest. 85, 530-540 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain V-J2-region (AA at 1)
recomb 291 292 V-region DNA end/J-region DNA start
BASE COUNT 82 a 74 c 76 g 89 t
ORIGIN
1 gacatccaga tgattcagtc tccatcgtcc atgtttgcct ctctgggaga cagagtcagt
61 ctctcttgtc gggctagtca gggcattaga ggtaatttag actggtatca gcagaaacca
121 ggtggaacta ttaaactcct gatctactcc acatccaatt taaattctgg tgtcccatca
181 aggttcagtg gcagtgggtc tgggtcagat tattctctca ccatcagcag cctagagtct
241 gaagattttg cagactatta ctgtctacag cgtaatgcgt atcctctcac gttcggaggg
301 gggaccaagc tggaaataaa a
//
LOCUS RATA2UGLBB 852 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA110.
ACCESSION M26836
KEYWORDS alpha-2u-globulin; globulin.
SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA110.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 852)
AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M.
TITLE Length polymorphism in the 3' noncoding region of rat hepatic
alpha-2u-globulin mRNAs
JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 4 549 alpha-2u-globulin
mRNA < 1 852 alpha-2u-globulin mRNA
BASE COUNT 246 a 186 c 198 g 222 t
ORIGIN
1 aacatgaagc tgttgctgct gctgctgtgt ctgggcctga cactggtctg tggccatgca
61 gaagaagcta gttccacaag agggaacctc gatgtggaca agctcaatgg ggattggttt
121 tctattgtcg tggcctctga caaaagagaa aagatagaag agaatgccag catgagagtt
181 tttatgcagc acatcgatgt cttggagaat tccttaggct tcaagttccg tattaaggaa
241 aatggagagt gcagggaact atatttggtt gcctacaaaa cgccagagga tggcgaatat
301 tttgttgagt atgacggagg gaatacattt actatactta agacagacta tgacagatat
361 gtcatgtttc atctcattaa tttcaagaac ggggaaacct tccaggcgat ggtgctctac
421 ggcagaacaa aggatctgag ttcagacatc aaggaaaagt ttgcaaaact atgtgaggcg
481 catggaatca ctagggacaa tatcattgat ctaaccaaga ctgatcactg tctccaggcc
541 cgaggatgaa gaaaggcctg agactccagg tgggcaatct ccagtgagag caagtgctga
601 gtggagactt ctcaccagga ctctagcatc accatttcct gtccatggag catcctgaga
661 caaattctgc gatctgattt ccatcctctg tcacagaaaa gtgcaatcct ggtctctcca
721 gcatcttccc tagttaccca ggacaacaca tcgagaatta aaagctttct taaatttctc
781 ttggccccac ccatgatcat tccgcacaaa tttcttgctc ttgcagttca ataaatgatt
841 acccttgcac tt
//
LOCUS RATA2UGLBC 838 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA120.
ACCESSION M26837
KEYWORDS alpha-2u-globulin; globulin.
SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA120.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 838)
AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M.
TITLE Length polymorphism in the 3' noncoding region of rat hepatic
alpha-2u-globulin mRNAs
JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 15 560 alpha-2u-globulin
mRNA < 1 838 alpha-2u-globulin mRNA
BASE COUNT 240 a 186 c 189 g 223 t
ORIGIN
1 tattccctac caacatgaag ctgttgctgc tgctgctgtg tctgggcctg acactggtct
61 gtggccatgc agaagaagct agttccacaa gagggaacct cgatgtggct aagctcaatg
121 gggattggtt ttctattgtc gtggcctcta acaaaagaga aaagatagaa gagaatggca
181 gcatgagagt ttttatgcag cacatcgatg tcttggagaa ttccttaggc ttcaagttcc
241 gtattaagga aaatggagag tgcagggaac tatatttggt tgcctacaaa acgccagagg
301 atggcgaata ttttgttgag tatgacggag ggaatacatt tactatactt aagacagact
361 atgacagata tgtcatgttt catctcatta atttcaagaa cggggaaacc ttccagctga
421 tggtgctcta cggcagaaca aaggatctga gttcagacat caaggaaaag tttgcaaaac
481 tatgtgaggc gcatggaatc actagggaca atatcattga tctaaccaag actgatcgct
541 gtctccaggc ccgaggatga agaaaggcct gagcctccag tgctgagtgg agacttctca
601 ccaggactct agcatcacca tttcctgtcc atggagcatc ctgagacaaa ttctgcgatc
661 tgatttccat cctctgtcac agaaaagtgc aatcctggtc tctccagcat cttccctagt
721 tacccaggac aacacatcga gaattaaaag ctttcttaaa tttctcttgg ccccacccat
781 gatcattccg cacaaatatc ttgctcttgc agttcaataa atgattaccc ttgcactt
//
LOCUS RATA2UGLBD 835 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA108.
ACCESSION M26838
KEYWORDS alpha-2u-globulin; globulin.
SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA108.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 835)
AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M.
TITLE Length polymorphism in the 3' noncoding region of rat hepatic
alpha-2u-globulin mRNAs
JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 12 557 alpha-2u-globulin
mRNA < 1 835 alpha-2u-globulin mRNA
BASE COUNT 240 a 185 c 190 g 220 t
ORIGIN
1 tccctaccaa catgaagctg ttgctgctgc tgctgtgtct gggcctgaca ctggtctgtg
61 gccatgcaga agaagctagt tccacaagag ggaacctcga tgtggacaag ctcaatgggg
121 attggttttc tattgtcgtg gcctctgaca aaagagaaaa gatagaagag aatggcagca
181 tgagagtttt tatgcagcac atcgatgtct tggagaattc cttaggcttc aagttccgta
241 ttaaggaaaa tggagagtgc agggaactat atttggttgc ctacaaaacg ccagaggatg
301 gcgaatattt tgttgagtat gacggaggga atacatttac tatacttaag acagactatg
361 acagatatgt catgtttcat ctcattaatt tcaagaacgg ggaaaccttc caggcgatgg
421 tgctctacgg cagaacaaag gatctgagtt cagacatcaa ggaaaagttt gcaaaactat
481 gtgaggcgca tggaatcact agggacaata tcattgatct aaccaagact gatcactgtc
541 tccaggcccg aggatgaaga aaggcctgag actccagtgc tgagtggaga cttctcacca
601 ggactctagc atcaccattt cctgtccatg gagcatcctg agacaaattc tgcgatctga
661 tttccatcct ctgtcacaga aaagtgcaat cctggtctct ccagcatctt ccctagttac
721 ccaggacaac acatcgagaa ttaaaagctt tcttaaattt ctcttggccc cacccatgat
781 cattccgcac aaatttcttg ctcttgcagt tcaataaatg attacccttg cactt
//
LOCUS RATA2UGLBE 867 bp ss-mRNA ROD 31-MAY-1990
DEFINITION Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA8.
ACCESSION M26835
KEYWORDS alpha-2u-globulin; globulin.
SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA8.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 867)
AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M.
TITLE Length polymorphism in the 3' noncoding region of rat hepatic
alpha-2u-globulin mRNAs
JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 19 564 alpha-2u-globulin
mRNA < 1 867 alpha-2u-globulin mRNA
BASE COUNT 248 a 192 c 198 g 229 t
ORIGIN
1 attctattcc ctaccaacat gaagctgttg ctgctgctgc tgtgtctggg cctgacactg
61 gtctgtggcc atgcagaaga agctagttcc acaagaggga acctcgatgt ggctaagctc
121 aatggggatt ggttttctat tgtcgtggcc tctaacaaaa gagaaaagat agaagagaat
181 ggcagcatga gagtttttat gcagcacatc gatgtcttgg agaattcctt aggcttcaag
241 ttccgtatta aggaaaatgg agagtgcagg gaactatatt tggttgccta caaaacgcca
301 gaggatggcg aatattttgt tgagtatgac ggagggaata catttactat acttaagaca
361 gactatgaca gatatgtcat gtttcatctc attaatttca agaacgggga aaccttccag
421 ctgatggtgc tctacggcag aacaaaggat ctgagttcag acatcaagga aaagtttgca
481 aaactatgtg aggcgcatgg aatcactagg gacaatatca ttgatctaac caagactgat
541 cgctgtctcc aggcccgagg atgaagaaag gcctgagcct ccaggtgggc aatctccagt
601 gagagcaagt gctgagtgga gacttctcac caggactcta gcatcaccat ttcctgtcca
661 tggagcatcc tgagacaaat tctgcgatct gatttccatc ctctgtcaca gaaaagtgca
721 atcctggtct ctccagcatc ttccctagtt acccaggaca acacatcgag aattaaaagc
781 tttcttaaat ttctcttggc cccacccatg atcattccgc acaaatatct tgctcttgca
841 gttcaataaa tgattaccct tgcactt
//
LOCUS MZENAR1 386 bp ss-mRNA PLN 31-MAY-1990
DEFINITION Maize glyceraldehyde-3-phosphate dehydrogenase mRNA, 5' end.
ACCESSION M31481 M31482
KEYWORDS glyceraldehyde-3-phosphate dehydrogenase.
SEGMENT 1 of 2
SOURCE Maize leaf (strain L. W64xW128E), cDNA to mRNA, clone Zm6.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 386)
AUTHORS Gowri,G. and Campbell,W.H.
TITLE cDNA clones for corn leaf NADH:nitrate reductase and chloroplast
NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase
JOURNAL Plant Physiol. 90, 792-798 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 12 > 386 glyceraldehyde-3-phosphate dehydrogenase
precursor
sigp 12 209 glyceraldehyde-3-phosphate dehydrogenase
signal peptide
matp 210 > 386 glyceraldehyde-3-phosphate dehydrogenase
mRNA < 1 > 386 glyceraldehyde-3-phosphate dehydrogenase mRNA
BASE COUNT 57 a 143 c 125 g 61 t
ORIGIN
1 ggtccgtggc catggcgtcg tccatgctct ccgctaccac cgtgccactc cagcaggggg
61 gcggcctgtc cgagttctcc gggctcagga gctccgcgtc gctgcccatg cgccggaatg
121 ccacctccga cgacttcatg tccgccgtct ccttcaggac ccacgcggtc ggtacgagcg
181 gcgggccgcg gcgggcgccg acggaggcaa agctgaaggt ggccatcaac gggttcggcc
241 gcatcgggcg caacttcctg cggtgctggc acgggcgcgg cgacgcgtcg cccctcgacg
301 tcatcgccat caacgacacc ggaggcgtca agcaggcgtc ccacctgctc aagtacgact
361 ccacgctcgg catcttcgat gccgat
//
LOCUS MZENAR2 456 bp ss-mRNA PLN 31-MAY-1990
DEFINITION Maize glyceraldehyde-3-phosphate dehydrogenase, 3' end.
ACCESSION M31483
KEYWORDS glyceraldehyde-3-phosphate dehydrogenase.
SEGMENT 2 of 2
SOURCE Maize leaf (strain L. W64xW1282E), cDNA to mRNA, clone Zm6.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 456)
AUTHORS Gowri,G. and Campbell,W.H.
TITLE cDNA clones for corn leaf NADH:nitrate reductase and chloroplast
NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase
JOURNAL Plant Physiol. 90, 792-798 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 312 glyceraldehyde-3-phosphate dehydrogenase
precursor (AA at 1)
matp < 1 309 glyceraldehyde-3-phosphate dehydrogenase
mRNA < 1 456 glyceraldehyde-3-phosphate dehydrogenase mRNA
BASE COUNT 87 a 150 c 122 g 97 t
ORIGIN About 525 bp after segment 1.
1 cgggtgccca ccccgaacgt ctccgtcgtc gacctcgtcg tgcaggtctc caagaagacc
61 ctcgccgagg aggtgaacca ggcgttccgc gacgccgcgg ccaacgagct cacgggcatc
121 ctcgaggtct gcgacgtgcc gctcgtgtcc gtcgacttca ggtgctccga cgtctcctcc
181 accatcgacg cctccctcac catggtcatg ggcgacgaca tggtcaaggt catctcctgg
241 tacgacaacg agtggggcta ctcgcagagg gtcgtcgacc tcgctgacat ctgcgccaac
301 cagtggaagt gaagcgttgc attgtaatag cactttgtaa ctcctcaccc tccacagggc
361 tttgggattc accccgaaat gactcacttt cggtgaaatt cacccgtttc ggcgaggtcg
421 tccgaatata ccggccaaat gtttccattg gatttc
//GenBank-Updates@genbank.bio.net (06/01/90)
LOCUS ECONANA 1243 bp ds-DNA BCT 01-JUN-1990
DEFINITION Escherichia coli nanA gene encoding N-acetylneuraminate lyase,
complete cds.
ACCESSION M20207 D00067 N00067
KEYWORDS N-acetylneuraminate lyase; NALase; nanA gene; npl gene.
SOURCE E.coli JE1011 DNA, 1.2 kb HindIII-EcoRI fragment DNA containing
nanA gene, which is cloned in pNL4, for [1]. E.coli C600 DNA, clone
pMK6, for [2].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1243)
AUTHORS Kawakami,B., Kudo,T., Narahashi,Y. and Horikoshi,K.
TITLE Nucleotide sequence of the N-acetylneuraminate lyase gene of
Escherichia coli
JOURNAL Agric. Biol. Chem. 50, 2155-2158 (1986)
STANDARD simple automatic
FEATURES from to/span description
pept 93 986 N-acetylneuraminate lyase
BASE COUNT 300 a 309 c 322 g 312 t
ORIGIN 1 bp upstream of HindIII site.
1 aagctttctg tatggggtgt tgcttaattg atctggtata acaggtataa aggtatatcg
61 tttatcagac aagcatcact tcagaggtat ttatggcaac gaatttacgt ggcgtaatgg
121 ctgcactcct gactcctttt gaccaacaac aagcactgga taaagcgagt ctgcgtcgcc
181 tggttcagtt caatattcag cagggcatcg acggtttata cgtgggtggt tcgaccggcg
241 aggcctttgt acaaagcctt tccgagcgtg aacaggtact ggaaatcgtc gccgaagagg
301 cgaaaggtaa gattaaactc atcgcccacg tcggttgcgt cagcaccgcc gaaagccaac
361 aacttgcggc atcggctaaa cgttatggct tcgatgccgt ctccgccgtc acgccgttct
421 actatccttt cagctttgaa gaacactgcg atcactatcg ggcaattatt gattcggcgg
481 atggtttgcc gatggtggtg tacaacattc cagccctgag tggggtaaaa ctgaccctgg
541 atcagatcaa cacacttgtt acattgcctg gcgtaggtgc gctgaaacag acctctggcg
601 atctctatca gatggagcag atccgtcgtg aacatcctga tcttgtgctc tataacggtt
661 acgacgaaat cttcgcctct ggtctgctgg cgggcgctga tggtggtatc ggcagtacct
721 acaacatcat gggctggcgc tatcagggga tcgttaaggc gctgaaagaa ggcgatatcc
781 agaccgcgca gaaactgcaa actgaatgca ataaagtcat tgatttactg atcaaaacgg
841 gcgtattccg cggcctgaaa actgtcctcc attatatgga tgtcgtttct gtgccgctgt
901 gccgcaaacc gtttggaccg gtagatgaaa aatatctgcc agaactgaag gcgctggccc
961 agcagttgat gcaagagcgc gggtgagttg tttcccctcg ctcgccccta ccggtgaggg
1021 gaaataaacg catctgtacc ctacaatttt cataccaaag cgtgtgggca tcgcccaccg
1081 cgggagactc acaatgagta ctacaaccca gaatatcccg tggtatcgcc atctcaaccg
1141 tgcacaatgg cgcgcatttt ccgctgcctg gttgggatat ctgcttgacg gttttgattt
1201 cgttttaatc gccctggtac tcaccgaagt acaaggtgaa ttc
//
LOCUS MACEPO 1462 bp ss-mRNA PRI 01-JUN-1990
DEFINITION Monkey (Cynomolgus) erythropoietin mRNA, complete cds.
ACCESSION M18189 M15818 M15819 M18188
KEYWORDS erythropoietin.
SOURCE Monkey cDNA to mRNA.
ORGANISM Macaca fascicularis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini;
Cercopithecidae; Cercopithecinae.
REFERENCE 1 (bases 1 to 1462)
AUTHORS Lin,F.-K., Lin,C.-H., Lai,P.-H., Browne,J.K., Egrie,J.C.,
Smalling,R., Fox,G.M., Chen,K.K., Castro,M. and Suggs,S.
TITLE Monkey erythropoietin gene: Cloning, expression and comparison with
the human erythropoietin gene
JOURNAL Gene 44, 201-209 (1986)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by F.-K.Lin, 04-AUG-1987.
FEATURES from to/span description
pept 217 795 erythropoietin precursor
sigp 217 288 erythropoietin signal peptide
matp 289 792 erythropoietin
BASE COUNT 297 a 455 c 433 g 277 t
ORIGIN Unreported.
1 agagtccctg ggccaccccg gtcgctcgct gcgtgcgtcc cggagccgga ccggggccac
61 cgcgccggct ctgcaccgat cccgcgcccc ctggacagcc gccctctcct ccaggcccgt
121 ggggctggcc ctgccccgct gaacttcccg ggatgaggac tcccggtgtg gtcaccgcgc
181 gcctaggtcg ctgagggacc ccggccaggc gcggagatgg gggtgcacga atgtcctgcc
241 tggctgtggc ttctcctgtc tctcgtgtcg ctccctctgg gcctcccagt cccgggcgcc
301 ccaccacgcc tcatctgtga cagccgagtc ctggagaggt acctcttgga ggccaaggag
361 gccgagaatg tcacgatggg ctgttccgaa agctgcagct tgaatgagaa tatcaccgtc
421 ccagacacca aagttaactt ctatgcctgg aagaggatgg aggtcgggca gcaggctgta
481 gaagtctggc agggcctggc cctgctctca gaagctgtcc tgcggggcca ggccgtgttg
541 gccaactctt cccagccttt cgagcccctg cagctgcaca tggataaagc catcagtggc
601 cttcgcagca tcaccactct gcttcgggcg ctgggagccc aggaagccat ctccctccca
661 gatgcggcct cggctgctcc actccgaacc atcactgctg acactttctg caaactcttc
721 cgagtctact ccaatttcct ccggggaaag ctgaagctgt acacggggga ggcctgcagg
781 agaggggaca gatgaccagg tgcgtccagc tgggcacatc caccacctcc ctcaccaaca
841 ctgcctgtgc cacaccctcc ctcaccactc ccgaacccca tcgaggggct ctcagctaag
901 cgccagcctg tcccatggac actccagtgc cagcaatgac atctcagggg ccagaggaac
961 tgtccagagc acaactctga gatctaagga tgtcgcaggg ccaacttgag ggcccagagc
1021 aggaagcatt cagagagcag ctttaaactc aggagcagag acaatgctgg gaaaacacct
1081 gagctcactc ggccacctgc aaaatttgat gccaggacac gctttggagg caatttacct
1141 gtttttgcac ctaccatcag ggacaggatg acctggagaa cttaggtggc aagctgtgac
1201 ttctccaggc ctcacgggca ctcccttggt ggcaagagcc cccttgacac tgagagaata
1261 ttttgcaatc tgcagcagga aaaattacgg acaggttttg gaggttggag ggtacttgac
1321 aggtgtgtgg ggaagcaggg cggtaggggt ggagctggga tgcgagtgag aaccgtgaag
1381 acaggatggg ggctggcctc tggttctcgt ggggtccaag ctttgtgtat tcttcaatct
1441 cactggcaag aactgaaacc ac
//
LOCUS AFACFXP1 1751 bp ds-DNA BCT 01-JUN-1990
DEFINITION A.eutrophus phosphoribulokinase P (cfxP) gene, complete cds.
ACCESSION M33562
KEYWORDS phosphoribulokinase.
SOURCE A.eutrophus (strain H16) DNA.
ORGANISM Alcaligenes eutrophus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 1751)
AUTHORS Kossmann,J., Klintworth,R. and Bowien,B.
TITLE Sequence analysis of the chromosomal and plasmid genes encoding
phosphoribulokinase from Alcaligenes eutrophus
JOURNAL Gene 85, 247-252 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 621 phosphoribulokinase F(AA at 1)
pept 633 1511 phosphoribulokinase P
BASE COUNT 310 a 614 c 542 g 285 t
ORIGIN
1 ctgcagcccg gcagcgccca ggtggcggcc ggctacgcgc tctacggtcc caccaccatg
61 ctggtgctga ccgtgggcaa tggcgtcaac ggcttcacgc tcgatcccaa cctgggcgag
121 ttcttcctga cgcaccccca cctgcgggtg ccgcccgata cccaggaatt tgccatcaac
181 gcgtcgaaca gccgcttctg ggaagcgccg gtgcagcgct acatcggcga gtgcatggct
241 ggcaagagcg ggccgcgcgg caaggatttc aatatgcgct ggatcgcatc gatggtggcc
301 gaggcgcacc gcatcctgat gcgcggcggt gtcttcatgt acccgcgcga caccaaggat
361 cccgccaagc cgggccgcct tcgcctgctg tacgaggcca acccgatcgc cttcctgatg
421 gagcaggccg gcgggcgcgc cagcacgggc cggcagacgc tgatgtcggt ggcgccgggc
481 gcgctgcacc agcgtatcgg cgtgatcttc ggctcgcgca atgaagtgga acggatcgag
541 ggctaccaca ccaaccagac cgaccccgac cttccgagtc ccctgttcaa cgagcgcagc
601 ctgttccgcg cgtccgcctg aggtgcctgg ccatgtcaga acgttatccc attatcgcta
661 tcaccgggtc ctcgggcgcc ggtaccacgt cggtgacccg taccttcgag aacatcttct
721 gccgtgaagg cgtgaagtcg gtcgtgatcg aaggcgacag cttccaccgc tacgaccgtg
781 ccgagatgaa ggtcaagatg gccgaggccg aacgcaccgg caatatgaac ttcagccact
841 tcggcgcgga gaacaacctg ttcggcgacc tggagagcct gttccgctcc tacgcggaat
901 ccggcacggg catgcgccgg cgctatctgc acagcacgga ggaggccgcg ccgttcgggc
961 agcagcccgg caccttcacc gcgtgggagc cattgccggc cgataccgac ctgctgttct
1021 acgaaggcct gcacggcggc gtggtcaccg acgaggtcaa tgtcgcccag tatcccaacc
1081 tgctgatcgg cgtggtgccc gtcatcaacc tggagtggat tcagaagctc tggcgcgaca
1141 agaaacagcg cggctactcg accgaggccg tgaccgacac catcctgcgc cgcatgccgg
1201 actacgtgaa ctacatctgc ccgcagttct cgcgcacgca tgtgaacttc cagcgggtgc
1261 cgtgcgtgga cacgtccaac ccctttatct cgcgcgaaat ccccgcgccc gatgaaagca
1321 tggtggtgat ccgctttgcc aacccgaagg ggatcgactt ccagtacctg ctgagcatga
1381 tccacgactc cttcatgtcg cgcgccaaca ccatcgtggt gccgggcggc aagatggaac
1441 tggccatgca gctgatcttc acgcccttcg tgctgcgcat gatggagcgc cgcaagcgcg
1501 ccgcgctgta aggagccgag atgaacgcac ccgaacgcat cgaccccgca gcgcgttgcg
1561 ccaacgcgct gcgcttcctg gccgccgacg cggtggagtt ggcccgctcc ggccaccccg
1621 gcgcccccat gggcatggcc gagatggccg aggtcgtgtg gcggcgccac ctgcgccaca
1681 acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac ggccatgcct
1741 ccatgctgca g
//
LOCUS AFACFXP2 1641 bp ds-DNA BCT 01-JUN-1990
DEFINITION A.eutrophus phosphoribulokinase P (cfxF) gene, complete cds.
ACCESSION M33563
KEYWORDS phosphoribulokinase.
SOURCE A.eutrophus (strain H16) DNA.
ORGANISM Alcaligenes eutrophus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 1641)
AUTHORS Kossmann,J., Klintworth,R. and Bowien,B.
TITLE Sequence analysis of the chromosomal and plasmid genes encoding
phosphoribulokinase from Alcaligenes eutrophus
JOURNAL Gene 85, 247-252 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 480 phosphoribulokinase F
pept 492 1370 phosphoribulokinase P
BASE COUNT 306 a 555 c 505 g 275 t
ORIGIN
1 ctgcaggtgc cggccgatac ccaggaattt gccatcaatg cgtcgaacag ccgcttctgg
61 gaagcgccgg tgcagcgcta catcgccgag tgcatggccg gcaagagcgg gccgcgcggc
121 aaggatttca atatgcgctg gatcgcgtcg atggtggccg aggcgcaccg catcctgatg
181 cgtggcggcg tcttcatgta cccgcgcgac tccaaggatc ccgccaagcc gggccgcctg
241 cgcctgctgt acgaggccaa tccgatcgcc ttcctgatgg agcaggctgg cgggcgcgcc
301 agcacgggcc ggcagacgct gatgtcggtg gcgccgggtg cgctgcacca gcgcattggc
361 gtgatcttcg gctcgcgcaa tgaagtggaa cggatcgagg gctaccacac cgaccagacc
421 gatcccgacc ttccgagtcc cctgttcaac gagcgcagcc tgttccgcgc gtctgcctga
481 ggtgcctggc catgtcagaa cgttatccca ttatcgccat caccggctcc tcgggcgccg
541 gtaccacgtc ggtgacccgt accttcgaga acatcttccg ccgcgagggc gtgaagtcag
601 tggtgatcga aggcgacagc ttccaccgct atgaccgcgc cgagatgaag gtcaagatgg
661 ccgaggccga gcgcaccggc aatatgaact tcagccactt cggcgaggag aacaacctgt
721 tcggcgagct ggagaacctg ttccgctcct atgccgaaac cggcacgggc atgcaccggc
781 actacctgca tagcccggag gaagccgcgc cgttcgggca ggaacccggc accttcacgc
841 agtgggagcc gctgccggcc gataccgatc tgctgttcta cgagggcctg cacggcggcg
901 tggtcaccga ttcggtcaat gtcgcccagt atcccaacct gctgatcggc gtggtccctg
961 tcatcaacct ggaatggatc cagaagctct ggcgcgacaa gaagcagcgc ggctattcga
1021 ccgaggccgt gaccgacacc atcctgcgcc gcatgccgga ctacgtgaat tacatctgtc
1081 cgcagttctc gcgcacgcat gtgaacttcc agcgcgtgcc gtgcgtggat acgtccaacc
1141 cctttatctc gcgcgaaatc cccgcgcccg atgaaagcat ggtggtgatc cgctttgcca
1201 acccgaaggg gatcgacttc cagtacctgc tgagcatgat ccacgactcc ttcatgtcgc
1261 gcgccaacac catcgtggtg cccggcggca agatggagct ggccatgcag ctgatcttca
1321 cgcccttcgt gctgcgcatg atggagcgcc gcaagcgcgc cgcgcagtaa ggactcgacc
1381 gtgaaacgcc gactcgcgag gatctcctga atgaacgcac cagaacgcat cgattccgca
1441 gcgcgctgtg ccaacgcgct gcgcttcctg gccgccgacg cggtggagca ggccaaatcg
1501 ggccaccccg gcgcacccat gggcatggcc gaaatggccg aagtgctgtg gcggcgccac
1561 ctgcgacaca acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac
1621 ggccatgcgt ccatgctgca g
//
LOCUS BIFLLD 1767 bp ds-DNA BCT 01-JUN-1990
DEFINITION B.longum L-lactate dehydrogenase (ldh) gene, complete cds.
ACCESSION M33585
KEYWORDS L-lactate dehydrogenase.
SOURCE B.longum (strain aM101-2) DNA.
ORGANISM Bifidobacterium longum
Prokaryota; Bacteria; Irregular, nonsporing, gram-positive rods.
REFERENCE 1 (bases 1 to 1767)
AUTHORS Minowa,T., Iwata,S., Sakai,H., Masaki,H. and Ohta,T.
TITLE Sequence and characteristics of the Bifidobacterium longum gene
encoding L-lactate dehydrogenase and the primary structure of the
enzyme: A new feature of the allosteric site
JOURNAL Gene 85, 161-168 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 555 1517 L-lactate dehydrogenase (EC 1.1.1.27)
mRNA 493 > 1517 L-lactate dehydrogenase mRNA
BASE COUNT 373 a 555 c 488 g 351 t
ORIGIN
1 gtcgacgcgg tcaatgacgt gttggcggac atcgaaggca cggcctcgat tccgcgtatt
61 ctcgtattca acaaggccga tcaggcggac gaggcgactc gtgaacgact cgccgcgctg
121 cagccagatg cgttcatcgt ctccgcctat accggtgagg gattggacga gctgcgtacc
181 gcggtcgaaa gtctgctgcc ggtcccgcat gtgcatgtca acgctctgct gccgtatacc
241 gctggctccc tgatctctcg tgtacgcgaa tacggcaagg tagacaaggt ggagtaccgc
301 gatgatggca tacagcttga agcggacgtt gatgcccatc ttgcgctcag gtggtcgaac
361 agtccattga ctaacgtgat aaacatcaca gtatattcgt gagcgctaac aaccgttgaa
421 aacattacca tacggttgtc aaacagggtg gtgtgccggt agcaaaacgt cttagcgggt
481 ttatagagtg aagacgttag ttacaaggcc tgccattcat cagcagaccg cctttgaaga
541 gaggttcatc catcatggcg gaaactaccg ttaagcccac gaagcttgct gttattggtg
601 ccggtgccgt tggctccacc ctcgccttcg ccgctgccca gcgtggcatc gctcgcgaga
661 tcgtgcttga agacatcgcc aaggagcgcg tggaagccga agtgctcgac atgcagcatg
721 gctccagctt ctacccgacc gtgtccatcg acggttccga cgatcctgag atctgccgcg
781 acgccgacat ggtcgtcatc accgctggtc cgcgtcagaa gccgggtcag tctcgtcttg
841 agctcgttgg cgctaccgtc aacatcctca aggccatcat gccgaacctg gtcaaggtgg
901 ctccgaacgc catctacatg ctcatcacca acccggtcga catcgctacc cacgtggctc
961 agaagctcac cggtctgccc gagaaccaga tcttcggttc cggcaccaac ctggactccg
1021 ctcgtctgcg cttcctgatt gcccagcaga ccggcgtcaa cgtcaagaac gtgcacgcct
1081 acatcgccgg cgagcacggc gactccgaag tcccgctgtg ggagtccgcc accatcggtg
1141 gcgtccccat gtgcgactgg accccgctgc ccggccacga tccgctcgac gccgacaagc
1201 gcgaggagat ccaccaggaa gtcaagaacg ccgcttacaa gatcatcaac ggtaagggtg
1261 ccaccaacta cgccatcggc atgtccggcg tcgacatcat cgaagccgtc ctgcacgaca
1321 ccaaccgcat tctgcccgtg agctccatgc tcaaggactt ccacggcatc tccgacatct
1381 gcatgtccgt gccgaccctc ctcaaccgtc agggcgtcaa caacaccatc aacaccccgg
1441 tctccgacaa ggagctcgcc gctctgaagc gctctgccga gacgctgaag gaaaccgccg
1501 cccagttcgg cttctgataa aaaatcgctg tacggagcgc ctttcgcacc gcagcaagac
1561 tcgacctacc tttgtaggcc ttcaccttgt gcggcacgaa aatcgcacac gtacagcgat
1621 tttttatacc ctgctgaatg ctcccgttgg gagcattttt tattcgtgga gttcgttgga
1681 ggagggggag ctatagccct cgggctcgag ctggaaggtg gtgtggggga ccgagaccgg
1741 gaagtgctcg cgcaggcaat cctgcag
//
LOCUS BOVP450SCC 1471 bp ds-DNA MAM 01-JUN-1990
DEFINITION Bovine P-450scc gene, exon 1.
ACCESSION J05245
KEYWORDS P-450scc.
SOURCE Bovine calf thymus DNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1471)
AUTHORS Ahlgren,R., Simpson,E.R., Waterman,M.R. and Lund,J.
TITLE Characterization of the promoter/regulatory region of the bovine
CYP11A (P-450-scc) gene: Basal and cAMP-dependent expression
JOURNAL J. Biol. Chem. 265, 3313-3319 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 948 / 1216 P-450scc, exon 1
pre-msg 897 > 1471 P-450scc mRNA and introns
BASE COUNT 324 a 367 c 396 g 384 t
ORIGIN
1 gcagatgtcc ctggtgattc ctgaaacagg ccctctgttt aaattcttca gcagttagag
61 ggaaggtcaa tttttcccaa ggcttttggg ctttgattgt tttcattttt aaattatctg
121 cattctaaag agatattttg ggtggcagat tttgctctcc tacaggactt tgtctaggag
181 acggctctca ggccagctcc gacgactgtt ccaaagaagt aagggaaagc tagggtttat
241 atcaatcttt ttttttgctg ggagaagggg gatgaacatg tagtcaaaca taaaaagatc
301 actgctaatc ccaaacaaca gacacctcaa gtgaatggtt ttagtgtttt tctatatatg
361 ttgtttagtc actaagtcct gtccgactct tttgcgactc catagactgt agcccaccaa
421 gctcctctgt ccatgggatt tttctaggca agaatactgg agtgggttgc catttccttc
481 tccctgggat cttcctaacc caaggactga acccttgtct cctgcattgc aggtggattt
541 tttaccgact gagccaccag ggaagttatg tgtgcaagaa tccggggtca tggaaatttt
601 cccttagata tacatcgtat ctagggacca gtacaatgca aatgcttcct gtttttcttc
661 atcctgaagt ctcctcaggg tgcattgagg gagggagtcc cctcaggtgg gtgaccacag
721 tggctgacgc ttgatgttgt agaactggaa tgatgggtta cattctttcg tttacagtac
781 tgagtctggg aggagctgtg tgggctggag tcagccggag gaggctgacc gccctgtcag
841 cttctcactt agccttgagc tggtgattat aagctgggtc ccagggtccc agggccagag
901 tcacctgctg cagtacgagc agagacagca gcagctgtgg gggcagcatg ctagcaaggg
961 ggcttcccct ccgttcagcc ctggtcaaag cctgcccacc catcctgagc tcagtggggg
1021 agggctgggg ccaccacagg gtgggcactg gagagggagc tggcatctcc acaaagaccc
1081 ctcgccccta cagtgagatc ccctcccctg gtgacaatgg ctggcttaac ctctaccatt
1141 tctggaggga gaagggctca cagagaatcc actttcgcca catcgagaac ttccagaagt
1201 atggccccat ttacaggtaa gcctggcagg aggattgggg ctggcgggat agggaagcct
1261 gtggtggccc cctccctgaa aggtctgccc tccccttcca ggctctggtt cacctctgac
1321 tttatttctt cctgcctggc ggtggcagga gtagagttaa tgcttcccag acagtgggtt
1381 cacttcccag ccctgaggcc tcaacagtcc ccgggctcta cacccttaga aactttgggg
1441 aggtggggag gcccaagaaa ataagccccg g
//
LOCUS BOVTGN 725 bp ds-DNA MAM 01-JUN-1990
DEFINITION Bovine minisatellite repeat DNA.
ACCESSION J04756
KEYWORDS minisatellite sequence.
SOURCE Bovine (strain Holstein-Friesian) blood DNA, clone lambda-gBJ32.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 725)
AUTHORS Kashi,Y., Iraqi,F., Tikochinski,Y., Ruzinzki,B., Nave,A.,
Beckmann,J.S., Friedmann,A., Soller,M. and Gruenbaum,Y.
TITLE (TG)n Uncovers a sex-specific hybridization pattern in cattle
JOURNAL Genomics 7, 31-36 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Gruenbaum, 22-JAN-1990.
FEATURES from to/span description
rpt 311 703 minisatellite repeat
BASE COUNT 134 a 131 c 226 g 234 t
ORIGIN
1 aagcttcaca tcccgagaat tccctcccag cgctcgtggt cccacagagg gctctgctgg
61 acctgcctcg ggtcacatgg caggtctggg gaggacacac ctctccccgg cagagaaatg
121 gccagaagcc aggtctgctc cacacgtgcc ttctcccaat actctctaac tttaaaaaaa
181 ctgccaaaga aaaagcggta cgtaataaca agcgcacaga tacgtaattt ataatggctg
241 acacggttgg cagggaaatg tgttacgcag gaattatgtt tttatttatg tgtgtcctgt
301 tttggagaca gcataagtaa tcatgggtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc
361 tccagcgtaa gtaatcatgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagcgtaagt
421 aatcgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc tccagagtaa
481 gtaatcatgg gtgtgtgtgt gtgtgtgttg cctgtctcca gcataagtaa tcatgggtgt
541 gtgtgtgtgt gtgtgttgcc tgtctccagc ataagtaatc atgggtgtgt gtgtgtgtgt
601 gtgtgtgttg cctgtctcca gcataagtaa tcatgggggg gtgtgtgtgt gtgtgtgtgt
661 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagggacttt tgtacagaga
721 agctt
//
LOCUS CELCDPK 2305 bp ss-mRNA INV 01-JUN-1990
DEFINITION C.elegans cAMP-dependent protein kinase subunit R mRNA,
complete cds.
ACCESSION J05220
KEYWORDS cAMP-dependent protein kinase.
SOURCE C.elegans mRNA.
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 2305)
AUTHORS Lu,X., Gross,R.E., Bagchi,S. and Rubin,C.S.
TITLE Cloning, structure, and expression of the gene for a novel
regulatory subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 3293-3303 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 132 1262 cAMP-dependent protein kinase subunit R
mRNA < 1 2305 cAMP-dependent protein kinase subunit R mRNA
BASE COUNT 651 a 543 c 417 g 694 t
ORIGIN
1 gcggcatccc ccgtcaaaag tctgttacct tccccgccaa gttttcatgt catttcaaac
61 gtgtgttttc tgctattttt ctcgcatttt tgtctctttt gtttgtttaa aacaagaaat
121 gactgaaaat tatgaacaat tattcaggtg acattgtttt tatgtcgggt ggaaacgaag
181 aggatcagtt ggcacaatgc caggcctatg tgcaacgaca caacattcag cagcttgtca
241 aagaagcaat tgttgttctg tgcatacaca aacccgacaa cccggttctc ttcttgaagg
301 atcattttga aaaactcaac gaacaacgag cacaggaagg aggaaatccg gacgctgctg
361 atgatgacga catcattgtg gagccaccaa agagatcagg tggacgcaga accggaatct
421 ctgccgaacc aatcaaggag gacgataccg agtacaagaa ggttgtgatt ccaaaggatg
481 atgctacaag aaggtcgctt gagtcagcca tgaggaagaa cttgctcttt gctcatttgg
541 aggaggatga gcaaaaaacc atgtacgatg ccatgttccc agttgagaag tctgctggag
601 agaccatcat cgaacaggga gaagaaggag ataacttcta tgttattgat aagggaaccg
661 ttgatgtcta cgtcaatcac gagtacgttc tcaccatcaa cgagggtgga tcatttggag
721 aacttgcttt gatctatgga actccacgtg ctgcaaccgt cattgccaag attgacgtga
781 aattgtgggc catcgaccgt ctcacctacc gtcgtatctt gatgggatca gtaacaaaga
841 agagaaagat gtatgacgag ttcctctcca aggttcaaat cttggccgat ctcgaccagt
901 gggagcgcgt taacgtggct gatgctcttg agagatgcga cttcgagcca ggaactcatg
961 tcgtcgagca aggacaaccc ggagacgaat tcttcatcat tctcgaggga gaagccaatg
1021 tgttgcagaa gagaagcgac gacgcaccat tcgacgttgt cggacatctt ggaatgtcgg
1081 actatttcgg agaaatcgct cttcttctcg accgcccacg tgccgccacc gttgttgcca
1141 aaacccatct caagtgcatc aagctggatc gtaaccgctt cgaacgtgtc atgggaccag
1201 ttcgtgagat cctgaagaga gacgtttcaa actacaactc atacgtcaaa ctgatgacct
1261 aatcggatta caatccgttt gccccgcatc ttcctcatca ccgtatcttc ttttccacct
1321 tttcgccaaa gtactcgtaa ctattttttg aaagtatccc aaaaattctt ccttagtttt
1381 tttttagtat tgtatttttt cctgtatttt tttggatttt ctattattct gcatactgtc
1441 tagacctgta accttcttct ctcaccaacc gtgacatctt caagggcccc ccccccaatc
1501 aattcgttcc ctatcttcat ttctagtcag tcgatgtcaa attaaattct aaatgtctaa
1561 atttctaact catagccaaa acaacattca ttgttttaac cagcccaaaa tcctagtcag
1621 aagtttgcag aaaactattt gaaactcctc gtgaaaaagc taaaaaaatg ttcaactaga
1681 atagttttag tttttattcc ctgtgtaaca aaaaaaacag ttttgcatgc ttttctatcc
1741 accctaaaat ttcaacattt tcagtagctt tacagctcaa tctttaccaa ccatttcctt
1801 caattcctat aggttaactc aaagaagaaa cgaacaccat tcttcattaa ttaacattaa
1861 gcagtgaatg ccaaatcttg tctttataat atgtaacata taattttatg agaaataaga
1921 aatgcgcatc cacaacaatc gtcccccccc cccctaccca gcacacattt ttttcttatt
1981 tgttccactt ctgaattctg aactccgtca cccttatcca gcattgcgaa gtctcttatc
2041 tacgccccct tccctcaaca tccaacttct tttttttttc atttttatat atttttcaca
2101 ttcattggaa atccgctgtc atgtcgccat ctcaatgttt tcttgtctct tttttttcat
2161 ttctccacca cattttccac tagatttctg ttatccaaaa tttttgtcgg tttcccccgc
2221 gcacacacac acaaaaaaca gtatgttata aactatctat aatgtaatct tagaggttta
2281 ttggactaat aaaatttgga agttg
//
LOCUS ECHOGL 1803 bp ds-DNA BCT 01-JUN-1990
DEFINITION E.chrysanthemi oligogalacturonate lysase (ogl) gene, complete cds.
ACCESSION M33583
KEYWORDS oligogalacturonate lysase.
SOURCE E.chrysanthemi DNA.
ORGANISM Erwinia chrysanthemi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1803)
AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J.
TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes
negatively regulated by the kdgR gene product
JOURNAL Gene 85, 125-134 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 271 1437 oligogalacturonate lysase (ogl)
BASE COUNT 480 a 393 c 441 g 489 t
ORIGIN
1 agtgaatcaa tacagccgga ggatatttat tctgacgttt ggtctggctg cttattgccg
61 aatctggatc aatatcaggc aaaataatct tgcgtcagcg caaacagtca gtgaaaaatg
121 aaagaatgtt ttataaaata aaaccacgat cacggaaaaa tgaaacgttg tttctacaat
181 accggtataa caaggcgatt ggccgggatg acgggtaatc tttcacctgg cggcgtctga
241 tttttcacga taagcgctaa ggatttacgg atggccaaag gtaaaaagct ttctttttcg
301 ttccatactt accaggattc agtcaccggc accgaagtgg tgcgtctcac tcctcccgat
361 gttatctgcc accgcaacta cttctatcag aagtgttttt ccaatgatgg cagcaagctg
421 ctttttggtg gcgcctttga cgggccgtgg aactactatt tgctggatct gaaaactcag
481 caggcgacgc aactgaccga aggtaccggc gacaatactt ttggtggttt tctgtcacca
541 gatgatgacg cgctttatta tgtaaagaac gttcgtaatt tgatgcgtgt tgacctgaat
601 acactggaag aaaccaatat ttatcaggtg ccggacgact gggtcgggta cggtacctgg
661 gttgccaact ccgactgcac caaaatggtc ggtatcgaga tcaagaaaga ggattggaaa
721 ccactgaccg actggaaaaa attccaggaa ttctacttta ccaatccatg ctgccgtttg
781 attcgtatcg atctgaaaac cggcgaagcc accaccattc tgaaggaaaa ccaatggctg
841 ggtcatccta tttaccgtcc gggtgacgat aatacggtgg ccttctgcca tgaaggtccg
901 catgacctgg ttgatgcgcg tatgtggttc atcaatgaag atggctccaa tatgcgtaag
961 gtaaaagagc atgcgccggg cgaaagctgc actcacgaat tctgggtgcc gaatggttct
1021 gcgctggcct acgtttccta tctgaaaggc agtactaacc gtttcatttg cagcgttgat
1081 ccggtaacgc tggaaaaccg tcagttgact gaaatgccgc cgtgttctca cctgatgagt
1141 aactacgatg gtacgttgat ggtgggagat gggtgtaatg cgccggtgga tgtgaaagat
1201 gacggtggct acaagactga aaacgatccg ttcctgtatg tgttcaatat gaagaccggg
1261 aaacatttcc aggtcgctca acacaacacc tcctgggaag tgctggaggg cgatcgtcag
1321 gtaacgcatc cacatccgtc ctttacgccg gatgacaagc acattctgtt tacgtctgat
1381 gtcgatggta agccggcgtt atatctggca aaagtgcctg attccgtctg gcaataactg
1441 tacctgcaac cgtgcccgcc agggcacgtt ttttcgcttt tacaataaga ttattaattc
1501 ggcagtgcta ttgtgatgga tgatattaat cggtctgtca cttatagtaa tgcccgattg
1561 ccatattcaa agcgttccct ttatactaaa acacgtgttc tatttttttt aaaacaaaaa
1621 atactgagta gggtaaccac aaaaatggct attgcagatt tagataaaca acccgattcc
1681 gtgtcatctg tcttaaaggt ctttggtatt ttgcaggcgt taggcgaaga gcgtgagatt
1741 ggcattactg aactttcaca gcgcgtgatg atgtctaaaa gtactgttta tcgtttcctg
1801 cag
//
LOCUS ECHPLE 1758 bp ds-DNA BCT 01-JUN-1990
DEFINITION E.chrysanthemi pectate lysase e isoenzyme (pelE) gene, complete
cds.
ACCESSION M33584
KEYWORDS pectate lysase e isoenzyme.
SOURCE E.chrysanthemi DNA.
ORGANISM Erwinia chrysanthemi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1758)
AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J.
TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes
negatively regulated by the kdgR gene product
JOURNAL Gene 85, 125-134 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 251 1465 pectate lysase e (pelE)
BASE COUNT 534 a 480 c 379 g 365 t
ORIGIN
1 tttaaaactt catttttaat ttaatcataa aaaataaaaa accaagtaac acaaagttac
61 aaatacagtc aatagtttat tttttattaa taaaaaccat tgtcatcatc gtgacaaagt
121 tcacaaaata gacactcaac cgcataaaca ttcgccaaag caaaagatga aatggtattt
181 cgtttttaga cacacatgta acaaatggac accatttgga tcgctcactg agcacacaag
241 gaaaattgcc atgaacaact cacgtatgtc ttccgtttca acacagaaaa caacaggacg
301 ttctgccttg ggaaccaaaa gcgcgctggc cgcgattatc gccaccacca tgatggtctc
361 agtcgcttcc gccgccagtc tgcaaaccac caaagcgaca gaagcggcgt caaccggctg
421 ggcaacgcag agcggcggca ccaccggcgg cgccaaagcg tcatcatcca aaatctacgc
481 ggtgaaaagc atcagcgaat tcaaagcagc actgaacgga accgattcgt cgcccaagat
541 catccaggtc accggagcaa ttgatatcag cggcggtaaa gcctacaccg actttgacga
601 tcagaaagcc cgcagccaga tcagcattcc gtccaacacc accatcatcg gtattggcaa
661 caaaggcaaa ttcaccaacg gttcgctggt ggtgaaaggc gtcagcaacg ttatcctgcg
721 taacctgtac atcgaaacgc cggtggacgt ggcgccgcat tacgaagagg gagatggttg
781 gaacgccgag tgggacgcag tagtgattga cagcacagac cacgtctggg ttgaccatgt
841 caccatcagc gacggcagcc tcaccgacga caaatacacc accaaaaacg gcgaaaaata
901 tgtgcagcat gacggctcgc tggatatcaa gcgcggctcc gactacgtta ccgtttctaa
961 tagccgcttc gagctgcacg acaaaaccat cctgatcggc cacagcgaca acaacggctc
1021 gcaggacgcc ggcaaactgc gcgtcacctt ccacaacaac ctgtttgacc gggtcggcga
1081 acgtaccccg cgcgtacgct tcggcagcgt tcacgcttac aacaacgttt acgttggcga
1141 cgtcaaccac aaagcctatc gctatcagta cagcttcggt atcggcacca gcggtagcct
1201 gctgtctgag tccaacgcgt ttaccattga taacatgaaa aaaatcagcg gccgcgacaa
1261 agaatgcagc gtggtcaagg cgtttaacgg taagatcttc tctgataaag gctcgatcat
1321 caacggcgct tcgtacaatc tgaatggttg cggctttggc ttcagcgcat acagcgccaa
1381 gatcccgtac aaatactcag cccaaaccat caccaccagc ctggcaaaca gcatcagcag
1441 caacgcgggc tacggtaaac tgtaatttcc cgcccagcgt aaaggccact tgcaaaagtg
1501 gcttttattt tcccatattc ccaccactcg tattaccccg atgttctccg aagcgccact
1561 caatacggtt caggttaatc cggccttgcc acaggcgttc atcaagaaat cgaccggttt
1621 cccccatcaa atgcagggac tagcaggaat tagtaagaga cacagattaa taaaattcta
1681 tttaataaaa atcgaatcaa ttgttaactg tttgggttat ttcagataaa aacgcttata
1741 catatagctg aatttaaa
//
LOCUS ECOFIMA1 66 bp ds-DNA BCT 01-JUN-1990
DEFINITION E.coli type 1 fimbriae (fimA) gene with viral insertion site,
partial cds.
ACCESSION M33412
KEYWORDS type 1 fimbriae.
SOURCE E.coli (strain HB101) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Hedegaard,L. and Klemm,P.
TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous
antigenic sequences
JOURNAL Gene 85, 115-124 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 66 type 1 fimbriae (AA at 1)
recomb 3 4 E.coli DNA end/ viral DNA start
recomb 63 64 viral DNA end/ E.coli DNA start
BASE COUNT 12 a 19 c 17 g 18 t
ORIGIN
1 gttgatcccc ggacctgcat gatcatcgct caaggaacct ctatgtatcc ctcctgttgg
61 ggggat
//
LOCUS ECOFIMA2 66 bp ds-DNA BCT 01-JUN-1990
DEFINITION E.coli type 1 fimbriae (fimA) gene and viral insertion site,
partial cds.
ACCESSION M33413
KEYWORDS type 1 fimbriae.
SOURCE E.coli (strain HB101) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Hedegaard,L. and Klemm,P.
TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous
antigenic sequences
JOURNAL Gene 85, 115-124 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 66 type 1 fimbriae (AA at 1)
recomb 3 4 E.coli DNA end/ viral DNA start
recomb 62 63 viral DNA end/ E.coli DNA start
BASE COUNT 14 a 17 c 17 g 18 t
ORIGIN
1 gttgatccaa acctgcgtgg agatctccag gttctggctc agaaagttgc tcgtacctta
61 ccggat
//
LOCUS ECOSPDE 2792 bp ds-DNA BCT 01-JUN-1990
DEFINITION E.coli speED operon speE and speD genes encoding S-adenosyl-
methionine decarboxylase and spermidine synthase, complete cds.
ACCESSION J02804 J04247
KEYWORDS S-adenosylmethionine decarboxylase; speD gene; speE gene;
spermidine synthase.
SOURCE E.coli K12 DNA, clones pT7-5-1 and LC37-29.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 643 to 2524)
AUTHORS Tabor,C.W. and Tabor,H.
TITLE The speED operon of Escherichia coli: Formation and processing of a
proenzyme form of S-adenosylmethionine decarboxylase
JOURNAL J. Biol. Chem. 262, 16037-16040 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 642; 2525 to 2792)
AUTHORS Xie,Q.-W., Tabor,C.W. and Tabor,H.
TITLE Spermidine biosynthesis in Escherichia coli the promoter and the
termination regions of the speED operon
JOURNAL J. Bacteriol. 171, 4457-4465 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed copy of sequence for [1] kindly submitted
by H.Tabor, 02-DEC-1987 and for [2] 13-MAR-1990.
FEATURES from to/span description
pept 314 661 unidentified protein
pept 767 1633 spermidine synthase /nomgen="speE"
pept 1649 2443 S-adenosylmethionine decarboxylase (ttg start
codon) /nomgen="speD"
signal 2448 2482 hairpin terminator
BASE COUNT 714 a 738 c 637 g 703 t
ORIGIN
1 aactgaatgc gattacgggc atcggtcgtg agcaaatcag ggatcggtaa cgttgggcgt
61 tctgccgcaa atactgcgcg gctccacagc ggcaaagcga agccacaccc agcgcgacgg
121 aatattttaa gaaatcacga cgttgcatag ttatttcctt attcttaagc aggcgaaaca
181 aaatcaaacg ttgagcatag tccttcccct tacgggaagg tcaagccgca gacacaattt
241 aatcgacaat aatggtcgtc gcctcgctcg cagtgtgcta acgtttatct tctttaagcc
301 cctggtagaa gcaatgaaga cgtttttcag aacagtgtta ttcggcagcc tgatggccgt
361 ctgcgcaaac agttacgcgc tcagcgagtc tgaagccgaa gatatggccg atttaacggc
421 agtttttgtc tttctgaaga acgattgtgg ttaccagaac ttacctaacg ggcaaattcg
481 tcgcgcactg gtctttttcg ctcagcaaaa ccagtgggac ctcagtaatt acgacacctt
541 cgacatgaaa gccctcggtg aagacagcta ccgcgatctc agcggcattg gcattcccgt
601 cgctaaaaaa tgcaaagccc tggcccgcga ttccttaagc ctgcttgcct acgtcaaata
661 atccctgata cctttttgta gaaatattga ccgtgcatcc gcggtcaatg ttagctatta
721 tgttgcgccc tttttttacg ggtgttaaca aaggaggtat caacccatgg ccgaaaaaaa
781 acagtggcat gaaacgctac acgaccagtt tgggcagtac tttgcggtag ataacgttct
841 gtatcatgaa aagaccgatc accaggatct gatcattttt gagaacgctg catttggtcg
901 cgtaatggcg ctggatggcg tagtacaaac caccgagcgc gacgagttta tctatcatga
961 gatgatgacc catgttccgc tactggccca tggtcacgcg aaacatgtgc tgattatcgg
1021 cggcggcgac ggtgccatgc tgcgtgaagt aacccgacat aaaaacgttg agtcaatcac
1081 gatggtggaa atcgatgcgg gtgtcgtatc gttctgccgt cagtatctac ccaaccataa
1141 cgccggtagc tacgacgatc cgcgctttaa gctggtgatc gacgatggcg tcaatttcgt
1201 taatcaaacc agccagacct ttgatgtcat tatctccgac tgcaccgatc ctatcggtcc
1261 cggcgaaagc cttttcactt cggcatttta tgaaggctgc aaacgttgcc tgaatcctgg
1321 cggtatcttc gtcgcacaaa acggcgtctg ctttttacag caggaagaag ccatcgacag
1381 ccatcgcaaa ctcagccatt acttcagcga cgttggcttt tatcaggcgg cgatcccgac
1441 ctattacggc ggtatcatga cttttgcatg ggcgacagat aacgacgcct tacgccatct
1501 ctcaaccgaa attattcagg cgcgttttct cgcctctggc ctgaaatgcc gttattacaa
1561 tccggcaatc catacggcag cttttgcctt acctcagtat ctgcaagacg cactggcttc
1621 acagccgtcc taaggagaag ataagaaatt gaaaaaactg aaactgcatg gctttaataa
1681 tctgaccaaa agtctgagtt tttgtattta cgatatctgc tacgccaaaa ctgccgaaga
1741 gcgcgacggt tatattgctt atatcgatga actctataat gccaaccgtc tgaccgaaat
1801 cctgtcagaa acctgttcca ttatcggggc taatattctt aacatcgccc gccaggatta
1861 cgaaccacag ggtgccagcg tcactattct ggtgagtgaa gaaccggttg acccgaaact
1921 catcgacaaa acagaacacc ccggcccact gccagaaacg gtcgttgccc atcttgataa
1981 aagtcatatt tgcgtacata cctacccgga aagtcatcct gaaggcggtt tatgtacctt
2041 ccgcgccgat attgaagtct ctacctgcgg cgtgatttct ccgctgaagg cgctgaatta
2101 cctgatccac cagcttgagt ccgatatcgt aaccattgat tatcgcgtgc gcggttttac
2161 ccgcgacatt aacggtatga agcactttat cgaccatgag attaattcga ttcagaactt
2221 tatgtctgac gatatgaagg cgctgtatga catggtggat gtgaacgtct atcaggaaaa
2281 tatcttccat accaagatgt tgcttaaaga gttcgacctt aagcactaca tgttccacac
2341 caaaccggaa gacttaaccg acagcgagcg ccaggaaatt accgctgcgc tgtggaaaga
2401 aatgcgcgag atttattacg ggcgcaatat gccagctgtt taacggctct ggcggagctc
2461 ccaggctccg ccagatttat ttacttctgc tgcacgaaat tgcggtaagc cgccacgacc
2521 tgcagaaaat cctcaacgcc gcatagcgac aggctttctt cgtcgtagta gttcatcccc
2581 tcttccattt catcgccagc gaattccagt tgattggcgc gaaccatcac ctcttcaccg
2641 tccatccaca gggtgtattc atgccctgcc cgttgccagg accgttcgct acctttcagt
2701 gcgtgcgctg cttgttccac ttcatcaagc aaggccaggt tttctttcac ctcttcatta
2761 aaccagtgcc cgaccacttc atgccccatg ga
//
LOCUS HUMBTRO 608 bp ds-DNA PRI 01-JUN-1990
DEFINITION Human beta-tropomyosin gene, exon 1.
ACCESSION J05247
KEYWORDS beta-tropomyosin.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 608)
AUTHORS Libri,D., Mouly,V., Lemonnier,M. and Fiszman,M.Y.
TITLE A nonmuscle tropomyosin is encoded by the smooth/skeletal beta-
tropomyosin gene and its RNA is transcribed from an internal
promoter
JOURNAL J. Biol. Chem. 265, 3471-3473 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 441 / 572 beta-tropomyosin
IVS 573 > 608 beta-tropomyosin intron A
BASE COUNT 102 a 198 c 223 g 83 t 2 others
ORIGIN
1 gtctctgcag cagatgtgtg gggactctgg agagcacccc tggatacact ccatgcagag
61 ctgctcacag cctctgcggc cccttttcct ctcaggaaaa taaacacccc gaccgccttc
121 ccctctgcct gctgggctcg ctgcggcatg cagccctcct ggnntctgtg ccctgctttg
181 gggctgtgcg cccatgctgg ggtcgacagc gtgcctccac ggggcagcca tgagccttat
241 cgggggacga cgtcgctcag cgcagggagc gcggcgggag cgggtcggtg tgcggcagga
301 ggaggaggag gaggaggagc aggaagggat ggaggcaggg cgggagcggc cgccgcctga
361 gcgcttcctg cccgagccag gcggatccca caaagggctc agcggagcgg ccttgctgcg
421 ctccccgccg gccgcaggcc atggccggca tcagctccat cgatgccgtc aagaagaaga
481 tccagagcct gcagcaggtg gctgacgagg cggaggaacg cgccgagcac ctgcagcggg
541 aggccgatgc cgagcggcag gcccgggagc gggtaaggct cctgggggca ccccgcgacc
601 cccagacc
//
LOCUS HUMGPIIB1 2238 bp ds-DNA PRI 01-JUN-1990
DEFINITION Human platelet glycoprotein IIb (GPIIb) gene, exon 1.
ACCESSION M33319
KEYWORDS platelet glycoprotein IIb.
SEGMENT 1 of 3
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
Schwartz,E. and Poncz,M.
TITLE Organization of the gene for platelet glycoprotein IIb
JOURNAL Biochemistry 29, 1232-1244 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1286 + 1473 platelet glycoprotein IIb (GPIIb), exon 1
pre-msg 1254 > 2238 GPIIb mRNA and introns
IVS 1474 > 2238 GPIIb intron A
BASE COUNT 596 a 554 c 595 g 493 t
ORIGIN
1 gtcaacggat cagaaaatag aaatcaaaag gaaaatgtgg ctatggttac ccctagcgga
61 cctcttaaat cttcctgaga acctgctttt ttgggaaggc atgagtgcca gtaagacttg
121 gcactcctcc tcttccgctt accgagagaa aatgactttg cctttctgct caaaactcat
181 cccttcactt tgtcacccta tgtttgcatc ttccatcctt agtgtgtgtt tccatccatc
241 cagtctttca gcaatacacg tactacacat tggactcttg ggtagtctct agggctgtag
301 caaggagcct tgctcccaag ggactcattt acacaatcct gtgaacggac caagagtaaa
361 cagtgtgctc aatgctgtgc ctacgtgtgt tagcccacgc ggccagcctg aggagtcagg
421 gaaggctccc ctaggcaaag cccccaacca gaatcaagtc ttaatggtta aagagctcca
481 tcacccaaaa aggattgagg gcctaccttc aactgaacag ctaatgcata atctcagaaa
541 ctgtgagtca aaattccctg gaataactcc actttatccc caatctcctt gccacctaga
601 ccaaggtcca ttcaccaccc tgtccccagc actgactgca ctgctgtggc cacactaaag
661 cttggctcaa gacggaggag gagtgaggaa gctgctgcac caatatggct ggttgaggcc
721 gcccaaggtc ctagaaggag gaagtgggta aatgccatat ccaaaaagat acagaagcct
781 caggttttat cgggggcagc agcttccttc tccttccccg acctgtggcc aagtcacaaa
841 gcaccacagc tgtacagcca gatgggggaa gggaggagat tagaactgta ggctagagta
901 gacaagtatg gaccagttca caatcacgct atcccaagca gaaagtgatg gtggcttgga
961 ctagcacggt ggtagtagag atggggtaaa gattcaagag acatcattga taggcagaac
1021 caataggaca tggtaataaa ctattctcag gaaaggggag gagtcatggc tttcagccat
1081 gagcatccac cctctgggtg gcctcaccca cttcctggca attctagcca ccatgagtcc
1141 aggggctata gccctttgct ctgcccgttg ctcagcaagt tacttggggt tccagtttga
1201 taagaaaaga cttcctgtgg aggaatctga agggaaggag gaggagctgg cccattcctg
1261 cctgggaggt tgtggaagaa ggaagatggc cagagctttg tgtccactgc aagccctctg
1321 gcttctggag tgggtgctgc tgctcttggg accttgtgct gcccctccag cctgggcctt
1381 gaacctggac ccagtgcagc tcaccttcta tgcaggcccc aatggcagcc agtttggatt
1441 ttcactggac ttccacaagg acagccatgg gaggtgagcc gtaagggaag ttggggtatt
1501 gggagagagc aggacccctc cccatcactg cttctggggg cttcgagttt cccatttgcg
1561 atagcagttg agcaaggtga cttgtggggc ctattcaggt tgatttcttg tcaagaatgt
1621 tggggtccag gggactggct caggtgaagg tataagggca gggcacatgt gggctgatgg
1681 gcactgaaaa ctacagcaag aacaaaggga agacaagagt tgatgcttta ttttttcccc
1741 aagggtcagt tgtatgaacc actccaccct caacaccttg aaatgcagag aggaggccgg
1801 gcgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcggg cagatcacct
1861 gaggtcgaga attcgagacc agcctgacca acatggagaa accccgtctc tactaaaaat
1921 acaaaaaaaa aaaaaaaaaa aagaggccag gcacagtggc tcacacctgc aatcccagca
1981 ctttgggagg cagaggtggg cagatcatga ggtcaggagt tcaagaccag cctggccaat
2041 atggtgaaac cctgtctcta ttaaaaatac aaaaattagc tgagcatggt ggcacactcc
2101 tgtagtccca gctactcggg aggctgaggt aggagaatca cttgaacccg ggaggtggag
2161 gttgcagtga gctgagacta tgccactgca ctcccagccc tggggttgac agagtgacac
2221 tccgtctcaa aaaaaaaa
//
LOCUS HUMGPIIB2 13204 bp ds-DNA PRI 01-JUN-1990
DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exons 2-29.
ACCESSION M33320
KEYWORDS platelet Glycoprotein IIb.
SEGMENT 2 of 3
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 13204)
AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
Schwartz,E. and Poncz,M.
TITLE Organization of the gene for platelet glycoprotein IIb
JOURNAL Biochemistry 29, 1232-1244 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 498 619 platelet Glycoprotein IIb (GPIIb), exon 2
709 806 platelet Glycoprotein IIb, exon 3
912 1077 platelet Glycoprotein IIb, exon 4
1293 1342 platelet Glycoprotein IIb, exon 5
1419 1464 platelet Glycoprotein IIb, exon 6
1552 1680 platelet Glycoprotein IIb, exon 7
2042 2089 platelet Glycoprotein IIb, exon 8
2245 2288 platelet Glycoprotein IIb, exon 9
2461 2514 platelet Glycoprotein IIb, exon 10
2653 2705 platelet Glycoprotein IIb, exon 11
2897 3108 platelet Glycoprotein IIb, exon 12
5536 5718 platelet Glycoprotein IIb, exon 13
5952 5997 platelet Glycoprotein IIb, exon 14
6106 6210 platelet Glycoprotein IIb, exon 15
6295 6350 platelet Glycoprotein IIb, exon 16
6443 6594 platelet Glycoprotein IIb, exon 17
6783 6908 platelet Glycoprotein IIb, exon 18
7886 7953 platelet Glycoprotein IIb, exon 19
8087 8234 platelet Glycoprotein IIb, exon 20
8803 8895 platelet Glycoprotein IIb, exon 21
9506 9585 platelet Glycoprotein IIb, exon 22
10202 10282 platelet Glycoprotein IIb, exon 23
10406 10505 platelet Glycoprotein IIb, exon 24
10605 10757 platelet Glycoprotein IIb, exon 25
10874 10999 platelet Glycoprotein IIb, exon 26
11478 11591 platelet Glycoprotein IIb, exon 27
11828 11929 platelet Glycoprotein IIb, exon 28
12117 + 12233 platelet Glycoprotein IIb, exon 29
pre-msg < 1 > 13204 GPIIb mRNA and introns
IVS < 1 497 GPIIb intron A
IVS 620 708 GPIIb intron B
IVS 807 911 GPIIb intron C
IVS 1078 1292 GPIIb intron D
IVS 1343 1418 GPIIb intron E (no splice consensus)
IVS 1465 1551 GPIIb intron F
IVS 1681 2041 GPIIb intron G
IVS 2090 2244 GPIIb intron H (no splice consensus)
IVS 2289 2460 GPIIb intron I
IVS 2515 2652 GPIIb intron J
IVS 2706 2896 GPIIb intron K
IVS 3109 5535 GPIIb intron L
IVS 5719 5951 GPIIb intron M
IVS 5998 6105 GPIIb intron N
IVS 6211 6294 GPIIb intron O
IVS 6351 6442 GPIIb intron P
IVS 6595 6782 GPIIb intron Q
IVS 6909 7885 GPIIb intron R
IVS 7954 8086 GPIIb intron S
IVS 8235 8802 GPIIb intron T
IVS 8896 9505 GPIIb intron U
IVS 9586 10201 GPIIb intron V
IVS 10283 10405 GPIIb intron W
IVS 10506 10604 GPIIb intron X
IVS 10758 10873 GPIIb intron Y
IVS 11000 11477 GPIIb intron Z
IVS 11592 11827 GPIIb intron AA
IVS 11930 12116 GPIIb intron BB
IVS 12235 > 13204 GPIIb intron CC
BASE COUNT 3046 a 3579 c 3857 g 2722 t
ORIGIN About 2000 bp after segment 1.
1 ctgcaggtca acggatctgc tagggtcctc ctatcagcac acacactcca gccccacttt
61 agaggtaccc gctaccttcc ctcattaaaa ccagctctca agaggggatc tggtaacagt
121 ctaggcaggc attccaggga gcatgtgaac cgctggttct tgttgcgggt ggaggatgga
181 ggtgttgtac agagtttagg tctttttcag caaagatctc caaaccccgg gtgttcaaaa
241 tcaaaccaaa ggggattata gtcccagctc tactcacaac tcactggtta ctttagccac
301 gagattgccc tcgctgagag tcggtttcac tgtccataag atgaagaagt acatcacggt
361 ggtctgtgag gtgtcattga ggaaagatgg tccagtgccc ccatgccaca tggccttcgg
421 gcagtgctcc cagcgccggc gccagggcct gggatacgct ggaatctgcg cggcgctcac
481 ccagctttcc tatgcagagt ggccatcgtg gtgggcgccc cgcggaccct gggccccagc
541 caggaggaga cgggcggcgt gttcctgtgc ccctggaggg ccgagggcgg ccagtgcccc
601 tcgctgctct ttgacctccg tgagtcccag gcaaggagag caaggttggg gtcagaggga
661 cgtggactgc ccgggcttca gcgccccacc ccttcttgtg ccttccaggt gatgagaccc
721 gaaatgtagg ctcccaaact ttacaaacct tcaaggcccg ccaaggactg ggggcgtcgg
781 tcgtcagctg gagcgacgtc attgtggtgg gccccgcggt acagggcaca gggaacaatc
841 gggggcaggg acactggggc caggaggagc ccaagtctcg cgccccgtcc ccatctgtgg
901 ccctttctca ggcctgcgcc ccctggcagc actggaacgt cctagaaaag actgaggagg
961 ctgagaagac gcccgtaggt agctgctttt tggctcagcc agagagcggc cgccgcgccg
1021 agtactcccc ctgtcgcggg aacaccctga gccgcattta cgtggaaaat gattttagta
1081 agcgccagct acgacctggc cccgcccact cgcgacggct tggccccgcc ccccatcgga
1141 tcccgccccc agcgccgcag cccttgcttt ggatctggcc tcgccccagg gccccgccga
1201 ctcaaggccc cgcccctgtc ccccagccct cctccgggct cgcgcgcgcc tcccttcacc
1261 cctgggctga cccctcctcc ttgtctcctc aggctgggac aagcgttact gtgaagcggg
1321 cttcagctcc gtggtcactc aggcgagtag ggagcaaaag cgcagtgggg gcggctccca
1381 aacagggccc cctctcaccc tcaggacttc ccttccaggc cggagagctg gtgcttgggg
1441 ctcctggcgg ctattatttc ttaggtacgt gcccatccgt acacctccct cccttctcgc
1501 ggccgaagga gaccgctttg ggcttcacac ccgctgtccc tcccgcccta ggtctcctgg
1561 cccaggctcc agttgcggat attttctcga gttaccgccc aggcatcctt ttgtggcacg
1621 tgtcctccca gagcctctcc tttgactcca gcaacccaga gtacttcgac ggctactggg
1681 gtaacaccgc cattccagac ttccagcacc ccgagggtca ccgcccaccg cagacggtca
1741 ggtcctgccc ctgtgggagc ctccatggcc acccctgccg gccaacccac cgcctaagcc
1801 gctcccgccc tccgctcctg cgcttccccg cagaccgccc acctcccatg cgcccaccgc
1861 tcccttccac tgcggactcg tagcgcagcc tggggcaggg cttggcccct cgaaggcctc
1921 cgtttttcca tctgcacaat gcagggctgg ggctgagtgg ccttaatctc ctccttcttt
1981 gccctccgtc ccctctgtgc ttcctcccct ggaaaagact aatttgcgcc cttgtcctca
2041 gggtactcgg tggccgtggg cgagttcgac ggggatctca acactacagg caagaaatcc
2101 acttagggcg ggagttgggt agcccagccc ggggaggagc gccttcctga aatctcccct
2161 atgtagctgg gtgcagaacg gggagcggga agtgggtagg ttctaaggct ctcattccct
2221 gagcctggct ctccctatcg ccagaatatg tcgtcgtgcc ccccacttgg agctggaccc
2281 tgggagcggt aagtgccccc accactgggc ctcccgaagc cccttatccc agttctcagg
2341 ctgacaactc ctgagcgccc cccacccccg ccccgcctcc accaaaccac cctttctcac
2401 ctggagtggg aggttgcttt gggtacaaga atgatgctct cgcctgcgct gtccgtgcag
2461 gtggaaattt tggattccta ctaccagagg ctgcatcggc tgcgcggaga gcaggtgggg
2521 gccaggtccc agtgggcgtg gctgggtgga gggggaactg agacttcaga atatttcatg
2581 ggaggtgagg gcccatttct taaagaggat gcttgtccag cggcgtgaat gatggtgctc
2641 ctcatcttgc agatggcgtc gtattttggg cattcagtgg ctgtcactga cgtcaacggg
2701 gatgggtgag gagggacatg cccccacccc tacccagttg ggtcccaaat taccagagct
2761 gcccctctgt ctccctttcc tagccctagt ctcacgtatc cactggagga acaggagagc
2821 aagggtcgag gagatttggc cctagcccca atatacccct ggtccagtcc catgtaacca
2881 ctcatctggc ccacaggagg catgatctgc tggtgggcgc tccactgtat atggagagcc
2941 gggcagaccg aaaactggcc gaagtggggc gtgtgtattt gttcctgcag ccgcgaggcc
3001 cccacgcgct gggtgccccc agcctcctgc tgactggcac acagctctat gggcgattcg
3061 gctctgccat cgcacccctg ggcgacctcg accgggatgg ctacaatggt gagggaagag
3121 aggagcccta cttgctgcag aggggttaac agccactcaa aaagcatgga gttggcctga
3181 gggcagccag aaccaggatg ggttttaagc atataagtat gtggcttaga cacatggggt
3241 gctgagtgga gagcagatgg gagagttgaa gactaattag gaagtgtttg ccttaatcca
3301 agcaagagac aatgaccacc tggatgtgga ttttggcagt ggagttagag atgggagtga
3361 cttcacagat atttaggact cggattatta ggacttggtg ggagactgga tgtggggcca
3421 ggggagaggt tggagttggg tgcctgtgat ggcctccact gcctggaact caggccgtgc
3481 agcaggtgct ggggagaggc gggagatcag cagttcagct ctggacctgt tgagcttgaa
3541 gggcttgggt gctttaggcg gaaatatcca aagaacagtt gggagtggct ctccccgctt
3601 ccacaagaga gatctgaatg ggagacaggg gtttggggaa agtggatgag gtcccgggac
3661 ctgtgaaata agaggcccag gatagagccc tagggagcaa aagcatttag gtgactccta
3721 caggaggtaa gtctgagaag gagacagagg agtgtccaga gagggaggag ggaacccagg
3781 gggtctgatg gcccgggact caaggaagag catgcgttaa agagcatgca caggaggaag
3841 tgggcgctgc agctcctgct gctgctgcaa gatacaatta ggtggggctg gagaaatatt
3901 catgggcttt agcaagaaga gggtgccagg catggtggct catacctgta atcccagcta
3961 cttgggaaat tgaagcagga gaatctcttg aacccgggaa gtggaggttg cactgagctg
4021 agcttgcgcc actactgcac tccagcctgg gtgacagagc aagactccat ctcaacaaaa
4081 taaaaaaaaa aatagagaaa gaaaggaaga aagaaaaaag aaggggaggt tattggtgac
4141 agtgacataa attgattcag gccaagatag ggtcagaagc cagaatgcaa tggggtaagg
4201 tatgaatgga gatgaaaaat tggatgcagc taatgtagac agctctttca acaggtttgt
4261 ggtaaaaagg aatttgagga atagaaagga aaaaaaaaaa catgtttgac tataagagga
4321 aaaagagaaa aggtgatcac agaaaagaga tgagggtcaa gggaagatta tttcaatgtg
4381 gaagaacatg tagtaggttg aaaatgatgt tgtggggaaa tggggggatg agccagcaga
4441 gagtccctgt gatgcctcag ggggtgggag ggtgactggc ccagtgtcag ggtgaaggaa
4501 ggaaacctct tccagggtca aatggggaaa gggaaaaaga aagttggtgt gggattatag
4561 cataacagtg ggctgcctct cttcctgaag taagagatta cgtcacctgc tgaaggaagt
4621 gtggggggtc tgggagtttg atggaatgga gaaggctaga aatagatgct agatggccag
4681 gcacggtggc tcacacctgg aatcccagca ctttgggagg ccgaggcagg aggatcactg
4741 gagcctagga gtttgacacc agcctggcca acatagggag atctcgtctc cataaaaatt
4801 tttaaaaatt agctgggcat ggtggctata gtctcaactg cttgggaagc tgaggtggga
4861 ggattgcttt agtccagaag gttgaggctg cagtaagcca tggttgcacc actgcacttc
4921 agcctgaatg acaagtgcaa gactgtctta aaataaaaaa tttaaagggc ttgggcacgg
4981 tggctcacac ctgtaatcca gcactttggg agcccaaggt gggcagatca cttgaggtca
5041 ggagttcgag atcagcctgg ccaatgtggt gaaaccccgt ctctactgaa aatacaaaaa
5101 ttagccgggc atggtggtag gcgcctgtaa tcccagctac tgaagaggct gaggcacaag
5161 aatcacttta acgggggagg cagaggttgc agtgagccga gatcgcacca ctgcactcca
5221 gccaggacaa cagagcgaga ctccatctca aaaaaaaaaa aatttagaaa agggaataat
5281 gatgcttaat tttcaggata tattttcctc aatagacagt gagagttgtc actgttttta
5341 taacaatcct acttggcagg tccctctccc acctgattgt taactcctgg agggtagggc
5401 agtgcctcct tcacccacac tttgcacccc tttcctagtc tcctgggatg ttcccagaga
5461 agctcaggaa agttttacag tcatctaggg aggctgaata acaatcagcc acttcctttc
5521 tgttactcct tccagacatt gcagtggctg ccccctacgg gggtcccagt ggccggggcc
5581 aagtgctggt gttcctgggt cagagtgagg ggctgaggtc acgtccctcc caggtcctgg
5641 acagcccctt ccccacaggc tctgcctttg gcttctccct tcgaggtgcc gtagacatcg
5701 atgacaacgg atacccaggt gccctggact gcctccagct agaaatgccc aagaaaggcc
5761 cttggacatt cgctggaagt gccaagagac acggccaggg ctcatgcctg gcctggtgtc
5821 ccactatgga ctgccagagg ggctgggtga aacctccagt gggggaggtg gtgtggggaa
5881 cccctgggaa gatgagatga ggatccccat accctaatcg ccaattctga cccattcctc
5941 gatgtctata gacctgatcg tgggagctta cggggccaac caggtggctg tgtacaggtg
6001 agcactggct ccaggggcgg gatggggaag gtcctgtgcc atcaagagga ggccaggcca
6061 ggaggagcca caatggcaag cctccccatc accctatccc atcagagctc agccagtggt
6121 gaaggcctct gtccagctac tggtgcaaga ttcactgaat cctgctgtga agagctgtgt
6181 cctacctcag accaagacac ccgtgagctg gtgaggaggc agagggcatg ggccttaaag
6241 gatctgggac ctcagaaagg ctccaacccc tgagccccac ttacgtcttt gcagcttcaa
6301 catccagatg tgtgttggag ccactgggca caacattcct cagaagctat gtgagtggca
6361 tgaagggggc aggagggagg tgggcttgga ctcccccgga ggctggccag ggaggtcctg
6421 actcttctgc ttgccctgcc agccctaaat gccgagctgc agctggaccg gcagaagccc
6481 cgccagggcc ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctgaacctg
6541 gatctgggcg gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagtacgc
6601 ccaggcaggg gattggcagg gctgggagag tagaacttac ccactggact tgttcatcta
6661 gccctggggc actgagctgg gtgctgtgag tccgggggtg gtcaggacac aggtgcctac
6721 tggccaggag aaggtgggat gtgtatggta gcaagatggc ctgactcttg cccctgtcct
6781 aggatgaggc agacttccgg gacaagctga gccccattgt gctcagcctc aatgtgtccc
6841 taccgcccac ggaggctgga atggcccctg ctgtcgtgct gcatggagac acccatgtgc
6901 aggagcaggt agggacaggc agggacaggc cagggaggtg caggacccct gatagcaaat
6961 caggattagg gttagtgcca agtcacaatg taaccccaaa accttgatgt cattccaaac
7021 cctaatgaaa acctcaaaat ccagccagtc atggtggctc acacctgtaa tcccagcact
7081 ttgggagacc gaggcaggca gattgcctga ggtcaggagt tagagaccaa cctggccaac
7141 atggtgaaaa cccatctcta ctaaaaatac aaaaaaaatt agccgggtgt ggtgacgcat
7201 gcctgtaatt ccagctactc gggaggctga agcaggagaa tcacttgaac ccaggaggca
7261 gaggttgcag tgagccaaga gtgtgccaca gcactccagc ctgggtgaca gagcaagact
7321 ctgtctcaaa aaaaaaaaaa aaagccaggc gcagtggcct cacgcctgta atcccagcac
7381 tttgggaggc caaggcgggt ggatcacgag gtcaggagat caagaccatc ctggctaaca
7441 cagtgaaacc ccgtctacta aaaatacaaa aaaaaaaaaa aaattagctg ggcgtggtgg
7501 cgggtacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaaccccgg
7561 gggcggacgt tgcagtgagc cgagatagtg ccactgcact ccagcctgga cgacagagcg
7621 agactccgtc tccaaaaata aaaaaacacc tgaaaatccc agtatcccct aagctctgat
7681 gtaaattgac aaaccctgac attgtcccaa acctccaaat ataacccgag ccccgatacc
7741 atctacaaac tccttttcgt cctcagatct tcttactccc taagccccta tgtgaacccc
7801 aagcccactg ttttcctaac cctgatgtaa tccctaaacc tcacacatcc ccaacttacc
7861 cgcacacccc aatgtgcccc tctagacacg aatcgtcctg gactgtgggg aagatgacgt
7921 atgtgtgccc cagcttcagc tcactgccag cgtgtgagga ggcctcccat tctgcccgac
7981 cctggccctt tctgcctatc atacctgctc cacaccttag tcccctcttt tcccacatcc
8041 tgggcccaga cccaggctcc ctggcttcac tcctctttcc ccacaggacg ggctccccgc
8101 tcctagttgg ggcagataat gtcctggagc tgcagatgga cgcagccaac gagggcgagg
8161 gggcctatga agcagagctg gccgtgcacc tgccccaggg cgcccactac atgcgggccc
8221 taagcaatgt cgaggtatgg cccccaccct gggaacagta cccgggacct gggaggcact
8281 ggagccttgg ctctctcatc tccctccctg agagtccctc ttctcttctg ctttgctgtc
8341 aaagatgtaa tttttttttt aatttggagg aggatacttg ctaatggtca gtcagaattc
8401 caaaactcta ttacaaaaac cagaaaaaca aaaaaggttt aggaaccaaa tgttaacagg
8461 aacctctgtt aacatttggt ggatttcctt ccagtctttt tttcaatatt gactcacact
8521 cacataagta tatatttatt ttttatgttg ttaatatagt ttataataat gggggtcata
8581 ctctaatgtt ttgtgttttt tatttccaaa atgaaaatgc ctaaaaagta gtagtgctac
8641 agcaatacac acactagcat gtgacagtcc cttgagcgac cccaccccaa gaaacccccc
8701 cctccctacc ttggcacaca aatctttcca gaccttccaa gggagcttaa atatatatat
8761 atgatgctct gtaatttctt tcttggaact gccttcctga agggctttga gagactcatc
8821 tgtaatcaga agaaggagaa tgagaccagg gtggtgctgt gtgagctggg caaccccatg
8881 aagaagaacg cccaggtgag gctgctgggt cgtggtaccg ggtctccacc aggggctcat
8941 gaataaccag attttagggg tgaggtttta gagccacata gttctgggcc agaatcttgg
9001 tcctcacact ccctttgcca acattgtcct tgggtgagtg actttccctc tctgagcccc
9061 tttaccagtg ggcttccagg taaaatagaa ataataatgg tggcctggtg cggtcgtcac
9121 gcctgtaatc ccagcactct gggaggccag agcgggtgga tcacgaggtc aggagttcaa
9181 gaccagcctg gccaacatag caaaaccccg tctctactaa aaatacaaaa attacccggg
9241 catggtggcg cacgcctata gtcagagcta ctcgggaggt tgaggcagaa aaatcacttg
9301 aacctgggag gtggaggttg cagtgagccg agatcatgcc actgcactcc agcctgggtg
9361 acagagtgag actccgtctc ggaaaaaaaa aaaaagaaaa agaatagtgg tgatcttgga
9421 gggtgaagac tggaggccac attcagggca gggctgtcct aagtggggca cttgggcagt
9481 gaccttggcc ctcctcatct cccagatagg aatcgcgatg ttggtgagcg tggggaatct
9541 ggaagaggct ggggagtctg tgtccttcca gctgcagata cggaggtact gacctggcga
9601 gcgtgcctac ccaccaccct tcccccgtct gacccccgtg cagagcccct caggtccctt
9661 ccatacagaa gggtctttcg aggccaggcg cagtggctca cacctgtaat cccagcacgt
9721 tgcgaggcca aggcagaagg atcactggag gtcaggagtt ggagaccagc ctggccaaca
9781 tggtgaaacc ccatctctac taaaatataa aattagctgg gcatggtggt gcgcacctac
9841 aatcccagct actcgggagg ctgaggcagg agaatagctt gaaccgaacc tgggaggtgg
9901 aggttgcagt gagctgagat tgggccactg cactccagcc ttccagcctg ggcgacagtg
9961 cgagattcta tctcaaaaga aaaaaaaaaa aaggtcttga agaagcctgg ttccctttct
10021 tcctcagaga tttagcgagt cttggagccc tagaggaagt tctttcccag gtctaacttc
10081 agtgtggcat gctctttgta taattagctc tctctgaact ctctaaaatt ctggcctcac
10141 ccccagaaag tcactgggct ggtgtccctg gccctgtttc tcctcatccc ctcccctcta
10201 gcaagaacag ccagaatcca aacagcaaga ttgtgctgct ggacgtgccg gtccgggcag
10261 aggcccaagt ggagctgcga gggtgagagg ccaggggtgg agaagggaga tggcattcag
10321 ggctctaaac tccagggggc gctggggaaa cctcacaggc caatcagggc atcacactct
10381 ctctgggggt cttgggcacc tgcaggaact cctttccagc ctccctggtg gtggcagcag
10441 aagaaggtga gagggagcag aacagcttgg acagctgggg acccaaagtg gagcacacct
10501 atgaggtatt ggggagcctc gcgtccctgg ctggggtgag cgggtcctca gaactccggg
10561 tgaggcgcta agctccccac accctgccac caccacccct tcagctccac aacaatggcc
10621 ctgggactgt gaatggtctt cacctcagca tccaccttcc gggacagtcc cagccctccg
10681 acctgctcta catcctggat atacagcccc aggggggcct tcagtgcttc ccacagcctc
10741 ctgtcaaccc tctcaaggta agagctgggt ggaagaaaga cctgggaagg cggccccaga
10801 ccaaccaccg gggcacctct gtgggctggg gttcggggga gacctgggcc tgaccactcc
10861 tttgcccccc caggtggact gggggctgcc catccccagc ccctccccca ttcacccggc
10921 ccatcacaag cgggatcgca gacagatctt cctgccagag cccgagcagc cctcgaggct
10981 tcaggatcca gttctcgtag tgagcaggct ctctggtctc gggcccggcc tccccgggac
11041 ccacggggca gaggggatgg gaggagggag aggggtccgg gtgtgctgtg ggcctctgtg
11101 ggccacgctt ggtccctggg agcacttcaa gtgaacatgg aggagcatgc tggcttgtgt
11161 ctggggtgag ctgaaagaca cttgcacttt ttaaaagctt cccagtacgt taaggagcat
11221 aaaacaatgc caaagcaagg ttatcataga tctgagcatt gtgcgctggg ggatgaccct
11281 ccctgcatct ctgggactat gtgagcaagc ccgtggaaag acagcatccg aagcttggat
11341 ccaaggccct tcctgatggg aaggccaccg cttcctgaac ccccggcccc ttctgcgttg
11401 ggtcctgggg gtaagggggt gggggatgat ggggtgatgg gccgggacgg ctggggactg
11461 acgatgcttc ccctcagagc tgcgactcgg cgccctgtac tgtggtgcag tgtgacctgc
11521 aggagatggc gcgcgggcag cgggccatgg tcacggtgct ggccttcctg tggctgccca
11581 gcctctacca ggtggggtgg gccgtggtgg ggcggggccg ggccttctgg gccgggacca
11641 ctttgctctg ggaggggcgg ggtttggtgt gggagggcag gaagagaggg aaggcaaggt
11701 ttactttggg ggattgcagt gggattaggt cagaggcagg gcttccccgc cgggtgtggg
11761 acctggactc cgtgcaacca ataggcctct tgtgggtgta aacggctttc aaccccaacc
11821 tgtccagagg cctctggatc agtttgtgct gcagtcgcac gcatggttca acgtgtcctc
11881 cctcccctat gcggtggccc cgctcagcct gccccgaggg gaagctcagg tgagtgtggg
11941 gggatggagc agagaccagt cctgcaggac ccattgtccc ccagtcagtg cccagccaga
12001 aaagtctgag gggtggtacg ggtgggtggc atggctggag gtcaccagcc tgaggtttga
12061 gtctttgtga aaggcaggtg tcaaggtgac tgaggagaca cgtgggtttg ccccaggtgt
12121 ggacacagct gctccgggcc ttggaggaga gggccattcc aatctggtgg gtgctggtgg
12181 gtgtgctggg tggcctgctg ctgctcacca tcctggtcct ggccatgtgg aaggtgaggt
12241 gtgaaggacg gtggagtccc cagcggggca caggcttggc tctgccctgc ctcacaggga
12301 gtcaaggaga gatggtggcc cacccaagtg ggtaatccag ggaccagggg tctatgtctc
12361 cactattaga atgtcattct cgtccagggg ggtggctcac acctgtaatc ccagcacttt
12421 ggcaggcaaa gcgtttagat cacctgaggt caagagttcg agaccagcct ggccaacatg
12481 gtgaaacccc atctctacta aaaataccaa attagccggg cgtgttgaca catgcctgta
12541 atctcagcta ctcgggaggc tgaggcagta gaattgcatg aacccaggag gcggaggttg
12601 cagtgagccg agatcacacc actgcactcc agcttgggca acagagcgag cctccatctc
12661 aaaaaaaaaa caaaaaaata gaatgtcttt ctctagtaga gcaaaaggca aaacaaacac
12721 aaaaatgtca ttctcctggg aacccttcca gacacatacc actggaaagg atagcacctg
12781 aaattctgag gcctttagac acccctgcca ccaaaaagat tcagaggata tagagggtat
12841 agagggtgta agtcctgcct tcaggaattc ctggctggtc tcaaggacaa gatgcacttc
12901 ttcctagccc tgcccttccc cttgagtgag gaagaggcca aggattggtc tagaccctat
12961 tccatacctt cctatgtggc cctggagggt cactcgctcc tctgcacctg gaggagtctc
13021 aagcacactg aagggaagac atggtgcttt tagggaaaac cacgcactag acccacaata
13081 atcaaataca tatcatcata tgctcgagtc atgcagacac aaacttcagt ataagaaaaa
13141 ttccaggctg ggcgttggtg gctcacaccg gtaaaatccc agcactttgg gaggccgagg
13201 tggg
//
LOCUS HUMGPIIB3 364 bp ds-DNA PRI 01-JUN-1990
DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exon 30.
ACCESSION M34344
KEYWORDS platelet Glycoprotein IIb.
SEGMENT 3 of 3
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 364)
AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
Schwartz,E. and Poncz,M.
TITLE Organization of the gene for platelet glycoprotein IIb
JOURNAL Biochemistry 29, 1232-1244 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 110 169 platelet Glycoprotein IIb (GPIIb), exon 30
pre-msg < 1 > 364 GPIIb mRNA and introns
IVS < 1 109 GPIIb intron CC
BASE COUNT 68 a 111 c 98 g 87 t
ORIGIN About 600 bp after segment 2.
1 ttctgcgctg gtccagggag gtgctcatat gctagcatac ttcctcacat gtgctctggg
61 gccagcaaat catctgtata ccctgacctt ggcccccgtg tacccccagg tcggcttctt
121 caagcggaac cggccacccc tggaagaaga tgatgaagag ggggagtgat ggtgcagcct
181 acactattct agcaggaggg ttgggcgtgc tacctgcacc gccccttctc caacaagttg
241 cctccaagct ttgggttgga gctgttccat tgggtcctct tggtgtcgtt tccctcccaa
301 cagagctggg ctaccccccc tcctgctgcc taataaagag actgagccct gatgctgagc
361 atgc
//
LOCUS HUMIMPH 2858 bp ss-mRNA PRI 01-JUN-1990
DEFINITION Human IMP dehydrogenase type 1 mRNA complete cds.
ACCESSION J05272
KEYWORDS IMP dehydrogenase.
SOURCE Human spleen, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2858)
AUTHORS Natsumeda,Y., Ohno,S., Kawasaki,H., Konno,Y., Weber,G. and
Suzuki,K.
TITLE Two distinct cDNAs for human IMP dehydrogenase
JOURNAL J. Biol. Chem. 265, 5292-5295 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.Natsumeda, 15-FEB-1990.
FEATURES from to/span description
pept 601 2145 IMP dehydrogenase type 1 (EC 1.1.1.205)
signal 2843 2848 poly-A signal
BASE COUNT 545 a 923 c 860 g 530 t
ORIGIN
1 tcggaagggg ccaggagaca ctggaaggtc cggacggcag ggaaggggac ggggttcttt
61 ccagtcccac ccgtgtaggg acacctctcc ccctcatccc ccgatgtacc ctcgctgaat
121 ctgggatggg agagacgaac cgagtctagg catctgcgta gcagcgccgg ggagagcggg
181 gagcccaggc ggagcccagt cgactcccgg attcccctgc cccgcccccg gcacgaggcc
241 ccgccccggc gccccgcccc tcctcgggac tcgaccgggc tgcgctcact gcccagccgg
301 ggccccggga gcctccaggc tcgcccgccc tgagctgcgg cctccgcatg gagggccact
361 cactccacca ccgctgcagg gaggcggacg gcgctgttcc ggagcccgga gcccggcaac
421 acccgggaca cgagacggcg gcgcagggct acagcgcccg actgctgcag gccggctacg
481 agcccgagag ccctagattg gacctcgcta cacacccgac gacaccccgt tcagaactat
541 cttcagtggt cttactggca ggtgttggtg tccagatgga tcgccttcgc agggctagcc
601 atggcggact acctgatcag cggcggcacc ggctacgtgc ccgaggatgg gctcaccgcg
661 cagcagctct tcgccagcgc cgacgacctc acctacaacg acttcctgat tctcccagga
721 ttcatagact tcatagctga tgaggtggac ctgacctcag ccctgacccg gaagatcacg
781 ctgaagacgc cactcatctc ctcccccatg gacactgtga cagaggctga catggccatt
841 gccatggctc tgatgggagg tattgggttc attcaccaca actgcacccc agagttccag
901 gccaatgaag tacgcaaggt caagaacttt gaacagggct tcatcacgga ccctgtggtg
961 ctgagcccct cgcacactgt gggcgatgtg ctggaggcca agatgcggca tggcttctct
1021 ggcatcccca tcactgagac gggcaccatg ggcagcaagc tggtgggcat cgtcacctcc
1081 cgagacatcg actttcttgc tgagaaggac cacaccaccc tcctcagtga ggtgatgacg
1141 ccaaggattg aactggtggt ggctccagca ggtgtgacgt tgaaagaggc aaatgagatc
1201 ctgcagcgta gcaagaaagg gaagctgcct atcgtcaatg attgcgatga gctggtggcc
1261 atcatcgccc gcaccgacct gaagaagaat cgagactacc ctctggcctc caaggattcc
1321 cagaagcagc tgctctgtgg ggcagctgtg ggcacccgtg aggatgacaa ataccgtctg
1381 gacctgctga cccaggcggg ggtcgacgtc atagtcttcc actcgtccca agggaattcg
1441 gtgtatcaga tcgccatggt gcattacatc aaacagaagt acccccacct ccaggtgatt
1501 ggggggaacg tggtgacagc agcccaggcc aagaacctga ttgatgctgg tgtggacggg
1561 ctgcgcgtgg gcatgggctg cggctccatc tgcatcaccc aggaagtgat ggcctgtggt
1621 cggccccagg gcactgctgt gtacaaggtg gctgagtatg cccggcgctt tggtgtgccc
1681 atcatagccg atggcggcat ccagaccgtg ggacacgtgg tcaaggccct ggcccttgga
1741 gcctccacag tgatgatggg ctccctgctg gccgccacta cggaggcccc tggcgagtac
1801 ttcttctcag acggggtgcg gctcaagaag taccggggca tgggctcact ggatcccatg
1861 gagaagagca gcagcagcca gaaacgatac ttcagcgagg gggataaagt gaagatcgca
1921 cagggtgtct cgggctccat ccaggacaaa ggatccattc agaagttcgt gccctacctc
1981 atagcaggca tccaacacgg ctgccaggat atcggggccc gcagcctgtc tgtccttcgg
2041 tccatgatgt actcaggaga gctcaagttt gagaagcgga ccatgtcgcc ccagattgag
2101 ggtggtgtcc atggcctgca ctcttacgaa aagcggctgt actgaggaca gcggtggagg
2161 ccgaggtggt ggaggggatg caccccagtg tccacttttg ggcacaggct ccctccataa
2221 ctgagtggtc cacagatttg cactacgggt tctccagctc ctttccaggc agagaggagg
2281 ggaggtcctg aggggactgc tgcccctcac tcggcatccc ctgcagagtc aggactgctc
2341 ccgggggcca ggctgccctg ggaggccccc tccgagacca gccagccagg ctctcaggac
2401 ctgcgctgcc ttaggatctt tcttgctgca gcctgctcca gcctggcccc caccccaggg
2461 gcaggcggcc cctcctggct tctcctgtag ggcacctccc tgcccctagc ctcccagcaa
2521 atggtgctct cctggccctg ctctggccct tcccgggccg tgcccctcag ccatgtggca
2581 cttctgagct cctgacctag gccaagggga ggtctctgcc cccttccccg gccctgggct
2641 acccttgggt cctgctcctc aggccgctcc cctgtccctg gccatgggta ggagactgcc
2701 ctggtcatgg ccgcctgcct gtcattcctg actcaccacc gtccccaggt gaaccattcc
2761 tcccttctcc tcagctgcag tcgaaggctt taactttgca cacttgggat cacagttgcg
2821 tcattgtgta ttaaatactt ggaataaatc aagcaggt
//
LOCUS MBIFTRX 1260 bp ds-DNA BCT 01-JUN-1990
DEFINITION M.thermoautotrophicum tetrahydromethanopterin formyltransferase
(FTR) gene, complete cds.
ACCESSION J05173
KEYWORDS tetrahydromethanopterin formyltransferase.
SOURCE M.thermoautotrophicum.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 1260)
AUTHORS DiMarco,A.A., Sment,K.A., Konisky,J. and Wolfe,R.S.
TITLE The formylmethanofuran: Tetrahydromethanopterin formyltransferase
from Methanobacterium thermoautotrophicum delta-H: Nucleotide
sequence and functional expression of the cloned gene
JOURNAL J. Biol. Chem. 265, 472-476 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 153 unknown protein (AA at 1)
pept 211 1101 tetrahydromethanopterin formyltransferase
BASE COUNT 371 a 260 c 305 g 324 t
ORIGIN
1 ggatccttgg agggagatga gggagatgag ggtatagtaa atatttttgt tctctggtgc
61 tgtaataacc atatgtttat cacccggcgc ataaccctct tattattcct aggggccctc
121 agattcatgg ttttacagtt atttctaagc tgaattgaaa gtagaaaaat atattaaaca
181 ttctcccagt aaatcaacta ggtgatagtt atggagataa atggtgttga aatagaagac
241 acatttgcag aggcctttgg cataaaggtt tcaagggttc ttgtaaccgc aacaaaaaaa
301 cttgccaaaa tagcagcaac cgaggcaaca ggttacggta cctcagttat agggtgccct
361 gcagaggctg gcatcgactg ctatgttcca cctgaggaga cacctgacgg aagaccaggg
421 tacattataa tgatatgcaa cccatccaag aagagccttg accatgagct ccttgaaagg
481 atagggatgg gtatactgac agcaccaacc accgctgtct tcgatgcact ggatgatgag
541 gatgagaagc tcaacatagg attcaaacta aaattcttcg gtgacggcta cgagaaggaa
601 cttgaaattg atggaagaaa gatccactca atcccaataa tgtcagggga cttcctgatt
661 gaaagccagt tcggaataaa ggatggagtt gccggcggaa acttcttcat aatgggtgac
721 agtcaggcat cagcgctact ggctgcacag gcagctgttg atgccatagc tgcagttgaa
781 ggtactgtaa caccattccc tggtggagtg gttgcttcag gatcaaaggt cggctcaaac
841 aagtacaagt tcctcaacgc ctcaaccaat gagaagatgt gtgtgaccct gaaggacgaa
901 gttgaggaca cccagatacc tgagaacgtc aacggggtct acgagatagt catagatggt
961 gttgatgaag aggccgtaag ggaggccatg aaggaaggta taaaggctgc ctgcacggtt
1021 ccaggcataa taaagataag tgcaggaaac tatggcggta acctgggagc ctacaagata
1081 aaactccatg acctcttctg atttttttat ccatgccctg atcctatcag ggttgacctt
1141 tttttattct atgcagaatt tttccatcat ttcgctttca actatttccc atatctcgtc
1201 tcgcggaggt taacaccgag agtatacccc ttttcctgag gtcatcttca attttcatta
//
LOCUS PFAROP1 3061 bp ds-DNA INV 01-JUN-1990
DEFINITION P.falciparum rhoptry associated protein 1 (rop1) gene, complete
cds.
ACCESSION M32853
KEYWORDS rhoptry associated protein 1.
SOURCE P.falciparum (strain K1) blood stage DNA, clones lambda rop[1-3].
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 3061)
AUTHORS Ridley,R.G., Takacs,B., Lahm,H.-W., Delves,C.J., Goman,M.,
Certa,V., Matile,H., Woollett,G.R. and Scaife,J.G.
TITLE Characterisation and sequence of a protective antigen from
Plasmodium falciparum
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by R.G.Ridley, 15-MAR-1990.
FEATURES from to/span description
pept 628 2976 rhoptry associated protein 1 (rop1) precursor
sigp 628 695 rhoptry associated protein 1 (rop1) signal
peptide
matp 696 2973 rhoptry associated protein 1 (rop1)
BASE COUNT 1266 a 335 c 357 g 1103 t
ORIGIN
1 gtatacacac atacaaatat gtaaaaatag acatttcaaa atttttaata taaaataaca
61 tatatatata taaataaata gtaaaattaa ttttttttat ttcatcattt tatatataca
121 tatatatata tatatatata tatatattta tttatttatt tatatacata tataagttat
181 ttatttattt atttcctttt tggttgtatt ttcgtttttt tgattctatt tatatattta
241 tagtaatatt tataattata attattttat ttaatttttt atctttttca ttttatatat
301 taataacttt cttttaaata ttatgatatt aatttttatt ttttagtatt tatataagat
361 tttaaaatag acaaaataaa tataaataaa ttatatatta tatatatata tatatatttt
421 ttttttttta tagttatgca catatatatg tattaatata tattatcttt aaactattag
481 aatatatact ttattattct aaatattatt aatttgtaat actcacatat atatatatat
541 atatatatat atttatttat ttatttattt atttatttat ttatatagta aaaaatcaat
601 taattttttt tttttttttt atatataatg agtttctatt tgggtagctt agtaataata
661 ttccatgtac tcttccgtaa tgtcgctgat ggtataaatg taaacggaga taataattat
721 gggaaaacaa taatcaataa tgatttcaat tttgatgatt acaattattg gacaccaata
781 aataaaaagg aatttttaaa ttcctatgaa gatgaatttt caagtgaatc ctttttagaa
841 aataaatcta gtgttgatga tggaaatata aatttaacag atacaagtac atcaaataaa
901 agttctaaaa aaggacatgg tagaagtaga gtaagatcag catcagctgc tgcaattctt
961 gaagaagatg attcaaaaga tgatatggaa tttaaagctt ctccttcagt tgttaaaaca
1021 tctactccat caggtacaca gacatctggt ttaaaatcat ctagtccatc tagtacaaag
1081 tcatcaagtc catcaaatgt aaaatcagct agtccacatg gtgaatctaa ttcttctgaa
1141 gaaagtacta ctaaatcctc aaagagaagt gcttcggttg caggtattgt aggtgccgac
1201 gaagaagcac ctcctgcacc aaaaaacacc ctcactccat tagaagaatt atatcctact
1261 aatgttaatt tatttaacta taaatattca ttaaacaata tggaagaaaa tatcaatata
1321 cttaaaaacg aaggagattt agttgcacaa aaagaagaat ttgaatatga tgaaaatatg
1381 gaaaaagcta aacaagacaa aaaaaaagca cttgagaaaa taggaaaaca atcagacgaa
1441 gaacctttta tgttttcaga aaataaattt cttgaaaatc aagtaaaaga aagaaatgtt
1501 gctggatcct tttctcgatt tttcagtaaa ttaaatcctt ttaagaaaga tgaagtaata
1561 gaaaaaactg aagtatcaaa gaaaacattt tcaggtatag gttttaatct tactgacaaa
1621 gaagctaaag tattaggtgt aggtgcaacc tatcaagaat atccagaaac catgttatat
1681 aactgtccaa acaattctaa tttgtttgat actatagaat cattacaagg aagaataatt
1741 gatattaaaa aaagagaaag catgatatca acaactttcg aacaacaaaa agaatgttta
1801 aaaaatatgg gtgtacttga tcttgaatta aacgatacac aatgtaaatt tggtacatgt
1861 ataggtagct ttggagaaca tcatcttaga ttatacgaat ttgagaatga cttatttaaa
1921 tttcatccaa atattgatta tttaacttta gctgatggat ataaattaca aaaaaatcat
1981 atatatgaat tatcccatgt aaacttttgc ttattaaatc ctaaaacatt agaagaattt
2041 ttaaaaaaaa aagaaatcaa ggatcttatg ggtggtgatg atcttataaa atataaagaa
2101 aattttgata actttatgag tatatctata acatgccata ttgaatcttt aatatatgat
2161 gatattgaag catctcaaga tattgctgct gtattaaaaa ttgctaaaag taaattacat
2221 gtaataacat caggtttatc atataaagca agaaaattag tatataaaat ttatagtgaa
2281 attcaaaaaa atccagatga actctatgaa aaattaacat ggatttatga taatatctat
2341 atgattaaaa gatattatac tgcatatgct ttagaaggtg tctgttcata tcttgaacat
2401 gataaaagtc aaatgtatac agaattacat atttataaca aaatagtcga ctctgttcgt
2461 tattatagtt catgctttaa aaacgttatt gtttataatg ctatcatttc tggtatacat
2521 gaaaaaataa aacatttctt aaaattagta ccaagacaca actttctttt ggattatcac
2581 tttaattcaa tttttgaaaa agaaattaaa ccagccaaaa aatatagtac ttcacatatt
2641 tattttgatc caactgttgc atcatatgct tattataatt tagatagaag aaccatggtt
2701 actattatta atgattattt cgaagcaaaa aaaaaagaat taaccgttat agtatctcgt
2761 atgaaaacag atatgctcag tcttcaaaat gaagaatcaa aaataccaaa tgacaaaagt
2821 gcaaattcaa aactagctac aagattaatg aaaaaattta aagctgaaat cagagatttc
2881 ttcaaagaaa tgcgtataca atatgctaaa ttaataaaca tacgttacag atctcactta
2941 aagaaaaact actttgcctt caagagatta gattaagaat ataaacttga aaaatatata
3001 taatgtaaaa atatactaaa tatattaata taatatatat atatatatat atatatatat
3061 a
//
LOCUS STYFLICD 350 bp ds-DNA BCT 01-JUN-1990
DEFINITION S.typhimurium phase-1 flagellin C and D (fliC, fliD) genes,
partial cds.
ACCESSION M33541
KEYWORDS phase-1 flagellin.
SOURCE S.typhimurium DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 350)
AUTHORS Inoue,Y.H., Kutsukake,K., Iino,T. and Yamaguchi,S.
TITLE Sequence analysis of operator mutants of the phase-1 flagellin-
encoding gene, fliC, in Salmonella typhimurium
JOURNAL Gene 85, 221-226 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 313 > 350 phase-1 flagellin C
pept 56 < 1 (c) phase-1 flagellin D
BASE COUNT 120 a 63 c 78 g 89 t
ORIGIN
1 aacaactggt ctaacggtaa gtttgatccc acacctaatg atgaaattga agccatgcct
61 tcttcctttt tgattgcaaa cagtagttaa gcgcgttatc ggcaatctgg aggcaaagtt
121 taatgataat tttgcaaaaa taatgcgcgg aataatgatg cataaagcgg ctatttcgcc
181 gcctaagaaa aagatcgggg gaagtgaaaa attttctaaa gttcgaaatt caggtgccga
241 tacaagggtt acggtgagaa accgtgggca acagcccaat aacatcaagt tgtaattgat
301 aaggaaaaga tcatggcaca agtcattaat acaaacagcc tgtcgctgtt
//
LOCUS SYNCRPBS1 88 bp ds-DNA SYN 01-JUN-1990
DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION M33414
KEYWORDS Artificial promoter; cyclic AMP activated promoter.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 88)
AUTHORS Aiba,H., Hanamura,A. and Tobe,T.
TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of
Escherichia coli
JOURNAL Gene 85, 91-97 (1989)
STANDARD simple staff_entry
BASE COUNT 24 a 22 c 18 g 24 t
ORIGIN
1 aaatgtgatc tagatcacat ttcaagcttg actgactgag atcagcttgc aacctttccg
61 gtcgcggcga taaagacatc ttcaccgt
//
LOCUS SYNCRPBS2 88 bp ds-DNA SYN 01-JUN-1990
DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION M33415
KEYWORDS Artificial promoter; cyclic AMP activated promoter.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 88)
AUTHORS Aiba,H., Hanamura,A. and Tobe,T.
TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of
Escherichia coli
JOURNAL Gene 85, 91-97 (1989)
STANDARD simple staff_entry
BASE COUNT 25 a 24 c 18 g 21 t
ORIGIN
1 aaatgtgatc tagatcacat ttggaattcc cggggatccg ggccctcttc aaacaggccc
61 agttcgccaa taaaatcacc tgattcag
//
LOCUS SYNCRPBS3 56 bp ds-DNA SYN 01-JUN-1990
DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION M33416
KEYWORDS Artificial promoter; cyclic AMP activated promoter.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 56)
AUTHORS Aiba,H., Hanamura,A. and Tobe,T.
TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of
Escherichia coli
JOURNAL Gene 85, 91-97 (1989)
STANDARD simple staff_entry
BASE COUNT 17 a 10 c 14 g 15 t
ORIGIN
1 aaatgtgatc tagatcacat ttggagtgcc tgaatgggcg tacagttatg aaaccc
//
LOCUS SYNCRPBS4 56 bp ds-DNA SYN 01-JUN-1990
DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION M33417
KEYWORDS Artificial promoter; cyclic AMP activated promoter.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 56)
AUTHORS Aiba,H., Hanamura,A. and Tobe,T.
TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of
Escherichia coli
JOURNAL Gene 85, 91-97 (1989)
STANDARD simple staff_entry
BASE COUNT 20 a 10 c 9 g 17 t
ORIGIN
1 aaatgtgatc tagatcacat ttggaattcc aaatgtgatc tacagttatg aaaccc
//
LOCUS TRFCRE1 3940 bp ds-DNA INV 01-JUN-1990
DEFINITION C.fasciculata retrotransposable element I (CRE 1) gene
complete cds.
ACCESSION M33009
KEYWORDS integrase; retrotransposable element I; reverse transcriptase.
SOURCE C.fasciculata.
ORGANISM Crithidia fasciculata
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 3940)
AUTHORS Gabriel,A., Yen,T.J., Schwartz,D.C., Smith,C.L., Boeke,J.D.,
Sollner-Webb,B. and Cleveland,D.W.
TITLE Rapidly rearranging retrotransposon within the miniexon gene locus
of Crithidia fasciculata
JOURNAL Mol. Cell. Biol. 10, 615-624 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Gabriel, 21-MAR-1990.
FEATURES from to/span description
ORF < 416 3837 ORF (AA at 2)
rpt 1 415 mini-exon gene repeat
site 386 415 target site duplication
site 415 415 mini-exon splice junction
rpt 3899 3940 mini-exon gene repeat
site 3899 3927 target site duplication
BASE COUNT 890 a 1032 c 1330 g 688 t
ORIGIN
1 aagcttccgg aaacaaccgg cacaaatttt gaggcggaag cgctgctttt ttttgtgtcc
61 gggggggtgc tccttggggt ccccctgtcc agccccagcc ggtcgcccac cacataggaa
121 tttgcgaagg acccccaaaa atcccggtcc ccggggcgag ttgtcccaac tttttcaaac
181 ctcatgaaga gctagttgcg tcattgaaaa gttcgtgtgc agaaaccccc tcccccacgt
241 ttgtacaatg gaagagttta cgatacaggt tttctcacgg ttttgaggtg ttttttcgaa
301 aaacaaaaaa tatagaggtg tatagcgctt atttttgaca cccccctcaa aacatgctgg
361 gggtataggt ccttccaact aacgctatat aagtatcagt ttctgtactt tattgaacgg
421 cattcggtct agtgggtcca ctcccaacct tctcctcctt ggtctcgggt tcgattccgg
481 tcgggcacga aactctcttc ctctctaaac tccacatata catccacaat aaccactctc
541 ataactgttc tggtgcgggg ccatttcgaa ccatattgac atttggcgca ccaaactcac
601 acatactaac caaaaatggc agagccaccc ggggcgacgg tccgagcgtg gcggagcacc
661 tcaagcgccg gcgggtcgaa ggatctgagc cggtcgtggt ggggtctcgc caggagggtc
721 tctccggtga gtcagcagtc gaagcgattg ttgtggaaag tggcagcgag gctgacgaag
781 aatccacggc gagggggctg ggagcctctc gacaagcgcc aggtgatgac tcgggcgggc
841 cggtgcccag ggcggaggag gacctcgccc cggttgggta cctgtatccc gagaacctgg
901 ccgtcccaca gggggcggcg tgtccggtgg tgggttgtgg gtaccgaccg aacacccggg
961 tgggaccgag actggtggaa cacctgaaca cggtgcaccg ggatatcctg ggcaacattc
1021 ctgtcgacgc ctggcgacgt cagggactcg tccgctgcct acggtgtggc tcggccttga
1081 ccgcgtccgg tcatggtcgg ggtgcacacg gtgggaagtg tggtccgtac agaagcagaa
1141 acgcggccat tagggctcgg acgcagagtt tctttggtgg aatttcgcag aattctgaca
1201 cacttacggc gggaaacggg gtctcagaag ctggtggtct agtggaagtg ggtagggtag
1261 aagacccaac cacagacccg tggtacaggg ttcgaacccc cttgaagcga caaatttacc
1321 gtacggactt ccgggtgtgg cagggcttgg cccggccagt actcctgggg tactctctgg
1381 cggatacagc agggaaggag ggacgccttc tcgccctgct caatctcccc agggaccatc
1441 tggaggtcca ggtaaatgct aagaggggaa tccaaccaca gccggctgaa atcgaggccc
1501 aggttcgaag gaaggtagtg gagttggcag ggataggagc ggtaggacga gcgatggcag
1561 tgatgactcg ggggcggctg gtcgaggtgc ccctggagcg ggtgatggag cagctcgagg
1621 agctccaccc gcaggaggac ccgcggggtt atccggcagc accagatacg agcgaggtgc
1681 tgcgggccaa ggagcagaag gtgcggcggg cgatcgcggc acggatgggg aggggtacgg
1741 cgcccggcct cgatggctgg acgcgggagc tcctcctccc cctcgcagaa gacccggccc
1801 tgctacacga gatcacgtcg gtggtctcgg atatcatgca ggggaaggtg gccgaggtgg
1861 tggcgcggag gctgcggagc agcgccgtca ccccgattcc gaaggacgag gcggggacga
1921 agatacgccc gatcgtgccg gagtcggcct ggctgaagct ggcctcgctg gtggcgatgg
1981 cggagatacc atccagcttc aaggagacct tcaaggggtg gcagtacggg gtctgggggg
2041 acgtcgccaa agcagtggcg aagatccgcc gggacagcga ggagcacgag tacctggtgg
2101 cactcgacgg ggtcaatgca tacaatacga tgagcagggc ccacatcctc caagccgtgt
2161 acgccgagca gcgcctgaag ccgatctggg gggtggtgaa ggtggcgctt ggggggccgg
2221 ggttcctggg agtatacagg gacggctgcc tcaagggcaa cctgtggtcc accaagggaa
2281 tccggcaggg catggtgctg ggccccctcc tgtacgcgac cgggatggca gcggccatcg
2341 ggccggtacg gcagcgtatc cccggggtcc ccgtgacggc ctacatcgac gacatcaccc
2401 tcgcggccag cggggcggag ggagccaggg cagccgaggc atacgcagac gccctcgaga
2461 cggtcggggt ggtcaccaac gccaggaagt cgatggtggt ggggccagaa ggcacccggg
2521 tggggatcgg gggtgtagac ctgccggtgg tggccgaggc ccggatcctg ggggcccact
2581 tccgagcaag ggggacacca gaggcccgta ccatcgagtg gctgcaggcg gccgtcgaga
2641 agtggcgacc catccaccag aagctgcggc aggacatcat cccgaagaac attgcgatga
2701 tgatgacccg catcagcctg gggtccaaga tgaccttcct cctccagacc cactcgccac
2761 aggaactgga gaccgcagcg aagacggcag acgacgaggt cgagcagacc ctccagcacc
2821 ttatggggca ggtagagatc acgccccgag caaggctgct ggcacaactc ccgatcagag
2881 agggggggct gggtctccgg cgaagcagcg agattgcgaa gttcgcacag gcagacgtgg
2941 ggcaaggcga ggcccaccag gcacacacaa aggcactaga tgaagggatc aagcaccagt
3001 tacaaccact cctctcggag tccgaggtgc agatcctgaa gtcgaacgct ggaatggggg
3061 ccgggcgggt gctgacagat agtagcctga ggatcccaga cgtggcagca acaatcgcgc
3121 tgagggagag actcctgctc agggtgctcc cggagggatg cagtgtgtgt gtggggggga
3181 cgcgacgaac taccatgtac acacgtgctc caacataccc accaagcccc ggacccgacg
3241 acacgacggg gtggtggatg agctggtggc cctggccagg aagatggggt acgagcccag
3301 caaggagccg agggcggacg ttgacgagta gggcgaggcc ggacctgtac atcacaggaa
3361 gcctgaagcc ggcggcgacg gatgtaacaa taacctaccc gggcaggcag gcgaggggag
3421 cacactcccg ttgcagcagg cctaccggaa taagatgggg gcctgggagg catgggggaa
3481 cctgcgaggg gtggacatgc agccggtggt cctcgggacg aacgcggaga tacacccgga
3541 gagtgcgaat ggatacgaag gttgacctcg gtcgaagaca aagacaaaat acataccagt
3601 tacaacgagg tgacgggacg aatcgtggag acggtgttgg ttgggaacgt ggagctgttc
3661 aacgcagtga cgaacctggc gctggtcagg gagttgatgt aggaaggtac gggtgggtag
3721 tagtagaata tcgaacggcg actaggaagt tggaccattg ttacttgtta cttgtgattc
3781 tgacgacgac gaaagattgt tccttgttac ttgttaccct gaaatttgat attacttgaa
3841 ttgatatgat atgattgata caaaaaaatt taaaaaaaaa aaaaaaaaaa aaaaaaaaat
3901 ataagtatca gtttctgtac tttattggta taagaagctt
//
LOCUS SQDKINEA 3693 bp ss-mRNA INV 01-JUN-1990
DEFINITION Squid kinesin heavy chain mRNA, complete cds.
ACCESSION J05258
KEYWORDS kinesin; kinesin heavy chain.
SOURCE Squid, cDNA to mRNA.
ORGANISM Loligo pealii
Eukaryota; Animalia; Eumetazoa; Mollusca; Cephalopoda; Coleoidea;
Teuthoidea; Myopsida; Loliginidae.
REFERENCE 1 (bases 1 to 3693)
AUTHORS Kosik,K.S., Orecchio,L.D., Schnapp,B., Inouye,H. and Neve,R.L.
TITLE The primary structure and analysis of the squid kinesin heavy chain
JOURNAL J. Biol. Chem. 265, 3278-3283 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 421 3324 kinesin heavy chain
BASE COUNT 1220 a 684 c 822 g 967 t
ORIGIN
1 aaaagaaatc tttctaggcg gccataagtc gctgttctac aaataatacc gagatcgtca
61 taacaaaccg cctatttaac acttccgaac aacttttata gaatatttac gactgttttg
121 aatagtatat ttagctgtca ggctgcgatt tgcttccttt tatttattta tttatttatt
181 tatatttttt gtagtagaaa tttacatttt ttttttttgc gggggagacg tcaggatatt
241 attacctact tttgtttgtg cgcgtgtgtt gaagaaacga aagacgttca agctttattg
301 gattagagaa ttgataaaga tcggattaac tgtataataa cgctaaactt taatttattt
361 gggacctagg aaagtcccat ataaacccaa gaatttatat cgtcctcaaa caacgccgcc
421 atggacgtgg cttccgagtg taacattaaa gtcatctgcc gagttagacc ccttaacgaa
481 gccgaggaac gggccggcag caagttcatc cttaaattcc caactgatga ttccatctct
541 atcgctggga aagtgttcgt atttgataag gttttaaaac ctaatgtctc acaagaatat
601 gtgtacaatg ttggtgctaa gcccattgtt gctgatgtac ttagtggttg caatggaaca
661 atttttgctt atggacaaac atccagtggg aaaactcata ctatggaggg tgtgctggac
721 aaaccctcca tgcacggcat cattccacgc attgtacaag acatcttcaa ctacatttat
781 ggaatggacg aaaacttgga gtttcatatc aagatttcat attatgaaat ttacttggac
841 aaaatcaggg atcttttgga tgttaccaag accaacttgg ctgtccatga agacaaaaac
901 agagtcccgt ttgtcaaggg tgccactgaa cgatttgtat ccagtcctga agaagtaatg
961 gaagttattg atgagggtaa aaacaacaga catgtagctg tgacaaacat gaatgaacac
1021 agttcgagaa gtcatagtgt atttttaatc aatgtcaagc aagaaaatgt tgaaacccag
1081 aaaaaattaa gtggtaaact ttacctagta gatcttgctg gtagtgaaaa agtaagcaaa
1141 actggtgcag aaggtgcagt ccttgatgaa gctaaaaaca tcaataaatc tttgtcagct
1201 ctgggaaatg tcatttcagc tcttgctgat ggaaataaat ctcacgtccc atacagagac
1261 agtaaattga cgagaatctt gcaggaaagt ttgggtggta atgcccgaac cactatggtc
1321 atatgttgtt cacctgcctc ttacaatgaa tccgagacca agtctacttt gctttttggc
1381 caaagagcta agactattaa gaatgtggta tcggttaatg aggagcttac agctgatgaa
1441 tggaaacgaa gatatgagaa agaaaaagaa agggttacta agcttaaagc cactatggct
1501 aagcttgagg cagaactaca gagatggaga actgggcaag ctgtatctgt agaagaacaa
1561 gttgatctta aggaagatgt ccctgctgaa tctcctgcca ctagtacaac tagcctggct
1621 ggaggactca tcgcttcaat gaatgagggt gaccgcaccc aactggaaga agagcgtctc
1681 aagctttatc aacagttgga tgacaaggat gatgaaatta acaatcagtc tcagttaatt
1741 gaaaaattga aggagcagat gatggaacaa gaagatttaa ttgcccaatc tcgtcgtgat
1801 tatgaaaatc tacaacaaga tatgtctcga atccaagctg acaatgagtc tgccaaggat
1861 gaagtcaaag aggtcttgca agctcttgag gaactggcca tgaactatga tcagaagtct
1921 caggaggtgg aagacaaaaa taaggagaat gagaatctaa gtgaggaact taaccagaaa
1981 ttgagcacct tgaatagctt acagaatgaa ttggatcagc tgaaggattc atctatgcac
2041 cataggaaac gagttactga catgatgatt aacttgttga aagatcttgg tgatattggt
2101 actattgtag gtggaaatgc tgcagaaact aagccaactg ctggttctgg tgagaaaatt
2161 gaggaggaat tcactgtagc tcgtctgtat attagcaaaa tgaaatctga ggtgaagacg
2221 ttggtgagcc gaaacaacca actggaaaat actcagcagg acaattttaa gaaaattgaa
2281 acacatgaaa aggatttgtc caactgcaaa ttactcattc aacaacatga agccaaaatg
2341 gctagtttac aggaagccat taaagattct gagaacaaga agaggatgtt ggaagacaat
2401 gtggattctc taaatgagga atatgccaaa cttaaagctc aagaacaaat gcatttagct
2461 gctctgtccg aacgggagaa ggagaccagc caagcaagtg aaactcgaga agttcttgag
2521 aaacagatgg aaatgcatcg ggaacaacac caaaaacaac ttcaatcttt acgggatgaa
2581 atttcagaaa agcaggctac tgtggataat cttaaagatg ataaccagag actttctttg
2641 gccctggaga aacttcaggc tgattatgac aaactcaaac aagaggaagt tgaaaaggct
2701 gccaagttgg ctgacttatc attgcagatt gaccgtcggg agcaagccaa acaggatctc
2761 aaaggacttg aagagactgt ggcaaaggag ctgcaaacac tacacaatct tagaaaactt
2821 tttgtacaag atctacaaaa taaggtcaaa aagtcatgca gcaagaccga agaagaagat
2881 gaagatactg gtggtaatgc tgcccagaaa caaaagattt ctttcttaga aaataatctt
2941 gagcagctga ctaaggttca caagcagctt gttcgagaca atgctgactt gcgctgtgag
3001 ttgcccaagc tggagaaaag gttacgtgct actatggagc gagtcaagtc cttggaatct
3061 gctctcaagg atgccaagga aggcgctatg agagacagaa aaaggtacca gcatgaggtg
3121 gatagaatca aggaagctgt ccgccagaag aatctcgctc gccgaggaca tgctgcacaa
3181 atagccaaac caatccggcc tggacaacac cagtctgttt ctcctgcgca ggcagcagct
3241 attcgcggtg gtggtggttt gtcccagaat ggaccaatga ttacatctac gccaattcga
3301 atggcacctg aaagcaaggc ataaatatct agttggaaga ggaattttta ctcccattca
3361 tcaaagaaca acagaactac aaagctttct agtgcattca ttgataattg aaatggagga
3421 tatgtatctc ctcctcctct actattatat ctcctcctcc tacatgaagc tgcattccgt
3481 tttgttagcg tgtaccctgt gtctcgctgc tacagtggag tgtatgtggg ctcttgcata
3541 ccactgcata gtggcagtta ggatggctgg ttacttagct atttaactgc ctttctacct
3601 gcctgcttcc cacctgcctt agtgatggca agtgagtgac tgctctggtg gcaatccaac
3661 ttttgtacaa ttattaaaaa aaaaaaaaaa aaa
//GenBank-Updates@genbank.bio.net (06/02/90)
LOCUS RATMRF4AA 1271 bp ss-mRNA ROD 02-JUN-1990
DEFINITION Rat muscle regulatory factor mRNA MRF4, complete cds.
ACCESSION M27151
KEYWORDS differentiation protein; muscle regulatory factor.
SOURCE Rat cDNA to mRNA, clone lambda-cRS4-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1271)
AUTHORS Rhodes,S.J. and Konieczny,S.F.
TITLE Identification of MRF4: A new member of the muscle regulatory
factor gene family
JOURNAL Genes Dev. 3, 2050-2061 (1989)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
S.F.Konieczny 16-AUG-1989, for release after publication.
FEATURES from to/span description
pept 45 773 muscle regulatory factor MRF4
BASE COUNT 347 a 292 c 303 g 329 t
ORIGIN
1 gtccatcacc cagttcagat cagtcagagg ccaaggagga gaacatgatg atggaccttt
61 ttgaaactgg ctcctatttc ttctacttag atggagaaaa tgtgactctt cagccattag
121 aagtggcaga gggctctcct ttgtacccag ggagtgatgg taccctatcc ccttgccagg
181 accaaatgcc ccaggaagcc gggagcgaca gcagtggaga ggaacacgtt ctggctcccc
241 caggccttca gccaccccac tgcccaggtc agtgtctgat ctgggcttgc aagacttgca
301 agagaaaatc tgcccccaca gatcgtcgga aagcagctac cctgcgcgaa aggaggaggc
361 ttaagaaaat caacgaagcc tttgaggcct tgaagcgtag aactgtggcc aaccccaacc
421 agaggctgcc caaggtggag attctgagaa gtgccatcaa ctacattgag cgtctgcagg
481 acctgctgca ccggctggat cagcaagaga aaatgcagga gctgggggtg gacccttaca
541 gctacaaacc caagcaagaa attcttgagg gtgcggattt cctgcgcacc tgcagcccgc
601 agtggccaag tgtttcggat cattccaggg gcctggtgat aactgctaag gaaggaggag
661 caagcgtcga tgcttcagcc tccagcagtc ttcagcgcct ttcttccatc gtggacagta
721 tttcctcaga ggaacgcaaa ctccccagcg tggaggaggt ggtggagaag taactcagtc
781 agcatttgga acattcttcg ctcagcagga agagcccctt tccgcctaat catttagatt
841 agggctcaca gaccccagaa tttatgaaag gcaagagact tagtgttaaa aaagaaacct
901 ctccccacct caagtgaaaa tccttcggct tggggctttt attataacta ttattgtatc
961 tgaaccgcta gtggcttagc tctagaaccc taattttgtt tttagtttgg ttggtttttt
1021 ataacatatt aacttttgct atgatcacgt gaccctttcc tgtcagttgc aaacgaagtt
1081 cattcctgtc taaatcaaag tgggaacgtt taatcctaag agtatttaat gtacttttgt
1141 aaatagtctt agtactttcg tttttatgta aacctaaagg acatatttta aatgtggaat
1201 taagcattgt atataaaatg tgtgagagtc tgatattgta atattaaaat attaatacgt
1261 ttctacacgt a
//GenBank-Updates@genbank.bio.net (06/05/90)
LOCUS HUMBREAKZ 680 bp ds-DNA PRI 05-JUN-1990
DEFINITION Human chromosome 10q24 breakpoint region.
ACCESSION M33603 M31943
KEYWORDS Chromosomal translocation.
SOURCE Human lung fibroblast DNA, clones G10-1 and G10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 680)
AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
Carroll,A.J. and Korsmeyer,S.J.
TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
juxtaposes the delta T-cell receptor with TCL3, a conserved and
activated locus at 10q24
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.M.Zutter, 07-FEB-1990.
FEATURES from to/span description
site 256 257 breakpoint
BASE COUNT 119 a 215 c 196 g 150 t
ORIGIN Chromosome 10q24.
1 ggatccacta gccgggtcag gcagagagaa ggaggccata gattcaggac caccttctgc
61 agctactctt ggcctgggaa tcttagagaa aggggtgcaa cgtgaacagt ttcttccgtc
121 ctggctgctg ttctaggacc caggaagcga gccccaggat cagacacaag aaaagaacag
181 gttcccccac ctccagtccc ctacacacat gcacttcgcg cccctagctc cggtgcccgt
241 ggtgctgcgg ggccagtggg ggctctgagc cccggtaaat cagcagaacc agtggccttt
301 tttttttttt ccggagagca ggcttcatct gggaggatgg cagcggggct tagccgccca
361 ggacagctag cttttcattt tcgttctctc ccggctgggt gtcctacccc gggcccagcc
421 ctgttatctt gggcacgaac aatgcaccgg taggctggtg atcggtggcg ggaagactac
481 tcccgggaca ttgggtacta agtggtggcc cgtgccgcct tctctgctgc cccgtcccgc
541 cttcctctgc tggccctggc tccctctgct ccgtttccac tctgggcact caactctccc
601 tctggcgtgg atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc
661 tcgagggggg gcccggatcc
//
LOCUS HUMTCDXB 60 bp ds-DNA PRI 05-JUN-1990
DEFINITION Human T-cell receptor rearranged delta-chain gene V-region
(V2-N-J1).
ACCESSION M34439 M31943
KEYWORDS T-cell receptor; delta chain; rearranged gene; variable region.
SOURCE Human (T-cell lymphoblastic leukemia patient 1143) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 60)
AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
Carroll,A.J. and Korsmeyer,S.J.
TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
juxtaposes the delta T-cell receptor with TCL3, a conserved and
activated locus at 10q24
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.M.Zutter, 07-FEB-1990.
FEATURES from to/span description
pept < 1 > 60 T-cell receptor delta-chain V2-N-J1-region (AA
at 3) /hgml_locus_uid="LP0165Q" /nomgen="TCRD"
/map="14q11.2"
recomb 29 30 V2-region end/N-region start
recomb 35 36 N-region end/J1-region start
BASE COUNT 19 a 15 c 11 g 15 t
ORIGIN
1 aagggtctta ctactgtgcc tgtgacacca aacgtacacc gataaactca tctttggaaa
//
LOCUS HUMTRANSX 50 bp ds-DNA PRI 05-JUN-1990
DEFINITION Human T-cell leukemia t(10:14)(q24:q11) chromosomal translocation.
ACCESSION M33602 M31943
KEYWORDS T-cell receptor; chromosomal translocation; oncogene.
SOURCE Human (T-cell lymphoblastic leukemia patient 1143) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
Carroll,A.J. and Korsmeyer,S.J.
TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
juxtaposes the delta T-cell receptor with TCL3, a conserved and
activated locus at 10q24
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.M.Zutter, 07-FEB-1990.
FEATURES from to/span description
ORF < 1 > 50 TCL3 oncogene (put.)
recomb 20 21 10q24 DNA end/N DNA start
recomb 27 28 N DNA end/14q11 DNA start
BASE COUNT 12 a 10 c 19 g 9 t
ORIGIN
1 cggtggtgct gcggggccag ggagggaacc gataaactca tctttggaaa
//
LOCUS MUSTCPB 2369 bp ds-DNA ROD 05-JUN-1990
DEFINITION Mouse t complex protein-10 (Tcp-10b) gene, complete cds.
ACCESSION M22597 J03141
KEYWORDS t complex responder.
SOURCE Mouse (haplotype tw5) testes germ cell DNA, clone T66B-a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 519)
AUTHORS Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D.,
Pilder,S.H. and Silver,L.M.
TITLE A candidate gene family for the mouse t complex responder (Tcr)
locus responsible for haploid effects on sperm function
JOURNAL Cell 55, 71-78 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2369)
AUTHORS Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D.,
Pilder,S.H. and Silver,L.M.
JOURNAL Unpublished (1989) Lewis Thomas Lab, Princeton, NJ 08544-1014
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by L.M.Silver, 07-JUN-1989.
The sequence is interrupted by introns that are not indicated in
[1] or [2].
FEATURES from to/span description
pept 421 1737 t complex protein-10
BASE COUNT 707 a 565 c 594 g 493 t 10 others
ORIGIN Chromosome 17.
1 tgcagaaaac catggggccc gggccatatg aaggtgcagt gggttatgaa taagctctgc
61 tataggtgcc tttgactctc agggtgaaat accttgtctc cagtcagatc ccctaccgct
121 gtggctaaac ctctcattct tgtttctata cagcctaggt tatggagttg taggttgggg
181 acaaggggtt tgtcctgatg agaggcagcg aaaccttcca cagcaaaggt ctgacgaaga
241 actaagacct cccaggcgcc ctctcaaggc cctgttcacc tcaaggtaat gctggcagta
301 ctcatgtgat caggcaccat gctgactctc tgaaccaaag tgttttccac cctcctctgg
361 ggacacagcc cttaatgctt actggtcacc tcgaggctaa atagtcagca gagacaagcc
421 atgttggagg atctgagtca aggaaaaggg tccaaccatg agaagaggaa gatggagagc
481 acagcccaga tcactgagga agacagcaag cttgatgagg ttgtggggct gcagaagcag
541 atctgtgacc ttgggacaga gctcacaaga caatcatctt ggtggtgcgt agctcacaaa
601 gacctccaaa gccagatcga tgctctgata aaggagaacc aggagatccg tgcggagctg
661 aagaccttga agaagcagga tgcggaggcc accaaagcct gtataggctc gcccaccccg
721 gcaagagcaa gcaacactct gccagtgtac ataaagatag agggaattga ttccgagaag
781 acaacctcat gggatgaaag agatgagctt tctggaagtc ctccaaacag aagcacaatg
841 gccaccggag gaacagactc ccaggatgaa aggctgtctt ttacatctgt ggatgaaaag
901 gttatacaca tgtcttccaa atttctgcaa agaagcttcg gcagaatgtc accagaacca
961 ctgtctgaca gcacattcct ggacacagag tcactggctg acatctggtc ctcaaatcca
1021 gagacttcgg acagtgaact tctcctgcat gctcaagcaa gcagggtcat tccttgtttt
1081 tccccaaatg cactgtgggt gcagaatatt ccaacaaagt caagagctcc taaagaaata
1141 cagcaaacct cagacactac aaagactgat gagacaaagg aaaagcgaca cccaaacggc
1201 aaggtggagc ggatgctcag tgacgggcga accatcatca ccttccccaa tggaaccagg
1261 aaggagatca gtgctgacaa gaagaccacc ctcatcaggt tttttaacgg tgacatgaag
1321 aagatcaagt ccgatcagaa agtgatttat tattatgcgg acacacaaac aatgcacaca
1381 acctacccag atggtgttga agtggtgcag tttcctaaca agtggactga aaaattctac
1441 ccggatggct ccaaggaaac cgtgtttcct gatgggacag tgaaacagct taaggatgga
1501 tgtgaagaga cggtgttccc cgatgggaca tttgtgacag tgaagaggaa cggagacaaa
1561 accatcatgt tcagcaacgg agagaaagaa atccacacgg ccaggttcaa gnggatggaa
1621 ttcccagatg gnaccaccaa gactgtgtat tgcaatggct gccaggagac caagtatgcc
1681 tcagggaggg tcagggtcaa agatgagaag ggaactgtca tcctggactg gaagtagtgc
1741 atccaacaga ctcaaggcca ggcagtaggc tccaatctcc aaatatagtt ttgaccaaaa
1801 cagacaaagc gactcacgat tcccttgaca atcctggaga tcccctgcac atgaggagta
1861 aagagacatc caaaaattgt aaagaaagag accatgtagc caccaccatc ttgggagaga
1921 ggtcagagaa gcagtagcca gaaatgaggt cctggacatg gcttccgtgg ccttgtncnn
1981 accatgcctt ctgtcggagt ctggccctag agcagagggt acaatgggag caacgtgacc
2041 actgccagct agttcctggg agcagccgtt aacaacagct ggcncngtgt ttttgcttgg
2101 ccaacacctg cctgcatcta cttctatccc cttctagagt cctcatcaat gggatccagg
2161 gggatgtatc ttcactggtg cctgatgtat gaatagactt gatttaacca tcctgaagtg
2221 tctagacaga acctcatgtg gcattctatc ccataaattc acacaatttt gatntgtcaa
2281 tcaacaagga gttgagnaaa agcacagata aaaacaaaga cagaaacata gtgngccaca
2341 cctcctacct gttcctgtga gtctgccct
//GenBank-Updates@genbank.bio.net (06/06/90)
LOCUS HUMARC1 1753 bp ds-DNA PRI 06-JUN-1990
DEFINITION Human androgen receptor gene, exon 1.
ACCESSION M27423
KEYWORDS androgen receptor; dihydrotestosterone receptor;
testosterone receptor.
SEGMENT 1 of 8
SOURCE Human (ATCC library XLAOXNLO1) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1753)
AUTHORS Lubahn,D.B., Brown,T.R., Simental,J.A., Higgs,H.N., Migeon,C.J.,
Wilson,E.M. and French,F.S.
TITLE Sequence of the intron/exon junctions of the coding region of the
human androgen receptor gene and identification of a point mutation
in a family with complete androgen insensitivity
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9534-9538 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1)
AUTHORS Lubahn,D.B.
JOURNAL Unpublished (1990) lubahnd%uncvx1.bitnet
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by D.B. Lubahn, 29-AUG-1989, [2] 04-JUN-1990.
FEATURES from to/span description
pept 79 + 1691 androgen receptor, exon 1 /nomgen="AR"
/map="Xq12" /hgml_locus_uid="LL0058U"
IVS 1692 > 1753 AR intron A
revision 1 1 a in [2]; c in [1]
BASE COUNT 344 a 546 c 573 g 290 t
ORIGIN
1 agcctgttga actcttctga gcaagagaag gggaggcggg gtaagggaag taggtggaag
61 attcagccaa gctcaaggat ggaagtgcag ttagggctgg gaagggtcta ccctcggccg
121 ccgtccaaga cctaccgagg agctttccag aatctgttcc agagcgtgcg cgaagtgatc
181 cagaacccgg gccccaggca cccagaggcc gcgagcgcag cacctcccgg cgccagtttg
241 ctgctgctgc agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag
301 cagcagcagc aagagactag ccccaggcag cagcagcagc agcagggtga ggatggttct
361 ccccaagccc atcgtagagg ccccacaggc tacctggtcc tggatgagga acagcaacct
421 tcacagccgc agtcggccct ggagtgccac cccgagagag gttgcgtccc agagcctgga
481 gccgccgtgg ccgccagcaa ggggctgccg cagcagctgc cagcacctcc ggacgaggat
541 gactcagctg ccccatccac gttgtccctg ctgggcccca ctttccccgg cttaagcagc
601 tgctccgctg accttaaaga catcctgagc gaggccagca ccatgcaact ccttcagcaa
661 cagcagcagg aagcagtatc cgaaggcagc agcagcggga gagcgaggga ggcctcgggg
721 gctcccactt cctccaagga caattactta gggggcactt cgaccatttc tgacaacgcc
781 aaggagttgt gtaaggcagt gtcggtgtcc atgggcctgg gtgtggaggc gttggagcat
841 ctgagtccag gggaacagct tcggggggat tgcatgtacg ccccactttt gggagttcca
901 cccgctgtgc gtcccactcc ttgtgcccca ttggccgaat gcaaaggttc tctgctagac
961 gacagcgcag gcaagagcac tgaagatact gctgagtatt cccctttcaa gggaggttac
1021 accaaagggc tagaaggcga gagcctaggc tgctctggca gcgctgcagc agggagctcc
1081 gggacacttg aactgccgtc taccctgtct ctctacaagt ccggagcact ggacgaggca
1141 gctgcgtacc agagtcgcga ctactacaac tttccactgg ctctggccgg accgccgccc
1201 cctccgccgc ctccccatcc ccacgctcgc atcaagctgg agaacccgct ggactacggc
1261 agcgcctggg cggctgcggc ggcgcagtgc cgctatgggg acctggcgag cctgcatggc
1321 gcgggtgcag cgggacccgg ttctgggtca ccctcagccg ccgcttcctc atcctggcac
1381 actctcttca cagccgaaga aggccagttg tatggaccgt gtggtggtgg tgggggtggt
1441 ggcggcggcg gcggcggcgg cggcggcggc ggcggcggcg gcggcggcgg cggcgaggcg
1501 ggagctgtag ccccctacgg ctacactcgg ccccctcagg ggctggcggg ccaggaaagc
1561 gacttcaccg cacctgatgt gtggtaccct ggcggcatgg tgagcagagt gccctatccc
1621 agtcccactt gtgtcaaaag cgaaatgggc ccctggatgg atagctactc cggaccttac
1681 ggggacatgc ggtaagtttt tccttccaga aatgtcgcct ttcggcccag ggcagagtca
1741 ctctgtgttc tgg
//
LOCUS ONOMICACTA 2374 bp ds-DNA INV 06-JUN-1990
DEFINITION O.nova micronuclear MIC-ACT-1 gene, complete cds.
ACCESSION M25531
KEYWORDS actin.
SOURCE O.nova micronuclear DNA.
ORGANISM Oxytricha nova
Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora;
Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae.
REFERENCE 1 (bases 1 to 2374)
AUTHORS Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C.
TITLE Reordering of nine exons is necessary to form a functional actin
gene in Oxytricha nova
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by A.F.Greslin, 13-JUN-1989.
FEATURES from to/span description
mRNA 2028 2220 actin mRNA
1956 2016 actin mRNA
427 918 actin mRNA
947 1393 actin mRNA
1666 1703 actin mRNA
1405 1555 actin mRNA
1723 1836 actin mRNA
2245 2270 actin mRNA
1848 1939 actin mRNA
site 427 918 actin, exon 1
site 947 1393 actin, exon 2
site 1405 1555 actin, exon 3
site 1666 1703 actin, exon 4
site 1723 1836 actin, exon 5
site 1848 1939 actin, exon 6
site 1956 2016 actin, exon 7
site 2028 2220 actin, exon 8
site 2245 2270 actin, exon 9
BASE COUNT 745 a 531 c 450 g 648 t
ORIGIN
1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc
61 acaagaagcc agctaaaaag aagatatatg gattgaatat ccactatgca attggaagaa
121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag
181 agtccaagtc ctatttcaca gtaaaaattt aaatacctat gaaaagagtc ctcgaatttc
241 aaggatataa tctaatatga accacttata taccttaaaa attatataca aattatattt
301 ataaataata taaaaaaaaa ttacaatagt aagatgatgt taaaatttaa tgataaattg
361 ttctatcaag catatatttt ccatttcaaa aatgagcatt caatcttcct cttagctatt
421 aagacatcgg agtcgtcaag gctggtttct ccggcgaaga tgctcccaga gctgtgttcc
481 cttccatcat cggaagaccc aagaacgtca gcgctttgat cggagttgac tccgcttctg
541 agtacatcgg agacgaggcc caacaaaaga gaggagtcct caagatcttc taccccatcg
601 agcacggtat catcaaggat tgggaagaca tggagaagat ctggaaccac accttctacg
661 ttgaactcag agtccagccc gatgagcacc ccgtcctcct cactgaggct cccctcaacc
721 ccaagaccaa cagagaaaag atgacccaga tcatgttcga gaccttcaat gttcccgctc
781 tctacgtcgc catccaggcc gtcctctccc tctactccgc tggtagaacc accggtatcg
841 tttgcgatgc tggtgatggt gtcacccaca ccgtccccat ctatgagggt ttctccatcc
901 cccacgccgt ctcaagaatc tcattttatt actcatttaa tatggagtaa tccagcttgc
961 cggtagagac ttgaccactt tcatggccaa gctcctgact gagaagggct acgtgttcac
1021 ttcctccgct gagatggaaa tcgtcagaga catcaaggaa aagctttgct tcgtcgcttt
1081 ggactacgaa gctgccatga agcagtccta cgagagcacc accttcgaga agaactacga
1141 gcttcccgac ggtagagtca ttaccattgg taacgctaga ttcagatgcc ccgagtatct
1201 cttcaagccc ctcgagatga acggaaagga actcgactcc atccaatcac tcacctacaa
1261 ctccatccag gagtgcgatg tcgatgtcag aagagacctc taccaaaaca tcatcctctc
1321 cggaggtacc accatgtacg agggtattgg tgagagactc ctcaaggaaa ttgaggccag
1381 agctcccaag tccataatag aatatagcca agccccgaca gaagattcgc cgtctggaga
1441 ggaggttcca ccctcacttc tctgtccact ttcgccagca tgtggatcac caaggaagac
1501 tacgacgaga acggagcaag catcgtccac agaaagtgcc tctgagccca aaactctatc
1561 cctatattta tacagctgtt cagattacaa tataaagttt catcatttac tagctcattt
1621 tattgaaaca attaaatatc tctctcctat tggaaaataa atggttttct cccaagtcca
1681 tcaacgtgaa ggttattgcc agccccataa tagtatttgg gaagacaaaa ctctagaaag
1741 atattgttta atttatgaaa cgctaataca aggggtgcag gatgaggatg tccgtagcct
1801 ctgcttcctt ttaactctga tgactcgcct ttgggttgaa tattatagtt aggttgaatg
1861 agaacttagg atatgtgttc cctttgactg catctcacta atttattaag catttcccct
1921 attttatttt atttgactcc ttttaataaa attgttagct tgacgactcc ggaaccatta
1981 tcaacaacca cagtttgttt gtctgccatg tgtagtaagc ttataaaaca tattgtgcta
2041 attgattctg aggaaatata ataaatgaaa gatagaggat ttgcgggggc gaagtagtga
2101 gattgtgata ttaattatat tagagtgggt gggaaagaga agcaagttta atgctacttt
2161 aatagatata gcgccaaaga tacagccttt aataatatta aatctttcta ctacttacta
2221 cacattgaag ccataattcc cttttggttg aggccaggtt gaatgaatat ttttatcatc
2281 acaagcaaat tttaaattaa acttacacca aaatatataa attgataact taaagtctat
2341 aaatcagaat tgagcagtca tctgacttgt gatc
//
LOCUS ONOMICACTB 2301 bp ds-DNA INV 06-JUN-1990
DEFINITION O.nova micronuclear MIC-ACT-2 gene, complete cds.
ACCESSION M25530
KEYWORDS actin.
SOURCE O.nova micronuclear DNA.
ORGANISM Oxytricha nova
Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora;
Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae.
REFERENCE 1 (bases 1 to 2301)
AUTHORS Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C.
TITLE Reordering of nine exons is necessary to form a functional actin
gene in Oxytricha nova
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by A.F.Greslin, 13-JUN-1989.
FEATURES from to/span description
mRNA 2030 2225 actin mRNA
1959 2019 actin mRNA
429 920 actin mRNA
949 1395 actin mRNA
1669 1706 actin mRNA
1407 1558 actin mRNA
1726 1839 actin mRNA
2241 2266 actin mRNA
1851 1942 actin mRNA
site 429 920 actin, exon 1
site 949 1395 actin, exon 2
site 1407 1558 actin, exon 3
site 1669 1706 actin, exon 4
site 1726 1839 actin, exon 5
site 1851 1942 actin, exon 6
site 1959 2019 actin, exon 7
site 2030 2225 actin, exon 8
site 2241 2266 actin, exon 9
BASE COUNT 704 a 525 c 445 g 627 t
ORIGIN
1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc
61 acaagaagcc agctaaaaag aagatatctg gattgaatat ccactatgca attggaagaa
121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag
181 agtcccaagt cctatttcag agtaaaaatt taaataccta tgaaaagagt cctcgaattt
241 caaggatata atctaatatg aaccacttat ataccttaaa aattatatac aaattatatt
301 tataaataat atataataac tacattagta agatgatgtt aaaatttaat gataattgtt
361 ctatcaagca tatattttcc atttcaaaaa tgagcattca atcttcctct tagatatgaa
421 ggcataggag tcgtcaaggc tggtttctcc ggcgaagatg ctcccagagc tgtgttccct
481 tccatcatcg gaagacccaa gaacgtcagc gctttgatcg gagttgactc cgcttctgag
541 tacatcggag acgaggccca acaaaagaga ggagtcctca agatcttcta ccccatcgag
601 cacggtatca tcaaggattg ggaagacatg gagaagatct ggaaccacac cttctacgtt
661 gaactcagag tccagcccga tgagcacccc gtcctcctca ctgaggctcc cctcaacccc
721 aagaccaaca gagaaaagat gacccagatc atgttcgaga ccttcaatgt tcccgctctc
781 tacgtcgcca tccaggccgt cctctccctc tactccgctg gtagaaccac cggtatcgtt
841 tgcgatgctg gtgatggtgt cacccacacc gtccccatct atgagggttt ctccatcccc
901 cacgccgtct caagaatctc attttattac tcacttaata tggagtaatc cagcttgccg
961 gtagagactt gaccactttc atggccaagc tcctgactga gaagggctac gtgttcactt
1021 cctccgctga gatggaaatc gtcagagaca tcaaggaaaa gctttgcttc gtcgctttgg
1081 actacgaagc tgccatgaag cagtcctacg agagcaccac cttcgagaag aactacgagc
1141 ttcccgacgg tagagtcatc accattggta acgctagatt cagatgcccc gagtatctct
1201 tcaagcccct cgagatgaac ggaaaggaac tcgactccat ccaatcactc acctacaact
1261 ccatccagga gtgcgatgtc gatgtcagaa gagacctcta ccaaaacatc atcctctccg
1321 gaggtaccac catgtacgag ggtatcggtg agagactcct caaggaaatt gaggccagag
1381 ctcccaagtc cataatagaa tatagccagc cccgacagaa gattcgccgt ctggagagga
1441 ggttccaccc tcacttctct gtccactttc gccagcatgt ggatcaccaa ggaagactac
1501 gacgagaacg gagcaagcat cgtccacaga aagtgcctct gagcccaaaa ctctatccct
1561 atatttatac agctgttctt attacaatat aaagtttcat catttactag ctcattttat
1621 tgaaacaatt aaatatctct ctcctattgg aaaataaatg gttttctccc aagtccatca
1681 acgtgaaggt tattgccagc cccataatag tatttgggaa gacaaaactc tagaaagata
1741 ttgtttagtt tatgaaacgc taatacaagg ggtgcaggat gaggatgtcc gtagcctctg
1801 cttcctttta actctgatga ctcgcctttg ggttgaatat tatagttagg ttgaatgaga
1861 acttaggata tgtgttccct ttgactgcat ctcactaatt tattaagcat ttcccctatt
1921 ttattttatt tgactccttt taataaaatt gttagcttga cgactccgga accattatca
1981 acaaccacag tttgtttgtc tgccatgtgt agtaagctta taaacattat gtgcaaattg
2041 attctgagga aatagaataa tgaaagatag aggatttgcg ggggcgaagt agtgagattg
2101 tgatattaat tatattagag tgggtgggaa agagaagcaa gtttaatgct actttaatag
2161 atatagcgcc aaagatacaa cctttaataa tattaaatct ttctactact tactacacat
2221 tgaagccaat ttcatattca ttccctttgg gttgaggcca ggttgaatga ctattttttt
2281 catcactaag caaattttaa a
//
LOCUS BSUTRPS 1450 bp ds-DNA BCT 06-JUN-1990
DEFINITION B.subtilis trpS gene encoding tryptophanyl-tRNA synthetase,
complete cds.
ACCESSION M24068
KEYWORDS tryptophanyl-tRNA synthetase.
SOURCE B.subtilis (strain QB928) DNA, clone pTSQ2.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1450)
AUTHORS Chow,K.-C. and Wong,J.T.-F.
TITLE Cloning and nucleotide sequence of the structural gene coding for
Bacillus subtilis tryptophanyl-tRNA synthetase
JOURNAL Gene 73, 537-543 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by K.-C.Chow 05-MAY-1989.
FEATURES from to/span description
pept 171 1163 tryptophanyl-tRNA synthetase (EC 6.1.1.2)
BASE COUNT 433 a 306 c 331 g 380 t
ORIGIN
1 ctgaccgaac tggaaaaaag taggctcagc gagcagtctc cgttacaaac gtcagagtga
61 ttccatttta atggaataat cagggtggta ccacggttca ttcgtccctt ttttacaggg
121 gaagaatgag ccttttttat tatgttttaa gaaatgaggt tgatgttttc atgaaacaaa
181 cgattttttc aggcattcag ccaagcggct cagtgacgct cggcaactat atcggtgcaa
241 tgaagcagtt tgtcgaactg cagcatgatt ataacagcta tttttgcatc gtcgatcagc
301 atgcgataac tgttcctcaa gaccggcttg agcttagaaa gaatatccgc aatctcgcgg
361 cgctttactt agctgtcgga cttgatccag aaaaagcaac attgtttatt cagtcagagg
421 tccccgcaca tgcgcaggcc ggatggatga tgcagtgtgt cgcctatatc ggcgagcttg
481 agcggatgac tcaatttaag gacaaatcca aaggcaatga agctgtcgtc tccggcctgt
541 taacatatcc gccgctgatg gccgctgata ttctgctgta cggaacggat cttgtacctg
601 tcggcgagga tcaaaagcag caccttgagc tgacgcggaa tcttgcagaa cgcttcaaca
661 aaaaatacaa cgacatcttt acgattccgg aagtgaaaat tccaaaagtc ggtgcacgta
721 tcatgtctct gaatgatccg ctgaagaaaa tgagcaaatc tgatccgaat cagaaagctt
781 atattacatt gctggatgag ccgaagcagc ttgaaaagaa aatcaaaagc gcagtaacgg
841 attctgaagg cattgtcaaa tttgataagg aaaacaaacc gggcgtttcc aaccttctta
901 caatttattc aatcctcggc aatacgacaa ttgaagagct tgaagcaaag tacgaaggaa
961 aaggctacgg cgagtttaaa ggtgatttgg cagaagtcgt agtgaacgca ttaaaaccga
1021 tccaggaccg ctattacgag ctgatagaat ctgaagaatt agaccggatt cttgatgaag
1081 gcgcggaacg agcgaatcgg acagcaaaca aaatgctgaa aaaaatggag aatgccatgg
1141 gtcttggaag aaaaagacgc taatcaaaaa accgctcttt gcaaagagcg gtttttttca
1201 gttgaccttt gattcgtttt ccatttccca aagcttttcg aaaaaaggct ggcctgttat
1261 taggttttcg aaaaagctag gtgtttgtcc gaccagcctt caagtgtgtc gttatacagc
1321 atgtcccaaa tctcttcaaa gctcatcctt ttgatttgtt catacatatg cggattccac
1381 tctgtgtacc agtagccaat ttccgctcgt ttttcagccc ttttaattgg tcaagcgaca
1441 taaacatcag
//
LOCUS ERGTUBB 2475 bp ds-DNA PLN 06-JUN-1990
DEFINITION E.graminis beta-tubulin gene, complete cds.
ACCESSION X51326 M30927
KEYWORDS beta-tubulin.
SOURCE E.graminis hordei (race CR3) DNA.
ORGANISM Erysiphe graminis
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Erysiphales; Erysiphaceae.
REFERENCE 1 (bases 1 to 2475)
AUTHORS Sherwood,J.E. and Somerville,S.C.
TITLE Sequence of the Erysiphe graminis f. sp. hordei gene encoding
beta-tubulin
JOURNAL Nucleic Acids Res. 18, 1052-1052 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.C.Somerville, 20-DEC-1989.
FEATURES from to/span description
pept 518 529 beta-tubulin, exon 1
581 604 beta-tubulin, exon 2
653 677 beta-tubulin, exon 3
729 771 beta-tubulin, exon 4
833 887 beta-tubulin, exon 5
934 1724 beta-tubulin, exon 6
1772 2162 beta-tubulin, exon 7
IVS 530 580 beta-tubulin intron A
IVS 605 652 beta-tubulin intron B
IVS 678 728 beta-tubulin intron C
IVS 772 832 beta-tubulin intron D
IVS 888 933 beta-tubulin intron E
IVS 1725 1771 beta-tubulin intron F
BASE COUNT 664 a 569 c 537 g 703 t 2 others
ORIGIN 3 bp upstream of SmaI site.
1 cccggggcaa atcacactct gcctctctag cctcctnccc gaaggtcgtg ctgaaatttc
61 tggaaacagc gtaattgctg tatggtagct tagcccaact ttttttgtgc cgtccagggc
121 tctagggagt gctgttnagc tagtgagaat agaagtcttc cgagatattt gaaagcctac
181 catagctctg aaggcattgt ggcaggacct agaggatcgt aagagatatg actgacaagt
241 gttgagtcct tgcgtcctaa ttttgtgtat tattaccgtt gtgagacatc acggcgacgc
301 agaccgatct gcacttttca gtgccaagat ggtatgtaag cagtcccgtg atcggagcta
361 gcgcagcaac ggtttagtgt gagccaggtc caccgtcaac ccgcaattac tttctcgctg
421 cgccaaatct ccaatttcta catcccaact aacctccgga aacgcaccta cactataccc
481 cctatcatcc tctaccgcct atctttcatc agccaatatg cgtgaaattg ttagttaaat
541 tccatcacgg caactcttga aatcgctaat gataatttag gtccatttac agaccggtca
601 atgcgtacgt taatatttag tttgccttgt aactctacta atagagtttt agggaaacca
661 aatcggagcc gcattctgtt cgtagaatct caagcttcta gctcaactcg ctcacaccca
721 ctctacaggg caaacaattt ctggtgagca tggacttgat ggttcaggag tgtaagttcc
781 ctcgcataat taggtatccg cattcatcaa cgaattataa tgactccatt agctacaatg
841 gtacatcaga tttacaactt gagaggatga acgtatactt taacgaggtc tggtgaagct
901 ctcaaaagag aagttatcgc taaccctctt aaggcttctg gtaacaaata tgttcctcgt
961 gctgttctcg tcgacttgga gccaggtact atggatgctg tacgagctgg tccttttggc
1021 cagctattca gaccagacaa cttcgtcttc ggacaatctg gagccggaaa taattgggca
1081 aaaggtcact acactgaagg cgcggagctt gtcgaccaag tactagatgt agtgcgaaga
1141 gaagcagagg gatgtgactg tcttcagggg tttcaaataa cacattctct tgggggtggt
1201 acaggtgccg gtatgggtac gttattaatt tcaaaaatcc gggaagaatt ccctgatcga
1261 atgatggcaa ccttctcagt tgtgccgtcc cctaaggtgt ctgacactgt tgttgagcca
1321 tacaacgcaa ctctttcagt ccatcagttg gtcgaaaact ccgacgagac tttctgtatc
1381 gacaatgagg cactttacga catctgtatg aggacgttaa agctttctaa cccctcatat
1441 ggtgatctaa atcacttggt atccgctgta atgtcaggcg taactacttg ccttcgattc
1501 cctggccagc taaactctga tttgcgcaaa ctggcagtta acatggttcc tttcccacgt
1561 cttcactttt ttatggttgg atttgcaccg cttacaagcc gtggagcgca ctctttccgt
1621 gccgtaactg ttcctgagtt aactcagcag atgtttgatc cgaaaaatat gatggcagca
1681 tccgacttcc gaaatggtcg ttacttgaca tgctctgcca ttttgtaagt cagctcgtta
1741 tatacgcata ttctatacta acatattaca gccgcggtaa ggtatctatg aaggaagtag
1801 aagatcagat gcgaaatgtc caacaaaaga acgtatccta ctttgttgag tggattccaa
1861 ataatgttca aactgcccta tgttctatcc cgcctcgtgg cctaaaaatg tcttcgacat
1921 tcgtcggaaa ctcgacttcc atccaagagc tcttcaaacg tgtcggagat caattcacgg
1981 ccatgttccg gagaaaggca tttctgcatt ggtacactgg cgaaggaatg gacgaaatgg
2041 agtttactga ggccgagtct aatatgaacg atcttgtctc agagtatcaa caataccaag
2101 aagcttcaat ctcggagggt gaggaagagt atccagagga agtaagcaac gaagaagaat
2161 agatcaaaat tgtcctttta acacattgtt ctgtactgta tctgaagtag cggatacgat
2221 gaactgtacc ccattaatcg taccgttcct cttgtttgga aaattaaaag tcaccaacca
2281 aaatgcgata gaaagtgccg gtgtactggc ctagcacatc cttgtctggg tggatgtcga
2341 acactttatt tgctccatta gctagtctct ttccgcgcga caggatgaaa actttctaat
2401 catcaatcac atttcttttc tacgaaagtt tcaattctgt tactgtagag ttttttttgt
2461 aaaaccatct ctaga
//GenBank-Updates@genbank.bio.net (06/07/90)
LOCUS DROIAB 2131 bp ds-DNA INV 07-JUN-1990
DEFINITION D.melanogaster infra-abdominal (iab-4) 1.7 and 2.0 kb RNAs,
complete cds.
ACCESSION M32109
KEYWORDS .
SOURCE D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2131)
AUTHORS Cumberledge,S., Zaratzian,A. and Sakonju,S.
TITLE Characterization of two RNAs transcribed from the cis-regulatory
region of the abd-A domain within the Drosophila bithorax complex
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3259-3263 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.Sakonju, 14-FEB-1990.
FEATURES from to/span description
RNA 141 1802 1.7 kb RNA
RNA 141 2101 2.0 kb RNA
BASE COUNT 592 a 468 c 478 g 593 t
ORIGIN Chromosome III, bithorax complex.
1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag
61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg
121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga
181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg
241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt
301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta
361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg
421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt
481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg
541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt
601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc
661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca
721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc
781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt
841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca
901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt
961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa
1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg
1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag
1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc
1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg
1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat
1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg
1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat
1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac
1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac
1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac
1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg
1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg
1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg
1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt
1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc
1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa
1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa
2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt
2101 aaatacttcg atttactggc tggaagttca t
//
LOCUS ECOMUT1X 1545 bp ds-DNA BCT 07-JUN-1990
DEFINITION E.coli mutT1 mutant DNA with an insertion sequence IS1.
ACCESSION M29970
KEYWORDS insertion sequence IS1.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 300 to 315; 1084 to 1105)
AUTHORS Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J.
TITLE Characterization of the defect in the Escherichia coli mutT1
mutator gene
JOURNAL J. Bacteriol. 172, 2802-2803 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1545)
AUTHORS Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J.
JOURNAL Unpublished (1989) Johns Hopkins, Baltimore, MD 21218
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.J.Bessman, 19-NOV-1989.
FEATURES from to/span description
trns 316 1083 IS1
site 308 315 5' insertion target sequence
site 1084 1091 3' insertion target sequence
BASE COUNT 379 a 366 c 423 g 377 t
ORIGIN
1 gatccttgcc cgtgcggttc tggtaaaaaa tacaagcagt gccatggccg cctgcaataa
61 aagctaactg ttgaagtaaa aggcgcagga ttctgcgcct tttttatagg tttaagacaa
121 tgaaaaagct gcaaattgcg gtaggtatta ttcgcaacga gaacaatgaa atctttataa
181 cgcgtcgcgc agcagatgcg cacatggcga ataaactgga gtttcccggc ggtaaaattg
241 aaatgggtga aacgccggaa caggcggtgg tgcgtgaact tcaggaagaa gtcgggatta
301 ccccccaaca tttttggtga tgctgccaac ttactgattt agtgtatgat ggtgtttttg
361 aggtgctcca gtggcttctg tttctatcag ctgtccctcc tgttcagcta ctgacggggt
421 ggtgcgtaac ggcaaaagca ccgccggaca tcagcgctat ctctgctctc actgccgtaa
481 aacatggcaa ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc agaaaatcat
541 tgatatggcc atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct
601 caacacgatt ttacgtcact taaaaaactc aggccgcagt cggtaacctc gcgcatacag
661 ccgggcagtg acgtcatcgt ctgcgcggaa atggacgaac agtggggcta tgtcggggct
721 aaatcgcgcc agcgctggct gttttacgcg tatgacagtc tccggaagac ggttgttgcg
781 cacgtattcg gtgaacgcac tatggcgacg ctggggcgtc ttatgagcct gctgtcaccc
841 tttgacgtgg tgatatggat gacggatggc tggccgctgt atgaatcccg cctgaaggga
901 aagctgcacg taatcagcaa gcgatatacg cagcgaattg agcggcataa cctgaatctg
961 aggcagcacc tggcacggct gggacggaag tcgctgtcgt tctcaaaatc ggtggagctg
1021 catgacaaag tcatcgggca ttatctgaac ataaaacact atcaataagt tggagtcatt
1081 accacatttt tcgctatttg aaaaactgga atatgaattc ccggacaggc atataacact
1141 gtggttttgg ctggtcgaac gctgggaagg ggagccgtgg ggtaaagaag ggcaacccgg
1201 tgagtggatg tcgctggtcg gtcttaatgc cgatgatttt ccgccagcca atgaaccggt
1261 aattgcgaag cttaaacgtc tgtaggtcag ataaggcgtt ttcgccgcat ccgacattcg
1321 cacacgatgc ctgatgcgac gctggcgcgt cttatcaggc ctaaagggat ttctaactca
1381 ttgataaatt tgtttttgta ggtcggataa ggcgttcacg ccgcatccga catttgcaca
1441 agatgcctga tgcgacgctg tccgcgtctt atcaggccta cgtgcggcat cagacaaatg
1501 tcactgcttt ggttcttcgc tccagtcatc gctttcggaa agatc
//
LOCUS YSAEF1B 2411 bp ds-DNA PLN 07-JUN-1990
DEFINITION C.albicans elongation factor 1-alpha (TEF-1) gene, complete cds.
ACCESSION M29934
KEYWORDS elongation factor.
SOURCE C.albicans (strain SC5314) DNA.
ORGANISM Candida albicans
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 2411)
AUTHORS Sundstrom,P., Smith,D. and Sypherd,P.S.
TITLE Sequence analysis and expression of the two genes for elongation
factor 1-alpha from the dimorphic yeast Candida albicans
JOURNAL J. Bacteriol. 172, 2036-2045 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Sundstrom, 17-NOV-1989.
FEATURES from to/span description
pept 624 2000 elongation factor 1-alpha
variant 1487 1487 g in EcoRI 2.3 kb clone; a in HindIII 4.9 kb
clone
BASE COUNT 760 a 402 c 477 g 772 t
ORIGIN
1 acggtaaatt agtgggaggc aaatctccca aatcaattaa acggaaatta ctcgaagctg
61 caaatctgtt tgctgatgga cagaatgatc atagtattaa tccagatgtt acaattgaag
121 aggatctgat gtctcaaata aggtatcaag acaacgaatc aatgatcaac caagaaagat
181 tattagagaa attggatgga tcagctgtgc ttgtggaaac actgatacga catgccagta
241 aacgaagtag aaaaaactaa ctgtatattc gctaacaaaa aaattgtatt atttttagac
301 atcgtaaatg gggctgtagc cctaattatt ttttatttct cgtgatctgc acgtgtactg
361 ttgatttttt tgtgcgattg aaattatatt gtgctgttat cttaataaaa tcagtcatac
421 cttttttttt ttgggttttt gttgtttaat ttgtgataaa catcccatga ggaacagcga
481 gaaagttttt tgctgttcac tttctcgttc aacttttgca aagtaaagta aagaaaaaaa
541 aattttccct cgtcatcttg atttttactt cttcttcttc ttcttctttt tcctttcttc
601 acacattata gtcataatca atcatgggta aagaaaaaac tcacgttaac gttgttgtta
661 ttggtcacgt cgattccggt aaatctacta ccaccggtca cttaatttac aagtgtggtg
721 gtatcgataa aagaaccatt gaaaaattcg aaaaagaagc tgctgaattg ggtaaaggtt
781 ctttcaaata cgcttgggtc ttggacaaat tgaaggctga aagagaaaga ggtatcacca
841 ttgatattgc tttgtggaaa ttcgaaactc caaaatacca cgttaccgtc attgatgctc
901 caggtcacag agatttcatc aagaatatga tcactggtac ttctcaagct gattgtgcta
961 ttttgattat tgctggtggt actggtgaat tcgaagccgg tatttctaag gatggtcaaa
1021 ccagagaaca cgctttgttg gcttacactt tgggtgtcaa acaattgatt gttgctgtca
1081 acaagatgga ctctgtcaaa tgggacaaaa acagatttga agaaatcatc aaggaaacct
1141 ccaacttcgt caagaaggtt ggttacaacc caaagactgt tccattcgtt ccaatctctg
1201 gttggaatgg tgacaacatg attgaaccat ccaccaactg tccatggtac aagggttggg
1261 aaaaggaaac caaatccggt aaagttactg gtaagacctt gttagaagct attgacgcta
1321 ttgaaccacc aaccagacca accgacaaac cattgagatt gccattgcaa gatgtttaca
1381 agattggtgg tattggtact gtgccagtcg gtagagttga aactggtatc atcaaagccg
1441 gtatggttgt tactttcgcc ccagctggtg ttaccactga agtcaagtcc gttgaaatgc
1501 atcacgaaca attggctgaa ggtgttccag gtgacaatgt tggtttcaac gttaagaacg
1561 tttccgttaa agaaattaga agaggtaacg tttgtggtga ctccaagaac gatccaccaa
1621 agggttgtga ctctttcaat gcccaagtca ttgttttgaa ccatccaggt caaatctctg
1681 ctggttactc tccagtcttg gattgtcaca ctgcccacat tgcttgtaaa ttcgacactt
1741 tggttgaaaa gattgacaga agaactggta agaaattgga agaaaatcca aaattcgtca
1801 aatccggtga tgctgctatc gtcaagatgg tcccaaccaa accaatgtgt gttgaagctt
1861 tcactgacta cccaccatta ggtagattcg ctgtcagaga tatgagacaa accgttgctg
1921 ttggtgtcat caaatctgtt gaaaaatccg acaaagctgg taaagttacc aaggctgctc
1981 aaaaagctgc taagaaataa gctagttgaa tattatgtaa gatctgttag agtttttatt
2041 ttgtattcat ttatttagtt tattttctta ttatatttga aatataatat atattttaaa
2101 aaaaaaattt acagtgtaga atttttggta gtccggtttg ttttgaaatc agtggtggta
2161 ttcaatattt gattaaattt tggtatgaat ttgtgttgaa aaataaaaaa taagcgagaa
2221 atttgcgtgg catattattt gtaatgttcg aatattctct cgtacaccaa aaggtttacg
2281 attattcatt gacatataat aaaatctaat tactaaagag ttttgctgga aatgcatgta
2341 tagtttatat catttctatt atctggacca tcatggaaga atagcaatta tttcttgtta
2401 tcggtagtaa a
//
LOCUS YSAEF2A 2369 bp ds-DNA PLN 07-JUN-1990
DEFINITION C.albicans elongation factor 1-alpha (TEF-2) gene, complete cds.
ACCESSION M29935
KEYWORDS elongation factor.
SOURCE C.albicans (strain SC5314) DNA.
ORGANISM Candida albicans
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 2369)
AUTHORS Sundstrom,P., Smith,D. and Sypherd,P.S.
TITLE Sequence analysis and expression of the two genes for elongation
factor 1-alpha from the dimorphic yeast Candida albicans
JOURNAL J. Bacteriol. 172, 2036-2045 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Sundstrom, 17-NOV-1989.
FEATURES from to/span description
pept 656 2032 elongation factor 1-alpha
variant 2188 2190 gta in one strand of clone 2.6 EcoRI; gtata in
opposite strand.
BASE COUNT 750 a 428 c 430 g 761 t
ORIGIN
1 gtatcaagaa aaaattgtat agtgcttgct gttcgatatt gctagaggca aagaaaaagg
61 attacctcac gcctaaaatg gaatttgaaa attgaggaga tgagttgagc ttttgtcttt
121 tacactttcc ctattcaatt gtgccgcaaa aacttcaata tctaacacag tgtgaaatat
181 ttgttatgta taccgaaagc caaataccat ttaaactggt atttcaagat ttcataaatc
241 aagtttctgg ttaaattaat tgctgttaaa cagaacaaga gcttcacatt gatgttggaa
301 taactactat agaattataa aatacacaaa accaggtcgt gtacgcaata acttttcaat
361 gtatgaaaaa attttttttt tttttgcttc ttctcttcgc ttcttttttc tagttatcaa
421 tttggcaatg aattgatttg attgcccttt gtttagttta ctatgaaaaa aaattttttt
481 tttgtctctg tcagattttc tctcattttt ttttttgcct cataatgaac aactcaaaaa
541 agaatataaa tttctcatca aagacctttc caattgaaaa ttttttcttt ccttcttttc
601 cttttttttt ttctttcttt cttcttttct aagtatacat tttatataat caatcatggg
661 taaagaaaaa actcacgtta acgttgttgt tattggtcac gtcgattccg gtaaatccac
721 taccaccggt cacttaattt acaagtgtgg tggtattgat aaaagaacca ttgaaaagtt
781 cgaaaaagaa gctgctgaat tgggtaaagg ttctttcaaa tacgcttggg tcttggacaa
841 attgaaggct gaaagagaaa gaggtatcac cattgatatc gctttgtgga aattcgaaac
901 tccaaaatac cacgttaccg tcattgatgc tccaggtcac agagatttca tcaagaatat
961 gatcactggt acttctcaag ctgattgtgc tattttgatt attgctggtg gtactggtga
1021 attcgaagcc ggtatttcta aggatggtca aaccagagaa cacgctttgt tggcttacac
1081 tttgggtgtc aaacaattga ttgttgctgt caacaagatg gactctgtca aatgggacaa
1141 aaacagattt gaagaaatca tcaaggaaac ctccaacttc gtcaagaagg ttggttacaa
1201 cccaaagact gttccattcg ttccaatctc tggttggaat ggtgacaaca tgattgaacc
1261 atccaccaac tgtccatggt acaagggttg ggaaaaggaa accaaatccg gtaaagttac
1321 tggtaagacc ttgttagaag ctattgacgc tattgaacca ccaaccagac caaccgacaa
1381 accattgaga ttgccattgc aagatgttta caagatcggt ggtattggta ctgtgccagt
1441 cggtagagtt gaaactggta tcatcaaagc cggtatggtt gttactttcg ccccagctgg
1501 tgttaccact gaagtcaaat ccgttgaaat gcatcacgaa caattggctg aaggtgttcc
1561 aggtgacaat gttggtttca acgttaagaa cgtttccgtt aaagaaatta gaagaggtaa
1621 cgtttgtggt gactccaaga acgatccacc aaagggttgt gactctttca atgcccaagt
1681 cattgttttg aaccatccag gtcaaatctc tgctggttac tctccagtct tggattgtca
1741 cactgcccac attgcttgta aattcgacac tttggttgaa aagattgaca gaagaactgg
1801 taagaaattg gaagaaaatc caaaattcgt caaatccggt gatgctgcta tcgtcaagat
1861 ggtcccaacc aaaccaatgt gtgttgaagc tttcactgac tacccaccat taggtagatt
1921 cgctgtcaga gatatgagac aaaccgttgc tgttggtgtc atcaaatctg ttgaaaaatc
1981 cgacaaagct ggtaaagtta ccaaggctgc tcaaaaagct gctaagaaat aaactaatta
2041 aatacctttg tttaaatagt tgtgattata ttattttttt aaaatatata taaataagag
2101 aataatcata caacatgtag taaggagcga agagtcccag ctctgtaaat aaaaaaaaga
2161 agaggagtta tgtgcttctt tgtttatgta tatatatata tatatatata tatattctta
2221 ccagttagtt aatcttacac atatgttgat ggtgctatat tgtacataca aactctaaat
2281 catatcagaa ctgcttggaa acctttccca ctaacaacac ccatccaatc cccttggttc
2341 tgtctccaac acacgttatt gatttccaa
//GenBank-Updates@genbank.bio.net (06/08/90)
LOCUS BSUHEMAC 3795 bp ds-DNA BCT 08-JUN-1990
DEFINITION B.subtilis 5-aminolevulinate synthase (hemA) and porphobilinogen
deaminase (hemC) genes, complete cds.
ACCESSION M32130
KEYWORDS 5-aminolevulinate synthase; porphobilinogen deaminase.
SOURCE B.subtilis (strain 168) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3795)
AUTHORS Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt,L.
TITLE Cloning and characterization of the hemA region of the Bacillus
subtilis chromosome
JOURNAL J. Bacteriol. 172, 2250-2258 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
I.C.Schroeder, 16-FEB-1990.
FEATURES from to/span description
pept 232 1599 5-aminolevulinate synthase (hemA, EC 2.3.1.37)
pept 1607 2437 unidentified protein (ORF2)
pept 2470 3414 porphobilinogen deaminase (hemC, EC 4.3.1.8)
pept 3404 > 3795 unidentified protein (ORF4)
BASE COUNT 1068 a 689 c 953 g 1085 t
ORIGIN Map position 245 degrees.
1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt
61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta
121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta
181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata
241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt
301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa
361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat
421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag
481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta
541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac
601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag
661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg
721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag
781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga
841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca
901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct
961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa
1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga
1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg
1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg
1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct
1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt
1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa
1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat
1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg
1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct
1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc
1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt
1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat
1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc
1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact
1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt
1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg
1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc
2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa
2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc
2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat
2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt
2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg
2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct
2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg
2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact
2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt
2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga
2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca
2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt
2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa
2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt
2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag
2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt
3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg
3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct
3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg
3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa
3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc
3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga
3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc
3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca
3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc
3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt
3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat
3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat
3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc
3781 tcttaagcag catgc
//
LOCUS SHFIPAH 2900 bp ds-DNA BCT 08-JUN-1990
DEFINITION S.flexner invasion plasmid antigen H (ipaH) gene, complete cds.
ACCESSION M32063
KEYWORDS invasion plasmid antigen.
SOURCE S.flexner (strain M90T-W; serotype 5) DNA, clone pWR390.
ORGANISM Shigella flexneri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2900)
AUTHORS Hartman,A.B., Venkatesan,M., Oaks,E.V. and Buysse,J.M.
TITLE Sequence and molecular characterization of a multicopy invasion
plasmid antigen gene, ipaH, of Shigella flexneri
JOURNAL J. Bacteriol. 172, 1905-1915 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A.B.Hartman, 12-FEB-1990.
FEATURES from to/span description
pept 251 1849 invasion plasmid antigen H
pept 2277 > 2900 ORF3
signal 139 144 -35 region
signal 162 167 -10 region
signal 2111 2116 -35 region
signal 2132 2137 -10 region
BASE COUNT 868 a 657 c 578 g 797 t
ORIGIN
1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta
61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt
121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat
181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa
241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag
301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac
361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac
421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc
481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga
541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata
601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac
661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt
721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt
781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa
841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc
901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact
961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt
1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct
1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc
1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc
1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa
1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct
1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt
1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc
1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt
1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt
1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga
1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag
1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc
1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt
1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat
1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa
1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt
1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa
2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga
2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga
2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga
2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga
2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt
2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg
2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata
2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt
2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc
2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat
2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac
2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc
2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc
2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc
2881 ctcagagctt aaaatatttg
//
LOCUS BOVMTAK5 3852 bp ds-DNA MAM 08-JUN-1990
DEFINITION Bovine mitochondrial adenylate kinase gene, exon 6 and 7.
ACCESSION D90069
KEYWORDS ATP:AMP phosphotransferase; adenylate kinase.
SEGMENT 5 of 5
SOURCE Bovine mitochondria DNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 3852)
AUTHORS Tanaka,H., Yamada,M., Kishi,F. and Nakazawa,A.
TITLE Isolation and characterization of bovine mitochondrial adenylate
kinase gene
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hiroshi
Tanaka Department of Biochemistry Yamaguchi University School of
medicine 1144 Kogushi Ube, Yamaguchi 755 Japan Phone: 0836(22)2215
Fax: 0836(22)2315
FEATURES from to/span description
pept / 126 347 adenylate kinase 2A exon 6(a+b) (EC 2.7.4.3)
pept / 126 + 321 adenylate kinase 2B exon 6a
pept + 2572 2576 adenylate kinase 2B exon 7
pre-msg < 1 491 adenylate kinase 2A mRNA and introns
pre-msg < 1 3448 adenylate kinase 2B mRNA and introns
IVS 322 2571 adenylate kinase 2B intron
BASE COUNT 942 a 915 c 933 g 1062 t
ORIGIN
1 tctagattgt ctgtctctcc ctgtgatgga ctttgggatc agacagcatc tcagatctgc
61 atctggcctt ggattaggag accttgctga gtcccagcct ttcattgccc tcttcttctt
121 ctcagatcac tggggaaccg ttgatccgcc gatcagatga taataagaag gccttgaaaa
181 tccgcctgga agcctaccac actcaaacca ccccgctggt ggagtactac agtaaacggg
241 ggatccactc cgccatcgat gcgtcccaga cccctgacgt cgtgtttgca agcatcctag
301 cagcattctc caaagccaca tgtaaagact tggttatgtt tatttaatgt tgggtccaaa
361 aaggaacttc tttcttcttc ctttccctgt tgaacgagtg ggtgggaatg gcagagtagg
421 cagagggaag cttcctcagg ccagcaagaa tatcatttga tgtattgatt aaaaaagcac
481 ttgcttcatg tacctttggt gtgtgcgcaa gtctcatctc atctgtgtgt gtgtgtggtg
541 tgtatgtgcg tgcacgtgca cacacgtgtg taagtataag tatgtgtaca ctcttatact
601 tcttaaattg taggcgagac tgtttacttc tttagccgta ctctttattt tcgacttccc
661 tggtggcgca gacggtaaag catctgtcta caatgcagga gacccgggtt cgatccctgg
721 gttgggaaga tcccctggag aaggaaatgg caatccactc taggactgtt gcctggaaaa
781 tcccatggac agaggagcct ggtaggctac agtccatggg gttgcaaaga gtcagacaca
841 actgagcgac ttcacttcac ttcactttat tttcaaccaa aataaactaa ctcaagtatc
901 cgccaggcag acaagttcct ggtatttttc catctccagc agctaattag aagtcctggg
961 tcacatgcag tcaggcaggg ctacagttcc tgatggcagc ctgttggcca gaaattccac
1021 ttgttttctc acccataatg aaaagtcttt gagtcattgt ggaaagggat cattaatttt
1081 ttccccctaa acaggaatga aaaggcactt actttataga ttccagaaat tactgggaga
1141 gggtatcacc atagaaagag cctggccaag ttgaattatt tttgtgatct gccccatggt
1201 tctataaagt aggaatcatt cggcacctgt gttgaatact aaattgatgc caaagaatta
1261 aaccaggtga aaggtcctct tgaattcaga ctgtcttctg aacatccagg ctaggcatct
1321 gagagcagcc agtccacttc cccaaagaca gcaggataga tttgtcttta ttttcaatgt
1381 ttgctctgtt tgcaagtgtg aacaaaacgg tgatttgagg atcatttggg gatcatgttg
1441 attagcattt attcaagccc agtctctaca ggatatgaag tcacatatat cccttttgaa
1501 cgcagccctc tcccttccat atctagtcct tcacccctta tgtcatgacg gtggtggcca
1561 cagtctcccc aagaaaggac cacccccaaa ataagatcac ctatggccgt aaccagcttt
1621 tcttcctaac ctcggctccc ggctcttagg catctcttga gatccacact gtgccttttg
1681 gtctctggtt tgatctgtgg caaacggagg aattaataga ctgctatgag ggaccacaaa
1741 aacagcaggc tctggaaaaa ccattaaaaa gtcagtgaca ggtccagtaa ataatgccac
1801 ctgcccaggg tactctgctg actcagtgca gtggcctcgg aagtgcccag cccccacctg
1861 acctgagctc tcctgctgtc tgtgagacca ccagagggct tgtcctctgt ccgcctggaa
1921 actggctgcg gtacgtggcc acctgtgtcc ctctctgtct gcttcaagtg agtgtctgct
1981 ggctctgctc tgccttgttt ccctggttgg agactaactc cacccattct taacggaaac
2041 ccagcctggc atgggatgtt tctgggaagc acatgatttc tgagaatgga caaggaaagg
2101 gagtgaaacc aaaactgtct gctgtgtctg cagggatctg ggctgcttct ctgggtgagg
2161 tggcctcatg aatcttagag ttgactcccc tttgacgggg gacagttgca ggcacttggc
2221 ctcaaagttg aggtccttga atgattgctg atagcgtgtg catgatctgt gctcaaatat
2281 ttgtctgtgt gcattcacac gtgcacacat atacatgtga tatttgggct accaatggga
2341 aagggaaggt cgtcgggggt gcatatgtcc tcttgggggt gggtagaatt caatttttgc
2401 tttgcttaat tccagagcct taacctcagc aggcatgaga gctttcacct ggcagagaaa
2461 tgtccagtgt ttaatgcatt tggttactgg ggtgtgcatg ccaaccttct ctgatggtta
2521 cttggttttt aaaaatgcat tttgttttct cttgtctcct ctttatttca gcctagtaac
2581 agaaggccag gcgggactgc accactactc atcaccccac ggcatgatcc ctgctcttag
2641 gtgctgggca gagggagggg cggtcagggt aaggatgggg acagaggagc ggtgagggcc
2701 caagcggggg agaggagctg tgttgccatg aagcagtttc atttacgtgg agtagaactc
2761 ttaaaagcat aagcaaaggg gaaaattaat tttttaaaac acttgattgg agggtataaa
2821 tagaaacaag gagatgcaat attatttcta aggaatcatg ctttcattta ctttggactg
2881 gtgacaactg gacaggtctc agtgccctga gacctcagct ttgatcagaa cctcatgcca
2941 gcccaggaat gcgggaaatc acactgttgt cctgtctgac ccatagcttg gaacagggca
3001 ggagttgatt actaatcctg gttacccata ccatgaaatc agaacgtcat ctccccacgt
3061 gcaatagaca tggaatgtgt tcctgggggc ggcgtctcaa caaatccggc ttactggagt
3121 catggggcag gctggtgtcc ctcccttcca tactcaccac tgatttacca gcccacctgc
3181 cctcacgggt gagccctcgg cagccaccca gcatacgcca cagtcctgtg ctcttgcctt
3241 cctccatcca catcatgtga aaggactctt tttaatcaat gagcaagtgt cctaagcaac
3301 attatccaaa gactgtcctt tccatcctca aatcctatga ctgggatcag tctacaacac
3361 tgtgatgtat tattttcaat gaggtgcctt tcttaactgt ccaaatgctg ccttgtttgg
3421 cccttaaata aagtgtgtta aaagtttgta tcccctgttg tggcattttt taaggagctg
3481 tgggctagta aaatgacgct gaattctgga tctgacttag aagccctttt ctgggtcatt
3541 agaagtcata agaaggcaga aaggtccaag agtgtgttca taagaagagt ctgctccctc
3601 ttggccttca gctcatctgc aggattttct tgagcccaag gacaaggcag cttgggccaa
3661 tggagacagt gctgtgcttt gggttcaggg atgtacaaca aatcccagcc aatcatttgc
3721 tcctgcttcc ttgtgtgaga aagagactag tggtggtcca gtggttaaga ctcttacctg
3781 ccactgcaga ggacactggt tcgatcccgg gtctgggaag atcccacatg ccgtgggcac
3841 tggacccatg tc
//
LOCUS BBRHWP 4330 bp ds-DNA BCT 08-JUN-1990
DEFINITION B. brevis cell wall protein (HWP) gene.
ACCESSION D90050
KEYWORDS HWP; cell wall protein.
SOURCE Bacillus brevis (strains HPD31, HPD52 and HPO33) genomic DNA.
ORGANISM Bacillus brevis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4330)
AUTHORS Ebisu,S., Tsuboi,A., Takagi,H., Naruse,Y., Yamagata,H.,
Tsukagoshi,N. and Udaka,S.
TITLE Conserved structures of cell wall protein genes among
protein-producing Bacillus brevis strains
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Norihiro
Tsukagoshi Department of Food Science and Technology Faculty of
Agriculture, Nagaya University Furo-cho, Chikusa-ku Nagoya 464
Japan Phone: 052-781-5111 x6869 Fax: 052-781-4447
FEATURES from to/span description
pept 548 3898 HWP precursor (ttg start codon)
sigp 548 706 HWP signal peptide
sigp 638 706 HWP signal peptide
matp 707 3895 HWP mature protein
site 132 132 transcription initiation site
site 255 255 transcription initiation site
site 325 325 transcription initiation site
site 373 373 transcription initiation site
site 386 386 transcription initiation site
BASE COUNT 1401 a 804 c 1000 g 1125 t
ORIGIN
1 atcgattacc aaaaaaccgc gaaagattcc gcggctgaga gctatcgttt gaccaatttg
61 cgctttgaaa cggtctggca acgaccctcg aagtgattca gccgaggagg aattatccaa
121 ccgtgagaat cagtaccaaa aagcgatcct gcagtacaat ctggctgtgg taaactttga
181 aactgccctc ggaagctaag aagaatttgt aaggttttga atggaactgg aaaaaggttc
241 aatggtgaca gtccgtccgg cgtaccctat aatacgagtt gtggcggatg tcactgctta
301 catattacaa gtgaatacga cccatgaaaa tttctttaaa ctttttttct gaggcgccgc
361 aacttttgct cgactgaggc gtttaatagg gtgtcacacg aaaaacgggg aattgtgtaa
421 aaaagattca cggattctag catttgtgtt acactagtga ttgttacatt ttacacaata
481 gctgaatata ctagagattt ttaacacaaa aaagcgaagc tgtcctgcga aaggaggtga
541 cacgcgcttg caggattcgg gctttaaaaa gaaagataga tcaacaaata ttccccaaga
601 acaatttgtt tatactagag gaggagaaca caaggttatg aaaaaggtcg ttaacagtgt
661 attggctagt gcgctcgcca tcacagttgc tccaatggct ttcgcagcag aggacacaac
721 tacagctcct aaaatggatg ctgcaatgga gaaaaccgta aaacgtctgg aagctcttgg
781 cctggtagca ggttatggca acggcgattt cggtgcagac aaaaccatca ctcgtgcaga
841 gttcgctact ctaatcgttc gcgctcgcgg tctggagcaa ggtgcgaaac tggcacaatt
901 caacactact tacacagatg ttagatccac tgactggttc gctggttttg taaacgtagc
961 ttccggtgaa gaaatcgtaa aaggtttccc ggacaaatct tttaaaccac aaaaccaagt
1021 tacttatgct gaagcagtaa ccatgatcgt tcgtgctttg ggttatgagc catccgttcg
1081 aggtgtatgg ccgaacagca tgatctccaa aggttccgaa ctgaacattg caaaaggtat
1141 caacaaccct aacatgcagc agttcgcggc gacaatcttc aaaatgctgg acaacgctct
1201 tcgcgttaag ctgatggagc aaatcgaata cggtactgac atccgtttaa acgtaactga
1261 cgaaactctc ttgactaaat atttgaaagt taccgtacgt gatatggact gggctcacga
1321 aaagggtaac aattctgatg aattgccact tgtaacaaac gtacctgcta ttggtctggg
1381 tagtttgaaa gcaaatgaag ttactttgaa tggaaaagat gctgatctgg gtagcaacac
1441 tacttataaa gtagctgaag gcatcaatcc taacgcattt gatggtcaaa aagtacaagt
1501 gtggatcaaa gatgaccgag aaaatgtcat cgtttggatg gaaggttccg aagacgaaga
1561 tgtcgttatg gaccgtgtga gtgctctgta cctgaaaggt aaagccttca cagatgatat
1621 tgtaaaagat cttagcaagt ctgatttgga tgatgtaaaa atcgaaatgg atggtagcga
1681 gaaatcgtat cgcttgacgg aagatacgaa aatcacttat aacttcactc gtttcaacga
1741 tccagtagat gctctgagca aaatttacaa agacaacgac acgtttggtg ttaaagttgt
1801 tttgaatgat aacaatgaag ttgcatacct ccacatcatt gacgatcaaa caattgataa
1861 aagcgtaaaa ggcgttaaat acggttccaa agttattagc aaaatcgatg ctgataagaa
1921 gaaaatcacc aacttggata actccaaatt tagcgacttg gaagatcagg atgaaggaaa
1981 agacttcttg gtattccttg atggccaacc agctaaactg ggtgatctga aagaatccga
2041 tgtttactcg gtatactatg ctgatggtga taaagataag tacttggtgt ttgctaaccg
2101 taacgttgca gaaggaaaag tagaaaaagt ggtaagccgt aacaagacag acattcgtct
2161 gactgttggt ggtaaaactt acaaagtata tccagatgct tcttactccg aaaacgcaaa
2221 taaagatgtt aagaaggtta attccgactt ggatctgatt agcaaccttg atggagaaga
2281 agtaaaactt ctcttggatc catccggtcg agttcgtcac atcgaaacaa aagacgcgat
2341 cgatgatcgt aaaccattgg caatcatcac taagggtgct acctataact ccagcaaaga
2401 tacttatgac tttactgtta tgacccaaaa aggtaaaaca caaatcgtat ctttggatca
2461 aaaggacatc tacgatagat atggggttaa ctacgataaa tccaatgata agcgtcaagc
2521 tttcgaaaag gatttagttg agcttctgca acctaaggta gttaaagagg actcagctac
2581 tgatgccaat caaaccgttc ttttggaagt taattttgat tccaaaggcg aagttgataa
2641 agtcaaagtt cttgatagca agctgaaata ctccgagaaa agcacttggg ataaacttgc
2701 ggatgaagat gacgacgtcg ttggtgatta tgaagtaacc gataagactg ctgtcttcaa
2761 aatgactggt gaccttactc cagctactgg aactaaacgt ggtgaactga aaaacgctgg
2821 tactgctaaa tttaaggatg ttgctaagaa gagcgatctg aaggtttggt actcagtgga
2881 ggaagacaaa ggtgaagttc aagcgatctt cgtagtagat ggcagtggtt tgggtggcga
2941 ccatcagttt ggtatggtca aacaatatgg tactgctagt aaacaggaca ctatcactat
3001 agttacaaaa gatggtgatt ctgtcactga aaaagagtac aaattggatg gcgacgctga
3061 tgatttgaaa gttgatcaag acatcagacg tggagatgtc atttctttca cactgaattc
3121 tgatggagaa gttattgttg acgatgtagt cgaggttgta aataacaacc acattgataa
3181 cactgcttct aaatcagcta cgctcatgcc tgaagacgaa cgtcaaaaag caggaatcga
3241 caaattggtt gttgctcgcg ttgacgaagt tgatggtaac actatttcct tgaactatgc
3301 tgacggaaag acacaaaaat attacacaaa agcatccact gcgttcattg atgtgtatga
3361 cggtcttgaa ggaattgatg gagtagatga aggcgactac atcgtaatga tcgatagcgc
3421 cgatattgac ggaactcgct ttgactatgt actggtagtt tctagcgatg atgagatccg
3481 cacgcaacac atctccacta aagcagttac ggacttcctg aacaagccaa ccagactatg
3541 taccaaatcc tggcgatggg gaagaagtag tcacggcacc aaagttaata cagttaacga
3601 tgaagcagtt gtagatggta ttgtaactct tccagctgat gcatctgtta gaaacttcaa
3661 cattgcattt gatcaagaaa ttaacagcaa agatgcaacg gtaactgtta ctaatgaaga
3721 tacgcttggt aacgtaacgg tatctgaggt tgcgacagat gcaaaagtat tgagcttcaa
3781 gactgctaaa ctggacacta caaaaactta catcatcaca gttaaaggct tgaaagataa
3841 aaacggtaaa gcagtgaaag atgtaactct ttacgttgaa tttgttgctg gagtgtgact
3901 tgttgctcta taaacactaa tagtgtgaaa tgagtgaaga agggagaagg attaaaacct
3961 tctcccttct ttaactgttt tggcgaattg aagggtgtaa cccccttccg tggaaacaac
4021 cattaatcca cgtgactaat cctcctgcat gcgttgtaag ctaactctta cagggtgtga
4081 agccaggtga agacgtaacc cttagtggtg cgggtctccg taaaagctaa ctatggttag
4141 gaaacgaatc tacgtctgag gcgtgtcagg tttggtaatt agcccttcag ctgctaatta
4201 ccccacaggc tgagatgcct ggagggttct aacgaattaa ggttgttaac gttagttcat
4261 cttgatatgt atagtaggaa cctaaggttt agcaaaaaaa gggatagcca aaatggaaca
4321 agtgagatct
//
LOCUS BSPRSDA 2996 bp ds-DNA BCT 08-JUN-1990
DEFINITION Bacillus sp. raw-starch-digesting amylase gene.
ACCESSION D90112
KEYWORDS alpha amylase; raw-starch-digesting amylase.
SOURCE Bacillus sp.(strain B1018) DNA.
ORGANISM Bacillus sp.
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2996)
AUTHORS Itokor,P., Tsukagoshi,N. and Udaka,S.
TITLE Nucleotide sequence of the raw-starch-digesting amylase gene from
Bacillus sp. B1018 and its strong homology to the cyclodextrin
glucanotransferase genes
JOURNAL Biochem. Biophys. Res. Commun. 166, 630-636 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Shigezo
Udaka Department of Food Science and Technology, Faculty of
Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan
Phone: 052-782-5111 x6356 Fax: 052-781-4447
FEATURES from to/span description
pept 313 2454 raw-starch-digesting amylase precursor (EC
3.2.1.1)
sigp 313 393 raw-starch-digesting amylase signal peptide
matp 394 2454 raw-starch-digesting amylase mature peptide
binding 302 306 ribosome binding site
signal 101 106 -35 region
signal 125 130 -10 region
rpt 2567 2580 inverted repeat
rpt 2585 2598 inverted repeat
BASE COUNT 764 a 852 c 774 g 606 t
ORIGIN 10 bp upstream of RsaI site.
1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa
61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt
121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga
181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg
241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat
301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc
361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag
421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat
481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat
541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg
601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat
661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat
721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac
781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct
841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac
901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc
961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc
1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat
1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat
1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac
1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg
1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc
1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg
1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg
1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg
1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa
1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg
1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt
1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact
1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta
1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg
1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc
1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc
1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc
2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc
2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc
2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat
2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc
2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag
2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc
2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg
2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc
2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga
2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg
2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata
2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga
2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc
2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca
2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga
2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt
//
LOCUS HUMALPL 3101 bp ds-DNA PRI 08-JUN-1990
DEFINITION Human alkaline phosphatase (EC 3.1.3.1) gene.
ACCESSION D90054
KEYWORDS alkaline phosphatase.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3101)
AUTHORS Matsuura,S., Kishi,F. and Kajii,T.
TITLE Characterization of a 5'-flanking region of the human liver/bone/
kidney alkaline phosphatase gene: Two kinds of mRNA from a single
gene
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Fumio
Kishi Department of Pediatrics Yamaguchi University School of
Medicine Ube, Yamaguchi 755 Japan Phone: 0836-22-2258 Fax:
0836-22-2696
FEATURES from to/span description
pre-msg 2130 3101 alkaline phosphatase mRNA and intron
IVS 2341 3101 alkaline phosphatase intron
rpt 375 664 Alu sequence
rpt 2631 2926 Alu sequence
BASE COUNT 775 a 752 c 692 g 882 t
ORIGIN chromosome 1; map position p34-36.1.
1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga
61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg
121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc
181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag
241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat
301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc
361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc
421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg
481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc
541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac
601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc
661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca
721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg
781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac
841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat
901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat
961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat
1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct
1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca
1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa
1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct
1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga
1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt
1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat
1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga
1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc
1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc
1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc
1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg
1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt
1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt
1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg
1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt
1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg
2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct
2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc
2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc
2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg
2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag
2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag
2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag
2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg
2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca
2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt
2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg
2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct
2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta
2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc
2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga
2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc
3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag
3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t
//
LOCUS RATCNRAA 2337 bp ss-mRNA ROD 08-JUN-1990
DEFINITION Rat calcineurin A alpha mRNA, complete cds.
ACCESSION D90035
KEYWORDS calcineurin; calcineurin A alpha; calmodulin binding protein;
calmodulin-dependent protein phosphatase; isoform.
SOURCE Rat brain, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2337)
AUTHORS Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and
Tanaka,C.
TITLE The Complete Primary Structure of Calcineurin A, a Calmodulin
Binding Protein Homologous with Protein Phosphatases 1 and 2A
JOURNAL Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Takayoshi
Kuno Department of Pharmacology Kobe University School of Medicine
7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451
x3273 Fax: 078-351-6531 Peptides, 78-329 and 391-414, seem to be
putative catalytic domain and calmodulin binding domain,
respectively.
FEATURES from to/span description
ORF 208 1773 calcineurin A alpha
signal 1944 1950 polyadenylation signal
BASE COUNT 649 a 523 c 596 g 569 t
ORIGIN
1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt
61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc
121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc
181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg
241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca
301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg
361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt
421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc
481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact
541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg
601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat
661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca
721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg
781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac
841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg
901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac
961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac
1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac
1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac
1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc
1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc
1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc
1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc
1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca
1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg
1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc
1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag
1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct
1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac
1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg
1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg
1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa
1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca
1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc
2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga
2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta
2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca
2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt
2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg
//
LOCUS BOVPAS 3589 bp ss-mRNA MAM 08-JUN-1990
DEFINITION Bovine poly(ADP-ribose) synthetase mRNA.
ACCESSION D90073
KEYWORDS DNA binding protein; helix-turn-helix; nuclear location signal;
poly(ADP-ribose) synthetase.
SOURCE Bovine thymus, cDNA to mRNA, clone pSO-7.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 3589)
AUTHORS Saito,I., Hatakeyama,K., Kido,T., Ohkubo,H., Nakanishi,S. and
Ueda,K.
TITLE Cloning of a full-length cDNA encoding bovine thymus
poly(ADP-ribose) synthetase: evolutionally conserved segments and
their potential functions
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Isao
Saito Department of Clinical Science and Laboratory Medicine
Faculty of Medicine, Kyoto University Shogoinkawahara-cho 54,
Sakyo-ku Kyoto 606 Japan Phone: 075-751-3467 Fax: 075-771-4792
Zinc-binding finger motifs are observed in seq. 21-51 and 128-165.
The sequences, 200-220 and 250-270, showed helix-turn-helix
structure. Nucleotide-binding fold was found in seq. 890-903.
FEATURES from to/span description
pept 77 3127 poly(ADP-ribose) synthetase (EC 2.4.2.30)
signal 224 231 nuclear location signal
BASE COUNT 975 a 879 c 1010 g 725 t
ORIGIN
1 tggatgagac ggcactagca gcagcgggtc tccaggtcgc ggcgtcgggc ttccggagct
61 gtgagcggtt agaaggatgg cggagtcttc agacaagctc taccgggtcg agtacgccaa
121 gagcgggcgc gcgtcttgca agaaatgcaa agagagcatc cccaaggact cgatccggat
181 ggccttcatg gtggagtcgc ccatgttcga tgggaaaatc ccgcactggt accacctctc
241 ctgcttctgg aaggtcggct tctccatctg gcaccctgat gtcgaggtgg aggggttctc
301 tgagctccgc tgggatgacc agcagacgat caagaagatg gccgagactg gcggacggac
361 agatgtttca ggcaaaggcc aagatggagt tggcagcaag accgagaaga cgttgattga
421 cttcggggca gggtacgcca agtccaacag aagcacgtgc aagagctgca tggagaagat
481 agacaagggc caggtgcgcc tgtctaagaa ggtggtgtac cccgataagc cccagctggg
541 catggttgac tgctggtacc acccaaagtg ttttgttcag aaacgggagg agctgggctt
601 ccgtcccgag ttcagcgcaa cgcacctcat gggcttcagc gtcctcaccg cagaggacca
661 agaaaccctc aagaagcaac tcccggccat caagggtgaa agaaagagaa aaggtgatga
721 ggtggatgga atagatgaag tgaccaagaa gaagtctaaa aaagaaaaag acaaggagat
781 taaacttgaa aaggccctta aggcccagaa cgacctgatc tggaatgtca aggacgagct
841 aaagaaagcg tgctctacga acgacctgaa agagttgctc atcttcaaca agcaggaagt
901 gccttccggg gagtcggcga tcttggaccg cgtggccgac ggtatggtgt ttggcgccct
961 ccttccctgc gaggaatgct cgggccagct ggtcttcaag ggcgacgcct attactgtac
1021 cggggatgtg actgcctgga ccaagtgtat ggtcaagaca cagacgccca accggaagga
1081 gtgggtgacc ccaaaggaat tccgagaaat ctcttacttc aagaaactga agatcaaaaa
1141 gcaggaccgt atattccccc cagagagcag caccccagtg ggggcagcag ccccaccctc
1201 cgcagcttca gcgcctgccg ctgtgcactc tggcccccca gacaagccat tatccaacat
1261 gaagatcctg actctcggga aactctccca gaacaaggat gaagtgaagg ccacgattga
1321 gaaactcggg gggaaattga cagggacggc caacaaggcc tccctgtgta tcagcaccaa
1381 aaaggaggtg gacaagttga ataaaaagat ggaggaagta aaagaagcca acatccgtgt
1441 cgtgtctgag gacttcctcc aagacatctc cgcctccacc aagagccttc aggagttgct
1501 ctccacccac ctcttgtccc cctggggagc cgaggtgaag gtggagcctg ttgaagcagt
1561 gggcccaaag gggaagtcgg gggccgcgcc ctccaagaag agcaagggtc ccgtcaagga
1621 ggaaggtacc aacaaatctg aaaagaggat gaaattaact cttaaaggag gagcagctgt
1681 cgaccctgat tcaggtctgg aacacaatgc acacgtcctc gagaaaggcg ggaaggtctt
1741 cagcgccacc ctcgggctcg tggacatcgt caaagggacc aactcctatt acaagctgca
1801 gctcctggag gatgacaaag agagcaggta ctggatattc aggtcctggg gccgtgtggg
1861 cacggtgatt ggtagtaaca aactggagca gatgccatcc aaggaggatg ccattgagca
1921 ttttatgaaa ttatatgaag agaaaaccgg aaacgcctgg cactctaaaa actttacgaa
1981 gcatcccaaa aagttctacc ctctggagat tgactacggc caggatgaag aggcggtgaa
2041 gaagttgaca gtaaaccctg gcaccaagtc caagctcccc aagccagtgc agaacctcat
2101 taagatgatc tttgatgtag agagtatgaa gaaagccatg gtggagtatg agattgacct
2161 tcagaagatg cccttgggga agctgagcaa aaggcagatc caggctgcat actccatcct
2221 cagtgaggtc cagcaggcac tgtcccaggg cagcagtgac tctcacatcc tggatctctc
2281 caaccgcttc tacaccctga tcccccacga cttcgggatg aagaagcccc cgctgctgaa
2341 caacgcaaac agcgtgcagg ccaaggtgga aatgctagac aacctgctgg atattgaggt
2401 ggcctacagt ctacttaggg gtggttctga tgacagcagc aaggacccca ttgatgtcaa
2461 ctatgagaag ctcaaaactg acattaaggt tgtggacaaa gattccgaag aagccgagat
2521 cattaggaag tatgtgaaga acactcacgc gaccacacac aacgcgtacg acttagaagt
2581 cgtcgacatc ttcaagatag agcgcgaagg ggaaagccag cgttacaagc cgtttaagca
2641 gctgcataac cggaggctgc tgtggcacgg gtccaggacc accaacttcg cgggcatcct
2701 gtcccagggt ctccggatag ccccacctga agcacctgtg acgggctaca tgtttggtaa
2761 gggaatctat ttcgcggaca tggtctccaa gagtgccaac tactgccaca cgtcccaggg
2821 agacccaata ggcttgatcc tgttgggaga agctgccctt ggaaacatgt atgaattgaa
2881 acatgctcga catatcagca agttacccaa gggcaagcac agtgtcaaag gtttaggcaa
2941 aactacccct gacccatcgg ctagtattac tgtggatggt gtggaggtgc ctctcgggac
3001 agggatttca tctggtgtta atgacacctg tctgttgtat aacgagtaca tcgtctacga
3061 cattgctcag gtccacctga agtacctgct gaagctgaag ttcaacttta agacatccct
3121 gtggtgagct gtgaggtgcg caggctggtg actggcgtgc tttgcatgac tggccaggca
3181 gaaacctcag ctgagttgtc gaggcatagc tagcttcact cagtctcctt agaagggatt
3241 ttatatgaac acttggtcac atgtttcaga gcgattttcc cagcttttca aatcccttga
3301 ttgtgttggg cgggggagcg ggcggggggt ggtttcgggg tttttctcct gccagccata
3361 accctggtag gtgcaactga caatagagga gaagctgggg agggtcttac ctgtgtctag
3421 attagtccaa tgaaagagcc tcgcaggcca cattagaaca tctgccttac tggtttcccc
3481 agggaaggaa aaaaataatc ttctaccctt gtttttaagt attcatcttt agttttggaa
3541 aaatgttaag tgtttatttt gaggtcaaaa taaaaactaa tttcatact
//
LOCUS CHKNMYC 6360 bp ds-DNA VRT 08-JUN-1990
DEFINITION Chicken N-myc protein gene.
ACCESSION D90071
KEYWORDS N-myc gene; nuclear protein; oncogene.
SOURCE Chicken (strain white leghorn) embryo DNA, clone CNG1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 6360)
AUTHORS Sawai,S., Kato,K., Wakamatsu,Y. and Kondoh,H.
TITLE Organization and expression of the chicken N-myc gene
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hisato
Kondoh Department of Molecular Biology School of Science, Nagoya
University Furocho, Chikusaku Nagoya 464-01 Japan Phone:
052-781-5111 x6685 Fax: 052-782-8575
FEATURES from to/span description
pept 2872 + 3598 N-myc protein exon 1
pept + 4682 5280 N-myc protein exon 2
pre-msg 2354 6208 N-myc mRNA and introns
IVS 2496 2769 intron A
IVS 3599 4681 intron B
BASE COUNT 1318 a 1648 c 1735 g 1659 t
ORIGIN
1 tgagagtaga cctaaaaagg agaacttgac gtagtattgg tggatgaaaa gctgttagtg
61 tgtgcctgaa acccagaagc caaccatatc ctggaccaca tgaaaagaag catggccagc
121 acggcaaggg tggtgattgt ccctctctac tccattctca aattcctgta cccaaagaga
181 aaacatagct atatacatat atgtatatgt gggggttttt ttgtttgttt ttttgctgtt
241 cccaggacta tgtttagcca atgcatcaaa acgcatacaa gcatttgcgg tcacttgagt
301 tggcactgca ctgtgcctcg tgctctggtt gccacccaga tgctgatcac acaccgtggg
361 tttggtggtt gctgtaccat ggatgcaccg ggtgtgaggc agggccttca cactgagatg
421 ggagccccac agctgccatc ttcctcctcc ggactctgtg ctgcagccag gtccaaccag
481 ctccatctgg ccagcccctc tgtatgcttt cagctctgca gcggacatcc aaacaacagt
541 gtgaacgctt aaggcgacct caccttcctg gaagggcctt gatacctcag tcttggccac
601 ccaagcacaa catggaggca agcgggcatc accccaggga acagcgtagc ccaggggctg
661 gagcgcaatg gggcaacgcg gtgaaggtgc acggtgagga actattaccg tctggctctg
721 ttctcagaaa caaactcctc ctcagcgact ctcacacaga ccctccctca gagctgccac
781 ttttgaagcc gccgggctca ccccgggcgg ctcgggagcg gcgccccgcc cggccgcaca
841 caaacggcgc cggcagcctt cctgcgagca acgcccacgt ccccgcctca acctgcagcc
901 tatgagcggg taggcgcctc cccaagcggg cgccggaccg ggatcgcaca gccgaggacg
961 gcgccgcgct gccgtaccgc ggtctccccg ccgctgtccg cgcccgcggg gaaggaggca
1021 gagcggtggg tgtccgccgc cggggggacg cccggccgcc gcctccggcc ctccgcaagg
1081 tgtcgccgcg cgcagcggaa cgccgggtcc ggccgggagc tgccgtgcgg tgcggcgggg
1141 ccggaggtgt gcgcggcgag cggggaggcg atcaccgcag gacgagcgct agttctttat
1201 aaagcccaga gcgagggcga ccccgcgttc cagacacttc cattattcct ctcctttatt
1261 agtatttttt tcacctctcc ctcggtcttc gcagataggg ccaccgctct cacgcagcgc
1321 cgcgtccccc cggggagggg cggccccgca cccccgggcg tgccgggcgc ctcgcaccgc
1381 gttggagaat aaagagcgaa tggggctttt ggctcgcaag cggaggcacc tccctcgtct
1441 atggaaatca gggggtgggg ttacgccgcg ctcccccccc cccttttttt ttttttttta
1501 atgctccccg ggtgtgtcag ctatctttat ttttttattt tttttttcct cccctccctt
1561 ccccccaccc cgccactcct ccccctcgca acacccaccg ccggcagcgg ggatttaaga
1621 ggaggaggaa gcggcagggt ctgggctgcc agcagcggga ggacaaaacc cccatgaaat
1681 aaatacaatc ccacatccag ctgggaggat gggacgggag cggcagcctc cgccggccct
1741 caatgggggt ttcattgagc ggatatttgg gctgtgctgc ggagcggtcg caccgagctc
1801 gccgcactct gtaggtgcct ctttgctctt cgcattttta acggtttcgt gtatttcggg
1861 agcagggaag cgcacaatgc tggaaaggag cagggggcgg cgcgggggcg agcagctgag
1921 ccgcgtttcg gtgcatttca tcagagcgct ttgcacgact gtgtgtgtgt gtgtgtgttt
1981 tttttttttt tttcctcccc cttttgctct ccgtcagctg ttcgcaaccc agcacacgtt
2041 ttcccttcgc tgggtcggga ctaagagctc atttcggttt caatgcccga tgcagaggag
2101 catggggctg cgggtgagcg gccgcgggta gcgcgggggc accgaggtaa gggcgctgct
2161 cccgggggcg aaccgtgtgg ctcttgtttt aacctgcaaa gagggagaaa aacttcctgt
2221 agcgagtcag gctttgttgg ggtgtgttca tcgtttggtg tgcgtcggtg tttgctgctg
2281 ctggattccc ctgatggagt tgcacggctc ggggagcccc ttctttgcat ccttctctcc
2341 ccactacact ggaaatgaca aaaggagtaa acttcctttg cttgcggtcg cgtataggag
2401 ggtttctgca gggaggcgtg catctgcccg cggctgtgct gctctgagct ctggaggcgg
2461 tcgggcaggg cgcggagctc cgtgcaggac cggaggtgtg tgtgtgtgtg tgtgtgtgtg
2521 tgtgtgtgtg caccctggag cggctcacgg ccgctgcctg cggcgtccga gaaggtgcgg
2581 tgcgttcctc cttaacttgt aaaggctcct ggtttgtttg ggtttttttt tccctatcca
2641 taatggaatt gtgttttttt atttttcctt tttttttttt tttccccact acataatagg
2701 gaattacagg cgtctttttt tttcgtgttt taattgtgtt cgtttcttta tttccatttt
2761 ctgctttagt ttcaggtcct ggacacacct ctcctctcct tggtggaact accgtaaaac
2821 ccgcctctaa cacgagcccc cagcccggga atcgacacgg agagccccgc catgccggga
2881 atgatcagca agaacccgga cctcgagttc gattccttac agccttgttt ctacccggac
2941 gaagatgatt tctatttatg cgggccggac tcggcccccc ccggggaaga catctggaag
3001 aagttcgagc tgctgcccac ccctccgctg tcccccagcc gggccgggct gcaggagcca
3061 ccccccgggg ggggctccat tgccgtgggg aggggcggac ctggggagtg ccgccccgtt
3121 gatcccctgg actgggcgtc cgagttgctc ctcctgcccc ccgaggccga gctgtggggc
3181 agcacggacg gcgcggattt cttcgagacg ggcctcgggg cgagcaacaa cctcaactcc
3241 atcatcatcc aggactgcat gtggagtgcg ttctcggccc gcgagaagct ggagcgggcg
3301 gtcagcgaga agctgcagag caagcctccc gccgccgccc cgccgcctcc gcccccggtt
3361 gtacccaccg ccgcttgtcg ccgccgcgaa cagccccagc gcggtcccgg ccgcgcggag
3421 ctgggcggct ccgtgcccga gtgcgtggac ccggccgtgg tcttcccctt ccccgtcaac
3481 aagcgggagg cggcggtgcc aagcggcggc gagaccccgc ggggcgggcg ccggccgggt
3541 cccgcggggg agagccgggc cagcagcagc tccggggacg acacgctcag cgactcgggt
3601 caagcgcgcg gctgacgggt gggcggtgcg gggtcgggcc gccgcagcgc acactcaaca
3661 gccgcggggc ccgtgcgcgc cccacggggc tgggttctcc ggggctgtgg gccaacagca
3721 aggacggtgt gcccgggctc gggctgtgcc gtgcggctcc tcgggcttcc catgtcctca
3781 tgtccatcgg ggtgtgtgtg cgccattctg ggggccgttt tggggagcgt gtggtcagaa
3841 gcgaggggca gtgccatgct gaaccacgtt cggtggggga tgtttacggc gttgggaggc
3901 tggaacagag cggcagcccg cagagggtcc ttgtctgtac ggtgacagag cctaaaatgc
3961 cctcagatag ttcccatcgg agaaatccgg ttcggcagtg tagttctggc tgtgtaagtg
4021 agcggtccgt atgatcgtca ccataaactt ggcgatccgt ctcggctcgc ggagcctact
4081 gtccctttaa tgtctggttt gacagctttg ggtgaggaag cacttccaac agctgtcttc
4141 ttggcactgc accaagcgcc ggttaaaggg tccccggcgg gagcagcttc accttctgca
4201 ctagaacaaa cccagcgatt gtttcgtttc ccgtctgctt ttccaccaac ccagggctgt
4261 gttgtggctc tgtacgtgca gtttgtgccc caggtctgca gctgtagtaa aaatgaatgc
4321 attttgtatg aaaacttttc gccactggga ttagttgggt ggtagcccca ctttcattta
4381 aaatgagaag ttgttaaaga aaaggagttg gagacaatcc ctatgggtcc cttcaaactc
4441 ggggtattct atgaaaaaca aacctcataa agttccccca aacttgcaga gatctatttt
4501 gtttagtttt gtgtatgtta tatacgtctc tagaactata tttgattatg gacagtgagc
4561 acaccctatt ttttttacca tcagttattt tattggtatt tctaatgtgc atatcttgtc
4621 aataaatatt attgcttttg agagaatatc aatttctttt tccattttct ttaaccgaaa
4681 gatgatgaag acgaggagga agaggatgat gaagaagaaa tagatgttgt gacagtggag
4741 aaaagacgct cctcctccaa caaggctgtt accaccctca ctattacagt gcgtcctaaa
4801 aataccactt ttccatcggt caggacacag cagaatgaac tgattttaaa gcgttgtgcg
4861 ccaattcacc agcagcataa ttatgccgct ccttctccgt atatggagag tgaagatgtg
4921 ccaccacaga aaaagttaaa agccgaggtg ccccgtccag taaaacccat gatccaacca
4981 aagtctaaga gttcaagtcc tcgaaactct gattcggagg acagcgaacg tcgacgtaac
5041 cataatatct tggagcgtca gaggcgtaat gatctgcggt ccagtttcct cacgttaagg
5101 gaccacgttc cagaacttgt taaaaatgag aaagctgcaa aagttgtgat tttgaaaaaa
5161 gccactgaat atgtccattc ccttcaggca gaggagcaga agttattgct agaaaaggag
5221 aaattgcaag ccaggcaaca acagttgcta aagaaaatag agtacaagcg gacttgctaa
5281 acttcttttt ctcgtttttt ttcttttctt tttggttttt tttttttttt ttggctgacc
5341 aggacagtca ttgccacttt gcacattttt gattctttaa aaaaaaaaaa aaaattgtgt
5401 tttttgacgt taagaatgtt ggttttactt tcaattcagt ccctgaagta attgacaaac
5461 tttattatcc gggtacgagc aaatgggtgt tcttgcaaga agtttattgc gagactacca
5521 aacacaatgg actgcctttg tttttcattt taagaactgt agatggtgga tttttttaat
5581 tttttttaaa ttgttgtgag catttggagc tgctgatgac atctagttga gttgaaaaca
5641 tccattccta atttttatgg tgcttatgtt ctaacagatg ttactttagg gggttggcat
5701 ttgtacccct ctggaatttt ctgtaaatac catctacaca cttgcctttt gtacatgtct
5761 tgggttatga gaggtggctt ttgctgccag tattagactg gaagttcata cctaagtact
5821 gtaatacctc aatgtttgag gagcatgttt ttgtatacaa atatattgtt aatctctgtt
5881 atgtactgta ctaattctta cattgcctgt atactttagt acgtcgctga tacataacta
5941 aatttgatac ttatattttc gtatgaaaat gagttgtgaa agttttgagt agatattact
6001 ttatcacttt tttgaactaa gaaacttttg taaagaaatt tactatatat gccttttcct
6061 agcctgtttc ttccagttaa tgtatttgtt aatgtttggt gcatagaact gggtaactgc
6121 aaagttctgt gtttaatttc ttccaatggt gtacatttag tgctgcgtct tatagcactt
6181 tgaaatacct catgtttatg aaaataaata gcaattacat gatgtgccat ttactatttt
6241 ttcttttaaa tggagttcct tgaaataaag tttttaactt tgcaatgttt atgaaataaa
6301 aacctagtcc catatgtaag gatttgaact ctcaagtata ggtgatgttt acggacaagc
//
LOCUS BMEGDH1 2834 bp ds-DNA BCT 08-JUN-1990
DEFINITION B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION D90043
KEYWORDS glucose dehydrogenase.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2834)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES from to/span description
pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47)
ORF 181 867 ORF1
ORF 1086 1946 ORF2
signal 89 94 put. -35 region for ORF1
signal 116 121 put. -10 region for ORF1
signal 167 172 ORF2 ribosome binding site
signal 893 928 termination signal
signal 972 977 put. -35 region for ORF2
signal 997 1003 put. -10 region for ORF2
signal 1070 1076 ORF2 ribosome binding site
signal 1949 1954 glucose dehydrogenase ribosome binding site
signal 2751 2783 termination signal for glucose dehydrogenase
BASE COUNT 896 a 442 c 644 g 852 t
ORIGIN
1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
2821 ctggtgttga attc
//
LOCUS BMEGDH2 1202 bp ds-DNA BCT 08-JUN-1990
DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION D90044
KEYWORDS glucose dehydrogenase; isozyme.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1202)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura Department of Fermentation Technology Osaka University 2-1
Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax:
06-876-9036
FEATURES from to/span description
pept 125 910 glucose dehydrogenase
signal 27 32 put. -35 region
signal 49 55 put. -10 region
signal 111 116 SD sequence
signal 928 959 termination signal
BASE COUNT 427 a 190 c 268 g 317 t
ORIGIN
1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
1201 ag
//
LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 08-JUN-1990
DEFINITION B.mori PTTH mRNA.
ACCESSION D90082
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
mRNA, clones P1, P2, C2, C9 and C19.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 1023)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept 34 708 preproPTTH
matp 379 708 PTTH subunit
signal 768 773 polyadenylation signal
signal 826 831 polyadenylation signal
signal 997 1002 polyadenylation signal
signal 1004 1009 polyadenylation signal
BASE COUNT 358 a 179 c 186 g 300 t
ORIGIN
1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
1021 aaa
//
LOCUS BMOPTTHP4 944 bp ss-mRNA INV 08-JUN-1990
DEFINITION B.mori preproPTTH mRNA.
ACCESSION D90083
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
mRNA, clone P4 and C21.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 944)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept < 1 631 preproPTTH
matp 302 631 PTTH subunit
signal 691 696 polyadenylation signal
signal 749 754 polyadenylation signal
signal 918 923 polyadenylation signal
signal 925 930 polyadenylation signal
BASE COUNT 337 a 163 c 177 g 267 t
ORIGIN
1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 08-JUN-1990
DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase
(EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION D90047
KEYWORDS Ip; complex II; iron sulfur subunit; sdh B;
succinate-ubiquinone oxidoreductase.
SOURCE Human liver mitochondrion, cDNA to mRNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 958)
AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
of iron sulfur(Ip) subunit of liver mitochondria
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kiyoshi
Kita Department of Parasitology Juntendo University 2-1-1,Hongo
Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax:
03-814-9300
FEATURES from to/span description
pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B)
(AA at 1)
site 74 95 iron-sulfur binding site I
site 167 179 iron-sulfur binding site II
site 224 236 iron-sulfur binding site III
BASE COUNT 319 a 212 c 204 g 223 t
ORIGIN
1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS HUMNCAW 2287 bp ss-mRNA PRI 08-JUN-1990
DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION D90064
KEYWORDS CEA; CEA gene family; PI-anchored membrane protein.
SOURCE Human white blood cells, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2287)
AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
Matsuoka,Y.
TITLE Characterization of a cDNA clone encoding a new species of the
nonspecific cross-reacting antigen (NCA), a member of the CEA gene
family
JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Fumiko
Arakawa First Department of Biochemistry School of Medicine Fukuoka
University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
092-801-1011 x2892 Fax: 092-801-3600
FEATURES from to/span description
ORF 87 1136 nonspecific cross-reacting antigen ORF
BASE COUNT 618 a 593 c 453 g 623 t
ORIGIN
1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
2281 gctgatc
//
LOCUS PIGDESTN 1681 bp ss-mRNA MAM 08-JUN-1990
DEFINITION Porcine destrin mRNA.
ACCESSION D90053 J05290
KEYWORDS actin-binding protein; cofilin; destrin.
SOURCE Pig adult brain, cDNA to mRNA, clone PD2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1681)
AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
Iida,K. and Yahara,I.
TITLE Destrin, a mammalian Actin-Depolymerizing Protein, Is Closely
Related to Cofilin --Cloning and Expression of Porcine Brain
Destrin cDNA
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji
Moriyama Department of Biophysics and Biochemistry Faculty of
Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
Phone: 03-821-2111 x4408
FEATURES from to/span description
pept 54 551 destrin
signal 1645 1660 polyadenylation signal
BASE COUNT 507 a 322 c 372 g 480 t
ORIGIN
1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taacttaaaa aaaaaaaaaa
1681 a
//
LOCUS RATPMP70X 3324 bp ss-mRNA ROD 08-JUN-1990
DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION D90038 J05256
KEYWORDS PMP70; peroxisomal membrane protein.
SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
189,201].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3324)
AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the
Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Keiju
Kamijo Department of Biochemistry Shinshu University School of
Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax:
263-33-6458
FEATURES from to/span description
pept 36 2015 peroxisomal membrane protein (PMP70)
signal 3271 3276 polyadenylation signal (put.)
BASE COUNT 921 a 658 c 773 g 972 t
ORIGIN
1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
3301 attaaaaaaa aaaaaaagga attc
//
LOCUS VACSANT 1525 bp ds-DNA VRL 08-JUN-1990
DEFINITION Vaccinia virus surface (S) antigen gene.
ACCESSION D90076
KEYWORDS S gene; surface antigen.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 1525)
AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y.
TITLE Identification and nucleotide sequence of the gene encoding a
surface antigen induced by vaccinia virus
JOURNAL Virology (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Yoshiaki
Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315
FEATURES from to/span description
pept 382 1437 surface antigen S
BASE COUNT 568 a 229 c 258 g 470 t
ORIGIN
1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
1501 gctatagaat gaatgaggat acgcg
//
LOCUS YSCA1 881 bp ds-DNA PLN 08-JUN-1990
DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION D90072 X13682
KEYWORDS acidic ribosomal protein; ribosomal protein.
SOURCE S. cerevisiae (strain IFO-40028) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 315; 631 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
STANDARD full staff_entry
REFERENCE 3 (bases 277 to 742)
AUTHORS Tsurugi,K. and Mitsui,K.
TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1
from Saccharomyces cerevisiae
JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988)
STANDARD simple automatic
COMMENT These data kindly submitted in computer readable form by: Kazuhiro
Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257
FEATURES from to/span description
pept 313 633 acidic ribosomal protein A1
signal 125 135 UASrpg box1
signal 182 192 UASrpg box2
signal 716 721 poly(A) signal
variant 303 303 a in [1]; g in [3]
variant 684 685 tt in [1]; t in [3]
variant 719 719 a in [1]; t in [3]
BASE COUNT 270 a 165 c 157 g 289 t
ORIGIN
1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS YSCCDC23X 3107 bp ds-DNA PLN 08-JUN-1990
DEFINITION S. cerevisiae CDC23 protein gene.
ACCESSION D90081
KEYWORDS CDC23 protein.
SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
D22) DNA, clone YX34.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3107)
AUTHORS Doi,A. and Doi,K.
TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
cerevisiae
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji Doi
The Institute of Scientific and Industrial Research Osaka
University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
Fax: 06-877-4977
FEATURES from to/span description
ORF 765 2645 ORF for CDC23
site 153 161 calcium-binding site
BASE COUNT 1010 a 595 c 629 g 873 t
ORIGIN
1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS RATLIPAA 3225 bp ss-mRNA ROD 08-JUN-1990
DEFINITION Rat mRNA for hormone sensitive lipase.
ACCESSION X51415 J03087
KEYWORDS hormone sensitive lipase; lipase.
SOURCE Rat adipose tissue cDNA to mRNA.
ORGANISM Rattus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3225)
AUTHORS Holm,C., Kirchgessner,T.G., Svenson,K.L., Lusis,A.J.,
Belfrage,P. and Schotz,M.C.
TITLE Nucleotide sequence of rat adipose hormone sensitive lipase cDNA
JOURNAL Nucleic Acids Res. 16, 9879-9879 (1988)
STANDARD full staff_entry
REFERENCE 2 (sites; for [1])
AUTHORS Holm,C., Kirchgessner,T.G., Svenson,K.L., Fredrikson,G.,
Nilsson,S., Miller,C.G., Shively,J.E., Heinzmann,C., Sparkes,R.S.,
Mohandas,T., Lusis,A.J., Belfrage,P. and Schotz,M.C.
TITLE Hormone-sensitive libase: Sequence, expression, and chromosomal
localization to 19 cent-q13.3
JOURNAL Science 241, 1503-1506 (1988)
STANDARD full staff_entry
FEATURES from to/span description
pept 616 2889 hormone sensitive lipase
signal 3209 3214 polyA signal (put.)
signal 3225 3225 polyA signal
BASE COUNT 701 a 952 c 916 g 656 t
ORIGIN
1 gaagaaactc gtaatggggc ttgagtgtgt gggtaaggaa gggaactgct gagcgtccag
61 atttcccaga acaagaagga caagtccaaa gacaataaac aaagatagga gttcatcgtt
121 gaatacctgg aaggaagaag gaagagggtg ggcagcttct gggatagaga agcagattct
181 ttggagttca ggagattgga ctcctagatc caggaaagag ggcaaaagcc atggacagtg
241 ggtttctgat tctgaagagg agagataggc aggggctgct tagcctgggg cttcgactcc
301 cagccgatag gcagaaagag gatgaggccc cagacttcta gaagacaagg agataaattc
361 cgaggtgtga gagagaagat aatggaaagc tcctgcatct ccaggaggct tggaacagac
421 cggtgaacct gaggaaggag agggctacgg ctttggactc ctgaatctcc tgtatcttgg
481 cagggatggt cctcggtttg aatactgggt tgggcctagg gagagagagg taggagctgt
541 acccctgggc ctgacagagc aaagagtaat cccagacctt cccatcttcc cgcagcctcg
601 gcattctcac acagcatgga tttacgcaca atgacacagt cgctggtggc gctcgcagaa
661 gacaacatgg ccttcttctc aagccagggc ccaggagaga cagcacggcg gctgtccaac
721 gtctttgcag gtgttcggga acaggcactg gggctagaac caactctagg tcagctcttg
781 ggtgtggcac accatttcga cctggacact gagacaccag ccaacggata ccgtagcttg
841 gtgcacacag ctcgttgctg cctggcacac ctactacaca aatcccgcta tgtggcctct
901 aaccgcagaa gtatcttctt ccgtgccagc cacaacctag cagaactgga ggcctacctg
961 gctgccctca cccagctccg tgctctagcc tactacgccc agcgcctgct gaccatcaac
1021 cgaccaggag tgctcttctt tgagggcgat gagggactca gcgctgactt cctacaggac
1081 tatgtcacgc tacataaagg ctgcttttac ggtcgctgcc tgggcttcca gttcacacct
1141 gccatccggc cgttcctgca gactctctcc atcgggctgg tgtccttcgg ggaacactac
1201 aaacgcaacg agacgggcct cagtgtgact gcaagttccc tctttacggg tggccgattc
1261 gccatagacc cagagttgcg tggggctgaa tttgagcgca tcatacagaa cctggacgtg
1321 cacttctgga aagccttctg gaatatcacg gagatcgagg tgctatcgtc tctggccaac
1381 atggcatcaa ccaccgtgag ggtaagccgc ctgcttagct tgccgcctga ggcctttgag
1441 atgccactaa cctctgaccc caagctcaca gttaccatct cacctccctt ggcacacacg
1501 ggaccaggcc ctgtgctagc caggctcatc tcctatgacc tgcgggaagg gcaggacagc
1561 aagatgctca acagcctggc aaaatctgag ggcccacgcc tggagctgcg cccacggcct
1621 caacaagccc cccgctcacg agccctggtt gtccacattc acggtggtgg ctttgtggca
1681 cagacctcca aatcccatga gccctacctc aagaactggg cccaggagct aggcgtcccc
1741 atcatctcca tcgactactc cctggcccct gaggccccct tcccccgagc actggaggag
1801 tgtttttttg cctactgctg ggctgtcaag cactgtgaac tgcttggttc aaccggagag
1861 cggatatgcc ttgcggggga cagcgcaggt gggaacctct gcatcaccgt gtcccttcgg
1921 gcagcagcct atggggtgag ggtgccagat ggcatcatgg cagcctaccc agttaccacc
1981 ctgcagtcct ctgcttctcc ctctcgtctg ctgagcctca tggatcctct tctaccactg
2041 agcgtactct ccaagtgtgt gagcgcctat tcagggacag agacggagga ccattttgac
2101 tcagaccaga aggcattggg cgtgatgggg ctggtgcaga gagacacgtc cctgttcctc
2161 agagacctcc gcctgggcgc ctcctcatgg ctcaactcct tcctggagtt aagtgggcgc
2221 aagccccata agaccccagt tgcctgcaac agagacactg cgccccacgg attctgggcg
2281 cttacggagt ctatgcgcag gagtgtgtct gaggcagccc tggcccagcc tgagggcttg
2341 ctgggcacag attccttgaa gaagctaaca ataaaggact tgagctttaa gggcaactca
2401 gagccatcag acagccccga gatgtcacag tcaatggaga cacttggccc ctccacaccc
2461 tcggatgtca acttttttct gcgatccggg aattcccagg aagaggctga aaccagagat
2521 gatataagcc ccatggacgg aatcccccgc gtgcgcgctg ccttccctga tggtttccac
2581 ccacggcgct caagccaagg tgtcctccac atgcccctct actcgtcacc catagtcaag
2641 aaccccttca tgtctcctct gctggcccct gacgtcatgc tgaagaccct gccgcccgtg
2701 caccttgtgg cgtgcgctct ggaccccatg ctggatgact cggtcatgtt cgcgcggcga
2761 ctgaaggacc tgggccagcc cgtgacgctg aaagtggtag aggacctgcc gcatggtttc
2821 ctgagcttag cagccctgtg tcgtgagacc ggcaggccgc ggagttgtgc gtgcagcgca
2881 tccggctgat cctcaccccc cctgctgcac cgctgacctg agctgggaac ggcggggggc
2941 ggcactaaaa gacctcttgc acccttctgc gcgggcttcc gtgatgagtg cgctccgagt
3001 cgggctctag gccccctcgg tcgggctggg cgggaggggg cgggccgtgc cttaaccata
3061 agtggggcgg ggcaggggcc aaaagctgaa cctgggggag ggggacgcac acacacacac
3121 ctgtcaccga gacagctgga cctgcactct accactgcct tctactgctg ctgtgaccga
3181 ccaccgctag tcggttttgc tttttgtaaa taaaagttat ttaat
//GenBank-Updates@genbank.bio.net (06/27/90)
LOCUS KAEASTAB 3324 bp ds-DNA BCT 27-JUN-1990
DEFINITION K.aerogenes arylsulfatase (atsA) and atsB genes, complete cds.
ACCESSION M31938
KEYWORDS arylsulfatase.
SOURCE K.aerogenes (strain W70) DNA, clone pAS123.
ORGANISM Klebsiella aerogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3324)
AUTHORS Murooka,Y., Ishibashi,K., Yasumoto,M., Sasaki,M., Sugino,H.,
Azakami,H. and Yamashita,M.
TITLE A sulfur- and tyramine-regulated Klebsiella aerogenes operon
containing the arylsulfatase (atsA) gene and the atsB gene
JOURNAL J. Bacteriol. 172, 2131-2140 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.Murooka, 02-FEB-1990, for release after publication.
FEATURES from to/span description
pept 452 1669 atsB protein
pept 1685 3079 arylsulfatase (atsA) (EC 3.1.6.1)
mRNA 420 3172 atsB and atsA mRNA (alt.)
mRNA 421 3172 atsB and atsA mRNA (alt.)
signal 383 388 atsB -35 region
signal 408 413 atsB -10 region
binding 441 444 atsB ribosome binding site
binding 1672 1675 atsA ribosome binding site
rpt 3118 3142 inverted terminal repeat, copy A
rpt 3148 3172 inverted terminal repeat, copy B
BASE COUNT 758 a 994 c 928 g 644 t
ORIGIN
1 cggctggctt ttaacaaagc agacatgata gggcctcccc gtcgagatga tggtacatga
61 cacatggatg actactgata attgaagcat acatttcaca tttaagtaac acaatcatta
121 acatattaac aatgtgatag ccgttacaaa attttcagca atacattgtt atataaagca
181 tttaaaaaac accccagcca tgctttacga tagcggcgtt aaaacataaa caaataagat
241 taacacccgg cgtacctgcc cctggtgata aaaaaaccct attcccgacg cgtgattctc
301 atcctaaaga acgatggatt tttgtttttt ttcacttata cccttaacag gatgcattta
361 attttcgccg cggcttaatc atgtgattac gcctgacccg ctgttgccaa aataatatca
421 acagtaccgg tcattaaccg aggataagcc gatgctgaat atcgccgccc tgcgccagca
481 gcaaattccg ctggccgctg agccgcgctc gccggtgccg tttcatattc tgatgaagcc
541 gattggcccc gcctgcaatc tcgcctgccg ctattgctat tacccgcagg acgaaacgcc
601 ggtcaacaag atggatgacg cgcggctgga gcagtttatc cgccgctata ttgccgccca
661 gcccgccggc gcgcgggaaa tcaactttgt ctggcagggg ggcgagccgc tgctggccgg
721 cctgagcttc tacaaaaaag cgctcgccct gcaggcacgc tatgcccccg acggcgtgac
781 tatcagcaac agcctgcaga ccaacgggac gctgatcaac gacgcatggt gccgactgtt
841 ccgcgaacat ggctttatta tcgggttgag cctcgaaggc aacgaagcgc tgcaggacta
901 ccatcgtccg gataaacgcg gccggtcgac ctggtcggcg gcgctgcgcg gcattgacct
961 gctccatcag catcaggtgg actttaatct gctggtggtg gtgcataacg agatggcggc
1021 ccacgcggcg gcgatttatg accggctggt cagcctcggc gcgcgctatc tgcagtttca
1081 gccgctgatg agcgaaggcg cggccctgcg cgaaggatac cagctcagcg ccgataactg
1141 gggacgtttt atggtcggca tctggcgaca gtggcggaag cgctgcgata gagggcgggt
1201 gttcgttatc aatatcgaac aggcgtgggc gcagtatttc actcatacca gcggcagctg
1261 cgtgcacagc gcccgctgcg gcagcaacct ggtgatggag cccgacggac agctctacgc
1321 ctgcgatcat ctgatcaacg ccgaacatcg gctgggtcgc cttgacgagc agacgcttgc
1381 cgccgcagtt gacgcctcgg tgcaactgcc tttcggtcag cagaaaagtc tgcgccgcga
1441 atgccagact tgctcggtaa aaatggtctg ccagggcggc tgcccggcgc atctcaacgc
1501 cgcaggcaac aaccgcctct gcggaggcta ctaccgcttc tttagcgaca tcctcgcgcc
1561 attacgcccc ttttcccgcg accttaatgg cctgaaggcc tggcgggccg cgtttgttgg
1621 gactgccgca tactgcgtag caccttaccc tgatgatatc cccctctgaa caggagagtc
1681 agtcgtgaac aaaaaagcca tggccgcggc ggtcagcatg atcctcgccg gtggcgcgca
1741 cgccgcgcag caggagcgtc caaacgtcat cgtgattatc gccgatgata tgggctactc
1801 ggacatcagc ccctttggcg gcgagatccc cacccccaac ctgcaggcga tggccgagca
1861 gggaatgcgc atgagccagt attacacctc gccgatgtcg gccccggcgc gctcaatgct
1921 gctcaccggc aacagtaacc agcaggctgg gatgggcggc atgtggtggt acgacagcac
1981 cattggcaag gagggctacg agctgcggtt gaccgaccgc gtcaccacca tggccgagcg
2041 ctttaaagac gcggggtata acaccctgat ggccggtaaa tggcacctcg gttttgttcc
2101 cggcgccacg ccgaaagatc gcggctttaa tcacgccttc gcctttatgg gcggcggcac
2161 cagccacttt aacgacgcga taccgctggg gaccgttgaa gccttccaca cctactacac
2221 ccgcgacggc gagcgcgtct ccctaccgga tgatttttac tccagcgaag cctacgcccg
2281 ccagatgaac agctggatta aagcgacgcc gaaggaacag ccggtcttcg cctggctggc
2341 cttcaccgcc cctcatgatc ccctgcaggc gcctgacgag tggattaagc gcttcaaagg
2401 ccagtatgag cagggttacg ccgaggtcta tcgccagcgc atcgcccgcc tgaaagcgct
2461 gggaattatc catgacgata ccccgctgcc gcatctggag ctggataaag agtgggaagc
2521 gctaacgcca gagcagcaga aatatacggc gaaagtgatg caggtgtatg cggcgatgat
2581 cgccaatatg gacgcgcaaa tcggcaccct gatggagacg ctgaaacaaa ccgggcgcga
2641 taaaaatacc ctgctggtct ttttaaccga taacggcgcc aacccggcgc agggtttcta
2701 ctacgaatct accccggaat tctggaagca gttcgataac agctatgaca acgtcggccg
2761 caaaggatca tttgtctcct acggcccgca ctgggccaac gtcagcaacg ccccctacgc
2821 caattatcac aaaaccacca gcgcccaggg cggcatcaat accgacttta tgatctccgg
2881 tcccgggatc acccgccacg gtaaaatcga cgcctcgacg atggcggtgt atgacgtggc
2941 gccgacgcta tatgaattcg ccggcatcga tccgaacaag tcgctggcga aaaagccggt
3001 gttgccgatg atcggcgtca gtttaagcgc tatctcaccg gcgaagtaca ggagccgccg
3061 cgccgaacta cggggttgaa ctgcatcatc aggcggcctg ggtcgatggc gaatggaagc
3121 tgcgacggct ggtgccgcgc ggcctcaccg ccggcgacgc gccgtggcag ctatttaatc
3181 tgcacgacga cccgctggag acgcatgatg tcgcggccga acatccgatc gggtcaaagc
3241 catgagcgag gcctacgagg catttgctaa gcgcaccatg gttaccaaag cgcagggcaa
3301 aatgaatgct acgtcggtat cgat
//
LOCUS RATGAH 1003 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Rat L-glutamine amidohydrolase mRNA, 3' end.
ACCESSION J05499
KEYWORDS L-glutamine amidohydrolase.
SOURCE Rat (strain Sprague-Dawley) liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1003)
AUTHORS Smith,E.M. and Watford,M.
TITLE Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence
similarity to kidney-type glutaminase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Waterford, 08-MAY-1990.
FEATURES from to/span description
pept < 1 660 L-glutamine amidohydrolase (EC 3.5.1.2)
BASE COUNT 270 a 243 c 258 g 232 t
ORIGIN
1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc
61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt
121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac
181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc
241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa
301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag
361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc
421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag
481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg
541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg
601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga
661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg
721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt
781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc
841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa
901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata
961 gacaaagtat tttgtgttca aataaacttt aattaccgga att
//
LOCUS PSEPROC 1167 bp ds-DNA BCT 27-JUN-1990
DEFINITION P.aeruginosa delta-1-pyrroline-5-carboxylate reductase gene,
complete cds.
ACCESSION M33557
KEYWORDS delta-1-pyrroline-5-carboxylate reductase.
SOURCE P.aeruginosa (strain PAO1) DNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1167)
AUTHORS Savoiz,A., Jeenes,D.J., Kocher,H.P. and Haas,D.
TITLE Comparison of proC and other housekeeping genes of Pseudomonas
aeruginosa with their counterparts in Escherichia coli
JOURNAL Gene 86, 107-111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Savioz, 04-APR-1990.
FEATURES from to/span description
pept 102 923 delta-1-pyrroline-5-carboxylate reductase (EC
1.5.1.2)
binding 90 94 ribosome binding site
binding 921 927 ribosome binding site
BASE COUNT 182 a 431 c 380 g 174 t
ORIGIN Chromosome PAO1.
1 ctcgaggcag ccatcggcga aggtgcgacc tgggtccgca tcggtaccgc cctgttcggc
61 gcccgcgact acggcgcgcc ggcttcttga aggaatcccc tatgagcaca ccccgcatag
121 cgttcatcgg cgccggcaac atggccgcca gcctgatcgg cggcctgcgc gcccaaggcg
181 tgccggcggc gcagatccgc gccagcgacc cgggcgccga gcaacgcgcg aagatcgccg
241 gcgagttcgc catcgacgtg gtcgagtcca acgccgaggc cgtggccgac gccgacgtcg
301 tggtcctgtc ggtcaagccg caggccatga aagccgtgtg ccaggccttg gcgccggcgc
361 tgaagccgga gcaactgatc gtctccatcg ccgccggcat cccctgcgcc agcctcgaag
421 cctggctcgg ccagccgcgc ccggtggtcc gctgcatgcc caacaccccg gcgctgctgc
481 gccagggcgc cagcgggctg tacgccaacg cgcaggtcag cgccgcgcaa tgcgagcagg
541 ccgggcagtt gctgtccgcg gtgggcatcg ccctgtggct ggacgacgaa gcgcagatcg
601 acgcggtgac cgcagtgtcg ggcagcggcc cggcgtattt cttcctgctg atgcaggcca
661 tgaccgacgc cggcgagaag ctcggcctgt cccgcgagac agcctcgcgc ctgaccctgc
721 agaccgccct cggtgcggcg cagatggcgc tgtccagcga ggtcgagccg gccgaactgc
781 ggcgccgggt gacttcgccc aacggcacca ccgaggcggc gatcaaatcc ttccaggcca
841 acggcttcga agcactggtg gagcaggccc tgaacgccgc cagccagcgc tccgccgagc
901 tggccgaaca gcttggccaa taaggagtag tcgatgatcg gcctcaacac cgctgctatc
961 tatatcctcc agaccctcgg cagcctctac ctgctgatcg tgctgctgcg cttcatcctg
1021 cagttggtcc gcgcggactt ctacaacccc ctcagccagt tcatcgtgcg cgccaccaag
1081 ccgctgctga acccgctgcg gcggatcatc cccggcttcg gcgggatcga cctggcctcg
1141 ctggtgctgg cgatcctgat ccagctg
//
LOCUS ECOPEPD 2311 bp ds-DNA BCT 27-JUN-1990
DEFINITION E.coli, peptidase D (pepD) gene, complete cds.
ACCESSION M34034
KEYWORDS peptidase D.
SOURCE E.coli (strain K12; substrain CM17) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2311)
AUTHORS Henrich,B., Monnerjahn,U. and Plapp,R.
TITLE Peptidase D gene (pepD) of Escherichia coli K12: Nucleotide
sequence, transcript mapping, and comparison with other peptidase
genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Henrich, 04-MAY-1990.
Author address: B.Henrich
Universitaet Kaiserlautern, Fac Biologie,
Postach 3049, D-6750 Kaiserslautern, FRG
FEATURES from to/span description
pept 187 1644 peptidase D
mRNA 94 1693 peptidase D mRNA
mRNA 127 1693 peptidase D mRNA (alt.)
binding 173 179 ribosome binding site
signal 83 88 TATA box
signal 113 118 TATA box
BASE COUNT 558 a 608 c 590 g 555 t
ORIGIN
1 gatctggcgc actaaaaacc agtatttcac atgagtccgc gtctttttac gcactgcctc
61 tccctgacgc gggataaagt ggtattctca aacatatctc gcaagcctgt cttgtgttga
121 caacattttc tgctaaccct gtgacctgca atactgtttt gcgggtgatc gacaaggaga
181 cttaacgtgt ctgaactgtc tcaattatct ccacagccgc tgtgggatat ttttgccaaa
241 atctgttcta ttcctcaccc gtcctatcat gaagagcaac tcgctgaata cattgttggt
301 tgggcaaaag agaaaggttt ccatgtcgaa cgcgatcagg taggtaatat cctgattcgt
361 aaacctgcta ccgcaggtat ggaaaatcgt aaaccggtcg tcttacaggc ccacctcgat
421 atggtgccgc agaaaaataa cgacaccgtg catgacttca cgaaagatcc tatccagcct
481 tatattgatg gcgaatgggt taaagcgcgc ggcaccacgc tgggtgcgga taacggcatt
541 ggtatggcct ctgcgctggc ggttctggct gacgaaaacg tggttcacgg cccgctggaa
601 gtgctgctga ccatgaccga agaagccggt atggacggtg cgttcggctt acagggcaac
661 tggttgcagg ctgatattct gattaacacc gactccgaag aagaaggtga aatctacatg
721 ggttgtgcgg ggggtatcga cttcacctcc aacctgcatt tagatcgtga agcggttcca
781 gctggttttg aaaccttcaa gttaacctta aaaggtctga aaggcggtca ctccggcggg
841 gaaatccacg ttgggctggg taatgccaac aaactgctgg tgcgcttcct ggcgggtcat
901 gcggaagaac tggatctgcg ccttatcgat ttcaacggcg gcacactgcg taacgccatc
961 ccgcgtgaag cctttgcgac cattgctgtc gcagctgata aagtcgacgt cctgaaatct
1021 ctggtgaata cctatcagga gatcctgaaa aacgagctgg cagaaaaaga gaaaaatctg
1081 gccttgttgc tggactctgt agcgaacgat aaagctgccc tgattgcgaa atctcgcgat
1141 acctttattc gtctgctgaa cgccaccccg aacggtgtga ttcgtaactc cgatgtagcc
1201 aaaggtgtgg ttgaaacctc cctgaacgtc ggtgtggtga ccatgactga caataacgta
1261 gaaattcact gcctgatccg ttcactgatc gacagcggta aagactacgt ggtgagcatg
1321 ctggattcgc tgggtaaact ggctggcgcg aaaaccgaag cgaaaggcgc atatcctggc
1381 tggcagccgg acgctaattc tccggtgatg catctggtac gtgaaaccta tcagcgcctg
1441 ttcaacaaga cgccgaacat ccagattatc cacgcgggcc tggaatgtgg tctgttcaaa
1501 aaaccgtatc cggaaatgga catggtttct atcgggccaa ctatcaccgg tccacactct
1561 ccggatgagc aagttcacat cgaaagcgta ggtcattact ggacactgct gactgaactg
1621 ctgaaagaaa ttccggcgaa gtaattattt gatttgctgc cggatggcgt ttaatcgcct
1681 tccggcagtt tcatccttca ttatccttcg ataaaagcca tccctgtaaa tgtccgtcgc
1741 gggttgccac gttcaatctg gtgatggaac attcgccgct gcgatttcag cgccgcgcta
1801 ttttcctgtt gctgttgctc cagcttccag gcaatcagca atcgtgccag ccgcttgtta
1861 gcatgctgac tacgctctga ctgaaccttc acgctaatac cggatgccaa atgcgtggcg
1921 cgtaccgccg agtcggtttt attgacatgt tgaccgcccg gccccgacga acgcagcgtc
1981 tcataacgga ttgcatccga ttgttcctgc tcatcagcgg taaaacgccc aatgcccaga
2041 aaccagtttt tgcgcccatg atgaggccga tacggactcg gacaaatcca ctgaatagtg
2101 ccacaccact ttcgcttaat gcccatgcgt tatcgccatc gagagaaacc agcgccgaac
2161 gcagtgtgtc agagtagcgg cccgtttctg tttccagcac cgttaccgcg acgtcttgcc
2221 gggtaaaaac gaccaacaca gaagaagctg cgccaatcgg gtagccgcgt ccggggtgta
2281 aatctggcat ccccactacg cgctgcatgt t
//
LOCUS HUMTGFB1B 5089 bp ss-mRNA PRI 27-JUN-1990
DEFINITION Human transforming growth factor beta-1-binding protein mRNA,
complete cds.
ACCESSION M34057
KEYWORDS transforming growth factor beta-1-binding protein.
SOURCE Human fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5089)
AUTHORS Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U.,
Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H.
TITLE TGF-beta-1 binding protein: A component of the large latent complex
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.-H.Heldin, 08-MAY-1990.
FEATURES from to/span description
pept 91 4275 transforming growth factor beta-1-binding
protein precursor
sigp 91 151 transforming growth factor beta-1-binding
protein signal peptide
matp 152 4272 transforming growth factor beta-1-binding
protein
BASE COUNT 1355 a 1127 c 1255 g 1352 t
ORIGIN
1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt
61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc
121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt
181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag
241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg
301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg
361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt
421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat
481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg
541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag
601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca
661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc
721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt
781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca
841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa
901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat
961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc
1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt
1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa
1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac
1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt
1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt
1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc
1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag
1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca
1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa
1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag
1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct
1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt
1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc
1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat
1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt
1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat
1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg
2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag
2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct
2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt
2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa
2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt
2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc
2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag
2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac
2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac
2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc
2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat
2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag
2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca
2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac
2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc
2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt
3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac
3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta
3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt
3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc
3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc
3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct
3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa
3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac
3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt
3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg
3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa
3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc
3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc
3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt
3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt
3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt
3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg
4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag
4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag
4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg
4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac
4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa
4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat
4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc
4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca
4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta
4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg
4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt
4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag
4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa
4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat
4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc
4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag
4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga
5041 tccccaatat atctaccatt gtatgttaaa taaatctcgt gccgaattc
//
LOCUS ALRVSRC 1801 bp ss-RNA VRL 27-JUN-1990
DEFINITION Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete
cds.
ACCESSION M33292
KEYWORDS oncogene; pp60v-src; src gene; tyrosine kinase.
SOURCE Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1.
ORGANISM Rous sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian sarcoma viruses.
REFERENCE 1 (bases 1 to 1801)
AUTHORS Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D.
TITLE Sequence and functional differences between Schmidt-Ruppin D and
Schmidt-Ruppin A strains of pp60v-src
JOURNAL J. Virol. 64, 3545-3550 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.I.Shalloway, 26-MAR-1990.
FEATURES from to/span description
pept 65 1645 pp60v-src protein
BASE COUNT 392 a 537 c 551 g 321 t
ORIGIN
1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac
61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga
121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc
181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa
241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact
301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga
361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg
421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc
481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga
541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac
601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt
661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca
721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca
781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc
841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg
901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc
961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca
1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga
1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg
1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag
1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt
1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg
1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt
1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg
1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta
1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg
1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc
1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg
1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga
1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg
1801 g
//
LOCUS RATLACREC 2041 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Rat lactogen receptor mRNA, complete cds.
ACCESSION M34083
KEYWORDS lactogen receptor.
SOURCE Rat (strain Sprague-Dawley) luteinized ovary, cDNA to mRNA, clone
R1.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2041)
AUTHORS Zhang,R., Buczko,E., Tsai-Morris,C.H., Hu,Z.-Z. and Dufau,M.L.
TITLE Isolation and characterization of two novel rat ovarian lactogen
receptor cDNA species
JOURNAL Biochem. Biophys. Res. Commun. 168, 415-422 (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by M.L.Dufau,
08-MAY-1990.
FEATURES from to/span description
pept 93 1925 lactogen receptor precursor
sigp 93 149 lactogen receptor signal peptide
matp 150 1922 lactogen receptor
signal 2031 2036 poly-A signal
BASE COUNT 615 a 497 c 460 g 469 t
ORIGIN
1 gtgagcactg cagatgtttt gcacatgagc cctgaaggga gcctctgata cattgcctat
61 agcaagaaga aggggccaac ctgaagaaaa acatgccatc tgcacttgct ttcgtcctac
121 ttgttctcaa catcagcctc ctgaagggac agtcaccacc agggaaacct gagatccaca
181 aatgtcgctc tcctgacaag gaaacattca cctgctggtg gaatcctggg acagatggag
241 gacttcctac caattattca ctgacttaca gcaaagaagg agagaaaacc acctacgaat
301 gtccagacta caaaaccagt ggccccaact cctgcttctt tagcaagcag tacacttcca
361 tctggaaaat atatatcatc acagtaaatg ccacgaacca aatgggaagc agttcctcgg
421 atccacttta tgtggatgtg acttacatcg ttgagccaga gcctcctcgg aacctgacat
481 tagaagtaaa acagctaaaa gacaaaaaaa catatctgtg ggtaaaatgg tccccaccca
541 ccataactga tgtgaaaact ggttggttta caatggaata tgaaattcga ttaaagcctg
601 aagaagcaga agagtgggag atccatttta caggtcatca aacacagttt aaagtttttg
661 acctatatcc agggcaaaag tatcttgtcc agactcgctg caagccagac catggatact
721 ggagtagatg gagccaggag agttccgttg aaatgccaaa tgacttcacc ttgaaggaca
781 caaccgtgtg gatcattgcg gccattctct ctgctgtcat ctgtttgatt atggtctggg
841 cagtggcttt gaagggctat agcatgatga cctgcatctt tccaccagtt cctgggccaa
901 aaataaaagg atttgatacc catctgctgg agaagggcaa gtctgaagag ctgctgagtg
961 ccttggggtg ccaagacttt ccccctactt ctgactgtga ggacttgctg gtggagttct
1021 tagaagttga tgacaatgag gacgagcggc taatgccatc ccattccaaa gagtatccag
1081 gtcaaggtgt taagcccaca cacctagatc ccgacagtga ctctgttcac ggaagctatg
1141 acagccattc tcttttatct gaaaagtgtg aggaacccca ggcctacccc cctactttgc
1201 acatccctga gatcactgag aagccagaga atcctgaagc aaatattcct cccaccgtgg
1261 acccccaaag caccaacccc aattttcatg tagatgcacc caaatcttca acatggccat
1321 tactgcctgg ccaacacatg cccagatctc cttaccacag tgttgctgat gtgtgcaagc
1381 tagccggaag tcctgtgaat acactggact ctttcttgga caaagcagag gaaaatgttc
1441 taaagttgtc taaagccctt gagactggag aggaagaagt ggctgaggaa aaagggggaa
1501 aaagcttccc ttctgacaaa caaaacacac cttggccgct gctccaggag aaaagcccca
1561 ctgtctatgt taaaccccca gattatgtgg agattcacaa agtcaacaaa gatggagtgc
1621 tatcattatt ccccaagcag agagaaaaca accagacaga gaagcctggg gttcctgaaa
1681 ccagtaagga gtatgccaag gtgtctggca ttacggataa caatatcctc gtattagtgc
1741 cagactcacg agccaagaac acagcgttgc tcgaggaatc agccaagaag gctccaccat
1801 cgtttgaagc tgaccaatct gagaaagatc tggccagctt cactgcaacc tcaagcaacc
1861 gcagactcca actgggtagg ctggattacc tggatcctac gtgcttcatg cactcctttc
1921 actgatagct agacttatgg aacgattggc taaactgtga tttctcttca ggtaacacta
1981 cagagtcatg aagtaatgtg gtctgctagc aaatgttaca ggatgtgggt ataaaaaaaa
2041 a
//
LOCUS YSCCP1A 1403 bp ds-DNA PLN 27-JUN-1990
DEFINITION S.cerevisiae centromere-binding protein CP1 gene, complete cds.
ACCESSION M34070
KEYWORDS CP1 protein; DNA-binding protein; centromere-binding protein.
SOURCE S.cerevisiae (strain X2180) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1403)
AUTHORS Baker,R.E. and Masison,D.C.
TITLE Isolation of the gene encoding the Saccharomyces cerevisiae
centromere-binding protein CP1
JOURNAL Mol. Cell. Biol. 10, 2458-2467 (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by R.E.Baker,
25-APR-1990.
FEATURES from to/span description
pept 256 1311 centromere-binding protein CP1
BASE COUNT 508 a 284 c 310 g 301 t
ORIGIN
1 ggatccttcg ggaccaccat gtctcaccca ccaaggtcaa aaaaaataaa atcggacgag
61 aaaagtattg ggcaaacaac aattatccac gtgcttatca cgttttttta tttaatgtag
121 aatgtaaacc attatacctt aaagataaac actttgttaa ttattcgttt tagttggcgg
181 tcattttcat ttctgctaca agtaccaaca tcaagtgctt aaaatataat acggttttct
241 acacttttat taacgatgaa ctctctggca aataataata agctttctac tgaggatgag
301 gaaatccatt cggcaagaaa aagaggctat aacgaggagc agaactacag cgaggccagg
361 aaaaagcaaa gagatcaagg cttgttgtca caagagagca atgacggaaa tattgactct
421 gctctgttga gcgaaggggc tacgctaaaa gggactcaaa gtcagtatga aagcggactg
481 acgtctaaca aagatgagaa aggaagtgat gacgaagatg cgtcagtggc tgaggctgct
541 gttgccgcta ccgtcaatta tacagactta atccagggcc aggaagatag ttctgatgcc
601 cacacatcta atcaaacgaa cgcgaatggt gagcacaagg attcccttaa tggggaaaga
661 gccataaccc catcaaatga aggagtcaag ccaaacacct ccctggaggg aatgacgtct
721 tcaccaatgg aatctacaca gcaatctaaa aacgacatgc tcataccatt ggcagaacac
781 gatcgtgggc cagaacacca acaggatgat gaagacaatg atgacgctga cattgatctc
841 aaaaaggata taagcatgca gccgggtcgt cgtggaagaa aacctactac tttggccaca
901 acagacgagt ggaaaaagca aagaaaagac tcccataaag aagtcgaaag gcgtcgccgc
961 gaaaacatca acactgcaat caacgtttta agcgacctcc tgcccgtgag agaatcaagt
1021 aaggcagcaa tattggcgcg tgccgctgaa tacattcaaa aattgaaaga aaccgatgaa
1081 gcaaacatcg aaaagtggac gttacaaaaa ttgcttagcg agcaaaacgc atcgcaatta
1141 gccagtgcaa atgagaaact gcaggaagaa ctgggaaatg cttacaaaga aatcgagtac
1201 atgaaacgcg ttttaaggaa ggagggaata gaatacgagg atatgcacac tcacaagaaa
1261 caagagaacg aaagaaaaag cactaggagc gataatccac atgaggcttg agagttaaaa
1321 atagatagct aaatgtattt cgagtctccc tatgtatctg cgtctctgcc ccatagcacc
1381 tctgttaagt ttttatcaat att
//
LOCUS MUSCD3A 1347 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Mouse CD3-eta protein mRNA, complete cds.
ACCESSION M33158
KEYWORDS CD3-eta protein.
SOURCE Mouse (strain B6/CBA F1J) female thymus, cDNA to mRNA, clone pBS17.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1347)
AUTHORS Jin,Y.-J., Clayton,L.K., Howard,F.D., Koyasu,S., Sieh,M.,
Steinbrich,R., Tarr,G.E. and Reinherz,E.L.
TITLE Molecular cloning of the CD3-eta subunit identifies a
CD3-zeta-related product in thymus-derived cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3319-3323 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.L.Reinherz, 22-MAR-1990.
FEATURES from to/span description
pept 139 759 CD3-eta protein precursor
sigp 139 201 CD3-eta signal peptide
matp 202 756 CD3-eta protein
mRNA < 1 1347 CD3-eta mRNA
signal 1329 1334 poly-A signal
BASE COUNT 374 a 342 c 336 g 295 t
ORIGIN
1 cgaggagagg cacccggtac atgttctcta ggagaacaaa gccagcagag actccatcag
61 cgcctccttt tctcctcatc ctcccaggca tagctgcctc tgcctctgcc tctgggtacc
121 atcccaggga agcagaagat gaagtggaaa gtgtctgttc tcgcctgcat cctccacgtg
181 cggttcccag gagcagaggc acagagcttt ggtctgctgg atcccaaact ctgctacttg
241 ctagatggaa tcctcttcat ctacggagtc atcatcacag ccctgtacct gagagcaaaa
301 ttcagcagga gtgcagagac tgctgccaac ctgcaggacc ccaaccagct ctacaatgag
361 ctcaatctag ggcgaagaga ggaatatgac gtcttggaga agaagcgggc tcgggatcca
421 gagatgggag gcaaacagca gaggaggagg aacccccagg aaggcgtata caatgcactg
481 cagaaagaca agatggcaga agcctacagt gagatcggca caaaaggcga gaggcggaga
541 ggcaaggggc acgatggcct ttaccaggac agccacttcc aagcagtgca gttcgggaac
601 agaagagaga gagaaggttc agaactcaca aggacccttg ggttaagagc ccgccccaaa
661 ggtgaaagca cccagcagag tagccaatcc tgtgccagcg tcttcagcat ccccactctg
721 tggagtccat ggccacccag tagcagctcc cagctctaag gccctgtgct cagctctggt
781 gatgaccctg gctgctgtca catgagttgt ggtgaggatg ggacttttga aaaatctgat
841 gttccaattt ctttcatgca tgctctactc agaagtgagc aagggccaaa actcctgggc
901 atgcatctga atgaatctct caacttagaa aagacttgcc tgcccctctg agctggccag
961 gtgtccccac ctaccctttg gcatgcctcc aagtgccagg acgccacaga ctgcattggg
1021 gccgaagact tcccttttct ttggtttttt tgtttgtttt gagtttatac aatcattaag
1081 aaatctttgg ttttggctgg aaatggaaaa acaaaacaaa tcaaagaaac caccctcccc
1141 tggcttatag cagcagtatt atgacctgac ctggctgagc tttccccccc tccaactttg
1201 ggggtggaaa ttgcaagtta agaactacat tcaaagaaaa cgttgaaagg gccggagaag
1261 cagcttccag aaagcccgtc ggatataaga ttgtcaaata ataataacta ttattataat
1321 atattaataa taaacttaaa tatttgg
//
LOCUS YSCPRP5 3017 bp ds-DNA PLN 27-JUN-1990
DEFINITION S.cerevisiae pre-RNA processing protein (PRP5) gene, complete cds.
ACCESSION M33191
KEYWORDS PRP5 protein; helicase; pre-RNA processing protein.
SOURCE S.cerevisiae (strain SPJ 5.41) vegetative DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3017)
AUTHORS Dalbadie-McFarland,G. and Abelson,J.
TITLE PRP5: A helicase-like protein required for mRNA splicing in yeast
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4236-4240 (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by
G.Dalbadie-McFarland, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept 284 2833 PRP5
BASE COUNT 1101 a 469 c 636 g 811 t
ORIGIN Chromosome 2R.
1 gttgacattc tacttttcta agacggataa tggacgactc cacgttgtag tagttgtcac
61 attaatcgga aaagtgtctc acatgtaaag gattaaaata agaaatcttc taatttatgt
121 tgaaatatct tgatttacca ttacccggat tagatgaaac taattaaagt catttagagc
181 aatggaccaa actaaaggta ttagagttct ttctaatata atacagcgcc aaataattcg
241 agaagggaaa tcataacgct gttgccttgg gtttatttga ggtatggaaa ctattgattc
301 gaagcaaaat attaataggg agtctttatt ggaggaaagg aggaaaaaac tagcaaaatg
361 gaaacaaaaa aaagcacaat ttgatgctca gaaagagcat caaacttcac gcaatgacat
421 tgttactaac agtttagaag gtaagcagac tactgaaaag tttacggaaa gacaggaacg
481 tgtaaaagaa gagcttcgga agagaaaaaa tgaatttagg aaatctgacg aacctgtatc
541 ggttaagcct tcgaagaaaa agtcgaagag aagtaaggta aagaagaaaa tatccttcga
601 cttcagtgat gacgatgatt ctgaaatagg agtttctttt agatcgaagg agcacattca
661 aaaagctcca gagcatgata atgaaaaaga tccactggat gagtttatga cgtcattgaa
721 ggaggagaaa atgagtaata gtaaaggcat gtacgacaga ggcgacattc ttgatgtaga
781 ggatcaattg tttgaacttg gaggaaccga cgatgaggat gttgaagata atacggataa
841 ttctaatata gcgaaaattg ctaaactcaa agcaaaaaaa cgtgtaaagc aaatttatta
901 ctctccagaa gagcttgagc cgttccaaaa aaatttttat atagaatctg aaacagtttc
961 ctcgatgtca gagatggaag ttgaggagct tagacttagt ctggataata tcaaaataaa
1021 gggaacaggt tgccccaaac cagtcacaaa gtggtctcaa ctgggactat caacggacac
1081 catggtttta attacagaaa agttgcactt tggctcctta acgcctattc aatcccaggc
1141 tcttcccgct attatgtcag gtcgtgatgt tataggaata tcaaaaactg gctccggcaa
1201 gaccatctcc tatcttttac cattgctaag acaagtaaaa gctcaacggc cattatcaaa
1261 acatgaaaca gggcccatgg gtttaattct ggctccaact agagagctag ctttacaaat
1321 acatgaagaa gttactaaat tcacagaagc agatacatcc attagatcag tatgctgtac
1381 aggaggttct gaaatgaaaa agcagattac tgatcttaaa agaggcactg agattgttgt
1441 tgccacaccg ggacgattta ttgatatatt aacactaaat gatgggaaat tacttagtac
1501 taaaagaata acgttcgtag taatggatga ggcagacagg ctgttcgatt taggttttga
1561 acctcaaata acgcaaatca tgaaaactgt tcgaccggat aaacagtgtg ttctatttag
1621 tgcaactttt ccgaacaaac tacgcagttt tgccgtaaga gttttgcatt ccccaatatc
1681 tattacgatt aattcaaagg gaatggttaa tgaaaacgta aaacaaaagt ttagaatatg
1741 ccattcagaa gatgaaaaat ttgacaatct ggtgcagctt atccatgagc gcagtgaatt
1801 ttttgacgag gttcaaagtg aaaatgacgg acagtcaagc gatgttgaag aagttgatgc
1861 taaagccatt atatttgtat caagtcagaa tatttgcgac ttcatttcaa aaaagctgtt
1921 aaatgctgga attgtgacct gtgctattca tgcaggtaag ccatatcaag aaaggcttat
1981 gaacttagaa aagttcaaac gagaaaagaa cagtattctt ctctgtacag aggttctctc
2041 aagaggttta aatgttcccg aagtgtcgtt ggttattatc tataacgccg tcaaaacttt
2101 cgcgcaatat gttcatacta ctggaagaac tgccagaggt agtcgctccg gtactgctat
2161 tactcttcta ttgcatgatg aactgtcagg tgcttatatc ttgagcaagg caatgcgcga
2221 tgaagaaata aaagctttag accctctgca agcaaaagaa cttcaggaaa tgagtgccaa
2281 gtttgaatcc ggtatgaaaa agggtaagtt taggttgtcg aaagggttcg gcggcaaagg
2341 actggaaaat atcaaaagta aaagagagga agctcaaaat aaagatttgg agctcaaaaa
2401 gaatgataag aggagcgatg acttggaaaa gaaaatcagc aatccacgcg aagggcatga
2461 cagtgtgtct gagtcttctg cgcttattcc gagactcaat tatgaactat tcaaagagtc
2521 tacagacggg tcgatcatct tttacgccaa ggtttatatt aatgatttac cccagattgt
2581 aagatgggag gctacaaaga atacaacctt attatttatc aagcatgaaa ccggatgtag
2641 tattactaac aagggtaaat tttaccctga agggaaggaa ccaaagaatg aaaacgatga
2701 gcctaaacta tacttattga tcgaaggcca agatgagaaa gacatacaat taagtataga
2761 attgctggag caaaaagtta aagagggggt cgtaaaggct gcaagcttgt ctttgaagag
2821 tactaaatac tagtttttag aatttaactc actcgtggtg ctatatactt tcgtagttag
2881 ttctgaaatt tttagggtgg taatttaaag catattcaat cataataaaa aaaaaaaaag
2941 agaattatta ggaaaaataa aaactaaaat aatatataat atatgatgta gaaaaaaatc
3001 ttgaaatttt tatcgat
//
LOCUS STMACM 1155 bp ds-DNA BCT 27-JUN-1990
DEFINITION S.globisporus N-acetylmuramidase M1 (acm) gene, complete cds.
ACCESSION M30645
KEYWORDS N-acetylmuramidase M1.
SOURCE S.globisporus (strain ATCC 21553) DNA.
ORGANISM Streptomyces globisporus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1155)
AUTHORS Lichenstein,H.S., Hastings,A.E., Langley,K.E., Mendiaz,E.A.,
Rohde,M.F., Elmore,R. and Zukowski,M.M.
TITLE Cloning and nucleotide sequence of the N-acetylmuramidase
M1-encoding gene from Streptomyces globisporus
JOURNAL Gene 88, 81-86 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.S.Lichenstein, 12-DEC-1989, for release after publication.
FEATURES from to/span description
pept 175 1059 N-acetylmuramidase M1 precursor
matp 406 1056 N-acetylmuramidase M1
BASE COUNT 175 a 454 c 372 g 154 t
ORIGIN
1 cgccattgct ctagaccggc ggagccgaca gatgggtctt tacgggggtc tacgcgcgtg
61 cattacttgt gtcgtgctca tggcgaccgg cctttccggg ccccttcgcc ggggcaagca
121 ccggtcattc gcgcgttcca cggctacatc ccccactcgt gcctggaggc agtcatgccc
181 gcgtacagct ctctcgcacg ccgcggccgc agacccgcgg tcgtcctcct cggcggtctc
241 gtcagcgcct ccctggcgct caccctggcg cccaccgccg ccgccgcgcc cctcgcgccc
301 ccgcccggca aggacgtcgg gcccggcgag gcgtacatgg gtgtcggcac ccgcatcgag
361 caggggctcg gcgccggccc cgacgagcgc accatcggcc cggccgacac cagcggtgtc
421 caggggatcg acgtgtcgca ctggcagggc tccatcaact ggagctcggt gaagtcggcc
481 gggatgtcct tcgcctacat caaggcgacc gagggcacca actacaagga cgaccggttc
541 agcgcgaact acaccaacgc gtacaacgcg gggatcatcc ggggcgccta ccacttcgcc
601 cgcccgaacg cctccagcgg cacggcgcag gccgactact tcgccagcaa cggcggcggc
661 tggtcccgcg acaaccggac cctgccgggc gtcctggaca tcgagcacaa cccctccggc
721 gccatgtgct acgggctctc caccacgcag atgcgcacct ggatcaacga cttccacgcc
781 cggtacaagg cgcgcaccac ccgcgacgtc gtcatctaca ccacggcgag ctggtggaac
841 acctgcaccg gcagctggaa cggcatggcg gccaagtccc cgttctgggt ggcccactgg
901 ggcgtgagcg ccccgacggt gccgagcggc ttcccgacct ggacgttctg gcagtactcg
961 gcgaccggcc gggtcggcgg cgtcagcggg gacgtcgacc gcaacaagtt caacggctcc
1021 gccgcccgtc tgctggccct ggccaacaac acggcgtgag acggccggag ggccgggggc
1081 acggcacgca cgccctgccc ccggccctcc cccgcgcccg gcgcggctac cgcatccgcc
1141 cgaggtgctc gcgca
//
LOCUS YSCKRE5 4383 bp ds-DNA PLN 27-JUN-1990
DEFINITION S.cerevisiae Kre5 protein (KRE5) gene, complete cds.
ACCESSION M33556
KEYWORDS Kre5 protein.
SOURCE S.cerevisiae (strain GRF88) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 4383)
AUTHORS Meaden,P., Hill,K., Wagner,J., Slipetz,D., Sommer,S.S. and
Bussey,H.
TITLE The yeast KRE5 gene encodes a probable endoplasmic reticulum
protein required for (1->6)-beta-D-glucan synthesis and normal cell
growth
JOURNAL Mol. Cell. Biol. 10, 3013-3019 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.Bussey, 04-APR-1990, for release after publication.
FEATURES from to/span description
pept 77 4174 KRE5 protein precursor
sigp 77 128 KRE5 protein signal peptide
matp 129 4171 KRE5 protein
BASE COUNT 1448 a 758 c 826 g 1351 t
ORIGIN Chromosome 15R.
1 tatataacgt ggcatattaa agattaattg tcctggtaga atatagacgt atcagtgtga
61 gtgcctctgt tgattaatga gactacttgc gttggtattg ttattgttgt gtgcgccgct
121 tcgtgcatgg acttatagct tacgatatgg catacccgaa tctgctcagg tctggtctat
181 tttagttcat ttactgggcg atgttgataa tcagctgtta actaatttat atcctttggt
241 taccggtttg gatgacgaga ttgatattca agaaaatctt gtggcgctaa cttccaatgt
301 attaagggag cgatacgata aagaggatgt ggctgattta ttggaactgt atgctagtct
361 ttaccctatg ggtatgatac agcacgatat cagttccaat gcagaacaag acgatgcaaa
421 tagtagctat ttcgttttga atggtaatag gtacgaaaag cccgacgacg tgttctactt
481 gaaatctaag gatttaacaa ttcaacagaa agtcccagat gttgatgtta tacaacctta
541 cgatgttgtc attggtacta actcagaagc gccgatattg atcttgtacg gttgtcctac
601 cgttattgac tccgacttcg aagaattcaa taggaattta tttatggaag caatgaatgg
661 agagggaaaa tttagattta tttggagatc cacatgttcc cttgatggga aaagcgtgga
721 gtatccctta actcatccgc ttgaaattac tttacaaaat ggttctagaa tgagctccat
781 acctcaatta aaaaaaatac tatatactgt acccaaagaa atattggttg gagcagacaa
841 cgatgatcag ctccatgatc tagaaccaga agaattacgt gaacttgatt tgagagtaac
901 atcgttaatc tcagaatttt accaatataa aaaggatatc acagccactc taaatttcac
961 caaaagtatt gttaacaact ttccactaat ctctaaacaa ctgattaagg tttcatctgt
1021 taacaaggat ataataacaa gtaatgaaga actcaatagt aaaggcttcg attacaacat
1081 gctaggtctc tatattaatg gacagaattg gaaaattacc tcactgactc cgtacaattt
1141 gcttactgct ttaaaaactg aataccaaag tttactgaaa attacgaacc ttttgcaaga
1201 actcgagcca tcgaaatgca tactagattc caagttttta ctcaataagt tttctcaatt
1261 ttcattgggg aagttgcaaa acttacaacc aatcaaaatg gatctccaca caattccagg
1321 gttctcagaa tcagtaatat acttcaatga tatcgaaagc gacccgcaat atgacgaatt
1381 agtaaatagt gttcaagcat tttttgataa atcgaaattc ggagagttgc ctgaaataaa
1441 gcaaaactgg tcagagatca tattcgttat agatttcgcc cgtttagaag atagtgaggt
1501 gaaggaggca ttgggtgggt tggttcgtgc cgttaatgtt gtctcccagg gatatccgca
1561 aagagtcgga ctattgccat ttagttcaga tagtgacaag tccgttgtta ataaaattta
1621 cgagctgaag aactcaactg acaatttaac agaattaaaa agttttttgg agacaatgct
1681 gcttgcagat ggcctttccg cgaatgcaaa acattcaaaa cacataccag ttccagatgt
1741 tttccatcta cttgatgaac ttcaaattga cgaaacatca attataatca atggagagat
1801 ttacccattt agaaaaaatt ggaattattt aattgcaaaa gttatcaaaa aggacactga
1861 atttattcgt aaagaattga gcaattcttc tccgaaaaac aaacaaatta gcgtaaggga
1921 cttattgcat tacaaatctg caaatctgag acataataaa tatacaccaa attattttgc
1981 tgattcggta tattcttcgg tcaacaatac tgcattggaa agcgtatgct cagaaagaat
2041 aggctactat actaaaaatg aagaatacaa tttattacac acaatcacat tagtggatga
2101 ttttggctct attcatgctt tgaaaagatt gagaaacttg ttgcatactt cctttgttgg
2161 tgttaggatc agaatcattc acgtaggtga tatttctgat atttggtatc aattgcgtgg
2221 aagtctttcc caaaaagatc caataggctc aataaataca tttattgatg ctttgaaact
2281 taaaaaggta aaaagtcaca cgtacaaaaa aagcggctta aaccagttag gccttcataa
2341 atggcttcct gacattccat tatttgaatt gcaaaagggt tcatttattg ctttgaacgg
2401 tagatttatc atcttgatca aaatgaagtg ccagaaacag aacatttcga aggccaaaat
2461 cataaagaga gaagctctaa gaacgatcga ttcagttttc gccctagatt tactttttcc
2521 aggtttctca caggaaataa tcaatcctga tttgatagaa atgatctcct ccattttaac
2581 taggttgttt taccaaggta cacatatata caataatggt attgattata ccactgaaag
2641 tagcttacca agaatggatt tgagcgagtt ttttagacct aataatttaa cgatgtttga
2701 agatggaaaa tcagcttcta ttgatttact actaatttta gatccacttg aagagagaac
2761 gcaaatgatt ctttctcttg ttgagcaatt caggcctttg aaatttgtta atattcaggt
2821 aattttaatg ccgacactgg aattaaacat tgtccctatt agaagaatat acgttgatga
2881 cgcagatatt gtcaaatcaa taacttctga ggatagcaga tcagatccag aagtagatat
2941 tgaaatggat gttcctaatt ctttcattgt agataataat tatcggataa aaaaattgct
3001 catagaatta cattccttct ctagcaaaac agtcctttca actggcaata ttgatggtat
3061 ggggggtgta tgcctagcac ttgtcgattc tgcagggaac attattgaca aaactacaac
3121 aatgaaaacc tttggctatg gacaatttca taccgacaaa tttttaaagg gttgctatat
3181 aaaaagttgt gattcaagat ataccgttca gtcattttct actgacgggc atcccgactt
3241 tataccatca gattccttgg atatactgtc gtacaatcca caaaaaatcg ctgtaaaaat
3301 ttcagaagag cctacacacg aggaagaata cgaggaaggt cgcaacaatg atacaataat
3361 caatattttt actattttag agtccgggcc agatgaggaa gagaggtaca tgcaaatgat
3421 tttatccatt ttgtcaaagt gtcccgaaac gcaaaaggtg aattttttca ttttagatca
3481 gccgtttatc tccgacactt taaggaaatc atgtgagtat ataaattcct ctgatgaaat
3541 gagaggcaat gtcatttttt tgaattatga atggcctcaa tggttaagac cgcaaagatt
3601 ttcttcaagg agaagggatg tctctagatt tctgttcttg gatgtccttt tacctcaaaa
3661 catctccaaa gtgttatata tgagtccaac tgaagtaccg ctggatcctt ttgacatttt
3721 tcaatttcaa ggcctcaaac gtgcacctct aggactattc cgaatgagtg gtgatggtta
3781 ttggaaagaa ggatactggg aaaaaatgtt aagggagaat aatttagaat tttattctac
3841 cgaaccggcc tttttagtaa acttagagag gtttcgggag ttagatgctg gtgataaata
3901 caggattcac tatcaacgta tttctacaga cgccatgtct cttgtcaata tcggccaaga
3961 tctagttaac aacctacaac tcgaggttcc gattaggttt ctcaagggat cgtataagaa
4021 gaaattagtt attaatgatg aatgtgtttc tgaatggaag aaaaaaataa ataagttcgc
4081 atcctctcct ggcgatgaag acgtacctgg agaaagtgtt agcagcaaat accaagattc
4141 cgacaatgcc gctcctctgc atgacgaatt ataactactc cagaagagtt cattacgcga
4201 ctgtccaaga gcgtgaagaa ttgcttctgc gcataagtct tcttccataa tagattttta
4261 tatcattttt agaacataaa atttctcgcc aagctagttt tttagctaaa agcagatatc
4321 cagtaacatg ggttccgctt tttgcagcga atactatgaa gagttttgcc cgactggctc
4381 ccc
//
LOCUS STMCURA 2192 bp ds-DNA BCT 27-JUN-1990
DEFINITION S.curacoi curC, CurA genes, complete cds, curD gene, 3'
end and curB gene 5' end.
ACCESSION M33704
KEYWORDS .
SOURCE S.curacoi (strain ATCC 13385) DNA.
ORGANISM Streptomyces curacoi
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 2192)
AUTHORS Bergh,S. and Uhlen,M.
TITLE Cloning, analysis and heterologous expression of the polyketides
synthesis genes of Streptomyces curacoi
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by S.Bergh,
13-APR-1990.
Author address: S.Bergh, Dept. of Biochemistry, Teknikringen 30,
Stockholm, SWEDEN S-100 44. E-mail: staffan@physchem.kth.se
FEATURES from to/span description
pept < 1 32 curD protein (AA at 3)
pept 77 541 curC protein
pept 538 1806 curA protein
pept 1803 > 2192 curB protein
BASE COUNT 349 a 823 c 731 g 289 t
ORIGIN
1 tgatcaccga ccgcagggcc gcgcagccct gacagcgcag ggacccccac cccaccgacc
61 gacctggaga cagaccatgg acaagatgcg cccgcgcgtc gtggacgtca aagagatcga
121 gcccaaccgc aagcgcggcg gtgacctgcg cacgctgctc acccctgtca cggtgggtgc
181 caccagcggg ttcatgggcc tggccatcat gcggccgggc gaacgcatca gcgagcacta
241 ccacccgtac tccgaggagt tcgtgtacgt cgtcgagggc aggctggagg tcgacctcga
301 tggcgaaacg tttcctctcc gtgccgacca gggcctcatg atccccatcg acatgcggca
361 ccgcttccgc aacgtcggtg acgaggaagc ccggatggtc ttccatctga gcccgctggc
421 accgaagccg agcctcggcc acgtcgacac ggaggccccg gcgatcagcg acgacgtcaa
481 ggcgtacccg ctcgtccagg aggagagcgg gcggccggaa cgacccgggg tcctgtcgtg
541 acccggcggg tggcggtcac cggcataggc gtggtcgctc cgggaggcat cggggtgccc
601 gcgttctggg acctcctctc gagcggccgt accgcgacac gtggcatcac cctgttcgac
661 cccgaggggc tgcggtcccg catcgccgcc gagtgcgact tcgacccgct cgcccacggc
721 ctcgaccccg agctggtcga acgcgccgac cggtacatac agttcgccct ggtcgccgcc
781 gacgaggccg tgaccgacag cggcatcgac ttcggcaccg agaacccctg gcgcgtggcc
841 gtgtccctgg gcagcgcggt gggcgggacg accaggctgg agcacgacta cgtcctcgtc
901 agcgaacggg gacagcgctg ggacgtcgac caccgcgccg ccgagccgga actgcaccgc
961 gccttctcgc ccagcacgct ggccgcggac gtcgcggagc ggttcggcgc ccagggaccg
1021 gtgcagaccg tgtccaccgg ctgcacctcc gggctcgacg cggtgggtta cgccttccac
1081 accatcgagg agggccgcgc cgacgtctgc atcgccgggg cgtcggactc gccgatctcc
1141 ccaatcacca tggcctgctt cgacgccatc aaggccacgt cgcccaacaa cgacgacccg
1201 gagcacgcct cccgcccctt cgacgcccac cgcgacggct tcgtcatggg cgagggcgcc
1261 gccgtcctcg tcctggagga actggaacac gcccgtgcgc gcggcgcaca tgtctactgc
1321 gagatcggcg gctacgccac cttcggcaac gcctaccaca tgaccggact gaccagtgag
1381 ggcctggaga tggcccgggc catcgacgtc gcgctcgacc acgcgcgcgt cgacccgacc
1441 gacatcgact acgtcaacgc ccacggctcg ggcacccggc agaacgaccg ccacgagacc
1501 gccgcggtca agaagtccct gggcgcccac gcctacgaca cccccatgag ctcgatcaag
1561 tccatggtgg gccactcgct cggcgcgatc ggcgcgatcg aggtcgtcgc ctgcgtactc
1621 gccctggccc ggcaggtggt cccgccgacg gcgaactacg agaccccgga ccccgagtgc
1681 gacctggact acgtaccgcg caccgcacgg ccccgccggc tcgaccacgt gctctccgtg
1741 ggcagcggct tcggcgggtt ccagtccgcg gtgctcctga cggggccggc tgggaggaaa
1801 cgatgaacgg cagtggtagt ggcatacgca cccggcggac cgccgtcacg gggatcggtg
1861 tggtcgcgcc caacggcctg cacgccgaca cctactggaa gtccgtcaag gagggcgcga
1921 gcgtactgga ccgcatcacc cgggagggct gcgagcacct tccgctccgc gtcgccggtg
1981 aggtccgcgg attcgacccg tcggcgctca tcgaggagac cttcctcgtc cagaccgacc
2041 tgttcaccca cttcgccctg gccgccgcgg acgccgccct ccaggacgcg gggctgacca
2101 aggccgccgc cgtggccgac tcgccgtact cggtcggtgt cgtcaccgcc gccgggtccg
2161 gtggcggcga gttcgggcag cgggaactgc ag
//
LOCUS MUSTRKB 2484 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Mouse tyrosine protein kinase B (trkB) mRNA, complete cds.
ACCESSION M33385
KEYWORDS tyrosine protein kinase.
SOURCE Mouse adult brain, cDNA to mRNA, TK-negative clone pFRK42.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2484)
AUTHORS Klein,R., Conway,D., Parada,L.F. and Barbacid,M.
TITLE The trkB tyrosine protein kinase gene codes for a second neurogenic
receptor that lacks the catalytic kinase domain
JOURNAL Cell 61, 647-656 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Barbacid, 29-MAR-1990, for release after publication.
FEATURES from to/span description
pept 1 1431 tyrosine protein kinase B (trkB) precursor
sigp 1 93 tyrosine protein kinase B (trkB) signal
peptide
matp 94 1428 tyrosine protein kinase B (trkB)
BASE COUNT 638 a 614 c 549 g 683 t
ORIGIN
1 atgtcgccct ggctgaagtg gcatggaccc gccatggcgc ggctctgggg cttatgcctg
61 ctggtcttgg gcttctggag ggcctctctc gcctgcccga cgtcctgcaa atgcagttcc
121 gctaggattt ggtgtactga gccttctcca ggcatcgtgg cattcccgag gttggaacct
181 aacagcgttg acccggagaa catcacggaa attctcattg caaaccagaa aaggctagaa
241 atcatcaatg aagatgacgt tgaagcttac gtggggctga gaaaccttac aattgtggat
301 tccggcttaa agtttgtggc ttacaaagcg tttctgaaaa acagcaacct gcggcacata
361 aatttcacac gaaacaagct gacgagtttg tccaggagac atttccgcca ccttgacttg
421 tctgacctga tcctgacggg taatccgttc acgtgctcct gcgacatcat gtggctcaag
481 actctccagg agactaaatc cagccccgac actcaggatt tgtactgcct caatgagagc
541 agcaagaaca tgcccctggc gaacctgcag atacccaatt gtggtctgcc atctgcacgt
601 ctggctgctc ctaacctcac cgtggaggaa ggaaagtctg tgaccctttc ctgcagtgtg
661 gggggtgacc cactccccac cttgtactgg gacgttggga atttggtttc caagcacatg
721 aatgaaacaa gccacacaca gggctcctta aggataacga acatttcatc tgatgacagt
781 ggaaagcaaa tctcttgtgt ggcagaaaac cttgtaggag aagatcaaga ttctgtgaac
841 ctcactgtgc attttgcgcc aactatcacg tttctcgagt ctccaacctc agatcaccac
901 tggtgcattc cattcactgt gagaggcaac cccaagcctg cgcttcagtg gttctacaat
961 ggggccatac tgaatgagtc caagtacatc tgtactaaga tccacgtcac caatcacacg
1021 gagtaccatg gctgcctcca gctggataac cccactcata tgaataacgg agactacacc
1081 ctgatggcca agaacgagta tgggaaggat gagagacaga tctccgctca cttcatgggc
1141 cggcctggag tcgactacga gacaaaccca aattaccctg aagtcctcta tgaagactgg
1201 accacgccaa ctgacattgg ggatactacg aacaaaagta atgaaatccc ctccacggat
1261 gttgctgacc aaagcaatcg ggagcatctc tcggtctatg ccgtggtggt gattgcatct
1321 gtggtgggat tctgcctgct ggtgatgttg ctcctgctca agttggcgag acattccaag
1381 tttggcatga aaggttttgt tttgtttcat aagatcccac tggatgggta gctgagataa
1441 aggaaagaca aaggctgggg ctgtggtgct tgttgcctga cgccctgtga gctgaactct
1501 gggactgctg ttgcctatcc caggaagtgc tgcttatttg agggtgtctg gtggaaatgg
1561 gtaatctccg aggatgtctg cagcctgctt gttgtgagct gtgactgggg aaccccaagg
1621 cagaggcagg ggtcaggcag ctgagaagca gcagaagaac acacttagat tcaccttctg
1681 ttcttacaat agttcaaata tagaatcgaa gtgaaatctc attggattat gcctctctaa
1741 tgaaaagcga gctgtttgac tatacggaaa atgtgctgac attaattgct tctgtttatt
1801 aaaggtgatt tgcaaattaa aaactctgca tctatcatct atccatctat ctgtttgtct
1861 atcatatcta tctgtctgtc tatctgtcta tcatctatct acctacctct ctatcatatc
1921 tatctgtctg tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc
1981 tatctatcat ctatctacct atcatcgatc tacttatcta tcatctatct atctacctat
2041 catcgattta cttatctatc atctatctat ctatctatct atctatctat ctatctatct
2101 atctgtcatc tatctaaagt catagctagg tctaagtgca cactaaaagt ctaatccaca
2161 cataacacct atttcagcaa catcttctgt tctctaacct ttgctaactt ctgtgatttc
2221 cacctacaac cctgcgactg atagacttaa aggcacattg gtggtgtcat tagtaggttc
2281 tttgttttgc tggcagcaaa gacccaaact cttcgctaac gattgctttc aaagtccacc
2341 cggcaggtag aacggagcag caccagggac tgtgtggcca ggagtatgga cctgaattaa
2401 ccacagcctg agaataaata atggtagggt atatgcatat agggaattaa aatcttgtcc
2461 ctttccattg ccctctgcta accg
//
LOCUS STRPRH 1784 bp ds-DNA BCT 27-JUN-1990
DEFINITION Streptococcus protein H gene, complete cds.
ACCESSION M29398
KEYWORDS IgG binding protein; protein H.
SOURCE Streptococcus (group A, strain AP1) DNA.
ORGANISM Streptococcus sp.
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 1784)
AUTHORS Gomi,H., Hozumi,T., Hattori,S., Tagawa,C., Kishimoto,F. and
Bjoerck,L.
TITLE The gene sequence and some properties of protein H: A novel IgG
binding protein
JOURNAL J. Immunol. 144, 4046-4052 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Gomi, 25-OCT-1989, for release after publication.
FEATURES from to/span description
pept 328 1458 protein H precursor
sigp 328 471 protein H signal peptide
matp 472 1455 protein H
binding 316 321 ribosome binding site
signal 246 251 -35 region
signal 270 274 -10 region
site 1489 1514 terminator
BASE COUNT 734 a 343 c 338 g 369 t
ORIGIN
1 gaattccgtt accatcaaca ggtgaaacag ctaacccatt cttcacagcg gcagccctta
61 ctgttatggc aacagctgga gtagcagcag ttgtaaaacg caaagaagaa aactaagcta
121 tcactttgta atactgagtg aacatcaaga gagaaccagt cggttctctc ttttatgtat
181 agaagaatga ggttaaggag aggtcacaaa ctaaacaact cttaaaaagc tgacctttac
241 tccttttgat taactatata taataaaaat attaggaaaa taatagcact attaattttc
301 ttttttaata aaatcaagga gtagataatg actagacaac aaaccaagaa aaattattca
361 ctacggaaac taaaaaccgg tacggcttca gtagccgttg ctttgaccgt tttgggcgca
421 ggttttgcaa accaaacaac agttaaggcg gaaggggcta aaattgattg gcaagaagag
481 tataaaaagt tagacgaaga taatgctaaa cttgttgagg ttgttgaaac cacaagtttg
541 gaaaacgaaa aactcaagag tgagaatgag gagaataaga aaaatttaga caaacttagc
601 aaagaaaatc aaggaaagct cgaaaaattg gagcttgact atctcaaaaa attagatcac
661 gagcacaaag agcaccaaaa agaacaacaa gaacaagaag agcgacaaaa aaatcaagaa
721 caattagaac gtaaatacca acgagaagta gaaaaacgtt atcaagaaca actccaaaaa
781 caacaacaat tagaaacaga aaagcaaatc tcagaagcta gtcgtaagag cctaagccgt
841 gaccttgaag cgtctcgtgc agctaaaaaa gaccttgaag ctgagcacca aaaacttgaa
901 gctgagcacc aaaaacttaa agaagacaaa caaatctcag acgcaagtcg tcaaggccta
961 agccgtgacc ttgaagcgtc tcgtgcagct aaaaaagagc ttgaagcaaa tcaccaaaaa
1021 cttgaagctg agcaccaaaa acttaaagaa gacaaacaaa tctcagacgc aagtcgtcaa
1081 ggcctaagcc gtgaccttga agcgtctcgt gcagctaaaa aagagcttga agcaaatcac
1141 caaaaacttg aagcagaagc aaaagcactc aaagaacaat tagcgaaaca agctgaagaa
1201 cttgcaaaac taagagctgg aaaagcatca gactcacaaa cccctgatac aaaaccagga
1261 aacaaagctg ttccaggtaa aggtcaagca ccacaagcag gtacaaaacc taaccaaaac
1321 aaagcaccaa tgaaggaaac taagagacag ttaccatcaa caggtgaaac agctaaccca
1381 ttcttcacag cggcagccct tactgttatg gcaacagctg gagtagcagc agttgtaaaa
1441 cgcaaagaag aaaactaagc tatcactttg taatactgag tgaacatcaa gagagaacca
1501 gtcggttctc tcttttatgt atagaagaat gaggttaagg agaggtcaca aactaaacaa
1561 ctcttaaaaa gctgaccttt actaataatc gtctttgttt tataatgaaa acattaacga
1621 aataatttat taaggagaga atactaatga atattagaaa taagattgaa aatagtaaaa
1681 cactactatt tacatccctt gtagccgtgg ctctactagg agctacacaa ccagtttcag
1741 ccgaaacgta tacatcacgc aattttgact ggtctgggga attc
//
LOCUS BSUSACP 3518 bp ds-DNA BCT 27-JUN-1990
DEFINITION B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the
sucrose phosphotransferase system), complete cds, and sucrase gene,
5' end.
ACCESSION J03006 M33761
KEYWORDS enzyme II; sucrase; sucrose transferase system.
SOURCE B.subtilis (strain Marburg 168) DNA, clone pBSG8-10.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1987 to 3518)
AUTHORS Fouet,A., Arnaud,M., Klier,A. and Rapoport,G.
TITLE Bacillus subtilis sucrose-specific enzyme II of the
phosphotransferase system: Expression in Escherichia coli and
homology to enzymes II from enteric bacteria enzymes II from
enteric bacteria
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2070)
AUTHORS Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G.
TITLE The sacT gene regulating the sacPA operon in B.subtilis shares
strong homology with transcriptional antiterminators
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
A.Fouet, 16-SEP-1987. Draft entry and computer-readable sequence
for [2] kindly submitted by M.Debarbouille 18-APR-1990.
Author address: M.Debarbouille
Institut Pasteur
25-Rue du Dr. Roux
75724
Paris Cedex 15
FRANCE
Emr:DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU
FEATURES from to/span description
pept 136 897 sacT protein
pept 2008 3390 enzyme II
pept 3387 > 3518 sucrase
BASE COUNT 1013 a 712 c 867 g 926 t
ORIGIN
1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat
61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat
121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc
181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa
241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg
301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg
361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag
421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa
481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat
541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa
601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc
661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg
721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat
781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa
841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg
901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag
961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag
1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg
1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa
1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg
1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc
1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc
1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac
1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa
1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc
1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg
1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga
1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca
1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata
1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt
1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga
1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa
1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt
1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt
2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt
2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt
2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat
2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat
2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat
2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt
2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg
2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag
2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt
2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata
2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag
2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca
2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga
2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc
2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc
2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc
3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa
3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact
3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc
3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg
3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac
3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag
3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg
3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt
3481 ttcatatcat gccgccggtt gggctgctga atgacccg
//
LOCUS MUSFGFR 2983 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Mouse FGF receptor mRNA, complete cds.
ACCESSION M33760
KEYWORDS FGF receptor.
SOURCE Mouse cell line NIH3T3, cDNA to mRNA, clone A-15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2983)
AUTHORS Mansukhani,A., Moscatelli,D., Talarico,D., Levytska,V. and
Basilico,C.
TITLE A murine FGF receptor expressed in CHO cells is activated by both
bFGF and K-FGF
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Basilico, 22-MAR-1990.
FEATURES from to/span description
pept 44 2245 FGF receptor precursor
sigp 44 113 FGF receptor signal peptide
matp 114 2242 FGF receptor
BASE COUNT 699 a 834 c 799 g 651 t
ORIGIN
1 gaattccgga gccttgttac caacctctaa ccgcagaact gggatgtggg gctggaagtg
61 cctcctcttc tgggctgtgc tggtcacagc cactctctgc actgccaggc cagccccaac
121 cttgcccgaa caagatgcac tcccatcctc ggaagatgat gacgacgacg atgactcctc
181 ctcggaggag aaagagacgg acaacaccaa accaaaccgt aggcctgtag ctccctactg
241 gacatcccca gagaaaatgg agaagaaact gcatgcggtg cccgctgcca agacggtgaa
301 gttcaagtgc ccgtcgagtg ggacacccaa ccccactctg cgctggttga aaaatggcaa
361 agagtttaag cctgaccacc gaattggagg ctacaaggtt cgctatgcca cctggagcat
421 cataatggat tctgtggtgc cttctgacaa gggcaactac tcctgcatcg tggagaatga
481 gtatgggagc atcaaccaca cctaccagct tgacgtcgtg gaacgatctc cgcaccgacc
541 catccttcag gcagggctgc ctgcaaacaa gacagtggcc ctggcgagca atgtggagtt
601 catgtgtaag gtgtacagcg atccgcagcc tcacattcag tggctgaagc acatcgaggt
661 gaacgggagt aagatcgggc cagacaactt gccgtatgtc cagatcctga agactgctgg
721 agttaatacc accgacaagg aaatggaggt gcttcaccta cggaatgtct cgtttgagga
781 tgcgggggag tatacgtgct tggcgggtaa ctctatcgga ctctcccatc actctgcatg
841 gttgaccgtt ctggaagccc tggaagagag accagctgtg atgacctcac cgctctacct
901 ggagatcatt atctactgca ccggggcctt cctgatctcc tgcatgttgg gctctgtcat
961 catctataag atgaagagcg gcaccaagaa gagcgacttc catagccaga tggctgtgca
1021 caagctggcc aagagcatcc ctctgcgcag acaggtaaca gtgtcagctg actccagtgc
1081 atccatgaac tctggggttc tcctggttcg gccctcacgg ctctcctcca gcgggacccc
1141 catgctggct ggagtctccg aatatgagct ccctgaggat ccccgctggg agctgccacg
1201 agacagactg gtcttaggca aaccacttgg cgagggctgc ttcgggcagg tggtgttggc
1261 tgaggccatc gggctggata aggacaaacc caaccgtgtg accaaagtgg ccgtgaagat
1321 gttgaagtcc gacgcaacgg agaaagacct gtcggatctg atctcggaga tggagatgat
1381 gaaaatgatt gggaagcaca agaatatgat caaccttctg ggagcgtgca cacaggatgg
1441 tcctctttat gtcattgtgg agtacgcctc caaaggcaat ctccgggagt atctacaggc
1501 ccggaggcct cctgggctgg agtactgcta taaccccagc cacaaccccg aggaacagct
1561 gtcttccaaa gatctggtat cctgtgccta tcaggtggct cggggcatgg agtatcttgc
1621 ctctaagaag tgtatacacc gagacctggc tgctaggaac gtcctggtga ccgaggataa
1681 cgtaatgaag atcgcagact ttggcttagc tcgagacatt catcatatcg actactacaa
1741 gaaaaccacc aacggccggc tgcctgtgaa gtggatggcc cctgaggcgt tgtttgaccg
1801 gatctacaca caccagagcg atgtgtggtc ttttggagtg ctcttgtggg agatcttcac
1861 tctgggtggc tccccatacc ccggtgtgcc tgtggaggaa cttttcaagc tgctgaagga
1921 gggtcatcga atggacaagc ccagtaactg taccaatgag ctgtacatga tgatgcggga
1981 ctgctggcat gcagtgccct ctcagagacc tacgttcaag cagttggtgg aagacctgga
2041 ccgcattgtg gccttgacct ccaaccagga ttatctggac ctgtccatac cgctggacca
2101 gtactcaccc agctttcccg acacacggag ctccacctgc tcctcagggg aggactctgt
2161 cttctctcat gagccgttac ctgaggagcc ctgtctgcct cgacacccca cccagcttgc
2221 caacagtgga ctcaaacggc gctgactacc aaccctgtcc cagttttctc ccattccgtc
2281 gtcacccgtg ccctcaccca caaatcccct gttggacaca ctgccttctc ctcctccttg
2341 ccgctggcaa gagccagtgc ctgactgaag gccttcctgt gttgtggcct tccccctcca
2401 tcacccccaa gacccctctt ctccctcttc ttagcctgct gtgtgagaga ggagccaaga
2461 ggcaggtgct tgccgacggc cgcatcctcc ttcccaggtg ttggaccaag acccgccccg
2521 ctgctggcac tgcttggagg tgtgcagagc ggaagcaagt ggagacgtat ccggggcatt
2581 cctgttgacc catcagcccc ttctgttctg gcggcagggc cttggggctc ctggaagccg
2641 tgaggtttct gtttaggcct taaccgaagg caacctctgc tccagatgga tggtaccagt
2701 agcttcttaa ttccaatact aatttgcttt gctgaccaaa tacctgcctg gtaccagaag
2761 acagggaggc agagactggg agccgtgatg tgcccttggc tgagcctaga cttggggctc
2821 tgtacatagc tatgaagaaa aacacaaagt gtataaatct tgagatatat ttacatgtct
2881 ttttaaaaag ggtcgttact agagatttac catgggggag acgcccaggg tagcatccgt
2941 tgctatatat taaaaacaca aacgaacaaa aaaaaaagaa ttc
//
LOCUS BOVTCRA01 402 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence,
variable region, and joining region.
ACCESSION D90010
KEYWORDS T-cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA52.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 402)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 402 T-cell receptor alpha chain, leader sequence,
variable region and joining region
BASE COUNT 114 a 111 c 91 g 86 t
ORIGIN
1 atgatgaaat cctcaagagt tttaccagtg atcctgtggc ttcagttgat ctcggtgagc
61 agccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagaggcc
121 gcctctctcg gctgcactta ttctgacagt gcttctcagt acttcatatg gtacagacag
181 taccctggga aaggccctga gtttctgctg caagtgtatg ccaacaaaga caaagaggaa
241 ggaaaattta cagcacagtc caataaaacc agcaagcatg tctccctgcg tatcagagac
301 tccgagccca gcgactcagc cacctacctc tgtgcagctt ctaactacca aggcagtcaa
361 ttaaactttg gaacaggaac tagactcacc atcacagcca aa
//
LOCUS BOVTCRA02 408 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence,
variable region, and joining region.
ACCESSION D90011
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA29.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 408)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 408 T-cell receptor alpha chain, leader sequence,
variable region and joining region
BASE COUNT 115 a 109 c 95 g 89 t
ORIGIN
1 atgatgaaat cctcaagagt tttattagtg atcctgttgg cttcagttga tcttgagcag
61 ccagcagaac acagtggagc agagcctgcg tctctgcccg tcccagaggg agccgctgca
121 tctctcggct gcacttattc tgacagcaat tctctgtact tcacatggta cagacagtac
181 cctgggaaag gccctgagtt tctgctgcaa gtgtatgcca acaacaacaa agaggaagga
241 aaatttacag cgcagtccaa taaaaccaac aagcatgtct ccctgcgtat cagagactcc
301 gagcccagcg actcagccac ctacctctgt gcagttgata cgatatccac cactgcaggg
361 accaagctaa cctttggaga aggaaccagg ctgatagtca agctgaaa
//
LOCUS BOVTCRA03 411 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence,
variable region, and joining region.
ACCESSION D90012
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T-cell, cDNA to mRNA, clone BTA44.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 411 T-cell receptor alpha chain, leader sequence,
variable region and joining region
BASE COUNT 104 a 114 c 101 g 92 t
ORIGIN
1 atgatgaaat cctcaagagt tttattggtg atcctgtggc ttcagttgat cttggcgagc
61 acccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagccgcc
121 gcatctctcc gctgcactta ttctgacagc acttctcggt acttcacatg gtacagacag
181 taccctggga aaaggcctga gtttctcgtg caagtgtatg ccaacaacaa caaagaggaa
241 ggaaaattta cagcgcagtc caataaaacc agcaagcacg tctccctgcg catcagagac
301 tctgagccca gcgactcagc cacctacctc tgtgcagtga gcaatgatta tgggggtgct
361 gccaaccagg tcatctttgg gtctggcacg ctgctttctg tcaagccaaa a
//
LOCUS BOVTCRA04 393 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable
region, joining region.
ACCESSION D90013
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T-cell, cDNA to mRNA, clone BTA43.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 393)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
Japan 080 Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 393 T-cell receptor alpha chain, leader sequence,
variable and joining region
BASE COUNT 111 a 88 c 95 g 99 t
ORIGIN
1 atgaagacac aaagaagagt cttgctgagt ctcttgtgga tacagatttg ctgcctcaga
61 gtgcagatga aggtggagca gagtccgggg gttctgactc tccaagaggg gagaaattcc
121 tctctgatat gcaattattc tatttccatg accagtgtgc agtggttcca acaaaatcct
181 gatggacgcc tcatctcctt gttttatata gcttcaggaa tgcagcagaa aggaagactg
241 aaatccacca ttaatagtaa ggagcgttac agtcaactct acatcagaga ctcccagcct
301 ggggactcag ccacttactt ctgtgctgcg atcaggactg acaattccag aatcttcttt
361 ggaactggaa cccaggtggt ggtaaagccc aaa
//
LOCUS BOVTCRA05 393 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable
region, and joining region.
ACCESSION D90014
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA20.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 393)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 393 T cell receptor alpha chain, leader sequence,
variable and joining region
BASE COUNT 96 a 97 c 93 g 107 t
ORIGIN
1 atgtccttgg tgttcacctt gatgcttgag atgctcctgt ttctgagagc tggagcccag
61 tcagtgaccc agcctgatga ccacattgct gtctctgaag agccgtctgg agctgaagtg
121 caactactca tcttctgttt caccgatctc ttctggtaca tacagtaccc caaccaagga
181 ctccagcttc tcctgaagta cgtgtctgga gacaatcttg tttcaggcat caaaggtttt
241 gaggctgaat ttaggaacag tgagacgtct ttccacctga ggaaaatacc agctcattgg
301 aaagactcgg ccaagtactt ctgtgctctg agtaggactg acaattccag aatcttcttt
361 ggaactggaa cccaggtggt ggtaaagccc aaa
//
LOCUS BOVTCRA06 393 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable
region, and joining rgion.
ACCESSION D90015
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA25.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 393)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 393 T cell receptor alpha chain, leader sequence,
variable and joining region
BASE COUNT 102 a 91 c 113 g 87 t
ORIGIN
1 atgaagagaa aatggggagc cctgctgggg tttctgtggg tccagatttg cttggtgagg
61 ggagtaaagg tggagcagag tccttcagtc ttgagcctgc aggagggagc caactctact
121 ctgaggtgca atttttctga cacagtggac agtgtgcagt ggttccagca gaatcccgga
181 ggagccctca ccaccctgtt tttcatagct tcagggacga agaagaatga aaggatgagt
241 tccacagtga attctaagga gcggtatagc accctacaca tcacagcctc ccagctggaa
301 gacgcagcca cctacctctg tgcggtcgat ctcggttcag gaaacaggca acttgtcttt
361 ggaaaaggca caagacttgc tgtgactcca aaa
//
LOCUS BOVTCRA07 399 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable
region, and joining region.
ACCESSION D90016
KEYWORDS T cell receptor; alpha chain; joining region; leader sequence;
variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA15.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 399)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept 1 > 399 T-cell receptor alpha chain, leader sequence,
variable and joining region.
BASE COUNT 91 a 90 c 103 g 115 t
ORIGIN
1 atggctttgc agagcactct ggaagcagtg tggctgggtg tcctcctcag ctctctctgg
61 aaggttgcag agagcaagga gcaagtattt cagtctccca ctgtggtctc tttggaggga
121 gctgtggcag aaatttcctg taatcactct atatccaatg tttatgactt cctctggtac
181 tttcacttcc caggatttgc accaagactc ctcattaagg gcttagggcc ttctcaacag
241 ggacgctaca acatgacata tgagaggttc tcttcatccc tgctcatcct ccaggtgcag
301 acggcagatg caggggttta ctattgtgct ctgaaggcga tttcaagtgg ccagaagttg
361 gtttttggaa gtgggaccat gttaaaggtg aatcttaaa
//
LOCUS BOVTCRA08 348 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90017
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA49.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 348 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 99 a 90 c 75 g 84 t
ORIGIN
1 ttgagtcttg ctaagacctc ccagcccatc ttcattgact catacgaagg acaagaagtg
61 aacatatctt gtaaccacac cacaatacgt acaatgagca atatcttgcg gtatgtgaag
121 gttccccatc cagggaccaa atttatcatt caaggatata agacaaacgt ggaaaatgaa
181 gtggcatctc tgttaattcc tcctgacagg aagttcagca ccctgagcct gccccaggct
241 tccctgagag acaccgctgt gtactactgc atcgtgagag atcaatatgg gggtgctgcc
301 aaccagctca tctttgggac tggcacgctg ctttctgtca agccaaaa
//
LOCUS BOVTCRA09 306 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90018
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA28.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 306)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 306 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 85 a 76 c 78 g 67 t
ORIGIN
1 ttgcttctgc aagaaggaga gaacttcacc acatactgca attcctcaag cattttaagc
61 agcttacagt ggtacaagca gagccctggg ggcagtcctg tcgtcttgat gatattagct
121 aagggtggag aagtgaagac agagcagaga ctgacaggtc agtttggaga gaccaaacag
181 cacagctccc tgcacctcac ggctgcccag ctctcagatg caggaaccta cttctgcgtc
241 agggatcgga caaacctctg gcaaaattgt ctttggaaga gggactcagc ttcatgttct
301 cccaaa
//
LOCUS BOVTCRA10 282 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90019
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA42.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 282)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 282 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 80 a 70 c 70 g 62 t
ORIGIN
1 accagctact gcaattcctc aagcatttta agcagcttac agtggtataa gcagaggtct
61 gggagcagtc ctgtcctctt gatgatatta gctaagggtg gagaagtgaa gacagagcag
121 agactgacag gtcggcttgg agagaccaaa cagcacagct ccctgcacct cacggctgcc
181 cagctctcag atgcaggaac ctacttctgc acggggattc tgagcacagg ttatcagaaa
241 ttcgtatttg gaactggcac ccaacttttg atcaacccaa aa
//
LOCUS BOVTCRA11 258 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90020
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA13.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 258)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 258 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 81 a 70 c 54 g 53 t
ORIGIN
1 agacacagct tcagactact tcccttggat aaacaaggac ctgggaaagg tccccaactc
61 cttatagcta ttcgttcaaa tatgggtgaa aaaaaagacc agagactgac tgttttattg
121 aataagacca ccaaacatct ctccctgcac atcgcaacca tccaacctgg agactcagct
181 gtctacttct gtgcagcaag aggtgcaggc cagaagctgg tattcgggcg aggaaccagg
241 ctgaccatca accccaaa
//
LOCUS BOVTCRA12 189 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90021
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA18.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 189)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 189 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 59 a 52 c 33 g 45 t
ORIGIN
1 tttaacattc cttcaggaac aaaacaagat ggaagattaa acgccacgac aatccctaaa
61 gaacgtcgca gctcactgca catttcctct tcacagacca cagactcagg cacttacttc
121 tgtgctggtg aacaaacctc tggcaaaatt gtctttggaa gagggactca gcttcatgtt
181 ctccccaaa
//
LOCUS BOVTCRA13 171 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90022
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA40.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 171)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 171 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 39 a 36 c 50 g 46 t
ORIGIN
1 gatctcgatg cggcaagggg atccagtgac tgtgaagtga ccctatacag tttcggggag
61 cccattatct tttctgggag tgactctacc ttgtacttct gtgctgtgag cggggattat
121 gggaacagac tcacttttgg aaatgggact cgagtgctgg tcacaccaaa a
//
LOCUS BOVTCRA14 156 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90023
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA15.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 156 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 39 a 36 c 31 g 50 t
ORIGIN
1 tttttttttt cattccttag acgctctgat gcacacagtt acctccttct gaaggaactc
61 cacatgaaag actttgcctc ttacctctgt gttgtgatgg atggcaacta tcagtggatc
121 tggggctctg ggaccaaact aattataaag ccagaa
//
LOCUS BOVTCRA15 156 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90024
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA21.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 156 T-cell receptor alpha chain, variable and
joining region. (AA at 1)
BASE COUNT 49 a 34 c 27 g 46 t
ORIGIN
1 tttttttttc aaaagggaaa aaaaatcatc agcctcacca tctcttcctt gtacctggcc
61 gactcagcaa agtatttctg tgctcttcgg gatagtggaa ctagcagcta taaacagaca
121 tttggaaaag gcactctctt aattgtgaat ccaaaa
//
LOCUS BOVTCRA16 147 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90025
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA26.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 147)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Dep. of Veterinart Public Health Obihiro University of
Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 147 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 45 a 31 c 38 g 33 t
ORIGIN
1 aagaaggatg aaaactactt ccacctgatg aaagcttcag cccattggag tgattctgcc
61 aagtacttct gtgctctgtg tgatccatgg ggaaacacag gaagactcac ctttgggcag
121 gggacagtgt tgcaagtaaa accagaa
//
LOCUS BOVTCRA17 135 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90026
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA27.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080
FEATURES from to/span description
pept < 1 > 135 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 36 a 40 c 34 g 25 t
ORIGIN
1 aggagcggaa tgtccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc
61 tgtgtggtgg acaggactgg tgtctacaat aagctgacct ttggaaccgg aaccaggctg
121 actgtacact cacaa
//
LOCUS BOVTCRA18 123 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90027
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA36.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080
FEATURES from to/span description
pept < 1 > 123 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 34 a 39 c 28 g 22 t
ORIGIN
1 gagcggtata gcaccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc
61 tgtgcggtgg aggaaaacac actgatcttt ggctcaggaa ccactttgac agtcaagcct
121 caa
//
LOCUS BOVTCRA19 123 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90028
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA41.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080
FEATURES from to/span description
pept < 1 > 123 T-cell receptor alpha chain, variable joining
region. (AA at 1)
BASE COUNT 23 a 34 c 32 g 34 t
ORIGIN
1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg
61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca
121 caa
//
LOCUS BOVTCRA20 51 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION D90029
KEYWORDS T cell receptor; alpha chain; joining region; variable region.
SOURCE Bovine T cell, cDNA to mRNA, clone BTA46.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 51)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080 Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 > 51 T-cell receptor alpha chain, variable and
joining region (AA at 1)
BASE COUNT 12 a 16 c 11 g 12 t
ORIGIN
1 gccaaccagc tcatctttgg gactggcacg ctgctttctg tcaagccaaa a
//
LOCUS BOVTCRA21 955 bp ss-mRNA MAM 27-JUN-1990
DEFINITION Bovine T-cell receptor alpha chain, constant and 3' untranslated
region.
ACCESSION D90030
KEYWORDS T cell receptor; alpha chain; constant region.
SOURCE Bovine T cell, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 955)
AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M.
TITLE Sequence analysis of bovine T-cell receptor alpha chain
JOURNAL Immunogenetics 31, 57-60 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Naotaka
Ishiguro Department of Veterinary Public Health Obihiro University
of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
080 Japan Phone: 0155-48-5111 x281
FEATURES from to/span description
pept < 1 423 T-cell receptor alpha chain, constant region
signal 935 940 polyadenylation signal
BASE COUNT 223 a 288 c 211 g 233 t
ORIGIN
1 gtcaaggacc ccaaccccac tgtgtaccag ctgagaagcc cccaatccag tgacacttct
61 gtctgcctat tcaccgattt tgattcaaat caagtcaaca tggaaaaaat catggggtcc
121 gaggggagca cggtgcacaa aacaaacagc accgtgctca acatggagat cctgggatcc
181 aagagcaacg ggatagtgac ttggggaaac accagcgatg ctggatgcgc atacaccttc
241 aacgagacca ttcccttcgc ctccagcttg gaaatctcct gtaatgccaa gctggtagag
301 aaaagctttg aaacagatat aaacctaaac tcccaaaacc tgtcagtgat agtgttccgc
361 atcctcctcc tgaaggtggt cgggtttaat ctgctcatga cgctgcggct ctggtccagt
421 tgaggtcacc gacaacctga gagccccgtg ctccctcgcc ccttgctcct cgtcgccctt
481 cctctccctc ttcaagcaga gaggcgcact ctctgccccc atggatgaga aggctccctc
541 ctcctctctg gcctggctgg ccataacacc aactggatcc tccacatact ggtgatcaag
601 atgctgacaa gctgcccagc acggctgcca cccactctct tcctcactgc tgcttgtcac
661 tgcctggcat tcttggcaaa gtcggggggc tccccacccc caccccccag agactgcctc
721 tgatgtccca ctggatggtg gatccccagt gggttctcct gggctctagc tcctggagaa
781 tgttgtgtgt tgtttatatt tttttacata gtgttcataa aaaaatatat atcacccttt
841 tccccaacat gtggggaaaa attatctttt cacgatctag gccctgctct tctgtgtatc
901 cgagccacat tgtatattct gctgccacgg cttcaataaa agcgatttgg aagag
//
LOCUS SUSFCK 3985 bp ss-mRNA INV 27-JUN-1990
DEFINITION S.purpuratus flagellar creatine kinase mRNA, complete cds.
ACCESSION M33763
KEYWORDS creatine kinase; flagellar creatine kinase.
SOURCE S.purpuratus sperm, cDNA to mRNA, clone WD2.2.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 3985)
AUTHORS Wothe,D.D., Charbonneau,H. and Shapiro,B.M.
TITLE The phosphocreatine shuttle of sperm: Flagellar creatine kinase
results from a gene triplication
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.D.Wothe, 18-APR-1990.
Author address: D.D.Wothe
University of Washington
Dept. of Biochemistry
SJ-70
Seattle, WA 98195
E-mail:WOTHE@toby.acs.washington.edu
FEATURES from to/span description
pept 95 3619 flagellar creatine kinase (E.C. 2.7.3.2)
signal 3961 3966 poly-A signal
BASE COUNT 1184 a 923 c 910 g 968 t
ORIGIN
1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat
61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca
121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa
181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa
241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa
301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact
361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt
421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct
481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac
541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc
601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag
661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc
721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat
781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc
841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg
901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa
961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg
1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg
1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt
1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc
1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca
1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga
1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca
1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac
1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg
1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt
1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc
1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct
1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg
1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga
1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc
1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga
1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag
1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat
2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga
2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa
2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt
2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct
2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga
2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga
2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga
2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc
2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa
2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat
2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg
2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc
2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt
2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc
2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc
2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga
3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct
3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga
3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga
3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga
3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact
3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt
3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg
3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg
3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt
3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct
3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg
3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt
3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa
3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat
3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta
3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga
3961 aatacaattc aaactacttt tcatt
//
LOCUS HAMPPTPCA 2126 bp ss-mRNA ROD 27-JUN-1990
DEFINITION Hamster dihydroorotate synthetase mRNA, partial cds.
ACCESSION M33702
KEYWORDS CAD; aspartate transcarbamoylase; carbamoyl phosphate synthetase;
dihydroorotase; dihydroorotate synthetase;
pyrimidine pathway trifunctional protein complex.
SOURCE Hamster SV40 transformed cell line BHK21, cDNA to mRNA, clone
pCAD142.
ORGANISM Cricetulus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2126)
AUTHORS Williams,N., Simpson,R.J., Moritz,R.L., Peide,Y., Crofts,L.,
Minasian,E., Leach,S.J., Wake,R.G. and Christopherson,R.I.
TITLE Location of dihydroorotase domain within trifunctional hamster
dihydroorotate synthetase
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.I.Christopherson, 13-APR-1990. Univ. of Sydney, Dept. of
Biochem., NSW, 2006 AUSTRALIA
FEATURES from to/span description
pept < 1 > 2126 dihydroorotase synthetase complex (AA at 3)
(EC 3.5.2.3)
BASE COUNT 415 a 676 c 609 g 426 t
ORIGIN Short arm of chromosome B9.
1 gtcgacggct ttcctccttc gtcaccaagg gctaccgcac gcggcgcctg gctgctgact
61 tctctgtgcc tctcatcatc gacatcaagt gcaccaaact cttcgtggag gccctgggtc
121 agattggccc cgccccgcct ttgaaggttc atgtagactg catgacctcc cagaaactgg
181 tgcggctccc tgggttgatc gacgtccatg tgcaccttcg ggagccaggt gggacacaca
241 aagaagactt tgcctcgggc acagctgctg ccttggctgg gggtgtcacc atggtctgtg
301 ccatgcctaa tacccggccc cccatcattg atgcccctgc tctggccctg gcccagaagc
361 tggcagaggc cggtgcccgc tgtgactttg ccctgttcct tggggcctca tctgaaaacg
421 cggggactct gggtgctgta gccgggtctg cagcaggact aaagctctac ctcaacgaga
481 ccttctccga gctgcggctg gacagtgtgg cccaatggat ggagcatttt gaaacgtggc
541 cttcccacct ccccattgtg gcccatgcag agcggcagag tgttgccgca gtcctcatgg
601 tggctcagct gacccagcgc ccagtgcaca tatgtcacgt ggctcggaag gaagagatcc
661 tgctgattaa aactgcgaag gcacaggggc tgcctgtgac ctgtgaggtc gcaccccacc
721 acctcttcct gaatcgggaa gacttggagc gcctgggacc tgggagggga gaggtccggc
781 ctgagcttgg ttcccgagag gatatggagg ctctgtggga gaatatggct gtcatcgact
841 gctttgcctc agaccacgct ccccataccc tggaggagaa gtgtgggccc aagcctccac
901 ccggcttccc agggctggag accatgctgc cgctcctgct gacagctgtg agtgagggcc
961 ggctcagtct ggatgacctg ctacagcgcc tgcaccacaa cccgcgccgg atcttccacc
1021 tgccccttca ggaggacacc tatgtagagg tggatctgga gcatgagtgg acaatcccta
1081 gccacatgcc cttctccaag gcccgctgga ccccttttga ggggcagaag gtgaagggta
1141 ccatccgccg tgtggtcctg cgaggggagg ttgcttatat cgatggacag gtgttggtac
1201 ccccaggcta tggacaagat gtacggaagt ggcctcaggg ggctgttccc cagccccctc
1261 cttcagctcc tgccaccaca gaaataacca cgacacctga gagaccacgc cgagtcatcc
1321 caggccttcc tgatggccgc ttccacttgc caccccgaat ccacagagcc tctgacccag
1381 gtttgccagc tgaggaaccc aaagagaagc catccaggaa agtagtggag ccagagctca
1441 tggggacccc cgacggtccc tgctaccctg caccgccggt gcctagacag gcatcacctc
1501 agaacctggg ctcttctggc ctactgcacc cacagacttc acccctgctg cactccttag
1561 tgggccaaca catcctgtct gtcaagcagt tcactaagga tcagatgtct catctgttca
1621 acgtcgcgca cacactacgg atgatggtgc agaaagagcg gagccttgac atcctaaagg
1681 gcaaggtcat ggcctccatg ttctacgagg tgagcacccg caccagtagc tcctttgcag
1741 cagccatggc ccggctcggg ggcgctgtcc tcagcttttc agaagccacg tcctccgtcc
1801 agaaggggga atcccttgcc gactctgtgc agaccatgag ttgctacgct gatgtcgttg
1861 tgctccggca ccctcagcct ggagctgtgg agctggcagc caaacactgt cgcagaccag
1921 tgatcaatgc tggggatgga gtcggagagc accctactca ggccctgctg gacatcttca
1981 ctatccggga agagctgggg actgtcaatg gcatgacgat caccatggta ggtgacctga
2041 agcatgggcg cacagtgcac tccctggcct gcctgctcac ccagtaccgt gtgagcctac
2101 gctacgtggc acctcccagc ctgcgc
//
LOCUS TRBALDA 1119 bp ds-DNA INV 27-JUN-1990
DEFINITION T.brucei fructose-bisphosphate aldolase (ald) gene, complete cds.
ACCESSION M19994
KEYWORDS fructose-bisphosphate aldolase.
SOURCE T.brucei brucei (strain 427) DNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1119)
AUTHORS Marchand,M., Poliszczak,A., Gibson,W.C., Wierenga,R.K.,
Opperdoes,F.R. and Michels,P.A.M.
TITLE Characterization of the genes for fructose-bisphosphate aldolase in
Trypanosoma brucei
JOURNAL Mol. Biochem. Parasitol. 29, 65-76 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.A.M.Michels, 03-AUG-1988.
FEATURES from to/span description
pept 1 1119 fructose-bisphosphate aldolase (ald) (EC
4.1.2.13)
BASE COUNT 240 a 334 c 305 g 240 t
ORIGIN
1 atgtccaagc gtgttgaagt tctgcttacc caactccctg cgtacaaccg cctgaagacg
61 ccatatgaag cggagctgat tgaaaccgcc aaaaagatga ccgcccccgg taagggtctc
121 ctcgccgccg atgagtccac tggttcttgc tcgaagcgct ttgcgggcat cggtctcagc
181 aacactgcag agcaccgccg tcagtaccgc gctctcatgc tggaatgcga aggtttcgag
241 cagtacatca gcggtgtcat cctgcacgat gagaccgtgt atcagaaagc taagaccggc
301 gaaacattcc ctcagtacct ccgtcgtcgt ggtgtggtgc ctggcatcaa aaccgattgc
361 ggtctcgagc ccctcgtgga gggtgccaag ggcgagcaga tgactgctgg tctcgacggt
421 tacatcaaac gggccaagaa atactacgct atgggctgcc gcttctgcaa gtggcgcaac
481 gtgtacaaga tccagaacgg caccgtgtct gaagctgttg ttcgtttcaa cgctgaaaca
541 ctcgctcgct acgctatcct ctcccaactc tgcggtcttg tcccgattgt ggagcctgag
601 gtgatgatcg atggcactca cgacattgag acttgccaac gcgtttcgca gcatgtgtgg
661 tcggaggtgg tttctgcact ccaccgccac ggcgttgtat gggagggatg cctgctgaag
721 cccaacatgg tcgtccctgg cgctgaatcc ggcctcaagg ccactgcgga gcaggttgcc
781 gagtacaccg tcaaaaccct cgcccgtgtt attccccctg cgctccccgg tgtgacgttc
841 ctatcaggtg gcctcagtga ggttatggcc tccgagtacc tcaacgctat gaacaactgc
901 ccgttaccac gcccatggaa actgaccttt tcatacgccc gtgcacttca gtccagcgcc
961 ataaagcggt ggggtggaaa ggaatctggt gtcgaagctg gccgccgtgc gttcatgcac
1021 cgcgcaaaga tgaactcact tgcccaactc ggcaagtaca accgtgctga tgacgacaag
1081 gactcgcagt ctctgtacgt tgctggcaac acatactaa
//GenBank-Updates@genbank.bio.net (06/28/90)
LOCUS HUMSODB 8841 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human ornithine decarboxylase gene, complete cds.
ACCESSION M33764
KEYWORDS ornithine decarboxylase.
SOURCE Human sultan myeloma cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 8841)
AUTHORS Hickok,N.J., Wahlfors,J., Crozat,A., Halmekyto,M., Alhonen,A.,
Janne,J. and Janne,O.A.
TITLE Human ornithine decarboxylase-encoding loci: Nucleotide sequence of
the expressed gene and characterization of a pseudogene
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.J.Hickok, 18-APR-1990.
FEATURES from to/span description
pept 3687 3788 ornithine decarboxylase, exon 3 (first
expressed exon; E.C. 4.1.17) /nomgen="ODC1"
/map="2p25" /hgml_locus_uid="LP0064V"
4072 4245 ornithine decarboxylase, exon 4
4452 4624 ornithine decarboxylase, exon 5
4883 5017 ornithine decarboxylase, exon 6
5148 5229 ornithine decarboxylase, exon 7
5406 5489 ornithine decarboxylase, exon 8
6546 6708 ornithine decarboxylase, exon 9
6791 6903 ornithine decarboxylase, exon 10
6997 7211 ornithine decarboxylase, exon 11
7851 7995 ornithine decarboxylase, exon 12
pre-msg 388 8338 ODC1 mRNA and introns
IVS 602 3454 ODC1 intron A
IVS 3565 3669 ODC1 intron B
IVS 3789 4071 ODC1 intron C
IVS 4246 4451 ODC1 intron D
IVS 4625 4882 ODC1 intron E
IVS 5018 5147 ODC1 intron F
IVS 5230 5405 ODC1 intron G
IVS 5490 6545 ODC1 intron H
IVS 6709 6790 ODC1 intron I
IVS 6904 6996 ODC1 intron J
IVS 7212 7850 ODC1 intron K
signal 359 363 TATA box
rpt 2277 2586 Alu-repeat
BASE COUNT 2261 a 1884 c 2279 g 2417 t
ORIGIN
1 aggctctgcc gttcagctgc cgcgggcggg gccggggcct gcggcgtcgt gcgccgtgcg
61 ggaccagttc caggcgggcg agaccgcgca gggcggggcg gggcgaggcg gccgcagggc
121 ggggagggcg gggagaggcg gccgcagggc ggggagggcg gggcgcgaag ccgggggcgg
181 gggccacgcg tggggcaggc ggtgctcggc tcggctgacg tcggcccgcc ggcgccccac
241 cacgtccgcg cgggcccggg ttgccaccgc cggcccccgc ccctcccccg gcggtgtccc
301 ggccggaacc gatcgtggct ggtttgagct ggtgcgtctc catggcgacc cgccggtgct
361 ataagtaggg agcggcgtgc cgtggggctt tgtcagtccc tcctgtagcc gccgccgccg
421 ccgccgcccg ccgcccctct gccagcagct ccggcgccac ctcgggccgg cgtctccggc
481 gggcgggagc caggcgctga cgggcgcggc gggggcggcc gagcgctcct gcggctgcga
541 ctcaggctcc ggcgtctgcg cttccccatg gggctggcct gcggcgcctg ggcgctctga
601 ggtgagggac tccccggccg cggaggaagg gagggagcga gggcgggagc ggggcgggct
661 gcgggcccgg gcccgggcac gtgtgcggcg cgcctcgccg gcctgcggag acacgtggtc
721 gccgagcggg ccacgacctt gaggcgccgc ttcctcccgg cccggggttc tcccgcggct
781 ggataagggt gatccgggcg cctcgttctg cccccgtctt cacagctcgg ggctggaggg
841 gcctagggga gacccacccg gagaccctgc ggccccgcgc cggcctcttt cccaaccctt
901 cggcggccgc gcgctggccg gggagccgtt ggggaggccc tggcggccgc gcagcaggtg
961 caggggcgca gagcctgggc tcgccttggt acagacgagc gggccccggc cttggcgcct
1021 tcagtttcct tccagttttt attttcgctg tgtctacaga gcagatgaca ccaatttgga
1081 aacccgcgag agtgggtaga gctaagatag tcttgctgta gtagctgtga tattagatgc
1141 tcggccatga cttagaggtg tttatttaag gactgtgaat gactcggtga tttcggaaaa
1201 gcttggctta gatgaacgga catacacagg ggagacagcc ctaaggtttg cagaaaaggc
1261 tgattgtgct gtttgcgaag tcgaaataat tggtgaaagt gtagaaggca gaacctctca
1321 ggaatgtctg gggaggacaa agaatgtgtt ggctgacttt gtttaaacat aaaattgggc
1381 agactttaat tgatttgtga aatttttttc aaagtttgtt tgaattagcc cctatctctt
1441 ctaacattat cctcttgtgc taattgattg accattttaa ataacttagc tgttacagaa
1501 agaccgaaag gtgttcttca gtaaaatata ttcaagtaag ttacttaagt aacgccttaa
1561 aagatacaga aaagcaaaaa agtattggcg tattaaaaag aaatcaaaac tttccaagtt
1621 taggcctgaa cattgcctta aaaatattta ataaggcctc aaatgaccca gtccgagact
1681 gcatgagcct atttattatt aaattgtaaa tattcttcat ataaacaaaa atatataacc
1741 atgtctgtaa caaaaatggt tttgctagcg ttgttactct cttcccttct ccgaggggtg
1801 atttaggcaa cttcggaggt tgacaatgcc aagcagtcac aatagataga gctttaaagc
1861 aaattctatg catgggtttg gatttatgac aggcccgtca ccctgggcct gtcatagtac
1921 cccatgccag agcaaactgt gtccccgaac cattgcctgg cctctgtgcc cgtaggctgc
1981 tggcactgaa gtgggttgca cagtggaaaa gaagaaagct ctacctggca gaaattttta
2041 aaggttaaaa taaataattt taagaaagct ggttcacaag gtgccacatt tgatgaaagc
2101 aaaatacagt ggcttttatt gttactagag tgatgttctt gcttgttttt cttttttggt
2161 gaagttagcc ccaaattatt ctcatagcta agcaaatacg agagtgactg taaggacagt
2221 tggcattccc ggaattgcta aacttggtag gcaacgctgg tttaagaata ctgagttcta
2281 gccgggcgtg gtggctcacg cctgtaatcc caacactttg ggaggctgag gcaggcggat
2341 cacctgaggt cgggagttgg agaccagcct gactaacatg gagaaacgcc atctccacta
2401 aaaatataaa attagccagg ccccgggtgt ggtggcacat gccggtaatc ccagctactc
2461 gggagactga ggcaggagaa tcgcttgaac ccaggaggcg gaggttgagg tgagccgaga
2521 tcatgccatt gcactccagc ctgggcaaca agagtaaaac tctgtctcaa aaaaaaaaaa
2581 aaaaaatact gaattctgat caggtaacag caactgtaat acaatgtgat aagttgactt
2641 gaagattaca gtttttaaga agtatatacc cagctaatac atgaaaatta actcgtaaaa
2701 tctcaaatgc tccagacatt tccatgatgc ctgttggtca gtaaaaatca ttctaagact
2761 tagtggaagt aggaaatgtt tgtatggcaa cgtggtgaaa tcctgtctct actaaaaatg
2821 tgtataaagg ctataatgta atcccagcac tttggaagac cgaggcgggt ggatcacctg
2881 gggtcaggag tttgagaccc acctggacca caaaaattag ccgggcatgg tggcaggcgc
2941 ctgtaatccc agctgctggg gaggctgagg caggagaatc gcttgaaccc gggaggcaga
3001 ggttgcagtg agccaagatt gcaccgctgc actccagcct gggtgacagc gtgagactct
3061 gtctcaaaaa aaataaaaaa gtctataatg ctattttaag tttctaagga actgaaactg
3121 ctctgaaata aatcagacca ttataagact tttttccata tcagtgagct aagtgcagat
3181 aagcttctga aacttgcatg ctagattttt ttggtacaaa tatttgaaat gcttagtgtg
3241 ctgccttgga aaaacctggt attttttgtt gtgtccttat actgccaagg tttatggaat
3301 catgtacctt atgcctagta ataattagga tgaccaggcc agtgagtggt tcatatccgg
3361 ggcatgatta gctctgcgtg tgctcagcca gtgccccatc ttcaactcga tgtgttccta
3421 aggtagacag caaattccct attttatttc tcagattgtc actgctgttc caagggcaca
3481 cgcagaggga tttggaattc ctggagagtt gcctttgtga gaagctggaa atatttcttt
3541 caattccatc tcttagtttt ccatgtaagt attcagttta catttatgtt gcaggttaat
3601 cttaagaatt gtattgctaa ggcttctaag tgaatttctc cactctattt gcattttgtt
3661 gcatttcaga ggaacatcaa gaaatcatga acaactttgg taatgaagag tttgactgcc
3721 acttcctcga tgaaggtttt actgccaagg acattctgga ccagaaaatt aatgaagttt
3781 cttcttctgt aagtatatga ggcccatgct ggcagtgcag ctgagagtgc caggcaagtg
3841 gaaaactttg gcaaggtcta aggaagagca atgaggctta catgtcttgt tatggaatgt
3901 agaaattaat tcactggtgg taaattaata gtgataatgg tgatactcat atcagtggct
3961 agactcaaaa gagcaggatt cattgtgact gatgggaatg aaggtcgctg gctattggtg
4021 tggtgtgtgg tgaggctgct agtgagtcac ctgtgaccac tcttgtttca ggatgataag
4081 gatgccttct atgtggcaga cctgggagac attctaaaga aacatctgag gtggttaaaa
4141 gctctccctc gtgtcacccc cttttatgca gtcaaatgta atgatagcaa agccatcgtg
4201 aagacccttg ctgctaccgg gacaggattt gactgtgcta gcaaggtaag cgatagcagc
4261 aggcctcaaa agcgttgtat aaaatgggcc tggtattccc cacgaggcag atacaagttg
4321 tgttttttgg gcaataaatg ctcactaaag gcaaatgggg cgggggggta catgacaact
4381 tcccatgctt ttctgtttat tccacgtgtt aagccacata tggatagcat gacaccactc
4441 ttctttttca gactgaaata cagttggtgc agagtctggg ggtgcctcca gagaggatta
4501 tctatgcaaa tccttgtaaa caagtatctc aaattaagta tgctgctaat aatggagtcc
4561 agatgatgac ttttgatagt gaagttgagt tgatgaaagt tgccagagca catcccaaag
4621 caaagtgagt tattccccca tctgagggca agatcgggag cataagatat gtggattctt
4681 atcaaacaaa cttaaatttc tgattattat atttctatac tttagtagaa agtagttgaa
4741 acccccattg agtcatgaag cctgggactc aaactacaga atatatcagc gacagtattt
4801 agaacaggat tgtttttatt ttaattgtgg ctataagtga acatctatca tgagacattt
4861 gctgcacttt ccttgcttgt aggttggttt tgcggattgc cactgatgat tccaaagcag
4921 tctgtcgtct cagtgtgaaa ttcggtgcca cgctcagaac cagcaggctc cttttggaac
4981 gggcgaaaga gctaaatatc gatgttgttg gtgtcaggtg agattttggt gggatagcta
5041 gaggtcaaga cattgaacag tttgagtttt acaggctttc tcctagtgtt tgctattatt
5101 ttaagaaata ctaagacaca gtgtctcgtc tctttatttt accccagctt ccatgtagga
5161 agcggctgta ccgatcctga gaccttcgtg caggcaatct ctgatgcccg ctgtgttttt
5221 gacatggggg tgagtatacg tgaccctgtt agggaagggc gggacacaac tgacaataac
5281 tagtcttaat tctagagtta actttttatg gcagttggtt ctgtattaca tgggtttcag
5341 cctatctgct gcatacattt ttgttattag ctgtggatct ggctgactta ttttcttgat
5401 tctaggctga ggttggtttc agcatgtatc tgcttgatat tggcggtggc tttcctggat
5461 ctgaggatgt gaaacttaaa tttgaagagg taatttagaa caaaactgta atactcagta
5521 gccgttctaa taaattcctt tttggaatat ttcaaaattt aagtgtctta actaatacca
5581 caatgggctg aagtgtcttg gtgtgatatt tttgagtgat ttctttgtgc tgtctgacat
5641 tacacttgat accatttggt tttctaaagt gtgaatcagc tttcccagaa gtcttggata
5701 attggttaca ttggaaatca tggctcacac ctgtaatcca gcacttgggg aggccaaggt
5761 ggtaggatca cttgagccca ggagtttgag accagcctgg gcaacacagt gagaccccat
5821 ctctacaaaa aaaattttaa aattagcctg gtgtggtggc gggcacctgt aatcccagct
5881 acttggaagg ctgaggtggg aggatcactt gagcccagga ggttgaggct gcagtgagcc
5941 atgatcatgc cactgcactc agcctgggct acagagtgag accctgtctc aaaaaaaaaa
6001 aagaaaaagc atgttgctgt gggcttccta gagaatatgc tgactgtagc acatcatcac
6061 cccaaatgtg ctttgctaga cctatgcttc ctctccttaa aatacttgaa atgtttagtc
6121 acttaggaag ttaagccatt atattggtgc ttgaatttat aaaatatatc cacatggttt
6181 gttaaaatca tgacgtaggc agaataggat ttttatcctg ttggcatgta tttgttaaaa
6241 tgttttgaca tcttgatgcc ttcctaggta gtagttagtt gcgtactgtt ctttgataaa
6301 aatcataccc ataacatcct aaaggagata gggtgcctgg aggggaatga aaacgagcca
6361 cctgggatat gtagcctggt tttcagggag atgttgatgt ttttttgctt ttgttacttt
6421 aatgataaac ctgtctgttg atgcctggtc tcatgatgtc atgtcacaag gccctgtgat
6481 gttactcccc catgtgaatt tcccacaatg aaggctgctc tttcttttct gtttcactct
6541 cttagatcac cggcgtaatc aacccagcgt tggacaaata ctttccgtca gactctggag
6601 tgagaatcat agctgagccc ggcagatact atgttgcatc agctttcacg cttgcagtta
6661 atatcattgc caagaaaatt gtattaaagg aacagacggg ctctgatggt atgtataaag
6721 gacgaatcac ttcatgtata actgaaagct gatgcaaaaa gtcattaaga ttgttgatct
6781 gcctttctag acgaagatga gtcgagtgag cagaccttta tgtattatgt gaatgatggc
6841 gtctatggat catttaattg catactctat gaccacgcac atgtaaagcc ccttctgcaa
6901 aaggtaattt ctgagcatac tgtataaaac aattaagagg actggtcaca acacgtgtaa
6961 ttaagtagta cttcctctct ccgtctcttt atatagagac ctaaaccaga tgagaagtat
7021 tattcatcca gcatatgggg accaacatgt gatggcctcg atcggattgt tgagcgctgt
7081 gacctgcctg aaatgcatgt gggtgattgg atgctctttg aaaacatggg cgcttacact
7141 gttgctgctg cctctacgtt caatggcttc cagaggccga cgatctacta tgtgatgtca
7201 gggcctgcgt ggtaagtaag ccatgcatgt tgatggtgct gccaagaata ggcaccttct
7261 tggatgtgtg cttcttgtct agacgaataa gaaattgtct tgcctaagat taaatatata
7321 tggatatttt tcctaagaaa agttttagaa aagactgatg agtgtatttc tatgtaattg
7381 gaatatattt aagttcatgc catgtgtctt gtggtttcct tattaccaaa acggtgactg
7441 aagaaacgct tgctttagaa atacattgaa ttggccaggt gtgctggctc acacctgaaa
7501 tcacaacaca ttgggaggcc aaggcagaag gatcacttga gcccaggagt tcgagcctgg
7561 gcaacatagt gagaccctgt ctctacaaaa aattaaaaaa ttagttggcc atggtagtgg
7621 gcgcctgtag tcccagctgc ttggctaagg tgagaggttt gcttgagcct gggaggttga
7681 ggctgcggtg agctatgata gcaccattgt attccagcct gagtaacaga gaaagaccct
7741 gtctcagaaa aaaaaaaaat acattgaatt gtttcctgat ggaagtaaat actctcatgc
7801 ccagttagga gtgagtcagg gtttttaata tgccactttt tctttctcag gcaactcatg
7861 cagcaattcc agaaccccga cttcccaccc gaagtagagg aacaggatgc cagcaccctg
7921 cctgtgtctt gtgcctggga gagtgggatg aaacgccaca gagcagcctg tgcttcggct
7981 agtattaatg tgtagatagc actctggtag ctgttaactg caagtttagc ttgaattaag
8041 ggatttgggg ggaccatgta acttaattac tgctagtttt gaaatgtctt tgtaagagta
8101 gggtcgccat gatgcagcca tatggaagac taggatatgg gtcacactta tctgtgttcc
8161 tatggaaact atttgaatat ttgttttata tggattttta ttcactcttc agacacgcta
8221 ctcaagagtg cccctcagct gctgaacaag catttgtagc ttgtacaatg gcagaatggg
8281 ccaaaagctt agtgttgtga cctgttttta aaataaagta tcttgaaata attaggcatt
8341 gggacgtttt tatggtgtgt tcattccaga cagttcacga atcccgtata gctcgctctg
8401 attctcagag aacaatgagt gggtccaccc acacacaggt aggaggacag gtgagacgga
8461 agccccatcc tcccatgtgg acggtgcaca tctgctcagc ccaccccaca tgtccagagt
8521 tggctgcaaa ctccttgtcc agagcctctg gtggtgggac ctacttaagt ctgacggacc
8581 tgtcctgtcc aggccagtgc ccagggaagg tgtgggaggc cctttgagcc tggcctgcag
8641 agaccatccg tgtcccctcc caccttcatg cctgtgagaa gttaggaatg tatacggtac
8701 cacatttggc agtcagctta ttttaataaa ttcagcaaca gcaagtccct accatgttgt
8761 gtatcttcac catcttgtct gaccatgacc actggccttg tgtgttcttt tactcaacgt
8821 gtacccccgc tctcccccaa a
//
LOCUS XELFIMB1 1341 bp ss-mRNA VRT 28-JUN-1990
DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds.
ACCESSION J02910
KEYWORDS integumentary mucin; mucin.
SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W.
TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous
with Von Willebrand factor
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Hoffmann, 26-APR-1990.
FEATURES from to/span description
pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1)
mRNA < 1 1341 FIM-B.1 mRNA
signal 1319 1324 poly-A signal
rpt 1 279 repeat sequence
BASE COUNT 472 a 290 c 268 g 311 t
ORIGIN
1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag
61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca
121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa
181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta
241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca
301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact
361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca
421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc
481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa
541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa
601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg
661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat
721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact
781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt
841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca
901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa
961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg
1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt
1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac
1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga
1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata
1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa
1321 taaatggttt ttttcagctg t
//
LOCUS LACISTRAN 808 bp ds-DNA BCT 28-JUN-1990
DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds.
ACCESSION M33868
KEYWORDS transposase.
SOURCE L.lactis DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 808)
AUTHORS Romero,D.A. and Klaenhammer,T.R.
TITLE Characterization of gram-positive insertion sequence IS946, an
Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid
pTR2030
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.A.Romero, 26-APR-1990.
Author address: D.A.Romero
North Carolina State University
Dept. of Food Science
Box 7624, Raleigh, NC 27695
FEATURES from to/span description
pept 74 754 transposase (put.)
rpt 1 18 inverted repeat
rpt 791 808 inverted repeat
signal 6 35 promoter region
binding 62 65 ribosomal binding site
BASE COUNT 281 a 137 c 158 g 232 t
ORIGIN
1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag
61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt
121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata
181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa
241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga
301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga
361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt
421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc
481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca
541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg
601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat
661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac
721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta
781 tttgattttt aaactttgca acagaacc
//
LOCUS RSSTRXA 464 bp ds-DNA BCT 28-JUN-1990
DEFINITION R.sphaeroides Y thioredoxin (trxA) gene, complete cds.
ACCESSION M33806
KEYWORDS thioredoxin Y.
SOURCE R.spaeroides DNA, clone pUTC3.
ORGANISM Rhodobacter sphaeroides
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 464)
AUTHORS Pille,S., Chuat,J.-C., Breton,A.M., Clement-Metral,J.D. and
Galibert,F.
TITLE Cloning, nucleotide sequence, and expression of the Rhodobacter
sphaeroides Y thioredoxin gene
JOURNAL J. Bacteriol. 172, 1556-1561 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 57 377 thioredoxin (trxA)
BASE COUNT 84 a 147 c 153 g 80 t
ORIGIN
1 ccggcagggg gccatcctta ctttccaagc aagcctacgg atcaggagaa ttgaatatgt
61 ccaccgttcc cgtgacggac gccaccttcg acaccgaggt gcgcaagtcc gacgtgcccg
121 tcgtcgtcga tttctgggcc gaatggtgcg gcccctgccg gcagatcggc ccggcgctcg
181 aggagctctc gaaggaatat gccggcaagg tgaagatcgt gaaggtcaat gtcgacgaga
241 accccgagag cccggcgatg ctgggcgttc gcggcatccc ggcgctgttc ctgttcaaga
301 acggtcaggt cgtgtcgaac aaggtcggcg ctgcgccgaa ggccgcgctg gccacctgga
361 tcgcttcggc gctctgagat cctctgccat agcagagacc ggacggggcg ccttcgggcg
421 cccttttcct tgagcaagcg accgaccgcg acgggcgagg atcc
//
LOCUS PSECATRA 1070 bp ds-DNA BCT 28-JUN-1990
DEFINITION P.putida catR regulatory protein gene, complete cds, and catBC
gene, 5' end..
ACCESSION M33817
KEYWORDS regulatory protein.
SOURCE P.putida DNA.
ORGANISM Pseudomonas putida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1070)
AUTHORS Rothmel,R.K., Aldrich,T.L., Houghton,J.E., Coco,W.M.,
Ornston,L.N. and Chakrabarty,A.M.
TITLE Nucleotide sequencing and characterization of Pseudomonas putida
catR: a positive regulator of the catBC operon is a member of the
lysR family
JOURNAL J. Bacteriol. 172, 922-931 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 135 1004 catR regulatory protein
pept 3 < 1 (c) cis,cis-muconate-lactonizing enzyme I (catB)
(EC 5.3.3.4)
mRNA 61 < 1 (c) catBC mRNA
BASE COUNT 199 a 355 c 329 g 187 t
ORIGIN
1 cattgttgcc aggtccgttg tttcgggctg ccagccgcgg gcccagtggt gatttacttg
61 ttgttcaagg attgcgcgag accctgatag ccgtccaata tcgaatgaat ctcccaccat
121 accctggagg tctgatggag ctgcgccact tgcgttactt caaggtcctg gccgaaaccc
181 tgaacttcac tcgcgccgcc gagctgctgc acattgccca gccgccgctg agccggcaga
241 tcagccaact cgaggaccag ctcggtacct tgctggtagt gcgcgagcgc ccgctgcggc
301 tgacagaggc gggtcgcttc ttctacgaac agagctgcac cgtgctgcag ctgcagaaca
361 tcagcgacaa cacccgtcgc attggccagg gccagcgcca gtggctgggg atcggcttcg
421 ccccgtcgac cctgtacaag gtgctgccgg agctgatccg cgagctgcgc caggacagcg
481 agctggaact gggcctcaac gagatgacca cgctgcagca ggtggaggcg ctgaaaagcg
541 ggcgcatcga catcgccttc gggcgcattc gcatcgatga cccggcgatt caccaacaag
601 tgctgtgcga ggacccgctg gtggcggtat tgcccaagga tcacccgctg gcgagcagcc
661 cgcttacact tgcccagctg gctggcgagg cgttcatcct ctacccggcc aacccgcggc
721 ccagctatgc cgaccatgtg ctggcactgt tcgcccacca cggcatgagc atccacgtca
781 gccaatgggc caacgaactg cagaccgcca tcggcctggt ggccgtcggc gtgggcgtga
841 ccctggtgcc ggcgtcggtg caacagcagc accgcaccga tatcgaatat gtaagcctgc
901 tcgacagcgg cgccgtcagc ccgatcatcc tcagccggcg caagggcgat gtgagcccga
961 tcgtgcagcg ctgcctgaca ttgattgcac aacaggccga atgagggcgc gctcctggaa
1021 tcaacggtct ggctaaacac ctgccccctg tgggagcggg ttcacccgcg
//
LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region.
ACCESSION M33765
KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene;
major histocompatibility complex.
SOURCE Human (DR4, DR4) DNA, clone H11A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1515)
AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C. and Maffe,A.
TITLE Effect of the AIR-I locus on the activation of an enhancerless
HLA-DQA1
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Guardiola, 18-APR-1990.
BASE COUNT 441 a 322 c 340 g 412 t
ORIGIN Chromosome 6p21.3.
1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag
61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga
121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac
181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag
241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc
301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag
361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag
421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag
481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct
541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg
601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta
661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc
721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt
781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga
841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta
901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt
961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg
1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt
1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc
1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc
1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac
1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg
1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca
1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac
1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta
1501 gcaaaataag ggatc
//
LOCUS FSOSTI35X 1856 bp ds-DNA PLN 28-JUN-1990
DEFINITION F.solani STI35 protein gene, complete cds.
ACCESSION M33642
KEYWORDS STI31; heat-shock gene.
SOURCE F.solani phaseoli mycelial ATCC 60860 isolate DNA, clone cKAES018.
ORGANISM Fusarium solani
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 1856)
AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and
Smith,D.A.
TITLE STI35, a stress-responsive gene in Fusarium spp
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.L.Schardl, 08-APR-1990.
FEATURES from to/span description
pept 564 1443 STI35 protein, exon 1
1492 1586 STI35 protein, exon 2
IVS 1444 1491 STI35 intron A
BASE COUNT 401 a 585 c 445 g 425 t
ORIGIN
1 acacactctc tgttgacgca tcctcagact cggaaataca tttcgttggc cgatatcgcg
61 ggttactcca tcatgatatc agccatccat tacatcgtta cccctcatct gcggtataaa
121 agccagccat tgtctcctgc tgcttgatcc attctctctt cttcttcatc aactcatcaa
181 caacttgaac caaaaccctt atcaagactc aaggtttgta ctcgaggtga gaggtgatga
241 actggtctga cacctgtgac atgctagata tcatcacttc cggtacggat cctccgtcga
301 ggaatctgca gccatggagt acctttttgc gggtagttgt ccgaacctga ggttggagcc
361 attcaggtaa agcatcatgc atgagccggt gcccgttctt cccttcttgg ccattgtctg
421 tactcctgtg tacatcaatg tcaagaagag gagagcgtgg ctgagattat acggcgaaac
481 ttgatctgga taataccagc gaaaggatca tgctctcctc cccccattga ccccaatccc
541 ctaacacaac cccaggccca acaatgtctc ctcccgctgc cgtctccccc cctgcccgct
601 ctgctgagct cgcctccgcc cccgccgtca agctccccgt cggcctgagc aagaacagcg
661 ctgctgccac caccgtcgag gagatggagg gcaagtggga cgacttcaag tttgccccca
721 tccgcgagag ccaggtctcc cgggccatga cccgccgcta cttccaggac ctcgacaact
781 acgccgagtc cgacattgtc attgtcggcg cgggatcctg cggtcttagc acccgctaca
841 tcctcggcaa gaagaggccc gacctcaaga tcgccatcat cgaggcttct gtgtctcctg
901 gtggtggtgc ctggctcggt ggccagctct tctcggccat ggtgatgcgc aagcccgccg
961 atgccttcct ccgcgaagtc ggcgtccctt acgaggacga gggcaactat gtcgtcgtca
1021 agcacgctgc cctcttcacc tcgaccatca tgtccaaggt gctccagctc cccaactgca
1081 agctcttcaa cgccacctgc gtcgaggacc tcatcacccg accttccaag gagggagtcc
1141 gcatctctgg tgtcgtcacc aactggaccc tcgtgtcgat gcaccacgac gaccagtcct
1201 gcatggaccc caacaccatc aacgcgcccc tcgtcatctc caccaccggc cacgacgccc
1261 ccatgggtgc cttttgcgtc aagcgtctcg tcagcatggg acgcatcgag aagctcggcg
1321 gcatgcgtgg tctcgacatg aacgtggccg aggatgccat cgtcaaggga acccgtgaga
1381 ttgtccctgg tctgattgtc ggaggaatgg agctttccga ggttgacgga gccaaccgaa
1441 tgggtgagtt tctcgacacg agtatcaata actggttgct aactcgacta ggccctactt
1501 tcggtgccat ggttctgagc ggtctcaagg ctgccgagga ggccctcaag gtcattgaca
1561 tccgccagaa gcagaactcg ttctaagcga gttgacgtac aagaagaaca tggactggga
1621 aacacgatga ttcaggaaac atgaattcag gagccaggac ctcgtctctc tttcgaggca
1681 tggatccttg tcgctcatta ggccctttca gagaatgaaa ttggatatca aaactgctga
1741 attgctcata ctttctcgat ataaaacttg acgtacaatg tgcaactcca agaactttac
1801 tcatcctcca tctccttccc ctttggcttc tcgggcggaa acttcatccc cccgtt
//
LOCUS FSOSTI35Y 1177 bp ss-mRNA PLN 28-JUN-1990
DEFINITION F.oxysporum STI35 protein mRNA, complete cds.
ACCESSION M33643
KEYWORDS STI35 protein.
SOURCE F.oxysporum cucumerinum (isolate B1-GK) mycelial culture, cDNA to
mRNA, clone pKAE5030.
ORGANISM Fusarium oxysporum
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 1177)
AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and
Smith,D.A.
TITLE sti35, a stress-responsive gene in Fusarium spp
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.L.Schardl, 08-APR-1990.
FEATURES from to/span description
pept 29 991 STI35 protein
mRNA < 1 1145 STI35 mRNA (alt.)
mRNA < 1 1177 STI35 mRNA (alt.)
BASE COUNT 264 a 328 c 291 g 294 t
ORIGIN
1 aagatatcat aacctgccca agatcaaaat ggcccctccc gctgctgtct cccctccctc
61 tcgctctgca gagcttgcta catctaccaa gctcccagtc atgagcaaga acatcaacac
121 caagaccgtt gaggagatgc tcggtcagtg ggatgatttc aagtttgctc ccatccgcga
181 gagccaggtc tcccgcgcca tgactcgccg ctacttccag gaccttgata actatgccga
241 gtctgacatt gttatcatcg gtgctggttc gtgcggtctg agcgccgcgt acattcttgg
301 caagaagcgt cctgatctca agatcgccat cattgaggct tctgtgtctc ctggtggtgg
361 tgcttggctt ggtggacagc tcttctccgc catgatcatg cgcaagcctg ctgatgcttt
421 cctccgcgag gtcggtgttc cttatgaaga tgagggtaac tacgtcgttg tcaagcacgc
481 cgctctcttc acctcgacca tcatgtccaa ggttcttcag atgcccaaca tcaagctctt
541 caatgccact tgtgttgagg atctcatcac ccgaccttcc gaggagggag tccgcattgc
601 cggtgttgtc accaactgga ctcttgtttc catgcaccac gatgaccagt cttgcatgga
661 ccccaacact attaacgctc ctttgatcat ctccaccacc ggccacgatg gcccgatggg
721 agctttctgt gtcaagcgcc ttgtcagcat gcagcgcatt gagaagctcg gtggtatgcg
781 tggtcttgac atgaacctcg ccgaggatgc cattgtcaag ggaacccgtg agattgttcc
841 tggtcttatt gttggtggaa tggaactttc cgaggttgac ggtgccaacc gcatgggtcc
901 taccttcggt gccatggctc tcagtggtct caaggctgcc gaggaagctc tcaagatctt
961 cgacactcgc aagaagcaga acgatctgta agcgagatca tcaaagaagg tcatagttgg
1021 gaaacggaac atgtgaattc aggatcctgg acctcgtctc tcattcgagg catggaacct
1081 tgtcattcat ttagtctgag cgactcaagt ttggaaaaca tgaacccaaa gtcggatgcc
1141 actcgaaaga gacgccaaat aaagcaatac taaattg
//
LOCUS STMGLUSYN 1208 bp ds-DNA BCT 28-JUN-1990
DEFINITION S.hygroscopicus glutamine synthetase (GLNB) gene, complete cds.
ACCESSION M33783
KEYWORDS glutamine synthetase.
SOURCE S.hygroscopicus (strain ATCC 21705) DNA.
ORGANISM Streptomyces hygroscopicus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1208)
AUTHORS Kumada,Y., Takano,E., Nagaoka,K. and Thompson,C.J.
TITLE Streptomyces hygroscopicus has two glutamine synthetase genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.J.Thompson, 20-APR-1990.
Author address: C.J.Thompson
Institut Pasteur
28 Rue du Dr. Roux
75015 Paris FRANCE
Tel: (33) 145688842
FEATURES from to/span description
pept 22 1158 glutamine synthetase (GLNB) precursor
sigp 22 84 glutamine synthetase signal peptide
matp 85 1155 glutamine synthetase
BASE COUNT 198 a 432 c 418 g 160 t
ORIGIN
1 gaattccttg gaaacggctg gatgtccggc tcggtaacct gtggttcaca aacgggcaac
61 ggacgagaaa tggcgcgttg cgagagtgcg gaggtacgcc cgcaccgtac ccgcaccgac
121 cgtgaagagg accccgtgag catcaaggcc gagtacatct ggatcgacgg cacgcagccg
181 accgccaagc tccgctccaa gaccaagatc ctgtccgacg gcagccggct tccgcggtgg
241 ggcttcgacg gttccagcac caaccaggcc gaaggccacg cctcggacct cgtactggag
301 ccggtgttca gctgcccgga cccgatccgc ggcggcgacc acctgctggt gctgtgcgag
361 gtgctgcaca ccgacctcac cccgcacccc tccaacaccc gggcgctgct gcgcccggtc
421 gcggagcggt tcgccggcca ggagccgatc ttcggcatcg agcaggagta caccttcctc
481 aagggcgacc gcccgctcgg cttccccgag ggcggcggct acccggcccc gcaggccgac
541 tactactgcg gcgtgggcgc cgacgcgatc ttcggccggg agatcgtcga gaagcacctc
601 gacctgtgcc tggcggccgg tctgggcctg tccggcatca acgccgaggt catgcccggc
661 cagtgggagt tccaggtcgg cgcgctgccg ccgctggagg tctcggacca catgtgggtg
721 gcgcgctggc tgctgcaccg ggtggcggag gagttcggcg tcaccgcgtc gctggacgcc
781 aagccggcca agggcgactg gaacggcgcg ggcgcgcaca ccaacttctc cacccgcgcg
841 atgcgcgagg gctacgaccc gatcatcacc gcctgcgagg cgctgggcca ggacgacaag
901 ccgctggagc acgtccgcca gtacggcacc ggcatcgagg accggctgac cggcgcgcac
961 gagaccgccc cctgggacgc gtactcctac ggcgcctccg accgcggcgc ctcggtgcgc
1021 atcccctggc aggtcgaggt cgagaagaag ggctacatcg aggaccggcg cccgaacgcc
1081 aacgtcgacc cgtacgtggt cacccggctg atggtggaca cctgctgcac ggagctggcg
1141 cggcgcgagc agatctgacg cctgcggtca cgtggccgtg cggtcacgtg accgcacggt
1201 cgcgcgag
//
LOCUS BLLEBG 2323 bp ds-DNA BCT 28-JUN-1990
DEFINITION B.lautus endo-beta-1,4-glucanase (celB) gene, complete cds.
ACCESSION M33762
KEYWORDS endo-beta-1,4-glucanase.
SOURCE B.lautus (strain PL236) DNA.
ORGANISM Bacillus lautus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2323)
AUTHORS Joergensen,P.L. and Hansen,C.K.
TITLE Multiple endo-beta-1,4-glucanases from Bacillus lautus PL236 and
characterization of the celB gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.K.Hansen, 18-APR-1990.
Author address: C.K.Hansen
Institut Pasteur
24 rue du Dr. Roux
75724
Paris Cedex 15
FRANCE
Email: DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU
FEATURES from to/span description
pept 172 1872 endo-beta-1,4-glucanase precursor (celB) (E.C.
3.2.1.4)
sigp 172 262 endo-beta-1,4-glucanase signal peptide
matp 263 1869 endo-beta-1,4-glucanase
signal 46 51 -35 region
signal 70 75 -10 region
binding 158 164 ribosome binding site
signal 1956 1981 transcription terminator
BASE COUNT 644 a 508 c 563 g 608 t
ORIGIN
1 ggggttttgt tcatttcaag aaagagcacg aataaacatc gctaatttac acatgattta
61 ttctctaatt attatggtgc atgccaattg gaaaatgtag tagattagtc atcgtaacct
121 gctttcatgc caagatatgt aatttttaaa aaagagtaaa ggagaattca gatgaaaaaa
181 cgtagaagca gtaaagttat tctttcgttg gccatcgttg ttgcattatt ggcagccgtc
241 gaacctaatg ccgctttagc agcggctcca ccaagtgcca tgcagtccta tgttgaagcg
301 atgcagcctg gctggaacct tggcaattct ctggatgctg tcggtgcgga tgagacgctg
361 gcacggggca atccgcggat cacgaaagag ctcattcaga acatcgctgc gcaaggctat
421 aagagcatac ggattcctgt tacctgggat tcccatatcg gcgcggcccc aaattatcaa
481 attgaagctg cgtacctcaa tcgagtgcag gaggtcgtac agtgggcttt ggacgcgaac
541 ctctatgtga tgattaatgt ccatcatgat tcctggctat ggatcagcaa aatggagtcg
601 cagcacgatc aagtactggc ccgttataat gcgatttgga cgcaaattgc caacaagttc
661 aagaacagcc cgagcaagct gatgttcgag agcgtgaatg agcctcgctt tacggatggc
721 ggaactacgg atgaagccaa gcagcaaaaa atgctggacg agctgaacgt atcctttttc
781 aacatcgtca gaaattccgg cggccagaac gcgactcgcc cgctagttct ttctacgttg
841 gaggcctctc ccacccaaga gagaatgacg gcgctttata atacgatgac caaactgaac
901 gacaagaatc tgatcgcaac cgttcatttt tatggattct ggccgtttag cgtaaatatc
961 gcaggatata cgaaatttga tgcggagacg caaaatgata ttataacgac cttcgataac
1021 gtgtataaca catttgtagc aaagggaatc ccggtggtag tcggcgaata tggccttctt
1081 ggattcgata agaataccgg cgtcattgaa cagggtgaga aattgaaatt tttcgagttt
1141 tttgcccagt atgtgaagca aaaaagcatt tccactatgc tatgggataa cggacagcac
1201 ttcaaccgca cgagcttcaa gtggtctgac ccggatttat tcaatatgat caaggccagt
1261 tggaccggac gttcatccac ggcttccagc gacctgatcc atgtcaagca gggcacggcg
1321 gtaaaagata cttcggttca gctcaatctt aacgggaata cgctaacttc cctttccgta
1381 aatggaacga cactgaaatc aggcacagat tacactttaa acagcagcag attaactttt
1441 aaagcgagcc agttgaccaa gctgacctcc ttgggcaaat tgggggtcaa cgcgacgatc
1501 gtgactaaat tcaatagagg cgccgactgg aagttcaacg tagtcctgta caatacgcct
1561 aagcttagca gtacgacggg gactacttct tcctttgcga ttccaacggc tttcaacggg
1621 gatcagcttg ctacgatgga agcggtctat gtaaacggcg gcaatgccgg tccgcataac
1681 tggacttcct ttaaggaatt cgaaacgacg ttcagccccg cttatagcga ggggaaaatc
1741 aaactgcagc aggcgttctt taatgaagtg aatgatacca cagtcacgct caagttccaa
1801 ttctggagcg gggagatcgt caactacacg attaaaaaga gcggttcgac ggtgacgggt
1861 acggcttcat aagcgagttt ggcaaaaaag gaccgatata ctgcctaatt tggtattgcc
1921 ttagttgaaa gcaattgctc cgaataaaca gaatgaagcc ccggccagct ggccgggact
1981 tatgcgttta ggaagtataa acgaatcatc agcaatttat ttagctcgtc tcagttcagc
2041 aatatcggct tcatgtgaaa cggagcggat gaacaatctt tcgagcaatt tctcatgctc
2101 ctgctgggtt tggagaacgg tttgctgatt agttttaagt acagatatat cctcacggac
2161 ttgattgatt catgtggtcc gttagttctt ctacctttgt atttgtggca gcaacgatat
2221 gaattaattg ttgaatgtgc ccgccatgac tgtttagctg ctcattgtgg ctttgtaact
2281 gttctcggat ttctttgaat tcttggtcgt gctcattaag ctt
//
LOCUS HLFRRD1 394 bp ss-rRNA BCT 28-JUN-1990
DEFINITION Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION M33803
KEYWORDS 16S ribosomal RNA.
SEGMENT 1 of 3
SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
ORGANISM Haloferax sp.
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 394)
AUTHORS Holmes,M.L. and Dyall-Smith,M.L.
TITLE A plasmid vector with a selectable marker for halophilic
archaebacteria
JOURNAL J. Bacteriol. 172, 756-761 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 394)
AUTHORS Holmes,M.L. and Dyall-Smith,M.L.
TITLE A plasmid vector with a selectable marker for halophilic
archaebacteria
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by M.L.Dyall-Smith, 19-JUN-1990.
FEATURES from to/span description
rRNA < 1 > 394 16S rRNA
revision 249 249 c in [1]; g in [2]
BASE COUNT 102 a 101 c 126 g 65 t
ORIGIN
1 gaaaagctca gtaacacgtg gccaaactac cctacagaga acgataacct cgggaaactg
61 aggctaatag ttcatacggg agtcatgctg gaatgccgac tccccgaaac gctcaggcgc
121 tgtaggatgt ggctgcggcc gattaggtag acggtggggt aacggcccac cgtgccgata
181 atcggtacgg gttgtgagag caagagcccg gagacggaat ctgagacaag attccgggcc
241 ctacggggcg cagcaggcgc gaaaccttta cactgcacgc aagtgcgata aggggacccc
301 aagtgcgagg gcatatagtc ctcgcttttc acgactgtaa ggcggtcgtg gaataagagc
361 tgggcaagac cggtgccagc cgccgcggta atac
//
LOCUS HLFRRD2 254 bp ss-rRNA BCT 28-JUN-1990
DEFINITION Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION M33804
KEYWORDS 16S ribosomal RNA.
SEGMENT 2 of 3
SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
ORGANISM Haloferax sp.
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 254)
AUTHORS Holmes,M.L. and Dyall-Smith,M.L.
TITLE A plasmid vector with a selectable marker for halophilic
archaebacteria
JOURNAL J. Bacteriol. 172, 756-761 (1990)
STANDARD full staff_review
FEATURES from to/span description
rRNA < 1 > 254 16S rRNA
BASE COUNT 67 a 59 c 85 g 43 t
ORIGIN About 140 bp after segment 1.
1 tacgtccggg gtaggagtga aatcccgtaa tcctggacgg accaccgatg gcgaaagcac
61 ctcgagaaga cggatccgac ggtgagggac gaaagctagg gtctcgaacc ggattagata
121 cccgggtagt cctagctgta aacgatgctc gctaggtgtg acacaggcta cgagcctgtg
181 ttgtgccgta gggaagccga gaagcgagcc gcctgggaag tacgtccgca aggatgaaac
241 ttaaaggaat tggc
//
LOCUS HLFRRD3 539 bp ss-rRNA BCT 28-JUN-1990
DEFINITION Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION M33805
KEYWORDS 16S ribosomal RNA.
SEGMENT 3 of 3
SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
ORGANISM Haloferax sp.
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Holmes,M.L. and Dyall-Smith,M.L.
TITLE A plasmid vector with a selectable marker for halophilic
archaebacteria
JOURNAL J. Bacteriol. 172, 756-761 (1990)
STANDARD full staff_review
FEATURES from to/span description
rRNA < 1 > 539 16S rRNA
BASE COUNT 132 a 140 c 159 g 108 t
ORIGIN About 45 bp after segment 2.
1 ctcaacgccg gacatctcac cagctccgac tacagtgatg acgatcaggt tgatgacctt
61 atcacgacgc tgtagagagg aggtgcatgg ccgccgtcag ctcgtaccgt gaggcgtcct
121 gttaagtcag gcaacgagcg agacccgcac ttctaattgc cagcagcagt ttcgactggc
181 tgggtacatt agaaggactg ccgctgctaa agcggaggaa ggaacgggca acggtaggtc
241 agtatgcccc gaatgagctg ggctacacgc gggctacaat ggtcgagaca atgggttgct
301 atctcgaaag agaacgctaa tctcctaaac tcgatcgtag ttcggattga gggctgaaac
361 tcgccctcat gaagctggat tcggtagtaa tcgcatttca atagagtgcg gtgaatacgt
421 ccctgctcct tgcacacacc gcccgtcaaa gcacccgagt gaggtccgga tgaggccacc
481 acacggtggt cgaatctggc ttcgcaaggg ggcttaagtc gtaacaaggt agccgtagg
//
LOCUS ZMOADHA 1365 bp ds-DNA BCT 28-JUN-1990
DEFINITION Z.mobilis alcohol dehydrogenase I (adhA) gene, complete cds.
ACCESSION M32100
KEYWORDS alcohol dehydrogenase I.
SOURCE Z.mobilis (strain CP4) DNA.
ORGANISM Zymomonas mobilis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods.
REFERENCE 1 (bases 1 to 1365)
AUTHORS Keshav,K.F., Yomano,L.P., An,H. and Ingram,L.O.
TITLE Cloning of the Zymomonas mobilis structural gene encoding alcohol
dehydrogenase I (adhA): Sequence comparison and expression in
Escherichia coli
JOURNAL J. Bacteriol. 172, 2491-2497 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
K. Keshav, 14-FEB-1990, for release after publication.
FEATURES from to/span description
pept 218 1231 alcohol dehydrogenase I (adhA) (EC 1.1.1.1)
mRNA 108 > 1365 adhA mRNA (alt.)
mRNA 137 > 1365 adhA mRNA (alt.)
site 1261 1289 adhA transcription termination signal
BASE COUNT 370 a 288 c 340 g 367 t
ORIGIN
1 tttttgggcg tatttttatc aagaaaattt aaataatcac atttttatta ttttagattt
61 aagtattgat acaagtgata tctataaatg tttttataac tttctggatc gtaatcggct
121 ggcaatcgtt ttccctatat tcgcaagatg tatgtcagcc gcagattttt cgactgacct
181 ctatctctcc gagatatatc aacaaaaggt agtcaccatg aaagcagccg tcataactaa
241 agatcatacg atcgaagtga aagacaccaa attacgccct ctgaaatacg gggaagcgct
301 tttggaaatg gaatattgcg gggtatgtca taccgatctc catgtgaaaa acggggattt
361 cggcgatgaa accggcagaa ttaccggcca tgaaggtatc ggtatcgtca agcaggtcgg
421 ggaaggggtt acttctctga aagccggtga ccgcgccagt gttgcatggt tcttcaaagg
481 ctgcggccat tgcgaatatt gtgtcagcgg gaatgaaacg ctttgccgca acgttgaaaa
541 tgccggttat acggttgacg gcgctatggc agaagaatgc atcgtcgttg ccgattactc
601 ggtcaaggtg ccagatggtc ttgatcctgc ggttgccagc agcatcactt gcgcgggtgt
661 aaccacctat aaagcagtca aagtttctca gatacagccg ggacaatggc tggccatcta
721 tggcttgggc ggtttaggca atctagccct tcaatatgcc aagaatgttt tcaacgccaa
781 agtgatcgcg atcgatgtca atgatgaaca gctcgctttt gccaaagagt tgggcgcaga
841 tatggtcatc aatccgaaaa acgaagatgc tgccaaaatc attcaggaaa aagtcggcgg
901 cgcacatgcg acggtggtga cggctgttgc caaatccgcc tttaactcgg ctgttgaggc
961 tatccgcgcg ggtggccgtg ttgtcgccgt tggtctgcct cctgaaaaaa tggatttgag
1021 cattcctcgt ttggtgcttg acggtatcga agtcctaggt tccttggtcg gaacgcggga
1081 agatttgaaa gaggccttcc agtttgcagc cgaaggtaag gtcaagccga aagttaccaa
1141 gcgtaaagtc gaagaaatca accaaatctt tgacgaaatg gaacatggta aattcacagg
1201 ccgtatggtt gttgatttta cccatcacta ggttttcgtg aaggcggaag cataaacgga
1261 aaaagccttt ctcttaccag aaaggctttt tctttgtcgt ctgataaaaa ttttcataca
1321 gaattttaac acggcactcg ttgctataag ccgctatcca agctt
//
LOCUS MUSIGKABG 349 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V32 region, 5' end.
ACCESSION M33992
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS D'Hoostelaere,L.A. and Klinman,D.
TITLE Characterization of new mouse Vk groups
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.D'Hoostelaere, 04-MAY-1990.
The frame shift between 178 and 184 could occur anywhere within
those boundaries.
Author address: L.A.D'Hoostelaere
Grenzacherstrasse 487
Postfach 4005
Basel, Switzerland
E-mail: IN%BASEL@UTOROCI.bitnet
FEATURES from to/span description
pept 1 178 Ig kappa-chain V32 region precursor
180 > 349 Ig kappa-chain V32 region precursor
sigp 1 60 Ig kappa-chain V-region signal peptide
matp 61 > 349 Ig kappa-chain V-region
site 178 184 frame shift
BASE COUNT 99 a 84 c 76 g 90 t
ORIGIN
1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt
61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc
121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa
181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc
241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca
301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg
//
LOCUS MUSIGKABH 349 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V33 region, 5' end.
ACCESSION M33993
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS D'Hoostelaere,L.A. and Klinman,D.
TITLE Characterization of new mouse Vk groups
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.D'Hoostelaere 04-MAY-1990.
The frame shift between 178 and 184 could occur anywhere within
those boundaries.
Author address: L.A.D'Hoostelaere
Grenzacherstrasse 487
Postfach 4005
Basel, Switzerland
E-mail: IN%BASEL@UTOROCI.bitnet
FEATURES from to/span description
pept 1 178 Ig kappa-chain V33-region precursor
180 > 348 Ig kappa-chain V33-region precursor
sigp 1 60 Ig kappa-chain V-region signal peptide
matp 61 > 348 Ig kappa-chain V-region
BASE COUNT 99 a 84 c 76 g 90 t
ORIGIN
1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt
61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc
121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa
181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc
241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca
301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg
//
LOCUS HUMCAIX 2785 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human carbonic anhydrase I (CAI) mRNA, complete cds.
ACCESSION M33987
KEYWORDS carbonic anhydrase I.
SOURCE Human EBV transformed SH B cell line DNA, and cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2785)
AUTHORS Lowe,N., Brady,H.J.M., Barlow,J.H., Sowden,J.C., Edwards,M. and
Butterworth,P.H.W.
TITLE Structure and methylation patterns of the gene encoding human
carbonic anhydrase I
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Since no intron sequences were provided this entry is treated as if
originating from an mRNA.
Draft entry and computer-readable sequence for [1] kindly submitted
by N.Lowe, 09-MAY-1990.
Author address: N.Lowe
Department of Biochemistry
University College London
Gower Street, London
WC1E 6BT, U.K.
E-mail:UCBCMAR%EUCLID.UCL.AC.UK@CUNYVM.CUNY.EDU
FEATURES from to/span description
pept 1048 1833 carbonic anhydrase I (EC 4.2.1.1)
/hgml_locus_uid="LS0141U" /nomgen="CA1"
/map="8q13-q22"
mRNA 902 1939 carbonic anhydrase I mRNA (alt.)
mRNA 902 2165 carbonic anhydrase I mRNA (alt.)
signal 874 879 TATA box
BASE COUNT 840 a 603 c 492 g 850 t
ORIGIN
1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat
61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa
121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta
181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa
241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca
301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg
361 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc
421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac
481 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa
541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat
601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg
661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg
721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac
781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta
841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc
901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca
961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa
1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata tgatgacaaa
1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct
1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc
1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt
1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc
1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat
1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc
1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt
1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag
1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat
1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc
1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta
1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct
1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc
1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa
1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact
1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc
2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac
2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc
2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca
2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca
2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga
2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct
2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt
2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc
2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg
2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat
2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta
2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct
2761 gccagtcagg gtggagtgcg atggt
//
LOCUS HUMPDHBZ 1408 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, 3' end.
ACCESSION M34056 J05549
KEYWORDS pyruvate dehydrogenase E1-beta subunit.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1408)
AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J.
TITLE Characterization of two cDNA clones for pyruvate dehydrogenase
E1~subunit and its differential regulation in cultured fibroblasts
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.J.Song, 07-MAY-1990.
Author address: B.J.Song
Laboratory of Metabolism and Molecular Biology
National Institute on Alcohol Abuse and Alcoholism
12501 Washington Ave.
Rockville, MD 20852
E-mail: landsman@thebard.nlm.nih.gov
FEATURES from to/span description
pept < 1 1008 pyruvate dehydrogenase E1-beta subunit
precursor /hgml_locus_uid="LU0223C"
/nomgen="PDHB" /map="unassigned" (AA at 1)
sigp 1 18 pyruvate dehydrogenase E1-beta subunit signal
peptide
matp 19 1005 pyruvate dehydrogenase E1-beta subunit
signal 1397 1402 poly-A signal
BASE COUNT 404 a 268 c 339 g 397 t
ORIGIN Chromosome 3p13-3q23.
1 tggaccgcgc cggctgccgt gcaggtgaca gttcgtgatg ctataaatca gggtatggat
61 gaggagctgg aaagagatga gaaggtattt ctgcttggag aagaagttgc ccagtatgat
121 ggggcataca aggttagtcg agggctgtgg aagaaatatg gagacaagag gattattgac
181 actcccatat cagagatggg ctttgctgga attgctgtag gtgcagctat ggctgggttg
241 cggcccattt gtgaatttat gaccttcaat ttctccatgc aagccattga ccaggttata
301 aactcagctg ccaagaccta ctacatgtct ggtggccttc agcctgtgcc tatagtcttc
361 aggggaccca atggtgcctc agcaggtgta gctgcccagc actcacagtg ctttgctgcc
421 tggtatgggc actgcccagg cttaaaggtg gtcagtccct ggaattcaga ggatgctaaa
481 ggacttatta aatcagccat tcgggataac aatccagtgg tggtgctaga gaatgaattg
541 atgtatgggg ttccttttga atttcctccg gaagctcagt caaaagattt tctgattcct
601 attggaaaag ccaaaataga aaggcaagga acacatataa ctgtggtttc ccattcaaga
661 cctgtgggcc actgcttaga agctgcagca gtgctatcta aagaaggagt tgaatgtgag
721 gtgataaata tgcgtaccat tagaccaatg gacatggaaa ccatagaagc cagtgtcatg
781 aagacaaatc atcttgtaac tgtggaagga ggctggccac agtttggagt aggagctgaa
841 atctgtgcca ggatcatgga aggtcctgcg ttcaatttcc tggatgctcc tgctgttcgt
901 gtcactggtg ctgatgtccc tatgccttat gcaaagattc tagaggacaa ctctatacct
961 caggtcaaag acatcatatt tgcaataaag aaaacattaa atatttagtt tggacttgaa
1021 tatcaagtcg ttgaaattta tttgaaatac ttgctggcac tgcacctgga tttgtactgc
1081 aagacctgac tattcataaa ggaaaacgat ttctaaagca acagcaggta tttttgtaca
1141 gggaagttta aatgtgtttg tgtatggaaa actctccact ctcctcccct agatgccatg
1201 cttccttttg tctgttacgg ttgccatgtt ctttgaataa caaattatat cacattttat
1261 cctctctcac cacaaggaca aagtatggat gtggcagagt cctgatgaaa gatgtatcca
1321 aacaagataa cttatatgta taaaattaaa gcatataata cacatttact gttagtttgt
1381 tttgataagg aataaaggaa tttctaac
//
LOCUS HUMPYRUV 1095 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, complete cds.
ACCESSION M34055 J05549
KEYWORDS pyruvate dehydrogenase E1-beta subunit.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1095)
AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J.
TITLE Characterization of two cDNA clones for pyruvate dehydrogenase
E1~subunit and its differential regulation in cultured fibroblasts
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.J.Song, 07-MAY-1990.
Author address: B.J.Song
Laboratory of Metabolism and Molecular Biology
National Institute on Alcohol Abuse and Alcoholism
12501 Washington Ave.
Rockville, MD 20852
E-mail: landsman@thebard.nlm.nih.gov
FEATURES from to/span description
pept 6 1085 pyruvate dehydrogenase E1-beta subunit
precursor /hgml_locus_uid="LU0223C"
/nomgen="PDHB" /map="unassigned"
sigp 6 95 pyruvate dehydrogenase E1-beta subunit signal
peptide
matp 96 1082 pyruvate dehydrogenase E1-beta subunit
mRNA < 1 1095 pyruvate dehydrogenase E1-beta subunit mRNA
signal 1061 1066 poly-A signal
BASE COUNT 288 a 221 c 302 g 284 t
ORIGIN Chromosome 3p13-3q23.
1 ccaagatggc ggcggtgtct ggcttggtgc ggagacccct tcgggaggtc tccgggctgc
61 tgaagaggcg ctttcactgg accgcgccgg ctgccgtgca ggtgacagtt cgtgatgcta
121 taaatcaggg tatggatgag gagctggaaa gagatgagaa ggtatttctg cttggagaag
181 aagttgccca gtatgatggg gcatacaagg ttagtcgagg gctgtggaag aaatatggag
241 acaagaggat tattgacact cccatatcag agatgggctt tgctggaatt gctgtaggtg
301 cagctatggc tgggttgcgg cccatttgtg aatttatgac cttcaatttc tccatgcaag
361 ccattgacca ggttataaac tcagctgcca agacctacta catgtctggt ggccttcagc
421 ctgtgcctat agtcttcagg ggacccaatg gtgcctcagc aggtgtagct gcccagcact
481 cacagtgctt tgctgcctgg tatgggcact gcccaggctt aaaggtggtc agtccctgga
541 attcagagga tgctaaagga cttattaaat cagccattcg ggataacaat ccagtggtgg
601 tgctagagaa tgaattgatg tatggggttc cttttgaatt tcctccggaa gctcagtcaa
661 aagattttct gattcctatt ggaaaagcca aaatagaaag gcaaggaaca catataactg
721 tggtttccca ttcaagacct gtgggccact gcttagaagc tgcagcagtg ctatctaaag
781 aaggagttga atgtgaggtg ataaatatgc gtaccattag accaatggac atggaaacca
841 tagaagccag tgtcatgaag acaaatcatc ttgtaactgt ggaaggaggc tggccacagt
901 ttggagtagg agctgaaatc tgtgccagga tcatggaagg tcctgcgttc aatttcctgg
961 atgctcctgc tgttcgtgtc actggtgctg atgtccctat gccttatgca aagattctag
1021 aggacaactc tatacctcag gtcaaagaca tcatatttgc aataaagaaa acattaaata
1081 tttagtttgg acttg
//
LOCUS ENTOMPX 664 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds.
ACCESSION M33878
KEYWORDS outer membrane protein X.
SOURCE E.cloacae (strain 2249-1) DNA.
ORGANISM Enterobacter cloacae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 664)
AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and
van de Klundert,J.A.M.
TITLE Molecular characterization of an Enterobacter cloacae outer
membrane protein OmpX
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.Stoorvogel, 26-APR-1990.
Author address: J.Stoorvogel
Dept of Medical Microbiol, Univ Hospital,
P.O.Box 9600, 2300 RC Leiden,
THE NETHERLANDS
FEATURES from to/span description
pept 83 601 outer membrane protein X precursor
sigp 83 151 outer membrane protein X signal peptide
matp 152 598 outer membrane protein X
signal 18 25 -35 region
signal 43 48 -10 region
binding 74 77 ribosome binding site
signal 620 649 transcription termination signal
BASE COUNT 171 a 156 c 162 g 175 t
ORIGIN
1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac
61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt
121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc
181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta
241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg
301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta
361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca
421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta
481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca
541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta
601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc
661 ctct
//
LOCUS ACARGA 1212 bp ds-DNA INV 28-JUN-1990
DEFINITION A.castellanii 5S RNA.
ACCESSION M34003
KEYWORDS 5S RNA.
SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1.
ORGANISM Acanthamoeba castellanii
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina;
Acanthamoebidae.
REFERENCE 1 (bases 1 to 1212)
AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R.
TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from
the eukaryotic protist Acanthamoeba castellanii
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.R.Paule, 04-MAY-1990.
Author address: M.R.Paule
Colorado State University, Dept. Biochemistry,
Fort Collins, CO 80523
FEATURES from to/span description
RNA 489 610 5S RNA precursor
RNA 489 607 5S RNA
BASE COUNT 195 a 374 c 329 g 314 t
ORIGIN
1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg
61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc
121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta
181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt
241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac
301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg
361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag
421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt
481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt
541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc
601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg
661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa
721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag
781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc
841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact
901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct
961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt
1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg
1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc
1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg
1201 gcaaatgtcg ac
//
LOCUS HUMCDR34 2412 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene,
complete cds.
ACCESSION M31423 M34243
KEYWORDS cerebellar-degeneration-related antigen.
SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2412)
AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
Chaganti,R.S.K., Posner,J.B. and Old,L.J.
TITLE Cerebellar degeneration-related antigen: A highly conserved
neuroectodermal marker mapped to chromosomes X in human and mouse
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.-T.Chen, 17-JAN-1990
FEATURES from to/span description
pept 503 1174 cerebellar-degeneration-related antigen
(CDR34)
BASE COUNT 743 a 334 c 669 g 666 t
ORIGIN
1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
2401 gagatgctgc ag
//
LOCUS RATHGF 2485 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION M32987
KEYWORDS hepatocyte growth factor.
SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2485)
AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
Shimizu,S. and Nakamura,T.
TITLE Deduced primary structure of rat hepatocyte growth factor and
expression of the mRNA in rat tissues
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.Nakamura, 26-MAR-1990.
FEATURES from to/span description
pept 143 2329 hepatocyte growth factor
BASE COUNT 750 a 533 c 577 g 625 t
ORIGIN
1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
2461 ttaatatatg tggcgttttc tgttg
//
LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 28-JUN-1990
DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds.
ACCESSION M33785
KEYWORDS protein-tyrosine kinase-1.
SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1090)
AUTHORS Tan,J.L. and Spudich,J.A.
TITLE Developmentally regulated protein-tyrosine kinase genes in
Dictyostelium
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Tan, 20-APR-1990.
Author address: J.L.Tan
Stanford University School of Medicine
Dept. of Cell Biology
Stanford, CA 94303
(415) 723-7408
FEATURES from to/span description
pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1)
BASE COUNT 371 a 198 c 186 g 335 t
ORIGIN
1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca
61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat
121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat
181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca
241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta
301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa
361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa
421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat
481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac
541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct
601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt
661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat
721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat
781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca
841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat
901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta
961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt
1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca
1081 cttcaactcg
//
LOCUS DDIDPYK2A 1292 bp ss-mRNA INV 28-JUN-1990
DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds.
ACCESSION M33784
KEYWORDS protein-tyrosine kinase-2.
SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1292)
AUTHORS Tan,J.L. and Spudich,J.A.
TITLE Developmentally regulated protein-tyrosine kinase genes in
Dictyostelium
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Tan, 20-APR-1990.
Author address: J.L.Tan
Stanford University School of Medicine
Dept. of Cell Biology
Stanford, CA 94303
(415) 723-7408
FEATURES from to/span description
pept < 1 1234 protein-tyrosine kinase-2 (DPYK2; AA at 2)
BASE COUNT 491 a 207 c 204 g 390 t
ORIGIN
1 cgattctaca agtacaacaa actctactaa agatatcaca tttttagttt gtgataatcc
61 tgattcaact aaagaaaaga gtaacgtttc aaatacttca tcaataattt ccgcttcaaa
121 tttaaataga catataacac caaattctca tatgagacct agaggtagat caatttctga
181 atctttaatt atgtcaccaa ttaataaaga atctttaaat gatattcaaa gagcaattga
241 aagtgaaaaa ataaagaaaa ctaaatttga agaattaaaa tcaatattgg gcgaaagaga
301 atatataatt gatataaatg atattcaatt tatacaaaaa gttggagaag gtgcattcag
361 tgaagtttgg gaaggttggt ggaaaggtat tcatgttgcc ataaaaaagt taaagattat
421 aggagatgaa gaacaattca aagagagatt cattagagag gttcaaaatt tgaaaaaagg
481 aaatcatcaa aacattgtca tgtttattgg tgcatgttat aaaccagcat gtatcataac
541 agagtatatg gcaggtggta gtctttacaa tatacttcat aatccaaata gttccactcc
601 aaaagttaaa tattctttcc cattggtttt gaaaatggca accgacatgg cattgggctt
661 attacatctt cattccatca ccattgtgca tcgtgattta accagtcaaa acattctatt
721 ggatgaattg ggtaatataa agatctctga ttttggttta tctgctgaaa agagtagaga
781 aggttcaatg acaatgacaa atggtggcat ttgcaatcca agatggagac cacccgaatt
841 gacaaagaat ttaggtcact actcggaaaa ggttgatgtc tattgtttct ctctagtagt
901 ttgggaaatt ttaactggcg aaattccttt ctctgattta gatggatctc aacgatccgc
961 tcaagtagct tatgctggtt taagaccacc aataccagag tattgcgatc ctgaattaaa
1021 attactctta actcaatgtt gggaggctga tccaaatgat agacctccct ttacctatat
1081 agtaaacaaa ttaaaagaaa tctcttggaa taatccaatt ggtttcgtct ctgatcaatt
1141 ctatcaatat agcgaacctt caactccaag attagcatta tcaaatcaat cttcaaattc
1201 aagtagtatt tctttatcac caactaaatt ataaaaaaaa aaaaaaaaaa aaacaaattt
1261 caaacaccaa acaccaccac tcatcaaaat cg
//
LOCUS MUSH2A1X 929 bp ds-DNA ROD 28-JUN-1990
DEFINITION Mouse histone H2A.1 gene, complete cds.
ACCESSION M33988
KEYWORDS histone H2A.1.
SOURCE Mouse (BALB/c) DNA, clone 53.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 929)
AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
Sch|mperli,D.
TITLE Structure of a mouse histone gene cluster
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Sch|mperli, 02-MAY-1990.
Author address: Dr.Schmperli
Zoologisches Institut, Baltzerstrasse 4
CH-3012 Bern
SWITZERLAND
FEATURES from to/span description
pept 164 556 histone H2A.1
mRNA 119 624 histone H2A.1 mRNA
BASE COUNT 238 a 247 c 222 g 222 t
ORIGIN
1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac
61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct
121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa
181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt
241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc
301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct
361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct
421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg
481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa
541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca
601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg
661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta
721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga
781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt
841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa
901 gggccattaa atcccacttc catagtttc
//
LOCUS MUSH2A1XX 693 bp ds-DNA ROD 28-JUN-1990
DEFINITION Mouse histone H3.2 gene, complete cds.
ACCESSION M33989
KEYWORDS histone H3.2.
SOURCE Mouse (strain BALB/c) DNA, clone 53.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 693)
AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
Sch|mperli,D.
TITLE Structure of a mouse histone gene cluster
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Sch|mperil, 02-MAY-1990.
Author address: D.Schlmperil
Zoologisches Institut, Baltzerstrasse 4
CH-3012 Bern
SWITZERLAND
FEATURES from to/span description
pept 151 561 histone H3.2
mRNA 116 611 histone H3.2 mRNA
BASE COUNT 159 a 206 c 184 g 144 t
ORIGIN
1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt
61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat
121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag
181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc
241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc
301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc
361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc
421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg
481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc
541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc
601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg
661 tttttcaaag atttatctag ttacaaagta tac
//
LOCUS NEUPSAS 3681 bp ds-DNA PLN 28-JUN-1990
DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (ARG-2)
gene, complete cds.
ACCESSION J05512
KEYWORDS carbamoyl phosphate synthetase.
SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and
cDNA to mRNA, clone pAB1.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 3681)
AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C.
TITLE The Neurospora crassa arg-2 locus: Structure and expression of the
gene encoding the small subunit of arginine-specific carbamoyl
phosphate synthetase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.S.Sachs, 07-MAY-1990.
FEATURES from to/span description
pept 1451 1630 arginine-specific carbamoyl phosphate
synthetase, exon 1 (ARG-2) (EC 6.4.5.5)
1687 2868 arginine-specific carbamoyl phosphate
synthetase, exon 2
pept 870 902 ORF, exon 1
1270 1282 ORF, exon 2
1359 1387 ORF, exon 3
pre-msg 752 3082 ARG-2 mRNA and introns
pre-msg 809 3082 ARG-2 mRNA and introns (alt.)
IVS 1631 1686 ARG-2 intron A
IVS 903 1269 ORF intron A
IVS 1283 1358 ORF intron B
signal 161 166 TGACTC box
signal 568 573 TGACTC box
signal 1008 1013 TGACTC box
signal 1307 1312 TGACTC box
BASE COUNT 760 a 1093 c 863 g 965 t
ORIGIN
1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg
61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg
121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg
181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga
241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg
301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg
361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg
421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa
481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa
541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg
601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa
661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag
721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca
781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag
841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc
901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc
961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc
1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat
1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc
1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct
1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg
1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt
1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa
1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat
1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg
1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag
1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt
1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc
1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt
1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac
1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat
1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta
1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg
1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc
2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac
2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc
2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac
2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc
2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc
2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg
2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct
2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt
2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc
2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag
2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt
2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag
2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc
2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg
2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca
2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg
3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag
3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc
3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg
3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca
3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag
3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa
3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg
3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg
3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc
3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc
3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg
3661 atgctacccc ctgtcctgca g
//
LOCUS MUSIMPD 1707 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Mouse IMP dehydrogenase mRNA, complete cds.
ACCESSION M33934
KEYWORDS IMP dehydrogenase.
SOURCE Mouse, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1707)
AUTHORS Tiedemann,A.A. and Smith,J.M.
TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP
dehydrogenase
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.M.Smith 27-APR-1990.
Author address:
Dr. J.M. Smith
Seattle Biomedical Research Institute
4 Nickerson St.
Seattle, WA 98109
FEATURES from to/span description
pept 67 1611 IMP dehydrogenase (EC 1.2.1.14)
BASE COUNT 457 a 408 c 451 g 391 t
ORIGIN
1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc
61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc
121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt
181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag
241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg
301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa
361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc
421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc
481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc
541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc
601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat
661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg
721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa
781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat
841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga
901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag
961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta
1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc
1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt
1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct
1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc
1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat
1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa
1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct
1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa
1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag
1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc
1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa
1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//
LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human osteopontin mRNA, complete cds.
ACCESSION J04765
KEYWORDS osteopontin; phosphoprotein.
SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1425)
AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D.
TITLE cDNA cloning, chromosomal location and RFLP analysis of human
osteopontin
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.F.Young, 04-MAY-1990.
FEATURES from to/span description
pept 68 1012 osteopontin precursor
/hgml_locus_uid="LK0223T" /nomgen="OPN"
/map="unassigned"
sigp 68 115 osteopontin signal peptide
matp 176 1012 osteopontin
mRNA < 1 1425 osteopontin mRNA
BASE COUNT 449 a 293 c 296 g 387 t
ORIGIN Chromosome 4q13.
1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS RATALADPS 1612 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene,
complete cds.
ACCESSION J04764
KEYWORDS delta-aminolevulinate dehydratase; pseudogene.
SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1612)
AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H.
TITLE Isolation and characterization of a rat delta-aminolevulinate
dehydratase processed pseudogene
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.R.Bishop, 20-APR-1990.
FEATURES from to/span description
pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase
rpt 101 108 8 bp flanking direct repeat
rpt 1385 1392 8 bp flanking direct repeat
BASE COUNT 404 a 411 c 418 g 379 t
ORIGIN
1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS MZEZEISP 215 bp ds-DNA PLN 28-JUN-1990
DEFINITION Z.mays zein storage protein gene, 5' flank.
ACCESSION M33830
KEYWORDS storage protein; zein.
SOURCE Z.mays endosperm DNA, clone pMS1.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 215)
AUTHORS Grasser,K.D., Maier,U.-G., Haass,M.M. and Feix,G.
TITLE Maize high mobility group proteins bind to CCAAT and TATA boxes of
a zein gene promoter
JOURNAL J. Biol. Chem. 265, 4185-4188 (1990)
STANDARD simple staff_review
BASE COUNT 84 a 42 c 26 g 63 t
ORIGIN
1 gtataaatta taacccaaca aagtaatcac tcaatgtcaa aaccaactag ataccatgtc
61 atctctacct tatcttacta atatcttttt gcaaattcga aattaatctt gcacaagcac
121 aaggactgag atgtgtataa atatctctta gattagtagc taatatatcg cacatattat
181 tgagaccaac tagcaacata gaaagcacaa tagtg
//
LOCUS CHKARCASR 2392 bp ss-mRNA VRT 28-JUN-1990
DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5-
aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR)
synthetase, complete cds.
ACCESSION M31764
KEYWORDS 5-aminoimidazole ribonucleotide carboxylase;
5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase.
SOURCE Chicken liver, cDNA to mRNA, clone pZD1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2392)
AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H.
TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole
ribonucleotide carboxylase and
5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase
by functional complementation of Escherichia coli pur mutants
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Zalkin, 01-FEB-1990.
FEATURES from to/span description
pept 59 1339 5-aminoimidazole ribonucleotide (AIR)
carboxylase-5-
aminoimidazole-4-N-succinocarboxamide
ribonucleotide (SAICAR) synthetase
matp 59 835 SAICAR synthetase domain
matp 836 1336 AIR carboxylase domain
site 2341 2347 poly-A signal
BASE COUNT 696 a 466 c 527 g 702 t 1 others
ORIGIN
1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat
61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga
121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac
181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac
241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa
301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg
361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa
421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg
481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac
541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg
601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac
661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg
721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga
781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct
841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca
901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac
961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg
1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc
1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca
1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc
1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc
1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt
1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag
1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt
1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg
1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta
1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta
1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt
1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat
1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc
1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg
1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt
1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa
1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc
2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct
2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc
2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac
2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa
2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat
2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc
//
LOCUS ECORAFRO 1289 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and
alpha-galactosidase (rafA) gene, 5' end.
ACCESSION M29849
KEYWORDS alpha-galactosidase; raffinose repressor.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1289)
AUTHORS Aslanidis,C. and Schmitt,R.
TITLE Regulatory elements of the raffinose operon: Nucleotide sequences
of operator and repressor genes
JOURNAL J. Bacteriol. 172, 2178-2180 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Schmitt, 13-NOV-1989.
FEATURES from to/span description
pept 166 1176 raffinose repressor
pept 1270 > 1289 alpha-galactosidase
binding 156 161 ribosome binding site
binding 1258 1263 ribosome binding site
signal 120 126 -35 region
signal 144 149 -10 region
signal 1212 1216 -35 region
signal 1235 1240 -10 region
site 1196 1213 raf operator
site 1217 1234 raf operator
BASE COUNT 317 a 283 c 326 g 363 t
ORIGIN
1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata
61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga
121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg
181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca
241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc
301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat
361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc
421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac
481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat
541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct
601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga
661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc
721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat
781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat
841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat
901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa
961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat
1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg
1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt
1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg
1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg
1261 agatgtctaa tgatttcaaa gtactgcag
//
LOCUS HSEDILSTD 351 bp ds-DNA VRL 28-JUN-1990
DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus
of STD DNA.
ACCESSION M34145
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 351)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_review
BASE COUNT 55 a 117 c 127 g 52 t
ORIGIN
1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca
61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga
121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt
181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg
241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc
301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g
//
LOCUS HSEDISA 633 bp ds-DNA VRL 28-JUN-1990
DEFINITION Equine herpesvirus type 1 defective interfering particle DNA.
ACCESSION M34143
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 633)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_entry
BASE COUNT 124 a 184 c 202 g 123 t
ORIGIN
1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact
301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc
361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca
421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt
481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg
541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc
601 acggcggggc gcagaacgcc cctggccccc ggg
//
LOCUS HSEDISTD 290 bp ds-DNA VRL 28-JUN-1990
DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat
of STD DNA.
ACCESSION M34144
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG..
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 290)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_review
BASE COUNT 69 a 72 c 77 g 72 t
ORIGIN
1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc
//
LOCUS HUMAR 1230 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) mRNA, complete cds.
ACCESSION M30704
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones
lambda-AR1 and lambda-AR2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1230)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept 210 968 amphiregulin precursor
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
sigp 210 266 amphiregulin signal peptide
matp 510 962 amphiregulin (alt.)
matp 528 962 amphiregulin (alt.)
mRNA 1 1230 amphiregulin mRNA
BASE COUNT 375 a 273 c 285 g 297 t
ORIGIN
1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc
61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg
121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga
181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg
241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg
301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg
361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt
421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac
481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc
541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag
601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc
661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca
721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca
781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg
841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat
901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta
961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc
1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat
1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga
1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt
1201 tattttacag ctcattaaac ttttttaacc
//
LOCUS HUMAR1 968 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, exon 1.
ACCESSION M30698
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 1 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 968)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept 859 + 919 amphiregulin precursor, exon 1
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
sigp 859 915 amphiregulin signal peptide
pre-msg 649 > 968 amphiregulin mRNA and introns
IVS 920 > 968 amphiregulin intron A
BASE COUNT 206 a 280 c 257 g 225 t
ORIGIN
1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt
61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg
121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg
181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag
241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta
301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta
361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac
421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct
481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt
541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc
601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca
661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc
721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca
781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc
841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg
901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca
961 tggcaggt
//
LOCUS HUMAR2 348 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, exon 2.
ACCESSION M30699
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 2 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 51 + 299 amphiregulin precursor, exon 2
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp 290 + 299 amphiregulin
pre-msg < 1 > 348 amphiregulin mRNA and introns
IVS < 1 50 amphiregulin intron A
IVS 300 > 348 amphiregulin intron B
BASE COUNT 88 a 75 c 86 g 99 t
ORIGIN
1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc
61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca
121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc
181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc
241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg
301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata
//
LOCUS HUMAR3 302 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, exon 3.
ACCESSION M30700
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 3 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 302)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 52 + 252 amphiregulin precursor, exon 3
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp + 52 + 252 amphiregulin
matp 60 + 252 amphiregulin (alt.)
pre-msg < 1 > 302 amphiregulin mRNA and introns
IVS < 1 51 amphiregulin intron B
IVS 253 > 302 amphiregulin intron C
BASE COUNT 125 a 46 c 54 g 77 t
ORIGIN
1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg
61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa
121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta
181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag
241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc
301 tg
//
LOCUS HUMAR4 254 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, exon 4.
ACCESSION M30701
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 4 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 254)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 51 + 204 amphiregulin precursor, exon 4
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp + 51 91 amphiregulin (both alt.)
pre-msg < 1 > 254 amphiregulin mRNA and introns
IVS < 1 50 amphiregulin intron C
IVS 205 > 254 amphiregulin intron D
BASE COUNT 81 a 42 c 53 g 78 t
ORIGIN
1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca
61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag
121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac
181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat
241 aatgggaggt taat
//
LOCUS HUMAR5 217 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, exon 5.
ACCESSION M30702
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 5 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 50 143 amphiregulin precursor, exon 5
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
pre-msg < 1 > 217 amphiregulin mRNA and introns
IVS < 1 49 amphiregulin intron D
BASE COUNT 89 a 26 c 41 g 61 t
ORIGIN
1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac
61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg
121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata
181 tctttagatc atatcctata attttgaaaa atttaac
//
LOCUS HUMAR6 354 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human amphiregulin (AR) gene, 3' flanking region.
ACCESSION M30703
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 6 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pre-msg < 1 294 amphiregulon mRNA and intron
BASE COUNT 104 a 48 c 52 g 150 t
ORIGIN
1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat
61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat
121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct
181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta
241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa
301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc
//
LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds.
ACCESSION M32294
KEYWORDS Ro ribonucleoprotein autoantigen.
SOURCE Human T-cell line Jurkat, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1890)
AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J.,
Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M.,
Sontheimer,R.D. and Capra,J.D.
TITLE Molecular cloning, expression and Chromosome 19 localization of a
human Ro/SS-A auto antigen
JOURNAL J. Clin. Invest. 85, 1379-1391 (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.P.McCauliffe, 23-FEB-1990.
FEATURES from to/span description
pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A)
precursor
sigp 67 117 Ro ribonucleoprotein autoantigen signal
peptide
matp 118 1317 Ro ribonucleoprotein autoantigen
site 1881 1886 poly-A signal (put.)
BASE COUNT 464 a 497 c 520 g 409 t
ORIGIN Chromosome 19.
1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg
61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag
121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc
181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt
241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg
301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa
361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac
421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc
481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac
541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca
601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac
661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac
721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag
781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac
841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg
901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag
961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc
1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac
1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag
1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag
1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag
1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag
1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg
1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt
1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt
1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt
1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc
1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct
1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca
1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact
1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact
1861 acaaacaaaa tttctattaa attaaatttt
//
LOCUS PRD1XII 1701 bp ds-DNA PHG 28-JUN-1990
DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds.
ACCESSION M30146
KEYWORDS early protein XII; early protein XIX.
SOURCE Bacteriophage PRD1 (Viradea strain) DNA.
ORGANISM Bacteriophage PRD1
Viridae; ds-DNA nonenveloped viruses; Tectiviridae.
REFERENCE 1 (bases 1 to 1701)
AUTHORS Gerendasy,D. and Ito,J.
TITLE Nucleotide sequence and transcription of the right early region of
bacteriophage PRD1
JOURNAL J. Bacteriol. 172, 1889-1898 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Gerendasy, 27-NOV-1989.
FEATURES from to/span description
pept 242 724 early protein XII
pept 797 955 early protein XIX (put.)
mRNA 179 732 early protein XII mRNA
mRNA 181 1100 early protein XIX mRNA
binding 230 233 ribosome binding site
binding 785 788 ribosome binding site
rpt 1 111 inverted terminal repeat
site 145 172 right early promoter
BASE COUNT 419 a 447 c 390 g 445 t
ORIGIN
1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca
61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat
121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca
181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat
241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag
301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg
361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc
421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct
481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc
541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc
601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc
661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc
721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa
781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc
841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat
901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc
961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct
1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa
1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg
1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag
1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac
1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc
1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga
1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg
1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt
1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg
1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc
1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg
1681 cgagaaaacc gccttgacaa a
//
LOCUS PSEALALPQ 3141 bp ds-DNA BCT 28-JUN-1990
DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP
and algQ) genes, complete cds.
ACCESSION M32077
KEYWORDS alginate regulatory protein P; alginate regulatory protein Q.
SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient
PA02003.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 3141)
AUTHORS Konyecsni,W.M. and Deretic,V.
TITLE DNA sequence and expression analysis of algP and algQ, components
of the multigene system transcriptionally regulating mucoidy in
Pseudomonas aeruginosa: algP contains multiple direct repeats
JOURNAL J. Bacteriol. 172, 2511-2520 (1990)
STANDARD full staff_review
COMMENT draft entry and computer-readable sequence [1] kindly submitted by
V.Deretic, 16-FEB-1990.
FEATURES from to/span description
pept 781 1263 alginate regulatory protein Q
pept 2079 3137 alginate regulatory protein P (put.)
mRNA 661 1310 algQ mRNA
mRNA 2054 3141 algP mRNA
BASE COUNT 607 a 1026 c 1009 g 499 t
ORIGIN 1 bp upstream of BamHI site; map position 9 min.
1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc
61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg
121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg
181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct
241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa
301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg
361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc
421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg
481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt
541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat
601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat
661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc
721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac
781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat
841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag
901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta
961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat
1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc
1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc
1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc
1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc
1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg
1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca
1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg
1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga
1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac
1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga
1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc
1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa
1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc
1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc
1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag
1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc
1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt
2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg
2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag
2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac
2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg
2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt
2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg
2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca
2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca
2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg
2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga
2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc
2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa
2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca
2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg
2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg
2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac
3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc
3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc
3121 cgaccagcgc ctcctaaggc g
//
LOCUS STYAROF 1235 bp ds-DNA BCT 28-JUN-1990
DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate
7-phosphate synthase (aroF) gene, complete cds.
ACCESSION M31302
KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase.
SOURCE S.typhimurium (strain LT2) DNA, clone pGM59.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1235)
AUTHORS Muday,G.K. and Herrmann,K.M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 165 to 1235)
AUTHORS Muday,G.K. and Herrmann,K.M.
TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia
coli
JOURNAL J. Bacteriol. 172, 2259-2266 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.M.Herrmann, 13-JAN-1990.
Author address: K.M.Herrmann
Purdue University
Biochemistry department
West Lafayette IN, 47907
(317) 494-1653
FEATURES from to/span description
pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate
synthase (E.C. 4.1.2.15)
site 1 20 operator aroFo2
site 52 71 operator aroFo1
site 69 113 promoter
site 75 94 TyrR box-1
BASE COUNT 309 a 286 c 343 g 297 t
ORIGIN
1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt
61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg
121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc
181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg
241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg
301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg
361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg
421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga
481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa
541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt
601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc
661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt
721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc
781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg
841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag
901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag
961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg
1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata
1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg
1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag
1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa
//
LOCUS TRBANTAT 18598 bp ds-DNA INV 28-JUN-1990
DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes,
complete cds.
ACCESSION M33720
KEYWORDS expression site variant specific antigen.
SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 18598)
AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P.,
Salmon,D. and Steinert,M.
TITLE The genes and transcripts of an antigen gene expression site from
T.brucei
JOURNAL Cell 57, 835-845 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Pays, 20-JAN-1989.
FEATURES from to/span description
pept 1174 2196 expression site variant specific antigen 7
pept 3528 4733 expression site variant specific antigen 6
pept 8519 9772 expression site variant specific antigen 5
pept 14450 18157 expression site variant specific antigen 4
mRNA 986 2676 ESAG-7 mRNA
mRNA 3483 5056 ESAG-6 mRNA
mRNA 8260 11304 ESAG-5 mRNA
trns 5305 5808 rime retroposon
BASE COUNT 5298 a 3468 c 4707 g 5125 t
ORIGIN
1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta
61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat
121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg
181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg
241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta
301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc
361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat
421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt
481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt
541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt
601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa
661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag
721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg
781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat
841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca
901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa
961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat
1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg
1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga
1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg
1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca
1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta
1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag
1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc
1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa
1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa
1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg
1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga
1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg
1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt
1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa
1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt
1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg
1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt
2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa
2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa
2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact
2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac
2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa
2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt
2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta
2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt
2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg
2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca
2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta
2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact
2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat
2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat
2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg
2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata
3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg
3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga
3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa
3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg
3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt
3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg
3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt
3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc
3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt
3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga
3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa
3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa
3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg
3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc
3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag
3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt
3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg
4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg
4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg
4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg
4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga
4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg
4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg
4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa
4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg
4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag
4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg
4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga
4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg
4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga
4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa
4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc
4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa
4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag
5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga
5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca
5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat
5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct
5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca
5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg
5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg
5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat
5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca
5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg
5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg
5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca
5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac
5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc
5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg
5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata
6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg
6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg
6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc
6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat
6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac
6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa
6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa
6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta
6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag
6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat
6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg
6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca
6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg
6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag
6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca
6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa
6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct
7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa
7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat
7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag
7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat
7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa
7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta
7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta
7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc
7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca
7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga
7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa
7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg
7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga
7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct
7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt
7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact
7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag
8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg
8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga
8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca
8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca
8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg
8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt
8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg
8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat
8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag
8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga
8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt
8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat
8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga
8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga
8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga
8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa
9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata
9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac
9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc
9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg
9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt
9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa
9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct
9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca
9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac
9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa
9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc
9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt
9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc
9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac
9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta
9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt
9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt
10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt
10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg
10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa
10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata
10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac
10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta
10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg
10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa
10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag
10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt
10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg
10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag
10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg
10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc
10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa
10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg
10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca
11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat
11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc
11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca
11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt
11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg
11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat
11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac
11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct
11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag
11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag
11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt
11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat
11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc
11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata
11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga
11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg
12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt
12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg
12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg
12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc
12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa
12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa
12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg
12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact
12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc
12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct
12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa
12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta
12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga
12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc
12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc
12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata
12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg
13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg
13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt
13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct
13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata
13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca
13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc
13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt
13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa
13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc
13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt
13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta
13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca
13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca
13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct
13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg
13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc
13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc
14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc
14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg
14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt
14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc
14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg
14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc
14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct
14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac
14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt
14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt
14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc
14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt
14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa
14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga
14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt
14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga
15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg
15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt
15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga
15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct
15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat
15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt
15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg
15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa
15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac
15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat
15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac
15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga
15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca
15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg
15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg
15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc
15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag
16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga
16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac
16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag
16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct
16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat
16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat
16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat
16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt
16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct
16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga
16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt
16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct
16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat
16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa
16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc
16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat
16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat
17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct
17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac
17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt
17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa
17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc
17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc
17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc
17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat
17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg
17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat
17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc
17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc
17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga
17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc
17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg
17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg
18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg
18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt
18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc
18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct
18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga
18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga
18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct
18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag
18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc
18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt
//
LOCUS BABUGU2A 680 bp ds-DNA PRI 28-JUN-1990
DEFINITION Baboon U2 small nuclear RNA gene, complete cds.
ACCESSION M33777
KEYWORDS U2 small nuclear RNA.
SOURCE Baboon (P.hamadryas) DNA, clone lambda-Pu2.
ORGANISM Papio hamadryas
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini;
Cercopithecidae; Cercopithecinae.
REFERENCE 1 (bases 1 to 680)
AUTHORS Matera,A.G., Weiner,A.M. and Schmid,C.W.
TITLE Structure and evolution of the U2 snRNA multigene family in
primates: Gene amplification under natural selection
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.G.Matera, 20-APR-1990.
Author address: A.G.Matera
Dept. of Chemistry
University of California
Davis, CA 95616
FEATURES from to/span description
uRNA 191 378 U2 small nuclear RNA
signal 99 114 upstream control element I
signal 136 157 upstream control element II
signal 379 420 3' processing box
BASE COUNT 148 a 159 c 224 g 149 t
ORIGIN
1 ggatccaagc acgggtgtaa gatttccctt ttcaaaggtg gagaataaga aaggagcccg
61 agtgtgtaac ggcgtccata gtggggtgga cgagacagag gggatggggc aaggagcgag
121 ggtggggctc tcaccgcgac ttgaatgtgg atgagagtgg gacggtgacg gcgggcgcga
181 aggcgagcgg atcgcttctc ggccttttgg ctaagatcaa gtgtagtatc tgttcttatc
241 agtttaatat ctgatacgtc ctctatccga ggacaatata ttaaatggat ttttggagca
301 gggagatgga ataggagctt gctccgtcca ctccacgcat cgacctggta ttgcagtacc
361 tccaggaacg gtgcaccccc tccggggata caaccgtttg ctaaaagtag agggaggtga
421 gagacgacaa caccggcggg gcggctcgca cgccaagcgt ctgtgactac gtcttgccgg
481 ctcaactgct tcgttggagt accacggggt tcctgagctg cgggcggcag acggtaggct
541 tcgggcacgc ttccatttac gctgtgactg cagcgctttg ggaaggctac gaccccccgc
601 agccgaccga ctcctttgga ggtcgttagg atctcagctt ggcagtcgag tggtggtgac
661 cttttaaagg aatgggatcc
//
LOCUS BOVLMCB 326 bp ds-DNA SYN 28-JUN-1990
DEFINITION Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5
gene, complete cds.
ACCESSION J02905
KEYWORDS cytochrome b-5.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 326)
AUTHORS Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and
Mauk,A.G.
TITLE Mutagenic, electrochemical, and crystallographic investigation of
the cytochrome b-5 oxidation-reduction equilibrium: Involvement of
asparagine-57, serine-64, and heme propionate-7
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
G.Mauk, 16-MAY-1990.
FEATURES from to/span description
pept 27 311 microsomal cytochrome b-5
binding 16 19 ribosome binding site
BASE COUNT 106 a 82 c 77 g 61 t
ORIGIN
1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac
61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg
121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg
181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt
241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt
301 ccgaatctta atgagaattc gtctga
//
LOCUS DROEST6P 5424 bp ds-DNA INV 28-JUN-1990
DEFINITION D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes,
complete cds.
ACCESSION M33780 M33781
KEYWORDS carboxylesterase.
SOURCE D.melanogaster (strain Dm145) adult DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 929 to 4027)
AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
Richmond,R.C.
TITLE Molecular analysis of duplicated esterase genes in Drosophila
melanogaster
JOURNAL Mol. Biol. Evol. 7, 9-28 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5424)
AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
Richmond,R.C.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.C.Richmond, 19-APR-1990.
FEATURES from to/span description
pept 1173 2559 carboxylesterase 6 (Est-6) precursor (EC
3.1.1.1), exon 1
2611 2858 carboxylesterase 6 precursor, exon 2
sigp 1173 1235 carboxylesterase 6 signal peptide
matp 1236 2559 carboxylesterase 6
2611 2855 carboxylesterase 6
pept 3052 4438 carboxylesterase P (Est-P) precursor (EC
3.1.1.1), exon 1
4495 4742 carboxylesterase P precursor, exon 2
sigp 3052 3111 carboxylesterase P signal peptide
matp 3112 4438 carboxylesterase P
4495 4739 carboxylesterase P
pre-msg 1132 2867 Est-6 mRNA and intron (alt.)
pre-msg 1132 2998 Est-6 mRNA and intron (alt.)
IVS 2560 2610 Est-6 intron A
IVS 4439 4494 Est-P intron A
signal 1104 1110 TATA box
BASE COUNT 1536 a 1117 c 1235 g 1536 t
ORIGIN Chromosome 3L 69A1-5.
1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc
61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa
121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa
181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca
241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc
301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa
361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata
421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa
481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat
541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat
601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac
661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt
721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg
781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa
841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat
901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc
961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa
1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg
1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt
1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca
1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg
1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat
1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca
1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg
1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg
1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag
1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca
1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg
1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta
1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg
1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg
1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag
1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc
1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc
2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc
2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg
2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga
2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac
2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga
2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc
2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc
2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa
2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa
2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga
2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa
2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg
2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg
2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa
2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt
2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa
3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata
3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat
3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac
3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct
3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc
3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta
3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc
3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg
3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca
3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta
3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt
3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat
3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag
3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc
3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc
3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg
3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag
4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct
4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga
4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat
4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg
4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca
4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat
4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt
4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg
4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc
4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg
4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat
4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct
4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt
4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc
4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg
4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg
4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta
5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt
5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg
5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga
5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag
5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct
5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt
5401 gtccgcggta ctcgcgctga attc
//
LOCUS ECOLASNII 1530 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION M34277
KEYWORDS L-asparaginase II.
SOURCE E.coli (strain K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1530)
AUTHORS Jennings,M.P. and Beacham,I.R.
TITLE Analysis of the Escherichia coli gene encoding L-asparaginase II,
ansB, and its regulation by cyclic AMP receptor and FNR proteins
JOURNAL J. Bacteriol. 172, 1491-1498 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 385 1431 L-asparaginase II (ansB) precursor
sigp 385 450 L-asparaginase II signal peptide
matp 451 1428 L-asparaginase II
mRNA 323 > 1431 ansB mRNA
BASE COUNT 409 a 381 c 382 g 358 t
ORIGIN
1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa
61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa
121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc
181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt
241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata
301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag
361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg
421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc
481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt
541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc
601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg
661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac
721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg
781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat
841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg
901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc
961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag
1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa
1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca
1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat
1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct
1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc
1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg
1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc
1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac
1501 cctaatgata atcaccggaa taaattattt
//
LOCUS HUMPADPRP 2085 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1.
ACCESSION M34337
KEYWORDS poly(ADP-ribose) polymerase.
SOURCE Human peripheral blood leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2085)
AUTHORS Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M.,
Sugimura,T. and Esumi,H.
TITLE Characterization of a putative promoter region of the human poly
(ADP-ribose) polymerase gene: Structural similarity to that of the
DNA polymerase beta gene
JOURNAL Biochem. Biophys. Res. Commun. 167, 701-710 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1943 / 2062 poly(ADP-ribose) polymerase
/hgml_locus_uid="LZ0154E" /nomgen="PPOL"
/map="1q41-q42"
pre-msg 1777 > 2085 PPOL mRNA and introns
IVS 2063 > 2085 PPOL intron A
BASE COUNT 495 a 535 c 550 g 505 t
ORIGIN
1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca
61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca
121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag
181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc
241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga
301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc
361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt
421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc
481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat
541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat
601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag
661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag
721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat
781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt
841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta
901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa
961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc
1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc
1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac
1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg
1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct
1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt
1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt
1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg
1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg
1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc
1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta
1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc
1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg
1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg
1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg
1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag
1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta
1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct
2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg
//
LOCUS HUMREVERB1 295 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human Rev-ErbA-alpha mRNA, 5' end.
ACCESSION M34339
KEYWORDS .
SEGMENT 1 of 2
SOURCE Human fetal skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 295)
AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W.
TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
from the noncoding DNA strand of a thyroid hormone receptor gene
results in a related protein that does not bind thyroid hormone
JOURNAL DNA 9, 77-83 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 204 > 295 Rev-ErbA-alpha protein (put.)
BASE COUNT 58 a 107 c 74 g 56 t
ORIGIN
1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg
61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag
121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat
181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg
241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc
//
LOCUS HUMREVERB2 285 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human Rev-ErbA-alpha mRNA, partial cds.
ACCESSION M34340
KEYWORDS .
SEGMENT 2 of 2
SOURCE Human fetal skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 285)
AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W.
TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
from the noncoding DNA strand of a thyroid hormone receptor gene
results in a related protein that does not bind thyroid hormone
JOURNAL DNA 9, 77-83 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 285 Rev-ErbA-alpha protein (put.) (AA at 1)
BASE COUNT 62 a 79 c 99 g 45 t
ORIGIN About 1.7 kb after segment 1.
1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc
61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt
121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa
181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc
241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag
//
LOCUS HUMTFEB 1526 bp ss-mRNA PRI 28-JUN-1990
DEFINITION Human TFEB protein mRNA, partial cds.
ACCESSION M33782
KEYWORDS .
SOURCE Human BJAB B cell line, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1526)
AUTHORS Carr,C.S. and Sharp,P.A.
TITLE A helix-loop-helix protein related to the immunoglobulin E box
binding proteins
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.S.Carr, 19-APR-1990.
FEATURES from to/span description
pept < 1 > 1526 TFEB protein (AA at 1)
BASE COUNT 324 a 495 c 493 g 214 t
ORIGIN
1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg
61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg
121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg
181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga
241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg
301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa
361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg
421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg
481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg
541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc
601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag
661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg
721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt
781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg
841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc
901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag
961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga
1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat
1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg
1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag
1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac
1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg
1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc
1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag
1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag
1501 gacgagggtc ccccgggcta ccccga
//
LOCUS MUSFCERII 2237 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete
cds.
ACCESSION M34163
KEYWORDS IgE receptor.
SOURCE Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2237)
AUTHORS Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and
Moore,K.W.
TITLE Isolation, characterization, and expression of cDNA clones encoding
the mouse Fc receptor for IgE (Fc-epsilon-RII)
JOURNAL J. Immunol. 144, 1974-1982 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 175 1170 low affinity IgE receptor (Fc-epsilon-RII)
mRNA < 1 2237 Fc-epsilon-RII mRNA
BASE COUNT 592 a 622 c 533 g 490 t
ORIGIN
1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata
61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag
121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa
181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt
241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc
301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca
361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc
421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag
481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta
541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat
601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag
661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca
721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag
781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc
841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc
901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct
961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt
1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca
1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca
1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc
1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg
1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc
1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat
1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc
1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt
1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg
1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg
1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac
1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc
1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct
1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt
1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa
1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca
1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag
2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct
2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac
2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt
2221 aataaaggat ctctgtg
//
LOCUS RATATM1 1585 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 1b.
ACCESSION M34137
KEYWORDS alpha-tropomyosin.
SEGMENT 1 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1585)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
COMMENT [1] reports isolation of exon 1b which is the first exon coding for
the alternatively expressed low-M-r alpha-tropomyosins.
FEATURES from to/span description
pept 1275 / 1406 low-M-r alpha-tropomyosin
BASE COUNT 360 a 408 c 449 g 368 t
ORIGIN
1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt
61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg
121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca
181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt
241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt
301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta
361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact
421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg
481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt
541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca
601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct
661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg
721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag
781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga
841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct
901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct
961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta
1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc
1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt
1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct
1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca
1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga
1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg
1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg
1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc
1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg
1561 gggcgagcat gaaatgggtc ccggg
//
LOCUS RATATM2 462 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 9c.
ACCESSION M34138
KEYWORDS alpha-tropomyosin.
SEGMENT 2 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 462)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
COMMENT [1] reports isolation of exon 9c which is the final exon coding for
the alternatively expressed brain alpha-tropomyosins TMBr-1 and
TMBr-3.
FEATURES from to/span description
pept / 375 448 alpha-tropomyosin
BASE COUNT 97 a 141 c 100 g 124 t
ORIGIN Unknown number of bp after segment 1.
1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc
61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct
121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg
181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca
241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa
301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt
361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga
421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag
//
LOCUS RATPLPA1 1658 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2.
ACCESSION M34108
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 1 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg10-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1658)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1477 + 1577 parathyroid hormone-like peptide (PLP)
precursor, exon 2 (first expressed exon)
sigp 1477 + 1577 parathyroid hormone-like signal peptide
pre-msg 951 > 1658 PLP mRNA and introns
IVS 1259 1452 PLP intron A
IVS 1578 > 1658 PLP intron B
BASE COUNT 325 a 507 c 466 g 360 t
ORIGIN
1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact
61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc
121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta
181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac
241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt
301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc
361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga
421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg
481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt
541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc
601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc
661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag
721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg
781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc
841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg
901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca
961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc
1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc
1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca
1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt
1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt
1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt
1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa
1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt
1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag
1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag
1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc
1621 agcaccagag gtgccgaact gaggcgggca atgcggaa
//
LOCUS RATPLPA2 144 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B
(partial).
ACCESSION M34109
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 2 of 5
SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 144)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS < 1 > 144 parathyroid hormone-like peptide intron B
BASE COUNT 42 a 26 c 21 g 55 t
ORIGIN 3 kb after segment 1.
1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt
61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc
121 catgctggca ctttggtttg taat
//
LOCUS RATPLPA3 145 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B
(partial).
ACCESSION M34110
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 3 of 5
SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS < 1 > 145 parathyroid hormone-like peptide intron B
BASE COUNT 22 a 34 c 34 g 55 t
ORIGIN 640 bp after segment 2.
1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc
61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga
121 atcggagtcc tgttttgtgg acttt
//
LOCUS RATPLPA4 920 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 3.
ACCESSION M34111
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 4 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg9-0.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 94 + 516 parathyroid hormone-like peptide (PLP)
precursor, exon 3
sigp + 94 100 parathyroid hormone-like signal peptide
matp 101 + 516 parathyroid hormone-like peptide
IVS < 1 93 PLP intron B
IVS 517 > 920 PLP intron C
BASE COUNT 198 a 353 c 166 g 203 t
ORIGIN 1.3 kb after segment 3.
1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg
61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct
121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct
181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa
241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata
301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg
361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac
421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac
481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag
541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga
601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa
661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct
721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct
781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc
841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct
901 tcctcaagcc aaggctgact
//
LOCUS RATPLPA5 855 bp ds-DNA ROD 28-JUN-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 4.
ACCESSION M34112
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 5 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg9-0.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 855)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 21 30 parathyroid hormone-like peptide (PLP)
precursor, exon 4
matp + 21 27 parathyroid hormone-like peptide
pre-msg < 1 449 PLP mRNA and introns
IVS < 1 20 PLP intron C
BASE COUNT 280 a 139 c 157 g 279 t
ORIGIN 3.6 kb after segment 4.
1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt
61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat
121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc
181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct
241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca
301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga
361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat
421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta
481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga
541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt
601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt
661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt
721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat
781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata
841 aaagatctgc tcggg
//
LOCUS RATTMBR1A 1050 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds.
ACCESSION M34135
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk4.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1050)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 41 886 brain alpha-tropomyosin (TMBr-1)
mRNA < 1 1050 TMBr-1
BASE COUNT 336 a 220 c 306 g 188 t
ORIGIN
1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa
61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc
121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca
181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc
241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc
301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc
361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat
421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca
481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg
541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga
601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact
661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt
721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac
781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa
841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag
901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac
961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct
1021 ctgtatatct taattaaatc ttcaataacc
//
LOCUS RATTMBR2A 1004 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds.
ACCESSION M34134
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk15.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1004)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 137 892 brain alpha-tropomyosin (TMBr-2)
mRNA < 1 1004 TMBr-2
BASE COUNT 265 a 242 c 321 g 176 t
ORIGIN
1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg
61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct
121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg
181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct
241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg
301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca
361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga
421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc
481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat
541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc
601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc
661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa
721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa
781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc
841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt
901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag
961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt
//
LOCUS RATTMBR3A 2488 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end.
ACCESSION M34136
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk10.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2488)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 604 brain alpha-tropomyosin (TMBr-3)
mRNA < 1 2488 TMBr-3
BASE COUNT 713 a 531 c 579 g 665 t
ORIGIN
1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg
61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga
121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat
181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa
241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga
301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac
361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta
421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt
481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta
541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga
601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg
661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag
721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta
781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag
841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa
901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct
961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt
1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg
1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa
1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc
1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt
1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc
1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta
1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac
1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt
1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg
1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat
1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc
1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc
1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa
1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca
1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt
1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg
1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc
2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa
2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc
2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg
2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc
2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc
2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg
2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa
2461 taaaatgcac attgtaggac attttcta
//
LOCUS RSPNIFHD 1375 bp ds-DNA BCT 28-JUN-1990
DEFINITION R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and
dinitrogenase alpha subunit (nifD') gene, 5' end.
ACCESSION M33774
KEYWORDS dinitrogenase alpha subunit; dinitrogenase reductase.
SOURCE R.rubrum DNA.
ORGANISM Rhodospirillum rubrum
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1375)
AUTHORS Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P.
TITLE The cloning and functional characterization of the nifH gene of
Rhodospirillum rubrum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.J.Lehman, 18-APR-1990.
FEATURES from to/span description
pept 367 1254 dinitrogenase reductase (nifH)
pept 1370 > 1375 dinitrogenase alpha subunit (nifD)
binding 355 358 ribosome binding site
binding 1357 1361 ribosome binding site
signal 145 160 upstream activating sequence
signal 175 190 upstream activating sequence
signal 280 296 ntr promoter
BASE COUNT 287 a 404 c 431 g 253 t
ORIGIN
1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc
61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc
121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg
181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat
241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa
301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt
361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg
421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc
481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc
541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg
601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc
661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac
721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg
781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg
841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt
901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag
961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt
1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc
1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg
1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc
1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt
1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac
1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc
//
LOCUS YSCMTFDEL 70 bp ds-DNA ORG 28-JUN-1990
DEFINITION S.cerevisiae mitochondrial novel junction sequence.
ACCESSION M34150
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 33 44 novel junction site
BASE COUNT 11 a 25 c 19 g 15 t
ORIGIN
1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg
61 cgggacttat
//
LOCUS YSCMTGAP12 457 bp ds-DNA ORG 28-JUN-1990
DEFINITION S.cerevisiae mitochondrial gap 12 sequence.
ACCESSION M34149 X15185
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 457)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
COMMENT The gap 12 sequence is a segment of yeast mitochondrial DNA that
had not been determined previously.
BASE COUNT 161 a 57 c 57 g 182 t
ORIGIN
1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg
61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc
121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata
181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct
241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt
301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata
361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt
421 tttataaaga aaaatagtat aatattatct tctcctc
//
LOCUS YSCMTORI1A 70 bp ds-DNA ORG 28-JUN-1990
DEFINITION S.cerevisiae mitochondrial ori1 gene, 5' flank.
ACCESSION M34153
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
BASE COUNT 19 a 11 c 24 g 16 t
ORIGIN
1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
61 ggggtgggtg
//
LOCUS YSCMTORI2A 70 bp ds-DNA ORG 28-JUN-1990
DEFINITION S.cerevisiae mitochondrial ori2 gene, 5' flank.
ACCESSION M34152
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 32 > 70 ori2 mRNA (+/- 4 bp)
BASE COUNT 23 a 0 c 18 g 29 t
ORIGIN
1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag
61 ggggtgggtg
//
LOCUS YSCMTORI7A 70 bp ds-DNA ORG 28-JUN-1990
DEFINITION S.cerevisiae mitochondrial ori7 gene, 5' flank.
ACCESSION M34154
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
BASE COUNT 19 a 11 c 24 g 16 t
ORIGIN
1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
61 ggggtgggtg
//
LOCUS CFXCSMA 859 bp ds-DNA BCT 28-JUN-1990
DEFINITION C.aurantiacus 5.7 kd chlorosomal protein (cmsA) gene, complete cds.
ACCESSION M33964
KEYWORDS chlorosomal protein.
SOURCE C.aurantiacus (strain J-10-fl) DNA.
ORGANISM Chloroflexus aurantiacus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Multicellular filamentous green bacteria; Chloroflexaceae.
REFERENCE 1 (bases 107 to 554)
AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J.
TITLE Gene encoding a 5.7 kd chlorosomal protein in Chloroflexus
aurantiacus: Regulated message levels and a predicted carboxy
terminal protein extension
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 859)
AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.J.Robinson, 01-MAY-1990.
Author address: S.J.Robinson
University of Massachusetts
108 Morrill Science Center
Amherst, MA 01003
FEATURES from to/span description
pept 210 452 5.7 kd chlorosomal protein (cmsA)
mRNA 135 > 452 cmsA mRNA
binding 196 204 ribosome binding site
rpt 464 495 inverted repeat
BASE COUNT 158 a 232 c 283 g 186 t
ORIGIN
1 gatcggcgac ttccggcggg tcattacgct gccgaccagt ctggccggcc tggaagccgg
61 tcaggccgag atggagggtg aatggttgat tgtccccttc atggcgccgc aaatggcgca
121 acgctgaggc agcctgaact gtggggtttc tcggaggctt gcctccggtg acctgaataa
181 tggagtccgt ttgataagga ggtgtgtgca tggcgacgag aggctggttc tcggagtcgt
241 cggcgcaggt ggcgcaaatc ggcgacatca tgttccaggg ccactggcaa tgggtctcga
301 atgcgctaca ggccaccgcg gcagcggttg acaacatcaa ccgcaatgct tacccgggcg
361 tgtcccggag cggctcgggc gagggagcgt tcagcagcag cccgagcaac ggcttccgtc
421 cgaagcgcat tcgctcgcgc ttcaaccgct aggctggcgg gcaggcgtga ggcgtgcgtt
481 ctgtcgcctc acgccgatcc ctcgggagcg ttatgattaa cctgctctac tacagtcata
541 ccaccgtacc ggctcatctg gaagagatta cgcgactgga tgtgctcgat cctgacgagc
601 aagtgctggt cgcgatagat ggcgttttgg tcgatcagca gggacagcgg ctgagcggcc
661 caacgctgca cgattattgt ctgatcaccg acttgcgggt gatcctgtgg gcacgagatt
721 atggccggca tttgtgttat gcctttccat tgcatgagct gacgttgatt gatgggcgcg
781 ggattgatcc gatccacggc gcagtgacga tgggctttag cgcgccagat accgatgatc
841 agatttttac gctgactct
//
LOCUS VVCACTA 3385 bp ds-DNA PLN 28-JUN-1990
DEFINITION Volvox carteri actin gene, complete cds.
ACCESSION M33963
KEYWORDS actin.
SOURCE Volvox carteri (female strain HK10) DNA, clone lambda-VAc21.
ORGANISM Volvox carteri
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Volvocaceae.
REFERENCE 1 (bases 1 to 3385)
AUTHORS Cresnar,B., Mages,W., Mueller,K., Salbaum,J.M. and Schmitt,R.
TITLE Structure and expression of the single Volvox carteri actin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Mages, 01-MAY-1990.
Author address: W.Mages
Lehrstuhl fuer Genetik
Universitaet Regensburg
Universitaetsstr.31, D-8400 Regensburg, FRG
FEATURES from to/span description
pept 450 509 actin, exon 2 (first expressed exon)
613 708 actin, exon 3
780 926 actin, exon 4
1009 1074 actin, exon 5
1150 1197 actin, exon 6
1396 1567 actin, exon 7
1667 1821 actin, exon 8
1959 2142 actin, exon 9
2425 2630 actin, exon 10
pre-msg 195 > 2630 actin mRNA and introns
IVS 258 439 actin intron A
IVS 510 612 actin intron B
IVS 709 779 actin intron C
IVS 927 1008 actin intron D
IVS 1075 1149 actin intron E
IVS 1198 1395 actin intron F
IVS 1568 1666 actin intron G
IVS 1822 1958 actin intron H
IVS 2143 2424 actin intron I
signal 120 124 CAAT box
signal 164 169 TATA box
BASE COUNT 775 a 830 c 928 g 852 t
ORIGIN
1 aagcttgaag agaggtctta tggaccctag tcggtcacga tccagacacc aaacgtaacg
61 aaaaaaagga ataattagcc gggcttgttg atgcatattc gcggctttac attatcccgc
121 caatacgttc acgtacggca caggacatcc ttttcactcg agctataaac tatcatcatc
181 cagattgctt gttttacctt gcaccagtta ctatcagcac caaccggccc ttgacttcgc
241 acagccttct aaaaaaggta cgcgcgacgt gtttagcgtt cgggcagccg cagctaggag
301 gtttggtggc tgcgaggccc ttctgcccca ggataagaag gggattaact tccgttgcca
361 taacatgaag tccggggcct tgtccatgtt cgatcgccaa cgatgcaacc tatgctgtct
421 taccgctggg cttatgcagg gcagcaacaa tggctgagga gggcgaggtt tctgctctgg
481 tgtgcgacaa cggctcgggc atggtcaagg tacgttttct ggctgctgct cggttgggcc
541 cggccaggcc caccccgcag ccttgctgag cttttgatgg atcgcatcta acacgtggcc
601 cctgatttgt aggctggctt tgctggtgat gatgccccac gagctgtgtt tccgagcatt
661 gttggtcggc cccgccatac gggtgtgatg gtcggcatgg ggcagaaggt gaatagaatg
721 acggcgatgc aggcacggac tcctcgacat ttttataatg atctttttgt tttcgccagg
781 attcctacgt gggcgacgag gcacagtcta agcgtggtat cttgacgttg cgctacccta
841 ttgagcacgg tatcgttacc aactgggatg acatggagaa gatctggcac cacactttct
901 ttaatgagct tcgtgtggca ccggaggtgc gtctgcgtac tagcttcctt ggctttcagc
961 tatggaccag ttcgacattg acaactgctt tccttcccaa atctgtagga gcacccagtc
1021 cttctcacgg aggcgcctct taatcccaag gcgaaccgtg agaagatgac gcaggtacgg
1081 atattgtttt atgcacatac cgttgcgttc gttcccaagc ttgcctcccg ataatctccg
1141 tttgcgcaga tcatgttcga gaccttcaac gtccctgcca tgtatgtggc gatccaggtg
1201 acgattgggg gaggggactt agtaatacct cgattgctgt tgttcgattc acccctctca
1261 tttcgatttg cgcacttggg gtggcatcat gtttcgtgcg tcggggtagt tgttaaccct
1321 gcaaccctag ctactgccac taatgacagt aggaaagtaa tatttaattg ctgtccttca
1381 tgctgacaac tgcaggccgt gctgtcgctg tatgctagcg gtcgtacgac gggtatcgtt
1441 ctagattccg gtgacggtgt aacccacacg gtgcccatct atgaaggcta cgcgctgccg
1501 cacgccatcc tgcgtctgga ccttgccggc cgtgacctga cggactacct gatgaagatt
1561 ctaatgggtg aggcattcgg atagggggac caaaataccg taccgcgaga aatgggttat
1621 gagtcgtgtg ttttgacttg tgttgaacgg cttcccttcg ttacagagcg tggttactca
1681 ttcacgacca ctgctgagcg tgaaatcgtg cgcgacatca aggagaagct ctgctatgtg
1741 gcgctggatt tcgagcagga aatggcaaca gcagcctcca gctcggcgct tgagaagacg
1801 tacgagctgc ctgatggcca ggtgcggccg tagctagacg ccgacgaata gcctaaaact
1861 ttcggtgatg atcagcttag atacatttga ttgtgtcggc attgcacgaa aaaagtcgtt
1921 gtccataatg ttaaaattgc cctcttttct ccctgcagcc aatcacaatt ggcaacgagc
1981 gcttccgttg ccctgaggtg ctgtacaacc ctagcctaat cggcatggag gcggtcggta
2041 tccacgacac tacctttaac agcatcatga agtgcgatgt cgatatccgc aaggatctgt
2101 acaacaacat cgtgctctct ggtggaacga ccatgttccc tggtaagcaa tgcatactac
2161 tgttagactt caggacatgt tgtgatcgca ttagcgatta gaaccgggtt agcgcgggac
2221 agacaggagg tacggtgttt ggggctcggg gaactcttat gaggatgaat aagcggttct
2281 tagcgtctat gtcacggacg gatctatgga attccgtcag tatctggagg cttgcatgtt
2341 gcatcgctcc ccctgaactc catcgtcctc cgtgaaggtc acagtgcctt gacaaatgcg
2401 atctgtgctg ttgaacaatt gcaggcatcg ctgatcgcat gaccaaggag atcacggccc
2461 ttgcgccgtc tgccatgaag atcaaggtcg tcgccccgcc ggagcgcaag tactctgtgt
2521 ggattggtgg ttcaatcctg gcgtccttgt cgaccttcca gcaaatgtgg atcgcgaaga
2581 gcgagtacga cgagagtggt ccgtcgattg tgcaccggaa gtgcttctaa gcataatagc
2641 ttgcggactg ttattcggcc tctttcggtt ataactgacc ggaagcgggt tactcttcag
2701 tcaaggcgga tgcctatgat tttgcaaaaa ttcgagcgga gatgtcatct tggtttcaag
2761 ctgcgtgggt gagaaatcaa ctgtgggatc catgtcgaca gcttaacaac agtctctacc
2821 atttcatggt atgcggctgt agccgtagcg agtacgtcgt catgagcagc ggtagtttga
2881 tcgtgcgaag tagtctgata ggggcagggc gcgtggggat ggctaataga acgaagagga
2941 tgtgtgcatt caagcgagtg gatgtcggcg attggtctag gccaagcaaa gtgcgtgtaa
3001 gggaaagctg gatgacacga ccattgctcg tgtgctttcg gaacttccac tcttgaactc
3061 aggcctatgc gctgccttga gtatcctgcc ggataaaagg cgggaggcgg taacgttagt
3121 agacccttcc aggggtataa atggtcaatt acaacaaaac tgtaacataa cgaagcgctg
3181 cacgtagtac gaaatccgtt gcaaacgtga aacccagcta cttgccaata cttaaacgcg
3241 catcagcatt gtgtacaaga acgtcaatga ggcattgcgt aaattacaca tgtgagcact
3301 aagaaaacag ctgtatgttg tcacgtttat tcactagcgc acagcccgca agtaagcggt
3361 ggcaaagccc agctttggta caaag
//
LOCUS HAMCD44 1089 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Hamster hyaluronate receptor (CD44), complete cds.
ACCESSION M33827
KEYWORDS CD44 hyaluronate receptor.
SOURCE Hamster, cDNA to mRNA.
ORGANISM Cricetulus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1089)
AUTHORS Aruffo,A., Stamenkovic,I., Melnick,M., Underhill,C.B. and Seed,B.
TITLE CD44 is the principal cell surface receptor for hyaluronate
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Seed, 23-APR-1990.
FEATURES from to/span description
pept 1 1089 hyaluronate receptor (CD44)
BASE COUNT 299 a 282 c 277 g 231 t
ORIGIN
1 atggacaagt tttggtggca cgcagcttgg ggactctgcc tcttgccgct gagcctggcg
61 cacgagcaga tcgatttgaa cataacctgc cgctatgcag gtgtattcca cgtggagaaa
121 aatggccgct acagcatctc acggactgag gcagctgacc tctgccaagc tttcaacagc
181 actctgccca ccatggacca gatggtgatg gccctgagca agggctttga aacatgcagg
241 tatgggttca tagaaggcca cgtggtgatc ccgaggatcc agcccaatgc catctgtgca
301 gccaaccaca ctggggtgta tatcctcaca tccaacacat ctcactacga tacatattgc
361 ttcaatgcct cagcacccct tgaagaagac tgtacatctg tcacagacct gcccaattcc
421 ttcgaaggac cagttaccat aactattgtc aaccgtgatg gtacccgcta cagcaagaag
481 ggcgagtata gaacacacca agaagacatt gatgcctcaa ataccacaga tgatgatgtc
541 agcagcggat cctccagtga gaagagcacc tcagggggct atgttttcca cacctacctt
601 cccactatac actcaactgc agaccaggat gatccctact tcatcgggag caccatggcc
661 accagagacc aagactcatc catggatccc agggggaatt ccctcactgt gactgatgga
721 tccaaattaa ctgaacactc aagtgggaat caagacagtg ggcttaactc aacttctcgt
781 cctggaggaa aacctcgagt tccagaatgg ctcatcgtct tggcatctct cctggcgctg
841 gctctgattc ttgctgtttg cattgctgtc aacagtagga gaaggtgtgg acagaagaaa
901 aagctggtga tcaacagtgg caatggaaag gtggaggaca ggaagccaag tgagctcaac
961 ggggaggcca gcaagtctca ggaaatggtg catttggtga acaaggaacc atcagagact
1021 cctgaccagt ttatgacagc tgatgagacc cggaatctgc agaatgtgga catgaagatt
1081 ggggtgtag
//
LOCUS DROGNBPSA1 441 bp ds-DNA INV 28-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exon 1.
ACCESSION M33996
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 1 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 441)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pre-msg 362 > 441 guanine nucleotide-binding protein (Gs-alpha)
mRNA and introns (alt.)
pre-msg 174 > 441 Gs-alpha mRNA and introns (alt.)
IVS 426 > 441 intron A
signal 152 157 promoter region (put.)
variant 365 365 c in DNA, g in cDNA
BASE COUNT 140 a 89 c 85 g 127 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata
61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc
121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta
181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat
241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg
301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc
361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat
421 gttcggtaat ttcacggtaa a
//
LOCUS DROGNBPSA2 897 bp ds-DNA INV 28-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exons 2,3 and 4.
ACCESSION M33997
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 2 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 897)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 265 540 guanine nucleotide-binding protein
(Gs-alpha-S), exon 2
607 703 guanine nucleotide-binding protein
(Gs-alpha-S), exon 3
760 + 880 guanine nucleotide-binding protein
(Gs-alpha-S), exon 4
pep$ 265 540 guanine nucleotide-binding protein
(Gs-alpha-L), exon 2
607 703 guanine nucleotide-binding protein
(Gs-alpha-L), exon 3
760 + 880 guanine nucleotide-binding protein
(Gs-alpha-L), exon 4
pre-msg < 1 > 897 Gs-alpha mRNA and introns
IVS < 1 29 intron A
IVS 541 606 intron B
IVS 704 759 intron C
IVS 881 > 897 intron D
BASE COUNT 266 a 225 c 226 g 180 t
ORIGIN About 1.4 kbp downstream of segment 1.
1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc
61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg
121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag
181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg
241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg
301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag
361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc
421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct
481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg
541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc
601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag
661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc
721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc
781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg
841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc
//
LOCUS DROGNBPSA3 1380 bp ds-DNA INV 28-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exons 5,6,7,8 and 9.
ACCESSION M33998
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 3 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1380)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 4 132 guanine nucleotide-binding protein
(Gs-alpha-S), exon 5
206 385 guanine nucleotide-binding protein
(Gs-alpha-S), exon 6
445 569 guanine nucleotide-binding protein
(Gs-alpha-S), exon 7
746 828 guanine nucleotide-binding protein
(Gs-alpha-S), exon 8
909 1055 guanine nucleotide-binding protein
(Gs-alpha-S), exon 9
pep$ + 4 132 guanine nucleotide-binding protein
(Gs-alpha-L), exon 5
206 385 guanine nucleotide-binding protein
(Gs-alpha-L), exon 6
445 569 guanine nucleotide-binding protein
(Gs-alpha-L), exon 7
755 828 guanine nucleotide-binding protein
(Gs-alpha-L), exon 8
909 1055 guanine nucleotide-binding protein
(Gs-alpha-L), exon 9
pre-msg < 1 1115 Gs-alpha mRNA and introns (alt.)
pre-msg < 1 1176 Gs-alpha mRNA and introns (alt.)
pre-msg < 1 1243 Gs-alpha mRNA and introns (alt.)
IVS < 1 3 intron D
IVS 133 205 intron E
IVS 386 444 intron F
IVS 570 745 intron G (for Gs-alpha-S) (no splice
consensus)
IVS 570 754 intron G' (for Gs-alpha-L)
IVS 829 908 intron H
variant 780 780 t in DNA, c in cDNA
signal 1083 1088 polyA signal
signal 1154 1159 polyA signal
signal 1221 1226 polyA signal
BASE COUNT 441 a 265 c 254 g 420 t
ORIGIN About 500 bp downstream of segment 2.
1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga
61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa
121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat
181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta
241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa
301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt
361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg
421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa
481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt
541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg
601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca
661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt
721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat
781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc
841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc
901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca
961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc
1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag
1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt
1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac
1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat
1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag
1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac
//
LOCUS SYCSBEFA 850 bp ds-DNA BCT 28-JUN-1990
DEFINITION Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta
subunit (psbF) genes, complete cds.
ACCESSION M33897 X06988
KEYWORDS cytochrome B559 alpha subunit; cytochrome B559 beta subunit;
photosystem II protein.
SOURCE Synechocystis sp. (strain PCC6803) DNA, clone pKW1261.
ORGANISM Synechocystis sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 553)
AUTHORS Pakrasi,B.P., Williams,J. and Arntzen,C.
TITLE Targeted mutagenesis of the psbE and psbF genes blocks
photosynthetic electron transport: Evidence for a functional role
of cytochrome B559 in photosystem II
JOURNAL EMBO J. 7, 325-332 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 850)
AUTHORS Pakrasi,H.B., Nyhus,K.J. and Granok,H.
TITLE Targeted deletion mutagenesis of the beta subunit of Cytochrome
B559 protein destabilizes the reaction center of photosystem II
JOURNAL Z. Naturforsch., C, Biosci. 45, 423-429 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Himadri, 01-MAY-1990.
Washington University Dept. of Biology
Campus Box 1137
St. Louis, MO 63130
FEATURES from to/span description
pept 121 366 cytochrome B559 alpha-subunit (psbE)
pept 403 537 cytochrome B559 beta subunit (psbF)
binding 109 112 ribosomal binding site (put.)
binding 392 396 ribosomal binding site (put.)
BASE COUNT 197 a 196 c 197 g 260 t
ORIGIN
1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga
61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa
121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg
181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg
241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt
301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat
361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct
421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg
481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag
541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct
601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt
661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc
721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt
781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa
841 tatcatggtt
//
LOCUS ECOPURCA 2060 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.coli 5-phosphoribosyl 5-aminoimidazole 4-N-succinocarboxamide
synthetase (SAICAR) and dapX gene, complete cds.
ACCESSION M33928
KEYWORDS 5'-P-ribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase;
SAICAR synthetase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2060)
AUTHORS Tiedemann,A.A., DeMarini,D.J., Parker,J. and Smith,J.M.
TITLE DNA sequence of the purC gene encoding SAICAR synthetase and
organization of the dapA-purC region of Escherichia coli K12
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (sites only)
AUTHORS He,B., Shiau,A., Choi,K.Y., Zalkin,H. and Smith,J.M.
TITLE Genes of the Escherichia coli pur regulon are negatively controlled
by a repressor-operator interaction
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Authorin sequence for [1] kindly submitted by J.M.Smith,
27-APR-1990.
Seattle Biomedical Research Institute
4 Nickerson Street
Seattle, WA 98109-1651
e-mail: jms0234@max.acs.washington.edu
FEATURES from to/span description
pept 1281 1994 5'-phosphoribosyl-5-aminoimidazole-4-
N-succinocarboxamide synthetase (EC 6.3.2.6)
pept 637 1068 dapX protein
signal 1106 1111 -10 region
binding 1112 1127 purine repressor protein binding site (PUR
box)
site 1180 1216 repetitive extragenic palindrome
BASE COUNT 501 a 504 c 587 g 468 t
ORIGIN
1 gcatgccggt ttgctgtaaa gtttagggag atttgatggc ttactctgtt caaaagtcgc
61 gcctggcaaa ggttgcgggt gtttcgcttg ttttattact cgctgcctgt agttctgact
121 cacgctataa gcgtcaggtc agtggtgatg aagcctacct ggaagcggca ccgcttgcgg
181 agccttcatg ccccggctgg aatgattttg ccggtgacct ccggtgttat gcaatcccgg
241 tgaccaacgg tagtggtgct gtcggtaagg cgctgacatt cgtccaccag cccagccgct
301 ggcactggtt tctggcgcgc gtacccagtt cacgggcgat accgcttcat tgctggtgga
361 aaatggtcgt ggcaatactc tgtggccgca ggtggttagc gtgctgcagg cgaaaaacta
421 caccatcacc caagtgatga tgctggtcag acactgacca ccgattgggt acaatggaac
481 cgtctggacg aagacgagca gtatcgtggt cgttatcaaa tctctgttaa gccgcagggt
541 tatcagcagg cggttacggt taaactgctg aacctggaac aggcgggcaa accggttgca
601 gacgcggctt ccatgcagcg ttacagcacg gagatgatga acgttatttc cgccggtctg
661 gataaatctg ccactgacgc cgcgaaccgt gcgcaaaatc gtgcctccac cactatggac
721 gtacaaagtg cagctgatga caccggttta ccaatgctgg tcgtacgcgg gccgttcaat
781 gtggtttggc aacgtctgcc agcggcgctg gaaaaagtgg gcatgaaagt gaccgacagc
841 acccgttcgc agggcaacat ggccgtaact tataagccgc tgtctgacag cgactggcag
901 gaactgggcg cgagcgatcc aggcctggca tccggtgact ataaactgca ggttggcgat
961 ttagataacc gcagcagcct acagttcatc gatccgaaag gtcatactct gactcagagt
1021 cagaacgacg cgctggtagc tgtcttccag gctgcgttta gcaagtaaaa atacagggct
1081 ggaatcatcc ggcccttttt tctgatatga tacgcaaacg tgtgcgtctg caggaaaacg
1141 cgattttagc ggtaattcgc acgaaatttg tttgtcggac gtagttcgga taaggcgttc
1201 acgccgcatc cgacaaaaca tccggcacac cagacagcaa aagattttaa aacgttaatt
1261 cacacccagg agtgataaag atgcaaaagc aagctgagtt gtatcgtggt aaagcgaaaa
1321 ccgtatacag cacggaaaac ccggacctgt tggtgctcga attccgcaat gatacgtcag
1381 caggggatgg cgcgcgcatt gagcagtttg atcgcaaagg tatggtgaac aacaagttca
1441 actacttcat tatgagcaaa ctggctgaag cgggtatccc gactcaaatg gagcgtctgc
1501 tctccgatac cgaatgtctg gtgaaaaagc tggatatggt gccggttgag tgtgtcgtgc
1561 gtaaccgtgc tgctggctct ctggtgaaac gtcttggaat cgaagaaggt attgagctga
1621 acccgccgct gttcgatctg ttcctgaaaa acgacgccat gcacgatccg atggtcaacg
1681 aatcttactg cgaaaccttt ggctgggtga gcaaagagaa cctggcgcgt atgaaagagc
1741 tgacctacaa agcgaacgac gtgctgaaaa aactgttcga tgatgctggt ctgattctgg
1801 tcgacttcaa gctggaattt ggtctgtaca aaggcgaagt ggtactgggt gatgagttct
1861 ccccggacgg tagccgcctg tgggacaaag aaacgctgga gaaaatggac aaagaccgtt
1921 tccgccagag cctcggtggc ctgatcgaag cctatgaagc cgtcgcccgc cgcctgggtg
1981 tacagctgga ctgatttttc tgttcatcat cttgccgtgc agctggcacg gcaagacaac
2041 cgctctcgga atatatgcat
//
LOCUS PIGSGLT1A 3649 bp ss-mRNA MAM 28-JUN-1990
DEFINITION Pig Na+/glucose cotransporter protein (SGLT1) mRNA, 3' end.
ACCESSION M34044
KEYWORDS Na+/glucose cotransporter.
SOURCE Pig kidney cell line LLC-PK1, cDNA to mRNA, clone pPSGT-[B1,C2].
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 3649)
AUTHORS Ohta,T., Isselbacher,K.J. and Rhoads,D.B.
TITLE Regulation of glucose transporters in LLC-PK1 cells: Effects of
glucose and monosaccharides
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Ohta 03-MAY-1990.
Mass General Hospital
Building 49
13th Street
Charlestown, MA 02129
FEATURES from to/span description
pept < 1 1818 Na+/glucose cotransporter (SGLT1)
signal 1935 1940 polyA signal (put.)
signal 2086 2091 polyA signal (put.)
signal 2238 2243 polyA signal (put.)
signal 3205 3210 polyA signal (put.)
signal 3628 3633 polyA signal (put.)
BASE COUNT 905 a 790 c 871 g 1083 t
ORIGIN
1 ttcttcctgg ctggacgaag tatggtgtgg tggccggttg gagcttctct gtttgccagt
61 tatatcggaa gcgggcattt tgttgggctg gcaggaactg gagcagctgc aggcattgcc
121 actggtggct ttgaatggaa tgctctgatt tgggtggttg ttctgggctg gctgtttgtc
181 cccatttaca ttaaggcagg ggtggtgacg atgccagagt atctgcggaa gcgatttgga
241 ggcaagcgga tccaggtcta cctctctatc ctgtccctga tgctctacat tttcaccaag
301 atctcggcag acatcttctc tggggccata ttcatcacct tggccttggg cctggatctg
361 tacctggcca tctttctctt actggcaatc actggccttt acacaatcac agggggcctg
421 gctgctgtga tttatacgga taccttgcag acagcaatca tgctggtggg gtcttttatc
481 ctaactgggt ttgcttttca tgaagtggga ggttatgatg ccttcatcga aaaatacatg
541 aatgccattc cgactgtgat ttctgatgga aatatcacca tcaagaaaga atgttatgcc
601 ccgagggctg actccttcca tattttccga gatcctctca agggagacct gccatggcct
661 gggctcacct ttgggctgtc catcctcgcc ctgtggtact ggtgcacaga tcaggtcatt
721 gtgcagcgct gtctctcggc caaaaacatg tcacatgtga aggctggctg cgtcatgtgt
781 gggtacttta agctgctgcc catgtttgtc atagtgatgc cagggatgat cagccgcgtc
841 ctgtacacag aaaaaattgc ctgcaccgtc ccctcggaat gtgagaaata ttgcggcacc
901 aaggttggct gttccaacat tgcctacccg accttggtgg tggaactcat gcccaatgga
961 ttgcgaggcc tgatgctgtc ggtcattttg gcctctctca tgagctccct gacctccatc
1021 ttcaacagcg ccacgactct cttcaccatg gatgtctacg ccaagatccg gaagagagca
1081 tctgagaaag agctcatgat tgcgggaagg ctgttcatcc tggtgctgat tggcatcagc
1141 attgcctggg tgcccattgt gcagtcagca caaagtgggc agctcttcga ttacatccag
1201 tctgtcacca gttacttggg accacccatc gcagctgtct tcctacttgc tattttctgc
1261 aagagagtca atgaggaggg agccttttgg ggactggtca taggatgtat gattgggctg
1321 gcccgtatga ttaccgagtt tgcctatgga accgggagct gcgtggagcc cagcaactgt
1381 cccacaatta tctgtggtgt gcactacttg tactttgcca tcatcctctt tgtcatttcc
1441 atcatcatcg tcctggtcgt ctccctcttc accaagccca ttccagatgt gcatctctac
1501 cgcctgtgtt ggagcctgcg caacagcaaa gaggagcgta ttgacctgga tgcagaggag
1561 gaggacattc aagaagcccc agaagagacc attgaaatag aagttcctga ggagaagaaa
1621 ggatgcttca ggaggaccta tgacctgttt tgtggcctgg accagcagaa gggccccaaa
1681 atgaccaagg aagaggaggc agccatgaag ctgaagatga cagacacctc agagaagcct
1741 ttgtggcgga cagtagtgaa catcaatggc attatcctgc tgaccgtggc cgtattctgc
1801 catgcatatt ttgcctgaaa tcctgccttc tgccacaggc tgtgccctga aggctggact
1861 ctttactctg tttcctttta gtcccattct gtgctgtcaa agggacacca gccagttgta
1921 aattttgcct agtcaataaa tgtgtacatt tgtaattagc cagaaaaaaa aatcagtgat
1981 ctgttattaa cttatgtatt tgaggccagt gtcatacagt catctgtaca tatcagacct
2041 gcagaaggaa gtccactcag tcggatgtct aggaaaaggc agactaagaa acagaagtcc
2101 cgtgatgtct gatgcaagtt tgtctcaggt agattcagca tgtcagtgtt gtttgtgatc
2161 cttgaatgtt attttaagag ttgtggtcgc cctggttcgg ccacttccta taaacctcac
2221 ctctcataat ttttcaaaag aaaaaatccc ccccctcccg atatcatgtc aagtttttcc
2281 ggcacaagag taacaatttg ataagttgat gaacacattg aagctgagga tggaagtttg
2341 atggagtccc atcctgggtc tttgctctcc aaaggagagg tctgggatgg taatacttcc
2401 cccaaagaga gggacccagg tgccaatctc aggccactag ggaaatcagg taacttgagt
2461 caaggcgatt ttctgcattt cttagaagtg tccattgtga gttccttggg ttgagatttc
2521 attcattttg gcatatcttt tgcctgtact gtcttagtca ctcagcatgt ctgtggacca
2581 aatgaatgat ataacatttt gttttataat gtttgcttta tttttggtta actttttctt
2641 atggctgcac aagtacaacc tctcgtgact ataatgaatt gggtgcattt ccataggcct
2701 gtcttcctca tggtgccaat gctggcatac aggtctagaa gctgactctc tcacataata
2761 gagcaaatgc acctcttgga ggcagccgtc atctcaaagt aaccacactc tcctgaaaca
2821 ctgaaacttt ttggcctgag accctccttc tatgcgctca catgaccata gctcatattt
2881 ttttgaggtg tcttgattga tgatcatcca ctgaccaagg ttagagtggg tggtctctgt
2941 tccaggtggt agcacttgac actcttattt cttgtttcct agggagaatt tttaaaaatt
3001 aaatttaatt aatttaattc ttttaattcc accagtgaaa tattaattat aatataatcc
3061 tgtggaagtc atggtgttgg tattgactgg tttcaagcta gcttggaaaa gagaagtgat
3121 ctgagattct gtggagctaa gagcggttcc ttgggtataa aagtggctta acaggtatac
3181 aatggcttgt tcttgtataa ggaaaagaaa attttgatat gccctatatt ccccttaaag
3241 ctcaggcaga gaagatgcca aaagtggaga atttcacatt attcagagta cttgtgggag
3301 tttgtgatgt ttttgagctt tgagaggata aagagggatt atagaaagtt ggggaatgga
3361 acaggcataa gtatctagat tgtcctagat tctgtttccg actttgccaa tgattcactg
3421 tgtatattgc agtaaattat tttgtctcaa ctgtgtattt tcctctttga aaaatggaga
3481 ccttggtgtt atagatctac ctacctcctc tgggtgtgac aaggtcattt tgaaaaggtt
3541 agatataata catgaaagat gccaagcaaa tttatattaa gcttcattaa tgtagtccca
3601 tacattaaat tttactactt tggtaaaaat aaaagacatt aaaaatgat
//
LOCUS RATSHIIA 3064 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds.
ACCESSION M34052
KEYWORDS K+ channel protein.
SOURCE Rat brain, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3064)
AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R.
TITLE Molecular cloning of a member of a novel class of K+ channel genes
in mammals
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by R.Bernardo, 07-MAY-1990.
FEATURES from to/span description
pept 80 1771 K+ channel protein (KShIIa)
BASE COUNT 777 a 718 c 729 g 840 t
ORIGIN
1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag
61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt
121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct
181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct
241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc
301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga
361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt
421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt
481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat
541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct
601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga
661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg
721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc
781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt
841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta
901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac
961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct
1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg
1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag
1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac
1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt
1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc
1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac
1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg
1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa
1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa
1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa
1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga
1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc
1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa
1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag
1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt
1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt
1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc
2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg
2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa
2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg
2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga
2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc
2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt
2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat
2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc
2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca
2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa
2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga
2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga
2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt
2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt
2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt
2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa
3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa
3061 tgat
//
LOCUS ECOHIP 726 bp ds-DNA BCT 28-JUN-1990
DEFINITION E-coli hip gene for integration host factor beta-subunit (IHF
beta).
ACCESSION X04864
KEYWORDS 30S ribosomal subunit protein S1; integration host factor;
integration host factor beta-subunit; rpsA gene.
SOURCE Escherichia coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 597)
AUTHORS Weisberg,R.A.
JOURNAL Unpublished (1987) NICHD/NIH, Bldg.6 Rm.306, Bethesda MD 20892.
STANDARD simple automatic
REFERENCE 2 (bases 1 to 597)
AUTHORS Flamm,E. and Weisberg,R.A.
TITLE Primary structure of the hip gene of Escherichia coli and of its
product,the beta-subunit of integration host factor
JOURNAL J. Mol. Biol. 183, 117-128 (1985)
STANDARD simple automatic
REFERENCE 3 (bases 1 to 726; revises [1],[2])
AUTHORS Mengeritsky,G., Giladi,H., Teff,D., Goitein,D., Weisberg,R.A. and
Oppenheim,A.B.
TITLE Revised DNA sequence of the hip gene of E.coli
JOURNAL Unpublished (1990)
STANDARD simple automatic
COMMENT **map:20';
EMBL features not translated to GenBank features:
key from to description
INVREP 495 508 inverted repeat B
INVREP 510 523 inverted repeat B'
TERM 495 523 put. transcription terminator
[3] Author address G.Mengeritsky
Dept of Mol Genetics,
Hebrew University-Hadassah Medical School
Jerusalem, Israel
GALINA%HUJIMD.BITNET@CUNYVM.CUNY.EDU
FEATURES from to/span description
pept < 1 29 rpsA
pept 190 474 IHF
revision 455 459 gcgcc in [3]; gc in [1],[2]
rpt 40 48 inverted repeat A
rpt 53 61 inverted repeat A'
signal 40 61 put. transcription terminator
signal 76 81 -35 region
signal 99 104 -10 region
binding 103 116 IHF binding site
site 475 483 Lambda-E.coli erossover point
mut 335 335 g in wt; a in hip157 (gly is asp)
BASE COUNT 204 a 167 c 162 g 193 t
ORIGIN
1 aagctttcaa agcagctaaa ggcgagtaat tctctgactc ttcgggattt ttattccgaa
61 gtttgttgag tttacttgac agattgcagg tttcgtcctg taatcaagca ctaagggcgg
121 ctacgggccg cccttaatca atgcagcaac agcagccgct taatttgcct ttaaggaacc
181 ggaggaatca tgaccaagtc agaattgata gaaagacttg ccacccagca atcgcacatt
241 cccgccaaga cggttgaaga tgcagtaaaa gagatgctgg agcatatggc ctcgactctt
301 gcgcagggcg agcgtattga aatccgcggt ttcggcagtt tctctttgca ctaccgcgca
361 ccacgtaccg gacgtaatcc gaagactggc gataaagtag aactggaagg aaaatacgtt
421 cctcacttta aacctggtaa agaactgcgc gatcgcgcca atatttacgg ttaagttttt
481 atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg aacaggtcac
541 tatcagtcaa aataaaatca ttatttgatt tcaattttgt cccactccct gcctctgtca
601 tcacgatact gtgatgccat ggtgtccgac ttatgcccga gaagatgttg agcaaactta
661 tcgcttatct gcttctcata gagtcttgca gacaaactgc gcaactcgtg aaaggtaggc
721 ggatcc
//
LOCUS GPIGPC3A 5077 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Guinea pig complement C3 protein (GPC3) mRNA, complete cds.
ACCESSION M34054
KEYWORDS complement C3.
SOURCE Guinea pig (strain Hartley) liver, cDNA to mRNA.
ORGANISM Cavia porcellus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae.
REFERENCE 1 (bases 1 to 5077)
AUTHORS Auerbach,H.S., Burger,R., Dodds,A. and Colten,H.R.
TITLE Molecular basis of complement C3 deficiency in guinea pigs
JOURNAL J. Clin. Invest. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by H.S.Auerbach 07-MAY-1990.
FEATURES from to/span description
pept 28 5028 complement C3 protein (GPC3) precursor
sigp 28 93 complement C3 protein signal peptide
matp 94 2040 complement C3 beta chain
matp 2053 5025 complement C3 alpha chain
signal 5055 5060 polyA signal (put.)
BASE COUNT 1174 a 1537 c 1504 g 862 t
ORIGIN
1 cctgggtcgc tcggctctcc ctgcaccatg ggacccgccg caggtcccag cctgctgctg
61 ctgctgctgg ccagcgtctc cctggccctg ggggacccca tgtactcgat catcaccccc
121 aacatcctgc ggctggagaa cgaggagacc gtggtgctgg aggcccacga agtgcaaggg
181 gacatccctg tcacagtcac tgtccacgac tttccggcca agaagaatgt gctgtccagt
241 gagaagaccg tgctgaccag tgccacggga tatctaggca ctgtcaccat caagatcccc
301 gccagcaagg agttcaagtc cgataagggg cgcaagttgg tggtcgtgca ggccgccttt
361 gggggcacgc agctggagaa ggtggtgctg gtgagcctgc agagcgggta cctcttcatc
421 cagacggaca agaccatcta cacgccgggc tccaccgtgc tctaccggat cttcaccgtg
481 gacagcgacc tgctgcccgt gggccggacc atcattgtca ccatcgagac acccgacggc
541 atccccatca agcgggacac cctgtcttcc aacaaccagc atggcatctt gcccttgtcc
601 tggaacatcc ccgagttggt caacatgggg cagtggaaaa tccaggcctt ctacgagaac
661 tcgcccaagc aagtcttctc ggctgagttc gaggtgaaag aatacgtgct gcccagcttc
721 gaggtcctgg tggagcccac ggagaagttc tactacatcg acgatccgaa aggcctggag
781 gtcaacatca tagccaggtt cctctacggg aagaacgtgg acggaacggc ctttgtcatc
841 ttcggggtcc aggacggtga ccagaggatc tccctggccc agtccctcac ccgagtcgtg
901 atcgaggacg gcagcggaga ggtcgtgctg agcaggcagg tgctgctgga cggggtgcag
961 ccctccaggc ccgaggccct ggtggggaag tccctgtacg tgtctgtcac ggtcatcctg
1021 cactcaggca gcgacatggt ggaggcagag cgcagtggga tccccatcgt cacctccccc
1081 taccagatcc acttcaccaa gacgcccaag tacttcaagc cggctatgcc cttcgaaatc
1141 atggtgctgg tgacgaaccc cgacggctct ccggcccccc acgtccccgt ggtgacccag
1201 ggctccaacg tgcagtctct cacccaagct gacggcgtgg ccaggctgag catcaacacc
1261 cccaacaccc ggcagcctct gagcgtcacg gttcaaacaa agaaaggggg catcccggac
1321 gcccggcagg ccatcaacac catgcaggcc ttgccctaca ccaccatgta caactccaac
1381 aactatttac acctctccat gccgcgcacc gagctcaagc ctggcgagac catcaacgtc
1441 aacttccacc tgcgaagcga cccgaaccag gaagctaaga tccgctatta cacctacctg
1501 atcatgaaca aggggaagct gctgaaggtg ggtcggcagc ctcgagagcc gggccaggcc
1561 ctggtggtgc tgcccatgcc catcaccaag gagctgattc cctccttccg cctggtggcc
1621 tactacactc tcatcggtgc cagcgcgcag agggaggtgg tggccgactc tgtgtgggca
1681 gacgtcagag actcctgtgt gggcacgctg gtcgtcaaag gcggctccgg gaaagatggc
1741 caggacaagc ggcagcagca tctaccccgg cagcagatga ccctcaggat agagggcaac
1801 cagggggccc gcgtgggcct ggtggccgtg gacaagggtg tgtttgtgct gaacaagaaa
1861 cacaaactga ctcagagtaa gatctgggac gtggtggaga aagcggacat cggctgcacc
1921 ccgggcagcg ggaaggacta cgcgggcgtc ttcacggacg cggggctgtc cttcaagtcc
1981 agcaaggcgg ggctgcagac ggcccagcgg gaaggtctgg actgccccaa gcccgccgcg
2041 cgccgccgcc gctcggtgca gctcatggag aggaggatgg acaaagccgg gaagtacaag
2101 agcaaggagc tgcgcaggtg ctgcgaggac ggcatgcgcg aaaaccccat gcagttttcc
2161 tgccagcgcc gcgcgcgcta cgtgagcctg ggcgaggcgt gcgtcaaggc cttcctggac
2221 tgctgcacct acatggccca gctgcgccag cagcaccggc gcgagcagaa cctgggcctg
2281 gcccggagtg acatggatga agacattata ccagaagagg acatcatttc cagaagccag
2341 tttccagaga gctggctgtg gaccatagag gaactgaagg agccagagag aaacggcatc
2401 tccaccaaga ccatgaacat cttcctcaaa gacagcatca ctacttggga gattctggcc
2461 gtgagcttgt cagataagaa aggcatctgc gtggcggacc ccttcgaggt cacggtgatg
2521 caggatttct tcatcgacct gcggctaccc tactccgtgg tacgcaacga gcaggtggag
2581 atccgcgcgg tcctctacaa ctaccgcgag gcccagagtc tcaaggtgag ggtggagctg
2641 ctgcacaacc ccgccttctg cagcctggcc accgccaaga agcgccacac ccagacggtg
2701 accatcggac ccaagtcctc tgtggcggtg ccctacgtcc tggtgcccct gaagatcggc
2761 ctgcaggagg tagaggtcaa agctgccgtc tacaactact tcatcagcga cggcgtgaag
2821 aagaccctca aggtcgtgcc cgaagggatg agagtcaaca aaacggtggc cattcgcact
2881 ctgaacccag agcagctggg ccaaggcggc gttcagaggg aggaaatccc agctgcagac
2941 ttaagcgacc aagtcccaga caccgattct gagaccaaga ttctcctcca agggaccccg
3001 gtggcccaga tggccgagga cgctgtggac gcggagcggc tgaagcacct gatcatcacg
3061 ccctcgggct gcggagagca gaacatgatc ggcatgacgc ccaccgtgat cgcggtgcac
3121 tacctggacc agacggagca gtgggagaag ttcggcctgg agaagcggca ggaggcgctg
3181 aacctcatca acagggggta cacacaacag ctggctttca aacagcccaa ctgggcctac
3241 gcggctttca agaaccgggc gtccagcact tggctgaccg cctacgtggt gaaggtcttc
3301 tcgctggcgg ccaatctcat tggcatcgac tcggaggtcc tctgcggggc ggtgaaatgg
3361 ctgatcctgg agaagcagaa gcccgacgga gtcttccagg aggacgggcc cgtgatacac
3421 caagagatga ttggtggtgt ccggaccgcc caggaggctg atgtgtccct cacggccttc
3481 gtgctcattg cattgcaaga ggctaaagat atctgcagag cacaggtcaa caacctggaa
3541 gcgaacatca acaaagctgg cgattacatc gagtcaagat acgcagacgt gaggagaccg
3601 tacacgctgg ccatcgctgg ctacgccttg gccctgctgg agaggctgaa cggcgcaacc
3661 ctccaaaaat ttctaaacgc agctacagag aagaaccgct gggaggaggc ccgtcagaag
3721 ctgtacagcg tggaagccac gtcctacgcc ctcctggccc tgctcctgct gaaggacttc
3781 gacgcggtgc ctcccgtcgt gcgctggctc aacgaacaga gatactacgg caggggctat
3841 ggctccaccc aggccacctt catggtgttc caagccttgg cccagtacca gacagatgtc
3901 cccgatcaca aggacctgaa catggaggtg gccctccagc tgcccagccg cagctccccg
3961 tccaaatttc gcctcgtctg ggaagccggt agcctcctgc ggtcagaagc gaccaagcaa
4021 aacgagggct tcaaactgac agctaaaggg aagggccagg gcacgttgtc ggtggtggct
4081 gtgtactatg ccaagacgaa acgtaaagtt gtctgcaaga acttcgacct cagggtcact
4141 ctgaagccgg cccccgatac agtcaagaag cctcaggagg ccaagagcac catgatcctt
4201 ggcatctgta ccaggtacct gggagaccag gatgccacta tgtccatcct ggacatatcc
4261 atgatgaccg gtttcattcc tgacacagat gacctcaagc tgctggccac cggcgtggac
4321 agatacatct ctaagtacga gatgaacaaa gacttcagca agaacaccct catcatctac
4381 ctggacaagg tctcacactc cgaggaagaa tgtctgtcct tcaaaatcca ccagttcttc
4441 aatgtggggc tcatccagcc tggctccgtc aaggtctact cctactataa cctagatgaa
4501 acatgtaccc agttctacca cccggagaag gaggacggga tgctgaacaa actctgccac
4561 aaggacttgt gccgttgcgc agaggaaaac tgcttcatac agctgccgga gaagatcacc
4621 ctggatgagc ggctggagaa ggcctgcgag cccggggtgg actacgtgta caagaccaag
4681 ctgctcaaga tggagctgtc ggatgacttc gatgagtaca tcatgacgat cgagcaggtc
4741 atcaagtcag gctccgatga ggtgcaggcc gggaaggagc gcagattcat cagccacatc
4801 aagtgcagag acgccctgca cctgaaggag gggaagcact acctcatgtg gggcctctcc
4861 tcggacctgt ggggggagag acccaacatg agttacatca tcgggaagga cacctgggtg
4921 gaagcgtggc ccgaggcgga agaatgccaa gacgaggaga accagcagca atgccaggac
4981 ctgggcactt tcacagagaa catggtggtt tttggctgcc ccaactgacc acgcccgtca
5041 gtcatcctcc ctccaataaa gcttcgggtc tatggca
//
LOCUS HUMPP14B 8076 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human placental protein 14 (PP14) gene, complete cds.
ACCESSION M34046
KEYWORDS placental protein 14.
SOURCE Human (cell line GM1416) DNA, clone PP14G.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 8076)
AUTHORS Vaisse,C., Atger,M., Potier,B. and Milgrom,E.
TITLE Human placental protein 14 gene: Sequence and characterization of a
short duplication
JOURNAL DNA (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Vaisse, 04-MAY-1990.
FEATURES from to/span description
pept 3019 3114 placental protein 14 precursor (PP14), exon 1
3510 3649 placental protein 14 precursor (PP14), exon 2
4036 4109 placental protein 14 precursor (PP14), exon 3
5482 5592 placental protein 14 precursor (PP14), exon 4
6644 6748 placental protein 14 precursor (PP14), exon 5
7019 7035 placental protein 14 precursor (PP14), exon 6
sigp 3019 3072 placental protein 14 signal peptide
matp 3073 3114 placental protein 14, exon 1
3510 3649 placental protein 14, exon 2
4036 4109 placental protein 14, exon 3
5482 5592 placental protein 14, exon 4
6644 6748 placental protein 14, exon 5
7019 7032 placental protein 14, exon 6
pre-msg 2971 8010 PP14 mRNA and introns
IVS 3115 3509 PP14 intron A
IVS 3650 4035 PP14 intron B
IVS 4110 5481 PP14 intron C
IVS 5593 6643 PP14 intron D
IVS 6749 7018 PP14 intron E
IVS 7036 7786 PP14 intron F
signal 2891 2899 CAAT box
signal 2942 2948 TATA box
signal 7992 7997 polyA signal
BASE COUNT 1636 a 2226 c 2203 g 2011 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcttta cctagcccta cgtcctgaag attttctctc acgctttctt ctaaaagttg
61 tatagtttta tgttttactt ttaaactatg agttaatgca tatgtcaggt gtgagtttta
121 gatggaggtt cttcgtttgc atgggatcga ttgcacgtga tgaattgctc cagcaccatt
181 tgttgcaaag actgtccttc ttttagaggg actcccgctt gccaggcctc tggtttaatg
241 aaacatgacc agagtgactc catcttaacg tgaataacta gacactcaca aggcacctat
301 aaggttatat aacgaggcta tgctgctcga tactgactac gacaatttcc tgtttctctg
361 cctacaggac accaccaccc ccatccagag catgatgtgc cagtacctgg gtgggtctca
421 cagcacatga gctcaacgtg ggtgagaggc agcagctact tccatggctg ggaaccctgg
481 ggagctgaca actggcttcc tgtccacctc agtgcctgtg ggctggtttt ttctttcttg
541 gttttttttt tatttgtttg tttgctttgt tttttttgag acagggtctc attctgtcac
601 tcaggctgga gtgcagtggc gtgatctcgg ctcactacaa cctccacctc ccagattcaa
661 gcgattctcg tgcctcagct tcccgagtag ctgggattac aggcgcgagc ttatggtctg
721 aaaatacccg catactaagc tgaccaccaa ttataactgc agaacattta tgcccatacg
781 aggcatctcc caccaagcct ggagaatgta ccgatgacct gggagtgcag ggggttatct
841 ttgctcacaa ataacgtcaa cgagtaggct gaggctgaag ggcaaatggt cattgatcac
901 actaggagcc cctatcttta gcgagtgcat ctgcatgatc caggtttcac tgtagctcat
961 tacagcttct tacaaacaga ggcactcaca gaggacgggc gttcctcctg ctcgctgagg
1021 ttgcccggct ctggcacaga gtcatttccc ataaacttgc tttcactgtg ctctgtgagt
1081 caccttgaat tctttcccgt gtgagatcta agaacccatt cttggggtct ggactgggac
1141 cctcttttcc gacaacactt cccccacgca ctgttcttgc agctttgtta aaggtctgtt
1201 gttgttgtgt gtgagtccat gtttggactc tctatttccc attgatctat gtgtccattt
1261 ctctgccaat accacaatct cttggtgact attgataata gtgcttgtaa acttgggtag
1321 actaactctt tatactttat tcttcttcaa aatggcttta gcaattgtag ctcctttgcc
1381 ttgacatgta aattttagaa aaaaaaattt ctctatatat ctacatatgg tggggtgtgg
1441 tggctcacac ctataattcc agcactttgg gaggccgagg tgagcagatc acttgaggtc
1501 agcagttcga gaccagcttg gccaaaatag tgaaaacccc tctctactaa aaagacaaac
1561 attagccagg tgtggtggtg ggcgcctata atcccagcta ctctggaggc tgaagcagca
1621 ggattacttg aacctgggat gcagaggttg cagtgagtca agatcgtacc actgcactcc
1681 accctgggca acagagcaag acttggcctc aaaaaaatat atatatgtat atatgtgtgt
1741 gtgtgtgtgt atacacacac acacacacac atgtctatag ctacctacaa aatatcttgc
1801 tgagaatttg ataggaattg cattaaatct ctaggttgag ttgaggagag ctaacacctt
1861 tgctgtgttg aagcttccag cccatgaaca cattgtgcct gtcctttacc ttaggtcttt
1921 gatttccctt ccttgggttc agagtttaag tcctgtccat atttggtgag atttttgcct
1981 acatgctgtg tgtgtgatta caaatcctga atttccagtt tcgggttctg tctactcctt
2041 gacactgtgt gggaacacgg tggatttttg tgcgattgac cttgctgacc tcactcggtt
2101 ctaggagggg ttttgtagag ttcttgcaat tttgtacctg ggcaatgatg ttatatgcga
2161 atagagacag gtatacctcc ccctctccaa tctgcctgcc tttagttacc atttcttgcc
2221 tagctgctct ggccggaatg acctgctctg atggatgggg gagtgggagt ggacacccgt
2281 gtcttgcctc aggactcaag tgctctccag ggagtgagat gcgggctcga ttgtaagtta
2341 taaagtgatg cttccctgtg ggaaaaagta catccaatat agactgtagg acaaagtctg
2401 aaagtccact gcctcccatt tccacccagt cttgcctgtg agtcagagag aaacagtgct
2461 aacaggcagt gtgcacctgt ccagaggctg gctgtttgag ggacacaggt gtgcacacct
2521 ggggatgtct gagtggaagg tacaggtcag gattatgacc gtgcagtcag tcacccactg
2581 gcatgatgtc tgtggcatcc tggggcagcc atggggctct caggccctct gcctgcccca
2641 caggccatac ccctgccctg gacacagctg tcctcagtgc tggcctctga cccaacattg
2701 tccaggagcc ccaacccaga aggtgctccc gccgctgcca gcctggaccc gacccaggcc
2761 cctcccgcct gaggccctgc caagaactgc ccagcccgga cacagaggag gttccgcgtg
2821 gacgcaggga agagcctccc attgccccag tggaggaagc tgcccagggg ccaaggatga
2881 gtcacaggtt cgaggaatca catggcgagg ctgtgggcgg ggatcttgtc tgccctcctc
2941 ctacataagg ccccctgagc ccacactgcc tcagcatccc tctggctcca gagctcagag
3001 ccacccacag ccgcagccat gctgtgcctc ctgctcaccc tgggcgtggc cctggtctgt
3061 ggtgtcccgg ccatggacat cccccagacc aagcaggacc tggagctccc aaaggtttga
3121 ggctggggga gcgggcactt tactgtggga ggcctggggc gggtgggagc tgcgggcagg
3181 cgggaagcca ggatctcaga aacctacagg aagcacagaa tggacgccat gacgtcagga
3241 agccctcagc cctgctctcc atctttaggg tggcctctct ggtttcccag catcctaggt
3301 gactcattat ttggactttg gaacactcct gagttagcac acactggtca ttttaagtac
3361 aggaaatttc atagcccagg atctggtaga tagcagacaa ccatccaatg ctcactgtac
3421 ccatcccagt tagactcagc cccgtctgca ccgggtgcaa cgagagccat ggtggggtgg
3481 gaccgccgtg cagcccaagg ccccctcagt tggcagggac ctggcactcc atggccatgg
3541 cgaccaacaa catctccctc atggcgacac tgaaggcccc tctgagggtc cacatcacct
3601 cactgttgcc cacccccgag gacaacctgg agatcgttct gcacagatgg tgggtttctc
3661 atcattgaga cgggctgggc gggggctcag tctcccccct caggggtcca ggactgggtg
3721 ggttgggcgg agctggactt agccccaggc attttctgac agccaggggc ttcactgtgg
3781 cccttccatg agggtggggt ggaaaaccag ggctccagac gttccctgtc cccttggatc
3841 ccctgcccca ggctctgggc caacagccaa ccacacagtg cagccccagg tcagactgag
3901 gagaaggtct gggcggctgc gggctgcggt gctccttgga cccggggaag ttcccgtggt
3961 gacctgattt taggagtgac agtgaaggca actccaattc aagtggccac tcatcctatt
4021 gtcaccacct ttcagggaga acaacagctg tgttgagaag aaggtccttg gagagaagac
4081 tgagaatcca aagaagttca agatcaactg tgagtgtccc caggccccaa gggctggctc
4141 agtgctggca tgctagccac gctctcccag aggcggctct gctggggcat gagggagtgg
4201 ggcctggcct gtccccactc tctctgcttc agggagtcag agtgtttact ccggtcaacc
4261 tgatgctgac cccagaggca tcttttacct ggagggcagg ggaagcacta attcttggca
4321 tgacatgact ggatgtgggt ctgcactgtg cccaggccaa ggggacaggt gctttgttgc
4381 actgttcact ctggcctcac aaaaggccag ggaggctgca ggcgagcagg tgggcaggtg
4441 ggcaggtggg taggtgggta ggtggatatg tatacaggtg gcaggagggt aggtgaacag
4501 gtgggtaggt gggcaggtgg ctaggtgagt aagtggttag gtgaacaggt gggcaggtga
4561 gcaggtggtt aggtgaacag gtgggcaggt ggataggtga acaggtgggc aggtgggtag
4621 gtgggtaggt atacaggtgg acaggtgggt aggtggacag gtgggcaggt gagtaggcga
4681 acaggtgggt gggtgaacag gtggccaggt gaacaagttg gtaggtgggc aggtgggtag
4741 gtgggtaggt gaacaggtgg gcaggtgggc aggtgggcag gtgggcaagt ggctgctgtt
4801 cccgtgggcc tggctgcctc ctgcgcactc tggggctgca gctctggtct taggctgagc
4861 tcccaggcct ctctggggga agagagaggg gcttacagca tgtccttggt ccactgaatt
4921 cttcctaaca atttgcaaca ttttgttcta ttttgttaat tattattttt ttaaaaagac
4981 agaggtggtc agggtctggg gcctcttatc ccctcatggg cacattttcc cagcaaatac
5041 agtttgcttc tcatgcttgg gacttgcctc aggcctttct gaccctgctt gccctcccca
5101 gaatcgagcc actctccaag gtccatttct tctccctcct cccgcccctg tgccctgttc
5161 ctgtgccatc tcccgccatc ctcacccgta cgtgacttct cagttggagt ctctccaggt
5221 cacagcctcc ctgcctgccg tgtctgcctc tccacggcac acctggcctc tcgccctcag
5281 ccggggctcc atggccctcc acattgcctc tcctcccctt tcctccctgg cttccctgat
5341 catggtccac agcaggggcc acgtcccatg gtgtcagtga tgaggaagcc acttagtgtg
5401 gtgggatgtc cacacacctg cacaggactc tgctgagacg gaggcttcat cttccttttg
5461 gttcttctct tctttcccca gatacggtgg cgaacgaggc cacgctgctc gatactgact
5521 acgacaattt cctgtttctc tgcctacagg acaccaccac ccccatccag agcatgatgt
5581 gccagtacct gggtgggtct cacagcacat gagctcaacg tgggtgagag gcagcagcta
5641 cgtccatggc tgggaaccct ggggagctga caactggctt cctgtccacc tcggtgcctg
5701 tgggctgact ttttctttct tggttttttt tatttgtttg ttgtttgctt tgtttttttg
5761 agaaagggtc tcattctgtc actcaggctg gagtgtagtg acgtgatctc ggctcactgc
5821 aacttctgcc tcccagattc aagcgattct cgtgcctcag cctccagagt agctgggatt
5881 acaggcgcac gccaccatgc ccagctaatt tttgtatttt ttggtagaga cggggtttca
5941 ccatgttggc caggctggtc tccaactcct ggatcaagtg atccacccgc cttggcctcc
6001 caaaggctgg gattacaggt catccaccac gcccggccag gctgagtttt tctccagcgg
6061 ttcatcgagt cctctgacaa agcaaggagc tgatataggg ccagtgggac ggtcgccagt
6121 caaggggctg ggcttggtgg atagattaat actcactggg cgtccagtca aaacgccctg
6181 aaacctatga tgctgtcaac caaacgaagg ccaggaatac caaaatagcc acataggcac
6241 agcccttccc catgtttctg agcacagtgt ttcctctggg gtcacacagg tgtcttcttg
6301 atcagcctca gccatgcttg gtgagagccg ggcactggga gagccaggca ctgtgctctc
6361 ctgtgacgct gtagacacca tcctaagctg tgcagacccc agcgctgccc agagcggagc
6421 agagggggcc gggcaaggag tgggagctgg ggtcagggaa cctggaggtg cagtggacag
6481 agccccggag accgccctag ggacctactc cagaccaaac tctgccagac ctcggagcac
6541 tggggcctcc ttctctgccc tccctcctca ggcaaggcct ctggagctcc ccagctctca
6601 tggaagcccc aggggcccag gactgaccca gcctcttcca cagccagagt cctggtggag
6661 gacgatgaga tcatgcaggg attcatcagg gctttcaggc ccctgcccag gcacctatgg
6721 tacttgctgg acttgaaaca gatggaaggt gagctctgcc taggacacgc ccagcctcag
6781 ctggaggaga agctgcctct ttcttagccc gagccccctg ctggctctgc aggactcagg
6841 tcactccttt ttggcccctc ccctgttctc ccctggcctt ctggggtgca gagccaccct
6901 gaggtggggt cctgccctct cccaccatcc tttcatccct tctctagccc tggggctgct
6961 gtgtccccag ctgtctcttc tctcgctgac acctccactg tcccatctcc tcccacagag
7021 ccgtgccgtt tctaggtgag ctcctgcctg gtcctgcctc ctgggtaatg tatcagcctc
7081 gcccactgtc tgcggctgcc tctctgggcc cctgggacag accctactgt gtccagttca
7141 gggctgaccc tacaggaatg aactggggtc tggtcttgtg attccagaaa gccaggctgc
7201 tgacgtcccc attcacgagc ccagcctgtg tcttgcagcc attgtattag tcacgggctt
7261 gtgccctata gtcagacctc atgctttctt ttggggttag gggtgttggt tggaaatggt
7321 gggggctata ggaggaggaa ggaggatggt tacatggaag ggcatgagaa gctggggacc
7381 tgcaggtctc ggtcccacgt tctttttttt ttttcttttt ttaagatgga gtctcgctct
7441 gtcaccaggc tggagtgcag tggcacaatc tcagctcact gcaacctcga cctcctgggt
7501 tcaagcgatt ctcctgcctc aaccccccga gttgctggaa ccacaggcgt gtgccaccat
7561 gcccagctaa tttttgtatt tttaatagaa acggggtttc accatgttgg ccacgatggt
7621 ctcaatctct tgacctcatg atccccccgc tttagcctcc caaagtgctg ggatttcagt
7681 gccacattct taagggggtg tgctcaagcc caccacatcc ttccagggct cccccgaaac
7741 accctgctct tcctccctct acttaagtga cctgtaaacc caacagctca cctccgcctc
7801 caggaagacc agactcccac ccttccacac ctccagagca gtgggacttc ctcctgccct
7861 ttcaaagaat aaccacagct cagaagacga tgacgtggtc atctgtgtcg ccatcccctt
7921 cctgctgcac acctgcacca cggccatggg gaggctgctc cctgggggca gagtctctgg
7981 cagaggttat taataaaccc ttggagcatg tcctgtctgg atgcgcagcc actgctgggt
8041 gtgggattca gggacgaggg cctggggtcg gggcag
//
LOCUS PHAMP1A 1313 bp ss-mRNA PLN 28-JUN-1990
DEFINITION P.chrysosporium Mn-dependent peroxidase (MP-1) mRNA, complete
cds.
ACCESSION J04980
KEYWORDS Mn-dependent peroxidase.
SOURCE P.chrysosporium (strain BKM-F-1767), cDNA to mRNA, clone lambda-
MP-1.
ORGANISM Phanerochaete chrysosporium
Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes;
Agaricales; Corticiaceae.
REFERENCE 1 (bases 1 to 1313)
AUTHORS Pease,E.A., Andrawis,A. and Tien,M.
TITLE Manganese-dependent peroxidase form Phanerochaete chrysosporium:
Primary structue deduced from cDNA sequence
JOURNAL J. Biol. Chem. 264, 13531-13535 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
E.A.Pease, 21-JUN-1989, for release after publication.
FEATURES from to/span description
pept 36 1184 Mn-dependent peroxidase precursor (MP-1)
sigp 36 72 Mn-dependent perocidase signal peptide
matp 73 1181 Mn-dependent perocidase
BASE COUNT 251 a 462 c 334 g 266 t
ORIGIN
1 tcagctctca aggacatccg cactcgaata tcgcaatggc cttcggttct ctcctcgcct
61 tcgtggctct cgccgccata actcgcgccg ccccgactgc ggagtctgca gtctgtccag
121 acggtacccg cgtcaccaac gcggcgtgct gcgctttcat tccgctcgca caggatttgc
181 aagagactct gttccagggt gactgtggcg aagatgccca cgaagtcatc cgtctgacct
241 tccacgacgc tattgcaatc tcccagagcc taggtcctca ggctggcggc ggtgctgacg
301 gctccatgct gcacttcccg acaatcgagc ccaacttctc cgccaacagc ggcatcgatg
361 actccgtcaa caacttgctt cccttcatgc agaaacacga caccatcagt gccgccgatc
421 ttgtacagtt cgccggtgcg gtcgcgctga gcaactgccc aggtgctcct cgcctcgagt
481 tcatggctgg acgtccgaac actaccatcc ccgcagttga gggcctcatt cctgagcctc
541 aagacagcgt caccaaaatc ctgcagcgct tcgaggacgc cggcaacttc tcgccgttcg
601 aggtcgtctc gctcctggct tcacacaccg ttgctcgtgc ggacaaggtc gacgagacca
661 tcgatgctgc gcccttcgac tcgacaccct tcaccttcga cacccaggtg ttcctcgagg
721 tcctgctcaa gggcacaggc ttcccgggct cgaacaacaa caccggcgag gtgatgtcgc
781 cgctcccact cggcagcggc agcgacacgg gcgagatgcg cctgcagtcc gactttgcgc
841 tcgcgcgcga cgagcgcacg gcgtgcttct ggcagtcgtt cgtcaacgag caggagttca
901 tggcggcgag cttcaaggcc gcgatggcga agcttgcgat cctcggccac agccgcagca
961 gcctcattga ctgcagcgac gtcgtccccg tcccgaagcc cgccgtcaac aagcccgcga
1021 cgttccccgc gacgaagggc cccaaggacc tcgacacgct cacgtgcaag gccctcaagt
1081 tcccgacgct gacctctgac cccggtgcta ccgagaccct catcccccac tgctccaacg
1141 gcggcatgtc ctgccctggt gttcagttcg atggccctgc ctaaaccact cacctccggc
1201 aatgcacctt tagtagatgt cgattctatt tggttgatta gaagtactgg gttctcgtgt
1261 catataccac caagttttca gaatgcaagt ctcctttctt tccaaaaaaa aaa
//
LOCUS PEAHSP177A 772 bp ss-mRNA PLN 28-JUN-1990
DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA,
complete cds.
ACCESSION M33901
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 772)
AUTHORS Vierling,E. and Helm,K.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 65 523 17.7 kDa heat shock protein (hsp17.7)
BASE COUNT 254 a 127 c 170 g 221 t
ORIGIN
1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag
61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga
121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc
181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga
241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt
301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga
361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc
421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc
481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat
541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg
601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata
661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc
721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac
//
LOCUS PEAHSP179A 700 bp ss-mRNA PLN 28-JUN-1990
DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA,
complete cds.
ACCESSION M33900
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 700)
AUTHORS Vierling,E. and Helm,K.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept < 1 469 17.9 kDa heat shock protein (hsp17.9)
BASE COUNT 209 a 123 c 170 g 198 t
ORIGIN
1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt
61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa
121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt
181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga
241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg
301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc
361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa
421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct
481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg
541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa
601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt
661 gagtatgtga aagtttcttt tgattaagct gtatttatcc
//
LOCUS PEAHSP181A 862 bp ss-mRNA PLN 28-JUN-1990
DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA,
complete cds.
ACCESSION M33899
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 862)
AUTHORS Vierling,E. and DeRocher,A.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 12 488 18.1 kDa heat shock protein (hsp18.1)
BASE COUNT 278 a 128 c 207 g 249 t
ORIGIN
1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg
61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac
121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg
181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga
241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag
301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt
361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc
421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt
481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc
541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat
601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt
661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa
721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac
781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga
841 gtctctccaa gatggagaat tc
//
LOCUS PEAHSP227A 795 bp ss-mRNA PLN 28-JUN-1990
DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA,
complete cds.
ACCESSION M33898
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 795)
AUTHORS Vierling,E. and Helm,K.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 44 637 22.7 kDa heat shock protein (hsp22.7)
BASE COUNT 261 a 122 c 170 g 242 t
ORIGIN
1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct
61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc
121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc
181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca
241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca
301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga
361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca
421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa
481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca
541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga
601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa
661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg
721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa
781 cttgttggct tttgt
//
LOCUS LACNISC 1222 bp ds-DNA BCT 28-JUN-1990
DEFINITION L.lactis insertion sequence IS981.
ACCESSION M33933
KEYWORDS insertion sequence.
SOURCE L.lactis (strain LM0230) DNA, clone pKM40.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 1222)
AUTHORS Polzin,K.M. and McKay,L.L.
TITLE Identification, DNA sequence and distribution of IS981, a new, high
copy number insertion sequence in the lactococci related to the
IS2/IS3 family of IS-elements
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.M.Polzin 27-APR-1990.
University of Minnesota
Dept. of Food Science
1334 Eckles Avenue, St. Paul, MN 55108
e-mail: kayla@molbio.cbs.umn.edu
FEATURES from to/span description
pept 58 318 ORF1
pept 330 1169 ORF2
binding 44 50 ribosomal binding site (put.)
rpt 1 40 5' inverted repeat
rpt 1183 1222 3' inverted repeat
signal 33 38 -10 region
signal 269 274 -35 region
signal 294 299 -10 region
BASE COUNT 442 a 233 c 206 g 341 t
ORIGIN 132 bp upstream of PvuII site.
1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg
61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa
121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata
181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa
241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc
301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca
361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta
421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac
481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac
541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt
601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa
661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata
721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag
781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata
841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc
901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca
961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa
1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct
1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac
1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac
1201 tctaaaaact tgacttaacg tc
//
LOCUS DRONCDA 2294 bp ss-mRNA INV 28-JUN-1990
DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA,
complete cds.
ACCESSION M33932
KEYWORDS non-claret disjunctional protein.
SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA,
clone pNB40.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2294)
AUTHORS Mcdonald,H.B. and Goldstein,L.S.B.
TITLE Identification and characterization of a gene encoding a
kinesin-like protein in Drosophila
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.B.Mcdonald, 30-APR-1990.
FEATURES from to/span description
pept 111 2168 non-claret disjunctional protein
site 180 319 alpha helical domain
BASE COUNT 622 a 661 c 603 g 408 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC.
1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt
61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta
121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag
181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa
241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct
301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg
361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc
421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc
481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg
541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt
601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct
661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg
721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg
781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga
841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc
901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag
961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg
1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc
1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc
1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg
1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg
1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca
1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg
1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg
1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg
1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg
1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg
1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc
1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc
1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg
1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca
1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc
1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact
1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca
2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc
2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg
2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat
2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa
2281 aaaaaaaaaa aaaa
//
LOCUS ECOPHOAJ 3133 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.fergusonii alkaline phosphatase (phoA) gene, complete cds.
ACCESSION M33966
KEYWORDS alkaline phosphatase.
SOURCE E.fergusonii (ATCC 35469) DNA.
ORGANISM Escherichia fergusonii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3133)
AUTHORS DuBose,R.F. and Hartl,D.L.
TITLE The molecular evolution of bacterial alkaline phosphatase:
Correlating variation among enteric bacteria with experimental
manipulations of the protein
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.F.DuBose, 01-MAY-1990.
FEATURES from to/span description
pept 544 1962 alkaline phosphatase (phoA) (EC 3.1.3.1)
site 466 486 pho box regulatory sequence
BASE COUNT 874 a 711 c 774 g 774 t
ORIGIN
1 aactattaat taagcaattt tttaattcgt ggaattataa tgcgcgagaa acgtttcgcg
61 ctaatttgtg ttaagaatat ttgccaaaag tattcgttat gtcataaaaa tactactttc
121 agactctagg aagttttgag tctaaatgtt aacgtcacca tactttggat aaggaaatgc
181 agttatgaaa aatcttattt ctgagttgtt gattaaactt gcacaaaaag aagaagagtc
241 aaaggagttg gttgcccagg tagaagccct ggagattatc gtcacggcta tgttgcgtaa
301 tatggggcaa cctgaacaaa aacgcgtaat cgaccagata gaaggggctt tggccggggt
361 aaagcctgat gccagcgttc ctgacggcga tacggaaatg ctgcgcgcat atgtaaagaa
421 gctcttaagg catccccggc aataagacac agaaccgaca cgacactgtc ataaagttgt
481 catcctcgct acctatagtc gctttgtttt tattttactg tatttataca tggagataat
541 aaagtgaaac aaagcgctat tgcattggcc ctgttatctt gtctgattac cccggtaagc
601 caggctcaaa cttctcaaaa tatcaatatc ctggaaaatc gtgccgctca gggtgatatt
661 acaatgccag gtggcgctcg ccgtttaagt ggcgatcaaa ctgaggcatt acgtgcatct
721 cttaatgata agccagctaa aaatattatt ttattaattg gcgatggcat gggggattct
781 gaaattactg ctgcacgaaa ttatgcagaa ggtgcaggtg gttattttaa agggatcgat
841 gcgttgccat taaccgggca gtacactcac tatgcattgg acaaaaaaac cggcaaacct
901 gattacgtga cagattctgc tgcatccgca accgcatgga cgacaggcgt caaaacatac
961 aatggtgcgc tgggtgtcga tatccatgaa aatcctcata ccactattct tgagatggca
1021 aaagcagccg gtctggcaac aggcaacgtg tctacggctg agttgcagga tgcgacacct
1081 gctgcgttgg tttctcatgt cacctcacgc aaatgctatg gcccttctgt taccagtgaa
1141 aaatgcccgg gcaatgcact ggaaaagggg gggaaaggtt caatcactga acaactgttg
1201 aatgcccgtg cggatgtgac gttgggtggt ggtgcaaaaa cgtttgctga aacggcaacc
1261 gctggagagt ggcagggtaa aacattgcgt gagcaagctc tggctcgcgg ttatcagatt
1321 gtcagcgatg ccgcatccct cgcagcagtg acgcaagcag ggcaggacaa acctctgctc
1381 gggttatttg ctgaaggaaa tatgcctgta cgttggcatg ggccaaaggc gagctatcac
1441 gggaatctcg ataagcccgc agtaacctgc acgccaaatc ctcagcgtaa cgaaactgta
1501 ccgacactgg cacagatgac agataaagct attgaattac ttagcaaaaa tgagagaggt
1561 ttcttcctgc aagtagaagg ggcatctata gataagcaag atcacgcggc taatccatgt
1621 ggacagattg gtgagacggt tgatctcgac gaagctgttc aacgtgctct ggaatttgcc
1681 aaaaaagatg gcaatacact ggtgatcgta actgcagatc atgcgcattc cagccagatt
1741 gtggcaccag ataccaaagc gccaggttta actcaggcgc taaatacgaa agatggtgcg
1801 gtgatggcga taagctacgg taactctgaa gaagactctc aggagcatac cggtagccag
1861 ctgcgcatcg cagcgtatgg gccgaatgcc gctaatgtgg tcggactgac cgatcaaacg
1921 gatcttttct acaccatgaa agccgctctg ggcttgcaat aatccatacc gacggccatt
1981 aattgcgccg tcgggtggtt ttttcaccat cagcagccag acttataaac actatccttc
2041 gggcgtattt tccggcccga aggaaactgg tgaagttgca acctcacaaa aggatggtgc
2101 gatgaaaatg acattactgg ttaccctgct atttgggctg gtttttttaa ccactgtcgg
2161 tgctgccgag agaacattaa cgccgcaaca acagcgtatg accacctgta atcagcaagc
2221 tactgcacag gcgttaaaag gtgatgcccg aaaaacgtat atgagcgatt gcctgaaaaa
2281 cagcaaatct cccccgaatg aaaaaagcct gacaccacaa caacaaaaga tgcgtgaatg
2341 taatacccag gcgacggaac aatcactgaa aggtgacgac cgtaataaat ttatgagtgc
2401 gtgtctgaag aaagctgcat agatccccct tagtgcttac gggtgagaca cgcaaacgcc
2461 tcacccgaaa tatcatactt ctccctcacc ccccctccta taatttggga aatttgtttc
2521 taaatattcc caaaaaaata atgaatgatg aaaacttcct taaaaaagcg gtcgcaccgg
2581 gtgaagaacc accttcttca cctcaaaacg aacatcaacg ttccgggtta cgctttgccc
2641 gtcgtgtgag gctaccaaga gttgtcggtt ttggcggtat gttctcccca tcgcctcggc
2701 gctggtggag caaccgatcg ctggatggtg gtggttgatc cttgttggct ggtcattcgt
2761 ttggccacat ctggcctggc aaatgtccag tcgtgcaatg gaccccttaa accgtgaaat
2821 ttacaaccta aaaacggatg cgattactgc cgggatgtgg gttggcataa tgggtgttaa
2881 cgcactacct tcaacgacga tgctgatgat tatgtgcctt aacctgatgg gcgccggagg
2941 acttaggttg ttcgtcgccg ggctggtatt gatggtagtc gcatgtctgg tgacgctaca
3001 gttgaccggc atccgcgtcg cgtttaccag ttcgcccctt gagtggtggc tttcactccc
3061 tgtcattgtg ttgtatccct taattttcgc ctgggtcagc tatcagacag caactaaatt
3121 ggcagagcat aaa
//
LOCUS SMAPHOAA 2672 bp ds-DNA BCT 28-JUN-1990
DEFINITION S.marcescens alkaline phosphatase (phoA) gene, complete cds.
ACCESSION M33965
KEYWORDS alkaline phosphatase.
SOURCE S.marcescens (ATCC 13880) DNA.
ORGANISM Serratia marcescens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2672)
AUTHORS DuBose,R.F. and Hartl,D.L.
TITLE The molecular evolution of bacterial alkaline phosphatase:
Correlating variation among enteric bacteria with experimental
manipulations of the protein
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.F.DuBose, 01-MAY-1990.
FEATURES from to/span description
pept 667 2094 alkaline phosphatase (phoA) (EC 3.1.3.1)
site 575 595 pho box regulatory sequence
BASE COUNT 588 a 845 c 749 g 490 t
ORIGIN
1 ggccagtgaa ttcctggcgc tgaacccgaa cggcaagatc ccggcgatca tcgatcccga
61 tggccccggc ggccggccgc tgccgctgtt cgaatccggc gccattctgc tctacctggc
121 ggaaaagagc ggccgtttcc tgccgcagga cccggcgcag cgttacgaaa ccattcaatg
181 ggtgttcttc cagatggccg ccgtcggccc gatgttcggc cagctcggtt tcttccataa
241 attcgccggg cgcgaatatg aagacaagcg gccgctggag cgctacaaaa acgaatcgaa
301 acgcctgctc ggcgtgctgg aaacgcgact tgaaggccgc gactggatca tgggcgcgga
361 gtacaccatc gccgatatct cgctgctggg ttgggtgcgc aacctgatcg gcttctatga
421 ggcgcgcgaa ctggtggagt tcgacagttt ccctcgcgtc gcgcagtggc tggagcgcgg
481 tctggcgcgc ccggcggtgc agcgtgggct cgctatcccg gcgcgcgcgg cctgagcgtc
541 aaacgggatg gccgccgggc cgtcccgcct tccgctgtca taaaattgtc atccctctgc
601 tctactctcc tttcgtaaca atttgatgtt atttacattg ttgtattacg acaaggagct
661 taccccatgc aacctgctgt atccctgatc gccggcgccg tactgtccgc cctgctctgc
721 tcctccgcca tcgcggcgga aacctcggcc aatgccgatg gcctgaccga tcgcgccgcg
781 cgcggcaacc tcgtcgaacc gggcggcgcc cgccgtctgg ccggcgatca gaccaccgcg
841 ctgaaggcat cgctgtcgga caaaacggcg aaaaacgtga tcctgctgat cggcgacggc
901 atgggggatt ccgagatcac cgccgcgcgc aactacgccg aaggcgcagg cggttacttc
961 aagggcatcg acgccctgcc gctgaccgga caatacaccc attactcgct ggacaggaaa
1021 acccacaagc cggactacgt gaccgactcc gccgcctccg ccaccgcctg ggccaccggg
1081 gtgaaaacct ataacggcgc gctcggcgtg gacgtgaacg gcaaggatca accgacgctg
1141 ctggagatcg ccaaggcggc cggcaaggcc accggcaacg tctccaccgc cgagttgcaa
1201 gacgccacgc cggcggcgct ggtctcgcac gtcatctcac gcaagtgcta tggcccggag
1261 gaaaccagcg agaaatgcgc cgctaacgcg ctggaaaatg gcggccgtgg ctccatcacc
1321 gaacaactgc tgaaaacccg cgccgacgtc acgctgggcg gcggcgcgaa atcgttcaac
1381 cagctggcaa aaagcggcga atggcaagga aaatcgctga aggaccaggc ggcggcgcag
1441 ggttatcagt gggtgagcaa cgccgacgaa ctgcaggccg ttaccctcgc caaccagcaa
1501 aaaccgctgc tggggctgtt cgccgacggc aatatgccgg tgcgttggct ggggccgaaa
1561 gccagctacc atggcaacct cgacaaaccg gcagtgacct gcgaaaacaa cccggcgcgc
1621 accgccgcca cgccaacgct ggcggcgatg accgaaaaag cgatcgcgct gttgaaagac
1681 aacccgaacg gcttcttcct gcaggtagag ggcgcgtcga tcgataaaca ggatcacgcc
1741 gccaacccgt gcgggcagat tggcgaaacc gtcgatctgg acgaagcggt acaaaaagcg
1801 ctggccttcg cccgcgccga cggcaacacg ctggtgatcg tcaccgccga ccacgcgcac
1861 gccagccaga tcgtctccgc cgacgccaaa gcgccgggcc tgacccaaaa gctgaccacc
1921 aaggacggcg cgccgatgac cctcagctac ggcaactctg aagaggaatc tcagggacac
1981 accggcaccc agctgcgcgt cgcggcctac ggcccgcacg ccgccaacgt agtgggcctg
2041 accgatcaga ccgatctgtt cttcaccatg cgcgatgcga tgggtatcaa gtaaccgacc
2101 cctgaacgat ggccgcccgc cacggcggcc atttattatt ggttaccgct aggtatttgc
2161 aatttaaccg caattattgt gtgataacga aataacgtct aattacgcgg caattaaccc
2221 atgaaaataa cgcggttata cggcctttct tccctctctg tttattcctg cgacagcaac
2281 cggccgtttt ttgcttatat tcattttttt tagttacaaa agtagttttt aacattctcg
2341 caaacaataa taccccgcca gtaacgtgaa ccgatcaatt tgcatccgga cggcctgcaa
2401 tagttcactt ttcgttaaat accctattaa caaaaaggca ttattgctgc ctatcggctt
2461 tcgccgggcg gtcggcttca ttatttccag ggtaactccc gctaaattct ttcacctgca
2521 atgagtagaa tttacacaca agagatttaa ttcaatgaga aaattactcg tcctgatttt
2581 cagtctcagc ctgtttggcg ttacgcaaca ggcggcggct gaagagaaga ataaaaccgt
2641 cgacctgcag gcatgcaagc ttggcactgg cc
//
LOCUS STYMALGA 1232 bp ds-DNA BCT 28-JUN-1990
DEFINITION S.typhimurium inner membrane protein genes (malG and malF) genes,
complete cds, and 3' end.
ACCESSION M33921
KEYWORDS .
SOURCE S.typhimurium (strain LT-2) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1232)
AUTHORS Francoz,E., Schneider,E. and Dassa,E.
TITLE Sequence of malG in Salmonella typhimurium
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT malG protein is proposed to form with proteins malK, malF, and malE
a hetero-oligomeric protein complex that performs the ATP-dependent
translocation of maltose and of maltodextrins through the inner
membrane of enterobacteriaceae.
Draft entry and computer-readable sequence for [1] kindly submitted
by E.Dassa, 30-MAR-1990.
Institut Pasteur
Unite de Programation Moleculaire et Toxicologie Genetique
Inserm U169, CNRS URA 271
e-mail: hofnung%pasteur.bitnet@cunyvm.cuny.edu
FEATURES from to/span description
pept < 1 64 inner membrane protein (malF)
pept 79 969 inner membrane protein (malG)
binding 66 70 ribosomal binding site (put.)
signal 1050 1073 rho-independent transcription termination
signal (put.)
BASE COUNT 241 a 336 c 332 g 323 t
ORIGIN 91 minutes on the Styphimurium map.
1 cttcctgctg gtaggcgcgc tggcaatagt gaacctgaaa gccacgcgta tgaagtttga
61 ttaagggaga taataacaat ggctatggtc caacccaaat ctcagaaact gcgtctcctc
121 atcacgcacc tggggctgct gattttcatc gcagcgatta tgttcccgct gctgatggtc
181 atcgctatct cgctacgtga agggaacttc gccaccggaa gcctgatccc ggataaaatc
241 tcctgggaac actggcggct ggcgctgggt ttcagcgtgg aacatgcgga tggtcgggta
301 acgccgcctc ccttcccggt cctgctgtgg ctgtggaact cggtgaaaat cgccggtatt
361 accgccatcg gcatcgtggc gctctccacc acctgcgcct acgctttcgc acgaatgcgt
421 tttccaggta aagcaacact gctcaaaggg atgctgattt tccaaatgtt tccggcggtt
481 ttgtctctgg tagcgctgta tgcgttattt gatcgcctgg gacaatacat tccatttatc
541 gggctgaata cgcatggcgg cgtgatcttc gcctatcttg gcggcatcgc gctgcatgtt
601 tggacgatta aaggctactt cgaaactatc gacagttcgc tggaagaggc cgccgcgctg
661 gatggcgcaa ccccgtggca ggcgtttcgc ctggtgctgc tgccgctttc cgtaccgatt
721 ctggcggtgg tgtttattct gtcgttcatc gcagcgatta ccgaagttcc ggtcgcctcg
781 ttgttactgc gcgatgtgga cagctatacg ctggccgtgg gtatgcagca atatctcaac
841 ccgcaaaact acctgtgggg cgactttgcc gccgccgccg tcctttcagc tattcctatc
901 accctggtgt tcctgttggc gcaacgttgg ctggtcaacg gcctgacggc agggggtgtg
961 aaaggttaag ttttatcgaa gtacctgccc ggcaacgggc aatgccactg ccctcaagtt
1021 atgttatccc cacttatggt cgtattttgg cgctcttcgg agcgcttttt ttatgcgtat
1081 agacagcggc taatttattg ccacgccata cgtccgttac ggtgtggcga cgcctattct
1141 cgccgtagtc ggttcgaatt acataaccac aagtgataac caacaggagg atcgccgccg
1201 aatagagtaa tacatccatt gctttatgat cc
//
LOCUS ECOCLPPA 1236 bp ds-DNA BCT 28-JUN-1990
DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component
(clpP) gene, complete cds.
ACCESSION J05534
KEYWORDS ATP-dependent protease.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1236)
AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
Bowers,B. and Gottesman,S.
TITLE Sequence and structure of ClpP, the proteolytic component of the
ATP-dependent Clp protease of E.coli
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.Gottesman, 23-APR-1990.
FEATURES from to/span description
pept 378 1001 ATP-dependent protease (clpP)
mRNA 279 > 1236 clpP mRNA
pept < 1 133 ORF
signal 244 249 -35 region
signal 267 272 -10 region
signal 165 193 transcription termination signal
binding 367 372 ribosome binding site (put.)
BASE COUNT 329 a 291 c 318 g 298 t
ORIGIN
1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS PLYORIA 375 bp ds-DNA VRL 28-JUN-1990
DEFINITION Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION M34048
KEYWORDS origin of replication.
SOURCE Polyomavirus BK (strain WWT) DNA, from human urine.
ORGANISM Polyomavirus BK
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 375)
AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 375 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 302 R block
BASE COUNT 123 a 86 c 95 g 71 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
361 aactggacaa aggcc
//
LOCUS PLYORIB 424 bp ds-DNA VRL 28-JUN-1990
DEFINITION Human polyomavirus BK (strain UT) origin of replication.
ACCESSION M34049
KEYWORDS origin of replication.
SOURCE Human polyomavirus BK (strain UT) DNA, from human urine.
ORGANISM Polyomavirus BK
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 424)
AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 424 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 261 R block
site 263 315 P block
site 316 349 Q block
site 350 361 R block
BASE COUNT 134 a 103 c 105 g 82 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
421 ggcc
//
LOCUS ECOUNCGA 85 bp ds-DNA BCT 28-JUN-1990
DEFINITION E.coli H+-ATPase gamma subunit (uncG) gene, 3' end.
ACCESSION M34095
KEYWORDS H+-ATPase.
SOURCE E.coli (strain KF12) DNA, clone pBWG1.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 85)
AUTHORS Iwamoto,A., Miki,J., Maeda,M. and Futai,M.
TITLE H+-ATPase gamma-subunit of Escherichia coli: Role of the conserved
carboxyl-terminal region
JOURNAL J. Biol. Chem. 265, 5043-5048 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 84 H+-ATPase gamma subunit (uncG) (AA at 1)
BASE COUNT 21 a 24 c 23 g 17 t
ORIGIN
1 ctgcagttgg tatacaacaa agctcgtcag gccagcatta ctcaggaact caccgagatc
61 gtctcggggg ccgccgcggt ttaaa
//
LOCUS HUMPIM1B 1737 bp ds-DNA PRI 28-JUN-1990
DEFINITION Human protein kinase (Pim-1) promoter DNA.
ACCESSION M34228
KEYWORDS protein kinase; proto-oncogene.
SOURCE Human cell line K562 DNA, clone pLH/P.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1737)
AUTHORS Meeker,T.C., Loeb,J., Ayres,M. and Sellers,W.
TITLE The human Pim-1 gene is selectively transcribed in different
hemato-lymphoid cell lines in spite of a G+C-rich housekeeping
promoter
JOURNAL Mol. Cell. Biol. 10, 1680-1688 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA 1699 > 1737 Pim-1 mRNA (alt.)
mRNA 1700 > 1737 Pim-1 mRNA (alt.)
mRNA 1703 > 1737 Pim-1 mRNA (alt.)
mRNA 1705 > 1737 Pim-1 mRNA (alt.)
mRNA 1706 > 1737 Pim-1 mRNA (alt.)
mRNA 1732 > 1737 Pim-1 mRNA (alt.)
mRNA 1733 > 1737 Pim-1 mRNA (alt.)
mRNA 1735 > 1737 Pim-1 mRNA (alt.)
binding 891 900 Sp1 binding site
binding 1422 1431 Sp1 binding site
binding 1433 1442 Sp1 binding site
binding 1496 1505 Sp1 binding site
binding 1222 1248 AP2 binding site
binding 1290 1308 AP2 binding site
binding 1662 1685 AP2 binding site
binding 1340 1366 PPF-348 binding site
BASE COUNT 360 a 546 c 426 g 405 t
ORIGIN
1 ctttgaatac tcaggaggtg aatttgggtc atttcatctc ctctcttagc tgctgtatga
61 ccctgtgcca tcctatttat gtttcatttt cctatctgca aaaagggagt aattcttgtt
121 cccttgggtt ttctcctagt cataaaaaat gaatgagttc ggccacagta cacaaacaaa
181 agaaaatatg taatattttg taggataaat tcataaaagt tgtggaggat ctgggcactt
241 ttataaagca agcactgtag aaaactttca aaggaaacat ttaaattcac taatgacagc
301 tatagggatc ctgatttttc ctttaaatgg caggcacttc aaaaattaaa atagaatgta
361 gttagctttc aattacttag gccactgaca attcaattta tggattatat agtattttaa
421 tttactcatt tcacacgtcg ttatcaactc tacatgaagt ttaaaaggac aaaggatgca
481 agtggaattg gtcagtgttc cgggttattt ccaagggagg cagagggtgg ggggtttcct
541 ttgagacaag acttggggtt ggccaataat tgctggtatt gcctgcctgg taataacagg
601 ctgatgaaaa ggtgactaca aaacgtgaaa actggttaaa tcaagcgcac cctcccaccc
661 tagttttaga tgaggaattt tccgccctca cagaaggggc tgaggcagca tctggcatca
721 caacactaac atttgcttcg tgatttcctc tttacccggc cctttgacac acatcccttc
781 ccagaaatca ggattcgctg gtgcttttgc atttctaaaa tgggaatccc gtggctgagc
841 ttttagccgg ccggaacgac tgagggctgc atccctttcg caggagcggc gtcccgcctc
901 cagttctgtt ccaggcctga ctcctccact ccctccgtga ctcatgtctg cggatccttc
961 gcccccgacg cgccccccaa cacacaaacc cccagaatcc gcccccagcc tacagcgcga
1021 cgtcagcccg ccccagccga cttggaggtc tcgggtctga gtcacacaga aagaccaccc
1081 tcgtcggcat ccccacacac agtccgacac ccggcgcgcc ggcctccccg cctgacacac
1141 taacgcccgt cgtctccgcg caacttgtta tgctccggct cgagcccttg acccaaaaac
1201 ctcagcgaaa cggagagccg cagagccggc ctcgggcggc ctttgatggc tttgttattg
1261 tttgggtttg aatcgatacg cccctcccca tccttcctcc ctcgcggccc tacacccagc
1321 tcccgcctcc cctcacgccc cgcgcccctc cccctccatt ttggcgcctt ttccttcccg
1381 ccacgtcgtg gcggcgtaga gaccattctg accgcgagag ctgggcgggg cgggggcggg
1441 gcgcgccgag ttatgcagat caatcggcct ctggttggct ggagtagcgc tggcaggggc
1501 ggggccgggg cgcggccaca gagcgcgcgg ggcgggggcc gaggggagtc gcccagtccc
1561 gccgcttccc caccccctct cctccctcgg ccggcccggc agccctgctc cccgccttgg
1621 cctcccggag aggccccgcc ccgtcccccc ccgccgcgcc ctccccgcgc gccctccccg
1681 ccggcgcgct cctccccttt actcctggct gcggggcgag ccgggcgtct gctgcag
//
LOCUS ONNCG 11835 bp ss-RNA VRL 28-JUN-1990
DEFINITION O'Nyong-nyong virus (Gulu strain) complete genome.
ACCESSION M20303 M33999
KEYWORDS nonstructural protein; polyprotein.
SOURCE O'Nyong-nyong virus (strain Gulu, isolated from Anopheles gambiae),
cDNA to viral RNA, clones 2,4,7,8,9,11,13,14,17,17s,18,22,23,27,
passed in BHK-21 cells.
ORGANISM O'Nyong-nyong virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae; Alphaviridae.
REFERENCE 1 (bases 1 to 11835)
AUTHORS Levinsion,R.S., Strauss,J.H. and Strauss,E.G.
TITLE Complete sequence of the genomic RNA of O'Nyong-nyong virus and its
use in the construction of alphavirus phylogenetic trees
JOURNAL Virology 175, 110-123 (1990)
STANDARD simple staff_entry
REFERENCE 2 (bases 4079 to 7621)
AUTHORS Strauss,E.G., Levinson,R., Rice,C.M., Dalrymple,J. and Strauss,J.H.
TITLE Nonstructural proteins nsP3 and nsP4 of Ross River and
O'Nyong-nyong viruses: Sequence and comparison with those of other
alphaviruses
JOURNAL Virology 164, 265-274 (1988)
STANDARD full staff_entry
FEATURES from to/span description
pept 80 7624 O'Nyong-nyong polyprotein A
matp 80 1684 nonstructural protein P1
matp 1685 4078 nonstructural protein P2
matp 4079 5788 nonstructural protein P3
matp 5789 7621 nonstructural protein P4
pept 7670 11413 O'Nyong-nyong polyprotein B
matp 7670 8449 C protein
matp 8450 8641 E3 protein
matp 8642 9910 E2 protein
matp 9911 10093 6K protein
matp 10094 11410 E1 protein
mRNA 1 11835 O'Nyong-nyong mRNA
RNA 7621 7669 26S RNA
signal 11817 11835 polyA signal
BASE COUNT 3676 a 2860 c 2859 g 2440 t
ORIGIN
1 atagctgcgt gatacacaca cgcagcttac gggtttcata ctgctctact ctgcattgca
61 agagattaaa gtacccatca tggattcagt gtatgtagac atagatgctg acagcgcgtt
121 tctgaaggcg ttgcagcaag cataccccat gtttgaggtg gaaccaaagc aggtcacgcc
181 aaatgaccat gcaaacgcta gagcattttc gcatctagca ataaaactga tagagcagga
241 aattgatcca gactcaacca ttctagacat tggtagcgca ccagctagga ggatgatgtc
301 tgatagaaaa taccactgcg tctgcccgat gcgcagcgca gaagaccctg agaggctcgc
361 gaattacgcg agaaaacttg cgtcagccgc tggaaaggtg acagataaaa acatctccgg
421 aaaaattaat gatctacaag ctgtgatggc cgtaccgaat atggaaacat ccacattctg
481 cctacacact gatgctacat gcaaacaaag aggagacgtc gccatttatc aagacgtcta
541 cgccgtccat gcacctacct cgctgtacca tcaggcgatt aaaggagtcc gcgtggcata
601 ctggataggg ttcgatacga cacctttcat gtacaatgca atggctggcg catacccatc
661 atattcaaca aactgggctg atgagcaggt actgaaagct aagaacatag ggctgtgttc
721 aacagaccta tctgagggta gacgaggcaa actatccatc atgagaggca aaaaattgaa
781 gccatgcgac cgagtgctat tctcggtcgg ctcaacactc taccctgaaa gtcgtaaact
841 tctacaaagc tggcatttac catcggtatt tcatctgaag ggtaaactca gcttcacctg
901 ccgctgtgac acgatcgtct catgcgaagg atacgttgtc aagagagtga ccatgagtcc
961 aggcatctac ggaaagacat cggggtatgc tgtaactcat catgccggcg gcttcctgat
1021 gtgcaagacg acagatacag tagacggcga aagggtatca ttctccgtgt gtacttacgt
1081 accagctact atctgcgacc agatgactgg aatccttgcc actgaggtaa ccccagaaga
1141 cgcacagaaa ctactggttg ggctaaacca acggatagtg gtcaatggca ggacgcaacg
1201 taatacaaac accatgaaaa actacctgct cccaatagtt gctcaggcct tcagcaagtg
1261 ggccaaagaa tgtcgaaagg acatggagga cgaaaaactc ttgggtgtcc gagagaggac
1321 cttaacgtgc tgttgcctat gggcatttag aaagcacaag acgcatacgg tgtacaaaag
1381 accggatacc cagtcaatcc aaaaggtccc tgccgaattt gacagctttg tgataccaag
1441 tctgtggtcg tcaggtttat caattccgct gagaaccaga atcaagtggc tcttgagcaa
1501 agctccaaaa tacgagcaac taccgcacag cggaaacgcc gaggaagcag cccaggctga
1561 aacagatgcg gtagaagaac aggaggcaga gctaacccga gaagctatgc caccattgca
1621 ggcgacacag gatgacattc aggtagaaat tgatgtagag caacttgaag accgagcagg
1681 agcgggcata gtcgaaacac caagaggagc aatcaaagtc acagcccaac cgtcagacct
1741 tgttgtcgga gagtacttag tactgacacc gcaggcggtc ctgcgcagcc aaaaactcag
1801 tctgattcac gcgcttgcag agcaggtaaa aacgtgcaca catagtgggc gagcaggcag
1861 gtacgcggtt gaagcatacg atgggcgtgt tctagtgccc tcgggctacg cgatacccca
1921 ggaagacttc cagagcttaa gcgaaagtgc caccatggta tttaacgagc gagagtttgt
1981 gaaccggaag ttacaccaca tcgccatgca cggcccagcg ctgaacactg atgaagagtc
2041 atatgaactg gtaagggtag agaaaacaga acacgagtac gtctatgacg ttgatcagaa
2101 gaaatgttgt aagagggagg aagcaacagg actagtgcta gtaggcgact taactagccc
2161 accataccat gagttcgcct acgaaggact aaaaatccgc ccagcatgtc catacaaaac
2221 ggcagttata ggtgtcttcg gagtaccggg ttctggcaag tcggctataa tcaaaaacct
2281 ggtaaccagg caagacttgg tgactagtgg aaaaaaagaa aactgccaag aaatctccaa
2341 tgacgtaatg cggcaaagga aattggagat atctgcacgt acagtcgact cactactcct
2401 gaatggatgt aacaagccag tggaagtact gtacgtggac gaggcattcg cttgtcattc
2461 gggaaccctg ttggcactga tagccatggt tagaccgcgt cagaaggtcg tactttgtgg
2521 cgacccaaag cagtgcggat tcttcaatat gatgcaaatg aaggtcaact ataatcacaa
2581 catctgcaca caggtgtacc ataaaagcat atcaaggcgg tgtacactgc ctgtaacagc
2641 catcgtgtcc tcgttgcatt acgagagcaa gatgcgcact acaaatgagt acaaccagcc
2701 aatcgtagtg gatactacgg gcataacaaa accagaaccc ggggacttag tgttaacgtg
2761 tttccgggga tgggttaagc agctgcaaat agactaccgt ggaaacgaag tcatgacagc
2821 agctgcttct caggggctga ccagaaaagg tgtttatgca gtaaggcaga aagtcaacga
2881 aaaccctctg tatgcaccaa catcagaaca cgttaacgtg ctattgacac gcacagaggg
2941 caagttgaca tggaagacac tctcaggcga cccatggata aagatactgc agaacccccc
3001 aaaaggggac tttaaggcaa caatcaagga gtgggaagca gaacacgcct ccatcatggc
3061 aggaatatgc aatcaccaga tggcttttga cacatttcag aacaaagcta atgtatgctg
3121 ggctaaatgc ctggtcccta ttcttgacac tgctggaatc aaattaagtg acaggcagtg
3181 gtctcagata gtgcaagctt ttaaagaaga tagggcctac tctccagaag ttgcactgaa
3241 tgaaatatgc actcgcatat atggggtaga cctggacagc ggactattct caaagccact
3301 gatatccgtc tactatgcag acaaccactg ggacaataga ccaggaggaa aaatgttcgg
3361 gttcaaccct gaggtggcac ttatgcttga aaagaaatat ccctttacaa aaggtaagtg
3421 gaacattaac aagcagatat gtataactac cagaaaggtt gacgaattta accccgaaac
3481 caacataata ccggccaacc gcagactgcc gcactcactc gtggctgaac accactcagt
3541 gagaggggaa agaatggaat ggctggtaaa caaaatcagc ggtcaccaca tgttgttggt
3601 tagcggtcat aatcttatat taccaacaaa aagagtcacc tgggtagcac cgttaggcac
3661 ccgaggtgca gactacacat ataacctgga acttggtcta ccagccacac taggcagata
3721 tgacctggta gttatcaata tccatactcc attccgcata catcattacc agcagtgtgt
3781 agatcacgca atgaagctcc agatgctagg gggggactct ctacggctgt taaagccggg
3841 aggttcactt ctgattagag cttacgggta cgccgaccga accagtgaaa gggtcattag
3901 cgtattggga cgcaagttca gatcgtccag ggctctgaaa cctcagtgca tcacgagcaa
3961 tacagaaatg ttcttcctat ttagccgatt cgacaatgga agaaggaact tcaccacaca
4021 tgttatgaac aaccagctga acgcagtgta tgcaggactg gccactagag cgggctgtgc
4081 cccgtcatac cgagtgaaac ggatggacat cgcaaagaac actgaggaat gcgtggtaaa
4141 cgccgccaat ccgcgcggag taccaggcga tggagtatgt aaagccgtgt atagaaaatg
4201 gccagaatca ttcagaaaca gtgcaacacc agtggggact gcaaagacaa tcatgtgcgg
4261 tcaatacccc gtcatccacg cagtaggccc taacttctca aactattctg aggctgaagg
4321 ggatagggaa ttggcttcag tgtatagaga agtggcgaaa gaagtgtcta ggctaggagt
4381 gagcagtgta gccatccctt tgctctcaac cggtgtgtac tcaggaggca aagacagact
4441 gctgcaatca ctaaaccatc ttttcgcagc gatggattcg acagatgcag acgttgtcat
4501 ctactgcagg gacaaggaat gggagaagaa gatcactgaa gccatatcac taagatccca
4561 ggtagaatta ctagatgatc acatctcagt ggattgcgac attgtacgcg ttcatccaga
4621 cagcagcttg gcaggccgaa aggggtacag cacagtagag ggagcactct actcgtacct
4681 agagggaaca agattccacc aaactgcagt agatatggca gagatatata ccatgtggcc
4741 gaaacaaact gaagccaacg aacaggtctg cctatatgct ctgggggaga gtatagagtc
4801 cgtcaggcaa aaatgtcccg tagacgacgc cgacgcctca ttccctccga aaacagtccc
4861 gtgcctatgc cgttatgcta tgacgcctga acgagttgca cgtctacgca tgaatcatac
4921 caccagcatc atagtgtgct cgtcttttcc gctgccgaaa tacaaaatcg agggcgtgca
4981 aaaagtaaaa tgttcgaaag cactcttgtt tgatcacaac gtaccgtctc gagtgagccc
5041 gagaacgtac aggcctgcgg acgaaatcat acagacacct caaacaccaa ctgaagcgtg
5101 tcaggacgca caactcgtgc agtcaataaa tgatgaagca gtgccagttc cctcagactt
5161 agaggcttgt gacgcaacta tggactggcc ctctatcggc accgtatcaa caagacaaag
5221 acacgactca tctgacagcg agtatagtgg ctccagaagt aacatacaac tagtgacggc
5281 ggacgtgcat gcaccaatgt acgcacattc gctggcgtcc agcggaggtt caatgctgtc
5341 gctgtccagt gaaccagctc agaacggcac aatgatacta cttgactcag aagacacaga
5401 cagtataagc agagtaagca caccgatcgc cccgcccaga agacgtttgg gaaggaccat
5461 aaatgtgacc tgcgacgagc gggaagggaa aatactccct atggccagcg acaggttctt
5521 cactgctaag ccatacactg tcgcactgag cgtatcaaca gcagatatga ctgtgtatcc
5581 catccaggca ccgctaggat tgataccacc acctaccctc gaaccgatca ctttcggaga
5641 cttcgccgaa ggtgaaatag acaacctcct gacaggggca ttgacatttg gggacttcga
5701 gccaggtgaa gtggaagagc tgacggatag cgagtggtca acgtgctcgg acacagacga
5761 agagttacga ctagacagag cagggggtta catattctcc tctgacactg gtcaaggtca
5821 tctacagcaa aaatcagtac gtcaaacgac gctaccggta aacattgttg aagaggtcca
5881 cgaagagaaa tgctatccac ctaaattgga tgagatcaaa gagcaactac tacttaagag
5941 acttcaggag agtgcttcca cggctaaccg gagtaggtac caatctagaa aagtggaaaa
6001 catgaaagcc acgattatcc acagactgaa agagggttgc agactctatt tggcgtcaga
6061 aacaccgagg gtcccatctt accgagtcac atacccggcg cccatctact cgccttcaat
6121 caatatcaaa ctgactaacc cagagactgc agtagcagtg tgtaacgagt ttttggccag
6181 aaactatcca actgtggcat cctaccaagt cactgacgag tacgacgcgt acttggatat
6241 ggtagacggg tccgaaagtt gcctagacag agctacattc aacccgtcta aactcaggag
6301 ttacccaaaa caacactctt accacgcacc caccatcaga agtgcagtgc catcaccatt
6361 ccaaaatacg ttgcagaatg tcttggcagc ggccacaaaa agaaactgca acgtaacgca
6421 gatgagggaa ctacccacta tggactccgc agtgtttaac gtggagtgtt ttaagaagta
6481 cgcttgcaac caagagtact ggagagagtt cgcctcaagc cctataaggg taacgacaga
6541 gaatctgaca atgtatgtga cgaaactaaa ggggcctaaa gcggcggcac tcttcgcaaa
6601 aacacacaac ttgctgccgc tacaagaggt accgatggac aggttcacaa tggacatgaa
6661 acgtgatgtg aaagtgacac caggtacaaa gcacaccgag gaaaggccga aagtacaggt
6721 catacaggcg gcagaaccac tggcaacagc atacctgtgt ggcatacaca gagagttggt
6781 gagaagacta aatgcagttc tgctaccgaa tgtccacaca ctgttcgata tgtcagccga
6841 agacttcgat gcaattatag ccacccattt caaaccgggc gatgctgtac tagaaactga
6901 catagcctca tttgataaga gtcaagacga ctcgcttgcg tcgaccgcca tgatgttgct
6961 agaagacctt ggggtagatc aacctatcct ggatctgata gaagcagcat tcggcgaaat
7021 atccagttgt catctaccga cgggtacgcg gttcaagttc ggcgcaatga tgaaatcagg
7081 catgtttcta accctgtttg tcaataccct cctgaacatc accattgcca gtcgggtgtt
7141 agaggagcga ttgactactt cagcctgtgc agctttcatt ggggacgaca acataataca
7201 tggggttgtc tctgacgcac taatggctgc acgttgtgct acgtggatga acatggaagt
7261 gaaaatcatc gatgcagtag tgtcagagaa ggcgccatac ttctgtgggg gatttatttt
7321 acacgacacg gtgacaggca cgtcgtgcag agtagcagac cctttaaaga gactgttcaa
7381 gctaggcaaa cctctggcag ctggagacga acaggatgag gacagaagac gtgctctggc
7441 agatgaggtt actagatggc aaagaaccgg cttagtcaca gaattggaaa aagcagtata
7501 ttcaaggtat gaagtacaag gaataacagc cgtaataaca tcaatggcta cctttgcgaa
7561 tagcaaagaa aactttaaga aactaagagg gcccgtcgta accttgtacg gcggacctaa
7621 ataggtacgc actacagcta cctacttaga gcatacatcg ctaatagcca tggagttcat
7681 accagcacaa acttactaca atagaagata ccagcctaga ccctggactc aacgccctac
7741 tatccaggtg atcaggccaa aaccacgccg aagaaggcct gcaggacaac tcgcacaact
7801 gatatccgca gtcagcagac tagcactgcg tacagttccc cagaaaccac gccggacccg
7861 aaaaattaag aagcaaaagc aagtaaagca agaacaacag agtactacga accagaagaa
7921 aaaggcgccg aaacaaaagc agacccaaaa gaaaaagaga ccaggacgaa gggaaaggat
7981 gtgcatgaag attgaaaatg actgcatctt cgaagtcaga catgaaggaa aagtaacggg
8041 gtatgcatgc ctagtaggtg ataaggtaat gaaaccagca cacgtgaaag gaactattga
8101 caacgcagac ctagcgaagt tggcgttcaa aagatcatcc aaatatgatc tagagtgcgc
8161 acagatacca gtgcacatga aatcggacgc ctcaaagttc acccatgaaa aaccagaagg
8221 ctattacaac tggcatcacg gagcagtaca gtattctgga gggaggttca cgatccctac
8281 aggcgcagga aagcctgggg acagcggaag accaatcttt gacaacaagg ggcgtgtcgt
8341 ggctattgtt ctaggcggag caaacgaagg aaccaggaca gcactatctg tagtgacttg
8401 gaataaagac atagtcacaa aaatcacacc agaggggtca gttgaatgga gccttgccct
8461 ccctgtcatg tgcctgttgg caaatacaac cttcccatgt tcccaaccgc cttgcgcgcc
8521 gtgctgctac gaaaagaaac cggaagaaac cttgagaatg ctggaggaca acgtcatgca
8581 accaggatat taccagttac tcgattcagc attggcctgc tcacaacgtc gtcaaaaacg
8641 taatgcaaga gaaaacttca atgtctacaa agtcactagg ccgtacttag cccactgtcc
8701 tgactgcggg gagggacact catgccacag cccaatagca ttagaacgga tcagaagtga
8761 ggcaacagat ggtaccttga aaatccaggt atctctgcaa atcggaataa agacagacga
8821 cagccacgat tggacgaagc tacggtatat ggatagccat acacctgtgg atgcagaccg
8881 atccgggttg tttgtcagaa cgtcagcacc gtgcaccatc acgggaacga tgggacattt
8941 catactagca cgctgtccga aaggagagac gctgacggta ggatttgtag acagtagaag
9001 gatcagtcac acgtgcatgc acccgttccg ccacgagcca ccgctgatag ggagagagaa
9061 gtttcactcc cgcccgcagc atggcaaaga actaccttgc agtacatacg tccataccac
9121 agcggcaact gctgaggaaa tagaagtgca tatgccgcca gatacccctg actacacgct
9181 gatgacacag caagcgggaa acgttaagat cacagttgac ggccagacgg tacgatacaa
9241 gtgcaaatgt gacggctcca atgaaggatt aataaccgct gacaaagtca taaataactg
9301 caaagtagac caatgccaca cagcggttac aaaccacaag aaatggcaat acaattcacc
9361 gctgaccccg cggaactccg aacaaggaga tagaaaaggt aagatccata tcccatttcc
9421 actggtgaac acaacctgca gggtaccaaa agcaagaaat ccgactgtca catacggtaa
9481 aaacagagtc actctgctgt tacatccaga ccacccaaca ctcctttcgt accgcgccat
9541 gggaaggatc ccggattacc atgaagagtg gataacaaac aagaaggaaa taagtatcac
9601 agtaccagca gaaggcttag aggttacgtg gggtaataat gacccataca aatattggcc
9661 ccaactgtct acaaatggta ctgcgcacgg gcacccacat gaaataatcc tctattacta
9721 tgagctgtac ccaactacca caattgctgt actagctgct gcttctatcg taataacatc
9781 tttggtaggt ctatcattag gcatgtgcat atgcgcgaga cgcaggtgca tcacgccata
9841 tgagctgact ccaggagcta ccatcccatt cctcctaggt gtactatgct gtgccaggac
9901 tgcaaaagca gcatcgtact acgaagctgc aacatacctc tggaatgagc aacaaccatt
9961 attttggtta cagcttctaa tccctctgtc agctgcaatt gttgtgtgta attgcctaaa
10021 acttttacca tgctgctgca aaacattgac ttttttagcc gtcatgagca tcggtgcccg
10081 cactgtgacc gcgtacgagc acgcaacagt gatcccgaac acggtgggag taccgtgtaa
10141 gactcttgtt agcagaccag ggtacagccc tatggtctta gaaatggagc tacagtcggt
10201 cactctggaa ccagcattat ccttggatta cattacgtgt gagtataaaa caatcacacc
10261 gtccccgtac gtaaaatgct gtggtacagc tgaatgtaag gccaagaacc tgccagatta
10321 taactgcaaa gtattcacag gcgtctaccc atttatgtgg ggaggagcat actgcttctg
10381 tgacgcagag aacacacagc tcagcgaggc acacgttgag aaatcagaat catgcaaaac
10441 tgagtttgca tcagcctaca gagcccacac agcttcagta tcagctaaac tacgtgtctt
10501 ttaccaaggg aataatatca ccgtgtctgc atacgccaat ggtgatcatg cagttacggt
10561 ggaagacgcg aagtttgtca tcggtccact atcgtccgcc tggtcaccat ttgataataa
10621 gatcgtggtg tacaaaggcg aagtctacaa tatggactat ccacctttcg gcgcagggag
10681 gccaggacag ttcggtgaca tccagagccg cacgccagac agcaaggacg tctatgcgaa
10741 tacgcagtta atactgcaaa gaccagcggc aggagcaata cacgtgcctt actcccaggc
10801 accttcgggc tttaagtact ggctcaagga aaaaggggca tcattgcagc atactgcacc
10861 atttggctgt cagatagcaa caaacccggt aagagcagtg aactgtgcag tgggcaacat
10921 accagtctcc attgacatcc cagatgcagc tttcaccagg gtcactgacg ctccttccat
10981 cacagacatg tcctgcgaag tagcttcgtg tacccattca tctgattttg gaggtgccgc
11041 agtcataaag tacacagcta gtaaaaaagg aaaatgcgcc gtgcactctg taacaaatgc
11101 ggtcactatc cgcgaaccta acgtagatgt caagggaaca gcacaattgc aaattgcctt
11161 ctcgaccgca ctagctagtg cggaattcaa ggtgcagatc tgctccacac tggtacactg
11221 ctcagcgacg tgccatcctc ctaaagacca tatagtcaat tacccgtcac ctcacaccac
11281 actaggagtg caggacattt caacgacagc tatgtcttgg gtccagaaga ttacaggagg
11341 agtgggactc gtggttgcta tagctgcttt gatcttaatt atagttctct gcgtatcatt
11401 tagcagacac taaagtacac atgccccaaa gtaatatact gtacaggtat acgtgctccc
11461 tgagcagcac gatatatgta ttatccataa aagaaaaaca aaacaaaaat ataaaaatta
11521 taaaatacaa aagtataaaa acaggtattg gtaccccctt agaggtacat tatttaacca
11581 gtgaaaatag gtattggtgc ccccttagag gcacattata gaaccaggta taggtgcccc
11641 cttagaggta caccaacaat aggtataagt gcccccttag tggcacacta accaccacaa
11701 tcggtaatat gaagagacgt aggtatgaag cttcgtaaaa gctgccgaac ttactttaag
11761 atgtaggcgt accgaactct tcgacaattc tccgacgcag ggacgtagga gaagttattt
11821 tgtttttaat atttc
//
LOCUS PNGE155A 155 bp ds-DNA INV 28-JUN-1990
DEFINITION P.redivivus E155 gene.
ACCESSION M33925
KEYWORDS .
SOURCE P.redivivus (strain SC) adult DNA, clone E155.
ORGANISM Panagrellus redivivus
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE 1 (bases 1 to 155)
AUTHORS de Chastonay,Y., Muller,F. and Tobler,H.
TITLE Two highly reiterated DNA sequences in the low C-value genome of
Panagrellus redivivus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.De Chastonay, 26-APR-1990.
Institue of Zoology
Perolles, Fribourg CH-1700
SWITZERLAND
e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu
BASE COUNT 43 a 33 c 30 g 49 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccaag gtggcccccg gggccatacg gtacttaaca aatttgcata aatgtgtgat
61 ttttgacaca aaacgagctt attaacatct ttggaattgc ccatgaatct tcatttgaat
121 gtggattata gatcttatcg ccccccaagc ttttg
//
LOCUS PNGE167A 167 bp ds-DNA INV 28-JUN-1990
DEFINITION P.redivivus E167 gene.
ACCESSION M33926
KEYWORDS .
SOURCE P.redivivus (strain SC) adult DNA, clone E167.
ORGANISM Panagrellus redivivus
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE 1 (bases 1 to 167)
AUTHORS de Chastonay,Y., Muller,F. and Tobler,H.
TITLE Two highly reiterated DNA sequences in the low C-value genome of
Panagrellus redivivus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.De Chastonay, 26-APR-1990.
Institue of Zoology
Perolles, Fribourg CH-1700
SWITZERLAND
e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu
BASE COUNT 54 a 31 c 27 g 55 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcccga tgttcagatt tgaactttgt atgaatttga gatggcccat ttgatcattg
61 cggattcttc agaatccgaa tctaatggct cagacccaca gaataagctt taaataggct
121 attctaacta tattgattca actaatctaa agtatgcctt aaaacat
//
LOCUS SNDHNGP 2600 bp ss-RNA VRL 28-JUN-1990
DEFINITION Sendai virus hemagglutinin-neuraminidase (HN) gene, complete cds.
ACCESSION M34033
KEYWORDS glycoprotein; hemagglutinin; neuraminidase.
SOURCE Parainfluenza virus type 4A (strain Toshiba) from primary monkey
kidney cells, cDNA to viral RNA, clone pG4HF1.
ORGANISM Parainfluenza virus type 1
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Paramyxoviridae; Paramyxovirus.
REFERENCE 1 (bases 1 to 2600)
AUTHORS Bando,H., Kondo,K., Kawano,M., Komada,H., Tsurudome,M.,
Nishio,M. and Ito,Y.
TITLE Molecular cloning and sequence analysis of human parainfluenza type
4A virus HN gene: Its irregularities on structure and activities
JOURNAL Virology 175, 307-312 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 343 2064 hemagglutinin-neuraminidase (HN)
mRNA 53 2600 HN mRNA
mRNA < 1 52 fusion protein (F) mRNA
signal 39 52 consensus polyA signal
signal 2587 2600 consensus polyA signal
BASE COUNT 955 a 561 c 410 g 674 t
ORIGIN
1 acaaaacgac acccaatcaa acaatctcca catttcaatt ttaagaaaaa aactattata
61 ggggggaaca cacttctcag ccctgattgc tcaaggccct tgcatgtgca accgagacac
121 cccccacaag caccggaata agacctgaca acaaagtagc agccaccacg acccaaaaac
181 aaaattaaaa ggatccggta acagcccatc aaccagcaat catagaatcc aacaatccag
241 agagacgtca catcaactca tccacgaatc ttcgaaggga acatcccaga caaaatcaca
301 gcccattccc tgatcacgga taaactgaga aagatcacaa gaatgcaaga ctcacatggt
361 aatacacaaa tactcaacca ggcaaattca atggtgaaaa gaacatggag attactattt
421 cgaattgcaa ccttaatatt acttgtttca atatttgtgt tatcgctcat aattgtatta
481 cagtcaacac cggggaattt gcaaaacgat atcaatataa ttagaaagga gctcaatgaa
541 cttatggaga attttgaaac tacatctaag tcactgttaa gtgtatcaaa tcaaatcact
601 tacgatgtat cagtacttac tcctataaga caagaagcta ttgaaacaaa catcatttca
661 aaaataaaag atcattgcaa agatagagta attaaagaag gaagcacttg cacattgaat
721 cgcagccctt tgcatgatgt ctctttttta aatgggttca ataaattcta tttcacatat
781 aaagataata tgcaaattaa gtttaaatca ttattagatt accccaattt tattccaact
841 gctacaactc cccacggatg cattcgaatt ccatcattct ccttaggtca aacccattgg
901 tgttataccc ataatataaa cctactagga tgtgcagacc ctgcatctag caatcaatat
961 gtatcactag gaaccttaca agtcttaaaa atgggtgacc cttattttaa agtcgagcat
1021 agtcattatt taaatgacgg gaggaatcga aagagttgtt cagtggttgc tgtccccgac
1081 ggatgcctgc ggaattgtgt gaccatgaca aaaaatgaga cagagaattt caaagacctc
1141 aattggcaac acaattactt acatacatat catataatgg taccattaaa gactcgtata
1201 ataaatccac caggatcatc cagagattgg gttcatatcg caccaggggt aggctcgggc
1261 cttttgtatg ccaaattact tatatttcct ttgtatgggg gtctcacgga aaaatcagtg
1321 atacataata atcaatcagg gaaatatttt ttccctaatt caactaaatt gcaatgccgt
1381 aacagcacta tggaaaaaat aaaaggagca aaagattcat acacaataac ttacttctca
1441 gggagactta tacagagtgc atttctggtt tgtgatctaa gacaatttct ttctgaagat
1501 tgtgaaatct taattcctag taatgattac atgatggtcg gtgcagaggg tcgattatat
1561 aacattgaga acaacatatt ttattatcag agaggatcca gctggtggcc ttatccgagc
1621 ctctatagaa tcaggttaaa ccttagtaag aaatatccta gaataactga aattaaattt
1681 acaaaaattg aaatcgcccc aagaccaggc aacaaagatt gtccaggaaa taaggcttgc
1741 ccaaaagaat gtataacggg agtctaccaa gatatattgc cactaagtta tcccaatact
1801 gcatttccac acttaaaaca agcgtattat acaggttttt atcttaataa ctcgctcgag
1861 agacgcaatc caacatttta tactgctgac aatctagatt accatcaaca ggaaagatta
1921 ggtaaattca atcttactgc tggatactct actacaactt gttttaaaca gaccactact
1981 gcgaggttat actgtctcta cataattgaa gtgggtgact cagtcattgg ggactttcag
2041 atcacccttt ttttagcagc ttaatagacc agactgttaa ttaatcaaca aagttattct
2101 gtaatataaa ctgatcttat aagtgaaaag atgcctatcc aaggaggttg atagacaaat
2161 agtaaaagta gcaattgtaa caaaactcta aggaaaaagt aattcgagaa atattataga
2221 ctgacttcag agcaaacaca acatcgatcc ataatagtca atataatcaa taatactcta
2281 tgagacctta cctatcaaca gcaaaaaaca cagtccatca agcggaaccc aactcgctcc
2341 atccttaatc atccactgaa agaaaaaata tacgaaggac catcggccac cgggtccaaa
2401 caatctagca caaaaattca aacaaccgcc aaactctgtt cggcctcaac aaacaatccg
2461 ccaagccatc tgtcattcct ataccaacac acaaccatcc cattcctcaa aagcaattca
2521 atccgcgacc caaagaagac tctccacata tccagctaat ccgtcgatcc gacacatcat
2581 cgtatctttt aagaaaaaaa
//
LOCUS YSCRP026A 2133 bp ds-DNA PLN 28-JUN-1990
DEFINITION S.cerevisiae RNA polymerase II sixth subunit (RP026) gene, complete
cds.
ACCESSION M33924
KEYWORDS RNA polymerase II; oncogene; transmembrane protein;
tyrosine kinase.
SOURCE S.cerevisiae single-copy plasmid DNA, clone 2.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2133)
AUTHORS Archambault,J., Schappert,K.T. and Friesen,J.D.
TITLE A suppressor of an RNA polymerase II mutant of Saccharomyces
cerevisiae encodes a subunit common to RNA polymerase I, II, and
III
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D.Friesen, 01-MAY-1990.
The Hospital for Sick Children
555 University Avenue
Toronto, Ontario, CANADA M5G 1X8
e-mail: jim@sickkids.toronto.edu
FEATURES from to/span description
pept 1026 1045 RNA polymerase II sixth subunit (RP026), exon
1
1122 1569 RNA polymerase II sixth subunit (RP026), exon
2
IVS 1046 1121 RP026 intron A
BASE COUNT 644 a 374 c 408 g 707 t
ORIGIN Chromosome XVI.
1 tctagaaaaa gacatatcag gacatatatg acacttccaa tttttggtta ctagtgagtc
61 gtcatgaata atcatgtgca tttgtaaacc attttccccc acacatggtt tgctacaaat
121 aggacatttt aatttaggat gatcattttt tatatgcgat tgcaattgtg accatatccg
181 aaactctttg caacaaccag caaaagtaca ttggtaagga ttttctacct caggatcgtg
241 atgtttagaa atgtgatttc ttagcctata tggcctctga aagcttttat tgcagtgtgg
301 acaggttagt ttatgtaaat gaacagataa aatatgtgcc cttaattgtg gatgcttgta
361 gaatcggagg ttgcatcctt cctctggaca aatgaaagat ttggtatgcg ttacttcgtg
421 tcgcttcagt tgctggcgag tcgtcactcc ttttccacaa taagaacatt ggaatggttt
481 cgtatcagaa tgcgtataca agtgtctctc taagtgactc tttttaacga aggattttgc
541 acactatcac actgaaatgc tcttaaaccc tgatgtacgc ttaattggtg ttcagtcaaa
601 attgaaggtc ttgtaaatgc ctatcacagc atcatagtca cagaaatatg tctttggcct
661 attgctcgat gagcttctag ttgatgttaa actgttcaat gattctgaac tctctgatcg
721 tgatatggga atggtctcct gtttaagttc ggccagtggc attccttcat tatttagaac
781 ctctcctccc atttgacagc gatttcaaca gttactactg tgcttaatgc ccttatttga
841 tagttagttc ttcttataat aaataatcat tgtatattga ttattcgtat agtatgatat
901 ttaattgaaa aaaaaatttt ttttttttta agaatatcat tcaaaggatc aatcacaacc
961 ttgaagaaag gctaaaaaga cacattttgc aggtaacagt gtaaagatta aggctacaag
1021 tgatcatgtc agactacgag gaggcgtatg tgcattcata ttggagaaat tgcgtgcttt
1081 agttttaatt atatttacta acttctatca ttgtgataca ggtttaacga cggtaatgag
1141 aacttcgaag attttgatgt agagcatttt tctgatgagg agacttatga ggaaaaacct
1201 caattcaagg atggtgaaac aaccgatgcc aacggtaaga ccatcgttac tggtggtaat
1261 ggcccagaag attttcaaca gcatgagcaa ataagaagaa agacacttaa ggaaaaggcc
1321 atcccaaaag accaaagagc aactactcca tacatgacca aatatgaaag ggcaagaatt
1381 ttaggtacaa gagccctaca aatttccatg aatgcgccag tttttgtaga tttagaaggt
1441 gaaactgatc cattgcgtat tgccatgaag gagttggctg aaaagaaaat tcctttggtt
1501 attagaagat atttaccaga tggttccttt gaggactgga gtgtggagga actcattgtg
1561 gatttgtgat tacacctgca ttttccttta tgtgtatttg ctttgtttgt atattattat
1621 aactataaag aaccgattta ttttgtaatg caaataactg gacgagagaa ccacgctata
1681 acctatttgg taataaaaat gtttgtatga ataaaaaagc tgctgactgc tttctttgat
1741 atgtgtatat agaatattta tcggtgattg atctaagaaa agcttacctc tgaaagttct
1801 ctcttcgtcg cttgctgatt tgtagtaaac aatttgaaaa gtttagcaaa ttcttcctca
1861 ggattttcaa aaccggcttc ctttaaagat caatgacctc attcaaggga acgtttaaat
1921 cgtgtcctct tccaatggct tttaaacttt cttccaaatt cttcacgctc ggggaaactg
1981 cgataagttc ttccccatga tagataagaa tatgggaaag cttacacctt cttctccaac
2041 ttccgcggta ctggtgtcat tgtcaggtac cattttggac cattcctcat ctgtcaatgt
2101 cttccccaat gttgcatata tcttggtaag atc
//
LOCUS MUS25OAS 1412 bp ss-mRNA ROD 28-JUN-1990
DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds.
ACCESSION M33863
KEYWORDS 2'-5' oligo A synthetase.
SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1412)
AUTHORS Coccia,E.M., Nissim,A. and Chebat,J.
TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3
cells impairs EMCV but not VSV replication
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by E.M.Coccia, 26-APR-1990.
FEATURES from to/span description
pept 36 1139 2'-5' oligo A synthetase
signal 1389 1394 poly-A signal
BASE COUNT 340 a 371 c 390 g 311 t
ORIGIN
1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc
61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg
121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag
181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca
241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct
301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg
361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca
421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg
481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca
541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta
601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc
661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc
721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt
781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc
841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc
901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga
961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc
1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga
1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag
1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc
1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca
1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg
1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc
1381 tgatcaacaa taaaccacag caggtgccgt ca
//
LOCUS PSEHUTR 1604 bp ds-DNA BCT 28-JUN-1990
DEFINITION P.putida histidine utilization genes repressor protein (hut) gene,
complete cds.
ACCESSION M33922
KEYWORDS histidine utilization genes repressor protein.
SOURCE P.putida (ATCC 12633) DNA.
ORGANISM Pseudomonas putida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1604)
AUTHORS Allison,S.L. and Phillips,A.T.
TITLE Nucleotide sequence of the gene encoding the repressor for the
histidine utilization genes of Pseudomonas putida
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.T.Phillips, 26-APR-1990.
FEATURES from to/span description
pept 167 913 histidine utilization genes repressor protein
(hut)
pept 910 1482 protein of unknown function
binding 46 65 hut binding site (put.)
binding 894 898 ribosomal binding site (put.)
site 82 95 operator site
signal 94 122 promoter (put.)
signal 1516 1543 transcription termination signal
BASE COUNT 295 a 484 c 513 g 312 t
ORIGIN
1 ggacatggct ggcccagccc gtaggcaaca gagcgcgttc ggcgaagtag gcggacatcg
61 gtcaaatcct gttattgtta acttgtatat acatatacag gcgtttgcct gccgggtaaa
121 ctgcggcaag ctaccgttca ttccctatgc acaaggatcc aacgccgtgc cgacacctcc
181 tgtctccgcg ctggttgccc agatgggcga gggcccggcg ccgctgtatg cccgggtcaa
241 acagatgatc atccagcaga tcgacaacgg cagctggccg ccgcatcacc gggtcccctc
301 ggagagtgaa ctggtcaacg agctaggctt cagccgcatg accatcaacc gtgccctgcg
361 cgaactcacg gccgacggcc tgctggtgcg catgcagggg gtcggcacgt tcgtagccga
421 gccaaagggc cgttcggcgt tgttcgaagt caacaacatt gccgatgaaa ttgccgcgcg
481 cggccatcag catagctgcc aggtgatcac gctcaccgag gaagcagccg gttccgaacg
541 ggccctggcc ctggacatgc gtgaaggcca gcgggtgttc cactcgctga tcgtgcattt
601 cgagaacggc gtgccggtgc agatcgagga ccgctacgtc aacgccgcga tcgcacccga
661 ctacctcaag caggatttca cccggcagac gccatatgcc tacctgtccc aggtagcgcc
721 gctgaccgag ggtgagcacg tggtcgaagc catcctggcc gagccggaag aatgccgcct
781 gctgcagatc gagcggggcg aaccttgcct gctgatccgc cgtcgtactt ggtccggccg
841 ccagccggta accgcggcgc ggctgatcca ccccggttcc cgtcatcgcc tggaaggacg
901 tttcagcaaa tgagccagct gcagttgttg cgcgcacagg attacccgcg catgccgtgg
961 aagaacggtg gcggtttcac cgaagagatc acccgcgaca gtggagaggg cctggacggc
1021 tttggctggc gcctgtcgat tgccgatatc gaagagtctg gcggcttttc caccttcgcc
1081 ggttaccagc ggatcatcac cgtgctgcag ggcgatggca tgcgcctgtt ggtcgatggc
1141 cagcccagcc ggccgttgct gccgttcgat gcctttgcct tcagcggcga aagccaggtc
1201 agctgcaagc tgctgggtgg ggcgatccgc gatttcaacc tgatctatgc accgcaacgg
1261 taccgggcga ggttgcagtg gtttgatggc acgagccgtt tgtacagctc ggcgtcgaca
1321 gtgctgttgt ttgctgccag cagtcacgtg gaagtgtcca tggcggggcg tgaggtgcag
1381 cggttggggt tgtatgactg cctgcggctg gagggcaacg atgagttgct tgggctggaa
1441 gttcaggggc ggttttgctt gattgagctc atttctcgct gatgggcttg gcgatacatt
1501 ttcatcgcct gtgagatcga gcgccgcgcg ggcggcgctc gatttgcgcg ccgccgcaaa
1561 actcaagccg gaccgacgct cgcttcaccc ccccaaaaaa aatc
//
LOCUS MUSCR2 427 bp ss-mRNA ROD 28-JUN-1990
DEFINITION M.musculus complement receptor type 2 (CR2) mRNA, 3' end.
ACCESSION M35685 J04153
KEYWORDS complement receptor type 2.
SOURCE M.musculus (strains BALB/c, DBA/2, CBA/J and C57B10.D2/OSN
C5-deficient) and Swiss nude (strain nu/nu), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 427)
AUTHORS Fingeroth,J.D., Benedict,M.A., Levy,D.N. and Strominger,J.L.
TITLE Identification of murine complement receptor type 2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 242-246 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D.Fingeroth, 13-JAN-1989.
FEATURES from to/span description
pept < 1 285 complement receptor type 2
BASE COUNT 148 a 95 c 76 g 108 t
ORIGIN Chromosome 1.
1 tgtttacaac cgaagatcaa aagagggcaa atattatcta ttttgaaaga tagttattca
61 tataatgaca ctgtggcatt ttcttgtgaa cctggcttca ccttgaaggg caacaggagc
121 attcgatgca atgctcatgg cacatcggag ccaccggtac cagtgtgtga aaaaggaaaa
181 cacagagaaa gcaattatta tacaaagaca agacccaaag aaggagctct tcatttagaa
241 acacgagaag tatattctat tgatccatat aacccagcaa gctgatgaca tgacaaatca
301 agatgtagaa ctctcagcta cctcttcagc accatatctg cttacatgcc accaagctac
361 cctccacgac aataatggac taaacctctg atttgtaagc cagccccaat taaatgtttt
421 tctctat
//
LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 28-JUN-1990
DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds.
ACCESSION M29968
KEYWORDS cytochrome P450; monooxygenase.
SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 2006)
AUTHORS Hassett,C. and Omiecinski,C.J.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Hassett, 20-NOV-1989.
Author Address [1]: C.Hasset
University of Washington
Department of Environmental Health SC-34
Seattle, WA 98195
FEATURES from to/span description
pept 66 1529 cytochrome P450IIC16 (CYP2C16)
mRNA < 1 2006 CYP2C16 mRNA
signal 1985 1989 poly-A signal
BASE COUNT 524 a 474 c 472 g 536 t
ORIGIN
1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag
61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac
121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta
181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag
241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg
301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca
361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga
421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga
481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa
541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct
601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg
661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt
721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt
781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca
841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt
901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa
961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata
1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca
1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg
1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc
1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc
1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga
1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc
1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg
1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt
1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc
1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac
1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac
1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga
1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc
1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat
1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa
1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct
1981 cattaataaa gtggatgcat tcactg
//GenBank-Updates@genbank.bio.net (06/29/90)
LOCUS MUSTKBLK 2094 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Mouse B lymphocyte kinase (blk) mRNA, complete cds.
ACCESSION M30903
KEYWORDS B lymphocyte kinase; tyrosine kinase.
SOURCE Mouse (strain C57B410, haplotype b) B-lymphoid hybridoma cell line
LB27.4 + 71.67, cDNA to mRNA, clones 102, 103, 201, 205, 215.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites)
AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V.
TITLE Specific expression of a tyrosine kinase gene, blk, in B lymphoid
cells
JOURNAL Science 247, 332-336 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2094)
AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V.
JOURNAL Unpublished (1990) Howard Hughes Med. Inst., Baltimore MD 21205
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by S.V.Desiderio, 21-DEC-1989.
FEATURES from to/span description
pept 350 1849 B lymphocyte kinase
BASE COUNT 485 a 542 c 607 g 460 t
ORIGIN
1 cagcctggct gaaggtcagg tgcccatgaa acttctcaga aacacctccg gggctgagga
61 gccagaagtc tccagaagac ttgactggag ggcagagctc ccactgagct gcttcttagg
121 gctgttcatg gcggtctgat cacaacaggg ggcacacaca cctctgtgtg ctgttggcag
181 gaagccacaa gccactgaag ctgactgaga tgagaagaat ccatctgggg atgggattgg
241 cttttcttaa ctgggtccag agcttctggt tgccgctaga aagttcttca accacacaga
301 ccctgggatc tgcacagcta tgttgagaag ctactgaagc cctgcaagga tggggctgct
361 gagcagcaag aggcaggtca gtgagaaggg caagggctgg agccccgtga agatccgcac
421 ccaggacaag gctcccccac ccctgccacc cctggttgtc ttcaaccacc ttgccccacc
481 atctcctaac caggacccag atgaagagga gcgttttgtg gtggctctgt ttgactatgc
541 cgctgtgaat gacagggacc ttcaggtgct gaagggtgag aagctccagg tcttgaggag
601 cactggagac tggtggttgg ccaggtcact cgtcacagga agagaaggtt atgtgcccag
661 caactttgtg gccccagtag agactctgga agtagaaaaa tggttcttca ggaccatcag
721 ccggaaggat gctgagaggc agttgctggc tccgatgaac aaggccggct cctttctcat
781 cagagagagt gagagcaata aaggtgcctt ttccctgtcc gtgaaagata tcaccaccca
841 gggggaggtg gtcaagcact ataagatccg atcactggac aatggaggct attacatctc
901 cccccggatc acctttccca ccctccaggc cctggtgcag cactattcaa agaaagggga
961 tggtttgtgt cagaagttga ctctgccctg tgtgaacctg gccccgaaga acctttgggc
1021 ccaagatgaa tgggaaatcc ccaggcagtc tctcaagttg gtccggaaac ttgggtctgg
1081 gcagtttggc gaagtctgga tgggttatta caaaaataac atgaaggtgg ccatcaagac
1141 cctgaaggag ggaaccatgt caccggaagc tttcctgggc gaggccaacg tgatgaaaac
1201 cctgcagcat gagaggctgg ttcgtctcta cgctgtggtc accagagagc ccatttacat
1261 cgtcactgaa tacatggcca gaggatgctt gctggatttt ctgaagaccg atgaaggtag
1321 caggttgtcc cttccaaggc tgattgacat gtcagcccag gttgcagagg ggatggctta
1381 catagagcgc atgaattcca tccaccgtga cctgcgggca gccaacatcc tggtgtctga
1441 gacgttgtgc tgcaaaatcg ctgacttcgg cttggccagg atcattgaca gtgaatacac
1501 tgcccaagag ggggccaagt tccccatcaa gtggaccgcc ccggaggcca tccacttcgg
1561 ggtgtttacc atcaaggctg atgtgtggtc cttcggagtc ttgctgatgg tgattgtcac
1621 ctatgggcgc gttccctacc caggaatgag caaccctgag gtcatccgta gcctggagca
1681 cggctaccga atgccatgcc cggagacatg tccaccggag ttgtacaatg atatcatcac
1741 tgagtgctgg cggggccggc cagaggagcg gcctaccttt gagttcctgc agtcggtgtt
1801 ggaggacttc tacacagcca cggagggcca atatgagctg cagccctagc aaccaatggt
1861 ctcttcaggg ctcccagcct gagcctcctt ctggactctg actttgggtt aatatgttca
1921 cacttagctc ttcttccagg taggttgcag aatcagggaa gaatggtgtg cctgcttctc
1981 aagtgaggga gccaggagac aaagatggct tcatggtgag gcatgagctc tcaaggcaca
2041 acctcggtcc acctttcact ggctgtgtgc ctttggccaa gttactttgc tgcg
//
LOCUS CHKSUH2AC 75 bp ds-DNA SYN 29-JUN-1990
DEFINITION Chicken and sea urchin chimeric TATA-H2A-c gene.
ACCESSION J02525
KEYWORDS conalbumin; histone.
SOURCE Chicken and sea urchin DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 75)
AUTHORS Grosschedl,R., Wasylyk,B., Chambon,P. and Birnstiel,M.L.
TITLE Point mutation in the TATA box curtails expression of sea urchin
H2A histone gene in vivo
JOURNAL Nature 294, 178-180 (1981)
STANDARD full staff_review
COMMENT The TATA-H2A-c gene was formed by replacing the "TATA box" of sea
urchin H2A with that of chicken conalbumin. A single base change
from 't' to 'g' in the third position of the "TATA box" reduces the
rate of transcription by 80%.
FEATURES from to/span description
mRNA 68 > 75 H2A-c mRNA
recomb 23 24 sea urchin DNA/BamHI linker
recomb 25 26 BamHI linker/chicken DNA
mut 41 41 t in wild type; g in "TAGA" mutation
recomb 60 61 chicken DNA/sea urchin DNA
BASE COUNT 20 a 21 c 21 g 13 t
ORIGIN 68 bp upstream of H2A-c mRNA start
1 tcgctgaccg gtctctccga tccggagggc tgctcctcta taaaagggga agaaagagga
61 accattcaag ccagc
//
LOCUS CPVW8 70 bp ds-DNA VRL 29-JUN-1990
DEFINITION CPV-BR white pock mutant recombination junction, variant W8.
ACCESSION M11471
KEYWORDS recombination joint.
SOURCE Cow pox virus (strain Brighton red; CPV-BR) DNA, passed in
chorioallantoic membrane (CAM) of 11-day old chicken embryo.
ORGANISM Cowpox virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 70)
AUTHORS Pickup,D.J., Ink,B.S., Parsons,B.L., Hu,W. and Joklik,W.K.
TITLE Spontaneous deletions and duplications of sequences in the genome
of cowpox virus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6817-6821 (1984)
STANDARD full staff_review
COMMENT In the white pock variants of cowpox virus, 32 to 38 kb of the 3'
end (region 2) were replaced by inverted copies, 21 to 50 kb long,
of the 5' end (region 1) of the genome. These variants possess
inverted terminal repeats. The limits of the inner ends of the
deletions appear to be defined by the location of the nearest
essential gene on one end and the location of the gene that encodes
pock-redness on the other. The sequences adjacent to the
recombination/transfer sites show no homology to each other or any
unusual structural feature. At the recombination site of variant
W8 one base pair "t" is shared by the two combining sequences.
FEATURES from to/span description
recomb 36 37 region 1 end/region 2 start (crossover event)
BASE COUNT 19 a 11 c 14 g 26 t
ORIGIN Undetermined number of bp downstream of EcoRI site.
1 ctttttattg agtggtggta gttacggata tctaatttat ccatccagta tgggtataca
61 acacgaattc
//
LOCUS DRORGM108 1059 bp ds-DNA INV 29-JUN-1990
DEFINITION D.melanogaster 28S rRNA gene with 1.006 kb type I insertion.
ACCESSION K03141
KEYWORDS insertion sequence.
SOURCE D.melanogaster ribosomal DNA, clone MB27.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1059)
AUTHORS Roiha,H. and Glover,D.M.
TITLE Duplicated rDNA sequences of variable lengths flanking the short
type I insertions in the rDNA of Drosophila melanogaster
JOURNAL Nucleic Acids Res. 9, 5521-5532 (1981)
STANDARD simple staff_review
COMMENT The rRNA genes of D.melanogaster can contain two types of
non-homologous insertion sequences, which are unlike intervening
sequences found in other genes. These insertion sequences appear
to have a repressive effect on the transcription of rRNA genes.
FEATURES from to/span description
rpt 24 30 rRNA duplication
rpt 1037 1043 28S rRNA duplication
recomb 30 31 28S rRNA end/insertion sequence start
recomb 1036 1037 insertion sequence end/28S rRNA start
BASE COUNT 237 a 244 c 311 g 265 t 2 others
ORIGIN 164 bp upstream of BstI site.
1 gaatggatta acgagattcc tactgtccct gtcttagctg ggagcagagg aagactcgcc
61 tagaggagtg tttaatccag agttggcaga acagatggga cgatgacagc gaaccaggac
121 gggtgacgca taggtttatc ccatacgtta gtcttgccta tcgggatcca agttttggat
181 tctcgatgag gacgtctttc ctgcttacag ggcacgggtc gttcaatgca tttttgcacg
241 ggagagccct cagcgatacc actgcttgcg catgtggcga tccatatgag gactggatgc
301 atatcttgtg cgcttgcccc ctatatgcag atctgcggga cctagatgga cttggagtgc
361 agcgccttgg cgaaactgga tcttcgaggg aatcctcgat gatcaagaga agactcaacg
421 gctggcaatg tttgcggaag aagtgttcct gaggaggagg ggcgtttagc tcagtcatct
481 ctcgcgtgtg gtttagcggg cgagaatact accacagttc cgctgttgct tgtcgtaaga
541 gacgactaat acagcgatag gattcctcta accctgcttg tcggagcaaa agggggaggc
601 ccaccgagcc tcttttcggt accacgggtt gagcagctat ccaagactgc tcattgaggt
661 aggccccctg gtgggagtat cgtggtggct gtggttggta cccatatcgc gggtagagcc
721 ttcatgctcg acgtttgagt tacggtgcta gttgcgcaaa actcgggtgc tgtgacccag
781 agatcagtag agattttagg tagatctcgc tcctcagcaa gggggagtgc ttgcccggca
841 agcangntac tcgaattgct accggggtgg tcgctatgta catagctata gcttctagtc
901 cgggacgctt gtctggcgta tccagacaca tgcaccatat gctcacttgt gggtgtatag
961 ggtgccgtgg ttgtaatccc ttcagtgtgg aacacgccac gtaaaataag ttcggaggga
1021 tccgaaaagc atacattgtc cctatctact atctagcaa
//
LOCUS ECOLACIS 113 bp ds-DNA BCT 29-JUN-1990
DEFINITION E.coli lacZ and IS1T insertion junction.
ACCESSION K03504
KEYWORDS insertion element IS1T; lacZ/IS1 fusion protein.
SOURCE E.coli K12 DNA, clone pCSH11.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 113)
AUTHORS Malamy,M.H., Rahaim,P.T., Hoffman,C.S., Baghdoyan,D.,
O'Connor,M.B. and Miller,J.
TITLE A frameshift mutation at the junction of an IS1 insertion within
lacZ restores beta-galactosidase activity via formation of an
active lacZ-IS1 fusion protein
JOURNAL J. Mol. Biol. 181, 551-555 (1985)
STANDARD full staff_review
COMMENT The site of integration of IS1 into lacZ is found 18 nucleotides
from the end of the sequence determining the C-terminus of
beta-galactosidase. Reversion to lac+, promoted by ICR191, results
from the loss of a "g" from the "ggg" sequence located at the
junction of lacZ and IS1. As a result an active but
temperature-sensitive lacZ/IS1 fusion protein is formed containing
six amino acids derived from IS1 which replace six amino acids
encoded by lacZ
FEATURES from to/span description
pept < 1 57 lacZ/IS1 fusion protein (AA at 1)
recomb 37 38 E.coli DNA end/IS1 DNA start
BASE COUNT 24 a 25 c 28 g 36 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccagc tgagcgccgg tcgctaccat taccaggtaa tgctgccaac ttactgattt
61 agtgtatgat ggtgatttta aggtgcttgc gtggcttcca tttccatcag atg
//
LOCUS ECOOMPASA 70 bp ds-DNA SYN 29-JUN-1990
DEFINITION E.coli outer membrane protein ompA gene-plasmid pBR322 junction
(synthetic), clone pOMPA21.
ACCESSION J02534
KEYWORDS artificial gene; membrane protein; outer membrane protein.
SOURCE E.coli and plasmid pBR322 DNA, clone pOMPA21.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 70)
AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
Kurz,C. and Schaller,H.
TITLE Export of a protein into the outer membrane of Escherichia coli
K12: Stable incorporation of the OmpA protein requires less than
193 amino-terminal amino-acid residues
JOURNAL Eur. J. Biochem. 122, 223-231 (1982)
STANDARD full staff_review
COMMENT [1] explored to what extent ompA gene can be shortened without
impairing outer membrane incorporation of resulting protein
fragments, and found that more than 133 NH2-terminal residues of
the mature OmpA protein suffice but are unstable, while 193 such
residues are stable.
FEATURES from to/span description
pept < 1 70 OmpA' protein (modified OmpA; aa at 2)
recomb 13 14 E.coli end/pBR322 start
BASE COUNT 15 a 17 c 24 g 14 t
ORIGIN 13 bp upstream of BamHI site
1 gctgagcaac ctggatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg
61 ctccaagtag
//
LOCUS ECOOMPASB 66 bp ds-DNA SYN 29-JUN-1990
DEFINITION E.coli outer membrane protein ompA gene / AD3 insert /
plasmid pBR322 synthetic construct, clone pOMPA21-11.
ACCESSION J02532
KEYWORDS artificial gene; membrane protein; outer membrane protein.
SOURCE E.coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-11.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
Kurz,C. and Schaller,H.
TITLE Export of a protein into the outer membrane of Escherichia coli
K12: Stable incorporation of the OmpA protein requires less than
193 amino-terminal amino-acid residues
JOURNAL Eur. J. Biochem. 122, 223-231 (1982)
STANDARD full staff_review
COMMENT See comments for first entry in this series. AD3 is a derivative of
a synthetic lac operator.
FEATURES from to/span description
pept < 1 42 OmpA' (modified OmpA protein)
recomb 13 14 E.coli end/AD3 insert start
recomb 60 61 AD3 insert end/pBR322 start
BASE COUNT 19 a 14 c 17 g 16 t
ORIGIN 13 bp upstream of BamHI site
1 gcgatcactc ctggatccgg aattcaaatt gtgagcggat aacaatttga attccggatc
61 gcgaag
//
LOCUS ECOOMPASC 66 bp ds-DNA SYN 29-JUN-1990
DEFINITION E.coli outer membrane protein ompA gene / AD3 insert /
plasmid pBR322 synthetic construct, clone pOMPA21-16.
ACCESSION J02533
KEYWORDS artificial gene; membrane protein; outer membrane protein.
SOURCE E. coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-16.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
Kurz,C. and Schaller,H.
TITLE Export of a protein into the outer membrane of Escherichia coli
K12: Stable incorporation of the OmpA protein requires less than
193 amino-terminal amino-acid residues
JOURNAL Eur. J. Biochem. 122, 223-231 (1982)
STANDARD full staff_review
COMMENT See comments for first entry in this series. AD3 is a derivative of
a synthetic lac operator.
FEATURES from to/span description
pept < 1 34 OmpA' (modified OmpA; aa at 2)
recomb 13 14 E.coli end/AD3 insert start
recomb 60 61 AD3 insert end/pBR322 start
BASE COUNT 22 a 14 c 15 g 15 t
ORIGIN 20 bp upstream of EcoRI site
1 acagaccaag cacgatccgg aattcaaatt gtgagcggat aacaatttga attccggatc
61 tgatcg
//
LOCUS HUMIGLAM2 872 bp ss-mRNA PRI 29-JUN-1990
DEFINITION Human Ig rearranged lambda-chain mRNA VJC-region subgroup lambda-IV
from heterohybridoma H6-3C4.
ACCESSION M18645
KEYWORDS immunoglobulin; immunoglobulin light chain; lambda-immunoglobulin;
variable region; variable region subgroup lambda-IV.
SOURCE Human hybridoma H6-3C4 (cell fusion of mouse myeloma NS1 cells and
human peripheral blood lymphocytes), cDNA to mRNA, clone
p3C4-lambda-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 872)
AUTHORS Yamasaki,N., Komori,S. and Watanabe,T.
TITLE Complementary DNA for a human subgroup IV immunoglobulin
lambda-chain
JOURNAL Mol. Immunol. 24, 981-985 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 31 732 Ig VJC-region
recomb 90 91 V-region start
recomb 375 376 V-region end/J-region start
recomb 412 413 J-region end/C-region start
recomb 729 730 C-region end
BASE COUNT 211 a 269 c 220 g 172 t
ORIGIN 254 bp upstream of BamHI site.
1 tcagaagcag agttctgggg tgtctccacc atggcctgga cccctctctg gctcactctc
61 ttcactcttt gcataggttc tgtggtttct tctgaactga ctcaggaccc tgttgtgtct
121 gtggccttgg gacagacagt caggatcaca tgccaaggag acagcctcag aacctatcat
181 gcaagctggt accagcagaa gccaagacag gcccctgtac ttgtcatcta tgatgaaaac
241 aaccggccct cagggatccc agaccgattc tctggctcca cctcaggaaa cacagcttcc
301 ttgaccatca ctggggctca ggcggaagat gaggctgact attactgtaa ctcccgggac
361 agcagtggta accgtctggt attcggcgga gggaccaagc tgaccgtcct aggtcagccc
421 aaggctgccc cctcggtcac tctgttcccg ccctcctctg aggagcttca agccaacaag
481 gccacactgg tgtgtctcat aagtgacttc tacccgggag ccgtgacagt ggcctggaag
541 ggagatagca gccccgtcaa ggcgggagtg gagaccacca caccctccaa acaaagcaac
601 aacaagtacg cggccagcag ctatctgagc ctgacgcctg agcagtggaa gtcccacaga
661 agctacagct gccaggtcac gcatgaaggg agcaccgtgg agaagacagt ggcccctaca
721 gaatgttcat aggttctcaa ccctcacccc caccacggga gactagagct gcaggatccc
781 aggggagggg tctctcctcc caccccaagg catcaagccc ttctccctgc actcaataaa
841 ccctcaataa atattctcat tgtcaaggaa tt
//
LOCUS HUMTCAXU 401 bp ss-mRNA PRI 29-JUN-1990
DEFINITION Human T-cell receptor active alpha-chain mRNA, clone HAP36
(V-alpha-9.1 family; J-alpha-H family).
ACCESSION M13737
KEYWORDS T-cell receptor; T-cell receptor alpha-chain.
SOURCE Human adult PHA-stimulated peripheral blood T lymphocyte, cDNA to
mRNA, clone HAP36.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 401)
AUTHORS Yoshikai,Y., Kimura,N., Toyonaga,B. and Mak,T.W.
TITLE Sequences and repertoire of human T-cell receptor alpha chain
variable region genes in mature T lymphocytes
JOURNAL J. Exp. Med. 164, 90-103 (1986)
STANDARD simple staff_review
FEATURES from to/span description
ORF < 1 > 401 T-cell receptor active alpha-chain precursor
recomb 321 322 V region end/D-J region start
recomb 387 388 D-J region end/C region start
BASE COUNT 115 a 102 c 87 g 97 t
ORIGIN
1 atgaagccca ccctcatctc agtgcttgtg ataatattta tactcagagg aacaagagcc
61 cagagagtga ctcagcccga gaagctcctc tctgtcttta aaggggcccc agtggagctg
121 aagtgcaact attcctattc tgggagtcct gaactcttct ggtatgtcca gtactccaga
181 caacgcctcc agttactctt gagacacatc tctagagaga gcatcaaagg cttcactgct
241 gaccttaaca aaggcgagac atctttccac ctgaagaaac catttgctca agaggaagac
301 tcagccatgt attactgtgc tctaagtgtt tataaccagg gaggaaagct tatcttcgga
361 cagggaacgg agttatctgt gaaacccaat atccagaacc c
//
LOCUS HUMTCAYR 406 bp ss-mRNA PRI 29-JUN-1990
DEFINITION Human T-cell receptor active alpha-chain V-region (V-J-C) mRNA,
partial cds, clone AA25.
ACCESSION M17657 J02992
KEYWORDS T-cell receptor alpha-chain; antigen receptor; constant region;
joining region; processed gene; variable region.
SOURCE Human peripheral blood lymphocyte, cDNA to mRNA, clone AA25.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 406)
AUTHORS Klein,M.H., Concannon,P., Everett,M., Kim,L.D.H.,
Hunkapiller,T. and Hood,L.
TITLE Diversity and structure of human T-cell receptor alpha-chain
variable region genes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6884-6888 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 > 406 T-cell receptor alpha-chain V-region (V-J-C)
precursor /nomgen="TCRA" /map="14q11.2"
/hgml_locus_uid="LX0123X"
sigp 1 69 T-cell receptor alpha-chain signal peptide
matp 70 > 403 T-cell receptor alpha-chain
recomb 340 341 TCR V-region DNA end/TCR J-region DNA start
recomb 400 401 TCR J-region DNA end/TCR C-region DNA start
BASE COUNT 104 a 95 c 98 g 109 t
ORIGIN Chromosome 14q11.2.
1 atgatgatat ccttgagagt tttactggtg atcctgtggc ttcagttaag ctgggtttgg
61 agccaacgga aggaggtgga gcaggatcct ggacccttca atgttccaga gggagccact
121 gtcgctttca actgtactta cagcaacagt gcttctcagt ctttcttctg gtacagacag
181 gattgcagga aagaacctaa gttgctgatg tccgtatact ccagtggtaa tgaagatgga
241 aggtttacag cacacgtcaa tagagccagc cagtatattt ccctgctcat cagagactcc
301 aagctcagtg attcagccac ctacctctgt gtggtgaaca ttcgcccagg aaacacacct
361 cttgtctttg gaaagggcac aagactttct gtgattccaa atatcc
//
LOCUS LAMCG 48502 bp ds-DNA circular PHG 29-JUN-1990
DEFINITION Bacteriophage lambda, complete genome.
ACCESSION J02459 M17233
KEYWORDS DNA binding protein; circular; coat protein; complete genome;
origin of replication; repressor; unidentified reading frame.
SOURCE Lambda wild-type and lambda strain cI857s7.
ORGANISM Bacteriophage lambda
Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
REFERENCE 1 (bases 37224 to 37940)
AUTHORS Sauer,R.T.
TITLE DNA sequence of the bacteriophage lambda cI gene
JOURNAL Nature 276, 301-302 (1978)
STANDARD full staff_review
REFERENCE 2 (bases 37990 to 38982)
AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H.
TITLE Nucleotide sequence of cro, cII and part of the O gene in phage
lambda DNA
JOURNAL Nature 272, 410-414 (1978)
STANDARD full staff_review
REFERENCE 3 (bases 34957 to 35615; r-strand)
AUTHORS Franklin,N.C. and Bennett,G.N.
TITLE The N protein of bacteriophage lambda, defined by its DNA sequence,
is highly basic
JOURNAL Gene 8, 107-119 (1979)
STANDARD full staff_review
REFERENCE 4 (bases 37768 to 40293)
AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H.
TITLE The primary structure of the phage lambda P gene completes the
nucleotide sequence of the plasmid lambda-dvh93
JOURNAL Biochem. Int. 1, 386-394 (1980)
STANDARD full staff_review
REFERENCE 5 (bases 27724 to 29275)
AUTHORS Hoess,R.H., Foeller,C., Bidwell,K. and Landy,A.
TITLE Site-specific recombination functions of bacteriophage lambda: DNA
sequence of regulatory regions and overlapping structural genes for
int and xis
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2482-2486 (1980)
STANDARD full staff_review
REFERENCE 6 (bases 43682 to 45218)
AUTHORS Daniels,D.L. and Blattner,F.R.
TITLE Nucleotide sequence of the Q gene and the Q to S intergenic region
of bacteriophage lambda
JOURNAL Virology 117, 81-92 (1982)
STANDARD full staff_review
REFERENCE 7 (bases 1 to 48502; fragments)
AUTHORS Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F. and Petersen,G.B.
TITLE Nucleotide sequence of bacteriophage lambda DNA
JOURNAL J. Mol. Biol. 162, 729-773 (1982)
STANDARD full staff_review
REFERENCE 8 (review; complete genome)
AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F. and
Blattner,F.R.
TITLE Appendix I: A molecular map of coliphage lambda
JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and
Weisberg,R.A. (Eds.);
Lambda II: 469-517;
Cold Spring Harbor Laboratory, New York (1983)
STANDARD full staff_review
REFERENCE 9 (review; complete genome with annotation)
AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F.,
Coulson,A.R., Hong,G.F., Hill,D.F., Petersen,G.B. and Blattner,F.R.
TITLE Appendix II: Complete annotated lambda sequence
JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A.
(Eds.);
Lambda II: 519-674;
Cold Spring Harbor Laboratory, New York (1983)
STANDARD full staff_review
REFERENCE 10 (bases 45493 to 45963)
AUTHORS Imada,M. and Tsugita,A.
TITLE Amino acid sequence of lambda phage endolysin
JOURNAL Nature New Biol. 233, 230-231 (1971)
STANDARD full staff_review
REFERENCE 11 (bases 44588 to 44780)
AUTHORS Lebowitz,P., Weissman,S.M. and Radding,C.M.
TITLE Nucleotide sequence of a ribonucleic acid transcribed in vitro from
lambda phage deoxyribonucleic acid
JOURNAL J. Biol. Chem. 246, 5120-5139 (1971)
STANDARD full staff_review
REFERENCE 12 (bases 1 to 12; both strands)
AUTHORS Wu,R. and Taylor,E.
TITLE Nucleotide sequence analysis of DNA. II. Complete nucleotide
sequence of the cohesive ends of bacteriophage lambda DNA
JOURNAL J. Mol. Biol. 57, 491-511 (1971)
STANDARD full staff_review
REFERENCE 13 (bases 38597 to 38672)
AUTHORS Dahlberg,J.E. and Blattner,F.R.
TITLE In vitro transcription products of lambda DNA: Nucleotide sequences
and regulatory sites
JOURNAL (in) Fox,C.F. and Robinson,W.S. (Eds.);
Virus Research. Proceedings of 1973 ICN-UCLA Symposium: 533-544;
Academic Press, New York (1973)
STANDARD full staff_review
REFERENCE 14 (sites; fragments at the 3'-terminus)
AUTHORS Weigel,P.H., Englund,P.T., Murray,K. and Old,R.W.
TITLE The 3'-terminal nucleotide sequences of bacteriophage lambda DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 70, 1151-1155 (1973)
STANDARD full staff_review
REFERENCE 15 (bases 35434 to 35618)
AUTHORS Dahlberg,J.E. and Blattner,F.R.
TITLE Sequence of the promoter-operator proximal region of the major
leftward of bacteriophage lambda
JOURNAL Nucleic Acids Res. 2, 1441-1458 (1975)
STANDARD full staff_review
REFERENCE 16 (bases 35583 to 35600)
AUTHORS Kleid,D.G., Agarwal,K.L. and Khorana,H.G.
TITLE The nucleotide sequence in the promoter region of the gene N in
bacteriophage lambda
JOURNAL J. Biol. Chem. 250, 5574-5582 (1975)
STANDARD full staff_review
REFERENCE 17 (bases 37945 to 38018)
AUTHORS Maniatis,T., Jeffrey,A. and Kleid,D.G.
TITLE Nucleotide sequence of the rightward operator of phage lambda
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1184-1188 (1975)
STANDARD full staff_review
REFERENCE 18 (bases 44588 to 44773)
AUTHORS Sklar,J., Yot,P. and Weissman,S.M.
TITLE Determination of genes, restriction sites, and DNA sequences
surrounding the 6s template of bacteriophage lambda
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1817-1821 (1975)
STANDARD full staff_review
REFERENCE 19 (bases 37945 to 38027)
AUTHORS Maniatis,T., Ptashne,M., Backman,K., Kleid,D., Flashman,S.,
Jeffrey,A. and Maurer,R.
TITLE Recognition sequences of repressor and polymerase in the operators
of bacteriophage lambda
JOURNAL Cell 5, 109-113 (1975)
STANDARD full staff_review
REFERENCE 20 (bases 37946 to 38039)
AUTHORS Smith,G.R., Eisen,H., Reichardt,L. and Hedgpeth,J.
TITLE Deletions of lambda phage locating a p-rm mutation within the
rightward operator
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 712-716 (1976)
STANDARD full staff_review
REFERENCE 21 (bases 35578 to 35667; 37903 to 38027)
AUTHORS Ptashne,M., Bachman,K., Humayun,M.Z., Jeffrey,A., Maurer,R.,
Meyer,B. and Sauer,R.T.
TITLE Autoregulation and function of a repressor in bacteriophage lambda
JOURNAL Science 194, 156-161 (1976)
STANDARD full staff_review
REFERENCE 22 (bases 37905 to 37989)
AUTHORS Walz,A., Pirrotta,V. and Ineichen,K.
TITLE Lambda repressor regulates the switch between p-r and p-rm
promoters
JOURNAL Nature 262, 665-669 (1976)
STANDARD full staff_review
REFERENCE 23 (bases 27617 to 27934)
AUTHORS Landy,A. and Ross,W.
TITLE Viral integration and excision: structure of the lambda att sites
JOURNAL Science 197, 1147-1160 (1977)
STANDARD full staff_review
REFERENCE 24 (bases 39062 to 39170)
AUTHORS Denniston-Thompson,K., Moore,D.D., Kruger,K.E., Furth,M.E. and
Blattner,F.R.
TITLE Physical structure of the replication origin of bacteriophage
lambda
JOURNAL Science 198, 1051-1056 (1977)
STANDARD full staff_review
REFERENCE 25 (bases 38610 to 38732)
AUTHORS Scherer,G., Hobom,G. and Koessel,H.
TITLE DNA base sequence of the p-o promoter region of phage lambda
JOURNAL Nature 265, 117-121 (1977)
STANDARD full staff_review
REFERENCE 26 (bases 38041 to 38241)
AUTHORS Roberts,T.M., Shimatake,H., Brady,C. and Rosenberg,M.
TITLE Sequence of cro gene of bacteriophage lambda
JOURNAL Nature 270, 274-275 (1977)
STANDARD full staff_review
REFERENCE 27 (bases 27616 to 28935)
AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E.
TITLE Nucleotide sequence of the attachment site of coliphage lambda
JOURNAL Nature 270, 757-760 (1977)
STANDARD full staff_review
REFERENCE 28 (bases 35578 to 35667)
AUTHORS Humayun,Z., Jeffrey,A. and Ptashne,M.
TITLE Completed DNA sequences and organization of repressor-binding sites
in the operators of phage lambda
JOURNAL J. Mol. Biol. 112, 265-277 (1977)
STANDARD full staff_review
REFERENCE 29 (bases 37206 to 37263; 37914 to 37970)
AUTHORS Humayun,Z.
TITLE DNA sequence at the end of the cI gene in bacteriophage lambda
JOURNAL Nucleic Acids Res. 4, 2137-2143 (1977)
STANDARD full staff_review
REFERENCE 30 (bases 44467 to 44807)
AUTHORS Sklar,J.L.
TITLE Structure and function of two regions of DNA controlling the
synthesis of prokaryotic RNAs
JOURNAL Thesis (1977) Yale University, New Haven, CT, USA
STANDARD full staff_review
REFERENCE 31 (bases 38470 to 39189)
AUTHORS Hobom,G., Grosschedl,R., Lusky,M., Scherer,G., Schwarz,E. and
Koessel,H.
TITLE Functional analysis of the replicator structure of lambdoid
bacteriophage DNAs
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 165-178 (1979)
STANDARD full staff_review
REFERENCE 32 (bases 37938 to 38016; 35589 to 35666; comp strand)
AUTHORS Flashman,S.M.
TITLE Mutational analysis of the operators of bacteriophage lambda
JOURNAL Mol. Gen. Genet. 166, 61-73 (1978)
STANDARD full staff_review
REFERENCE 33 (bases 38212 to 38362)
AUTHORS Rosenberg,M., Court,D., Shimatake,H., Brady,C. and Wulff,D.L.
TITLE The relationship between function and DNA sequence in an
intercistronic regulatory region in phage lambda
JOURNAL Nature 272, 414-423 (1978)
STANDARD full staff_review
REFERENCE 34 (bases 29711 to 29811; 31043 to 31058)
AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E.
TITLE Determination of the endpoints of partial deletion mutants of the
attachment site of bacteriophage lambda by DNA sequencing
JOURNAL Nucleic Acids Res. 5, 3209-3218 (1978)
STANDARD full staff_review
REFERENCE 35 (bases 21661 to 31129; fragments)
AUTHORS Hoess,R.H. and Landy,A.
TITLE Structure of the lambda att sites generated by int-dependent
deletions
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5437-5441 (1978)
STANDARD full staff_review
REFERENCE 36 (bases 38453 to 38500)
AUTHORS Sprague,K.U., Faulds,D.H. and Smith,G.R.
TITLE A single base-pair change creates a chi recombinational hotspot in
bacteriophage lambda
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 6182-6186 (1978)
STANDARD full staff_review
REFERENCE 37 (bases 13 to 72; 48391 to 48502)
AUTHORS Nichols,B.P. and Donelson,J.E.
TITLE 178-Nucleotide sequence surrounding the cos site of bacteriophage
lambda DNA
JOURNAL J. Virol. 26, 429-434 (1978)
STANDARD full staff_review
REFERENCE 38 (bases 38597 to 39688)
AUTHORS Scherer,G.
TITLE Nucleotide sequence of the O gene and of the origin of replication
in bacteriophage lambda DNA
JOURNAL Nucleic Acids Res. 5, 3141-3156 (1978)
STANDARD full staff_review
REFERENCE 39 (bases 38453 to 38500)
AUTHORS Smith,G.R., Faulds,D.H. and Sprague,K.U.
TITLE Nucleotide-sequence analysis of a chi site
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 1067-1068 (1979)
STANDARD full staff_review
REFERENCE 40 (sites; transcription termination sites)
AUTHORS Adhya,S. and Gottesman,M.
TITLE Control of transcription termination
JOURNAL Annu. Rev. Biochem. 47, 967-996 (1978)
STANDARD full staff_review
REFERENCE 41 (bases 38008 to 39328)
AUTHORS Moore,D.D., Denniston-Thompson,K., Kruger,K.E., Furth,M.E.,
Williams,B.G., Daniels,D.L. and Blattner,F.R.
TITLE Dissection and comparative anatomy of the origins of replication of
lambdoid phages
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 155-163 (1979)
STANDARD full staff_review
REFERENCE 42 (bases 27711 to 27826)
AUTHORS Ross,W., Landy,A., Kikuchi,Y. and Nash,H.
TITLE Interaction of int protein with specific sites on lambda att DNA
JOURNAL Cell 18, 297-307 (1979)
STANDARD full staff_review
REFERENCE 43 (bases 37305 to 37352)
AUTHORS Lieb,M.
TITLE Is5 increases recombination in adjacent regions as shown for the
repressor gene of coliphage lambda
JOURNAL Gene 12, 277-280 (1980)
STANDARD full staff_review
REFERENCE 44 (bases 29055 to 29131)
AUTHORS Miller,H.I., Abraham,J., Benedik,M., Campbell,A., Court,D.,
Echols,H., Fischer,R., Galindo,J.M., Guarneros,G., Hernandez,T.,
Mascarenhas,D., Montanez,C., Schindler,D., Schmeissner,U. and
Sosa,L.
TITLE Regulation of the integration-excision reaction by bacteriophage
lambda
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 439-445 (1981)
STANDARD full staff_review
REFERENCE 45 (bases 36245 to 36343)
AUTHORS Pirrotta,V., Ineichen,K. and Walz,A.
TITLE An unusual polymerase binding site in the immunity region of phage
lambda
JOURNAL Mol. Gen. Genet. 180, 369-376 (1980)
STANDARD full staff_review
REFERENCE 46 (bases 37940 to 38016)
AUTHORS Rosen,E.D., Hartley,J.L., Matz,K., Nichols,B.P., Young,K.M.,
Donelson,J.E. and Gussin,G.N.
TITLE DNA sequence analysis of prm- mutations of coliphage lambda
JOURNAL Gene 11, 197-205 (1980)
STANDARD full staff_review
REFERENCE 47 (bases 38212 to 38467)
AUTHORS Wulff,D.L., Beher,M., Izumi,S., Beck,J., Mahoney,M., Shimatake,H.,
Brady,C., Court,D. and Rosenberg,M.
TITLE Structure and function of the cy control region of bacteriophage
lambda
JOURNAL J. Mol. Biol. 138, 209-230 (1980)
STANDARD full staff_review
REFERENCE 48 (bases 38237 to 38334)
AUTHORS Court,D., Brady,C., Rosenberg,M., Wulff,D.L., Behr,M.,
Mahoney,M. and Izumi,S.
TITLE Control of transcription termination: A rho-dependent termination
site in bacteriophage lambda
JOURNAL J. Mol. Biol. 138, 231-254 (1980)
STANDARD full staff_review
REFERENCE 49 (bases 38102 to 38166)
AUTHORS Calva,E. and Burgess,R.R.
TITLE Characterization of a rho-dependent termination site within the cro
gene of bacteriophage lambda
JOURNAL J. Biol. Chem. 255, 11017-11022 (1980)
STANDARD full staff_review
REFERENCE 50 (bases 28929 to 29198)
AUTHORS Abraham,J., Mascarenhas,D., Fischer,R., Benedik,M., Campbell,A. and
Echols,H.
TITLE DNA sequence of regulatory region for integration gene of
bacteriophage lambda
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2477-2481 (1980)
STANDARD full staff_review
REFERENCE 51 (bases 27724 to 29525)
AUTHORS Davies,R.W.
TITLE DNA sequence of the int-xis p-i region of the bacteriophage lambda;
overlap of the int and xis genes
JOURNAL Nucleic Acids Res. 8, 1765-1782 (1980)
STANDARD full staff_review
REFERENCE 52 (bases 27501 to 27615)
AUTHORS Mizuuchi,M. and Mizuuchi,K.
TITLE Integrative recombination of bacteriophage lambda: extent of the
DNA sequence involved in attachment site function
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3220-3224 (1980)
STANDARD full staff_review
REFERENCE 53 (bases 27479 to 27633)
AUTHORS Hsu,P.-L., Ross,W. and Landy,A.
TITLE The lambda phage att site: functional limits and interaction with
int protein
JOURNAL Nature 285, 85-91 (1980)
STANDARD full staff_review
REFERENCE 54 (bases 23131 to 23248)
AUTHORS Rosenvold,E.C., Calva,E., Burgess,R.R. and Szybalski,W.
TITLE In vitro transcription from the b2 region of bacteriophage lambda
JOURNAL Virology 107, 476-487 (1980)
STANDARD full staff_review
REFERENCE 55 (bases 30493 to 30569)
AUTHORS Smith,G.R., Schultz,D.W. and Crasemann,J.M.
TITLE Generalized recombination: Nucleotide sequence homology between chi
recombinational hotspots
JOURNAL Cell 19, 785-793 (1980)
STANDARD full staff_review
REFERENCE 56 (bases 37940 to 38023)
AUTHORS Meyer,B.J., Maurer,R. and Ptashne,M.
TITLE Gene regulation at the right operator (o-r) of bacteriophage
lambda. II. o-r-1, o-r-2, and o-r-3: their roles in mediating the
effects of repressor and cro
JOURNAL J. Mol. Biol. 139, 163-194 (1980)
STANDARD full staff_review
REFERENCE 57 (bases 29055 to 29124)
AUTHORS Abraham,J. and Echols,H.
TITLE Regulation of int gene transcription by bacteriophage lambda:
location of the start generated by an int constitutive mutation
JOURNAL J. Mol. Biol. 146, 157-165 (1981)
STANDARD full staff_review
REFERENCE 58 (bases 43681 to 45634)
AUTHORS Daniels,D.L.
TITLE Control of late transcription in bacteriophage lambda
JOURNAL Thesis (1981) University of Wisconsin, Madison, WI, USA
STANDARD full staff_review
REFERENCE 59 (bases 35468 to 35819)
AUTHORS Horn,G.T. and Wells,R.D.
TITLE The leftward promoter of bacteriophage lambda
JOURNAL J. Biol. Chem. 256, 1998-2002 (1981)
STANDARD full staff_review
REFERENCE 60 (bases 32503 to 35905)
AUTHORS Ineichen,K., Shepherd,J.C.W. and Bickle,T.A.
TITLE The DNA sequence of the phage lambda genome between p-l and the
gene bet
JOURNAL Nucleic Acids Res. 9, 4639-4653 (1981)
STANDARD full staff_review
REFERENCE 61 (bases 43860 to 45001)
AUTHORS Petrov,N.A., Karginov,V.A., Mikryukov,N.N., Serpinski,O.I. and
Kravchenko,V.V.
TITLE Complete nucleotide sequence of the bacteriophage lambda DNA region
containing gene Q and promoter p-r
JOURNAL FEBS Lett. 133, 316-320 (1981)
STANDARD full staff_review
REFERENCE 62 (bases 44972 to 45057)
AUTHORS Smith,G.R., Comb,M., Schultz,D.W., Daniels,D.L. and Blattner,F.R.
TITLE Nucleotide sequence of the chi recombinational hotspot chi+d in
bacteriophage lambda
JOURNAL J. Virol. 37, 336-342 (1981)
STANDARD full staff_review
REFERENCE 63 (bases 38686 to 39224)
AUTHORS Moore,D.D., Denniston,K.J. and Blattner,F.R.
TITLE Sequence organization of the origins of DNA replication in lambdoid
coliphages
JOURNAL Gene 14, 91-101 (1981)
STANDARD full staff_review
REFERENCE 64 (bases 35468 to 35711)
AUTHORS Remaut,E., Stanssens,P. and Fiers,W.
TITLE Plasmid vectors for high-efficiency expression controlled by the pl
promoter of coliphage lambda
JOURNAL Gene 15, 81-93 (1981)
STANDARD full staff_review
REFERENCE 65 (bases 35468 to 35541)
AUTHORS Drahos,D. and Szybalski,W.
TITLE Antitermination and termination functions of the cloned Nutl, N and
tl1 modules of coliphage lambda
JOURNAL Gene 16, 261-274 (1981)
STANDARD full staff_review
REFERENCE 66 (bases 27650 to 27741)
AUTHORS Kravchenko,V.V. and Mikryukov,N.N.
TITLE Localization of the promoter p-att of the binding site of
Escherichia coli polymerase on phage lambda DNA near the
integration site
JOURNAL Dokl. Biochem. 264, 148-151 (1982)
STANDARD full staff_review
REFERENCE 67 (bases 39219 to 39338)
AUTHORS Moore,D.D. and Blattner,F.R.
TITLE Appendix: Sequence of lambda ri c 5b
JOURNAL J. Mol. Biol. 154, 81-83 (1982)
STANDARD full staff_review
REFERENCE 68 (bases 37938 to 38018)
AUTHORS Hawley,D.K. and McClure,W.R.
TITLE Mechanism of activation of transcription initiation from the lambda
p-rm promoter
JOURNAL J. Mol. Biol. 157, 493-525 (1982)
STANDARD full staff_review
REFERENCE 69 (bases 25157 to 27484)
AUTHORS Hong,G.F.
TITLE A systematic DNA sequencing strategy
JOURNAL J. Mol. Biol. 158, 539-549 (1982)
STANDARD full staff_review
REFERENCE 70 (bases 35577 to 35647)
AUTHORS Hyman,H.C. and Honigman,A.
TITLE The use of the plasmid pha10 in the isolation of lambda pl promoter
mutations
JOURNAL Mol. Gen. Genet. 185, 515-517 (1982)
STANDARD full staff_review
REFERENCE 71 (bases 38262 to 38386)
AUTHORS Lau,L.F., Roberts,J.W. and Wu,R.
TITLE Transcription terminates at lambda tr1 in three clusters
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 6171-6175 (1982)
STANDARD full staff_review
REFERENCE 72 (bases 31299 to 31408)
AUTHORS Luk,K.-C. and Szybalski,W.
TITLE Transcription termination: Sequence and function of the
rho-independent t-l3 terminator in the major leftward operon of
bacteriophage lambda
JOURNAL Gene 17, 247-258 (1982)
STANDARD full staff_review
REFERENCE 73 (bases 35437 to 37348)
AUTHORS Landsmann,J., Kroeger,M. and Hobom,G.
TITLE The rex region of bacteriophage lambda: Two genes under three-way
control
JOURNAL Gene 20, 11-24 (1982)
STANDARD full staff_review
REFERENCE 74 (bases 40218 to 43972)
AUTHORS Kroeger,M. and Hobom,G.
TITLE A chain of interlinked genes in the NinR region of bacteriophage
lambda
JOURNAL Gene 20, 25-38 (1982)
STANDARD full staff_review
REFERENCE 75 (bases 31299 to 31408)
AUTHORS Luk,K.-C. and Szybalski,W.
TITLE Characterization of the cloned terminators t-r1, t-l3 and t-i, and
the Nutr antitermination site of coliphage lambda
JOURNAL Gene 20, 127-134 (1982)
STANDARD full staff_review
REFERENCE 76 (bases 48424 to 48500)
AUTHORS Miwa,T. and Matsubara,K.
TITLE Identification of sequences necessary for packaging DNA into lambda
phage heads
JOURNAL Gene 20, 267-279 (1982)
STANDARD full staff_review
REFERENCE 77 (bases 18414 to 18746)
AUTHORS Luk,K.-C. and Szybalski,W.
TITLE A cluster of leftward, rho-dependent t'j terminators in the J gene
of coliphage lambda
JOURNAL Gene 21, 175-191 (1983)
STANDARD full staff_review
REFERENCE 78 (bases 48469 to 48498)
AUTHORS Miwa,T. and Matsubara,K.
TITLE Lambda phage DNA sequences affecting the packaging process
JOURNAL Gene 24, 199-206 (1983)
STANDARD full staff_review
REFERENCE 79 (bases 33287 to 33486)
AUTHORS Knight,D.M. and Echols,H.
TITLE The cIII gene and protein of bacteriophage lambda
JOURNAL J. Mol. Biol. 163, 505-510 (1983)
STANDARD full staff_review
REFERENCE 80 (bases 33000 to 33244; 33420 to 33543; 33629 to 34080)
AUTHORS Luk,K.-C. and Szybalski,W.
TITLE The tl2 cluster of transcription termination sites between genes
bet and ral of coliphage lambda
JOURNAL Virology 125, 403-418 (1983)
STANDARD full staff_review
REFERENCE 81 (bases 37938 to 38019)
AUTHORS Shih,M.-C. and Gussin,G.N.
TITLE Mutations affecting two different steps in transcription initiation
at the phage lambda p-rm promoter
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 496-500 (1983)
STANDARD full staff_review
REFERENCE 82 (bases 1 to 56; 48474 to 48502)
AUTHORS Feiss,M., Kobayashi,I. and Widner,W.
TITLE Separate sites for binding and nicking of bacteriophage lambda DNA
by terminase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 955-959 (1983)
STANDARD full staff_review
REFERENCE 83 (bases 29063 to 29140)
AUTHORS Benedik,M., Mascarenhas,D. and Campbell,A.
TITLE The integrase promoter and t1' terminator in bacteriophages lambda
and 434
JOURNAL Virology 126, 658-668 (1983)
STANDARD full staff_review
REFERENCE 84 (bases 45901 to 46443)
AUTHORS Taylor,A., Benedik,M. and Campbell,A.
TITLE Location of the R-z gene in bacteriophage lambda
JOURNAL Gene 26, 159-163 (1983)
STANDARD full staff_review
REFERENCE 85 (sites; cohesive ends)
AUTHORS Hohn,B.
TITLE DNA sequences necessary for packaging of bacteriophage lambda DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7456-7460 (1983)
STANDARD full staff_review
REFERENCE 86 (sites; Pre-promoter mutations)
AUTHORS Wulff,D.L., Mahoney,M., Shatzman,A. and Rosenberg,M.
TITLE Mutational analysis of a regulatory region in bacteriophage lambda
that has overlapping signals for the initiation of transcription
and translation
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 555-559 (1984)
STANDARD full staff_review
REFERENCE 87 (sites; attP recombination site)
AUTHORS Craig,N.L. and Nash,H.A.
TITLE E. coli integration host factor binds to specific sites in DNA
JOURNAL Cell 39, 707-716 (1984)
STANDARD full staff_review
REFERENCE 88 (sites; major leftward transcription unit)
AUTHORS Edlind,T.D., Cooley,T.E., Richards,S.H. and Ihler,G.M.
TITLE Long range base-pairing in the leftward transcription unit of
bacteriophage lambda: Characterization by electron microscopy and
computer-aided sequence analysis
JOURNAL J. Mol. Biol. 179, 351-365 (1984)
STANDARD full staff_review
REFERENCE 89 (sites; prohead binding)
AUTHORS Frackman,S., Siegele,D.A. and Feiss,M.
TITLE A functional domain of bacteriophage lambda terminase for prohead
binding
JOURNAL J. Mol. Biol. 180, 283-300 (1984)
STANDARD full staff_review
REFERENCE 90 (sites; cII binding site mutations)
AUTHORS Place,N., Fien,K., Mahoney,M.E., Wulff,D.L., Ho,Y.-S., Debouck,C.,
Rosenberg,M., Shih,M.-C. and Gussin,G.N.
TITLE Mutations that alter the DNA binding site for the bacteriophage
lambda cII protein and affect the translation efficiency of the cII
gene
JOURNAL J. Mol. Biol. 180, 865-880 (1984)
STANDARD full staff_review
REFERENCE 91 (sites; nutR mutations)
AUTHORS Warren,F. and Das,A.
TITLE Formation of termination-resistant transcription complex at phage
lambda nut locus: Effects of altered translation and a ribosomal
mutation
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 3612-3616 (1984)
STANDARD full staff_review
REFERENCE 92 (sites; light chain oligonucleotides)
AUTHORS Coleclough,C. and Erlitz,F.L.
TITLE Use of primer-restriction-end adapters in a novel cDNA cloning
strategy
JOURNAL Gene 34, 305-314 (1985)
STANDARD full staff_review
REFERENCE 93 (sites; nutL antiterminator)
AUTHORS Peltz,S.W., Brown,A.L., Hasan,N., Podhajska,A.J. and Szybalski,W.
TITLE Thermosensitivity of a DNA recognition Site: Activity of a
truncated nutL Antiterminator of coliphage lambda
JOURNAL Science 228, 91-93 (1985)
STANDARD full staff_review
REFERENCE 94 (sites; rho utilization sites A and B)
AUTHORS Chen,C.-Y.A. and Richardson,J.P.
TITLE Sequence elements essential for rho-dependent transcription
termination at lambda-tR1
JOURNAL J. Biol. Chem. 262, 11292-11299 (1987)
STANDARD simple staff_review
REFERENCE 95 (bases 2521 to 3300)
AUTHORS Hong,G.F.
TITLE Sequencing of large double-stranded DNA using the dideoxy
sequencing technique
JOURNAL Biosci. Rep. 2, 907-912 (1982)
STANDARD simple staff_entry
COMMENT Contributed on tape by F.Sanger via D.L.Daniels. Most of references
[10] through [80] are either annotated by [8] and [9], which are
the immediate sources for the annotation below, or they are cited
in Table 3 of [8]. Only references [1] through [9] are represented
in the features table herein.
This is the best representation to date of the wild-type lambda
l-strand, though much of the sequence was determined for the
cI857s7 strain and changed to wild-type [8]. All reported
variations leading to the strains cI857s7, imm21, imm434, lac5,
Nin5 and b2 are included in the annotation. The first twelve bases
are the sticky ends. A significant fraction of the known mutations
affecting replication and transcription have been annotated below;
a large number of point mutations, deletions and substitutions have
not. For a complete account of lambda mutations in relation to the
sequence, see [8].
Each coding sequence belongs to a reading frame (orf) whose number,
given in parentheses, should indicate the number of amino acids
coded. The starting points for translation are known with varying
degrees of certainty; for example, the start site for the N
protein, given here as 35438, may turn out to be downstream (on the
complementary strand) at 35360. When direct empirical evidence such
as mutation or amino acid sequence is lacking, the start point is
said to be putative. For a summary of the evidence bearing upon the
coding sequences, see [7],[8]. Intergenic spaces in lambda are
typically short and overlapping: the multiple reading frames (mult)
range between a span of 1 and a span of 103. In most cases, a start
codon precedes a termination codon, exceptions being the m-l
boundary (13429) and the 314-194 boundary (21973) which show the
E.coli trp operon pattern of "translational coupling" (see
<ecotrp>).
Transcription in the central region, bases 22686 to 37940, is
leftward off the l-strand. In our annotation, this is indicated by
the letter "c" and the descriptive term "comp strand". Signals and
recognition sites in this region, without judgement made about
their polarity, are treated accordingly, hence their span should be
read toward the left rather than toward the right. Furthermore some
leftward transcription is located outside the central region, and
that is also indicated by "c" and "comp strand". In general, the
estimates for the extent or span of signals (e.g. operators),
binding sites (e.g. Nutr, int-binding sites, etc.) and of the
attachment site (att) vary in the literature. This annotation
follows [8]. No attempt is made to annotate promoters as signals
because of the indefiniteness of their span, however known promoter
mutants are given. The cII protein is known to bind in the -35
regions of p-i (29091) and pre(38369). Transcript termination sites
must be understood to be conditional on the N and Q proteins and
less than 100% efficient. There remain terminators to be found and
some of those annotated may have significance only in vitro.
FEATURES from to/span description
pept 191 736 nu1 (DNA packaging;181)
pept 711 2636 A (DNA packaging;641)
pept 2633 2839 W (head-tail joining;68)
pept 2836 4437 B (capsid component;533)
sigp 2839 2901 leader peptide
matp 2902 4434 processed B
pept 4418 5737 C (capsid component;439)
pept 5132 5737 nu3 (capsid assembly;201)
pept 5747 6079 D (head-DNA stabilization;110)
pept 6135 7160 E (capsid component;341)
pept 7202 7600 Fi (DNA packaging;117)
pept 7612 7965 Fii (head-tail joining;117)
pept 7977 8555 Z (tail component;192)
pept 8552 8947 U (tail component;131)
pept 8955 9695 V (tail component;256)
pept 9711 10133 G (tail component;140)
pept 10115 10549 T (tail component;144)
pept 10542 13103 H (tail component;853)
pept 13100 13429 M (tail component;109)
pept 13429 14127 L (tail component;232)
pept 14276 14875 K (tail component;199)
pept 14773 15444 I (tail component;223)
pept 15505 18903 J (tail:host specificity;1132)
pept 18965 19585 lom (outer host membrane;206a)
pept 19650 20855 orf-401
pept 21029 21973 orf-314
pept 21973 22557 orf-194
pept 23918 22686 (c) ea47
pept 25399 24509 (c) ea31 (296)
pept 26973 25396 (c) ea59 (525)
pept 28882 27812 (c) int (integration;356)
pept 29078 28860 (c) xis (excision;72)
pept 29655 29374 (c) ea8.5 (93)
pept 30395 29847 (c) ea22 (182)
pept 32028 31348 (c) exo (exonuclease;226)
pept 32810 32025 (c) bet (recombination;261)
pept 33232 32816 (c) gam (recombination;138)
pept 33330 33187 (c) kil(host-killing;54)
pept 33463 33299 (c) cIII (antitermination;89)
pept 33904 33536 (c) ea10 (ssb;122)
pept 34287 34087 (c) ral(restriction alleviation;66)
pept 35438 35037 (c) N (early gene regulator;133)
pept 36259 35825 (c) rexb (exclusion;144)
pept 37114 36275 (c) rexa (exclusion;279)
pept 37940 37227 (c) cI (repressor;237)
pept 38041 38241 cro (antirepressor; also tof;66)
pept 38360 38653 cII (antitermination;119)
pept 38686 39585 O (DNA replication;299)
pept 39582 40283 P (DNA replication;233)
pept 40280 40570 ren(exclusion;96)
pept 40644 41084 Nin 146 (pept unknown;146)
pept 41081 41953 Nin 290 (pept unknown;290)
pept 41950 42123 Nin 57 (pept unknown;57)
pept 42090 42272 Nin 60 (pept unknown;60)
pept 42269 42439 Nin 56 (pept unknown;56)
pept 42429 43043 Nin 204 (pept unknown;204)
pept 43040 43246 Nin 68 (pept unknown;68)
pept 43224 43889 Nin 221 (pept unknown;221)
pept 43886 44509 Q (late gene regulator;207)
pept 44621 44815 orf-64
pept 45186 45509 S (cell lysis;107)
pept 45493 45969 R (cell lysis;158)
pept 45966 46427 Rz (cell lysis;153)
mRNA 35582 18482 (c) mRNA-pl (alt.; via t'j4 terminator)
mRNA 35582 18597 (c) mRNA-pl (alt.; via t'j3 terminator)
mRNA 35582 18637 (c) mRNA-pl (alt.; via t'j2 terminator)
mRNA 35582 18671 (c) mRNA-pl (alt.; via t'j1 terminator)
mRNA 35582 27538 (c) mRNA-pl (alt.; via ti terminator)
mRNA 35582 31262 (c) mRNA-pl (alt.; via tl3 terminator)
mRNA 35582 33100 (c) mRNA-pl (alt.; via tl2d terminator)
mRNA 35582 33141 (c) mRNA-pl (alt.; via tl2c terminator)
mRNA 35582 33494 (c) mRNA-pl (alt.; via tl2b terminator)
mRNA 35582 33930 (c) mRNA-pl (alt.; via tl2a terminator)
mRNA 35582 34560 (c) mRNA-pl (alt.; via tl1 terminator)
mRNA 29065 27538 (c) mRNA int (integration; 356; via ti terminator)
mRNA 23231 < 23231 (c) mRNA-pbl
mRNA 37940 35798 (c) mRNA-prm (via timm terminator)
mRNA 38343 35798 (c) mRNA-pre (via timm terminator)
mRNA 36256 35798 (c) mRNA-plit (via timm terminator)
mRNA 38023 38135 mRNA-pr (alt.; via tr0 terminator)
mRNA 38023 38315 mRNA-pr (alt.; via tr1a terminator)
mRNA 38023 38337 mRNA-pr (alt.; via tr1b terminator)
mRNA 38023 38370 mRNA-pr (alt.; via tr1c terminator)
mRNA 38023 40624 mRNA-pr (alt.; via tr2 terminator)
mRNA 38675 38599 (c) mRNA-oop transcription mRNA
mRNA 44587 44780 mRNA-pr' transcription (late genes) mRNA
binding 27583 27602 int binding site 1
binding 27615 27634 int binding site 2
binding 27714 27747 int binding site 3
binding 27814 27854 int binding site 4
binding 35534 35518 (c) Nutl(N-utilization leftward)putative binding
site
binding 38265 38281 Nutr(N-utilization rightward)putative binding
site
binding 39034 39051 ori iteron 1(O binding site)
binding 39054 39071 ori iteron 2(O binding site)
binding 39078 39095 ori iteron 3(O binding site)
binding 39101 39118 ori iteron 4(O binding site)
mut 19368 23278 lac5 substitution
mut 21737 > 21737 b2 substitution terminating at the att site
recomb 24389 24390 lambda::lambdoid hybridization site
mut 27537 27537 t in sib3 , c in wild-type
mut 27547 27547 a in hef13 , g in wild-type
mut 27568 27568 a in sib2 , c in wild-type
mut 27573 27573 t in sib1 , g in wild-type
recomb 27723 27738 attachment core(att)for host chromosome
insertion
mut 29063 29063 a in xis am6 , g in wild-type
recomb 31266 31267 lambda::lambdoid hybridization site
mut 34378 38617 imm21 region
mut 35528 35528 a in Nutl63,g in Nutl96,t in Nutl18,c in
wild-type
mut 35530 35530 g in wild-type deleted in Nutl3
mut 35583 38245 imm434 region
mut 35596 35596 a in vir2, t in v003, c in wild-type
mut 35606 35606 c in vir101 , t in wild-type
signal 35607 35591 (c) operator-l1 (first base on comp strand)
mut 35621 35621 t in v305 , c in wild-type
mut 35622 35622 t in v305 , g in wild-type
signal 35631 35615 (c) operator-l2 (first base on comp strand)
signal 35651 35635 (c) operator-l3 (first base on comp strand)
mut 35940 35940 a in rex209 , g in wild-type
mut 35947 35947 a in rex111 , g in wild-type
mut 37287 37287 a in cIam14, c in wild-type
mut 37308 37308 c in cIam504, g in wild-type
mut 37313 37313 a in cIam505, g in wild-type
variant 37589 37589 t in strain cI857s7([2]); c in wild type
mut 37589 37589 t in ind1 , c in wild-type
mut 37629 37629 c in cIam499, g in wild-type
mut 37635 37635 c in cIam212, a in wild-type
mut 37680 37680 a in cIam34, c in wild-type
variant 37742 37742 t in strain ci857s7([2]); c in wild-type
mut 37742 37742 t in ci857 , c in wild-type
mut 37808 37808 a in cIam282, g in wild-type
mut 37872 37872 c in cIam302, a in wild-type
signal 37951 37967 operator-r3
mut 37954 37954 t in prm-e37 , c in wild-type
mut 37955 37955 g in vc3 , a in wild-type
mut 37957 37957 t in or3-r1 , c in wild-type
mut 37958 37958 t in or3-r2, a in or3-r3 mutants, g in
wild-type
mut 37965 37965 g in or3-c12 , a in wild-type
mut 37966 37966 c in or3-c10 , t in wild-type
mut 37971 37971 g inp-rmup-1 , a in wild-type
mut 37973 37973 t in prm-m104, 116, u31 mutants, c in
wild-type
signal 37974 37990 operator-r2
mut 37978 37978 t in prm-e104, g in vc3, a in wild-type
mut 37979 37979 a in virl, t in prm-e93, c in wild-type
mut 37985 37985 t in vn , g in wild-type
mut 37989 37989 t deleted in mah4 mutant
mut 37990 37990 g deleted in mch9 mutant
mut 37991 37991 g in pr-x3 , a in wild-type
signal 37998 38014 operator-r1
mut 38003 38003 a in vs326 , c in wild-type
mut 38007 38007 t in prm-uv8, a in vir3, c in wild-type
mut 38008 38008 a in prm-uv93, m36 mutants, g in wild-type
mut 38009 38009 c in vs387, t in vc1, g in wild-type
site 38249 38266 rho utilization site A (rutA)
site 38282 38301 rho utilization site B (rutB)
mut 38302 38302 a in cin-1 , g in wild-type
mut 38306 38306 c in cnc1 , t in wild-type
mut 38307 38307 g in cnc8 , a in wild-type
mut 38350 38350 g in cy3048, a in wild-type
mut 38354 38354 c in cy2001, t in wild-type
mut 38357 38357 t in cy3019, c in wild-type
mut 38364 38364 g in can1 , t in wild-type
mut 38370 38370 t in cy3003 , c in wild-type
mut 38371 38371 t in cy42 , a in wild-type
mut 38376 38376 g in cy844 , a in wild-type
mut 38379 38379 a in cy3008 , g in wild-type
mut 38380 38380 t in cy3001 , c in wild-type
mut 38430 38430 c in cII2002 , t in wild-type
signal 38543 38557 ice(inceptor signal for DNA replication)
mut 39122 39122 a in ti-12 , c in wild-type
recomb 39157 39158 lambda::lambdoid hybridization site
recomb 39165 39166 lambda::lambdoid hybridization site
mut 39268 39268 t in ric5b , c in wild-type
mut 39292 39292 a in ric5b , g in wild-type
mut 40501 43307 Nin5 substitution
variant 43082 43082 a in strain cI857s7 ([2]); g in wild-type
unsure 43082 43082 g or a, cited in [8]
recomb 43884 43885 lambda::lambdoid hybridization site
variant 45352 45352 a in strain cI857s7 ([2]); g in wild-type
mut 45352 45352 a in sam7 , g in wild-type
BASE COUNT 12334 a 11362 c 12820 g 11986 t
ORIGIN 5' end of the l-strand of the lambda chromosome (sticky end).
1 gggcggcgac ctcgcgggtt ttcgctattt atgaaaattt tccggtttaa ggcgtttccg
61 ttcttcttcg tcataactta atgtttttat ttaaaatacc ctctgaaaag aaaggaaacg
121 acaggtgctg aaagcgaggc tttttggcct ctgtcgtttc ctttctctgt ttttgtccgt
181 ggaatgaaca atggaagtca acaaaaagca gctggctgac attttcggtg cgagtatccg
241 taccattcag aactggcagg aacagggaat gcccgttctg cgaggcggtg gcaagggtaa
301 tgaggtgctt tatgactctg ccgccgtcat aaaatggtat gccgaaaggg atgctgaaat
361 tgagaacgaa aagctgcgcc gggaggttga agaactgcgg caggccagcg aggcagatct
421 ccagccagga actattgagt acgaacgcca tcgacttacg cgtgcgcagg ccgacgcaca
481 ggaactgaag aatgccagag actccgctga agtggtggaa accgcattct gtactttcgt
541 gctgtcgcgg atcgcaggtg aaattgccag tattctcgac gggctccccc tgtcggtgca
601 gcggcgtttt ccggaactgg aaaaccgaca tgttgatttc ctgaaacggg atatcatcaa
661 agccatgaac aaagcagccg cgctggatga actgataccg gggttgctga gtgaatatat
721 cgaacagtca ggttaacagg ctgcggcatt ttgtccgcgc cgggcttcgc tcactgttca
781 ggccggagcc acagaccgcc gttgaatggg cggatgctaa ttactatctc ccgaaagaat
841 ccgcatacca ggaagggcgc tgggaaacac tgccctttca gcgggccatc atgaatgcga
901 tgggcagcga ctacatccgt gaggtgaatg tggtgaagtc tgcccgtgtc ggttattcca
961 aaatgctgct gggtgtttat gcctacttta tagagcataa gcagcgcaac acccttatct
1021 ggttgccgac ggatggtgat gccgagaact ttatgaaaac ccacgttgag ccgactattc
1081 gtgatattcc gtcgctgctg gcgctggccc cgtggtatgg caaaaagcac cgggataaca
1141 cgctcaccat gaagcgtttc actaatgggc gtggcttctg gtgcctgggc ggtaaagcgg
1201 caaaaaacta ccgtgaaaag tcggtggatg tggcgggtta tgatgaactt gctgcttttg
1261 atgatgatat tgaacaggaa ggctctccga cgttcctggg tgacaagcgt attgaaggct
1321 cggtctggcc aaagtccatc cgtggctcca cgccaaaagt gagaggcacc tgtcagattg
1381 agcgtgcagc cagtgaatcc ccgcatttta tgcgttttca tgttgcctgc ccgcattgcg
1441 gggaggagca gtatcttaaa tttggcgaca aagagacgcc gtttggcctc aaatggacgc
1501 cggatgaccc ctccagcgtg ttttatctct gcgagcataa tgcctgcgtc atccgccagc
1561 aggagctgga ctttactgat gcccgttata tctgcgaaaa gaccgggatc tggacccgtg
1621 atggcattct ctggttttcg tcatccggtg aagagattga gccacctgac agtgtgacct
1681 ttcacatctg gacagcgtac agcccgttca ccacctgggt gcagattgtc aaagactgga
1741 tgaaaacgaa aggggatacg ggaaaacgta aaaccttcgt aaacaccacg ctcggtgaga
1801 cgtgggaggc gaaaattggc gaacgtccgg atgctgaagt gatggcagag cggaaagagc
1861 attattcagc gcccgttcct gaccgtgtgg cttacctgac cgccggtatc gactcccagc
1921 tggaccgcta cgaaatgcgc gtatggggat gggggccggg tgaggaaagc tggctgattg
1981 accggcagat tattatgggc cgccacgacg atgaacagac gctgctgcgt gtggatgagg
2041 ccatcaataa aacctatacc cgccggaatg gtgcagaaat gtcgatatcc cgtatctgct
2101 gggatactgg cgggattgac ccgaccattg tgtatgaacg ctcgaaaaaa catgggctgt
2161 tccgggtgat ccccattaaa ggggcatccg tctacggaaa gccggtggcc agcatgccac
2221 gtaagcgaaa caaaaacggg gtttacctta ccgaaatcgg tacggatacc gcgaaagagc
2281 agatttataa ccgcttcaca ctgacgccgg aaggggatga accgcttccc ggtgccgttc
2341 acttcccgaa taacccggat atttttgatc tgaccgaagc gcagcagctg actgctgaag
2401 agcaggtcga aaaatgggtg gatggcagga aaaaaatact gtgggacagc aaaaagcgac
2461 gcaatgaggc actcgactgc ttcgtttatg cgctggcggc gctgcgcatc agtatttccc
2521 gctggcagct ggatctcagt gcgctgctgg cgagcctgca ggaagaggat ggtgcagcaa
2581 ccaacaagaa aacactggca gattacgccc gtgccttatc cggagaggat gaatgacgcg
2641 acaggaagaa cttgccgctg cccgtgcggc actgcatgac ctgatgacag gtaaacgggt
2701 ggcaacagta cagaaagacg gacgaagggt ggagtttacg gccacttccg tgtctgacct
2761 gaaaaaatat attgcagagc tggaagtgca gaccggcatg acacagcgac gcaggggacc
2821 tgcaggattt tatgtatgaa aacgcccacc attcccaccc ttctggggcc ggacggcatg
2881 acatcgctgc gcgaatatgc cggttatcac ggcggtggca gcggatttgg agggcagttg
2941 cggtcgtgga acccaccgag tgaaagtgtg gatgcagccc tgttgcccaa ctttacccgt
3001 ggcaatgccc gcgcagacga tctggtacgc aataacggct atgccgccaa cgccatccag
3061 ctgcatcagg atcatatcgt cgggtctttt ttccggctca gtcatcgccc aagctggcgc
3121 tatctgggca tcggggagga agaagcccgt gccttttccc gcgaggttga agcggcatgg
3181 aaagagtttg ccgaggatga ctgctgctgc attgacgttg agcgaaaacg cacgtttacc
3241 atgatgattc gggaaggtgt ggccatgcac gcctttaacg gtgaactgtt cgttcaggcc
3301 acctgggata ccagttcgtc gcggcttttc cggacacagt tccggatggt cagcccgaag
3361 cgcatcagca acccgaacaa taccggcgac agccggaact gccgtgccgg tgtgcagatt
3421 aatgacagcg gtgcggcgct gggatattac gtcagcgagg acgggtatcc tggctggatg
3481 ccgcagaaat ggacatggat accccgtgag ttacccggcg ggcgcgcctc gttcattcac
3541 gtttttgaac ccgtggagga cgggcagact cgcggtgcaa atgtgtttta cagcgtgatg
3601 gagcagatga agatgctcga cacgctgcag aacacgcagc tgcagagcgc cattgtgaag
3661 gcgatgtatg ccgccaccat tgagagtgag ctggatacgc agtcagcgat ggattttatt
3721 ctgggcgcga acagtcagga gcagcgggaa aggctgaccg gctggattgg tgaaattgcc
3781 gcgtattacg ccgcagcgcc ggtccggctg ggaggcgcaa aagtaccgca cctgatgccg
3841 ggtgactcac tgaacctgca gacggctcag gatacggata acggctactc cgtgtttgag
3901 cagtcactgc tgcggtatat cgctgccggg ctgggtgtct cgtatgagca gctttcccgg
3961 aattacgccc agatgagcta ctccacggca cgggccagtg cgaacgagtc gtgggcgtac
4021 tttatggggc ggcgaaaatt cgtcgcatcc cgtcaggcga gccagatgtt tctgtgctgg
4081 ctggaagagg ccatcgttcg ccgcgtggtg acgttacctt caaaagcgcg cttcagtttt
4141 caggaagccc gcagtgcctg ggggaactgc gactggatag gctccggtcg tatggccatc
4201 gatggtctga aagaagttca ggaagcggtg atgctgatag aagccggact gagtacctac
4261 gagaaagagt gcgcaaaacg cggtgacgac tatcaggaaa tttttgccca gcaggtccgt
4321 gaaacgatgg agcgccgtgc agccggtctt aaaccgcccg cctgggcggc tgcagcattt
4381 gaatccgggc tgcgacaatc aacagaggag gagaagagtg acagcagagc tgcgtaatct
4441 cccgcatatt gccagcatgg cctttaatga gccgctgatg cttgaacccg cctatgcgcg
4501 ggttttcttt tgtgcgcttg caggccagct tgggatcagc agcctgacgg atgcggtgtc
4561 cggcgacagc ctgactgccc aggaggcact cgcgacgctg gcattatccg gtgatgatga
4621 cggaccacga caggcccgca gttatcaggt catgaacggc atcgccgtgc tgccggtgtc
4681 cggcacgctg gtcagccgga cgcgggcgct gcagccgtac tcggggatga ccggttacaa
4741 cggcattatc gcccgtctgc aacaggctgc cagcgatccg atggtggacg gcattctgct
4801 cgatatggac acgcccggcg ggatggtggc gggggcattt gactgcgctg acatcatcgc
4861 ccgtgtgcgt gacataaaac cggtatgggc gcttgccaac gacatgaact gcagtgcagg
4921 tcagttgctt gccagtgccg cctcccggcg tctggtcacg cagaccgccc ggacaggctc
4981 catcggcgtc atgatggctc acagtaatta cggtgctgcg ctggagaaac agggtgtgga
5041 aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc
5101 ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca
5161 gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt
5221 gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga
5281 tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg
5341 aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac
5401 tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc
5461 gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga
5521 ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt
5581 gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac
5641 tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc
5701 tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga
5761 aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc
5821 cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg
5881 taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc
5941 tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga
6001 tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac
6061 ggcaatcagc atcgtttaac tttacccttc atcactaaag gccgcctgtg cggctttttt
6121 tacgggattt ttttatgtcg atgtacacaa ccgcccaact gctggcggca aatgagcaga
6181 aatttaagtt tgatccgctg tttctgcgtc tctttttccg tgagagctat cccttcacca
6241 cggagaaagt ctatctctca caaattccgg gactggtaaa catggcgctg tacgtttcgc
6301 cgattgtttc cggtgaggtt atccgttccc gtggcggctc cacctctgaa tttacgccgg
6361 gatatgtcaa gccgaagcat gaagtgaatc cgcagatgac cctgcgtcgc ctgccggatg
6421 aagatccgca gaatctggcg gacccggctt accgccgccg tcgcatcatc atgcagaaca
6481 tgcgtgacga agagctggcc attgctcagg tcgaagagat gcaggcagtt tctgccgtgc
6541 ttaagggcaa atacaccatg accggtgaag ccttcgatcc ggttgaggtg gatatgggcc
6601 gcagtgagga gaataacatc acgcagtccg gcggcacgga gtggagcaag cgtgacaagt
6661 ccacgtatga cccgaccgac gatatcgaag cctacgcgct gaacgccagc ggtgtggtga
6721 atatcatcgt gttcgatccg aaaggctggg cgctgttccg ttccttcaaa gccgtcaagg
6781 agaagctgga tacccgtcgt ggctctaatt ccgagctgga gacagcggtg aaagacctgg
6841 gcaaagcggt gtcctataag gggatgtatg gcgatgtggc catcgtcgtg tattccggac
6901 agtacgtgga aaacggcgtc aaaaagaact tcctgccgga caacacgatg gtgctgggga
6961 acactcaggc acgcggtctg cgcacctatg gctgcattca ggatgcggac gcacagcgcg
7021 aaggcattaa cgcctctgcc cgttacccga aaaactgggt gaccaccggc gatccggcgc
7081 gtgagttcac catgattcag tcagcaccgc tgatgctgct ggctgaccct gatgagttcg
7141 tgtccgtaca actggcgtaa tcatggccct tcggggccat tgtttctctg tggaggagtc
7201 catgacgaaa gatgaactga ttgcccgtct ccgctcgctg ggtgaacaac tgaaccgtga
7261 tgtcagcctg acggggacga aagaagaact ggcgctccgt gtggcagagc tgaaagagga
7321 gcttgatgac acggatgaaa ctgccggtca ggacacccct ctcagccggg aaaatgtgct
7381 gaccggacat gaaaatgagg tgggatcagc gcagccggat accgtgattc tggatacgtc
7441 tgaactggtc acggtcgtgg cactggtgaa gctgcatact gatgcacttc acgccacgcg
7501 ggatgaacct gtggcatttg tgctgccggg aacggcgttt cgtgtctctg ccggtgtggc
7561 agccgaaatg acagagcgcg gcctggccag aatgcaataa cgggaggcgc tgtggctgat
7621 ttcgataacc tgttcgatgc tgccattgcc cgcgccgatg aaacgatacg cgggtacatg
7681 ggaacgtcag ccaccattac atccggtgag cagtcaggtg cggtgatacg tggtgttttt
7741 gatgaccctg aaaatatcag ctatgccgga cagggcgtgc gcgttgaagg ctccagcccg
7801 tccctgtttg tccggactga tgaggtgcgg cagctgcggc gtggagacac gctgaccatc
7861 ggtgaggaaa atttctgggt agatcgggtt tcgccggatg atggcggaag ttgtcatctc
7921 tggcttggac ggggcgtacc gcctgccgtt aaccgtcgcc gctgaaaggg ggatgtatgg
7981 ccataaaagg tcttgagcag gccgttgaaa acctcagccg tatcagcaaa acggcggtgc
8041 ctggtgccgc cgcaatggcc attaaccgcg ttgcttcatc cgcgatatcg cagtcggcgt
8101 cacaggttgc ccgtgagaca aaggtacgcc ggaaactggt aaaggaaagg gccaggctga
8161 aaagggccac ggtcaaaaat ccgcaggcca gaatcaaagt taaccggggg gatttgcccg
8221 taatcaagct gggtaatgcg cgggttgtcc tttcgcgccg caggcgtcgt aaaaaggggc
8281 agcgttcatc cctgaaaggt ggcggcagcg tgcttgtggt gggtaaccgt cgtattcccg
8341 gcgcgtttat tcagcaactg aaaaatggcc ggtggcatgt catgcagcgt gtggctggga
8401 aaaaccgtta ccccattgat gtggtgaaaa tcccgatggc ggtgccgctg accacggcgt
8461 ttaaacaaaa tattgagcgg atacggcgtg aacgtcttcc gaaagagctg ggctatgcgc
8521 tgcagcatca actgaggatg gtaataaagc gatgaaacat actgaactcc gtgcagccgt
8581 actggatgca ctggagaagc atgacaccgg ggcgacgttt tttgatggtc gccccgctgt
8641 ttttgatgag gcggattttc cggcagttgc cgtttatctc accggcgctg aatacacggg
8701 cgaagagctg gacagcgata cctggcaggc ggagctgcat atcgaagttt tcctgcctgc
8761 tcaggtgccg gattcagagc tggatgcgtg gatggagtcc cggatttatc cggtgatgag
8821 cgatatcccg gcactgtcag atttgatcac cagtatggtg gccagcggct atgactaccg
8881 gcgcgacgat gatgcgggct tgtggagttc agccgatctg acttatgtca ttacctatga
8941 aatgtgagga cgctatgcct gtaccaaatc ctacaatgcc ggtgaaaggt gccgggacca
9001 ccctgtgggt ttataagggg agcggtgacc cttacgcgaa tccgctttca gacgttgact
9061 ggtcgcgtct ggcaaaagtt aaagacctga cgcccggcga actgaccgct gagtcctatg
9121 acgacagcta tctcgatgat gaagatgcag actggactgc gaccgggcag gggcagaaat
9181 ctgccggaga taccagcttc acgctggcgt ggatgcccgg agagcagggg cagcaggcgc
9241 tgctggcgtg gtttaatgaa ggcgataccc gtgcctataa aatccgcttc ccgaacggca
9301 cggtcgatgt gttccgtggc tgggtcagca gtatcggtaa ggcggtgacg gcgaaggaag
9361 tgatcacccg cacggtgaaa gtcaccaatg tgggacgtcc gtcgatggca gaagatcgca
9421 gcacggtaac agcggcaacc ggcatgaccg tgacgcctgc cagcacctcg gtggtgaaag
9481 ggcagagcac cacgctgacc gtggccttcc agccggaggg cgtaaccgac aagagctttc
9541 gtgcggtgtc tgcggataaa acaaaagcca ccgtgtcggt cagtggtatg accatcaccg
9601 tgaacggcgt tgctgcaggc aaggtcaaca ttccggttgt atccggtaat ggtgagtttg
9661 ctgcggttgc agaaattacc gtcaccgcca gttaatccgg agagtcagcg atgttcctga
9721 aaaccgaatc atttgaacat aacggtgtga ccgtcacgct ttctgaactg tcagccctgc
9781 agcgcattga gcatctcgcc ctgatgaaac ggcaggcaga acaggcggag tcagacagca
9841 accggaagtt tactgtggaa gacgccatca gaaccggcgc gtttctggtg gcgatgtccc
9901 tgtggcataa ccatccgcag aagacgcaga tgccgtccat gaatgaagcc gttaaacaga
9961 ttgagcagga agtgcttacc acctggccca cggaggcaat ttctcatgct gaaaacgtgg
10021 tgtaccggct gtctggtatg tatgagtttg tggtgaataa tgcccctgaa cagacagagg
10081 acgccgggcc cgcagagcct gtttctgcgg gaaagtgttc gacggtgagc tgagttttgc
10141 cctgaaactg gcgcgtgaga tggggcgacc cgactggcgt gccatgcttg ccgggatgtc
10201 atccacggag tatgccgact ggcaccgctt ttacagtacc cattattttc atgatgttct
10261 gctggatatg cacttttccg ggctgacgta caccgtgctc agcctgtttt tcagcgatcc
10321 ggatatgcat ccgctggatt tcagtctgct gaaccggcgc gaggctgacg aagagcctga
10381 agatgatgtg ctgatgcaga aagcggcagg gcttgccgga ggtgtccgct ttggcccgga
10441 cgggaatgaa gttatccccg cttccccgga tgtggcggac atgacggagg atgacgtaat
10501 gctgatgaca gtatcagaag ggatcgcagg aggagtccgg tatggctgaa ccggtaggcg
10561 atctggtcgt tgatttgagt ctggatgcgg ccagatttga cgagcagatg gccagagtca
10621 ggcgtcattt ttctggtacg gaaagtgatg cgaaaaaaac agcggcagtc gttgaacagt
10681 cgctgagccg acaggcgctg gctgcacaga aagcggggat ttccgtcggg cagtataaag
10741 ccgccatgcg tatgctgcct gcacagttca ccgacgtggc cacgcagctt gcaggcgggc
10801 aaagtccgtg gctgatcctg ctgcaacagg gggggcaggt gaaggactcc ttcggcggga
10861 tgatccccat gttcaggggg cttgccggtg cgatcaccct gccgatggtg ggggccacct
10921 cgctggcggt ggcgaccggt gcgctggcgt atgcctggta tcagggcaac tcaaccctgt
10981 ccgatttcaa caaaacgctg gtcctttccg gcaatcaggc gggactgacg gcagatcgta
11041 tgctggtcct gtccagagcc gggcaggcgg cagggctgac gtttaaccag accagcgagt
11101 cactcagcgc actggttaag gcgggggtaa gcggtgaggc tcagattgcg tccatcagcc
11161 agagtgtggc gcgtttctcc tctgcatccg gcgtggaggt ggacaaggtc gctgaagcct
11221 tcgggaagct gaccacagac ccgacgtcgg ggctgacggc gatggctcgc cagttccata
11281 acgtgtcggc ggagcagatt gcgtatgttg ctcagttgca gcgttccggc gatgaagccg
11341 gggcattgca ggcggcgaac gaggccgcaa cgaaagggtt tgatgaccag acccgccgcc
11401 tgaaagagaa catgggcacg ctggagacct gggcagacag gactgcgcgg gcattcaaat
11461 ccatgtggga tgcggtgctg gatattggtc gtcctgatac cgcgcaggag atgctgatta
11521 aggcagaggc tgcgtataag aaagcagacg acatctggaa tctgcgcaag gatgattatt
11581 ttgttaacga tgaagcgcgg gcgcgttact gggatgatcg tgaaaaggcc cgtcttgcgc
11641 ttgaagccgc ccgaaagaag gctgagcagc agactcaaca ggacaaaaat gcgcagcagc
11701 agagcgatac cgaagcgtca cggctgaaat ataccgaaga ggcgcagaag gcttacgaac
11761 ggctgcagac gccgctggag aaatataccg cccgtcagga agaactgaac aaggcactga
11821 aagacgggaa aatcctgcag gcggattaca acacgctgat ggcggcggcg aaaaaggatt
11881 atgaagcgac gctgaaaaag ccgaaacagt ccagcgtgaa ggtgtctgcg ggcgatcgtc
11941 aggaagacag tgctcatgct gccctgctga cgcttcaggc agaactccgg acgctggaga
12001 agcatgccgg agcaaatgag aaaatcagcc agcagcgccg ggatttgtgg aaggcggaga
12061 gtcagttcgc ggtactggag gaggcggcgc aacgtcgcca gctgtctgca caggagaaat
12121 ccctgctggc gcataaagat gagacgctgg agtacaaacg ccagctggct gcacttggcg
12181 acaaggttac gtatcaggag cgcctgaacg cgctggcgca gcaggcggat aaattcgcac
12241 agcagcaacg ggcaaaacgg gccgccattg atgcgaaaag ccgggggctg actgaccggc
12301 aggcagaacg ggaagccacg gaacagcgcc tgaaggaaca gtatggcgat aatccgctgg
12361 cgctgaataa cgtcatgtca gagcagaaaa agacctgggc ggctgaagac cagcttcgcg
12421 ggaactggat ggcaggcctg aagtccggct ggagtgagtg ggaagagagc gccacggaca
12481 gtatgtcgca ggtaaaaagt gcagccacgc agacctttga tggtattgca cagaatatgg
12541 cggcgatgct gaccggcagt gagcagaact ggcgcagctt cacccgttcc gtgctgtcca
12601 tgatgacaga aattctgctt aagcaggcaa tggtggggat tgtcgggagt atcggcagcg
12661 ccattggcgg ggctgttggt ggcggcgcat ccgcgtcagg cggtacagcc attcaggccg
12721 ctgcggcgaa attccatttt gcaaccggag gatttacggg aaccggcggc aaatatgagc
12781 cagcggggat tgttcaccgt ggtgagtttg tcttcacgaa ggaggcaacc agccggattg
12841 gcgtggggaa tctttaccgg ctgatgcgcg gctatgccac cggcggttat gtcggtacac
12901 cgggcagcat ggcagacagc cggtcgcagg cgtccgggac gtttgagcag aataaccatg
12961 tggtgattaa caacgacggc acgaacgggc agataggtcc ggctgctctg aaggcggtgt
13021 atgacatggc ccgcaagggt gcccgtgatg aaattcagac acagatgcgt gatggtggcc
13081 tgttctccgg aggtggacga tgaagacctt ccgctggaaa gtgaaacccg gtatggatgt
13141 ggcttcggtc ccttctgtaa gaaaggtgcg ctttggtgat ggctattctc agcgagcgcc
13201 tgccgggctg aatgccaacc tgaaaacgta cagcgtgacg ctttctgtcc cccgtgagga
13261 ggccacggta ctggagtcgt ttctggaaga gcacgggggc tggaaatcct ttctgtggac
13321 gccgccttat gagtggcggc agataaaggt gacctgcgca aaatggtcgt cgcgggtcag
13381 tatgctgcgt gttgagttca gcgcagagtt tgaacaggtg gtgaactgat gcaggatatc
13441 cggcaggaaa cactgaatga atgcacccgt gcggagcagt cggccagcgt ggtgctctgg
13501 gaaatcgacc tgacagaggt cggtggagaa cgttattttt tctgtaatga gcagaacgaa
13561 aaaggtgagc cggtcacctg gcaggggcga cagtatcagc cgtatcccat tcaggggagc
13621 ggttttgaac tgaatggcaa aggcaccagt acgcgcccca cgctgacggt ttctaacctg
13681 tacggtatgg tcaccgggat ggcggaagat atgcagagtc tggtcggcgg aacggtggtc
13741 cggcgtaagg tttacgcccg ttttctggat gcggtgaact tcgtcaacgg aaacagttac
13801 gccgatccgg agcaggaggt gatcagccgc tggcgcattg agcagtgcag cgaactgagc
13861 gcggtgagtg cctcctttgt actgtccacg ccgacggaaa cggatggcgc tgtttttccg
13921 ggacgtatca tgctggccaa cacctgcacc tggacctatc gcggtgacga gtgcggttat
13981 agcggtccgg ctgtcgcgga tgaatatgac cagccaacgt ccgatatcac gaaggataaa
14041 tgcagcaaat gcctgagcgg ttgtaagttc cgcaataacg tcggcaactt tggcggcttc
14101 ctttccatta acaaactttc gcagtaaatc ccatgacaca gacagaatca gcgattctgg
14161 cgcacgcccg gcgatgtgcg ccagcggagt cgtgcggctt cgtggtaagc acgccggagg
14221 gggaaagata tttcccctgc gtgaatatct ccggtgagcc ggaggctatt tccgtatgtc
14281 gccggaagac tggctgcagg cagaaatgca gggtgagatt gtggcgctgg tccacagcca
14341 ccccggtggt ctgccctggc tgagtgaggc cgaccggcgg ctgcaggtgc agagtgattt
14401 gccgtggtgg ctggtctgcc gggggacgat tcataagttc cgctgtgtgc cgcatctcac
14461 cgggcggcgc tttgagcacg gtgtgacgga ctgttacaca ctgttccggg atgcttatca
14521 tctggcgggg attgagatgc cggactttca tcgtgaggat gactggtggc gtaacggcca
14581 gaatctctat ctggataatc tggaggcgac ggggctgtat caggtgccgt tgtcagcggc
14641 acagccgggc gatgtgctgc tgtgctgttt tggttcatca gtgccgaatc acgccgcaat
14701 ttactgcggc gacggcgagc tgctgcacca tattcctgaa caactgagca aacgagagag
14761 gtacaccgac aaatggcagc gacgcacaca ctccctctgg cgtcaccggg catggcgcgc
14821 atctgccttt acggggattt acaacgattt ggtcgccgca tcgaccttcg tgtgaaaacg
14881 ggggctgaag ccatccgggc actggccaca cagctcccgg cgtttcgtca gaaactgagc
14941 gacggctggt atcaggtacg gattgccggg cgggacgtca gcacgtccgg gttaacggcg
15001 cagttacatg agactctgcc tgatggcgct gtaattcata ttgttcccag agtcgccggg
15061 gccaagtcag gtggcgtatt ccagattgtc ctgggggctg ccgccattgc cggatcattc
15121 tttaccgccg gagccaccct tgcagcatgg ggggcagcca ttggggccgg tggtatgacc
15181 ggcatcctgt tttctctcgg tgccagtatg gtgctcggtg gtgtggcgca gatgctggca
15241 ccgaaagcca gaactccccg tatacagaca acggataacg gtaagcagaa cacctatttc
15301 tcctcactgg ataacatggt tgcccagggc aatgttctgc ctgttctgta cggggaaatg
15361 cgcgtggggt cacgcgtggt ttctcaggag atcagcacgg cagacgaagg ggacggtggt
15421 caggttgtgg tgattggtcg ctgatgcaaa atgttttatg tgaaaccgcc tgcgggcggt
15481 tttgtcattt atggagcgtg aggaatgggt aaaggaagca gtaaggggca taccccgcgc
15541 gaagcgaagg acaacctgaa gtccacgcag ttgctgagtg tgatcgatgc catcagcgaa
15601 gggccgattg aaggtccggt ggatggctta aaaagcgtgc tgctgaacag tacgccggtg
15661 ctggacactg aggggaatac caacatatcc ggtgtcacgg tggtgttccg ggctggtgag
15721 caggagcaga ctccgccgga gggatttgaa tcctccggct ccgagacggt gctgggtacg
15781 gaagtgaaat atgacacgcc gatcacccgc accattacgt ctgcaaacat cgaccgtctg
15841 cgctttacct tcggtgtaca ggcactggtg gaaaccacct caaagggtga caggaatccg
15901 tcggaagtcc gcctgctggt tcagatacaa cgtaacggtg gctgggtgac ggaaaaagac
15961 atcaccatta agggcaaaac cacctcgcag tatctggcct cggtggtgat gggtaacctg
16021 ccgccgcgcc cgtttaatat ccggatgcgc aggatgacgc cggacagcac cacagaccag
16081 ctgcagaaca aaacgctctg gtcgtcatac actgaaatca tcgatgtgaa acagtgctac
16141 ccgaacacgg cactggtcgg cgtgcaggtg gactcggagc agttcggcag ccagcaggtg
16201 agccgtaatt atcatctgcg cgggcgtatt ctgcaggtgc cgtcgaacta taacccgcag
16261 acgcggcaat acagcggtat ctgggacgga acgtttaaac cggcatacag caacaacatg
16321 gcctggtgtc tgtgggatat gctgacccat ccgcgctacg gcatggggaa acgtcttggt
16381 gcggcggatg tggataaatg ggcgctgtat gtcatcggcc agtactgcga ccagtcagtg
16441 ccggacggct ttggcggcac ggagccgcgc atcacctgta atgcgtacct gaccacacag
16501 cgtaaggcgt gggatgtgct cagcgatttc tgctcggcga tgcgctgtat gccggtatgg
16561 aacgggcaga cgctgacgtt cgtgcaggac cgaccgtcgg ataagacgtg gacctataac
16621 cgcagtaatg tggtgatgcc ggatgatggc gcgccgttcc gctacagctt cagcgccctg
16681 aaggaccgcc ataatgccgt tgaggtgaac tggattgacc cgaacaacgg ctgggagacg
16741 gcgacagagc ttgttgaaga tacgcaggcc attgcccgtt acggtcgtaa tgttacgaag
16801 atggatgcct ttggctgtac cagccggggg caggcacacc gcgccgggct gtggctgatt
16861 aaaacagaac tgctggaaac gcagaccgtg gatttcagcg tcggcgcaga agggcttcgc
16921 catgtaccgg gcgatgttat tgaaatctgc gatgatgact atgccggtat cagcaccggt
16981 ggtcgtgtgc tggcggtgaa cagccagacc cggacgctga cgctcgaccg tgaaatcacg
17041 ctgccatcct ccggtaccgc gctgataagc ctggttgacg gaagtggcaa tccggtcagc
17101 gtggaggttc agtccgtcac cgacggcgtg aaggtaaaag tgagccgtgt tcctgacggt
17161 gttgctgaat acagcgtatg ggagctgaag ctgccgacgc tgcgccagcg actgttccgc
17221 tgcgtgagta tccgtgagaa cgacgacggc acgtatgcca tcaccgccgt gcagcatgtg
17281 ccggaaaaag aggccatcgt ggataacggg gcgcactttg acggcgaaca gagtggcacg
17341 gtgaatggtg tcacgccgcc agcggtgcag cacctgaccg cagaagtcac tgcagacagc
17401 ggggaatatc aggtgctggc gcgatgggac acaccgaagg tggtgaaggg cgtgagtttc
17461 ctgctccgtc tgaccgtaac agcggacgac ggcagtgagc ggctggtcag cacggcccgg
17521 acgacggaaa ccacataccg cttcacgcaa ctggcgctgg ggaactacag gctgacagtc
17581 cgggcggtaa atgcgtgggg gcagcagggc gatccggcgt cggtatcgtt ccggattgcc
17641 gcaccggcag caccgtcgag gattgagctg acgccgggct attttcagat aaccgccacg
17701 ccgcatcttg ccgtttatga cccgacggta cagtttgagt tctggttctc ggaaaagcag
17761 attgcggata tcagacaggt tgaaaccagc acgcgttatc ttggtacggc gctgtactgg
17821 atagccgcca gtatcaatat caaaccgggc catgattatt acttttatat ccgcagtgtg
17881 aacaccgttg gcaaatcggc attcgtggag gccgtcggtc gggcgagcga tgatgcggaa
17941 ggttacctgg attttttcaa aggcaagata accgaatccc atctcggcaa ggagctgctg
18001 gaaaaagtcg agctgacgga ggataacgcc agcagactgg aggagttttc gaaagagtgg
18061 aaggatgcca gtgataagtg gaatgccatg tgggctgtca aaattgagca gaccaaagac
18121 ggcaaacatt atgtcgcggg tattggcctc agcatggagg acacggagga aggcaaactg
18181 agccagtttc tggttgccgc caatcgtatc gcatttattg acccggcaaa cgggaatgaa
18241 acgccgatgt ttgtggcgca gggcaaccag atattcatga acgacgtgtt cctgaagcgc
18301 ctgacggccc ccaccattac cagcggcggc aatcctccgg ccttttccct gacaccggac
18361 ggaaagctga ccgctaaaaa tgcggatatc agtggcagtg tgaatgcgaa ctccgggacg
18421 ctcagtaatg tgacgatagc tgaaaactgt acgataaacg gtacgctgag ggcggaaaaa
18481 atcgtcgggg acattgtaaa ggcggcgagc gcggcttttc cgcgccagcg tgaaagcagt
18541 gtggactggc cgtcaggtac ccgtactgtc accgtgaccg atgaccatcc ttttgatcgc
18601 cagatagtgg tgcttccgct gacgtttcgc ggaagtaagc gtactgtcag cggcaggaca
18661 acgtattcga tgtgttatct gaaagtactg atgaacggtg cggtgattta tgatggcgcg
18721 gcgaacgagg cggtacaggt gttctcccgt attgttgaca tgccagcggg tcggggaaac
18781 gtgatcctga cgttcacgct tacgtccaca cggcattcgg cagatattcc gccgtatacg
18841 tttgccagcg atgtgcaggt tatggtgatt aagaaacagg cgctgggcat cagcgtggtc
18901 tgagtgtgtt acagaggttc gtccgggaac gggcgtttta ttataaaaca gtgagaggtg
18961 aacgatgcgt aatgtgtgta ttgccgttgc tgtctttgcc gcacttgcgg tgacagtcac
19021 tccggcccgt gcggaaggtg gacatggtac gtttacggtg ggctattttc aagtgaaacc
19081 gggtacattg ccgtcgttgt cgggcgggga taccggtgtg agtcatctga aagggattaa
19141 cgtgaagtac cgttatgagc tgacggacag tgtgggggtg atggcttccc tggggttcgc
19201 cgcgtcgaaa aagagcagca cagtgatgac cggggaggat acgtttcact atgagagcct
19261 gcgtggacgt tatgtgagcg tgatggccgg accggtttta caaatcagta agcaggtcag
19321 tgcgtacgcc atggccggag tggctcacag tcggtggtcc ggcagtacaa tggattaccg
19381 taagacggaa atcactcccg ggtatatgaa agagacgacc actgccaggg acgaaagtgc
19441 aatgcggcat acctcagtgg cgtggagtgc aggtatacag attaatccgg cagcgtccgt
19501 cgttgttgat attgcttatg aaggctccgg cagtggcgac tggcgtactg acggattcat
19561 cgttggggtc ggttataaat tctgattagc caggtaacac agtgttatga cagcccgccg
19621 gaaccggtgg gcttttttgt ggggtgaata tggcagtaaa gatttcagga gtcctgaaag
19681 acggcacagg aaaaccggta cagaactgca ccattcagct gaaagccaga cgtaacagca
19741 ccacggtggt ggtgaacacg gtgggctcag agaatccgga tgaagccggg cgttacagca
19801 tggatgtgga gtacggtcag tacagtgtca tcctgcaggt tgacggtttt ccaccatcgc
19861 acgccgggac catcaccgtg tatgaagatt cacaaccggg gacgctgaat gattttctct
19921 gtgccatgac ggaggatgat gcccggccgg aggtgctgcg tcgtcttgaa ctgatggtgg
19981 aagaggtggc gcgtaacgcg tccgtggtgg cacagagtac ggcagacgcg aagaaatcag
20041 ccggcgatgc cagtgcatca gctgctcagg tcgcggccct tgtgactgat gcaactgact
20101 cagcacgcgc cgccagcacg tccgccggac aggctgcatc gtcagctcag gaagcgtcct
20161 ccggcgcaga agcggcatca gcaaaggcca ctgaagcgga aaaaagtgcc gcagccgcag
20221 agtcctcaaa aaacgcggcg gccaccagtg ccggtgcggc gaaaacgtca gaaacgaatg
20281 ctgcagcgtc acaacaatca gccgccacgt ctgcctccac cgcggccacg aaagcgtcag
20341 aggccgccac ttcagcacga gatgcggtgg cctcaaaaga ggcagcaaaa tcatcagaaa
20401 cgaacgcatc atcaagtgcc ggtcgtgcag cttcctcggc aacggcggca gaaaattctg
20461 ccagggcggc aaaaacgtcc gagacgaatg ccaggtcatc tgaaacagca gcggaacgga
20521 gcgcctctgc cgcggcagac gcaaaaacag cggcggcggg gagtgcgtca acggcatcca
20581 cgaaggcgac agaggctgcg ggaagtgcgg tatcagcatc gcagagcaaa agtgcggcag
20641 aagcggcggc aatacgtgca aaaaattcgg caaaacgtgc agaagatata gcttcagctg
20701 tcgcgcttga ggatgcggac acaacgagaa aggggatagt gcagctcagc agtgcaacca
20761 acagcacgtc tgaaacgctt gctgcaacgc caaaggcggt taaggtggta atggatgaaa
20821 cgaacagaaa agcccactgg acagtccggc actgaccgga acgccaacag caccaaccgc
20881 gctcagggga acaaacaata cccagattgc gaacaccgct tttgtactgg ccgcgattgc
20941 agatgttatc gacgcgtcac ctgacgcact gaatacgctg aatgaactgg ccgcagcgct
21001 cgggaatgat ccagattttg ctaccaccat gactaacgcg cttgcgggta aacaaccgaa
21061 gaatgcgaca ctgacggcgc tggcagggct ttccacggcg aaaaataaat taccgtattt
21121 tgcggaaaat gatgccgcca gcctgactga actgactcag gttggcaggg atattctggc
21181 aaaaaattcc gttgcagatg ttcttgaata ccttggggcc ggtgagaatt cggcctttcc
21241 ggcaggtgcg ccgatcccgt ggccatcaga tatcgttccg tctggctacg tcctgatgca
21301 ggggcaggcg tttgacaaat cagcctaccc aaaacttgct gtcgcgtatc catcgggtgt
21361 gcttcctgat atgcgaggct ggacaatcaa ggggaaaccc gccagcggtc gtgctgtatt
21421 gtctcaggaa caggatggaa ttaagtcgca cacccacagt gccagtgcat ccggtacgga
21481 tttggggacg aaaaccacat cgtcgtttga ttacgggacg aaaacaacag gcagtttcga
21541 ttacggcacc aaatcgacga ataacacggg ggctcatgct cacagtctga gcggttcaac
21601 aggggccgcg ggtgctcatg cccacacaag tggtttaagg atgaacagtt ctggctggag
21661 tcagtatgga acagcaacca ttacaggaag tttatccaca gttaaaggaa ccagcacaca
21721 gggtattgct tatttatcga aaacggacag tcagggcagc cacagtcact cattgtccgg
21781 tacagccgtg agtgccggtg cacatgcgca tacagttggt attggtgcgc accagcatcc
21841 ggttgttatc ggtgctcatg cccattcttt cagtattggt tcacacggac acaccatcac
21901 cgttaacgct gcgggtaacg cggaaaacac cgtcaaaaac attgcattta actatattgt
21961 gaggcttgca taatggcatt cagaatgagt gaacaaccac ggaccataaa aatttataat
22021 ctgctggccg gaactaatga atttattggt gaaggtgacg catatattcc gcctcatacc
22081 ggtctgcctg caaacagtac cgatattgca ccgccagata ttccggctgg ctttgtggct
22141 gttttcaaca gtgatgaggc atcgtggcat ctcgttgaag accatcgggg taaaaccgtc
22201 tatgacgtgg cttccggcga cgcgttattt atttctgaac tcggtccgtt accggaaaat
22261 tttacctggt tatcgccggg aggggaatat cagaagtgga acggcacagc ctgggtgaag
22321 gatacggaag cagaaaaact gttccggatc cgggaggcgg aagaaacaaa aaaaagcctg
22381 atgcaggtag ccagtgagca tattgcgccg cttcaggatg ctgcagatct ggaaattgca
22441 acgaaggaag aaacctcgtt gctggaagcc tggaagaagt atcgggtgtt gctgaaccgt
22501 gttgatacat caactgcacc tgatattgag tggcctgctg tccctgttat ggagtaatcg
22561 ttttgtgata tgccgcagaa acgttgtatg aaataacgtt ctgcggttag ttagtatatt
22621 gtaaagctga gtattggttt atttggcgat tattatcttc aggagaataa tggaagttct
22681 atgactcaat tgttcatagt gtttacatca ccgccaattg cttttaagac tgaacgcatg
22741 aaatatggtt tttcgtcatg ttttgagtct gctgttgata tttctaaagt cggttttttt
22801 tcttcgtttt ctctaactat tttccatgaa atacattttt gattattatt tgaatcaatt
22861 ccaattacct gaagtctttc atctataatt ggcattgtat gtattggttt attggagtag
22921 atgcttgctt ttctgagcca tagctctgat atccaaatga agccataggc atttgttatt
22981 ttggctctgt cagctgcata acgccaaaaa atatatttat ctgcttgatc ttcaaatgtt
23041 gtattgatta aatcaattgg atggaattgt ttatcataaa aaattaatgt ttgaatgtga
23101 taaccgtcct ttaaaaaagt cgtttctgca agcttggctg tatagtcaac taactcttct
23161 gtcgaagtga tatttttagg cttatctacc agttttagac gctctttaat atcttcagga
23221 attattttat tgtcatattg tatcatgcta aatgacaatt tgcttatgga gtaatctttt
23281 aattttaaat aagttattct cctggcttca tcaaataaag agtcgaatga tgttggcgaa
23341 atcacatcgt cacccattgg attgtttatt tgtatgccaa gagagttaca gcagttatac
23401 attctgccat agattatagc taaggcatgt aataattcgt aatcttttag cgtattagcg
23461 acccatcgtc tttctgattt aataatagat gattcagtta aatatgaagg taatttcttt
23521 tgtgcaagtc tgactaactt ttttatacca atgtttaaca tactttcatt tgtaataaac
23581 tcaatgtcat tttcttcaat gtaagatgaa ataagagtag cctttgcctc gctatacatt
23641 tctaaatcgc cttgtttttc tatcgtattg cgagaatttt tagcccaagc cattaatgga
23701 tcatttttcc atttttcaat aacattattg ttataccaaa tgtcatatcc tataatctgg
23761 tttttgtttt tttgaataat aaatgttact gttcttgcgg tttggaggaa ttgattcaaa
23821 ttcaagcgaa ataattcagg gtcaaaatat gtatcaatgc agcatttgag caagtgcgat
23881 aaatctttaa gtcttctttc ccatggtttt ttagtcataa aactctccat tttgataggt
23941 tgcatgctag atgctgatat attttagagg tgataaaatt aactgcttaa ctgtcaatgt
24001 aatacaagtt gtttgatctt tgcaatgatt cttatcagaa accatatagt aaattagtta
24061 cacaggaaat ttttaatatt attattatca ttcattatgt attaaaatta gagttgtggc
24121 ttggctctgc taacacgttg ctcataggag atatggtaga gccgcagaca cgtcgtatgc
24181 aggaacgtgc tgcggctggc tggtgaactt ccgatagtgc gggtgttgaa tgatttccag
24241 ttgctaccga ttttacatat tttttgcatg agagaatttg taccacctcc caccgaccat
24301 ctatgactgt acgccactgt ccctaggact gctatgtgcc ggagcggaca ttacaaacgt
24361 ccttctcggt gcatgccact gttgccaatg acctgcctag gaattggtta gcaagttact
24421 accggatttt gtaaaaacag ccctcctcat ataaaaagta ttcgttcact tccgataagc
24481 gtcgtaattt tctatctttc atcatattct agatccctct gaaaaaatct tccgagtttg
24541 ctaggcactg atacataact cttttccaat aattggggaa gtcattcaaa tctataatag
24601 gtttcagatt tgcttcaata aattctgact gtagctgctg aaacgttgcg gttgaactat
24661 atttccttat aacttttacg aaagagtttc tttgagtaat cacttcactc aagtgcttcc
24721 ctgcctccaa acgatacctg ttagcaatat ttaatagctt gaaatgatga agagctctgt
24781 gtttgtcttc ctgcctccag ttcgccgggc attcaacata aaaactgata gcacccggag
24841 ttccggaaac gaaatttgca tatacccatt gctcacgaaa aaaaatgtcc ttgtcgatat
24901 agggatgaat cgcttggtgt acctcatcta ctgcgaaaac ttgacctttc tctcccatat
24961 tgcagtcgcg gcacgatgga actaaattaa taggcatcac cgaaaattca ggataatgtg
25021 caataggaag aaaatgatct atattttttg tctgtcctat atcaccacaa aatggacatt
25081 tttcacctga tgaaacaagc atgtcatcgt aatatgttct agcgggtttg tttttatctc
25141 ggagattatt ttcataaagc ttttctaatt taacctttgt caggttacca actactaagg
25201 ttgtaggctc aagagggtgt gtcctgtcgt aggtaaataa ctgacctgtc gagcttaata
25261 ttctatattg ttgttctttc tgcaaaaaag tggggaagtg agtaatgaaa ttatttctaa
25321 catttatctg catcatacct tccgagcatt tattaagcat ttcgctataa gttctcgctg
25381 gaagaggtag ttttttcatt gtactttacc ttcatctctg ttcattatca tcgcttttaa
25441 aacggttcga ccttctaatc ctatctgacc attataattt tttagaatgg tttcataaga
25501 aagctctgaa tcaacggact gcgataataa gtggtggtat ccagaatttg tcacttcaag
25561 taaaaacacc tcacgagtta aaacacctaa gttctcaccg aatgtctcaa tatccggacg
25621 gataatattt attgcttctc ttgaccgtag gactttccac atgcaggatt ttggaacctc
25681 ttgcagtact actggggaat gagttgcaat tattgctaca ccattgcgtg catcgagtaa
25741 gtcgcttaat gttcgtaaaa aagcagagag caaaggtgga tgcagatgaa cctctggttc
25801 atcgaataaa actaatgact tttcgccaac gacatctact aatcttgtga tagtaaataa
25861 aacaattgca tgtccagagc tcattcgaag cagatatttc tggatattgt cataaaacaa
25921 tttagtgaat ttatcatcgt ccacttgaat ctgtggttca ttacgtctta actcttcata
25981 tttagaaatg aggctgatga gttccatatt tgaaaagttt tcatcactac ttagtttttt
26041 gatagcttca agccagagtt gtctttttct atctactctc atacaaccaa taaatgctga
26101 aatgaattct aagcggagat cgcctagtga ttttaaacta ttgctggcag cattcttgag
26161 tccaatataa aagtattgtg taccttttgc tgggtcaggt tgttctttag gaggagtaaa
26221 aggatcaaat gcactaaacg aaactgaaac aagcgatcga aaatatccct ttgggattct
26281 tgactcgata agtctattat tttcagagaa aaaatattca ttgttttctg ggttggtgat
26341 tgcaccaatc attccattca aaattgttgt tttaccacac ccattccgcc cgataaaagc
26401 atgaatgttc gtgctgggca tagaattaac cgtcacctca aaaggtatag ttaaatcact
26461 gaatccggga gcactttttc tattaaatga aaagtggaaa tctgacaatt ctggcaaacc
26521 atttaacaca cgtgcgaact gtccatgaat ttctgaaaga gttacccctc taagtaatga
26581 ggtgttaagg acgctttcat tttcaatgtc ggctaatcga tttggccata ctactaaatc
26641 ctgaatagct ttaagaaggt tatgtttaaa accatcgctt aatttgctga gattaacata
26701 gtagtcaatg ctttcaccta aggaaaaaaa catttcaggg agttgactga attttttatc
26761 tattaatgaa taagtgctta cttcttcttt ttgacctaca aaaccaattt taacatttcc
26821 gatatcgcat ttttcaccat gctcatcaaa gacagtaaga taaaacattg taacaaagga
26881 atagtcattc caaccatctg ctcgtaggaa tgccttattt ttttctactg caggaatata
26941 cccgcctctt tcaataacac taaactccaa catatagtaa cccttaattt tattaaaata
27001 accgcaattt atttggcggc aacacaggat ctctctttta agttactctc tattacatac
27061 gttttccatc taaaaattag tagtattgaa cttaacgggg catcgtattg tagttttcca
27121 tatttagctt tctgcttcct tttggataac ccactgttat tcatgttgca tggtgcactg
27181 tttataccaa cgatatagtc tattaatgca tatatagtat cgccgaacga ttagctcttc
27241 aggcttctga agaagcgttt caagtactaa taagccgata gatagccacg gacttcgtag
27301 ccatttttca taagtgttaa cttccgctcc tcgctcataa cagacattca ctacagttat
27361 ggcggaaagg tatgcatgct gggtgtgggg aagtcgtgaa agaaaagaag tcagctgcgt
27421 cgtttgacat cactgctatc ttcttactgg ttatgcaggt cgtagtgggt ggcacacaaa
27481 gctttgcact ggattgcgag gctttgtgct tctctggagt gcgacaggtt tgatgacaaa
27541 aaattagcgc aagaagacaa aaatcacctt gcgctaatgc tctgttacag gtcactaata
27601 ccatctaagt agttgattca tagtgactgc atatgttgtg ttttacagta ttatgtagtc
27661 tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt
27721 tcagcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat ttgttgcaac
27781 gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg tcccactccc
27841 tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg agaagatgtt
27901 gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg cgcaactcgt
27961 gaaaggtagg cggatcccct tcgaaggaaa gacctgatgc ttttcgtgcg cgcataaaat
28021 accttgatac tgtgccggat gaaagcggtt cgcgacgagt agatgcaatt atggtttctc
28081 cgccaagaat ctctttgcat ttatcaagtg tttccttcat tgatattccg agagcatcaa
28141 tatgcaatgc tgttgggatg gcaattttta cgcctgtttt gctttgctcg acataaagat
28201 atccatctac gatatcagac cacttcattt cgcataaatc accaactcgt tgcccggtaa
28261 caacagccag ttccattgca agtctgagcc aacatggtga tgattctgct gcttgataaa
28321 ttttcaggta ttcgtcagcc gtaagtcttg atctccttac ctctgatttt gctgcgcgag
28381 tggcagcgac atggtttgtt gttatatggc cttcagctat tgcctctcgg aatgcatcgc
28441 tcagtgttga tctgattaac ttggctgacg ccgccttgcc ctcgtctatg tatccattga
28501 gcattgccgc aatttctttt gtggtgatgt cttcaagtgg agcatcaggc agacccctcc
28561 ttattgcttt aattttgctc atgtaattta tgagtgtctt ctgcttgatt cctctgctgg
28621 ccaggatttt ttcgtagcga tcaagccatg aatgtaacgt aacggaatta tcactgttga
28681 ttctcgctgt cagaggcttg tgtttgtgtc ctgaaaataa ctcaatgttg gcctgtatag
28741 cttcagtgat tgcgattcgc ctgtctctgc ctaatccaaa ctctttaccc gtccttgggt
28801 ccctgtagca gtaatatcca ttgtttctta tataaaggtt agggggtaaa tcccggcgct
28861 catgacttcg ccttcttccc atttctgatc ctcttcaaaa ggccacctgt tactggtcga
28921 tttaagtcaa cctttaccgc tgattcgtgg aacagatact ctcttccatc cttaaccgga
28981 ggtgggaata tcctgcattc ccgaacccat cgacgaactg tttcaaggct tcttggacgt
29041 cgctggcgtg cgttccactc ctgaagtgtc aagtacatcg caaagtctcc gcaattacac
29101 gcaagaaaaa accgccatca ggcggcttgg tgttctttca gttcttcaat tcgaatattg
29161 gttacgtctg catgtgctat ctgcgcccat atcatccagt ggtcgtagca gtcgttgatg
29221 ttctccgctt cgataactct gttgaatggc tctccattcc attctcctgt gactcggaag
29281 tgcatttatc atctccataa aacaaaaccc gccgtagcga gttcagataa aataaatccc
29341 cgcgagtgcg aggattgtta tgtaatattg ggtttaatca tctatatgtt ttgtacagag
29401 agggcaagta tcgtttccac cgtactcgtg ataataattt tgcacggtat cagtcatttc
29461 tcgcacattg cagaatgggg atttgtcttc attagactta taaaccttca tggaatattt
29521 gtatgccgac tctatatcta taccttcatc tacataaaca ccttcgtgat gtctgcatgg
29581 agacaagaca ccggatctgc acaacattga taacgcccaa tctttttgct cagactctaa
29641 ctcattgata ctcatttata aactccttgc aatgtatgtc gtttcagcta aacggtatca
29701 gcaatgttta tgtaaagaaa cagtaagata atactcaacc cgatgtttga gtacggtcat
29761 catctgacac tacagactct ggcatcgctg tgaagacgac gcgaaattca gcattttcac
29821 aagcgttatc ttttacaaaa ccgatctcac tctcctttga tgcgaatgcc agcgtcagac
29881 atcatatgca gatactcacc tgcatcctga acccattgac ctccaacccc gtaatagcga
29941 tgcgtaatga tgtcgatagt tactaacggg tcttgttcga ttaactgccg cagaaactct
30001 tccaggtcac cagtgcagtg cttgataaca ggagtcttcc caggatggcg aacaacaaga
30061 aactggtttc cgtcttcacg gacttcgttg ctttccagtt tagcaatacg cttactccca
30121 tccgagataa caccttcgta atactcacgc tgctcgttga gttttgattt tgctgtttca
30181 agctcaacac gcagtttccc tactgttagc gcaatatcct cgttctcctg gtcgcggcgt
30241 ttgatgtatt gctggtttct ttcccgttca tccagcagtt ccagcacaat cgatggtgtt
30301 accaattcat ggaaaaggtc tgcgtcaaat ccccagtcgt catgcattgc ctgctctgcc
30361 gcttcacgca gtgcctgaga gttaatttcg ctcacttcga acctctctgt ttactgataa
30421 gttccagatc ctcctggcaa cttgcacaag tccgacaacc ctgaacgacc aggcgtcttc
30481 gttcatctat cggatcgcca cactcacaac aatgagtggc agatatagcc tggtggttca
30541 ggcggcgcat ttttattgct gtgttgcgct gtaattcttc tatttctgat gctgaatcaa
30601 tgatgtctgc catctttcat taatccctga actgttggtt aatacgcttg agggtgaatg
30661 cgaataataa aaaaggagcc tgtagctccc tgatgatttt gcttttcatg ttcatcgttc
30721 cttaaagacg ccgtttaaca tgccgattgc caggcttaaa tgagtcggtg tgaatcccat
30781 cagcgttacc gtttcgcggt gcttcttcag tacgctacgg caaatgtcat cgacgttttt
30841 atccggaaac tgctgtctgg ctttttttga tttcagaatt agcctgacgg gcaatgctgc
30901 gaagggcgtt ttcctgctga ggtgtcattg aacaagtccc atgtcggcaa gcataagcac
30961 acagaatatg aagcccgctg ccagaaaaat gcattccgtg gttgtcatac ctggtttctc
31021 tcatctgctt ctgctttcgc caccatcatt tccagctttt gtgaaaggga tgcggctaac
31081 gtatgaaatt cttcgtctgt ttctactggt attggcacaa acctgattcc aatttgagca
31141 aggctatgtg ccatctcgat actcgttctt aactcaacag aagatgcttt gtgcatacag
31201 cccctcgttt attatttatc tcctcagcca gccgctgtgc tttcagtgga tttcggataa
31261 cagaaaggcc gggaaatacc cagcctcgct ttgtaacgga gtagacgaaa gtgattgcgc
31321 ctacccggat attatcgtga ggatgcgtca tcgccattgc tccccaaata caaaaccaat
31381 ttcagccagt gcctcgtcca ttttttcgat gaactccggc acgatctcgt caaaactcgc
31441 catgtacttt tcatcccgct caatcacgac ataatgcagg ccttcacgct tcatacgcgg
31501 gtcatagttg gcaaagtacc aggcattttt tcgcgtcacc cacatgctgt actgcacctg
31561 ggccatgtaa gctgacttta tggcctcgaa accaccgagc cggaacttca tgaaatcccg
31621 ggaggtaaac gggcatttca gttcaaggcc gttgccgtca ctgcataaac catcgggaga
31681 gcaggcggta cgcatacttt cgtcgcgata gatgatcggg gattcagtaa cattcacgcc
31741 ggaagtgaat tcaaacaggg ttctggcgtc gttctcgtac tgttttcccc aggccagtgc
31801 tttagcgtta acttccggag ccacaccggt gcaaacctca gcaagcaggg tgtggaagta
31861 ggacattttc atgtcaggcc acttctttcc ggagcggggt tttgctatca cgttgtgaac
31921 ttctgaagcg gtgatgacgc cgagccgtaa tttgtgccac gcatcatccc cctgttcgac
31981 agctctcaca tcgatcccgg tacgctgcag gataatgtcc ggtgtcatgc tgccaccttc
32041 tgctctgcgg ctttctgttt caggaatcca agagctttta ctgcttcggc ctgtgtcagt
32101 tctgacgatg cacgaatgtc gcggcgaaat atctgggaac agagcggcaa taagtcgtca
32161 tcccatgttt tatccagggc gatcagcaga gtgttaatct cctgcatggt ttcatcgtta
32221 accggagtga tgtcgcgttc cggctgacgt tctgcagtgt atgcagtatt ttcgacaatg
32281 cgctcggctt catccttgtc atagatacca gcaaatccga aggccagacg ggcacactga
32341 atcatggctt tatgacgtaa catccgtttg ggatgcgact gccacggccc cgtgatttct
32401 ctgccttcgc gagttttgaa tggttcgcgg cggcattcat ccatccattc ggtaacgcag
32461 atcggatgat tacggtcctt gcggtaaatc cggcatgtac aggattcatt gtcctgctca
32521 aagtccatgc catcaaactg ctggttttca ttgatgatgc gggaccagcc atcaacgccc
32581 accaccggaa cgatgccatt ctgcttatca ggaaaggcgt aaatttcttt cgtccacgga
32641 ttaaggccgt actggttggc aacgatcagt aatgcgatga actgcgcatc gctggcatca
32701 cctttaaatg ccgtctggcg aagagtggtg atcagttcct gtgggtcgac agaatccatg
32761 ccgacacgtt cagccagctt cccagccagc gttgcgagtg cagtactcat tcgttttata
32821 cctctgaatc aatatcaacc tggtggtgag caatggtttc aaccatgtac cggatgtgtt
32881 ctgccatgcg ctcctgaaac tcaacatcgt catcaaacgc acgggtaatg gattttttgc
32941 tggccccgtg gcgttgcaaa tgatcgatgc atagcgattc aaacaggtgc tggggcaggc
33001 ctttttccat gtcgtctgcc agttctgcct ctttctcttc acgggcgagc tgctggtagt
33061 gacgcgccca gctctgagcc tcaagacgat cctgaatgta ataagcgttc atggctgaac
33121 tcctgaaata gctgtgaaaa tatcgcccgc gaaatgccgg gctgattagg aaaacaggaa
33181 agggggttag tgaatgcttt tgcttgatct cagtttcagt attaatatcc attttttata
33241 agcgtcgacg gcttcacgaa acatcttttc atcgccaata aaagtggcga tagtgaattt
33301 agtctggata gccataagtg tttgatccat tctttgggac tcctggctga ttaagtatgt
33361 cgataaggcg tttccatccg tcacgtaatt tacgggtgat tcgttcaagt aaagattcgg
33421 aagggcagcc agcaacaggc caccctgcaa tggcatattg catggtgtgc tccttattta
33481 tacataacga aaaacgcctc gagtgaagcg ttattggtat gcggtaaaac cgcactcagg
33541 cggccttgat agtcatatca tctgaatcaa atattcctga tgtatcgata tcggtaattc
33601 ttattccttc gctaccatcc attggaggcc atccttcctg accatttcca tcattccagt
33661 cgaactcaca cacaacacca tatgcattta agtcgcttga aattgctata agcagagcat
33721 gttgcgccag catgattaat acagcattta atacagagcc gtgtttattg agtcggtatt
33781 cagagtctga ccagaaatta ttaatctggt gaagtttttc ctctgtcatt acgtcatggt
33841 cgatttcaat ttctattgat gctttccagt cgtaatcaat gatgtatttt ttgatgtttg
33901 acatctgttc atatcctcac agataaaaaa tcgccctcac actggagggc aaagaagatt
33961 tccaataatc agaacaagtc ggctcctgtt tagttacgag cgacattgct ccgtgtattc
34021 actcgttgga atgaatacac agtgcagtgt ttattctgtt atttatgcca aaaataaagg
34081 ccactatcag gcagctttgt tgttctgttt accaagttct ctggcaatca ttgccgtcgt
34141 tcgtattgcc catttatcga catatttccc atcttccatt acaggaaaca tttcttcagg
34201 cttaaccatg cattccgatt gcagcttgca tccattgcat cgcttgaatt gtccacacca
34261 ttgattttta tcaatagtcg tagtcatacg gatagtcctg gtattgttcc atcacatcct
34321 gaggatgctc ttcgaactct tcaaattctt cttccatata tcaccttaaa tagtggattg
34381 cggtagtaaa gattgtgcct gtcttttaac cacatcaggc tcggtggttc tcgtgtaccc
34441 ctacagcgag aaatcggata aactattaca acccctacag tttgatgagt atagaaatgg
34501 atccactcgt tattctcgga cgagtgttca gtaatgaacc tctggagaga accatgtata
34561 tgatcgttat ctgggttgga cttctgcttt taagcccaga taactggcct gaatatgtta
34621 atgagagaat cggtattcct catgtgtggc atgttttcgt ctttgctctt gcattttcgc
34681 tagcaattaa tgtgcatcga ttatcagcta ttgccagcgc cagatataag cgatttaagc
34741 taagaaaacg cattaagatg caaaacgata aagtgcgatc agtaattcaa aaccttacag
34801 aagagcaatc tatggttttg tgcgcagccc ttaatgaagg caggaagtat gtggttacat
34861 caaaacaatt cccatacatt agtgagttga ttgagcttgg tgtgttgaac aaaacttttt
34921 cccgatggaa tggaaagcat atattattcc ctattgagga tatttactgg actgaattag
34981 ttgccagcta tgatccatat aatattgaga taaagccaag gccaatatct aagtaactag
35041 ataagaggaa tcgattttcc cttaattttc tggcgtccac tgcatgttat gccgcgttcg
35101 ccaggcttgc tgtaccatgt gcgctgattc ttgcgctcaa tacgttgcag gttgctttca
35161 atctgtttgt ggtattcagc cagcactgta aggtctatcg gatttagtgc gctttctact
35221 cgtgatttcg gtttgcgatt cagcgagaga atagggcggt taactggttt tgcgcttacc
35281 ccaaccaaca ggggatttgc tgctttccat tgagcctgtt tctctgcgcg acgttcgcgg
35341 cggcgtgttt gtgcatccat ctggattctc ctgtcagtta gctttggtgg tgtgtggcag
35401 ttgtagtcct gaacgaaaac cccccgcgat tggcacattg gcagctaatc cggaatcgca
35461 cttacggcca atgcttcgtt tcgtatcaca caccccaaag ccttctgctt tgaatgctgc
35521 ccttcttcag ggcttaattt ttaagagcgt caccttcatg gtggtcagtg cgtcctgctg
35581 atgtgctcag tatcaccgcc agtggtattt atgtcaacac cgccagagat aatttatcac
35641 cgcagatggt tatctgtatg ttttttatat gaatttattt tttgcagggg ggcattgttt
35701 ggtaggtgag agatctgaat tgctatgttt agtgagttgt atctatttat ttttcaataa
35761 atacaattgg ttatgtgttt tgggggcgat cgtgaggcaa agaaaacccg gcgctgaggc
35821 cgggttattc ttgttctctg gtcaaattat atagttggaa aacaaggatg catatatgaa
35881 tgaacgatgc agaggcaatg ccgatggcga tagtgggtat catgtagccg cttatgctgg
35941 aaagaagcaa taacccgcag aaaaacaaag ctccaagctc aacaaaacta agggcataga
36001 caataactac cgatgtcata tacccatact ctctaatctt ggccagtcgg cgcgttctgc
36061 ttccgattag aaacgtcaag gcagcaatca ggattgcaat catggttcct gcatatgatg
36121 acaatgtcgc cccaagacca tctctatgag ctgaaaaaga aacaccagga atgtagtggc
36181 ggaaaaggag atagcaaatg cttacgataa cgtaaggaat tattactatg taaacaccag
36241 gcatgattct gttccgcata attactcctg ataattaatc cttaactttg cccacctgcc
36301 ttttaaaaca ttccagtata tcacttttca ttcttgcgta gcaatatgcc atctcttcag
36361 ctatctcagc attggtgacc ttgttcagag gcgctgagag atggcctttt tctgatagat
36421 aatgttctgt taaaatatct ccggcctcat cttttgcccg caggctaatg tctgaaaatt
36481 gaggtgacgg gttaaaaata atatccttgg caaccttttt tatatccctt ttaaattttg
36541 gcttaatgac tatatccaat gagtcaaaaa gctccccttc aatatctgtt gcccctaaga
36601 cctttaatat atcgccaaat acaggtagct tggcttctac cttcaccgtt gttcggccga
36661 tgaaatgcat atgcataaca tcgtctttgg tggttcccct catcagtggc tctatctgaa
36721 cgcgctctcc actgcttaat gacattcctt tcccgattaa aaaatctgtc agatcggatg
36781 tggtcggccc gaaaacagtt ctggcaaaac caatggtgtc gccttcaaca aacaaaaaag
36841 atgggaatcc caatgattcg tcatctgcga ggctgttctt aatatcttca actgaagctt
36901 tagagcgatt tatcttctga accagactct tgtcatttgt tttggtaaag agaaaagttt
36961 ttccatcgat tttatgaata tacaaataat tggagccaac ctgcaggtga tgattatcag
37021 ccagcagaga attaaggaaa acagacaggt ttattgagcg cttatctttc cctttatttt
37081 tgctgcggta agtcgcataa aaaccattct tcataattca atccatttac tatgttatgt
37141 tctgagggga gtgaaaattc ccctaattcg atgaagattc ttgctcaatt gttatcagct
37201 atgcgccgac cagaacacct tgccgatcag ccaaacgtct cttcaggcca ctgactagcg
37261 ataactttcc ccacaacgga acaactctca ttgcatggga tcattgggta ctgtgggttt
37321 agtggttgta aaaacacctg accgctatcc ctgatcagtt tcttgaaggt aaactcatca
37381 cccccaagtc tggctatgca gaaatcacct ggctcaacag cctgctcagg gtcaacgaga
37441 attaacattc cgtcaggaaa gcttggcttg gagcctgttg gtgcggtcat ggaattacct
37501 tcaacctcaa gccagaatgc agaatcactg gcttttttgg ttgtgcttac ccatctctcc
37561 gcatcacctt tggtaaaggt tctaagctca ggtgagaaca tccctgcctg aacatgagaa
37621 aaaacagggt actcatactc acttctaagt gacggctgca tactaaccgc ttcatacatc
37681 tcgtagattt ctctggcgat tgaagggcta aattcttcaa cgctaacttt gagaattttt
37741 gcaagcaatg cggcgttata agcatttaat gcattgatgc cattaaataa agcaccaacg
37801 cctgactgcc ccatccccat cttgtctgcg acagattcct gggataagcc aagttcattt
37861 ttcttttttt cataaattgc tttaaggcga cgtgcgtcct caagctgctc ttgtgttaat
37921 ggtttctttt ttgtgctcat acgttaaatc tatcaccgca agggataaat atctaacacc
37981 gtgcgtgttg actattttac ctctggcggt gataatggtt gcatgtacta aggaggttgt
38041 atggaacaac gcataaccct gaaagattat gcaatgcgct ttgggcaaac caagacagct
38101 aaagatctcg gcgtatatca aagcgcgatc aacaaggcca ttcatgcagg ccgaaagatt
38161 tttttaacta taaacgctga tggaagcgtt tatgcggaag aggtaaagcc cttcccgagt
38221 aacaaaaaaa caacagcata aataaccccg ctcttacaca ttccagccct gaaaaagggc
38281 atcaaattaa accacaccta tggtgtatgc atttatttgc atacattcaa tcaattgtta
38341 tctaaggaaa tacttacata tggttcgtgc aaacaaacgc aacgaggctc tacgaatcga
38401 gagtgcgttg cttaacaaaa tcgcaatgct tggaactgag aagacagcgg aagctgtggg
38461 cgttgataag tcgcagatca gcaggtggaa gagggactgg attccaaagt tctcaatgct
38521 gcttgctgtt cttgaatggg gggtcgttga cgacgacatg gctcgattgg cgcgacaagt
38581 tgctgcgatt ctcaccaata aaaaacgccc ggcggcaacc gagcgttctg aacaaatcca
38641 gatggagttc tgaggtcatt actggatcta tcaacaggag tcattatgac aaatacagca
38701 aaaatactca acttcggcag aggtaacttt gccggacagg agcgtaatgt ggcagatctc
38761 gatgatggtt acgccagact atcaaatatg ctgcttgagg cttattcggg cgcagatctg
38821 accaagcgac agtttaaagt gctgcttgcc attctgcgta aaacctatgg gtggaataaa
38881 ccaatggaca gaatcaccga ttctcaactt agcgagatta caaagttacc tgtcaaacgg
38941 tgcaatgaag ccaagttaga actcgtcaga atgaatatta tcaagcagca aggcggcatg
39001 tttggaccaa ataaaaacat ctcagaatgg tgcatccctc aaaacgaggg aaaatcccct
39061 aaaacgaggg ataaaacatc cctcaaattg ggggattgct atccctcaaa acagggggac
39121 acaaaagaca ctattacaaa agaaaaaaga aaagattatt cgtcagagaa ttctggcgaa
39181 tcctctgacc agccagaaaa cgacctttct gtggtgaaac cggatgctgc aattcagagc
39241 ggcagcaagt gggggacagc agaagacctg accgccgcag agtggatgtt tgacatggtg
39301 aagactatcg caccatcagc cagaaaaccg aattttgctg ggtgggctaa cgatatccgc
39361 ctgatgcgtg aacgtgacgg acgtaaccac cgcgacatgt gtgtgctgtt ccgctgggca
39421 tgccaggaca acttctggtc cggtaacgtg ctgagcccgg ccaaactccg cgataagtgg
39481 acccaactcg aaatcaaccg taacaagcaa caggcaggcg tgacagccag caaaccaaaa
39541 ctcgacctga caaacacaga ctggatttac ggggtggatc tatgaaaaac atcgccgcac
39601 agatggttaa ctttgaccgt gagcagatgc gtcggatcgc caacaacatg ccggaacagt
39661 acgacgaaaa gccgcaggta cagcaggtag cgcagatcat caacggtgtg ttcagccagt
39721 tactggcaac tttcccggcg agcctggcta accgtgacca gaacgaagtg aacgaaatcc
39781 gtcgccagtg ggttctggct tttcgggaaa acgggatcac cacgatggaa caggttaacg
39841 caggaatgcg cgtagcccgt cggcagaatc gaccatttct gccatcaccc gggcagtttg
39901 ttgcatggtg ccgggaagaa gcatccgtta ccgccggact gccaaacgtc agcgagctgg
39961 ttgatatggt ttacgagtat tgccggaagc gaggcctgta tccggatgcg gagtcttatc
40021 cgtggaaatc aaacgcgcac tactggctgg ttaccaacct gtatcagaac atgcgggcca
40081 atgcgcttac tgatgcggaa ttacgccgta aggccgcaga tgagcttgtc catatgactg
40141 cgagaattaa ccgtggtgag gcgatccctg aaccagtaaa acaacttcct gtcatgggcg
40201 gtagacctct aaatcgtgca caggctctgg cgaagatcgc agaaatcaaa gctaagttcg
40261 gactgaaagg agcaagtgta tgacgggcaa agaggcaatt attcattacc tggggacgca
40321 taatagcttc tgtgcgccgg acgttgccgc gctaacaggc gcaacagtaa ccagcataaa
40381 tcaggccgcg gctaaaatgg cacgggcagg tcttctggtt atcgaaggta aggtctggcg
40441 aacggtgtat taccggtttg ctaccaggga agaacgggaa ggaaagatga gcacgaacct
40501 ggtttttaag gagtgtcgcc agagtgccgc gatgaaacgg gtattggcgg tatatggagt
40561 taaaagatga ccatctacat tactgagcta ataacaggcc tgctggtaat cgcaggcctt
40621 tttatttggg ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac
40681 atcagcaaaa cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg
40741 tagtaaccat tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct
40801 taggtgacgt ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga
40861 agtgtgtgtt taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg
40921 gctttgtggt aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat
40981 tagagcttat acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac
41041 tggctctgga gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt
41101 ctccggtggc aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg
41161 taaagacgtg cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt
41221 tgtcagggaa gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa
41281 cccggagctt ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg
41341 aatgcctgtt ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg
41401 cggcgcgttc tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca
41461 tttcgggcga gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct
41521 aaagccaaag cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat
41581 cctcgcatgg tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg
41641 catattctgc attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg
41701 attgcagcgt gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga
41761 aacgccaaag gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta
41821 ttcagaaaat gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac
41881 cggctcttgt tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag
41941 ggaagctgca tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga
42001 ccaaatcaac cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg
42061 ttaccactac cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca
42121 taatctgcga aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa
42181 tcccaaaaga atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt
42241 ggctaagacg tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga
42301 aagcgtcgag cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt
42361 tcacgtgtgt gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga
42421 ggaagaagat gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat
42481 ggtttcaccc tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag
42541 cactcgaacg acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac
42601 gacgagagga gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc
42661 gcagttactg gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc
42721 gcgacttacc atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt
42781 accggacaac tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat
42841 gcgtggtgtg caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta
42901 gccgcatcgg gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga
42961 ctatcgaaga gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa
43021 atagcagaag tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga
43081 agcatacgga aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag
43141 aaaatacgtt gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt
43201 tcatcgcgga tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc
43261 gaaatatttg ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg
43321 atacgattgg attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc
43381 gtggtgcaga gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac
43441 gtggaaacca tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact
43501 ggctgcttaa tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta
43561 aagctcttgc ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata
43621 aaaaatatgt tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag
43681 ttgatcatca gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg
43741 tgaaagaaat caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac
43801 tcaagtttgc caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat
43861 tgattcaggt acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc
43921 aaaaagcccg atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac
43981 tgatgtgatg gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt
44041 ctgcggtaag cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca
44101 atttgcacac aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa
44161 ggcaaaggta ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc
44221 gacgccgggg gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa
44281 aacagagctg tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta
44341 ttcaaggatg ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac
44401 ccaacccacc tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca
44461 caaagaagag tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg
44521 ccacggatgg caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt
44581 tgactcaacg atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta
44641 ctaccgattc cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg
44701 cagagaggtc tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat
44761 aacggtttcg ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg
44821 aagagtcggc gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc
44881 ggaagcagaa ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa
44941 cccaaactga gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc
45001 cttttacaca tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc
45061 gttttgcccg tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt
45121 tgttctatca gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt
45181 aagacatgaa gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac
45241 aaggcatcgg ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg
45301 gtgcgtttac aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tggttcattc
45361 gtgaccttct cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta
45421 tcggctacat cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag
45481 ccggagtaga agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg
45541 tggtcggagg gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt
45601 gtaggcggag agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac
45661 ccaaaactca aatcaacagg cgccggacgc taccagcttc tttcccgttg gtgggatgcc
45721 taccgcaagc agcttggcct gaaagacttc tctccgaaaa gtcaggacgc tgtggcattg
45781 cagcagatta aggagcgtgg cgctttacct atgattgatc gtggtgatat ccgtcaggca
45841 atcgaccgtt gcagcaatat ctgggcttca ctgccgggcg ctggttatgg tcagttcgag
45901 cataaggctg acagcctgat tgcaaaattc aaagaagcgg gcggaacggt cagagagatt
45961 gatgtatgag cagagtcacc gcgattatct ccgctctggt tatctgcatc atcgtctgcc
46021 tgtcatgggc tgttaatcat taccgtgata acgccattac ctacaaagcc cagcgcgaca
46081 aaaatgccag agaactgaag ctggcgaacg cggcaattac tgacatgcag atgcgtcagc
46141 gtgatgttgc tgcgctcgat gcaaaataca cgaaggagtt agctgatgct aaagctgaaa
46201 atgatgctct gcgtgatgat gttgccgctg gtcgtcgtcg gttgcacatc aaagcagtct
46261 gtcagtcagt gcgtgaagcc accaccgcct ccggcgtgga taatgcagcc tccccccgac
46321 tggcagacac cgctgaacgg gattatttca ccctcagaga gaggctgatc actatgcaaa
46381 aacaactgga aggaacccag aagtatatta atgagcagtg cagatagagt tgcccatatc
46441 gatgggcaac tcatgcaatt attgtgagca atacacacgc gcttccagcg gagtataaat
46501 gcctaaagta ataaaaccga gcaatccatt tacgaatgtt tgctgggttt ctgttttaac
46561 aacattttct gcgccgccac aaattttggc tgcatcgaca gttttcttct gcccaattcc
46621 agaaacgaag aaatgatggg tgatggtttc ctttggtgct actgctgccg gtttgttttg
46681 aacagtaaac gtctgttgag cacatcctgt aataagcagg gccagcgcag tagcgagtag
46741 catttttttc atggtgttat tcccgatgct ttttgaagtt cgcagaatcg tatgtgtaga
46801 aaattaaaca aaccctaaac aatgagttga aatttcatat tgttaatatt tattaatgta
46861 tgtcaggtgc gatgaatcgt cattgtattc ccggattaac tatgtccaca gccctgacgg
46921 ggaacttctc tgcgggagtg tccgggaata attaaaacga tgcacacagg gtttagcgcg
46981 tacacgtatt gcattatgcc aacgccccgg tgctgacacg gaagaaaccg gacgttatga
47041 tttagcgtgg aaagatttgt gtagtgttct gaatgctctc agtaaatagt aatgaattat
47101 caaaggtata gtaatatctt ttatgttcat ggatatttgt aacccatcgg aaaactcctg
47161 ctttagcaag attttccctg tattgctgaa atgtgatttc tcttgatttc aacctatcat
47221 aggacgtttc tataagatgc gtgtttcttg agaatttaac atttacaacc tttttaagtc
47281 cttttattaa cacggtgtta tcgttttcta acacgatgtg aatattatct gtggctagat
47341 agtaaatata atgtgagacg ttgtgacgtt ttagttcaga ataaaacaat tcacagtcta
47401 aatcttttcg cacttgatcg aatatttctt taaaaatggc aacctgagcc attggtaaaa
47461 ccttccatgt gatacgaggg cgcgtagttt gcattatcgt ttttatcgtt tcaatctggt
47521 ctgacctcct tgtgttttgt tgatgattta tgtcaaatat taggaatgtt ttcacttaat
47581 agtattggtt gcgtaacaaa gtgcggtcct gctggcattc tggagggaaa tacaaccgac
47641 agatgtatgt aaggccaacg tgctcaaatc ttcatacaga aagatttgaa gtaatatttt
47701 aaccgctaga tgaagagcaa gcgcatggag cgacaaaatg aataaagaac aatctgctga
47761 tgatccctcc gtggatctga ttcgtgtaaa aaatatgctt aatagcacca tttctatgag
47821 ttaccctgat gttgtaattg catgtataga acataaggtg tctctggaag cattcagagc
47881 aattgaggca gcgttggtga agcacgataa taatatgaag gattattccc tggtggttga
47941 ctgatcacca taactgctaa tcattcaaac tatttagtct gtgacagagc caacacgcag
48001 tctgtcactg tcaggaaagt ggtaaaactg caactcaatt actgcaatgc cctcgtaatt
48061 aagtgaattt acaatatcgt cctgttcgga gggaagaacg cgggatgttc attcttcatc
48121 acttttaatt gatgtatatg ctctcttttc tgacgttagt ctccgacggc aggcttcaat
48181 gacccaggct gagaaattcc cggacccttt ttgctcaaga gcgatgttaa tttgttcaat
48241 catttggtta ggaaagcgga tgttgcgggt tgttgttctg cgggttctgt tcttcgttga
48301 catgaggttg ccccgtattc agtgtcgctg atttgtattg tctgaagttg tttttacgtt
48361 aagttgatgc agatcaatta atacgatacc tgcgtcataa ttgattattt gacgtggttt
48421 gatggcctcc acgcacgttg tgatatgtag atgataatca ttatcacttt acgggtcctt
48481 tccggtgatc cgacaggtta cg
//
LOCUS MUSIGKCLO 422 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Mouse IgMk rearranged kappa light-chain mRNA variable region
(V-J-kappa) anti-DNA autoantibody.
ACCESSION M20834
KEYWORDS autoantibody; immunoglobulin light chain; kappa-immunoglobulin;
kappa-immunoglobulin light chain; variable region.
SOURCE Mouse (strain MRL/Mp -lpr/lpr) spleen hybridoma cell line
MRL-DNA22, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 422)
AUTHORS Kofler,R., Strohal,R., Balderas,R.S., Johnson,M.E., Noonan,D.J.,
Duchosal,M.A., Dixon,F.J. and Theofilopoulos,A.N.
TITLE Immunoglobulin kappa light chain variable region gene complex
organization and immunoglobulin genes encoding anti-DNA
autoantibodies in lupus mice
JOURNAL J. Clin. Invest. 82, 852-860 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
R. Kofler 28-JUL-1988
FEATURES from to/span description
pept 33 > 422 IgMk light chain precursor
sigp 33 98 Ig light chain signal peptide
matp 99 > 422 Ig light chain
mRNA 1 > 422 IgMk mRNA
recomb 386 387 V-region end/J-region start
BASE COUNT 112 a 109 c 95 g 106 t
ORIGIN Chromosome 6.
1 taattagcta gggaccaaaa ttcaaagaga aaatggattt tcaggtgcag attttcagct
61 tcctgctaat cagtgcctca gtcataatgt ccagaggaca aattgttctc acccagtctc
121 cagcaatcat gtctgcatct cctggggaac gggtcaccat gacctgcagt gccagctcaa
181 gtgtaagttc cagctacttg tactggtacc agcagaagcc aggatcctcc cccaaactat
241 ggatttatag cacatccaac ctggcttctg gagtccctgc tcgcttcagt ggcagtgggt
301 ctgggacctc ttattctctc acaatcagca gcatggaggc tgaagatgct gccacttatt
361 actgccagca gtacagtggt tacccattca cgttcggcac ggggacaaaa ttggaaataa
421 aa
//
LOCUS MUSIGKCLY 312 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Mouse Ig unproductively rearranged kappa-chain VJ2C mRNA from
plasmacytoma BFPC 61B, partial cds.
ACCESSION M12189
KEYWORDS immunoglobulin; immunoglobulin light chain; kappa-immunoglobulin;
variable region.
SOURCE Mouse plasmacytomas, cDNA to mRNA, clone BFPC 61B.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Bernard,O., Gough,N.M. and Adams,J.M.
TITLE Plasmacytomas with more than one immunoglobulin kappa mRNA:
Implications for allelic exclusion
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 5812-5816 (1981)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps < 1 > 312 Ig kappa-chain VJC-regions (AA at 3)
recomb 245 246 V-region end/J2-region start
recomb 285 286 J2-region end/C-region start
BASE COUNT 75 a 88 c 82 g 67 t
ORIGIN
1 ctgcaccagg ggagaaggtc acaatgactt gcagggccag ctcaagtgta agttacatgg
61 actggtacca gcagaagcca ggatcctccc ccaaaccctg gatttatgcc acatccaccc
121 tagtttctgg agttcctgct cgcttcagtg gcagtgggtc tgggacctct tactctctca
181 caatcagcag agtggaggct gaagatgctg ccacttatta gtgcccgcag tggagtagta
241 acccacccga cgttcggtgg aggcaccaag ctggaaatca aacgggctga tgctgcacca
301 actgtatcca tc
//
LOCUS MUSTCBXZA 396 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Mouse T-cell receptor active beta-chain V-region V11DJ mRNA.
ACCESSION M13676
KEYWORDS T-cell antigen receptor; T-cell receptor;
T-cell receptor beta-chain; antigen receptor; diversity exon;
joining exon; processed gene; variable region.
SOURCE Mouse (C57BL/6) T-helper hybridoma, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 390)
AUTHORS Behlke,M.A., Spinella,D.G., Chou,H.S., Sha,W., Hartl,D.L. and
Loh,D.Y.
TITLE T-cell receptor beta-chain expression: Dependence on relatively few
variable region genes
JOURNAL Science 229, 566-570 (1985)
STANDARD full staff_review
REFERENCE 2 (bases 391 to 396)
AUTHORS Behlke,M.A.
JOURNAL Unpublished (1985) Howard Hughes Med Inst, St.Louis, MO 63110.
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of the sequence in [2] were
kindly provided by M.A.Behlke, 25-OCT-1985.
FEATURES from to/span description
pept 1 > 396 T-cell receptor beta-chain precursor (V11DJ)
sigp 1 57 T-cell receptor beta-chain signal peptide
matp 58 > 396 T-cell receptor beta-chain
recomb 342 343 V11-region end/D-region start
recomb 345 346 D-region end/N-region start
recomb 348 349 N-region end/J-region start
recomb 390 391 J-region end/C-region start
BASE COUNT 113 a 96 c 81 g 106 t
ORIGIN Chromosome 6.
1 atgagctgca ggcttctcct ctatgtttcc ctatgtcttg tggaaacagc actcatgaac
61 actaaaatta ctcagtcacc aagatatcta atcctgggaa gaacaaataa gtctttggaa
121 tgtgagcaac atctgggaca taatgctatg tactggtata aacagagcgc tgagaagccg
181 ccagagctca tgtttctcta caatcttaaa cagttgattc gaaatgagac ggtgcccagt
241 cgttttatac ctgaatgccc agacagctcc aagctacttt tacatatatc tgccgtggat
301 ccagaagact cagctgtcta tttttgtgcc agcagccaga taactagtaa ccaagacacc
361 cagtactttg ggccaggcac tcggctcctc gtgtta
//
LOCUS P18XIS911 115 bp ds-DNA SYN 29-JUN-1990
DEFINITION Plasmid pACYC184/insertion element 91 (IS91), left junction.
ACCESSION K03543
KEYWORDS insertion sequence.
SEGMENT 1 of 2
SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 115)
AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
TITLE Transposition of IS91 does not generate a target duplication
JOURNAL J. Bacteriol. 169, 442-443 (1987)
STANDARD full staff_review
COMMENT No duplication of an insertion target sequence seems to take place
upon the insertion of IS91.
FEATURES from to/span description
recomb 16 17 pBR322 DNA end/IS91 DNA start
rpt 16 24 IS91 inverted terminal repeat
BASE COUNT 26 a 26 c 29 g 34 t
ORIGIN 16 bp upstream of TaqI site.
1 ttaacgaccc tgccctcgag taggcagcct ggcggctgcg gcttgtcatg gtctggaatt
61 accgttataa aaaaagataa tgtcattgtc tttcaggtag ttatatggcc cgttc
//
LOCUS P18XIS912 115 bp ds-DNA SYN 29-JUN-1990
DEFINITION Plasmid pACYC184/insertion element 91 (IS91), right junction.
ACCESSION M15256
KEYWORDS insertion sequence.
SEGMENT 2 of 2
SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 115)
AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
TITLE Transposition of IS91 does not generate a target duplication
JOURNAL J. Bacteriol. 169, 442-443 (1987)
STANDARD full staff_review
COMMENT No duplication of an insertion target sequence seems to take place
upon the insertion of IS91.
FEATURES from to/span description
rpt 93 100 IS91 inverted terminal repeat
recomb 101 102 IS91 DNA end/pACYC184 DNA start
BASE COUNT 25 a 20 c 29 g 41 t
ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site.
1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt
61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tgaaccgacg accgg
//
LOCUS PBR322IS2 115 bp ds-DNA SYN 29-JUN-1990
DEFINITION Plasmid pBR322/insertion element 91 (IS91), right junction.
ACCESSION M15257
KEYWORDS insertion sequence.
SEGMENT 2 of 2
SOURCE Plasmid pBR322 and insertion element 91 DNA, clone pSU240.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 115)
AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
TITLE Transposition of IS91 does not generate a target duplication
JOURNAL J. Bacteriol. 169, 442-443 (1987)
STANDARD full staff_review
COMMENT No duplication of an insertion target sequence seems to take place
upon the insertion of IS91.
FEATURES from to/span description
rpt 93 100 IS91 inverted terminal repeat
recomb 101 102 IS91 DNA end/pBR322 DNA start
BASE COUNT 25 a 19 c 27 g 44 t
ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site.
1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt
61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tcaaggatct taccg
//
LOCUS RATMYCSE 903 bp ds-DNA ROD 29-JUN-1990
DEFINITION Rat c-myc/Ig epsilon translocation DNA.
ACCESSION M23417
KEYWORDS epsilon immunoglobulin; switch region; translocation.
SOURCE Rat (strain LOU) immunocytoma cell line IR162 DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 903)
AUTHORS Tian,S.-S. and Faust,C.
TITLE A simple and precise aberrant translocation of the rat c-myc gene
into the epsilon-heavy chain switch region of the IgE-producing
immunocytoma, IR162
JOURNAL J. Biol. Chem. 264, 1846-1853 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 298 299 Ig epsilon switch region DNA end/c-myc DNA
start
signal 617 621 TATA box
signal 777 781 TATA box
BASE COUNT 212 a 250 c 238 g 203 t
ORIGIN
1 gggcccagtt cagctctgtg ccaggccccc agaaagcata ggatatttct gataaacctg
61 attgtctcta cctaccctga tatagtctgg ctactatatc aggacaatct tcttggagaa
121 cagttaaaaa catccataaa aaatttcgag caatctattt ttttttaaat taatttattt
181 tttggatttt ttttaaactt acatttcaaa tgttatcact tttcccccac ccaaccaccc
241 acccctttcc acctctctgt cctgacattc ccctacactg gtgtgtgtgt ggggtccaga
301 ctaatacacg caggacaaga acacagttca gccgagcgct gcgcccgaac aactgtacag
361 aaagggaaag gaccgacgcg cgagcaggag aaaatggtcg ggcgcgcagt taattcatgc
421 tgcgctatta ctgtttacac cccgagtcgg agtactgggc tgcgggactg aggctcctcc
481 tctttccccg gctccccact agtgcccctc ccgagttccc aaagcagagg gcgggggagc
541 gagaggagga aaaaaataga ggtcgggagg ggagggagaa agagattctc tggctaatcc
601 ccgcccaccc gccctttata atccgggggt ctgcgcggct aaggaccccc gggctgcgct
661 gctctccgct gccgcctccg ccgcgcccac tccgctcgcc tcctgcctcc aaaagggcag
721 ggcttcgccg aggcttggcg ggaaaaagaa gcgaggggag ggatccggag tcgcagtata
781 aaagaagctt ttcgggcgtt ttttttctga ctcgctgtag taattccagc gagagacaga
841 gggagtgagc gggcgggttg gaagagccca gtgtgcagag ccccactccg ggcttcctag
901 gaa
//
LOCUS RATRAFB 2719 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Rat activated c-raf oncogene mRNA encoding a fused protein,
complete cds.
ACCESSION M15428
KEYWORDS fusion protein; oncogene; protein kinase; raf oncogene.
SOURCE Rat primary NIH 3T3 cell transformant IQ7-2, cDNA to mRNA, clone
pA5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2719)
AUTHORS Ishikawa,F., Takaku,F., Nagao,M. and Sugimura,T.
TITLE Rat c-raf oncogene activation by a rearrangement that produces a
fused protein
JOURNAL Mol. Cell. Biol. 7, 1226-1232 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by F.Ishikawa, 29-MAY-1987.
The activated rat c-raf gene (see below) is transcribed to produce
a fused mRNA, in which the 5' half of the sequence is replaced by
an unknown rat sequence.
The normal and activated c-raf were each connected to the long
terminal repeat of Rous sarcoma virus and transfected into NIH 3T3
cells. Only the activated form had transforming activity.
FEATURES from to/span description
pept 206 2014 raf fusion protein
mRNA < 1 > 2525 raf mRNA
recomb 898 899 rat DNA of unknown origin end/raf DNA start
site 1032 1032 homology to v-raf start
BASE COUNT 762 a 644 c 698 g 615 t
ORIGIN 123 bp upstream of BglI site.
1 gccgtgcgca tagaggccgg tgcgcggccc ttgctcgttt aacgcgggac tatatttccc
61 agggtccgtc gcgggagtct ccggcgggca ggcgcgcggg agactgcgag cgaggcgccg
121 acggggcggc tcaggcgtct gggtccgcgc atctccttgc tccttcgctt ctccttcagc
181 cgctgctgcc acgaccccgg ccgacatggc ggcggtgttg cagcaagtgc tggagcgccc
241 ggagctgaac aagctgccta agtcgaccca gaacaaactt gagaagttcc tggctgaaca
301 gcagtccgaa atcgactgcc tgaaggggcg gcacgagaaa tttaaggtgg agagtgagca
361 acaatacttt gagatagaga agagactatc ccagagtcag gagaggcttg ttaatgaaac
421 ccgggagtgt cagaacttga ggctggagct tgagaagcta aataaccaag taaaagtatt
481 aactgagaaa aacaaagaac ttgaaactgc tcaagaccgc aatctaggca ttcagagcca
541 gtttacaaga gcaaaggaag agttagaagc tgaaaaaaga gatttaatca gaaccaatga
601 gaggttatct caggaagttg aatatttaac agaggatgtt aaacgtctaa acgaaaaact
661 taaagaaagc aatacaacga agggtgaact tcagttaaag ctggatgaac ttcaagcttc
721 tgatgtcact gtgaagtacc gagaaaaacg cttagaacaa gaaaaggaat tgctacacaa
781 tcaaaattca tggctaaaca cagagttgaa aaccaaaact gatgagctat tggctctagg
841 aagagaaaag ggaaatgaaa ttctggaact taagtgtact cttgaaaaca aaaaggaaga
901 ggatgcaatt cgaagtcaca gtgaatcagc ctcaccttca gccctgtcca gcagccccaa
961 caacctgagc ccaacaggct ggtcacagcc caaaacccct gtgccagcac aaagagagag
1021 ggcgccagga tctgggaccc aggaaaaaaa caaaattagg cctcgtgggc agagagattc
1081 aagttattac tgggaaatag aagccagtga ggtgatgctg tctactcgga ttggctcggg
1141 ctcctttggc actgtgtaca agggcaagtg gcatggagat gttgcagtaa agatcctaaa
1201 ggtggttgac ccaactccag agcaacttca ggccttcagg aacgaggtgg ctgttttgcg
1261 caaaacacgg catgttaata tcctgctgtt catggggtac atgacaaagg acaacctggc
1321 gattgtgacc cagtggtgtg aaggcagcag tctctacaaa cacctgcatg tccaggagac
1381 caaattccag atgttccagc taattgacat tgcccggcag acagctcagg gaatggacta
1441 tttacatgca aagaacatca tccacagaga catgaaatcc aacaatatat ttctccatga
1501 aggcctcacg gtgaaaatcg gagattttgg tttggcaaca gtgaagtcgc gctggagtgg
1561 ttctcagcag gttgaacagc ccactggctc tgtgctgtgg atggccccag aagtaatccg
1621 aatgcaggat aacaacccgt tcagcttcca gtccgatgtc tactcctatg gcattgtgct
1681 gtatgagctg atgactgggg agcttcccta ctcccacatc aacaaccgag accagatcat
1741 cttcatggtg ggccgtgggt acgcctcccc agatcttagc aggctctaca agaactgccc
1801 caaggcaatg aagaggttgg tggctgactg tgtgaagaaa gtcaaagaag aaaggccttt
1861 gtttcctcag atcctgtctt ccattgagct gcttcagcac tctctgccga aaatcaacag
1921 gagcgcctct gagccttccc tgcatcgggc agctcacact gaggacatca atgcttgtac
1981 gctgaccaca tccccaaggc taccagtctt ctagctgacg ttatagctgt tcttaggcca
2041 ccaggggacg aagaagagtc agcaggcacc actttctgtt tccttggggg cagaatgcat
2101 gtttccggaa aagctgctgc taaggaccta gactactcac agggccttaa cttcatattg
2161 ccttcttttc tacccttcct gccctggaaa tggaagctgt ccgccaagcc agcctgctcc
2221 agaggtatac aagtcagcga gtatttttag ggcaaatggc cttggagaga gaaggcaggg
2281 cactccggct actgcaggga catgcagttg ggaacttggc tcattgagct gtacagacag
2341 tggtgcagtg ccagttttgc acatggagtc ctggccacct gggggagcct gctttggtac
2401 tacagaactt cactttgtgg acacaccttc ctcttactga gtctaagatg tcctgtgcag
2461 aggatgcttt ccaagcacgg tgctccacct tctggcagcc tcccacacgc tgaatctgtc
2521 ttccaggagc tgccctatgg ggtgctgcag cccagcccta tctctatagt cacatccttg
2581 tctgtaagaa agccaggaat acaggttttc ttaatgattt tgggttttaa ttttgttttt
2641 attgagcctg ataaaataca gttatctgat ggttcctcaa ttatgttatt ttaataaaat
2701 aaattaaatt taaaaaaaa
//
LOCUS YSCINV 68 bp ss-mRNA SYN 29-JUN-1990
DEFINITION Yeast (S.cerevisiae) synthetic invertase signal peptide region
fused to human interferon gene.
ACCESSION M13346
KEYWORDS gene fusion; invertase signal peptide; signal peptide.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 68)
AUTHORS Chang,C.N., Matteucci,M., Perry,L.J., Wulf,J.J., Chen,C.Y. and
Hitzeman,R.A.
TITLE Saccharomyces cerevisiae secretes and correctly processes human
interferon hybrid proteins containing yeast invertase signal
peptides
JOURNAL Mol. Cell. Biol. 6, 1812-1819 (1986)
STANDARD full staff_review
FEATURES from to/span description
pept 12 > 68 yeast invertase signal/human interferon fusion
peptide
sigp 12 68 invertase signal peptide
recomb 8 9 EcoRI sequence end/yeast invertase signal
start
BASE COUNT 13 a 16 c 12 g 27 t
ORIGIN 3 bp upstream of EcoRI site.
1 ccgaattcat gatgcttttg caagctttcc ttttcctttt ggctggtttt gcagccaaaa
61 tatctgca
//
LOCUS ADGLS 1477 bp ds-DNA VRL 29-JUN-1990
DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units.
ACCESSION K03325
KEYWORDS alternate splicing; terminal repeat.
SOURCE Adenovirus type 7 (simian) DNA.
ORGANISM Mastadenovirus s7
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 1477)
AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E.
TITLE E1a regions of the human adenoviruses and of the highly oncogenic
simian adenovirus 7 are closely related
JOURNAL J. Virol. 53, 399-409 (1985)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.Kimelman (15-AUG-1985).
The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA
hybridization and S1 nuclease digestion. The peptide coding
regions were deduced by analogy to other adenoviruses and following
the most likely mode of translation. The intron of the mRNA III
transcribed from the E1a of other adenoviruses has not yet been
determined by S1 mapping.
Enhancer sequences are located at positions 165-173 and 265-270. A
TATA box is located at position 408-413. A polyadenylation signal
is located at posityion 1447-1452.
FEATURES from to/span description
pept 486 837 E1a 28 K protein from mRNA I, exon 1
1178 1407 E1a 28 K protein from mRNA I, exon 2
pept 486 1056 E1a 24 K protein from mRNA II, exon 1
1178 1407 E1a 24 K protein from mRNA II, exon 2
pre-msg 439 1477 E1a mRNA
IVS 838 1177 E1a 24K pept intron A
IVS 1057 1177 E1a 28K pept intron A
BASE COUNT 331 a 334 c 442 g 370 t
ORIGIN 5' end of the l strand of the genome.
1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt
61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg
121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt
181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt
241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga
301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc
361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc
421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg
481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata
541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca
601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg
661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc
721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg
781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg
841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg
901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg
961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc
1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact
1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga
1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca
1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc
1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca
1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg
1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta
1441 tcaggaaata aaaagattaa ctggattctt tgtgcct
//
LOCUS ADGRS 116 bp ds-DNA VRL 29-JUN-1990
DEFINITION adenovirus type 7 (simian) right end of genome.
ACCESSION V00035 J01991
KEYWORDS terminal repeat.
SOURCE adenovirus type 7 (simian) dna.
ORGANISM Mastadenovirus s7
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 116)
AUTHORS Broker,T.R.
TITLE terminal nucleotide sequences and restriction endonuclease cleavage
maps of simian and murine adenoviruses
JOURNAL (in) Tooze,J. (Ed.);
DNA Tumor Viruses, Second Edition Revised: 1037-1046;
Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 116)
AUTHORS Tolun,A., Alestroem,P. and Pettersson,U.
TITLE sequence of inverted terminal repetitions from different
adenoviruses: demonstration of conserved sequences and homology
between sa7 termini and sv40 dna
JOURNAL Cell 17, 705-713 (1979)
STANDARD simple staff_review
COMMENT L-strand is given, from 5' to 3'.
BASE COUNT 25 a 15 c 46 g 30 t
ORIGIN
1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg
61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct
//
LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 29-JUN-1990
DEFINITION Simian adenovirus SA7P, leftmost 2338 bp.
ACCESSION X01027
KEYWORDS E1A protein; E1b protein; inverted terminal repeat;
overlapping genes; transforming gene.
SOURCE Simian adenovirus SA 7P.
ORGANISM Mastadenovirus s7
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338)
AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and
Van Ormondt,H.
TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of
simian adenovirus SA7P
JOURNAL J. Gen. Virol. 65, 1699-1708 (1984)
STANDARD simple automatic
COMMENT **map: leftmost XhoI fragment (6%);
EMBL features not translated to GenBank features:
key from to description
SITE 1 183 inverted terminal repeat (ITR)
of SA7P
PRM 409 416 TATA E1A region
CAP 440 440 cap site E1A RNAs
MSG 440 838 "12S" E1A RNA
MSG 440 838 1st exon "12S" E1A RNA
IVS 839 1175 intron "12S" E1A RNA
MSG 1176 >1445 2nd exon "12S" E1A RNA
MSG 440 1057 1st exon "13S" E1A RNA
IVS 1058 1175 intron "13S" E1A RNA
MSG 1176 >1445 2nd exon "13S" E1A RNA
SITE 1445 1450 AATAAA E1A
PRM 1508 1514 TATA E1B
CAP 1538 1538 cap E1B RNA's
This virus, an independent isolate from Moscow, USSR, is highly
related to simian adenovirus 7 (SA7). The E1A sequences are 95%
homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985)
FEATURES from to/span description
pept 487 838 E1A 21K protein (part 1)
1176 / 1401 E1A 21K protein (part 2)
pept 487 1057 E1A 29K protein (part 1)
1176 / 1401 E1A 29K protein (part 2)
pept 1569 2111 pot. 21K E1B protein
pept 1874 > 2338 pot. 55K E1B protein
BASE COUNT 514 a 535 c 725 g 564 t
ORIGIN
1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag
61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg
121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt
181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt
241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg
301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg
361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg
421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct
481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat
541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac
601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag
661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg
721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct
781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt
841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg
901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat
961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg
1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact
1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg
1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga
1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac
1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc
1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc
1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata
1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct
1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca
1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg
1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc
1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt
1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg
1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct
1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca
1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc
1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc
2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa
2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg
2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat
2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg
2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag
//
LOCUS ADGVARNA 850 bp ds-DNA VRL 29-JUN-1990
DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene.
ACCESSION M14082
KEYWORDS virus-associated RNA.
SOURCE Simian adenovirus type 7 DNA.
ORGANISM Mastadenovirus s7
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 850)
AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G.
TITLE Characterization of a low-molecular-weight virus-associated (VA)
RNA encoded by simian adenovirus type 7 which functionally can
substitute for adenovirus type 5 VA RNA-1
JOURNAL J. Virol. 60, 635-644 (1986)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by S.Larsson, 15-JAN-1987.
Simian adenovirus type 7 VA RNA stimulates protein synthesis in a
transient expression assay and can functionally substitute for VA
RNA-I during lytic growth of human Ad5. The SA7 genome encodes
only one VA RNA species.
FEATURES from to/span description
RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp)
ORF 105 < 1 (c) ORF A cds
ORF 366 > 850 ORF B cds
BASE COUNT 140 a 260 c 338 g 112 t
ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map.
1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc
61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg
121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga
181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg
241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg
301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt
361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag
421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc
481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg
541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc
601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc
661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca
721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc
781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg
841 tggccccggg
//
LOCUS ADZITRL 150 bp ds-DNA VRL 29-JUN-1990
DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end.
ACCESSION J01958
KEYWORDS origin of replication; terminal repeat.
SOURCE ad31 (adenovirus 31).
ORGANISM Mastadenovirus 31
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A.
TITLE conserved sequences at the origin of adenovirus dna replication
JOURNAL J. Virol. 44, 530-537 (1982)
STANDARD simple staff_review
BASE COUNT 38 a 29 c 48 g 35 t
ORIGIN
1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
121 gtgggagccg ggcgcgccgg atgtgacgcg
//
LOCUS ADZITRR 150 bp ds-DNA VRL 29-JUN-1990
DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end.
ACCESSION J01959
KEYWORDS origin of replication; terminal repeat.
SOURCE ad31 (adenovirus 31).
ORGANISM Mastadenovirus 31
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A.
TITLE conserved sequences at the origin of adenovirus dna replication
JOURNAL J. Virol. 44, 530-537 (1982)
STANDARD simple staff_review
BASE COUNT 40 a 28 c 47 g 35 t
ORIGIN
1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
121 gtgggagccg ggcgcgccgg atgtgacgaa
//
LOCUS HSKGAORA 3726 bp ds-DNA VRL 29-JUN-1990
DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown
function.
ACCESSION M33303
KEYWORDS glycoprotein A.
SOURCE Herpesvirus of turkey DNA.
ORGANISM Gallid herpesvirus type 2
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 3726)
AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K.
TITLE Homologies between herpesvirus of turkey and Marek's disease virus
type-1 DNAs within two co-linearly arranged open reading frames,
one encoding glycoprotein A
JOURNAL Gene 84, 399-405 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 207 1676 glycoprotein A
pept 2118 2756 ORF2
BASE COUNT 1016 a 884 c 815 g 1011 t
ORIGIN
1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat
61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt
121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc
181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc
241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg
301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg
361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg
421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca
481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta
541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt
601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg
661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta
721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc
781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg
841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata
901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat
961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg
1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg
1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca
1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct
1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga
1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta
1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta
1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc
1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa
1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt
1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt
1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt
1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc
1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact
1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat
1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt
1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa
1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca
2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg
2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg
2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc
2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat
2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg
2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg
2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta
2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg
2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca
2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca
2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac
2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac
2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata
2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta
2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc
2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga
3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc
3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc
3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt
3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg
3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt
3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc
3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa
3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca
3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga
3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag
3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat
3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat
3721 gaattc
//
LOCUS HSKTK 2039 bp ds-DNA VRL 29-JUN-1990
DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds.
ACCESSION M26659
KEYWORDS thymidine kinase.
SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo
fibroblasts.
ORGANISM Gallid herpesvirus type 2
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 2039)
AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K.
TITLE Genetic and biochemical characterization of the thymidine kinase
gene from herpesvirus of turkeys
JOURNAL J. Virol. 63, 2847-2852 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.L.Martin, 03-AUG-1989.
FEATURES from to/span description
pept 608 1540 thymidine kinase (EC 2.7.1.21)
BASE COUNT 570 a 458 c 437 g 574 t
ORIGIN 649 bp upstream of XbaI site.
1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa
61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg
121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt
181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta
241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa
301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc
361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg
421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat
481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct
541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg
601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc
661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc
721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt
781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc
841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga
901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt
961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact
1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc
1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg
1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt
1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga
1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt
1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct
1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta
1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga
1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga
1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat
1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat
1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt
1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa
1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc
1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc
1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc
1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat
//
LOCUS LACHSP 3095 bp ds-DNA BCT 29-JUN-1990
DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene,
complete cds.
ACCESSION M30192
KEYWORDS .
SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 3095)
AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R.
TITLE Sequence of HSP gene from pTR2030
JOURNAL Unpublished (1989) North Carolina State Univ., Raleigh, NC 27695
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Hill, 29-NOV-1989.
FEATURES from to/span description
pept 1105 2991 hsp gene (aborts phage infection)
BASE COUNT 1129 a 439 c 518 g 1009 t
ORIGIN
1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg
61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga
121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat
181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt
241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac
301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga
361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat
421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca
481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac
541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct
601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca
661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat
721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa
781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg
841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc
901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat
961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt
1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag
1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca
1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct
1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca
1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca
1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata
1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat
1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa
1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt
1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata
1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa
1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat
1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt
1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat
1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat
1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa
1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat
2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct
2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct
2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat
2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct
2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta
2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata
2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa
2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa
2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata
2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac
2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct
2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa
2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa
2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc
2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa
2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt
3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat
3061 ctcctcatcc gcatttggat aaagatgact ataaa
//
LOCUS PH15RER 2016 bp ds-DNA PHG 29-JUN-1990
DEFINITION Bacteriophage phi-15 right early region containing genes 17,
16.5-16.9, complete cds.
ACCESSION M28830
KEYWORDS DNA replication protein.
SOURCE Bacteriophage phi-15 DNA.
ORGANISM Bacteriophage phi-15
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 2016)
AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V.
TITLE Nucleotide sequence of the right early region of Bacillus phage
phi-15 and comparison with related phages: Reorganization of gene
17 during evolution
JOURNAL Gene 75, 341-347 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 203 664 gene 17 peptide
pept 666 992 gene 16.9 peptide (put.)
pept 989 1306 gene 16.8 peptide (put.)
pept 1308 1700 gene 16.7 peptide (put.)
pept 1697 1861 gene 16.6 peptide (put.)
pept 1858 1971 gene 16.5 peptide (put.)
BASE COUNT 744 a 374 c 382 g 516 t
ORIGIN
1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca
61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa
121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa
181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga
241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc
301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt
361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca
421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc
481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt
541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga
601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa
661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca
721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag
781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa
841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta
901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca
961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg
1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga
1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg
1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat
1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt
1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt
1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa
1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc
1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt
1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc
1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg
1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat
1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt
1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt
1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg
1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc
1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt
1981 ttgattgttt cacgtggaac atcactgtat acgcat
//
LOCUS STLENV 4122 bp ss-RNA VRL 29-JUN-1990
DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region.
ACCESSION M11373 M11374
KEYWORDS env gene; pol gene.
SOURCE Simian T-cell leukaemia virus genomic RNA.
ORGANISM Simian T-cell lymphotropic virus type I
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 4122)
AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and
Yoshida,M.
TITLE Sequence homology of the simian retrovirus genome with human T-cell
leukemia virus type I
JOURNAL Virology 144, 59-65 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 311 1777 env protein
BASE COUNT 905 a 1456 c 741 g 1020 t
ORIGIN
1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt
61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc
121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc
181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc
241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac
301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc
361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca
421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct
481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca
541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg
601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg
661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca
721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg
781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa
841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga
901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac
961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg
1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta
1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt
1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc
1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc
1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc
1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt
1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca
1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca
1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc
1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca
1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat
1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc
1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca
1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca
1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac
1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag
1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc
2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt
2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg
2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt
2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta
2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc
2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca
2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga
2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac
2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa
2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa
2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt
2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag
2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa
2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc
2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc
2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat
3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact
3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt
3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc
3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt
3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc
3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt
3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa
3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta
3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg
3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat
3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc
3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg
3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt
3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt
3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag
3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc
3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc
4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc
4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca
//
LOCUS FIVCG 9474 bp ss-RNA VRL 29-JUN-1990
DEFINITION Feline immunodeficiency virus, Petaluma strain, complete
genome.
ACCESSION M25381 M25729
KEYWORDS complete genome; env gene; gag gene; pol gene.
SOURCE Feline immunodeficiency virus, Petaluma strain, clones FIV-14 [1],
[3], and 34TF10 [2].
REFERENCE 1 (bases 1 to 355)
AUTHORS Olmstead,R.A., Barnes,A.K., Yamamoto,J.K., Hirsch,V.M.,
Purcell,R.H. and Johnson,P.R.
TITLE Molecular cloning of feline immunodeficiency virus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2448-2452 (1989)
STANDARD full automatic
REFERENCE 2 (bases 1 to 9472)
AUTHORS Talbott,R.L., Sparger,E.E., Lovelace,K.M., Fitch,W.M.,
Pedersen,N.C., Luciw,P.A. and Elder,J.H.
TITLE Nucleotide sequence and genomic organization of feline immuno-
deficiency virus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 5743-5747 (1989)
STANDARD full automatic
REFERENCE 3 (bases 1 to 9474)
AUTHORS Olmstead,R.A., Hirsch,V.M., Purcell,R.H. and Johnson,P.R.
TITLE Nucleotide sequence analysis of feline immunodeficiency virus:
genome organization and relationship to other lentiviruses
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8088-8096 (1989)
STANDARD full automatic
REFERENCE 4 (bases 7175 to 7182; revision)
AUTHORS Elder,J.H. and Scripps Institute LaJolla,CA.
JOURNAL Unpublished (1990)
STANDARD full automatic
COMMENT [2] sequence kindly provided in computer-readable form by John
Elder, Scripps Institute, La Jolla, CA. [3] sequence kindly
provided in computer-readable form by Robert Olmstead, NIAID, NIH,
Bethesda, MD. [1] contains the LTR sequence from the complete
genome in [3].
The sequence from [3] is shown (with differences from [2]); the
substantive difference from [2] is at position 6123 where the
latter possesses an early stop codon in orf2, which is thought to
encode the transactivator protein. Clone 34TF10 is, nevertheless,
biologically active as described in [2]. A difference earlier
reported at 7175-7182 turns out to have been a typo [2,4]. Orf1 is
weakly homologous (PLSEARCH) with the Q orf of visna and has no
detectable similarity to the vif protein of HIV and SIV. This is
consistent with the clustering of FIV with the ungulate
lentiviruses [2],[3]. Several "urfs" are identified by [2] and [3]
of which only orf3, possibly analogous to the L orf of visna, is
annotated below; a small, possibly associated, orf is found at
8955-9170. For a fuller discussion of the reading frames, see [2]
and [3].
FEATURES from to/span description
pept 628 1980 gag polyprotein
pept / 1869 5243 pol polyprotein (NH2 terminus uncertain) (AA
at 1869)
pept 5236 5991 orf1
pept 5992 6228 orf2
pept 6266 8836 env polyprotein
pept 6712 6918 orf3
LTR 1 355 5' LTR
LTR 9120 9474 3' LTR
rpt 217 281 R repeat 5' copy
rpt 9336 9400 R repeat 3' copy
binding 357 377 primer binding site
variant 12 12 t in [3]; c in [2]
variant 122 122 c in [3]; t in [2]
variant 146 146 g in [3]; t in [2]
variant 373 374 ga in [3]; a in [2]
variant 498 501 ggac in [3]; agca in [2]
variant 1575 1575 g in [3]; a in [2]
variant 1840 1840 g in [3]; c in [2]
variant 1977 1977 a in [3]; g in [2]
variant 2028 2028 g in [3]; a in [2]
variant 2033 2033 c in [3]; t in [2]
variant 2288 2288 a in [3]; g in [2]
variant 2464 2464 g in [3]; a in [2]
variant 2490 2490 c in [3]; t in [2]
variant 2580 2580 c in [3]; t in [2]
variant 2631 2631 t in [3]; c in [2]
variant 2735 2735 a in [3]; g in [2]
variant 3733 3733 a in [3]; g in [2]
variant 3743 3743 a in [3]; t in [2]
variant 3812 3812 g in [3]; a in [2]
variant 4202 4202 g in [3]; a in [2]
variant 4211 4211 c in [3]; t in [2]
variant 4271 4271 c in [3]; t in [2]
variant 4493 4493 a in [3]; g in [2]
variant 4540 4540 g in [3]; t in [2]
variant 4706 4706 c in [3]; t in [2]
variant 4891 4891 g in [3]; a in [2]
variant 5332 5332 g in [3]; a in [2]
variant 5497 5497 t in [3]; c in [2]
variant 5559 5559 t in [3]; c in [2]
variant 5986 5986 t in [3]; c in [2]
variant 6000 6000 c in [3]; a in [2]
variant 6123 6123 g in [3]; a in [2]
variant 6149 6149 t in [3]; c in [2]
variant 6179 6179 a in [3]; g in [2]
variant 6238 6238 g in [3]; t in [2]
variant 6245 6248 attt in [3]; ttt in [2]
variant 6306 6306 t in [3]; c in [2]
variant 6430 6430 c in [3]; t in [2]
variant 6548 6548 a in [3]; c in [2]
variant 6739 6739 c in [3]; t in [2]
variant 6753 6753 t in [3]; c in [2]
variant 6758 6758 a in [3]; g in [2]
variant 6764 6764 a in [3]; g in [2]
variant 6779 6779 a in [3]; g in [2]
variant 6782 6783 aa in [3]; gg in [2]
variant 7175 7182 acaggagg in [3]; caggagga in [2]
variant 7381 7381 a in [3]; g in [2]
variant 7484 7484 g in [3]; a in [2]
variant 7512 7513 ag in [3]; ca in [2]
variant 7618 7618 a in [3]; c in [2]
variant 7625 7625 g in [3]; a in [2]
variant 7670 7670 a in [3]; g in [2]
variant 7704 7708 aaaaa in [3]; caaat in [2]
variant 7750 7750 t in [3]; c in [2]
variant 7808 7810 atg in [3]; gta in [2]
variant 7824 7828 aaaag in [3]; caaaa in [2]
variant 7996 7996 g in [3]; a in [2]
variant 8054 8054 g in [3]; c in [2]
variant 8129 8129 g in [3]; a in [2]
variant 8222 8222 g in [3]; a in [2]
variant 8238 8238 a in [3]; g in [2]
variant 8329 8329 g in [3]; a in [2]
variant 8377 8377 t in [3]; c in [2]
variant 8394 8394 c in [3]; t in [2]
variant 8548 8550 gaa in [3]; aac in [2]
variant 8588 8588 g in [3]; a in [2]
variant 9131 9131 t in [3]; c in [2]
variant 9241 9241 c in [3]; t in [2]
variant 9265 9265 g in [3]; t in [2]
BASE COUNT 3571 a 1370 c 2119 g 2414 t
ORIGIN 5' terminus of 5' LTR
1 tgggatgagt attggaaccc tgaagaaata gaaagaatgc ttatggacta gggactgttt
61 acgaacaaat gataaaagga aatagctgag catgactcat agttaaagcg ctagcagctg
121 cctaaccgca aaaccacatc ctatggaaag cttgctaatg acgtataagt tgttccattg
181 taagagtata taaccagtgc tttgtgaaac ttcgaggagt ctctttgttg aggacttttg
241 agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta
301 tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg
361 cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc
421 tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg
481 agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata
541 aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca
601 acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg
661 gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga
721 gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt
781 gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga
841 gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg
901 gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct
961 caaatgggat tagacactag gccatctatg aaagaagcag gtggaaaaga ggaaggccct
1021 ccacaggcat atcctattca aacagtaaat ggagtaccac aatatgtagc acttgaccca
1081 aaaatggtgt ccatttttat ggaaaaggca agagaaggac taggaggtga ggaagttcaa
1141 ctatggttta ctgccttctc tgcaaattta acacctactg acatggccac attaataatg
1201 gccgcaccag ggtgcgctgc agataaagaa atattggatg aaagcttaaa gcaactgaca
1261 gcagaatatg atcgcacaca tccccctgat gctcccagac cattacccta ttttactgca
1321 gcagaaatta tgggtatagg attaactcaa gaacaacaag cagaagcaag atttgcacca
1381 gctaggatgc agtgtagagc atggtatctc gaggcattag gaaaattggc tgccataaaa
1441 gctaagtctc ctcgagctgt gcagttaaga caaggagcta aggaagatta ttcatccttt
1501 atagacagat tgtttgccca aatagatcaa gaacaaaata cagctgaagt taagttatat
1561 ttaaaacagt cattgagcat agctaatgct aatgcagact gtaaaaaggc aatgagccac
1621 cttaagccag aaagtaccct agaagaaaag ttgagagctt gtcaagaaat aggctcacca
1681 ggatataaaa tgcaactctt ggcagaagct cttacaaaag ttcaagtagt gcaatcaaaa
1741 ggatcaggac cagtgtgttt taattgtaaa aaaccaggac atctagcaag acaatgtaga
1801 gaagtgaaaa aatgtaataa atgtggaaaa cctggtcatg tagctgccaa atgttggcaa
1861 ggaaatagaa agaattcggg aaactggaag gcggggcgag ctgcagcccc agtgaatcaa
1921 atgcagcaag cagtaatgcc atctgcacct ccaatggagg agaaactatt ggatttataa
1981 attataataa agtaggtact actacaacat tagaaaagag gccagaaata ctcatatttg
2041 taaatggata tcctataaaa tttttattag acacaggagc agatataaca attttaaata
2101 ggagagattt tcaagtaaaa aattctatag aaaatggaag gcaaaatatg attggagtag
2161 gaggaggaaa gagaggaaca aattatatta atgtacattt agagattaga gatgaaaatt
2221 ataagacaca atgtatattt ggtaatgttt gtgtcttaga agataactca ttaatacaac
2281 cattattagg gagagataat atgattaaat tcaatattag gttagtaatg gctcaaattt
2341 ctgataagat tccagtagta aaagtaaaaa tgaaggatcc taataaagga cctcaaataa
2401 aacaatggcc attaacaaat gaaaaaattg aagccttaac agaaatagta gaaagactag
2461 aaagagaagg gaaagtaaaa agagcagatc caaataatcc atggaataca ccagtatttg
2521 ctataaaaaa gaaaagtgga aaatggagaa tgctcataga ttttagagaa ttaaacaaac
2581 taactgagaa aggagcagag gtccagttgg gactacctca tcctgctggt ttacaaataa
2641 aaaaacaagt aacagtatta gatatagggg atgcatattt caccattcct cttgatccag
2701 attatgctcc ttatacagca tttactttac ctagaaaaaa taatgcggga ccaggaagga
2761 gatttgtgtg gtgtagtcta ccacaaggct ggattttaag tccattgata tatcaaagta
2821 cattagataa tataatacaa ccttttatta gacaaaatcc tcaattagat atttaccaat
2881 atatggatga catttatata ggatcaaatt taagtaaaaa ggagcataaa gaaaaggtag
2941 aagaattaag aaaattacta ttatggtggg gatttgaaac tccagaagat aaattacagg
3001 aagaaccccc atatacatgg atgggttatg aattacatcc attaacatgg acaatacaac
3061 agaaacagtt agacattcca gaacagccca ctctaaatga gttgcaaaaa ttagcaggaa
3121 aaattaattg ggctagccaa gctattccag acttgagtat aaaagcatta actaacatga
3181 tgagaggaaa tcaaaaccta aattcaacaa gacaatggac taaagaagct cgactggaag
3241 tacaaaaggc aaaaaaggct atagaagaac aagtacaact aggatactat gaccccagta
3301 aggagttata tgctaaatta agtttggtgg gaccacatca aataagttat caagtatatc
3361 agaaggatcc agaaaagata ctatggtatg gaaaaatgag tagacaaaag aaaaaggcag
3421 aaaatacatg tgatatagcc ttaagagcat gctataagat aagagaagag tctattataa
3481 gaataggaaa agaaccaaga tatgaaatac ctacttctag agaagcctgg gaatcaaatt
3541 taattaattc accatatctt aaggccccac ctcctgaggt agaatatatc catgctgctt
3601 tgaatataaa gagagcgtta agtatgataa aagatgctcc aataccagga gcagaaacat
3661 ggtatataga tggaggtaga aagctaggaa aagcagcaaa agcagcctat tggacagata
3721 caggaaagtg gcaagtgatg gaattagaag gcagtaatca gaaggcagaa atacaagcat
3781 tattattggc attaaaagca ggatcagagg agatgaatat tataacagat tcacaatatg
3841 ttataaatat tattcttcaa caaccagata tgatggaggg aatctggcaa gaagttttag
3901 aagaattgga gaagaaaaca gcaatattta tagattgggt cccaggacat aaaggtattc
3961 caggaaatga ggaagtagat aagctttgtc aaacaatgat gataatagaa ggggatggga
4021 tattagataa aaggtcagaa gatgcaggat atgatttatt agctgcaaaa gaaatacatt
4081 tattgccagg agaggtaaaa gtaataccaa caggggtaaa gctaatgttg cctaaaggat
4141 attggggatt aataatagga aaaagctcga tagggagtaa aggattggat gtattaggag
4201 gggtaataga cgaaggatat cgaggtgaaa ttggagtaat aatgattaat gtatcaagaa
4261 aatcaatcac cttaatggaa cgacaaaaga tagcacaatt aataatattg ccttgtaaac
4321 atgaagtatt agaacaagga aaagtagtaa tggattcaga gagaggagac aatggttatg
4381 ggtcaacagg agtattctcc tcttgggttg acagaattga ggaagcagaa ataaatcatg
4441 aaaaatttca ctcagatcca cagtacttaa ggactgaatt taatttacct aaaatggtag
4501 cagaagagat aagacgaaaa tgcccagtat gcagaatcag aggagaacaa gtgggaggac
4561 aattgaaaat agggcctggt atctggcaaa tggattgcac acactttgat ggcaaaataa
4621 ttcttgtggg tatacatgtg gaatcaggat atatatgggc acaaataatt tctcaagaaa
4681 ctgctgactg tacagttaaa gctgtcttac aattgttgag tgctcataat gttactgaat
4741 tacaaacaga taatggacca aattttaaaa atcaaaagat ggaaggagta ctcaattaca
4801 tgggtgtgaa acataagttt ggtatcccag ggaacccaca gtcacaagca ttagttgaaa
4861 atgtaaatca tacattaaaa gtttggattc ggaaattttt gcctgaaaca acctccttgg
4921 ataatgcctt atctctcgct gtacatagtc tcaattttaa aagaagaggt aggataggag
4981 ggatggcccc ttatgaatta ttagcacaac aagaatcctt aagaatacaa gattattttt
5041 ctgcaatacc acaaaaattg caagcacagt ggatttatta taaagatcaa aaagataaga
5101 aatggaaagg accaatgaga gtagaatact ggggacaggg atcagtatta ttaaaggatg
5161 aagagaaggg atattttctt atacctagga gacacataag gagagttcca gaaccctgcg
5221 ctcttcctga aggggatgag tgaagaagat tggcaggtaa gtagaagact ctttgcagtg
5281 ctccaaggag gagtaaatag cgctatgcta tacatatcta ggctacctcc ggatgaaaga
5341 gaaaagtata aaaaagactt caagaaaaga ctttttgaca cagaaacagg atttataaag
5401 agactacgga aagctgaagg aataaaatgg agctttcata ctagagatta ttacatagga
5461 tatgtcagag aaatggtggc aggatccact acatcattaa gtctaaggat gtatatatat
5521 ataagtaacc cactatggca ttctcagtat cgtccaggtt tgaaaaattt caataaggaa
5581 tggccttttg taaatatgtg gataaaaaca ggatttatgt gggatgatat tgaaaaacaa
5641 aatatttgta taggaggaga agtttcacca ggatggggac cagggatggt aggtatagca
5701 ataaaagctt ttagttgtgg cgaaagaaag attgaggcta ctcctgtaat gattataaga
5761 ggagaaatag atccaaaaaa atggtgcgga gattgttgga atttaatgtg tcttagaaac
5821 tcacctccaa agactttaca aagactcgct atgttggcgt gtggcgtgcc ggctaagaag
5881 tggcgaggat gctgtaatca acgctttgtt tctccttaca gaacgcctgc tgatttagag
5941 gtcattcaat ccaagcccag ctggaacctg ttatggtcgg gagaattatg aatggaagac
6001 ataatagtat tattcaatag ggtcactgag aaactagaaa aagaattagc tatcagaata
6061 tttgtattag cacatcaatt agaaagggac aaagctatta gattactaca aggattattt
6121 tggagatata gatttaagaa accccgagta gattattgtt tatgttggtg gtgttgcaaa
6181 ttctattatt ggcagttgca atctacatta tcaataacta ctgcttagaa atatttagat
6241 taatatttca tttgcaacaa taagaatggc agaaggattt gcagccaata gacaatggat
6301 aggactagaa gaagctgaag agttattaga ttttgatata gcaacacaaa tgagtgaaga
6361 aggaccacta aatccaggag taaacccatt tagggtacct ggaataacag aaaaagaaaa
6421 gcaaaactac tgtaacatat tacaacctaa gttacaagat ctaaggaacg aaattcaaga
6481 ggtaaaactg gaagaaggaa atgcaggtaa gtttagaaga gcaagatttt taaggtattc
6541 tgatgaaagt gtattgtccc tggttcatgc gttcatagga tattgtatat atttaggtaa
6601 tcgaaataag ttaggatctt taagacatga cattgatata gaagcacccc aagaagagtg
6661 ttataataat agagagaagg gtacaactga caatataaaa tatggtagac gatgttgcct
6721 aggaacggtg actttgtacc tgattttatt tataggaata ataatatatt cacagacaac
6781 caacgctcag gtagtatgga gacttccacc attagtagtc ccagtagaag aatcagaaat
6841 aattttttgg gattgttggg caccagaaga acccgcctgt caggactttc ttggggcaat
6901 gatacatcta aaagctaaga caaatataag tatacgagag ggacctacct tggggaattg
6961 ggctagagaa atatgggcaa cattattcaa aaaggctact agacaatgta gaagaggcag
7021 aatatggaaa agatggaatg agactataac aggaccatca ggatgtgcta ataacacatg
7081 ttataatgtt tcagtaatag tacctgatta tcagtgttat ttagatagag tagatacttg
7141 gttacaaggg aaaataaata tatcattatg tctaacagga ggaaaaatgt tgtacaataa
7201 agttacaaaa caattaagct attgtacaga cccattacaa atcccactga tcaattatac
7261 atttggacct aatcaaacat gtatgtggaa tacttcacaa attcaggacc ctgaaatacc
7321 aaaatgtgga tggtggaatc aaatggccta ttataacagt tgtaaatggg aagaggcaaa
7381 agtaaagttt cattgtcaaa gaacacagag tcagcctgga tcatggttta gagcaatctc
7441 gtcatggaaa caaagaaata gatgggagtg gagaccagat tttgaaagta aaaaggtgaa
7501 aatatctcta cagtgcaata gcacaaaaaa cctaaccttt gcaatgagaa gttcaggaga
7561 ttatggagaa gtaacgggag cttggataga gtttggatgt catagaaata aatcaaaact
7621 tcatgctgaa gcaaggttta gaattagatg tagatggaat gtagggagta atacctcgct
7681 cattgataca tgtggaaaca ctcaaaaagt ttcaggtgcg aatcctgtag attgtaccat
7741 gtattcaaat aaaatgtaca attgttcttt acaaaacggg tttactatga aggtagatga
7801 ccttattatg catttcaata tgaaaaaggc tgtagaaatg tataatattg ctggaaattg
7861 gtcttgtaca tctgacttgc catcgtcatg ggggtatatg aattgtaatt gtacaaatag
7921 tagtagtagt tatagtggta ctaaaatggc atgtcctagc aatcgaggca tcttaaggaa
7981 ttggtataac ccagtggcag gattacgaca atccttagaa cagtatcaag ttgtaaaaca
8041 accagattac ttagtggtcc cagaggaagt catggaatat aaacctagaa ggaaaagggc
8101 agctattcat gttatgttgg ctcttgcagc agtattatct attgccggtg cagggacggg
8161 ggctactgct atagggatgg taacacaata ccaccaagtt ctggcaaccc atcaagaagc
8221 tgtagaaaag gtgactgaag ccttaaagat aaacaactta agattagtta cattagagca
8281 tcaagtacta gtaataggat taaaagtaga agctatggaa aaatttttgt atacagcttt
8341 cgctatgcaa gaattaggat gtaatcaaaa tcaatttttc tgcaaaatcc ctcctgagtt
8401 gtggacaagg tataatatga ctataaatca aacaatatgg aatcatggaa atataacttt
8461 gggggaatgg tataaccaaa caaaagattt acaacaaaag ttttatgaaa taataatgga
8521 catagaacaa aataatgtac aagggaagaa agggatacaa caattacaaa agtgggaaga
8581 ttgggtagga tggataggaa atattccaca atatttaaag ggactattgg gaggtatctt
8641 gggaatagga ttaggagtgt tattattgat tttatgttta cctacattgg ttgattgtat
8701 aagaaattgt atccacaaga tactaggata cacagtaatt gcaatgcctg aagtagaagg
8761 agaagaaata caaccacaaa tggaattgag gagaaatggt aggcaatgtg gcatgtctga
8821 aaaagaggag gaatgatgaa gtatctcaga cttattttat aagggagata ctgtgctgag
8881 ttcttccctt tgaggaaggt atgtcatatg aatccatttc gaatcaaatc aaactaataa
8941 agtatgtatt gtaaggtaaa aggaaaagac aaagaagaag aagaaagaag aaagccttca
9001 agaggatgat gacagagtta gaagatcgct tcaggaagct atttggcacg acttctacaa
9061 cgggagacag cacagtagat tctgaagatg aacctcctaa aaaagaaaaa agggtggact
9121 gggatgagta ttggaaccct gaagaaatag aaagaatgct tatggactag ggactgttta
9181 cgaacaaatg ataaaaggaa atagctgagc atgactcata gttaaagcgc tagcagctgc
9241 ctaaccgcaa aaccacatcc tatggaaagc ttgctaatga cgtataagtt gttccattgt
9301 aagagtatat aaccagtgct ttgtgaaact tcgaggagtc tctttgttga ggacttttga
9361 gttctccctt gaggctccca cagatacaat aaatatttga gattgaaccc tgtcgagtat
9421 ctgtgtaatc ttttttacct gtgaggtctc ggaatccggg ccgagaactt cgca
//
LOCUS ASTSSP12S 3341 bp ds-DNA PLN 29-JUN-1990
DEFINITION A.sativa 12S globulin seed storage protein gene, complete cds.
ACCESSION J05485
KEYWORDS 12S globulin; storage protein.
SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-OG1-E1.
ORGANISM Avena sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 3341)
AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A.
TITLE Analysis of seed storage protein genes of oats
JOURNAL J. Biol. Chem. 265, 9652-9658 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Shotwell, 17-APR-1990.
FEATURES from to/span description
pept 983 1310 12S globulin seed storage protein, exon 1
1428 1702 12S globulin seed storage protein, exon 2
1829 2344 12S globulin seed storage protein, exon 3
2445 2882 12S globulin seed storage protein, exon 4
pre-msg 945 > 3341 12S globulin seed storage protein mRNA and
introns
IVS 1311 1427 12S globulin seed storage protein intron A
IVS 1703 1828 12S globulin seed storage protein intron B
IVS 2345 2444 12S globulin seed storage protein intron C
signal 885 890 CAT box
signal 917 921 TATA box
signal 2915 2920 polyA addition signal
signal 2965 2970 polyA addition signal
BASE COUNT 1044 a 677 c 606 g 1014 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattctttt tggaaagtca ttttgcctcc tgaactccag tgtttcctat ttattaaaaa
61 aaactaaaaa ctatatttat aagtttgaaa aaatcatgaa acaaaattgt aaaaattgct
121 agtgatatat cccacaaacg tgcaaaatct caatttgaag tgctttgtat ttcgagctac
181 acaaaaatga caaagtgtga ctttttatgt gatttgaaat cactatacta cagatctaca
241 attttgttct ttttgtgaag ctataaatac acattatttt gagttgagat ttttcatgtt
301 tgtgctatga atcataggct acatcctgat ttatttttag aattttttgg aacctaaaat
361 atgttctaga ttatttttta aaaagtggga tcattgatgc ccatacacac gaaatctcca
421 ctcaattctt ttatacatta tctttctata tctactaacg tggattatac atcatagtaa
481 gtttcttact acatgtgctt tcttgttaca atgtggacat gactcttcca cttttgggct
541 ttatgttgta tggatatact catgacatgg aattttgtcc acacacgtag aaccatccat
601 atatattgtt gtgcatagaa caaaacacaa gcaagccatg aaaaaggagt cacaggtgcc
661 acaaactgtt gtaggaagta taactagtat gaggcctttt atttgacgtc ggacaatggc
721 caagagctac atataaaaga tggtgctaga tttgtgagtc agcaccagct gtaggcagaa
781 aacaacacat atcttttgag acaaagttat atctattcac atttaaaacc atgatctgtt
841 gagtcaccat gaatatcttc ttatctatgt taataattac atgtcatcat gtttatcctg
901 gactactttt tatggctata aaatcaaact tacaataagg aaactagcac caatccacct
961 tctacaatct cttcaaacaa tcatggcaac tactaggttt ccatcattgt tgttttactc
1021 ctatattttt ctcttgtgca atgggtcaat ggctcagcta ttcgggcaga gctttacccc
1081 atggcaaagc tctcgacaag gaggtttaag ggggtgcaga tttgataggc tacaagcatt
1141 tgaaccactt cgacaagtgg ggtcacaagc gggtatcatt gagtactttg atgagcagaa
1201 tgagcaattt cgttgtgcag gtgtatccgt catccgtcgt gttattgagc ctcaaggcct
1261 cttgttacct caataccaca acgctcctgg cttggtgtac atccttcaag gttagtgtct
1321 aattgattat aaaaattgcc tttgttatac ttcacttagg atttagatgt gccaaatgtt
1381 acaccgttca tattttttaa caatgaaaca aatgtgtttt cttttaggta ggggattcac
1441 agggttgact ttccctggat gcccggcgac cttccaacaa cagttccaac catttgatca
1501 agcccagttt gctgaaggtc aaagccaaag ccaaaatctt aaggatgaac accaaagagt
1561 tcaccacatc aaacaaggag atgttgttgc tctaccggct ggcatagtac actggtgcta
1621 caacgatggt gatgcaccga ttgtagctgt ctatgtcttc gacgtaaaca acaacgctaa
1681 tcagcttgaa ccaaggcaaa aggtaactat acaacttaat gcacacaaaa tatatacaac
1741 tatttacaag tgatccaact atgatttgaa gtatacatta gtgggatatt aatgaactct
1801 gtttaacttc atcgatataa aattttagga gttcttgttg gctggtaaca acaagagaga
1861 gcaacagttt ggacaaaaca tattcagtgg attcagtgtc caacttctta gtgaggccct
1921 tggtataagt cagcaagtag cacaaaagat tcagagtcaa aatgaccaaa gaggtgagat
1981 aattcgtgtg agtcaaggcc ttcaattctt gaagcctttt gtttcccaac aaggaccagt
2041 agagcatcaa gcctaccaac caattcaaag tcaagaagaa caatcaaccc aataccaggt
2101 agggcaatca ccacaatatc aagaaggaca atcaactcaa taccagccag gacagtcatg
2161 ggaccaaagt ttcaatggtt tggaggagaa tttctgttca ttggaggcaa ggcaaaacat
2221 cgaaaacccg aaacgtgccg acacgtacaa cccacgtgct ggcaggataa cacatctcaa
2281 tagcaagaat tttcccaccc ttaacctggt gcaaatgagt gctacaagag taaatttata
2341 ccaggtattt atgatactac attcaacaca ctatcttatt tttagatatt ctaagcttca
2401 tacaaccgat taataatatg gcatacaaat aattgctatt gcagaatgct attctttcac
2461 catactggaa cattaatgct cacagtgtca tgcacatgat ccaaggacgt gctcgagttc
2521 aagttgtcaa taaccatggt cagaccgtat tcaatgacat tcttcgtcgc ggacaactgc
2581 taatcatacc acaacactat gttgttctca agaaggcaga gcgtgaagga tgccagtata
2641 tttcattcaa gaccaacccc aattccatgg ttagccagat cgcaggaaag acctccatcc
2701 tacgtgcatt gcccgttgat gtcctcgcca atgcataccg catttctaga caggaagccc
2761 aaaacctcaa aaataaccgt ggagaagagt ttgatgcatt cacccctaag tttacacaaa
2821 cgggctccca gagttaccag gacgaggggg agtcatcttc gactgagaag gcatccgagt
2881 gaataagtga gtgtaatgga aactagtata gtgaaataaa ggcatcgcat gtttgcagcc
2941 tagtggtata taaccgctta tctcaataaa aaagtttctc cgtgttatat tgtttgcttg
3001 tttcttgtac tcttcttaat tttatctttt atgatcaacc aacttacctc tcttcttcaa
3061 ttttctgcca ctcttcacat atgcatatca aggagacgta tagtccactt gtactagatt
3121 tattttttgg tactatcttt tgtggtttga gtatttctta ggtccgataa aaacattcaa
3181 tttactgatg ctaaattttt tcagcaaacc tagccaccta aaattgtgaa tcacatacct
3241 tcaaatttct atgccaaagt gtgcgatttt tatgttctat agatccaagc ttgggctgca
3301 ggtcgactct agaggatcga tccccgggcg agctcgaatt c
//
LOCUS ASTSSPA 1548 bp ds-DNA PLN 29-JUN-1990
DEFINITION A.sativa seed storage protein gene, complete cds.
ACCESSION J05486
KEYWORDS avenin; prolamine; storage protein.
SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-AV45-X1.
ORGANISM Avena sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1548)
AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A.
TITLE Analysis of seed storage protein genes of oats
JOURNAL J. Biol. Chem. 265, 9652-9658 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Shotwell, 17-APR-1990.
FEATURES from to/span description
pept 484 1113 seed storage protein
mRNA 411 > 1548 seed storage protein mRNA
signal 343 347 CAT box
signal 380 384 TATA box
signal 1189 1194 polyA addition signal
signal 1252 1258 polyA addition signal
BASE COUNT 484 a 375 c 323 g 366 t
ORIGIN
1 tctagaacaa tccttcctgt tactagttta atctacctta tacacagaca agtttacaaa
61 ctttgcaagc catgacaata tggtttatgc ctaacagcct gtagaaaaac acaaacttag
121 tttcagaaaa tgatgcaata tagatcagtg attgacatgt aaagcgaaaa agatgagaca
181 tattgccaac tatcaagctt tcttgatagt cgaaaggtga tacatgtaat ggcaaagcaa
241 cttttatgat cgatccaaaa ataacatttg taggtagtgc caccaaacac aacataccaa
301 ataatcagtt cgagaagcat acaagcattt tttaaggaaa agcaaatgca aagatgaaaa
361 gaaactatgc catggcagat ataaatagac ctgcaccatg gagatcctcc tccatcattc
421 atccttcaca caccgagagc acaaacatta aaacctagca agcagtatca acacaaatct
481 accatgaaga acttcctcat ccttgccctc ctttccatgg cggcgaccat ggccactgcg
541 cagtttgacc ctagcgaaca ataccagcca tatcctgagc aacaacagcc aattctacaa
601 caacagttgc tgttgcagca acaacaacag atgttgttgc agcaacaacc attgttgcag
661 gttttgcagc aacagttgaa cccatgcagg cagttcctcg tgcaacagtg cagcccggtg
721 gcagcggtgt cattcctccg gtcgcaaatc ctgcaacaga gcagctgcca ggtgatgagg
781 caacaatgct gccgacggct ggagcagatc cccgagcagc tccggtgccc agccatccat
841 agcgtcgtcc aggccatcat tatgcagcaa caacaacaac agttcttcca gcctcagatg
901 caacaacagt tcttccagcc tcagatgcaa caggtgacac agggcatctt ccagcctcag
961 atgcaacagg tgacacaggg catcttccag actcagatgc aaggtcagat cgaggggatg
1021 agggcgtttg cgctgcaggc cctgccggcg atgtgcgatg tatacgtccc gccgcactgc
1081 cccgtcgcca ccaccccgct cagtggcttc taagaacact acaagagcta tagtactaca
1141 taaataccat cagcgtttag ccgatggacc gatcttgtag cggtgacaaa taaaataaaa
1201 gggtcatgca ctaacatgtg tgacagccga cccgtactag tccaaacttg gaaataaaag
1261 aatgcaaagt tcttgtctgc ataacattga ttgtcgtttt tccattcatg tttataccta
1321 accataagct catgcattaa ctagctgctt atgtagcctt cattgatgtg aacataataa
1381 cagagtgggt gtaaaagatg gatttgaatt gaggcactat taaagtgagg agtaagaacc
1441 aggtgcagat gatagctctt taatggtgag gcttgtgagt tgtgacctta ttggctgaca
1501 tgcattggat tctaaagagt acatgttgat gctttcaaca atgtcgac
//
LOCUS DOGSPC18A 847 bp ss-mRNA MAM 29-JUN-1990
DEFINITION Dog microsomal signal peptidase complex (SPC 18) mRNA, complete
cds.
ACCESSION J05466
KEYWORDS ER membrane protein; signal peptidase complex.
SOURCE Dog liver, cDNA to mRNA, lambda-18-[50,51].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 847)
AUTHORS Shelness,G.S. and Blobel,G.
TITLE Two subunits of the canine signal peptidase complex are homologous
to yeast sec-11 protein
JOURNAL J. Biol. Chem. 265, 9512-9519 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.S.Shelness, 04-APR-1990.
FEATURES from to/span description
pept 39 578 microsomal signal peptidase complex
BASE COUNT 212 a 162 c 216 g 257 t
ORIGIN
1 cctctccgct gcccgtgtcc tccttcaccg gccgcgccat gctgtctcta gattttttgg
61 acgatgtgcg gcgaatgaac aagcggcagc tgtattatca agtcctaaat tttggaatga
121 ttgtctcctc ggcactaatg atctggaagg gactgatggt cataactgga agtgaaagtc
181 caattgtagt ggtgctcagt ggcagcatgg agccagcatt tcatagagga gatcttctct
241 ttttaacaaa tcgagttgaa gatcccatcc gagtgggaga aattgttgtt tttaggatag
301 aaggaagaga gattcctata gttcaccgag tcttgaagat tcatgaaaag caaaatggac
361 atatcaagtt tttgaccaaa ggagataata atgccgttga tgaccgaggc ctctataaac
421 aaggacaaca ctggctagag aagaaggatg tcgtggggag agcaagggga tttgttcctt
481 atattggaat tgtgacgatc ctcatgaatg actatccgaa atttaagtat gccgtcctct
541 tcttgctggg tttattcgtg ctggtccatc gtgagtaaga agtctacctt gctgttcctg
601 gaagatgccg tacttttcgt tcctgactgt ttggagtaga tattggtcta tgattggtgg
661 agtgcagaac gtacatgttg gcgcttcttg gtagcaatgg tttgcattag tttctgtttc
721 cacgccaggg tctgtgtggg cgggtgcacc aagatgcgca ccgaggggac tctcaatcac
781 aggatttcat atgttgtcat tgtcacactt tcacattttt gtacatcagt gaatttttta
841 tattaaa
//
LOCUS DROCRPA 7226 bp ss-mRNA INV 29-JUN-1990
DEFINITION D.melanogaster crumbs protein mRNA, complete cds.
ACCESSION M33753
KEYWORDS crumbs protein.
SOURCE D.melanogaster (strain Oregon R) 3-12 hr embryo, cDNA to mRNA,
clone lambda-gt10.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 7226)
AUTHORS Tepass,U., Theres,C. and Knust,E.
TITLE Crumbs encodes an EGF-like protein expressed on apical membranes of
Drosophila epithelial cells and required for organization of
epithelia
JOURNAL Cell 61, 787-799 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by U.Tepab 17-APR-1990.
FEATURES from to/span description
pept 214 6633 crumb protein
BASE COUNT 1880 a 1860 c 1879 g 1607 t
ORIGIN Chromosome 95F9-13.
1 aaaaaaaata aaaataacct gttctaagcc aagtgccgat tgtgtgtgtg tgtgtgtgtg
61 cgtatgaagt gtttgcgtgg agcagagaga atgggagaat gcaattacca acaacaacaa
121 gctcaatgag tatcgtttga atagcaataa tatctagcca aaagcaaccc tttccaatac
181 gaaaactcga cttgagccaa caaatcagcg atcatggcta aaatcgccaa tgcgtcactg
241 tcgcaacaac aaaaacagcg gcaagcagaa acagcaacga ccacaacaac aacagtggcg
301 gcatcaggta gaaacagcaa caacaacagc aagaagtcga gaccgcacga aaagcgccac
361 gtcaaattac gctcgcattt gctcaagcgc cgcccatttc agtttattcg tcgccctcaa
421 tggataccgc tcttcatact tatttactta gcaacagatg tcgcctcagt ggcggtgccg
481 acgaaggagg cgtactttaa tggctccact tacctccgcc tcaccacgcc gatgcccatt
541 tgggatcact cggcgattag tttccgctcg tgccgcggcg gcgagatcct cgcccagcag
601 tacaacaaga actccattgt aatctcagtg ctcaatgact ttctgcaaat ctcactggct
661 ggacccgccg tccatgggcc caacaaccgg ctggatgtca agctgcccta ccaactgctg
721 gacaaccgct ggcatacgct gcagttcaag tacgagtacg gaaatctcta cctgcatgtg
781 gatcgcgcgg caagcatatt tgccaactcc acgtacaaca gtcagttcct gactaaccag
841 gacattggct acaaggacgc catcttgata ctgggaaact ccttctccgg ttgccttctg
901 gatggaccag gtctgcaatt tgtgaacaac tcgacggtgc agaatgtggt ctttggtcac
961 tgtcccttga ccccgggtcc ctgcagcgat cacgatcttt tcactcgact gccagacaac
1021 ttttgtctga atgatccctg tatgggccat ggaacctgct cgtccagtcc agagggatac
1081 gaatgtcggt gcacggcacg ttactcggga aaaaattgcc agaaggacaa tggctcgccg
1141 tgtgccaaaa atccatgtga gaacggtggt tcctgcctgg agaattccga ggggaattac
1201 cagtgcttct gtgatcccaa ccacagtggt cagcactgcg agacggaggt gaacatccac
1261 ccactctgcc agacgaaccc ctgcctgaac aatggagcat gtgtggtgat tggcggcagt
1321 ggagcactca cctgcgagtg tcccaaagga tacgctggtg ccaggtgcga ggtggacacg
1381 gatgagtgcg cctcacagcc gtgtcagaac aacggaagct gcatagaccg tatcaatgga
1441 ttcagctgcg attgcagcgg caccggttac acgggtgcgt tctgccagac gaatgtggat
1501 gagtgcgaca agaatccgtg cctgaacggc ggcagatgct tgcataccta cggatggtac
1561 acctgccagt gtctggatgg ttggggcggc gagatttgtg atcgacccat gacatgccag
1621 acgcagcagt gtttcaacgg tggcacctgc ctggacaagc ccatcggctt ccagtgcctc
1681 tgtccgccgg agtacactgg tgaactgtgc caaatcgcac ccagttgcgc acagcagtgc
1741 cccatcgatt cggagtgcgt aggcggcaaa tgcgtttgta agccaggctc atcgggttac
1801 aactgtcaaa cgagcacggg cgatggagct tctgctttgg ccctgacacc catcaactgc
1861 aatgccacca atggaaagtg tctcaatgga ggaacttgct ccatgaacgg aacccactgc
1921 tattgcgccg tcggctattc gggagatcgt tgcgagaagg ccgagaactg ttcgccactg
1981 aactgccagg aaccgatggt atgtgtccag aatcagtgcc tctgcccgga gaacaaggtg
2041 tgcaaccagt gcgccacaca accgtgccag aatggcggtg aatgtgtgga cctaccgaat
2101 ggagactacg agtgcaagtg cacccgtgga tggactggac gcacctgtgg caacgacgtg
2161 gacgagtgca ccctgcatcc gaagatctgc ggcaatggta tctgcaagaa cgagaaggga
2221 tcgtacaaat gctactgtac gcctggattt accggagtcc actgcgattc cgatgtggac
2281 gaatgcctta gctttccctg cctcaacgga gccacgtgcc acaacaagat aaatgcctac
2341 gagtgcgttt gccaaccagg atatgaggga gagaactgcg aagtagatat cgatgagtgc
2401 ggcagtaatc cctgctcgaa cggatccacc tgcatcgaca ggatcaacaa cttcacctgc
2461 aactgcattc cgggaatgag gggccgcata tgtgacatcg acattgacga ctgtgtggga
2521 gatccctgct tgaatggtgg tcagtgcatc gaccaattgg gtggcttccg ttgtgactgc
2581 agtggcaccg gttatgaggg cgagaattgt gagctgaaca tagacgaatg cctctcgaat
2641 ccgtgcacga atggtgccaa gtgcctggac agggttaagg actacttctg cgattgccac
2701 aacggctaca agggaaagaa ctgtgagcag gacatcaacg agtgcgagag taatccctgc
2761 cagtacaatg gcaactgcct agagcgttcg aacataacgc tttaccaaat gagtaggatc
2821 acggatctgc ccaaggtgtt tagccagccc tttagctttg agaatgccag cggctacgag
2881 tgcgtctgtg tgcccggcat cattggaaag aactgcgaga tcaacattaa tgagtgcgac
2941 agtaatccat gcagcaagca cggaaattgc aatgacggga ttggaaccta tacctgcgaa
3001 tgcgaacctg gcttcgaggg cacccactgt gagatcaaca tagatgaatg tgatcgctat
3061 aatccctgcc agaggggcac ttgctatgac cagatagacg actacgactg cgactgcgat
3121 gcgaactatg gaggcaaaaa ctgttccgtt cttctcaaag gctgtgacca gaatccttgt
3181 ttaaatggcg gtgcctgctt gccgtatttg atcaacgagg taactcatct gtacacctgc
3241 acctgcgaga acggcttcca gggcgataaa tgcgaaaaga ccactactct gtccatggtg
3301 gccactagcc tgatttcggt taccacggaa cgcgaggaag gctacgacat taatcttcaa
3361 tttaggacca cactgcccaa cggagttttg gctttcggaa ccaccggcga aaagaatgaa
3421 ccagttagct atattttgga gctgattaat ggacgactga atctacattc ctcgctccta
3481 aataagtggg agggcgtgtt tattggatcg aagctgaacg atagcaactg gcacaaggtg
3541 tttgtggcca taaacacttc gcatctagtc ctttcggcca acgacgagca ggctattttt
3601 ccggttggct cctatgaaac ggccaataat agccagcctt cattcccgcg cacatacctc
3661 ggaggcacca ttcccaatct gaagtcctat ctgcgccacc tcacccacca accatcagct
3721 tttgtgggct gcatgcaaga tataatggtc aatggtaaat ggatctttcc cgatgagcag
3781 gatgcgaata tcagttatac caaactggaa aatgtccaga gcggttgtcc gcgcaccgaa
3841 caatgtaaac cgaatccttg ccattccaac gttgaatgca cggacctttg gcacaccttc
3901 gcttgtcact gtcccagacc attctttggg cacacatgtc agcacaatat gactgctgcc
3961 acttttggcc acgagaacac aacccattcg gccgtgattg tggagaccac agatgtggct
4021 aggcgagcta ttagatccat cctagacatt tccatgttta ttcgtacacg cgagccaact
4081 ggtcaagtct tttacttggg cactgatcca cgcaaagccc ccactaaaaa tattggcgac
4141 tcatatgtgg cggccaaatt acacggcggc gagctgctgg taaagatgca gttcagcggt
4201 actccggagg cctacaccgt tggtggccag aaattggaca acggctataa ccacctgatc
4261 gaggtggtgc gcaatcaaac gctcgtgcag gtcaagctca atggcaccga gtacttccgc
4321 aagacgctgt ccacgacggg tctgctggac gcacagttgc tttacttggg cggacctgca
4381 cccacacgcg agtcacttct gggcgcgacc acagaaccag gaataattcc agttccggga
4441 gcaggaatac ccattgaaga cacaacagtg cccaaggagg cggacgacag cagggactac
4501 tttaagggca taatccagga cgtgaaggtt agcaatggct cgctcaacct gattgtggag
4561 atgtattccc taaacgtgac agatgtccaa gtgaatgcca aacctctggg cgctgtaacc
4621 atcgatcgtg cctccgtgtt acctggtgag gtgtccgatg atctgtgccg aaagaatgcc
4681 tgcctccata atgccgagtg caggaacaca tggaatgact acacttgcaa atgcccaaat
4741 ggttacaagg gaaagaagtg tgccaggaga tcagaatttt gccaacacgt tacatgtccg
4801 ggacagagct tgtgccaaaa tctggacgat ggctatgagt gtgttacgaa caccacattt
4861 actgggcagg aacgcagtcc tctcgccttc ttctacttcc aggagcagca atctgacgat
4921 attgtcagtg aagcgtctcc caagcaaact cttaaaccgg taattgatat tgccttccga
4981 ctcgtgctgg aggtactctg cctgtacata gacaatgtgg acggattctt tgaaattgga
5041 gtaaacggag gacgagtgac catcacctgg aagctaagtg cactgcattt tggcgagtct
5101 gcacgctttg agaaggagaa cactgacgga gaatggagtc gcatttacct cagggcacac
5161 aacagcaaac tggagggtgg atggaaggga tgggaatcga tggtggatcc gacgccagct
5221 ttttccacgg acatcgacca agcggccttc cagtccctga tcgctacaag tacccaggtt
5281 tacttgggtg gcatgccaga gtcgcgacaa gcacgaggat ccactttgtc tgcccagcag
5341 ggctctcagt tcaagggctg tgtgggagag gcaaggctcg gcgatctttt actgccctac
5401 ttctccatgg cggaactgta ttcgcgcacc aatgtttcag tacagcaaaa ggctcaattc
5461 cgtctaaatg ccacacgacc tgaggagggc tgcatcctgt gcttccagtc agactgcaaa
5521 aatgacggct tctgtcaatc tccttcagat gagtacgcct gcacctgtca gcctggattc
5581 gagggcgatg attgcggcac ggacatcgac gagtgtctta acacggaatg cttgaacaac
5641 ggcacctgca tcaaccaggt tgcagctttc ttctgccagt gtcagccagg attcgagggt
5701 cagcactgtg agcaaaacat cgacgagtgt gcggatcagc cgtgccacaa cggtggcaac
5761 tgcacggatc ttatcgcatc gtacgtgtgc gactgccctg aggactatat gggcccgcag
5821 tgcgacgtgc tgaagcaaat gacctgcgag aacgagccat gtcggaatgg atcaacctgc
5881 cagaatggat tcaatgcttc tactggcaat aactttacat gtacatgcgt gcccggcttc
5941 gagggtccac tgtgtgacat acccttctgt gaaataacgc cttgcgataa cggtggcctc
6001 tgcctgacca ctggagcggt accgatgtgc aaatgtagtc tgggatacac tggtcgcctg
6061 tgcgagcagg acattaacga gtgcgaatcg aatccatgcc agaacggtgg tcagtgtaag
6121 gacctcgtcg gcaggtacga gtgcgattgc agggcacgga ttcgagggat tcgctgtgaa
6181 aatgacatcg acgagtgcaa catggaggga gattactgcg gcggattggg ccggtgtttc
6241 aacaagcccg gatccttcca gtgcatctgc cagaaaccct attgcggagc ctactgcaac
6301 tttacggatc cctgcaacgc tacggacctc tgtctaaacg gcggtcgctg cgtagagtcc
6361 tgcggcgcca aaccggacta ctactgcgag tgtccggaag gtttcgcggg aaagaattgc
6421 acagcaccga ttacggccaa ggaggacggg ccttcgacca cagacattgc catcattgta
6481 atacccgtag tggtggtgct gctgctgatc gcgggagccc tcctgggcac cttcctggtg
6541 atggccagga acaagcgagc aaccaggggc acctatagcc cgagcgtcgt caagagtact
6601 gcaacccacg gctggaaatg gacaacgtac tgaagccacc gccggaagag cgactaattt
6661 agttttgagt tttgagcatg aacgacgatt agcaaagcaa acaaaagata tttttaaatc
6721 cgcccatata cacctagctg taggagtaac tcaatgtttt gtactaagtt ggcccctagt
6781 tatggtttac atcttaaggt gctcaaagca atagcagcag cagcagcagc caggcttcca
6841 cctcgactcg tcccttatcc cctgcctacg ctttagttag ttaataatgc cgttgtctat
6901 ttattctagt agttagatga catacgtacc gccctatagt cgttatgtag ttatgttccc
6961 gatagtttag attccgtatt cgatttctcg tatatgtaat cctaaagctg cgaagaaact
7021 tgagctccga actgtcgatt tcccctttgc attccacaag gaagcccatc aaatgtattg
7081 tatgagtatt tttgtaaata agattgtttg ccgactctta aagtaaatta gctgagctaa
7141 aatcgaaatt aaaaacagaa aatcctatat gcctatatat cgcgagtata tatcagtaaa
7201 tgtggcctta taacaaaaaa aaaaaa
//
LOCUS ECOINCRNA 263 bp ss-RNA BCT 29-JUN-1990
DEFINITION E.coli antisense Inc RNA.
ACCESSION M33752
KEYWORDS antisense RNA.
SOURCE E.coli clinical isolate RNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 263)
AUTHORS Lopez,X.X. and Rodriguez,J.C.
TITLE Isolation and nucleotide sequence of an incompatibility gene from
the incFv1 plasmid pSU401: comparison to other IncF genes
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Rodriguez, 17-APR-1990. Univ de Cantabria, Depto Biologia
Molecular, c/Cardenal Herrera Oria s/n, 39011 Santander, SPAIN
FEATURES from to/span description
signal 179 185 -10 promoter
signal 205 211 -35 promoter
BASE COUNT 70 a 54 c 71 g 68 t
ORIGIN
1 gatcgtttaa ggaattttgt ggctggccac gccgtaaggt ggcaaggaac tggttctgat
61 gtggatgtac aggagccaga aaagcgaaaa ccccgataat cttctctaac tttggcgagt
121 gcagaaagat taccggggcc atctaaaaac cgtatagcca acaattcagc tatgcgggga
181 gtatagttat atgcccggaa aagttcaaga cttctttctg tgctcgctcc ttctgcgcat
241 tgtaagtgca ggatggtgtg act
//
LOCUS FLASL287HA 1095 bp ss-RNA VRL 29-JUN-1990
DEFINITION Influenza A/SL/2/87, A(H1N1) virus hemagglutinin (seg 4) RNA, 5'
end.
ACCESSION M33748
KEYWORDS hemagglutinin.
SOURCE Influenza type A/SL/2/87 passed in rhesus monkey kidney cells.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1095)
AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T.
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
REFERENCE 2 (bases 33 to 1095)
AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T.
TITLE Sequence of an influenza virus hemagglutinin determined directly
from a clinical sample
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.T.Schulze 17-APR-1990. St. Louis School of Med, Dept of
Microbiology, 1402 S Grand Blvd St. Louis Missouri 63104
FEATURES from to/span description
pept 33 > 1095 hemagglutinin precursor protein
sigp 33 83 signal peptide
matp 84 1061 hemagglutinin 1 (HA1)
matp 1066 > 1095 hemagglutinin 2 (HA2)
BASE COUNT 381 a 233 c 237 g 244 t
ORIGIN
1 agcaaaagca ggggaaaata aaaacaacca aaatgaaagc aaaactacta gtcctgttat
61 gtgcatttac agctacagat gcagacacaa tatgtatagg ctaccatgcg aacaactcaa
121 ccgacactgt tgacacagta cttgaaaaga acgtgacagt gacacactct gtcaacctac
181 ttgaggacag tcacaacgga aaactatgtc gactaaaagg aatagcccca ctacaattgg
241 gtaattgcag tattgccgga tggattttag gaaacccaga atgcgaatca ctgttttcta
301 agaaatcatg gtcctacatt gcagaaacac caaactccga gaatggaaca tgttacccag
361 ggtatttcgc tgactatgag gaactgaggg agcaattgag ttcagtatca tcattcgaga
421 gattcgaaat attccccaaa gaaagctcat ggcccaacca caccgtaacc aaaggagtaa
481 cggcatcatg ctcccataag gggaaaagca gtttttacag aaatttgcta tggctgacgg
541 agaagaatgg ctcgtaccca aatctgagca agtcctatgt gaacaacaaa gagaaagaag
601 tccttgtact atggggtgtt catcacccgt ctaacatagg ggaccaaagg gccatctatc
661 atacagaaaa tgcttatgtc tctgtagtgt cttcacatta taacaggaga ttcaccccag
721 aaatagcaaa aagacccaaa gtaagagatc aagaaggaag aattaactac tactggactc
781 tgctggaacc cgaggacaca ataatatttg aggcaaatgg agatctaata gcgccatggt
841 atgcttttgc actgagtaga ggctttgggt caggaatcat cacctcaaac gcatcaatgg
901 atgaatgtga cgcgaagtgt caaacacccc agggagctat aaacagtagt cttcctttcc
961 agaatgtaca cccagtcaca ataggagagt gcccaaagta tgtcaggagt acaaaattaa
1021 gaatggttac aggactaagg aacatcccat ccattcaatc cagaggtttg tttggagcca
1081 ttgccggttt cattg
//
LOCUS TREKPMDA 999 bp ds-DNA ORG 29-JUN-1990
DEFINITION T.evansi kinetoplast DNA minicircle.
ACCESSION M33751
KEYWORDS .
SOURCE T.evansi bloodstream forms DNA, clone AnTat 3/3.
ORGANISM Kinetoplast Trypanosoma evansi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma evansi.
REFERENCE 1 (bases 1 to 999)
AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
Steinert,M. and Hamers,R.
TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by E.Bajyana Songa 17-APR-1990.
FEATURES from to/span description
site 367 516 kinetoplast minicircle common region (ORI)
BASE COUNT 391 a 78 c 203 g 327 t
ORIGIN
1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS RATCASPAT 1535 bp ss-mRNA ROD 29-JUN-1990
DEFINITION Rat cytosolic aspartate aminotransferase mRNA, complete cds.
ACCESSION D00252
KEYWORDS cytosolic aspartate aminotransferase.
SOURCE Rat (strain Wistar), cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1535)
AUTHORS Horio,Y., Tanaka,T., Taketoshi,M., Nagashima,F., Tanase,S.,
Morino,Y. and Wada,H.
TITLE Rat cytosolic aspartate aminotransferase: Molecular cloning of cDNA
and expression in Escherichia coli
JOURNAL J. Biochem. 103, 797-804 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Wada, 07-SEP-1989.
FEATURES from to/span description
pept 43 1284 cytosolic aspartate aminotransferase (EC
2.6.1.1)
mRNA 1 1535 cytosolic aspartate aminotransferase mRNA
BASE COUNT 360 a 409 c 410 g 356 t
ORIGIN
1 agctcccagc tcgtgctcga ggtcccctca gattccatcg cgatggcccc tccatcattc
61 tttgcccagg ttccacaggc cccgccggtt ctggtcttta agctcattgc ggacttccgg
121 gatgatcccg atccccgcaa ggttaacctc ggcgtgggag cgtaccgcac agatgactct
181 cagccctggg ttttgccagt agtgaggaag gtcgaacaga agattgctaa cgaccacagt
241 ctcaaccacg agtacttgcc catcctgggc ctggcggagt tccggagctg tgcttctcag
301 ctagtacttg gggacaacag cccagctctc agggagaatc gggttggggg tgtgcagtct
361 ttgggaggga ccggtgcact tcgaattgga gctgacttct tagggcgatg gtacattggc
421 acagacaaca agaacacgcc cgtctacgta tcatcgccga cctgggagaa ccataatggc
481 gtgttttctg ccgctggttt taaagacatt cggtcctatc gctactggga tgcagagaag
541 agaggacttg atctccaggg tttcctgaat gatctggaga atgctcctga gttctccatc
601 tttgtcctcc acgcctgtgc acacaaccca acggggaccg acccaactga agaggagtgg
661 aagcagatcg ccgccgtcat gaagcgccgt tttctgttcc ccttctttga ctcagcctat
721 cagggctttg catctggaga cctagagaaa gatgcctggg ctattcgcta ttttgtgtct
781 gaaggcttcg agctcttctg tgcccagtcc ttctccaaga acttcgggct ctacaatgag
841 agagtgggga atctgaccgt ggtcggaaaa gagcatgaca gcgtcctgcg ggtcctttcc
901 cagatggaga agattgtacg aatcacctgg tccaatcccc ctgcccaggg agctcggatc
961 gtggccacca ccctctccaa ccctgagctc tttaaggagt ggaaaggaaa cgtgaagaca
1021 atggctgacc ggattctgac catgagatcc gaactcaggg cgcgactaga agctctcaag
1081 actcccggga cttggtctca catcactgag cagattggaa tgttcagctt tactgggttc
1141 aaccccaagc aggtcgagta tttggtcaac gagaagcaca tctatctgat gccgagcggt
1201 cggatcaaca tgtgcggctt gaccaccaag aacctagatt atgtggctac ctccatcaat
1261 gaagctgtca ccaaattcca gtgaagaaac accgagtagt tcataccacc aaagcagttc
1321 ctgtcacagc tttcctgcct gcgcaaacct agccgtacat gttgtttatt agagatgacc
1381 accatgggga ggcagccgct gtttagctgg ccccacaaga gaagacattt cttgaacatg
1441 aacctgggtc gggtgggggg atgactgggg ttagggcctt ttggaaacca gagcagatta
1501 aagttattta agaataaaaa aacccgttga tatgc
//
LOCUS RATGL5 599 bp ds-DNA ROD 29-JUN-1990
DEFINITION Rat NICER element (GL5-14) 3' long terminal repeat.
ACCESSION M33534
KEYWORDS long terminal repeat.
SOURCE Rat (strain NEDH) liver NGS-TC12 induced pheochromocytoma cell line
DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 599)
AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A.
TITLE NICER elements: A family of nerve growth factor inducible
cAMP-extinguishable retrovirus-like elements
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Wagner, 03-APR-1990.
FEATURES from to/span description
rpt 21 575 3' viral long terminal repeat
recomb 575 576 viral DNA end/rat DNA start
signal 345 349 CAAT box
signal 402 406 TATA box
signal 481 486 Poly-A signal
BASE COUNT 174 a 154 c 139 g 132 t
ORIGIN
1 tggtagaaga agaggggaat ggaaaggaaa attatacaga tttaaggttt aaaaatatga
61 agttaaaaga gtatgtttca actcaggact aaacactgtg aaaagcaagt ccagacagcc
121 ccgccctgcc gctagaacta acagaccata aaaggaaagg aatgcagaac agaccaggag
181 taccggacct gactcacagg ccacctggca ggaagagata agcccccagc ccccgacatc
241 caggacgccc caaagctgcc aatgtgtgta gctataactt attacctcat catgtgaaat
301 agccaatcat atgtgaacat gtctttgtgc ctcgtttgaa tccaccaatc cccgtaacta
361 tgcatctgct tctgtacgcc cacttctgct tccccaatcc ctataaaatc cccatgctgg
421 agctgctggg tgcgcaagtc ctccgaagag tctgtgtgcc cgcaggtacc tgtgttttcc
481 aataaaccct cttgctgatg gcatcgcagt ggactcggct cggtcattgg ggcttggact
541 cctcctgagg gaaggtcctc tccgggggtc tttcagaaga agagaattaa cagaccaaa
//
LOCUS RATGL5A 599 bp ds-DNA ROD 29-JUN-1990
DEFINITION Rat NICER element (GL5-14) 5' long terminal repeat.
ACCESSION M33535
KEYWORDS long terminal repeat.
SOURCE Rat (strain NEDH, isolate GL4-14) liver NGS-TC12 induced
pheochromocytoma cell line DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 599)
AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A.
TITLE NICER elements: A family of nerve growth factor inducible
cAMP-extinguishable retrovirus-like elements
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Wagner, 03-APR-1990.
FEATURES from to/span description
recomb 18 19 rat DNA end/viral DNA start
rpt 19 584 5' viral long terminal repeat
signal 344 348 CAAT box
signal 401 405 TATA box
mRNA 439 > 599 mRNA
signal 479 485 Poly-A signal
binding 579 586 primer binding site
BASE COUNT 160 a 158 c 138 g 143 t
ORIGIN
1 ttattatgtg tccttgaagg gaaaggaaaa ttatacagat ttaaggttta aaaatatgaa
61 gttaaaagag tatgtttcaa ctcaggacta aacactgtga aaagcaagtc cagacagccc
121 cgccctgccg ctagaactaa cagaccataa aaggaaagga atgcagaaca gaccaggagt
181 accggacctg actcacaggc cacctggcag gaagagataa gcccccagcc tccgacatcc
241 aggacgcccc aaagctgcca atgtgtgtag ctatacctta ttacctcatc atgtgaaata
301 gccaatcata tgtgaacatg tctttgtgcc tcgtttgaat ccaccaatcc ccgtaactat
361 gcatctgctt ctgtacgccc acttctgctt ccccaatccc tataaaagcc ccatgctgga
421 gctgctgggt gcgcaagtcc tccgaagaga ctgtgtgccc gaaggtacct gtgttttcca
481 ataaaccctc ttgctgattg catcgagtgg actcggctcg gtcattgggc gcttggactc
541 ctcctgaggg aaaggtcctc tccgggggtc tttcattatg gggctcgccc cgggatttg
//
LOCUS XELHMGCOA 2970 bp ss-mRNA VRT 29-JUN-1990
DEFINITION X.laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA,
complete cds.
ACCESSION M29258
KEYWORDS 3-hydroxy-3-methylglutaryl-coenzyme A reductase.
SOURCE X.laevis liver, cDNA to mRNA, clone 2.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2970)
AUTHORS Chen,H. and Shapiro,D.J.
TITLE Nucleotide sequence and estrogen induction of Xenopus laevis
3-hydroxy-3-methylglutaryl-coenzyme A reductase
JOURNAL J. Biol. Chem. 265, 4622-4629 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Chen, 20-OCT-1989.
FEATURES from to/span description
pept 115 2766 3-hydroxy-3-methylglutaryl-coenzyme A
reductase
mRNA < 1 2970 3-hydroxy-3-methylglutaryl-coenzyme A
reductase
BASE COUNT 821 a 676 c 714 g 759 t
ORIGIN
1 ggcgggaggt agaacatctg tgtgtgtcat ctccgtatcc ttgtggtatt tctctgccgc
61 cgccgggtca gtcaccagcc agcatcggcc aaagatatcc ttaatagagc caacatgctg
121 tccagactat ttcgcatgca tgggcagttt gtggcctctc atccctggga ggtcatcgtt
181 gggactgtga ctctcacaat ctgcatgatg tcaatgaaca tgttcactgg gaatgacaaa
241 atctgtggct ggaattatgc ttgccccaag tttgaagaag atgttctgag cagtgacatc
301 atcatcctga caatcacaag atgcatcgcc atcctttaca tttatttcca gttccaaaac
361 ctcagacagc tcggctcaaa atacatttta gggattgctg ggcttttcac catcttctca
421 agctttgttt ttagcacagt ggttattcac ttcctggata aagaactgac aggcctaaac
481 gaagctcttc catttttcct gcttttgatc gatctttcca aggctagtgc tttagccaaa
541 tttgctctaa gttccaactc acaggatgaa gtccgagaca atattgcccg tggcatggca
601 attctaggac ctacctttac ccttgaggcg cttgtagaat gtcttgtgat tggagtcggt
661 acaatgtcag gtgtgcggca gttagaaatc atgtgttgct tcggctgcat gtctgtcctt
721 gccaattatt ttgccttcat gaccttcttt ccagcttgtg tttcattggt gttagagctt
781 tctagagaga gccgagaggg ccgtccaatc tggcagctca gtcagtttgc tagtgtacta
841 gaagaagagg aggataataa acccaatcct gttactcaga gagtgaaaat gattatgtct
901 ttgggtttgg tccttgttca tgctcatagt cggtggataa gtgaaccctc ttcccaaaac
961 agcacctcaa tcagtgacca tgaagttacc accatgctgg atgatatgat gccaaagaga
1021 gttgagccta gcatgcctct ctggcagttc tatctttcta gaatggtgac catggatgta
1081 gaacagatca ttactcttgg tcttgctctc ctccttgccg tgaagtacat attctttgag
1141 caaacagaga cagaatcaac cttctctatg aagaacccga taatctcccc cgtagcggtg
1201 caaaagaagc aaatagagag ttgctgcaga cgggaacctg aacaggaaaa aactgtccat
1261 gtttccacaa cagaagaagc ttcatcaaag gaggaaactg aggcagttat aaagccatta
1321 ccccttgaga catcgcccaa agccaagttt atagtgggag attcttctcc actggaattg
1381 tccccagagg ataaaaatac catgtttgat ctaccagagg aaccccgtcc acttgatgaa
1441 tgtgttcgta tactcaagaa tccagataaa ggcgcacagt atctgactga tgcagaagtc
1501 attagcctgg tgaatgccaa acacatccca gcctacaaac tggaaaccat gatggaatct
1561 ccccgagagg gcgtggccat tcgccggcaa atgttgtctg acaagctccc ccagcgatca
1621 gcactgcaga gcttgcctta taagaactac aactactctt tagtaatggg tgcttgttgt
1681 gagaatgtca tcgggtacat gcccatccct gtaggtgtag ctggacctct cttgctgaac
1741 aacaaagagt atcaggtgcc catggcaaca actgaaggat gtcttgtggc cagcacaaac
1801 agaggttgca gagccataat gctgggagga ggtgccaaaa gtcgtgttct ggctgatggg
1861 atgactcgtg gaccggttgt taggttacca actgcctgtg atgcagcaga agtaaaggcc
1921 tggctagaca gcgcagaggg attcaaagtc ataaaagatg catttgatag cacaagcaga
1981 tttgcacgac tcgggagact acaaaactgc gttgctggca ggaacctgta catccgtttc
2041 cagtcaaaaa caggagatgc catgggaatg aacatgattt cgaaggtcac tgaacaagct
2101 cttgctagac tacaagaaga gttccctgat ctccatgttt tggctgtgag cggcaactac
2161 tgtacggata agaagccagc tgctattaac tggatagagg gaagaggaaa gtcagtggtt
2221 tgtgaggcca tcattccagc caaagttgta cgggaggttc taaagagctc aacagaagcg
2281 ttagttgaag tgaatatcaa taagaatttt attggttctg caatggcggg aagcatcgga
2341 gggtacaatg cacatgctgc aaacattgtg actgccattt acattgcttg cggacaggat
2401 gctgctcaga atgttgggag ctccaattgc atcacaataa tggaggctac cggtcccacg
2461 tatgaagatt tgtacatcag ctgcacaatg ccctctatag aaattggaac tgtaggaggt
2521 ggaactaact tggctcccca gcaagcttgc ttacagatgc tgggagtgca aggagcaagc
2581 acagagacac cagggaaaaa cgcctgccag cttgctcaga tagtgtgtag cacagttatg
2641 gccggggaac tgtcactaat ggcggcttta gcagcgggtc acctcgtcaa gagtcacatg
2701 gtccacaaca ggtcaaaaat taaccttcag gacctgccag gcacgtgcac aaagaaagct
2761 gcctgattgg ccgtaggaag agccgaatct taaaatgaac ttctctgaga aacggtgctg
2821 tatctcatca caacacaagg tggcgccaca tctgcctgag aaatggaatt gcctcatcta
2881 catatggatt cccaggcccg agcttcacaa tcaccatgtt aatccagtct tttattgatg
2941 atggcatctt aaaaagaaac ttacataatt
//
LOCUS HUMPFKMIX 1139 bp ss-mRNA PRI 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM) alternatively spliced mRNA,
partial cds.
ACCESSION J05533
KEYWORDS phosphofructokinase.
SOURCE Human (34 week old) placenta, cDNA to mRNA, clone HPFKM1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 62; 1110 to 1139)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1139)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
JOURNAL Unpublished (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 1139 muscle phosphofructokinase (AA at 3) (EC
2.7.1.11) /nomgen="PFKM" /map="1cen-q32"
/hgml_locus_uid="LL0081L"
BASE COUNT 285 a 262 c 337 g 255 t
ORIGIN
1 gaaaaccaat cacctcagaa gacatcaaga atggcagcag gatgggtgtg gaagcagtga
61 tggcactttt ggaggggacc ccagataccc cagcctgtgt agtgagcctc tctggtaacc
121 aggctgtgcg cctgcccctc atggaatgtg tccaggtgac caaagatgtg accaaggcca
181 tggatgagaa gaaatttgac gaagccctga agctgagagg ccggagcttc atgaacaact
241 gggaggtgta caagcttcta gctcatgtca gacccccggt atctaagagt ggttcgcaca
301 cagtggctgt gatgaacgtg ggggctccgg ctgcaggcat gaatgctgct gttcgctcca
361 ctgtgaggat tggccttatc cagggcaacc gagtgctcgt tgtccatgat ggtttcgagg
421 gcctggccaa ggggcagata gaggaagctg gctggagcta tgttgggggc tggactggcc
481 aaggtggctc taaacttggg actaaaagga ctctacccaa gaagagcttt gaacagatca
541 gtgccaatat aactaagttt aacattcagg gccttgtcat cattgggggc tttgaggctt
601 acacaggggg cctggaactg atggagggca ggaagcagtt tgatgagctc tgcatcccat
661 ttgtggtcat tcctgctaca gtctccaaca atgtccctgg ctcagacttc agcgttgggg
721 ctgacacagc actcaatact atctgcacaa cctgtgaccg catcaagcag tcagcagctg
781 gcaccaagcg tcgggtgttt atcattgaga ctatgggtgg ctactgtggc tacctggcta
841 ccatggctgg actggcagct ggggccgatg ctgcctacat ttttgaggag cccttcacca
901 ttcgagacct gcaggcaaat gttgaacatc tggtgcaaaa gatgaaaaca actgtgaaaa
961 ggggcttggt gttaaggaat gaaaagtgca atgagaacta taccactgac ttcattttca
1021 acctgtactc tgaggagggg aagggcatct tcgacagcag gaagaatgtg cttggtcaca
1081 tgcagcaggg tgggagccca accccatttg ataggaattt tgccactaag atgggcgcc
//
LOCUS J05525 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05525
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 4 a 8 c 3 g 5 t
ORIGIN
1 ctcatctcag atacctggcc
//
LOCUS J05526 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05526
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 5 a 6 c 5 g 4 t
ORIGIN
1 actcagcgag gtacttgcac
//
LOCUS J05527 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05527
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 7 a 8 c 3 g 2 t
ORIGIN
1 tctcaaccag acaaggaccc
//
LOCUS J05528 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05528
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 7 a 3 c 4 g 6 t
ORIGIN
1 catcaagaat gttcgtatga
//
LOCUS J05529 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05529
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 2 a 5 c 6 g 7 t
ORIGIN
1 gtcccttcag ctggtggtta
//
LOCUS J05530 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05530
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 6 a 4 c 5 g 5 t
ORIGIN
1 cagaattctg gtaagtcacg
//
LOCUS J05531 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05531
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 4 a 6 c 8 g 2 t
ORIGIN
1 gctcctgcag ggcagcagga
//
LOCUS J05532 20 bp ds-DNA BAD 29-JUN-1990
DEFINITION Human muscle phosphofructokinase (PFKM).
ACCESSION J05532
REFERENCE 1 (bases 1 to 20)
AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
TITLE Alternative splicing of the transcript encoding the human muscle
isoenzyme of phosphofructokinase
JOURNAL J. Biol. Chem. 265, 9006-9010 (1990)
STANDARD simple staff_entry
COMMENT Intron/exon boundary sequence. McLachlan promises to send us full
exon sequence before it is published in August 1990 using the same
accession numbers.
BASE COUNT 4 a 3 c 7 g 6 t
ORIGIN
1 tgtccaggtg gtaagtactg
//GenBank-Updates@genbank.bio.net (06/30/90)
LOCUS DROGNBPSA1 441 bp ds-DNA INV 30-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exon 1.
ACCESSION M33996
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 1 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 441)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pre-msg 362 > 441 guanine nucleotide-binding protein (Gs-alpha)
mRNA and introns (alt.)
pre-msg 174 > 441 Gs-alpha mRNA and introns (alt.)
IVS 426 > 441 intron A
signal 152 157 promoter region (put.)
variant 365 365 c in DNA, g in cDNA
BASE COUNT 140 a 89 c 85 g 127 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata
61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc
121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta
181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat
241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg
301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc
361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat
421 gttcggtaat ttcacggtaa a
//
LOCUS DROGNBPSA2 897 bp ds-DNA INV 30-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exons 2,3 and 4.
ACCESSION M33997
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 2 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 897)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 265 540 guanine nucleotide-binding protein
(Gs-alpha-S), exon 2
607 703 guanine nucleotide-binding protein
(Gs-alpha-S), exon 3
760 + 880 guanine nucleotide-binding protein
(Gs-alpha-S), exon 4
pep$ 265 540 guanine nucleotide-binding protein
(Gs-alpha-L), exon 2
607 703 guanine nucleotide-binding protein
(Gs-alpha-L), exon 3
760 + 880 guanine nucleotide-binding protein
(Gs-alpha-L), exon 4
pre-msg < 1 > 897 Gs-alpha mRNA and introns
IVS < 1 29 intron A
IVS 541 606 intron B
IVS 704 759 intron C
IVS 881 > 897 intron D
BASE COUNT 266 a 225 c 226 g 180 t
ORIGIN About 1.4 kbp downstream of segment 1.
1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc
61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg
121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag
181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg
241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg
301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag
361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc
421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct
481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg
541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc
601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag
661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc
721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc
781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg
841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc
//
LOCUS DROGNBPSA3 1380 bp ds-DNA INV 30-JUN-1990
DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha
subunit gene, exons 5,6,7,8 and 9.
ACCESSION M33998
KEYWORDS guanine nucleotide-binding protein.
SEGMENT 3 of 3
SOURCE D.melanogaster (strain Canton S) embryo DNA, clones
lambda-DGs-[4,5,7,10,11,12].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1380)
AUTHORS Quan,F. and Forte,M.A.
TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by
alternate splicing involving an unusual splice site
JOURNAL Mol. Cell. Biol. 10, 910-917 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 4 132 guanine nucleotide-binding protein
(Gs-alpha-S), exon 5
206 385 guanine nucleotide-binding protein
(Gs-alpha-S), exon 6
445 569 guanine nucleotide-binding protein
(Gs-alpha-S), exon 7
746 828 guanine nucleotide-binding protein
(Gs-alpha-S), exon 8
909 1055 guanine nucleotide-binding protein
(Gs-alpha-S), exon 9
pep$ + 4 132 guanine nucleotide-binding protein
(Gs-alpha-L), exon 5
206 385 guanine nucleotide-binding protein
(Gs-alpha-L), exon 6
445 569 guanine nucleotide-binding protein
(Gs-alpha-L), exon 7
755 828 guanine nucleotide-binding protein
(Gs-alpha-L), exon 8
909 1055 guanine nucleotide-binding protein
(Gs-alpha-L), exon 9
pre-msg < 1 1115 Gs-alpha mRNA and introns (alt.)
pre-msg < 1 1176 Gs-alpha mRNA and introns (alt.)
pre-msg < 1 1243 Gs-alpha mRNA and introns (alt.)
IVS < 1 3 intron D
IVS 133 205 intron E
IVS 386 444 intron F
IVS 570 745 intron G (for Gs-alpha-S) (no splice
consensus)
IVS 570 754 intron G' (for Gs-alpha-L)
IVS 829 908 intron H
variant 780 780 t in DNA, c in cDNA
signal 1083 1088 polyA signal
signal 1154 1159 polyA signal
signal 1221 1226 polyA signal
BASE COUNT 441 a 265 c 254 g 420 t
ORIGIN About 500 bp downstream of segment 2.
1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga
61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa
121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat
181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta
241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa
301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt
361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg
421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa
481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt
541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg
601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca
661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt
721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat
781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc
841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc
901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca
961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc
1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag
1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt
1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac
1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat
1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag
1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac
//
LOCUS SYCSBEFA 850 bp ds-DNA BCT 30-JUN-1990
DEFINITION Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta
subunit (psbF), and photosystem II L protein genes, complete cds.
ACCESSION M33897 X06988
KEYWORDS cytochrome B559 alpha subunit; cytochrome B559 beta subunit;
photosystem II protein L.
SOURCE Synechocystis sp. (strain PCC6803) DNA, clone pKW1261.
ORGANISM Synechocystis sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 553)
AUTHORS Pakrasi,H.B., Williams,J.G.K. and Arntzen,C.J.
TITLE Targeted mutagenesis of the psbE and psbF genes blocks
photosynthetic electron transport: Evidence for a functional role
of cytochrome B559 in photosystem II
JOURNAL EMBO J. 7, 325-332 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 850)
AUTHORS Pakrasi,H.B., Nyhus,K.J. and Granok,H.
TITLE Targeted deletion mutagenesis of the beta subunit of Cytochrome
B559 protein destabilizes the reaction center of photosystem II
JOURNAL Z. Naturforsch., C, Biosci. 45, 423-429 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.B.Pakrasi, 01-MAY-1990.
Washington University Dept. of Biology
Campus Box 1137
St. Louis, MO 63130
FEATURES from to/span description
pept 121 366 cytochrome B559 alpha-subunit (psbE)
pept 403 537 cytochrome B559 beta subunit (psbF)
pept 547 666 L protein of photosystem II (PS2)
binding 109 112 ribosomal binding site (put.)
binding 392 396 ribosomal binding site (put.)
BASE COUNT 197 a 196 c 197 g 260 t
ORIGIN
1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga
61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa
121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg
181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg
241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt
301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat
361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct
421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg
481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag
541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct
601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt
661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc
721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt
781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa
841 tatcatggtt
//
LOCUS MUSBFGFR 2526 bp ss-mRNA ROD 30-JUN-1990
DEFINITION Mouse basic fibroblast growth factor receptor (bFGF-R) mRNA,
complete cds.
ACCESSION M28998
KEYWORDS fibroblast growth factor receptor; transmembrane protein;
tyrosine kinase.
SOURCE Mouse (strain CBA/CaH), cDNA to mRNA, clone NE2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2526)
AUTHORS Reid,H.H., Wilks,A.F. and Bernard,O.
TITLE Two forms of basic fibroblast growth factor receptor-like mRNA are
expressed in the developing mouse brain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1596-1600 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.Bernard, 11-OCT-1989.
FEATURES from to/span description
pept 58 2526 fibroblast growth factor-receptor precursor
sigp 58 117 fibroblast growth factor-receptor signal
peptide (put.)
matp 118 2523 fibroblast growth factor-receptor
BASE COUNT 579 a 710 c 708 g 529 t
ORIGIN
1 gtggaatatc catggaggta cggagccttg ttaccaacct ctaaccgcag aactgggatg
61 tggggctgga agtgcctcct cttctgggct gtgctggtca cagccactct ctgcactgcc
121 aggccagccc caaccttgcc cgaacaagct cagccctggg gagtccctgt ggaagtggag
181 tctctcctgg tccaccctgg cgacctgcta cagcttcgct gtcggcttcg cgatgatgtg
241 cagagcatca actggctgcg ggatggggtg cagctggtgg agagcaaccg tacccgcatc
301 acaggggagg aggtggaggt gcgggactcc atccccgctg actctggcct ctacgcttgc
361 gtgaccagca gcccctctgg cagcgatacc acctacttct ccgtcaatgt ctcagatgca
421 ctcccatcct cggaagatga tgacgacgac gatgactcct cctcggagga gaaagagacg
481 gacaacacca aaccaaaccg taggcctgta gctccctact ggacatcccc agagaaaatg
541 gagaagaaac tgcatgcggt gcccgctgcc aagacggtga agttcaagtg cccgtcgagt
601 gggacaccca accccactct gcgctggttg aaaaatggca aagagtttaa gcctgaccac
661 cgaattggag gctacaaggt tcgctatgcc acctggagca tcataatgga ttctgtggtg
721 ccttctgaca agggcaacta cacctgcatc gtggagaatg agtatgggag catcaaccac
781 acctaccagc ttgacgtcgt ggaacgatct ccgcaccgac cccatccttc agcagggctg
841 cctgccaaca agacagtggc cctgggcagc aatgtggagt tcatgtgtaa ggtgtacagc
901 gatccgcagc ctcacattca gtggctgaag cacatcgagg tgaacgggag taagatcggg
961 ccagacaact tgccgtatgt ccagatcctg aagactgctg gagttaatac caccgacaag
1021 gaaatggagg tgcttcatct acggaatgtc tcctttgagg atgcggggga gtatacgtgc
1081 ttggcgggta actctatcgg actctcccat cactctgcat ggttgaccgt tctggaagcc
1141 ctggaagaga gaccagctgt gatgacctca ccgctctacc tggagatcat tatctactgc
1201 accggggcct tcctgatctc ctgcatgttg ggctctgtca tcatctataa gatgaagagc
1261 ggcaccaaga agagcgactt ccatagccag atggctgtgc acaagctggc caagagcatc
1321 cctctgcgca gacaggtaac agtgtcagct gactccagtg catccatgaa ctctggggtt
1381 ctcctggttc ggccctcacg gctctcctcc agcgggaccc ccatgctggc tggagtctcc
1441 gaatatgagc tccctgagga tccccgctgg gagctgccac gagacagact ggtcttaggc
1501 aaaccacttg gcgagggctg cttcgggcag gtggtgttgg ctgaggccat cgggctggat
1561 aaggacaaac ccaaccgtgt gaccaaagtg gccgtgaaga tgttgaagtc cgacgcaacg
1621 gagaaggacc tgtcggatct gatctcggag atggagatga tgaaaatgat tgggaagcac
1681 aagaatatca tcaaccttct gggagcgtgc acacaggatg gtcctcttta tgtcattgtg
1741 gagtacgcct ccaaaggcaa tctccgggag tatctacagg cccggaggcc tcctgggctg
1801 gagtactgct ataaccccag ccacaacccc gaggaacagc tgtcttccaa agatctggta
1861 tcctgtgcct atcaggtggc tcggggcatg gagtatcttg cctctaagaa gtgtatacac
1921 cgagacctgg ctgctaggaa cgtcctggtg accgaggata acgtaatgaa gatcgcagac
1981 tttggcttag ctcgagacat tcatcatatc gactactaca agaaaaccac caacggccgg
2041 ctgcctgtga agtggatggc ccctgaggcg ttgtttgacc ggatctacac acaccagagc
2101 gatgtgtggt cttttggagt gctcttgtgg gagatcttca ctctgggtgg ctccccatac
2161 cccggtgtgc ctgtggagga acttttcaag ctgctgaagg agggtcatcg aatggacaag
2221 cccagtaact gtaccaatga gctgtacatg atgatgcggg actgctggca tgcagtgccc
2281 tctcagagac ctacgttcaa gcagttggtg gaagacctgg accacattgt ggccttgacc
2341 tccaaccagg agtatctgga cctgtccata ccgctggacc agtactcacc cagctttccc
2401 gacacacgga gctccacctg ctcctcaggg gaggactctg tcttctctca tgagccgtta
2461 cctgaggagc cctgtctgcc tcgacacccc acccagcttg ccaacagtgg actcaaacgg
2521 cgctga
//
LOCUS BSSBIODAYB 4382 bp ds-DNA BCT 30-JUN-1990
DEFINITION B.sphaericus bioDAYB operon genes, complete cds.
ACCESSION M29292
KEYWORDS adenosylmethionine-8-amino-7-oxononanoate aminotransferase; bioB;
biotin synthase; dethiobiotin synthase.
SOURCE B.sphaericus (strain IF03525) DNA.
ORGANISM Bacillus sphaericus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 456 to 511)
AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
TITLE Cloning and characterization of the Bacillus sphaericus genes
controlling the bioconversion of pimelate into dethiobiotin
JOURNAL Gene 87, 63-70 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 4382)
AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Goleckler, 20-OCT-1989, for release after publication.
FEATURES from to/span description
pept 509 1213 dethiobiotin synthase (bioD) (ttg start codon)
pept 1194 2561 adenosylmethionine-8-amino-7-oxononanoate
aminotransferase (bioA) (gtg start codon)
pept 2566 3156 biotin synthase (bioY)
pept 3156 4154 bioB protein (gtg start codon)
mRNA < 456 4364 bioDAYB mRNA
binding 491 503 ribosome binding site
binding 1179 1186 ribosome binding site
binding 2545 2556 ribosome binding site
binding 4317 4364 ribosome binding site
BASE COUNT 1376 a 746 c 952 g 1308 t
ORIGIN
1 aagctttgca cacttctgtt tcgtatcctc atattgaact tgatgaaacc ttcctatggc
61 cgtatgcatt gagatttttt ctcgatgttc tgcttgcaat gttcgatatt cttcttgccg
121 aatagctaca cgataccaaa attcataacg caacggtaaa tctcttattt cgtaagtaag
181 caaagtattt aaaatactgc tcatttgttc atatgtatct agctttttat ctgtctcctt
241 aaatagtcca aacattttgc caccccctgt tttgattaat actacaacct atgataaaaa
301 ccctttaata tttcttggga aataatccaa cgttgataaa acggggtgaa tatccgatca
361 atcgagtgaa atttaggata gaataccctc ggaaaaagca ttatctgaat catttatgta
421 aaaatgcaaa aaaaggcatt tacaaaagga aaaagaatgt gttaacttaa aaactatagt
481 tggttaacta aaagaggggg aggtacagtt gcaacacttt tgggttgttg gaacagatac
541 agatgttgga aaaacatttg tcaccacatt attaatgcgt aatttgcaaa aacagggcgt
601 acgtgtaacg ccttataaac cagtccaaac tggtgaagtg tatgatggtg aacaagccta
661 ttacttcgac acagcgatgt atgaaaaata ttccttgcaa ttgctagaca gagagaattt
721 aaatggctat tcatttaaag aggctgcatc gccacatttt gcggctcaac tggaggggca
781 gcaaattgac acacagcagt tattaaagca aatgcaactt ttacagcaaa catgggatgt
841 tgttatttgt gaaggagcgg gtgggctctt tgtgccatta gatgcatgtg gcgaaacgac
901 attgttggat gtcattgttg aaagtaaact acccgttgtc gtggttacac gaacagcact
961 aggaacaatt aaccatacgc tcttaacgtt agaggcattg actacacgga aaattgaagt
1021 gcttggtctt gtatttaacg gtgatatggg gagcaggatg gagcaagaca atatccaaac
1081 gattttacag tattatacat tgccctatat gacgatacca aagctggaag agctgtcgga
1141 cattaatgag tatgcaatta cgggcacatc attgtttgaa aggctgatta gacgtgaaac
1201 aagtattaac tgagctacaa gaaaaagatt tacaacatgt ctggcatcct tgctcacaaa
1261 tgaaagatta tgaggctttt ccaccaatcg ttataaaaaa aggcgaaggt gtatggctgt
1321 atgatgaaca gaatcaacgc tatcttgatg cggtatcttc atggtgggtc aatttatttg
1381 gacatgccaa tccacgtatt agccaagcat taagtgaaca agcatttacg ttggagcata
1441 caatttttgc gaatttttca catgagccag cgattaaact cgcacaaaaa ttagtagctt
1501 taacaccaca aagtttacaa aaagtatttt ttgcagataa tggttcatct gctatagaag
1561 tcgctttaaa aatgagtttt caatatcata tgcaaacggg gaaaacgcaa aaaaaacgct
1621 ttttggcatt aacggatgcc taccatggtg aaacattagg tgctttatcc gtcggtggcg
1681 tagatcttta taacgaagtg tatcaaccac tgttattgga tacggtacga gcacaaggcc
1741 cagattgttt ccgttgccca ttcaagcatc atccggatag ttgccatgcc caatgtatta
1801 gttttgtaga ggatcagttg cgcatgcatc ataaggaaat tacggcggtt attattgagc
1861 cactcattca agcggcagca gggatgaaaa tgtatccagc tatttatttg cgacgtttac
1921 gtgaactatg tacgcaatat gatgtgcatc taattgcaga cgaaattgct gtaggttttg
1981 ggcgcacagg tacacttttt gcctgtgagc aggctaatat ctctccggat tttatgtgtt
2041 tatcaaaagg tttaacaggt gggtatttac cactgtctgt cgtaatgacg acgaatgatg
2101 tatatcaggc attttatgat gattatgcca cgatgaaggc gtttttacat tcacatagtt
2161 acacagggaa tacacttgcc tgccgtgttg ctctagaggt attggcgata tttgaagaag
2221 aacagtatat agacgttgtg caagacaaag gtgaacgcat gcgaaagcta gccttggagg
2281 cttttagtga tttacctttt gttggtgaat atcggcaagt tgggtttgtc ggggcgattg
2341 aacttgtggc gaatcgcgat accaaagagc cattaccgag tgaggagcgc atcggctatc
2401 aaatatacaa aagagcttta gcaaaagggt tactgattcg tccacttggg aatgttttgt
2461 atttcatgcc accatacatt ataacggacg atgaaatgca atttatgatt caaacaacaa
2521 aagatacaat tgttcaattt tttgaagagc gggagggatg agggcatgtt gaaacaacag
2581 tcaacgttat cacttgtgat gattgcgatg tttgctgcat taacagcagt tggtgccttc
2641 attaaaattc cattaccgct cgtgccgttt acattacaaa ttgtctttgt ctttttagcg
2701 ggttgcttac tcggtggtcg caatggattt caaagtcagc tagtttacat aggaataggt
2761 ttagttggct tgccagtttt tacacaaggt ggaggcatta catatgtatt gcagccgact
2821 tttggttact taataggatt tgctcttgct gcattagtaa tcggctatat gattgatcga
2881 gtagaatcac caacgaaaaa gcatttcatt gttgccaata ttatagggct tatcattatt
2941 tatgcagtcg cagtacctta tttatatgta gcattaaatg tatggttaaa catgaaatca
3001 agttggtctc atgtattttt agtaggcttt gtcaatagta ttgttgcaga cttttgctta
3061 gcaattgctt ctgccctttt agctgaacgt ctatacaaag tattccgttc cgctagagct
3121 ataaaacttg tgcaaattga aaaggagaat gtttagtgaa ttggttacaa ttagcagatg
3181 aagtgattgc aggcaaggta attagcgatg atgaggcact tgccatttta aatagtgatg
3241 atgatgatat tttaaagcta atggacggcg catttgccat tcgtaagcac tattacggta
3301 aaaaagtaaa gttaaatatg attatgaatg ctaaaagtgg ctattgccca gaggattgtg
3361 gctattgctc gcagtcatct aaatcgaccg ctcctattga gaaatatccg ttcattacaa
3421 aagaagaaat attagcgggg gcaaagcgtg cgtttgaaaa taaaattggt acgtattgca
3481 tcgtcgcaag cggacgtggg ccgactcgta aagatgtcaa tgtagtgagt gaagccgttg
3541 aagaaattaa agcaaaatat ggcttaaaag tttgcgcttg cttaggttta ctaaaagaag
3601 aacaagcaca acaattaaaa gaagcgggtg ttgatcgcta caatcataac ttaaatacat
3661 cagagcgtca ccattcctat attacgacga cgcacacata tgaggatcgt gttaataccg
3721 ttgaggttgt aaagaaacat ggtatttccc catgttctgg agccattatt gggatgaaag
3781 aaacgaaaat ggatgtcgtg gaaattgcac gcgcattgca tcagttggac gcggattcaa
3841 ttccagttaa cttcttacat gcaattgatg gaacgaaact tgaaggaaca caggacttaa
3901 atcctcgcta ttgcttaaaa gtattagcgt tattccgcta catgaatcct tcgaaggaaa
3961 ttagaatttc cggtggtcgc gaagtcaatt taggattcct tcagccattt ggactgtatg
4021 cagcaaatag tatttttgtt ggggattact taactactga aggacaagaa gccaatagcg
4081 attatcgtat gcttgaagat ttgggctttg aaatcgagct gacacaaaag caagaagaag
4141 cattttgttc ttaattcaac caatcattat gaaataaaat ctactactac acaatatgat
4201 tacctcaaaa ccgtgtgagc gtcgtggaaa aggcgcacag acggtttttt ggtcgataaa
4261 agagaaggag aaaggtaaat aaatggttcc gataatatac ctataaaatg atggttttca
4321 caaaatgttc aatgaaagcg ttttgaaatt gaacagtttg tgaagggctt cacataaagc
4381 tt
//
LOCUS BSSBIOXWF 4530 bp ds-DNA BCT 30-JUN-1990
DEFINITION B.sphaericus bioXWF operon genes, complete cds.
ACCESSION M29291
KEYWORDS 7-keto-8-aminopelargonic acid synthetase; bioF; bioW; bioX.
SOURCE B.sphaericus (strain IF03525) DNA.
ORGANISM Bacillus sphaericus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1081 to 1151)
AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
TITLE Cloning and characterization of the Bacillus sphaericus genes
controlling the bioconversion of pimelate into dethiobiotin
JOURNAL Gene 87, 63-70 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 4530)
AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Goleckler, 20-OCT-1989, for release after publication.
FEATURES from to/span description
pept 1149 1649 bioX protein
pept 1630 2367 bioW protein
pept 2364 3533 7-keto-8-aminopelargonic acid synthetase
(bioF)
mRNA < 1081 3560 bioXWF mRNA
binding 1132 1140 ribosome binding site
binding 1612 1621 ribosome binding site
binding 2350 2355 ribosome binding site
BASE COUNT 1426 a 768 c 958 g 1378 t
ORIGIN
1 aagctttact acaaccagga ggatgtagtt taggcgctgt tttaccatct aaaacaagtg
61 atggtaaaac ttatgtcctt agaaattacg atttgtcacc agctatatct gatatgagac
121 tttgtacaac taaggtaaaa gggaaatata gtcatactgg attctctgtt tcgtactttg
181 gtcgaagtga aggcttaaat gaagagggat tctgtgtagc ttttgcctca tgcggtattc
241 cagttggaaa acatccagga atgaaaaagc ctatcctaaa aggacttcag tttatggtaa
301 tagtaagagc tttacttgaa aattgtaaag atgttgagga aggaattact tatttagaga
361 atatgcctat tggaacaaat atgaatttgc tactgtcaga tgccaaaggg aatgtagctt
421 tagtagaaac gtatgatgga gaaaagttcg tagaaagggg caatcaaaaa tcaggatttt
481 tgattgcaac taatcacgct gtaatgccca aaataatgaa gttagaagac agaaaattag
541 aacaatctga aatacgttat aattttttga aaaataactt agagagtgat gattttttta
601 caaagaataa attacaacaa ttaatgttta atgaataccc taatggtgta acagtgcaca
661 atttcgaaga aaatttctga actgttcatt caattctatt taatcttaac gacaaacaat
721 tagatatttc tttcggttcc cctataaata ataaaatcta taaattaaaa gtaggagaaa
781 ccttgccttt taatgaatta gaagttttca tagagaactc aaactatgga ccaaacttct
841 ggaagctaat caaataataa tatggccaaa gaaacacttt cagctgcatg atgagccgag
901 agtgtttttt tatcattcct ctattcaata ataaatcgct tatttcacct tcttggaggt
961 ttgaaatcct cttgtttcat aggtttctgg cgttttcctt tttgacgaag cactgttaaa
1021 gctctatcct tttttgaaaa aaatatagaa aaacaatttt aatctacctt cctatctata
1081 aatgtgttaa cttaattatt attaaggtta actcaaattg aagaagttag aatgggagga
1141 ataggaggat gcgaaagttt tctacatatg atcttgctca gatttcatta ctagcttgtc
1201 ttattatcgt tacaggcatg tttaagattc caacaggtat tcctggatct gagtttcaat
1261 tatcagcacc gattgccgtt gcgattgcag cagtatttgg atttaagcga tattttcttg
1321 cgggaatcat tgcaagtcta atcttatttt tactaggtat acactccatc ttaaatgttg
1381 aaatttcaat aattttccga ttgactgttg gtctaatcat tgttttatta ggaacttcaa
1441 ttccggtact agttgtggca ggaccgattg gaacaatggt tgctagactt ggattggctt
1501 ttacgttagg gaccccgttt ttgccactat tcgttttggc gattccaggg atggtcatta
1561 cggctgtcag tgtttatcca ataacgaaaa tgttatatgc aattaataag aaagtagcag
1621 gtgatcatca tgttagaaac gtgttatagc attcgaatgc gtgcagctga aaaaaatctc
1681 gaaggaggag aaaagcatat atctggtggg gaacggatag ggagtgaatt tcaaatagag
1741 ccaattgtaa aacagttatt gaacaaagca aggaatcatt cgcgcggaga tgctgacttt
1801 attcaaatta ccgttgaaaa acttacaggt gatcagatac tgtatatgcc accgttagaa
1861 ataacgacaa ttgatgagag ttcaattgaa agggcacata aagaagctag gagtatatta
1921 acctcagtag gtgtttccaa gcaggcacaa aatgttgctt ttcatctact tgctagtaat
1981 caaaatcttc gtggggctat cctccttcat agtcaaactg gcttacgact tgacaatcgc
2041 ggactgaaag gcgttcgtgt atcacgaatc gattggcaag acgctgatgt aggttacaat
2101 gagcgtgttc gtgaagcgct agctctggca acgaaagtgg caaattctcc gtataccatc
2161 gcagaattat gttggtcaga tgatccagaa tacgttactg gctatgtaag caatcatgag
2221 attggttatg tcagaattac gcctttaaaa agggaaggct gtgaaagtgg cggacgtatt
2281 ttttttgtgt cagatgaagt tgagctagaa tcatatatac actatttaga aagagaacct
2341 attctcatta gggggcattt aaaatgaatg atcgctttcg aagggaactg caagtaatag
2401 aagagcaagg attgacaagg aagttacgtt tgttttcaac tggaaatgaa agtgaggtag
2461 tgatgaatgg taagaaattt ttgctatttt catcgaataa ctacttaggc cttgcaacag
2521 atagtcgttt gaaaaagaaa gcaactgaag gcattagtaa atacggtaca ggggctggcg
2581 gttctcgact tacaactgga aacttcgaca ttcatgaaca gctagaatct gaaattgcag
2641 attttaaaaa gactgaagcg gccattgtat tcagcagtgg gtatttagcg aacgtaggtg
2701 tgatttcgag cgtgatgaag gcaggagata ctatcttttc tgatgcttgg aatcacgcga
2761 gtattataga tggttgtcga ttaagtaaag ccaaaacgat tgtttatgaa catgcggata
2821 tggtggattt agagcggaaa ttaaggcaat cacatgggga tggattgaag ttcatcgtaa
2881 cggatggcgt ttttagtatg gatggtgata ttgcgccact tccaaaaata gtagagttag
2941 ccaaggaata caaagcgtac ataatgattg atgatgcgca tgcaacaggt gttcttggca
3001 atgatggttg tggtaccgct gattattttg gtttgaaaga tgagattgat tttacagtag
3061 gcacgttgag taaagcgatt ggtgcagagg gtggatttgt atcgacatca tccattgcta
3121 agaactattt gttaaataac gcccgatctt ttattttcca aacagcttta tcgccaagtg
3181 cgattgaagc agcgcgagaa ggcatttcca tcatacagaa tgagcccgag cggagaaagc
3241 aattgctgaa aaatgcgcag tacttacgat tgaaattaga ggaatctggt tttgtaatga
3301 aagaagggga aacacctatt atttctctta tcattggtgg ttctcatgaa gccatgcagt
3361 tttctgcgaa actactggat gaaggtgtct ttattccagc gattcgacca ccaacagtgc
3421 cgaaagggtc aagtcggttg cgtataacgg taatggctac acatacaata gagcagctcg
3481 atatggtcat tagtaaaatt aagaaaatag gaaaagaaat ggggattgta taattgtttg
3541 agtgcctggc actcaaacaa ttttttgatt gctttttctt attaattaca tagcactaac
3601 cttcaagttc tgcaattgcc tctaattcaa ttaagcagcc atggcttaat tctttaattg
3661 ggataatggc gcgggcaggc ttatgactac cgaaaaattg tgcataagtt tcatttacgt
3721 ccgcccataa atcagcagat gttatataga cgcgacacat aacaacagcc tccttcgtaa
3781 gaccgcttgc ttgaagcacg acttccatct tttgaagcgc catgaatgtt tcagcgtgaa
3841 tgccaccagt tggaggtttg cctgtagctg gatcgactga agtttgcccc gaaatatata
3901 aattcccgtt actaatcatg cctggtgtat aatgcccgat tggctcttgt tctatcaaaa
3961 ctttcttcat aataagttcc tccttataaa cgttgtaatt gatttagccc aatattgcca
4021 cctgaaataa ggaagcaaac atggtcgttt ttcgtaatcg gtaaattgcc tcgaagagct
4081 gctcctatcc cgatggcaga ggagggttca gctaatattt tgccttctag tagcaatgtt
4141 ttcatcccat taagtatgga agcctcgtct acacgcacaa tgtcatcaac taatttttca
4201 acaatcggga agtttcgctc cctaggttgt aaagttaata gagcatccgc taacgattgt
4261 tgttcagcta atagtaatct tttaccaact tgaatacttt ctgaatagcg agggacgacg
4321 gctggttcaa caccaaaaat tttaatatgt ggtgctaaag attttaccgc agttgcgaca
4381 ccacttagca gaccaccacc accaatagga acaacaattg cacttacctg agagaactgt
4441 tctaaaattt ccaatcctac tgtcccttgc cccgccataa tatcgtaatc atcataagga
4501 tgcacaatgg cataaccatg ttctaagctt
//
LOCUS HUMATPAS01 1458 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exons 2-5.
ACCESSION M28284
KEYWORDS adenosine triphosphatase.
SEGMENT 1 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 1458)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 35 148 adenosine triphosphate alpha subunit, exon 2
(EC 3.6.1.3) (AA at 35)
308 442 adenosine triphosphate alpha subunit, exon 3
876 993 adenosine triphosphate alpha subunit, exon 4
1109 + 1377 adenosine triphosphate alpha subunit, exon 5
IVS < 1 34 adenosine triphosphate alpha subunit intron A
IVS 149 307 adenosine triphosphate alpha subunit intron B
IVS 443 875 adenosine triphosphate alpha subunit intron C
IVS 994 1108 adenosine triphosphate alpha subunit intron D
IVS 1378 > 1458 adenosine triphosphate alpha subunit intron E
BASE COUNT 267 a 493 c 429 g 269 t
ORIGIN
1 gtcgacggat caacatcccc acatctcccc acagctgtac ctgggcatcg tgctggcggc
61 cgtggtgatc atcactggct gcttctccta ctaccaggag gccaagagct ccaagatcat
121 ggagtccttc aagaacatgg tgccccaggt gaagggtgcc cagcaagggc cagagggggt
181 gttagtgtat gggctggggc cgggcccagc aagggccaga gggggtgtta gtgtatgggc
241 tggggccggg ccagtgaccc ccaggcagag ggagtctggg aggtgacatt actccatccc
301 acctcagcaa gccctggtga tccgggaagg tgagaagatg caggtgaacg ctgaggaggt
361 ggtggtcggg gacctggtgg agatcaaggg tggagaccga gtgccagctg acctgcggat
421 catctcagcc cacggctgca aggtgggcct gtagggcccg gctttaccct ccctcgggct
481 gcccaggagc tcaggcccag ccctcctccc tcacactcag gagtccaggt cccaacccct
541 cctccctcag acccaggagt ccaggccccc aggccctcct ccctcagacc caggagttta
601 ggccctgggc acctcctcac ccaacagtca ggagtccagg cccagccctc acctttcgga
661 gatccttagg gaccctagac cttggccagc agcactgtgc cttccccacc ttcaagatcc
721 cagcctctgg cctcccacac accactcacc cactgggcac ccaggcttct agctgtgatc
781 tccaggcaca caggcttcag cccaagccct gtacacaaat accctcctgt cccaggccct
841 ggactgaacc ctctctctgc tgcacctacc cccaggtgga caactcctcc ctgactggcg
901 aatccgagcc ccagactcgc tctcccgact gcacgcacga caaccccttg gagactcgga
961 acatcacctt cttttccacc aactgtgtgg aaggtgaggc gggtgcagag aagacacaca
1021 gctgggcaga ccagggatgt gtcccagggg tcaggcctca gaacctccct gagccacccc
1081 acctcagcct aacccctctg ggctgcaggc acggctcggg gcgtggtggt ggccacgggc
1141 gaccgcactg tcatgggccg tatcgccacc ctggcatcag ggctggaggt gggcaagacg
1201 cccatcgcca tcgagattga gcacttcatc cagctcatca ccggcgtggc tgtcttcctg
1261 ggtgtctcct tcttcatcct ctccctcatt ctcggataca cctggcttga ggctgtcatc
1321 ttcctcatcg gcatcatcgt ggccaatgtc ccagagggtc tgctggccac tgtcactgta
1381 aggccaggct cctgggtctg ggtggggagg gcctgggggt ctgggctcct gggtctgagg
1441 gaggaggggc tgggatcc
//
LOCUS HUMATPAS02 200 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 6.
ACCESSION M28285
KEYWORDS adenosine triphosphatase.
SEGMENT 2 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 200)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 33 + 200 adenosine triphosphate alpha subunit, exon 6
(EC 3.6.1.3)
IVS < 1 32 adenosine triphosphate alpha subunit intron E
BASE COUNT 44 a 65 c 52 g 39 t
ORIGIN
1 ccctgtaatt gcctgccttg ctcgtcctcc aggtgtgtct gaccgtgacc gccaagcgca
61 tggcccggaa gaactgcctg gtgaagaacc tggaggctgt agagaccctg ggctccacgt
121 ccaccatctg ctcagataag acagggaccc tcactcagaa ccgcatgaca gtcgccacca
181 tgtggtttga caaccagatc
//
LOCUS HUMATPAS03 200 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 7 (5' end).
ACCESSION M28286
KEYWORDS adenosine triphosphatase.
SEGMENT 3 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 200)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 67 / 200 adenosine triphosphate alpha subunit, exon 7
(EC 3.6.1.3)
IVS < 1 66 adenosine triphosphate alpha subunit intron F
(no splice consensus)
BASE COUNT 43 a 68 c 55 g 34 t
ORIGIN
1 ggccaccctc tgatcggtcc ccagctctcc atccatgaga ccgaggaccc caacgacaac
61 cgatacctgc tggtgatgaa gggtgccccc gagcgcatcc tggaccgctg ctccaccatc
121 ctgctacagg gcaaggagca gcctctggac gaggaaatga aggaggcctt tcagaatgcc
181 taccttgagc tcggtggcct
//
LOCUS HUMATPAS04 559 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exons 8 and 9.
ACCESSION M28287
KEYWORDS adenosine triphosphatase.
SEGMENT 4 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 559)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 155 330 adenosine triphosphate alpha subunit, exon 8
(EC 3.6.1.3) (AA at 157)
401 + 537 adenosine triphosphate alpha subunit, exon 9
IVS < 1 154 adenosine triphosphate alpha subunit intron G
IVS 331 400 adenosine triphosphate alpha subunit intron H
IVS 538 > 559 adenosine triphosphate alpha subunit intron I
BASE COUNT 125 a 165 c 157 g 112 t
ORIGIN
1 acttaatata tatatagtag tatgaaggaa gtatcctaga atgaatgctg actgccgtct
61 gctgatggga gatgaatgcg gcgatcagac atctaggcat gggtgcaggt tccgaggctg
121 gaaccctcac accccaaccc ttccctgcca ctaggtttct gccattatta cctgcccgag
181 gagcagtatc cccaaggctt tgccttcgac tgtgatgacg tgaacttcac cacggacaac
241 ctctgctttg tgggcctcat gtccatgatc ggcccacccc gggcagccgt ccctgacgcg
301 gtgggcaagt gtcgcagcgc aggcatcaag gtgtggcttg ggtgctgggg aggcaagcag
361 gcgtggcgga gaggccatcc ctaaaaaaca atgcctgcag gtcatcatgg tcaccggcga
421 tcaccccatc acggccaagg ccattgccaa gggtgtgggc atcatctctg agggcaacga
481 gactgtggag gacatcgccg cccggctcaa cattcccgtc agccaggtta accccaggtg
541 agccacccat tcccagcca
//
LOCUS HUMATPAS05 323 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 10.
ACCESSION M35821
KEYWORDS adenosine triphosphatase.
SEGMENT 5 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 323)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 62 + 212 adenosine triphosphate alpha subunit, exon 10
(EC 3.6.1.3)
IVS < 1 61 adenosine triphosphate alpha subunit intron I
IVS 213 > 323 adenosine triphosphate alpha subunit intron J
BASE COUNT 59 a 106 c 97 g 61 t
ORIGIN
1 gatcactttg ccactcctca cacaccctga cctcagccat cgctctctct gctcttccca
61 gggatgccaa ggcctgcgtg atccacggca ccgacctcaa ggacttcacc tccgagcaaa
121 tcgacgagat cctgcagaat cacaccgaga tcgtcttcgc ccgcacatcc ccccagcaga
181 agctcatcat tgtggagggc tgtcagagac aggtgggctg cgctcccgca gaggagggga
241 cggggccttg actcctgggt cctcactgag gccggggctg gtttcctggg tctgagggag
301 gaggggctgg gggtctgacc tgg
//
LOCUS HUMATPAS06 428 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 11.
ACCESSION M35822
KEYWORDS adenosine triphosphatase.
SEGMENT 6 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 428)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 147 + 315 adenosine triphosphate alpha subunit, exon 11
(EC 3.6.1.3)
IVS < 1 146 adenosine triphosphate alpha subunit intron J
IVS 316 > 427 adenosine triphosphate alpha subunit intron K
BASE COUNT 83 a 110 c 147 g 88 t
ORIGIN
1 tctagagagt ctgagcctaa ctcctagtct aggaagagag ttggacctaa cttctagtct
61 aagagagggc tgggtcctgg ctttctggat ctgaggaggg actctgggga ctggcctctg
121 gtgtcatcct taccctctct ccccagggtg caattgtggc tgtgaccggg gatggtgtga
181 acgactcccc cgctctgaag aaggccgaca ttggggtggc catgggcatc gctggctctg
241 acgtctccaa gcaggcagct gacatgatcc tgctggacga caactttgcc tccatcgtca
301 caggggtgga ggagggtgag ttggccaggg gtggcctgga gaccaggtca ctaccggagc
361 ctgagaccag caaggggaac tggccagggc tgcaggggga tgtgtggcag agaccaaggc
421 ccctgcct
//
LOCUS HUMATPAS07 939 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, intron J.
ACCESSION M28288
KEYWORDS adenosine triphosphatase.
SEGMENT 7 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 939)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
IVS < 1 > 939 adenosine triphosphate alpha subunit intron K
BASE COUNT 287 a 197 c 250 g 205 t
ORIGIN
1 ggatcccgtc ctgagtgaca cgagccagac acaaaagcat tcttagcatc caactccaac
61 tggagcaggc tcagcacccg tctctgggct aaaagtcagg ataatggtta tgtctaggag
121 gagggatata gactggaaga gagcagacaa aacctaaatg gggggctcgg aatggttttc
181 tatcttgact ggggtctggt ggttgcatgg ctgtgtacat gtgtaaaaat tgtatacacc
241 ctctctggtt gaatatttta atgagatata atatgcccag ctcagtgtat gcatgccata
301 tctcaataaa tattcaacca gtacaggttg tctaaagtaa acaaaataaa gattaatcca
361 ggccaggcgg tggggctaca cctgtaatcc caagactttg gagaccgaag caggaggatt
421 gcctgagatc aagagttcaa gaccagcctg gccaacatga tgaaacccca tctctactaa
481 aaatacaaaa attagctggg cttggtggtg ggcacctgta accccagcta cttgggaggc
541 tgaggcagga gaatcatttg aacctcagag gcagacgttg cagtgagccg caattgtgcc
601 attgcatcca gcctgggcaa cagagcaaga ctccatctca aaaaaataaa ataaaataaa
661 ataaacatta gtcaggcacg gtggctcgag cctgtggtcg cagctacttg ggaggctgag
721 gtgggaggat cacttgagga ggtcgaggtt gcagtgactg atgattgcag cactgggcta
781 ccagcctggg caacgaggtg gagaccttct ctctaaataa ataaataagt aaaaagatta
841 gcccatgaac aatgaggtca atcattaaga aagaacaggc gggcacggtg gcaaacacct
901 gtaatcccag cactttggga gcgctgaggc gggtggatc
//
LOCUS HUMATPAS08 1619 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exons 12 and 13.
ACCESSION M28289
KEYWORDS adenosine triphosphatase.
SEGMENT 8 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 1619)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 1069 1223 adenosine triphosphate alpha subunit, exon 12
(EC 3.6.1.3)
1301 + 1424 adenosine triphosphate alpha subunit, exon 13
IVS < 1 1068 adenosine triphosphate alpha subunit intron K
IVS 1224 1300 adenosine triphosphate alpha subunit intron L
IVS 1425 > 1619 adenosine triphosphate alpha subunit intron M
BASE COUNT 372 a 419 c 489 g 339 t
ORIGIN
1 ggatccaagg tccagctgtg gagttgtgtc tgtagctcgc cccttcttgc tgtgtagtat
61 ttgactgcat ggatgtaccg tggtttattc atccctcacc tgttgatgga catggggttg
121 actctggctt ttggctctga tgagtggtgc tatggtgaac attctcgtac ttgcttttcg
181 ctgggcataa acagtcctcc tctcttttgg gtataaaacc taggagtcgg attactcggt
241 cggtggggtg ggggatgttt gatagcaact ccgaaaccat cttccaaagt ggttatttca
301 agaatgtttc cacaaaggcg atctggctgg atgtttatgt ggaaaggtag gtgggtgagg
361 ggccaggtgt ggagacttat gcctgtaatc ccagcacttc gggaggctga gatggaatca
421 cagcccagaa attagacaag cctaggaaaa gtggtgagac tctgtctcta ctaaaaatac
481 aaaaattagc tggacgtggt ggcgcgtgac tgtagtctca gctacatggg aggctgaggc
541 aggaggatca cttgagcctg ggagtcgagg ttgcagtgag ccgtgatcat gccactgcac
601 tgcagcctgg gcaacagagt gagaccctgt cccaaaaaga aaataatggt gggtgggctg
661 gggggtagaa aaggacgcag gccgggcgcg gtcgtcacgc ctgtaatccc aggcacttca
721 ggaggccgag gagggcagat cacgaggtca ggagattgag accatgcagg ctaacacagt
781 gaaaccccgt ctctaataaa aatacaaaaa aattagccgg gcgtggtggc aggtgcctgt
841 agtgccagct actcgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt
901 gcagtgagcc gagatcgcac cactgcactc cagcctgggc aacacagcga gactctgtct
961 caaaaaaaaa aaaaaaaaaa aaaaggacgt tggatgaggg cagaggaggg gcagagggag
1021 tggggctccc tggcatgggc gcctgctctg agcctgcctg tgccacaggc cgcctgatct
1081 tcgacaacct aaagaagtcc attgcctaca ccctgaccag caatatcccg gagatcacgc
1141 ccttcctgct gttcatcatg gccaacatcc cgctgcccct gggcaccatc accatcctct
1201 gcatcgatct gggcactgac atggtgagcc ctggcagcca cccttggggc caggagggtg
1261 gagtcctccc ctctccggct caccggcctc ctccgcctag gtccctgcca tctcactggc
1321 gtacgaggct gccgaaagcg acatcatgaa gagacagccc aggaacccgc ggacggacaa
1381 attggtcaat gagagactca tcagcatggc ctacgggcag attggtgagg caccggggac
1441 tccatctcct taccaccatg ccggcctaga gcatgcctgg ccaccgtgcg tgcttgggac
1501 cctggcattg actcagggga gcagacgtgg gcaggaccaa ccagtgagct atctgagggg
1561 ggggtctgca ccccatcctt ctccacctcc tcctctctgc tgttgatgtg tgcggatcc
//
LOCUS HUMATPAS09 276 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 14.
ACCESSION M28290
KEYWORDS adenosine triphosphatase.
SEGMENT 9 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 276)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 86 + 231 adenosine triphosphate alpha subunit, exon 14
(EC 3.6.1.3)
IVS < 1 85 adenosine triphosphate alpha subunit intron M
IVS 232 > 276 adenosine triphosphate alpha subunit intron N
BASE COUNT 46 a 83 c 81 g 66 t
ORIGIN
1 ccgggccctc tgaggtgccc tgggttaggc tgactggccc caatctgagc ctctgcctga
61 ggacttacct ctctcctcct tccaggaatg atccaggctc tcggtggctt cttctcttac
121 tttgtgatcc tggcagaaaa tggcttcttg cccggcaacc tggtgggcat ccggctgaac
181 tgggatgacc gcaccgtcaa tgacctggaa gacagttacg ggcagcagtg ggtgagtagg
241 gcagggtgct gtgtctctgc ccaccgtaag atcccc
//
LOCUS HUMATPAS10 567 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 15.
ACCESSION M28291
KEYWORDS adenosine triphosphatase.
SEGMENT 10 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 567)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 366 + 496 adenosine triphosphate alpha subunit, exon 15
(EC 3.6.1.3)
IVS < 1 365 adenosine triphosphate alpha subunit intron N
IVS 497 > 567 adenosine triphosphate alpha subunit intron O
BASE COUNT 95 a 179 c 161 g 132 t
ORIGIN
1 ctgatgccct tgccagccca tctgagcctt tgtcccactg ctgagctcgt ctgccctgcc
61 agccttcttc cccacctctg tctgtccctt caaagcccag ttgctgtctg tctccttggg
121 aggccacagg gtcttatcct cctccctggc ccctggtgga tgtgagatca ccattaactt
181 tcttcttcct tgtacgtctc ctgactcctc cctctgggac tatgagcccg cagaaggaag
241 acacacctga ggccctgagg accccatgca ggatgggcgg ggcaaagagc acggaacgtc
301 ggggtggcgg ctagggctgc agtgccacta actgacgtcc cgtgccctgg tcaccgctcc
361 tgcagacata cgagcagagg aaggtggtgg agttcacctg ccacacggcc ttctttgtga
421 gcatcgttgt cgcccagtgg gccgatctga tcatctgcaa gacccggagg aactcggtct
481 tccagcaggg catgaagtga gggccggggg acatggtgac tggacagcca tctgtctgtc
541 cgagtcgtct gtctgtcgtg tgtactc
//
LOCUS HUMATPAS11 212 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 16.
ACCESSION M28292
KEYWORDS adenosine triphosphatase.
SEGMENT 11 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 212)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 28 + 129 adenosine triphosphate alpha subunit, exon 16
(EC 3.6.1.3)
IVS < 1 27 adenosine triphosphate alpha subunit intron O
IVS 130 > 212 adenosine triphosphate alpha subunit intron P
BASE COUNT 33 a 81 c 53 g 45 t
ORIGIN
1 cccagccctg cccttctgtg cctccaggaa caagatcatg atcttcgggc tgtttgagga
61 gacggccctg gctgccttcc tgtcctactg ccccggaatg gacgtggccc tgcgcatgta
121 ccctctcaag tgagtgcccc gctgccccca gccctgccca caccagcgcc tgccatggag
181 cctcttcctt agactaggtc tgaactaggc cc
//
LOCUS HUMATPAS12 1047 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit
gene, exon 17 and 18.
ACCESSION M28293
KEYWORDS adenosine triphosphatase.
SEGMENT 12 of 12
SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE 1 (bases 1 to 1047)
AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide
sequence related to the alpha-subunit
JOURNAL FEBS Lett. 213, 73-80 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 101 192 adenosine triphosphate alpha subunit, exon 17
(EC 3.6.1.3)
474 502 adenosine triphosphate alpha subunit, exon 18
IVS < 1 100 adenosine triphosphate alpha subunit intron P
BASE COUNT 145 a 391 c 236 g 275 t
ORIGIN
1 ggatcctggg agactgcccc tctgcgtccc ctccagtccc ctgaaactct gcctctcgtt
61 agggccccgc actcaagccc tcctgctctc ccctctgcag gcccagctgg tggttctgtg
121 ccttccccta cagtttcctc atcttcgtct acgacgaaat ccgcaaactc atcctgcgca
181 ggaacccagg gggtgaggga gctcggcaag gcagccgagg ggggcggggg gcagcagggt
241 ctcaggcagc tggtcccagg ctcccctcgc cctgctggat gctctgccac ctggttccca
301 ctcttctctc tcttcccatc tctccggcac ccactctgtc ttctcacggg tctctgtctg
361 tgtggtttcc ttgtctctct ccctctctgt ccctctctct gctgggcggc tcgccttgcc
421 tgtctctctc catctcttac tctgtctctt tctttccttc tttgtctctc caggttgggt
481 ggagaaggaa acctactact gacctcagcc ccaccacatc gcccatctct tccccgtccg
541 caggccagga ccgcccctgt cagtcccccc aattttgtat tctgggggga ggagccctct
601 cttcctgtgg ccccaccttg gcccccaccc cctccactat ctcctgccgc ccccactctg
661 gctggcttct ctcccctgcc ccaaacctct ctcctctctc ttttctgtgt cagtttctct
721 ccctctcctc acccctctat ccattcctcc cgccccagcc acctccctgg gctctttttt
781 actccccttc agccccccgg tgatgccatc tctggttctg gacaattatc aaatatatca
841 gtggggagag agaagcggtg tgtgtgttgt gcctgctttc cagacgggga ctgcggctgg
901 gacagcatcc cctctgttgg ggcgcgctgc caggcgactc tcgaaactgc aggaacttag
961 cctggcactg gcttggaagt cacggaatct cagagccatc taatcacgga gttttcaact
1021 ccacgaaagt cagagcagct tgaattc
//
LOCUS HUMATPASE1 309 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone
lambda-NK-alpha-RD-16.
ACCESSION M27571
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 1 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 309)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 21 + 289 Na+, K+ -ATPase alpha subunit, exon 2 (AA at
23)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A
IVS 290 > 309 Na+, K+ -ATPase alpha subunit intron B
BASE COUNT 54 a 88 c 96 g 71 t
ORIGIN
1 gatgccccac catgttgcag gcactgccag gggcattgtg attgccacag gagaccggac
61 ggtgatgggc cgcatagcta ctctcgcctc aggcctggag gttgggcgga cacccatagc
121 aatggagatt gaacacttca tccagctgat cacaggggtc gctgtattcc tgggggtctc
181 cttcttcgtg ctctccctca tcctgggcta cagctggctg gaggcagtca tcttcctcat
241 cggcatcata gtggccaacg tgcctgaggg gcttctggcc actgtcactg tgagtgggtc
301 aggctgagg
//
LOCUS HUMATPASE2 239 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3, clone
lambda-NK-alpha-RD-16.
ACCESSION M27576
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 2 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 239)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 219 Na+, K+ -ATPase alpha subunit, exon 3
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B
IVS 220 > 239 Na+, K+ -ATPase alpha subunit intron C
BASE COUNT 53 a 76 c 66 g 44 t
ORIGIN
1 ttctcctctt tctctaccag gtgtgcctga ccctgacagc caagcgcatg gcacggaaga
61 actgcctggt gaagaacctg gaggcggtgg agacgctggg ctccacgtcc accatctgct
121 cggacaagac gggcaccctc acccagaacc gcatgaccgt cgcccacatg tggtttgaca
181 accaaatcca tgaggctgac accaccgaag atcagtctgg tgattgggtg ctccagcgg
//
LOCUS HUMATPASE3 150 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4, clone
lambda-NK-alpha-RD-16.
ACCESSION M27578
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 3 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 / 130 Na+, K+ -ATPase alpha subunit, exon 4
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron C
IVS 131 > 150 Na+, K+ -ATPase alpha subunit intron D
BASE COUNT 31 a 49 c 38 g 32 t
ORIGIN
1 tccccctcat ttcctcccag gggccacttt tgacaaacga tcccctacgt ggacggccct
61 gtctcgaatt gctggtctct gcaaccgcgc cgtcttcaag gcaggacagg agaacatctc
121 cgtgtctaag gtagggggtc aggacacaca
//
LOCUS HUMATPASA1 308 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone
lambda-NK-alpha-TW-4.
ACCESSION M27572
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 1 of 2
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 308)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 21 / 288 Na+, K+ -ATPase alpha subunit, exon 2 (AA at
22)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A
IVS 289 > 308 Na+, K+ -ATPase alpha subunit intron B
BASE COUNT 56 a 79 c 80 g 93 t
ORIGIN
1 tccctcccct tctttttaag gaccgcacgt ggtattgttg tctacactgg ggatcgcact
61 gtgatgggaa gaattgccac acttgcttct gggctggaag gaggccagac ccccattgct
121 gcagaaattg aacattttat ccacatcatc acgggtgtgg ctgtgttcct gggtgtgtct
181 ttcttcatcc tttctctcat ccttgagtac acctggcttg aggctgtcat cttcctcatc
241 ggtatcatcg tagccaatgt gccggaaggt ttgctggcca ctgtcacggt aagaggcagg
301 tgatggtc
//
LOCUS HUMATPASA2 95 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4, 3' end, clone
lambda-NK-alpha-TW-4.
ACCESSION M27579
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 2 of 2
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 95)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 / 75 Na+, K+ -ATPase alpha subunit, exon 4 (AA at
1)
IVS 76 > 95 Na+, K+ -ATPase alpha subunit intron D
BASE COUNT 27 a 20 c 21 g 27 t
ORIGIN
1 gctctgtcca gaattgcagg tctttgtaac agggcagtgt ttcaggctaa ccaggaaaac
61 ctacctattc ttaaggtatg ctcaagagtt aacta
//
LOCUS HUMATPASEA 188 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (5' end).
clone lambda-NK-alpha-SW3.2.
ACCESSION M27574
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-SW3.2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 188)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 21 > 188 Na+, K+ -ATPase alpha subunit, exon 3 (AA at
21)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B
BASE COUNT 45 a 57 c 50 g 36 t
ORIGIN
1 gtccttccct ctcctgctag gtgaccctgt cgctgacagc aaaacggatg gccaagaaga
61 actgcctggt gaagaacctg gaggctgtgg agacccctgg ctccacctcc atcatctgct
121 cggacaagac tgggacactg acccagaaca ggatgacagt ggcccatctg tggttcgaca
181 atcagatc
//
LOCUS HUMATPASEB 141 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (partial),
clone lambda-NK-alpha-R15-1.
ACCESSION M27575
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R15-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 141 Na+, K+ -ATPase alpha subunit, exon 3 (AA at
1)
BASE COUNT 32 a 41 c 41 g 27 t
ORIGIN
1 ctgtccctga cagccaagcg cctggccagt aagaactgcg tggtcaagaa cctggaggcg
61 gtggagacat tgggctccac ttcggtgatc tgctcggaca agacagggac tctcactcag
121 aaccgcatga ctgtgtccca t
//
LOCUS HUMATPAB1 309 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2,
clone lambda-NK-alpha-R3-2.
ACCESSION M27570
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 1 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 309)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 21 + 289 Na+, K+ -ATPase alpha subunit, exon 2 (AA at
23)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A
IVS 290 > 309 Na+, K+ -ATPase alpha subunit intron B
BASE COUNT 45 a 103 c 89 g 72 t
ORIGIN
1 ctaacccctc tggcctgcag gcacggctcg gggcgtggtg gtggccacgg gcgaccgcac
61 tgtcatgggc cgtatcgcca ccctggcatc agggctggag gtgggcaaga cgcccatcgc
121 catcgagatt gagcacttca tccagctcat caccggcgtg gctgtcttcc tgggtgtctc
181 cttcttcatc ctctccctca ttctcggata cacctggctt gaggctgtca tcttcctcat
241 cggcatcatc gtggccaatg tcccagaggg tctgctggcc actgtcactg taaggccagg
301 ctcctgggt
//
LOCUS HUMATPAB2 239 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3,
clone lambda-NK-alpha-R3-2.
ACCESSION M27573
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 2 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 239)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 + 219 Na+, K+ -ATPase alpha subunit, exon 3 (AA at
22)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B
IVS 220 > 239 Na+, K+ -ATPase alpha subunit intron C
BASE COUNT 54 a 77 c 68 g 40 t
ORIGIN
1 ctgccttgct cgtcctccag gtgtgtctga ccgtgaccgc caagcgcatg gcccggaaga
61 actgcctggt gaagaacctg gaggctgtag agaccctggg ctccacgtcc accatctgct
121 cagataagac agggaccctc actcagaacc gcatgacagt cgcccacatg tggtttgaca
181 accagatcca cgaggctgac accactgagg accagtcagg tgagcgcagg ccccgggta
//
LOCUS HUMATPAB3 150 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4,
clone lambda-NK-alpha-R3-2.
ACCESSION M27577
KEYWORDS Na+, K+ -ATPase alpha-subunit.
SEGMENT 3 of 3
SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
TITLE The family of human Na+ K+ -ATPase genes: No less than five genes
and/or pseudogenes related to the alpha-subunit
JOURNAL FEBS Lett. 217, 275-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 21 / 130 Na+, K+ -ATPase alpha subunit, exon 4 (AA at
22)
IVS < 1 20 Na+, K+ -ATPase alpha subunit intron C
IVS 131 > 150 Na+, K+ -ATPase alpha subunit intron D
BASE COUNT 27 a 50 c 38 g 35 t
ORIGIN
1 ctcacacatg cctcccccag ggacctcatt tgacaagagt tcgcacacct gggtggccct
61 gtctcacatc gctgggctct gcaatcgccc tgtcttcaag ggtggtcagg acaacatccc
121 tgtgctcaag gtgggttagc tactggcctc
//
LOCUS NGORRDA 1486 bp ss-rRNA RNA 30-JUN-1990
DEFINITION N.gonorrhoeae 16S ribosomal RNA.
ACCESSION M34130
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE N.gonorrhoeae (strain 2013) ribosomal RNA.
ORGANISM Neisseria gonorrhoeae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1486)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
University of Illinois
Urbana, IL 61801
FEATURES from to/span description
rRNA 1 1486 16S ribosomal RNA
BASE COUNT 365 a 334 c 460 g 294 t 33 others
ORIGIN
1 nnaacataag agtttgatcc tggctcagat tgaacgctgg cggcatgctt tacacatgca
61 agtcggacgg cagcacaggg aagcttgctt ctcgggtggc gagtggcgaa cgggtgagta
121 acatatcgga acgtaccggg tagcggggga taactgatcg aaagatcagc taataccgca
181 tacgtcttga gagggaaagc aggggacctn cgggccntgc gctatcngag cggccgatat
241 ctgattagct ngtnggcggg gtaaaggccc accaaggcga cgatcagtag cgggtctgag
301 aggatgatcc gcnacactgg gactgagaca cggccnagac tcctacggga gncagcagtg
361 ggnnattttg gacaatgggc gcnagcctga tccagccatg cngcgtgtct gaagaaggcc
421 ttcgggttgt aaaggacttt tgtcagggaa gaaaaggccg ttgccaatat cggcggccga
481 tgacggtacc tgaagaataa gcaccggcta actacgtgcn agcagccncg nnnatacgta
541 gggtgcgagc gttaatcgga attactgggc gtaaagcggg cgcagacggt tacttaagca
601 ggatgtgaaa tcncngggct caacccggna actgcgttct gaactgggtg actcgagtgt
661 gtcagaggga ggtggaattc cacgtgtagc agtgaaatgc gtagagatgt ggaggaatac
721 cnatggcgaa ggcagcctcc tgggataaca ctgacgttca tgtccgaaag cgtgggtagc
781 aaacaggatt agataccctg gtagtccacg ccctaaacga tgtcaattag ctgttgggca
841 actnnattgc ttggtagcgt agctaacgcg tgaaattgac cgcctgggga gtacggtcgc
901 aagattaaaa ctcaaaggaa ttgacggggn accgcacaag cggtggatga tgtggattaa
961 ttcgatnnaa cgcgaagaac cttacctggt tttgacatgt gcggaatcct ccggagacgg
1021 agnagtgcct tcgggagccg taacacaggt gctgcatggc tgtcgtcagc tcgtgtcgtg
1081 agatgttggg ttaagtcccg caacgagcgc aacccttgtc attagttgcc atcattcggt
1141 tgggcactct aatgagactg ccggtgacaa gccggaggaa ggtggggatg acgtcaagtc
1201 ctcatggccc ttatgaccag ggcttcacac gtcatacaat ggtcggtaca gagggtagcc
1261 aagcgcgagc ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
1381 ccngnncttg tacacaccgc ccgtcacacc atgggagtgg ggnataccag aagtaggtag
1441 ggtaaccgca aggagtccgc ttaccacggt atgcttcatg actggg
//
LOCUS SPLRRDA 1492 bp ss-rRNA RNA 30-JUN-1990
DEFINITION S.volutans 16S ribosomal RNA.
ACCESSION M34131
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE S.volutans (strain ATCC 19554) ribosomal RNA.
ORGANISM Spirillum volutans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1492)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
University of Illinois
Urbana, IL 61801
FEATURES from to/span description
rRNA 1 1492 16S ribosomal RNA
BASE COUNT 389 a 327 c 446 g 299 t 31 others
ORIGIN
1 tcaaatgaag agtttgatcc tggctcagat tgaacgctgn cngcatgcct tacacatgca
61 agtcgaacgg taacagggac ttcggtccgc tgacgagtgg cgaacgggtg agtaaagcat
121 aggaatacat ccattagtgg gggataacgc atcgaaaggt gtgctaatac cgcatacgct
181 ctaaggagga aagcanggga tccnagacct tgcgctaatg ggctagccta tgtccgatta
241 gctagttggt agagtaaaag cctaccaagg cgacgatcgg taactggtct gagaggatga
301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaatt
361 ttggacaatg ggggaaaccc tgatccagcc atgccgcgtg ggtgaagaag gccttcgggt
421 tgtaaagccc tttcgggagg aaagaaaagc attgctctaa catagcaatg tgatgacggt
481 acctccataa gaagcaccgg ctaactacgt gccagcngcc gcggtaatac gtagggtgcn
541 agcgttaatc ggaattactg ggcgtaaagc gtgcgcaggc ggctagataa gccagatgtg
601 aaatccccga gctcaacttg ggtatcgcgc ttggaactgt ctagcnagag tttggcngag
661 gggggtagaa ttccacgtgt agcagtgaaa tgcgtagaga tgtggaggaa taccgatggc
721 gaaggnngcc ccctgggcta aaactgacgc tcatgcncga aagcgtgggg agcaaacagg
781 attagatacc ctggtagtcc acgccctaaa cgatgtcaac tggttgttgg gagggtaaac
841 ctcttagtaa cgaagctaac gcgagaagtt gactgcctgg ggagtacggt cgcaagatta
901 aaactcaaag gaattgacgg gganccgcac aagcggtgga ttatgtggat taattcgatg
961 caacgcgaaa aaccttacct acctttgaca tgtcgagaat cctgcagaga tgcgggngtg
1021 ccttcgggaa ctcgaacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
1081 gggttaagtc ccgcaacgag cgcaaccctt atcattagtn gctacgcaag ggcactctaa
1141 tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct catggccctt
1201 atgggtaggg cttcacacgt aatacaatgg tcggtacaga gggttgccaa cccgtgaggg
1261 ggagctaatc tcagaaagcc gatcgtagtc cggattgcag tctgcaactc gactgcatga
1321 agtcggaatc gctagtaatc gcggatcagc atgcngcggt gaatacgttc ccgggtcttg
1381 tacacaccgc ccgtcacacc atgggagtgg gtttcaccag aagtagttag cctaaccgcn
1441 aggagggcga ttaccacggt gngattcatg nnnnnnnnnn nnnngtaaca ag
//
LOCUS LOAREPDNA 839 bp ds-DNA INV 30-JUN-1990
DEFINITION Loa loa repetitive DNA sequence.
ACCESSION M34259
KEYWORDS repetitive DNA.
SOURCE Loa loa DNA.
ORGANISM Loa loa
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Orthorrhapha; Tabanoidea; Tabanidae.
REFERENCE 1 (bases 1 to 839)
AUTHORS Klion,A.D., Raghavan,N., Brindley,P.J. and Nutman,T.B.
TITLE Cloning and characterization of a species-specific repetitive DNA
sequence from Loa loa
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.D.Klion, 11-MAY-1990.
Author address: A.D.Klion
NIAID/NIH, Lab Parasitic Diseases
Bldg 4, Rm 126
9000 Rockville Pike
Bethesda, MD 20892
BASE COUNT 207 a 130 c 199 g 303 t
ORIGIN
1 gatcaaatac ctgatacgct cgtataaccg tgtcatccgg atatttatta tattaattat
61 taaagtgtta atcaaacatg agcaaaactt tttcgacgac gcaaattaca ttcctcgttt
121 tagtgcttta acccgaatgt aatataacat aacaaccatt tgcttaggct taggtttttc
181 gggtttatgc ttaggcttag gtttttcggg tttatgctta ggcttaggct taggttcttc
241 gggtttatgc ttaggcttag gcttaggttt ctcgggttta tgcttaggtt catgcttagg
301 cttaggctta ggcttgggct taggcttagg cttaggctta ggtttttcgg gtttatgctt
361 aggcttaggc ttaggcttgg gcttaggctt aggcttaggc ttaggttttt cgggtttatg
421 cttaggctta ggcttaggct taggcttggg cttaggctta ggcttaggct taggcttagg
481 cttaggtttt tcgggtttat gcttaggttt atgcttaggg tttatgctta gttttttttt
541 gaacactgtt cgataaccat ataagtatca taaatgtaaa catgtaaaat ttcagcattt
601 ataacaagaa gcaccaaaaa acaccgatgg atgaagcaaa agcggacgat gagcatgcca
661 cagtgaggag ctgttaagcg acttcgtgct gctacaaatt tgcattatgc aatggattac
721 agtaatccat tgttggatac agtgttgttt tctgatttgt agtaatccat cacagtaatt
781 cagcaaagtg aagtgtactc tgttgaaagc gtaagtattg cattggtgat attatgatc
//
LOCUS CFXRRDA 1486 bp ss-rRNA BCT 30-JUN-1990
DEFINITION C.aurantiacus 16S ribosomal RNA.
ACCESSION M34116
KEYWORDS 16S ribosomal RNA.
SOURCE C.aurantiacus (strain J10FL) ribosomal RNA.
ORGANISM Chloroflexus aurantiacus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Multicellular filamentous green bacteria; Chloroflexaceae.
REFERENCE 1 (bases 1 to 1486)
AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
Woese,C.R.
TITLE The green non-sulfur bacteria: A deep branching in the eubacterial
line of descent
JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 15-MAY-1990.
FEATURES from to/span description
rRNA 1 1486 16S ribosomal RNA
BASE COUNT 301 a 394 c 495 g 224 t 72 others
ORIGIN
1 acaatgnaga gtttgatcct ggctcaggac gaacgctggc ggcgtgccta atgcatgcaa
61 gtcgaacgca gcaggcgtgc ctggctgcgt ggcgaacggc tgaggaacac gtgggtgacc
121 tgccccggag tgggggatac cccgtcgaaa gacgggacaa tcccgcatac gctcgacgga
181 ggaaagccga aggcgctctg nnaggggccn gcggcccatc aggtagttgg tgtggtaacg
241 gcgcaccaag ccaatgacgg gtacccggtc tgagaggacg accgggcaga ctgggactga
301 gacacggccc agactcctac gggaggcagc agcaaggaat tttccccaat gggcgcaagc
361 ctgagggagc aacgccgcgt ggaggacgac ggccttcggg ttgtaaactc ctttcgggtg
421 ggacgatgct gacggtacca ccagaagcag ccccggctaa ctctgtgcca gcagccgcgg
481 taagacagag ggggcnagcg ttgtccggag ttactgggcg taaagggcgc gcaggcggtg
541 ggctgcgtcg gcgctgaaag cgccccgctt aacggggcga ggcgcgccga tacgagtcca
601 ctcgaggcaa gcagagggtg gcggaattcc gggtggagcg gtgaaatgcg tagagatccg
661 gaggaacgcc agtggggaag ccggccacct gggcttgacc tgacgctgag gcgcgacagc
721 gtggggagca aaccggatta gatacccggg tagtccacgc cgtaaacgat gccggctcgg
781 cgtctggcgc acgttggtgt gctgggtgcc ttagctcacg cggtaagccg gccgcctggg
841 gannnnnnnn nnnngcttaa aactcaaagg aattgnnnnn nnnnngcaca agnnnnnnnn
901 nnnnnngttt aattcgacgc aacacgaaga accttacccg gacttgacat ggcgctgcat
961 cccctggaaa cagggncgcc tgncgaggcg caccacaggt gctgcatggc tgtcgtcagc
1021 tcgtgtcgtg agatgttggg ttcagtcccg caacgagcgc aacccgtgtc ggtagttaca
1081 ggtgtctacc gagactgccg ntgaccggcg gaggaaggcg cggatgacgt caagtcagca
1141 tggcccttac gtccggggcg acacacacgc tacaatggcc acgacaatgc gttgccaagc
1201 cgcaaggtgg agctaatcgc ctaaacgtgg tctcagtgca gantcggggc tgcaactcgc
1261 ccccgtgaag gcggagttgc tagtaaccgc gtatcagcca tggcgcggtg aatacgttcc
1321 cgggccttgt acacaccgcc cgtcacgtca tgggagtggc caatgcttga agtccgtgtg
1381 ctaaccccag tcggggaggc agcggccgag ggcaggggcc gcgacnnnnn nnnnnnngta
1441 acaagnnnnn ngtaccggaa gnnnnnnnnn gatcacctcc tttcta
//
LOCUS DVURRDA 1551 bp ds-DNA BCT 30-JUN-1990
DEFINITION D.desulfuricans 16S ribosomal RNA gene.
ACCESSION M34113
KEYWORDS 16S ribosomal RNA.
SOURCE D.desulfuricans (ATCC 27774) DNA.
ORGANISM Desulfovibrio desulfuricans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1551)
AUTHORS Oyaizu,H. and Woese,C.R.
TITLE Phylogenetic relationships among the sulfate respiring bacteria,
myxobacteria and purple bacteria
JOURNAL Syst. Appl. Microbiol. 6, 257-263 (1985)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
FEATURES from to/span description
rRNA 1 1551 16S ribosomal RNA
BASE COUNT 387 a 364 c 488 g 311 t 1 others
ORIGIN
1 tgaactggag agtntgattc tggctcagat tgaacgctgg cggcgtgctt aacacatgca
61 agtcgaacgc gaaagggact tcggtcctga gtaaagtggc gcacgggtga gtaacgcgtg
121 gataatctgc ccttatgatc gggataacag ttggaaacgg ctgctaatac cggatacgct
181 caaaatgaac tttttgagga aagatggcct ctgcttgcat gctatcacgt aaggatgagt
241 ccgcgtccca ttagcttgtt ggcggggtaa cggcccacca aggcatcgat gggtagccga
301 tttgagagga tgatcggcca cactggaact gaaacacggt ccagactcct acgggaggca
361 gcagtgggga atattgcgca atgggcgaaa gcctgacgca gcgacgccgc gtgagggatg
421 aaggttttcg gatcgtaaac ctctgtcaga agggaagaaa ctacgttgtg ctaatcagca
481 gcgtactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa
541 tacggagggt gcaagcgtta atcggaatta ctgggcgtaa agcgcacgta ggctgtagtg
601 taagtcaggg gtgaaatccc acggctcaac cgtggaactg cctttgatac tgcacaactt
661 gaatccggga gagggtggcg gaattccagg tgtaggagtg aaatccgtag atatctggag
721 gaacatcagt ggcgaaggcg gccacctgga ccggtattga cgctgaggtg cgaaagcgtg
781 gggagcaaac aggattagat accctggtag tccacgctgt aaacgatgga tgctagatgt
841 cggggagtat tcttcggtgt cgtagttaac gcgttaagca tcccgcctgg ggagtacggt
901 cgcaaggctg aaactcaaag aaattgacgg gggcccgcac aagcggtgga gtatgtggtt
961 taattcgatg caacgcgaag aaccttacct aggtttgaca tccacggaac cctcccgaaa
1021 aggaggggtg cccttcgggg agccgtgaga caggtgctgc atggctgtcg tcagctcgtg
1081 tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctatggatag ttgccagcaa
1141 gtaatgttgg gcactctatt cagactgccc gggttaaccg ggaggaaggt ggggacgacg
1201 tcaagtcatc atggccctta cgcctagggc tacacacgta ctacaatggc gcgcacaaag
1261 gggagcgaga ccgcgaggtg gagccaatcc caaaaaacgc gtcccagtcc ggattgcagt
1321 ctgcaactcg actgcatgaa gttggaatcg ctagtaattc gagatcagca tgctcgggtg
1381 aatgcgttcc cgggccttgt acacaccgcc cgtcacacca cgaaagtcgg ttttacccga
1441 agccggtgag ccaaccagca atggaggcag ccgtctacgg tagggccgat gattggggtg
1501 aagtcgtaac aaggtagccg taggggaacc tgcggctgga tcacctcctt t
//
LOCUS HPTRRDA 1484 bp ss-rRNA BCT 30-JUN-1990
DEFINITION H.aurantiacus 16S ribosomal RNA.
ACCESSION M34117
KEYWORDS 16S ribosomal RNA.
SOURCE H.aurantiacus (ATCC 23779) ribosomal RNA.
ORGANISM Herpetosiphon aurantiacus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Cytophagaceae.
REFERENCE 1 (bases 1 to 1484)
AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
Woese,C.R.
TITLE The green non-sulfur bacteria: A deep branching in the eubacterial
line of descent
JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 15-MAY-1990.
FEATURES from to/span description
rRNA 1 1484 16S ribosomal RNA
BASE COUNT 331 a 359 c 465 g 265 t 64 others
ORIGIN
1 nttgtcggcg agtttgatcc tggctcagga cgaacgctgg cggcgtgcct aatgcatgca
61 agtcgaacgc attcttcgga aggagtggcg cacggctgag gaacacgtga ctaacctacc
121 ccggtgtggg ggataacggg tcgaaagact cgctaatccc gccatacgat ngcctcggcg
181 gaggaaagcc gtaaggcgca ctggnngggg gtcgcgtccc attagatagt tggtgtggta
241 atggcgcacc aagtcgatga tgggtctctg gtctgagagg ncgaccagac agattgggac
301 tgagacacgg cccaaactcc tacggggngc agcagcaagg aattttcggc aatgggcgca
361 agcctgaccg agcaacgccg cgtggaggat gacggctctt gggttgtaaa ctccttttgg
421 gggggacgat aatgacggta ccctccgaat caggcccggc taactacgtg ccagcagccg
481 cggtaatacg taggggccaa gcgttgtccg gaattactgg gcgtaaagcg tgnntaggcn
541 gtcgatgatg tgccgcgtga aagcgccgga gtaatgccgg ccaggtcgcg gcagacacgt
601 tgactagagg ctcgcagagg aacgtggaat tcccggtgta gtggtgaaat gcgtagatat
661 cgggaggaac accagtggcg caagcggcgt tctgggcgag acctgacact gagccacgac
721 ggcgtgggga gcaaacagga ttagataccc tggtagtcca cgcagtaaac gatgcatacc
781 aggtgtggga tggcgttcgc gtcgttccgt gccgcagctt acgcgatgag tatgccgcct
841 ggggactacg agcgcaagct taaaactcaa aggaattgnn nnnnnnnngc acaagnnnnn
901 nnnnnnnnng tttaattcga cgcaacgcga agaaccttac ctagtcttga catagcactg
961 caagcttcgg aaatgaagtt cgccttcgag ggtgtgctac aggtgctgca tggctgtcgt
1021 cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga gcgcaacccc tgtgaggtgt
1081 tacaagtgtc acctcagact gccgttgtca acaacggagg aaggcgggga tgacgtcaag
1141 tccgcatggc ccttacgact agggcgacac acacgctaca atggctggga gaatgcgccg
1201 cgacctggca acaggctgcg aatcgagaac accagtcaca gttcagattg ggggctgcaa
1261 ctcgccccca tgaaggcgga gttgctagta atcgccggtc agcatacggc ggtgaatcag
1321 tacccgggcc ttgtacacac cgcccgtcac gtcatggaag tgggaaacac ctgaagtccg
1381 tggctaaccg caaggaggca gcggccgagg gtgggtctcg taactgnnnn nnnnnngtaa
1441 caagnnnnnn nnnnnngaag nnnnnnnnng atcacctcct ttct
//
LOCUS MXARRDB 1540 bp ds-DNA BCT 30-JUN-1990
DEFINITION M.xanthus 16S ribosomal RNA gene.
ACCESSION M34114
KEYWORDS 16S ribosomal RNA.
SOURCE M.xanthus (strain MD207) DNA.
ORGANISM Myxococcus xanthus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria;
Myxococcaceae.
REFERENCE 1 (bases 1 to 1540)
AUTHORS Oyaizu,H. and Woese,C.R.
TITLE Phylogenetic relationships among the sulfate respiring bacteria,
myxobacteria and purple bacteria
JOURNAL Syst. Appl. Microbiol. 6, 257-263 (1985)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
FEATURES from to/span description
rRNA 1 1540 16S ribosomal RNA
BASE COUNT 384 a 372 c 488 g 295 t 1 others
ORIGIN
1 caattggaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
61 gtcgagcgcg aataggggca acccttagta gagcggcgca cgggtgcgta acacgtggat
121 aatctgcctg agtgctcggg ataaccagtc gaaagattgg ctaataccgg ataagcccac
181 ggtttcttcg gagactgagg gaaaaggtgg cctctgtata caagctatca cattcagatg
241 agtccgcggc ccatcagcta gttggcgggg taatggccca ccaaggcaac gacgggtagc
301 tggtctgaga ggacgatcag ccacactgga actgagacac ggtccagact cctacgggag
361 gcagcagtgg ggaattttgc gcaatgggcg aaagcctgac gcagcaacgc cgcgtgtgtg
421 atgaaggtct ttggattgta aagcactttc gaccgggaag aaaacccgtt ggctaacatc
481 caacggcttg acggtaccgg gagaagaagc accggctaac tctgtgccag cagccgcggt
541 aatacagagg gtgcaagcgt tgttcggaat tattgggcgt aaagcgcgtg taggcggcgt
601 gacaagtcgg gtgtgaaagc cctcagctca actgaggaag tgcgcccgaa actgttgtgc
661 ttgagtgccg gagagggtgg cggaattccc caagtagagg tgaaattcgt agatatgggg
721 aggaacaccg gtggcgaagg cggccacctg gacggtaact gacgctgaga cgcgaaagcg
781 tggggagcaa acaggattag ataccctggt agtccacgcc gtaaacgatg agaactaggt
841 gtcgtgggag ttgacccccg cggtgccgaa gctaacgcat taagttctcc gcctgggaag
901 tacggtcgca agactaaaac tcaaaggaat tgacgggggn ccgcacaagc ggtggagcat
961 gtggtttaat tcgacgcaac gcgcagaacc ttacctggtc ttgacatcct cagaatcctt
1021 cagagatgag ggagtgcccg caagggaact gagagacagg tgctgcatgg ctgtcgtcag
1081 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ctttagttgc
1141 cacgcaagtg gatctctaga gggactgccg gtgttaaacc ggaggaaggt ggggatgacg
1201 tcaagtcctc atggccttta tgaccagggc tacacacgtg ctacaatggc cggtacagag
1261 cgttgccaac ccgcgagggg gagctaatcg cataaaaccg gtctcagttc agattggagt
1321 ctgcaactcg actccatgaa ggaggaatcg ctagtaatcg cagatcagca cgctgcggtg
1381 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagtcga ttgctccaga
1441 aatcatctca ccaagaggtg ctcaaggagt ggtcggtaac tggggtgaag tcgtaacaag
1501 gtagccgtag gggaacctgc ggctggatca cctcctttct
//
LOCUS RBLRRDA 1491 bp ss-rRNA BCT 30-JUN-1990
DEFINITION R.acidophila 16S ribosomal RNA.
ACCESSION M34128
KEYWORDS 16S ribosomal RNA.
SOURCE R.acidophila (strain 7050) ribosomal RNA.
ORGANISM Rhodopseudomonas acidophila
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1491)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
Microbiology Department, 131 Burrill Hall
University of Illinois
407 S. Goodwill Ave.
Urbana, IL 61801
E-mail: ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1491 16S ribosomal RNA
BASE COUNT 364 a 341 c 449 g 300 t 37 others
ORIGIN
1 tacaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc
61 aagtcgaacg ggcatagcaa tatgtcagtg gcagacgggt gaggaacacg tgggaacatg
121 ccctttggtt cggaacaact cagggaaact tgagctaata ccggataagc ctttcgaggg
181 aaagatttat cgccgaagga gtggcccgcg tccgattagc tagttggtga ggtaatggct
241 caccaaggcg acgatcggna gctngtctga gaggatgatc agccacattg ggactgagac
301 acggcccaaa ctcctacggg aggcagcagt ggggaatatt ggacaatggg cgcaagccng
361 atccagccat gccgcgtgag tgatgacggc cctagggttg taaagctctt tcacccacga
421 cgataatgac ggtagtgggc gaagaagccc cggctaactt cgtgccagca gccgcggtaa
481 tacgaagggg gcnagcgttg ttcggattta ctgggcgtaa agcgcacgta ggcngatctt
541 taagtcaggg gtgaaatgcc ggagctcaac ttcggaactg cctttgatac tggagatctt
601 gagttcgaga gaggtgagtg gaactgcgag tgtagaggtg aaattcgtag atattcgcaa
661 gaacaccagt ggcgaaggcg gctcactggc tcgatactga cgctgaggtg cnnaagcgtg
721 gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgga tgctagccgt
781 taggcagctt gctgcttagt ggcgcagcta acgctttaag catcccgcct ggggagtacg
841 gtcgcaagat taaaactcaa aggaattgac gggggcncgc acaagcggtg gagcatgtgg
901 tttaattcga agcaacgcgc agaaccttac cagcttttga catgtcttgg acggatagca
961 gagatgttat cttctcttcg gagccgagaa cacaggtgct gcatggctgt cgtcngctcg
1021 tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac ccacgtcctt agttgccatc
1081 attaagttgg gcactctagg gagactgccg gtgataagcc gagaggaagg tgtggatgac
1141 gtcaagtcct catggccctt acaggctggg ctacacacgt gctacaatgg cggtgacaat
1201 gggaagcgaa agggcgacct ctagcaaatc tcaaaaaacc gtctcagttc ggattgtact
1261 ctgcaactcg agtgcatgaa ggtggaatcg ctagtaatcg cagatcagaa cgctgcggtg
1321 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagttgg ctttacccga
1381 aggcgtttcg ctaaccgcaa ggaggcaggc gaccacggta gggtcagcgn ctnggnnnnn
1441 nnngtaacaa gnnnnnnnnn nnngaacctg nnnnnngatc acctcctttc t
//
LOCUS RCARRDA 1470 bp ss-rRNA BCT 30-JUN-1990
DEFINITION R.capsulatus 16S ribosomal RNA.
ACCESSION M34129
KEYWORDS 16S ribosomal RNA.
SOURCE R.capsulatus (strain B10, ATCC 33303) ribosomal RNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1470)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
Microbiology Department, 131 Burrill Hall
University of Illinois
407 S. Goodwill Ave.
Urbana, IL 61801
E-mail: ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1470 16S ribosomal RNA
BASE COUNT 343 a 330 c 423 g 275 t 99 others
ORIGIN
1 ttcaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggcc taacacatgc
61 aagtcgagcg agaccttcgg gtctagcggc ggacgggtga gtaacgcgtg ggaacgtgcc
121 ctttgctacg gaatagcccc gggaaactgg gagtaatacc gtatgtgccc ttcggggnaa
181 agatttatcg gcaaaggatc ggcccgcgtt ggattaggta gttggtgggg taatggccta
241 ccaagccgac gatccatagc tggtttgaga ggatgatcag ccacactggg actgagacac
301 ggcccagact cctacgggag gcagcagtgg ggaatcttag acaatggggg aaaccctgat
361 ctagccatgc cgcgtgagcg atgaaggcct tagggttgta aagctctttc aggtgggaag
421 ataatgacgg taccaccaga agaagccccg gctaactccg nnnnnnnngc cgcggtaata
481 cgnnnnnnnn nnnnnnnnnn nngaattact gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
541 nnnnngnggt gaaatcccag ggctcaacct tggaactgcc tttgaaactc ctggtcttga
601 ggncgagaga ggtgagtgga attccgagtg tagaggtgaa attcgtagat attcggagga
661 acaccagtgg cgaaggcggc tcactggctc gatactgacg ctgaggtgcg aaagcgtggg
721 gagcaaacag gattagatac cctggtagtc cacgccgtaa acgatgaatg ccagtcgtcg
781 gcaggcatgc ctgtcggtga cacactaacg gattaagcat tccgcctggg gagaacggtc
841 gcaagattaa aactcaaagg aattgacggg ggcccgcaca agcggtggag catgtggttt
901 aattcgaagc aacgcgcaga accttaccaa cccttgacat cgagatcgcg gttaccagag
961 atggtttcct tcagttcggc tggatcttag acaggtgctg catggctgtc gtcagctcgt
1021 gtcgtgagat gttgggttaa gtccggcaac gagcgcaacc cacactttca gttgccatca
1081 ttcagttggg cactctggaa gaactgccga tgataagtcg gaggaaggtg tggaggacgt
1141 caagtcctca tggcccttac gggttgggct acacacgtgc tacaatggtg gtgacaatgg
1201 gccaatccca aaaagccatc tcagttcgga ttggggtctg caactcgacc ccatgaagtn
1261 ggaatcgcta gtaatcgcgt aacagcaaga cgcggtgaat acgttcccgg gccttgtaca
1321 caccgcccgt cacaccatgg gaattgggtc taccctaaga aggtgcgcca accagcaang
1381 gaggcagcna gccacggtag gcacaggnng gnnnnnnnnn nngtaacaag nnnnnnnnnn
1441 nngaacctgn nnnnngatca cctcctttct
//
LOCUS RMVRRDA 1484 bp ss-rRNA BCT 30-JUN-1990
DEFINITION R.vannielii 16S ribosomal RNA.
ACCESSION M34127
KEYWORDS 16S ribosomal RNA.
SOURCE R.vannielii (strain EY33) ribosomal RNA.
ORGANISM Rhodomicrobium vannielii
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1484)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
Microbiology Department, 131 Burrill Hall
University of Illinois
407 S. Goodwill Ave.
Urbana, IL 61801
E-mail: ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1484 16S ribosomal RNA
BASE COUNT 343 a 343 c 433 g 292 t 73 others
ORIGIN
1 caaaatctga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc
61 aagtcgaacg caccgcaagg tgagtggcag acgggtgagt aacgcgtggg aaccttccct
121 ttggtacgga ataacttcgg gaaaccgaag ctaataccgt atatctcctc cgggagaaag
181 atttatcgcc aaaggatggg cccgcgttgg attagctagt tggtgtggta acggcgcacc
241 aaggcgacga tccatagctg gtctgagagg atgatcagcc acactgggac tgagacacgg
301 cccagactcc tacgggaggc ngcngtgggg aatcttggac aatgggggca accctgatcc
361 agccatgccg cgtgagtgaa gaaggcctta gggttgtaaa gctcttttgg cggggacgat
421 aatgacggta cccgcagaat aagccccggc taacttcgtg ccagcagccg cggtaatacg
481 aagggggcna gcgttgttcg gaatcactgg gcgtaaagcg tacgcaggcg gattgataag
541 tcaggggtga aatcccgggg ctcaacctcg gaattgcctt tgatactgtc tgtctcgagt
601 tcgggagagg ttggcggaat tcctagtgta gaggtgaaat tcgtagatat taggaagaac
661 accagtggcg aaggnngcca actggcccga tactgacgct catgtacgaa agcgtgggga
721 gcaaacagga ttagataccc tggtagtcca cgctgtaaac tatggatgct agccgttggg
781 gagcttgctc ttcagtggcg cagctaacgt cttaagcatc ccgcctgggg agtacggtcg
841 caagattaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta
901 attcgaggca acgcgaagaa ccttaccagc tcttgacatg tcgtgctacg tggagagatt
961 cacggttccc ttcggggacg cgaacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt
1021 gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ccttagttgc taccatttag
1081 ttgagcactc taaggnnacc gccggtgata agccggagga aggtggggat gacgtcaagt
1141 catcatggcc cttacgggct gggctacaca cgtgctacaa tggcggtgac agtgggcagc
1201 gacacagcga tgtgatgcta atcccaaaaa gccgtctcag ttcagattgc actctgcaac
1261 tcgagtgcat gaagtcggaa tcgctagtaa tcgcggatca gcatgccgcg gtgaatacgt
1321 tcccgggcct tgtacacacc gcccgtcaca ccatgggagt tggttttacc cgaaggcgtn
1381 angctaaccg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngtaa
1441 caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct
//
LOCUS THRRRDA 1528 bp ds-DNA BCT 30-JUN-1990
DEFINITION T.roseum 16S ribosomal RNA gene.
ACCESSION M34115
KEYWORDS 16S ribosomal RNA.
SOURCE T.roseum (strain ATCC 27502) DNA.
ORGANISM Thermomicrobium roseum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1528)
AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
Woese,C.R.
TITLE The green non-sulfur bacteria: A deep branching in the eubacterial
line of descent
JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 15-MAY-1990.
FEATURES from to/span description
rRNA 1 1528 16S ribosomal RNA
BASE COUNT 295 a 416 c 585 g 226 t 6 others
ORIGIN
1 ggggatggag agtttgatcc tggctcaggg ggaacgctgg cggcgtgcct aatgcatgca
61 agtcggacgg gangcacgcn ctctggcgtg ccgaccgtgg cggacgggtg cgtaacacgt
121 ggggaaccct cccgggtgcg ggggataacc cggggaaact cgggctaata ccccatacgc
181 ttggctggcg gtgggccggc caaggaaagg cgcaggcgac tgtgctgtgc tcggagggcc
241 ctgcggccta tcagctagac ggtagggtaa cggcctaccg tggcgatgac gggtagctgg
301 tctgagagga tggccagcca cacgggcact gagacacggg cccgactcct acgggaggca
361 gcagcaggga atcttccgca atgggggcaa ccctgacgga gcgacgccgc gtgcgggagg
421 aagcccttcg gggtgtaaac cgctgttcgg ggggacgatc gagcgacggt accctcggag
481 caagtcccgg ctaactacgt gccagcagcc gcggtaagac gtagggggcg agcgttaccc
541 ggagtcactg ggcgtaaagg gcgtgtaggc ggctgggtac gccgcgtgtg aaagtccccg
601 gctcaaccgg ggagggtcgc gcgggacggc ctggctcgag ggcgggagag gcgggtggaa
661 ttcccggtgt agcggtgaaa tgcgtagaga tcgggaggaa cgccggtggc gaaggcggcc
721 cgctggcccg tacctgacgc tgaggcgcga aggcgtgggg agcgaaccgg attagatacc
781 cgggtagtcc acgcagtaaa cgatgcgggc gaggtgtggg tggtgttgac cccatccgtg
841 ccggcgccaa cgcagtaagc ccgccgcctg gggagtacgg ccgcaaggct aaaactcaaa
901 ggaattgacg ggggcccgca caagcagcgg agcgtgtggt ttaattcgac gcaacgcgaa
961 gaaccttacc agggcttgac atgccaccga acctggctga aaggctgggg tgcccggtga
1021 gggagcggtg gcacaggtgc tgcatggctg tcgtcagctc gtgccgtgag gtgttgggtt
1081 aagtcccgca acgagcgcaa ccctcggggt cagttacgcg ggtgtctgac ccgactgccg
1141 gggaaagccc ggaggaagga ggggatgacg tcaagtcagc atggccctga cgccctgggc
1201 gacacacacg ctacagtgac cgggacagtg ggcagcgaag gggcgacctg gagccaatcc
1261 cgcaaacccg gtcgtggtgg ggatcgcagg ctgcaacccg cctgcgtgaa cgcggagttg
1321 ctagtaaccg ccggtcagcc atacggcggt gaatacgttc ccgggccttg tacacaccgc
1381 ccgtcacgtc acgaaagctg gcttcacctg aagctggtgg gccaaccggc gcacgctggg
1441 ggcagccgtc gagggtgggg ctggtgattg ggacgaagtc gtaacaaggt agccgtaccg
1501 gaaggnnnng gtggatcacc tcctttct
//
LOCUS RATTATFL 11973 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat tyrosine aminotransferase (TAT) gene, 5' flank.
ACCESSION M34257
KEYWORDS tyrosine aminotransferase.
SOURCE Rat (strain Wistar) adult male liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 8314 to 8413)
AUTHORS Boshart,M., Weih,F., Schmidt,A., Fournier,R.E.K. and Schuetz,G.
TITLE A cyclic AMP response element mediates repression of tyrosine
aminotransferase gene transcription by the tissue-specific
extinguisher locus Tse-1
JOURNAL Cell 61, 905-916 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 11973)
AUTHORS Labeit,S., Strahle,U. and Weih,F.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by M.Boshart, 11-MAY-1990.
BASE COUNT 3162 a 2618 c 2675 g 3518 t
ORIGIN
1 gttcatgcat gtacaagtac atgtgaatgt gtgtgaacaa cttcagatgt cattacttgg
61 gccataatta ctgttgttga tgttgatagg gcctatcttc gcctgggggt ggccagtgag
121 gctggctaac tgctcagtca atccagacat ttgctcatct ctccgttccc aatgctggca
181 ttactacccg tgcctcccac accaccatgc ctggtatttt ttttttttcc tttttgtctt
241 tttaaatctg ggctgtaagg ttgaactcag gtccttgtgc atgcaagtca agtgttttac
301 agactgagct atctctccta gtcctgtgct ggaaattccc tcagacactt ctacatcaga
361 ttcaaatcct ttctctaggt gagagatgcc attcatttgc caacaaggcg gaatgggtga
421 gcttgtccaa aataccacgt ttaatcaggt tgtctgtttt ggttggctta tttgacactt
481 acaagtcaac attgcaacat tattatatat ttgtcttctg attttaaaaa tgatgggtga
541 tgcgcttgtg ttttaacctt ttctaggaga taaacgtcta cctgttgaat taacagactt
601 actttttaaa catcctgtcg tacatgacac ttttttttaa acatttattc atttattata
661 tataagtaca ctgtagctgc tcttcagaca caccagaaga gggcatcgga tctctttaca
721 gatggttgtg agccaccacg tggttgctgg gaattgaact caggacctct ggaagagtag
781 tcgggtgctc ttaaccgctg agccatttct ccagcccgac actttttaat actatataaa
841 acaagttaaa taaatacgat cgttgccctt agaaatgcac aaacacgaaa tgggacagac
901 ttttcacttt ctttcatgcg tagcacttcc acgaaaccac aagttaggaa caatgatcaa
961 atatttacag gacccctaag catcacagtt aataattttc atgtggggac gcacctatat
1021 gtgattggca gttgaatcaa caaagcagca aaaatcaaca actttaaaat aagatgtgta
1081 ggtcattttg gtcaactaaa atttgagaaa tcgctgattt gtgtatgcac gtgtgttcct
1141 aaaaccgtta ctttatgtag ttctttttca aagaaaggag atgggcaccc ccaggaaagc
1201 tggttatacc cattgcttgg agtttctgat gactgctttg acatcagggc agcattctgt
1261 catcatccgt gtgtactctg ggatgacagg gacctggacc tgtgtagaag agtcccctaa
1321 ctcccttctc actccccagg gcttaaggct tgcctgggtc acctgatgct cgtttgcagt
1381 gggcgaggta gggcaggtga ggagagtggg tccattgccg tgagacacgc aatcagtggc
1441 aatagtgaag tcactgagcc ggacgtttct caatatttgc tctggcagag ggaggggggg
1501 gggagggagg gagggaggga ggtagagctg tttttttgtt ctaagggact gaaaactctt
1561 ctggaaatct acggggaatg catggagagc cagcactggg tttggggctc agggggtctc
1621 gtggacagca tgtccagcct ttcctagtgc taatctactt ggcaaatgga gattcctcct
1681 tcgggtgtca gctcaggggt agatgaagca ccctctgcct tttccgggct catcctgaac
1741 ttcctataga acactgatct ataagcctct cgcctgtcct cctggagcac ctgccattca
1801 caggattcat cagggtggtg ttcagggaag acacccattg gggtcgtcaa cacaatagag
1861 ccaactgaag aagggaggat ctgcctccca ttttatattt attttattat tttatttgag
1921 acagagtatg tctatgtagc cttggctagg ctggaacttt ccttttttga gcaggttggc
1981 ctcaaactca ccaagattat ctggctatac atctcctgag tgctgggatt aaaggagtgt
2041 accactgtac ctggcttctc tctctctctg tctctgtctc tgtctgtctc tctgtctctc
2101 tgtctctctg tctctctctc tctctctctc tctttctctc tctctctctt ccttccttcc
2161 tcttcctttc cttctttcct tcttttcttc tctctctctt tctttcttcc tttctttctt
2221 tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctt
2281 ctttctcttc tagtttttgt gtgtgtgtct gtgtgttttg gatacagtct tgctatgtga
2341 ccctacgctc ttctcaaact tgtatttatt ttgccacagt tttcccagag ctgagagttg
2401 ctttccttaa aaaaaacaaa aaatcacagg ggttggggat ttagctcagt ggtagcgcta
2461 tggcctagca agcgcgaagg ccctgggttc ggtcctcagc tccgaaaaaa aagggaaaaa
2521 aaattacata ttacttttag agaataacag gtttctgttg tgaaatttcc gtaagtattt
2581 acaacaaatg cagtactgac tattaaacca agagccatgt gtatgctagg caaatgctct
2641 acccctgacc tgcgtaatgc actttgatga cactcttaac ccgttgcatc ctggtcacac
2701 tcttctcctt cctcttccca aatagataat cttctgcctt tatgcctttt tcttttctct
2761 acatctagat tccgcatgtg agattcttcc ttagctggtc ttctcttact gaacaccgtg
2821 ctctgtccag ttgtctgcaa atggcatcat tccgtccttt agagctgaac agaactccac
2881 cgtgtatgcg tactgcatct tttttatcca catctctgtg gacggaccct tagctgattg
2941 cgaagtctgc ctattacgaa tacagtgacg gcagatgcgc aggcatctac ttccgctcct
3001 ccaaggtgcg caggcgtctc ggcagtacac cgacttccgc tccacccggg acatacctgc
3061 gagtggtata gctaggtcac agggtgattc tatttttcga atctatacta atttccatgc
3121 ggctatatta atttacattc ccacaacaat atatggaggt tccattccgc ctgtacatta
3181 tatcccccct ctctttctct gcctcaagta tgctaggcaa gcacttttac cagtcagcca
3241 acattcccag cccacttatg gttttcttgg tgatgccatt atgattggga tgaggtgaac
3301 atctccatgt agttaagatt ggccttttcc tgattctatt tccctcctgt tcatcagaca
3361 gaccactgaa tagcctaatc atatggggcc ctgaggatct ccagttataa aagttaatga
3421 ttctggttgc ttaattgact taggtaacag ttaataaaac acagggcctc atgatttact
3481 gcctatgagg cagaatagct taactaccct gctgttagag gaggccacct tggcccgagg
3541 tttggttctg ttctttagta gctgtgtggc ctttggcaag ttattcaact ctcctatgtt
3601 tgggtttctg cagttttcaa atgcgtttat aatagtacct accacatggt ggaatcagta
3661 aagtaacaca gacttagaat tgccaagtgc caagaacaca agagatgctt gatcaatatt
3721 tacgggtgac taaaattatt atttctacat ttttagtcat tcttaaaagg caatgctcct
3781 tccccccttt ttcttctgga cagaatagaa aaaacatcgt tttcaattca taataaattt
3841 ccattctcat aaaatatcca gtaccattta aggagttaat acaagctcct tgaccctcat
3901 cacaggtatt aataaagctt tatgaattac attatcagtc acttgttcat tcagccatcc
3961 tactgagcac tggctgtatg atagacacca tctcatgaaa caacgataaa taaatattct
4021 gctctcatgg aattaatatt ccagcagggg gaaataggca ataattaaaa tatgtaaaat
4081 gtatactata ttttactgat tctaaaacat ccattttccc agatgttaaa acccagaatc
4141 taggtgtggt gcctcccatt tgtaatccta gtattcagga ggctgagcag aaggctgcaa
4201 gttcaaggct tgcctcagtt aaaaaaataa attctcaggg ctggggattg tattacttgg
4261 ttgttaggat gcttacctaa gcacacatta gaccctaaat ttgaccctca atactgcata
4321 aaactgggtc tggtgacatc caggtggtgg aggcaggagg atcagaagtt taaagttatc
4381 tggctacata gtgagttcca agacaggtca gccacaacct tgtctaaaaa aaaaaaaaaa
4441 accacacaat taatattcat ttcacactga ctgaccagca tttttctttt gaacaaaaaa
4501 caaaacaaaa caacatcaga ccaagacaac aaatgcgtca acagcctctt agttttgatg
4561 ctggacacag ttgcggttag atatagtctc ctcctcctgc ccgcttgctg gaaggtaaaa
4621 ccaatatcct ttgggtagca aaatcccggc accttctctc ggactttgtt ttgctcatag
4681 ttctgtggca ggaaacttga tgtcctagac ttgggtaatt cactgtgatt tggccctcgc
4741 tgatggtcct ctcccctctg taggtcccag cagcgagtct ggataaagat tgctgacttg
4801 cataatctct ggcatttgtt accaggcggg gtagggaacg agtgatcctg agacttcact
4861 gagggaggtc agcagcaaag ctctggaaca ttccatgttc cttggccaag tgcatgctcc
4921 acaacttaca gatcatggag actgaactgg gactttctgc agcaggaagt caggctcagc
4981 ccctacctgt ccgaatggca gaccttgtat tacccattaa gccacgggac tgggatgctc
5041 aggattgctg tggtctaagc aggtagtgag agatgagagg ctggctctga ctggtttggg
5101 ttattatttg tgagtgctca aaaggaaaca gatggtgcgg tcaacctagg cagtttacga
5161 aaagggactt tcacttgtga ctgtgttcta gcaggatttg tctgtggcct cttgtgcttc
5221 tgagcggaga ctgggaacga ttacatgggt tcacatgtgg tcacagtcat cctgggccaa
5281 caacaggctc agatgggcta gccatccaaa atggcttctt catcggttca gtgcgccgat
5341 agttttgcat cccctctttg acaaaacaca cgggatttct tttagtttta ttttaccttt
5401 atattttttc atcttattaa atctttccct ccactttatt tgtattgcca tataaccttt
5461 tccccacata ctatttcctg attttttgat ccccttccct ctgctcccag gtttccctgc
5521 ctttcctcct atccagattc actctctttc tgtctctcac tagaaaacaa ccaggcttct
5581 aaaggataat aataaaatat aatataacac aagaaaccaa catctaatac actgggataa
5641 agacaaaata aaaagaagaa aagaacacaa taagagacag acacagagac ccactagttc
5701 atacacttag gaatcccaca aaaacactaa actggaagcc ataatataga cgcaaaggac
5761 ctgtagggta aaaggagaga aaaatatgaa ataattaaat acaaaaaaaa aaaatagccc
5821 tggcgtggca ctgtgaaaca aggcatctcc aatgttgagt tggtttccca tggaccatct
5881 cctgctgggc atggggcccg cccctaagag tagtttgttt ccccagcgag actcccttgg
5941 agaaaagtaa aattttattt gccagtgttt ttaagtggtg ataacttctg gattagggat
6001 ggggtacgtg tccaactctc ctttcaactc taggacccat ctggtgcagc cctgtgacag
6061 ccctgcacgt gctgccacag tctctgtgag ttcctgtgtg gatcagtcct gttgaccaga
6121 gggcttgttt tctcggcgtc ttccatgcct ctggctctcc tgcctcctct tctgcagggt
6181 tcccagagac acccatttaa gccggggtat gccaaggttt ctcacgctct gcatgctgtc
6241 tggctacggc tttctatttg tcccatctgc cgaagaagga agcctctctg atgatgactg
6301 agcagggcac tgatctactt ccctgactga agcacacact ggatttcttt cacgaaagct
6361 cagggtacta agcgagattt ggtagacgaa ctggcctgtg ttcagaacgc tcagaatgac
6421 tcctctggct ttcatcggtc aacccagtca tggcaaactt atttcctgat aaaatgctca
6481 ggatgaggac tggattgcag atgcggccac aaataaaaga acacgaagtt caccctggag
6541 acaaacggct gaggtaagag tcatggttgc cacagttatg caaaacacaa aacaaataag
6601 gtggtgctat ttgctaaact ccgtgagtgg ggcacccaga aggcacctca gtctgcaaac
6661 agatgaagat tttatttagc ttttgaagag cagagtgctg aagtagagag aaaagccaca
6721 acctttaaag ggtgagcagc cctttgaaaa cccactatcc ctttatacca tgaacaaaac
6781 agcagatcta atgctggctc ttcctcccct cattaccttc tccctttttt atgtgctatt
6841 tactgccaac tggtgagcac gtccagaccg aggcttatag gaagcaccac gtaagacaag
6901 agggcataag tggagaacaa ggaattgatt gtatgtgcag tacccaagtg atactgtcca
6961 ttattctgga tggaaaggaa cgacttagct ggtgaggaga tggaaaaggc aggcggatgt
7021 ttcctgcatc agctgaagag ttagttgact attttttttc ttaatttgtt tttgtttgtg
7081 tatgagtgct ttgcgcgcac ctagtctgtg taccacatgt gtgcaatgct caaggaggcc
7141 agaagagggc gccgtatctc ctggagctgg cgttgcagat ggtcccgaac ggccacgggg
7201 gtgctatgaa ttgaacaaaa agtaggccag caggaagttc atcgtctcag gaaaaagtct
7261 cattaaaata tcctaaaata agacccatag agtaccaagc tgagagagta aggatataga
7321 atgtacgtct ggaacaatag acacatttca acatgttaat catgggcaga aagaattgat
7381 attgcttgtg tccaggtggt agaattactg aagaactttt aaaggtttat ttgtttttat
7441 tttatgtgtg ggagtgtttt gcctgcagga tttgagtgtg ctacgcgcac gtctgttctt
7501 cttggagggt gtcagatggc ctgaactgaa gttacagttg gttgtgagcc aactgtgtag
7561 atgctggaaa ccaaatcccc atcaggtcaa acaaatgctc tgaactgctg agccatctct
7621 ccagccgcct ggcggtagaa ttattttaaa aatgcttttg tatgccaggc atggtggctc
7681 acagctgtaa agccagaact tgggaggtag aggcagaagg atcagcagtt tgggccatca
7741 caggctggct acattgtgag tatgagggtc tcaaaaaaca aaacaaaaat tatcccaaag
7801 ataaagaaca acgacaactc caaaacaaac aattaaccca cttctggcaa aagcgacagc
7861 gtacacttag aaaagctcac tagaagtcat ttggttttta ttttgatatt agcttttact
7921 taggtatcaa ttcgtttgtg tgtttagttt tgagataccc cagactatag ttgtttgttt
7981 ttcaagggaa ggacttcact gtgcctgttg gtgtaacctt tagtgagtgt cagagcttgc
8041 tgggccttga ctcgaggtct agtttatctg gtattgcaga aacttcctat acagctctgg
8101 gtcctgtcac acactctccc accctgcctg gtctgctttc tgctgaggga aaacattagc
8161 ttccagaggc aggtagcacc agaacttgtt tgctctgcct gtttcccctc tgacctcagt
8221 tctgatgttg gctcccacac ccagatgaag gtgtgttcgt ggcattgacc tgatgatgca
8281 gctgccctgg ctcccttcgc atgcccctct gccctgcagc ttctgcgtca gcgccagtat
8341 ttgctggacc acttgccaac ccctgtgcag cctgctgctc tttgatctgt atacctgaag
8401 ccaagccagc ccggcctgta agcagctgtt ctagagcttt gcaacccgca tcctaagcca
8461 aggtctcatc taaggcaaaa ataagggggt tattttctac agggacaggt gcgggacact
8521 cacagatgct tcaagaacac agtgggccga gaaaatgaac aatgtccgtc tttgattaaa
8581 tggggcaaag gcaacatctt taatattatc attccgtcaa tcactttgag aaaagcttat
8641 ttaagaagac aattcccaat ggttttcaaa ccaccggtta ggaaggtaat tagcctttct
8701 gaaaatacca tcaaggagtc tgaagaactg taacgtcggg gtgtggtggc ccacaccttg
8761 tcctcccagc acctgggagg ccgagacaaa gaagatgtag agttgggggc cagcttctgg
8821 tactctcgat gttcaaaacc aaaacagaac ggtttaaaaa ttaagactct ggcatagaat
8881 attctagaat caaacgtacc acaaacagat tcaggaatac tcggaattca gtataaacta
8941 aagcaacttt ttaaaattag tagggagaac caggtgtggt ggtacacacc tttaattcca
9001 gcacatggag gcaggggcag gcagatcttt gtgagttcaa gggcagcctg gtctacatgg
9061 caagttccag gctagccagg gcagcatagt gaagctctgc ttacaaaagc aaaacaaaaa
9121 accgggaaaa ccttcagttt ttgaataaga cacatatgtg agttcctaag atgcatgaac
9181 gccacagaga ccactcaaga gagcaccaag gctcctacgc aaaggcaagt tagctccaga
9241 gccctaggag agctgctcat gggacagggc agcgacagcc accaagctga gagtcagctt
9301 gtgtgcccag gagggaggcg ttgggtcaga gcctctggag gacccctgaa gtctcttctc
9361 agtgttctct atcacaggga gagctgtcag cccctggaat gtggttctat gtctagaaaa
9421 ctatcccata aataacagga agcccaaggt ttaccaatct ctgctgtaca ggatgttcta
9481 gctactttat ttgcaataga aaatctgaaa gtttccccat gtccaacaag actagaacaa
9541 acaagtcctg cgtagtcgcc tgtcggtttc tgggtgtggt ggtatagccc tgtaatccca
9601 gcatttggga agctgaggtg ggaggatcgg gagttcaagg tcagcttggg ctacttagaa
9661 agaccttgtc tcaaaagaag tggagggggg gtggtggtgg tggtggtggt ggtgtaaaat
9721 tgatctcttt gtatgataat gtccatacaa tatattaata ttgaaaacag tcatgccata
9781 gaacaatttt gttcagtaaa tgtacatcac ataagtaact tcaaatttta gaagggataa
9841 ttttaaagtc aggcaggcag cattaaagaa gaatgaggaa atcaatttaa aattcttttt
9901 taaagactta cttactggat ttttgcctat gggtaatttg catgtatgtt ataacaccct
9961 gtatgtttct ggttcctatg gggatcagaa gagggtatta ggtcctctgg gactggaggt
10021 atagaccgta gtttgtgagt caccgtgtgg gttctaagaa ctgaaccggg tcttctggaa
10081 gggcagccag tgctcttaac ctctgagcca tcttgccatc cccaataaaa ttagttaaga
10141 aattattatt attgttgttg tcgttttcgt ccttattagt gtgtgtgttc agcagtcagt
10201 tctgtccttc cacagtggca tctaagagtg gaacctaggt tggtggtcag gcttgtgcag
10261 gaagtgattt ttgccggctg agcaatctca tcaggcttcg gacatttttt aactgttttt
10321 atatttttat tttatagttt ttattgtttt aattcaaagt gtcagagaat ctttcaaaat
10381 gttgcaatcc atataaaaat gaacgaggtg gttggcattt ttaacactaa gtcttcgcaa
10441 tgcaacgtct cttttatatc aacaatgcat ctcactttag accagccaca tatcaaggat
10501 caaaagtcaa ttatggctgt agccattata ggaggccgtg caactctaaa gtatgtacag
10561 ccggcggtac agcgtttgct ttggttcaat ttcctatatc aaaaaggagt gtgtgggagg
10621 gagttgtaga aagtgactga ggaagacacc cagtgtccta atgtcaacag gaggatccac
10681 agcaggactc tggcaacacc cgtgtttatg aaacatctct gcgtgctgta gcttttattg
10741 gggggtttca gtgatccgtg tgtttgggca ctagaactct gtcggaccca gggggatgtg
10801 agccttgtgc atctaaatcg cagtccgcct tttcaggccc atcgcaggaa gaagccattg
10861 tagagatgca gcgtggtggt gggagcactg cacatgcgca gagacgctac tatgcaaata
10921 atagtctagc gcctcttgtg gacggtgttg tagctgcggt ctgtgcctgt tggagcaagg
10981 agcctgtgtg gccctggaat cagattacct gctttagggt aggagagaga tcctaccact
11041 gttgttggtt cctttgttaa cgttgtgatc agttacaagc tgctctggga attttccact
11101 gtttaaagaa cgacttcaga tcccttaact cgctgcactc agttttgtac tctgatgaaa
11161 atgaagctgc agagatcttt ttggctgcct tgtttcttga ttgattatta atttgtttga
11221 actgcccagt ttgttctaac tggagtccaa aactctgtgg tgtgccaaga aaaggaaatt
11281 aaggcatagg gcaaagaaat aataataacg actacggtga cgacgatgat gatgacgatg
11341 taaatgccat tatcgaacga gagcatgatt ctatctgtgc tttgaaaact tattaaattt
11401 tgtacgcatt tattctgcat ttctatttag ctatccgcat acttatctat tttggtgaat
11461 cagtgaaatt ttttatttgt ttttcccttg gaatgcggtt gaatttttgt ggatattacc
11521 gttgtccgta gcaaatccca catatggtag atgggtattc aaccaaactc ttatcattgc
11581 ttatatccag gagtgagact agagtgcaca gagaaagggg taccagaagg accctcgttt
11641 aaatgtattt ctgtactgtt aacacaattt tcataaccaa aaaagaccaa taaagttaat
11701 cactgtcacc ccaactgccc tccaccattt tctcaaacat ctcaccaaat gactagaaag
11761 agttaacagg attccagata cttgatgtaa ggacaaatcc cagattggaa ggtggcccag
11821 ggttggggtg agaaacagca gagtgggggg tggggtatgg gggtaggtcc gggggaggga
11881 cttagttctc actctcaacc aatagcacga aggcttcggg cccaacgccc attggctgaa
11941 actatttcaa gggtcaggac tgcacctgag ctc
//
LOCUS DRRADHA 1321 bp ds-DNA INV 30-JUN-1990
DEFINITION D.orena alcohol dehydrogenase (Adh) gene, 5' flank.
ACCESSION M33488 Z00032
KEYWORDS alcohol dehydrogenase.
SOURCE D.orena DNA.
ORGANISM Drosophila orena
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1321)
AUTHORS Moses,K., Heberlein,U. and Ashburner,M.
TITLE The Adh gene promoters of Drosophila melanogaster and Drosophila
orena are functionally conserved and share features of sequence
structure and nuclease-protected sites
JOURNAL Mol. Cell. Biol. 10, 539-548 (1990)
STANDARD simple staff_review
BASE COUNT 409 a 295 c 260 g 357 t
ORIGIN
1 tctagattga ttctactacg ctgcctccag cagacacgcc cccccagcac cccatcccct
61 cccgctggca ccagtttcag tattacacgt atggaaatta agccgaagtt caattgcgag
121 cgcaggaaca acacgagctt gcatctttct acacttcttg gcgcgctatg cttgacattc
181 acaaggtcga agctcttagc gttctgactc ggggtgctac actgcacaaa attacattat
241 gcattcttca aatatcccta atataccaaa tatgtattaa aaagtgatca tttaccgatc
301 gttcgaaaga cggtataggc ttacaaaaaa atttgccaaa gtaatttaca aattcataat
361 ggttttttca agtatataat atgtacatca atattttctt acatgtattt tatgggatga
421 ttatgtttta aactacactc aattttttct cagtgcactt cctggtgttc cattttctat
481 tgggtccgta cccggcgttt gtttgcagat cacttgcttg cctattatta tagcatttta
541 cacattacaa aattctggac gtcgctgctg cagccgctgt cgacgtcaac tgcactcgcc
601 cccacgacaa aacggtattt aaggcgctgt gcaagtccca gtcgccgatt atgtctcagt
661 gcagttgtca gttgcagttc agcagaccgg ctagcgagta cttgcatctc ttcaaattta
721 cctaattgat caagtaagtg ggaaagggac ctcattatgc aatgtcgaat agtaagagat
781 cactatcact aatggtggag cataataaaa tcaattgcat gcaatcgaaa tgaatgcaaa
841 ccggcacaag cagtagcaaa cctactaaac aaattaaaat ttggaggctg tgccgtggcg
901 aatatttgac ttggaaaatc acctgttgtt taaccgctaa aaataggaat tttaacatta
961 agcatccctg ttaatcggcg ccgtgccttc gttagctatc tcaaaagcgc gcgcgtgcag
1021 acgagcagta attttccaag catcaggcat agttgggcat aaattataaa catagaaact
1081 gaatactaat atagaaaaag ctttgccgcc acaaaatccc aaacaaaaac aaaatatgtg
1141 ccgaaaaata aaacaaaaat aaaccgtaaa ccgggcagcg ttgccgtcgt tgcgggctgt
1201 gaagcttacg tgaatagccg agagatcgcg taatgataga taaagaaagc tctacgtaag
1261 cgaagcttct gggggataga tcttcctata aatacgggac cgacgcgaac tggaaacgaa
1321 c
//
LOCUS M33489 1380 bp ss-mRNA BAD 30-JUN-1990
DEFINITION D.melanogaster alcohol dehydrogenase (Adh) mRNA, 5' flank.
ACCESSION M33489 Z00032
KEYWORDS alcohol dehydrogenase.
SOURCE D.melanogaster, cDNA to mRNA.
REFERENCE 1 (bases 1 to 1380)
AUTHORS Moses,K., Heberlein,U. and Ashburner,M.
TITLE The Adh gene promoters of Drosophila melanogaster and Drosophila
orena are functionally conserved and share features of sequence
structure and nuclease-protected sites
JOURNAL Mol. Cell. Biol. 10, 539-548 (1990)
STANDARD simple staff_review
COMMENT Secondary reference.
BASE COUNT 426 a 290 c 241 g 423 t
ORIGIN
1 tctagattga ttctacgctg cctccagcag ccacccctcc catccccatc cccatcacca
61 tccagtcccg ttggctccca gtcacagtat tacacgtatg caaattaagc cgaagttcaa
121 ttgcgaccgc aggaacaaca cgatctttct acacttctcc ttgctatgct tgacattcac
181 aaggtcaaag ctcttaatat tctggcttgt ggccctacac tgtaagaaat tactatagaa
241 ataagggtac acggaataag atattttttt tagtccatat gcttttaaca aatgtgtttt
301 gagtttatgt tatattattg ttagaaaaca ggtgtttttt tttaaatcgg ttaaaaaatt
361 actacgagag aaaaatacaa attttgtaaa taagattgac tctttttcga ttttggaata
421 ttttcattca ttttatgttt ttacgttttc acttatttgt ttctcagtgc actttctggt
481 gttccatttt ctattgggct ctttaccccg catttgtttg cagatcactt gcttgcgcat
541 ttttattgca ttttacatat tacacattat ttgaacgccg ctgctgctgc atccgtcgac
601 gtcgactgca ctcgccccca cgagagaaca gtatttaagg agctgcgaag gtccaagtca
661 ccgattattg tctcagtgca gttgtcagtt gcagttcagc agacgggcta acgagtactt
721 gcatctcttc aaatttactt aattgatcaa gtaagtagca aaagggcaca caattgaagg
781 aaattcttgt ttaattgaat ttattatgca agtgcggaaa taaaatgaca gtattaaata
841 gtaaatattt tgtaaaatca tatataatca aatttattca atcagaacta attcaagctg
901 tcacaagtag tgcgaactca attaattggc atcgaattaa aatttggagg cctgttccgc
961 atattcctct tggaaaatca cctgttagtt aacttctaaa aataggaatt ttaacataac
1021 tcgtccctgt taatcggcgc cgtgccttcg ttagctatct caaaagcgag cgcgtgcaga
1081 cgagcagtaa ttttccaagc atcaggcata taatatacta atactaatac taatactaat
1141 ataagaatac taatatagaa aaaaagcttt gccggtacaa aatcccaaac aaaaacaaac
1201 cgtgtgtgcc gaaaaataaa aataaaccat aaactaggca gcgctgccgt cgccggctga
1261 gcagcctgcg tacatagccg agatcgcgta acggtagata atgaaaagct ctacgtaacc
1321 gaagcttctg ctgtacggat cttcctataa atacggggcc gacacgaact ggaaaccaac
//
LOCUS RATRIBOI 1456 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat ribophorin I (Rpn-I) gene, 5'end.
ACCESSION M33508
KEYWORDS ribophorin I.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1456)
AUTHORS Behal,A., Prakash,K., D'Eustachio,P., Adesnik,M., Sabatini,D.D. and
Kreibich,G.
TITLE Structure and chromosomal location of the rat ribophorin I gene
JOURNAL J. Biol. Chem. 265, 8252-8258 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Kreibich, 03-APR-1990.
FEATURES from to/span description
pept 1362 > 1456 ribophorin I (Rpn-I)
mRNA 1338 > 1456 Rpn-I mRNA (alt.)
mRNA 1341 > 1456 Rpn-I mRNA (alt.)
mRNA 1347 > 1456 Rpn-I mRNA (alt.)
mRNA 1352 > 1456 Rpn-I mRNA (alt.)
mRNA 1353 > 1456 Rpn-I mRNA (alt.)
mRNA 1359 > 1456 Rpn-I mRNA (alt.)
binding 1314 1324 SP1 binding site
binding 1325 1333 SP1 binding site
binding 1247 1256 SP1 binding site
signal 1297 1300 CAAT box
BASE COUNT 342 a 325 c 336 g 453 t
ORIGIN
1 ctgcaggtgc taatttcctg acttcttctc tgtagggagg agcatgtcct tccctccgct
61 ctctcattag ccagtgtcag tgtttgaata tttccatctc cttccaaaat ttttttttct
121 tttttttttt ttcggagctg gggaccgaac ccagggcctt agcgcttgct aggccaagct
181 accactgagc taaatcccca acccccaaaa tttaccttta aattacatcc ttcaagcaac
241 aatattggga catgttcatt agatcagaat tgataagagg caacacggaa atgaaagagg
301 tgacagatct ctaccaaggc cttattgccc ttagtcagag cgaatgaagg agcagcagtc
361 tccttttgtg agtggagact gtacaccaca atcgtgcagg gggcttcgag attttatagg
421 gttgaaacgg gactttaggg gagcaaagtt ttatcagtag atttttcttt atcattatta
481 tcatcattgt tgttgttgtt tttaaatggt ttcaaggccc aataatgcca gtagatgttc
541 tctttgctac attatttttc ttatcaggtg ttatcagtat accgactaga gtttttccac
601 tagcttgaag ttcatttgtt ttaggtcatt aattcttcag tgtgactttg ggagagacag
661 cactagagga atccatctgc aagaagcctt tgtaaaaggt cttgccgcag agagaccctt
721 tcctgccttt ctgtcatatg acgatatgcg attccttctt tagacaatgc aatgccaaag
781 ttccatcttg gtagtaagga gcagccctca cctgacagct tagtcctgcc agtaacttca
841 ccttggactt ctacaattgt gctaagtgcc aacagatact tgatgttaat atcatcagcc
901 ttggttagtt gcttaaggtt tctgccatcg ctcatattaa aaattgaact aaatattagg
961 ggagagactt gaatctttgg aaatttcccg tttttcttta aacttttgcc cacttagtat
1021 tcatatcaat gtagagaagg caaacttatt tccttagact aagctgtttc tgagcggaga
1081 ggccattttt tttttttttt ttttttttgc ataagcatga aggaaacagg agataaggaa
1141 ggacgactac cacagaaatc actggtgagt ctagctaata agcgctcaaa gacgtgtgcc
1201 tgcgctttaa gatttttcaa tcaggcgccc catcgcggtt gccttggaga cggggtcctt
1261 cctgacccgg cccagccatc ttgtggtgct gttggccaat caacggttgc gccgtggcgg
1321 gggggggcgg ggcgcgtgtc ggtggcgctc agctcgcggt catggaggcg ccgatcgtct
1381 tgctgctgct gctatggctc gccttgcccg acgcctggca gcgcctcttc ggaggctccg
1441 ccgctggtca acgagg
//
LOCUS DROCYCB 2612 bp ss-mRNA INV 30-JUN-1990
DEFINITION D.melanogaster cyclin B mRNA, complete cds.
ACCESSION M33192
KEYWORDS cyclin B.
SOURCE D.melanogaster 3-12 hour embryo, cDNA to mRNA, clone Nr 4/4.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2612)
AUTHORS Lehner,C.F. and O'Farrell,P.H.
TITLE The roles of Drosophila cyclins A and B in mitotic control
JOURNAL Cell 61, 535-547 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.F.Lehner, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept 250 1842 cyclin B
mRNA < 1 2612 cyclin B mRNA
BASE COUNT 710 a 670 c 635 g 597 t
ORIGIN Chromosome 2; map postion 59A
1 aattcaaacg gcagcttggc ctcgcttcgt gtgcaaaact cgatcaggtt ttcggataat
61 tgacaatccc gcactcgact tgtaaattat aaactattag taatcatata ttcggacaac
121 aaattcaaca gctaaacaag cgttcggtca cagaaacgcg atcaaaagtc aaccaaacca
181 agtgatagcc aagcgtctgc ctatcttcgt gttaattgtg tttgtacaga tagaaaagaa
241 gcaatcaaaa tggtgggcac aacactgaaa atgcgtggcg atgagaacgc ttcggagaac
301 ttcaagcaag tgcaattgaa gaaattgacg gttccttcca tggaggcaac aacaaaacgc
361 gcggccttgg gcgatttgca gaatcgcggc ataagtcgtc ccatcgcagc gaaggatgcg
421 gcacagaaag actccaagga tctcaagctc acagacgccc tgcgcaatgc caaagctcgg
481 gtggacagcc actggaagaa acagccactg ggcagcacca atggcaatgg caatggcgcc
541 gttccgccca aggtcaacga ggggggcgtg tcggcgtttt tgcgttcgaa ttcggtgcgc
601 aatcgcgttc cgaccaagac cactgtagaa cccactaaag ttacagtcaa gtccagttct
661 tccgagaacg tgaacgagcc caccttgaag cgcgaggaca gcaatctgtc gaagaagtcg
721 ctgaccaaac tcggtgccgc tttggccaaa cccgtgatgg gagtttcagg aattcgacgg
781 gaaccagtag ctgtttcccg caaagaggca gagaccaaga aggaactgcc agaaaccaag
841 aaggactcac tggaagtgaa aaaggatgcg accaggatgc cccttattag gggcaacagt
901 gcagtcacta cgaccacatc cacgatgccc accaccatgt ccctttccag caagcgcttg
961 gctgggatcg aggacattga tgccaatgac aaggagaacc tggtactggt ctccgaatat
1021 gtaaacgaca tctacgacta cttgtatcag gtggagctgg agcagcccat tcacaaggat
1081 cacctggccg gacagaagga ggtgtcccac aagatgcgag ccgtgctgat cgattggatc
1141 aacgaagtcc acctgcagtt ccatctggct gcagagacct tccagctggc ggtggctatc
1201 attgatcgct acctgcaggt ggtcaaggac accaaacgca cgtacttgca attggtggga
1261 gtgacagcac tcttcatagc caccaagtac gaggagctgt tcccgccggc aatcggagat
1321 ttcgtcttca tcacggacga cacctacact gcccggcaga tccgacagat ggagctgcaa
1381 atcttcaagg ccatcgactg taatctgtcg cgtccgctgc cgattcactt ccttcgacgc
1441 tactcgaagg ctgctggcgc cgaggacgag caccatacga tgtccaagta cttcatcgag
1501 ttagcttccg tggactacga aatggccact tacaggccat cggagattgc agctgcctca
1561 ctgttcctgt cgctgcactt gctcaatgga aaccaccggg ccggtacagg attcaacgac
1621 cgtcactgga cgcccactct gaccttctac tcgcgatact cggccgcgca cttgcgtccg
1681 attacccggc tgatcgcgaa actggcccgg gacgctcctc aggccaagct gaaggccatc
1741 tacaacaagt accagggcag caagttccag aagatcgcgc tgcgaacgga gctgaccggt
1801 gcgctgatgg actcgattgt gggccagagc cagaggaaat agtgcggtcc aaggcggact
1861 ggaagaccct gacttaccta gtttagttta atttgttttc atttttaaat ttgtagcgta
1921 tttcattttc tgttcgtttc gtgttcgtta aaaaatgcgt atagttaccg tagtcgcatt
1981 gccaactatc tttacctgca tcacccatcc ctaagatatc gtaatctgct ggagtccctt
2041 gagcagtttt cggctactgc caagagctgg ctcccggcat ctttgcccac ggagcacaag
2101 ttgctcgcga gccggccgct gggagtgagt tcctccggta agactaggaa ccgaactaaa
2161 ctggagcccg tcagctcctt tcgctgggtc accatctcag tgggagcgag cggcggagcg
2221 gtggacaccg agagagagag tgggcaacga gttcatttgc tggccgaaca catcggcgtt
2281 gtctctccaa gaattatatt ttataattgt acccgtgaaa tcatttcatc ccctacctac
2341 ctatcactaa ttgtatttgt cgttagtttt tagtttcgat gggaatgaat aaatccaaaa
2401 aattgtgcaa aattattccc ataaatcgca taaaaaagag acgtagacta tttgtaattt
2461 atatcatgta tttcgcacat tcatacaaaa acaacttatg ctagttatat tgtttaccac
2521 taagcaacga ttaaaacacg ttttaactaa ctgatgtttc ccaaaaatgt ctctgaacaa
2581 tacatagaaa ataaagtttt gtataatttt at
//
LOCUS RATSV4I1 37 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 5' recombination junction.
ACCESSION K01218
KEYWORDS .
SEGMENT 1 of 3
SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 37)
AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
Botchan,M.
TITLE Analysis of the sites of integration of viral DNA sequences in rat
cells transformed by adenovirus 2 or SV40
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
STANDARD simple staff_review
COMMENT The three segments of this entry represent an integration of the
SV40 genome in which two SV40 fragments are integrated in opposite
orientations with a 40 bp insert of undetermined origin between
them. The authors report that SV40 and adenovirus inserts into host
cell DNA appears to be chaotic, with inversions, deletions, and
recombination with segments of uncertain origin.
FEATURES from to/span description
recomb 25 26 rat DNA end/SV40 DNA start
BASE COUNT 12 a 8 c 5 g 12 t
ORIGIN About 57 bp 3' to EcorI site.
1 acttcacatt ctaggatgtc tggctactaa aaaatct
//
LOCUS RATSV4I2 58 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 recombination junction DNA and an insert of undetermined
origin.
ACCESSION K01219
KEYWORDS .
SEGMENT 2 of 3
SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 58)
AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
Botchan,M.
TITLE Analysis of the sites of integration of viral DNA sequences in rat
cells transformed by adenovirus 2 or SV40
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
STANDARD simple staff_review
COMMENT The three segments of this entry represent an integration of the
SV40 genome in which two SV40 fragments are integrated in opposite
orientations with a 40 bp insert of undetermined origin between
them. The authors report that SV40 and adenovirus inserts into host
cell DNA appears to be chaotic, with inversions, deletions, and
recombination with segments of uncertain origin.
FEATURES from to/span description
recomb 9 10 SV40 DNA end/insert of unkown origin DNA start
recomb 49 50 insert of unknown origin DNA end/SV40 DNA
start
BASE COUNT 11 a 17 c 11 g 19 t
ORIGIN About 3850 bp after segment 1.
1 tttgtgatct ttgcaatccc acagattgca gcataccagg cttccctgtc cttgtgca
//
LOCUS RATSV4I3 40 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 3' recombination junction.
ACCESSION K01220
KEYWORDS .
SEGMENT 3 of 3
SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 40)
AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
Botchan,M.
TITLE Analysis of the sites of integration of viral DNA sequences in rat
cells transformed by adenovirus 2 or SV40
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
STANDARD simple staff_review
COMMENT The three segments of this entry represent an integration of the
SV40 genome in which two SV40 fragments are integrated in opposite
orientations with a 40 bp insert of undetermined origin between
them. The authors report that SV40 and adenovirus inserts into host
cell DNA appears to be chaotic, with inversions, deletions, and
recombination with segments of uncertain origin.
FEATURES from to/span description
recomb 16 17 SV40 DNA end/Rat cell DNA start
BASE COUNT 9 a 8 c 7 g 16 t
ORIGIN About 770 bp after segment 2.
1 gctttttaaa tcctcacttc aggggtaaga gtttcattct
//
LOCUS RATSV4J1 42 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 DNA recombination junction, clone pEM2.
ACCESSION K02743
KEYWORDS .
SEGMENT 1 of 2
SOURCE Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone
pEM2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 42)
AUTHORS Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M.
TITLE Simian virus 40 illegitimate recombination occurs near short direct
repeats
JOURNAL J. Mol. Biol. 177, 53-68 (1984)
STANDARD full staff_review
COMMENT [1] reports that of two junctions between rat and SV40 DNA studied,
one was a very abrupt transition, whereas the other (shown here)
had an area of homology (bp 16-26) between the two sequences at the
junction. The more likely recombination site at base 26 is given
in the sites table, although base 16 might be another candidate.
FEATURES from to/span description
recomb 25 26 rat DNA end/SV40 DNA (base 439) start
BASE COUNT 6 a 6 c 10 g 20 t
ORIGIN
1 atggctgtat gcttttttta gagcctttgc tgcaattttg tg
//
LOCUS RATSV4J2 1357 bp ds-DNA Rod 30-JUN-1990
DEFINITION Rat/SV40 recombination junction, clone pEM5.
ACCESSION K02744
KEYWORDS .
SEGMENT 2 of 2
SOURCE Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone
pEM5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1357)
AUTHORS Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M.
TITLE Simian virus 40 illegitimate recombination occurs near short direct
repeats
JOURNAL J. Mol. Biol. 177, 53-68 (1984)
STANDARD full staff_review
COMMENT [1] reports that of two junctions between rat and SV40 DNA studied,
one, shown here, was a very abrupt transition, whereas the other
had an area of homology between the two sequences at the junction.
A member of an interspersed middle-repetitive family bordered by 15
bp direct repeats is present at positions 595 to 790. Short direct
repeats (ttg)6 and (tatc)12 are present at positions 53-70 and
310-357, respectively. [1] speculates that the presence of these
short repeats provides a good environment for the viral integration
process.
FEATURES from to/span description
recomb 24 25 SV40 DNA (base 705) end/rat DNA start
unsure 306 307 tt could be ttt
BASE COUNT 336 a 273 c 228 g 520 t
ORIGIN About 5 kb after segment 1; 823 bp upstream of HindII site.
1 caacttgcat ctgttgctac tgttttttac tctgatgttg ctactgagaa ttttgttgtt
61 gttgttgttg gtggttttta aagacagtct cactatgtag tccaggctgg cctggaactt
121 gctctgtagg gtaagctggc catgaactca gtttgtatcc agtagctaga gtgagtcctt
181 gaacttggtc ctcctgcctt tgcctctcaa ctgctggcag ctactgagaa acttaactta
241 atctcattgc ttttttctct gacttttttt agaattggtt tttaatctta atactctatg
301 gtgcattcct atctatctat ctatctatct atctatctat ctatctatct atctatcacc
361 tatctctctt atatatctgt accctgtact ttttcagctt tttattatgg ggctcacgca
421 catgcacata atacgtctgc ataccatggc acacatgtga aggccacaca tgtagactaa
481 tcagttttct ccttctacta tgtgcatcct gggaatgaac ttaggtcatc gggcttggtg
541 gcaggtattt ttgtttgcta agctacctca ccagcccctc tggttctttt taaattcttt
601 tgagtttttt tttaatattt atttatttat tatacataag tacactgtag ctatcttcag
661 acacaccaga agagggcatt ggttgtgtga gccaccatgt agttgctggg atttgaactc
721 aggacctctg gaagagcagt cagtgctctt aaccgctgag ccatctctcc aacccttctt
781 ttgagttttt atgtgtcttt tctttttata aatttctaaa gtcaacgtaa aaagcattgg
841 atttgattac agcatgtttt tcttttatta aaaatattga ggcttttaac tctttagaga
901 caataaaccc aataagcatt agcataaata ccaattctta gaacaataca ttttcaaaca
961 cagaaacatt tactgaaaaa aggaactttt ttgcatcttt ataaatctgc tctattctag
1021 cttataaggt aaatgcttta ggctcatgtc caatttatat tcaatgtttg ttgatgtctg
1081 caatctatga ggaaaatcta acctcacaaa ggtgcatggt ttagaaacag aaagtttgac
1141 tttggttttg ttgttgtttg gatagtggtc tgtttggttg tgagtgtcac atttccttac
1201 atatgtgtgt gtcattatac ttttacatat tacttttaca tatgtgtcat tataactttg
1261 cctaatttga ccctctcctc ccctgctacc cgactctatt gctggttcaa ttcttcaccc
1321 caaatagtct ctcccagtat tcaggtcaca tgaattc
//
LOCUS RATSV4M1 37 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 5' recombination junction.
ACCESSION K01093
KEYWORDS .
SEGMENT 1 of 3
SOURCE Rat cell line 14B DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 37)
AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
TITLE Integration and excision of SV40 DNA from the chromosome of a
transformed cell
JOURNAL Cell 20, 143-152 (1980)
STANDARD full staff_review
COMMENT The authors report that parts of the SV40 genome are integrated in
opposite orientations and that a segment of SV40 has been deleted
in rat cell line 14B. There is a 40 bp insertion of unknown origin.
[1] analyzes 3 segments with rat proviral boundaries.
FEATURES from to/span description
recomb 25 26 rat DNA end/SV40 DNA start
BASE COUNT 12 a 8 c 5 g 12 t
ORIGIN
1 acttcacatt ctaggatgtc tggctactaa aaaatct
//
LOCUS RATSV4M2 58 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat SV40 proviral DNA with insert of undetermined origin.
ACCESSION K01094
KEYWORDS recombinant DNA.
SEGMENT 2 of 2
SOURCE Rat cell line 14B DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 58)
AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
TITLE Integration and excision of SV40 DNA from the chromosome of a
transformed cell
JOURNAL Cell 20, 143-152 (1980)
STANDARD full staff_review
COMMENT The authors report that parts of the SV40 genome are integrated in
opposite orientations and that a segment of SV40 has been deleted
in rat cell line 14B. There is a 40 bp insertion of unknown origin.
[1] analyzes 3 segments with rat proviral boundaries.
FEATURES from to/span description
recomb 9 10 SV40 DNA end/DNA of unknown origin start
recomb 49 50 DNA of unknown origin end/SV40 DNA start
BASE COUNT 19 a 11 c 17 g 11 t
ORIGIN About 3085 bp after segment 1.
1 tgcacaagga cagggaagcc tggtatgctg caatctgtgg gattgcaaag atcacaaa
//
LOCUS RATSV4M3 40 bp ds-DNA ROD 30-JUN-1990
DEFINITION Rat/SV40 3' recombination junction.
ACCESSION K01095
KEYWORDS .
SEGMENT 3 of 3
SOURCE Rat cell line 14B DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 40)
AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
TITLE Integration and excision of SV40 DNA from the chromosome of a
transformed cell
JOURNAL Cell 20, 143-152 (1980)
STANDARD full staff_review
COMMENT The authors report that parts of the SV40 genome are integrated in
opposite orientations and that a segment of SV40 has been deleted
in rat cell line 14B. There is a 40 bp insertion of unknown origin.
[1] analyzes 3 segments with rat proviral boundaries.
FEATURES from to/span description
prov < 1 16 SV40 DNA
BASE COUNT 9 a 8 c 7 g 16 t
ORIGIN About 818 bp after segment 2.
1 gctttttaaa tcctcacttc aggggtaaga gtttcattct
//
LOCUS XELXOTCH 9163 bp ss-mRNA VRT 30-JUN-1990
DEFINITION X.laevis Xotch protein mRNA, complete cds.
ACCESSION M33874
KEYWORDS Xotch.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 9163)
AUTHORS Coffman,C., Harris,W. and Kintner,C.
TITLE Xotch, the Xenopus homolog of the Drosophila notch gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.R.Coffman, 26-APR-1990.
Author address: C.R.Coffman
University of California
Dept. of Biology-B022,
La Jolla, CA 92093
FEATURES from to/span description
pept 346 7917 Xotch protein
mRNA < 1 9163 Xotch mRNA
signal 9146 9151 poly-A signal
rpt 409 4632 EGF-like repeats
rpt 4684 5025 lin-12/Notch repeats
rpt 5953 6591 cdc-10/SWI-6 repeats
BASE COUNT 2499 a 2192 c 2188 g 2284 t
ORIGIN
1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat
61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca
121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa
181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt
241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag
301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg
361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact
421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc
481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag
541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc
601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc
661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa
721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt
781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc
841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa
901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt
961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg
1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt
1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc
1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac
1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc
1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc
1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt
1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca
1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac
1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt
1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt
1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga
1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca
1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac
1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt
1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag
1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt
1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac
2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc
2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag
2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata
2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc
2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc
2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca
2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat
2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc
2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc
2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt
2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca
2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt
2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt
2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa
2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt
2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac
3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg
3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt
3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt
3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc
3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac
3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca
3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac
3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc
3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct
3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag
3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag
3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga
3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg
3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc
3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag
3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa
3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg
4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac
4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc
4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa
4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct
4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc
4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag
4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg
4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct
4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct
4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc
4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat
4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc
4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg
4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct
4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag
4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa
4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac
5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac
5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa
5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa
5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca
5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg
5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc
5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg
5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc
5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc
5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg
5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt
5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa
5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc
5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc
5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac
5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt
6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg
6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag
6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa
6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg
6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac
6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg
6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa
6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag
6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca
6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc
6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt
6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg
6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag
6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag
6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat
6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta
6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct
7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc
7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt
7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc
7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat
7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa
7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca
7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca
7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg
7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc
7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg
7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca
7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca
7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct
7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca
7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa
7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga
7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag
8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc
8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat
8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat
8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa
8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag
8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg
8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta
8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa
8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc
8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac
8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta
8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta
8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta
8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt
8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag
8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca
9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga
9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat
9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat
//
LOCUS CCRFTRCX 2202 bp ds-DNA BCT 30-JUN-1990
DEFINITION C.crescentus transcriptional protein (ftrC) gene, complete cds.
ACCESSION M30946
KEYWORDS transcriptional protein.
SOURCE C.crescentus (strain CB15) DNA.
ORGANISM Caulobacter crescentus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE 1 (bases 1 to 2202)
AUTHORS Van Way,S.M. and Mullin,D.A.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.A.Mullin, 20-DEC-1989.
Author address: D.A.Mullin
Tulane University Dept. of Biology
2000 Perciual Stern Hall
New Orleans, LA 70118
FEATURES from to/span description
pept 340 1707 transcriptional protein (ftrC)
BASE COUNT 383 a 741 c 750 g 328 t
ORIGIN
1 gaattcggag gggccatgct ggcctccgag gctccgatcg aactcagcga caagaccgct
61 gcggacctgg cgccggtctt cgacgttccg gtcaacatct cggccgtgct gggccgcgcc
121 aacatgtccg tggcgcagct gctgcagctg ggccagggct cgatcctgga gctggaccgc
181 aaggtcggcg aggcgatcga catctacgtc aataaccgcc tggtcgcccg gggcgaggtc
241 gtcgtcgtcg acgagcgcct gggcgtgacc atgacggaaa tcatcaagga cggcgaccag
301 ggctgacgcc cggtggtcgt tcgagaggga gagtaagaga tgcggcttct ggtcgttgga
361 aaactgaacg ggcagctctc ggtcgccgtg aagatggcga tgaacgcggg cgcgaaggtc
421 tcgcacgtcg aaacgacgga gcaggcgacc aacgcgctgc gggcggggca gggcgctgac
481 cttctgatgg tcgactatgt gctcgacatc gccggtctga tcgccgccaa cgaagccgag
541 cggatgcggg tgccggtggt ggcctgcggc gtcgacgccg atccgatgcg cgcggccaac
601 gccatcaagg ccggggccaa ggagttcatc ccgctgccgc cggacgccga gctgatcgcc
661 gccgtcctgg ccgccgtcac cgacgacgaa aagccgatgg tcgtccgcga cccggccatg
721 gagcaggtca tcaagctggc cgaccaggtc gccccctccg aagcctcgat cctgatcacc
781 ggggagagcg gctcgggtaa ggaggtcatg gcccgctacg tccacggcaa gtcgcgccgg
841 gccaaggcgc cgttcatcag cgtcaactgc gccgccatcc ccgagaacct gctggaaagc
901 gagctgttcg gccacgagaa gggcgccttc accggggcca tggcccgccg catcggcaag
961 ttcgaggagg ccgacggcgg caccctgctg ctggacgaaa tcagcgaaat ggacgtgcgc
1021 ctgcaagcca agctgctgcg cgccatccag gagcgcgaga tcgaccgcgt gggcggctcc
1081 aagccggtca aggtcaatat ccgcatcctg gccaccagca accgcgacct ggcccaggcg
1141 gtgaaggacg ggacgttccg ggaagacctg ctctaccgtc tgaacgtcgt gaacctgcgc
1201 ctgccgccgc tgcgcgagcg tccggccgac gtgatcagcc tgtgcgagtt cttcgtgaag
1261 aagtactcgg ccgccaacgg catcgaggaa aagccgatct cggccgaggc caagcgccgc
1321 ctgatcgctc accgctggcc gggcaacgtc cgcgagctgg aaaacgccat gcaccgggcg
1381 gtgctgctgt cggcgggccc ggagatcgag gagttcgcca tccgtctgcc cgacggccag
1441 ccgatggccc cggcgccgga cgtcgcggtt gcccgcggcg cccagatggc cgccgacgcc
1501 gcctcgcgcg ccttcgtcgg ctcgaccgtc gccgaggtcg agcagcagct gatcatcgac
1561 accctggagc actgcctggg caaccgcacc catgcggcca acatcctggg catctcgatc
1621 cgcaccctgc gcaacaagct gaaggaatat tccgacgccg gcgtgcaggt gccgccgccc
1681 cagggcgggg tcggcgcggc cgcttaagac cagggcgtcg aggtggcggg accgaaaacc
1741 tcgcccttcg acaagctcag ggtgaggttt tctatgcctg gcccggcgct tcattcgtcc
1801 tcaccctgag cctgtcgaag ggcgaggacg acgcaaagct ccgggccgat ctggccgccg
1861 ccgcgaccat cggtccggcg ccgcgccgcg aaaaatttgg ttaacaagat ggcggcgggc
1921 cgcaaatcgc tccaagcgaa agcctagctt cccatccccg gttcaaggtt ccgaatggcc
1981 gacgccgccg ccccgaacgc cagctcgatg cccagcgcca agtcgctgct cgacgggttg
2041 atgcgcggcg agatgggcct ggccctgggc gtggtcggca tcatcgtgct gctgatcatc
2101 ccggttccgg cgccgctgct ggacgtgctg ctggccattt cgctgacggg atcggtgctg
2161 atcctgatga cggcgattct gatcaagaag ccgctggaat tc
//
LOCUS TRCTC26 638 bp ss-mRNA INV 30-JUN-1990
DEFINITION T.cruzi AMP-inducible protein (TC26) mRNA, partial cds.
ACCESSION M34062
KEYWORDS AMP-inducible protein.
SOURCE T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26.
ORGANISM Trypanosoma cruzi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 638)
AUTHORS Heath,S., Hieny,S. and Sher,A.
TITLE A cyclic AMP inducible gene expressed during the development of
infective stages of Trypanosoma cruzi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.Heath, 08-MAY-1990.
FEATURES from to/span description
pept < 1 > 638 AMP-inducible protein (TC26) (AA at 1)
BASE COUNT 154 a 174 c 168 g 142 t
ORIGIN
1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct
61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct
121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac
181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc
241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag
301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca
361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg
421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac
481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg
541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg
601 tttctgcaac atctcaccgg aacagtctgc atatatat
//
LOCUS HUMSATM29A 700 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human minisatellite MS29 locus 1 polymorphism.
ACCESSION M28875
KEYWORDS minisatellite.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 700)
AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J.
TITLE A novel human DNA polymorphism resulting from transfer of DNA from
chromosome 6 to chromosome 16
JOURNAL Genomics 7, 222-234 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.J.Jeffreys, 04-OCT-1989.
BASE COUNT 156 a 188 c 212 g 144 t
ORIGIN Chromosome 6p25-pter.
1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc
61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc
121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg
181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact
241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact
301 ctgaaggttt gagatgaaat gactgagcca cacctagtgc aggaatgagc accagggcct
361 ctctgcaacg gctgcttaag ggccacagga taagcatagt gtggcgcccc cagttattcc
421 ctgtttcact cattggacca ccctgatagg cttcagggag gcaggaacct cctctaaggc
481 agcaggagag gggtggatgg ggcagccaac aggtgcatct catgcctccg tgttctccat
541 agggagggag gtgggctaag aggcgagtgg ggcagaggcc tgggccagcg tgaagtgtca
601 ggagtaaggg gggagaagac tggagggggc aaatccagga ggcagctgag gctcgactgt
661 gcaaatacaa gcccggttac gtctgtgtag cttagcttcc
//
LOCUS HUMSATM29B 700 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human minisatellite MS29 locus 2 polymorphism.
ACCESSION M28876
KEYWORDS minisatellite.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 700)
AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J.
TITLE A novel human DNA polymorphism resulting from transfer of DNA from
chromosome 6 to chromosome 16
JOURNAL Genomics 7, 222-234 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.J.Jeffreys, 04-OCT-1989.
BASE COUNT 182 a 189 c 146 g 183 t
ORIGIN Chromosome 16p11.
1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc
61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc
121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg
181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact
241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact
301 ctgaaggttt gagatgaaat gactgagcca gtgagtgaaa agaccagcca agaatgtgaa
361 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc
421 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca
481 gtggcttgac caataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca
541 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt
601 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg
661 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat
//
LOCUS HUMSATOD 1900 bp ds-DNA PRI 30-JUN-1990
DEFINITION Human O family dispersed repeat element.
ACCESSION M28877
KEYWORDS repetitive element.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1900)
AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J.
TITLE A novel human DNA polymorphism resulting from transfer of DNA from
chromosome 6 to chromosome 16
JOURNAL Genomics 7, 222-234 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.J.Jeffreys, 04-OCT-1989.
FEATURES from to/span description
rpt 410 785 O family dispersed repeat
BASE COUNT 484 a 501 c 344 g 567 t 4 others
ORIGIN
1 gccacctctg tctaatgttc tcatatagct ccaactctaa cacctcctta cattctacca
61 tctgtgtctt agactgcagc aagccctgta actggctctc ggtctccagg cttctcccga
121 cctagacttg cttacattgc tgtacaagtg aggtttttaa cattctcagt cttgtcactc
181 actgcttaaa ctccttcaag aattttcact cccttcaggc caagggctat atgaagccct
241 tcagatgtgg cccttgcttg tgtctccagc cccatctctc agcagtgctc ttaacactcc
301 caaccatcat gagctaatta ctatccttcc aacatgccat gctccttcac acccctcttg
361 ctttgaacac gctgactctt ctgcctgaaa tgtctttccc agttcccttc tgatatggtt
421 tggctgtgtc cccacccaaa tctcatcttg aattgtagtt cccataatcc ccacatgttg
481 tgggagggac ctggtaggag gtaattgagt catgggggtg attaccctca tgctgccatt
541 ctcatggtag tgaacgtgag ttctcacaag atcttatggt tttataaggg gcttctctcc
601 ctttgctcag cacttcacct tgctgccacc atgtgaagga cacgttttct tccccttcca
661 ccatgattgt aagtttcctg aggcctcctc agccatgctg aactgtgagt caattaactt
721 ctttccttta taaattaccc agtctcgggt atgtatttat tagcagtgtg agaatggact
781 aatacacccc cttcactcac ttttcagtct tcaaagtaca cagccatcct ttactgcccc
841 atgcagcacc ttcaggtctt gagctctatg cagggaggta agccacggaa gtcacaagca
901 catcttttat actagtgcac acaactgaaa actcacagac tggtaaaagc acatctttta
961 tactagtgca cacggctgaa aactcacaga ctggtaccaa accagcttgg ttctgttctg
1021 cttccactgc ctgtcaagac tgtgtgctct gggcaaggta cttcaaccac tgggtcttgg
1081 cttcctcatc tggaaaatga aaataaaact gcaactttaa aaagcttttc tgaaatcaaa
1141 tgcagcaatg tacataaaag gcaaataagc tcaccctatg ttttacctaa ttttcttttt
1201 atcctgctct gtgtatttct taagccaacc atcttacttc ctttttagaa tgcatcagga
1261 tagacacaaa taaatggaaa cactgttaca aagcccctgc tcacacttca gaattcagat
1321 tgcattacct cctccaggaa gtcttcctta atcactctct gctcgagtct catttagatg
1381 atctcctgtg tgcttgcata tcgtctgtgc atacctctgt gttgggagga gtggtaagag
1441 atggggctgg agactcaaga aaaggccaca tcggaagggc ctcgatcctt tcccctgaag
1501 ggagtaaagg ctctccaaag agtttaagca gtgagtgaaa agaccagcca agaatgtgaa
1561 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc
1621 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca
1681 gtggcttgnn nnataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca
1741 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt
1801 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg
1861 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat
//
LOCUS BSURFOLA 4880 bp ds-DNA BCT 30-JUN-1990
DEFINITION B.subtilis folic acid biosynthetic operon encoding
para-aminobenzoic acid synthase, component I (pab), glutamine
amidotransferase (trpG), pot. p-aminobenzoate synthesis enzyme
(pabC), and pot. dihydropteroate synthase (sul) genes, complete
cds.
ACCESSION M34053
KEYWORDS H2-pteridine-CH2OH pyrophosphokinase; dihydropteroate synthase;
glutamine amidotransferase; para-aminobenzoic acid synthase.
SOURCE B.subtilis (strain VB157) vegetative cell DNA, clones pJS[1,2].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4880)
AUTHORS Slock,J., Stahly,D.P., Han,C.-y. and Crawford,I.P.
TITLE An apparent folic acid biosynthetic operon of Bacillus subtilis
containing pab, an amphibolic trpG, a third gene required for
synthesis of para-aminobenzoic acid, and the dihydropteroate
synthase gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.P.Stahly, 07-MAY-1990.
Author address: D.P.Stahly
University of Iowa
Dept. of Microbiology
Iowa City, IA 52242
e-mail: cmdsix@vaxa.weeg.uiowa.edu
FEATURES from to/span description
pept 313 1725 para-aminobenzoic acid synthase, component I
(pab)
pept 1739 2323 glutamine amidotransferase (trpG)
pept 2323 3204 pot. p-aminobenzoate synthesis enzyme (pabC)
pept 3186 4043 pot. dihydropteroate synthase (sul) (EC
2.5.1.15)
pept 4036 4398 ORF2
binding 300 303 ribosome binding site
binding 1724 1730 ribosome binding site
binding 2306 2313 ribosome binding site
binding 3175 3179 ribosome binding site
binding 4023 4027 ribosome binding site
binding 4381 4385 ribosome binding site
binding 1720 1728 methyltryptophan resistance protein binding
site
BASE COUNT 1512 a 939 c 1205 g 1224 t
ORIGIN
1 gaattctcgg aggcatctca tcaggtgcgg caatttatgc tgctcttcaa gtagcaaaaa
61 aactcggaaa aggcaaaaaa gtattggcca tcattccaag taacggcgaa cgttatctaa
121 gcacgccgct gtaccaattc gattaaaaaa agccaaaact cccggttcgc cgggagtttt
181 tttatatttc gtgcatcaaa tatgccatgt ctgcgacgaa tgttattctt ggaagagaaa
241 ttcacttttt tcactaacaa cattgcttta caattaaaaa caagtaaaaa agaaaataga
301 ggaatgatac aaatggcaca acgcagaccg gcaggcaaaa aaataccttt tcaaaaagac
361 tcattcttac aacaatttga gaaacttgcg caatcccgga aacatcatgt acttctcgaa
421 agtgcaagag gcggcagata tagtatagcc ggtcttgatc caattgcgac tgtgaaagga
481 aaagacggaa taactacaat taagcatggt gatgagatgc tgtttaaaga aggtgatcca
541 ttacgggcct tccacagctg gtttaaaaca ctggaaacag aaacgaatca tgagttccct
601 gactttcaag gcggggcaat cgggtttctc agctatgatt acgcacggta cattgaaaat
661 tttaaaatgc tctcattaga tgatttagaa acaccagata tttattttct tgtttttgat
721 gatatagcag tttatgacca tcaagaagag tctctatggc tgattactca tgttaatggt
781 tctgatcagg aaacagcgga tgtgaagcta tctgagttag agcagatgtg gttgactgag
841 cttcccgctg tcacttcgcg agagatgaag cctgaaacag ctggttcttt cgcggcgcca
901 tttaccgagg atgggttctc acaagctgta gagaaaatca aacaatacat tgccagcgga
961 gatgtgtttc aagtcaatct atcaataagg cagtcacagt cactgtctgt ccacccatat
1021 caaatttaca aaaccttgag agaagtaaat ccttctcctt atatggcgta tttagaaaca
1081 cctgatttcc aaatcatttg cggatcgcct gaactgcttg tcagcaaaaa gggcaagcta
1141 ttagagacga gaccgattgc gggcacccgt tccagaggga aaacaaatga agaagacgag
1201 gcgcttgcaa acgaattgat acacaatgaa aaagaacgcg cggaacatgt catgctggtt
1261 gatcttgagc gaaatgatct gggaagagta tcacgttacg ggtctgtgcg cgtaaatgaa
1321 ttcatggcaa ttgaaaaata ctcgcatgtg atgcacattg tgtctaatgt ccaaggtgaa
1381 ctgcaggatg ggtatgatgc tgtagatatt attcatgctg tgtttcccgg aggaaccatt
1441 actggtgcac cgaaagtaag aacgatggaa attatagaag aacttgagcc gacacgccga
1501 gggctttata ctggatctat aggatggttt ggatataatc acgatctgca gtttaatatc
1561 gtcattcgaa ccatttatgc aaccggaggg caggcattta tgcagtccgg tgcaggagtt
1621 gtgattgatt ctgttccgaa gcacgaatac aaggaatcat tcaaaaaagc ttttgcgatg
1681 caaagagcat tagagctgag cgaagaagag acaaaaatta gatgaggtga gcggagaaat
1741 gattttaatg attgataact acgattcatt cacgtacaac ttggtacagt atttgggcga
1801 gcttggggaa gagctggttg tgaaacgcaa tgacagcatc acaatcgatg aaattgaaga
1861 actgtctccg gactttctga tgatatctcc cggaccgtgc agccctgatg aggcgggaat
1921 cagcctcgaa gcaattaaac atttcgcagg gaaaattcct attttcggtg tatgtctcgg
1981 acatcagtcc atcgcacaag tgttcggtgg tgatgttgtt agggcagaac ggcttatgca
2041 cgggaaaacc tcggatatcg agcatgacgg caaaaccatt tttgaagggt tgaaaaatcc
2101 ccttgttgcg acgcgatacc actcgctgat cgtaaaacct gagacgctgc caagctgttt
2161 tacagtaaca gcacaaacga aagaaggaga aatcatggct attcgccaca atgacctccc
2221 gatagagggt gtgcaatttc acccagagtc tattatgacc tcctttggga aagaaatgct
2281 cagaaatttt attgagacat atcgcaagga agttattgcg tgatgatata tgtgaacggc
2341 cggtatatgg aggagaaaga tgcagttctt tctccttttg accatgggtt tttatatggg
2401 atcggtgttt tcgaaacctt caggctctac gaaggctgcc cgtttttgct tgattggcat
2461 atagagaggc ttgaacgcgc gcttaaggat ctgcaaatcg aatatactgt ttcaaagcat
2521 gagatccttg agatgctgga caagctgctt aagctaaatg atatcaagga tggcaatgcc
2581 cgcgtcagac tgaacatatc agcgggaatc agcgataaag gctttgtcgc ccagacgtac
2641 gataagccca ccgttttatg ctttgtgaat cagctgaaac cagaaagtct tcctttgcaa
2701 aaagaaggaa aggttttatc gatacggaga aacactccag agggatcgtt ccggttaaaa
2761 tctcatcatt atctaaataa tatgtatgcg aaacgggaaa ttggaaatga tccgcgtgtc
2821 gagggaattt tcttaacgga ggatggagct gttgcagagg gtatcatatc taatgtcttc
2881 tggagaaaag gtcgctgtat ctatacgcca tcgcttgata ccggtattct ggacggcgtc
2941 acgagacgtt ttatcataga aaatgcaaag gatatcggcc tggagctcaa aacgggccga
3001 tatgagctag aagctctctt gacagctgac gaagcgtgga tgacgaactc ggttcttgaa
3061 atcatcccgt ttactaaaat agaagaagta aactatggaa gccaaagcgg agaagcaacc
3121 tccgctcttc aattgttata caaaaaagaa ataaagaaca tgattcatga aaagggagga
3181 agagcatggc gcagcacaca atagatcaaa cacaagtaat ccacactaag cccagcgctt
3241 tatcatataa agagaagacg ctggtgatgg gaattttaaa cgtaacgcct gactctttct
3301 cggacggcgg aaaatatgac agcttggaca aggcgctgct gcacgcgaaa gagatgatcg
3361 atgatggtgc ccatatcatt gatattggag gggaatcgac aaggcctggc gctgagtgcg
3421 tatctgagga tgaggagatg tccagagtca ttccggtgat tgagcggatt acgaaagagc
3481 ttggtgttcc tatttctgta gacacgtaca aggcttctgt cgcagatgaa gcagtgaaag
3541 ccggtgcatc cattatcaat gatatttggg gagccaaaca tgatccgaag atggcttccg
3601 ttgcagctga acataatgtt ccaattgtac tcatgcataa ccgccctgaa agaaactaca
3661 atgacttatt gccggatatg ctgtcggact taatggagag tgtaaaaatt gctgttgagg
3721 ccggagtaga cgagaagaac attattcttg atcctggtat cggtttcccg aaaacctatc
3781 acgataactt ggcagtgatg aacaaactag agattttcag cggattggga tatccggttc
3841 ttctggcaac ctcccgaaaa agattcatcg gacgtgttct ggatcttccg cctgaggagc
3901 gggctgaggg cacaggcgcg actgtgtgtc tcggcattca aaaaggctgt gacattgtca
3961 gggtccatga tgtaaagcaa attgccagaa tggcgaaaat gatggacgcg atgctgaata
4021 agggaggggt gcaccatgga taaagtttat gtagaaggta tggagtttta cggatatcac
4081 ggtgtgttca cagaagaaaa caaacttggc cagcggttta aagtcgattt aaccgctgag
4141 ctggatttaa gcaaagctgg acagacagac gaccttgagc aaacgatcaa ctatgctgag
4201 ctctatcacg tatgtaaaga tatcgtggaa ggggagcctg tgaaattggt ggaaacgctg
4261 gcggaacgta ttgctggcac tgttctcgga aaatttcagc ctgttcagca atgtacggtg
4321 aaagtgatta agccagaccc gccaattccc ggacactata aatcagtagc aattgaaatt
4381 acgagaaaaa agtcatgaac aacatagctt atattgcact tggatctaat attggagata
4441 gagaaacgta tttaaggcaa gcagtggctt tactgcatca gcatgctgcg gtgacagtca
4501 ctaaagtgtc gtctatttac gaaactgacc cggtcggata cgaagatcaa gctcaatttt
4561 tgaatatggc tgttgaaatc aagacatcat tgaacccttt tgaactcctt gaactgacgc
4621 agcagataga aaatgaatta ggcagaacaa gggaagtaag atggggaccg cggacggcag
4681 accttgacat tttgttattt aatcgtgaaa atattgaaac agagcaacta attgttccgc
4741 atccgagaat gtatgagcgt ttgtttgtcc ttgcgccgct tgcggaaatt tgccagcagg
4801 ttgaaaaaga ggctacaagc gccgaaacag accaagaagg tgtaagagta tggaagcaga
4861 aatctggggt agacgaattc
//GenBank-Updates@genbank.bio.net (07/03/90)
LOCUS DROANNIX 1104 bp ss-mRNA INV 03-JUL-1990
DEFINITION D.melanogaster annexin IX mRNA, 3' end.
ACCESSION M34068 J05501
KEYWORDS annexin IX.
SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1104)
AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
Suedhof,T.C.
TITLE Two novel annexins from Drosophila melanogaster: Cloning,
characterization and differential expression in development
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.C.Suedhof, 04-MAY-1990.
FEATURES from to/span description
pept < 1 891 annexin IX (AA at 1)
signal 1091 1096 poly-A signal
BASE COUNT 281 a 300 c 306 g 217 t
ORIGIN
1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg
61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc
121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg
181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga
241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt
301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag
361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat
421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc
481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc
541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag
601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag
661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg
721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt
781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag
841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca
901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa
961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg
1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt
1081 ccctctgtgc aataatcgga attc
//
LOCUS DROANNX 1192 bp ss-mRNA INV 03-JUL-1990
DEFINITION D.melanogaster annexin X mRNA, complete cds.
ACCESSION M34069 J05501
KEYWORDS annexin X.
SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1192)
AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
Suedhof,T.C.
TITLE Two novel annexins from Drosophila melanogaster: Cloning,
characterization and differential expression in development
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.C.Suedhof , 04-MAY-1990.
FEATURES from to/span description
pept 91 1056 annexin X
signal 1175 1184 poly-A signal
BASE COUNT 271 a 348 c 363 g 210 t
ORIGIN Chromosome 93B or 19A-4,7.
1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac
61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag
121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga
181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg
241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag
301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag
361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc
421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag
481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc
541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc
601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg
661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc
721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg
781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc
841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc
901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag
961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac
1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc
1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct
1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc
//
LOCUS NEUMPPX 2038 bp ss-mRNA PLN 03-JUL-1990
DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds.
ACCESSION J05484
KEYWORDS matrix processing peptidase.
SOURCE N.crassa, cDNA to mRNA.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 2038)
AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W.
TITLE Matrix processing peptidase of mitochondria: Structure-function
relationships
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by W.Neupert, 17-APR-1990.
FEATURES from to/span description
pept 41 1774 matrix processing peptidase
BASE COUNT 439 a 626 c 552 g 421 t
ORIGIN
1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc
61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc
121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga
181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga
241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa
301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc
361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc
421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc
481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg
541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct
601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg
661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt
721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa
781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac
841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc
901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc
961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga
1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc
1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc
1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg
1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct
1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg
1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat
1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga
1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat
1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg
1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg
1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct
1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga
1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt
1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc
1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc
1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct
1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa
//
LOCUS HUMCYP2DG 5503 bp ds-DNA PRI 03-JUL-1990
DEFINITION Human debrisoquine 4-hydroxylase mutant allele (CYP2D6-MA1) gene,
complete cds.
ACCESSION M33189
KEYWORDS debrisoquine 4-hydroxylase.
SOURCE Human individual MAGA DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5503)
AUTHORS Gonzalez,F.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Gonzalez, 23-MAR-1990, for release after publication.
Author address: F.Gonzalez
National Cancer Institute
Bldg. 37 Rm. 3E-24
National Institute of Health
Bethesda, Md 20892
FEATURES from to/span description
pept 814 993 debrisoquine 4-hydroxylase, exon 1
1696 1877 debrisoquine 4-hydroxylase, exon 2
2419 2571 debrisoquine 4-hydroxylase, exon 3
2661 2820 debrisoquine 4-hydroxylase, exon 4
3254 3430 debrisoquine 4-hydroxylase, exon 5
3621 3762 debrisoquine 4-hydroxylase, exon 6
3970 4157 debrisoquine 4-hydroxylase, exon 7
4612 4753 debrisoquine 4-hydroxylase, exon 8
4852 5030 debrisoquine 4-hydroxylase, exon 9
pre-msg 726 5103 debrisoquine 4-hydroxylase mRNA and introns
IVS 994 1695 debrisoquine 4-hydroxylase intron A
IVS 1878 2418 debrisoquine 4-hydroxylase intron B
IVS 2572 2660 debrisoquine 4-hydroxylase intron C
IVS 2821 3253 debrisoquine 4-hydroxylase intron D
IVS 3431 3620 debrisoquine 4-hydroxylase intron E
IVS 3763 3969 debrisoquine 4-hydroxylase intron F
IVS 4158 4611 debrisoquine 4-hydroxylase intron G
IVS 4754 4851 debrisoquine 4-hydroxylase intron H
signal 689 702 TATA box
BASE COUNT 1066 a 1537 c 1851 g 1049 t
ORIGIN Chromosome 22.
1 ggctgggaag tggggtactt ggtgccgggt ctgtatgtgt gtgtgactgg tgtgtgtgag
61 agagaatgtg tgccctaagt gtcagtgtga gtctgtgtat gtgtgaatat tgtctttgtg
121 tgggtgattt tctgcgtgtg taatcgtgtc cctgcaagtg tgaacaagtg gacaagtgtc
181 tgggagtgga caagagatct gtgcaccatc aggtgtgtgc atagcgtctg tgcatgtcaa
241 gagtgcaagg tgaagtgaag ggaccaggcc catgatgcca ctcatcatca ggagctctaa
301 ggccccaggt aagtgccagt gacagataag ggtgctgaag gtcactctgg agtgggcagg
361 tgggggtagg gaaagggcaa ggccatgttc tggaggaggg gttgtgacta cattagggtg
421 tatgagccta gctgggaggt ggatggccgg gtccactgaa accctggtta tcccagaagg
481 ctttgcaggc ttcaggagct tggagtgggg agagggggtg acttctccga ccaggcccct
541 ccaccggcct accctgggta agggcctgga gcaggaagca ggggcaagaa cctctggagc
601 agcccatacc cgccctggcc tgactctgcc actggcagca cagtcaacac agcaggttca
661 ctcacagcag agggcaaagg ccatcatcag ctccctttat aagggaaggg tcacgcgctc
721 ggtgtgctga gagtgtcctg cctggtcctc tgtgcctggt ggggtggggg tgccaggtgt
781 gtccagagga gcccatttgg tagtgaggca ggtatggggc tagaagcact ggtgcccctg
841 gccgtgatag tggccatctt cctgctcctg gtggacctga tgcaccggcg ccaacgctgg
901 gctgcacgct actcaccagg ccccctgcca ctgcccgggc tgggcaacct gctgcatgtg
961 gacttccaga acacaccata ctgcttcgac caggtgaggg aggaggtcct ggagggcggc
1021 agaggtgctg aggctcccct accagaagca aacatggatg gtgggtgaaa ccacaggctg
1081 gaccagaagc caggctgaga aggggaagca ggtttggggg acttcctgga gaagggcatt
1141 tatacatggc atgaaggact ggattttcca aaggccaagg aagagtaggg caagggcctg
1201 gaggtggagc tggacttggc agtgggcatg caagcccatt gggcaacata tgttatggag
1261 tacaaagtcc cttctgctga caccagaagg aaaggccttg ggaatggaag atgagttagt
1321 cctgagtgcc gtttaaatca cgaaatcgag gatgaagggg gtgcagtgac ccggttcaaa
1381 ccttttgcac tgtgggtcct cgggcctcac tgctcaccgg catggaccat catctgggaa
1441 tgggatgcta actggggcct ctcggcaatt ttggtgactc ttgcaaggtc atacctgggt
1501 gacgcatcca aactgagttc ctccatcaca gaaggtgtga cccccacccc cgccccagga
1561 tcaggaggct gggtctcctc cttccacctg ctcactcctg gtagccccgg gggtcgtcca
1621 aggttcaaat aggactagga cctgtagtct ggggggatcc tggcttgaca agaggccctg
1681 accctccctc tgcagttgcg gcgccgcttc ggggacgtgt tcagcctgca gctggcctgg
1741 acgccggtgg tcgtgctcaa tgggctggcg gccgtgcgcg aggcgatggt gacccgcggc
1801 gaggacacgg ccgaccgccc gcctgtgccc atcacccaga tcctgggttt cgggccgcgt
1861 tcccaaggca agcagcggtg gggacagaga cagatttccg tgggacccgg gtgggtgatg
1921 accgtagtcc gagctgggca gagagggcgc ggggtcgtgg acatgaaaca ggccagcgag
1981 tggggacagc gggccaagaa accacctgca ctagggaggt gtgagcatgg ggacgagggc
2041 ggggcttgtg acgagtgggc ggggccactg ccgagacctg gcaggagccc aatgggtgag
2101 cgtggcgcat ttcccagctg gaatccggtg tcgaagtggg gggcggggac cgcacctgtg
2161 ctgtaagctc agtgtgggtg gcgcggggcc cgcggggtct tccctgagtg caaaggcggt
2221 cagggtgggc agagacgagg tgggcaaagc cctgccccag ccaagggagc aaggtggatg
2281 cacaaagagt gggccctgtg accagctgga cagagccagg gactgcggga gaccaggggg
2341 agcatagggt tggagtgggt ggtggatggt ggggctaatg ccttcatggc cacgcgcacg
2401 tgcccgtccc acccccaggg gtgttcctgg cgcgctatgg gcccgcgtgg cgcgagcaga
2461 ggcgcttctc cgtctccacc ttgcgcaact tgggcctggg caagaagtcg ctggagcagt
2521 gggtgaccga ggaggccgcc tgcctttgtg ccgccttcgc caaccactcc ggtgggtgat
2581 gggcagaagg gcacaaagcg ggaactggga aggcggggga cggggaaggc gaccccttac
2641 ccgcatctcc cacccccaag acgccccttt cgccccaacg gtctcttgga caaagccgtg
2701 agcaacgtga tcgcctccct cacctgcggg cgccgcttcg agtacgacga ccctcgcttc
2761 ctcaggctgc tggacctagc tcaggaggga ctgaaggagg agtcgggctt tctgcgcgag
2821 gtgcggagcg agagaccgag gagtctctgc agggcgagct cccgagaggt gccggggctg
2881 gactggggcc tcggaagagc aggatttgcg tagatgggtt tgggaaagga cattccagga
2941 gaccccactg taagaagggc ctggaggagg aggggacatc tcagacatgg tcgtgggaga
3001 ggtgtgcccg ggtcaggggg caccaggaga ggccaaggac tctgtacctc ctatccacgt
3061 cagagatttc gattttaggt ttctcctctg ggcaaggaga gagggtggag gctggcactt
3121 ggggagggac ttggtgaggt cagtggtaag gacaggcagg ccctgggtct acctggagat
3181 ggctggggcc tgagacttgt ccaggtgaac gcagagcaca ggagggattg agaccccgtt
3241 ctgtctggtg taggtgctga atgctgtccc cgtcctcctg catatcccag cgctggctgg
3301 caaggtccta cgcttccaaa aggctttcct gacccagctg gatgagctgc taactgagca
3361 caggatgacc tgggacccag cccagccccc ccgagacctg actgaggcct tcctggcaga
3421 gatggagaag gtgagagtgg ctgccacggt ggggggcaag ggtggtgggt tgagcgtccc
3481 aggaggaatg aggggaggct gggcaaaagg ttggaccagt gcatcacccg gcgagccgca
3541 tctgggctga caggtgcaga attggaggtc atttgggggc taccccgttc tgtcccgagt
3601 atgctctcgg ccctgctcag gccaagggga accctgagag cagcttcaat gatgagaacc
3661 tgcgcatagt ggtggctgac ctgttctctg ccgggatggt gaccacctcg accacgctgg
3721 cctggggcct cctgctcatg atcctacatc cggatgtgca gcgtgagccc atctgggaaa
3781 cagtgcaggg gccgagggag gaagggtaca ggcgggggcc catgaacttt gctgggacac
3841 ccggggctcc aagcacaggc ttgaccagga tcctgtaagc ctgacctcct ccaacatagg
3901 aggcaagaag gagtgtcagg gccggacccc ctgggtgctg acccattgtg gggacgcatg
3961 tctgtccagg ccgtgtccaa caggagatcg acgacgtgat agggcaggtg cggcgaccag
4021 agatgggtga ccaggctcac atgccctaca ccactgccgt gattcatgag gtgcagcgct
4081 ttggggacat cgtccccctg ggtgtgaccc atatgacatc ccgtgacatc gaagtacagg
4141 gcttccgcat ccctaaggta ggcctggcgc cctcctcacc ccagctcagc accagcccct
4201 ggtgatagcc ccagcatggc tactgccagg tgggcccact ctaggaaccc tggccaccta
4261 gtcctcaatg ccaccacact gactgtcccc acttgggtgg ggggtccaga gtataggcag
4321 ggctggcctg tccatccaga gcccccgtct agtggggaga caaaccagga cctgccagaa
4381 tgttggagga cccagcgcct gcagggagag ggggcagtgt gggtgcctct gagaggtgtg
4441 actgcgccct gctgtggggt cggagagggt actgtggagc ttctcgggcg caggactagt
4501 tgacagagtc cagctgtgtg ccaggcagtg tgtgtccccc gtgtgtttgg tggcaggggt
4561 cccagcatcc tagagtccag tccccactct caccctgcat ctcctgccca gggaacgaca
4621 ctcatcacca acctgtcatc ggtgctgaag gatgaggccg tctgggagaa gcccttccgc
4681 ttccaccccg aacacttcct ggatgcccag ggccactttg tgaagccgga ggccttcctg
4741 cctttctcag caggtgcctg tggggagccc ggctccctgt ccccttccgt ggagtcttgc
4801 aggggtatca cccaggagcc aggctcactg acgcccctcc cctccccaca ggccgccgtg
4861 catgcctcgg ggagcccctg gcccgcatgg agctcttcct cttcttcacc tccctgctgc
4921 agcacttcag cttctcggtg cccactggac agccccggcc cagccaccat ggtgtctttg
4981 ctttcctggt gaccccatcc ccctatgagc tttgtgctgt gccccgctag aatggggtac
5041 ctagtcccca gcctgctccc tagccagagg ctctaatgta caataaagca atgtggtagt
5101 tccaactcgg gtcccctgct cacgccctcg ttgggatcat cctcctcagg gcaaccccac
5161 ccctgcctca ttcctgctta ccccaccgcc tggccgcatt tgagacaggg gtatgttgag
5221 gctgagcaga tgtcagttac ccttgcccat aatcccatgt cccccactga cccaactctg
5281 actgcccaga ttggtgacaa ggactacatt gtcctggcat gtggggaagg ggccagaatg
5341 ggctgactag aggtgtcagt cagccctgga tgtggtggag agggcaggac tcagcctgga
5401 ggcccatatt tcaggcctaa ctcagcccac cccacatcag ggacagcagt cctgccagca
5461 ccatcacaac agtcacctcc cttcatatat gacaccccaa aac
//
LOCUS CHKCOLCARB 1394 bp ss-mRNA VRT 03-JUL-1990
DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end,
clone 7 and 13.
ACCESSION M28659 J05129
KEYWORDS IX collagen-proteoglycan; extracellular matrix protein.
SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clones 7 and 13.
ORGANISM Gallus domesticus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1394)
AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R.
TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from
the use of two widely separated promoters
JOURNAL J. Biol. Chem. 264, 20033-20041 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES from to/span description
pept 147 > 1394 alpha(IX) collagen-proteoglycan
BASE COUNT 372 a 341 c 358 g 323 t
ORIGIN
1 tccctccccg ctgactgcgt ggggcaggag gagcattctg cacccattca tactctcgtt
61 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg
121 agacgtgaag aaaaccagca gagaagatga aaagcaactg gaaaattaca gctttcttgt
181 atatgtgtag ttttctgggg tctttcatct cagctaccta ccagcaacaa tcaagattgc
241 cagtcattct gggtgctcgt caaagaactg atctctgccc aacaatcagg attggcgaag
301 atgacttgcc aggctttgac ctgatttctc agttccagat agaaaaagct gcttctcaag
361 gaattgtcca gagagtagtg ggttctactg ctctacaagt ggcttataaa ttgggaccca
421 atgtagactt caggattcca accagtgcaa tatattccaa tggattgcct gatgaatact
481 cctttcttac tacttttcgg atgactggag ccacacttca gaaatactgg actatttggc
541 agattcagga ttcttcagga aaagaacaag ttggagtgaa tctcaatggt ccaatgaaaa
601 gcgttgagtt ttcttataaa ggagtggatg gaagtctcca gactgcatca tttttacatt
661 tgcctttctt gtttgattcc caatggcaca agcttatgat aagtgtggaa acaaccagcg
721 ttacactttt tattgactgt ataaaggtag aaaccctaaa cataaaacca aaggggaaaa
781 tcagtgttga tggcttctca gtgcttggaa gactcaaaaa taatcctcaa atttcagttc
841 cgtttgaagt ccagtggatg ccgattcact gcgatcccct gcggccccag agagaaggtt
901 gtggtgagct cccagcccgg ataagccaga cagtgattga gagaggtctt cctggtccac
961 caggcccccc aggtccacca gggccaccag gagttcctgg cattgatggc atcgatggag
1021 agagaggacc taacggcccc cccggtccac cgggtccgga cggcgacgca ggcaaagcgg
1081 gatccccggg cctgcctgga gagccaggag ctgatgggtt aacaggccct gatggatcac
1141 caggtgccac aggaccgaaa ggacagaagg gtgagccagg acctccaggt gctcgtggac
1201 ttccgggcaa gggtcttctt ggaccacccg gtccagctgg tgctgcagga cttcccggtg
1261 aagtaggccg tgctggccca cctggtgatc caggaaaaag gggaccacca ggaccaccag
1321 gaccaccagg ccctcgagga acaattggtc tgcaagacgg tgacccattg tgtcccaatg
1381 cttgtccacc tggc
//
LOCUS CHKCOLCARC 776 bp ss-mRNA VRT 03-JUL-1990
DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end,
clone YM43.
ACCESSION M28660 J05129
KEYWORDS IX collagen-proteoglycan; extracellular matrix protein.
SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clone YM43.
ORGANISM Gallus domesticus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 776)
AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R.
TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from
the use of two widely separated promoters
JOURNAL J. Biol. Chem. 264, 20033-20041 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES from to/span description
pept 255 > 776 alpha(IX) collagen-proteoglycan
BASE COUNT 119 a 319 c 235 g 103 t
ORIGIN
1 gaattcccga cacccccacc tgcatcaccc cccccccatc tcgcagtccc tcgcccccat
61 caaagcccct ttgtgccacc tccgtcgcca cccggcccca gaatagcagc acgctcacct
121 gcaggggggg tcggagccag cgcctgccct cgtcccccgc tgctccatat taatcagccc
181 cttcctcctc ctcctcctcc tcctcctcct gccggtccct ccgcagtccg acacttacag
241 ccccgctccc ggccatggcc caccgcagcc ccgcgctctg cctgctgctc ctgcacgctg
301 cctgcctctg cctggcccag ctccgggggc caccaggaga gcccggccca cgagggcccc
361 caggtccgcc aggagtgccg ggagcggatg gcattgatgg tgacaaaggc tctcccggag
421 cccccggctc cccaggtgcc aaaggggagc ccggagcccc gggtccggat gggcctccag
481 ggaagccagg cttagacggt cttacgggag ccaaagggag ccggggccca tggggggggc
541 aaggactgaa gggtcagcct ggactgccgg ggccgccggg gctccccggt ccctcgctgc
601 caggaccacc cgggctgcca ggccaggtcg gactgcccgg ggagatcgga gtgccaggac
661 ccaagggcga tcctggaccc gatggcccac ggggcccccc gggtccccca gggaaacccg
721 gccccccagg acacatccaa ggagtggagg gaagcgcaga tttcttgtgc ccgacc
//
LOCUS CHKCOLCOR 602 bp ss-mRNA VRT 03-JUL-1990
DEFINITION Chicken cornea alpha-1(IX) collagen-proteoglycan mRNA, 5' end.
ACCESSION M28658 J05129
KEYWORDS IX collagen-proteoglycan; extracellular matrix protein.
SOURCE Chicken 8 day old embryo cornea, cDNA to mRNA, clone IN212.
ORGANISM Gallus domesticus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 602)
AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R.
TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from
the use of two widely separated promoters
JOURNAL J. Biol. Chem. 264, 20033-20041 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES from to/span description
pept 89 > 602 alpha(IX) collagen-proteoglycan
BASE COUNT 121 a 175 c 186 g 120 t
ORIGIN
1 tgcagctgaa aggtgaactg ggctgtaagg cacattttgg atttctgtgt attgtagcac
61 ctaggtggct gcaaaatctg tccccactat ggcctgggct gcatggggcc ctctgcttct
121 cgggcttttc ttgcagattt tttgcctctg ccttgctcaa agaggtcttc ctggtccacc
181 aggcccccca ggtccaccag ggccaccagg agttcctggc attgatggca ttgatggaga
241 gagaggacct aacggccccc ccggtccacc gggtccggac ggcgacgcag gcaaagcggg
301 atccccgggc ctgcctggag agccaggagc tgatgggtta acaggccctg atggatcacc
361 aggtgccaca ggaccgaaag gacagaaggg tgagccagga cctccaggtg ctcgtggacc
421 tccgggcaag ggtcttcttg gaccacctgg tccagctggt gctgcaggac ttcccggtga
481 agtaggccct gctggcccac ctggtgatcc aggaaaaagg ggaccaccag gaccaccagg
541 accaccaggc cctcgaggaa caattggtct gcaagatggt gacccattgt gtcccaatgc
601 tt
//
LOCUS CHKCOLG1 840 bp ds-DNA VRT 03-JUL-1990
DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exon 1,
and cornea alpha-1(IX) collagen-proteoglycan gene, 5' flank.
ACCESSION M28662 J05129
KEYWORDS IX collagen-proteoglycan; extracellular matrix protein.
SEGMENT 1 of 2
SOURCE Chicken DNA.
ORGANISM Gallus domesticus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 840)
AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R.
TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from
the use of two widely separated promoters
JOURNAL J. Biol. Chem. 264, 20033-20041 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES from to/span description
pept 807 / 820 cartilage alpha(IX) collagen-proteoglycan,
exon 1
pre-msg 661 > 840 cartilage alpha(IX) collagen-proteoglycan
IVS 821 > 840 cartilage alpha(IX) collagen-proteoglycan
intron A
BASE COUNT 249 a 202 c 157 g 232 t
ORIGIN
1 ccacccgtga gaattcctca agtgaaaatg caaatgaaca gaaattataa attgttcaga
61 aactgagtat atgttctcca aatttctctg aacgaggccc ctctctttgg aaagtataat
121 gtgtgtgtga ataacaactg aacaacagga gtcctcttag taatgcctat gtgcattcct
181 tgaaaaggtt caagtttaag cagtaaaagt ccttttaaat aattggtttt attcagaaga
241 atcaactagg acactaccag ataggcttct ccagagacct tctgatggat aaatcaacaa
301 gaactgaaaa tatcttcttt ataggactga tgttcttttc ttgtgaaagt ttttagcttt
361 aacaccacag tgaagccacc agtttccaca aaatcccttg gtacatgtta ttattctttt
421 atctgcctca ctgaacagtg cccctgccat ttggtgactg gcatcgctta actcatatag
481 tgttaatctt tctaccctga tgtcggcata agcagcaccc ctttcttcac tctcttggct
541 tctttatatt cagctggctc cagagatccg ccctcagacc ccaccaggat acagacgtct
601 gtccagcccc cacctccttc cctttgcaag attaaaacca acccagcagc ctgcacctcc
661 ctccccgctg agtcctgcgt ggggcaggag gagcattctg cacccattca tactctcgtt
721 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg
781 agacgtgaag aaaaccagca gagaagatga aaagcaactg gtaagagaac aagtgggatt
//
LOCUS CHKCOLG2 840 bp ds-DNA VRT 03-JUL-1990
DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exons 6
and 7, and cornea alpha-1(IX) collagen-proteoglycan gene, exon 1.
ACCESSION M28661 J05129
KEYWORDS IX collagen-proteoglycan; extracellular matrix protein.
SEGMENT 2 of 2
SOURCE Chicken DNA, clones 13 and 26.
ORGANISM Gallus domesticus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 840)
AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R.
TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from
the use of two widely separated promoters
JOURNAL J. Biol. Chem. 264, 20033-20041 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.R.Olsen, 12-SEP-1989, for release after publication.
The first amino acid for the open reading frame for exon 6 which is
indicated in the features as starting at nucleotide 9 could start
at nucleotide 11.
FEATURES from to/span description
pept 636 / 707 cornea alpha(IX) collagen-proteoglycan, exon 1
pept / 9 86 cartilage alpha(IX) collagen-proteoglycan,
exon 6 (AA at 9)
750 / 770 cartilage alpha(IX) collagen-proteoglycan,
exon 7
pre-msg 541 > 840 cornea alpha(IX) collagen-proteoglycan mRNA
and introns
pre-msg < 1 > 840 cartilage alpha(IX) collagen-proteoglycan mRNA
and introns
IVS < 1 8 cartilage alpha(IX) collagen-proteoglycan
intron E
IVS 87 749 cartilage alpha(IX) collagen-proteoglycan
intron F
IVS 771 > 840 cartilage alpha(IX) collagen-proteoglycan
intron G
IVS 708 > 840 cornea alpha(IX) collagen-proteoglycan intron
A
BASE COUNT 181 a 214 c 222 g 223 t
ORIGIN
1 cctaccagtt tgaagtccag tggatgctga ttcactgcga tcccctgcgg ccccagagag
61 aaggttgtgg tgagctccca gcccgggtga cccgcgttcc cagcctgaca gtgctgaact
121 gggctgccac taaatctatg aagttcacag gagcttcatt tttccccgtc tatgtccaga
181 gaagtctatt tcaccatacc tgactgaaat ttggtgcctt tagcaatcca gccccctgga
241 gtagcagcct tactttaact cttccatgcc ttcctatctt ttccttctca gccagtgcta
301 gggtcagagg cttttgaaag atatccctga cagcgaagag agactgctgt ctccttgcag
361 actcctgggc aacctgaggg agggaaaccc ttgcctggga ggtgagggag ggtgccaaaa
421 caacagcgag cagggcaaag ggttaaaggt actgctgtca ttcaatcctc ttcctcccag
481 ccttcagctc tcctccaatc ccacgaccct ctcccaggca gttaataagg aactgtgagg
541 ggtgccttgc agctgaaagg tgaactgggc tgtaaggcac attttggatt tctgtgtatt
601 gtagcaccta ggtggctgca aaatctgtcc ccactatggc ctgggctgca tggggccctc
661 tgcttctcgg gcttttcttg cagatttttt gcctctgcct tgctcaagta agtttattct
721 gactttatac ctgtttttct cccttacaga taagccagac agtgattgag gtaagtgtga
781 gggaagggat ggtgctgcat cgtaagggaa agggtttgga tgaagagggg ctgaaggctg
//
LOCUS RATIRF1A 2078 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds.
ACCESSION M34253
KEYWORDS interferon regulatory factor 1; transcription factor.
SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2078)
AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A.
TITLE Interferon regulatory factor 1 is an immediate-early gene under
transcriptional regulation by prolactin in Nb2 T cells
JOURNAL Mol. Cell. Biol. 3087, 3094 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990.
FEATURES from to/span description
pept 198 1184 interferon regulatory factor 1 (IRF-1)
BASE COUNT 542 a 544 c 540 g 452 t
ORIGIN
1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg
61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac
121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct
181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc
241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc
301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt
361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca
421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga
481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc
541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa
601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct
661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg
721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg
781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg
841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac
901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca
961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct
1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg
1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact
1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc
1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag
1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg
1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct
1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg
1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga
1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact
1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact
1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa
1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct
1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct
1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt
1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga
1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt
1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta
2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain).
ACCESSION M34189
KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1146)
AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F.,
Stefanuto,G. and Silengo,L.
TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by F.Altruda, 10-MAY-1990.
FEATURES from to/span description
pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic
domain)
BASE COUNT 373 a 184 c 222 g 367 t
ORIGIN
1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg
61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt
121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct
181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga
241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt
301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct
361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct
421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa
481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata
541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta
601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg
661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac
721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg
781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt
841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag
901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg
961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt
1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga
1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc
1141 gaattc
//
LOCUS NEUAMTR 5928 bp ds-DNA PLN 03-JUL-1990
DEFINITION N.crassa mating type protein gene, complete cds.
ACCESSION M33876
KEYWORDS mating type protein.
SOURCE N.crassa (strain 74-ORS-A) DNA.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 5928)
AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L.
TITLE The Neurospora crassa A mating type region
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by N.L.Glass, 26-APR-1990.
FEATURES from to/span description
pept 4121 4283 mating type protein, exon 1
4343 5046 mating type protein, exon 2
IVS 4284 4342 mating type protein intron A
signal 3858 3906 GC signal
signal 3945 3949 CAAT box
signal 5319 5326 poly-A signal
BASE COUNT 1470 a 1459 c 1526 g 1473 t
ORIGIN
1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS HUMNCADH 3451 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human N-cadherin mRNA, complete cds.
ACCESSION M34064
KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein.
SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3451)
AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D.,
Spurr,N. and Goodfellow,P.N.
TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked
to the E-cadherin gene
JOURNAL J. Neurochem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.H.Barton, 08-MAY-1990.
FEATURES from to/span description
pept < 1 2247 N-cadherin (AA at 1)
BASE COUNT 1041 a 703 c 724 g 983 t
ORIGIN Chromosome 18.
1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag
61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg
121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg
181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca
241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac
301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga
361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc
421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac
481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga
541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca
601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct
661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata
721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg
781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc
841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc
901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag
961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc
1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc
1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct
1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt
1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct
1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt
1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc
1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct
1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat
1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt
1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg
1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg
1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt
1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt
1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga
1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct
1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc
1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag
2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt
2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt
2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct
2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac
2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc
2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag
2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta
2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt
2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt
2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca
2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta
2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag
2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg
2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga
2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta
2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt
3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt
3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt
3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg
3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa
3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa
3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct
3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg
3421 ctaattttgg ataacaaaaa aaggggaatt c
//
LOCUS MHVNSGII 870 bp ss-RNA VRL 03-JUL-1990
DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2).
ACCESSION M34035
KEYWORDS non structural protein.
SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA.
ORGANISM Murine hepatitis virus A59
Unclassified.
REFERENCE 1 (bases 1 to 870)
AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G.
TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not
essential for virus replication in transformed murine cells
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.G.Siddell, 04-MAY-1990.
Author address: S.G.Siddell
Inst of Virology
Univ of Wuerzburg
Versbacherstrasse 7
8700 Wuerzburg
FEATURES from to/span description
pept 40 837 non-structural protein-2 (NS2)
BASE COUNT 269 a 147 c 187 g 267 t
ORIGIN
1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt
61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt
121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag
181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa
241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac
301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat
361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg
421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc
481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag
541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg
601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat
661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag
721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg
781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt
841 gtttttacac tattagtgta ataaacttat
//
LOCUS MCAMV6 1904 bp ds-DNA VRL 03-JUL-1990
DEFINITION Cauliflower mosaic virus (CaMV) gene six protein gene, complete
cds.
ACCESSION M23620
KEYWORDS gene six protein.
SOURCE Cauliflower mosaic virus (strain D4) DNA.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (sites)
AUTHORS Daubert,S. and Routh,J.
TITLE Determinants of symptomatology in the DNA sequence CaMV
JOURNAL mol plant microb interact (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1904; for [1])
AUTHORS Daubert,S. and Routh,J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1], [2] kindly submitte
by S.Daubert, 08-AUG-1989, for release after publication.
FEATURES from to/span description
pept 1 1563 gene six protein
signal 1629 1635 TATA box
signal 1547 1550 CCAT enhancer 1
BASE COUNT 645 a 450 c 379 g 430 t
ORIGIN bps 5774 to 7678 of genome.
1 atggagaaca tagaaaaact cctcatgcaa gagaaaatac taatgctaga gctcgatcta
61 gtaaaagcaa aaataagctt agcaagagct aacggctctt cgcaacaagg agaactctct
121 ctccaccgtg aaacaccgga aaaagaagaa gcagttcatt ctgcactggc cacttttacg
181 ccaacccaag taaaagctat tccagagcaa acggctcctg gtaaagaatc aacaaatccg
241 ttgatggcta gtatcttgcc aaaagatatg aattcagttc agactgaaat taggctcaaa
301 aggccatcgg acttcttacg tccttatcag ggaatttcaa tcccacaaaa atctgagctt
361 aacagcacag ttactcttca cggagtagaa tcgggtattc aacaccctca tatcaactac
421 tacgttgtgt ataacggtcc acacgccggt atatacgatg actggggttg tacaaaggcg
481 gcaacaaacg gcgttcccgg agttgcacaa aagaagtttg ccactattac agaggcaaga
541 gcagcagctg acgcatacac aacaagtcag caaacagaca ggttgaactt catccccaaa
601 ggagaagctc aactcaagcc caagagcttt gcgaaggcct taaccagccc atcaaagcaa
661 aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc cccaaaagag
721 atctcctttg ccccggagat caccatggac gactttctct atctctacga tctaggaaga
781 aagttcgacg gagaaggtga cgataccatg ttcaccactg ataatgagaa gattagcctc
841 ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc agcaggtctc
901 atcaagacga tctacccgag caataatctc caggagatca aataccttcc caagaaggtt
961 aaagatgcag tcaaaagatt caggactaac tgcatcaaga acacagagaa agatatattt
1021 ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca taaaccaagg
1081 caagtaatag aaattggagt ctctaagaaa gtagttccta ctgaatcaaa ggccatggag
1141 tcaaaaattc agatcgagga tctaacagaa ctcgccgtga agactggcga acagttcata
1201 cagagtcttt tacgactcaa tgacaagaag aaaatcttcg tcaacatggt ggagcacgac
1261 actctcgtct actccaagaa tatcaaagat acagtctcag aagaccaaag ggctattgag
1321 acttttcaac aaagggtaat atcgggaaac ctcctcggat tccattgccc agctatctgt
1381 cacttcatcg aaaggacagt agaaaaggaa ggtggcacct acaaatgcca tcattgcgat
1441 aaaggaaagg ctatcattca agatgcctct accgacagtg gtcccaaaga tggaccccca
1501 cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat
1561 tgatgtgaca tctccactga cgtaagggat gacgcacaat cccactaccc ttcgcaagac
1621 ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaaa tcaccagtct
1681 ctctctacaa gactatctct ctctattttc tccagaataa tgtgtgagta gtttcccgat
1741 aagggaatta gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag
1801 tatgtatttg tatttgtaaa atacttctat caataaaatt tctaattcct aaaaccaaaa
1861 tccagtacta aaatccagat ctcctaaagt ccctatagat cttt
//
LOCUS CREAPCYN 577 bp ss-mRNA PLN 03-JUL-1990
DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION J05524
KEYWORDS apoplastocyanin.
SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
PC6-2.
ORGANISM Chlamydomonas reinhardtii
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Chlamydomonadaceae.
REFERENCE 1 (bases 1 to 577)
AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
Bogorad,L.
TITLE Isolation and characterization of a complementary DNA clone for an
algal pre-apoplastocyanin
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.Hill, 11-MAY-1990.
FEATURES from to/span description
pept 22 459 apoplastocyanin (PC6-2) precursor
sigp 22 162 apoplastocyanin signal peptide
matp 163 456 apoplastocyanin
BASE COUNT 91 a 189 c 171 g 126 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS RATSVPIIA 4161 bp ds-DNA ROD 03-JUL-1990
DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete
cds.
ACCESSION J05443
KEYWORDS seminal vesicle secretion II protein.
SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell
DNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4161)
AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G.,
Matusik,R.J., Carr,S.A. and Crabb,J.W.
TITLE Structural characterization of the rat seminal vesicle secretion II
protein and gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.E.Harris, 25-APR-1990.
FEATURES from to/span description
pept 2065 2137 seminal vesicle secretion II protein (SVS II)
precursor, exon 1
2377 3548 seminal vesicle secretion II protein
precursor, exon 2
sigp 2065 2130 seminal vesicle secretion II protein signal
peptide
matp 2131 3545 seminal vesicle secretion II protein
signal 1934 1946 CAAT box
signal 2013 2020 TATA box
BASE COUNT 1243 a 902 c 822 g 1194 t
ORIGIN
1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct
61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct
121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt
181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt
241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa
301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc
361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac
421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata
481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta
541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta
601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca
661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc
721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc
781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc
841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa
901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca
961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac
1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct
1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt
1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac
1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta
1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt
1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg
1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact
1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt
1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac
1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac
1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa
1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt
1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat
1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct
1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag
1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg
1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc
2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc
2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag
2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca
2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt
2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga
2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga
2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag
2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc
2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg
2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat
2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct
2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg
2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg
2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt
2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac
2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga
3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa
3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg
3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta
3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag
3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc
3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa
3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat
3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa
3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta
3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc
3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg
3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag
3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa
3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac
3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat
3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg
3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc
4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga
4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc
4141 aatggtgacc ctgttgaatt c
//
LOCUS CODCPRRKA 94 bp ss-RNA RNA 03-JUL-1990
DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds.
ACCESSION M35276 M15192
KEYWORDS 4.5S RNA.
SOURCE C.fragile chloroplast RNA.
ORGANISM Chloroplast Codium fragile
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Brypsidales; Codiaceae; Codium fragile.
REFERENCE 1 (bases 1 to 94)
AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S.
TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts
JOURNAL J. Biol. Chem. 262, 1848-1854 (1987)
STANDARD simple staff_review
FEATURES from to/span description
RNA 1 94 4.5 S RNA
BASE COUNT 35 a 11 c 14 g 34 t
ORIGIN
1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc
61 atttgtattt gaaaattgtt taggaactag gcac
//
LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 03-JUL-1990
DEFINITION Hamster apolipoprotein (apoB) gene, partial cds
(LDL receptor-binding domain).
ACCESSION M35187
KEYWORDS apolipoprotein B.
SOURCE Hamster DNA.
ORGANISM Mesocricetus auratus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2339)
AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N.
TITLE Sequence of the putative low-density lipoprotein receptor-binding
regions of apolipoprotein B in mouse and hamster
JOURNAL Gene 87, 309-310 (1990)
STANDARD simple staff_review
COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications
on line 4 and line 17 of printed sequence. The hamster sequence
should be 1 bp to the left on these lines.
FEATURES from to/span description
pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding
domain) (AA at 1)
BASE COUNT 725 a 519 c 450 g 645 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat
61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat
121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc
181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct
241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat
301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt
361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat
421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac
481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta
541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag
601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg
661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc
721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc
781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc
841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt
901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt
961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt
1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat
1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga
1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac
1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa
1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag
1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac
1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt
1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg
1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact
1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac
1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg
1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg
1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt
1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt
1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt
1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac
1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca
2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa
2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac
2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta
2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct
2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc
//
LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding
domain).
ACCESSION M35186
KEYWORDS apolipoprotein B.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2354)
AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N.
TITLE Sequence of the putative low-density lipoprotein receptor-binding
regions of apolipoprotein B in mouse and hamster
JOURNAL Gene 87, 309-310 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding
domain) (AA at 1)
BASE COUNT 757 a 525 c 423 g 649 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat
61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat
121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc
181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct
241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat
301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt
361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat
421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac
481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg
541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa
601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta
661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc
721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc
781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt
841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt
901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt
961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt
1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat
1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga
1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac
1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa
1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag
1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac
1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt
1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt
1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa
1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc
1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg
1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca
1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc
1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac
1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag
1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct
1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag
2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg
2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt
2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat
2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga
2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat
2341 gtatatctga attc
//
LOCUS RATBPTT 1035 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Rat beta-tachykinin mRNA, complete cds.
ACCESSION M35277 M15191
KEYWORDS neurokinin A; substance P; tachykinin.
SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen, cDNA
to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1035)
AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance
P and neurokinin A
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 100 492 beta-preprotachykinin
sigp 100 156 beta-tachykinin signal peptide (3' end could
be 171)
matp 172 204 substance P
matp 294 321 neurokinin
mRNA < 1 1035 beta-preprotachykinin mRNA
BASE COUNT 289 a 212 c 245 g 289 t
ORIGIN
1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc
241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg
421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga
481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt
541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg
601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta
661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca
721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct
781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc
841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt
901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg
961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa
1021 gaatgtatct tcagg
//
LOCUS CHPRGIT 1051 bp ds-DNA PRI 03-JUL-1990
DEFINITION Chimpanzee rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION M30947
KEYWORDS internal transcribed spacer.
SOURCE Chimpanzee DNA.
ORGANISM Pan troglodytes
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 1 to 1051)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT 69 a 411 c 425 g 146 t
ORIGIN Acrocentric chromosomes 14, 15, 17, 22, 23.
1 acggagccga aggggggcgc gaggccgcgg cggcgccgcc gcgcgcttcc ctccccccca
61 ccccgccgca acgcggcgcg tgcgcgggcg gggcccgtgt gccgttcgtt cgttcgttcg
121 ctgcccggcc ccgccgccgc gagagccgag aactcgggag ggcgacgttg gggggagagc
181 gagagagaca gaaagaaggg ggcgcgtgtt cgctgcgcgt gtcgtggggc cggcggggag
241 cggtccccgg cctcgggccc gacggacgtg tgtgtcggcg ggcgcggggg cggttctcgg
301 cggcgtcacg gcgggtttgg gggggggggt ctcggtgccc tcctccccgc cggggcccgt
361 cgtccggccc cgccgcgcgc cggctccccg tcgtcggggc cgggccggat tcccgtcgcc
421 gcctccgccg cgcgccgctc cgcgccaccg ggcacggccc cgctcgctct ccccggcctt
481 cccgctaggg cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc
541 ccgccgtcca ggtacctagc gcgttccggc gcggaggttt aaagacccct tgggggatcg
601 cccgtccgcc cgcgggtcgg gggcggtggt gggcccgcgg gggagtcccg tcgggagggg
661 cccggcccct cccgcgcctc ccccgcggac tccgcccccg gccggggccg cgccgcctcg
721 ccggctcggg tcgcggcggc cgtcgggtgg gggctttacc cggcggccgt cgcgtgcgcg
781 cgtgccgcgc gtgtggcgtg cgccccgcgc cgtgggggcg ggaacccccc gggcgcctgt
841 ggggtggtgt ccgcgctcgc ccctgcgtgg gcggcgcgcg cctccccgtg gtgtgaaacc
901 ttccgacccc tctccggagt ccggtcccgt ttttgctgtc tctctggccg gcctgaggca
961 accccctctc ctctgggggg gggggacgtg ccgcgccagg agggcctccc ggtgtgtttg
1021 tcgggagcgc cctcgccaaa tcgacctcgt a
//
LOCUS CHPRGITX 2512 bp ds-DNA PRI 03-JUL-1990
DEFINITION Chimpanzee 28S ribosomal RNA gene fragment.
ACCESSION M30950
KEYWORDS 28S ribosomal RNA.
SOURCE Chimpanzee DNA.
ORGANISM Pan troglodytes
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 193 to 261; 375 to 945; 1042 to 1079; 1334 to 1357;
1742 to 1958; 2190 to 2204)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2512)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES from to/span description
rRNA < 1 > 2512 28S ribosomal RNA
BASE COUNT 397 a 837 c 924 g 354 t
ORIGIN Chromosomes 14, 15, 17, 22, 23.
1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
181 cccctccacc cgccctccct tccccccgcc gcccctcctc ctcctccccg gagggggcgg
241 gctccggcgg gtgcgggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc
301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg
361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtcct
421 cctcctcccc ccccgtctcc gccccccggc cccgcgtcct cccccgggag ggcgcgcggg
481 tcggggcggt ggcggcggcg gcggcggtgg cggcggtggc ggcgggaccg aaaccccccc
541 cgagtgttac agccccccgg cagcagcact cgccgaatcc cggggccgag ggagcgagac
601 ccgtcgccgc gctctccccc ctcccggcgc ccacccccgc ggggatatcc tccgcgaggg
661 gggtctcccc cgcgggggcg cgccggcgtc tcctcgtggg ggggccgggc cacccctccc
721 acggcgcgac cgctctccca cccctcctcc ccgcaacccc cctctcccgg cgacggggag
781 ggccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg
841 ccgtcgggcc cgggggaggt tctctcgggg ccacgcgcgc gtcccccgaa gagggggacg
901 gcggagccga gcgcacgggg tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa
961 cacggaccaa ggagtctaac acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc
1021 gcaatgaagg tgaaggccgg cgcgctcgcc ggccgaggtg ggatccgagg cctctccagt
1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc
1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac
1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg
1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc
1321 tggcgctctc gcagacccga cgcacacccc cccacgcagt tttatccggt aaagcgaatg
1381 attagaggtc ttggggccga aacgatctca acctattctc aaactttaaa tgggtaagaa
1441 gcccggctcg ctggcgtgga gccggggtgg aatgcgagtg cctagtgggc cacttttggt
1501 aagcagaact ggcgctgcgg gatgaaccga acgccgggtt aaggcgcccg atgccgacgc
1561 tcatcagacc ccagaaaagg tgttggttga tatagacagc aggacggtgg ccatggaagt
1621 cggaatccgc taaggagtgt gtaacaactc acctgccgaa tcaactagcc ctgaaaatgg
1681 atggcgctgg agcgtcgggc ccatacccgg ccgtcgccgg cagtcgagag tggacgggag
1741 cggcgggggc ggcgcgggcg tgtgcgcgcg cgcgtgtgtg cgtgtgtgtc ggagggcggc
1801 ggcggtggcg gcgggggtgg ggtcctcccc ctcccccacg ccgcctcccc tcctcccacc
1861 caccaccgcc gccgccaccc ccgctccccg cccccggagc cccgcggacg ctacgccgcg
1921 acgagtagga gggccgctgc ggtgagcctt gaagcctagg gcgcgggccc gggtggagcc
1981 gccgcaggtg cagatcttgg tggtagtagc aaatattcaa acgagaactt tgaaggccga
2041 agtggagaag ggttccatgt gaacagcagt tgaacatggg tcagtcggtc ctgagagatg
2101 ggcgagcgcc gttccgaagg gacgggcgat ggcctccgtt gccctcggcc gatcgaaagg
2161 gagtcgggtt cagatccccg aatccggagt ggcggagatg ggcgccgcga ggcgtccagt
2221 gcggtaacgc gaccgatccc ggagaagccg gcgggagccc cggggagagt tctcttttct
2281 ttgtgaaggg cagggcgccc tggaatgggt tcgccccgag agaggggccc gtgccttgga
2341 aagcgtcgcg gttccggcgg cgtccggtga gctctcgctg gcccttgaaa atccggggga
2401 gagggtgtaa atctcgcgcc gggccgtacc catatccgca gcaggtctcc aaggtgaaca
2461 gcctctggca tgttggaaca atgtaggtaa gggaagtcgg caagccggat cc
//
LOCUS GORRGIT 987 bp ds-DNA PRI 03-JUL-1990
DEFINITION Gorilla rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION M30948
KEYWORDS internal transcribed spacer.
SOURCE Gorilla DNA.
ORGANISM Gorilla gorilla
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 1 to 987)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT 65 a 398 c 390 g 134 t
ORIGIN Chromosomes 22 and 23.
1 acggagcgaa gggcgaggcc gcggcggtgg cgccgccgcg tgcttccctc ccccccaccg
61 acgcggcgcg tgcgcgggcg gggcccgtgc cgttcgttcg ttcgttcgtt cgctgcccgg
121 ccccgccgcc gcgagagccg aggactcggg agggagacgg ggggggagaa gagaaaggag
181 gcctgtccgt gtgtgcgtgt cgtggggccg gccgcgctgg tgagcggcgg cgaggcctcc
241 ccggccgcgg cccgacgacg tgtgtgtcgg cgggtgcggg ggcggttctc ggcggcgtca
301 cggcgggttt ggggcctcgg tgccctcctc cccgccgggg cccgtcgtcc ggccccgccg
361 ccggcccccc cgtcgtcggg gccggccggg ttcccgtcgc cgccgccgcc gccgccgtcg
421 tcgcctccgc cgcgccaccg ggaccggccc cgctcgctct ccccggcctt cccgctaggg
481 cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc ctccccgccg
541 ttccaggtac ctagcgcgtt ccggcgcgga ggtttaaaga ccccttgggg gatcgcccgt
601 ccgccccgtg ggtcgggggc ggtgggcccg cgggggggtc ccgtcgggag gggcccggcc
661 cctcccgcgc ctccaccgcg gactccgccc cccggccggg gccgcggcgg ccgtcgggtg
721 ggggctttac ccggcggccg tgcgcccccg cgccgtgggg gcgggaaccc ccgggcgcct
781 gtggggcgtg tcagcgctcg cccccgcgtg ggcgccgcgc ctccccgtgg tgtgaaacct
841 tccgacccct ctccggagtc cggtcccgtt tgctgtccgt ctggccggcc tgaggcaacc
901 ccccctcctc cgtggggggg gggggacgtg ccgcgccagg agggccctcc cggtgtcggg
961 agcgccctcg ccaaatcgac ctcgtta
//
LOCUS GORRGITX 2467 bp ds-DNA PRI 03-JUL-1990
DEFINITION Gorilla 28S ribosomal RNA gene fragment.
ACCESSION M30951
KEYWORDS 28S ribosomal RNA.
SOURCE Gorilla DNA.
ORGANISM Gorilla gorilla
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 193 to 261; 375 to 944; 1041 to 1079; 1334 to 1354;
1737 to 1913; 2145 to 2159)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2467)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES from to/span description
rRNA < 1 > 2467 28S ribosomal RNA
BASE COUNT 386 a 819 c 912 g 350 t
ORIGIN Chromosomes 22 and 23.
1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
181 cccctccacc cgccctccct tcccccgccg cccctcctcc tcctccccgg agggggcggg
241 ctccggcggg tgcggggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc
301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg
361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtccg
421 tccgtccgtc ctcctccccc gtctccgccc cccggccccg cgtcctccct cgggaagggg
481 cgcgcgggtc ggggcggcgg cggcggcggt ggcggcggcg gcggcggcgg cgggaccgaa
541 acccccccga gtgttacagc cccccggcca gccagccatc gccgaatccc ggggccgagg
601 gagcgagacc cgtcgccgcg ctctcccccc tcccggcgcc cacccccgcg ggggtccccc
661 gcgagggggt cccccgcggg ggcgcgccgg cgtctcctcg tgggggggcc gggccacccc
721 tcccacggcg cgaccgctct cccacccctc gcttccccgc acactccccc ggcgacgggg
781 tgccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg
841 ccgtcgggcc cgggggaggt tctcccgggg ccacgcgcgc gtcccccgaa gagggggacg
901 gcggagcgag cgcacggggt cggcggcgat gtcggctacc cacccgaccc gtcttgaaac
961 acggaccaag gagtctaaca cgtgcgcgag tcgggggctc gcacgaaagc cgccgtggcg
1021 caatgaaggt gaaggccggc gcgctcgccg gccgaggtgg gatcccgagg cctctccggt
1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc
1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac
1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg
1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc
1321 tggcgctctc gcagacccct cctccccccc acgcagtttt atccggtaaa gcgaatgatt
1381 agaggtcttg gggccgaaac gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc
1441 cggctcgctg gcgtggagcc gggtggaatg cgagtgcctg tgggccactt ttggtaagca
1501 gaactggcgc tgcgggatga accgaacgcc gggttaaggc gcccgatgcc gacgctcatc
1561 agaccccaga aaaggtgttg gttgatatag acagcaggac ggtggccatg gaagtcggaa
1621 tccgctaagg agtgtgtaac aactcacctg ccgaatcaac tagccctgaa aatggatggc
1681 gctggagcgt cgggcccata cccggccgtc gccggcagtc gagagtggac gggagcggcg
1741 ggggcggcgc gcgcgcgcgc gtgtggggtc ggagggcggc gtgtgggcgg tggggtcctc
1801 gcccccctcc cccgcgcctc ccctcctccc acccccgctc cccgcccccg ggagccccgc
1861 ggacgctacg ccgcgacgag taggagggcc gctgcggtga gccttgaagc ctagggcgcg
1921 ggcccgggtg gagccgccgc aggtgcagat cttggtggta gtagcaaata ttcaaacgag
1981 aactttgaag gccgaagtgg agaagggttc catgtgaaca gcagttgaac atgggtcagt
2041 cggtcctgag agatgggcga gcgccgttcc gaagggacgg gcgatggcct ccgttgccct
2101 cggccgatcg aaagggagtc gggttcagat ccccgaatcc ggagtggcgg agatgggcgc
2161 cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga agccggcggg agccccgggg
2221 agagttctct tttctttgtg aagggcaggg cgccctggaa tgggttcgcc ccgagagagg
2281 ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc ggtgagctct cgctggccct
2341 tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc gtacccatat ccgcagcagg
2401 tctccaaggt gaacagcctc tggcatgttg gaacaatgta ggtaagggaa gtcggcaagc
2461 cggatcc
//
LOCUS ORARGIT 1070 bp ds-DNA PRI 03-JUL-1990
DEFINITION Orangutan rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION M30949
KEYWORDS internal transcribed spacer.
SOURCE Orangutan DNA.
ORGANISM Pongo pygmaeus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 1 to 1070)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT 60 a 438 c 438 g 134 t
ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23.
1 acggagcgaa gagcgaggcc cgcggcggcg ccgccgcggc gtccttcctc gtcggccggc
61 cggccgcgtt tctcccccgc ttcccgcggc gcgtgcgcgg gcggggcccg tgccgttcgc
121 gcgcacgcgc gggcgtgcgt gcgtgcgtcg cccggccccg ccggccgcga gagccggaga
181 acctcgggag ggagagagag gggggagaga gagagcggtg tgtgtgtgcg cgcgcgcgtg
241 tctcgggggc ggccggcgcg gcggggagcg gtccccggcc gcggccccga cgtgtgtgtc
301 ggcgggcgcg ggtgcggtcc tcggcggcgt cgcggcgggg tggggggtgt ctcggtgccc
361 ctccccgccg gggcccgtcg tcccgtcccc gacccgccgg ctccgcgtcg ggggccggcc
421 gggttcccgc cgcccccgtc gcctccgcca cgccgcgcca ccgggccggg ccggcccggc
481 ccgccccgct cgctctcccc ggccttcccg ctagggcgtc tcgagggtcg ggggccggac
541 gccggtcccc gcgcctcctc gtccgccccc ccctcccccc gccgtccagg tacctagcgc
601 gttccggcgc ggaggtttaa agaccccttg ggggatcgcc cgtccgcccg tgggtcgggg
661 gcggtgggcc cgcgtgggga gtcccgtcgg gaggggcccg gcccctcccg cgcctccacc
721 gcggactccg cccccccggc cggggcgctg ccgccgccgc cgcggtcgcg gcggccgtcg
781 ggtgggggct ttacccggcg gccgtcgtgc cgtccgtcgc gcgcgtgccc cgcgccgtgg
841 gggcgggaac cccccgggcg cctgtggggt ggtgtccgcg ctcgcccccg cgtgggcggc
901 gcgcgcctcc ccgtggtgtg cgacaccttc cgacccctct ccggagtccg gtcccgtttg
961 ccgtctgact ggccggcctg aggcgacccc cccctgcggg ggggaagtgc cgcgccaggg
1021 gcgagggcct cccggtgtgt cgggggcgcc ctcgcccgat cgagctcgta
//
LOCUS ORARGITX 2487 bp ds-DNA PRI 03-JUL-1990
DEFINITION Orangutan 28S ribosomal RNA gene fragment.
ACCESSION M30952
KEYWORDS 28S ribosomal RNA.
SOURCE Orangutan DNA.
ORGANISM Pongo pygmaeus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 193 to 263; 377 to 985; 1084 to 1120; 1376 to 1394;
1780 to 1933; 2165 to 2179)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2487)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES from to/span description
rRNA < 1 > 2487 28S ribosomal RNA
BASE COUNT 393 a 814 c 921 g 359 t
ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23.
1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
181 cccctccacc cgccctccct cccccgccgc ccctcctcct cctccccgcg gggagggggc
241 gggctccggc gggtgcgggg gtgggcgggc ggggccgggg gtggggtcgg cgggggaccg
301 tcccccgacc ggcgaccggc cgccgccggg cgcatttcca ccgcggcggt gcgccgcgac
361 cggctccggg acggctggga aggcccggtg gggaaggtgg ctcggggggc cccgtccgtc
421 cgtccgtccg tcctcctccc tcctcccccc tcgtcttccc cccggccccg cgtcctccct
481 cgggagggcg cgcgggtcgg gggcggcggc gggggtggct gctgctgctg ctgcggcggc
541 ggcgggaccg aaccccccga gtgttacagc cccggcagca gcgctcgccg aacccggggc
601 cgagggagcg agacccgtcg ccgcgctctc ccccctcccg gcgcccaccc ccgcgggggt
661 cccccgcgag ggggtccccc ccgcgggggc gcgccggcgt ctcctcgcgt ggggggccgg
721 gccgcccctc ccacggcgcg accgctctcc cacccccccc ttccccgcgc acccccggcg
781 acgggggccc gcgcgggcgg ggggggcggg gcggactgtc cccagtgcgc cccgggcggg
841 tcgcgccgtc gggcccgggg aagagagagg gagaggaggg ggttctcctc ctcctcctcc
901 cctctcgggg ccacgcgcgc gtccctcgaa gagggggacg gcggagccga gcgcacgggg
961 tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa cacggaccaa ggagtctaac
1021 acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc gcaatgaagg tgaaggccgg
1081 cgcgctcgcc ggccgaggtg ggatcccgag gcctctccag tccgccgagg gcgcaccacc
1141 ggcccgtctc gcccgccgcg ccggggaggt ggagcacgag cgcacgtgtt aggacccgaa
1201 agatggtgaa ctatgcctgg gcagggcgaa gccagaggaa actctggtgg aggtccgtag
1261 cggtcctgac gtgcaaatcg gtcgtccgac ctgggtatag gggcgaaaga ctaatcgaac
1321 catctagtag ctggttccct ccgaagtttc cctcaggata gctggcgctc tcgcagactc
1381 gaccgaccga ccgcagtttt atccggtaaa gcgaatgatt agaggtcttg gggccgaaac
1441 gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc cggctcgctg gcgtggagcc
1501 gggcgtggaa tgcgagtgcc tagtgggcca cttttggtaa gcagaactgg cgctgcggga
1561 tgaaccgaac gccgggttaa ggcgcccgat gccgacgctc atcagacccc agaaaaggtg
1621 ttggttgata tagacagcag gacggtggcc atggaagtcg gaatccgcta aggagtgtgt
1681 aacaactcac ctgccgaatc aactagccct gaaaatggat ggcgctggag cgtcgggccc
1741 atacccggcc gtcgccggca gtcgagagtg gacgggagcg gcgggggcgg ggtgcgtgcg
1801 ggtgtggggg tgtgtgtggg ggggggtcct ccccccccgc cactcctcct cctcccaccc
1861 ctcccccgga gcagccccgc ggacgctacg ccgcgacgag taggagggcc gctgcggtga
1921 gccttgaagc ccagggcgcg ggcccgggtg gagccgccgc aggtgcagat cttggtggta
1981 gtagcaaata ttcaaacgag aactttgaag gccgaagtgg agaagggttc catgtgaaca
2041 gcagttgaac atgggtcagt cggtcctgag agatgggcga gcgccgttcc gaagggacgg
2101 gcgatggcct ccgttgccct cggccgatcg aaagggagtc gggttcagat ccccgaatcc
2161 ggagtggcgg agacgggcgc cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga
2221 agccggcggg agccccgggg agagttctct tttctttgtg aagggcaggg cgccctggaa
2281 tgggttcgcc ccgagagagg ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc
2341 ggtgagctct cgctggccct tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc
2401 gtacccatat ccgcagcagg tctccaaggt gaacagcctc tggcatgttg gaacaatgta
2461 ggtaagggaa gtcggcaagc cggatcc
//
LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 03-JUL-1990
DEFINITION A.calcoaceticus 16S ribosomal RNA.
ACCESSION M34139
KEYWORDS 16S ribosomal RNA.
SOURCE A.calcoaceticus (strain 33604) ribosomal RNA.
ORGANISM Acinetobacter calcoaceticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1536)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1536 16S ribosomal RNA
BASE COUNT 387 a 329 c 460 g 330 t 30 others
ORIGIN
1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca
61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta
121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc
181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta
241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga
301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata
361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt
421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt
481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc
541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt
601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga
661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg
721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag
781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga
841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact
901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat
961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt
1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt
1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga
1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat
1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta
1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac
1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg
1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct
1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn
1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt
//
LOCUS BDERRSAA 1553 bp ss-rRNA RNA 03-JUL-1990
DEFINITION B.stolpii 16S ribosomal RNA.
ACCESSION M34125
KEYWORDS 16S ribosomal RNA.
SOURCE B.stolpii (strain uki-2) ribosomal RNA.
ORGANISM Bdellovibrio stolpii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1553)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept Microbiology, 131
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1553 16S ribosomal RNA
BASE COUNT 381 a 320 c 429 g 316 t 107 others
ORIGIN
1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt
121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca
181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg
241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg
301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc
361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag
421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc
481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt
541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc
601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc
661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga
721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg
781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat
841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag
901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat
961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct
1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag
1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc
1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat
1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac
1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg
1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc
1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac
1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt
//
LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 03-JUL-1990
DEFINITION P.staleyi 16S ribosomal RNA.
ACCESSION M34126
KEYWORDS 16S ribosomal RNA.
SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA.
ORGANISM Planctomyces staleyi
Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales.
REFERENCE 1 (bases 1 to 1525)
AUTHORS Woese,C.R. and Oyalzu,H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 15-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1525 16S ribosomal RNA
BASE COUNT 376 a 348 c 482 g 315 t 4 others
ORIGIN
1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa
61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa
121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg
181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt
241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga
301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga
361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg
421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc
481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta
541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac
601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg
661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg
721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag
781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga
841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac
901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat
961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt
1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt
1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact
1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta
1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta
1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga
1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac
1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca
1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa
1501 cctgcggctg gatcacctcc tttct
//
LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 03-JUL-1990
DEFINITION R.purpureus 16S ribosomal RNA.
ACCESSION M34132
KEYWORDS 16S ribosomal RNA.
SOURCE R.purpureus (strain 6770) ribosomal RNA.
ORGANISM Rhodocyclus purpureus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1478)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1478 16S ribosomal RNA
BASE COUNT 367 a 345 c 470 g 289 t 7 others
ORIGIN
1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca
61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat
121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct
181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt
241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg
301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat
361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg
421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg
481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc
541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt
601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga
661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg
721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag
781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa
841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt
901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga
961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag
1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga
1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg
1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc
1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac
1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg
1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc
1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc
1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt
//
LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34185
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3365)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 256 2457 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 786 a 962 c 917 g 700 t
ORIGIN
1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac
61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca
121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa
181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct
241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca
301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc
361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc
421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa
481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca
541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga
601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac
661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag
721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac
781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag
841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac
901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag
961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt
1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag
1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc
1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag
1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc
1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt
1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag
1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg
1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa
1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac
1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc
1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc
1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc
1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc
1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg
1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc
1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg
1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg
2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg
2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac
2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga
2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag
2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg
2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag
2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg
2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc
2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc
2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct
2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc
2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag
2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg
2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg
2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga
2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg
3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca
3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag
3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat
3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg
3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat
3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa
3361 aaaaa
//
LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34186
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3503)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 527 2722 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 777 a 1044 c 1005 g 677 t
ORIGIN
1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc
61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg
121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga
181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc
241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc
301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg
361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga
421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc
481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa
541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc
601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc
661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac
721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt
781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga
841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat
901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta
961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat
1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat
1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa
1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt
1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc
1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt
1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga
1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt
1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa
1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc
1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat
1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga
1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga
1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt
1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa
1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc
1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg
1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc
2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc
2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt
2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa
2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat
2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct
2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg
2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg
2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg
2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta
2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt
2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa
2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc
2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc
2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact
2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga
2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc
3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc
3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac
3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc
3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac
3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg
3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga
3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt
3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata
3481 tattaaaaac aaaaaaaaaa aaa
//
LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34187
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2283)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 417 1325 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 566 a 621 c 612 g 484 t
ORIGIN
1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca
61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc
121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga
181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc
241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc
301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg
361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt
421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta
481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg
541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg
601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg
661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt
721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg
781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca
841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt
901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta
961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa
1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct
1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct
1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc
1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca
1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc
1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca
1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt
1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag
1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga
1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga
1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa
1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc
1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc
1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg
1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt
1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa
1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag
2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca
2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca
2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac
2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa
2281 aaa
//
LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA,
complete cds.
ACCESSION M34188
KEYWORDS FGF receptor; fibroblast growth factor receptor.
SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA,
clone h5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1625)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 523 1425 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 368 a 480 c 489 g 288 t
ORIGIN
1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg
61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc
121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc
181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat
241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc
301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc
361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct
421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg
481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc
541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc
601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct
661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc
721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa
781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc
841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg
901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc
961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg
1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt
1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg
1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca
1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg
1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat
1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg
1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca
1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca
1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt
1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg
1621 aaaaa
//
LOCUS SYNLACZA 6476 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD1.27.
ACCESSION M34296
KEYWORDS lacZ.
SOURCE Cloning vector pPD1.27.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6476)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3500 3501 E.coli lacZ end/SV40 start
recomb 3634 3635 SV40 end/synthetic start
recomb 3657 3658 synthetic end/pUC19 start
recomb 4076 4077 pUC19 end/C.elegans sup-7 start
recomb 4442 4443 C.elegans sup-7 end/pUC19 start
BASE COUNT 1560 a 1646 c 1676 g 1594 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa
3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat
3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg
3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt
3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt
3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa
3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt
3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct
3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZB 6096 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD8.02.
ACCESSION M34297
KEYWORDS lacZ.
SOURCE Cloning vector pPD8.02.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6096)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3275 3276 synthetic end/unknown DNA start
recomb 3696 3697 unknown DNA end/C.elegans sup-7 end
recomb 4062 4063 C.elegans sup-7 end/pUC19 start
BASE COUNT 1458 a 1571 c 1616 g 1451 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta
3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg
3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt
3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc
3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat
3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg
3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat
3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca
3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga
3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa
3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac
4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta
4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc
5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc
5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc
6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa
6061 ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZC 7376 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD8.33.
ACCESSION M34298
KEYWORDS lacZ.
SOURCE Cloning vector pPD8.33.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7376)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 3524 3425 unknown DNA end/C.elegans sup-7 end
recomb 4555 4556 synthetic end/pUC19 start
recomb 4976 4977 pUC19 end/C.elegans sup-7 start
recomb 5342 5343 C.elegans sup-7 end/pUC19 start
BASE COUNT 1863 a 1793 c 1814 g 1906 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct
4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZD 5730 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD16.43.
ACCESSION M34299
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.43.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 5730)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3264 3265 synthetic end/unknown DNA start
recomb 3513 3514 unknown DNA end/synthetic start
recomb 3524 3525 synthetic end/pUC19 start
BASE COUNT 1348 a 1488 c 1580 g 1314 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga
3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg
3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa
3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca
3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat
3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta
3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg
3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg
3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac
3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg
3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt
4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg
4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac
4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt
4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag
4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc
4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc
4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta
4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg
4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga
4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac
4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa
4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat
4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc
4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg
4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg
5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa
5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg
5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga
5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct
5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca
5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc
5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg
5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc
5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca
5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc
5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga
5701 gcggataaca atttcacaca ggaaacagct
//
LOCUS SYNLACZE 7010 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD16.51.
ACCESSION M34300
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.51.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7010)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 4544 4545 synthetic end/pUC19 start
recomb 4793 4794 pUC19 end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 4524 4525 unknown DNA end/synthetic start
recomb 4804 4805 synthetic end/pUC19 start
BASE COUNT 1753 a 1710 c 1778 g 1769 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg
4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct
4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc
4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa
4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg
4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc
5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga
5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga
5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg
5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc
5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac
5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact
5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca
5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg
5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact
5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg
5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg
5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat
5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc
5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat
5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt
5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc
6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt
6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac
6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt
6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct
6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga
6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac
6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg
6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt
6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc
6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg
6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc
6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc
6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag
6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca
6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat
6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg
6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct
//
LOCUS SYNLACZF 7088 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD18.32.
ACCESSION M34301
KEYWORDS lacZ.
SOURCE Cloning vector pPD18.32.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7088)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 4622 4623 synthetic end/pUC19 start
recomb 4871 4872 pUC19 end/synthetic start
recomb 3322 3323 synthetic end/unknown DNA start
recomb 4602 4603 unknown DNA end/synthetic start
recomb 4882 4883 synthetic end/pUC19 start
BASE COUNT 1780 a 1724 c 1797 g 1787 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa
4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt
4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt
4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt
4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt
4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa
4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta
4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta
4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata
4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac
4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc
4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc
4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt
4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac
4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat
4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat
4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata
5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct
5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa
5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt
5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg
5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca
5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa
5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt
5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga
5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc
5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga
5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga
5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga
5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat
6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct
6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc
6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc
6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc
6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg
6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata
6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta
6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt
6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt
6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc
6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg
6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca
7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg
7081 aaacagct
//
LOCUS SYNLACZG 6563 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD21.28.
ACCESSION M34302
KEYWORDS lacZ.
SOURCE Cloning vector pPD21.28.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6563)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 4097 4098 synthetic end/pUC19 start
recomb 4346 4347 pUC19 end/synthetic start
recomb 3322 3323 synthetic end/unknown DNA start
recomb 4077 4078 unknown DNA end/synthetic start
recomb 4357 4358 synthetic end/pUC19 start
BASE COUNT 1587 a 1650 c 1723 g 1603 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct
4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc
4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga
4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg
4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc
4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata
4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt
4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag
4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca
4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc
4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat
4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc
4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg
4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc
4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat
4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga
4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc
5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc
5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt
5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc
5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc
5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca
5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca
5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt
5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta
5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg
5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc
5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag
5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa
5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc
5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc
5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta
5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag
6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct
6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga
6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc
6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt
6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg
6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg
6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc
6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc
6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata
6541 acaatttcac acaggaaaca gct
//
LOCUS SYNLACZH 6562 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD22.04.
ACCESSION M34303
KEYWORDS lacZ.
SOURCE Cloning vector pPD22.04.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6562)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 179 180 synthetic end/E.coli trpS start
recomb 263 264 E.coli trpS end/synthetic start
recomb 267 268 synthetic end/E.coli lacZ start
recomb 3261 3262 E.coli lacZ end/synthetic start
recomb 4096 4097 synthetic end/pUC19 start
recomb 4345 4346 pUC19 end/synthetic start
recomb 3321 3322 synthetic end/unknown DNA start
recomb 4076 4077 unknown DNA end/synthetic start
recomb 4356 4357 synthetic end/pUC19 start
BASE COUNT 1587 a 1650 c 1722 g 1603 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg
181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt
241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc
301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata
361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc
421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc
481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca
541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc
601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga
661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg
721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg
781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag
841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga
901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg
961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt
1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta
1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga
1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg
1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa
1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc
1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga
1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt
1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg
1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac
1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg
1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg
1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta
1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt
1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg
1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt
1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg
1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc
2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg
2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga
2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg
2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt
2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac
2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc
2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc
2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta
2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc
2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga
2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg
2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg
2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg
2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc
2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg
2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg
3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct
3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca
3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg
3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct
3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg
3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc
3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca
3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct
3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa
3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac
3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca
3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc
3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa
3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc
3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt
3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg
3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat
4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta
4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct
4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac
4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg
4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg
4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag
4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg
4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat
4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca
4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc
4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca
4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg
4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca
4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata
4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag
4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg
5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta
5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct
5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca
5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag
5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt
5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa
5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga
5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg
5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag
5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc
5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg
5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac
5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga
6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt
6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag
6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg
6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta
6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc
6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc
6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca
6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa
6541 caatttcaca caggaaacag ct
//
LOCUS SYNLACZI 6567 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD22.11.
ACCESSION M34304
KEYWORDS lacZ.
SOURCE Cloning vector pPD22.11.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6567)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 184 185 synthetic end/E.coli trpS start
recomb 268 269 E.coli trpS end/synthetic start
recomb 272 273 synthetic end/E.coli lacZ start
recomb 3266 3267 E.coli lacZ end/synthetic start
recomb 4101 4102 synthetic end/pUC19 start
recomb 4350 4351 pUC19 end/synthetic start
recomb 3326 3327 synthetic end/unknown DNA start
recomb 4081 4082 unknown DNA end/synthetic start
recomb 4361 4362 synthetic end/pUC19 start
BASE COUNT 1587 a 1651 c 1724 g 1605 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg
181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca
241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga
301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg
361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga
421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga
481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc
541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga
601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg
661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg
721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt
781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct
841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa
901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg
961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac
1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga
1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa
1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac
1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat
1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca
1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct
1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct
1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac
1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag
1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc
1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc
1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga
1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa
1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga
1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg
1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa
1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa
2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct
2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga
2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc
2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa
2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac
2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg
2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat
2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca
2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct
2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga
2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca
2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac
2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg
2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc
2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg
2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg
3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct
3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt
3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca
3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga
3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca
3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt
3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc
3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag
3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt
3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa
3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg
3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg
3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg
3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat
3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc
3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg
3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg
4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt
4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa
4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag
4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta
4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag
4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt
4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa
4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca
4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc
4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc
4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt
4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt
4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg
4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact
4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg
4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga
4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg
5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa
5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac
5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc
5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca
5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga
5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta
5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc
5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc
5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt
5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac
5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct
5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact
5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg
5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata
5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga
5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag
6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg
6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac
6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca
6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg
6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc
6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa
6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt
6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt
6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg
6541 gataacaatt tcacacagga aacagct
//
LOCUS SYNLACZJ 7242 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD26.77.
ACCESSION M34305
KEYWORDS lacZ.
SOURCE Cloning vector pPD26.77.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7242)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 4776 4777 synthetic end/pUC19 start
recomb 5025 5026 pUC19 end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 4756 4757 unknown DNA end/synthetic start
recomb 5036 5037 synthetic end/pUC19 start
BASE COUNT 1780 a 1801 c 1846 g 1815 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc
3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca
3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt
3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc
3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct
3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg
3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca
3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt
3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga
3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat
3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta
3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa
3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt
4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga
4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg
4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat
4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt
4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg
4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta
4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct
4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga
4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt
4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca
4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat
4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt
4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg
4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta
4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg
4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg
4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg
5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg
5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa
5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga
5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc
5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat
5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg
5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc
5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca
5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc
5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta
6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag
6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga
6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa
6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc
6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt
6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca
6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg
6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag
6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc
6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt
6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag
7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca
7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga
7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt
7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct
//
LOCUS SYNLACZK 6620 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD34.110.
ACCESSION M34306
KEYWORDS lacZ.
SOURCE Cloning vector pPD34.110.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6620)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 237 238 synthetic end/E.coli trpS start
recomb 321 322 E.coli trpS end/synthetic start
recomb 325 326 synthetic end/E.coli lacZ start
recomb 3319 3320 E.coli lacZ end/synthetic start
recomb 4154 4155 synthetic end/pUC19 start
recomb 4403 4404 pUC19 end/synthetic start
recomb 3379 3380 synthetic end/unknown DNA start
recomb 4134 4135 unknown DNA end/synthetic start
recomb 4414 4415 synthetic end/pUC19 start
BASE COUNT 1592 a 1665 c 1731 g 1632 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta
181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta
241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc
301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct
361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc
421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc
481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct
541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc
601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg
661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg
721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc
781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc
841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat
901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact
961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta
1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct
1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc
1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa
1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac
1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat
1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat
1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg
1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac
1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc
1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc
1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg
1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat
1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt
1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc
1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc
1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat
1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat
2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc
2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa
2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc
2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat
2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg
2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc
2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag
2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat
2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac
2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg
2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc
2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa
2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct
2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat
2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg
2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga
3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat
3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc
3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc
3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa
3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg
3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc
3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg
3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc
3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt
3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa
3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac
3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc
3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa
3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa
3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg
3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc
3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt
4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg
4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact
4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga
4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca
4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta
4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt
4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg
4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca
4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt
4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga
4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga
4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat
4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca
4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt
4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac
4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct
5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga
5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac
5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat
5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg
5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc
5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc
5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg
5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta
5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg
5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga
5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt
5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag
5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa
5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag
5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca
5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac
6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa
6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc
6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg
6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc
6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
6601 atttcacaca ggaaacagct
//
LOCUS SYNLACZL 5808 bp ds-DNA SYN 03-JUL-1990
DEFINITION Cloning vector pPD16.01.
ACCESSION M34307
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.01.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 5808)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 3342 3343 synthetic end/unknown DNA start
recomb 3591 3592 unknown DNA end/synthetic start
recomb 3602 3603 synthetic end/pUC19 start
BASE COUNT 1375 a 1502 c 1599 g 1332 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg
3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg
3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg
3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat
3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg
3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt
3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac
3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa
3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat
3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc
3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga
3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg
4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc
4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag
4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc
4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg
4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg
4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac
4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac
4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg
4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg
4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg
4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac
4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg
4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg
4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc
4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc
4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt
4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact
5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac
5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag
5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg
5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga
5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt
5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct
5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg
5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta
5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta
5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct
//
LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 03-JUL-1990
DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33841
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 967)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 961 967 inverted repeat B
rpt 560 810 R region
signal 792 797 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 530 536 TATA box
BASE COUNT 254 a 235 c 222 g 256 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
961 ccctaca
//
LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 03-JUL-1990
DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33842
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 960)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 954 960 inverted repeat B
rpt 555 803 R region
signal 785 790 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 525 531 TATA box
BASE COUNT 250 a 232 c 219 g 259 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS MUSMHEBF1 573 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-f gene, exon 1.
ACCESSION M35677 M34123
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 3
SOURCE Mouse inbred strain B10.M) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 573)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 301 + 394 MHC E-beta-f, exon 1
IVS 395 > 573 MHC E-beta-f intron A
BASE COUNT 135 a 151 c 138 g 149 t
ORIGIN Chromosome 17.
1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgtgt
61 atccatgtaa tgaagagaac tgcaagtttc agaagggaac ctgcaaactg aatctctaac
121 taggaactga tgatgctgaa cttctttgat gctgattggc tcccagcact ggccttaccc
181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga
241 ataaagttac agtctgaagc ttgccttccc ctctgactct cgtgtctcct ctcctgcagc
301 atgatgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg
361 agccctccag tggctttggt cagagactcc agacgtaaat gcacacctca ggtgctggga
421 tgctcggggt cggggaagga aggagctaac attctcactg tccagtccaa gtccctcgaa
481 actattgata tcttctgtga gcatgcacag tcctcacatg aactctaaac tatgtcccca
541 aacagacgcc tggatgtttg tgctctcaga tct
//
LOCUS MUSMHEBF2 495 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-f gene, exon 2.
ACCESSION M35678 M34123
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 3
SOURCE Mouse inbred strain B10.M) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 495)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept + 61 + 330 MHC E-beta-f, exon 2
IVS < 1 60 MHC E-beta-f intron A
IVS 331 > 495 MHC E-beta-f intron B
BASE COUNT 111 a 112 c 180 g 92 t
ORIGIN About 3.0 kb after segment 1; chromosome 17.
1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag
61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc
121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg
181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc
241 cggagatcct ggaggatgcg cgggccgcgg tggacacgta ctgcagatac aactatgaga
301 tcttggataa attccttgtg cggcggagag gtgagacagg acagggtggg tggggcggaa
361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc
421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc
481 aggggaacgg aattc
//
LOCUS MUSMHEBF3 2155 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-f gene, exons 3,4,5 and 6.
ACCESSION M35679 M34123
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 3 of 3
SOURCE Mouse inbred strain B10.M) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2155)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept + 122 403 MHC E-beta-f, exon 3
971 1081 MHC E-beta-f, exon 4
1484 1507 MHC E-beta-f, exon 5
1802 1815 MHC E-beta-f, exon 6
IVS < 1 121 MHC E-beta-f intron B
IVS 404 970 MHC E-beta-f intron C
IVS 1082 1483 MHC E-beta-f intron D
IVS 1508 1801 MHC E-beta-f intron E
BASE COUNT 459 a 586 c 569 g 541 t
ORIGIN About 3.9 kb after segment 2; chromosome 17.
1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcatgc
61 agacagccta caaggaggac agctccaccc tcatggctcc ttctcacctc tctttctcta
121 gttgagccta cggtgactgt gtaccccaca aagacgcagc ccctggaaca ccacaacctc
181 ctggtctgct ctgtgagtga cttctaccct ggcaacattg aagtcagatg gttccggaat
241 ggcaaggagg agaaaacagg aattgtgtcc acgggcctgg tccgaaatgg agactggacc
301 ttccagacac tggtgatgct ggagacggtt cctcagagtg gagaggttta cacctgccag
361 gtggagtatc ccagcctgac cgaccctgtc acggtcgagt ggagtgagtg gtaacttcca
421 gactctgtga atgcccgccc gggtgggtgt ggtttatccc tgcctgtcag ctttctccac
481 ccacacactc tttccactgg ctttgtgctg tcctgccttt caccatggct tacagtgtag
541 gtgcgtgaag cttctacaag cacagttgcc ccctgggaag cagttatgcc cccatagact
601 catctgagcc tgccagtgac ataacaggtc ctggaatctt cttggcccct gctgcagtct
661 ctgccgttgc tgggttgtgt tcctcctgct gctgctgctg ctgacgatgg acaaggagca
721 gtgcagggtc atgactgaac tcagggacat atagtcatag ctctgccttt gctacccctc
781 agagctcagc agcttcctgt cagctcggct caggcctgtt tggttggttt ctcaacatga
841 ccaggaatgt tgacagccag atcttctaga acacacttct tccttgggct caaagctccg
901 agtctcaggg gtccggagtg gaaatgggat ttgggctaaa accctccaaa cctttggctt
961 cctttctcag aagcacagtc cacatctgca cagaacaaga tgttgagtgg agttgggggc
1021 ttcgtgctgg gcctcctctt cctcggagcg gggctgttca tctacttcag gaaccagaaa
1081 ggtaaggagc ctggtgggag ccccaactcc atagcatttc agggaaaagc catggctttg
1141 ttctcaggat gccattggcc ctgtgacctc aggtttcatt ggattctgaa tgcaacagtc
1201 tgtggttact tgatttgacc ctgaggaggg ataacacatg ggagagttaa gttgattctg
1261 gcttgagacc tgaggacaga ggaaggctgg ggggagccat gggcactgcc ggtgactgaa
1321 gctccctaag cccctccctc tgtccatgct cctcttggtt ctgtgtgctc tgggcagtat
1381 taccagagga atctcaggtg gcagctcaga gtctggggac atgtgtctgg ggacagatct
1441 gccttcatgc atgtaagcat ctattttatt ctctcttttc taggacagtc tggacttcag
1501 ccaacaggta acacccattg tcttctctca gagacagatc tgctttccct acagtatggg
1561 ggctggggtg atggactcag ggcacaaaat ggggaagact gagatcccag ggttggccag
1621 gcagttagca ctgagccttg ctccctgcac ttactgaagc ctgtgctctg aagcagcaat
1681 gactcggggc atgagaagtt cctctctgct cactgccatg ctgtaaggag aggcctgaag
1741 cagtcagaga agccactgca gagtgaggtc tggaaacagc cctgtcccct gtgctctaca
1801 ggactcctga gctgagatga agtaacaagg ctgaaggaag gagttccccc ccgtgtctcc
1861 atgccatgaa aacatgtcct gcttggccca catccctcca gagacactgc tcttccagga
1921 cctggctcct cctgattctc caccctggag atctgtgctc ctgatggctg cttatccctg
1981 acccaggcct tgcagctccc agaacagagg ccccactctt cacatctcct gtcccctttt
2041 gtcccttgcc ttttgtctgg cacttctgag ccagtctgct gtcatatgct tttttacatt
2101 tttctcaaat aaacaaataa tgaaagtcat ctgcttcata gagtttcaag cagaa
//
LOCUS MUSMHEBQ1 574 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-q gene, exon 1.
ACCESSION M35680 M34124
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 3
SOURCE Mouse inbred strain B10.M) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 574)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 301 + 394 MHC E-beta-q, exon 1
IVS 395 > 574 MHC E-beta-q intron A
BASE COUNT 132 a 152 c 143 g 147 t
ORIGIN Chromosome 17.
1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgagt
61 atccatgtaa tgaagagaac tgcaagtttc agaaggggac ctgcaaactg aatctctaac
121 tagcaactga tgatgctgga ctcctttgat gctgattggc tcccagcact ggccttaccc
181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga
241 ataaagttac agtctgaagc ttgccttccc ctctgactcc tgtgtctcct ctcctgcagc
301 atggtgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg
361 agccctccag tggctttggt cagagactcc agacgttaag tgcacacctc aggtgctggg
421 atgctcgggg tcggggaagg aaggagctaa cattctcact gtccaggcca agtccctcgg
481 aactattgat atcttctgtg agcatgcaca gtcctcacat gaactctaaa ctatgtcccc
541 aaacagaagc ctggatgttt gtgctctcag atct
//
LOCUS MUSMHEBQ2 495 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-q gene, exon 2.
ACCESSION M35681 M34124
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 3
SOURCE Mouse inbred strain B10.G) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 495)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept + 61 + 330 MHC E-beta-q, exon 2
IVS < 1 60 MHC E-beta-q intron A
IVS 331 > 495 MHC E-beta-q intron B
BASE COUNT 113 a 116 c 176 g 90 t
ORIGIN About 3.0 kb after segment 1; chromosome 17.
1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag
61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc
121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg
181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc
241 cggagatcct ggagcaaaag cgggccgcgg tggacacgta ctgcagacac aactatgaga
301 tcttcgataa cttccttgtg cggcggagag gtgagacagg acagggtggc tggggcggaa
361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc
421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc
481 aggggaacgg aattc
//
LOCUS MUSMHEBQ3 2159 bp ds-DNA ROD 03-JUL-1990
DEFINITION Mouses MHC class II E-beta-q gene, exons 3,4,5 and 6.
ACCESSION M35682 M34124
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 3 of 3
SOURCE Mouse inbred strain B10.G) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2159)
AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P.
TITLE Characterization of the molecular defects in the mouse E-beta-f and
E-beta-q genes: Implications for the origin of MHC polymorphism
JOURNAL J. Immunol. 144, 1957-1964 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept + 126 407 MHC E-beta-q, exon 3
975 1085 MHC E-beta-q, exon 4
1488 1511 MHC E-beta-q, exon 5
1806 1819 MHC E-beta-q, exon 6
IVS < 1 125 MHC E-beta-q intron B
IVS 408 974 MHC E-beta-q intron C
IVS 1086 1487 MHC E-beta-q intron D
IVS 1512 1805 MHC E-beta-q intron E
BASE COUNT 459 a 589 c 572 g 539 t
ORIGIN About 3.9 kb after segment 2; chromosome 17.
1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcaggc
61 atgcagacag cctacaagga ggacagctcc accctcatgg ctccttctca cctctctttc
121 tctagttgag cctacggtga ctgtgtaccc cacaaagacg cagcccctgg aacaccacaa
181 cctcctggtc tgctctgtga gtgacttcta ccctggcaac attgaagtca gatggttccg
241 gaatggcaag gaggagaaaa caggaattgt gtccacgggc ctggtccgaa atggagactg
301 gaccttccag acactggtga tgctggagac ggttcctcag agtggagagg tttacacctg
361 ccaggtggag catcccagcc tgaccgaccc tgtcacggtc gagtggagtg agtggtaact
421 tccagactct gtgaatgccc gcccgggtgg gtgtggttta tccccgcctg tcagctttct
481 ccacccacac actctttcca ctggctttgt gctgtcctgc ctttcaccat ggcttacagg
541 gtaggtgcgt gaagcttcta caagcacagt tgccccctgg gaagcagtta tgcccccata
601 gactcatctg agcctgccag tgacataaca ggtcctggaa tcttcttggc ccctgctgca
661 gtctctgccg ttgctgggtt gtgttcctcc tgctgctgct gctgctgacg atggacaagg
721 agcagtgcag ggtcatgact gaactcaggg acatatagtc atagctctgc ctttgctacc
781 cctcagagct cagcagcttc ctgtcagctc ggctcaggcc tgtttggttg gtttctcaac
841 atgaccagga atgttgactg ccagatcttc tagaacacac ttcttccttg ggctcaaagc
901 tccgagtctc aggggtccgg agtggaaatg ggatttgggc taaaaccctc caaacctttg
961 gcttcctttc tcagaagcac agtccacatc tgcacagaac aagatgttga gtggagttgg
1021 gggcttcgtg ctgggcctcc tcttcctcgg agcggggctg ttcatctact tcaggaacca
1081 gaaaggtaag gagcctggtg ggagccccaa ctccatagca tttcagggaa aagccatggc
1141 tttgttctca ggatgccatt ggccctgtga cctcaggttt cattggattc tgaatgcaac
1201 agtctgtggt tacttgattt gaccctgagg agggataaca catgggagag ttaagttgat
1261 tctggcttga gacctgagga cagaggaagg ctggggggag ccatgggcac tgccggtgac
1321 tgaagctccc taagcccctc cctctgtcca tgctcctctt ggttctgtgt gctctgggca
1381 gtattaccag aggaatctca ggtggcagct cagagtctgg ggacatgtgt ctggggacag
1441 atctgccttc atgcatgtaa gcatctattt tattctctct tttctaggac agtctggact
1501 tcagccaaca ggtaacaccc attgtcttct ctcagagaca gatctgcttt ccctacagta
1561 tgggggctgg ggtgatggac tcagggcaca aaatggggaa gactgagatc ccagggttgg
1621 ccaggcagtt agcactgagc cttgctccct gcacttactg aagcctgtgc tctgaagcag
1681 caatgactcg gggcatgaga agttcctctc tgctcactgc catgctgtaa ggagaggcct
1741 gaagcagtca gagaagccac tgcagagtga ggtctggaaa cagccctgtc ccctgtgctc
1801 tacaggactc ctgagctgag atgaagtaac aaggctgaag gaaggagttc ccccccgtgt
1861 ctccatgcca tgaaaacatg tcctgcttgg cccacatccc tccagagaca ctgctcttcc
1921 aggacctggc tcctcctgat tctccaccct ggagatctgt gctcctgatg gctgcttatc
1981 cctgacccag gccttgcagc tcccagaaca gaggccccac tcttcacatc tcctgtcccc
2041 ttttgtccct tgccttttgt ctggcacttc tgagccagtc tgctgtcata tgctttttta
2101 catttttctc aaataaacaa ataatgaaag tcatctgctt catagagttt caagcagaa
//
LOCUS RATHPA1 3282 bp ds-DNA ROD 03-JUL-1990
DEFINITION Rat haptoglobin (Hp) gene, exons 1,2 and 3.
ACCESSION M34230
KEYWORDS haptoglobin.
SEGMENT 1 of 3
SOURCE Rat (strain Wistar) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3282)
AUTHORS Marinkovic,S. and Baumann,H.
TITLE Structure, hormonal regulation, and identification of the
interleukin-6- and dexamethasone-responsive element of the rat
haptoglobin gene
JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1065 1069 haptoglobin (Hp), exon 1
2797 2879 haptoglobin, exon 2
3125 + 3226 haptoglobin, exon 3
pre-msg 1032 > 3282 Hp mRNA and intron
IVS 1070 2796 Hp intron A
IVS 2880 3124 Hp intron B
IVS 3227 > 3282 Hp intron C
BASE COUNT 858 a 743 c 830 g 831 t 20 others
ORIGIN
1 ctaatttaaa aacgtttttt aaaacgagtg aagccattta ggttgagcgg ctacattagc
61 gtgaacagca ttccagaaca ggtgtcgggc tgaaacattg gttttctcct gggctgcgaa
121 cacagcgagt ctgccattga ggactctgtc tctacactag catgtggtgt ggctttccgc
181 taacaacaat cagaggagac acagcaggct catttcactg atttcaaatc ggaagacttt
241 tagcaacagg aagatgtcct catgggtcgg gaagcaactg tgaaacggaa ccgatttctt
301 tttactgttc tgtgggcgag actgcaggaa tttctacact ggatttaagt gattccgaga
361 taagtccaga gaagggagcc agtacaaggg tcccatgtca gtctacctat agagctttag
421 tcactctgag attgaagagc agtggaccaa gacccaataa ctcagtctgc tgcctgcaaa
481 ttccagagct ctccacaccc aggagatggt catgcttggg caggagagtt gaaaaaagaa
541 aagacttctt ttatagtctg agttaagggc tgggtcacaa gggtgtttaa aaaaaaaaaa
601 aaagagggct ggggatttag ctcagtggta gagcgcttac ctaggaagca caaggcctgg
661 gttcggtccc agctcgaaaa aaagaccaaa aaaaaaaaaa aaaaaaaaaa aaaaagagag
721 gtctcgtccc tctcccagtt aagtatcaga ttaacagccc ctattccccg tcccactctc
781 tggggttatc acactgcggt gggtgggagg ggtcgtgaag ttgctagatt tcttcatgat
841 ttgtaaaata acaccacgag gagagccaag tatgaagcaa gagctcagct cttgaaaagg
901 ggtttgcttt gtggttactg gaacagtcac tgaccttagc aaggccgaca ttgtgcaaac
961 acagaaatgg aagaaaagga ggtggggtga aaccgaagca taaaaagggt gagcaggagt
1021 cagcacagcg cacgccttct ggaaagaggt gagagaggcc cacgatgagg tgagtccaca
1081 gtccacactt ttgggcacac aatgcagatg tctctgggag agtgagaaaa tgggatgcag
1141 gaacagggcc gatgggcacc gttctgtggg agttaagccc gcagcctgca ggcgcatatg
1201 gcgagggata gagctgtgga tgcattgcaa cacactgtaa acttacctga agcgttgtga
1261 gacttttttt tttttttttg gtcttttttt tcggagctgg ggaccgaacc caggccttgc
1321 tgccttccta ggcaaagtcc gctctaccac tgagctaaat tccccaaccc cgcgttgtga
1381 gacttttgtt ttataacttg actatgcagt ttgagtgtga attttgttgg gtgaagacct
1441 caggctgaaa tgtcaaaggc aggaagtgaa gggaccagtg acaaagcccc ttcctccctg
1501 tgtccatgag agatgggcag gacagacagg gctttctatc tctaaggagg atctttccca
1561 gtgagatgaa aggttttgtt ttttaccagg catgcagcag cttcctggga tgctggctgt
1621 gctgttaaca gacttcctgc ttttaaagga acaaagacaa tagtcacaca gtctagtggc
1681 accatcaagg catccccctt cctttttaaa atcaaaatat aaagactttg aaggttacaa
1741 aaagactaga agcatagtgt ccaaaaggaa ttcctaactg gccagaatct acagggaatt
1801 ggttaccgtt taagtgtggt ctgtgtacca atggtggcca caagtcatgc tgagaggaag
1861 ccagttttct ccaggtaact tctggtttga tacacaatcc ctttttttaa aattatttat
1921 ttatttattt gtttgtttgt ttctgtgagt acactgtcgc tgtctttaga cacaccagaa
1981 gagggcatcg gatctcatta cagaggttgt gagccaccat gtgttgctgg gaattgaact
2041 caggacctct ggaagagcag tcagtcgtct taaccgctgg gaattgaact caggacctct
2101 ggaagagcag tcagtgctct taaccgctga gccctctctc cagccctgat atataacctt
2161 aagaccaaat acttatgaag taataggagc aagcacatgt gagttatata catatgtata
2221 tatttgggtc atagtgcaca cccagggatt ctagagctga ggcagggtga agtctgggag
2281 ttcaggagtt gtgacagcta gaaagatgga ctgtgtctnn nnnnnnnnnn nnnnnnnnta
2341 accttttcat tttggaattc caaaaagaga agagccaaat aaattagagc catcatcttt
2401 aagttagcta cgatgtccta acaatgtctt catagctgga acttaatgat gcgtgcagag
2461 gcttcccctt gctgacgttg tggtcaccac cagaggcaga ggcagaggca gaggctcact
2521 ttgctctgtg cctcctcccc agttggttct tgttccacct cccactctcg ggcgggagac
2581 aggcacttgt tatgtagcac tacgtaaagc cccgatcctc ctgcctcaga gtggagagct
2641 ggggtagcac atatgcttcc acactggtgc tgctttcctt cgggtcatgg tgctcccttt
2701 ctaagcttct acaaaattcc ccagtgacac cttgcttgcg tgtaatgcac aaatgcaaga
2761 agaccaactc tactccttct tgccacttct ctacagagcc ctgggagctg tcgtcactct
2821 cctgctctgg ggtcagcttt ttgctgtgga attgggcaat gatgccacag acattgaagg
2881 tgagtctcag gggtttccca ggagctgtgc accccagcag gctgtggccc tgtctgacca
2941 catcagtccc gcactgtatt aaggaagacc cagacctcct ctcgcctaga ccctcggggc
3001 ctcccggcct cagcttccac tcggtgcaag ggagtctggt gttcagggca gctccgtctc
3061 ttctggcttt gcacggggag catctgatca ccacagccct ttcctcgctt ctttctcttg
3121 gcagatgaca gctgcccaaa gcccccagag attgcaaacg gctatgtgga acacttggtt
3181 cgttatcgct gccgacagtt ctacaaacta cagaccgaag gagatggtaa ggctgtttga
3241 gcgggtaggg ctaggctgtc acaccagaac ttaagtgctg ct
//
LOCUS RATHPA2 482 bp ds-DNA ROD 03-JUL-1990
DEFINITION Rat haptoglobin (Hp) gene, exon 4.
ACCESSION M34231
KEYWORDS haptoglobin.
SEGMENT 2 of 3
SOURCE Rat (strain Wistar) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 482)
AUTHORS Marinkovic,S. and Baumann,H.
TITLE Structure, hormonal regulation, and identification of the
interleukin-6- and dexamethasone-responsive element of the rat
haptoglobin gene
JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 296 + 370 haptoglobin (Hp), exon 4
IVS < 1 295 Hp intron C
IVS 371 > 482 Hp intron D
BASE COUNT 118 a 108 c 120 g 136 t
ORIGIN
1 ttaacccgtg agccgtctcc agtccaggga gtgtagtcta tctacgactt tgtacagcct
61 acattcctga caatttctaa gagcttcatt gtgtctttaa agctcccgtg gttgtcatag
121 cctccttttg ggagagacac tctttaattc cattttttca atgaggaaac tgaggacgga
181 gatgccaagg tagcttgtga ggggaagagt cttgatctga actctgacct cttcctgtcc
241 aactctttca tcaggccaca ttcattttct ctgagctcac ctccttttgt ttcaggaatc
301 tacaccttaa acagtgagaa gcaatgggtg aacccagctg ctggcgataa actccccaag
361 tgtgaggcag gtgggtgttg aggtcttaaa gcatggggct aaaatggggc catgtttctc
421 ttgtgtgcct gagtgagtaa gacagggtca gagagacacg ctgcaaagga ggacaatgac
481 ta
//
LOCUS RATHPA3 1245 bp ds-DNA ROD 03-JUL-1990
DEFINITION Rat haptoglobin (Hp) gene, exon 5.
ACCESSION M34232
KEYWORDS haptoglobin.
SEGMENT 3 of 3
SOURCE Rat (strain Wistar) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1245)
AUTHORS Marinkovic,S. and Baumann,H.
TITLE Structure, hormonal regulation, and identification of the
interleukin-6- and dexamethasone-responsive element of the rat
haptoglobin gene
JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 329 1107 haptoglobin (Hp), exon 5
IVS < 1 328 Hp intron D
BASE COUNT 309 a 297 c 342 g 297 t
ORIGIN
1 ctgcagaggc tctggaagaa tcagccacca ctgcttgcga aaccaacagt acaggaacac
61 tgcccttgcc acctgctccg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
121 tgtgtgtgtg tgtgtacgtg tgtataaata tatatatgta tatacctaca tacatatgta
181 atcgtcatca cacatacaca ttccacaatc ctcttgaaag tcaatgacag acctgaaagc
241 tgtgtacatt tcattcttag acaaagttgc cctgcagggg cctggtgtga actgctgctc
301 acatcggtct ctcctcctcc ctccgcagtg tgtgggaagc ccaagcatcc tgtggaccag
361 gtacagcgca tcatcggtgg ttccatggac gccaaaggca gctttccttg gcaggccaag
421 atgatctcca gacatggact caccactggg gccacactga tcagtgacca gtggctgctg
481 accactgccc aaaacctctt cctgaatcac agtgagaatg cgacagccaa ggacattgcc
541 cctaccttaa cactctatgt ggggaaaaac cagctggtgg agattgagaa ggtagttctc
601 caccccgagc gctctgtggt ggatatcggg ctgatcaagc tcaaacagaa agtgcttgtc
661 actgagaaag tcatgcctat ctgcctgcct tccaaagact acgtagcgcc aggccgcatg
721 ctatgtgtcc ggttgggggc gcggaatgtc aactttagat ttactgaacg tctcaagtat
781 gtcatgctgc ctgtggctga ccaggagaag tgtgagctgc actatgagaa aagcacagtg
841 cctgagaaga aaggcgctgt aactcctgtt ggggtacagc ccatcttgaa taagcatacc
901 ttctgtgctg gccttaccaa gtatgaggaa gacacttgct atggtgacgc tggcagtgcc
961 tttgccgtcc atgacacgga ggaggacacc tggtatgcag ctgggatcct gagctttgac
1021 aagagttgtg ccgtagctga gtatggtgtg tatgtgaagg caactgatct gaaggactgg
1081 gtccaggaaa caatggccaa gaactagttc agggctgact agagggctgc acacagtggg
1141 gcagggcaat tcaccctgga agaggaagta gaagggttgg ggacataatc tgagggctgc
1201 tagccctgca ttgctcagtc aataataaaa aacgagcttt ggacc
//
LOCUS MUSTCAXL 331 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma LD1.
ACCESSION M34194
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 331 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 71 a 86 c 89 g 85 t
ORIGIN
1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg
61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg
121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat
181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg
241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata
301 acttttgggg ctggaaccaa actcaccatt a
//
LOCUS MUSTCAXM 334 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma LD3.
ACCESSION M34196
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 334)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 334 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 97 a 84 c 75 g 78 t
ORIGIN
1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg
61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg
121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga
181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag
241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg
301 acatttggat ctggcacaaa agtcattgtt ctac
//
LOCUS MUSTCAXN 327 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-14.
ACCESSION M34198
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 327)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 327 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 92 a 80 c 85 g 70 t
ORIGIN
1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt
301 ggacagggaa ccaagttatc tatcaag
//
LOCUS MUSTCAXO 324 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-27.
ACCESSION M34200
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 324 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 92 a 80 c 83 g 69 t
ORIGIN
1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc
301 tctgggacca agctaattat aaag
//
LOCUS MUSTCAXP 297 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-6.
ACCESSION M34202
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 297 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 85 a 73 c 74 g 65 t
ORIGIN
1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac
61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc
121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc
181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt
241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg
//
LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, T-cell clone V2.1.
ACCESSION M34204
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 193)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 193 T-cell receptor alpha-chain (AA at 2)
BASE COUNT 56 a 45 c 49 g 43 t
ORIGIN
1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata
61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc
121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt
181 tctgagtgtt ctg
//
LOCUS MUSTCAXR 333 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-33.
ACCESSION M34206
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 333)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 333 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 84 a 96 c 78 g 74 t 1 others
ORIGIN
1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg
61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc
121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa
181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg
241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga
301 ctcacctttg ggaaaggaac tcagctgatc atc
//
LOCUS MUSTCAXS 339 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-7.
ACCESSION M34208
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 339 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 86 a 103 c 75 g 75 t
ORIGIN
1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg
61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat
121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg
181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag
241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga
301 aagctcacgt ttggacacgg caccatcctt agggtccat
//
LOCUS MUSTCAXT 342 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma P1F12C4.
ACCESSION M34210
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 342)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 342 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 80 a 102 c 79 g 81 t
ORIGIN
1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg
61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc
121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa
181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg
241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact
301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at
//
LOCUS MUSTCAXU 324 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma P1D3A6.
ACCESSION M34212
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 324 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 82 a 80 c 81 g 81 t
ORIGIN
1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg
61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga
121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc
181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac
241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt
301 caaggaaccg ttctgagtgt tata
//
LOCUS MUSTCAXV 210 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 1E1O.
ACCESSION M34214
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 210 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 54 a 53 c 50 g 53 t
ORIGIN
1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg
61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca
121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact
181 tttggtcaag gaaccgttct gagtgttata
//
LOCUS MUSTCAXW 234 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 7/6AH1.
ACCESSION M34216
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 234 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 55 a 59 c 58 g 62 t
ORIGIN
1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct
61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct
121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta
181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc
//
LOCUS MUSTCAXX 312 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 2B11.
ACCESSION M34218
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 312 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 89 a 79 c 79 g 64 t 1 others
ORIGIN
1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag
61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta
121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac
181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg
241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact
301 ttgcaagtgc aa
//
LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma LD1.
ACCESSION M34195
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 153)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 153 T-cell receptor beta-chain (AA at 1)
recomb 122 123 V-region end/J-region start
BASE COUNT 39 a 37 c 36 g 41 t
ORIGIN
1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc
61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta
121 ggttctggaa atacgctcta ttttggagaa gga
//
LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma LD3.
ACCESSION M34197
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 111)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 111 T-cell receptor beta-chain (AA at 1)
recomb 73 74 V-region end/J-region start
BASE COUNT 18 a 37 c 25 g 31 t
ORIGIN
1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc
61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a
//
LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-14.
ACCESSION M34199
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 321 T-cell receptor beta-chain (AA at 1)
recomb 292 293 V-region end/J-region start
BASE COUNT 106 a 81 c 61 g 73 t
ORIGIN
1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc
61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag
121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc
181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct
241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac
301 accttgtact ttggtgcggg c
//
LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-27.
ACCESSION M34201
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 210 T-cell receptor beta-chain (AA at 1)
recomb 167 168 V-region end/J-region start
BASE COUNT 57 a 56 c 43 g 54 t
ORIGIN
1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc
61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg
121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag
181 ttcttcggac cagggacacg actcaccgtc
//
LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-6.
ACCESSION M34203
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 255)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 255 T-cell receptor beta-chain (AA at 1)
recomb 219 220 V-region end/J-region start
BASE COUNT 57 a 68 c 77 g 53 t
ORIGIN
1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc
61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc
121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc
181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg
241 ggcacccgac tatcg
//
LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, T-cell clone V2.1.
ACCESSION M34205
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 213)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 213 T-cell receptor beta-chain (AA at 1)
recomb 171 172 V-region end/J-region start
BASE COUNT 48 a 59 c 58 g 48 t
ORIGIN
1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat
61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc
121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag
181 cagttcttcg gaccagggac acgactcacc gtc
//
LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-33.
ACCESSION M34207
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 207)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 207 T-cell receptor beta-chain (AA at 1)
recomb 163 164 V-region end/J-region start
BASE COUNT 65 a 54 c 36 g 52 t
ORIGIN
1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt
61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag
121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt
181 ttcggtcatg gaaccaagct gtctgtc
//
LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-7.
ACCESSION M34209
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 156 T-cell receptor beta-chain (AA at 1)
recomb 118 119 V-region end/J-region start
BASE COUNT 30 a 48 c 40 g 38 t
ORIGIN
1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct
121 gaacagtact tcggtcccgg caccaggctc acggtt
//
LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma P1F12C4.
ACCESSION M34211
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 132)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 132 T-cell receptor beta-chain (AA at 1)
recomb 94 95 V-region end/J-region start
BASE COUNT 35 a 39 c 28 g 30 t
ORIGIN
1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg
61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc
121 agactcacag tt
//
LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma P1D3A6.
ACCESSION M34213
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 303 T-cell receptor beta-chain (AA at 1)
recomb 259 260 V-region end/J-region start
BASE COUNT 80 a 72 c 79 g 72 t
ORIGIN
1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac
61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat
121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg
181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc
241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt
301 gtt
//
LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 1E1O.
ACCESSION M34215
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 159)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 159 T-cell receptor beta-chain (AA at 1)
recomb 113 114 V-region end/J-region start
BASE COUNT 35 a 40 c 39 g 45 t
ORIGIN
1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg
61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga
121 aatacgctct attttggaga aggaagccgg ctcattgtt
//
LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 7/6AH1.
ACCESSION M34217
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 321 T-cell receptor beta-chain (AA at 1)
recomb 275 276 V-region end/J-region start
BASE COUNT 86 a 78 c 81 g 76 t
ORIGIN
1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc
61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg
121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac
181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct
241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga
301 gaaggaagcc ggctcattgt t
//
LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 03-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 2B11.
ACCESSION M34219
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 339 T-cell receptor beta-chain (AA at 1)
recomb 294 295 V-region end/J-region start
BASE COUNT 90 a 88 c 88 g 73 t
ORIGIN
1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca
61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg
121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc
181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa
301 aacaccttgt actttggtgc gggcacccga ctatcggtg
//
LOCUS CHKATHA 188 bp ss-mRNA VRT 03-JUL-1990
DEFINITION Chicken avian thymic hormone mRNA, partial cds.
ACCESSION M34330
KEYWORDS avian thymic hormone; parvalbumin.
SOURCE Chicken thymus, cDNA to mRNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 188)
AUTHORS Palmisano,W.A. and Henzl,M.T.
TITLE Partial nucleotide sequence of the parvalbumin from chicken thymus
designated "avian thymic hormone"
JOURNAL Biochem. Biophys. Res. Commun. 167, 1286-1293 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 188 avian thymic hormone (AA at 1)
BASE COUNT 53 a 43 c 50 g 42 t
ORIGIN
1 ccggatcaga tcaagaaggt ttttggaatc cttgatcagg acaagagcgg cttcattgaa
61 gaagaagagc ttcagctgtt tctgaagaac ttctcttcga gtgccagagt cctcacctct
121 gcggagacca aagctttcct ggctgcaggt gacaccgacg gcgacgacaa aataggcgta
181 gaagaatt
//
LOCUS DDISGSPA 1957 bp ds-DNA INV 03-JUL-1990
DEFINITION D.discoideum spore germination-specific protein (270-11) gene,
complete cds.
ACCESSION M33862
KEYWORDS spore germination-specific protein.
SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and
DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1957)
AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L.
TITLE a shared internal theronine-glutamic acid-threonine-proline repeat
defines a family of dictyostelium discoideum spore germination-
specific proteins
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.L.Ennis 17-APR-1990.
FEATURES from to/span description
pept 171 228 spore germination-specific protein, exon 1
316 1856 spore germination-specific protein, exon 2
IVS 229 315 spore germination-specific protein intron A
BASE COUNT 724 a 359 c 258 g 616 t
ORIGIN
1 aaataatttt attattttct tttgtaaaag taattaaata aaaaaaaaaa taaaaaataa
61 ataaaattaa ataaagtcaa ttaaaaaaaa aaaaataata taaatatata taaaataaaa
121 aaaaaaaaac aaaacaataa tagtttatga tataaatttt taataataat atgaaaaata
181 tatatagttt attcttatta tttgcattaa taagtgcaac atttgcaagt aagttgaaaa
241 aaaaaaaaaa aaaattatat tgtaaatttt aaataaaaaa caatatacta attattaatt
301 ttaaaattaa attagataat gcatttattg tacattggaa ttcagattca atttcaaaaa
361 aattaacggg acaaattggt gatacaatct ctttttatac aagtgatgga aattctcatg
421 atgtaaaaag ttcagatggt tctgtttcgt caagtgtttt ctctggtagt cttacaaatc
481 ctggaatttt caaggtaaca cttactaaag aaggtaatat tgaatttacc agttcatatg
541 atgaaggtct ttctgcaaca atagtagttt cttctggtgg tcaaattccg attacaacaa
601 cttcatcaac tacaactgat ggtagttcaa ccccttccac tccaacttca acaacttcag
661 cctcaactac tacaagtggt ggtagtgcta caacaacaac aggagaacca attactgatg
721 gttctaatgg aggcgccagt tccacaactg gcaatagcgg gacgacaggt tctgctacca
781 ctactacttc ttcttcttcc gataattccg atggcagtgt aggtacttca actacaactt
841 caccagctat cacaacttca agtgggtcaa taatcgatcc aacttcacca cctacaactg
901 attcatcctc taatagtggt ggttatggtt catcatcttc aattgaaaat ggcgtagaat
961 gtttattaac aatcactcaa gatgcatttg attcttggac atatgataat attatttaca
1021 ccgtttatca agtaaattta acaaatattg gtacactttc agttgagtct gttattctca
1081 ctccaaatga taactcttta atttaccata cttgggaatt ggtttatgat ggaacttcac
1141 tcactcttcc aacctataga aaagctggtc caatcaatcc agaggaaacc attatctttg
1201 gttatatctc tagaaatagt actgatgtta catttgcttt aagtccaaca tgttcagatt
1261 catcaagtcc aactccaact cctactgaga ctccaactga gactccaact gagactccaa
1321 ctgagactcc aactgagact ccaactgaaa ctccaactga aactccaact gaaactgaaa
1381 ctccaacacc aacaccatca agctcatcta gtgatgtaga tagtggttca tcatctgaaa
1441 ttgaaacccc aacaccaact gaaactgata ccccaacccc aacaccatca agttcttcaa
1501 gtgaaggaag tggatcatca tcagaaactc aaccaccaat tactccacca ccaaccactg
1561 gtacttcttg tttagcccaa gtccaacaaa aagttatcaa ctcatggatt aatggtgaag
1621 ttgatcatta tatacaagtt gaggctacta ttgttaacca aggttcaact ccaatttcat
1681 cttttaattt ttattctgat gctgaacaaa tttggtcagt tgaaaaaaca ggaaccaata
1741 cctataaatt accaagttgg ttctcaacaa ttccagttgg tgggtcccat acctttggtt
1801 atattgttaa atctgctgaa ttatctgacc tcgaaggagt tcaatataca tgttgatttt
1861 aaaactctct ttttgtaata ataaaaaaaa aaaaaaaatt ttttggaaat aaatttaatt
1921 ttcaaaaact agttttgatt tcactttatt taataat
//
LOCUS DDISGSPB 3655 bp ds-DNA INV 03-JUL-1990
DEFINITION D.discoideum pore germination-specific protein gene, complete cds.
ACCESSION M33861
KEYWORDS endo-(1,4)-beta-D-glucanase; spore germination-specific protein.
SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and
DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 3655)
AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L.
TITLE A shared internal theronine-glutamic acid-threonine-proline repeat
defines a family of Dictyostelium discoideum spore germination-
specific proteins
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.L.Ennis, 17-APR-1990.
FEATURES from to/span description
pept 1346 1412 spore germination-specific protein, exon 1
1505 3555 spore germination-specific protein, exon 2
IVS 1413 1504 spore germination-specific protein intron
BASE COUNT 1346 a 482 c 459 g 1368 t
ORIGIN
1 tttttttttt ttaatatttt ttattttatt ttttttttta attattatta attattaatc
61 tttattataa acaaaatgca tatgtgttaa aattattata accaaaaatt aattaattta
121 aaaaactaag aactatagtt ctgagatttt caatagtttt tttcaaataa tatgatttct
181 ttttcaaggg tcattaaaat tatattatta gaactattta aaaaaaattc aaaagttaaa
241 tatttaactt ttgcattttt aaaaccatca attataataa ttaattattt tattattttt
301 tttttttttt tttttttttt aattattttt gttttttttt tttttttttt tttttttttt
361 ttttattaaa aaaactatga atactttaaa ttatagtttt tcattttttt attaactgat
421 cataatttaa tttaatttaa tttaatttat ttttttgtat ttaatactcg aaaaccacat
481 acccatgatt aattaaaaaa aataaaaaaa aataaaaaaa aaagaaaaag tactttttca
541 aataaaaaat gtttataaaa aaaaattttt ttttgaggcc aagttaatat ttttgggtag
601 ttaaaatact aagatttgtt ccaatttgga tttttaatgg tttttatttt taaaaataat
661 aatttaacat ttttctaatc aattttcaaa tttttttttt tataactgat ttcttttttt
721 tttattttaa ttttttttta attttttttt atttaaaaaa tatttcaagt tgtacatttc
781 cgttagaatt tcatttggaa gatattagat tttaatttaa aaacaatttt cctaaaaaat
841 aaaataaaaa atgcgaaatt taattttttt tttttattaa taattatttt gaattaaatt
901 tttttttttt tttttttttt ttcccagatt tccaatctta taaaaaggaa ttgtttttta
961 tttttttttt tttcattttc aaaaaactaa tttattagat ctttaaaaaa aaaaaaaaaa
1021 ataataataa taataaaaat aataatatta tctattatcc aaatttgttt ttgcaattaa
1081 tttcgttatt ttttttttta aaaaactcac cacatactta cacaccaaaa aataacaaaa
1141 ataataattc tattattata atcaatttat tgtagtataa gtttaacttt taaagttcta
1201 ttaaaaaaaa aaaaaaaaaa aaaaaaagaa aaaaaaaaat atataaaata ataaaacttt
1261 tgtttattat ttttatgtac tataaatttc aaattcctat atctaaattt ttaatatttc
1321 taaattttta taaattaaaa ccaatatgaa aatattgaaa aattgtatat tattaataat
1381 atttgggtta ttatcaactc aattaattaa tggtaaagta taaaaaaaaa aaaaaaaaaa
1441 aaatattata tttcttaaac aaaaaaaaaa acaaaatatt aattcttaat ttttttttta
1501 ttagcggata ccgattattg ttcattactt gaaaatgcat taatgtttta taaaatgaat
1561 agagctggtc gtttaccaga taacgatata ccatggagag gtaattcagc attgaatgat
1621 gcaagtccaa attcagctaa agatgccaat ggtgatggta atttaagtgg tggttatttt
1681 gatgctggtg atggtgttaa atttggttta ccaatggctt attctatgac tatgttgggt
1741 tggtcattca ttgaatatga atccaatatt gctcaatgtg gtttgacaag tttatacctc
1801 gatacaatta aatatggtac cgactggctt attgcagcac atactgccga taatgaattt
1861 gcaggccaag ttggtgatgg taatgttgat cattcttggt ggggtcctcc agaagatatg
1921 acaatggctc gtccaactta tatgttaaca accgaagcac caggtactga aattgcaatg
1981 gaagcagcat cagcattagc tgcagcttca atagcattta aatcttcaaa cccaacatac
2041 gctgcaactt gcttagcaca tgctaaaact cttcataatt tcgggtacac ttatcgtggt
2101 gtttattcag attccattac gaatgctcaa gctttttata attcatggtc tggctataag
2161 gatgatttag tttggggtag catttggtta tataaagcaa ctcaagattc agattattta
2221 acaaaagccg ttgcagatta tgcatcaggt ggtgttggtg gaatggcaca aggtaattct
2281 cacgattggg ataataaagc accaggttgt tgtttattat tatctaaatt agttccaacc
2341 acaagtactt ataaaactga tttcgaaggt tggttaaatt attggttacc aggtggaggt
2401 gtcacttata ctccaggtgg tttagcatgg atcagacaat ggggtccagc tcgttatgct
2461 gccactgccg ctttccttgg ttctttagct ggtactgaaa aaggcacaga tttcactcaa
2521 aaacaagttg actatttaat tggtaataat ccaaatcaac aatcatttgt agttggtatg
2581 ggtccaaatt atccaattaa tccacatcat cgtgctgccc atcattctac aactaatgat
2641 ataaataatc cagttaataa tttatacctc ttaaaaggtg ctttagttgg tggaccaggt
2701 tcaaatgatg aatatactga tgatagaact gattatattt caaatgaagt tgcaactgat
2761 tataatgctg gtttcgttgg tgcattagct tctcttgtaa atccatcttc aacttctgtt
2821 ccaaccacaa ctccaacagt aactgaaacc ccaacagaga ctccaactga gactccaact
2881 gagactccaa ctgagactcc aacagagact ccaacagaaa ctccaacaga gactccaaca
2941 gaaactccaa cagagactcc aacagaaact ccaacagaaa ctccaacaga aactccaaca
3001 gaaactccaa cagaaactcc aacagaaact ccaaccgaga ctccaactga aactgttact
3061 ccaaccccaa cagtaacacc aactgaaact ccatcaagtg gagaatcttt atcaatctat
3121 aaaagtggat taaaaaatga tttccaagat tggtcatggg gtgagcattc attaactgat
3181 acaacaaatg ttgaatctgg agaaaccaat tcaatttcat ttacaccaaa agcatatggt
3241 gcagtatttt taggatgttt cgaatgtatt gatactgata catacaataa tattgaattt
3301 gatattaatg gtggtagcag tggtgctcaa ttattaagaa taactgttgt taaaaatagt
3361 aaatctgttg gttccaaatt aattaccgat cttaatggtg gaactccaat cgaagcaaat
3421 tcatggacta aaattaaagc atcctttatt gatgacttta aagtatctgg taaagtcgat
3481 ggtatttgga ttcaagatat caaaggtgat acccaatcaa ctgtatacat aagtaatatt
3541 attgcaactg cttaaaaaaa tattaatatt aaatattaaa aaaagtataa ataaaataat
3601 cttaaattaa aaaaaataag tgttttcgaa attttctata gatatatatc taaaa
//
LOCUS ECOCYSXE 1396 bp ds-DNA BCT 03-JUL-1990
DEFINITION E.coli cysteine regulon 33 Kd (cysE) and 16 Kd protein (cysX)
genes, complete cds.
ACCESSION M34333
KEYWORDS cysE gene; cysX gene.
SOURCE E.coli (strain K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1396)
AUTHORS Tei,H., Murata,K. and Kimura,A.
TITLE Structure and expression of cysX, the second gene in the
Escherichia coli K-12 cysE locus
JOURNAL Biochem. Biophys. Res. Commun. 167, 948-955 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 221 1042 33 Kd protein (cysE)
pept 919 527 (c) 16 Kd protein (cysX)
BASE COUNT 325 a 362 c 393 g 316 t
ORIGIN
1 cgcgaactgg cgcatcgctt cggcgttgaa atgccaataa ccgaggaaat ttatcaagta
61 ttatattgcg gaaaaaacgc gcgcgaggca gcattgactt tactaggtcg tgcacgcaag
121 gacgagcgca gcagccacta accccaggga acctttgtta ccgctatgac ccggcccgcg
181 cagaacgggc cggtcattat ctcatcgtgt ggagtaagca atgtcgtgtg aagaactgga
241 aattgtctgg aacaatatta aagccgaagc cagaacgctg gcggactgtg agccaatgct
301 ggccagtttt taccacgcga cgctactcaa gcacgaaaac cttggcagtg cactgagcta
361 catgctggcg aacaagctgt catcgccaat tatgcctgct attgctatcc gtgaagtggt
421 ggaagaagcc tacgccgctg acccggaaat gatcgcctct gcggcctgtg atattcaggc
481 ggtgcgtacc cgcgacccgg cagtcgataa atactcaacc ccgttgttat acctgaaggg
541 ttttcatgcc ttgcaggcct atcgcatcgg tcactggttg tggaatcagg ggcgtcgcgc
601 actggcaatc tttctgcaaa accaggtttc tgtgacgttc caggtcgata ttcacccggc
661 agcaaaaatt ggtcgcggta tcatgcttga ccacgcgaca ggcatcgtcg ttggtgaaac
721 ggcggtgatt gaaaacgacg tatcgattct gcaatctgtg acgcttggcg gtacgggtaa
781 atctggtggt gaccgtcacc cgaaaattcg tgaaggtgtg atgattggcg cgggcgcgaa
841 aatcctcggc aatattgaag ttgggcgcgg cgcgaagatt ggcgcaggtt ccgtggtgct
901 gcaaccggtg ccgccgcata ccaccgccgc tggcgttccg gctcgtattg tcggtaaacc
961 agacagcgat aagccatcaa tggatatgga ccagcatttc aacggtatta accatacatt
1021 tgagtatggg gatgggatct aatgtcctgt gatcgtgccg gatgcgatgt aatcatctat
1081 ccggcctaca gtaactaatc tctcaatacc gctcccggat accccaactg tcgccaggct
1141 tcatacacca ctaccgacac cgcattggac agattcatgc tgcggctgtc cggcaccatc
1201 ggaatgcgaa ttttttgttc agcgggcagg gcatcaagaa tgctcgctgg caggccgcgt
1261 gtttccgggc cgaacatcag ataatcgcca tcctgatagc ttacggcgct gtgagcaggt
1321 gtacctttcg tggtgagggc gaacaggcgc tgggattttc tgcttcgagg aacgcgcgat
1381 agtcatgatg acgcgt
//
LOCUS ECOTRPP 74 bp ds-DNA SYN 03-JUL-1990
DEFINITION Expression plasmid pDS20 derivative.
ACCESSION M34334
KEYWORDS .
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 74)
AUTHORS Latta,M., Philit,M., Maury,I., Soubrier,F., Denefle,P. and
Mayaux,J.-F.
TITLE Tryptophan promoter derivatives on multicopy plasmids: A
comparative analysis of expression potentials in Escherichia coli
JOURNAL DNA 9, 129-137 (1990)
STANDARD simple staff_review
BASE COUNT 18 a 18 c 15 g 23 t
ORIGIN
1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc
61 aaatattctg aaat
//
LOCUS HUMCYTOK 1724 bp ss-mRNA PRI 03-JUL-1990
DEFINITION Human cytokeratin 8 mRNA, complete cds.
ACCESSION M34225
KEYWORDS cytokeratin 8.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1724)
AUTHORS Yamamoto,R., Kao,L.-C., McKnight,C.E. and Strauss,J.F.III.
TITLE Cloning and sequence of cDNA for human placental cytokeratin 8.
Regulation of the mRNA in trophoblastic cells by cAMP
JOURNAL Mol. Endocrinol. 4, 370-374 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 35 1486 cytokeratin 8
BASE COUNT 401 a 498 c 524 g 301 t
ORIGIN
1 ttcggcaatt cctacctcca ctcctgcctc caccatgtcc atcagggtga cccagaagtc
61 ctacaaggtg tccacctctg gcccccgggc cttcagcagc cgctcctaca cgagtgggcc
121 cggttcccgc atcagctcct cgagcttctc ccgagtgggc agcagcaact ttcgcggtgg
181 cctgggcggc ggctatggtg gggccagcgg catgggaggc atcaccgcag ttacggtcaa
241 ccagagcctg ctgagcccct tgtccctgga ggtggacccc aacatccagg ccgtgcgcac
301 ccaggagaag gagcagatca agaccctgaa caacaagttt gcctccttca tagacaaggt
361 acggttcctg gagcagcaga acaagatgct ggagaccaag tggagcctcc tgcagcagca
421 gaagacggct cgaagcaaca tggacaacat gttcgagagc tacatcaaca accttaggcg
481 gcagctggag actctgggcc aggagaagct gaagctggag gcggagcttg gcaacatgca
541 ggggctggtg gaggacttca agaacaagta tgaggatgag atcaataagc gtacagagat
601 ggagaacgaa tttgtcctca tcaagaagga tgtggatgaa gcatacatga acaaggtaga
661 gctggagtct cgcctggaag ggctgaccga cgagatcaac ttcctcaggc agctgtatga
721 agaggagatc cgggagctgc agtcccagat ctcggacaca tctgtggtgc tgtccatgga
781 caacagccgc tccctggaca tggagagcat cattgctgag gtcaaggcac agtacgagga
841 tattgccaac cgcagccggg ctgaggctga gagcatgtac cagatcaagt atgaggagct
901 gcagagcctg gctgggaagc acggggatga cctgcggcgc acaaagactg agatctcaga
961 gatgaaccgg aacatcagcc ggctccaggc tgagattgag ggcctcaaag gccagagggc
1021 ttccctggag gccgccattg cagatgccga gcagcgtgga gagctggcca ttaaggatgc
1081 caacgccaag ttgtccgagc tggaggccgc cctgcagcgg gccaagcagg acatggcccg
1141 gcagctgcgt gagtaccagg agctgatgaa cgtcaagctg gccctggaca tcgacatcgc
1201 cacctacagg aagctgctgg agggcgagga gagcccgctg gagtctggga tgcagaacat
1261 gagtattcat acgaagacca ccggcggcta tgcgggtggt ttgagctcgg cctatgggga
1321 cctcacagac cccggcctca gctacagcct gggctccagc tttggctctg gcgcgggctc
1381 cagctccttc agccgcacca gctcctccag ggccgtggtt gtgaagaaga tcgagacacg
1441 tgatgggaag ctggtgtctg agtcctctga cgtcctgccc aagtgaacag ctgcggcagc
1501 ccctcccagc ctacccctcc tgcgctgccc cagagcctgg gaaggaggcc gctatgcagg
1561 gtagcactgg gaacaggaga cccacctgag gctcagccct agccctcagc ccacctgggg
1621 agtttactac ctggggaccc cccttgccca tgcctccagc tacaaaacaa ttcaattgct
1681 tttttttttt ttggtcccaa aataaaacct cagctagctc tgcc
//GenBank-Updates@genbank.bio.net (07/04/90)
LOCUS TRBMVAT5A 1544 bp ss-mRNA INV 04-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds.
ACCESSION M33825
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1544)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
BASE COUNT 497 a 396 c 366 g 285 t
ORIGIN
1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS TRBWRATATA 1544 bp ss-mRNA INV 04-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (WRATat A) mRNA, complete
cds.
ACCESSION M33823
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1544)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
pept 42 1457 variant surface glycoprotein (WRATat A)
mRNA < 1 1544 WRATat A mRNA
BASE COUNT 545 a 387 c 345 g 267 t
ORIGIN
1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS TRBWRATATB 1582 bp ss-mRNA INV 04-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (WRATat B) mRNA, complete
cds.
ACCESSION M33824
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1582)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
mRNA < 1 1582 WRATat B mRNA
BASE COUNT 532 a 384 c 376 g 290 t
ORIGIN
1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa
1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg
1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta
1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct
1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt
1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg
1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
1561 tctggtatat tttaacacct tt
//
LOCUS HUMTNC1 1085 bp ds-DNA PRI 04-JUL-1990
DEFINITION Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION M33771 J05502
KEYWORDS troponin C.
SEGMENT 1 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1085)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept 1033 + 1035 troponin C, exon 1
pre-msg 968 > 1085 troponin C mRNA and introns (alt.)
pre-msg 969 > 1085 troponin C mRNA and introns (alt.)
IVS 1036 > 1085 troponin C intron A
BASE COUNT 266 a 276 c 306 g 237 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
1081 tggac
//
LOCUS HUMTNC2 1773 bp ds-DNA PRI 04-JUL-1990
DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION M33772 J05502
KEYWORDS troponin C.
SEGMENT 2 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1773)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept + 51 102 troponin C, exon 2
247 390 troponin C, exon 3
479 593 troponin C, exon 4
758 894 troponin C, exon 5
1506 1537 troponin C, exon 6
pre-msg < 1 1668 troponin C mRNA and introns
IVS < 1 50 troponin C intron A
IVS 103 246 troponin C intron B
IVS 391 478 troponin C intron C
IVS 594 757 troponin C intron D
IVS 895 1505 troponin C intron E
BASE COUNT 358 a 455 c 628 g 331 t 1 others
ORIGIN About 2.5 kb after segment 1.
1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS ARRSEG11 631 bp ds-RNA VRL 04-JUL-1990
DEFINITION Adult diarrhea rotavirus segment 11 non-structural protein mRNA,
complete cds.
ACCESSION M34380
KEYWORDS non-structural protein.
SOURCE Adult diarrhea rotavirus, cDNA to viral RNA, clone AD63, and viral
RNA segment 11.
ORGANISM Adult diarrhea rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 631)
AUTHORS Chen,G.-M., Hung,T. and Mackow,E.R.
TITLE cDNA cloning of each genomic segment of the group B rotavirus ADRV:
Molecular characterization of the 11th RNA segment
JOURNAL Virology 175, 605-609 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 59 571 non-structural protein
BASE COUNT 246 a 104 c 122 g 159 t
ORIGIN
1 ggtatataaa agtcagtaga cggctggaaa cgttgcacgt actactcact acccagagat
61 ggcagaagcg tctgagttca attttaccat aaagagaaag cagagaacaa tgagtgatcg
121 aagaacgaga gaggatacaa aacaaaaaaa gattgaagag aaatctgata ttgatttggt
181 tgattcagcc tctgtctatt cacaagagtc ttctagaagt aactatagtg atgcttacga
241 taaattaaaa cgtgaaccaa tggttgaaga atcaaatgac gcaaagtata ggaattttga
301 attttctgaa gatgaggaag ttcatagacc atcaagtaaa gcatcagaca aatcatacag
361 agaaatgaaa cgtaagcatg atgacattaa tacgtcagat tctattttag agaaactgtc
421 agaattaaat ctcgagattg aaaagataaa acaaatgaac caaccaatta caattgacgc
481 agcattcaac atgattcttc gaaatgtcga caatctaaca ataaggcaga agcaggcctt
541 aataaatgca attgttaatt ctatgaacta gactgtggtc tccacctgtc atctactggg
601 gtcaacctct tgagttatat ttaaaaaacc c
//
LOCUS BOVFESUP 1112 bp ss-mRNA MAM 04-JUL-1990
DEFINITION Bovine Rieske iron-sulfur protein mRNA, complete cds.
ACCESSION M34336
KEYWORDS Rieske iron-sulfur protein.
SOURCE Bovine heart, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1112)
AUTHORS Usui,S., Yu,L. and Yu,C.-A.
TITLE Cloning and sequencing of a cDNA encoding the Rieske iron-sulfur
protein of bovine heart mitochondrial ubiquinol-cytochrome c
reductase
JOURNAL Biochem. Biophys. Res. Commun. 167, 575-579 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 16 825 Rieske iron-sulfur protein precursor
sigp 16 234 Rieske iron-sulfur protein signal peptide
matp 235 822 Rieske iron-sulfur protein
BASE COUNT 269 a 249 c 281 g 313 t
ORIGIN
1 ggagctgctg tcgccatgtt gtcggttgcc gcccgctcgc gccattcgcg cccgtcctat
61 cggccacgtc ccgcggggtg gcgcgcgctg cggccctggt acagccggtc gtccaaggag
121 tcgcctgtgc tggacttgaa gcgctccgtc ctgtgccggg agtcgctcag aggccaggcc
181 gcggcggctt tggtcgcttc cgtgagcctc aatgtccctg cttctgtccg ttattcccat
241 acagacatca aagtgcctga cttctctgac tatcgtcgcc ctgaagtgtt agatagtaca
301 aagtcttcaa aagagagcag tgaggctcgg aaaggtttct cctatttggt gactgcaaca
361 actactgtgg gtgttgcata tgctgccaag aatgtcgtct ctcagtttgt ttccagcatg
421 agtgcttctg ccgatgtgtt ggccatgtcg aaaattgaaa tcaagttatc cgatattcca
481 gaggggaaga acatggcttt caaatggaga ggcaaacccc tgttcgtgcg ccatcgaacc
541 aagaaggaaa ttgaccagga agctgcagtt gaagtgtccc agttgaggga cccacagcat
601 gacttagaac gagtaaagaa acctgagtgg gttatcttga taggtgtttg cactcatctt
661 ggttgtgtac ccattgcaaa tgcaggagat tttggtggtt attactgccc ttgccatggg
721 tcacactatg atgcgtctgg cagaatcaga aaggggcctg cacctctcaa ccttgaagtt
781 ccctcatatg agttcaccag cgatgatatg gtgattgttg gttagagaga cctggactcg
841 agttacaggc ttctttcgtc ttcatgtcac ctcagaagag ttacttgaag aacaagcctt
901 ctgtgctttg aattagttga tttgaaatat ttaagaattg ctaataatgt gtttgcaaac
961 atgtaaattg aatttaatgt tgaatacttt caagcattca cctaaaaaag ccccccatta
1021 aacattgtta tgctcaaagg gtgcagtgtc tttgatagtt aattctaatt aaaaattgta
1081 gatgagtgta taaaatattt gtgaaatctg ta
//
LOCUS HUMARX 3231 bp ss-mRNA PRI 04-JUL-1990
DEFINITION Human androgen-receptor mRNA, complete cds.
ACCESSION M34233
KEYWORDS androgen-receptor.
SOURCE Human testis and prostate cancer cell line LNCaP, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3231)
AUTHORS Govindan,M.V.
TITLE Specific region in hormone binding domain is essential for hormone
binding and trans-activation by human androgen receptor
JOURNAL Mol. Endocrinol. 4, 417-427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 109 2829 androgen-receptor /hgml_locus_uid="LL0058U"
/nomgen="AR" /map="Xq12"
BASE COUNT 713 a 941 c 883 g 694 t
ORIGIN
1 agctagctgc agcgactacc gcatcatcac agcctgttga actcttctga gcaagagaag
61 gggaggcggg gtaagggaag taggtggaag attcagccaa gctcaaggat ggaagtgcag
121 ttagggctgg gaagggtcta ccctcggccg ccgtccaaga cctaccgagg agctttccag
181 aatctgttcc agagcgtccg cgaagtgatc cagaacccgg gccccaggca cccagaggcc
241 gcgagcgcag cacctcccgg cgccagtttg ctgctgctgc agcagcagca gcagcagcag
301 cagcagcagc agcagcagca gcagcaagag actagcccca ggcagcagca gcagcagcag
361 ggtgaggatg gttctcccca agcccatcgt agaggcccca caggctacct ggtcctggat
421 gaggaacagc aaccttcaca gccgcagtcg gccctggagt gccaccccga gagaggttgc
481 gtcccagagc ctggagccgc cgtggccgcc agcaaggggc tgccgcagca gctgccagca
541 cctccggacg aggatgactc agctgcccca tccacgttgt ccctgctggc ccccactttc
601 cccggcttaa gcagctgctc cgctgacctt aaagacatcc tgagcgaggc cagcaccatg
661 caactccttc agcaacagca gcaggaagca gtatccgaag gcagcagcag cgggagagcg
721 agggaggcct cgggggctcc cacttcctcc aaggacaatt acttaggggg cacttcgacc
781 atttctgaca acgccaagga gttgtgtaag gcagtgtcgg tgtccatggg cctgggtgtg
841 gaggcgttgg agcatctgag tccaggggaa cagcttcggg gggattgcat gtacgcccca
901 cttttgggag ttccacccgc tgtgcgtccc actccttgtg ccccattggc cgaatgcaaa
961 ggttctctgc tagacgacag cgcaggcaag agcactgaag atactgctga gtattcccct
1021 ttcaagggag gttacaccaa agggctagaa ggcgagagcc taggctgctc tggcagcgct
1081 gcagcaggga gctccgggac acttgaactg ccgtctaccc tgtctctcta caagtccgga
1141 gcactggacg aggcagctgc gtaccagagt cgcgactact acaactttcc actggctctg
1201 gccggaccgc cgccccctcc gccgcctccc catccccacg ctcgcatcaa gctggagaac
1261 ccgctggact acggcagcgc ctgggcggct gcggcggcgc agtgccgcta tggggacctg
1321 gcgagcctgc atggcgcggg tgcagcggga cccggttctg ggtcaccctc agccgccgct
1381 tcctcatcct ggcacactct cttcacagcc gaagaaggcc agttgtatgg accgtgtggt
1441 ggtggtgggg gtggtggcgg cggcggcggc ggcggcggcg gcggcgaggc gggagctgta
1501 gccccctacg gctacactcg gccccctcag gggctggcgg gccaggaaag cgacttcacc
1561 gcacctgatg tgtggtaccc tggcggcatg gtgagcagag tgccctatcc cagtcccact
1621 tgtgtcaaaa gcgaaatggg cccctggatg gatagctact ccggacctta cggggacatg
1681 cgtttggaga ctgccaggga ccatgttttg cccattgact attactttcc accccagaag
1741 acctgcctga tctgtggaga tgaagcttct gggtgtcact atggagctct cacatgtgga
1801 agctgcaagg tcttcttcaa aagagccgct gaagggaaac agaagtacct gtgcgccagc
1861 agaaatgatt gcactattga taaattccga aggaaaaatt gtccatcttg tcgtcttcgg
1921 aaatgttatg aagcagggat gactctggga gcccggaagc tgaagaaact tggtaatctg
1981 aaactacagg aggaaggaga ggcttccagc accaccagcc ccactgagga gacaacccag
2041 aagctgacag tgtcacacat tgaaggctat gaatgtcagc ccatctttct gaatgtcctg
2101 gaagccattg agccaggtgt agtgtgtgct ggacacgaca acaaccagcc cgactccttt
2161 gcagccttgc tctctagcct caatgaactg ggagagagac agcttgtaca cgtggtcaag
2221 tgggccaagg ccttgcctgg cctccgcaac ttacacgtgg acgaccagat ggctgtcatt
2281 cagtactcct ggatggggct catggtgttt gccatgggct ggcgatcctt caccaatgtc
2341 aactccagga tgctctactt cgcccctgat ctggttttca atgagtaccg catgcacaag
2401 tcccggatgt acagccagtg tgtccgaatg aggcacctct ctcaagagtt tggatggctc
2461 caaatcaccc cccaggaatt cctgtgcatg aaagccatgc tactcttcag cattattcca
2521 gtggatgggc tgaaaaatca aaaattcttt gatgaacttc gaatgaacta catcaaggaa
2581 ctcgatcgta tcattgcatg caaaagaaaa aatcccacat cctgctcaag acgcttctac
2641 cagctcacca agctcctgga ctccgtgcag cctattgcga gagagctgca tcagttcact
2701 tttgacctgc taatcaagtc acacatggtg agcgtggact ttccggaaat gatggcagag
2761 atcatctctg tgcaagtgcc caagatcctt tctgggaaag tcaagcccat ctatttccac
2821 acccagtgaa gcattggaaa ccctatttcc ccaccccagc tcatgccccc tttcagatgt
2881 cttctgcctg ttataactct gcactactcc tctgcagtgc cttggggaat ttcctctatt
2941 gatgtacagt ctgtcatgaa catgttcctg aattctatct gctgggcttt ttttttctct
3001 ttctctcctt tctttttctt cttccctccc tatctaaccc tcccatggca ccttcagact
3061 ttgcttccca ttgtggctcc tatctgtgtt ttgaatggtg ttgtatgcct taaatctgtg
3121 atgatcctca tatggcccag tgtcaagttg tgcttgttta cacgcatctc tgtgccagcc
3181 acacaaaccg tttacttact taccgcaagg gaacttagag agctagaatt c
//
LOCUS DBIDB58 2149 bp ds-DNA PLN 04-JUL-1990
DEFINITION D.biflorus lectin DB58 gene, complete cds.
ACCESSION M34271 J05280
KEYWORDS lectin.
SOURCE D.biflorus 3-week old leaf and stem DNA.
ORGANISM Dolichos biflorus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 2149)
AUTHORS Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and
Etzler,M.E.
TITLE Two lectin genes differentially expressed in Dolichos biflorus
differ primarily by a 116-base pair sequence in their 5' flanking
regions
JOURNAL J. Biol. Chem. 265, 4997-5001 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 902 1729 lectin DB58
mRNA 865 > 2149 lectin DB58 mRNA
BASE COUNT 671 a 474 c 294 g 710 t
ORIGIN
1 gaagagatgt tgaaatttaa tatattttta ttattatgga tgaaaattta cgtgactctc
61 ttattaaaca atgattgcct tataattttt tttttcaata attaaaagat gcgttaaaaa
121 gaataaggaa aatcctcttt taatatatat aattactttt tataatactt tacgaaaaat
181 attattatta ttattattat tatattatta cagtagatta taaatagtta tatcttaaaa
241 aaaatttgtc aatatatcac tatttaaaaa ataattatag aatattcgtt atgtgagtat
301 tttttaaaag atattaataa aaggactcaa attcattctt tacctcatat ttattttagt
361 ttttttttaa atttcatttt aatttattgt gaatagtata atatataatt caattattac
421 aaatgaataa ttaaattatt aggaactgat aaagtaaagt aaaaaaagtg aagaacaata
481 tatctacatt aaaagaacat tatgatatat gtggagtttc tgcatcactt tgggaccgaa
541 agtggactat attggtaata tgtattgtac tacataaatt atatgattac caaagttata
601 tgcaccgtac tatctcaatt atatgtttcc aaaaccacaa aaaatataat tttttttctt
661 tttaactcta ttataatata ttgttttatt atttaatata aataaaacca tacaataaaa
721 gcatataaca agagaaatca ttttcaaaca cgtgtttatc tctatagatg atgtgattta
781 gattttgagt ttctatggtt aagaatacca aaacctagat gaggttttga actataaata
841 tacaacacag gcttctccat atgcatcacc catcaccata gttgagcaaa acatcaaagc
901 catggcttcc tccacagtct ctgttgtcct ctccctcttc ctgctgcttc tcacccaagc
961 atactcagct gacatccaat ccttctcctt caaaaacttc aactcatcca gcttcatcct
1021 ccaaggtgac gccaccgtat cctccagcaa gttacgactc acaaaagtta agggcaacgg
1081 ccttcccacc ctgtcttctc tcggccgcgc cttctactcc tcccccatcc aaatctatga
1141 caagtccacc ggcgccgtcg ccagctgggc cacctccttc acagccaaca ttttcgctcc
1201 caacaagtca tcctccgccg acgggattgc cttcgctttg gtacccgtcg ggtctgagcc
1261 caaaagcaac agtggctttc taggtgtttt cgacagcgat gtctacgaca actccgccca
1321 gactgtggca gtggagttcg acacgttctc caacaccgac tgggacccca ccagccgcca
1381 cattgggatc gacgtgaact ccatcaagtc tatcagaacg gcgtcgtggg gtttggccaa
1441 cggccaaaac gccgagattc tgatcaccta caacgccgcc acctcccttt tggtggcttc
1501 tctggtccac ccttctcgca gaacgagcta catcgtctcc gaaagagtgg acatcactaa
1561 tgaacttccg gagtatgtga gcattgggtt ctccgccacc accgggcttt ccgaaggcta
1621 cactgaaact cacgacgtcc tctcttggtc ttttgcttcg aagctcccag atgactccac
1681 cactgaacct ttggacattg ctagctacct cgtccgcaac gtcctctaga ctccttcttc
1741 ctccctgtga cagtgcttcc atgccataac gctctcttcc tcttcctctt catcttcctc
1801 ttcctcttcc cactccttct tctttcttcc taataatctt catctgtcac agtaattaat
1861 attatcttac cttctttcta cactctctgt tcttctttta tatgttaacc tcatctccta
1921 atcaactctg aacataatat tttaatacaa acatcacata taatatttat tataataatt
1981 acccgttaaa atgtctttta atacatatat taaaataaat atatgaagaa aatattagta
2041 tataattatt tgagaaatat aattaaattt ataatgtgaa ttaattgctc tatgagtgaa
2101 tagtccaaaa caaatatttg aaaaaggaaa aaaattgaaa atggatagg
//
LOCUS DBISLEC 2230 bp ds-DNA PLN 04-JUL-1990
DEFINITION D.biflorus seed lectin (SL) gene, complete cds.
ACCESSION M34270 J05280
KEYWORDS lectin.
SOURCE D.biflorus 28-day postanthesis seed DNA.
ORGANISM Dolichos biflorus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 2230)
AUTHORS Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and
Etzler,M.E.
TITLE Two lectin genes differentially expressed in Dolichos biflorus
differ primarily by a 116-base pair sequence in their 5' flanking
regions
JOURNAL J. Biol. Chem. 265, 4997-5001 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1013 1840 seed lectin
mRNA 976 > 2230 seed lectin mRNA
BASE COUNT 677 a 498 c 347 g 708 t
ORIGIN
1 ttaacgcctt taagttttat gaagagatgg gtgcgaagat ggttgaagaa tacaggtttt
61 tcagaatgac tgggaaggct cttcaggatt ctggttaaac attcactgta acatgttatt
121 taacttattc ctattttatg atttctaata agattattaa ttatttttta taatacttta
181 cgaaaaaaaa ttattattat tattatatta ttataataga ttataaacaa ttatatctta
241 aaagaaatta tatcactatt taaaaaataa ttatagaata ttcgttatgt gagcattttt
301 taaaagatat taataaaagg actcaaattc attctttacg tcatattttt cgaaatgtat
361 tattttattt tttattaaat ttcattttaa tttattgtga atagtataat atataattat
421 gaataattaa tattattagg aagtgataaa gtaaagtaaa gaatgaagaa taatatatct
481 acactttaaa agaacattat gatatatgtc gagtttctgc atcactttgg gaccaaaagt
541 gtactatatt ggtaatatgt actgtggtac ataaattata tgataaccaa agttacatgg
601 accgtactat ctcaattata tgttaccgaa atcacaaaaa atataattat tttttctttt
661 taactctatt ataatatatt ttttattatt taatataaaa aaaaccatag gataaaagca
721 tataacaaga gaaaacaata tttaaacacg ttatctctat agcccaatag ggacggcaca
781 ccccccactt caagcgcacc tctgtgtcaa cagaatcact ctaagagtat agtcgtggaa
841 tcggtgaacc acgccagctg cttagatgcg tggcgcagag gatgtgattt agattttgag
901 tttctatggt taagattacc aaaatctaga tgaggttttc aactataaat agacaagaga
961 ggctcctcaa tatgcatcac ccatcaccat agttgagcaa aacatcaaag ccatggcttc
1021 ctccacagtc tctgttgtcc tctccctctt cctgctgctt ctcacccaag caaactcagc
1081 caacatccaa tccttctcct tcaaaaactt caactcaccc agcttcatcc tccaaggtga
1141 cgccaccgta tcctccggca agttacaact caccaaagtt aaggaaaacg gcattcccac
1201 cccgtcttct ctcggccgcg ccttctactc ctcccccatc caaatctatg acaagtccac
1261 aggcgccgtc gccagctggg ccacctcctt cacagtcaag atttcagctc ccagcaaggc
1321 atccttcgcc gatgggattg ccttcgcttt ggtacccgtc gggtctgagc ccagacgcaa
1381 cggtggctat ctaggtgttt tcgacagcga tgtctacaac aactccgccc agactgtggc
1441 agtggagttc gacacgttct ccaacagcgg ctgggacccc agcatgaagc acattgggat
1501 cgacgtgaac tccatcaagt ctatcgcaac ggtgtcgtgg gatttggcca acggcgaaaa
1561 cgccgagatt ctgatcacgt acaacgccgc cacctccctt ttggtggctt ctctggtcca
1621 cccttctcgc agaacgagct acatcctctc cgaaagagtg gacatcacta atgaacttcc
1681 cgagtatgtg agcgttgggt tctccgccac caccgggctt tccgaaggct acattgaaac
1741 tcacgacgtc ctctcttggt cttttgcttc caagctccca gatgactcca ccgctgaacc
1801 tctggacctt gctagctacc tcgtccgcaa cgtcctctag actccttctt cctccctgtg
1861 acggtgcttc catgccataa cactctcttc ctcttcctct tcctcttccc accccttctt
1921 ctttcttcat aataataatc ttcatctgtc acagtaataa tattatctta ccttctttct
1981 acactctccg ttcttctttt atattttaac ctcatctcct aatcaactct taacaacata
2041 atatttaata taaataaaat tacccgttaa aatgtctttt aatacatata ttaagataaa
2101 tatatgaaga aaagattagt atataattat ttgagaaata tgttattatg ggttaagtaa
2161 tgaatttatt ttaagaaaat atgtaattga gtaaacctgt tttcgtaaac aactttgtaa
2221 aatggaattc
//
LOCUS STMBLAC1 957 bp ds-DNA BCT 04-JUL-1990
DEFINITION S.badius beta-lactamase gene, complete cds.
ACCESSION M34178
KEYWORDS beta-lactamase.
SOURCE S.badius DNA.
ORGANISM Streptomyces badius
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 957)
AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
TITLE Molecular analysis of beta-lactamases from four species of
Streptomyces: Comparison of amino acid sequences with those of
other beta-lactamases
JOURNAL J. Gen. Microbiol. 136, 589-598 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 16 957 beta-lactamase precursor (EC 3.5.2.6)
sigp 16 93 beta-lactamase signal peptide
matp 94 954 beta-lactamase
BASE COUNT 156 a 357 c 320 g 124 t
ORIGIN
1 caggaggttt cgcccatgca tcacccccgc gtcaggaccg ccgtcgccgg tctggtcgcc
61 gtgctgtccc tcgtgccgct ggcggcctgc gggcagagcg actccaccgc gcccccttcg
121 tccgccaagc ccgccacgag cgcgtcggcc tcgctgccgc ggccgaagcc gtacaccggc
181 gacttcaaga agctggagcg cgagttcgac gcacggctgg gcgtgtacgc gatcgacacc
241 ggcaccggac gcgaggtgac ccacaacgac cgggcgcgct tcgcctacca ctccaccttc
301 aaggcgctcc aggccgcggt cgtcctcagc acctactccc tggacggcct cgacaagcgg
361 gtgacgtaca cccgcgagga cctggtcgcc cactctccgg tgaccgagaa gcacgtcgac
421 accggcatga cgctgaagga gctgtgcgac gcctccgtgc gctacagcga caacaccgcg
481 gccaacctcc tcttcgaccc acgtgcggcc gggcccaagg gcctggacgc ctcgctggag
541 aagctcggcg acgacatcac gcggatggac cgggaggagc cggagctgag ccgatgggtc
601 cccggtgaga agcgcgacac ctccacgccc cgggcgctgg ccgaggatct gcgcgcgttc
661 gtcctgggca aggccctgcg cgcacccgag cgggcgcagc tgacgacgtg gctgcggacg
721 aacaccaccg gggacgcggt gatcagggcg ggtgtgccgg agaactgggt cgtcggtgac
781 aagaccggga ccggcagcta ctacggggcc cgcaacgaca tcgccgtggt gtggcccccg
841 gactccgcgc ccatcgtgat cgccatcctg tcccaccgcg gtacgaagga cgcggagccg
901 gacgacgaac tcatcgccga agcggcctcc gtggtcgtcg actcgctgtc ctcctag
//
LOCUS STMBLAC2 948 bp ds-DNA BCT 04-JUL-1990
DEFINITION S.lavendulae beta-lactamase gene, complete cds.
ACCESSION M34180
KEYWORDS beta-lactamase.
SOURCE S.lavendulae DNA.
ORGANISM Streptomyces lavendulae
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 948)
AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
TITLE Molecular analysis of beta-lactamases from four species of
Streptomyces: Comparison of amino acid sequences with those of
other beta-lactamases
JOURNAL J. Gen. Microbiol. 136, 589-598 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 31 948 beta-lactamase precursor (gtg start codon) (EC
3.5.2.6)
sigp 31 133 beta-lactamase signal peptide
matp 134 945 beta-lactamase
BASE COUNT 136 a 373 c 333 g 106 t
ORIGIN
1 gaccgcaaga gccgtagggg agagcgcatc gtggggacca caggagcccg tccgagccgt
61 cgtgccgtac tgaccgccgc cgcgggggcg gcggtggcgg ggatccccct gggcgggtcc
121 acggccttcg cggccccgcg ggggaacccg gacgtcctgc ggcagctgcg cgccctcgaa
181 caggaacact ccgcccgcct gggcgtgtac gcccgggaca cggccaccgg ccggaccgtc
241 ctccaccgcg cggaggagcg ctttccgatg tgctcggtct tcaagaccct cgccgtcgcc
301 gccgtcctgc gcgacctcga ccgggacggc gagttcctgg ccacacgcct cttctacacc
361 gaacaggaag tgaaggactc cgggttcggg ccggtcaccg gcctgcccga gaacctcgcg
421 gccggcatga ccgtcgagcg gctgtgcgcc gccgccatct gccagagcga caacgccgcg
481 gcgaacctgc tgctccggga gctgggcggc ccggaggcgg tcacccgttt ctgccggtcg
541 gtgggggacc ggaccacccg gctggaccgc tgggagcccg aactgaactc cgccgagccc
601 ggccggctca ccgacaccac caccccgcgc gccatcgggg cgacctacgg cgaactcgtc
661 ctcggcgacg cgctcgcccc gcgggaccgc gagcggctca ccggctggct cctggcgaac
721 acgacgagca ccgaacgctt ccgcaagggc ctgcccgccg actggaccct gggcgacaag
781 accggcggcg gcgcgtacgg cacgaacaac gacgccggcg tcacctggcc cccgcaccgc
841 ccgccggtgg tgatggtcgt cctgacgacc cacgaccggc cggacgcggt cgccgacaac
901 ccgctcgtgg ccaagacggc agcgctgctg gcgtcggccc tcggctga
//
LOCUS STMBLAC3 951 bp ds-DNA BCT 04-JUL-1990
DEFINITION S.fradiae beta-lactamase gene, complete cds.
ACCESSION M34179
KEYWORDS beta-lactamase.
SOURCE S.fradiae DNA.
ORGANISM Streptomyces fradiae
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 951)
AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
TITLE Molecular analysis of beta-lactamases from four species of
Streptomyces: Comparison of amino acid sequences with those of
other beta-lactamases
JOURNAL J. Gen. Microbiol. 136, 589-598 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 31 951 beta-lactamase precursor (gtg start codon) (EC
3.5.2.6)
sigp 31 133 beta-lactamase signal peptide
matp 134 948 beta-lactamase
BASE COUNT 133 a 352 c 361 g 105 t
ORIGIN
1 tcgtcctgcc gtagggagaa ggggtccatc gtggatcgca ccaccgcacg tccgaaccgc
61 cgagccgtcc tcgccacagg ggtgggggcc gcgctggcgg ccaccgcggc agcggcgggc
121 cccgcgcacg ccgctccggg gcggggcgcc cgcgtcgagg ggcgcctgcg ggcgctggag
181 cgcacgcacg acgcgcggct cggcgcgttc gcgtacgaca cgggcacggg gcggacggtg
241 gcgtaccgcg cggacgagcg cttcccgatc gcgtcgatgt tcaagacgat cgccgtcgcc
301 gccgtgctgc gcgacctgga ccgggacggc gaggtcctcg cccggcgcgt ccactacacg
361 gcggactacg tgaagcggtc gggctactcc cccgtcaccg ggctgccgga gaacgtggcg
421 aacggcatga ccgtcgccga gctgtgcgag gcgacgctga cccgcagcga caacacggcg
481 gccaacctgc tcctgcgcga tctgggcggc ccgaccgcgg tgacccgttt ctgccgctcc
541 gtcggcgacc acgtcacccg gctggaccgc tgggagcccg agctgaactc ggccgagccc
601 gggagggtca ccgacaccac gtcgccgcgc gccatcggcc gcacgtacgg gcggctgatc
661 ctcggcgacc tgctggcggc gcacgaccgg gagcggctga cgcggtggat gctggacaac
721 cggacgagtg acgagcggtt ccgcaagggc ctgcccgcgg actggctgct ggcggacaag
781 acgggcggcg gcgactacgg caccaacaac gacgcggggg tcgcctggcc gcccggacgc
841 ccgccggtcg tgctggccgt ccagacgacc cggttcaccc cggacgccga ggccgacaac
901 gtcctcgtcg cggaggcggc acgcctcctg gcggaggcga tgaccgactg a
//
LOCUS LBKRRNA 1528 bp ss-rRNA RNA 04-JUL-1990
DEFINITION Lactobacillus kandleri 16S small subunit ribosomal RNA.
ACCESSION M23038
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE Lactobacillus kandleri (strain DSM 20593) rRNA.
ORGANISM Lactobacillus kandleri
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1528; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1528 16S rRNA
BASE COUNT 411 a 326 c 436 g 336 t 19 others
ORIGIN
1 nntaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg cactgtggtt gaaatgagat gagaagcttg cttcaagtca aatgccaaca
121 ttgcagtgag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcaggggata
181 acatctggaa acaggtgcta ataccgtata acaataaaaa ccgcatggtt ttgatttaaa
241 agatggttct gctatcacta agagatggtc ccncggtgca ttagttagtt ggtgaggtaa
301 aggctcacca agacgatgat gcatagccga gttgagagac tgatcggcca caatgggact
361 gagacacggc cnatactcct acgggagnca gcagtaggga atcttccaca atggacgaaa
421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa
541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta
601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcggcttaac
661 cgagnnattg ctttggaaac tggataactt gagtgcagta gaggaaagtg gaactccatg
721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga
781 ctgtaactga cgttgaggct cnnaagtgtg ggtagcaaac aggattagat accctggtag
841 tccacaccgt aaacgatgag tgctagctgt tcgagggttt ccgcccttga gtggcgaagc
901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt
1021 accaggtctt gacatccttt gaccactcca gagatggagn tttcccttcg gggtcaaagt
1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccncaa
1141 cgagcgcaac ccttattgtt agttgccagc atttagttgg gcactctagc aagactgccg
1201 gtgacaaacc ggaggaaggc ggggatgacg tcaaatcatc atgcccctta tgacctgggc
1261 tacacacgtg ctacaatggc aagtacaacg agcagccaac ccgcgagggt gcgcaaatct
1321 cttaaagctt gtctcagttc ggactgtagg ctgcaactcg cctacacgaa gtcggaatcg
1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc
1441 cgtcacacca tgagagtttg taacacccaa agtcggtgag gtaaccttta taggggccag
1501 ccgcctaagg tgggacagat gatnaggg
//
LOCUS LBMRRNA 1524 bp ss-rRNA RNA 04-JUL-1990
DEFINITION Lactobacillus minor, 16S small subunit ribosomal RNA.
ACCESSION M23039
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.minor(strain DSM 20014) rRNA.
ORGANISM Lactobacillus minor
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1524; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1524 16S rRNA
BASE COUNT 414 a 320 c 431 g 345 t 14 others
ORIGIN
1 nncaaaatga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg ctttgtggtt caactgatat gaagagcttg ctcggatttg aagatgaaca
121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcgggggata
181 acacttggaa acaagtgcta ataccgcata atactgataa ccgcatggtt atcagttgaa
241 agatggttct gctatcacta aganatggtc ccncggtgta ttagctagtt ggtaaggtaa
301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact
361 gagacacggc cnatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa
421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa
541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta
601 ttgggcgtaa agcgagcgca gacggttgtt taagtctgaa gtgaaagccc acagcttaac
661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg
721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga
781 ctgtaactga cgttgaggct cgnaagtgtg ggtagcaaac aggattagat accctggtag
841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc
901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt
1021 accagctctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt
1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
1141 cgagcgcaac ccttattact agttgccagc atttagttgg gcactctagt gagactgccg
1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgagctgggc
1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct
1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cnggtcttgt acacaccgcn
1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac
1501 cgtctaaggt gggacagatg atta
//
LOCUS LBTRRNA 1529 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.halotolerans 16S small subunit ribosomal RNA.
ACCESSION M23037
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.halotolerans (strain DSM 20190) rRNA.
ORGANISM Lactobacillus halotolerans
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1529; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1529 16S rRNA
BASE COUNT 406 a 329 c 437 g 328 t 29 others
ORIGIN
1 nncaaaatga gagtttgatc ctggctcagg angaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg ccttgtcgtt ctactgattt agagagcttg ctcaatactg acgtagaact
121 atacaaggag tggcgaacgg gtgagtaaca cgtgggaaac cnncctctta gtaggggata
181 acacctggaa acaggngcta ataccgcata acaacgataa ccgcatggtt atcgcttgaa
241 aggtggcgta agctaccgct aagagatggt cccgcggtgc attagttagt tggtgaggta
301 atggctcacn nagacgatga tgcatagccg agttgagaga ctgatcggcc acaatgggac
361 tgagacacgg ccnntnctcc tacgggagnc agcagtaggg aatcttccac aatggacgaa
421 agtctgatgg agcaacgcng ngtgtgtgat gaagggtttc ggctcgtaaa acactgttgt
481 aagagaagaa tgatactaag agtaactgtt tagtatgtga cggtatctta ccagaaagga
541 acggctaaat acgtgccagc agccncggta atacgtatgt tccnagcgtt atccggattt
601 attgggcgta aagcgagcgc agacggttat ttaagtctga agtgaaagcc cncagcttaa
661 ctgtggnagt gctttggaaa ctggataact tgagtgcagt agaggaaagt ggaactccat
721 gtgtagcggt gaaatgcgta gatatatgga agaacaccag cggcgaaggc ggctttctgg
781 actgtaactg acgttgaggc tcgnaagtgt gggtagcaaa caggattaga taccctggta
841 gtccacaccg taaacgatga gtgctagatg tttgggggtt tccgccccna agtgtcgtag
901 ctaacgcatt aagcactccg cctggggagt acgaccgcaa ggttgaaact caaaggaatt
961 gacggggacc ngcacaagcg gnggagcatg tggtttaatt cgaagnaacg cgaagaacct
1021 taccaggnct tgacatcctt tgaccacctc agagatgagg ctttcccttc ggggacaaag
1081 tgacaggtgg ngcatggttg tcgtcagctc gtgtcgtgag atgttgggtt aagtcccgca
1141 acgagcgcaa cccttatgat tagttgccag catttagttg ggcactctaa tcagactgcc
1201 ggtgacaaac cggaggaagg tggggatgac gtcaaatcat catgcccctt atgacctggg
1261 ctacacacgt gctacaatgg caagtacaac gagcagctaa cccgtaaggg cacgcgaatc
1321 tcttaaaact tgtctcagtt cggattgtgg gctgcaactc gcccncatga agtcggaatc
1381 gctagtaatc gcggatcagc acgccgcggt gaatacgttc ccgggtcttg tacacaccgc
1441 ncgtcacncc atgagagttt gtaacaccca aagccggtga ggtaacccgc aagggaacca
1501 gccgtctaag gtgggacaga tgattnggg
//
LOCUS LCORRNA 1525 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.confusus 16S small subunit ribosomal RNA.
ACCESSION M23036
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.confusus (strain DSM 20196) rRNA.
ORGANISM Lactobacillus confusus
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1525; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1525 16S rRNA
BASE COUNT 407 a 325 c 440 g 339 t 14 others
ORIGIN
1 nccaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg ctttgtggtt caactgattt gaagagcttg ctcagatatg acgatggaca
121 ttgcaaagag tggcgaacgg gtgagtaacn cgtgggaaac cnacctctta gcaggggata
181 acatttggaa acagatgcta ataccgtata acaatgacaa ccgcatggtn gtnatttaaa
241 agatggttct gctatcacta agagatggtc ccgcggtgca ttagctagtt ggtaaggtaa
301 tggcttacca aggcgatgat gcatagccga gttgagagac tgatcggcca caatgggact
361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atgggcgaaa
421 gcctgatgga gcaacgccnn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa
541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta
601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcagctcaac
661 tgaggaattg ctttggaaac tggatgactt gagtgcagta gaggaaagtg gaactccatg
721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga
781 ctgtaactga cgttgaggct cnaaagtgtg ggtagcaaac aggattagat accctggtag
841 tccacaccgt aaacgatgag tgctaggtgt ttgagggttt ccgcccttaa gtgccgcagc
901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaannaacgc gaagaacctt
1021 accaggtctt gacatccctt gacaactcca gagatggagn gttcccttcg gggacaaggt
1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg
1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc
1261 tacacacgtg ctacaatggc gtatacaacg agttgcaaac ccgcgagggt gagctaatct
1321 cttaaagtac gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggtcttgt acacaccgcc
1441 cgtcacacca tgagagtttg taacacccaa agccggtggg gtaaccttcg ggagccagcc
1501 gtctaaggtg ggacagatga ttagg
//
LOCUS LEURRNAC 1493 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.cremoris 16S small subunit ribosomal RNA.
ACCESSION M23034
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.cremoris (strain DSM 20346) rRNA.
ORGANISM Leuconostoc cremoris
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1493; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1493 16S rRNA
BASE COUNT 398 a 324 c 438 g 324 t 9 others
ORIGIN
1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca
61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa
121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa
181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga
241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg
301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc
361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat
421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat
481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta
541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta
601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact
661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga
721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt
781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg
841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt
901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg
961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt
1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct
1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca
1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga
1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa
1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta
1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg
1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc
1441 aaagccggtg gcctaacctt tnaggaagga gcngtctaag gnagganaga nga
//
LOCUS LEURRNAL 1499 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.lactis 16S small subunit ribosomal RNA.
ACCESSION M23031 M23032
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.lactis (strain DSM 20202) rRNA.
ORGANISM Leuconostoc lactis
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1499; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1499 16S rRNA
BASE COUNT 396 a 320 c 435 g 316 t 32 others
ORIGIN
1 nntaaattga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc
61 aagtcgaagc cgcagcgaaa ggtgcttgca ccnttcaagc gagtggcgaa cgggtgagta
121 acacgtggat aacctgcctc aaggctgggg ataacatttg gaaacagatg ctaataccga
181 ataaaactta gtatcgcatg atacaaagtt caaaggcgct acggcgncac ctagagatgg
241 gnccgcggtg cattagttag ttggtggggt aaaggcctac caagacaatg atgcatagcc
301 gagttgagag actnatcggc cacattggga ctgagacacg gcccnaactc ctacgggagg
361 cagcagtagg gaatcttcca caatgggcga aagcctnatg gagcaacgcc nngtgtgtga
421 tgaaggcttt agggtcgtaa agcacngttg tatgggaaga aatgctagaa tagggaatga
481 ttctagttcg acggtaccat accagaaagg gacggctaaa tacgtgccag cagcnncggt
541 aatacgtatg tccnnagcgt tatccggatt tattgggcgt aaagcgagcg cagacggttg
601 attaagtctg atgtgaaagc cnggagctca actccggnat ggcattggaa actggttaac
661 ttgagtgttg tagaggtaag tggaactcca tgtgtagcgg tggaatgcgt agatatatgg
721 aagaacacca gtggcgaagg cggcttactg gacaacaact gacgttgagg ctcnnaagtg
781 tgggtagcaa acaggattag ataccctggt agtccacacc gtaaacgatg aatactaggt
841 gttaggaggt ttccgcctct tagtgccnaa gctaacgcat taagtattcc gcctggggag
901 tacgaccgca aggttgaaac tcaaaggaat tgacggggac cngcacaagc ggtggagcat
961 gtggtttaat tcgaagnaac gcgaagaacc ttaccaggtc ttgacatcct ttgaagcttc
1021 tagagataga agtgttctct tcggagacaa agtgacaggt ggngcatggt cgtcgtcagc
1081 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttatt gttagttgcc
1141 agcattcagt tgggcactct agcgagactg ccggtgacaa accggaggaa ggcggggacg
1201 acgtcagatc atcatgcccc ttatgacctg ggctacacac gtgctacaat ggcgtataca
1261 acgagttgcc aacccgcgag ggtgagctaa tctcttaaag tacgtctcag ttcggactgc
1321 agtctgcaac tcgactgcac gaagtcggaa tcgctagtaa tcgcggatca gcacgccgcg
1381 gtgaatacgt tcccnggnct tgtacacacc gnnnntcacn ccatgggagt ttgtaatgcc
1441 cnaagccggt ggcctaacct tatggaggga gccgtctaag gcaggacaga tgactaggg
//
LOCUS LEURRNAM 1554 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.mesenteroides 16S small subunit ribosomal RNA.
ACCESSION M23035
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.mesenteroides (DSM 20343) rRNA.
ORGANISM Leuconostoc mesenteroides
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1554; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1554 16S rRNA
BASE COUNT 406 a 338 c 447 g 334 t 29 others
ORIGIN
1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca
61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa
121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa
181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga
241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg
301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc
361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat
421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat
481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta
541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta
601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact
661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga
721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt
781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg
841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt
901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg
961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt
1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct
1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca
1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga
1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa
1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta
1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg
1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc
1441 aaagccggtg gcctaacctt ttaggaagga gccgtctaag gcaggacaga tgaccggggn
1501 nnnnnngtaa caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct
//
LOCUS LEURRNAO 1510 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.oenos 16S small subunit ribosomal RNA.
ACCESSION M35820 M23032
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.oenos (strain DSM 20252) rRNA.
ORGANISM Leuconostoc oenos
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1510; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1510 16S rRNA
BASE COUNT 387 a 316 c 431 g 351 t 25 others
ORIGIN
1 nttaaattga gagtttgatc ctggctcagg atgaacgctg gtggcgtgcc taatacatgc
61 aagtcgtacg tagccgctga attgatcttc gggtgaagtg aggcaatgac tagagtggcg
121 aactggtgag taacacgtna gaaacctgcc ctttagtggg ggataacatt tggaaacaga
181 tgctaatacc gcgtaacaac aaatcacacn tgtgatctgt ttgaaaggtc cttttggatc
241 gctagaggat ggtcttgcgg cgtattagct tgttggtagg gtagaagcct accaaggcaa
301 tgatgcgtag ccgagttgag agactggctg gccncattgg gactgagaca ctgcccnaac
361 tcctacggga gnctgcagta gggaattttc cgcaatgcac gaaagtgtga cggagcgacg
421 ccgcgtgtgt gatgaaggct ttcgggtcgt aaagcactgt tgtaagggaa gaataactga
481 attcagagaa agttttcagc ttgacggtac cttaccagaa agggatggct aaatacgtgc
541 cagcagccgc ggtaatacgt atgtcccnag cgttatccgg atttattggg cgtaaagcga
601 gcgcagacgg tttattaagt ctgatgtgaa atcccgaggc ccaacctcgg tactgcattg
661 gaaactgatt tacttgagtg cgatagaggc aagtggaact ccatgtgtag cggtgaaatg
721 cgtagatatg tggaagaaca ccagtggcga aagcggcttg ctagatcgta actgacgttg
781 aggctcgaaa gtatgggtag caaacgggat tagatacccc ggtagtccat accgtaaacg
841 atgggtgcta gttgttaaga ggtttccgcc tcctagtgac gtagcaaacg cattaagcac
901 ccngcctgag gagtacggcc gcaaggctga aacttaaagg aattgacggg gnnccgcaca
961 agcggtggag catgtggttt aattcgaaga tacgcnnaaa accttaccag gtcttgacat
1021 accaatgatc gcttttgtaa tgaaagcttt tcttcggaac attggataca ggtggtgcat
1081 ggtcgtcgtc agctcgtgtc gtgagatgtt gggttnagtc ccgcaacgag cgcnaccctt
1141 gttattagtt gccagcattt ngttgggcac tctaatgaga ctgccggtga tnaaccggag
1201 gaaggtgggg acgacgtcag atcatcatgc cccttatgac ctgggcaaca cacgtgctac
1261 aatgggaagt acaacgagtc gcaaaccggc gacggtaagc taatctctta aaacttctct
1321 cagttcggac tggagtctgc aactcgactc cacgaaggcg gaatcgctag tnatcgcgaa
1381 tcagcatgtc gcggtgaatn cgttcccngg ncttgtacac nccgcnngtc aaatcntggg
1441 agtcggaagt acccaaagtc gcttggctaa cttttngagg ccggtgccta aggtaaaatc
1501 gatgactggg
//
LOCUS LEURRNAPS 1524 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.paramesenteroides 16S small subunit ribosomal RNA.
ACCESSION M23033
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.paramesenteroides (strain DSM 20288) rRNA.
ORGANISM Leuconostoc paramesenteroides
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1524; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1524 16S rRNA
BASE COUNT 404 a 326 c 435 g 340 t 19 others
ORIGIN
1 nttaaaatga gagtttganc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg ctttgtcttt aactgatatg aagagcttgc tctgatttga ttttatctga
121 caaagagtgg cgaacgggtg agtaacacgt gggtaaccnr cctcttagca ggggataaca
181 tttggaaaca agtgctaata ccgtataata ccaacaaccg catggtngtn ggttgaaaga
241 tggttctgct atcactaaga gatggacccn cggtgcatta gctagttggt aaggtaacgg
301 cttaccaagg caatgatgca tagccgagtt gagagactga tcggccacaa tgggactgag
361 acacggccca tactcctacg ggagncagca gtagggaatc ttccacaatg ggcgcaagcc
421 tgatggagca acgccnngtg tgtgatgaag ggtttcggct cgtaaaacac tgttataaga
481 gaagaacggc actgagagta actgttcagt gtgtgacggt atcttaccag aaaggaacgg
541 ctaaatacgt gccagcagcc gcggtaatac gtatgttccn agcgttatcc ggatttattg
601 ggcgtaaagc gagcgcagac ggttatttaa gtctgaagtg aaagccctca gctcaactga
661 ggaatggctt tggaaactgg atgacttgag tgcagtagag gaaagtggaa ctccatgtgt
721 agcggtgaaa tgcgtagata tatggaagaa caccagtggc gaaggcggct ttctggactg
781 taactgacgt tgaggctcga aagtgtgggt agcaaacagg attagatacc ctggtagtcc
841 acaccgtaaa cgatgagtgc tagatgttcg agggtttccg cccttgagtg tcgcagctaa
901 cgcattaagc actccgcctg gggagtacga ccgcaaggtt gaaactcaaa ggaattgacg
961 gggaccngca caagcggngg agcatgtggt ttaattcgaa gnaacgcgaa gaaccttacc
1021 aggncttgac atcccttgct aatcctagaa ataggangtt cccttcgggg acaaggtgac
1081 aggtggngca tggttgtcgt cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga
1141 gcgcaaccct tattattagt tgccagcatt cagttgggca ctctagtgag actgccggtg
1201 acaaaccgga ggaaggtggg gatgacgtca aatcatcatg ccccttatga cctgggctac
1261 acacgtgcta caatggcata tacaacgagt cgccaacccg cgagggtgcg ctaatctctt
1321 aaagtatgtc tcagttcgga ttgtaggctg caactcgcct acatgaagtc ggaatcgcta
1381 gtaatcgcgg atcagcacgc cgcggtgaat acgttcccgg gncttgtaca caccgcccgt
1441 cacaccatga gagtttgtaa cacccaaagc cggtggggta accttttagg agccagccgt
1501 ctaaggtggg acagatgatn aggg
//
LOCUS LVIRRNAA 1582 bp ss-rRNA RNA 04-JUL-1990
DEFINITION L.viridescens 16S small subunit ribosomal RNA.
ACCESSION M23040
KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE L.viridescens (strain DSM 20410) rRNA.
ORGANISM Lactobacillus viridescens
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (sites)
AUTHORS Yang,D. and Woese,C.R.
TITLE Phylogenetic structure of the "leuconostocs": An interesting case
of a rapidly evolving organism
JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1582; for [1])
AUTHORS Yang,D. and Woese,C.R.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 10-MAR-1989, for release after publication.
Author address: C.R.Woese
Microbiology Department
131 Burrill Hall
407 S. Goodwin Ave
Urbana, IL 61801
Ph: 217-333-9369
E-mail carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1582 16S rRNA
BASE COUNT 426 a 334 c 436 g 351 t 35 others
ORIGIN
1 tacaaaatga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
61 aagtcgaacg ctttgtggtc caactgattt gaagagcttg ctcagatatg acgatggaca
121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctacctctta gcaggggata
181 acacttggaa acaagtgcta ataccgtata acactaataa ccgcatggtt attagttaaa
241 agntggtctt gctatcacta agagatggtc ccacggtgta ttagctagtt ggtaaggtaa
301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact
361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa
421 gtctgatgga gcaacgccgc gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa
541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta
601 ttgggcgtaa agcgagcgca gacggttatt taagtccgaa gtgaaagccc acagcttaac
661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg
721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga
781 ctgtaactga cgttgaggct cgaaagtgtg ggtagcaaac aggattagat accctggtag
841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc
901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagcaacgc gaagaacctt
1021 accaggtctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt
1081 gacaggtggt gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg
1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc
1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct
1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc
1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac
1501 cgtctaaggt gggacagatg attagggnnn nnnngtaaca agnnnnnnnn nnnnnnnnnn
1561 nnnnnnngat cacctccttt ct
//
LOCUS DIRPARAM 2545 bp ss-mRNA INV 04-JUL-1990
DEFINITION D.immitis paramyosin mRNA, partial cds.
ACCESSION M29733
KEYWORDS paramyosin.
SOURCE D.immitis, cDNA to mRNA.
ORGANISM Dirofilaria immitis
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 2545)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 2545 paramyosin (AA at 1)
BASE COUNT 949 a 420 c 544 g 632 t
ORIGIN
1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt
61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa
121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt
181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt
241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag
301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca
361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt
421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa
481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa
541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa
601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca
661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa
721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca
781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt
841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa
901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa
961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca
1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa
1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt
1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa
1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg
1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt
1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag
1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt
1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat
1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta
1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg
1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa
1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg
1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat
1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact
1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta
1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg
1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa
2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa
2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa
2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag
2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt
2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa
2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa
2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa
2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct
2521 tcagcatcta aaatttacgt tctcg
//
LOCUS ONGPARAM1 134 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-1.
ACCESSION M29734
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 134)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 134 paramyosin (AA at 1)
BASE COUNT 53 a 23 c 28 g 30 t
ORIGIN
1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt
61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca
121 gaacgttttg aagc
//
LOCUS ONGPARAM2 135 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-2.
ACCESSION M29735
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 135 paramyosin (AA at 2)
BASE COUNT 43 a 31 c 26 g 35 t
ORIGIN
1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga
61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt
121 ggattccgta cgtac
//
LOCUS ONGPARAM3 176 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-3.
ACCESSION M29736
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 176 paramyosin (AA at 1)
BASE COUNT 74 a 20 c 36 g 46 t
ORIGIN
1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa
61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct
121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca
//
LOCUS ONGPARAM4 414 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-4.
ACCESSION M29737
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 414)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 414 paramyosin (AA at 3)
BASE COUNT 143 a 70 c 101 g 100 t
ORIGIN
1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg
61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa
121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc
181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc
241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc
301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga
361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc
//
LOCUS ONGPARAM5 145 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-5.
ACCESSION M29738
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 145 paramyosin (AA at 3)
BASE COUNT 48 a 28 c 28 g 41 t
ORIGIN
1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg
61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac
121 ttgataatgc aattcgtgct cgtaa
//
LOCUS ONGPARAM6 127 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA, 3' end, clone OV-7.
ACCESSION M29740
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 127)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 56 paramyosin (AA at 30
mRNA < 1 127 paramyosin mRNA
BASE COUNT 44 a 20 c 24 g 39 t
ORIGIN
1 ctgctggcaa aaatgcttca gcatcgaaga tttacgttct cgaagatgat cagtagatag
61 atagatggtg ataatagttg aaatcaactg atatcatctg ttaacaatca ttaaaatgtg
121 tctcatc
//
LOCUS ONGPARAM7 129 bp ss-mRNA INV 04-JUL-1990
DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-6.
ACCESSION M29739
KEYWORDS paramyosin.
SOURCE O.volvulus, cDNA to mRNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 129)
AUTHORS Limberger,R.J. and McReynolds,L.A.
TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 129 paramyosin (AA at 3)
BASE COUNT 48 a 20 c 31 g 30 t
ORIGIN
1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta
61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata
121 aggaaacgc
//
LOCUS CORDTXRA 2529 bp ds-DNA BCT 04-JUL-1990
DEFINITION C.diphtheriae toxin repressor (dtxR) gene, complete cds.
ACCESSION M34239
KEYWORDS regulatory protein; toxin repressor.
SOURCE C.diphtheriae (strain C7) DNA, clone pHH2500..
ORGANISM Corynebacterium diphtheriae
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 1 to 1390)
AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R.
TITLE Molecular cloning and DNA sequence analysis of the diptheria tox
regulatory element (dtxR) from Corynebacterium diptheriae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2529)
AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.R.Murphy, 11-MAY-1990.
University Hospital
Dept Medicine-Rm E613
88 East Newton Street
Boston, MA 02118-2393
FEATURES from to/span description
pept 681 1361 toxin repressor protein (dtxR)
pept 25 456 ORF1
pept 1384 2370 ORF3
signal 595 629 promoter sequence
binding 664 669 ribosomal binding site (put.)
binding 1200 1283 DNA-binding region (put.)
BASE COUNT 691 a 615 c 593 g 630 t
ORIGIN 1 bp upstream of HindIII site.
1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa
61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct
121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac
181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac
241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt
301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga
361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt
421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc
481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct
541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg
601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac
661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta
721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc
781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg
841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg
901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata
961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac
1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag
1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg
1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag
1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg
1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc
1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat
1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt
1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat
1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt
1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa
1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt
1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac
1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac
1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt
1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat
1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac
1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac
2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc
2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat
2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt
2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca
2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga
2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc
2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt
2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat
2521 aataagctt
//
LOCUS ECOANSBA 1643 bp ds-DNA BCT 04-JUL-1990
DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION M34234
KEYWORDS L-asparaginase II; ansB gene.
SOURCE E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and
pansB7.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1643)
AUTHORS Bonthron,D.T.
TITLE L-asparaginase II of Escherichia coli K-12: Cloning and sequencing
of the ansB gene and its position on the chromosomal map
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.T.Bonthron, 11-MAY-1990.
FEATURES from to/span description
pept 346 1392 L-asparaginase II (ansB) (EC 3.5.1.1)
sigp 346 411 L-asparaginase II signal peptide
matp 412 1389 L-asparaginase II
binding 335 339 ribosomal binding site (put.)
site 1400 1425 transcription termination signal (put.)
BASE COUNT 442 a 404 c 405 g 392 t
ORIGIN 63.8 min on K12 map.
1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa
61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg
121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt
181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac
241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca
301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa
361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat
421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct
481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa
541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat
601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc
661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg
721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca
781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac
841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc
901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac
961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc
1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca
1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt
1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc
1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa
1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc
1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc
1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg
1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg
1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta
1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc
1621 agtatcacgg cattctcaag ctt
//
LOCUS HUMADE5RC 1946 bp ds-DNA SYN 04-JUL-1990
DEFINITION Synthetic human/adenovirus type 5 recombination junction.
ACCESSION M34061
KEYWORDS .
SOURCE Human fetal bone marrow fibroblasts transformed with Adeno5/SV40
recombinant virus (cell line H13.1) DNA, clone pH13.1SV1.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 1946)
AUTHORS Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and
Siniscalco,M.
TITLE Preferential viral integration at the highly recombinogenic
chromosomal site 1p36 in human cells transformed with an
Adeno5/SV40 hybrid
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Romani 08-MAY-1990.
FEATURES from to/span description
recomb 429 430 adenovirus type 5 end/Human DNA start
rpt 430 1134 tandem Alu repeats
BASE COUNT 492 a 548 c 510 g 396 t
ORIGIN Chromosome 1p36.
1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg
61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca
121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc
181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac
241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac
301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc
361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt
421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg
481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg
541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg
601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg
661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca
721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc
781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata
841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt
901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct
961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg
1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat
1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat
1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc
1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt
1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg
1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc
1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt
1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt
1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct
1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc
1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga
1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg
1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg
1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc
1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga
1921 ctgaacggag gcagggccaa aaccct
//
LOCUS MCLPCCA 1701 bp ds-DNA BCT 04-JUL-1990
DEFINITION M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes,
complete cds, and linker protein, 5' end.
ACCESSION M34254
KEYWORDS light-harvesting biliprotein; phycoerythrocyanin.
SOURCE M.laminosus (strain Fischerella PCC 7603) DNA.
ORGANISM Mastigocladus laminosus
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria.
REFERENCE 1 (bases 1 to 1701)
AUTHORS Eberlein,M. and Kufer,W.
TITLE Genes encoding both subunits of phycoerythrocyanin, a light-
harvesting, biliprotein from the cyanobacterium Mastigocladus
laminosus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Kufer, 11-MAY-1990.
Botanisches Institut
Ludwig-Maximilians- Universitaet Muenchen
Menzingerstr. 67
D-8000 Muenchen 19
Federal Republic of Germany
FEATURES from to/span description
pept 279 797 phycoerythrocyanin beta-subunit
pept 884 1372 phycoerythrocyanin alpha-subunit
pept 1528 > 1701 phycoerythrocyanin linker protein precursor
matp 1531 > 1701 phycoerythrocyanin linker protein
BASE COUNT 526 a 312 c 383 g 480 t
ORIGIN
1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag
61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc
121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt
181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa
241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag
301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc
361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg
421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta
481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat
541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc
601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag
661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca
721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag
781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt
841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac
901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc
961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa
1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac
1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg
1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc
1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc
1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt
1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac
1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta
1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa
1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt
1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa
1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa
1681 agatttgcaa ctgccgaagc t
//
LOCUS YSCTSDNAA 238 bp ds-DNA PLN 04-JUL-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii.
ACCESSION M34310
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 238)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 146 g 92 t
ORIGIN
1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt
181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt
//
LOCUS YSCTSDNAB 297 bp ds-DNA PLN 04-JUL-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i.
ACCESSION M34311
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 184 g 113 t
ORIGIN
1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg
181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg
241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg
//
LOCUS YSCTSDNAC 325 bp ds-DNA PLN 04-JUL-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii.
ACCESSION M34312
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 325)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 199 g 126 t
ORIGIN
1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt
181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg
241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg
301 tgggtgtggt gtgtgggtgt gtggg
//
LOCUS YSCTSDNAD 248 bp ds-DNA PLN 04-JUL-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i.
ACCESSION M34313
KEYWORDS .
SOURCE S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone
YLP108CA-4-i.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 248)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 151 g 97 t
ORIGIN
1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg
181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt
241 gtgggtgg
//
LOCUS BOVMET 1047 bp ss-mRNA MAM 04-JUL-1990
DEFINITION Bovine metalloproteinase inhibitor mRNA, complete cds.
ACCESSION M32303
KEYWORDS metalloproteinase inhibitor.
SOURCE Bovine aorta endothelium, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1047)
AUTHORS Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E.
TITLE cDNA cloning and expression of a metalloproteinase inhibitor
related to tissue inhibitor of metalloproteinases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
K.E.Langley, 23-FEB-1990, for release after publication.
FEATURES from to/span description
pept 291 953 metalloproteinase inhibitor precursor
sigp 291 368 metalloproteinase inhibitor signal peptide
matp 369 950 metalloproteinase inhibitor
signal 1042 1047 polyA signal
BASE COUNT 220 a 350 c 312 g 165 t
ORIGIN
1 gaattccggc ttctatggag cactcgggac caggtccgcg gcgcgcgcac tcgctcgctc
61 gccgcccccc agccagctct cgcttccgcg ccgccagccg cgccccgcgc ctcctcgctg
121 caccccgcga cctagagcca agaaagtttg tgtggcgagt gagggccgga gaggagagcg
181 cgcccgcgga gtgccgtcca gaccagcgcg gccccggcgg agaggggagc gccccgagcc
241 caggcggcgg cggctagccc gagtccgcga cccccgcccc tccgcccgcc atgggcgccg
301 ccgcccgcag cctgccgctc gcgttctgcc tcctgctgct ggggacgctg ctcccccggg
361 ccgacgcctg cagctgctcc ccggtgcacc cgcaacaggc gttttgcaat gcagacatag
421 tgatcagggc caaagcagtc aataagaagg aggtggactc tggcaacgac atctacggca
481 accccatcaa gcggattcag tatgagatca agcagataaa gatgttcaag ggacctgatc
541 aggacataga gtttatctac acagcccccg ccgctgccgt gtgtggggtc tcgctggaca
601 ttggaggaaa gaaggagtat ctcattgcag ggaaggccga ggggaatggc aatatgcata
661 tcaccctctg tgacttcatc gtgccctggg acaccctgag tgccacccag aagaagagcc
721 tgaaccacag gtaccagatg ggctgtgagt gcaagatcac tcgatgcccc atgatcccat
781 gctacatctc ctctccggac gagtgcctct ggatggactg ggtcacggag aagaacatca
841 acggacacca ggccaagttc ttcgcctgca tcaagagaag cgacggctcc tgcgcctggt
901 accgcggagc agcacccccc aagcaggagt ttctggacat cgaggacccg taagcaggcc
961 accaggactc ctggggccaa ttgacagtgt ccaagagttc agactggtcc agctccgaca
1021 tcccttcctg gacacagcat gaataaa
//
LOCUS CFDCG 1291 bp ss-DNA circular VRL 04-JUL-1990
DEFINITION Coconut foliar decay virus, complete genome.
ACCESSION M29963
KEYWORDS circular; complete genome.
SOURCE Coconut foliar decay virus DNA, clones NMORG, pORG, and pORGSAU3A.
ORGANISM Coconut foliar decay virus
Viridae; ss-DNA viruses.
REFERENCE 1 (bases 1 to 1291)
AUTHORS Rohde,W., Randles,J.W., Langridge,P. and Hanold,D.
TITLE Nucleotide sequence of a circular single-stranded DNA associated
with coconut foliar decay virus
JOURNAL Virology 176, 648-651 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
W.Rhode, 15-MAR-1989, for release after publication.
FEATURES from to/span description
pept 103 975 ORF 1
pept 314 775 ORF 2
pept 639 797 ORF 3
pept 1098 1286 ORF 4
pept 987 823 (c) ORF 5
pept 568 422 (c) ORF 6
site 40 70 stem-loop structure
BASE COUNT 336 a 323 c 332 g 300 t
ORIGIN
1 cgccaaaaac ctctgctaag tcccgtgcta agtataaata gccgcggggg ctagtattac
61 ccccgcggct ccccaacctc tgctaacccc gcttggctat aaatgggttc ctccattcgc
121 cgctggtgct tcactctgaa ctacgagaca gaggaagagg cggcgaatgt tgtaaggcgc
181 atcgagtccc ttaacttggt ctatgctata gttggggacg aggttgcacc ttcaactggc
241 caacgacacc tccaaggatt catccacttg aagaccggtc ggcgactgca aggattgaag
301 actgttcttg ggaatgacag gattcacctg gagccgaccc gtggttccga cgaacagaat
361 agagactact gttcgaagga acgggtgctt ctcgagcacg gagtcccgac tcgtcctgga
421 gtcaaaaggc cacgattggc ccaacgattt gctgaggaac ctgatgaact ccgcctggaa
481 gacccaggcg gataccgaag atgcgttgta cacggagctt cggtggaatg gacaagatgg
541 gccgctgaaa atccgttccc atttccatat cacaattggc agcttgaagt gctgtctgcg
601 atcggagagc cagcggacga tcgcacaatc ctctggatat gcggacgaga cggaggagac
661 gggaagtccg tgtttgccaa atatctcgga ctcaagcccg actggttcta cacatgtggt
721 ggaaccagaa aggacgtatt gtaccagtac atcgaggacc caaaacgaaa tttaatcctc
781 gatgtaccca ggtgtaattt agagtattta aattatgccc tgttagaatg tgttaagaac
841 agggcattca gttcggacaa atacgaaccc cttagttatc ttgggttcga ccatgtgcat
901 gtactcgtat ttgccaatgt cctgcctgat tatttgaaaa tcagcaggga cagaataaaa
961 ctgtggaata tttaaagtat gtgtcatcta aattacacca atacccgccc gcccacgcgc
1021 tatcgtttac atcttatgaa tatcctgccc aggccgaagg cctgggaggt gctacccggc
1081 cgaaggccgg gaacaatatg aatcgagtta tgggcgggcc cacaataaaa gattccattt
1141 ggataagaac gaatctgtta ctttgcttgc agtgcacgca accactttcc acgtcaccaa
1201 tccaggtgag tagcttgctg gagaagaaag ccgcaagcct ctatctaccg tccatttgtt
1261 tttgcgcgat cggacggctg agttgatctg g
//
LOCUS HUMMET 1035 bp ss-mRNA PRI 04-JUL-1990
DEFINITION Human metalloproteinase inhibitor mRNA, complete cds.
ACCESSION M32304
KEYWORDS metalloproteinase inhibitor.
SOURCE Human fetal aorta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1035)
AUTHORS Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E.
TITLE cDNA cloning and expression of a metalloproteinase inhibitor
related to tissue inhibitor of metalloproteinases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
K.E.Langley, 23-FEB-1990, for release after publication.
FEATURES from to/span description
pept 255 917 metalloproteinase inhibitor precursor
/nomgen="TIMP" /map="Xp11.3-p11.23"
/hgml_locus_uid="LJ0125P"
sigp 255 332 metalloproteinase inhibitor signal peptide
matp 333 914 metalloproteinase inhibitor
signal 1011 1016 polyA signal
BASE COUNT 216 a 361 c 306 g 152 t
ORIGIN
1 gaattccggc ccgccgtccc ccaccccgcc gccccgcccg gcgaattgcg ccccgcgccc
61 ctcccctcgc gcccccgaga caaagaggag agaaagtttg cgcggccgag cggggcaggt
121 gaggagggtg agccgcgcgg gaggggcccg cctcggcccc ggctcagccc ccgcccgcgc
181 ccccagcccg ccgccgcgag cagcgcccgg accccccagc ggcggccccc gcccgcccag
241 ccccccggcc cgccatgggc gccgcggccc gcaccctgcg gctggcgctc ggcctcctgc
301 tgctggcgac gctgcttcgc ccggccgacg cctgcagctg ctccccggtg cacccgcaac
361 aggcgttttg caatgcagat gtagtgatca gggccaaagc ggtcagtgag aaggaagtgg
421 actctggaaa cgacatttat ggcaacccta tcaagaggat ccagtatgag atcaagcaga
481 taaagatgtt caaagggcct gagaaggata tagagtttat ctacacggcc ccctcctcgg
541 cagtgtgtgg ggtctcgctg gacgttggag gaaagaagga atatctcatt gcaggaaagg
601 ccgaggggga cggcaagatg cacatcaccc tctgtgactt catcgtgccc tgggacaccc
661 tgagcaccac ccagaagaag agcctgaacc acaggtacca gatgggctgc gagtgcaaga
721 tcacgcgctg ccccatgatc ccgtgctaca tctcctcccc ggacgagtgc ctctggatgg
781 actgggtcac agagaagaac atcaacgggc accaggccaa gttcttcgcc tgcatcaaga
841 gaagtgacgg ctcctgtgcg tggtaccgcg gcgcggcgcc ccccaagcag gagtttctcg
901 acatcgagga cccataagca ggcctccaac gcccctgtgg ccaactgcaa aaaaagcctc
961 caagggtttc gactggtcca gctctgacat cccttcctgg aaacagcatg aataaaacac
1021 tcatccccgg aattc
//
LOCUS HUMACEX 1141 bp ds-DNA PRI 04-JUL-1990
DEFINITION Human angiotensin-converting enzyme (ACE) gene, 5' end.
ACCESSION M34434
KEYWORDS angiotensin-converting enzyme.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1141)
AUTHORS Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E.
TITLE Genomic DNA 5' to the mouse and human angiotensin-converting enzyme
genes contains two distinct regions of conserved sequence
JOURNAL Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1136 > 1141 angiotensin-converting enzyme
BASE COUNT 169 a 427 c 364 g 181 t
ORIGIN
1 ctgtccctgg ctccttccct gatcccaccg ccagcctcac cccacggttc ctccattgcc
61 ccacctccca ctgcgccgcc gggcctctgc cagggtcaag gggcttcccc cctctggcag
121 cagacgccat ggtgccgagg tggcctccac aaccgccctg tgcgccaata ggacaagact
181 gtcctccctc ccccacactt gtcactttga gggacacgtg gatgagacag gaaaacacag
241 gggagtgtgg agacctgagg tgacttggag caagcctctc aacctgagcg gcaatttctt
301 catctgtaaa atgagggggt tgttctcatc tctgaggctt tgtgtcgctc tcaaagcctg
361 ctagcctcgg gttctaggac tctgttggga tcgtgtgtga tgttttctgc tgagcgacgg
421 cagcctgtgt cctcgggggg aaagagggca ggcgctccaa agctcctgcg ctctgtggct
481 ccccctccct cgcagcccca agccccaggt gtgccggccg ccctgagccc ctccagcacc
541 tcccggaggc gcctgcaaga cacctaaggt ccccgcctcc ctcctctccc ccccgccaca
601 cccctacccc cggcaggcga cgtccccgcc cctcgaccat ggcctggtga agaagccggc
661 caggcccgat cagccccatc cccgccgcac gagcggcgcc tgcggacagc tcctggggcc
721 ccggccttgt cactccggag gcgggaggct ccggggggtc gggctgggaa gatcgagccg
781 gaggccgcta ggctcccagg ccccggccga ggctgcgcgg ccgcacggtg ggcaggctcg
841 ggtgttccgg caaactgccg ggtccccatc ttcaaaagag aggaggccct ttctccagct
901 tcctctgcgg gagcccgacc cagccccatc ccgccacccc cgggctgcac ctcggcccct
961 ccccggcccg cgcccctgcc cggggcgggc caggaacctc ggcccgcgcc gctggggact
1021 ttggagcgga ggaggaagcg cggcggggcg ggggcggggg tgtgtcgggt tttataaccc
1081 gcagggcggc cgcggcgcag gagaaggggc agagccgagc accgcgcacc gcgtcatggg
1141 g
//
LOCUS MURU3REG 492 bp ss-RNA VRL 04-JUL-1990
DEFINITION Murine retrovirus SL3-2 U3 region.
ACCESSION M34383
KEYWORDS long terminal repeat.
SOURCE Murine retrovirus SL3-2, cDNA to viral RNA.
ORGANISM Murine retrovirus SL3-2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses.
REFERENCE 1 (bases 1 to 492)
AUTHORS Dai,H.Y., Etzerodt,M., Baekgaard,A.J., Lovmand,S., Jorgensen,P.,
Kjeldgaard,N.O. and Pedersen,F.S.
TITLE Multiple sequence elements in the U3 region of the leukemogenic
murine retrovirus SL3-2 contribute to cell-dependent gene
expression
JOURNAL Virology 175, 581-585 (1990)
STANDARD simple staff_review
BASE COUNT 158 a 136 c 114 g 84 t
ORIGIN
1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
61 atgggaaaaa taccagagct gatgttctca gaaaaacaag aacaaggaag tacagagagg
121 ctaaaaagta ccgggactag ggccaagaac agatggtccc cagaccgcta actgcagtaa
181 cgacaggata tctgtggtta agcactaggg ccccggccca gggccaagaa cagatggtcc
241 ccagaccgct aacgacagga tatctgtggt taagcactag ggccccggcc cagggccaag
301 aacagatggt ccccagaaat agctaaaaca acaacagttt caagagaccc agaaactgtc
361 tcaaggttcc ccagatgacc ggggatcaac cccaagcctc atttaaacta accaatcagc
421 tcgcttctcg cttctgtacc cgcgcttatt gctgcccagc tctataaaaa gggtaagaac
481 cccacactcg gc
//
LOCUS MUSACEX 1299 bp ds-DNA ROD 04-JUL-1990
DEFINITION Mouse angiotensin-converting enzyme (ACE) gene, 5' end.
ACCESSION M34433
KEYWORDS angiotensin-converting enzyme.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1299)
AUTHORS Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E.
TITLE Genomic DNA 5' to the mouse and human angiotensin-converting enzyme
genes contains two distinct regions of conserved sequence
JOURNAL Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1294 > 1299 angiotensin-converting enzyme
BASE COUNT 262 a 393 c 369 g 275 t
ORIGIN
1 ggatcctctt tctgaactcc ataggtacca ctctcagaat cccctcctaa ccctccacca
61 gtacagcctg cttcctccag aaacacaaca gctcacacaa acccacagcc ctcaaatcct
121 gggagcccaa gcaatatgga ccacctaagt tctcagcagg tggccccttg ctccttcctc
181 ctttctccat ctctacccgt cccctgggac ctctttggga gcaagtcacc tgacaggcca
241 gcacagtctc actctgagtc cttgcaggtc acacaatctc ccatgtgaac tccatcaccc
301 catctggcaa ggacaagggt ccgcccacct ctttgggggc acagccctgg tgctcagagg
361 ttaacttcac aggctgctga tgcatttaga ggacctgatt ctccttcccc caggcctgtc
421 actttgaggg acgggtggag gagacaggga aatgactggg agtgtggaga cctgaggtga
481 cttgaagcca atctgtctca ggggtaattt ctccattttt aaaaagcagg gggttgttgt
541 aatctctaag gcacattttt ttttagcttt cagagcgtgc aactgcgagg ttctagggct
601 ctgttgggac cacacatgat ggttcttgtg agttgaacag ctgtgtatcc aaagagtagg
661 ggcaggcatg ccagtgctga tggcctagtc tatggctccc cctcatagag cagaatagac
721 tctatttcga ggcatcccaa caaccgaagc ccaaggtgaa ctggttgctg acatctgcag
781 cacctagcca aggcgcctgt caggacagct aaacttcccg cgcccacgca ggcgacccct
841 ttcctccatg acctggcaac cttaagtcca tcccggtggc accagcctgc cacccccgcc
901 cccttgtcac tcaagagccc ggaaaggctt ggagggttat gccaagcctg ctgggctctg
961 tggcctgaga ctacgtggtg gcggccagag ggcacggttg ggccgcagca gtgtttgcag
1021 cgggcaggga cccttttcta gcttcctctg agagagcccg atctagcccc atttctccat
1081 ccaggctgct ccctggcccc acagggctct ccttgcccag ggcgggcggg gaatcagccg
1141 ggttcgcgct gccggggact ttggagcgag gaggaagcgc ggaggggcgg ggaggtgggg
1201 gtgtgtcggg aggcggtggt ggctgggttt tataatccgc agggcggtcg cggcacgaga
1261 gaaggggcag cgccgcaccg cgcgcaccgc gccatgggg
//
LOCUS MUSOCT3 1324 bp ss-mRNA ROD 04-JUL-1990
DEFINITION Mouse octamer binding transcription factor (oct-3) mRNA,
complete cds.
ACCESSION M34381
KEYWORDS octamer binding transcription factor.
SOURCE Mouse embryo carcinoma cell line P19, cDNA to mRNA, clone
lambda-C1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1324)
AUTHORS Okamoto,K., Okazawa,H., Okuda,A., Sakai,M., Muramatsu,M. and
Hamada,H.
TITLE A novel octamer binding transcription factor is differentially
expressed in mouse embryonic cells
JOURNAL Cell 60, 461-472 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 50 1183 octamer binding transcription factor (oct-3)
mRNA 1 1324 oct-3 mRNA
BASE COUNT 297 a 361 c 400 g 266 t
ORIGIN
1 gtgagccgtc tttccaccag gcccccggct cggggtgccc accttcccca tggctggaca
61 cctggcttca gacttcgcct cctcaccccc accaggtggg ggtgatgggt cagcagggct
121 ggagccgggc tgggtggatt ctcgaacctg gctaagcttc caagggcctc caggtgggcc
181 tggaatcgga ccaggctcag aggtattggg gatctcccca tgtccgcccg catacgagtt
241 ctgcggaggg atggcatact gtggacctca ggttggactg ggcctagtcc cccaagttgg
301 cgtggagact ttgcagcctg agggccaggc aggagcacga gtggaaagca actcagaggg
361 aacctcctct gagccctgtg ccgaccgccc caatgccgtg aagttggaga aggtggaacc
421 aactcccgag gagtcccagg acatgaaagc cctgcagaag gagctagaac agtttgccaa
481 gctgctgaag cagaagagga tcaccttggg gtacacccag gccgacgtgg ggctcaccct
541 gggcgttctc tttggaaagg tgttcagcca gaccaccatc tgtcgcttcg aggccttgca
601 gctcagcctt aagaacatgt gtaagctgcg gcccctgctg gagaagtggg tggaggaagc
661 cgacaacaat gagaaccttc aggagatatg caaatcggag accctggtgc aggcccggaa
721 gagaaagcga actagcattg agaaccgtgt gaggtggagt ctggagacca tgtttctgaa
781 gtgcccgaag ccctccctac agcagatcac tcacatcgcc aatcagcttg ggctagagaa
841 ggatgtggtt cgagtatggt tctgtaaccg gcgccagaag ggcaaaagat caagtattga
901 gtattcccaa cgagaagagt atgaggctac aggacacctt tcccaggggg ggctgtatcc
961 tttcctctgc ccccaggtcc ccactttggc accccaggct atggaagccc ccacttcacc
1021 acactctact cagtcccttt tcctgagggc gaggcctttc cctctgttcc cgtcactgct
1081 ctgggctctc ccatgcattc aaactgaggc accagccctc cctggggatg ctgtgagcca
1141 aggcaaggga ggtagacaag agaacctgga gctttggggt taaattcttt tactgaggag
1201 ggattaaaag cacaacaggg gtggggggtg ggatggggaa agaagctcag tgatgctgtt
1261 gatcaggagc ctggcctgtc tgtcactcat cattttgttc ttaaataaag actggacaca
1321 cagt
//
LOCUS PP1SIM 1900 bp ds-DNA PHG 04-JUL-1990
DEFINITION Bacteriophage P1 sim region proteins, complete cds.
ACCESSION M34382
KEYWORDS sim region.
SOURCE Bacteriophage P1 viral DNA.
ORGANISM Bacteriophage P1
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 1900)
AUTHORS Maillou,J. and Dreiseikelmann,B.
TITLE The sim gene of Escherichia coli phage P1: Nucleotide sequence and
purification of the processed protein
JOURNAL Virology 175, 500-507 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 470 718 sim region ORF 1
pept 791 1570 sim region ORF 2
BASE COUNT 636 a 323 c 360 g 581 t
ORIGIN
1 ccccatcacg ccccaccata attcgggcgt aacgcggttt acggacacaa aaataccgca
61 atatcggaaa tctgcggttg tccgcactaa cattcaggct gtcaaaaccg gtcgcagaat
121 ttgctacgac ggtggaacta taagcctgaa cgattaaaag gtcaatatga tgcgaaaaga
181 tagcattcgc gacttaaaaa tacaaattta ttagagcatt atttgtttaa taaatacaca
241 gttggatcta ataacctctt ttttttaaag gcgaaaatat gtaccctaaa tgagttataa
301 ggcaggtgag gttataatga gaaaactatt actaccgtta ttatttatgg ctgggactgt
361 taatgcagca tcaagcgtaa aggagatttg taccgattat acgaaatacc ttgggcacgt
421 ttacgccttt gcgatcaatg actattaatc cagtgctagg agtggattta tgctgattcg
481 tttgttttta gtgctttcct ttttaacatt taatgttttt gctgatgaag ttgacttttc
541 gaaggtagat tgcaattcag tggaaacaag aaaagctctt attgaagaat ataacgaaat
601 attatcgtca tatggaataa cagtggttga ttcttataat caaaaaacta ttcagaaagg
661 aataaataaa ctggtctgtt atggggttta ccaatattca gatggctctt cggagtagtt
721 tatttataag catacccaaa tgtctgtgaa ttaattagtg agtttaaacc gattaatgag
781 tgaatggaaa atgaaattat ttaatgtaat aacattttgt tgtgctattt ttgctggaag
841 cgcgatagct gataataaat tgccagattg gctttctacc tcaaagaaag attatgattt
901 agtaagggca ttctatttgt ctggatttgc ttcgaaagca atgaacaatc aatttggtta
961 tcatttgcca tctgagttgg ttaatgattt taaagataat gaatttgctg ctcaggaaaa
1021 atggaacaca attccaattg tgtatggtga aataaaatcc ataagaatgg tgaataataa
1081 accaattgta gaattattta ctccagggga aaatgcaacg cctataaatt atatcaaatt
1141 gaaaatattg gattcaaagc aagactctct gttaaaacta aaaaaagggg atgatatata
1201 tgcagtgtgc tccggtgcta attttagctt agtgccaatt ctgagcaact gcactccagc
1261 aacagacgtc attgatgctg cactctcttt ttctggtgaa tatatgttcc ctgcttttga
1321 ttctttttca cctactaagc aaaacgtcaa atatatattc acaaatcaag atcctgttca
1381 gatgatgaat tttatagggt acctatcctt agtcgataca acgaaggata agaataaaat
1441 ggatatggtt cgtaagtgta cgccttggaa gccggaatgt tcacaacaat tcgttgatgt
1501 aatggaagga tttgatagca tcatgtataa atatgaggga gaatttaaaa actacataga
1561 attaaaatag tgtttgaaag gatagtcaat ttaagtttta acaccaacgc ccattaaagg
1621 gcttttattg ttttactcaa aacaacctga ttagtgtata accatatcat atccctcatt
1681 ccacctacac tgattacccc cagacaacaa tattcctact caatgaacaa atgactactc
1741 gtagaatcgg ttaacacacc agattctacg aggtttcaat gacaccacga caattactcg
1801 aagacgtcaa aacccgcttc acacctttga ttgcggatga accggcctta ctggaatccc
1861 tgctaagaaa agcattggga acctaccagg atagggcggg
//
LOCUS PSEPST1 1446 bp ds-DNA BCT 04-JUL-1990
DEFINITION Plasmid pPST1 (from Pseudomonas stutzeri) ORF1 gene, complete cds.
ACCESSION M34385
KEYWORDS .
SOURCE Plasmid pPST1 (from Pseudomonas stutzeri, strain MO-19) DNA.
ORGANISM Pseudomonas stutzeri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1446)
AUTHORS Fujita,M., Kubota,M., Futai,M. and Amemura,A.
TITLE Identification and DNA sequencing of a new plasmid (pPST1) in
Pseudomonas stutzeri MO-19
JOURNAL Plasmid 22, 271-274 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 576 1034 ORF1
BASE COUNT 267 a 446 c 425 g 308 t
ORIGIN
1 gcatgccatc gacccccgtc aataccttga gcacatgctg cgtccagcgc gtagccatcc
61 acatctgctc cgggctggcg taggcgtaaa cctgcttggc cttggcgttc tgcggcttga
121 gttccagttc tgcccgaacg tggtcaggat cgcccccggt ctgcttgccc ttttcgtaga
181 tgcgcttcat ggctgccgag gtgcgcgacc cgatgtagag cgtccgcccg tcttgctcgc
241 ggtgagagtc gcctacatgc gtgaccttga ggccgaactt gtctgccgtc tccagaccca
301 gcgcgtagag agagtcccac gcccccggct cgcagtagtc gatagtcacg tcagccctga
361 gtaggttgtg ccccctgaac tcatcccgaa cgacagcagc aaaggcgggt gcccggtcgc
421 cgcttgcaga ggcccatacg cgcgtcccga cggtgttacc gccccactgc acccgtgcaa
481 gcacagaatc gcctctgacg acgttatagg cccgttcata gccatttttc ggggtggtcg
541 gcgttacgtc cgaacgactc gaaagagccg gccagatgat caaggaaccc ggtcgggttt
601 ggcctcgatg ctggcggtgt agtagtcgaa cctcattttt cacccccaga ggctctagag
661 cgcttcgcgg ggtttttttg gctcggtgtg taggggggtt aggtgttgtg acccccgtag
721 ttacctatgc gggggttggg cggtctgcaa acatgatcca tgagagcgac gcaaaggccc
781 gagcatctcg ttcatgtgag cctctccaga ctcaaggcct agagaccatc ccggccctag
841 attcaacgtg ccagcgcccg ggctttctgg caggcccgga gaaccctttg ttggacaggg
901 ttaaccatag tcaggaactc cgagacaaga cccgacatcg tgacgccctg aacctcagaa
961 agacccctga tcgtgtcgta cacgtcctga tcgagcgtaa ccgtgattct cggtttctgt
1021 gtggccattg gtgatgcact ccgggcaaag tgatgcaccg aatgtacgca gccgaaggcg
1081 aaaggcaacc cgtgcgaatt gtttacatgt gcgcatgagc gcatgagcgc atgtgatttg
1141 tgtgcatgtg atttgtatgc atgtgcgcat actggcgcgc tacgcttgcc cgccgagccg
1201 agttgccggc tcgtaccggc gctacgctcc cggcccgatc cgggcacctc gacccggccc
1261 gacgaatttg tccgcgccgg tggacttatt tgtccgacac tagccgagca acgaacagcc
1321 cgaacgactc aatatcgccg cccagtgcct ccagcttgcg gcaatagttg ccgtactgct
1381 tagccatgaa ctcaagagcg cgctcatcat cggtaagctg gcggatcgtt ccaggcggtg
1441 ctggct
//
LOCUS BNACRUCA 1686 bp ss-mRNA PLN 04-JUL-1990
DEFINITION B.napus cruciferin mRNA, 3' end.
ACCESSION J05233
KEYWORDS 12S storage globulin; cruciferin; seed development protein.
SOURCE B.napus (strain Svaloefs Karat) seed, cDNA to mRNA, clone pCRU1.
ORGANISM Brassica napus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 1686)
AUTHORS Roedin,J., Ericson,M.L., Josefsson,L.-G. and Rask,L.
TITLE Characterization of a cDNA clone encoding a Brassica napus 12 S
protein (cruciferin) subunit: Relationship between precursors and
mature chains
JOURNAL J. Biol. Chem. 265, 2720-2723 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 1521 cruciferin precursor (AA at 1)
sigp < 1 60 cruciferin signal peptide
matp 61 948 cruciferin alpha-subunit
matp 949 1518 cruciferin beta-subunit
signal 1543 1548 polyA signal
signal 1671 1676 polyA signal
BASE COUNT 462 a 455 c 415 g 354 t
ORIGIN
1 ccttgcgaga cagccgtcgc aacgttcggg gttctcctcg tcctcaacgg ctgtctcgca
61 aggcagtcgc taggggttcc tcctcagcta gggaacgcgt gtaacctcga taacttagac
121 gttctccagc ctaccgaaac tatcaagagc gaggctggtc gggtcgagta ctgggatcac
181 aacaatcctc agatccgatg tgctggtgtc tctgtctctc gtgttataat cgaacaaggc
241 ggtctctacc ttcctacctt cttcagctcc cccaaaattt caatcgttgt tcaaggaatg
301 ggtattagcg gaagagtggt ccctggatgc gcggaaacct tcatggactc gcagcctatg
361 caaggacaac aacaaggtca accatggcag ggacaacaag gacaacaggg tcagcaggga
421 caacaaggtc aacagggtca gcagggacaa caaggtcaac agggtcagca gggtcaacag
481 ggacagcagg gtcagcagca gcaagggttc cgtgacatgc accagaaggt cgaacatgtt
541 cgacatggag acatcattgc cattactgca ggctcttccc attggatcta caacaccggt
601 gaccagccac ttgtcattat ctgccttctc gacattgcca actaccaaaa ccaactcgac
661 cgcaacccaa gaacgttccg tctggccgga aacaacccac agggcggttc ccagcagcag
721 cagcaacaac aacagaacat gttgagcggg ttcgaccctc aggtcctagc ccaggcattg
781 aaaatcgacg ttaggttggc tcaggagctt cagaaccaac aagacagcag aggaaacatc
841 gttcgtgtta agggaccttt ccaggttgtg aggccgcctc ttagacagcc atacgagagt
901 gagcagtgga gacacccccg tggcccacca caaagcccac aagacaacgg cttggaggag
961 actatctgca gcatgaggac ccacgagaac attgatgacc cagcccgtgc tgacgtgtat
1021 aagcccaacc tcggccgtgt gactagcgct aacagctaca ctttacccat cttgcagtat
1081 atcagactca gcgccacccg tggcattctc cagggtaatg cgatggtgct tccgaaatac
1141 aacatgaacg cgaacgagat cttgtactgc actcaaggac aagcaaggat tcaagtggtg
1201 aacgacaacg gacagaacgt gctggaccag caggtgcaga agggacagct cgtggtcatc
1261 ccacaaggat tcgcctatgt tgtccagtcc caccaaaaca acttcgaatg gatttctttc
1321 aagacaaacg ctaacgcgat ggtcagcact ttggccggta gaacctcggc cttgagggca
1381 ttgccactag aggtcataac caacgctttc caaatttctc tcgaggaagc tagaaggatc
1441 aagttcaaca cgcttgagac cactttgact cgtgcgcgcg gtggacaacc ccagttgatc
1501 gaggagatag tcgaggctta agttaaaacg tttactttta ctaataaagt agtgacatgg
1561 ttactattgt aatggtcagt ttgtaatcat gtccactcta agtttttaac gtatgtgtaa
1621 aatatgtgtc taaagaacac gccggcacgt ctcttgtatg taacccttct aataaatacc
1681 cccccc
//
LOCUS BS2ORIA 2279 bp ds-DNA BCT 04-JUL-1990
DEFINITION Plasmid pBS2 origin of replication.
ACCESSION M34386
KEYWORDS origin of replication.
SOURCE Plasmid pBS2 DNA.
ORGANISM Plasmid pBS2
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 2279)
AUTHORS Darabi,A., Forough,R., Bhardwaj,G., Watabe,M., Goodarzi,G.,
Gross,S.C. and Watabe,K.
TITLE Identification and nucleotide sequence of the minimal replicon of
the low-copy-number plasmid pBS2
JOURNAL Plasmid 22, 281-286 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 195 470 ORF1
pept 1139 1948 ORF2
binding 169 177 ribosomal binding site (put.)
binding 1110 1117 ribosomal binding site (put.)
orgrpl 865 1008 origin of replication DNA
BASE COUNT 699 a 343 c 562 g 675 t
ORIGIN 1 bp upstream of HindIII site.
1 aagctttgtt gaatatgggg aatagttata accagatggg aagccttttt tcggctgtac
61 cttattatca caaggctata aaggcagcta aaataagcgg agcgaaggaa attacccagg
121 cttattatga tttagctctg attcatttta ggaacaatga aaagattgaa ggacgcggat
181 tctttggaaa agcaatggag caagctgaag aatttaacga tagtctgttt caggatttat
241 taaatgttct aaaggcatta tttattgaga ctggaagcag gcaaaaggtc atgaatgccc
301 ttgaggcgtt gagaacggga caaggctatc catattttga agagctagct ttaatcgctg
361 cagaatttta tactatggat aagcgcatgg aagattctat atatttttac aatgagatgg
421 tgtgcgctca aagacaaatt caaaggggcg atttcctgta tgaagtttaa aggtcttttt
481 tcagcggtcc ttattgtaag cctgttggtt ggagcggggt attctttcgt gcaccatgat
541 gaggtttcag ttgcttcaag gaatgcaaca tgattaagaa taaagaatcg agatttacca
601 cttaaagata tggaaaacag atcgttattg atctgttttt ttctttgtgt gctgaccatg
661 cgggcatgat gcggagataa cgctgggtgc tgcggagata atgcgggcat gggtgcgctg
721 atcgcggaga ctgaaaaggc aaaatttcaa aaaaatctcc cccctacggg ggggaagaat
781 ggttttgatc ttttgttttg ggttttaaaa aagccggctg ttttcagccg gtactttttc
841 gattttggcg aagccgaaat cgggtctttt cttatcttga tactatatag aaacaacatc
901 atttttcaaa atcagttcaa agtcttgtgt gtcaagggtt agatcggttt ttgacaggta
961 aaaactcctt ctgctattat taaggtgtcg aatcaaaata atagaatgct agagaactag
1021 ctcagaagga gtttttttgt tgatttattc atctgaaaac gattatagca tcctcgaaga
1081 taaaaccgca acaggtaaaa agcgggattg gaaggggaaa aagagacgta cgaacctcat
1141 ggcggagcat tacgaagcat tagagagtaa gattggggca ccttactatg gcaaaaaggc
1201 tgaaaaacta attagttgtg cagagtatct ttcgtttaag agagacccgg agacgggcaa
1261 gttaaaactg tatcaagccc atttttgtaa agtgaggtta tgcccgatgt gtgcgtggcg
1321 caggtcgtta aaaattgctt atcacaataa gttgatcgta gaggaagcca atcggcagta
1381 cggctgcgga tggatttttc tcacgctgac ggttcggaat gtaaagggag aacggctgaa
1441 gccacaaatt tctgagatga tggaaggctt taggaaactg ttccagtaca aaaaagtaaa
1501 aacttcggtt cttggatttt tcagagcttt agagattacc aaaaatcatg aagaagatac
1561 ataccatcct cattttcatg tgttgatacc agtaaggaaa aattattttg ggaaaaacta
1621 tattaagcag gcggagtgga cgagcctttg gaaaaaggcg atgaaattgg attacactcc
1681 gattgttgat attcgtcgag tgaaaggtaa agccaagatt gacgctgaac tgattgaaaa
1741 cgatgtgcgg gaagcaatga tggagcaaaa agctgttctc gaaatttcta aatatccggt
1801 taaggatacg gatgttgtgc gcggcaataa ggtgactgaa gacaatctga acacggtgct
1861 ttacttggat gatgcgttgg cagctcgaag gttaattgga tacggtggca ttttgaagga
1921 gatacataaa gagctgaatc ttcggtgatg cggaggacgg cgatctggtc aagattgagg
1981 aagatgatga cgaggttgca aatggtgcat tgaggttatg gcttattggc atcctggcat
2041 taaaaattac ataatcaaat aaaaaaagca gacctttaag gcgtgctttt tttaactaac
2101 ccatttgtat tgtgttgaaa tatgttttgt tcgtttttta tctttgtagc tttttatgat
2161 taattgtcta tgataaccta gggaatcttt acattgtctg tgttgaatag ttacataaac
2221 tggtttgctc ttaggatagt ttaaatatac tgttgtcatt gctccgaaaa caaaagctt
//
LOCUS ECOCYSBA 1407 bp ds-DNA BCT 04-JUL-1990
DEFINITION E.coli cysteine regulatory protein (cysB) gene, complete cds.
ACCESSION M34332
KEYWORDS cysB gene; cysteine regulatory protein.
SOURCE E.coli (mutant strain CB64) DNA, clone pLB1.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1407)
AUTHORS Tei,H., Watanabe,K., Murata,K. and Kimura,A.
TITLE Analysis of the Escherichia coli K-12 cysB gene and its product
using the method of gene fusion
JOURNAL Biochem. Biophys. Res. Commun. 167, 962-969 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 349 1323 cysteine regulatory protein (cysB)
BASE COUNT 378 a 312 c 326 g 391 t
ORIGIN
1 ggatccggaa ggtaataaga ccatggttcg ctttagccgt aaaaccaaac agcaatatgt
61 ctcttcggaa aaagacggaa aggcgactgg ctggtcagca ttttatgttg atggcaaatg
121 ggttgaagga aaaaaataac ctttaattct gtcaggtttt tataaacaaa gggtcgcgaa
181 agcggccctt ttttattgca tattattttt tcttcacacc tatacactaa ggctataaat
241 gatatagtgg ttatagttag cacctttttt attattaaat cgtattagtc acccgccagg
301 tgtgacgaaa aaacgatgtt ctgatggcgt ctaagtggat ggtttaacat gaaattacaa
361 caacttcgct atattgttga ggtggtcaat cataacctga atgtctcatc aacagcggag
421 ggactttaca catcacaacc cgggatcagt aaacaagtca gaatgctgga agacgagcta
481 ggcattcaaa ttttttcccg aagcggcaag cacctgacgc aggtaacgcc agcagggcaa
541 gaaataattc gtatcgctcg cgaagtcctg tcgaaagtcg atgccataaa atcggttgcc
601 ggagagcaca cctggccgga taaaggttca ctgtatatcg ccaccacgca tacccaggca
661 cgctacgcat taccaaacgt catcaaaggc tttattgagc gttatcctcg cgtttctttg
721 catatgcacc agggctcgcc gacacaaatt gctgatgccg tctctaaagg caatgctgat
781 ttcgctatcg ccacagaagc gctgcatctg tatgaagatt tagtgatgtt accgtgctac
841 cactggaatc gggctattgt agtcactccg gatcacccgc tggcaggcaa aaaagccatt
901 accattgaag aactggcgca atatccgttg gtgacatata ccttcggctt taccggacgt
961 tcagaactgg atactgcctt taatcgcgca gggttaacgc cgcgtatcgt tttcacggca
1021 acggatgctg acgtcattaa aacttacgtc cggttagggc tgggggtagg ggtcattgcc
1081 agcatggcgg tggatccggt cgccgatccc gaccttgtgc gtgttgatgc tcacgatatc
1141 ttcagccaca gtacaaccaa aattggtttt cgccgtagta ctttcttgcg cagttatatg
1201 tatgatttca ttcagcgttt tgcaccgcat ttaacgcgtg atgtcgttga tgcggctgtc
1261 gcattgcgct ctaatgaaga aattgaggtc atgtttaaag atataaaact gccggaaaaa
1321 taatttcttg cgttattttc ggcacctttt atgtagcgaa ggtgccggaa tatattctct
1381 tttgttactt atttctatac cgtgcac
//
LOCUS HUMARHIII 60 bp ds-DNA BAD 04-JUL-1990
DEFINITION Synthetic DNA encoding angiogenin/RNase hybrid (ARH-III), partial
cds.
ACCESSION M34329
KEYWORDS .
SOURCE Human DNA, clone pAng2.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 60)
AUTHORS Bond,M.D. and Vallee,B.L.
TITLE Replacement of residues 8-22 of angiogenin with 7-21 of RNase a
selectively affects protein synthesis inhibition and angiogenesis
JOURNAL Biochemistry 29, 3341-3349 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 60 angiogenin/RNase protein (AA 4-23) (AA at 1)
BASE COUNT 17 a 18 c 13 g 12 t
ORIGIN
1 tcgaggtata caaaattcga acgccagcac atggactctt cgacaagcgc tgcctccgat
//
LOCUS HUMIGIIDO 1914 bp ss-mRNA PRI 04-JUL-1990
DEFINITION Human interferon-gamma-inducible indoleamine 2,3-dioxygenase (IDO)
mRNA, complete cds.
ACCESSION M34455
KEYWORDS indoleamine 2,3-dioxygenase.
SOURCE Human diploid fibroblast cell line FS-4, cDNA to mRNA, clone C42.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1914)
AUTHORS Dai,W. and Gupta,S.L.
TITLE Molecular cloning, sequencing and expression of human interferon-
gamma-inducible indoleamine 2,3-dioxygenase cDNA
JOURNAL Biochem. Biophys. Res. Commun. 168, 1-8 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 640 1851 indoleamine 2,3-dioxygenase (IDO) (EC
1.13.11.17)
BASE COUNT 563 a 396 c 427 g 528 t
ORIGIN
1 tgagaagggc aaatgctatc attggaaaaa ctgacaaaag tcccaatagg aaaaataagg
61 aagtggagag ttactatgtt tctaattttt catgtgcttc tatttttttc ctacttcaga
121 gccattgact aatagttgag tataacacag gttgtgtttc cgggctgctg aaacatgaca
181 ctaatatttt caaagaactg tggaagccta aaaggaagcc aatgagaaat aactaaatga
241 gagtttagga ctgcagcctt cattttcatt caaagattta aaagtttcca taaagtaaaa
301 tgttcttctc cggccacctg ttttcatagt tctgtgtttt ccttcaggcc tttctggctt
361 cctatatggc agtaagaaaa tgatgtgctt aatgattaca aatttcatat ggaatacgaa
421 ctttcagttt gtacatatga tgcacagaga tgcttttgtg gttttattgg ttttcatatt
481 acaaacaaag aaactagaaa atgaaaccat tccaaaagtg gaagtaattt ctcactgccc
541 ctgtgataaa ctgtggtcac tggctgtggc agcaactatt ataagatgct ctgaaactct
601 tcagacactg agggggcacc agaggagcag actacaagaa tggcacacgc tatggaaaac
661 tcctggacaa tcagtaaaga gtaccatatt gatgaagaag tgggctttgc tctgccaaat
721 ccacaggaaa atctacctga tttttataat gactggatgt tcattgctaa acatctgcct
781 gatctcatag agtctggcca gcttcgagaa agagttgaga agttaaacat gctcagcatt
841 gatcatctca cagaccacaa gtcacagcgc cttgcacgtc tagttctggg atgcatcacc
901 atggcatatg tgtggggcaa aggtcatgga gatgtccgta aggtcttgcc aagaaatatt
961 gctgttcctt actgccaact ctccaagaaa ctggaactgc ctcctatttt ggtttatgca
1021 gactgtgtct tggcaaactg gaagaaaaag gatcctaata agcccctgac ttatgagaac
1081 atggacgttt tgttctcatt tcgtgatgga gactgcagta aaggattctt cctggtctct
1141 ctattggtgg aaatagcagc tgcttctgca atcaaagtaa ttcctactgt attcaaggca
1201 atgcaaatgc aagaacggga cactttgcta aaggcgctgt tggaaatagc ttcttgcttg
1261 gagaaagccc ttcaagtgtt tcaccaaatc cacgatcatg tgaacccaaa agcatttttc
1321 agtgttcttc gcatatattt gtctggctgg aaaggcaacc cccagctatc agacggtctg
1381 gtgtatgaag ggttctggga agacccaaag gagtttgcag ggggcagtgc aggccaaagc
1441 agcgtctttc agtgctttga cgtcctgctg ggcatccagc agactgctgg tggaggacat
1501 gctgctcagt tcctccagga catgagaaga tatatgccac cagctcacag gaacttcctg
1561 tgctcattag agtcaaatcc ctcagtccgt gagtttgtcc tttcaaaagg tgatgctggc
1621 ctgcgggaag cttatgacgc ctgtgtgaaa gctctggtct ccctgaggag ctaccatctg
1681 caaatcgtga ctaagtacat cctgattcct gcaagccagc agccaaagga gaataagacc
1741 tctgaagacc cttcaaaact ggaagccaaa ggaactggag gcactgattt aatgaatttc
1801 ctgaagactg tgagaagtac aactgagaaa tcccttttga aggaaggtta atgtaaccca
1861 acaagagcac attttatcat agcagagaca tctgtatgca ttcctgtcat tacc
//
LOCUS RATRPL35AA 451 bp ss-mRNA ROD 04-JUL-1990
DEFINITION Rat 60S ribosomal subunit protein L35 mRNA, complete cds.
ACCESSION M34331 X51705
KEYWORDS ribosomal protein.
SOURCE Rat, cDNA to mRNA, clone pL35-[6,9].
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 451)
AUTHORS Suzuki,K., Olvera,J. and Wool,I.G.
TITLE The primary structure of rat ribosomal protein L35
JOURNAL Biochem. Biophys. Res. Commun. 167, 1377-1382 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 48 419 ribosomal protein L35
mRNA < 1 451 ribosomal protein L35 mRNA
BASE COUNT 138 a 126 c 121 g 66 t
ORIGIN
1 tcctctttct ctcgaaccgg gcggcctcat ccgcgaacag cgcagcaatg gccaagatta
61 aggctcggga cctgcgcggc aagaagaagg aggagctgtt gaaacaactg gatgatctga
121 aggtggaact gtcccagctt cgcgtggcca aagtgacagg cggcgccgcg tccaagctct
181 ccaagatacg agtcgtacgc aaatccatcg cccgtgtcct cactgtcatt aatcagactc
241 aaaaggaaaa cctcaggaaa ttctacaagg gaaagaagta caagcccctg gacctgcgac
301 ccaagaagac aagagccatg cgccgccggc tcaccaagca tgaagagaag ctgaagacca
361 agaagcagca gcggaaggag cggctgtacc cactgcgcaa gtacgcagtc aaggcctgag
421 acgacgacaa taacaataaa gtccaaaact g
//
LOCUS SCMIMP23A 1058 bp ss-mRNA INV 04-JUL-1990
DEFINITION S.mansoni 23 kDa integral membrane protein (Sm23) mRNA, complete
cds.
ACCESSION M34453
KEYWORDS integral membrane protein.
SOURCE S.mansoni (strain Puerto Rican), cDNA to mRNA, clone 3.7.2.
ORGANISM Schistosoma mansoni
Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda;
Digenea; Strigeidida; Strigeata; Schistosomatoidea;
Schistosomatidae.
REFERENCE 1 (bases 1 to 1058)
AUTHORS Wright,M.D., Henkle,K.J. and Mitchell,G.F.
TITLE An immunogenic Mr 23,000 integral membrane protein of Schistosoma
mansoni worms that closely resembles a human tumor-associated
antigen
JOURNAL J. Immunol. 144, 3195-3200 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 16 672 23 kDa integral membrane protein (Sm23)
mRNA < 1 1058 23 kDa integral membrane protein mRNA
BASE COUNT 269 a 189 c 219 g 381 t
ORIGIN
1 gtcccgtttg aaagtatggc aacgttgggt actggtatgc gttgtctaaa aagttgcgta
61 ttcgttttga acatcatctg tctgttatgt tccctggtat taattggggc tggtgcctat
121 gtggaagtca agtttagtca atatggagac aatttacaca aagtctggca ggcagcacct
181 atcgctatta ttgttgtcgg agtaataatc ctgatagtga gctttctggg ttgttgtgga
241 gccataaagg aaaacgtctg catgctgtat atgtatgcgt tcttcctcgt cgtacttctg
301 attgctgagt tggccgctgc catagttgca gttgtttaca aagataggat cgattcagaa
361 attgatgcat taatgactgg tgctctggat aaaccgacaa aagagataac tgaattcatg
421 aatttgattc aatcatcatt ccattgctgt ggagccaaag gccctgatga ttatcgtggt
481 aacgtgccag cctcgtgcaa agaagagaat ttgacctata ccgaaggctg tgtatctgtc
541 tttggagcat tcttaaaacg caacctggtg attgttgcct gtgttgcatt tggtgtatgc
601 tttttccagt tgttaagcat tgtcatagcc tgctgtttgg gtcgccaaat aaaggagtat
661 gagaatgtgt aaacactgca aagaaaacag tatgatgttt atcaatcccg ttctgttttc
721 cctctggctt ttatgaaatg ttgctttttt attgcctaga taattgtgcc ttggctaata
781 atcgtgtact cgacttcgtt tacgatataa ttattatact taaatatgat actgtcattc
841 tcattgtttc acttatcatt cttataccaa atgagctgca ttgattccct ggtatttacc
901 tacaaactga tgacaaatca tcttcatttt tatggtgtgt cttataacat ttctcgttta
961 tataatttta agtctttggt ataccttttg ttttacagac cgcttatttt gccgccacag
1021 cattagacat ttcttgtata aattaattcc gttaactt
//
LOCUS STMMELCLAC 63 bp ds-DNA BAD 04-JUL-1990
DEFINITION S.antibioticus melC-lacZ translational fusion protein mRNA, 5' end.
ACCESSION M34505
KEYWORDS .
SOURCE S.antibioticus DNA, clones pIF41[3,5].
ORGANISM Streptomyces antibioticus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 63)
AUTHORS Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W.,
Chiou,S.-J., Hu,N.-T. and Chen,C.W.
TITLE The melanin operon of Streptomyces antibioticus: Expression and use
as a marker in gram-negative bacteria
JOURNAL Gene 86, 123-128 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 63 lacZ-melC fusion protein
pept 49 > 63 melC protein
BASE COUNT 16 a 20 c 15 g 12 t
ORIGIN
1 atgaccatga ttacgccaag ctcgcgaggc ctcgagatct atcgatgcat gccggaactc
61 acc
//
LOCUS STMMELCTAC 101 bp ds-DNA BAD 04-JUL-1990
DEFINITION S.antibioticus tac promoter and melC gene, 5' end.
ACCESSION M34506
KEYWORDS .
SOURCE S.antibioticus DNA, clone pMELA1.
ORGANISM Streptomyces antibioticus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 101)
AUTHORS Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W.,
Chiou,S.-J., Hu,N.-T. and Chen,C.W.
TITLE The melanin operon of Streptomyces antibioticus: Expression and use
as a marker in gram-negative bacteria
JOURNAL Gene 86, 123-128 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 87 > 101 melC protein
BASE COUNT 30 a 23 c 23 g 25 t
ORIGIN
1 ccctgttgac aattaatcat cggctcgtat aatgtgtgga attgtgagcg gataacaatt
61 tcacacagga aacaggatca gcttgcatgc cggaactcac c
//
LOCUS SUSSPERA 2150 bp ss-mRNA INV 04-JUL-1990
DEFINITION S.purpuratus speract egg protein mRNA, complete cds.
ACCESSION J02896
KEYWORDS egg protein; speract.
SOURCE Sea urchin (library lambda-Zap), cDNA to mRNA.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 2150)
AUTHORS Ramarao,C.S., Burks,D.J. and Garbers,D.L.
TITLE A single mRNA encodes multiple copies of the egg peptide speract
JOURNAL Biochemistry 29, 3383-3388 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 98 988 speract protein precursor
pept 206 988 speract protein precursor
matp 500 529 speract protein
matp 767 796 speract protein
matp 800 829 speract protein
matp 833 862 speract protein
matp 866 895 speract protein
matp 500 529 speract protein homolog (put.)
matp 668 697 speract protein homolog (put.)
matp 701 730 speract protein homolog (put.)
matp 734 763 speract protein homolog (put.)
matp 899 928 speract protein homolog (put.)
matp 932 961 speract protein homolog (put.)
signal 1320 1325 polyA signal
signal 1463 1468 polyA signal
BASE COUNT 656 a 442 c 437 g 615 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcggct actcgacttt acgctagatc tctaccgagc tttctcgtcg aatagaacaa
61 ccgaccgacc cacgtcgttg aagatgaagt tcatcgtatg cctcctggcc ctcgtggcgt
121 agcttcgggc aagcctgtca tcagcggtca ggaccagaag tacacccttg ctgattccag
181 caaggatcta gagggcattg ctcacatggc tatcgtcgac tcactttccc ctctacacat
241 ctcgctctcc tctttggaat ctgcgtggaa caacctcctc aacattgctt ctcaggaaga
301 ctactctatc ccggaacttt ccatacccaa aatcgacgtg aagagcatct tgtcatgcaa
361 gcccaagtat tctcccaagt accctgtggt cttgcagtac atctctgatc attacgtcca
421 ggtccaggac catatcgcca acgccaagga acttacagaa ggactgaaat tcgtctcgca
481 gctcatcatg tacaagaaga tcgaccacga taccctggcc tctgtctcta aaatgttgag
541 caattacttg actgactacg catcgacaat ctcatcactt aaatcagtcg tctgtcaaga
601 tcaaacagcg ccatctcatc ctatggacga gagttacatg gacacacccc tgtcaatgat
661 cttgaaaggc accatgccta ctggcgctgg agtagacaaa ggcttcgcac ttggtggtgg
721 tggagttggt aaaggattca acttgaacgg cggtggagtt ggcaaaggct ttgacttgaa
781 cggcggtgga gttggcaaag gattcgactt gaacggcggt ggagttggca aaggctttga
841 cttgaacggc ggtggagttg gcaaaggctt tgacttgaac ggcggtggag ttggcaaagg
901 tttcgctttg ggcggtggtg gagttggcaa aggattcagc ctcaccggag gaggcgtcgg
961 cagggaagtc gaaatcaaag actggtaatc gacgttctgg cgctcaatag atctgacatc
1021 aaatgccatg cgcccgtaca ggaatttcat aattgccatt tttttttaca ctgatatcaa
1081 tactttggtc gtgcctgtcc aattatacaa atatgtatta aataatagca tcaaatgcaa
1141 aaaaaaaaaa aaaaaaaaaa gggtcttgtt ttcttgagct tgcaatgatc aaccatcatc
1201 cagtgtcgtc atgatggtct gcgtcacaag agaggcgata agcaaaattt aagactcaag
1261 atggtatgtt aaactttgat cttaaggacg ttttctttgt ttcctcatca aatattgaaa
1321 ataaattttt aacaaaaaaa attacatatg cctgtaaaat ggctatttca gcatttttct
1381 ctgttttcta cgattcatat ttctttttca ttggaaaaac tagaaagaat atataaaata
1441 taaaatattt tcaaaatagt ttaataaatt atttccaaaa gacaactttg tcaaaaagaa
1501 ttgaaaaaag aagaaaatga attgaaagca aaattttctt ttcaaaatgt cagcaactga
1561 tagtgatagc cgcaggtatt acatcaggca aatcaaacag gtaaaggtca tacaatgtac
1621 cttctagatg ttactttaca agattttaga tctactttgg tatcttcttt cctttttcct
1681 ttgattgtaa actcctttca agtaatacga attaatacca taagttgcac ctgcctagcc
1741 tatacagtcg acactgccca caaacaaatc gagaagtata gaataacgca acataaaaaa
1801 ggaaagattt tggtagctgt gctcttttcc tcttttctta tctttcgaat gtttactctc
1861 ctgtactcat ttaaaatggt aaattaaaaa gagagccaga tttttatttg gaaggttgca
1921 aatcgattga cagataggtc atcagtgtag agctttgata caagtgacaa aaacggtcca
1981 tacgcttgtt gagcaatatg gttacttcat gaaggccttg tatatttcac ataatgataa
2041 caaatactag aagtcaatga tgcattgctg caaagaattt tgataacgtg actttctttg
2101 tagtctggtg cttggtaaaa tccagagacg caatgaaaat acttgaattc
//
LOCUS TETTCBP23A 751 bp ss-mRNA INV 04-JUL-1990
DEFINITION T.thermophila calcium-binding protein (TCBP-23) mRNA, complete cds.
ACCESSION J05227
KEYWORDS calcium-binding protein.
SOURCE T.thermophila (strain B1868), cDNA to mRNA, clone TCBP-23.
ORGANISM Tetrahymena thermophila
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 751)
AUTHORS Takemasa,T., Takagi,T., Kobayashi,T., Konishi,K. and Watanabe,Y.
TITLE The third calmodulin family protein in Tetrahymena: Cloning of the
cDNA for Tetrahymena calcium-binding protein of 23 kDa (TCBP-23)
JOURNAL J. Biol. Chem. 265, 2514-2517 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 28 651 calcium-binding protein (TCBP-23)
signal 661 666 polyA signal
signal 676 681 polyA signal
BASE COUNT 249 a 120 c 152 g 230 t
ORIGIN
1 gtagaaagca aacaaacact aaaagaaatg gaacaccaaa tcatcaccta aaacgtctac
61 gctcctgata ctgaagccaa gcttgatgtt gctagaaagc tttttgctta gtttgattct
121 aacaagaacg gtactttaga tcctagcgaa gttgctggac tcatcaagac tacttttgaa
181 aatatgggtg ttaaggacta cagcgtcact gctgatgatg tcaagcttta catgaagagt
241 gttgatgttg ataacaacgg tcttgtttcc tactctgaat acgaagaata cgtcattgct
301 tgcctcaaga aagctggctt cgactgtgaa gttaagcaaa aggtaaaaag atctgccaaa
361 aagagagacg ctgctactga aatgaagttg gacgttgcca gaagactctt cgctaagtac
421 gactctgata agagtggtta attagaagaa aaggaagttt atggtgtcat tactgaaacc
481 tataagcaaa tgggtatgga ttacaagccc actgaagctg atgttaagct ttggatgtcc
541 atgactgata ctgataagaa tggaactgtc tctattgttg aatatgaaga tttcgtcatt
601 tctggtctta agaaggctgg tttcatggtc aaggaattca ctcaagcttg atcttatttt
661 aataaacata tttcaaataa aaattgaact gtatttgtgt atttcatgtc atttcttaaa
721 attttaaatg atttttttaa tatcgttaaa a
//
LOCUS TRNRESRA 480 bp ds-DNA BCT 04-JUL-1990
DEFINITION Transposon Tn2501 putative res region.
ACCESSION M34377
KEYWORDS .
SOURCE Transposon Tn2501 DNA, clone pTM41.
ORGANISM Transposon Tn2501
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 480)
AUTHORS Michiels,T. and Cornelis,G.
TITLE Site-specific recombinations between direct and inverted res sites
of Tn2501
JOURNAL Plasmid 22, 249-255 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
site 285 310 inverted repeat region I
site 325 356 inverted repeat region II
site 366 387 inverted repeat region III
BASE COUNT 125 a 110 c 110 g 135 t
ORIGIN
1 gcgattctgg atatgacgct actgtgaggg acattgcttt tgtccggaat aaatccaggt
61 ccacgcagat aacagagtaa tacagcaaag cccaggcgat ttgctggccg ccggtgttta
121 ttaatgagtg caatatcctg ttcattcaga aaacacattc gggtcagaat aatttcatca
181 tctggaataa ccagcaagcg ctcctgctct tcactgctca gtatctgtcg ccgtggcata
241 aatgcttccc tcgcgtatag tttccatatg ttatggattg gctatgaccg aaaccggtcg
301 gtttcgtaca ccttctaata tatctgtccg aaagtaagtg aaattcattt cgtacaggcg
361 ctataattcg gacatccatt tcgtacggaa agtttcctat gtcacgagtt tttgcttact
421 gccgggtttc gactcttgag caaaccaccg agaatcagcg aagggaaatt gaagcggcgg
//
LOCUS HUMALRMB 1354 bp ss-mRNA PRI 04-JUL-1990
DEFINITION Human aldose reductase mRNA, complete cds.
ACCESSION J05474
KEYWORDS aldose reductase.
SOURCE Human adult and fetus retina, and muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1354)
AUTHORS Nishimura,C., Matsuura,Y., Kokai,Y., Akera,T., Carper,D.,
Morjana,N., Lyons,C. and Flynn,T.G.
TITLE Cloning and expression of human aldose reductase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Nishimura, 26-APR-1990.
FEATURES from to/span description
pept 33 983 aldose reductase (EC 1.1.1.21)
mRNA < 1 1354 aldose reductase mRNA
BASE COUNT 320 a 352 c 365 g 317 t
ORIGIN
1 cggtacgtgc ggccttgggg agcgcagcag ccatggcaag ccgtctcctg ctcaacaacg
61 gcgccaagat gcccatcctg gggttgggta cctggaagtc ccctccaggg caggtgactg
121 aggccgtgaa ggtggccatt gacgtcgggt accgccacat cgactgtgcc catgtgtacc
181 agaatgagaa tgaggtgggg gtggccattc aggagaagct cagggagcag gtggtgaagc
241 gtgaggagct cttcatcgtc agcaagctgt ggtgcacgta ccatgagaag ggcctggtga
301 aaggagcctg ccagaagaca ctcagcgacc tgaagctgga ctacctggac ctctacctta
361 ttcactggcc gactggcttt aagcctggga aggaattttt cccattggat gagtcgggca
421 atgtggttcc cagtgacacc aacattctgg acacgtgggc ggccatggaa gagctggtgg
481 atgaagggct ggtgaaagct attggcatct ccaacttcaa ccatctccag gtggagatga
541 tcttaaacaa acctggcttg aagtataagc ctgcagttaa ccagattgag tgccacccat
601 atctcactca ggagaagtta atccagtact gccagtccaa aggcatcgtg gtgaccgcct
661 acagccccct cggctctcct gacaggccct gggccaagcc cgaggaccct tctctcctgg
721 aggatcccag gatcaaggcg atcgcagcca agcacaataa aactacagcc caggtcctga
781 tccggttccc catgcagagg aacttggtgg tgatccccaa gtctgtgaca ccagaacgca
841 ttgctgagaa ctttaaggtc tttgactttg aactgagcag ccaggatatg accaccttac
901 tcagctacaa caggaactgg agggtctgtg ccttgttgag ctgtacctcc cacaaggatt
961 accccttcca tgaagagttt tgaagctgtg gttgcctgct cgtccccaag tgacctatac
1021 ctgtgtttct tgcctcattt ttttccttgc aaatgtagta tggcctgtgt cactcagcag
1081 tgggacagca acctgtagag tggccagcga gggcgtgtct agcttgatgt tggatctcaa
1141 gagccctgtc agtagagtag aagtctcttc cagtttgctt tgcccttctt tctaccctgc
1201 tggggaaagt acaacctgaa tacccttttc tgaccaaaga gaagcaaaat ctaccaggtc
1261 aaaatagtgc cactaacggt tgagttttga ctgcttggaa ctggaatcct ttcagcaaga
1321 cttctctttg cctcaaataa aaagtgcttt tgtg
//
LOCUS CHTHYPABX 2468 bp ds-DNA BCT 04-JUL-1990
DEFINITION C.psittaci hypA and hypB protein genes, complete cds.
ACCESSION M25101
KEYWORDS .
SOURCE C.psittaci (strain GPIC) elementary body DNA, clone pGP57.
ORGANISM Chlamydia psittaci
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 2468)
AUTHORS Morrison,P.R., Belland,R.J., Lyng,K. and Caldwell,H.D.
TITLE Chlamydial disease pathogenesis: The chlamydial hypersensitivity
antigen is a stress response protein
JOURNAL J. Exp. Med. 170, 1271-1283 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.P.Morrison, 02-JUN-1989.
FEATURES from to/span description
pept 314 622 hypA protein
pept 673 2307 hypB protein
signal 83 88 -35 region
signal 120 127 -10 region
BASE COUNT 855 a 462 c 514 g 637 t
ORIGIN
1 gaattcttaa caaagaagat aacgctctcg aatcgtacat gaacttctta aaaagtggtg
61 gctccgactt cccattagaa atcttgaaga agtccggatt ggatatggca accagcgagc
121 ccatatacaa ggccttttcc ttcatagaga gaaaaattca agagttatca tctttaattt
181 aaaacaacta aagaaaagta gcacttgata ttattaagtg ctaaaatcat tgccaaaaaa
241 cgagagactt tggtatcgtt cctgagaaac ggcaaagtct cttttagaac aagaaacaca
301 aggagcttat aacatgtcag atcaagcaac gacccttagg attaagcccc tgggcgatag
361 aattttagtg aaaagagaag aagaagattc tacagcgcgc ggcggcatca ttttacctga
421 tacagcaaag aaaaaacagg atcgagcaga ggtattagtc ctaggcactg gaaaacgaga
481 taaagatggc aacgtcctac cttttgaagt taccgtgggt gatactgttt taatagataa
541 atacgcggga caagaactta ccgttgatgg tgaggagtac gtcattgttc aggaaagcga
601 agttatggca gttctcaagt aagagaaatc attatttata gattgcaaaa agttaaggag
661 cacaaaaaaa caatggcagc aaaaaatatt aaatataacg aagacgccag aaaaaaaatc
721 cataaaggag ttaaaaccct tgcagaagct gtaaaagtaa ccttaggtcc taaaggccgt
781 catgtggtta tcgataaaag ctttggttct cctcaagtta ccaaagatgg cgtaactgtc
841 gctaaagaaa ttgagctcga agacaagcat gagaacatgg gagctcaaat ggtaaaagaa
901 gtcgctagca aaactgcaga taaagctggt gatggaacta caacagctac tgttcttgca
961 gaagctatct acagtgaagg attgagaaac gtaactgcag gcgccaatcc tatggacctc
1021 aaaagaggca ttgataaggc agtaaaagtc gttgtcgatg aaatcaaaaa aattagtaaa
1081 cccgtacaac atcacaaaga aatagctcaa gtagcgacta tttctgcaaa taatgatgct
1141 gaaatcggta atcttatcgc cgaagccatg gaaaaagttg gcaaaaacgg ctctattact
1201 gttgaagaag ctaaaggttt cgaaactgtc ctcgacgttg tcgaaggtat gaatttcaac
1261 cgcggatacc tatccagcta cttctctaca aatcctgaaa cacaagaatg tgttttagaa
1321 gaagctctcg tgcttatcta tgataaaaaa atttccggaa tcaaagattt tctaccagtt
1381 ttacaacaag tagcagaatc aggacgtccc ctacttatca ttgctgaaga tatcgaagga
1441 gaagctttag ctactttagt agtaaacaga ctacgtgctg gattcagagt gtgtgcagta
1501 aaagctcctg gatttggtga tagaagaaaa gctatgttag aagacatcgc tattttaact
1561 ggtggtcaac tcatcagcga agagcttggc atgaagcttg agaacacaac tctagctatg
1621 ttaggaaaag ctaaaaaagt catcgtttcc aaagaagata caacaattgt tgaaggtctt
1681 ggcagcaaag aagatattga atctcgctgc gaaagtatca aaaaacaaat cgaagacagt
1741 acttctgatt acgacaaaga aaaactccaa gaacgtttag ctaaactttc cggaggcgta
1801 gctgtaatcc gtgtaggagc tgctacagaa atcgaaatga aagagaaaaa agacagagta
1861 gatgatgctc agcatgcaac tcttgctgca gttgaagaag gtattctacc tggcggtggt
1921 acagctttag ttcgctgcat ccctacttta gaagctttca ttcctattct tacaaatgaa
1981 gatgagcaaa tcggagcacg tattgttctc aaagcattat ccgctccatt aaagcaaatt
2041 gcagccaatg ctggtaaaga aggcgctatc atctgtcaac aagtgctttc tcgctcctct
2101 agcgaaggct atgatgcttt acgcgatgct tacaccgaca tgattgaggc aggaattctc
2161 gatccaacta aagttacacg ttgtgcttta gaaagcgcag cttctgtagc tgggcttcta
2221 ttaacaacag aagctttaat tgccgatatt cctgaagaga aatcctcttc tgctcccgca
2281 atgccaggcg caggaatgga ttattaatcc ttaatttaga gagcattttc tctaatatta
2341 taaggtctcc tttcatccat cttgagaaga aagggggcct tttttatttt ctaatatttc
2401 tttcttcatc tatgttggaa accaagataa atcatattct catcatgcat gtttaaactt
2461 ttaaagaa
//
LOCUS TRBPGKM 3306 bp ds-DNA INV 04-JUL-1990
DEFINITION T.brucei phosphoglycerate kinase isozyme-8 (cPGK-8, gPGK) genes,
complete cds.
ACCESSION M33775
KEYWORDS phosphoglycerate kinase.
SOURCE T.brucei (strain EATRO 164) DNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 3306)
AUTHORS Alexander,K., Parail,A.C. and Parsons,M.
TITLE An allele of Trypanosoma brucei cytoplasmic phosphoglycerate kinase
is a mosaic of other alleles and genes
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by M.Parsons,
18-APR-1990.
Author address: M.Parsons
Seattle Biomedical Research Institute
4 Nickerson Street
Seattle, WA 98109-1651
(206) 284-8846
FEATURES from to/span description
pept 175 1437 phosphoglycerate kinase (cPGK-8; E.C. 2.7.2.3)
pept 1792 3114 phosphoglycerate kinase (gPGK; E.C. 2.7.2.3)
allele 406 406 c in allele 8; g in allele 2
allele 466 467 cg in allele 8; gc in allele 2
allele 603 604 ga in allele 8; gaaga in allele 2
allele 640 640 g in allele 8; t in allele 2
allele 729 730 tc in allele 8; ca in allele 2
allele 783 783 g in allele 8; a in allele 2
allele 840 840 a in allele 8; c in allele 2
allele 944 950 aatcgaa in allele 8; atccatt in allele 2
allele 1018 1018 g in allele 8; t in allele 2
allele 1107 1107 g in allele 8; a in allele 2
allele 1425 1428 cgaa in allele 8; tgat in allele 2
allele 1439 1457 attcatcctgttacgggcc in allele 8;
gtacatcaggcgaagggtt in allele 2
allele 1483 1483 t in allele 8; g in allele 2
allele 1512 1512 t in allele 8; c in allele 2
allele 1537 1540 ttta in allele 8; ta in allele 2
allele 2015 2015 a in allele 8; g in allele 2
allele 2083 2084 cg in allele 8; gc in allele 2
allele 2136 2136 t in allele 8; c in allele 2
allele 2151 2151 t in allele 8; c in allele 2
allele 2196 2199 cttt in allele 8; gttc in allele 2
allele 2268 2268 c in allele 8; c in allele 2
allele 2400 2400 g in allele 8; a in allele 2
allele 2919 2919 c in allele 8; t in allele 2
allele 3031 3031 g in allele 8; a in allele 2
allele 3057 3057 g in allele 8; t in allele 2
allele 3196 3197 tg in allele 8; gtttg in allele 2
allele 3248 3248 t in allele 8; c in allele 2
allele 3257 3257 g in allele 8; t in allele 2
allele 3273 3274 ag in allele 8; agag in allele 2
allele 3284 3286 ttg in allele 8; tg in allele 2
allele 3297 3299 aat in allele 8; at in allele 2
BASE COUNT 863 a 659 c 833 g 951 t
ORIGIN
1 aaaccgaatt cccttcccca agtctcgcag tcacttcttt tcaacgtttt ctcacttagc
61 caacatactt gtattttttg tttcaaaaca ttaaaaaatt gtaacaaggg agtttcttat
121 ttttttgaaa aaactatata tatcgatata tacttatctg atcacaaatc aaatatgtca
181 ctaaaagaga ggaagagcat taacgaatgc gatcttaagg gaaagaaggt tcttatccgt
241 gttgacttta atgttcctct ggatgatggt aatattacca acgactaccg aatccgatca
301 gctctgccag ccgtccaaaa agttctcaca gaaggcggca gctgtgttct catgagccac
361 ctcgggaggc cgaaaggcgt ttctatggct gaaggcaaag aactgcggag cgctggcggt
421 attcccgggt tcgagcagaa ggcaacactc aaaccggtag ccaagcgcct cagcgaattg
481 ttatcgagac ccgtcacatt cgcacctgac tgtctgaacg ctgcagatgt cgtctctaag
541 atgtctccgg gcgatgttgt tctgcttgaa aatgtacggt tctacaaaga agagggcagc
601 aagagcactg aggaacgtga agccatggcc aagatccttg cgtcatatgg tgatgtttac
661 atcagtgatg cttttggtac agctcaccgt gacagtgcta ccatgaccgg aattccaaag
721 attttgggtc acggtgctgc cggttatttg atggagaagg agatttcata cttcgctaag
781 gtgcttggta acccgccgcg tccgctggtt gctatcgttg gtggagcgaa agtgagcgaa
841 aagatccaac ttctggataa catgttgcag cgcatcgatt atctcttaat tggtggtgca
901 atggcataca catttctgaa ggctcagggt tacagcattg gaaaatcgaa gtgcgaggaa
961 agtaaacttg aatttgctcg atccctgctg aagaaggcgg aggaccgcaa ggtgcaggtt
1021 attcttccaa ttgatcatgt ttgccacacg gaattcaaag ctgtggattc tccattgata
1081 actgaggatc aaaacatccc tgaagggcat atggctctgg atattggtcc caagactatt
1141 gaaaaatatg ttcagacgat tgggaagtgt aagagcgcca tttggaacgg tcccatgggt
1201 gtatttgaaa tggttcctta ttccaaaggt acatttgcaa ttgcgaaagc catgggtcga
1261 ggaactcacg agcatggact catgagtatc atcggtggtg gtgacagcgc aagtgcagct
1321 gagttgagcg gtgaggcgaa gcgcatgtct catgtttcaa cgggtggtgg tgcgtctttg
1381 gaactcctcg agggcaaaac gcttcccggt gttacagtat tggacgaaaa ggagtaatat
1441 tcatcctgtt acgggcctgt tttatggaat tgtgtttttt agtccttttt atttgttggt
1501 taggtattgg ttcgtacgtg actattattt ttttttttta ggataacatt tatgtttttt
1561 ctctactcat tttatttttt ggtactatgt atcgattgca cagtaatatt tcaatcgcta
1621 tgatatttct atctttttga tcgttctact gtgtaacttt gtgtttatct ttgttacttc
1681 actctttttt cactcaaatc gtttgggctg caggctcagc tctttggtga tatcaaagca
1741 taattgctgc ggagatacgt ttttccacct aataagtaat tgtgatacaa gatgaccctt
1801 aacgagaaga agagcattaa tgaatgcgat cttaagggaa agaaggttct tatccgtgtt
1861 gactttaatg ttcccgtgaa aaacggtaag atcaccaacg actaccgaat ccgatcagct
1921 ctgccaacgc tcaagaaggt tctcacagaa ggcggcagct gtgttctcat gagccacctc
1981 gggaggccga aaggtattcc catggcgcaa gctgacaaaa tacggagcac tggcggtgtt
2041 cccgggttcc aacagaaggc aacactcaaa ccggtagcca agcgcctcag cgaactgcta
2101 ttgaggcccg tcacattcgc acctgactgc ctgaatgctg cagatgtcgt ttctaagatg
2161 tctccgggcg atgttgttct gcttgaaaat gtacgctttt acaaagaaga gggcagcaag
2221 aaggcaaaag accgtgaagc catggccaag atccttgcgt catatggcga tgtttacatc
2281 agtgatgctt ttggtacagc tcaccgtgac agtgctacca tgaccggaat tccaaagatt
2341 ttgggcaacg gtgctgccgg ttatttgatg gagaaggaga tttcatactt cgctaaggtg
2401 cttggtaacc cgccgcgtcc gctggttgct atcgttggtg gagcgaaagt gagcgacaag
2461 atccaacttc tggataacat gttgcagcgc atcgattatc tcttaattgg tggtgcaatg
2521 gcatacacat ttctgaaggc tcagggttac agcattggaa aatcgaagtg cgaggaaagt
2581 aaacttgaat ttgctcgatc cctgctgaag aaggcggagg accgcaaggt gcaggttatt
2641 cttccaattg atcatgtttg ccacacggaa ttcaaagctg tggattctcc attgataact
2701 gaggatcaaa acatccctga agggcatatg gctctggata ttggtcccaa gactattgaa
2761 aaatatgttc agacgattgg gaagtgtaag agcgccattt ggaacggtcc catgggtgta
2821 tttgaaatgg ttccttattc caaaggtaca tttgcaattg cgaaagccat gggtcgagga
2881 actcacgagc atggactcat gagtatcatc ggtggtggcg acagcgcaag tgcagctgag
2941 ttgagcggtg aggcgaagcg catgtctcat gtttcaactg gtggtggtgc gtctttggaa
3001 ctcctcgagg gcaaaacgct tcccggcgtt gcagtattgg acgaaaagtc ggcggtggtg
3061 tcgtatgcct ctgcaggtac tggaactctt tctaaccggt ggagctctct ttaaacttcc
3121 agaaaaaata tatttctgca aaatactttt ggaagtttgt cttgtcttta tagatgaagg
3181 atttgtttct ttttttgtga tgttttcaag gttaattagt tttgggggtt cgttatctta
3241 attattttgg tgggtgggag taaataaagc agaggtaaat tttttggtga cacaaaaatt
3301 gggaag
//GenBank-Updates@genbank.bio.net (07/06/90)
LOCUS MUSLRPA 3070 bp ss-mRNA ROD 06-JUL-1990
DEFINITION Mouse leukocyte common antigen-related phosphatase mRNA, complete
cds.
ACCESSION M36033 M33671
KEYWORDS leukocyte common antigen-related phosphatase;
protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase.
SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to
mRNA, clones lambda-mLRP-[B20,C26].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3070)
AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L.
TITLE Identification of an additional member of the
protein-tyrosine-phosphatase family: Evidence for alternative
splicing in the tyrosine phosphatase domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Thomas, 11-APR-1990.
FEATURES from to/span description
pept 26 2515 leukocyte common antigen-related phosphatase
precursor (EC 3.13.48)
sigp 26 82 leukocyte common antigen-related phosphatase
signal peptide
matp 83 2512 leukocyte common antigen-related phosphatase
mRNA < 1 3070 leukocyte common antigen-related phosphatase
mRNA
site 827 936 gg in short clone lambda-mLRP-B20
BASE COUNT 874 a 754 c 732 g 710 t
ORIGIN
1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag
61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac
121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc
181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga
241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc
301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca
361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc
421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc
481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa
541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga
601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact
661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt
721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc
781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc
841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt
901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac
961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca
1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg
1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag
1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt
1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat
1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt
1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa
1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg
1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga
1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt
1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg
1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat
1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat
1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt
1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa
1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc
1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa
1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca
2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga
2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa
2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga
2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa
2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt
2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa
2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta
2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt
2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa
2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta
2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt
2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa
2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact
2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga
2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct
2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa
3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga
3061 ttccctgagg
//
LOCUS MUSLRPB 144 bp ds-DNA ROD 06-JUL-1990
DEFINITION Mouse leukocyte common antigen-related phosphatase gene, exon N.
ACCESSION M36034 M33671
KEYWORDS leukocyte common antigen-related phosphatase;
protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase.
SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3 DNA, clone
lambda-mLRP-B20.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 144)
AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L.
TITLE Identification of an additional member of the
protein-tyrosine-phosphatase family: Evidence for alternative
splicing in the tyrosine phosphatase domain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept / 25 / 132 leukocyte common antigen-related phosphatase,
exon N (EC 3.13.48) (AA at 27)
IVS < 1 24 leukocyte common antigen-related phosphatase
intron X
IVS 133 > 144 leukocyte common antigen-related phosphatase
intron X+1
BASE COUNT 50 a 30 c 30 g 34 t
ORIGIN
1 gcttactcaa acgatcatac aaagttctct ctttagctgt gagcaaggat gcagtgaaag
61 cactgaacaa aaccactcca ttgttagaaa gaaggtttat tgggaaatca aactccagag
121 gctgtctctc aggtcagaga aaac
//
LOCUS HUMACOLA1 34 bp ds-DNA PRI 06-JUL-1990
DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 1.
ACCESSION M34534
KEYWORDS alpha-1 type IX collagen.
SEGMENT 1 of 4
SOURCE Human DNA, clone YMh304.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 34)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept 19 / 32 alpha-1 type IX collagen precursor, exon 1
/hgml_locus_uid="LG0170H" /nomgen="COL9A1"
/map="6q12-q14"
pep$ 19 / 32 alpha-1 type IX collagen precursor, exon 1
sigp 19 / 32 alpha-1 type IX collagen signal peptide
sig$ 19 / 32 alpha-1 type IX collagen signal peptide
IVS 33 > 34 COL9A1 intron A
BASE COUNT 14 a 5 c 9 g 6 t
ORIGIN
1 agaaaatcaa ctgggaaaat gaagacctgc tggt
//
LOCUS HUMACOLA2 88 bp ds-DNA PRI 06-JUL-1990
DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 6.
ACCESSION M32135
KEYWORDS alpha-1 type IX collagen.
SEGMENT 2 of 4
SOURCE Human DNA, clone YMh617.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept / 3 / 86 alpha-1 type IX collagen, exon 6 (AA at 3)
/hgml_locus_uid="LG0170H" /nomgen="COL9A1"
/map="6q12-q14"
pep$ / 3 + 86 alpha-1 type IX collagen, exon 6 (AA at 3)
IVS < 1 2 COL9A1 intron E
IVS 87 > 88 COL9A1 intron F
BASE COUNT 23 a 24 c 22 g 19 t
ORIGIN Unknown number of bp after segment 1.
1 agtttgaact tcaatggatg ctgatccatt gtgaccccct gcggcccagg acagaaactt
61 gccatgagct gccagccaga ataacggt
//
LOCUS HUMACOLA3 74 bp ds-DNA PRI 06-JUL-1990
DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 1*.
ACCESSION M32133
KEYWORDS alpha-1 type IX collagen.
SEGMENT 3 of 4
SOURCE Human DNA, clone Ymh617.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 74)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept < 1 + 72 alpha-1 type IX collagen, exon 1* (AA at 1)
/hgml_locus_uid="LG0170H" /nomgen="COL9A1"
/map="6q12-q14"
IVS 73 > 74 COL9A1 intron G*
BASE COUNT 5 a 21 c 32 g 16 t
ORIGIN Unknown number of bp after segment 2.
1 atggcctgga ctgcgcggga ccgcggggcc ctggggctgc tgctgttggg gctctgcttg
61 tgcgcggctc aagt
//
LOCUS HUMACOLA4 25 bp ds-DNA PRI 06-JUL-1990
DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 7.
ACCESSION M32137
KEYWORDS alpha-1 type IX collagen.
SEGMENT 4 of 4
SOURCE Human DNA, clone YMh617.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 25)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept + 3 / 23 alpha-1 type IX collagen, exon 7
/hgml_locus_uid="LG0170H" /nomgen="COL9A1"
/map="6q12-q14"
pep$ + 3 / 23 alpha-1 type IX collagen (COL9A1), exon 7
IVS < 1 2 COL9A1 intron G*
IVS 24 > 25 COL9A1 intron H*
IVS < 1 2 COL9A1 intron F
IVS 24 > 25 COL9A1 intron G
BASE COUNT 7 a 10 c 7 g 1 t
ORIGIN Unknown number of bp after segment 3.
1 agcccagcca gaccaccgac gaggt
//
LOCUS MUSACOLA1 34 bp ds-DNA ROD 06-JUL-1990
DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1.
ACCESSION M34533
KEYWORDS alpha-1 type IX collagen.
SEGMENT 1 of 4
SOURCE Mouse DNA, clone YMm513.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 34)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept 19 / 32 alpha-1 type IX collagen (COL9A1) precursor,
exon 1
pep$ 19 / 32 alpha-1 type IX collagen precursor, exon 1
sigp 19 / 32 alpha-1 type IX collagen signal peptide
sig$ 19 / 32 alpha-1 type IX collagen signal peptide
IVS 33 > 34 COL9A1 intron A
BASE COUNT 13 a 6 c 9 g 6 t
ORIGIN
1 agagaaccca ctgggaaaat gaagaacttc tggt
//
LOCUS MUSACOLA2 88 bp ds-DNA ROD 06-JUL-1990
DEFINITION Mouse alpha-1 type IX collagen (COL9A1), exon 6.
ACCESSION M32134
KEYWORDS alpha-1 type IX collagen.
SEGMENT 2 of 4
SOURCE Mouse DNA, clones YMm[413,911,341].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept / 3 / 86 alpha-1 type IX collagen (COL9A1), exon 6 (AA
at 3)
pep$ / 3 + 86 alpha-1 type IX collagen, exon 6 (AA at 3)
IVS < 1 2 COL9A1 intron E
IVS 87 > 88 COL9A1 intron F
BASE COUNT 25 a 22 c 21 g 20 t
ORIGIN Unknown number of bp after segment 1.
1 agtttgaact ccagtggatg ctgattcatt gtgaccccct gagacccagg agagaaacct
61 gtcatgagct gccaatcaga atcacagt
//
LOCUS MUSACOLA3 80 bp ds-DNA ROD 06-JUL-1990
DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1*.
ACCESSION M32132
KEYWORDS alpha-1 type IX collagen.
SEGMENT 3 of 4
SOURCE Mouse DNA, clones YMm[413,911,341].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept < 1 + 78 alpha-1 type IX collagen (COL9A1), exon 1* (AA
at 1)
IVS 79 > 80 COL9A1 intron F*
BASE COUNT 8 a 19 c 31 g 22 t
ORIGIN Unknown number of bp after segment 2.
1 atggcctggg ctgcctgggg acgaggagtg cttgggctgt cactgatgct gtctgggctc
61 cgcttgtgtg ctgctcaagt
//
LOCUS MUSACOLA4 25 bp ds-DNA ROD 06-JUL-1990
DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 7.
ACCESSION M32136
KEYWORDS alpha-1 type IX collagen.
SEGMENT 4 of 4
SOURCE Mouse DNA, clones YMm[413,911,341].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 25)
AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
TITLE The alpha-1 (IX) collagen gene gives rise to two different
transcripts in mouse embryonic and human fetal RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
B.R.Olsen, 15-FEB-1990.
FEATURES from to/span description
pept + 3 / 23 alpha-1 type IX collagen (COL9A1), exon 7
pep$ + 3 / 23 alpha-1 type IX collagen, exon 7
IVS < 1 2 COL9A1 intron G*
IVS 24 > 25 COL9A1 intron H*
IVS < 1 2 COL9A1 intron F
IVS 24 > 25 COL9A1 intron G
BASE COUNT 8 a 7 c 7 g 3 t
ORIGIN Unknown number of bp after segment 3.
1 agaccagcca gaccactgat gaggt
//GenBank-Updates@genbank.bio.net (07/07/90)
LOCUS TRBMVAT5A 1544 bp ss-mRNA INV 07-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds.
ACCESSION M33825
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1544)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
BASE COUNT 497 a 396 c 366 g 285 t
ORIGIN
1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS TRBWRATATA 1544 bp ss-mRNA INV 07-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (WRATat A) mRNA, complete
cds.
ACCESSION M33823
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1544)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
pept 42 1457 variant surface glycoprotein (WRATat A)
mRNA < 1 1544 WRATat A mRNA
BASE COUNT 545 a 387 c 345 g 267 t
ORIGIN
1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS TRBWRATATB 1585 bp ss-mRNA INV 07-JUL-1990
DEFINITION T.brucei variant surface glycoprotein (WRATat B) mRNA, complete
cds.
ACCESSION M33824
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1585)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
BASE COUNT 535 a 384 c 376 g 290 t
ORIGIN
1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa
1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg
1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta
1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct
1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt
1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg
1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
1561 tctggtatat tttaacacct ttaaa
//
LOCUS HUMMHB17W 3520 bp ds-DNA PRI 07-JUL-1990
DEFINITION Human MHC class I HLA-Bw58 gene, complete cds.
ACCESSION M11799
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human B lymphoblastoid cell line WT49 DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3520)
AUTHORS Ways,J.P., Coppin,H.L. and Parham,P.
TITLE The complete primary structure of HLA-Bw58
JOURNAL J. Biol. Chem. 260, 11924-11933 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 88 160 HLA-Bw58 of MHC, exon 1 /nomgen="HLA-B"
/map="6p21.3" /hgml_locus_uid="LL0111R"
290 559 HLA-Bw58 of MHC, exon 2
806 1081 HLA-Bw58 of MHC, exon 3
1657 1932 HLA-Bw58 of MHC, exon 4
2037 2153 HLA-Bw58 of MHC, exon 5
2595 2627 HLA-Bw58 of MHC, exon 6
2734 2777 HLA-Bw58 of MHC, exon 7
pre-msg 67 3386 HLA-Bw58 of MHC, mRNA and introns
IVS 161 289 HLA-Bw58 intron A
IVS 560 805 HLA-Bw58 intron B
IVS 1082 1656 HLA-Bw58 intron C
IVS 1933 2036 HLA-Bw58 intron D
IVS 2154 2594 HLA-Bw58 intron E
IVS 2628 2733 HLA-Bw58 intron F
BASE COUNT 665 a 981 c 1099 g 775 t
ORIGIN Chromosome 6p21.3.
1 tctagagaag ccaatcagtg tcgccggggt cccagttcta aagtccccac gcacccaccc
61 ggactcagaa tctcctcaga cgccgagatg cgggtcacgg cgccccgaac cgtcctcctg
121 ctgctctggg gggcagtggc cctgaccgag acctgggccg gtgagtgcgg ggtcgggagg
181 gaaatggcct ctgtggggag gagcgagggg accgcaggcg ggggcgcagg acctgaggag
241 ccgcgccggg aggagggtcg ggcgggtctc agcccctcct cgcccccagg ctcccactcc
301 atgaggtatt tctacaccgc catgtcccgg cccggccgcg gggagccccg cttcatcgca
361 gtgggctacg tggacgacac ccagttcgtg aggttcgaca gcgacgccgc gagtccgagg
421 acggagcccc gggcgccatg gatagagcag gaggggccgg agtattggga cggggagaca
481 cggaacatga aggcctccgc gcagacttac cgagagaacc tgcggatcgc gctccgctac
541 tacaaccaga gcgaggccgg tgagtgaccc cggcccgggg cgcaggtcac gactccccat
601 cccccacgta cggcccgggg tcgccccgag tctccgggtc cgagatccgc ctccctgagg
661 ccgcgggacc cgcccagacc ctcgaccggc gagagcccca ggcgcgttta cccggtttca
721 ttttcagttg aggccaaaat ccccgcgggt tggtcggggc ggggcggggc tcgggggacg
781 gggctgaccg cggggccggg gccagggtct cacatcatcc agaggatgta tggctgcgac
841 ctggggcccg acgggcgcct cctccgcggg catgaccagt ccgcctacga cggcaaggat
901 tacatcgccc tgaacgagga cctgagctcc tggaccgcgg cggacaccgc ggctcagatc
961 acccagcgca agtgggaggc ggcccgtgtg gcggagcagc tgagagccta cctggagggc
1021 ctgtgcgtgg agtggctccg cagatacctg gagaacggga aggagacgct gcagcgcgcg
1081 ggtaccaggg gcagtgggga gccttcccca tctcctatag gtcgccgggg atggcctccc
1141 acgagaagag gaggaaaatg ggatcagcgc tagaatgtcg ccctcccttg aatggagaat
1201 ggcatgagtt ttcctgagtt tcctctgagg gccccctctt ctctctagga caattaaggg
1261 atgacgtctc tgaggaaatg gaggggaaga cagtccctag aatactgatc aggggtcccc
1321 tttgacccct gcagcagcct tgggaaccgt gacttttcct ctcaggcctt gttctctgcc
1381 tcacactcag tgtgtttggg gctctgattc cagcacttct gagtcacttt acctccactc
1441 agatcaggag cagaagtccc tgttccccgc tcagagactc gaactttcca atgaatagga
1501 gattatccca ggtgcctgcg tccaggctgg tgtctgggtt ctgtgcccct tccccacacc
1561 aggtgtcctg tccattctca ggctggtcac atgggtggtc ctagggtgtc ccatgagaga
1621 tgcaaagcgc ctgaattttc tgactcttcc catcagaccc cccaaagaca cacgtgaccc
1681 accaccccgt ctctgaccat gaggccaccc tgaggtgctg ggccctgggc ttctaccctg
1741 cggagatcac actgacctgg cagcgggatg gcgaggacca aactcaggac actgagcttg
1801 tggagaccag accagcagga gatagaacct tccagaagtg ggcagctgtg gtggtgcctt
1861 ctggagaaga gcagagatac acatgccatg tacagcatga ggggctgccg aagcccctca
1921 ccctgagatg gggtaaggag ggggatgagg ggtcatatct cttctcaggg aaagcaggag
1981 cccttctgga gcccttcagc agggtcaggg cccctcgtct tcccctcctt tcccagagcc
2041 atcttcccag tccaccatcc ccatcgtggg cattgttgct ggcctggctg tcctagcagt
2101 tgtggtcatc ggagctgtgg tcgctactgt gatgtgtagg aggaagagct caggtaggga
2161 aggggtgagg ggtggggtct gggttttctt gtcccactgg gggtttcaag ccccaggtag
2221 aagtgttccc tccctcatta ctgggaagca gcatccacac aggggctaac gcagcctggg
2281 accctgtgtg ccagcactta ctcttttgtg cagcacatgt gacaatgaag gacggatgta
2341 tcaccttgat ggttgtggtg ttggggtcct gatttcagca ttcatgagtc aggggaaggt
2401 ccctgctaag gacagacctt aggagggcag ttggtccagg acccacactt gctttcctcg
2461 tgtttcctga tcctgccttg ggtctgtagt catacttctg gaaattcctt ttgggtccaa
2521 gacgaggagg ttcctctaag atcttaaggc cctgcttcct cccagtcccc tcacaggaca
2581 ttttcttccc acaggtggaa aaggagggag ctactctcag gctgcgtgta agtggtgggg
2641 gtgggagtgt ggaggagctc acccacccca taattcctcc tgtcccacgt ctcctgcggg
2701 ctctgaccag gtcctgtttt tgttctactc cagccagcga cagtgcccag ggctctgatg
2761 tgtctctcac agcttgaaaa ggtgagattc ttggggtcta gagtgggtcg ggtggcgggt
2821 ctgggggtgg gtggggcaga ggggaaaggc ctgggtaatg gggattcttt gattgggatg
2881 tttcgcgtgt gtggtgggct gtttacagtg tcatcgctta ccatgactaa ccagaatttg
2941 ttcatgactg ttgttttctg tagcctgaga cagctgtctt gtgagggact gagatgcagg
3001 atttcttcac gcctcccctt tgtgacttca agagcctctg gcatctcttt ctgcaaaggc
3061 acctgaatgt gtctgcgtcc ctgttagcat aatgtgagga ggtggagaca cagcccaccc
3121 ttgtgtccac tgtgacccct gttcccatgc tgacctgtgt ttcctcccca gtcatctttc
3181 ctgttccaga gaggtggggc tggatgtctc catctctgtc tcaactttac gtgcactgag
3241 ctgcaacttc ttacttccct actgaaaata agaatctgaa tataaatttg ttttctcaaa
3301 tatttgctat gagaggttga tggattaatt aaataagtca attcctggaa tttgagagag
3361 caaataaaga cctgagaacc ttccagaatc tgcatgttcg ctgtgctgag tctgttgcag
3421 gtggggtgtg gagaaggctg tggggggccg agtgtggacg gggcctgtgc ccatttggtg
3481 ttgagtccat catgggcttt atgtggttag tcctcagctg
//
LOCUS HUMTGFB1B 5075 bp ss-mRNA PRI 07-JUL-1990
DEFINITION Human transforming growth factor-beta 1 binding protein mRNA,
complete cds.
ACCESSION M34057
KEYWORDS transforming growth factor-beta 1 binding protein.
SOURCE Human fibroblast, cDNA to mRNA, clone BPA13.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5075)
AUTHORS Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U.,
Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H.
TITLE TGF-beta-1 binding protein: A component of the large latent complex
of TGF-beta-1 with multiple repeat sequences
JOURNAL Cell 61, 1051-1061 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.-H.Heldin, 08-MAY-1990.
FEATURES from to/span description
pept 91 4275 transforming growth factor-beta 1 binding
protein precursor
sigp 91 150 transforming growth factor-beta 1 binding
protein signal peptide
matp 151 4272 transforming growth factor-beta 1 binding
protein
BASE COUNT 1353 a 1122 c 1252 g 1348 t
ORIGIN
1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt
61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc
121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt
181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag
241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg
301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg
361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt
421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat
481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg
541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag
601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca
661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc
721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt
781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca
841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa
901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat
961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc
1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt
1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa
1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac
1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt
1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt
1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc
1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag
1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca
1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa
1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag
1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct
1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt
1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc
1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat
1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt
1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat
1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg
2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag
2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct
2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt
2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa
2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt
2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc
2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag
2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac
2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac
2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc
2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat
2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag
2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca
2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac
2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc
2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt
3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac
3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta
3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt
3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc
3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc
3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct
3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa
3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac
3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt
3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg
3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa
3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc
3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc
3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt
3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt
3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt
3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg
4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag
4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag
4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg
4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac
4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa
4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat
4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc
4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca
4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta
4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg
4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt
4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag
4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa
4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat
4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc
4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag
4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga
5041 tccccaatat atctaccatt gtatgttaaa taaat
//
LOCUS MUSPLGN 2720 bp ss-mRNA ROD 07-JUL-1990
DEFINITION Mouse plasminogen mRNA, complete cds.
ACCESSION J04766
KEYWORDS plasminogen.
SOURCE Mouse cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2720)
AUTHORS Degen,S.J.
TITLE Characterization of the cDNA coding for mouse plasminogen and
localization of the gene to mouse chromosome 17
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly
submitted by S.J.Degen, 08-MAY-1990.
FEATURES from to/span description
pept 25 2463 plasminogen precursor
sigp 25 81 plasminogen signal peptide
matp 82 2460 plasminogen
signal 2699 2704 poly-A signal
BASE COUNT 731 a 707 c 710 g 572 t
ORIGIN Chromosome 17.
1 ccggtgctgt tggccagtcc caacatggac cataaggaag taatccttct gtttctcttg
61 cttctgaaac caggacaagg ggactcgctg gatggctaca taagcacaca aggggcttca
121 ctgttcagtc tcaccaagaa gcagctcgca gcaggaggtg tctcggactg tttggccaaa
181 tgtgaagggg aaacagactt tgtctgcagg tcattccagt accacagcaa agagcagcaa
241 tgcgtgatca tggcggagaa cagcaagact tcctccatca tccggatgag agacgtcatc
301 ttattcgaaa agagagtgta tctgtcagaa tgtaagaccg gcatcggcaa cggctacaga
361 ggaaccatgt ccaggacaaa gagtggtgtt gcctgtcaaa agtggggtgc cacgttcccc
421 cacgtaccca actactctcc cagtacacat cccaatgagg gactagaaga gaactactgt
481 aggaacccag acaatgatga acaagggcct tggtgctaca ctacagatcc ggacaagaga
541 tatgactact gcaacattcc tgaatgtgaa gaggaatgca tgtactgcag tggagaaaag
601 tatgagggca aaatctccaa gaccatgtct ggacttgact gccaggcctg ggattctcag
661 agcccacatg ctcatggata catccctgcc aaatttccaa gcaagaacct gaagatgaat
721 tattgccaca accctgacgg ggagccaagg ccctggtgct tcacaacaga ccccaccaaa
781 cgctgggaat actgtgacat cccccgctgc acaacacccc cgcccccacc cagcccaacc
841 taccaatgtc tgaaaggaag aggtgaaaat taccgaggga ccgtgtctgt caccgtgtct
901 gggaaaacct gtcagcgctg gagtgagcaa acccctcata ggcacaacag gacaccagaa
961 aatttcccct gcaaaaatct ggaagagaac tactgccgga acccagatgg agaaactgct
1021 ccctggtgct ataccactga cagccagctg aggtgggagt actgtgagat tccatcctgc
1081 gagtcctcag catcaccaga ccagtcagat tcctcagttc caccagagga gcaaacacct
1141 gtggtccagg aatgctacca gagcgatggg cagagctatc ggggtacatc gtccactacc
1201 atcacaggga agaagtgcca gtcctgggca gctatgtttc cacacaggca ttcgaagacc
1261 ccagagaact tcccagatgc tggcttggag atgaactact gcaggaaccc ggatggtgac
1321 aagggccctt ggtgctacac cactgacccg agcgtcaggt gggaatactg caacctgaag
1381 cggtgctcag agacaggagg gagtgttgtg gaattgccca cagtttccca ggaaccaagt
1441 gggccgagcg actctgagac agactgcatg tatgggaatg gcaaagacta tcggggcaaa
1501 acggccgtca ctgcagctgg caccccctgc cagggatggg ctgcccagga gccccacagg
1561 cacagcatct tcaccccaca gacaaaccca cgggcagatc tggaaaagaa ctactgccga
1621 aacccagatg gggatgtgaa tggtccttgg tgctatacaa caaaccccag aaaactttat
1681 gactattgtg acatccccct gtgtgcatca gcatcatcct ttgagtgcgg gaaacctcag
1741 gtggaaccga agaaatgccc tgggagggtg gtgggtggct gcgtggccaa ccctcactcc
1801 tggccctggc aaatcagcct tagaacaaga tttaccggac agcacttctg tggcggtact
1861 ttaatagccc cagagtgggt tctgactgct gcccactgtt tggagaaatc ttcaagacct
1921 gaattctaca aggttatcct gggtgcgcac gaagaatata tccgtgggtt ggatgttcag
1981 gaaatatcag tagccaaact gatcttggag cccaacaacc gtgacattgc cctgctgaaa
2041 ctaagccgcc cagccaccat cacggataaa gtcattccag cttgtctgcc atctccaaat
2101 tacatggttg ctgaccggac aatatgttac atcaccggct ggggagagac tcaagggact
2161 ttcggtgccg gtcgtctcaa ggaggctcag ctgcctgtga ttgagaacaa ggtgtgcaac
2221 cgcgtcgagt atctgaacaa cagagtcaaa tccacggagc tctgtgccgg gcaactggct
2281 ggtggcgtcg acagctgcca aggcgacagt ggaggacctc tggtttgctt cgagaaggac
2341 aagtacattt tacaaggagt cacttcttgg ggtcttggct gtgctcgccc caataagcct
2401 ggtgtctacg ttcgtgtctc acggtttgtt gattggattg aaagggagat gaggaataac
2461 tgactaggtg gaaggccgag caaaacctct gcttactaaa gcttactgaa tatggggaga
2521 gggcttaggg tgtttggaaa aactgacagt aatcaaactg ggacactaca ctgaaccaca
2581 gcttcctgtc gcccctcagc ccctcccctt tttttgtatt attgtgggta aaattttcct
2641 gtctgtggac ttctggattt tgtgacaata gaccatcact gctgtgacct ttgttgaaaa
2701 taaactcgat acttactttg
//
LOCUS HUMMHEA 4938 bp ds-DNA PRI 07-JUL-1990
DEFINITION Human MHC class I lymphocyte antigen (HLA-E) (HLA-6.2) gene,
complete cds.
ACCESSION M21533
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; lymphocyte antigen;
major histocompatibility complex.
SOURCE Human lymphoblastoid (cell line B-LCL) DNA, clone HLA-6.2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4938)
AUTHORS Koller,B.H., Geraghty,D.E., Shimizu,Y., DeMars,R. and Orr,H.T.
TITLE A novel HLA class I gene expressed in resting T lymphocytes
JOURNAL J. Immunol. 141, 897-904 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.T.Orr, 23-NOV-1988.
FEATURES from to/span description
pept 705 768 MHC HLA-E precursor, exon 1 /nomgen="HLA-E"
/map="6p21.3" /hgml_locus_uid="LT0100B"
899 1168 MHC HLA-E precursor, exon 2
1411 1686 MHC HLA-E precursor, exon 3
2308 2583 MHC HLA-E precursor, exon 4
2708 2824 MHC HLA-E precursor, exon 5
3575 3607 MHC HLA-E precursor, exon 6
3712 3752 MHC HLA-E precursor, exon 7
sigp 704 768 MHC HLA-E signal peptide
matp 899 1168 MHC HLA-E
1411 1686 MHC HLA-E
2308 2583 MHC HLA-E
2708 2825 MHC HLA-E
3576 3607 MHC HLA-E
3712 3749 MHC HLA-E
pre-msg < 705 4479 MHC HLA-E mRNA and introns
IVS 769 898 MHC HLA-E, intron A
IVS 1169 1410 MHC HLA-E, intron B
IVS 1687 2307 MHC HLA-E, intron C
IVS 2584 2707 MHC HLA-E, intron D
IVS 2825 3574 MHC HLA-E, intron E
IVS 3608 3711 MHC HLA-E, intron F
IVS 3755 3919 MHC HLA-E, intron G
BASE COUNT 1065 a 1332 c 1380 g 1161 t
ORIGIN Chromosome 6p21.3.
1 ggcatgatgg acgcgattag ccggtaacgg atacatgaag catgaggtct ttgaagtcat
61 tggcggctct ctggtgtacg agtctcagct ggttgactac aaggcactct gaagcagatc
121 ctgttttttg ttttgttttg ttttttgttc tgtttttccc cctagacatc tcactctgct
181 gcccaggctg gagtgcagtg gtgtgatctc ggctcactgc aaccatccat cctcctgggt
241 tcaagtagat tctccatgcc tcagcctccc aagtagccag gattacagcc atgcgccacc
301 acgccggcta attttttgga cttttagtag agacagggtt tctccatatt ggtcgggctg
361 gtctcgaact cccaacctca ggtgatcagc ccgccttggc ctcccaaagt gctgagatta
421 caggcgtgag ccaccgcgcc cagccaggac taatttctaa gagtgtgcag agataccgaa
481 acctaaaagt ttaagaactg ctgattgctg ggaaactctg cagtttcccg ttcctctcgt
541 aacctggtca tgtgtccttc ttcctggata ctcatgacgc agactcagtt ctcattccca
601 atgggtgtcg ggtttctaga gaagccaatc agcgtcgcca cgactcccga ctataaagtc
661 cccatccgga ctcaagaagt tctcaggact cagaggctgg gatcatggta gatggaaccc
721 tccttttact cctctcggag gccctggccc ttacccagac ctgggcgggt gagtgcgggg
781 tcgggatgga aacggcctct accgggagta gagaggggcc ggcccggcgg gggcgaagga
841 ctcggggagc cgcgccggga ggagggtcgg gccgatctca gcccctcctc gcccccaggc
901 tcccactcct tgaagtattt ccacacttcc gtgtcccggc ccggccgcgg ggagccccgc
961 ttcatctctg tgggctacgt ggacgacacc cagttcgtgc gcttcgacaa cgacgccgcg
1021 agtccgagga tggtgccgcg ggcgccgtgg atggagcagg aggggtcaga gtattgggac
1081 cgggagacac ggagcgccag ggacaccgca cagattttcc gagtgaacct gcggacgctg
1141 cggcgctact acaatcagag cgaggccggt gagtgacccc ggccaggagc aggtcacgac
1201 ccctccccat cccccacgga cggcgcgggt cccctcgaat cttcgggtcc cagattcacc
1261 ccaaggctgc ggaacccgcc cagaccctag accggggaga gtctcaggcg cctttacccg
1321 gttctttttc agtttaggcc aaaatgccca cagggtggtg gcgacggggg cggggcttgg
1381 tgggcgggac tgactaaggg gcggggccag ggtctcacac cctgcagtgg atgcatggct
1441 gcgagctggg gcccgacagg cgcttcctcc gcgggtatga acagttcgcc tacgacggca
1501 aggattatct caccctgaat gaggacctgc gctcctggac cgcggtggac acggcggctc
1561 agatctccga gcaaaagtca aatgatgcct ctgaggcgga gcaccagaga gcctacctgg
1621 aagacacatg cgtggagtgg ctccacaaat acctggagaa ggggaaggag acgctgcttc
1681 acctgggtaa gagggtccac agggctactc tcccatctcc ttcttgggct aggactgtgc
1741 ccacagctga cagacctcaa acagtagaag aaacagggat ggaggccaga ataccactcc
1801 tcccttggat caggagaggg agctgtcacc tgaggtacag gagatcctat accacagagt
1861 gactctctta aagggccaga cctctctcag gggcaattaa ggaatctagt ctcgctggag
1921 attccatcct tcagatgaac tgatgagcag ttctctttga ctcccagtat taggaatcac
1981 gggggagttt ctctcgtgcc tgattctcag ccccacacca agagtttttg gaggtctgac
2041 tccagctttt ctcagtcact cagcatccac acaggccagg accagaaatc ccttttcacc
2101 ttctaccctg ggctagctca tcccgattct agaactttcc aaggaataag aggctatccc
2161 agatccctaa gtccaggctg gtgtcaaggt tttgtcctct tctcctacta taattgtcct
2221 cttccttctc aggatggtca catgggtgct gctggagtgt cccatgagag atacaaagtg
2281 cctgaatttt ctgactcttc ccctcagagc ccccaaagac acacgtgact caccacccca
2341 tctctgacca tgaggccacc ctgaggtgct gggccctggg cttctaccct gcggagatca
2401 cactgacctg gcagcaggat ggggagggcc atacccagga cacggagctc gtggagacca
2461 ggcctgcagg ggatggaacc ttccagaagt gggcagctgt ggtggtgcct tctggagagg
2521 agcagagata cacgtgccat gtgcagcatg aggggctacc cgagcccgtc accctgagat
2581 ggagtaagga gggggatggg aggtcatgtc tcttctcagg gaaagcggga gcccttctgg
2641 agcccttccg cagggtcagg gctgaggcct gggggtcagg gccccttacg ttcccctctt
2701 ttcccagagc cggcttccca gcccaccatc cccatcgtgg gcatcattgc tggcctggtt
2761 ctccttggat ctgtggtctc tggagctgtg gttgctgctg tgatatggag gaagaagagc
2821 tcaggtgggg aagggagaag ggtggggtct gagttttctt gtcccactgg gtgtttcaag
2881 ccctaggtaa aagtgtgtcc tgcctcgtta ctgggaagca ccatccacac acacgagcct
2941 acccagcctg gggccctgtg tgccagcacc tactcttttt ttttgagacg gagtcttggc
3001 tctgtcaccc aggctggagt gcaatggcgt ggtttcagct cactgcaacc tccgcctccc
3061 aggttcaagc aattctcctg cctcagcctc cctagtagct gggactacac atgcgtgcca
3121 ccacacctgg ctaatttttt tttttgtatt tttagtggag atggggtttc actatgttgg
3181 ccaggctggt ctcgaactcc tgactttgtg atctgcctgc ctcggcctcc caaagtgctg
3241 ggattacagt cgtgagccac cgcacccagc cgcacctact cttttgtaaa gcacctgtga
3301 caatgaagga cagatttatc accttgacga ttgtggtgat ggggacctga tcccagcagt
3361 cacaggtcac aggggaaggt ccctgctgaa gacagacctc agaagggcag ttgatccagg
3421 acccacacct gctttcttca cgtttcctga tcctgccctg ggtctgcagt cacagttcag
3481 gaaacttctc tgggatccaa aactaggagg ttcctctagg accttatggc cctgcctcct
3541 ccctggcccc tcacaggaca ttttcttcca acaggtggaa aaggagggag ctactctaag
3601 gctgagtgta agtgcggggc gggagcgtgg aggagctcgc ccaccctata attcctcctg
3661 caccacatct cctgtgggct ctgaccaggt cttgtttttg ttctacccca gggagcgaca
3721 gtgcccaggg gtctgagtct cacagcttgt aaaggtgaga ttctgggggt ctgaagtggg
3781 tggagggtgg ggcagagggg acaggactgg gttgtgggga ttttttgatt cagaattttt
3841 gagtgtgtgg tgggctgttc agagtgtcat cacttaccgt gactgacctg aatttgttca
3901 tgactatttt cttctgtagc ctgagacagc tgccttgtgt gcgactgaga tgcacagctg
3961 ccttgtgtgc gactgagatg caggatttcc tcacgcctcc cctatgtgtc ttaggggact
4021 ctggcttctc tttttgcaag ggcctctgaa tctgtctgtg tccctgttag cacaatgtga
4081 ggaggtagag aaacagtcca cctctgtgtc taccatgacc cccttcctca cactgacctg
4141 tgttccttcc ctgttctctt ttctattaaa aataagaacc tgggcagagt gcggcagctc
4201 atgcctgtaa tcccagcact tagggaggcc gaggagggca gatcacgagg tcaggagatc
4261 gaaaccatcc tggctaacac ggtgaaaccc cgtctctact aaaaaataca aaaaattagc
4321 tgggcgcaga ggcacgggcc tgtagtccca gctactcagg aggcggaggc aggagaatgg
4381 cgtcaacccg ggaggcggag gttgcagtga gccaggattg tgcgactgca ctccagcctg
4441 ggtgacaggg tgaaacgcca tctcaaaaaa taaaaattaa aaaataaaaa aagaacctgg
4501 atctcaattt aatttttcat attcttgcaa tgaaatggac ttgaggaagc taagatcata
4561 gctagaaata cagataattc cacagcacat ctctagcaaa tttagcctat tcctattctc
4621 tagcctattc cttaccacct gtaatcttga ccatatacct tggagttgaa tattgttttc
4681 atactgctgt ggtttgaatg ttccctccaa cactcatgtt gagacttaat ccctaatgtg
4741 gcaatactga aaggtggggc ctttgagatg tgattggatc gtaaggctgt gccttcattc
4801 atgggttaat ggattaatgg gttatcacag gaatgggact ggtggcttta taagaagagg
4861 aaaagagaac tgagctagca tgcccagccc acagagagcc tccactagag tgatgctaag
4921 tggaaatgtg aggtgcag
//
LOCUS HUMGSTPI1 1724 bp ds-DNA PRI 07-JUL-1990
DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exons 1-5.
ACCESSION X08094 M33082
KEYWORDS glutathione S-transferase pi.
SEGMENT 1 of 3
SOURCE Human, cell line HPB-ALL, lymphoid DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1724)
AUTHORS Cowell,I.G.
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1724)
AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
TITLE The structure of the human glutathione S-transferase pi gene
JOURNAL Biochem. J. 255, 79-83 (1988)
STANDARD simple staff_entry
COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
mRNA sequence.
EMBL features not translated to GenBank features:
key from to description
SITE 116 123 TRE-box
SITE 129 134 Sp1 binding site
SITE 138 143 Sp1 binding site
PRM 154 157 TATA-box
MSG 182 214 Exon 1
MSG 499 534 Exon 2
MSG 819 925 Exon 3
MSG 1042 1129 Exon 4
MSG 1488 1591 Exon 5
[1] Author address Cowell I.G., University College and Middlesex,
School of Medicine, Windeyer Building, Cleveland Street, London,
W1P 6DB, United Kingdom.
Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES from to/span description
pept 214 214 glutathione S-transferase pi, exon 1
/hgml_locus_uid="LQ0085R" /map="11q13-qter"
/nomgen="GST3"
499 534 glutathione S-transferase pi, exon 2
819 925 glutathione S-transferase pi, exon 3
1042 1129 glutathione S-transferase pi, exon 4
1488 + 1591 glutathione S-transferase pi, exon 5
pre-msg 182 > 1724 GST3 mRNA and introns
IVS 215 498 GST3 intron A
IVS 535 818 GST3 intron B
IVS 926 1041 GST3 intron C
IVS 1130 1487 GST3 intron D
IVS 1592 > 1724 GST3 intron E
revision 1532 1541 atgaatgac in [1]; gtgaatgacg in [2]
revision 1707 1707 n in [1]; g in [2]
BASE COUNT 272 a 598 c 541 g 313 t
ORIGIN
1 tcgctgcgac tccgggactc cagggcgccc ctctcggccg acgcccgggg tgcagcggcg
61 ccgggctggg gccggcggga gtccgcggga ccctccagaa gagcggccgg cgccgtgact
121 cagcactggg gcggagcggg gcgggaccac ccttataagg ctcggaggcc gcgaggcctt
181 cgctggagtt tcgccgccgc agtcttcgcc accagtgagt acgcgcggcc cgcgtccccg
241 gggatggggc tcagagctcc cagcatgggg ccaacccgca gcatcaggcc cgggctcccg
301 gcaggctcct cgcccacctc gagacccggg acggggccta ggggacccag gacgtcccca
361 gtggccgtta gcggctttca gggggcccgg agcgcctcgg ggagggatgg gaccccgggg
421 gcgggagggg ggggcagctc cactcaccgc gccttggcat cctccccggg ctccacaaat
481 tttctttgtt cgctgcagtg ccgccctaca ccgtggtcta tttcccagtt cgaggtagga
541 gcatgtgtct ggcaggaagg gaggcaggcc tggctgcagc ccacagcccc tcgcccaccc
601 ggagagatcc gaaccccctt atccctccgt cgtgtggctt ttaccccggg cctccttcct
661 gttccccgcc tctcccgcca tgcctgctcc ccgccccagt gttgtgtaaa atcttggagg
721 aacctgtttc ctgttccctc ctgcactcct gaccctcccg ggttgctgcg aggcggagtg
781 gcccggtccc cacatctcgt acttctccct ccccgcaggc cgctgcgcgg ccctgcgcat
841 gctgctggca gatcagggcc agagctggaa ggaggaggtg gtgaccgtgg agacgtggca
901 ggagggctca ctcaaagcct cctgcgtaag tgaccatgcc cgggcaaggg gagggggtgc
961 tgggccttag ggggctgtgc actaggatcg ggggacgccc aagctcagtg ccccttccct
1021 gagccatgcc tcccccaaca gctatacggg cagctcccca agttccagga cggagacctc
1081 accctgtacc agtccaatac catcctgcgt cacctgggcc gcacccttgg tgagtcttaa
1141 cctcaagtcc aggagtcagg caagcctctg cccccggagc ccttttgttt aaatcaggcc
1201 ccgcagccct ctggagggga ggaaactgag acccactgag gttacgtagt ttgccaagtc
1261 aagcctggtt gtttgcaatc cttgcctgtg ccaggctggc tcccaggtgt caggtgagct
1321 ctgagcacct ggtgtgtggc agtctctcat ccttccacgc acatcctctt cccctcctcc
1381 caggctgggg ctcatcagac agaaccccct tggcctggcc catccccaat gactgtgtgt
1441 gatcagcgcc cagtcaaccc ggcctgctcc cctccaccca accccagggc tctatgggaa
1501 ggaccagcag gaggcagccc tggtggacat ggtgaatgac ggcgtggagg acctccgctg
1561 caaatacatc tccctcatct acaccaacta tgtgtgagca tctgcaccag gttgggcact
1621 gggggctgaa caaagaaagg ggcttcttgt gccctcaccc cccttacccc tcaggtggct
1681 tgggctgacc ccttcttggg tcagggggca ggggctgggt cagg
//
LOCUS HUMGSTPI2 591 bp ds-DNA PRI 07-JUL-1990
DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 6.
ACCESSION X08095 M33083
KEYWORDS glutathione S-transferase pi.
SEGMENT 2 of 3
SOURCE Human, cell line HPB-ALL, lymphoid DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 591)
AUTHORS Cowell,I.G.
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 591)
AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
TITLE The structure of the human glutathione S-transferase pi gene
JOURNAL Biochem. J. 255, 79-83 (1988)
STANDARD simple staff_entry
COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
mRNA sequence.
EMBL features not translated to GenBank features:
key from to description
TRANSCR <1 >591 primary transcript
MSG 470 577 Exon 6
[1] Author address Cowell I.G., University College and Middlesex,
School of Medicine, Windeyer Building, Cleveland Street, London,
W1P 6DB, United Kingdom.
Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES from to/span description
pept + 470 + 577 glutathione S-transferase pi, exon 6
/hgml_locus_uid="LQ0085R" /map="11q13-qter"
/nomgen="GST3"
IVS < 1 469 GST3 intron E
IVS 578 > 591 GST3 intron F
BASE COUNT 131 a 111 c 225 g 124 t
ORIGIN About 300 bp after segment 1.
1 tgggaccagt cagcagaggc agcgtgtgtg cgcgtgcgtg tgcgtgtgtg tgcgtgtgtg
61 tgtgtacgct tgcatttgtg tcgggtgggt aaggagatag agatgggcgg gcagtaggcc
121 caggtcccga aggccttgaa cccactggtt tggagtctcc taagggcaat gggggccatt
181 gagaagtctg aacagggctg tgtctgaatg tgaggtctag aaggatcctc cagagaagcc
241 agctctaaag cttttgcaat catctggtga gagaacccag caaggatgga caggcagaat
301 ggaatagaga tgagttggca gctgaagtgg acaggatttg gtactagcct ggttgtgggg
361 agcaagcaga ggagaatctg ggactctggt ggtctggcct ggggcagacg ggggtgtctc
421 aggggctggg agggatgaga gtaggatgat acatggtggt gtctggcagg aggcgggcaa
481 ggatgactat gtgaaggcac tgcccgggca actgaagcct tttgagaccc tgctgtccca
541 gaaccaggga ggcaagacct tcattgtggg agaccaggtg agcatctggc c
//
LOCUS HUMGSTPI3 499 bp ds-DNA PRI 07-JUL-1990
DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 7.
ACCESSION X08096 M33084
KEYWORDS glutathione S-transferase pi.
SEGMENT 3 of 3
SOURCE Human, cell line HPB-ALL, lymphoid DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 499)
AUTHORS Cowell,I.G.
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 499)
AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
TITLE The structure of the human glutathione S-transferase pi gene
JOURNAL Biochem. J. 255, 79-83 (1988)
STANDARD simple staff_entry
COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
mRNA sequence.
EMBL features not translated to GenBank features:
key from to description
TRANSCR <1 ? primary transcript
MSG 17 ? Exon 7
[1] Author address Cowell I.G., University College and Middlesex,
School of Medicine, Windeyer Building, Cleveland Street, London,
W1P 6DB, United Kingdom.
Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES from to/span description
pept + 17 205 glutathione S-transferase pi exon 7
/hgml_locus_uid="LQ0085R" /map="11q13-qter"
/nomgen="GST3"
IVS < 1 16 GST3 intron F
signal 260 265 poly-A signal
revision 127 128 tc in [1]; cg in [2]
BASE COUNT 114 a 140 c 141 g 104 t
ORIGIN About 200 bp after segment 2.
1 ctggcctgcc ctgcagatct ccttcgctga ctacaacctg ctggacttgc tgctgatcca
61 tgaggtccta gcccctggct gcctggatgc gttccccctg ctctcagcat atgtggggcg
121 cctcagcgcc cggcccaagc tcaaggcctt cctggcctcc cctgagtacg tgaacctccc
181 catcaatggc aacgggaaac agtgagggtt ggggggactc tgagcgggag gcagagtttg
241 ccttcctttc tccaggacca ataaatttct aagagaggct actatgagca ctgtgtttcc
301 tgggacgggc ttagggcgtt ctcagcctcg aggtcggtgg gagggcagag cagaggacta
361 gaaaacagct cctccagcca ccgtcagtgg cttctggagc cctcagcctg gctgtgttta
421 ctcaaccatc acaaactaga agaggaagaa aaaaaaagag agagagtaca agagaaataa
481 caatggtcct gtcctggag
//GenBank-Updates@genbank.bio.net (07/10/90)
LOCUS HUMCDR34 2412 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene,
complete cds.
ACCESSION M31423
KEYWORDS cerebellar-degeneration-related antigen.
SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2412)
AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
Chaganti,R.S.K., Posner,J.B. and Old,L.J.
TITLE Cerebellar degeneration-related antigen: A highly conserved
neuroectodermal marker mapped to chromosomes X in human and mouse
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.-T.Chen, 17-JAN-1990
FEATURES from to/span description
pept 503 1174 cerebellar-degeneration-related antigen
(CDR34)
BASE COUNT 743 a 334 c 669 g 666 t
ORIGIN
1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
2401 gagatgctgc ag
//
LOCUS RATHGF 2485 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION M32987
KEYWORDS hepatocyte growth factor.
SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2485)
AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
Shimizu,S. and Nakamura,T.
TITLE Deduced primary structure of rat hepatocyte growth factor and
expression of the mRNA in rat tissues
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.Nakamura, 26-MAR-1990.
FEATURES from to/span description
pept 143 2329 hepatocyte growth factor
BASE COUNT 750 a 533 c 577 g 625 t
ORIGIN
1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
2461 ttaatatatg tggcgttttc tgttg
//
LOCUS HUMINSGS 351 bp ds-DNA SYN 10-JUL-1990
DEFINITION Human (synthetic) insulin gene, complete cds.
ACCESSION J02547 M25881
KEYWORDS artificial gene; insulin.
SOURCE Synthetic human DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 79 to 351)
AUTHORS Brousseau,R., Scarpulla,R., Sung,W., Hsiung,H.M., Narang,S.A. and
Wu,R.
TITLE Synthesis of a human insulin gene: V. Enzymatic assembly, cloning
and characterization of the human proinsulin DNA
JOURNAL Gene 17, 279-289 (1982)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 351)
AUTHORS Georges,F., Brousseau,R., Michniewicz,J., Prefontaine,G.,
Stawinski,J., Sung,W., Wu,R. and Narang,S.A.
TITLE Synthesis of a human insulin gene: VII. Synthesis of
preproinsulin-like human DNA, its cloning and expression in M13
bacteriophage
JOURNAL Gene 27, 201-211 (1984)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 351)
AUTHORS Narang,S.A., Brousseau,R., Georges,F., Michniewicz,J.,
Prefontaine,G., Stawinski,J. and Sung,W.
TITLE The human preproinsulin gene: synthesis, cloning, gene
modification, and expression studies
JOURNAL Can. J. Biochem. 62, 209-216 (1984)
STANDARD full staff_review
COMMENT In places where the human insulin amino acid sequence is identical
to the rat insulin amino acid sequence, the synthetic sequence
follows the published nucleotide sequence for rat (see separate
entry).
FEATURES from to/span description
pept 6 350 synthetic preproinsulin
sigp 6 77 synthetic insulin signal peptide
matp 90 179 synthetic insulin B-chain
matp 186 278 synthetic insulin C-chain
matp 285 347 synthetic insulin A-chain
BASE COUNT 65 a 93 c 100 g 93 t
ORIGIN 78 bp upstream of EcoRI site.
1 aattcatggg cctatggatc cgtctactgc ctctgatcgc gctgctgatc ctctggggac
61 cggatccagc tgcggccgaa ttccggatgt ttgtcaatca gcacctttgt ggttctcacc
121 tggtggaggc tctgtacctg gtgtgtgggg aacgtggttt cttctacaca cccaagaccc
181 gtcgtgaagc tgaagacctt caagtgggtc aagttgaact tggtgggggt cctggtgcgg
241 gttctcttca acctttggct ctcgagggat cacttcaaaa gcgtggcatt gtggagcagt
301 gctgcaccag catctgctcc ctctaccaac tggagaacta ctgcaactga g
//
LOCUS TRFRRECF 212 bp ss-rRNA RNA 10-JUL-1990
DEFINITION Trypanosomatid (C.fasciculata) small rRNA e from the large
ribosomal subunit.
ACCESSION K02691 M25882
KEYWORDS ribosomal RNA.
SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA.
ORGANISM Crithidia fasciculata
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W.
TITLE Primary structures of four novel small ribosomal RNAs from
Crithidia fasciculata
JOURNAL Can. J. Biochem. 61, 38-45 (1983)
STANDARD full staff_review
COMMENT The large subunit of the ribosome of C.fasciculata contains six
small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
are found in ribosomes of other organisms. rRNAs e,f,g, and j were
analyzed by [1]. In rRNA e the number of "g" residues at positions
91-93 and 123-124 were ambiguous. At positions 77-81 three "c"s
and two "t"s were found, but the order was unclear. Positions 116
and 119 gave strong "t" bands but also consistently gave weak bands
in the "u-2" track. [1] is not sure of the reason: sequencing
artifact or an indication of cistron heterogeneity. No evidence of
this heterogeneity was found in chemical gels.
FEATURES from to/span description
rRNA 1 212 ribosomal RNA e
modified 125 125 p (putative)
BASE COUNT 46 a 53 c 51 g 62 t
ORIGIN 5' end of mature rRNA e.
1 tagtggaaat gcgaaacact tgccaggtga caaatcaatc ctcccacggt gagctttctt
61 ttcaccataa tccacatctc cggctttgct gggcttgggc ctttttactt ctcgcgttgt
121 tcggtgcggg ggcccaagat tgaaaaatgc agctctccct acgtactgtc attgttgtga
181 gttctgcgca ttaaagcaaa aacctggggt gt
//
LOCUS TRFRRFCF 183 bp ss-rRNA RNA 10-JUL-1990
DEFINITION Trypanosomatid (C.fasciculata) small rRNA f from the large
ribosomal subunit.
ACCESSION K02692 M25883
KEYWORDS ribosomal RNA.
SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA.
ORGANISM Crithidia fasciculata
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 183)
AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W.
TITLE Primary structures of four novel small ribosomal RNAs from
Crithidia fasciculata
JOURNAL Can. J. Biochem. 61, 38-45 (1983)
STANDARD full staff_review
COMMENT The large subunit of the ribosome of C.fasciculata contains six
small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
are found in ribosomes of other organisms. rRNAs e,f,g, and j are
reported by [1].
FEATURES from to/span description
rRNA 1 183 ribosomal RNA f
BASE COUNT 41 a 49 c 57 g 36 t
ORIGIN 5' end of mature rRNA f.
1 gtgagattgt gaagggatct cgcaggcatc gtgagggaag tatggggtag tacgagagga
61 actcccatgc cgtgcctcta gtttctgggg tttgtcgaac ggcaagtgcc ccgaagccat
121 cgcacggtgg ttctcggctg aacgcctcta agccagaagc caatcccaag accagatgcc
181 ccc
//
LOCUS TRFRRGCF 136 bp ss-rRNA RNA 10-JUL-1990
DEFINITION Trypanosomatid (C.fasciculata) small rRNA g from the large
ribosomal subunit.
ACCESSION K02693 M25884
KEYWORDS ribosomal RNA.
SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA.
ORGANISM Crithidia fasciculata
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 136)
AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W.
TITLE Primary structures of four novel small ribosomal RNAs from
Crithidia fasciculata
JOURNAL Can. J. Biochem. 61, 38-45 (1983)
STANDARD full staff_review
COMMENT The large subunit of the ribosome of C.fasciculata contains six
small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
are found in the ribosomes of other organisms. rRNAs e,f,g, and j
are reported by [1]. There was some question whether rRNA g
contained 135 or 136 bp, starting with base 1 or 2 in the sequence
presented below.
FEATURES from to/span description
rRNA 1 136 ribosomal RNA g
BASE COUNT 31 a 37 c 40 g 28 t
ORIGIN 5' end of mature rRNA g.
1 acaacgtccc tctccaaacg agagaatatg catgggctgg catgagcggc atgcttcact
61 ccggtggggc tcgaggggca cttacgtccc gaggcgctga accttgaggc ctgaaatttc
121 atgctctggg actaaa
//
LOCUS TRFRRJCF 73 bp ss-rRNA RNA 10-JUL-1990
DEFINITION Trypanosomatid (C.fasciculata) small rRNA j from the large
ribosomal subunit.
ACCESSION K02694 M25885
KEYWORDS ribosomal RNA.
SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA.
ORGANISM Crithidia fasciculata
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 73)
AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W.
TITLE Primary structures of four novel small ribosomal RNAs from
Crithidia fasciculata
JOURNAL Can. J. Biochem. 61, 38-45 (1983)
STANDARD full staff_review
COMMENT The large subunit of the ribosome of C.fasciculata contains six
small rRNAs (designated e,f,g,h,i,j) when normally only two (h,i)
are found in the ribosomes of other organisms. rRNAs e,f,g, and j
are reported by [1]. There was some question whether rRNA j
contained 72 or 73 bp, starting with base 1 or 2 in the sequence
presented below.
FEATURES from to/span description
rRNA 1 73 ribosomal RNA j
BASE COUNT 17 a 23 c 14 g 19 t
ORIGIN 5' end of mature rRNA j.
1 tcatcgaatc gccacctaca cgactggagc ttgctccctc gtcggcctct agtatattca
61 tgatcacaag gta
//
LOCUS YSCRGEA 1798 bp ds-DNA PLN 10-JUL-1990
DEFINITION Yeast (S.cerevisiae) 18S ribosomal RNA gene.
ACCESSION J01353 M27607
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE Yeast (S.cerevisiae + D4) DNA, clones pY1rA3 and prYC.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1798)
AUTHORS Rubtsov,P.M., Musakhanov,M.M., Zakharyev,V.M., Krayev,A.S.,
Skryabin,K.G. and Bayev,A.A.
TITLE The structure of the yeast ribosomal RNA genes. I. The complete
nucleotide sequence of the 18S ribosomal RNA gene from
Saccharomyces cerevisiae
JOURNAL Nucleic Acids Res. 8, 5779-5794 (1980)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1798; revises [1])
AUTHORS Mankin,A.S., Skryabin,K.G. and Rubtsov,P.M.
TITLE Identification of ten additional nucleotides in the primary
structure of yeast 18S rRNA
JOURNAL Gene 44, 143-143 (1986)
STANDARD full staff_review
FEATURES from to/span description
rRNA 1 1798 18S ribosomal RNA
revision 943 943 a in [2]; g in [1]
revision 962 962 a in [2]; g in [1]
revision 982 983 ag in [2]; ga in [1]
revision 988 999 tcgaagatgatc in [2]; tc in [1]
revision 1002 1002 g in [2]; a in [1]
revision 1122 1123 ag in [2]; aag in [1]
revision 1742 1742 a in [2]; g in [1]
BASE COUNT 480 a 348 c 459 g 511 t
ORIGIN 9 bp upstream of Sau3A site.
1 tatctggttg atcctgccag tagtcatatg cttgtctcaa agattaagcc atgcatgtct
61 aagtataagc aatttataca gtgaaactgc gaatggctca ttaaatcagt tatcgtttat
121 ttgatagttc ctttactaca tggtataacc gtggtaattc tagagctaat acatgcttaa
181 aatctcgacc ctttggaaga gatgtattta ttagataaaa aatcaatgtc ttcggactct
241 ttgatgattc ataataactt ttcgaatcgc atggccttgt gctggcgatg gttcattcaa
301 atttctgccc tatcaacttt cgatggtagg atagtggcct accatggttt caacgggtaa
361 cggggaataa gggttcgatt ccggagaggg agcctgagaa acggctacca catccaagga
421 aggcagcagg cgcgcaaatt acccaatcct aattcaggga ggtagtgaca ataaataacg
481 atacagggcc cattcgggtc ttgtaattgg aatgagtaca atgtaaatac cttaacgagg
541 aacaattgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc aatagcgtat
601 attaaagttg ttgcagttaa aaagctcgta gttgaacttt gggcccggtt ggccggtccg
661 attttttcgt gtactggatt tccaacgggg cctttccttc tggctaacct tgagtccttg
721 tggctcttgg cgaaccagga cttttacttt gaaaaaatta gagtgttcaa agcaggcgta
781 ttgctcgaat atattagcat ggaataatag aataggacgt ttggttctat tttgttggtt
841 tctaggacca tcgtaatgat taatagggac ggtcgggggc atcggtattc aattgtcgag
901 gtgaaattct tggatttatt gaagactaac tactgcgaaa gcatttgcca aggacgtttt
961 cattaatcaa gaacgaaagt taggggatcg aagatgatct ggtaccgtcg tagtcttaac
1021 cataaactat gccgactaga tcgggtggtg tttttttaat gacccactcg gtaccttacg
1081 agaaatcaaa gtctttgggt tctgggggga gtatggtcgc aaggctgaaa cttaaaggaa
1141 ttgacggaag ggcaccacta ggagtggagc ctgcggctaa tttgactcaa cacggggaaa
1201 ctcaccaggt ccagacacaa taaggattga cagattgaga gctctttctt gattttgtgg
1261 gtggtggtgc atggccgttt ctcagttggt ggagtgattt gtctgcttaa ttgcgataac
1321 gaacgagacc ttaacctact aaatagtggt gctagcattt gctggttatc cacttcttag
1381 agggactatc ggtttcaagc cgatggaagt ttgaggcaat aacaggtctg tgatgccctt
1441 agaacgttct gggccgcacg cgcgctacac tgacggagcc agcgagtcta accttggccg
1501 agaggtcttg gtaatcttgt gaaactccgt cgtgctgggg atagagcatt gtaattattg
1561 ctcttcaacg aggaattcct agtaagcgca agtcatcagc ttgcgttgat tacgtccctg
1621 ccctttgtac acaccgcccg tcgctagtac cgattgaatg gcttagtgag gcctcaggat
1681 ctgcttagag aagggggcaa ctccatctca gagcggagaa tttggacaaa cttggtcatt
1741 tagaggaact aaaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcatta
//
LOCUS DROSHA1A 1473 bp ss-mRNA INV 10-JUL-1990
DEFINITION D.melanogaster Sha12 protein mRNA, complete cds.
ACCESSION M32660
KEYWORDS .
SOURCE D.melanogaster, cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1473)
AUTHORS Butler,A., Wei,A. and Salkoff,L.
TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in
Drosophila
JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1473)
AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
TITLE Diverse K+ currents expressed by a Drosophila extended gene family
which is conserved in mouse
JOURNAL Science 248, 599-603 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Salkoff, 07-MAR-1990.
FEATURES from to/span description
pept 1 1473 Sha12 protein
BASE COUNT 302 a 434 c 432 g 305 t
ORIGIN Chromosome 3 left arm at locus 76B.
1 atggcctcgg tcgccgcttg gctgcccttc gcccgggcgg cggccatcgg gtgggtgccg
61 atagccaccc acccactgcc accgcccccg atgcccaagg atcgccgcaa aacggacgac
121 gagaagctcc tgatcaacgt ctccgggcgg cgcttcgaga cgtggcggaa tactttggag
181 aagtatccgg acaccctttt aggttccaat gaaagggagt tcttctacga cgaggactgc
241 aaagaatact tcttcgatcg ggacccggac atcttccggc acatactgaa ctactaccgg
301 acgggcaagc tgcactaccc gaagcacgaa tgcctcacca gctacgacga ggagctggcc
361 ttctttggaa taatgccgga tgtcattggc gattgctgct acgaggacta ccgggaccgg
421 aagcgggaga acgcggagcg gctgatggac gacaagctgt cggagaacgg ggatcagaat
481 ctgcagcagc tgaccaacat gcgccagaag atgtggcggg ccttcgagaa tccgcacacg
541 tcgacgagcg ccctggtgtt ctactatgtt acgggtttct tcatcgccgt ctccgtgatg
601 gccaacgtgg tggagacggt gccgtgtggc caccggccgg gcagagcggg aactctgccc
661 tgcggcgagc gctacaagat cgtcttcttc tgcctggata ccgcctgcgt gatgatcttt
721 acggcggagt acctacttcg actcttcgcc gcccccgatc gctgcaagtt cgtgcgctcg
781 gtgatgagca ttattgatgt ggtggccatt atgccgtact acattggcct cgggatcacc
841 gacaacgacg acgtgagcgg tgctttcgtc acgctgcgcg tgttccgtgt cttccgcata
901 ttcaagttct cgcgccactc gcaaggactt cggatcctcg gctacacgct caagtcctgc
961 gccagcgaac tgggcttcct tgtcttctcg ctggccatgg ccattatcat ctttgccacc
1021 gtcatgttct acgccgagaa gaacgtcaat ggcaccaact tcacatcgat tccggcggcc
1081 ttctggtata ccatcgtcac aatgacgacg ctgggatatg gcgacatggt gccagagaca
1141 atagctggca aaattgtggg cggcgtctgc tcgcttagcg gtgtgctggt catcgcctta
1201 cctgtacctg ttatcgtatc gaactttagt agaatctatc accagaacca gcgagcggac
1261 aagcgcaagg cgcagcggaa agctcgcctg gcgcgcatcc gcattgccaa ggcctcgtcc
1321 ggagccgcct ttgttagcaa gaagaaggcc gccgaggccc ggtgggctgc ccaggagtcg
1381 ggcatcgagc tggatgacaa ctatcgggac gaggacatct tcgagctgca gcaccatcat
1441 ttgctgcgat gtctggagaa gacaacgatg tag
//
LOCUS DROSHABA 2778 bp ss-mRNA INV 10-JUL-1990
DEFINITION D.melanogaster Shab11 protein mRNA, complete cds.
ACCESSION M32659
KEYWORDS .
SOURCE D.melanogaster, cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2778)
AUTHORS Butler,A., Wei,A. and Salkoff,L.
TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in
Drosophila
JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2778)
AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
TITLE Diverse K+ currents expressed by a Drosophila extended gene family
which is conserved in mouse
JOURNAL Science 248, 599-603 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Salkoff, 07-MAR-1990.
FEATURES from to/span description
pept 1 2775 Shab11 protein
BASE COUNT 679 a 784 c 788 g 527 t
ORIGIN
1 atggtcgggc aattgcaagg tggacaggct gctggccagc aacagcaaca gcaacaagcg
61 actcagcaac agcaacactc gaagcagcag ctgcaacagc agcagcagca acagcagcaa
121 ctgcaactca agcagcatca gcagcagcaa caggacatcc tgtatcagca acataacgag
181 gcaattgcaa ttgcacgcgg actgcaggct gcaacacctg ccgacatcgg cgataatcag
241 ccgtactacg atacaagcgg taatgtcgat tgggagcggg cgatgggagc cggtggagct
301 ggtgcatatg gtggcatcgg catcggatct ctaccagcag ctggcggtgc tgcttatcac
361 cttgggccag ctaatcccgc aggcctcgtt tctcgtcact tggattacgg tgatggcggc
421 caccttgctg gcccatccgc cggtcttcct gctggagctg tgggatcagg agcaggagcg
481 ggagccggtg cgggagcatc agtcacggga tcaggatcag gagcagggac aggaacagga
541 accggagccg gatctggatc gggcagtgga gcagcaggca aggaagttcg ctacgcccct
601 ttcccagtcg catcaccaac gcactcgatt cccacaacct cccagcagat cgttggcggc
661 gtcggtggcg tgggcgtcgg tggtgccagc agccagtcga tttcgggcgg tgtacccacc
721 cacagccaga gcaacaccac cggcgctctg cagcggacac attccagatc catgtcctcc
781 ataccgccgc ccgagccgtt catgatagcc cagtcgaagg cggtcaacag ccgcgtgtcc
841 atcaacgtgg gcggggtgag gcacgaggtc ctgtggagga cgctggagcg gctgccccac
901 acgcggctcg ggcggctggg ggagtgcacc acccacgagg ccatcgtgga gctgtgcgac
961 gactactcgc tggcggacaa cgagtacttc ttcgaccgac atccgaagag cttcagctcc
1021 atcctgaact tctatcgcac cggcaagctg cacatcgtcg acgagatgtg cgtgctcgcg
1081 tttggtgatg acctggagta ctggggcgtc gacgaactgt acctggagtc ctgctgccag
1141 cacaagtacc accagcgcaa ggagaacgtt cacgaggaga tgcgtaagga ggccgagtcc
1201 ctgcggcagc gcgacgagga ggaattcggc gaaggtaaat tctccgagta ccagaagtat
1261 ctgtgggagc tcctcgagaa gcctaacact agtttcgccg cccgggttat cgcagtgata
1321 tccatactat tcatagtcct gtctaccata gccctgacgt tgaacaccct accacaacta
1381 caacacattg acaacggtac accacaggat aatccgcaat tggcaatggt tgaggccgtg
1441 tgtatcacgt ggttcactct agagtacata cttaggttta gctcctcgcc ggacaagtgg
1501 aagttcttta agggcggcct taacataatc gatctattgg caatactccc atactttgtt
1561 tcgttatttc tattggaaac gaataagaat gcaacggacc agttccagga tgtgcgtcgg
1621 gtggtgcagg tctttcgcat catgcgcatc ctgcgggtcc ttaagctggc ccgtcactca
1681 acgggcctgc agtcgttagg ctttacgctg cgtaactcat ataaggaact cggtctacta
1741 atgctgttcc tggccatggg cgttctcata ttttcttcgc tggcatattt tgccgaaaag
1801 gatgaaaagg atacaaaatt cgtttcaata ccggaagcat tttggtgggc gggtattaca
1861 atgacaactg ttggctacgg ggacatctgt cccacaactg cactgggaaa ggttattggt
1921 actgtgtgtt gcatatgcgg tgttctggtg gtcgctttgc ctattcccat catcgttaac
1981 aattttgctg aattttataa gaatcagatg cgccgcgaaa aggccctcaa gcgtcgcgag
2041 gcactcgatc gtgccaagcg cgagggcagc attgtctcct tccatcatat caatctgaaa
2101 gatgccttcg ccaagtccat ggatctcatc gatgtgattg tcgacacagg aaagcaaaca
2161 aatgtcgtgc atccgaaggg taaaagacaa agcaccccca atataggcag gcagaccctc
2221 gatgtgcaaa gcgccccagg ccacaatctc tcgcaaacgg acggcaacag caccgaaggc
2281 gagtctacca gcggacgcaa tccggccacc accggaaccg gatgctataa gaattacgac
2341 cacgtagcca acctgcgcaa ctccaacctg cacaaccgac gcggatccag ctctgagcag
2401 gatgcagtgc cgccctacag cttcgacaat cccaatgccc gccagacctc aatgatggcc
2461 atggagagct atcggcgcga cgaacaggca ctgctgcagc aacagcaaca gcagcagcaa
2521 cagatgttgc agatgcaaca gattcagcag aaggccccga acggaaatgg aggtgcaacc
2581 ggaggaggag tggccaacaa cctggccatg gtggccgcat caagtgccgg aacagccgtg
2641 gccaccgcca ccaatgccag taatgccagc aataccgccc ccgggtcaga gggcgccgag
2701 ggaggcgtga tggagatggg ggcggtgtcg atgacgacaa cctttcccag gccaagggac
2761 tgcccatcca gatgatga
//
LOCUS DROSHAWA 1497 bp ss-mRNA INV 10-JUL-1990
DEFINITION D.melanogaster Shaw2 protein mRNA, complete cds.
ACCESSION M32661
KEYWORDS .
SOURCE D.melanogaster, cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1497)
AUTHORS Butler,A., Wei,A. and Salkoff,L.
TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in
Drosophila
JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1497)
AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
TITLE Diverse K+ currents expressed by a Drosophila extended gene family
which is conserved in mouse
JOURNAL Science 248, 599-603 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Salkoff, 07-MAR-1990.
FEATURES from to/span description
pept 1 1497 Shaw2 protein
BASE COUNT 344 a 426 c 390 g 337 t
ORIGIN Chromosome 2 left arm at locus 24B-C.
1 atgaatctga tcaacatgga ctcggaaaac agggtggtgc tcaatgtggg tggcattagg
61 cacgaaacct acaaggccac gctgaagaag attccggcta cgcgattatc gcgattaaca
121 gaggcgctgg ccaactatga tccgatactg aatgagtact tctttgatcg gcatccgggc
181 gtcttcgcac aagtgctcaa ctattacaga actggaaagc tgcattatcc cacggatgtg
241 tgcggtccgc tgtttgagga ggaattggag ttctggggcc tagactcgaa ccaagtggag
301 ccctgctgtt ggatgaccta cacacagcat cgcgacaccc aggaaaccct agccgtactc
361 gatcgtctcg atctggatac ggaaaaaccg tccgaagagg aattggcacg caaattcggc
421 ttcgaggagg actactacaa aggcacaata tcctggtggc aggaaatgaa gccgcgcatt
481 tggtccttgt tcgatgagcc ctacagttcc aatgcagcca agactattgg cgtggtttcg
541 gtgttcttca tctgcatttc gatcctgtcg ttctgcctga agacccatcc cgatatgcgg
601 gtgcccatcg tccggaacat tacagtgaaa actgcgaatg gaagtaatgg ctggtttttg
661 gacaaaacgc agaccaatgc gcacatagcc ttcttctata tcgaatgcgt gtgcaatgcc
721 tggtttacct ttgaaatatt ggtgcgcttt atctcatcgc cgaacaagtg ggagttcatc
781 aagtcatctg ttaacatcat agactacata gcgacgctta gtttttatat cgatctagtg
841 cttcagcggt tcgcatcgca cctggagaac gctgacatcc tcgagttctt ctcgatcatc
901 cgcatcatgc gtctgttcaa gctgacgcgc cactcgtccg gactgaagat cctgatccag
961 acgttccggg cctcggccaa ggagctgacc ctgctggtgt tcttcctcgt cctgggcatc
1021 gtgatcttcg ccagccttgt ctactacgcg gagcgcatcc agcccaatcc gcacaacgac
1081 ttcaacagca taccgctggg cctgtggtgg gccctggtca caatgaccac cgtcggctac
1141 ggcgacatgg cccccaaaac ctacattggc atgttcgtgg gtgccctctg cgccctggcc
1201 ggcgtactaa ccatcgcact gccagtgccc gtcatcgtca gcaacttcgc catgtactac
1261 tcgcacacgc aggccagggc caaactgcca aagaagcgga gacgagtgct tcccgtcgag
1321 cagccgcgcc agcccagact gccaggtgcc cctggtggtg tcagtggttg cggcaccccg
1381 ggctcgggtc cccactccgg tccgatggga tccggcggaa ctggaccacg tcgcatgaac
1441 aataaaacaa aggacctggt cagccccaag tcagatatgg ccttcagttt cgactaa
//
LOCUS SUVSATA 332 bp ss-RNA VRL 10-JUL-1990
DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence.
ACCESSION M33000
KEYWORDS .
SOURCE Subterranean clover mottle virus (isolated from Trifolium
subterraneum) satellite RNA.
ORGANISM Subterranean clover mottle virus
Viridae; ss-RNA nonenveloped viruses;
Velvet tobacco mottle virus group.
REFERENCE 1 (bases 1 to 332)
AUTHORS Davies,C., Haseloff,J. and Symons,R.H.
TITLE Structure, self-cleavage, and replication of two viroid-like
satellite RNAs (virusoids) of subterranean clover mottle virus
JOURNAL Virology 177, 216-224 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.H.Symons, 20-MAR-1990.
FEATURES from to/span description
site 62 63 self-cleavage site
site 1 120 high sequence homology with virusoid of
subterranean clover mottle virus
site 232 332 high sequence homology with virusoid of
subterranean clover mottle virus
BASE COUNT 77 a 93 c 80 g 82 t
ORIGIN
1 agaggcatac cctcctcgcg gattttgaag gtgttctagc tacccaagta ttccacgctg
61 tctgtacttg tatcagtaca ctgacgagtc cctaaaggac gaaacagcgc accgcaatct
121 acgtataccc cgattcgact tgcttggagc aagcgttcga cagagtgccg cgcctggaat
181 gacgcggttc tggccacact cacccgggag gccatcgggc ggattatact agttgtcaag
241 gacctgtcgt tagttctact atacattact acactacgtg ttacttgtta ggtggcccca
301 cctcactttc gtgaaggcta gagaacgtcc ac
//
LOCUS SUVSATB 388 bp ss-RNA VRL 10-JUL-1990
DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence.
ACCESSION M33001
KEYWORDS .
SOURCE Subterranean clover mottle virus (isolated from Trifolium
subterraneum) satellite RNA.
ORGANISM Subterranean clover mottle virus
Viridae; ss-RNA nonenveloped viruses;
Velvet tobacco mottle virus group.
REFERENCE 1 (bases 1 to 388)
AUTHORS Davies,C., Haseloff,J. and Symons,R.H.
TITLE Structure, self-cleavage, and replication of two viroid-like
satellite RNAs (virusoids) of subterranean clover mottle virus
JOURNAL Virology 177, 216-224 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.H.Symons, 20-MAR-1990.
FEATURES from to/span description
site 63 64 self-cleavage site
site 1 120 high sequence homology with virusoid of
subterranean clover mottle virus
site 286 388 high sequence homology with virusoid of
subterranean clover mottle virus
BASE COUNT 97 a 106 c 91 g 94 t
ORIGIN
1 agaggcatac cctcctcgcg gattttgaag gtgtttcagc tacccaaagt attccacgct
61 gtctgtactt atatcagtac actgacgagt ccctaaagga cgaaacagcg caccgcaact
121 tggccagacc tcgccaatca cccccacacc aagccaaaaa ccggtcccca acgcagttta
181 gtatcaagtc gtcgcatcca cgctcccgag ggaggaagtt tgcgccttga ggttctgcac
241 ggtcgtggta acaggaaaag tgttggaatg tttgaaggtc ttgcggttgt caaggaccaa
301 gtcgttagtg ttactatata ttactaccct acgtgttact ttgttaggtg gccccacctc
361 actttcgtga aggctaggaa acgtccac
//
LOCUS BOVCYP4SC 1073 bp ss-mRNA MAM 10-JUL-1990
DEFINITION Bovine cytochrome P450-scc mRNA fragment.
ACCESSION M25920
KEYWORDS cytochrome P450-scc.
SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA644.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1073)
AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L.
TITLE An approach to the molecular biology of congenital adrenal
hyperplasia
JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985)
STANDARD simple staff_entry
COMMENT The coding region for cytochrome P450-scc was not indicated in [1].
BASE COUNT 259 a 243 c 193 g 378 t
ORIGIN
1 taagtctgaa ttttgcaata aggaactcat gatttgaatt acagtcagct cccattcctg
61 tttttgctga ctatatagag ccttctccat ttttggctgc aaaacatata atcagtctga
121 tttggtattt atcattttgt gacataatgt gtaagagtgc ctcgtctgtt tggaaaaggt
181 agtttctatg accagtgtgt ctcttggcaa actctgttaa cctttgtctc accacttcat
241 tttgtattcc aaggcctttg tttctctgtt tctccaggta tctcttgact tcctactttt
301 accttccaat cctctaggat gaaaaggaca tctttttttt tttttttggt gtagttctag
361 aaggtcttca tagaaagggt caacttcaac ttcttaggca tcagtggtta gggcatatac
421 ttggattact gtaatgttaa atggtttgct ttggaaacta accaagatca ttctgttgct
481 tttgagattg cacccaaata ctgcattttg gactcttctg tttactatga ggactactcc
541 atttaatcta aaggattctt aggccacaat agtagatata atggtcatct gaattattat
601 aaatttatca attttcttcc attttagttc actgaattct aacttattga tgcttcattc
661 ttgccatctc ctgcttgacc atgtttttta ccttgattca tggacctgac attccaggtt
721 cctatgcaat attattctgt atagtgtcag acttactttc accaccagac atatccacaa
781 ctgtatatca tttccgtttt ggcccagctg cttcactttt tctggaacta ttcatatctg
841 ccctccactc tttcccaata gcatattgga cacattctcg aacacaggga gccgggggac
901 aggtgctggt ttcttctggc acacctgggg cagctgaaca cagtgttgac tggcagacac
961 agccccacac caaacgctcg ctaacactga cactgttccc gtgatggcca gggagccccc
1021 tccccaaaaa cctgctcctg gaagctggca ggatttgtgc cattcataag ggt
//
LOCUS BOVCYPC21 920 bp ss-mRNA MAM 10-JUL-1990
DEFINITION Bovine cytochrome P450-c21 mRNA fragment.
ACCESSION M25921
KEYWORDS cytochrome P450-c21.
SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA4.8.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L.
TITLE An approach to the molecular biology of congenital adrenal
hyperplasia
JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985)
STANDARD simple staff_entry
COMMENT The coding region for cytochrome P450-c21 was not indicated in [1].
BASE COUNT 185 a 278 c 203 g 254 t
ORIGIN
1 gttcagatgc tgtgtcccat tgggaaagtt cagcaggtta ccagggccac ggcctcagtc
61 atcctcagaa tcgctgtccc tcttggcagg gacagagcac cgcaccgcag acagcagcac
121 gtcttccacg ggcttcttgg gattctcctc caggctcgtc ttgatggctc cagactcaga
181 gcaacttcca ctccaactcg tccaaagtca ggttcatgcc accaaacacc agaggtccgg
241 ataactgagc cttgatgtca ccttcaaggt acacaaatac cgtggcagat tcctatcagg
301 gtaactgggt atgcaggtgg ttgaaatggc tttgataaac ttgacatcag gaaacttcct
361 ggcgaggtgc actcaagtgc tgatttatca gggcacagag gggaatccct tgtttgtaaa
421 ggtgcaggat gacccataag ccctcaccag ctttggtaac ttcttgaaca taatcctttc
481 cagagatttc caaaacctct ccaaatttgt tcttcagttg ggtcgctttc cattcggcca
541 gcctttgctg cctgtacatt tcaattgcac gttcgtcttc ctcattaaat tcgtcttcat
601 tatcctccag ttcttccaaa gtcatgtctt catatgtttt cacaatggac tgctggagga
661 tccgctgctc ctcttcttct gcctccttct ccagatcttt caaatcttcc tttgaaggca
721 agatgccttt tttgcgtaag atgtcattcc actcggtgtc tgcgttgggg tcctgcattt
781 tctgtcaaat cgctagggcc ctgccggcca cagccacccg gcccgtgagc tctctaccgc
841 gcacgcaggc gccactcgcc tcctctccca gcctgccctg agatctcgtc cgcccgttgg
901 ccctccttct cttggcgccg
//
LOCUS MUSINT4 3000 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse proto-oncogene Wnt-4 protein mRNA, complete cds.
ACCESSION M32502
KEYWORDS Wnt protein; proto-oncogene.
SOURCE Mouse (strain BALB/c) 8.5 day old embryo, cDNA to mRNA, (library of
B.Hogan).
REFERENCE 1 (bases 1 to 3000)
AUTHORS Roelink,H., Wagenaar,E., Lopes da Silva,S. and Nusse,R.
TITLE Wnt-3, a gene activated by proviral insertion in mouse mammary
tumors, is homologous to int-1/Wnt-1 and is normally expressed in
mouse embryos and adult brain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 4519, 4523 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H. Roelink, 03-MAR-1990.
FEATURES from to/span description
pept 46 1113 Wnt-4 protein
BASE COUNT 703 a 789 c 787 g 721 t
ORIGIN Chromosome 11.
1 cctcttcatg atcgccggca aacttcctcc tcggcgctgc ttctaatgga gccccacctg
61 ctcgggctgc tactcggcct cctgctcagt ggcaccaggg tcctcgctgg ctacccaatt
121 tggtggtccc tggccctggg ccagcagtac acatctctgg cctcccagcc tctgctctgc
181 ggctccatcc caggcctggt ccccaagcaa ctgcgcttct gccgcaatta catcgagatc
241 atgcccagcg tagcagaagg tgtgaagctg ggcatccagg agtgccagca tcagttccgg
301 ggccgccggt ggaactgtac caccatagat gacagcctgg ccatctttgg gcctgtcttg
361 gacaaagcca cccgtgaatc ggccttcgtg catgccatcg cctcggctgg tgtcgccttc
421 gcagtcacac gctcctgcgc tgagggaacc tccaccatct gcggctgtga ctcacatcat
481 aaggggccac ctggagaagg ctggaagtgg ggcggctgca gcgaggacgc cgacttcggg
541 gtgctggtgt cccgggaatt tgcggatgcg cgggagaaca ggccagatgc ccgctcagct
601 atgaacaagc acaacaatga agcaggccga acgaccatcc tggaccacat gcacctaaag
661 tgtaaatgcc acgggttgtc cggcagctgc gaggtgaaga cctgctggtg ggcccagccc
721 gacttccgtg ccattggcga cttcctcaag gacaagtacg acagtgcctc cgagatggtg
781 gtggagaaac accgtgagtc ccgaggctgg gtggagaccc tgcgggctaa gtacgcgctc
841 ttcaagccac ccaccgagag ggacctggtc tactacgaga actcccccaa cttttgtgag
901 cccaacccag agacgggctc ctttggtacc agggaccgga cttgcaatgt cacctcccac
961 ggcatcgatg gctgcgatct gctgtgctgt ggccggggcc acaacacgag gacggagaaa
1021 cggaaggaga aatgccattg cgtcttccac tggtgctgct atgtcagctg ccaagagtgt
1081 attcgcatct acgatgtgca cacctgcaag tagtgagcca gggcactggg aaggggtaga
1141 ttgtgcggct ggatccattc atcgaagtcc catgagaagc aggatctaga tccaggccag
1201 ccttcggcac tggccagcaa ggagcatgga ctgttgccag ctgcatgtga taaacgacct
1261 ggacccagcc ggcctcggac ggacgggcgg cttctttctc aactaacgtc tctccccctg
1321 ctctggatgg tgtacggctt tacagagggg ctttctttat ggttttacca gggtctgctg
1381 gggacagact cgaggcttac ctttgcacat gttaaagaaa ataaaaatga aaaaaaaaaa
1441 tctaccgcaa cagaacaggc tgggctagtg tgagctcttg gcctggtggg aaggacaaga
1501 ccatggcgag attctgtgtc caagctgcct ctactcgtga cattccaaga tgcctctgag
1561 gtgggaactg tgaagtagga cagagccccg cagtcccctc ttgtccgtcg actcccattt
1621 aaattggaca taccttgtcg ttctgagaaa agccatagat aggtgtagct gggatgtagt
1681 gatggggagg cccctggcca acagtgggag caagatcttg agttttgaag acctcagagt
1741 tctgggcggc ctgggaagcc atctgcagaa cagagttcct tgtgggctcc tgttttcgct
1801 agccctgttc tgccctggag cgacagtcag atctccacgc ccctttctgt tgttctacag
1861 tgtccacctt tactacgcgt tttttttttt tttttcatga tgaccttgta aataggtcag
1921 atgtggaggc aggtctcttc tggctccatc caccacaccc agaaagaatg ggctgctctg
1981 cccttctcag ccttgctaac cagcagacac cgaggagagc agcggggcac cttagagagc
2041 aatctaaaca tggttggcag gtggggaggg taaagagtcc cacttccttt gtgttagaag
2101 gcagactacc ctgcgtcctt ttctcccatt ggctgaagta accagaaaga caagagatcc
2161 ttaacaagcc cttcttccca cttgtaaaag ggatagccta tctcagttcc caaggatctg
2221 gattagatag atattcaaaa gaggcaagca gcgaatggag gcagctccca gctctgttcc
2281 cgacgcatga tggtactggc tgggtttagt aaggtgggtg gggctgcacg gatcaatcca
2341 tcaactccgt cttaaggaga atcagaaaga ggagataaaa tgggggaatg gggcagaaca
2401 aagaatttgt cctttcccgc ttctgtctag ggtctgctaa tgctggcttg acgaggggtc
2461 agccacttct ttcctgttgt gcagttggct tgccaagcag gctccagtag gcccttgcct
2521 gcactctcta ccatgtgacc atgagcactg ctctagggac acctcccatc ccttcctagc
2581 accccaaatg ccccttccca tctctccttc cagaagttgg aaatcaagtc aactggataa
2641 cgcttgtgtg agacacttga gcagaacgga tacaacaatt tacaagtctc ttcatatcta
2701 tgtattctat attaaaagtg ataaagtcat gtttccgggg cgtattcaag tagctgacaa
2761 gtaattattt aataatagta catgagcgca ttgtaattat cctcgccata gtcaggtaat
2821 agcatccaat gggaggtccc taccaacctg ctgtatccaa agttttgtaa aaagttgtag
2881 aagttgttga tctttttgat tttatattca aaaagtctct ttttataaat attatttatt
2941 atacaatgta tatacctttg agttaactaa gattatatat tatataaata tatatatatt
//
LOCUS DRONCDA 2294 bp ss-mRNA INV 10-JUL-1990
DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA,
complete cds.
ACCESSION M33932
KEYWORDS non-claret disjunctional protein.
SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA,
clone pNB40.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2294)
AUTHORS Mcdonald,H.B. and Goldstein,L.S.B.
TITLE Identification and characterization of a gene encoding a
kinesin-like protein in Drosophila
JOURNAL Cell 61, 991-1000 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.B.Mcdonald, 30-APR-1990.
FEATURES from to/span description
pept 111 2168 non-claret disjunctional protein
site 180 319 alpha helical domain
BASE COUNT 622 a 661 c 603 g 408 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC.
1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt
61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta
121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag
181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa
241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct
301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg
361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc
421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc
481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg
541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt
601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct
661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg
721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg
781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga
841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc
901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag
961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg
1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc
1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc
1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg
1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg
1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca
1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg
1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg
1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg
1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg
1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg
1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc
1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc
1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg
1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca
1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc
1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact
1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca
2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc
2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg
2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat
2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa
2281 aaaaaaaaaa aaaa
//
LOCUS YSCVPS1A 2457 bp ds-DNA PLN 10-JUL-1990
DEFINITION S.cerevisiae GTP-binding protein (VPS1) gene, complete cds.
ACCESSION M33315
KEYWORDS GTP-binding protein.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2457)
AUTHORS Rothman,J.H., Raymond,C.K., Gilbert,T., O'Hara,P.J. and
Stevens,T.H.
TITLE A putative GTP binding protein homologous to interferon-inducible
Mx proteins performs an essential function in yeast protein sorting
JOURNAL Cell 61, 1063-1074 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.O'Hara, 02-APR-1990.
FEATURES from to/span description
pept 318 2432 GTP-binding protein (VPS1)
BASE COUNT 812 a 454 c 496 g 695 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgata gatacttgaa tcctctaata gtcgaaaaat gctcgagggt aaaccacttg
61 tgcgcttgga ctggcctagt ttccaaaacc aatgttctaa tggattgatt tcttccccaa
121 acattattaa gtggccgggt cacccaaaga cttgggcgcc gttgattcgc gtcgctttgc
181 catcaagaga acaacatatc ttccaagaca gaccgagata attcatctat ttactcctaa
241 aaaagaatta gagaggcctt ttatagcacc aaaataagga ccgtacgaaa actgcacatt
301 ttatattatc agatatcatg gatgagcatt taatttctac tattaacaag cttcaggacg
361 ctttggcgcc cttaggagga ggatctcaat ctcctattga tttaccacag atcaatgttg
421 tcggttccca gtcgtcagga aagtcgtccg ttttggagaa cattgttggt agggatttct
481 tgccaagagg tactggtatt gtcaccagga gacctttagt gttacaattg attaatagga
541 gaccaaaaaa gtcagaacat gctaaagtaa accaaactgc taatgaattg attgacttga
601 acatcaacga tgatgacaag aaaaaggatg aatcaggaaa gcaccaggaa gagggacaat
661 ctgaagacaa taaagaggaa tggggtgaat ttttgcattt acctggtaag aagttttata
721 attttgacga aattagaaag gaaatcgtca aagaaactga caaagtgaca ggtgccaatt
781 caggtatttc ttctgtgccc attaacttga gaatttattc tccgcatgtt cttactttga
841 cgttagtgga tttgcctggg ttgacgaagg ttcccgtagg tgaccaacct cctgatattg
901 aaagacaaat taaggacatg ttgttaaagt atatttcgaa accaaacgct atcatattat
961 ctgttaatgc cgctaacacc gatttagcca acagcgatgg tttgaagctg gctagagagg
1021 tcgatccaga aggaacgaga actattggtg tcttgacaaa agtcgatttg atggatcaag
1081 gtacagatgt catagatatt ttggctggaa gagtcattcc tttgagatat ggttatatcc
1141 cagttatcaa tagaggtcaa aaggatattg aacacaaaaa aacaatcaga gaagcccttg
1201 aaaacgaaag aaaatttttt gagaaccatc cctcttacag ttctaaagct cattactgtg
1261 gtacaccata tttggctaaa aagttaaact caatcttatt acaccacatt aggcaaactc
1321 tgccagaaat caaagcgaaa atcgaagcca cattgaaaaa atatcaaaac gaacttataa
1381 acttgggccc agaaactatg gattcagcta gttcggttgt tttgagcatg attactgatt
1441 tttccaatga atatgccggt atcttggacg gtgaggcgaa ggagctttcc agtcaggaac
1501 tttctggtgg tgctagaatt tcttacgtat tccatgaaac tttcaaaaat ggtgtagact
1561 ctttggatcc attcgaccag atcaaagatt ctgatatcag aaccattatg tacaatagtt
1621 caggttctgc cccatctttg tttgtcggta ccgaagcttt tgaagtttta gttaaacagc
1681 aaattagaag atttgaagaa ccatctctac gtttagttac tctggtgttt gatgaacttg
1741 ttcgtatgct aaaacagatt atttcacaac caaagtactc aaggtatcct gctctaagag
1801 aagcgatttc taatcagttc attcagttct taaaggatgc tactattcct acgaatgagt
1861 ttgttgtcga tatcatcaaa gctgaacaaa cttacatcaa tacagcccat cccgaccttt
1921 tgaagggttc tcaagcaatg gttatggtgg aagaaaaatt acatcctcgc caagtcgctg
1981 ttgacccaaa gacgggtaaa ccattaccaa cccaaccatc gtctagtaag gcgccagtta
2041 tggaagagaa atcaggattt tttggtgggt tcttctccac taaaaacaag aagaaattgg
2101 cagctttgga atccccacct cctgttttaa aagctactgg ccaaatgaca gagagggaaa
2161 caatggaaac agaagtaatc aagttgttga ttagtagtta tttctctatt gtcaaaagaa
2221 ccattgccga tattatacca aaggctttga tgcttaaatt gattgtgaaa agtaaaactg
2281 atattcagaa agttttactc gaaaaacttt acggaaagca agatattgaa gaattaacga
2341 aagaaaacga cataaccatt caaagaagaa aagaatgtaa gaagatggtc gagatattga
2401 gaaacgctag tcaaatcgtc tcctctgttt aggttttcct catctatacc ggtcgac
//
LOCUS R75RELAX 99 bp ds-DNA BCT 10-JUL-1990
DEFINITION Plasmid R751 relaxation region.
ACCESSION M33118
KEYWORDS .
SOURCE Plasmid R751 DNA.
ORGANISM Plasmid R751
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 99)
AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E.
TITLE The origin of conjugative IncP plasmid transfer: Interaction with
plasmid-encoded products and the nucleotide sequence at the
relaxation site
JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988)
STANDARD simple staff_entry
BASE COUNT 32 a 26 c 22 g 19 t
ORIGIN
1 gaataaggga cagtgaagat agataaccgg ctcgccggtt agctaacttc acacatcctg
61 cccgccttac ggcgttaata acaccaagga aagtctaca
//
LOCUS RP4RELAX 99 bp ds-DNA BCT 10-JUL-1990
DEFINITION Plasmid RP4 relaxation region.
ACCESSION M33117
KEYWORDS .
SOURCE Plasmid RP4 DNA.
ORGANISM Plasmid RP4
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 99)
AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E.
TITLE The origin of conjugative IncP plasmid transfer: Interaction with
plasmid-encoded products and the nucleotide sequence at the
relaxation site
JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988)
STANDARD simple staff_entry
BASE COUNT 27 a 28 c 28 g 16 t
ORIGIN
1 gaataaggga cagtgaagaa ggaacacccg ctcgcgggtg ggcctacttc acctatcctg
61 cccggctgac gccgttggat acaccaagga aagtctaca
//
LOCUS ACCTRPF 1466 bp ds-DNA BCT 10-JUL-1990
DEFINITION A.calcoaceticus 5'-phosphoribosyl anthranilate isomerase (trpF)
and tryptophan synthase (trpB) genes, complete cds and 5' end.
ACCESSION M34485
KEYWORDS 5'-phosphoribosyl anthranilate isomerase; tryptophan synthase.
SOURCE A.calcoaceticus DNA.
ORGANISM Acinetobacter calcoaceticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1466)
AUTHORS Ross,C.M., Kaplan,J.B., Winkler,M.E. and Nichols,B.P.
TITLE An evolutionary comparison of Acinetobacter calcoaceticus trpF with
trpF genes of several organisms
JOURNAL Mol. Biol. Evol. 7, 74-81 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 506 1147 5'-phosphoribosyl anthranilate isomerase
pept 1149 > 1466 tryptophan synthase (trpB) (gtg start codon)
BASE COUNT 430 a 297 c 334 g 405 t
ORIGIN
1 gatcaagttt agttgcatct gttgaatcat cagcaaaaac agttgttgaa gaaaacccca
61 ttgcaattgc aatcgccccc actaaacggg taggctgaaa agaaatagac atgtattgtg
121 ctccatacat tcaccccacg tgaatgattg agtggataga tgtaacaagc aggtctccgg
181 actcaaatgg catctcaaaa agagacaagc atattcacct tcccacatct atgcatgcag
241 tggcgtaagt ctaaatgact tttttaatat ggtttacatt tttaccgttg cgggggcagc
301 actggatttg caccagtttc cctaaagcga atgcttttaa cttgttacga attgtgtaaa
361 gtataaagtc tgagcgaaga ttaaacaatc tgaatacgat caaattcgtt caactttgac
421 gcaaagcaca aaaattgcat tacaatactt agcccaatga tggatagatc ggctgtctgt
481 caggcaatac aatgagcttc tttctatgcg aacgcgcgca aaaatttgcg gtattacccg
541 ttcccaagat gtccaagcag cagtaagtgc aggtgcagat gccattggac tggttttttt
601 cccaccaagt cctcgacatg tttctatagc gcaagcgcaa gcattgctcc agcatattcc
661 cgcttatgtt caggtggttg gtttatttgt gaatgcaact gcggatcaaa tcaaatcagt
721 gcttgattgt gtggctttgg atgtattaca actacatggc gatgaaacgc ctgagcaatg
781 tcaagagatt gctctgcagt gcaagcgtcg ctggtataaa gccattcaag ttaaaccaga
841 gcttgatgta gttgatgaag ttcagcgtta tcaggccgct ggtgcaagtg cggtattgct
901 ggatgcgtgg catccagagc tcaaaggtgg aactggtcat caatttgatt ggtcgaagtt
961 tcccaagctg gatattccac ttattcttgc aggcggttta acgcctgaaa atgttgtaga
1021 tgccattcaa accacacacg cttttgcagt ggatgtgagc ggaggggtag aggccgcaaa
1081 aggtattaaa gataaacaac tcatcgaacg atttatgcaa ggagtccaat gtggatcagc
1141 aaaataacgt gattgactat acgcaatatc cagatgctcg tgggcatttt ggtattcatg
1201 gcggacgttt tgtatcagaa acacttatgg cggcacttga agatttagaa aatctttaca
1261 accgcatgaa aaatgacgaa cagtttctgg cagaatttga ccgcgatctt gcctattatg
1321 taggtcgtcc tagtccactt tattatgctg aacgatggtc aaagaagctc ggtggtgcgc
1381 aaatttactt aaaacgtgaa gacctgaatc atacaggttc acacaaagtt aataacacca
1441 ttggtcaggc attattggcc aagctt
//
LOCUS BCIGLCA 2316 bp ds-DNA BCT 10-JUL-1990
DEFINITION B.circulans beta-1,3-glucanase A1 (glcA) gene, complete cds.
ACCESSION M34503
KEYWORDS beta-1,3-glucanase.
SOURCE B.circulans (strain WL-12) DNA, clone pNT003.
ORGANISM Bacillus circulans
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2316)
AUTHORS Yahata,N., Watanabe,T., Nakamura,Y., Yamamoto,Y., Kamimiya,S. and
Tanaka,H.
TITLE Structure of the gene encoding beta-1,3-glucanase A-1 of Bacillus
circulans WL-12
JOURNAL Gene 86, 113-117 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 241 2289 beta-1,3-glucanase A1 (glcA)
BASE COUNT 705 a 489 c 538 g 584 t
ORIGIN
1 ggaaattcaa cccacagagt atcgacaaat gatgcgccaa aacgtagaac gtgaagtaca
61 ataccacagt acaaatatat aaattgaatc aaaacccaaa aaattgggat ataacaaaaa
121 taattgtacc ttttcagcag attatcctat tcgatagaat aaagatattc ccccatgtaa
181 gcgatttcct ttatacgcat agattgggag aaactattat cctatcaaag gagggcaatt
241 atgaaaccat ctcactttac ggagaaacgg tttatgaaaa aggtacttgg tttgttctta
301 gtggttgtga tgctggctag tgttggcgtg ttgccaactt caaaagttca agcagctggg
361 accacagtta cctcaatgga gtacttctca ccagcagatg gacctgttat ttcaaaatct
421 ggcgttggca aagccagcta cggatttgtt atgcctaagt tcaatggagg ctccgctacg
481 tggaacgatg tttacagtga cgtgggtgtc aatgtgaaag tgggtaacaa ctgggttgat
541 attgatcaag ccggaggtta tatctataac caaaactggg ggcactggag cgatggcggt
601 ttcaatggct attggttcac cctttccgca acaaccgaaa ttcaactgta ctccaaagcg
661 aatggtgtta agcttgaata tcaacttgta ttccaaaaca ttaacaaaac aaccatcaca
721 gcgatgaatc cgacacaagg gccgcaaatt acagcaagtt tcacaggcgg tgcaggcttt
781 acatatccaa cgttcaacaa tgattctgcg gtaacctatg aagccgtagc ggatgatttg
841 aaggtgtatg taaaacctgt aaacagcagc tcatggattg atattgacaa taatgcagcc
901 agcggctgga tttatgatca caacttcggc caattcaccg acggtggagg aggttactgg
961 tttaacgtaa cggaatcgat caacgtcaaa ttggaatcaa agacttcttc ggctaacctt
1021 gtttatacaa ttacgtttaa tgaacctaca agaaattcat atgtcattac gccatacgaa
1081 ggaacaacct tcacagcaga tgcgaatggt tccattggaa tcccgcttcc caaaattgat
1141 gggggtgcgc caatcgccaa agaactgggc aatttcgtat atcagattaa catcaatggg
1201 caatgggtgg atttgagtaa ctccagtcag agcaagtttg catactcggc taatggctac
1261 aacaatatgt ctgatgccaa ccagtggggg tactgggccg attatatcta tggcctttgg
1321 ttccagccaa tccaggaaaa tatgcaaatc cgtatcggat atccgctgaa cggacaggcg
1381 ggtggaaata ttggcaacaa cttcgtcaac tataccttca tcggtaatcc aaatgctccg
1441 cgtccggatg tatccgatca agaggatatc tcgatcggaa caccaactga cccggctatt
1501 gcgggcatga atcttatctg gcaggatgaa tttaacggaa ctacactgga tacaagtaaa
1561 tggaactatg aaacaggtta ttatctcaat aacgatcccg ctacttgggg atggggaaat
1621 gcagagttgc agcactacac aaacagcaca caaaatgtat atgtacagga cgggaagctg
1681 aatatcaaag ccatgaacga tagcaaatct ttcccgcagg atccgaatcg gtatgcacag
1741 tattcttcag gtaagattaa caccaaggat aaactctcct tgaagtacgg cagagtagat
1801 tttcgtgcca agcttcctac aggggatggc gtttggccag cgctgtggat gcttccaaaa
1861 gattctgtat atggcacatg ggctgcatcg ggtgaaatcg atgttatgga agcaagagga
1921 cgtcttccag ggtctgtaag cggtaccata cactttggcg gacaatggcc cgtgaaccag
1981 tcttcgggtg gcgattatca cttcccagaa gggcaaactt ttgccaatga ttatcatgta
2041 tactcggtag tctgggaaga ggacaatatt aaatggtatg tcgacggcaa gtttttctat
2101 aaagtcacta accagcagtg gtattccaca gctgcaccga ataatccgaa tgctcctttc
2161 gatgagccgt tctacctcat tatgaacttg gcagtcggcg gaaacttcga cggaggccgt
2221 actccgaacg cgtccgatat cccggcaact atgcaagtgg attatgtacg tgtgtataaa
2281 gaacagtaat aaaacagccg tttccgcgat tggggt
//
LOCUS CHKAGLOB 1737 bp ds-DNA VRT 10-JUL-1990
DEFINITION Chicken alpha-globin gene, alpha-5HR DNA fragment.
ACCESSION M34465
KEYWORDS alpha-globin.
SOURCE Chicken fibroblast DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1737)
AUTHORS Kalandadze,A.G., Bushara,S.A., Vassetzky,Y.S.Jr. and Razin,S.V.
TITLE Characterization of DNA pattern in the site of permanent attachment
to the nuclear matrix located in the vicinity of replication origin
JOURNAL Biochem. Biophys. Res. Commun. 168, 9-15 (1990)
STANDARD simple staff_review
BASE COUNT 356 a 589 c 447 g 345 t
ORIGIN
1 gcggcacggg gcggccccgg gcccggcgcg cacttactgg ccttggcggc ggggtgctcg
61 gcgccgcgct ggaaggggaa gcggaagagc agcttgttgc cgcggctgcc cgagctcaca
121 aggataacgc tgatggggct ggtgctctcg cccatgccgc cgcgccacag cgagcaccgg
181 gcgggcaacg acggacgcgg ctccgcggaa ggcggcccgg cccgcgcgac ttccgcttcc
241 gcgcctccgc cgccgccgcc ggttcccccg ggccgcggcc gagcggcggg gcggagctgc
301 gggcacagcg ctccccgggc aggtcgcgct cagaggccgg gccgccgctt cagcgccgtg
361 ccctcagtgc ggcccagcgc cgtgcccgca gcgctgccca cacgccctcg gggtgcccca
421 cggctgctgc ttgctcccgg tgcccgccgt tcctcccagc acctcgcagt gcagccgtgc
481 ctgaagtgca gcccagcacc tcacacctca gccccgggct cccagtacga ccagcaggtc
541 acgttggagt ctcttgtcct caagactgcg cagtgtctca cctttgagcc ttgtgccccc
601 cattcagccc agcacatcac actgtagccc ttacaccctc accacagcac agcacctcac
661 gttcaggccc cagcacgtca agatggagcc ctgtgccccc agacagccag catggaacca
721 tcaaatcctt agagttggaa gatgtctgaa tccttgtgcc cccagttcag cccggcacct
781 ctcacacccc actcaacact cttcagccaa gagcctacag ctcaacccag cacctcacgc
841 cacccagcag cactcccgcc atcagcccag tgcccccagt ccggatcggt acctctcatg
901 cccatgcaca gtgcaccaga tcagcctagc accactagtt cattccagca cctcacgtgc
961 ccacagccaa ccactccagc acccccggtg ccctagtcac acctctccgc tgcctcaagg
1021 ttcattccca cctcttccca catcccctca caccccctca ttattttcat gtctcgcaat
1081 ctcctttggt cacttggagt cattcagtta tgacaactcc agaactagaa gctgctggcc
1141 agcagcaagt gccacaaact gtgttccccc ggcagctctt ctggctcatt tgtcttattg
1201 tgtgtccagc tgagatcaga aagctatcgg caattatgtc agaggatggc ccagtttttc
1261 acatagattt gtctgtattt gatagcaata tttagtattt ggtgctccga gtatccccac
1321 tctggatttt tctctgcaag attcttccct tggacttcag gcagagaagg ggactgaaag
1381 ggagatgagc acccgcagtg agggcttaat ctgcacggcc attctctgca aggcaggtga
1441 taacaactga agcaagagaa gctgtcattg aggggagaga gttgttggtg agcgattaaa
1501 gagcagtcac attatcacag cagagcattc atcgtggccc agtgctgggg agctacgtta
1561 gaattgccca gtgtgtctgc ttcccagcat aactatgcat tcttcaatta aaaaactgca
1621 ggcatgtttg ccatttccag ctctcggaga tgagttaaag caaagctctg gaaacctgca
1681 agctctctga gtgctagtag aatgaaatga aagaataaag ccagatatag attctgc
//
LOCUS HUMPDHBA 1484 bp ss-mRNA PRI 10-JUL-1990
DEFINITION Human pyruvate dehydrogenase beta-subunit mRNA, complete cds.
ACCESSION M34479
KEYWORDS pyruvate dehydrogenase.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1484)
AUTHORS Ho,L. and Patel,M.S.
TITLE Cloning and cDNA sequence of the beta-subunit component of human
pyruvate dehydrogenase complex
JOURNAL Gene 86, 297-302 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1080 pyruvate dehydrogenase beta-subunit precursor
/hgml_locus_uid="LU0223C" /nomgen="PDHB"
/map="unassigned"
sigp 1 90 pyruvate dehydrogenase beta-subunit signal
peptide
matp 91 1077 pyruvate dehydrogenase beta-subunit
mRNA < 1 1484 pyruvate dehydrogenase beta-subunit mRNA
BASE COUNT 414 a 287 c 369 g 414 t
ORIGIN
1 atggcggcgg tgtctggctt ggtgcggaga ccccttcggg aggtctccgg gctgctgaag
61 aggcgctttc actggaccgc gccggctgcg ctgcaggtga cagttcgtga tgctataaat
121 cagggtatgg atgaggagct ggaaagagat gagaaggtat ttctgcttgg agaagaagtt
181 gcccagtatg atggggcata caaggttagt cgagggctgt ggaagaaata tggagacaag
241 aggattattg acactcccat atcagagatg ggctttgctg gaattgctgt aggtgcagct
301 atggctgggt tgcggcccat ttgtgaattt atgaccttca atttctccat gcaagccatt
361 gaccaggtta taaactcagc tgccaagacc tactacatgt ctggtggcct tcagcctgtg
421 cctatagtct tcaggggacc caatggtgcc tcagcaggtg tagctgccca gcactcacag
481 tgctttgctg cctggtatgg gcactgccca ggcttaaagg tggtcagtcc ctggaattca
541 gaggatgcta aaggacttat taaatcagcc attcgggata acaatccagt ggtggtgcta
601 gagaatgaat tgatgtatgg ggttcctttt gaatttcctc cggaagctca gtcaaaagat
661 tttctgattc ctattggaaa agccaaaata gaaaggcaag gaacacatat aactgtggtt
721 tcccattcaa gacctgtggg ccactgctta gaagctgcag cagtgctatc taaagaagga
781 gttgaatgtg aggtgataaa tatgcgtacc attagaccaa tggacatgga aaccatagaa
841 gccagtgtca tgaagacaaa tcatcttgta actgtggaag gaggctggcc acagtttgga
901 gtaggagctg aaatctgtgc caggatcatg gaaggtcctg cgttcaattt cctggatgct
961 cctgctgttc gtgtcactgg tgctgatgtc cctatgcctt atgcaaagat tctagaggac
1021 aactctatac ctcaggtcaa agacatcata tttgcaataa agaaaacatt aaatatttag
1081 tttggacttg aatatcaagt cgttgaaatt tatttgaaat acttgctggc actgcacctg
1141 gatttgtact gcaagacctg actattcata aaggaaaacg atttctaaag caacagcagg
1201 tatttttgta cagggaagtt taaatgtgtt tgtgtatgga aaactctcca ctctcctccc
1261 ctagatgcca tgcttccttt tgtctgttac ggttgccatg ttctttgaat aacaaattat
1321 atcacatttt atcctctctc accacaagga caaagtatgg atgtggcaga gtcctgatga
1381 aagatgtatc caaacaagat aacttatatg tataaaatta aagcatataa tacacattta
1441 ctgttagttt gttttgataa ggaataaagg aatttctaac atga
//
LOCUS LEIGP63A 3047 bp ds-DNA INV 10-JUL-1990
DEFINITION L.chagasi major surface glycoprotein (gp63) gene, complete cds.
ACCESSION M28527
KEYWORDS glycoprotein; protease; surface antigen.
SOURCE L.chagasi (isolate MHOM/BR/82/BA-2C1a) DNA, clones pLc63-[1 and 2].
ORGANISM Leishmania chagasi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 3047)
AUTHORS Miller,R.A., Reed,S.G. and Parsons,M.
TITLE Leishmania gp63 molecule implicated in cellular adhesion lacks an
Arg-Gly-Asp sequence
JOURNAL Mol. Biochem. Parasitol. 39, 267-274 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by M.Parsons, 03-OCT-1989.
FEATURES from to/span description
pept 496 2295 gp63 protein
BASE COUNT 423 a 1195 c 925 g 504 t
ORIGIN
1 ggtacctccc ccaccccggc cctccggccc cgcgcccccg cctctgtgct gtgccgtgcc
61 ctggactccc tctcctccac ctctcctcgc ttctgtcgct ccgcctcccc gagcgacccg
121 cggcgccgcg cggtgcgtgt ctggtgcggc gagtggcggg gtgccgtccc ccctcgctgc
181 ggcacccctc cccgcgccac cacggaggca cccgtgagca cgccaacaga ccaacgcact
241 cacgtcccca tcgtcctccc ccctccccgc accagcaccg acgtgctctc cgctctccct
301 ccctcaccac ctcccctcgc accctccctt gccttctccc tgtcccctcc ctccccagat
361 ccgccaacgc atccgatccc gctacacccc cctctccccc gcccacacgc acgcgcacac
421 cgccgtgcac aagccctcgc cctcgccctc gccaccacac cccactgccc acagcgcccc
481 cgcgcctgca gagccatgtc cgtcgacagc agcagcacgc accggcaccg cagcgtcgcc
541 gcgcgcctgg tgcgcctcgc ggctgccggc gccgcagtca tcgctgctgt cggcaccgcg
601 gccgcgtggg cacacgccgg tgcggtgcag caccgctgca tccacgacgc gatgcaggca
661 cgcgtgcggc agtcggtggc gcgccaccac acggcccccg gcgccgtgtc cgcggtgggc
721 ctgccgtacg ttactctcga caccgcggcc gccgccgatc gccggccggg cagcgcgccc
781 acagtcgtgc gcgccgcgaa ctggggcgcg ctgcgcatcg ccgtctccac cgaggacctc
841 accgaccccg cctaccactg cgctcgcgtc gggcagcaca tcaagaggcg acttggcggc
901 gtcgacatat gcacggccga ggacatcctc accgacgaga agcgcgacat cctggtcaag
961 cacctcatcc cgcaggcgct gcagctgcac acggagcggc tgaaggtgcg gcaggtgcag
1021 gacaagtgga aggtgacggg catgggcgac gatgtgtgca gcgacttcaa ggtgccgccg
1081 gcgcacatca ccgatggcct gagcaacacc gacttcgtga tgtacgtcgc ctccgtgccg
1141 agcgaggagg gtgtgctggc gtgggccacg acctgccagg tgttctctga cggccatcca
1201 gccgtgggcg tcatcaacat ccccgcggcg aacattgcgt cgcggtacga ccagctggtg
1261 acgcgtgtcg tcacgcacga gatggcgcac gcgctcggct tcagcgtcgg cttcttcgaa
1321 ggcgcccgca tcctggagag catttcgaac gttcggcaca aggacttcga tgttcccgtg
1381 atcaacagca gcacggcggt ggcgaaggcg cgcgagcagt acggctgcga caccttggag
1441 tatctggaga tcgaggacca gggcggtgcg ggctccgccg ggtcgcacat caagatgcgc
1501 aacgcgcagg acgagctcat ggcgcctgcc gcagctgccg ggtactacag cgccctgacc
1561 atggccatct tccaggacct cggcttctac caggcggact tcagcaaggc cgaggtgatg
1621 ccgtggggcc ggaacgccgg ctgcgccttc ctcagcgaga agtgcatgga gcggaacatc
1681 acgaagtggc cggcgatgtt ctgcaatgag aacgaggtga ctatgcgctg ccccaccagt
1741 cgtctcagcc ttggaaagtg cggtgttacc cgtcacccgg accttccgcc gtactggcag
1801 tacttcacgg acccgtccct cgccggcatc tccgccttca tggactgctg ccctgtcgtg
1861 gagccctacg gtgatggcag ctgcgcacag cgtgcgtctg aagcgggcgc accattcaaa
1921 ggcttcaacg tcttctccga cgcggcgcgc tgcatcgatg gcgccttcag gccgaagacg
1981 agtcacggca taatcaagtc gtacgccgga ctgtgcgcca acgtgcggtg cgacacggcc
2041 acgcgcacgt acagcgtgca ggtgcacggc ggcagcggct acgccaactg cacgccgggc
2101 ctcagagttg agctgagcac cgtgagcagc gccttcgagg agggcggcta catcacgtgc
2161 ccgccgtacg tggaggtgtg ccagggcaac gtgcaggctg ccaaggacgg cggcaacgcc
2221 gcggctggtc gccgtggtcc gcgcgccgcg gcgacggcgc tgctggtggc cgcgctgctg
2281 gccgtggcgc tctagacggt ggataggacg ggtgctgatg gcgtgtcccc tgctcccccc
2341 tccctccctc cctctcgttg tctctcggaa gagctccacg ctgtcctttc atctcctcgc
2401 ctgttctacg cttgcttcgc tgcgccgctg caccgggccg gtcctcgccg accctcgcct
2461 gccctctccc cctcctctct cccgccaccc caccccgctc cccgctgcgc acggtgcctg
2521 tgcgcttgga gagaggtgca gcagcgcgcg ggagctgagg gagggagggg gtgtcgtgcg
2581 cgggtgcgca tgccttcttt cacttcctta tttgtcttct atttgttccc tgcggcaccc
2641 gcacaccccc acccgctggc ggccatccgc ggcatccgcg ggtgcgtgcg cggtgtgtct
2701 gccttctctc tcctcctttc gctctgtttc cctgtcctcg gactccccgg cgccagcgtg
2761 agctccgcag tcaccgccca cccggcgctc cggcgcggtc agcgccaccc caccccaccc
2821 cctctccccc attcgtgcgt gtctcttctc gctttttctg tttcctcttg tagcagggcg
2881 cgccgcgttg tgggagcggt ggcggcctct gcgcgcggac ggcatgcagg tcggccggga
2941 gagtctcccg ccagcgcccg cgcagcgcag agccgtcgcc cacccaccgt ctcctcccac
3001 cttcgcatgc cgccgcacta ggtgcacgtc gtcggcacga ccaccga
//
LOCUS PFATUBB 2833 bp ds-DNA INV 10-JUL-1990
DEFINITION P.falciparum beta-tubulin gene, complete cds.
ACCESSION M28398
KEYWORDS tubulin.
SOURCE P.falciparum (Brazilian strain 7G8, isolate 78G) nonsynchronous
blood stage DNA, clone 768.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 2833)
AUTHORS Sen,K. and Godson,G.N.
TITLE Isolation of alpha and beta-tubulin genes of Plasmodium falciparum
using a single oligonucleotide probe
JOURNAL Mol. Biochem. Parasitol. 39, 173-182 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.N.Godson, 27-SEP-1989.
FEATURES from to/span description
pept 654 749 beta-tubulin, exon 1
1112 2064 beta-tubulin, exon 2
2228 2516 beta-tubulin, exon 3
IVS 750 1111 TUBB intron A
IVS 2065 2227 TUBB intron B
BASE COUNT 1018 a 311 c 455 g 1049 t
ORIGIN
1 aattcctagt ttatttaatt taaaaattaa aagatcgaat gctcaacatt ttaaaaagaa
61 atctgtgaaa catatcttaa caagaaatgg tgtaacaaaa gaaacaatat taaatgataa
121 attaccaaag ataaatgatg aaattgacag aacatataat ggacacaaaa tggatgaaaa
181 tttacaggat aaacaaaaaa ggaatcatgg agtaaatata aaattaataa atgaatatga
241 aaatatcatg tgaagaataa attctcaaaa tcattgattg tatgacaaga ttcaagaatt
301 ggttatataa aaatatattt aggaaaagta attttgggtc atatgtatca acatttacag
361 gtgtatttgg aggtgctgca gctgttagct gtttctgcca taagtggagc ttgtataact
421 aaatttagtg ttacattggt tccggtattt gcatgttttg ggggtgtctt tgcgattatt
481 ataatattat taatattagg aacatggatg cttgttacat ggttatggca acacaaagaa
541 gtagtatttt tttttttttt taatttttac ttaatatatc ctcttacaat ataaaatatt
601 tatatattta aaaaaaaaag aaaaaatttt ctttgagatt attttattaa agaatgagag
661 aaattgttca tattcaagct ggccaatgtg gaaatcaaat aggtgcaaag ttttgggaag
721 tcatttctga tgagcatgga atagatccag taagtttaaa aaaaaaatat atttatttat
781 atgaatctgt aaacatatgt atatttatat atatatatat atatatatgg aagaataatt
841 ttgtgtgtat aatttggggt ccttcccctt tattgtattc tataaatgcc tcctttatat
901 tgataataat ttatatatgt aaacctttaa tgacgaggct tatatataaa aaccttagat
961 attataaata aatgtatatt atgtacatat gacgatatcg ctctctctat atatatatat
1021 atatatatat atatatattt atttatttat atatttattt atttatttat ttatttattt
1081 tttttttttt tttttatttt atttttttta gagtggtacc tatagtgggg acagtgactt
1141 acagttagaa agagttgacg ttttttacaa cgaagcaaca ggaggtagat atgttccaag
1201 agctatattg atggacttgg aacctggtac tatggatagt gttcgtgctg gcccctttgg
1261 tcaattattt cgtccagata attttgtgtt tggtcaaaca ggtgcaggaa ataattgggc
1321 taaaggacat tatactgaag gtgctgaatt gatagatgca gttttagatg tgcttagaaa
1381 agaagcagaa ggttgtgatt gtttacaagg atttcagatt actcattcat taggtggtgg
1441 tacaggtagt ggtatgggta ctttgttgat tagtaaaata agagaggagt atcctgatcg
1501 tattatggaa acattttctg tatttccatc accaaaagtt tctgatactg ttgttgaacc
1561 atataatgct acattatcag tccatcagtt ggttgaaaat gctgatgaag ttcaagttat
1621 cgataatgaa gctttatatg acatatgttt taggactctt aaattaacaa caccaacata
1681 tggagattta aatcaccttg tatcagctgc aatgtcaggt gtaacctgtt cgttaagatt
1741 tcctggtcaa cttaacagtg acttaagaaa attagctgtt aatttgatcc cattcccacg
1801 tttacatttc tttatgtacg ggtttgctcc tttaactagt agaggcagtc aacaatacag
1861 agccttaact gtgccggagt taacacaaca aatgttcgac gcaaaaaata tgatgtgcac
1921 aagtgatcca agacatggaa gatatttaac ggcatgtgct atgtttagag gaagaatgtc
1981 cacaaaggaa gttgacgaac aaatgttaaa cgttcaaaat aaaaactcat cttattttgt
2041 cgaatggatt cctcacaaca caaagtaaga aggaacaatt gatactagta tgcatgtttt
2101 tttgtttata tgtatttata tatatatata tatatatgta ttcatttata tattttgaaa
2161 tatacatttt acatataaat tttttttttt tctttttctt tttttttttt tttgtttttt
2221 tctttagatc aagtgtttgt gatattccac cattgggatt aaaaatggct gttacttttg
2281 taggaaactc aaccgccatt caagaaatgt ttaaaagagt ttctgatcaa tttactgcta
2341 tgtttagaag aaaagccttt ttgcactggt acaccggaga aggtatggac gagatggaat
2401 ttacagaagc tgaatcaaat atgaatgatt tagtttcaga atatcaacaa tatcaagatg
2461 ctacagcaga agaggaagga gaatttgaag aagaagaagg agacgtagaa gcctaaatct
2521 atttatattt atgaaaatat atacatatta tatatatatg tatatgtaat taacaagaat
2581 aaaaaataaa aaataaaaaa aaaataaaat aaaaaaataa aaatacataa taaaaaagta
2641 taaaataaat atctaatcat taattatata taacaatata atttaactct tttttttttt
2701 attattattg aagttatgtt cgggtatata taacatatat ataaattata tatatgttgc
2761 agtttctttt tttttttttt tttttttttt tcttatcatt tgattttaca ctcacatata
2821 tatgacatat ata
//
LOCUS RATADOME1 2513 bp ds-DNA ROD 10-JUL-1990
DEFINITION Rat S-adenosylmethionine decarboxylase pseudogene, complete cds.
ACCESSION M34463
KEYWORDS S-adenosylmethionine decarboxylase; pseudogene.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2513)
AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P.,
Ihalainen,R. and Pajunen,A.
TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA.
Comparison with an intronless rat pseudogene
JOURNAL Gene 86, 193-199 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 436 1379 pseudo-S-adenosylmethionine decarboxylase
BASE COUNT 771 a 478 c 518 g 746 t
ORIGIN
1 tctactaaac atgataaaga atttaagaaa tccatctctt cacttccagt ctatatatct
61 ttgagatgct attcaggata ctgagttaaa aaataagatt aggcttacac agcatggcgc
121 ggaacattag ctaactctca ctcaactctg acaagaaagc agcagactac atgagactga
181 actgtatctg cctttagttc caacagactc acgttcaact tttcttcacg aaaacagcca
241 gggaaatttt attagtcctt ttttaaaaat agttaatata aaattataac aacaacagca
301 gcagcaacaa caacaaggac cctgaactta gtaacacacg tggaacaaac cgtagcagcg
361 actggagcag tgggagaaga gatttaattt aggtgatttt tttggatttg ttggttgttg
421 gtcagcctca cagtgatgga agttgcacat atttttttga agggactgag aagctgctag
481 aggtctggtt ttccagacag cagtccgacg ccagccaggg acatggggat cttcatacca
541 tcccaagatc tcagtgggat gtgcttttga ggatgtgcag tcctcaacca taagtacgac
601 aaagatgcac aagcaggaag cttacacact cagtgagagt agcatgttta tacatttcat
661 gtgatactac cctcttactg aaagctctgg tttccaggtt gaagctcgct agggattacc
721 gtgggttaga ctcaattctt ttattctcat aagaatttca tgaagccctc tctccaaggg
781 tacccacacc gaaatttcca cgaagaaatc gaatttctta atgcagtttt cccaaatgga
841 gcagcatatt gtatgggaca aacgaattct gactgttggt acttatatac ttggatctcc
901 agagagccga gtcatcaaac agtcagatca accctgggaa ttctgatgag tgagcttgac
961 ctagcagtta cggaccagtt ctattgctgc aaaggatgtc actcgtgaga gtgaattcat
1021 gacctgatat caggtcattg atgacacact gtttaatcct tgcagcttct tgatgaatgg
1081 aatgaaatcg attggactag tcacatcgct ccagaagcag agttctctta tgttagcttt
1141 gaaacaaacc taagtgagac atcctatgac agcccgatca ggaaagttgg gaaattcgtc
1201 aagccaggaa aatttgtgac caccttgttt gttaatcaga gttctaaatg tcgcacaggc
1261 cattcttcat cccagaagat tgacggtttt aaacatcatg attgccaaag tgctatgctc
1321 aacgaagata aatgcaatat tgaatgtatc aaatgaaaag aattcagtct ctggtggagg
1381 gggattggag caaggatgaa tcagcccact aaagaaaact ccatggaaaa gacaggctat
1441 gcagtgcact ttaatcagct tcacacggtg cctaccatgc cttcactaac taaccaagta
1501 gtgatagaaa tgtccactaa gtcaaagcag aaatgtaata ctaagcattc tgacctcagt
1561 aagcaccacc attgccacca ttgccaccaa tttttactaa aggaaatttt gaatcaaatg
1621 aggatctgta gtttccgtct gttctgaggt cggctgttct ctttggtctt cgtttcacca
1681 tggcgctcag atgatcaaat gagtagctgc cagagggagg aatctccagg ttacttagcc
1741 tggagaatgg atgaatggat gaaacagcac aatattatga ctgtttagaa atacaggctt
1801 tcaagagtcg gcatgttagt ggcatttgta gatactgtgg aatttaagca gcaaagaaca
1861 aattggacta aatttcctat taattgccct cccactgttt cttggtagtt tctggactgg
1921 cacatcgatg tttttttttt ttttttcctt ccatatttaa aatgaagcac ttttttagca
1981 tttctaagca aagaatgcac ttggtttgta atcaagtagt tggaacgctg tctgaatgtt
2041 tactttatac accatgctga ttgaacgctt cattgaggaa gctttcagtc agttattggt
2101 ctgattctgt aatgagcaca gcacgtggtt tgaattgcca tttggaggac cagtgcttat
2161 ttaggctgga tcgcgtaaac cggtagattt tagcttgagg tttgattccc tcaccttata
2221 aaattaagaa ttctaatgtt gaaaattgca taggtttgtg tgaaacaaag cccagaagag
2281 aaactgtagg tagactagta atcttgtgta attataggtg agaagtttta gtgccgtaat
2341 ttctttgttg gcgttggact tttatcagct gaaatgtatt tctgtaccac aatgtaagct
2401 tcaataaagt ttgcttaatt gtctagtaac attaaaaaat ataagattaa tagaattgat
2461 ctcaacagta aggaaacaaa actaccttta ttattacata acataatctt tca
//
LOCUS RATADOMET 3102 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Rat S-adenosylmethionine decarboxylase mRNA, complete cds.
ACCESSION M34464 M21155 J04048 M21783
KEYWORDS AdoMet decarboxylase; S-adenosylmethionine decarboxylase.
SOURCE Rat prostate, cDNA to mRNA, clone pSAMr1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 232 to 1821)
AUTHORS Pajunen,A., Crozat,A., Janne,O.A., Ihalainen,R., Laitinen,P.H.,
Stanley,B., Madhubala,R. and Pegg,A.E.
TITLE Structure and regulation of mammalian S-adenosylmethionine
decarboxylase
JOURNAL J. Biol. Chem. 263, 17040-17049 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3102)
AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P.,
Ihalainen,R. and Pajunen,A.
TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA.
Comparison with an intronless rat pseudogene
JOURNAL Gene 86, 193-199 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by A.Crozat, 27-OCT-1988.
FEATURES from to/span description
pept 273 1274 S-adenosylmethionine decarboxylase (EC
4.1.1.50)
mRNA 1 3102 S-adenosylmethionine decarboxylase mRNA
BASE COUNT 835 a 650 c 724 g 893 t
ORIGIN
1 cggggaaagc agcggactac aagagactga actgtatctg cctctatttc caacggactc
61 acgttcaact ttcgctcacg aaaatagccg ggaaaatttt attagtcctt tttttaaaaa
121 aagttaatat aaaattatag caaaaaaaaa aaaaggaacc tgaactttag taacacagct
181 ggaacaatcc gcagcggcgg caggagcggc gggagaagag tttaatttag ttgattttct
241 gtggttgttg gttgttcgct agtctcacgg tgatggaagc tgcacatttt ttcgaaggga
301 ccgagaaact gctggaggtc tggttctcca gacagcagtc cgacgcaagc cagggatctg
361 gggaccttcg taccatccca agatccgagt gggatgtcct tctgaaggat gtgcagtgct
421 caatcataag tgtgacaaag actgacaagc aggaagctta tgtactcagt gagagtagca
481 tgtttgtctc caagagacgt ttcattttga agacatgtgg taccaccctc ttactgaaag
541 cactggttcc cctgttgaag cttgctaggg actacagtgg gtttgactcg attcaaagct
601 tcttttattc tcgtaagaat ttcatgaagc cttctcacca agggtaccca caccggaatt
661 tccaggaaga aatcgagttt cttaatgcaa ttttcccaaa cggagcagga tattgtatgg
721 gacgtatgaa ttctgactgt tggtacctgt acactttgga tctcccagag agccgagtaa
781 tcaatcagcc agatcaaacc ctggaaattc tgatgagtga gcttgaccca gcagttatgg
841 accagttcta catgaaagat ggtgttactg caaaggatgt cactcgtgag agtggaattc
901 gtgacctgat accaggttct gtcattgatg ccacactgtt caatccttgt ggctactcaa
961 tgaatggaat gaaatcggat ggaacatatt ggactattca catcactcca gaaccagaat
1021 tttcttatgt tagctttgaa acaaacctaa gtcagacctc ctatgatgac ctgatcagga
1081 aagttgtgga agtcttcaag ccaggaaaat ttgtgaccac cttgtttgtt aatcagagtt
1141 ctaagtgtcg cacagtgctt tcttcgcccc agaagattga cggtttcaaa cgtcttgatt
1201 gccagagcgc tatgttcaac gattacaatt ttgtttttac cagttttgct aagaaacagc
1261 aacaacagag ttgattagga aaaatgaaaa agaaaaaacg caaaaagaga agacacacag
1321 gaggtggtgg ctgctttcta gatgttgatc ctgggggcca tgctgaccgt gaccaccacc
1381 ttgtagctgc agaaagccct aggtgtaatg atagtgtaat cattttgaag tgtatgcatt
1441 attatatcaa ggagttagat atcttgcatg aatgctctct tctgtgttta ggtgttctat
1501 gccactcttg ctgtggaact gaagtgcatg tagaaaagaa ctctgactgt atgaatcttt
1561 acgacacttg tgaaaacgat tcgacttggt ttatgcacag cgtaatattt ctgcaggcat
1621 cgtccaaaat cccccacaga caaggctttc gtccccatta gatgcggcct cagctgacca
1681 ttggcgactg ttctatttgc tgccagagtt tttacatcca gttacctcca ctttctagag
1741 catattctct actaatgttc aaaaccgatt tctacttcat acgggtgtct tatgcaatgg
1801 caattaaagt tttcttccac aagttgagtc tttgtaagga aatgattcca gttgcttgtt
1861 ttgtgttcta ctgttttagt aattgctcct gcatttatag tcctatggtt tttcactacc
1921 cctgatgaag caatacacgg tcacactgtg ggcttacatt gtaatcttca ccccagatgg
1981 gagctcagag acggtccctt gctcattttt ccctaagatg tagaatgtgg ccttgctatt
2041 ggcatgccct tctgtggaag ataaatgatg gaagtgaaag tatcccgggg gtgagcaagg
2101 agaaccaccc catggcagtg atgggcttgg cagtgcactc cgagctctca cagtggagtg
2161 cccaccatgc cttcactaac tcactgagca gtgataggat gcccaccaag tcagagcaga
2221 aatctaaccc taaggattct cacctcggta agtgccgcca ttgccaccac tttactaaag
2281 gaagtttccg ctcagaggag agtctgtact tcccgcctgt cctaatgtca gctgttctct
2341 ctggtctttc accatggcgt tcagatgctc aaatgaatgg ctgatcggcc gcagggagga
2401 ctctccgggt tactgggcct ggagaatgga gaaacaggca cggtattctg acagttaatg
2461 gcaccagaga tgcgggcttt caagagctgg cctgttagtg gcatttttaa gcagaaaaga
2521 gcaaactaga cgaagttccc tatttattgc cctcccactg tttccttggc agtttctgga
2581 ctggcgcaat gatgccttgt tccttccgta tttataacga agctaaaaag cgtttctaag
2641 catggagtct acttggtttg aaatcaagtg gttggaacac tgtctggatt tttactttac
2701 gcagtgttga ttgaacgctt cgttggggaa gccttcagtc cgcttcatcg gtctgttctg
2761 taatgagcac agcacaccta gtttgaattg ctgtttggag ggccagtgct tatttgagct
2821 gggtcttgta acccagtaga ttttggcttg aggtctgact cccccatctt acgaaattaa
2881 gaattctaat gttggaaatt gcatagggtt tgcgtggaaa aaagcccagg gaaaaaaaaa
2941 aaaaaacaga aggcggacta gtgatctagt gtgattacag gcggggaagt tttggtgcca
3001 taatttcttt gttggtgttg gacttttaat cagctgaaat gtatttctgt accacaatgt
3061 aagcttcaat aaaagtttgc ttaattgtct agtaacatcc ag
//
LOCUS HUMCD38 1407 bp ss-mRNA PRI 10-JUL-1990
DEFINITION Human lymphocyte differentiation antigen CD38 mRNA, complete cds.
ACCESSION M34461
KEYWORDS cell surface glycoprotein; lymphocyte differentiation antigen CD38;
membrane glycoprotein.
SOURCE Human PHA-treated peripheral blood cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1407)
AUTHORS Jackson,D.G. and Bell,J.I.
TITLE Isolation of a cDNA encoding the human CD38 (T10) molecule, a cell
surface glycoprotein with an unusual discontinuous pattern of
expression during lymphocyte differentiation
JOURNAL J. Immunol. 144, 2811-2815 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 70 972 lymphocyte differentiation antigen CD38
/hgml_locus_uid="LZ0047A" /nomgen="CD38"
/map="4"
BASE COUNT 381 a 332 c 326 g 368 t
ORIGIN
1 ctaaagctct cttgctgcct agcctcctgc cggcctcatc ttcgcccagc caaccccgcc
61 tggagcccta tggccaactg cgagttcagc ccggtgtccg gggacaaacc ctgctgccgg
121 ctctctagga gagcccaact ctgtcttggc gtcagtatcc tggtcctgat cctcgtcgtg
181 gtgctcgcgg tggtcgtccc gaggtggcgc cagacgtgga gcggtccggg caccaccaag
241 cgctttcccg agaccgtcct ggcgcgatgc gtcaagtaca ctgaaattca tcctgagatg
301 agacatgtag actgccaaag tgtatgggat gctttcaagg gtgcatttat ttcaaaacat
361 ccttgcaaca ttactgaaga agactatcag ccactaatga agttgggaac tcagaccgta
421 ccttgcaaca agattcttct ttggagcaga ataaaagatc tggcccatca gttcacacag
481 gtccagcggg acatgttcac cctggaggac acgctgctag gctaccttgc tgatgacctc
541 acatggtgtg gtgaattcaa cacttccaaa ataaactatc aatcttgccc agactggaga
601 aaggactgca gcaacaaccc tgtttcagta ttctggaaaa cggtttcccg caggtttgca
661 gaagctgcct gtgatgtggt ccatgtgatg ctcaatggat cccgcagtaa aatctttgac
721 aaaaacagca cttttgggag tgtggaagtc cataatttgc aaccagagaa ggttcagaca
781 ctagaggcct gggtgataca tggtggaaga gaagattcca gagacttatg ccaggatccc
841 accataaaag agctggaatc gattataagc aaaaggaata ttcaattttc ctgcaagaat
901 atctacagac ctgacaagtt tcttcagtgt gtgaaaaatc ctgaggattc atcttgcaca
961 tctgagatct gagccagtcg ctgtggttgt tttagctcct tgactccttg tggtttatgt
1021 catcatacat gactcagcat acctgctggt gcagagctga agattttgga gggtcctcca
1081 caataaggtc aatgccagag acggaagcct ttttccccaa agtcttaaaa taacttatat
1141 catcagcata cctttattgt gatctatcaa tagtcaagaa aaattattgt ataagattag
1201 aatgaaaatt gtatgttaag ttacttcctt tagagcacaa tggatctcga gggatcttcc
1261 atacctacca gttctgcgcc tgcgagtcgc ggccgcatct agaggatctt tgtgaaggaa
1321 ccttacttct gtggtgtgac ataattggac aaactaccta tagagattta aagctctaag
1381 gtaaatataa aatttttaag tgtataa
//
LOCUS MUSCD28 1492 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse glycoprotein CD28 mRNA, complete cds.
ACCESSION M34563
KEYWORDS glycoprotein CD28.
SOURCE Mouse lymphoma T cell line EL4, cDNA to mRNA, clone lambda-SSD1.5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1492)
AUTHORS Gross,J.A., St John,T. and Allison,J.P.
TITLE The murine homologue of the T lymphocyte antigen CD28: Molecular
cloning and cell surface expression
JOURNAL J. Immunol. 144, 3201-3210 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 57 713 glycoprotein CD28 precursor
sigp 57 113 glycoprotein CD28 signal peptide
matp 114 710 glycoprotein CD28
BASE COUNT 401 a 355 c 332 g 404 t
ORIGIN
1 acacactctg ccttgctcac agaggagggg ctgcagccct ggccctcatc agaacaatga
61 cactcaggct gctgttcttg gctctcaact tcttctcagt tcaagtaaca gaaaacaaga
121 ttttggtaaa gcagtcgccc ctgcttgtgg tagatagcaa cgaggtcagc ctcagctgca
181 ggtattccta caaccttctc gcaaaggaat tccgggcatc cctgtacaag ggcgtgaaca
241 gcgacgtgga agtctgtgtc gggaatggga attttaccta tcagccccag tttcgctcga
301 atgccgagtt caactgcgac ggggatttcg acaacgaaac agtgacgttc cgtctctgga
361 atctgcacgt caatcacaca gatatttact tctgcaaaat tgagttcatg taccctccgc
421 cttacctaga caacgagagg agcaatggaa ctattattca cataaaagag aaacatcttt
481 gtcatactca gtcatctcct aagctgtttt gggcactggt cgtggttgct ggagtcctgt
541 tttgttatgg cttgctagtg acagtggctc tttgtgttat ctggacaaat agtagaagga
601 acagactcct tcaagtgact accatgaaca tgactccccg gaggcctggg ctcactcgaa
661 agccttacca gccctacgcc cctgccagag actttgcagc gtaccgcccc tgacagggac
721 ccctatccag aagcccgccg gctggtaccc gtctacctgc tcatcatcac tgctctggat
781 aggaaaggac agcctcatct tcagccggcc actttggacc tctactgggc caccaatgcc
841 aactatttta gagtgtctag atctaacatc atgatcatct tgagactctg gaatgaatga
901 cagaagcttc tatggcagga taaagtctgt gtggcttgac ccaaactcaa gcttaataca
961 tttattgact tgattgggga agttagagta gagcaatcaa aaagatcatt cattcagcct
1021 tgggaagtca atttgcaggc tcctggatga gccctgcccc gttttcactt gccagcacat
1081 ttcagtcatg tggtgtgata gccaaagatg ttttggacag agaagaaagg atagaaaaac
1141 cttctctttg gctaagttgg tgtttggggt ggggataggt tagagtatag tacttaacta
1201 tttgaaaaat aatgaaaaca cttttttcac tcatgaaatg agccacttag ctcctaaata
1261 gtgttttcct gttagtttag aaagttgtgg acatattttt ttaatgattt ctgaccattt
1321 ttaatcacat tgactcatgg aatggcctca aagcaccccc cagtgcttct ttcctcattc
1381 ccggtcatgg gaactcagta ttattaatag tcacaacatg atttcagaac tagatagccc
1441 tcccacacca agaagaatgt gagaggaagt aaggtcactt tatgtaaaaa cg
//
LOCUS MUSIGHAAU 294 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 123) mRNA V region, partial
cds.
ACCESSION M34523
KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
variable region.
SOURCE Mouse (BALB/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig gamma-chain V-region (AA at 1)
BASE COUNT 83 a 61 c 77 g 73 t
ORIGIN
1 caggtccagc tgcagcagtc tgggccagag gtggtgaggc ctggggtctc agtgaagatt
61 tcctgcaagg gttccgacta cacattcact gattatgcta tgcactgggt gaagcagagt
121 catgcaaaga gtctagagtg gattggagtt attagtactt acaatggtaa tacaaactac
181 aaccagaagt ttaagggcaa ggccacaatg actgtagaca aatcctccag cacagcctat
241 atggaacttg ccagattgac atctgaggat tctgccatgt attactgtgt acgt
//
LOCUS MUSIGHAAV 294 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 126) mRNA V region, partial
cds.
ACCESSION M34524
KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
variable region.
SOURCE Mouse (BALB/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig gamma-chain V-region (AA at 1)
BASE COUNT 83 a 69 c 77 g 65 t
ORIGIN
1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggcttc agtgaagata
61 tcctgcaaga cttctggata cacattcact gaatacacca tgcactgggt gaagcagagc
121 catggaaaga gccttgagtg gattggaggt attaatccta acaatggtgg ttctaactac
181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac
241 atggagctcc gcagcctgac atctgaggat tctgcagtct attactgtgc aaga
//
LOCUS MUSIGHAAW 294 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 125) mRNA V region, partial
cds.
ACCESSION M34525
KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig gamma-chain V-region (AA at 1)
BASE COUNT 83 a 66 c 74 g 71 t
ORIGIN
1 cagatccagt tggtgcagtc tggacctgaa ctgaagaagc ctggagagac agtcaagatc
61 tcctgcaagg cttctggtta taccttcaca gactattcaa tgcactgggt gaagcaggct
121 ccaggaaagg gtttaaagtg gatggactgg ataaacactg agactggtgt gccaacatat
181 gcagatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
241 ttgcagatca acgacctcaa aaatgaggac acggctacat atttctgtac taga
//
LOCUS MUSIGHAAX 294 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 127) mRNA V region, partial
cds.
ACCESSION M34526
KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig gamma-chain V-region (AA at 1)
BASE COUNT 81 a 80 c 61 g 72 t
ORIGIN
1 gatgtgcagc ttcaggaggt aggacctgac ctggtgaaac cttctcagtc actttcactc
61 acctgcactg tcactggcta ctccatcacc agtggttata gctggcactg gatccggcag
121 tttccaggaa acaaactgga atggatgggc tacatacact acagtgatag ctctaactac
181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagaa ccagttcttc
241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc aagg
//
LOCUS MUSIGKABI 300 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 123) mRNA V region, partial
cds.
ACCESSION M34527
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 300 Ig kappa-chain V-region (AA at 1)
BASE COUNT 74 a 86 c 70 g 70 t
ORIGIN
1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc
61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc
121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc
181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa
241 gatgctgcca cttattactg ccagcagtgg agtagtaaac cacccatcac gttcggtgct
//
LOCUS MUSIGKABJ 300 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 126) mRNA V region, partial
cds.
ACCESSION M34528
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 300 Ig kappa-chain V-region (AA at 1)
BASE COUNT 77 a 80 c 66 g 77 t
ORIGIN
1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaga tagcgtcagt
61 ctttcctgca gggccagcca aagtattagc aacaacctac actggtatca acaaaaatca
121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc
181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact
241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggcctcacac gttcggctcg
//
LOCUS MUSIGKABK 312 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 127) mRNA V region, partial
cds.
ACCESSION M34529
KEYWORDS immunoglobulin heavy-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-region (AA at 1)
BASE COUNT 78 a 81 c 77 g 76 t
ORIGIN
1 gacattgtgc tgacccaatc tccagcttct ttggctgtgt ctctagggca gagggccacc
61 atatcctgca gagccagtga aagtgttgat agttatggca atagttttat gcactggtac
121 cagcagaaac caggacagcc acccaaactc ctcatctatc gtgcatccaa cctagaatct
181 gggatccctg ccaggttcag tggcagtggg tctaggacag acttcaccct caccattaat
241 cctgtggagg ctgatgatgt tgcaagctat tactgtcagc aaagtaatga ggaacctccc
301 acgttcggag gg
//
LOCUS MUSIGKABL 312 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V region, partial cds.
ACCESSION M34530
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c), cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
TITLE V-H and V-L gene usage by murine IgG antibodies that bind
autologous insulin
JOURNAL J. Immunol. 144, 3091-3098 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-region (AA at 1)
BASE COUNT 71 a 96 c 69 g 76 t
ORIGIN
1 caaattgttc tcacccagtc tccaacaatc atgtctgcat ctctagggga acgggtcacc
61 atgacctgca ctgccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag
121 ccaggatcct cccccaaact ctggatttat agtacatcca acctggcttc tggagtccca
181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgccag cagtatcatc gttccccacc cacgttcggt
301 gctgggacca ag
//
LOCUS HUMINSR01 2085 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 1.
ACCESSION M23100 M32822
KEYWORDS Alu repetitive sequence; insulin receptor.
SEGMENT 1 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1933)
AUTHORS Seino,S., Seino,M., Nishi,S. and Bell,G.I.
TITLE Structure of the human insulin receptor gene and characterization
of its promoter
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 114-118 (1989)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 2085)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept 1824 + 1923 human insulin receptor precursor, exon 1
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
sigp 1824 1904 human insulin receptor signal peptide
matp 1905 + 1923 human insulin receptor
pre-msg 1541 > 2085 hINSR mRNA and introns (alt.)
pre-msg 1542 > 2085 hINSR mRNA and introns (alt.)
pre-msg 1548 > 2085 hINSR mRNA and introns (alt.)
IVS 1924 > 2085 hINSR intron A
rpt < 1 76 Alu repeat
BASE COUNT 417 a 631 c 702 g 335 t
ORIGIN Chromosome 19p13.3-13.2.
1 agatctggcc attgcactcc agcctgggca acagagaaaa actccatcta aaaaaaaaaa
61 aaaaaaaaaa aaaaaacaga gagagagaga gagagagaga gaaggaaacg gaactggggg
121 gaggatttgc aaaaatatgg ttagggatgg cacttcagag atgaagccat cctggagtgt
181 tacgggcaag ggaaatgctg gggcaaagcc ccagaggcag gaataggttt ggcctgttgc
241 atgaacagtg ggtccagctc ctagcaaact gtttattgaa tgaaagaaga atgaatgcct
301 tgggtctagg gttgtgctgg gcgctttctt aagttttctt tcccgggtac ctccccagaa
361 ctggcatgca ggtattatta aacccattac acaagtgaaa ctggcccaga gacagaaaag
421 tccctggtcc aagaccacac aggagtgagg ggtggaggaa ccctcctccc attgagttct
481 ggctttccta tactgaaagc cccttcctct cctgcagtaa ggtaggtgga accgctgtcc
541 cgccttgttg gtgaatgtcg ttgctagact tcagacacat acaggctggt ctgctgaaaa
601 tcagagatgt ccacctgcgc cctattcgag gtctccggcg tcttctttgg cgtcgtcttt
661 gccctttcag aagcgtctgc acatttttcc aggtgtcatt tctccaactt gaacacaggg
721 agcgcactgg gcacgcgggc acgtggctgt ccccaggggc ctggcttggg tctcgcccct
781 gggccggggc gcacgcgcgg gcgggacatc tgggggcgcc cacgcgctct gggacgagtg
841 tcgctggcca ggcccggact gaggaaaggc gagtgagaca ctactcgcct ggggtgcaaa
901 atttaaggga gtgaaaaaaa aaaaaaaaga aagaaaccaa aaccacctcg agtcaccaaa
961 ataaacattt taatgcagta ttttttaaaa aatcaacagg aatcctccaa agcccactat
1021 gaacaaaata gcaaaatggt agagaaagga tctgtgccgc tgcgtcgggc ctgtggggcg
1081 cctccggggg tctgaaactg gaggagactc ggggctgtag ggcgcgcgga tctggggcgc
1141 gccctcggtc ccggcgcgcc cagggcctcc cgcgcggggc ccggcacagg gaggcgggga
1201 ggcgggcggg gcggggcggg accgggcggc acctccctcc cctgcaagct ttccctccct
1261 ctcctgggcc tctcccgggc gcagagtccc ttcctaggcc agatccgcgc cgccttttcc
1321 cgcggcccgc acggggccca gctgacgggc cgcgttgttt acgggccgga gcagccctct
1381 ctcccgccgc ccgcccgcca cccgccagcc caggtgcccg cccgccagtc agctagtccg
1441 tcggtccgcg cgtccctctg tcccggagcc cgcagatcgc gacccagagc gcgcggggcc
1501 gagagccgag agacagtccc gggcgcagcg cggagctccg ggccccgaga tcctgggacg
1561 gggcccgggc cgcagcggcc ggggggtcgg ggccaccacc gcaagggcct ccgctcagta
1621 tttgtagctg gcgaagccgc gcgcgccctt cccggggctg cctctgggcc ctccccggca
1681 ggggggctgc ggcccgcggg tcgcgggcgt ggaagagaag gacgcgcggc ccccagcgcc
1741 tcttgggtgg ccgcctcgga gcatgacccc cgcgggccag cgccgcgcgc tctgatccga
1801 ggagaccccg cgctcccgca gccatgggca ccgggggccg gcggggagcg gcggccgcgc
1861 cgctgctggt ggcggtggcc gcgctgctac tgggcgccgc gggccacctg taccccggag
1921 agggtgagtc tgggggcgcg ggcgtgggcg gggagcgccg cgatggggag aggaccccac
1981 ccaagccaaa atcgatcccc cgcttgtgga ctgagaaccc tccccagggg cggggggcgg
2041 tggccaggac ggtagctcct gcatcgcgta gggggagcgg gaagc
//
LOCUS HUMINSR02 928 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 2.
ACCESSION M32823
KEYWORDS insulin receptor.
SEGMENT 2 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 928)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 174 + 725 human insulin receptor precursor, exon 2
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 174 + 725 human insulin receptor
pre-msg < 1 > 927 hINSR mRNA and introns
IVS < 1 173 hINSR intron A
IVS 726 > 927 hINSR intron B
BASE COUNT 218 a 234 c 237 g 239 t
ORIGIN About 25.0 kbp downstream of segment 1.
1 tactttacag agaaagctac tcatcccggc tggctgcaga gtttacaggg cccgggatga
61 aaacacaggg cccaggtttc ctgtccatga agccggctct gcccctgatc cttctgatgc
121 atccaccgtg cgtctgctca cctgtcttgc tttctgttca ttttctcttg tagtgtgtcc
181 cggcatggat atccggaaca acctcactag gttgcatgag ctggagaatt gctctgtcat
241 cgaaggacac ttgcagatac tcttgatgtt caaaacgagg cccgaagatt tccgagacct
301 cagtttcccc aaactcatca tgatcactga ttacttgctg ctcttccggg tctatgggct
361 cgagagcctg aaggacctgt tccccaacct cacggtcatc cggggatcac gactgttctt
421 taactacgcg ctggtcatct tcgagatggt tcacctcaag gaactcggcc tctacaacct
481 gatgaacatc acccggggtt ctgtccgcat cgagaagaac aatgagctct gttacttggc
541 cactatcgac tggtcccgta tcctggattc cgtggaggat aattacatcg tgttgaacaa
601 agatgacaac gaggagtgtg gagacatctg tccgggtacc gcgaagggca agaccaactg
661 ccccgccacc gtcatcaacg ggcagtttgt cgaacgatgt tggactcata gtcactgcca
721 gaaaggtacg ccggggatac agggttctaa gcagtgtctc gtgccttgtt ctagaaagct
781 taaaatgttt tatggcttaa aaatgttaaa tggtcattag gtaggggccg gggaatagtg
841 ggtggtggca ttcactagcc cagggagtgg cagacatttt ctgtaaagac tcagatagta
901 gatacttcag attttgcagg ccatatgg
//
LOCUS HUMINSR03 639 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 3.
ACCESSION M32824
KEYWORDS insulin receptor.
SEGMENT 3 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 639)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 114 + 435 human insulin receptor precursor, exon 3
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 114 + 435 human insulin receptor
pre-msg < 1 > 639 hINSR mRNA and introns
IVS < 1 113 hINSR intron B
IVS 436 > 639 hINSR intron C
BASE COUNT 134 a 171 c 163 g 171 t
ORIGIN About 25.0 kbp downstream of segment 2.
1 gatccagaat tgctgcatat gcagacagga attggacaaa gccatttatt tatttattta
61 tttatttatt tatttattta tttatttccc tctctctctc tctctctctc cagtttgccc
121 gaccatctgt aagtcacacg gctgcaccgc cgaaggcctc tgttgccaca gcgagtgcct
181 gggcaactgt tctcagcccg acgaccccac caagtgcgtg gcctgccgca acttctacct
241 ggacggcagg tgtgtggaga cctgcccgcc cccgtactac cacttccagg actggcgctg
301 tgtgaacttc agcttctgcc aggacctgca ccacaaatgc aagaactcgc ggaggcaggg
361 ctgccaccaa tacgtcattc acaacaacaa gtgcatccct gagtgtccct ccgggtacac
421 gatgaattcc agcaagtgag ttctggatgt gggtctgggg ggcagccgag aggagaagga
481 acgtggggtt ggttgtgacg atgccgcttg ttaaaactgt gtgcaaaccc agggttaatt
541 ggctatgagt gaggtctctg ctctcagatg ctacttttgc accctgtttt ggtcctgggc
601 ttgggagtgg gagttgacta cctttttctc taaaggacc
//
LOCUS HUMINSR04 663 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 4.
ACCESSION M32825
KEYWORDS insulin receptor.
SEGMENT 4 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 663)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 318 + 466 human insulin receptor precursor, exon 4
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 318 + 466 human insulin receptor
pre-msg < 1 > 663 hINSR mRNA and introns
IVS < 1 317 hINSR intron C
IVS 467 > 663 hINSR intron D
BASE COUNT 159 a 195 c 171 g 138 t
ORIGIN About 15.0 kbp downstream of segment 3.
1 ccaacatggt aaccccgtct ctactcaaaa atacaaaaat tagccaggca cggtggcggg
61 cacctataat cccagctact gtggaggctg aggcaggaga atctcttgaa cccagaaggc
121 agaggttgca gtgagctgag atcgcaccac tgcactccag cctgggcaac agagcgagac
181 tctgtcacac aaacacacac acacacacaa agaaatacca tatcaggcag aaagatgcct
241 gagatgtctg aaggaccttg gataccgtga cacccccctc ccctttctct ttctctctct
301 ctctgctccg tccttagctt gctgtgcacc ccatgcctgg gtccctgtcc caaggtgtgc
361 cacctcctag aaggcgagaa gaccatcgac tcggtgacgt ctgcccagga gctccgagga
421 tgcaccgtca tcaacgggag tctgatcatc aacattcgag gaggcagtga gtgtctctgt
481 gtgggcgtcg ggggtgcctg ttgggctcca tgtccctctg agctgtgagc ggggaagaaa
541 agcagtgcag accctgctgc gtgctcctac agcactttta ggatggtcgt tcagtggctc
601 ccccatggat agaaccatgc tgggagtctg cctcaaaacc tgaaatgaac agctcagtct
661 tcc
//
LOCUS HUMINSR05 410 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 5.
ACCESSION M32826
KEYWORDS insulin receptor.
SEGMENT 5 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 410)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 188 + 332 human insulin receptor precursor, exon 5
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 188 + 332 human insulin receptor
pre-msg < 1 > 410 hINSR mRNA and introns
IVS < 1 187 hINSR intron D
IVS 333 > 410 hINSR intron E
BASE COUNT 105 a 80 c 100 g 125 t
ORIGIN About 3.0 kbp downstream of segment 4.
1 gggcagaagt atgcttgacc catttaagga atgctaagga cttcagattg tgttctaagc
61 atgatgagtt ttgagctggg tatgtccagt catttgcagc ctgagggtta tcttctcacc
121 atggagaatc atgagaagat tgaaatatgt ctatagaaac ccactggata ttctctcctt
181 tccttagaca atctggcagc tgagctagaa gccaacctcg gcctcattga agaaatttca
241 gggtatctaa aaatccgccg atcctacgct ctggtgtcac tttccttctt ccggaagtta
301 cgtctgattc gaggagagac cttggaaatt gggtacgtgg gcctgattgt gtgtatggcc
361 tgagtgctaa ctaggaagtt cgtgtattag aacaacttaa ggattttttt
//
LOCUS HUMINSR06 554 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 6.
ACCESSION M32827
KEYWORDS insulin receptor.
SEGMENT 6 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 554)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 189 + 403 human insulin receptor precursor, exon 6
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 189 + 403 human insulin receptor
pre-msg < 1 > 554 hINSR mRNA and introns
IVS < 1 188 hINSR intron E
IVS 404 > 554 hINSR intron F
BASE COUNT 154 a 129 c 130 g 141 t
ORIGIN About 1.0 kbp downstream of segment 5.
1 ggccatgaaa acttcctcaa cttcctctgt tatccacatt caacaaatat gtgttgagta
61 tgtgccaagc aagtggagag gattaggcac gtagcactga acaagatcaa ctccgagcat
121 ggccacacca tcttggagtt gtagaagacc agccgttgaa tgactagatg tgtgtgtttt
181 ttccatagga actactcctt ctatgccttg gacaaccaga acctaaggca gctctgggac
241 tggagcaaac acaacctcac catcactcag gggaaactct tcttccacta taaccccaaa
301 ctctgcttgt cagaaatcca caagatggaa gaagtttcag gaaccaaggg gcgccaggag
361 agaaacgaca ttgccctgaa gaccaatggg gaccaggcat cctgtaagtc actggtcccc
421 aacctttttg gcacgaggga ccggtttagt ggaagatggt ttttccatgg actggtggtg
481 ggtggggatg gtttcagcat gattcaagtg cattacattt actatgcact ttattcctat
541 tatgattaca ttgt
//
LOCUS HUMINSR07 592 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 7.
ACCESSION M32828
KEYWORDS insulin receptor.
SEGMENT 7 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 592)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 277 + 403 human insulin receptor precursor, exon 7
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 277 + 403 human insulin receptor
pre-msg < 1 > 592 hINSR mRNA and introns
IVS < 1 276 hINSR intron F
IVS 404 > 592 hINSR intron G
BASE COUNT 125 a 144 c 144 g 179 t
ORIGIN About 1.0 kbp downstream of segment 6.
1 ttgcgcgggt acagactgcg cttattcagt tgactgtctg gctgagtcaa gtcattggct
61 tacgtgagtg tgagtggcca agttgcaaaa ctggctctta cctttgaatc ttcccccatt
121 catactcagc caggcacatg gggaggagac ccttaaggga atagcagcat cacctctgcc
181 ttctcacggt ccctccagga agtgtggggg tcccaggctt tggtctgaaa ctacactgaa
241 atagctcatt tttgcctttt gttttaactt ttccaggtga aaatgagtta cttaaatttt
301 cttacattcg gacatctttt gacaagatct tgctgagatg ggagccgtac tggccccccg
361 acttccgaga cctcttgggg ttcatgctgt tctacaaaga ggcgtaagta gaagagttag
421 agagacgctg aggaggcgag ggctggctgg ctctgtgctt gctacgtttg tgctccaatc
481 tgcccctctt gggttcctgt ctatctccct cctcctcctg gaataaatat cttaggttcc
541 tttttacaat ctcaccagtc gatggcatgc aaagtcaata gtgtctgctt tt
//
LOCUS HUMINSR08 401 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 8.
ACCESSION M32829
KEYWORDS insulin receptor.
SEGMENT 8 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 401)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 124 + 374 human insulin receptor precursor, exon 8
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 124 + 374 human insulin receptor
pre-msg < 1 > 401 hINSR mRNA and introns
IVS < 1 123 hINSR intron G
IVS 375 > 401 hINSR intron H
BASE COUNT 90 a 98 c 112 g 101 t
ORIGIN About 3.0 kbp downstream of segment 7.
1 cattagattg ttgggtgagt aacatgtgac cctatgggat gtaacttccc aggcctcatc
61 tgcacggcac tcagtgtgac ggtcttgtaa gggtaactgc cttctgctgt tttgtcttga
121 aagcccttat cagaatgtga cggagttcga tgggcaggat gcgtgtggtt ccaacagttg
181 gacggtggta gacattgacc cacccctgag gtccaacgac cccaaatcac agaaccaccc
241 agggtggctg atgcggggtc tcaagccctg gacccagtat gccatctttg tgaagaccct
301 ggtcaccttt tcggatgaac gccggaccta tggggccaag agtgacatca tttatgtcca
361 gacagatgcc accagtgagt gtgtcttggg aatgtgaatt c
//
LOCUS HUMINSR09 420 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 9.
ACCESSION M32830
KEYWORDS insulin receptor.
SEGMENT 9 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 420)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 106 + 273 human insulin receptor precursor, exon 9
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 106 + 273 human insulin receptor
pre-msg < 1 > 420 hINSR mRNA and introns
IVS < 1 105 hINSR intron H
IVS 274 > 420 hINSR intron I
BASE COUNT 85 a 125 c 94 g 116 t
ORIGIN About 3.0 kbp downstream of segment 8.
1 ggtgccctca tgatgtcttt aacttgtgtg tcccccgcca tcctcccacc agctttcttt
61 gcacactgtt tctcatgatg gacccgtttc ctttctccct ggcagacccc tctgtgcccc
121 tggatccaat ctcagtgtct aactcatcat cccagattat tctgaagtgg aaaccaccct
181 ccgaccccaa tggcaacatc acccactacc tggttttctg ggagaggcag gcggaagaca
241 gtgagctgtt cgagctggat tattgcctca aaggtgagtg caggcagctg tgctaggatc
301 ggtggggttt gcacacgtgt gtctgatgca ctttgcttca cctctaggga agcagctatc
361 tcttcctgtg tctcagtgtc ggaaggcaca cacacacact ccattctatc tcatatgaaa
//
LOCUS HUMINSR10 517 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 10.
ACCESSION M32831
KEYWORDS insulin receptor.
SEGMENT 10 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 517)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 187 + 388 human insulin receptor precursor, exon 10
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 187 + 388 human insulin receptor
pre-msg < 1 > 517 hINSR mRNA and introns
IVS < 1 186 hINSR intron I
IVS 389 > 517 hINSR intron J
BASE COUNT 83 a 88 c 194 g 152 t
ORIGIN About 11.0 kbp downstream of segment 9.
1 tttgtggtgt gtgtatgtgt ggtgtgttgt gtgatgtgtg tggtgtgtgt gtgggggggt
61 gtgtggtgtg tgtatgtgtg gtgtgtgtgg tgtgtgtgtg tggtgtgtgt gtgtgggggg
121 ggtgtgtgtg tgtatgtgtg ttcagccgca gagacttgag cccccctttt ctgtttcttt
181 ctccagggct gaagctgccc tcgaggacct ggtctccacc attcgagtct gaagattctc
241 agaagcacaa ccagagtgag tatgaggatt cggccggcga atgctgctcc tgtccaaaga
301 cagactctca gatcctgaag gagctggagg agtcctcgtt taggaagacg tttgaggatt
361 acctgcacaa cgtggttttc gtccccaggt caggacttgg cgctgggctc tcttagtggg
421 tgccaattgg cttggtgttg gtggaaggtc attacttagg gaccgagagg tagtgggagg
481 gagagacggc agaaccctgg gtggagtctg aatggag
//
LOCUS HUMINSR11 343 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 11.
ACCESSION M32832
KEYWORDS insulin receptor.
SEGMENT 11 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 343)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 123 + 158 human insulin receptor precursor, exon 11
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 123 + 158 human insulin receptor
pre-msg < 1 > 343 hINSR mRNA and introns
IVS < 1 122 hINSR intron J
IVS 159 > 343 hINSR intron K
BASE COUNT 68 a 97 c 98 g 80 t
ORIGIN About 2.0 kbp downstream of segment 10.
1 tggtccaggg tcaaagccag ggtgccctta ctcggacaca tgtggcctcc aagtgtcaga
61 gcccagtggt ctgtctaatg aagttccctc tgtcctcaaa ggcgttggtt ttgtttccac
121 agaaaaacct cttcaggcac tggtgccgag gaccctaggt atgactcacc tgtgcgaccc
181 ctggtgcctg ctccgcgcag ggccggcggc gtgccaggca gatgcctcgg agaacccagg
241 ggtttctctg gctttttgca tgcggcgggc agctgtgctg gagagcagat gcttcaccaa
301 ttcagaaatc caatgccttc actctgaaat gaaatctggg cat
//
LOCUS HUMINSR12 719 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 12.
ACCESSION M32833
KEYWORDS insulin receptor.
SEGMENT 12 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 719)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 161 + 435 human insulin receptor precursor, exon 12
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 161 + 435 human insulin receptor
pre-msg < 1 > 719 hINSR mRNA and introns
IVS < 1 160 hINSR intron K
IVS 436 > 719 hINSR intron L
BASE COUNT 137 a 198 c 195 g 189 t
ORIGIN About 8.0 kbp downstream of segment 11.
1 ggtcattcct ggcagtctgt attgtaatcc atgttcccca ttgctgcacc ctcctgcgct
61 ctgatctttc ttcttaatca agccttttat tctccagtgt cactttttta aaaaaaatga
121 tggtgatggt gtcatcatac atgtcctact gtcgttccag gccatctcgg aaacgcaggt
181 cccttggcga tgttgggaat gtgacggtgg ccgtgcccac ggtggcagct ttccccaaca
241 cttcctcgac cagcgtgccc acgagtccgg aggagcacag gccttttgag aaggtggtga
301 acaaggagtc gctggtcatc tccggcttgc gacacttcac gggctatcgc atcgagctgc
361 aggcttgcaa ccaggacacc cctgaggaac ggtgcagtgt ggcagcctac gtcagtgcga
421 ggaccatgcc tgaaggtagg gctgctggtc cggggtccga gtgtcatggg tgggacatca
481 aggctgactt tttgtttgag acggagcctt gctctgtcgc ccaggctgga gtacagtggt
541 gcgacctcag ctcactccag cctctgccac ctatgtcaag tgattccctg cttcagcctc
601 ccaagtagct gggactacag gtgtctgcca ccacgcccag ctaatttttg tatttttagt
661 agagatgggg tttcaccata ttgcccaggc tggtcttgaa ctcctgggct caagtgatc
//
LOCUS HUMINSR13 439 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 13.
ACCESSION M32834
KEYWORDS insulin receptor.
SEGMENT 13 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 439)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 93 + 232 human insulin receptor precursor, exon 13
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 93 + 232 human insulin receptor
pre-msg < 1 > 439 hINSR mRNA and introns
IVS < 1 92 hINSR intron L
IVS 233 > 439 hINSR intron M
BASE COUNT 98 a 114 c 105 g 122 t
ORIGIN About 1.0 kbp downstream of segment 12.
1 gtcaccagcc caaggttgca ccatggacag gtggcagaag tgggatctca tccaagagtt
61 acatccctgc ctctcacttc ctctccttac agccaaggct gatgacattg ttggccctgt
121 gacgcatgaa atctttgaga acaacgtcgt ccacttgatg tggcaggagc cgaaggagcc
181 caatggtctg atcgtgctgt atgaagtgag ttatcggcga tatggtgatg aggtaaggcc
241 cttgactctt gggcatgccc ctgcaccact tcagcatgcc ccttcagagt tgcacttggt
301 acctccttcc tctgctgaaa ttttgattcc agtgcttctc tcatcaggta ctgtgctatt
361 agtacttaaa gccttgatac ctgacttcgc aggaagatgg gtcagaaatg ccaatctacc
421 agcttgttac ttttcttag
//
LOCUS HUMINSR14 386 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 14.
ACCESSION M32835
KEYWORDS insulin receptor.
SEGMENT 14 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 386)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 85 + 244 human insulin receptor precursor, exon 14
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 85 + 244 human insulin receptor
pre-msg < 1 > 386 hINSR mRNA and introns
IVS < 1 84 hINSR intron M
IVS 245 > 386 hINSR intron N
BASE COUNT 62 a 123 c 115 g 86 t
ORIGIN About 6.0 kbp downstream of segment 13.
1 tggctgtgag ctccctgcga ggggtggaca ctcccagatg tgcaaagctc agccaccctc
61 cttctcctcc tctcttcctc ccaggagctg catctctgcg tctcccgcaa gcacttcgct
121 ctggaacggg gctgcaggct gcgtgggctg tcaccgggga actacagcgt gcgaatccgg
181 gccacctccc ttgcgggcaa cggctcttgg acggaaccca cctatttcta cgtgacagac
241 tattgtaagt ctccatggca gcctcagctg actggggctg tgcttagcac tgagcatggt
301 gggacattgc aggggatgac ttggagaggc cgcagtgctg gccctggcct tgactctcag
361 gcctatcagc tgctgcggtg cttgcc
//
LOCUS HUMINSR15 429 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 15.
ACCESSION M32836
KEYWORDS insulin receptor.
SEGMENT 15 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 429)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 92 + 194 human insulin receptor precursor, exon 15
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 92 + 194 human insulin receptor
pre-msg < 1 > 428 hINSR mRNA and introns
IVS < 1 91 hINSR intron N
IVS 195 > 428 hINSR intron O
BASE COUNT 117 a 67 c 82 g 163 t
ORIGIN About 3.0 kbp downstream of segment 14.
1 cccacccatt ccaggagtgg atgtgatttt tgatgtgaac tttgttggaa acacattgat
61 atgaaacata tattttctta ttctatttca gtagacgtcc cgtcaaatat tgcaaaaatt
121 atcatcggcc ccctcatctt tgtctttctc ttcagtgttg tgattggaag tatttatcta
181 ttcctgagaa agaggtgagt tcagtgagtt cagtggtgtg ctgggaacag ttggttctct
241 gggggaaaac atgccttgat ataggtatag gcatatttaa gtttattatg aattttgctg
301 atataggatg tgtaacatgc aatttacaga taattgtcat aatatgatat acacaactct
361 ttattgtaaa ttccctctag acagttgatt ctcacagaat gtttttattg attttttttt
421 ttgcccaaa
//
LOCUS HUMINSR16 480 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 16.
ACCESSION M32837
KEYWORDS insulin receptor.
SEGMENT 16 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 480)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 261 + 328 human insulin receptor precursor, exon 16
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 261 + 328 human insulin receptor
pre-msg < 1 > 480 hINSR mRNA and introns
IVS < 1 260 hINSR intron O
IVS 329 > 480 hINSR intron P
BASE COUNT 123 a 131 c 109 g 117 t
ORIGIN About 2.0 kbp downstream of segment 15.
1 aaaaacaaaa acaaaaacaa aacaaaaaaa aaaccaccca gggagggatg agtgctccca
61 tgttgatgca cttacatacc tgtctgatgg gcttccattc aaaacataaa ggtcccccat
121 ccctgcccta gactgcatct aggattatgg ggattctgct ggtaagggct gccatttgcc
181 ttggggagtc ttgtatgaaa cacctttctg cagagtccca tgagaatctc aagctaacgt
241 gcctcgtttt cctcctccag gcagccagat gggccgctgg gaccgcttta cgcttcttca
301 aaccctgagt atctcagtgc cagtgatggt gagtaccatc ccttccctgt gggtggccag
361 aaccctactc atcagcttcc tttgccttca ccattgagtg agagtgaagg atgggttccc
421 cagggaggcc aagaaaagcc ctcttattca tttgagcttg ccaaactgcc cttgctgcag
//
LOCUS HUMINSR17 485 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 17.
ACCESSION M32838
KEYWORDS insulin receptor.
SEGMENT 17 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 485)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 136 + 380 human insulin receptor precursor, exon 17
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 136 + 380 human insulin receptor
pre-msg < 1 > 485 hINSR mRNA and introns
IVS < 1 135 hINSR intron P
IVS 381 > 485 hINSR intron Q
BASE COUNT 96 a 119 c 162 g 108 t
ORIGIN About 1.0 kbp downstream of segment 16.
1 cccggcatgg gtcctggatc acagaactca tttcatgagt gttttcgagg gggtttgggt
61 gagggcttgg gtggaaggtg gctgcagacc cccaagggat cctccaagga tgctgtgtag
121 ataagtaaga agtagtgttt ccatgctctg tgtacgtgcc ggacgagtgg gaggtgtctc
181 gagagaagat caccctcctt cgagagctgg ggcagggctc cttcggcatg gtgtatgagg
241 gcaatgccag ggacatcatc aagggtgagg cagagacccg cgtggcggtg aagacggtca
301 acgagtcagc cagtctccga gagcggattg agttcctcaa tgaggcctcg gtcatgaagg
361 gcttcacctg ccatcacgtg gtgagtccag tgggggtggg acatgggctg gctttcctga
421 cccttccctt tctctgcctc ctcctcctgc acagagcgac agaggacaca gggtgtatcc
481 tccta
//
LOCUS HUMINSR18 287 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 18.
ACCESSION M32839
KEYWORDS insulin receptor.
SEGMENT 18 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 287)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 117 + 227 human insulin receptor precursor, exon 18
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 117 + 227 human insulin receptor
pre-msg < 1 > 287 hINSR mRNA and introns
IVS < 1 116 hINSR intron Q
IVS 228 > 287 hINSR intron R
BASE COUNT 51 a 85 c 98 g 53 t
ORIGIN About 2.0 kbp downstream of segment 17.
1 acgctgcatc caggccacag ggtgctgtgt gtgacataga caccagggag ggaggagaac
61 cctggtgagt cgaatcacgg accctcctcc aagaaccctg gttgcttgct ctgcaggtgc
121 gcctcctggg agtggtgtcc aagggccagc ccacgctggt ggtgatggag ctgatggctc
181 acggagacct gaagagctac ctccgttctc tgcggccaga ggctgaggta agctgcttcg
241 ggggacccag cggggtactc ggtggagcac ccgctcctgg cctcctc
//
LOCUS HUMINSR19 322 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 19.
ACCESSION M32840
KEYWORDS insulin receptor.
SEGMENT 19 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 322)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 45 + 204 human insulin receptor precursor, exon 19
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 45 + 204 human insulin receptor
pre-msg < 1 > 322 hINSR mRNA and introns
IVS < 1 44 hINSR intron R
IVS 205 > 322 hINSR intron S
BASE COUNT 81 a 76 c 79 g 86 t
ORIGIN About 0.5 kbp downstream of segment 18.
1 gatcccagtg ctgctgaaac accaaccccg tgtttctgtt ttagaataat cctggccgcc
61 ctccccctac ccttcaagag atgattcaga tggcggcaga gattgctgac gggatggcct
121 acctgaacgc caagaagttt gtgcatcggg acctggcagc gagaaactgc atggtcgccc
181 atgattttac tgtcaaaatt ggaggttcgt ctggctttct gctttgaaaa cataacgacc
241 caggccaggt ttgatttcag aaggaagttg tctataatga gccgttaagt cttttctgat
301 aatataaagg ggcaagtact tc
//
LOCUS HUMINSR20 288 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 20.
ACCESSION M32841
KEYWORDS insulin receptor.
SEGMENT 20 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 288)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 115 + 244 human insulin receptor precursor, exon 20
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 115 + 244 human insulin receptor
pre-msg < 1 > 288 hINSR mRNA and introns
IVS < 1 114 hINSR intron S
IVS 245 > 288 hINSR intron T
BASE COUNT 61 a 55 c 102 g 70 t
ORIGIN About 0.5 kbp downstream of segment 19.
1 gacgtgggcc aggtgaaccc ctcttagggc tctgtgagag gtggggcagt caaggtggca
61 gatgctagga ccaaggctga aggttaagag cgtgtgaacc ttttgtgttg tcagactttg
121 gaatgaccag agacatctat gaaacggatt actaccggaa agggggcaag ggtctgctcc
181 ctgtacggtg gatggcaccg gagtccctga aggatggggt cttcaccact tcttctgaca
241 tgtggtgagt tgtgtgtgga tgggtggatg gacgctgggc ttgaattc
//
LOCUS HUMINSR21 407 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 21.
ACCESSION M32842
KEYWORDS insulin receptor.
SEGMENT 21 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 407)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 101 + 235 human insulin receptor precursor, exon 21
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 101 + 235 human insulin receptor
pre-msg < 1 > 407 hINSR mRNA and introns
IVS < 1 100 hINSR intron T
IVS 236 > 407 hINSR intron U
BASE COUNT 73 a 75 c 118 g 141 t
ORIGIN About 1.0 kbp downstream of segment 20.
1 ttgcgtgtgt gtgtgcgttt gcgtgtgtgt gtttgcgcgc gcgcgtgtgt gtgtgtgtct
61 aaatggcttc tttgttacta ctatcaactg tcatcggcag gtcctttggc gtggtccttt
121 gggaaatcac cagcttggca gaacagcctt accaaggcct gtctaatgaa caggtgttga
181 aatttgtcat ggatggaggg tatctggatc aacccgacaa ctgtccagag agagtgtaag
241 tgtagaaagg gtttaaggtg tgtgaggtgt tcgttgaaag ggtattgccc tttacacgtg
301 tgcttggttt tgcctttcct atgtctacac gctcaccgtg tttgcatgct gtatgttaca
361 ggtgtgtttg tgtttgcata gcttgtcttt acatgcatgc ttgcatt
//
LOCUS HUMINSR22 873 bp ds-DNA PRI 10-JUL-1990
DEFINITION Human insulin receptor (hINSR) gene, exon 22.
ACCESSION M32972
KEYWORDS insulin receptor.
SEGMENT 22 of 22
SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 873)
AUTHORS Seino,S., Seino,M. and Bell,G.I.
TITLE Human insulin-receptor gene: Partial sequence and amplification of
exons by polymerase chain reaction
JOURNAL Diabetes 39, 123-128 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.I.Bell, 14-MAR-1990.
FEATURES from to/span description
pept + 83 437 human insulin receptor precursor, exon 22
/hgml_locus_uid="LG0007M" /nomgen="INSR"
/map="19p13.3-p13.2"
matp + 83 434 human insulin receptor
pre-msg < 1 873 hINSR mRNA and introns
IVS < 1 82 hINSR intron U
BASE COUNT 199 a 217 c 234 g 223 t
ORIGIN About 2.0 kbp downstream of segment 21.
1 ctgcagggac aagagtgggg gtttgggagg atgcgtggca gggcccccag actcacccag
61 gacgtgtcct tctgccccgc agcactgacc tcatgcgcat gtgctggcaa ttcaacccca
121 agatgaggcc aaccttcctg gagattgtca acctgctcaa ggacgacctg caccccagct
181 ttccagaggt gtcgttcttc cacagcgagg agaacaaggc tcccgagagt gaggagctgg
241 agatggagtt tgaggacatg gagaatgtgc ccctggaccg ttcctcgcac tgtcagaggg
301 aggaggcggg gggccgggat ggagggtcct cgctgggttt caagcggagc tacgaggaac
361 acatccctta cacacacatg aacggaggca agaaaaacgg gcggattctg accttgcctc
421 ggtccaatcc ttcctaacag tgcctaccgt ggcgggggcg ggcaggggtt cccattttcg
481 ctttcctctg gtttgaaagc ctctggaaaa ctcaggattc tcacgactct accatgtcca
541 gtggagttca gagatcgttc ctatacattt ctgttcatct taaggtggac tcgtttggtt
601 accaatttaa ctagtcctgc agaggattta actgtgaacc tggagggcaa ggggtttcca
661 cagttgctgc tcctttgggg caacgacggt ttcaaaccag gattttgtgt tttttcgttc
721 cccccacccg cccccagcag atggaaagaa agcacctgtt tttacaaatt cttttttttt
781 tttttttttt tttttttttg ctggtgtctg agcttcagta taaaagacaa aacttcctgt
841 ttgtggaaca aaatttcgaa agaaaaaacc aaa
//
LOCUS BT1NAMTA 1091 bp ds-DNA PHG 10-JUL-1990
DEFINITION Bacteriophage T1 DNA N-6-adenine-methyltransferase (M.T1) gene,
complete cds.
ACCESSION J05393
KEYWORDS DNA N-6-adenine-methyltransferase.
SOURCE Bacteriophage T1 DNA.
ORGANISM Bacteriophage T1
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1091)
AUTHORS Schneider-Scherzer,E., Auer,B., de Groot,E.J. and Schweiger,M.
TITLE Primary structure of a DNA (N-6-adenine)-methyltransferase from
Escherichia coli virus T1
JOURNAL J. Biol. Chem. 265, 6086-6091 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 171 824 DNA N-6-adenine-methyltransferase (M.T1)
pept 824 1072 pot. protein HP 83
binding 161 164 ribosomal binding site (put.)
binding 813 816 ribosomal binding site (put.)
signal 141 146 TATA box
BASE COUNT 345 a 205 c 266 g 275 t
ORIGIN
1 aaaagggaag tttctcaaaa aggtccggga gcgtggcggc ttctctgccg tcgcatacgg
61 attcgggcaa ttcaagatcg caatttacga aatgatgaaa tagcactttt tgttaaaact
121 gccgggatgg aatctggcat tattatctca ccaaaacgag aggaataaaa atgaaagact
181 ttaatgatat cgaaactatc gactttgcag aaactggttg ctcattcact cgcgaagcaa
241 tagcatcagg cggttattat caggcattga aaacgccaac ctgtaaagag atttcagggc
301 gtcgatacaa ggggacaaat acccctgacg ctgttcgtga tttatggtca actccgcgag
361 aggttattgc ataccttgag ggtcgttatg ggaaatatga tctcgacgct gcggcaagcg
421 aagaaaataa agtttgcgag aagttttact ctcaggaaac aaactgctta aaacgttggt
481 ggggaaagaa taagcacgtt tggttaaatc ctccttatag ccgacctgat atatttgtca
541 actctactgc gtggtttact gaagcgcggc agaacgcagc tgaaataatc tggattgaag
601 cggacttgac tgaggatatt gacggcaatg aatacgcacg atccggtcgc ctggctttca
661 tatccggtga aactggaaag gccgtagacg gtaataacaa aggttcggta atttttatta
721 tgcgcgaact taaagaaggt gaggtgcaac agactcacta catcccaatc acaagcattt
781 gcccttcggt gaaaaacaaa cgagcaaagg tgaggaaagt atgatgagcg aaaaaatggt
841 tcctgttaaa ttaactgagc aaggtttatg gctactttat cgagctacgt gctgcgaaat
901 tatggagcga aacggattga ctcaggatgt tattggttgc gatctgtggg agttcactag
961 ttctcttgat atgcttttcg atgagataaa aaatgaatac atagagaact ggccttcaat
1021 catacagaaa gacgtggaag aacttaaagc tgatacaatc gtacagcact aattgctaaa
1081 actacccggc g
//
LOCUS STVBLSG 1130 bp ds-DNA BCT 10-JUL-1990
DEFINITION Streptoverticillum sp. blasticidin S-acetyltransferase (bls) gene,
complete cds.
ACCESSION M34537
KEYWORDS blasticidin S-acetyltransferase.
SOURCE Streptoverticillum sp. (strain JCM4673) DNA.
ORGANISM Streptoverticillum sp.
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1130)
AUTHORS Perez-Gonzalez,J.A., Ruiz,D., Esteban,J.A. and Jimenez,A.
TITLE Cloning and characterization of the gene encoding a blasticidin S
acetyltransferase from Streptoverticillum sp
JOURNAL Gene 86, 129-134 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 147 557 blasticidin S-acetyltransferase (bls) (147
could be 225)
binding 136 139 ribosomal binding site (put.)
BASE COUNT 151 a 420 c 394 g 165 t
ORIGIN
1 gatcagcgcc ggcccccacc ggcactgtgc atcagcgtac ggccggggta cgacaacgga
61 agcggattgg caaaactgcc tggccccggt gtttatggtg agctttatgt tcagtattga
121 ggcggtgaac gacccggaac gacgcgatgt tgtccttgcc acggttgcag accgtcaacg
181 acgaacgttc gcccgccctg cgggcgttgc ggcgcacgcc ggtgatggag gcgcggccgc
241 tggaggtgta cgccacgtac gcctgcggcg agcgcgggga gctggcgggc gggctcgtcg
301 gtcatgtgca gtggcaatgg ctgcacgtgg acctgctgtg ggtggacgcg ggggcccgcg
361 gggcggggct gggctcgcgg ttgatcgcgc gggcggaggc ccgcgcccgg gaggagttcg
421 gctgcatcgg cagccaggtg gagacctggg acttccaggc gccggggttc taccagcggg
481 tggggtatcg cctcgcggcg agcatcccgg actatccgcc cgggatcacg agccacctgc
541 tggtgaagga gctttgaggc gccccgtcag gggcgcgggg ccgttactcc ggggctgcgc
601 cccggacccc cgggtggcgc gtcgactgcg ggccggtggg ggcttgtcgc gcagttcccc
661 gcgcccctta cggggcgcct ggtcgcgccc acgcggcgga gccgcatatc gagcacagcc
721 ccgcgcccct tacggggcgc tgctctaggc cacccgccgt gccccctccc ccgccgccgt
781 gccgaacagt cgtgccgtcc ccagtgcctc ggtgaccacc ttggtcaccc tttcctcatc
841 tgccccatcc accaaggcga ttgccgagcc gccgaagccg ccgcccgtca tccgggcccc
901 cagggccccc gccttcaccg ccgtctccac caccacgtcc aattccgcac aggacacccg
961 gaagtcgtcg cgcagcgagg cgtgcccctc cgtcagcagt gggcccacag ccctcgcatc
1021 ccccgcggcc agcagggccg cgacccgctc cacccggtcg ttctccgtca ccacgtgacg
1081 gaccaaggcg cgctccgcgg caggcaactc acccagtgcc gcctgcagac
//
LOCUS HUMGAPDH 1268 bp ss-mRNA PRI 10-JUL-1990
DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) mRNA,
complete cds.
ACCESSION M33197
KEYWORDS glyceraldehyde-3-phosphate dehydrogenase.
SOURCE Human lung cancer cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1268)
AUTHORS Tokunaga,K., Nakamura,Y., Sakata,K., Fujimori,K., Ohkubo,M.,
Sawada,K. and Sakiyama,S.
TITLE Enhanced expression of a glyceraldehyde-3-phosphate dehydrogenase
gene in human lung cancers
JOURNAL Cancer Res. 47, 5616-5619 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 61 1068 glyceraldehyde-3-phosphate dehydrogenase (EC
1.2.1.12) /hgml_locus_uid="LM0055R"
/nomgen="GAPD" /map="12p13"
mRNA < 1 1268 GAPDH mRNA
BASE COUNT 295 a 385 c 326 g 262 t
ORIGIN
1 gttcgacagt cagccgcatc ttcttttgcg tcgccagccg agccacatcg ctcagacacc
61 atggggaagg tgaaggtcgg agtcaacgga tttggtcgta ttgggcgcct ggtcaccagg
121 gctgctttta actctggtaa agtggatatt gttgccatca atgacccctt cattgacctc
181 aactacatgg tttacatgtt ccaatatgat tccacccatg gcaaattcca tggcaccgtc
241 aaggctgaga acgggaagct tgtcatcaat ggaaatccca tcaccatctt ccaggagcga
301 gatccctcca aaatcaagtg gggcgatgct ggcgctgagt acgtcgtgga gtccactggc
361 gtcttcacca ccatggagaa ggctggggct catttgcagg ggggagccaa aagggtcatc
421 atctctgccc cctctgctga tgcccccatg ttcgtcatgg gtgtgaacca tgagaagtat
481 gacaacagcc tcaagatcat cagcaatgcc tcctgcacca ccaactgctt agcacccctg
541 gccaaggtca tccatgacaa ctttggtatc gtggaaggac tcatgaccac agtccatgcc
601 atcactgcca cccagaagac tgtggatggc ccctccggga aactgtggcg tgatggccgc
661 ggggctctcc agaacatcat ccctgcctct actggcgctg ccaaggctgt gggcaaggtc
721 atccctgagc tgaacgggaa gctcactggc atggccttcc gtgtccccac tgccaacgtg
781 tcagtggtgg acctgacctg ccgtctagaa aaacctgcca aatatgatga catcaagaag
841 gtggtgaagc aggcgtcgga gggccccctc aagggcatcc tgggctacac tgagcaccag
901 gtggtctcct ctgacttcaa cagcgacacc cactcctcca cctttgacgc tggggctggc
961 attgccctca acgaccactt tgtcaagctc atttcctggt atgacaacga atttggctac
1021 agcaacaggg tggtggacct catggcccac atggcctcca aggagtaaga cccctggacc
1081 accagcccca gcaagagcac aagaggaaga gagagaccct cactgctggg gagtccctgc
1141 cacactcagt cccccaccac actgaatctc ccctcctcac agttgccatg tagacccctt
1201 gaagagggga ggggcctagg gagccgcacc ttgtcatgta ccatcaataa agtaccctgt
1261 gctcaacc
//
LOCUS MUSMK2P 728 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Mouse retinoic acid-responsive protein (MK) mRNA, complete cds.
ACCESSION M35833 J05447
KEYWORDS MK protein; retinoic acid-responsive protein.
SOURCE Mouse (strain BALB/c) adult liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites)
AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
TITLE Structure of a retinoic acid-responsive gene, MK, which is
transiently activated during the differentiation of embryonal
carcinoma cells and the mid-gestation period of mouse embryogenesis
JOURNAL J. Biol. Chem. 265, 9441-9443 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 728; for [1])
AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.Shyuichiro, 20-APR-1990, for release after publication.
FEATURES from to/span description
pept 44 466 retinoic acid-responsive protein MK precursor
sigp 44 109 retinoic acid-responsive protein MK signal
peptide
matp 110 463 retinoic acid-responsive protein MK
mRNA 1 728 MK2 mRNA
BASE COUNT 184 a 211 c 206 g 127 t
ORIGIN
1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt
61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga
121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag
181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca
241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga
301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa
361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa
421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga
481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat
541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc
601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata
661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac
721 ttttttcc
//
LOCUS MUSMKPG 2929 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse retinoic acid-responsive protein (MK) gene, complete cds.
ACCESSION M34094 J05447
KEYWORDS MK protein; alternative splicing; retinoic acid-responsive protein.
SOURCE Mouse (strain BALB/c) adult liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2929)
AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
TITLE Structure of a retinoic acid-responsive gene, MK, which is
transiently activated during the differentiation of embryonal
carcinoma cells and the mid-gestation period of mouse embryogenesis
JOURNAL J. Biol. Chem. 265, 9441-9443 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.Shyuichiro, 20-APR-1990, for release after publication.
FEATURES from to/span description
pept 1298 1373 retinoic acid-responsive protein (MK)
precursor, exon 1
1491 1649 retinoic acid-responsive protein (MK)
precursor, exon 2
1766 1927 retinoic acid-responsive protein (MK)
precursor, exon 3
2631 2656 retinoic acid-responsive protein (MK)
precursor, exon 4
sigp 1298 1363 retinoic acid-responsive protein (MK) signal
peptide
matp 1364 1373 retinoic acid-responsive protein (MK)
1491 1649 retinoic acid-responsive protein (MK)
1766 1927 retinoic acid-responsive protein (MK)
2631 2653 retinoic acid-responsive protein (MK)
pre-msg 463 2918 MK3 mRNA and introns (minor alt.)
pre-msg 1007 2918 MK2 mRNA and introns (major alt.)
pre-msg 1048 2918 MK1 mRNA and introns (minor alt.)
IVS 816 1296 MK3 intron A
IVS 1052 1296 MK2 intron A
IVS 1374 1490 MK1 intron A, and MK2 and MK3 intron B
IVS 1650 1765 MK1 intron B, and MK2 and MK3 intron C
IVS 1928 2630 MK1 intron C, and MK2 and MK3 intron D
signal 85 91 GC box
signal 143 149 GC box
signal 274 280 GC box
signal 598 604 GC box
signal 852 858 GC box
signal 910 916 GC box
signal 939 945 GC box
BASE COUNT 660 a 771 c 930 g 568 t
ORIGIN
1 tggccaccaa catctcagat cacttcggga gatgggtctg ccccgatcct gacctctgcc
61 tagggcctta ggctcacagc gcctggggcg gagctgattt tcccgctcct gcagggatga
121 taacaatgaa agtaaaagag gtggggcggg ggccaggctt gggttctttg gtcttttggc
181 cctgtgccct ggagcagtcc cctccccctg gcttgtactg gggggggggg gggggatctg
241 cttgaggtga gcctgaggcc ccagggtcag gggtgggcgg ttatcacctc cgggggaagc
301 ccggtctgga acttctcaga cagctcttgt cagcgacaag atttaccaaa ctcatttcta
361 tgtgcttccc catccccccc aacgcccttc cctcctcctc ctcccccaaa cctgcactag
421 aaaaaggctc tcgagccttg ctcacccgga gccatctgag gtcccaggta cccagctccc
481 tgccacatca gagacccttc ttgcactctg agtgaactga ttaaaaaaaa aaaaaaaaaa
541 aaaaaaccaa gccggaggtg agccgggcct cgaagggaag gttcgcgggt gcggtggccg
601 ccccgagcct gtgacaccag gacatactcc cggggcccgc ggtgggcaag cgaagtggtg
661 acctgagagc tgacaggctg cgagagggaa aagtatagac aggcctagac caggggaagg
721 ggaggggata gagagctggg cctgctacga ggggacctga gccagaagcg cactggtaaa
781 accgaactcc aggaccagag acccagagat cagaggtgag aggcacagac gcgggagtcc
841 cggctcggcg aggggcggga gtggaggcgg ggactagggg ggtctgggga ggtgcgggtt
901 tggggggagg gggcgggtcc ttccacggga tggggggagg ggcgggggcc catgtgaccg
961 gctcagaccg gttctggaga caaaaggggc cttagcggcc ttagcgggac aggccggagc
1021 gggagggagc gaagcatcga gcagtgagcg agtgagcgca cgcagtggct gtggccccag
1081 tcccttcagg cggctgctct gccaccaagg gggctgaggt gggggtgggg gtacgctgag
1141 acatcggttc caagtcctcc ctccgtctcc cccttgtcgg tccgacgttt tgggcctgga
1201 aagtgggaca agtcagtcaa gggtgggagg tccttcccgc ggttcctagc ggagaagaga
1261 ctaggcgaga aactctaacc caggttttac ccctaggatg cagcaccgag gcttcttcct
1321 tctcgccctt cttgccctct tggtggtcac gtccgcggtg gccaaaaaaa aaggtgatgg
1381 gataggatgg gctcaggagt aaaagctggg gtgggcaggt gaggcaggcc gtgtgaccaa
1441 gtgctggtcc ggcacgccat gtccttaact ttgttccttg cgccctgtag agaaggtgaa
1501 gaagggcagc gagtgttcgg agtggacctg ggggccctgc acccccagca gcaaggactg
1561 cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc cagcgcgtcc attgcaaggt
1621 gccctgcaac tggaagaagg aatttggagg tgaggtggcg cgcgggagga gggcgggaag
1681 ccagagggta tgtccttata aaccggaggc agggaggaca tccacaaccc tcctgtctct
1741 caccgtgggg ccactctccc atcagccgac tgcaaataca agtttgagag ctggggggcg
1801 tgtgatggga gcactggcac caaagcccgc caagggaccc tgaagaaggc gcggtacaat
1861 gcccagtgcc aggagaccat ccgcgtgact aagccctgca cctccaagac caagtcaaag
1921 accaaaggtc agcgaatatg gtggggttgt gggccaggct actccatgct ctgtctctgc
1981 agagcagtct taaagttagg aatgggcagg cacttgaggg ccactctcag gagatgctaa
2041 accctctgcc caagtaggaa ctactctttc tgttggatca tccgacctgg gttcctggga
2101 aaggcttgtc tttgtcaact gaggaaggtg gggtgggatc agggaggagt taactctgcg
2161 cttaaaacta tggaaaggcc tgtcccaaag gtacatgctg ctacctgact cccaacagct
2221 attgaggcca gcagggcaga ggtgactctg cccatttccc cggtgaggaa cttggagtac
2281 tctgatccta gatgaaaata gaaagttgaa agtcaggctt ggtagctcgt gcctgtaaaa
2341 agcggcactt caggactgag gcagtaacac tgccttgagt tcaaggttac agactgagag
2401 acttgagagt ctgtctttaa aggggggggg ggggcgcgag ggttaaaaag ttgaacgaat
2461 aaagaaagat ttcatatcac atggctgccc tttcccacca cttccaggtg aactggtcag
2521 tcaccactag ggggcaggat tttctctcct tgatggacat gtctgcgttg tctggtgagt
2581 ccgagctagg tcacccaccg cactaatgca tctccgttat tgttttccag ccaagaaagg
2641 aaaaggaaag gactaagtca ggaggccaga gagcctccgg cctcgcctgg agcctgaacg
2701 gagccctcct ctcccacagg cccaagatat aacccaccag tgccttttgt cttcctgtca
2761 gctctgtcaa tcacgcctgt cctctcacgc ccacaccaag tgcccaaagt ggggagggac
2821 aagagattct ggaaagtgag cctccccata ccctcttttg ttctccccac cctgatactt
2881 gttattaaga aatgaataaa ataaactcac ttttttccaa taaaagctt
//
LOCUS MUSCRRY01 676 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 1.
ACCESSION M34164
KEYWORDS complement receptor.
SEGMENT 1 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 676)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 414 + 531 complement receptor (Crry; liver) precursor,
exon 1
sigp 414 530 complement receptor (liver) signal peptide
(put.)
matp 531 + 531 complement receptor (liver)
pep$ 414 + 660 complement receptor (spleen) precursor, exon 1
sigp 414 530 complement receptor (spleen) signal peptide
(put.)
matp 531 + 660 complement receptor (spleen)
IVS 532 > 676 Crry intron A
IVS 661 > 676 Crry intron A'
BASE COUNT 148 a 162 c 193 g 173 t
ORIGIN
1 atccgaattc atcataagga aataggttct tactgtatac tagacagggt atgcaactgt
61 cagctcactg ttgcagatta gggttaggct ccacccttgc agatttttaa aaggagtaag
121 gccgggctat atgccaaacc gagttcccat aatgccttgt tttctttgga gtcgaaggtt
181 cctgcaagtg gaaaacttcc tggagctgac ctactaggta ttgaaccagt ttctgcattg
241 ctgaatcaat ctcccaaggg taattccaca gaaatcccag gggcttggag taaacaagac
301 cgcgcctagc ccagctagag gaagttttat tccggaaccc agcgccattt ctgggtggga
361 ctgctttcta caccatttgc cgtaaaacgt tgtttgagaa cggtgtgagg ggaatggagg
421 tctcttctcg gagttcagag cctctggatc cggtgtggct ccttgtagcc ttcggccggg
481 gaggagtcaa gctagaagtt ttgctgctgt tcttgctgcc atttactttg ggtgagctgc
541 ggggaggcct ggggaagcac ggacacacgg ttcaccggga acccgcggta aataggctct
601 gcgcagactc caaacgctgg tctgggctgc ctgtgagtgc tcagcgcccc tttcccatgg
661 gtgagcgtgg ggcgcc
//
LOCUS MUSCRRY02 200 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 2.
ACCESSION M34165
KEYWORDS complement receptor.
SEGMENT 2 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 190 complement receptor (Crry; liver) precursor,
exon 2
matp 11 + 190 complement receptor (liver)
pep$ + 11 + 190 complement receptor (spleen) precursor, exon 2
matp + 11 + 190 complement receptor (spleen)
IVS < 1 10 Crry intron A
IVS < 1 10 Crry intron A'
IVS 191 > 200 Crry intron B
BASE COUNT 60 a 51 c 35 g 54 t
ORIGIN Undetermined number of base pairs after segment 1.
1 cattcaacag gtcactgccc agccccatca cagcttcctt ctgccaaacc tataaatcta
61 actgatgaat ccatgtttcc cattggaaca tatttgttgt atgaatgtct cccaggatat
121 atcaagaggc agttctctat cacctgcaaa caagactcaa cctggacgag tgctgaagat
181 aagtgtatac gtgagtaact
//
LOCUS MUSCRRY03 120 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 3.
ACCESSION M34166
KEYWORDS complement receptor.
SEGMENT 3 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 110 complement receptor (Crry; liver) precursor,
exon 3
matp + 11 + 110 complement receptor (liver)
pep$ + 11 + 110 complement receptor (spleen) precursor, exon 3
matp + 11 + 110 complement receptor (spleen)
IVS < 1 10 Crry intron B
IVS 111 > 120 Crry intron C
BASE COUNT 36 a 21 c 25 g 38 t
ORIGIN Undetermined number of base pairs after segment 2.
1 tttttcatag gaaaacaatg taaaactcct tcagatcctg agaatggctt ggtacatgta
61 cacacaggca ttcagtttgg atcccgtatt aattatactt gtaatcaagg gtgagttggc
//
LOCUS MUSCRRY04 104 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 4.
ACCESSION M34167
KEYWORDS complement receptor.
SEGMENT 4 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 104)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 96 complement receptor (Crry; liver) precursor,
exon 4
matp + 11 + 96 complement receptor (liver)
pep$ + 11 + 96 complement receptor (spleen) precursor, exon 4
matp + 11 + 96 complement receptor (spleen)
IVS < 1 10 Crry intron C
IVS 97 > 104 Crry intron D
BASE COUNT 20 a 20 c 27 g 37 t
ORIGIN Undetermined number of base pairs after segment 3.
1 ctgtgtgtag ataccgcctc attggttcct cctctgctgt atgtgtcatc actgatcaaa
61 gtgttgattg ggatactgag gcacctattt gtgagtgtaa gttg
//
LOCUS MUSCRRY05 422 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 5.
ACCESSION M34168
KEYWORDS complement receptor.
SEGMENT 5 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 422)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 412 complement receptor (Crry; liver) precursor,
exon 5
matp + 11 + 412 complement receptor (liver)
pep$ + 11 + 412 complement receptor (spleen) precursor, exon 5
matp + 11 + 412 complement receptor (spleen)
IVS < 1 10 Crry intron D
IVS 413 > 422 Crry intron E
BASE COUNT 106 a 100 c 104 g 112 t
ORIGIN Undetermined number of base pairs after segment 4.
1 ctttgcccag ggattccttg tgagataccc ccaggcattc ccaatggaga tttcttcagt
61 tcaaccagag aagactttca ttatggaatg gtggttacct accgctgcaa cactgatgcg
121 agagggaagg cgctctttaa cctggtgggt gagccctcct tatactgtac cagcaacgat
181 ggtgaaattg gagtctggag cggccctcct cctcagtgca ttgaactcaa caaatgtact
241 cctcctccct atgttgaaaa tgcagtcatg ctgtctgaga acagaagctt gttttcctta
301 agggatattg tggagtttag atgtcaccct ggctttatca tgaaaggagc cagcagtgtg
361 cattgtcagt ccctaaacaa atgggagcca gagttaccaa gctgcttcaa gggtaagctc
421 ga
//
LOCUS MUSCRRY06 206 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 6.
ACCESSION M34169
KEYWORDS complement receptor.
SEGMENT 6 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 196 complement receptor (Crry; liver) precursor,
exon 6
matp + 11 + 196 complement receptor (liver)
pep$ + 11 + 196 complement receptor (spleen) precursor, exon 6
matp + 11 + 196 complement receptor (spleen)
IVS < 1 10 Crry intron E
IVS 197 > 206 Crry intron F
BASE COUNT 61 a 33 c 58 g 54 t
ORIGIN Undetermined number of base pairs after segment 5.
1 ctaattgcag gagtgatatg tcgtctccct caggagatga gtggattcca gaaggggttg
61 ggaatgaaaa aagaatatta ttatggagag aatgtaacct tggaatgtga ggatgggtat
121 actctagaag gcagttctca aagccagtgc cagtctgatg gcagctggaa tcctcttctg
181 gccaaatgtg tatctcgtaa gtacaa
//
LOCUS MUSCRRY07 44 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 7.
ACCESSION M34170
KEYWORDS complement receptor.
SEGMENT 7 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 44)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 34 complement receptor (Crry; liver) precursor,
exon 7
matp + 11 + 34 complement receptor (liver)
pep$ + 11 + 34 complement receptor (spleen) precursor, exon 7
matp + 11 + 34 complement receptor (spleen)
IVS < 1 10 Crry intron F
IVS 35 > 44 Crry intron G
BASE COUNT 8 a 8 c 10 g 18 t
ORIGIN Undetermined number of base pairs after segment 6.
1 tctctttcag gctcaatcag tggtctaatt gttggtaagt tctg
//
LOCUS MUSCRRY08 96 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 8.
ACCESSION M34171
KEYWORDS complement receptor.
SEGMENT 8 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 96)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 86 complement receptor (Crry; liver) precursor,
exon 8
matp + 11 + 86 complement receptor (liver)
pep$ + 11 + 86 complement receptor (spleen) precursor, exon 8
matp + 11 + 86 complement receptor (spleen)
IVS < 1 10 Crry intron G
IVS 87 > 96 Crry intron H
BASE COUNT 27 a 10 c 19 g 40 t
ORIGIN Undetermined number of base pairs after segment 7.
1 tcctgtttag gaattttcat tgggataatc gtctttattt tagtcatcat tgttttcatt
61 tggatgattc tgaagtataa aaaacggtga gtaaag
//
LOCUS MUSCRRY09 125 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 9.
ACCESSION M34172
KEYWORDS complement receptor.
SEGMENT 9 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 125)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 115 complement receptor (Crry; liver) precursor,
exon 9
matp + 11 + 115 complement receptor (liver)
pep$ + 11 + 115 complement receptor (spleen) precursor, exon 9
matp + 11 + 115 complement receptor (spleen)
IVS < 1 10 Crry intron H
IVS 116 > 125 Crry intron I
BASE COUNT 46 a 24 c 25 g 30 t
ORIGIN Undetermined number of base pairs after segment 8.
1 taccaattag caataccaca gatgaaaagt ataaagaagt gggtattcat ttaaattata
61 aagaagacag ctgtgtccgc cttcagtctc tgctcacaag tcaggagaac agcaggtaca
121 tatgc
//
LOCUS MUSCRRY10 128 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry) gene, exon 10.
ACCESSION M34173
KEYWORDS complement receptor.
SEGMENT 10 of 10
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 128)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 56 complement receptor (Crry; liver) precursor,
exon 10
matp + 11 53 complement receptor (liver)
pep$ + 11 56 complement receptor (spleen) precursor, exon
10
matp + 11 53 complement receptor (spleen)
IVS < 1 10 Crry intron I
BASE COUNT 41 a 30 c 20 g 37 t
ORIGIN Undetermined number of base pairs after segment 9.
1 tttgctgaag taccactagc ccagcacgga attcactcac tcaagaagtc tcctaaatag
61 cagcaacgtg aaatgagaac atgctctgtc tgtatcactt ttaaaataaa ctgtttcctt
121 ttaagatc
//
LOCUS MUSCRRYPS 1272 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse complement receptor (Crry-ps) pseudogene DNA fragment.
ACCESSION M34174
KEYWORDS complement receptor; pseudogene.
SOURCE Mouse (strain Balb/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1272)
AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
TITLE The murine complement receptor gene family: III. The genomic and
transcriptional complexity of the Crry and Crry-ps genes
JOURNAL J. Immunol. 144, 1988-1996 (1990)
STANDARD simple staff_review
BASE COUNT 377 a 256 c 276 g 363 t
ORIGIN
1 tgcccagccc catcacagct tccttctgcc aaacctataa atctaactga tgaatccatg
61 tttcccattg gaacatctgt gaaatatgaa tgtctcccag gatatatcaa gaggcagttc
121 tctatcacct gcaaacaaga ctcaacctgg acgagtgctg aagataagtg tatacgaaaa
181 caatgtaaaa ctcctttaga tcctcagaat ggcttggtac atgtacacac aggcattcag
241 tttggatccc gtattaatta tacttgtaat aaaggatacc gcctcattgg ttcctcctct
301 gctgtatgtg tcatcactga tcaaagtgtt gattgggata ctgaggcacc tatttgtgag
361 tggattcctt gtgatatacc cccaggcatt cccaatggag atttcttcag ttcaactaga
421 gaagactttc attatggaat ggtggttacc taccgctgca acactgatgc gagagggaag
481 gcgctcttta acctggtggt tatactgtac cagcaacgat ggtgaaattg gagtctggag
541 tggccctcct cctcagtgca ttggattcaa caaatgtact cctcctccct atgttgaaaa
601 tgcagtcatg ctgtctgaga acagaagctt gttttcctta agggatattg tggagtttag
661 atgtcaccct ggctttatca tgaaaggagc cagcagtgtg cattgtcagt ccctaaacaa
721 atgggagcca gagttaccaa gctgcttcaa gggagtgata tgtcgtctcc ctcaggagat
781 gagtggattc cagaaggggt tgggaatgaa aaaagaatat tattatggag agaatgtaac
841 cttggaatgc gaggatgggt atactctaga aggcagttct caaagccagt gtcagtctga
901 tggcagctgg aatcctcttc tggccaaaag tgtatcgcgc tcaatcagtg gtctaattgt
961 tggaattttc attgggatga tcatctttat tttattcatc attgttttca tttggatgat
1021 tctgaagtat aaaaaacgca ataccacaga tgaaaagtat aaagaagtgg gtattcattt
1081 aaattataaa ggagacagct gtgtctgcct tcagtctctg ctcacaagtc aggagaacag
1141 cactaccact agcccagcac agaattcact cgctcaagaa gtctcctaaa tagcagcaac
1201 gtgaaatgag aacatgtctt tctgtatcat ttttaaaata aactatttct tttaagaaaa
1261 aaaagaaaga aa
//
LOCUS BSURGRRNB 7430 bp ds-DNA BCT 10-JUL-1990
DEFINITION B.subtilis rrnB operon with 23S rRNA, 16SrRNA, 5S rRNA and tRNA
gene cluster: Val-, Thr-, Lys-, Leu-cug-, Gly-ggc-, Leu-uua-, Arg-,
Pro-, Ala-, Met-, Ile-, Ser-uca-, Met-f-, Asp-, Phe-, His-,
Gly-gga-, Ile-, Asn-, Ser-agc- and Glu-tRNA.
ACCESSION K00637 M10606 X00007
KEYWORDS 23S ribosomal RNA; 5S ribosomal RNA; ribosomal RNA; transfer RNA;
transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn;
transfer RNA-Asp; transfer RNA-Glu; transfer RNA-Gly;
transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu;
transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe;
transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr;
transfer RNA-Val.
SOURCE B.subtilis 168 DNA, library of Ferrari et al, clone pBC204 [1];
clone pGS227 [2]; clone pGS332 [3].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 4897 to 7430)
AUTHORS Green,C.J. and Vold,B.S.
TITLE Sequence analysis of a cluster of twenty-one tRNA genes in Bacillus
subtilis
JOURNAL Nucleic Acids Res. 11, 5763-5774 (1983)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 1168)
AUTHORS Stewart,G.C. and Bott,K.
TITLE DNA sequence of the tandem ribosomal RNA promoter for B.subtilis
operon rrnB
JOURNAL Nucleic Acids Res. 11, 6289-6300 (1983)
STANDARD simple staff_review
REFERENCE 3 (bases 1 to 7430; revises [1],[2])
AUTHORS Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F.
TITLE Nucleotide sequence of the Bacillus subtilis ribosomal RNA operon,
rrnB
JOURNAL Gene 37, 261-266 (1985)
STANDARD simple staff_review
REFERENCE 4 (sites for [1],[2] and [3])
AUTHORS Su,S.L. and Dubnau,D.
TITLE Binding of Bacillus subtilis ermC' methyltransferase to 23S rRNA
JOURNAL Biochemistry 29, 6033-6042 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and sequence in computer readable form for [1],[2],[3]
kindly provided by K.F.Bott, 26-DEC-1985.
The RNAs, encoded by the sequence presented below, are probably
transcribed as one polycistronic unit, including the tRNA region,
because there are no obvious terminator stem loop structures until
after the end of the tRNA region at positions 7245-7272 and
7392-7413 [3].
[1] notes that though the Ile-tRNA-nau sequence has the methionine
anticodon "cau", it is highly homologous to Ile-tRNA-gau; the "c"
in the wobble position may be post-transcriptionally modified to
recognize "aua" codons.
Promoter P1 is located at positions 184-189 (-35 region) and
207-213 (-10 region), and P2 at 276-281 (-35 region) and 299-304
(-10 region). A third promoter region could be at positions
5517-5522. A potential stem-loop structure, necessary for
processing of the mature 16S rRNA, is found at positions 327-360
[2].
FEATURES from to/span description
rRNA 485 2034 16S rRNA
rRNA 2203 5129 23S rRNA
rRNA 5185 5300 5S rRNA
tRNA 5322 5397 Val-tRNA
tRNA 5430 5504 Thr-tRNA
tRNA 5543 5618 Lys-tRNA
tRNA 5629 5715 Leu-tRNA-cug
tRNA 5721 5795 Gly-tRNA-ggc
tRNA 5810 5895 Leu-tRNA-uua
tRNA 5905 5981 Arg-tRNA
tRNA 5997 6073 Pro-tRNA
tRNA 6079 6151 Ala-tRNA
tRNA 6172 6248 Met-tRNA
tRNA 6251 6327 Ile-tRNA-nau
tRNA 6334 6425 Ser-tRNA-uca
tRNA 6443 6519 Met-tRNA-f
tRNA 6531 6607 Asp-tRNA
tRNA 6620 6695 Phe-tRNA
tRNA 6712 6788 His-tRNA
tRNA 6799 6872 Gly-tRNA-gga
tRNA 6888 6964 Ile-tRNA-gau
tRNA 6975 7049 Asn-tRNA
tRNA 7053 7143 Ser-tRNA-agc
tRNA 7169 7240 Glu-tRNA
revision 504 504 c in [3]; t in [2]
revision 571 573 tcc in [3]; tc in [2]
revision 5029 5031 gga in [3]; ga in [1]
anticdn 5355 5357 Val-tRNA anticodon tac
anticdn 5463 5465 Thr-tRNA anticodon tgt
anticdn 5576 5578 Lys-tRNA anticodon ttt
anticdn 5663 5665 Leu-tRNA-cug anticodon cag
anticdn 5753 5755 Gly-tRNA-ggc anticodon gcc
anticdn 5844 5846 Leu-tRNA-uua anticodon taa
anticdn 5939 5941 Arg-tRNA anticodon acg
anticdn 6031 6033 Pro-tRNA anticodon tgg
anticdn 6112 6114 Ala-tRNA anticodon tgc
revision 6165 6167 act in [3]; at in [1]
anticdn 6206 6208 Met-tRNA anticodon cat
anticdn 6285 6287 Ile-tRNA-nau anticodon cat
anticdn 6370 6372 Ser-tRNA-uca anticodon tga
anticdn 6477 6479 Met-tRNA-f anticodon cat
anticdn 6565 6567 Asp-tRNA anticodon gtc
anticdn 6653 6655 Phe-tRNA anticodon gaa
anticdn 6746 6748 His-tRNA anticodon gtg
anticdn 6831 6833 Gly-tRNA-gga anticodon tcc
anticdn 6922 6924 Ile-tRNA-gau anticodon gat
anticdn 7007 7009 Asn-tRNA anticodon gtt
anticdn 7087 7089 Ser-tRNA-agc anticodon gct
anticdn 7202 7204 Glu-tRNA anticodon ttc
BASE COUNT 1906 a 1694 c 2125 g 1705 t
ORIGIN 65 bp upstream of MboI site; 280 degrees on the B.subtilis map.
1 ctttaatgct ccccttgtgg tcatcagtat ttagttcgtt tcacatacaa gaaaacgaaa
61 aaaacaacaa gatcacatga ctgatgtata tgttctttta agaaacttat atgatacaca
121 cgctttagaa atcatggcga ggattatagt ttatttgttt tatagatttt ttttaaaaaa
181 ctattgcaat aaataaatac aggtgttata ttattaaacg tcgctgatgc acagcggaca
241 caactagatg cttcaaaaca acttgaaaaa agttgttgac aaaaaagaag ctgaatgtta
301 tattagtaaa gctgcttcat tgagaagtaa cgaaatgatc tttgaaaact aaacaagaca
361 aaacgtacct gttaattcag tttttaaaaa tcgcactgcg atgtgcgtat catcaaacag
421 ggcctgcacg acgcaggtca cacaggtgtc gccgcaggat gcggtgaact taacctgtga
481 tccatttatc ggagagtttg atcctggctc aggacgaacg ctggcggcgt gcctaataca
541 tgcaagtcga gcggacaggt gggagcttgc tccgatgtta gcggcggacg ggtgagtaac
601 acgtgggtaa cctgcctgta agactgggat aactccggga aaccggggct aataccggat
661 ggttgtttga accgcatggt tcaaacataa aaggtggctt cggctaccac ttacagatgg
721 acccgcggcg cattagctag ttggtgaggt aacggctcac caaggcaacg atgcgtagcc
781 gacctgagag ggtgatcggc cacactggga ctgagacacg gcccagactc ctacgggagg
841 cagcagtagg gaatcttccg caatggacga aagtctgacg gagcaacgcc gcgtgagtga
901 tgaaggtttt cggatcgtaa agctctgttg ttagggaaga acaagtaccg ttcgaacagg
961 gcggtacctt gacggtacct aaccagaaag ccacggctaa ctacgtgcca gcagccgcgg
1021 taatacgtag gtggcaagcg ttttccggaa ttattgggcg taaagggctc gcaggcggtt
1081 tcttaagtct gatgtgaaag cccccggctc aaccggggag ggtcattgga aactggggaa
1141 cttgagtgca gaagaggaga gtggaattcc acgttgtagc ggtgaaatgc gtagagatgt
1201 ggaggaacac cagtggcgaa ggcgactctc tggtctgtaa ctgacgctga ggagcgaaag
1261 cgtggggagc gaacaggatt agataccctg gtagtccacg ccgtaaacga tgagtgctaa
1321 gtgttagggg gtttccgccc cttagtgctg cagctaacgc attgagcact ccgcctgggg
1381 agtacggtcg caagactgaa actcaaagga attgacgggg gcccgcacaa gcggtggagc
1441 atgtggttta attcgaagca acgcgaagaa ccttactagg tcttgacatc ctctgacaat
1501 cctagagata ggacgtcccc ttcggggcag agtgacaggt ggtgcatggt tgtcgtcagc
1561 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttgat cttagttgcc
1621 agcattcagt tgggcactct aaggtgactg ccggtgacaa accggaggaa ggtggggatg
1681 acgtcaaatc atcatgcccc ttatgacttg ggctacacac gtgctacaat ggacagaaca
1741 aagggcagcg aaccgcgagg ttaagccaat cccacaaatc tgttctcagt tcggatcgca
1801 gtctgcaact cgactgcgtg aagctggaat cgctagtaat cgcggatcag catgccgcgg
1861 tgaatacgtt cccgggcctt gtacacaccg cccgtcacac cacgagagtt tgtaacaccc
1921 gaagtcggtg aggtaacctt ttaggagcca gccgccgaag gtgggacaga tgattggggt
1981 gaagtcgtaa caaggtagcc gtatcggaag gtgcggctgg atcacctcct ttctaaggat
2041 attatacgga atataagacc caaggtctta taaacagaac gttccctgtc ttgtttagtt
2101 ttgaaggatc attccttcga aacgtgttct ttgaaaacta gataacagta gacatcacat
2161 tcaattagta acacaagata tcacatagtg attcttttta acggttaagt tagaaagggc
2221 gcacggtgga tgccttggca ctaggagccg atgaaggacg ggacgaacac cgatatgctt
2281 cggggagctg taagcaagct ttgatccgga gatttccgaa tggggaaacc caccactcgt
2341 aatggagtgg tatccatatc tgaattcata ggatatgaga aggcagaccc ggggaactga
2401 aacatctaag tacccggaga agagaaagca aatgcgattc cctgagtagc ggcgacgaac
2461 acgggatcag cccaaaccaa gaggcttgcc tctgtggttg taggacactc tgtacggagt
2521 tacaaaagaa cgaggtagat gaagaggtct ggaaagggcc cgccatagga ggtaacagcc
2581 ctgtagtcaa aacttcgttc tctcctgagt ggatcctgag tacggcggaa cacgtgaaat
2641 tccgtcggaa tccgggagga ccatctccca aggctaaata ctccctagtg accgatagtg
2701 aaccagtacc gtgagggaaa ggtgaaaagc accccggaag gggagtgaaa gagatcctga
2761 aaccgtgtgc ctacaagtag tcagagcccg ttaacggtga tggcgtgcct tttgtagaat
2821 gaaccggcga gttacgatcc cgtgcaaggt taagcagaag atgcggagcc gcagcgaaag
2881 cgagtctgaa tagggcgcat gagtacgtgg tcgtagaccc gaaaccaggt gatctaccca
2941 tgtccagggt gaagttcagg taacactgaa tggaggcccg aacccacgca cgttgaaaag
3001 tgcggggatg aggtgtgggt aggggtgaaa tgccaatcga acctggagat agctggttct
3061 ctccgaaata gctttagggc tagcctcaag gtaagagtct tggaggtaga gcactgattg
3121 gactaggggc cctcaccggg ttaccgaatt cagtcaaact ccgaatgcca atgacttatc
3181 cttgggagtc agactgcgag tgataagatc cgtagtcgaa agggaaacag cccagaccgc
3241 cagctaaggt cccaaagtat acgttaagtg gaaaaggatg tggagttgct tagacaacca
3301 ggatgttggc ttagaagcag ccaccattta aagagtgcgt aatagctcac tggtcgagtg
3361 actctgcgcc gaaaatgtac cggggctaaa cgtatcaccg aagctgcgga ctgttcttcg
3421 aacagtggta ggagagcgtt ctaagggctg tgaagccaga ccggaaggac tggtggacgg
3481 cttagaagtg agaatgccgg tatgagtagc gaaaagaggg gtgagaatcc ctccaccgaa
3541 tgcctaaggg ttcctgagga aggctcgtcc gctcagggtt agtcgggacc taagccgagg
3601 ccgaaaggcg taggcgatgg acaacaggtt gatattcctg taccacctcc tcaccatttg
3661 agcaatgggg ggtcgcagga ggatagggta agcgcggtat tggatatccg cgtccaagca
3721 gttaggctgg gaaataggca aatccgtttc ccataaggct gagctgtgat ggcgagcgaa
3781 atatagtagc gaagttcctg attccacact gccaagaaaa gcctctagcg aggtgagagg
3841 tgcccgtacc gcaaaccgtc acaggtaggc gaggagagaa tcctaaggtg atcgagagaa
3901 ctctcgttaa ggaactcggc aaaatgaccc cgtaacttcg ggagaagggg tgctctgtta
3961 gggtgcaagc ccgagagagc cgcagtgaat aggcccaggc gactgtttag caaaaacaca
4021 ggtctctgcg aagccgtaag gcgaagtata ggggctgacg cctgcccggt gctggaaggt
4081 taagaggagc gcttagcgta agcgaaggtg cgaattgaag ccccagtaaa cggcggccgt
4141 aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac ccgcacgaaa
4201 ggcgcaacga tctgggcgct gtctcaacga gagactcggt gaaattatag tacctgtgaa
4261 gatgcaggtt acccgcgaca ggacggaaag accccgtgga gctttactgc agcctgatat
4321 tgaatgttgg tacagcttgt acaggatagg taggagcctt ggaaaccgga gcgccagctt
4381 cggtggaggc atcggtggga tactaccctg gctgtattga ccttctaacc ccccgccctt
4441 atcgggcggg gagacagtgt caggtgggca gtttgactgg ggcggtcgcc tcctaaaagg
4501 taacggaggc gcccaaaggt tccctcagaa tggttggaaa tcattcgcag agtgtaaagg
4561 cacaagggag cttgactgcg agacctacaa gtcgagcagg gacgaaagtc gggcttagtg
4621 atccggtggt tccgcatgga agggccatcg ctcaacggat aaaagctacc ccggggataa
4681 caggcttatc tcccccaaga gctccacatc gacggggagg tttggcacct cgatgtcggc
4741 tcatcgcatc ctggggctgt agtcggtccc aagggttggg ctgttcgccc attaaagcgg
4801 tacgcgagct gggttcagaa cgtcgtgaga cagttcggtc cctatccgtc gcgggcgctg
4861 gaaatttgag aggagctgtc cttagtacga gaggaccggg atggacgcac cgctggtgta
4921 ccagttgttc tgccaagggc atcgctgggt agctatgtgc ggacgggata agtgctgaaa
4981 gcatctaagc atgaagcccc cctcaagatg agatttccca ttccgcaagg aagtaagatc
5041 cctgaaagat gatcaggttg ataggtctga ggtggaagtg tggcaacaca tggagctgac
5101 agatactaat cgatcgagga cttaaccata tttttgaatg atgtcacacc tgttatctag
5161 ttttgagaga acactctcaa tttgtttggt ggcgatagcg aagaggtcac acccgttccc
5221 ataccgaaca cggaagttaa gctcttcagc gccgatggta gtcgggggtt tccccctgtg
5281 agagtaggac gccgccaagc aattgcacgt tagtgcaata tggaggatta gctcagctgg
5341 gagagcatct gccttacaag cagagggtcg gcggttcgag cccgtcatcc tccaccattt
5401 ttcattatac atatcggttt tacatatatg ccggtgtagc tcaattggta gagcaactga
5461 cttgtaatca gtaggttggg ggttcaagtc ctcttgccgg caccactttt atatgatata
5521 atattcaagt ctattgtaag aagagccatt agctcagttg gtagagcatc tgacttttaa
5581 tcagagggtc gaaggttcga gtccttcatg gctcaccatt tacatgttgc ggatgtggcg
5641 gaattggcag acgcgctaga atcaggctct agtgtcttta cagacgtggg ggttcaagtc
5701 ccttcatccg caccatttct gcggaagtag ttcagtggta gaacaccacc ttgccaaggt
5761 gggggtcgcg ggttcgaatc ccgtcttccg ctccaactat accatccacg ccggggtggt
5821 ggaattggca gacacacagg acttaaaatc ctgcggtagg tgactaccgt gccggttcaa
5881 gtccggccct cggcattaag ttttgcgccc gtagctcaat tggatagagc gtttgactac
5941 ggatcaaaag gttaggggtt cgactcctct cgggcgcgcc atgatctata tgaaatcggg
6001 aagtagctca gcttggtaga gcacatggtt tgggaccatg gggtcgcagg ttcgaatcct
6061 gtcttcccga ccattcttgg ggccttagct cagctgggag agcgcctgct ttgcacgcag
6121 gaggtcagcg gttcgatccc gctaggctcc acttgatttc aaaaactatt tggcggtgta
6181 gctcagctgg ctagagcgta cggttcatac ccgtgaggtc gggggttcga tcccctccgc
6241 cgctaccaat ggacctttag ctcagttggt tagagcagac ggctcataac cgtccggtcg
6301 taggttcgag tcctacaagg tccaccacta tacggaggaa tacccaagtc tggctgaagg
6361 gatcggtctt gaaaaccgac agggtgtcaa agcccgcggg ggttcgaatc cctcttcctc
6421 cgccatacat attcctaatc atcgcggggt ggagcagttc ggtagctcgt cgggctcata
6481 acccgaaggt cgcaggttca aatcctgccc ccgcaaccaa attttaaaat ggtccggtag
6541 ttcagttggt tagaatgcct gcctgtcacg caggaggtcg cgggttcgag tcccgtccgg
6601 accgccattt aaatacttag gctcggtagc tcagttggta gagcaacgga ctgaaaatcc
6661 gtgtgtcggc ggttcgattc cgtcccgagc caccatttat caatatgctt tggcggttgt
6721 ggcgaagtgg ttaacgcacc agattgtggc tctggcattc gtgggttcga ttcccatcaa
6781 tcgccccaaa taaaaattgc gggtgtagtt tagtggtaaa acctcagcct tccaagctga
6841 tgtcgtgggt tcgattccca tcacccgctc catttctata tcgtcatggg cctgtagctc
6901 agctggttag agcgcacgcc tgataagcgt gaggtcgatg gttcgagtcc attcaggccc
6961 accatgactt ttgttccaca gtagctcagt ggtagagcta tcggctgtta accgatcggt
7021 cgcaggttcg aatcctgcct gtggagccaa atggagaagt actcaagtgg ctgaagaggc
7081 gcccctgcta agggtgtagg tcgtgtaagc ggcgcgaggg ttcaaatccc tccttctccg
7141 ccatatgatt acagatatca taattatcgg cccgttggtc aagcggttaa gacaccgccc
7201 tttcacggcg gtaacacggg ttcgaatccc gtacgggtca tcccagaagc cttgcatatc
7261 ctgcaaggtt tttttgtttt tataaatcat gtatatgtct tagattttgt tctttatttt
7321 aaaaacagac tacaaaaatc tccatatatt tcgtttttct tcagaaaatg aagttaattg
7381 tctataagta taagccgttt cagggaaagg gctttttttt atttcttcga
//
LOCUS ECOAROCX 1690 bp ds-DNA BCT 10-JUL-1990
DEFINITION E.coli chorismate synthase (aroC) gene, complete cds.
ACCESSION M33021
KEYWORDS aroC gene; chorismate synthase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1690)
AUTHORS White,P.J., Millar,G. and Coggins,J.R.
TITLE The overexpression, purification and complete amino acid sequence
of chorismate synthase from Escherichia coli K12 and its comparison
with the enzyme from Neurospora crassa
JOURNAL Biochem. J. 251, 313-322 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 492 1562 chorismate synthase (EC 4.6.1.4)
BASE COUNT 403 a 467 c 466 g 354 t
ORIGIN
1 gtcgacgcgg tggatatctc tccagacgcg ctggcggttg ctgaacagaa catcgaagaa
61 cacggtctga tccacaacgt cattccgatt cgttccgatc tgttccgcga cttgccgaaa
121 gtgcagtacg acctgattgt cactaacccg ccgtatgtcg atgcgaagat atgtccgacc
181 tgccaaacaa taccgccacg agccggaact gggcctggca tctggcactg acggcctgaa
241 actgacgcgt cgcattctcg gtaacgcggc agattacctt gctgatgatg gcgtgttgat
301 ttgtgaagtc ggcaacagca tggtacatct tatggaacaa tatccggatg ttccgttcac
361 ctggctggag tttgataacg gcggcgatgg tgtgtttatg ctcaccaaag agcagcttat
421 tgccgcacga gaacatttcg cgatttataa agattaagta aacacgcaaa cacaacaata
481 acggagccgt gatggctgga aacacaattg gacaactctt tcgcgtaacc accttcggcg
541 aatcgcacgg gctggcgctc ggctccatcg tcgatggtgt tccgccagcc attccgctga
601 cggaagcgga cctgcaacat gacctcgacc gtcgtcgccc tgggacatcg cgctatacca
661 cccagcgccg cgagccggat caggtcaaaa ttctctccgg tgtttttgaa ggcgttacta
721 ccggcaccag cattggcttg ttgatcgaaa acactgacca gcgctctcag gattacagtg
781 cgattaagga cgttttccgt ccaggccatg ccgattacac ctacgaacaa aaatacggtc
841 tgcgcgatta tcgcggcggt ggacgttctt ccgcccgcga aaccgccatg cgcgtggcgg
901 caggagctat tgccaaaaaa tatctcgccg agaaatttgg tattgaaatc cgtggctgcc
961 tgacccagat gggcgacatt ccgctggata tcaaagactg gtcgcaggtc gagcaaaatc
1021 cgtttttttg cccggacccc gacaaaatcg acgcgttaga cgagttgatg cgtgcgctga
1081 aaaaagaggg cgactccatc ggcgctaaag tcaccgttgt tgccagtggc gttcctgccg
1141 gacttggcga gccggtcttt gaccgcctgg atgctgacat cgcccatgcg ctgatgagca
1201 tcaacgcggt gaaaggcgtg gaaattggcg acggctttga cgtggtggcg ctgcgcggca
1261 gccagaaccc cgatgaaatc accaaagacg gtttccagag caaccatgcg ggcggcattc
1321 tcggcggtat cagcagcggg cagcaaatca ttgcccatat ggcgctgaaa ccgacctcca
1381 gcattaccgt gccgggtcgt accattaacc gctttggcca agaagttgag atgatcacca
1441 aaggccgtca cgatccctgt gtcgggatcc gcgcagtgcc gatcgcagaa gcgaatgctg
1501 gcgatcgttt taatggatca cctgttacgg caacgggcgc aaaatgccga tgtgaagact
1561 gatattccac gctggtaaaa aatgaataaa accgcgattg cgctgctggc tctgcttgcc
1621 agtagcgcca gcctggcagc gacggcgtgg caaaaaataa cccaacctgt gccgggtagc
1681 gccaaatcga
//
LOCUS PFAMSA2 819 bp ds-DNA INV 10-JUL-1990
DEFINITION P.falciparum 45 kD merozoite surface antigen (MSA 2) gene,
complete cds.
ACCESSION M28891
KEYWORDS integral membrane protein; surface antigen.
SOURCE P.falciparum DNA, clone 3D7.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 819)
AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and
Anders,R.F.
TITLE Structural diversity in the 45-kilodalton merozoite surface antigen
of Plasmodium falciparum
JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by J.A. Smythe, 06-OCT-1989.
FEATURES from to/span description
pept 1 819 45 kD merozoite surface antigen precursor
sigp 1 60 45 kD merozoite surface antigen signal peptide
matp 61 819 45,000 merozoite surface antigen
rpt 157 228 12 base repeat
rpt 301 321 9 base repeat
BASE COUNT 304 a 157 c 143 g 215 t
ORIGIN
1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt
61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg
121 agaagtatgg cagaaagtaa gccttctact ggtgctggtg gtactgctgg tggtagtgct
181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg ttctggtgat
241 ggtaatggtg cagatgctga gggaagttca agtactcccg ctactaccac aactaccaaa
301 actaccacaa ctaccacaac tactaatgat gcagaagcat ctaccagtac ctcttcagaa
361 aatccaaatc ataaaaatgc cgaaacaaat ccaaaaggta aaggagaagt tcaagaacca
421 aatcaagcaa ataaagaaac tcaaaataac tcaaatgttc aacaagactc tcaaactaaa
481 tcaaatgttc cacccactca agatgcagac actaaaagtc ctactgcaca acctgaacaa
541 gctgaaaatt ctgctccaac agccgaacaa actgaatccc ccgaattaca atctgcacca
601 gagaataaag gtacaggaca acatggacat atgcatggtt ctagaaataa tcatccacaa
661 aatacttctg atagtcaaaa agaatgtacc gatggtaaca aagaaaactg tggagcagca
721 acatccctct taaataactc tagtaatatt gcttcaataa ataaatttgt tgttttaatt
781 tcagcaacac ttgttttatc ttttgccata ttcatataa
//
LOCUS PFAMSA2X 864 bp ds-DNA INV 10-JUL-1990
DEFINITION P.falciparum 45,000 merozoite surface antigen (MSA2) gene,
complete cds.
ACCESSION M28892
KEYWORDS integral membrane protein; surface antigen.
SOURCE P.falciparum (isolate Indochina 1) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 864)
AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and
Anders,R.F.
TITLE Structural diversity in the 45-kilodalton merozoite surface antigen
of Plasmodium falciparum
JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by J.A. Smythe, 06-OCT-1989.
FEATURES from to/span description
pept 1 864 45 kD merozoite surface antigen precursor
sigp 1 60 45 kD merozoite surface antigen signal peptide
matp 61 864 45 kD merozoite surface antigen
rpt 169 312 12 base repeat sequence
rpt 379 397 9 base repeat sequence
BASE COUNT 288 a 157 c 179 g 240 t
ORIGIN
1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt
61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg
121 agaagtatga cagaaagtaa tcctcctact ggtgctagtg gtagtgctgg tggtagtgct
181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct
241 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct
301 ggtggtagtg ctggttctgg tgatggtaat ggtgctaatc ctggtgcaga tgctgagaga
361 agtccaagta ctcccgctac taccacaact accacaacta ctaatgatgc agaagcatct
421 accagtacct cttcagaaaa tccaaatcat aataatgccg aaacaaatca agcaaataaa
481 gaaactcaaa ataactcaaa cgttcaacaa gactctcaaa ctaaatcaaa tgttccaccc
541 actcaagatg cagacactag aagtcctact gcacaacctg aacaagctga aaattctgct
601 ccaacagccg aacaaactga atcccccgaa ttacaatctg caccagagaa taaaggtaca
661 ggacaacatg gacatatgca tggttctaga aataatcatc cacaaaatac ttctgatagt
721 caaaaagaat gtaccgatgg taacaaagaa aactgtggag cagcaacatc cctcttaaat
781 aactctagta atattgcttc aataaataaa tttgttgttt taatttcagc aacacttgtt
841 ttatcttttg ccatattcat ataa
//
LOCUS XELRASX 1143 bp ss-mRNA VRT 10-JUL-1990
DEFINITION X.laevis ras protein mRNA, complete cds.
ACCESSION M34657
KEYWORDS ras protein.
SOURCE X.laevis defolliculated oocyte, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1143)
AUTHORS Andeol,Y., Gusse,M. and Mechali,M.
TITLE Characterization and expression of a Xenopus ras during oogenesis
and development
JOURNAL Dev. Biol. 139, 24-34 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 196 756 ras protein
mRNA < 1 1143 ras protein mRNA
BASE COUNT 346 a 275 c 279 g 243 t
ORIGIN
1 gaattcgcca gtgttacaga atgggagttc tgaggcgctg tgactaatcc cccccacccc
61 cgcatattgg ggaaatccac cggcgggcag aaagccagag ggagaactaa ggggggccaa
121 accaaaggaa aacgcaggag ccaaagcctc cagaaacaca gggatccgtg acgagcccga
181 gtcggtgctg gtgaaatgac ggagtacaaa ctggtggtgg ttggtgctgg aggcgtgggg
241 aagagcgcac tcacaatcca gctcattcag aaccattttg tggacgagta tgatcctact
301 attgaggact cgtataggaa gcaggtggtg atagacgggg agacctgcct cctagatatc
361 ctggacactg cggggcaaga ggaatacagc gctatgaggg atcagtacat gcgcacggga
421 gaaggctttc tctgtgtctt tgctattaac aacacaaagt ccttcgagga cgtccatcat
481 tacagggaac agattaacag agttaaagat tccgatgacg ttcccatggt gttagttggt
541 aacaaatgcg acctcccatc ccggactgtg gacacaaagc aagcgcagga actggcaaag
601 agctatggta ttccttttat agagacctct gccaaaacta gacagggagt cgaagacgcc
661 ttctataccc tagtccgtga aatccgcaag cacaaggaga agatcagcaa cgggaaaaag
721 aaaaagtcct ccaaaaggaa gtgtgtcgtt ctttaacgtg ccaacctgcc cccccctgcc
781 atcctcgtgg atcagagaaa accgtgccat cacacacctg aagtcaaaga aaaaaaaagt
841 gtggactttt gtcgttgctg tggaaaccat tgaattgcca tgaaatttaa aaaaaaaacc
901 aaaacattga ccacttattt taacacaacc gataaatggc acaggctgtg ccccaatcgt
961 gtatatattc ttcatgaaca aactgtttta tcagaaagac agatgcaata gccccttctt
1021 tttaccccaa ttaaccctcc tcctggtttc tatttctccc tggaaaagac gttggtcgac
1081 cagaggggaa gaacctgccc aggcctttct tacagcccca tttgaataaa gattgaaaca
1141 ctc
//
LOCUS HUMSPTB 6765 bp ss-mRNA PRI 10-JUL-1990
DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION J05500
KEYWORDS beta-spectrin; spectrin.
SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
V252.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6765)
AUTHORS Winkelmann,J.C., Chang,J.G., Tse,W.T., Marchesi,V.T. and
Forget,B.G.
TITLE Full length sequence of the cDNA for human erythroid beta-spectrin
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Winkelmann, 08-MAY-1990.
FEATURES from to/span description
pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14"
/hgml_locus_uid="LS0033T"
mRNA < 1 6765 SPTB mRNA
signal 6716 6722 poly-A signal
BASE COUNT 1627 a 1822 c 2144 g 1172 t
ORIGIN Chromosome 14q23-q24.
1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgtcaggc ctgcgggagc
2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
2761 ggttcgacat cctggaccag gagatgaaga ccttcatgac tcagattgat ggtgtgaacc
2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcgatcagac gacccgggca
4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS STYEUTBC 2526 bp ds-DNA BCT 10-JUL-1990
DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
complete cds.
ACCESSION J05518
KEYWORDS ethanolamine ammonia-lyase.
SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2526)
AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
TITLE Cloning, sequencing and expression of the genes encoding the
alcohol-dependent ethanolamine ammonia-lyase of Salmonella
typhimurium
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.M.Babior, 08-MAY-1990.
FEATURES from to/span description
pept 141 1499 ethanolamine ammonia-lyase (eutB)
pept 1518 2378 ethanolamine ammonia-lyase (eutC)
binding 130 133 ribosome binding site
binding 1507 1510 ribosome binding site
BASE COUNT 563 a 687 c 779 g 497 t
ORIGIN
1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
2521 gcctcc
//
LOCUS XELPCNA 1018 bp ss-mRNA VRT 10-JUL-1990
DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
cds.
ACCESSION M34080
KEYWORDS nuclear protein; proliferating cell nuclear antigen.
SOURCE X.laevis oocyte, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1018)
AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
TITLE Characterization and developmental expression of Xenopus
proliferating cell nuclear antigen (PCNA)
JOURNAL Dev. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Leibovici, 08-MAY-1990.
FEATURES from to/span description
pept 28 813 proliferating cell nuclear antigen (PCNA)
mRNA < 1 1018 PCNA mRNA
BASE COUNT 284 a 223 c 237 g 274 t
ORIGIN
1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS YSCAAC2A 1333 bp ds-DNA PLN 10-JUL-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
cds.
ACCESSION M34076 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1333)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator in yeast
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 235 1158 ADP/ATP-translocator protein (AAC2)
BASE COUNT 388 a 209 c 301 g 435 t
ORIGIN
1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
1321 agccactgtt gtt
//
LOCUS YSCAAC3 1308 bp ds-DNA PLN 10-JUL-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
cds.
ACCESSION M34075 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1308)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator in yeast
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 78 1034 ADP/ATP-translocator protein (AAC3)
BASE COUNT 353 a 228 c 263 g 464 t
ORIGIN
1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS ECODKSA 1273 bp ds-DNA BCT 10-JUL-1990
DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION M34945
KEYWORDS dnaK suppressor.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1273)
AUTHORS Kang,P.J. and Craig,E.A.
TITLE Identification and characterization of a new Escherichia coli gene
that is a dosage-dependent suppressor of a dnaK deletion mutation
JOURNAL J. Bacteriol. 172, 2055-2064 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 229 441 ORF 1
pept 619 1074 dnaK suppressor (dksA)
BASE COUNT 343 a 301 c 333 g 296 t
ORIGIN
1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
1261 acgcatagaa gat
//
LOCUS STFCYCLI 2180 bp ss-mRNA INV 10-JUL-1990
DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION M33880
KEYWORDS cyclin B.
SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone
lambda-gt10-cyc10.
ORGANISM Asterina pectinifera
Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa;
Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE 1 (bases 1 to 2180)
AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes
cyclin
JOURNAL Dev. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by T.Kishimoto, 11-MAY-1990.
FEATURES from to/span description
pept 126 1313 cyclin B (CYC)
mRNA < 126 2180 cyclin B mRNA
BASE COUNT 650 a 440 c 482 g 608 t
ORIGIN
1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
2161 gtaataaagt taatggaaat
//
LOCUS CHKMTTGHA 90 bp ds-DNA ORG 10-JUL-1990
DEFINITION Chicken mitochondrial His-tRNA gene.
ACCESSION M34496 M34497
KEYWORDS transfer RNA-His.
SOURCE Chicken (strain white leghorn) liver mitochondrial DNA.
ORGANISM Mitochondrion Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus.
REFERENCE 1 (bases 1 to 90)
AUTHORS L'Abbe,D., Lang,B.F., Desjardins,P. and Morais,R.
TITLE Histidine tRNA from chicken mitochondria has an uncoded 5'-terminal
guanylate residue
JOURNAL J. Biol. Chem. 265, 2988-2992 (1990)
STANDARD simple staff_entry
COMMENT the "n"s in the tRNA sequence are probably modified bases.
FEATURES from to/span description
tRNA 11 79 His-tRNA
anticdn 41 43 His-tRNA anticodon gtg
variant 10 10 t in DNA, n in tRNA
variant 18 18 t in DNA; n in tRNA
variant 19 19 a in DNA; n in tRNA
variant 26 26 c in DNA; n in tRNA
variant 27 27 c in DNA; n in tRNA
variant 35 35 t in DNA; n in tRNA
variant 45 45 t in DNA; n in tRNA
BASE COUNT 27 a 22 c 17 g 24 t
ORIGIN
1 acccctctat gcaaacatag tttaacccaa acattagatt gtgattctaa aaataggagt
61 ttaaccctcc ttgttcgccg aggggaggcc
//
LOCUS DDISAS1A 2145 bp ss-mRNA INV 10-JUL-1990
DEFINITION D.discoideum GTP-binding protein (SAS1) gene, complete cds.
ACCESSION M34456
KEYWORDS GTP-binding protein.
SOURCE D.discoideum, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 2145)
AUTHORS Saxe,S.A. and Kimmel,A.R.
TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium
discoideum with sequence similarities to essential genes in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1095 1706 GTP-binding protein (SAS1)
BASE COUNT 929 a 249 c 238 g 727 t 2 others
ORIGIN
1 gggaattatt aggacatcag gtttaaaacc tattcagaca ccagaataca atttgaattg
61 agcggcaacg ttcctttcac tctgcactac atcagcatta ttagagagaa aggttgaaaa
121 acctctatcg aaggtggtgg aattgctgag aagtaacagc aataaataaa acattcaaac
181 cgatagatga gaggttcaaa atccatctag ttagtagggc taaaaaacta caaatcataa
241 acccgatccg atacctaaga ctcctttttt tttttttttt tttttaataa atcaaataat
301 cacatgacct tggagtcttg gtctgcccac gaatttaaag tgcaaagttt attttattta
361 aactgggtgc atgcaaacat tactctatcg accgatttat ccaattttaa tactaaaatc
421 ttaaaaacca gaaagaanna ataataataa taataataat aataataata ataataataa
481 taataataat aataataata ataataataa taataataat aataataata ataataataa
541 taataataat aataataata ataataacaa ccttatttga aaattcaaat taaaaaaaaa
601 agaaatagct ttacatttta aaattaaaat tcataaataa aaccattata aaaatattga
661 agtatatcaa taggtttaat ttaattattg tttatttaat aaaaaaaaaa aaaaaaaaaa
721 aaaattattt aatcggttca atttaacttt ttcgaagaat tatttttttt aagaaaacat
781 ttcaacccaa aaaaataaaa aaaataaaaa aataaaaatt taaatcgaat ggttgaaatg
841 ttttcttaaa aaaacaaaaa ttaaaataaa ttttattttt tttgaattaa atttcaattc
901 agcaattcaa taattttaac gttttcactt catcaaaaat tataaataga atattaaaca
961 caacacaaca caactatcca aactaaaaca attaaaatca aaactctaat tttttataaa
1021 aatttattta ttttctcatc tcaataaaaa catttaaaaa cataattggt aatatagata
1081 tttttttcaa aataatgact tctccagcaa caaataaacc agcagcctac gattttttag
1141 ttaaattact tttaattggg gatagtggtg taggaaagtc atgtctttta ttacgttttt
1201 ctgatggttc tttcacacca agtttcatcg ctactattgg tatcgatttc aaaattcgta
1261 caattgaatt agagggtaaa agaattaaat tacaaatttg ggacactgca ggtcaagaaa
1321 gattcagaac tatcactaca gcatactatc gcggtgctat gggtatccta ttggtttatg
1381 atgtcactga tgaaaaatct tttggtagca ttagaaattg gattagaaat atcgagcaac
1441 atgcttcaga ctcagttaat aaaatgttaa tcggtaataa atgtgatatg accgaaaaga
1501 aagttgttga tagctcaaga ggtaaatcac ttgcagacga atatggtatt aaatttttag
1561 aaacttctgc caaaaacagt gtaaatgtag aggaagcctt tattggttta gcaaaagata
1621 ttaaaaaacg tatgattgat acaccaaatg atcctgatca taccatatgc attactccaa
1681 acaataagaa aaatacttgt tgttaaattg gggccatttt aattttcaca ttattagatg
1741 aaaaaaaaaa aaaaaaaaaa ctaaaattaa aagtaaaaaa cacttttttt tatttaaaaa
1801 tattattttt cattagtcat gaatggttac gtctaaacga tctaatattt ctctatagta
1861 gtgaattatt gcttcatgaa ttttagtgaa aagtttagct taataataat aataataata
1921 ataataataa taataataat aataataata ataataataa ataataataa caattttaaa
1981 attaaatatc caatgttgaa tattttaagt caaaaataat aataataatt ggaatgtatt
2041 ttaaaattaa aattcataaa taaactatta attattgttt attgccttta atggctaacc
2101 tattttttat agtttaaaaa taatttataa ttaatttttt taaat
//
LOCUS DDISAS2A 989 bp ds-DNA INV 10-JUL-1990
DEFINITION D.discoideum GTP-binding protein (SAS2) gene, complete cds.
ACCESSION M34457
KEYWORDS GTP-binding protein.
SOURCE D.discoideum DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 989)
AUTHORS Saxe,S.A. and Kimmel,A.R.
TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium
discoideum with sequence similarities to essential genes in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 49 675 GTP-binding protein (SAS2)
BASE COUNT 421 a 141 c 142 g 285 t
ORIGIN
1 atcaatcaat aaactacaaa tttataatat agatattttt tcgaaataat gacttctcca
61 gcaacaaata aatcagcagc ctacgattat ttaattaaat tacttttaat cggtgatagt
121 ggtgtaggta aatcatgtct tttattacgt ttttctgaag attctttcac accaagtttc
181 atcactacta ttggtatcga tttcaaaatt cgtacaattg aattggaagg taaaagaatt
241 aaattacaaa tttgggatac tgcaggtcaa gaaagattca gaactatcac tacagcatac
301 tatcgtggtg ctatgggtat cctattggtt tatgatgtca ctgatgaaaa atcttttggt
361 aacattagaa attggattag aaatatcgag caacatgcta cagactctgt taataaaatg
421 ttaatcggta ataaatgtga tatggctgaa aagaaagttg ttgatagctc aagaggtaaa
481 tcacttgcag acgaatatgg tattaaattt ttagaaacct cagccaaaaa cagtataaat
541 gtagaggaag cctttattag tttagcaaaa gatattaaaa aacgtatgat tgatacacca
601 aatgaacaac cacaagttgt tcaaccaggt acaaatcttg gtgcaaataa caataagaaa
661 aaagcttgtt gttaaattgg gtgctatttt aattttcaca ttatattatt agataaaaat
721 aaaaaaaaaa aaaaaaatct taaaaaaaaa aaaaaaagtc atcaaaatta ttcacctaaa
781 aaaataacat ataaaccctg ggtttcaagg cagaggatga ttcacttaca acaacaacaa
841 caacaaccaa caacaacaac aacaaccaac aacaacaact aacaacaaca acaaataata
901 ataataataa aaataataat aataaatccc caagttgtga agttgtgttg aaattaataa
961 gagtgggagg tttatatcgc ataaataac
//
LOCUS HUMLAMBA 2850 bp ss-mRNA PRI 10-JUL-1990
DEFINITION Human lamin B mRNA, complete cds.
ACCESSION M34458
KEYWORDS intermediate filament; lamin B.
SOURCE Human T-cell line MOLT-4, cDNA to mRNA, clone LAM-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2850)
AUTHORS Pollard,K.M., Chan,E.K.L., Grant,B.J., Sullivan,K.F., Tan,E.M. and
Glass,C.A.
TITLE In vitro posttranslational modification of lamin B cloned from a
human T-cell line
JOURNAL Mol. Cell. Biol. 10, 2164-2175 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 342 2102 lamin B
mRNA < 342 2850 lamin B mRNA
signal 2834 2839 polyA signal
BASE COUNT 776 a 614 c 748 g 712 t
ORIGIN
1 cgcgagcagg agacggcggc gggcgaaccc tgctgggcct ccagtcaccc tcgtcttgca
61 ttttcccgcg tgcgtgtgtg agtgggtgtg tgtgttttct tacaaagggt atttcgcgat
121 cgatcgattg attcgtagtt cccccccgcg cgcctttgcc ctttgtgctg taatcgagct
181 cccgccatcc caggtgcttc tccgttcctc taaacgccag cgtctggacg tgagcgcagg
241 tcgccggttt gtgccttcgg tccccgcttc gccccctgcc gtcccctcct tatcacggtc
301 ccgctcgcgg cctcgccgcc ccgctgtctc cgccgcccgc catggcgact gcgacccccg
361 tgccgccgcg gatgggcagc cgcgctggcg gccccaccac gccgctgagc cccacgcgcc
421 tgtcgcggct ccaggagaag gaggagctgc gcgagctcaa tgaccggctg gcggtgtaca
481 tcgacaaggt gcgcagcctg gagacggaga acagcgcgct gcagctgcag gtgacggagc
541 gcgaggaggt gcgcggccgt gagctcaccg gcctcaaggc gctctacgag accgagctgg
601 ccgacgcgcg acgcgcgctc gacgacacgg cccgcgagcg cgccaagctg cagatcgagc
661 tgggcaagtg caaggcggaa cacgaccagc tgctcctcaa ctatgctaag aaggaatctg
721 atcttaatgg cgcccagatc aagcttcgag aatatgaagc agcactgaat tcgaaagatg
781 cagctcttgc tactgcactt ggtgacaaaa aaagtttaga gggagatttg gaggatctga
841 aggatcagat tgcccagttg gaagcctcct tagctgcagc caaaaaacag ttagcagatg
901 aaactttact taaagtagat ttggagaatc gttgtcagag ccttactgag gacttggagt
961 ttcgcaaaag catgtatgaa gaggagatta acgagaccag aaggaagcat gaaacgcgct
1021 tggtagaggt ggattctggg cgtcaaattg agtatgagta caagctggcg caagcccttc
1081 atgagatgag agagcaacat gatgcccaag tgaggctgta taaggaggag ctggagcaga
1141 cttaccatgc caaacttgag aatgccagac tgtcatcaga gatgaatact tctactgtca
1201 acagtgccag ggaagaactg atggaaagcc gcatgagaat tgagagcctt tcatcccagc
1261 tttctaatct acagaaagag tctagagcat gtttggaaag gattcaagaa ttagaggact
1321 tgcttgctaa agaaaaagac aactctcgtc gcatgctgac agacaaagag agagagatgg
1381 cggaaataag ggatcaaatg cagcaacagc tgaatgacta tgaacagctt cttgatgtaa
1441 agttagccct ggacatggaa atcagtgctt acaggaaact cttagaaggc gaagaagaga
1501 ggttgaagct gtctccaagc ccttcttccc gtgtgacagt atcccgagca tcctcaagtc
1561 gtagtgtacg tacaactaga ggaaagcgga agagggttga tgtggaagaa tcagaggcga
1621 gtagtagtgt tagcatctct cattccgcct cagccactgg aaatgtttgc atcgaagaaa
1681 ttgatgttga tgggaaattt atccgcttga agaacacttc tgaacaggat caaccaatgg
1741 gaggctggga gatgatcaga aaaattggag acacatcagt cagttataaa tatacctcaa
1801 gatatgtgct gaaggcaggc cagactgtta caatttgggc tgcaaacgct ggtgtcacag
1861 ccagcccccc aactgacctc atctggaaga accagaactc gtggggcact ggcgaagatg
1921 tgaaggttat attgaaaaat tctcagggag aggaggttgc tcaaagaagt acagtcttta
1981 aaacaaccat acctgaagaa gaggaggagg aggaagaagc agctggagtg gttgttgagg
2041 aagaactttt ccaccagcag ggaaccccaa gagcatccaa tagaagctgt gcaattatgt
2101 aaaattttca actgtcttcc tcaaaataaa gaagtatggt aatctttacc tgtatacagt
2161 gcagagcctt ctcagaagca cagaatattt ttatatttcc tttatgtgaa tttttaagct
2221 gcaaatctga tggccttaat ttcctttttg acactgaaag ttttgtaaaa gaaatcatgt
2281 ccatacactt tgttgcaaga tgtgaattat tgacactgaa cttaataact gtgtactgtt
2341 cggaaggggt tcctcaaatt ttttgacttt ttttgtatgt gtgttttttc ttttttttta
2401 agttcttatg aggaggggag ggtaaataaa ccactgtgcg tcttggtgta atttgaagat
2461 tgccccatct agactagcaa tctcttcatt attctctgct atatataaaa cggtgctgtg
2521 agggagggga aaagcatttt tcaatatatt gaacttttgt actgaatttt tttgtaataa
2581 gcaatcaagg ttataatttt ttttaaaata gaaattttgt aagaaggcaa tattaaccta
2641 atcaccatgt aagcactctg gatgatggat tccacaaaac ttggttttat ggttacttct
2701 tctcttagat tcttaattca tgaggagggt gggggaggga ggtggaggga gggaagggtt
2761 tctctattaa aatgcattcg ttgtgttttt taagatagtg taacttgctt aaatttctta
2821 tgtgacatta acaaataaaa aagctctttt
//
LOCUS VIBANGRA 4379 bp ds-DNA BCT 10-JUL-1990
DEFINITION V.anguillarum trans-acting transcriptional activator (angR),
S-acyl fatty acid synthesis thioesterase-like protein genes,
complete cds, and outer membrane protien (omp), 3' end.
ACCESSION M34504
KEYWORDS S-acyl fatty acid synthesis thioesterase-like protein;
outer membrane protein; trans-acting transcriptional activatior.
SOURCE V.anguillarum DNA, clone pJHC-A103.
ORGANISM Vibrio anguillarum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 4379)
AUTHORS Farrell,D.H., Mikesell,P., Actis,L.A. and Crosa,J.H.
TITLE A regulatory gene, angR, of the iron uptake system of Vibrio
anguillarum: Similarity with phage P22 cro and regulation by iron
JOURNAL Gene 86, 45-51 (1990)
STANDARD simple staff_entry
COMMENT Fur protein is a product of the ferric uptake regulatory gene
(fur).
FEATURES from to/span description
pept < 1 275 outer membrane receptor protein
pept 361 3507 trans-acting transcriptional activator (angR)
pept 3504 4262 S-acyl fatty acid synthase thioesterase-like
protein (ORF6)
binding 154 158 Fur binding site
binding 348 353 ribosome binding site (put.)
binding 3488 3493 ribosome binding site (put.)
signal 34 39 -35a region (put.)
signal 63 68 -10a region (put.)
signal 109 114 -35b region (put.)
signal 126 131 -10b region (put.)
signal 3387 3392 -35c region (put.)
signal 3405 3410 -10c region (put.)
signal 3443 3448 -35d region (put.)
signal 3456 3461 -10d region (put.)
BASE COUNT 1320 a 914 c 902 g 1243 t
ORIGIN
1 ggaacctacc agtgatgcgt caacttactc ttattggtca agcaaattac atgtcagagc
61 aatatattga tgcacaaaac actcaatcac tgtctgcaca gactattttt gatttaggtg
121 ctcgctataa ctctaccatc gccaatcaaa gtgtcatttg gcgtcttgcg gtcaacaacg
181 taaccgatga agcatattgg actaccaccc attacgctag ccttgcgttg ggtgcccctc
241 gtacggtgat gctatctgct acagcggatt tttaatctcg gtcaattttg cccttgacct
301 ttctggttaa gggcattcgt cttccccttc cccccatttg gctttttatg agaatttaga
361 atgaatcaaa atgaacatcc cttcgctttc cctgagacaa aattaccttt aacctccaat
421 caaaattggc agttatcaac ccaaagacag cgtactgaaa aaaaatcgat taccaatttt
481 acgtatcagg aatttgatta cgaaaacatt tcgagggaca cattagaacg ctgcctcaca
541 acaataatta agcatcaccc aatattcgga gctaagctca gtgacgactt ctacctccat
601 tttccgagca aaactcacat tgaaaccttt gcagttaatg acttaagtaa tgccttaaaa
661 caagatattg ataaacagtt ggccgatacg cgttctgcag taacgaaaag ccgctcacaa
721 gcgataatct ctatcatgtt tagtatattg ccaaaaaaca taatcaggct tcatgtacgc
781 ttcaactcag ttgttgtaga taatccaagt gttacgcttt tttttgagca gcttactcag
841 ttattatcgg gaagtcccct ttctttttta aatcaagaac agactatctc cgcatacaat
901 cacaaagtta ataatgagtt gcttagtgtt gatcttgagt ccgcaagatg gaatgaatat
961 attctaacac tacctagttc agcaaacctt cccacaattt gtgaacccga aaaactggat
1021 gaaaccgata tcactcgcag gtgcattaca ctgtcacaaa ggaagtggca gcagttggtt
1081 actgttagca aaaaacataa tgtcacaccg gagataactc ttgccagtat attttcgacc
1141 gttttatcac tctgggggca tcaaaaatac ctcatgatga gatttgatat caccaaaatc
1201 aatgactaca cgggcatcat aggccagttt accgaacctt tattagtggg tatgtccggc
1261 tttgagcaga gctttctttc tcttgttaaa aacaaccaaa aaaagttcga agaagcttat
1321 cattatgacg ttaaagtacc tgtttttcag tgtgttaata aattatctaa tatttcggat
1381 tctcaccgtt atcctgctaa tatcactttt tctagcgagc ttttaaacac aaaccatagc
1441 aaaaaagctg tatggggatg tcgtcaatca gccaatactt ggctttcttt acatgctgta
1501 atcgagcaag aacaacttgt cttacaatgg gacagccaag acgcaatctt cccaaaagac
1561 atgatcaaag atatgttaca tagttacacc gatttattag acttactcag ccaaaaagat
1621 gtcaactggg cacagccttt accaactttg ctgccaaaac atcaggagtc catacgcaat
1681 aaaataaatc aacagggaga cctagaacta actaaagaac tcctccatca gcgttttttt
1741 aaaaacgtag agtccacccc taatgctctt gcgattatcc atggtcaaga gtcattagat
1801 tatataactt tagcaagcta cgccaagagt tgtgcgggtg cactaaccga agctggagta
1861 aaatcaggag accgcgttgc tgtcactatg aataaaggca ttggtcaaat agtggcagta
1921 ttgggaatat tatatgctgg ggctatttat gttcctgtct ctctagatca accacaagaa
1981 aggcgggaaa gtatttatca aggtgctgga attaacgtta ttcttattaa cgaatcagat
2041 agtaaaaatt ccccttcaaa tgatcttttc tttttcctgg actggcaaac agcgataaag
2101 agtgagccaa tgcgtagccc tcaagatgtc gcgccaagtc aaccagccta tattatctac
2161 acatcaggct caacaggaac ccctaaggga gtggtgattt ctcaccaagg cgctcttaat
2221 acatgtatcg cgatcaatcg acgttatcaa attgggaaaa atgatcgagt attggctctt
2281 tcagcactac attttgacct ttcggtatac gacatctttg gcctactttc tgccggcggc
2341 actatcgtat tagtcagtga gcttgaaaga cgtgacccga ttgcttggtg tcaagcaatt
2401 gaggagcata atgtcaccat gtggaatagc gtcccagcat tatttgatat gttattaact
2461 tacgctactt gctttaactc tatcgctccc tcaaaactcc gtttaaccat gctttcggga
2521 gactggattg gattagattt accgcagcgt tatcgcaatt atcgtgtaga tggccaattt
2581 attgcgatgg gaggagccac cgaagcatcg atatggtcaa acgtctttga cgtagagaaa
2641 gttccgatgg agtggcgctc tatcccttat ggctatcctc tacctagaca acaatatcga
2701 gttgtcgatg acttggggcg agattgccca gattgggtag ctggcgaact ttggattggt
2761 ggtgacggta tcgcactggg gtattttgac gatgaattga aaacgcaagc tcagttttta
2821 catattgatg gccatgcttg gtatcgtact ggtgacatgg gctgttattg gccagatggt
2881 actcttgagt tcttggggcg aagagacaag caggtcaaag taggaggtta cagaattgag
2941 ttgggagaaa tcgaagttgc actcaataat ataccggggg tgcagcgtgc ggttgctatc
3001 gcagtgggca ataaagacaa aactctagca gcattcatcg ttatggattc ggagcaagca
3061 ccaatagtta cagcgccgtt ggatgcagaa gaagttcaac ttttgttgaa caaacaactg
3121 cctaactaca tggttcccaa acgcataatt ttccttgaaa ccttccccct aaccgctaat
3181 ggtaaagtcg atcataaagc tctaactcga atgactaacc gagaaaagaa aacatctcaa
3241 agcataaata aacctattat tactgcgagt gaagatagag tagccaaaat ttggaatgac
3301 gttcttggtc ctacagaact ctataaatcg agtgatttct ttttgtcggg aggagatgca
3361 tacaacgcaa tagaggtagt caaacgttgt cataaagctg gctatctaat caagctatca
3421 atgttgtacc gttattctac gattgaagct ttcgctatta tcatggaccg ttgtcgatta
3481 gcacctcagg aagaggctga gttatgagcc ctttaatcaa acttgcagcc tcttcgaggc
3541 tgcatgatgc aactcattat gttttatgcc cttttgcagg aggtggtagt ggtgcattta
3601 gacactggcg tacattatcc cttgaaaatg aagtgatttc ggtaatgctt tatcctggta
3661 gagaatttcg tatagacgac ccaacagtca taaacatcgg cacattagca gaagaaatga
3721 tccaagcttt aaaaacctgt aatcaacgaa tagaagatac gatcattgtc ggtcatagta
3781 tgggcgcgca agtggcgtat gaagcaagta aaaaactagt aaatcagggg ctatttctga
3841 aagggctgat catctctggt tgtcaagctc ctcatatcaa agggcgaagg ttactaggtg
3901 aatgcgatga taaaaccttt attcataatc tagtcgagat tggagggtgt gatccaagtt
3961 tagctaaaag tccagagtgg tggccgatat ttctgccagc tttgagggcg gactttacgg
4021 ctacagaaca gtatattttc acatcacttc caaatgataa ggaaggcctt cctatcccaa
4081 ctctattgat ttcaggtgat caagatagag aagctaactt ttcagaaata gaagagtgga
4141 aactttggtg taataaagtc gttgatcatt tagtggtcga gggcgggcat ttctatataa
4201 cagagcaacc tcaaatgatg cttgaatgca tccgggcttt atcaaccgaa acgactgcct
4261 aatactaagg ttcggttgat agatttttag acaaacaact tcaaacgaca agggtatgca
4321 tttaagcaat gcataccctg ggcttttcga tcaacactat tacttggttt ccggaattc
//
LOCUS VIBLUXABG 3200 bp ds-DNA BCT 10-JUL-1990
DEFINITION P.leiognathi luciferase alpha (luxA), beta (luxB) subunit, and
gamma protein (luxG) genes, complete cds.
ACCESSION M34564
KEYWORDS gamma protein; luciferase.
SOURCE P.leiognathi (strain 554) DNA, clone pPHL[6,11,12].
ORGANISM Photobacterium leiognathi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 3200)
AUTHORS Illarionov,B.A., Blinov,V.M., Donchenko,A.P., Protopopova,M.V.,
Karginov,V.A., Mertvetsov,N.P. and Gitelson,J.I.
TITLE Isolation of bioluminescent functions from Photobacterium
leiognathi: Analysis of luxA, luxB, luxG and neighboring genes
JOURNAL Gene 86, 89-94 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 145 ORF1
pept 182 1246 luciferase alpha-subunit (luxA)
pept 1295 2272 luciferase beta-subunit (luxB)
pept 2293 2979 gamma protein (luxG)
pept 3081 > 3200 ORF2
binding 171 174 ribosomal binding site (put.)
binding 1284 1287 ribosomal binding site (put.)
binding 2281 2284 ribosomal binding site (put.)
binding 3072 3075 ribosomal binding site (put.)
BASE COUNT 1092 a 616 c 586 g 906 t
ORIGIN
1 tcgagcagcc attggcttag acagtgaagt gattgattta gttgatgata ttagtgagcc
61 aaactttgaa gatctcacca ttattacagt taatgaacgt cgtttgaaaa ataaaattga
121 aaacgaaatg ttcgctagcg cttaaaccaa tacctattca agtcatcaaa aggaaaagat
181 aatgaaattt ggcaatattt gtttctcata ccagccacca ggtgaatctc ataaagaagt
241 catggatcgc tttgttcgtc ttggcgttgc ttcagaagaa ttaaacttcg acaccttctg
301 gacacttgag caccacttca ctgaattcgg cctaacaggt aacttatatg ttgcttgtgc
361 caatattctt ggtcgtacca aaaaacttaa cgtcggcaca atgggtatcg tactaccaac
421 agctcaccct gctcgccaaa tggaagatct actgctactg gatcaaatgt caaaaggacg
481 ttttaacttt ggtgtagtac gtggtctata ccataaagat ttccgggtat ttggtgttac
541 gatggaagat tctcgttcga tcactgaaga tttccataaa atgatcatgg acggctctaa
601 atcaggcgtt ttacacactg atggtaaaaa cattgaattc ccagatgtaa atgtctatcc
661 agaggcctac ctagacaaga tccctacttg tatgacagcg gaatctgcgg cgacaacgac
721 ctggctagca gaacgtggtt tgccaatggt actgagctgg atcatcacca ccagcgagaa
781 aaaagcacag atggaactat acaatgaaat tgcagctgag catgggcacg atattcacaa
841 tatcgaccac agcatgacct tcatctgttc cgttaatgaa gatccagaaa aagcagaaag
901 tgtctgccgt gacttcctat caaactggta cgagtcctac accaatgcga ccaatatctt
961 taaagacagt aaccaaactc gtggttatga ctatcacaaa ggtcaatggc gtgactttgt
1021 actacaaggc cataccgata cccgtcgtcg tcttgattac agtaataacc taaaccctgt
1081 tggtacacct gaaaaatgta ttgaaattat ccagcgagat atcgatgcaa cagggatcaa
1141 caacatcacc cttggttttg aagcaaacgg ttctgagcaa gaaatcatcg catcgatgga
1201 acgcttcatg acacaagtgg cgccatacct aaaagatccg aaataaactg ccacattaaa
1261 gccattgaat taaattataa ataaggaaaa aaacatgaat tttggattat tctttctgaa
1321 ctttcagctc aaaggtatga catctgaagc agtactagac aacatgatcg atactattgc
1381 tttggttgat aaagacgagt accacttcaa aaccgcattt gtgaacgaac accatttttc
1441 taaaaacggt atcgttgggg cacctatgac agctgcaagt tttctactag gtttaactga
1501 acgccttcat attggttcat tgaatcaagt gatcaccact caccacccag tccgtattgc
1561 agaagaagct agcttacttg atcaaatgtc agatgggcgt tttattcttg ggttaagtga
1621 ttgtgttagt gatttcgaga tggacttctt taaacgccaa cgagatagcc aacaacaaca
1681 attcgaagcc tgttacgaaa ttctaaatga cggtatcact accaactact gttatgcgaa
1741 taatgacttt tataacttcc caaaaatctc tatcaaccca cactgtatta gtaaagaaaa
1801 cctaaaacag tatattttag cgaccagcat gggcgtggtg gaatgggctg cgaaaaaagg
1861 gttaccactg acttaccgct ggagtgatac gctggcagaa aaagaaaatt actatcaacg
1921 ttatttaact gtcgccgctg aaaataatgt cgacattact catgttgatc accaattccc
1981 attacttgtt aacattaatc cggatcgtga tattgctaaa caagaaatgc gtgactatat
2041 ccgtggttat attgctgaag cttacccaaa tacagatcaa gaagaaaaaa ttgaagagct
2101 aattaagcaa catgcggttg gtacagaaga tgaatattat gaatcatcta aatatgcttt
2161 agaaaaaaca ggttcaaaga atgtattgct atcttttgaa tcaatgaaaa ataaagccgc
2221 tgtcatcgac cttattaata tggttaatga aaaaatcaag aaaaatctat aataaataac
2281 aggataataa aaatgacaaa atggaattat ggcgtcttct tccttaattt ttaccatgta
2341 ggacagcaag agccatcatt aaccatgagc aatgcgttag aaacattacg tattatagat
2401 gaagatacat ctatctatga tgttgttgca tttagcgaac accacataga taaaagctac
2461 aatgatgaaa cgaaattagc gccatttgtt agccttggca aacaaattca tattttagcc
2521 accagccctg aaacggttgt aaaagcggct aaatatggga tgccactact gtttaaatgg
2581 gatgatagtc aacaaaagcg tatcgaatta ttaaaccatt accaagcagc tgcggctaaa
2641 tttaatgtcg atattgcagg tgttcgtcat cgattaatgt tatttgtcaa tgttaatgac
2701 aacccaacgc aagccaaagc tgagcttagc atttacttag aagattacct ctcttacacc
2761 caagcagaaa catccattga tgaaatcatc aatagcaatg ctgcaggcaa cttcgatacg
2821 tgtttacatc acgttgctga aatggctcaa ggtttaaata ataaagtcga tttcttattt
2881 tgctttgaat cgatgaaaga tcaagagaat aaaaaatcac taatgattaa ctttgataaa
2941 cgcgttatta attatagaaa agaacacaac cttaactaat tcagttaagt caatttaaat
3001 taaaacttcg tcaatcattg tcattattaa tggcagtgtg gcttcttacg ctgccattaa
3061 attttttatt aaggtgtaat atgactactt tattagatat tgatactaac gatattattg
3121 ttagttcaga actcgatgat attattttct catcatcacc gtttacatta acctttgatg
3181 agcaagaaaa attaaagcaa
//
LOCUS YSCSLP1A 3456 bp ss-mRNA PLN 10-JUL-1990
DEFINITION S.cerevisiae vacuolar function expression protein (SLP1) gene,
complete cds.
ACCESSION M34474
KEYWORDS .
SOURCE S.cerevisiae, cDNA to mRNA, clone pYKK101.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3456)
AUTHORS Wada,Y., Kitamoto,K., Kanbe,T., Tanaka,K. and Anraku,Y.
TITLE The SLP1 gene of Saccharomyces cerevisiae is essential for vacuolar
morphogenesis and function
JOURNAL Mol. Cell. Biol. 10, 2214-2223 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 692 2767 SLP1 protein
signal 536 544 TATA box
BASE COUNT 1136 a 588 c 691 g 1041 t
ORIGIN
1 ctgcagctaa tcacgtgctc acatctttac tcaatgagat tgatggtgtt gaagagttaa
61 agggtgtagt tattgtagcg gcgacgaata gacctgatga aatagatgct gctcttctaa
121 ggcctggtag gttagataga cacatttacg ttggccctcc agacgtaaac gcccgcttgg
181 aaatcttaaa gaagtgcaca aagaaattta atacagaaga gtctggagtc gatcttcatg
241 aattggcaga ccgtacagaa ggttattccg gagctgaagt tgtgctgctt tgtcaagaag
301 cgggcttggc tgccataatg gaagatttag atgtcgcaaa agtggaatta cgtcattttg
361 agaaagcttt taaaggaatt gctaggggca ttactccaga aatgctctct tattatgaag
421 agtttgctct aagaagcggt tcatcttcgt aagcttgttc atagtcaatt cttttccttt
481 gtgtgctcaa taatagtaga tagaaattat actgaactcc ggtcattttg tataatatat
541 taatcacttc acacgaacat acataaataa aatatcataa aggttagcaa attggaacta
601 gttatatgtt aattagttaa aagatagaaa attcgagaaa ggaagaaaaa gctgatattg
661 cccatctcca actttatcaa atcatttcac gatgaataga ttttggaata ctaagaaatt
721 ttcattaaca aatgccgatg gactatgtgc taccttaaat gagatatctc aaaatgatga
781 agttcttgtg gttcaaccaa gtgtattgcc agtactcaat agtttgctaa ctttccaaga
841 tttgactcaa tcaactcctg taaggaaaat tacgttactc gatgatcagc taagtgacga
901 tttaccgagt gccttaggca gcgttccgca aatggatctt atttttctta ttgatgtcag
961 aacatctctc cgactccctc cacaactgct tgatgctgct caaaagcaca atttatcatc
1021 tttgcatata atatactgtc gatggaaacc gtctttccaa aatactttgg aggatacaga
1081 gcaatggcaa aaggatggtt tcgatttgaa ttcaaaaaaa acacatttcc ctaacgtcat
1141 tgaatctcag ttaaaggagc tatcgaacga atataccctt tacccttggg atctcttgcc
1201 cttcccacag attgatgaaa atgttctatt gactcattcc ctttataaca tggaaaatgt
1261 aaacatgtat tatcccaact tacgttcttt gcagagtgcc acagagtcaa tactggttga
1321 tgatatggtc aattcgttgc agagcttgat ttttgaaact aatagtatca taacaaatgt
1381 tgtgtcgata ggtaatctgt ctaagagatg tagccatctt ttgaagaaac gaatcgatga
1441 gcatcaaaca gagaatgatt tattcatcaa gggtacgctt tatggtgaac gaaccaactg
1501 tggactagaa atggacttga ttatcttgga aaggaatacc gatcctataa cgccattgtt
1561 gacacaactt acgtatgcag gaatactaga tgatctatat gaattcaatt ctggcataaa
1621 gataaaggag aaagacatga acttcaatta taaggaagat aaaatatgga atgatttgaa
1681 atttttaaat tttgggtcga ttgggccgca gttaaataaa ttggcaaagg aactacaaac
1741 gcaatatgat acaaggcata aagccgagag cgtacatgaa atcaaagaat tcgttgattc
1801 cttaggttct ttgcaacaaa ggcaagcttt tttgaaaaat cacacaacct tatcatccga
1861 cgttttgaaa gtggtagaga ctgaagagta cggatctttc aataaaatct tagagttaga
1921 gctggaaatt ttgatgggaa atacacttaa taacgacatt gaagatatta tactcgagtt
1981 gcagtaccag tacgaggttg atcaaaagaa gattctcaga ttaatctgtt tattgtctct
2041 ttgtaaaaat tcacttcgag aaaaggatta tgaatatcta agaaccttta tgatcgactc
2101 ttggggcatt gaaaaatgct ttcaacttga atcattggct gagttaggat ttttcactag
2161 caaaacggga aaaactgatt tgcatattac aacaagtaag tcaacaagat tacagaaaga
2221 ataccgttat atttcacaat ggttcaatac agtacccata gaagacgagc atgctgccga
2281 taaaatcaca aatgagaacg atgacttctc ggaagccact tttgcttaca gtggtgtagt
2341 gcccttgaca atgagactgg ttcagatgtt atatgatagg tctatcttgt tccataatta
2401 ttcctcgcag cagcctttta tactgtcaag agaacctaga gtttctcaaa cggaggattt
2461 aattgaacag ttatatggag actcacatgc gatcgaagag agtatatggg tcccgggaac
2521 cattacaaaa aagatcaatg caagcatcaa gagcaataat agacggtcca tagacggatc
2581 taatgggaca tttcatgctg cagaggatat tgcactcgta gtattcctcg gaggtgtaac
2641 aatgggtgaa atagctataa tgaagcattt gcaaaaaata ctaggtaaaa aaggtatcaa
2701 taaaaggttt atcatcatcg ccgatggctt gatcaatggc acaaggatca tgaactctat
2761 atcttaatta ttatatgata gatttgttaa ttttttgtat atgcaaatgt gcttttttca
2821 ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa aagctagggg
2881 actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct ggcttagtga
2941 tttataagaa tggttaatta gttttgtatc ctttattttc tagatagagc cacagagcaa
3001 actaaacaga aaagttatcc atttccatta cgcaatgttg tgccaacaga tgattagaac
3061 gacagctaag agaagtagca atatcatgac cagacctatt atcatgaaga ggtcagtaca
3121 cttcaaagac ggtgtgtatg aaaatatccc attcaaagtc aaaggaagaa agacacctta
3181 cgccttatct catttcgggt tcttcgctat tggatttgct gttccatttg ttgcctgcta
3241 tgttcaattg aaaaagtcag gtgcttttta aaacaccccc ctaagttgaa ggatagatgt
3301 gtgtacatag cgtgcttggt tgagacgttt tagagtgtgt tctttgctat tcctaggtgc
3361 gcatatcatc gttttattta tttgtacaat tttcttttca tatattcata atcctctcct
3421 tgtgccttcg tattgagacg gcgggaaaga aggatc
//
LOCUS CHKMHBLBA 2405 bp ds-DNA VRT 10-JUL-1990
DEFINITION Chicken MHC class II B-LBII-beta gene, complete cds.
ACCESSION M29763
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Chicken (haplotype B12) DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2405)
AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
TITLE Organization of a functional chicken class II B gene
JOURNAL Immunogenetics 31, 179-187 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 16; 2332 to 2405)
AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Zoorob, 06-NOV-1989, for release after publication.
Author address: R.Zoorob
Institut d'Embryologie Cellulaire et Moleculaire du CNRS
et du College de France
49bis av. de la Belle Gabrielle
F-94736 Nogent sur Marne
France
FEATURES from to/span description
pept 828 918 MHC B-LII-beta chain, exon 1
1127 1396 MHC B-LII-beta chain, exon 2
1483 1764 MHC B-LII-beta chain, exon 3
1847 1957 MHC B-LII-beta chain, exon 4
2049 2072 MHC B-LII-beta chain, exon 5
2175 2188 MHC B-LII-beta chain, exon 6
pre-msg 808 2331 MHC B-LII-beta chain mRNA and intron
IVS 919 1126 MHC B-LII-beta chain intron A
IVS 1397 1482 MHC B-LII-beta chain intron B
IVS 1765 1846 MHC B-LII-beta chain intron C
IVS 1958 2048 MHC B-LII-beta chain intron D
IVS 2073 2174 MHC B-LII-beta chain intron E
BASE COUNT 380 a 728 c 902 g 395 t
ORIGIN
1 ggatccatgg gtgacgtaag gatgaggttc cagcacatat tggacccttc tgcgtttgca
61 tggagggatc ttcgggggat ctttgtgatc ttcagtgatt ttcagtggtc tttggtggtc
121 ttcagtgctc ttcgttggtc tttgacaaag atgcagagga gcaccgctcc cagacggacc
181 ccccggggac cccatttgtc gccatcccca ctgggacatg cagccattga ccacagccct
241 ccggctgcga ccacccaact gattccttat ccaaagtcca ctctttgcac acttacctcc
301 aatttagtga taaggatgtg gcgtgggacc gtcccaatgg ccgcacacaa gtccaggtag
361 atgatatggg atgaccatga agggatcaca gagaggaaca cggggtgacc acgaggagca
421 acgaaggaaa cgctgagtga ccacgggcag aaaatggtgt gaccattagg ggacaacgag
481 agggaacaga agtagtaagg agtgagaatg gggtgacaaa gaggtgacca tggcataact
541 ttgataagac cattgggtga ccgcagggtg atggccatac catggggtga gcactggatg
601 accatggagg tcattggagg accatcgggt gggacgaggg ccgtggggac acccgtgggg
661 cggtgggacg ggggcagagt gtcagaagga gccccgcggc gcagaactct gcctggagac
721 gggtgacgcc gcccggcgcc gccgccgctc attggccctc cccgcccggc cccgggctcg
781 cggctggcgc ggggtgccgg gtcccccatc gtccggcggc agcagccatg gggagcgggc
841 gcgtcccggc ggcgggggcc gtgctggtgg cactgctggc gctgggagcc cggccggccg
901 ccggcacgcg gccctcgggt gagctcggag ccgcggcgcg gggacggcgc tgcgtccccc
961 ccggagaaac ccccggagcc cttctggccg tgcgcagcgc tcggggctgc ggggggacgg
1021 agggcggggg ggggcggcgg agccgtgggg ggcagcgggg ccggggaggg ggcggggggt
1081 gtggcggggg gcggctgtgt gccctgaccg tgccctctgc ccgcagcgtt cttcttctgc
1141 ggtgcgatat ccgagtgcca ctacctgaac ggcaccgagc gggtgaggta tctgcaaagg
1201 tacatctaca accggcagca gttcacgcac ttcgacagcg acgtggggaa atttgtggcc
1261 gattcaccgc tgggtgagcc gcaagctgaa tactggaaca gcaacgccga gcttctggag
1321 aaccgaatga atgaagtgga caggttctgc cggcacaact acgggggtgt ggagtccttc
1381 acggtgcaga ggagcggtga gtgccgcggg gcgcagcgcg gacggacggg caggcgccgc
1441 gctctggcgg tcggtccgca gcgctccccc cgtgccccgc agtggagccc aaggtgaggg
1501 tctcggcgct gcagtcgggc tccctgcccg aaaccgaccg tctggcgtgc tacgtgacgg
1561 gcttctaccc gccggagatc gaggtgaagt ggttcctgaa cgggcgggag gagacggagc
1621 gcgtggtgtc cacggacgtg atgcagaacg gggactggac gtaccaggtg ctggtggtgc
1681 tggagaccgt cccgcggcgc ggggacagct acgtgtgccg ggtggagcac gccagcctgc
1741 ggcagcccat cagccaggcg tggggtaagg cccccgggcc ctgccccgcc gcggggggag
1801 cgggagcgcg gcccgccgcg ctgagccgcc gccttcgtcc ccgcagagcc gccggcggac
1861 gcgggcagga gcaagctgct gacgggcgtg gggggcttcg tgctggggct cgtcttcctg
1921 gcgctggggc tcttcgtgtt cctgcgcggt cagaaaggtg agcgctgggg aggggggctg
1981 cgcggggggg gtcgggagcg gggggtgggg ggcagcgtcc gcgctgacct cgtctcgctg
2041 tgtttcaggg cgccccgtcg ccgccgctcc aggtaacgtc ccgttcccat tcccgttccc
2101 gttcccgttc ccgttccgcg ctgcgagcgg ccccgatccc ggcgcggggc tcagctctgc
2161 ccgtctcccc gcagggatgc tgaattagct gctgccccgc cgagccgctg cacccgcacc
2221 ccccgctctc ccggccgtcg cctcggctct ccctcgggct gccaccgcgt ccgttggaga
2281 tgtcgccacg atgcacgctt cgtccccatc ctaataaacg cgctgacttt gaccccgctg
2341 ttcgctgccc gtgaatcatt ggggactttc cgtcgcgtgg gaggagggga gggaagtgaa
2401 agctt
//
LOCUS CHKMHBLIIB 444 bp ds-DNA VRT 10-JUL-1990
DEFINITION Chicken MHC class II B-LBIII-beta gene, exon 1.
ACCESSION M29764
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Chicken (haplotype B12) DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 444)
AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
TITLE Organization of a functional chicken class II B gene
JOURNAL Immunogenetics 31, 179-187 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Zoorob, 06-NOV-1989, for release after publication.
FEATURES from to/span description
pept 125 / 215 MHC B-LIII-beta chain, exon 1
pre-msg 105 > 444 MHC B-LIII-beta chain mRNA and intron
IVS 216 > 444 MHC B-LIII-beta chain intron A
BASE COUNT 50 a 156 c 176 g 62 t
ORIGIN
1 ctgatcgggg tacccgcaac ggagatctgc ctggagacgg gtgatgccgc ccagcccagg
61 cactcactgc tccagagcag cggcgcgggc tgccggcacc cttcctcctc ctccggcagc
121 agccatgggg agcggccgtg tcctggtggc cggggccgtg ctggtagcac tggtggcgct
181 gggagcacgg caggccgccg gcacgcggcc ctcaggtgag ctcggagtcc cggtgtgggg
241 atggtgcagg gtggtccctc ccggtgtctc ccggcgccca ccccagcccc gtgcgcagcg
301 ctcggagctc cgcggctcag gatgccggcg acagcgcgtc cgcagccgtc gtgggcgtgg
361 ggggcacggg acggagcgcg gacgggagtg gctttcgggt ctgccgaggg gcagctggct
421 cctgacggtg ccccctcccc gcag
//
LOCUS RATLY6A 1221 bp ds-DNA ROD 10-JUL-1990
DEFINITION Rat Ly6-A antigen gene, exon 2.
ACCESSION M30692
KEYWORDS antigen.
SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA,
(library of Clontech), clone RK6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1221)
AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated
from rat kidney
JOURNAL Immunogenetics 31, 104-111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES from to/span description
pept / 497 834 Ly6-A antigen, exon 2 (put.) (AA at 499)
IVS < 1 496 Ly6-A intron A (no splice consensus)
signal 1204 1209 poly-A signal
BASE COUNT 286 a 315 c 314 g 306 t
ORIGIN
1 gtagtccggc tgctggctga gttgtaaggc aggagggagg ctgggtgtgt tttgtcttgc
61 atgtagccct ctctgcagag ggcctggctt cactcacaca agcctggtaa catctggtac
121 atcgaactct aagaatcggc aagcccactg ctgccgtctc cttaagagtt catttaggga
181 gtctgtcagg aacttgggca ggagtccaca ctaagggaag cttacttccc aaacagtggt
241 gctgggtgga aagtggagga ctcatgagaa cccctagttt aagactttta gagaagcagt
301 ctgaagcact gtggagatgt ggtcccatcg ccatcctgga gtagggataa ttttgcccag
361 gagccccagc aatgggtcag aggagcaaaa cgacgacagc tgtaagtggt ctcagaagat
421 gctagaggaa acagaagatg aactggcagc tgagacttgg cggtaactta ctggcttcga
481 cactatgcgt gttactctca gggcctaaac tgctacaatt gcacgatgat cccatttggt
541 aatacctgct catcaactgc tacctgcccc taccctgatg gagtctgtgc tattcaggtg
601 gcagaagttg ttatgagctc tgtaagacag aaagtaaagg accatatttg ccttcccgtc
661 tgcccaacga gtcctcaaac aaccgagatc ctgggtactg ttgtcgacat gaagatttcc
721 tgttgcaata cagatctttg caacgcagca gggcccactg gaggcagcac ctggaccatg
781 gcaagggtgc ttctgttcag cctgggctca ttcctcctgc agaccttgct gtaatggctc
841 ctccaaggcc ccgccaccct tgtcctttta tcctcatgtg taatcactcc tccctggagc
901 cctctagtga taaattctga gtaatagaaa ctctgaggtg ggggtagggt gtggaacacc
961 ttgtttcaac tctatagccc ctgctgggta ggtgccccac tcccctctct agggctttca
1021 gatatgtact tcctggaatg ccattatgtt gtggtttgct gctcttggcc ctggaggcat
1081 gtggacagca cggggaagag acagaaaccc aaggcactgt gtgaccacct ccatccatac
1141 ataaaaatct ggggtcctgc agggttccca cacatgcctc tcaacatccc cctatttgag
1201 tccaataaac tctctgttct c
//
LOCUS RATLY6B 905 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Rat Ly6-B antigen mRNA, complete cds.
ACCESSION M30689
KEYWORDS antigen.
SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of
Clontech), clone RK10.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 905)
AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated
from rat kidney
JOURNAL Immunogenetics 31, 104-111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES from to/span description
pept 103 510 Ly6-B antigen (put.)
signal 878 882 poly-A signal
signal 599 604 poly-A signal
BASE COUNT 206 a 248 c 213 g 238 t
ORIGIN
1 ctcttgctct cctccagcca caagtggtct cagaagatgc tagaatgtag aggaaacaga
61 agatgaactg gcaggttttg cctgtgcgcc ccttctcaga ggatgaacag atcttgtgct
121 atgaagtcct gtgtgctcat ccttctcctg gccctactgt gtgcagaaag agctcagggg
181 ctaaactgct acaattgcac gatgatccca tttggtaata cctgctcatc aactgctacc
241 tgcccctacc ctgatggagt ctgcactatt caggtggcag aagttgttgt gagctctgta
301 agactgaaag taaagagcaa tctctgcctt cccggctgcc ccaagagtcc tcaaacacct
361 gaggtcctcg gtaccgttgt ccatgtgaat actgactgtt gcaatacaga tctttgcaac
421 gcagcaggtc ccactggagg cagcacgtgg accatggcag gggtgcttct gttcatcctg
481 ggctcagtcc tcctgcagac cttgctgtga tggaccctcc aaggccctgc cacccttgtc
541 cttttatcct tatgtgtaat cactccttcc tggagccctc tagtgataaa ttctgagtaa
601 taaaaattca gaggggggat tgagtgtgga acaccttgtt gcaactctat agccactgct
661 ggataggttc cccactcccc tctctagggc tttcagatat gtacttccta gaatgccatt
721 gtgttttggt ttgctgctct tggccctgga ggcaggggac agcacgggga agaggcagaa
781 acccaaggca ctgtgacacc acctccatcc atacataaaa atctggggtt ctgcagggtt
841 cccacacatg cctctgaaca tccccctatt tgagtccaat aaactctctg ttctcccacg
901 gaatt
//
LOCUS RATLY6C 931 bp ds-DNA ROD 10-JUL-1990
DEFINITION Rat Ly6-C antigen gene, complete cds.
ACCESSION M30690
KEYWORDS antigen.
SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA,
(library of Clontech), clone RK3.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 931)
AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated
from rat kidney
JOURNAL Immunogenetics 31, 104-111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES from to/span description
pept 76 262 Ly6C antigen, exon 1 (put.)
342 559 Ly6C antigen, exon 2 (put.)
IVS 263 341 Ly6C antigen intron A (no splice consensus)
signal 927 931 poly-A signal
BASE COUNT 202 a 257 c 224 g 248 t
ORIGIN
1 gccctgggac gtaattggaa gtctattaac tggctccaat ttccaaggtt ttctctgtgc
61 accccttctc tgaggatgaa cagttcttgc gctatgaagt cctgtatgct catctttttc
121 ctggccctac tgtgtgcaga aagagctcag ggcctaaagt gctacagttg catagaagtc
181 ccacttaatg ctaactgctc aacagctacc tgcccctact ctgatggagt gtgtgtttct
241 caggtgttag aagctgtaga gggtctccta gatgcaactt cccagggaac tgcaagagtc
301 tgagaggctg gttgcccttt ttgctctgcc actgagtgat cgctctgtaa gacggacagc
361 aaagagcaat ctctgccttc caatctgccc caagtttcct caaagaaccg agatcctggg
421 taccgttgtc tacacgaagg tttcctgttg caatacagat ctttgcaatg cagcaggtcc
481 cactggaggc agcacctgga ccgtggcagg ggtgcttctg ttcagcctgg gctcagtcct
541 cctggagacc ttgctgtgat ggcccctcca aggccccgcc acccttgtcc ttttagcctc
601 atgtgtaatc actcctctga agccctctag tgataaattc tgagtaatag aaactcccag
661 gtgggggtag ggtgtggaac accttgattc aactctatag cccctgctgg gtaggtgccc
721 cactcccctc tctaggactt tcagatctgt acttcctgga atgccattgt gttgtggttt
781 gctgctcttg gccctggagg cacatggaca gcacagggaa gaggcagaaa cccaaggcac
841 tgtgacacca cccccatcca tacataaaaa tctggggttc tgcagggttc ccacacatgc
901 ctctcaaggt tcccctattt tagtccaata a
//
LOCUS RATLY6CA 783 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Rat Ly6-C antigen mRNA, exon 2.
ACCESSION M30691
KEYWORDS antigen.
SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of
Clontech), clone RK11.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 783)
AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated
from rat kidney
JOURNAL Immunogenetics 31, 104-111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES from to/span description
pept / 1 403 Ly6-C antigen, exon 2 (put.) (AA at 2)
signal 771 783 poly-A signal
BASE COUNT 173 a 219 c 187 g 204 t
ORIGIN
1 gaacagttct tgcgctatga agtcctgtat gctcatcttt ttcctggccc tactgtgtgc
61 agaaagagct cagggcctaa agtgctacag ttgcatagaa gtcccactta atgctaactg
121 ctcaacagct acctgcccct actctgatgg agtgtgtgtt tctcaggtgt tagaagctgt
181 agagggctct gtaagacgga cagcaaagag caatctctgc cttccaatct gccccaagtt
241 tcctcaaaga accgagatcc tgggtaccgt tgtctacacg aaggtttcct gttgcaatac
301 agatctttgc aatgcagcag gtcccactgg aggcagcacc tggaccgtgg caggggtgct
361 tctgttcagc ctgggctcag tcctcctgga gaccttgctg tgatggcccc tccaaggccc
421 cgccaccctt gtccttttag cctcatgtgt aatcactcct ctgaagccct ctagtgataa
481 attctgagta atagaaactc ccaggtgggg gtagggtgtg gaacaccttg attcaactct
541 atagcccctg ctgggtaggt gccccactcc cctctctagg actttcagat ctgtacttcc
601 tggaatgcca ttgtgttgtg gtttgctgct cttggccctg gaggcacatg gacagcacag
661 ggaagaggca gaaacccaag gcactgtgac accaccccca tccatacata aaaatctggg
721 gttctgcagg gttcccacac atgcctctca aggttcccct attttagtcc aataaactct
781 ctg
//
LOCUS RATTAG1 5040 bp ss-mRNA ROD 10-JUL-1990
DEFINITION Rat axonal glycoprotein (TAG-1), mRNA, complete cds.
ACCESSION M31725
KEYWORDS glycoprotein.
SOURCE Rat 13 day old embryo spinal cord axon, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5040)
AUTHORS Furley,A.J., Morton,S.B., Manalo,D., Karagogeos,D., Dodd,J. and
Jessell,T.M.
TITLE The axonal glycoprotein TAG-1 is an immunoglobulin superfamily
member with neurite outgrowth-promoting activity
JOURNAL Cell 61, 157-170 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Furley,A.J.W., 30-JAN-1990, for release after publication.
FEATURES from to/span description
pept 224 3346 axonal glycoprotein (TAG-1) precursor
sigp 224 313 axonal glycoprotein signal peptide
matp 314 3343 axonal glycoprotein
BASE COUNT 1144 a 1486 c 1397 g 1013 t
ORIGIN
1 gaattcccgc ccgctgccgc cacgccagga cagccagtgg ctaaggccgg cggggcaagc
61 agccctgagg ctggcagcag ggtctgctca ccaggcggcc gcagcagtgc cccagccaac
121 acccttcccg cactctaggt gtgcctgagt ctccagttga ttctcccgga gcggagctgc
181 ggctcctctc ttttggactc tgcctctgcc tgaaagaccc accatgggga cacacgccag
241 gaaaaaggca agcttgctgc tgctggtgct ggccacagtg gccctggtct cctctccagg
301 atggagtttt gcccagggaa ccccagctac ctttggaccc atcttcgaag agcaacccat
361 tggcctgcta ttcccagagg agtctgcaga ggatcaggtg acactggcgt gccgtgcccg
421 tgctagccct ccagccacct acaggtggaa gatgaatggc acagatatga acctggaacc
481 tggctcccgt caccagctga tggggggcaa cctggtcatc atgagcccca ccaagacaca
541 ggatgctggt gtctaccagt gcctagcctc caacccagta ggcactgtgg tcagcaagga
601 ggctgtcctc cgctttggct ttctacagga attctccaag gaggagagag accctgtgaa
661 aacccatgag ggctggggag tgatgctgcc ctgtaacccg cctgcccatt acccaggttt
721 gtcctaccgc tggctcctca acgagttccc caacttcatc ccaacggatg ggcgacactt
781 cgtgtcccag actacaggaa acctgtacat cgcccggacc aatgcctcag acctgggcaa
841 ctactcttgt ttggctacca gccacatgga cttttccacc aagagtgtct tcagcaaatt
901 tgcgcagctc aacctggctg cggaagatcc ccgactcttc gctcccagta tcaaagctcg
961 gttccccccg gagacctacg cactagttgg gcagcaagtc accctggagt gctttgcctt
1021 tgggaacccg gttccccgga tcaagtggcg caaagtggat ggttccttgt cccctcagtg
1081 ggccacagct gagcccaccc tgcagatccc cagcgtgagc tttgaagacg agggtaccta
1141 tgaatgtgag gcagagaact ccaagggtcg tgacaccgtc cagggacgca tcatcgtgca
1201 agctcagcct gagtggctaa aggtgatctc agacacagag gccgacattg gctccaactt
1261 acgttggggc tgtgcagcag caggcaaacc ccggcccatg gtgcgctggc tgagaaacgg
1321 ggaacctctg gcctcccaga accgggtgga ggtcttggct ggggacctgc gattctctaa
1381 gctgagcctg gaggactctg gcatgtacca gtgtgtggct gaaaacaagc atggcaccat
1441 ctatgccagt gctgagctgg ctgtacaagc tctggcccca gacttcaggc agaaccctgt
1501 gagacggctg atccctgcag ctcgaggcgg agagatcagc atcctgtgcc agcctcgcgc
1561 agccccaaaa gctacaatac tttggagcaa gggtactgag attttgggga acagtaccag
1621 agtgactgtc acttccgatg gcaccttgat catcagaaac atcagccgat ccgatgaagg
1681 caaatatacc tgctttgctg agaacttcat gggcaaagcc aacagtaccg ggatcctgtc
1741 cgtgcgcgat gcaaccaaga tcaccctggc tccctccagt gctgacatca acgtgggtga
1801 caacctgacc ctacaatgtc atgcctcgca cgaccccact atggacctca cgttcacctg
1861 gaccctggat gatttcccta ttgactttga taagcctgga ggtcactacc ggagagccag
1921 tgcgaaggag accattgggg acctgactat cctcaacgcc cacgtacgcc atggagggaa
1981 gtacacatgc atggcccaga ctgtggtaga tggtacatcc aaggaggcca cagtcctggt
2041 ccgaggtccc ccaggtcccc cagggggtgt ggtggtgaga gacatcggag acaccaccgt
2101 tcagcttagc tggagtcgtg gctttgacaa ccacagcccc attgccaagt acacgctgca
2161 agctcgtact ccaccctcgg ggaaatggaa gcaggttcgg accaatcctg tgaatatcga
2221 gggtaatgcc gagactgccc aggtgctggg tctcatgcct tggatggact atgagtttcg
2281 ggtttcagct agcaacatct tgggcactgg ggagcccagc gggccctcca gcaaaatccg
2341 cactaaggaa gcagtcccct cagtggcacc atcgggactc agtggagggg gaggagcccc
2401 tggagagctc atcatcaact ggactcccgt gtcacgggag taccagaacg gagacggctt
2461 cggctacctg ctgtccttcc gcaggcaagg cagctccagc tggcagactg cccgggtgcc
2521 tggcgctgat gcgcagtact tcgtctacgg caatgacagc atccagccct acacaccctt
2581 tgaggtcaag atccgaagct acaatcgccg gggggatggg cccgagagcc tcactgcgtt
2641 agtgtactca gcagaggaag agcccagggt ggcccctgcc aaggtctggg ccaaggggtc
2701 ctcatcttca gagatgaacg tgagctggga gcctgtgcta caagacatga acggcattct
2761 cctgggatat gagattcgct actggaaagc cggggacaac gaagcagccg ctgaccgagt
2821 gaggacagca gggctagaca ccagtgcccg agtcactggc ctgaacccca acaccaaata
2881 ccacgtaact gtgagggcct acaaccgggc cggcactgga cccgctagcc cttcagctga
2941 tgccatgacc gtgaagcccc cgccacggag acctcctggc aacatctcct ggactttctc
3001 aagctccagt ctcagcctta agtgggaccc tgtggttcct ctccgaaatg aatctacggt
3061 cactggctac aagatgctgt atcagaatga tttgcaccca actcctacgc tccacctcac
3121 cagcaagaac tggatagaaa taccagtacc cgaagacatt ggccacgctc tggtacagat
3181 tcgaaccaca gggcctggag gggatgggat ccccgcagaa gtccacattg tgagaaatgg
3241 aggcacaagc atgatggtgg agagcgccgc cgcccgccct gcccatcccg gacctgcgtt
3301 ctcctgcatg gtgatattga tgctcgctgg ctaccagaag ctctgatctc aacactgccc
3361 gccacgccca agctggacac ccaccctaac agacacagcg gctgaccaca gctccctttc
3421 gtccaaggtg gtccaacact gtgcctgagc gtggttggct tagacaccta ctcccaacag
3481 taccctttat gtaggaggta ggatattcct attctgccac aggatagaac catgcgagga
3541 aattttcttt aagtcaagag gcactgggca gtgacttcca tgataatagt actaggccta
3601 atgcctggac cccttggggt cttggtcgaa aggaacgggc ctttgattaa gcagatggtc
3661 ctttggggcc acaagtggca ctgccatctg agatcagagt accaggccca gcaggaacat
3721 gggcagcagt ggggtattgt tttccctcta tgaagcagag ggacctcttc tagtcctcac
3781 tggagaagca ccatggttgg tcccgacacg gtcttccatg actccctggc ttcctcggta
3841 gccaaggaca aggccctggg ttactgggga tagaagctca aaagggttga gaggctaccc
3901 cacccgatgg aaaggggcac cagcctaagc ccattggcca tcctggtggc actgccctct
3961 cagccagcac tgccaagcca atcctgtcgt cctccagatg gaatggtgga gtgacagagc
4021 cacttcaggt ggctatgtga ctaaagggct tgcctcgagg agttgccttg cctcatcaag
4081 atgcttcctt catggaccct ccagggtacg ggcaggagat gtccatctga acgctactct
4141 cttcccttca gctctgctgc aaacttgtgc ctgcctccac ctcccacaac tgcaggcccc
4201 agaaatcagc tctcaacaca gcatccattc tttgtcctgg gatagagagg catccgagaa
4261 gggccagcat caaagtggcc ctgcctgctt ccaggaatat cctccatcac ctggccacac
4321 ctgctcccca gaactgcctg gactactctc ttcagtcccc acaagaaaaa gggttaataa
4381 gggggggggg ggtggcctgc cttgagttct gggtagttac cagggataga ccagactacg
4441 ggagctgaag aagccttata acttgactta tccgtaccct acacttaaca gacgaggaaa
4501 tggaggtgca gaagggttag ggacttcttg ggggtcacat ggtctgtaag gacaaggcat
4561 ggtcagcaca gggtctcctc cccacctgtg ggaggctcta tagagagagg gaggatgttg
4621 agcagtcaca gcctgtcctc taggactctg gaggactctg gaggaggagc cctctgcttc
4681 aagaggttct ggctggtgag atggacaaat gagctccaac caaggcatag gcagattcca
4741 ggagtcaatg gcctggggca gccttctgct gggaactcgg cagggagcac tgtctggaag
4801 cctctcgggc ttgctcattt caagaagagg ccaaagcaag gacagagttc cttagacgag
4861 gaccctgcag cagcacgacc agaaaacccc agtgtccacg ccctcagccc acgggggcag
4921 cagagcaggc atttcaagat gcacttgccc tgctgctcct taggccattt ctgtagttta
4981 cagttagagc tctattttgt tatgggtttt taaacttcaa gccttgctct gtttttctgg
//
LOCUS MUSADAM01 2308 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 1 (non-coding).
ACCESSION M34242 J04767
KEYWORDS adenosine deaminase.
SEGMENT 1 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2308)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pre-msg 885 > 2308 adenosine deaminase (ADA) mRNA and introns
IVS 1008 > 2308 ADA intron A
BASE COUNT 479 a 615 c 698 g 516 t
ORIGIN Chromosome 2.
1 cccacctcaa ggtgcgcaca agttacttaa ggaacttgct acaatatagc cctgctcccg
61 cccccaaaat cccaccaaac ctagagtatg gttctaaaca gctcacctgt taagtctcct
121 tggccaatcc tctagaagtt gaccatagta tgaagttttc tgcagcgtag tttttttctg
181 cccccctttc actactgtgt ctgagcacat gtgctgtgct ttgtagctga aactggcttt
241 attgctgcag aaaccagtcc actgtattta cccacagcac tgatgtgagc attctaaata
301 catctcgatg cgtgggcata tttatccagc gtaactgccc caggagagat gaactgtgtg
361 ttcctgtcca ccccctgtat cagcacctga gactagtctc agagtctctc tcacacacaa
421 cagtgttctc tgcatcccac ccgccctcac ctggtgaact ccggcagtcg ccgctaaatc
481 tccctaatta cacacttctt ctgccttgtg attctgcaac aagtgggtct atccctcaaa
541 atccagcccc ataaggcttc aggactgtgt ggctccagct tcagcctgca caaagtaggc
601 gcccaagcaa cactggaagc ctcggtactg aaggggcccg gaaggggcag gtgagacatt
661 ggagtcacgt ctgcaggggg ctcacctggg agcttcctag ggtgtagcca gcagggaagg
721 tctggggttc agaattccgg gaaatgcgcg ccagagttgc aggcgggggg gggggggggg
781 ggggggcggg gccgtggctc cggaaggcgg ggtctctctg tgggcgtagc gtgggcgggg
841 ctgtgcgggg cagcccggta aaaaagagcg tggcgggccg cggtctctga gagccatcgg
901 gaagcgaccc tgccagcgag ccaacgcaga cccagagagc ttcggcggag agaaccggga
961 acacgctcgg aaccatggcc cagacacccg cattcaacaa acccaaagta agcaccgagg
1021 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg
1081 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca
1141 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaag
1201 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg
1261 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca
1321 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaac
1381 ctgggggtgt ctctggtcgg ccttcgggtt cggctgctgt ctatgcgaac ctgggagtgc
1441 ctccagtcgg ccttcgggtt cggctgctgc ctatgccctg tgccctggag gtctcagcct
1501 cgctgtctgc caatgggcat ccagtgcggc ggggctgcac agctgtgtgg gactgggcta
1561 ggacctgggt gtctgagccc cagtagaatg gggcccaggg tctctagctg ttaaatgttc
1621 agtgtatggc tttatactta agtgttatga ttactttctg ggcaacaggt aacctaggtt
1681 tgtgggtgcg cccgtgggaa aatctatgat ccaaaccaga aaaggaaggg atagaggctt
1741 cagggtgcca ggaggaaccc ctacacatac tgaccgtttg gccatatggg tttatttggg
1801 atgaagtttt agcccattga ccccagagga gaacccttta tctgtctttc tgcaagctgt
1861 ggcttcttgg aaacagggag actccaggtc cccaaggcca gatttgcagc ccttacagat
1921 tctgtctagt cagccaggca aattgaactg gtcagcagaa gtgtgggact gagaactcag
1981 ggggagggat cagagacagt cacccttaga cttacccctc caagaaacag atgctgagtg
2041 gggggcgggg tggcagacgt atgaatcccg tgtgcatgtt gtgtcatata tgcgtgcatg
2101 gagggagcgg gagggaagat gggcagtggg cctgtattcc atgcacttac catagggaac
2161 acactctgcc cctctagcta gaggctagaa gggcagggca agtcttccta cccaaccaat
2221 gcctgctgca catcttgtct ggtggctcct gaccacagtt ggtgctctta gacatcaaag
2281 ggtgagtttt cttttgatgg tctgaatt
//
LOCUS MUSADAM02 207 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 2 (non-coding).
ACCESSION M34243 J04767
KEYWORDS adenosine deaminase.
SEGMENT 2 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 207)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pre-msg < 1 > 207 adenosine deaminase (ADA) mRNA and introns
IVS < 1 78 ADA intron A
IVS 141 > 207 ADA intron B
BASE COUNT 39 a 58 c 54 g 56 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 1.
1 gctcctcggg ctctgtggtg gcttctgagg tgtcctctgg ctctgtggta tctcacgctc
61 tttttctgtc ccttgcaggt agagttacac gtccacctgg atggagccat caagccagaa
121 accatcttat actttggcaa gtaagtccaa ggacaaccac agaccttccc aggattgcag
181 agcgtgtaca gctcttcttg gggggcc
//
LOCUS MUSADAM03 382 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 3 (first expressed
exon).
ACCESSION M34244 J04767
KEYWORDS adenosine deaminase.
SEGMENT 3 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 382)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept 235 + 299 adenosine deaminase (ADA, EC 3.5.4.4), exon 3
(first expressed exon)
IVS < 1 176 ADA intron B
IVS 300 > 382 ADA intron C
BASE COUNT 86 a 117 c 94 g 85 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 2.
1 aacacacaca tgcctgatgc cagcaaagga ggcctgaagg cattggtacc cctggaatta
61 gagttacagc tggtcatggg cctccatgtg ggtctcgtct tctgcaagaa cagccagtgt
121 gctcttaccc accaagccct ggtgcagccc ctcacccttg actttatttt taggaggaag
181 agaggcatcg ccctcccggc agatacagtg gaggagctgc gcaacattat cggcatggac
241 aagcccctct cgctcccagg cttcctggcc aagtttgact actacatgcc tgtgattgcg
301 taagttgctc cccaaccctt gtgccccaca gtagcatcca tccctataac caaggtcagg
361 cctgagctgc tgctgtacaa gg
//
LOCUS MUSADAM04 346 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 4.
ACCESSION M34245 J04767
KEYWORDS adenosine deaminase.
SEGMENT 4 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 346)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 93 + 227 adenosine deaminase (ADA), exon 4
IVS < 1 92 ADA intron C
IVS 228 > 346 ADA intron D
BASE COUNT 77 a 85 c 111 g 73 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 3.
1 acagttgtag ttacctcgtt ggctactaga cgtcccaagg agctgagaaa ggttgccaac
61 ctgtgttctt cttcccttcc caggggctgc agagaggcca tcaagaggat cgcctacgag
121 tttgtggaga tgaaggcaaa ggagggcgtg gtctatgtgg aagtgcgcta tagcccacac
181 ctgctggcca attccaaggt ggacccaatg ccctggaacc agactgagtg agtgacatca
241 ctggaggggg ctgtgctgag cggggctctg agctgaggat ggagtgctta gagccctggc
301 ctggtccatg gactcagagc gactcagctc agtcctaagt gcacga
//
LOCUS MUSADAM05 385 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 5.
ACCESSION M34246 J04767
KEYWORDS adenosine deaminase.
SEGMENT 5 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 385)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 115 + 230 adenosine deaminase (ADA), exon 5
IVS < 1 114 ADA intron D
IVS 231 > 385 ADA intron E
BASE COUNT 84 a 115 c 94 g 92 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 4.
1 tctccatcta gaaatagaag ggcagagaga catcactaca tccctgctcc agttccatgg
61 ctgcccatgg tcttcccttg gcctaaagtc ctccctcttc ctctctccac acagagggga
121 cgtcacccct gatgacgttg tggatcttgt gaaccagggc ctgcaggagg ggaggcaagc
181 atttggcatc aaggtccggt ccattctgtg ctgcatgcgc caccagccca gtgagtaccg
241 ccgcaccctg ctggctgcct ggcctataac aaggtggacc gactatccag cgtccccacc
301 tcgtatttct agagttttct aaaaaacacc tgtgaacttt tggtgactct ggtgagtcct
361 taacaggaaa ttgggacttg cacag
//
LOCUS MUSADAM06 189 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 6.
ACCESSION M34247 J04767
KEYWORDS adenosine deaminase.
SEGMENT 6 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 189)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 18 + 145 adenosine deaminase (ADA), exon 6
IVS < 1 17 ADA intron E
IVS 146 > 189 ADA intron F
BASE COUNT 37 a 39 c 75 g 38 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 5.
1 ggcccgtgcc cctgcaggct ggtcccttga ggtgttggag ctgtgtaaga agtacaatca
61 gaagaccgtg gtggctatgg acttggctgg ggatgagacc attgaaggaa gtagcctctt
121 cccaggccac gtggaagcct atgaggtggg cctgagaagg ggagggtggc cctgggggag
181 cttgggtag
//
LOCUS MUSADAM07 307 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exons 7 and 8.
ACCESSION M34248 J04767
KEYWORDS adenosine deaminase.
SEGMENT 7 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 307)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 11 82 adenosine deaminase (ADA), exon 7
164 + 265 adenosine deaminase, exon 8
IVS < 1 10 ADA intron F
IVS 83 163 ADA intron G
IVS 266 > 307 ADA intron H
BASE COUNT 68 a 86 c 85 g 68 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 6.
1 tcccttccag ggcgcagtaa agaatggcat tcatcggacc gtccacgctg gcgaggtggg
61 ctctcctgag gttgtgcgtg aggtaaggag ccagtgaccc cgggcctctt cttcctgatt
121 ctgttcctgt ccctggactc acctcctctc tgcttctcca caggctgtgg acatcctcaa
181 gacagagagg gtgggacatg gttatcacac catcgaggat gaagctctct acaacagact
241 actgaaagaa aacatgcact ttgaggtgag acgccaaggc agagagagtg agctctggct
301 accccgt
//
LOCUS MUSADAM08 249 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 9.
ACCESSION M34249 J04767
KEYWORDS adenosine deaminase.
SEGMENT 8 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 249)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 112 + 176 adenosine deaminase (ADA), exon 9
IVS < 1 111 ADA intron H
IVS 177 > 249 ADA intron I
BASE COUNT 56 a 77 c 60 g 56 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 7.
1 ctgaggcaat gaagcacaaa gctatccaga atagaacctc agctgggctc agccctgacc
61 agtctggccc cggccactat gccagccagc cacacatcct gccccttgca ggtctgcccc
121 tggtccagct acctcacagg cgcctgggat cccaaaacga cgcatgcggt tgttcggtga
181 gatctggttc cgggacccat tttgttttga ttccggaatt cacctatagt gagtcgtata
241 aattcgtaa
//
LOCUS MUSADAM09 340 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 10.
ACCESSION M34250 J04767
KEYWORDS adenosine deaminase.
SEGMENT 9 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 340)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 141 + 270 adenosine deaminase (ADA), exon 10
IVS < 1 140 ADA intron I
IVS 271 > 340 ADA intron J
BASE COUNT 81 a 77 c 92 g 90 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 8.
1 ttaatacgag aatgcaaccc tttgtgttgt ctaaggttgt ataaagatgg aagagggagg
61 tggtggaagg gcagtgatgg ttcttggagt gaagaggctc tctctctctc tcttttcttc
121 ctgcctggcc cctcccccag cttcaagaat gataaggcca actactcact caacacagac
181 gaccccctca tcttcaagtc caccctagac actgactacc agatgaccaa gaaagacatg
241 ggcttcactg aggaggagtt caagcgactg gtgagtatgt gtgagctatg agcctgacac
301 tggcccaggt gtgtgtgtgt gtgtatatgt gtgtgtgtgt
//
LOCUS MUSADAM10 279 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 11.
ACCESSION M34251 J04767
KEYWORDS adenosine deaminase.
SEGMENT 10 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 279)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pept + 88 171 adenosine deaminase (ADA), exon 11
IVS < 1 87 ADA intron J
IVS 188 > 279 ADA intron K
BASE COUNT 67 a 88 c 61 g 63 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 9.
1 ggatctgttt cccccactat gatgcccttg cccttgctaa cagggctgct tccttccttg
61 tcctgactcc atgtttcccc cttctagaac atcaacgcag cgaagtcaag cttcctccca
121 gaggaagaga agaaggaact tctggaacgg ctctacagag aataccaata gccaccacag
181 actgacggta cgcttgtgca gggcgcaata accaccccac cacactgtcc tccttaactc
241 tgtgcgattg tggcagaagt cttgggcagg agcacacct
//
LOCUS MUSADAM11 442 bp ds-DNA ROD 10-JUL-1990
DEFINITION Mouse adenosine deaminase (ADA) gene, exon 12 (non-coding).
ACCESSION M34252 J04767
KEYWORDS adenosine deaminase.
SEGMENT 11 of 11
SOURCE Mouse lung fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 442)
AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
Chinsky,J.M., Martin,B.D. and Kellems,R.E.
TITLE Structural and functional analysis of the murine adenosine
deaminase gene
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES from to/span description
pre-msg < 1 > 287 adenosine deaminase (ADA) mRNA and introns
IVS < 1 69 ADA intron K
signal 282 287 poly-A signal
BASE COUNT 97 a 111 c 114 g 120 t
ORIGIN Chromosome 2; undetermined number of base pairs after segment 10.
1 ttctgtgctt ctaccatgcc ttacatgtca tgagacctga cctttctatt tctctgactt
61 gaccagcagg gcgggtcccc tgaagatggc aaggccactt ctctgagcct catcctgtgg
121 ataaagtctt tacaactctg acatattgac cttcattcct tccagacctt ggagaggcca
181 ggtctgtcct ctgattggat atcctggcta ggtcccaggg gacttgacaa tcatgcacat
241 gaattgaaaa ccttccttct aaagctaaaa ttatggtgtt caataaagca gctggtgact
301 ggtatcttgc agcacatggt gaatacggtc tcggggctgc tggctaggat gctaagaaag
361 gaggagcctg ggccctacgc tgagtgtcag gtctggggag ctagggtctc ttccgcaggt
421 cgactctaga gatccccggg ct
//
LOCUS TRBMVAT5A 1664 bp ss-mRNA INV 10-JUL-1990
DEFINITION T.brucei MVAT5-like variant surface glycoprotein mRNA, complete
cds.
ACCESSION M33825
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1664)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
pept 43 1596 MVAT5-like variant surface glycoprotein
BASE COUNT 533 a 435 c 395 g 301 t
ORIGIN
1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
721 gcaaacttcg gccagtcgat ccctttcgca ggcggctatc taacaaggaa cccgacagcc
781 aacacagcca gcagcgccga cggtacggac tttgtaagca accccgaaga cagcaagata
841 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
901 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
961 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
1021 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
1081 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
1141 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
1201 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
1261 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
1321 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
1381 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
1441 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
1501 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
1561 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
1621 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS TRBWRATATA 1544 bp ss-mRNA INV 10-JUL-1990
DEFINITION T.brucei WRATat A variant surface glycoprotein mRNA, complete
cds.
ACCESSION M33823
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1544)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
pept 42 1457 WRATat A variant surface glycoprotein
mRNA < 1 1544 WRATat A mRNA
BASE COUNT 545 a 387 c 345 g 267 t
ORIGIN
1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS TRBWRATATB 1585 bp ss-mRNA INV 10-JUL-1990
DEFINITION T.brucei WRATat B variant surface glycoprotein mRNA, complete
cds.
ACCESSION M33824
KEYWORDS variant surface glycoprotein.
SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 1585)
AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E.
TITLE Sequences of three VSG mRNAs expressed in a mixed population of
Trypanosoma brucei rhodesiense
JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Donelson, 23-APR-1990.
FEATURES from to/span description
pept 41 1504 WRATat B variant surface glycoprotein
BASE COUNT 535 a 384 c 376 g 290 t
ORIGIN
1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
1021 cgacctcata acagctaaat ccggtacgac actaagcggc gcaatcactc ggctaaagca
1081 actacaaatc gccaaagcaa agctaaaagc aagacggcta ctgctgcaaa acagggaacg
1141 gcagcaaacg cgacttatgg cgctagcaga caagatgcaa gaattgtacc aagaggcctt
1201 acatgacgag gttcaactca ggaaggaagc gcagaacaaa cctcaagaaa caccagattc
1261 tgacaagcaa aaagcatgcg agaaatatca caacaagtca aaggaatgca aagaaaatgg
1321 ttgccaatgg agtggaactg aagaaaccat aggaaagtgc gaagctaaac ccaaagcagg
1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
1561 tctggtatat tttaacacct ttaaa
//GenBank-Updates@genbank.bio.net (07/11/90)
LOCUS CDCXYNAB 6067 bp ds-DNA BCT 11-JUL-1990
DEFINITION C.saccharolyticum xylanase A (XynA), beta-xylosidase (XynB) and
acetyl esterase (XynC) genes, complete cds.
ACCESSION M34459
KEYWORDS acetyl esterase; beta-xylosidase; xylanase.
SOURCE C.saccharolyticum DNA, clone pNZ1400.
ORGANISM Caldocellum saccharolyticum
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 6067)
AUTHORS Luethi,E., Love,D.R., McAnulty,J., Wallace,C., Caughey,P.A.,
Saul,D. and Bergquist,P.L.
TITLE Cloning, sequence analysis, and expression of genes encoding
xylan-degrading enzymes from the thermophile "Caldocellum
saccharolyticum"
JOURNAL Appl. Environ. Microbiol. 56, 1017-1024 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 195 1223 xylanase/beta-xylosidase (XynA) precursor
sigp 195 293 xylanase/beta-xylosidase signal peptide
matp 294 1220 xylanase/beta-xylosidae
pept 1257 2057 acetyl esterase (XynC)
pept 2198 2491 ORF 3
pept 2491 3429 ORF 4
pept 3445 4911 beta-xylosidase (XynB) (3445 could be 3463)
pept 5439 > 6067 ORF 6
BASE COUNT 2230 a 787 c 1243 g 1807 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccccgc aaagcctaaa ataagtacat ttagaatgat ggcagaaaat ggttatatta
61 cccttgaatt tacgttaagt aaaaatgctg tggtgctttt tgaggtaagc aaggttgtag
121 atgagtcaga tacttatata ggacttgacg atagtaaaat accaggttat tagttgcttt
181 ataaaataaa aggaatgagg tgtttaattg tgtgcgaaaa tttagagatg ctaaacttat
241 cattagcaaa aacatacaaa gattacttta aaataggtgc tgcagtaact gcgaaagatt
301 tagaaggagt tcatagggat attcttttga agcattttaa tagcctcaca ccagaaaatg
361 ccatgaagtt tgaaaatatt catccagaag agcagagata taattttgaa gaggttgcca
421 ggataaaaga gtttgcaatt aaaaatgaca tgaagttaag aggacataca tttgtttggc
481 ataatcaaac tccggggtgg gtgtttttag ataagaatgg ggaagaagcc tcaaaagagt
541 tagttattga aaggttaaga gagcatataa aaactttgtg tgagagatac aaggatgtag
601 tatatgcgtg ggatgtggtg aacgaagcag tagaagataa aacagaaaag cttttgcgag
661 aatcaaactg gagaaaaatt attggagatg attatattaa aattgctttt gagatagcaa
721 gagaatatgc aggagatgca aagttatttt ataacgatta taacaatgaa atgccttata
781 aattagaaaa aacctacaaa gttctaaaag agcttttaga aagaggtact ccaatagatg
841 gaattggtat acaagcacac tggaatatat gggataaaaa tcttgttagt aatttaaaaa
901 aggctataga agtatatgct tccttaggtt tagaaattca tattacagaa cttgacattt
961 cagtatttga gtttgaagat aagaggactg acttgtttga accaaccccg gaaatgcttg
1021 aactacaagc aaaagtatat gaagatgtat ttgcagtttt tcgagaatat aaagatgtaa
1081 taacttctgt tacattatgg ggtattagcg acagacacac atggaaagat aacttccctg
1141 taaagggtcg aaaagattgg cctctcttat tcgacgtaaa tggaaaacca aaagaagcct
1201 tgtacaggat attaagattt taaagatttt ttaacgaaga aaggggttct tttaatatgg
1261 ctatcatgca aatcaacttt tattcaaaga tgttgaaaaa gaacacaaca attttggcca
1321 ttttacccgt agataaacca gataagaaat tccagaaaga tgttgatagt gaaaatttga
1381 aaaccttata tcttttgcat ggttatgctg gtaactacat ggattggttg tgtggagccc
1441 gaattgttga attatcaatg cgatataatg ttgctgtgtt tctgccatca ggtgaaaata
1501 gtttttattt agatgatgaa gaaaaggaag aatattttgg tgaatttgtg ggaaatgaaa
1561 ttatagaatt tacaagaagc gtttttccta ttcctcaaaa aagggaaaaa acttttattg
1621 gcggtttatc aatgggaggt tacggtgctc ttagaaatgg gcttaaatat aacaagaatt
1681 ttgtaggtat aatagcttta tcatcagcac taataattca taagattgca ggtattccta
1741 aggattatag gaatgcttat gcaagttata actattatag acgagtgttt ggagacctaa
1801 actctttaat aggtagcgat aaagacataa atgccttagt tactaagcta aaacaagaaa
1861 aaggtagtat tccaaaaata tacatggcat gcggcagaga tgacttttta gttcaagaaa
1921 acagagattt atttaatttt ttgaaaaatg aaggtataga cgtggtttat gaggaagacg
1981 aaggtggaca tgactgggat ttttggaaca aatatattgc aaatgctttt gagtggatga
2041 gtaaggtttc tgattaagtc ttcacgtacc ctgttttaag ttttacaaat agatttgtgg
2101 ggtgaatagg tttttttaac actattttat taaggaagag gatgaaaaat aaaaaaagtg
2161 gacaaatttc ttgttaattg taattacatg cattgcaatg gttttctttt ttacatcgtg
2221 tactattcag tctgctatag agcagaagaa aactgttgag gaaatcttgg gaaaaatagg
2281 tgagagtgag gacaaaacaa attcaagggg gcaaccagca acaatgaaag aggatgaagt
2341 tgaagataat cctttaaaag atgtatataa agattatttc ctggttggag cagcaattaa
2401 tggctattct gttgaaactg ctgctatcaa tcatcctggt atggctgcaa ttttgaaaaa
2461 aactttaaca gtacaaccct atctaatttg atgaaacaac aatacctttt agattatgaa
2521 gctacaaaag caagtaaaaa tggaatgcca gtgtgtaaat ttgacagctg cattcctgct
2581 ttacaatttt gtaaggaaaa tggcataaaa atgagaggac atgtgttagt atggcataat
2641 cagacaccag aatggttttt ccacaaagac tatgatgtat cgaaaccact tgtagatgct
2701 gctactatgg aacgccggtt ggaaagttat atcaaacagg taattgaatt ttgtcaaaaa
2761 aattatcccg gtgtagtcta ttgctgggat gttgttaacg aagctatact tgatgatggt
2821 tcatggagag aaatcaataa taattggtat accattatga aagaaaagta tgtggaaaag
2881 gcattttatt atgcaagaaa atatgccaaa aaagatgttg ccctgtttta caatgattac
2941 aatgtttttc tccctgcaaa gagagaagca atttataatc ttgctcagaa acttaaagaa
3001 aaaggattga ttgacgggtt gggtcttcaa cctacagtag gcttgaatta tcctgaatta
3061 gattctgatg atatagattc attcaaaacg acattagaaa catttgcaaa acttggctta
3121 caaattcata ttactgagtt aaattttgaa ataaagggag atgagagcaa tcgtactcct
3181 gaaaatctca aaaaacaagc agataggtat tacgaaatga tgaagttatt attgaaggaa
3241 gatactgata atggtgggcc ttgcaacata acttgtgtta ctgtttttgg tatctgtgac
3301 gattatccac tatataaaaa ttttaagcag tgcatgtatc tttgggataa aaattgcaat
3361 cctaaaccat gtttttattc atttctccaa gcaggtttag actggaaagc atctttatta
3421 agcaaataag aatgaacaac acttatggag aggaggaaaa taatgaaaat aactattaat
3481 tatggaaaga gacttgggaa aataaacaaa ttttgggcaa aatgtgttgg aagctgtcat
3541 gctacaactg cgttaagaga agactggcga aagcaattaa aaaaatgtcg tgacgaactt
3601 ggttttgagt atattcgatt tcatggttgg ttgaatgatg atatgagtgt ttgttttaga
3661 aatgatgatg ggctactttc attctcattc ttcaacatag attctataat tgattttctt
3721 ttggagatag gtatgaaacc atttattgaa ctgagcttta tgccagaagc gttagcgtca
3781 ggtacaaaga cagttttcca ttacaaagga aatataacac cgccgaaatc ttatgaagaa
3841 tggggtcagc tgattgagga gttagcaagg catcttatta gcagatatgg gaaaaatgaa
3901 gtaagagaat ggttttttga ggtatggaac gaaccaaatc taaaggattt cttctgggca
3961 ggaacaatgg aagaatattt taagctttac aaatatgctg cttttgcaat aaagaaagtg
4021 gactctgaac taagggtagg tggaccagct actgcaatcg atgcatggat acctgaacta
4081 aaagattttt gtacaaaaaa tggtgttcca atagatttta tttcaacgca tcaatatcca
4141 acagatttag cattcagtac aagctcaaat atggaagagg ctatggcaaa agcaaagaga
4201 ggtgaattag cagagagggt aaaaaaggct ttagaggaag catatccatt gcctgtttac
4261 tacactgaat ggaataactc tccaagtcct cgagacccat atcacgacat accttacgat
4321 gctgctttta ttgtaaaaac aataattgac attatagatt taccacttgg gtgttattct
4381 tattggacat ttacagatat ctttgaagaa tgtggacaga gttctttacc ttttcatggg
4441 ggattcgggc ttctaaatat tcatggtata ccaaaaccat cctatagagc atttcaaatt
4501 ttagataaac taaacggtga gaggattgag atagagtttg aagataaaag cccaaccatt
4561 gattgtatag ctgtccagaa tgagagagag ataatacttg tgatctcaaa ccataatgtt
4621 ccgctgtctc ctattgatac cgaaaatata aaagttgttt taaaaggtat tgagaattgc
4681 cgagaagttt ttgttgagag aatagatgaa tataatgcca atccaaaaag agtatggctt
4741 gaaatgggca gtcctgcgta tctcaataga gaacagattg aggagttgat aaaagcatca
4801 gaactaaaga aagagaaagt ttcatggggg attgtgaata ataatgaaat tacatttgat
4861 ttaagtgttt tacctcactc agttgtggct gttacaatta agaatggtta gtgaaatgtt
4921 aagagagaaa agcaattttg tatatctctt ttaattttta cctttgacac atcaaacaat
4981 ctaaattaaa attaaagtat agtgttttgc atactcaaca tagtataaat tatataaggg
5041 taacattaat accctttttg tttttgtaag ggggtgtttt tgtggcaaag cacacgcaaa
5101 aaggtaaatc agctgccaca gccgccgtgt cagacaaaga aaaagcaagg tttgttccta
5161 aaaatattca agctgagata aaagaaaaga ttaaagacac tggtgaaaaa gtagcaaagg
5221 ctgagggtaa ggacaaagca cttttacagt taaagctgga gagcaacaaa aaggttgata
5281 agaaaaaatt caaaaaggat agaagtgttg agaggaataa aacttcatta aatagatttt
5341 taagtttaga taaaattaaa tccctatatt caaaagagat acataataaa ctttcacaca
5401 tctttgaaga tgcagtttct gaggtttata gaattttaat ggggctaaag tatatcaaaa
5461 aggcgccaaa ttacaccgaa attgttctga aggcaaagat attttcaacc ttgattttga
5521 tgattgtaat attattttta atcaacaaaa tgccttctac atacaaaaaa gcgtatgcag
5581 ttgttttgaa caatcagatt gtagggtatg tgaaggacaa gactgaagca caaaaccttc
5641 ttacccagat taaaaaagaa gtagaggaaa gacacaatac agacagtttc attttacaaa
5701 gtaagcttca actaaagagc attgagcctg gtcaatatcg tgagacaagg gttgatgagc
5761 tgaaaaatac tatcatagaa aaggggaagg tccttgtaaa aaggtatgct atttttgtta
5821 attcaaaacc atattttgta tttgaaaatc cacaaactcc aaataatatt cttaacaagc
5881 taaaaaaggt ctattataat gacaaggcat cacaggcaaa attcttagag aaggtagaaa
5941 taaaaccagt ttatgtctca ccagctatta aagtagctga tgaagctact gccttaacaa
6001 agattatgtt tgggaaagac caggtaatag aatatacagt caaggaagga gatactcttt
6061 gggatcc
//
LOCUS PFAAMA1 2307 bp ds-DNA INV 11-JUL-1990
DEFINITION P.fragile apical membrane antigen 1 (AMA1/AG352) gene, complete
cds.
ACCESSION M29898
KEYWORDS apical membrane antigen.
SOURCE P.fragile (Nilgiri strain) DNA, from Macaca mulatta, clone
AG352VATV1.
ORGANISM Plasmodium fragile
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 2250)
AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F.,
Collins,W.E., Anders,R.F. and Kemp,D.J.
TITLE Apical membrane antigen of Plasmodium fragile
JOURNAL Mol. Biochem. Parasitol. 39, 279-284 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 2251 to 2307)
AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F.,
Collins,W.E., Anders,R.F. and Kemp,D.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.Marshall, 15-NOV-1989.
FEATURES from to/span description
pept 275 1963 apical membrane antigen 1 (AMA1/AG352)
precursor
sigp 284 322 put. apical membrane antigen 1 signal peptide
matp 323 1960 apical membrane antigen 1
BASE COUNT 794 a 445 c 499 g 569 t
ORIGIN
1 taagttccct ttctacaccc ggatgcctcc tagagcaaat aggagtttca agcgtttaca
61 tgtaatttac caagcgtttg taattttgca actttgcaat ttttctactg cgcaagtttg
121 taaccgtgaa gctgctcacc tgtgtgacgg ccaattttta ccaacggtta aacctgttag
181 tggctatttt tttctcgccc ccctcctgat tgatgtgcag agggagagaa ccaaatagct
241 gcctttttct tgagtcacaa tttaacaaca caatatgaat aaaatatact gcatactgtt
301 tttaagtgcc cagtgccttg tgcacatggg taagtgcgag ccaaaccaga agccgagcag
361 gctgacccgc agcgctaaaa acgttttgtt ggaacaggag cctatggttg agagaagtac
421 acgaatgagt aacccatgga aagcattcat ggaaaagtac gatatcgaaa aaacacacag
481 ttctggtatt cgagtagatt taggggaaga tgcagaagtg ggaaattcca gctatagaat
541 accagcagga aaatgtcctg tttttggaaa gggtatcgtt atacagaatt ctgaggttag
601 tttcttaaca cctgtagcta caggcaatca aaagttgaag gatggaggtt tcgcctttcc
661 acaagcaaat gatcatattt cccctatatc cataaaaaac cttagagaaa ggtataaaga
721 gaatccagat ttgatgaagc taaacgattt agctttgtgt aaaactcatg cagccagctt
781 tgtaatggaa atggataaaa attcgtccta tagacaccca gctgtatatg atgaagataa
841 aaaaatatgt tacatgttgt atttatcagc gcaagaaaat atgggtccaa gatactgtag
901 taaagatgca gaaaataaag atgctatgtt ttgcttcaag ccagataaaa atgaaacatt
961 tgaccatctt gcctatttaa gcaaaaatgt ggttaatgat tggcaaaaca aatgcccccg
1021 taaaaattta ggaaattcta aatttggatt atgggtggat ggaaactgtg aagaaatccc
1081 atacgttcaa gacgtgcagg caaaggatct acgcgaatgt aacagaatcg ttttcgaagc
1141 tagcgcttca gatcaaccaa ctcagtacga agaagaacta accgattatc aaaaaataca
1201 agaaggcttt agacaaaacg atcagggtat gattaaaagt gcttttcttc cagtaggtgc
1261 attcaactcg gacaatttta agagtaaagg aagaggatat aactgggcaa atttcgatac
1321 tgaaaataag gtttgttacc tttttaatgc caaacccact tgcctcatta atgacaaaaa
1381 ctttatcgca acaacagcgt tatctcatcc ccaagaagta gacaatgagt ttccatgcag
1441 catatacaaa gatgaaatgg aaagggaaat gaggaaagaa tcgaggaaca tgagtctgta
1501 caatgttgat aaggcacgga ttgttctgcc aaggatattt atctccaacg ataaggacag
1561 tctcaaatgt ccatgcgcac cagaacacat taccaacagt acctgcaact tttacgtttg
1621 taactgtgta gagaaaaggg cagaaattaa agaaaataac gaagtggcca taaaggaaga
1681 atttaagcaa gattaccaat acgcgcaagg tgaatccaaa aatcagatgc tcctaattat
1741 tatcggaata actggaggtg tgtgtgtggt cgcactggct tccatgtttt acttcaggaa
1801 gaaagctcac aatgataagt atgacaagat ggagcaggca gacgggtacg ggaaacccac
1861 caccaggaaa gacgagatgc tcgaccccga ggcgtccttc tggggtgaag aaaagcgggc
1921 ctcccacacc acccctgtgc tgatggagaa gccttactac tgagcgggga agcaaccgaa
1981 ttggtgaggg cctctttggt cgtaaacaaa gtgggggtgc ctcacaatgc atattttcaa
2041 cccgcgtcat gtaaaaaaga aaaacgagac acacccagct ggccaacaaa ttgcccacaa
2101 gggaggagaa atggagcaag ctaaaattgg gctattgtca tcatcaccag ttaccgagga
2161 aatgaaaaca acaacaaaaa aaaacgtaac acatggtaaa gtaactgatt ggttaagcaa
2221 agccgagtga aaatttaccc cacttgcgat ttaaaagcat gatttgcctc caccaaatgg
2281 acctctccac tattaatatt accggag
//
LOCUS RICAAMYA 1553 bp ss-mRNA PLN 11-JUL-1990
DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS103.
ACCESSION M24286
KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase.
SOURCE Rice (strain M202), cDNA to mRNA, clone pOS103.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1553)
AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and
Rodriguez,R.L.
TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA
clones and mRNA expression during seed germination
JOURNAL Mol. Gen. Genet. 221, 235-244 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.H.Kumagai, 25-APR-1989.
Author address: M.H.Kumagi
UC Davis, Dept. of Genetics,
Davis, Ca. 95616
FEATURES from to/span description
pept 34 1338 alpha-amylase (EC 3.2.1.1)
BASE COUNT 340 a 486 c 446 g 281 t
ORIGIN
1 atcaatcatc catctccgaa gtgtgtctgc agcatgcagg tgctgaacac catggtgaac
61 aaacacttct tgtccctttc ggtcctcatc gtcctccttg gcctctcctc caacttgaca
121 gccgggcaag tcctgtttca gggattcaac tgggagtcgt ggaaggagaa tggcgggtgg
181 tacaacttcc tgatgggcaa ggtggacgac atcgccgcag ccggcatcac ccacgtctgg
241 ctccctccgc cgtctcactc tgtcggcgag caaggctaca tgcctgggcg gctgtacgat
301 ctggacgcgt ctaagtacgg caacgaggcg cagctcaagt cgctgatcga ggcgttccat
361 ggcaagggcg tccaggtgat cgccgacatc gtcatcaacc accgcacggc ggagcacaag
421 gacggccgcg gcatctactg cctcttcgag ggcgggacgc ccgactcccg cctcgactgg
481 ggcccgcaca tgatctgccg cgacgacccc tacggcgatg gcaccggcaa cccggacacc
541 ggcgccgact tcgccgccgc gccggacatc gaccacctca acaagcgcgt ccagcgggag
601 ctcattggct ggctcgactg gctcaagatg gacatcggct tcgacgcgtg gcgcctcgac
661 ttcgccaagg gctactccgc cgacatggca aagatctaca tcgacgccac cgagccgagc
721 ttcgccgtgg ccgagatatg gacgtccatg gcgaacggcg gggacggcaa gccgaactac
781 gaccagaacg cgcaccggca ggagctggtc aactgggtcg atcgtgtcgg cggcgccaac
841 agcaacggca cggcgttcga cttcaccacc aagggcatcc tcaacgtcgc cgtggagggc
901 gagctgtggc gcctccgcgg cgaggacggc aaggcgcccg gcatgatcgg gtggtggccg
961 gccaaggcga cgaccttcgt cgacaaccac gacaccggct cgacgcagca cctgtggccg
1021 ttcccctccg acaaggtcat gcagggctac gcatacatcc tcacccaccc cggcaaccca
1081 tgcatcttct acgaccattt cttcgattgg ggtctcaagg aggagatcga gcgcctggtg
1141 tcaatcagaa accggcaggg gatccacccg gcgagcgagc tgcgcatcat ggaagctgac
1201 agcgatctct acctcgcgga gatcgatggc aaggtgatca caaagattgg accaagatac
1261 gacgtcgaac acctcatccc cgaaggcttc caggtcgtcg cgcacggtga tggctacgca
1321 atctgggaga aaatctgagc gcacgatgac gagactctca gtttagcaga tttaacctgc
1381 gatttttacc ctgaccggta tacgtatata cgtgccggca acgagctgta tccgatccga
1441 attacggatg caattgtcca cgaagtactt cctccgtaaa taaagtagga tcagggacat
1501 acatttgtat ggttttacga ataatgctat gcaataaaat ttgcactgct taa
//
LOCUS RICAAMYB 1682 bp ss-mRNA PLN 11-JUL-1990
DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS137.
ACCESSION M24287
KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase.
SOURCE Rice (strain M202), cDNA to mRNA, clone pOS137.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1682)
AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and
Rodriguez,R.L.
TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA
clones and mRNA expression during seed germination
JOURNAL Mol. Gen. Genet. 221, 235-244 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.H.Kumagai, 25-APR-1989.
Author address: M.H.Kumagi
UC Davis, Dept. of Genetics,
Davis, Ca. 95616
FEATURES from to/span description
pept 78 1382 alpha-amylase (EC 3.2.1.1)
BASE COUNT 355 a 491 c 519 g 317 t
ORIGIN
1 atccatcatc tacaagagat cgatcagtag tggttagcag caactcacta tcgaacacgg
61 tttcagctta cacagatatg aagaacacca gcagcttgtg tttgctgctc ctcgtggtgc
121 tctgcagctt gacctgtaac tcgggtcaag cacaggtcct cttccagggt ttcaactggg
181 agtcgtggaa gcagcagggt ggctggtaca acatgttgaa aggccaagtc gacgacatcg
241 ccaaggccgg ggtcacccac gtctggctgc cgccgccgtc gcactccgtg gcgcgagggt
301 acatgccggg gcgtctctac gacctggacg cgtccaagta cggcacggcg gcggagctca
361 agtcgctgat cgcggcgttc cacgggaagg gcgtccagtg cgtcgccgac gtcgtgatca
421 accaccggtg cgccgagaag aaggacgccc gcggcgtgta ctgcgtgttc gagggcggga
481 cgcgcgaccg cctcgactgg ggccccggca tgatctgcag cgacgacacg cagtactccg
541 acggcacggg ccaccgcgac accggcgagg ggttcggcgc ggcgcccgac atcgaccacc
601 tcaacccgcg cgtccagcgg gagctcaccg actggctcaa ctggctcaag tccgacgtcg
661 gcttcgacgg ctggcgcctc gacttcgcca agggatactc cacggacatc gctaagatgt
721 acgtcgagag ctgcaagccg ggcttcgtcg tcgccgagat atggaactcg ctgagctaca
781 acggcgacgg caagccggcg gccaaccagg accagggccg gcaggagctg gtgaactggg
841 tgaacgccgt cggcgggccg gcgatgacgt tcgacttcac caccaagggc ctcctgcagg
901 cgggcgtcca gggcgagctg tggcggctgc gcgacggcaa cggcaaggcg cccggcatga
961 tcgggtggct gccagagaag gccgtcacgt tcgtcgacaa ccacgacacc ggctcgacgc
1021 agaagctttg gccgttcccc tccgacaagg tcatgcaggg ctacgcctac atcctcaccc
1081 accccggagt cccctgcatc ttctacgacc acatgttcga ctggaacctg aagcaggaga
1141 taaccgcgct ggcggcgatc agggagagga acggcatcaa cgccgggagc aagctccgga
1201 tcgtcgtcgc cgacgccgac gcatacgtcg ccgtcgtcga cgagaaggtc atggtgaaga
1261 tcgggacgag gtacgacgtg ggcaacgcgg tgccgtcgga tttccatcag acggtgcacg
1321 gcaaggacta cagcgtctgg gagaaggggt ccctccgcgt cccggcgggg cggcacctat
1381 agcgggctca agccctaaac tgaacgggat agtcatgctc aaaccagttt ctacacggca
1441 agaatttact gattcttata ctttttcagt caattaaatt atggttttta tatatgtaat
1501 tttgtatccg attgtagcgt tcgaataagt aggcaggctc tctagcctct aggttaattg
1561 cgggcatatg tagcttgcca gttaattgtg tttgtatcac gcagtttgta accgttggtg
1621 catatatatg tcaggttcag gatgcagtaa aaaatcatac tgcaccgatc agtgagtttt
1681 ta
//
LOCUS HUMCEAPX 494 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human cell adhesion protein (SQM1) mRNA, complete cds.
ACCESSION M33374
KEYWORDS cell adhesion protein.
SOURCE Human squamous carcinoma cell line SCC25, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 494)
AUTHORS Wong,Y.-C., Tsao,S.-W., Kakefuda,M. and Bernal,S.D.
TITLE cDNA cloning of a novel cell adhesion protein expressed in human
squamous carcinoma cells
JOURNAL Biochem. Biophys. Res. Commun. 166, 984-992 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 36 443 cell adhesion protein (SQM1)
mRNA < 1 494 SQM1 mRNA
BASE COUNT 107 a 154 c 163 g 70 t
ORIGIN
1 ccctcggtgc tgcagggatc tgcaggactg cagccatggg ggcgcacctg gtccggcgct
61 acctgggcga tgcttcggtg gagcccgacc ccctgcagat gccaaccttc ccgccagact
121 acggcttccc cgaacgcaag gagcgcgaga tggtggccac acagcaggag atgatggacg
181 cgagtgaggc tcagctgcgg gactactgcg cccaccacct catccggctg ctcaagtgca
241 agcgtgacag cttcccaagt tgctggcctg caagcaggaa gcggcacgac tcgggactac
301 tgcgcaccgc aagctatgtg atgcgcatga aggagtttga gcgggacgag ggctgctcca
361 gcggaagaag cggcgggaga agaaggcggc aaatctgcaa aggccaggga cccggggaag
421 tggaccccaa ggtggccctg taggggtgca ccccccaccc tatggaccag tcaaataaaa
481 ccttcaggcc cctc
//
LOCUS REOCEAP1 1463 bp ds-RNA VRL 11-JUL-1990
DEFINITION Reovirus sp. (serotype ST1) sigma-1 protein gene, complete cds.
ACCESSION M32860
KEYWORDS cell attachment protein; sigma-1 protein.
SOURCE Reovirus sp. (serotype ST1), cDNA to viral RNA.
ORGANISM Reovirus sp.
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1463)
AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
TITLE Identification of conserved domains in the cell attachment proteins
of the three serotypes of Reovirus
JOURNAL Virology 174, 399-409 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 14 1426 sigma-1 protein
BASE COUNT 426 a 291 c 369 g 377 t
ORIGIN
1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact
61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca
121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat
181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg
241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca
301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga
361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt
421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg
481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga
541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt
601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg
661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta
721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc
781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg
841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat
901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga
961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac
1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt
1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc
1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca
1201 gacgggggat actgaaccat cgtcaactat tgatccattg tctacgacat ttgccgcggt
1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg
1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt
1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc
1441 ctcggcacaa ggggtcaatc atc
//
LOCUS REOCEAP2 1440 bp ss-RNA VRL 11-JUL-1990
DEFINITION Reovirus sp. (serotype ST2) sigma-1 protein gene, complete cds.
ACCESSION M32861
KEYWORDS cell attachment protein; sigma-1 protein.
SOURCE Reovirus sp. (serotpe ST2), cDNA to viral RNA.
ORGANISM Reovirus sp.
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1440)
AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
TITLE Identification of conserved domains in the cell attachment proteins
of the three serotypes of reovirus
JOURNAL Virology 174, 399-409 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 14 1402 sigma-1 protein
BASE COUNT 384 a 316 c 381 g 359 t
ORIGIN
1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac
61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa
121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact
181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg
241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc
301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga
361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat
421 tactaatgac cttggatcat tgaatacgag gctgacgact gaattgaacg atgtccgcca
481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt
541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg
601 gttcgaccgc aataatgttc tgcagttatt cttatcgaac cagcagaaag ggttgggatt
661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg
721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc
781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct
841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag
901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc
961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata
1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga
1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt
1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc
1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc
1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat
1321 cacgaatcat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat
1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc
//
LOCUS REOCEAP3 1416 bp ss-RNA BAD 11-JUL-1990
DEFINITION Reovirus sp. (serotype ST3) sigma-1 protein gene, complete cds.
ACCESSION M32862
KEYWORDS cell attachment protein; sigma-1 protein.
SOURCE Reovirus sp. (serotype ST3) viral DNA.
ORGANISM Reovirus sp.
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1416)
AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
TITLE Identification of conserved domains in the cell attachment proteins
of the three serotypes of reovirus
JOURNAL Virology 174, 399-409 (1990)
STANDARD simple staff_review
COMMENT Secondary reference. Please see: Proc. Natl. Acad. Sci. U.S.A.
82, 24-28 (1985), accession m10262.
FEATURES from to/span description
pept 13 1380 sigma-1 protein
BASE COUNT 376 a 301 c 365 g 374 t
ORIGIN
1 gctattggtc ggatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta
61 acgagtgata atggagcatc actgtcaaaa gggcttgaat caagggtctc ggcgctcgag
121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca
181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat
241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat
301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt
361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac
421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg
481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc
541 ctctcaatcc gtaataaccg tatgaccatg ggattaaatg atggactcac gttgtcaggg
601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt
661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg
721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg
781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacagt
841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg
901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg
961 cagagcatgt ggataggaat tgtctcctat tctggtagtg ggctgaattg gagggtacag
1021 gtgaactccg acatttttat tgtagatgat tacatacata tatgtcttcc agcttttgac
1081 ggtttctcta tagctgacgg tggagatcta tcgttgaact ttgttaccgg attgttacca
1141 ccgttactta caggagacac tgagcccgct tttcataatg acgtggtcac atatggagca
1201 cagactgtag ctatagggtt gtcgtcgggt ggtgcgcctc agtatatgag taagaatctg
1261 tgggtggagc agtggcagga tggagtactt cggttacgtg ttgagggggg tggctcaatt
1321 acgcactcaa acagtaagtg gcctgccatg accgtttcgt acccgcgtag tttcacgtga
1381 ggatcagacc accccgcggc actggggcat ttcatc
//
LOCUS RATGLYSN 2386 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Rat glycogen synthase mRNA, complete cds.
ACCESSION J05446
KEYWORDS UDP glucose:glycogen 4-alpha-D-glucosyltransferase;
glycogen synthase.
SOURCE Rat adult liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2386)
AUTHORS Bai,G., Zhang,Z., Werner,R., Nuttall,F.Q., Tan,A.W.H. and
Lee,E.Y.C.
TITLE The primary structure of rat liver glycogen synthase deduced by
cDNA cloning: Absence of phosphorylation sites 1a and 1b
JOURNAL J. Biol. Chem. 265, 7843-7848 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Y.C.Lee, 16-MAR-1990.
FEATURES from to/span description
pept 46 2157 glycogen synthase (EC 2.4.1.11)
mRNA < 1 2386 glycogen synthase mRNA
signal 2365 2370 poly-A signal
BASE COUNT 657 a 581 c 579 g 569 t
ORIGIN
1 ctgcaaccgg tccccttcgg caccagacac acagctggac gaagaatgct caggggccgc
61 tccttgtctg tgacgtccct tggtgggctc cctgcatggg aagctgaaag actccccgtg
121 gaagacttat tgctttttga agtttcctgg gaagtgacca acaaagttgg gggcatctgt
181 actgtgatcc agagcaaagc caaaaccaca gccaatgaat ggggagagaa ttacttcctg
241 ataggtccgt attttgagca taatgtgaag actcaagtag agccatgcag gcccgccaac
301 gatgccgtca gaaaagctgt ggatgccatg aacaaacatg gctgccaggt gcattttgga
361 agatggctga tagaagggag tccgtatgtg gtgctttttg acatcagctc ctcagtgtgg
421 aacctggaca ggtggaaggg agacttctgg gaagcatgtg gcgttggcat ccctcacgac
481 gaccgagaag ccaatgacat gctcatattt gggtctttaa ctgcctggtt cttaaaggag
541 gtgacggacc atgcagacgg taaacacgtc attgcccaat tccatgaatg gcaggctgga
601 actgggctga tcctttctcg tgccaggaaa ctccccatcg ccacaatatt tacaacccat
661 gccacactgc tggggcggta tctctgtgca gcaaatattg acttctacaa ccagcttgat
721 aagttcaaca tagacaaaga ggccggggag aggcagattt atcaccgcta ctgcatggag
781 cgggcttccg tgcactgtgc gcacgtgttt accacagtgt cagaaatcac agccatcgag
841 gcggacgaca tgctgaagag gaagcctgat gtggtgactc caaacggctt gaacgttaag
901 aagttttctg cggtgcacga atttcaaaat ctccatgcca catacaaggc caggatacag
961 gattttgttc gaggtcattt ctatggccac ctggacttcg atcttgaaaa gacgttattt
1021 cttttcattg ctgggaggta tgagttctcc aacaagggag cagacatctt cctagaatcc
1081 ttatccaggc tcaatttcct cctaaggatg cataagagta acgtcactgt ggtagtgttt
1141 ttcatcatgc ctgccaagac aaacaatttc aacgtggaaa ccctgaaggg ccaggcggtg
1201 cggaaacagc tgtgggacac tgtgcactgt atgaaggaaa agtttggcaa gaaactctac
1261 gatgggttat taagaggaga aatacccgac atgaatagta ttttggatcg agatgactta
1321 acaattatga aaagagccat tttttcaact cagagacact ctttgcctcc tgtgaccact
1381 cacaatatga tcgacgattc cacggatccc atcctcagca ccattcgacg aattggactt
1441 ttcaacaatc gcacagacag agtcaaggtg attttacacc cagaattcct gtcctccacc
1501 agccccctac taccaatgga ttatgaagag tttgtccgag gctgtcacct tggggtattt
1561 ccatcatact atgagccctg gggttacacg ccagccgaat gcacagtgat gggcatcccc
1621 agtgtgacta cgaacctctc tggtttcggg tgtttcatgc aggagcatgt ggctgaccct
1681 accgcgtacg gtatttatat cgtcgacagc gtccgctctc cagatgattc ttgcaaccag
1741 ctgactcagt ttctctatgg gttctgtaaa cagtcccgcc gccaaagaat catccagagg
1801 aaccgcaccg agaggctctc agatcttctg gactggagat acctgggcag atattaccag
1861 catgccagac atctgacact gagcagggct tttccagaca aattctacct ggagcccaca
1921 tccccaccaa cgacggatgg ctttaagtat cccaggccct cctcagtacc accttcccca
1981 tcaggatccc agacttcaag tcctcagagc agcgatgtgg aaaacgaagg ggatgaggat
2041 gagagatatg atgaggaaga ggaggctgag agggaccggc taaacatcaa gtcaccattt
2101 tccctgaacc acatcccaaa ggggaagaaa aagcttcatg gagaatataa gaactgagct
2161 caaatgaaat gattccaaat ccacaagaaa atgagctgag cccaagtcca tccctgatgc
2221 ataccgacag atatttacag aatgacgtcg gaaatctaga atctgtgtcc agatcactga
2281 tagtaacttg tagccaccga catgtgtcac cgtactgtga tggtactttt gttgtctaat
2341 tggaaatttc aatctgttat tgataataaa ttaccaaatc taaatg
//
LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 11-JUL-1990
DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds.
ACCESSION M29968
KEYWORDS cytochrome P450; monooxygenase.
SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA.
ORGANISM Oryctolagus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 2006)
AUTHORS Hassett,C. and Omiecinski,C.J.
TITLE Sequence and gene expression of rabbit cytochrome P450 IIC16:
Comparison ti highly related family members
JOURNAL Nucleic Acids Res. 18, 1429-1434 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Hassett, 20-NOV-1989.
Author Address [1]: C.Hasset
University of Washington
Department of Environmental Health SC-34
Seattle, WA 98195
FEATURES from to/span description
pept 66 1529 cytochrome P450IIC16 (CYP2C16)
mRNA < 1 2006 CYP2C16 mRNA
signal 1985 1989 poly-A signal
BASE COUNT 524 a 474 c 472 g 536 t
ORIGIN
1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag
61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac
121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta
181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag
241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg
301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca
361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga
421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga
481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa
541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct
601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg
661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt
721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt
781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca
841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt
901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa
961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata
1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca
1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg
1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc
1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc
1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga
1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc
1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg
1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt
1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc
1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac
1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac
1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga
1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc
1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat
1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa
1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct
1981 cattaataaa gtggatgcat tcactg
//
LOCUS HUMGAPA 4307 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds.
ACCESSION M23379
KEYWORDS GTPase-activating protein.
SOURCE Human placenta, cDNA to mRNA, clone 101.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4307)
AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A.,
Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and
McCormick,F.
TITLE Molecular cloning of two types of GAP complementary DNA from human
placenta
JOURNAL Science 242, 1697-1700 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by C.M.Long, 31-MAR-1989. For sequence of clone 16 refer to
M23612.
FEATURES from to/span description
pept 119 3262 ras p21 GTP-ase-activating protein (GAP)
BASE COUNT 1280 a 856 c 957 g 1214 t
ORIGIN
1 cctcagcctg gggagctgaa ggggagacgc gtctgggtgg ggctgctcgg agcccgggcc
61 tggtggcccc tggggctccc gggcgggcag ggtagggcag agtagagcgg gcttcaacat
121 gatggcggcc gaggccggca gtgaggaggg cggcccggta acagccggag ctggaggagg
181 cggcgcggca gcgggctcca gtgcctatcc cgcagtgtgt cgggtgaaga tacccgcggc
241 cctgcctgtg gcagccgccc cctatcctgg gctggtggag accggagtgg ctggaactct
301 gggtggcgga gccgctttgg ggtcagagtt cctaggagcc gggtctgtgg caggggcact
361 ggggggagct ggactgacag ggggaggtac tgctgctggc gtagctggtg ctgctgctgg
421 cgtggccggt gctgctgttg ctggacctag tggagacatg gctctcacca aactgcccac
481 ttcgttgctt gctgagactc tcgggccagg cggcggtttt ccccctctgc cccctccccc
541 ttacctgccc cctttggggg cgggcctcgg gacagtggac gaaggtgact ctctggatgg
601 accagaatac gaggaggaag aggtggccat accgttgacc gctcctccaa ctaaccagtg
661 gtatcacgga aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc
721 tggcagttat cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct
781 tagccagatg aatgttgtca accattttag gattattgct atgtgtggag attactacat
841 tggtggaaga cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg
901 tttgcttaaa ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag
961 aaggcgtgta cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt
1021 cttaaaagga gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac
1081 aaatttaaga acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg
1141 ggaagaagat ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc
1201 ttataattta ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa
1261 tactcctggc gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat
1321 atgtccaacg ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga
1381 catcatagat cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt
1441 accaatgcag gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa
1501 taccatccgt cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct
1561 tctgaaaaag ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga
1621 tgcccaactt atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga
1681 tctcagtgta tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt
1741 tcagatagta gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac
1801 tccagaacaa gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag
1861 tagtccaggg acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga
1921 agaagcccat aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa
1981 tagtgtccaa gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga
2041 gtttgtcttt gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa
2101 aacaaagaaa agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca
2161 gaaagggcat gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat
2221 tgaaccaggg tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga
2281 agagtacagt gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt
2341 atcacatgta tgtggacaag accgaacact actggccagc atcctactga ggatttttct
2401 tcacgaaaag cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga
2461 agatgaagcc actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta
2521 tatgaaagcc actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat
2581 aatggaaagc aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt
2641 gaacactaat ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat
2701 ggcttcagaa atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt
2761 tcagcataag tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt
2821 tcttcgactc atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc
2881 tccatctcct attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc
2941 aaatcttgtg gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa
3001 aagcaacaaa catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc
3061 ggacactaca gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat
3121 ttgcgtggct cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca
3181 cgtattgaaa aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac
3241 aaaaaccaat gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt
3301 ccaacatggt aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac
3361 ttttccacat tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa
3421 taactatgcc agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg
3481 aatcaatctt taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg
3541 ctatgactgt atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc
3601 aaagttttgc tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt
3661 attgacaatt gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag
3721 gaatatgacc atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat
3781 tataatagga acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg
3841 gaactgttga aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca
3901 tacgacttat tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa
3961 atgtttacaa gtaaatagtt tgaattcagt aaatattatt ggttgttgta ttgatcaatg
4021 catgttaccc attcaaccat tttatagact accaatttct tttatgttaa ctagaatgct
4081 tttgttaaaa gttatttgtt cattatttgt gctacccctt tgattatgca gacaacctca
4141 tcagctgcct aacttatcca tctttgaact tctgactact tgttgtatct gctggatatt
4201 tagttcaact gtatagtttt atttacttct gtatgtgtat ttttgtgaag tattcacaaa
4261 ggttaagtta aaataaaacc aagggatatc ttgcaaaaaa aaaaaaa
//
LOCUS HUMGAPB 3456 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds.
ACCESSION M23612
KEYWORDS GTP-ase-activating protein.
SOURCE Human placenta, cDNA to mRNA, clone 16.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3456)
AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A.,
Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and
McCormick,F.
TITLE Molecular cloning of two types of GAP complementary DNA from human
placenta
JOURNAL Science 242, 1697-1700 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by C.M.Long, 31-MAR-1989. For sequence of clone 101 refer to
M23379.
FEATURES from to/span description
pept 100 2712 ras p21 GTP-ase-activating protein (GAP)
site 49 51 5' in frame termination codon
BASE COUNT 1134 a 640 c 687 g 995 t
ORIGIN
1 ggaagaggtg gccataccgt tgaccgctcc tccaactaac cagtaagtta agactgctgt
61 tcaggaattt gggaagctgg ctccagaaaa gaagtggaaa tgaaggggtg gtatcacgga
121 aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc tggcagttat
181 cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct tagccagatg
241 aatgttgtca accattttag gattattgct atgtgtggag attactacat tggtggaaga
301 cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg tttgcttaaa
361 ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag aaggcgtgta
421 cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt cttaaaagga
481 gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac aaatttaaga
541 acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg ggaagaagat
601 ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc ttataattta
661 ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa tactcctggc
721 gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat atgtccaacg
781 ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga catcatagat
841 cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt accaatgcag
901 gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa taccatccgt
961 cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct tctgaaaaag
1021 ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga tgcccaactt
1081 atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga tctcagtgta
1141 tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt tcagatagta
1201 gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac tccagaacaa
1261 gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag tagtccaggg
1321 acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga agaagcccat
1381 aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa tagtgtccaa
1441 gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga gtttgtcttt
1501 gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa aacaaagaaa
1561 agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca gaaagggcat
1621 gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat tgaaccaggg
1681 tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga agagtacagt
1741 gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt atcacatgta
1801 tgtggacaag accgaacact actggccagc atcctactga ggatttttct tcacgaaaag
1861 cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga agatgaagcc
1921 actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta tatgaaagcc
1981 actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat aatggaaagc
2041 aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt gaacactaat
2101 ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat ggcttcagaa
2161 atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt tcagcataag
2221 tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt tcttcgactc
2281 atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc tccatctcct
2341 attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc aaatcttgtg
2401 gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa aagcaacaaa
2461 catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc ggacactaca
2521 gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat ttgcgtggct
2581 cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca cgtattgaaa
2641 aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac aaaaaccaat
2701 gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt ccaacatggt
2761 aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac ttttccacat
2821 tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa taactatgcc
2881 agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg aatcaatctt
2941 taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg ctatgactgt
3001 atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc aaagttttgc
3061 tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt attgacaatt
3121 gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag gaatatgacc
3181 atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat tataatagga
3241 acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg gaactgttga
3301 aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca tacgacttat
3361 tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa atgtttacaa
3421 gtaaatagtt tgaattcagt aaatattaaa aaaaaa
//
LOCUS YSCSDH 1665 bp ds-DNA PLN 11-JUL-1990
DEFINITION S.cerevisiae succinate dehydrogenase iron-protein subunit (SDH)
gene, complete cds.
ACCESSION J05487
KEYWORDS succinate dehydrogenase iron-protein subunit.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Lombardo,A., Carine,K. and Scheffler,I.E.
TITLE Cloning and characterization of the iron-sulfur subunit gene of
succinate dehydrogenase from Saccharomyces cerevisiae
JOURNAL J. Biol. Chem. 265, 10419-10423 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.E.Scheffler, 13-APR-1990.
FEATURES from to/span description
pept 738 1538 succinate dehydrogenase iron-protein subunit
(SDH) (EC 1.3.99.1)
signal 585 589 CAAT box
signal 616 622 TATA box
BASE COUNT 532 a 349 c 345 g 439 t
ORIGIN Chromosome VII.
1 atcttacaag taacttaagt caaggcgtga aaagtaccac cactgtgctt gacatgcaaa
61 agggttgcag agtgcgtcta ccaaggtacg tggaccatga tcaaatcatc aagccttatg
121 atctacgtga ggcccaagga caatactggc tcaagaccgt gaatggagga gtattatgaa
181 tgatgaaatc ctgtcgcacg tatattgcca ggcaaagaac tagcagtaat tgtgtcatgt
241 cagcacattg ctgaggtgca aatggccacc caagagctta ttggagcaca ggatatcttc
301 atcagggaat tacattggaa agatccggtc ttcaaattaa ctcaagtctc aatccgaata
361 cttcattccc atcagcgatc ctgaagaacg tcggtccttg tacaggaaca tcgccattgc
421 tgttagagaa tacaataagt actgtgaagc tatcctatga tcacatatga aagtatatac
481 ccgcttttgt acactatgta gctataattc aatcgtatta ttgtacgtcc gcacgaccat
541 gccttagaaa tatccgcagc gcgcaaaagg cggcctcgca ttggcccaat tagctccggt
601 gtaaaaaggg caaactatat aagggattaa tgactttcta tgagaatgcc aaaaaatgtt
661 aggctaaagg aagggattga aaggaatata gttgagctat actttcttga aatactggag
721 tatacatatt tatagggatg ttgaacgtgc tattgagaag gaaggccttt tgtttggtga
781 cgaagaaggg tatggctact gccacaacag ctgcagctac gcataccccc agattgaaaa
841 cttttaaagt ttacagatgg aatccagacg agccaagtgc taaacctcat ttacagtcat
901 atcaagtgga tctgaatgac tgtgggccca tggtacttga tgcgctgtta aagatcaaag
961 acgaacagga ttctacccta acttttagaa gatcatgtag agaaggtatc tgcggttcat
1021 gtgccatgaa cattggcggt agaaacacgc tagcttgtat atgtaagatc gaccagaacg
1081 aatccaaaca actcaagatc tatccattac cccacatgtt tattgtcaaa gatttggtac
1141 ctgatttaac taacttctac caacaataca aatctatcca accttactta cagagatcat
1201 cgtttccaaa ggatggaacg gaagtgctac aaagtattga agatcgtaag aaactggatg
1261 gtctttacga atgtattctg tgtgcatgct gctctacttc atgtccatcg tactggtgga
1321 accaagaaca gtatttgggc cctgccgtgc taatgcaagc ctaccgttgg ctaattgact
1381 ctagagacca agctacaaag acaagaaagg ccatgctaaa caactccatg tcattgtaca
1441 gatgtcacac catcatgaac tgtactagaa cttgtccaaa gggcttgaat cctggtttgg
1501 ctattgctga aattaagaaa tctttggcat ttgcctagac tatcagaaaa acagctagcc
1561 ccgaagaact cagaagcctc tcaaatgatt ttggcactaa taaaagcacc aactattatt
1621 attattattt tcaaggacga aactcaccat tctcacacat tcctt
//
LOCUS BOVPDEAP 585 bp ss-mRNA MAM 11-JUL-1990
DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase
alpha'-subunit (PDE), partial cds.
ACCESSION M33140 M29465
KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase.
SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 585)
AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K.,
Mullaney,P.J., Walsh,K.A. and Beavo,J.A.
TITLE Identification of a noncatalytic cGMP-binding domain conserved in
both the cGMP-stimulated and photoreceptor cyclic nucleotide
phosphodiesterases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 585 cone photoreceptor cyclic nucleotide
phosphodiesterase alpha'-subunit (AA at 1)
BASE COUNT 198 a 115 c 126 g 146 t
ORIGIN
1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa
61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt
121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt
181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt
241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag
301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct
361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc
421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa
481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca
541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat
//
LOCUS CHKG1CLSE 240 bp ds-DNA VRT 11-JUL-1990
DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer
cor segments B3 and B4.
ACCESSION M33954
KEYWORDS delta-1-crystallin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 240)
AUTHORS Goto,K., Okada,T.S. and Kondoh,H.
TITLE Functional cooperation of lens-specific and nonspecific elements in
the delta-1-crystallin enhancer
JOURNAL Mol. Cell. Biol. 10, 958-964 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 17 116 core segment B3
site 112 235 core segment B4
BASE COUNT 61 a 52 c 53 g 74 t
ORIGIN
1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg
61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt
121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga
181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa
//
LOCUS CHKOVAL 9206 bp ds-DNA VRT 11-JUL-1990
DEFINITION Chicken ovalbumin gene, complete cds.
ACCESSION J00895
KEYWORDS ovalbumin.
SOURCE Chicken oviduct DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1343 to 8906)
AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R.,
Brownlee,G.G. and O'Malley,B.W.
TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin
gene and its biological significance
JOURNAL Biochemistry 20, 6437-6446 (1981)
STANDARD full staff_review
REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206)
AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P.
TITLE The ovalbumin gene-sequence of putative control regions
JOURNAL Nucleic Acids Res. 8, 127-142 (1980)
STANDARD full staff_review
REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions)
AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P.
TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA
sequences at the exon-intron boundaries
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978)
STANDARD full staff_review
REFERENCE 4 (bases 1282 to 1420; 2952 to 2999)
AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C.,
Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and
Chambon,P.
TITLE Organisation and sequences at the 5' end of a cloned complete
ovalbumin gene
JOURNAL Nature 278, 428-434 (1979)
STANDARD full staff_review
REFERENCE 5 (bases 1343 to 8906; exons only)
AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E.,
Givol,D., Fields,S., Robertson,M. and Brownlee,G.G.
TITLE Sequence of chicken ovalbumin mRNA
JOURNAL Nature 273, 723-728 (1978)
STANDARD full staff_review
REFERENCE 6 (bases 2675 to 5042)
AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F.,
O'Malley,B.W. and Brownlee,G.G.
TITLE Sequence of three introns in the chick ovalbumin gene
JOURNAL Nature 278, 370-372 (1979)
STANDARD full staff_review
REFERENCE 7 (bases 1 to 1042)
AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L.
TITLE The ovalbumin gene family: The 5' end region of the X and Y genes
JOURNAL J. Mol. Biol. 156, 1-19 (1982)
STANDARD full staff_review
REFERENCE 8 (bases 5576 to 5624)
AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W.
TITLE The ovalbumin gene: Alleles created by mutations in the intervening
sequences of the natural gene
JOURNAL Cell 16, 201-211 (1979)
STANDARD simple staff_entry
REFERENCE 9 (bases 443 to 611)
AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M.
TITLE The steroid-dependent regulatory element in the ovalbumin gene does
not function as a typical steroid response element
JOURNAL J. Biol. Chem. 265, 7590-7595 (1990)
STANDARD simple staff_entry
COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a,
adenovirus major late, chicken conalbumin, chkx and chky genes
noted for 5' flanking sequence.
OV1.3 is identical to OV1.8 except that intron E is shorter by 522
nucleotides [8].
FEATURES from to/span description
pept 2996 3163 ovalbumin, exon 1
3415 3465 ovalbumin, exon 2
4047 4175 ovalbumin, exon 3
4576 4693 ovalbumin, exon 4
5652 5794 ovalbumin, exon 5
6126 6281 ovalbumin, exon 6
7864 8259 ovalbumin, exon 7
pre-msg 1343 8906 oval mRNA and introns
IVS 1390 2978 oval intron A
IVS 3164 3414 oval intron B
IVS 3466 4046 oval intron C
IVS 4176 4575 oval intron D
IVS 4694 5651 oval intron E
IVS 5795 6125 oval intron F
IVS 6282 7863 oval intron G
allele 1282 1282 g may be c [1]
allele 1309 1309 g may be a [1]
allele 1376 1376 g may be c [1],[7]
allele 1385 1385 g may be a [1],[7]
allele 1393 1393 a may be g [1]
allele 3010 3010 t may be c [1]
allele 3154 3154 a may be g [1]
allele 5747 5747 g may be a [1]
allele 8032 8032 can be g [5]
conflict 1471 1471 c in [1]; t in [2],[7]
conflict 1523 1523 a in [1]; t in [2],[7]
conflict 1538 1539 at in [1]; a in [2]
conflict 2681 2681 t in [1],[6]; g in [2]
conflict 2696 2696 a in [1],[6]; g in [2]
conflict 3552 3552 a in [1],[6]; g in [2]
conflict 3683 3683 c in [1],[6]; t in [2]
conflict 3934 3934 a in [1],[6]; c in [2]
conflict 3960 3960 a in [1],[6]; g in [2]
conflict 3994 3994 t in [1],[6]; c in [2]
conflict 4207 4208 tt in [1],[6]; ttt in [2]
conflict 4396 4396 t in [1],[6]; c in [2]
conflict 5636 5636 c in [1]; g in [3]
conflict 6100 6100 t in [1]; tc in [3]
conflict 6119 6119 t in [1]; tg in [3]
conflict 8492 8505 gactcacagtactg in [1]; g in [5]
site 443 611 steroid-dependent regulatory element [9]
BASE COUNT 2994 a 1749 c 1721 g 2742 t
ORIGIN 5 bp upstream of PstI site.
1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca
61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg
121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc
181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg
241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca
301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga
361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca
421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca
481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa
541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt
601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg
661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt
721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt
781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc
841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg
901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac
961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag
1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat
1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag
1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat
1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg
1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc
1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa
1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac
1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac
1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta
1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt
1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat
1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg
1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat
1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac
1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg
1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac
1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa
2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact
2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa
2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa
2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa
2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc
2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac
2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt
2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt
2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc
2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat
2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa
2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat
2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca
2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg
2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt
2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg
3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca
3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt
3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa
3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct
3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc
3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc
3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt
3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt
3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct
3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa
3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag
3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta
3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg
3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat
3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag
3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta
3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca
4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta
4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta
4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg
4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat
4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac
4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag
4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat
4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta
4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat
4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa
4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa
4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa
4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt
4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa
4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca
4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc
4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat
5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg
5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga
5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt
5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc
5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt
5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag
5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga
5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa
5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc
5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt
5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc
5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga
5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta
5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg
5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct
5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt
6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca
6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc
6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat
6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca
6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg
6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt
6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag
6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa
6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag
6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga
6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac
6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa
6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca
6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat
6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat
6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag
6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga
7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt
7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag
7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac
7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc
7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg
7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac
7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt
7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga
7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga
7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta
7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga
7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag
7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt
7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg
7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg
7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc
7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct
8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa
8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc
8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc
8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact
8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt
8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga
8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac
8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa
8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa
8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa
8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc
8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca
8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt
8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt
8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc
8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag
9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa
9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg
9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg
9181 ctgccctgat ctcggctggg gtgatg
//
LOCUS PMUCEN 150 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage
site.
ACCESSION M32302
KEYWORDS .
SOURCE Bacteriophage Mu (wild type) DNA.
ORGANISM Bacteriophage mu
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Pato,M., Howe,M. and Higgins,P.
TITLE DNA gyrase binds to a centrally located replication enhancer (CEN)
in the bacteriophage Mu genome
JOURNAL Unpublished (1990) In Press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.L.Pato, 23-FEB-1990.
FEATURES from to/span description
site 49 50 DNA gyrase cleavage site in complimentary
strand
mut 44 44 a in wt; g in Mu nuB103
mut 50 50 g in wt; c in Mu nuB1
BASE COUNT 40 a 38 c 27 g 45 t
ORIGIN Map position at 18.0 kb.
1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat
61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt
121 atcttccttt cactttcttt ctccagtact
//
LOCUS RATRNRTR 2577 bp ds-DNA ROD 11-JUL-1990
DEFINITION Rat snRNP-associated polypeptide N, complete cds.
ACCESSION J05497
KEYWORDS snRNP-associated polypeptide N.
SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2577)
AUTHORS Schmauss,C. and Lerner,M.R.
TITLE The closely related small nuclear ribonucleoprotein polypeptides N
and B/B' are distinguishable by antibodies as well as by
differences in their mRNAs and gene structures
JOURNAL J. Biol. Chem. 265, 10733-10739 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.Schmauss, 13-APR-1990.
FEATURES from to/span description
pept 1314 2036 snRNP-associated polypeptide N
signal 774 779 TATA box
signal 698 702 CAAT box
BASE COUNT 822 a 537 c 581 g 637 t
ORIGIN
1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg
61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact
121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag
181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag
241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga
301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg
361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct
421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg
481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc
541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt
601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca
661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat
721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa
781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc
841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg
901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag
961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa
1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag
1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga
1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt
1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt
1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg
1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag
1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct
1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag
1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg
1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg
1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag
1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga
1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag
1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa
1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa
1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc
1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata
2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc
2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg
2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca
2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa
2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca
2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa
2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat
2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg
2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc
//
LOCUS WUCSSP 1323 bp ds-DNA INV 11-JUL-1990
DEFINITION W.bancrofti species specific DNA fragment.
ACCESSION M27140
KEYWORDS .
SOURCE W.bancrofti DNA, clone IWb35.
ORGANISM Wuchereria bancrofti
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 1323)
AUTHORS Dissanayake,S. and Piessens,W.F.
TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA
sequence
JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990)
STANDARD simple staff_entry
BASE COUNT 399 a 204 c 251 g 469 t
ORIGIN 1288 bp upstream of SacI site.
1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt
61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat
121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta
181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga
241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt
301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg
361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa
421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg
481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg
541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat
601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc
661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc
721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag
781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg
841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa
901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct
961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat
1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac
1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata
1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca
1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg
1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg
1321 agc
//
LOCUS YSCHXT2 2890 bp ds-DNA PLN 11-JUL-1990
DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene,
complete cds.
ACCESSION M33270
KEYWORDS high affinity hexose transporter-2.
SOURCE S.cerevisiae (isogenic strain to S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2890)
AUTHORS Kruckeberg,A.L. and Bisson,L.F.
TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high
affinity glucose transport
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.L.Kruckeberg, 26-MAR-1990.
FEATURES from to/span description
pept 818 2443 high affinity hexose transporter-2
BASE COUNT 808 a 571 c 535 g 976 t
ORIGIN
1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg
61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc
121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata
181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt
241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac
301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc
361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa
421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac
481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg
541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac
601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc
661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca
721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca
781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg
841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga
901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa
961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt
1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag
1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt
1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg
1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat
1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc
1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc
1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt
1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt
1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc
1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc
1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat
1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa
1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt
1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa
1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc
1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat
1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa
2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat
2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc
2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt
2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt
2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa
2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc
2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact
2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt
2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt
2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg
2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt
2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa
2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa
2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat
2881 cggccatact
//
LOCUS RATGAH 1003 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Rat L-glutamine amidohydrolase mRNA, 3' end.
ACCESSION J05499
KEYWORDS L-glutamine amidohydrolase.
SOURCE Rat (strain Sprague-Dawley) liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1003)
AUTHORS Smith,E.M. and Watford,M.
TITLE Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence
similarity to kidney-type glutaminase
JOURNAL J. Biol. Chem. 265, 10631-10636 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Waterford, 08-MAY-1990.
FEATURES from to/span description
pept < 1 660 L-glutamine amidohydrolase (EC 3.5.1.2)
BASE COUNT 270 a 243 c 258 g 232 t
ORIGIN
1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc
61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt
121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac
181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc
241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa
301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag
361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc
421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag
481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg
541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg
601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga
661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg
721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt
781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc
841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa
901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata
961 gacaaagtat tttgtgttca aataaacttt aattaccgga att
//
LOCUS CEACAEVA 264 bp ds-DNA VRL 11-JUL-1990
DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene,
complete cds.
ACCESSION M34092
KEYWORDS tat protein.
SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from
goat synovial membrane, clone pCol.9.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis-
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 08-MAY-1990.
Author address: W.P.Cheevers
Washington State University
Dept Veterinary Microbiology
Pullman, WA 94164-7040
email: b384@wsuvmsl.bitnet
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 97 a 40 c 80 g 47 t
ORIGIN
1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
241 tgtaacccgg ggtggggaac ataa
//
LOCUS CEACAEVB 264 bp ds-DNA VRL 11-JUL-1990
DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene,
complete cds.
ACCESSION M34093
KEYWORDS tat protein.
SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from
goat synovial membrane, clone pC63-49.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis-
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 08-MAY-1990.
Author address: W.P.Cheevers
Washington State University
Dept Veterinary Microbiology
Pullman, WA 94164-7040
email: b384@wsuvmsl.bitnet
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 99 a 43 c 75 g 47 t
ORIGIN
1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
241 tgtaacccgg ggtgggaaac ataa
//
LOCUS FLAPR834HA 540 bp ss-RNA VRL 11-JUL-1990
DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds.
ACCESSION M34335
KEYWORDS glycoprotein; hemagglutinin.
SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally
from human.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 540)
AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P.
TITLE Rapid detection of influenza virus H1 by the polymerase chain
reaction
JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990)
STANDARD simple staff_review
COMMENT Sequence reported is + strand.
FEATURES from to/span description
pept < 1 > 540 hemagglutinin (AA at 2)
BASE COUNT 179 a 121 c 123 g 117 t
ORIGIN
1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta
61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac
121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat
181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg
241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa
301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc
361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa
421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt
481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa
//
LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 11-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1.
ACCESSION M34422
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, clone PS-beta-G C.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 355)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS 1 355 pregnancy-specific beta-1 glycoprotein intron
C1
BASE COUNT 110 a 64 c 62 g 119 t
ORIGIN Chromosome 19.
1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag
61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc
121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg
181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa
241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct
301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag
//
LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M34421
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1418)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE Structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 79 1359 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 79 180 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 181 1356 pregnancy-specific beta-1 glycoprotein
mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT 408 a 398 c 296 g 316 t
ORIGIN Chromosome 19.
1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac
61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc
121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc
181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt
241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac
301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca
361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg
421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa
481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc
541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg
601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag
661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc
721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat
781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat
841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg
901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata
961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac
1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc
1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc
1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa
1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc
1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct
1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga
1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac
//
LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M34420
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1856)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE Structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 82 1368 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 82 183 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 184 1365 pregnancy-specific beta-1 glycoprotein
mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT 544 a 476 c 364 g 472 t
ORIGIN Chromosome 19.
1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag
61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc
121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact
181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta
241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag
301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct
361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc
421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga
481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc
541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact
601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg
661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga
721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg
781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag
841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg
901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg
961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag
1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac
1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc
1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag
1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct
1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc
1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt
1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta
1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca
1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt
1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat
1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga
1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt
1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg
1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc
//
LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M23575
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone pSP1-i.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2004)
AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N.
TITLE Molecular cloning of a cDNA for human pregnancy-specific
beta-1-glycoprotein: Homology with human carcinoembryonic antigen
and related proteins
JOURNAL Gene 71, 439-449 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 232 1518 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 232 333 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 334 1515 pregnancy-specific beta-1 glycoprotein
BASE COUNT 579 a 510 c 406 g 509 t
ORIGIN
1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca
61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta
121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt
181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc
241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca
301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc
361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct
421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta
481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc
541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac
601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac
661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag
721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat
781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt
841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg
901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac
961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa
1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt
1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat
1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca
1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat
1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa
1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag
1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg
1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct
1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac
1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg
1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact
1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg
1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt
1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc
1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc
1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg
1981 taataaagtt atctgcacaa accc
//
LOCUS HUMLEUELA 920 bp ss-mRNA PRI 11-JUL-1990
DEFINITION Human elastase/medullasin mRNA, complete cds.
ACCESSION M34379
KEYWORDS elastase; medullasin.
SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M.
TITLE Functional expression of human leukocyte elastase (HLE)/medullasin
in eukaryotic cells
JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 39 842 elastase/medullasin precursor (EC 3.4.21.37)
sigp 39 119 elastase/medullasin signal peptide
matp 126 839 elastase/medullasin
BASE COUNT 141 a 329 c 287 g 163 t
ORIGIN
1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg
61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct
121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc
181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg
241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata
301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg
361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca
421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg
481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg
541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca
601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct
661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct
721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac
781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact
841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca
901 ataaacattc tctgttttgt
//
LOCUS MSGIS6110 1360 bp ds-DNA BCT 11-JUL-1990
DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element.
ACCESSION M29899
KEYWORDS insertion sequence.
SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone
I21.
ORGANISM Mycobacterium tuberculosis
Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE 1 (bases 1 to 1360)
AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H.,
Gicquel,B. and Guesdon,J.L.
TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex
JOURNAL Nucleic Acids Res. 18, 188-188 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Thierry 15-NOV-1989.
BASE COUNT 269 a 439 c 432 g 220 t
ORIGIN
1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca
61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca
121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt
181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc
241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa
301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga
361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg
421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg
481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc
541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt
601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca
661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca
721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg
781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag
841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt
901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg
961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca
1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg
1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac
1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc
1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc
1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct
1321 cagatcagag agtctccgga ctcaccgggg cggttcacga
//
LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region
from hybridoma CE5, partial cds.
ACCESSION M34119
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene.
SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 348 Ig H-chain V-J3 region (AA at 1)
recomb 294 295 J-region end/D-region start
recomb 303 304 D-region end/J-region start
BASE COUNT 89 a 82 c 97 g 80 t
ORIGIN Chromosome 12.
1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg
121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat
181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac
241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt
301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733,
partial cds.
ACCESSION M34120
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain (AA at 1)
BASE COUNT 87 a 83 c 74 g 77 t
ORIGIN Chromosome 6.
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat
181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga
301 ggcaccaaac tggaaatcaa a
//
LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma
C017-1A, partial cds.
ACCESSION M34121
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain (AA at 1)
BASE COUNT 89 a 77 c 78 g 77 t
ORIGIN Chromosome 6.
1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc
61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca
121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat
181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct
241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg
301 gggaccaagc tggaaataaa a
//
LOCUS MUSIGKCST 318 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma
CE5, partial cds.
ACCESSION M34122
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 318 Ig kappa-chain (AA at 1)
BASE COUNT 84 a 78 c 80 g 76 t
ORIGIN Chromosome 6.
1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca
121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag
181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc
301 accaagctgg aaatcaaa
//
LOCUS PSERRSAA 1517 bp ss-rRNA RNA 11-JUL-1990
DEFINITION P.aeruginosa 16S ribosomal RNA.
ACCESSION M34133
KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA.
SOURCE P.aeruginosa (strain 25330) ribosomal RNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1517)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept of Microbiology
407 S. Goodwin Avenue
Urbana, IL 61801
email: carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1517 16S rRNA
BASE COUNT 379 a 334 c 470 g 301 t 33 others
ORIGIN
1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta
121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc
181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta
241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga
301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata
361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat
421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt
481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg
541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg
601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag
661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc
721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg
781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag
841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta
901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag
961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg
1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca
1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg
1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg
1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact
1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg
1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta
1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn
1501 nnnnnnnnnn gaacctg
//
LOCUS RATNESTIN 5946 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Rat nestin mRNA, complete cds.
ACCESSION M34384
KEYWORDS intermediate filament protein; nestin.
SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5946)
AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G.
TITLE CNS stem cells express a new class of intermediate filament protein
JOURNAL Cell 60, 585-595 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 128 5545 nestin (128 could be 161)
mRNA 1 5495 nestin mRNA
BASE COUNT 1600 a 1401 c 1871 g 1074 t
ORIGIN
1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa
61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac
121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg
181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc
241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga
301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga
361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt
421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc
481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga
541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc
601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga
661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt
721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg
781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga
841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac
901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca
961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct
1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg
1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg
1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc
1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt
1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc
1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca
1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga
1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc
1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc
1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga
1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct
1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct
1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa
1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca
1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg
1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt
1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt
2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca
2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga
2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca
2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact
2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg
2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat
2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca
2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa
2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt
2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg
2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa
2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg
2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga
2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc
2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag
2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc
3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc
3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac
3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt
3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa
3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt
3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc
3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc
3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc
3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt
3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga
3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga
3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac
3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga
3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa
3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc
3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga
3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga
4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc
4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa
4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag
4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga
4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc
4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt
4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga
4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga
4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg
4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg
4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc
4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct
4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga
4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga
4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc
4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga
4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa
5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct
5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca
5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga
5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt
5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg
5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga
5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc
5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga
5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg
5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt
5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg
5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc
5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac
5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc
5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct
5941 acctac
//
LOCUS SYNCMPA 1885 bp ds-DNA BCT 11-JUL-1990
DEFINITION Synechococcus sp. 42-kD membrane protein (cmpA) gene, complete cds.
ACCESSION M32999
KEYWORDS membrane protein.
SOURCE Synechococcus sp. (strain PCC 7942) DNA.
ORGANISM Synechococcus sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 1885)
AUTHORS Omata,T., Carlson,T.J., Ogawa,T. and Pierce,J.
TITLE Sequencing and modification of the gene encoding the 42 kilodalton
protein in the cytoplasmic membrane of Synechococcus PCC 7942
JOURNAL Plant Physiol. 93, 305-311 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Pierce, 20-MAR-1990.
FEATURES from to/span description
pept 141 1493 49-kD membrane protein (cmpA)
BASE COUNT 452 a 508 c 487 g 438 t
ORIGIN
1 ggttatcagc cttatcggtc tggaataacc agttggccta aagtcatgca gacagagcgt
61 ttctgcgcct ctcgtgaagc aattcgcaca acttgtccat ctttagaggc atctcctgtt
121 gtgggatgta ggggagacgt atgaacgaat ttcaaccagt caatcgtcgt cagtttctgt
181 tcacgctcgg agcaaccgct gctagcgcta ttttgctgaa gggttgcggt aatcctcctt
241 ccagtagcgg cggcgggact tctagtacaa ctcagccaac tgctgcaggg gcgagtgatc
301 tggaagtcaa gacaatcaaa ttgggctaca tccccatctt tgaagcggct ccactgatca
361 ttggccgcga aaaaggcttt tttgccaaat atggcttgga tgttgaagtc tcgaaacaag
421 ccagctgggc agctgctcgc gataacgtca ttctcggttc tgctggtggc ggcatcgatg
481 gcggtcagtg gcaaatgccg atgcctgcct tgctaacgga aggtgcgatc agcaacggtc
541 aaaaagttcc catgtatgtc ttggcttgct tgagcaccca aggcaatggc atcgctgttt
601 ccaatcagct caaggcccaa aatctgggct tgaagctagc gcccaaccgc gactttatcc
661 tcaactaccc gcaaactagc ggccggaagt tcaaagcatc ctacaccttc ccgaacgcca
721 accaagactt ctggattcgc tattggtttg cagctggcgg tatcgatcct gataaagaca
781 ttgaactctt gaccgttccc agcgcagaaa ctctacaaaa tatgcgcaat ggcacgatcg
841 attgcttcag taccggcgat ccctggccgt cgcggattgc caaagatgac atcggctatc
901 aagctgcgct gacaggtcaa atgtggcctt accaccccga ggaattcttg gcgctgcgag
961 cagactgggt agacaaacat ccgaaagcta cgctcgcctt gctgatgggc ttgatggaag
1021 cgcagcaatg gtgcgatcag aaagcaaatc gggcagagat ggccaagatc ctctccggtc
1081 gcaacttctt taacgtgccg gtttcgatcc tgcagccgat tctggaaggt caaatcaaag
1141 ttggagcaga cggaaaagat ctcaacaact ttgatgccgg cccgctcttc tggaagagtc
1201 cgcgcggcag tgtctcctat ccctacaaag ggctcaccct ctggttcttg gtggagtcga
1261 tccgctgggg cttcaacaag caagtgctac ctgacattgc agccgcccag aaactcaacg
1321 atcgcgtgac tcgtgaagac ctctggcaag aggcagccaa gaaattaggg gtgcccgctg
1381 cggatatccc aaccggatcg actcgcggta ccgagacctt ctttgatggc atcacctaca
1441 acccagacag tccgcaagct tatctccaaa gcttgaagat taaacgggca taagtagggg
1501 cttcaatcat caaccttagt tcagtcacta tcaggagata gacagaccat ggttactgca
1561 cgggaaacaa gacgaaacgg aagtcgtcct tctggcttaa aaaaatggcg tcagaaactc
1621 gatggcatct tgctaccgct agcaggaatt ttgggtttcc tcatcatttg gcagatcttt
1681 tctagcacgg gcaacccgct tgcccggccc tgctcagtct cttcacagaa gagagaacac
1741 gcgagttgct gccctatccc ttcttggatc gcggcgggct tgataaaggt ctgttctggc
1801 agacgtatcg cttagttctg acgcgggtgg cccagggctt ttcgatccgc agccatcatc
1861 ggcatcggca tttccgttgg aattc
//
LOCUS ECOOXYR 1264 bp ds-DNA BCT 11-JUL-1990
DEFINITION E.coli oxyR regulatory protein gene, complete cds.
ACCESSION J04553
KEYWORDS oxyR gene; regulatory protein.
SOURCE E.coli (strain K12, CSH50) DNA, clones pAQ17 and pMomR1200.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1264)
AUTHORS Christman,M.F., Storz,G. and Ames,B.N.
TITLE Oxyr, a positive regulator of hydrogen peroxide-inducible genes in
Escherichia coli and Salmonella typhimurium, is homologous to a
family of bacterial regulatory proteins
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3484-3488 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
G.Storz, 23-MAY-1989.
FEATURES from to/span description
pept 203 1120 oxyR protein
mRNA 170 > 1264 oxyR mRNA
signal 133 138 -35 region
signal 157 162 -10 region
binding 191 195 ribosome binding site
BASE COUNT 289 a 308 c 368 g 299 t
ORIGIN 89.6 minutes on K12 map.
1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc
61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat
121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta
181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc
241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag
301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg
361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg
421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct
481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct
541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact
601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga
661 acgattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca
721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat
781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc
841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc
901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggagcgca aacgcgatgg
961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg
1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat
1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc
1141 gatagcttcc gccatcgtcg ggtagttaaa ggtggtgttg acgaagtact caatagtgtt
1201 gccgccacct ttctgttcca taatcgcctg accgatatga ataatttcgg cgagcgcgct
1261 cgcc
//
LOCUS CLLRRE 1860 bp ss-rRNA RNA 11-JUL-1990
DEFINITION C.sapidus 18S rRNA, 3' end.
ACCESSION M34360
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE C.sapidus rRNA.
ORGANISM Callinectes sapidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura;
Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE 1 (bases 1 to 1860)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-113 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1860 18S ribosoma RNA
BASE COUNT 359 a 375 c 421 g 367 t 338 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
//
LOCUS HUMNCSRC 138 bp ds-DNA PRI 11-JUL-1990
DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene,
exons 3, 4, NI, and NII, partial cds.
ACCESSION M34469
KEYWORDS membrane-associated tryosine protein kinase.
SOURCE Human adult brain DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 138)
AUTHORS Pyper,J.M. and Bolen,J.B.
TITLE Identification of a novel neuronal C-SRC exon expressed in human
brain
JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 29 membrane-associated tyrosine protein kinase
(C-SRC), exon 3 (AA at 1)
30 47 membrane-associated tyrosine protein kinase,
exon NI
48 80 membrane-associated tyrosine protein kinase,
exon NII
81 > 136 membrane-associated tyrosine protein kinase,
exon 4
variant 117 117 a or g
variant 135 135 c or t
BASE COUNT 36 a 40 c 40 g 22 t
ORIGIN
1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
121 cagacaggct acatcccc
//
LOCUS MHVAPEPA 1000 bp ss-RNA VRL 11-JUL-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34435
KEYWORDS peplomer protein.
SOURCE Murine (strain JHM-DL) RNA.
ORGANISM Murine coronavirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
BASE COUNT 258 a 192 c 227 g 317 t 6 others
ORIGIN
1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS MHVAPEPB 843 bp ss-RNA VRL 11-JUL-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34436
KEYWORDS peplomer protein.
SOURCE Murine (strain A59) RNA.
ORGANISM Murine coronavirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 843)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 400 401 deletion site
BASE COUNT 227 a 153 c 183 g 280 t
ORIGIN
1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
841 gaa
//
LOCUS MHVSP 3780 bp ss-mRNA VRL 11-JUL-1990
DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION X04797 M34437
KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein;
surface projection glycoprotein.
SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
pJMS1010, pJS112, and pJS92.
ORGANISM Murine hepatitis virus A59
Unclassified.
REFERENCE 1 (bases 1 to 3780)
AUTHORS Schmidt,I., Skinner,M. and Siddell,S.
TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection
Glycoprotein of Coronavirus MHV-JHM
JOURNAL J. Gen. Virol. 68, 47-56 (1987)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
SITE 24 30 put. region of homology preceeding
mRNA 5' initiation codons
SITE 121 129 pot. N-glycosylation site
SITE 208 216 pot. N-glycosylation site
SITE 430 438 pot. N-glycosylation site
SITE 604 612 pot. N-glycosylation site
SITE 1099 1107 pot. N-glycosylation site
SITE 1333 1341 pot. N-glycosylation site
SITE 1636 1644 pot. N-glycosylation site
SITE 1732 1740 pot. N-glycosylation site
SITE 1756 1764 pot. N-glycosylation site
SITE 1825 1833 pot. N-glycosylation site
SITE 1900 1914 put. proteolytic cleavage site
SITE 1972 1980 pot. N-glycosylation site
SITE 2023 2031 pot. N-glycosylation site
SITE 2293 2301 pot. N-glycosylation site
SITE 2440 2448 pot. N-glycosylation site
SITE 3139 3147 pot. N-glycosylation site
SITE 3301 3309 pot. N-glycosylation site
SITE 3331 3339 pot. N-glycosylation site
SITE 3388 3396 pot. N-glycosylation site
SITE 3436 3444 pot. N-glycosylation site
SITE 3499 3507 pot. N-glycosylation site
SITE 3553 3654 pot. transmembrane domain
SITE 3622 3675 cysteine-rich region
SITE 3655 3660 charge cluster
SITE 3715 3723 pot. N-glycosylation site
FEATURES from to/span description
pept 31 3738 surface protein S precursor
sigp 31 60 surface protein S signal peptide
matp 61 3736 surface protein S
BASE COUNT 1030 a 718 c 800 g 1232 t
ORIGIN
1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION M34598 M29013 J03562
KEYWORDS constant region; immunoglobulin; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
Y31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 713)
AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
TITLE Structure of a third murine immunoglobulin lambda light chain
variable region that is expressed in laboratory mice
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
STANDARD simple staff_entry
REFERENCE 2 (bases 2 to 450)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
REFERENCE 3 (bases 266 to 429)
AUTHORS Sanchez,P. and Cazenave,P.-A.
TITLE A new variable region in mouse immunoglobulin lambda light chains
JOURNAL J. Exp. Med. 166, 265-270 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region
sigp 12 68 Ig lambda chain signal peptide
matp 69 > 713 Ig lambda chain
recomb 379 380 V-region end/J2-region start
recomb 414 415 J2-region end/C2-region start
BASE COUNT 175 a 190 c 159 g 189 t
ORIGIN
1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS MUSIGLVD 681 bp ds-DNA ROD 11-JUL-1990
DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION M34597
KEYWORDS constant region; germline; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; variable region.
SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region,
exon 1
209 531 Ig lambda-chain precursor Vx-J2-C2 region,
exon 2
sigp 59 104 Ig lambda-chain signal peptide
209 219 Ig lambda-chain signal peptide
matp 220 528 Ig lambda-chain
IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A
recomb 530 531 Vx-region end/J2-region start
recomb 565 566 J2-region end/C2-region start
BASE COUNT 179 a 156 c 136 g 210 t
ORIGIN Chromosome 16.
1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
661 tcctcttgga cttcgtaggg c
//
LOCUS PAERRE 1877 bp ss-rRNA RNA 11-JUL-1990
DEFINITION P.kadiakensis 18S rRNA, 3' end.
ACCESSION M34359
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.kadiakensis rRNA.
ORGANISM Palaemonetes kadiakensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea;
Palaemonoidea; Palaemonidae.
REFERENCE 1 (bases 1 to 1877)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-113 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1877 18S ribosoma RNA
BASE COUNT 339 a 321 c 399 g 313 t 505 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnnnnn
//
LOCUS PBESVBRA 584 bp ds-DNA INV 11-JUL-1990
DEFINITION P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION M34601
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 584)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 461 541 27 bp repeats
BASE COUNT 205 a 41 c 85 g 251 t 2 others
ORIGIN
1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS PBESVBRB 593 bp ds-DNA INV 11-JUL-1990
DEFINITION P.berghei telomeric repeat region subfragment a DNA.
ACCESSION M34602
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 593)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 484 510 27 bp repeat motif
BASE COUNT 209 a 40 c 97 g 247 t
ORIGIN
1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS PEURRE 1902 bp ss-rRNA RNA 11-JUL-1990
DEFINITION P.aztecus 18S rRNA, 3' end.
ACCESSION M34362
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.aztecus rRNA.
ORGANISM Penaeus aztecus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea;
Penaeidae.
REFERENCE 1 (bases 1 to 1902)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1902 18S ribosoma RNA
BASE COUNT 340 a 369 c 427 g 332 t 434 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS POCRRE 1874 bp ss-rRNA RNA 11-JUL-1990
DEFINITION P.ascensionis 18S rRNA, 3' end.
ACCESSION M34358
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.ascensionis rRNA.
ORGANISM Procaris ascensionis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea;
Procaridoidea; Procarididae.
REFERENCE 1 (bases 1 to 1874)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1874 18S ribosoma RNA
BASE COUNT 298 a 291 c 331 g 266 t 688 others
ORIGIN
1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnn
//
LOCUS PRARRE 1869 bp ss-rRNA RNA 11-JUL-1990
DEFINITION P.leonensis 18S rRNA, 3' end.
ACCESSION M34363
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.leonensis rRNA.
ORGANISM Procambarus leonensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea;
Astacoidea; Cambaridae.
REFERENCE 1 (bases 1 to 1869)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1869 18S ribosoma RNA
BASE COUNT 409 a 412 c 459 g 419 t 170 others
ORIGIN
1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnn
//
LOCUS PVIC1RPTA 711 bp ds-DNA INV 11-JUL-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28745 M25758
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 711)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 711 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 697 711 region 2
site 502 597 post repeat variable region
site 598 696 post repeat constant region
BASE COUNT 261 a 151 c 226 g 73 t
ORIGIN
1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS PVIC1RPTB 657 bp ds-DNA INV 11-JUL-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28746 M25759
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK210) DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 657)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 657 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 643 657 region 2
rpt 16 477 27 bp repeats
site 478 544 3' post repeat variable region
site 545 642 3' post repeat constant region
BASE COUNT 229 a 135 c 224 g 69 t
ORIGIN
1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS SHV2A 554 bp ss-RNA VRL 11-JUL-1990
DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA,
partial cds.
ACCESSION M34085
KEYWORDS .
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 554)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL Unpublished (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by E.A.Brown, 04-MAY-1990.
Author address: E.A.Brown
547 Burnett-Womack
CB# 7030
Chapel Hill, NC 27599
FEATURES from to/span description
pept < 1 > 554 segment 2A-encoded protein (AA at 1)
BASE COUNT 184 a 71 c 138 g 161 t
ORIGIN Segment 2A; map position 3108-3662.
1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
541 tcaggaatac agga
//
LOCUS SHVVP1CP 2373 bp ss-RNA VRL 11-JUL-1990
DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION M34084
KEYWORDS capsid protein VP1.
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
in cwll line BS-C-1.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 2373)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL J. Virol. 63, 4932-4937 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by E.A.Brown, 04-MAY-1990.
FEATURES from to/span description
pept < 1 > 2373 capsid protein VP1 (AA at 1)
BASE COUNT 661 a 447 c 491 g 774 t
ORIGIN Segment P1; map position 735-3107.
1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS STNRRE 1885 bp ss-rRNA RNA 11-JUL-1990
DEFINITION S.hispidus 18S rRNA, 3' end.
ACCESSION M34361
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE S.hispidus rRNA.
ORGANISM Stenopus hispidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea;
Stenopodidae.
REFERENCE 1 (bases 1 to 1885)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1885 18S ribosoma RNA
BASE COUNT 327 a 336 c 430 g 368 t 424 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS MSQMUD76A 124 bp ds-DNA BAD 11-JUL-1990
DEFINITION A.dirus DNA probe pMU-D76.
ACCESSION M34656
KEYWORDS .
SOURCE A.dirus (Strain D) wild-caught female DNA, clone pMU-D76.
ORGANISM Anopheles dirus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 124)
AUTHORS Panyim,O., Yasothornsrikul,S., Tungpradubkul,S., Baimai,V.,
Rosenberg,R., Andre,R.G. and Green,C.A.
TITLE Identification of isomorphic malaria vectors using a DNA probe
JOURNAL Am. J. Trop. Med. Hyg. 38, 47-49 (1988)
STANDARD simple staff_review
BASE COUNT 32 a 33 c 34 g 25 t
ORIGIN
1 gatctgcact cggcgtgaat ttggttacca tcgaatgtgc ggaaaaagtt ttaccccgtg
61 cgcagtgcgg aacacgccag acttgttaca cacggaaacg gaccacgaac gtgttacgcg
121 cacg
//
LOCUS ACCCITSYN 1895 bp ds-DNA BCT 11-JUL-1990
DEFINITION A.anitratum citrate synthase gene, complete cds.
ACCESSION M33037
KEYWORDS citrate synthase.
SOURCE A.anitratum DNA, clone pLJD1.
ORGANISM Acinetobacter anitratum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1895)
AUTHORS Donald,L.J. and Duckworth,H.W.
TITLE Expression and base sequence of the citrate synthase gene of
Acinetobacter anitratum
JOURNAL Biochem. Cell Biol. 65, 930-938 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 264 1538 citrate synthase precursor
matp 267 1535 citrate synthase
signal 120 125 -35 signal
signal 141 146 -10 signal
signal 1560 1585 pot. transcription termination signal
BASE COUNT 534 a 406 c 357 g 598 t
ORIGIN
1 gtactcaacg cttaattttt ttctgcacgt tcttcttgaa ttgacttatg ataccatccc
61 gatgcagtga ttttactgac tttttttgct cgggtcttga tgactaactc tctgtgggaa
121 cgtcattttt tatccataag tataattgac aaaatttcag tactcactaa tcttatagca
181 aattttgaca ccgtctgatt cgcacatgag aaaattagga tttcgagtca gataatcatt
241 caccaggaca ggagatctat tgaatgtctg aagcaactgg caaaaaagcc gtattacatc
301 ttgatggcaa agaaattgaa ttaccaattt acagtggcac attaggtccc gatgtaatcg
361 acgttaaaga tgtattggcc tcaggtcact ttacttttga tcctggtttt atggcgacag
421 cttcatgcga gtctaaaatc acatttatcg atggtgacaa aggtatttta ttacaccgcg
481 gttacccgat tgaccagtta gcgactcaag cagactacct tgaaacttgt tatttattat
541 taaatggcga gttaccaact gctgaacaaa aagttgagtt cgatgcgaaa gttcgtgctc
601 atactatggt tcatgatcaa gttagccgtt tcttcaatgg tttccgtcgt gatgctcacc
661 ctatggcaat catggttggt gtagtaggcg cattatctgc tttctatcac aacaaccttg
721 acattgaaga catcaaccac cgcgaaatta ctgcgattcg tttgattgct aaaattccaa
781 cgcttgctgc ttggagctac aaatatactg taggtcagcc attcatctat ccacgtaatg
841 acttaaatta cgcggaaaac ttcttacaca tgatgtttgc aactcctgca gaccgtgact
901 acaaagtaaa ccctgttctt gctcgtgcaa tggatcgtat ctttacgctt cacgctgacc
961 acgaacaaaa cgcgtctact tctacagttc gtcttgctgg ttctactggt gcgaatccat
1021 atgcgtgtat ctctgctggt atctctgctc tttggggtcc tgcacacggt ggtgcgaacg
1081 aagcagttct taaaatgctt gatgaaatcg gtagcgttga aaatgttgct gagttcatgg
1141 aaaaagttaa acgcaaagaa gttaaactta tgggcttcgg tcaccgcgtt tacaaaaact
1201 tcgatccacg cgctaaagtg atgaagcaaa cttgtgacga agttcttgaa gcattaggta
1261 tcaatgatcc tcaattagcg cttgctatgg aacttgaacg tattgcattg aacgacccgt
1321 actttgttga acgtaaactt taccctaacg tagacttcta ctctggtatc atccttaaag
1381 cgattggtat cccaacagaa atgtttaccg ttatcttcgc tcttgcacgt acagttggct
1441 ggatcagtca ctggttagaa atgcacagcg gtccttacaa aattggtcgt cctcgtcagc
1501 tttacactgg tgaagtgcaa cgtgacatca agcgttaata ttcgaaagaa tattaatgta
1561 aaaagctgcc taatggcagt tttttttata aataagtttt aaaagttatt cttcttcaaa
1621 catatttaat aagtgatgac taataccatc agctcttagc caagccaact cataacttgc
1681 ttcggccaaa gctaaaatac gtctttcaaa ctcagtccat acttgtttaa cttgcgcttc
1741 tgaatcccta aaccactgtc atagctaaat gcttattctt ttcacatatt tttaaggcat
1801 ggtagagttt agccctttac tcgccccttc attaacctga cacgtttacc taatataaat
1861 ccttctacat gctgtagact gggaacatag gtacc
//
LOCUS ECOGUAC 1991 bp ds-DNA BCT 11-JUL-1990
DEFINITION E.coli GMP reductase (guaC) gene, complete cds.
ACCESSION M33020
KEYWORDS GMP reductase.
SOURCE E.coli (strain K12) DNA, clone pDS89.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1991)
AUTHORS Andrews,S.C. and Guest,J.R.
TITLE Nucleotide sequence of the gene encoding the GMP reductase of
Escherichia coli K12
JOURNAL Biochem. J. 255, 35-43 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 210 1250 GMP reductase (guaC) (E.C. 1.6.6.8)
mRNA 25 < 1250 guaC mRNA (put.)
mRNA 90 < 1250 guaC mRNA (put.)
mRNA 126 < 1250 guaC mRNA (put.)
BASE COUNT 493 a 494 c 508 g 496 t
ORIGIN
1 gaattcatca tgattatcaa aacgttaaaa atgagtgcac gaaagcgaaa ttgatgaaac
61 gttcgctcac tatttaccag gtaaatttat gggattgtag cgtaaaaaaa gacaatttcg
121 cagtcttgcg ccgcattgat tagtgcgtat gatagcgtca ctggagttgc gctcttaccc
181 ttatagccat taaccccagg aatccgcaca tgcgtattga agaagatctg aagttaggtt
241 ttaaagacgt tctcatccgc cctaaacgct ccactcttaa aagccgttcc gatgttgaac
301 tggaacgtca attcaccttc aaacattcag gtcagagctg gtccggcgtg ccgattatcg
361 ccgcaaatat ggacaccgta ggcacatttt ctatggcctc tgcgctggct tcttttgata
421 ttttgactgc tgtgcataaa cactattctg tcgaagagtg gcaagcgttt atcaacaatt
481 cttccgctga tgtgctgaaa catgtgatgg tttctaccgg tacgtctgat gcggatttcg
541 aaaaaactaa acagattctc gacctgaacc cggcattaaa cttcgtttgt attgacgtgg
601 cgaatggtta ttccgaacac ttcgtgcagt tcgttgcgaa agcgcgtgaa gcgtggccga
661 ccaaaaccat ttgtgctggt aacgtagtga ctggtgaaat gtgtgaggag cttatcctct
721 caggtgccga tatcgttaaa gttggcattg gcccaggttc tgtttgtaca actcgcgtca
781 aaacaggcgt cggttatccg caactttctg cggtaatcga atgtgccgat gctgcgcacg
841 gtctgggcgg aatgatcgtc agcgatggtg gctgcaccac gccgggcgat gtggcgaaag
901 cctttgcgcg tgccgatttc gtcatgcttg gcggcatgct ggcgggccac gaagagagcg
961 gcggtcgcat cgttgaggag aacggcgaga aatttatgct gttctacggc atgagctccg
1021 agtctgcgat gaaacgtcac gttggcggcg ttgcggaata tcgcgcagca gaaggtaaaa
1081 ccgttaagct gccgctgcga ggcccggttg aaaataccgc gcgagatatt ttgggcggcc
1141 tgcgttcagc ttgtacatac gttggggctt cacgcctgaa agagctgacc aagcgcacca
1201 cgtttattcg tgtgcaggaa caagaaaacc gcatcttcaa caacctgtaa tctcccaacg
1261 ctggcgtgga gcaacacgcc acggttatcc catcccactc atcgcatcgc ctaaatggaa
1321 aattggcaga tacattgcca ccaccagcgt accaataatt cctcccgtta tgatcagcaa
1381 cgcggttcag taaggctgcg aggttatccg ccagcgccat tgtgttttcc cgatgatgat
1441 gggcgaggtt gtctaacatg agatccagag agccggatgc ctctcctgtt ctcactaatt
1501 gcaaacagag cgggctaaac tcaccggtat tttttagcgc cagccagatg ggttgaccgt
1561 tactgatatc gtgctggatt tgtgtcagaa gttgcaccca gtacgggcag cgcattgttt
1621 ctctgacgct ctctacgccc tgtaaaaaag taatgcctgc actttgtgtc agcgccagaa
1681 tcgtaaagat ctgcgtgagt ttttgtcccc gcatcagtga acccataatc gggatgcgta
1741 acagcaattt ctgccgcact ataagccagg tcggtcggcg catcagcaac ttattggcta
1801 tcgccagcag aaagccgaac acaccagcag ccagctccat tcgccactaa agtctgccag
1861 cgtcatgatc ccctgcgtta gtgccggtag tggggtgttg aaggtcttat agatagcggc
1921 aaactccggc agacacaaaa tgcagcattg ccacaaccac catgattagc catcgctaaa
1981 atgatgatgg g
//
LOCUS HUMDKERB 8815 bp ds-DNA PRI 11-JUL-1990
DEFINITION Human cytokeratin 8 (CK8) gene, complete cds.
ACCESSION M34482
KEYWORDS cytokeratin 8.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 8815)
AUTHORS Krauss,S. and Franke,W.W.
TITLE Organization and sequence of the human gene encoding cytockeratin 8
JOURNAL Gene 86, 241-249 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1113 1436 cytokeratin 8, exon 1
3972 4180 cytokeratin 8, exon 2
4809 4869 cytokeratin 8, exon 3
5344 5439 cytokeratin 8, exon 4
5958 6248 cytokeratin 8, exon 5
7113 7333 cytokeratin 8, exon 6
7492 7550 cytokeratin 8, exon 7
8380 8567 cytokeratin 8, exon 8
IVS 1437 3971 CK8 intron A
IVS 4181 4808 CK8 intron B
IVS 4870 5343 CK8 intron C
IVS 5440 5957 CK8 intron D
IVS 6249 7112 CK8 intron E
IVS 7334 7491 CK8 intron F
IVS 7551 8379 CK8 intron G
signal 1007 1011 TATA box
signal 8779 8784 poly-A signal
BASE COUNT 1868 a 2324 c 2481 g 2142 t
ORIGIN
1 tcaacggatc tcgctctttt ttttctttgg agatggaatc tcgctctgtc gcccaggctg
61 gagtgcagtg gcaagtctca gctcactgca actctgcctc ccgggttcaa gtgattctcc
121 tgcctcagcc tcctgagtag ctgggattac accatggcca gctaattttt gtatttttag
181 tagagatggg gtttcaccat gttggtcagg cttgtcttga actcctgacc tcgtgatccg
241 cctacctcag cctcccaaag tgctgggatt acaggcgtgc acagcgtgcc ctggccttgg
301 atctcttttt atcttgcacc ttcagatgta gagggacgac agccactgtg tgtgtatgtg
361 tatgtgtgtg tgtgtgtgtg tgtgtgcgcg tgtgatgttt attcactcat ttatttattc
421 attcattcat tccacaaata tctacccaga ccctcttggc actgcaccag gtcgtagggg
481 tagaacagta acctggaaag atgaggcaaa tggttgattt cagattcaag gctttggact
541 ccagctgttc tgtcatccag ctcaggcagg ccctcataat cgcttcaatc agggagaaca
601 caggagagtt tctctggggt gtcggcagct cagaggagac ccaaatacta ggagacccct
661 tttcccatgc ttcccagtcc tccagtttat ttcccccagg aaggagggag acaagaccca
721 gagtcagggt tgtagtggct gggcggccca ggcaagtctg cttgttacac gacttgtgcc
781 aggacaggat ttcttccagt ttcatattca ctgaactgcc ttttcctggg tttctggggg
841 tggtgctgga gtgggctcca gggttggaac gggcccttgc gacgcgtctc tgctgccccc
901 acctgagtct gccccgaggt ggcaggtgac gggttcacgc gacgcctctg gcctagccac
961 tcaggtacga ggcctttccc ccactccccg gggctgggat ctcttttata aaaggccatt
1021 cctgagagct ctcctcacca agaagcagct tctccgctcc ttctaggatc tccgcctggt
1081 tcggcccgcc tgcctccact cctgcctcta ccatgtccat cagggtgacc cagaagtcct
1141 acaaggtgtc cacctctggc ccccgggcct tcagcagccg ctcctacacg agtgggcccg
1201 gttcccgcat cagctcctcg agcttctccc gagtgggcag cagcaacttt cgcggtggcc
1261 tgggcggcgg ctatggtggg gccagcggca tgggaggcat caccgcagtt acggtcaacc
1321 agagcctgct gagccccctt gtcctggagg tggaccccaa catccaggcc gtgcgcaccc
1381 aggagaagga gcagatcaag accctcaaca acaagtttgc ctccttcata gacaaggtga
1441 gggtcccctg cgtggctgac tgtgccccgc agcccctttc tcctggtagt cccggtccct
1501 atgcacatct ccagccccca gctggcgtcc tgctgggcct cacccgccct gggcacactc
1561 tcccttccat cctccgacct cacccctccc gtgcaccttg gtttgggctg ggtgagggtg
1621 gggagagggt ctggacagcc gggatgaatc ctggggcttc cttcttccct tttaaactgg
1681 agggtcttgg aagagagaga caacttaagg gtacagccta gttcccacca cccctctcta
1741 caaatcccgt tcttcctcag gtcattctgt cccaaattat aaaaaataat agcggttatt
1801 gttctcaccc caacccagtt ctgaccgtct tttaacgtat gcctgcggca gtcccagctg
1861 ttcgggacta ccctcctcca ggttcgcctc ttcgccagca ctacccaagg ctccccagtg
1921 gtgcctttgt gatttttttt ctttcttttt tttacatagg ggtttggtgt gattctagca
1981 ttctaggaga aggaagtggg tgtctcggtt caaacgggca aatattgatt gaggcctttg
2041 gccgccggag gcctgagtgc gggggtcaca gaatgagtca tacggcccct ggcccggcag
2101 cgtgggcggg gccgagggcg gggtgagggc tgcgggcagc agtctgcggg acgctctcct
2161 ccactggcgg agctcggcgt cgggggcggt gtgggtgggg tggggtgggg tggggtgggc
2221 tggggtgggg tggaggaggc gagggcctgg cctcggaaag cccatgcagg attcaaagtc
2281 tcctgggacg ccgcccgggg tttacgtcct gttaagttta tggcttcaga taacgcggtc
2341 gcccaccaac gcccctcgcc cattcagccc gtgtcccttt ctcggcgtcc tgtccctgct
2401 gcccccagcc tcggctccac tttccacaca gcaggagcca gggccgggtt ttgcagcctg
2461 ggactccgct gcctgagccc cggcccccgg cggccccgag gattgggccc ttcacgctga
2521 ctggctcctg ggaggcattg tgggaacggg aggagggaaa tcctggggca gagtaagccg
2581 ggaggaaccg gagccccagg aacccagtgg tcgggggccc tcgctgtcca agcgcctgga
2641 cttgacttgt tgactgcgtt ttgctagccc tggggtcctt atagagagca gctaagcata
2701 ggctttggaa tctgaattct tggtctgcac tcgtctgccg gttcctggtt atggactccc
2761 ttgccaagtc ttatttcctc atctataaaa tgaatatgag agcccctaaa tccatatagc
2821 aaaagttttt gccttattca aacttacata tgtaaagagt tcagcagtgc ttggcccaca
2881 ttccattagg ataagatgtt ataatcactt ttttttaaaa aataattttg gggcagaatg
2941 actggggaag aaagcgattt gcagagagtg gtggagggaa ctaggctgta cccttaaaag
3001 atttctgtcc cctccagttt agaaggagtt acaagttttt ttgtttgttt gagacagagt
3061 tactctgtgc ccaggctgga gtgcagtggt gtgatctcag ctcactgcaa cgctccgctt
3121 cctgggttca agcgattctc ctgcctcagc caccgagtag ctgggactac aagtgcgtgc
3181 acagcccggt taattttgta attattgtag gcaaggttca atatgttggc aggctggtct
3241 cgaactctga cttcagaaat ccgcctgcct tgaccaccca aagtgctgga attacagcgt
3301 gagcctccac gcccggcctc tttttcaatc ttaacatctt tagaaaggtt ggctattttt
3361 ggccgggcgc gggcttacgc ctataatccc agcactttgg gaggccaagg cgggccaatc
3421 acaaggtcag gagttcgaga ccatcctgcc taagacggtg aaaccctgtc tctactaaaa
3481 atacaaaaaa attagtgggg cgtggtggca cgcacggctg cctgtagccc cagccactcg
3541 ggaggctgag gcaggggcag gagaatggca tgaacttggg aggcggagct tgcagtgagc
3601 tgagatcttg cactgcactc tagcctgggc cggagactcc caaagaaagc ttggctattt
3661 ttattgatgt gtaatataca acctatgtaa atgaagttag gcctattggt ttgcaaatgc
3721 agctttaaca taattacctt acctgtctcc ttcccctacc caatgctgag ggacattgct
3781 ccccacctca ccatcatgcc atgctttctc cccctggtca taggtgatct ttccagaaca
3841 gctaaccagg tgcctggggt ctggagactt actgcttgag gagtgaatta agagaaaaga
3901 ctgcttgctt tcctccagac tttgagccct ggcctgatgt agaccttttt gctctctcct
3961 ccttcgtata ggtacggttc ctggagcagc agaacaagat gctggagacc aagtggagcc
4021 tcctgcagca gcagaagacg gctcgaagca acatggacaa catgttcgag agctacatca
4081 acaaccttag gcggcagctg gagactctgg gccaggagaa gctgaagctg gaggcggagc
4141 ttggcaacat gcaggggctg gtggaggact tcaagaacaa gtgagcaact ccaccctcca
4201 cccaactgaa gtcacctgct ctcctccacc ccttgacctt gggactaagt ccatggccct
4261 ctgttgtggg aagtgcagtc ctatctaatt agggtgacca cctgatgagg tttctcggac
4321 agtctgtgtt tatgccaggt tctagcacat tgttgatagt acccacccct ttcaatctaa
4381 ctgtctggat ttgaagaaca aattatgtgt caatgttgac atggtaaacc tgagacggga
4441 gagataggca gcctgtgggc ctcacttttg tacttaacat tctggcccct ctttagtctt
4501 gacccttgac ctctagcaaa ctctagaaag ttctgtctga ggtctcatgt caggccctgc
4561 tgttaacact ctcaaggtgt ccaatccgat gtgtattcat ggatttggag agagatttcc
4621 tgcttcccac gggctaaggg aggggtgagg gtggagaggg cagctgggga aggcagaagg
4681 accagccttc tcatatcctc atctctgtga actgaatttc ctgatttcac aacgcccctg
4741 tctcccaaaa gaccaagggc aacctccctt ttgccttcat cctctaattg taagtctttt
4801 cctcacaggt atgaggatga gatcaataag cgtacagaga tggagaacga atttgtcctc
4861 atcaagaagg tgagggagtc tcccttctcc tatctggaca ctggaggctg gggctcagag
4921 actcagacca agaagctttc tgggttttgt ccctaaatat tcctaagtag tgggacaaac
4981 tcatttatgt aaacatttgg gtgcacagaa aggtagacaa ggatggagtg gtaggtgcat
5041 ttggacagaa ctcttgacat cggtgttggg acatggttca gaaaacagag cagtagaact
5101 ggagatctgg ctctagaagg ctccctagag aaggaggtgg aagagggtgt gttgcaggaa
5161 gcagaggtga aggtgtgtgg gctgagaatg cacatgtgat gggcagaggc tgggctggaa
5221 gatcaatcca caaagtggca actagaaagt cctgtgacca ggccattggg tggaccttgg
5281 gagccccttg gttggggttg ggtgtggaaa cccagctcag gctcccctct cctcatcccc
5341 caggatgtgg atgaagctta catgaacaag gtagagctgg agtctcgcct ggaagggctg
5401 accgacgaga tcaacttcct caggcagcta tatgaagagg tatgttcctg gtcgcaggag
5461 agtgagggtc cccagccttg tcagcgcctc caccctgaga ctcaaccaga ggctcctccc
5521 agcccccagc acactaataa gacaaaggac cccactgctg actaattaca gccaccaata
5581 tttgctcggc tagtatttat tgggtctata tgttctgtcc ctcgcatgag gtgagtcatt
5641 accccatttc acagacgaga aagtgggctc agagaagtga aataacgtat ccaaggtcat
5701 catagggtgt ggtgattcag cagcaactct gtccccaaag cccttgttcc taatctttga
5761 gctgcattgg atccctctgt gcacctagta ttggtgaccc agttcctttt tcaggaactt
5821 tgcccctctc cctgaccctg actcccacct gctcctctcc tctgctgccc ctgtcttata
5881 cctaagaaag gctgttgtgg aaaagggggc tcctgtgtgc agagacaggg cctcaccact
5941 tgccctcttc cccacaggag atccgggagc tgcagtccca gatctcggac acatctgtgg
6001 tgctgtccat ggacaacagc cgctccctgg acatggacag catcattgct gaggtcaagg
6061 cacagtacga ggatattgcc aaccgcagcc gggctgaggc tgagagcatg taccagatca
6121 agtatgagga gctgcagagc ctggctggga agcacgggga tgacctgcgg cgcacaaaga
6181 ctgagatctc tgagatgaac cggaacatca gccggctcca ggctgagatt gagggcctca
6241 aaggccaggt atgggccggg ttgggggtgg gagggttcct tggacacaat cctggtgaga
6301 ggagataatg taggaagagt gaagtttctg ggagtcgggg aaggaatcct agaccagggt
6361 tcaggagttg gaggggcagc cacagttcag cttctcagtc tgcttctgag aagcaaaggg
6421 atgcagggaa ggtcccttgg gccaggacag aggtgaaagg ggactggggc aggtatgttg
6481 gggactcgtg atacatgctc caagcctgct ttaatcagtc atatgcatca ggggtaaggt
6541 tgagctctgc tgctttaagg aaagtctaga acccagggat ctagtccagt tagggtaggg
6601 ggaccttaca gtgtcgcagg tcgagaaggg tgtggagggg aagcacctgg aaactgctca
6661 tgtctccctg atctgcttcc ttagtctcgt ttatttattt atttattttt gagacagagt
6721 cttgctctgt cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caagctccgc
6781 ctcctgggtt cacactattc tcctgactca gcctcctgag tagctgggac tacaggcgcc
6841 cgcaccaggc tggctaattt tttttgtatt tttgctagag acggggtttc actgtgttag
6901 ccaggactcg tcgatctcct gaccttgtga tctgcccgcc tcgcctccca aagtgctggg
6961 attacaggca tgagcactgt gcccggccct tagtctcatt aattgagctg gggagtcagc
7021 ctagtgtgtg gaggacctga gggagggtgg acgcacggag gaagagaagg catacccaac
7081 ctgacctact tacctgtccc ctacccacag agagggcttc cctggaggcc gccattgcag
7141 atgccgagca gcgtggagag ctggccatta aggatgccaa cgccaagttg tccgagctgg
7201 aggccgccct gcagcgggcc aagcaggaca tggcgcggca gctgcgtgag taccaggagc
7261 tgatgaacgt caagctggcc ctggacatcg agatcgccac ctacaggaag ctgctggagg
7321 gcgaggagag ccggtgggtg tgggtacctc tgaccggacc tgcttcccta tccctgggac
7381 ctggggtggg gacggtggga gccccctgaa gccccttgga cttggggtcc tgttgttctg
7441 ggccaagaag ggctaggagt tggtcctgac accccatttg acagggtaca ggctggagtc
7501 tgggatgcag aacatgagta ttcatacgaa gaccaccggc ggctatgcag gtggtgtccc
7561 agggccctgg atgagggcgg gaggcagggc cagggaggct cagctccagg gagggggctg
7621 tgctcagtcg ctcacagtga cctcagcctg agcactcatg ttcttgggag aatcctaggg
7681 tggggaggca catattcagg gaactccagt aataacttta ttacttagta acttcatatt
7741 agaagataca ccaataacca tagctgtgtg ccaggcactt gcgtaagtat cctacaggtt
7801 ttatgtgatt tattttattt attaatttaa tttaattttt ttgagacgaa gtctcgctgt
7861 caccaagctg agtgcagtgc tgatctcagc tcactgtaac ctcacctcct gggttcaaga
7921 gattctcctc cgtcaggcct cccaagtagc tgggactaca ggcgcatacc accatgccca
7981 tgctaatttt tgtattttta gtagagacgg ggtttcactg tgttgggcag gctggtctcg
8041 aactcctgac cttgtgatca gtgctgggat tacaggcatg agacactggg cctggctgta
8101 atttattttt tatatgacac ctgtaaacgt cttcagttga ggaaggctga ggtgcagcta
8161 aatgtccaag ctgacacagg ctatatatat ggcagctgtt ttccaccctg ctcctggttt
8221 tccctgacag ttctggagta gtgaaccatg caatcactga tcaggagagc tgggttaacc
8281 tccatccctg gggctatgtt gggaatgagc agggagaagg gcatggagcc tgccatggtg
8341 ggcttctgta ctcatgtggc tacctctgtc cctcaccagg tggtctgagc tcggcctatg
8401 ggggctcaca agccggcctc agctacagcc tgggctccag ctttggctct ggcgcgggct
8461 ccagctcctt cagccgcacc agctcctcca gggccgtggt tgtgaagaag atcgagacac
8521 gtgatgggaa gctggtgtct gagtcctctg acgtcctgcc caagtgaaca gctgcggcag
8581 cccctcccag cctacccctc ctgcgctgcc ccagagcctg ggaaggaggc cgctatgcag
8641 ggtagcactg ggaacaggag acccacctga ggctcagccc tagccctcag cccacctggg
8701 gagtttacta cctggggacc ccccttgccc atgcctccag ctacaaaaca attcaattgc
8761 tttttttttt tggtccaaaa taaaacctca gctagctcgc cgaatgtcct tgctt
//
LOCUS HUMSRU30S 179 bp ss-RNA RNA 11-JUL-1990
DEFINITION Human 30S small nuclear ribonucleotide protein pre-mRNA complex,
exons 1 and 2 (partial).
ACCESSION M34493
KEYWORDS small nuclear ribonucleoprotein.
SOURCE Human Hela cell pre-mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Pruzan,R., Furneaux,H., Lassota,P., Hong,G.Y. and Hurwitz,J.
TITLE Assemblage of the prespliceosome complex with separated fractions
isolated from Hela cells
JOURNAL J. Biol. Chem. 265, 2804-2813 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 55 small nuclear ribonucleotide protein (snRNP),
exon 1 (AA at 2)
142 > 179 small nuclear ribonucleotide protein, exon 2
pre-msg < 1 > 179 snRNP pre-mRNA complex
IVS 56 141 30S small nuclear RNA intron A
BASE COUNT 30 a 48 c 46 g 55 t
ORIGIN
1 aatacacgga attcactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag
61 tgtgacctgc acgtctaggg cgcagtagtc cagggtttcc ttgatgatgt catacttatc
121 ctgtcccttt tttttccaca gctcgcggtt gaggacaaac tcttcgcggt ctttccagt
//
LOCUS K5TPA1PRO 307 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage BK5-T promotor DNA.
ACCESSION M34486
KEYWORDS .
SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1266.
ORGANISM Bacteriophage BK5-T
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 307)
AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
TITLE Molecular characterization of promoters of the Lactococcus lactis
subsp. cremoris temperate bacteriophage BK5-T and identification of
a phage gene implicated in the regulation of promoter activity
JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 304 > 307 bacteriophage BK5-2 mRNA
BASE COUNT 80 a 56 c 43 g 128 t
ORIGIN
1 gatcaaggtg tgtaggtgta atctctagct taggaacgct tttgatacag aacgtgtgat
61 tgtccgtttt taactttctt gttttgtcat cttcataaac tcacaaagtt tatttttgga
121 acaaattttt cttttttatc gtatgacgta acttttttca tttggtccat cataagcttt
181 tttaatattg tcagcttttg ctttttcgac gttctctacc gacgctttca aaatctttaa
241 tgaaaaaaac cgtaaccatc gaatttttct tccatatttt caaagaatcc gttactatct
301 aacgatc
//
LOCUS K5TPA3PRO 182 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage BK5-T promotor Pa3 DNA.
ACCESSION M34488
KEYWORDS .
SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1268.
ORGANISM Bacteriophage BK5-T
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 182)
AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
TITLE Molecular characterization of promoters of the Lactococcus lactis
subsp. cremoris temperate bacteriophage BK5-T and identification of
a phage gene implicated in the regulation of promoter activity
JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 169 > 182 promotor region mRNA
BASE COUNT 57 a 21 c 24 g 80 t
ORIGIN
1 ttttcagaat atgaagttaa aagttctcta atatttttat ccgttaaaga gtatcctata
61 aataaaattg gggattctgt taagtttgac aatatttttc gcatttacta atgctaattt
121 agattcatta tttttataat cctcactagt tatacatata gtatttgggt ttttgactga
181 tc
//
LOCUS K5TPF1PRO 177 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage BK5-T promotor Pf1 DNA.
ACCESSION M34490
KEYWORDS .
SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1262.
ORGANISM Bacteriophage BK5-T
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 177)
AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
TITLE Molecular characterization of promoters of the Lactococcus lactis
subsp. cremoris temperate bacteriophage BK5-T and identification of
a phage gene implicated in the regulation of promoter activity
JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 149 > 177 ORF
mRNA 110 > 177 ORF mRNA
BASE COUNT 63 a 29 c 37 g 48 t
ORIGIN
1 cctttattct tcgtgcaagg aggcgcaaga tggtcaaaac ttacaaaccg attgatttta
61 acagaaaatg taagattgga gttactaaaa cagtaactta ctccaactgg aggtaagatt
121 gaaaaaattg acccaggaac ggttttaaat gttcgatttc gcggctaaaa tgagatc
//
LOCUS K5TPF2PRO 1209 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage BK5-T promotor Pf2 and an ORF, partial cds.
ACCESSION M34487
KEYWORDS .
SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1261.
ORGANISM Bacteriophage BK5-T
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1209)
AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
TITLE Molecular characterization of promoters of the Lactococcus lactis
subsp. cremoris temperate bacteriophage BK5-T and identification of
a phage gene implicated in the regulation of promoter activity
JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 249 > 1209 bacteriophage BK5-2 mRNA
BASE COUNT 377 a 213 c 235 g 384 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattctgaa tatggttcgt aaccctatgg catttctcaa tactctttca tctaaaactg
61 aaactagcgg aagtgatagt gctgctggac ttactattcc gcaagatatc cgtactatga
121 ttaacacatt ggttcgccaa tatgactcac tacaacaata tgtacgtgtt gagagtgttt
181 ctacttcaaa cggtagtcgt gtatatgaaa aatggactga tgtaactccg ttgactgtaa
241 tggatgcaga agatggaaaa attcctgatc ttgataatcc acgtttggac aattattaaa
301 tacttgatta aacgttatgc gggaatcatc aatgccaact aatacattgc ttaaagatac
361 agcagaaaat attcttgcat ggttatcaag ctggattgct aagaaagtgg ttgtgactcg
421 taaccaagcg attattgcag caatgggtac agttcctaaa aaaccaacaa tcgctaaatt
481 tgatgatgtt attactatga ttaatacatc tgttgatcct gcgattatcg ccacttcaag
541 tcttttgact aaccagtcag ggttgaataa acttgctttg gttaaaactg ctgaaggtaa
601 atatttgctc gaaccagacc caacaaaacc taattcatat ctaattaaag gtaaaaaagt
661 tattgttgtt gcagatcgct ggcttccaaa tagtggatca acagtttatc cactttacta
721 tggagatatg tcgcaagcta ttacattgtt tgaccgtgaa aacatgtcat tacttccaac
781 aaatattggt gctggtgcat ttgaaactga tactactaaa attcgtgtaa tcgatcgctt
841 cgatgttaaa actgctgact cagaagcttt agttgctggt tcacttactg caattgcaga
901 ccaagtaggt aattttactg caggaaagta ggtaatttat gacagtaact gttgatgact
961 tactagatca gttatcagaa gatgatgatc gcaaaccgca acttcaaatt tatttgatac
1021 agcaaaagca tatgtgaaaa atgcagtgag ttctgataca gttgatgctc catttttcag
1081 tgtagaaaac gtttatccga tttatgatgt agctgttctt agctattcta tggatttgtg
1141 gattaatcgt tctacgacta tgccgcctac tacggctgta gatcacatgg ttggtcagtt
1201 gagaggcct
//
LOCUS K5TPG2PRO 195 bp ds-DNA PHG 11-JUL-1990
DEFINITION Bacteriophage BK5-T promotor Pg2 DNA.
ACCESSION M34489
KEYWORDS .
SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1265.
ORGANISM Bacteriophage BK5-T
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 195)
AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
TITLE Molecular characterization of promoters of the Lactococcus lactis
subsp. cremoris temperate bacteriophage BK5-T and identification of
a phage gene implicated in the regulation of promoter activity
JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 154 > 195 promotor region mRNA
BASE COUNT 71 a 28 c 40 g 56 t
ORIGIN
1 agagatttac gaaaagttga gtgctttagc tgaaattgat agacttttcc attggtctag
61 ccatttacat caagaacgat tacaatttgt tagtaaatat ccaaatgtta tggaaaaata
121 cagacaagca aactaaggag ggtatattga atgaccgaca aactaatatc gctggtcatc
181 aaagtgtgtg actgg
//
LOCUS MUSH2A 1805 bp ds-DNA ROD 11-JUL-1990
DEFINITION Mouse (H-2a haplotype) DNA fragment.
ACCESSION D90007
KEYWORDS .
SOURCE Mouse (strain B10.A, haplotype H-2a) DNA, clone B10.A.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1805)
AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
Steinmetz,M. and Moriwaki,K.
TITLE Recombinational hotspot specific to female meiosis in the mouse
major histocompatibility complex
JOURNAL Immunogenetics 31, 79-88 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihiko
Shiroishi National Institute of Genetics Yata-1111 Mishima,
Shizuoka 411 Japan Phone: 0559-75-0771
FEATURES from to/span description
site 1 288 MT-consensus
rpt 1227 1242 TCTG repeat
LTR 1533 1568 Xenotropic solitary LTR sequence
BASE COUNT 391 a 400 c 409 g 605 t
ORIGIN Chromosome 17.
1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg
121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct
181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc
241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa
301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct
361 cctctgcttt cctggacatg taattttttt ttttgagaca aggtcattct tgttgtctat
421 tcttgactgg ctttgaattc agaatctgca ggctctgcct ctctggtaac atgtaacatt
481 ttccatatgt aacattttta ccagccattt cccagtaaat gagttacttc atttgaggtt
541 ttgtcttaaa tccccgtgag caatgttttg ttagtttcca aagcacgagg attctaagtg
601 tctatttgtt gctaagttgc caggctgtta cagagcacag tttctgggac cctggctctc
661 tgaaactgac tagggattgc tttagtataa acataaacca ctgggactct ggctctttga
721 aactgactag ggattgcttt agtacaagta taaaccactc agtcctggtc ctacttggct
781 tcaaaagttg aatatcgctt ttggtatttg agatggagat ttaaagatgg aattttatta
841 gtcttctgcc tggttttctt tctttctttg ctcttactgc cttgtggctc agaaccagct
901 gttgcctgtt tgatagtttg tgaccaatac ctgtactgtt aaattggcca tttgagaact
961 caaaaagtcc caacttgtag tgttttcggt ttccatggtc ttagatattt ccactgcaga
1021 caacatcaag ttgccagtgg ttaacaactg tctttcagaa ctctcaagta tttcggtggg
1081 tctgccagcc cttgtaacgt agcgccacgt ggtatatgct tatttgtctg tctgtctgtc
1141 tgttgtgcaa gatgcctgtg tgccctgagg tcagaggaca gcttcaaggg ctctccattc
1201 ttccctgacc acgtggatcc agggaataga actttgacca ttacccacgg gccatgttat
1261 ttcttgacag ttctgttgta catttgtttt agtctttggc tttatttatt tttctcaccc
1321 tcagtttccc tttgtctcag atgctttttt ttttttttta aatcttgcct tgggagatgt
1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt
1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc
1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg
1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg
1621 tttcagggca ggctggtagt cttctgactc tgctcactgg ccactttcag ttcctgcttt
1681 ctgaatccta tccagagttc tcagtggtca tcagactctg gagaggacga ggggaagggg
1741 tgggctctta aactatcatt tatatttaaa aaaaattaaa caacagagtt agaagcagat
1801 ccagg
//
LOCUS MUSH2B 1634 bp ds-DNA ROD 11-JUL-1990
DEFINITION Mouse (H-2b haplotype) DNA fragment.
ACCESSION D90008
KEYWORDS .
SOURCE Mouse (strain C57BL/10, haplotype H-2b) DNA, clone B10.30.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1634)
AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
Steinmetz,M. and Moriwaki,K.
TITLE Recombinational hotspot specific to female meiosis in the mouse
major histocompatibility complex
JOURNAL Immunogenetics 31, 79-88 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihiko
Shiroishi National Institute of Genetics Yata-1111 Mishima,
Shizuoka 411 Japan Phone: 0559-75-0771
FEATURES from to/span description
site 1 285 MT-consensus
rpt 1128 1143 TCTG repeat
LTR 1534 1569 Xenotropic solitary LTR sequence
BASE COUNT 340 a 373 c 367 g 554 t
ORIGIN Chromosome 17.
1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
61 ggcgccattc ctgggctggc agtcctggct tctataagaa agcaggctga gcaagtgatg
121 acgcccctcc atggcctctg catcagctcc tgcctccaga ttcctgtcct gatttcttcg
181 gtgactaaca gctatgtgga agtgtaaaca ggatgaacgc tttccttccc aggtagcttt
241 ggtcctgggg tttcattgca gtaatagtaa ccctagatgg gacaagactt tgatcaagcg
301 ttccctttca ttgtcccctt cctgtagaca tgacttctct tcctatagac agtctcccct
361 ctgctttcct ggacacggaa tttttttttt tttttttttg agacaaggtc tttcttgtct
421 attctcgact ggctttgaat tcagaatctg cagctctgcc tctctagtaa catgtagcat
481 tttccatatg taacattttt accagccatt tcccagtaaa tgagttactt catttggggt
541 tttatcctaa atccccgtga gcaatgtttt gttagtttcc aaagcacgag gattctaagt
601 gtctatttgt tgccaagttg ccaggctgtt acagagcaca gtttctggga ccctggctct
661 ctgaaactga ctagggattg ctttagtata aacataaacc actgggactc tggctctttg
721 aaactgacta gggattgctt tagtacaagt ataaaccact cagtcctggt cctacttggc
781 ttcaaaagtt gaatatcgca tttggtattt gagatggaga tttaaagacg gaattttatt
841 agtcttctgc ctggttttct ttctttcttt gctcttactg ccttgtggct cagaaccagc
901 tgttgcctgt ttgatagttt gtgaccaata cctgtactgt taaattggcc atttgagaac
961 tcaaaaagtc ccaacttgta gtgttttcgg tttccatggt cttagatatt tccactgcag
1021 acaacatcaa gttgccagtg gttaacaact gtctttcaga actctcaagt gtttcggtgg
1081 gtctgccagc ccttgtaacg tagcgccacg tggtatatgc ttatttgtct gtctgtctgt
1141 ctgttgtgca agatgcctgt gtgccctgag gtcagaggac agcttcaagg gctctgcatt
1201 cttccctgac cacgtggatc cagggaatag aactttgacc attacccacg ggccatgtta
1261 tttcttgaca gttctgttgt acatttgttt tagtctttgg ctttatttat ttttctcacc
1321 ctcagtttcc ctttgtctca gatgcttttt tttttttttt aatcttgcct ctgggagatg
1381 tttcaaactc ttggaacgaa tgatacagtt gtttgattga tagaacgaag ccttccagtg
1441 tgaatgcgtt tgcatttcag cttgttgctg gctggctgtg tggtgctggt tcagacatgt
1501 cacaggcttg aggtgttaag gctaactgag ttcggagagt ccccacctga ccccttctcc
1561 gttcccctca ccagggagac ctccctcctg gctgcagttg agcagggtgc accggggctg
1621 gtttcagggc atgc
//
LOCUS MUSH2WM7 1630 bp ds-DNA ROD 11-JUL-1990
DEFINITION Mouse (H-2wm7 haplotype) DNA fragment.
ACCESSION D90009
KEYWORDS .
SOURCE Mouse (strain B10.MOL-SGR, haplotype H-2wm7) DNA, clone SGR.31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1630)
AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
Steinmetz,M. and Moriwaki,K.
TITLE Recombinational hotspot specific to female meiosis in the mouse
major histocompatibility complex
JOURNAL Immunogenetics 31, 79-88 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihiko
Shiroishi National Institute of Genetics Yata-1111 Mishima,
Shizuoka 411 Japan Phone: 0559-75-0771
FEATURES from to/span description
site 1 288 MT-consensus
rpt 1126 1141 TCTG repeat
LTR 1533 1568 Xenotropic solitary LTR sequence
BASE COUNT 342 a 363 c 370 g 555 t
ORIGIN Chromosome 17.
1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg
121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct
181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc
241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa
301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct
361 cctctgcttt actggacatg taattttttt tttgagacaa ggtcattctt gttgtctatt
421 cttgactggc tttgaattca gaatctgcag gctctgcctc tctggtaaca tgtaacattt
481 tccatatgta acatttttac cagccatttc ccagtaaatg agttacttca tttgaggttt
541 tgtcttaaat ccccgtgagc aatgttttgt tagtttccaa agcacgagga ttctaagtgt
601 ctatttgttg ctaagttgcc aggctgttac agagcacagt ttctgggacc ctggctctct
661 gaaactgact agggattgct ttagtataaa cataaaccac tgggactctg gctctttgaa
721 actgactagg gattgcttta gtacaagtat aaaccactca gtcctggtcc tacttggctt
781 caaaagttga atatcgcttt tggtatttga gatggagatt taaagatgga attttattag
841 tcttctgcct ggttttcttt ctttctttgc tcttactgcc ttgtggctca gaaccagctg
901 ttgcctgttt gatagtttgt gaccaatacc tgtactgtta aattggccat ttgagaactc
961 aaaaagtccc aacttgtagt gttttcggtt tccatggtct tagatatttc cactgcagac
1021 aacatcaagt tgccagtggt taacaactgt ctttcagaac tctcaagtgt ttcggtgggt
1081 ctgccagccc ttgtaacgta gcgccacgtg gtatatgctt atttgtctgt ctgtctgtct
1141 gttgtgcaag atgccggtgt gccctgaggt cagaggacag cttcaagggc tctgcattct
1201 tccctgacca cgtggatcca gggaacagaa ctttgaccat tatccacggg ccatgttatt
1261 tcttgacagt tctgttgtac atttgtttta gtctttggct ttatttattt ttctcaccct
1321 cagtttccct ttgtctcaga tgcttttttt ttttttttta atcttgcctc tgggagatgt
1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt
1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc
1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg
1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg
1621 tttcagggca
//
LOCUS MUSMHH2IE 576 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Mouse MHC class II I-E-beta-1 (haplotype H2b/K) gene, partial cds.
ACCESSION M28408
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane glycoprotein; major histocompatibility complex.
SOURCE Mouse (strain B10 (3R)) adult spleen (haplotype H2b/k), cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 576)
AUTHORS Gorski,J. and Hayes,C.E.
TITLE The I-J-disparate mouse strains B10.A(3R) and B10.A(5R) have
identical I-E beta sequences
JOURNAL Immunogenetics 39, 127-129 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted J.Gorski, 27-SEP-1989.
The sequence for Mouse (strain B10.A (5R)) is identical to [1].
FEATURES from to/span description
pept < 1 > 576 H2-I-E-beta (AA at 1)
BASE COUNT 140 a 149 c 178 g 109 t
ORIGIN Chromosome 17.
1 gtcagagact ccagaccatg gtttttggaa tactgtaaat ctgagtgtca tttctacaac
61 gggacgcagc gcgtgcggct tctggaaaga tacttctaca acctggagga gaacctgcgc
121 ttcgacagcg acgtgggcga gttccgcgcg gtgaccgagc tggggcggcc agacgccgag
181 aactggaaca gccagccgga gttcctggag caaaagcggg ccgaggtgga cacggtgtgc
241 agacacaact atgagatctc ggataaattc cttgtgcggc ggagagttga gcctacggtg
301 actgtgtacc ccacaaagac gcagcccctg gaacaccaca acctcctggt ctgctctgtg
361 agtgacttct accctggcaa cattgaagtc agatggttcc ggaatggcaa ggaggagaaa
421 acaggaattg tgtccacggg cctggtccga aatggagact ggaccttcca gacactggtg
481 atgctggaga cggttcctca gagtggagag gtttacacct gccaggtgga gcatcccagc
541 ctgaccgacc ctgtcacggt cgagtggaaa gcacac
//
LOCUS RATFAPS 1271 bp ss-mRNA ROD 11-JUL-1990
DEFINITION Rat testis-specific farnesyl pyrophosphate synthetase mRNA,
complete cds.
ACCESSION M34477
KEYWORDS farnesyl pyrophosphate synthetase.
SOURCE Rat adult (Sprague-Dawley), cDNA to mRNA, clone TF1.4.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1271)
AUTHORS Teruya,J.H., Kutsunai,S.Y., Spear,D.H., Edwards,P.A. and
Clarke,C.F.
TITLE Testis-specific transcriptional initiation sites of rat farnesyl
pyrophosphate synthetase mRNA
JOURNAL Mol. Cell. Biol. 10, 2315-2326 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 158 1219 farnesyl pyrophosphate synthetase
mRNA 1 1271 farnesyl pyrophosphate synthetase mRNA
BASE COUNT 329 a 319 c 347 g 276 t
ORIGIN
1 ttatatttgg gttctgccta ctgagccggg agtctgggaa ctacaactcc cagagtgctg
61 agcggatgca cgctctgctt ttaggtgtaa gccgcaaaca tcttggaccc cgggagaatc
121 cgcgttgaag cacagagcat ttagctcctc tgtcagaatg aatggggacc agaaactgga
181 tgttcataac caagaaaagc agaatttcat ccagcacttc tcccagattg tcaaggtgct
241 gactgaggat gaactgggac acccagagaa gggagatgct attacccgga tcaaagaggt
301 cctggagtac aacactgtag gaggcaagta caatcggggt ctgacggtgg tacagacctt
361 ccaggaactg gtggaaccaa ggaaacagga tgctgagagc ctacagcggg ccctgacggt
421 gggctggtgt gtagaactgc tccaggcttt cttcctcgtg ttagatgaca tcatggactc
481 ttcccacact cgccgggggc agatctgctg gtatcagaag ccgggcatag gcttggatgc
541 catcaacgat gctctgcttc tggaagccgc tatctaccgc ctgcttaagt tctactgcag
601 ggagcagccc tactacctca acctgctgga gctctttcta cagagttcct atcagactga
661 gatcgggcag actctcgacc tcatcacagc accccagggc caagtggatc ttggtagata
721 cactgaaaag aggtacaaat ctatcgtcaa gtacaagaca gctttctact ctttctacct
781 gcctatcgcg gctgccatgt acatggctgg aattgatggg gagaaggaac acgctaatgc
841 cctgaagatc ctgctggaga tgggcgagtt cttccagatc caggacgact accttgatct
901 ctttggagac cccagtgtga ccggaaaggt cggcactgac atccaggaca acaaatgcag
961 ctggctggtg gttcagtgtc tgctacgagc cactcctcag cagcgccaga tcttagagga
1021 gaattatggg cagaaggacc cagaaaaagt ggcgcgggtg aaagcactgt acgaggagct
1081 ggatctgcgg agtgtgttct tcaagtacga ggaagacagt tacaaccgcc tcaagagtct
1141 catagagcag tgctccgcgc ccctgccccc atccatcttc ctggaactag caaacaagat
1201 ctacaagcgg agaaagtaac ctcgaattgt agaggctgcg agggaggggt ctcaataaat
1261 tattgttcaa c
//
LOCUS TTHRPEGL 2340 bp ds-DNA BCT 11-JUL-1990
DEFINITION Thermus thermophilus trpL, anthranilate synthase I and II (trpE and
trpG) genes, complete cds.
ACCESSION X07744
KEYWORDS anthranilate synthase I; anthranilate synthase II; trpE gene;
trpG gene; trpL gene.
SOURCE Thermus thermophilus (strain HB8 (ATCC 27634) DNA.
ORGANISM Thermus thermophilus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 2340)
AUTHORS Sato,S., Nakada,Y., Kanaya,S. and Tanaka,T.
TITLE Molecular cloning and nucleotide sequence of Thermus thermophilus
HB8 trpE and trpG
JOURNAL Biochim. Biophys. Acta 950, 303-312 (1988)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
PRM 37 42 pot. -35 region
PRM 60 65 pot. -10 region [1]
Author address: Sato S., Mitsubishi Kasei, Institute of Life
Sciences, 11, Minamiooya Machida-Shi, Tokyo, Japan.
Submitted (24-MAY-1988) on tape to the EMBL data library.
FEATURES from to/span description
pept 72 107 trpL protein
pept 169 1557 anthranilate synthase I (trpE) (EC 4.1.3.27)
pept 1603 2217 anthranilate synthase II (trpG)
BASE COUNT 351 a 764 c 855 g 370 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccgggc cctggagggg cggccccttt agcccctgga cagggccccc gtgtcccgct
61 atcctgaggc catggccctt ccctccgccc tctggtggcc cggctaggcc ccggggcggg
121 aggcctttcc ccggggcaca ccccggggct ttgtttttgg gggacggcat ggagcggatc
181 cgaccttacc gcaaaacctt cctcgcggac ctggagaccc cggtgaccgc ctacctgaag
241 cttgccgaga aggctccggt gagcttcctt ttggagtcgg tggagcgggg gcgccaaagc
301 cgcttctcca tcgtgggggt gggggcgcgg cgcaccttcc gcctgaagga cggggtcttc
361 acggtgaacg gggagcgggt ggaaacccgt gatcccttgc gcgccctcta cgagagggtc
421 tacgccccct tggagcgcca ccccgacctc ccccccttct tcggcggggt ggtgggctac
481 gccgcctacg acctcgtccg ctactacgaa aggcttccga gcctcaagcc cgacgacctc
541 ggcctccccg acctcctctt cgtggagccc gaggtggtgg ccgtctttga ccacctgaag
601 aacctcctcc acctcgtggc cccagggagg gaccccgagg aggcggaggc ccgcctcttt
661 tgggcggaga ggcggctcaa gggccccttg cccggggtgc cgggggagag ggcggggggg
721 agggcccgct tccaggcgga cttttcccgg gaggcctacc tggaggcggt gaggagggcc
781 ctggactaca tccgggcggg ggacatcttc caggtggtcc tctccttgag gctctcctcc
841 cccctcaccg tccacccctt cgccctctac cgggcgctga ggagcgtgaa cccgagcccc
901 tacatgggct acctggacct gggggaggtg gtcttggtct cggcgagccc ggaaagcctc
961 ctccgctcgg acggccgaag ggtggtcacc cggcccatcg cgggcacgag gccgaggggg
1021 aaggacgagg aggaggacaa aaggcttgcc gaggagctcc ttagggacga gaaggaggtc
1081 gcggagcacg tgatgcttct ggacctctcc cgcaacgaca tcggccgggt cgccgccttc
1141 ggcacggtgc gggtcctcga gcccctccac gtggagcact actcccacgt gatgcacctg
1201 gtctccacgg tggagggcat cttggccgag gggaagaccc ccctggacgc cctggccagc
1261 gtgctgccca tggggacggt ctccggggcc ccgaagatcc gggccatgga gatcattgaa
1321 gaactggagc cccaccgccg ggggccctac gggggaagct tcggctacct cgcctacgac
1381 ggggccatgg acatggccct caccctgcgc accttcgtgg tggcgaaggg gtggatgcac
1441 gtccaggcgg gggcggggat cgtggcggac tcggtgccgg agagggagta cgaggagtgc
1501 tggaacaagg cgcgggcgct cctcaaggcg gtggagatgg cggaggcggg gctgtgatcc
1561 caccccatgc cggcaggggc ccggtaagga ggcctggtag gcatggctgc taacggagcg
1621 aaggggagaa aggttatgag ggtcttggtg gtggacaact acgacagctt cacctacaac
1681 ctggtgcagt acctggggga gctcggggcg gagcccatcg tgtggcggaa cgaccgcttc
1741 cggctggagg aggtggaggc cctggacccg gaccggatcc tcatcagccc ggggccttgc
1801 accccctttg aggcggggct ttccgtcccc ttggtccagc gctacgcccc ccgctacccc
1861 atcctggggg tctgcctcgg acaccaggcc atcggggcgg ccttcggggg gaaggtggtc
1921 cccgcccccg tcctcatgca cggcaaggtg agccccatcc accacgacgg caccggggtc
1981 ttccgggggc tagatagccc cttccccgcc acccgctacc actccctggc ggtggtggag
2041 gtgccggagg ccctcgtggt gaacgcctgg gcggaggagg cgggggggcg gacggtgatg
2101 ggcttccgcc accgggacta ccccacccac ggggtgcagt tccacccgga aagctacctt
2161 acggaggcgg gtaaactcat cctcaagaac ttcctggagg acccatggac gcggtgaaga
2221 aggccattct gggcgaggtt ttggaggaag aggaggccta cgaggtcatg cgggccctga
2281 tggcggggga ggtctccccg gtgcgggcgg cggggctttt ggtggccttg agcctgaggg
//
LOCUS XELGBBBLI 6777 bp ds-DNA VRT 11-JUL-1990
DEFINITION X.laevis beta-L-I globin gene, upstream region.
ACCESSION M34470
KEYWORDS beta-L-I.
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 6777)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-
globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD simple staff_review
BASE COUNT 2121 a 1247 c 1228 g 2181 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc
61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt
121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt
181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag
241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt
301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca
361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac
421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac
481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag
541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac
601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa
661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga
721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta
781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca
841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga
901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa
961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat
1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt
1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt
1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt
1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg
1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg
1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg
1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag
1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg
1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc
1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa
1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga
1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat
1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg
1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag
1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat
1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt
1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa
2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag
2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa
2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat
2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc
2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga
2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat
2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca
2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac
2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct
2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg
2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa
2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt
2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt
2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt
2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat
2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat
3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat
3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact
3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa
3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt
3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc
3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat
3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct
3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga
3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa
3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca
3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc
3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa
3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc
3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata
3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa
3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta
3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca
4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat
4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc
4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct
4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa
4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc
4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat
4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt
4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac
4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg
4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac
4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta
4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac
4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa
4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc
4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg
4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag
4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc
5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct
5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg
5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata
5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat
5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt
5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat
5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca
5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc
5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa
5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc
5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag
5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct
5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt
5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg
5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg
5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat
6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta
6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca
6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg
6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt
6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt
6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga
6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat
6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa
6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca
6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca
6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat
6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac
6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc
//
LOCUS XELHBBBAI 2027 bp ds-DNA VRT 11-JUL-1990
DEFINITION X.laevis beta-A-I globin gene, upstream region.
ACCESSION M34471
KEYWORDS beta-A-I.
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2027)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-
globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD simple staff_review
BASE COUNT 792 a 266 c 284 g 685 t
ORIGIN 1 bp upstream of BglII site.
1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct
61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat
121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc
181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt
241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt
301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta
361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa
421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata
481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc
541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct
601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat
661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca
721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct
781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt
841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat
901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg
961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa
1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt
1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag
1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt
1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata
1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca
1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt
1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag
1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg
1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat
1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata
1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc
1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt
1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa
1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc
1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg
1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca
1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc
//
LOCUS XELHBBBLII 910 bp ds-DNA VRT 11-JUL-1990
DEFINITION X.laevis beta-L-II globin gene, upstream region.
ACCESSION M34472
KEYWORDS beta-L-II.
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 910)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-
globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD simple staff_review
BASE COUNT 322 a 152 c 128 g 308 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc
61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc
121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt
181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc
241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt
301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta
361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct
421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt
481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta
541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact
601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac
661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag
721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct
781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca
841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat
901 attttttttt
//
LOCUS CLONEUR 4835 bp ds-DNA BCT 11-JUL-1990
DEFINITION C.botulinum neurotoxin gene, complete cds.
ACCESSION M30196
KEYWORDS neurotoxin.
SOURCE C.botulinum (strain 62A, subtype A) DNA.
ORGANISM Clostridium botulinum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4835)
AUTHORS Binz,T., Kurazono,H., Wille,M., Frevert,J., Wernars,K. and
Niemann,H.
TITLE The complete sequence of the botulinum type A neurotoxin and its
comparison with other Clostridial neurotoxins
JOURNAL J. Biol. Chem. 265, 9153-9158 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Niemann, 29-NOV-1989.
FEATURES from to/span description
pept 358 4248 neurotoxin
mRNA 240 > 4835 neurotoxin mRNA
signal 344 349 ribosome binding site
site 4400 4432 potential terminator
BASE COUNT 1934 a 517 c 756 g 1628 t
ORIGIN
1 aagcttctaa atttaaatta ttaagtataa atccaaataa acaatatgtt caaaaacttg
61 atgaggtaat aatttctgta ttagataata tggaaaaata tatagatata tctgaagata
121 atagattgca actaatagat aacaaaaata acgcaaagaa gatgataatt agtaatgata
181 tatttatttc caattgttta accctatctt ataacggtaa atatatatgt ttatctatga
241 aagatgaaaa ccataattgg atgatatgta ataatgatat gtcaaagtat ttgtatttat
301 ggtcatttaa ataattaata atttaattaa ttttaaatat tataagaggt gttaaatatg
361 ccatttgtta ataaacaatt taattataaa gatcctgtaa atggtgttga tattgcttat
421 ataaaaattc caaatgcagg acaaatgcaa ccagtaaaag cttttaaaat tcataataaa
481 atatgggtta ttccagaaag agatacattt acaaatcctg aagaaggaga tttaaatcca
541 ccaccagaag caaaacaagt tccagtttca tattatgatt caacatattt aagtacagat
601 aatgaaaaag ataattattt aaagggagtt acaaaattat ttgagagaat ttattcaact
661 gatcttggaa gaatgttgtt aacatcaata gtaaggggaa taccattttg gggtggaagt
721 acaatagata cagaattaaa agttattgat actaattgta ttaatgtgat acaaccagat
781 ggtagttata gatcagaaga acttaatcta gtaataatag gaccctcagc tgatattata
841 cagtttgaat gtaaaagctt tggacatgaa gttttgaatc ttacgcgaaa tggttatggc
901 tctactcaat acattagatt tagcccagat tttacatttg gttttgagga gtcacttgaa
961 gttgatacaa atcctctttt aggtgcaggc aaatttgcta cagatccagc agtaacatta
1021 gcacatgaac ttatacatgc tggacataga ttatatggaa tagcaattaa tccaaatagg
1081 gtttttaaag taaatactaa tgcctattat gaaatgagtg ggttagaagt aagctttgag
1141 gaacttagaa catttggggg acatgatgca aagtttatag atagtttaca ggaaaacgaa
1201 tttcgtctat attattataa taagtttaaa gatatagcaa gtacacttaa taaagctaaa
1261 tcaatagtag gtactactgc ttcattacag tatatgaaaa atgtttttaa agagaaatat
1321 ctcctatctg aagatacatc tggaaaattt tcggtagata aattaaaatt tgataagtta
1381 tacaaaatgt taacagagat ttacacagag gataattttg ttaagttttt taaagtactt
1441 aacagaaaaa catatttgaa ttttgataaa gccgtattta agataaatat agtacctaag
1501 gtaaattaca caatatatga tggatttaat ttaagaaata caaatttagc agcaaacttt
1561 aatggtcaaa atacagaaat taataatatg aattttacta aactaaaaaa ttttactgga
1621 ttgtttgaat tttataagtt gctatgtgta agagggataa taacttctaa aactaaatca
1681 ttagataaag gatacaataa ggcattaaat gatttatgta tcaaagttaa taattgggac
1741 ttgtttttta gtccttcaga agataatttt actaatgatc taaataaagg agaagaaatt
1801 acatctgata ctaatataga agcagcagaa gaaaatatta gtttagattt aatacaacaa
1861 tattatttaa cctttaattt tgataatgaa cctgaaaata tttcaataga aaatctttca
1921 agtgacatta taggccaatt agaacttatg cctaatatag aaagatttcc taatggaaaa
1981 aagtatgagt tagataaata tactatgttc cattatcttc gtgctcaaga atttgaacat
2041 ggtaaatcta ggattgcttt aacaaattct gttaacgaag cattattaaa tcctagtcgt
2101 gtttatacat ttttttcttc agactatgta aagaaagtta ataaagctac ggaggcagct
2161 atgtttttag gctgggtaga acaattagta tatgatttta ccgatgaaac tagcgaagta
2221 agtactacgg ataaaattgc ggatataact ataattattc catatatagg acctgcttta
2281 aatataggta atatgttata taaagatgat tttgtaggtg ctttaatatt ttcaggagct
2341 gttattctgt tagaatttat accagagatt gcaatacctg tattaggtac ttttgcactt
2401 gtatcatata ttgcgaataa ggttctaacc gttcaaacaa tagataatgc tttaagtaaa
2461 agaaatgaaa aatgggatga ggtctataaa tatatagtaa caaattggtt agcaaaggtt
2521 aatacacaga ttgatctaat aagaaaaaaa atgaaagaag ctttagaaaa tcaagcagaa
2581 gcaacaaagg ctataataaa ctatcagtat aatcaatata ctgaggaaga gaaaaataat
2641 attaatttta atattgatga tttaagttcg aaacttaatg agtctataaa taaagctatg
2701 attaatataa ataaattttt gaatcaatgc tctgtttcat atttaatgaa ttctatgatc
2761 ccttatggtg ttaaacggtt agaagatttt gatgctagtc ttaaagatgc attattaaag
2821 tatatatatg ataatagagg aactttaatt ggtcaagtag atagattaaa agataaagtt
2881 aataatacac ttagtacaga tatacctttt cagctttcca aatacgtaga taatcaaaga
2941 ttattatcta catttactga atatattaag aatattatta atacttctat attgaattta
3001 agatatgaaa gtaatcattt aatagactta tctaggtatg catcaaaaat aaatattggt
3061 agtaaagtaa attttgatcc aatagataaa aatcaaattc aattatttaa tttagaaagt
3121 agtaaaattg aggtaatttt aaaaaatgct attgtatata atagtatgta tgaaaatttt
3181 agtactagct tttggataag aattcctaag tattttaaca gtataagtct aaataatgaa
3241 tatacaataa taaattgtat ggaaaataat tcaggatgga aagtatcact taattatggt
3301 gaaataatct ggactttaca ggatactcag gaaataaaac aaagagtagt ttttaaatac
3361 agtcaaatga ttaatatatc agattatata aacagatgga tttttgtaac tatcactaat
3421 aatagattaa ataactctaa aatttatata aatggaagat taatagatca aaaaccaatt
3481 tcaaatttag gtaatattca tgctagtaat aatataatgt ttaaattaga tggttgtaga
3541 gatacacata gatatatttg gataaaatat tttaatcttt ttgataagga attaaatgaa
3601 aaagaaatca aagatttata tgataatcaa tcaaattcag gtattttaaa agacttttgg
3661 ggtgattatt tacaatatga taaaccatac tatatgttaa atttatatga tccaaataaa
3721 tatgtcgatg taaataatgt aggtattaga ggttatatgt atcttaaagg gcctagaggt
3781 agcgtaatga ctacaaacat ttatttaaat tcaagtttgt atagggggac aaaatttatt
3841 ataaaaaaat atgcttctgg aaataaagat aatattgtta gaaataatga tcgtgtatat
3901 attaatgtag tagttaaaaa taaagaatat aggttagcta ctaatgcatc acaggcaggc
3961 gtagaaaaaa tactaagtgc attagaaata cctgatgtag gaaatctaag tcaagtagta
4021 gtaatgaagt caaaaaatga tcaaggaata acaaataaat gcaaaatgaa tttacaagat
4081 aataatggga atgatatagg ctttatagga tttcatcagt ttaataatat agctaaacta
4141 gtagcaagta attggtataa tagacaaata gaaagatcta gtaggacttt gggttgctca
4201 tgggaattta ttcctgtaga tgatggatgg ggagaaaggc cactgtaatt aatctcaaac
4261 tacatgagtc tgtcaagaat tttctgtaaa catccataaa aattttaaaa ttaatatgtt
4321 taagaataac tagatatgag tattgtttga actgcccctg tcaagtagac aggtaaaaaa
4381 ataaaaatta agatactatg gtctgatttc gatattctat cggagtcaga ccttttaact
4441 tttcttgtat cctttttgta ttgtaaaact ctatgtattc atcaattgca agttccaatt
4501 agtcaaaatt atgaaacttt ctaagataat acatttctga ttttataatt tcccaaaatc
4561 cttccatagg accattatca atacatctac caactcgaga catactttga gttgcgccta
4621 tctcattaag tttattcttg aaagatttac ttgtatattg aaaaccgcta tcactgtgaa
4681 aaagtggact agcatcagga ttggaggtaa ctgctttatc aaaggtttca aagacaagga
4741 cgttgttatt tgattttcca agtacatagg aaataatgct attatcatgc aaatcaagta
4801 tttcactcaa gtacgccttt gtttcgtctg ttaac
//GenBank-Updates@genbank.bio.net (07/12/90)
LOCUS BOVB1A 781 bp ss-mRNA MAM 12-JUL-1990
DEFINITION Cow beta-crystallin (p-Beta 25/23) mRNA, complete cds.
ACCESSION M33010
KEYWORDS beta-crystallin; crystallin.
SOURCE Cow lens cortex, cDNA to mRNA, clone p-Beta 25/23.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 781)
AUTHORS Gorin,M.B. and Horwitz,J.
TITLE Cloning and characterization of a cow beta crystallin cDNA
JOURNAL Curr. Eye Res. 3, 939-948 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 6 653 beta-crystallin
BASE COUNT 217 a 185 c 187 g 192 t
ORIGIN
1 tccagatgga gacccagact gtgcagcagg agctgaaatc ccttccaacc accaagatgg
61 ctcaaactaa ccccatgccg gggtctgtgg ggccatggaa gattaccatc tatgaccagg
121 agaacttcca gggcaagaga atggaattca ccagctcctg cccaaatgtc tctgagcgca
181 attttgacaa cgtccggtct ctcaaggtgg aatgtggcgc ctgggttggt tatgagcata
241 ccagcttctg tgggcaacag tttgtcctgg agagaggaga gtaccctcgc tgggatgcct
301 ggagcgggag taatgcctat cacattgagc gcctcatgtc cttccgcccc atctgttcag
361 ctaatcataa ggagtctaag attacaattt ttgagaaaga aaatttcatt ggacgccaat
421 gggaaatctg tgatgactac ccctccttgc aagccatggg ttggcccaac aacgaagttg
481 gctctatgaa gatacaatgt ggagcctggg tttgctacca gtatcctggg taccgtggct
541 atcagtatat cttggaatgt gaccatcatg gaggagacta caaacactgg agagagtggg
601 gttctcatgc ccagacttcc cagattcaat ccattcgccg tatccaacag tagtggatta
661 aaagctccaa gtaagaattc ctcaagcatg agaccttcct aaacaatcta gaataaaata
721 tatgttctgc tgatattgct tccaaatgtt agctgctgaa atccacaata aatgtcatta
781 a
//
LOCUS CFICENB 439 bp ds-DNA BCT 12-JUL-1990
DEFINITION C.fimi endoglucanase B (cenB) gene, 5' end.
ACCESSION M33026
KEYWORDS endoglucanase; endoglucanase B.
SOURCE C.fimi DNA.
ORGANISM Cellulomonas fimi
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 1 to 439)
AUTHORS Owolabi,J.B., Beguin,P., Kilburn,D.G., Miller,R.C.Jr. and
Warren,R.A.J.
TITLE Expression in Escherichia coli of the Cellulomonas fimi structural
gene for endoglucanase B
JOURNAL Appl. Environ. Microbiol. 54, 518-523 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 275 > 439 endoglucanase B (cenB) precursor
sigp 275 373 endoglucanase B signal peptide
matp 374 > 436 endoglucanase B
BASE COUNT 58 a 173 c 154 g 54 t
ORIGIN
1 ggatcccgcg cccggcgcga gcccgcaacc cacgcgccca cggatcgggc ctcacgagcc
61 cgacgttggc ggccgggccg gggggcgacc tcgagaccga ggagcccccg cgtgaggcga
121 cgttggccgc gcacgccgct ggtgagcggg ctgaatcgtt tagggcgttg acctgcggac
181 ggacccgtct ggacgatgcg ccaggcgtcg tgcgggtgcg actgcggaca gcacgggtcg
241 ccgaccacca ctcccgtgcc cggaagagga ccccatgctc cgccaagtcc cacgcacgct
301 cgtcgcgggt ggctccgccc tcgccgtcgc cgtcggggtg ctcgtcgccc cgctcgcgac
361 cggcgcggcc gccgcgccca cctacaacta cgccgaggcc ctgcagaagt cgatgttctt
421 ctaccaggcg cacggctcc
//
LOCUS RATLACTAS 250 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Rat lactase-phlorizin hydrolase mRNA, partial cds.
ACCESSION M34730
KEYWORDS lactase; lactase-phlorizin hydrolase.
SOURCE Rat (strain Sprague-Dawley CD) newborn, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 250)
AUTHORS Bueller,H.A., Kothe,M.J.C., Goldman,D.A., Grubman,S.A., Sasak,W.V.,
Matsudaira,P.T., Montgomery,R.K. and Grand,R.J.
TITLE Coordinate expression of lactate-phlorizin hydrolase mRNA and
enzyme levels in rat intestine during development
JOURNAL J. Biol. Chem. 265, 6978-6983 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 250 lactase-phlorizin hydrolase (AA at 2)
BASE COUNT 66 a 62 c 66 g 56 t
ORIGIN
1 agaaaggatc ttctaccaca aaacctatat caacgaggct ctgaaagcct acaagctgga
61 tggtgtggac cttcgagggt actctgcctg gacgctgatg gacgacttcg agtggctgct
121 tggctacacc atgagatttg gattgtatca cgttgacttt aatcatgtga gcagacctcg
181 cacagcaaga gcctcagcca gatactatgc agaggtcatt gccaacaatg gcatgcccct
241 ggccgggaag
//
LOCUS BOVARRB 1945 bp ss-mRNA MAM 12-JUL-1990
DEFINITION Cow beta-arrestin mRNA, complete cds.
ACCESSION M33601
KEYWORDS beta-arrestin; inhibitor.
SOURCE Cow adult brain cortex, cDNA to mRNA, clone pBARRESTIN-1/1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (sites)
AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J.
TITLE Beta-arrestin: A protein that regulates beta-adrenergic receptor
function
JOURNAL Science 248, 1547-1550 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1945; for [1])
AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J.
JOURNAL Unpublished (1990) See COMMENT for author address.
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.J.Lohse, 06-APR-1990, for release after publication.
Author address [1]:
M.J.Lohse
Howard Hughes Medical Institute, Box 3821,
Duke University Medical center
Durham, NC 27710
FEATURES from to/span description
pept 97 1353 beta-arrestin
BASE COUNT 419 a 590 c 556 g 380 t
ORIGIN
1 gttccgggaa ccggctggcc cgcgcccctc ctgtcggccg gggattttcc agcctgggcg
61 ctgacgccgc ggacctcccc gcggccgcct cggaccatgg gcgacaaagg gacgcgggtg
121 ttcaagaagg cgagccccaa tggaaagctc accgtctatc tgggaaagcg ggactttgtg
181 gaccacatcg acctcgtgga gcccgtggat ggagtggttc ttgtggatcc ggagtatctc
241 aaggagagga gagtctatgt gacgctgacc tgcgccttcc gctacggccg ggaggacctg
301 gatgtcctgg gcctgacctt tcgcaaggac ctgtttgtgg ccaacgtgca gtctttcccg
361 ccggcccctg aggacaagaa gcccctgacg cggctgcagg agcgcctcat caagaagctg
421 ggcgagcatg cctacccttt cacctttgag atccctccga acctcccatg ctctgtgact
481 ttgcagccgg gacctgaaga tacagggaag gcctgcggtg tggactacga agtgaaagcc
541 ttctgtgcgg agaacctgga ggagaagatc cacaagcgga attctgtgcg cctggtcatc
601 cggaaggttc agtatgcccc agagaggcct ggcccccagc ccacggccga gaccaccagg
661 cagttcctca tgtcagacaa gcccttgcat ctggaggcct ccctggacaa ggagatctac
721 taccacggag aacccatcag tgtcaacgtc catgtcacca acaacaccaa caagacggtg
781 aagaagatca agatctcggt gcgccagtat gcagacatct gtctgttcaa cacagcccag
841 tacaagtgcc ctgtggccat ggaagaggct gatgacacag tggcacccag ctctacgttc
901 tgcaaggtct acacgctgac ccccttcctg gccaacaatc gagagaagcg gggcctcgcc
961 ctggacggga agctcaaaca cgaggacacg aacctggcct ccagcaccct gttgagggaa
1021 ggagccaacc gggagatcct gggcatcatt gtttcctaca aagtgaaagt gaagctggtg
1081 gtgtctcgtg gcggcctgtt gggagatctt gcatccagtg atgtggccgt ggaactgcct
1141 ttcaccctaa tgcaccccaa gcccaaagag gaacccccac accgggaagt tccagagcac
1201 gagacgccgg tagataccaa tctcatagaa cttgacacca acgatgacga cattgtgttt
1261 gaggactttg cccgccagag actaaaaggc atgaaggatg acaaggagga agaggaggat
1321 ggtaccggct ctccgcggct caacgacaga tagactgggg ctgccctccc tccgggcagc
1381 tccaggtcca ctctcatgca ctaggatgct tgttcgtctt cttcctgtcc tggctccccc
1441 tcccctttgt tcttccagtt tctaccaggg ggccccagcg gtcttccagg tcacggtggc
1501 gaacccctgg cctcaggatt ggcccccatc accatgccaa cagggccaca ggcagcaccc
1561 tcaccctctc actgcatcac ttctccattc cccctctttt cctattgacc cccagacagg
1621 ccagcacagc tctggccttc ggatttgact cgggatgggg agcagaaagg ggaagatggg
1681 gcacaagggc ttggcgaggt ggggatgggg gctcaagacg cgtgagagga tgtggccact
1741 gtcccaggtg atgaatacag ttctggcagc taaaacatga ccgctttgaa ggccaccctc
1801 ctctggctgg gaggggacag acccatggat agattgtcca cacagatttg ctcgaagttc
1861 agacctacca aacagctgtc ttcttcttcc ctcgtccctg ccccctgttc ctctgtggct
1921 gacagtgacc ttggtgaagg tttgt
//
LOCUS BBVRNA3 389 bp ss-RNA VRL 12-JUL-1990
DEFINITION Black beetle virus RNA3 proteins B1 and B2 genes, complete cds.
ACCESSION M33065
KEYWORDS .
SOURCE Black beetle virus.
ORGANISM Black beetle virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Nodaviridae.
REFERENCE 1 (bases 1 to 389)
AUTHORS Guarino,L.A., Ghosh,A., Dasmahapatra,B., Dasgupta,R. and
Kaesberg,P.
TITLE Sequence of the black beetle virus subgenomic RNA and its location
in the viral genome
JOURNAL Virology 139, 199-203 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 10 318 B1 protein
pept 20 340 B2 protein
BASE COUNT 118 a 120 c 98 g 53 t
ORIGIN
1 tcgttaccaa tgttaaacga tgccaagcaa actcgcgcta atccaggaac ttcccgaccc
61 cattcaaacg gcggtggaag cagccatggg aatgagctac caagacgcac cgaacaacgt
121 gcgcagggac ctcgacaacc tgcacgcttg cctaaacaag gcaaaactaa cggtaagtcg
181 gatggtaaca tcactgctgg agaaacccag cgtggtggca tacctagagg gaaaggcccc
241 cgaggaggca aaaccaacac tcgaagaacg cctccgaaag ctggagctca gccacagcct
301 tccaacaacc ggaagtgacc ccccacccgc aaaactgtag gtggctctta ggagcaccca
361 cacccgttct agcccgaaag ggcagaggt
//
LOCUS MUSURNAA 54 bp ss-uRNA ROD 12-JUL-1990
DEFINITION Mouse small nuclear RNA.
ACCESSION M34036
KEYWORDS small nuclear RNA.
SOURCE Mouse plasmacytoma cell line P301 small nuclear RNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 54)
AUTHORS Chernokhvostov,V.V. and Georgiev,G.P.
TITLE Complexes of nuclear matrix DNA with proteins tightly bound to DNA
contain a specific small-size RNA of a novel type
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.V.Chernokhvostov, 04-MAY-1990.
or address:V.V.Chernokhvostov . of Molecular Biology, USSR Acad.
Sci.
Vavilova str., 32
117984, Moscow
USSR
FEATURES from to/span description
uRNA 1 54 small nuclear RNA
BASE COUNT 19 a 12 c 13 g 10 t
ORIGIN
1 agaagacacc ctgatttaac ttctggtatc ggaagatgca agagccgaac caga
//
LOCUS RATCYP2A1 18820 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds.
ACCESSION M33312
KEYWORDS B2 repetitive sequence; LINE repetitive sequence; cytochrome P450;
hepatic steroid hydroxylase IIA1.
SOURCE Rat (strain Sprague Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 18820)
AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
genes and regional localization of the CYP2A gene subfamily on
mouse chromosome 7
JOURNAL Biochemistry 29, 1329-1341 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 4573 4749 hepatic steroid hydroxylase IIA1 (CYP2A1),
exon 1
5050 5212 hepatic steroid hydroxylase IIA1, exon 2
7638 7787 hepatic steroid hydroxylase IIA1, exon 3
8005 8165 hepatic steroid hydroxylase IIA1, exon 4
9386 9562 hepatic steroid hydroxylase IIA1, exon 5
12760 12898 hepatic steroid hydroxylase IIA1, exon 6
13340 13527 hepatic steroid hydroxylase IIA1, exon 7
13960 14101 hepatic steroid hydroxylase IIA1, exon 8
17010 17191 hepatic steroid hydroxylase IIA1, exon 9
pre-msg 4545 17380 CYP2A1 mRNA and introns
IVS 4750 5049 CYP2A1 intron A
IVS 5213 7637 CYP2A1 intron B
IVS 7788 8004 CYP2A1 intron C
IVS 8166 9385 CYP2A1 intron D
IVS 9563 12759 CYP2A1 intron E
IVS 12899 13339 CYP2A1 intron F
IVS 13528 13959 CYP2A1 intron G
IVS 14102 17009 CYP2A1 intron H
rpt 3750 4026 B2 repeat
BASE COUNT 5274 a 4402 c 3695 g 5449 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattctagt acggtagccc tggctttcat caactagtta gtgccaaata tttgagaaaa
61 gttacaggtt caagctaata aaagttgcag agagtataaa agaatgcaga ttagacaaga
121 aaaaattaat tagagccctt ctagccaaca aagcctcaga tccaggagaa aagactacca
181 tagaaatggc caaaggctta tttatcaaag aaactgggct cagtggcagc aggatgacca
241 ccttgcctgt gtttattgtt gccacagcac tgttggataa agatgcaaat aaattaactt
301 tgggacagaa gttgatcatg actgctcctc cccctgcccc cgcaattctg attgaggctc
361 agtaatgcct acatgcttca ttatcacact ttactaatca gccctggctg agatattttc
421 cagccacctg tttccctgaa ccctgcgact cttccaccca accctgactt gggctgtcca
481 cttcatcaat tcgatgaggt tcaggcccag atacacaata ccagacctta cttgaggaac
541 tctcatccat cagaaacaga gcatacctgg ttcacggaca gaagtagctt catccataag
601 ggtcagagga gaacaggggc agcaataaca acagaaggaa aggtaatctg gactcagtct
661 cttccttcca ggcacttcaa ctcaaaagaa caaactaaga acattaacac aagtcctcat
721 catgggaaaa ggactggctg ttagcatctg cagggacagc cagatatgca tttacaacta
781 ctcatgtgca cggagccatt tacaaggaac aacagctcct aacagcagaa ggaaaaactg
841 tcagaaataa agataaaact ctacaggaca tcaaaaagaa accaatggct agagataaca
901 gtctgactga taaggcttct aagaaggtag ccttaaagga aacagccaac actagattgg
961 ccattgtcct acctgaacca cctagagtaa ctgataaatg cagaaaaaga aattaaatgg
1021 gctggtgatg gtcaactgaa ggtaagcgaa tattgcccac ttctcagaaa gatccacagt
1081 cactcacttg ggagtaaaat gaatgacaaa ctttaaagtt tgccagctga ccaacactca
1141 caggaagccc aaacatccaa attcctgact gcgagttaag agacttggag cctactggga
1201 aaattgattt ttttcagaaa tcaagcaaga aagatatggc tcaaaatatc tgctgatatt
1261 tgtagacatt ttttttcgag atagatggat agatgtacta ggagagatct ttccgatgtt
1321 tggagcacct aaggtaacag gatcagacaa tgggcctgtc ttcatatctc aggtaagtca
1381 gggacttgct aagatcttgg ggactaattg gaaactccat tgttcatatc atccccagag
1441 ttcagggcag gtagaaagga tgaatagaac tctaaaagag accttaacag aattagcctt
1501 ggagactggt ggggactggg tgaggctctt tccctttgcc ctatattagg tgtacatggc
1561 attctagctc ccattgtatc tagcctacag ttggtagcta ttacagaact gaaaaatgat
1621 aatttaagat ttaaggtcag agctaccaaa taggctcatg aatttgtttg ggcctaaatt
1681 atgtaccttc tgtgaagcag gcctggttcc agaaccacac aagtcaaaag agactgggtc
1741 tctatgaaga gatttcacca aggtgcgact aaacccatgt gaaaatggca attcatcatc
1801 ctgttgatca tgatcaccac ctgggtgtac aacaaccaca ccagaccagt tcctccaatg
1861 aagaactctg cctggctcca gctgtaccaa aatagagggt tcaaaaggac accaagccct
1921 tcaagttaaa gttgactcag tctcagtcct gagtctcttg cccctgctaa ctctatgtct
1981 atatatactg tatgtcttag atccccccct gttaggaagg taccctagct ggatccttga
2041 taattttact tttatttctg acttttggcc cctgtatttt aagttgctta gtagtttata
2101 agagaattca gtcaagttaa ttatcttaag gcaacactat ctacagctgg aagcagggaa
2161 gcaagcatat gagttagaag actataagct tcaagatcaa agctatgcta aaagaaaagg
2221 ggggaatgaa aagccagagt tggggtcaat ctgaggccaa tgagaaaaac ccaccattaa
2281 catccaagca cagaacgacc cttctcttcc agaaagagta aagctagttt agttcctgga
2341 acagctacaa gccaaactgt tgaacaaagc cacatgtaac tccccatcca acctccagaa
2401 agtcccagaa tggcacactg accacaagtc attttggagg ttacttcacc ccactaatag
2461 tagtactctt cctagttact gttgtgcaaa ttctgcccca attgtttgta aggtatatac
2521 agacccagtt agagtctgct cagggtcttc tctttctgaa agggagtcaa ccccgacgca
2581 ttaaaataaa gctagtcttg gttttgcatt gattagcacc tccttgagtc tcactcaagg
2641 ggtcccggaa agggtcagat tagacctcat atacctctga gcacagcttg tatggtgact
2701 aagatacagg atacccacag gctgggatta gagagtttaa accaaagatc tttcatccat
2761 gtgctccatg cctgccctgt gcccaggggg aaacatggat tctaattaca gaagcctccc
2821 taaggatctt aatgggaacc aagtaggaga cttttccagt tagaagcctt ctgacaactg
2881 gggtttcccc atattggtag tttaggttgt tatttcacaa aactacaatt ccttcaccaa
2941 ctggagttct gagttattct cctctagtct ggaaaatgat ctgctaaaat atagctgtgg
3001 ttttctaccc ttttcaaagc catacataga cagggaaggt tgcccatcct tccctgaagt
3061 tgaagatcct tttagaagtc aatgcaccca tcagtggtga taaatgcctt taatcccagt
3121 atgcagcaaa ctctgtgagt ttgacgccaa attggtctac agagtgtgtt ctagaacagt
3181 cagagctaaa gagagaaaca ctctgtggaa aagaaagaaa gaaaaaagaa aggaaggaaa
3241 gaaggaagga aggaaggaag gaaggaagga aggaaggaag gaaggaaaaa gaaaggaagg
3301 aaggaaaaag gtacagagag agggaaagag ggagggagaa aaataacata tatgaagaca
3361 cagtacagga ccaatctggg ctcaggtgcc cactttagtc tcctactgga attttcatcc
3421 acttgtacca gaaactcagc acccacagat ccttcttgcc atgtgacctt ccagtccata
3481 gtttggaatc tttcctgttt tccttactaa tatttttctc ctaataaaaa gactaaacca
3541 tctagactct aggactccag agatgactct gtgggtaaga gcacttgttg ctcttgtaga
3601 agacccaggt tttattccta gaacccacat ggtggcttac aaccatgtgt gaccccattt
3661 ccaaaggatt cctctaaatc ttttggcatc tttggacagt gtgcccaatt gttacccaga
3721 cttaaatgga agagaaacct tcatattaca taaaaaatta acacaataag atccataaag
3781 aaatatataa aagaagataa tcttttttaa aaaggataca ttggatacat tgccaggcct
3841 ggagagatgg ctcagtggtt aagagcactg acagctcttc cagaggtcct gagttcaaat
3901 tcccagcaac cacatggtga ctcacaatta tctgtaatgg gattcaatgc ccactactgg
3961 tgtgtctgaa gacagtgaca gtgtactcat atacatgaaa gaaataaatg aatcttgaga
4021 acaacattta atgcctgaag ccatgtttcc tgtattgttc cagtcaaatc taagaatgtg
4081 aattctatca cagaccacaa catttacatc tatgagggct ttcttcatga gctcaaccat
4141 acaaatatag atttttagtt ctagatttga tctggtggac ccagaaatgg acagcctcct
4201 gataatagcc acagtcccca atacagcacc aatctcatca tagcatttga agagtgtatt
4261 atatgttggc ctgttcacct tgtccactaa aaccctcagc ttggtccacc aaagcctctt
4321 tgactgcatt gtatcaacac accaaagcac accgaggctt taagaatttt gaagtaagcc
4381 tgccacccag aggtcttcct atttgcccat gttgtgggtg ttgcaacaaa gacagggtca
4441 gtgttaggag ataggattgg agggtaaaag actcaactag acaaacagga gcaaaggcca
4501 tcctgtgtcc ctgggagtat aaaggtacta tctcagcctt ggctatcagc ctgtcaatcc
4561 tcactggcca ctatgctgga cacaggactg cttctggtgg tcatattggc ctccctgagc
4621 gtcatgctct tggtgtccct ctggcagcag aaaatcaggg ggagattgcc tccaggaccc
4681 actcctttgc ctttcattgg aaattatctg cagctgaata caaaagacgt atacagttcc
4741 atcacacagg tatcactgga tgaggggatg gatgggacat gggagcacaa gaggctgtga
4801 tgttttgcat gttttgtggc agaagattca tagaggaatc caaagtcttg tattagtgga
4861 gtttagaaag ataaggagct atttcaagtc tttggtttgt tgtttgttgt ttgttttgtt
4921 tgttgtttgt ttctttgttt ctttgttttt tcaatcattt atttgtagag taacacataa
4981 tctgacctct gtgtactggt ccagttcagt gaataagtca tctaacagcc cccatctacc
5041 ccacatcagc tcagtgagcg ctatggtcct gtgttcacca tccaccttgg gcctcgccgg
5101 gttgtggtgc tttatggata cgatgcagtc aaagaggctt tggtggacca agctgaggag
5161 ttcagtggac gaggcgaaca ggctacctac aatacactct tcaaaggcta tggtgaggag
5221 gataccacat tggggaacat gcccaaggac atttgttggc gtcatttaag tagccttcat
5281 actaactcat ctctccctca aggctgtaca gagttctctg aatttctctc catatccatg
5341 ttgaatgttg gctctcattg tgaccctccc tagcatttct gagattgaaa acagactttt
5401 gcaaattctg tgggttcttt cttccatcct tctctaccgt tttcttccgc cctttctacc
5461 acctatcact agataggaaa gaaaaggaga tagaggtgaa aggggacatt actgttagat
5521 tatttcctgc tgattaggag tgacgagctc cttagggaaa gttttatctt ctctgtcagg
5581 atatctaatt tcttcttgtt gttatttctt tacataagac tacttaacaa atcacaagca
5641 acagcaacta accaatagcc aaaaccaatt tctcagggtc cttgcattta cacaaccttg
5701 aggagtccca gtatcctgag tgtcacacac tctcagaaac tatctgcagc tggcaaaatc
5761 ataacctcct gctttggaca acctgaacca gccccatatg ccatacctgg gagtaaacag
5821 aaacatattt ctataatagt tctgtatttt tcaaagaaat caaatttctt actacatctg
5881 gccattgctg ctcttctctc tctctctctc tctctctctc tctctctctc tctcacacac
5941 acacacacac acacacacac aaacacacac acacaaacac acacacacaa acacacacac
6001 acaaacacac acacacacaa acacacacac acaaacacac acacacgcac gcacacacac
6061 acaacctctc ggcattctcc tagatggatg actccttttt aatttagctg atatttttat
6121 ccttcttaaa catttatcca cacacagagc atcagttgca ggtctcaggc attcactcct
6181 gatgcctctg gattggtttt ttagattctt tgttcttact tttccatcta tgggtgctgg
6241 gctctcaagc acatctctgc acagtgtgtg tgcctggtgc ccatggaagc aaaaagatgg
6301 agtcagatct cctgaactcc aggggttccc tgagttccag ggttatgagc tgccaggtga
6361 gtgctggggt acaagcacag gtcctctgca aggtcagcca gtgctcttga gtgcagagcc
6421 agctttgctg ccccccactg cctatatttt taaatgctgt tttacatact ccatgtgttg
6481 tccctaagat gtgtataatg cttatagaac gtcacagtct ggtaagtgct ggccaaagct
6541 acagaagtat aaaatggcct tgaacagcaa aacactggtt ataagcaaga aaggtcaaaa
6601 taaagagaaa atccacaaag agccaaatat ctttataaca ttaattctgt agttaaaatt
6661 taacacagag agtgtatctc gttccttgaa gaactgaagg acacacaaat gactacttct
6721 acctagggtc aaaatatagc ggtgactaca gctcaagaca cacaaaacca gagtcaagaa
6781 tcagggagtg gtaataaaat aataaaaaat cctggctcag ggtttcttcc cacctttccc
6841 tgatgaaagg cacacacagc ctttatattt tagtctgcct tatgcagcac aatagctggg
6901 cagctgccta ccctccatgc tgttagaatc cattttccta ttgaaagccc caagttaata
6961 ctttacaagt ttctttatac catatttgct attcttgacc caactgagga gcccttttgg
7021 ccacactgtc ttggcccata gcacatggtg tctctccttc taccttctgc tctttcttct
7081 tccatggctt ccacagaggc tcctcaatcc cattctcctt cctcatgctc tctagcccca
7141 gaaaactaag caccacaagt ctcttctccc agctattagc tgctgacatc tttatttacc
7201 aatcagaatg aactgcgggc aggatcactc agacaaacta cagactccaa atcttagagg
7261 ccaacactta ctgttatagg aaacaataaa agacaaaaac ctcaacacca gggtatgttt
7321 ctgggtaggc tgtccttgct ttaatgggga tttgctgttt tcagaaaatg ctcaatattg
7381 attgattttg ccatttccag gaccctttgc tgcattctgt ctgtaagtct ctttttattt
7441 gcctggctga cttgtttcaa ctttctttct ctgactgtgt ctgatgcaca gtctgtgttt
7501 gtgtcttttg tgtccttgcc atttctatcc aactttgtct cttttctttc ccccttagaa
7561 cccctttcca gggtgggcct catccatcct cagcctcagt ctacttctcc tgacccctta
7621 tatttatatc tctacaggcg tggcattcag cagtggggag cgggcaaaac aactcaggcg
7681 cctctctata gccacattga gagattttgg tgtgggcaag cgtggtgtag aggagcgtat
7741 cctggaggag gcaggctatt tgatcaagat gttgcagggc acttgtggta agcaagagac
7801 cattaagtgt ttgggcaaga gaaagaacat ccctgacacc tagaccctat gggttgtgga
7861 taagaagggc ggggaagacc gcctaccaaa ccatccccag aatctggtgc tgagagattg
7921 gtgcctcact ccaattccca caccatctgc taactcttct ccctcataat gccaatgtct
7981 tccaaacaat gtcacccctc tcaggagccc ccattgaccc caccatctac ctgagcaaaa
8041 cagtctccaa tgttattagc tccattgtct tcggggaacg cttcgactat gaggacacgg
8101 agttcctgtc actgctgcag atgatgggtc aaatgaacag atttgcagct tcacccacag
8161 ggcaggtaac agatccagct ctgccaattg tccttatagt gtcccacatt gaccatacca
8221 acaaagggca aggaccaccc tgactctcat ggctacaaac aaaagctccc ctcaaaaaca
8281 gaagctcccc tcaaaaccag cctttacttc agaaaactga acctttacat cagagcccac
8341 agaagctatc cagtgctcac aatctaatgt cctctggata tctcagtagc ctgagaacac
8401 agccctctgc ttgactctct tccctgggca ggtttctcca gcttaacctc taataaatcc
8461 tctatgtggt cctcctgaaa atttagacaa ctgcccaagg gatacaagtg accacctctg
8521 gccccctcct ccaatcctga acacctacct agttctgcaa aactgtggtc agtaaagcta
8581 ttcagtccat acacccagtt ctccccaaag atcccactga cacaatggca caaaagtcac
8641 ctgttgtctc aggtaaattc aggaatgagt agacaggcac ctcaaccaag gcaaccaagc
8701 acagacctct ggatggactg tttccccaaa cacccatatg tctcccagct acacacaacc
8761 cacatcaaga caatatctga caggtgtgtc tcacacctta taacctgaac caccccacca
8821 tgaagacctg actatgtgaa aaaccgattc taatctcaaa caaatatcaa gacatctaat
8881 cttagccctc tcaaatgccc aaacatatag atacttgatt cactgcgaca ctcatgtcct
8941 gaatactaga aacctggagt aatggtctga tccaaaaatc agttaaataa ctgaatgtct
9001 actaatgttc ccttttgatc cagttcattg ggattgtaag acaatgacct tcattcttta
9061 aatcacctag aaaactgtgg tctctggggc ctctgacagt tcagtggttt aagagcatgc
9121 actgctcatc ctgaggaccg agttcagttc ccactaccta tgctgaacat ttcaaaactc
9181 tatgggagta cacctgcacc gtgcacataa ttaaaagtaa aatattcaaa cgaatataaa
9241 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa
9301 cacctggatg gatcccttga gactcgaccc actcccacgg gtgttgccac tgacaagcct
9361 tttcttttct cctcccaccc cccagctcta tgacatgttc cattcagtga tgaagtacct
9421 gcctggacca cagcaacaga tcatcaaggt tactcagaaa ctggaagact tcatgataga
9481 gaaagtgagg cagaaccata gtaccctgga ccccaattcc ccaaggaact tcattgactc
9541 ctttctcatc cgcatgcaag aggtgatccc aatcatggtg gatggaatgt ctaaaacagg
9601 gcagctctaa atcatcctag aaaaggagga ggaatatagg cccattaagt gcccatgatt
9661 ctcctcacag tcccggttat agttaaacct cactctttca cctgttgagc cttatccaag
9721 ccagggtatg ggttagcaaa ttaccatgac aaccgatatt ccagtgttcc cctatgagac
9781 actgttttca gtgttcaact acttagcatg cactgaagct actgtcgaag accctgtgga
9841 gcctaaactt cgcaaagagg gaaagtgtgc ccagacttgc atgctgactt tatggagaca
9901 gaaaactata cagccttgcc tctatggctc tcaggctttt actattagcc acatggtctc
9961 tagcatttca tatctctgtt aggaaataca catcagtaca catcagtggc ctaagacctg
10021 ggtttttttt tcttttgtct gttctagtaa tttttttatt gtttttcatt tttgtgtttt
10081 tttcttttat tggatttttt atttctattt cagatattat cccctttctt ggtttccctt
10141 ccagaaacct gctatctcct catgcttcta tgaggattct ctcccaccca cacaacactc
10201 cctgccacct ccctgtgctg acattcccct acactggggc atcgagccca gacaggacca
10261 agggtctctc ctcccattga tacccaacaa ggccatcctc tgttatacat atggctgaag
10321 caataggtac atccctgtgt actcttggga tggtttagtc actgggagct ctggtgggtc
10381 tggttggtta atattgttgt tcttcttata gggtggcaaa ccccttcagc tccttcagtc
10441 ctttctctaa ctcctccata tgggaccatt ttctcagttc aatggttgac tgcaagcatc
10501 tgcctctgta attgtcacgc tctgcagagc ctctcaggag acagctatat gaggatcctg
10561 tcaacatata tttcttggca tccacaatat tgtgtgagtt tagaggatgt caatgggatg
10621 aatccacctg tagggcagtc tctgaatggc ctttccttca gactctgctc caaactttgt
10681 ctttgtattt ccttctttga gtatttttgt tccccctttc aagaaggact gaagcatact
10741 cacttgagtc tttcttcttc ttgagtttca tgtggtctct gaattctatc ttgggtattc
10801 caagtttttg gactaatatt tacttctcag tgagtgcata ccatgtgttg ggttacctca
10861 cttaggatga tattttttag ttccatccat ttgcctaaga atttcatgaa gtcattattt
10921 ttaatagcag tgtagtactc cattgtgtaa atttactata ttttttgtat atatttctct
10981 gttgaagaac atctagtttc tttccagctt ctggctatta taaataaggc tgttatgaac
11041 atagtggaga gtgtgtcttt gttatatgtt ggagcatctt ttgagtatat gcccaggaat
11101 ggtatagctg agtcctcaca taatactatg tccaattttc tgaggaacct ccaggatgat
11161 atccagagtg gttgtatcaa attacaatcc accaacaatg gaggagtgtt actctttctc
11221 cacatcctta ccagcatctg ttgtcacctt cgtttttgac ctttgccatt ctaactggtg
11281 tgaggtggaa tctcagagtt gttttgattt gcatttccct gatgactaag gaggttgaac
11341 atttctttag gtacttctca accatattcc taagctgaga attctttgct tagctcttta
11401 ctccattttt aatggggtta tttgattctc tggagtctaa cttcttgagt tctttgtata
11461 tatttaacat tagccctcta tcggatgtgg gattggtaaa gatcttttcc caatctgttg
11521 gttgtcgatt tgtcctaatg acagtgtcct ttgccttaca gaagctttgc aactttatga
11581 agtagtattt gtcaattctt gatcttagag cataagccat tggtgttttg tttaggaaac
11641 tctccctggt gcccatgtgt tcaagaccct ttcccacttt ctgttctatt agttccagtg
11701 tatctggttt tattttagtt taattttatt tttcttggat aattatgtat tacacatcaa
11761 atgttattcc ctttgtcccc tctctcatat ccccttcccc tccctctgcc tctatgggga
11821 tgctaccacc cccatccacc cactcccacc tcaaccccct agcattccct tacattgaga
11881 aaaagagcct tcactagacc aagggctttt cctcctattg atgctggaca atgccatcct
11941 ttgctacata tgcagctgaa gccacgggtc cttccatggg tacgctttgg ttggtggttt
12001 aggccctggg agctctcgtg gagtctggtt ggttagttga tattattctt ccatccctaa
12061 aatgaatgac agtcacctag acagagaaat gagcaaagct tctcatgcaa acccaagact
12121 gctaacacag cctggagatc tttttccaac gattggtctg gaccctatga gaactagatc
12181 caaaggaaat tgcagaagtg ctgcctattg catccctctc ctccatgagg aacttaatcc
12241 acagttgacg gctgtttaga gacgatgaaa taatattcct ttgcagtgtg gctactagta
12301 aattgacctt tctcaagtaa agaacccctc gcccatatgc atgcagccac acctaattat
12361 aagcagttac ccacaacacc cccaacaaac aggaaaatag gaaggagact tattaggaat
12421 aagaaatggt tcaaaaaaat ggaaagtaga aaataataga ggggaatacg tttaaagtgc
12481 atttcatgta tacgtctgaa aaataaggac tcaaggttca gtgggtatgg aaggggattc
12541 atctgggagg gtttggagga ggggtatgaa tatattcaca atacaataaa tgaaattctc
12601 aaagaattaa taaaattatt tataaaagaa ttactagaaa tgtttcagaa aattaaaacc
12661 cttaatgttc cccaaggatg acaaaatgat agatttatgc cctctctctc tctctctctc
12721 tctctctctc tctctctctc tctctctctc tctctgcagg agaaaaatgg caattcagag
12781 ttccacatga agaacctagt gatgacaaca ctaagcctct tctttgctgg gtctgagaca
12841 gtcagctcca cactacgcta cggcttcctt ctactcatga agcatccaga tgtggagggt
12901 gaggctggct atgtggcagg gaagttggga accgcagact ctccaactgc ttacaaccta
12961 acaatgaccc tcacttctcc caggttcctg gatgctcagt catgctcagc tatgcagaga
13021 caggggcata ttaaatgcat aaacacagtt ctcacaaact taaaatatta gacattccca
13081 aattgatttc actctgactt ccagatctct gctctctgtt ctcttccctg actcctgctt
13141 cttctcccca ccatgattct gtcacgaaaa ggataaaatg accctgtcca gcatttaggt
13201 atggatatat gtttaaatgg tttaaatgca tgttatttac agagacatgt aatacatgca
13261 gtggtacaca tgtgaactat tccacctgct ttgaggcctc tggattttta aaaatacccc
13321 atctccgctt gtctttcagc caaggtccat gaggaaattg agcaggtgat cggcaggaac
13381 cgacagcctc agtatgagga ccacatgaag atgccctaca cccaggctgt gatcaatgag
13441 atccaaagat tttctaactt ggctcccttg ggcattcctc gaaggattat caagaacaca
13501 accttccgtg gcttcttcct ccccaaggta gcagccatgc ccatccagga ggggcctcca
13561 gcccacttac tgatgcttca gggcttcttt ccatctgtag ctatctaact ccactctaat
13621 tcctccaacc aaagaattca tccacatgtc cccaaattct tgtccagctg ctttgaactc
13681 cattttctat ctactcttct gccttgctac cttccaatct ctcaactcct gggctagagg
13741 caaaggcctg ctgtcacact aacaccctat cttagcacat gatcccctgg agctcaaatc
13801 tccaattgct gatggcacat atcgtagccc ctcaaatctc ctattcccta atgccttttc
13861 ctgaggagac ctccaactct gtgccttgca gttgtctata tttggacatc ctttctccat
13921 caacccatct tctaaaatct cctttcttcc ctcttccagg gcaccgatgt gttccctata
13981 ttaggttctc tgatgacaga cccaaagttc ttccctagcc ccaaagactt cgacccccag
14041 aacttcctgg atgacaaggg acagttgaag aaaaatgctg ctttcctccc tttctccact
14101 ggtaaggaga cagtgggtta ttgaactact gttcacacca acatgggtag cacatgccag
14161 cttccctgtc tgtgatgctg cctagaatca ggctaaccag gtatagcccc tgcacctccc
14221 aagcaccaga catgctggat gcaggtgaga ggatccctgg gaccagtgat ctgtgtcaga
14281 gaccggggag gggttgggaa taccaacttt cctaggtgat gctcatgcaa gcaatttctt
14341 cacactcttt ctaatgcagc ttttaaataa ttgtttgttt ttctttattt tttaagtaat
14401 ttatttaatg tgcaatggtg tgaggttgtc agatgccttg gaactgaact tatagatgat
14461 tatgagctgc catgtggctg ctgagaattg aaccttggat cttcagaaga acagacagtg
14521 ctcttaacca atgagccatc tcccagcccc atcttcagac tcttaaaagt gggataacaa
14581 ccaggtggta taggtgcatg cctttaacca cagtactggt ggatatctga gttcaacacc
14641 agcctgggac tatagagtga gttacaggac aacccaggct acatggagga aaccatgact
14701 tcaaaaacta aaaataaata aataataggt aggtagatag atagatagat agatacatac
14761 atacatagat acatagatac atagatacat agatacatag atacatagat acatagatac
14821 atagatacat agatacatag atagatgcat agatagatac atagatagat agagacatag
14881 atagatgcat agatagatac atagatagat agatgcatag atagatagat acatagatag
14941 aaagatgcat gtatacatac atgcatgcat acatagataa atagatgact cataaaaaat
15001 taaaagaata aaaaaataaa caaggccaca gcagagcatc tacatttgag aggataatta
15061 ataattgata gaggaagcat ctgtactcca tattgctcca gcctaaaatg agttgtccca
15121 cgttgtgtgt agggacacca gggttttaag agggttagga gcctttccta atgatccctc
15181 atgctccagt atagcagccc cttctccttt tttttttctt tttttcttta ttaacttgag
15241 tatttcttat taacatttcg agtgttattc cctttcccgg tttccaggcc aacatccccc
15301 taatccctcc ccctcccctt ctttatgggt gttcccctcc ccaccctccc cccattgccg
15361 cgctcccccc aacaatcaca ttcacagggg gttcagtctt agcaggacca aggacttccc
15421 cttccattgg tgctcttact aggctattca ttgctaccta tgaggttgga gtccagggtc
15481 agtccatgta tagtctttag gtagtggctt agtccctgga agctctggtt ggttggcatt
15541 gttgttcata tggggtttcg agtcccttca agctcttcca gttctttctc tgattccttc
15601 aacgggggtc ctattctccc acccttcccc cactgccgcc ctccccccaa caatcacgtt
15661 cactggggct gaaccccatt tttaataggg ttatttgtct ccctgcggtc taacttcttg
15721 agttctttgt atattttgga tataagccct ctatctgttg taggattggt aaagatcttt
15781 tcccaatctg ttggttgccg ttttgtccta accacagtgt ctttgcctta cagaagcttt
15841 gcagttttat gagatcccat ttgtcgattc ttgatcttag agcataagcc attggtgttt
15901 tgttcaggaa attttctcca gtgcccatgt gttcaagatg cttccccact ttttttccta
15961 ttagtttgag tgtatctggt ttgatgtgga ggtccttgat ccacttggac ttaagctttg
16021 tacagcgtga taagcatgga tcaatctgca ttcttctaca tgttgacctc cagttgaacc
16081 agcaccattt gctgaaaatg ctatcttttt tccattgaat ggttttggcc cctttgtcaa
16141 aaatcaagtg accataggta ggtgggttca tttctgagtc ttcaattcta ttccattgat
16201 ctatctgtct gtctctgtac caataccatg cagtttttat cactattgct ctgtaatact
16261 gcttgagttc agggatagtg attccccctg aagtcctttt attgttgagg atagttttag
16321 ctatcctggg ttttttgtta ttctagatga atttgcaaat tgttctgtct aactctttga
16381 agaattggat tggtattttg atggggattg cattgaatct gtagatcgct tttggtaaaa
16441 tggtcatttt tactagatta atcctgccaa tccatgaaca tgggagatct ttccatcttc
16501 tgaggtcttc ttcaatttct ttcttcagcg tcttgaagtt cttattgtac agatctttta
16561 cttgcctggt taaagtcaca ccaaggtatt ttatattatt tgggactatt atgaagggta
16621 tcgtttccct aatttctttc tcggcttgtt tctcttttgt gtagaggaag gcaactgatt
16681 tatttgagtt aattttatac ccagccactt tgctgaagtt gtttatcagc tttagtagtt
16741 ctctggtgga acttttggga tcacttaaat acactatcat gtcatctgca aatagtgata
16801 ttttgacttc ttcttttcca atctttatcc ccttgatctc cttttgttgt ctgattgctc
16861 tggcttgaac ttcaagaact atattgaata agtagggaga gagtgcagcc ccttctcttt
16921 aagagaacac agctttgcac ttggcactga ggcaaggcag cggtgagagc ttccttccca
16981 actgtgctcc ttccctctct cctcttcagg gaagcgattc tgcttgggag atggcctggc
17041 taagatggag ctcttcctgc tgctcaccac tattttacag aacttccgtt tcaagttccc
17101 aatgaaacta gaagacatca acgagtcccc caaacccttg gggtttacca ggatcatacc
17161 aaagtacacc atgagcttca tgcccatctg attctgagtt gaatcaaggt ggggcaagag
17221 ggagagagag cctgaagtgg ggccagggtg caggtggaga gaacagggga ggtgaagatg
17281 agggttaaga agggaccaca cccatggaag aaacacaaaa gacttctcac tttggtaaaa
17341 ttgtaacagt cctaataaaa agaaagaaat actcagtggg cagcagtaac aacaactgag
17401 actcatgggg caaaggtggc tcacctctgc agaagctgtc ctgtccttct ctcagtcctc
17461 tacacaagag cagcatgtcc ccaagtccaa cgtacaggtt gcaaagatgg aacttacaaa
17521 tttgaaccta aactgaggtg gaaaaaactc aagttagcta ggattgatgt tttggactct
17581 atcaccagca ttcaggaggg agggaacatg gctctctacc atgtctgcca ggactacaca
17641 gtgagagctt atctcaaaag aaaaaagaaa aaaagaaaaa aatttatata tatatatata
17701 tatatgtata tatatgtata tatatatatg gagagagaga gagagagaga gagagagaga
17761 gagagagaga gagagagaga gagtttgcat tgtacatgat cagggaaata ataaaaacta
17821 gtttgacagt cacataccag tgggttctaa tttatcaaac tccaccccca cccccactgc
17881 cactgctgcc ctatgaagga actgaacaga agcttaactt tccttgggcc atttcgacag
17941 ctgttgtgtc atcaaggctt ctgttttcct atggagacac tacacatggg acagagagga
18001 taacagggag ctcatgactg agagaccttc aggccaaagc acttgaacct ttgtttatcc
18061 tgtttattct gaattttctg cttctgggct ctcatttccc caccattaaa atgagaatat
18121 caatatttac agctgcactg catctctttt tggagtgatt cctggtaact aagaaataag
18181 tagaaaatgg aaggatgaaa tccaccagga ggtttgagta aattccactg tgggaaacac
18241 aggggactgt gggatggcaa ggatgagagc tggaaagaat gcaaggccac actatgtctc
18301 atgcatattt tatatctttt ttatattctt tatatctttg tagtgttttt attagcctac
18361 aaagaaatac atttctcact ggcaacttct tacatatata tcactaccta tgttctcatt
18421 cactttcctt cgctggtctt ggcctcttcg caaaattatt caccggtaat ttattcacac
18481 tttctaattt ttgagcatgg tgcattccag taagatttaa tctctgtggc catggtgttt
18541 cacagctctg taacactgaa gcacattcat catcaactgc actgaagtca tcaacttaag
18601 aagcaaagga ggattcttct ggtctccatc tgcgcccaga gctaagtctg ccccacaacc
18661 ctccagattc aaaacctccc cagacagagc tagtcctcca ggagtgctct cactactaag
18721 gccacaagtg agaccccatt tcccttcaat accgatccaa agaggagccc accagatacc
18781 aggtaccaaa gttaaatgag gatccgttga cctgcaggtc
//
LOCUS RATCYP2A21 7247 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 1 and 2.
ACCESSION M33313
KEYWORDS LINE repetitive sequence; cytochrome P450;
hepatic steroid hydroxylase IIA2.
SEGMENT 1 of 3
SOURCE Rat (strain Sprague Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 7247)
AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
genes and regional localization of the CYP2A gene subfamily on
mouse chromosome 7
JOURNAL Biochemistry 29, 1329-1341 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 5556 5732 hepatic steroid hydroxylase IIA2 (CYP2A2),
exon 1
6198 + 6360 hepatic steroid hydroxylase IIA2, exon 2
pre-msg 5528 > 7247 CYP2A2 mRNA and introns
IVS 5733 6197 CYP2A2 intron A
IVS 6361 > 7247 CYP2A2 intron B
rpt < 1 649 LINE repeat
rpt 1120 2122 LINE repeat
rpt 5852 6037 dre repeat
rpt 5852 5863 5' direct repeat
rpt 6026 6037 3' direct repeat
BASE COUNT 2306 a 1563 c 1530 g 1848 t
ORIGIN
1 aactatcctc aacaataaaa ggacttctca gggaatcact atccctgaac tcaagcagta
61 ttacagagca atagtgatta aaaactgcat ggtattggta cagagacata cagatagacc
121 aatggaatag aactgaagac ccaaaaatga acccaagcac ctatggtcac ttgatttttg
181 acaaaggaac caaaaccatc caatggaaaa aagatagcat tttcagcaaa tggtgctggt
241 tcaactggag gtcagtatgt agaagaatgc agatcaatac attcttatca ccctgtacaa
301 agcttaagtc caaatggatc aaggacctcc acatcaaacc agatacactc aaactaatag
361 gagaaaaagt ggggaagcat ctcgaacaca tgggcactgg agaaaaatcc ctgaacaaaa
421 taccagtggc ctatgctcta agatcaagaa tcgacaaatg ggatttcata aaactacaaa
481 gcttctgtaa ggccaaggac actgttgtta ggacaaaacg gcaaccaaca gattgtgaaa
541 acatctttac caatcctaaa actgatagag gctcatatcc aaaatataca aagaactcat
601 gaagttagag tgcagggaga caaataaccc tattaaaaaa tggggttcat gggtgtagat
661 ctctcctgag agacacaccc agaatacagc atattcatat gcgaatgcca gcagcaatcc
721 actgaactga gaatgggacc cccgttgaag gaatcagaga aaggactgga agagcttgaa
781 ggggctcgag accccatatg aacaataatg tcaaccaacc agagcttcca gggactaagc
841 tattacccaa agactgtaca tggagtgacc ctgggctcca actgcataag tagcaatgaa
901 tagcctagta agagcacagt ggaaagggaa gcccttagtc ctgccaagac tgaaccccca
961 gtgaatgtga ttgttggggg gaggacagta atgggtggag gatggggagg ggaacaccaa
1021 tatagagggg agggggagga gttaggggga atgttggcct ggaaactggg aaagggagta
1081 acaatcgaaa tgtaaataag aaatactcaa gttgataaag ataaaaaaaa agtgaggttc
1141 agagctaaac aatgaattca cagctgagga atgccaaatg gctgagaagc accaaagaaa
1201 tgttcaacat ctttagtcat aagggaaatg caaatcaaaa caaccctgag attctacctc
1261 acaccagtca gaatggctaa gatcaaaaac tcaggtgaca ccaaatgctg gcgaggatgt
1321 ggagaaagag gaatactcct ccattgttgg taggattgca gactgctaca accattctgg
1381 aaatcagtct ggaggttcct cataaaattg gacatagatc tacctgagga cccagctcta
1441 cctctcttgg gcatataccc aaaagatgca ccaacatata acaaagacac atgctccact
1501 gtgttcatag cagccttatt tataatgggc agaagctgga aagaacccag atgcccttca
1561 acagaggaat ggatacagaa aatgtagtac atctacacaa tggaatacta ctccgctatc
1621 aaaaacaatg actttatgat attcataggc aaatggatgg aactcgaaaa tatcatcctt
1681 agtgaggtaa cccaatcaca gaaaaacata catggcatgc actcattggt aagtggatat
1741 tagcccaaat gctcaaatta ccctagatgc acagaacaca tgaaactcaa gaaggatgac
1801 caaaatgcgg atgcttcact ccttctttaa aacaggaaca agaataccct tgggagagga
1861 tagggaggca aagtttagaa cagaggcaga acgaacaccc attcagagcc tgcccacatg
1921 tggcccatac atatatagcc accaaactag ataagatgga tgaagcaaag aagtgcaggc
1981 tgacaggaga tctatgtaga tagatctctc ctgaaagaca cagccagaat acagcaaata
2041 cataggcgaa taccagcagc aaaccactga actgagaatg ggaccctgtt gaaggaatta
2101 gagaaaggac tgaatgttgt tgtaaaaata taaaaataaa gagtaatgtt ggtcttttac
2161 cccgctaggt atcttggcgg aaacacatcc cagccacgca ctttcctaca ctcaaaccct
2221 cacataaaag aacacacaac acaataatct ttgacccaat tggtaagata taattgccta
2281 cttaaacata caaagcccgg taccatccat cccttgagaa cattaataac aatttgtaaa
2341 tacacagagc agaatcttaa catcaccagc tatcttgtcc tgccacggct tctccgcccc
2401 tctctccctc ctgtctcttc ctctctccct tagtctcctc ctcttcctta aaacttctct
2461 cccgcccatc cttccttctc ctccaatgac aggcctcctt ctatcctgta cctgcccctc
2521 accagtactt tacaaattca gtggagaggt ggttctggtg aagtcacctg agttctgagt
2581 ccttgactag gcagctgtcc ttggggcagt ggaattagca tcaaaataca gtaacttcag
2641 ggcaaaccag aataactgaa agagcttgaa ggggcttgaa accccatatg aacaacaatg
2701 tcaaccaacc agagcttcca gggactaaga ctatacaagg actgaccctg ggctccaact
2761 gcataggtag caatgaatag cctagtaagg ccaccagtgg aaggggaagc ccttggtact
2821 gccaagactg aacccccagt gaatgtgatt gttgggggag gacggtaatg gggcgaggat
2881 ggggagggga acaccaatat agaggggagg gggagaggga gggggatgtt ggcctgaaaa
2941 cctggaaagg gaataacaat tgtaatgtaa ataagaaatg gctcagtggt taggagcact
3001 gactgctctc ccataggttc tgagttcaaa tcccagcaac cacatggtgg ctcacagcca
3061 tttgtatggg atccgattcc ctcttctggt gtgtctgaag acagcaacat tgtacttata
3121 aatgaataaa caaataaata aatctttaaa aaaaagaaat acccaattta ataaagatgg
3181 agaacaaaaa acaagaagat acattgctag ggctagagac atggctcagc agttaagagc
3241 actgactgct cttccagagg tcctgagttc aattcccaga accacatgat ggctcacaac
3301 aatctgcaat gggattcaaa gatcacttct ggtgtgtcta aagacagtga caatgtactc
3361 atatacatga aagaaagaat gaaatcttta aaactttcaa aagctgaaga catgctccct
3421 atattattcc aggcaaatcg aagaatttga attctatcac aaactacaat actcacatca
3481 atgagggttc ttttcatgtg ctcaaccaca caaatgtaga tttttagtta tggatttgat
3541 ctggggaacc tagacatgga cagtctccag ataatgccca cagttaccaa tacagcatcc
3601 ctctcaacat agcctttgaa gagtgtgcta taggttggcc tgttcacctc atccacttaa
3661 ctcctcagct tggtctccca aagcctcttt gactgcatcc cattcataaa ggaccacaac
3721 ccagtgaggc tttaagaatt ttgaagtact ggcagcagcc tatgccctgg ggacccctga
3781 gcatctcacc agttccaggt cggagactcg gctacatacg atggcaccga acccagatac
3841 tcactggaaa ggaccgtacc tggtgctgct gaccaccctg acagccatca actctcagcc
3901 ctcaccagcc gtgtactagc tgttggggct gagagctggg acctagagct gggaccagtt
3961 cttcaaaaag ctccctagac ttaatttcat gtttgccccg ggttttatca agataggtgt
4021 ggggataggc ttgatttcta ttacaaatga tgtaacattg catatgttag tactcctaac
4081 acttcttggg actgtgcctc agggatcaca atctgtataa gtttagaagt tctaaaagct
4141 agtcatgacc ttggtgtgta ggtttagata gtgtccagat tggaatcctg atgctaaaga
4201 cttagtaaga cacaaaaaaa ggagttgaga attacttagg gctaaggcta tctaggtgct
4261 gcaagggcag cacaaggaca tctgctgttg caatgcaagg cttatagaga attcagaact
4321 gccatttagg agtaattaaa gactccatga ataaacttag agaaaggtta gacaaaaggc
4381 agacagagaa gcgcatcagg gatggtttga gagctggttt agtagatctc cttggatgac
4441 tactctggta ttttccctta tgggaccctt cttagttttg cttctgcttc tgattatagg
4501 tccatgtgtg ttagagaaac tagttaatag gtttgactcc tacaaaaaga tagagacgct
4561 caacaaggtt ggtttgagtc ttggttcact cggtctccct ggatgactac cctactctct
4621 gctatatggc tgggccatta ctaataattt tcttggtttt agtttttgga ccctgcgtga
4681 caaacaggtt aattgctttt gttacaaatc gagtgagtgc tgtgcggttg gttctgagac
4741 aacagtacca gtcagttagg acaactggtg agaccaaata agagacttga tatcaaaatt
4801 ctaagattag aattacttag tagaagaaga ggggaatgaa aggaaaatta tacagattta
4861 aggtttaaaa atatgaagtt aaaagagtat gtttcaactc aggactaaac actgtgaaaa
4921 gcaagtccag gcagccccgc cctgccgcta gaactaacag accataaaag gaaaggaatg
4981 cagaacagac caggagtacc ggatctgact cacaggccac ctggcaggaa gagataagcc
5041 cccagccccc gacatccagg acgccccaaa cctgccaatg tgtgtagcta taccttatta
5101 cctcatcatg tgaaatagcc aatcatatgt gaacatgtct atgtgcctcg tttgaatcca
5161 ccaatccccg taactatgca tctgcttctg tacgcccgct tctgcttccc caatccctat
5221 aaaagcccca tgctggagct gctgggcgcg caagtcctcc gaagagactg tgtgcctgca
5281 ggtacctgtg ttttccaata aaccctcttg ctgattgcaa aaaaaaaaaa aaaaaaaaaa
5341 aaaaaaaaaa gaattttgaa gtaagcctgc cacctttctt cctatttgcc catgttgtgg
5401 gtgttgcaac aaagactggg tcaatgttag aaaatagggt tgggaggcaa aagactcaac
5461 tagacaaaca ggagcaaagg ccatcctgtg tccctgggag tataaaggta ctatctcagc
5521 cttggctatc agtctgtcca tcctcactgg ctactatgct ggacacagga ctgctcctgg
5581 tggtcatact ggcctcccta agtgtcatgt tcttggtgtc cctctggcag cagaaaatca
5641 gggagagatt gcctccagga cccactcctt tgcctttcat tggaaattat ctgcagctga
5701 atatgaaaga cgtatacagt tccatcacac aggtatcact ggatgagggg atggatggga
5761 catgggagtc caagaagctg ggttgttttg catgttttgt ggcagaagat tcatagagta
5821 aatccaaagt cttgtattca tggagtttag aaagataagg agcgggctgg agagatggct
5881 cagcggttaa gagcaccatg tgctcttcca aaggtcctga gttcaaatcc cagtaaccac
5941 atggtggctc acaaccatct ataatgagat ctggtgccct cttcttgtat tcttaatcat
6001 aataaataaa taaatctaaa aaaataagat aaggagctat ttcaactctt tggtttgttg
6061 tttgcattcg tttgtttgtt tgtttgtttg tttgtttttc aatcatttat ttgtagaata
6121 acacataatc tgacctctgt gtactggtct agttcagtga ataagtcatc taacagcctc
6181 catctaccca acatcagctc agtgagcgct atggtcctgt gttcaccatt caccttgggc
6241 ctcgacggat tgttgtgctt tatggatacg atgcagtcaa agaggctttg gtggaccaag
6301 ctgaggagtt cagtggacgt ggcgaactgc ctacctttaa tatactcttc aaaggctatg
6361 gtgaggagga taccacattg gggagcatgc ccaaggacat ttgttggcct catttaagta
6421 gccttcatcc taactcatct ttcccctcaa ggctgtacat agtcctctga tttttctctc
6481 catattcaag ttgaatgttg cttcttattg tgacccttcc tagtctttct atgattctct
6541 gtgggtgctt cctttcattc ttcttcaccc ttttcttcca ttctttaacc ctcataatac
6601 taggtaggag ataaaaagag atagaggaaa aaggggacac tattgttaga ctacttcctt
6661 ctgagaggta atgagttcct tagggcaagt ttgatcatct cagtcaggat atctaatttc
6721 ttcttcctgt tgttactttg cacaaggcga cttaacaaag cacagccaac agcaaccaac
6781 caacaaccaa aaccaatctc tcaaggccct tgcattaaaa taacctctga ggaatcccca
6841 gtatcctaag ggtcacactc tcagaaacta tctgcagtag gcaaaatcat acccctgcta
6901 gagcacaaaa taaatcatag gtctctgctt tggacaatct gattcatccc catattgcat
6961 acctggaatt aaaaaaacat attcctataa tatttctgta tttgtcaaaa aaaaacaaaa
7021 ttcttttttt tttatcttta agtaatactc caactttatt gaataaagga ataaatggag
7081 ttttcaagtt ttcccatcat ggttattttt aaagccacct gatacatgac agtacttatc
7141 aaaacaagat gtttatctat ttttgtcatt tgtatttttg cttaatttta tattcataat
7201 atatttaaat taactaatag ttcatggtaa cacttggcca cacaggt
//
LOCUS RATCYP2A22 4753 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 3,4 and
5.
ACCESSION M33325
KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2.
SEGMENT 2 of 3
SOURCE Rat (strain Sprague Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4753)
AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
genes and regional localization of the CYP2A gene subfamily on
mouse chromosome 7
JOURNAL Biochemistry 29, 1329-1341 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 1815 1964 hepatic steroid hydroxylase IIA2 (CYP2A2),
exon 3
2182 2342 hepatic steroid hydroxylase IIA2, exon 4
3567 + 3743 hepatic steroid hydroxylase IIA2, exon 5
IVS < 1 1814 CYP2A2 intron B
IVS 1965 2181 CYP2A2 intron C
IVS 2343 3566 CYP2A2 intron D
IVS 3744 > 4753 CYP2A2 intron E
BASE COUNT 1258 a 1204 c 911 g 1380 t
ORIGIN About 5 kb after segment 1.
1 aatcaatagt ttttaagcta ctaacccttt ctagagatga tgaaaataga aaactggaag
61 aatgcctagg tagcaaatga ccttggaagt tagggactaa aaatttaagt ccacatctgt
121 gcaagataaa aattaactct tagtttgcat aagctcttat ttttttcata agtcttattt
181 gtttttttat ctttattaac ttgagtattt cttatttaca tttcgattat tattcccctt
241 cccagtttct gggtggatga ctccttttta acttagctga tatttttatt cttcttaaac
301 atttatccac acacagagca tcagtcgcag gtctgaggca caccctgcta gtgcctctgg
361 attgttttta aagatcattt gctcttactt ttctatctat gggtgttttg cttatgtgta
421 tatgtgtaca caagtctggt gcccatggaa gcaaaaagat ggagtcagat ctcctgaact
481 ccaggggttc catgagttct ataagctgtc aggcgagtgc tggggttcaa gcacaggtcc
541 tctgcaaggt cagccagtgc tcttgagtgc agagccagct ttgctgtcca tccccccgcc
601 cccgcgcatg tatttttaaa tgttgtttta catatgtcat gtgttgtccc taagatgtgt
661 ataatgctta tagaacatta cagtctggta agtgctggcc aaagttacag aagtataaaa
721 tggccttgag cagcaaaaca ttggttataa gcaagaaagt tcaaaataaa gagaaaatcc
781 acaaagagcc aaatatcttt ataacattaa ttctgtggtt gcgatttaac accaaggggg
841 tatctgtttc cctgaactaa ggggcacaga aatggctact actacttagg gtcaaaatag
901 tgactacagc tcaggacaca taagcaaaac cagagccaaa gaccagggag tggtaataaa
961 ataataaaaa atcctggctc agggattcgt cccacctttc cctggtgaaa gacacacaca
1021 gcctttatat tttagtctgc cttatgcagc acaatagctg ggcagctgcc taccctccat
1081 gttgttagaa tccatttccc tatcaatagc cttgagttga tactttacaa atttccatat
1141 tccatttttg ctgttcttaa cccaatttaa cagccttctg ggccacaatc tcttggccct
1201 tagcacatgg tatctctcct ttgcccttct tctctttctt cttccttggc ttccacggaa
1261 gctcctcggt cccattctcc ttcctcatgc tctagccaag gaaacctaaa cccctcctat
1321 gtcccttctc cccagctatt agctgctggc atctttattt accaaccaaa gtaaatgggg
1381 gcagagtccc ccaggctaag ggcagattcc aaatcttaga aggcagcacg aagcagtata
1441 gtaaacagta aaagaaaaaa acgcaacacc agagtacgtt tctatgtatg ctgtccttgc
1501 tttaatgtgg agtttctgtt ttcagaaaat gctcaaattt ggttctttta gccatgtcag
1561 cgacctggag cagcattctg agtctctctg cttctgtctg taactctctg tttccttgcc
1621 tggctgactt gttccaactt tcttactctg actgtgtctg ctgcagagcc tctgttcgtt
1681 tcttcagtgt tcttgccatc tcaatcccat ctttgtctct tttctttcct ctaagaaggc
1741 ctttccagca tgggcctggg ccttcctcag cctcagacta cctcacccca acacccatgt
1801 tcatgtctct acaggttttt cattgagcaa tgtggaacag gccaagcgta tcaggcgctt
1861 caccatagcc acattgagag attttggtgt gggcaagcgt gatgtacagg agtgtatcct
1921 ggaggaggca ggctatttga tcaagacgtt gcagggcact tgtggtaagc aagagaccat
1981 taagtgtttg ggcaagagaa agaacatccc tgacacctag accctatggg ttgtggagaa
2041 ggaggacggc gaagaccgcc taccaaacca tctccagaat ctggtgctga gagattggtg
2101 cctcactcca attcccacac catctgctaa ctcttctccc tcataatgcg aatgtcatcc
2161 aaacaatgtc acccctctca ggagccccca ttgacccttc catctacctg agcaaaacag
2221 tctccaatgt cattaactcc attgtcttcg ggaaccgctt cgactatgag gacaaagagt
2281 tcttgtcact gttggagatg atcgatgaaa tgaatatatt tgcagcctca gccacagggc
2341 aggtaaaaga ttccagctct gccaattgtg cttataatgt cctacattgg ccataccgac
2401 aaagggcaag gactacccca acgctcatgt ccacaaacat tcccctcaaa aacagaagct
2461 cccctcaaaa ccaaccttta ccttcagaaa actgaacctt tacatcagag cccacaggag
2521 ctatccagtg ctcacaatct aatgacctct ggatatctca agggcctgag aacaaagccc
2581 tctgcttggc tctcttccct gggcaggttt cccccgctta aattctgaca gatcctctgt
2641 gtggtcgtcc tgaaagttga gacacctgcc caagggagac aagtgatcac ctcaggcccc
2701 ctcctccaat cctgagcacc tacctggttc tgcaaaacta tggccagtaa agtcattcac
2761 actggacaca ctgctctccc aaaagatctc actggcacca tgacacgaga gtcacctgct
2821 tgtctcaggt aaattcagga atgagtagac aggaacctca accaaggcaa ccaagcacag
2881 acctctagat ggactgtttc cccaaacacc catacgactg ccaaccagcc acacacagtc
2941 caattcaaaa aggtctgaca ggtgtgtccc acaccttata acccgaacca tcttatcctg
3001 aatactttac tatgtggaaa acagattcta atctcaaaca aatatcaaga gatctaaatt
3061 cagccttctt tggtgcccaa acatctaaat acttgagtca ctgtgataac cctggcctga
3121 acacaggaaa cctggattaa tggtctaatc aaaaaatcaa ttgaatagtt gaatgtctgc
3181 taatgtcccc ttttgatcca gctcatccag attgtaggac aatgaccctc attctttaaa
3241 tcaactagaa aattgcagtc tctggggctt cagactgttc agtagtttaa gagcatgtac
3301 tgctcatcct gaggacctga gttcagttcc cagtacgtat gctggacatt gcacagctca
3361 aggggagtac acctgcactc gtgcacataa ttaaaagtaa aatattcaaa tgaatataaa
3421 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa
3481 cacctggatg aatgacttaa tacaagtgcc actcccactc aatgttgcca ctgacaagcc
3541 ttttcttttc tcctcccacc ccccagctct atgacatgtt ccattcagtg atgaagtacc
3601 tgcctggacc acagcaacag atcatcaagg ttactcagaa actggaagac ttcatgatag
3661 agaaagtgag gcagaaccat agtaccctgg accccaattc cccaaggaac ttcattgact
3721 cctttctcat ccgcatgcaa gaggtgatcc caatcatggt ggatggaatg tctaagactg
3781 agcagctgga aatcacccta gaaaaggagg aggaatataa gcccattaag tgcccatgat
3841 tctcctcaca gtcccggtta tagttaaacc tcactctttc acctgttgag ccttatccaa
3901 gccagggtat gggttagcaa attaccatga caaccgatat tccagtgttc ccctatgaga
3961 cactgttttc agtattcaac tacttagcat gcactgaagc aactgtcgaa gaccctgtgg
4021 agcctaaatt tcgcaaggag ggaaagtgtg cccagacttg catgctaact tcatgcagac
4081 agaaaactgc ttgcctctat ggctctcagg attttactat tagccacctg gactctagca
4141 tttcatatct ctgttagaaa atacatatca atacacaacc ctgaactggg caacctgggt
4201 tgttgtattt tttcttctat tatctgctct agtaattatg tattgttttt tattttaatg
4261 ttgtttttct tttttttttc atctttatta aattgaagat ttcttattta catttaaatt
4321 gttattcccc ttcccggttt ccaggccaac attctctaac ccctcccctt ccccttctat
4381 atgggcttcc ccttcatatc ctccccccat taccaccctt cccccaacaa tcacgttcac
4441 tgggtgttca gtcttggcag gacccggggc ttccccttcc actggtgctc ttacaagcct
4501 cattgcttcc tatgaggttg gagcccaggg tcagtccatg tgtagtcgtc gggtagtggc
4561 ttagtccctg gaagctctgg ttgcttagca ttgttgttca tatagggtct cgaccccttc
4621 aagctcttac actcctttcg ctgattcctt caacgggggt cccgttctca gttcagtggt
4681 ttgctcctgg catttgccta tgtatttgct gtattctggc tgtgtctctc aggagagatc
4741 cgttgacctg cag
//
LOCUS RATCYP2A23 5080 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 6,7,8 and
9.
ACCESSION M34392
KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2.
SEGMENT 3 of 3
SOURCE Rat (strain Sprague Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5080)
AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
genes and regional localization of the CYP2A gene subfamily on
mouse chromosome 7
JOURNAL Biochemistry 29, 1329-1341 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 1322 1460 hepatic steroid hydroxylase IIA2 (CYP2A2),
exon 6
1876 2063 hepatic steroid hydroxylase IIA2, exon 7
2496 2637 hepatic steroid hydroxylase IIA2, exon 8
3636 3817 hepatic steroid hydroxylase IIA2, exon 9
pre-msg < 1 4588 CYP2A2 mRNA and introns
IVS < 1 1321 CYP2A2 intron E
IVS 1461 1875 CYP2A2 intron F
IVS 2064 2495 CYP2A2 intron G
IVS 2638 3635 CYP2A2 intron H
BASE COUNT 1470 a 1191 c 970 g 1449 t
ORIGIN About 15 kb after segment 1.
1 gaattctttg tatatattgg acaatagccc tctatcagat gtacaattgg taaagagctt
61 ttcccaatct gttggttgtc gttttgtcct aataacagtg tcctttgcct tacagaagct
121 ttgcaatttt atgaagtccc atttgttgat tcttgatctt agagcataag cccttggtgt
181 tctcttcagg aaattctccc atgtgccctg tgttcaaggc tcttacccgc tttctcttct
241 attagtttca gtgcatctgg ttttatttta attttgtttt atttttcttg tatatttttg
301 tacttacact tcaaatgcta tctcctttgt acattctctg atatctcctc cctgtcccca
361 tgcttctatg aggatgctct cacttccacc cacccactcc cacctcaatg ccttgacatt
421 cacctacatt ggggaaatgg gcctttactg gaccaaggac ttttcctcct attaatgatg
481 gacaatgcca tcctctgcta tatatacagc tgaagccatg cttccctcca tttgtactct
541 ttggttgggg gtttagtctc tgggagctct gagggaagag tctggttggt tgataatttt
601 gctcttccag ccatgaaatg aaagacagtc acctatacag agaaacaagc aaagcttctc
661 ctgcaaacca aagattccaa acacaacctg gacattgctt ttccaaccat tggtctggac
721 actttgagaa ctagatacaa agaaaattcc agaagtgctg ccacttgggt ccatttctga
781 ggaatttaat ccacagttga tggctgctta gagatgatga aatcatattc ctttgcagtg
841 tggctactag taaattgccc tttctcaagt gaagaaccac tcacccatat gcatgcagcc
901 acacctaatt ataagcagat ctccccccaa ataaaaacag gaaaatatga ggaagactta
961 ttagaaatta gaaatggttc aataaaataa aaatagagat aatggagggg aatatgttta
1021 aggtgcattt cacatatatg tctgaaaaat gaagactcaa gattcagtgg gtatggaatg
1081 ggattcatct gggagggctt gagggagggg tgtgaatgta ttcacagtac aataaatgaa
1141 attctcaaag aaataataaa aatatttata caataatgac tagaaatgtt ttagaaaatt
1201 aaaaccctta gtgttcccca aaaggagtac aaaatgataa atagatttgc gttctctctc
1261 tctctgtctc tgtctctgtc tctgtctctc tgtctctctc tctctctctc tcccccccca
1321 ggagaaatat gttaattcag aattccacat gaacaaccta gtgatgtcat cattaggcct
1381 cctctttgct gggactgggt cagtcagctc cacgctatac catggtttcc tgctactcat
1441 gaagcatcca gatgtggaag gtgaggctgg ctgtgtggca aggaagttgg gaaccccaga
1501 ttctccaacc tgacaatgac cctcacctct cccagatccc tggatgctca gacatcctga
1561 ctatgcagac acagaggcat attaaatgca taaacagagt actaagttaa aatattaaac
1621 attctgaaat tgatttccca ctgactgcca gatccctgtt ctctgttccc tgacttctcc
1681 ttctccccac catgatttgg tcatgaaaag gataaaatga tcctggccag catttaggta
1741 tggatgtatg tatagatggt ctaaatgcat gtttacagag acatgtaata catacagtgg
1801 tacacatgtg aactattcca catgctttga ggtctctgga tttttagaaa cagcccatct
1861 tcctttgtct tccagccaag gtccatgagg aaattgagcg agtgatcggc aggaaccgac
1921 agcctcagta tgaggaccac atgaagatgc cctacaccca ggctgtgatc aatgagatcc
1981 aaagattttc taacttggct cccttgggca ttcctcgaag gattatcaag aacacaacct
2041 tccgtggctt cttcctcccc aaggtgcagc caggcccacc caagtagggg cctccaaccc
2101 actccctgat gcttcagggc ctctttccat ctacagccat ctaactcaac tctaattcct
2161 ccaaccaaag aattcaccca catgtcccca acttcttgtc acactgcttt gaactccaag
2221 ttctatctga tcttctgcct tactactatc caatctctca actcctgggc taacacacta
2281 acacattatc tcagaacatg attcccctag agctcaaatc tccaatttct ggtggcacgc
2341 atcacagccc ctcaaaactc ctattcccta atgccctttc ctcaggagac ccccaactct
2401 gtgcctttcc gttctcttca tttggacact agcaccactt ggggtccttt ctccatcaac
2461 ccatcttctc aaatttcctt tctttcctct tccagggcac cgatgtattc cctataatag
2521 gttctctgat gacagaacca aagttcttcc ctaaccacaa agacttcaac ccccagcact
2581 tcctggatga caagggacag ttgaagaaga atgctgcatt tctccctttt tccattggta
2641 aggagacagt gggttattag accactgctc ataccaacag ggataactca tgccagttcc
2701 catctctgtg attctgccta gcatcaggct aaccaggtac aatccctgca cctcccaagc
2761 accacgactc aggtcaaagt atcaatgaga tcagtgatct ctttcagaga ctgggaagcg
2821 gttcagaaca ccaaatttcc caggtcatgc tcatgcaagc aatttcttca tactcttttt
2881 aaagcagttt taaatgattt ttttgttatt ttttaataat tcatctaatg tgcattggtg
2941 tgaggttgtc agattcatta gaactggact tatagacatt ttatctgcca tgtgggtgct
3001 gagaattgaa ccttggttct tcagaagagc agacagtgct cttaaccagt gagccatctc
3061 ccagccccat attcaaattt taaaagggga taacaaccag gtggtggtgg tacatgtctt
3121 taaacccagt actcaagaag cagaagcagg tggatatcta agttcaatgc cagctggatc
3181 tatagagtaa gttagaagaa aacccagact aaatggagga aaccctgact taaaaaacta
3241 aaaataaata aataatagat agatagatgc atgcatgtat acatacatat atgcatacct
3301 acatgcatgc atacatagat acatagatga ctcagagata attagatgaa taaataaata
3361 aacaagacca cagcaggcat ccacatctga gaataaaatt aataattggt agaggaagca
3421 tctggactcc atattgcttc agcctacaat gagttgcccc actttgtgtg tagggacact
3481 ggggttctga gagggttagg aacctttcct aatgatcact catgctccag gttagcaccc
3541 cttttcccta agagaacaag gctgctcact gggtactgag ggaaagaagt gagatcttgc
3601 tccaagtctg tgctccttac ttctctcctc tttaggaaag cgattctgct tgggagatag
3661 cctggctaaa atggagctct tcctgctgct caccaccatc ttgcagaact tccgttttaa
3721 gttcccaatg aatctagaag acatcaacga gtaccccagt cccatagggt ttaccaggat
3781 cataccaaat tacaccatga gcttcatgcc catctgattc tgagttgaat caaggtgggg
3841 caagagggag ggagagcctg aagtggggcc agggtgcagg tggagagaac agagaagatg
3901 aagatgaggg ttaagaaggg accacaccca tggaagaaac acaaaagact tctcagtttg
3961 gtaaaattgt aacagtccta ataaaaagaa agaaacaccc agtaggcagc agtaacaaca
4021 actgagactc atggggcaaa ggtggctcac ctctgcagaa gctgtcctgc ccttctctca
4081 ctcagtcctc tacacaagag cagcatgtcc ccaagcccaa cgtacaggtt caaaagatag
4141 aacttaaaaa atttgaacct aaactgaggt ggaaaagaca cagttagcta ggattgacac
4201 attggactct atcaccagca ttcaggaggg agggaacatg gctccctagg aggcctgcca
4261 gaattacaaa gtgaaactca tctcaaaaaa ggaacaacag aaaataaaat ttcaaattga
4321 tttctcttag accataagag tccagatctg tatccaaagc tatttggtta tattttttgt
4381 tattgttgtt ttgtttacac attgtgtttt tctttcggtt tgtaagtctg tttgggatat
4441 ttaatttaca tttactgatt agtgtgggtg gtagggcata ccatggctca aatgtggaaa
4501 ccaaagaaaa gcttttggaa gtgtcatctc ccttacaata cgtgtgtcca agaactcaaa
4561 ttcagacaat aaagcttgat agcaagcact tctacctact gagacatcta actggccaat
4621 ttagggagtt tattttaatt tatttactta ctaatttata tgaatataag tcctctatct
4681 gcatggccac ctgcgtggca gacgaaggca tcagatcact ttacagaagg ttgagtccac
4741 ccagtggtgg atggaaattg aactcaggac ttctagaagc cgtcaaattt tgagccacct
4801 cttcaacccc ttaaacaagt ttcttaaggt caccctttcc tcaaatgaaa caacaaggac
4861 ttggaatatt ttaacataac ctgagtcctc ctacctgagg tgttgtttct acaagcctgg
4921 caggcaactg atctacctcc aacatacact ttccaacagt cttgctttct catccacacc
4981 ttaatcacct gacacctgtt ggcctcagcc cctgtgccag gtaagtccat tttgtctgac
5041 tcagtcagtc tgggagacaa aaatcccttt gacagaattc
//
LOCUS ECOUGRE 108 bp ds-DNA SYN 12-JUL-1990
DEFINITION Synthetic uteroglobin (UG) mRNA expressed in E.coli, 5' end.
ACCESSION M34596
KEYWORDS uteroglobin.
SOURCE E.coli DNA, clone pLE103-1.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 108)
AUTHORS Miele,L., Cordella-Miele,E. and Mukherjee,A.B.
TITLE High level bacterial expression of uteroglobun, a dimeric
eukaryotic protein with two interchain disulfide bridges, in its
natural quaternary structure
JOURNAL J. Biol. Chem. 265, 6427-6435 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 91 > 108 synthetic uteroglobin
binding 77 81 ribosomal binding site (put.)
signal 10 26 phi-10 promoter
BASE COUNT 37 a 23 c 19 g 29 t
ORIGIN
1 gatccaaatt aatacgactc actataggga gaccacaacg gtttccctct agaaataatt
61 ttgtttaact ttaagaagga gatatacacc atggctgcag ccaagctt
//
LOCUS HCVCG3PE 12283 bp ss-RNA VRL 12-JUL-1990
DEFINITION Hog cholera virus polyprotein mRNA, complete cds.
ACCESSION M31768
KEYWORDS envelope glycoprotein E1.
SOURCE Hog cholera virus (strain Brescia), cDNA to viral RNA, passed in
SK-6 cells.
ORGANISM Hog cholera virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae; Mucosal disease virus group.
REFERENCE 1 (bases 1 to 12283)
AUTHORS Moormann,R.J.M., Warmerdam,P.A.M., van der Meer,B., Schaper,W.M.M.,
Wensvoort,G. and Hulst,M.M.
TITLE Molecular cloning and nucleotide sequence of Hog cholera virus
strain Brescia and location in the genome of the sequence encoding
envelope protein E1
JOURNAL Virology 177, 184-198 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Moorman,R.J.M., 01-FEB-1990, for release after publication.
FEATURES from to/span description
pept 361 12057 hog cholera virus protein precursor
matp 2428 3538 envelope glycoprotein E1 (put.)
BASE COUNT 3850 a 2559 c 3182 g 2692 t
ORIGIN
1 agttcattct cgtgtacatg attggacaaa tcaaaatctc aatttggttc agggcctccc
61 tccagcgacg gccgagctgg gctagccatg cccacagtag gactagcaaa cggagggact
121 agccgtagtg gcgagctccc tgggtggtct aagtcctgag tacaggacag tcgtcagtag
181 ttcgacgtga gcagaagccc acctcgagat gctatgtgga cgagggcatg cccaagacac
241 accttaacct agcgggggtc gttagggtga aatcacacca tgtgatggga gtacgacctg
301 atagggtgct gcagaggccc actattaggc tagtataaaa atctctgctg tacatggcac
361 atggagttga atcattttga acttttatac aaaacaaaca aacaaaaacc aatgggagtg
421 gaggaaccgg tatacgatgt aacggggaga ccattgtttg gagacccaag tgaggtacac
481 ccacaatcaa cattgaagct accacatgat agggggagag gcaacatcaa aacaacactg
541 aagaatctac ctaggagagg tgactgcagg agtggcaacc acctaggccc ggttagtggg
601 atatatgtaa agcccggccc tgtcttttat caggactaca tgggcccagt ctatcataga
661 gcccctctgg agttttttga cgaagcacag ttctgtgagg tgaccaaaag gataggtagg
721 gtgacaggta gtgacggaaa gctttaccat atatacgtgt gcatcgatgg ttgcatcctg
781 ctgaagctag ccaagagggg cgagccaaga accctgaagt ggattagaaa tctcaccgac
841 tgtccattgt gggttaccag ttgttctgat gatggtgcaa gtgcaagtaa agagaagaaa
901 ccagatagga tcaacaaggg taaattaaag atagccccaa aagagcatga gaaggacagc
961 aggactaagc cacctgatgc tacgattgta gtggaaggag taaaatacca ggtcaaaaag
1021 aaaggtaaag ttaagggaaa gaatacccaa gacggcctgt accacaacaa gaataaacca
1081 ccagaatcta ggaagaaatt agaaaaagcc ctattggcat gggcagtgat agcaattatg
1141 ttataccaac ctgttgcagc cgaaaatata actcaatgga acctgagaga caacggtacc
1201 aatggtatcc agcacgctat gtaccttaga ggagtcagca gaagcttgca tgggatctgg
1261 ccagaaaaaa tatgcaaagg agtccccacc tacctggcca cagacacgga actgagagaa
1321 atacagggaa tgatggtagc cagcgagggg acaaactata cgtgctgtaa gttacagaga
1381 catgaatgga acaaacatgg atggtgtaac tggtataaca tagacccctg gatacagtta
1441 atgaatagaa cccaagcaaa cttggcagaa ggccctccga gcaaggagtg cgccgtgact
1501 tgcaggtacg ataaaaatgc tgacattaac gtggtcaccc aggccagaaa caggccaacc
1561 accctaactg gctgcaagaa agggaccaat ttttcttttg cgggtacagt tatagagggc
1621 ccatgtaatt tcaacgtttc tgtcgaggat atcttatatg gggatcatga gtgtggcagt
1681 ctactccagg atacggctct atacctagta gatggaatga ccaacactat agagagagcc
1741 aggcagggag ccgcgagggt gacatcttgg ctagggaggc aactccgcat tgccgggaag
1801 aggttggagg gcagaagcaa aacctggttt ggtgcctatg ccctatcacc ttattgtaat
1861 gtgacaacga aaatagggta catatggtac actaacaact gtaccccggc ttgcctcccc
1921 aaaaatacaa agataatagg ccccggtaaa tttgacacta acgcggaaga cggaaagatt
1981 ctccatgaga tggggggcca cctatcagaa tttctgctgc tctctctggt cgttctgtct
2041 gacttcgccc ctgaaacagc cagcgcgtta tacctcattt tgcactacgt gatccctcaa
2101 tcccatgaag aacctgaagg ctgtgacaca aaccagctga atttaacagt ggaactcagg
2161 actgaagacg tgataccatc atcagtctgg aatgttggca aatatgtgtg tgttagacca
2221 gactggtggc catatgaaac caaggtggct ttgttatttg aagaggcagg acaggtcgta
2281 aagttagcct tgcgggcact gagggattta accagggtct ggaatagcgc atcaaccacg
2341 gcattcctca tctgcttgat aaaagtatta agaggacagg tcgtgcaagg tgtgatatgg
2401 ctgttactgg taactggggc acaaggccgg ctagcctgca aggaagatca caggtacgct
2461 atatcaacaa ccaatgagat agggctacat ggggccgaag gtctcactac cacctggaaa
2521 gaatacaacc acaatttgca actggatgat gggaccgtca aggccatctg catggcaggt
2581 tcctttaaag tcacagcact taatgtggtt agtaggaggt atctggcatc attacataag
2641 gacgctttac ccacttccgt gacattcgag ctcctgttcg acgggaccag cccattgacc
2701 gaggaaatgg gagatgactt cgggttcgga ctgtgtccgt atgatacgag ccctgtagtc
2761 aagggaaaat acaacacaac cttgttgaat ggtagtgcat tctacctagt ttgcccaata
2821 gggtggacgg gtgttataga gtgcacggca gtgagcccga caactctgag aacagaagtg
2881 gtaaagacct tcagaagaga gaaacccttt ccgtacagaa gggattgtgt gaccactaca
2941 gtggaaaatg aagatctatt ctactgtaaa tgggggggca attggacatg tgtgaaaggt
3001 gaaccagtga cctacacggg ggggccagta aaacaatgca gatggtgtgg cttcgacttc
3061 aatgagcctg acggactccc acactacccc ataggtaagt gcattttggc aaatgagaca
3121 ggttacagaa tagtggattc aacggactgt aacagagatg gcgttgtaat cagcacagag
3181 gggagtcatg agtgcttgat tggtaacaca actgtcaagg tgcatgcatt agatgaaaga
3241 ctaggcccta tgccatgcag gcctaaggag atcgtctcta gtgcgggacc tgtaaggaaa
3301 acttcctgta cattcaacta cgcaaaaact ctgaggaaca ggtattatga gcccagggac
3361 agctatttcc aacaatatat gctcaagggc gagtatcagt actggtttga tctggatgtg
3421 accgaccgcc actcagatta cttcgcagaa ttcattgtct tggtggtggt ggcactgttg
3481 ggaggaagat atgtcctgtg gctaatagtg acctacatag ttctaacaga acaactcgcc
3541 gctggtctac agttaggcca gggtgaggta gtgttaatag ggaacttaat cacccacaca
3601 gatattgagg ttgtagtata tttcttactg ctctatttgg tcatgagaga tgagcctata
3661 aagaaatgga tactactgct gttccatgct atgaccaaca atccagttaa gaccataaca
3721 gtggcactgc tcatggttag cggggttgcc aagggtggaa agatagatgg tggttggcag
3781 cggctgccgg agaccaactt tgatatccaa ctcgcgctga cagttatagt agtcgctgtg
3841 atgttgctgg caaagaaaga tccgactacc gtccccttgg ttataacggt ggcaaccctg
3901 agaacggcta agataactaa tggacttagt acagatctag ccatagctac agtgtcaaca
3961 gctttgctaa cctggaccta cattagtgac tattataaat acaagacctt gctacagtac
4021 cttattagca cagtgacagg tatcttcttg ataagggtac tgaagggggt aggtgagtta
4081 gatttacaca ccccaacctt accatcttac agacccctct tcttcatcct cgtgtacctc
4141 atttccactg cagtggtaac aagatggaat ctggacatag ccggattgct gctgcagtgt
4201 gtcccaaccc ttttaatggt tttcacgatg tgggcagaca tccttaccct gatcctcata
4261 ctgcctactt acgagttgac aaaactatat tacctcaagg aagtgaagat tggggcagaa
4321 aggggctggt tgtggaagac caacttcaag agggtaaatg acatatacga agttgaccaa
4381 gctggtgagg gggtgtacct tttcccatca aaacaaaaga caggtacaat aacaggtact
4441 atgttgccac tgatcaaagc catactcata agttgcatca gcaataagtg gcaatttata
4501 tatctattgt acttgatatt cgaagtgtct tactaccttc acaagaagat catagatgaa
4561 atagcaggag ggaccaactt catctcgaga cttgtagccg ctctgatcga agccaattgg
4621 gcctttgaca acgaagaagt tagaggttta aagaagttct tcttgctgtc tagtagggtt
4681 aaagaactga tcatcaaaca caaagtgagg aatgaagtga tggtccactg gtttggcgac
4741 gaagaggtct atgggatgcc gaagctggtt ggcttagtca aggcagcaac actgagtaaa
4801 aataaacatt gtattttgtg caccgtctgt gaaaacagag agtggagagg agaaacctgc
4861 ccaaaatgcg gccgttttgg gccaccagtg acctgtggca tgaccctagc cgactttgaa
4921 gaaaaacact ataagaggat tttctttaga gaggatcaat cagaagggcc ggttagggag
4981 gagtatgcag ggtatctgca atatagagcc agagggcaat tattcctgag gaatctcccg
5041 gtgctagcaa caaaagtcaa gatgctcctg gtcggaaatc ttgggacgga ggtgggggat
5101 ttggaacacc ttggctgggt gctcagaggg cctgccgttt gcaagaaggt taccgaacat
5161 gagaaatgca ccacatccat aatggacaaa ttaactgctt tcttcggtgt tatgccaagg
5221 ggcaccacac ctagagcccc tgtgagattc cccacctctc tcttaaagat aagaaggggg
5281 ctggaaactg gctgggcgta cacacaccaa ggtggcatca gttcagtgga ccatgtcact
5341 tgtgggaaag acttactggt atgtgacact atgggccgga caagggttgt ttgccaatca
5401 aataacaaga tgacagacga gtccgagtat ggagttaaaa ctgactccgg atgcccggag
5461 ggagctaggt gttacgtgtt caaccgagag gcagttaata tatccgggac taaaggagct
5521 atggtccact tacaaaaaac tggaggagaa ttcacctgtg tgacagcatc agggactccg
5581 gccttctttg atctcaagaa cctcaaaggc tggtcagggc taccgatatt tgaggcatca
5641 agtggaagag tagtcggcag ggttaaggtc gggaagaatg aggactctaa accaaccaag
5701 cttatgagtg gaatacaaac agtctccaaa agtaccacag acttgacaga aatggtaaag
5761 aaaataacaa ccatgaacag gggagaattc agacaaataa cccttgccac aggtgccgga
5821 aaaaccacgg aactccctag atcagtcata gaagagatag gaaggcataa gagggtcttg
5881 gtcttgatcc ctctgagggc ggcagcagag tcagtatacc aatatatgag acaaaaacac
5941 ccaagcatag cattcaactt gaggataggg gagatgaagg aaggggacat ggccacaggg
6001 ataacctatg cctcatatgg ttacttctgt cagatgccac aacctaagct gcgagccgcg
6061 atggttgagt actccttcat attccttgat gagtaccact gttccacccc cgaacaattg
6121 gctatcatgg gaaagatcca cagattttca gagaacctgc gggtagtagc catgaccgca
6181 acaccagcag gcacggtaac aactacaggg caaaaacacc ctatagaaga atacatagcc
6241 ccagaagtga tgaaggggga agacttaggt ccagagtact tggacatagc tggactaaag
6301 ataccagtag aggagatgaa gagtaacatg ctggtctttg tgcccacaag gaacatggct
6361 gtagagacgg caaagaaact gaaagctaag ggttataact caggctacta ttatagtgga
6421 gaggatccat ctaacctgag ggtggtaaca tcacagtccc cgtacgtggt ggtagcaacc
6481 aacgcaatag aatcaggtgt tactctccca gacttggatg tggtcgtcga cacagggctt
6541 aagtgtgaaa agaggatacg gctgtcacct aagatgccct tcatagtgac gggcctgaag
6601 agaatggctg tcacgattgg ggaacaagcc cagagaaggg ggagagttgg gagagtgaag
6661 cctgggagat actacaggag tcaagaaacc cccgttggtt ccaaagatta ccattacgac
6721 ctactgcaag cacagaggta cggtatagaa gatgggataa acatcaccaa atcttttaga
6781 gagatgaatt atgattggag cctttatgag gaggatagtc tgatgattac acaattggaa
6841 atcctcaaca atctgttgat atcagaagag ctaccaatgg cagtaaaaaa tataatggcc
6901 aggactgacc acccagaacc aatccaactg gcgtacaaca gctacgaaac gcaggtgcca
6961 gttctattcc caaaaataaa aaatggagag gtgactgaca gttacgataa ctataccttc
7021 ctcaacgcaa gaaagctggg ggatgatgta ccaccctacg tgtatgccac agaggatgag
7081 gacttagcgg tagagctgct gggcttagac tggccggacc ctgggaacca aggaaccgtg
7141 gaggctggta gagcactaaa acaagtagtt ggtctatcaa cagctgagaa cgccctgtta
7201 gtagctttat tcggctatgt aggatatcag gcactctcaa agaggcatat accagtagtc
7261 acagacatat attcaattga agatcacagg ttggaagaca ccacacacct acagtatgcc
7321 ccgaatgcta tcaagacgga ggggaaggag acagaattga aggagctagc tcagggggat
7381 gtgcagagat gtatggaagc tatgactaat tatgcaagag atggcatcca attcatgaag
7441 tctcaggcac tgaaagtgaa agaaaccccc acttacaaag agacaatgga caccgtggcg
7501 gactatgtaa agaagttcat ggaggcactg gcggacagca aagaagacat cataaaatat
7561 gggttgtggg ggacgcacac agccttatat aagagcatcg gtgctaggct tgggaacgag
7621 actgcgttcg ctaccctggt cgtgaaatgg ctggcatttg ggggagaatc aatagcagac
7681 catgtcaaac aagcggccac agacttggtc gtttactata tcatcaacag acctcagttc
7741 ccaggagaca cggagacaca acaggaagga aggaaatttg tagccagcct actggtctca
7801 gccctggcta cttacactta caaaagctgg aattacaata atctgtccaa gatagttgaa
7861 ccggctttgg ctactctgcc ctatgccgcc acagctctca agctattcgc ccccactcga
7921 ttggagagcg ttgtcatact gagtaccgca atctacaaaa cctacctatc aatcaggcgc
7981 ggaaaaagcg atggtttgct aggcacaggg gttagtgcgg ctatggaaat catgtcacaa
8041 aacccagtat ctgtgggtat agcggtcatg ctaggggtgg gggccgtagc ggcccacaat
8101 gcaatcgaag ccagtgagca gaagagaaca ctactcatga aagtttttgt aaagaacttc
8161 ttggatcagg cagccactga tgaattagtc aaggagagcc ctgagaaaat aataatggct
8221 ttgtttgaag cagtgcagac agtcggcaac cctcttagac tggtatacca cgtttacgga
8281 gttttttaca aagggtggga ggcaaaagag ttggcccaaa ggacagccgg taggaatctt
8341 ttcactttga taatgtttga ggctgtggaa ctactgggag tagatagcga aggaaagatc
8401 cgccagctat caagcaatta catactagag ctcctgtata agttccgtga cagtatcaag
8461 tccagcgtga ggcagatggc aatcagctgg gcccctgccc cttttagttg tgattggaca
8521 ccgacggatg acagaatagg gcttccccaa gataatttcc tccgagtgga gacaaaatgc
8581 ccctgtggtt acaagatgaa agcagttaag aattgtgctg gggagttgag actcttagag
8641 gaggaaggct catttctctg caggaataaa ttcgggagag gttcacggaa ctacagggtg
8701 acaaaatact atgatgacaa tctatcagaa ataaagccag tgataagaat ggaaggacat
8761 gtggaactct actacaaggg agccactatt aaactggatt tcaacaacag taaaacaata
8821 ttggcaaccg ataaatggga ggtcgatcac tccactctgg tcagggtgct caagaggcac
8881 acaggggctg gatattgtgg ggcatacctg ggtgagaaac cgaaccacaa acatctgata
8941 gagagggact gcgcaaccat caccaaagat aaggtttgtt ttctcaagat gaagagaggg
9001 tgtgcattta cttatgactt atcccttcac aaccttaccc ggctgattga attggtacac
9061 aagaataact tggaagacaa agagattcct gccgttacgg tcacaacctg gctggcttac
9121 acatttgtaa atgaagatat agggaccata aaaccagcct tcggggagaa aataacacca
9181 gagatgcagg aggagataac cttgcagcct gctgtattgg tggatgcaac tgacgtgacc
9241 gtgaccgtgg taggggaaac ccctactatg actacagggg agaccccaac aacgttcacc
9301 agctcaggtc cagacccgaa aggccaacaa gttttaaaac tgggtgtagg tgaaggccaa
9361 taccccggga ctaatccaca gagagcaagc ctgcacgaag ccatacaaag cgcagatgaa
9421 aggccctctg tgctgatatt ggggtctgat aaagccacct ctaatagagt gaaaactgta
9481 aagaatgtga aggtatacag aggcagggac ccactagaag tgagagatat gatgaggagg
9541 ggaaagatcc tagtcatagc cctgtctagg gttgataatg ctctattgaa atttgtagat
9601 tacaaaggca cctttttaac tagagagacc ctggaggcat taagtttggg taggccaaaa
9661 aagaaaaaca taaccaaggc agaagcacag tggttgctgc gcctcgaaga ccaaatggaa
9721 gagctacccg attggttcgc agccggggaa cccatttttc tagaggccaa tattaaacat
9781 gacaggtatc atctggtagg ggatatagct actatcaaag agaaagccaa acaattgggg
9841 gctacagact ctacaaagat atccaaggag gttggtgcaa aagtatattc tatgaaattg
9901 agtaattggg tgatgcaaga agaaaacaaa cagagcaact tgaccccctt atttgaagag
9961 ctcctacagc agtgtccacc cggaggccaa aacaaaactg cacatatggt ctctgcttac
10021 caactagctc aagggaactg gatgccaacc agctgccatg tttttatggg gaccatatct
10081 gccagaagga ctaagaccca tccatatgaa gcatatgtca agttaaggga gttggtagag
10141 gaacacaaga tgaaaacatt gtgtcccgga tcaagtctgc gtaacgacaa tgaatgggta
10201 attggcaaga tcaaatacca gggcaacctg aggaccaaac acatgttgaa ccccggcaag
10261 gtggcagagc aactgcacag agaaggacac agacacaatg tgtataacaa gacaataggc
10321 tcagtgatga cagctactgg catcaggttg gagaagttgc ccgtggttag ggcccagaca
10381 gacacaacca acttccacca agcaataagg gataagatag acaaggaaga gaatctacag
10441 accccgggtt tacataagaa actaatggaa gttttcaatg cattgaaacg acccgagtta
10501 gagtcctcct atgacgctgt ggaatgggag gaattggaga gaggaataaa cagaaagggt
10561 gctgctggtt tctttgaacg caaaaacata ggggagatat tggattcaga gaaaattaaa
10621 gtagaagaga ttattgacaa tctgaaaaag ggtagaaata tcaaatacta tgaaaccgca
10681 atcccaaaaa atgaaaagag ggatgtcaat gatgactgga ccgcaggtga ctttgtggac
10741 gagaagaaac ccagagtcat acaataccct gaagcaaaaa caaggctggc catcaccaag
10801 gtgatgtata agtgggtgaa gcagaagcca gtagtcatac ccgggtatga agggaagaca
10861 cctctgttcc aaatttttga caaagtaaag aaggaatggg atcaattcca aaatccagtg
10921 gcagtgagct tcgacactaa ggcgtgggac acccaggtga ccacaaatga tctggagctg
10981 ataaaggaca tacaaaagta ctacttcaag aagaaatggc ataaatttat tgacaccctg
11041 actatgcata tgtcagaagt acccgtaatc actgctgatg gggaggtgta tataaggaaa
11101 gggcaaagag gtagtggaca gcccgacaca agcgcaggca acagcatgct aaatgtgtta
11161 acaatggttt atgccttctg cgaggccaca ggggtaccct acaagagttt tgacagggtg
11221 gcaaaaattc atgtgtgtgg ggacgatggt ttcctgatca cagagagagc tctcggcgag
11281 aaattcgcaa gcaagggagt ccaaatcctg tatgaagctg ggaagcccca gaagatcact
11341 gaaggggaca aaatgaaagt ggcctaccaa tttgctgata ttgagttttg ctcccataca
11401 ccaatacaag taaggtggtc agataacact tctagctaca tgccagggag aaatacaacc
11461 acaatcctgg ctaaaatggc cacaaggtta gattccagtg gtgagagggg taccatagcg
11521 tacgagaaag cagtagcatt cagcttcctg ctaatgtatt cctggaaccc actaatcaga
11581 aggatttgct tattggtact atcaactgaa ctgcaagtga aaccagggaa gtcaaccact
11641 tactattatg aaggggaccc gatatctgcc tacaaggaag tcatcggcca caatcttttc
11701 gatctcaaga gaacaagctt cgagaagctg gccaagttaa atctcagcat gtccgtactc
11761 ggggcctgga ctagacacac cagcaaaaga ctactacaag actgtgtcaa tatgggtgtt
11821 aaagagggca actggttagt caatgcagac agactggtga gtagtaagac tggaaatagg
11881 tatgtacctg gagaaggcca caccctgcaa gggagacatt atgaagaact ggcgttggca
11941 agaaaacaga tcaacagctt ccaagggaca gacaggtaca atctaggccc aatagtcaac
12001 atggtgttaa ggaggctgag agtcatgatg atgaccctga tagggagagg ggtatgagtg
12061 cgggtgaccc gcgatctgga cccgtcagta ggaccctatt gtagataaca ctaatttttt
12121 atttatttag atattactat ttatttattt atttatttat tgaatgagta agaactggta
12181 caaactacct catgttacca cactacactc attttaacag cactttagct ggaaggaaaa
12241 ttcctgacgt ccacagttgg actaaggtaa tttctaacgg ccc
//
LOCUS HUMC6A2A1 2159 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human alpha-2 collagen type VI, alpha-2 collagen type VI-a, and
alpha-2 collagen type VI-a' gene, exons 6,5,4 and 3.
ACCESSION M34571
KEYWORDS alpha-2 collagen type VI; alternative splice.
SEGMENT 1 of 3
SOURCE Human leukocyte DNA, clone D1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2159)
AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
TITLE Alternative splicing of the human alpha-2(VI) collagen gene
generates multiple mRNA transcripts which predict three protein
variants with distinct carboxyl termini
JOURNAL J. Biol. Chem. 265, 6473-6480 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 75 120 alpha-2 collagen type VI, exon 6
273 425 alpha-2 collagen type VI, exon 5
590 1042 alpha-2 collagen type VI, exon 4
1307 + 1345 alpha-2 collagen type VI-a, exon 3
pept / 75 120 alpha-2 collagen type VI-a, exon 6
273 425 alpha-2 collagen type VI-a, exon 5
590 1042 alpha-2 collagen type VI-a, exon 4
1307 + 1345 alpha-2 collagen type VI-a', exon 3
pept / 75 120 alpha-2 collagen type VI-a', exon 5
273 425 alpha-2 collagen type VI-a', exon 4
590 1042 alpha-2 collagen type VI-a', exon 3
1307 + 1345 alpha-2 collagen type VI-a', exon 2
pre-msg < 1 > 2159 alpha-2cVI mRNA and introns
IVS < 1 74 intron E
IVS 121 272 intron D
IVS 426 589 intron C
IVS 1043 1306 intron B
IVS 1346 > 2159 intron A
BASE COUNT 391 a 653 c 749 g 366 t
ORIGIN
1 tgtccggacc ccagccagac tgctgtgaac tcttctgggc ccggggactg ccctgcctgc
61 cgtgtgcatt gcaggagtgt gacgtcatga cctacgtgag ggagacctgc gggtgctgcg
121 gtgaggcact gcccacggca gggtcggggc ccatgcaccg ggtggagggc gggagtgcag
181 cagggctggg tcatcgctgg gtcctgcatg tgcacgtgac cctagggtct gaggtctccc
241 ggtacccccc gatgaccctg ccaccccccc agactgtgag aagcgctgtg gcgccctgga
301 cgtggtcttc gtcatcgaca gctccgagag cattgggtac accaacttca cactggagaa
361 gaacttcgtc atcaacgtgg tcaacaggct gggtgccatc gctaaggacc ccaagtccga
421 gacaggtcac ggggcagggc gggtgcagca ttgcgggggg ccgcgggcgc gtgggaggcg
481 atgagatggg agaagtccag acgcgtccct ccaacgaggg cctctgcatg gctggggatg
541 ccccagaccc cgaggcctct ggcaacgacc tcacgcgtgc ggcttgcagg gacgcgtgtg
601 ggcgtggtgc agtacagcca cgagggcacc tttgaggcca tccagctgga cgacgaacat
661 atcgactccc tgtcgagctt caaggaggct gtcaagaacc tcgagtggat tgcaggtggc
721 acctggacac cctcagccct caagtttgcc tacgaccgcc tcatcaagga gagccggcgc
781 cagaagacac gtgtgtttgc ggtggtcatc acggacgggc gccacgaccc tcgggacgat
841 gacctcaact tgcgggcgct gtgcgaccgc gacgtcacag tgacggccat cggcatcggg
901 gacatgttcc acgagaagca cgagagtgaa aacctctact ccatcgcctg cgacaagcca
961 cagcaggtgc gcaacatgac gctgttctcc gacctggtcg ctgagaagtt catcgatgac
1021 atggaggacg tcctctgccc gggtgtacgt gtgggcgcgg ggcagtcagg ccgaggagca
1081 gcaggcccca gccgcgtcta gcgtgaccgc cagggacacc cctcacctga gggacgaatg
1141 tgcagcccaa ggatcttggg ctgtgggtgg gaaggggtcg gcctctcggg ctgcagggca
1201 gacgcgccag ctcgaccctg agcctgtcta ggcagatcag tgaacggccg ctgagggttc
1261 gctagggact gaccctggcc tggccggcct ctctcctctc ttccagaccc tcagatcgtg
1321 tgcccagacc ttccctgcca aacaggtaat gcagggacct gagccaccac cccagactag
1381 caaagcagcc ctggtgtcct tcctcctcga gggccgggct gggggagggg ccgtgcaggg
1441 acccgggggc ggcggacgac tgcggaggct gctccttagg gagatggccc caggatggca
1501 gcacagggga ggaggggctt ggggaaggca ggctcccagg aacgcaggaa cagcatcacg
1561 aggccatgag gtgggtgctg ctagcctggc gctgtgctcg gcatgtggcc actggtcttg
1621 aaggcccacc atgggcttgc agtctccctc agctgccgcc cagctcccat gggctggccg
1681 tgcatgtgcc accggaggaa gccctggatc agtgagtgaa accatcccgg ggtggaagca
1741 ctgacacccc ccagcaccag caggtcttgc tccaaccctg gcctgcctcg atcgagctgc
1801 agctgcggct ctcatctctg ggagtggggg agcccatgtc cggatgattg gcccagcgtg
1861 gtgtgaagct ggagctgggg gtgccgttca gctgctgctg gactggtgct gcccccatgg
1921 tgcactgctg caaccgttgc tgggcccaca ggaggtcccc gggggcggtt atgtagctga
1981 gtccccctca ttgagccgtc cccttccagg agtgtgaggg tggggatgcc atggagacag
2041 ggtgggaggg tccagactga gaggaccaca gggtaggaaa cctccaaggg tctgctggta
2101 ctaagtcagc ccttctcagc actcgggatc gcgatgtgcg atcgagagtc catggggag
//
LOCUS HUMC6A2A2 1348 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human alpha-2 collagen type VI and alpha-2 collagen type VI-a gene,
exons 2a and 2b.
ACCESSION M34572
KEYWORDS alpha-2 collagen type VI; alternative splice.
SEGMENT 2 of 3
SOURCE Human leukocyte DNA, clone D1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1348)
AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
TITLE Alternative splicing of the human alpha-2(VI) collagen gene
generates multiple mRNA transcripts which predict three protein
variants with distinct carboxyl termini
JOURNAL J. Biol. Chem. 265, 6473-6480 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 437 462 alpha-2 collagen type VI, exon 2a
pept + 730 1025 alpha-2 collagen type VI-a, exon 2b
pre-msg < 1 1336 alpha-2cVI mRNA and introns
IVS < 1 436 intron A (alt. splice site)
IVS < 1 729 intron A (alt. splice site)
signal 1285 1291 polyA signal
BASE COUNT 238 a 457 c 398 g 255 t
ORIGIN About 1.0 kbp downstream from segment 1.
1 tctggctact ggtgacacac tgctgtgcct gccctggcct tctccagaca gccctgtcca
61 cccaaagccc agccaccctg gcctgcagca ggcctgtgga gttctcagtt gcgtggggac
121 cagagggtgc tggagaaaca aaccagacgc agctgaaggc agtcagggca gggcgcaatc
181 agcgataaga gctgcatagg ggccacagcg taacctgagc tccagtcggt ggaaagaaaa
241 ggcagagacg ttgcagaggc caggtctgct caggggaaga cagttctggg tgtagaggac
301 tcacatccca gagaggctga ggaagggttt accacgcaag cttctcattc gggactcttg
361 aggggtggct ggggtcttcc tggcgacggg ctgcggcact gaagccctac tggagtttgg
421 cctgtctccg gcacaggttt ggacggagct gttttgtgct gaaaggtttt ctcggggtcc
481 gtggtgtccc ccaaaggtgc caccgtgcgg gtctcctagc tccctgccag cttcctgtcc
541 ctgtgctcac tgcccccacg cctcctgcca aggccgagcc acacacccgc tccacctgca
601 tttcctctac cgactcgcca gcccaaatgc cgctcttcac tctggcctcg ctgagcggct
661 gcccgaggag gagctctagg ccgacgccca ccgcaggcct tacagtcgtc tctggacgct
721 cccttgcaga tgcaccgtgg cctggcggcg agcccccggt caccttcctc cgcacggaag
781 aggggccgga cgccaccttc cccaggacca ttcccctgat ccaacagttg ctaaacgcca
841 cggagctcac gcaggacccg gccgcctact cccagctggt ggccgtgctg gtctacaccg
901 ccgagcgggc caagttcgcc accggggtag agcggcagga ctggatggag ctgttcattg
961 acacctttaa gctggtgcac agggacatcg tgggggaccc cgagaccgcg ctggccctct
1021 gctaaagccc gggcacccgc ccagccgggc tgggccctcc ctgccacact agcttcccag
1081 ggctgccccc gacaggctgg ctctcagtgg aggccgagag atctggaatc ggggtcagcg
1141 gggctacagt ccttccaggg gctctggggc agctcccagc ctcttcccat gctggtggcc
1201 accgtgtccc ttgctgcggc tgcatcttcc agtctctcct ccgtcttcca gtggccgctc
1261 tctttataag aaccctggtc attgaattta aggcccaccc caagtccaga atgacctcgc
1321 aagaccctta actcactccc gtctgcag
//
LOCUS HUMC6A2A3 1174 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human alpha-2 collagen type VI-a' gene, exon 1.
ACCESSION M34573
KEYWORDS alpha-2 collagen type VI; alternative splice.
SEGMENT 3 of 3
SOURCE Human leukocyte DNA, clone D1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
TITLE Alternative splicing of the human alpha-2(VI) collagen gene
generates multiple mRNA transcripts which predict three protein
variants with distinct carboxyl termini
JOURNAL J. Biol. Chem. 265, 6473-6480 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 140 738 alpha-2 collagen type VI-a', exon 1
pre-msg < 1 1028 alpha-2cVI mRNA and introns
IVS < 1 139 intron A (alt. splice site)
signal 1010 1015 polyA signal
BASE COUNT 189 a 439 c 364 g 182 t
ORIGIN About 2.1 kbp downstream of segment 2.
1 ctgcagaaac gccccgcaga gcccagtggt ctgtgaggtt gcaggcaggg tgcgaatgga
61 agggacaggt gcggggctgg cacctgcccg gtcctgccca cctctcctcc gcccagcccg
121 cacctgcggt ctcccacaga gctgtccgtg gcacagtgca cgcagcggcc cgtggacatc
181 gtcttcctgc tggacggctc cgagcggctg ggtgagcaga acttccacaa ggcccggcgc
241 ttcgtggagc aggtggcgcg gcggctgacg ctggcccgga gggacgacga ccctctcaac
301 gcacgcgtgg cgctgctgca gtttggtggc cccggcgagc agcaggtggc cttcccgctg
361 agccacaacc tcactgccat ccacgaggcg ctggagacca cacaatacct gaactccttc
421 tcgcacgtgg gcgcaggcgt ggtgcacgcc atcaatgcca tcgtgcgcag cccgcgtggc
481 ggggcccgga ggcacgcaga gctgtccttc gtgttcctca cggacggcgt cacgggcaac
541 gacagtctgc acgagtcggc gcactccatg cgcaacgaga acgtggtacc caccgtgctg
601 gccttgggca gcgacgtgga catggacgtg ctcaccacgc tcagcctggg tgaccgcgcc
661 gccgtgttcc acgagaagga ctatgacagc ctggcgcaac ccggcttctt cgaccgcttc
721 atccgctgga tctgctagcg ccgccgcccg ggccccgcag tcgagggtcg tgagcccacc
781 ccgtccatgg tgctaagcgg gcccgggtcc cacacggcca gcaccgctgc tcactcggac
841 gacgccctgg gcctgcacct ctccagctcc tcccacgggg tccccgtagc cccggccccc
901 gcccagcccc aggtctcccc aggccctccg caggctgccc ggcctccctc cccctgcagc
961 catcccaagg ctcctgacct acctggcccc tgagctctgg agcaagccca ataaaggctt
1021 tgaacccatt gcgtgcctgc gagcttctgt gcgcaggaga gacctcaaag gtgtcttgtg
1081 gccaggaggg aaacactgca gctgtcgctc gcccaccagg gtcaatggct cccccgggcc
1141 cagcctgacc tcctaggaca tcaactgcag gtgc
//
LOCUS HUMC6A2AA 888 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human alpha-2 collagen type VI mRNA, 3' end.
ACCESSION M34570
KEYWORDS alpha-2 collagen type VI.
SOURCE Human fibroblast, cDNA to mRNA, clone F221.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 888)
AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
TITLE Alternative splicing of the human alpha-2(VI) collagen gene
generates multiple mRNA transcripts which predict three protein
variants with distinct carboxyl termini
JOURNAL J. Biol. Chem. 265, 6473-6480 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 54 alpha-2 collagen type VI
BASE COUNT 136 a 329 c 249 g 174 t
ORIGIN
1 gtgtgcccag accttccctg ccaaacaggt ttggacggag ctgttttgtg ctgaaaggtt
61 ttctcggggt ccgtggtgtc ccccaaaggt gccaccgtgc gggtctccta gctccctgcc
121 agcttcctgt ccctgtgctc actgccccca cgcctcctgc caaggccgag ccacacaccc
181 gctccacctg catttcctct accgactcgc cagcccaaat gccgctcttc actctggcct
241 cgctgagcgg ctgcccgagg aggagctcta ggccgacgcc caccgcaggc cttacagtct
301 tctctggacg ctcccttgca gatgcaccgt ggcctggcgg cgagcccccg gtcaccttcc
361 tccgcacgga agaggggccg gacgccacct tccccaggac cattcccctg atccaacagt
421 tgctaaacgc cacggagctc acgcaggacc cggccgccta ctcccagctg gtggccgtgc
481 tggtctacac cgccgagcgg gccaagttcg ccaccggggt agagcggcag gactggatgg
541 agctgttcat tgacaccttt aagctggtgc acagggacat cgtgggggac cccgagaccg
601 cgctggccct ctgctaaagc ccgggcaccc gcccagccgg gctgggccct ccctgccaca
661 ctagcttccc agggctgccc ccgacaggct ggctctcagt ggaggcccag agatctggaa
721 tcggggtcag cggggctaca gtccttccag gggctctggg gcagctccca gcctcttccc
781 atgctggtgg ccaccgtgtc ccttgctgcg gctgcatctt ccagtctctc ctccgtcttc
841 cagtggccgc tctctttata agaaccctgg tcattgaatt taaggccc
//
LOCUS PPH47CG 7726 bp ds-DNA VRL 12-JUL-1990
DEFINITION Human papillomavirus type 47 (HPV-47) +-sense strand.
ACCESSION M32305
KEYWORDS .
SOURCE Human papillomavirus type 47 DNA, clone pTZ18R.
ORGANISM Human papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 7726)
AUTHORS Kiyono,T., Adachi,A. and Ishibashi,M.
TITLE Genome organization and taxonomic position of human papillomavirus
type 47 inferred from its DNA sequence
JOURNAL Virology 177, 401-405 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
T.Kiyono, 23-FEB-1990, for release after publication.
FEATURES from to/span description
pept 966 981 E1/E4 fusion protein, exon 1
3324 4000 E1/E4 fusion protein, exon 2
pept 208 678 ORF E6
pept 668 979 ORF E7
pept 966 2783 ORF E1
pept 2725 4245 ORF E2
pept 3086 4000 ORF E4
pept 4334 5890 ORF L2
pept 5903 7447 ORF L1
pre-msg 198 4465 HPV-47-1 mRNA and intron
IVS 982 3323 HPV-47-1 intron
pre-msg < 1 4465 HPV-47-2 mRNA and intron
IVS 1359 2677 HPV-47-2 intron
signal 4424 4429 polyA signal
BASE COUNT 2369 a 1517 c 1727 g 2113 t
ORIGIN
1 aacggtaagt ttgcattaat gtaccaggtg cggtacagat catttcacaa tggatattat
61 tgttgccaac taccatagtc ataatcaagt tcttgcctgt atcgttttcg taccttacct
121 acagtatttt atattaatat ataaataaat aaatatataa atgtgtattt atttctcagg
181 ctcagttctt tgcaattatt aagacaaatg gctcagaagg ctttggaaca gactacagtt
241 aaagaggaaa agctagaact acctactact attagaggct tagctcaatt gttagacata
301 cctttagtag attgtttgct accttgcaac ttttgtggca gatttcttga ctatttagaa
361 gtttgtgaat ttgattataa aaagcttact ttaatttgga aagactacag tgtttatgcc
421 tgctgccgtt tgtgctgctc agcaactgcc acatatgaat ttaatgtttt ttatcaacaa
481 acagtgttag gtagagatat tgagctagct acaggccttt ccatttttga gattgacata
541 aggtgtcata cctgcctgtc atttcttgac attattgaaa agttagatag ctgtggaaga
601 ggacttccct ttcacaaagt aagaaacgcc tggaagggtg tttgtaggca gtgtaagcat
661 ttttacaatg attggtaaag aggtcaccgt gcgagatatt gttctggagt taagtgaggt
721 tcaacctgaa gtattaccag ttgacctgtt ttgcgacgag gaattaccaa atgaacaaca
781 ggcggaggag gagctagaca tcgacagagt cgttttcaaa gtgattgcac cgtgcggttg
841 cagctgctgc gaggtcaagc ttcgcatttt tgtgaacgca acaaaccgtg gcatcaggac
901 atttcaggaa cttttgactg gtgatctgca gctcctctgc ccagagtgcc gtgggaactg
961 caaacatggc ggattctaaa ggtagtacat ctaaagaagg gtttggtgat tggtgtattt
1021 tggaagctga ctgtagtgat gttgaggatg atttgggaca attatttgag agagatacag
1081 actcagatat ctcggacctg ttagacaatt gtgacctgga tcagggcaat tcacgggaac
1141 tatttcatca acaggagtgt aagcaaagcg aggagcaatt acaaaaacta aaacgaaagt
1201 atcttagtcc aaaagctgtc gcgcagctta gtccgcgtct tgagtcaatt tcattgtcac
1261 ctcagcagaa atccaagaga aggctctttg cagagcaaga cagcggactc gagttaacct
1321 ttaacaatga agctgaagat gttactcctg aggtggaggt accggctata gactctcggc
1381 cggatgatga tgagggagga tcaggggatg tagatattca ttatacagca ttgttgcgtt
1441 ccagcaacca aaaggccaca ttactggcaa aattcaaaca agcgtttggg gtaggcttta
1501 atgaattgac aagacaattc aaaagctaca aaacctgctg taatcattgg gttgtatccg
1561 tatatgcagt ccatgatgat ctatttgaaa gctcaaagca gctgttgcaa cagcattgtg
1621 actatatatg ggtccgtggg atagatgcaa tgtcattata tctattgtgt tttaaggcgg
1681 gaaaaaatcg tgggacagtt cataagctaa ttaccacaat gttaaatgtg catgagcaac
1741 agatattgtc tgagcctcca aagttaagaa atacagctgc tgcattattt tggtacaaag
1801 gatgtatggg acctggagtg ttcacccacg gtccttaccc tgaatggatt gcacaattaa
1861 ccattttggg ccataagagt gctgaggcaa gtgcgtttga tctgtcagtc atggttcaat
1921 gggcatttga taacaatctg tttgaggagg cagacattgc atacggatat gcaagactgg
1981 caccagagga tagcaatgca gttgcatggc ttgcacataa taaccaagct aaatatgtta
2041 gagaatgtgc tatgatggtt cgatactaca aaaaggggca aatgagagat atgagcatgt
2101 ctgagtggat atatacaagg atacatgaag tagagggaga aggacagtgg tctagcattg
2161 ttaaattttt aagatatcaa gaaataaatt ttatttcatt tttggctgct ttaaaagatt
2221 tattacattc agtacctaaa cgcaattgta ttttattcca tggccctcca aatacaggaa
2281 agtcatcgtt tggaatgtcc ttaataaaag ttctaagggg gagagtatta tcatttgtaa
2341 actccaaaag tcagttttgg ttgcagcctc ttggagaatg taaaatagca ttattagatg
2401 atgttacaga tccatgttgg gtgtatatgg atcaatattt aagaaatggg ttagatgggc
2461 attttgtgtc tttggattgt aaatatagag cacccatgca aacaaagttt ccacctttaa
2521 tacttacatc taatattaat gtacatgcag agaccaatta tagataccta catagtagaa
2581 ttaagggttt tgaatttaaa aatccatttc ctatgaaagc agataataca cctcaatttg
2641 agttaactga ccaaagctgg aaatcttttt ttacaaggct ttggacacac ttagacctga
2701 gtgaccaaga agacgagggc gaacatggag aatctcagcg agcgtttcaa tgctctgcaa
2761 gaacagctaa tgaacattta tgaagctgca gaacagacat taaaggcaca aattttacat
2821 tggcagacat tgcgaaaaga agctgtgaca ctctactttg ctaggcagaa aggcataaat
2881 aggttgggat accaaccagt gcctgcatta gcaatatctg aggcaagggc caaagaggct
2941 atatatatgg tgttgcagtt agagtcgcta caaaaatcag cgtttgcttt ggagccttgg
3001 accttagtgg acactagtac agagactttt aagagtgctc cagaaaatca ttttaaaaag
3061 gggcctgtac ctgtggaggt gatatatgac aaagatgaag caaatgctaa tttgtatact
3121 atgtggacat ttgtgtatta catggattca gatgatgtgt ggcataagac aacaagtggg
3181 gtcaatcaaa ctggcattta ctacctatat ggaacattta aacactatta tgtgttattt
3241 gctgatgatg caaagagata tagtgctact ggagaatggg aagttaaagt taataaggaa
3301 actgtgttta ctcctgtcac tagctccaca ccaccagggt caccaggagg acaaacagac
3361 ccagacacct cctccaagac ccccaccacc accacagccg ccactgacac ctcgcccaga
3421 cgccaatcca tcaataaaca gtcacaacaa accgaaacca aacgaagagg gtacggacgg
3481 agaccatcaa gcagaacaag gcgaccgcaa acgcaccaaa ggcgatccag atccagatcc
3541 cggtcgcggt ccagttctca aacccactct tccaccacca ccaccaccac cacctacagg
3601 tccaggtcta cgtcgctcaa caagactcgt gctcgttcca ggtcaaggtc cacctccaga
3661 tctaccagca ccaccagtag aaggggaggt agagggtcat ccacaaggca aagatcgcga
3721 tcaccctcca cctacacctc aaaacggtca cgggaaggaa acacaagggg cagagggagg
3781 gggagacaag ggagagcagg gagcagtggg gggagagagc agcgacggag aaggagatca
3841 ttctcaacct cccctgactc ctccaaacga gtcagacggg agtctcctaa ataccgtggc
3901 gtgtctccta gcgaggtggg aaagcaactt cgatcagttg gtgcaaaaca ttcagggcga
3961 cttggaaggt tattggagga agctagggac cccccagtaa ttcttgtgcg aggggacgca
4021 aacacattaa aatgctttcg caacagagca aggaacaaat atagagggct ttttagatca
4081 ttcagcacta cattttcctg ggtagctgga gatagcattg agcgtctagg caggtccaga
4141 atgctcatta gcttttcctg cctcactcag agaagggatt ttgatgatgc tgtcaaatat
4201 ccaaaaggag tcgagtggtc atatggtagt cttgatagcc tttaacaagc attaacgctg
4261 ctttgctact aactgctatt aacaaccaca gctttttttt tacgtttttt tattttactg
4321 attttgtact gcaatggcgc gtgctagaag ggtcaaacgt gactctgtaa cacatatata
4381 tcagacctgc aaacaggcag gcacttgccc ctcggacgtt gttaataaag ttgagcaaac
4441 aacagttgct gacaatattt tgaaatatgg cagtgctggt gtcttttttg gaggccttgg
4501 cataggaaca ggccgaggga ctgggggtgc tactgggtac gtgccacttg gggaaggtcc
4561 tggtgtccgt gtgggaggaa ccccaacggt tgtaaggcct tctcttgttc ctgaagcaat
4621 tggaccagtt gatattttac ccattgacac aatcgcacct gtcgagccta ctgcttcatc
4681 tttagtccca ttaacagagt cgtctggtgc tgatttactt cccggtgaag ttgaaactat
4741 agccgaaata catcctattc ctgaaggtcc gacaatcgac tcccctgtag tcaccacaac
4801 gacaggttcc agtgctgttc tggaagtggc tccagaacct gtacccccta cacgtgttag
4861 aattgctaga acacaatatc ataatccctc ttttcagata ctcactgaat caacacctgc
4921 gcagggcgag agttctcttg ctgaccatat tttggtcacc tcagggtctg gtggacaaag
4981 gataggcggt gatataacag acgaaattga acttactgag tttccaagca gatatacatt
5041 tgaaatagaa gaacccaccc ctccacgaaa aagtagcaca ccattacaaa ctgtagcctc
5101 tgcagtaagg cgacggggct tctcattaac aaatagaaga ttggtacaac aagtagctgt
5161 agacaatcct ttatttttaa gtcaaccttc taagatggta agattctcat ttgacaatcc
5221 agcttttgaa gaagaggtta ccaatatttt tgaacaggat gttaacagct ttgaagaacc
5281 tccagacagg gattttcttg atattaaaca attgggccgt cctcaatatt ctacaacacc
5341 agcaggttat attagggtaa gcagactagg aactcgaggc accattcgca ctcgttctgg
5401 tgcacaaata ggttctcagg tacactttta tagagattta agttctataa atactgagga
5461 tccaatagaa ctacagcttt tagggcagca ttctggagat gctactattg ttcaaggtcc
5521 tgtagaaagc acatttatag atatggacat tgctgaaaac cctttatctg aaacaataga
5581 tgcttcatct aatgatttac ttttggatga gactgtggag gattttagtg ggtcccaatt
5641 agtaattgga aatcgaagga gtacaacatc atatactgtt cccagatttg agactactag
5701 aagtagttcc tattatgttc aagacacaga tggttattat gttgcttacc cagagtcacg
5761 ggacactatt gatattattt accctacacc tgaattacct gtagttgtca ttcacaccca
5821 tgacaattct ggagactttt acttacatcc tagtcttaga aggcgtaagc gtaaaagaaa
5881 atatttgtga tttgcattgc agatggcagt gtggcactcg gctaacggta aagtatacct
5941 tcctccatca acaccagtgg ccagggttca aagcacggat gaatacatac aaaggactaa
6001 tatctattat catgcaaata ctgaccgcct tttaacagta ggacatccat atttcaatgt
6061 atacaataat aatggaacta cattagaggt tccaaaagta tcaggtaatc agcatagggt
6121 gtttcgctta aaattgccag atcctaatag atttgctcta gcggacatgt ctgtatacaa
6181 ccctgacaaa gaacgcttgg tgtgggcctg caggggtcta gaaattggaa ggggtcaacc
6241 tttaggtgtt ggcagtactg gtcacccata ttttaataag gtaaaagata cagaaaacag
6301 taattcctat atcacaaact caaaagatga cagacaagac acctcttttg atcctaaaca
6361 aatacagatg tttattgtgg gctgcactcc atgtattggc gaacactggg ataaggcaga
6421 gccttgtggg gaacagcaaa ctggtctttg tcctcctatt gaattaaaaa acacatacat
6481 tcaggatggc gacatggcag acattggttt tggcaacatt aatttcaagg ccttacaaca
6541 cagtaggtct gatgttagtc ttgacattgt aaatgaaact tgcaagtacc cggattttct
6601 caaaatgcaa aatgatgttt atggggatgc ttgctttttt tatgctcgta gagagcaatg
6661 ttatgccaga catttttttg ttagaggggg aaaaacaggt gatgacatac caggagcaca
6721 ggttggcaat ggtaatatga aaaatcaatt ttacattcct ggtgctacgg gtcaggctca
6781 gagcactata ggtaatgcca tgtatttccc aactgtcagt ggctcactag tctctagtga
6841 tgctcaactg tttaacaggc cattctggct ccaaagggct cagggtcata ataatggcat
6901 tctgtgggct aatcaaatgt ttgtcacagt tgtagacaac acaagaaata caaatttcag
6961 catctctgtt tactctcagg caggggacat aaaggatata caggattata atgcagacaa
7021 ttttagagag tatcaaagac atgtggagga atatgaaatt tctgtaatat tacaattgtg
7081 caaagttcct ttaaaagcag aagttttagc acaaattaat gccatgaatt cgtctctttt
7141 agaggaatgg cagttaggat ttgtgcctac tccagacaac cctattcagg atacatatag
7201 atatctagaa tctttggcca ctaggtgtcc tgaaaagtct cctccaaaag agaaggttga
7261 cccctacaaa ggtttaaact tttgggatgt cgatatgaca gagcgccttt ccctggattt
7321 agatcaatat tcattaggta gaaagttctt attccaggct ggattacagc agacgaccgt
7381 aaacggtaca aaaacaactc cttacagggg gtccatcaga ggaacaaagc gcaaacgaaa
7441 aaattgaaga tgaccgtttt cggtacagat tgtttaactt ttacacagta ttcaaggaat
7501 gtctgtttac tgtgactaag tgtaactctg ccaaagaaac aaccgcaccc ggtacacgta
7561 ttcagcttgt tgccaaaaca gataagcttg gcagtcagaa cacaccgtgt tcgtcgcaac
7621 acgctcggat taggtcttct gccaaaagaa atttaatctt gttatcgttt ttggcgatca
7681 catttggcac cgcgggcagc tgttttggca ctacaagaca accgtt
//
LOCUS RUBCG 9755 bp ss-RNA VRL 12-JUL-1990
DEFINITION Rubella virus complete genome encoding nonstructural protein,
capsid protein, glycoproteins E1 and E2, complete cds.
ACCESSION M15240 M18901 M32735
KEYWORDS C protein; glycoprotein; glycoprotein E1; glycoprotein E2;
hemagglutinin.
SOURCE Rubella virus (strain Therien) cDNA to genomic RNA and cDNA to
mRNA, clones pRUB1025[1010,1012,1002,1006,1015,1001].
ORGANISM Rubella virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Rubivirus.
REFERENCE 1 (bases 8155 to 9754)
AUTHORS Frey,T.K., Marr,L.D., Hemphill,M.L. and Dominguez,G.
TITLE Molecular cloning and sequencing of the region of the rubella virus
genome coding for glycoprotein E1
JOURNAL Virology 154, 228-232 (1986)
STANDARD full staff_review
REFERENCE 2 (bases 5917 to 9754; revises [1])
AUTHORS Frey,T.K. and Marr,L.D.
JOURNAL Unpublished (1987)
STANDARD full staff_review
REFERENCE 3 (bases 5247 to 8366)
AUTHORS Frey,T.K. and Marr,L.D.
TITLE Sequence of the region coding for virion proteins C and E2 and the
carboxy terminus of the nonstructural proteins of rubella virus:
comparison with alphaviruses
JOURNAL Gene 62, 85-99 (1988)
STANDARD full staff_review
REFERENCE 4 (bases 1 to 9755)
AUTHORS Domminguez,G., Wang,C.-Y. and Frey,T.K.
TITLE Sequence of the genome RNA of rubella virus: Evidence for genetic
rearrangement during togavirus evolution
JOURNAL Virology 177, 225-258 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable copy of sequence in [2] kindly
provided by T.K.Frey, 01-JUN-1987.
Draft entry and computer-readable sequence for [4] kindly submitted
by G.Dominguez, 09-MAR-1990, for release after publication.
Glycoprotein E1 contains the viral hemagglutinin activity.
Multiple copies of the C protein comprise the nucleocapsid.
FEATURES from to/span description
pept 39 6656 nonstructural polyprotein precursor
pept 6505 9696 structural polyprotein precursor
matp 6505 7404 capsid protein (C)
matp 7405 8250 glycoprotein E2
matp 8251 9693 glycoprotein E1
mRNA 6428 9755 subgenomic RNA
BASE COUNT 1457 a 3781 c 3007 g 1510 t
ORIGIN
1 atggaagcta tcggacctcg cttaggactc ccattcccat ggagaaactc ctagatgagg
61 ttcttgcccc cggtgggcct tataacttaa ccgtcggcag ttgggtaaga gaccacgtcc
121 gatcaattgt cgagggcgcg tgggaagtgc gcgatgttgt taccgctgcc caaaagcggg
181 ccatcgtagc cgtgataccc agacctgtgt tcacgcagat gcaggtcagt gatcacccag
241 cactccacgc aatttcgcgg tatacccgcc gccattggat cgagtggggc cctaaagaag
301 ccctacacgt cctcatcgac ccaagcccgg gcctgctccg cgaggtcgct cgcgttgagc
361 gccgctgggt cgcactgtgc ctccacagga cggcacgcaa actcgccacc gccctggccg
421 agacggccag cgaggcgtgg cacgctgact acgtgtgcgc gctgcgtggc gcaccgagcg
481 gccccttcta cgtccaccct gaggacgtcc cgcacggcgg tcgcgccgtg gcggacagat
541 gcttgctcta ctacacaccc atgcagatgt gcgagctgat gcgtaccatt gacgccaccc
601 tgctcgtggc ggttgacttg tggccggtcg cccttgcggc ccacgtcggc gacgactggg
661 acgacctggg cattgcctgg catctcgacc atgacggcgg ttgccccgcc gattgccgcg
721 gagccggcgc tgggcccacg cccggctaca cccgcccctg caccacacgc atctaccaag
781 tcctgccgga caccgcccac cccgggcgcc tctaccggtg cgggccccgc ctgtggacgc
841 gcgattgcgc cgtggccgaa ctctcatggg aggttgccca acactgcggg caccaggcgc
901 gcgtgcgcgc cgtgcgatgc accctcccta tccgccacgt gcgcagcctc caacccagcg
961 cgcgggtccg actcccggac ctcgtccatc tcgccgaggt gggccggtgg cggtggttca
1021 gcctcccccg ccccgtgttc cagcgcatgc tgtcctactg caagaccctg agccccgacg
1081 cgtactacag cgagcgcgtg ttcaagttca agaacgccct gtgccacagc atcacgctcg
1141 cgggcaatgt gctgcaagag gggtggaagg gcacgtgcgc cgaggaagac gcgctgtgcg
1201 catacgtagc cttccgcgcg tggcagtcta acgccaggtt ggcggggatt atgaaaggcg
1261 cgaagtgcgc cgccgactct ttgagcgtgg ccggctggct ggacaccatt tgggacgcca
1321 ttaagcggtt cctcggtagc gtgcccctcg ccgagcgcat ggaggagtgg gaacaggacg
1381 ccgcggtcgc cgccttcgac cgcggccccc tcgaggacgg cgggcgccac ttggacaccg
1441 tgcaaccccc aaaatcgccg ccccgccctg agatcgccgc gacctggatc gtccacgcag
1501 ccagcgaaga ccgccattgc gcgtgcgctc cccgctgcga cgtcccgcgc gaacgtcctt
1561 ccgcgcccgc cggccagccg gatgacgagg cgctcatccc gccgtggctg ttcgccgagc
1621 gccgtgccct ccgctgccgc gagtgggatt tcgaggctct ccgcgcgcgc gccgatacgg
1681 cggccgcgcc cgccccgccg gctccacgcc ccgcgcggta ccccaccgtg ctctaccgcc
1741 accccgccca ccacggcccg tggctcaccc ttgacgagcc gggcgaggct gacgcggccc
1801 tggtcttatg cgacccactt ggccagccgc tccggggccc tgaacgccac ttcgccgccg
1861 gcgcgcatat gtgcgcgcag gcgcgggggc tccaggcttt tgtccgtgtc gtgcctccac
1921 ccgagcgccc ctgggccgac gggggcgcca gagcgtgggc gaagttcttc cgcggctgcg
1981 cctgggcgca gcgcttgctc ggcgagccag cagttatgca cctcccatac accgatggcg
2041 acgtgccaca gctgatcgca ctggctttgc gcacgctggc ccaacagggg gccgccttgg
2101 cactctcggt gcgtgacctg cccgggggtg cagcgttcga cgcaaacgcg gtcaccgccg
2161 ccgtgcgcgc tggcccccgc cagtccgcgg ccgcgtcacc gccacccggc gaccccccgc
2221 cgccgcgccg cgcacggcga tcgcaacggc actcggacgc tcgcggcact ccgccccccg
2281 cgcctgcgcg cgacccgccg ccgcccgccc ccagcccgcc cgcgccaccc cgcgctggtg
2341 acccggtccc tcccattccc gcggggccgg cggatcgcgc gcgtgacgcc gagctggagg
2401 tcgcctgcga gccgagcggc ccccccacgt caaccagggc agacccagac agcgacatcg
2461 ttgaaagtta cgcccgcgcc gccggacccg tgcacctccg agtccgcgac atcatggacc
2521 caccgcccgg ctgcaaggtc gtggtcaacg ccgccaacga ggggctactg gccggctctg
2581 gcgtgtgcgg tgccatcttt gccaacgcca cggcggccct cgctgcaaac tgccggcgcc
2641 tcgccccatg ccccaccggc gaggcagtgg cgacacccgg ccacggctgc gggtacaccc
2701 acatcatcca cgccgtcgcg ccgcggcgtc ctcgggaccc cgccgccctc gaggagggcg
2761 aagcgctgct cgagcgcgcc taccgcagca tcgtcgcgct agccgccgcg cgtcggtggg
2821 cgtgtgtcgc gtgccccctc ctcggcgctg gcgtctacgg ctggtctgct gcggagtccc
2881 tccgagccgc gctcgcggct acgcgcaccg agcccgtcga gcgcgtgagc ctgcacatct
2941 gccaccccga ccgcgccacg ctgacgcacg cctccgtgct cgtcggcgcg gggctcgctg
3001 ccaggcgcgt cagtcctcct ccgaccgagc ccctcgcatc ttgccccgcc ggtgacccgg
3061 gccgaccggc tcagcgcagc gcgtcgcccc cagcgacccc ccttggggat gccaccgcgc
3121 ccgagccccg cggatgccag gggtgcgaac tctgccggta cacgcgcgtc accaatgacc
3181 gcgcctatgt caacctgtgg ctcgagcgcg accgcggcgc caccagctgg gccatgcgca
3241 ttcccgaggt ggttgtctac gggccggagc acctcgccac gcattttcca ttaaaccact
3301 acagtgtgct caagcccgcg gaggtcaggc ccccgcgagg catgtgcggg agtgacatgt
3361 ggcgctgccg cggctggcat ggcatgccgc aggtgcggtg caccccctcc aacgctcacg
3421 ccgccctgtg ccgcacaggc gtgccccctc gggcgagcac gcgaggcggc gagctagacc
3481 caaacacctg ctggctccgc gccgccgcca acgttgcgca ggctgcgcgc gcctgcggcg
3541 cctacacgag tgccgggtgc cccaagtgcg cctacggccg cgccctgagc gaagcccgca
3601 ctcatgagga cttcgccgcg ctgagccagc ggtggagcgc gagccacgcc gatgcctccc
3661 ctgacggcac cggagatccc ctcgaccccc tgatggagac cgtgggatgc gcctgttcgc
3721 gcgtgtgggt cggctccgag catgaggccc cgcccgacca cctcctggtg tcccttcacc
3781 gtgccccaaa tggtccgtgg ggcgtagtgc tcgaggtgcg tgcgcgcccc gaggggggca
3841 accccaccgg ccacttcgtc tgcgcggtcg gcggcggccc acgccgcgtc tcggaccgcc
3901 cccacctctg gcttgcggtc cccctgtctc ggggcggtgg cacctgtgcc gcgaccgacg
3961 aggggctggc ccaggcgtac tacgacgacc tcgaggtgcg ccgcctcggg gatgacgcca
4021 tggcccgggc ggccctcgca tcagtccaac gccctcgcaa aggcccttac aatatcaggg
4081 tatggaacat ggccgcaggc gctggcaaga ctacccgcat cctcgctgcc ttcacgcgcg
4141 aagaccttta cgtctgcccc accaatgcgc tcctgcacga gatccaggcc aaactccgcg
4201 cgcgcgatat cgacatcaag aacgccgcca cctacgagcg ccggctgacg aaaccgctcg
4261 ccgcctaccg ccgcatctac atcgatgagg cgttcactct cggcggcgag tactgcgcgt
4321 tcgttgccag ccaaaccacc gcggaggtga tctgcgtcgg tgatcgggac cagtgcggcc
4381 cacactacgc caataactgc cgcacccccg tccctgaccg ctggcctacc gagcgctcgc
4441 gccacacttg gcgcttcccc gactgctggg cggcccgcct gcgcgcgggg ctcgattatg
4501 acatcgaggg cgagcgcacc ggcaccttcg cctgcaacct ttgggacggc cgccaggtcg
4561 accttcacct cgccttctcg cgcgaaaccg tgcgccgcct tcacgaggct ggcatacgcg
4621 catacaccgt gcgcgaggcc cagggtatga gcgtcggcac cgcctgcatc catgtaggca
4681 gagacggcac ggacgttgcc ctggcgctga cacgcgacct cgccatcgtc agcctgaccc
4741 gggcctccga cgcactctac ctccacgagc tcgaggacgg ctcactgcgc gctgcggggc
4801 tcagcgcgtt cctcgacgcc ggggcactgg cggagctcaa ggaggttccc gctggcattg
4861 accgcgttgt cgccgtcgag caggcaccac caccgttgcc gcccgccgac ggcatccccg
4921 aggcccaaga cgtgccgccc ttctgccccc gcactctgga ggagctcgtc ttcggccgtg
4981 ccggccaccc ccattacgcg gacctcaacc gcgtgactga gggcgaacga gaagtgcggt
5041 acatgcgcat ctcgcgtcac ctgctcaaca agaatcacac cgagatgccc ggaacggaac
5101 gcgttctcag tgccgtttgc gccgtgcggc gctaccgcgc gggcgaggat gggtcgaccc
5161 tccgcactgc tgtggcccgc cagcacccgc gcccttttcg ccagatccca cccccgcgcg
5221 tcactgctgg ggtcgcccag gagtggcgca tgacgtactt gcgggaacgg atcgacctca
5281 ctgatgtcta cacgcagatg ggcgtggccg cgcgggagct caccgaccgc tacgcgcgcc
5341 gctatcctga gatcttcgcc ggcatgtgta ccgcccagag cctgagcgtc cccgccttcc
5401 tcaaagccac cttgaagtgc gtagacgccg ccctcggccc cagggacacc gaggactgcc
5461 acgccgctca ggggaaagcc ggccttgaga tccgggcgtg ggccaaggag tgggttcagg
5521 ttatgtcccc gcatttccgc gcgatccaga agatcatcat gcgcgccttg cgcccgcaat
5581 tccttgtggc cgctggccat acggagcccg aggtcgatgc gtggtggcag gcccattaca
5641 ccaccaacgc catcgaggtc gacttcactg agttcgacat gaaccagacc ctcgctactc
5701 gggacgtcga gctcgagatt agcgccgctc tcttgggcct cccttgcgcc gaagactacc
5761 gcgcgctccg cgccggcagc tactgcaccc tgcgcgaact gggctccact gagaccggct
5821 gcgagcgcac aagcggcgag cccgccacgc tgctgcacaa caccaccgtg gccatgtgca
5881 tggccatgcg catggtcccc aaaggcgtgc gctgggccgg gattttccag ggtgacgata
5941 tggtcatctt cctccccgag ggcgcgcgca gcgcggcact caagtggacc cccgccgagg
6001 tgggcttgtt tggcttccac atcccggtga agcacgtgag cacccctacc cccagcttct
6061 gcgggcacgt cggcaccgcg gccggcctct tccatgatgt catgcaccag gcgatcaagg
6121 tgctttgccg ccgtttcgac ccagacgtgc ttgaagaaca gcaggtggcc ctcctcgacc
6181 gcctccgggg ggtctacgcg gctctgcctg acaccgttgc cgccaatgct gcgtactacg
6241 actacagcgc ggagcgcgtc ctcgctatcg tgcgcgaact taccgcgtac gcgggggcgc
6301 ggcctcgacc acccggccac catcggcgcg ctcgaggaga ttcagacccc ctacgcgcgc
6361 gccaatctcc acgacgccga ctaacgcccc tgtacgtggg gcctttaatc ttacctactc
6421 taaccaggtc atcacccacc gttgtttcgc cgcatctggt gggtacccaa cttttgccat
6481 tcgggagagc cccagggtgc ccgaatggct tctactaccc ccatcaccat ggaggacctc
6541 cagaaggccc tcgaggcaca atcccgcgcc ctgcgcgcgg aactcgccgc cggcgcctcg
6601 cagtcgcgcc ggccgcggcc gccgcgacag cgcgactcca gcacctccgg agatgactcc
6661 ggccgtgact ccggagggcc ccgccgccgc cgcggcaacc ggggccgtgg ccagcgcagg
6721 gactggtcca gggccccgcc ccccccggag gagcggcaag aaactcgctc ccagactccg
6781 gccccgaagc catcgcgggc gccgccacaa cagcctcaac ccccgcgcat gcaaaccggg
6841 cgtgggggct ctgccccgcg ccccgagctg gggccaccga ccaacccgtt ccaagcagcc
6901 gtggcgcgtg gcctgcgccc gcctctccac gaccctgaca ccgaggcacc caccgaggcc
6961 tgcgtgacct cgtggctttg gagcgagggc gaaggcgcgg tcttttaccg cgtcgacctg
7021 catttcacca acctgggcac ccccccactc gacgaggacg gccgctggga ccctgcgctc
7081 atgtacaacc cttgcgggcc cgagccgccc gctcacgtcg tccgcgcgta caatcaacct
7141 gccggcgacg tcaggggcgt ttggggtaaa ggcgagcgca cctacgccga gcaggacttc
7201 cgcgtcggcg gcacgcgctg gcaccgactg ctgcgcatgc cagtgcgcgg cctcgacggc
7261 gacagcgccc cgcttccccc ccacaccacc gagcgcattg agacccgctc ggcgcgccat
7321 ccttggcgca tccgcttcgg tgccccccag gccttccttg ccgggctctt gctcgccacg
7381 gtcgccgttg gcaccgcgcg cgccgggctc cagccccgcg ctgatatggc ggcacctcct
7441 acgctgccgc agcccccctg tgcgcacggg cagcattacg gccaccacca ccatcagctg
7501 ccgttcctcg ggcacgacgg ccatcatggc ggcaccttgc gcgtcggcca gcattaccga
7561 aacgccagcg acgtgctgcc cggccactgg ctccaaggcg gctggggttg ctacaacctg
7621 agcgactggc accagggcac tcatgtctgt cataccaagc acatggactt ctggtgtgtg
7681 gagcacgacc gaccgccgcc cgcgaccccg acgcctctca ccaccgcggc gaactccacg
7741 accgccgcca cccccgccac tgcgccggcc ccctgccacg ccggcctcaa tgacagctgc
7801 ggcggcttct tgtctgggtg cgggccgatg cgcctgcgcc acggcgctga cacccggtgc
7861 ggtcggttga tctgcgggct gtccaccacc gcccagtacc cgcctacccg gtttggctgc
7921 gctatgcggt ggggccttcc cccctgggaa ctggtcgtcc ttaccgcccg ccccgaagac
7981 ggctggactt gccgcggcgt gcccgcccat ccaggcgccc gctgccccga actggtgagc
8041 cccatgggac gcgcgacttg ctccccagcc tcggccctct ggctcgccac agcgaacgcg
8101 ctgtctcttg atcacgccct cgcggccttc gtcctgctgg tcccgtgggt cctgatattt
8161 atggtgtgcc gccgcgcctg tcgccgccgc ggcgccgccg ccgccctcac cgcggtcgtc
8221 ctgcaggggt acaacccccc cgcctatggc gaggaggctt tcacctacct ctgcactgca
8281 ccggggtgcg ccactcaagc acctgtcccc gtgcgcctcg ctggcgtccg ttttgagtcc
8341 aagattgtgg acggcggctg ctttgcccca tgggacctcg aggccactgg agcctgcatt
8401 tgcgagatcc ccactgatgt ctcgtgcgag ggcttggggg cctgggtacc cgcagcccct
8461 tgcgcgcgca tctggaatgg cacacagcgc gcgtgcacct tctgggctgt caacgcctac
8521 tcctctggcg ggtacgcgca gctggcctct tacttcaacc ctggcggcag ctactacaag
8581 cagtaccacc ctaccgcgtg cgaggttgaa cctgccttcg gacacagcga cgcggcctgc
8641 tggggcttcc ccaccgacac cgtgatgagc gtgttcgccc ttgctagcta cgtccagcac
8701 cctcacaaga ccgtccgggt caagttccat acagagacca ggaccgtctg gcaactctcc
8761 gttgccggcg tgtcgtgcaa cgtcaccact gaacacccgt tctgcaacac gccgcacgga
8821 caactcgagg tccaggtccc gcccgacccc ggggacctgg ttgagtacat tatgaattac
8881 accggcaatc agcagtcccg gtggggcctc gggagcccga attgccacgg ccccgattgg
8941 gcctccccgg tttgccaacg ccattcccct gactgctcgc ggcttgtggg ggccacgcca
9001 gagcgccccc ggctgcgcct ggtcgacgcc gacgaccccc tgctgcgcac tgcccctgga
9061 cccggcgagg tgtgggtcac gcctgtcata ggctctcagg cgcgcaagtg cggactccac
9121 atacgcgctg gaccgtacgg ccatgctacc gtcgaaatgc ccgagtggat ccacgcccac
9181 accaccagcg acccctggca tccaccgggc cccttggggc tgaagttcaa gacagttcgc
9241 ccggtggccc tgccacgcac gttagcgcca ccccgcaatg tgcgtgtgac cgggtgctac
9301 cagtgcggta cccccgcgct ggtggaaggc cttgcccccg ggggaggcaa ttgccatctc
9361 accgtcaatg gcgaggacct cggcgccgtc ccccctggga agttcgtcac cgccgccctc
9421 ctcaacaccc ccccgcccta ccaagtcagc tgcgggggcg agagcgatcg cgcgaccgcg
9481 cgggtcatcg accccgccgc gcaatcgttt accggcgtgg tgtatggcac acacaccact
9541 gctgtgtcgg agacccggca gacctgggcg gagtgggctg ctgcccattg gtggcagctc
9601 actctgggcg ccatttgcgc cctcccactc gctggcttac tcgcttgctg tgccaaatgc
9661 ttgtactact tgcgcggcgc tatagcgcct cgctagtggg cccccgcgcg aaacccgcac
9721 taggccacta gatccccgca cctgttgctg tatag
//
LOCUS XEL68KSA 2009 bp ss-mRNA VRT 12-JUL-1990
DEFINITION X.laevis 68 kDa serum albumin mRNA, complete cds.
ACCESSION M18350
KEYWORDS serum albumin.
SOURCE X.laevis adult liver hepatocyte (lambda-ZAP library), cDNA to mRNA,
clone pX1A14.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2009)
AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and
Schoenberg,D.R.
TITLE Xenopus laevis serum albumin: Sequence of the cDNAs encoding the 68
and 74 kDa peptides, relationship of the 74 kDa albumin to
alpha-fetoprotein, and the regulation of albumin gene expression by
thyroid hormone during development
JOURNAL Mol. Endocrinol. 3, 464-473 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1842 to 2009)
AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L.
TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis
serum albumin mRNA
JOURNAL Mol. Endocrinol. 3, 805-814 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable
sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989.
FEATURES from to/span description
pept 39 1859 68 kDa serum albumin precursor
sigp 39 110 68 kDa serum albumin signal peptide
matp 111 1856 68 kDa serum albumin
signal 1986 1994 poly-A signal
site 1 1 cap site
BASE COUNT 699 a 388 c 393 g 529 t
ORIGIN 87 bp upstream of HinfI site.
1 aggcttctca gaggtcccca cccaatacat ctccagtcat gaagtggatc accctcattt
61 gtctgttaat tagctccact ttaatagaat caagaataat tttcaaaaga gatacagatg
121 tagaccatca caagcatatt gctgacatgt acaatttatt gactgagcgg accttcaaag
181 gacttacatt ggctattgtc tcacagaatc tccagaaatg ttcattggag gagctgtcta
241 aactggtgaa tgaaattaat gactttgcca aatcctgtac aggaaacgac aaaactcctg
301 agtgtgaaaa acccataggc accctgtttt atgacaaact ctgcgcagat ccaaaagtgg
361 gtgttaatta tgagtggagc aaagagtgct gttctaagca agatccagag agagcacagt
421 gcttcagggc acatagagtt tttgaacata atccagtaag gcctaaacct gaggaaactt
481 gtgcattatt caaagaacac cctgatgatc ttctctcagc attcatacat gaagaggcga
541 gaaaccatcc agacctttat cccccagcag tactattatt aacacagcaa tatggcaaac
601 ttgttgaaca ttgttgtgaa gaagaagaca aggataaatg ctttgcagaa aagatgaagg
661 aactgatgaa acacagtcat tctattgaag ataagcaaaa acatttctgc tggattgtaa
721 ataattatcc tgaaagagtt attaaagcac taaatttggc cagagtgagc cacagatatc
781 ctaagcctga tttcaagctt gcccataaat ttaccgagga gactacacac ttcattaagg
841 attgttgtca tggggacatg tttgaatgca tgacagagag gctggagctt tctgagcata
901 cctgtcaaca taaagatgag ttatcaacaa aacttgaaaa atgctgtaac ttacctttgc
961 ttgagcgtac atactgcatt gtcaccttgg aaaatgatga cgttcctgct gaattatcaa
1021 agccaattac agaatttaca gaggaccctc atgtttgtga gaagtatgct gagaataaaa
1081 gtttcttaga gatatctcca tggcagagtc aagaaacacc agaattgtct gaacaattcc
1141 ttttgcaatc tgcaaaagaa tatgaatctt tgctgaacaa gtgctgcttt tcagacaatc
1201 ctcctgaatg ctacaaggat ggagctgaca gatttatgaa tgaagccaag gagagatttg
1261 catatttgaa acaaaactgt gatatcttgc atgaacatgg agaatatctc tttgaaaatg
1321 aattgctcat aagatacaca aagaaaatgc cccaagtgtc agatgaaaca ttgattggaa
1381 tagcacacca aatggcagat attggtgagc actgctgtgc cgtacctgaa aatcaaagga
1441 tgccatgtgc agaaggagac cttaccattc tcattggaaa aatgtgtgaa aggcaaaaga
1501 agacatttat aaataaccac gttgctcatt gctgcactga ctcatattct gggatgcgtt
1561 catgctttac tgctcttggt ccagatgagg actatgtacc acccccagtt actgatgaca
1621 catttcactt tgacgacaag atatgcactg ctaatgataa agaaaaacag catatcaaac
1681 agaaattcct tgtgaagctg attaaagtta gtcctaaatt ggaaaaaaat cacattgatg
1741 aatggctgct ggaattcctt aagatggtac agaaatgctg tactgcagat gaacaccagc
1801 catgttttga tacagagaaa ccagtactga ttgaacactg tcaaaaactc catccataag
1861 agtccataag agcaaagacc agtcttcaaa ctcactgagg aacaccttcc atctctcaaa
1921 cacaagaaaa aaaagttcct tcagctgaaa agagcatttg cttagagcat tcaactgtgt
1981 gttgtaataa ataaagcatt ttaaaaaat
//
LOCUS XEL74KSA 1957 bp ss-mRNA VRT 12-JUL-1990
DEFINITION Xenopus laevis 74 kDa serum albumin mRNA, complete cds.
ACCESSION M21442
KEYWORDS serum albumin.
SOURCE X.laevis adult liver hepatocyte, cDNA to mRNA, clone pX1A74.1.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1957)
AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and
Schoenberg,D.R.
TITLE Xenopus laevis serum albumin: Sequence of the complementary
deoxyribonucleic acids encoding the 68- and 74-kilodalton peptides
and the regulation of albumin gene expression by thyroid hormone
during development
JOURNAL Mol. Endocrinol. 3, 464-473 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1801 to 1957)
AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L.
TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis
serum albumin mRNA
JOURNAL Mol. Endocrinol. 3, 805-814 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable
sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989.
FEATURES from to/span description
pept < 1 1818 74 kDa serum albumin (AA at 1)
sigp < 1 66 74 kDa serum albumin signal peptide
matp 67 1815 74 kDa serum albumin
signal 1935 1944 poly-A signal
BASE COUNT 667 a 383 c 382 g 525 t
ORIGIN
1 tggatcaccc tgatttgtct gttaattagc tcctctttca ttgaatcaag gatacttttc
61 aaaagagata cagatgcaga ccatcacaag catattgctg atgtatacac cgcattgact
121 gagcggacct tcaaaggact tacattggct attgtctctc agaatctcca gaaatgttcg
181 ttggaggagt tatctaagct ggtgaatgaa ataaatgact ttgccaaatc ctgtattaat
241 gacaaaactc ctgagtgtga aaaaccagtg ggcaccctgt tttttgacaa actctgtgca
301 gatccagcag tgggtgttaa ttatgagtgg agcaaagagt gctgtgccaa gcaagatcca
361 gagagggctc agtgcttcaa ggcgcacaga gatcatgaac atacttcaat aaagcctgaa
421 cctgaggaaa cctgcaaatt actcaaagaa caccctgatg atcttctctc agcgttcatt
481 catgaagagg caagaaacca tccagacctt tatccaccag cagtattagc attaaccaag
541 caatatcaca aacttgctga acattgttgt gaagaagaag acaaggaaaa atgcttctca
601 gaaaagatga agcaacttat gaaacaatct cattccattg aagataagca acatcatttc
661 tgctggattc tggataattt tcctgaaaaa gttcttaaag cactaaattt ggccagagtg
721 agccacagat atcctaaagc tgaattcaag cttgcccata attttactga ggaggttaca
781 cactttatta aagattgttg ccatgacgac atgtttgaat gcatgactga gaggctggag
841 cttactgagc atacctgtca acataaagat gagttatcat caaaacttga aaaatgctgt
901 aatatacctt tgcttgagcg tacatactgc attgtcacct tggaaaatga tgacgttcct
961 gctgaattgt ctcagccaat tacagaattt acagaggacc ctcatgtgtg tgagaagtat
1021 gctgagaata acgaagtttt cttaggaaga tatctccatg ctgtgtcaag aaaacaccag
1081 gaattgtctg aacaattcct tttgcaatct gcaaaagaat atgaatcttt gctgaacaag
1141 tgctgcaaaa cagacaatcc tcctgaatgc tacaaggatg gagctgacag atttatgaat
1201 gaagccaagg agagatttgc atatttgaaa caaaactgtg atatcttgca tgaacatgga
1261 gaatatctct ttgaaaatga attgctcata agatacacaa agaaaatgcc ccaagtgtca
1321 gatgaaacat tgattggaat agcacaccaa atggcagata ttggtgagca ctgctgtgcc
1381 gtacctgaaa atcaaaggat gccatgtgca gaaggagacc ttaccattct cattggaaaa
1441 atgtgtgaaa ggcaaaagaa gacatttata aataaccacg ttgctcattg ctgcactgac
1501 tcatattctg ggatgcgttc atgctttact gctcttggtc cagatgagga ctatgtacca
1561 cccccagtta ctgatgacac atttcacttt gacgacaaga tatgcactgc taatgataaa
1621 gaaaaacagc atatcaaaca gaaattcctt gtgaagctga ttaaagttag tcctaaattg
1681 gaaaaaaatc acattgatga atgttctgct gaattcctta agatggtaca gaaatgctgt
1741 actgcagatg aacaccagcc atgttttgat acagagaaac cagtactgat tgaacactgt
1801 caaaaactcc atccataaga gtccattaga gcaaaggcca gccttcaaac tcactgagga
1861 acatcttcca tctctcacat gaaaaaagtt tcctccatct gaaaagaaaa tttgttcatt
1921 caactgtctg ttgaaataaa taaagcgttt aaaatat
//
LOCUS MUSHOX28A 216 bp ds-DNA ROD 12-JUL-1990
DEFINITION Mouse homeobox protein gene Hox-2.8, partial cds.
ACCESSION M34004
KEYWORDS homeobox protein.
SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 216)
AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and
Lehrach,H.
TITLE A yeast artificial chromosome containing the mouse homeobox cluster
Hox-2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Krumlauf, 04-MAY-1990, for release after publication.
FEATURES from to/span description
pept < 1 > 216 Hox-2.8 homeobox protein (AA at 1)
site 34 216 homeobox Hox-2
BASE COUNT 51 a 68 c 66 g 31 t
ORIGIN Chromosome 11D.
1 ggccccggat tgccagaatg cggcggcagc ggctcccgca gactgcgcac ggcctacacc
61 aacacgcaac tgctggagct ggagaaggag ttccacttca ataagtacct gtgccggccg
121 cgtcgcgtcg agatcgctgc cttgctggac ctcaccgaaa ggcaggtcaa agtctggttc
181 cagaaccgac gcatgaaaca caagcggcag acggag
//
LOCUS MUSHOX29A 183 bp ds-DNA ROD 12-JUL-1990
DEFINITION Mouse homeobox protein gene Hox-2.9, partial cds.
ACCESSION M34005
KEYWORDS homeobox protein.
SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 183)
AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and
Lehrach,H.
TITLE A yeast artificial chromosome containing the mouse homeobox cluster
Hox-2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Krumlauf, 04-MAY-1990, for release after publication.
FEATURES from to/span description
pept < 1 > 183 Hox-2.9 homeobox protein (AA at 1)
site 1 183 homeobox Hox-2
BASE COUNT 50 a 53 c 54 g 26 t
ORIGIN Chromosome 11D.
1 cccggcggtc tccgcacaaa cttcaccacg cgccagctga cggagctgga gaaggaattt
61 catttcaaca aatacctgag ccgtgcccgg agggtggaga tcgccgccac cctggagctc
121 aatgaaacgc aggtgaagat ctggttccag aaccggcgca tgaagcagaa gaaacgcgag
181 cga
//
LOCUS BCCIPMD 1101 bp ds-DNA BCT 12-JUL-1990
DEFINITION B.coagulans 3-isopropylmalate dehydrogenase gene, complete cds.
ACCESSION M33099
KEYWORDS 3-isopropylmalate dehydrogenase.
SOURCE B.coagulans (ATCC 7051) DNA.
ORGANISM Bacillus coagulans
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1101)
AUTHORS Sekiguchi,T., Ortega-Cesena,J., Nosoh,Y., Ohashi,S., Tsuda,K. and
Kanaya,S.
TITLE DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of
Bacillus coagulans. Comparison with the enzymes of Saccharomyces
cerevisiae and Thermus thermophilus
JOURNAL Biochim. Biophys. Acta 867, 36-44 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1101 3-isopropylmalate dehydrogenase
BASE COUNT 288 a 255 c 328 g 230 t
ORIGIN
1 atgaaaatga aactggccgt actgcccggc gatgggatcg ggccggaagt gatggatgca
61 gcgatccgcg ttttaaaaac agtgttggac aatgacgggc atgaagccgt ttttgaaaat
121 gcgctgattg ggggcgccgc cattgatgaa gcggggacgc ccctaccgga agaaacgctt
181 gacatttgcc gcaggagcga tgccattttg ctcggcgcgg taggggggcc gaaatgggat
241 cataacccgg cttccctccg cccggaaaaa ggcctgctcg ggctccggaa agaaatgggg
301 ctgtttgcga acctgcgccc ggttaaagca tatgccacac ttttaaacgc atcgccttta
361 aaacgggaac gtgtggaaaa cgtcgatctt gttattgtcc gcgaactgac gggcggcctc
421 tattttgggc gcccgagtga aaggcgcggg ccgggcgaga atgaagtggt agacacgctt
481 gcctatacaa gggaagagat tgaaagaatt attgagaaag cattccagct tgcccaaatc
541 agaagaaaaa aactggcatc cgtcgataag gcgaatgtgc tggaatcaag cagaatgtgg
601 cgcgaaattg cggaagaaac cgcgaaaaag tatccggacg tggaattgag ccatatgctt
661 gtcgactcaa cttcgatgca gctgattgca aatccgggcc aatttgatgt cattgtaaca
721 gagaatatgt tcggcgatat tttaagcgat gaagcgtccg tgattaccgg cagcctcggc
781 atgttgccat ccgcaagcct ccgttccgac cggttcggca tgtatgaacc ggtccacggc
841 tccgcgccgg atattgccgg gcagggaaaa gccaacccgc tcgggacagt gctgtcagcg
901 gctttgatgc tccgttattc gttcgggctt gagaaagaag cggcggccat tgaaaaagca
961 gtggatgatg tgcttcaaga cggctattgt acaggcgatt tgcaggtggc aaacggaaaa
1021 gtggtcagta caattgagct cacagaccgg ctgatcgaaa aattaaataa cagcgcagcc
1081 ggtccgcgca tttttcaata a
//
LOCUS DROSGS3A 151 bp ds-DNA SYN 12-JUL-1990
DEFINITION D.melanogaster synthetic Sgs-3 glue protein gene/Adh gene, 5'
flank.
ACCESSION M34726
KEYWORDS alcohol dehydrogenase; glue protein.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 151)
AUTHORS Roark,M., Raghavan,K.V., Todo,T., Mayeda,C.A. and Meyerowitz,E.M.
TITLE Cooperative enhancement at the Drosophila Sgs-3 locus
JOURNAL Dev. Biol. 139, 121-133 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 131 > 151 synthetic Sgs-3 glue protein mRNA
recomb 130 131 Adh DNA end/Sgs-3 synthetic DNA start
BASE COUNT 51 a 25 c 36 g 39 t
ORIGIN
1 gtcgacccaa aagtatcaaa caaaggggag aaggcttgtg tttgcataat cgaaatactg
61 actccatttt tagaattgca gtttcagtga aagcgtacct ataaaaaggt gaggtatccg
121 caagaaaagt atcagtttgt ggtaccgagc t
//
LOCUS MZESOD3A 1037 bp ss-mRNA PLN 12-JUL-1990
DEFINITION Z.mays manganese superoxide dismutase (SOD-3) mRNA, complete cds.
ACCESSION M33119
KEYWORDS manganese superoxide dismutase; superoxide dismutase.
SOURCE Z.mays (strain W64A), cDNA to mRNA, clone pSod3.1c.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1037)
AUTHORS White,J.A. and Scandalios,J.G.
TITLE Isolation and characterization of a cDNA for mitochondrial
manganese superoxide dismutase (SOD-3) of maize and its relation to
other manganese superoxide dismutases
JOURNAL Biochim. Biophys. Acta 951, 61-70 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 46 753 manganese superoxide dismutase (SOD-3) (EC
1.15.1.1)
BASE COUNT 237 a 259 c 296 g 245 t
ORIGIN
1 gaattccacg cacccaggag atacagcgag cgagcgacca aagccatggc tctccgcacc
61 ctggcatcga agaaggtcct atccttcccg ttcggcggcg cgggccggcc gttggcggcg
121 gcggcgtctg cgaggggggt gacgacggtc acactccccg acctctccta cgacttcggc
181 gcgctggaac cggccatctc gggggagatc atgcgcttgc accaccaaaa gcaccacgcc
241 acctacgtcg ccaactacaa caaggcgctg gagcagcttg aaactgccgt ctccaagggc
301 gacgcctccg ctgtcgtcca gctgcaggcg gcgatcaagt tcaacggcgg cggtcatgtg
361 aaccattcaa tcttctggaa gaacctcaag cccattagcg aaggtggcgg ggagccgcct
421 catgggaaac ttggctgggc catcgatgag gattttggtt cgtttgaggc acttgtaaag
481 aagatgaatg cagaaggcgc tgctttccaa gggtctggat gggtgtggtt agctttggat
541 aaagaggcaa aaaaggtttc agttgaaaca acagctaatc aggatcctct ggtgactaaa
601 ggtgcaagct tggttccgct gttggggatt gatgtctggg aacatgcata ctacctgcag
661 tacaagaatg ttaggccgga ttacctgaac aacatctgga aggtgatgaa ctggaaatat
721 gctggagagg tgtacgaaaa tgttcttgct tgaattgtct taacggacaa tacacatctg
781 cgcgcgcggg tttcggctgt ttgatcatgt gaaataaaga tggacctgtc tagcggctgg
841 accttgtgta catttcactg agatagacta atggacggcc tgccgatttt gttcgtcctg
901 cttgcgtgct actctgtctc tgctcctagt ttttggcatc atgtttatgt tgagcaaggt
961 gatgcccaag ggaagccatt cccactcttg tctccattaa taaaatcagc tgagcttccg
1021 atgtttgctt ggaattc
//
LOCUS RATA2UGLBA 300 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat alpha-2u-globulin gene, 5' end.
ACCESSION M33213
KEYWORDS alpha-2u-globulin.
SOURCE Rat DNA, clone 91.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Kurtz,D.T., McCullough,L., Bishop,D.K. and Manos,M.M.
TITLE DNA sequences required for hormonal induction of rat
alpha-2u-globulin genes
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 985-988 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 292 > 300 alpha-2u-globulin
mRNA 233 > 300 alpha-2u-globulin mRNA
BASE COUNT 108 a 58 c 76 g 58 t
ORIGIN
1 acccactaat ttttcgtggg aatatgtttt gcgaaatgta tgagtgatag aatcaatcca
61 taggagatga catcgccaag tttcaaaagg gcaggaacaa tcgtggcttc acatcagtac
121 atggaaaaca ttccacaaag cctgagaaga atggaaggcc catatgagaa ggaaaaaaaa
181 acaccgaaac ccagagagag tataaagacg agcaaagtgc tggaggtgga gtgtgggcac
241 catcagcaga gggattgtcc cgacagagag gcaattctat tccctaccaa catgaagctg
//
LOCUS SHRRGBA 409 bp ds-DNA INV 12-JUL-1990
DEFINITION Brine shrimp 5.8S ribosomal RNA gene.
ACCESSION M33097
KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
SOURCE Brine shrimp DNA.
ORGANISM Artemia sp.
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Branchiopoda; Sarsostraca; Anostraca; Artimiidae.
REFERENCE 1 (bases 1 to 409)
AUTHORS Vaughn,J.C., Sperbeck,S.J. and Hughes,M.J.
TITLE Molecular cloning and characterization of ribosomal RNA genes from
the brine shrimp: Nucleotide sequence analysis and evolution of the
5.8 S rRNA gene region and its flanking nucleotides
JOURNAL Biochim. Biophys. Acta 783, 144-151 (1984)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 177 339 5.8S ribosomal RNA
BASE COUNT 104 a 82 c 116 g 107 t
ORIGIN
1 ggtgaaaaat agtcatattg gggacgagag tggcttcttg tgattcaagg atcatggata
61 ccactccgcg agactaaagg gagtgaaggt gagcttgccc caacagagca tggcttgagg
121 tgtgcaaggg tgcaattgca ttggccttgt ttgagggaga atttgaaaca ttcaatagaa
181 tgacccttga ggatggatca cttggctcac attacgaaga cgaacgcagc tagacgcgtg
241 attccatgcg aactgcagga cacatggaac gtctatattt tgaacgcaaa ttgcatgtcc
301 agcctttgag cttggactac gtctggctga gagacggatg tttttatcat tcggtcatct
361 gggtataccg tcactgcgag gctccttgct tctatagggc cgttgatcg
//
LOCUS GLATSAA 3053 bp ss-mRNA INV 12-JUL-1990
DEFINITION G.lamblia trophozoite surface antigen (TSA 417) mRNA, complete cds.
ACCESSION M33641
KEYWORDS major surface protein; trophozoite surface antigen.
SOURCE G.lamblia (strain WB, ATCC 30957), cDNA to mRNA, clone pFDG417.
ORGANISM Giardia lamblia
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE 1 (bases 1 to 3053)
AUTHORS Gillin,F.D., Hagblom,P., Harwood,J., Aley,S.B., Reiner,D.S.,
McCaffery,M., So,M. and Guiney,D.
TITLE Isolation and expression of the gene for a major surface protein of
Giardia lamblia
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4463-4467 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by S.B.Aley, 08-APR-1990.
FEATURES from to/span description
pept 205 2346 trophozoite surface antigen protein precursor
(TSA 417)
sigp 205 255 trophozoite surface antigen protein signal
peptide
matp 256 2343 trophozoite surface antigen
binding 188 193 ribosomal binding site (put.)
signal 67 72 TATA box
signal 2359 2364 misc. signal
signal 2831 2837 polyA signal
BASE COUNT 785 a 777 c 865 g 626 t
ORIGIN
1 gaattcttac gctatgtacg gcttatattg acaggattgc tacaggctat gaatactatg
61 ctagagtata aacatgtatc cacggcgatc tgggggtctt ctcggagact agtggccagt
121 taccatggac acgcaagaag ctgtctgtgg tagcctggcc ccgggctttg cgttggaagc
181 gccacccagc aggtcggcgg cctaatgttc ggcagatttt tgctcgcgat cgtcatcctt
241 cagctggcac ggacagcctg cacccaagaa gctgacgatg gaaagtgtaa aacgtgtggc
301 gtcaccattg gtcaagacac ttggtgctct gagtgcaacg gagcaaacta cgcccccgtg
361 aacggccagt gtgtagacgt caacgctgag gggccaagca aaacgctttg tccgcaacat
421 agcgcaggga agtgcacgca gtgcggaggc aactcattca tgtacaagga cggctgttat
481 tccagcggag aaggccttcc tggacacagc ctgtgcttaa gttccgacgg agatggcgta
541 tgcaccgagg cggccccggg gtactttgct ccggtgggag cggcgaacac tgaacagtct
601 gtgatcgcat gtggcgatac aactggagta acaatagcag ctggcggaaa cacatacaag
661 ggcattgctg actgcgcaga atgcagcgcc cctgacgcaa cagccggcgc tgaggccggc
721 aaggttgcaa cgtgtaccaa gtgtggagtc agtaagtatc tcaaggataa cgtgtgcgta
781 gataaagccc aatgtaattc tggtagcact aataagttcg ttgcagttga tgattctgag
841 aatggcaaca agtgtgtttc ttgcagcgat aacctcaatg gtggcgttgc caattgcgac
901 acctgtagct acgatgagca atctaagaag atcaagtgta caaaatgcac cgataacaac
961 tacctgaaaa ccacaagcga aggcacgtcg tgcgtacaaa aagaccaatg caaagacggc
1021 ttcttcccca aggatgacag cagtgcagga aataaatgcc tcccttgtaa tgacagcacc
1081 gacggaattg ccaattgcgc cacgtgtgct ctggttagtg gccgatcagg ggctgccctc
1141 gttacatgct ccgcctgcac ggatggatac aagcctagtg ccgacaaaac tacgtgcgag
1201 gcggtaagca actgcaagac ccccggatgc aaggcgtgca gcaacgaagg aaaggagaac
1261 gaggtctgca cagactgtga tggtagcaca tacctcacgc cgacaagcca gtgcatagac
1321 agctgcgcta agattggaaa ctactatgga gccaccgaag gagcaaagaa actctgtaaa
1381 gagtgcactg cggctaactg caagacttgc gatgatcagg ggcagtgcca agcatgcaac
1441 gacgggttct ataaaaacgg cgacgcgtgc tctccgtgcc acgaaagctg caagacatgc
1501 agcgcaggca ctgccagcga ctgcaccgag tgtcccaccg gaaaagcact caggtacggg
1561 gacgacggta ctaagggcac gtgcggagaa ggctgcacaa cgggcacagg agcaggagca
1621 tgcaagacgt gtgggctcac tatcgatggc gctagctact gctctgagtg cgccacaacg
1681 acagaatatc ctcaaaatgg cgtctgtgca ccaaaggcta gccgcgccac acctacgtgc
1741 aacgactcgc ctattcagaa tggtgtttgt ggaacgtgtg ccgataacta ctttaagatg
1801 aacggagggt gctatgaaac agtcaagtat cccggtaaga cggtttgcat tagtgcacca
1861 aatggtggta cgtgtcaaaa agctgcagat ggttacaagt tggattcagg tacccttaca
1921 gtttgttctg aagggtgtaa ggaatgtgct agcagtaccg actgtactac gtgtctggac
1981 ggatatgtaa agagtgcaag tgcgtgcaca aagtgtgacg ctagctgcga aacatgtaat
2041 ggagcagcta caacatgtaa ggcgtgtgct acgggatact acaagaccgc atcaggagaa
2101 ggtgcgtgca cgtcttgtga aagtgatagc aacggagtca ctggtattaa gggctgccta
2161 aactgcgccc ctccgcccaa caataaaggt tccgtcctct gctacctcat aaaggatagc
2221 ggtagcacca acaagagcgg gctctccact ggtgccatag cgggtatctc cgtcgctgtc
2281 atcgttgttg tcggcggcct catcggcttc ctctgctggt ggttcctctg cagggggaag
2341 gcgtagatgt acttagatag taaaccgtca tcgatgggtc tgctcggtgt ctgttcctgc
2401 tagcacagac agcagggtct cagccagtgc accaagcatc aggcgtgtgg atgaatgttt
2461 ggcttatcca gtagcgccct tgcgtgtcca cgggctcaca tgtgaccaac agtgctgtac
2521 aggtaggtag agaccagacc acggatccca tgcactgaat gcaactcctt tgcagccgtg
2581 atgggtcagt tgtggcaatt tataagacaa aacgagggcc ctgtccatcg cacagtccct
2641 tgcagcgctt ccagacgcgg agctggcggc ggtcctgcac tacctcgccg agttccgtgg
2701 gccagaggtc ttcggggact gccttcagac cttgctcagc tcgacgagcc ccgggacgag
2761 accctcaggc ttgcggcaca gaaagacata cgcggcttcc tcgaaaggat cgacagaaag
2821 gactcagctc aataaatgcc actcttcacg tcctcgttcc gcggtacatc gtgtagctgt
2881 acatccagtg gaacttttcg actcagaagt gaagttgatg agctctgtgg tagataactt
2941 ctcatggcct ggagtggcag acactgtgag cagctgattg gcatgcaatt cacaccctag
3001 acgcggtgga gagatacccc cgtccatcca ttacaaacaa gtcccacaag ctt
//
LOCUS HUMPLG01 1272 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 1.
ACCESSION M33272 J05286
KEYWORDS plasminogen.
SEGMENT 1 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1272)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept 1077 + 1125 plasminogen precursor, exon 1
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
sigp 1077 + 1125 plasminogen signal peptide
pre-msg 957 > 1272 PLG mRNA and introns
IVS 1126 > 1272 PLG intron A
BASE COUNT 391 a 231 c 269 g 381 t
ORIGIN
1 gaattccgca gacattccac ccaagaccat tgggctccca cctctactct tttgccagtt
61 aatgaatagg caggaatttc actgcctgga aagaggaaca atgctttctg gtccttattt
121 cacatctaaa atagagaggt caattgattt attcctaaat atctttgaac actaaaatag
181 aagttttaca gcatatatac tacctggttg ctctagactt aagccaggga aaagtacaga
241 ttcaacattt aaaattgaga tagacgcttt ccacttaatg ctaccagtct tgctttattt
301 catgagaatg agaatataat aatatggcat acgttcattt gggggaaaga ttgatgtctt
361 ataacataat ttataattac agaaaacatg tgagttcact gggaataaat aaattttgaa
421 gataataaga tactttcact tatgtcataa tttctatgtc atttggtgta ggatgtagag
481 atattaacgt ttacacctaa ctcaagtttg tcatctaaga cctgaaaggg ttttgtctat
541 cagctgcacc cctgggtaga gacacaacct tggggaaggc ctcagcccca tccctcgtac
601 agcaggaatg agaacagccc tgcctgttgg gaagcttgag ggaggctatg gacgtgcagc
661 gcttggcaga aggtctcgtc atggaaggtt ccagcaaatg tgagatactt ttatgatttc
721 attttctcca aaagaaaggg aataagagaa gaggggagga aataagacta attgcgagag
781 ataaagtaca agggtgaggg aaggaataag gagacatgac ggcagcgtgg agcagccgag
841 gggggagatt gctttcacca cttcccagca tctattgcag attccaccct caaacatttt
901 gtaaggactc tttattcaag gtaacgtttg aaccctgctg agccagtggc atgggtctct
961 gagagaatca ttaacttaat ttgactatct ggtttgtgga tgcgtttact ctcatgtaag
1021 tcaacaacat cctgggattg ggacccactt tctgggcact gctggccagt cccaaaatgg
1081 aacataagga agtggttctt ctacttcttt tatttctgaa atcaggtaag acatagtttt
1141 tttaaattat aataattatt ttttctccca caatgtagta aaaatacata tgccatggct
1201 ttatgtgcaa ttcatttaat ttttgattca tgaaacttcc agttgaaaat cttgtataag
1261 attgaggaat tc
//
LOCUS HUMPLG02 161 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, intron A (partial).
ACCESSION M33273 J05286
KEYWORDS plasminogen.
SEGMENT 2 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 161)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
FEATURES from to/span description
IVS < 1 > 161 plasminogen intron A /hgml_locus_uid="LW0013Z"
/nomgen="PLG" /map="6q26-q27"
BASE COUNT 52 a 46 c 20 g 43 t
ORIGIN Unknown number of base pairs after segment 1.
1 gaattcaccc atttaggcat acaatccaat ggatttcaag atattgagag ttgtgcagcc
61 accatcagaa taaattttaa aactattcat acccccaaaa acgcactcca ctctccttag
121 ctgttacccc aatctgcagc ttctggcaac cactaatcta c
//
LOCUS HUMPLG03 376 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 2.
ACCESSION M33274 J05286
KEYWORDS plasminogen.
SEGMENT 3 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 376)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 196 + 331 plasminogen (PLG) precursor, exon 2
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
sigp 196 203 plasminogen signal peptide
matp 204 + 331 plasminogen
IVS < 1 195 PLG intron A
IVS 332 > 376 PLG intron B
BASE COUNT 114 a 74 c 78 g 110 t
ORIGIN Unknown number of base pairs after segment 2.
1 tctttattta tgtccaaatg cccgactgtg tgttcttaac taaacatttt gattcatagc
61 tacccattct acttccagta aacagaaagt tttatttggt taatgctaac caaatagatt
121 aaaaggaagt catgacaatt agacattgac attgatttac tgaccattta ttccacttgg
181 atctcccacc tctaggtcaa ggagagcctc tggatgacta tgtgaatacc cagggggctt
241 cactgttcag tgtcactaag aagcagctgg gagcaggaag tatagaagaa tgtgcagcaa
301 aatgtgagga ggacgaagaa ttcacctgca ggtatttcca ttgtcgttgc acctacgcag
361 gaatctgtaa ttcaga
//
LOCUS HUMPLG04 291 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 3.
ACCESSION M33275 J05286
KEYWORDS plasminogen.
SEGMENT 4 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 46 + 152 plasminogen (PLG) precursor, exon 3
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 46 + 152 plasminogen
IVS < 1 45 PLG intron B
IVS 153 > 291 PLG intron C
BASE COUNT 88 a 60 c 51 g 92 t
ORIGIN Unknown number of base pairs after segment 3.
1 taaataaaga aaaatactta ttggatttcc tgcttcgttc tgcagggcat tccaatatca
61 cagtaaagag caacaatgtg tgataatggc tgaaaacagg aagtcctcca taatcattag
121 gatgagagat gtagttttat ttgaaaagaa aggtgagtac attttcttcc tcctcctcct
181 actgtcctcc ccatcctccc actcttcctc tttctctatt ctatctttaa tttatgagac
241 cagaggagga aggcactatc gtgttataaa actgaattct gagttaggac a
//
LOCUS HUMPLG05 69 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, intron C (partial).
ACCESSION M33276 J05286
KEYWORDS plasminogen.
SEGMENT 5 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 69)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
FEATURES from to/span description
IVS < 1 > 69 plasminogen intron C /hgml_locus_uid="LW0013Z"
/nomgen="PLG" /map="6q26-q27"
BASE COUNT 26 a 9 c 13 g 21 t
ORIGIN Unknown number of base pairs after segment 4.
1 aagtgcagat taaatctaaa ctttatctgg tgaagttatt agttcttaca agtagcaagc
61 aaacggtaa
//
LOCUS HUMPLG06 57 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, intron C (partial).
ACCESSION M33277 J05286
KEYWORDS plasminogen.
SEGMENT 6 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 57)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
FEATURES from to/span description
IVS < 1 > 57 plasminogen intron C /hgml_locus_uid="LW0013Z"
/nomgen="PLG" /map="6q26-q27"
BASE COUNT 18 a 12 c 7 g 20 t
ORIGIN Unknown number of base pairs after segment 5.
1 agtgcaacat ctacaataat tactttcctt atttttgaag tggaccatat ctcgaca
//
LOCUS HUMPLG07 341 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 4.
ACCESSION M33278 J05286
KEYWORDS plasminogen.
SEGMENT 7 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 341)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 167 + 281 plasminogen (PLG) precursor, exon 4
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 167 + 281 plasminogen
IVS < 1 166 PLG intron C
IVS 282 > 341 PLG intron D
BASE COUNT 89 a 73 c 77 g 102 t
ORIGIN Unknown number of base pairs after segment 6.
1 tggctcagtt tactgcagcc tttttgcaga tgcaaaagat gatcttttag aaagcagaaa
61 cagggggtct ggtgcatgag atctttttct caacgtgact atgctgtgca gaccttcatg
121 tggtgtcttg tgaaagactt tgaccactgt gtggacttcc cttcagtgta tctctcagag
181 tgcaagactg ggaatggaaa gaattacaga gggacgatgt ccaaaacaaa aaatggcatc
241 acctgtcaaa aatggagttc cacttctccc cacagaccta ggtaagacat tccctttcat
301 ctttgtgttc atctactgta aagttgtccc tctgtgtctg t
//
LOCUS HUMPLG08 354 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 5.
ACCESSION M33279 J05286
KEYWORDS plasminogen.
SEGMENT 8 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 69 + 208 plasminogen (PLG) precursor, exon 5
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 69 + 208 plasminogen
IVS < 1 68 PLG intron D
IVS 209 > 354 PLG intron E
BASE COUNT 101 a 83 c 72 g 98 t
ORIGIN Unknown number of base pairs after segment 7.
1 ttctgccttg ctaatagcaa gctgattttt agaatatagt ctaagtgctt cttttccatc
61 ctccccagat tctcacctgc tacacacccc tcagagggac tggaggagaa ctactgcagg
121 aatccagaca acgatccgca ggggccctgg tgctatacta ctgatccaga aaagagatat
181 gactactgcg acattcttga gtgtgaaggt caggagtggt tctagaaaat gttttcattt
241 ctgcccttca cctgtaaaat aatttgttgt aaagcccctt cccacaggga tgttattaat
301 aattgagtaa cgtattcacc tctgggaaag aagcaaaacc ccagaattaa cctg
//
LOCUS HUMPLG09 206 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 6.
ACCESSION M33280 J05286
KEYWORDS plasminogen.
SEGMENT 9 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 56 + 176 plasminogen (PLG) precursor, exon 6
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 56 + 176 plasminogen
IVS < 1 55 PLG intron E
IVS 177 > 206 PLG intron F
BASE COUNT 51 a 57 c 39 g 59 t
ORIGIN Unknown number of base pairs after segment 8.
1 ttcatccatt tcagttttct tcttcctctc tgtccttcct tcccactctg tccagaggaa
61 tgtatgcatt gcagtggaga aaactatgac ggcaaaattt ccaagaccat gtctggactg
121 gaatgccagg cctgggactc tcagagccca cacgctcatg gatacattcc ttccaagtaa
181 gtctcactgg gaaaaacatt ccatgt
//
LOCUS HUMPLG10 100 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, intron F (partial).
ACCESSION M33281 J05286
KEYWORDS plasminogen.
SEGMENT 10 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
FEATURES from to/span description
IVS < 1 > 100 plasminogen intron F /hgml_locus_uid="LW0013Z"
/nomgen="PLG" /map="6q26-q27"
BASE COUNT 29 a 17 c 26 g 28 t
ORIGIN Unknown number of base pairs after segment 9.
1 ccaaaatgat aaggtcactg attctgttga gtgattttta cacatgtaaa ctgttagaaa
61 aacagtgctt ggcagccggg catggtggca catgctgtag
//
LOCUS HUMPLG11 247 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 7.
ACCESSION M33282 J05286
KEYWORDS plasminogen.
SEGMENT 11 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 247)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 68 + 186 plasminogen (PLG) precursor, exon 7
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 68 + 186 plasminogen
IVS < 1 67 PLG intron F
IVS 187 > 247 PLG intron G
BASE COUNT 70 a 63 c 51 g 63 t
ORIGIN Unknown number of base pairs after segment 10.
1 cttgaaaaag agtcttatcc atgaatgtaa atgttcagtg ctactaaaat ctttcttgtc
61 cattcagatt tccaaacaag aacctgaaga agaattactg tcgtaacccc gatagggagc
121 tgcggccttg gtgtttcacc accgacccca acaagcgctg ggaactttgc gacatccccc
181 gctgcagtga gtatgatgca cacccagatt ccaggatttg gacctgccct gttcttgaaa
241 tcaaaag
//
LOCUS HUMPLG12 244 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 8.
ACCESSION M33283 J05286
KEYWORDS plasminogen.
SEGMENT 12 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 244)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 47 + 209 plasminogen (PLG) precursor, exon 8
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 47 + 209 plasminogen
IVS < 1 46 PLG intron G
IVS 210 > 244 PLG intron H
BASE COUNT 68 a 72 c 45 g 59 t
ORIGIN Unknown number of base pairs after segment 11.
1 ctcaaaaaat atatatattc attgtaactt attttgccca ttcaagcaac acctccacca
61 tcttctggtc ccacctacca gtgtctgaag ggaacaggtg aaaactatcg cgggaatgtg
121 gctgttaccg tgtccgggca cacctgtcag cactggagtg cacagacccc tcacacacat
181 aacaggacac cagaaaactt tccctgcaag taagtcccct ccagtctcat tctgctgcta
241 tgga
//
LOCUS HUMPLG13 217 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 9.
ACCESSION M33284 J05286
KEYWORDS plasminogen.
SEGMENT 13 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 36 + 181 plasminogen (PLG) precursor, exon 9
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 36 + 181 plasminogen
IVS < 1 35 PLG intron H
IVS 182 > 217 PLG intron I
BASE COUNT 64 a 56 c 52 g 45 t
ORIGIN Unknown number of base pairs after segment 12.
1 ttggaaagct aaactcacaa tcacttcttt ttcagaaatt tggatgaaaa ctactgccgc
61 aatcctgacg gaaaaagggc cccatggtgc catacaacca acagccaagt gcggtgggag
121 tactgtaaga taccgtcctg tgactcctcc ccagtatcca cggaacaatt ggctcccaca
181 ggtaagcaag ggtatgggag cttactgagg gcccaag
//
LOCUS HUMPLG14 409 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 10.
ACCESSION M33285 J05286
KEYWORDS plasminogen.
SEGMENT 14 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 409)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 132 + 291 plasminogen (PLG) precursor, exon 10
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 132 + 291 plasminogen
IVS < 1 131 PLG intron I
IVS 292 > 409 PLG intron J
BASE COUNT 116 a 102 c 73 g 118 t
ORIGIN Unknown number of base pairs after segment 13.
1 tctgtctgct aatacagaaa agagaacagt cataattctc agaggctacc gtactgtttt
61 tgtcataaat tgcttcatgc ttcttttttt tcagtaattg ttaagcttga tttcttttat
121 tttaatttca gcaccacctg agctaacccc tgtggtccag gactgctacc atggtgatgg
181 acagagctac cgaggcacat cctccaccac caccacagga aagaagtgtc agtcttggtc
241 atctatgaca ccacaccggc accagaagac cccagaaaac tacccaaatg cgtatgtctt
301 tgatttttac tgtaagaggg gcatcagcca actgaaattt ctgttaaaag agccatgctt
361 catgcttcaa gccaacttcc taggaccaaa tttctcttag acccagaat
//
LOCUS HUMPLG15 266 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 11.
ACCESSION M33286 J05286
KEYWORDS plasminogen.
SEGMENT 15 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 60 + 241 plasminogen (PLG) precursor, exon 11
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 60 + 241 plasminogen
IVS < 1 59 PLG intron J
IVS 242 > 266 PLG intron K
BASE COUNT 66 a 68 c 65 g 67 t
ORIGIN Unknown number of base pairs after segment 14.
1 ctgggtgccc ctgaatattc tcccacctct tgtgacctgt attgttttgg aatttccagt
61 ggcctgacaa tgaactactg caggaatcca gatgccgata aaggcccctg gtgttttacc
121 acagacccca gcgtcaggtg ggagtactgc aacctgaaaa aatgctcagg aacagaagcg
181 agtgttgtag cacctccgcc tgttgtcctg cttccaaatg tagagactcc ttccgaagaa
241 ggtaagaaat ctgtggctgg acatct
//
LOCUS HUMPLG16 224 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 12.
ACCESSION M33287 J05286
KEYWORDS plasminogen.
SEGMENT 16 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 224)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 26 + 174 plasminogen (PLG) precursor, exon 12
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 26 + 174 plasminogen
IVS < 1 25 PLG intron K
IVS 175 > 224 PLG intron L
BASE COUNT 57 a 56 c 56 g 55 t
ORIGIN Unknown number of base pairs after segment 15.
1 aatcatccat tttttccctg tacagactgt atgtttggga atgggaaagg ataccgaggc
61 aagagggcga ccactgttac tgggacgcca tgccaggact gggctgccca ggagccccat
121 agacacagca ttttcactcc agagacaaat ccacgggcgg gtctggaaaa aaatgtaagc
181 cactttgatt tggactcttt ggccttttgc tcaccaatct ttgc
//
LOCUS HUMPLG17 223 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 13.
ACCESSION M33288 J05286
KEYWORDS plasminogen.
SEGMENT 17 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 223)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 31 + 124 plasminogen (PLG) precursor, exon 13
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 31 + 124 plasminogen
IVS < 1 30 PLG intron L
IVS 125 > 223 PLG intron M
BASE COUNT 56 a 44 c 58 g 65 t
ORIGIN Unknown number of base pairs after segment 16.
1 gctggagctt acatgccttc ttgttttcag tactgccgta accctgatgg tgatgtaggt
61 ggtccctggt gctacacgac aaatccaaga aaactttacg actactgtga tgtccctcag
121 tgtggtaggt tgccttcttt ttggtaagga aactgcttac ttaatatgga tttgcaacaa
181 aaaaggaaaa gggcttctga gcagactgct tctggggagg aga
//
LOCUS HUMPLG18 296 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 14.
ACCESSION M33289 J05286
KEYWORDS plasminogen.
SEGMENT 18 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 296)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 126 + 246 plasminogen (PLG) precursor, exon 14
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 126 + 246 plasminogen
IVS < 1 125 PLG intron M
IVS 247 > 296 PLG intron N
BASE COUNT 74 a 72 c 69 g 81 t
ORIGIN Unknown number of base pairs after segment 17.
1 atgattttac tatttagttc ggcctttaag atgtcaaaaa ctcagtgctt ggaatttgtc
61 tcgaattaca ccacaaaatt gctaccttgt ctcaaatggg atttctttcc caccttgtgc
121 cacagcggcc ccttcatttg attgtgggaa gcctcaagtg gagccgaaga aatgtcctgg
181 aagggttgta ggggggtgtg tggcccaccc acattcctgg ccctggcaag tcagtcttag
241 aacaaggtaa gaacaggccc agaaacgatt tatactgtcc ctccacgtaa gccctg
//
LOCUS HUMPLG19 361 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 15.
ACCESSION M33290 J05286
KEYWORDS plasminogen.
SEGMENT 19 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 361)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 66 + 140 plasminogen (PLG) precursor, exon 15
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 66 + 140 plasminogen
IVS < 1 65 PLG intron N
IVS 141 > 361 PLG intron O
BASE COUNT 93 a 77 c 80 g 111 t
ORIGIN Unknown number of base pairs after segment 18.
1 ttctgtacaa tggagcagaa caaagtatca atttaactaa aatttgaact aaatcctctt
61 tccaggtttg gaatgcactt ctgtggaggc accttgatat ccccagagtg ggtgttgact
121 gctgcccact gcttggagaa gtatgtttag gggacaattg acatgaagtc ttgtcttaaa
181 tactttttct gtccttcttt tcctcctttc ctcctttcct ttctcactct tcctcccttc
241 cttctctggc tgtgacacta gggaccaggc cagggcaatt ggataagaga gaagggaagg
301 gtttctagaa agaaactgca gaggaaagac acagtacaga tgattttgtg ggcctgaata
361 a
//
LOCUS HUMPLG20 331 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 16.
ACCESSION M34272 J05286
KEYWORDS plasminogen.
SEGMENT 20 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 41 + 181 plasminogen (PLG) precursor, exon 16
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 41 + 181 plasminogen
IVS < 1 40 PLG intron O
IVS 182 > 331 PLG intron P
BASE COUNT 80 a 81 c 79 g 91 t
ORIGIN Unknown number of base pairs after segment 19.
1 ctggaccata ttttcctctt gacatcctca tcttttctag gtccccaagg ccttcatcct
61 acaaggtcat cctgggtgca caccaagaag tgaatctcga accgcatgtt caggaaatag
121 aagtgtctag gctgttcttg gagcccacac gaaaagatat tgccttgcta aagctaagca
181 ggtactcgtt cacctgtggt cttcacccca cgctggtgaa gatatttgct ttatgtctgg
241 gttttatggg ccatggcact gcatggcagt ggggaggaac tgtctatcac atgaaaggct
301 caagggcttt ggggacagca tcaatcttca a
//
LOCUS HUMPLG21 251 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 17.
ACCESSION M34273 J05286
KEYWORDS plasminogen.
SEGMENT 21 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 251)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 41 + 147 plasminogen (PLG) precursor, exon 17
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 41 + 147 plasminogen
IVS < 1 40 PLG intron P
IVS 148 > 251 PLG intron Q
BASE COUNT 71 a 61 c 51 g 68 t
ORIGIN Unknown number of base pairs after segment 20.
1 gcagagcagt caaacataac tgctgatgct tttctttcag tcctgccgtc atcactgaca
61 aagtaatccc agcttgtctg ccatccccaa attatgtggt cgctgaccgg accgaatgtt
121 tcatcactgg ctggggagaa acccaaggtg agataaattc cattgcccac ataacgaatt
181 ggttttgacc tacagtccat gtgacaaaat gatcattttg gagaaagctg tgcaaattcc
241 tatccatgaa t
//
LOCUS HUMPLG22 101 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, intron Q (partial).
ACCESSION M34274 J05286
KEYWORDS plasminogen.
SEGMENT 22 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 101)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
FEATURES from to/span description
IVS < 1 > 101 plasminogen intron Q /hgml_locus_uid="LW0013Z"
/nomgen="PLG" /map="6q26-q27"
BASE COUNT 24 a 34 c 24 g 19 t
ORIGIN Unknown number of base pairs after segment 21.
1 agaagggtgc tccctcacac aactacagca gtccaggtga tgcacccact gcccaatgct
61 tggtagtcaa gaggagcttc ctccctgcag ctctgcccag a
//
LOCUS HUMPLG23 254 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 18.
ACCESSION M34275 J05286
KEYWORDS plasminogen.
SEGMENT 23 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 254)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 69 + 214 plasminogen (PLG) precursor, exon 18
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 69 + 214 plasminogen
IVS < 1 68 PLG intron Q
IVS 215 > 254 PLG intron R
BASE COUNT 61 a 54 c 63 g 76 t
ORIGIN Unknown number of base pairs after segment 22.
1 tgttctggaa tatcctcctg aatgtgtttt gggtgcagtt gccatttctt tcatcttttt
61 aaacacaggt acttttggag ctggccttct caaggaagcc cagctccctg tgattgagaa
121 taaagtgtgc aatcgctatg agtttctgaa tggaagagtc caatccaccg aactctgtgc
181 tgggcatttg gccggaggca ctgacagttg ccaggtaagc aaagatcaag agaccaaagt
241 tagtcttgtg ctct
//
LOCUS HUMPLG24 1236 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human plasminogen gene, exon 19.
ACCESSION M34276 J05286
KEYWORDS plasminogen.
SEGMENT 24 of 24
SOURCE Human leukocyte and lung fibroblast DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1236)
AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
TITLE Characterization of the gene for human plasminogen, a key proenzyme
in the fibrinolytic system
JOURNAL J. Biol. Chem. 265, 6104-6111 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES from to/span description
pept + 41 202 plasminogen (PLG) precursor, exon 19
/hgml_locus_uid="LW0013Z" /nomgen="PLG"
/map="6q26-q27"
matp + 41 199 plasminogen
pre-msg < 1 427 PLG mRNA and introns (alt.)
pre-msg < 1 458 PLG mRNA and introns (alt.)
pre-msg < 1 1184 PLG mRNA and introns (alt.)
IVS < 1 40 PLG intron R
BASE COUNT 365 a 233 c 297 g 341 t
ORIGIN Unknown number of base pairs after segment 23.
1 agcctaaccc tcacatgcat ttttctctcc ctctgtatag ggtgacagtg gagggcctct
61 ggtttgcttc gagaaggaca aatacatttt acaaggagtc acttcttggg gtcttggctg
121 tgcacgcccc aataagcctg gtgtctatgt tcgtgtttca aggtttgtta cttggattga
181 gggagtgatg agaaataatt aattggacgg gagacagagt gacgcactga ctcacctaga
241 ggctgggacg tgggtaggga tttagcatgc tggaaataac tggcagtaat caaacgaaga
301 cactgtcccc agctaccagc tacgccaaac ctcggcattt tttgtgttat tttctgactg
361 ctggattctg tagtaaggtg acatagctat gacatttgtt aaaaataaac tctgtactta
421 actttgattt gagtaaattt tggttttggt cttcaacatt ttcatgctct ttgttcaccc
481 caccaatttt aaatgggcag atggggggat ttagctgctt ttgataagga acagctgcac
541 aaaggactga gcaggctgca aggtcacaga ggggagagcc aagaagttgt ccacgcattt
601 acctcatcag ctaacgaggg cttgacatgc atttttactg tctttattcc tgacactgag
661 atgaatgttt tcaaagctgc aacatgcatg gggagtcatg cgaaccgatt ctgttattgg
721 gaatgaaatc tgtcaccgac tgcttgactt gagcccaggg gacacagagc agagagctgt
781 atatgatgga gtgaaccggt ccatggatgt gtaacacaag accaactgag agtctgaatg
841 ttattctggg gcacacgtga gtctaggatt ggtgccaaga gcatgtaaat gaacaacaag
901 caaatattga aggtggacca cttatttccc attgctaatt gcctgcccgg ttttgaaaca
961 gtctgcagta cacacggtga caggagaatg acctgtggga gagatacatg tttagaagga
1021 agagaaagga caaaggcaca cgttttacca tttaaaatat tgttaccaaa caaaaatatc
1081 cattcaaaat acaatttaac aatgcaacag tcatcttaca gcagagaaat gcagagaaaa
1141 gcaaaactgc aagtgactgt gaataaaggg tgaatgtagt ctcaaatcct caaagagctg
1201 tgtttatttc attgacaaat agattatttg tattca
//
LOCUS PARGANTI1 162 bp ds-DNA INV 12-JUL-1990
DEFINITION P.primaurelia G surface antigen gene, 5' end.
ACCESSION M11194
KEYWORDS G surface antigen; surface antigen.
SEGMENT 1 of 2
SOURCE P.primaurelia macronucleus DNA.
ORGANISM Paramecium primaurelia
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE 1 (bases 1 to 162)
AUTHORS Meyer,E., Caron,F. and Baroin,A.
TITLE Macronuclear structure of the G surface antigen gene of Paramecium
primaurelia and direct expression of its repeated epitopes in
Escherichia coli
JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
F.Caron, 24-OCT-1985.
FEATURES from to/span description
pept 19 > 162 G surface antigen
BASE COUNT 55 a 23 c 25 g 59 t
ORIGIN
1 tgaattttaa tacttttaat gaataataaa ttcatcatat tctcattgtt gcttgcttta
61 gtagcaagtc aaacatacag tttaacatca tgcacatgtg cataattgtt atcagaagga
121 gattgcatca aaaatgtttc acttggatgt tcatgggata ca
//
LOCUS PARGANTI2 798 bp ds-DNA INV 12-JUL-1990
DEFINITION P.primaurelia G surface antigen gene, partial cds.
ACCESSION M11193
KEYWORDS G surface antigen; surface antigen.
SEGMENT 2 of 2
SOURCE P.primaurelia macronucleus DNA.
ORGANISM Paramecium primaurelia
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE 1 (bases 1 to 798)
AUTHORS Meyer,E., Caron,F. and Baroin,A.
TITLE Macronuclear structure of the G surface antigen gene of Paramecium
primaurelia and direct expression of its repeated epitopes in
Escherichia coli
JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
F.Caron, 24-OCT-1985.
FEATURES from to/span description
pept < 1 > 798 G surface antigen (AA at 1)
rpt 1 222 direct repeat 1
rpt 223 444 direct repeat 2
rpt 445 666 direct repeat 3
rpt 667 > 798 direct repeat 4
BASE COUNT 262 a 153 c 177 g 206 t
ORIGIN
1 tgtgcttcaa ttactggaac aggattaacc actgctattt gtggaactta tgatgcaggt
61 tgtgtggcaa atgttaacgg aacagcttgt taagaaaaat tagcaacatg tgatttgtat
121 ttaactcaaa actcttgttc tacctcggca gctgcagcaa cagcagataa atgtgcatgg
181 agtggaaccg cttgccttgc agttacaact gttggtaccc attgtgctta tgttactgga
241 actggactta ctgatttaat atgtgcagca tataatgcaa attgtacagc taataaagct
301 ggaacagcat gtcaggagaa aaaggctact tgcaatttat acacaacaga agccacctgt
361 tcaacatcag cagctgcagc aacagcagat aaatgcgcat ggagtggagc agcttgcctt
421 gcagtaacaa ctgttgctac agagtgtgct tatgttactg gaactggact tactgattta
481 atatgtgcag catataatgc aaattgtaca gctaataaag ctggaacagc atgtcaggag
541 aaaaaggcta cttgcaattt atacacaaca gaagccacct gttcaacatc agcagctgca
601 gcaacagcag ataaatgcgc atggagtgga gcagcttgcc ttgcagtaac aactgttgct
661 acagagtgtg cttatgttac tggaactgga ctaacaaatg caatatgtgc agcatataat
721 gcaaattgta cagctaataa agctggaaca gcatgtcagg agaaaaaggc tacttgcaat
781 ttatacacaa cagaagcc
//
LOCUS BOVCASA 1123 bp ss-mRNA MAM 12-JUL-1990
DEFINITION Bovine alpha-s1-casein mRNA, complete cds.
ACCESSION M33123
KEYWORDS alpha-s1-casein.
SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA,
clone p-alpha-s1 C228.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1123)
AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R.
TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA
clone
JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 64 708 alpha-s1-casein precursor
sigp 64 108 alpha-s1-casein signal peptide
matp 109 705 alpha-s1-casein
mRNA < 1 1123 alpha-s1-casein mRNA
BASE COUNT 331 a 247 c 223 g 322 t
ORIGIN
1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca
61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat
121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt
181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt
241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc
301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa
361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac
421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa
481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac
541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg
601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat
661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga
721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg
781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg
841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct
901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa
961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa
1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact
1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc
//
LOCUS BPHINTXIS 1741 bp ds-DNA PHG 12-JUL-1990
DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes,
complete cds.
ACCESSION M34832
KEYWORDS excisionase; integrase.
SOURCE Bacteriophage phi-11 DNA.
ORGANISM Bacteriophage phi-11
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1741)
AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y.
TITLE Sequence analysis and comparison of int and xis genes from
Staphylococcal bacteriophages l54a and phi-11
JOURNAL J. Bacteriol. 172, 2568-2575 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 267 67 (c) excisionase (xis)
pept 379 1425 integrase (int)
BASE COUNT 650 a 264 c 263 g 564 t
ORIGIN
1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc
61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa
121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac
181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt
241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca
301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt
361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt
421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa
481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac
541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct
601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga
661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta
721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt
781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct
841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta
901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt
961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt
1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa
1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat
1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta
1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa
1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat
1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat
1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg
1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc
1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt
1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat
1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact
1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag
1741 t
//
LOCUS BSTNPRAS 3510 bp ds-DNA BCT 12-JUL-1990
DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional
activator (nprA) genes, complete cds.
ACCESSION M34237
KEYWORDS neutral protease; transcriptional activator.
SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53.
ORGANISM Bacillus stearothermophilus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3510)
AUTHORS Nishiya,Y. and Imanaka,T.
TITLE Cloning and nucleotide sequences of the neutral protease gene and
its transcriptional activator gene from Bacillus stearothermophilus
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by Y.Nishiya, 11-MAY-1990.
FEATURES from to/span description
pept 181 1401 transcriptional activator (nprA)
pept 1750 3405 neutral protease (nprS) precursor
sigp 1570 2274 neutral protease (nprS) signal and propeptide
matp 2275 3219 neutral protease (nprS)
BASE COUNT 1177 a 581 c 737 g 1015 t
ORIGIN
1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa
61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat
121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa
181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt
241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa
301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa
361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag
421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa
481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat
541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa
601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt
661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg
721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac
781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga
841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat
901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat
961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca
1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac
1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat
1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca
1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt
1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac
1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta
1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga
1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg
1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag
1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc
1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt
1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag
1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga
1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca
1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc
1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt
1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt
2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact
2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac
2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct
2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac
2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta
2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa
2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca
2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca
2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat
2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt
2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac
2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca
2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat
2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat
2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt
2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa
3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc
3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca
3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat
3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa
3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc
3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa
3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca
3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt
3481 gcaaaatgcc caaaggctta taatgttgat
//
LOCUS HSEGP14 3347 bp ds-DNA VRL 12-JUL-1990
DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
cds.
ACCESSION M34861
KEYWORDS glycoprotein 14.
SOURCE Equine herpesvirus type 1 DNA.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 3347)
AUTHORS Guo,P.
TITLE Characterization of the gene and an antigenic determinant of equine
herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent
glycoproteins of other herpesviruses
JOURNAL Gene 87, 249-255 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 300 3239 glycoprotein 14 (gp14)
BASE COUNT 885 a 891 c 851 g 720 t
ORIGIN
1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg
61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg
121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg
181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc
241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca
301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta
1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc
2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga
2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg
2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa
2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg
2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga
2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc
3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa
3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga
3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg
3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa
3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact
3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg
//
LOCUS HUMHBGAA 1227 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human A-gamma-globin gene, 3' end.
ACCESSION M33200
KEYWORDS A-gamma-globin.
SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1)
DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1227)
AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G.
TITLE Gamma gene promoter and enhancer structure in Seattle variant of
hereditary persistence of fetal hemoglobin
JOURNAL Blood 71, 1108-1112 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 3 A-gamma-globin (AA at 1)
/hgml_locus_uid="LK0092S" /nomgen="HBG1"
/map="11p15.5"
mut 794 794 t in wt; c in mutant
mut 970 970 c in wt; a in mutant
mut 1186 1186 a in wt; g in mutant
BASE COUNT 366 a 204 c 294 g 363 t
ORIGIN
1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca
61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca
121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc
181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt
241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg
301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc
361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg
421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa
481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg
541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc
601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga
661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt
721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa
781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta
841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca
901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat
961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa
1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc
1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc
1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca
1201 gatgaagaag gagaaaaaaa taagctt
//
LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human theta-1-globin gene, complete cds.
ACCESSION M33022
KEYWORDS theta-1-globin.
SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1114)
AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J.
TITLE Nucleotide sequence of the human theta-1-globin gene
JOURNAL Biochem. Genet. 26, 207-211 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 359 453 theta-1-globin, exon 1
/hgml_locus_uid="LV0155X" /nomgen="HBQ1"
/map="16p13.3"
538 742 theta-1-globin, exon 2
852 980 theta-1-globin, exon 3
IVS 454 537 theta-1-globin intron A
IVS 743 851 theta-1-globin intron B
BASE COUNT 166 a 386 c 393 g 169 t
ORIGIN
1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg
61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt
121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc
181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga
241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc
301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat
361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa
421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc
481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac
541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc
601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg
661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg
721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg
781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc
841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag
901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc
961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc
1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag
1081 gacgcgagag gaaggcgccg cccctcccca agga
//
LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds.
ACCESSION M33033
KEYWORDS inter-alpha-trypsin inhibitor heavy chain.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1149)
AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and
Martin,J.-P.
TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization
of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence
of the H chain
JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA
at 1) /hgml_locus_uid="LE0221G"
/nomgen="ITIH1" /map="3p21.2-p21.1"
BASE COUNT 332 a 292 c 253 g 272 t
ORIGIN
1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat
61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat
121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa
181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag
241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct
301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc
361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga
421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc
481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac
541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg
601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc
661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact
721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc
781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca
841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa
901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca
961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa
1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag
1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga
1141 ttttatttc
//
LOCUS HUMPTHROM 327 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human thrombin mRNA, 5'end.
ACCESSION M33031
KEYWORDS serine protease; thrombin.
SOURCE Human, cDNA to mRNA, clone pIIH13.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 327)
AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C.
TITLE Recombinant genetic approaches to functional mapping of thrombin
JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L"
/nomgen="F2" /map="11p11-q12"
sigp 28 156 thrombin signal peptide
matp 157 > 327 prothrombin
BASE COUNT 60 a 97 c 109 g 61 t
ORIGIN
1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc
61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct
121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg
181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc
241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag
301 acagcgagga cgcctcgaga taagctt
//
LOCUS MUSCC3A 312 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Mouse complement component C3 mRNA, partial cds.
ACCESSION M33032
KEYWORDS complement component C3.
SOURCE Mouse liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H.
TITLE Amino acid sequences of mouse complement C3 derived from nucleotide
sequences of cloned cDNA
JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 complement component C3 precursor (AA at 1)
matp < 1 27 complement component C3-beta subunit (AA at 1)
matp 40 273 complement component C3-alpha subunit
matp 274 > 312 complement component C3-alpha' subunit
BASE COUNT 90 a 77 c 90 g 55 t
ORIGIN
1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga
61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt
121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc
181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa
241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca
301 gaagaagata tt
//
LOCUS MUSN038A 1260 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Mouse nucleolar protein N038 mRNA, complete cds.
ACCESSION M33212
KEYWORDS nucleolar protein N038.
SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone
lambda-FML-185.19.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1260)
AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W.
TITLE DNA cloning and amino acid sequence determination of a major
constituent protein of mammalian nucleoli: Correspondence of the
nucleoplasmin-related protein N038 to mammalian protein B23
JOURNAL Chromosoma 96, 417-426 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 79 957 nucleolar protein N038
mRNA < 1 1260 nucleolar protein N038 mRNA
BASE COUNT 419 a 214 c 306 g 321 t
ORIGIN
1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga
61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct
121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat
181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa
241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta
301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt
361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag
421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa
481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa
541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat
601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct
661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa
721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa
781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa
841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc
901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa
961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg
1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt
1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac
1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag
1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt
//
LOCUS RATPOS 1804 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete
cds.
ACCESSION M33114
KEYWORDS type-2A protein phosphatase catalytic subunit.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1804)
AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S.,
Sugimura,T. and Nagao,M.
TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver
type 2A protein phosphatase, and detection of high levels of
expression of the gene in normal and cancer cells
JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 114 1043 type-2A protein phosphatase catalytic subunit
mRNA < 1 1804 type-2A protein phosphatase catalytic subunit
mRNA
BASE COUNT 482 a 391 c 437 g 494 t
ORIGIN
1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg
61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg
121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc
181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat
241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc
301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta
361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc
421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga
481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt
541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga
601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac
661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc
721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag
781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc
841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc
901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta
961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc
1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc
1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt
1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct
1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat
1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga
1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc
1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa
1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa
1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg
1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt
1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa
1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct
1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact
1801 atat
//
LOCUS RATSCP2 1409 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds.
ACCESSION M34728
KEYWORDS sterol carrier protein-2.
SOURCE Rat liver, cDNA to mRNA, clone SP43.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1409)
AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and
Davis,L.G.
TITLE Characterization of a cDNA encoding rat sterol carrier protein-2
JOURNAL DNA Cell Biol. 9, 159-165 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 307 1128 sterol carrier protein-2 (SCP-2)
mRNA < 1 1409 sterol carrier protein-2 mRNA
BASE COUNT 387 a 300 c 390 g 332 t
ORIGIN 2 bp upstream of EcoRI site.
1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc
61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc
121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc
181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt
241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt
301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc
361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa
421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac
481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc
541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg
601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta
661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc
721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag
781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg
841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct
901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag
961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg
1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc
1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg
1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca
1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc
1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg
1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac
1381 ttgagtttca gaataaagtt cagaatagt
//
LOCUS SYNPSBAII 556 bp ds-DNA BCT 12-JUL-1990
DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end.
ACCESSION M34833
KEYWORDS D1 protein; photosystem II.
SOURCE Synechococcus (strain PCC 7942) DNA.
ORGANISM Synechococcus sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 556)
AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S.
TITLE Different and rapid responses of four cyanobacterial psbA
transcripts to changes in light intensity
JOURNAL J. Bacteriol. 172, 1998-2004 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 81 425 ORF1
pept 527 > 556 photosystem II D1 protein (psbAII)
mRNA 59 > 556 psbAII mRNA (alt.)
mRNA 478 > 556 psbAII mRNA (alt.)
BASE COUNT 136 a 145 c 142 g 133 t
ORIGIN
1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag
61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg
121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac
181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag
241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg
301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc
361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg
421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg
481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt
541 gcagcggcgc gagagc
//
LOCUS ABCAARAA 1624 bp ds-DNA BCT 12-JUL-1990
DEFINITION A.aceti acetic acid resistance protein (aarA) gene, complete cds.
ACCESSION M34830
KEYWORDS acetic acid resistance protein.
SOURCE A.aceti (strain 10-8) DNA, clone pAR1611.
ORGANISM Acetobacter aceti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Azotobacteraceae.
REFERENCE 1 (bases 1 to 1624)
AUTHORS Fukaya,M., Takemura,H., Okumura,H., Kawamura,Y., Horinouchi,S. and
Beppu,T.
TITLE Cloning of genes responsible for acetic acid resistance in
acetobacter aceti
JOURNAL J. Bacteriol. 172, 2096-2104 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 185 1495 acetic acid resistance protein (aarA)
signal 1508 1545 transcription termination signal
binding 171 176 ribosomal binding site (put.)
BASE COUNT 400 a 446 c 404 g 374 t
ORIGIN
1 gcatgcattt gcacacattc gcgcgaccct aagcccaaaa aactgtggtt ttccaagcat
61 actcctttcc gataacgctt cgtttatcgc tggcaacctt ccggtttcct tttgaatgag
121 tgacaaagtg tgacgagcag gccgcagcag cgaccgtggc ccaaccatgc agaaggaaac
181 actaatgagc gcgtcgcaga aagaaggtaa gctatctacc gctaccattt cggttgatgg
241 aaaatccgcc gaaatgcctg tgctttcagg cactctggga ccggatgtta tcgacatccg
301 caaacttccg gcgcaactgg gcgttttcac gtttgaccca ggttacgggg aaacagcggc
361 ctgcaacagc aaaatcacct ttattgatgg tgataaaggc gttctgctgc accgtggtta
421 ccctattgcg cagctggacg aaaatgcttc ctacgaagaa gttatttatc tgcttttgaa
481 tggcgaactg cccaacaagg tgcagtacga caccttcacc aacaccctta caaaccatac
541 gctgctgcac gagcagatcc gtaacttctt taacggcttc cggcgtgatg cccacccaat
601 ggccattctg tgtggtacgg ttggggcttt gtctgccttc tacccagatg ccaacgatat
661 tgccattccc gccaatcggg atctggccgc catgcggctg attgccaaaa tcccaaccat
721 tgcggcatgg gcttacaaat acacgcaggg tgaagccttt atctacccgc ggaatgatct
781 gaactacgca gaaaacttcc tgtccatgat gttcgcgcgc atgtccgaac cttacaaggt
841 caaccctgtt ctggcccgcg ccatgaaccg gattctgatt ctgcatgccg atcatgagca
901 gaatgcctct acctccaccg tacgtctggc tggttctaca ggggccaatc cgtttgcctg
961 tattgctgcg ggcattgccg ctctgtgggg acctgcacat ggtggcgcaa acgaagctgt
1021 gctgaaaatg ctggcccgta ttggcaagaa agaaaatatt cctgccttta tcgcacaggt
1081 gaaggacaag aacagcggcg taaagctgat gggctttggc caccgcgttt acaagaactt
1141 cgacccacgt gcgaagatca tgcagcagac ctgccacgaa gtgctgacag aacttggcat
1201 taaggatgat ccgctgctgg atctggcggt tgagctggaa aagattgctc tgagcgatga
1261 ttacttcgtg cagcgcaaac tttacccgaa tgtggatttc tactctggca tcattctcaa
1321 ggccatgggc atccccacca gtatgtttac tgtgctgttt gccgtagccc gcaccaccgg
1381 ctgggtgagc cagtggaagg aaatgattga agaaccgggc cagcgtatca gccgccctcg
1441 ccagctttat attggcgcac cgcagcgtga ctatgtgccg cttgccaaac gctaaaacag
1501 actaacccaa aaagccgact tcccgtaagg aaagtcggct ttttgtttgc acgctgtttc
1561 caaaaaaata gggcggcaga gcgaataaac gctacctagc cttcaggcat aaaaaaacgc
1621 atgc
//
LOCUS BOVBADPTA 708 bp ss-mRNA MAM 12-JUL-1990
DEFINITION Cow beta adaptin mRNA, partial cds.
ACCESSION M34177 J05273
KEYWORDS beta adaptin.
SOURCE Cow brain, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 708)
AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
Parham,P.
TITLE Conservation and diversity in families of coated vesicle adaptins
JOURNAL J. Biol. Chem. 265, 4814-4820 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 708 beta adaptin (AA at 1)
BASE COUNT 198 a 159 c 175 g 176 t
ORIGIN
1 gctgtgaaga aagtgattgc tgctatgact gtggggaaag acgttagctc tctctttcca
61 gatgtagtga actgtatgca gacggataat ctggaactga agaagcttgt gtatctctac
121 ttgatgaact atgccaagag tcagccagac atggccatca tggctgtcaa cagctttgtg
181 aaggattgtg aagatcccaa tcctctgatt cgagctttgg cagtcagaac catggggtgc
241 atccgggtgg acaagataac agagtatctc tgtgagcccc tccgcaagtg cttaaaggat
301 gaagatccct acgtccggaa gacagcagca gtctgcgtgg caaaactcca tgacatcaat
361 gcccagatgg tggaagatca gggatttctg gattctctgc gggatctcat agcagattca
421 aatccaatgg tggtggctaa tgctgtagca gcactatctg aaatcagtga atctcacccc
481 aacagcaact tactcgatct gaatccacag aacattaata agctactgac agccctgaat
541 gagtgcaccg aatggggcca gattttcatc ctggactgct tatctaatta caatcctaaa
601 gatgaccggg aggctcagag catctgtgag cgggtaactc cccggttatc tcatgccaac
661 tcagcagtgg tgctttcagc agtaaaagtc ctaatgaaat ttttggaa
//
LOCUS BSUSENSA 1773 bp ds-DNA BCT 12-JUL-1990
DEFINITION B.subtilis transcription regulatory protein (senS) gene, complete
cds.
ACCESSION M34826 M30611
KEYWORDS transcription regulatory protein.
SOURCE B.subtilis (strain DB2) DNA, clone pWL[77,80].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1773)
AUTHORS Wang,L.-F. and Doi,R.H.
TITLE Complex character of senS, a novel gene regulating expression of
extracellular-protein genes of Bacillus subtilis
JOURNAL J. Bacteriol. 172, 1939-1947 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.H.Doi, 11-DEC-1989, for release after publication.
FEATURES from to/span description
pept 1486 1683 transcription regulatory protein senS
binding 1470 1477 ribosomal binding site (put.)
site 1455 1474 transcription termination signal
site 1671 1698 rho-independent transcription terminator
BASE COUNT 460 a 397 c 388 g 528 t
ORIGIN
1 agttcttgga aattctgatt ttcgatatct ggcgaattta cgtagtctcc catcgtttct
61 ttcgaaaggg acgttctcag cccctcaatc cagcggacat tttgtctttt ttctccaggg
121 gatgtccagt ttgttaagta ttcctgggcg atgattgcgt cacgataata aaatgccgtt
181 tggtcgggag cgacccgtcc ggctgccccg ccgagtgctt gctgccagac actggcgttt
241 tgattcggag cgtgctctaa aaagtgtttt attgttgaga tcgcacgttc tgataatggc
301 ttttcaatga aagagccgga gcgtttcatt ttttgaggct gattgcctcc cgggctgtta
361 aaaaaggtta ccgcttcaat gaatggcgtt gtttttacca ttccgcttga cggacttcct
421 gctttcaata aaggctttaa cagttttttt aactctgttt ttggcccgac aaattggccg
481 agggcttcta tgcggtttac ttctttaggc caaaactcta ttgatgatgt aagccggtca
541 tctgtatacg gggcccagtt ctgccacgtg ttatatactt cctcaaaatc atcccatccc
601 catgtaatag aaaaaatcga cacttgagag atgggcactg ctttaaatgt catggaggtg
661 actatgccga aattgcctcc tccgcctccc tgagacgccc aaaatgtgga tgatttgaac
721 agctgactgt aatcagatca gcgccctctt tttcgtctgc tacgatcatc tcaagctgca
781 cgaggctgtc gcaagtaaga ccggcagccc ttgttaaaag tccaattccc cctccgagag
841 ttaaacctgt gagccctaca ttagcaatgg tgcctgcggg aagcgtcagg ccgtattgcc
901 agagtgtccg atagacttct cccaattcag cccccgcttc aatataggcc agctttttat
961 cctgattcac agttattttt ttcatctcgc ttaaatcaat aacaagaccg ttatttaaaa
1021 gggaaaagtt ctcatagctg tgtctgccgc ctctaatacg gaaaggcaca cggttttcac
1081 gcgcccattt cagcgcattg agtgcatcct gtttgttttg gcaaaacaca atgatgtcag
1141 atcctttcta agcttaggtt aatattggtt cttgcttcgt tatagtccgg atcatcccgt
1201 gtcacgatac gtccggtcaa ttttgtcttt tccacactcc cacatctctt tctctcgtat
1261 tctagtttct ctagcttatg cgtcagggga aaagagtgta taaggaaaaa gcggggatgc
1321 aatctgatac agtgtcaaca ccctcaaaaa atagttgaca ggtcggtatt gtatgaatta
1381 acatggtcag tacaaatttt tcaaatttat cgcgctgatc ggaacaccga aggctcttat
1441 cgtttagata agggcctttt ttgtatgaaa aaggggggat tattgatggg agtcaaaaaa
1501 gaaaagggga gaaaacgatt caggaagcga aaaacctacg ggaatcagat tttgccgctt
1561 gagctgctga ttgaaaaaaa caaacgagag attataaaca gcgcggaact catggaagaa
1621 atttatatga agattgatga gaagcatacg caatgtgtaa ctaaatataa aaaaacccgc
1681 tgactacaac gggtttttgc atttctccat taagaatctt ttttaatcgg caatccaagg
1741 ccttctgcca cgcgttttcc gtattcagga tcc
//
LOCUS CHITDNA 176 bp ds-DNA INV 12-JUL-1990
DEFINITION C.thummi telomeric DNA.
ACCESSION M33211
KEYWORDS telomeric DNA.
SOURCE C.thummi heat-shocked larvae, cDNA to mRNA, clone lambda-Cth5.
ORGANISM Chironomus thummi
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Chironomidae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Carmona,M.J., Morcillo,G., Galler,R., Martinez-Salas,E.,
de la Campa,A.G., Diez,J.L. and Edstroem,J.E.
TITLE Cloning and molecular characterization of a telomeric sequence from
a temperature-induced Balbiani ring
JOURNAL Chromosoma 92, 108-115 (1985)
STANDARD simple staff_entry
BASE COUNT 63 a 34 c 27 g 52 t
ORIGIN Chromosome III.
1 aattctagaa aaatcgagtt ttttcgaaaa catgaaaatt ttttttctct catcctagaa
61 caagtgtttt agacctcaaa acagatgtga acataaaagt gatgtattga caaaagttgc
121 tccaaactga gatgcatcca acgtgatatc gatatcccat gtacccccct atggaa
//
LOCUS ECOSUHBA 1017 bp ds-DNA BCT 12-JUL-1990
DEFINITION E.coli extragenic suppressor (suhB) gene, complete cds.
ACCESSION M34828
KEYWORDS extragenic suppressor; suhB gene.
SOURCE E.coli DNA, clone pRY61.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1017)
AUTHORS Yano,R., Nagai,H., Shiba,K. and Yura,T.
TITLE A mutation that enhances synthesis of sigma-32 and suppresses
temperature-sensitive growth of the rpoH15 mutant of Escherichia
coli
JOURNAL J. Bacteriol. 172, 2124-2130 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 194 997 suhB protein
signal 122 127 -35 region
signal 145 150 -10 region
BASE COUNT 244 a 278 c 259 g 236 t
ORIGIN 55 min on K12 map.
1 catggcacgg gcaacagaac ccatattgcc ggtgtgtgac gtctccacca gcacaattcg
61 aatattttgc agcattgtct ttcttcatct aaagattatt cacgcatctt atcataaaac
121 gaagacagat gccgatctcg ctgctatact ctgcgccgtt ttcccgttct ttaacatcca
181 gtgagagaga ccgatgcatc cgatgctgaa catcgccgtg cgcgcagcgc gcaaggcggg
241 taatttaatt gccaaaaact atgaaacccc ggacgctgta gaagcgagcc agaaaggcag
301 taacgatttc gtgaccaacg tagataaagc tgccgaagcg gtgattatcg acacgattcg
361 taaatcttac ccacagcaca ccatcatcac cgaagaaagc ggtgaacttg aaggtactga
421 tcaggatgtt caatgggtta tcgatccact ggatggcact accaacttta tcaaacgtct
481 gccgcacttc gcggtatcta tcgctgttcg tatcaaaggc cgcaccgaag ttgctgtggt
541 atacgatcct atgcgtaacg aactgttcac cgccactcgc ggtcagggcg cacagctgaa
601 cggctaccga ctgctcggca gcaccgctcg cgatctcgac ggtactattc tggcgaccgg
661 cttcccgttc aaagcaaaac agtacgccac tacctacatc aacatcgtcg gcaaactgtt
721 caacgaatgt gcagacttcc gtcgtaccgg ttctgcggcg ctggatctgg cttacgtcgc
781 tgcgggtcgt gttgacggtt tctttgaaat cggtctgcgc ccgtgggact tcgccgcagg
841 cgagctgctg gttcgtgaag cgggcggcat cgtcagcgac ttcaccggtg gtcataacta
901 catgctgacc ggtaacatcg ttgctggtaa cccgcgcgtt gttaaagcca tgctggcgaa
961 catgcgtgac gagttaagcg acgctctgaa gcgttaatga ctcaggcggg tgatatc
//
LOCUS HUMBADPTA 5701 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human beta adaptin mRNA, complete cds.
ACCESSION M34175 J05273
KEYWORDS beta adaptin.
SOURCE Human fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5701)
AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
Parham,P.
TITLE Conservation and diversity in families of coated vesicle adaptins
JOURNAL J. Biol. Chem. 265, 4814-4820 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 178 2991 beta adaptin
mRNA < 1 5701 beta adaptin mRNA
signal 5683 5688 polyA signal
BASE COUNT 1528 a 1373 c 1284 g 1516 t
ORIGIN
1 ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa
61 gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc
121 cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg
181 actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa
241 ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgctatg
301 accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac
361 aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca
421 gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg
481 attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat
541 ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca
601 gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt
661 ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta
721 gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca
781 cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc
841 atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt
901 gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa
961 gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg
1021 aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc
1081 gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc
1141 aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc
1201 atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat
1261 gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc
1321 atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc
1381 aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac
1441 cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag
1501 ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat
1561 gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag
1621 ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag
1681 ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac
1741 cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc
1801 ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat
1861 gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt
1921 gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat
1981 gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc
2041 ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat
2101 gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat
2161 agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct
2221 acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc
2281 atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa
2341 ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac
2401 ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc
2461 tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc
2521 attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat
2581 aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca
2641 ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg
2701 aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct
2761 gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg
2821 gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc
2881 gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct
2941 gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg
3001 tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa
3061 ttgtattgct gcgtagaatc tgaacacact gaggccacct agcaaggtag taactagtct
3121 aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg
3181 taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact
3241 tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag
3301 tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag
3361 caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga
3421 aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt
3481 cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta
3541 gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact
3601 ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg
3661 ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag
3721 cctgtcagca ccaaagctga cagaagttat accttactcc tttcctttcc cctgaacaaa
3781 cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct
3841 gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt
3901 agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa
3961 aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc
4021 tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa
4081 tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt
4141 tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta
4201 aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact
4261 tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc
4321 tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat
4381 taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga
4441 aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc
4501 caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg
4561 cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa
4621 atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact
4681 cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga
4741 aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc
4801 ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc
4861 ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg
4921 tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac
4981 ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct
5041 catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca
5101 atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc
5161 agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc
5221 actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact
5281 tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct
5341 tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc
5401 ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag
5461 caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc
5521 ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag
5581 gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg
5641 ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca
5701 c
//
LOCUS HUMCD59A 1671 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human lymphocytic antigen CD59/MEM43 mRNA, complete cds.
ACCESSION M34671 X15861
KEYWORDS CD59 antigen; cell surface antigen; integral membrane protein.
SOURCE Human peripheral blood monocyte, cDNA to mRNA, clone R18..
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 108 to 443)
AUTHORS Sawada,R., Ohashi,K., Okano,K., Hattori,M., Minato,N. and Naruto,M.
TITLE Complementary DNA sequence and deduced peptide sequence for
CD59/MEM43 antigen, the human homologue of murine lymphocyte
antigen Ly-6c
JOURNAL Nucleic Acids Res. 17, 6728-6728 (1989)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 1671)
AUTHORS Sawada,R., Ohashi,K., Anaguchi,H., Okazaki,H., Hattori,M.,
Minato,N. and Naruto,M.
TITLE Isolation and expression of the full-length cDNA encoding CD59
antigen of human lymphocytes
JOURNAL DNA 9, 213-220 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer readable copy for sequence [1] kindly
provided by Naruto,M., 17-JUL-1989.
[1] Author address: Naruto,M.
Basic Research Laboratories
Toray Industries Inc
1111 Tebiro Kamakura 248, Japan.
FEATURES from to/span description
pept 30 416 antigen CD59 precursor (CD59)
/hgml_locus_uid="LY0169B" /nomgen="CD59"
/map="11pter-p13"
sigp 30 104 CD59 signal peptide
matp 105 413 CD59 protein
mRNA < 1 1671 CD59 mRNA
signal 527 532 polyA signal
BASE COUNT 434 a 347 c 390 g 500 t
ORIGIN
1 ggcgccgcca ggttctgtgg acaatcacaa tgggaatcca aggagggtct gtcctgttcg
61 ggctgctgct cgtcctggct gtcttctgcc attcaggtca tagcctgcag tgctacaact
121 gtcctaaccc aactgctgac tgcaaaacag ccgtcaattg ttcatctgat tttgatgcgt
181 gtctcattac caaagctggg ttacaagtgt ataacaagtg ttggaagttt gagcattgca
241 atttcaacga cgtcacaacc cgcttgaggg aaaatgagct aacgtactac tgctgcaaga
301 aggacctgtg taactttaac gaacagcttg aaaatggtgg gacatcctta tcagagaaaa
361 cagttcttct gctggtgact ccatttctgg cagcagcctg gagccttcat ccctaagtca
421 acaccaggag agcttctccc aaactccccg ttcctgcgta gtccgctttc tcttgctgcc
481 acattctaaa ggcttgatat tttccaaatg gatcctgttg ggaaagaata aaattagctt
541 gagcaacctg gctaagatag aggggctctg ggagactttg aagaccagtc ctgtttgcag
601 ggaagcccca cttgaaggaa gaagtctaag agtgaagtag gtgtgacttg aactagattg
661 catgcttcct cctttgctct tgggaagacc agctttgcag tgacagcttg agtgggttct
721 ctgcagccct cagattattt ttcctctggc tccttggatg tagtcagtta gcatcattag
781 tacatctttg gagggtgggg caggagtata tgagcatcct ctctcacatg gaacgctttc
841 ataaacttca gggatcccgt gttgccatgg aggcatgcca aatgttccat atgtgggtgt
901 cagtcaggga caacaagatc cttaatgcag agctagagga cttctggcag ggaagtgggg
961 aagtgttcca gatagcaggg catgaaaact tagagaggta caagtggctg aaaatcgagt
1021 ttttcctctg tctttaaatt ttatatgggc tttgttatct tccactggaa aagtgtaata
1081 gcatacatca atggtgtgtt aaagctattt ccttgccttt ttttattgga atggtaggat
1141 atcttggctt tgccacacac agttacagag tgaacactct actacatgtg actggcagta
1201 ttaagtgtgc ttattttaaa tgttactggt agaaaggcag ttcaggtatg tgtgtatata
1261 gtatgaatgc agtggggaca ccctttgtgg ttacagtttg agacttccaa aggtcatcct
1321 taataacaac agatctgcag gggtatgttt taccatctgc atccagcctc ctgctaactc
1381 ctagctgact cagcatagat tgtataaaat acctttgtaa cggctcttag cacactcaca
1441 gatgtttgag gctttcagaa gctcttctaa aaaatgatac acacctttca caagggcaaa
1501 ctttttcctt ttccctgtgt attctagtga atgaatctca agattcagta gacctaatga
1561 catttgtatt ttatgatctt ggctgtattt aatggcatag gctgactttt gcagatggag
1621 gaatttcttg attaatgttg aaaaaaaacc cttgattata ctctgttgga c
//
LOCUS HUMKER19PA 1586 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human keratin K19 pseudogene.
ACCESSION M33101
KEYWORDS keratin K19; pseudogene.
SOURCE Human, cDNA to mRNA, clone IF7.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1586)
AUTHORS Savtchenko,E.S., Schiff,T.A., Jiang,C.-K., Freedberg,I.M. and
Blumenberg,M.
TITLE Embryonic expression of the human 40-kD keratin: Evidence from a
processed pseudogene sequence
JOURNAL Am. J. Hum. Genet. 43, 630-637 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps 120 1310 keratin 19 pseudogene
signal 1412 1418 polyA signal
BASE COUNT 377 a 405 c 470 g 334 t
ORIGIN
1 attgataaac atataatctg atatttatgt aaagtagcta ttttttaaaa aaagtatggc
61 tcctccctcg aatcgcagcc tctgggacca gggtcgctcc atccgtcgtc cgcctcgcca
121 tgacttccta cacgtatcgc cagtcgtagg ccaagtagtc cttctggggc ctgggtggtg
181 gctccgtgag ttttgtggca gaggttgcct ttcgcgcgct cagcatgcac tgggcctctg
241 gaggccgcgg cgtgtccgtg tcctccgccc gcttcgtgtc tgtcctcgtc ctccttgggg
301 ggctacggcg gcgtcttggc cgtgtcctac gggctgctgg cgggcaacga gaagctcaat
361 atgcagaacc tcagcgaccc tctggcctcc tacctggaca aggtgggcgc cctggaggac
421 gccaacggca aactggaggt gaagatccgc gactggtacc agaagcaggg gcccgggcct
481 cccgtgacta cagccactct acaagactat ccaggacctg cggtacaaga ttcttggtgc
541 caccattgag aactccagga ttgtcctgga gatcgacaac gcccgtctgg ctgcagatga
601 cttccgaacc aagagtgaga cggagcaggc tctgcgcatg agcggaggcc gacatcaacg
661 gcctgcgcag ggtgctggac gagctgaccc tggccattac cgacctggag atgcagatct
721 aaggcctgaa ggaagagctg gcctacctga agaagaacca tgagaaggaa atcagtgggc
781 tgaggggcca agtgggaggc caggtcagtg gggaggtgga ttcggctcag ggcacctatc
841 tcgccaagat cctgagttac atgcgaacgc aatacgaggt catggcggac aacaactgga
901 aggatgctga agcctggttc accagccgga ctgaagaatt gaaccgggag gtcgctggcc
961 acacagatca gctccagatg agccggtcca aggtcgctga cctgcggcgc accctccagg
1021 gtcttgagct ggagctgcag tcacggctga gcatgaaagc cgccttggaa gccacactgg
1081 cagaaacgga ggcgcgcttt ggagtccact tggcgcagat ccagccgctg atcaactgta
1141 ttgaagccca gctgggcgat gtgcgagctg atagtgagcg gcagaatcag gattaacagc
1201 agttcatgga catcaagtcg cggctggagc aggagatctc cacctaccgc agcctgctcg
1261 agggccagaa ggatcactac aacaacctgt ccgcctccaa ggtcctctga ggcagcaggc
1321 taaggggctt ctactgtcct ttggagggtg tctcctgggt agggggatgg gaaggaaggg
1381 acccttaccc cctgctcttc ccctgatctg ccaataaaat tttatggtcc aaggggaaaa
1441 aaaaaaaaaa aaaaaatata tatatatata tatatatata tatatatgtg tgtgtgtgtg
1501 tgtgtgtgtg tgtatatata cgtgtgtgtg tatatatata tatatgaaaa acaatacatg
1561 ctcgttgtag aaatgtggaa acatgg
//
LOCUS HUMLOX15A 2671 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human 15-lipoxygenase mRNA, complete cds.
ACCESSION M23892
KEYWORDS 15-lipoxygenase.
SOURCE Human reticulocyte, cDNA to mRNA, clone 15LOX.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2671)
AUTHORS Sigal,E., Craik,C.S., Highland,E., Grunberger,D., Costello,L.L.,
Dixon,R.A.F. and Nadel,J.A.
TITLE Molecular cloning and primary structure of human 15-lipoxygenase
JOURNAL Biochem. Biophys. Res. Commun. 157, 457-464 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 4 1992 15-lipoxygenase
mRNA < 1 2671 15-lipoxygenase mRNA
BASE COUNT 580 a 743 c 718 g 630 t
ORIGIN
1 aagatgggtc tctaccgcat ccgcgtgtcc actggggcct cgctctatgc cggttccaac
61 aaccaggtgc agctgtggct ggtcggccag cacggggagg cggcgctcgg gaagcgactg
121 tggcccgcac ggggcaagga gacagaactc aaggtggaag taccggagta tctggggccg
181 ctgctgtttg tgaaactgcg caaacggcac ctccttaagg acgacgcctg gttctgcaac
241 tggatctctg tgcagggccc cggagccggg gacgaggtca ggttcccttg ttaccgctgg
301 gtggagggca acggcgtcct gagcctgcct gaaggcaccg gccgcactgt gggcgaggac
361 cctcagggcc tgttccagaa acaccgggaa gaagagctgg aagagagaag gaagttgtac
421 cggtggggaa actggaagga cgggttaatt ctgaatatgg ctggggccaa actatatgac
481 ctccctgtgg atgagcgatt tctggaagac aagagagttg actttgaggt ttcgctggcc
541 aaggggctgg ccgacctcgc tatcaaagac tctctaaatg ttctgacttg ctggaaggat
601 ctagatgact tcaaccggat tttctggtgt ggtcagagca agctggctga gcgcgtgcgg
661 gactcctgga aggaagatgc cttatttggg taccagtttc ttaatggcgc caaccccgtg
721 gtgctgaggc gctctgctca ccttcctgct cgcctagtgt tccctccagg catggaggaa
781 ctgcaggccc agctggagaa ggagctggag ggaggcacac tgttcgaagc tgacttctcc
841 ctgctggatg ggatcaaggc caacgtcatt ctctgtagcc agcagcacct ggctgcccct
901 ctagtcatgc tgaaattgca gcctgatggg aaactcttgc ccatggtcat ccagctccag
961 ctgccccgca caggatcccc accacctccc cttttcttgc ctacggatcc cccaatggcc
1021 tggcttctgg ccaaatgctg ggtgcgcagc tctgacttcc agctccatga gctgcagtct
1081 catcttctga ggggacactt gatggctgag gtcattgttg tggccaccat gaggtgcctg
1141 ccgtcgatac atcctatctt caagcttata attccccacc tgcgatacac cctggaaatt
1201 aacgtccggg ccaggactgg gctggtctct gacatgggaa ttttcgacca gataatgagc
1261 actggtgggg gaggccacgt gcagctgctc aagcaagctg gagccttcct aacctacagc
1321 tccttctgtc cccctgatga cttggccgac cgggggctcc tgggagtgaa gtcttccttc
1381 tatgcccaag atgcgctgcg gctctgggaa atcatctatc ggtatgtgga aggaatcgtg
1441 agtctccact ataagacaga cgtggctgtg aaagacgacc cagagctgca gacctggtgt
1501 cgagagatca ctgaaatcgg gctgcaaggg gcccaggacc gagggtttcc tgtctcttta
1561 caggctcggg accaggtttg ccactttgtc accatgtgta tcttcacctg caccggccaa
1621 cacgcctctg tgcacctggg ccagctggac tggtactctt gggtgcctaa tgcaccctgc
1681 acgatgcggc tgcccccgcc aaccaccaag gatgcaacgc tggagacagt gatggcgaca
1741 ctgcccaact tccaccaggc ttctctccag atgtccatca cttggcagct gggcagacgc
1801 cagcccgtta tggtggctgt gggccagcat gaggaggagt atttttcggg ccctgagcct
1861 aaggctgtgc tgaagaagtt cagggaggag ctggctgccc tggataagga aattgagatc
1921 cggaatgcaa agctggacat gccctacgag tacctgcggc ccagcgtggt ggaaaacagt
1981 gtggccatct aagcgtcgcc accctttggt tatttcagcc cccatcaccc aagccacaag
2041 ctgacccctt cgtggttata gccctgccct cccaagtccc accctcttcc catgtcccac
2101 cctccctaga ggggcacctt ttcatggtct ctgcacccag tgaacacatt ttactctaga
2161 ggcatcacct gggaccttac tcctctttcc ttccttcctc ctttcctatc ttccttcctc
2221 tctctcttcc tctttcttca ttcagatcta tatggcaaat agccacaatt atataaatca
2281 tttcaagact agaatagggg gatataatac atattactcc acacctttta tgaatcaaat
2341 atgatttttt tgttgttgtt aagacagagt ctcactttga cacccaggct ggagtgcagt
2401 ggtgccatca ccacggctca ctgcagcctc agcgtcctgg gctcaaatga tcctcccacc
2461 tcagcctcct gagtagctgg gactacaggc tcatgccatc atgcccagct aatatttttt
2521 tattttcgtg gagacggggc ctcactatgt tgcctaggct ggaaatagga ttttgaaccc
2581 aaattgagtt taacaataat aaaaagttgt tttacgctaa agatggaaaa gaactaggac
2641 tgaactattt taaataaaat attggcaaaa g
//
LOCUS MUSBPGALA 334 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Mouse beta-galactoside-binding lectin (L-14.5) mRNA, 5' end.
ACCESSION M33214
KEYWORDS beta-galactoside-binding lectin.
SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line
UV-2237-IP, cDNA to mRNA, clone L3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 334)
AUTHORS Raz,A., Carmi,P. and Pazerini,G.
TITLE Expression of two different endogenous galactoside-binding lectins
sharing sequence homology
JOURNAL Cancer Res. 48, 645-649 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 16 > 334 14 kDa beta-galactoside-binding lectin
(L-14.5)
BASE COUNT 86 a 94 c 91 g 63 t
ORIGIN
1 gaattgggta caatcatggc ctgtggtctg gtggatcagc aagctgaatc tcaaactggg
61 gcaatgtctc aaagttcggg gcagaggtgg acctcggacg acaggagctt tgtgctgacc
121 ctgggaaaag acagcaacaa ccgttgccta cacttcaatc ctcgcttcaa tgcccatgga
181 gacgccaaca ccattctgtg taacaccaag gaagatggga cctggggaac cgaacaccgg
241 gaacctgcct tccccttcca gcccgggagc atcacagagt gtgcatgcac ctttgaccag
301 gctgacctga ccatgcaagc tgccagacgg acat
//
LOCUS MUSBPGALB 621 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Mouse beta-galactoside-binding lectin (L-34) mRNA, 3' end.
ACCESSION M33215
KEYWORDS beta-galactoside-binding lectin.
SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line
UV-2237-IP3, cDNA to mRNA, clone M5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 621)
AUTHORS Raz,A., Carmi,P. and Pazerini,G.
TITLE Expression of two different endogenous galactoside-binding lectins
sharing sequence homology
JOURNAL Cancer Res. 48, 645-649 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 420 34 kDa beta-galactoside-binding lectin (L-34)
BASE COUNT 164 a 171 c 154 g 132 t
ORIGIN
1 cccagggcaa cctggggcac ctggggccat ccccagtgct cctggaggct atcctgctgc
61 tggcccttat ggtgtccccg ctggaccact gacgtgccct atgacctgcc cttgcctgga
121 ggagtcatgc cccgcatgct gatcacaatc atgggcacag tgaaacccaa cgcaaacagg
181 attgttctag atttcaggag agggaatgat gttgccttcc actttaaccc ccgcttcaat
241 gagaacaaca gaagactcat tgtgtgtaac acgaagcagg acaataactg gggaaaggaa
301 gaaagacagt cagccttccc ctttgagagt ggaaaaccat tcaaaataca agtcctggtt
361 gcagctgacc attcaggttg cggtcacgat gctcactact gcagtacaac catcggatga
421 agaacctccg ggaaatcagc caactggcga tcagtggtga cataaccctg caccagcgct
481 gaaccagcgc catgatctaa gccagaaggg gcggcaccga aaccggccct gtgtgcctta
541 ggagtgggaa actttgcatt tctctctcct tatccttctt gtaagacatc catttaataa
601 agtctcatgc tgagagaaaa g
//
LOCUS MUSP32A 1510 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Mouse tumor-induced 32 kD protein (p32) mRNA, complete cds.
ACCESSION M33203
KEYWORDS tumor-induced protein.
SOURCE Mouse (strain BALB/c) fibroblast cell line 3T3 A31, cDNA to mRNA,
clone pMp32S.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1510)
AUTHORS Kageyama,H., Hiwasa,T., Tokunaga,K. and Sakiyama,S.
TITLE Isolation and characterization of a complementary DNA clone for a
M-r 32,000 protein which is induced with tumor promoters in BALB/c
3T3 cells
JOURNAL Cancer Res. 48, 4795-4798 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 75 944 32 kD protein (p32)
mRNA < 1 1510 p32 mRNA
signal 1491 1496 poly-A signal
BASE COUNT 352 a 420 c 381 g 357 t
ORIGIN
1 ccgcgcagag ccgtctcgag catagcccgg agcctgaatc gagcagaacc agcctgaact
61 agcccagtcc ggtgatggag cgtccacagc ccgacagcat gccccaggat ttgtctgagg
121 ccttgaagga ggccaccaag gaggtacaca tccaagccga gaatgctgag ttcatgaaga
181 actttcagaa gggtcaggtg tccagagaag gctttaagct ggtgatggct tccttgtacc
241 atatctacac ggccctggaa gaggagatag agcgcaacaa gcagaaccca gtctatgccc
301 cactctactt ccctgaggag ctgcaccgaa gggctgccct ggagcaggac atggccttct
361 ggtatgggcc tcactggcag gaaatcatcc cttgcacgcc agccacacag cactatgtaa
421 agcgtctcca cgaggtgggg cgcactcacc ctgagctgct ggtggcccac gcatataccc
481 gctacctggg tgacctctca gggggtcagg tcctgaagaa gattgcacag aaggccatgg
541 ccttgcccag ctctggggag ggcctggctt tttttacctt cccgaacatc gacagcccca
601 ccaagttcaa acagctctat cgtgctcgaa tgaacactct ggagatgaca cctgaggtca
661 agcacagggt gacagaagag gctaagaccg ccttcctgct caacattgag ctgtttgagg
721 agctgcaggt gatgctgaca gaggaacaca aagaccagag tccctcacag atggcgtcac
781 ttcgtcagag gcctgctagc ctggtgcaag atactgcccc tgcagagaca ccccgaggga
841 aaccccagat cagcactagc tcatcccaga caccgctcct ccagtgggtc ctcactctca
901 gcttcctgtt ggcaacagtg gcagtgggaa tttatgccat gtaaatgcaa tactggcccc
961 caggggctgt gaactctgtc caatgtggcc ttctctctgt aagggagaat cttgcctggc
1021 tctcttctct tgggcctcta agaaagcttt tggggtccct agcccactcc ctgtgtttcc
1081 tttctctctg gaatggaggg agatacctga cacagttccc tcaccaaaag cacatccagc
1141 cagtggcctg aactttgaaa ccagcagccc caaatcctgc agcagagccc caaaactggc
1201 ctgtaaaagc agctgttctg agcccagtgc ccatggttgt aagcatccat gttgactgac
1261 cacgactgct gtcccccagt gccatggcca ctttgatatc cgtttccaga catttctgtc
1321 tcgtatttct gtcttgtttt ttattatttc cccagttcta ccagagtaat ggtattttgt
1381 tgttttgttt tgtcttgttt ttcctaacaa agtggggcta tcttttgagg ggtgggtggg
1441 aaagaattat ttaatagttg taaccttggt ctctaacttc tgtgtgaaat aataaatggc
1501 attatctaac
//
LOCUS PASLKTCABD 7742 bp ds-DNA BCT 12-JUL-1990
DEFINITION P.haemolytica leukotoxin gene cluster, complete cds.
ACCESSION M24197 M34943 M34944
KEYWORDS LktA membrane protein; cytolysin; hemolysin; leukotoxin.
SOURCE P.haemolytica (strain PHL101) DNA, clones lambda-sh132 and pSH224.
ORGANISM Pasteurella haemolytica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 7742)
AUTHORS Highlander,S.K., Chidambaram,M., Engler,M.J. and Weinstock,G.M.
TITLE DNA sequence of the Pasteurella haemolytica leukotoxin gene cluster
JOURNAL DNA 8, 15-28 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 46 to 276 and 3576 to 3813)
AUTHORS Highlander,S.K., Engler,M.J. and Weinstock,G.M.
TITLE secretion and expression of the Pasteurella haemolytica leukotoxin
JOURNAL J. Bacteriol. 172, 2343-2350 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.K.Highlander, 24-APR-1989.
FEATURES from to/span description
pept 216 719 leukotoxin (LktC)
pept 735 3596 leukotoxin membrane protein (LktA)
pept 3670 5796 leukotoxin (LktB)
pept 5808 7244 leukotoxin (LktD)
mRNA 186 > 3596 lktCA mRNA
signal 173 178 -10 region promoter
binding 205 209 lktC ribosomal binding site (put.)
binding 722 726 lktA ribosomal binding site (put.)
binding 3660 3665 lktB ribosomal binding site (put.)
binding 5797 5802 lktD ribosomal binding site (put.)
BASE COUNT 2573 a 1357 c 1568 g 2244 t
ORIGIN
1 taatattaca atgtaattat tttgtttaat ttccctacat tttgtataac tttaaaacac
61 tcctttttct cttctgatta tataaaagac aaaaaataca atttaagcta caaaaaacaa
121 caaaaaacaa caaaaaacac gacaataaga tcgagtaatg attatattat gttataattt
181 ttgacctaat ttagaataat tatcgagtgc aaattatgaa tcaatcttat tttaacttac
241 taggaaacat tacttggcta tggatgaact cctccctcca caaagaatgg agctgtgaac
301 tactagcacg caatgtgatt cctgcaattg aaaatgaaca atatatgcta cttatagata
361 acggtattcc gatcgcttat tgtagttggg cagatttaaa ccttgagact gaggtgaaat
421 atattaagga tattaattcg ttaacaccag aagaatggca gtctggtgac agacgctgga
481 ttattgattg ggtagcacca ttcggacatt ctcaattact ttataaaaaa atgtgtcaga
541 aataccctga tatgatcgtc agatctatac gcttttatcc aaagcagaaa gaattaggca
601 aaattgccta ctttaaagga ggtaaattag ataaaaaaac agcaaaaaaa cgttttgata
661 catatcaaga agagctggca acacgactta aaaatgaatt taattttatt aaaaaataga
721 aggagacatc ccttatggga actagactta caaccctatc aaatgggcta aaaaacactt
781 taacggcaac caaaagtggc ttacataaag ccggtcaatc attaacccaa gccggcagtt
841 ctttaaaaac tggggcaaaa aaaattatcc tctatattcc ccaaaattac caatatgata
901 ctgaacaagg taatggttta caggatttag tcaaagcggc cgaagagttg gggattgagg
961 tacaaagaga agaacgcaat aatattgcaa cagctcaaac cagtttaggc acgattcaaa
1021 ccgctattgg cttaactgag cgtggcattg tgttatccgc tccacaaatt gataaattgc
1081 tacagaaaac taaagcaggc caagcattag gttctgccga aagcattgta caaaatgcaa
1141 ataaagccaa aactgtatta tctggcattc aatctatttt aggctcagta ttggctggaa
1201 tggatttaga tgaggcctta cagaataaca gcaaccaaca tgctcttgct aaagctggct
1261 tggagctaac aaattcatta attgaaaata ttgctaattc agtaaaaaca cttgacgaat
1321 ttggtgagca aattagtcaa tttggttcaa aactacaaaa tatcaaaggc ttagggactt
1381 taggagacaa actcaaaaat atcggtggac ttgataaagc tggccttggt ttagatgtta
1441 tctcagggct attatcgggc gcaacagctg cacttgtact tgcagataaa aatgcttcaa
1501 cagctaaaaa agtgggtgcg ggttttgaat tggcaaacca agttgttggt aatattacca
1561 aagccgtttc ttcttacatt ttagcccaac gtgttgcagc aggtttatct tcaactgggc
1621 ctgtggctgc tttaattgct tctactgttt ctcttgcgat tagcccatta gcatttgccg
1681 gtattgccga taaatttaat catgcaaaaa gtttagagag ttatgccgaa cgctttaaaa
1741 aattaggcta tgacggagat aatttattag cagaatatca gcggggaaca gggactattg
1801 atgcatcggt tactgcaatt aataccgcat tggccgctat tgctggtggt gtgtctgctg
1861 ctgcagccgg ctcggttatt gcttcaccga ttgccttatt agtatctggg attaccggtg
1921 taatttctac gattctgcaa tattctaaac aagcaatgtt gagcacgttg caaataaaaa
1981 ttcataacaa aattgtagaa tgggaaaaaa ataatcacgg taagaactac tttgaaaatg
2041 gttacgatgc ccgttatctt gcgaatttac aagataatat gaaattctta ctgaacttaa
2101 acaaagagtt acaggcagaa cgtgtcatcg ctattactca gcagcaatgg gataacaaca
2161 ttggtgattt agctggtatt agccgtttag gtgaaaaagt ccttagtggt aaagcctatg
2221 tggatgcgtt tgaagaaggc aaacacatta aagccgataa attagtacag ttggattcgg
2281 caaacggtat tattgatgtg agtaattcgg gtaaagcgaa aactcagcat atcttattca
2341 gaacgccatt attgacgccg ggaacagagc atcgtgaacg cgtacaaaca ggtaaatatg
2401 aatatattac caagctcaat attaaccgtg tagatagctg gaaaattaca gatggtgcag
2461 caagttctac ctttgattta actaacgttg ttcagcgtat tggtattgaa ttagacaatg
2521 ctggaaatgt aactaaaacc aaagaaacaa aaattattgc caaacttggt gaaggtgatg
2581 acaacgtatt tgttggttct ggtacgacgg aaattgatgg cggtgaaggt tacgaccgag
2641 ttcactatag ccgtggaaac tatggtgctt taactattga tgcaaccaaa gagaccgagc
2701 aaggtagtta taccgtaaat cgtttcgtag aaaccggtaa agcactacac gaagtgactt
2761 caacccatac cgcattagtg ggcaaccgtg aagaaaaaat agaatatcgt catagcaata
2821 accagcacca tgccggttat tacaccaaag ataccttgaa agctgttgaa gaaattatcg
2881 gtacatcaca taacgatatc tttaaaggta gtaagttcaa tgatgccttt aacggtggtg
2941 atggtgtcga tactatttac ggtaacgacg gcaatgaccg cttatttggt ggtaaaggcg
3001 atgatattct cgatggtgga aatggtgatg attttatcga tggcggtaaa ggcaacgacc
3061 tattacacgg tggcaagggc gatgatattt tcgttcaccg taaaggcgat ggtaatgata
3121 ttattaccga ttctgacggc aatgataaat tatcattctc tgattcgaac ttaaaagatt
3181 taacatttga aaaagttaaa cataatcttg tcatcacgaa tagcaaaaaa gagaaagtga
3241 ccattcaaaa ctggttccga gaggctgatt ttgctaaaga agtgcctaat tataaagcaa
3301 ctaaagatga gaaaatcgaa gaaatcatcg gtcaaaatgg cgagcggatc acctcaaagc
3361 aagttgatga tcttatcgca aaaggtaacg gcaaaattac ccaagatgag ctatcaaaag
3421 ttgttgataa ctatgaattg ctcaaacata gcaaaaatgt gacaaacagc ttagataagt
3481 taatctcatc tgtaagtgca tttacctcgt ctaatgattc gagaaatgta ttagtggctc
3541 caacttcaat gttggatcaa agtttatctt ctcttcaatt tgctagagca gcttaatttt
3601 taatgattgg caactctata ttgtttcaca cattatagat tgccgtttta ttttataaaa
3661 ggagacaata tggaagctaa ccatcaaagg aatgatcttg gtttagttgc cctcactatg
3721 ttggcacaat accataatat ttcgcttaat ccggaagaaa taaaacataa atttgatctt
3781 gacggaaaag ggctttcttt aactgcttgg cttttagctg caaaatcgtt agcgttgaaa
3841 gcgaaacaca ttaaaaaaga gatttcccgc ttacacttgg tgaatttacc ggcattagtt
3901 tggcaagata acggtaaaca ttttttattg gtaaaagtgg ataccgataa taaccgctat
3961 ttaacttaca atttggaaca agatgctcca caaattctgt caacagacga atttgaagcc
4021 tgctatcaag ggcagttaat tttggtcacg tccagagctt ccgtagtagg tcaattagca
4081 aagttcgatt tcacctggtt tattccggcg gtgatcaaat accgaaaaat ctttctagaa
4141 accttgattg tttcgatctt tttgcaaatt tttgccctaa ttacaccgct attcttccaa
4201 gttgttatgg ataaagtact ggtgcatcga ggtttttcaa ccttgaatat cattacggtt
4261 gccttagcta ttgtgatcat ctttgaaatt gtactaagtg gtttgagaac ctatgttttt
4321 tctcatagca ctagccgtat tgatgttgaa ttaggcgcta aattatttcg acatttatta
4381 tcactaccca tttcttattt tgaaaacaga cgagttggag atacagtcgc tagggttaga
4441 gaattagatc aaattcgtaa tttccttacc ggacaagcat taacctcggt gttagatctc
4501 ttattctctt ttatcttttt tgccgtaatg tggtattaca gcccaaaatt aaccttggta
4561 attcttggtt cattgccctg ctatatttta tggtcaattt ttattagtcc gattttaaga
4621 cggcgtttag atgagaaatt tgcccgaagt gctgataacc aagcattctt agttgagtcg
4681 gtaacagcca tcaatatgat taaagcgatg gcggttgctc cacaaatgac ggatacatgg
4741 gataaacagc tggcaagcta tgttttcatc agtttccgtg tcaccgtatt agcaaccatt
4801 gggcaacaag gtgtacaact tattcaaaaa accgttatgg tgattaacct ttggttaggg
4861 gcacacttag ttatttcagg cgatctgagt attgggcaat taattgcctt taatatgcta
4921 tcagggcaag tgattgcacc ggtgattcgg ctggctcagc tctggcaaga tttccaacaa
4981 gttgggattt ccgtcactcg cttaggtgat gttttaaact ctccaaccga acaatatcaa
5041 ggcaaattat cactaccaga aataaaaggc gatatctcat ttaaaaatat ccgctttaga
5101 tataaaccag atgcaccaac tattttaaat aatgtgaatt tagaaattag gcaaggagaa
5161 gtgattggga ttgttggacg ttccggttca ggcaaaagta ctctgactaa attactgcaa
5221 cgtttttata ttcctgaaaa tgggcaggtt ttgattgatg gacatgatct agccttagct
5281 gatccaaact ggctacgccg tcaaataggt gtagtgctgc aagataatgt gttattaaac
5341 cgcagtatcc gagaaaatat tgcgctatca gatccaggaa tgccaatgga gcgagtaatt
5401 tatgcagcaa aattagcagg ggctcacgat tttatttcag aattgcgtga aggttatacc
5461 accattgtgg gtgaacaagg agcggggctt tcaggcgggc aacgccaacg gattgcgatt
5521 gctcgagctt tggtaaacaa cccgaaaatc ctgatttttg atgaggcaac cagtgccctc
5581 gattacgaat ctgagcatat tattatgcaa aatatgcaaa aaatatgcca aggcagaacc
5641 gtgattttga ttgcacatcg tttatcgacc gtcaaaaatg cggatcgaat tattgtgatg
5701 gaaaaggggg aaattgttga gcaaggcaag caccacgaat tactgcaaaa cagtaacgga
5761 ctttattcct acttacacca attacaactt aattaagaag gaaaacaatg aaaatatggc
5821 ttagtggtat ttatgaattt ttcctacgct ataaaaacat ttgggcagaa gtatggaaaa
5881 ttcgtaaaga attagaccac ccaaacagaa aaaaagacga aagtgaattt ttaccggcac
5941 atttagaact gattgaaacc ccggtttcta aaaaaccacg tctaattgct tatttgatta
6001 tgctattttt agttgtggca attgtgcttg ccagtgtaag caaagttgaa attgtggcga
6061 ctgctcccgg taaattaact tttagtggca gaagtaaaga aattaaaccg attgaaaacg
6121 ccattgtaca agaaattttc gttaaagatg ggcagtttgt ggaaaaaggg caattattag
6181 tcagcttaac tgcattgggt tctgatgcag atatcaaaaa gaccatggct tcactttctt
6241 tagctaaact ggagacctat cgctaccaaa ctttgcttac tgccattgaa aaagagtcct
6301 tgccggtgat tgatttatct agaaccgaat ttaaagattc atcggaagaa gatcgactac
6361 gtattaaaca cttaattgag gagcaataca ccacttggca aaaacaaaaa acacagaaaa
6421 ctttagcgta taagcgtaaa gaggctgaaa aacaaacaat atttgcctat gtccgtaaat
6481 atgaaggtgc aacacgtatt gaacaagaaa aattaaaaga ctttaaggca ctttataaac
6541 agaagtcttt atctaagcac gaacttcttg cgcaagaaaa taaattaatt gaggctcaga
6601 atgcagtagc tgtttatcgc tcaaaattaa atgaattaga aaatgatcta ctcaatgtaa
6661 aagaagaact tgaattgatc acgcaattct ttaaaagcga tgtgttggaa aaattaaagc
6721 aacatattga aaatgaacgc caacttcggc tcgagttaga aaaaaataat caacgcagac
6781 aggcctcgat gatcagagca ccggtttccg gtacggttca gcaactgaaa attcacacta
6841 taggtggtgt tgttacgact gctgaaacct tgatgatcat tgtgccggaa gacgatgtgt
6901 tagaggccac cgctctggtt ccaaacaaag atatcggctt tgttgcagca gggcaggagg
6961 tgattattaa agtggaaact ttcccttata cacgctatgg ttatctaact ggtcgaatta
7021 aacatattag cccggatgcg attgaacaac ctaatgtagg cttagttttt aatgcaacta
7081 tagctataga taggaagaat ctaacatcgc ctgatgggcg aaaaattgat ttgagttcag
7141 gtatgacaat aactgctgaa atcaaaaccg gtgaacggag tgtaatgagt tatttactca
7201 gcccattaga agaatctgtc acagaaagtt taagggaacg ctaatcgaac caaaacaaag
7261 ccataaaagc cattttgagc ttttatggct ttatttttta gtccacaagc ggacaaaaaa
7321 gcccaatttt ttacactttt ataacaaatt gttctaacta aaaattacta attcttttct
7381 tttatagcga tctctatttc atttcattaa cattgactag aagggattat gagcctaagc
7441 attacgaatc tttctcttgg ctaccgcaaa aatcagcaaa ggcttatttg aaaagcacgg
7501 tgtcgaggtg gaaaaaccgg tgatgtttcg cagctgggct cagttggtgg aagcttttta
7561 agtggcaatg tgaacgtggt gcatctgctt tcgcctatga gtttgtgggc gaaatatgga
7621 gcaaatgctc cggtgaaagc ggtaatgtgg aatcacttgg caggttcggc tttaacggtt
7681 cgccctgaaa tcaacagtat tgccgaactc tccggcaaaa cggtagaact tccgttttgg
7741 ta
//
LOCUS RATBADPTA 3477 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Rat beta adaptin mRNA, complete cds.
ACCESSION M34176 J05273
KEYWORDS beta adaptin.
SOURCE Rat lymphocyte, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3477)
AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
Parham,P.
TITLE Conservation and diversity in families of coated vesicle adaptins
JOURNAL J. Biol. Chem. 265, 4814-4820 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 72 2885 beta adaptin
mRNA < 1 3477 beta adaptin mRNA
BASE COUNT 914 a 851 c 822 g 890 t
ORIGIN
1 cggggctgtg ctctctgact gccgccgcca ccccgcccct tgcctccggt tcacgctgaa
61 gatccagaat catgactgac tccaagtact tcacaaccaa taagaaggga gaaatctttg
121 aattaaaagc tgaactcaac aatgaaaaga aagaaaagag gaaggaggct gtgaagaaag
181 tgattgctgc tatgactgtg gggaaagacg ttagctctct cttcccagat gtggtgaact
241 gtatgcagac tgacaacctg gaactaaaga agcttgtgta cctctatctg atgaactatg
301 ccaagagtca gccagacatg gccatcatgg ctgtcaacag ctttgtgaag gattgtgaag
361 accccaatcc tttgattcga gccttggcag ttagaaccat gggatgcatc cgggtggaca
421 agattacaga gtatctctgt gaacccctcc gcaagtgctt gaaggatgaa gacccctatg
481 ttcggaaaac agcagcagta tgcgtggcaa aactccatga tatcaatgcc cagatggtgg
541 aagatcaggg atttctggat tctctgcggg atctcatagc agattcaaac ccaatggtgg
601 tggctaatgc tgtagcagca ttgtctgaga tcagtgagtc tcacccaaac agcaacttac
661 ttgatctgaa ccctcagaat atcaataagc tgctcacagc cctgaatgag tgcactgagt
721 ggggccagat tttcatcttg gactgcctgt ctaattacaa ccctaaagat gaccgggaag
781 ctcagagcat ctgtgagcga gtgacgcctc ggctctctca tgccaattct gcagtggtgc
841 tttcagcagt aaaagttctg atgaagtttc tagagttgtt acccaaggac tctgactact
901 acaatatgct gctaaagaag ctagcgcctc cacttgtcac tttgctctct ggggagccag
961 aagtgcagta tgttgccctg aggaacatca acctaattgt ccagaaaagg cctgaaatct
1021 tgaagcagga aatcaaggtc ttctttgtga agtacaatga tcctatctat gttaaactag
1081 agaagttaga catcatgatt cgtcttgcat cccaagccaa cattgctcag gttctggcag
1141 aactgaagga atatgccact gaagttgatg tggactttgt tcgcaaagct gtgagggcca
1201 ttggacggtg tgccatcaaa gtggagcaat cagcagaacg ctgtgtgagc acactgcttg
1261 atctaatcca gaccaaagta aattatgtgg tccaagaggc aattgttgtc atcagggaca
1321 tcttccgaaa ataccccaac aagtatgaga gcattatcgc cacgctgtgt gagaacttgg
1381 actccctgga tgaacccgat gcccgagcgg ctatgatttg gattgtagga gagtatgctg
1441 aaagaatcga taatgccgat gagttactag agagcttcct ggaaggtttt catgatgaaa
1501 gcacccaggt gcagctcacg ttgcttaccg ccatagtgaa actgtttctc aagaagccat
1561 cagaaacaca ggagctggtc caacaggtct tgagcttggc cacacaggat tctgataatc
1621 ctgaccttcg agatcggggt tatatttatt ggcgccttct ttcaactgac cctgtgacag
1681 ccaaagaagt agtgttgtct gagaagccat tgatctctga ggaaacagac ctcattgaac
1741 ctaccctcct ggatgagctc atctgccaca ttggttcttt ggcctccgtg taccataaac
1801 ctccgaatgc ttttgtggaa gggagccatg gcattcatcg caaacacttg ccaattcacc
1861 atgggagcac tgatgcaggt gatagccctg ttggcaccac cactgcaacc aacctggaac
1921 agcctcaggt catcccctct caaggtgacc ttctggggga tcttttaaat cttgacctgg
1981 gtcccccagt gaatgtcccg caagtgtcct ccatgcagat gggagcagtg gatcttttag
2041 gaggaggact ggatagcctg gtaggacagt ccttcatccc gtcatcagtg cctgcaacct
2101 tcgctccttc acctactcct gctgtggtca gcagtggtct gaatgacctg tttgagcttt
2161 ccactgggat aggcatggca cctggcggat atgtggctcc taaggcagtc tggctacctg
2221 ctgtaaaggc taaaggcttg gagatttcgg ggacgtttac tcaccgccaa gggcacatct
2281 atatggaaat gaacttcacc aacaaagctc tgcagcacat gacggatttt gccatccagt
2341 ttaacaagaa tagcttcggt gtcatcccga gcactccctt ggccatacat actccgctga
2401 tgccaaacca gagcattgat gtgtctctgc ctctcaacac cttgggccca gtcatgaaga
2461 tggagcctct gaataacttg caggtggctg ttaaaaacaa tattgatgtc ttctacttca
2521 gctgcctcat cccactcaat gtgctttttg tagaagatgg caaaatggaa cgccaggtct
2581 tccttgcgac gtggaaggat attcccaatg aaaatgagct ccaatttcag attaaggagt
2641 gtcatttaaa cgctgacaca gtttccagca agttgcaaaa caacaatgtt tacactatcg
2701 ccaagaggaa tgtggagggg caggacatgc tgtaccagtc cctgaagctc actaatggca
2761 tttggatttt ggcagagctg cggatccagc caggaaaccc caattatacg ctgtcgctga
2821 agtgtagagc ccctgaagtc tctcagtaca tctatcaggt ctacgacagc attttgaaaa
2881 actaataaat gggtccagtc agcctgtaat cagtgcaagc cacgaactct taactgaaag
2941 acactgtatt gttgtgtaga gcctgaaccc aaaccctgcg gtacccaccc cggtagtggc
3001 cagtcatttt gtgctgatat tagcactcac cccattggta ggttagcttc ccgtgacatc
3061 tccttccact atcgcccacc tctgccacct gccgctgctc tctgtcctta gttgtgagtt
3121 cctctgtgct gtgccaatgg ctagcctttt ctacaccctc ttttgagtgt agtttgatat
3181 tttgtaatcg aaagctcatt tcacaagcag aaaaaggcaa caagttaatt agagcgagga
3241 agagtgtcac tgaaacatac actgcacctt attgttttat atttttgtac agatgagata
3301 gatattgagg tagaacgctg agtagaaagg gtgactgacc ctcctcagac acagtcttat
3361 tggagacata tggccctggc cccttctggg caaggagagg cgaccccact cctggtcttt
3421 tgcattttca ccttggccac gccttccagc tctcttatgc ccatgctctc tcatttg
//
LOCUS RATPSPB 1620 bp ss-mRNA ROD 12-JUL-1990
DEFINITION Rat pulmonary surfactant-associated glycoprotein A (SP-A) mRNA,
complete cds.
ACCESSION M33201
KEYWORDS pulmonary surfactant protein A.
SOURCE Rat fetal lung, cDNA to mRNA, clone SP-A [0.9, 1.6].
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1620)
AUTHORS Fisher,J.H., Emrie,P.A., Shannon,J., Sano,K., Hattler,B. and
Mason,R.J.
TITLE Rat pulmonary surfactant protein A is expressed as two differently
sized mRNA species which arise from differential polyadenylation of
one transcript
JOURNAL Biochim. Biophys. Acta 950, 338-345 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 56 802 pulmonary surfactant protein A (56 could be
29)
BASE COUNT 425 a 400 c 396 g 399 t
ORIGIN
1 cagatatcca cacagcctgc aggtctgtat gtggaagcca ctggggatag tagccatgtc
61 actgtgttct ttggccttca ccctcttctt gactgttgtc gctggtatca agtgcaatgt
121 gacagacgtt tgtgctggaa gccctgggat ccctggagct cctggaaacc atggtctgcc
181 tggcagagac gggagagacg gtgtcaaagg agaccctgga cctccaggtc ccatgggccc
241 tcctggagga atgccaggtc ttcctggacg cgatgggctg cccggaggac ctggtgcacc
301 tggaggacgt ggagacaagg gagagcctgg agaaaggggc ctgccaggat ttccagctta
361 cctggatgag gagctccaga ctgaactcta tgagatcaaa catcagattc tgcaaacaat
421 gggagtcctc agcttgcaag gatccatgct gtcagtgggg gataaagtct tttccaccaa
481 tgggcagtca gtcaactttg ataccattaa agagatgtgt accagagcag gaggcaacat
541 tgctgtcccg aggactcctg aggagaacga ggccattgca agtattgcga agaagtacaa
601 caactatgtc tacttgggca tgattgaaga ccagactcct ggagacttcc actacctgga
661 tggggcttct gtgaactaca ccaactggta cccaggagaa cccaggggtc agggcaaaga
721 aaagtgtgta gaaatgtata cagatgggac atggaatgat aggggctgcc tgcagtaccg
781 gctggctgtt tgtgaatttt gatcaagcaa ttagacgaaa agatgaaccc tcacactgcc
841 tctatcctga tgattcatct ggtctgtaaa accctgcaac tacctttact tgtggccttc
901 agtaattaga agcatctttt gtcacccccg ctcccacata gttcccaaac acttctccat
961 attcattagc aatcctgagt gtttccctag agtcccatct gagcgttcat tcaaggtagc
1021 cattgtaaac cttggccttg accatgagat ggatagatac ttcctttttc ctcactttat
1081 ccagtcttca tttataaatg gtggccatga agacccagca tggaaggacc ctctaactaa
1141 gtgctgccct ctgacctttc cacccttctg tagctcggtg tcccaggatt tagaagtcca
1201 ggttaaacat aggggatttc tgggaaagcc tagtatgtgg gtgcaggcca cattcatgcc
1261 atctgtatcc atggctttca aggcaaacat tgtctctaag aagccagaga accaggagaa
1321 ccaggtagga ccaggtagta ctgggggaac ataaactcac ttggtttggc atgtatggct
1381 cctccttggg tctggaggtg ccatcttgac cttgaactaa cagcagccac cctgggtttt
1441 gagagaacga ccttcccagc ccagacccca actcaagtaa tttcctgcta acagacacag
1501 cctcagttca ctttacatca ctgaggcatt catgatacga actgcaatct gttttctcct
1561 ctcgtgagtt caatcagcta ttcattaaag tcaactgcat tcaaaaaaaa aaaaaaaaaa
//
LOCUS FSBCRYGM1 613 bp ss-mRNA VRT 12-JUL-1990
DEFINITION Carp gamma-crystallin (gamma-m1) mRNA, complete cds.
ACCESSION X12902 M33115
KEYWORDS crystallin; gamma-crystallin.
SOURCE Cyprinus carpio lens, cDNA to mRNA.
ORGANISM Cyprinus carpio
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae.
REFERENCE 1 (bases 1 to 613)
AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C.
TITLE Carp gamma-crystallins with high methionine content: Cloning and
sequencing of the complementary DNA
JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988)
STANDARD simple staff_review
COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry,
Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C..
Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES from to/span description
pept 34 570 gamma-crystallin (gamma-m1)
BASE COUNT 166 a 133 c 167 g 147 t
ORIGIN
1 ctgaagcact gagataaaca accctctacc atcatgggca agatcatctt ctacgaggac
61 aggaacttcc agggccgcag ctatgactgc atgagcgact gctctgatat ctcctcttac
121 ctcagccgcg ttggttcaat cagggtggag agtggttgtt tcatggtcta tgagcgcaac
181 agctacatgg ggaaccagtt cttcctgagg aggggcgagt accatgatat gcagcgcatg
241 atgagcatgg gcatgatgtt tgacactatc agatcctgcc gcatgattcc tccatacagg
301 ggttcctaca gaatgaggat ctacgagagg gacaccttcg gaggacagat gcacgaggtg
361 atggatgact gtgacaacat catggaacgt taccgtatgt ctgactggca gtcttgtcat
421 gtgatggacg gccactggct cttctatgag cagccacact acagaggcag aatgtggtac
481 ttcaggcctg gagagtacag gagcttcaga gatatgggat acagcaacat gagattcatg
541 agcatgaggc gtatcactga tatgtgttaa actgctagaa tatagaagga attaaagtgt
601 tattctcaga act
//
LOCUS FSBCRYGM2 554 bp ss-mRNA VRT 12-JUL-1990
DEFINITION Carp gamma-crystallin (gamma-m2) mRNA, complete cds.
ACCESSION X12903 M33116
KEYWORDS crystallin; gamma-crystallin.
SOURCE Cyprinus carpio lens, cDNA to mRNA.
ORGANISM Cyprinus carpio
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae.
REFERENCE 1 (bases 1 to 554)
AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C.
TITLE Carp gamma-crystallins with high methionine content: Cloning and
sequencing of the complementary DNA
JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988)
STANDARD simple staff_review
COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry,
Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C..
Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES from to/span description
pept 7 528 gamma-crystallin gamma-m2 (AA 1 - 173)
BASE COUNT 142 a 124 c 150 g 138 t
ORIGIN
1 tggcccatga aggtcacctt ttatgaggac aggaacttcc agggtcgctc ttatgactgt
61 atgagcgact gtgccgattt ctcctcctac atgagccgct gtcactcttg cagagtgcac
121 agcggatgct ggatgatgta cgatcaaccc aactacatgg gaaatcagta tttctttagg
181 aggggagagt atgctgatta catgtctatg tttggaatga gcaactgcat caggtcctgc
241 cgtatgatcc ctatgcacag gggatcctac agaatgagga tctacgagag ggagaacttc
301 atgggccaga tgtacgaaat ggccgatgac tgtgacagta tcatggaccg ttaccgcatg
361 cctcactgcc agtcctgcca tgtgatggac ggccactggc tcatgtatga gcagccccac
421 tacagaggca ggatgtggta cttcaggcct ggagagtaca ggagcttcag caatatgggt
481 ggaatgagat tcatgagcat gaggcgtatc atggactcct ggtactagag tttatattaa
541 taaaataact cctc
//
LOCUS HUMIL2A1 940 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human interleukin 2 gene, exons 1 and 2.
ACCESSION M33199
KEYWORDS interleukin; interleukin 2.
SEGMENT 1 of 2
SOURCE Human DNA, clones Lm HIG[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 940)
AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K.
TITLE Organization of the DNA regions flanking the human interleukin 2
gene
JOURNAL Biomed. Res. 6, 197-205 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 629 775 interleukin 2, exon 1 /nomgen="IL2"
/map="4q26-q27" /hgml_locus_uid="LT0164X"
865 / 924 interleukin 2, exon 2
IVS 776 864 IL2 intron A
IVS 925 > 940 IL2 intron B
BASE COUNT 313 a 181 c 137 g 309 t
ORIGIN
1 cttcaactca ataagcattt taagtattct aatcttagta tttctctagc tgacatgtaa
61 gaagcaatct atcttattgt atgcaattag ctcattgtgt ggataaaaag gtaaaaccat
121 tctgaaacag gaaaccaata cacttcctgt ttaatcaaca aatctaaaca tttattcttt
181 tcatctgttt actcttgctc ttgtccacca caatatgcta ttcacatgtt cagtgtagtt
241 ttatgacaaa gaaaattttc tgagttactt ttgtatcccc acccccttaa agaaaggagg
301 aaaaactgtt tcatacagaa ggcgttaatt gcatgaatta gagctatcac ctaagtgtgg
361 gctaatgtaa caaagaggga tttcacctac atccattcag tcagtctttg ggggtttaaa
421 gaaattccaa agagtcatca gaagaggaaa aatgaaggta atgttttttc agacaggtaa
481 agtctttgaa aatatgtgta atatgtaaaa cattttgaca cccccataat atttttccag
541 aattaacagt ataaattgca tctcttgttc aagagttccc tatcactctc tttaatcact
601 actcacagta acctcaactc ctgccacaat gtacaggatg caactcctgt cttgcattgc
661 actaagtctt gcacttgtca caaacagtgc acctacttca agttctacaa agaaaacaca
721 gctacaactg gagcatttac ttctggattt acagatgatt ttgaatggaa ttaatgtaag
781 tatatttcct ttcttactaa aattattaca tttagtaatc tagctggaga tcatttctta
841 taacaatgca ttatactttc ttagaattac aagaatccca aactcaccag gatgctcaca
901 tttaagtttt acatgcccaa gaaggtaagt acaatatttt
//
LOCUS HUMIL2A2 569 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human interleukin 2 (IL-2) gene, 3' flank.
ACCESSION M33198
KEYWORDS Alu repetitive sequence; interleukin; interleukin 2.
SEGMENT 2 of 2
SOURCE Human DNA, clones Lm HIG[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 569)
AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K.
TITLE Organization of the DNA regions flanking the human interleukin 2
gene
JOURNAL Biomed. Res. 6, 197-205 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rpt 136 449 Alu-repeat /nomgen="IL2" /map="4q26-q27"
/hgml_locus_uid="LT0164X"
rpt 130 135 5' insertion target sequence
rpt 450 455 3' insertion target sequence
BASE COUNT 204 a 131 c 96 g 138 t
ORIGIN Unknown number of bp after segment 1.
1 agcttcaata agatccaatg aatattctag attctatttg tcttctgaag acagcttaat
61 ctaatttaga taaaaataac atcatccaga gcctctacac tatttcagac acatgtagca
121 tcagcttaaa aattatgaaa cctactggct aacacgtgaa accttgtcac taccaaaaat
181 acaaaaaaaa aaaaattagc tgagtgtggt ggcgggcgcg tagtcccagc tactcaggag
241 gctgaggcag gagaatggcg tgaacttggt aggcagagct gcagtgagcc aagatcgtgc
301 cattgcactc cagcctgggt gacagagcaa gactccatct caaaaaaaaa aaaaaaaaaa
361 aagagacctg ctaacacaca cacacacaca cacacacaca ctctctctct ctctctctct
421 ctctctctct ctctctctct ctctctctca aattaagttg ggcggcaagg ggaaacaata
481 aacatctcca acataggatt caagtgtagt tataagatac agactttaac taatataata
541 tgttcaagaa aataaagcat catatctag
//
LOCUS TOMCPKA 103 bp ss-rRNA ORG 12-JUL-1990
DEFINITION Tomato chloroplast 4.5S ribosomal RNA.
ACCESSION M33098
KEYWORDS 4.5S ribosomal RNA; ribosomal RNA.
SOURCE Tomato (strain Mill) chloroplast ribosomal RNA.
ORGANISM Chloroplast Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae; Lycopersicon esculentum.
REFERENCE 1 (bases 1 to 103)
AUTHORS Zhen-Qi,C., Xiao,X. and E,-Sheng.W.
TITLE The nucleotide sequence of 4.5 S rRNA from tomato chloroplasts
JOURNAL Biochim. Biophys. Acta 866, 89-91 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 103 4.5S ribosomal RNA
BASE COUNT 29 a 19 c 32 g 23 t
ORIGIN
1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg
61 atgtatgcag ctgaggcatc ctaacagatc ggtagacttg aac
//
LOCUS EBOMAY 157 bp ss-RNA VRL 12-JUL-1990
DEFINITION Ebola virus 3' proximal protein gene, 5' end.
ACCESSION M33062
KEYWORDS .
SOURCE Ebola virus (strain MAY; Zaire 1976) RNA.
ORGANISM Ebola virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae.
REFERENCE 1 (bases 1 to 157)
AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D.
TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and
Marburg virus
JOURNAL Virology 149, 251-254 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 53 > 157 3'proximal protein
BASE COUNT 56 a 22 c 31 g 48 t
ORIGIN
1 gggcacacaa aaagaaagaa gaatttttag gatcttttgt gtgcgaataa ctatgaggaa
61 gattaataat ttcctctcat tgaaatttga tgatcggaat ttgaaattga aattgttgat
121 ctgtaatcac accgttgatt cagagccaca cacaagt
//
LOCUS ECOBISCASD 3337 bp ds-DNA BCT 12-JUL-1990
DEFINITION E.coli biotin sulfoxide reductase (bisC) gene, complete cds.
ACCESSION M34827
KEYWORDS biotin sulfoxide reductase; bisC gene.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3337)
AUTHORS Pierson,D.E. and Campbell,A.
TITLE Cloning and nucleotide sequence of bisC, the structural gene for
biotin sulfoxide reductase in Escherichia coli
JOURNAL J. Bacteriol. 172, 2194-2198 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 577 2757 biotin sulfoxide reductase (bisC)
BASE COUNT 764 a 856 c 927 g 790 t
ORIGIN
1 tatccccgct gcgggttacg ctaacaccag tgccgcgcat tttgtcgcgc agttcgcttc
61 ctgcacatcc atgtaataac caacgccgcc gcccagagct gcgcctgctg ctgcgccaat
121 cagcgcgcct ttaccgcgat ctttcttcga agaagagagc gcaccaatac ccgcgcccac
181 gagagagccc agacctgcgc cgatagcaga tttacctgct tcgcgttcgc cggtgtaagg
241 gttagttgtg cagccagata ccgccagagc gccactcact acggcggcaa taagataaac
301 acgtttcttc attgttaatc cttaataacc tttttattct ttgccacggg ttccgtggcg
361 ggagattatg ccgcgtgaac atgaagatta ttcctgggaa tactcggaaa tttgtaagta
421 atatttaact gctcaataca tctaaccttt caggagtctt cggtttggcc aactcatcct
481 cacgatattc cgttctgact gccgccattg ggggcccatg ctggttgaaa ccgacggcga
541 aaccgtgttt agctgcgtgg cgcgttagcc acaggaatgg aaaactcctt gcagagcgcg
601 gttcgcgacc aggttcacag caatacgcgg gtacgatttc caatggtgcg aaaaggcttt
661 cttgcgtcac cggaaaaccc gcaaggcatt cgtgggcagg atgaatttgt tcgcgtgagt
721 tgggatgagg cgctggatct tattcaccaa caacataaac gcattcgtga ggcttatggt
781 ccggcatcga tttttgctgg ttcctacggc tggcgttcaa acggcgtgct gcataaggcc
841 tcgacattat tacaacgcta tatggcgctg gcaggcggtt ataccgggca tctgggggat
901 tattcgaccg gcgcggcaca ggcgatcatg ccgtatgtcg tgggtggtag tgaagtttat
961 caacagcaga ccagttggcc gctggtgctg gaacatagcg atgtcgtggt gctgtggagt
1021 gctaacccac tcaatacgct gaaaattgcg tggaatgcat ccgatgagca ggggctttct
1081 tacttttctg cactgcgtga cagcgggaaa aagctgatct gcattgatcc aatgcgatcg
1141 gaaaccgtcg atttctttgg cgataaaatg gagtgggtgg caccgcacat gggcaccgat
1201 gttgcgctga tgctggggat cgcccatacg ctggtggaaa atggttggca cgacgaagcg
1261 tttctggcgc gttgcaccac aggttatgcc gtcttcgcct cttatttgct gggcgagagt
1321 gacggaatag cgaaaaccgc cgaatgggca gcagagattt gtggtgttgg cgcagcgaaa
1381 atccgcgagc tggcggctat tttccaccaa aataccacca tgctgatggc aggctgggga
1441 atgcagcgcc aacagtttgg tgagcaaaaa cactggatga tcgtcacgct ggcagcaatg
1501 ttggggcaaa tcggcacacc cggcggcggt tttggtcttt cttaccattt tgccaatggt
1561 ggtaacccca cgcggcgttc tgcggtgctc tcttccatgc agggcagctt gccgggtggc
1621 tgcgatgcgg tggataaaat ccctgttgcc cgcattgttg aagcactgga aaaccctggt
1681 ggcgcatatc aacacaacgg tatgaaccga catttcccgg atattcgttt tatctggtgg
1741 gcgggcggtg ccaactttac tcatcatcag gataccaatc gcctgatccg tgcctggcaa
1801 aaaccggagc tggtggtgat ctctgaatgc ttctggacgg cggcggcaaa acacgcggat
1861 atcgttctgc ctgcgactac ctcttttgag cgtaatgatc tcaccatgac cggtgattac
1921 agtaatcagc atctggtgcc gatgaagcaa gtggtgccgc cacgctatga agcgcgtaat
1981 gattttgatg tttttgccga gttaagtgaa cgctgggaga agggcggtta tgcacgtttt
2041 acggaaggaa aaagtgagct gcaatggctg gaaacgtttt ataacgttgc ccgacagcgc
2101 ggggcaagcc agcaggttga attgccgcca tttgctgagt tctggcaagc caaccagtta
2161 attgagatgc cggaaaaccc ggacagcgag cggtttattc gcttcgctgc atttttgccg
2221 cgatccgctg gcgatccgtt aaaaacgcag cgcaagattg aaatcttctc acagcgtatt
2281 gccgattacg gttacccgga ttgccctggg catccaatgt ggctggagcc ggacgaatgg
2341 cagggcaatg ccgaaccaga acagttgcag gtactttctg cccatccggc gcaccgcctg
2401 cacagccagc tgaattacag ttctctgcgc gaattgtacg cggtggcaaa tcgtgagcct
2461 gtcaccattc atcctgacga tgcccaggag cgcggcatac aagatggcga tactgttcgg
2521 ttgtggaacg cacgcgggca aattcttgcc ggagcggtca ttagcgaggg aattaaacct
2581 ggcgtgattt gcattcacga aggggcatgg ccggatctgg atttaaccgc tgacggtatt
2641 tgtaaaaacg gcgcagtgaa cgtgctgacc aaagatctcc ccagctcgcg gctgggaatg
2701 gctgtgcggg taatacggcg ctggcatggc tggaaaaata caacggtccg gaactgacac
2761 ttacagcgtt tgaaccaccg gccagctcat aatccatgtg ggtagttggg tttcatcctg
2821 ccatgcgcaa tcgacaatgt gaaaaccctg tgcctggtaa aaatttatcg ccggttgatt
2881 tttttgataa acctccagca tcaggtgggg atggcgctgc tgcacatact gcatcagcgc
2941 cttaccaata ccgcgcctga cggccttcgg tgcgacaaac atcgctgcca gaaatcggcc
3001 ttccataatg ctgacaaaac cgagaagctt accgtcttct tcccagaccc agttttgcgc
3061 gttggcaaga taggcatccg caccagcgga atgcagtcac gccagtaatt cgcttttata
3121 aagggatgcc cccaggttgt actttccagc cacagttcga ggatcgcggg gagttctgaa
3181 cgttgcgctt cccgaatcat ggtttatttc ccggatagca acagcagcca accacatgat
3241 cattcaccag cccacatgcc tgcataaagg gagtaacaga ttgtggtgcc gacaaactta
3301 aaaccacgtt ttttcagtgc cttagatagg gcgttcg
//
LOCUS HUMARXA 1335 bp ss-mRNA PRI 12-JUL-1990
DEFINITION Human aldose reductase mRNA, complete cds.
ACCESSION M34720
KEYWORDS aldehyde reductase; aldose reductase.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1335)
AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E.
TITLE Cloning and prokaryotic expression of a biologically active human
placental aldose reductase
JOURNAL DNA Cell Biol. 9, 149-157 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 14 964 aldose reductase (EC 1.1.1.21)
mRNA < 1 1335 aldose reductase mRNA
BASE COUNT 319 a 347 c 356 g 313 t
ORIGIN
1 gagcgcagca gccatggcaa gccgtctcct gctcaacaac ggcgccaaga tgcccatcct
61 ggggttgggt acctggaagt cccctccagg gcaggtgact gaggccgtga aggtggccat
121 tgacgtcggg taccgccaca tcgactgtgc ccatgtgtac cagaatgaga atgaggtggg
181 ggtggccatt caggagaagc tcagggagca ggtggtgaag cgtgaggagc tcttcatcgt
241 cagcaagctg tggtgcacgt accatgagaa gggcctggtg aaaggagcct gccagaagac
301 actcagcgac ctgaagctgg actacctgga cctctacctt attcactggc cgactggctt
361 taagcctggg aaggaatttt tcccattgga tgagtcgggc aatgtggttc ccagtgacac
421 caacattctg gacacgtggg cggccatgga agagctggtg gatgaagggc tggtgaaagc
481 tattggcatc tccaacttca accatctcca ggtggagatg atcttaaaca aacctggctt
541 gaagtataag cctgcagtta accagattga gtgccaccca tatctcactc aggagaagtt
601 aatccagtac tgccagtcca aaggcatcgt ggtgaccgcc tacagccccc tcggctctcc
661 tgacaggccc tgggccaagc ccgaggaccc ttctctcctg gaggatccca ggatcaaggc
721 gatcgcagcc aagcacaata aaactacagc ccaggtcctg atccggttcc ccatgcagag
781 gaacttggtg gtgatcccca agtctgtgac accagaacgc attgctgaga actttaaggt
841 ctttgacttt gaactgagca gccaggatat gaccacctta ctcagctaca acaggaactg
901 gagggtctgt gccttgttga gctgtacctc ccacaaggat taccccttcc atgaagagtt
961 ttgaagctgt ggttgcctgc tcgtccccaa gtgacctata cctgtgtttc ttgcctcatt
1021 tttttccttg caaatgtagt atggcctgtg tcactcagca gtgggacagc aacctgtaga
1081 gtggccagcg agggcgtgtc tagcttgatg ttggatctca agagccctgt cagtagagta
1141 gaagtctctt ccagtttgct ttgcccttct ttctaccctg ctggggaaag tacaacctga
1201 ataccctttt ctgaccaaag agaagcaaaa tctaccaggt caaaatagtg ccactaacgg
1261 ttgagttttg actgcttgga actggaatcc tttcagcaag acttctcttt gcctcaaata
1321 aaaagtgctt ttgtg
//
LOCUS HUMARXB 652 bp ds-DNA PRI 12-JUL-1990
DEFINITION Human aldose reductase gene, partial cds.
ACCESSION M34721
KEYWORDS aldehyde reductase; aldose reductase.
SOURCE Human placenta DNA, clone lambda-gt11-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 652)
AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E.
TITLE Cloning and prokaryotic expression of a biologically active human
placental aldose reductase
JOURNAL DNA Cell Biol. 9, 149-157 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 27 aldose reductase, exon X (AA at 1) (EC
1.1.1.21)
626 > 652 aldose reductase, exon X+1
IVS 28 625 aldose reductase intron X
BASE COUNT 142 a 164 c 184 g 162 t
ORIGIN
1 gccaagcaca ataaaactac agcccaggta cagccacttc aggtgttgct gaccgtccac
61 aactgcctgc attcctgaca gtcctgttag ccaagaggag gaagtgactg agcctgttac
121 accctcacag gaagtatggt taggggtcct caagtacaga gtggaaaggg cacagatcgg
181 ggttttagaa gactctggca tgggctctta gattaatagt gcctgccccc actactgcaa
241 gggtgactgc cacgagggcc agcgcttgtt cattcatgtg gaacctcatc tgtacaaatg
301 taagagctct tagccgtgca gggaatgttc tttctcctga gtggtagtgt gcatttctag
361 ccagtggagg gcctcatgtg gtctcatgat atgcctgaga cactgaagcg tgtggcacag
421 tggctagcgc aggactctgg agtcagatct ggacctgaat gcgtcgccta cctgttgcta
481 gctgtgacct gacatcttgg agcccctctc tgatcacctg tggagttcta gcacgtcctt
541 ctgcaggttg tgtgtgtgag agactgagat gatgggtgcg agtgcctggc atgtatacac
601 actcactgtc tccttgggct cacaggtcct gatccggttc cccatgcaga gg
//
LOCUS MRV3TERM 59 bp ss-RNA VRL 12-JUL-1990
DEFINITION Marburg virus 3'terminal region of genome.
ACCESSION M36065
KEYWORDS .
SOURCE Marburg virus RNA.
ORGANISM Marburg virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D.
TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and
Marburg virus
JOURNAL Virology 149, 251-254 (1986)
STANDARD simple staff_review
BASE COUNT 15 a 10 c 5 g 29 t
ORIGIN
1 tctgtgtgtt ttgttctcta ctactaaaac acatagtata tttatttctt cttataatc
//
LOCUS RATQRED1 431 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat quinone reductase gene, exon 1.
ACCESSION M33038
KEYWORDS quinone reductase.
SEGMENT 1 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 431)
AUTHORS Bayney,R.M. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone
reductase structural gene and prediction of the NH2 terminal
sequence of the protein by double-stranded sequencing of exons 1
and 2
JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 244 + 250 quinone reductase, exon 1
pre-msg 170 > 431 quinone reductase mRNA and introns
IVS 251 > 431 quinone reductase intron A
BASE COUNT 91 a 125 c 119 g 96 t
ORIGIN
1 taacttggta tcctcccccc agcgcctctg ggctggcaat ccagccccgc cctcgctggc
61 tgccctgcac agtgggctgg gccggaaaag caagatataa agcctgaaag tgctcagtac
121 agctcgcact agcctaggct gtggcacgca ggatctttcc gaagcatttc agggtcgtcc
181 tggcaaccag ctgctcagcc aatcagcgct tgacactacg atccgccccc aacttctgga
241 gccatggcgg gtgagtatgg ctccaactcc agcctaattc atcctgagga ggatgtaggg
301 gcttgctatg gggtttgttc cttgcctcga agttgaaaag tgtagagatt aggatcctgg
361 atgagcctcg gtgagtcccc ggaaggagag cttcttctca gaaccatagg tgcagattat
421 tctgcagccc c
//
LOCUS RATQRED2 410 bp ds-DNA ROD 12-JUL-1990
DEFINITION Rat quinone reductase gene, exon 2.
ACCESSION M33039
KEYWORDS quinone reductase.
SEGMENT 2 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 410)
AUTHORS Bayney,R.M. and Pickett,C.B.
TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone
reductase structural gene and prediction of the NH2 terminal
sequence of the protein by double-stranded sequencing of exons 1
and 2
JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 108 / 272 quinone reductase, exon 2
IVS < 1 107 quinone reductase intron A
IVS 273 > 410 quinone reductase intron B
BASE COUNT 103 a 111 c 102 g 94 t
ORIGIN
1 agaaactaag gtggggaacg tgtctggtcc caagcacttt tagattaggg actcacccgt
61 cctgtttgga ttttctttcc tcacctcctc acgtacgcct taaacagtga gaagagccct
121 gattgtattg gcccacgcag agaggacatc attcaactat gccatgaagg aggctgctgt
181 ggaggctctg aagaagaaag gatgggaggt ggtcgaatct gacctctatg ctatgaactt
241 taaccccctc atttccagaa acgacatcac aggtaagaat cgtctccctc cactgacagt
301 ggaccacgtg acccagcctc agcccctctt gcctcccaac aggggagccg aaggactcgg
361 agaactttca gtaccctgtt gagtcatctc tggcgtataa ggaaggccgc
//
LOCUS RHAFIXA 1040 bp ds-DNA BCT 12-JUL-1990
DEFINITION A.caulinodans nitrogen fixation protein (nifO and fixA) genes,
complete cds and 5'end.
ACCESSION M35122
KEYWORDS nitrogen fixation protein.
SOURCE A.caulinodans (strain ORS571) DNA.
ORGANISM Azorhizobium caulinodans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 1040)
AUTHORS Kaminski,P.A., Norel,F., Desnoues,N., Kush,A., Salzano,G. and
Elmerich,C.
TITLE Characterization of the fixABC region of Azorhizobium caulinodans
ORS571 and identification of a new nitrogen fixation gene
JOURNAL Mol. Gen. Genet. 214, 496-502 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 328 657 nitrogen fixation protein (nifO)
pept 939 > 1040 nitrogen fixation protein (fixA)
BASE COUNT 181 a 349 c 328 g 182 t
ORIGIN
1 gagctcggcc tctatgacat cgacgccagc gcggtgaacg tcgcgcacgt gcccgtcatt
61 ccggacgaga acgaggtgag cggcgtcgat atcgtcgtcc gcctgcgtcg cacgggccgc
121 tgagggacgc tccgcctgtc gccttcgggg cacccgcatc cgcgtagcag cgcggccgcc
181 tcccgcggac tggccatcgc cagtactggc acgggcattg cttggacctc atccgtgccc
241 cgacatcggg gcaacgggtt cgcccgccaa agcgaccgga tgagttcacc tcatccgatt
301 acgcaccaga ctttcaggag acggagcatg gcgaccgccg gcggcatcct cgatcagctc
361 aacaaggcat ccagcgcgga agacttcttc gcgctgctcg aggtcgatta cgatccccaa
421 gtggtgaatg tggtgcgcct gcatatcctg cggcgcatgg gccagtatct ggtcagcgag
481 aatttcgaag gccaggcgga tgacgccatc cgcgcccggt gcaaagaggt gctggaacag
541 gcctatgcgg acttcctcgc ctcctcgccc ttgcaggagc gggtgttcaa ggtgctgaag
601 gaggccgccc agccgccgaa gcccaagccg atggtatcgc tcaccgttct caagtgacgt
661 tccccccctc ccgcgtcctt caaggcggcc tgcacccggc aggccgccct tcgcgtttca
721 gggcgcgggc gggtggtgag gggccacggg caagacgcgc ctgtcgcatt ccgacgcggg
781 tggcggacgt tcctgtcggc ggcggagccg gggcggaaag cgcattgtgg catgccagac
841 agccctttga tttcatgcgc gttttcgggc tgaaagacag ttggtacgac acttgctcat
901 tcctccccaa gagcccaacc gttccgggag cgaacgcaat gcacatcgtc gtctgcatca
961 agcaggttcc tgactccgcg cagatccgcg tgcaccccgt gacgaacacc atcatgcgtc
1021 agggtgtgcc cacgatcatc
//
LOCUS STMPPG 200 bp ds-DNA BCT 12-JUL-1990
DEFINITION S.griseus brown pigment production gene, 5' flank.
ACCESSION M35117
KEYWORDS brown pigment production protein.
SOURCE S.griseus (strain TK21) DNA, clone pARC1.
ORGANISM Streptomyces griseus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Horinouchi,S., Nishiyama,M., Nakamura,A. and Beppu,T.
TITLE Construction and characterization of multicopy expression-vectors
on Streptomyces spp
JOURNAL Mol. Gen. Genet. 210, 468-475 (1987)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 112 > 200 brown pigment production protein mRNA
BASE COUNT 22 a 79 c 71 g 28 t
ORIGIN
1 gatcgtccat ggtggccatc ccaccatccg ccgcgccggg gcggcgagcg cgtttcgctg
61 ggcggacacg ctccccttgc cggtgctagc gcgaccgcgc tagcgtggtc gggtgcccaa
121 gatccgtatg acgcccctga ccgaccggcg ttcggccggt tcctgaagca cgcccccgac
181 cgcgcggccg gccgggccgg
//
LOCUS SYNGPCNA 111 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION M35111
KEYWORDS nucleoprotein.
SOURCE Synthetic DNA, clone pACRP1-LCM WE N.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 111)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 92 > 111 nucleoprotein
BASE COUNT 37 a 17 c 21 g 36 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc atttgattgc
61 gcttttattt ggaaattcat tgtgtgacaa aatgtctttg tccaaagaag t
//
LOCUS SYNGPCNB 111 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene,
5' end.
ACCESSION M35112
SOURCE Synthetic DNA, clone pACRP1-LCM WE G.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 111)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 109 > 111 glycoprotein precursor
BASE COUNT 33 a 20 c 25 g 33 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc tttttggatt
61 gcgctttcct ttaggacaac tgggtgctgg attctatcca gtaaaaggat g
//
LOCUS SYNGPCNC 131 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION M35113
KEYWORDS nucleoprotein.
SOURCE Synthetic DNA, clone pACRP5-LCM WE N.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 131)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 112 > 131 nucleoprotein
BASE COUNT 41 a 19 c 24 g 47 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg
61 gatcctaggc atttgattgc gcttttattt ggaaattcat tgtgtgacaa aatgtctttg
121 tccaaagaag t
//
LOCUS SYNGPCND 131 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene,
5' end.
ACCESSION M35114
KEYWORDS glycoprotein precursor.
SOURCE Synthetic DNA, clone pACRP5-LCM WE G.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 131)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 129 > 131 glycoprotein precursor
BASE COUNT 37 a 22 c 28 g 44 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg
61 gatcctaggc tttttggatt gcgctttcct ttaggacaac tgggtgctgg attctatcca
121 gtaaaaggat g
//
LOCUS SYNGPCNE 155 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION M35115
KEYWORDS nucleoprotein.
SOURCE Synthetic DNA, clone pACRP6-LCM WE N.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 155)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 136 > 155 nucleoprotein
BASE COUNT 53 a 22 c 26 g 54 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa
61 cagttttgta ataaaaaaac cccggatcct aggcatttga ttgcgctttt atttggaaat
121 tcattgtgtg acaaaatgtc tttgtccaaa gaagt
//
LOCUS SYNGPCNF 155 bp ds-DNA SYN 12-JUL-1990
DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene,
5' end.
ACCESSION M35116
KEYWORDS glycoprotein precursor.
SOURCE Synthetic DNA, clone pACRP6-LCM WE G.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 155)
AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
TITLE Expression of the S-coded genes of lymphocytic choriomeningitis
arenavirus using a baculovirus vector
JOURNAL J. Gen. Virol. 67, 1515-1529 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 153 > 155 glycoprotein precursor
BASE COUNT 49 a 25 c 30 g 51 t
ORIGIN
1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa
61 cagttttgta ataaaaaaac cccggatcct aggctttttg gattgcgctt tcctttagga
121 caactgggtg ctggattcta tccagtaaaa ggatg
//
LOCUS XELD7 1051 bp ss-mRNA VRT 12-JUL-1990
DEFINITION X.laevis pot. developmental protein (D7) mRNA, complete cds.
ACCESSION M35119
KEYWORDS developmental protein D7.
SOURCE X.laevis, cDNA to mRNA, clones D7.1 and D7.0.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1051)
AUTHORS Smith,R.C., Dworkin,M.B. and Dworkin-Rastl,E.
TITLE Destruction of a translationally controlled mRNA in Xenopus oocytes
delays progesterone-induced maturation
JOURNAL Genes Dev. 2, 1296-1306 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 27 863 pot. developmental protein D7
BASE COUNT 318 a 240 c 222 g 271 t
ORIGIN
1 gaaaaccggg acgtttgggc tgcaatatgg aatttgatga gctgatgcag tgcccatatg
61 acaaaaatca tatgattcgg cccagccggt ttccctacca ccttgttaaa tgcagagaga
121 ataatcgtgc agcagctaaa attctagcaa cttgcccata taatgcccgc cacagagtcc
181 ctaaacagga gcttgatctg cacatggcca gctgtgaata cagggtgacc atggagccca
241 tttctgctgc attttcacat cagaaggtgg agacctcaac atggcaaagc cctccttgtg
301 aagaggtctg ggaaactgac gaagatcccg tgtcaaggcc aaagcccttt attttaaatg
361 attttactcc ttctcagcct tttaatatgt cagaaggtga tggaaatatg ccgtatactg
421 gaataagcag caactacaga cctgaagtcc aacctatgaa ttcagtcatg caagtaaagc
481 aaaatcaacc tgaacctgag ccttttacct ccagtgagcg aaactatgat ccacgatcca
541 aggaaccacc caatccaaag caacctgcag tgaatggcta caaacctgca actacaaata
601 caaacccatg gtgcaggcaa acgggaggat cgaggggagc tgctcctcca aagttgggtg
661 ctaaatcctc agatgagggg ccaagaaata aggaatttcc cactccaaag gcgaacttga
721 tgaatgagta cgtacctgta gcagcaaatg caaatccatg gtgcaggcaa ccaggagggt
781 ccagtgctgc ttcagaacct ttgggtgttg actccttcga tgagtggcca tgccttggac
841 gccagccatg ggttagaaag taaatcttca ctttaaaaac aggactttca tctgaacctg
901 ttcctgactt gtccaactcc tggattttta aaatttgttg tgaagttgcc atttagtatt
961 tttgtacaaa attttaacag ccttcatttt tacatattaa gctttttatc acaaatataa
1021 tactaattta cttgaatgtt atttgttaac c
//
LOCUS YSCNUP1 4986 bp ds-DNA PLN 12-JUL-1990
DEFINITION S.cerevisiae nucleoporin (NUP1) gene, complete cds..
ACCESSION M33632
KEYWORDS nuclear pore complex protein; nucleoporin.
SOURCE S.cerevisiae (strain S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 4986)
AUTHORS Davis,L.I. and Fink,G.R.
TITLE The NUP1 gene encodes an essential component of the yeast nuclear
pore complex
JOURNAL Cell 61, 965-978 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by L.I.Davis, 06-APR-1990, for release after publication.
FEATURES from to/span description
pept 1001 4231 nucleoporin (NUP1) (put.)
BASE COUNT 1590 a 1036 c 991 g 1369 t
ORIGIN Chromosome XV; 14 cm prox. to ADE2.
1 gaattcatca gtgaactctt catcattcaa aaacacccaa tcatagttga acttggagtt
61 aaatctatct tccacggatt taatagactc agccaacgaa tatagatctc tattacgcac
121 tagagtgaca aaagtggcct tttcccgagg accggtatac tttggtagtt ccactttcga
181 gtatttatag cccgagccgg aagatagcac ggtactgtac tttgggcttg gggcagagcc
241 tctcgcatac tgagctccat gaaagaacac atacacagta aacacggcga ccagaagaag
301 tcctaatttt ttgtaaacag gctgcttgct agctgggatc ataatcttcg ccattttggt
361 tattgactct atcccttaaa aactcttctg atggagtact ttacttcgat tgcttaacga
421 aatctttgtg aggaaaaaga tatctcttaa aattagaaag tacaatagtc tagcgtatta
481 tactaagaat ctgcaaaaaa gaagcaagaa ggcaccacct attatagacc tttgacacga
541 agtctctctg gagtgctttg gcctacgtgt gcggtactcg tttacatggg acaaccacgg
601 tttttttttt ggtgttactg gaggtataca gtgcgtatat ccacttgtac gacaagagat
661 ttacactaca ccgcgtaaag aaaacgccga caccaaatat aagtcacgtg tatgcaaagc
721 ctattttatg ccctaatttt caagccccgg tttttacgcc ctagttttta tatttagggt
781 ttgtcgttgc acgtgatcaa tggttcgtat tatgtgacat tgaaatgctt tttcatttta
841 atttttttct ttgacgaaat ttcgtaatgt caagaaacac ttaaagaaaa taagtgatga
901 ggaactcaat aaggacacta cgtagcggtg caaatacgat aggatattag cctcgaaagg
961 gttataggga cagagagtga gcgacaattt ttagtcattc atgtcttcaa acacttcttc
1021 tgtgatgtct tctccacgtg tcgaaaagag atcgttttct tccactttaa aatcattctt
1081 cacaaacccc aataaaaaac ggccatcgag caagaaagtt ttcagttcaa acctctcata
1141 cgcgaatcat ttggaggaat cagatgttga agacacactg catgttaata agagaaagag
1201 ggtgtccggt acatcacagc atagcgacag cttaactcag aacaacaata atgcgccaat
1261 tataatatat ggaaccgaaa acactgagag accgccgctt ttgccaattt tgcccattca
1321 aagactgagg ttattaaggg agaagcagag ggtgagaaat atgcgtgagc ttggattaat
1381 tcaatcaact gaatttccat ctattacatc gtcggttata ttgggctctc aaagtaaaag
1441 cgatgaggga ggatcgtacc tatgcacatc atctactcct tcccctatta aaaacggttc
1501 ttgcactagg cagttggccg gaaaaagcgg tgaagacacc aatgttggac tacccattct
1561 caaatcattg aaaaatagat ccaatagaaa aaggtttcat agtcagtcaa aggggaccgt
1621 gtggtcagca aattttgaat atgatttgtc agaatatgac gctatacaaa aaaaggataa
1681 caaggataag gaaggtaacg ctggcggtga tcagaagaca agcgagaata gaaataatat
1741 taagagtagt atttcaaatg gcaatctggc tacaggccct aacctgacaa gcgaaattga
1801 agacctacgt gcagacatca actctaatag gttatcgaat cctcaaaaaa atctactttt
1861 aaaaggacca gcttccacag ttgcaaaaac tgcccctatt caggagagct ttgttcccaa
1921 ttcagagcgc tctggtacgc ctacgttaaa gaaaaatatt gagcccaaaa aggacaaaga
1981 aagtattgtt ttgcccaccg taggttttga ctttatcaag gacaatgaga ctccatctaa
2041 gaaaacttct cctaaggcaa cttcttctgc aggtgcagtc tttaaatcga gtgtagaaat
2101 gggaaaaacc gataagtcaa cgaaaactgc cgaggcgcct accttatcat tcaattttag
2161 ccaaaaggct aataaaacta aggctgtcga caatactgtc ccttccacaa ctttattcaa
2221 ttttggtggt aaatcagata ccgttacttc tgccagtcaa ccttttaaat ttggaaagac
2281 atccgaaaaa agtgaaaatc atacagaatc agacgcgcct ccaaaatcaa ctgctccaat
2341 attttctttt ggtaaacaag aagagaatgg tgatgaaggt gatgatgaaa atgagcccaa
2401 aagaaaaagg cgtttacctg ttagcgagga tacaaacacc aagcctttat tcgatttcgg
2461 caagaccggt gatcaaaagg agaccaaaaa gggagagtca gaaaaggacg catcagggaa
2521 accaagcttt gtctttggtg caagtgataa gcaagctgaa ggtacaccat tatttacatt
2581 cggaaaaaaa gctgatgtaa caagcaatat tgactcctct gcacaattta cctttggtaa
2641 agccgccacc gcgaaagaaa cacacaccaa accatctgag acacctgcca caatagtcaa
2701 gaagcctact tttacttttg ggcagtcaac aagtgaaaat aagatctctg agggaagtgc
2761 gaaacctaca ttctctttct ctaagtcaga ggaggaacgt aagagtagtc caatttcaaa
2821 cgaagcagct aaaccctcgt tttcgtttcc gggcaagcct gttgatgttc aagcaccgac
2881 ggatgataag actctcaagc caactttttc ttttactgaa cctgctcaaa aagattcatc
2941 tgttgtttcg gaacctaaaa agccctcctt tacgtttgcg tcttcaaaaa cctcacaacc
3001 aaagccattg ttttcatttg gtaagtcaga tgcagctaaa gaaccaccag gctctaacac
3061 ctcattttct ttcactaaac ctcctgctaa tgagacagat aaaagaccta caccgccatc
3121 tttcaccttt ggcggttcca caacaaataa tacaacaacc actagcacaa aaccatcttt
3181 tagttttggg gctcccgagt cgatgaagtc gacagcaagt acagcggcag caaatacgga
3241 gaagctatca aatggctttt cctttacaaa gttcaatcac aataaagaaa agtcaaactc
3301 tccaacttct ttcttcgatg gttctgcttc ctcaacgccg attcctgtct tgggtaagcc
3361 aacagacgct actggtaata caacatctaa atctgcattt tcattcggta ctgctaacac
3421 caatggtacc aatgcctcag caaactccac atcattctcg tttaacgccc ctgctactgg
3481 taacggcaca actactactt ccaatacctc aggaaccaat atagcgggta catttaacgt
3541 aggaaaaccg gatcaaagta tcgctagtgg caatacgaat ggagcgggct cggcatttgg
3601 cttttcgagc tcaggaacag cagcaactgg tgcagcttct aatcaatctt catttaattt
3661 tggaaacaat ggtgcagggg gtctcaatcc ttttacatca gcaacttcgt caactaatgc
3721 taatgctggt ttattcaata aacctccttc cacgaatgca caaaatgtca atgttccctc
3781 tgcttttaat tttacgggaa ataattcaac gcctggtggc ggctctgtat ttaatatgaa
3841 cggcaacact aatgctaata cggtgtttgc cggctctaat aaccaaccac atcaatcgca
3901 aaccccatct ttcaatacaa acagctcatt cacgccatca acagttccta atattaattt
3961 tagcggattg aatggcggaa ttactaatac cgcgaccaat gcattaaggc caagtgatat
4021 atttggtgcg aatgctgcct ctggttccaa ttcaaacgta acaaatccat catccatttt
4081 tgggggggca ggtggtgtgc cgacaacttc ttttgggcag ccgcagtcag cccctaatca
4141 gatggggatg ggaacaaata atggcatgag catgggcggt ggtgttatgg cgaacagaaa
4201 gattgcaaga atgaggcact ctaaaaggta aatatcggtt atgtaattag gtattgtgtt
4261 gcttttctga aggatatata ttcctatatt ccttcaactt atacactgca tatgaaactt
4321 cttgagaagt tttaaataat tggtcttttt tttaatcggg tacaggcgag ataatggata
4381 cctgttctag ggtaggcaaa tttggaggca agagcccaaa attgttttgt tttgacctgg
4441 tttctctcag ttaacatatt atgaatagca tcttacaaat aatttacgag cgaaactttc
4501 gatctgcaaa gcgctccttt tttctttccg aaaaaataca gtaaaataag gtctagtaca
4561 cagcctgatc tacaattagt gttcttttct gtttacttcg atcagctgtt tagctttgcc
4621 acttttctag tagctccaca cgctgtgaaa gtaacgagac atagaagtca cggatggatt
4681 tgaaaaggga ttggttacga tggaaaatta caataggtag tgggcctggc agtattgtgc
4741 tggatttccc atcatttttg gtagggtgcg tattcacaac aatgatgggg ccaatattac
4801 agaagcttat aggtaaactt ttggtaggtt taataacggt atgcaaattc ttagttatta
4861 ttggaagcat tgtcttcgtt atcggagttg catcaaagaa atacacatat gacgatttca
4921 aagtaagtat aaaaagatca ggagagccag gggaatcaca tgatatgagg actgaaccaa
4981 aaagaa
//GenBank-Updates@genbank.bio.net (07/13/90)
LOCUS TMGMPCPA 1413 bp ss-RNA VRL 13-JUL-1990
DEFINITION Tobacco mild green mosaic virus movement and coat protein genes,
complete cds.
ACCESSION M34236
KEYWORDS coat protein; movement protein.
SOURCE Tobacco mild green mosaic virus (strain PV 228), cDNA to viral RNA.
ORGANISM Tobacco mild green mosaic virus
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Tobamovirus.
REFERENCE 1 (bases 1 to 1413)
AUTHORS Nejidat,A., Cellier,F., Holt,C.A., Gafny,R., Eggenberger,A.L. and
Beachy,R.N.
TITLE Examination of movement function complementation between two
tobamoviruses in a recombinant virus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Nejidat, 11-MAY-1990.
FEATURES from to/span description
pept 51 851 movement protein
pept 857 1336 coat protein
BASE COUNT 456 a 214 c 331 g 412 t
ORIGIN
1 tgttaacaaa tttttgtgtg ataaattttt atttagaact ttgtttttaa atggctgtta
61 gtctcagaga tactgtcaaa attagcgagt tcattaatct ttcgaaacag gatgagatac
121 ttccggcatt catgactaag gtcaagagtg ttagaatatc gactgtggac aagattatgg
181 ctgttaagaa tgatagtctt tctgatgtag atttacttaa aggtgttaag ttagttaaga
241 atggttatgt atgcttagct ggtttggtag tgtctgggga gtggaatctc ccggataact
301 gccgtggtgg tgtcagtgtt tgtattgtag acaagagaat gaaaagaagt aatgaggcaa
361 cgctgggtgc ttatcacgcc cctgcttgca aaaagaattt ttcctttaag ctaatcccta
421 attattcaat aacatccgag gatgctgaga agaacccgtg gcaagtgtta gtgaatatca
481 aaggagtggc tatggaagaa ggatactgtc ctttatcttt ggagttcgtt tcaatttgtg
541 tagtacataa aaataatgta aagaaaggtt tgagggaacg tattttgaga gtaacagacg
601 actcgccaat tgaactcact gaaaaagttg ttgaggagtt cgtggatgaa gtaccaatgg
661 ctgtgaaact cgaaaggttc cggaaaacaa aaaagggaaa aaaaagaaaa aaagaaaaaa
721 aaaagagagt ggtaggtaat agtgttaata ataagaaaat aaataatagt ggtaagaagg
781 gtttgaaagt tgaggaaatt gaggataatg taagtgatga cgagtctatc gcgtcatcga
841 gtacgtttta atcaacatgc cttatacaat caactctccg agccaatttg tttacttaag
901 ttccgcttat gcagatcctg tgcagctgat caatctgtgt acaaatgcat tgggtaacca
961 gtttcaaacg caacaagcta ggacaacagt ccaacagcaa tttgcggatg cctggaaacc
1021 tgtgcctagt atgacagtga gatttcctgc atcggatttc tatgtgtata gatataattc
1081 gacgcttgat ccgttgatca cggcgttatt aaatagtttt gatactagaa atagaataat
1141 agaggttgat aatcaacccg caccgaatac tactgaaatc gttaacgcga ctcagagggt
1201 agacgatgcg actgtagcta taagggcttc aatcaataat ttagctaatg aactggttcg
1261 tggaactggc atgttcaatc aagcaggctt tgagactgct agtggacttg tctggaccac
1321 aactccggct acttagctat tgttgtgaga tttcctaaaa taaagtcgct gatactttaa
1381 attcagggtg gctgatacca aaatcagcag tgg
//
LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 13-JUL-1990
DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene,
complete cds.
ACCESSION J02911
KEYWORDS forymltetrahydrofolate synthetase.
SOURCE C.thermoaceticum DNA.
ORGANISM Clostridium thermoaceticum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1965)
AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G.
TITLE Primary structure of the thermostable formyltetrahydrofolate
synthetase from Clostridium thermoaceticum
JOURNAL Biochemistry 29, 5687-5694 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Lovell, 13-APR-1990, for release after publication.
FEATURES from to/span description
pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg
start codon) (EC 6.3.4.3)
mRNA < 1 1760 FTHFS mRNA
binding 41 44 ribosomal binding site (put.)
signal 1761 1785 pot. transcription termination signal
BASE COUNT 459 a 576 c 532 g 398 t
ORIGIN
1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc
61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact
121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa
181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt
241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac
301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg
361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat
421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct
481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg
541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat
601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc
661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt
721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga
781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac
841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg
901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc
961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg
1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg
1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa
1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa
1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc
1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact
1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa
1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga
1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta
1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa
1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg
1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc
1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc
1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat
1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca
1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag
1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat
//
LOCUS HUMGLIBSA 176 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A1.
ACCESSION M34816
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 31 53 human zinc finger protein binding site
binding 94 116 human zinc finger protein binding site
binding 157 176 human zinc finger protein binding site
BASE COUNT 68 a 50 c 36 g 22 t
ORIGIN
1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt
61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta
121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat
//
LOCUS HUMGLIBSB 166 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A2.
ACCESSION M34817
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 166)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 27 49 human zinc finger protein binding site
binding 90 112 human zinc finger protein binding site
BASE COUNT 60 a 47 c 37 g 22 t
ORIGIN
1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa
61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc
121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac
//
LOCUS HUMGLIBSC 313 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A3.
ACCESSION M34818
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 313)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 1 23 human zinc finger protein binding site
binding 64 86 human zinc finger protein binding site
binding 127 149 human zinc finger protein binding site
binding 190 212 human zinc finger protein binding site
binding 253 275 human zinc finger protein binding site
BASE COUNT 110 a 79 c 75 g 49 t
ORIGIN
1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc
61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag
121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc
181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac
241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc
301 agctgcaagg gct
//
LOCUS HUMGLIBSD 184 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A4.
ACCESSION M34819
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 184)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 32 54 human zinc finger protein binding site
binding 95 117 human zinc finger protein binding site
binding 158 180 human zinc finger protein binding site
BASE COUNT 67 a 46 c 41 g 30 t
ORIGIN
1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg
61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt
121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt
181 tgta
//
LOCUS HUMGLIBSE 62 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region B1.
ACCESSION M34820
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone B1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 62)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 11 33 human zinc finger protein binding site
BASE COUNT 18 a 26 c 11 g 7 t
ORIGIN
1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc
61 tt
//
LOCUS HUMGLIBSF 67 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region C1.
ACCESSION M34821
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone C1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 67)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 16 29 human zinc finger protein binding site
BASE COUNT 15 a 26 c 15 g 11 t
ORIGIN
1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct
61 ctctgct
//
LOCUS MLVU3RA 383 bp ss-RNA VRL 13-JUL-1990
DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA.
ACCESSION M34722
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA,
from mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 383)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 112 178 MuLV promoter region
binding 131 144 CCAAT-binding transcription factor
binding 147 152 Moloney leukemia virus binding protein b
binding 156 167 enhancer core binding protein
binding 350 362 CCAAT-binding transcription factor
binding 363 372 glucocorticoid response element
BASE COUNT 127 a 105 c 84 g 67 t
ORIGIN
1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc
61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc
181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc
361 caagaacaga tggtccccag aaa
//
LOCUS MLVU3RB 348 bp ss-RNA VRL 13-JUL-1990
DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA.
ACCESSION M34723
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 348)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 77 177 MuLV promoter region
binding 96 109 CCAAT-binding transcription factor
binding 112 117 Moloney leukemia virus binding protein b
binding 121 132 enhancer core binding protein
binding 142 154 CCAAT-binding transcription factor
binding 155 164 glucocorticoid response element
BASE COUNT 115 a 97 c 76 g 60 t
ORIGIN
1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa
61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc
121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct
181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg
241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg
301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc
//
LOCUS MLVU3RC 445 bp ss-RNA VRL 13-JUL-1990
DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA.
ACCESSION M34724
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 445)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 77 274 MuLV promoter region
binding 96 109 CCAAT-binding transcription factor
binding 168 179 CCAAT-binding transcription factor
binding 238 252 CCAAT-binding transcription factor
binding 138 143 Moloney leukemia virus binding protein b
binding 209 214 Moloney leukemia virus binding protein b
binding 147 158 enhancer core binding protein
binding 218 229 enhancer core binding protein
binding 180 189 glucocorticoid response element
binding 252 263 glucocorticoid response element
BASE COUNT 142 a 126 c 104 g 73 t
ORIGIN
1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa
61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg
121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa
181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg
241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga
301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa
361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa
421 aaagggtaag aaccccacac tcggc
//
LOCUS MLVU3RD 409 bp ss-RNA VRL 13-JUL-1990
DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA.
ACCESSION M34725
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 409)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 112 238 MuLV promoter region
binding 131 144 CCAAT-binding transcription factor
binding 174 179 Moloney leukemia virus binding protein b
binding 183 194 enhancer core binding protein
binding 204 215 CCAAT-binding transcription factor
binding 216 225 glucocorticoid response element
BASE COUNT 133 a 114 c 91 g 71 t
ORIGIN
1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata
181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc
241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc
//
LOCUS TTHISOR 6434 bp ds-DNA BCT 13-JUL-1990
DEFINITION T.thermophilus insertion sequences Is1000A and Is1000B.
ACCESSION M33159
KEYWORDS insertion sequence.
SOURCE T.thermophilus (strain HB8, ATCC 27634), clones pNZ1557 and 1559.
ORGANISM Thermus thermophilus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 6434)
AUTHORS Ashby,M.K. and Bergquist,P.L.
TITLE Cloning and sequence of IS1000 from Thermus thermophilus HB8
JOURNAL Plasmid (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.K.Ashby, 22-MAR-1990.
FEATURES from to/span description
pept 666 106 (c) ORFA
pept < 706 1347 ORFB
pept < 1251 1961 ORFC (AA at 1251)
pept < 2007 3026 ORFD (AA at 2007)
pept 3711 4664 ORF 1A
pept 5173 6126 ORF 1B
trns 3516 4711 IS1000A
trns 4978 6173 IS1000B
BASE COUNT 1060 a 2367 c 2037 g 970 t
ORIGIN
1 ctcgccgccc tcctcctcgg cgggggccaa gcccacctgc tcttcgccct cgccctcagc
61 ctcggcctcc tttggccccc ccggtgagct cggccaggac cgccttcaag gccctttcca
121 ggcctcgagg ctccccgtgt tctccaagac ccaggtcgcc cgcttgcgct tctcctcctc
181 gggcatctgg gcccgctccc gggccaggac ctcctcccgg gaaagccccg agcgcgccat
241 cacccgcctc acccgctcct ccaggggggc cgccaccaag agggtcccgt ggagcctccc
301 ctcccacccc ttctcaaaga ggagggggat ctccagaaag acgagggggg cctctaggcg
361 ggaaagctcc tccataagaa gccgcctgac ctccgggtgg accacggcct cgagggcctt
421 gagcctctcc gggtcggaaa agacgagccg ggcgagggcc ctccggtcca gcctcccccc
481 caccaccgcc tcggggaaga ggcgcttcag ctcctcctcc ttgttctccc gggcccgggc
541 ggcgagcgcg tccaggtcca agacggggta gccccaggac ctgaggaggg cggccacggt
601 gctcttgccg ctgccgatgt tcccggtgat gccgataata atggggtgct tcgcctcgtg
661 gcccatatgg acttcccctt ttataccccc aaggacgcct tccccgtggg cggggcggtg
721 cgggacctcc tcctgggaag aaggccaacg gacctggact acgccgcctt ggaccccgaa
781 ggggcggccg aggaggcaaa gaggcgcctc gggggaagcc tcttccccct ggaccccaag
841 cggggccact accgcctggt ggtgggggaa aggaccctgg acttcacccc cttggagggg
901 aggctggagg aggacctcct ccggcgggac taccgggtga acgccctcct ctggaagggg
961 ggcgccgtct tcggcctaaa gggggtggag gaggacctga ggcggcgcct cctcgtcccc
1021 gtgcgggagg aaaacctcta ccaggaccac ctgcgaagcc tccggggggt ccgcctcgcc
1081 gccaccttgg gcttcgggct tccccagagg acccgggagg cccttggccg ccacgcccgg
1141 ttcctccagg cccaccccga ggccctcccc gcgcgggaaa gggtcaagga ggagcttgcc
1201 cggctcctcc tctccccgag ggcggcttcg gcctgcgcct tctggagagg gtggggctcc
1261 ttggggtcta cctccccgag ctcgccctcc tcgtgggcct ccatcagggc ggggtgcacc
1321 accttcccgc ctgggaccac accctgagcg cggtcttcca cctcctctgg ctctggcccg
1381 aggcccccct cgaggcccgc ctcgccgccc tcttccacga cgtgggcaag cccctcaccc
1441 gccgctttga ccccgaggtg ggccgcttcc gcttcctggg gcacgccgag gtgggggcgg
1501 agatcgcccg ggcaagcctc ttttggctcc gcttccccaa ggaggtggtg gagcgggcgg
1561 cgggcctggt gcgccgccac atggaccgcc tccccgagga gaggaaggcc ctgcgccgct
1621 tcttcctgag gcgccaggac ctcctcccgg acctcgtcta cctcatggcg gcggaccgcc
1681 tggccacgag gggcgtggag cgcgaggcct gggaggtgct cgggcgctac gaagaggtcc
1741 tgaaagaccc cctgccgcaa aggcccctcc tctccgggga ggaggtgatg gcccttctgg
1801 gcctccaaga aggccccgag gtgggaaggg ccctaaaggc cctcctcgag gcccaggccg
1861 aaggccgggt ggggaccaag gaggaggccc gggcctttct cctatattgg aggggtggaa
1921 gggaggctca ggcttcggga accccagatc acccccattg aggggggctt cctggtctcc
1981 gaccctacgg ggtctacgag aagcccctgg ccctcaccga gggcgggctt ttcctcctct
2041 ccctcatgga ggggaggacc ctggaggagg tgcaggagga ggtgttcaag cgccacgggg
2101 tcctggtgcc gaagaaggag ctggaagacc tggcgaaggc cctggaggag gcgggcctcc
2161 tcctcacgga gaaggtggag gcgaggctca aagaggagga ggaaaagctc aagcgggagc
2221 ggcccatgcg cctcgccggg ctctcctatc ccgagggcga gcgggaggcc cgggccttcc
2281 tcgaggcctt ccgggcgagc tacccggggg aaggcgagga ggcccgggtc ctcctcatgc
2341 cccacctgga gccctcccgc gtccccgagg tctacggggc ggccctcgcc gccctggaga
2401 agaccccgcc ccctgagcgc atctacctgg tgggggtcgc ccacaggccg ctgaaggaga
2461 aggccgccgc ccttcccgtg cccttccaga cccccttcgg ccccgccctg ccggaccttc
2521 ccgccctcca ggccttagac gccctccttc cctttgagct cttcaacacg cccctcgcct
2581 tccgggagga gcacagcctg gagcttcccc tcttcttcct gaaggggcgg ttccccgagg
2641 cccgcgtcct ccccctcctc gtggccaggc gaagcccgga gctcggggag gccctgaagg
2701 tggtcctgcg ggacttcccg ggccttctcg tcctcgccgt ggacctctcc cacgtggggc
2761 cccgcttcgg ggacaccccc ctcacccgca ccctggcgga ggaggcgagg aggcgggacc
2821 tgggcttcct ggaaaggctc gcggaagggg agccggaggc cgccctcgcc ttcctggggg
2881 ccaaccccac ccgcatagac ggggtggagg tggtggcaag cctcctcccc ctcctccggg
2941 aaaggaaggg gaaggtcctg gcccaccgcc tggacctcga ggcccccacc ctaagcgccg
3001 tgggggcggg caccctggtc ctctaaagag ccccagattt ggaacgacgc gaaggcccgg
3061 gcgccgaacc gcccgggccc cttttctggg agcgcccctt tgggaaaggg cccggaaggt
3121 gggtgggcag gcgcgccggg gcctgcgtga aagcctccta gcgcgagggg tacaggcctt
3181 cacccacgac aaaggggtcg cttgatgggc aagacagtaa tccccttacg gggctcaatc
3241 ccttgcaacc cttaggagct acctttttgc gcttcccgtg acgagcgtcg caatcccctt
3301 acggggctca atcccttgca acttccttct aaggaggaat gggcatcatt cacaatcagc
3361 aagtcgcaat ccccttacgg ggctcaatcc cttgcaacgg attcgcctcg gcccgcctag
3421 ggactggatc tatgaccccg tcgcaatccc cttacggggc tcaatccctt gcaaccattg
3481 gtggaggcgc gtgataagcg agtggaggtg aggtggtcgc atgtatcttg ccccctgagg
3541 gaggccccac cgccccggtt gccgggccac cacgcccgcg ggattgatcg gtagccaccc
3601 accccggcct ccccgccgca cagtatcccg gtcgtcaaag accgcatcca catggaaggc
3661 ccacagggtc gggtggccac ctccctcagg ttaccaaaag gagagagccc atgaccttcg
3721 ccggcattga cgtcagcaaa acccacctgg acctggccct cgtctccaac tcccccaaac
3781 ccacccgcct ccgcttcccc aactcccctg aaggccgtca agccctcctc gccgccctcg
3841 cccaccacaa ccccgcctgg gtcgccctgg agcccacggg cgcttaccac ctccccctcc
3901 tcaagctcct ggcagaaaac cgcctccagg tggccctggt caacccctac catctcgccg
3961 ccttccgcaa ggccaaggga gaacgccaga agaccgaccg ccaagacgcc ctcctcctcg
4021 cccgctacgc ccaggtctac cacggggagc tctgggccta caccctcccc ccagaagccc
4081 tccgtgagct caaagccctg gtgggctacc gggaggactt ggccgggcgg gaaagggcca
4141 tcctcaacca gatggaggcg gcggagtggg cggggagcaa ggaggtcctc gccctcctcc
4201 aaaaggagct ggcctgcgtg aaggggcttc tcggggaggt ggaggccagg atccaggccc
4261 tcctcgccac cctccccgag gccgaggtcc tgatggccct gcccggggtg gggccccagg
4321 tggcggcagc ggtgctggcc ctcctgcccc cagagctctg gggccgggcg aagagggcgg
4381 cctcctacgc ggggctcatc cccgagcggg aggagtcggg aaagagcgtg gagaggagtc
4441 ggctctccaa aaaagggcct cccctcctgc ggcgaaagct ctacatgggc gccctggtgg
4501 cggtgcgcca tgacccggag atgcgggcct tctaccaccg cctgctctcg cggggaaaga
4561 gaaagaagca ggcgttggtg gccgtggccc acaagctcct caggcggatg atgggaaggc
4621 tcagggagta ctacgcgacc cagctagatc aaggggtcgc ttgacaggca agacagtatc
4681 aatcccctta cggggctcaa tcccttgcaa ctccgacggc aagatcatcg gggtgcggcg
4741 gctcctcgcc gtcgcaatcc ccttacgggg ctcaatccct tgcaacgcca gtctatgacc
4801 aagcccctag gctccttcac tttgaagtgt cgcaatcccc ttacggggct caatcccttg
4861 caacacctca tgggggagta agataggcgg gaaaggaggg aggtcgcaat ccccttacgg
4921 ggctcaatcc cttgcaacca cgtgaagacg tccctggcct cggtcgtgag ggacgtggtc
4981 gcatgtatct tgccccctga gggaggcccc accgccccgg ttgccgggcc accacgcccg
5041 cgggattgat cggtagccac ccaccccggc ctccccgccg cacagtatcc cggtcgtcaa
5101 agaccgcatc cacatggaag gcccacaggg tcgggtggcc acctccctca ggttaccaaa
5161 aggagagagc ccatgacctt cgccggcatt gacgtcagca aaacccacct ggacctggcc
5221 ctcgtctcca actcccccaa acccacccgc ctccgcttcc ccaactcccc tgaaggccgt
5281 caagccctcc tcgccgccct cgcccaccac aaccccgcct gggtcgccct ggagcccacc
5341 agcgcctacc acctccccct cctcaagctc ctggcagaaa accgcctcca ggtggccctg
5401 gtcaacccct accacctcgc cgccttccgc aaggccaagg gagaacgcca gaagaccgac
5461 cgccaagacg ccctcctcct cgcccgctac gcccaggtct accacgaaga cctccgggcc
5521 tacaccctac ccccagaaac cctccgggag ctcaaagccc tggtgggcta ccgggaggac
5581 ttggccgggc gggaaaggac catcctcaac cagatggagg cggcggagtg ggcggggagc
5641 aaggaggtcc tcgccctcct ccagaaggag ctggcctgcg tgaaggggct tctcggggag
5701 gtggaggcca ggatccaggc cctcctcgcc accctccccg aggccgaggt cctgatggcc
5761 ctgcccgggg tggggcccca ggtggcggca gcggtgctgg ccctcctgcc cccagagctc
5821 tggggccggg cgaagagggc ggcctcctac gcggggctca tccccgagcg ggaggagtcg
5881 ggaaagagcg tggagaggag tcggctctcc aaaaaagggc ctcccctcct gcggcgaaag
5941 ctctacatgg gcgccctggt ggcggtgcgc catgacccgg agatgcgggc cttctaccac
6001 cgcctgctct cgcggggaaa gagaaagaag caggcgttgg tggccgtggc ccacaagctc
6061 ctcaggcgga tgatgggaag gctcagggag tactacgcaa cccagctaga tcaaggggtc
6121 gcttgacagg caagacagta tcaatcccct tacggggctc aatcccttgc aacgaacctc
6181 ctcgggccca ccgagggggt aatcctcgca gagtcgcaat ccccttacgg ggctcaatcc
6241 cttgcaacct gaaccagaag gccctctggg cggcggcgta cgccgtggcg tcgcaatccc
6301 cttacggggc tcaatccctt gcaacgcgat tggcgtcgcc ctccccgtta tcgcttacga
6361 gtcgcatgta tcttgccccc tgagggaggc cccaccgccc cggttgccgg gccaccacgc
6421 ccgcgggatt gatc
//
LOCUS MLUDNAA 4171 bp ds-DNA BCT 13-JUL-1990
DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34
(rpmH), DNA biosynthesis initiation protein (dnaA), and DNA
polymerase III beta subunit (dnaN) genes, complete cds.
ACCESSION M34006
KEYWORDS 50S ribosomal subunit protein L34;
DNA biosynthesis initiation protein;
DNA polymerase III beta subunit; ribonuclease P.
SOURCE M.luteus (strain IF03333) DNA.
ORGANISM Micrococcus luteus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 4171)
AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N.
TITLE Structure of the dnaA region of Micrococcus luteus: Conservation
and variations among eubacteria
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by N.Ogasawara, 05-MAY-1990.
FEATURES from to/span description
pept 435 37 (c) ribonuclease P (rnpA)
pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH)
pept 944 2491 DNA biosynthesis initiation protein (dnaA)
pept 3242 > 4171 DNA polymerase III beta subunit (dnaN)
signal 843 848 dnaA -35 signal
signal 866 871 dnaA -10 signal
BASE COUNT 672 a 1558 c 1361 g 580 t
ORIGIN
1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg
61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta
121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag
181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg
241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct
301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg
361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg
421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg
481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg
541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca
601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag
661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc
721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc
781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc
841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc
901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc
961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc
1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct
1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc
1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga
1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg
1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac
1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt
1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc
1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg
1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc
1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc
1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga
1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca
1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga
1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt
1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc
1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc
1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta
2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac
2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa
2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac
2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct
2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc
2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa
2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa
2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg
2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact
2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc
2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg
2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct
2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg
2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc
2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca
2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc
3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac
3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca
3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc
3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct
3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc
3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg
3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga
3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc
3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg
3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct
3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac
3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga
3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg
3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac
3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc
3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga
3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac
4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct
4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag
4141 cgccaacgag tccgtgccgt gcacgctcga g
//
LOCUS CE1CG 6646 bp ds-DNA circular BCT 13-JUL-1990
DEFINITION Plasmid ColE1, complete genome.
ACCESSION J01566 M33100
KEYWORDS cea gene; colicin; colicin E1; colicin E1 immunity;
complete genome; kil gene; origin of replication.
SOURCE Plasmid ColE1 DNA.
ORGANISM Plasmid Colicin E1
Prokaryota; Bacteria.
REFERENCE 1 (bases 1008 to 1370)
AUTHORS Tomizawa,J.-I., Ohmori,H. and Bird,R.E.
TITLE Origin of replication of colicin E1 plasmid DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 1865-1869 (1977)
STANDARD full staff_review
REFERENCE 2 (bases 1410 to 1599)
AUTHORS Bastia,D.
TITLE Determination of restriction sites and the nucleotide sequence
surrounding the relaxation site of ColE1
JOURNAL J. Mol. Biol. 124, 601-639 (1978)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 1613; 6577 to 6646)
AUTHORS Oka,A., Nomura,N., Morita,M., Sugisaki,H., Sugimoto,K. and
Takanami,M.
TITLE Nucleotide sequence of small ColE1 derivatives: Structure of the
regions essential for autonomous replication and colicin E1
immunity
JOURNAL Mol. Gen. Genet. 172, 151-159 (1979)
STANDARD full staff_review
REFERENCE 4 (bases 623 to 832)
AUTHORS Morita,M. and Oka,A.
TITLE The structure of a transcriptional unit on colicin E1 plasmid
JOURNAL Eur. J. Biochem. 97, 435-443 (1979)
STANDARD full staff_review
REFERENCE 5 (bases 1 to 192; 384 to 519; 6644 to 6646)
AUTHORS Patient,R.K.
TITLE Characterization of in vitro transcription initiation and
termination sites in Col E1 DNA
JOURNAL Nucleic Acids Res. 6, 2647-2665 (1979)
STANDARD full staff_review
REFERENCE 6 (bases 4783 to 5353)
AUTHORS Ebina,Y., Kishi,F., Miki,T., Kagamiyama,H., Nakazawa,T. and
Nakazawa,A.
TITLE The nucleotide sequence surrounding the promoter region of colicin
E1 gene
JOURNAL Gene 15, 119-126 (1981)
STANDARD full staff_review
REFERENCE 7 (bases 1 to 6646)
AUTHORS Chan,P.T., Ohmori,H., Tomizawa,J.-I. and Lebowitz,J.
TITLE Nucleotide sequence and gene organization of ColE1 DNA
JOURNAL J. Biol. Chem. 260, 8925-8935 (1985)
STANDARD full staff_review
REFERENCE 8 (bases 57 to 123)
AUTHORS Zhiang,S., Yan,L. and Zubay,G.
TITLE Regulation of gene expression in plasmid ColE1: Delayed expression
of the kil gene
JOURNAL J. Bacteriol. 170, 5460-5467 (1988)
STANDARD simple staff_entry
REFERENCE 9 (bases 639 to 753)
AUTHORS Tomizawa,J.-I.
TITLE Control of ColE1 plasmid replication: The process of binding of RNA
I to the primer transcript
JOURNAL Cell 38, 861-870 (1984)
STANDARD full staff_review
REFERENCE 10 (bases 5046 to 5110)
AUTHORS Parker,R.C.
TITLE Mitomycin C-induced bidirectional transcription from the colicin E1
promoter region in plasmid ColE1
JOURNAL Biochim. Biophys. Acta 868, 39-44 (1986)
STANDARD full staff_entry
COMMENT The L (light) strand is shown. This sequence was kindly supplied
over computer network by J.Lebowitz (04-APR-1985). The open
reading frames noted by [7] are named in FEATURES by the molecular
mass of their predicted products or by the names given by [7].
FEATURES from to/span description
pept 400 59 (c) colicin E1 immunity protein (imm)
pept 448 585 kil protein
pept 1609 1746 mob1 protein (gtg start codon)
pept 1803 1612 (c) RNA I inhibition modulator protein (rom; gtg
start codon)
pept 4362 3943 (c) entry exclusion protein 2 (exc2)
pept 4862 4443 (c) entry exclusion protein 1 (exc1)
pept 5140 62 colicin E1 protein (cea)
RNA 641 > 641 II RNA
RNA 751 645 (c) I RNA ((3' end +/- 1 bp) [4]
mRNA 1449 > 3936 mob mRNA (5' end putative) [5],[7]
mRNA 5056 > 585 colE1-kil mRNA [7]
mRNA 409 < 59 (c) imm mRNA [5],[7]
mRNA 4887 < 3943 (c) exc mRNA [5],[7]
binding 5063 5093 LexA binding site [7]
ORF 605 895 10.2 kd ORF [7] (ttg start codon)
ORF 1241 870 (c) 13.8 kd ORF [7] (gtg start codon)
ORF 1189 1350 6.0 kd ORF [7]
ORF 1408 1238 (c) 6.4 kd ORF [7]
ORF 1347 1526 6.9 kd ORF [7] (gtg start codon)
ORF 1709 1500 (c) 7.6 kd ORF [7]
ORF 1609 1746 mob1 ORF (gtg start codon)
ORF 1843 2190 mob2 ORF
ORF 2180 3733 mob3 ORF
ORF 2214 2399 mob4 ORF (gtg start codon)
ORF 2868 3386 mob6 ORF
ORF 3393 3626 mob7 ORF (gtg start codon)
ORF 3657 3818 mob8 ORF
ORF 3766 3936 mob9 ORF (gtg start codon)
ORF 4258 4647 14.5 kd ORF [7]
ORF 5435 5653 8.8 kd ORF [7]
ORF 5852 6013 6.7 kd ORF [7]
conflict 83 84 aa in [3],[7]; a in [5]
conflict 99 100 aa in [3],[7]; a in [5]
conflict 135 135 a in [3],[7]; g in [5]
conflict 144 144 a in [3],[7]; g in [5]
conflict 174 174 a in [3],[7]; g in [5]
conflict 484 485 aa in [3],[7]; a in [5]
conflict 491 492 tt in [3],[7]; t in [5]
conflict 506 512 ctaacta in [3],[7]; caca in [5]
conflict 517 519 cct in [3],[7]; cgg in [5]
conflict 528 528 c in [7]; g in [3]
conflict 652 653 aa in [3],[7]; aaa in [4]
conflict 1111 1111 g in [7]; t in [1],[3]
orgrpl 1197 1197 replication origin (+/- 1 bp) [1]
conflict 1366 1366 a in [3],[7]; g in [1]
conflict 1416 1417 cc in [3],[7]; c in [2]
conflict 1454 1454 c in [3],[7]; t in [2]
cutss 1466 1465 (c) relaxation (nic) cut site [2]
conflict 1581 1582 aa in [3],[7]; a in [2]
conflict 4784 4787 ctcc in [7]; cgc in [6]
BASE COUNT 1760 a 1421 c 1798 g 1667 t
ORIGIN 2 bp downstream of EcoRI site.
1 ttctatgctc ctatattgat aagaataaac ttaatactat aaatgaggtg ttagggattt
61 aattattctt tattgatata aaaagtccta gcaatccaaa tgggattgct aggaccaaac
121 aaagtagatt atatagcata aataggttta attttgctac gggggcgtta tttaggtttt
181 ttcttctttc gaaaaaatct ttctttatga agttaaaagc tatgtattca atagcatatt
241 ttgaatatgg acatagaata gtgcttatca ctattgcata tagcatctta tctgacacaa
301 ggaaataata cccttcgctg ttttttgtta taaggtatat atatataagt gtgcagtaca
361 ggccaaataa aatatttttt atgtagtatc ttaagctcat aaattaaacc tcgccatata
421 ttcttttcat tttataagga tcgagttatg aggaaaagat tttttgtggg aatattcgcg
481 ataaacctcc ttgttggatg tcaggctaac tatatacctg atgttcaggg agggaccatc
541 gcaccatcct cctcttctaa actgacgggg atcgcggttc agtagaaaag attaaaggat
601 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
661 taccaacggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
721 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag tcgggccact
781 acttcaagaa ctctgtagca ccgtttgtgc catcatcgct ctgctaatcc ggttaccagt
841 ggctgctgcc agtggcgtta aggcgtgcct taccgggttg gactcaagac gatagttacc
901 ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg
961 aacgacctac accgaactga gataccaaca gcgtgagcta tgagaaagcg ccacgcttcc
1021 cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
1081 gagggagctt ccagggggaa acgcctggta gctttatagt cctgtcgggt ttcgccacct
1141 ctgacttgag cgtctatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc
1201 ctgctacgtg gccttcttcc tgttcctggt cttttgctca catgttcttt ccggccttat
1261 cccctgattc tgtggataac tgtgttaccg tttttgtgtg agtcagtacc gctcgccgca
1321 gtcgaacgac cgagcgtagc gagtcagtga gcgaggaagc ggaaaagcgc ctggacgtgc
1381 attttctcct tacgcatctg tgcggcattt cacacccggc atggcgtact tttcatacaa
1441 tccgcactga tgccgcatgg ttaagccagt atacactccg ctatcgctac gtgactgggt
1501 cagggctgcg ccccgacacc cgctaaaacc tgctgacgcg ccctgacggg cttgtcagct
1561 cccggcatcc gctcacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt
1621 ttcaccgtca tccccgaaac gtgcgaggca gctgcggtaa agctcatcgg cgtggtcgtg
1681 aagcgattca caaatatcgg cctgttcatc tgcgtccagt tcgttgagct tctccagcag
1741 cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttagt
1801 cactgatgcc tccgtgtaag ggggatttct gttcatgggg taatgatacc gatgaaacgc
1861 gagaggatgc tcacaatacg ggttactgat gatgaacatg cccggttact tgaacgctgt
1921 gagggtaaac aactggcggt atggatgcgg cgggtctgcc tgggggagcc ggttgcccgt
1981 tccggaaaac tgccgacact ggcaccgccg ttactgcgtc agctggccgc catcggaaat
2041 aacctgaatc agacagcccg taaggtgaac agcgggcagt ggtcttccgg tgaccgggtt
2101 caggtggtgg ccgcactgat ggccatcggg gatgagctgc gccggctgcg tctggctgtc
2161 agggaacagg gggcgcggga tgatagttaa atttcatgcc aggggaaaag gtggtggcag
2221 tggtccggtt gattacctgc tggggaggga gcgtaaccgc gaaggcgcaa cggtgcttca
2281 gggtataccg gaagaagtcc gggaactcat cgatgccacg ccatttgcga agaaatacac
2341 gtccggtgtt ctgtcgttcg cggagaagga gctgccgccg ggaggacgtg aaaaagtgat
2401 ggcgagcttt gagcgtgtac tgatgcccgg tctcgaaaaa aatcagtaca gcatcctgtg
2461 ggtggagcac caggacaagg gacggcttga gctgaatttt gtcattccga acatggagct
2521 acagaccgga aaacgcctcc agccgtacta cgaccgcgca gacaggccta gaattgatgc
2581 ttggcagacg ctgttaaatc accattacgg gctgcatgac ccgaacgccc cggagaaccg
2641 caggacgctg acactccctg ataacctgcc tgaaacgaaa caggcgcttg ctgagggcgt
2701 cacgcgaggt atagatgcac tttaccatgc cggagagata aaaggccgtc aggatgtgat
2761 tcaggcgctc actgaggcgg ggctggaagt ggtcagggtg acgcgaagca gtatcagcat
2821 tgcagatccg aacggcggga agaatatcag gctgaaagga gcattttatg agcaatcttt
2881 tgcagacggg cgcggagttc gagaaaaagc tgaaagagag agccgaatct acagagaaaa
2941 tgctgaacaa cgagttcagg aggctcggcg aatctgtaag cgaggctgtg acatcaaacg
3001 agacgaaaat cagagacgct atagccctgt tcacagcctc gacagaggaa tcgctggaaa
3061 aacaccggga aggggtgaaa gaggcgatga tgcagcacag gagggacgtg ttaaagctgg
3121 cagggaatac gggcatgatg ttactgggga tagtctttct cctgtttacc gcgagtggcg
3181 ggacgctctg gtatcttgga gggaggatac aggcgaacct ggaagaaatc aggaagcagg
3241 aagagacatt gcagaaactg aacgcgaaga catggggcgt ggagtttgtg caggacggga
3301 acaggaaatt ccttgtcctt ccgtacggga aatcagcgga ggtgattccc tttcagggga
3361 aagagtgggt acatctgaag gagtgacaca gagtgacaga gctggaaaca catttgctga
3421 acgccttaga gcagctgcaa caggactata tgcagcggct gagcgaatgg gagagcgcct
3481 tcgtggaatt gcagaagatg ttttcgctta cgcaacggga caacgcgatg ctgaacgagc
3541 gggtcatgca gttgagtcag caggtgcaac acttgagcga gcagacagaa cgcttgagcc
3601 agttatacag cgagaactgg agataagaga ggaacggctg atacaggagc gcgaacatgt
3661 gttatcccta gaacgggagc gtcagccgga aatacaggaa cgcacgctgg atggcccttc
3721 gctgggatgg tgaaaccatg aaaaatggca gcttcagtgg attaagtggg ggtaatgtgg
3781 cctgtaccct ctggttgcat aggtattcat acggttaaaa tttatcaggc gcgatcgcgc
3841 agtttttagg gtggtttgtt gccattttta cctgtctgct gccgtgatcg cgctgaacgc
3901 gttttagcgg tgcgtacaat taagggatta tggtaaatcc acttactgtc tgccctcgta
3961 gccatcgaga taaaccgcag aaatcgtgtc agccagcagc ctggattttt tcggggtaag
4021 gtctttgctg cggtaatcag tacccgcaaa ttgagttgtg ccccggagtg agtttaaaaa
4081 ttcttggctg tgaaactgct gaatgcgttg ttgcacatca ctccgtgatt ttcctgtcgc
4141 tttgtcttta gctcccatat cccagaactg ccgaaaaaaa ggcaccatca ttcgtgttgt
4201 gtcggctttt tgggtggaaa agttgggatc aaaaccgtca tcagaagcat atacgtaatg
4261 ccgggtatgt tgttccacgc tgccctttgg ggtacaggct gtgagagcta ctgttgctgt
4321 tatcatggcg ataagacaaa gtttgtttcc ggtacgtctc atgagtgtct ccctacctta
4381 aagtatttat tctcggtgag ttattatccg aagactattt tatcatcagt ctctgacagc
4441 gttcagcgac cgaacaacct ggaccagaat ccccgtttct tttccagttc cagtgcctgc
4501 ctgagctggg ctatctcatt ttgtagctgt tcccgttctg cttcctggcg tctgcgatcc
4561 atatcctgtg cctgtttatc ctcaagcatc agcgtcaggc attgtttcag ctcattcagt
4621 tcccggagaa tgcgttctgt ctgctgatca tgtggatttt ctgcatgtcc ctcactgtgc
4681 ctttctggtg tctcattctg ctttaattcg ccgtatgccc ggatcagttc actggtttca
4741 aactcccgtc gtccgtccct gccggtacgg taactgacaa ggcctccgcg catatcacgg
4801 taaaactgtg aacgcgatct gcctgtcatt tttagtgcgt cccggagtgt gtgccatgcc
4861 ataaagtgac agtgtcccat agatgtctca tctcatagtt tcagtaaaac ataatgaggt
4921 ctgagaacgg taatgtttgt gctggttttt gtggcatcgg gcgagaatag cgcgtggtgt
4981 gaaagactgt ttttttgatc gttttcacaa aaatggaagt ccacagtctt gacagggaaa
5041 atgcagcggc gtagctttta tgctgtatat aaaaccagtg gttatatgta cagtatttat
5101 ttttaactta ttgttttaaa agtcaaagag gattttataa tggaaaccgc ggtagcgtac
5161 tataaagatg gtgttcctta tgatgataag ggacaggtaa ttattactct tttgaatggt
5221 actcctgacg ggagtggctc tggcggcgga ggtggaaaag gaggcagtaa aagtgaaagt
5281 tctgcagcta ttcatgcaac tgctaaatgg tctactgctc aattaaagaa aacacaggca
5341 gagcaggctg cccgggcaaa agctgcagcg gaagcacagg cgaaagcaaa ggcaaacagg
5401 gatgcgctga ctcagcgcct gaaggatatc gtgaatgagg ctcttcgtca caatgcctca
5461 cgtacgcctt cagcaacaga gcttgctcat gctaataatg cagctatgca ggcggaagca
5521 gagcgtttgc gccttgcgaa agcagaagaa aaagcccgta aagaagcgga agcagcagaa
5581 aaggcttttc aggaagcaga acaacgacgt aaagagattg aacgggagaa ggctgaaaca
5641 gaacgccagt tgaaactggc tgaagctgaa gagaaacgac tggctgcatt gagtgaagaa
5701 gctaaagctg ttgagatcgc ccaaaaaaaa ctttctgctg cacaatctga agtggtgaaa
5761 atggatggag agattaagac tctcaattct cgtttaagct ccagtatcca tgcccgtgat
5821 gcagaaatga aaacgctcgc tggaaaacga aatgaactgg ctcaggcatc cgctaaatat
5881 aaagaactgg atgagctggt caaaaaacta tcaccaagag ccaatgatcc gcttcagaac
5941 cgtccttttt ttgaagcaac cagacgacgg gttggggccg gtaagattag agaagaaaaa
6001 caaaaacagg taacagcatc agaaacacgt attaaccgga taaatgctga tataactcag
6061 atccagaagg ctatttctca ggtcagtaat aatcgtaatg ccggtatcgc tcgtgttcat
6121 gaagctgaag aaaatttgaa aaaagcacag aataatctcc ttaattcaca gattaaggat
6181 gctgttgatg caacagttag cttttatcaa acgctgactg aaaaatatgg tgaaaaatat
6241 tcgaaaatgg cacaggaact tgctgataag tctaaaggta agaaaatcgg caatgtgaat
6301 gaagctctcg ctgcttttga aaaatacaag gatgttttaa ataagaaatt cagcaaagcc
6361 gatcgtgatg ctatttttaa tgcgttggca tcggtgaagt atgatgactg ggctaaacat
6421 ttagatcagt ttgccaagta cttgaagatt acggggcatg tttcttttgg atatgatgtg
6481 gtatctgata tcctaaaaat taaggataca ggtgactgga agccactatt tcttacatta
6541 gagaagaaag ctgcagatgc aggggtgagt tatgttgttg ctttactttt tagcttgctt
6601 gctggaacta cattaggtat ttggggtatt gctattgtta caggaa
//
LOCUS COGTUB1A 2672 bp ds-DNA PLN 13-JUL-1990
DEFINITION C.graminicola beta-tubulin-1 (TUB1) gene, complete cds.
ACCESSION M34491
KEYWORDS beta-tubulin-1.
SOURCE C.graminicola DNA.
ORGANISM Colletotrichum graminicola
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Diaporthales; Diaporthaceae.
REFERENCE 1 (bases 1 to 2672)
AUTHORS Panaccione,D.G. and Hanau,R.M.
TITLE Characterization of two divergent beta-tubulin genes from
Colletotrichum graminicola
JOURNAL Gene 86, 163-170 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 662 673 beta-tubulin-1 (TUB1), exon 1
740 763 beta-tubulin-1, exon 2
818 843 beta-tubulin-1, exon 3
905 946 beta-tubulin-1, exon 4
1014 1068 beta-tubulin-1, exon 5
1124 1908 beta-tubulin-1, exon 6
1962 2355 beta-tubulin-1, exon 7
IVS 674 739 TUB1 intron A
IVS 764 817 TUB1 intron B
IVS 844 904 TUB1 intron C
IVS 947 1013 TUB1 intron D
IVS 1069 1123 TUB1 intron E
IVS 1909 1961 TUB1 intron F
BASE COUNT 632 a 754 c 660 g 626 t
ORIGIN
1 ggatccggct cggcgttacg gactggaaac ccaaagggca acaatgagta gagcacttag
61 caaccgtgca acaacaacac catagccagg cagctgtgta tccaggggaa cgaacccatt
121 ggtccaaggt gttaccaagc ctctaggcaa taatctccca gttacggcca tacaaacctg
181 attcatgatg cctagaattc tgagcaaagc aaaacagagc atccatgtac caatcatttc
241 tgtggatata tactcaacgc ggactagctt ggtctctttt gggcgccgcc tccgagagaa
301 tgggcatcag ccataggtcc ctttcttttt tccttttccc atccccctcc cctccctacc
361 catcactctc tcttccaaag agtaagcaaa actgtgctgc cggagtaggc agccgtcgaa
421 tcagaaaggc agccattcaa cgatgcgcaa acatgtgggg attggaaatc cagaccgcaa
481 agttacactc tgctcgggcc tatccgatac gcagcgataa gttgtcggtc cttggatagg
541 ggatttgttg tgatttacag gcatatgtat ctataagatg accacctgtc ttgagtagcg
601 gggaaaatct ctgctttcca ccatcaactt ttgacaatcc acaccattcc caaagttcat
661 aatgcgtgag attgtgagtt gtaccttgtc cacgtaccta ttgaggtatt gattctcagc
721 tgactcgggg atccaacaga tacacctcca aaccggccaa tgtgtacgta tctttcgctg
781 ccgaatggat gagccctgac tgacgaagaa tctgtagggc aaccaagtcg gtacagcatt
841 ctggtatgag gatgggaccg tcgatgcacc ttcaagagat gagactgatg ttggtcacta
901 ccaggcaaac cattcacggc gagcacgggc ttgatcaaga tggagtgtga gtctcaccgc
961 catgtaacgc gacgtcacct cgctcatcga gcgtcaagtt gacattttca tagattccgc
1021 ggctctgatg agcagcagtc cgagaggctg agcgtctact tcaccgaggt aactcgatac
1081 agtcgtcgcg cacttctccc tattcctttc tgacacgatg caggctgcca aacaaaagta
1141 cgttcctcgt gcagtcctgg ttgaccttga gcccgcaacc atggatgcaa tccgttccgg
1201 tccccttggt gacttcttcc gtcccgacaa catggtctac ggacagtccg gcgcgggtaa
1261 caactgggca aagggccact acaccgaagg cgccgagctg gttgaccaag tcctcgacgt
1321 cgtgcgccgc gaagccgaag cctgcgacag tctgcagggt ttccagatca cgcactccct
1381 cggtggcggc accggctctg gcatgggcac cctcctcatc gcaaaggtcc gcgaggaatt
1441 ccctgatcgt atgatggcga ccttctctgt cctgccgtct gccaaggtct ctgaggtcgt
1501 tgtcgagccg tacaatgcga ccttgtcgat tcatcagctc gtcgaaaact cggacgagac
1561 gttctgtatc gacaacgagg ccctgtatga tatctgccgc cgcaccctga agcaagcgca
1621 cccgtcctac ggacatctca accacctggt gtccagggtc atgtccggct tgacgacggg
1681 cttccgcttc ccggggcaat tgaacgccga cctccgcaag ctggccgtga acctcgtccc
1741 cttccccaga cttcacttct tcaccgtcgg attcgcgccg ctcacctcct cggcctcctt
1801 ctcaaacctg ggtatcgccg agctcacgca gcagatgttc gacccgaaga acgtcatgtt
1861 ggcgtctgat ttccgtgacg gacgcttctt gacttgctcc accatgttgt gagttggtcc
1921 cctcatgtcg cttgatgaat tctttgctaa cgtaccttta gccgtggaaa ggtgtccatg
1981 aagcaggtcg aggaacagat ccaggccatc aagaacaaga actcagccaa cttcgtggag
2041 tggatcccga ataacatcca gactgctcat tgctctgtgc cgcccaaggg actggacgta
2101 tcatcgacgt tcattggaaa ctccaccgca atccagaaca gtttccgcag agtcggcgac
2161 cagttcagcc tcatgttccg gagaaaggct ttcttgcatt ggtacactgg agagggcatg
2221 gatgagatgg aattcactga ggccgagtct aacatgaacg atctggtatc cgagtaccag
2281 cagtaccagg atgcaggcat ggatgacgag tacggcgaag agtacgaaga cgaggctcct
2341 gctgaggagg aatgagaagg caagacaact cttcgatgtc cttcccgagg tgaagttact
2401 ttatattcaa ggtagttgat ggtctttgaa ccttttccaa tcactgtaca cacccaatcc
2461 accatcatag tccgagcagc tcacttgaag ttattttccc tttcgactga gaaggcaata
2521 gagttgttct atctcgttgc ttgtgcagac aacgaagtgg atcgttacgc aacccctagt
2581 cttgacttgc aaatcatatc gtcccaagat ccagatggct attagttcat ggatcggtac
2641 ccttgtagat ccacctgaag tggctccccg gg
//
LOCUS COGTUB1B 2614 bp ds-DNA PLN 13-JUL-1990
DEFINITION C.graminicola beta-tubulin-2 (TUB2) gene, complete cds.
ACCESSION M34492
KEYWORDS beta-tubulin-2.
SOURCE C.graminicola DNA.
ORGANISM Colletotrichum graminicola
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Diaporthales; Diaporthaceae.
REFERENCE 1 (bases 1 to 2614)
AUTHORS Panaccione,D.G. and Hanau,R.M.
TITLE Characterization of two divergent beta-tubulin genes from
Colletotrichum graminicola
JOURNAL Gene 86, 163-170 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 441 452 beta-tubulin-2 (TUB2), exon 1
655 678 beta-tubulin-2, exon 2
738 763 beta-tubulin-2, exon 3
840 881 beta-tubulin-2, exon 4
942 996 beta-tubulin-2, exon 5
1068 1858 beta-tubulin-2, exon 6
1917 2310 beta-tubulin-2, exon 7
IVS 453 654 TUB2 intron A
IVS 679 737 TUB2 intron B
IVS 764 839 TUB2 intron C
IVS 882 941 TUB2 intron D
IVS 997 1067 TUB2 intron E
IVS 1859 1916 TUB2 intron F
BASE COUNT 561 a 802 c 607 g 644 t
ORIGIN
1 cccgggtctt ggatcggaga gaacagagga gcagaatagg gattattggg taggcaggta
61 cgtgcccagg taacacgtcg tgcctgccaa tcaacgccga tctgagtcct cgacttgttg
121 cgacccacca tcaaatagtg cctgggaatg gtctttcccc aatcaggatc gtgcacggat
181 cccaatagta aacaagcgag ctgcaccctt ttcctctctg gcctgtctct gggcacctgg
241 cctgtggcgg tgagcgaaat cacgtttacc cgcaaaacaa aaatcaacaa cctttcccct
301 acctaaccac accttaatct catccacttt ccaaccacgt ccaccttgga agcttcgcgc
361 agctctcaag cacctcccac tcgtcctctc ctttgctcca tcgtcggcct agtcagctca
421 agagcttttc tccattcaca atgcgtgaga ttgtaagtcc ttcccctcaa tcattcgtaa
481 caaataaacc tgcgaccgac gcgtttggcg acgaatcgtc ggccttgccc ctgaacgtac
541 cccgccgaca tttccaccca acactggtcc tcaccgaaga cgaccacgat tgccatcacc
601 gacagtatgc accttgggga tatatcgctg accattgatt gttatactcg ataggttcac
661 ctccagaccg gccagtgcgt aagtcttctc tgatcccaac caacaatcca aggtgcgggg
721 ctaacttctt tgaatagggt aaccagattg gtgctgcctt ttggtgcgta gccagaccga
781 catcatcgac ttcggcgaga ttggcctcga aaagacattg gatattaata cgggcacagg
841 caaaacatct ctggcgagca cggcctcgac agcaatggcg tgtatgttgc caacctccag
901 atctggccac ttcctcgagt tcaccgctaa tttctcaaca gttacaatgg cacctctgag
961 ctccagctcg agcgcatgag cgtctacttc aacgaagttt gttatcctag ccccccccag
1021 gaagcagaca aacctattga tgaatactga ccttgtcacg tacccaggct tccggcaaca
1081 agtatgtccc tcgcgccgtc ctcgtcgact tggagcccgg taccatggat gctgttcgcg
1141 ccggcccctt cggccagctt ttccgccccg acaacttcgt ctttggccag tccggtgccg
1201 gcaacaactg ggccaagggt cactacaccg aaggagctga gcttgtcgac caggtccttg
1261 acgtcgtccg ccgcgaggct gagggctgcg actgccttca gggcttccag attacccact
1321 cccttggtgg aggtactggt gccggtatgg gtactctgtt gatctccaag attcgcgagg
1381 agttccccga ccgcatgatg gctacctttt ccgtcgttcc ctcccctaag gtttccgaca
1441 ccgttgttga gccctacaac gccactctct ccgtccacca gctggttgag aactccgatg
1501 agaccttctg cattgataac gaggctctct acgacatctg catgcgtact cttaagctct
1561 ctaacccctc gtacggcgac ctgaaccatc tcgtctctgc cgtcatgtcc ggtgtcacta
1621 cctgcctgcg tttccctggt cagctgaact ctgacctgcg taagctggcc gtcaacatgg
1681 ttcctttccc ccgtcttcac ttcttcatgg tcggattcgc tcccctgacc agccgtggtg
1741 cccactcttt ccgcgctgtc agcgttcctg agctcaccca gcagatgttc gaccccaaga
1801 acatgatggc tgcctctgac ttccgcaacg gtcgctacct gacctgctct gccatcttgt
1861 aagtgtcatt tccagccaac ctacaatgtc agcacttgct aacagctgcc tctcagccgt
1921 ggtaaggtcg ccatgaagga cgtcgaggac cagatgcgca acgtcctgaa caagaactct
1981 tcttacttcg tcgagtggat ccccaacaat gtccagaccg ccctctgctc cattcctccc
2041 cgcggcctca agatgtcctt cacctttgtc ggtaactcta ccgccatcca ggagctcttc
2101 aagcgtgtcg gtgagcagtt cactgccatg ttccgtcgca aggctttctt gcattggtac
2161 actggtgagg gtatggacga gatggagttc actgaggctg agtccaacat gaacgacttg
2221 gtctctgagt accagcagta ccaggacgct ggtgttgatg aggaggagga ggagtacgag
2281 gatgacgccc ccctggagga ggaggtttaa gcgttgtctg aaaatgctgt gccaccttgg
2341 ccatgtcttc acccaacccg tctgcggtgg catttcgctt cactattcca gctctgcact
2401 ggaaatgggc ttctagatat acctctctta gtagttcgcc tggcgtatca aaatgagtac
2461 gaagaatcag agattactct gtacaaatta ttggcaacat caaatgcata gttttatggc
2521 aattgcgaca cctctaatct tgccagagtt caagtattcg tatcttcttt cgtgactgac
2581 agattatcta tgtttaaacg tggacaccct cgag
//
LOCUS DDICNPA 6372 bp ds-DNA INV 13-JUL-1990
DEFINITION D.discoideum cyclic nucleotide phosphodiesterase gene, complete
cds.
ACCESSION M23449
KEYWORDS cyclic nucleotide phosphodiesterase.
SOURCE D.discoideum (strain NC4; cell line AX3-K) DNA, clones pGP-1 and
pC34.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 6372)
AUTHORS Podgorski,G.J., Franke,J., Faure,M. and Kessin,R.H.
TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium
discoideum utilizes alternate promoters and splicing for the
synthesis of multiple mRNAs
JOURNAL Mol. Cell. Biol. (1989) In press
STANDARD full staff_review
REFERENCE 2 (bases 3380 to 4379)
AUTHORS Faure,M., Franke,J., Hall,A.L., Podgorski,G.J. and Kessin,R.H.
TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium
discoideum contains three promoters specific for growth,
aggregation, and late development
JOURNAL Mol. Cell. Biol. 10, 1921-1930 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by R.H.Kessin, 23-MAR-1989.
FEATURES from to/span description
pept 4127 4146 cyclic nucleotide phosphodiesterase, first
expressed exon (EC 3.1.4.17)
4297 5635 cyclic nucleotide phosphodiesterase, exon 2
pre-msg 1163 6060 CNP mRNA and introns
IVS 1780 2266 CNP intron A (no splice consensus at 2266)
IVS 2377 4077 CNP intron B
IVS 4147 4296 CNP intron C
signal 6015 6020 poly-A signal
BASE COUNT 2615 a 750 c 504 g 2503 t
ORIGIN
1 tgatcatttt gtaaatcttt caattaaatt tcaacttaca cattaataat aataaaaata
61 ataataataa taatatcaga cattcacagg atttttataa atccaatttt atttttcatt
121 gatgtcgttt tattaattga cattcaaatt gtgaataact gacaataaat aaaacagttc
181 atatatttat ttgtttgaaa aaaaaaaaaa aaaaaaaaaa tcaaagttgc atgtgcgagt
241 gtaaattggg gtttttttta taaatctaaa ttagttattt tacaatttta gaaccactac
301 caaattacaa aaaaaaaaaa aaaaaaacca ctcatttatt tttttttttt tttttttttt
361 ttttatcaca tagattccaa taaaattgtg aaacataaag aggaaaacga taatatggat
421 tattagaaaa taatcaatac tttttttttt tttttttttt tgtgttttat tggtttttaa
481 ttaaaagttt tgggggtttt ttattttaat tttttttttt tttgaaaaac atactcactc
541 acacaaaacg caatcaataa atctattttt tgaaaataaa aaaaatcttt tttttttttt
601 tttttttttt ttttctagtt ttcaaaaaat tcatttaatc gcattacatt ataaaataaa
661 aaaattcggt gtcttaaatc tcccaaaatc ttgaatgatt tgtgtgtgtt aaaataataa
721 tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaatt ccaactttta gatccccaat
781 cacataggtg tgatatatct aagttattga aaattgtgtt gatattgttt ttcttacttt
841 tttttttttt ttaaaataac cactttactt tctaaatatt gcatcgcaat gggggtatta
901 aaagttttgt tatttttatt tacacatctt gatttcaaga taaaaatttt tagtgtaaat
961 atatcaaaca ccatagagac ttttcacatt atcaagtgtt tttgactctc tatcacactc
1021 acaaatctga tttttatttt tatttttatt tttttttttt ttttatttta tttttttttt
1081 attttatttt atttttctcc aaccacaatt gatttcaaaa aaaaaaaagt ctataaaatc
1141 aaaaaaaaat ttttttttta attaatcaaa aacaaaaaca aaaaacaaaa ataaataaaa
1201 aaaaaaaaaa aaaacttgta acattaataa taataatttg taattttaaa taccaacttc
1261 aattccatat tacttttttt gtatcattat ttatatatct caatctattt taaatcacca
1321 ttgtatcact actaaaaaaa aaaaaaaaaa aaaaatcaaa aatcaaaaat ccaaatttat
1381 aaaaaaaaaa aaaaaaatca aaattgtaaa aaaataacaa tatcaccctt ttttctgtat
1441 tattacttgt atttgtaaca ctataaatta aataaataaa aatttaattt tgtatcatta
1501 ccacactatt aatttttttt ttacaatttt tgatcaataa taaatcgtta aaaaaaaaaa
1561 aaaaaaaaat ttttaataat aaaataaatt aataccaaaa ataatcaacc aaccaaccaa
1621 aaattattta attatttatt aaccatttta tcataattaa taaataaaaa aaaaaaaaaa
1681 ataaaaaaaa aacatcaaaa aaaaaaaata aaaaaacaat accatatcaa attatctctc
1741 gttctgtact attactcctt aaaaaattta ttaaaaaagg tatgtaaata atataattat
1801 tattatttta tttattatta tgattccatt taaatgtttc aaaatttggt ttttttattt
1861 tttccttatt ttcgttcgct ttgaccaatc attttttttt attattttaa tttttaatta
1921 tttttcgggt tcattccaac atgacaatgg aaaaaattaa aaaagggtat aggaaaagcg
1981 caaatcaaaa ggaaaaaaaa aaatatcttg cgtttgccaa tttttatttt tttttatttt
2041 tttttttatt tttttttttt ttgaaatcaa tcaaaatttc caaccttata ataatccaat
2101 ttaatcacaa atccagtgat tttatcgttg tgtgtgtgtg gtctgatttt aaaatcacca
2161 actgtttttt tttataatta tttttttttt attttatttt tcaatttctt ttcttttttt
2221 tttttttatt ttttaatatc ttttattttt tttttttttt ttttttttat tattattaat
2281 aatcataaaa ctgtaatagt acttaaattg taacaaaata aaaatataaa aataataata
2341 ataatttaac aataagaata gcaacaatta ttcattgttt gtatttaatc aatatttaga
2401 atttttatta tttttttttt atatttatta tttattatct atctagaata caagtagaaa
2461 atataattat ataattttat aatacccatt ataatatatt ttattttatt ttattttatt
2521 ttattttttt ttctcctaaa atcacccaaa caccccacac acaattttat gtaatattca
2581 ataacaacat tgattttcac accactcact gcccaattca attcaaacga tagtgagtaa
2641 ttaattttgt gtttagatgg tagatagaga tgatgtattt tattttttta tatcatccca
2701 agttccatat aaatcaacat tttatgatgt ttatttgttt ttcaatataa catttttatt
2761 taataaaaaa atttaattaa atcatatata acactactta ataaaaacat caataatcac
2821 accacccact accaataata aaaaataaat aaaaaaaata aataaataaa aaattaatta
2881 aatgtttata tttttataac accacacata cgcccacact ccctctttct ccccactact
2941 aaacttgata tttttattgt tttttttttt atacaataaa acattgattg ttatattttt
3001 aattttatga ttaaaaaaaa aaaaaaaaaa caaaaacaaa aactatatat aaaaataccc
3061 aaaaataaaa tatttttaag ggtagagtga taataattaa tcatacacaa aataatgaaa
3121 gtcacaagta gttataaaaa aataaaaaaa aaaaaaaaaa aaaggaaaaa aaaaaaaagt
3181 aataataata ataagaataa taaaaataat aaaaataatt taaaccctat atattgacat
3241 aatggatagc tctataatgt ataatagtag tatatttatt ttaaattata tataaaaata
3301 atttacgatt tcaaagatcg aaaccaatta taacactact gttatgaaac aaatgtgtat
3361 attttacact aacctcgtcg atatctattg tgtgtaacaa ttaggggttt tattatttcc
3421 ttttttttta aaaaaatttt tttttttttt ttatttacta ctatttttaa aaaaaaaaaa
3481 aaaaaaaaaa aaaaaaaaaa gataatttat tttgatcttt tatgtgttga tacacctttt
3541 catgtttaca cacaaacaaa aaacactcga taaattttat tattattttt tatttgtttt
3601 taatttaatt gatattaaaa aaaaaaaaat taaaaaaaaa aaaaaaaaaa aattttaagt
3661 gatatataat ttttaaaatt ttttatttat ttattttttt aaaaaattaa tttttcaaac
3721 aataatacat tcaaaattaa aacaataata acagattcaa aatatcattt ggttttgttt
3781 tttttttttt ttttatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaattata
3841 catataaata tttataacaa taacccacaa acgccacaca ctcacttatt ttttttaatt
3901 ttaaatttta tttattttat ttatttattt atttatttaa tcaaaaaaaa aaaaaaaaaa
3961 aaaactaaaa ttaaaattac taacttattt ttttttttat aaatatatat aaaaaaaaaa
4021 aaaaaataaa attttttttt tttccaatat ctttatcttt tttttttatt ataaaagaag
4081 aaagaattaa tatagttcga ataactacaa aaaatataca aaaaaaatgg cattaaataa
4141 aaaattgtaa gtggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaattta aaaatagaaa
4201 ttgatttgtt gtttatacta attttttttt tttttttttg gaattttttt tttggaaatt
4261 tttatttttt attttttttt tttttcccaa aaatagaatt agtttattac ttttaatttt
4321 tataatttta aatattgtaa attcccatca acaagaagat tgtgatgatg acgatgaaga
4381 tataggaatt tcagcagaga gatcagaaag aagatcagtt aaaaacagta acgatggtag
4441 taatttttat aatttaaatg attattatac accagagaat tggaatcatt atagtggatc
4501 atttgcaact aaagattgta gggatgcatc atacataacc ataccattgg gtacaactgg
4561 tggattggat gaaggtaatc tctcatcatt cctcttgaca aagaaaggta gtaatctttt
4621 cattgcattg gatgcaggta cagtatggca aggtgttaga agattaacca ccttcaaata
4681 tttcaataca ctcttcaata ttacctatcc atcatgggca gttttaccag agcaaagaac
4741 ctcatggttc ctcaagaaac acgttatgag ttatttcatt ggtcacagtc atttagatca
4801 tgtaggtggt ttaattttag tttcaccaga ggattattta gcaaagaatt ggatcgatgt
4861 tcaaccacca atcaacaatg gtatcatggg tctcattcgt aaattagggt tcaaaccaac
4921 tgatttcacc tcttcaagta tcctccaaaa gaaaaccatc atgggtttac catcaaccat
4981 caattcaatc tctactaatc ttttcaataa tcaagtttgg ccaaatcttc catcatttgg
5041 tcgttatcaa tactttagtt tagcaagtgg cattgaatat ccattcacag agttggtccc
5101 atacaatgcc accactatgt ctttggtcgc caatgaattc cctttctctg tcaaagttaa
5161 accatttgaa ttgtgtcacg ataatttaat atcaacttct ttcctcttca ctgatagtat
5221 ctctggtgaa caaatcgctt tcttctctga tactggtgtt ccatcctctg tcgcttgtga
5281 ttgggaaggt aaaatctatg ccgtctggaa acaaattaaa atcgataaat taaaagctat
5341 ctacatcgaa acttctttcc caaataatac tccagactct gctatgtttg gtcatctccg
5401 tccaagagat gtcatgaaat taatggatca actcttggtt caatccattc aaacctctcc
5461 accaatgact aatctcaaac atgttaaatt aatcatcgaa cacattaaac cacaagttgc
5521 tgaagatcca aatggttgga caactcaaag agtaatctat caacaattaa aagaagcaaa
5581 taataatggt gttagaatta ttattccaaa tcaaggtgat ccaatttgta tttaaataat
5641 taattaatat aatatataat aattaaacat atttatttat ttattccacc aaaaaaaaaa
5701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa attaaaaaaa aaaaaaaaaa aaaaaaatta
5761 aataataata ataactaaaa tagaaaacaa taccaaaaaa aaaaacaaaa aaaaaaaaaa
5821 aaaccaatcg acaatttttt tttttttttt tctctaaaac aaatacataa aaaaaatcat
5881 cttataaata actcctcctc ctttcccctc cttctataat tattcagttt cattaattgt
5941 atagattttt tttttttaag tcaaccaacc tccttttttt ttaaaccaca aaaaaaaaaa
6001 aaaaaaaaaa aaaaaataaa ccattttatt tttttttttc caaaaaaatt ttccagtttt
6061 tttttttttt tattttagaa acattttatt tttactttta aaaaatatgg gtgtttattt
6121 tgagaaatat attttttatt taataacaac accctttaag gaaagataga taataattaa
6181 ttatattttt attttacata atgcattttt tttttttttt ttgggtttta ttttattatt
6241 ttttttttta ttgttttaat attattattt ttttttcata aaactttttg agaatcacta
6301 tggaagtcac ttaaattatt attttctctt tcgttatcat cttcttggaa atcaacttgt
6361 tcatcttgat ca
//
LOCUS HUMCPGISL 2520 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human nested gene protein gene, complete cds.
ACCESSION M34677
KEYWORDS coagulation factor VIII; nested gene protein.
SOURCE Human 4X B cell line (ACTT GM1202) DNA, clone p542.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2520)
AUTHORS Levinson,B., Kenwrick,S., Lakich,D., Hammonds,G.Jr. and
Gitschier,J.
TITLE A transcribed gene in an intron of the human factor VIII gene
JOURNAL Genomics 7, 1-11 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 585 1268 CpG island protein, exon 1
1271 1681 CpG island protein, exon 2
IVS < 1 > 2520 coagulation factor VIII, intron 22
mRNA 528 2265 CpG island protein mRNA (5' end = end of cDNA)
binding 523 532 Sp1 binding site
binding 543 552 AP4 binding site
BASE COUNT 418 a 843 c 761 g 498 t
ORIGIN Chromosome X.
1 cccgggaaag tcctcagcat gatggtgttc gttggggcgg gtgggcctca agaccatgac
61 tgcagtgttg cgacaggacg gacttctaag gcaaccgggg tggggacgga gaggtggacc
121 cggtgcttgg caaggggccc tggaccggaa gacggaggac tgagtagtcg gggcgggggg
181 ggggggggac aacagtctag aaatgcgcag acatggaatt agggtcacga gtttcgtttg
241 ggtcatgttc agatggagag gctgtgggat gcctttagca acctagacca cttgcattgc
301 agccgagcta gcagcagctg gtgccccgcc aggccccgcg taccttccca gggtcttgga
361 tccgcatccc cgagaccctc gatttgcata cgccgctcac agcccggcca ggccccgcct
421 ccgatcccgc tctttgcacc tcatcagggc aaggggcagg ctcgggctgg ggagccggcg
481 cgggcacggc ggcagcccgc cgagggggac gcagggcacg tcgccccgcc ccggcccgcc
541 cgccagctgc ggcggcgcgt cgggaagtgc ttggggcggc gagcatggcg gcagcggctg
601 caggcctggg cggcggcggc gccggcccgg gacccgaggc cggggacttc ctggcccgct
661 accggctggt atcgaacaag ctgaagaagc ggttcctgcg gaagccgaac gtggcggagg
721 ccggcgagca gttcggacag ctgggccggg agctgcgcgc ccaggagtgt ctgccctacg
781 cggcctggtg ccagctggcg gtggcgcgct gccagcaggc gctcttccac gggcccgggg
841 aggcgctggc cctcaccgag gccgcccgcc tcttcctgcg gcaggagcgc gacgcgccag
901 cgctggtctg ccccgccgcc tacggggagc cgctgcaggc cgccgccagc gccctgggcg
961 ccgcggtgcg tctgcacctc gagctgggcc agccggccgc cgccgccgcc ctctgcctcg
1021 agctggccgc cgccctgcgc gacctgggcc agccggccgc cgccgccggt cacttccagc
1081 gcgccgccca gctccagctg ccccagctgc ccctggccgc gctgcaggcg cttggcgagg
1141 ccgcctcctg ccagctgctg gcgcgcgact acaccggcgc cctggcggtc ttcacgcgca
1201 tgcagcgcct ggcgcgggag cacggcagcc acccggtgca gtcactgccg ccgcccccgc
1261 cgccggcacc ccagcccggg cccggggcga cgcccgccct accggccgcg ctgcttcctc
1321 cgaactccgg ctcggcggcg ccctctcccg cgccctgggc gccttctcgg acgtgctggt
1381 ccgctgcgag gtgtcccgcg tgctgctgct gctcctcctg caaccaccgc ccgccaagct
1441 gctgccggag cacgcccaga ccctggagaa gtactcctgg gaggcttttg acagccacgg
1501 gcaggagagc agcggccagc ttcccgagga gctctttctg ctgctccagt ctttggtcat
1561 ggctacccac gaaaaggaca cggaagccat caagtcgctg caggtggaga tgtggccact
1621 gttgactgct gagcagaacc acctccttca cctcgttctg caagaaacca tctcccccta
1681 ggacagggag tctgatccat cccattcacc cagtgacttc tttttgccca ggcctggact
1741 ttttgcatca gtcacgttaa ccagatgact ttgcctgtta ccaaacctca tgcatccacg
1801 tttgcgtctg gggaggaata aaaagacatc gttcccgctt ctgcgttttg ttattcctac
1861 tgccgccata ggaattattt cgttggctga acgttaccag caccccgaga acacattttg
1921 atagaatcag agtagaggac atggctgtct tctaaaaagc cacgacatga aaatgacaat
1981 ccctttcgtc tccttcctcc gctgcttcca cctaacgcag cctcctgcct ccgcctttgt
2041 ttcatagtga ggattttatt ttgcacggcg ccctccctct aaatacctac cctagatgat
2101 ttcatcctgc ccctcacttc ttaacatatt cctgtgtctt tgtaatggcc aaatttctcc
2161 ttccactcgt ccgcacggta cgtcttcatg ggagtcattt tattccttac agcttccttg
2221 taaccacagc cctaaatcca tgataaagct actcttgtac tcttcactca tgtgtggttt
2281 tggtgtccgt cgtctataaa atgtgactct cttacctact tgtcaggagg ccagcagcag
2341 aaccagatag ctgtttgggc tccaggatgt attcgttcct ttgaaatttg cgctatgcct
2401 cctatgtggg caacgaacct gcagagatca tctatgatgt cctccgtact cgggtgactg
2461 caatgcagat acatccagtg cagcgggtct ggtgccgatg catcgatttg cattgaattc
//
LOCUS HUMIGHCY 372 bp ss-mRNA PRI 13-JUL-1990
DEFINITION Human Ig active mu-chain mRNA Vv-D-J6-region, partial cds.
ACCESSION M34678
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; variable region.
SOURCE Human CD5+ EBV transformed B-cell, cDNA to mRNA, clone EBV 383.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 372)
AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V.,
Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D.
TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody
specific for a rabies virus-neutralizing monoclonal idiotypic
antibody reveals extensive somatic variability suggestive of an
antigen-driven immune response
JOURNAL J. Immunol. 144, 2835-2839 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 372 Ig mu-cahin Vv-D-J6-region (AA at 1)
/hgml_locus_uid="LW0029F" /nomgen="IGHM"
/map="14q32.33"
recomb 294 295 Vv-region end/D-region start
recomb 316 317 D-region end/J6-region start
BASE COUNT 84 a 107 c 108 g 73 t
ORIGIN
1 gaggtgcagc tggtgcagtc tggagcagaa gtgaaaaagc cgggggagtc tctgaagatc
61 tcctgtaagg gttctggata cagctttagc acctcctgga tcggctgggt gcgccagatg
121 cccgggaaag gcctcgagtg gctggggacc atccatcctg gtgactctga cacaagaaat
181 agcccctcct tccaaggcca ggtcaccatc tcagtcgaca agtccatcag cacggcctcc
241 ctgcagtggg gcagcctgaa ggcctcggac accggtatat attactgtgc gagactgata
301 acaggtcctc gacactacta ctactacgct atggacgtct ggggccaagg gaccacggtc
361 accgtctcct ca
//
LOCUS HUMIGHCZ 306 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human Ig expressed mu-chain gene Vv-region.
ACCESSION M34680
KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region.
SOURCE Human CD5+ EBV transformed B-cell DNA, clone EBV 383ex.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 306)
AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V.,
Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D.
TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody
specific for a rabies virus-neutralizing monoclonal idiotypic
antibody reveals extensive somatic variability suggestive of an
antigen-driven immune response
JOURNAL J. Immunol. 144, 2835-2839 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1)
/hgml_locus_uid="LW0029F" /nomgen="IGHM"
/map="14q32.33"
sigp < 1 12 Ig mu-chain Vv-region signal peptide
matp 13 > 305 Ig mu-chain Vv-region
BASE COUNT 66 a 81 c 97 g 62 t
ORIGIN
1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag
61 tctctgaaga tctcctgtaa gggttctgga tacagcttta gcaccttctg gatcggctgg
121 gtgcgccaga tgcccgggaa aggcctggag tggctgggga ccatccatcc tggtgactct
181 gacacaagaa atagcccgtc cttccaaggc caggtcacca tctcagtcga caagtccatc
241 agcacggcct tcctgcagtg gggcagcctg aaggcctcgg acaccggtat atattactgt
301 gcgaga
//
LOCUS HUMIGHVV 306 bp ds-DNA PRI 13-JUL-1990
DEFINITION Human Ig germline mu-chain gene Vv-region.
ACCESSION M34679
KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region.
SOURCE Human CD5+ EBV transformed B-cell DNA, clone 251.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 306)
AUTHORS van der Heijden,R.W.J., Bunschoten H Pascual,V.,
Uytdehaag,F.G.C.M., Osterhaus,J.D.M.E. and Capra,J.D.
TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody
specific for a rabies virus-neutralizing monoclonal idiotypic anti-
body reveals extensive somatic variability suggestive of an
antigen-driven immune response
JOURNAL J. Immunol. 144, 2835-2839 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1)
/hgml_locus_uid="LW0029F" /nomgen="IGHM"
/map="14q32.33"
sigp < 1 12 Ig mu-chain Vv-region signal peptide
matp 13 > 305 Ig mu-chain Vv-region
BASE COUNT 67 a 83 c 95 g 61 t
ORIGIN
1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag
61 tctctgaaga tctcctgtaa gggttctgga tacagcttta ccagctactg gatcggctgg
121 gtgcgccaga tgcccgggaa aggcctggag tggatgggga tcatctatcc tggtgactct
181 gataccagat acagcccgtc cttccaaggc caggtcacca tctcagccga caagtccatc
241 agcaccgcct acctgcagtg gagcagcctg aaggcctcgg acaccgccat gtattactgt
301 gcgaga
//
LOCUS MUSPLII 874 bp ss-mRNA ROD 13-JUL-1990
DEFINITION Mouse placental lactogen II (PL-II) mRNA, complete cds.
ACCESSION M14647
KEYWORDS placental lactogen II.
SOURCE Mouse (strain Balb/c) placenta, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 874)
AUTHORS Jackson,L.L., Colosi,P., Talamantes,F. and Linzer,D.I.H.
TITLE Molecular cloning of mouse placental lactogen cDNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 8496-8500 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 60 728 placental lactogen II (mPL-II) precursor
sigp 60 152 placental lactogen II (mPL-II) signal peptide
matp 153 725 placental lactogen II (mPL-II)
mRNA < 1 874 mPL-II mRNA
BASE COUNT 234 a 212 c 190 g 238 t
ORIGIN
1 agcactcggg gaacagcagc cttctggtgt tgtcagcact tcaagagaac tcctcagaga
61 tgaagctgtc tttgagtcaa ccatgctcct tctctggggc actcctgttg ctggcagtgt
121 caaacctact tgtctgggag aaggtgactt ccttaccaaa ttatcgatta cccactgaaa
181 gcctatacca acgtgtgatt gtggtgtcac acaacgccca tgatcttgct tcaaaagcct
241 tcatggaatt tgaaatgaag tttggtagga cagcttggac ctatggcctg atgttaagcc
301 cctgccacac tgctgcaatc cttactccag aaaacagcga gcaagtccac cagacaacat
361 cggaagacct tctgaaagtg tccatcacta ttttacaagc ctgggaagag cctctgaaac
421 acatggtggc agcagtggct gctcttccac atgtacctga tactctgctg tcaagaacaa
481 aggagttgga ggaaagaatt caagggcttc tggaaggact gaagatcata ttcaataggg
541 tttacccagg agctgttgca agtgactata ctttctggtc tgcatggtca gatttgcagt
601 catctgatga atccactaag aacagtgctc ttagaacctt atggcggtgc gtgcgcaggg
661 atacacataa agttgacaat tacctcaagg tcctgaagtg tcgtgatgtt cataacaaca
721 actgctgaac atctatccct cttctctgtc tcttagaagg tccctcgtga tctagacctt
781 caaagcacca atgaatttcc ccttctttgg tgcctttcta gattgagttg tcatcttacc
841 cccaaataaa ctgattcttt gcaaatgcta aatt
//
LOCUS MUSRARGA 2715 bp ss-mRNA ROD 13-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma (mRAR-gamma-A) mRNA, complete
cds.
ACCESSION M34476
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE13.20.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2715)
AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and
Varmuza,S.
TITLE Identification of a novel isoform of the retinoic acid receptor
gamma expressed in the mouse embryo
JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 253 1629 retinoic acid receptor gamma
mRNA 1 2715 RAR-gamma-A mRNA
BASE COUNT 578 a 839 c 741 g 557 t
ORIGIN
1 cggggctctg cggccttcct tcccccgccc tcccctccag cagtttccac caggtccctc
61 acctcagcct ggcccagtat gtaggaggga atctctgcag aggccagagg gatccttgga
121 aacccactgg acagaccagg cagggtgggc acggagcctc ccaggccagg gcagtgggca
181 tgggcggggg ctgtagctga agaccacccc cgcctgctgc agagtccaag ggattcccac
241 gccgcagcta ccatggccac caataaggag agactctttg cgcccggtgc cctggggcct
301 ggatctggtt acccaggagc aggcttccca ttcgccttcc caggtgcact cagagggtcg
361 ccaccatttg agatgctgag ccctagcttc cggggcctgg gccagcctga cctccccaag
421 gagatggctt ctctctcggt ggagacacag agcaccagct cggaggagat ggtacccagc
481 tctccctcac ccccaccacc tcctcgggtc tataagccat gctttgtatg caatgacaag
541 tcttctggct accactatgg ggtcagctcc tgtgaaggct gcaagggctt cttcagacgc
601 agcattcaga aaaacatggt gtatacatgt caccgtgaca aaaactgtat catcaacaag
661 gtcaccagaa atcgatgcca gtactgcagg ctacaaaagt gtttcgaagt gggcatgtcc
721 aaggaagctg taaggaacga tcgaaacaag aagaaaaagg aggtaaaaga ggagggctcg
781 cccgacagct atgaactgag tccacagtta gaggaactca tcaccaaggt cagcaaagcc
841 caccaggaga cttttccctc actctgccag ctgggcaagt acaccacgaa ctccagtgca
901 gatcaccggg tgcagctgga cctggggctg tgggacaagt tcagcgagct ggccaccaaa
961 tgcatcatca agattgtgga gtttgcgaag cggctgcctg gttttacagg gctcagcatt
1021 gccgaccaga tcacgctgct caaggctgct tgtctggaca tcctaatgct gcggatctgt
1081 acaaggtata ccccagagca ggacactatg acattctcgg atgggctgac cctgaaccga
1141 acccagatgc acaatgctgg ctttgggccc cttacagacc tcgtctttgc ctttgccggg
1201 cagctgctgc ccctggagat ggatgacacc gagactgggc tacttagtgc tatctgcctc
1261 atctgtggag accgaatgga cctggaagag cccgagaagg tggacaagct gcaggagccc
1321 ctgctggaag ccctgaggct ctatgcccgg cgacgagacc cagccaaacc ctacatgttc
1381 ccaaggatgc tgatgaaaat caccgacctc cggggcatca gcactaaggg agcagaaagg
1441 gctataaccc tgaagatgga gattccaggc ccgatgccac ccctgatccg agagatgctg
1501 gagaacccgg agatgtttga ggacgactcc tcgaagcctg gcccccaccc caaggcttcc
1561 agtgaggacg aagctccagg gggccagggc aaaaggggcc aaagtcccca acctgaccag
1621 gggccctgac ctaccccgtt gtggggttgg gccccaggca gcagactgac catttcccag
1681 ataccgccag tgactggggg aggacctgcc ccgcccactc tccacccttt taatgagctc
1741 gttattttgc caaagtttct aggggtgcct gtgttcatcc ctgtcctgtt ctaactggct
1801 ccctctacag tcccgcgggg atctgctgca cacctaccag aagagctgga gaaggggcga
1861 gcctgggtct agactctaaa atctcagcac tgcctttcgg attcaggcca caggctcccg
1921 acgcaagagg aagccccgcc ttcccatagc cttttcctct gccaggtgct tgggcctctg
1981 ggagcaaaca ggaacactag agaccaaaag gggggccccc cccgagggga gggctgagcc
2041 acccttgtgc cccaccctga gcgcctgact ctgtgtgatg aatctgccgc gctcagtgcc
2101 cccggtgccc catcctaatc ctcctgccgg gtgtaggggc aggctggggc ctgcgttttc
2161 tagggtgggg cctgagggga agggagaatg tgagcccgag cccgtctgta cctctgctga
2221 agtacagcac tcatcctggg ccctccatta tataagcctc tcccttactt ctgtctcgtg
2281 ccttggcctc ccgccctcat ctcagccatc aggcagggcc ctcctacact acagaggggc
2341 caggggtccc tctcctagtg ccttctaccc ttgatccccc cagagcagct tggccccggg
2401 gaggggagat gctgcttagc tgatcccacc ttgacccaga ggaagcctct atttatttat
2461 tagcttttgt ttacaccctg gaaatgaccc cttcctccag gggtcttggg agggggagcc
2521 cagggcctct gtgacccctt cctttctttc cccaagcccc cagtttatat ttagctgcca
2581 aataagattc ctgcaggctc ccccatttct ctggggggtt agggtgctgt cccctcccct
2641 ctgtttatat ctcctacccc gctgttcgca tattgctgag ttttctattt ttgcaaaata
2701 aagtgatgga aactc
//
LOCUS MUSRARGB 2496 bp ss-mRNA ROD 13-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma (RAR-gamma-B) mRNA, complete
cds.
ACCESSION M34475
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE7.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2496)
AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and
Varmuza,S.
TITLE Identification of a novel isoform of the retinoic acid receptor
gamma expressed in the mouse embryo
JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 67 1410 retinoic acid receptor gamma
mRNA 1 2496 RAR-gamma-B mRNA
BASE COUNT 533 a 751 c 684 g 528 t
ORIGIN
1 aggaggggga atgggctgcg ggtccttggt gttctagcac ccagtttctc tccaagccgg
61 gtcgcgatgt acgactgcat ggaatcgttt gtcccgggtc cgcgacggct gtatggggcg
121 gccgggcccg gggccggctt actacgcaga gccactggca gctcctgctt cgccggactt
181 gagtcttttg cctgggcaca gcctgccagt ctacaatcgg tggagacaca gagcaccagc
241 tcggaggaga tggtacccag ctctccctca cccccaccac ctcctcgggt ctataagcca
301 tgctttgtat gcaatgacaa gtcttctggc taccactatg gggtcagctc ctgtgaaggc
361 tgcaagggct tcttcagacg cagcattcag aaaaacatgg tgtatacatg tcaccgtgac
421 aaaaactgta tcatcaacaa ggtcaccaga aatcgatgcc agtactgcag gctacaaaag
481 tgtttcgaag tgggcatgtc caaggaagct gtaaggaacg atcgaaacaa gaagaaaaag
541 gaggtaaaag aggagggctc gcccgacagc tatgaactga gtccacagtt agaggaactc
601 atcaccaagg tcagcaaagc ccaccaggag acttttccct cactctgcca gctgggcaag
661 tacaccacga actccagtgc agatcaccgg gtgcagctgg acctggggct gtgggacaag
721 ttcagcgagc tggccaccaa atgcatcatc aagattgtgg agtttgcgaa gcggctgcct
781 ggttttacag ggctcagcat tgccgaccag atcacgctgc tcaaggctgc ttgtctggac
841 atcctaatgc tgcggatctg tacaaggtat accccagagc aggacactat gacattctcg
901 gatgggctga ccctgaaccg aacccagatg cacaatgctg gctttgggcc ccttacagac
961 ctcgtctttg cctttgccgg gcagctgctg cccctggaga tggatgacac cgagactggg
1021 ctacttagtg ctatctgcct catctgtgga gaccgaatgg acctggaaga gcccgagaag
1081 gtggacaagc tgcaggagcc cctgctggaa gccctgaggc tctatgcccg gcgacgagac
1141 ccagccaaac cctacatgtt cccaaggatg ctgatgaaaa tcaccgacct ccggggcatc
1201 agcactaagg gagcagaaag ggctataacc ctgaagatgg agattccagg cccgatgcca
1261 cccctgatcc gagagatgct ggagaacccg gagatgtttg aggacgactc ctcgaagcct
1321 ggcccccacc ccaaggcttc cagtgaggac gaagctccag ggggccaggg caaaaggggc
1381 caaagtcccc aacctgacca ggggccctga cctaccccgt tgtggggttg ggccccaggc
1441 agcagactga ccatttccca gataccgcca gtgactgggg gaggacctgc cccgcccact
1501 ctccaccctt ttaatgagct cgttattttg ccaaagtttc taggggtgcc tgtgttcatc
1561 cctgtcctgt tctaactggc tccctctaca gtcccgcggg gatctgctgc acacctacca
1621 gaagagctgg agaaggggcg agcctgggtc tagactctaa aatctcagca ctgcctttcg
1681 gattcaggcc acaggctccc gacgcaagag gaagccccgc cttcccatag ccttttcctc
1741 tgccaggtgc ttgggcctct gggagcaaac aggaacacta gagaccaaaa ggggggcccc
1801 ccccgagggg agggctgagc cacccttgtg ccccaccctg agcgcctgac tctgtgtgat
1861 gaatctgccg cgctcagtgc ccccggtgcc ccatcctaat cctcctgccg ggtgtagggg
1921 caggctgggg cctgcgtttt ctagggtggg gcctgagggg aagggagaat gtgagcccga
1981 gcccgtctgt acctctgctg aagtacagca ctcatcctgg gccctccatt atataagcct
2041 ctcccttact tctgtctcgt gccttggcct cccgccctca tctcagccat caggcagggc
2101 cctcctacac tacagagggg ccaggggtcc ctctcctagt gccttctacc cttgatcccc
2161 ccagagcagc ttggccccgg ggaggggaga tgctgcttag ctgatcccac cttgacccag
2221 aggaagcctc tatttattta ttagcttttg tttacaccct ggaaatgacc ccttcctcca
2281 ggggtcttgg gagggggagc ccagggcctc tgtgacccct tcctttcttt ccccaagccc
2341 ccagtttata tttagctgcc aaataagatt cctgcaggct cccccatttc tctggggggt
2401 tagggtgctg tcccctcccc tctgtttata tctcctaccc cgctgttcgc atattgctga
2461 gttttctatt tttgcaaaat aaagtgatgg aaactc
//GenBank-Updates@genbank.bio.net (07/14/90)
LOCUS RATCYP7 3535 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat cholesterol 7-alpha-hydroxylase (CYP7) mRNA, complete cds.
ACCESSION J05430
KEYWORDS cholesterol 7-alpha-hydroxylase.
SOURCE Rat (Sprague Dawley) adult male liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3535)
AUTHORS Jelinek,D.F., Andersson,S., Slaughter,C.A. and Russell,D.W.
TITLE Cloning and regulation of cholesterol 7-alpha-hydroxylase, the rate
-limiting enzyme in bile acid biosynthesis
JOURNAL J. Biol. Chem. 265, 8190-8197 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W.Russell, 07-MAR-1990, for release after publication.
FEATURES from to/span description
pept 30 1541 cholesterol 7-alpha hydroxylase (CYP7) (EC
1.14.13.17)
rpt 2195 2313 Alu repeat
BASE COUNT 1035 a 705 c 696 g 1099 t
ORIGIN
1 cgctttggaa attttcctgc ttttgcaaaa tgatgactat ttctttgatt tggggaattg
61 ccgtgttggt gagctgttgc atatggttta ttgttggaat aaggagaagg aaagctggtg
121 aacctccttt ggagaacggg ttgattccgt acctgggctg tgctctgaaa tttggatcta
181 atcctcttga gttcctaaga gctaatcaaa ggaagcatgg tcacgttttt acctgcaaac
241 tgatggggaa atatgtccat ttcatcacaa actccctgtc ataccacaaa gtcttatgtc
301 atggaaaata ttttgactgg aaaaaatttc attacactac ttctgcgaag gcatttggac
361 acagaagcat tgacccaaat gatggaaata ccacggaaaa tataaacaac acttttacca
421 aaaccctcca gggagatgct ctgtgttcac tttctgaagc catgatgcaa aacctccaat
481 ctgtcatgag acctcctggc cttcctaaat caaagagcaa tgcctgggtc acggaaggga
541 tgtatgcctt ctgttaccga gtgatgtttg aagctggcta tctaacactg tttggcagag
601 atatttcaaa gacagacaca caaaaagcac ttattctaaa caaccttgac aacttcaaac
661 aatttgacca agtctttccg gcactggtgg caggccttcc tattcacttg ttcaagaccg
721 cacataaagc tcgggaaaag ctggctgagg gattgaagca caagaacctg tgtgtgaggg
781 accaggtctc tgaactgatc cgtctacgta tgtttctcaa tgacacgctc tccacctttg
841 acgacatgga gaaggccaag acgcacctcg ctattctctg ggcatctcaa gcaaacacca
901 ttcctgcaac cttttggagc ttatttcaaa tgatcaggag tcctgaagca atgaaagcag
961 cctctgaaga agtgagtgga gctttacaga gtgctggcca agagctcagc tctggaggga
1021 gtgccattta cttggatcaa gtgcaactga atgacctgcc ggtactagac agcatcatca
1081 aggaggctct gaggctttcc agtgcatcct tgaatatccg cacagctaag gaggacttca
1141 ctctccatct tgaggacggt tcctataaca tccgaaaaga tgacatgata gctctttatc
1201 cacagttaat gcacttggat cctgaaatct acccagaccc tttgactttc aaatatgacc
1261 ggtaccttga tgaaagcggg aaagcaaaga ccaccttcta cagtaatgga aacaagctga
1321 agtgtttcta catgcccttc ggatcaggcg cgacaatatg tcctggaaga ctctttgccg
1381 tccaagaaat caagcagttt ttgatcctga tgctctcctg ctttgaactg gagtttgtgg
1441 agagccaagt caagtgtccc cctctagacc agtcccgggc aggcttggga attttgccac
1501 cactacatga tattgagttt aaatataaac tgaaacactg atacgtggtt ggaagaagcg
1561 aacactggat gatgtcactt ggcggctgag agtcatcact aaacaggcct tcgggaccaa
1621 tgctcactga tgcgccctag cgactggatt agtgggaaga actttgttct cgctgcccac
1681 attcctgggt gttcacatag ctggggccag agcttcatca ctttcagaaa gcaatgtctt
1741 ttgtatttat tttcaaaatg aagatattcc aattggcagg atatttttcc taaggaaatt
1801 gctttatatt tttatgaaaa ctaccaatta attatgaaag ggcttgaaat tcacgtttta
1861 gtgaaattac tgatttttca ctagtaaggt tcttcaggtg tgaaactgta ttataaaaat
1921 gttgtaatgg gtcacactgt gctttgcata aaggtaaagg aaactatgtt tcagcctttt
1981 ctgtgtctat gagcttcgaa aataatctta ctgttctaga aacactgggg aggtttcgac
2041 atgctctcgc tatattttat tttactgttg ctagaaattt tcattccagt tttcaactac
2101 cttatctttc ccccattttg acatgcatgc caatgagaag agtatttttt aggaattaac
2161 aaggcacctc ccagaaccct accctgagac ttttaagcct ttaatcccag cactcgagaa
2221 gtagagccag gcagatctct gagtctgagg ttattctggt ctacatcagc tccagacaag
2281 ccaggactac agaatgggat cttgtctaaa aaatacagct aatctttatg tcataactga
2341 ttatgaatca acctaaaaga taaattttca atcaggactc agagaaaatg agcaattaaa
2401 aaacttagct ctgaggtatg tggaattcat taagtacaag ttgacattac atgttcttta
2461 aaaatagttt atgttttatc tctaaatgcc ctgcagatga agaataataa tgaaaagttg
2521 aataatactg tttaaacact aagtgcaata atgctttggt aatgtacttt aagagaatca
2581 ttagccgtgc cagttttact aaaatatatt tatatgtaaa ttatatttat ctttttctta
2641 taccataaat ataaaaatat tgcaacattt agtaatttta aaattatata cctttcagaa
2701 aatgatgtat gatgtttgta tgttttttaa ctttgaacag aacatttaaa ttattcatct
2761 acggtgattt ttatcttatt tatttctttt tgtctcattc atatcttgaa gaaatccaaa
2821 aatatctgaa ggaatcgctc actcaaatgt ctccctatgg ttacagaaaa attcaatacc
2881 atgtttttgt cctcggggac tgaagcaggg tgtcgtgggt gcgagcagag gctcctgctg
2941 cagcgagctt tatccacggg actccttaaa cttttaaaat cttatcacta ttatcatgca
3001 tttattacct aagtaggata tttccctttc ctttttcatt tcagcagagt cccttagcaa
3061 cccaggctga ctgggaccct ccatgtagct taagctgtga actcactgta cttcctgttt
3121 tcacttattt taggaagtaa ttttccctat cagaaatttt aattgtttag atgatgtata
3181 agagtaacac aattctgtta tatactaatc tgtagtaaac taaatttgtt cttagaacaa
3241 gtttgatgac tctcaaattg aatgtatcca tacatctttc catggcttct tgaatgccca
3301 tttctcatac acagaatgat gggtttcacg gtgatgtctt cctttcatgt ctttattctt
3361 gtgcggtgat ggttggcaaa tgatacccat ggagcaaggt tactcttcct atttctgtgc
3421 agcctaagtg ttaagaataa tttttaaata cttggaggga aggcacattt tgtgtcatat
3481 gtgaagtgac atgtgacaca cagactagca aatccatgag taaaatttta ttggg
//
LOCUS PDECYT550 1047 bp ds-DNA BCT 14-JUL-1990
DEFINITION P.denitrificans cytochrome c550 gene, complete cds, and
iso-cytochrome oxidase subunit I (iso-COI) gene, 5' end.
ACCESSION M27304
KEYWORDS cytochrome c550; cytochrome oxidase.
SOURCE P.denitrificans (strain NCIB 8944) DNA, clone pRS59.
ORGANISM Paracoccus denitrificans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1047)
AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and
Stouthamer,A.H.
TITLE Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus
denitrificans and analysis of the resultant physiological effects
JOURNAL J. Bacteriol. 172, 986-996 (1990)
STANDARD full staff_review
REFERENCE 2 (sites; amino acid correction for [1])
AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and
Stouthamer,A.H.
TITLE Errata: Mutagenesis of the gene encoding cytochrome c-550 of
Paracoccus denitrificans and analysis of the resultant
physiological effects
JOURNAL J. Bacteriol. 172, 3534-3534 (1990)
STANDARD full staff_review
COMMENT Printed sequence for [1] kindly submitted by R.J.M.Van Spanning,
25-AUG-1989.
FEATURES from to/span description
pept 235 702 cytochrome c550 precursor
pept 874 > 1047 iso-cytochrome oxidase subunit I
sigp 235 294 signal peptide cytochrome c550
matp 295 699 cytochrome c550
ORF < 1 90 open reading frame (AA at 1)
rpt 734 743 inverted repeat copy A
rpt 757 766 inverted repeat copy B
BASE COUNT 215 a 340 c 309 g 183 t
ORIGIN 1 bp upstream of SalI site.
1 gtcgacatgg gcctgccggt cggcatggat gacctggacc tggccctgcg ccgcagcttc
61 gcacggaact ttccgccgct cgccggctga ttttcggccc tgcgacatat tctccctcag
121 ccgttctacg acgtttttcc tgtattctgc ccgcttggca catgatagcc ctgtcaatcg
181 gggaagcgga gaggtccgtt gcccggacat gagcgatacc caagaggaaa cgcgatgaag
241 atcagcatct atgccactct cgccgccatc accctcgccc tgcccgctgc ggcccaggat
301 ggcgacgccg ccaaaggcga gaaagaattc aacaagtgca aggcttgcca catgatccag
361 gcgccggacg gcaccgacat catcaagggc ggcaagaccg ggcccaacct ttacggcgtc
421 gtcggccgca agatcgcctc ggaggagggc ttcaaatacg gcgaaggcat cctcgaggtc
481 gccgaaaaga accccgacct gacctggacc gaggccgacc tgatcgaata cgtcaccgac
541 cccaagccct ggctggtcaa gatgaccgac gacaagggcg ccaagaccaa gatgaccttc
601 aagatgggca agaaccaggc cgacgtggtg gccttcctgg cccagaactc gcccgatgcg
661 ggcggcgacg gcgaggctgc ggccgagggc gaatcgaact gatcccgacg cctccgagat
721 gtctcgacgg ccccgcccct gcgtgcgaca aaagcgcgca ggggcgcgac ttttagtcct
781 ttcttcgcat tgcgaggggt aggattgtgg tctagaaaca ggcgagtccg tcggcctttg
841 cgccggccag ccgcatgtct agggagtcca cgcatggcag acgcagccgt tcacggccac
901 ggtgaccatc atgacacccg cgggttcttc acccgctggt tcatgtcaac aaaccacaag
961 gatatcggta tcctttacct gttcacggcc ggcatcgtcg gcctgatctc ggtatgcttc
1021 accgtctata tgcggatgga actgcag
//
LOCUS ECOSPEC 2330 bp ds-DNA BCT 14-JUL-1990
DEFINITION E.coli ornithine decarboxylase (speC) and cyclic AMP receptor
protein genes, complete cds.
ACCESSION M33766
KEYWORDS cyclic AMP receptor protein; ornithine decarboxylase; speC gene.
SOURCE E.coli (strain K12) DNA, clone pLC20-5.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2330)
AUTHORS Barroso,L., Moore,R., Wright,J., Patel,T. and Boyle,S.M.
TITLE Analysis and sequence of the speC (ornithine decarboxylase) gene of
Escherichia coli
JOURNAL Unpublished (1990)
STANDARD unannotated staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.M.Boyle, 22-MAY-1990.
Author address: S.M.Boyle
Virginia Polytechnic Institute and State University
College of Veterinary Medicine, VPI & SU
Blacksburg, VA 24061
FEATURES from to/span description
pept 86 2281 ornithine decarboxylase (speC) (EC 4.1.1.17)
pept 719 630 (c) cyclic AMP receptor protein (CRP)
mRNA 15 2289 speC mRNA (alt., primer extension)
mRNA 16 2289 speC mRNA (alt., S1 mapping)
mRNA 27 2289 speC mRNA (alt., primer extension)
mRNA 30 2289 speC mRNA (alt., S1 mapping)
mRNA 41 2289 speC mRNA (alt., S1 mapping)
mRNA 42 2289 speC mRNA (alt., primer extension)
signal 14 19 -10 region
signal 733 728 (c) -10 region
signal 755 750 (c) -35 region
signal 2289 2320 rho-independent transcription termination
signal
binding 364 367 pyridoxal-PO4 binding site
binding 66 72 GTP phosphoryl binding site
binding 263 266 GTP phosphoryl binding site
binding 379 382 GTP guanine binding site
BASE COUNT 508 a 546 c 664 g 612 t
ORIGIN
1 ctgcaggaca acagtaaaat cagagcgttt ctgcttttac tgatgtctgg cggtcggagc
61 tggtgaccag tttgacccat atctcatggg gcagggtttt ccaccttgtc cggtattctt
121 acttccccga aacgggtttg cgcttatgaa atcaatgaat attgccgcca gtagtgaact
181 ggtatcccga ctttcttctc atcgtcgcgt ggtggcgttg ggagatactg attttacgga
241 cgtcgcggca gtcgtcatta ccgctgcgga tagtcgcagt ggcattcttg cgttgcttaa
301 gcgcaccggt tttcatctac cggtgttttt gtattccgaa catgctgttg aattacctgc
361 gggcgttacg gcggtaatca acggcaacga gcagcagtgg ctggagctgg aatccgcagc
421 ctgtcagtat gaagagaatt tgctgccacc gttttatgac acgctgacgc agtacgttga
481 gatgggcaac agcacctttg cttgccctgg acatcaacat ggtgcgtttt ttaaaaagca
541 tcctgccgga cgccattttt acgatttctt tggtgagaac gtctttcgcg ccgatatgtg
601 taacgctgac gtaaaattgg gcgatctgct tattcatgaa ggatcggcga aagatgcgca
661 gaaattcgca gccaaagtct ttcatgccga taaaacctat tttgtgctga acggcacatc
721 ggcagcgaat aaagtggtga cgaatgcgct gttaacgcgt ggcgatctgg tgctcttcga
781 ccgtaacaac cataagtcga atcatcacgg cgcgctgatt caggcggggg cgacgccggt
841 ctatctggaa gcttcacgca acccgtttgg tttcattggc ggtattgatg cgcactgttt
901 taatgaagag tatctgcgcc agcaaattcg cgacgttgcg ccagaaaaag ccgacctgcc
961 gcgcccgtat cgcctggcga ttattcagct gggaacctat gacggcactg tctataacgc
1021 ccgtcaggtg atcgataccg ttgggcatct gtgtgattac attctgtttg attccgcgtg
1081 ggtcggttat gaacaattta tcccgatgat ggcggatagc tcgccgctgc tgttagaact
1141 taacgaaaac gatccgggga tctttgtgac tcagtcggtg cacaaacagc aggcgggatt
1201 ctcacagacg tcgcagatcc ataaaaaaga taaccatatc cgcggacagg cgcgtttttg
1261 cccgcataag cggttgaata acgcctttat gctccatgct tctaccagcc ctttctatcc
1321 gctgtttgct gcactggatg ttaacgccaa aattcatgaa ggggagagtg ggcgtcggct
1381 gtgggctgag tgtgttgaga tagggattga agcgcgcaag gctattcttg cgcgctgtaa
1441 gctgttccgc ccgtttatcc cgcccgttgt tgatggcaaa ttgtggcagg attatccgac
1501 atcagtgtta gccagcgacc gccgtttttt cagttttgag ccgggggcga agtggcacgg
1561 ctttgaagga tatgccgcgg atcagtattt tgttgatccg tgcaagctgt tactcactac
1621 accaggtatc gatgccgaaa ccggcgaata tagcgacttt ggcgttccgg cgacgattct
1681 ggcgcactat ctgcgtgaga acggcattgt gccggagaag tgcgatctca actccattct
1741 gtttttatta actccggcgg aaagccacga gaagctggca caactggtgg cgatgctggc
1801 gcaatttgaa cagcatattg aggatgactc gccgctggtt gaggtgttgc cgagcgttta
1861 taacaagtat ccggtgcgct atcgcgacta caccctgcgc cagttgtgtc aggagatgca
1921 cgatctgtat gtcagtttcg acgtcaaaga cctacaaaaa gcgatgttcc gccagcagag
1981 tttcccgtca gtggtgatga acccccagga tgcgcatagc gcttatattc gcggtgacgt
2041 ggagttggtg cggattcgtg atgccgaagg gcgaattgcg gcagaagggg cgttgcctta
2101 tccacctggc gtgctttgcg tggtacccgg ggaagtctgg ggtggggcgg ttcaacgtta
2161 tttccttgca ctggaagaag gggtgaattt gttgccggga ttttcgccgg agctgcaagg
2221 tgtttatagc gaaaccgatg cggatggcgt gaaacggttg tacggttatg tgttgaagta
2281 agaataaaaa aaacgggtca ccttctggcg acccgttttt ctttgcgtaa
//
LOCUS HUMLPLAS 557 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human L-plastin mRNA, 5' end.
ACCESSION M34426
KEYWORDS L-plastin.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 557)
AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J.
TITLE Correction of the N-terminal sequences of the human plastin
isoforms by using anchored polymerase chain reaction:
Identification of a potential calcium-binding domain
JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 138 > 557 L-plastin
BASE COUNT 174 a 111 c 133 g 139 t
ORIGIN
1 acttcctgcc ttgtgaccac acacccaggc ttgacaaagc tgttctgcag atcagaaaga
61 aggggttcct ggtcatacac cagtactacc aaggacagct tttttcctgc aagatctgtt
121 acctaaagca ataaaaaatg gccagaggat cagtgtccga tgaggaaatg atggagctca
181 gagaagcttt tgccaaagtt gatactgatg gcaatggata catcagcttc aatgagttga
241 atgacttgtt caaggctgct tgcttgcctt tgcctgggta tagagtacga gaaattacag
301 aaaacctgat ggctacaggt gatctggacc aagatggaag gatcagcttt gatgagttta
361 tcaagatttt ccatggccta aaaagcacag atgttgccaa gacctttaga aaagcaatca
421 ataagaagga agggatttgt gcaatcggtg gtacttcaga gcagtctagc gttggcaccc
481 aacactccta ttcagaggaa gaaaagtatg cctttgtcaa ctggataaac aaagccctgg
541 aaaatgatcc tgattgt
//
LOCUS HUMTPLAS 516 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human T-plastin mRNA, 5' end.
ACCESSION M34427
KEYWORDS T-plastin.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 516)
AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J.
TITLE Correction of the N-terminal sequences of the human plastin
isoforms by using anchored polymerase chain reaction:
Identification of a potential calcium-binding domain
JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 88 > 516 T-plastin
BASE COUNT 169 a 85 c 125 g 137 t
ORIGIN
1 aaagattccg aggtgcagaa gttgtctgag tgggttggtc ggcggcagtc gggccagacc
61 caggactctg cgactttaca tctttaaatg gatgagatgg ctaccactca gatttccaaa
121 gatgagcttg atgaactcaa agaggccttt gcaaaagttg atctcaacag caacggattc
181 atttgtgact atgaacttca tgagctcttc aaggaagcta atatgccatt accaggatat
241 aaagtgagag aaattattca gaaactcatg ctggatggtg acaggaataa agatgggaaa
301 ataagttttg acgaatttgt ttatattttt caagaggtaa aaagtagtga tattgccaag
361 accttccgca aagcaatcaa caggaaagaa ggtatttgtg ctctgggtgg aacttcagag
421 ttgtccagcg aaggaacaca gcattcttac tcagaggaag aaaaatatgc ttttgttaac
481 tggataaaca aagctttgga aaatgatcct gattgt
//
LOCUS MUSICSBP 2544 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Mouse interferon consensus sequence binding protein mRNA, complete
cds.
ACCESSION M32489
KEYWORDS DNA binding protein; interferon consensus sequence binding protein.
SOURCE Mouse (strain Balb/c; haplotype H-2d) neonatal spleen and lung,
cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2544)
AUTHORS Driggers,P.H., Ennist,D.L., Gleason,S.L., Mak W,-H., Marks,M.S.,
Levi,B.-Z., Flanagan,J.R., Appella,E. and Ozato,K.
TITLE An interferon gamma-regulated protein that binds the
interferon-inducible enhancer element of major histocompatibility
complex class I genes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3743-3747 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Ozato, 23-MAR-1990, for release after publication.
FEATURES from to/span description
pept 21 1295 interferon consensus sequence binding protein
binding 21 362 DNA binding site
BASE COUNT 584 a 680 c 739 g 541 t
ORIGIN
1 gcgtgggaac cggcggcagg atgtgtgacc ggaacggcgg gcggcggctg cggcagtggc
61 tgatcgaaca gatcgacagc agcatgtacc cggggctgat ctgggaaaat gatgagaaga
121 ccatgttccg tatcccctgg aagcatgccg gcaagcagga ttacaatcag gaggtggatg
181 cttccatctt caaggcctgg gcagttttta aagggaagtt taaagaggga gacaaagctg
241 aaccagccac gtggaagacg aggttacgct gtgctctgaa caagagccca gattttgaag
301 aagtgactga ccggtcccag ctggacattt ctgagccata taaagtttac cgaattgtcc
361 ccgaggaaga acaaaaatgc aagctgggcg tggcacctgc aggctgcatg agcgaagttc
421 ctgagatgga gtgtggccgc tcagagattg aggagctgat caaggaacct tctgtggatg
481 agtacatggg tatgaccaag aggagcccat ccccaccaga ggcctgcagg agccagatcc
541 tccctgactg gtgggtccag cagcccagtg caggcctgcc actggtgacc ggatatgccg
601 cctatgacac acaccattca gctttctccc agatggtcat cagcttctac tacgggggca
661 agctggtggg ccaggccacc accacctgcc ttgaaggctg ccgtctctcc ctgagccagc
721 cggggctgcc taagttgtat gggccggatg gcctggaacc cgtgtgcttt ccgacggccg
781 acaccatccc cagtgagcgg cagaggcagg tgacccggaa gctgtttggg cacctggaac
841 gtggcgtgct actgcacagc aaccgcaagg gcgtgttcgt gaagcggctg tgccagggcc
901 gcgtgttctg cagcggcaac gcggtggtgt gcaagggcag gcccaacaag ctggagcggg
961 acgaggtggt gcaggtcttt gacaccaacc agttcatccg agagctgcag caattctacg
1021 ccacccagag ccgcctacct gacagcaggg tggtcctgtg cttcggggag gagtttccgg
1081 acactgtgcc cttgcgctcc aaactcattc tggtgcaggt agagcagctg tatgccaggc
1141 agctggtgga ggaagcgggc aagagctgcg gtgctggctc cctgatgcca gccctggagg
1201 agccccagcc ggaccaggct ttccgcatgt ttccggatat ctgtacctca caccagagac
1261 ccttttttag agaaaatcaa cagatcaccg tctaagcctc agtccgggca ccccacctcg
1321 cctgagctca agcttcaaga gtctgtgact aagagaattc cgaaaggatg tggagccctc
1381 tgactggggt gggcgggtgt cctccaaggg gcctccggaa gcccacagag ggatgcgctc
1441 ctgctcaggc aggtgtcaga agcttgcagg ggctgtggcc gcaacctgtg attaaagcat
1501 tccttttcct tacactattt cacccctcac cactaatggc tggcctttct gtgtgctgag
1561 gtctttcgac agttcaaatc atctggtggc agcagactcg cctttgccct tctgcggccg
1621 agggcggaga tttatgactt tctctgcttg gttggagaag aagaatcttt actattcagc
1681 ttcttttctt ttggccagaa ctctgaaaaa aaaaaaaaac tcttttttaa gaccaatatt
1741 tgtattctca caggctcagc tgtcaatcac ttgagacctt ccctgtaaag tggggcagat
1801 tttaaatatg ggtgtagata ctcgttgcag ccttcgcagg aattttggtt gtggttcatt
1861 gattcacaca gactctgtgt cagctgacag ggctgtgtgg ggcatcaaag gaggaccagg
1921 cactgtggag aagacccatt cactggcatc tcacccttcc ttgtccagct ccatacccag
1981 tcctaagacc cagtgaaaag ccacgtccaa actgtgctct gggctcatca gtgcccaccc
2041 acgtaccagg gaaaggcaca cacccatccc agtgggcaca gagcggaatg tccccctacc
2101 gcaccatttg cgcccccaat ctggctgtcc aacctagttt gtaagtaatc taaatcagtg
2161 actatagccc cgcctaaggg acacttcccg gaggagggag ccgctgaaaa ggagttagtt
2221 tgagggtcag tacacaacag gggcagaaag ccaagcagat gtgggggcag ggaggtcatc
2281 atctgctttt gtctgagaga aggagagctt ctccgtttgt tcaactttgt aacaagctgg
2341 gttacatgct ccacgcagct agagaagcct aggtgctctg cattccctgg ggaactgcag
2401 gaaagcctta cctgctgact gttgctctgg ggaaaagcct gagggtccag agcagctaca
2461 agctacaggc cataccttac aacctgaaaa gctaaggacc acggtgacct tcccggctac
2521 tgtgtgaagg tgctgggtgg ggcc
//
LOCUS DDIUMPCMP 694 bp ss-mRNA INV 14-JUL-1990
DEFINITION D.discoideum UMP-CMP kinase mRNA, complete cds.
ACCESSION M34568
KEYWORDS UMP-CMP kinase.
SOURCE D.discoideum, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 694)
AUTHORS Wiesmueller,L., Noegel,A.A., Barzu,O., Gerisch,G. and Schleicher,M.
TITLE cDNA-derived sequence of UMP-CMP kinase from Dictyostelium
discoideum and expression of the enzyme in Escherichia coli
JOURNAL J. Biol. Chem. 265, 6339-6345 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 54 638 UMP-CMP kinase
mRNA < 1 694 UMP-CMP kinase mRNA
BASE COUNT 287 a 74 c 110 g 223 t
ORIGIN
1 tttaaatata taaattttgt aaaacaactt caaatttttt aaattcaaga atgatggaaa
61 aaagtaaacc aaatgttgta tttgttttag gtggaccagg ttcaggtaaa ggtactcaat
121 gcgccaacat tgtcagagat tttggatggg tacatttatc agcaggtgat ttattacgtc
181 aagaacaaca atctggttca aaagatggtg aaatgattgc aacaatgatc aaaaatggtg
241 aaatcgttcc atcaattgta actgtaaaat tattaaagaa tgctattgat gctaatcaag
301 gtaaaaactt tttagttgat ggtttcccac gtaatgaaga aaataataat tcttgggaag
361 aaaatatgaa agattttgtt gatacaaaat ttgttttatt ctttgattgt ccagaagaag
421 ttatgactca aagattatta aagagaggtg aatcaagtgg tagatcagat gataatattg
481 aatcaattaa aaagagattc aatacattca acgttcaaac taaattagtt atcgatcatt
541 ataataaatt tgacaaagtt aaaatcattc cagcaaatag agatgttaat gaagtttata
601 atgatgttga aaaccttttt aaatcaatgg gtttttaaaa aaaaaaaaaa aaaaaccaat
661 aaattttaaa attactaaat aataaatata cccc
//
LOCUS HUMBGALXX 2409 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human beta-galactosidase (GLB1) mRNA, complete cds.
ACCESSION M34423
KEYWORDS beta-galactosidase.
SOURCE Human testis, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2409)
AUTHORS Yamamoto,Y., Hake,C.A., Martin,B.M., Kretz,K.A.,
Ahern-Rindell,A.J., Naylor,S.L., Mudd,M. and O'Brien,J.S.
TITLE Isolation, characterization, and mapping of a human acid beta-
galactosidase cDNA
JOURNAL DNA 9, 119-127 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 61 2094 beta-galactosidase precursor (EC 3.2.1.23)
/nomgen="GLB1" /map="3pter-p21"
/hgml_locus_uid="LA0059G "
sigp 61 129 beta-galactosidase signal peptide (put.)
matp 130 2091 beta-galactosidase
BASE COUNT 566 a 635 c 613 g 595 t
ORIGIN
1 gaattcgggc gcgaagcggc cggcctgggc gccgactgca gagccgggag gctggtggtc
61 atgccggggt tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct
121 acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc
181 ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg
241 ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc
301 cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag
361 gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg
421 aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta
481 gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag
541 tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt
601 ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg
661 cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact
721 gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg
781 gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc
841 aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct
901 cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg
961 gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc
1021 aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct
1081 ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta
1141 ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa
1201 aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc
1261 ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca
1321 acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac
1381 gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg
1441 atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg
1501 ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact
1561 ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg
1621 cgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc
1681 cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt
1741 gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag
1801 gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc
1861 ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa
1921 ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac
1981 aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga
2041 ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atgatgatga
2101 aagcctgtgt ctttgaggga ttctaccctg aacatacctc acagatcctc cctgtcatgc
2161 cacatttcac tgattggaat gtggaaatgg aaaaggaatt taggatgtgc attttcacct
2221 gaggtttccc tgcatccctg cagtgccaaa gccccacctt cagggaccac ctggaatgtg
2281 tgaggggctg acagcacagt aacgtgcata catatctgca gggctggaat ggaagcttta
2341 aaggtggtag tgatttttat tttggaagaa tcatgttacc tttttgttaa ataaaatttg
2401 cccgaattc
//
LOCUS HUMGAA 3441 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human acid alpha-glucosidase (GAA) mRNA, complete cds.
ACCESSION M34424
KEYWORDS acid alpha-glucosidase; acid maltase.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3441)
AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R.
TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha-
glucosidase, detection of an intron in the 5' untranslated leader
sequence, definition of 18-bp polymorphisms, and differences with
previous cDNA and amino acid sequences
JOURNAL DNA 9, 85-94 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 33 2891 acid alpha-glucosidase /nomgen="GAA"
/map="17q23" /hgml_locus_uid="LZ0096H"
BASE COUNT 612 a 1170 c 1027 g 632 t
ORIGIN
1 gcctgtagga gctgtccagg ccatctccaa ccatgggagt gaggcacccg ccctgctccc
61 accggctcct ggccgtctgc gccctcgtgt ccttggcaac cgctgcactc ctggggcaca
121 tcctactcca tgatttcctg ctggttcccc gagagctgag tggctcctcc ccagtcctgg
181 aggagactca cccagctcac cagcagggag ccagcagacc agggccccgg gatgcccagg
241 cacaccccgg ccgtcccaga gcagtgccca cacagtgcga cgtccccccc aacagccgct
301 tcgattgcgc ccctgacaag gccatcaccc aggaacagtg cgaggcccgc ggctgctgct
361 acatccctgc aaagcagggg ctgcagggag cccagatggg gcagccctgg tgcttcttcc
421 cacccagcta ccccagctac aagctggaga acctgagctc ctctgaaatg ggctacacgg
481 ccaccctgac ccgtaccacc cccaccttct tccccaagga catcctgacc ctgcggctgg
541 acgtgatgat ggagactgag aaccgcctcc acttcacgat caaagatcca gctaacaggc
601 gctacgaggt gcccttggag accccgcatg tccacagccg ggcaccgtcc ccactctaca
661 gcgtggagtt ctccgaggag cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg
721 tgctgctgaa cacgacggtg gcgcccctgt tctttgcgga ccagttcctt cagctgtcca
781 cctcgctgcc ctcgcagtat atcacaggcc tcgccgagca cctcagtccc ctgatgctca
841 gcaccagctg gaccaggatc accctgtgga accgggacct tgcgcccacg cccggtgcga
901 acctctacgg gtctcaccct ttctacctgg cgctggagga cggcgggtcg gcacacgggg
961 tgttcctgct aaacagcaat gccatggatg tggtcctgca gccgagccct gcccttagct
1021 ggaggtcgac aggtgggatc ctggatgtct acatcttcct gggcccagag cccaagagcg
1081 tggtgcagca gtacctggac gttgtgggat acccgttcat gccgccatac tggggcctgg
1141 gcttccacct gtgccgctgg ggctactcct ccaccgctat cacccgccag gtggtggaga
1201 acatgaccag ggcccacttc cccctggacg tccagtggaa cgacctggac tacatggact
1261 cccggaggga cttcacgttc aacaaggatg gcttccggga cttcccggcc atggtgcagg
1321 agctgcacca gggcggccgg cgctacatga tgatcgtgga tcctgccatc agcagctcgg
1381 gccctgccgg gagctacagg ccctacgacg agggtctgcg gaggggggtt ttcatcacca
1441 acgagaccgg ccagccgctg attgggaagg tatggcccgg gtccactgcc ttccccgact
1501 tcaccaaccc cacagccctg gcctggtggg aggacatggt ggctgagttc catgaccagg
1561 tgcccttcga cggcatgtgg attgacatga acgagccttc caacttcatc aggggctctg
1621 aggacggctg ccccaacaat gagctggaga acccacccta cgtgcctggg gtggttgggg
1681 ggaccctcca ggcggccacc atctgtgcct ccagccacca gtttctctcc acacactaca
1741 acctgcacaa cctctacggc ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg
1801 ctcgggggac acgcccattt gtgatctccc gctcgacctt tgctggccac ggccgatacg
1861 ccggccactg gacgggggac gtgtggagct cctgggagca gctcgcctcc tccgtgccag
1921 aaatcctgca gtttaacctg ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc
1981 tgggcaacac ctcagaggag ctgtgtgtgc gctggaccca gctgggggcc ttctacccct
2041 tcatgcggaa ccacaacagc ctgctcagtc tgccccagga gccgtacagc ttcagcgagc
2101 cggcccagca ggccatgagg aaggccctca ccctgcgcta cgcactcctc ccccacctct
2161 acacactgtt ccaccaggcc cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg
2221 agttccccaa ggactctagc acctggactg tggaccacca gctcctgtgg ggggaggccc
2281 tgctcatcac cccagtgctc caggccggga aggccgaagt gactggctac ttccccttgg
2341 gcacatggta cgacctgcag acggtgccaa tagaggccct tggcagcctc ccacccccac
2401 ctgcagctcc ccgtgagcca gccatccaca gcgaggggca gtgggtgacg ctgccggccc
2461 ccctggacac catcaacgtc cacctccggg ctgggtacat catccccctg cagggccctg
2521 gcctcacaac cacagagtcc cgccagcagc ccatggccct ggctgtggcc ctgaccaagg
2581 gtggggaggc ccgaggggag ctgttctggg acgatggaga gagcctggaa gtgctggagc
2641 gaggggccta cacacaggtc atcttcctgg ccaggaataa cacgatcgtg aatgagctgg
2701 tacgtgtgac cagtgaggga gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg
2761 ccacggcgcc ccagcaggtc ctctccaacg gtgtccctgt ctccaacttc acctacagcc
2821 ccgacaccaa ggtcctggac atctgtgtct cgctgttgat gggagagcag tttctcgtca
2881 gctggtgtta gccgggcgga gtgtgttagt ctctccagag ggaggctggt tccccaggga
2941 agcagagcct gtgtgcgggc agcagctgtg tgcgggcctg ggggttgcat gtgtcacctg
3001 gagctgggca ctaaccattc caagccgccg catcgcttgt ttccacctcc tgggccgggg
3061 ctctggcccc caacgtgtct aggagagctt tctccctaga tcgcactgtg ggccggggcc
3121 tggagggctg ctctgtgtta ataagattgt aaggtttgcc ctcctcacct gttgccggca
3181 tgcgggtagt attagccacc cccctccatc tgttcccagc accggagaag ggggtgctca
3241 ggtggaggtg tggggtatgc acctgagctc ctgcttcgcg cctgctgctc tgccccaacg
3301 cgaccgctgc ccggctgccc agagggctgg atgcctgccg gtccccgagc aagcctggga
3361 actcaggaaa attcacagga cttgggagat tctaaatctt aagtgcaatt atttttaata
3421 aaaggggcat ttggaatcaa a
//
LOCUS HUMGAAX 298 bp ds-DNA PRI 14-JUL-1990
DEFINITION Human acid alpha-glucosidase (GAA) gene, 5' flank.
ACCESSION M34425
KEYWORDS acid alpha-glucosidase; acid maltase.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 298)
AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R.
TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha-
glucosidase, detection of an intron in the 5' untranslated leader
sequence, definition of 18-bp polymorphisms, and differences with
previous cDNA and amino acid sequences
JOURNAL DNA 9, 85-94 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS 274 > 298 acid alpha-glucosidase, intron A
/hgml_locus_uid="LZ0096H" /nomgen="GAA"
/map="17q23"
BASE COUNT 38 a 88 c 128 g 44 t
ORIGIN
1 cgtgcggagg tgagccgggc cggggctgcg gggcttccct gagcgcgggc cgggtcggtg
61 gggcggtcgg ctgcccgcgc ggcctctcag tgggaaagct gaggttgtcg ccggggccgc
121 gggtggaggt cggggatgag gcagcaggta ggacagtgac ctcggtgacg cgaaggaccc
181 cggccacctc taggttctcc tcgtccgccc gttgttcagc gagggaggct ctgggcctgc
241 cgcagctgac ggggaaactg aggcacggag cgggtgagac acctgacgtc tgccccgc
//
LOCUS HUMMAK 171 bp ds-DNA PRI 14-JUL-1990
DEFINITION Human male germ cell-associated kinase (mak) gene, exon N.
ACCESSION M35863 M34569
KEYWORDS kinase.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 171)
AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M.
TITLE A novel mammalian protein kinase gene (mak) is highly expressed in
testicular germ cells at and after meiosis
JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 25 / 157 male germ cell-associated kinase (mak), exon N
(AA at 27)
IVS < 1 24 mak intron X
IVS 158 > 171 mak intron X+1
BASE COUNT 48 a 30 c 38 g 55 t
ORIGIN
1 tttttttctc cgtatatcat caaggctttt ttcataggga catgaaacca gaaaacttgc
61 tttgtatggg tccagagctt gtgaaaattg ctgattttgg acttgcaaga gaattaaggt
121 cacagccacc atacactgac tatgtatcta ccagatggtg agtaggggtt a
//
LOCUS RATMAK 2372 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat male germ cell-associated kinase (mak) mRNA, complete cds.
ACCESSION M35862 M34569
KEYWORDS kinase.
SOURCE Rat, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2372)
AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M.
TITLE A novel mammalian protein kinase gene (mak) is highly expressed in
testicular germ cells at and after meiosis
JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 237 2105 male germ cell-associated kinase (mak)
mRNA < 1 2372 mak mRNA
BASE COUNT 683 a 575 c 574 g 540 t
ORIGIN
1 tctgaatatg tggagactat tatagcaaaa tggactcagt gtctctttgc cagattcctt
61 attgctgtat cccatgagcc tacctgacct tgaggaagag ccgtgtttaa tgatgacaag
121 cctgtctccc gtgtagtgac aaaaaggagg aggaccacaa ggaaagacgg atttagattt
181 ctactcagca aggaagcgag ctctgtcagc aacctctctg gattaccctt cccaagatga
241 accgatatac aaccatgagg cagctggggg acggcacgta cgggagtgtg cttatgggca
301 agagcaatga gtctggggag ctggtggcaa tcaagaggat gaagagaaag ttctattctt
361 gggatgagtg tatgaacttg cgagaagtta agtccctgaa gaaactcaat catgccaatg
421 tgattaaact aaaagaagtt atcagagaaa atgaccatct ttattttata tttgaatata
481 tgaaagaaaa cctctatcag ctaatgaaag acagaaacaa actgttccct gagtcagtca
541 tcagaaatat tatgtatcaa atactacagg ggctggcatt tatccacaaa catggctttt
601 ttcacaggga catgaaacct gagaatttgc tttgcatggg tccagagctg gtgaagattg
661 ctgattttgg acttgcgaga gaattaagat cacagccacc atatactgac tatgtgtcta
721 ccagatggta ccgtgctcct gaagttttgc taaggtcttc agtgtacagc tctcccattg
781 acgtgtgggc cgtgggaagt ataatggccg agctatatac gtttagaccg cttttcccag
841 ggaccagtga agttgatgag atctttaaaa tttgccaagt gttagggact cccaagaaaa
901 gtgactggcc ggaggggtac cagctggcat cctccatgaa cttccgcttt ccccagtgca
961 ttcctataaa cctgaaaact ctcattccca atgccagtag tgaggctatt cagcttatga
1021 cagaaatgct taactgggat ccaaagaaac ggccaactgc aagccaggca ctgaagcacc
1081 catattttca agtcggtcag gtattgggcc cttctgcaca ccatctggat gcaaaacaga
1141 ctttgcacaa gcagctgcag cctccagagc caaagccatc ttcctctgaa cgggatccta
1201 agcctttgcc aaacatcctt gatcagcctg ccgggcagcc ccagccaaaa cagggccacc
1261 aaccactgca ggccattcag ccaccacaga acacagtggt tcagccacct ccaaagcagc
1321 agggtcacca taagcaacca caaacgatgt ttccaagtat cgtcaaaacc ataccaacga
1381 atccagtcag cacagtaggc cataaggggg cccggagacg gtggggtcag acagtcttca
1441 agtctggaga cagctgtgac aacatcgagg actgcgactt gggagcctcc cactccaaga
1501 agccgagcat ggatgccttc aaggaaaaaa agaagaagga gtctccattt cggtttccag
1561 aagcaggact cccagtctcc aaccacttga agggggaaaa tagaaattta catgcatcct
1621 taaaatctga cacaaacttg tcaactgctt caaccgctaa gcagtactat ttgaaacaat
1681 caagatacct tccgggtgtg aaccccaaga acgtgtcttt ggtagctggc ggcaaggata
1741 taaattcaca ctcttggaat aatcagctat ttcctaagtc tctgggatcc atgggggcgg
1801 acctcgcttt caagaggagt aacgcagcag ggaaccttgg gagctacagc gcttacagcc
1861 agacgggatg cgtgccttcc tttctcaaga aagaagtggg atcagctggc cagaggatcc
1921 acttggcgcc tctgggtgcg tcggctgccg actatacctg gagcacaaaa actggccgag
1981 gccagttttc aggacgaact tacaatccca cagccaaaaa tctcaatatt gtgaaccgca
2041 cacagccagt cccctcggtg cacgggagga cagactgggt ggctaagtac ggaggccacc
2101 ggtaggagaa gtggaagcct gaagcattgc tccgtagagg acaatcacgc cccttgatcc
2161 tgggagacgt ctacagagtc tatttctacc gagttccaca ggacgcacac aaccgtgggc
2221 gcctcagaga ccggaagtca gctcccgatt tcttcctttt ccggaaatgc aatgcatttt
2281 cttaacttgt ggccagcagt gctgatgcag caccgtgctg aaacttttga gcgggacttt
2341 taaagagtat tgaataaatt atttgccaaa gt
//
LOCUS BNRCFIA 1930 bp ds-DNA BCT 14-JUL-1990
DEFINITION B.fragilis imipenem-cefoxitin-hydrolyzing enzyme (cfiA) gene,
complete cds.
ACCESSION M34831
KEYWORDS imipenem-cefoxitin-hydrolyzing enzyme.
SOURCE B.fragilis (strain TAL2480) DNA.
ORGANISM Bacteroides fragilis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Anaerobic gram-negative straight, curved and helical rods;
Bacteroidaceae.
REFERENCE 1 (bases 1 to 1930)
AUTHORS Thompson,J.S. and Malamy,M.H.
TITLE Sequencing the gene for an imipenem-cefoxitin-hydrolyzing enzyme
(cfiA) from Bacteroides fragilis TAL2480 reveals strong similarity
between cfiA and Bacillus cereus beta-lactamase II
JOURNAL J. Bacteriol. 172, 2584-2593 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 533 1282 imipenem-cefoxitin-hydrolyzing enzyme
precursor
sigp 533 586 imipenem-cefoxitin-hydrolyzing enzyme signal
peptide
matp 587 1279 imipenem-cefoxitin-hydrolyzing enzyme
BASE COUNT 563 a 435 c 434 g 498 t
ORIGIN
1 gatcaataag cagttcatct tgtctgcagt attgaaatca taggagttgt ccagaagcag
61 atttataggt gatgttctta caagtcagtt cttcgatagc acgcaatatg gtgtctgcgc
121 tgcaagtgcg aagagttgga tgaagagaca aatgtttcat caggtgagtt gtaacatcct
181 caatacatga gccgccacaa agatatacgc acatcagaga gcgtagaatt tcgctatatt
241 gataaccaaa catagtgcat ctcaatccca aggtggaatc tatggtttga gctaaaagag
301 catcaaattg ctccataata gaaaaaattc ctccaaaagg agtgagtttc tcagatttta
361 tttgtacttt tgccatgtca tattagagtt ttgcttgtct tcttttcgca acactaaggt
421 aagtgaaaat tctgacatgg caaaatcctg ggcaactttt tgttgctcag gcacttataa
481 ataatgttaa actatagtgt tgcggaatta aggcatcata taaaagaata aaatgaaaac
541 agtatttatc cttatctcca tgcttttccc tgtcgcagtt atggcacaga aaagcgtaaa
601 aatatccgat gacatcagta tcacccaact ctcggacaaa gtgtacactt atgtatccct
661 cgccgaaatc gaaggatggg gtatggtacc ttccaacggg atgattgtta tcaacaacca
721 ccaggcagcg ttgctggaca caccgatcaa tgacgcacaa acggaaatgc tggtcaactg
781 ggtgacagac tctttgcatg ccaaagtcac cacgtttatc ccgaaccact ggcacggcga
841 ttgtattggc ggactgggtt acctgcaaag gaaaggtgtc caatcatacg cgaaccagat
901 gacgatagac ctcgccaagg aaaaagggtt gcccgtaccg gaacatggat tcaccgattc
961 actgaccgtc agcttggacg gcatgcctct ccaatgttat tatttaggag gcggacatgc
1021 gaccgacaat atcgtggttt ggctgccgac agagaatatc ctttttggcg gatgtatgct
1081 taaagacaac caggcgacaa gcatcggcaa catctcggac gcggacgtga cggcatggcc
1141 gaaaactctc gataaggtaa aagccaagtt cccctcggcc cgttacgtcg tgcccggaca
1201 tggcgactat ggcggaaccg aactgataga gcataccaag cagatcgtga accaatatat
1261 agaaagcact tcaaagccat agcccgaatt gtaggcatct acccggtatt ccatgaacga
1321 agaggcgaaa gcttttatcc cgtccacaga atgtaacccg gccacaaacg agacataaaa
1381 atatcaattg aacaacacaa cagatccttg tttcatctac ggcgtcgcag ttaggtcttt
1441 ttttatcaat tgttaagcgt ttcttcccga ttattagcga tctttgccat caaaaaataa
1501 acgatatgaa ttggattatc ctaacgattg ccggatgctt tgaggtgctg ttcactttct
1561 gcatggggaa agtgaaagaa acaaccggaa cgaaatgtac ctgtggttca ccggattcct
1621 gatcgcaact tttataagta tgttactact ggtgaaagcc acccagaatc tgcccatcgg
1681 cacagcttat gccgtgtgga caggaatagg cgctgtgggt acggtgctcg taggtatcct
1741 tgtgtttagc gaacccgtca gttttggaga ctcttcttta tagtgatgct gatcagttca
1801 ataatcgggc tgaagatggt ttcgcattga gggggaaatt catccctcag cacctccggc
1861 aattccgcca acgaacggat catcagatac tcctccgacg atgaatctcc tcgtgttcac
1921 atcgctcagc
//
LOCUS CLTSTRRA 142 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION M34684
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-A.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 142)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 36 77 STRR1 sequence
rpt 78 119 STRR1 sequence
BASE COUNT 28 a 64 c 19 g 31 t
ORIGIN
1 tatttattcc catcacccaa tccccaatcc ccagtcccca ttacccctta tcccagaggg
61 gccccgagtt ccccaatccc cattacccct tatcccagag ggtccccgag ttccccagtc
121 cccagtcccc aatcctgact gg
//
LOCUS CLTSTRRB 73 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION M34685
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-B.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 73)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 19 a 33 c 5 g 16 t
ORIGIN
1 ggagttctcc aatccccaat ccccagtccc caatccccat tccccattcc ccattcccct
61 caaactaaag aat
//
LOCUS CLTSTRRC 64 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION M34686
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 64)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 16 a 26 c 2 g 20 t
ORIGIN
1 tttttccaaa tccccaatcc ctagtcccca atccctagtc cccaatccct aatcatttca
61 attc
//
LOCUS CLTSTRRD 49 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION M34687
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 9 a 23 c 5 g 12 t
ORIGIN
1 ttcctctgct tccccagtcc ccagtcccca gtccccaccc ataaaagtt
//
LOCUS CLTSTRRE 74 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION M34688
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-E.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 74)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 6 a 33 c 3 g 32 t
ORIGIN
1 aggttttgta ttttcccttt cccctttcac cttttcccct tcccctttca cctttcccct
61 ttccccaccc ctat
//
LOCUS CLTSTRRF 75 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION M34689
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-A.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 14 a 8 c 15 g 38 t
ORIGIN
1 ctggtttgtt ttgttattgg tcattggtca tttgtcattg gtcatttgtc attagttatt
61 ggtaattcat atcaa
//
LOCUS CLTSTRRG 64 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION M34690
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-B.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 64)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 10 a 10 c 11 g 33 t
ORIGIN
1 tttttgttgt ttgtcatttg tcattggtca tttgtcatta gtcctttgtc attcaaaccg
61 agtt
//
LOCUS CLTSTRRH 77 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION M34691
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-C.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 77)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 18 a 6 c 17 g 36 t
ORIGIN
1 ccaatttgta ttagtcattt gttattagtc atttgtcatt tgtcatttgg ggtttgtaat
61 tggtaatagg aggaatt
//
LOCUS CLTSTRRI 85 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION M34692
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-D.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 85)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 25 a 0 c 25 g 35 t
ORIGIN
1 atggggtaat tggtaattgg taattggtaa ttggtaattg gtaattggta attggtaatt
61 ggtaattggt aattggtaat agatt
//
LOCUS CLTSTRRJ 80 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION M34693
KEYWORDS s.
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-E.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 13 a 13 c 17 g 37 t
ORIGIN
1 tcccatgtca tttgtcattt gtcattggtt attggttatt ggtcatttgt catttggggt
61 ttggtaattg acctcaactc
//
LOCUS CLTSTRRK 76 bp ds-DNA BCT 14-JUL-1990
DEFINITION Callothrix sp. short tandem repeat family 3 (STRR3).
ACCESSION M34694
KEYWORDS .
SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 3.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
TITLE Highly repetitive DNA sequences in Cyanobacterial genomes
JOURNAL J. Bacteriol. 172, 2755-2761 (1990)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 25 a 15 c 15 g 21 t
ORIGIN
1 ttttaagtct gtgaacagtc aacagtcaac agtcaacagt taacagttga aaggacacag
61 cagtgctatg tccttt
//
LOCUS CPACM559B 1076 bp ds-DNA ORG 14-JUL-1990
DEFINITION C.paradoxa cyanelle cytochrome b-559 (psbE, psbF, psbI, and psbJ)
genes, complete cds.
ACCESSION M35129
KEYWORDS cytochrome b-559.
SOURCE C.paradoxa cyanelle DNA.
ORGANISM Cyanelle Cyanophora paradoxa
Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE 1 (bases 1 to 1076)
AUTHORS Cantrell,A. and Bryant,D.A.
TITLE Nucleotide sequence of the genes encoding cytochrome b-559 from the
cyanelle genome of Cyanophora paradoxa
JOURNAL Photosyn. Res. 16, 65-81 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 246 470 cytochrome b-559 apoprotein (psbE)
pept 510 638 cytochrome b-559 apoprotein (psbF)
pept 648 764 cytochrome b-559 apoprotein (psbI)
pept 787 909 cytochrome b-559 apoprotein (psbJ)
signal 926 951 pot. transcription termination signal
BASE COUNT 367 a 142 c 160 g 407 t
ORIGIN
1 tctttaagta gtcagatatc tggaaatact tggaaaaaag aaacaagtac agacaatatt
61 ccttcaaatt tctataaaat tagttttata gataaagaag taggtttata ctaggaaatc
121 agggaacact tttacgatat gtatcattat aatagaataa ttaatatctt ataatttgaa
181 taaaatcttt tattatacat aaaagcatca tatatatata tatataattt ttaggagtga
241 tttatatgtc tggaggaact actggcgaac gcccattttc tgacattgtt actagtattc
301 gttattgggt tattcatact gtaactattc cattctttat tgttgcaggt tggctttttg
361 taagtactgg tttagcttat gatgtatttg gtactccaag accagatgaa tacttcactg
421 aagaacgtca agaagtacca attattaatc aacgtttttc aactaattaa tcgttattaa
481 tttatagttt ctaaaagagg aaaaaggaaa tgaataatcc taatcaaccg gtttcttatc
541 caatttttac agttagatgg ttagcaattc atgctattgg aattccagct gtatttttta
601 ttggatctat tactgcaatg caatttattc aacgatagga gatatatatg gttagccaaa
661 atcctaatag acaaaaagtt gaattaaatc gtacttccct attctgggga ttacttttaa
721 ttttcgtatt agcaatttta ttctctagct acatttttaa ctaaatttct tcaataagga
781 gtctttatgg caaatactgg tggacgcatt cctttatggc ttgttgctac agttgcaggt
841 ttagcagcta ttggtgtact aggaatcttt ttctatggtg gttactctgg tttaggttcc
901 tctatttaat aaaatcaatt taagattttt tgtaataaat taattaaaaa aaaatagaga
961 gttaaatttt tttaactctc tattctttag acatttataa ctaataaaag aaattaagca
1021 atttcatctg tttcaatata aacaaataat aatgccataa cattgctggt aagaca
//
LOCUS KVSM121M 612 bp ds-RNA VRL 14-JUL-1990
DEFINITION Killer virus of S.cerevisiae (class M1, type 1), M1-2 region mRNA
fragment.
ACCESSION M35150
KEYWORDS .
SOURCE Killer virus of S.cerevisiae (strain A364A x S7) class M1 type 1,
cDNA to viral RNA, clones pEH-[2,3].
ORGANISM Killer virus of S.cerevisiae
Viridae; ds-RNA nonenveloped viruses.
REFERENCE 1 (bases 1 to 612)
AUTHORS Georgopoulos,D.E., Hannig,E.M. and Leibowitz,M.J.
TITLE Sequence of the M1-2 region of killer virus double-stranded RNA
JOURNAL Basic Life Sci. 40, 203-213 (1986)
STANDARD simple staff_review
BASE COUNT 205 a 140 c 141 g 125 t 1 others
ORIGIN
1 aaagaaagaa cnaaaagaaa cagaaaaaga gagaacagga caacaaacgc aacaaaacac
61 aaacacaagc acactcacct tgagtctaac tggtggcacg cagcatatct caccctgaga
121 ctaactggcg gcaggcgacc gtgagcatac agcatgcccc actcgattcg agacgcgatt
181 cgcgctcgta ggtatcgagc ggctacgttg agctattatg gcagtgacat gcgattcgcg
241 cactgccaag atcagctcag caaagttaag accagtatcg gatatggtag actactacaa
301 ttcgcacagg tatgagattc tcagtctagt gtatggatga gtagttgagc caatgaatct
361 agggtttaaa ttactatgca ttgacatata acaggtacaa gcgtagataa tacttactag
421 gccccagccg gtacaccctg tattgaataa atacgactat ttggccaggt ctggacgggg
481 cagtcgaatt actaggttga gcacacacac gtgaatcaca caacataaca gtgtaggaac
541 ataatgtgcc attcgtagtc tgagacgccg ctagcctggt ttaatgcaac agcatagaag
601 aaacacacat ca
//
LOCUS PIGHMG2 1153 bp ss-mRNA MAM 14-JUL-1990
DEFINITION Pig non-histone chromosomal protein (HMG2) mRNA, complete cds.
ACCESSION J02895
KEYWORDS non-histone chromosomal protein.
SOURCE Pig thymus, cDNA to mRNA, clone pCD-MK02.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1153)
AUTHORS Shirakawa,H., Tsuda,K.-i. and Yoshida,M.
TITLE Primary structure of non-histone chromosomal protein HMG2 revealed
by the nucleotide sequence
JOURNAL Biochemistry 29, 4419-4423 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by M.Yoshida, 07-DEC-1989.
FEATURES from to/span description
pept 144 776 non-histone protein HMG2 precursor
matp 147 773 non-histone protein HMG2
mRNA < 1 1153 HMG2 mRNA
BASE COUNT 361 a 224 c 301 g 267 t
ORIGIN 53 bp upstream of HaeIII site.
1 acaaaccagt tcacgccgga gccgcaggta ggcagcgtcg cggtcggacc cggccgctgc
61 gcgggagcct gaggagaagc taccaccagg caagagaccc ctccgggccc cggtggacgc
121 gtcgtcgccg gcgccgcagc accatgggga agggcgaccc caacaagccg cggggcaaga
181 tgtcctcgta cgccttcttc gtgcagacct gccgggagga gcacaagaag aaacaccccg
241 attcctcggt caacttcgcc gagttctcca agaagtgctc cgagcgatgg aagactatgt
301 ctgccaagga aaagtccaag tttgaagata tggcaaaaag tgacaaagct cgctatgacc
361 gggagatgaa aaattacgtc cctcccaagg gtgacaagaa gggcaagaaa aaagatccca
421 atgctcccaa aaggcctcca tctgccttct tcctgttttg ctctgaacat cgcccaaaga
481 tcaaaagtga acaccctggc ttatccattg gggatactgc aaagaaattg ggtgaaatgt
541 ggtctgagca gtcagccaaa gataaacaac cgtatgaaca gaaagcagct aagctaaagg
601 aaaaatatga aaaggatatt gctgcatacc gtgccaaggg taagggtgaa gcaggaaaga
661 agggccctgg caggccaaca ggctctaaga agaagaatga accagaagat gaggaggaag
721 aggaagagga ggaagaagat gaagacgagg aggaagaaga tgaagatgag gaataaatgg
781 ctatcctgta atgttgtgtg tggagtgtgt gtgtgtgctc aggcaattat tttgctaaga
841 atgtgaattc aagtgcagct caatattagc ttcagtataa aaactgtaca gatttttgta
901 tagctaataa gattctttgt agagaaaata ctttttttaa aagtgcaggt tgcagctttt
961 tgaggggcta ctacatacag ttagatttta aagcttctga tgttgaatgt ttctaaatat
1021 ttaatggttt ctttaatttc ttgtgtatgg taacagcaaa ctcgtagaaa taagtatcaa
1081 tagcaaattt tgggttttct agaatgttgc attttgtttt ttaaaaaaat tttgtaataa
1141 aattatgtat att
//
LOCUS PSEHUTH 2249 bp ds-DNA BCT 14-JUL-1990
DEFINITION P.putida histidine ammonia-lyase (hutH) gene, complete cds.
ACCESSION M35140
KEYWORDS histidase; histidine ammonia-lyase.
SOURCE P.putida (ATCC 12633) DNA, clone pMC4.
ORGANISM Pseudomonas putida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2249)
AUTHORS Consevage,M.W. and Phillips,A.T.
TITLE Sequence analysis of the hutH gene encoding histidine ammonia-lyase
in Pseudomonas putida
JOURNAL J. Bacteriol. 172, 2224-2229 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 316 1848 histidine ammonia-lyase (hutH) precursor (gtg
start codon (E.C. 4.3.1.3)
matp 319 1845 histidine ammonia-lyase (hutH)
BASE COUNT 392 a 777 c 699 g 381 t
ORIGIN
1 ccatggtggt ggcgtgggca tgggcttctc tcagcactcg ggcatggtca tcgtctgcga
61 cggcaccgat gaagccgccg agcgcatcgc ccgtgtactg accaacgacc cagggactgg
121 cgtcatgcgc cacgccgatg ccggttatga catcgccatc gactgcgcca aggagcaggg
181 cctggacctg ccgatgatca ctggctgatt gccacgcttt ggatcgcacc ccgccagggg
241 tgatactgaa caacaagaag gagcgcgcag gcacccacga acggcctgcg gctccacgcg
301 atttggagta gtaacgtgac cgaactcacc ctcaagcccg gcaccctgac cctggcccag
361 ctgcgcgcga tccatgccgc gcccgtgcgc ctgcaactgg atgccagcgc cgcgccggcc
421 atcgatgcca gcgtcgcctg tgtcgagcag atcattgccg aagaccgcac tgcctacggc
481 atcaacaccg gtttcggcct gctggcctcg acccgcatcg ccagccacga cctggaaaac
541 ctgcagcgct cgctggtgct gtcccacgcc gctggcattg gcgcgccgct ggatgacgat
601 ctggtgcggt tgatcatggt gctgaaaatc aacagcctca gccgtggctt ctccggcatt
661 cgccgcaaag tcatcgatgc gctgatcgcc ctggtcaacg ccgaagtcta cccgcacatc
721 ccgctgaaag gctcggtggg tgcttccggc gacctggcgc cgctggcgac catgtcgctg
781 gtgctgctgg gcgaaggcaa agcccgctac aagggccagt ggctgtcggc caccgaagcc
841 ctggcggttg ccggcctcga accgctgacc ctggctgcca aagagggcct ggccctgctc
901 aacggcaccc aggcgtccac cgcctatgcc ctgcgtggcc tgttctatgc cgaagacttg
961 tacgctgcgg ctatcgcctg cggcggcctg agcgtcgaag ccgtactggg ctcgcgttcg
1021 ccgttcgatg cgcgtattca cgaagcgcgt ggccagcgcg gccagatcga cactgccgcg
1081 tgcttccgcg acctgctggg cgattccagc gaagtgtcgc tgtcgcacaa gaactgcgac
1141 aaggtccagg acccgtactc gctgcgctgc cagccgcagg tcatgggcgc gtgcctgacc
1201 cagctgcgcc aggctgccga ggtgctgggc atcgaagcca acgccgtgtc ggacaacccg
1261 ctggtgttcg ctgccgaggg tgacgtgatt tccggtggca acttccacgc agaaccggtg
1321 gccatggccg ccgacaacct ggccctggcc atcgccgaaa tcggttcgct cagcgagcgc
1381 cgcatctcgc tgatgatgga caagcacatg tcccagctgc cgccgttcct ggtggaaaac
1441 ggtggggtca actccggctt catgatcgcc caggtcaccg ctgccgctct ggccagcgag
1501 aacaaggccc tgtcgcaccc gcacagcgtc gacagcctgc cgacctcggc caaccaggaa
1561 gaccacgtat cgatggcccc ggctgccggc aagcgcctgt gggaaatggc cgaaaacacc
1621 cgtggcgtgc ctgccatcga atggctgggc gcatgccagg gcctggacct gcgcaaaggc
1681 ctgaagactt cggccaagct ggagaaagca cgccaggcgc tgcgcagcga agtggcgcac
1741 tacgaccgtg accgtttctt cgcgccggac atcgaaaagg ctgtggaact gttggccaag
1801 ggtagtttga ccggtttgct gccggcaggt gtgctgccaa gcctgtaatg ccccctgggg
1861 cgctgcgcct tcgcagcaca aggctgctcc tgcaaaagcg cacgcgccct tgtacggaca
1921 gcctgtgctg cgatgggctg catgccagcc ccctgacctc acgaccgcag gacgattcaa
1981 ggacgccgac acgcaacaag ctcaaggtct caagtgcggg ctaaatgccc gccacatccg
2041 tttcatggcc ctcggttctg ccatcggaac cgggctgttc tacggctccg cctcagccat
2101 ccagatggcc ggcccggccg tgctgctggc ttacctgatc ggcggtgccg ctgtgttcat
2161 ggtcatgcgc gccctcggcg aaatggccgt gcacaacccg gtggccggct ccttcggcca
2221 ctacgccagt acctacctcg gccccatgg
//
LOCUS RATGABAAD 1799 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat GABA-A receptor delta subunit mRNA, complete cds.
ACCESSION M35162
KEYWORDS GABA-A receptor delta subunit; gamma-aminobutyric acid receptor.
SOURCE Rat adult brain, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1799)
AUTHORS Zhao,Z.-Y. and Joho,R.H.
TITLE Isolation of distantly related members in a multigene family using
the polymerase chain reaction technique
JOURNAL Biochem. Biophys. Res. Commun. 167, 174-182 (1990)
STANDARD simple staff_review
REFERENCE 2 (sites; corrections to figure 3 for [1])
AUTHORS Zhao,Z.-Y. and Joho,R.H.
TITLE Isolation of distantly related members in a multigene family using
the polymerase chain reaction technique
JOURNAL Biochem. Biophys. Res. Commun. 168, 887-887 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 50 1399 GABA-A receptor delta subunit
BASE COUNT 385 a 512 c 500 g 402 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccctg gggctagccg gacctgtccc gcgcacagcc cgcaaggcca tggacgttct
61 gggctggctg ctgctgccgc tccttctgct gtgcacgcag ccgcaccatg gcgccagagc
121 aatgaatgac attggggact acgtgggctc caacctggag atatcctggc tccccaacct
181 ggatggacta atggagggct acgcccgaaa cttccgacca ggcattggag gtcctccagt
241 gaatgtggcg cttgccctag aggtggccag cattgaccac atctcagaag caaatatgga
301 atacaccatg acagtgttcc tgcaccagag ctggcgagac agcaggctgt cctacaacca
361 taccaacgag accctgggcc tggatagccg cttcgtggac aagctgtggc tccctgacac
421 cttcattgtg aatgccaagt ctgcctggtt ccatgatgtg accgtggaaa acaagcttat
481 ccgcctacag cccgacggtg tgattttata cagcatccgc atcacctcca cagtggcctg
541 tgacatggac cttgccaagt accccatgga cgagcaggag tgcatgctgg acctggagag
601 ctatggctac tcttctgagg acattgtcta ttattggtca gaaaaccagg agcagatcca
661 cgggctggac aggctgcaac tggcccagtt cactatcacc agttaccgct tcaccacgga
721 gctgatgaac ttcaaatcag ctggccagtt ccctcgactc agcttacact tccagcttcg
781 gaggaaccgg ggtgtctaca tcatccagtc ttacatgccc tctgtcctcc tggttgccat
841 gtcctgggtc tccttctgga ttagccaagc agcagtgcct gccagagtat ctctaggcat
901 caccactgtg ctgacaatga ccacactcat ggttagtgcc cgctcctccc tcccgcgggc
961 ttctgctatc aaggctctgg atgtgtattt ctggatctgc tatgtcttcg tgtttgctgc
1021 cctggtggag tatgcatttg cccacttcaa tgctgactac aggaagaaac ggaaagccaa
1081 ggtcaaggtc acgaagccaa gggcagagat ggacgtgagg aacgccattg tcctcttctc
1141 cctctctgct gctggggtca gccaggagtt ggctatctcc cgccgtcaag gccgggtccc
1201 tgggaacctc atgggttcct ataggtctgt agaagtggag gcaaagaagg agggggggtc
1261 ccgcccagga ggcccaggag gcatccgttc cagactcaaa cccatcgatg cagacaccat
1321 cgacatctat gcccgcgctg tgttcccggc agcctttgca gcagtcaaca tcatctactg
1381 ggcggcgtat accatgtgac ggcagtgcgt acaccacatg acggctttac atgtaccacg
1441 tgcggtggtt gccacaaact cctggaggaa cagccctctc cgtggatccc aaagtgggac
1501 agagaggagc gtccgctccc tgggctttac ctcaattttg ctctaccagc ctgcacttcc
1561 tgtagatgga gcaggatggg ctttgcacag agggccatta gttggcctgt ccctgaagct
1621 gagcctgcac taaggtgccc ttgggagctg tgaagtttta gactcaggat gggcctgatt
1681 tctgggcctt tgctctgtgg gatcaggatc agagagaaag agagtagggg tggatgacca
1741 ctctcatggt gaactggagg agaagggcat tatgaggcct ctttggtccc agggaattc
//
LOCUS RPVCTSI 2195 bp ds-DNA VRL 14-JUL-1990
DEFINITION Raccoonpox virus Sal1 end fragment DNA.
ACCESSION M23018
KEYWORDS .
SOURCE Raccoonpox virus (strain V71-I-85A) DNA, passed in human tk- 143
cells.
ORGANISM Raccoonpox virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 2195)
AUTHORS Parsons,B.L. and Pickup,D.J.
TITLE Tandemly repeated sequences are present at the ends of the DNA of
Raccoonpox virus
JOURNAL Virology 161, 45-53 (1987)
STANDARD simple staff_review
BASE COUNT 880 a 225 c 373 g 717 t
ORIGIN
1 ataattaatt taattttaaa tattttattt aatgtctaga aaaaaatgtg taacccagac
61 tgtagagaac tctctaatgt ggtgaaaaaa tcaatccctt aatcgagact atcagaaaga
121 gacgaaacat tttttagtga gacattagaa agagaaagag acgaaacatt ttttagtgag
181 acattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga gaaatatttt
241 tttgtaaaac ttttttatga ctccattaga aagagatgaa acatttttag tgagacatta
301 gaaagagaaa tatttttttg taaaactttt ttatgactcc attagaaaga gacgaaacat
361 ttttagtgag acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta
421 gaaagagacg aaacattttt agtgagacat tagaaagaga aatatttttt tgtaaaacgt
481 tttcttaggt caaaagtcaa tgttttaaaa tatggactag aatatgtctg tgtaaaaact
541 tgtgaaattt tgtaccaaag aaaaacgtga gcagtatccc ctacatggat tttactagat
601 catttatgta acaaaaaatt atattattcg tgtacgttaa ttataaaaat attgtatgat
661 atatcgaggt ccattagcaa aaggtgttga tagacaatta atcgtgtgtg atgtaaccat
721 gattattaca gcagaaagag taagagtatt ttttgtaaaa cttttttatg actccattag
781 aaagagaaag agacgaaaca tttttagtga gacattagaa agagacgaaa catttttttt
841 gtaaaacttt tttatgactc cattagaaag agaaagagac gaaacatttt tagtgagaca
901 ttagaaagag aaatattttt tgtaaaactt tttttatgac tccattagaa agagaaatat
961 tttttgtaaa acttttttat gactccatta gaaagagacg aaacattttt attgagacat
1021 tagaaagaga aagagacgaa acattttttt gtaaaacttt tttatgactc cattagaaag
1081 agaaagagac gaaacatttt tagtgataca ttagaaagag aaatattttt tgtaaaactt
1141 ttttatgact ccattagaaa gagatgaaac atttttagtg agacattaga aagagaaaga
1201 gacgaaacat tttttttgta aaactttttt tatgactcca ttagaaagag aaagagacga
1261 aacattttta gtgagacatt agaaagataa agagacgaaa catttttttt gtaaaacttt
1321 ttttatgact ccattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga
1381 gacgaaacat ttttagtgag acattagaaa gataaagaga cgaaacattt tttttgtaaa
1441 acttttttat gactccatta gaaagagaaa gagacgaaac atttttagtg agacattaga
1501 aagagacgaa acattttttt tgtaaaactt tttttatgac tccattagaa agagacgaaa
1561 catttttagt gagacattag aaagataaag agacgaaaca ttttttttgt aaaacttttt
1621 tatgactcca ttagaaagag aaagagacga aacattttta gtgagacatt agaaagagaa
1681 atattttttg taaaactttt ttatgactcc attagaaaga gatgaaacat ttttagtgag
1741 acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta gaaagagatg
1801 aaacattttt agtgagacat tagaaagaga aagagacgaa acattttttt ttgtaaaact
1861 tttttatgac tccattagaa agagatgaaa catttttagt gagacattag aaagagaaag
1921 agacgaaaca tttttttttt gtaaaacttt ttatgactcc attagaaaga gatgaaacat
1981 ttttagtgag acattagaaa gagaaagagt gacacgataa aacttgttac ttattgttag
2041 agaaaaagaa tcgactttaa tgcgaataca tttttaaatt tcaaaccgga aataagtgtc
2101 cggtacactt ttaaaattaa acgtatgtcg gaatccacaa ctaatcctga ggattacacg
2161 attcagacac acgatttaag ttttgtgtcg tcgac
//
LOCUS STYFLICDA 612 bp ds-DNA BCT 14-JUL-1990
DEFINITION S.muenchen phase 1 flagellin (antigen-determining region) (fliCd)
gene, partial cds.
ACCESSION M34824
KEYWORDS phase 1 flagellin antigen d.
SOURCE S.muenchen (strain RKS3121) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 612)
AUTHORS Smith,N.H., Beltran,P. and Selander,R.K.
TITLE Recombination of Salmonella phase 1 flagellin genes generates new
serovars
JOURNAL J. Bacteriol. 172, 2209-2216 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 612 phase 1 flagellin (fliCd) (AA at 1)
BASE COUNT 207 a 114 c 137 g 154 t
ORIGIN
1 attgatttaa aagaaattag ctctaaaaca ctgggacttg ataagcttaa tgtccaggat
61 gcctacaccc cgaaagaaac tgctgtaacc gttgataaaa ctacctataa aaatggtaca
121 gatactatta cagcccagag caatactgat atccaaactg caattggcgg tggtgcaacg
181 ggggttactg gggctgatat caaatttaaa gatggtcaat actatttaga tgttaaaggc
241 ggtgcttctg ctggtgttta taaagccact tatgatgaaa ctacaaagaa agttaatatt
301 gatacgactg ataaaactcc gttagcaact gcggaagcta cagctattcg gggaacggcc
361 actataaccc acaaccaaat tgctgaagta acaaaagagg gtgttgatac gaccacagtt
421 gcggctcaac ttgctgctgc aggggttact ggtgccgata aggacaatac tagccttgta
481 aaactatcgt ttgaggataa aaacggtaag gttattgatg gtggctatgc agtgaaaatg
541 ggcgacgatt tctatgccgc tacatatgat gagaaacagg tacaattact gctaaacaac
601 cactatacag at
//
LOCUS STYFLICI 585 bp ds-DNA BCT 14-JUL-1990
DEFINITION S.typhimurium phase 1 flagellin (antigen-determining region)
(fliCi) gene, partial cds.
ACCESSION M34822
KEYWORDS phase 1 flagellin antigen i.
SOURCE S.typhimurium (strain RKS3333) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 585)
AUTHORS Smith,N.H., Beltran,P. and Selander,R.K.
TITLE Recombination of Salmonella phase 1 flagellin genes generates new
serovars
JOURNAL J. Bacteriol. 172, 2209-2216 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 585 phase 1 flagellin (fliCi) (AA at 1)
BASE COUNT 180 a 108 c 142 g 155 t
ORIGIN
1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tctgcaacaa
61 aaatataagg tcagcgatac ggctgcaact gttacaggat atgccgatac tacgattgct
121 ttagacaata gtacttttaa agcctcggct actggtcttg gtggtactga ccagaaaatt
181 gatggcgatt taaaatttga tgatacgact ggaaaatatt acgccaaagt taccgttacg
241 gggggaactg gtaaagatgg ctattatgaa gtttccgttg ataagacgaa cggtgaggtg
301 actcttgctg gcggtgcgac ttccccgctt acaggtggac tacctgcgac agcaactgag
361 gatgtgaaaa atgtacaagt tgcaaatgct gatttgacag aggctaaagc cgcattgaca
421 gcagcaggtg ttaccggcac agcatctgtt gttaagatgt cttatactga taataacggt
481 aaaactattg atggtggttt agcagttaag gtaggcgatg attactattc tgcaactcaa
541 aataaagatg gttccataag tattaatact acgaaataca ctgca
//
LOCUS STYFLICR 579 bp ds-DNA BCT 14-JUL-1990
DEFINITION S.heidelberg phase 1 flagellin (antigen-determining region) (fliCr)
gene, partial cds.
ACCESSION M34823
KEYWORDS phase 1 flagellin antigen r.
SOURCE S.heidelberg (strains RKS539) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 579)
AUTHORS Smith,N.H., Beltran,P. and Selander,R.K.
TITLE Recombination of Salmonella phase 1 flagellin genes generates new
serovars
JOURNAL J. Bacteriol. 172, 2209-2216 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 579 phase 1 flagellin (fliCr) (AA at 1)
BASE COUNT 181 a 118 c 126 g 154 t
ORIGIN
1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tgttcaacaa
61 aaatataagg tcagcgatac ggctgcaact gtcactggct atacagattc tgctactgct
121 attgacaaat ctacgtttgc tgcatcagca actaccttag gtggtactcc tgctattact
181 ggtgatctga agtttgatga tactactgga aaatattacg ctgatgtttc aggtactacg
241 gctaaagatg gtgtttatga agtaacagtt gcagccgatg gaaaagtcac tttaactggc
301 acaccaacag gaccaattac tgctggcttc ccttcaactg caacaaaaga tgttaaacaa
361 actcagcaag aaaacgctga tttgacagag gccaaagccg cattgacagc agcgggtgtt
421 gcagcggccg gcacagcatc tgttgttaag atgtcttata ctgataataa cggtaaaact
481 attgatggtg gtttagcagt taaggtaggc gatgattact attctgcaac tcaaaataaa
541 gatggttcca taagtattaa tactacgaaa tacactgca
//
LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION M34641
KEYWORDS FGF receptor-1; fibroblast growth factor receptor;
transmembrane glycoprotein; tyrosine kinase.
SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA,
clone 1:1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3343)
AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L.
TITLE cDNA cloning and expression of a human FGF receptor which binds
acidic and basic FGF
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by L.Claesson-Welsh, 25-MAY-1990.
Author address: L.Claesson-Welsh
Ludwig Institute for Cancer Research
Biomedical Center
Box 595
S-751 24 Uppsala
SWEDEN
FEATURES from to/span description
pept 10 2472 FGF receptor-1 precursor
sigp 10 72 FGF receptor-1 signal peptide
matp 73 2469 FGF receptor-1
BASE COUNT 766 a 957 c 911 g 709 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//
LOCUS BSUSPCR 2852 bp ds-DNA BCT 14-JUL-1990
DEFINITION B.subtilis spectinomycin resistance (spc) genes, complete cds.
ACCESSION M31102
KEYWORDS spectinomycin resistance.
SOURCE B.subtilis (strain 1A241) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2852)
AUTHORS Yoshikawa,H. and Doi,R.H.
TITLE The nucleotide sequence of the Bacillus subtilis spectinomycin
resistance gene region
JOURNAL Nucleic Acids Res. 18, 1647-1647 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.H.Doi, 04-JAN-1990.
FEATURES from to/span description
pept 52 549 spc ORF1
pept 563 742 spc ORF2
pept 773 1213 spc ORF3
pept 1215 2450 spc ORF4 (ttg start codon)
pept 2505 > 2852 spc ORF5
BASE COUNT 830 a 535 c 673 g 814 t
ORIGIN
1 tgatgcgctc gtgaagctgg acttaaattt taataaaaga aggagggaca catgattatg
61 cgtcgtattg acccaagcaa attagagtta gaagaacgct tagttacggt taaccgcgta
121 gcgaaaggtg gtcgtcgttt ccgcttcgca gctctagtcg ttgtcggtga caaaaacgga
181 cacgtaggat tcggtactgg taaagcacaa gaagtaccag aagcgattcg caaagctgtt
241 gaagatgcga aaaagaattt gattgaagta ccaatggttg gaactacaat tccacacgaa
301 atcatcggac gtttcggtgc aggtaacatc ttgttaaaac ctgcttctga aggtactgga
361 gttatcgctg gaggccctgt acgtgcggta cttgagctag ctggtgtagc tgatatcctt
421 tctaagtctt taggttctaa cacaccgatc aacatgattc gtgcaacact tcaaggttta
481 agtgaactta aacgtgctga agacgttgcg aagcttcgtg gaaaatctgt agaagaactg
541 ttaggataag gagggaacat taatggctaa attagaaatt accctcaaac gaagtgtaat
601 cggtcgcccg gaagaccaac gcgttactgt aagaacactt ggtttaaaga aaacaaacca
661 aactgttgtt catgaagaca acgctgcgat ccgtggtatg attaataaag tatctcattt
721 agtttctgtt aaagaacaat aaaaatattt gatcggataa ggaggtgtcc taatgaaact
781 tcatgaatta aaaccttcag aaggttcacg caaaacgcgt aatcgcgtag gtcgtggtat
841 tggttctggc aacggtaaaa cagctggtaa aggtcacaaa ggtcaaaacg ctcgttctgg
901 cggcggtgta cgccctggat tcgagggggg acaaatgcct ttattccaac gtcttcctaa
961 acgtggtttc actaacatca accgcaagga atacgctgta gtcaacctag acaaattgaa
1021 cggttttgca gaaggaacgg aagtcactcc tgaacttctt ctagagactg gtgttattag
1081 caaacttaat gcaggagtaa agattcttgg caacggtaaa ttagagaaaa aattaactgt
1141 aaaagccaat aaattctctg cttctgcgaa agaagctgtt gaagctgctg gcggtacagc
1201 tgaggtgatc taacttgttt aaaacaatct ccaactttat gcgtgtgagt gatatcagga
1261 ataaaatcat attcacttta ctcatgctta tcgtctttcg cataggtgcg tttattcctg
1321 tgccttacgt taacgctgaa gcgttacagg cacagtctca aatgggtgtt tttgatctcc
1381 ttaatacatt tggcggcggt gcgctttacc aattttccat tttcgcaatg ggaattactc
1441 cttatatcac ggcttcgatc atcattcagc tgcttcagat ggatgtggta ccgaagttta
1501 ccgagtggtc taagcaaggt gaagttggcc gccgtaaatt agctcagttc acaaggtact
1561 ttacgattgt gcttggtttc atccaagcgt taggtatgtc atatggattc aacaatctgg
1621 caaacggtat gctgatcgaa aaatccggtg tatcgacata tcttatcatt gctttagtgc
1681 tcactggcgg aactgccttt ttaatgtggc ttggggaaca aattacttct catggagtag
1741 gcaacggaat atcgatcatt atcttcgcgg ggattgtgtc tagtattcca aaaacaattg
1801 ggcaaatata tgagactcaa tttgtcggca gcaacgatca gttgtttatt catattgtga
1861 aagtcgcact tcttgtgatt gcgattttag cagttattgt tggagttatt ttcattcagc
1921 gaggtcagtc tacacacctt ccattgaaag tgaatcctgc aggggttatt ccggtaatct
1981 ttgcggttgc gtttttgata acgccgcgga cgatcgcgtc attctttgga acaaacgatg
2041 tgacaaagtg gattcaaaac aactttgata atacgcatcc ggtgggtatg gcgatatatg
2101 ttgcgttgat tattgccttt acgtactttt atgcttttgt acaggtaaac cctgaacaaa
2161 tggctgataa ccttaaaaaa cagggtggct atatcccggg ggttcgtcca gggaaaatga
2221 ctcaagatag aattacgagc attttgtatc gacttacgtt tgtgggttct atattcttag
2281 ccgtgatttc cattcttcct atctttttca ttcaattcgc tggattgcct caaagtgcac
2341 aaattggcgg aacatctttg ttaattgttg tcggggtagc cttggagaca atgaaacaac
2401 tagaaagcca gttggtgaaa cgaaactacc gtggatttat gaaaaactag aggaaatgga
2461 tttatccatt ccctcttaat aaagagagga cgggggaaat ccgaatgaac ttagtcttaa
2521 tggggcttcc tggtgccggt aaaggcacac agggcgaacg aattgttgag gattatggga
2581 ttcctcatat ctcaacagga gatatgttcc gtgctgctat gaaagaagaa acaccactcg
2641 gactcgaagc aaaatcttat attgataagg gagagctcgt acctgacgaa gtcacaatcg
2701 gaattgtaaa agagagactt ggcaaagatg attgtgaaag aggttttctt ctggacggat
2761 ttccgcgaac agtcgctcaa gccgaagctc ttgaagaaat tcttgaggaa tacggcaagc
2821 cgattgatta tgtcattaac attgaagtcg ac
//
LOCUS RATHDNFNT 1020 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds.
ACCESSION M34643
KEYWORDS neurotrophic factor; neurotrophin-3.
SOURCE Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1020)
AUTHORS Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H.
TITLE Molecular cloning and neurotrophic activities of a protein with
structural similarities to b-nerve growth factor: Developmental and
topographical expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.P.Ernfors, 18-MAY-1990.
FEATURES from to/span description
pept 73 849 neurotrophin-3 (HDNF/NT-3) precursor
sigp 73 120 neurotrophic factor/neurotrophin-3 (HDNF/NT-3)
signal peptide
matp 490 846 neurotrophic factor/neurotrophin-3
BASE COUNT 320 a 229 c 226 g 245 t
ORIGIN
1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag
61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc
121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc
181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag
241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag
301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag
361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc
421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg
481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag
541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg
601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt
661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct
721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta
781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga
841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc
901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa
961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa
//
LOCUS ABCRRAA 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION Acetobacter sp. (strain MB 58) 5S ribosomal RNA, complete sequence.
ACCESSION M34766
KEYWORDS 5S ribosomal RNA.
SOURCE Acetobacter sp. (strain MB 58) rRNA.
ORGANISM Acetobacter sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Azotobacteraceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 27 a 40 c 32 g 17 t 2 others
ORIGIN
1 gatctggtgg ccatggcggg agcaaatcag ccgatcccat cccgaactcg gccgtcaaat
61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagayy
//
LOCUS ABCRRAB 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION Acetobacter sp. (strain 914) 5S ribosomal RNA, complete sequence.
ACCESSION M34767
KEYWORDS 5S ribosomal RNA.
SOURCE Acetobacter sp. (strain 914) rRNA.
ORGANISM Acetobacter sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Azotobacteraceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 27 a 42 c 29 g 18 t 2 others
ORIGIN
1 nntctgatgg ccatggcggg agcaaatcac ccgatcccat cccgaactcg gccgtcaaat
61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagatc
//
LOCUS ANCRRAA 117 bp ss-rRNA RNA 14-JUL-1990
DEFINITION A.aquaticus 5S ribosomal RNA, complete sequence.
ACCESSION M34775
KEYWORDS 5S ribosomal RNA.
SOURCE A.aquaticus (strain Ma) rRNA.
ORGANISM Ancylobacter aquaticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE 1 (bases 1 to 117)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 117 5S ribosomal RNA
BASE COUNT 23 a 40 c 32 g 22 t
ORIGIN
1 cctggtggcc attgcgagga gccccaaccc gatcccatcc cgaactcggc cgttaaactc
61 ctcagcgccg atggtactat gtctcaagac ctgggagagt aggtcgctgc caggcct
//
LOCUS BLTRRAA 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION B.viscosus 5S ribosomal RNA, complete sequence.
ACCESSION M34774
KEYWORDS 5S ribosomal RNA.
SOURCE B.viscosus (strain D7) rRNA.
ORGANISM Blastobacter viscosus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 23 a 38 c 35 g 22 t
ORIGIN
1 gcctggtggc tatggcgagg agtccgaacc cgatcccatc ccgaactcgg ccgttaaact
61 cctcagcgcc aatggtactt cgtctcaaga cgcgggagag taggtcgctg ccaggcct
//
LOCUS HYVRRAA 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION H.vulgare 5S ribosomal RNA, complete sequence.
ACCESSION M34768
KEYWORDS 5S ribosomal RNA.
SOURCE H.vulgare (strain NP-160) rRNA.
ORGANISM Hyphomicrobium vulgare
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 22 a 34 c 37 g 26 t
ORIGIN
1 gacctggtga ttatggcggg tggtctgcac ccgatctcat tccgaactcg gccgtgaaac
61 cgcccagcgc caatggtact tcgtcttaag acgcgggaga gtaggtcgtc gccaggtct
//
LOCUS HYVRRAB 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION Hyphomicrobium sp. 5S ribosomal RNA, complete sequence.
ACCESSION M34769
KEYWORDS 5S ribosomal RNA.
SOURCE Hyphomicrobium sp. (strain G10) rRNA.
ORGANISM Hyphomicrobium sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 20 a 34 c 39 g 26 t
ORIGIN
1 gacctggtga ttatggcggg gtggctgcac ccgatcccat tccgaactcg gccgtgaaac
61 gcccctgcgc cgatggtact tcgtcttaag acgcgggaga gtaggtcgtt gccaggtct
//
LOCUS MAGRRAA 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.agile 5S ribosomal RNA, complete sequence.
ACCESSION M34759
KEYWORDS 5S ribosomal RNA.
SOURCE M.agile (strain A20) rRNA.
ORGANISM Methylomonas agile
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 29 a 29 c 36 g 22 t 3 others
ORIGIN
1 gcttggtggc catagcgagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc
61 gcttagcgcc gatgatagtg cgacaggttg tcgtgcgaaa gtagggaatt gtmaagcnn
//
LOCUS MAGRRAB 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.rubra 5S ribosomal RNA, complete sequence.
ACCESSION M34760
KEYWORDS 5S ribosomal RNA.
SOURCE M.rubra (strain 15S) rRNA.
ORGANISM Methylomonas rubra
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 30 a 31 c 33 g 23 t 2 others
ORIGIN
1 gcttggtgac catagcaagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc
61 gcttagcgcc gatgatagtg tggcaggttg ccatgtgaaa gtaggtcatc gcmaagctn
//
LOCUS MAGRRAC 117 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.methanica 5S ribosomal RNA, complete sequence.
ACCESSION M34761
KEYWORDS 5S ribosomal RNA.
SOURCE M.methanica (strain 12) rRNA.
ORGANISM Methylomonas methanica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 117 5S ribosomal RNA
BASE COUNT 27 a 30 c 32 g 24 t 4 others
ORIGIN
1 ncttggtgac catagcgagc gtgraccacc cgatcccatc ccgaactcgg aagtgaaacc
61 gctttgcgcc gatgatagtg tggcagtttg ccatgtgaaa gtaggtcatc gcmaagn
//
LOCUS MLCRRAA 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.capsulatus (strain ATCC 19069) 5S ribosomal RNA, complete
sequence.
ACCESSION M34758
KEYWORDS 5S ribosomal RNA.
SOURCE M.capsulatus (strain ATCC 19069) rRNA.
ORGANISM Methylococcus capsulatus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 30 a 36 c 36 g 16 t
ORIGIN
1 gcctggcggc cagagcgagc gggaaccacc cgatcccatc ccgacctcgg aagtgaaacc
61 gcttagcgcc gatgatagtg caagaatacc tgtgcgaaag tagggaactg ccaggctc
//
LOCUS MLCRRAB 117 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.capsulatus (strain Y) 5S ribosomal RNA, complete sequence.
ACCESSION M34762
KEYWORDS 5S ribosomal RNA.
SOURCE M.capsulatus (strain Y) rRNA.
ORGANISM Methylococcus capsulatus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 117 5S ribosomal RNA
BASE COUNT 31 a 28 c 31 g 23 t 4 others
ORIGIN
1 nyttggtgac catagcgagc gtgacccacc cgatcccatc ccgaactcgg aagtgaaacc
61 gcttagtgcc gatgatagtg tggcaatttg ccatgcgaaa gtagggaatt gcmaaan
//
LOCUS MTBRRAA 115 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.organophilum 5S ribosomal RNA, complete sequence.
ACCESSION M34770
KEYWORDS 5S ribosomal RNA.
SOURCE M.organophilum (strain NP-220) rRNA.
ORGANISM Methylobacterium organophilum
Prokaryota; Bacteria; Methylmonadaceae.
REFERENCE 1 (bases 1 to 115)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 115 5S ribosomal RNA
BASE COUNT 23 a 41 c 34 g 17 t
ORIGIN
1 ccggcggcct gagcggtgtg ccagaacccg atcccatctc gaactcggcc gttaaacaca
61 ccagcgccca tggtactgtg ctccaagaca cgggagagtc ggtcgccgcc aggct
//
LOCUS MTBRRAB 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.extorquens 5S ribosomal RNA, complete sequence.
ACCESSION M34771
KEYWORDS 5S ribosomal RNA.
SOURCE M.extorquens (strain AM1) rRNA.
ORGANISM Methylobacterium extorquens
Prokaryota; Bacteria; Methylmonadaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 24 a 39 c 37 g 18 t
ORIGIN
1 gcctggcggc ctgagcggtg tggaagaacc cgatcccatc tcgaactcgg ccgttaaaca
61 caccagcgcc gatggtactg tgtctcaaga cacgggagag tcggtcgccg ccaggccc
//
LOCUS MYLRRAA 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.trichosporium 5S ribosomal RNA, complete sequence.
ACCESSION M34763
KEYWORDS 5S ribosomal RNA.
SOURCE M.trichosporium (strain 4E) rRNA.
ORGANISM Methylosinus trichosporium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 28 a 29 c 33 g 24 t 5 others
ORIGIN
1 nncttggtga ctatagtgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac
61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagyn
//
LOCUS MYMRRAA 115 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.methylotrophus 5S ribosomal RNA, complete sequence.
ACCESSION M34765
KEYWORDS 5S ribosomal RNA.
SOURCE M.methylotrophus (strain NCIB 10515) rRNA.
ORGANISM Methylophilus methylotrophus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 115)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 115 5S ribosomal RNA
BASE COUNT 29 a 33 c 28 g 25 t
ORIGIN
1 cttggggaca atagcggttt ggaaccaccc cttcccatct cgaacagggc cgtgaaacga
61 acttgcgcca atgatagtgt actcttcgta tgcgaaagta ggtcatcccc aagcc
//
LOCUS MYPRRAA 119 bp ss-rRNA RNA 14-JUL-1990
DEFINITION M.parvus 5S ribosomal RNA, complete sequence.
ACCESSION M34764
KEYWORDS 5S ribosomal RNA.
SOURCE M.parvus (strain 492) rRNA.
ORGANISM Methylocystis parvus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria.
REFERENCE 1 (bases 1 to 119)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 28 a 32 c 33 g 22 t 4 others
ORIGIN
1 nncttggtga ccatagcgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac
61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagcn
//
LOCUS PSERRAA 118 bp ss-rRNA RNA 14-JUL-1990
DEFINITION P.stutzeri 5S ribosomal RNA, complete sequence.
ACCESSION M34776
KEYWORDS 5S ribosomal RNA.
SOURCE P.stutzeri (strain 8) rRNA.
ORGANISM Pseudomonas stutzeri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 23 a 35 c 35 g 25 t
ORIGIN
1 gcctggtggc tatggcgagg aggcctaacc cgatcccatc ccgaactcgg ccgttaaact
61 cctcagtgcc gatggtacta tgtcttaaga cctgggagag taggtcgtcg ccaggcct
//
LOCUS RECRRAA 116 bp ss-rRNA RNA 14-JUL-1990
DEFINITION R.vacuolatum 5S ribosomal RNA, complete sequence.
ACCESSION M34773
KEYWORDS 5S ribosomal RNA.
SOURCE R.vacuolatum (strain Rv) rRNA.
ORGANISM Renobacter vacuolatum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE 1 (bases 1 to 116)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 116 5S ribosomal RNA
BASE COUNT 23 a 36 c 32 g 25 t
ORIGIN
1 ctggtggcca ttgcgaggag ccccaattcg atctcactcc gaactcggcc gttaaactcc
61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggcct
//
LOCUS TUMRRAA 116 bp ss-rRNA RNA 14-JUL-1990
DEFINITION T.mutans 5S ribosomal RNA, complete sequence.
ACCESSION M34772
KEYWORDS 5S ribosomal RNA.
SOURCE T.mutans (strain U2) rRNA.
ORGANISM Tuberoidobacter mutans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE 1 (bases 1 to 116)
AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
sequencing
JOURNAL J. Gen. Microbiol. 136, 441-446 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 116 5S ribosomal RNA
BASE COUNT 23 a 36 c 33 g 24 t
ORIGIN
1 ctggtggcca ttgcgaggag cctgaacccg actccactcc gaactcggcc gttaaactcc
61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggtct
//
LOCUS ANAPETF1A 1682 bp ds-DNA BCT 14-JUL-1990
DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds.
ACCESSION M35126
KEYWORDS ferredoxin.
SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1.
ORGANISM Anabaena variabilis
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE 1 (bases 1 to 1682)
AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
Van Arkel,G. and Weisbeek,P.
TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
Synechococcus sp. PCC 7942: Structure and regulation
JOURNAL Photosyn. Res. 18, 179-204 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 794 1093 ferredoxin (petF1)
binding 782 785 ribosomal binding site (put.)
BASE COUNT 480 a 353 c 322 g 527 t
ORIGIN
1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg
61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta
121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat
181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat
241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat
301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata
361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat
421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc
481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat
541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata
601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt
661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat
721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga
781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag
841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta
901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc
961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata
1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga
1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag
1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta
1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga
1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt
1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac
1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct
1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt
1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat
1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg
1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata
1681 tc
//
LOCUS ANIPETF1A 1074 bp ds-DNA BCT 14-JUL-1990
DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds.
ACCESSION M35127
KEYWORDS ferredoxin.
SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612.
ORGANISM Anacystis nidulans
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria.
REFERENCE 1 (bases 1 to 1074)
AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
Van Arkel,G. and Weisbeek,P.
TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
Synechococcus sp. PCC 7942: Structure and regulation
JOURNAL Photosyn. Res. 18, 179-204 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 504 803 ferredoxin (petF1)
binding 493 496 ribosomal binding site (put.)
BASE COUNT 234 a 298 c 281 g 261 t
ORIGIN
1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg
61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca
121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac
181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt
241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga
301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa
361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc
421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc
481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg
541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag
601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca
661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg
721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc
781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat
841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc
901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc
961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt
1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag
//
LOCUS ASAASPPA 1764 bp ds-DNA PLN 14-JUL-1990
DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds.
ACCESSION M34454
KEYWORDS aspartic proteinase; aspergillopepsin.
SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA.
ORGANISM Aspergillus awamori
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 1764)
AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H.,
Carlomagno,L.P. and Thompson,S.A.
TITLE Molecular cloning and deletion of the gene encoding
aspergillopepsin A from Aspergillus awamori
JOURNAL Gene 86, 153-162 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 177 496 aspergillopepsin (pepA), exon 1
548 825 aspergillopepsin, exon 2
878 1126 aspergillopepsin, exon 3
1186 1523 aspergillopepsin, exon 4
sigp 177 308 aspergillopepsin signal peptide
matp 455 496 aspergillopepsin
548 825 aspergillopepsin
878 1126 aspergillopepsin
1186 1523 aspergillopepsin
pre-msg 124 1764 pepA mRNA and introns (put.)
pre-msg 125 1764 pepA mRNA and introns (put.)
pre-msg 127 1764 pepA mRNA and introns (put.)
IVS 497 547 pepA intron A
IVS 826 877 pepA intron B
IVS 1127 1185 pepA intron C
signal 54 58 TATA box
BASE COUNT 387 a 515 c 445 g 417 t
ORIGIN
1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga
61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc
121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg
181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc
241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc
301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc
361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg
421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca
481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct
541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt
601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg
661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca
721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg
781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac
841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc
901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac
961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc
1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt
1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt
1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc
1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag
1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg
1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct
1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca
1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc
1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg
1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg
1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt
1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact
1741 gaaccagtag tagtactttc cact
//
LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 14-JUL-1990
DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene,
complete cds.
ACCESSION M34836
KEYWORDS protein synthesis initiation factor 2.
SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and
pJET1.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2758)
AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K.,
Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and
Leighton,T.
TITLE Isolation and molecular genetic characterization of the Bacillus
subtilis gene (infB) encoding protein synthesis initiation factor 2
JOURNAL J. Bacteriol. 172, 2675-2687 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 381 2531 protein synthesis initiation factor 2 (infB)
binding 369 375 ribosomal binding site (put.)
BASE COUNT 917 a 535 c 662 g 644 t
ORIGIN
1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat
61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc
121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac
181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt
241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg
301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta
361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa
421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc
481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg
541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg
601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg
661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca
721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc
781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca
841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc
901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag
961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg
1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa
1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc
1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac
1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga
1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg
1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa
1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg
1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg
1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa
1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg
1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg
1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg
1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg
1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc
1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg
1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg
1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg
2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt
2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc
2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc
2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag
2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca
2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga
2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta
2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg
2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa
2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc
2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt
2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat
//
LOCUS CIBRCRA 1845 bp ds-DNA BCT 14-JUL-1990
DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator
protein (repZ) gene, complete cds.
ACCESSION M34837
KEYWORDS replicon initiator protein.
SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1.
ORGANISM Plasmid Colicin Ib-P9
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1845)
AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K.
TITLE Organization of the replication control region of plasmid Co1Ib-P9
JOURNAL J. Bacteriol. 172, 1983-1991 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 455 1486 repZ protein
mRNA 157 1762 repZ protein mRNA (alt.)
mRNA 158 1762 repZ protein mRNA (alt.)
mRNA 363 < 1 (c) inc RNA
binding 442 448 ribosomal binding site (put.)
signal 377 372 (c) inc -10 region
signal 400 395 (c) inc -35 region
mut 353 353 c in wild type; t in inc3
mut 374 374 t in wild type; c in inc2
mut 400 400 a in wild type; c in inc1
BASE COUNT 505 a 467 c 490 g 383 t
ORIGIN
1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag
61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct
121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac
181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag
241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa
301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag
361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata
421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata
481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta
541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg
601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg
661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc
721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa
781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc
841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt
901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact
961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct
1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt
1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca
1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct
1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag
1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct
1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct
1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga
1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc
1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa
1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca
1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa
1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt
1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa
1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt
//
LOCUS ECOHEMBA 1085 bp ds-DNA BCT 14-JUL-1990
DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds.
ACCESSION M35121
KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene;
porphobilinogen synthase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1085)
AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A.
TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12
JOURNAL Mol. Gen. Genet. 214, 503-508 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 80 1054 porphobilinogen synthase (hemB)
signal 15 20 hemB -35 region
signal 41 46 hemB -10 region
BASE COUNT 254 a 277 c 301 g 253 t
ORIGIN
1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc
61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc
121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg
181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat
241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg
301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg
361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat
421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg
481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc
541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc
601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac
661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg
721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact
781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga
841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg
901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt
961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc
1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag
1081 ttaac
//
LOCUS ECOSTMA 1418 bp ds-DNA BCT 14-JUL-1990
DEFINITION E.coli F factor (stm) gene, complete cds.
ACCESSION M35124
KEYWORDS F factor; stm gene.
SOURCE E.coli DNA, clones pEG[157,159,175,176,220].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1418)
AUTHORS Golub,E.I. and Panzer,H.A.
TITLE The F factor of Escherichia coli carries a locus of stable plasmid
inheritance stm, similar to the parB locus of plasmid RI
JOURNAL Mol. Gen. Genet. 214, 353-357 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 830 1009 F factor (stm)
signal 1013 1039 pot. transcription termination signal
signal 1074 1105 pot. transcription termination signal
BASE COUNT 279 a 364 c 429 g 345 t 1 others
ORIGIN
1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa
61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct
121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag
181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg
241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact
301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc
361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg
421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc
481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact
541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag
601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc
661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca
721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga
781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc
841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta
901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc
961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg
1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg
1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag
1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct
1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt
1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt
1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg
1381 aagctgtctc tggcgggctt ccatccctga cgggaaac
//
LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds.
ACCESSION M34480
KEYWORDS platelet glycoprotein IIb.
SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3333)
AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
Marguerie,G.
TITLE GPIIb and GPIIIa amino acid sequences deduced from human
megakaryocyte cDNAs
JOURNAL Mol. Biol. Rep. 14, 27-33 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 33 3152 platelet glycoprotein IIb (GPIIb)
mRNA 1 3333 GPIIb mRNA
BASE COUNT 626 a 998 c 1040 g 669 t
ORIGIN
1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag
61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct
121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt
181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg
241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga
301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg
361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca
421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga
481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc
541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg
601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg
661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc
721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc
781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg
841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca
901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc
961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca
1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag
1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg
1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg
1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc
1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc
1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct
1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag
1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc
1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca
1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc
1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc
1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg
1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc
1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg
1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg
1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga
1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca
2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact
2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg
2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga
2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct
2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg
2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc
2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca
2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc
2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac
2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact
2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca
2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa
2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc
2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc
2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg
2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct
3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc
3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca
3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga
3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt
3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc
3301 ccctcctgct gcctaataaa gagactgagc cct
//
LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 14-JUL-1990
DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds.
ACCESSION M35999
KEYWORDS platelet glycoprotein IIIa.
SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4487)
AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
Marguerie,G.
TITLE GPIIb and GPIIIa amino acid sequences deduced from human
megakaryocyte cDNAs
JOURNAL Mol. Biol. Rep. 14, 27-33 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 19 2385 plate glycoprotein IIIa (GPIIIa)
mRNA < 1 2603 GPIIIa mRNA
BASE COUNT 1046 a 1090 c 1186 g 1165 t
ORIGIN
1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg
61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga
121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat
181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac
241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc
301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt
361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag
421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg
481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac
541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc
601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt
661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag
721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag
781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt
841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct
901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat
961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt
1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc
1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat
1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta
1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga
1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag
1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag
1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc
1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga
1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc
1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc
1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt
1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg
1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg
1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt
1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc
1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa
1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact
2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag
2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc
2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc
2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct
2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa
2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc
2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag
2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag
2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg
2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag
2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg
2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg
2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt
2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta
2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag
2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc
3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt
3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca
3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga
3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc
3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc
3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc
3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat
3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat
3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac
3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc
3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg
3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat
3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa
3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata
3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa
3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa
3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc
4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt
4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa
4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata
4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga
4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt
4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta
4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc
4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc
//
LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds.
ACCESSION M35125
KEYWORDS Cdx-1 homeobox protein.
SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1597)
AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D.,
Guenet,J.-L. and Gruss,P.
TITLE A mouse gene homologous to the Drosophila gene caudal is expressed
in epithelial cells from the embryonic intestine
JOURNAL Genes Dev. 2, 1647-1654 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 158 736 Cdx-1 homeobox protein
site 389 571 Cdx-1 homeobox
BASE COUNT 326 a 523 c 444 g 304 t
ORIGIN
1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt
61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct
121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca
181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc
241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc
301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg
361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca
421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa
481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa
541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc
601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg
661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga
721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg
781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt
841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt
901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc
961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc
1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca
1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc
1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat
1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg
1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat
1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat
1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca
1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg
1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa
1561 aaatcataca aggcattaaa gaatttttaa aaagttt
//
LOCUS MUSHSP70B 3518 bp ds-DNA ROD 14-JUL-1990
DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds.
ACCESSION M35021
KEYWORDS heat shock protein 70.1.
SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3518)
AUTHORS Hunt,C. and Calderwood,S.
TITLE Characterization and sequence of a mouse hsp70 gene and its
expression in mouse cell lines
JOURNAL Gene 87, 199-204 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 806 2734 heat shock protein 70.1 (hsp70.1)
signal 507 512 inverted CCAAT box
signal 547 552 TATA box
BASE COUNT 827 a 993 c 1014 g 684 t
ORIGIN
1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag
61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga
121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta
181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca
241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc
301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg
361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac
421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct
481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga
541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat
601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc
661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc
721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag
781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg
841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga
901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg
961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg
1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt
1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg
1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga
1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga
1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat
1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga
1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat
1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga
1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga
1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag
1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga
1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg
1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca
1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga
1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg
1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct
1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc
2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc
2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga
2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc
2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga
2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa
2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca
2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc
2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga
2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt
2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca
2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc
2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg
2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt
2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt
2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat
2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag
3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc
3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt
3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac
3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat
3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt
3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc
3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct
3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt
3481 agctatcttc agacacacca gaaaagggca tcagatct
//
LOCUS RDVS10 1319 bp ds-RNA VRL 14-JUL-1990
DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds.
ACCESSION D00241
KEYWORDS S10 RNA.
SOURCE Rice dwarf virus, cDNA to viral RNA.
ORGANISM Rice dwarf virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE 1 (bases 1 to 1319)
AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T.
TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome
JOURNAL J. Gen. Virol. 69, 227-231 (1988)
STANDARD simple automatic
COMMENT Three contiguous termination codons were detected at the third
triplet downstream from the first stop codon.
Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated
20-JUN-1988
FEATURES from to/span description
pept 27 1088 ORF
BASE COUNT 354 a 279 c 284 g 402 t
ORIGIN 5' end of the plus-sense strand of segment S10.
1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta
121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc
1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc
1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat
//
LOCUS RDVS10A 1320 bp ds-RNA VRL 14-JUL-1990
DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds.
ACCESSION M35118
KEYWORDS .
SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367].
ORGANISM Rice dwarf virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE 1 (bases 1 to 1320)
AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E.
TITLE Nucleotide sequence of rice dwarf virus genome segment 10
JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 27 1088 ORF
BASE COUNT 357 a 280 c 282 g 401 t
ORIGIN
1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta
121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc
1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt
1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat
//
LOCUS SRVAGSM 3651 bp ss-RNA VRL 14-JUL-1990
DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds.
ACCESSION M34882
KEYWORDS .
SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
E6 cell line C1008.
ORGANISM Sapporo rat virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Hantavirus.
REFERENCE 1 (bases 1 to 3651)
AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
Schmaljohn,C.S.
TITLE Coding properties of the S and the M genome segments of Sapporo rat
virus: Comparison to other causative agents of hemorrhagic fever
with renal syndrome
JOURNAL Virology 176, 114-125 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 47 3448 M segment ORF
BASE COUNT 1119 a 692 c 755 g 1085 t
ORIGIN
1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct
61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat
121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc
181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa
241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa
301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg
361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat
421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc
481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca
541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt
601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga
661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac
721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg
781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat
841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga
901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag
961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat
1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa
1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac
1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc
1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt
1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt
1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt
1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc
1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact
1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt
1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc
1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat
1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca
1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca
1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc
1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg
1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag
1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt
2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata
2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat
2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt
2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca
2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc
2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt
2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt
2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag
2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact
2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg
2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga
2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga
2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt
2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat
2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga
2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt
3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat
3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca
3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca
3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa
3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat
3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt
3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg
3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg
3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt
3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct
3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a
//
LOCUS SRVAGSS 1769 bp ss-RNA VRL 14-JUL-1990
DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds.
ACCESSION M34881
KEYWORDS .
SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
E6 cell line C1008.
ORGANISM Sapporo rat virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Hantavirus.
REFERENCE 1 (bases 1 to 1769)
AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
Schmaljohn,C.S.
TITLE Coding properties of the S and the M genome segments of Sapporo rat
virus: Comparison to other causative agents of hemorrhagic fever
with renal syndrome
JOURNAL Virology 176, 114-125 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 43 1332 S segment ORF
BASE COUNT 569 a 331 c 410 g 459 t
ORIGIN
1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa
61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag
121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat
181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc
241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca
301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt
361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg
421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg
481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag
541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc
601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat
661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg
721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc
781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc
841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa
901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg
961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct
1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa
1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc
1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt
1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt
1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct
1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat
1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa
1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat
1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag
1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt
1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa
1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc
1741 tttttgggga gtaagggagt ctactacta
//
LOCUS TRNSTIIA 1315 bp ds-DNA BCT 14-JUL-1990
DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene,
complete cds.
ACCESSION M35123
KEYWORDS heat-stable toxin II; insertion sequence; transposase.
SOURCE Transposon Tn4521 (from Escherichia coli) DNA.
ORGANISM Transposon Tn4521
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1315)
AUTHORS Hu,S.T. and Lee,C.H.
TITLE Characterization of the transposon carrying the STII gene of
enterotoxigenic Escherichia coli
JOURNAL Mol. Gen. Genet. 214, 490-495 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 879 400 (c) heat-stable toxin II (STII)
trns 88 607 insertion sequence IS2 (bases 17-536)
trns 608 970 insertion sequence IS2 (bases 969-1327)
BASE COUNT 313 a 349 c 320 g 333 t
ORIGIN
1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct
61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac
121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc
181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt
241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt
301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact
361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata
421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt
481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca
541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc
601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc
661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg
721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag
781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt
841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct
901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg
961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt
1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga
1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac
1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct
1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat
1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga
//
LOCUS AVOCYP 1500 bp ss-mRNA PLN 14-JUL-1990
DEFINITION Avocado cytochrome P-450LXXIA1 (cyp71A1) mRNA, complete cds.
ACCESSION M32885
KEYWORDS cytochrome P-450; cytochrome P-450LXXIA1.
SOURCE Avocado (strain Hass) ripe fruit mesocarp, cDNA to mRNA, clone
pAVOd8 and pAVOc8, and DNA (bases 1-16).
ORGANISM Persea americana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Magnoliidae; Laurales; Lauraceae.
REFERENCE 1 (bases 1 to 1500)
AUTHORS Bozak,K.R., Yu,H., Sirevag,R. and Christoffersen,R.E.
TITLE Sequence analysis of ripening-related cytochrome P-450 cDNAs from
avocado fruit
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3904-3908 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.E.Christoffersen, 16-MAR-1990, for release after publication.
FEATURES from to/span description
pept 21 1436 cytochrome P-450LXXIA1 (cyp71A1)
BASE COUNT 412 a 343 c 354 g 391 t
ORIGIN
1 tcactctaga gtaataatcc atggctatct tagtgtctct gctcttctta gccattgctc
61 tcaccttctt cctcctaaaa ctcaacgaaa aacgagagaa gaaaccgaac ctaccccctt
121 ctcctcccaa ccttcccatc atcggaaacc tccatcagct cggtaatctc ccacaccgct
181 ctcttcgctc ccttgcaaac gaactcggac ccttgatact cctccatctg ggtcacatcc
241 ccactctcat agtctccacc gctgagattg ccgaagagat cttgaaaacc catgatctca
301 tctttgctag ccgaccatcc acaactgcag ctcgacgaat cttctatgac tgcaccgacg
361 tggcgttctc tccctacggc gaatattgga ggcaagtaag gaagatctgt gtactcgagc
421 tccttagcat aaagagagtc aactcctacc gttcgatcag ggaagaagag gtgggcctca
481 tgatggagag gatctctcaa tcttgctcga caggtgaagc tgttaatcta tcagagctgt
541 tgctattgct atcaagcggc acaataacaa gggttgcttt tgggaagaag tacgaaggag
601 aagaagaaag gaagaacaag tttgcggatc ttgcaactga gttgacaact cttatgggag
661 ctttcttcgt gggagactac tttccttcgt ttgcatgggt tgatgttcta acagggatgg
721 atgcgaggtt gaagagaaat catggtgaat tggatgcttt tgtggatcac gtaattgatg
781 accatctcct tagtagaaaa gcgaacggct cagatggagt ggagcagaaa gatttagtgg
841 atgtgctgct acatctgcag aaggattcct cactcggcgt ccatctcaac agaaataacc
901 ttaaagccgt catcttggac atgttctctg gtggaactga tacgacggct gtgaccttag
961 aatgggctat ggcagagctt ataaagcatc ccgatgtgat ggagaaagcc caacaagagg
1021 taagaagagt tgtggggaaa aaagcaaagg tggaagaaga agatcttcat cagttgcact
1081 acttaaaact aatcatcaaa gagactctgc gcttgcatcc tgttgctcca ttattagttc
1141 cacgcgagtc cacaagggat gtagttataa ggggctatca tattcctgca aagacaagag
1201 tctttattaa tgcatgggca ataggaagag accccaagtc atgggagaat gctgaggaat
1261 ttctcccaga gagatttgtc aataattctg ttgatttcaa ggggcaagat tttcaactta
1321 ttccttttgg agcagggagg aggggctgtc cggggattgc atttggcatc agttcagttg
1381 agatttctct tgccaatctc ttgtactggt tcaactggga attacctggg atctaaccaa
1441 agaagatctg gacacgctcg aagccgttgg gataaccgtt cacatgaagt ttcctctgca
//
LOCUS ECODGTP 2760 bp ds-DNA BCT 14-JUL-1990
DEFINITION E.coli dGTP triphosphohydrolase (dgt) gene, complete cds,
and periplasmic protease (htrA) gene, 5'end.
ACCESSION M31772
KEYWORDS dGTP triphosphohydrolase; dGTPase;
deoxyguanosine triphosphate triphosphohydrolase; dgt gene;
htrA gene; periplasmic protease.
SOURCE E.coli (strain K12, isolate HR42) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2760)
AUTHORS Wurgler,S.M. and Richardson,C.C.
TITLE Structure and regulation of the gene for dGTP triphosphohydrolase
from Escherichia coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2740-2744 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.M. Wurgler, 01-FEB-1990, for release after publication.
FEATURES from to/span description
pept 1066 2583 dGTP triphosphohydrolase (dgt) (EC 3.1.5.1)
pept 2713 > 2760 periplasmic protease (hrtA)
pept 958 323 (c) ORF
signal 991 996 -35 region
signal 1014 1019 -10 region
site 1025 1046 regulatory signal
binding 1054 1065 ribosome binding site
signal 2590 2630 pot. transcription termination signal
BASE COUNT 711 a 665 c 706 g 678 t
ORIGIN 3.6 minutes on K12 map.
1 cagctggtca acctgccgct cggcattacc tccacgccag gcaatcacca gatcgggttt
61 cagcgcgaca atgcgttcca gattcatccc ctgccaggcg gaaacctgct caatcttttg
121 cgcttgtgga ggatagtcgg aatagctgct gaccccaacc ggcgtgatcc cggcggcaaa
181 ggcaagttca gtgttggcgg gagaaagcgt gatgacgcgc ggcgcggcgt tgagccacag
241 tggcgcaaga aaagacatgg ggcgaccagc gccctgaaca gtgacttagc catgtgcaag
301 tttctgcacc agtgactaac catcaggctg gactgtttag cggcaacagc aggaactcat
361 cgaagctaag atgagactgt tgatcggcca cgtcggagat ggcgcgtacg acaacaaacg
421 ggacgttgaa attgtggcag acatgggcga ttgccgtcgc ttccatctct acagcaatgg
481 cctgtgggaa gttgtggcgg attttcgcca gaccaacaga accgttgatg aaagcgtcgc
541 cgctaacaat caggccacgt acagcgttaa gattcagttc ggcaatgcag gcctcagcgg
601 cagcgatcag tttatcgtca gctttaaagc ctgccggaca gcctggtaac tgaccgtatt
661 cataaccaaa tgccgtgaca tccgcgtcgt gataacgtgc ttcgtccgag acaacgatat
721 cgcccacttt caacgttggt gccaggccac cggcagaacc ggtgttaata atcacatctg
781 gcttgcagtg ttccaacagc aaagtggcac ccagcgccgc agcgacttta ccgatgcccg
841 atttcagaag cgcaacctcg gttccattca gttggccggt atagatttcg caaccgccga
901 gactgatagt ttgacggttt tcgattttgt cacgcagcag cgtaacttct tcttccattg
961 caccaatgat gccgattttc atagatttac tcgcgataag cccgatttga aggcatagtt
1021 taccatgcgc ttacggggaa gcgtatttct cacgcgggag aggacatggc acagattgat
1081 ttccgaaaaa aaataaactg gcatcgtcgt taccgttcac cgcagggcgt taaaaccgaa
1141 catgagatcc tgcggatctt cgagagcgat cgcgggcgta tcatcaactc tccggcaatt
1201 cgtcgtctgc aacaaaagac ccaggttttt ccactggagc gcaatgccgc cgtgcgcacg
1261 cgtcttaccc actcgatgga agtccagcag gtggggcgct acatcgccaa agaaatttta
1321 agccgtctga aagagcttaa attactggaa gcatacggcc tggatgaact gaccggtccc
1381 tttgaaagca ttgttgagat gtcatgcctg atgcacgata tcggcaatcc gccgtttggt
1441 cattttggcg aagcggcgat aaatgactgg tttcgccaac gtttgcaccc ggaagatgcc
1501 gaaagccagc ctctgactga cgatcgctgc agcgtggcgg cactacgttt acgggacggg
1561 gaagaaccgc ttaacgagct gcggcgcaag attcgtcagg acttatgtca ttttgagggg
1621 aatgcacaag gcattcgcct ggtgcataca ttgatgcgga tgaatctcac ctgggcacag
1681 gttggcggta ttttaaaata tacccgtccg gcgtggtggc gtggcgaaac gcctgagaca
1741 catcactatt taatgaaaaa gccgggttat tatctttctg aagaagccta tattgcccgg
1801 ttgcgtaaag aacttaattt ggcgctttac agtcgttttc cattaacgtg gattatggaa
1861 gctgccgacg acatctccta ttgtgtggca gaccttgaag atgcggtaga gaaaagaata
1921 tttaccgttg agcagcttta tcatcatttg cacgaagcgt ggggccagca tgagaaaggt
1981 tcgctctttt cgctggtggt tgaaaatgcc tgggaaaaat cacgctcaaa tagtttaagc
2041 cgcagtacgg aagatcagtt ttttatgtat ttacgggtaa acaccctaaa taaactggta
2101 ccctacgcgg cacaacgatt tattgataat ctgcctgcga ttttcgccgg aacgtttaat
2161 catgcattat tggaagatgc cagcgaatgc agcgatcttc ttaagctata taaaaatgtc
2221 gctgtaaaac atgtgtttag ccatccagat gtcgagcggc ttgaattgca gggctatcgg
2281 gtcattagcg gattattaga gatttatcgt cctttattaa gcctgtcgtt atcagacttt
2341 actgaactgg tagaaaaaga acgggtgaaa cgtttcccta ttgaatcgcg cttattccac
2401 aaactctcga cgcgccatcg gctggcctat gtcgaggctg tcagtaaatt accgtcagat
2461 tctcctgagt ttccgctatg ggaatattat taccgttgcc gcctgctgca ggattatatc
2521 agcggtatga ccgacctcta tgcgtgggat gaataccgac gtctgatggc cgtagaacaa
2581 taaccaggct tttgtaaaga cgaacaataa atttttacct tttgcagaaa ctttagttcg
2641 gaacttcagg ctataaaacg aatctgaaga acacagcaat tttgcgttat ctgttaatcg
2701 agactgaaat acatgaaaaa aaccacatta gcactgagtc gactggctct gagtttaggt
//
LOCUS RATHMGCOA 1994 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat mitochondrial 3-hydroxy-3-methylglutaryl-CoA synthase mRNA,
complete cds.
ACCESSION M33648
KEYWORDS 3-hydroxy-3-methylglutaryl-CoA synthase.
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
lambda-cMS1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1994)
AUTHORS Ayte,J., Gil-Gomez,G., Haro,D., Marrero,P.F. and Hegardt,F.G.
TITLE Rat mitochondrial and cytosolic 3-hydroxy-3-methylglutaryl-CoA
synthases are encoded by two different genes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3874-3878 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.G.Hegardt, 06-APR-1990, for release after publication.
FEATURES from to/span description
pept 50 1576 3-hydroxy-3-methylglutaryl-CoA synthase
precursor (EC 4.1.3.5)
sigp 50 160 3-hydroxy-3-methylglutaryl-CoA synthase signal
peptide
matp 161 1573 3-hydroxy-3-methylglutaryl-CoA synthase
mRNA 1 1994 3-hydroxy-3-methylglutaryl-CoA synthase mRNA
BASE COUNT 493 a 563 c 505 g 433 t
ORIGIN
1 atctctccca ggggctgtgg actgctggct ttctgttgat accttagaga tgcagcggct
61 tttggctcca gcaaggcggg tcctgcaagt gaagagagtc atgcaggaat cttcgctctc
121 acccgctcac ctgctccccg cagcccagca gaggttttct acaatccctc ctgctcccct
181 ggccaaaact gatacatggc caaaagatgt gggcatcctt gccctggagg tctactttcc
241 agcccaatat gtggaccaaa ctgacctgga gaagttcaac aatgtggaag cagggaagta
301 cacagtgggc ttgggccaga cccgtatggg cttctgttcg gtccaggagg acatcaactc
361 cttgtgcctc acagtggtgc agaggctgat ggaacgcaca aagctgccat gggatgccgt
421 aggccgcctg gaagtgggca cggaaaccat cattgacaag tccaaggctg tcaagacagt
481 gctcatggag ctcttccagg attcaggcaa cactgacatc gagggcatag ataccaccaa
541 cgcctgctat ggtggcactg cctccctctt caacgctgcc aactggatgg agtccagcta
601 ctgggatggt cgctatgccc tggtggtctg tggtgatatc gcagtctacc caagtggtaa
661 cccccgcccc acaggtggtg ccggggctgt ggcaatgctg attgggccca aggccccgct
721 agtcctggaa caagggctga ggggaaccca catggagaac gcctatgact tctacaaacc
781 aaacttggcc tcagagtatc cactggtgga tgggaagctg tctatccagt gctacctgcg
841 ggccttggac cgatgctatg cagcttaccg caggaaaatc cagaatcagt ggaagcaagc
901 tggaaacaac cagcctttca ccctcgatga cgtgcaatat atgatcttcc acacaccctt
961 ttgcaagatg gtccagaaat ccctagctcg gctgatgttc aatgacttcc tgtcatctag
1021 cagtgacaag cagaacaact tatacaaggg tctagaggcc ttcaagggtc taaagctgga
1081 agaaacctac accaacaagg atgttgacaa ggctctgctg aaggcctccc tggacatgtt
1141 caacaagaaa accaaggcct ccctttacct ctccacaaac aatgggaaca tgtacacctc
1201 gtccctctac gggtgcctgg cctcacttct ctcccaccac tctgcccaag aattggccgg
1261 ctccaggatt ggagccttct cctacggctc aggcttagca gcaagtttct tctcatttcg
1321 agtgtccaag gacgcttccc caggttcccc tctggagaag ctggtgtcta gtgtgtcaga
1381 tctgcccaaa cgtctagact cccggagacg catgtcccct gaggaattca cagaaataat
1441 gaatcagaga gagcaatttt accacaaggt gaacttctct ccccctggtg acacaagcaa
1501 cctcttccca ggcacttggt accttgaacg agtggatgag atgcaccgca gaaaatatgc
1561 ccggcgtccc gtctaaggag accaatccat acaaccattc cccggggaaa gaatgtgagc
1621 agagccgtta cccaaacggc ttccacttaa aattccaccc acagcagtga acggtgaata
1681 gacacagcga ccccatagga tctgctccgc ggtgaagggc ctccctctgt ggatcctggg
1741 tgaccctccc tgaagcagtg agcaccacag gttctgctgt ggaccagagc ccccctgtgg
1801 agagggagaa agaaagggga gccgctgacc tgcagggata cagaccttcc ccacagcctg
1861 gcagccgccc gtttgttgca gcttattatc agactgtggg ctatcatagt tcatgctcgt
1921 ttcttaaagt ttcccgagaa tttctaaaat tttgtatcta aacttttaat atggcgatta
1981 aaaggagaga agga
//
LOCUS ECOSLTOXI 1761 bp ds-DNA BCT 14-JUL-1990
DEFINITION E.coli Shiga-like toxin II (SLTIIva) gene, complete cds.
ACCESSION M29153
KEYWORDS Shiga-like toxin II.
SOURCE E.coli (serotype O128:B12, isolate H.I.8) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1761)
AUTHORS Gannon,V.P.J., Teerling,C., Masri,S.A. and Gyles,C.L.
TITLE Molecular cloning and nucleotide sequence of another variant of the
Escherichia coli Shiga-like toxin II family
JOURNAL J. Gen. Microbiol. 136, 1125-1135 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.P.J.Gannon, 20-SEP-1989, for release after publication.
FEATURES from to/span description
pept 254 1213 Shiga-like toxin II (SLTIIva)-A subunit
precursor
sigp 254 319 Shiga-like toxin II-A subunit signal peptide
matp 320 1210 Shiga-like toxin II-A subunit
pept 1226 1489 Shiga-like toxin II-B subunit precursor
sigp 1226 1282 Shiga-like toxin II-B subunit signal peptide
matp 1283 1486 Shiga-like toxin II-B subunit
signal 61 66 -35 region
signal 82 87 -10 region
binding 241 247 A subunit binding site
binding 1215 1221 B subunit binding site
signal 1659 1265 rho-independent transcription termination
signal
BASE COUNT 487 a 316 c 396 g 562 t
ORIGIN
1 atatgtgaga gattctctat tctataaaaa tataaaccat atacccgtaa tcttaaataa
61 ttgatactat tttccggaag ctatagtgca tcattatctg ctttctgttc ctgataaaca
121 gggaagtaat acattgtgca gtcatttcct gtgaggggat ttgtttgtaa taaatttctg
181 ctaaaaacat tccggacttc ttgcgaggta ttattcctgg cgctgccggt tcagactgat
241 ggagtgttaa tatatgcgac atatattatt aaagctggtg ttgttttttt gtgtttgctt
301 gtcttcagca tcttatgcag atgagtttac tgtggatttc tcttcgcaaa agagctatgt
361 tgattcattg aatagtataa ggtcggcaat atccactcca cttggaaata tatctcaggg
421 tggtgtttct gtttcagtaa ttaatcatgt tctaggcgga aactatatat cattgaatgt
481 tagaggcctt gatccatata gcgagagatt taaccacctc cgtttaataa tggaacggaa
541 taacttatat gttgcaggct ttattaatac tgaaacgaat atcttttaca gattctccga
601 tttctcacat atttcagtgc ctgatgtgat aactgtttcc atgacgacgg acagcagtta
661 ttcatcatta cagcgaatcg cagatctgga acgtacaggg atgcagattg ggcgtcattc
721 actggttggt tcatatctgg atttaatgga gttcagagga cgttccatga cccgcgcatc
781 atccagagct atgctgcgtt ttgtcacagt gatagcagaa gctctgcgat tcagacaaat
841 acagcgggga ttccgaccgg cgctgtctga ggcatctccg ctttatacaa tgacggctca
901 ggatgttgac cttaccctga actggggaag aataagtaat gttcttccag agtacagagg
961 agaggaaggg gtaagaatcg gtaggatatc ttttaatagt ctttctgcga ttctcggaag
1021 tgttgcggtc atccttaatt gccactcaac cggaagttat tcagttcgtt ccgtgagcca
1081 aaaacagaaa acagaatgcc agattgttgg agacagggcg gccattaaag taaataatgt
1141 tttgtgggaa gcgaatacaa tcgctgcttt attaaatcgc aagcctcagg atcttactga
1201 accaaaccaa taacaggggg tgaatatgaa gaagatgatt attgcagttt tattcggtct
1261 cttttctgct aattccatgg cggcggattg tgctgtagga aaaattgagt tttccaagta
1321 taatgaggat gataccttta ctgtgaaggt gtcaggaaga gaatactgga cgaacagatg
1381 gaatttgcag ccattgttac aaagtgctca gctgacaggg atgactgtaa caatcatatc
1441 taatacctgc agttcaggct caggctttgc ccaggtgaag tttaactgag aatctacggt
1501 ttatttatgc gcgtcttttg tttctggacg cagatattat tagtgttgtg gatgctgatt
1561 aaatttggtc agtgttttcg ttaaagtcat ataaatacag gggcgtcacg cccctttttg
1621 gtctgtagtt gggtgaagat catcacttta ctttgctcaa ataactcagc acttgccggt
1681 ttacccttaa ggcaatacct acgttccata aaagaagcat caattaggat gctcgtatac
1741 ttgttatgtg ctttaaatgt t
//
LOCUS RATCHOL7H 3545 bp ss-mRNA ROD 14-JUL-1990
DEFINITION Rat cholesterol 7-alpha-hydroxylase mRNA, complete cds.
ACCESSION J05460
KEYWORDS cholesterol 7-alpha-hydroxylase; cholesterol 7-alpha-monooxygenase.
SOURCE Rat (strain Wistar King) male 8 week old liver, cDNA to mRNA, clone
p7-alpha-11.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3545)
AUTHORS Noshiro,N., Nishimoto,M. and Okuda,K.
TITLE Rat liver cholesterol 7-alpha-hydroxylase: Pretranslational
regulation for circadian rhythm
JOURNAL J. Biol. Chem. 265, 10036-10041 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.Noshiro, 30-MAR-1990, for release after publication.
FEATURES from to/span description
pept 36 1547 cholesterol 7-alpha-hydroxylase (EC
1.14.13.17)
BASE COUNT 1035 a 710 c 697 g 1103 t
ORIGIN
1 ggtctcccct ttggaaattt tcctgctttt gcaaaatgat gactatttct ttgatttggg
61 gaattgccgt gttggtgagc tgttgcatat ggtttattgt tggaataagg agaaggaaag
121 ctggtgaacc tcctttggag aacgggttga ttccgtacct gggctgtgct ctgaaatttg
181 gatctaatcc tcttgagttc ctaagagcta atcaaaggaa gcatggtcac gtttttacct
241 gcaaactgat ggggaaatat gtccatttca tcacaaactc cctgtcatac cacaaagtct
301 tatgtcatgg aaaatatttt gactggaaaa aatttcatta cactacttct gcgaaggcat
361 ttggacacag aagcattgac ccaaatgatg gaaataccac ggaaaatata aacaacactt
421 ttaccaaaac cctccaggga gatgctctgt gttcactttc tgaagccatg atgcaaaacc
481 tccaatctgt catgagacct cctggccttc ctaaatcaaa gagcaatgcc tgggtcacgg
541 aagggatgta tgccttctgt taccgagtga tgtttgaagc tggctatcta acactgtttg
601 gcagagatat ttcaaagaca gacacacaaa aagcacttat tctaaacaac cttgacaact
661 tcaaacaatt tgaccaagtc tttccggcac tggtggcagg ccttcctatt cacttgttca
721 agaccgcaca taaagctcgg gaaaagctgg ctgagggatt gaagcacaag aacctgtgtg
781 tgagggacca ggtctctgaa ctgatccgtc tacgtatgtt tctcaatgac acgctctcca
841 cctttgacga catggagaag gccaagacgc acctcgctat tctctgggca tctcaagcaa
901 acaccattcc tgcaaccttt tggagcttat ttcaaatgat caggagtcct gaagcaatga
961 aagcagcctc tgaagaagtg agtggagctt tacagagtgc tggccaagag ctcagctctg
1021 gagggagtgc catttacttg gatcaagtgc aactgaatga cctgccggta ctagacagca
1081 tcatcaagga ggctctgagg ctttccagtg catccttgaa tatccgcaca gctaaggagg
1141 acttcactct ccatcttgag gacggttcct ataacatccg aaaagatgac atgatagctc
1201 tttatccaca gttaatgcac ttggatcctg aaatctaccc agaccctttg actttcaaat
1261 atgaccggta ccttgatgaa agcgggaaag caaagaccac cttctacagt aatggaaaca
1321 agctgaagtg tttctacatg cccttcggat caggcgcgac aatatgtcct ggaagactct
1381 ttgccgtcca agaaatcaag cagtttttga tcctgatgct ctcctgcttt gaactggagt
1441 ttgtggagag ccaagtcaag tgtccccctc tagaccagtc ccgggcaggc ttgggaattt
1501 tgccaccact acatgatatt gagtttaaat ataaactgaa acactgatac gtggttggaa
1561 gaagcgaaca ctggatgatg tcacttggcg gctgagagtc atcactaaac aggccttcgg
1621 gaccaatgct cactgatgcg ccctagcgac tggattagtg ggaagaactt tgttctcgct
1681 gcccacattc ctgggtgttc acatagctgg ggccagagct tcatcacttt cagaaagcaa
1741 tgtcttttgt atttattttc aaaatgaaga tattccaatt ggcaggatat ttttcctaag
1801 gaaattgctt tatattttta tgaaaactac caattaatta tgaaagggct tgaaattcac
1861 gttttagtga aattactgat ttttcactag taaggttctt caggtgtgaa actgtattat
1921 aaaaatgttg taatgggtca cactgtgctt tgcataaagg taaaggaaac tatgtttcag
1981 ccttttctgt gtctatgagc ttcgaaaata atcttactgt tctagaaaca ctggggaggt
2041 ttcgacatgc tctcgctata ttttatttta ctgttgctag aaattttcat tccagttttc
2101 aactacctta tctttccccc attttgacat gcatgccaat gagaagagta ttttttagga
2161 attaacaagg cacctcccag aaccctaccc tgagactttt aagcctttaa tcccagcact
2221 cgagaagtag agccaggcag atctctgagt ctgaggttat tctggtctac atcagctcca
2281 gacaagccag gactacagaa tgggatcttg tctaaaaaat acagctaatc tttatgtcat
2341 aactgattat gaatcaacct aaaagataaa ttttcaatca ggactcagag aaaatgagca
2401 attaaaaaac ttagctctga ggtatgtgga attcattaag tacaagttga cattacatgt
2461 tctttaaaaa tagtttatgt tttatctcta aatgccctgc agatgaagaa taataatgaa
2521 aagttgaata atactgttta aacactaagt gcaataatgc tttggtaatg tactttaaga
2581 gaatcattag ccgtgccagt tttactaaaa tatatttata tgtaaattat atttatcttt
2641 ttcttatacc ataaatataa aaatattgca acatttagta attttaaaat tatatacctt
2701 tcagaaaatg atgtatgaat gtttgtatgt tttttaactt tgaacagaac atttaaatta
2761 ttcatctacg gtgattttta tcttatttat ttctttttgt ctcattcata tcttgaagaa
2821 atccaaaaat atctgaagga atcgctcact caaatgtctc cctatggtta cagaaaaatt
2881 caataccatg tttttgtcct cggggactga agcagggtgt cgtgggtgcc gagcagaggc
2941 tcctgctgca gcgagcttta tccacgggac tccttaaact tttaaaatct tatcactatt
3001 atcatgcatt tattacctaa gtaggatatt tccctttcct ttttcatttc agccgagtcc
3061 cttagcaacc caggctgact gggaccctcc atgtagctta agctgtgaac tcactgtact
3121 tcctgttttc acttatttta ggaagtaatt ttccctatca gaaattttaa ttgtttagat
3181 gatgtataag agtaacacaa ttctgttata tactaatctg tagtaaacta aatttgttct
3241 tagaacaagt ttgatgactc tcaaattgaa tgtatccata catctttcca tggcttcttg
3301 aatgcccatt tctcatacac agaatgatgg gtttcacggt gatgtcttcc tttcatgtct
3361 ttattcttgt gcggtgatgg ttggcaaatg atacccatgg agcaaggtta ctcttcctat
3421 ttctgtgcag cctaagtgtt aagaataatt tttaaatact tggagggaag gcacattttg
3481 tgtcatatgt gaagtgacat gtgacacaca gactagcaaa tccttgagta aaattttatt
3541 gggat
//
LOCUS HUMCTLA1A 4751 bp ds-DNA PRI 14-JUL-1990
DEFINITION Human granzyme B (CTLA-1) gene, complete cds.
ACCESSION M28879
KEYWORDS cytotoxic T-lymphocyte-associated serine esterase 1; granzyme B.
SOURCE Human (patient B) blood DNA, clone lambda-B30.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4751)
AUTHORS Haddad,P., Clement,M.-V., Bernard,O., Larsen,C.-J., Degos,L.,
Sasportes,M. and Mathieu-Mahul,D.
TITLE Structural organization of the hCTLA-1 gene encoding human granzyme
B
JOURNAL Gene 87, 265-271 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Haddad, 11-MAY-1990, for release after publication.
FEATURES from to/span description
pept 1161 1215 granzyme B, exon 1 /hgml_locus_uid="LQ0170R"
/nomgen="CTLA1" /map="14q11-q13"
2256 2403 granzyme B, exon 2
2857 2992 granzyme B, exon 3
3200 3460 granzyme B, exon 4
4105 4248 granzyme B, exon 5
IVS 1216 2255 CTLA-1 intron A
IVS 2404 2856 CTLA-1 intron B
IVS 2993 3199 CTLA-1 intron C
IVS 3461 4104 CTLA-1 intron D
signal 4338 4343 poly-A signal
BASE COUNT 1211 a 1292 c 1119 g 1129 t
ORIGIN
1 gaattctata ttttgagata taccattcct catagaaaaa tttcctcaca gaaaatataa
61 aggtggaaac aaatcacaag aatcgaacca tgtagagaga cttagttgtc ttttaacaga
121 attgggcacg ggctgttcag aaacaacaat ctttcacatc cattataatg atagcattag
181 tgtagtttgt ttagcaaatg tttactgcga gcctgttatg tgctgagcct gctatgtaag
241 aagtgtggct ctctggacag gagacagaat actaaacaac acaactactg atctttggct
301 gcctggcatg cttcctcact tcatatggta tcagcaattt agcaccacaa acgtccttta
361 gagaccagcc ctttctcatt cttggttcta gtggcttgag tagactgacc ccactaccca
421 agtggatttg actcctagca attcattaat ctagcccata aatgtcaagt acaggacttt
481 attgaagcat tcagaaagag gaatagggga tgttagaatc tctagaaagg aagctatgat
541 aataaatggg ttgctagatg ggtctagtag atggtggcca tgctttgtta ctgccttgtg
601 tattgtgcta ccatagccct ccccaaactg tactctggct cctggcattt ccgtctcttc
661 aaccagatgg tcagctctct aagtgaagga gacacatctc caacatgctt ggttctagca
721 caacagaagg gctcaaacac atacctgcta aagaaactat cctgatggat ttagcagcat
781 ggccatgagg cattggcggt tctatcactg ggaactcagg tttctggtgc tccagtacct
841 ctactggctg ataccacatc ctacagttca cttcataggc ttgggttcct gctctgggct
901 gaataggtgg tccactctga gtcatcagct gtggtgatga tgtggtcact gcatgattct
961 cacacaagca cccagaggac gtcatcaggc agaggcagtg ggggtgggca gcatttacag
1021 aaaatctgtg atgagacacc acaaaaccag aggggaacat gaagtcactg agcctgctcc
1081 acctctttcc tctcccaaga gctaaaagag agcaaggagg aaacaacagc agctccaacc
1141 agggcagcct tcctgagaag atgcaaccaa tcctgcttct gctggccttc ctcctgctgc
1201 ccagggcaga tgcaggtgag tgaccgtctt ccaacctcgg ggcccaaccc atcccacagg
1261 tctcctgccc tttctccaca ttcctgatcc atctatctac caggaatgtt ctgaactcca
1321 gctcccattc taccaagacc ccccaagtgt gatgctggat aagctatcag caggaatggc
1381 agagcagcag gccattctca agaagagcca gtgggtacta tcccttcccc agagcccacc
1441 tttgtcacct ggagagtagg actttcctag aagtaaatgg cagaggatgg gaaactagaa
1501 aagagaaata ttaaattatt ctagagtagg cctggcttct gtttctggga taagacaggt
1561 gcttctctca ctgtacttag gagagaaacc cagagctcag ctgacagcag aattggtaca
1621 atcactgtcc tcagaacact gttaatgtgt ttgctcagtc ccattctcca actctgcttt
1681 tcttccctgg cctttggtgg ctcccctctt tccaaggatg aggcactacg gcaggcccca
1741 gcttccctgc tttctagaat tccaccagca ctgctctacc agccctcatc cagaggctaa
1801 ctggagccag tccatcatgc agccatgaac atttactggg cacccactac atgtcaggct
1861 ctaggaaaca ggatatgaca gtatctagat ccctccactt acaccctggc cattagaaag
1921 cagcactatc ctagacacca caggactcat aagggtcttg gaaactcacc tgaaacaaag
1981 caaagtcagg agaggaatga tcaggagcct ctgggatttc actgtcccta agacaggtat
2041 gctcgccttc aactacatat ggaagaaaga tttacagacc aaagtctgct gttcttccct
2101 ttttcagagc aggaaattga agccccttcc tccaggccac tcccaactcc aggctatccc
2161 aggctcccaa atgcccagga gttctggagc cactaagcag gtgcccaccc agcagattcc
2221 atgggtgccc acaagcagac agacttttcc ttcaggggag atcatcgggg gacatgaggc
2281 caagccccac tcccgcccct acatggctta tcttatgatc tgggatcaga agtctctgaa
2341 gaggtgcggt ggcttcctga tacaagacga cttcgtgctg acagctgctc actgttgggg
2401 aaggtgagga gcagaaaaca gcccacaccc tcctggaaac actccacaga gacccctgcc
2461 ttcttcccaa ggagctccct gggctcctgt gaacacacat gccaggaggt ctccttagag
2521 ggtgagaaaa gggcagttaa gtttgtggag agaggggaag gttggttcca gaggtgctgc
2581 tgaagtaaga aacagcagag tgaccaagcc tgccatattt agaactgggg gcatactttg
2641 gcatagaata caaactgaag caattccacc tgtgtttcta gggggaaccg aaccctgaga
2701 aacctggtgc aattaccaga attccaattc ctggggaccg actgtcctta atttcccctc
2761 agctgcagcc ctgccccagc tgtcacctgc tcttcactgt ctctgggctg tatacactgt
2821 gactccaccc ccatcctcac tctgctctct gtgcagctcc ataaatgtca ccttgggggc
2881 ccacaatatc aaggaacagg agccgaccca gcagtttatc cctgtgaaaa gagccatccc
2941 ccatccagcc tataatccta agaacttctc caatgacatc atgctactgc aggtgaggca
3001 cactcctgcc actcttgctc ttcttggtcc agttggttcc actccccctg gaatgccggc
3061 ccttccctcc tttccatcct ggcctcttgg ttagttccta tgcctcagag gagagaggga
3121 agattgtgca gccccatcac tgtgtcgggg cccagaagtt cgttggctga cctggacttt
3181 cttgcctctt ccccaccagc tggagagaaa ggccaagcgg accagagctg tgcagcccct
3241 caggctacct agcaacaagg cccaggtgaa gccagggcag acatgcagtg tggccggctg
3301 ggggcagacg gcccccctgg gaaaacactc acacacacta caagaggtga agatgacagt
3361 gcaggaagat cgaaagtgcg aatctgactt acgccattat tacgacagta ccattgagtt
3421 gtgcgtgggg gacccagaga ttaaaaagac ttcctttaag gtaagactat gcacctgcct
3481 ggattggctc ttgggagaaa gatgtttggg gaatatctga gacctggaga ctcaagtagt
3541 gggggactcc ttcacccact agactgtgat atttctctct ggaaagagaa aaggggacta
3601 gactgagctg gggagaaatt agggcctctg caaacttacc aagaggcctt atggtggatg
3661 gtgccttctt tggaaggatg aatttgcaac actccaccca ctccaggtca cagatattag
3721 gaaactgtgc ccatgggggt gcagctaatt ataaccaggt gtgtcttcag aggctggtac
3781 ccaacgtggt taatgggctg gtcctccatg gtggacatca gccctccttg cccacttctg
3841 ggtccttaaa cagccaacgg tcccacatac ctccgatctc aggatctggg ggacatgacg
3901 gaggctggcc cctgggatga ggtgaagcag taacaatgtc cagggccaga gcttggcagc
3961 tggggccacc agcggcctgc cctgccctct ggtctcccac atgtaggctg tgcaagttgg
4021 ccttttctaa aagggggctt gagatggaag agagggcagg acccggagga gcatcagctc
4081 agtccttcca ctctctattc acagggggac tctggaggcc ctcttgtgtg taacaaggtg
4141 gcccagggca ttgtctccta tggacgaaac aatggcatgc ctccacgagc ctgcaccaaa
4201 gtctcaagct ttgtacactg gataaagaaa accatgaaac gctactaact acaggaagca
4261 aactaagccc ccgctgtaat gaaacacctt ctctggagcc aagtccagat ttacactggg
4321 agaggtgcca gcaactgaat aaatacctct tagctgagtg gaaagctggt ttcttgttta
4381 ttcattgacc ctcattctca ggcaccacat ctgcgctatg caggccaatg acacaatttt
4441 gctgttttct gctttctcct ctcccctcac cccttgccac ctccccaaac ccccacatga
4501 agctgatact cagctccttc ctatccacac cagtttctcc agggcctgcc ttctgccaag
4561 gctgaagctg agcaccatca ggagacaaca tggaccactt tggtcctggg gctttgggta
4621 aacttcttac ctccttctcc agtgttacat tgacagagaa aaaagggata ataccatggg
4681 acctaactcc tcatcccact ggggctcctc attctcccct gggcttagtt tctctaccct
4741 cctctgagct c
//GenBank-Updates@genbank.bio.net (07/17/90)
LOCUS BPEP69OMP 3000 bp ds-DNA BCT 17-JUL-1990
DEFINITION B.pertussis omp69A gene for P.69 outer membrane protein, complete
cds.
ACCESSION J04560
KEYWORDS membrane protein; outer membrane protein.
SOURCE B.pertussis (CN2992) DNA.
ORGANISM Bordetella pertussis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 3000)
AUTHORS Charles,I.G., Dougan,G., Pickard,D., Chatfield,S., Smith,M.,
Novotny,P., Morrissey,P. and Fairweather,N.F.
TITLE Molecular cloning and characterization of protective outer membrane
protein P.69 from Bordetella pertussis
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3554-3558 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by N.F.Fairweather 26-JUL-1989.
FEATURES from to/span description
pept 145 2877 P.69 outer membrane protein precursor
sigp 145 246 P.69 outer membrane protein signal peptide
matp 247 2277 P.69 outer membrane protein
BASE COUNT 453 a 1019 c 1096 g 432 t
ORIGIN 2 bp upstream of ClaI site.
1 atcgatgata cgtcgctgta acacgacaaa tagcgtgcgt tgcagcggtt ctggatggcg
61 ttattcgtac ttttgctgcg cccattcttc cctgttccat cgcggtgcgg gcatggcggg
121 cgtctgctct ccacctggca tccaatgaac atgtctctgt cacgcattgt caaggcggcg
181 cccctgcgcc gcaccacgct ggccatggcg ctgggcgcgc tgggcgccgc cccggcggcg
241 catgccgact ggaacaacca gtccatcgtc aagaccggtg agcgccagca tggcatccat
301 atccagggct ccgacccggg cggcgtacgg accgccagcg gaaccaccat caaggtaagc
361 ggccgtcagg cccagggcat cctgctagaa aatcccgcgg ccgagctgca gttccggaac
421 ggcagtgtca cgtcgtcggg acagttgtcc gacgatggca tccggcgctt tctgggcacc
481 gtcaccgtca aggccggcaa gctggtcgcc gatcacgcca cgctggccaa cgttggcgac
541 acctgggacg acgacggcat cgcgctctat gtggccggcg aacaggccca ggccagcatc
601 gccgacagca ccctgcaggg cgctggcggc gtgcagatcg agcgcggcgc caatgtcacg
661 gtccaacgca gcgccatcgt cgacgggggc ttgcatatcg gcgccctgca gtcattgcag
721 ccggaagacc ttccgcccag ccgggtggtg ctgcgcgaca ccaacgtgac cgccgtgccc
781 gccagcggcg cgcccgcggc ggtgtctgtg ttgggggcca gtgagcttac gctcgacggc
841 gggcacatca ccggcgggcg ggcagcgggg gtggcggcca tgcaaggggc ggtcgtgcat
901 ctgcagcgcg cgacgatacg gcgcggggac gcgcttgccg gcggtgcggt tcccggcggt
961 gcggttcccg gtggtgcggt tcccggcggc ttcggtcccg gcggcttcgg tcccgtcctc
1021 gacggctggt atggcgtgga cgtatcgggc tccagcgtgg agctcgccca gtcgatcgtc
1081 gaggcgccgg agctgggcgc cgcaatccgg gtgggccgcg gcgccagggt gacggtgccg
1141 ggcggcagct tgtccgcacc gcacggcaat gtcatcgaga ccggcggcgc gcgtcgcttt
1201 gcgcctcaag ccgcgcccct gtcgatcacc ttgcaggccg gcgcgcatgc ccaggggaaa
1261 gcgctgctgt accgggtcct gccggagccc gtgaagctga cgctgaccgg gggcgccgat
1321 gcgcagggcg acatcgtcgc gacggagctg ccctccattc ccggcacgtc gatcgggccg
1381 ctcgacgtgg cgctggccag ccaggcccga tggacgggcg ctacccgcgc ggtcgactcg
1441 ctgtccatcg acaacgccac ctgggtcatg acggacaact cgaacgtcgg tgcgctacgg
1501 ctggccagcg acggcagcgt cgatttccag cagccggccg aagctgggcg gttcaaggtc
1561 ctgacggtca atacgctggc gggttcgggg ctgttccgca tgaatgtctt cgcggacctg
1621 gggctgagcg acaagctggt cgtcatgcag gacgccagcg gccagcacag gctgtgggtc
1681 cgcaacagcg gcagcgagcc ggccagcgcc aacaccctgc tgctggtgca gacgccacta
1741 ggcagcgcgg cgacctttac ccttgccaac aaggacggca aggtcgatat cggtacctat
1801 cgctatcgat tggccgccaa cggcaatggg cagtggagcc tggtgggcgc gaaggcgccg
1861 ccggcgccca agcccgcgcc gcagccgggt ccccagccgc cgcagccgcc gcagccgcag
1921 ccggaagcgc cggcgccgca accgccggcg ggcagggagt tgtccgccgc cgccaacgcg
1981 gcggtcaaca cgggtggggt gggcctggcc agcacgctct ggtacgccga aagcaatgcg
2041 ttgtccaagc gcctgggcga gttgcgcctg aatccggacg ccggcggcgc ctggggccgc
2101 ggcttcgcgc aacgccagca gctggacaac cgcgccgggc ggcgcttcga ccagaaggtg
2161 gccggcttcg agctgggcgc cgaccacgcg gtggcggtgg ccggcggacg ctggcacctg
2221 ggcgggctgg ccggctatac gcgcggcgac cgcggcttca ccggcgacgg cggcggccac
2281 accgacagcg tgcatgtcgg gggctatgcc acatatatcg ccgacagcgg tttctacctg
2341 gacgcgacgc tgcgcgccag ccgcctggag aatgacttca aggtggcggg cagcgacggg
2401 tacgcggtca agggcaagta ccgcacccat ggggtgggcg cctcgctcga ggcgggccgg
2461 cgctttaccc atgccgacgg ctggttcctc gagccgcagg ccgagctggc ggtattccgg
2521 gccggcggcg gtgcgtaccg cgcggccaac ggcctgcggg tgcgcgacga aggcggcagc
2581 tcggtgctgg gtcgcctggg cctggaggtc ggcaagcgca tcgaactggc aggcggcagg
2641 caggtgcagc catacatcaa ggccagcgtg ctgcaggagt tcgacggcgc gggtacggta
2701 cacaccaacg gcatcgcgca ccgcaccgaa ctgcgcggca cgcgcgccga actgggcctg
2761 ggcatggccg ccgcgctggg ccgcggccac agcctgtatg cctcgtacga gtactccaag
2821 ggcccgaagc tggccatgcc gtggaccttc cacgcgggct accggtacag ctggtaaagc
2881 gaggagggtc tatcccccgc ggaggagttt ttcctggagc ttggccggtg ccagtctcca
2941 ggctcaggcg gccagggcct gcgggccggg caggccgcgc tggtgctggc cgaaccattg
//
LOCUS ATHTRPB 4225 bp ds-DNA PLN 17-JUL-1990
DEFINITION A.thaliana tryptophan synthase beta subunit (trpB) gene,
complete cds.
ACCESSION M23872
KEYWORDS tryptophan synthase.
SOURCE A.thaliana (strain Landsberg erecta) DNA, clone lambda-M1-g, and
cDNA to mRNA (strain Columbia), clone lambda-CD-2.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 882)
AUTHORS Berlyn,M.B.
JOURNAL Unpublished (1989) Yale Univ, 165 Prospect, New Haven CT, 06511.
STANDARD full staff_review
REFERENCE 2 (bases 883 to 4225)
AUTHORS Berlyn,M.B., Last,R.L. and Fink,G.R.
TITLE A gene encoding the tryptophan synthase beta subunit of Arabidopsis
thaliana
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4604-4608 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by M.B.Berlyn, 18-APR-1989.
FEATURES from to/span description
pept 1517 1858 tryptophan synthase beta subunit, exon 1 (EC
4.2.1.20)
2109 2465 tryptophan synthase beta subunit, exon 2
2549 2991 tryptophan synthase beta subunit, exon 3
3084 3178 tryptophan synthase beta subunit, exon 4
3275 3450 tryptophan synthase beta subunit, exon 5
IVS 1859 2108 tryptophan synthase intron A
IVS 2466 2548 tryptophan synthase intron B
IVS 2992 3083 tryptophan synthase intron C
IVS 3179 3274 tryptophan synthase intron D
site 3643 3645 poly-A addition site
BASE COUNT 1252 a 733 c 817 g 1423 t
ORIGIN
1 gaattctttc atatctcctg caaagttctt gatatcaata ctccagcagt aactaagact
61 tagactcttg agcgtaggag agtttgataa caaagactcg gcctctgtga gcttgatcca
121 accaatagag agctttctag gcaatcccga gtttttgaac ttggagggat caagcccaca
181 cgcgtaaatc tttagtgatt cgagatttgt gtttaaaatc cgaattaaaa cctaatcaaa
241 ttaaaactaa accaaaccaa atacaatcca aaattaaact aattttggtt gagtttggtt
301 atagttttac taaatccaaa ttaacagaac ataaccaaac ccgaagattt ttagagtctt
361 tagaatttta aggtgatttt agataaaaga gattaaacac aaatctcgaa aactaaagaa
421 agagtttttg aaaattttta agtgttttca tgtaaagtgg atttctctgt gttttctgca
481 ttctgcggat tataactcct atgttttttt tctccgtcaa ttatatgtgt ttattttctc
541 tattttcttt tatttttatt tttattctct atattagggt ttagtttatg aaaacttttt
601 gttatctata taggcttggg ggatgtattt aaattagaat ttaaagtgat ttgagttctt
661 tgagttttta aataatttta acgattttaa aaaagttcgt atgatttttg taaaatctat
721 taaaatctca ccttaaatca tgggatttgg atttctgtat tttgaactaa gaaaatcctc
781 tcaaatcctc caaaatcatt aaaattcaaa tccacaaatt gttctgaata acagtgaatt
841 ttaaggtgga ttttgaaata attagttcaa taacactgaa tttcatgaga ttttttaaaa
901 tacatgtttg aataacatat gatttataaa ttctacacaa atcttttaaa attctaattt
961 caatacattg tttttgaaag tgttattgac tcttgccaat atagtatccc aattcccaac
1021 ttgtgtttca ttttttcatc tatctaataa acaattagat gaacacaaaa aaatattggt
1081 aggtgatggc tcaattggat atgtttttga aaaccatgtg ttaaaaactt aaaatactat
1141 ccaacttacc ccagtcctac caactttttt tttcttctct tggtctgctt acatgtgtct
1201 gcttatatct ccaaaaggaa atagatatat aaaaattcaa atttaaatat ttgcgatttg
1261 ttaaatttta atcaatattt aatttttgtt tttttttgtt tttttttatg aagacaacaa
1321 ataaccaaat ttatcaaatc tgatcaaagc agatttagga ttttacaaat atattttttt
1381 aatatgaatt ttgtggtcag attttgacca attctctttg aaaaaaaaaa aaatctatct
1441 ataaaaacat gtgttacttt gaaaggatat ttcaaggaga agaatatatt tgactcagag
1501 agagattcat tcagtaatgg cagcctcagg cacctctgct actttcagag cctccgtttc
1561 ttcagctcct tcctcttctt cccaattgac ccatttgaaa tcacccttca aagctgtcaa
1621 atatacgcct ctgccatcgt ctcgctccaa gtcatcatcc ttctccgtct cctgcaccat
1681 cgccaaggac ccgcctgttc tcatggccgc cggatctgac ccggccctgt ggcaacgacc
1741 cgattcgttc ggtcggtttg ggaagtttgg tgggaagtat gtacctgaaa cccttatgca
1801 cgctctatct gagcttgaat ccgctttcta tgctcttgcc accgacgatg atttccaggt
1861 agagaccttg ttcttcgtga ttagataaaa atgagacatt ttgcgttgtg ggtcgttgga
1921 attgtagcgt tatcatttgc gggttgtgat aaagttcgga tcttgttgat gggatatgtt
1981 gattctaggg catttgaatg aaattggttc tgttcattta tgaaatggtt agaaatatgg
2041 ttttggttcc tctggttcaa tgtctcattc atgtcctttg tgtatttgtg tgaatgaatc
2101 tttgttagag agagttggct ggaatcttga aggactatgt gggtagagaa agtcctctgt
2161 attttgcaga gaggcttacg gagcattaca ggcgcgagaa tggcgaaggg cctcttatat
2221 acttgaagag agaagacttg aatcacacag gagctcacaa gattaacaac gctgtggctc
2281 aggctcttct tgctaagcgg ttggggaaga agaggattat tgctgagaca ggagcgggtc
2341 aacatggtgt ggctacagct actgtgtgtg cccgttttgg tttggagtgt attatctata
2401 tgggtgctca agatatggag agacaagcac tcaatgtgtt cagaatgcga cttcttggtg
2461 ccgaggtgat gcttaggatt catttccatc tcttttaaat gtgataaact ttgttactga
2521 accttttttt gggtgtctgg tgctataggt gagaggagtc cactctggaa cagcgacatt
2581 gaaggatgcg acatctgaag cgataagaga ttgggtgaca aatgtggaga ctacccatta
2641 catattggga tctgtggcgg gtcctcatcc ttaccccatg atggtcagag actttcacgc
2701 tgtgattggt aaagaaacaa ggaaacaagc gttggagaaa tggggcggga agccggatgt
2761 cttggtggct tgtgtcggtg gtggttcaaa cgctatggga ctcttccatg aatttgtgaa
2821 tgacacagag gtccggatga tcggtgtgga agcagcggga ttcggattgg acagtggcaa
2881 acacgctgct acattgacaa agggagatgt tggtgtcctc catggagcta tgagttactt
2941 gctgcaagat gatgatggac aaatcattga accacactcc atcagtgcag ggtaagtgaa
3001 attctctatt cagatccggt aaccataata agtctactca gttagccacg gctgcatgaa
3061 catggtttga ttttctctta cagattggac taccctggag tcggacccga gcacagtttc
3121 tttaaagaca tgggacgggc tgaatactat agcataaccg acgaagaagc gttggaaggt
3181 aatgtaccga tgcatagtct ttagttctgt atttgaaaat ctgttctttg tttcataact
3241 tacctttgca ttccctttgg tttgtggtct gtagcgttca agagagtgtc gcggttagag
3301 ggaatcattc cagcactgga gacctcacac gcactagctt acctcgagaa gctatgtccc
3361 acattatcag acgggacgag agtggtcttg aacttcagcg gaagaggaga taaagatgtt
3421 cagacagtgg ctaaatatct tgatgtttga agagagaaac acttatgttt gtttttcagt
3481 tattcagaca gaatgaaatc ctttttgtac ttgtaatctg tttgtacttt acttctctct
3541 ggtcagtacg tttccaacaa aattggaaat agaaaaagat tacgaagaga acgaaatttg
3601 aataaatgta ttatcaataa aataaaaaac atttcatatt ggaaatagtc ttatgttcct
3661 caatctgctt tctttttcgt tttcaaaacg aaatcctact tttgcagata caaacatgag
3721 atatgatctc agaactctaa tagcttattt atctcaaatt tcatacgaga tgttacattg
3781 gaaagaggtc caaattgcta gaagacatat atattcgcat atagtatacc aatctgccaa
3841 agttatgttc cccttgtgaa ttgtgatcct taagatttaa gaaaaaatag aagtggtgac
3901 tttgatgtat gaaaaataat aatataagca aagttgcaaa atccaataga gaaagaatct
3961 tcttttagta tatccttcac caaacatatc tctctgcctc ttctgaaatc ccaagagaca
4021 tgtctctctc tctcttttcc aagtccgaac caaatctaat gtatcaacat tacttcttct
4081 gaagtcccaa gagacattgc tctctctctt ttccaagtct gaaccaaatc taatgcatca
4141 acattacttt tgtaataaaa ttcaataagg attagctgat tactgttgtc actctttggt
4201 ttcttttcct ttgttaatag aattc
//
LOCUS HS5E1P 2590 bp ds-DNA VRL 17-JUL-1990
DEFINITION Murine cytomegalovirus early transcription unit e1 protein gene,
complete cds.
ACCESSION M35146
KEYWORDS .
SOURCE Murine cytomegalovirus (salivary strand strain Smith; ATCC VR-194)
DNA, passed in mouse (BALB/c) embryonal fibroblast cell line B25.
ORGANISM Murine cytomegalovirus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Betaherpesvirinae.
REFERENCE 1 (bases 1 to 2590)
AUTHORS Buehler,B., Keil,G.M., Weiland,F. and Koszinowski,U.H.
TITLE Characterization of the murine cytomegalovirus early transcription
unit e1 that is induced by immediate-early proteins
JOURNAL J. Virol. 64, 1907-1919 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 163 955 early transcription unit e1, exon 1
1049 1225 early transcription unit e1, exon 2
1552 1574 early transcription unit e1, exon 3
pre-msg 44 2548 e1 mRNA and introns
IVS 956 1048 e1 intron A
IVS 1226 1551 e1 intron B
signal 16 20 TATA signal
signal 2548 2553 Poly-A signal
BASE COUNT 564 a 732 c 751 g 543 t
ORIGIN
1 gcgtgatgca gactttataa atcgcaagcc ggagcggatc ctaggctcat ttcttctgcg
61 atcttcgcat ttagcagacc aaatgctgat agttcctgcg tcgcggtaga ttacgtgccc
121 acttttctcg tcgcgaccgg tgaaaagacc ttcgttcgga ccatggccgc gccagatcga
181 cgcggatcgc ccatcgtttc gagacgacgt tacttcacct tccggaactc gagtcggacg
241 ctgcatcaga atgtcacccg catgttcgac ctccgtcagt ggacctacga gagcgcccgc
301 gttttggact gcacgaacgg cgagggccgc accgagaact gggggcccgg ctggctgtgc
361 gcgaccatca tgcagtctcc ggacagcggc tcgtcgtccg gatcggcggc ccagggatgc
421 atgtctctgg acatcacctc ggacgacgga ccggagaagg tcatgaacat gttccaccgc
481 gggagcatcg tctgtaacaa gaccgtctct tcggtcgcga cgccgccggg gtccgacggc
541 acgggcgcca gcctcttgac tctggtggcg gacggcagct tgctgcaagt catcctggtc
601 gaacactcgc cgggcgccca ccgcgccgaa gccgacgatg tcgccgagag cgtcggcaac
661 gccgcggtgg ccgccgccgt gcagtcggcc acttcgatgg gtagctcgag ctccgtcggc
721 ggcgccggtg gcggcggctt caattcttcg gaagaacgtc gccgcgataa agactcgtcc
781 aaatctcacg atgacgaccg acgtaagaga ttcgaactct gcaacggtac ccttcatagg
841 gagcggacct cgggtagctc cggatcgagc ggcggcggcg gcggcagcgg cagcggaggc
901 agcagcggcc tcagtacaaa gcagaaggag cgcagcagga ggctcgaaga ggaatgtaag
961 tagttctgat cggggtttta ctggtcgctt tccccctcga ccttcggtga accgtcacgg
1021 gactaatgtt gttactctgt tcggaaaggt tctccacgca gcggggggga gcccaagcgc
1081 cagaagaccc accacgatcc gagaccggag agggatctgg aaccgcctag gagcagtacc
1141 actgtcgatg gtaatgcaga cgaagctgga gcggtctcct ttttgaacag ttatgctgct
1201 agttcgctgt ccgccgtcag cgatggtgcg ttaccaccct cctctccggt gactatagag
1261 acaccaaggt ttaatgactt gatagaatcc gaggaggaag acgatggaac tcggtgttcg
1321 tctgccgatg gtacggcgac ggtaaccgag accggggcgt ctgccgagat cgagcaaacg
1381 ttcaccgacg gtgggctagc atgcatgagg tccttaattg actcgcgggc cctcatggat
1441 ctcgccttta cgagccgcat gacggcggtg aagtcgggat ctgagggagc gggcaacacc
1501 tgtcctgggg aacagaggct ttctaactgc aatcttcctc cgaattcaca ggaggacaac
1561 aacccgcgtt ttgaaggcgt ctttaattcc ctgatgagac tcttagacga gtataaagac
1621 aagagcacca gctcgagttc cgccgccgta ccgggatcca gttgtacagg caccacgcag
1681 acagagggga tgcctcacca caggggtatc ttcgacaggg acacatccgc cgctcagcag
1741 aaacctgtat gtgagatacg gcccttcatc gacatagctc gctgtgtcga gccggcggcg
1801 gcgccgaccg cttcgcgctc cagagccacc acggccgccg ctcgcaggcg gggaagaggc
1861 tcgcagcctc ggaggaacag tcgggtcgcg gcgcgcgtcg cccaggtcga tccgaccgac
1921 ccacccgcag ctgctgctgc cgccgccgcc ggtcccacca cgcggggaag gagcaggagt
1981 aggaggggaa gacgaggccc agacctcacg gaggacgggt tggagattgt cgagacgggc
2041 gccaccgccg gcactctggc cgtttccgag gaggaaaccg cgatggccgc cgctatgtta
2101 gaggatatgg tcgacttgga caatgtgttc gatgatctta attgatctca tgaaagatgg
2161 ataaagaaat gatgctctgt aaactctaac tgactgttgt tatcgatcgc ctcgtctcgc
2221 tcgggtcttc tgccgatgct gccgtctgta agcacgcaag caagcactcg ctctatagca
2281 gtagaatatg tatggatgat atacttactg ttgttgttca cttactatta tcacctatat
2341 aaaacttgta ttcatgatac ttaccactga aaaaattgtg tatgtctttt tatacttttg
2401 accaccgtta ttgatgataa gtatactctt ctaactgtgt gtgtctctgt ctaccgggcc
2461 gatggagata ttatccctgg tccccctctg tctgtgctaa caacataaca tccatgtaaa
2521 taaaattaat atttttaata ttttatcaat aaaaaccaca catttgttac aataaaccgc
2581 gctcgccttt
//
LOCUS HSE14GP 3351 bp ds-DNA VRL 17-JUL-1990
DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
cds.
ACCESSION M35145
KEYWORDS glycoprotein 14.
SOURCE Equine herpesvirus type 1 DNA.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 3351)
AUTHORS Guo,P., Goebel,S., Perkus,M.E., Taylor,J., Norton,E., Allen,G.,
Languet,B., Desmettre,P. and Paoletti,E.
TITLE Coexpression by vaccinia virus recombinants of equine herpesvirus 1
glycoproteins gp13 and gp14 results in potentiated immunity
JOURNAL J. Virol. 64, 2399-2406 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 300 3242 glycoprotein 14 (gp14) precursor
sigp 300 554 glycoprotein 14 (gp14) signal peptide
matp 555 3239 glycoprotein 14 (gp14)
signal 3251 3256 Poly-A signal
BASE COUNT 882 a 892 c 858 g 719 t
ORIGIN
1 aacgttgggt tgttaccgca tctcaaggag gaactagctc ggtttatgat tactgcggct
61 aaaggtaatt ggtcaattag cgagtttcaa aggttttatt gctttgaggg agtgacaggt
121 gtgacggcca cgcagcggct ggcgtggaaa tatatcgggg agctcatcct agccgccgca
181 gtattctcct cggttttcca ctgtggagag gtgcgcctcc tgcgcgcaga tcgtacctac
241 ccggactcca gcggcgcaca gcgctgcgtg agcggcattt acataaccta cgaggcgtca
301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tctacgttta
1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
2101 cgtggtgtac cctccaaaac aaagagcgga ccctatggaa cgaaatggtg aagattaacc
2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
2281 gctcgatgga cagtaacacg tgctactccc gcccccccgt aacatttaca attactaaga
2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
2701 acaataccgc agtgattatg caggggatcg ccagcttttt caagggcctg ggtaaagtgg
2761 gggaggccgt gggaacgctc gttctcggcg ccgccggcgc tgttgtttca accgtatctg
2821 gaatagcttc gtttttaaac aacccatttg gggggctagc catcggcctg ctggtaatcg
2881 ccggcctggt agctgcgttt tttgcttaca gatatgtaat gcagatccgc agtaacccca
2941 tgaaagctct ataccccata acaacaaagg ccttgaaaaa caaagccaaa acttcctacg
3001 gccagaacga ggaggacgat gggagcgact ttgatgaggc caagcttgaa gaggctcgcg
3061 aaatgatcaa atacatgtct atggtttcgg ccctggaaaa gcaggaaaag aaagctataa
3121 agaaaaacag tggggttggc ctgatcgcca gtaacgtctc aaagctggcc ctgcgaaggc
3181 gcggtcccaa atatacccga ctccaacaga acgataccat ggaaaatgaa aaaatggttt
3241 aaacatgttt aataaatatt atgacacgta ctcaaagtgt gacctcatat ttgcataacc
3301 actttctagt tccggcccca aggatattta agcctagtat ctccgccgag g
//
LOCUS MLV3ENV 232 bp ds-DNA VRL 17-JUL-1990
DEFINITION Murine leukemia virus p15-gag protein gene, complete cds.
ACCESSION M35144
KEYWORDS p15-gag protein.
SOURCE Murine leukemia virus (strain Emv-3) DNA.
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 232)
AUTHORS Mercer,J.A., Lee,K.H., Nexo,B.A., Jenkins,N.A. and Copeland,N.G.
TITLE Mechanism of chemical activation of expression of the endogenous
ecotropic murine leukemia provirus Emv-3
JOURNAL J. Virol. 64, 2245-2249 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 17 > 232 p15-gag protein
BASE COUNT 51 a 57 c 53 g 50 t 21 others
ORIGIN
1 gaccgcttct aaaaacatgg gaccgaccgt aaccacccct ctgagtctga ccctagaaca
61 ctgggaagat gtccagcgca tcgcgtccaa tcagtccgta gatgtcaaga agagacgctg
121 ggtcaccttc tgctctgccg agtggccaac tttcggtgta gggtggccac aagatggtac
181 ttttaatnnn nnnnnnnnnn nnnnnnnnaa atctaaggtg ttctctcctg gt
//
LOCUS CTRTPDC 1740 bp ss-mRNA PLN 17-JUL-1990
DEFINITION C.roseus tryptophan decarboxylase mRNA, complete cds.
ACCESSION M25151 J04521
KEYWORDS decarboxylase; tryptophan decarboxylase.
SOURCE C.roseus leaf, cDNA to mRNA.
ORGANISM Catharanthus roseus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Gentianales; Apocynaceae.
REFERENCE 1 (bases 1 to 1740)
AUTHORS De Luca,V., Marineau,C. and Brisson,N.
TITLE Molecular cloning and analysis of cDNA encoding a plant tryptophan
decarboxylase: Comparison with animal dopa decarboxylases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2582-2586 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Brisson, 01-JUN-1989
FEATURES from to/span description
pept 70 1572 tryptophan decarboxylase (EC 4.1.1.28)
mRNA 1 1740 tryptophan decarboxylase mRNA
BASE COUNT 505 a 355 c 359 g 521 t
ORIGIN Unreported.
1 ctctctctct ctctctctaa gactttctct ctctacacat acacctacac cagaaaaaag
61 aaaaaaataa tgggcagcat tgattcaaca aatgtagcca tgtccaattc tccagttgga
121 gaatttaagc cacttgaagc tgaggaattc cgaaaacaag cccatcgtat ggtagatttc
181 atagccgatt attacaaaaa tgtggaaaca tatccggtcc ttagcgaagt cgaacctgga
241 tatctccgaa aacgtatccc cgaaaccgct ccttacctcc ccgaaccact tgacgacatc
301 atgaaagata ttcagaagga tattatccca ggaatgacaa attggatgag ccctaatttt
361 tatgcatttt ttcctgccac tgttagttca gctgcctttt taggagaaat gttgtctact
421 gccctaaatt cagtaggctt tacttgggtt tcttcaccag ccgccaccga attagaaatg
481 attgttatgg attggttggc tcagatcctt aaactcccca aatctttcat gttttcaggt
541 accggtggcg gcgtcatcca aaacaccact agcgagtcca ttctttgtac aatcattgcc
601 gcccgggaaa gggccctgga gaagctcggt cccgatagta ttggaaaact tgtctgttac
661 ggatccgatc aaacccatac catgttcccc aaaacttgca aattggcggg aatttatccg
721 aataatatta ggttaatacc tacgaccgtc gaaacggatt tcggcatctc acctcaagtt
781 ctacgaaaaa tggtcgagga tgacgtggcg gccggatatg taccgctgtt cttatgcgct
841 accctgggta ccacctcgac cacggctacc gatcctgtgg actcactttc tgaaatcgct
901 aacgagtttg gtatttggat ccacgtggat gctgcttatg cgggaagcgc ctgtatatgt
961 cccgagttta gacattactt ggatggaatc gaacgagttg actcactgag tctgagtcca
1021 cacaaatggc tactcgctta cttagattgc acttgcttgt gggtcaagca accacatttg
1081 ttactaaggg cactcactac gaatcctgag tatttaaaaa ataaacagag tgatttagac
1141 aaagttgtgg acttcaaaaa ttggcaaatc gcaacgggac gaaaatttcg gtcgctgaaa
1201 ctttggctca ttttacgtag ctatggagtt gttaatttac agagtcatat tcgttctgac
1261 gtcgcaatgg gcaaaatgtt cgaagaatgg gttagatcag actccagatt cgaaattgtg
1321 gtaccgagaa acttttctct tgtttgtttt agattaaaac ctgacgtttc gagtttacat
1381 gtagaagaag tgaataagaa acttttggac atgcttaact cgacgggacg agtttatatg
1441 actcatacta ttgtgggagg catatacatg ctaagactgg ctgttggctc atcgctaact
1501 gaagaacatc atgtacgccg tgtttgggat ttgattcaaa aattaaccga tgatttgctc
1561 aaagaagctt gatgaataag taagggtttt tttttaattt ttttttaaat tttatatttg
1621 ctgattgttt gaagagttta aaaataaagt gatttgtaaa ggtttattgt actcaaacaa
1681 tcatgcaatt aattatatgt attaattatg acatgagaat aaaatagaat ttgtgtgtgc
//
LOCUS WHTAMYA 2397 bp ds-DNA PLN 17-JUL-1990
DEFINITION Wheat alpha-amylase gene, complete cds.
ACCESSION M16991
KEYWORDS alpha-amylase.
SOURCE Wheat (strain Chinese Spring) distal half-grain DNA, clone
lambda-Amy3/33.
ORGANISM Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 2397)
AUTHORS Baulcombe,D.C., Huttly,A.K., Martienssen,R.A., Barker,R.F. and
Jarvis,M.G.
TITLE A novel wheat alpha-amylase gene (alpha-Amy3)
JOURNAL Mol. Gen. Genet. 209, 33-40 (1987)
STANDARD full staff_review
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
D.C.Baulcombe, 13-AUG-1987.
The region bounded by nucleotides 975 and 1032 might be a signal
peptide.
FEATURES from to/span description
pept 963 1052 alpha-amylase, exon 1
1138 2028 alpha-amylase, exon 2
2133 2393 alpha-amylase, exon 3
pre-msg 908 > 2397 alpha-amylase mRNA and introns (5' end +/- 3
bp)
IVS 1053 1137 alpha-amylase intron A
IVS 2029 2132 alpha-amylase intron B
BASE COUNT 613 a 615 c 599 g 570 t
ORIGIN 874 bp upstream of SstI site
1 aacattcaca atgcaaaatc tatatcatca gatccatcat gaaatatatt ttcatattat
61 atattttttg gtattgtagc tgcttttttc cttgaataca cacgagtgtg atatatatat
121 ttttggtatt gtagctgctt taattatata tatatatata tatatataca tatatatata
181 tatatataat taaagaagat gatgggctaa gaagcccttg gtgttgtaga tgttgatatt
241 ttttgatata attttggtca aactttataa agtttggatt ttcagaaatc taacacagag
301 agtaaaaaga aacggaagga gtatataata agagttctag cagccgctcc tccaactgca
361 cccaactgca ccctcgcctg agccggagcg gcgaggggga ggctaggtcg ccgcgccacc
421 atgtaaaaag attctttttt tatctttacc gactctagcc tcaccttgta cacacaaaaa
481 caagccaatc gaggtttttt tctagaatca acatcaaacc atccggtttt atgtgaaccg
541 cccggttcat cggttttttt tcaaaaaacg tctgattcga ccgggttttc ctggttggac
601 tcaatcggtc caccagactt aaagaactgc aggaatcgcc ggttccggtt ttcaaaacta
661 tgagcagagt gcttgtctag tgcagatata catggagagt aattaaacgg cagtaattaa
721 tagacaaaac tttagctcct aatccatttt atctcagaag ccagcggtcc ttatccatgc
781 ggatatcttt ggagaaaaat taaggagaat tgatagtttt tactaggcac ctactaatct
841 ctaagctcca agtacctata aatacaagcg agctcaagca ttgccaaacc aaggagtagt
901 cgtcggtcca ggagtccaga tttcatagga agtagaggcg agctgcactg cagactgtag
961 ctatgggcaa gcactctgct actctctgtg gccttctggt tgttgtgctt tgtctcgcct
1021 ccagcttagc acaggctcaa attcttttcc aggtgagaaa atgcatgcat cttcaagttt
1081 catttgcacc cagaatcttt catatgatga aatgtgcttt ggtgtttcgt tggccagggg
1141 tttaattggg aatcgtggaa gacacaaggc gggtggtaca agttcatgca agggaaggtg
1201 gaggagatcg ccagcaccgg cgccacccac gtctggctcc ccccgccgtc gcaatccgtg
1261 tcgccggagg gctatctgcc ggggcagctc tacaacctca actccaagta cggcagcggg
1321 gccgacctca agtcgctgat ccaggcgttc cgcggcaaga acatctcgtg cgtcgccgac
1381 atcgtcatca accaccgctg cgccgacaag aaggacggcc gcggcgtcta ctgcatcttc
1441 gagggcggga cgtccgacaa ccgcctcgac tggggccccg acgagatctg cagcgacgac
1501 accaagtact ccaacggccg cggccacagg gacaccggcg gtggcttcga cgccgcgccc
1561 gacatcgacc acctcaaccc gcgggtccag agggagctct ccgcctggct caactggctc
1621 aaaaccgacc tcggcttcga cggatggcgc ctcgacttcg ccaagggata ctccgcggcc
1681 atggccaaga tctacgtcga caacagcaag ccggcgttcg tcgttggcga gctctacgac
1741 cgggaccgac agctgctcgc gaactgggtg cggggcgtcg gcgggccggc cacggcgttt
1801 gacttcccca ccaagggcgt tctccaggag gccgtccagg gcgatctggg gaggatgcgc
1861 ggcagcgacg gcaaggcgcc cggtatgatt gggtggatgc cggagaagac cgtcacgttc
1921 atcgacaacc acgacactgg gtcgacgcag aggctgtggc cattcccctc ggacaaggtc
1981 atgcagggct acgcctacat cctcacacac ccgggcatac catgcatcgt aagtagtagc
2041 acactacaca acctcaccat aacatttcgc atcaaacgta ccccacgatg tttgtgatct
2101 gaacttacaa ctacttggtt ttgcgcgcgc agttctacga ccatgtgttc gactggaaac
2161 tgaagcagga gatcaccgca ctggctacgg tcaggtcaag gaacgggatc catccgggta
2221 gcacactgga tatcctcaaa gccgagggcg atctctatgt tgccaagatt gggggcaagg
2281 ttataaccaa gatcgggtca agatacaata ttggcgacaa tgtgatcccc tcgggtttca
2341 agattgcagc taaaggcaac aactattgtg tctgggagaa gagcggcctc tgaattc
//
LOCUS STAFEMA 3446 bp ds-DNA BCT 17-JUL-1990
DEFINITION S.aureus factor essential for expression of methicillin resistance
(femA) gene, complete cds, and trpA gene, 3' end.
ACCESSION M23918
KEYWORDS femA protein.
SOURCE S.aureus (strain NCTC 8325; organism BB270) DNA.
ORGANISM Staphylococcus aureus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 3446)
AUTHORS Berger-Baechi,B., Barberis-Maino,L., Straessle,A. and Kayser,F.H.
TITLE FemA, a host mediated factor essential for methicillin resistance
in Staphylococcus aureus: Molecular cloning and characterization
JOURNAL J Antimicrob Chemother 23, 671-673 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by B.Berger-Baechi, 11-APR-1989.
FEATURES from to/span description
pept < 1 294 trpA protein (AA at 1)
pept 577 1878 FemA protein
pept 1897 3156 ORF 419 protein
pre-msg 514 > 3446 FemA mRNA
rpt 361 390 large inverted repeat, left end
rpt 405 434 large inverted repeat, right end
rpt 416 423 small inverted repeat, left end
rpt 430 437 small inverted repeat, right end
BASE COUNT 1281 a 448 c 617 g 1100 t
ORIGIN 3 bp upstream of EcoRV site; segment 18 of S.aureus chromosome.
1 gatatcgtat cccatgcgga aggctttatt tatactgtga cgatgaatgc gacaacaggg
61 caaaacggtg cgtttcatcc agaattaaaa cgaaaaattg agtcaattaa agcgatagcc
121 aatgtgccag ttgtcgcagg atttggtata agaacaccac aacatgttgc agatataaaa
181 gaggttgcag atggcattgt cattggtagc gaaatcgtta agcgatttaa atctaacacg
241 cgtgaggaaa tcattaaata tttacaatct atccaacaaa cattgaataa ttaagtttac
301 ttgatttaaa aaaattaggc gaatactgtt tgaaaaagtg aaaaacggtg aattataaaa
361 ttgaatacaa tttcaaaaaa agtaatatga gcaaacccaa acgttcatat tacttttttt
421 gaaattgtat tcaaaaatct aaatattact ataaaagtat acgcaattaa agcgtttatg
481 ttttagtttt aacattaact attgtatact tatttagatt agatttatta tttttgacat
541 ttgcagaggg gaaatagaaa aactgcaaat acggaaatga aattaattaa cgagagacaa
601 ataggagtaa tgataatgaa gtttacaaat ttaacagcta aagagtttgg tgcctttaca
661 gatagcatgc catacagtca tttcacgcaa actgttggcc actatgagtt aaagcttgct
721 gaaggttatg aaacacattt agtgggaata aaaaacaata ataacgaggt cattgcagct
781 tgcttactta ctgctgtacc tgttatgaaa gtgttcaagt atttttattc aaatcgcggt
841 ccagtgattg attatgaaaa tcaagaactc gtacactttt tctttaatga attatcaaaa
901 tatgttaaaa aacatcgttg tctataccta catatcgatc catatttacc atatcaatac
961 ttgaatcatg atggcgagat tacaggtaat gctggtaatg attggttctt tgataaaatg
1021 agtaacttag gatttgaaca tactggattc cataaaggat ttgatcctgt gctacaaatt
1081 cgttatcact cagtgttaga tttaaaagat aaaacagcag atgacatcat taaaaatatg
1141 gatggactta gaaaaagaaa cacgaaaaaa gttaaaaaga atggtgttaa agtaagattt
1201 ttatctgaag aagaactacc aatttttaga tcatttatgg aagatacgtc agaatcaaaa
1261 gcttttgctg atcgtgatga caaattttac tacaatcgct taaaatatta caaagaccgt
1321 gtgttagtac ctttagcgta tatcaacttt gatgaatata ttaaagaact aaacgaagag
1381 cgtgatattt taaataaaga tttaaataaa gcgttaaagg atattgaaaa acgtcctgaa
1441 aataaaaaag cacataacaa gcgagataac ttacaacaac aacttgatgc aaatgagcaa
1501 aagattgaag aaggtaaacg tctacaagaa gaacatggta atgaattacc tatctctgct
1561 ggtttcttct ttatcaatcc atttgaagtt gtttattatg ctggtggtac atcaaatgca
1621 ttccgtcatt ttgccggaag ttatgcagtg caatgggaaa tgattaatta tgcattaaat
1681 catggcattg accgttataa tttctatggt gttagtggta aatttacaga agatgctgaa
1741 gatgctggtg tagttaaatt caaaaaaggt tacaatgctg aaattattga atatgttggt
1801 gactttatta aaccaattaa taaacctgtt tacgcagcat ataccgcact taaaaaagtt
1861 aaagacagaa ttttttagga agggaattat caaaacatga aatttacaga gttaactgtt
1921 accgaatttg acaactttgt acaaaatcca tcattggaaa gtcattattt ccaagtaaaa
1981 gaaaatatag ttacccgtga gaatgatggc tttgaagtag ttttattagg tattaaagac
2041 gacaataaca aagtaattgc agcaagcctt ttctctaaaa ttcctactat gggaagttat
2101 gtttactatt cgaatcgtgg tccagtaatg gatttttcag atttaggatt agttgattat
2161 tatttaaaag agttagataa atatttacag caacatcaat gtttatatgt taaattagat
2221 ccgtattggt tatatcatct atatgataaa gatatcgtgc catttgaagg tcgcgagaaa
2281 aatgatgccc tagtaaactt gtttaaatca catggttacg agcatcatgg ctttacaact
2341 gagtatgata catcgagcca agtacgatgg atgggcgtat taaaccttga aggtaaaaca
2401 cccgaaacat tgaaaaagac atttgatagt caacgtaaac gtaatattaa taaagcgata
2461 aactatggtg ttaaagtcag attccttgaa cgtgatgagt tcaatctttt cttagattta
2521 tatcgtgaaa ctgaagagcg tgctggattt gtatcaaaaa cagatgatta tttttataac
2581 tttattgaca catatggaga taaagtatta gtaccattag catatattga ccttgatgaa
2641 tatgtgttaa agttgcaaca ggaattgaat gacaaagaaa atcgtcgtga tcaaatgatg
2701 gcgaaagaaa acaaatcaga taagcaaatg aagaaaattg cagaattaga taagcaaatt
2761 gatcatgatc agcatgaatt attgaatgca agtgaattga gcaaaacgga cggcccaatt
2821 ctaaaccttg cttctggcgt ttattttgca aatgcatatg aagtgaatta tttctctggt
2881 ggttcatcag aaaaatataa tcaatttatg ggaccataca tgatgcattg gtttatgatt
2941 aactattgct tcgataatgg ctatgatcgt tataatttct atggtttatc aggtgatttt
3001 acggaaaaca gtgaagatta tggcgtatac cgctttaaac gtggatttaa tgtacaaatc
3061 gaagaattaa taggggattt ctataaacca attcataaag tgaaatattg gttgttcaca
3121 acattggata aattacgtaa aaaattaaag aaatagatgt aaatgatgtt agggcttata
3181 gttattgata ctataggctc ttttttatat gttttttaat aaagcctcga ataatgatat
3241 attaaatgac aagtttaaca agtgattgca ataaatataa atatgaattg cacaacctat
3301 ttgtaaatga ttgagttgag tatacttttt aattcattca atatgactca aaacaagcca
3361 atatgtatta tatttttggt gaataaacct tagcataaca atttatgtct gattatatat
3421 gtgtgtgttt gcaagtgtgt gatatc
//
LOCUS WTVPNS11A 1172 bp ds-RNA VRL 17-JUL-1990
DEFINITION Wound Tumor Virus genomic segment S10 encoding Pns11 nonstructural
polypeptide, complete cds.
ACCESSION M24114 J04352
KEYWORDS polypeptide Pns11.
SOURCE Wound Tumor Virus, cDNA to viral RNA.
ORGANISM Wound tumor virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE 1 (bases 1 to 1172)
AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
TITLE Complete nucleotide sequence of wound tumor virus genomic segments
encoding nonstructural polypeptides
JOURNAL Virology 171, 222-228 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.L.Nuss, 20-APR-1989.
FEATURES from to/span description
pept 25 1068 polypeptide Pns11
BASE COUNT 359 a 222 c 236 g 355 t
ORIGIN
1 ggtatttttg ttggatctgc tgtgatggat gcttctgtcg acaggatcac aaatctacat
61 tttgaaatcc tcgctaaagc gggtggacat gaaatacacc aaaaatatga agcaattagg
121 aagttgaacc ttacgggtga ttcttcaaag agtaatatct ctgtatctgc aaggtcagca
181 atattgaaat gggctgacgc taaacaagga tacatagcca gccaactgga cgatcgagat
241 tatggcgacc tcatcgccaa agctgtaatc tttgtcccga tgtctgttat aacaggaggg
301 aaaaatccga aggatttaat accttacgga gttgtcgctg cagtgttaat ttttgttcct
361 gaaacgttga cactactgga tgagatagtt attaatttaa tgcatgataa gaaacctttg
421 tcatcaattt tattgacaaa gatactcaga gacatgaaaa ttgatgtctg tggtagtaat
481 tttgattcat tttattactg cccaatttcg aggtataata gacatattat taagctagct
541 ggtgcattac ctcaaatgcc gactagcgtt agattatcag ttaatgattt agctcgagta
601 gctatttccg aagtacataa ccagcttatt tcggataaac agatgttttt caaactacca
661 actggttttt cgccaaaagt gcattgttta aaagtacttt gcactactga aatggaaatt
721 tttcagaaat gggttagaac gttcatgtct gatagaccga acgaattcat ctattctgac
781 cagtttaata tattatctag gaccacctac ttttcttcag atgatccttt ttcatttttt
841 actctttgga ggggatggtc aacctacaaa gaaatactct ctcaagatca agcttcaagc
901 tttcttgaag ccataggaag tggaaagccg ttgagaagtt caatcgctac ttttccatca
961 atgtttgatg aaggtgctat ttacatcaga tatgaatgga taactccaaa agattcagca
1021 aattcgaaga aggctggctc ttcagcaccg agcgcgccta aaatgtagaa gatttgtcct
1081 tgagattcgc atctggagca gcccggccta gtcacccgga gccagatcga gctaaaactc
1141 gagttgaatt ccccagctcc aacaaacatg at
//
LOCUS WTVPNS4A 2565 bp ds-RNA VRL 17-JUL-1990
DEFINITION Wound Tumor Virus genomic segment S4 encoding Pns4 nonstructural
polypeptide, complete cds.
ACCESSION M24117 J04352
KEYWORDS polypeptide Pns4.
SOURCE Wound Tumor Virus, cDNA to viral RNA.
ORGANISM Wound tumor virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE 1 (bases 1 to 2565)
AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
TITLE Complete nucleotide sequence of wound tumor virus genomic segments
encoding nonstructural polypeptides
JOURNAL Virology 171, 222-228 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.L.Nuss, 20-APR-1989.
FEATURES from to/span description
pept 64 2262 polypeptide Pns4
BASE COUNT 809 a 463 c 625 g 668 t
ORIGIN
1 ggtattgatc ccactttcta gatcagtgag ctagtgaagt aaggttcaaa actaatcatc
61 gccatgaagc aagtagccgg gtttattggt atgagtcaaa agaacaaagg aattcagcaa
121 aaccagtggc attccggtcc gccacaggga ctgctttcag gacaaacgaa agctgaacaa
181 ggtaccagca gtcagcaagc tggagtgaat caaggagaga attccaaatc tggggcggta
241 gtccagagga aaatgccaat gaggagagac gttaattctg ccgcacaacg tcaacatgta
301 aggaggttgg tgtacacacc atactttcca aatgaaacat ggaatatatc aacattcaaa
361 aatggagcta aagacgtaga acggtcggtt atatcgacgc tgaacaacgt cgctgagaat
421 agagtcatgg acaactgtac ttccagagtc atttttgaga tgactcaaat acaatttgag
481 tcacttccag atataataag aaatgaattt actagagttg gggatgacgc cttgaaatgg
541 gcggtccctg aagatttgaa atctgcagat cttgaccata tgatggtcgt gaaattatca
601 acagaaggca cgatataccc aacgacgctt atttttccag gtggatgtag tgggatggct
661 aaattgaaat cagtctactc attcttggaa agtcagctag aaagaatagt cacaccgaca
721 ccctcggtgt ctttgaagta tgttacaagc tgggctgagc acttatttga tctgtgctcc
781 ggacagctga tcaattcaca aaacgagcgt gtagacaaac tgcttggata catgatctgg
841 gatattgaaa aagctatcac tttgactgat caagtgatcg catgttacaa ccaccctgaa
901 gttgtgctga ggaggctagg ggcctcggat attgcatgcg ctgtgttagc tggggagtct
961 gtggtgaagc tcacccgcct agcgttatcc aagtctcctg ttgatggctg ctcatgctgt
1021 agaatattgg agctgattct aaatttacca tcacgtaagc ccaacgataa ggtgccgcag
1081 gtaccattgg acatactatt cgcatctgtc tatagatacg tatcagccat gtgcatgggg
1141 cgagtgctga atgggcgcat tgacgcatcg gggatacagt cgactgatca tgcgactgct
1201 tcgatcaagc tgaatgatat aattgtaaat gatcttgagt taaggagtat gggtgttgat
1261 aaaacgtcat cattcagagg aacgcaatct atgagggcgt tctacgttcc tgagaattta
1321 gcaggctcta tactagatag gattaacgtg ttggtcatgc gccattttgg aatcctgcat
1381 atgtggggtt ttaatggtgt agtacttcaa aatcaagagg ggtactgtga ctatcacatt
1441 ataacaggat taaaccactt aacaactata acgcatacta actccatggt tgcagtacat
1501 tggggtactg aatcaaggat ggataatata tttgaaataa aagcaagaac tcttccaacc
1561 gcatccgaga cgatgataac attgatcgaa aatgccttga aggagcaact gacatcaata
1621 gtgaaagacg gcctcaggaa gggagtaagt ttttcagtga aaagaaatat aaatgactct
1681 cgctttggat ttgagactaa ttcatcccca gcgatatttc tgaaactcag agacatgcta
1741 aaaagagcta agcctttctc agacctgtta tccttggcat tgtcaaaggt aataaaaaaa
1801 gagaacgcta tgatacaacg gtcaatcacc acagtggaag tggccgttgc cattaaaatg
1861 aaagtctatg ggttggatga gtatgtcagc ctgatgaaag ttgagaagaa ggaagtggaa
1921 agtggaagcc ttccacttca agaatttctg aaattaaaat caaatgcagc aggggcacag
1981 tcatcgactg ttgcagttaa gatgaaagaa gaagaagtca atagcaaagc ctattgcctg
2041 atatcagaaa ctattgttgt aaacatggac gcagtgagaa gtgcatgtgg agtagtgcaa
2101 tcagagaatc tggtcataaa gtcagaactt tcaggaccag aattgagtga aagtgtaact
2161 tcagggttga tggagttact aggaagaaat gccggaccaa gtaaatcttg ggctgaccaa
2221 gttgaagaag cggaaaatga agaggaaaaa caaaaggagt gagtgtgctc catttctgga
2281 tcggagtgtc ttgcgaggct cgggtctata gatttataaa catgtattat aattaatatt
2341 attagtggta atatattgca tttgtaaata gtgtgcttat ctcattaggc ctgccattag
2401 ggcggggtcg tgatgaaccc tcaggactat accacgtccc attttagctc gctatagcaa
2461 atttgagtat caacataagt gactggagtt taaaccctct tagctttatt cttgagcgaa
2521 gctataatga cgatggttgg tctagggctg atggatcatc atgat
//
LOCUS WTVPNS7A 1700 bp ds-RNA VRL 17-JUL-1990
DEFINITION Wound Tumor Virus genomic segment S6 encoding Pns7 nonstructural
polypeptide, complete cds.
ACCESSION M24116 J04352
KEYWORDS polypeptide Pns7.
SOURCE Wound Tumor Virus, cDNA to viral RNA.
ORGANISM Wound tumor virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE 1 (bases 1 to 1700)
AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
TITLE Complete nucleotide sequence of wound tumor virus genomic segments
encoding nonstructural polypeptides
JOURNAL Virology 171, 222-228 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.L.Nuss, 20-APR-1989.
FEATURES from to/span description
pept 45 1607 polypeptide Pns7.
BASE COUNT 551 a 298 c 338 g 513 t
ORIGIN
1 ggtattttct cctgaaatac ttgacctggc tagggttcgg tgaaatgaag atttccgact
61 tctgtttcgc ctctgcaaat gatggatcat acactttaaa agcattttcg gaacttaatg
121 aatacaagga tgtagtgaaa cttgtctctg atgaaaagat tggagttgga tttcactgct
181 acaatttagg tctgatgaat atagttgaag atttttccgg aaatcttgac aacgaatctt
241 acttgaccag taaggtaggc aaaagaatgg cttccgagct agtcactgca tactcaaaat
301 ttgggtctac tagcagccgg accttacata gctcattaaa tctaccagtc gtaaatataa
361 cttcattacc aacatcccaa gctaaagatc tcaaaccgaa tcattcattg gatgataagg
421 gatctatgct acggactcaa atacattcaa ttctgactgg aaacggtccg ctaactataa
481 aacgaaggat agatgcgttt tattatagcg cctcaagtat attcactaga cacatgactt
541 cgaaatatgc taatcctgga tcaaatgtgc ctcaaaggtt ttcatttata ccagattgtg
601 caatgaataa gaaaccaacg ctatttctag agaatagaga taacgaactc caagattcca
661 tgaccatcat gttaatgttg ggacaggtct tttcggatgc tttaacatat tatctaaatg
721 cctcaatact atacggtatc ctaggtagaa ttgaatccaa agtgcaggtt gatttaccag
781 ccatcactct tgaatccgtt catgttacaa ataatttgga aatctctccg gccgcatttg
841 ctctaatagc gtcagtctgg cttgataaag ctgaaatatt atcaaaatta aatgcgatag
901 acttcatagt gtctcctgaa gataatgaag atagaatatc aaatttactg aaactgatgc
961 tgccagttca gtctaataat ataacggttg aaaaatctga cactaggttc tcggtaaccc
1021 attcagatgg atttatgagg tattatatgt gtttttctaa acacgaattt gattatggag
1081 accaccttga gtcttttggc atcccagtgc tgcgtgtaag actaggtaaa ccaatatcaa
1141 atgaactgaa caagccgatg ctggtaatgt tcaagaagca tgaatcaata tctagcataa
1201 atgttagata tcaagtaaga ggtggaagta tacctaaatt caggacctca gaatttagaa
1261 gagatatagg aatgttagtt gccaattctc ggtttatggc gactgatatt actttgatat
1321 tgtctacatt ttatccattc actcaggaaa cggataaatt atttattgaa caacacataa
1381 aagaaatttt tcttgatatg tatccatgga ttgataaact aacttcagcg gacgctaaat
1441 ctgaggttaa tatatcatat gggaatttag tgctatactc atatggagag ttagtcaaaa
1501 atagtatatt tattgcaatg atggacaatt gcaaggacgc ccggaatagt ttttctcggg
1561 ctgatatgag agagatccaa gcatttgtag cagcgtttac acagtgagta agattttttc
1621 atgaacctgg agcttgcgag gccaccagca gctaaaaccc gtaggtggag gttgtataca
1681 tggtgagtgg ggagcatgat
//
LOCUS ATUVIRD 3662 bp ds-DNA BCT 17-JUL-1990
DEFINITION A.tumefaciens virD1/D2 protein genes, complete cds.
ACCESSION M33673
KEYWORDS virulence protein.
SOURCE A.tumefaciens plasmid pTiC58 DNA, from Nicotiana tabacum.
ORGANISM Agrobacterium tumefaciens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 3662)
AUTHORS Wang,K., Herrera-Estrella,A.H. and Van Montagu,M.
TITLE Overexpression of virD1/D2 genes in A.tumefaciens enhances
T-complex formation and plant transformation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by A.Herrera-Estrella, 11-APR-1990.
Author address [1]: A.Herrera-Estrella
Rijksuniversiteit Gent
Department of Genetics
Ledeganckstraat 35, Ghent, East Flanders
Belgium B-9000
e-mail [1]: jecop@gengenp.rug.ac.be
FEATURES from to/span description
pept 648 1091 unknown protein 1
pept 1125 2468 unknown protein 2
pept 2483 3079 unknown protein 3
ORF 3118 > 3662 unknown protein 4
BASE COUNT 870 a 975 c 1042 g 774 t 1 others
ORIGIN
1 gtcgacaatg cttcatcgat atcgagcgga gttaacatgg tcgggatcag aagcaggttt
61 gagctggcaa tgatcgtgtt gttgagttcg ctcgaaccac catgcgtatc ggccagcgca
121 taatcaaatc cttggagttc ggcgtcctca taggccgcct caaggagtgc catttcctcg
181 gcggcgtaga cttcgcagaa ggagccctag gtattgctgc gaagggcgtt ttctttccat
241 cgcgttagtg gtcggttttc atcagcgtcg aagagagcca atcgtttgcc gtcacttgca
301 aaggcggcgc aaagacccat gagtgccgtg gttttgccgg ctcctccctt gaaggagcaa
361 aatgtcagaa gtttcatgtc cttatcctgt cgattttgtg aagcggaagt gtgtctgtac
421 ttttatttgt gtgtatgatt ttgcgataat tcataagtaa tgtagtaatt acctgatttt
481 atatttcaat tttattgtaa tataatttca attgtaataa tataaaaata aatatccctt
541 atgtgttctt gatttcgttt tgtatatggc tagattccca tctgccacga cgaggaaatg
601 ctacggcggg gcaagttcag atctttccgt cttctatgga ggaagctatg tcgcaaggca
661 gtaggcccac ctcaagtgac attgccgtca accagcgcga atgcgtgaag gttgaaggct
721 tcaaggtcgt cagtacccga ttaagatcgg ccgaatatga gagtttttct catcaggcac
781 gcttgctggg cctctccgac agcatggcca tacgggttgc ggtgcgccgc attggtggct
841 ttcttgaaat cgacgcagag actcgtcata ggatggaggc catactacaa tccataggaa
901 cactctcaag caacattgcc gcgctgctat ctgcctatgc cgaaaatccg acaatggatt
961 tggaggcttt gcgagctgaa cgtatcgcct tcggtaaatc tttcgctgac ctcgacggct
1021 tgctccgttc cattttgtcc gtatcacggc ggcggatcga cggttgctcg ctgctgaaag
1081 acgccttgta gcactgacgt agcacttggc ggggaacata ttcgatgccc gatcgagctc
1141 aagttatcat tcgcattgtg ccgggaggtg gcaccaagac ccttcaacaa attatcaatc
1201 agttggagta tctatcccgg aagggcaggc tggagctgca gcgttcagcc cgacatctcg
1261 atattcccct gccaccggat caaatccacg aacttgcccg aagctgggtt caagagactg
1321 gaacttatga cgaaagtcag ccagacgagg aaaggcaaca ggagttgacc acccatatta
1381 ttgtaagctt ccccgccggt acaagccagg tagcggctta tgcggcgagc cgggagtggg
1441 cagccgagat gtttgggtca ggcgcagggg ggggccgata caactatctt acggccttcc
1501 acatcgatcg cgaccaccca catctgcatg tcgtcgtcaa tcggcgcgaa cttttaggac
1561 acggctggct gaagatatct cggcgccatc cccaactgaa ttacgacgcc ctgcgcataa
1621 agatggccga gatttcactt cgtcatggca ttgccctcga tgcgagccga cgagcagaac
1681 gtggcatcac cgagcggccg atcacttatg cccaatatcg gcgccttgag cgggagcagg
1741 ctcgccaaat ccgtttcgaa gacgcggatt tggaacagtc gtcgccgcaa ggagatcatc
1801 cagaattcag ccaacctttc gatacatccc catttgaagc atccgcgggc ggaccggagg
1861 acatgcctcg gcccaacaat cggcagaatg agtcgcaagt tcatctccag gagccagctg
1921 gtgtcagcaa cgaagccggt gtccttgtgc gggttgcatt ggagacggag cgccttgctc
1981 aaccattcgt ttccgaaacc attctcgcgg acgacatagg gagcggctct tcgcgtgttg
2041 ccgagggccg tgtggagagc gcaaaccgca ctcccgatat tcctcgcgca gcaactgaag
2101 ctgccacgca cacgacacac gaccggcagc ggcgtgcaaa gcgtcctcat gatgacgacg
2161 gagggccgag tggagcaaaa cgtgtgacat tggaaggcat cgcggttggg ccccagcgaa
2221 cggccggcga acaggatggc agtagtggcc ccttagaacg gcaagctgga acgtctcggc
2281 catctccacc gacggccacg acgcgggcca gcaccgcaac cgattcattg tctgctacag
2341 cccacctcca gcaacggaga ggtgtccttt caaagcgtcc gcgtgaagat gatgatggag
2401 aaccgagtga acgcaaacgc gagagagatg agcgcagcaa ggacgggcgt gggggaaata
2461 ggagatagag aaggaccgag tgatggcaaa tgaagagttc accagacact atgcgtggcc
2521 cgttcctgtg gcttcgaatg atgaagggcg tgggaccgcg cggatcccca tccaggcaca
2581 atcaatcgtt gctggagaag acggtcggga cacttcggtc ccaacggctt tgtcgcgacc
2641 gccaattgaa gatatgccgc acggcgtcca agaaacatcg gcgagtggcg gacgactggg
2701 agcggcccgt ctgcgggatt ccgtaatccc gccaggaata tccgaagccc gcacggacct
2761 atccgcaatt ttgcggaaaa aaagcggttc tttccgcacc ggtatgcagt atctgcgtgg
2821 gcttgaacgg gaaaattttg ataaacaaga cagggaagcc agtgcgttgc cagatttaag
2881 tgcaaggggc ataaagcgac cgcgcgaaat tgagtatccc ggcaatgcaa gcggattaac
2941 cataaagagg caagacggct taggcataga gatcaatact atctcggcat cttcgcccgt
3001 gaaccggggc cgcgcattcg tcgaactggc aaggcgcgcc ggaaccgggc gtgtacaatg
3061 ttcagccatc ggcagataga gcacagaact ctgcgcagga aagttcgacg tttcccgatg
3121 gcacttctgt ttccgctctc tattcaggcc cactcgcaga atggttcgag agggatactg
3181 gcagcgagac gaccaggaat tcaggcaaca ccatttcgtc accgctccga gggctggagg
3241 agttcggcga ttccgcggac agccggtatc ttgggcgcga agctcagagt ctttcagtta
3301 ccgtaacaac gcctaattcg aatgctgagg caagttctca tagcgcacac actgaaactc
3361 tcgacgatgt cagcagtgac cgctcgagcg aacaggggag gggtcccctt ggcgccgcaa
3421 tccttggatc ccatcatgac cttttcnccg cgtgcgcaga agttatcaca aacaaaccgc
3481 gattcccctg aactgaccga cgccgaccta gcaaaagtcg atgcggtgtt cgaatctctc
3541 tccaaggggc ccccggcggg ggagagcgcc gcaccggact ttcgtgagcg cggaccaggc
3601 agcgctttcc agaaagaagg cgtgtccgac agggcgaacg gggtgcccac taactgggag
3661 gt
//
LOCUS CHKFIBAB 2730 bp ss-mRNA VRT 17-JUL-1990
DEFINITION Chicken alpha-fibrinogen mRNA, 3' end, and ORF, complete cds.
ACCESSION M34096
KEYWORDS alpha-fibrinogen; fibrinogen.
SOURCE Chicken (strain White Leghorn) liver parenchymal cell line, cDNA to
mRNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2730)
AUTHORS Weissbach,L. and Grieninger,G.
TITLE Bipartite mRNA for chicken alpha-fibrinogen potentially encodes an
amino acid sequence homologous to beta- and gamma-fibrinogens
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5198-5202 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Grieninger, 08-MAY-1990.
FEATURES from to/span description
pept < 1 1518 alpha-fibrinogen (AA at 1)
ORF 1830 2582 ORF
mRNA < 1 2730 alpha-fibrinogen mRNA
BASE COUNT 825 a 574 c 678 g 653 t
ORIGIN
1 acgatactct gtgtgttgct ctgcctcaat ttagcttggg ctcaggatgg gaagaccacc
61 tttgaaaagg agggcggagg cgggcgtggc cccaggatcc tagagaacat gcacgagtcg
121 tcctgcaagt atgagaagaa ctggcctatc tgtgtggatg atgactgggg tacaaaatgt
181 ccatcatgct gcagaatgca aggaataatt gacgacacag accagaatta cagtcaaaga
241 atagacaata ttaggcagca actcgcagac agtcaaaaca agtataaaac atccaatcgg
301 gtaattgtgg aaaccataaa tatactaaag cctggtctgg agggtgccca gcagctcgat
361 gagaattacg gtcatgtgtc aacagaactg aggaggagaa ttgtgacatt aaagcagcga
421 gttgccactc aagtgaacag aattaaagct ctgcagaaca gcatccagga gcaggtggtg
481 gaaatgaagc gcttggaggt ggacattgat attaagatac gagcttgcaa agggagctgt
541 gctagaagtt ttgattacca ggtggacaaa gaaggctatg acaacatcca gaagcatctt
601 acccaggcca gctccattga tatgcaccca gactttcaaa caaccacctt gagcacactg
661 aaaatgaggc ctcttaagga ctcgaatgtt cctgaacatt ttaagctcaa gccttcacca
721 gaaatgcaag ctatgagtgc atttaataac atcaaacaga tgcaagtggt attagaaaga
781 ccagaaacag accacgtggc agaagcaagg ggggattctt caccttccca cactggcaaa
841 ttaattactt cttctcacag gagagagagc cctagcctgg tagacaaaac ctcctctgcc
901 tcctctgtcc atagatgcac cagaactgtc accaagaaag tcattagtgg ccctgatggt
961 cctagagaag aaatagttga aaaaatggtt tcctctgatg gctcagactg ctcccatcta
1021 caaggaggac gagaaggaag cacgtaccat tttagtggga caggtgactt tcacaagcta
1081 gacaggcttc tgccagatct agagtcattc ttcacccatg actctgtatc cactagtagt
1141 aggcactcaa ttggctcaag caccagcagc catgtgactg gggcaggcag cagccaccta
1201 ggcactggag ggaaagacaa attcacagat ttaggggagg aggaagaaga tgactttgga
1261 ggacttcagc catctggatt tgctgctggc agtgcgagtc actccaagac tgtactgacc
1321 agctcctctt ctagtttcaa caagggaggc tccacttttg aaaccaagtc actaaagacc
1381 cgtgaaacat ctgagcagct aggtggggtg caacatgatc agagtgcaga ggacacccca
1441 gactttaagg cgcgcagctt cagaccagca gcaatgagta caaggagatc ctacaatggg
1501 aaaggtacac agaagtagta attgaggtag tggagccaaa ttcaattcat aaccaaactg
1561 acagaatgtt tttagatact gcagtataat agtgtgatga taaaatgcca gaactgtgtc
1621 tgtttgttgc cactttggag ataaaagaga actatatcac taatagtcag gtattgcata
1681 gacaattagc aatataagat ctgtgtaagg ttttcctctc tcaattcttt ttaatgttgc
1741 ctgtcactaa accttatgtt gtccaagaca catttaacaa attttcctca gatctattgc
1801 tgcttggatt cctctgaagt ttttattaag ctttactgat aacttctgtc aaaccagatc
1861 aacttttttt tttagactgc gatgatatcc gccagaaaca cacttctggt gccaaaagtg
1921 gcattttcaa aatcaagcca gagggatcca ataaggtttt gtcagtttat tgcgaccaag
1981 agaccacttt gggaggatgg ctattgatcc aacagagaat ggatggatca gtgaatttta
2041 accgtacgtg gcaagactac aggagaggtt tcggcagcgt ggatggcaaa gggcaaggag
2101 agttgtggct gggcaatgag aatatccact tgctgactca gaacgatact ctgcttcgag
2161 tagagttaga ggactgggat ggaaatgctg catatgcaga gtacatcgtg caggtaggga
2221 ctgaagcaga aggctatgcc ctgactgtgt cctcctatga ggggactgct ggggatgctc
2281 tggttgctgg ctggctggag gagggctctg agtacacatc ccatgcccag atgcagttca
2341 gcacttttga ccgggaccag gaccactggg aggagagctg tgcagaggtg tacgggggtg
2401 gctggtggta caacagctgc caggcagcca acctcaatgg catttactat ccagggggcc
2461 attatgaccc caggtacaat gttccatatg agatcgagaa tggagtagtc tggataccat
2521 ttagagcctc cgattattcc ctcaaagttg ttagaatgaa aatcagaccc ctggaaaccc
2581 tgtagaagga cgggcattta atatatgtgt cacaactaaa agttggaaac atcttttata
2641 tacctgtgtg cacgatgtac ctttacccta tgaatttgaa ggcagcatgg cacatctatg
2701 gcatacaaaa taaacggatt aattattaac
//
LOCUS ECOHYA 6023 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli hya operon encoding hydrogenase isozyme 1 large and small
subunit (hyaB and A) genes and hyaC-F protein genes, complete cds.
ACCESSION M34825
KEYWORDS hyaA gene; hyaB gen; hyaC gene; hyaD gene; hyaE gene; hyaF gene;
hydrogenase.
SOURCE E.coli (strain LE392) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 6023)
AUTHORS Menon,N.K., Robbins,J., Peck,H.D.Jr., Chatelus,C.Y., Choi,E.-S. and
Przybyla,A.E.
TITLE Cloning and sequencing of a putative Escherichia coli [NiFe]
hydrogenase-1 operon containing six open reading frames
JOURNAL J. Bacteriol. 172, 1969-1977 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 188 1306 hydrogenase-1 small subunit (hyaA) precursor
sigp 188 322 hydrogenase-1 small subunit (hyaA) signal
peptide
matp 323 1303 hydrogenase-1 small subunit (hyaA)
pept 1303 3096 hydrogenase-1 large subunit (hyaB)
pept 3115 3822 hyaC protein
pept 3819 4406 hyaD protein
pept 4403 4801 hyaE protein
pept 4798 5655 hyaF protein
signal 5657 5683 pot. transcription termination signal
BASE COUNT 1334 a 1607 c 1720 g 1362 t
ORIGIN
1 ttttgtgcaa aagtttcact acgctttatt aacaatactt tctggcgacg tgcgccagtg
61 cagaaggatg agctttcgtt ttcagcatct cacgtgaagc gatggtttgc cttgctacag
121 ggacgtcgct tgccgaccat aagcgcccgg tgtcctgccg gtgtcgcaag gaggagagac
181 gtgcgatatg aataacgagg aaacatttta ccaggccatg cggcgtcagg gcgttacccg
241 gcgcagcttt ctcaaatatt gtagtctggc tgccacgtcg ctgggattag gcgcgggaat
301 ggcaccaaag attgcctggg cgctggagaa caaaccgcgc attccggtgg tatggatcca
361 cggtctggaa tgcacctgct gtaccgaatc ttttatccgc tccgctcacc cactggcgaa
421 ggacgtcatc ctttccctga tttccctcga ttacgacgat actttgatgg ctgccgccgg
481 aacccaggcg gaagaagtct ttgaagacat catcacgcaa tacaatggca aatatatcct
541 cgcagtagaa ggtaatccgc cgctgggcga gcaggggatg ttctgtatca gcagcggtcg
601 accgtttatt gagaaactca aacgtgccgc tgccggagcc agcgcgatta tcgcctgggg
661 aacctgcgcg tcctggggct gcgtgcaggc cgcgcgaccc aatccgacgc aggcaacgcc
721 tatcgacaaa gtcatcaccg acaaacccat tatcaaagta cctggctgcc cgccgatccc
781 ggatgtgatg agcgccatca ttacttacat ggtgaccttt gatcgcttgc cagatgtcga
841 cagaatgggc cgtccgctga tgttctatgg tcagcgaatc cacgataaat gctatcgccg
901 cgcccacttc gacgccggag agttcgtcca gagttgggat gatgacgctg cccgcaaagg
961 ttactgcctg tacaaaatgg gctgcaaagg gcctaccacc tataacgcct gttcctccac
1021 acgctggaat gatggcgttt ctttcccaat ccagtctggt cacggctgcc tgggctgtgc
1081 ggaaaatggt ttctgggatc gcggttcgtt ctacagccgc gtggtcgata ttccgcaaat
1141 gggtactcat tccaccgccg ataccgtcgg tttaaccgcg cttggcgtgg tggcagcggc
1201 tgttggtgtg cacgcagtcg ccagcgccgt tgaccagcgc agacgtcata accagcaacc
1261 tacagaaacc gaacatcagc caggcaatga ggataaacag gcatgagcac tcagtacgaa
1321 actcagggat acaccatcaa taatgccgga cgccgcctgg tggtcgaccc gattacgcgc
1381 atcgaaggcc acatgcgctg cgaagtgaat attaacgatc agaatgtgat caccaatgcc
1441 gtctcctgcg gcaccatctt tcgcgggctg gagatcatcc tacaagggcg cgacccgcgc
1501 gatgcgcggg cgttcgttga acgtatctgc ggcgtctgta ctggcgtaca cgccctggct
1561 tcggtttacg ccatcgaaga tgctatcggt attaaagtgc cggacaacgc caatatcatc
1621 cgcaacatta tgctggcaac gctctggtgc cacgatcatc tggtgcactt ctatcagctt
1681 gccgggatgg actggatcga tgtgttagat gcgctgaaag ccgacccgcg gaaaacctcc
1741 gaactggcgc aaagtctctc ctcttggccg aaatcatccc ctggctattt cttcgacgta
1801 caaaaccgcc tgaaaaaatt tgttgaaggc gggcagttgg ggatcttccg caatggctac
1861 tgggggcacc cgcagtacaa actgccgcca gaagctaacc tgatgggctt tgcccactat
1921 ctcgaagctc tcgatttcca gcgtgaaatt gtcaaaatcc acgcggtctt tggcggtaaa
1981 aacccgcatc caaactggat tgtcggcggg atgccttgcg ccatcaacat tgacgaaagc
2041 ggcgcggtcg gggcagtcaa tatggaacgc ctgaacctgg tgcagtcaat tatcacccgc
2101 acggcggact tcattaacaa cgtgatgatc cccgacgcct tagccatcgg tcagttcaac
2161 aaaccgtgga gcgaaatcgg cactggtctt tctgataaat gcgttctcag ctacggcgca
2221 ttcccggata ttgccaacga ctttggcgag aaaagtctgc tgatgcctgg cggcgcggtg
2281 attaacggcg acttcaacaa tgtgctgcca gtggatttgg ttgatccgca gcaggtgcag
2341 gagtttgtcg accacgcctg gtatcgatat cccaacgatc aggtcgggcg tcatccgttc
2401 gatggcatca ccgacccgtg gtacaacccc ggcgatgtca aaggcagcga taccaacatt
2461 cagcagctga atgaacagga acgctactcg tggatcaaag cgccacgctg gcgcggtaac
2521 gcgatggaag tggggccgct ggcgcgcacg ttaatcgctt atcacaaagg cgatgctgcg
2581 accgttgagt cggtcgatcg catgatgtcg gcgttgaacc tgccgctttc cggtatccag
2641 tcaacgttag gccgcatttt gtgccgcgcg cacgaagcgc agtgggccgc aggtaagttg
2701 cagtatttct tcgacaagct gatgaccaac ctgaaaaacg gcaatctcgc cactgcttcc
2761 acggaaaaat gggaacctgc aacctggccg acagagtgcc gtggtgtcgg ttttaccgaa
2821 gcgccgcgcg gggcgttagg ccactgggcc gccattcgcg atggcaagat tgatctctac
2881 cagtgcgtgg tgccgaccac ctggaacgcc agcccgcgcg atcccaaagg gcagattggc
2941 gcttatgaag cggcgctgat gaacaccaaa atggcgatcc ccgagcaacc gctggagatc
3001 ctgcgtactc tgcacagctt tgacccgtgc ctcgcctgtt caacacacgt gctgggcgac
3061 gacggtagcg agctgatctc cgtgcaggtg cgttaacagc gaaggagaat catcatgcaa
3121 cagaaaagcg acaacgttgt cagccactat gtctttgaag cgccagtgcg catctggcac
3181 tggttgacgg tgttatgcat ggcggtgttg atggtcaccg gatactttat cggcaagccg
3241 ctaccttccg tcagcggcga ggcgacgtat ctgttctata tgggctacat caggttaatt
3301 cacttcagcg ccgggatggt ttttaccgtg gttttgctga tgcggatcta ctgggctttt
3361 gttggcaatc gatactcccg cgagctgttt atcgtgccgg tatggcgtaa aagctggtgg
3421 cagggcgtgt ggtatgaaat ccgctggtat ctgtttctgg caaaacgtcc gagtgccgat
3481 ataggccata atcccatcgc ccaggcggcg atgttcggct atttcctgat gtcggtcttt
3541 atgatcatca ctggttttgc gctgtacagc gaacacagcc agtacgctat ttttgcgccg
3601 ttccgttatg tggtggaatt tttctactgg acgggtggca actcaatgga cattcacagc
3661 tggcatcggc tggggatgtg gctgattggc gcgtttgtga tcggtcatgt ctacatggcg
3721 ctgcgtgaag acatcatgtc cgacgacacg gtgatctcca ccatggtcaa cggctaccgt
3781 agccacaaat ttggcaaaat aagtaacaag gagcgttcat gagcgagcaa cgcgtggtgg
3841 tcatggggct gggcaacctg ctgtgggccg atgaaggctt cggcgtgcgg gtggcggaac
3901 ggctgtatgc ccattaccac tggcccgagt atgtggagat tgtcgatggc ggtactcagg
3961 gactgaactt gctggggtat gtcgaaagcg ccagccatct gttgattctc gatgccattg
4021 actacgggct ggaacctgga acgctgcgaa cctatgccgg agaacgcatt ccggcttatc
4081 tcagcgcgaa gaaaatgagc ctgcatcaga acagtttctc cgaagtgttg gcgctggcgg
4141 atatccgcgg acatctgcca gcacatattg ccctcgtcgg tctgcaaccc gcaatgctcg
4201 acgactacgg cggtagcctg agcgaactgg cacgggagca actgcccgct gcggaacagg
4261 cggcgctggc gcagcttgct gcgtggggaa ttgtgccgca accggctaat gaatcgcgct
4321 gtctcaatta tgactgtctg tcgatggaaa attacgaagg cgttcgcttg cgccagtacc
4381 ggatgacaca ggaggagcag ggatgagcaa cgacacgcca tttgatgcgt tgtggcaacg
4441 aatgctggcg cgcggctgga cgccagtcag tgaatcccgt cttgacgact ggcttacgca
4501 agcgccagac ggcgtggtgt tattaagcag tgacccgaaa cgcacgccag aggtcagcga
4561 taatccggta atgattggcg aattactgcg cgagtttccc gactatacat ggcaggtggc
4621 gattgctgac cttgagcaga gcgaagccat cggcgatcgc tttggcgtct ttcgctttcc
4681 tgccacttta gtgtttaccg gcggaaacta tcgcggcgtg ctgaatggta ttcacccgtg
4741 ggcggaactg ataaacctga tgcgcgggct tgtcgaaccg cagcaggagc gtgcctcatg
4801 agcgaaactt ttttccatct gctggggcca ggaacgcaac cgaacgatga cagtttcagc
4861 atgaatccac tgccgatcac ctgtcaggtg aatgatgaac cgagtatggc ggccctggag
4921 caatgtgctc acagcccgca ggtgattgcg ctgttaaacg agttacaaca tcaactaagc
4981 gaacgccaac cgccgttggg cgaggtgctg gcagtcgatc tgttaaatct caacgccgac
5041 gatcgtcact ttatcaatac gcttctcggg gaaggggaag tgtcagtgcg cattcagcag
5101 gctgacgaca gtgaaagtga aatacaggag gcgatcttct gcggattatg gcgggtgcgc
5161 agacgtcgcg gcgaaaagtt gctggaggac aaactggagg ctggctgcgc gccgctggcg
5221 ttgtggcagg cggcaacgca aaatctcttg ccgacagatt cgctgttacc gccgcccatt
5281 gatggcctga tgaatggcct accgttggcg catgagttac tggcacatgt acgtaacccc
5341 gacgcgcagc cgcacagcat taatctgacg caattaccca tcagcgaggc tgatcggctt
5401 tttctctcac gtctctgtgg gccgggaaat attcagattc gtaccattgg ctatggcgag
5461 agctatatca acgccacggg gttacgccat gtctggcatt tacgctgtac ggacacctta
5521 aaaggcccgt tactggaaag ttatgaaatc tgcccaatac cggaagtggt gctggcagcg
5581 ccagaagatt tggtcgactc tgcgcagcgg cttagcgagg tatgtcagtg gctggcggaa
5641 gctgcaccga cgtaaaaaga cggtaagtat cgctttcagt cttatgaata tcgcaatcgg
5701 cgaatacctc tggtcgtaga gtttcaggat aaagagggag atctaccatt atcgggttat
5761 ttttctctct tcgcctacag gagtgcgcat gtgggatgtc attgatttat cgcgctggca
5821 gtttgctctg accgcgctgt atcacttttt atttgtaccc cttaccctgg gctgattttt
5881 ttgctggcta ttatggaaac catttacgtg gtcaccggca aaaacaatct accggcaaaa
5941 caatctaccg cgatatgacg cgcttctggg gtaagctctt cggtatcaat tttgctcttg
6001 gcgtggctac ccggcctaac cat
//
LOCUS HUMFDXC 1174 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human ferredoxin mRNA, complete cds.
ACCESSION M34788
KEYWORDS ferredoxin.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 19 573 ferredoxin
BASE COUNT 337 a 211 c 267 g 359 t
ORIGIN Chromosome 11q13-qter.
1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct
61 gctgtcctcg gcggcccggc cggccggtgg ctgcaccacg ctgggtcccg cgctggatcc
121 agcggcctgc tgaggaaccg ggggccgggc ggtagcgcgg aggcgagccg gtcgctgagc
181 gtgtcggcgc gggcccggag cagctcagaa gataaaataa cagtccactt tataaaccgt
241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga tgttgtggtt
301 gaaaataatc tagatattga tggctttggt gcatgtgagg gaaccctggc ttgttcaacc
361 tgtcacctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag
421 aatgacatgc tcgatctggc atatggacta acagacagat cacggttggg ctgccaaatc
481 tgtttgacaa aatctatgga caatatgact gttcgagtgc ctgaaacagt ggctgatgcc
541 agacaatcca ttgatgtggg caagacctcc tgaactagaa caaataggaa tattttcatg
601 gaattttacc tatttttata attattattt cttaaagtga ttaaatgaga acatggatga
661 gtggacttca tattatgact agctttacta ttttaattca ccttgcataa ctactgaatt
721 ttgtcattct tgaaagtatg caatttttat tttggttata ttacaaaaat gtcaatcaaa
781 tattaaaaaa tagttaatgt gatagaaaaa ccttacatat ttttttctta tgtttgttta
841 gcgactttag caaaatgttt tcatataatc tcatctgttt acctagaaga taggttaagg
901 aaatatatta ttattcctgt ttgatgtggg tgaaggcaga gatctaacct ggcttgttta
961 gggccatacc actaattaga aaatctgtgc tagaacctgt gtcttattcc tataagctat
1021 gtgttcagac tgaaactgga gaaattatga ctattttatt tatagtagta gttaaatctg
1081 aatgtgtatg gacaaaaata tttaattgct gagtaaactg cttaacttca aagatagtta
1141 ttgaccttat aaataaatat ttcaaaattt tgat
//
LOCUS HUMFDXPSA 676 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin pseudogene, complete cds.
ACCESSION M34787
KEYWORDS ferredoxin; pseudogene.
SOURCE Human DNA, clone h2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 676)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 19 569 pseudo-ferredoxin
BASE COUNT 171 a 149 c 192 g 164 t
ORIGIN Chromosome 20.
1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct
61 gctgtcctcg gcgacccggc cggccggtgg ctgcaccacg ccgggtcccg cgctggagcc
121 agcggcctgc tgaggagccg gggaccgggc cggagcgcgg aggcaagccg gccgctgagc
181 gtgtcggcgg gggcgcggag cagctcagaa gataaagcga cagtccactt tataaactgt
241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga cgttgtggtt
301 gaaaataatc cagatattga tggctttggt gcatgtgagg gaactctaac ttgtttaacc
361 tgtcatctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag
421 aatcacatgc tcgatctggc atatggacta acagatcaca gttgggctgc caaatctgtt
481 tgacaaaatc tatggacaat atgactgttc gagtgcctga aacagtggct gatgccagac
541 aatccattga tgtgggcaag acctcctgaa ctagaacaaa taggaatatt ttcatggaat
601 tttacctatt tttataatta ttatttaagt gataaaatga gtacatggat gagtggattc
661 tgtattatga ctagct
//
LOCUS HUMFDXPSB 976 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin pseudogene, complete cds.
ACCESSION M34789
KEYWORDS ferredoxin; pseudogene.
SOURCE Human DNA, clone h3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 976)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps < 1 386 pseudo-ferredoxin
BASE COUNT 340 a 156 c 168 g 312 t
ORIGIN Chromosome 21.
1 ttcagtgaag ggcccagagc agctcggaag ataaaataac agtccacttt ataaaccaac
61 agtgaaacat taacaaacaa aggaaaagtt gatgactctt tgctagatgt tgtggttgaa
121 aataatgtag atattgatgg gtttgttgca tggcagggaa ccttggctct gcttgtcacc
181 tcatttttga aaagcacata tttgagaagt tagacacaat cactgacaag gagattgaca
241 tgcttgatct ggcatatgga ctgacagaca ggtcagagtt gtattgccaa atctgtttga
301 caaaatctat gggccatatg actgtctgag tacctgatgg agtggccagt gccagacaat
361 ccactgatat gggcaagacc acctaagcag aataaatagc aatgctttca ctaaatttta
421 cctattttat aattatttct tatataatta aataaaaaca tacatgaatg aacttattat
481 tatgaccaac tttactgttt taattcacct tgtataacta ctgaatttta tagttctgaa
541 aatacgcaat ttttatattt gttacattag aaaaaggtca gttaaatatt agaaaataat
601 taatatgaca ataccttaca aattttgcct tacgtttgtt tagcaacttt tagcaaaatg
661 ttttcacatg gccttatgtc tgcttaccta gaaagtaaat tttttaaaaa agatattatt
721 atccctgttt tatgtgagtg aaggcaaaga cctaaaatgg cttgttaagg gccatccaac
781 taattagaaa atatgtacac acctgcatct tgttactata aggtatatgt tagactgaaa
841 ctggagaaat tatgactatc ttacagcagc aattagatct gactgcatat ggacaaaaac
901 attaattgct cagtgaactg cttaactgac agttattaac cttacacata aagatttcaa
961 aattttgatt cagtga
//
LOCUS HUMFRDXA1 350 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin gene, exon 3.
ACCESSION M34783
KEYWORDS ferredoxin.
SEGMENT 1 of 2
SOURCE Human DNA, clone h4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 350)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 45 + 174 ferredoxin, exon 3
IVS < 1 44 ferredoxin intron B
IVS 175 > 350 ferredoxin intron C
BASE COUNT 99 a 56 c 78 g 117 t
ORIGIN Chromosome 11q13-qter.
1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac
61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc
121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga
181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc
241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga
301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag
//
LOCUS HUMFRDXA2 1260 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin gene, exon 4.
ACCESSION M34785
KEYWORDS ferredoxin.
SEGMENT 2 of 2
SOURCE Human DNA, clone h4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1260)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 33 147 ferredoxin, exon 4
IVS < 1 32 ferredoxin intron C
BASE COUNT 410 a 196 c 241 g 413 t
ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter.
1 actaaaccat accttccccc ttttccatac agatcacggt tgggctgcca aatctgtttg
61 acaaaatcta tggacaatat gactgttcga gtgcctgaaa cagtggctga tgccagacaa
121 tccattgatg tgggcaagac ctcctgaact agaacaaata ggaatatttt catggaattt
181 tacctatttt tataattatt atttcttaaa gtgattaaat gagaacatgg atgagtggac
241 ttcatattat gactagcttt actattttaa ttcaccttgc ataactactg aattttgtca
301 ttcttgaaag tatgcaattt ttattttggt tatattacaa aaatgtcaat caaatattaa
361 aaaatagtta atgtgataga aaaaccttac atattttttt cttatgtttg tttagcgact
421 ttagcaaaat gttttcatat aatctcatct gtttacctag aagataggtt aaggaaatat
481 attattattc ctgtttgatg tgggtgaagg cagagatcta acctggcttg tttagggcca
541 taccactaat tagaaaatct gtactagaac ctgtgtctta ttcctacaag ctatgtgttc
601 agactgaaac tggagaaatt atgactattt tatttatagt agtagttaaa tctgaatgtg
661 tatggacaaa aatatttaat tgctcagtaa actgcttaac ttcaaagata gttattaacc
721 ttataaataa atatttcaaa attttgattc ggaagactaa gtctggacgt agacattata
781 atgctatcaa agaagtttga tctctgtttt gactaaacta gaggaaaaat gattggatgt
841 gtttattctt ttctaagcag aatggtttaa ctttgtactc tttgaaaaat aatgctgatt
901 tataaatctc tgcctataac agaatggaaa ccttatgaat gaattgtgtt tctctgtcct
961 gagctggaga agggaatgag caggctgaca cgttgcacag ccccaggtgg cgccattctc
1021 tcacgcaagg atggggctgc agggtgagca gcgtgggctg cagtgtgtca gtcccaggag
1081 tgagggagtg gcaagcacca cagattacca cgtatgtgtg gaagacattc gtacccttat
1141 ctttactata aataaattca taaaagttaa caaaggggta cacagtatgg tctttggaaa
1201 tataataaaa catcaactaa cttggactaa ttgtgaggaa gagcagaaca aattagtaga
//
LOCUS HUMFRDXB1 350 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin gene, exon 3.
ACCESSION M34784
KEYWORDS ferredoxin.
SEGMENT 1 of 2
SOURCE Human DNA, clone h1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 350)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 45 + 174 ferredoxin, exon 3
IVS < 1 44 ferredoxin intron B
IVS 175 > 350 ferredoxin intron C
BASE COUNT 99 a 56 c 78 g 117 t
ORIGIN Chromosome 11q13-qter.
1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac
61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc
121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga
181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc
241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga
301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag
//
LOCUS HUMFRDXB2 1269 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human ferredoxin gene, exon 4.
ACCESSION M34786
KEYWORDS ferredoxin.
SEGMENT 2 of 2
SOURCE Human DNA, clone h1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1269)
AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
TITLE Structure, sequence, chromosomal location, and evolution of the
human ferredoxin gene family
JOURNAL DNA Cell Biol. 9, 205-212 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 32 146 ferredoxin, exon 4
IVS < 1 31 ferredoxin intron C
BASE COUNT 407 a 197 c 246 g 419 t
ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter.
1 actaaaccat accttcccct tttccataca gatcacggtt gggctgccaa atctgtttga
61 caaaatctat ggacaatatg actgttcgag tgcctgaaac agtggctgat gccagacaat
121 ccattgatgt gggcaagacc tcctgaacta gaacaaatag gaatattttc atggaatttt
181 acctattttt ataattatta tttcttaaag tgattaaatg agaacatgga tgagtggact
241 tcatattatg actagcttta ctattttaat tcaccttgca taactactga attttgtcat
301 tcttgaaagt atgcaatttt tattttggtt atattacaaa aatgtcaatc aaatattaaa
361 aaatagttaa tgtgatagaa aaaccttaca tatttttttc ttatgtttgt ttagcgactt
421 tagcaaaatg ttttcatata atctcatctg tttacctaga agataggtta aggaaatata
481 ttattattcc tgtttgatgt gggtgaaggc agagatctaa cctggcttgt ttagggccat
541 accactaatt agaaaatctg tgctagaacc tgtgtcttat tcctataagc tatgtgttca
601 gactgaaact ggagaaatta tgactatttt atttatagta gtagttaaat ctgaatgtgt
661 atggacaaaa atatttaatt gctgagtaaa ctgcttaact tcaaagatag ttattgacct
721 tataaataaa tatttcaaaa ttttgattcg gaagactaag tctggacgta gacattataa
781 tgctatcaaa gaagtttgat ctctgttttg actaaactag aggaaaaatg attggatgtg
841 tttattcttt tctaagcaga atggtttaac tttgtactct ttgaaaaata atgctgattt
901 ataaatctct gcctataaca gaatggaaac cttatgaatt gtttgtgttt ctctgtcctg
961 agctcggtac ccctggagaa gggaatgagc aggctgacac gttgcacagc cccaggtggc
1021 gccattctct cacgcaagga tggggctgca gggtgagcag cgtgggctgc agtgtgtcag
1081 tcccaggagt gagggagtgg caagcaccac agattaccac gtatgtgtgg aagacattcg
1141 tactcttatc tttactataa ataaattcat aaaagttaac aaaggggtac acagtatggt
1201 ctttggaaat ataataaaac atcaactaac ttggactaat tgtgaggaag agcagaacaa
1261 attagtaga
//
LOCUS RATC1H4SY 3146 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat cytoplasmic C-1-tetrahydrofolate synthase mRNA, complete cds.
ACCESSION J05519
KEYWORDS C-1-tetrahydrofolate synthase.
SOURCE Rat adult liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3146)
AUTHORS Thigpen,A.E., West,M.G. and Appling,D.R.
TITLE Rat C-1-tetrahydrofolate synthase: cDNA isolation, tissue-specific
levels of the mRNA, and expression of the protein in yeast
JOURNAL J. Biol. Chem. 265, 7907-7913 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.R.Appling, 18-MAY-1990.
FEATURES from to/span description
pept 121 2928 C-1-tetrahydrofolate synthase
mRNA < 1 3146 C-1-tetrahydrofolate synthase mRNA
signal 3138 3143 poly-A signal
BASE COUNT 845 a 759 c 845 g 697 t
ORIGIN
1 cgcactgagt ttccccggga gatccctgag cacgtgggtt gtgctgtccg gtaagcgcac
61 ggagggactg gaacctgaat tttgctggtg ttcttcgtgg gtcgcagact agctaaggcc
121 atggcaccag cgggaatcct gaatggaaaa gtggtctccg cgcaaatcag gaatctactg
181 aagacccagt ttactcagat gcaggagcag gtacctggct tcacccctgg cctggcaatc
241 ctgcaggttg gcgacagaga tgattccaat ctttatataa atgtgaagct gaaggctgct
301 caagagatcg ggatcaaagc cactcacatt aaattaccaa gaacttccac agagtcggag
361 gtgttaaaat acgtcatctc cctgaatgag gatgccactg tgcacggatt catagtacag
421 ctgcctttag actccgagaa ttccattaac acagaggcag tcatcaatgc cattgcccct
481 gagaaggatg tggacgggtt gacgagcatc aatgccggga agcttgccag aggtgatcta
541 aaggactgct tcattccgtg cacacccaaa ggatgcttgg aactcatcaa agagacagga
601 gtgcagattg caggaaggca cgccgtggtg gtcgggcgaa gtaaaatagt tggtgcaccc
661 atgcacgacc tgcttctgtg gaacaatgcc acggtgacca cctgccactc caagactgcc
721 gatctggaca aggaggtaaa taaaggagac attctggtgg ttgcaacagg acagcctgag
781 atggtgaaag gggagtggat caagcctgga gctgtggtca tagactgtgg gatcaattat
841 gttccagatg atacaaaacc aaatggaagg aaagttgtgg gtgacgtagc atatgacgag
901 gccaaggaga aggcgagctt tatcacgcct gtccccggtg gcgtggggcc catgaccgtg
961 gcaatgctga tgcagagcac agtagagagc gcacagcgct tcctgaagaa atttaagcca
1021 gggaagtgga caattcagta taacaagctg aacctcaaga cgcctgtacc aagtgacatt
1081 gctatatcac gatcttgcaa acccaagctc atcggtaacc tggcccgaga aattgggcta
1141 ctcactgagg aggtggaatt gtatggagaa acaaaggcca aggtcttact gtcagcacta
1201 gatcgcctga agcatcagcc agatgggaaa tacgttgtgg tgactggaat tactccaaca
1261 cccctgggag aagggaagag cacaaccacc atcgggcttg tgcaagccct tggcgcccac
1321 ctgcatcaga atgtttttgc gtgtgtgcga cagccttctc agggccccac ctttgggata
1381 aaaggtggcg ctgcaggagg cggctattcc caggtcattc ctatggaaga gtttaatctc
1441 cacctcactg gtgacatcca cgccatcact gccgctaata accttgtggc tgctgctatt
1501 gacgctcgga tatttcatga gctgacccag acagacaagg ctctctttaa tcgtctggta
1561 ccctcagtaa atggagtaag aaagttctct gacatccaaa tccgcaggtt acggaggcta
1621 ggcattgaga agactgaccc cgctgcgctg acagatgatg agataaacag atttgcaaga
1681 ctagatattg atccagaaac cataacatgg cagagagtgc tggatactaa cgatagattc
1741 ctgaggaaga tcaccattgg acaggctcca accgagaaag gccacacacg cacggcccag
1801 tttgatatct ctgtggccag tgaaatcatg gcagtcctgg ccctcactag ttctctggaa
1861 gacatgagag caaggctggg caaaatggtg gtagcgtcca gtaagaaagg ggagcctatc
1921 agctgtgaag acctgggcgt gagcggggcg ctgacggtgc tcatgaagga tgcgatcaag
1981 cccaatctca tgcagacctt agagggcacg ccggtatttg tccatgctgg gccttttgcc
2041 aacattgcac atgggaattc ctccatcatt gcagaccgga ttgcactcaa gctggttggc
2101 cctgagggct ttgtagtgac agaagcagga ttcggagcag acataggaat ggaaaagttc
2161 ttcaacatca agtgccggta ttctggtctc cagcctcatg tggtggttct tgttgccact
2221 gtcagggctc ttaagatgca cgggggtggc cccacggtca ccgctggact gcctcttccc
2281 aaggcttaca cagaagagga cctggacctg gtggaaaagg gcttcagtaa cttgaggaaa
2341 cagatagaaa atgctagaat gtttggagtg cctgtcgtgg tggccatgaa tgcattcaag
2401 acagatacag atactgagct ggacctcatc ggccgcctct ccagagagca tggggctttc
2461 gatgctgtca agtgcaccca ctgggcagaa gggggccagg gagccttagc cctggctcag
2521 gctgtccaga gagcttcaca ggcccccagc agcttccagc tcctctatga cctcaagctc
2581 tcagttgaag ataaaatcag gattattgca cagaagatct acggggccga tgacatcgaa
2641 ttgctccctg aagcacaaaa caaagcagaa atctacacaa agcagggctt tgggaatcta
2701 cccatctgca tggccaaaac acacttgtct ttatctcaca acccagagca aaaaggggtg
2761 cctactggct tcgttctgcc catccgggac atccgggcca gcgttggggc tggtttcctg
2821 taccctttag taggaacgat gagcacaatg cctggactcc ctactcgacc ctgtttttat
2881 gatatcgatt tggaccctga aactgaacaa gtgaatggat tgttttaaac agatcttcca
2941 tctccaagag gctactctgt ctggccagtg tctgttcagg cccactgagg aagtgtgcag
3001 agaagtcttg aaagtctgtg cctgccctga agagcttcag aaatagtgga agtttctcta
3061 aagcctttca tagccttaat tcagatcatg tataaattaa cataaatcat gcctatttac
3121 atagtaaaag tccacagaat aaatga
//
LOCUS NGOARGF 1260 bp ds-DNA BCT 17-JUL-1990
DEFINITION N.gonorrhoeae ornithine transcarbamoylase (argF) gene, complete
cds.
ACCESSION M34930
KEYWORDS ornithine transcarbamoylase.
SOURCE N.gonorrhoeae (strain CDC 50) DNA.
ORGANISM Neisseria gonorrhoeae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1260)
AUTHORS Martin,P.R., Cooperider,J.W. and Mulks,M.H.
TITLE Sequence of the argF gene encoding ornithine transcarbamoylase from
Neisseria gonorrhoeae
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.H.Mulks, 06-JUN-1990.
Author address [1]: M.H.Hulks
Michigan State University
Department of Microbiology and Public Health
367 Giltner Hall, East Lansing, MI 43324
e-mail [1]: 21667CFM@MSU.bitnet
FEATURES from to/span description
pept 141 1136 ornithine transcarbamoylase (argF) (EC
2.1.3.3)
binding 128 133 ribosome binding site
signal 1172 1204 transcription termination signal
site 39 48 DNA uptake sequence
site 54 63 DNA uptake sequence
site 713 722 DNA uptake sequence
site 1175 1184 DNA uptake sequence
site 1192 1201 DNA uptake sequence
BASE COUNT 350 a 350 c 319 g 241 t
ORIGIN
1 tccccgtcat ccgtatggag taagggattg accgcaatgc cgtctgaaca accttcagac
61 ggcattgcaa cattccgcta acccttcttt ccgcaaacgc tgcaaatacg gcgttcacgc
121 cccacataaa ggaaacgaca gtgaacctga aaaaccgcca ttttctgaaa cttttggact
181 tcacgccgga agaaatcacc gcctacctcg accttgccgc cgagttgaaa gacgccaaaa
241 aggcagggcg cgagattcag cggatgaaag ggaaaaacat cgccctgatt tttgaaaaaa
301 catccacgcg cacacgctgt gcgtttgaag tcgccgcacg cgaccaaggc gcggatcgaa
361 cctatctgga accgtccgcc agccaaatcg ggcacaaaga aagcatcaaa gacacggcgc
421 gcgtcttagg cagaatgtac gatgccatcg aatatcgcgg cttcgctcag gaaactgtcg
481 aagaattggc aaaatatgcg ggcgtacccg tgttcaacgg gctgaccaac gagttccatc
541 ccacacaaat gcttgccgac gcactgacta tgcgcgaaca cagcggcaaa cctttgaacc
601 aaaccgcgtt tgcctacgtc ggcgacgcgc gttacaacat gggcaattcc ctgctgattt
661 taggggcaaa attggggatg gacgtgcgta tcggcgcacc tcaaagcctg tggccgtctg
721 aaggcattat cgccgccgca cacgccgccg ccaaagaaac cggtgcaaaa attaccctga
781 ccgaaaacgc gcatgaagcc gtcaaaggtg tcggtttcat tcatactgac gtatgggtca
841 gcatgggcga gccgaaagaa gtctggcagg aacgcatcga tttgctgaaa gattaccgcg
901 ttacgcccga actgatggcg gcatcgggca atccgcaagt caaattcatg cactgcctgc
961 ccgccttcca caaccgcgaa accaaagtcg gcgaatggat ttacgaaacc ttcgggctga
1021 acggtgtgga agttacagaa gaagtattcg aaagtccggc cggcatcgtg ttcgatcagg
1081 cggaaaaccg tatgcacacg attaaagcgg taatggtcgc ggctctgggc gactgacaga
1141 actgtgcctg tttaaattca tccgcaacac agataccgtc tgaacacgat gttcagacgg
1201 tatccatata acaaactgcc tacacgatgt gtaggcagtc ccgtttgaaa acaatcagtt
//
LOCUS PFACSPA 1218 bp ds-DNA INV 17-JUL-1990
DEFINITION P.falciparum CS gene encoding circumsporozoite protein, complete
cds.
ACCESSION M22982 J04650
KEYWORDS circumsporozoite protein.
SOURCE P.falciparum (strain NF54) DNA, clone 1.3A.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1218)
AUTHORS Caspers,P., Gentz,R., Matile,H., Pink,J.R. and Sinigaglia,F.
TITLE The circumsporozoite protein gene from NF54, a Plasmodium
falciparum isolate used in malaria vaccine trials
JOURNAL Mol. Biochem. Parasitol. 35, 185-190 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
provided by P.Caspers, 14-MAR-1989.
FEATURES from to/span description
pept 1 1218 circumsporozoite protein
BASE COUNT 531 a 259 c 168 g 260 t
ORIGIN
1 atgatgagaa aattagctat tttatctgtt tcttcctttt tatttgttga ggccttattc
61 caggaatacc agtgctatgg aagttcgtca aacacaaggg ttctaaatga attaaattat
121 gataatgcag gcactaattt atataatgaa ttagaaatga attattatgg gaaacaggaa
181 aattggtata gtcttaaaaa aaatagtaga tcacttggag aaaatgatga tggaaataac
241 gaagacaacg agaaattaag gaaaccaaaa cataaaaaat taaagcaacc agcggatggt
301 aatcctgatc caaatgcaaa cccaaatgta gatcccaatg ccaacccaaa tgtagatcca
361 aatgcaaacc caaatgtaga tccaaatgca aacccaaatg caaacccaaa tgcaaaccca
421 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca
481 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca
541 aatgcaaacc caaatgcaaa cccaaatgca aaccccaatg caaatcctaa tgcaaaccca
601 aatgcaaacc caaacgtaga tcctaatgca aatccaaatg caaacccaaa cgcaaacccc
661 aatgcaaatc ctaatgcaaa ccccaatgca aatcctaatg caaatcctaa tgccaatcca
721 aatgcaaatc caaatgcaaa cccaaacgca aaccccaatg caaatcctaa tgccaatcca
781 aatgcaaatc caaatgcaaa cccaaatgca aacccaaatg caaaccccaa tgcaaatcct
841 aataaaaaca atcaaggtaa tggacaaggt cacaatatgc caaatgaccc aaaccgaaat
901 gtagatgaaa atgctaatgc caacagtgct gtaaaaaata ataataacga agaaccaagt
961 gataagcaca taaaagaata tttaaacaaa atacaaaatt ctctttcaac tgaatggtcc
1021 ccatgtagtg taacttgtgg aaatggtatt caagttagaa taaagcctgg ctctgctaat
1081 aaacctaaag acgaattaga ttatgcaaat gatattgaaa aaaaaatttg taaaatggaa
1141 aaatgttcca gtgtgtttaa tgtcgtaaat agttcaatag gattaataat ggtattatcc
1201 ttcttgttcc ttaattag
//
LOCUS NPA25K 900 bp ds-DNA VRL 17-JUL-1990
DEFINITION Autographa californica nuclear polyhedrosis virus 25K protein gene,
complete cds.
ACCESSION M22537 M23428
KEYWORDS 25K protein.
SOURCE Autographa californica nuclear polyhedrosis virus DNA, clone
HindIII-I.
ORGANISM Autographa californica nuclear polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 900)
AUTHORS Beames,B. and Summers,M.D.
TITLE Location and nucleotide sequence of the 25K protein missing from
baculovirus few polyhedra (FP) mutants
JOURNAL Virology 168, 344-353 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.D.Beames, 10-FEB-1989.
The 25K gene product is not needed for replication of the virus in
tissue culture. It is believed to be important in the viral
occlusion process but this has not yet been experimentally shown.
Disruption of the gene leads to few polyhedra (FP) phenotype.
FEATURES from to/span description
pept 115 753 25K protein
recomb 63 64 IFP1.6 insertion site
recomb 586 587 IFP2.2 insertion site
BASE COUNT 334 a 151 c 165 g 250 t
ORIGIN Map position 36.6 to 37.5.
1 aagaaaacat gattacgtct gtacacgaac gcgtattaac gcagagtgca aagtataaga
61 gggttaaaaa atatatttta cgcaccatat acgcatcggg ttgatatagt taatatggat
121 caatttgaac agttgattaa cgtgtctctg ctcaagtcgt tgatcaaaac gcaaatcgac
181 gaaaatgtgt cggacaatat caagtcgatg agcgaaaaac taaaaaggct agaatacgac
241 aatctcacag acagcgttga gatatacggt attcacgaca gcaggctgaa taataaaaaa
301 attagaaact attatttaaa aaagatttgc gctttactcg acctaaactt taaacacgtc
361 atagaatctt cgtttgacaa aaaccacatt gtggccaagc tgtgtgacgc gacgcgcgct
421 aaagaatggc aaaccaagtc gcgcgagcgt cgacttaaaa attttaattt aaacattaat
481 tacgacgggc ccgtaaaaat atttgtggcc gccacagcgg agcaaaagct gttacttaaa
541 aaaactcgcg acgctctgtt gccgttttac aaatacattt ccatttgtaa aaacggtgtt
601 atggtgagac gcgacgaaaa gagtcgcgtg tttattgtta aaaatgagca aaatatagaa
661 tatctcaaag ccaacaaata ttacgctttt catagcgata gcgtcgataa ttttgagtct
721 gaaaacgata gcgaaaaaat gcttcaaatt taattaaatt taattaataa ttaattttag
781 tatcaagttg aatctcgtaa tgtaagaaaa aaaacactaa tattggttgt gattcttgcc
841 agacgtccat tttatgtgat aaataaaaca caaacacaaa gtttttatat tgtcttttat
//
LOCUS SHPHSKB3C 413 bp ds-DNA MAM 17-JUL-1990
DEFINITION Sheep BIIIB4 high sulfur keratin gene, 3' end.
ACCESSION M21101 J03062
KEYWORDS BIIIB high-sulphur wool keratin.
SOURCE Sheep (Dorset horn strain) DNA, clone pSWK31.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 413)
AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
TITLE The keratin BIIIB gene family: Isolation of cDNA clones and
structure of a gene and a related pseudogene
JOURNAL Genomics 4, 182-191 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.J.Frenkel, 24-OCT-1989.
FEATURES from to/span description
pept < 1 51 BIIIB4 high-sulfur keratin (AA at 1)
BASE COUNT 112 a 99 c 84 g 118 t
ORIGIN 15 bp upstream of PvuII site.
1 actacctaca ctcagcccag ctgtgagccc tgcatcccaa gctgctgctg accgacggct
61 acctcaccca gtgcctgaca gagtcaaccc agaagcttta gtgctcacct gtctcagtac
121 ctgcaactaa ttatgtctcc gctttcaaag ttggaacaag gcattatcac agacaaccct
181 cacaaaaaac aaaccaagag actttcgatg gccgtgtagt ggacatcagt gaacaaggac
241 agctggagta ggtagatgcc tacaggtttc ccagagttgt tcagttcgtt agtgttaaat
301 tgtatctttc tttggtgctt tgggaaattc gtttccagtc ttgaatggta tctttctgga
361 aattaaggaa cttcttcatg attattcaat aaagtttaca tctctggtat aat
//
LOCUS SHPHSKB3D 1080 bp ss-mRNA MAM 17-JUL-1990
DEFINITION Sheep BIIIB3 high-sulfur keratin pseudogene mRNA.
ACCESSION M21102 J03062
KEYWORDS BIIIB high-sulphur wool keratin.
SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone psi-BIIIB3A.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1080)
AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
TITLE The keratin BIIIB gene family: Isolation of cDNA clones and
structure of a gene and a related pseudogene
JOURNAL Genomics 4, 182-191 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.J.Frenkel, 24-OCT-1989.
FEATURES from to/span description
pept.ps / 191 459 BIIIB3 high-sulfur keratin
BASE COUNT 310 a 226 c 195 g 349 t
ORIGIN 74 bp upstream of AvaII site.
1 aaatgtcaaa ttgattataa aaattttagt ggaaaacggt ggcataaata aaaaagagag
61 gaaaactttt ggtggaccaa cacctgatga agcggatata taaagagccc caaagtgaga
121 ggagacattt cgtcctggat agttcgtctt tcactccaag atgccaaaag aaaccaaggt
181 tcccggtgcc acggcttgct gagctcccct ctgctgcagt gttcccacca gccccgccac
241 tactatctgc tcctctgaca aattctgggc atgagtctgc ctgcccagct cctgctcaca
301 tatggtttgg ttactggaga caacctgctg tgacagctgc ccaaagcctg cacattcctc
361 aaccctatgt gccaacctgc ttcctgctca actctaccca gcccaccctg gaccaggaaa
421 ccatcagcct cacagccctg tatcccaagc tgctgctaac tgatgtttga ctcactcagt
481 gcctggcaat aacacagaag ctatctgttt ggcattcact tgctcagtat tttatcacat
541 attgaggtag acccaatggc atagatatgg aaggcctact ttttatctta atggaaagaa
601 aataaataaa ctttatgctt atttggctaa caattttttc atttgggcag gtgaatgtca
661 tctatgtcaa aatagtatta aagtctataa gacttcagac cctattttat tggtcatatt
721 gcttcctgga ccccatttat tgtattcggg attttcatag aggagaaata attttgatgg
781 ttttcctaat aagctatttc tgtggcaaaa aagagaaaga gatgtgaaat tttgttgagt
841 gctttttctg tatctatgga agtaattatt tagttttttt tatttaatct tttacatttt
901 aaagtttata actatgttca ttttgattga ttttaaaaat gttaaaccaa ccttcttttc
961 ctacgaaacc cagattgata taatgtattt tccctttcat acatgattga ctttgatttg
1021 ctacattaaa aacatttatt tatttggcta cactggatct tagttgcagc agatgagatc
//
LOCUS SHPHSKB3E 1047 bp ss-mRNA MAM 17-JUL-1990
DEFINITION Sheep BIIIB4 high-sulfur keratin mRNA, complete cds.
ACCESSION M21103 J03062
KEYWORDS BIIIB high-sulphur wool keratin.
SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone BIIIB4.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1047)
AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
TITLE The keratin BIIIB gene family: Isolation of cDNA clones and
structure of a gene and a related pseudogene
JOURNAL Genomics 4, 182-191 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.J.Frenkel, 24-OCT-1989.
FEATURES from to/span description
pept 289 585 BIIIB4 high-sulfur keratin
BASE COUNT 298 a 294 c 193 g 262 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttgtgt tccagtgaca aatagcaaga gtaatttgtt cagatttagc agtgaactca
61 aacaaaggtt aattaggaaa ataaagactc tttacaagca tctaaatgtc aaaccagtca
121 aaggataatg ggtacactaa cataaacaca caggaaagaa gccttccggt tggaccaaca
181 ccactggaga gggtatataa gagcctcaga gcagaaagag agattctcac gccaggagac
241 tcgtcttctt taaaaccaaa tcaataaaaa acacagcttc ccaacaccat ggcttgctgt
301 gcccgcctct gctgcagcgt ccccaccacc cccgccacca ccatctgctc ctctgacaaa
361 ttctgcagat gtggagtctg tctgcccagc acctgcccac acacagtctg gttcctggag
421 ccaacctgct gtgacaaccg ccccccacct tgccacattc ctcagccctc tgtgcccacc
481 tgcttcctgc tcaactcttc ccagcccacc ccaggcctgg aaagcatcaa cctcacaacc
541 tacactcagc ccagctgtga gccctgcatc ccaagctgct gctgaccgac ggctgcctca
601 cccactgcct gacagagtca acccagaagc tttagtgctc acctgtctca gtacctgcaa
661 ctaattatgt ctccgctttc aaagttggaa caaggcatta tcacagacaa ccctcacaaa
721 aaacaaacca agagactttc gatggccgtg tagtggacat cagtgaacaa ggacagctgg
781 agtaggtaga tgcctacagg tttcccagcg ttgttcagtt ccttcgtgtt aaattgtatc
841 tttctttggt gctttgggaa ttctgtttcc agtcttgaat cgtatctttc tggaaattga
901 ggagcttctt catgattatt ctaataaagt ttccatctct ggcataacat aaatgtctat
961 aggtatttcc atttattttt gttaacacat caaatctata tcttaactcc aaaggacacc
1021 attttaacaa actgagaaca tctgcag
//
LOCUS HUMRARG2 250 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human retinoic acid receptor gamma 2 mRNA, 5' end.
ACCESSION M32074
KEYWORDS retinoic acid receptor gamma.
SOURCE Human fetal skin, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 250)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Chambon,P., 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 62 > 250 retinoic acid receptor gamma 2
mRNA 1 > 250 retinoic acid receptor gamma 2 mRNA
BASE COUNT 38 a 77 c 87 g 48 t
ORIGIN Chromosome 12.
1 ggggaatcgg ctgcgggtcc ttggtgtttc cagcacccag tttcccttaa gccgggtcgc
61 gatgtacgac tgtatggaaa cgtttgcccc gggtccgcga cggctgtacg gggcggccgg
121 gcccggggcc ggcttgctgc gcagagccac cggcggctcc tgtttcgccg gacttgaatc
181 ttttgcctgg ccgcaacccg ccagcctgca atcggtggag acacagagca ccagctcaga
241 ggagatggtg
//
LOCUS MUSRARG1 620 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 1 mRNA, 5' end.
ACCESSION M32068
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse F9 cell, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 620)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Chambon,P., 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 399 > 620 retinoic acid receptor gamma 1
mRNA 1 > 620 retinoic acid receptor gamma 1 mRNA
BASE COUNT 108 a 217 c 192 g 102 t 1 others
ORIGIN Chromosome 15.
1 attttggggc ggtcacgtgg gcgggcgggc tccgagaggc cccgggattg tcccagccta
61 gagccgtgcc cccctgaagc cccccattac cgcgagtcac taacaccgcg gtgctccatc
121 cccgagaccg cccgacgccg ggacctgggg ctctgcggcc ttccttcccc cgccctcccc
181 tccagcagtt tccaccaggt ccctcacctc agcctggccc agtatgtagg agggactctc
241 tgcangaggc cagagggatc cttggaaacc cactggacag accaggcagg gtgggcacgg
301 agcctcccag gccagggcag tgggcatggg cgggggctgt agctgaagac cacccccgcc
361 tgctgcagag tccaagggat tcccacgccg cagctaccat ggccaccaat aaggagagac
421 tctttgcgcc cggtgccctg gggcctggat ctggttaccc aggagcaggc ttcccattcg
481 ccttcccagg tgcactcaga gggtcgccac catttgagat gctgagccct agcttccggg
541 gcctgggcca gcctgacctc cccaaggaga tggcttctct ctcggtggag acacagagca
601 ccagctcgga ggagatggta
//
LOCUS MUSRARG2 479 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 2 mRNA, 5' end.
ACCESSION M32069
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse F9 L cell, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 479)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.Chambon, 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 291 > 479 retinoic acid receptor gamma 2
mRNA 1 > 479 retinoic acid receptor gamma 2 mRNA
BASE COUNT 75 a 163 c 143 g 98 t
ORIGIN Chromosome 15.
1 tctaggcttt gcttttacaa aaggtctcca cagagctagc tgcctaggca ccctaagtga
61 ggcagactga ggcagaggct acagcttcgc cttcgcgccc cgcccagttc ttttcccttg
121 ctggggatcc ccctcctccc cgcccatccc ttacccccca tgcatctcag cgccctacgc
181 tagccctccc cctcccccct ccaggagcgg ggcgctgcgg ggggaggagg gggaatgggc
241 tgcgggtcct tggtgttcta gcacccagtt tctctccaag ccgggtcgcg atgtacgact
301 gcatggaatc gtttgtcccg ggtccgcgac ggctgtatgg ggcggccggg cccggggccg
361 gcttactacg cagagccact ggcagctcct gcttcgccgg acttgagtct tttgcctggg
421 cacagcctgc cagtctacaa tcggtggaga cacagagcac cagctcggag gagatggta
//
LOCUS MUSRARG3 646 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 3 mRNA, 5' end.
ACCESSION M32070
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse 11.5 day embryo F9 L cell, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 646)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Chambon, 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 496 > 646 retinoic acid receptor gamma 3
mRNA < 1 > 646 retinoic acid receptor gamma 3 mRNA
BASE COUNT 151 a 150 c 184 g 161 t
ORIGIN Chromosome 15.
1 tccccacccc caccgccacc aactgcttcc agggaactct gggattacta gagtcaggag
61 tgagccctaa ccttttcagt tctatgcccc tccccgcccc ctttaaaaat gtgtatggtg
121 ttctgtctat atgtattttc gcgtgccatt tttgtgcctg gtgcccatgg aagatggaag
181 agggaaccgg ttctgttaga actggcgatt acaaatggtt gtaaactacc atgtagatgc
241 tgggaatgga accctggtcc tctgggagag cagccactgc tcttacctgc tgagtcccaa
301 ccaatcttca actttatgga gcagaagcag agaagttaac tccttggcct ggaatttgca
361 gaagtgaacg ttaatgtaga agagttggct ttatgggggt ggggatgggg tagggggcag
421 tggtggggcc tgaaatccca acaagctaca aagagtggtg gtctgggctt tccagggagt
481 acctgttaag ggcttatgca caagggtgac aacagcggtc accagcaggt cccaagaaag
541 agaggccatg ggatgagggt gcttctgctc agcttctgct tatcttctca tgctgctttt
601 cattcagcgg tggagacaca gagcaccagc tcggaggaga tggtac
//
LOCUS MUSRARG4 309 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 4 mRNA, 5' end.
ACCESSION M32071
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse 9.5 day embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 309)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Chambon, 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 300 > 309 retinoic acid receptor gamma 4
mRNA < 1 > 309 retinoic acid receptor gamma 4 mRNA
BASE COUNT 77 a 81 c 100 g 51 t
ORIGIN Chromosome 15.
1 gatgacaatc gtaaggcccc caaatatcca tcttccccct ctctgtaatg ggagatgcca
61 ggcttggggc ccaggaaccc ctttccagag cggcaataaa aagctgggga tggagaggca
121 gtaaaaataa agtccagatg agaagcggct ttgaagcctg ggccacagtg cctggcgggg
181 gtgggctgtt ggaaggggct gagagaggag cctgggacac cccccccccc aaatttactg
241 ggggttggca ccgcactcgc agtctagcgg tggagacaca gagcaccagc tcggaggaga
301 tggtaccca
//
LOCUS MUSRARG5 111 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 5 mRNA, 5' end.
ACCESSION M32072
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse F9 cell, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 111)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Chambon,P., 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 102 > 111 retinoic acid receptor gamma 5
mRNA 1 > 111 retinoic acid receptor gamma 5 mRNA
BASE COUNT 23 a 26 c 37 g 25 t
ORIGIN Chromosome 15.
1 tgaagcttct ccttgggggc tggattttgg attgaagagt cagcccctgg caccttgggg
61 atatccttgc ggtggagaca cagagcacca gctcggagga gatggtaccc a
//
LOCUS MUSRARG6 420 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid receptor gamma 6 mRNA, 5' end.
ACCESSION M32073
KEYWORDS retinoic acid receptor gamma.
SOURCE Mouse 9.5 day embryo F9, P19, and L cells, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 420)
AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
Leroy,P., Staub,A. and Chambon,P.
TITLE Murine isoforms of retinoic acid receptor gamma with specific
pattern of expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Chambon,P., 13-FEB-1990, for release after publication.
FEATURES from to/span description
pept 407 > 420 retinoic acid receptor gamma 6
mRNA 1 > 420 retinoic acid receptor gamma 6 mRNA
BASE COUNT 84 a 121 c 128 g 87 t
ORIGIN Chromosome 15.
1 tgaggcagcg ctctgtgggt agggagggcg tcagtgcagg agagagtacc acccacgctc
61 aggcctgtgg ggaccccagg atgggctgaa agtgagggcc ggaaaggcct tccaggcttc
121 cccaaacctc ccagcaccta ccattcaggc aaccccaccc ccagtcttga ataaactccc
181 tgcacccttc cgcccccttt cttttgaggg ggaatctaac tccagcagga ttcttatgct
241 aattgggtgc gtggggggtg gggtgggtgg agaaggcttc cctcttgtaa ggtggggagc
301 tggtctggaa cccccaacct ataggctctt ctgtcctctc actaccttgg gtctcagtat
361 ggacttgtga ccagcggtgg agacacagag caccagctcg gaggagatgg tacccagctc
//
LOCUS PMGMTF1 190 bp ds-DNA ORG 17-JUL-1990
DEFINITION P.magellanicus mitochondrial fragment I DNA.
ACCESSION M34468
KEYWORDS .
SEGMENT 1 of 3
SOURCE P.magellanicus mitochondrial DNA.
ORGANISM Mitochondrion Placopecten magellanicus
Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia;
Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea;
Pectinidae; Placopecten magellanicus.
REFERENCE 1 (bases 1 to 190)
AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
TITLE Molecular characterization of a repeat element causing large-scale
size variation in the mitochondrial DNA of the sea scallop
Placopecten magellanicus
JOURNAL Mol. Biol. Evol. 7, 45-64 (1990)
STANDARD simple staff_review
BASE COUNT 66 a 63 c 14 g 47 t
ORIGIN
1 ctttcaaaca caaaaaacac aaaaagctcg ccaataaact ttccaatttt ttaacactac
61 agctaagttt tccccattct ttcacctcaa gcgaagggaa atctatatcc ccactcccaa
121 caatctactg gcatactact tatactacca aagaaactca tctacccccc atccacccca
181 cattggtacc
//
LOCUS PMGMTF2 1449 bp ds-DNA ORG 17-JUL-1990
DEFINITION P.magellanicus mitochondrial fragment II DNA.
ACCESSION M34466
KEYWORDS .
SEGMENT 2 of 3
SOURCE P.magellanicus mitochondrial DNA.
ORGANISM Mitochondrion Placopecten magellanicus
Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia;
Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea;
Pectinidae; Placopecten magellanicus.
REFERENCE 1 (bases 1 to 1449)
AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
TITLE Molecular characterization of a repeat element causing large-scale
size variation in the mitochondrial DNA of the sea scallop
Placopecten magellanicus
JOURNAL Mol. Biol. Evol. 7, 45-64 (1990)
STANDARD simple staff_review
BASE COUNT 405 a 403 c 193 g 448 t
ORIGIN About 500 bp after segment 1.
1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc
61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt
121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca
181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa
241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat
301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg
361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag
421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc
481 ttctctcact aaaccatcca ccccccactt tccaattttt ttaacactac agctaagttt
541 tccccattct ttcacctcaa gcgagaaggg aaatctatat ccccactccc aacaatctac
601 tggcatacta cttatactac caaaagaaac tcatctaccc cccatccacc ccacattggt
661 accagaacgt tcctacaaac cctttcttcg ctataggggg aaggaacctt cttccttccc
721 ctcggacccc ttcctcgccc cctcccccct cccatcccca acccagctat cctttcccct
781 cccccctagc ccccctacac gcttaactgg tagtcccccc ccttcctccc cccctccctg
841 tgcgggtcaa cgaacctgtc tctactccgc tcttgatcaa acctcgaaaa aaaaaatctt
901 tttttttttt tttttttcgg tctccaagga ctccgtggcc gagaccagga ttctaccctg
961 tagggaaagt ggagggtaag gggttaaata tttatcccgg gcggggggaa cccggaatta
1021 tcctttataa ccaatttcaa cccaatttat ggctccgtac caaaacgcta tttgccccac
1081 ccccctctga ggctttttga ttcagaaaac acgagattta gtgatatttt tcataatatg
1141 gcacaagcaa ttcactgaag gcctaaattt ccgaccctgg aacaattttt ttcaaaaatt
1201 tgacatctaa aagtcacctg aatttacagc cttttttttg attttcaaat ttttgccgtt
1261 tttggacccc gttttttttg actttttttt tgtgtttttt tgtgtttttt tttcaccctt
1321 ttttttgcca aaaattacca tttttgcttg gctacaataa gtttacaaag ccggctataa
1381 aaaattttta gaaaaacccg aaaatatcag ttttgagttt tgaaatgaac tttttttttt
1441 tgactgcag
//
LOCUS PMGMTF3 746 bp ds-DNA ORG 17-JUL-1990
DEFINITION P.magellanicus mitochondrial fragment III DNA.
ACCESSION M34467
KEYWORDS .
SEGMENT 3 of 3
SOURCE P.magellanicus mitochondrial DNA.
ORGANISM Mitochondrion Placopecten magellanicus
Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia;
Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea;
Pectinidae; Placopecten magellanicus.
REFERENCE 1 (bases 1 to 746)
AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
TITLE Molecular characterization of a repeat element causing large-scale
size variation in the mitochondrial DNA of the sea scallop
Placopecten magellanicus
JOURNAL Mol. Biol. Evol. 7, 45-64 (1990)
STANDARD simple staff_review
BASE COUNT 260 a 213 c 99 g 174 t
ORIGIN About 500 bp after segment 2.
1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc
61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt
121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca
181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa
241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat
301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg
361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag
421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc
481 ttctctcact aaaccatcca ccccccactt tcctccggga aaaaaaaaaa aaagaaggga
541 tctcatataa ccccatctta ccgctccggc acaaccaata gacgattcac aagtagagcc
601 cagccctcct tgcgacccta accctgaaag taccgaagag acccgcaacc atcacaaaca
661 aaaacatccg gcctaaagga accaaacccc aacaccagtc cgaggggtta tcgtttcaat
721 accgcatgca tagaacaaac ctgcag
//
LOCUS HUMCREB 1026 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human active transcription factor CREB mRNA, complete cds.
ACCESSION M34356
KEYWORDS active transcription factor CREB; alternative splicing.
SOURCE Human T-cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1026)
AUTHORS Berkowitz,L.A. and Gilman,M.Z.
TITLE Two distinct forms of active transcription factor CREB
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Z.Gilman, 16-MAY-1990.
FEATURES from to/span description
pept 1 1026 active transcription factor CREB-B
pept 1 261 active transcription factor CREB-A
304 1026 active transcription factor CREB-A
BASE COUNT 332 a 249 c 233 g 212 t
ORIGIN
1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct
61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct
121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat
181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct
241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca
301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat
361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac
421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact
481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga
541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg
601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc
661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt
721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc
781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa
841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt
901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat
961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca
1021 gattaa
//
LOCUS PFAATUBII 3706 bp ds-DNA INV 17-JUL-1990
DEFINITION P.falciparum alpha-tubulin II gene, complete cds.
ACCESSION M34390
KEYWORDS alpha-tubulin II.
SOURCE P.falciparum haploid intraerythrocytic cell line K1 (Thailand) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 2779)
AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G.,
Scaife,J.G. and Hyde,J.E.
TITLE The tubulin genes of the human malaria parasite Plasmodium
falciparum: Their chromosomal location and sequence analysis of the
alpha-tubulin II gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 2780 to 3706)
AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G.,
Scaife,J.G. and Hyde,J.E.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Hyde, 22-MAY-1990.
FEATURES from to/span description
pept 749 810 alpha-tubulin II, exon 1
1022 1142 alpha-tubulin II, exon 2
1262 2431 alpha-tubulin II, exon 3
IVS 811 1021 alpha-tubulin II intron A
IVS 1143 1261 alpha-tubulin II intron B
BASE COUNT 1390 a 347 c 502 g 1467 t
ORIGIN Chromosome 4.
1 gaattcctta tatatgtact cgttttatat ttttttccta taatttcaac atattaaaaa
61 gtactgaata taacattata tatttacata ttcaagttat atttattttc tttttctttt
121 ttcttgatat attatattat ttgattattt ttttttcttt taatatgtgg cttatattta
181 caaaaatagt aatatatata tgttaatatg taaagaaaca gaaatgccat tgtatgaaat
241 ataaaagttt caattttttt aaaaaatata aaaaaagaaa gaaaataaca aataaaaata
301 aatatattat atatatgtgt ttatattttt atttttatat ataattatat tttttcttta
361 tgatatatat tattatatct ttataaagaa gacataaaag aaacagattg attgaagttt
421 ttcttgcttt aatataagaa agtaatttca taatgaaaat aatatatata tatgtatatg
481 atattttaat ttataattat aatgagtttg taaaaatata ttttatataa gtttatatgt
541 attttttata tatatatata tatatatata tggctttgga taatataata tatatatata
601 aataaataaa tatatatata tatatatata tatatatgta tatttattta catatttatt
661 taatataggt acaaacatag acctacatat ttttatattt atttatttta ttttatatta
721 acatttttta tctaacgttt tacgaacgat gagagaagtc attagtattc atgttggaca
781 ggctggtatt caaataggaa atgcttgctg gtaaaaaaaa aaaaaaaaaa aaaaaaaaaa
841 aaaaaatata tatatatata tataaatgca tgtgtgttgt atatttttct tcttcatttt
901 tatttttaaa gagatacata atgtttaatg tattattcaa tgtgttgtgg taaaaacatg
961 tgtataaatt ataattaaaa atatacatat atatatatat atatatattt tttttttata
1021 gggaattgtt ttgccttgaa catggaattc aaccggatgg gcagatgcca agtgaccaag
1081 tcgttgctgg tggtgatgat gcctttaata catttttctc agaaacggga gctggaaaac
1141 atgtaataat aaaaaaaaaa aaaaaaaaat atatacatat atatatatat atatatatat
1201 atttatttat ttatttattt acttatacat gtgtgcttct ttttgtcgtt actcatttta
1261 ggtaccacgt tgtgtgttcg ttgatttaga acccaccgtc gttgacgaag ttcgaacagg
1321 aacgtatcgt cagctgtttc accctgaaca actaatatct ggaaaagagg atgcagcaaa
1381 taatttcgca aggggacatt ataccatagg aaaagaaatt gttgatgtat gtttggatag
1441 ggttcgaaag ttggctgata attgcactgg attacaagga tttttgatgt ttaatgcagt
1501 aggtggaggt acaggtagtg gtcttggttg tttattatta gaaaggttgg caatagatta
1561 tggaaagaaa tcaaaattaa atttttgttc gtggccatct cctcaagtat cgacagctgt
1621 tgtagagcct tataattctg tattatcaac acattcattg ttagaacata cagatgtggc
1681 aattatgctc gataacgaag caatatatga tatatgtaag aaaaatttag atatagaaag
1741 gccaacctat actaacttga atagattgat tgctcaagtt atctcttcat taacagcatc
1801 tttaagattt gatggtgctt tgaatgttga tgtaacagaa tttcagacta atttagtacc
1861 atatcctaga attcacttta tgttatcatc atatgctcca atcataagtg ctgagaaggc
1921 atatcacgag caattgtcgg tttctgaaat aacgaattct gcctttgagc ctgcatctat
1981 gatggcaaag tgtgatccca gacatggaaa atatatggct tgttgtttaa tgtatagagg
2041 agatgtagta ccaaaggatg ttaatgctgc cgtcgcaact attaagacta agagatctat
2101 acaattcgtt gattggtgtc ctacgggatt taaatgtgga atcaattatc agccccctac
2161 cgtggttcca ggaggagatt tagcaaaagt tatgagagct gtttgcatga tcagcaactc
2221 aacagcaatt gccgaagtat tctcacgaat ggaccaaaaa tttgatttga tgtatgcgaa
2281 aagggctttc gttcattggt atgtaggtga aggtatggaa gaaggagaat ttagtgaagc
2341 tagagaggat ttggctgcct tggaaaagga ttatgaagag gtaggaatag aatcgaatga
2401 tggagaggga gaagatgagg gatatgaatg aacataaata taaatggata aataaataaa
2461 tatatatata tatatatata tatatatata tatatatatg tatatctact tataattatt
2521 tttttttttt atctcatacg tcctatattc ccaaaatatt atatatatgt tcataataat
2581 tatatggaac atatatatcc tcttatcata ttgtatattt attcgttgta aaaaataaaa
2641 aaaaatatac aagaatatta atattttttt ctatgtattt acaccttcat tagtttaata
2701 ttcaagaaaa ataatactta ctagaaataa ataaaatata tatatatata tatatgtatg
2761 tatatatgtg tgtatgtacc attaattagt aagtagtaat ttattatgca atttatttat
2821 tatatatata tatatatata tatatatata gtatcaaatt aaatatacga caattcgtaa
2881 gtacattctt gaaaaaataa atgtatagat ataatattat tttttttata caaatttata
2941 tttgtagaat atatattttt ttccctttat tattatgcat tatataataa tatatacatg
3001 ataatatgca tacatataca tattatatat atatatgtat gtatgtatat atgtcgtgtg
3061 ataatttatc tttatgctaa taatacatta cacatataaa tacttattta tatctatata
3121 tatttttttt atatacgaca ataatttatg taaacattaa gttacatatt aaagttcgat
3181 taataaatat ataataattt tttttatttt attatatcta tttttccaat gtataaataa
3241 tatatgaaat atactatatg ctatataata tgagcacaca taagatttgg tgttatataa
3301 atgtagaagt aatattaata gtatcagacg aaaagaacat tttttttttt tttttttttt
3361 tttttttggt ccttttaaat tgatataaaa aactaaactt tagagtttct ttaaaatgag
3421 acatttttta caacttaata taatttaatt tttttttttt tttttttcat atttattaga
3481 aatgaatata taggttaaca catatatatt atatatatat atattaacat gaaaaggtta
3541 aaaaaaaata tggaaaatac aaacatataa ttatatacat atatattata tatatgtatg
3601 tatctttttt atgttttaat ttggtcacgt ggttaatttt atatgatgct aaaaaaattt
3661 tcttccattt aagaaacagg aattaacaaa aaatttaagg gaattc
//
LOCUS XELFIMB1 1341 bp ss-mRNA VRT 17-JUL-1990
DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds.
ACCESSION J02910
KEYWORDS integumentary mucin; mucin.
SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W.
TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous
with von Willebrand factor
JOURNAL Biochemistry 29, 6240-6244 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Hoffmann, 26-APR-1990.
FEATURES from to/span description
pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1)
mRNA < 1 1341 FIM-B.1 mRNA
signal 1319 1324 poly-A signal
rpt 1 279 repeat sequence
BASE COUNT 472 a 290 c 268 g 311 t
ORIGIN
1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag
61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca
121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa
181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta
241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca
301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact
361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca
421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc
481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa
541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa
601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg
661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat
721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact
781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt
841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca
901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa
961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg
1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt
1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac
1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga
1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata
1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa
1321 taaatggttt ttttcagctg t
//
LOCUS YSCANBI 2156 bp ds-DNA PLN 17-JUL-1990
DEFINITION S.cerevisiae ANB1 locus encoding protein synthesis initiation
factor (eIF-4D) gene, complete cds.
ACCESSION J05455
KEYWORDS protein synthesis initiation factor eIF-4D.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2156)
AUTHORS Mehta,K.D., Leung,D., Lefebvre,L. and Smith,M.
TITLE The ANB1 locus of Saccharomyces cerevisiae encodes the protein
synthesis initiation factor eIF-4D
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Dr. K.D.Mehta, 16-MAR-1990.
FEATURES from to/span description
pept 569 1042 protein synthesis initiation factor (eIF-4D)
mRNA 516 > 1042 protein synthesis initiation factor mRNA
(alt.)
mRNA 526 > 1042 protein synthesis initiation factor mRNA
(alt.)
mRNA 540 > 1042 protein synthesis initiation factor mRNA
(alt.)
signal 395 400 TATA box
BASE COUNT 602 a 422 c 369 g 763 t
ORIGIN 3 bp upstream of SmaI site.
1 cccgggaatt ttagattcag gtaggaaatt gattacataa atactgttac cctgaatcat
61 attcgacgat gtcgtctcac acggaaatat aattcatttc ttggttttcc aaaaaaattt
121 tcattttttt tcactttttt gtttcgtcct cctttttttt tttttttatt ttttttcctg
181 tgttcacctt tttttttttt ttttcagttt acaactttct gcattctttt ctgtgttttt
241 tttttttcgt ttttccattg ttcgttcgtt gcctgttttt ttgccctatt gttctcgagc
301 ctaaaaattt tttcctttcc tgctttcctt tcttcgttca aagtttccta ttccattgtt
361 ctcttcggta aactcattgt tgtcggaact cagatatatt caggtcaatt tactgtactt
421 caattgactt ttttcttgaa atttcaactt gccttttcaa cttgttcttc ttttttaatc
481 ttattctaca ctttagttcc cttaccttgt tcctaattat tgtctagcaa aaagaaaaca
541 tacacctatt tcattcacac actaaaacat gtctgacgaa gaacacacct ttgaaaatgc
601 tgacgctggt gcctccgcca cctacccaat gcaatgttct gccttgagaa agaatggttt
661 cgttgtcatc aaaggtagac catgtaagat tgtcgacatg tccacttcca agaccggtaa
721 gcacggtcac gccaaggtcc atttggttac ccttgatatc ttcactggta agaagttgga
781 agatttgtct ccatccactc acaacttgga agttccattt gtcaagagaa gcgaatacca
841 attgttggat attgatgatg gctacttgtc cttgatgacc atggacggtg aaaccaaaga
901 cgatgtcaag gcaccagaag gtgaattggg tgacagcatg caagctgctt tcgatgaagg
961 taaggatttg atggtcacta tcatttctgc catgggtgaa gaagctgcca tctccttcaa
1021 ggaagctcca agatctgatt agatatttta gatttctata gcttacaaaa tacgcgatgt
1081 ttttttgatc tagaaatctt atttatacat tttatataat attataataa tcataatcat
1141 aattattcat cctaactaac ttcgttctgt ttcagctacg aagttgcctt cattatcggt
1201 acgttcgtgt taattggtgt ttgataccat agtcacttat gacgacttac agaacttttt
1261 tttttgacgg atattttccc ctcaatatgc cgaggatgat aactttataa attgccttct
1321 ttcggttcct tttttctttt taatttagct gacgggaaca tgtgtatata ccccagaaga
1381 atttaaacag aaattttttt gagctatttt acgaaaatat taaaaactag ggctacagat
1441 atcagagaaa aaattactta tatgatgaca ataacttaat atcacagtca ctatcgttct
1501 cgcctttata tgcgcaatga cagcattcga cttttcctta gaagggatcc gatcaacgaa
1561 acttcaaaaa gagtagaagt cgaaatattt tttgcacgac gagggtgaca aaccgtgcta
1621 tgtccacaaa tgaggccctg ctttcccaag ataaattgcg taaacatata attgttacaa
1681 ggatccgatc aacgaaactt caaaaagagt agaagtcgaa atattttttg cacgacgagg
1741 gtgacaaacc gtgctatgtc cacaaatgag gccctgcttt cccaagataa attgcgtaaa
1801 catataattg ttacaaatcc aaggctctgt gatttcttca ccaagtgtag ttgtgatgta
1861 tagtgtgtag gataggtaat gaccggtaaa tcaagctggt aaaaagtcct aataacaagc
1921 ccgaatctgg aataatggta atattcccat gccaggtgga cagtgttgtc cttacacaaa
1981 ctcaagagat taaaggcaac ttggccgagt ggttaaggcg aaagattaga aatcttttgg
2041 gctttgcccg cgcaggttcg agtcctgcag ttgtcgttat tttttatttc cctttctttt
2101 cgtacataca caatgcacag tcttcaaagt aaaataccaa cgtatgtatc aagatc
//
LOCUS BSUBMRX 1402 bp ds-DNA BCT 17-JUL-1990
DEFINITION B.subtilis multidrug-resistant protein (BMR) gene, complete cds.
ACCESSION M33768
KEYWORDS multidrug-resistance.
SOURCE B.subtilis DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1402)
AUTHORS Neyfakh,A.A., Bidnenko,V.E. and Chen,L.B.
TITLE Efflux-mediated multidrug resistance in bacteria: Similarities and
dissimilarities with mammalian system
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.A.Neyfahh, 20-APR-1990.
FEATURES from to/span description
pept 195 1364 multidrug-resistance protein (BMR)
signal 130 135 -35 promoter
signal 155 160 -10 promoter
BASE COUNT 353 a 300 c 323 g 426 t
ORIGIN About 216 min. on B.subtilis map.
1 ggcggatatg gatggagaag tttatacaca tacacccgca gtcattcagg tcctgccaca
61 gcatatcgac atgcttgttc ccgcaaatga ataactgtca taaggcttta gaaagatttt
121 gcaaatccgt tgactctccc ctaggaggag gtcttacagt ataagggata cacacttgaa
181 aaaggggaag tcatatggag aagaaaaata ttaccttaac tatattatta accaatttat
241 ttattgcttt tttggggatc gggcttgtga ttccagtaac gccgaccatt atgaatgaat
301 tgcatttatc ggggaccgcg gtcggctata tggttgcctg cttcgctatt acacagctca
361 ttgtctcacc aatagccgga cgatgggttg atcgcttcgg gcgcaagatc atgatcgtaa
421 tcggcctgtt gttctttagt gtgtcggagt ttttgttcgg cattggaaaa acagttgaga
481 tgttatttat cacccgtatg ctgggcggta tcagcgcacc gttcattatg cccggggtca
541 cggcttttat tgcagatatc acgaccatta aaacacggcc aaaagcgctc ggttatatgt
601 cagccgctat ttcaacagga tttattatcg gccccggcat cgggggattt ttagcagaag
661 tccattcccg gctgcctttt ttctttgcgg cagcttttgc actgttagca gccattttat
721 caatcctcac gctgcgcgag ccggaacgaa accctgaaaa tcaggaaata aaaggacaga
781 agacaggctt taaacgaatt tttgccccca tgtatttcat agcttttctc attatcttaa
841 tttcgtcttt tggtttagca tcatttgaat ctttatttgc attattcgtg gatcataaat
901 tcggatttac ggccagcgac attgccatta tgattacagg aggagcgatt gttggcgcca
961 ttacgcaagt cgtcttattc gaccgcttca caagatggtt tggcgaaatt catttaattc
1021 ggtacagctt aattctctcg acgagtctgg tattcttgct gacaacggta cattcatatg
1081 ttgcgattct gctggtgaca gtcaccgtat ttgtcggatt tgatctcatg cggcctgcgg
1141 taacgactta cctgtcaaag attgcgggaa atgaacaggg gtttgccggc ggtatgaatt
1201 caatgtttac aagtatcggc aatgtattcg ggcctattat cggcggaatg ctgttcgata
1261 tagatgtaaa ctatcctttc tactttgcaa cggtcacctt agccataggg attgcactga
1321 ccattgcttg gaaagcgcct gcacatctta aagccagcac gtgataagaa gcgcattctt
1381 tgtgtactgc aaagaatgcg ct
//
LOCUS ECOMOR 1500 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli morphology and auto-aggregation control protein (mor)
gene, complete cds.
ACCESSION M34102
KEYWORDS auto-aggregation control protein; morphology control protein.
SOURCE E.coli (strain K12) DNA, clones pSRW[220,226].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1500)
AUTHORS Warne,S.R., Varley,J.M., Boulnois,G.J. and Norton,M.G.
TITLE Identification and characterization of a gene that controls colony
morphology and auto-aggregation in Escherichia coli K12
JOURNAL J. Gen. Microbiol. 136, 455-462 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 203 1120 morphology control protein (put.)
signal 133 138 -35 region
signal 157 162 -10 region
rpt 1135 1164 region of dyad symmetry
rpt 1167 1211 region of dyad symmetry
BASE COUNT 338 a 364 c 428 g 370 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc
61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat
121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta
181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc
241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag
301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg
361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg
421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct
481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct
541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact
601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga
661 agcattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca
721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat
781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc
841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc
901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggaggcca aacgcgatgg
961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg
1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat
1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc
1141 gatagcttcg ccatcgtcgg gtagttaaag gtggtgttga cgaagtactc aatagtgttg
1201 ccgccacctt tctgttccat aatcgcctga ccgatatgaa taatttcggc agcgcgctcg
1261 ccaaagcagt gaatacccag aatctctttt gtttcccgat ggaacaaaat tttcagcgtg
1321 cccacgttca tgccgacgat ttgtgcgcgt gccagatgtt taaactgggc gcggcccact
1381 tcatatggca ctttcattgc ggtcagctgc tgttcggttt tgcccacaga gctgatttcc
1441 gggatggtgt aaataccggt agggatatct tcaatcagat gtgcggtggc ttcgcctttt
//
LOCUS ECOTGMETY 800 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli Met-tRNA-f2 (metY) gene, 5' end.
ACCESSION M28401
KEYWORDS transfer RNA-Met.
SOURCE E.coli (strain K12) DNA, clones p50L and p50.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 800)
AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I.
TITLE Identification of a second promoter for the metY-nusA-infB operon
of Escherichia coli
JOURNAL J. Bacteriol. 172, 2336-2342 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.E.Granston, 27-SEP-1989.
FEATURES from to/span description
tRNA 682 758 Met-tRNA-f2 (metY)
anticdn 716 718 Met-tRNA anticodon atg
signal 560 565 -35 region promoter P1
signal 584 589 -10 region promoter P1
signal 638 643 -35 region promoter P0
signal 661 666 -10 region promoter P0
mRNA 596 > 800 P1-associated mRNA
BASE COUNT 210 a 191 c 190 g 209 t
ORIGIN 5 bp upstream of KpnI site.
1 ggtaccagtc acggcgcggc cagcgcgtcg ttttgaaata ggtcaggccg ccggtggtgt
61 tatgaaatgc gccacactga atagcggcaa taccttcggc caccagttgt ttgcggcagt
121 cgatcagacg tgcgttctcc gcgccgtatt ccatggcacg acgagggatc gcatcataat
181 cctcttcgtc tggctggccc aggtttgcag tatatgcata aggaaccgct cccttttgtc
241 gcatccacag cagtgcggca ctggtgtcca gaccgccaga aaaagcgata ccaatacgtt
301 gacctaccgg gagatgcttg agaatcgtcg tcataaaata acaccctgct taattaactg
361 atgatgagcc tggatttccg ctctcactga atttttatgc aaaataaatg agttttcatt
421 taatcatctt ttatcggaga caggaagagt ttagtgtgtt ttttgtaaaa taatgcgctt
481 aagggagagc aggagaaggc aaaagtattc aacaaatgaa agtgaactgg atattcattc
541 acatgattag caataaacgt tgacaaaatg tggcgtggat cactataatg cctgcagatt
601 ttacgtcccg tctcggtaca ccaaatccca gcagtatttg cattttttac ccaaaacgag
661 tagaatttgc cacgtttcag gcgcggggtg gagcagcctg gtagctcgtc gggctcataa
721 cccgaagatc gtcggttcaa atccggcccc cgcaaccact ttcccttaga gtcctttttc
781 aaatatactg tgaagacttc
//
LOCUS RAT5HT1A 2314 bp ds-DNA ROD 17-JUL-1990
DEFINITION Rat 5-hydroxytryptamine-1a receptor (5-HT-1a) gene, complete cds.
ACCESSION J05276
KEYWORDS 5-hydroxytryptamine-1a receptor.
SOURCE Rat DNA, clone D.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2314)
AUTHORS Albert,P.R., Zhou,Q.-Y., Van Tol,H.H.M., Bunzow,J.R. and Civelli,O.
TITLE Cloning, functional expression, and mRNA tissue distribution of the
rat 5-hydroxytryptamine-1a receptor gene
JOURNAL J. Biol. Chem. 265, 5825-5832 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 123 1391 5-hydroxytryptamine-1a receptor (15-HT-1a)
BASE COUNT 528 a 617 c 591 g 578 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccctc cctctgaaac tccccagaaa agaaggaaga ccctaggggg aggggcgagg
61 gagatcccct tgcagttctt tcccccctcc cttcccccac cccagcggga gtgcaggcag
121 gcatggatgt gttcagtttt ggccagggca acaacaccac agcgtcccag gagccctttg
181 ggaccggcgg caacgttact agcatctccg acgtgacctt cagctaccaa gtgatcacct
241 ctctgttgct gggtactctc attttctgcg cggtgctcgg caatgcctgc gtggttgctg
301 ccatcgccct ggagcgctcc ctccagaatg tggccaacta tctcatcggc tccttggcgg
361 ttactgatct catggtatca gtgctggtgc tgcccatggc tgctctgtac caggtgctca
421 acaagtggac cctgggccag gtcacctgcg acctgtttat cgccctggat gtactgtgct
481 gcacctcgtc catcctgcac ctgtgcgcca tcgcgctaga caggtactgg gctatcaccg
541 accctataga ctatgtgaac aaaaggacgc cccggcgcgc cgctgcgctg atctcgctca
601 cttggctcat tggctttctc atctccatcc cgcccatgct gggctggcgc accccggaag
661 accgatcgga ccccgacgcg tgcaccatca gcaaggacca cggctacacc atctactcca
721 ctttcggcgc tttctatatc ccgctgttgc tcatgctggt tctctacggg cgcatcttca
781 gagccgcacg cttccgaatc cgcaagactg tcaggaaggt ggaaaagaag ggagcaggca
841 ccagcttagg aacttcgtcg gcaccgcccc ccaagaagag cctgaacgga cagccaggta
901 gtggggactg gaggcgctgt gctgagaaca gggcggtggg gactccgtgc actaatgggg
961 cagtgaggca gggtgacgac gaggccaccc tggaggtgat cgaggtgcac cgagtgggca
1021 actccaaaga gcaccttcct ctgcccagcg agtcaggatc taactcctat gcccccgcct
1081 gcttggagag aaaaaatgag cgcaatgctg aagcaaagcg caagatggcc ttggcccgtg
1141 aaaggaagac ggtgaagact ctgggcatca tcatgggcac cttcatcctc tgctggctgc
1201 cgtttttcat tgtggctctg gtcctgcctt tctgtgaaag cagctgccac atgcctgcgt
1261 tgttgggtgc cataattaac tggctaggct actccaactc cctgctcaat ccggttattt
1321 atgcttattt caacaaagac tttcaaaacg cttttaagaa gataatcaag tgcaagttct
1381 gccgccgatg atgatgatga tggtggtggt aatagtggtg actatggaga aggaggagga
1441 ggaggaagag gaggaggaag aagaggaaga ggaggaagag tgtagggttt acaggaccct
1501 ccccattcac tcagtaccct agccctggaa gcaacaccta agataacttg cttttcctag
1561 aacatgcagc ttctcagcac atggctttca gaccttgttc tctctatatc tcgtttcaag
1621 gggacagtac tctttgcaga aatcttagag aaccactagt atctgtgtga aaagttattg
1681 gcttggtctt tggccctagg atcagttttg gtagttgtaa taacatcctc cctattcaaa
1741 atcccaattc ttcacgatgg aattttaaac cttggtgatt caaggtagaa agccgaaaca
1801 gaaaatctaa actgtcctga catctgccag gcccactaga actgtcactc tctgccctac
1861 ttctggaact ttgaggatca agactgctct gtctctcact ccctcttgct ctagaaacat
1921 ctgttccctt tgggtgtcaa tctacactgt tgaactggtc tccatgctgg tctggaaaag
1981 cagttggaag taactactct gggttatgtg ctctggaaat ttttgaatag caaaagatcc
2041 cccccacccc tgcacaacac ccactttctg tatttttttt aaattttttt ttaatttttg
2101 aactcaaaat caagtggcaa aagtgtctgt tggagtggac ttataagtag gtctggctcg
2161 taggaagagc caggaaggtc tgtggatggt ttcttgtcct ggtgttaaca ctcaccaaag
2221 aaactgacaa tcagactatt tggaaggaag agttttattc ctacagtttg ttggcctttc
2281 tgtaagttct ggcaaggctc tgtgtcttct gcag
//
LOCUS RATFCGRE 1341 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat Fc-gamma receptor mRNA, complete cds.
ACCESSION M32062
KEYWORDS Fc-gamma receptor.
SOURCE Rat (strain Fisher 344) natural killer cell, cDNA to mRNA, clone
6a6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Zeger,D.L., Hogarth,P.M. and Sears,D.W.
TITLE Characterization and expression of an Fc-gamma receptor cDNA cloned
from rat natural killer cells
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3425-3429 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.W. Sears, 12-FEB-1990.
FEATURES from to/span description
pept 50 853 Fc-gamma receptor
BASE COUNT 374 a 346 c 319 g 302 t
ORIGIN
1 ccagcaattt catctcctag acctcatcag actctgaccc agttcttgaa tgactttgga
61 gacccagatg tttcagaatg cacattctgg aagccaatgg ctactcccac cactgacaat
121 gttgctgctg tttgcttttg cagacaggca gacgggagat cttctgaagg ctgtggtgaa
181 acgtgatccc ccatggatcc aggtgctcaa ggacgacact gtgacgctga cgtgcgaagg
241 gacccacaat cctggaaact cttctaccca gtggttccac aaccagagct ccacctgggg
301 ccaggtccaa gccagctaca cgtttaaggc cacagtcaat gacagtggag aataccggtg
361 ccgaatggcg cacaccagcc tcagcgaccc catacatctg gaagtgattt ctgactggct
421 gctgctccag acccctcaac tggtgtttga ggaaggggaa accatcacat taaggtgcca
481 tagctggaag aacaaacagc tgactaaggt cttgttgttc cagaatggaa aacctgtgag
541 gtattattat caaagcagca atttttcaat ccccaaagcc aaccacagtc acagtgggaa
601 ctactactgc aaagcatatc taggaaggac aatgcatgtg tccaagcctg tcaccatcac
661 tgtccaaggt tcagcaaccg cgtccaccag ctctctagtg tggttccatg ccgctttctg
721 cctagtgatg tgcctcctgt ttgcagtgga caccggcctg tatttctgtg tacggagaaa
781 tcttcaaacc tcgggggagg actggaggaa atccctgtca gtcggaaagt acaaggctcc
841 acaggacaaa tgacatccca tcgtatggct aaacagcggc agcttctttt cagccacacc
901 gccgcttcat ctacagcctt cctttgaaag caacttacca ccaggctgga tatttggtcc
961 tgcaatcaca actgctagga ggctgaggca ggaggatcac caaaggctgc ccgggtttta
1021 gagaagagag agtgcaagtc tatcctggat aacccaataa gaccctgggt ttagatggct
1081 tatcaggaaa gagcacccat tgctaagctc acagacaaga ccacagattg atccctaagg
1141 acacagggca aagggggaga acaaattcct gaaagttgtc ctctgacctc cacacgtcca
1201 ccatggcaga tgcacacaat aaacaaatta aaatgtaata tatatatttt aaacaagagg
1261 tggggaaagg ctgggggttg gtggcacagt ttttatccag gacacatgat gctctggatt
1321 ttgacctcct ataataaaac g
//
LOCUS YEPYOPYER 1527 bp ds-DNA BCT 17-JUL-1990
DEFINITION Y.pestis virulence determinant (yopE) and yopE regulator (yerA)
genes, complete cds.
ACCESSION M34279
KEYWORDS virulence detrminant; yopE regulator.
SOURCE Y.pestis DNA, clone pYV019.
ORGANISM Yersinia pestis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1527)
AUTHORS Forsberg,A. and Wolf-Watz,H.
TITLE Genetic analysis of the yopE region of Yersinia spp.:
Identification of a novel conserved locus, yerA, regulating yopE
expression
JOURNAL J. Bacteriol. 172, 1547-1555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 476 84 (c) yopE regulator (yerA)
pept 670 1329 virulence determinant (yopE)
BASE COUNT 426 a 329 c 339 g 433 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccgaca ttactaagtg agcgttgtat ctaatcctgg gggcaggtca ttccgtataa
61 taaggcaaca accaaaaatc tactcaacta aatgaccgtg gtggtgagat tagtgatgag
121 gtttgtagcc gttcagcccc ctgcaccagc atctcaagct gagtatatag tgagttatta
181 tccaggctgt tcaatggttg tcgattccat aacactgggt gccccccaac ctcgtcccag
241 gataagatgg gttttaatat atcttgactg aatatattat ggctaagtaa ggtttccttt
301 tcatcattat tgtcaagaga aggtagggta aacattaata tttgcccgac aggatgctct
361 gttatatggc aggcgaattc cccaactttg acaccgataa ccggttcaat agtatctgga
421 atagacaacg aaagttgttg aaataattga gtgatagctt gttcaaatga atacattatg
481 atctcataat agttagataa aatatcaact taaccaaagc actctcggca gaccatcaat
541 tttagcctat aatttttagt ttttgttttg tctaatataa caacaaaaac agcagcgatt
601 ttttatatag ccatcggcta ttttcccact aagataacct tgttttaata gccaaggtaa
661 taaatagtca tgaaaatatc atcatttatt tctacatcac tgcccctgcc gacatctgtg
721 tcaggatcta gcagcgtagg agaaatgtct gggcgctcag tctcacagca aacaagtgat
781 caatatgcaa acaatctggc cgggcgcact gaaagccctc agggttccag cttagccagc
841 cgtatcattg agaggttatc atcagtggcc cactctgtga ttgggtttat ccaacgcatg
901 ttctcggagg ggagccataa accggtggtg acaccagcac ccacacctgc acaaatgcca
961 agtcctacgt ctttcagtga cagtatcaag caacttgctg ctgagacgct gccaaaatac
1021 atgcagcagt tgaatagctt ggatgcagag atgctgcaga aaaatcatga tcagttcgct
1081 acgggcagcg gccctcttcg tggcagtatc actcaatgcc aagggctgat gcagttttgt
1141 ggtggggaat tgcaagctga ggccagtgcc atcttaaaca cgcctgtttg tggtattccc
1201 ttctcgcagt ggggaactat tggtggggcg gccagcgcgt acgtcgccag tggcgttgat
1261 ctaacgcagg cagcaaatga gatcaaaggg ctggcgcaac agatgcagaa attactgtca
1321 ttgatgtgat atggataaaa acaaggggat agtgtttccc cctttttcta tcaatattgc
1381 gaatatcttc gtccctgatc tttcaggggc gaatcgtttt ttagcatgct cattgttaga
1441 atttctgact tatctctctt ctgtattact actcatgctc tggaaaatcc tgaacatgta
1501 tatctatgga ttgatgcagc actcgag
//
LOCUS YEPYOPYER1 1762 bp ds-DNA BCT 17-JUL-1990
DEFINITION Y.enterocolitica virulence determinant (yopE) and yopE regulator
(yerA) genes, complete cds.
ACCESSION M34278
KEYWORDS virulence determinant; yopE regulator.
SEGMENT 1 of 2
SOURCE Y.enterocolitica DNA, clone pYV8081.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1762)
AUTHORS Forsberg,A. and Wolf-Watz,H.
TITLE Genetic analysis of the yopE region of Yersinia spp.:
Identification of a novel conserved locus, yerA, regulating yopE
expression
JOURNAL J. Bacteriol. 172, 1547-1555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 609 217 (c) yopE regulator (yerA)
pept 803 1462 virulence determinant (yopE)
rpt 1468 > 1762 insertion sequence
rpt 1464 1467 5' insertion target sequence
BASE COUNT 497 a 395 c 382 g 488 t
ORIGIN
1 ccggtactcc tcattaccaa tgaggaggcg gtgattccac caagtgaatg ggcttaatac
61 aagtctttta cattaaaaat tgttaaaaaa cgggagataa aagtcaacac tccaacttgg
121 tgttaacttt tactgagcga aatctgatat tgctggcacc acaaatttat aggttatcgc
181 tatttccgta taataaggca acaaccaaaa atctactcaa ctaaatgacc gtggtggtga
241 gattagtgat gaggtttgta gccgttcagc cccctgtacc agcatttcaa gctgagtata
301 tagcgagtta ttatccaggt tgttcaatgg ttggcgattc cataacactg ggtgcccccc
361 gacctcgtcc caagataaga tgggttttaa tatatcttga ctgaatatat tatggctaag
421 taaggtttcc ttttcattat tattgtcaag agaaggcagg gtaaacatta atatttgccc
481 gacaggatgc tctgttatat ggcaggcgaa ttccccaact ttgacaccga taaccggttc
541 aatagtatct ggaatagaca gcgaaagttg ttgaaataat tgagtgatag cttgttcaaa
601 tgaatacatt atgatctcct aatagttaga taaaatatca agttaaccaa agcactctcg
661 gcagaccatc aatttgagcc tataattttt agtttttatt ttgtctaata taacaacaaa
721 aacagcagcg attttttata tagccaccgg ctattttccc actaagataa ccttgtttta
781 atagccaagg gaataaatag tcatgaaaat atcatcattt atttctacat cactgcccct
841 gccgacatca gtgtcaggat ctagcagcgt aggagaaatg tctgggcgct cagtctcaca
901 gcaaaaaagt gagcaatatg caaacaatct ggctgggcgc actgaaagcc ctcagggttc
961 cagcttagcc agccgtatca ctgagaagtt atcatcaatg gcccactctg cgattgaatt
1021 tatcaaacgc atgttctcgg aggggagcca taaaccggtg gtgacaccag cacccacgcc
1081 tgcacaaatg ccaagtccta cgtctttcag tgatagtatc aagcaacttg ctgctgagac
1141 gctgccaaaa tacatgcagc agttgagtag cttggatgca gagacgctgc agaaaaatca
1201 tgatcagttc gccacgggca gcggccctct tcgtggcagt atcactcaat gccaagggct
1261 gatgcagttt tgtggtgggg aattgcaagc tgaggccagt gccatcttaa acacgcctgt
1321 ttgtggtatt cccttctcgc agtggggaac tattggtggg gcggccagcg cgtacgtcgc
1381 cagtggcgtt gatctaacgc aggcagcaaa tgagctcaaa gggctggcgc aacagatgca
1441 tcaattactg tcattgatgt gatatggtga ctagtcctgc tataggttga cacttttcag
1501 ccttaaaacg cctgatgaac ttcatcgggc gttttgtatt ttaacgacag atgcggtcgc
1561 cgcgtgttgt agatttccac tgactccctg accatctttc ttgcctggcc aatatcttcc
1621 ggcttaacca gcagatattc catttttaat atcccattga cccgctccgc cagcgcattc
1681 tgataacaat catatccatc cgtcatcgaa cagatgacac cgtgacgttg atgtaacgcc
1741 tgatattctg ttgaacagta ct
//
LOCUS YEPYOPYER2 546 bp ds-DNA BCT 17-JUL-1990
DEFINITION Y.enterocolitica DNA with insertion element, 3' end.
ACCESSION M34280
KEYWORDS insertion element.
SEGMENT 2 of 2
SOURCE Y.enterocolitica DNA, clone pYV8081.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 546)
AUTHORS Forsberg,A. and Wolf-Watz,H.
TITLE Genetic analysis of the yopE region of Yersinia spp.:
Identification of a novel conserved locus, yerA, regulating yopE
expression
JOURNAL J. Bacteriol. 172, 1547-1555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt < 1 242 insertion element
rpt 243 246 3' insertion target sequence
BASE COUNT 144 a 119 c 109 g 174 t
ORIGIN About 650 bp after segment 1.
1 gatccgtgac ccagaaagac aaatccgtca ttaacttgcg tagccacttc aggacggtag
61 aacggccttg aataccgttg cggtcttgcg ctgtcggtaa gtcatttccc ttttttcgac
121 ctgttcgaca acggccaatt taaaggatag aggataatgc cgttgcgtgc gtttaacata
181 ttggttcatc acattttcct aagtacatga gttaaatgtg tcaacgctat ttaggacggg
241 tcatggataa aaacaagggg atagtgtttc cccctttttc tctcaatatt gcgaatatct
301 tcgtccctga tctttcaggg gcgaatcgtt ttttagcatg ctcattgtta gaatttctga
361 cttatctctc ttctgtatta ctactcatgc tctggaaaat cctgggcgtc tatattctat
421 ggattgatgc aacactcgag aaatcaaaat atcattgcta agcgttatat agtatatacc
481 gtgcttttta tatgaaaacg gcgcatgtca gagcaaatcc agttacactc agcccctaac
541 tctaga
//
LOCUS HUMCDR34 2412 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene,
complete cds.
ACCESSION M31423
KEYWORDS cerebellar-degeneration-related antigen.
SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2412)
AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
Chaganti,R.S.K., Posner,J.B. and Old,L.J.
TITLE Cerebellar degeneration-related antigen: A highly conserved
neuroectodermal marker mapped to chromosomes X in human and mouse
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.-T.Chen, 17-JAN-1990
FEATURES from to/span description
pept 503 1174 cerebellar-degeneration-related antigen
(CDR34)
BASE COUNT 743 a 334 c 669 g 666 t
ORIGIN
1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
2401 gagatgctgc ag
//
LOCUS RATHGF 2485 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION M32987
KEYWORDS hepatocyte growth factor.
SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2485)
AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
Shimizu,S. and Nakamura,T.
TITLE Deduced primary structure of rat hepatocyte growth factor and
expression of the mRNA in rat tissues
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Nakamura, 26-MAR-1990.
FEATURES from to/span description
pept 143 2329 hepatocyte growth factor
BASE COUNT 750 a 533 c 577 g 625 t
ORIGIN
1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
2461 ttaatatatg tggcgttttc tgttg
//
LOCUS MUSIMPD 1707 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse IMP dehydrogenase mRNA, complete cds.
ACCESSION M33934
KEYWORDS IMP dehydrogenase.
SOURCE Mouse, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1707)
AUTHORS Tiedemann,A.A. and Smith,J.M.
TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP
dehydrogenase
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.M.Smith 27-APR-1990.
Author address:
Dr. J.M. Smith
Seattle Biomedical Research Institute
4 Nickerson St.
Seattle, WA 98109
FEATURES from to/span description
pept 67 1611 IMP dehydrogenase (EC 1.2.1.14)
BASE COUNT 457 a 408 c 451 g 391 t
ORIGIN
1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc
61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc
121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt
181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag
241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg
301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa
361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc
421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc
481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc
541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc
601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat
661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg
721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa
781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat
841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga
901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag
961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta
1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc
1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt
1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct
1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc
1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat
1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa
1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct
1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa
1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag
1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc
1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa
1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//
LOCUS NEUPSAS 3681 bp ds-DNA PLN 17-JUL-1990
DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (arg-2)
small subunit gene, complete cds.
ACCESSION J05512
KEYWORDS carbamoyl phosphate synthetase.
SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and
cDNA to mRNA, clone pAB1.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 3681)
AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C.
TITLE The Neurospora crassa arg-2 locus: Structure and expression of the
gene encoding the small subunit of arginine-specific carbamoyl
phosphate synthetase
JOURNAL J. Biol. Chem. 265, 10981-10987 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.S.Sachs, 07-MAY-1990.
FEATURES from to/span description
pept 1451 1630 arginine-specific carbamoyl phosphate
synthetase small subunit, exon 1 (arg-2) (EC
6.4.5.5)
1687 2868 arginine-specific carbamoyl phosphate
synthetase small subunit, exon 2
pept 870 902 ORF, exon 1
1270 1282 ORF, exon 2
1359 1387 ORF, exon 3
pre-msg 752 3082 arg-2 mRNA and introns
pre-msg 809 3082 arg-2 mRNA and introns (alt.)
IVS 1631 1686 arg-2 intron A
IVS 903 1269 ORF intron A
IVS 1283 1358 ORF intron B
signal 161 166 TGACTC box
signal 568 573 TGACTC box
signal 1008 1013 TGACTC box
signal 1307 1312 TGACTC box
BASE COUNT 760 a 1093 c 863 g 965 t
ORIGIN
1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg
61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg
121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg
181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga
241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg
301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg
361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg
421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa
481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa
541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg
601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa
661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag
721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca
781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag
841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc
901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc
961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc
1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat
1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc
1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct
1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg
1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt
1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa
1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat
1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg
1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag
1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt
1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc
1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt
1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac
1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat
1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta
1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg
1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc
2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac
2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc
2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac
2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc
2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc
2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg
2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct
2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt
2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc
2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag
2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt
2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag
2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc
2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg
2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca
2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg
3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag
3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc
3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg
3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca
3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag
3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa
3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg
3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg
3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc
3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc
3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg
3661 atgctacccc ctgtcctgca g
//
LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human osteopontin mRNA, complete cds.
ACCESSION J04765
KEYWORDS osteopontin; phosphoprotein.
SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1425)
AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D.
TITLE cDNA cloning, chromosomal location and RFLP analysis of human
osteopontin
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Young, 04-MAY-1990.
FEATURES from to/span description
pept 68 1012 osteopontin precursor
/hgml_locus_uid="LK0223T" /nomgen="OPN"
/map="unassigned"
sigp 68 115 osteopontin signal peptide
matp 176 1012 osteopontin
mRNA < 1 1425 osteopontin mRNA
BASE COUNT 449 a 293 c 296 g 387 t
ORIGIN Chromosome 4q13.
1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS MUS25OAS 1412 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds.
ACCESSION M33863
KEYWORDS 2'-5' oligo A synthetase.
SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1412)
AUTHORS Coccia,E.M., Nissim,A. and Chebat,J.
TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3
cells impairs EMCV but not VSV replication
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.Coccia, 26-APR-1990.
FEATURES from to/span description
pept 36 1139 2'-5' oligo A synthetase
signal 1389 1394 poly-A signal
BASE COUNT 340 a 371 c 390 g 311 t
ORIGIN
1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc
61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg
121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag
181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca
241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct
301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg
361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca
421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg
481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca
541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta
601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc
661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc
721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt
781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc
841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc
901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga
961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc
1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga
1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag
1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc
1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca
1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg
1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc
1381 tgatcaacaa taaaccacag caggtgccgt ca
//
LOCUS ACARGA 1212 bp ds-DNA INV 17-JUL-1990
DEFINITION A.castellanii 5S RNA.
ACCESSION M34003
KEYWORDS 5S RNA.
SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1.
ORGANISM Acanthamoeba castellanii
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina;
Acanthamoebidae.
REFERENCE 1 (bases 1 to 1212)
AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R.
TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from
the eukaryotic protist Acanthamoeba castellanii
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Paule, 04-MAY-1990.
Author address: M.R.Paule
Colorado State University, Dept. Biochemistry,
Fort Collins, CO 80523
FEATURES from to/span description
RNA 489 610 5S RNA precursor
RNA 489 607 5S RNA
BASE COUNT 195 a 374 c 329 g 314 t
ORIGIN
1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg
61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc
121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta
181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt
241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac
301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg
361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag
421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt
481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt
541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc
601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg
661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa
721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag
781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc
841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact
901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct
961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt
1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg
1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc
1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg
1201 gcaaatgtcg ac
//
LOCUS ENTOMPX 664 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds.
ACCESSION M33878
KEYWORDS outer membrane protein X.
SOURCE E.cloacae (strain 2249-1) DNA.
ORGANISM Enterobacter cloacae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 664)
AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and
van de Klundert,J.A.M.
TITLE Molecular characterization of an Enterobacter cloacae outer
membrane protein OmpX
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Stoorvogel, 26-APR-1990.
Author address: J.Stoorvogel
Dept of Medical Microbiol, Univ Hospital,
P.O.Box 9600, 2300 RC Leiden,
THE NETHERLANDS
FEATURES from to/span description
pept 83 601 outer membrane protein X precursor
sigp 83 151 outer membrane protein X signal peptide
matp 152 598 outer membrane protein X
signal 18 25 -35 region
signal 43 48 -10 region
binding 74 77 ribosome binding site
signal 620 649 transcription termination signal
BASE COUNT 171 a 156 c 162 g 175 t
ORIGIN
1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac
61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt
121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc
181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta
241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg
301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta
361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca
421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta
481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca
541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta
601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc
661 ctct
//
LOCUS NEUMPPX 2038 bp ss-mRNA PLN 17-JUL-1990
DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds.
ACCESSION J05484
KEYWORDS matrix processing peptidase.
SOURCE N.crassa, cDNA to mRNA.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 2038)
AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W.
TITLE Matrix processing peptidase of mitochondria: Structure-function
relationships
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Neupert, 17-APR-1990.
FEATURES from to/span description
pept 41 1774 matrix processing peptidase
BASE COUNT 439 a 626 c 552 g 421 t
ORIGIN
1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc
61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc
121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga
181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga
241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa
301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc
361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc
421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc
481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg
541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct
601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg
661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt
721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa
781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac
841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc
901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc
961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga
1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc
1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc
1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg
1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct
1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg
1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat
1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga
1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat
1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg
1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg
1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct
1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga
1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt
1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc
1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc
1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct
1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa
//
LOCUS XELXOTCH 9163 bp ss-mRNA VRT 17-JUL-1990
DEFINITION X.laevis Xotch protein mRNA, complete cds.
ACCESSION M33874
KEYWORDS Xotch.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 9163)
AUTHORS Coffman,C., Harris,W. and Kintner,C.
TITLE Xotch, the Xenopus homolog of the Drosophila notch gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Coffman, 26-APR-1990.
Author address: C.R.Coffman
University of California
Dept. of Biology-B022,
La Jolla, CA 92093
FEATURES from to/span description
pept 346 7917 Xotch protein
mRNA < 1 9163 Xotch mRNA
signal 9146 9151 poly-A signal
rpt 409 4632 EGF-like repeats
rpt 4684 5025 lin-12/Notch repeats
rpt 5953 6591 cdc-10/SWI-6 repeats
BASE COUNT 2499 a 2192 c 2188 g 2284 t
ORIGIN
1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat
61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca
121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa
181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt
241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag
301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg
361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact
421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc
481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag
541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc
601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc
661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa
721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt
781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc
841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa
901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt
961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg
1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt
1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc
1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac
1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc
1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc
1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt
1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca
1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac
1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt
1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt
1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga
1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca
1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac
1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt
1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag
1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt
1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac
2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc
2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag
2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata
2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc
2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc
2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca
2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat
2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc
2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc
2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt
2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca
2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt
2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt
2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa
2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt
2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac
3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg
3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt
3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt
3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc
3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac
3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca
3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac
3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc
3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct
3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag
3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag
3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga
3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg
3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc
3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag
3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa
3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg
4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac
4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc
4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa
4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct
4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc
4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag
4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg
4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct
4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct
4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc
4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat
4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc
4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg
4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct
4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag
4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa
4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac
5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac
5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa
5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa
5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca
5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg
5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc
5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg
5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc
5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc
5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg
5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt
5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa
5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc
5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc
5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac
5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt
6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg
6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag
6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa
6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg
6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac
6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg
6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa
6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag
6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca
6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc
6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt
6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg
6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag
6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag
6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat
6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta
6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct
7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc
7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt
7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc
7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat
7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa
7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca
7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca
7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg
7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc
7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg
7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca
7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca
7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct
7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca
7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa
7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga
7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag
8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc
8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat
8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat
8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa
8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag
8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg
8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta
8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa
8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc
8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac
8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta
8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta
8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta
8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt
8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag
8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca
9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga
9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat
9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat
//
LOCUS TRCTC26 638 bp ss-mRNA INV 17-JUL-1990
DEFINITION T.cruzi AMP-inducible protein (TC26) mRNA, partial cds.
ACCESSION M34062
KEYWORDS AMP-inducible protein.
SOURCE T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26.
ORGANISM Trypanosoma cruzi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 638)
AUTHORS Heath,S., Hieny,S. and Sher,A.
TITLE A cyclic AMP inducible gene expressed during the development of
infective stages of Trypanosoma cruzi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Heath, 08-MAY-1990.
FEATURES from to/span description
pept < 1 > 638 AMP-inducible protein (TC26) (AA at 1)
BASE COUNT 154 a 174 c 168 g 142 t
ORIGIN
1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct
61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct
121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac
181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc
241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag
301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca
361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg
421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac
481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg
541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg
601 tttctgcaac atctcaccgg aacagtctgc atatatat
//
LOCUS DROANNIX 1104 bp ss-mRNA INV 17-JUL-1990
DEFINITION D.melanogaster annexin IX mRNA, 3' end.
ACCESSION M34068 J05501
KEYWORDS annexin IX.
SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1104)
AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
Suedhof,T.C.
TITLE Two novel annexins from Drosophila melanogaster: Cloning,
characterization and differential expression in development
JOURNAL J. Biol. Chem. 265, 11382-11388 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Suedhof, 04-MAY-1990.
FEATURES from to/span description
pept < 1 891 annexin IX (AA at 1)
signal 1091 1096 poly-A signal
BASE COUNT 281 a 300 c 306 g 217 t
ORIGIN
1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg
61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc
121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg
181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga
241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt
301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag
361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat
421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc
481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc
541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag
601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag
661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg
721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt
781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag
841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca
901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa
961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg
1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt
1081 ccctctgtgc aataatcgga attc
//
LOCUS DROANNX 1192 bp ss-mRNA INV 17-JUL-1990
DEFINITION D.melanogaster annexin X mRNA, complete cds.
ACCESSION M34069 J05501
KEYWORDS annexin X.
SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1192)
AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
Suedhof,T.C.
TITLE Two novel annexins from Drosophila melanogaster: Cloning,
characterization and differential expression in development
JOURNAL J. Biol. Chem. 265, 11382-11388 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Suedhof , 04-MAY-1990.
FEATURES from to/span description
pept 91 1056 annexin X
signal 1175 1184 poly-A signal
BASE COUNT 271 a 348 c 363 g 210 t
ORIGIN Chromosome 93B or 19A-4,7.
1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac
61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag
121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga
181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg
241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag
301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag
361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc
421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag
481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc
541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc
601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg
661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc
721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg
781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc
841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc
901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag
961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac
1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc
1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct
1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc
//
LOCUS ECODKSA 1273 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION M34945
KEYWORDS dnaK suppressor.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1273)
AUTHORS Kang,P.J. and Craig,E.A.
TITLE Identification and characterization of a new Escherichia coli gene
that is a dosage-dependent suppressor of a dnaK deletion mutation
JOURNAL J. Bacteriol. 172, 2055-2064 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 229 441 ORF 1
pept 619 1074 dnaK suppressor (dksA)
BASE COUNT 343 a 301 c 333 g 296 t
ORIGIN
1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
1261 acgcatagaa gat
//
LOCUS STFCYCLI 2180 bp ss-mRNA INV 17-JUL-1990
DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION M33880
KEYWORDS cyclin B.
SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone
lambda-gt10-cyc10.
ORGANISM Asterina pectinifera
Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa;
Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE 1 (bases 1 to 2180)
AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes
cyclin
JOURNAL Dev. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Kishimoto, 11-MAY-1990.
FEATURES from to/span description
pept 126 1313 cyclin B (CYC)
mRNA < 126 2180 cyclin B mRNA
BASE COUNT 650 a 440 c 482 g 608 t
ORIGIN
1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
2161 gtaataaagt taatggaaat
//
LOCUS MUSH2A1X 929 bp ds-DNA ROD 17-JUL-1990
DEFINITION Mouse histone H2A.1 gene, complete cds.
ACCESSION M33988
KEYWORDS histone H2A.1.
SOURCE Mouse (BALB/c) DNA, clone 53.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 929)
AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
Sch|mperli,D.
TITLE Structure of a mouse histone gene cluster
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Sch|mperli, 02-MAY-1990.
Author address: Dr.Schmperli
Zoologisches Institut, Baltzerstrasse 4
CH-3012 Bern
SWITZERLAND
FEATURES from to/span description
pept 164 556 histone H2A.1
mRNA 119 624 histone H2A.1 mRNA
BASE COUNT 238 a 247 c 222 g 222 t
ORIGIN
1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac
61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct
121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa
181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt
241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc
301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct
361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct
421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg
481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa
541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca
601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg
661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta
721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga
781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt
841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa
901 gggccattaa atcccacttc catagtttc
//
LOCUS MUSH2A1XX 693 bp ds-DNA ROD 17-JUL-1990
DEFINITION Mouse histone H3.2 gene, complete cds.
ACCESSION M33989
KEYWORDS histone H3.2.
SOURCE Mouse (strain BALB/c) DNA, clone 53.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 693)
AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
Sch|mperli,D.
TITLE Structure of a mouse histone gene cluster
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Sch|mperil, 02-MAY-1990.
Author address: D.Schlmperil
Zoologisches Institut, Baltzerstrasse 4
CH-3012 Bern
SWITZERLAND
FEATURES from to/span description
pept 151 561 histone H3.2
mRNA 116 611 histone H3.2 mRNA
BASE COUNT 159 a 206 c 184 g 144 t
ORIGIN
1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt
61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat
121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag
181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc
241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc
301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc
361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc
421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg
481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc
541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc
601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg
661 tttttcaaag atttatctag ttacaaagta tac
//
LOCUS RATGGSLEC 1358 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat Gal/GalNAc-specific lectin mRNA, complete cds.
ACCESSION J05495
KEYWORDS Gal/GalNAc-specific lectin.
SOURCE Rat (strain Wistar) adult peritoneal macrophage, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1358)
AUTHORS Kurata,M.Ii., Itoh,H., Yamashina,N. and Kawasaki,I.T.
TITLE Molecular cloning and sequence analysis of cDNA encoding the
macrophage lectin specific for galactose and N-acetylgalactosamine
JOURNAL J. Biol. Chem. 265, 11295-11298 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Kawasaki, 20-APR-1990, for release after publication.
FEATURES from to/span description
pept 172 1092 Gal/GalNAc-specific lectin precursor
sigp 172 195 Gal/GalNAc-specific lectin signal peptide
matp 196 1089 Gal/GalNAc-specific lectin
signal 1330 1335 poly-A signal
BASE COUNT 340 a 325 c 365 g 328 t
ORIGIN
1 ccacctctgg ccactgctgc acagggaagc cacgatttca cacagttcca ggacttacct
61 cagcttcaca ctgcttctga cactctcctt ggtcccagct cagtatctgt ctggattctg
121 gtgtctcggt ttcagttcag atagccctag aggccttgtg tttgtgcaag catgacaatg
181 gcatatgaaa acttccagaa cttggggagc gaagagaaaa accaagaggc tggtaaagct
241 cctccccagt ccttcctgtg caatatcctc tcttggaccc acctcctcct gttctccctg
301 ggccttagcc tcctgctgct ggtggttatc tccgtgattg gatcccaaaa ttcccagtta
361 aggagggacc tagaaaccct gagaaccact ttagacaaca ccacctccaa caccaaggct
421 gaactacagg ccctggcctc caggggtgac agcttgcaaa caggaatcaa ttctctgaaa
481 gtggaggtgg atgatcatgg gcaggaactg caggcaggcc gaggcttgag ccagaaggtg
541 gcttctctgg agagcacagt ggagaagaag gagcagactc tcagaacaga tctatctgaa
601 ataaccgatc gtgtgcaaca gctggggaag gacttgaaga ccctgacatg tcagctggcc
661 agcctcaaga acaacggctc agcagtggcc tgctgccccc ttcactggat ggagcatgaa
721 ggcagctgct actggttctc tcagtctggg aagccgtggc ctgaagctga caagtactgc
781 cagctggaga attctaacct agtggtggtc aactccctgg cggagcagaa ttttctacag
841 actcacatgg gcagtgtggt cacttggatt ggcctaacgg accaaaatgg gccctggcga
901 tgggtggacg ggaccgacta tgagaaaggc tttacgcact gggccccaaa gcagccagat
961 aactggtatg gacacgggct gggaggaggc gaggactgtg cacacttcac ctcagatggt
1021 cgctggaatg atgacgtttg ccagaggccc taccgctggg tctgtgagat gaagctggcc
1081 aaggacagct aggagcactc tccaagtaat ttatttcttc agtggtttta gctggggaag
1141 cctggagttt ggactcctcc tatcccattt caatgttatt tgggactttt ctttttgtca
1201 aaaaatttaa gtcaaggaga aaggattgtg tttgagatgg tagtgtggtg tccggaagaa
1261 tggggttatt gaaatctgtg gtattttgta cagagtgccg cttattgtag gactttttaa
1321 tgtaacaaaa ataaaatact gttggtttta aaaaaaac
//
LOCUS MLUDNAA 4171 bp ds-DNA BCT 17-JUL-1990
DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34
(rpmH), DNA biosynthesis initiation protein (dnaA), and DNA
polymerase III beta subunit (dnaN) genes, complete cds.
ACCESSION M34006
KEYWORDS 50S ribosomal subunit protein L34;
DNA biosynthesis initiation protein;
DNA polymerase III beta subunit; ribonuclease P.
SOURCE M.luteus (strain IF03333) DNA.
ORGANISM Micrococcus luteus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 4171)
AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N.
TITLE Structure of the dnaA region of Micrococcus luteus: Conservation
and variations among eubacteria
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Ogasawara, 05-MAY-1990.
FEATURES from to/span description
pept 435 37 (c) ribonuclease P (rnpA)
pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH)
pept 944 2491 DNA biosynthesis initiation protein (dnaA)
pept 3242 > 4171 DNA polymerase III beta subunit (dnaN)
signal 843 848 dnaA -35 signal
signal 866 871 dnaA -10 signal
BASE COUNT 672 a 1558 c 1361 g 580 t
ORIGIN
1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg
61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta
121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag
181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg
241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct
301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg
361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg
421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg
481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg
541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca
601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag
661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc
721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc
781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc
841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc
901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc
961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc
1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct
1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc
1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga
1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg
1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac
1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt
1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc
1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg
1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc
1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc
1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga
1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca
1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga
1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt
1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc
1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc
1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta
2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac
2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa
2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac
2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct
2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc
2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa
2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa
2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg
2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact
2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc
2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg
2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct
2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg
2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc
2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca
2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc
3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac
3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca
3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc
3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct
3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc
3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg
3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga
3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc
3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg
3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct
3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac
3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga
3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg
3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac
3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc
3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga
3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac
4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct
4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag
4141 cgccaacgag tccgtgccgt gcacgctcga g
//
LOCUS HUMLYAM1 1192 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 1.
ACCESSION M32406
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 1 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1192)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.-J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept 1040 + 1042 leukocyte adhesion molecule-1 (LAM-1), exon 1
mRNA 931 > 1192 leukocyte adhesion molecule-1 mRNA
IVS 1043 > 1192 leukocyte adhesion molecule-1 intron A
BASE COUNT 410 a 185 c 326 g 271 t
ORIGIN Chromosome 1q22-25.
1 agagagctgt tataaagatt aaacaatata ataaatatgg cgcgtgagct tcagagtttt
61 tgttgttgtt attattattt tcctaaaaat gcaaatctga tttgcatttg actcattgac
121 tcacatcagt gggtcttcct ttttattgtc cttcatcata tgggtcctaa tttcacatgc
181 agtcttataa aaccatctca ttttatagtc caagaatatt aaaggtactt gtaggctccc
241 aaacctacac ggtgaaaagc tagagagcat gggctctctt caggggttaa cttcaggaag
301 tgccactaac aaggacgtcc actaggtggt gagcaaggaa agacggaggt gaaggaaccg
361 aaacgagtca agtccactgc ttagctctac tgaagttttg caaacatcat aaatatgtct
421 gaaatgcagt tttgatttgt agtatttgca atttccaagg gccatttacc acaggtagcc
481 aagagttagt ttagcattta tgaaaaagat aggggagggt ggtggttaag aaggaggtgg
541 aggagagagt gaaggaggaa gaggagaaca agaaccaaac aaaaacaaga acaagaacaa
601 gtagaagaag aggagcaggg aggaaaaaga agaggaagaa gaacagcaac aacaatgagt
661 gaaggaggag gagggtaagg aaagatgcat aggagaatgg aaggaaggat agaaaggagg
721 gaaggaagag agaatctagt cacattactt tctgatcagc agttcatttt tgtctcagtg
781 ggaggcaata gaggccagtc taggaaaggg gtggggaaag aggaaagaga agtgcaggag
841 gaaggggagg cccaagggga ggaggaggag gatgtgagac tgggttagag aaatgaaaga
901 aagcaaggct ttctgttgac attcagtgca gtctacctgc agcacagcac actccctttg
961 ggcaaggacc tgagaccctt gtgctaagtc aagaggctca atgggctgca gaagaactag
1021 agaaggacca agcaaagcca tggtgagcct ttcagcctaa aagacgttta gatgctcaga
1081 tagaaactct tggggttgta gaggcaggtg gcaaggatag gaatcacccc atttcaattc
1141 tggttttaaa taatatagaa actaaacatt ttctcagacc ctcaaaaaaa gt
//
LOCUS HUMLYAM2 363 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 2.
ACCESSION M32407
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 2 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 229 + 310 leukocyte adhesion molecule-1 (LAM-1), exon 2
mRNA < 1 > 363 leukocyte adhesion molecule-1 mRNA
IVS < 1 228 leukocyte adhesion molecule-1 intron A
IVS 311 > 363 leukocyte adhesion molecule-1 intron B
BASE COUNT 115 a 51 c 69 g 128 t
ORIGIN Chromosome 1q22-25.
1 cactgagact aagcgtaaaa taaatagaac aaacaaactg tgcatcagtt ctgatgtaaa
61 tttgaagtaa ttttcatcta tgtctgagaa acctgttacc tcagacaggg ttagtagaca
121 tatgtgtttt attctgatta ttaagaaagt tgtaagcacc acctcaaagg ctataaatgt
181 gtggtttaag ggtatacatc taaatataat tttgtatttc atttgcagat atttccatgg
241 aaatgtcaga gcacccagag ggacttatgg aacatcttca agttgtgggg gtggacaatg
301 ctctgttgtg gtatgttatg atatttatat atcactaagt ctattttact tatattcatt
361 ttt
//
LOCUS HUMLYAM3 531 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 3.
ACCESSION M32408
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 3 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 531)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 73 + 459 leukocyte adhesion molecule-1 (LAM-1), exon 3
mRNA < 1 > 531 leukocyte adhesion molecule-1 mRNA
IVS < 1 72 leukocyte adhesion molecule-1 intron B
IVS 460 > 531 leukocyte adhesion molecule-1 intron C
BASE COUNT 172 a 106 c 138 g 115 t
ORIGIN Chromosome 1q22-25.
1 ctggagtagt gctaggttct ttttagctgt aacattatgt aagtctgcat aggtcacact
61 gatgtcttgc agatttcctg gcacatcatg gaaccgactg ctggacttac cattattctg
121 aaaaacccat gaactggcaa agggctagaa gattctgccg agacaattac acagatttag
181 ttgccataca aaacaaggcg gaaattgagt atctggagaa gactctgcct ttcagtcgtt
241 cttactactg gataggaatc cggaagatag gaggaatatg gacgtgggtg ggaaccaaca
301 aatctctcac tgaagaagca gagaactggg gagatggtga gcccaacaac aagaagaaca
361 aggaggactg cgtggagatc tatatcaaga gaaacaaaga tgcaggcaaa tggaacgatg
421 acgcctgcca caaactaaag gcagccctct gttacacagg tagggagtga caagacggct
481 atgctgcctc agactcagga agggccacgg ttaagagaat actcagattt a
//
LOCUS HUMLYAM4 832 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 4.
ACCESSION M32409
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 4 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 832)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 161 + 268 leukocyte adhesion molecule-1 (LAM-1), exon 4
mRNA < 1 > 832 leukocyte adhesion molecule-1 mRNA
IVS < 1 160 leukocyte adhesion molecule-1 intron C
IVS 269 > 832 leukocyte adhesion molecule-1 intron D
BASE COUNT 241 a 163 c 150 g 278 t
ORIGIN Chromosome 1q22-25.
1 aaaattttag ccatatgatt tttatgctat gaatttacca aataaacctt tcctgattat
61 ttaaatcatc tcagacaaaa ggttatctat gtctaaagaa atgactttga gtactaaaat
121 gtaatcacat taaaatattt tttttctgac ctccttaaag cttcttgcca gccctggtca
181 tgcagtggcc atggagaatg tgtagaaatc atcaataatt acacctgcaa ctgtgatgtg
241 gggtactatg ggccccagtg tcagtttggt aagtctcttt cctttctttg cttcttctta
301 ggtaaagtca caggaatcat tatagcttat catgaagctg gttggaacaa aatgatacta
361 gccactctga gaaatgggaa gttttgatca gaaagctctg ctttcacaat attgttacct
421 ttccgtaaag atttcataag tcagcatgaa gtttcgattc acttctcaac aagtcttttt
481 gagtaccaca agaagcacag tgttgggata aagctgtcag ggttacaata aggaattagc
541 atggtagatt cccgctctca agaagctcac gatctaatga gcttgttaga ttaattagaa
601 ctctaaggtc tggaagaaac tatgccattt atcattagga ggctgagtta cccagaaagt
661 atcttgcttt ttccttctag tagttccttt ccttcttgca gttctccaca cttaacacat
721 gtgctctgta gcacactgac tttgctggtg gccttctctc tcattttgca catggccaaa
781 aaacatgtca tctttaagac attgttcaaa gacagtttct tctaggaagc tt
//
LOCUS HUMLYAM5 712 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 5.
ACCESSION M32410
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 5 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 712)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 377 + 562 leukocyte adhesion molecule-1 (LAM-1), exon 5
mRNA < 1 > 712 leukocyte adhesion molecule-1 mRNA
IVS < 1 376 leukocyte adhesion molecule-1 intron D
IVS 563 > 712 leukocyte adhesion molecule-1 intron E
BASE COUNT 200 a 145 c 139 g 228 t
ORIGIN Chromosome 1q22-25.
1 ctctgatgtg atagttattt cccgactaag ctggtcattc ccagttacac ctatttggct
61 ttaaggattc tcactacaga taatactgaa gataataata tgaagactag ctaatgttta
121 cttagaattt ctgatgagtc aggctttgtt ctaacgtcct tgacttatgc taattgaatt
181 acatttagtt tccatatcaa tttgataaag ataacacaat ttcattattc ctcttatata
241 gatgaagaaa ctgaagttgg agggttcaag taaccttgtt taaaggcaca tggttatcaa
301 gtggcagggc taggattcaa atccaggcgt cagttcctct taactcttcc ccatactgtt
361 tctttcccta ttgaagtgat tcagtgtgag cctttggagg ccccagagct gggtaccatg
421 gactgtactc accctttggg aaacttcagc ttcagctcac agtgtgcctt cagctgctct
481 gaaggaacaa acttaactgg gattgaagaa accacctgtg gaccatttgg aaactggtca
541 tctccagaac caacctgtca aggtgagtaa cttcagacta gaggttttgt catgcaatcc
601 tgggcttaca gtcagaacat tcagtagaag tttgctgaga agtcaaactt aggatcctaa
661 tttaacctaa cttttgttta acctactgtg atgtttctca aaggacttat tc
//
LOCUS HUMLYAM6 451 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 6.
ACCESSION M32411
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 6 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 451)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 240 + 425 leukocyte adhesion molecule-1 (LAM-1), exon 6
mRNA < 1 > 451 leukocyte adhesion molecule-1 mRNA
IVS < 1 239 leukocyte adhesion molecule-1 intron E
IVS 426 > 451 leukocyte adhesion molecule-1 intron F
BASE COUNT 121 a 107 c 89 g 134 t
ORIGIN Chromosome 1q22-25.
1 gagggtcacc ttagctaggg cagcagcctg gagtagctac tcctctcccc acagctttca
61 atgcttcctt gccttcatct ctcattcacc acccaccatc attctcaaga aaataaagcc
121 tggaagcaat atcacaagta atgtagtcag gcagctttgg ctaaaaatcc aaagctcaag
181 ggagggtctc tactcagaaa tactgttttg tctttttttt tttttctttt tcattgaagt
241 gattcagtgt gagcctctat cagcaccaga tttggggatc atgaactgta gccatcccct
301 ggccagcttc agctttacct ctgcatgtac cttcatctgc tcagaaggaa ctgagttaat
361 tgggaagaag aaaaccattt gtgaatcatc tggaatctgg tcaaatccta gtccaatatg
421 tcaaagtgag taagtttgtc ctggaactga a
//
LOCUS HUMLYAM7 543 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 7.
ACCESSION M32412
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 7 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 543)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 300 + 428 leukocyte adhesion molecule-1 (LAM-1), exon 7
mRNA < 1 > 543 leukocyte adhesion molecule-1 mRNA
IVS < 1 299 leukocyte adhesion molecule-1 intron F
IVS 429 > 543 leukocyte adhesion molecule-1 intron G
BASE COUNT 174 a 99 c 122 g 148 t
ORIGIN Chromosome 1q22-25.
1 tatcagaact aagaaagctt gggctgcagg tcgactctag gtgcattttc aggaactcta
61 tgaaccacaa atctgggcat tgagattctg taggcattag actagcaagg ctggtcagtc
121 tttgcctatg ctgtagactc atcaggggcc ttcccatgcc agtttcctca tctgtcaaat
181 ggcatcattt gggctactac tgggagatgt aaggaggaaa aaagtcaaat atcatgagat
241 agactaagga aataatgctg gtggtctcat gctatgtgcc ttactgattt ctctttcaga
301 attggacaaa agtttctcaa tgattaagga gggtgattat aaccccctct tcattccagt
361 ggcagtcatg gttactgcat tctctgggtt ggcatttatc atttggctgg caaggagatt
421 aaaaaaaggt atgtgagttt aacttcacat gaaaagaaca caactttaaa gtgaaaaaga
481 aaaaaaaaag aaacccacag gaaattaaat gtgatagatc aacacaagca ggatgccaag
541 ctt
//
LOCUS HUMLYAM8 524 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 8.
ACCESSION M32413
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 8 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 524)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 380 + 398 leukocyte adhesion molecule-1 (LAM-1), exon 8
mRNA < 1 > 524 leukocyte adhesion molecule-1 mRNA
IVS < 1 379 leukocyte adhesion molecule-1 intron G
IVS 399 > 524 leukocyte adhesion molecule-1 intron H
BASE COUNT 146 a 104 c 103 g 171 t
ORIGIN Chromosome 1q22-25.
1 tagtttacag tattagcagc tgtccctcaa ggaagaatct gcaggtagat gagatgcaga
61 ttgggtggga taaacacttg aatgacatat tgggtcttgc caccaggcaa tttagcaatt
121 ctgtcttctt gagtagcacg gagatggaat ggaacctcag gaggcatctg catcaacatg
181 tctgttctgt attagtgtct accactgttt attaagccag ttcctcaaat ctcctttgac
241 acagataggg tccacctaac aaatacctaa tatacttcaa aagacagttt tgagagtggg
301 agtcttcctt ctcccttact tgaaaaactt taaattgtct aatttttgct aatgcctttt
361 tctctatttt ctatttcagg caagaaatcc aagagaaggt aagttttatt agtggcgagg
421 agtttccaca tctgctgatt cattctctac ttcttaagtt acttctgctc tagctagaca
481 catacccata gtagttatta ctgggtctat caatgacaga tagg
//
LOCUS HUMLYAM9 1696 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 9.
ACCESSION M32414
KEYWORDS leukocyte adhesion molecule-1.
SEGMENT 9 of 9
SOURCE Human mature lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1696)
AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
Griffin,J. and Tedder,T.F.
TITLE Structure of the gene encoding the human leukocyte adhesion
molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
JOURNAL J. Biol. Chem. 265, 7760-7767 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Tedder, 01-MAR-1990.
FEATURES from to/span description
pept + 310 328 leukocyte adhesion molecule-1 (LAM-1), exon 9
mRNA < 1 1414 leukocyte adhesion molecule-1 mRNA
IVS < 1 309 leukocyte adhesion molecule-1 intron H
BASE COUNT 443 a 387 c 292 g 574 t
ORIGIN Chromosome 1q22-25.
1 ataagcatca ctaaagagct tgttaggggt gcagaatctc aggctccact cagacctact
61 gaatcagagt ctgcatttta acaccatctc tgagtggtaa ggacatgaaa atctgagaag
121 tgctgctact agggtttgct tacatttgtt catcttcaga ggttcctaaa gcctggcctc
181 ttgtctgaga tttccagctg aaagcatttc cttgctcctc ttctcatctc taatgaatat
241 ttacctttac tactaacact ccaagttttg caatttttaa actcttatta tcttttgttt
301 ttctttcagt atgaatgacc catattaaat cgcccttggt gaaagaaaat tcttggaata
361 ctaaaaatca tgagatcctt taaatccttc catgaaacgt tttgtgtggt ggcacctcct
421 acgtcaaaca tgaagtgtgt ttccttcagt gcatctggga agatttctac ctgaccaaca
481 gttccttcag cttccatttc acccctcatt tatccctcaa cccccagccc acaggtgttt
541 atacagctca gctttttgtc ttttctgagg agaaacaaat aagaccataa agggaaagga
601 ttcatgtgga atataaagat ggctgacttt gctctttctt gactcttgtt ttcagtttca
661 attcagtgct gtacttgatg acagacactt ctaaatgaag tgcaaatttg atacatatgt
721 gaatatggac tcagttttct tgcagatcaa atttcgcgtc gtcttctgta tacgtggagg
781 tacactctat gaagtcaaaa gtctacgctc tcctttcttt ctaactccag tgaagtaatg
841 gggtcctgct caagttgaaa gagtcctatt tgcactgtag cctcgccgtc tgtgaattgg
901 accatcctat ttaactggct tcagcctccc caccttcttc agccacctct ctttttcagt
961 tggctgactt ccacacctag catctcatga gtgccaagca aaaggagaga agagagaaat
1021 agcctgcgct gttttttagt ttgggggttt tgctgtttcc ttttatgaga cccattccta
1081 tttcttatag tcaatgtttc ttttatcacg atattattag taagaaaaca tcactgaaat
1141 gctagctgca actgacatct ctttgatgtc atatggaaga gttaaaacag gtggagaaat
1201 tccttgattc acaatgaaat gctctccttt cccctgcccc cagacctttt atccacttac
1261 ctagattcta catattcttt aaatttcatc tcaggcctcc ctcaacccca ccacttcttt
1321 tataactagt cctttactaa tccaacccat gatgagctcc tcttcctggc ttcttactga
1381 aaggttaccc tgtaacatgc aattttgcat ttgaataaag cctgcttttt aagtgttaac
1441 tagtttgcct agtttgttat tttgaaaatt gatcatatgt tttgttttct ccccagtgag
1501 ttacatgctc cttcagggca gagtttgtgt cagatccctg gagtatctag tgcattactt
1561 gacactcaat aaatgaatgt tcaaataaat cagaaagagc atacagtgca ctgctgatat
1621 aagtttcagc atccctcttt ctctatggca tctgatgacc tgggtcagat atcacctaat
1681 gtcaacagct gaattc
//
LOCUS RATALADPS 1612 bp ds-DNA ROD 17-JUL-1990
DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene,
complete cds.
ACCESSION J04764
KEYWORDS delta-aminolevulinate dehydratase; pseudogene.
SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1612)
AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H.
TITLE Isolation and characterization of a rat delta-aminolevulinate
dehydratase processed pseudogene
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.R.Bishop, 20-APR-1990.
FEATURES from to/span description
pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase
rpt 101 108 8 bp flanking direct repeat
rpt 1385 1392 8 bp flanking direct repeat
BASE COUNT 404 a 411 c 418 g 379 t
ORIGIN
1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS BOVPDEAP 585 bp ss-mRNA MAM 17-JUL-1990
DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase
alpha'-subunit (PDE), partial cds.
ACCESSION M33140 M29465
KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase.
SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 585)
AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K.,
Mullaney,P.J., Walsh,K.A. and Beavo,J.A.
TITLE Identification of a noncatalytic cGMP-binding domain conserved in
both the cGMP-stimulated and photoreceptor cyclic nucleotide
phosphodiesterases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 585 cone photoreceptor cyclic nucleotide
phosphodiesterase alpha'-subunit (AA at 1)
BASE COUNT 198 a 115 c 126 g 146 t
ORIGIN
1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa
61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt
121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt
181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt
241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag
301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct
361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc
421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa
481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca
541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat
//
LOCUS CHKG1CLSE 240 bp ds-DNA VRT 17-JUL-1990
DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer
cor segments B3 and B4.
ACCESSION M33954
KEYWORDS delta-1-crystallin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 240)
AUTHORS Goto,K., Okada,T.S. and Kondoh,H.
TITLE Functional cooperation of lens-specific and nonspecific elements in
the delta-1-crystallin enhancer
JOURNAL Mol. Cell. Biol. 10, 958-964 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 17 116 core segment B3
site 112 235 core segment B4
BASE COUNT 61 a 52 c 53 g 74 t
ORIGIN
1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg
61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt
121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga
181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa
//
LOCUS CHKOVAL 9206 bp ds-DNA VRT 17-JUL-1990
DEFINITION Chicken ovalbumin gene, complete cds.
ACCESSION J00895
KEYWORDS ovalbumin.
SOURCE Chicken oviduct DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1343 to 8906)
AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R.,
Brownlee,G.G. and O'Malley,B.W.
TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin
gene and its biological significance
JOURNAL Biochemistry 20, 6437-6446 (1981)
STANDARD full staff_review
REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206)
AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P.
TITLE The ovalbumin gene-sequence of putative control regions
JOURNAL Nucleic Acids Res. 8, 127-142 (1980)
STANDARD full staff_review
REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions)
AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P.
TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA
sequences at the exon-intron boundaries
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978)
STANDARD full staff_review
REFERENCE 4 (bases 1282 to 1420; 2952 to 2999)
AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C.,
Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and
Chambon,P.
TITLE Organisation and sequences at the 5' end of a cloned complete
ovalbumin gene
JOURNAL Nature 278, 428-434 (1979)
STANDARD full staff_review
REFERENCE 5 (bases 1343 to 8906; exons only)
AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E.,
Givol,D., Fields,S., Robertson,M. and Brownlee,G.G.
TITLE Sequence of chicken ovalbumin mRNA
JOURNAL Nature 273, 723-728 (1978)
STANDARD full staff_review
REFERENCE 6 (bases 2675 to 5042)
AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F.,
O'Malley,B.W. and Brownlee,G.G.
TITLE Sequence of three introns in the chick ovalbumin gene
JOURNAL Nature 278, 370-372 (1979)
STANDARD full staff_review
REFERENCE 7 (bases 1 to 1042)
AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L.
TITLE The ovalbumin gene family: The 5' end region of the X and Y genes
JOURNAL J. Mol. Biol. 156, 1-19 (1982)
STANDARD full staff_review
REFERENCE 8 (bases 5576 to 5624)
AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W.
TITLE The ovalbumin gene: Alleles created by mutations in the intervening
sequences of the natural gene
JOURNAL Cell 16, 201-211 (1979)
STANDARD simple staff_entry
REFERENCE 9 (bases 443 to 611)
AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M.
TITLE The steroid-dependent regulatory element in the ovalbumin gene does
not function as a typical steroid response element
JOURNAL J. Biol. Chem. 265, 7590-7595 (1990)
STANDARD simple staff_entry
COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a,
adenovirus major late, chicken conalbumin, chkx and chky genes
noted for 5' flanking sequence.
OV1.3 is identical to OV1.8 except that intron E is shorter by 522
nucleotides [8].
FEATURES from to/span description
pept 2996 3163 ovalbumin, exon 1
3415 3465 ovalbumin, exon 2
4047 4175 ovalbumin, exon 3
4576 4693 ovalbumin, exon 4
5652 5794 ovalbumin, exon 5
6126 6281 ovalbumin, exon 6
7864 8259 ovalbumin, exon 7
pre-msg 1343 8906 oval mRNA and introns
IVS 1390 2978 oval intron A
IVS 3164 3414 oval intron B
IVS 3466 4046 oval intron C
IVS 4176 4575 oval intron D
IVS 4694 5651 oval intron E
IVS 5795 6125 oval intron F
IVS 6282 7863 oval intron G
allele 1282 1282 g may be c [1]
allele 1309 1309 g may be a [1]
allele 1376 1376 g may be c [1],[7]
allele 1385 1385 g may be a [1],[7]
allele 1393 1393 a may be g [1]
allele 3010 3010 t may be c [1]
allele 3154 3154 a may be g [1]
allele 5747 5747 g may be a [1]
allele 8032 8032 can be g [5]
conflict 1471 1471 c in [1]; t in [2],[7]
conflict 1523 1523 a in [1]; t in [2],[7]
conflict 1538 1539 at in [1]; a in [2]
conflict 2681 2681 t in [1],[6]; g in [2]
conflict 2696 2696 a in [1],[6]; g in [2]
conflict 3552 3552 a in [1],[6]; g in [2]
conflict 3683 3683 c in [1],[6]; t in [2]
conflict 3934 3934 a in [1],[6]; c in [2]
conflict 3960 3960 a in [1],[6]; g in [2]
conflict 3994 3994 t in [1],[6]; c in [2]
conflict 4207 4208 tt in [1],[6]; ttt in [2]
conflict 4396 4396 t in [1],[6]; c in [2]
conflict 5636 5636 c in [1]; g in [3]
conflict 6100 6100 t in [1]; tc in [3]
conflict 6119 6119 t in [1]; tg in [3]
conflict 8492 8505 gactcacagtactg in [1]; g in [5]
site 443 611 steroid-dependent regulatory element [9]
BASE COUNT 2994 a 1749 c 1721 g 2742 t
ORIGIN 5 bp upstream of PstI site.
1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca
61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg
121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc
181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg
241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca
301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga
361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca
421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca
481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa
541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt
601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg
661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt
721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt
781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc
841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg
901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac
961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag
1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat
1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag
1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat
1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg
1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc
1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa
1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac
1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac
1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta
1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt
1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat
1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg
1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat
1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac
1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg
1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac
1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa
2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact
2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa
2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa
2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa
2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc
2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac
2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt
2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt
2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc
2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat
2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa
2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat
2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca
2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg
2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt
2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg
3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca
3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt
3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa
3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct
3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc
3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc
3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt
3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt
3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct
3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa
3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag
3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta
3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg
3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat
3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag
3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta
3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca
4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta
4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta
4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg
4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat
4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac
4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag
4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat
4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta
4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat
4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa
4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa
4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa
4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt
4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa
4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca
4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc
4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat
5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg
5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga
5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt
5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc
5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt
5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag
5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga
5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa
5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc
5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt
5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc
5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga
5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta
5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg
5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct
5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt
6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca
6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc
6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat
6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca
6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg
6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt
6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag
6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa
6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag
6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga
6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac
6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa
6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca
6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat
6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat
6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag
6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga
7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt
7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag
7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac
7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc
7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg
7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac
7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt
7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga
7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga
7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta
7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga
7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag
7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt
7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg
7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg
7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc
7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct
8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa
8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc
8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc
8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact
8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt
8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga
8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac
8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa
8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa
8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa
8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc
8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca
8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt
8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt
8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc
8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag
9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa
9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg
9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg
9181 ctgccctgat ctcggctggg gtgatg
//
LOCUS PMUCEN 150 bp ds-DNA PHG 17-JUL-1990
DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage
site.
ACCESSION M32302
KEYWORDS .
SOURCE Bacteriophage Mu (wild type) DNA.
ORGANISM Bacteriophage mu
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Pato,M., Howe,M. and Higgins,P.
TITLE DNA gyrase binds to a centrally located replication enhancer (CEN)
in the bacteriophage Mu genome
JOURNAL Unpublished (1990) In Press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Pato, 23-FEB-1990.
FEATURES from to/span description
site 49 50 DNA gyrase cleavage site in complimentary
strand
mut 44 44 a in wt; g in Mu nuB103
mut 50 50 g in wt; c in Mu nuB1
BASE COUNT 40 a 38 c 27 g 45 t
ORIGIN Map position at 18.0 kb.
1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat
61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt
121 atcttccttt cactttcttt ctccagtact
//
LOCUS RATRNRTR 2577 bp ds-DNA ROD 17-JUL-1990
DEFINITION Rat snRNP-associated polypeptide N, complete cds.
ACCESSION J05497
KEYWORDS snRNP-associated polypeptide N.
SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2577)
AUTHORS Schmauss,C. and Lerner,M.R.
TITLE The closely related small nuclear ribonucleoprotein polypeptides N
and B/B' are distinguishable by antibodies as well as by
differences in their mRNAs and gene structures
JOURNAL J. Biol. Chem. 265, 10733-10739 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Schmauss, 13-APR-1990.
FEATURES from to/span description
pept 1314 2036 snRNP-associated polypeptide N
signal 774 779 TATA box
signal 698 702 CAAT box
BASE COUNT 822 a 537 c 581 g 637 t
ORIGIN
1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg
61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact
121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag
181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag
241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga
301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg
361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct
421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg
481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc
541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt
601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca
661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat
721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa
781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc
841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg
901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag
961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa
1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag
1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga
1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt
1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt
1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg
1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag
1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct
1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag
1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg
1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg
1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag
1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga
1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag
1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa
1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa
1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc
1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata
2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc
2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg
2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca
2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa
2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca
2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa
2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat
2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg
2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc
//
LOCUS WUCSSP 1323 bp ds-DNA INV 17-JUL-1990
DEFINITION W.bancrofti species specific DNA fragment.
ACCESSION M27140
KEYWORDS .
SOURCE W.bancrofti DNA, clone IWb35.
ORGANISM Wuchereria bancrofti
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 1323)
AUTHORS Dissanayake,S. and Piessens,W.F.
TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA
sequence
JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990)
STANDARD simple staff_entry
BASE COUNT 399 a 204 c 251 g 469 t
ORIGIN 1288 bp upstream of SacI site.
1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt
61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat
121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta
181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga
241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt
301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg
361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa
421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg
481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg
541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat
601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc
661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc
721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag
781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg
841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa
901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct
961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat
1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac
1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata
1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca
1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg
1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg
1321 agc
//
LOCUS YSCHXT2 2890 bp ds-DNA PLN 17-JUL-1990
DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene,
complete cds.
ACCESSION M33270
KEYWORDS high affinity hexose transporter-2.
SOURCE S.cerevisiae (isogenic strain to S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2890)
AUTHORS Kruckeberg,A.L. and Bisson,L.F.
TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high
affinity glucose transport
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.L.Kruckeberg, 26-MAR-1990.
FEATURES from to/span description
pept 818 2443 high affinity hexose transporter-2
BASE COUNT 808 a 571 c 535 g 976 t
ORIGIN
1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg
61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc
121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata
181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt
241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac
301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc
361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa
421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac
481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg
541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac
601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc
661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca
721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca
781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg
841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga
901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa
961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt
1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag
1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt
1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg
1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat
1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc
1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc
1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt
1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt
1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc
1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc
1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat
1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa
1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt
1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa
1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc
1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat
1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa
2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat
2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc
2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt
2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt
2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa
2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc
2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact
2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt
2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt
2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg
2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt
2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa
2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa
2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat
2881 cggccatact
//
LOCUS CHKARCASR 2392 bp ss-mRNA VRT 17-JUL-1990
DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5-
aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR)
synthetase, complete cds.
ACCESSION M31764
KEYWORDS 5-aminoimidazole ribonucleotide carboxylase;
5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase.
SOURCE Chicken liver, cDNA to mRNA, clone pZD1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2392)
AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H.
TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole
ribonucleotide carboxylase and
5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase
by functional complementation of Escherichia coli pur mutants
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Zalkin, 01-FEB-1990.
FEATURES from to/span description
pept 59 1339 5-aminoimidazole ribonucleotide (AIR)
carboxylase-5-
aminoimidazole-4-N-succinocarboxamide
ribonucleotide (SAICAR) synthetase
matp 59 835 SAICAR synthetase domain
matp 836 1336 AIR carboxylase domain
site 2341 2347 poly-A signal
BASE COUNT 696 a 466 c 527 g 702 t 1 others
ORIGIN
1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat
61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga
121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac
181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac
241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa
301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg
361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa
421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg
481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac
541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg
601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac
661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg
721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga
781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct
841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca
901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac
961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg
1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc
1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca
1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc
1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc
1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt
1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag
1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt
1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg
1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta
1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta
1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt
1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat
1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc
1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg
1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt
1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa
1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc
2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct
2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc
2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac
2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa
2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat
2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc
//
LOCUS ECORAFRO 1289 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and
alpha-galactosidase (rafA) gene, 5' end.
ACCESSION M29849
KEYWORDS alpha-galactosidase; raffinose repressor.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1289)
AUTHORS Aslanidis,C. and Schmitt,R.
TITLE Regulatory elements of the raffinose operon: Nucleotide sequences
of operator and repressor genes
JOURNAL J. Bacteriol. 172, 2178-2180 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Schmitt, 13-NOV-1989.
FEATURES from to/span description
pept 166 1176 raffinose repressor
pept 1270 > 1289 alpha-galactosidase
binding 156 161 ribosome binding site
binding 1258 1263 ribosome binding site
signal 120 126 -35 region
signal 144 149 -10 region
signal 1212 1216 -35 region
signal 1235 1240 -10 region
site 1196 1213 raf operator
site 1217 1234 raf operator
BASE COUNT 317 a 283 c 326 g 363 t
ORIGIN
1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata
61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga
121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg
181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca
241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc
301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat
361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc
421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac
481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat
541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct
601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga
661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc
721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat
781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat
841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat
901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa
961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat
1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg
1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt
1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg
1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg
1261 agatgtctaa tgatttcaaa gtactgcag
//
LOCUS HSEDILSTD 351 bp ds-DNA VRL 17-JUL-1990
DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus
of STD DNA.
ACCESSION M34145
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 351)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_review
BASE COUNT 55 a 117 c 127 g 52 t
ORIGIN
1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca
61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga
121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt
181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg
241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc
301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g
//
LOCUS HSEDISA 633 bp ds-DNA VRL 17-JUL-1990
DEFINITION Equine herpesvirus type 1 defective interfering particle DNA.
ACCESSION M34143
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 633)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_entry
BASE COUNT 124 a 184 c 202 g 123 t
ORIGIN
1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact
301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc
361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca
421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt
481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg
541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc
601 acggcggggc gcagaacgcc cctggccccc ggg
//
LOCUS HSEDISTD 290 bp ds-DNA VRL 17-JUL-1990
DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat
of STD DNA.
ACCESSION M34144
KEYWORDS defective interfering particle.
SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG..
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 290)
AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
O'Callaghan,D.J.
TITLE Identification of the site of recombination in the generation of
the genome of DI particles of Equine herpesvirus type 1
JOURNAL Virology 175, 448-455 (1990)
STANDARD simple staff_review
BASE COUNT 69 a 72 c 77 g 72 t
ORIGIN
1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc
//
LOCUS HUMAR 1230 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) mRNA, complete cds.
ACCESSION M30704
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones
lambda-AR1 and lambda-AR2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1230)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept 210 968 amphiregulin precursor
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
sigp 210 266 amphiregulin signal peptide
matp 510 962 amphiregulin (alt.)
matp 528 962 amphiregulin (alt.)
mRNA 1 1230 amphiregulin mRNA
BASE COUNT 375 a 273 c 285 g 297 t
ORIGIN
1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc
61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg
121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga
181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg
241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg
301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg
361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt
421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac
481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc
541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag
601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc
661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca
721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca
781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg
841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat
901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta
961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc
1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat
1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga
1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt
1201 tattttacag ctcattaaac ttttttaacc
//
LOCUS HUMAR1 968 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, exon 1.
ACCESSION M30698
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 1 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 968)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept 859 + 919 amphiregulin precursor, exon 1
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
sigp 859 915 amphiregulin signal peptide
pre-msg 649 > 968 amphiregulin mRNA and introns
IVS 920 > 968 amphiregulin intron A
BASE COUNT 206 a 280 c 257 g 225 t
ORIGIN
1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt
61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg
121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg
181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag
241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta
301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta
361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac
421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct
481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt
541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc
601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca
661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc
721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca
781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc
841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg
901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca
961 tggcaggt
//
LOCUS HUMAR2 348 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, exon 2.
ACCESSION M30699
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 2 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 51 + 299 amphiregulin precursor, exon 2
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp 290 + 299 amphiregulin
pre-msg < 1 > 348 amphiregulin mRNA and introns
IVS < 1 50 amphiregulin intron A
IVS 300 > 348 amphiregulin intron B
BASE COUNT 88 a 75 c 86 g 99 t
ORIGIN
1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc
61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca
121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc
181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc
241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg
301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata
//
LOCUS HUMAR3 302 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, exon 3.
ACCESSION M30700
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 3 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 302)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 52 + 252 amphiregulin precursor, exon 3
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp + 52 + 252 amphiregulin
matp 60 + 252 amphiregulin (alt.)
pre-msg < 1 > 302 amphiregulin mRNA and introns
IVS < 1 51 amphiregulin intron B
IVS 253 > 302 amphiregulin intron C
BASE COUNT 125 a 46 c 54 g 77 t
ORIGIN
1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg
61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa
121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta
181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag
241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc
301 tg
//
LOCUS HUMAR4 254 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, exon 4.
ACCESSION M30701
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 4 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 254)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 51 + 204 amphiregulin precursor, exon 4
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
matp + 51 91 amphiregulin (both alt.)
pre-msg < 1 > 254 amphiregulin mRNA and introns
IVS < 1 50 amphiregulin intron C
IVS 205 > 254 amphiregulin intron D
BASE COUNT 81 a 42 c 53 g 78 t
ORIGIN
1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca
61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag
121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac
181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat
241 aatgggaggt taat
//
LOCUS HUMAR5 217 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, exon 5.
ACCESSION M30702
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 5 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pept + 50 143 amphiregulin precursor, exon 5
/hgml_locus_uid="LW0168B" /nomgen="AREG"
/map="4q13-q21"
pre-msg < 1 > 217 amphiregulin mRNA and introns
IVS < 1 49 amphiregulin intron D
BASE COUNT 89 a 26 c 41 g 61 t
ORIGIN
1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac
61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg
121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata
181 tctttagatc atatcctata attttgaaaa atttaac
//
LOCUS HUMAR6 354 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human amphiregulin (AR) gene, 3' flanking region.
ACCESSION M30703
KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT 6 of 6
SOURCE Human breast carcinoma cell line MCF-7 DNA, clones
lambda-ARH[6,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
Disteche,C.M., Todaro,G.J. and Shoyab,M.
TITLE The amphiregulin gene encodes a novel epidermal growth factor-
related protein with tumor-inhibitory activity
JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 18-DEC-1989.
FEATURES from to/span description
pre-msg < 1 294 amphiregulon mRNA and intron
BASE COUNT 104 a 48 c 52 g 150 t
ORIGIN
1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat
61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat
121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct
181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta
241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa
301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc
//
LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds.
ACCESSION M32294
KEYWORDS Ro ribonucleoprotein autoantigen.
SOURCE Human T-cell line Jurkat, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1890)
AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J.,
Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M.,
Sontheimer,R.D. and Capra,J.D.
TITLE Molecular cloning, expression and Chromosome 19 localization of a
human Ro/SS-A auto antigen
JOURNAL J. Clin. Invest. 85, 1379-1391 (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.P.McCauliffe, 23-FEB-1990.
FEATURES from to/span description
pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A)
precursor
sigp 67 117 Ro ribonucleoprotein autoantigen signal
peptide
matp 118 1317 Ro ribonucleoprotein autoantigen
site 1881 1886 poly-A signal (put.)
BASE COUNT 464 a 497 c 520 g 409 t
ORIGIN Chromosome 19.
1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg
61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag
121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc
181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt
241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg
301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa
361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac
421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc
481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac
541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca
601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac
661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac
721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag
781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac
841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg
901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag
961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc
1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac
1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag
1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag
1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag
1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag
1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg
1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt
1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt
1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt
1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc
1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct
1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca
1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact
1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact
1861 acaaacaaaa tttctattaa attaaatttt
//
LOCUS PRD1XII 1701 bp ds-DNA PHG 17-JUL-1990
DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds.
ACCESSION M30146
KEYWORDS early protein XII; early protein XIX.
SOURCE Bacteriophage PRD1 (Viradea strain) DNA.
ORGANISM Bacteriophage PRD1
Viridae; ds-DNA nonenveloped viruses; Tectiviridae.
REFERENCE 1 (bases 1 to 1701)
AUTHORS Gerendasy,D. and Ito,J.
TITLE Nucleotide sequence and transcription of the right early region of
bacteriophage PRD1
JOURNAL J. Bacteriol. 172, 1889-1898 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Gerendasy, 27-NOV-1989.
FEATURES from to/span description
pept 242 724 early protein XII
pept 797 955 early protein XIX (put.)
mRNA 179 732 early protein XII mRNA
mRNA 181 1100 early protein XIX mRNA
binding 230 233 ribosome binding site
binding 785 788 ribosome binding site
rpt 1 111 inverted terminal repeat
site 145 172 right early promoter
BASE COUNT 419 a 447 c 390 g 445 t
ORIGIN
1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca
61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat
121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca
181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat
241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag
301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg
361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc
421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct
481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc
541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc
601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc
661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc
721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa
781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc
841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat
901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc
961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct
1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa
1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg
1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag
1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac
1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc
1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga
1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg
1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt
1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg
1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc
1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg
1681 cgagaaaacc gccttgacaa a
//
LOCUS PSEALALPQ 3141 bp ds-DNA BCT 17-JUL-1990
DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP
and algQ) genes, complete cds.
ACCESSION M32077
KEYWORDS alginate regulatory protein P; alginate regulatory protein Q.
SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient
PA02003.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 3141)
AUTHORS Konyecsni,W.M. and Deretic,V.
TITLE DNA sequence and expression analysis of algP and algQ, components
of the multigene system transcriptionally regulating mucoidy in
Pseudomonas aeruginosa: algP contains multiple direct repeats
JOURNAL J. Bacteriol. 172, 2511-2520 (1990)
STANDARD full staff_review
COMMENT draft entry and computer-readable sequence [1] kindly submitted by
V.Deretic, 16-FEB-1990.
FEATURES from to/span description
pept 781 1263 alginate regulatory protein Q
pept 2079 3137 alginate regulatory protein P (put.)
mRNA 661 1310 algQ mRNA
mRNA 2054 3141 algP mRNA
BASE COUNT 607 a 1026 c 1009 g 499 t
ORIGIN 1 bp upstream of BamHI site; map position 9 min.
1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc
61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg
121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg
181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct
241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa
301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg
361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc
421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg
481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt
541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat
601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat
661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc
721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac
781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat
841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag
901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta
961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat
1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc
1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc
1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc
1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc
1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg
1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca
1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg
1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga
1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac
1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga
1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc
1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa
1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc
1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc
1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag
1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc
1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt
2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg
2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag
2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac
2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg
2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt
2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg
2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca
2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca
2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg
2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga
2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc
2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa
2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca
2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg
2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg
2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac
3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc
3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc
3121 cgaccagcgc ctcctaaggc g
//
LOCUS STYAROF 1235 bp ds-DNA BCT 17-JUL-1990
DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate
7-phosphate synthase (aroF) gene, complete cds.
ACCESSION M31302
KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase.
SOURCE S.typhimurium (strain LT2) DNA, clone pGM59.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1235)
AUTHORS Muday,G.K. and Herrmann,K.M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 165 to 1235)
AUTHORS Muday,G.K. and Herrmann,K.M.
TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia
coli
JOURNAL J. Bacteriol. 172, 2259-2266 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.M.Herrmann, 13-JAN-1990.
Author address: K.M.Herrmann
Purdue University
Biochemistry department
West Lafayette IN, 47907
(317) 494-1653
FEATURES from to/span description
pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate
synthase (E.C. 4.1.2.15)
site 1 20 operator aroFo2
site 52 71 operator aroFo1
site 69 113 promoter
site 75 94 TyrR box-1
BASE COUNT 309 a 286 c 343 g 297 t
ORIGIN
1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt
61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg
121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc
181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg
241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg
301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg
361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg
421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga
481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa
541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt
601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc
661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt
721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc
781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg
841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag
901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag
961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg
1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata
1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg
1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag
1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa
//
LOCUS TRBANTAT 18598 bp ds-DNA INV 17-JUL-1990
DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes,
complete cds.
ACCESSION M33720
KEYWORDS expression site variant specific antigen.
SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 18598)
AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P.,
Salmon,D. and Steinert,M.
TITLE The genes and transcripts of an antigen gene expression site from
T.brucei
JOURNAL Cell 57, 835-845 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Pays, 20-JAN-1989.
FEATURES from to/span description
pept 1174 2196 expression site variant specific antigen 7
pept 3528 4733 expression site variant specific antigen 6
pept 8519 9772 expression site variant specific antigen 5
pept 14450 18157 expression site variant specific antigen 4
mRNA 986 2676 ESAG-7 mRNA
mRNA 3483 5056 ESAG-6 mRNA
mRNA 8260 11304 ESAG-5 mRNA
trns 5305 5808 rime retroposon
BASE COUNT 5298 a 3468 c 4707 g 5125 t
ORIGIN
1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta
61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat
121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg
181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg
241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta
301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc
361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat
421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt
481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt
541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt
601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa
661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag
721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg
781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat
841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca
901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa
961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat
1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg
1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga
1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg
1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca
1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta
1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag
1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc
1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa
1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa
1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg
1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga
1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg
1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt
1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa
1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt
1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg
1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt
2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa
2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa
2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact
2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac
2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa
2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt
2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta
2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt
2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg
2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca
2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta
2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact
2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat
2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat
2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg
2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata
3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg
3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga
3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa
3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg
3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt
3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg
3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt
3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc
3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt
3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga
3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa
3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa
3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg
3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc
3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag
3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt
3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg
4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg
4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg
4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg
4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga
4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg
4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg
4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa
4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg
4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag
4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg
4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga
4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg
4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga
4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa
4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc
4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa
4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag
5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga
5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca
5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat
5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct
5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca
5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg
5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg
5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat
5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca
5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg
5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg
5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca
5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac
5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc
5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg
5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata
6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg
6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg
6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc
6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat
6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac
6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa
6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa
6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta
6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag
6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat
6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg
6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca
6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg
6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag
6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca
6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa
6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct
7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa
7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat
7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag
7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat
7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa
7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta
7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta
7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc
7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca
7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga
7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa
7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg
7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga
7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct
7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt
7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact
7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag
8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg
8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga
8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca
8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca
8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg
8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt
8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg
8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat
8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag
8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga
8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt
8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat
8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga
8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga
8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga
8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa
9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata
9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac
9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc
9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg
9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt
9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa
9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct
9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca
9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac
9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa
9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc
9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt
9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc
9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac
9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta
9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt
9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt
10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt
10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg
10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa
10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata
10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac
10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta
10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg
10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa
10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag
10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt
10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg
10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag
10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg
10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc
10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa
10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg
10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca
11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat
11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc
11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca
11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt
11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg
11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat
11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac
11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct
11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag
11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag
11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt
11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat
11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc
11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata
11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga
11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg
12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt
12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg
12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg
12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc
12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa
12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa
12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg
12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact
12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc
12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct
12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa
12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta
12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga
12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc
12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc
12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata
12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg
13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg
13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt
13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct
13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata
13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca
13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc
13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt
13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa
13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc
13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt
13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta
13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca
13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca
13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct
13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg
13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc
13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc
14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc
14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg
14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt
14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc
14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg
14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc
14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct
14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac
14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt
14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt
14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc
14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt
14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa
14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga
14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt
14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga
15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg
15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt
15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga
15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct
15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat
15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt
15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg
15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa
15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac
15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat
15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac
15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga
15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca
15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg
15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg
15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc
15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag
16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga
16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac
16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag
16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct
16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat
16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat
16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat
16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt
16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct
16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga
16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt
16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct
16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat
16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa
16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc
16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat
16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat
17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct
17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac
17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt
17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa
17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc
17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc
17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc
17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat
17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg
17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat
17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc
17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc
17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga
17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc
17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg
17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg
18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg
18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt
18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc
18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct
18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga
18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga
18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct
18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag
18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc
18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt
//
LOCUS JS3PRSBS 81 bp ds-DNA BCT 17-JUL-1990
DEFINITION Plasmid pJS37 promoter region DNA.
ACCESSION M34567
KEYWORDS .
SOURCE Plasmid pJS37 DNA, from B.subtilis (strain MB11).
ORGANISM Plasmid pJS37
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 81)
AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
TITLE Comparative expression of the pC194 cat gene in Streptococcus
pneumoniae, Bacillus subtilis and Escherichia coli
JOURNAL Gene 86, 71-79 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 61 81 leader peptide (put.)
mRNA 37 > 81 cat mRNA
mRNA 64 > 81 cat mRNA
binding 46 51 ribosomal binding site (put.)
signal 34 62 promoter region 1
signal 5 34 promoter region 2
signal 57 62 P1 -10 region
signal 34 39 P1 -35 region
signal 29 34 P2 -10 region
signal 5 10 P2 -35 region
BASE COUNT 36 a 5 c 11 g 29 t
ORIGIN
1 atttttaaaa tatatattta tgttacagta atattgactt ttaaaaaagg attgattcta
61 atgaagaaag cagacaagta a
//
LOCUS JS3PRSEC 62 bp ds-DNA BCT 17-JUL-1990
DEFINITION Plasmid pJS37 promoter region DNA.
ACCESSION M34566
KEYWORDS .
SOURCE Plasmid pJS37 DNA, from E.coli (strain C600).
ORGANISM Plasmid pJS37
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 62)
AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
TITLE Comparative expression of the pC194 cat gene in Streptococcus
pneumoniae, Bacillus subtilis and Escherichia coli
JOURNAL Gene 86, 71-79 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 52 > 62 cat mRNA
signal 21 43 promoter region PT
signal 43 48 PT -10 region
signal 21 26 PT -35 region
BASE COUNT 23 a 3 c 14 g 22 t
ORIGIN
1 ccgatgatga agaaaagaat ttgaaactta gtttatatgt ggtaaaatgt tttaattggg
61 at
//
LOCUS JS3PRSSP 109 bp ds-DNA BCT 17-JUL-1990
DEFINITION Plasmid pJS37 promoter region DNA.
ACCESSION M34565
KEYWORDS .
SOURCE Plasmid pJS37 DNA, from S.pneumoniae (strain 708).
ORGANISM Plasmid pJS37
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 109)
AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
TITLE Comparative expression of the pC194 cat gene in Streptococcus
pneumoniae, Bacillus subtilis and Escherichia coli
JOURNAL Gene 86, 71-79 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 33 53 leader peptide (put.)
signal < 1 6 promoter region 1
signal 6 34 promoter region 2
binding 18 23 ribosomal binding site (put.)
BASE COUNT 47 a 13 c 16 g 33 t
ORIGIN
1 taatattgac ttttaaaaaa ggattgattc taatgaagaa agcagacaag taagcctcct
61 aaattcactt tagataaaaa tttaggaggc atatcaaatg aactttaat
//
LOCUS HUMTNC1 1085 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION M33771 J05502
KEYWORDS troponin C.
SEGMENT 1 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1085)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept 1033 + 1035 troponin C, exon 1
pre-msg 968 > 1085 troponin C mRNA and introns (alt.)
pre-msg 969 > 1085 troponin C mRNA and introns (alt.)
IVS 1036 > 1085 troponin C intron A
BASE COUNT 266 a 276 c 306 g 237 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
1081 tggac
//
LOCUS HUMTNC2 1773 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION M33772 J05502
KEYWORDS troponin C.
SEGMENT 2 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1773)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept + 51 102 troponin C, exon 2
247 390 troponin C, exon 3
479 593 troponin C, exon 4
758 894 troponin C, exon 5
1506 1537 troponin C, exon 6
pre-msg < 1 1668 troponin C mRNA and introns
IVS < 1 50 troponin C intron A
IVS 103 246 troponin C intron B
IVS 391 478 troponin C intron C
IVS 594 757 troponin C intron D
IVS 895 1505 troponin C intron E
BASE COUNT 358 a 455 c 628 g 331 t 1 others
ORIGIN About 2.5 kb after segment 1.
1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33841
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 967)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 961 967 inverted repeat B
rpt 560 810 R region
signal 792 797 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 530 536 TATA box
BASE COUNT 254 a 235 c 222 g 256 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
961 ccctaca
//
LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33842
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 960)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 954 960 inverted repeat B
rpt 555 803 R region
signal 785 790 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 525 531 TATA box
BASE COUNT 250 a 232 c 219 g 259 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS SYNLACZA 6476 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD1.27.
ACCESSION M34296
KEYWORDS lacZ.
SOURCE Cloning vector pPD1.27.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6476)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3500 3501 E.coli lacZ end/SV40 start
recomb 3634 3635 SV40 end/synthetic start
recomb 3657 3658 synthetic end/pUC19 start
recomb 4076 4077 pUC19 end/C.elegans sup-7 start
recomb 4442 4443 C.elegans sup-7 end/pUC19 start
BASE COUNT 1560 a 1646 c 1676 g 1594 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa
3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat
3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg
3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt
3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt
3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa
3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt
3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct
3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZB 6096 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD8.02.
ACCESSION M34297
KEYWORDS lacZ.
SOURCE Cloning vector pPD8.02.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6096)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3275 3276 synthetic end/unknown DNA start
recomb 3696 3697 unknown DNA end/C.elegans sup-7 end
recomb 4062 4063 C.elegans sup-7 end/pUC19 start
BASE COUNT 1458 a 1571 c 1616 g 1451 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta
3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg
3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt
3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc
3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat
3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg
3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat
3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca
3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga
3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa
3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac
4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta
4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc
5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc
5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc
6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa
6061 ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZC 7376 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD8.33.
ACCESSION M34298
KEYWORDS lacZ.
SOURCE Cloning vector pPD8.33.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7376)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 3524 3425 unknown DNA end/C.elegans sup-7 end
recomb 4555 4556 synthetic end/pUC19 start
recomb 4976 4977 pUC19 end/C.elegans sup-7 start
recomb 5342 5343 C.elegans sup-7 end/pUC19 start
BASE COUNT 1863 a 1793 c 1814 g 1906 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct
4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS SYNLACZD 5730 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD16.43.
ACCESSION M34299
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.43.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 5730)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 3264 3265 synthetic end/unknown DNA start
recomb 3513 3514 unknown DNA end/synthetic start
recomb 3524 3525 synthetic end/pUC19 start
BASE COUNT 1348 a 1488 c 1580 g 1314 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga
3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg
3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa
3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca
3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat
3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta
3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg
3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg
3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac
3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg
3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt
4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg
4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac
4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt
4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag
4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc
4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc
4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta
4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg
4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga
4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac
4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa
4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat
4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc
4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg
4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg
5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa
5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg
5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga
5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct
5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca
5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc
5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg
5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc
5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca
5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc
5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga
5701 gcggataaca atttcacaca ggaaacagct
//
LOCUS SYNLACZE 7010 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD16.51.
ACCESSION M34300
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.51.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7010)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 4544 4545 synthetic end/pUC19 start
recomb 4793 4794 pUC19 end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 4524 4525 unknown DNA end/synthetic start
recomb 4804 4805 synthetic end/pUC19 start
BASE COUNT 1753 a 1710 c 1778 g 1769 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg
4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct
4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc
4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa
4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg
4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc
5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga
5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga
5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg
5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc
5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac
5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact
5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca
5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg
5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact
5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg
5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg
5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat
5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc
5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat
5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt
5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc
6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt
6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac
6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt
6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct
6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga
6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac
6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg
6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt
6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc
6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg
6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc
6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc
6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag
6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca
6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat
6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg
6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct
//
LOCUS SYNLACZF 7088 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD18.32.
ACCESSION M34301
KEYWORDS lacZ.
SOURCE Cloning vector pPD18.32.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7088)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 4622 4623 synthetic end/pUC19 start
recomb 4871 4872 pUC19 end/synthetic start
recomb 3322 3323 synthetic end/unknown DNA start
recomb 4602 4603 unknown DNA end/synthetic start
recomb 4882 4883 synthetic end/pUC19 start
BASE COUNT 1780 a 1724 c 1797 g 1787 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa
4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt
4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt
4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt
4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt
4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa
4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta
4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta
4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata
4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac
4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc
4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc
4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt
4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac
4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat
4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat
4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata
5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct
5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa
5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt
5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg
5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca
5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa
5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt
5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga
5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc
5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga
5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga
5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga
5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat
6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct
6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc
6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc
6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc
6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg
6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata
6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta
6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt
6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt
6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc
6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg
6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca
7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg
7081 aaacagct
//
LOCUS SYNLACZG 6563 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD21.28.
ACCESSION M34302
KEYWORDS lacZ.
SOURCE Cloning vector pPD21.28.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6563)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 4097 4098 synthetic end/pUC19 start
recomb 4346 4347 pUC19 end/synthetic start
recomb 3322 3323 synthetic end/unknown DNA start
recomb 4077 4078 unknown DNA end/synthetic start
recomb 4357 4358 synthetic end/pUC19 start
BASE COUNT 1587 a 1650 c 1723 g 1603 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct
4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc
4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga
4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg
4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc
4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata
4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt
4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag
4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca
4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc
4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat
4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc
4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg
4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc
4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat
4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga
4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc
5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc
5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt
5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc
5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc
5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca
5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca
5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt
5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta
5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg
5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc
5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag
5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa
5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc
5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc
5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta
5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag
6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct
6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga
6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc
6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt
6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg
6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg
6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc
6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc
6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata
6541 acaatttcac acaggaaaca gct
//
LOCUS SYNLACZH 6562 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD22.04.
ACCESSION M34303
KEYWORDS lacZ.
SOURCE Cloning vector pPD22.04.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6562)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 179 180 synthetic end/E.coli trpS start
recomb 263 264 E.coli trpS end/synthetic start
recomb 267 268 synthetic end/E.coli lacZ start
recomb 3261 3262 E.coli lacZ end/synthetic start
recomb 4096 4097 synthetic end/pUC19 start
recomb 4345 4346 pUC19 end/synthetic start
recomb 3321 3322 synthetic end/unknown DNA start
recomb 4076 4077 unknown DNA end/synthetic start
recomb 4356 4357 synthetic end/pUC19 start
BASE COUNT 1587 a 1650 c 1722 g 1603 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg
181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt
241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc
301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata
361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc
421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc
481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca
541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc
601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga
661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg
721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg
781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag
841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga
901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg
961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt
1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta
1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga
1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg
1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa
1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc
1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga
1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt
1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg
1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac
1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg
1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg
1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta
1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt
1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg
1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt
1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg
1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc
2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg
2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga
2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg
2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt
2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac
2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc
2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc
2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta
2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc
2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga
2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg
2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg
2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg
2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc
2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg
2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg
3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct
3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca
3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg
3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct
3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg
3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc
3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca
3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct
3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa
3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac
3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca
3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc
3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa
3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc
3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt
3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg
3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat
4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta
4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct
4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac
4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg
4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg
4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag
4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg
4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat
4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca
4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc
4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca
4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg
4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca
4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata
4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag
4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg
5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta
5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct
5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca
5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag
5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt
5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa
5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga
5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg
5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag
5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc
5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg
5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac
5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga
6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt
6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag
6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg
6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta
6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc
6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc
6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca
6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa
6541 caatttcaca caggaaacag ct
//
LOCUS SYNLACZI 6567 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD22.11.
ACCESSION M34304
KEYWORDS lacZ.
SOURCE Cloning vector pPD22.11.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6567)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 184 185 synthetic end/E.coli trpS start
recomb 268 269 E.coli trpS end/synthetic start
recomb 272 273 synthetic end/E.coli lacZ start
recomb 3266 3267 E.coli lacZ end/synthetic start
recomb 4101 4102 synthetic end/pUC19 start
recomb 4350 4351 pUC19 end/synthetic start
recomb 3326 3327 synthetic end/unknown DNA start
recomb 4081 4082 unknown DNA end/synthetic start
recomb 4361 4362 synthetic end/pUC19 start
BASE COUNT 1587 a 1651 c 1724 g 1605 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg
181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca
241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga
301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg
361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga
421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga
481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc
541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga
601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg
661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg
721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt
781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct
841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa
901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg
961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac
1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga
1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa
1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac
1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat
1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca
1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct
1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct
1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac
1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag
1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc
1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc
1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga
1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa
1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga
1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg
1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa
1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa
2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct
2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga
2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc
2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa
2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac
2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg
2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat
2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca
2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct
2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga
2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca
2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac
2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg
2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc
2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg
2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg
3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct
3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt
3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca
3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga
3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca
3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt
3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc
3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag
3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt
3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa
3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg
3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg
3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg
3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat
3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc
3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg
3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg
4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt
4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa
4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag
4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta
4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag
4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt
4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa
4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca
4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc
4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc
4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt
4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt
4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg
4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact
4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg
4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga
4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg
5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa
5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac
5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc
5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca
5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga
5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta
5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc
5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc
5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt
5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac
5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct
5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact
5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg
5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata
5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga
5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag
6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg
6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac
6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca
6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg
6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc
6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa
6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt
6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt
6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg
6541 gataacaatt tcacacagga aacagct
//
LOCUS SYNLACZJ 7242 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD26.77.
ACCESSION M34305
KEYWORDS lacZ.
SOURCE Cloning vector pPD26.77.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7242)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 102 103 synthetic end/E.coli trpS start
recomb 186 187 E.coli trpS end/synthetic start
recomb 190 191 synthetic end/E.coli lacZ start
recomb 3184 3185 E.coli lacZ end/synthetic start
recomb 4776 4777 synthetic end/pUC19 start
recomb 5025 5026 pUC19 end/synthetic start
recomb 3244 3245 synthetic end/unknown DNA start
recomb 4756 4757 unknown DNA end/synthetic start
recomb 5036 5037 synthetic end/pUC19 start
BASE COUNT 1780 a 1801 c 1846 g 1815 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc
3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca
3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt
3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc
3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct
3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg
3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca
3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt
3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga
3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat
3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta
3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa
3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt
4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga
4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg
4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat
4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt
4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg
4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta
4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct
4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga
4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt
4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca
4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat
4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt
4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg
4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta
4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg
4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg
4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg
5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg
5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa
5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga
5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc
5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat
5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg
5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc
5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca
5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc
5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta
6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag
6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga
6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa
6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc
6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt
6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca
6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg
6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag
6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc
6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt
6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag
7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca
7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga
7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt
7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct
//
LOCUS SYNLACZK 6620 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD34.110.
ACCESSION M34306
KEYWORDS lacZ.
SOURCE Cloning vector pPD34.110.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6620)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 237 238 synthetic end/E.coli trpS start
recomb 321 322 E.coli trpS end/synthetic start
recomb 325 326 synthetic end/E.coli lacZ start
recomb 3319 3320 E.coli lacZ end/synthetic start
recomb 4154 4155 synthetic end/pUC19 start
recomb 4403 4404 pUC19 end/synthetic start
recomb 3379 3380 synthetic end/unknown DNA start
recomb 4134 4135 unknown DNA end/synthetic start
recomb 4414 4415 synthetic end/pUC19 start
BASE COUNT 1592 a 1665 c 1731 g 1632 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta
181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta
241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc
301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct
361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc
421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc
481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct
541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc
601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg
661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg
721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc
781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc
841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat
901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact
961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta
1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct
1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc
1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa
1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac
1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat
1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat
1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg
1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac
1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc
1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc
1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg
1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat
1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt
1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc
1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc
1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat
1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat
2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc
2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa
2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc
2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat
2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg
2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc
2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag
2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat
2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac
2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg
2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc
2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa
2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct
2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat
2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg
2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga
3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat
3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc
3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc
3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa
3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg
3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc
3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg
3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc
3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt
3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa
3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac
3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc
3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa
3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa
3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg
3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc
3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt
4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg
4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact
4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga
4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca
4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta
4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt
4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg
4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca
4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt
4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga
4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga
4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat
4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca
4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt
4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac
4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct
5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga
5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac
5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat
5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg
5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc
5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc
5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg
5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta
5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg
5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga
5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt
5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag
5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa
5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag
5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca
5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac
6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa
6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc
6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg
6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc
6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
6601 atttcacaca ggaaacagct
//
LOCUS SYNLACZL 5808 bp ds-DNA SYN 17-JUL-1990
DEFINITION Cloning vector pPD16.01.
ACCESSION M34307
KEYWORDS lacZ.
SOURCE Cloning vector pPD16.01.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 5808)
AUTHORS Fire,A.Z., Harrison,S. and Dixon,D.
TITLE A modular set of lac-Z fusion vectors for studying gene expression
in C.elegans
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Z.Fire, 11-MAY-1990.
Author address: A.Z.Fire
Carnegie Inst of Washington
Dept Embryology
115 West Univ Parkway
Baltimore, MD 21210
email: AZF@JHUIGF.BITNET
FEATURES from to/span description
recomb 57 58 pUC19 end/synthetic start
recomb 180 181 synthetic end/E.coli trpS start
recomb 264 265 E.coli trpS end/synthetic start
recomb 268 269 synthetic end/E.coli lacZ start
recomb 3262 3263 E.coli lacZ end/synthetic start
recomb 3342 3343 synthetic end/unknown DNA start
recomb 3591 3592 unknown DNA end/synthetic start
recomb 3602 3603 synthetic end/pUC19 start
BASE COUNT 1375 a 1502 c 1599 g 1332 t
ORIGIN
1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg
3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg
3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg
3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat
3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg
3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt
3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac
3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa
3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat
3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc
3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga
3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg
4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc
4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag
4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc
4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg
4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg
4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac
4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac
4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg
4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg
4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg
4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac
4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg
4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg
4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc
4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc
4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt
4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact
5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac
5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag
5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg
5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga
5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt
5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct
5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg
5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta
5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta
5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct
//
LOCUS MUSTCAXL 331 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma LD1.
ACCESSION M34194
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 331 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 71 a 86 c 89 g 85 t
ORIGIN
1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg
61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg
121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat
181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg
241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata
301 acttttgggg ctggaaccaa actcaccatt a
//
LOCUS MUSTCAXM 334 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma LD3.
ACCESSION M34196
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 334)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 334 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 97 a 84 c 75 g 78 t
ORIGIN
1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg
61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg
121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga
181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag
241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg
301 acatttggat ctggcacaaa agtcattgtt ctac
//
LOCUS MUSTCAXN 327 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-14.
ACCESSION M34198
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 327)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 327 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 92 a 80 c 85 g 70 t
ORIGIN
1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt
301 ggacagggaa ccaagttatc tatcaag
//
LOCUS MUSTCAXO 324 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-27.
ACCESSION M34200
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 324 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 92 a 80 c 83 g 69 t
ORIGIN
1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc
301 tctgggacca agctaattat aaag
//
LOCUS MUSTCAXP 297 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-6.
ACCESSION M34202
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 297 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 85 a 73 c 74 g 65 t
ORIGIN
1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac
61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc
121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc
181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt
241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg
//
LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, T-cell clone V2.1.
ACCESSION M34204
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 193)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 193 T-cell receptor alpha-chain (AA at 2)
BASE COUNT 56 a 45 c 49 g 43 t
ORIGIN
1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata
61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc
121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt
181 tctgagtgtt ctg
//
LOCUS MUSTCAXR 333 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-33.
ACCESSION M34206
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 333)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 333 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 84 a 96 c 78 g 74 t 1 others
ORIGIN
1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg
61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc
121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa
181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg
241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga
301 ctcacctttg ggaaaggaac tcagctgatc atc
//
LOCUS MUSTCAXS 339 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma MT1-7.
ACCESSION M34208
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 339 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 86 a 103 c 75 g 75 t
ORIGIN
1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg
61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat
121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg
181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag
241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga
301 aagctcacgt ttggacacgg caccatcctt agggtccat
//
LOCUS MUSTCAXT 342 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma P1F12C4.
ACCESSION M34210
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 342)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 342 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 80 a 102 c 79 g 81 t
ORIGIN
1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg
61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc
121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa
181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg
241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact
301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at
//
LOCUS MUSTCAXU 324 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma P1D3A6.
ACCESSION M34212
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 324 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 82 a 80 c 81 g 81 t
ORIGIN
1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg
61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga
121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc
181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac
241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt
301 caaggaaccg ttctgagtgt tata
//
LOCUS MUSTCAXV 210 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 1E1O.
ACCESSION M34214
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 210 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 54 a 53 c 50 g 53 t
ORIGIN
1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg
61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca
121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact
181 tttggtcaag gaaccgttct gagtgttata
//
LOCUS MUSTCAXW 234 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 7/6AH1.
ACCESSION M34216
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 234 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 55 a 59 c 58 g 62 t
ORIGIN
1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct
61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct
121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta
181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc
//
LOCUS MUSTCAXX 312 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region,
partial cds, from hybridoma 2B11.
ACCESSION M34218
KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 312 T-cell receptor alpha-chain (AA at 1)
BASE COUNT 89 a 79 c 79 g 64 t 1 others
ORIGIN
1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag
61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta
121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac
181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg
241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact
301 ttgcaagtgc aa
//
LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma LD1.
ACCESSION M34195
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 153)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 153 T-cell receptor beta-chain (AA at 1)
recomb 122 123 V-region end/J-region start
BASE COUNT 39 a 37 c 36 g 41 t
ORIGIN
1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc
61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta
121 ggttctggaa atacgctcta ttttggagaa gga
//
LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma LD3.
ACCESSION M34197
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 111)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 111 T-cell receptor beta-chain (AA at 1)
recomb 73 74 V-region end/J-region start
BASE COUNT 18 a 37 c 25 g 31 t
ORIGIN
1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc
61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a
//
LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-14.
ACCESSION M34199
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 321 T-cell receptor beta-chain (AA at 1)
recomb 292 293 V-region end/J-region start
BASE COUNT 106 a 81 c 61 g 73 t
ORIGIN
1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc
61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag
121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc
181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct
241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac
301 accttgtact ttggtgcggg c
//
LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-27.
ACCESSION M34201
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 210 T-cell receptor beta-chain (AA at 1)
recomb 167 168 V-region end/J-region start
BASE COUNT 57 a 56 c 43 g 54 t
ORIGIN
1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc
61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg
121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag
181 ttcttcggac cagggacacg actcaccgtc
//
LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-6.
ACCESSION M34203
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 255)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 255 T-cell receptor beta-chain (AA at 1)
recomb 219 220 V-region end/J-region start
BASE COUNT 57 a 68 c 77 g 53 t
ORIGIN
1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc
61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc
121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc
181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg
241 ggcacccgac tatcg
//
LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, T-cell clone V2.1.
ACCESSION M34205
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 213)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 213 T-cell receptor beta-chain (AA at 1)
recomb 171 172 V-region end/J-region start
BASE COUNT 48 a 59 c 58 g 48 t
ORIGIN
1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat
61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc
121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag
181 cagttcttcg gaccagggac acgactcacc gtc
//
LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-33.
ACCESSION M34207
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 207)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 207 T-cell receptor beta-chain (AA at 1)
recomb 163 164 V-region end/J-region start
BASE COUNT 65 a 54 c 36 g 52 t
ORIGIN
1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt
61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag
121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt
181 ttcggtcatg gaaccaagct gtctgtc
//
LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma MT1-7.
ACCESSION M34209
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 156 T-cell receptor beta-chain (AA at 1)
recomb 118 119 V-region end/J-region start
BASE COUNT 30 a 48 c 40 g 38 t
ORIGIN
1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct
121 gaacagtact tcggtcccgg caccaggctc acggtt
//
LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma P1F12C4.
ACCESSION M34211
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 132)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 132 T-cell receptor beta-chain (AA at 1)
recomb 94 95 V-region end/J-region start
BASE COUNT 35 a 39 c 28 g 30 t
ORIGIN
1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg
61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc
121 agactcacag tt
//
LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma P1D3A6.
ACCESSION M34213
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 303 T-cell receptor beta-chain (AA at 1)
recomb 259 260 V-region end/J-region start
BASE COUNT 80 a 72 c 79 g 72 t
ORIGIN
1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac
61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat
121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg
181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc
241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt
301 gtt
//
LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 1E1O.
ACCESSION M34215
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 159)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 159 T-cell receptor beta-chain (AA at 1)
recomb 113 114 V-region end/J-region start
BASE COUNT 35 a 40 c 39 g 45 t
ORIGIN
1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg
61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga
121 aatacgctct attttggaga aggaagccgg ctcattgtt
//
LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 7/6AH1.
ACCESSION M34217
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 321 T-cell receptor beta-chain (AA at 1)
recomb 275 276 V-region end/J-region start
BASE COUNT 86 a 78 c 81 g 76 t
ORIGIN
1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc
61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg
121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac
181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct
241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga
301 gaaggaagcc ggctcattgt t
//
LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region,
partial cds, from hybridoma 2B11.
ACCESSION M34219
KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon;
variable region.
SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
TITLE Structurally diverse T cells can recognize an influenza antigen/MHC
complex in the same common orientation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.H.Taylor, 16-MAY-1990.
Author address: A.H.Taylor
Wistar Institute of Anatomy and Biology
3601 Spruce Street
Philadelphia, PA 19104
Tel: (215) 898-3839
FEATURES from to/span description
pept < 1 > 339 T-cell receptor beta-chain (AA at 1)
recomb 294 295 V-region end/J-region start
BASE COUNT 90 a 88 c 88 g 73 t
ORIGIN
1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca
61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg
121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc
181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa
301 aacaccttgt actttggtgc gggcacccga ctatcggtg
//
LOCUS CEACAEVA 264 bp ds-DNA VRL 17-JUL-1990
DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene,
complete cds.
ACCESSION M34092
KEYWORDS tat protein.
SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from
goat synovial membrane, clone pCol.9.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis-
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 08-MAY-1990.
Author address: W.P.Cheevers
Washington State University
Dept Veterinary Microbiology
Pullman, WA 94164-7040
email: b384@wsuvmsl.bitnet
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 97 a 40 c 80 g 47 t
ORIGIN
1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
241 tgtaacccgg ggtggggaac ataa
//
LOCUS CEACAEVB 264 bp ds-DNA VRL 17-JUL-1990
DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene,
complete cds.
ACCESSION M34093
KEYWORDS tat protein.
SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from
goat synovial membrane, clone pC63-49.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
Robinson,M.M. and Cheevers,W.P.
TITLE Genetic structure of the pol-env region of the Caprine arthritis-
encephalitis lentivirus genome: Possible role in trans-activation
of the viral long terminal repeat
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.P.Cheevers, 08-MAY-1990.
Author address: W.P.Cheevers
Washington State University
Dept Veterinary Microbiology
Pullman, WA 94164-7040
email: b384@wsuvmsl.bitnet
FEATURES from to/span description
pept 1 264 tat protein
BASE COUNT 99 a 43 c 75 g 47 t
ORIGIN
1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
241 tgtaacccgg ggtgggaaac ataa
//
LOCUS FLAPR834HA 540 bp ss-RNA VRL 17-JUL-1990
DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds.
ACCESSION M34335
KEYWORDS glycoprotein; hemagglutinin.
SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally
from human.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 540)
AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P.
TITLE Rapid detection of influenza virus H1 by the polymerase chain
reaction
JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990)
STANDARD simple staff_review
COMMENT Sequence reported is + strand.
FEATURES from to/span description
pept < 1 > 540 hemagglutinin (AA at 2)
BASE COUNT 179 a 121 c 123 g 117 t
ORIGIN
1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta
61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac
121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat
181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg
241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa
301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc
361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa
421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt
481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa
//
LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1.
ACCESSION M34422
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, clone PS-beta-G C.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 355)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS 1 355 pregnancy-specific beta-1 glycoprotein intron
C1
BASE COUNT 110 a 64 c 62 g 119 t
ORIGIN Chromosome 19.
1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag
61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc
121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg
181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa
241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct
301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag
//
LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M34421
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1418)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE Structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 79 1359 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 79 180 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 181 1356 pregnancy-specific beta-1 glycoprotein
mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT 408 a 398 c 296 g 316 t
ORIGIN Chromosome 19.
1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac
61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc
121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc
181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt
241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac
301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca
361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg
421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa
481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc
541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg
601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag
661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc
721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat
781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat
841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg
901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata
961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac
1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc
1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc
1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa
1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc
1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct
1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga
1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac
//
LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M34420
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1856)
AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
TITLE Structure, evolution and chromosomal localization of the human
pregnancy-specific beta-1 glycoprotein gene family
JOURNAL Genomics 6, 579-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 82 1368 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 82 183 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 184 1365 pregnancy-specific beta-1 glycoprotein
mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT 544 a 476 c 364 g 472 t
ORIGIN Chromosome 19.
1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag
61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc
121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact
181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta
241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag
301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct
361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc
421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga
481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc
541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact
601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg
661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga
721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg
781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag
841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg
901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg
961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag
1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac
1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc
1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag
1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct
1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc
1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt
1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta
1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca
1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt
1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat
1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga
1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt
1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg
1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc
//
LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION M23575
KEYWORDS beta-1 glycoprotein.
SOURCE Human placenta, cDNA to mRNA, clone pSP1-i.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2004)
AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N.
TITLE Molecular cloning of a cDNA for human pregnancy-specific
beta-1-glycoprotein: Homology with human carcinoembryonic antigen
and related proteins
JOURNAL Gene 71, 439-449 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 232 1518 pregnancy-specific beta-1 glycoprotein
precursor /hgml_locus_uid="LG0073P"
/nomgen="PSBG1" /map="19q13.1-q13.2"
sigp 232 333 pregnancy-specific beta-1 glycoprotein signal
peptide
matp 334 1515 pregnancy-specific beta-1 glycoprotein
BASE COUNT 579 a 510 c 406 g 509 t
ORIGIN
1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca
61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta
121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt
181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc
241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca
301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc
361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct
421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta
481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc
541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac
601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac
661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag
721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat
781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt
841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg
901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac
961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa
1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt
1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat
1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca
1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat
1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa
1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag
1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg
1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct
1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac
1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg
1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact
1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg
1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt
1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc
1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc
1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg
1981 taataaagtt atctgcacaa accc
//
LOCUS HUMLEUELA 920 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human elastase/medullasin mRNA, complete cds.
ACCESSION M34379
KEYWORDS elastase; medullasin.
SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M.
TITLE Functional expression of human leukocyte elastase (HLE)/medullasin
in eukaryotic cells
JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 39 842 elastase/medullasin precursor (EC 3.4.21.37)
sigp 39 119 elastase/medullasin signal peptide
matp 126 839 elastase/medullasin
BASE COUNT 141 a 329 c 287 g 163 t
ORIGIN
1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg
61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct
121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc
181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg
241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata
301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg
361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca
421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg
481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg
541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca
601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct
661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct
721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac
781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact
841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca
901 ataaacattc tctgttttgt
//
LOCUS MSGIS6110 1360 bp ds-DNA BCT 17-JUL-1990
DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element.
ACCESSION M29899
KEYWORDS insertion sequence.
SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone
I21.
ORGANISM Mycobacterium tuberculosis
Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE 1 (bases 1 to 1360)
AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H.,
Gicquel,B. and Guesdon,J.L.
TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex
JOURNAL Nucleic Acids Res. 18, 188-188 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Thierry 15-NOV-1989.
BASE COUNT 269 a 439 c 432 g 220 t
ORIGIN
1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca
61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca
121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt
181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc
241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa
301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga
361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg
421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg
481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc
541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt
601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca
661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca
721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg
781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag
841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt
901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg
961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca
1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg
1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac
1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc
1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc
1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct
1321 cagatcagag agtctccgga ctcaccgggg cggttcacga
//
LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region
from hybridoma CE5, partial cds.
ACCESSION M34119
KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene.
SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 348 Ig H-chain V-J3 region (AA at 1)
recomb 294 295 J-region end/D-region start
recomb 303 304 D-region end/J-region start
BASE COUNT 89 a 82 c 97 g 80 t
ORIGIN Chromosome 12.
1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg
121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat
181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac
241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt
301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733,
partial cds.
ACCESSION M34120
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain (AA at 1)
BASE COUNT 87 a 83 c 74 g 77 t
ORIGIN Chromosome 6.
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat
181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga
301 ggcaccaaac tggaaatcaa a
//
LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma
C017-1A, partial cds.
ACCESSION M34121
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 321 Ig kappa-chain (AA at 1)
BASE COUNT 89 a 77 c 78 g 77 t
ORIGIN Chromosome 6.
1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc
61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca
121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat
181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct
241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg
301 gggaccaagc tggaaataaa a
//
LOCUS MUSIGKCST 318 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma
CE5, partial cds.
ACCESSION M34122
KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
TITLE Identical D region sequences expressed by murine monoclonal
antibodies specific for a human tumor-associated antigen
JOURNAL J. Immunol. 144, 1965-1968 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 318 Ig kappa-chain (AA at 1)
BASE COUNT 84 a 78 c 80 g 76 t
ORIGIN Chromosome 6.
1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca
121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag
181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc
301 accaagctgg aaatcaaa
//
LOCUS PSERRSAA 1517 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.aeruginosa 16S ribosomal RNA.
ACCESSION M34133
KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA.
SOURCE P.aeruginosa (strain 25330) ribosomal RNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1517)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept of Microbiology
407 S. Goodwin Avenue
Urbana, IL 61801
email: carl@ninja.life.uiuc.edu
FEATURES from to/span description
rRNA 1 1517 16S rRNA
BASE COUNT 379 a 334 c 470 g 301 t 33 others
ORIGIN
1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta
121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc
181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta
241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga
301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata
361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat
421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt
481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg
541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg
601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag
661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc
721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg
781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag
841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta
901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag
961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg
1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca
1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg
1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg
1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact
1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg
1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta
1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn
1501 nnnnnnnnnn gaacctg
//
LOCUS RATNESTIN 5946 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat nestin mRNA, complete cds.
ACCESSION M34384
KEYWORDS intermediate filament protein; nestin.
SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5946)
AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G.
TITLE CNS stem cells express a new class of intermediate filament protein
JOURNAL Cell 60, 585-595 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 128 5545 nestin (128 could be 161)
mRNA 1 5495 nestin mRNA
BASE COUNT 1600 a 1401 c 1871 g 1074 t
ORIGIN
1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa
61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac
121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg
181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc
241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga
301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga
361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt
421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc
481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga
541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc
601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga
661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt
721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg
781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga
841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac
901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca
961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct
1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg
1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg
1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc
1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt
1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc
1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca
1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga
1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc
1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc
1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga
1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct
1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct
1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa
1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca
1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg
1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt
1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt
2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca
2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga
2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca
2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact
2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg
2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat
2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca
2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa
2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt
2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg
2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa
2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg
2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga
2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc
2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag
2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc
3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc
3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac
3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt
3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa
3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt
3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc
3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc
3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc
3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt
3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga
3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga
3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac
3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga
3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa
3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc
3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga
3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga
4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc
4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa
4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag
4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga
4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc
4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt
4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga
4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga
4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg
4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg
4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc
4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct
4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga
4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga
4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc
4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga
4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa
5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct
5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca
5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga
5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt
5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg
5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga
5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc
5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga
5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg
5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt
5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg
5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc
5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac
5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc
5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct
5941 acctac
//
LOCUS CLLRRE 1860 bp ss-rRNA RNA 17-JUL-1990
DEFINITION C.sapidus 18S rRNA, 3' end.
ACCESSION M34360
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE C.sapidus rRNA.
ORGANISM Callinectes sapidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura;
Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE 1 (bases 1 to 1860)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-113 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1860 18S ribosoma RNA
BASE COUNT 359 a 375 c 421 g 367 t 338 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
//
LOCUS HUMNCSRC 138 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene,
exons 3, 4, NI, and NII, partial cds.
ACCESSION M34469
KEYWORDS membrane-associated tryosine protein kinase.
SOURCE Human adult brain DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 138)
AUTHORS Pyper,J.M. and Bolen,J.B.
TITLE Identification of a novel neuronal C-SRC exon expressed in human
brain
JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 29 membrane-associated tyrosine protein kinase
(C-SRC), exon 3 (AA at 1)
30 47 membrane-associated tyrosine protein kinase,
exon NI
48 80 membrane-associated tyrosine protein kinase,
exon NII
81 > 136 membrane-associated tyrosine protein kinase,
exon 4
variant 117 117 a or g
variant 135 135 c or t
BASE COUNT 36 a 40 c 40 g 22 t
ORIGIN
1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
121 cagacaggct acatcccc
//
LOCUS MHVAPEPA 1000 bp ss-RNA VRL 17-JUL-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34435
KEYWORDS peplomer protein.
SOURCE Murine (strain JHM-DL) RNA.
ORGANISM Murine coronavirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
BASE COUNT 258 a 192 c 227 g 317 t 6 others
ORIGIN
1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS MHVAPEPB 843 bp ss-RNA VRL 17-JUL-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34436
KEYWORDS peplomer protein.
SOURCE Murine (strain A59) RNA.
ORGANISM Murine coronavirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 843)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 400 401 deletion site
BASE COUNT 227 a 153 c 183 g 280 t
ORIGIN
1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
841 gaa
//
LOCUS MHVSP 3780 bp ss-mRNA VRL 17-JUL-1990
DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION X04797 M34437
KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein;
surface projection glycoprotein.
SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
pJMS1010, pJS112, and pJS92.
ORGANISM Murine hepatitis virus A59
Unclassified.
REFERENCE 1 (bases 1 to 3780)
AUTHORS Schmidt,I., Skinner,M. and Siddell,S.
TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection
Glycoprotein of Coronavirus MHV-JHM
JOURNAL J. Gen. Virol. 68, 47-56 (1987)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
SITE 24 30 put. region of homology preceeding
mRNA 5' initiation codons
SITE 121 129 pot. N-glycosylation site
SITE 208 216 pot. N-glycosylation site
SITE 430 438 pot. N-glycosylation site
SITE 604 612 pot. N-glycosylation site
SITE 1099 1107 pot. N-glycosylation site
SITE 1333 1341 pot. N-glycosylation site
SITE 1636 1644 pot. N-glycosylation site
SITE 1732 1740 pot. N-glycosylation site
SITE 1756 1764 pot. N-glycosylation site
SITE 1825 1833 pot. N-glycosylation site
SITE 1900 1914 put. proteolytic cleavage site
SITE 1972 1980 pot. N-glycosylation site
SITE 2023 2031 pot. N-glycosylation site
SITE 2293 2301 pot. N-glycosylation site
SITE 2440 2448 pot. N-glycosylation site
SITE 3139 3147 pot. N-glycosylation site
SITE 3301 3309 pot. N-glycosylation site
SITE 3331 3339 pot. N-glycosylation site
SITE 3388 3396 pot. N-glycosylation site
SITE 3436 3444 pot. N-glycosylation site
SITE 3499 3507 pot. N-glycosylation site
SITE 3553 3654 pot. transmembrane domain
SITE 3622 3675 cysteine-rich region
SITE 3655 3660 charge cluster
SITE 3715 3723 pot. N-glycosylation site
FEATURES from to/span description
pept 31 3738 surface protein S precursor
sigp 31 60 surface protein S signal peptide
matp 61 3736 surface protein S
BASE COUNT 1030 a 718 c 800 g 1232 t
ORIGIN
1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION M34598 M29013 J03562
KEYWORDS constant region; immunoglobulin; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
Y31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 713)
AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
TITLE Structure of a third murine immunoglobulin lambda light chain
variable region that is expressed in laboratory mice
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
STANDARD simple staff_entry
REFERENCE 2 (bases 2 to 450)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
REFERENCE 3 (bases 266 to 429)
AUTHORS Sanchez,P. and Cazenave,P.-A.
TITLE A new variable region in mouse immunoglobulin lambda light chains
JOURNAL J. Exp. Med. 166, 265-270 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region
sigp 12 68 Ig lambda chain signal peptide
matp 69 > 713 Ig lambda chain
recomb 379 380 V-region end/J2-region start
recomb 414 415 J2-region end/C2-region start
BASE COUNT 175 a 190 c 159 g 189 t
ORIGIN
1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS MUSIGLVD 681 bp ds-DNA ROD 17-JUL-1990
DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION M34597
KEYWORDS constant region; germline; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; variable region.
SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region,
exon 1
209 531 Ig lambda-chain precursor Vx-J2-C2 region,
exon 2
sigp 59 104 Ig lambda-chain signal peptide
209 219 Ig lambda-chain signal peptide
matp 220 528 Ig lambda-chain
IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A
recomb 530 531 Vx-region end/J2-region start
recomb 565 566 J2-region end/C2-region start
BASE COUNT 179 a 156 c 136 g 210 t
ORIGIN Chromosome 16.
1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
661 tcctcttgga cttcgtaggg c
//
LOCUS PAERRE 1877 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.kadiakensis 18S rRNA, 3' end.
ACCESSION M34359
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.kadiakensis rRNA.
ORGANISM Palaemonetes kadiakensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea;
Palaemonoidea; Palaemonidae.
REFERENCE 1 (bases 1 to 1877)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-113 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1877 18S ribosoma RNA
BASE COUNT 339 a 321 c 399 g 313 t 505 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnnnnn
//
LOCUS PBESVBRA 584 bp ds-DNA INV 17-JUL-1990
DEFINITION P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION M34601
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 584)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 461 541 27 bp repeats
BASE COUNT 205 a 41 c 85 g 251 t 2 others
ORIGIN
1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS PBESVBRB 593 bp ds-DNA INV 17-JUL-1990
DEFINITION P.berghei telomeric repeat region subfragment a DNA.
ACCESSION M34602
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 593)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 484 510 27 bp repeat motif
BASE COUNT 209 a 40 c 97 g 247 t
ORIGIN
1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS PEURRE 1902 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.aztecus 18S rRNA, 3' end.
ACCESSION M34362
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.aztecus rRNA.
ORGANISM Penaeus aztecus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea;
Penaeidae.
REFERENCE 1 (bases 1 to 1902)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1902 18S ribosoma RNA
BASE COUNT 340 a 369 c 427 g 332 t 434 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS POCRRE 1874 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.ascensionis 18S rRNA, 3' end.
ACCESSION M34358
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.ascensionis rRNA.
ORGANISM Procaris ascensionis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea;
Procaridoidea; Procarididae.
REFERENCE 1 (bases 1 to 1874)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1874 18S ribosoma RNA
BASE COUNT 298 a 291 c 331 g 266 t 688 others
ORIGIN
1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnn
//
LOCUS PRARRE 1869 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.leonensis 18S rRNA, 3' end.
ACCESSION M34363
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.leonensis rRNA.
ORGANISM Procambarus leonensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea;
Astacoidea; Cambaridae.
REFERENCE 1 (bases 1 to 1869)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1869 18S ribosoma RNA
BASE COUNT 409 a 412 c 459 g 419 t 170 others
ORIGIN
1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnn
//
LOCUS PVIC1RPTA 711 bp ds-DNA INV 17-JUL-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28745 M25758
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 711)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 711 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 697 711 region 2
site 502 597 post repeat variable region
site 598 696 post repeat constant region
BASE COUNT 261 a 151 c 226 g 73 t
ORIGIN
1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS PVIC1RPTB 657 bp ds-DNA INV 17-JUL-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28746 M25759
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK210) DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 657)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 657 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 643 657 region 2
rpt 16 477 27 bp repeats
site 478 544 3' post repeat variable region
site 545 642 3' post repeat constant region
BASE COUNT 229 a 135 c 224 g 69 t
ORIGIN
1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS SHV2A 554 bp ss-RNA VRL 17-JUL-1990
DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA,
partial cds.
ACCESSION M34085
KEYWORDS .
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 554)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL Unpublished (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Brown, 04-MAY-1990.
Author address: E.A.Brown
547 Burnett-Womack
CB# 7030
Chapel Hill, NC 27599
FEATURES from to/span description
pept < 1 > 554 segment 2A-encoded protein (AA at 1)
BASE COUNT 184 a 71 c 138 g 161 t
ORIGIN Segment 2A; map position 3108-3662.
1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
541 tcaggaatac agga
//
LOCUS SHVVP1CP 2373 bp ss-RNA VRL 17-JUL-1990
DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION M34084
KEYWORDS capsid protein VP1.
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
in cwll line BS-C-1.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 2373)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL J. Virol. 63, 4932-4937 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Brown, 04-MAY-1990.
FEATURES from to/span description
pept < 1 > 2373 capsid protein VP1 (AA at 1)
BASE COUNT 661 a 447 c 491 g 774 t
ORIGIN Segment P1; map position 735-3107.
1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS STNRRE 1885 bp ss-rRNA RNA 17-JUL-1990
DEFINITION S.hispidus 18S rRNA, 3' end.
ACCESSION M34361
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE S.hispidus rRNA.
ORGANISM Stenopus hispidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea;
Stenopodidae.
REFERENCE 1 (bases 1 to 1885)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustraceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1885 18S ribosoma RNA
BASE COUNT 327 a 336 c 430 g 368 t 424 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS SIVGAGFA1 541 bp ss-RNA VRL 17-JUL-1990
DEFINITION Simian immunodeficiency virus gag protein (smmgagFA1) gene, partial
cds.
ACCESSION M33754
KEYWORDS gag protein.
SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus
atys peripheral blood mononuclear cells.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 541)
AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N.,
Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A.
TITLE Detection of occult SIVsmm infection in asymptomatic seronegative
nonhuman primates and evidence for variation in SIV gag sequence
between endogenous and in vitro propagated virus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.J.Villinger, 18-APR-1990.
Author address: F.J.Villinger
Emory University,
Department of Biology
Winship, Cancer Center
Atlanta GA, 30322
Ph: (404) 248-4555.
FEATURES from to/span description
pept < 1 > 541 gag protein (AA at 1)
BASE COUNT 191 a 109 c 126 g 115 t
ORIGIN
1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta
61 ggagaacatc aggcagccat gcaaattatt agagagatta taaatgaaga agctgctgat
121 tgggatctac aacacccgca gccaggtcca ataccagcag tacaactcag agagccaaga
181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg
241 caacaaaatc ctgtaccagt gggaaatatt tacaggagat ggatccaact tggattacaa
301 aagtgtgtca ggatgtataa tccaacaaac attctggacg taaaacaggg gccgaaagag
361 ccattccaga gctatgtaga caggttctac aaaagcttaa gagcagaaca gacagatcca
421 gcagtgaaga attggatgac ccaaacactg ctgattcaaa atgctaaccc agattgtaaa
481 ttggtgctca agggtctggg tatgaatccc actttagaag aaatgctgac agcctgtcag
541 g
//
LOCUS SIVGAGFA2 335 bp ss-RNA VRL 17-JUL-1990
DEFINITION Simian immunodeficiency virus gag protein (smmgagFA2) gene, partial
cds.
ACCESSION M33755
KEYWORDS gag protein.
SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus
atys peripheral blood mononuclear cells.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 335)
AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N.,
Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A.
TITLE Detection of occult SIVsmm infection in asymptomatic seronegative
nonhuman primates and evidence for variation in SIV gag sequence
between endogenous and in vitro propagated virus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.J.Villinger, 18-APR-1990.
Author address: F.J.Villinger
Emory University,
Department of Biology
Winship, Cancer Center
Atlanta GA, 30322
FEATURES from to/span description
pept < 1 > 335 gag protein (AA at 1)
BASE COUNT 121 a 70 c 72 g 72 t
ORIGIN
1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta
61 ggagaacatc aggcagccat gcaaattatt agagagatta taaataaaga agctgctgat
121 tgggatttac aacacccgca gccaggtcca ataccagcag gacaactcag agagccaaga
181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg
241 caacaaaatc ctgtaccagt cggaaacatt tacaggagat ggatccaact tgcattacaa
301 aagtgtgtca ggatgtacaa tccaacaaac attct
//
LOCUS NEUAMTR 5928 bp ds-DNA PLN 17-JUL-1990
DEFINITION N.crassa mating type protein gene, complete cds.
ACCESSION M33876
KEYWORDS mating type protein.
SOURCE N.crassa (strain 74-ORS-A) DNA.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 5928)
AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L.
TITLE The Neurospora crassa A mating type region
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.L.Glass, 26-APR-1990.
FEATURES from to/span description
pept 4121 4283 mating type protein, exon 1
4343 5046 mating type protein, exon 2
IVS 4284 4342 mating type protein intron A
signal 3858 3906 GC signal
signal 3945 3949 CAAT box
signal 5319 5326 poly-A signal
BASE COUNT 1470 a 1459 c 1526 g 1473 t
ORIGIN
1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS HUMNCADH 3451 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human N-cadherin mRNA, complete cds.
ACCESSION M34064
KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein.
SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3451)
AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D.,
Spurr,N. and Goodfellow,P.N.
TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked
to the E-cadherin gene
JOURNAL J. Neurochem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.H.Barton, 08-MAY-1990.
FEATURES from to/span description
pept < 1 2247 N-cadherin (AA at 1)
BASE COUNT 1041 a 703 c 724 g 983 t
ORIGIN Chromosome 18.
1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag
61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg
121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg
181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca
241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac
301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga
361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc
421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac
481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga
541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca
601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct
661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata
721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg
781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc
841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc
901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag
961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc
1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc
1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct
1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt
1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct
1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt
1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc
1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct
1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat
1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt
1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg
1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg
1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt
1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt
1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga
1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct
1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc
1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag
2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt
2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt
2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct
2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac
2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc
2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag
2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta
2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt
2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt
2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca
2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta
2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag
2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg
2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga
2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta
2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt
3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt
3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt
3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg
3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa
3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa
3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct
3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg
3421 ctaattttgg ataacaaaaa aaggggaatt c
//
LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34185
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3365)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 256 2457 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 786 a 962 c 917 g 700 t
ORIGIN
1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac
61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca
121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa
181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct
241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca
301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc
361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc
421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa
481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca
541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga
601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac
661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag
721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac
781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag
841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac
901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag
961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt
1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag
1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc
1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag
1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc
1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt
1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag
1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg
1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa
1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac
1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc
1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc
1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc
1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc
1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg
1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc
1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg
1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg
2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg
2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac
2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga
2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag
2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg
2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag
2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg
2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc
2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc
2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct
2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc
2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag
2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg
2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg
2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga
2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg
3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca
3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag
3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat
3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg
3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat
3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa
3361 aaaaa
//
LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34186
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3503)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 527 2722 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 777 a 1044 c 1005 g 677 t
ORIGIN
1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc
61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg
121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga
181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc
241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc
301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg
361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga
421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc
481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa
541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc
601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc
661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac
721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt
781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga
841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat
901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta
961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat
1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat
1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa
1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt
1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc
1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt
1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga
1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt
1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa
1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc
1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat
1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga
1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga
1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt
1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa
1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc
1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg
1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc
2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc
2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt
2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa
2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat
2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct
2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg
2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg
2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg
2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta
2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt
2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa
2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc
2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc
2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact
2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga
2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc
3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc
3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac
3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc
3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac
3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg
3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga
3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt
3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata
3481 tattaaaaac aaaaaaaaaa aaa
//
LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form
mRNA, complete cds.
ACCESSION M34187
KEYWORDS FGF receptor; fibroblast growth factor receptor;
transmembrane tyrosine kinase.
SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
clone h4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2283)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 417 1325 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 566 a 621 c 612 g 484 t
ORIGIN
1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca
61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc
121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga
181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc
241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc
301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg
361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt
421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta
481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg
541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg
601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg
661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt
721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg
781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca
841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt
901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta
961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa
1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct
1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct
1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc
1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca
1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc
1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca
1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt
1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag
1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga
1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga
1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa
1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc
1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc
1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg
1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt
1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa
1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag
2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca
2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca
2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac
2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa
2281 aaa
//
LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA,
complete cds.
ACCESSION M34188
KEYWORDS FGF receptor; fibroblast growth factor receptor.
SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA,
clone h5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1625)
AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
TITLE Diverse forms of a receptor for acidic and basic fibroblast growth
factors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Johnson, 10-MAY-1990.
Author address: D.E.Johnson
University of California San Francisco
4th and Parnassus
Howard Hughes Medical Institute
San Francisco, CA 94143
(415) 476-4297
FEATURES from to/span description
pept 523 1425 fibroblast growth factor receptor (FGFr)
transmembrane form
BASE COUNT 368 a 480 c 489 g 288 t
ORIGIN
1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg
61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc
121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc
181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat
241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc
301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc
361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct
421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg
481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc
541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc
601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct
661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc
721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa
781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc
841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg
901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc
961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg
1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt
1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg
1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca
1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg
1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat
1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg
1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca
1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca
1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt
1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg
1621 aaaaa
//
LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain).
ACCESSION M34189
KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1146)
AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F.,
Stefanuto,G. and Silengo,L.
TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Altruda, 10-MAY-1990.
FEATURES from to/span description
pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic
domain)
BASE COUNT 373 a 184 c 222 g 367 t
ORIGIN
1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg
61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt
121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct
181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga
241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt
301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct
361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct
421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa
481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata
541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta
601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg
661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac
721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg
781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt
841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag
901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg
961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt
1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga
1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc
1141 gaattc
//
LOCUS RATIRF1A 2078 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds.
ACCESSION M34253
KEYWORDS interferon regulatory factor 1; transcription factor.
SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2078)
AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A.
TITLE Interferon regulatory factor 1 is an immediate-early gene under
transcriptional regulation by prolactin in Nb2 T cells
JOURNAL Mol. Cell. Biol. 3087, 3094 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.-Y.Yu-Lee, 11-MAY-1990.
FEATURES from to/span description
pept 198 1184 interferon regulatory factor 1 (IRF-1)
BASE COUNT 542 a 544 c 540 g 452 t
ORIGIN
1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg
61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac
121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct
181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc
241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc
301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt
361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca
421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga
481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc
541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa
601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct
661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg
721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg
781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg
841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac
901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca
961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct
1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg
1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact
1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc
1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag
1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg
1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct
1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg
1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga
1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact
1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact
1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa
1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct
1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct
1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt
1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga
1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt
1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta
2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 17-JUL-1990
DEFINITION A.calcoaceticus 16S ribosomal RNA.
ACCESSION M34139
KEYWORDS 16S ribosomal RNA.
SOURCE A.calcoaceticus (strain 33604) ribosomal RNA.
ORGANISM Acinetobacter calcoaceticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1536)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1536 16S ribosomal RNA
BASE COUNT 387 a 329 c 460 g 330 t 30 others
ORIGIN
1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca
61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta
121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc
181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta
241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga
301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata
361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt
421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt
481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc
541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt
601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga
661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg
721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag
781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga
841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact
901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat
961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt
1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt
1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga
1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat
1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta
1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac
1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg
1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct
1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn
1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt
//
LOCUS BDERRSAA 1553 bp ss-rRNA RNA 17-JUL-1990
DEFINITION B.stolpii 16S ribosomal RNA.
ACCESSION M34125
KEYWORDS 16S ribosomal RNA.
SOURCE B.stolpii (strain uki-2) ribosomal RNA.
ORGANISM Bdellovibrio stolpii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1553)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept Microbiology, 131
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1553 16S ribosomal RNA
BASE COUNT 381 a 320 c 429 g 316 t 107 others
ORIGIN
1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt
121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca
181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg
241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg
301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc
361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag
421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc
481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt
541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc
601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc
661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga
721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg
781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat
841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag
901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat
961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct
1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag
1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc
1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat
1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac
1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg
1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc
1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac
1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt
//
LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 17-JUL-1990
DEFINITION P.staleyi 16S ribosomal RNA.
ACCESSION M34126
KEYWORDS 16S ribosomal RNA.
SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA.
ORGANISM Planctomyces staleyi
Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales.
REFERENCE 1 (bases 1 to 1525)
AUTHORS Woese,C.R. and Oyalzu,H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 15-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1525 16S ribosomal RNA
BASE COUNT 376 a 348 c 482 g 315 t 4 others
ORIGIN
1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa
61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa
121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg
181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt
241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga
301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga
361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg
421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc
481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta
541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac
601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg
661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg
721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag
781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga
841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac
901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat
961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt
1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt
1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact
1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta
1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta
1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga
1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac
1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca
1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa
1501 cctgcggctg gatcacctcc tttct
//
LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 17-JUL-1990
DEFINITION R.purpureus 16S ribosomal RNA.
ACCESSION M34132
KEYWORDS 16S ribosomal RNA.
SOURCE R.purpureus (strain 6770) ribosomal RNA.
ORGANISM Rhodocyclus purpureus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1478)
AUTHORS Woese,C.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Woese, 09-MAY-1990.
Author address: C.R.Woese
University of Illinois
Dept. Microbiology
131 Burrill Hall
407 S. Goodwin Ave.
Urbana, IL 61801
(217) 333-9369
FEATURES from to/span description
rRNA 1 1478 16S ribosomal RNA
BASE COUNT 367 a 345 c 470 g 289 t 7 others
ORIGIN
1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca
61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat
121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct
181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt
241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg
301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat
361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg
421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg
481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc
541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt
601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga
661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg
721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag
781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa
841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt
901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga
961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag
1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga
1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg
1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc
1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac
1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg
1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc
1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc
1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt
//
LOCUS RATSVPIIA 4161 bp ds-DNA ROD 17-JUL-1990
DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete
cds.
ACCESSION J05443
KEYWORDS seminal vesicle secretion II protein.
SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell
DNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4161)
AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G.,
Matusik,R.J., Carr,S.A. and Crabb,J.W.
TITLE Structural characterization of the rat seminal vesicle secretion II
protein and gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.E.Harris, 25-APR-1990.
FEATURES from to/span description
pept 2065 2137 seminal vesicle secretion II protein (SVS II)
precursor, exon 1
2377 3548 seminal vesicle secretion II protein
precursor, exon 2
sigp 2065 2130 seminal vesicle secretion II protein signal
peptide
matp 2131 3545 seminal vesicle secretion II protein
signal 1934 1946 CAAT box
signal 2013 2020 TATA box
BASE COUNT 1243 a 902 c 822 g 1194 t
ORIGIN
1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct
61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct
121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt
181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt
241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa
301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc
361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac
421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata
481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta
541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta
601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca
661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc
721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc
781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc
841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa
901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca
961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac
1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct
1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt
1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac
1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta
1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt
1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg
1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact
1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt
1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac
1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac
1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa
1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt
1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat
1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct
1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag
1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg
1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc
2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc
2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag
2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca
2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt
2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga
2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga
2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag
2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc
2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg
2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat
2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct
2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg
2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg
2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt
2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac
2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga
3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa
3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg
3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta
3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag
3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc
3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa
3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat
3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa
3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta
3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc
3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg
3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag
3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa
3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac
3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat
3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg
3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc
4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga
4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc
4141 aatggtgacc ctgttgaatt c
//
LOCUS CREAPCYN 577 bp ss-mRNA PLN 17-JUL-1990
DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION J05524
KEYWORDS apoplastocyanin.
SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
PC6-2.
ORGANISM Chlamydomonas reinhardtii
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Chlamydomonadaceae.
REFERENCE 1 (bases 1 to 577)
AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
Bogorad,L.
TITLE Isolation and characterization of a complementary DNA clone for an
algal pre-apoplastocyanin
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Hill, 11-MAY-1990.
FEATURES from to/span description
pept 22 459 apoplastocyanin (PC6-2) precursor
sigp 22 162 apoplastocyanin signal peptide
matp 163 456 apoplastocyanin
BASE COUNT 91 a 189 c 171 g 126 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS MHVNSGII 870 bp ss-RNA VRL 17-JUL-1990
DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2).
ACCESSION M34035
KEYWORDS non structural protein.
SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA.
ORGANISM Murine hepatitis virus A59
Unclassified.
REFERENCE 1 (bases 1 to 870)
AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G.
TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not
essential for virus replication in transformed murine cells
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.G.Siddell, 04-MAY-1990.
Author address: S.G.Siddell
Inst of Virology
Univ of Wuerzburg
Versbacherstrasse 7
8700 Wuerzburg
FEATURES from to/span description
pept 40 837 non-structural protein-2 (NS2)
BASE COUNT 269 a 147 c 187 g 267 t
ORIGIN
1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt
61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt
121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag
181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa
241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac
301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat
361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg
421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc
481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag
541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg
601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat
661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag
721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg
781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt
841 gtttttacac tattagtgta ataaacttat
//
LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION M34641
KEYWORDS FGF receptor-1; fibroblast growth factor receptor;
transmembrane glycoprotein; tyrosine kinase.
SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA,
clone 1:1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3343)
AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L.
TITLE cDNA cloning and expression of a human FGF receptor which binds
acidic and basic FGF
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Claesson-Welsh, 25-MAY-1990.
Author address: L.Claesson-Welsh
Ludwig Institute for Cancer Research
Biomedical Center
Box 595
S-751 24 Uppsala
SWEDEN
FEATURES from to/span description
pept 10 2472 FGF receptor-1 precursor
sigp 10 72 FGF receptor-1 signal peptide
matp 73 2469 FGF receptor-1
BASE COUNT 766 a 957 c 911 g 709 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//
LOCUS ECOFADA 1594 bp ds-DNA BCT 17-JUL-1990
DEFINITION E.coli 3-ketoacyl-coenzyme A thiolase (fadA) DNA.
ACCESSION J05498
KEYWORDS 3-ketoacyl-coenzyme A thiolase.
SOURCE E.coli (strain K12) DNA, clone pK52.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1594)
AUTHORS Yang,S.-Y., Yang,X.-Y.H., Healy-Louie,G., Schulz,H. and Elzinga,M.
TITLE Nucleotide sequence of the fadA gene: Primary structure of 3-
ketoacyl-coenzyme A thiolase from Escherichia coli and the
structural organization of the fadAB operon
JOURNAL J. Biol. Chem. 265, 10424-10429 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-Y.Yang, 11-MAY-1990.
FEATURES from to/span description
pept 110 1276 3-ketoacyl-coenzyme A thiolase (fadA)
precursor (E.C. 2.3.1.16)
sigp 110 139 3-ketoacyl-coenzyme A thiolase signal peptide
matp 140 1273 3-ketoacyl-coenzyme A thiolase
BASE COUNT 329 a 425 c 489 g 351 t
ORIGIN 86 min on K12 map.
1 tgaagtgccg gaaggtctgc gtaataaagc gcgtcataac gaaccgtact atcctccggt
61 tgagccagcc cgtccggttg gcgacctgaa aacggcttaa ggagtcacaa tggaacaggt
121 tgtcattgtc gatgcaattc gcaccccgat gggccgttcg aagggcggtg cttttcgtaa
181 cgtgcgtgca gaagatctct ccgctcattt aatgcgtagc ttgctggcgc gtaacccggc
241 gctggaagcg gcggccctcg acgatattta ctggggttgt gtgcagcaga cgctggagca
301 gggttttaat atcgcccgta acgcggcgct gctggcagaa gtaccacact ctgtcccggc
361 ggttaccgtt aatcgcttgt gtggttcatc catgcaggca ctgcatgacg cagcacgaat
421 gatcatgact ggcgatgcgc aggcatgtct ggttggcggc gtggagcata tgggccatgt
481 gccgatgagt cacggcgtcg attttcaccc cggcctgagc cgcaatgtcg ccaaagcggc
541 gggcatgatg ggcttaacgg cagaaatgct ggcgcgtatg cacggtatca gccgtgaaat
601 gcaggatgcc tttgccgcgc ggtcacacgc ccgcgcctgg gccgccacgc agtcggccgc
661 atttaaaaat gaaatcatcc cgaccggtgg tcacgatgcc gacggcgtcc tgaagcagtt
721 taattacgac gaagtgattc gcccggaaac caccgtggaa gccctcgcca cgctgcgtcc
781 ggcgtttgat ccagtaaacg gtatggtaac ggcgggcaca tcttctgcac tttccgatgg
841 cgcagctgcc atgctggtga tgagtgaaag ccgcgcccat gaattaggtc ttaagccgcg
901 cgctcgtgtg cgttcgatgg cggtcgttgg ttgtgaccca tcgattatgg gttacggccc
961 ggttccggcc tcgaaactgg cgctgaaaaa agcggggctt tctgccagcg atatcggcgt
1021 gtttgaaatg aacgaagcct ttgccgcgca gatcctgcca tgtattaaag atctgggact
1081 aattgagcag attgacgaga agatcaacct caacggtggc gcgatcgcgc tgggtcatcc
1141 gctgggttgt tccggtgcgc gtatcagcac cacgctgctg aatctgatgg aacgcaaaga
1201 cgttcagttt ggtctggcgg acggatgtgt atcgggtctg ggtcagggta ttgcgacggt
1261 gtttgagcgg gtttaaccaa cttgccggat tgcggccgtt gaacggcctt atccggtcta
1321 ccgatccggc accaatgtag gcctgataag acgcgccagc gtcgcatcag gcatcatgca
1381 ccaattgccg gatgcggcgt aaacgcctta tccggctacg gttcggacta tttgtaggca
1441 cgataagacg cgacagcgtc gcatcaggca tcatgcacca attgccggat gcggcgtaaa
1501 cgccttatcc ggcctacggt tcggactatt tgtaggcacg ataagacgcg acagcgtcgc
1561 atcgggcatg gtttgccaga tacgtcatag atct
//
LOCUS HSSCLS 1645 bp ds-DNA VRL 17-JUL-1990
DEFINITION Herpesvirus saimiri collagen-like protein mRNA, complete cds.
ACCESSION M31964
KEYWORDS collagen-like protein.
SOURCE Herpesvirus saimiri (strain 484-77), cDNA to mRNA.
ORGANISM Herpesvirus saimiri
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 1645)
AUTHORS Geck,P., Whitaker,S.A., Medveczky,M.M. and Medveczky,P.G.
TITLE Expression of collagen-like sequences by a tumor virus, herpesvirus
saimiri
JOURNAL J. Virol. 64, 3509-3515 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P. Medveczky, 07-FEB-1990.
FEATURES from to/span description
pept 101 400 collagen-like protein
pept 580 1224 ORF2
mRNA 56 1248 collagen-like protein mRNA
BASE COUNT 545 a 363 c 393 g 344 t
ORIGIN Map position 98-100.
1 gagtttccaa aatgtagtaa gctaacatat aaatatcagc agaaaataga gtaacacaag
61 aaacagctaa caagagcaac acaaacacgc ttgaaagaaa atggcaagcg agccaaacct
121 aagatatcca actgaagaaa caggagatcg aggacctcaa ggacctccag gacctccagg
181 accccaagga cctccaggac cccaaggacc tccaggaccc caaggacctc caggacccca
241 aggacctcca ggaccccaag gacctccagg accccaagga cctccaggac ccccaggacc
301 tccaggacct tcagactcca ggattgtttg taacaaactt attgcttgga atcataattt
361 tactcttatt aattatagtt gcgatcttac tggtgtctaa attagtagta aactaagaac
421 agagcaagca gtatagaaga gagtaattat gctagcttaa tttcatctag tagtcagttg
481 tgtagaactt tttattagta aatctagaag tgttaagaaa aaggtataga aagtagtgac
541 ctttattagc tgcatcttca tatctctaaa ttagtaagta tggaaaacca gagagaagaa
601 atagaactaa cagaaattcc agagactgag aaaaaaagaa cagctgaaga aaaattatta
661 tcatgctctg ctgaaactgc tgaagaaaag gtatcattat gctctgaaga aactacagac
721 acgtcaagca gttctagcag tgagcaaaca cctgctccta tagaagtaaa cgttaacatt
781 caaacgagca catacttgcc tcagaacgcg gcaacaaact tgaacagttt gtacacaagt
841 tttgaagacg caagggcaca aggaaaagga ctggtgcgac acaattcgga cgatttaaaa
901 agttttttag aaaaataccc accagattat agaaaaccta aacgggactt gtcagaatca
961 tgggatccag gtatgccgaa accaactttg cctcctaggc ctgcgaacct aggtgcaagc
1021 caggcaagta ctgtaagacg gcatgtaaga gaacaaaatt ttaaacagct tagagaacgt
1081 aaagctaatg aagggaaaat agtaaaagat ttgaaaagac ttgagtacaa agttaatata
1141 attctttgtt tagtagttgt tatattagct ataatattat tactaactgg gctaagtata
1201 ctgttcataa gaataaaaag ttaatttaac aaagaattta acaaatatgt gtgtgcatgg
1261 tcattaaaag aagcgggccc caagcagtgg gcgctgggca gctggaaaaa tgtcctgggc
1321 gtggctagag cgagctcggg gggcaaagcg agcagcgagc ggctgcggag ggagggggaa
1381 gggattgagc tgccccgccc ctctgggctg agattccaga gcccctgttg tgtccccagg
1441 ccaccgcagg cccggggctg tgcccctgtg cttggagcca gtccccagag ccctggctag
1501 gccgaggcgc tgctagagta agggtcccac aactgttcag gcccccccag ggggagagcc
1561 cagcgcagcg ctctgccggg tagggggggg gtagctggct ccccccccac ctctgcgtga
1621 agcagagtgc atgcagcccc tgcag
//
LOCUS MUSAGP1A 777 bp ss-mRNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION M34648 J05488
KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein;
non-specific immunosuppressant; transmembrane protein;
tyrosine kinase.
SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1A.
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 777)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 39 662 alpha-1-acid glycoprotein (AGP) precursor
sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide
matp 93 659 alpha-1-acid glycoprotein (AGP)
signal 747 752 AGP poly-A signal
BASE COUNT 225 a 189 c 186 g 177 t
ORIGIN Chromosome 4.
1 atcttttcca agccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg
61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta
121 taggcgaacc tatcaccaat gagaccctga gctggctctc tgacaaatgg tttttcattg
181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt
241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag
301 atgaccactg tgtctataac tctacacatc taggaatcca gagagagaat gggaccctct
361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg
421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca
481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg
541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg
601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat
661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct
721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacacagtca aaaaccg
//
LOCUS MUSAGP1X 778 bp ss-mRNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION M34647 J05488
KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein;
non-specific immunosuppressant; transmembrane protein;
tyrosine kinase.
SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1X.
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 778)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 39 662 alpha-1-acid glycoprotein (AGP) precursor
sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide
matp 93 659 alpha-1-acid glycoprotein (AGP)
signal 747 752 AGP poly-A signal
BASE COUNT 224 a 191 c 187 g 176 t
ORIGIN Chromosome 4.
1 atctcttcca ggccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg
61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta
121 taggcgaacc tatcaccaat gagaccctag gctggctatc tgacaaatgg tttttcattg
181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt
241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag
301 atgaccactg tgtctataac tccacacatc taggaatcca gagagagaat gggaccctct
361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg
421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca
481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg
541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg
601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat
661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct
721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacactgtca aaaacccg
//
LOCUS MUSAGP2AX 765 bp ss-mRNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION M34649 J05488
KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein;
non-specific immunosuppressant; transmembrane protein;
tyrosine kinase.
SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clones lambda-[2A,2X].
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 765)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 29 652 alpha-1-acid glycoprotein (AGP) precursor
sigp 29 82 alpha-1-acid glycoprotein (AGP) signal peptide
matp 83 649 alpha-1-acid glycoprotein (AGP)
signal 735 740 AGP poly-A signal
BASE COUNT 226 a 185 c 185 g 169 t
ORIGIN Chromosome 4.
1 gggcctggtg cctctgagtg ccctcagcat ggcactgcac acggttctta tcatgttgag
61 cctcctgccg atgttggaag ctcagaaccc agaacatgcc aacatcacta taggcgaacc
121 tatcaccaat gagaccctag gctggctctc tgacaaatgg tttttcatgg gcgcagcttt
181 cagaaaactc gagtacaggc aggcaattca aatgatgcag actgaatttt tttaccttac
241 caccaacttg ataaacgaca caatagagct tcgggagtct caaacaatag gtgaccagtg
301 tgtctataac tccacccatc taggattcca gagagaaaat gggaccttct ccaagtatga
361 aggaggagta gaaacctttg cccacctgat agtgctaagg aaacatgggg ccttcatgct
421 tgcctttgat ctcaatgatg agaagaaacg gggactgtcc ctctatgcca aaaggccaga
481 tatgaccctg gagctacggg aagtattcca gaaggctgtc aaacacgtgg gcatggatga
541 atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg agaagaagca
601 gcttgagttg gggaaggaga ccaagaaaga tcctgaggaa ggccaggcat gaactcagct
661 ctgaactcca tgggctatcc ccaggctcac caaaccccac cccctcctgt gcactttgat
721 tctgtctctg caacaataaa gatttgctga aagagtcaaa aaccg
//
LOCUS MUSAGP3X 766 bp ss-mRNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION M34646 J05488
KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein;
non-specific immunosuppressant; transmembrane protein;
tyrosine kinase.
SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-3X.
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 766)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 31 654 alpha-1-acid glycoprotein (AGP) precursor
sigp 31 84 alpha-1-acid glycoprotein (AGP) signal peptide
matp 85 651 alpha-1-acid glycoprotein (AGP)
signal 737 742 AGP poly-A signal
BASE COUNT 228 a 186 c 185 g 167 t
ORIGIN Chromosome 4.
1 cggggcctgg tgcctctgag tgccctcagc atggcactgc acacggttct tatcatgttg
61 agcctcctgc cgatgttgga agctcagaac ccagaacatg ccaacatcac tataggcgaa
121 cctatcacca atgagaccct aggctggctc tctgacaaat ggtttttcat gggcgcacgt
181 ttcagaaaac tcgagtacag gcaggcaatt caaatgatgc agactgaatt tttttacctt
241 accaccaact tgataaacga cacaatagag cttcgggagt ctcaaacaat aggtgaccag
301 tgtgtctata actccaccca tctaggattc cagagagaaa atgggacctt ctccaagtat
361 gaaggaggag tagaaacctt tgcccacctg atagtgctaa ggaaacatgg ggccttcatg
421 cttgcctttg acctcaatga tgagaagaaa cggggactgt ccctctatgc caaaaggcca
481 gatatgaccc tggagctacg ggaagtattc cagaaggctg tcaaacacgt gggcatggat
541 gaatcagaaa tcatatttgt cgactggaaa aaggataagt gtggtcagca ggagaagaag
601 cagcttgagt tggggaagga gaccaagaaa gatcctgagg aaggccaggc atgaactcag
661 ctctgaactc catgggctat ccccaggctc accaaacccc accccctcct gtgcactttg
721 attctgtctc tgcaacaata aagatttgct gaaagaaaaa aaaccg
//
LOCUS MUSAGP7A 1348 bp ds-DNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 7, 5' flank and
exon 1.
ACCESSION M34645 J05488
KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase.
SOURCE M.caroli adult hepatocyte DNA, clones lambda-[11B,27A].
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1348)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 1232 / 1348 alpha-1-acid glycoprotein (AGP), exon 1
mRNA 1194 > 1348 AGP mRNA
signal 1161 1166 TATA box
BASE COUNT 312 a 316 c 356 g 364 t
ORIGIN Chromosome 4.
1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga
61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg
121 agaccttagc tggatgtcca cttccctgcc tgcacccttg cccttagaat ccacaggctc
181 tggatacaca gcacagccac aggaagcttt ggggatttct tttccctatc ttaagtttat
241 tgttttcttc atttggttgt ataacaaaaa gagattttta aatttgctaa tgggtgcaag
301 tgtgagtgtg tgtgtgtgtg tgtgtgtgtt tctgtgtgtg tgtgtgtgtg tgtgtgtgtg
361 tgtgtgtgtg tgtgtgtgtg agagagagag agagagagga agggacagag tgaaatagat
421 catgacactg tgagcatgtt ctaacacgtt tgctgtattg atgttttgaa gcagctcact
481 gtgatctcaa gctcccagca atgtctctgt gcctcagcgg gagtcaccag gcctgtccct
541 gcaatacttc tagagattgt tgattatgta gacaaagatc aggtaggaac tagaaccaga
601 tattgactgg atgctgggca cagaccctgg gttgggcata cactgtccta ggcagatcct
661 ttcctgctgt agatactggg agctttgctg aactcgatgg tcaactcaga ttcacccctt
721 tttgggcact tggctgcctt caggctgggt aggagtccgc aggaacatca cactcctttg
781 aaaaccatcc atctttattc ttggccctca acttgagccc ctaagtgtct tctatgttca
841 ctatgaaccc tgacctggga ccccttctta tcacgcttcg ggggggtgga gttgatgtag
901 tgtaggtttc actcctgtta gacagcttca tgggataaga gagaaagggg accgctgtct
961 gggacctaag tatcatcagg ctagcccttt acctaccttg accatgaatc agccactctg
1021 gtgtaggcag aagccgactg agggagctgc acaaacctgg cttgagggaa catttggggc
1081 aagacattta tcaagtgctg gtgactttgt ggcactgctc taacgccctg ctgcagtcac
1141 atgccctccc cacatcctgt tataaaagtc actgcaccct ccaggcacca gttatctctt
1201 cccagcactg gtgcctctga gtgccgtcag catggagcta cacacagttc ttgtgatgtt
1261 gagcctcctg ctgctgttgg aagctcagaa cccagaacat gccaacatca acataggcaa
1321 acctatcaac aatgagaccc tgagctgg
//
LOCUS MUSAGP8A 2094 bp ds-DNA ROD 17-JUL-1990
DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 8, 5' flank and
exon 1.
ACCESSION M34644 J05488
KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase.
SOURCE M.caroli adult hepatocyte DNA, clones lambda-[15A,11B].
ORGANISM Mus caroli
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2094)
AUTHORS Prowse,K.R. and Baumann,H.
TITLE Molecular characterization and acute-phase expression of the
multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
Differences in glucocorticoid stimulation and regulatory elements
between the rat and mouse AGP genes
JOURNAL J. Biol. Chem. 265, 10201-10209 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.R.Prowse, 18-MAY-1990.
FEATURES from to/span description
pept 1978 / 2094 alpha-1-acid glycoprotein (AGP), exon 1
mRNA 1940 > 2094 AGP mRNA
signal 1907 1912 TATA box
BASE COUNT 554 a 502 c 464 g 574 t
ORIGIN Chromosome 4.
1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga
61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg
121 agaccttagc tggtgtccac tccctgcctg cacccttgcc cttagaatcc acaggctctg
181 gatgcacagc acagccacag gaagctttgg agatatctaa tccgtttctt aagtttatta
241 ttttccatca tttgctttta taacaaaaaa tagatatttt atatctgcca gtgtgtgcaa
301 atatgtttat gtgcacgtgt acgtgtgcgc ctgtgtgtaa gagacataac agtgacaaag
361 aggcacaaac agagagatta tgacttctgg gcactgctaa cttgggacag aacatatatc
421 attggtgctt ttgggctcct catggttatg ctgtttctcc tctgggaaga gagaaacttt
481 cttaagactg tataaataca acaaaacaca tatgttttgg gtctccagct cagcaacttc
541 tcctaggccc tgtacagccc actccatggt cagggggctc tcaggaccat ctgtggcccc
601 ttccttttgc cctaggtggt cactgtcact gacctgatta atatcattac aattagtctg
661 atcctctaga acgccctgta aacaggctca cacaacccca cctctgggcg atctgctggc
721 aataatgttg tcgagctgca tttgtatggt caaatctatc agtctttcat ttctctctgt
781 tgctaagcaa caccatgctt gcatgaaatg tcacaaccca tttctcattt tcctggtggt
841 aggcacttaa gagcttggag tttgtggcta ctttgaataa tgccattaga acatttacac
901 acatgatact ggaggtgact catgttttgt tttcctttct gttggtagac taatttttaa
961 aggcaatatg ttgctctgac acttataaag aggggaagca taaaatacaa aatgccacag
1021 gataaaagaa ggttctcgtg aaattagaac acagtaagtg aatgacaatt atttatcatt
1081 aaaacaaaaa ccctgaatac acagaaacat gagaaattgt ggaggacaca ggaagtagat
1141 tactcaactg tgagcgtgtt ctcatatgtt tggtgctttt gatgttttga aagagctcac
1201 ggtgatctca agctcctagc aatgactccg agcctcagtg agagtcacca ggcctgtccc
1261 tagcaataca tttcgagagt gttgattatg tagacaaaaa tcaggctata tctatcaagc
1321 aggaaccaga accagatatt gactggttgc tgtgcacaga ccctggattg ggcacacact
1381 attctagaca gatcctttcc tgctgtagat acagggagct ttgctgaact acattttcaa
1441 ctcagattca tccctctttt ttgggcattt ggctgcctct agcgtgcgta ggggtcccca
1501 gaaacatcac actcctttgg aaactaatcc atctttgtcc ttggccctta acttgagccc
1561 ctaaatgact tctaagttca ctatgaaccc tgaccaggac cccttcatag tccccttggt
1621 gataaataga ataggtctca ctcctgctag gcagcttcat gggataagac agatatggga
1681 ccatgactgg gatctaagta ttatcaggct agcccagtat ccaccttgac catgaatcag
1741 ccactctggt gtaggtcagg agcctgtgtc ggggccggct ggctgaggga gctgcacaaa
1801 gctggcttga gggaacattt tgcgcaagac atttcccaag tgctggggag attgtgccag
1861 ggctctagag gccctgctgc agtcccatgc cctccccaca ttctgttata aaagccactg
1921 caccctccat ccaccaatta tctcttccaa gccctggtgc ctctgagtgt cctaagcatg
1981 gcactgcaca tgattcttgt catgttgagc ctcctgccgc tgttggaagc tcagaaccca
2041 gaacatgtca acatcactat aggcgaacct atcaccaatg agaccctagg ctgg
//
LOCUS MUSMK1 933 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid-responsive protein (MK-1) mRNA, complete cds.
ACCESSION M19662 M34326 J05473
KEYWORDS DNA-binding protein; regulatory protein;
retanoic acid-responsive protein; secreted protein.
SOURCE Mouse (strain 129/Sv) HM-1 embryonal carcinoma, cDNA to mRNA, clone
pO1-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 33 to 932)
AUTHORS Kadomatsu,K., Tomomura,M. and Muramatsu,T.
TITLE cDNA cloning and sequencing of a new gene intensely expressed in
early differentiation stages of embryonal carcinoma cells and in
mid-gestation period of mouse embryogenesis
JOURNAL Biochem. Biophys. Res. Commun. 151, 1312-1318 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 933)
AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma
system: Heterogeneity of the transcript and the nature of the
translation
JOURNAL J. Biol. Chem. 265, 10765-10770 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Kadomatsu, 08-JUL-1988. Draft entry and computer-readable
sequence for [2] kindly submitted by M.Tomomura and T.Muramatsu,
16-MAY-1990.
The expression of MK-1 mRNA increases in the early stages of
retinoic acid-induced differentiation of embryonal carcinoma cells.
The putative peptide product may act as a DNA-binding protein
regulating gene expression.
FEATURES from to/span description
pept 249 671 retanoic acid-responsive protein (MK-1)
precursor
sigp 249 314 retanoic acid-responsive protein signal
peptide
matp 315 668 retanoic acid-responsive protein
mRNA < 1 910 MK-1 mRNA
BASE COUNT 216 a 272 c 270 g 175 t
ORIGIN
1 gagtgagcgc acgcagtggc tgtggcccca gtcccttcag gcggctgctc tgccaccaag
61 ggggctgagg tgggggtggg ggtacgctga gacatcggtt ccaagtcctc cctccgtctc
121 ccccttgtcg gtccgacgtt ttgggcctgg aaagtgggac aagtcagtca agggtgggag
181 gtccttcccg cggttcctag cggagaagag actaggcgag aaactctaac ccaggtttta
241 cccctaggat gcagcaccga ggcttcttcc ttctcgccct tcttgccctc ttggtggtca
301 cgtccgcggt ggccaaaaaa aaagagaagg tgaagaaggg cagcgagtgt tcggagtgga
361 cctgggggcc ctgcaccccc agcagcaagg actgcggcat gggcttccgc gagggtacct
421 gtggggccca gacccagcgc gtccattgca aggtgccctg caactggaag aaggaatttg
481 gagccgactg caaatacaag tttgagagct ggggggcgtg tgatgggagc actggcacca
541 aagcccgcca agggaccctg aagaaggcgc ggtacaatgc ccagtgccag gagaccatcc
601 gcgtgactaa gccctgcacc tccaagacca agtcaaagac caaagccaag aaaggaaaag
661 gaaaggacta agtcaggagg ccagagagcc tccggcctcg cctggagcct gaacggagcc
721 ctcctctccc acaggcccaa gatataaccc accagtgcct tttgtcttcc tgtcagctct
781 gtcaatcacg cctgtcctct cacgcccaca ccaagtgccc aaagtgggga gggacaagag
841 attctggaaa gtgagcctcc ccataccctc ttttgttctc cccaccctga tacttgttat
901 taagaaatga ataaaataaa ctcacttttt tcc
//
LOCUS MUSMK2 730 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid-responsive protein (MK-2) mRNA, complete cds.
ACCESSION M34327 J05473
KEYWORDS DNA-binding protein; regulatory protein;
retanoic acid-responsive protein; secreted protein.
SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clone pB2-25.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 730)
AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma
system: Heterogeneity of the transcript and the nature of the
translation
JOURNAL J. Biol. Chem. 265, 10765-10770 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Tomomura and T.Muramatsu, 16-MAY-1990.
FEATURES from to/span description
pept 44 466 retanoic acid-responsive protein (MK-2)
precursor
sigp 44 109 retanoic acid-responsive protein signal
peptide
matp 110 463 retanoic acid-responsive protein
mRNA < 1 705 MK-2 mRNA
BASE COUNT 186 a 211 c 206 g 127 t
ORIGIN
1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt
61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga
121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag
181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca
241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga
301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa
361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa
421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga
481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat
541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc
601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata
661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac
721 ttttttccaa
//
LOCUS MUSMK3 1039 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse retinoic acid-responsive protein (MK-3) mRNA, complete cds.
ACCESSION M34328 J05473
KEYWORDS DNA-binding protein; regulatory protein;
retanoic acid-responsive protein; secreted protein.
SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clones pB3-30 and
FK-3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1039)
AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma
system: Heterogeneity of the transcript and the nature of the
translation
JOURNAL J. Biol. Chem. 265, 10765-10770 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Tomomura and T.Muramatsu, 16-MAY-1990.
FEATURES from to/span description
pept 355 777 retanoic acid-responsive protein (MK-3)
precursor
sigp 355 420 retanoic acid-responsive protein signal
peptide
matp 421 774 retanoic acid-responsive protein
mRNA < 1 1016 MK-3 mRNA
BASE COUNT 278 a 292 c 302 g 167 t
ORIGIN
1 cccaggtacc cagctccctg ccacatcaga gacccttctt gcactctgag tgaactgatt
61 aaaaaaaaaa aaaaaaaaaa aaaaccaagc cggaggtgag ccgggcctcg aagggaaggt
121 tcgcgggtgc ggtggccgcc ccgagcctgt gacaccagga catactcccg gggcccgcgg
181 tgggcaagcg aagtggtgac ctgagagctg acaggctgcg agagggaaaa gtatagacag
241 gcctagacca ggggaagggg aggggataga gagctgggcc tgctacgagg ggacctgagc
301 cagaagcgca ctggtaaaac cgaactccag gaccagagac ccagagatca gaggatgcag
361 caccgaggct tcttccttct cgcccttctt gccctcttgg tggtcacgtc cgcggtggcc
421 aaaaaaaaag agaaggtgaa gaagggcagc gagtgttcgg agtggacctg ggggccctgc
481 acccccagca gcaaggactg cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc
541 cagcgcgtcc attgcaaggt gccctgcaac tggaagaagg aatttggagc cgactgcaaa
601 tacaagtttg agagctgggg ggcgtgtgat gggagcactg gcaccaaagc ccgccaaggg
661 accctgaaga aggcgcggta caatgcccag tgccaggaga ccatccgcgt gactaagccc
721 tgcacctcca agaccaagtc aaagaccaaa gccaagaaag gaaaaggaaa ggactaagtc
781 aggaggccag agagcctccg gcctcgcctg gagcctgaac ggagccctcc tctcccacag
841 gcccaagata taacccacca gtgccttttg tcttcctgtc agctctgtca atcacgcctg
901 tcctctcacg cccacaccaa gtgcccaaag tggggaggga caagagattc tggaaagtga
961 gcctccccat accctctttt gttctcccca ccctgatact tgttattaag aaatgaataa
1021 aataaactca cttttttcc
//
LOCUS YSCVPS33P 2695 bp ds-DNA PLN 17-JUL-1990
DEFINITION S.cerevisiae vacuolar biogenesis and vacuolar protein sorting
protein (VPS33/SLP1), complete cds.
ACCESSION M34638
KEYWORDS vacular biogenesis protein; vacuolar protein sorting protein.
SOURCE S.cerevisiae (strain GRF88) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2695)
AUTHORS Banta,L.M., Vida,T.A., Herman,P.K. and Emr,S.D.
TITLE Characterization of the yeast Vps33p, a protein required for
vacuolar protein sorting and vacuole biogenesis
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.M.Banta, 18-MAY-1990.
Author address: L.M.Banta
California Insitute of Technology
147-75 Div of Biology
Pasadena, CA 91125
FEATURES from to/span description
pept 401 2476 Vps33p protein
signal 245 252 TATA box
binding 480 498 ATP-binding domain A (put.)
binding 661 679 ATP-binding domain B (put.)
BASE COUNT 916 a 445 c 522 g 812 t
ORIGIN
1 gtcaagaagc gggcttggct gccataatgg aagatttaga tgtcgcaaaa gtggaattac
61 gtcattttga gaaagctttt aaaggaattg ctaggggcat tactccagaa atgctctctt
121 attatgaaga gtttgctcta agaagcggtt catcttcgta agcttgttca tagtcaattc
181 ttttcctttg tgtgctcaat aatagtagat agaaattata ctgaactccg gtcattttgt
241 ataatatatt aatcacttca cacgaacata cataaataaa atatcataaa ggttagcaaa
301 ttggaactag ttatatgtta attagttaaa agatagaaaa ttcgagaaag gaagaaaaag
361 ctgatattgc ccatctccaa ctttatcaaa tcatttcacg atgaatagat tttggaatac
421 taagaaattt tcattaacaa atgccgatgg actatgtgct accttaaatg agatatctca
481 aaatgatgaa gttcttgtgg ttcaaccaag tgtattgcca gtactcaata gtttgctaac
541 tttccaagat ttgactcaat caactcctgt aaggaaaatt acgttactcg atgatcagct
601 aagtgacgat ttaccgagtg ccttaggcag cgttccgcaa atggatctta tttttcttat
661 tgatgtcaga acatctctcc gactccctcc acaactgctt gatgctgctc aaaagcacaa
721 tttatcatct ttgcatataa tatactgtcg atggaaaccg tctttccaaa atactttgga
781 ggatacagag caatggcaaa aggatggttt cgatttgaat tcaaaaaaaa cacatttccc
841 taacgtcatt gaatctcagt taaaggagct atcgaacgaa tatacccttt acccttggga
901 tctcttgccc ttcccacaga ttgatgaaaa tgttctattg actcattccc tttataacat
961 ggaaaatgta aacatgtatt atcccaactt acgttctttg cagagtgcca cagagtcaat
1021 actggttgat gatatggtca attcgttgca gagcttgatt tttgaaacta atagtatcat
1081 aacaaatgtt gtgtcgatag gtaatctgtc taagagatgt agccatcttt tgaagaaacg
1141 aatcgatgag catcaaacag agaatgattt attcatcaag ggtacgcttt atggtgaacg
1201 aaccaactgt ggactagaaa tggacttgat tatcttggaa aggaataccg atcctataac
1261 gccattgttg acacaactta cgtatgcagg aatactagat gatctatatg aattcaattc
1321 tggcataaag ataaaggaga aagacatgaa cttcaattat aaggaagata aaatatggaa
1381 tgatttgaaa tttttaaatt ttgggtcgat tgggccgcag ttaaataaat tggcaaagga
1441 actacaaacg caatatgata caaggcataa agccgagagc gtacatgaaa tcaaagaatt
1501 cgttgattcc ttaggttctt tgcaacaaag gcaagctttt ttgaaaaatc acacaacctt
1561 atcatccgac gttttgaaag tggtagagac tgaagagtac ggatctttca ataaaatctt
1621 agagttagag ctggaaattt tgatgggaaa tacacttaat aacgacattg aagatattat
1681 actcgagttg cagtaccagt acgaggttga tcaaaagaag attctcagat taatctgttt
1741 attgtctctt tgtaaaaatt cacttcgaga aaaggattat gaatatctaa gaacctttat
1801 gatcgactct tggggcattg aaaaatgctt tcaacttgaa tcattggctg agttaggatt
1861 tttcactagc aaaacgggaa aaactgattt gcatattaca acaagtaagt caacaagatt
1921 acagaaagaa taccgttata tttcacaatg gttcaataca gtacccatag aagacgagca
1981 tgctgccgat aaaatcacaa atgagaacga tgacttctcg gaagccactt ttgcttacag
2041 tggtgtagtg cccttgacaa tgagactggt tcagatgtta tatgataggt ctatcttgtt
2101 ccataattat tcctcgcagc agccttttat actgtcaaga gaacctagag tttctcaaac
2161 ggaggattta attgaacagt tatatggaga ctcacatgcg atcgaagaga gtatatgggt
2221 cccgggaacc attacaaaaa agatcaatgc aagcatcaag agcaataata gacggtccat
2281 agacggatct aatgggacat ttcatgctgc agaggatatt gcactcgtag tattcctcgg
2341 aggtgtaaca atgggtgaaa tagctataat gaagcatttg caaaaaatac taggtaaaaa
2401 aggtatcaat aaaaggttta tcatcatcgc cgatggcttg atcaatggca caaggatcat
2461 gaactctata tcttaattat tatatgatag atttgttaat tttttgtata tgcaaatgtg
2521 ctttttttca ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa
2581 aagctagggg actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct
2641 ggcttagtga tttataagaa tggttaatta gttttgtatc ctttaatttt ctaga
//
LOCUS BOVCASA 1123 bp ss-mRNA MAM 17-JUL-1990
DEFINITION Bovine alpha-s1-casein mRNA, complete cds.
ACCESSION M33123
KEYWORDS alpha-s1-casein.
SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA,
clone p-alpha-s1 C228.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1123)
AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R.
TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA
clone
JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 64 708 alpha-s1-casein precursor
sigp 64 108 alpha-s1-casein signal peptide
matp 109 705 alpha-s1-casein
mRNA < 1 1123 alpha-s1-casein mRNA
BASE COUNT 331 a 247 c 223 g 322 t
ORIGIN
1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca
61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat
121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt
181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt
241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc
301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa
361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac
421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa
481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac
541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg
601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat
661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga
721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg
781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg
841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct
901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa
961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa
1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact
1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc
//
LOCUS BPHINTXIS 1741 bp ds-DNA PHG 17-JUL-1990
DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes,
complete cds.
ACCESSION M34832
KEYWORDS excisionase; integrase.
SOURCE Bacteriophage phi-11 DNA.
ORGANISM Bacteriophage phi-11
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1741)
AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y.
TITLE Sequence analysis and comparison of int and xis genes from
Staphylococcal bacteriophages l54a and phi-11
JOURNAL J. Bacteriol. 172, 2568-2575 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 267 67 (c) excisionase (xis)
pept 379 1425 integrase (int)
BASE COUNT 650 a 264 c 263 g 564 t
ORIGIN
1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc
61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa
121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac
181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt
241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca
301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt
361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt
421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa
481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac
541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct
601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga
661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta
721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt
781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct
841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta
901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt
961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt
1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa
1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat
1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta
1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa
1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat
1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat
1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg
1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc
1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt
1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat
1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact
1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag
1741 t
//
LOCUS BSTNPRAS 3510 bp ds-DNA BCT 17-JUL-1990
DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional
activator (nprA) genes, complete cds.
ACCESSION M34237
KEYWORDS neutral protease; transcriptional activator.
SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53.
ORGANISM Bacillus stearothermophilus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3510)
AUTHORS Nishiya,Y. and Imanaka,T.
TITLE Cloning and nucleotide sequences of the neutral protease gene and
its transcriptional activator gene from Bacillus stearothermophilus
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Nishiya, 11-MAY-1990.
FEATURES from to/span description
pept 181 1401 transcriptional activator (nprA)
pept 1750 3405 neutral protease (nprS) precursor
sigp 1570 2274 neutral protease (nprS) signal and propeptide
matp 2275 3219 neutral protease (nprS)
BASE COUNT 1177 a 581 c 737 g 1015 t
ORIGIN
1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa
61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat
121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa
181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt
241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa
301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa
361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag
421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa
481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat
541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa
601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt
661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg
721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac
781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga
841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat
901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat
961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca
1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac
1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat
1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca
1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt
1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac
1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta
1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga
1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg
1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag
1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc
1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt
1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag
1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga
1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca
1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc
1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt
1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt
2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact
2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac
2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct
2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac
2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta
2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa
2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca
2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca
2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat
2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt
2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac
2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca
2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat
2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat
2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt
2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa
3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc
3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca
3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat
3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa
3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc
3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa
3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca
3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt
3481 gcaaaatgcc caaaggctta taatgttgat
//
LOCUS HSEGP14 3347 bp ds-DNA VRL 17-JUL-1990
DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
cds.
ACCESSION M34861
KEYWORDS glycoprotein 14.
SOURCE Equine herpesvirus type 1 DNA.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 3347)
AUTHORS Guo,P.
TITLE Characterization of the gene and an antigenic determinant of equine
herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent
glycoproteins of other herpesviruses
JOURNAL Gene 87, 249-255 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 300 3239 glycoprotein 14 (gp14)
BASE COUNT 885 a 891 c 851 g 720 t
ORIGIN
1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg
61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg
121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg
181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc
241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca
301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta
1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc
2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga
2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg
2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa
2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg
2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga
2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc
3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa
3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga
3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg
3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa
3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact
3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg
//
LOCUS HUMHBGAA 1227 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human A-gamma-globin gene, 3' end.
ACCESSION M33200
KEYWORDS A-gamma-globin.
SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1)
DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1227)
AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G.
TITLE Gamma gene promoter and enhancer structure in Seattle variant of
hereditary persistence of fetal hemoglobin
JOURNAL Blood 71, 1108-1112 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 3 A-gamma-globin (AA at 1)
/hgml_locus_uid="LK0092S" /nomgen="HBG1"
/map="11p15.5"
mut 794 794 t in wt; c in mutant
mut 970 970 c in wt; a in mutant
mut 1186 1186 a in wt; g in mutant
BASE COUNT 366 a 204 c 294 g 363 t
ORIGIN
1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca
61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca
121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc
181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt
241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg
301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc
361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg
421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa
481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg
541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc
601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga
661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt
721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa
781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta
841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca
901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat
961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa
1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc
1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc
1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca
1201 gatgaagaag gagaaaaaaa taagctt
//
LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 17-JUL-1990
DEFINITION Human theta-1-globin gene, complete cds.
ACCESSION M33022
KEYWORDS theta-1-globin.
SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1114)
AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J.
TITLE Nucleotide sequence of the human theta-1-globin gene
JOURNAL Biochem. Genet. 26, 207-211 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 359 453 theta-1-globin, exon 1
/hgml_locus_uid="LV0155X" /nomgen="HBQ1"
/map="16p13.3"
538 742 theta-1-globin, exon 2
852 980 theta-1-globin, exon 3
IVS 454 537 theta-1-globin intron A
IVS 743 851 theta-1-globin intron B
BASE COUNT 166 a 386 c 393 g 169 t
ORIGIN
1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg
61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt
121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc
181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga
241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc
301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat
361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa
421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc
481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac
541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc
601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg
661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg
721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg
781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc
841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag
901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc
961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc
1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag
1081 gacgcgagag gaaggcgccg cccctcccca agga
//
LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds.
ACCESSION M33033
KEYWORDS inter-alpha-trypsin inhibitor heavy chain.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1149)
AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and
Martin,J.-P.
TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization
of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence
of the H chain
JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA
at 1) /hgml_locus_uid="LE0221G"
/nomgen="ITIH1" /map="3p21.2-p21.1"
BASE COUNT 332 a 292 c 253 g 272 t
ORIGIN
1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat
61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat
121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa
181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag
241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct
301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc
361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga
421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc
481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac
541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg
601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc
661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact
721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc
781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca
841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa
901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca
961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa
1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag
1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga
1141 ttttatttc
//
LOCUS HUMPTHROM 327 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human thrombin mRNA, 5'end.
ACCESSION M33031
KEYWORDS serine protease; thrombin.
SOURCE Human, cDNA to mRNA, clone pIIH13.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 327)
AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C.
TITLE Recombinant genetic approaches to functional mapping of thrombin
JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L"
/nomgen="F2" /map="11p11-q12"
sigp 28 156 thrombin signal peptide
matp 157 > 327 prothrombin
BASE COUNT 60 a 97 c 109 g 61 t
ORIGIN
1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc
61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct
121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg
181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc
241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag
301 acagcgagga cgcctcgaga taagctt
//
LOCUS MUSCC3A 312 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse complement component C3 mRNA, partial cds.
ACCESSION M33032
KEYWORDS complement component C3.
SOURCE Mouse liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H.
TITLE Amino acid sequences of mouse complement C3 derived from nucleotide
sequences of cloned cDNA
JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 complement component C3 precursor (AA at 1)
matp < 1 27 complement component C3-beta subunit (AA at 1)
matp 40 273 complement component C3-alpha subunit
matp 274 > 312 complement component C3-alpha' subunit
BASE COUNT 90 a 77 c 90 g 55 t
ORIGIN
1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga
61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt
121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc
181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa
241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca
301 gaagaagata tt
//
LOCUS MUSN038A 1260 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse nucleolar protein N038 mRNA, complete cds.
ACCESSION M33212
KEYWORDS nucleolar protein N038.
SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone
lambda-FML-185.19.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1260)
AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W.
TITLE DNA cloning and amino acid sequence determination of a major
constituent protein of mammalian nucleoli: Correspondence of the
nucleoplasmin-related protein N038 to mammalian protein B23
JOURNAL Chromosoma 96, 417-426 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 79 957 nucleolar protein N038
mRNA < 1 1260 nucleolar protein N038 mRNA
BASE COUNT 419 a 214 c 306 g 321 t
ORIGIN
1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga
61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct
121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat
181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa
241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta
301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt
361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag
421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa
481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa
541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat
601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct
661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa
721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa
781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa
841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc
901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa
961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg
1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt
1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac
1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag
1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt
//
LOCUS RATPOS 1804 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete
cds.
ACCESSION M33114
KEYWORDS type-2A protein phosphatase catalytic subunit.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1804)
AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S.,
Sugimura,T. and Nagao,M.
TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver
type 2A protein phosphatase, and detection of high levels of
expression of the gene in normal and cancer cells
JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 114 1043 type-2A protein phosphatase catalytic subunit
mRNA < 1 1804 type-2A protein phosphatase catalytic subunit
mRNA
BASE COUNT 482 a 391 c 437 g 494 t
ORIGIN
1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg
61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg
121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc
181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat
241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc
301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta
361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc
421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga
481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt
541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga
601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac
661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc
721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag
781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc
841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc
901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta
961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc
1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc
1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt
1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct
1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat
1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga
1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc
1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa
1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa
1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg
1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt
1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa
1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct
1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact
1801 atat
//
LOCUS RATSCP2 1409 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds.
ACCESSION M34728
KEYWORDS sterol carrier protein-2.
SOURCE Rat liver, cDNA to mRNA, clone SP43.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1409)
AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and
Davis,L.G.
TITLE Characterization of a cDNA encoding rat sterol carrier protein-2
JOURNAL DNA Cell Biol. 9, 159-165 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 307 1128 sterol carrier protein-2 (SCP-2)
mRNA < 1 1409 sterol carrier protein-2 mRNA
BASE COUNT 387 a 300 c 390 g 332 t
ORIGIN 2 bp upstream of EcoRI site.
1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc
61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc
121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc
181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt
241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt
301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc
361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa
421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac
481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc
541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg
601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta
661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc
721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag
781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg
841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct
901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag
961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg
1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc
1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg
1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca
1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc
1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg
1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac
1381 ttgagtttca gaataaagtt cagaatagt
//
LOCUS SYNPSBAII 556 bp ds-DNA BCT 17-JUL-1990
DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end.
ACCESSION M34833
KEYWORDS D1 protein; photosystem II.
SOURCE Synechococcus (strain PCC 7942) DNA.
ORGANISM Synechococcus sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Chroococcales.
REFERENCE 1 (bases 1 to 556)
AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S.
TITLE Different and rapid responses of four cyanobacterial psbA
transcripts to changes in light intensity
JOURNAL J. Bacteriol. 172, 1998-2004 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 81 425 ORF1
pept 527 > 556 photosystem II D1 protein (psbAII)
mRNA 59 > 556 psbAII mRNA (alt.)
mRNA 478 > 556 psbAII mRNA (alt.)
BASE COUNT 136 a 145 c 142 g 133 t
ORIGIN
1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag
61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg
121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac
181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag
241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg
301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc
361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg
421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg
481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt
541 gcagcggcgc gagagc
//
LOCUS ACCPCACGH 2391 bp ds-DNA BCT 17-JUL-1990
DEFINITION A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and
protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH)
genes, complete cds.
ACCESSION M33798
KEYWORDS gamma-carboxymuconolactone decarboxylase;
protocatechuate 3,4-dioxygenase.
SOURCE A.calcoaceticus DNA, clone pZR2.
ORGANISM Acinetobacter calcoaceticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 2391)
AUTHORS Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N.
TITLE DNA sequences of genes encoding Acinetobacter calcoaceticus
protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of
genes and of DNA sequences within genes during their evolutionary
divergence
JOURNAL J. Bacteriol. 172, 956-966 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 324 728 gamma-carboxymuconolactone decarboxylase
(pcaC) (EC 4.1.1.44)
pept 750 1463 protocatechuate 3,4-dioxygenase, alpha subunit
(pcaH) (EC 1.13.11.3)
pept 1491 2120 protocatechuate 3,4-dioxygenase, beta subunit
(pcaG) (EC 1.13.11.3)
ORF < 1 203 ORF (AA at 3)
binding 312 315 ribosomal binding site (put.)
binding 737 740 ribosomal binding site (put.)
binding 1478 1482 ribosomal binding site (put.)
BASE COUNT 670 a 463 c 533 g 725 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca
61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg
121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg
181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata
241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg
301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag
361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact
421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc
481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc
541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg
601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg
661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca
721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc
781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac
841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac
901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag
961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc
1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc
1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt
1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc
1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt
1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc
1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca
1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc
1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt
1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg
1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg
1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta
1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc
1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta
1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc
1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta
1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag
1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg
2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa
2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac
2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac
2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg
2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt
2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t
//
LOCUS HAMSAA3A 530 bp ss-mRNA ROD 17-JUL-1990
DEFINITION M.auratus serum amyloid A (SAA3) mRNA, complete cds.
ACCESSION M33431
KEYWORDS serum amyloid A protein.
SOURCE M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone
pAA825.
ORGANISM Mesocricetus auratus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 530)
AUTHORS Gervais,C. and Suh,M.
TITLE Serum amyloid A protein-related mRNA expression in herpes simplex
virus type 2 transformed hamster cells
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Suh, 30-MAR-1990.
FEATURES from to/span description
pept 42 410 serum amyloid A
signal 510 515 polyA signal
BASE COUNT 147 a 122 c 140 g 121 t
ORIGIN
1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca
61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag
121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact
181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac
241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca
301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg
361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt
421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag
481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct
//
LOCUS HEIBEXBA 795 bp ds-DNA BCT 17-JUL-1990
DEFINITION H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end.
ACCESSION M33787
KEYWORDS capsulation protein.
SOURCE H.influenza (type b, strain RM153) DNA.
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 795)
AUTHORS Kroll,J.S. and Moxon,E.R.
TITLE Capsulation in distantly related strains of Haemophilus influenzae
type b: genetic drift and gene transfer at the capsulation locus
JOURNAL J. Bacteriol. 172, 1374-1379 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 795 capsulation protein (bexB)
BASE COUNT 188 a 133 c 177 g 297 t
ORIGIN
1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt
61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt
121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc
181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg
241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt
301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag
361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg
421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt
481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa
541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat
601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa
661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg
721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa
781 ggagttgagc cacaa
//
LOCUS HEIBEXBB 795 bp ds-DNA BCT 17-JUL-1990
DEFINITION H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end.
ACCESSION M33788
KEYWORDS capsulation protein.
SOURCE H.influenza (type b, strain RM926 ATCC 8468) DNA, clones
lambda-926.[8,11,14].
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 795)
AUTHORS Kroll,J.S. and Moxon,E.R.
TITLE Capsulation in distantly related strains of Haemophilus influenzae
type b: genetic drift and gene transfer at the capsulation locus
JOURNAL J. Bacteriol. 172, 1374-1379 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 795 bexB protein
BASE COUNT 188 a 128 c 178 g 301 t
ORIGIN
1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt
61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta
121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt
181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg
241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt
301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag
361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg
421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt
481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa
541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac
601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa
661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta
721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag
781 gggattgagc cacaa
//
LOCUS PSEARCDA 1502 bp ds-DNA BCT 17-JUL-1990
DEFINITION P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end)
genes.
ACCESSION M33223
KEYWORDS membrane protein.
SOURCE P.aeruginosa (strain PAO1) DNA, clone pME183.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1502)
AUTHORS Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D.,
Mercenier,A. and Haas,D.
TITLE The arc operon for anaerobic arginine catabolism in Pseudomonas
aeruginosa contains an additional gene, arcD, encoding a membrane
protein
JOURNAL Gene 87, 37-43 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Haas, 26-MAR-1990.
FEATURES from to/span description
pept 24 1472 membrane protein (arcD)
pept 1494 > 1502 membrane protein (arcA)
binding 8 15 ribosome binding site (put.)
BASE COUNT 247 a 507 c 443 g 305 t
ORIGIN
1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag
61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa
121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg
181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg
241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct
301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca
361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg
421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt
481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct
541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg
601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca
661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca
721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt
781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg
841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct
901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg
961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca
1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct
1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga
1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga
1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg
1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc
1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct
1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct
1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca
1501 cg
//
LOCUS TOMCD4A 5429 bp ds-DNA PLN 17-JUL-1990
DEFINITION Tomato ATP-dependent protease (CD4A) gene, complete cds.
ACCESSION M32603
KEYWORDS ATP-dependent protease.
SOURCE Tomato (L.esculentum) DNA.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 5429)
AUTHORS Gottesman,S. and Pichersky,E.
TITLE A conserved regulatory subunit for ATP-dependent protease in
prokaryotes and eukaryotes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Pichersky,E., 08-MAR-1990.
FEATURES from to/span description
pept 787 1357 ATP-dependent protease (CD4A), exon 1
1461 1576 ATP-dependent protease (CD4A), exon 2
1828 1950 ATP-dependent protease (CD4A), exon 3
2034 2219 ATP-dependent protease (CD4A), exon 4
2853 3068 ATP-dependent protease (CD4A), exon 5
3248 3465 ATP-dependent protease (CD4A), exon 6
3768 3843 ATP-dependent protease (CD4A), exon 7
3926 4015 ATP-dependent protease (CD4A), exon 8
4125 5309 ATP-dependent protease (CD4A), exon 9
pre-msg 1 5429 CD4A mRNA and introns
IVS 1358 1460 CD4A intron A
IVS 1577 1827 CD4A intron B (no splice consensus)
IVS 1951 2033 CD4A intron C
IVS 2220 2852 CD4A intron D
IVS 3069 3247 CD4A intron E
IVS 3466 3767 CD4A intron F
IVS 3844 3925 CD4A intron G (no splice consensus)
IVS 4016 4124 CD4A intron H
BASE COUNT 1521 a 972 c 1221 g 1715 t
ORIGIN
1 tcgaagttgt aaaggttggt cttttgcttc tacttgtttg gactagtttc tttccaagta
61 atggggttat taagttggta ttgttgtgga ttttgacagg atatgggggt atgatgtagc
121 tctcatgttt atattttcaa ttaagcaatg tagttctggt tgactagttg tggtgctaga
181 attcccaatt aacctcttca ctgaattact cctagtgctg aataaaactg ccttatctca
241 agggtgtggg ttgttgttgt attctagagg gtttgtttat ataatcccat tggaaccaag
301 ttaacctatg caataggaat tatatattca agattctgct agtttagaaa ctaaagtcct
361 tgcagcttgg ctgctttatc ctgtatgaaa tagttaccat cctgtggttg catagttttc
421 caggtcttct tgttttgagg ctttattatt cattttactg atgatataac ttttatgatg
481 aaatttgaat ttagttttaa gcttatgcgc taggtgtatg ttgctcaaca tgcattcgca
541 gagttttttg aagtatggca ttctcataat gatcctattt gtctacatta tgtcatctaa
601 agaaattaac cttcaacggt taatggttta caatggtcca aaaaaagtag ccttcaatgg
661 ttattagtga tggttttgtt gatgtctgct aactatatgt atatgttatg taatatgatc
721 cttggtactt cttccattgc cactaatgat cttttcttat atgctatcat gtcaggttgc
781 aggtccatga tggctagagc tttagttcag tcaacaaaca ttctaccttc agttgctggt
841 gaaagggctg gacaattcaa tggatcccgg aaagatcaaa gaactgttag aatgctatgt
901 aatgtaaaat gttgttcctc aaggctgaac aattttgcag gactgcgagg gtgcaacgca
961 ttagatacac tactagtaaa atctggagaa actctccatt cgaaagtggc agctgcaact
1021 tttgtcagac ggccacgagg gtgccgattt gtcccaaaag caatgtttga gcggttcact
1081 gagaaagcaa taaaagtcat tatgcttgcc caagaagagg ccagacgact aggtcacaac
1141 tttgttggga cagaacagat cctgctgggt cttattggtg agggaactgg tattgctgct
1201 aaggttctta aatccatggg aattaatttg aaagatgctc gtgtggaagt agagaagatc
1261 attggcaggg gtagtggatt tattgcagtg gaaatccctt ttactcctcg tgcaaagcgt
1321 gttctagaac tatctctgga ggaagcccgc caactaggta cccttatcta tcatttcagc
1381 cttttgttgg atggcctaac tgtcatgcga actcaaaatt catttctttg ttgcatctgt
1441 tgattggttt tgtgatgcag ggcataatta tattggctcc gagcacttgt tacttggact
1501 gttgcgcgaa ggtgaaggtg tggctgcccg tgttcttgaa aacttgggtg ctgaccccac
1561 taacattcgc acacaggcaa gtgattttcc tttctcaaac tggtttgaaa tgcttttctt
1621 gaagccaagg gtctaatcag aaacagcctc tctacctcca caagtagggg taaggtctgc
1681 atacatctta ccttcccaga tgccacctgt gggattacac tgggtatgtt gttgttgttg
1741 tttacttctg atgggggtga gggtatttta aattgcttga cttcattgta atcagtttct
1801 cttttttctt gtaaaataaa taaataggtg atacggatgg ttggcgagag tagtgaggct
1861 gttggtgcta gtgttggagg tggaacttct ggcctaaaga tgcctacatt ggaggagtac
1921 ggcaccaatt tgacaaagtt agctgaagag gtacatagaa agataagtcc gtcataatgt
1981 ggcaagtgat tggctctctt caattatttg atatggtcat attatatttt caggggaaac
2041 tagatcctgt agttggaaga caggctcaaa tcgagcgtgt tactcaaatc ttgggtcgcc
2101 gtaccaaaaa caacccttgc cttattggag aaccaggtgt tggaaaaact gctattgccg
2161 aagggctagc tcaaagaatt gctaatggtg atgtccctga aacaatagaa ggaaagaagg
2221 tcacaggggt gtatgataaa ttgcataaca tgttttatgc ttgagctttt tttctcttta
2281 tcaatttcat ttcaactgtg tggcagatcc agaagctgtg ctacattctt agatagaaag
2341 acttcaaatc atcattttta taatgtctta gaatttgatt ttatacagag ggtaattttt
2401 taaattggaa gtaaatattc tgcagacaaa tatatataca caaatattgt tatctgatcc
2461 cccatataac acccccaccc cccaccccaa aagcctctag gccaaaaatt tacgcgattc
2521 taagagtatc caatcaatat ttttggtagg cgtgccaaaa atttacagaa acatttgaat
2581 gtcccactaa cttgtacgaa caggttttca aatttcaacc gccttgagcc actacatatt
2641 cttcattgtc tgatatatgg actcaaggca tatatcaggt ttagttccct cttacttcaa
2701 ttctagtcta actgaagtca tagggaagat gaacctattt atctaaatct ttttatatgc
2761 aactattttg atgaggtata ccctcctttt tcaaaaaatg tgtatatcct tgtcattata
2821 cgagtgtatg acttgctttg ggtgatggtc aggtcataac acttgatatg ggattgcttg
2881 ttgctgggac aaaataccgt ggagagtttg aggaaagact gaagaaacta atggaggaaa
2941 ttaaacaaag tgatgaaata atactgttta ttgatgaagt acacacattg attggagctg
3001 gagctgcaga gggagcaatt gatgctgcaa acatcttgaa acctgccctg gctagaggcg
3061 agctacaggt tcgttatcgt ttcgacctgt agtatgttgt tccttggaat ttcaatgagc
3121 actatgaaat ctaccttaaa tttggtgttt aatcattatc ttggtccgag tacatcttgg
3181 taaaactaca aaagaatttc aaagttggtg ttaattaaga agtattttgc ttggtttcac
3241 ttgtcagtgt attggagcca caaccctgga tgagtacaga aagcacattg agaaagaccc
3301 tgcactagag agaaggttcc agccagttaa agtccctgaa ccttctgtgg atgaaaccat
3361 acagatcttg aaagggcttc gggagaggta tgagattcat cacaagctcc attacactga
3421 tgaggcaata gaggccgctg ccaagctttc tcaccagtac atcaggtatt attattaatc
3481 atatattcat tttccaaagt atcttccgta gtcaagttgt tggtacattt tgactttggg
3541 atttcacggc aaactaattt attcttgtat gatttaagat atatccttct caataataaa
3601 actcttagga taagcggctt ttcttttgta tcagtgtctg aataggctga cgcacggttt
3661 atctcctgca aattgcacat ttatctccac tccaaaacaa tcatcttttt ccgtgattta
3721 tctgaagtaa agcattaata tgtttttgag ttgaaccttt attccagtga ccgctttctg
3781 cctgataaag caattgactt gattgatgaa gctggttccc gtgttcgact tcgtcatgca
3841 caggcataga gtactctttt tgtcttactc tgtcctgcca tttcttccta agggagtcat
3901 gacccgttat attgatacta tctagctccc tgaggaagca agagagctgg aaaaggagct
3961 tcgccagatt acaaaggaga aaaatgaagc tgtccgcggt caagattttg agaaggtact
4021 ggtttcctct cgtcctgtaa aaaaattcct ctggactgat tatttcttag aaaacattga
4081 gattttcagg tgtgcaattt taatcctctt tatgcattgg acaggctggg gagttgcgtg
4141 atagagaaat ggatcttaag gcacaaatct cagccttgat agacaaaaac aaagagaaga
4201 gcaaggcaga atctgaggca ggagatgcag caggtcccat agtgacagag gcagatattc
4261 agcacattgt ctcttcctgg accggcattc ctgtagagaa ggtctcaact gacgaatctg
4321 atcgcctcct aaaaatggaa gaaacacttc atacccgagt cattggccag gatgaagcag
4381 taaaagccat tagtcgcgct atcagacgtg cccgtgttgg cctcaagaat cccaaccgac
4441 ctattgccag tttcatcttt tctggtccaa ctggtgttgg gaagtcagaa ctggcaaagt
4501 ctttagcaac gtactacttt ggttctgaag aagcaatgat tcggcttgat atgagtgagt
4561 ttatggaaag acacactgtg tctaaactca tcggttcacc ccctgggtat gttggttaca
4621 ctgaaggtgg tcaattaact gaagctgtta ggcgtcgacc ttatacagtt gtgctctttg
4681 atgagattga gaaggctcat cctgatgtct tcaacatgat gcttcaaatt cttgaagatg
4741 gaagattgac agacagcaaa ggcagaactg tcgacttcaa gaatacactc ctcatcatga
4801 catcgaatgt tggaagcagt gtgatagaga aaggaggtcg tcgtataggt tttgatctag
4861 attttgacga gaaggatagt agttacaacc gtatcaagag cttggtgact gaggagttga
4921 aacagtactt caggccagag tttttaaaca gattgagtga gatgattgta ttccgtcagc
4981 tcactaagtt ggaggtgaag gagattgccg atatcatgct taaggaggtc tttgtgaggt
5041 tgaagaataa ggagatagaa ctccaagtga cagagaggtt tagagatagg gtagttgatg
5101 aaggatataa cccaagctat ggagctagac cattgaggag agctattatg agactgctag
5161 aggatagtat ggccgagaag atgcttgcag gtgagatcaa agaaggtgat tcagtaattg
5221 tggatgtgga ttcagatggc aacgtcaccg ttctcaatgg cactagtggt gctccctcag
5281 attcagctcc tgagcctatc cttgtgtaga tcaattcttc atttagctct acaaatttgt
5341 tcttcgtaat gttgcttttg gatgtcttgg ctaagcaaca tgtgagtaat tttttgggtg
5401 acaggctgat atattcttcc aagttgact
//
LOCUS TOMCD4B 5846 bp ds-DNA PLN 17-JUL-1990
DEFINITION Tomato ATP-dependent protease (CD4B) gene, complete cds.
ACCESSION M32604
KEYWORDS ATP-dependent protease.
SOURCE Tomato (L.esculentum) DNA.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 5846)
AUTHORS Gottesman,S. and Pichersky,E.
TITLE A conserved regulatory subunit for ATP-dependent protease in
prokaryotes and eukaryotes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Pichersky,E., 08-MAR-1990.
FEATURES from to/span description
pept 1111 1675 ATP-dependent protease (CD4B), exon 1
1896 2011 ATP-dependent protease (CD4B), exon 2
2133 2255 ATP-dependent protease (CD4B), exon 3
2326 2511 ATP-dependent protease (CD4B), exon 4
3020 3235 ATP-dependent protease (CD4B), exon 5
3445 3662 ATP-dependent protease (CD4B), exon 6
3978 4053 ATP-dependent protease (CD4B), exon 7
4138 4227 ATP-dependent protease (CD4B), exon 8
4354 5535 ATP-dependent protease (CD4B), exon 9
pre-msg 1 5846 CD4B mRNA and introns
IVS 1676 1895 CD4B intron A
IVS 2012 2132 CD4B intron B (no splice consensus)
IVS 2256 2325 CD4B intron C
IVS 2512 3019 CD4B intron D
IVS 3236 3444 CD4B intron E
IVS 3663 3977 CD4B intron F
IVS 4054 4137 CD4B intron G (no splice consensus)
IVS 4228 4353 CD4B intron H
BASE COUNT 1592 a 1067 c 1283 g 1904 t
ORIGIN
1 aagcttctgt agcccaaagt tttgtccttt cacttcttgt ctactcctca tactgtctct
61 tccaaattca acagttgcca ctgtcgattt ttcagcagta aatttacctt gctcaatatc
121 aactccccat agcccaccca aacttctctt catttttata ataatgacgt tgcatgtttc
181 atacctcgcc taggaatgtg ttcatcgaat ttccgtcatt gtcacatcat atctgttctt
241 gtcctttact ctttttcctt ttctttcctg tactctcgtt cgtgttgtta ggatttgtac
301 tgaaaccgtt gggttatcag ttttacagac ctttaagcag tctctctagg gtttactaag
361 agctggaaat ttgaataatc tgatcaaaat tgctgaggtt cttttaatcc ttctaaagat
421 ctctgtagca gagtggtgaa aagcggaaca gggggaatac tcggtacaaa tggtgttgtg
481 ttcaattgca aagaccaaac ctcccattct aatgtgttgc aacaaactat taggggagtt
541 ttaaggaaga gaaaatgggg gataatgcta cacaatgcta acctattata atcatctatt
601 tagtttgcaa accccacata tctctgtgtg tattgtccta tttgttcaat ctattcatga
661 aaccttggct actcaatcct ggattgcgca cgtactactc agcagttgcg tagtgcccta
721 gttcttatat cttggagggt agactgaatt tggtttatgt caatgaatca tgtttcacac
781 agatggttac atattttatt acatagaatc tatatgtcat gcatcgactc ctgtcgagta
841 tggatacttc ctacttcttc ttggaactct ctgattgagt tatctttctg ctaatggact
901 gtgcctcact atttttatct tcctctggat tggaaaatat ggacattcac ttgttatcaa
961 tttttgaatt aataatttgt tgatatgttg gctgattgat gtgtcattga cttcatatac
1021 tgttatgggt ttgtcatgtg gaagatttga tattttcgat tatctaagga gagctttttt
1081 gattattttc ttatcctggg caggtccatc atggctagag ctctagttca gtcaacaagc
1141 atcccatcgt cagttgctgg tgaaaggaca accaaattca atggatctgg gaaaacaaaa
1201 agagctgtta caatgctatg caatgcacaa tcatcttcac ttacactgag ggattttaca
1261 ggattgcgag gatgcaacgc aatagataca ctagttagat ctggagaaac tctccaatcc
1321 aaagtagctg ctgcaactta tgtcagacgg ccacgaggtt gccgatttgt accaaaagca
1381 atgtttgagc gcttcacaga gaaggcgata aaagtcatta tgcttgcaca agaagaggcc
1441 agacgacttg gtcacaattt tgttggcact gagcagatct tgttgggtct aattggtgag
1501 ggaactggta ttgctgccaa ggttcttaaa tcgatgggga tcaatttgaa agatgctcgt
1561 gtggaagtgg aaaagataat tggaaggggt agtgggtttg ttgctgttga gatccctttt
1621 acccctcgtg ccaagcgtgt tctggaactc tctctggagg aagcccgcca gctaggtatc
1681 cttttttgat tgtctgtatt tgttgtatta cctttgggac atgtctaaaa gtggttctcc
1741 ggttgagtct tattctcatg ttgctcgttg acacctttgg tttttcccaa aattcattga
1801 ttttctttct ctttgcttgc ctttctggtc ttctacatgc accagcatga ttatttgtct
1861 aaatatttat tccatttata aactggttgg ttcagggcat aactatattg gttcggagca
1921 cttgctactt ggattgctac gtgaaggtga aggtgtggct gcccgtgttc ttgaaaactt
1981 gggtgctgac cccagcaaca tccgcactca ggcaagtagt agatctcaca tgcttcctct
2041 gggtagtcta atattgtgct tttgtgggaa gttggatttt ttggtttcat catgaccagt
2101 gttcattggt tttattcttc ttctaaaaat aggtgatccg gatggttggt gagagtaatg
2161 aggctgttgg tgctagcgtt ggaggtggaa cttctggcca aaaaatgcca acactggagg
2221 agtatggaac aaatttgaca aaattggcgg aagaggtaat ctaaaaaact atatttgtga
2281 tgatatgtga acagttatga ataattgtcc atttttggtt tgtagggaaa attggacccc
2341 gttgttggaa gacagccgca aattgaacgg gtcactcaaa tcttgggtcg gcggactaag
2401 aacaaccctt gtcttattgg tgaaccaggt gttggcaaaa cagctattgc tgagggtttg
2461 gcacaaagaa ttgcaaatgg tgatgtccct gaaacaattg aggggaagaa ggtacgtcat
2521 gtggatagct tgcagaatgt gatgtttacc tgtgcccttt cttctcccat ttgttgtatc
2581 atatggtgaa cccagaagct gagcagggtt atgaggtgca gtctagtggt attttcgatt
2641 cacaaggatt gggttcaaat gaatgtgcat tctatctcca ctaaattgaa atatgtcaaa
2701 ttatattttc agtgtctgtg ttctcttgaa tttttttata tataaattct tccgtgctac
2761 ttgttctctt ggcatatttc tttttagtag attgatttgt cgatctatgc ttataactaa
2821 ttgagaaaaa acgattagta tcccttgttg aacttaattt cttagctcaa actatcaaac
2881 tgaagtgatt tcaagtgaga aaggtccacc acctattctt gtgaatctaa tccatctgaa
2941 actttgtcaa atgtttcgtg ctatgatata gctacctttc tcgttgttcc taagagtcgc
3001 tttttttggt gataattagg tgataactct tgatatggga ttgcttgttg ctggaacaaa
3061 ataccgtgga gagtttgagg aaaggctaaa aaagttgatg gaggaaatta agcagagtga
3121 tgaaataata ctatttatcg atgaagtgca cacattgatt ggagctggag cagcagaggg
3181 ggccatcgat gctgcaaaca tcttgaaacc tgccctagct cgaggtgaac tacaggtaag
3241 ttcctccttc gcttgttctt tcctttctgt tctttctaga aattaaaaat gagaactaag
3301 aatatttctg ttaacaactc aatgttttag aagatatatc attgttttaa catgcatttt
3361 accgcctcat aagaagttaa aatgactgct cagttaaagc aggagaaaaa acttctacat
3421 gtgtatttac ttggcttctt gcagtgtatc ggagctacta cactggatga atacagaaag
3481 catattgaga aagatcctgc actagagagg agattccaac cagttaaggt ccctgaacct
3541 actgttgatg aaactataca gattctgaaa gggcttcgtg agaggtatga aattcatcac
3601 aaacttcgtt acactgacga agacttagtg gctgctgctc agctctcata ccagtacatc
3661 aggtatctaa tctctaccat tttcctatgt cttcttgtac gtttttactc ttttctcgtc
3721 ctgaattgtt gatgcattgg taccttgaag aaatatgttt gctttaagac ggtagacctg
3781 aagttcaact tattttatga tagttaagtg tgataataga agttggttta cttttgttta
3841 tcagcatcta attaggtttt tatgcttatc tctagaaaag attgtcctag attccacgat
3901 atagattttc gtataaagtt tttttttcct tgttctgcta agtgatacat cttttcactt
3961 gcgttttcac atttcagtga ccgatttctg cctgacaaag caattgattt gattgatgaa
4021 gctggttctc gtgttcgact tcgccatgct caggcatgaa atgctctctt agtctctctt
4081 ttaatctgtc attgccctac tatttgttga gtgacaatgc tctgttgttg cctccagctc
4141 cccgaggaag caaaagagct cgagaaagaa cttcgtcaga taacaaagga gaagaatgaa
4201 gctgttcgtg gtcaagattt cgaaaaggta ctgagtttgt catgtacatc taaagaaccc
4261 ctcaaaaatg gttgttatat agcccaacta ttttcttatc aagtattata attaaagaat
4321 tttctaatga catggttttg gattaatgga caggctgggg aattacgtga tagagaaatg
4381 gatctgaagg cacagatcac agccctcata gacaaaaaca aagaggtgag caaggctgaa
4441 agtgaggctg cagatacagg tccacttgtg acagaagcag atattcagca catcgtctct
4501 tcttggactg gtatccctgt tgagaaggtc tccactgacg aatctgaccg tctcctaaaa
4561 atggaagaaa cacttcacac acgaatcatt ggccaggatg aagctgtcaa agccattagt
4621 cgcgctattc gacgtgcacg agttgggctc aagaatccca acagacctat tgctagtttc
4681 atcttttccg gtcccactgg tgttgggaaa tcagaactgg caaaggcact ggctgcatat
4741 tactttggtt ctgaagaagc aatgatccgg cttgatatga gtgagttcat ggaaagacac
4801 actgtctcta aactcattgg atccccccct ggttatgttg gttacacaga gggtggtcaa
4861 ttgactgaag ctgttaggcg tcgaccttat acagttgtgc tctttgatga gattgagaag
4921 gctcatcctg atgtcttcaa catgatgctt caaattcttg aagatggaag attgacagac
4981 agcaagggta gaactgtaga tttcaagaac acacttctca tcatgacatc aaatgttgga
5041 agtagtgtaa ttgagaaagg tggccgtcgt ataggttttg atctcgacct tgatgagaag
5101 gatagcagtt acaatcgcat caagagctta gtgacagaag aattgaagca gtacttcagg
5161 ccagagtttt tgaacagatt ggatgagatg attgtattcc gtcagctcac taagttagag
5221 gttaaggaga tagctgatat catgcttaag gaggtctttg agaggttgaa agttaaggag
5281 atagaacttc aagtgacaga gaggtttaga gacagggtgg ttgacgaggg atacaaccct
5341 agctatggag cacgacctct gagaagagct attatgagac ttttagagga cagcatggct
5401 gagaaaatgc ttgcaaatga gattaaagaa ggtgattcag ttatcgtgga cgttgattca
5461 gatggcaatg tgactgtcct caatggaagt agtggtactc cctcggatcc ggctcctgag
5521 cctatccccg tttagatcag gtgatcacat tttagctcaa cagtctctgt gtggtctttt
5581 aagagttcat ttgtgttggc cactaagctt tgtggggctt atgaagcaac ttttgagttt
5641 atctctggga atgctcaggt aaatcttgcc acgtcgagaa ggcagcatcc tttctatgtt
5701 gatcttaatg attttacaga agtatactgg attaagattg tagttggtca caagaaagta
5761 ttgtattaac aagacactcc ttatcaaaaa catatctagt ttattgtagg cgacagttga
5821 atcgttaatc ctcgtattgt ctatcc
//
LOCUS PLYORIA 375 bp ds-DNA VRL 17-JUL-1990
DEFINITION Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION M34048
KEYWORDS origin of replication.
SOURCE Polyomavirus BK (strain WWT) DNA, from human urine.
ORGANISM Polyomavirus BK
Unclassified.
REFERENCE 1 (bases 1 to 375)
AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 375 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 302 R block
BASE COUNT 123 a 86 c 95 g 71 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
361 aactggacaa aggcc
//
LOCUS PLYORIB 424 bp ds-DNA VRL 17-JUL-1990
DEFINITION Human polyomavirus BK (strain UT) origin of replication.
ACCESSION M34049
KEYWORDS origin of replication.
SOURCE Human polyomavirus BK (strain UT) DNA, from human urine.
ORGANISM Polyomavirus BK
Unclassified.
REFERENCE 1 (bases 1 to 424)
AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 424 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 261 R block
site 263 315 P block
site 316 349 Q block
site 350 361 R block
BASE COUNT 134 a 103 c 105 g 82 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
421 ggcc
//
LOCUS ECOCLPPA 1236 bp ds-DNA BCT 17-JUL-1990
DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component
(clpP) gene, complete cds.
ACCESSION J05534
KEYWORDS ATP-dependent protease.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1236)
AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
Bowers,B. and Gottesman,S.
TITLE Sequence and structure of ClpP, the proteolytic component of the
ATP-dependent Clp protease of E.coli
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Gottesman, 23-APR-1990.
FEATURES from to/span description
pept 378 1001 ATP-dependent protease (clpP)
mRNA 279 > 1236 clpP mRNA
pept < 1 133 ORF
signal 244 249 -35 region
signal 267 272 -10 region
signal 165 193 transcription termination signal
binding 367 372 ribosome binding site (put.)
BASE COUNT 329 a 291 c 318 g 298 t
ORIGIN
1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS LACNISC 1222 bp ds-DNA BCT 17-JUL-1990
DEFINITION L.lactis insertion sequence IS981.
ACCESSION M33933
KEYWORDS insertion sequence.
SOURCE L.lactis (strain LM0230) DNA, clone pKM40.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 1222)
AUTHORS Polzin,K.M. and McKay,L.L.
TITLE Identification, DNA sequence and distribution of IS981, a new, high
copy number insertion sequence in the lactococci related to the
IS2/IS3 family of IS-elements
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.M.Polzin 27-APR-1990.
University of Minnesota
Dept. of Food Science
1334 Eckles Avenue, St. Paul, MN 55108
e-mail: kayla@molbio.cbs.umn.edu
FEATURES from to/span description
pept 58 318 ORF1
pept 330 1169 ORF2
binding 44 50 ribosomal binding site (put.)
rpt 1 40 5' inverted repeat
rpt 1183 1222 3' inverted repeat
signal 33 38 -10 region
signal 269 274 -35 region
signal 294 299 -10 region
BASE COUNT 442 a 233 c 206 g 341 t
ORIGIN 132 bp upstream of PvuII site.
1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg
61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa
121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata
181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa
241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc
301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca
361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta
421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac
481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac
541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt
601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa
661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata
721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag
781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata
841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc
901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca
961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa
1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct
1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac
1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac
1201 tctaaaaact tgacttaacg tc
//
LOCUS RATSHIIA 3064 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds.
ACCESSION M34052
KEYWORDS K+ channel protein.
SOURCE Rat brain, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3064)
AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R.
TITLE Molecular cloning of a member of a novel class of K+ channel genes
in mammals
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Bernardo, 07-MAY-1990.
FEATURES from to/span description
pept 80 1771 K+ channel protein (KShIIa)
BASE COUNT 777 a 718 c 729 g 840 t
ORIGIN
1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag
61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt
121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct
181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct
241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc
301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga
361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt
421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt
481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat
541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct
601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga
661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg
721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc
781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt
841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta
901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac
961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct
1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg
1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag
1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac
1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt
1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc
1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac
1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg
1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa
1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa
1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa
1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga
1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc
1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa
1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag
1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt
1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt
1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc
2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg
2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa
2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg
2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga
2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc
2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt
2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat
2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc
2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca
2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa
2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga
2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga
2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt
2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt
2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt
2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa
3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa
3061 tgat
//
LOCUS HUMROS1 7375 bp ss-mRNA PRI 17-JUL-1990
DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA,
complete cds.
ACCESSION M34353
KEYWORDS oncogene; transmembrane tyrosine-specific protein kinase.
SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 7375)
AUTHORS Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M.
TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.H.Wigler, 15-MAY-1990.
FEATURES from to/span description
pept 207 7250 transmembrane tyrosine-specific protein kinase
precursor /hgml_locus_uid="LN0124W"
/nomgen="ROS1" /map="6q21-q22"
sigp 207 314 transmembrane tyrosine-specific protein kinase
signal peptide
matp 208 7247 transmembrane tyrosine-specific protein kinase
BASE COUNT 2203 a 1496 c 1605 g 2071 t
ORIGIN
1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa
61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc
121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca
181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg
241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata
301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata
361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact
421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg
481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa
541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc
601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt
661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct
721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg
781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg
841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc
901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact
961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg
1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt
1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg
1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg
1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca
1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta
1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct
1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc
1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt
1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga
1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg
1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac
1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca
1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat
1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct
1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc
1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca
1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg
2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa
2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa
2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag
2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa
2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt
2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga
2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag
2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc
2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca
2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca
2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa
2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca
2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca
2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata
2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga
2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat
3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc
3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc
3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt
3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg
3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc
3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat
3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac
3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa
3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt
3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta
3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc
3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa
3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg
3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc
3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg
3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat
3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga
4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag
4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa
4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga
4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga
4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga
4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg
4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga
4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt
4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg
4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca
4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg
4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg
4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg
4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa
4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca
4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc
4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg
5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg
5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca
5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt
5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac
5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt
5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg
5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag
5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg
5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca
5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca
5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag
5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg
5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc
5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg
5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc
5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag
6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg
6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa
6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga
6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc
6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag
6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca
6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac
6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact
6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa
6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa
6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt
6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa
6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga
6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc
6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg
6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt
6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt
7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc
7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag
7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc
7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt
7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc
7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc
//
LOCUS MBOPIVML 969 bp ds-DNA BCT 17-JUL-1990
DEFINITION M.lacunata pilin gene inverting protein (PivML) gene, complete cds.
ACCESSION M34367
KEYWORDS pilin gene inverting protein.
SOURCE M.lacunata (ATCC 17956) DNA, clone pMxL1.
ORGANISM Moraxella lacunata
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 969)
AUTHORS Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C.
TITLE Identification, cloning and sequencing of Piv, a new gene involved
in inverting the pilin gene of M.lacunata
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.F.Marrs, 17-MAY-1990.
FEATURES from to/span description
pept 1 969 pilin gene inverting protein (PivML)
BASE COUNT 318 a 236 c 165 g 250 t
ORIGIN
1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt
61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta
121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca
181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt
241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca
301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg
361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac
421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg
481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt
541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata
601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca
661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca
721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca
781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc
841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc
901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt
961 catgaataa
//
LOCUS STYMKAD 915 bp ds-DNA BCT 17-JUL-1990
DEFINITION Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds.
ACCESSION M34355
KEYWORDS mouse killing agent D.
SOURCE Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA.
ORGANISM Plasmid pEX102
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 915)
AUTHORS Taira,S. and Rhen,M.
TITLE Nucleotide sequence of mkaD, a virulence associated gene of
Salmonella typhimurium containing variable and constant region
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.P.Taira, 16-MAY-1990.
FEATURES from to/span description
pept 132 857 mouse killing agent D (mkaD)
BASE COUNT 273 a 181 c 213 g 248 t
ORIGIN
1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt
61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa
121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata
181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt
241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg
301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc
361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc
421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag
481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag
541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt
601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac
661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt
721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac
781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc
841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca
901 gctattttaa atgct
//
LOCUS TOBEXT 3994 bp ds-DNA PLN 17-JUL-1990
DEFINITION N.plumbaginifolia extensin (ext) gene, complete cds.
ACCESSION M34371
KEYWORDS cell wall protein; extensin.
SOURCE N.plumbaginifolia leaf DNA.
ORGANISM Nicotiana plumbaginifolia
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 3994)
AUTHORS De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R.,
Genetello,C., Van Montagu,M., Depicker,A. and Inze,D.
TITLE The extensin signal peptide allows secretion of heterologous
proteins from protoplasts
JOURNAL Unpublished (1990)
STANDARD unannotated staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.R.De Loose, 18-MAY-1990.
Author address: M.R.R.De Loose
Laboratorium voor Genetika, Wetenschappen,
K.L.Ledeganckstraat, Gent,
Oost-Vlaanderen, BELGIUM
FEATURES from to/span description
pept 1590 2840 extensin precursor
sigp 1590 1652 extensin signal peptide (could end at 1667)
matp 1653 2837 extensin (could start at 1668)
mRNA 1560 3342 extensin mRNA
rpt 1740 1824 tandem 85 bp repeats
rpt 1740 1758 19 bp repeats
signal 1532 1537 TATA box
signal 3323 3329 poly-A signal
BASE COUNT 1264 a 992 c 550 g 1188 t
ORIGIN
1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt
61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta
121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct
181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt
241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct
301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg
361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc
421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat
481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat
541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt
601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt
661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc
721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac
781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga
841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact
901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa
961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa
1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt
1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata
1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact
1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt
1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt
1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac
1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt
1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc
1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag
1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat
1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact
1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac
1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga
1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt
1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt
1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc
1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac
2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac
2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac
2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt
2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat
2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa
2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt
2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac
2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca
2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc
2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc
2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc
2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac
2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc
2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta
2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta
2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc
3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc
3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct
3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa
3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt
3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc
3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc
3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc
3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact
3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg
3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa
3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc
3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat
3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat
3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca
3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat
3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca
3961 ggtcgactct agggatcccc gggcgagcga attc
//
LOCUS MUSIGHABN 336 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34574
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 336)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 336 Ig heavy-chain V-D-J region (AA at 1)
recomb 276 277 V-region DNA end/D-region DNA start
recomb 285 286 D-region DNA end/J-region DNA start
BASE COUNT 86 a 76 c 95 g 79 t
ORIGIN Chromosome 12.
1 actggaggag gcttggtgca acctgggggg tcacggggac tctcttgtga aggctcaggg
61 ttcactttta gtggcttctg gatgagctgg gttcgacaga cacctgggaa gaccctggag
121 tggattggag acattaattc tgatggcagt gcaataaact acgcaccatc cataaaggat
181 cgattcacta tcttcagaga caatgacaag agcaccctgt acctgcagat gagcaatgtg
241 cgatcggagg acacagccac gtatttctgt atgagatata gtaactactg gtacttcgat
301 gtctggggca caaggaccac ggtcaccgtc tcctca
//
LOCUS MUSIGHABO 357 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34575
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 357 Ig heavy-chain V-D-J-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
recomb 306 307 D-region DNA end/J-region DNA start
BASE COUNT 89 a 77 c 106 g 85 t
ORIGIN Chromosome 12.
1 gaagtgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc
61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca
121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac
181 gcagcatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac
241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatgat
301 ggttactact ggtacttcga tgtctggggc acaaggacca cggtcaccgt ctcctca
//
LOCUS MUSIGHABP 354 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34576
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH34.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V-D-J-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
recomb 303 304 D-region DNA end/J-region DNA start
BASE COUNT 88 a 78 c 104 g 83 t 1 others
ORIGIN Chromosome 12.
1 gaantgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc
61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca
121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac
181 gcaccatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac
241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatggt
301 aactactggt acttcgatgt ctggggcaca gggaccacgg tcaccgtctc ctca
//
LOCUS MUSIGHABQ 366 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34577
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 366)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1)
recomb 300 301 V-region DNA end/D-region DNA start
recomb 318 319 D-region DNA end/J-region DNA start
BASE COUNT 89 a 84 c 97 g 96 t
ORIGIN Chromosome 12.
1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc
61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct
121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga
301 tattactacg gtagtagcta tgctatggac tactggggtc aaggaacctc agtcaccgtc
361 tcctca
//
LOCUS MUSIGHABR 360 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34578
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1)
recomb 300 301 V-region DNA end/D-region DNA start
recomb 315 316 D-region DNA end/J-region DNA start
BASE COUNT 85 a 87 c 93 g 95 t
ORIGIN Chromosome 12.
1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc
61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct
121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga
301 tccctctatg atggttactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS MUSIGHABS 360 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34579
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
recomb 309 310 D-region DNA end/J-region DNA start
BASE COUNT 90 a 93 c 93 g 83 t 1 others
ORIGIN Chromosome 12.
1 caggtccaac tgcagcngcc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg
61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg
121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac
181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagagggatt
301 tatagtaact actggtactt cgatgtctgg ggcacaggga ccacggtcac cgtctcctca
//
LOCUS MUSIGHABT 357 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34580
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 357 Ig heavy chain V-D-J-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
recomb 306 307 D-region DNA end/J-region DNA start
BASE COUNT 90 a 94 c 89 g 82 t 2 others
ORIGIN Chromosome 12.
1 caggtccaac tgcancancc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg
61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg
121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac
181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagatactat
301 ggtaactact ggtacttcga tgtctggggc acagggacca cggtcaccgt ctcctca
//
LOCUS MUSIGHABU 366 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34581
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 366)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1)
recomb 294 295 V-region DNA end/D-region DNA start
recomb 321 322 D-region DNA end/J-region DNA start
BASE COUNT 95 a 93 c 96 g 81 t 1 others
ORIGIN Chromosome 12.
1 caggtccaac tgcagcagcc tgggactgaa ctggtnaagc ctggggcttc agtgaagctg
61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac
181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac
301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc
361 tcctca
//
LOCUS MUSIGHABV 321 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34582
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH27.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 321 Ig heavy chain V-D-J-region (AA at 1)
recomb 258 259 V-region DNA end/D-region DNA start
recomb 270 271 D-region DNA end/J-region DNA start
BASE COUNT 81 a 90 c 63 g 81 t 6 others
ORIGIN Chromosome 12.
1 aaaccctnac agtnactctt nnttacctgc tctattactg ntttccccat caccagtggt
61 tactactgga tctggatccg tcagtcacct gggaaacccc tagaatggat ggggtacatc
121 actcatagtg gggaaacttt ctacaaccca tctctccaga gccccatctc cattactaga
181 gaaacgtcaa agaaccagtt cttcctccaa ttgaactctg tgaccacaga ggacacagcc
241 atgtattact gtncaggaga ctatgatggt tactggtact tcgatgtctg gggcacaggg
301 accacggtca ccgtctcctc a
//
LOCUS MUSIGHABW 336 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION M34583
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH32.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 336)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 336 Ig heavy chain V-D-J-region (AA at 1)
recomb 273 277 V-region DNA end/D-region DNA start
recomb 285 286 D-region DNA end/J-region DNA start
BASE COUNT 82 a 99 c 71 g 82 t 2 others
ORIGIN Chromosome 12.
1 ggacctggcc tggtgaaacc ctcacagtna ctcttcctta cctgctctat tactggtttc
61 cccatcacca gtggttacta ctggatctgg atccgtcagt cacctgggaa acccctagaa
121 tggatggggt acatcactca tagtggggaa actttctaca acccatctct ccagagcccc
181 atctccatta ctagagaaac gtcaaagaac cagttcttcc tccaattgaa ctctgtgacc
241 acagaggaca cagccatgta ttactgtnca ggagacaccg atggttactg gtacttcgat
301 gtctggggca cagggaccac ggtcaccgtc tcctca
//
LOCUS MUSIGKABQ 306 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34586
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 306)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 306 Ig kappa-chain V-J-region (AA at 1)
recomb 284 285 V-region DNA end/J-region DNA start
BASE COUNT 75 a 85 c 77 g 69 t
ORIGIN Chromosome 6.
1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc
61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc
121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc
181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa
241 gatgctgcca cttattactg ccagcagtgg agtagtaacc cgtggacgtt cggtggaggc
301 accaag
//
LOCUS MUSIGKABR 339 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34588
KEYWORDS immunoglobulin heavy-chain; joining exon;
kappa-immunoglobulin: processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1)
recomb 302 303 V-region DNA end/J-region DNA start
BASE COUNT 89 a 81 c 81 g 86 t 2 others
ORIGIN Chromosome 6.
1 gatgttttga tgacccaaac tccactctcc ctncctgtca gtcttggaga tcaagcctcc
61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg
121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt
181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg
301 tggacgttcg gtggaggcac caagctggaa atcaaacgg
//
LOCUS MUSIGKABS 339 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34589
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1)
recomb 302 303 V-region DNA end/J-region DNA start
BASE COUNT 89 a 79 c 82 g 86 t 3 others
ORIGIN Chromosome 6.
1 gatgttttga tgacncaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg
121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt
181 tctggggtcc caganaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg
301 tggacgttcg gtggaggcac caagctggaa atcaaacgg
//
LOCUS MUSIGKABT 323 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34590
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 323)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 323 Ig kappa-chain V-J-region (AA at 1)
recomb 288 289 V-region DNA end/J-region DNA start
BASE COUNT 96 a 81 c 70 g 76 t
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctccgac gttcggtgga
301 ggcaccaagc tggaaatcaa acg
//
LOCUS MUSIGKABU 339 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34591
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH27.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1)
recomb 290 291 V-region DNA end/J-region DNA start
BASE COUNT 84 a 91 c 81 g 82 t 1 others
ORIGIN Chromosome 6.
1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccggnga gaagatcact
61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag
121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca
181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag
241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccacg cacgttcggc
301 tcggggacaa agttggaaat aaaacgggct gatgctgca
//
LOCUS MUSIGKABV 327 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34592
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH32.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 327)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 327 Ig kappa-chain V-J-region (AA at 1)
recomb 290 291 V-region DNA end/J-region DNA start
BASE COUNT 82 a 87 c 80 g 78 t
ORIGIN Chromosome 6.
1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccgggga gaagatcact
61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag
121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca
181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag
241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccgta cacgttcgga
301 ggggggacca agctggaaat aaaacgg
//
LOCUS MUSIGKABW 324 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION M34593
KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH34.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 324 Ig kappa-chain V-J-region (AA at 1)
recomb 287 288 V-region DNA end/J-region DNA start
BASE COUNT 96 a 76 c 80 g 72 t
ORIGIN Chromosome 6.
1 gacatcaaga tgacccagtc tccatcctcc atgtatgcat cgctgggaga gagagtcact
61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
181 agattcagtg gcagtggatc tgggcaagat tattctctaa ccatcagcag cctggagtct
241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg
301 gggaccaagc tggaaataaa acgt
//
LOCUS MUSIGLBLP 349 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION M34587
KEYWORDS immunoglobulin heavy-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 349; revises [1])
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 349 Ig lambda-chain V-J-region (AA at 1)
recomb 293 294 V-region DNA end/J-region DNA start
BASE COUNT 87 a 94 c 81 g 86 t 1 others
ORIGIN Chromosome 16.
1 caggctgttg tgactcagga atctgcactc accacatcac ctggtggaac agtcatactc
61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
121 aaaccagatc atttattcac tggtctaata ggtggtacca gcaaccgagc tccaggtgtt
181 cctgtcagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
241 cagactgagg atgatgcaat gtatttctgt gctctatggt acagcaccca ttatgttttc
301 ggcggtggaa ccaaggtcac tgtcctaggt cagcccaagt ccantccca
//
LOCUS MUSIGLBN 330 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION M34584
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 330)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 330; revises [1])
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1)
recomb 291 292 V-region DNA end/J-region DNA start
BASE COUNT 87 a 87 c 79 g 77 t
ORIGIN Chromosome 16.
1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc
61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt
181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca ctgggtgttc
301 ggtggaggaa ccaaactgac tgtcctaggc
//
LOCUS MUSIGLBO 330 bp ss-mRNA ROD 17-JUL-1990
DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION M34585
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
to mRNA, clone CH2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 330)
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
TITLE Restricted Ig variable region gene expression among Ly-1+ B cell
lymphomas
JOURNAL J. Immunol. 141, 2788-2796 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 330; revises [1])
AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by C.A.Pennell, 20-APR-1990.
FEATURES from to/span description
pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1)
recomb 291 292 V-region DNA end/J-region DNA start
BASE COUNT 87 a 86 c 78 g 79 t
ORIGIN Chromosome 16.
1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc
61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt
181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca tttggtgttc
301 ggtggaggaa ccaaactgac tgtcctaggc
//GenBank-Updates@genbank.bio.net (07/18/90)
LOCUS HUMELA2 906 bp ss-mRNA PRI 18-JUL-1990
DEFINITION Human elastase 2 mRNA, complete cds.
ACCESSION M16631
KEYWORDS elastase.
SOURCE Human pancreas, cDNA to mRNA (library of R.Weiss), clones
hpe2-lambda-[4,10].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 906)
AUTHORS Fletcher,T.S., Shen,W.-F. and Largman,C.
TITLE Primary structure of human pancreatic elastase 2 determined by
sequence analysis of the cloned mRNA
JOURNAL Biochemistry 26, 7256-7261 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by T.S.Fletcher, 05-AUG-1987.
A poly-adenylation signal is located at positions 891-896.
FEATURES from to/span description
pept 22 831 elastase 2 precursor
sigp 22 69 elastase 2 signal peptide
matp 70 828 elastase 2
BASE COUNT 197 a 274 c 253 g 182 t
ORIGIN 161 bp upstream of PstI site.
1 aaacagtccc agggacacac catgataagg acgctgctgc tgtccacttt ggtggctgga
61 gccctcagtt gtggggaccc cacttaccca ccttatgtga ctagggtggt tggcggtgaa
121 gaagcgaggc ccaacagctg gccctggcag gtctccctgc agtacagctc caatggcaag
181 tggtaccaca cctgcggagg gtccctgata gccaacagct gggtcctgac ggctgcccac
241 tgcatcagct cctccaggac ctaccgcgtg gggctgggcc ggcacaacct ctacgttgcg
301 gagtccggct cgctggcagt cagtgtctct aagattgtgg tgcacaagga ctggaactcc
361 aaccaaatct ccaaagggaa cgacattgcc ctgctcaaac tggctaaccc cgtctccctc
421 accgacaaga tccagctggc ctgcctccct cctgccggca ccattctacc caacaactac
481 ccctgctacg tcacgggctg gggaaggctg cagaccaacg gggctgttcc tgatgtcctg
541 cagcagggcc ggttgctggt tgtggactat gccacctgct ccagctctgc ctggtggggc
601 agcagcgtga aaaccagtat gatctgtgct gggggtgatg gcgtgatctc cagctgcaac
661 ggagactctg gcgggccact gaactgtcag gcgtctgacg gccggtggca ggtgcacggc
721 atcgtcagct tcgggtctcg cctcggctgc aactactacc acaagccctc cgtcttcacg
781 cgggtctcca attacatcga ctggatcaat tcggtgattg caaataacta accaaaagaa
841 gtccctggga ctgtttcaga cttggaaagg tcacagaagg aaaataatat aataaagtga
901 caactc
//
LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 18-JUL-1990
DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene,
complete cds.
ACCESSION J02911
KEYWORDS formyltetrahydrofolate synthetase.
SOURCE C.thermoaceticum DNA.
ORGANISM Clostridium thermoaceticum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1965)
AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G.
TITLE Primary structure of the thermostable formyltetrahydrofolate
synthetase from Clostridium thermoaceticum
JOURNAL Biochemistry 29, 5687-5694 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.R.Lovell, 13-APR-1990, for release after publication.
FEATURES from to/span description
pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg
start codon) (EC 6.3.4.3)
mRNA < 1 1760 FTHFS mRNA
binding 41 44 ribosomal binding site (put.)
signal 1761 1785 pot. transcription termination signal
BASE COUNT 459 a 576 c 532 g 398 t
ORIGIN
1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc
61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact
121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa
181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt
241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac
301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg
361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat
421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct
481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg
541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat
601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc
661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt
721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga
781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac
841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg
901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc
961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg
1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg
1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa
1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa
1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc
1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact
1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa
1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga
1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta
1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa
1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg
1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc
1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc
1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat
1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca
1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag
1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat
//
LOCUS HUMGLIBSA 176 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A1.
ACCESSION M34816
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 31 53 human zinc finger protein binding site
binding 94 116 human zinc finger protein binding site
binding 157 176 human zinc finger protein binding site
BASE COUNT 68 a 50 c 36 g 22 t
ORIGIN
1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt
61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta
121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat
//
LOCUS HUMGLIBSB 166 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A2.
ACCESSION M34817
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 166)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 27 49 human zinc finger protein binding site
binding 90 112 human zinc finger protein binding site
BASE COUNT 60 a 47 c 37 g 22 t
ORIGIN
1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa
61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc
121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac
//
LOCUS HUMGLIBSC 313 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A3.
ACCESSION M34818
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 313)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 1 23 human zinc finger protein binding site
binding 64 86 human zinc finger protein binding site
binding 127 149 human zinc finger protein binding site
binding 190 212 human zinc finger protein binding site
binding 253 275 human zinc finger protein binding site
BASE COUNT 110 a 79 c 75 g 49 t
ORIGIN
1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc
61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag
121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc
181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac
241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc
301 agctgcaagg gct
//
LOCUS HUMGLIBSD 184 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region A4.
ACCESSION M34819
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone A4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 184)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 32 54 human zinc finger protein binding site
binding 95 117 human zinc finger protein binding site
binding 158 180 human zinc finger protein binding site
BASE COUNT 67 a 46 c 41 g 30 t
ORIGIN
1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg
61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt
121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt
181 tgta
//
LOCUS HUMGLIBSE 62 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region B1.
ACCESSION M34820
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone B1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 62)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 11 33 human zinc finger protein binding site
BASE COUNT 18 a 26 c 11 g 7 t
ORIGIN
1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc
61 tt
//
LOCUS HUMGLIBSF 67 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human zinc finger protein (GLI) DNA binding region C1.
ACCESSION M34821
KEYWORDS GLI protein binding site.
SOURCE Human DNA, clone C1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 67)
AUTHORS Kinzler,K.W. and Vogelstein,B.
TITLE The GLI gene encodes a nuclear protein which binds specific
sequences in the human genome
JOURNAL Mol. Cell. Biol. 10, 634-642 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
binding 16 29 human zinc finger protein binding site
BASE COUNT 15 a 26 c 15 g 11 t
ORIGIN
1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct
61 ctctgct
//
LOCUS MLVU3RA 383 bp ss-RNA VRL 18-JUL-1990
DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA.
ACCESSION M34722
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA,
from mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 383)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 112 178 MuLV promoter region
binding 131 144 CCAAT-binding transcription factor
binding 147 152 Moloney leukemia virus binding protein b
binding 156 167 enhancer core binding protein
binding 350 362 CCAAT-binding transcription factor
binding 363 372 glucocorticoid response element
BASE COUNT 127 a 105 c 84 g 67 t
ORIGIN
1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc
61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc
181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc
361 caagaacaga tggtccccag aaa
//
LOCUS MLVU3RB 348 bp ss-RNA VRL 18-JUL-1990
DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA.
ACCESSION M34723
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 348)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 77 177 MuLV promoter region
binding 96 109 CCAAT-binding transcription factor
binding 112 117 Moloney leukemia virus binding protein b
binding 121 132 enhancer core binding protein
binding 142 154 CCAAT-binding transcription factor
binding 155 164 glucocorticoid response element
BASE COUNT 115 a 97 c 76 g 60 t
ORIGIN
1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa
61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc
121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct
181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg
241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg
301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc
//
LOCUS MLVU3RC 445 bp ss-RNA VRL 18-JUL-1990
DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA.
ACCESSION M34724
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 445)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 77 274 MuLV promoter region
binding 96 109 CCAAT-binding transcription factor
binding 168 179 CCAAT-binding transcription factor
binding 238 252 CCAAT-binding transcription factor
binding 138 143 Moloney leukemia virus binding protein b
binding 209 214 Moloney leukemia virus binding protein b
binding 147 158 enhancer core binding protein
binding 218 229 enhancer core binding protein
binding 180 189 glucocorticoid response element
binding 252 263 glucocorticoid response element
BASE COUNT 142 a 126 c 104 g 73 t
ORIGIN
1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa
61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg
121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa
181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg
241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga
301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa
361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa
421 aaagggtaag aaccccacac tcggc
//
LOCUS MLVU3RD 409 bp ss-RNA VRL 18-JUL-1990
DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA.
ACCESSION M34725
KEYWORDS U3 region; promoter region.
SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from
mice (strain CWB).
ORGANISM Murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 409)
AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs
of CWD mice
JOURNAL Virology 176, 166-177 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
signal 112 238 MuLV promoter region
binding 131 144 CCAAT-binding transcription factor
binding 174 179 Moloney leukemia virus binding protein b
binding 183 194 enhancer core binding protein
binding 204 215 CCAAT-binding transcription factor
binding 216 225 glucocorticoid response element
BASE COUNT 133 a 114 c 91 g 71 t
ORIGIN
1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata
181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc
241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc
//
LOCUS SIVPBJA 125 bp ss-RNA VRL 18-JUL-1990
DEFINITION Simian immunodeficiency virus upstream enhancer region.
ACCESSION M31325
KEYWORDS .
SOURCE Simian (mangabey) immunodeficiency virus (isolate PBj14), clone
4.41.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 125)
AUTHORS Dewhurst,S., Embretson,J.E., Anderson,D.C., Mullins,J.I. and
Fultz,P.N.
TITLE Sequence analysis and acute pathogenicity of molecularly cloned SIV
JOURNAL Nature 345, 636-640 (1990)
STANDARD simple staff_review
BASE COUNT 40 a 25 c 38 g 22 t
ORIGIN
1 gaagaaaaca agctaagaca gcagggactt tccacaaaga cagcagggac tttccacaaa
61 ggggatgtta cggggaggta ctggggagga gctggctgga acgcccacgt attctctgta
121 taaat
//
LOCUS CPARRAA 119 bp ss-rRNA RNA 18-JUL-1990
DEFINITION C.paradoxa cellular 5S ribosomal RNA.
ACCESSION M33029
KEYWORDS 5S ribosomal rRNA.
SOURCE C.paradoxa rRNA.
ORGANISM Cyanophora paradoxa
Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
Cryptomonadales; Kathablepharidaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M.
TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle-
associated 5S ribosomal RNAs
JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 25 a 35 c 37 g 22 t
ORIGIN
1 gtgtacggct atactaccgg aaaagcgccc gttcccgtcc gattacgaga gcctagcccg
61 gtcaggcccg actagtacta gggtggggga ccacctggga acatcgggtg ctgtacact
//
LOCUS CPARRAB 118 bp ss-rRNA RNA 18-JUL-1990
DEFINITION C.paradoxa cyanellar 5S ribosomal RNA.
ACCESSION M33030
KEYWORDS 5S ribosomal rRNA.
SOURCE C.paradoxa cyanellar rRNA.
ORGANISM Cyanelle Cyanophora paradoxa
Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE 1 (bases 1 to 118)
AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M.
TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle-
associated 5S ribosomal RNAs
JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 31 a 26 c 31 g 30 t
ORIGIN
1 cctggtgctt attgcgtggt ggaaccacac cgatccatac cgaactcggt tgtgaaacat
61 tactgcggcg acaatacttg aagggtaact ttctggaaag atagcttagt gccaggat
//
LOCUS SNDZSTR 15384 bp ss-RNA VRL 18-JUL-1990
DEFINITION Sendai virus nucleoprotein (NP), phosphoprotein (P), matrix protein
(M), F protein, hemagglutinin-neuraminidase (HN), and L protein
genes, complete cds.
ACCESSION M30202 M30203 M30204 M19661
KEYWORDS F protein; L protein; hemagglutinin-neuraminidase; matrix protein;
nucleoprotein; phosphoprotein.
SOURCE Sendai virus (Z strain, F1-R mutant, and ts-f1 mutant) RNA.
ORGANISM Parainfluenza virus type 1
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Paramyxoviridae; Paramyxovirus.
REFERENCE 1 (sites)
AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K.,
Klenk,H.-D., Rott,R. and Seto,J.T.
TITLE Characterization of a pantropic variant of Sendai virus derived
from a host range mutant
JOURNAL Virology 165, 577-583 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 4866 to 6560; for [1])
AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K.,
Klenk,H.-D., Rott,R. and Seto,J.T.
JOURNAL Unpublished (1988)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 15384)
AUTHORS Middleton,Y., Tashiro,M., Thai,T., Oh,J., Seymour,J., Pritzer,E.,
Klenk,H.-D., Rott,R. and Seto,J.T.
TITLE Nucleotide sequence analyses of the genes encoding the HN, M, NP,
P, and L proteins of two host range mutants of Sendai virus
JOURNAL Unpublished (1989) California State University at Los Angeles,
Dept. of Microbiology, Los Angeles, CA 90032-8745
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] and [2] kindly
submitted by J.T.Seto, 08-JUL-1988, for [3], 27-NOV-1989.
Author Address: J.T.Seto
California State University at Los Angeles
Department of Microbiology
Los Angeles, CA 90032-8745
FEATURES from to/span description
pept 120 1694 nucleoprotein (NP)
pept 1844 3550 phosphoprotein (P)
pept 3669 4715 matrix protein (M)
pept 4866 6563 F protein
pept 6693 8420 hemagglutinin-neuraminidase (HN)
pept 8556 15242 L protein
variant 310 310 c in strain Z and F1-R mutant; t in ts-f1
mutant
variant 350 350 c in strain Z and F1-R mutant; t in ts-f1
mutant
variant 1580 1580 t in strain Z and F1-R mutant; c in ts-f1
mutant
variant 1586 1586 g in strain Z and ts-f1 mutant; a in F1-R
mutant
variant 2307 2307 g in strain Z; a in F1-R mutant and ts-fl
mutant
variant 4020 4020 a in strain Z and F1-R mutant; c in ts-fl
mutant
variant 4051 4051 g in strain Z and F1-R mutant; a in ts-fl
mutant
variant 4297 4297 c in strain Z and F1-R mutant; t in ts-fl
mutant
variant 5053 5053 t in strain Z and F1-R mutant; g in ts-fl
mutant
variant 5176 5176 g in strain Z and F1-R mutant; a in ts-fl
mutant
variant 5208 5208 c in strain Z and F1-R mutant; t in ts-fl
mutant
variant 5212 5212 a in strain Z and F1-R mutant; g in ts-fl
mutant
variant 5700 5700 a in strain Z and F1-R mutant; g in ts-fl
mutant
variant 5708 5708 t in strain Z and F1-R mutant; c in ts-fl
mutant
variant 6530 6530 a in strain Z and F1-R mutant; c in ts-fl
mutant
variant 10429 10429 g in strain Z; a in F1-R mutant and ts-fl
mutant
variant 15262 15262 c in strain Z and F1-R mutant; a in ts-fl
mutant
BASE COUNT 4595 a 3329 c 3756 g 3704 t
ORIGIN
1 accaaacaag agaaaaaaca tgtatgggat atgtaatgaa gttatacagg attttagggt
61 caaagtatcc accctgagga gcaggttcca gaccctttgc tttgctgcca aagttcacga
121 tggccgggtt gttgagcacc ttcgatacat ttagctctag gaggagcgaa agtattaata
181 agtcgggagg aggtgctgtt atccccggcc agaggagcac agtctcagtg ttcgtactag
241 gcttaagtgt gactgatgat gcagacaagt tattcattgc aactaccttc ctagctcact
301 cattggacac agataagcag cactctcaga gaggagggtt cctcgtctcc ctgcttgcca
361 tggcttacag tagtccagaa ttgtacttga caacaaacgg agtaaacgcc gatgtcaaat
421 atgtgatcta caacatagag aaagacccta agaggacgaa gacagacgga ttcattgtga
481 agacgagaga tatggaatat gagaggacca cagaatggct gtttggacct atggtcaaca
541 agagcccact cttccagggt caacgggatg ctgcagaccc tgacacactc cttcaaatct
601 atgggtatcc tgcatgccta ggagcaataa ttgtccaagt ctggattgtg ctggtgaagg
661 ccatcacaag cagcgccggc ttaaggaaag ggttcttcaa caggttagag gcgttcagac
721 aagacggcac cgtgaaaggt gccttagttt tcactgggga gacagttgag gggataggct
781 cggttatgag atctcagcaa agccttgtat ctctcatggt tgagaccctt gtgactatga
841 atactgcaag atctgatctc accacattag agaagaacat ccagatagtt gggaactaca
901 tccgagatgc agggctggct tccttcatga acactattaa atatggggtg gagacaaaga
961 tggcagctct aacgttgtca aacctgaggc ccgatattaa taagcttaga agcctcatag
1021 acacctacct gtcaaaaggc cccagagctc cctttatctg tatcctcaag gaccctgttc
1081 atggtgaatt tgctccaggc aattatcctg cactatggag ttacgccatg ggagtcgccg
1141 tcgtacagaa caaggcaatg cagcagtacg tcacagggag gacatacctt gatatggaaa
1201 tgttcttact aggacaagcc gtggcaaagg atgctgaatc gaagatcacg agtgccttgg
1261 aagatgagtt aggagtgacg gatacagcca aggggaggct cagacatcat ctggcaaact
1321 tgtccggtgg ggatggtgct taccgcaaac caacaggcgg tggtgcaatt gaggtagctc
1381 tagacaatgc cgatatcgac ctagaaacaa aagctcatgc ggaccaggac gctaggggtt
1441 ggggtggaga tagtggtgaa agatgggcac gtcaggtgag tggtggccac tttgtcacac
1501 tacatggggc tgaacggtta gaggaggaaa ccaatgatga ggatgtatca gacatagaga
1561 gaagaatagc catgagactt gcagagagac ggcaagagga ttctgcaacc catggagatg
1621 aaggccgcaa taacggtgtc gatcacgacg aagatgacga taccgcagca gtagctgggg
1681 taggaggaat ctaggatcat acgaggcttc aaggtacttg atccgtagta agaaaaactt
1741 agggtgaaag ttcatccacc gatcggctca ggcaaggcca cacccaaccc caccgaccac
1801 acccagcagt cgagacagcc acggcttcgg ctacacttac cgcatggatc aagatgcctt
1861 cattcttaaa gaagattctg aagttgagag ggaggcgcca ggaggacgag agtcgctctc
1921 ggatgttatc ggattcctcg atgctgtcct gtcgagtgaa ccaactgaca tcggagggga
1981 cagaagctgg ctccacaaca ccatcaacac tccccaagga ccaggctctg ctcatagagc
2041 caaaagtgag ggcgaaggag aagtctcaac accgtcgacc caagataatc gatcaggtga
2101 ggagagtaga gtctctggga gaacaagcaa gccagaggca gaagcacatg ctggaaacct
2161 tgataaacaa aatatacacc gggcctttgg gggaagaact ggtacaaact ctgtatctca
2221 ggatctgggc gatggaggag actccggaat ccttgaaaat cctccaaatg agagaggata
2281 tccgagatca ggtattgaag atgaaagcag agagatggct gcgcaccctg ataagagggg
2341 agaagaccaa gctgaaggac ttccagaaga ggtacgagga ggtacatccc tacctgatga
2401 aggagaaggt ggagcaagta ataatggaag aagcatggag cctggcagct cacatagtgc
2461 aagagtaact ggggtcctgg tgattcctag ccccgaactt gaagaggctg tgctacggag
2521 gaacaaaaga agacctacca acagtgggtc caaacctctt actccagcaa ccgtgcctgg
2581 cacccggtcc ccaccgctga atcgttacaa cagcacaggg tcaccaccag gaaaaccccc
2641 atctacacag gatgagcaca tcaactctgg ggacaccccc gccgtcaggg tcaaagaccg
2701 gaaaccacca atagggaccc gctctgtctc agattgtcca gccaacggcc gcccaatcca
2761 cccgggtcta gagaccgact caacaaaaaa gggcatagga gagaacacat catctatgaa
2821 agagatggct acattgttga cgagtcttgg tgtaatccag tctgctcaag aattcgaatc
2881 atcccgagac gcgagttatg tgtttgcaag acgtgcccta aagtctgcaa actatgcaga
2941 gatgacattc aatgtatgcg gcctgatcct ttctgccgag aaatcttccg ctcgtaaggt
3001 agatgagaac aaacaactgc tcaaacagat ccaagagagc gtggagtcat tccgggatat
3061 ttataagaga ttctctgagt atcagaaaga acagaactca ttgctgatgt ccaacctatc
3121 tacacttcat atcatcacag atagaggtgg caagactgac aacacagact cccttacaag
3181 gtccccctcc gtttttgcaa aatcaaaaga gaacaagact aaggctacca ggtttgaccc
3241 atctatggag accctagaag atatgaagta caaaccggac ctaatccgag aggatgaatt
3301 tagagatgag atccgcaacc cggtgtacca agagagggac acagaaccta gggcctcaaa
3361 cgcatcacgt ctcctcccct ccaaagagaa gcccacaatg cactctctca ggctcgtcat
3421 agagagcagt cccctaagca gagctgagaa agcagcatat gtgaaatcat tatccaagtg
3481 caagacagac caagaggtta aggcagtcat ggaactcgta gaagaggaca tagagtcact
3541 gaccaactag atcccgggtg aggcatccta ccatcctcag tcatagagag atccaatcta
3601 ccatcagcat cagccagtaa agattaagaa aaacttaggg tgaaagaaat ttcacctaac
3661 acggcgcaat ggcagatatc tatagattcc ctaagttctc atatgaggat aacggtactg
3721 tggagcccct gcctctgaga actggtccgg ataagaaagc catcccccac atcaggattg
3781 tcaaggtagg agtccctcct aaacatggag tgagatacct agatttattg ctcttgggtt
3841 tctttgagac accgaaacaa acaaccaatc tagggagcgt atctgacttg acagagccga
3901 ccagctactc aatatgcggc tccgggtcgt tacccatagg tgtggccaaa tactacggga
3961 ctgatcagga actcttaaag gcctgcaccg atctcagaat tacggtgagg aggactgtta
4021 gagcaggaga gatgatcgta tacatggtgg gttcgattgg tgctccactc ctaccatggt
4081 caggcaggct gagacaggga atgatattta atgcaaacaa ggtcgcacta gctccccaat
4141 gcctccctgt ggacaaggac ataagactca gagtggtgtt tgtcaatggg acatctctag
4201 gggcaatcac catagccaag atcccaaaga cccttgcaga ccttgcattg cccaactcta
4261 tatccgttaa tttactggtg acactcaaga ccgggacctc cacagaacaa aagggggtac
4321 tcccagtact tgatgatcaa ggggagaaaa agctcaattt tatggtgcac ctcgggttga
4381 tcaggagaaa ggtcgggaag atatactctg ttgagtactg caagagcaag attgagagaa
4441 tgcggctgat tttctcactt gggttaatcg gcggtataag cttccatgtt caggttattg
4501 ggacactatc taagacattc atgagtcagc tcgcatggaa gagggcagtc tgcttcccat
4561 taatggatgt gaatccccat atgaacatgg tgatttgggc ggcatctgta gaaatcacag
4621 gcgtcgatgc ggtgttccaa ccggccatcc ctcgtgattt ccgctactac cctaatgttg
4681 tggctaagaa catcggaagg atcagaaagc tgtaaatgtg cacccatcag agacctgcga
4741 caatgcccca agcagacacc acctggcagt cggagccacc gggtcactcc ttgtcttaaa
4801 taagaaaaac ttagggataa agtcccttgt gagtgcttgg ttgcaaaact ctccccttgg
4861 gaaacatgac agcatatatc cagagatcac agtgcatctc aacatcacta ctggttgttc
4921 tcaccacatt ggtctcgtgt cagattccca gggataggct ctctaacata ggggtcatag
4981 tcgatgaagg gaaatcactg aagatagctg gatcccacga atcgaggtac atagtactga
5041 gtctagttcc ggtggtagac tttgagaatg ggtgcggaac agcccaggtt atccagtaca
5101 agagcctact gaacaggctg ttaatcccat tgagggatgc cttagatctt caggaggctc
5161 tgataactgt caccagtgat acgacacaaa atgccggtgc tccacagccg aaattcttcg
5221 gtgctgtgat tggtactatc gcacttggag tggcgacatc agcacaaatc accgcaggga
5281 ttgcactagc cgaagcgagg gaggccaaaa gagacatagc gctcatcaaa gaatcgatga
5341 caaaaacaca caagtctata gaactgctgc aaaacgctgt gggggaacaa attcttgctc
5401 taaagacact ccaggatttc gtgaatgatg agatcaaacc cgcaataagc gaattaggct
5461 gtgagactgc tgccttaaga ctgggtataa aattgacaca gcattactcc gagctgttaa
5521 ctgcgttcgg ctcgaatttc ggaaccatcg gagagaagag cctcacgctg caggcgctgt
5581 cttcacttta ctctgctaac attactgaga ttatgaccac aatcaggaca gggcagtcta
5641 acatctatga tgtcatttat acagaacaga tcaaaggaac ggtgatagat gtggatctaa
5701 agagatatat ggtcaccctg tctgtgaaga tccctattct ttctgaagtc ccaggtgtgc
5761 tcatacacaa ggcatcatct atttcttaca acatagacgg ggaggaatgg tatgtgactg
5821 tccccagcca tatactcagt cgtgcttctt tcttaggggg tgcagacata accgattgtg
5881 ttgagtccag attgacctat atatgcccca gggatcccgc acaactgata cctgacagcc
5941 agcaaaagtg tatcctgggg gacacaacaa ggtgtcctgt cacaaaagtt gtggacagcc
6001 ttatccccaa gtttgctttt gtgaatgggg gcgttgttgc taactgcata gcatccacat
6061 gtacctgcgg gacaggccga agaccaatca gtcaggatcg ctctaaaggt gtagtattcc
6121 taacccatga caactgtggt cttataggtg tcaatggggt agaattgtat gctaaccgga
6181 gagggcacga tgccacttgg ggggtccaga acttgacagt cggtcctgca attgctatca
6241 gacccgttga tatttctctc aaccttgctg atgctacgaa tttcttgcaa gactctaagg
6301 ctgagcttga gaaagcacgg aaaatcctct cggaggtagg tagatggtac aactcaagag
6361 agactgtgat tacgatcata gtagttatgg tcgtaatatt ggtggtcatt atagtgatca
6421 tcatcgtgct ttatagactc agaaggtcaa tgctaatggg taatccagat gaccgtatac
6481 cgagggacac atacacatta gagccgaaga tcagacatat gtacacaaaa ggtgggtttg
6541 atgcaatggc tgagaaaaga tgatcacgac cattatcaga tgtcttgtaa agcaggcatg
6601 gtattcgttg agatctgtat ataataagaa aaacttaggg tgaaagtgag gtcgcgcggt
6661 actttagctt tcacctcaaa caagcacaga tcatggatgg tgataggggc aaacgtgact
6721 cgtactggtc tacttctcct agtggtagca ctacaaaatt agcatcaggt tgggagaggt
6781 caagtaaagt tgacacatgg ttgctgattc tctcattcac ccagtgggct ttgtcaattg
6841 ccacagtgat catctgtatc ataatttctg ctagacaagg gtatagtatg aaagagtact
6901 caatgactgt agaggcattg aacatgagca gcagggaggt gaaagagtca cttaccagtc
6961 taataaggca agaggttata gcaagggctg tcaacattca gagctctgtg caaaccggaa
7021 tcccagtctt gttgaacaaa aacagcaggg atgtcatcca gatgattgat aagtcgtgca
7081 gcagacaaga gctcactcag cactgtgaga gtacgatcgc agtccaccat gccgagggaa
7141 ttgccccact tgagccacat agtttctgga gatgccctgt cggagaaccg tatcttagct
7201 cagatcctga aatctcattg ctgcctggtc cgagcttgtt atctggttct acaacgatct
7261 ctggatgtgt taggctccct tcactctcaa ttggcgaggc aatctatgcc tattcatcaa
7321 atctcattac acaaggttgt gctgacatag ggaaatcata tcaggtcctg cagctagggt
7381 acatatcact caattcagat atgttccctg atcttaaccc cgtagtgtcc cacacttatg
7441 acatcaacga caatcggaaa tcatgctctg tggtggcaac cgggactagg ggttatcagc
7501 tttgctccat gccgactgta gacgaaagaa ccgactactc tagtgatggt atcgaggatc
7561 tggtccttga tgtcctggat ctcaaaggga gaactaagtc tcaccggtat cgcaacagcg
7621 aggtagatct tgatcacccg ttctctgcac tataccccag tgtaggcaac ggcattgcaa
7681 cagaaggctc attgatattt cttgggtatg gtggactaac cacccctctg cagggtgata
7741 caaaatgtag gacccaagga tgccaacagg tgtcgcaaga cacatgcaat gaggctctga
7801 aaattacatg gctaggaggg aaacaggtgg tcagcgtgat catccaggtc aatgactatc
7861 tctcagagag gccaaagata agagtcacaa ccattccaat cactgaaaac tatctcgggg
7921 cggaaggtag attattaaaa ttgggtgatc gggtgtacat ctatacaaga tcatcaggct
7981 ggcactctca actgcagata ggagtacttg atgtcagcca ccctttgact atcaactgga
8041 cacctcatga agccttgtct agaccaggaa ataaagagtg caattggtac aataagtgtc
8101 cgaaggaatg catatcaggc gtatacactg atgcttatcc attgtcccct gatgcagcta
8161 acgtcgctac cgtcacgcta tatgccaata catcgcgtgt caacccaaca atcatgtatt
8221 ctaacactac taacattata aatatgttaa ggataaagga tgttcaatta gaggctgcat
8281 ataccacgac atcgtgtatc acgcattttg gtaaaggcta ctgctttcac atcatcgaga
8341 tcaatcagaa gagcctgaat accttacagc cgatgctctt taagactagc atccctaaat
8401 tatgcaaggc cgagtcttaa atttaactga ctagcaggct tgtcggcctt gctgacacta
8461 gagtcatctc cgaacatcca caatatctct cagtctctta cgtctctcac agtattaaga
8521 aaaacccagg gtgaatggga agcttgccat aggtcatgga tgggcaggag tcctcccaaa
8581 acccttctga catactctat ccagaatgcc acctgaactc tcccatagtc agggggaaga
8641 tagcacagtt gcacgtcttg ttagatgtga accagcccta cagactgaag gacgacagca
8701 taataaatat tacaaagcac aaaattagga acggaggatt gtcccctcgt caaattaaga
8761 tcaggtctct gggtaaggct cttcaacgca caataaagga tttagaccga tacacgtttg
8821 aaccgtaccc aacctactct cacgaattac ttaggcttga tataccagag atatgtgaca
8881 aaatccgatc cgtcttcgcg gtctcggatc ggctgaccag ggagttatct agtgggttcc
8941 aggatctttg gttgaatatc ttcaagcaac taggcaatat agaaggaaga gaggggtacg
9001 atccgttgca ggatatcggc accatcccgg agataactga taagtacagc aggaatagat
9061 ggtataggcc attcctaact tggttcagca tcaaatatga catgcggtgg atgcagaaga
9121 ccagaccggg gggacccctt gatacctcta attcacataa cctcctagaa tgcaaatcat
9181 acactctagt aacatacgga gatcttgtca tgatactgaa caagttgaca ttgacagggt
9241 atatcctaac ccctgagctg gtcttgatgt attgtgatgt tgtagaagga aggtggaata
9301 tgtctgctgc agggcatcta gataagaagt ccattgggat aacaagcaaa ggtgaggaat
9361 tatgggaact agtggattcc ctcttctcaa gtcttggaga ggaaatatac aatgtcatcg
9421 cactattgga gcccctatca cttgctctca tacaactaaa tgatcctgtt atacctctac
9481 gtggggcatt tatgaggcat gtgttgacag agctacagac tgttttaaca agtagagacg
9541 tgtacacaga tgctgaagca gacactattg tggagtcgtt actcgccatt ttccatggaa
9601 cctctattga tgagaaagca gagatctttt ccttctttag gacatttggc caccccagct
9661 tagaggctgt cactgccgcc gacaaggtaa gggcccatat gtatgcacaa aaggcaataa
9721 agcttaagac cctatacgag tgtcatgcag ttttttgcac tatcatcata aatgggtata
9781 gagagaggca tggcggacag tggcccccct gtgacttccc tgatcacgtg tgtctagaac
9841 taaggaacgc tcaagggtcc aatacggcaa tctcttatga atgtgctgta gacaactata
9901 caagtttcat aggcttcaag tttcggaagt ttatagaacc acaactagat gaagatctca
9961 caatatatat gaaagacaaa gcactatccc ccaggaagga ggcatgggac tctgtatacc
10021 cggatagtaa tctgtactat aaagccccag agtctgaaga gacccggcgg cttattgaag
10081 tgttcataaa tgatgagaat ttcaacccag aagaaattat caattatgtg gagtcaggag
10141 attggttgaa agacgaggag ttcaacatct cgtacagtct caaagagaaa gagatcaagc
10201 aagagggtcg tctattcgca aaaatgactt ataagatgcg agccgtacag gtgctggcag
10261 agacactact ggctaaagga ataggagagc tattcaggga aaatgggatg gttaagggag
10321 agatagacct acttaaaaga ttgactactc tttctgtctc aggcgtcccc aggactgatt
10381 cagtgtacaa taactctaaa tcatcagaga agagaaacga aggcatggga aataagaact
10441 ctggggggta ctgggacgaa aagaagaggt ccagacatga attcaaggca acagattcat
10501 caacagacgg ctatgaaaca ttaagttgct tcctcacaac agacctcaag aaatactgct
10561 taaactggag atttgagagt actgcattgt ttggtcagag atgcaacgag atatttggct
10621 tcaagacctt ctttaactgg atgcatccag tccttgaaag gtgtacaata tatgttggag
10681 atccttactg tccagtcgcc gaccggatgc atcgacaact ccaggatcat gcagactctg
10741 gcattttcat acataatcct agggggggca tagaaggtta ctgccagaag ctgtggacct
10801 taatctcaat gagtgcaatc cacctagcag ctgtgagagt gggtgtcagg gtctctgcaa
10861 tggttcaggg tgacaatcaa gctatagccg tgacatcaag agtacctgta gctcagactt
10921 acaagcagaa gaaaaatcat gtctatgagg agatcaccaa atatttcggt gctctaagac
10981 acgtcatgtt tgatgtaggg cacgagctaa aattgaacga gaccatcatt agtagcaaga
11041 tgtttgtcta tagtaaaagg atatactatg atgggaagat tttaccacag tgcctgaaag
11101 ccttgaccaa gtgtgtattc tggtccgaga cactggtaga tgaaaacaga tctgcttgtt
11161 cgaacatctc aacatccata gcaaaagcta tcgaaaatgg gtattctcct atactaggct
11221 actgcattgc gttgtataag acctgtcagc aggtgtgcat atcactaggg atgactataa
11281 atccaactat cagcccgacc gtaagagatc aatactttaa gggtaagaat tggctgagat
11341 gtgcagtgtt gattccagca aatgttggag gattcaacta catgtctaca tctagatgct
11401 ttgttagaaa tattggagac cccgcagtag cagccctagc tgatctcaaa agattcatca
11461 gagcggatct gttagacaag caggtattat acagggtcat gaatcaagaa cccggtgact
11521 ctagttttct agattgggct tcagaccctt attcgtgtaa cctcccgcat tctcagagta
11581 taactacgat tataaagaat atcactgcta gatctgtgct gcaggaatcc ccgaatcctc
11641 tactgtctgg tctcttcacc gagactagtg gagaagagga tctcaacctg gcctcgttcc
11701 ttatggaccg gaaagtcatc ctgccgagag tggctcatga gatcctgggt aattccttaa
11761 ctggagttag ggaggcgatt gcagggatgc ttgatacgac caagtctcta gtgagagcca
11821 gcgttaggaa aggaggatta tcatatggga tattgaggag gcttgtcaat tatgatctat
11881 tgcagtacga gacactgact agaactctca ggaaaccggt gaaagacaac atcgaatatg
11941 agtatatgtg ttcagttgag ctagctgtcg gtctaaggca gaaaatgtgg atccacctga
12001 cttacgggag acccatacat gggctagaaa caccagaccc tttagagctc ttgaggggaa
12061 tatttatcga aggttcagag gtgtgcaagc tttgcaggtc tgaaggagca gaccccatct
12121 atacatggtt ctatcttcct gacaatatag acctggacac gcttacaaac ggaagtccgg
12181 ctataagaat cccctatttt ggatcagcca ctgatgaaag gtcggaagcc caactcgggt
12241 atgtaagaaa tctaagcaaa cccgcaaagg cggccatccg gatagctatg gtgtatacgt
12301 gggcctacgg gactgatgag atatcgtgga tggaagccgc tcttatagcc caaacaagag
12361 ctaatctgag cttagagaat ctaaagctgc tgactcctgt ttcaacctcc actaatctat
12421 ctcataggtt gaaagatacg gcaacccaga tgaagttctc tagtgcaaca ctagtccgtg
12481 caagtcggtt cataacaata tcaaatgata acatggcact caaagaagca ggggagtcga
12541 aggatactaa tctcgtgtat cagcagatta tgctaactgg gctaagcttg ttcgagttca
12601 atatgagata taagaaaggt tccttaggga agccactgat attgcactta catcttaata
12661 acgggtgctg tataatggag tccccacagg aggcgaatat ccccccaagg tccacattag
12721 atttagagat tacacaagag aacaataaat tgatctatga tcctgatcca ctcaaggatg
12781 tggaccttga gctatttagc aaggtcagag atgttgtaca cacagttgac atgacttatt
12841 ggtcagatga tgaagttatc agagcaacca gtatctgtac tgcaatgacg atagctgata
12901 caatgtctca attagataga gacaacttaa aagagatgat cgcactagta aatgacgatg
12961 atgtcaacag cttgattact gagtttatgg tgattgatgt tcctttattt tgctcaacgt
13021 tcgggggtat tctagtcaat cagtttgcat actcactcta cggcttaaac atcagaggaa
13081 gggaagaaat atggggacat gtagtccgga ttcttaaaga tacctcccac gcagttttaa
13141 aagtcttatc taatgctcta tctcatccca aaatcttcaa acgattctgg aatgcaggtg
13201 tcgtggaacc tgtgtatggg cctaacctct caaatcagga taagatactc ttggccctct
13261 ctgtctgtga atattctgtg gatctattca tgcacgattg gcaagggggt gtaccgcttg
13321 agatctttat ctgtgacaat gacccagatg tggccgacat gaggaggtcc tctttcttgg
13381 caagacatct tgcataccta tgcagcttgg cagagatatc tagggatggg ccaagattag
13441 aatcaatgaa ctctctagag aggctcgagt cactaaagag ttacctggaa ctcacatttc
13501 ttgatgaccc ggtactgagg tacagtcagt tgactggcct agtcatcaaa gtattcccat
13561 ctactttgac ctatatccgg aagtcatcta taaaagtgtt aaggacaaga ggtataggag
13621 tccctgaagt cttagaagat tgggatcccg aggcagataa tgcactgtta gatggtatcg
13681 cggcagaaat acaacagaat attcctttgg gacatcagac tagagcccct ttttgggggt
13741 tgagagtatc caagtcacag gtactgcgtc tccgggggta caaggagatc acaagaggtg
13801 agataggcag atcaggtgtt ggtctgacgt taccattcga tggaagatat ctatctcacc
13861 agctgaggct ctttggcatc aacagtacta gctgcttgaa agcacttgaa cttacctacc
13921 tattgagccc cttagttgac aaggataaag ataggctata tttaggggaa ggagctgggg
13981 ccatgctttc ctgttatgac gctactcttg gcccatgcat caactattat aactcagggg
14041 tatactcttg tgatgtcaat gggcagagag agttaaatat atatcctgct gaggtggcac
14101 tagtgggaaa gaaattaaac aatgttacta gtctgggtca aagagttaaa gtgttattca
14161 acgggaatcc tggctcgaca tggattggga atgatgagtg tgaggctttg atttggaatg
14221 aattacagaa tagctcgata ggcctagtcc actgtgacat ggagggagga gatcataagg
14281 atgatcaagt tgtactgcat gagcattaca gtgtaatccg gatcgcgtat ctggtggggg
14341 atcgagacgt tgtgcttata agcaagattg ctcccaggct gggcacggat tggaccaggc
14401 agctcagcct atatctgaga tactgggacg aggttaacct aatagtgctt aaaacatcta
14461 accctgcttc cacagagatg tatctcctat cgaggcaccc caaatctgac attatagagg
14521 acagcaagac agtgttagct agtctcctcc ctttgtcaaa agaagatagc atcaagatag
14581 aaaagtggat cttaatagag aaggcaaagg ctcacgaatg ggttactcgg gaattgagag
14641 aaggaagctc ttcatcaggg atgcttagac cttaccatca agcactgcag acgtttggct
14701 ttgaaccaaa cttgtataaa ttgagcagag atttcttgtc caccatgaac atagctgata
14761 cacacaactg catgatagct ttcaacaggg ttttgaagga tacaatcttc gaatgggcta
14821 gaataactga gtcagataaa aggcttaaac taactggtaa gtatgacctg tatcctgtga
14881 gagattcagg caagttgaag acaatttcta gaagacttgt gctatcttgg atatctttat
14941 ctatgtccac aagattggta actgggtcat tccctgacca gaagtttgaa gcaagacttc
15001 aattgggaat agtttcatta tcatcccgtg aaatcaggaa cctgagggtt atcacaaaaa
15061 ctttattaga caggtttgag gatattatac atagtataac gtatagattc ctcaccaaag
15121 aaataaagat tttgatgaag attttagggg cagtcaagat gttcggggcc aggcaaaatg
15181 aatacacgac cgtgattgat gatggatcac tgggtgatat cgagccatat gacagctcgt
15241 aataattagt ccctatcgtg ccgaacgatc gaagctccgc ggtacctgga agtcttggac
15301 ttgtccatat gacaatagta agaaaaactt acaagaagac aagaaaattt aaaaggatac
15361 atatctctta aactcttgtc tggt
//
LOCUS HUMKRASPO 111 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human K-ras proto-oncogene, exon 1.
ACCESSION M34904
KEYWORDS K-ras proto-oncogene.
SOURCE Human normal fetal liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 111)
AUTHORS Santos,E., Martin-Zanca,D., Reddy,E.P., Pierotti,M.A.,
Della Porta,G. and Barbacid,M.
TITLE Malignant activation of a K-ras oncogene in lung carcinoma but not
in normal tissue of the same patient
JOURNAL Science 223, 661-664 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 / 111 K-ras p21 protein, exon 1
BASE COUNT 35 a 16 c 30 g 30 t
ORIGIN
1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg
61 atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga g
//
LOCUS HUMGLYE 498 bp ss-mRNA PRI 18-JUL-1990
DEFINITION Human glycophorin E mRNA, complete cds.
ACCESSION M29610 J05183 M29620
KEYWORDS glycophorin.
SOURCE Human erythroleukemia proerythroblast cell line K562, cDNA to mRNA,
clone pHGpE-7.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 498)
AUTHORS Kudo,S. and Fukuda,M.
TITLE Identification of a novel human glycophorin, glycophorin E, by
isolation of genomic clones and complementary DNA clones utilizing
polymerase chain
JOURNAL J. Biol. Chem. 265, 1102-1110 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Kudo 31-OCT-1989.
FEATURES from to/span description
pept 57 293 glycophorin E
signal 470 481 polyA signal
BASE COUNT 128 a 107 c 94 g 169 t
ORIGIN
1 agttgtcttt ggtagttttt ttgcactaac ttcaggagcc agctcgtgat ctcaggatgt
61 atggaaaaat aatctttgta ttactattgt cagaaattgt gagcatatca gcatcaagta
121 ccactggtgt ggcaatgcac acttcaacct cttcttcagt cacaaagagt tacatctcat
181 cacagacaaa tgggataaca ctcattaatt ggtgggcgat ggctcgtgtt atttttgagg
241 tgatgcttgt tgttgttgga atgatcatct taatttctta ctgtattcga tgactgataa
301 aggcatgagg atgtggcctg catgctgcct gatcttgcct agaaccagct gcacctgctg
361 ttctcttgtt atgcaaactg gctgcacctg ctattccttt gcttatgccc caacccttgg
421 ctatcctaac tccctgttct cctgcctatt actgtattct ctacttctaa ataaaaataa
481 aacaaaatac aaattatt
//
LOCUS BAMSYNAMY 85 bp ds-DNA SYN 18-JUL-1990
DEFINITION Synthetic B.amyloliquefaciens alpha-amylase gene promoter region.
ACCESSION M34931
KEYWORDS .
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 85)
AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
TITLE Two tandemly located promoters, artificially constructed, are
active in a Bacillus subtilis alpha-amylase secretion vector
JOURNAL J. Biochem. 99, 1181-1190 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 72 > 85 synthetic alpha-amylase
BASE COUNT 34 a 8 c 19 g 24 t
ORIGIN
1 actcggggtt gttattattt tatcgatatg taaaatataa tttctagaag aaaagaaggt
61 ggagaggaaa catgatccaa aaacg
//
LOCUS PMP5END 139 bp ss-RNA VRL 18-JUL-1990
DEFINITION Papaya mosaic virus, 5' end.
ACCESSION M35054
KEYWORDS .
SOURCE Papaya mosaic virus RNA.
ORGANISM Papaya mosaic potexvirus
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Potexvirus.
REFERENCE 1 (bases 1 to 139)
AUTHORS Lok,S. and Abouhaidar,M.G.
TITLE The nucleotide sequence of the 5' end of papaya mosaic virus RNA:
Site of in vitro assembly initiation
JOURNAL Virology 153, 289-296 (1986)
STANDARD simple staff_entry
BASE COUNT 58 a 23 c 30 g 27 t 1 others
ORIGIN
1 ggaaaagaaa cacaaagcaa agcaaagcaa agcaactcaa ataaancata tttggtcaag
61 gcacttggta atcaaacggg cacaacccta gattaatgat atggcaaatt tgaggagtgt
121 tttcgaacag ttgaacgag
//
LOCUS MZEEMA 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Z.mexicana 180-bp knob-specific repeat region.
ACCESSION M32533
KEYWORDS .
SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm1.
ORGANISM Zea mexicana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
COMMENT Zea mexicana is also known as Zea mays subspecies mexicana.
BASE COUNT 58 a 38 c 39 g 45 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aataatatcg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS MZEEMB 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Z.mexicana 180-bp knob-specific repeat region.
ACCESSION M32534
KEYWORDS .
SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm2.
ORGANISM Zea mexicana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
COMMENT Zea mexicana is also known as Zea mays subspecies mexicana.
BASE COUNT 60 a 40 c 38 g 42 t
ORIGIN
1 ggccacacaa accccatttt tgtcgaaaat agccatgaac gaccattttc aataatatcg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS MZEZMA 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32521
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-5.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 58 a 40 c 39 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS MZEZMB 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32522
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-21.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 58 a 40 c 39 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS MZEZMC 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32523
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-10.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 58 a 41 c 38 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tttcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat
//
LOCUS MZEZMD 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32524
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-25.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 58 a 39 c 40 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat
//
LOCUS MZEZME 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32525
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm2-3(I).
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 40 c 38 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctagtc gatgatcaat
//
LOCUS MZEZMF 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32526
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-7.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 40 c 40 g 41 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aaaaataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctgtggtc gacgatcaat
//
LOCUS MZEZMG 179 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32527
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-26.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 38 c 38 g 44 t
ORIGIN
1 ggccacaaac ccccattttt gtcgaaaata gccatgaacg accattttca ataataccga
61 aggctaacac ctatggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg
121 atctatggca aggaaacata tgtggggtga ggtgtatgag cctctagtcg atgatcaat
//
LOCUS MZEZMH 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32528
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-14.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 61 a 39 c 38 g 42 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccatttac aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaaaaatgt
121 gatctatgga aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS MZEZMI 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32529
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-13.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 61 a 40 c 36 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat aaccatgaac gaccattttc aataatatcg
61 aaggctaaca cctatggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 aatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS MZEZMJ 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32530
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm4-1.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 35 c 38 g 48 t
ORIGIN
1 ggccacacaa cccgcatttt ttttgaaaat agccatgaat gaccattttc aataatatta
61 gaggctaaca cctacgagtt tttaaccaag aaatggtctc caccagaaat tcaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat
//
LOCUS MZEZMK 179 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32531
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(I).
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 62 a 39 c 37 g 41 t
ORIGIN
1 ggccacacaa cccccatttt ttcaaaaata gccatgaacg accattttca aaaatacgaa
61 aggctaacac gtacggattt ttgaccatga aatggtctcc accagaaatc cgaaaatgag
121 atctacgaca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat
//
LOCUS MZEZML 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION Maize 180-bp knob-specific repeat region.
ACCESSION M32532
KEYWORDS .
SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(II).
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 34 c 40 g 47 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aaaaatacga
61 aaggctaaca cgtacggatt tttgaccatg aaatggtttt caccagaaat tcaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS TRD2AA 177 bp ds-DNA PLN 18-JUL-1990
DEFINITION T.dactyloides 180-bp knob-specific repeat region.
ACCESSION M32535
KEYWORDS .
SOURCE T.dactyloides 6-day-old seedling DNA, clone 2.
ORGANISM Tripsacum dactyloides
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 177)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 60 a 38 c 40 g 39 t
ORIGIN
1 ggccacacaa cccccattta tgtcggaaat agcgatgaac gaccattttc aataatacga
61 aggtaacacc tacgaaattt tgaccaagat aggtctcaac cagaaatcca agaatgtgac
121 ctatggcaag gaaacatatg tggggtgagg tgtacgagcc tctggtcgat gatcaat
//
LOCUS TRD2AB 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION T.dactyloides 180-bp knob-specific repeat region.
ACCESSION M32536
KEYWORDS .
SOURCE T.dactyloides 6-day-old seedling DNA, clone 4.
ORGANISM Tripsacum dactyloides
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 59 a 40 c 40 g 41 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agcgatgaac gaccattttc aataatacca
61 aaggctaaca cctatggatt ttggaccaag aaatggtctc caccagaaat ccaagaatgt
121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS TRD2AC 180 bp ds-DNA PLN 18-JUL-1990
DEFINITION T.dactyloides 180-bp knob-specific repeat region.
ACCESSION M32537
KEYWORDS .
SOURCE T.dactyloides 6-day-old seedling DNA, clone 7.
ORGANISM Tripsacum dactyloides
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 60 a 41 c 37 g 42 t
ORIGIN
1 ggccacacaa cccccatttt tgtttaaaat agccaagaac gaccattttc aataataccg
61 aatgctaaca cctatggatt attgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctccggtc gatgatcaat
//
LOCUS TRD2AD 179 bp ds-DNA PLN 18-JUL-1990
DEFINITION T.dactyloides 180-bp knob-specific repeat region.
ACCESSION M32538
KEYWORDS .
SOURCE T.dactyloides 6-day-old seedling DNA, clone 8.
ORGANISM Tripsacum dactyloides
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Dennis,E.S. and Peacock,W.J.
TITLE Knob heterochromatin homology in maize and its relatives
JOURNAL J. Mol. Evol. 20, 341-350 (1984)
STANDARD simple staff_review
BASE COUNT 58 a 39 c 38 g 44 t
ORIGIN
1 ggccacacaa cccccatttt ttttgaaaat agccatgaac gaccattttc aataataccg
61 aaggtaacac ctacggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg
121 acctatggca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat
//
LOCUS VSVNNJ 1329 bp ss-mRNA VRL 18-JUL-1990
DEFINITION Vesicular stomatitis virus (New Jersey) nucleocapsid protein mRNA,
complete cds.
ACCESSION K02379 M35062
KEYWORDS nucleocapsid protein; protein N.
SOURCE VSV (New Jersey serotype; Ogden strain), cDNA to mRNA, clones 53
and 77.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Banerjee,A.K., Rhodes,D.P. and Gill,D.S.
TITLE Complete nucleotide sequence of the mRNA coding for the N protein
of vesicular stomatitis virus (New Jersey serotype)
JOURNAL Virology 137, 432-438 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1234 to 1329)
AUTHORS Rhodes,D.P. and Banerjee,A.K.
TITLE Poly(A)-adjacent sequence of the 14.5 S mRNA of vesicular
stomatitis virus (New Jersey serotype)
JOURNAL Virology 105, 297-300 (1980)
STANDARD simple staff_entry
FEATURES from to/span description
pept 14 1282 nucleocapsid protein
mRNA 1 1329 N mRNA
conflict 1264 1264 a in [1]; g in [2]
BASE COUNT 443 a 271 c 281 g 334 t
ORIGIN 182 bp upstream of BstNI site.
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc
61 taaactaccg gccaatgagg atccggtaga gtatccagct gattacttca aaaataacac
121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta
181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
241 gttaaaggga gtagaaggaa cattagacag agactgggtt agctttggaa gaacgatcgg
301 aaagagggaa gagaatgtca aaatcttcga tctagtcaaa gttgaggaac tgaagacagc
361 tcttcctgat gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat
421 ttacatccta ggcctatatc gagtgggtag atccaaagtt acagattaca gaaagaaact
481 cctggacggt ctagaaaatc agtgtagagt ggcatcaacc agatttgaaa gccttgttga
541 agatggtctt gacttctttg acatctggga aaatgatccc aacttcacca aaatagttgc
601 tgcagtggat atgttttttc acatgttcaa gaagcatgaa cgtgctccaa tcagatatgg
661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa
721 ggttagcgga ctttcaattg aagacctaac aacatgggtt ctgaatcgag aagtcgcaga
781 cgaactatgc caaatgatgt acccaggcca agaaattgac aaggcagatt catacatgcc
841 atatatgatt gatttcgggt tatcccaaaa atctccttat tcatcagtga agaatccagc
901 tttccacttc tggggtcaac tggccgcact cttgctaaga tcaactcgag caaaaaatgc
961 tcggcaaccc gatgacattg agtacacttc gctgacttgt gcaagtttac tactgtcatt
1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac
1081 aacagagaaa gatgacagtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt
1141 agactggctc ggttggtatg atgacaatgg agggaaaccc acaccggata tgctcaactt
1201 cgcacgaaga gcagtcagtt cgttgcagtc actccgcgag aaaacaattg gcaaatacgc
1261 aaaagtagaa tttgacaaat gaccaggttt gacatatcaa tatcaatatt atattctgtg
1321 ctatatatg
//
LOCUS TNCPOLY 3660 bp ss-RNA VRL 18-JUL-1990
DEFINITION Tobacco necrosis virus RNA-dependent RNA polymerase and coat
protein genes, complete cds.
ACCESSION M33002
KEYWORDS RNA-dependent RNA polymerase; coat protein; polymerase.
SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell
line, cDNA to viral RNA, clones pFM2[0,1,2,3].
ORGANISM Tobacco necrosis virus
Viridae; ss-RNA nonenveloped viruses.
REFERENCE 1 (bases 1 to 3660)
AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J.
TITLE Genome structure of tobacco necrosis virus strain A
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Meulewaeter, 20-MAR-1990, for release after publication.
FEATURES from to/span description
pept 60 665 RNA-dependent RNA polymerase (put.)
/transl_except=(666..668)
669 2234 RNA-dependent RNA polymerase
pept 2218 2436 ORF 3
pept 2440 2610 ORF 4
pept 2613 3443 coat protein
pept 3467 3646 ORF 6
site 666 668 readthrough codon
BASE COUNT 1034 a 890 c 895 g 841 t
ORIGIN
1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa
61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact
121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag
181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt
241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa
301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg
361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg
421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg
481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac
541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag
601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt
661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac
721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct
781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc
841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc
901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa
961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga
1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag
1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc
1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc
1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg
1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat
1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg
1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa
1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta
1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact
1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc
1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga
1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg
1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt
1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat
1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca
1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc
1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat
2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt
2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg
2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg
2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt
2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa
2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa
2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt
2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca
2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac
2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg
2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc
2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc
2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg
2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag
2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa
2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca
3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca
3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg
3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag
3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta
3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg
3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg
3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa
3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga
3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt
3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa
3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag
//
LOCUS BLYPROZ 75 bp ss-mRNA PLN 18-JUL-1990
DEFINITION Barley protein Z mRNA, partial cds.
ACCESSION M35065
KEYWORDS albumin; protein Z.
SOURCE Barley, cDNA to mRNA, clone paz1-1.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Rasmussen,S.K., Hopp,H.E., Brandt,A., Svendsen,I. and Hejgaard,J.
TITLE A cDNA clone for protein Z, a major barley endosperm albumin
JOURNAL Carlsberg Res. Commun. 49, 385-390 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 75 protein Z (AA at 1)
BASE COUNT 12 a 19 c 19 g 25 t
ORIGIN
1 cccctgaagg tggatttggt ggattttgtc gcgaatcacc ctttcctctt ccttattcgg
61 gaagacattg ccggt
//
LOCUS BSURPOF 3078 bp ds-DNA BCT 18-JUL-1990
DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
complete cds.
ACCESSION M34995
KEYWORDS RNA polymerase; minor sigma-37 factor.
SOURCE B.subtilis (strains 168 and Marburg), clones
lambda-[21,gt11,1,11,21,31].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 368 to 2297)
AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus
subtilis RNA polymerase: Isolation, nucleotide sequence,
chromosomal locus, and cryptic function
JOURNAL J. Bacteriol. 169, 771-778 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3078)
AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
TITLE Similar organization of the sigB and spoIIA operons encoding
alternate sigma factors of Bacillus subtilis RNA polymerase
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.W.Price, 18-FEB-1987. Draft entry and computer-readable
sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES from to/span description
pept 200 529 ORF V
pept 526 1008 ORF W
pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg
start codon)
pept 1762 2361 ORF X
mRNA 131 2368 sigB operon mRNA
signal 131 168 sigB operon promoter
signal 2368 2401 sigB transcription termination signal
binding 188 194 ORF V ribosomal binding site
binding 511 519 ORF W ribosomal binding site
binding 954 961 sigB ribosomal binding site
binding 1750 1753 ORF X ribosomal binding site
BASE COUNT 999 a 575 c 755 g 749 t
ORIGIN 40 degrees on B.subtilis map.
1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
3061 caaactgctg cgaagctt
//
LOCUS HUMERPR 1624 bp ss-mRNA PRI 18-JUL-1990
DEFINITION Human erythropoietin receptor mRNA, complete cds.
ACCESSION M34986
KEYWORDS erythropoietin receptor.
SOURCE Human cell line OCI-M1, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1624)
AUTHORS Ehrenman,K.St. and St John,T.
TITLE Sequence of the human erythropoietin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.St.John, 07-JUN-1990.
Author address: T.St.John
Hutchinson Cancer Res Center
1124 Columbia St.
Seattle, WA
Email: TSTJOHN@FHCRCVAX
FEATURES from to/span description
pept 1 1527 erythropoietin receptor precursor
sigp 1 72 erythropoietin receptor signal peptide
matp 73 1524 erythropoietin receptor
BASE COUNT 276 a 537 c 498 g 313 t
ORIGIN
1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
1621 agga
//
LOCUS HUMGFP40H 4379 bp ds-DNA PRI 18-JUL-1990
DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete
cds.
ACCESSION M30135
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor;
megakeryoblastic leukemia cell growth factor.
SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4379)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 12-DEC-1989.
FEATURES from to/span description
pept 796 909 P40 T-cell and mast cell growth factor
precursor, exon 1
1022 1057 P40 T-cell and mast cell growth factor
precursor, exon 2
1140 1172 P40 T-cell and mast cell growth factor
precursor, exon 3
2461 2592 P40 T-cell and mast cell growth factor
precursor, exon 4
4096 4215 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 796 849 P40 T-cell and mast cell growth factor signal
peptide
matp 850 909 P40 T-cell and mast cell growth factor
1022 1057 P40 T-cell and mast cell growth factor
1140 1172 P40 T-cell and mast cell growth factor
2461 2592 P40 T-cell and mast cell growth factor
4096 4212 P40 T-cell and mast cell growth factor
IVS 910 1021 hP40 intron A
IVS 1058 1139 hP40 intron B
IVS 1173 2460 hP40 intron C
IVS 2593 4095 hP40 intron D
signal 742 747 TATA box
signal 4354 4359 polyA signal
rpt 2965 3265 Alu repeat
binding 370 376 AP-2 binding site
binding 624 629 IRF-1 binding site
binding 627 633 AP-1 binding site
BASE COUNT 1208 a 921 c 904 g 1346 t
ORIGIN
1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS MEFDNABP 1710 bp ds-DNA BCT 18-JUL-1990
DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION M34778
KEYWORDS DNA binding protein HMfB.
SOURCE M.fervidus DNA.
ORGANISM Methanothermus fervidus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanothermaceae.
REFERENCE 1 (sites; for [2])
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
TITLE DNA binding protein HMf, isolated from the hypothermophilic
archaeal Methanothermus fervidus is most closely related to
histones
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1710)
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Sandman, 01-JUN-1990.
Author address: K.Sandman
Department of Microbiology
Ohio State University
484 W. 12th Ave.
Columbus, Ohio 43210
E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES from to/span description
pept 578 787 DNA binding protein HMfB
pept 1662 805 (c) ORF285
signal 450 457 promoter
signal 523 530 promoter
signal 790 814 transcription termination signal
BASE COUNT 619 a 263 c 265 g 561 t 2 others
ORIGIN 1 bp upstream of HindIII site.
1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS MUSINT1A 5607 bp ds-DNA ROD 18-JUL-1990
DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION K02593 M34750
KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE Mouse (BALB/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1098 to 5606)
AUTHORS Van Ooyen,A. and Nusse,R.
TITLE Structure and nucleotide sequence of the putative mammary oncogene
int-1; Proviral insertions leave the protein-encoding domain intact
JOURNAL Cell 39, 233-240 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5607)
AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
Otte,A., Schuuring,E. and Van Ooyen,A.
TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of
activation by insertion of proviral DNA of the mouse mammary tumor
virus
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [2] kindly submitted
by R.Nusse, 31-MAY-1990.
FEATURES from to/span description
pept 1779 1882 Wnt-1 protein (exon 1)
2452 2705 Wnt-1 protein (exon 2)
3279 3544 Wnt-1 protein (exon 3)
4002 4490 Wnt-1 protein (exon 4)
IVS 1883 2451 Wnt-1 cds intron A
IVS 2706 3278 Wnt-1 cds intron B
IVS 3545 4001 Wnt-1 cds intron C
site 1133 1134 MMTV insertion site in tumor 17 [1]
site 1394 1395 MMTV insertion site in tumor 35 [1]
site 1631 1632 MMTV insertion site in tumor 102 [1]
site 4495 4496 MMTV insertion site in tumor 53 [1]
mRNA 1428 5398 Wnt-1 mRNA (alt.)
mRNA 1595 5398 Wnt-1 mRNA (alt.)
signal 1562 1566 TATA box
signal 1573 1577 TATA box
BASE COUNT 1188 a 1681 c 1519 g 1219 t
ORIGIN Chromosome 15.
1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
5581 gaccctaaat gagaccaact agggatc
//
LOCUS MUSP40M 3808 bp ds-DNA ROD 18-JUL-1990
DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
cds.
ACCESSION M30136
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor.
SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
DNA, clone lambda-S40.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3808)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 21-NOV-1989.
FEATURES from to/span description
pept 644 757 P40 T-cell and mast cell growth factor
precursor, exon 1
846 881 P40 T-cell and mast cell growth factor
precursor, exon 2
965 997 P40 T-cell and mast cell growth factor
precursor, exon 3
2184 2315 P40 T-cell and mast cell growth factor
precursor, exon 4
3393 3512 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 644 697 P40 T-cell and mast cell growth factor signal
peptide
matp 698 757 P40 T-cell and mast cell growth factor
846 881 P40 T-cell and mast cell growth factor
965 997 P40 T-cell and mast cell growth factor
2184 2315 P40 T-cell and mast cell growth factor
3393 3509 P40 T-cell and mast cell growth factor
IVS 758 845 mP40 intron A
IVS 882 964 mP40 intron B
IVS 998 2183 mP40 intron C
IVS 2316 3392 mP40 intron D
signal 589 594 TATA box
binding 227 233 AP-2 binding site
binding 475 480 IRF-1 binding site
binding 478 484 AP-1 binding site
BASE COUNT 1012 a 903 c 869 g 1024 t
ORIGIN Chromosome 13.
1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 18-JUL-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
cds.
ACCESSION M29938
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
ZQ30-1.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1105)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 13 777 MHC SLA-DQ alpha chain precursor
sigp 13 81 MHC SLA-DQ alpha chain signal peptide
matp 82 774 MHC SLA-DQ alpha chain
mRNA < 1 1105 MHC SLA-DQ mRNA
BASE COUNT 254 a 290 c 255 g 306 t
ORIGIN
1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1081 aaacaataaa tcacttcatg aggtt
//
LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 18-JUL-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
cds.
ACCESSION M29939
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 69 836 MHC SLA-DQ alpha chain precursor
sigp 69 137 MHC SLA-DQ alpha chain signal peptide
matp 138 833 MHC SLA-DQ alpha chain
BASE COUNT 273 a 313 c 266 g 322 t
ORIGIN
1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS PIGMHDQBA 212 bp ds-DNA MAM 18-JUL-1990
DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION M29944
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
Gustafsson,K. and LeGuern,C.
TITLE Class II genes of miniature swine: II. Molecular identification and
characterization of B (beta) genes from SLA-c haplotype
JOURNAL Immunogenetics 31, 1-6 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept < 1 12 MHC SLA-DO beta chain
BASE COUNT 38 a 58 c 52 g 64 t
ORIGIN
1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS PP2LEND 266 bp ds-DNA PHG 18-JUL-1990
DEFINITION Bacteriophage P2 left end DNA.
ACCESSION M34756
KEYWORDS .
SOURCE Bacteriophage P2 DNA.
ORGANISM Bacteriophage P2
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Ziermann,R. and Calendar,R.
TITLE Characterization of the cos sites of Bacteriophages P2 and P4
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Ziermann, 05-JUN-1990.
Author address: R.Ziermann
University of California
Dept. Mol. Cell Biology
Berkeley, CA 94720
Email: calendar@garnet.berkeley.edu
FEATURES from to/span description
site 1 19 cohesive end of phage DNA
site 20 266 left end of phage DNA
BASE COUNT 41 a 58 c 99 g 68 t
ORIGIN Map position 0.0-0.8%.
1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
241 gagccagccg tttatctcgc ggatcc
//
LOCUS RCABCHH 314 bp ds-DNA BCT 18-JUL-1990
DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
F108) genes, 3' end, and 5' end respectively.
ACCESSION M34843
KEYWORDS .
SOURCE R.capsulata DNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 314)
AUTHORS Yang,Z. and Bauer,C.E.
TITLE Rhodobacter capsulatus genes involved in early steps of the
bacteriochlorophyll biosynthetic pathway
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.E.Bauer, 04-JUN-1990.
FEATURES from to/span description
pept < 1 79 bacteriochlorophyll biosynthesis protein
(bchH) (AA at 2)
pept 108 > 314 bacteriochlorophyll biosynthesis protein
(F108)
BASE COUNT 59 a 87 c 119 g 49 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
301 agcgcgtgct gcag
//
LOCUS TREANTAT 999 bp ds-DNA ORG 18-JUL-1990
DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION M34848
KEYWORDS .
SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA.
ORGANISM Kinetoplast Trypanosoma evansi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma evansi.
REFERENCE 1 (bases 1 to 999)
AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
Steinert,M. and Hamers,R.
TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Bajyana Songa, 05-JUN-1990.
FEATURES from to/span description
site 367 516 kinetoplast minicircle common region (ORI)
BASE COUNT 391 a 78 c 203 g 327 t
ORIGIN
1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS VIBSCRRE 1524 bp ds-DNA BCT 18-JUL-1990
DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds,
and sucrose transport (scrA) protein, 5' end.
ACCESSION M35009
KEYWORDS Scr repressor protein; sucrose transport protein.
SOURCE V.alginolyticus DNA.
ORGANISM Vibrio alginolyticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Blatch,G.L. and Woods,D.R.
TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
repressor-encoding gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Blatch, 10-JUN-1990.
Author address: G.L.Blatch
University of Cape Town
Dept. Microbiology
Private Bag
Rondebosch
7700 SOUTH AFRICA
Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
FEATURES from to/span description
pept 36 < 1 (c) sucrose transport protein (scrA)
pept 271 555 Scr repressor protein (scrR)
pept 791 1090 ORF5
binding 264 267 scrR ribosomal binding site
binding 780 785 ORF5 ribosomal binding site
signal 178 183 scrA -10 region
signal 203 208 scrA -35 region
signal 190 195 scrR -10 region
signal 162 167 scrR -10 region
signal 762 767 ORF5 -35 region
signal 742 747 ORF5 -35 region
site 78 91 repressor operator site (put.)
site 229 242 repressor operator site (put.)
site 12 71 ScrR helix-turn-helix motif
BASE COUNT 466 a 344 c 321 g 393 t
ORIGIN
1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
1501 gccaattgtg atattgaaga agct
//
LOCUS XELNRP1A 2431 bp ss-mRNA VRT 18-JUL-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
complete cds.
ACCESSION M34894
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2431)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 74 1117 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2431 nrp-1B mRNA
signal 2407 2418 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
site 921 941 alternative exon (put.)
BASE COUNT 663 a 522 c 543 g 703 t
ORIGIN
1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS XELNRP1B 2324 bp ss-mRNA VRT 18-JUL-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
complete cds.
ACCESSION M34895
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2324)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 29 1051 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2324 nrp-1B mRNA
signal 2306 2311 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
BASE COUNT 647 a 467 c 520 g 690 t
ORIGIN
1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS YSJSACDHY 2569 bp ds-DNA PLN 18-JUL-1990
DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION M34929
KEYWORDS saccharopine dehydrogenase.
SOURCE Y.lipolytica (strain W29) DNA.
ORGANISM Yarrowia lipolytica
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2569)
AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
Gaillardin,C.
TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
lipolytica
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.M.Fournier, 06-JUN-1990.
ORF1 is probably not translated, since a lacZ fusion/expression
experiment did not produce results.
FEATURES from to/span description
pept 649 1929 ORF1
pept 1764 655 (c) saccharopine dehydrogenase
mRNA 582 2060 ORF1 mRNA (alt.)
mRNA 598 2060 ORF1 mRNA (alt.)
mRNA 599 2060 ORF1 mRNA (alt.)
mRNA 602 2060 ORF1 mRNA (alt.)
mRNA 606 2060 ORF1 mRNA (alt.)
mRNA 609 2060 ORF1 mRNA (alt.)
mRNA 1807 639 (c) ORF2 mRNA (alt.)
mRNA 1809 639 (c) ORF2 mRNA (alt.)
mRNA 1811 639 (c) ORF2 mRNA (alt.)
mRNA 1812 639 (c) ORF2 mRNA (alt.)
mRNA 1814 639 (c) ORF2 mRNA (alt.)
signal 521 528 TATA box
signal 565 575 TATA box
BASE COUNT 668 a 573 c 666 g 662 t
ORIGIN
1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS MVISAA1A 772 bp ss-mRNA MAM 18-JUL-1990
DEFINITION Mink serum amyloid A1 mRNA, complete cds.
ACCESSION M34953 J05445
KEYWORDS serum amyloid A1.
SOURCE Mink liver, cDNA to mRNA, clone pmiSAA1.
ORGANISM Mustela vison
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae.
REFERENCE 1 (bases 1 to 772)
AUTHORS Marhaug,G., Husby,G. and Dowton,S.B.
TITLE Mink serum amyloid A protein--expression and primary structure
based on cDNA sequences
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.O.Marhaug, 07-JUN-1990.
FEATURES from to/span description
pept 29 418 serum amyloid A1
mRNA < 1 772 serum amyloid A1
BASE COUNT 176 a 205 c 200 g 191 t
ORIGIN
1 cagctctgct ccaccagaac tcagcaccat gaagcttttc acgggcctca tcttctgctc
61 cttggtcctg ggggtcagca gtcaatggta ttcattcatt ggtgaggctg ctcaaggagc
121 ttgggacatg tacagagcct actctgacat gatagaagcc aagtacaaaa attcggacaa
181 atacttccat gcccgcggga actatgatgc tgcacaaagg ggccctgggg gtgcgtgggc
241 tgctaaagtc atcagtgacg ccagagagcg ttctcagaga atcacagacc ttatcaagta
301 tggagacagc ggccacggag tggaggactc gaaggctgac caagctgcca acgaatgggg
361 ccggagtggc aaagacccca accacttccg accccctggc ctgcctgaca agtactgagc
421 tttcccttgg ctctgccctg gggagatggg ctgtgaggcc cctgaaggca agaacagttg
481 ctgagttaga gttcctgaat tgtataccct tcctactgaa tactttaaag agcacattaa
541 aatgtttagt aaatgcttgt gaatgggcac ctggatggcc cagttggtta agcggctgtc
601 ttcggctcat gtcatgatcc tagggtcctg ggaccgagtt ctccatgggg ctccttcctg
661 ggcagggagt ctgcttctct gcctgcctct ctgcctctgc ctgccactct gcctgcttgt
721 gtgctctctc tccctctttc tctgacaaat aaataaataa ataaaatctt tt
//
LOCUS MVISAA2A 552 bp ss-mRNA MAM 18-JUL-1990
DEFINITION Mink serum amyloid A2 mRNA, complete cds.
ACCESSION M34954 J05445
KEYWORDS serum amyloid A2.
SOURCE Mink liver, cDNA to mRNA, clone pmiSAA2.
ORGANISM Mustela vison
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae.
REFERENCE 1 (bases 1 to 552)
AUTHORS Marhaug,G., Husby,G. and Dowton,S.B.
TITLE Mink serum amyloid a protein - expression and primary structure
based on cDNA sequences
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.O.Marhaug, 07-JUN-1990.
FEATURES from to/span description
pept 31 420 serum amyloid A2 precursor
sigp 31 84 serum amyloid A2 signal peptide
matp 85 417 serum amyloid A2
mRNA < 1 552 serum amyloid A2 mRNA
BASE COUNT 129 a 144 c 161 g 118 t
ORIGIN
1 agcagctctg ctccaccaga actcagcacc atgaagcttt tcacgggcct catcttctgc
61 tccttggtcc tgggggtcag cagtcaatgg tattcattca ttggtgaggc tgttcaaggg
121 gcttgggaca tgtacagagc ctactctgac atgagagaag ccaattacaa aaattcggac
181 aaatacttcc atgcccgtgg gaactatgat gctgcacaaa ggggccctgg gggcgcctgg
241 gctgctaaag tcatcagtga cgccagagag cgttctcaga gagtcacaga ccttttcaag
301 tatggagaca gcggccacgg agtggaggac tcgaaggctg accaggctgc caacgaatgg
361 ggccggagtg gcaaagaccc caaccacttc cgaccctctg gcttgcctga caagtactga
421 gcttcccctt ggctctgtcc tggggggatg ggctgtgagg accctgggga cagggacacc
481 cagtcattga gttctctgtc cccagaggct agtggagggc acctaatagg tgtctaataa
541 atgcttactg ag
//
LOCUS TFERUBISCO 2172 bp ds-DNA BCT 18-JUL-1990
DEFINITION Pot. Thiobacillus sp. ribulose 1,5 bisphosphate carboxylase
(RUBISCO) large and small subunit genes, complete cds.
ACCESSION M34536
KEYWORDS ribulose 1,5 bisphosphate carboxylase.
SOURCE Pot. Thiobacillus sp. DNA, clones pAH5.0 and pAH196.
ORGANISM Thiobacillus sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Colorless sulfur bacteria.
REFERENCE 1 (bases 1 to 2172)
AUTHORS Stein,J.L., Haygood,M. and Felbeck,H.
TITLE Nucleotide sequence and expression of a deep sea ribulose 1,5
bisphosphate carboxylase gene cloned from a chemoautotrophic
bacterial endosymbiont
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Stein, 23-MAY-1990.
Author address: J.L.Stein
Scripps Institution of Oceanography
UC San Diego, A-002
A-002, Marine Biol Res Div.
La Jolla, CA 92093
The taxonomy of this organism could not be determined. It can only
live as a symbiont of Alvinoconcha hessleri, a gastropod.
FEATURES from to/span description
pept 201 1616 ribulose 1,5 bisphosphate carboxylase large
subunit (EC 4.1.1.39)
pept 1685 2050 ribulose 1,5 bisphosphate carboxylase small
subunit
signal 19 24 pot. -35 region
signal 43 49 pot. -10 region
binding 188 194 ribosome binding site
binding 1673 1676 ribosome binding site
BASE COUNT 441 a 741 c 641 g 349 t
ORIGIN
1 atccttatcc tgaccctcaa tacataatcg tgagcttatg gctataatca gaaagtctga
61 gttcacttta tagatgacct ctcccatgct cactctcaat cctgagtgat tcactcggat
121 tggacccgat aaccgtgcgg gctgtcggac gctaggcgtc cggtgcccct cgaacaccga
181 gcctctggag aggacaagca atggcgaaga aatacgatgc gggcgtaaaa gactaccgcg
241 aaacctactg gatgccggac tacacgccga aggagaccga cctgctggcg tgcttcaaga
301 tcattccgca gccgggcgtg ccgcgcgagg aagcccgggc cgccgtggcc gccgagtcgt
361 ccaccggcac ctggaccacc gtgtggaccg acctgctgac cgacctcgac cactacaagg
421 gccgtgccta cgccatcgag gacgtgccgg gcgacgagga ggccttctat gccttcatcg
481 cctaccccat cgacctgttc gaggaaggct cggtggtcaa cgtgtttacc tctctggtgg
541 gcaacgtgtt cggcttcaag gccatccgcg ccctgcgtct ggaggacgtg cgcttcccca
601 tcgcatacgt catgacctgc aacggtccgc cgcacggcat ccaggtcgag cgcgacatca
661 tgaacaagta cggccgtccg ctgttgggct gcaccatcaa gcccaagctg ggcctgtcgg
721 ccaagaacta cggtcgcgcc gtgtacgaat gcctgcgcgg cggcctggac ttcaccaagg
781 acgacgagaa cgtcaactcc cagcccttca tgcgctggcg gcacggtttc gacttcgtga
841 tggaggccat cgagaaggcc gagcgcgaga ccggcgagcg caagggtcac tacctgaacg
901 tcaccgcgcc caccccggac gagatgttca agcgtgccga gtacgccaag gagatcggcg
961 cgccgatcat catgcacgac tacatcaccg gcggcttcac cgccaacacc ggcctggcgc
1021 agtggtgccg tgacaacggc gtgctgctgc acatccaccg cgccatgcac gccgtgctcg
1081 accgcaaccc gcaccacggc atccacttcc gcgtgctgac caagatcctg cgcctgtccg
1141 gcggcgacca cctgcacacc ggcaccgtgg tcggcaagct ggagggcgac cgcgaggcga
1201 ccctcggctg gatcgacctg ctgcgcgagt cctacatcaa ggaagaccgt tcgcgcggca
1261 tcttcttcga ccaggactgg ggctccatgc ccggcgtctt tgcggcctgc tccggcggta
1321 tccacgtctg gcacatgccg gcgctggtca ccatcttcgg cgagcatgcc gtgctccagt
1381 tcggtggcgg caccctgggt cacccctggg gcaacgccgg cgccgcggcc aaccgggtgg
1441 cgctggaagc ctgcgtcgag gctcgcaacg agggccacga gctggagaaa gagggcaagg
1501 acatcctcat ccaggccgcc aagcacagcc ccgaactcaa gaccgcgatg gagacctgga
1561 aggaaatcaa gttcgagttc gacaccgtgg acaaactgga cgtggcgcac aaataagcgc
1621 cccgccaccc cgggccggtc gtggctggcc gcgccacgca tccccaattt tcgaggtatc
1681 aacaatgagc gaaatccaag actacaactc gagcgtcagc gatccgtcga gccgcaagtt
1741 cgagaccttc tcctacctgc ccgaactcgg cgtggaaaag atccgcaagc aggtcgagta
1801 catcgtcagc aagggctgga acccggccgt cgagcacacc gagccggaga acgccttcga
1861 ccactactgg tacatgtgga agctgccgat gttcggcgaa accgacgtgg acgccatcct
1921 ggccgaggcc gaggcatgcc acaaggcgca tccctcgcat cacgtgcgcc tgatcggcta
1981 cgacaactat gcccagtcgc aaggcactgc catggtgatc ttccgcggcc ccatcagcgc
2041 caagtgctga gccaccgggg ccgtaaacac gcgactgccg aggtctgccg gcccccgtcg
2101 catggcgacc cccgagggca tccgccaggt gcgtcacctc gcgatgttct tccgaacatt
2161 ccaataagcc ag
//
LOCUS LSVGPCN 3417 bp ss-RNA VRL 18-JUL-1990
DEFINITION Lassa virus glycoprotein (GPC) and nucleoprotein (N) genes,
complete cds.
ACCESSION M36544 M33879
KEYWORDS glycoprotein; nucleoprotein.
SOURCE Lassa virus (strain GA391), cDNA to viral RNA.
ORGANISM Lassa virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Arenaviridae.
REFERENCE 1 (bases 1 to 3417)
AUTHORS Wilson,S.M. and Clegg,J.C.S.
TITLE Sequence analysis of the S RNA of Lassa virus
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.C.S.Clegg, 26-APR-1990.
Author address:J.C.S.Clegg
PHLS Centre for Applied Microbiology and Research
Porton Down, Salisbury SP4 OJG
UNITED KINGDOM
FEATURES from to/span description
pept 71 1543 glycoprotein (GPC)
pept 3315 1603 (c) nucleoprotein (N)
BASE COUNT 944 a 801 c 706 g 966 t
ORIGIN
1 gcaccgggga tcctaggcat ttaggattgc gcattttaaa acctcctttt tggaaagtgt
61 cgcaatcagg atgggacaga ttgtgacatt cttccaagaa gttcctcatg ttattgagga
121 agtgatgaat attgtcctta ttgcactatc catcctagca attctgaagg gactatacaa
181 tgttgccacg tgtggcttga tagggcttgt cacattcctt ctactttcag gaaggtcatg
241 ctcactgatc tacaaaggga cttacgagct gcaaaccctt gagttgaata tggagactct
301 taatatgacc atgccgctat cctgcacaaa gaacaacagt catcattata taagggtggg
361 gaatgagaca ggacttgagc tcaccttgac caacaccagc attcttaatc acaaattctg
421 taacctctct gatgcccaca aaaggaatct ttatgatcac agcctcatga gtatcatctc
481 tacctttcat ctgtccattc ccaacttcaa tcaatatgag gcaatgagct gcgatttcaa
541 tggggggaaa atcactgtgc aatacaacct gagtcatagc ttcgcagtgg atgcagcagg
601 tcactgcggc acacttgcaa atggtgtctt acaaacattt atgagaatgg cttggggagg
661 gagttatatt gctcttgatt ctggacgcgg taactgggac tgtataatga ccagttacca
721 atatctaatc attcagaata caacctggga tgaccactgc caattttcca gaccatcacc
781 tattggctac cttgggcttc tctcacaaag aactagagac atatacatta gcagaaggtt
841 gttggggaca ttcacctgga cactatcaga ctcagaggga aatgaaacac cagggggata
901 ctgccttact agatggatgt tgattgaggc cgaattaaag tgctttggaa acactgcagt
961 agccaagtgc aatgagaaac atgatgaaga attttgtgac atgctaaggt tgttcgattt
1021 caacaaacag gccataagga ggctcaaaac agaggcccaa atgagcatac agctgatcaa
1081 caaggctgtc aatgctttaa taaatgatca gctcatcatg aagaaccact tgagagacat
1141 catgggcata ccatattgta actacagcag atattggtac cttaaccaca catcaacagg
1201 aaagacctca ctaccaaggt gttggcttat ctcaaatgga tcatatctca atgagaccaa
1261 gttttcagat gacatcgaac aacaagctga caacatgata acagagatgc tacaaaagga
1321 atacatagat agacagggca aaactccact ggggttagtt gacctatttg tttttagcac
1381 aagtttctat ctgataagca tctttctcca cctggtcaaa ataccaaccc atagacacat
1441 tgtaggtaaa ccttgcccaa aaccccacag gctgaaccac atgggcatct gctcctgtgg
1501 tctatacaaa cagccaggtg tgcctgtcag atggaagagg tgaaatccca cagggccccc
1561 gtgacccacc gccaattggc ggtgggtcac gggggcgtcc atctacagga cgactttagg
1621 tgttgaagtt ctgaacacca tgtctctagg gagcacaact ctcaggactt ttgcatcaag
1681 tcctcctgaa actgctgcat caaacataat gcaatccatc agtgcacaat gaggagttat
1741 ttcctcttta ccacctctct tctttttctc cacaactacc ccagtgtgca tgtgacatag
1801 atccttgaat tgatcccaaa cagcattctc aaactttctt gaatctgctt tactaagagt
1861 gatgtcaatc agttttatgt ctctcctccc ttgtgactca aggagttttc tgatatcctc
1921 tgatccttgg caagtgatga ccatgttccg aggaagggct tctatcactg cactggttaa
1981 cccaggttgg gcagcaaaca aatcagtcac atccctacca tgtgaatact ttgcatcttg
2041 tttgaattgc ttcaaatctg ttggctccct aaagaaatgt acatagcaac ccgagctcgg
2101 ttgataaagg gctatctcaa ctgggtcttc tggtcttcct tcaatatcca tccaggtttt
2161 tgcgttggga atcaagttga agcacttgaa atctttgaga gtcatcaact gagagtagat
2221 taatcctgca gcaaagcctg cagactgtaa tgacttgttg gaccccccat ttccaatttt
2281 ctggggcttg ttgtcttgct caaggtccac cacagtattt tcccatgctc ttcccacaat
2341 cgaggtcctt gatgcaatat aggaccatcc gtctcctgag agacagatct tgtataggat
2401 gttctcataa gggttccttt cacccggtgt gtctgataca aacattccca gactcttctt
2461 aactttcaag attgacttca agataccatc catggtctga ggtgaaacct taatagtctc
2521 taacatgtta ccaccatcaa gcatgcaggc ccctgctttg acagcagcac ccaaactgaa
2581 attgtaacca gagatgttga gtgaactttt cttagtgtca atcatattca gaattggatg
2641 actctgagac aatctgtcga gatcagatga gttggggtat ttggctgtgt aaatcagccc
2701 taaatctgtc aaagcttgaa cggcatcatt caggtccact tgcccctgtt tggtcaggca
2761 tgctaaagtt aggcttggca ttgttccgaa ctgattgttg agtaactctg catttttgac
2821 atcccaaact ctcacaatgc cattactggc accctttccc cctccactta ctccagtcat
2881 gccaatcatg ttcaaaaggg ctctcctttg atcaagctgt tgtgaactca aattccccat
2941 ataaactcct gaactcaaag gcctttctgt tctggtgact tttgatttca gtttttctaa
3001 atcagcggcc aggattagta gatcgtctga acttaaggtt ccaactctta agacactttt
3061 ctgctgtgtg gatttgagct caacaagatt gttgactgct tgattgagat ctctcagtcg
3121 ttttaggtcg ccatcatctc ttttctgctt gcgcatcaat ctctgaacat tactgacctc
3181 ggagaagtca agaccatgaa ggagagcttg agcgtcttta actacctgca actttatgtt
3241 ggaacagtag ccagatagtt cccttcttag ggattgagtc cacaagcatg acctcacttc
3301 cttggaagca ctcattgtcg tgatggttgt ctgacccttg agtgggtctt gaatgtggtc
3361 actccaaagg tttgattagt gcaaagcgca atccaatagc ctaggatcca ctgtgcg
//
LOCUS MOPGPCNC 3419 bp ss-RNA VRL 18-JUL-1990
DEFINITION Mopeia virus glycoprotein (GPC) and nucleocapsid protein genes,
complete cds.
ACCESSION M33879
KEYWORDS glycoprotein; nucleocapsid protein; small RNA segment.
SOURCE Mopeia virus (strain 800150), cDNA to viral RNA.
ORGANISM Mopeia virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Arenaviridae.
REFERENCE 1 (bases 1 to 3419)
AUTHORS Wilson,S.M. and Clegg,J.C.S.
TITLE Sequence analysis of the S RNA of the African arenavirus Mopeia: An
unusual secondary structure feature in the intergenic region
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.C.S.Clegg, 26-APR-1990.
Author address: J.C.S.Clegg
PHLS Centre for Applied Microbiology and Research
Porton Down, Salisbury SP4 OJG,
UNITED KINGDOM
FEATURES from to/span description
pept 47 1516 glycoprotein
pept 3352 1640 (c) nucleocapsid protein
BASE COUNT 875 a 819 c 736 g 989 t
ORIGIN
1 gtggatccta ggctttttgg ttgcgcattt ctagagcatc tcggagatgg ggcagatagt
61 caccttcttt caagaggtgc cacacatcct tgaagaagtg atgaacattg tgctgatgac
121 cctctcaatc ttggccatcc taaagggcat ctacaatgtg atgacctgtg gaatcatcgg
181 tttgataaca tttttgtttt tgtgtgggag atcatgctca agcatctata aggacaacta
241 tgagttcttc tctctcgacc tcgacatgtc ttcactgaat gcaacgatgc ctctctcctg
301 ctcaaagaac aactcccatc actacatcca agttgggaat gagacaggcc tagagctgac
361 attgacaaac actagcataa ttaatcataa gttttgcaac ctgtctgatg ctcacaggag
421 gaatctttat gacaaagcac ttatgtcaat cttgacaaca ttccacctaa gcattccaga
481 ttttaaccaa tatgaggcaa tgtcatgtga ttttaacgga gggaagatct cagttcaata
541 caacctgtcc cactcaaatt atgtggatgc cggcaaccac tgtggcacta ttgcaaatgg
601 cattatggat gtttttagaa gaatgtattg gagcacctcc ctttcggttg cctctgacat
661 aagtgggact caatgcatac agaccgatta taagtatttg attattcaga acacatcatg
721 ggaggatcat tgcatgttct caagaccctc acccatggga tttttaagcc ttctgtcaca
781 gagaaccagg aatttctaca tctcaagaag acttttgggt ctttttacat ggactttgag
841 tgactcggaa gggaacgaca tgccaggtgg ttactgtctc acaaggtcta tgctaatagg
901 gcttgatctg aaatgctttg ggaacactgc cattgcgaag tgtaatcagg cacatgatga
961 agaattctgt gacatgctcc gcctttttga ttttaataaa caagcaatta gcaaactgag
1021 gtctgaagtt cagcagagca taaatttgat aaataaagct gtcaacgccc tcatcaatga
1081 tcaactggtc atgaggaacc atttgagaga tctaatgggt attccttatt gcaactactc
1141 taagttttgg tacttgaatg atactaggac agggagaaca tccctcccaa agtgttggct
1201 tgtgaccaat gggtcatacc taaatgaaac ccagttctca acagaaattg agcaagaagc
1261 caataacatg ttcactgaca tgttgaggaa ggagtatgag aaaaggcaga gcacaacacc
1321 tctggggtta gtagatcttt ttgttttctc cactagcttt tatttgatct ccgtgttcct
1381 ccacctaatc aaaattccaa cacatagaca cataaaaggc aagccttgcc cgaaaccaca
1441 cagactcaat cacatggcaa tctgttcctg tggcttctat aagcaaccag gtctccccac
1501 acaatggaaa aggtgaagaa caggggcacc cccgagaccc accgccttcg gcggtgggtc
1561 tcgggggcct ggagtgactg ttccttgatg gcctccacga cccaccgcca gaggcggtgg
1621 gtcgtggagg tgtcccttct tacaggacaa ctctgggagg acctgttcta aacaccaggt
1681 ctctactcaa gactggtctg ggggttggga tttgtggact ccctatgact gctgcttcaa
1741 acatgagaca atccagcaat gcacagtgtg gtgtcacttc ttctttgcca cctctcttct
1801 tcttttgcgt tacaatcccc gtatgcattc tacaaaggtg tttgtattca tcccagatct
1861 tatcctcaaa ttttcttgca tcgtctttct gcatggacac atcaatcagt tttatgtccc
1921 tcctgttctg ggagtcaaga agctttctga tgtcatcagc accttgacac gacaagacca
1981 tgttctttgg caggctctca attacagcac ttgtcaaccc cggctgaact gagaataggt
2041 cttggatgtc aatgccatga gagtgtttgg agtcttgttt gaattgttta atgtctgttg
2101 gttccctgta aaaatgaata tactgaccat tattaggttg atagatagct atctccacgg
2161 ggtcttcagg tctcccttct atgtcaatcc atgtcttagc atttggatcc aattgcaaca
2221 tgctgtcttt gagttccatt gtctgagagt aggtcaaacc agcaggcatg ccttgggcct
2281 gagcgagtct gttggagtta cctttgtcag gctttgcatt ctgttgcaca tcaccactca
2341 gatcaacagt tgtgttatcc caagccctgc cgacgatcga tgttcttgat gctatgtaag
2401 gccaaccctc tccagaaaga catagtttgt agaggagatt ttcataaggg ttcctttccc
2461 ctggcgtgtc agagacaaac attcccaaag aacgcttgac tttcaaggca gccttcaaaa
2521 ttccttccaa attggatttt gttactttga tggtctccag catgtttccg ccgtctatta
2581 gacaagcacc agctttcacc gcagctgaca ggctaaaatt gtaccctgat atgtttatgg
2641 cacttttctt gggctcaatg atccccaaga ttgggtgggt ctgagagagt ttgtctaagt
2701 cagacatgtt tggatacttt gcagtgtaca caagccccaa gtctgacaac gattggatca
2761 catcattcag atctgcctgc ccttgtttgc acatgcaagc aatcgtcagg cttggcattg
2821 tcccaaactg attgtttaaa agctctgggt ttcgaacatc ccacactcta acaatgccat
2881 ctccgagagt gtttcctgcc cggaacccgc cacccattcc gaccatctgt agcaaaaccc
2941 tcctctgttc tagctgctga gctgttaggt tgcccatata gactccagca gcaagaggcc
3001 tctcacctct gatgactttt gctttgagtc tgtccaggtc ggcagcaagg acgaggaggt
3061 catcagaggt tagtgtcccc actctcaaaa cattcttctg ttggactgac tttaactcaa
3121 ctagattgtt cactgcctgg tttaggtccc tcaatctttt caggtcagag tcatccctct
3181 tctcctttct catcaacctt tgaacattgg caacttcaga gaagtccagc ccatgaagaa
3241 gtgcttgagc atccttgatg acttggatct ttatgttgga gcagtaccct gagagttctc
3301 tcctcaggct ctgtgtccac aagaaggact tcacctcctt tgaattggac attgttggag
3361 gcttttcggt gcagagctga ttccaaaggc gcaatcaatt agcctaggat ccactgtgc
//
LOCUS HIV2D194 9399 bp ss-RNA VRL 18-JUL-1990
DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral
genome.
ACCESSION J04542
KEYWORDS .
SOURCE Human immunodeficiency virus type 2, isolate D194.
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (sites; for [2])
AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D.,
Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C.,
Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H.
TITLE Molecular cloning of two West African human immunodeficiency virus
type 2 isolates that replicate well in macrophages: A Gambian
isolate, from a patient with neurologic acquired immunodeficiency
syndrome, and a highly divergent Ghanian isolate
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2383-2387 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 9399)
AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D.,
Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C.,
Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT The gag coding sequence appears to be distorted at the 5' end by a
frameshift; the env cds has a premature stop codon. The 5' ends of
the pol and gag genes have not been determined.
FEATURES from to/span description
pept < 851 2053 gag polyprotein (AA at 851)
pept < 1657 4878 pol polyprotein (AA at 1657)
pept 4808 5455 vif protein
pept 5283 5621 vpX protein
pept 5621 5938 vpR protein
pept 5784 6079 tat protein, exon 2 (first expressed exon)
8212 8308 tat protein, exon 3 (AA at 8213)
pept 6010 6079 rev protein, exon 2 (first expressed exon)
8212 8453 rev protein, exon 3 (AA at 8214)
pept 6086 6886 env ORF
pept 8471 9244 nef protein
LTR 1 239 5' LTR
LTR 8856 9399 3' LTR
rpt 1 113 R repeat 5' copy
binding 243 259 primer (Lys-tRNA) binding site
BASE COUNT 3206 a 1924 c 2357 g 1911 t 1 others
ORIGIN
1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gctccacgct tgcttgctta aagacctctt aataaagctg ccagttagaa gcaagttaag
121 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gagtaacaag
181 accctggtct gttaggaccc ttcccgcttt gagaatccaa ggcaggaaaa tccctagcag
241 gttggcgccc gaacagggac ttgaaagagg actgagaagc cctggaacac ggctgagtga
301 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaaag cgcgggccga
361 ggtaccgaag cggcgtgtgg agcgggagtg aaagaggcct ccgggtgaag gtaagtacct
421 acaccgaaaa ctgtagccag aaaaggcttg ttatcctacc tttagacagg tagaagattg
481 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagacga attagaaaaa
541 gttaggttac ggcccaacgg aaagaaaaga tacaggttaa aacatgttgt gtgggcagcg
601 aatgaattag acagattcgg attggcagag agcctgttgg aatcaaaaga aggttgccaa
661 aagattctta aagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt
721 tttaataccg tctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa
781 gaagcaaaga aactagcaca gagacatcta gtggcagaaa caggaactgc agagaaaatg
841 ccaaatataa gtagaccaac agcaccacct agtgggaaag ggaggaaact tccccgtgca
901 acaggcaggc ggcaactata tccatgtgcc gctgagcccc cgaactctaa atgcttgggt
961 aaaattagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc
1021 agaaggctgc acgccctatg atatcaatca aatgcttaat tgtgtgggcg atcaccaagc
1081 agctatgcaa ataatcagag aaattattaa tgaggaagca gcagattggg atgcgcagca
1141 cccaatacca ggccccttac cagcagggca gcttagagac ccaagggggt ctgacatagc
1201 aggaacaaca agcacagtag atgaacagat ccagtggatg tataggcaac caaatcccgt
1261 gccggtaggg aacatctaca ggagatggat ccagataggg ctacagaaat gtgtcaggat
1321 gtacaaccca actaacatct tagatgtgaa gcagggacca aaagaatcgt tccagagcta
1381 tgtagacaga ttctacaaaa gcctaagggc agaacaaaca gacccggctg taaaaaattg
1441 gatgacccaa acgctgctaa tacagaatgc caacccagac tgcaagttag tattaaaagg
1501 actagggatg aatcccaccc tagaggagat gctgactgcc tgccagggag taggcggacc
1561 aagccagaaa gccagactaa tggctgaagc cctaaaggag gctttgacgc cagcccctat
1621 cccatttgca gcagcccaac aaagaagggc aattaggtgt tggaattgtg gaaaggaggg
1681 acactcggcg aaacagtgcc gagcacccag aagacagggc tgctggaagt gtggcaagtc
1741 aggacacatc atggcaaact gcccggaaag acaggcaggt tttttaggga tgggcccacg
1801 gggaaagcag ccccgcaact tccccgcggc ccaagctcct caggggctga taccaacagc
1861 acccccaata gatccagcag tggacctgtt ggagaaatat atgcagcaag ggagaaagca
1921 gagagagcag agggagagac catacaagga ggtgacggag gacttactgc acctcgagca
1981 gggagagacg ccccacagag gggcgacaga ggacttgcta cacctcaatt ctctctttgg
2041 aaaagaccag tagtcacagc attcatcgag gatcagccgg tagaagtctt actagacaca
2101 ggagctgatg actcaatagt agcaggaata gagttagggg acaattacac tccaaaaata
2161 gtggggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataaaggta
2221 ctaaataaaa gagtaagagc caccataatg acaggagata ccccaatcaa catttttggc
2281 agaaatattc tggcaacctt aggcatgtca ttaaacctac cagtcgccaa gttagaccca
2341 ataaaagtaa cattgaagcc agggaaagat ggaccaaggc tgaaacaatg gcccctaaca
2401 aaagaaaaaa tagaagcact aaaagaaatt tgtgaaaaaa tggaaaggga gggccaacta
2461 gaagaagcac ctccaactaa tccttataat acccccacat ttgcaattaa gaaaaaggac
2521 aagaacaaat ggagaatgct aatagatttt agagaactaa acagggtgac tcaagatttc
2581 acagaaattc agctaggaat tccacacccg gcaggattag ccaaaaagaa aaggattact
2641 gtactagatg taggggatgc ctacttttcc ataccactac atgaagattt taggcaatat
2701 actgcattta ccctaccatc agtaaacaat gcagagccag aaaaaagata tgtatataag
2761 gtcttaccac aaggatggaa aggatcacca gcaatctttc aattcatgat gaggcaaatc
2821 ttagaacctt tcagaaaagc aaacccagac gtcattctca tccaatacat ggatgatatc
2881 ttaatagcta gtgacaggac gggtttagag catgacaaag tagtcctgca actaaaagaa
2941 cttctgaatg gcctagggtt ctctacccca gatgagaagt tccaaaagga ccctccgttt
3001 caatggatgg gctatgaatt gtggccaact aaatggaaac tgcagaaaat acaattacct
3061 cagaaagaaa tatggacagt caatgacatc caaaaactag taggagtttt gaactgggcg
3121 gcgcagatct atccagggat aaaaaccaag catttatgta aattgattag aggaaaaatg
3181 acactcacag aggaagtaca gtggacagag ttagcagagg cagaactaga agaaaacaaa
3241 attatcttaa gtcaggaaca agagggatcc tactatcagg aagaagaaga actagaagca
3301 acagtcatca aaagccaaga caatcagtgg gcatacaaaa tacaccaggg agagagggtt
3361 ctaaaagtag gaaagtatgc gaagataaaa aatactcata ccaatggggt cagactacta
3421 gcacaagtag tccaaaaaat aggaaaggaa gcactggtca tttggggacg agtgccaaaa
3481 tttcacctac cggtagagag agacacctgg gagcaatggt gggataacta ctggcaagta
3541 acatgggtcc cagagtggga cttcgtatct accccaccac tggtcaggtt gacatttaac
3601 ttggtaggag atcctatacc aggcacagag accttttaca cagatggatc atgcaataga
3661 cagtcaaaag aaggaaaagc aggatatgta acagatagag ggagagacag ggtaagagta
3721 ttagagcaaa catccaatca gcaagcagaa ctagaagcct ttgcgatggc actggcagac
3781 tcaggtccca aggttaatat catagtagac tcacagtatg taatggggat agtagcaggc
3841 caaccaacag agtcagaaaa tagaatagta aaccaaatca ttgaggacat gataaagaaa
3901 gaagcagtct atgttgcatg ggtcccagcc cataaaggca taggaggaaa ccaggaagta
3961 gaccatttag taagtcaggg catcagacaa gtattattcc tggaaaagat agagcccgct
4021 caagaagaac acgaaaaata tcatagcaat ataaaagaac taacccataa atttggaata
4081 ccccaactag tggcaagaca gatagtaaac acatgtgccc aatgccaaca gaaaggagaa
4141 gccatacatg ggcaagtaaa tgcagaaata ggcgtttggc aaatggactg cacacactta
4201 gaaggaaaaa tcattatagt agcagtgcat gttgcaagtg gattcataga agcagaagtc
4261 atcccacagg aatcaggaag gcagacagca ctcttcctat taaaactggc cagtaggtgg
4321 ccaataacac acttgcacac agacaatggc cccaacttca cttcacagga agtgaagatg
4381 gtggcatggt ggataggtat agagcaatcc tttggagtac cttacaatcc acaaagccag
4441 ggagtagtag aagcaatgaa tcaccaccta aaaaatcaga taagtagaat tagagaacag
4501 gcaaatacaa tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaagaagg
4561 ggaggaatag gggatatgac cccagcagaa agactaatta acatgatcac cacagaacaa
4621 gaaatacaat tcctccaaag aaaaaattca aattttaaaa aattccaggt ctattacaga
4681 gaaggcagag atcagctgtg gaaaggacct ggagagctac tgtggaaggg agacggagca
4741 gtcatagtca aggtaggggc ggacataaaa gtagtaccaa gaaggaaggc caagattatc
4801 agggactatg gaggaaggca agaactggat agtagttccc acctggaggg tgccagggag
4861 gatggagagg tggcatagcc ttgtcaagca cctgaagtac agaacaaaag acttagagga
4921 ggtgcgctat gttccccatc acaaggtagg atgggcatgg tggacttgca gcagggtaat
4981 attcccacta gaaggagaaa gtcatctaga gatacaggca tattggaacc taacaccaga
5041 aaaaggatgg ctctcctctc attcagtaag gttaacctgg tatacagaaa agttctggac
5101 agatgttacc ccagactgtg cagactccct aatacacagc acttatttct cttgctttac
5161 ggcaggtgaa gtaagaagag ccatcagagg ggaaaagtta ttgtcctgct gcaactaccc
5221 ccaagctcat aaagcacagg taccatcact tcaataccta gccctagtgg tagtgcaaca
5281 aaatggcaga ccccagagaa agggtgccgc caggaaacag tggagaagag accattggag
5341 aggccttcga gtggctagac aggactatag aagccttaaa cagggaggca gtgaaccatc
5401 tgccccgaga gctcattttc caggtgtggc aaaggtcctg ggcatattgg catgatgaac
5461 aagggatgtc aacaagttac acaaagtata gatatttgtg cataatgcag aaagctgtgt
5521 atatacattt caagaagggg tgcacttgcc tggggagagg acatggcccg ggaggatgga
5581 gaccaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga
5641 gtttccccca gaagatggga ccccacggag agagctaggg agtacctggg taatagaaac
5701 tctgaaggaa atcaaggaag aagccttaaa acattttgat ccctgcttgc taattgctct
5761 tggcaactat atctataata gacatggaga cacccttgaa ggagccagag agctcattag
5821 agtcctacaa cgagccctct tcgtgcacat cagagcggga tgtgaccgct caagaaaggg
5881 ccaaacaagg agaagagctc cttgcccagc tgcaccgacc cctagaggca tgcactaact
5941 catgctattg taagcagtgc agttaccatt gccagctgtg tttcttgaaa aaagggctcg
6001 ggatatggta tgcgcgacag ggcagacgaa gaaggactcc aagaaaaact aagactcatc
6061 cgcctcctgc atcagataag taagtatgga gcctggtagg aatcagctgc ttgttgccat
6121 tttattaact agtgcttgct taatatattg caaacaatat gtgactgttt tctatggcat
6181 acccgcgtgg agaaatgcat ctattcccct attttgtgca accaaaaata gagatacttg
6241 ggggaccatc cagtgcttgc cagacaatga tgattatcag gaaataacct taaatgtgac
6301 agaagctttt gatgcatggg ataatacagt aacagaacaa gcaatagaag atgtctggag
6361 actgtttgag acatcaataa aaccatgtgt caagttgacg cccctatgtg tggcgatgaa
6421 ttgtaatata acttcaggga ctaccgcgac cccgagtcca ccaaacatta caataataga
6481 tgaaaattct acctgtatag gcgacaacaa ctgcacagga ttagggaaag aagaggtggt
6541 tgagtgtgag ttcaatatga cggggctaga acaagataag aaaaggaagt ataatgacgc
6601 atggtactca agagatgtgg tttgtgacaa gacaaacgga acaggcacat gttacatgag
6661 acattgcaac acatcagtca tcaaagagtc atgtgacaag cactattggg atgctatgaa
6721 gtttagatac tgtgcaccac cgggttttgc cctactaaga tgcaatgata ccaactattc
6781 aggctttgaa cctaagtgct ctaaagtagt agctgcttca tgcacaagga tgatggaaac
6841 gcaaacttct acttggtttg gctttaatgg cactagagca gaatagaaca tatatctatt
6901 ggcatggtaa gataatagga ctatcattag cttaaacagt attataatct cacaatgcat
6961 tgtaagagac caggaaataa gacagttgta ccaataacac ttatgtcagg gcgaaggttt
7021 cactctcggc cagtctacaa caaaaaacct gggcaggcat ggtgttggtt tcaaggcaac
7081 tggatagaag ccatgcggga ggtgaagcaa acccttgcaa aacatcccag gtacggagga
7141 acaaatgata caggaaaaat taactttacg aagccaggaa taggttcaga cccagaagtg
7201 acatacatgt ggactaactg cagaggagaa tttctctact gtaatatgac ttggttcctc
7261 aattgggtag aaaataagac gaaccaaaca cacggcaact atgcgccatg ccatataagg
7321 cagataatta acacctggca taaggtaggg acaaatgtat atttgcctcc tagggaaggg
7381 gagttgacct gcaattcaac agtaaccagc ataattgcta acattgactc agatggaaat
7441 cagaccaaca ttacctttag tgcagaagtg gcagaactgt accgattaga attgggggac
7501 tacaaattga tagaagtaac accaattccg ttcgcaccta caaaagagaa aagatattcc
7561 tcggctccag tgaggaacaa aagaggtgtg ttcgtgctag ggttcttggg ttttctcgca
7621 gcagcaggtt ctgcaatggg cggcncgtcc ttgacgctgt cggctcagtc ccggacttta
7681 ctggccggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaagaa
7741 atgttgcgat tgaccgtctg gggaacgaaa aatctccagg caagagtcac tgctatcgag
7801 aaatacttaa aggaccaggc acagctaaat tcatggggat gtgcgtttag gcaggtctgc
7861 cacactactg taccatgggt aaatgactcc ttaacacctg actggaacaa tatgacatgg
7921 caggaatggg aaaaacgagt ccactaccta gaggcaaata tcagtcaaag tttagaacag
7981 gcacaaattc aacaagaaaa gaatatgtat gaactacaaa aactaaatag ctgggatgtc
8041 tttggcaact ggtttgattt gacctcctgg atcaaatata ttcaatatgg agtttatata
8101 gtagtaggaa taataggttt aagaatagcc atatatatag tgcaattgtt aagtagactt
8161 agaaagggct ataggcctgt tttctcctcc ccccccggtt atctccaaca gatccatatc
8221 cacacggaca ggggacagcc agccaacgaa gaaacagaag aagacgccgg agacgacagt
8281 ggtttcggct tgtggccttg gccactaaac tacatacaat tcctgatcca cctactgact
8341 cgcctcttga ccgggctata caacagctgc aggggcttac tatccaagaa ctccccgacc
8401 cgccgactga tctcccagag tctaacagca atcagggact ggctgagact taaggcggcc
8461 tacctgcaat atgggtgcga gtggatccaa gaagcgttcc gagcattcgc aaggactgcg
8521 agagagacta ttgcgggcgc gtggaggggg ttatgtgaag cagcgcaacg catcgggagg
8581 ggaatcctcg cagtcccaag aaggatcagg cagggagcag aaatcgccct cctgtgaggg
8641 acagcagtat cagcagggag agttcatgaa caccccatgg agaaccccag cagcaatagg
8701 gcagaaaaat tcatataagc agcaaaatat ggatgatgta gattctgatg atgatgacct
8761 agtgggagtt cctgttatgc caagagtacc gctgagagaa atgacctata aactggcaat
8821 agatatgtca cattttataa aagaaaaagg aggactggaa gggatatttt acagtaggga
8881 gagacataga atcctagact tgttcctaga aaaggaggaa gggataatac cagattggca
8941 gaattatact catgggccag gaacaaggta cccaatgtac ttcgggtggc tgtggaaact
9001 agtaccagta gacatctcac aagaggcaga ggaagtagag accaactgct tagtacaccc
9061 agcacaaaca agcagatatg atgacgagca tggggagaca ctagtttggc ggtttgaccc
9121 catgctggcc tatagttaca aggccttcat tctgcaccca gaagaatttg ggcacaagtc
9181 aggattgcca gagaaagagt ggaaggcaaa actgaaagca agagggatac catatagtga
9241 ataacaggaa caaccatact tggtcagggc aggaaatagc tactaagaac agctgagact
9301 gcagggactt tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgtcat
9361 attctctgta taaatgtacc cgcttcttgc attgtattc
//
LOCUS ANAHETA 2608 bp ds-DNA BCT 18-JUL-1990
DEFINITION Anabaena sp. heterocyst differentiation protein (hetA) gene,
complete cds.
ACCESSION M31722
KEYWORDS differentiation protein; heterocyst differentiation protein.
SOURCE Anabaena sp. (strain PCC 7120) DNA.
ORGANISM Anabaena sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE 1 (bases 54 to 2608)
AUTHORS Holland,D. and Wolk,C.P.
TITLE Identification and characterization of hetA, a gene that acts early
in the process of morphological differentiation of heterocysts
JOURNAL J. Bacteriol. 172, 3131-3137 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2608)
AUTHORS Holland,D. and Wolk,C.P.
JOURNAL Unpublished (1990) MSU-DOE Plant Res. Lab., M.S.U., Lansing, MI
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Wolk,C.P., 30-JAN-1990.
FEATURES from to/span description
pept 198 2021 heterocyst differentiation protein (hetA)
ORF 2077 > 2608 unidentified reading frame (ORF2)
rpt 1973 2000 direct repeat
rpt 2075 2158 direct repeat
BASE COUNT 793 a 510 c 513 g 792 t
ORIGIN
1 taaaaatcct atataggact catatttgat ttctgaaaaa ccaaattggc tagaagcttg
61 ctatacagag ctttgtttct cagtatacgt agcaaaattc aagtacgatt cctatagatg
121 aattactaca ctttagttaa aaattaggtt gtcacggttc aaaaaattgt cttagaaaat
181 atacaacgca tttaaaaatg ccaaaatcac cacataaatt atttaaagct aatagcttct
241 ggaaagagaa caatttaata ttgcgagaaa taaaacattt tcgcaaaata gctatattag
301 ccgtaatatt ttcattttta gctgccagtt ttgaaggtgt tagtattggt tttttactgt
361 catttttaca aaaattgact agtcctaatg atccaattca aacaggaatt agctgggttg
421 acatgatttt ggctgctgat gcttggccga ttccccccat atacaggata tctctactaa
481 ttttgttgag tacctggatg cgtgctacct tcaattattt tggcggagta tacactgaat
541 cagcccaact taatttagca gatcgcttac ataagcaaat ttttgagcaa ttacaagctc
601 tgaggttgag ttactttgct caaactcgtt cgggtgaact cattaatacg attactacag
661 aaattgaaag aatcaaacag ggttttagtg ggttagcttt tgtgttaacc agaatcatga
721 ccgtttgcgt ttactttgtt gtgatgtttt ctatatcatg gcaactttcg attatttctg
781 tgctgatatt tttactgtta gcagtgggat tatctacgct gaataagcga gtcagagaaa
841 ctagctttgg catttctcat gctaatgctc aatttacggc tgtagctgta gagttcatca
901 atgggattcg gacaattcaa gcttttggga cacaagaatt tgaacggcaa cgtttctata
961 aagcgagtac taatcaactg aatgcagcta tcaaagttgt tttagcttgg acactagtaa
1021 aacctatagc tgaaggtata gctactacgg ttttaattag tttgattgtg atttccttcg
1081 cgacttttac attaccagtt gcctcattat tgacattctt ctttgtttta gtcagggtga
1141 ttcctaatat tcaagatatt aatggtacgg ttgcttttct gagtacttta caagggtcat
1201 cagaaaacat taaaaatatt ttgcagacga ataataaacc ttacttgaaa aatggcaaac
1261 ttcatttcca agggttgaag cgttcaattg atttagtatc cgtagatttt ggttatactg
1321 ctgataattt agtgttgaat aatatcactc tgacaattga gcgtggtaaa acaacagctc
1381 tggtgggtgc atcgggcgct ggtaaaacca cattagctga tttaattccc cgattttacg
1441 atccgacaga gggacagatt ttagtagatg ggcttgatgt acagtacttt gaaatcaatt
1501 ccctccgccg caaaatggct gtagttagtc aagatacatt tattttcaac acttctatta
1561 gagacaatat cgcctacggt acatctgggg cgagtgaagc ggaaattaga gaagtagcgc
1621 ggctagcaaa tgcgttgcaa tttatcgaag aaatgcccga agggtttgat actaagttag
1681 gcgatcgcgg tgtccgttta tctggaggac agagacaacg gattgcgatc gctcgtgcat
1741 tactccgaga tcccgaaatc ctcattcttg acgaagccac cagcgcccta gattcagtct
1801 ccgagcgatt aattcaggag tctatagaaa aactttccgt gggtagaaca gtaattgcga
1861 tcgctcacag actctccaca attgccaaag cagataaggt tgtggtgatg gaacaagggc
1921 gaattgttga gcagggaaat tatcaagaac ttctagaaca acgcggaaag ctctggaaat
1981 atcaccagat gcaacacgaa tcaggacaga ctaattcgta atatcaattc aaaattcaaa
2041 attcaaaatt caaaattagg gaagccgagc agaatcatgg ttttggggta tgtatctgtc
2101 ccattctttt ttcaaatcgg tataactccc caatccccaa tccccaatct ccagtcccca
2161 atccccaatc cccaatcccc aatccccaat ccccagtccc caatccccaa tcccatgaaa
2221 atttccgtca tcatctcgaa ttacaactat gctcgttatc tttctagagc aatcaactct
2281 gttctcgctc aaactcactc agacattgaa atcgttatcg tagatgatgg ttctacagat
2341 aacagccgtg atgttattac ccaactgcaa gaacaagcac cggataaaat caagcccatc
2401 tttcaagcaa atcaaggaca gggaggcgct ttcaatgcgg ggtttgcggc ggcgactggc
2461 gaagtcgtag cttttcttga tgcagacgat gtgtggaaac ctcataaatt acagcgtatt
2521 gttgaggtat ttcagacatc ggatgtagtt ggtgtcatgc accatttgga tatcatagat
2581 ggcaatgaca aaacaattga tcaagctt
//
LOCUS BOVTRLA 85 bp ss-tRNA RNA 18-JUL-1990
DEFINITION Bovine amber suppressor Leu-tRNA-cag.
ACCESSION M35066
KEYWORDS transfer RNA-Leu.
SOURCE Calf liver tRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 85)
AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L.
TITLE Novel amber suppressor tRNAs of mammalian origin
JOURNAL EMBO J. 6, 3049-3055 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 1 85 Leu-tRNA
anticdn 35 37 Leu-tRNA anticodon cag
modified 6 6 m2g
modified 10 10 m2g
modified 12 12 ac4c
modified 17 17 gm
modified 19 19 d
modified 21 21 p
modified 27 27 m22g
modified 33 33 um
modified 38 38 m1g
modified 39 39 p
modified 45 45 um
modified 47 47 p
modified 50 50 m3c
modified 57 57 m5c
modified 64 64 p
modified 67 67 m1a
BASE COUNT 14 a 25 c 28 g 18 t
ORIGIN 5' end of mature tRNA.
1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaggtc gcagtctccc tggaggcgtg
61 ggttcgaatc ccacttctga cacca
//
LOCUS BOVTRLB 86 bp ss-tRNA RNA 18-JUL-1990
DEFINITION Bovine amber suppressor Leu-tRNA-caa.
ACCESSION M36600
KEYWORDS transfer RNA-Leu.
SOURCE Calf liver tRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 86)
AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L.
TITLE Novel amber suppressor tRNAs of mammalian origin
JOURNAL EMBO J. 6, 3049-3055 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 1 86 Leu-tRNA
anticdn 35 37 Leu-tRNA anticodon caa
modified 6 6 m2g
modified 10 10 m2g
modified 12 12 ac4c
modified 19 19 d
modified 21 21 p
modified 27 27 m22g
modified 33 33 um
modified 38 38 m1g
modified 39 39 p
modified 45 45 um
modified 47 47 p
modified 50 50 m3c
modified 58 58 m5c
modified 65 65 p
modified 68 68 m1a
BASE COUNT 16 a 24 c 28 g 18 t
ORIGIN 5' end of mature tRNA.
1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaagtc gcagtctcgc atggaggcgt
61 gggttcgaat cccacttctg acacca
//
LOCUS BRPMBOI 62 bp ds-DNA INV 18-JUL-1990
DEFINITION B.malayi MboI repeat consensus sequence DNA.
ACCESSION M34369
KEYWORDS .
SOURCE B.malayi (strain sub-periodic) DNA.
ORGANISM Brugia malayi
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Gnathostomatidae.
REFERENCE 1 (bases 1 to 62)
AUTHORS Natarajan,S., Werner,C., Cameron,M. and Rajan,T.V.
TITLE Isolation and characterization of a repetitive DNA element from the
genome of the human filarial parasite, Brugia malayi
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.Natarajan, 18-MAY-1990.
author address: S.Natarajan
University of Connecticut
Dept. of Pathology
Farmington, CT 06032
BASE COUNT 24 a 12 c 4 g 22 t
ORIGIN
1 ccatttctct acagatataa caatatcact agaagacatt ttgattaatt cattaactca
61 ta
//
LOCUS DUKARGIN 1568 bp ss-mRNA VRT 18-JUL-1990
DEFINITION Duck argininosuccinate lyase/delta 2-crystallin mRNA, complete cds.
ACCESSION M35132
KEYWORDS argininosuccinate lyase/delta 2-crystallin.
SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC5-1.
ORGANISM Anas platyrhynchos
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE 1 (bases 1 to 1568)
AUTHORS Wistow,G. and Piatigorsky,J.
TITLE Gene conversion and splice-site slippage in the argininosuccinate
lyase/delta-crystallins of the duck lens. Members of an enzyme
superfamily
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.J.Wistow, 26-MAR-1990.
FEATURES from to/span description
pept 74 1480 argininosuccinate lyase/delta 2-crystallin
signal 1538 1543 poly-A signal
site 86 91 6 inserted bases at splice site slip
BASE COUNT 389 a 398 c 439 g 342 t
ORIGIN
1 ccggggctgc actgcggaca cggttgcacc aggtgctggg atcgccgcac acgcgagcaa
61 aacgtcctct gagatggcat ctgaggcgag gggagataaa ctttggggag gaagattcag
121 tggaagcaca gatcccatca tggagaagct caactcttcc attgcctatg accagaggct
181 gtctgaagtt gatatccagg ggagcatggc ttatgccaaa gctctggaga aggctgggat
241 cctaactaag actgagctgg agaagatcct gagtggcctg gaaaagatct ctgaggaatg
301 gtctaaggga gtctttgtgg tgaaacagag cgatgaggat atccacactg ccaacgaacg
361 cagactgaag gagctgattg gtgatatcgc tggaaagctg cacaccggaa gaagcaggaa
421 tgatcaggtt gtgactgact tgaagctgtt catgaagaat tccctctcta tcatctccac
481 tcacctgctg cagctcatca agaccctggt ggaacgtgct gccatagaaa ttgatgttat
541 cttgcctggc tacacccacc tgcagaaggc ccagcccatc agatggagcc agttcttgct
601 cagccatgct gttgccctga cccgtgactc tgagcgcctg ggagaggtga agaagaggat
661 caacgtcctg cccctgggaa gcggtgctct ggctggcaac cccctggaca ttgacagaga
721 gatgctgcgt agcgagctgg agtttgcttc catcagcctg aacagcatgg atgccattag
781 tgagagagac tttgtggtgg aattcctctt ccttgccacc ctgctgatga tccaccttag
841 caagatggct gaagatctca taatctacag caccagcgag tttggcttcc tgaccctttc
901 tgatgccttc agcactggca gcagcctgat gcctcagaag aagaaccctg atagcctgga
961 actgatccgc agcaaggctg gacgtgtgtt tggacggctg gcttcaattc tcatggttct
1021 caaaggactt ccaagcacct acaacaagga cctgcaggag gacaaggagg ctgtctttga
1081 tgttgtggac accctgactg ctgtgctcca ggttgccact ggggtgatct ctaccctcca
1141 gatcagcaag gagaacatgg agaaggctct gacccctgag atgctggcta ctgacctggc
1201 tctctacttg gttcgcaaag gagtgccgtt cagacaagcc cacactgctt cggggaaggc
1261 cgtccacctt gccgagacca aaggcatcac catcaataaa ctcagcctgg aggacctgaa
1321 gagcatcagc ccccagttca gcagcgatgt ctcccaggtc ttcaactttg tcaacagcgt
1381 ggagcaatac actgccctgg cgggtaccgc caagagcagc gtgactaccc agatcgagca
1441 gctgagggag ctgatgaaga agcagaagga acaagcttag agtgtgggga gatttcccgt
1501 ggatgtagtg ttttgcttat cacactaatg cagagttaat aaacactgtg gtgtattgta
1561 gttcactc
//
LOCUS DUKARGINX 1562 bp ss-mRNA VRT 18-JUL-1990
DEFINITION Duck argininosuccinate lyase/delta 1-crystallin mRNA, complete cds.
ACCESSION M35133
KEYWORDS argininosuccinate lyase/delta 1-crystallin.
SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC6-2.
ORGANISM Anas platyrhynchos
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE 1 (bases 1 to 1562)
AUTHORS Wistow,G. and Piatigorsky,J.
TITLE Gene conversion and splice-site slippage in the argininosuccinate
lyase/delta-crystallins of the duck lens. Members of an enzyme
superfamily
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.J.Wistow, 26-MAR-1990.
FEATURES from to/span description
pept 71 1471 argininosuccinate lyase/delta 1-crystallin
signal 1529 1534 poly-A signal
BASE COUNT 393 a 400 c 427 g 342 t
ORIGIN
1 gggctccact gcagacacgg ttgtaccagg tgccgggatc gccgcacacc cgagcaaaac
61 gtcgtctgag atggcatccg agggggataa acttatggga ggaaggtttg tcggaagcac
121 ggatcccatc atgcagatgc tcagcacttc catatccact gaacagaggc tgtctgaggt
181 tgatatccag gcaagcattg cttatgccaa agccctggag aaggctggga tcctaactaa
241 gactgaactg gagaagatcc tgagtggcct ggaaaagatc tctgaggaat tgtctaaggg
301 agtcattgta gtgacccaaa gcgatgagga tatccaaact gccaatgaac gcagactgaa
361 ggagctgatt ggtgatattg ctggaaagct gcacactgga agaagcagga acgaacaggt
421 tgtgactgac ttgaagctgt tcatgaagaa ttccctctct atcatctcca ctcacctgct
481 gcagctcatc aagaccctgg tggaacgtgc tgccatagaa attgatgtta tcttgcctgg
541 ctacacccac ctgcagaagg cccagcccat cagatggagc cagttcttgc tcagccatgc
601 tgttgccctg acccgtgact ctgagcgcct gggagaggtg aagaagagga tcaacgtcct
661 gcccctggga agcggtgctc tggctggcaa ccccctggac attgacagag agatgctgcg
721 tagcgagctg gagtttgctt ccatcagcct gaacagcatg gatgccatta gtgagagaga
781 ctttgtggtg gaattcctct ctgttgccac cctgctgctg atccacctta gcaagatggc
841 tgaagatctc atcatctaca gcaccagcga gtttggcttc ctgacccttt ctgatgcctt
901 cagcactggc agcagcctga tgcctcagaa gaagaaccct gatagcctgg aactgatccg
961 cagcaaggct ggacgtgtgt ttggacggct ggcttcaatt ctcatggttc tcaaaggact
1021 tccaagcacc tacaacaagg acctgcagga agacaaggag gctgtcattg atgttgtgga
1081 caccctgact gctgtgctcc aggttgccac tggggtgatc tctaccctcc agatcagcaa
1141 ggagaacatg gagaaggctc tgacccctga gatgctggct actgacctgg ctctctactt
1201 ggttcgcaaa ggaatgccat tcagacaagc ccacactgct tcggggaagg ccgtccacct
1261 tgccgagacc aaaggcattg ccatcaataa tctcaccctg gaggacctga agagcatcag
1321 ccccctgttc agcagcgatg tgtcccaggt cttcaacttt gtcaacagcg tggagcagta
1381 cactgccctg ggcggtaccg ccaagagcag cgtgactacc cagattgagc agctgaggga
1441 gctgatgaag aagcagaagg aacaagctta gagtgtgggg agatttcccg tggatgcagc
1501 attgtgctta tcacactaat gcagagttaa taaacactgt ggtgtattgt agttcactca
1561 aa
//
LOCUS VACFGENE 6185 bp ds-DNA VRL 18-JUL-1990
DEFINITION Vaccinia virus F1L, F2L, F3L, F4L, F5L, F6L, F7L, F8L, and F9L
mRNAs, complete cds.
ACCESSION M34368
KEYWORDS .
SOURCE Vaccinia virus (strain WR), cDNA to viral mRNA, clone HindIII F.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 6185)
AUTHORS Roseman,N.A. and Slabaugh,M.B.
TITLE The vaccinia virus HindIII D fragment: Nucleotide sequence of the
left 6.2-kb
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.A.Roseman, 17-MAY-1990.
Author address: N.A.Roseman
Oregon State University
Biochemistry and Biophysics
Corvallis, Oregon 97331
FEATURES from to/span description
pept 119 757 F9L protein
pept 817 1014 F8L protein
pept 1162 1404 F7L protein
pept 1420 1644 F6L protein
pept 1674 2642 F5L protein
pept 2674 3633 F4L protein
pept 3644 5086 F3L protein
pept 5110 5553 F2L protein
pept 5565 > 6185 F1L protein
BASE COUNT 2175 a 946 c 1138 g 1926 t
ORIGIN
1 gaattcatca tgtgtaccaa aacagactgt gataaatata gattaaaggt ttccattctt
61 cacccaatta gtttcttgga aaaatttatt atgagagaca ttttctcaga ctggataaat
121 ggcggaaact aaagagttta aaactttgta taatcttttt atagatagtt atttacaaaa
181 attagctcaa cattctatcc ctactaatgt cacttgtgct attcatatag gagaggttat
241 aggacagttt aaaaattgcg cgctccgaat aactaacaaa tgcatgagta attctcgact
301 tagtttcaca ctcatggttg aatcatttat tgaagtgatt tcattgcttc cggaaaagga
361 tagaagacgt atcgctgaag aaataggaat agatctagac gatgtaccta gtgcggtatc
421 caagctagaa aagaactgta atgcgtatgc ggaggttaat aatattatag atatacagaa
481 attagatatc ggagaatgtt cggctccgcc cggtcaacat atgcttttac agatagttaa
541 tacaggatcc gcggaacgaa attgtggttt acagacaatt gttaagtcct taaataaaat
601 atacgttcca cctattatcg aaaaccgatt gccgtattac gatccgtggt ttctagtggg
661 tgtagcaatt attctagtta tttttactgt agctatttgt tctattagac gaaatctggc
721 tcttaaatac agatacggaa cgtttttata cgtttaatta ataaaaaaat ttaattacaa
781 ggtatataat agtactccat ctacgcaatc gcgataatgg agggatctaa acgcaaacac
841 gacagtcggc gactacaaca agaacaggag cagcttcgtc cacgtacacc gccatcatat
901 gaagaaattg caaaatatgg acactcattt aacgtgaaaa gatttacgaa tgaagaaatg
961 tgtcttaaga atgattatcc acgaattata tcatataatc ctccaccaaa atagagtata
1021 tatatatcat catttcatga tgtatactac tgacatagtt tcaatgtgaa cttttcactt
1081 tcttgccggt tatgaagaat attttttatt ttaatggtca ttactaatcg tatattataa
1141 ttgaaaatgg attagtttaa tatgacgctc gtcatgggat cctgctgtgg tagattctgt
1201 gacgctaaga ataagaataa gaaggaagat gtagaagagg gaagagaagg atgttacaat
1261 tataagaacc ttaatgatct ggatgaatcc gaagcacgtg tagaatttgg accattatat
1321 atgataaatg aagaaaaatc agacataaat acattggata taaaaagaag atatagacac
1381 acgatagagt ctgtatattt ctaaaagttt ttataaaaaa tgagtaaaat actcacgttt
1441 gttaaaaata agataattga cttgattaat aatgaccaaa ttaaatattc tagagttata
1501 atgatagaag agtccgatag tcttttaccg gttgatgagg tgcatgctaa ccacggattt
1561 gactgtgtgg agatgataga tgaaaatata agcaatgaga atatcgaaca gtataaaacc
1621 gaatcttttt ttacaataaa ttgaaatcaa aacatttatt aaaccgcatc aagatgggta
1681 cgaacggcgt tagagtattt gtcattttat atttgttggc tgtatgcgga tgtatcgaat
1741 acgacgtaga cgataatgta catatttgta cccacactaa cgtgtcacat attaatcaca
1801 ctagttggta ttataatgat aaggttatag cgctagccac cgaggataaa acttctggtt
1861 atatatcatc attcataaaa cgcgttaata tctcattaac ttgtttaaat atatcgagtt
1921 tgcggtacga agattctggt acatacaaag gagtatcgca tctaaaagat ggagtcatcg
1981 ttacaactac tatgaatata tctgtaaagg ctaatatcat tgacttgact ggtagagtgc
2041 gttatctaac cagaaattat tgcgaagtta aaatacgatg cgaaataaca tctttcgcgc
2101 ttaatggttc tactacacca ccacatatga tattaggaac agtagataaa tggaaatatc
2161 ttccatttcc tacagatgat tatagatacg taggggaact gaaaagatat atatctggaa
2221 acccatatcc aacagagtcg ctagcgttag aaattagctc gacgtttaat cggtttacta
2281 tcgttaaaaa tttgaacgat gacgagtttt cttgttatct gttttcacaa aattatagtt
2341 tccataaaat gttgaatgtg cgtaacattt gtgaatccga atgggaggcg ttaaataata
2401 ataacgataa ttcatcctcc atgcccgctt cccacaacaa tctcgcaaac gatttatcta
2461 gtatgatgtc acaattacaa aatgataatg atgataataa cgattattca gcacccatga
2521 atgtcgataa tttaattatg atagtactaa taacaatgct atcaataata cttgtaatta
2581 ttgtagtgat tgcggcgata tcgatgtaca aaagatccaa gtacaggcat atagataact
2641 gaaaaaaaat ttattgttat tgttaattta gttatggaac ccatccttgc accaaatcca
2701 aatagatttg ttattttccc aatccaatat tatgacatct ggaacatgta taaaaaggca
2761 gaggcatcat tttggacagt ggaagaagta gatatatcta aagatatcaa tgattggaat
2821 aaactaacac cagacgaaaa atattttata aaacatgtat tggcgttttt tgcagccagt
2881 gacggaatag tgaatgaaaa tttggcggaa cgattttgta cagaagtaca gattaccgag
2941 gctagatgtt tctacggatt tcagatggcc attgaaaaca ttcattcgga aatgtatagt
3001 cttttgatcg atacttatgt taaagatagt aatgaaaaaa actatctctt taatgccata
3061 gaaacgatgc cttgtgtaaa aaagaaggcc gattgggctc aaaagtggat acatgacagc
3121 gccggttatg gagagagact tattgccttt gctgcagtag aaggaatctt cttttctgga
3181 tcattcgctt ccatattttg gcttaaaaag cgtggcctaa tgcccggact cacgttttcc
3241 aacgaactaa ttagtagaga cgagggtctg cactgcgatt tcgcatgttt gatgtttaaa
3301 catttattgc atccaccgag tgaagaaacc gttagatcta ttataacaga tgcggtatcc
3361 attgaacaag aatttcttac tgcggctctt ccagttaaac ttataggaat gaattgtgaa
3421 atgatgaaaa catatataga attcgtcgcg gatagattga tttctgaatt gggatttaaa
3481 aaaatttata atgttaccaa tccgtttgat ttcatggaaa atatatcatt ggaaggaaaa
3541 actaattttt tcgaaaaacg tgtgggtgaa taccaaaaaa tgggagttat gtctcaagaa
3601 gataatcatt tttctttaga tgttgacttt taaagaaaca taaatgccga tatttgttaa
3661 tactgtgtac tgtaagaata tattagcatt gtctatgact aagaaattca aaacaattat
3721 tgatgctata ggtggcaata taatagtcaa ttctacgata ttgaaaaagt tatctcctta
3781 ctttcgcaca catttacgtc aaaaatacac gaaaaataaa gatccagtta cttgggtttg
3841 tctagacctt gacattcaca gtttaacttc tatagttatt tactcgtata ctggaaaggt
3901 atatatagat agtcataacg tcgtcaattt attacgtgct tctatattaa cctctgtaga
3961 atttatcatc tacacttgta taaactttat cttacgagat tttagaaagg aatattgtgt
4021 cgagtgttac atgatgggta tagaatacgg actatccaat ctcttatgtc atactaaaaa
4081 ctttattgcc aaacactttt tggaactgga agatgacatc atagacaatt ttgattatct
4141 atctatgaaa cttattctag aaagcgatga actaaatgtt ccagatgagg attatgtagt
4201 tgattttgtc attaagtggt atataaagcg aagaaataaa ttaggaaatc tgctactcct
4261 tatcaaaaat gtaatcaggt caaattatct ttctcccaga ggtataaata atgtaaaatg
4321 gatactagac tgtaccaaaa tatttcattg tgataaacaa ccacgcaaat catacaagta
4381 tccattcata gagtatccta tgaacatgga tcaaattata gatatattcc atatgtgtac
4441 aagtactcat gttggagaag tagtatatct catcggtgga tggatgaaca atgaaataca
4501 taacaatgct atagcggtaa attatatatc aaacaattgg attccaattc ctccgatgaa
4561 tagccccaga ctgtatgcta gcgggatacc cgctaacaat aaattatacg tagtaggagg
4621 tctaccaaat cccacatctg ttgagcgttg gttccacggg gatgctgctt gggttaatat
4681 gccgagtctt ctgaaaccta gatgtaatcc agcagtggca tccataaaca atgttatata
4741 cgtaatggga ggacattctg aaactgatac aactacagaa tatttgctac ccaatcatga
4801 tcagtggcag tttggaccat ccacttatta tcctcattat aaatcatgcg cgttagtgtt
4861 cggtagaagg ttattcttgg ttggtagaaa tgcggaattt tattgtgaat ccagcaatac
4921 atggactctg atagatgatc ctatttatcc gagggataat ccagaattga tcatagtgga
4981 taataaactg ctattgatag gaggatttta tcgtgaatcg tatatagata ctatagaagt
5041 gtacaatcat cacacttatt catggaatat atgggatggt aaataatttt gaaataaaat
5101 attagtttta tgttcaacat gaatattaac tcaccagtta gatttgttaa ggaaactaac
5161 agagctaaat ctcctactag gcaatcacct tacgccgccg gatatgattt atatagcgct
5221 tacgattata ctatccctcc aggagaacga cagttaatta agacagatat tagtatgtcc
5281 atgcctaagt tctgctatgg tagaatagct cctaggtctg gtctgtccct aaaaggcatt
5341 gatataggag gcggtgtaat agacgaagat tataggggaa acataggagt cattcttatt
5401 aataatggaa aatgtacgtt taatgtaaat actggagata gaatagctca gctaatctat
5461 caacgtatat attatccaga actggaagaa gtacaatctc tagatagtac aaatagagga
5521 gatcaagggt ttggatcaac aggacttaga taataaacaa tagtatgttg tcgatgttta
5581 tgtgtaataa tatcgtagat tatgtagatg atatagataa tggtatagta caggatatag
5641 aagatgaggc tagcaataat gttgatcacg actatgtata tccacttcca gaaaatatgg
5701 tatatagatt tgacaagtcc actaacatac tcgattatct atcaacggaa cgggaccatg
5761 taatgatggc tgttcgatac tatatgagta aacaacgttt agacgacttg tatagacagt
5821 tgcccacaaa gactagatca tatatagata ttatcaacat atattgtgat aaagttagta
5881 atgattataa tagggacatg aatatcatgt atgatatggc atctacaaaa tcatttacag
5941 tttatgacat aaataacgaa gttaatacta tactaatgga taacaagggg ttgggtgtaa
6001 gattggcgac aatttcattt ataaccgaat tgggtagacg atgtatgaac ccagtagaaa
6061 cgataaaaat gtttactcta ctatcgcata ctatatgcga tgattatttt gtagattata
6121 taacggacat ttcaccacca gataatacca tccctaacac tagcacgcgt gaatatctaa
6181 agctt
//
LOCUS BSUCCCA 1172 bp ds-DNA BCT 18-JUL-1990
DEFINITION B.subtilis 13 kD cytochrome c-550 (cccA) gene, complete cds.
ACCESSION J05569
KEYWORDS cytochrome c-550.
SOURCE B.subtilis (strain w168) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1172)
AUTHORS von Wachenfeldt,C. and Hederstedt,L.
TITLE Bacillus subtilis 13 kDa cytochrome c-550 encoded by cccA, consists
of a membrane-anchor and a heme-domain
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.O.S.von Wachenfeldt, 12-JUN-1990.
FEATURES from to/span description
pept 542 904 cytochrome c-550 (cccA)
BASE COUNT 400 a 170 c 264 g 338 t
ORIGIN 223 min on B.subtilis map.
1 cagctggaag atgtgcttga tacgttaact gatcgtgaag aaaatgtatt gcgtcttcga
61 ttcggtcttg atgacggccg tacaagaaca ttagaagagg tcggcaaagt atttggagta
121 acgagagagc gtattcgaca aatcgaagcc aaagcgttgc ggaaactaag acatcctagc
181 agaagtaaac gtttgaaaga tttccttgaa taagatggaa cgggtcttga agatccgttc
241 ttcttttttt aaaaagatat atggataata tgcctttatt ttactgaaaa atgatgtcat
301 ttgcaaatga acattgtggt gaaaaatttc aaaatctaat tccatatttt ctattgtaag
361 cgtatacaat acattataca atagaataaa aaggatatta gagatttagg catgtttcta
421 ttcaattttg tataaagtgt gaataaaaaa cttttgtata gcaatccatt tactttttgt
481 aaaaataagt tagaattaga agtgtttaca tagggggaag gatcaaaaag ggggaaggga
541 aatgaaatgg aacccgctta ttccattttt gctgatcgct gttttaggaa ttggtctaac
601 tttcttttta tcagtaaaag gacttgatga ctctcgggag attgcgagcg gaggagaaag
661 caaatctgct gaaaagaaag atgcaaacgc ttcaccagaa gaaatttaca aggcaaactg
721 catcgcatgc cacggtgaaa attatgaagg tgtttcgggt cctagcttaa aaggtgtagg
781 ggataagaaa gatgtcgctg aaattaaaac aaagatcgaa aaaggcggta atggaatgcc
841 ttccggtctt gttccggccg ataagctgga tgatatggca gagtgggtgt caaaaattaa
901 ataaaagaac tatttttctc tttatacatt cacaggccgc cgagatgttc tcgacagcct
961 tttatttttg actgctgcgt gcagacccgc atcaaaagaa ataatttcaa aatcatttgc
1021 ttcagcttgg attccgggct gattttttat atgatagtaa aggaaaaaat gattgagaaa
1081 agttgtgatg acaagtgaac gaattaaaat tatctaaacg attgcaaaca gtagcggagt
1141 acataccgaa tggagcggtg atggccgata tc
//
LOCUS ECOGACAR 1630 bp ds-DNA BCT 18-JUL-1990
DEFINITION E.coli glutamate and aspartate carrier (gltP) gene, complete cds.
ACCESSION M32488
KEYWORDS glutamate and aspartate carrier.
SOURCE E.coli (strain K12-BK9MDG) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1630)
AUTHORS Wallace,B., Yang,Y.-J., Hong,J. and Lum,D.
TITLE Cloning and sequencing of a gene encoding a glutamate and aspartate
carrier of E.coli K-12
JOURNAL J. Bacteriol. 172, 3214-3220 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Y. Hong, 02-MAR-1990.
FEATURES from to/span description
pept 151 1338 glutamate and aspartate carrier (gltP)
BASE COUNT 330 a 391 c 472 g 437 t
ORIGIN
1 caagggttgc gcaacatacc gcgcaaatga tactgatcat aagcgttaaa aaaatctaca
61 aaccaacgca acacaattca tgccctggca gtatgtcacg ttctcgcgtt tctgaacggg
121 gaacggcgct ccattgagga agtcattcat atgaaaaata taaatttcag cctggcctgg
181 cagattctgt ttgctatggt gctgggcatt ctcctgggaa gctacctgca ctaccatagc
241 gacagccgcg actggctggt cgtcaatttg ctctctccgg cgggtgatat cttcatccat
301 ctgattaaaa tgattgttgt gccgattgtg atctccacgc tggtggtggg tatcgcgggt
361 gttggtgatg ccaaacagct cgggcgtatt ggcgcgaaaa ccattatcta cttcgaggtg
421 atcaccaccg tcgccatcat tttgggatca ctctggcgaa ccgtcttcca gcccggtgcc
481 ggggtggata tgtcgcagtt ggcgaccgtc gatatctcga aatatcagag cactacggaa
541 gcggtacaaa gcagttccca cggcattatg ggcacgattt tgtcgctggt gccgacgaac
601 attgtggcgt cgatggcgaa aggcgaaatg ctgccgatca tctttttctc ggtgctgttt
661 ggtctggggc tttcttccct gcccgcgacg catcgtgaac cgctggtgac cgtgttccgc
721 tccatctctg aaaccatgtt taaagtgact cacatggtga tgcgttatgc accggtgggt
781 gtgtttgcgc tgattgcggt gacggtggct aactttggtt tctcgtctct gtggccactg
841 gcgaaactgg tgctgctggt gcatttcgcc attctgttct tcgcgctggt agtgctggga
901 attgtggcgc gcctgtgcgg gttaagcgtc tggatcctga ttcgtattct gaaagatgag
961 ctgattctgg cgtactccac tgccagctct gaaagcgtcg tgccgcgaca tatgagaaga
1021 tggaagccta cggagcaccg gtgtcgatca ccagtttcgt ggtgccgacc ggttaacctc
1081 ttttaccttg atggttcgac gctgtatcaa agtattgccg ctatcttcat cgcgcagttg
1141 tatggcattg acctgtccat ctggcaggaa atcattctgg tcgtgacgct gatggtgacc
1201 tcgaaaggga ttgctggcgt gcctggcgtg tcgtttgtgg tgttgctggc aacgctgggt
1261 agcgtaggta tccgctggaa ggtctggcgt ttattgctgg tgttgaccgt atcctcgaca
1321 tggcgcgtac tgccgtgaac gtggtgggta atgcgctggc ggtgctggtg attgccaagt
1381 gggaacacaa atttgaccgt aagaaagcgc tggcttatga gcgtgaagtg ctgggcaaat
1441 ttgataaaac tgcggatcaa taattgaaga ttgccgggga tatccacccg gcaatgtgtg
1501 aatgcctgat gcgacgcttg ccgcgtctta tcaggcctac gccagacagc gcaatagcct
1561 gatttagcgt gattttgtag gtcggataag gcgtttatgc cgcatccgac atcaacgcct
1621 gatgcgacgc
//
LOCUS ECOLYSU 3034 bp ds-DNA BCT 18-JUL-1990
DEFINITION E.coli lysyl-tRNA synthetase (lysU) gene, complete cds.
ACCESSION M30630
KEYWORDS lysyl-tRNA synthetase.
SOURCE E.coli (strain K12) isolate CS520 DNA, clone pLC4-5.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3034)
AUTHORS Clark,R.L. and Neidhardt,F.C.
TITLE Roles of the two lysyl-tRNA synthetases of Escherichia coli:
Analysis of nucleotide sequences and mutant behavior
JOURNAL J. Bacteriol. 172, 3237-3243 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.L.Clark, 12-DEC-1989.
FEATURES from to/span description
pept 316 1821 lysyl-tRNA synthetase (lysU) (E.C. 6.1.1.6)
signal 194 199 -35 signal
signal 216 221 -10 signal
BASE COUNT 850 a 654 c 665 g 865 t
ORIGIN 93.55 min on K12 map.
1 cgatcatcgt cgtattggcc tttgctaccc gttttctgtt tagcacgccg acgaatatga
61 tacaggagag caacgattaa tgtcgtaaaa acaattggtt atgaataaat taactttcag
121 tttataacac taacaaatcg agattaataa gtcatttaaa ccattttgat ggttatttat
181 tagtgatatc aacttgaggt aagcgttagt ttcgataaga taaactgagt tactaatagt
241 cgaggcagat aatacagtgt accgatctgt ctctttatct acgctaaatt gaaagctgga
301 tttagaggaa ccaaaatgtc tgaacaagaa acacggggag ccaatgaggc tattgatttt
361 aacgatgaac tgagaaatcg ccgcgaaaaa ctggcggcac tacgtcagca aggtgtggcg
421 tttcccaatg attttcgccg cgaccatacc tctgaccagt tgcacgaaga gtttgatgcg
481 aaggataacc aggaactgga atccttaaac attgaagtct cggttgctgg ccgaatgatg
541 acccgtcgta tcatggggaa agcctccttt gtaacgttgc aggatgtcgg tggccgtatt
601 caactgtacg ttgcaagaga tagcctgcca gaaggtgttt ataacgatca gtttaaaaaa
661 tgggatctgg gtgacattat cggtgccggt acgctgttta agacgcaaac gggtgagctt
721 tccattcact gtactgagct gcgcctgctg actaaagcac tacgtccttt accagataaa
781 ttccatggtc tgcaggatca ggaagtccgt tatcgtcaac gttatctgga cctcatcgct
841 aacgataaat cccgtcaaac gtttgttgtc cgttcaaaaa ttctggccgc tatccgtcaa
901 ttcatggtcg cgcgcggctt tatggaagta gaaaccccga tgatgcaggt aattccaggt
961 ggggcatctg ctcgcccgtt tattacccat cataatgctc tggatttaga tatgtatgct
1021 cgtatcgcgc cggagctgta tctgaaacgt ctggttgtag gcggttttga acgggtattc
1081 gaacacgtaa ctcgtaatga aggtattcgt cgccataatc ctgagttcac aatgatggaa
1141 ctctacatgg cgtatgcgga ttaccacgat ttgattgaac tgacagagtc actgttccgc
1201 accctggcac aagaggttct gggtaccact aaagtcactt atggcgagca tgtgtttgat
1261 ttcggcaaac cgtttgaaaa actcaccatg cgcgaagcaa tcaaaaaata tcgtccagaa
1321 accgatatgg ccgacctgga taattttgat gctcgtaaag cattagctga atctatcggt
1381 attacggtag agaaaagctg ggggttggga cgtagtgtca cagagatctt tgatgaagtg
1441 gtagaaggac atgtgattca gccatccttt attacggaat atccggcaga agtgtccccg
1501 ctggcacgtc gtaatgatgt taacccggaa atcaccgacc gttttgaatt cttcatcggt
1561 ggtcgtgaaa tcggtaatgg ttttagcgaa ttaaacgacg cagaagatca ggctgaacgt
1621 ttccaggaac aggttaatgc taaagctgca ggtgacgacg aagccatgtt ctatgacgaa
1681 gattacgtga ctgcgctgga atatggtctg ccgccaaccg ctggtctggg tattggtatc
1741 gaccgaatga ttatgctgtt tactaacagc catactattc gcgacgttat tctcttcccg
1801 gcgatgcgcc cacagaaata aatttcactt taatgaacga agcagtcagg cgactgcttc
1861 gttcatcgtt ccgcaaatcc agttttgagg agagacgctt cgtttcacga taaccagcgg
1921 gtccactcaa ctacagctcc taaaagagag aaaaaagtaa ctcgacaggc agactcattg
1981 gccaggttat tccgaccagg aatgcactaa gtaaacgaat atgtgttttc taatgataga
2041 gataaaccag agtaatcgca ggttattcac gccgaatacg ttcacacatc acataaaaaa
2101 ttaatatttt agagaatagt gccatttttt atgttcttct tatccttcct ggctacttgt
2161 aaaactgacg gaaatagtca tcaactgtca taatgttgag gtcaaaagca atatccggta
2221 agtctaagca tttgagcctg attaaatctt cgatttcact tgcggctaac gatgtcgtta
2281 ttgcgaagga gttttcattc aattgtcgtg gtatcccata ttcatcagcc acgatagagg
2341 agaaattcat atcatacata catcaatgtt cttaatctta agaagagtgg ggaataagat
2401 gatggataat caaaagaaac aacataagta ttgaacttgt ttttacctcc ataatatgac
2461 tcctatcaat attaaacgct ctggggacgc aaaacttatt aaccccgatg atgattctca
2521 tataggagat attctttttt ttgctcagta aaggggatgg acttgggaca aaaaaacgcg
2581 ctctctgccc attctttgtg catcccccta cccttcagat gctctccttg catcacatcg
2641 cggagagaat gaaaaatatc gtgactctgt actctcccat caactgcgaa tatcatcata
2701 ctcccgcgtt ttatcaaatt catgtttcgc aaatgggcat aatgggataa tttttcgttt
2761 ttcccgacgc attttttcca cgactttcgc aaccagctgt taccaatccc cgttgccttt
2821 caggctttca tcgacatcgt atgttcgata actcgctaaa tctccgtcgg tcggcacaaa
2881 gacaatttca gcgatttgct tgccttgttt gtcattaatg taaaatttat tgtggccttc
2941 gcgtatttcc attttaccct cgcttattta tgacggtatt tcagcgcgcc gctcgggcac
3001 gtatcatcac tttaaccaca gtggcgacgt cgac
//
LOCUS PSEBCD 4007 bp ds-DNA BCT 18-JUL-1990
DEFINITION P.aeruginosa pilin biogenesis protein (pilB, pilC and pilD) genes,
complete cds.
ACCESSION M32066
KEYWORDS pilin biogenesis protein.
SOURCE P.aeruginosa (strain PAK) DNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 4007)
AUTHORS Nunn,D., Bergman,S. and Lory,S.
TITLE Products of three accessory genes, pilB, pilC and pilD are required
for biogenesis of Pseudomonas aeruginosa pili
JOURNAL J. Bacteriol. 172, 2911-2919 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.N.Nunn, 12-FEB-1990.
FEATURES from to/span description
pept 199 1899 pilin biogenesis protein (pilB)
pept 1903 3123 pilin biogenesis protein (pilC)
pept 3127 3999 pilin biogenesis protein (pilD)
mRNA 8 < 1 (c) pilA mRNA
BASE COUNT 882 a 1244 c 1110 g 771 t
ORIGIN
1 cgcctaaccc taccaagcac ttaccatgcc aaataccgtt cacatagcga gcaaggccgt
61 cttatgccct tcccgaggag actcaaactt aaaacggaca ggatgtgaca ctttttgaca
121 gccggactcc cccgatttgg cagaccctgt tatctgcgtt ataaggcact tcaatagccc
181 tctgcgattt ccttctccat gaacgacagc atccaactga gcggtctgtc ccgacagctc
241 gtccaagcca atctactcga cgaaaagacc gccctgcagg cccagacgca ggcgcagcgc
301 aacaagctgt cgctggtgac gcacctggtg cagaacaagc tggtaagcgg cctggccctg
361 gccgaattgt ccgccgagca gttcggcatc gcctattgcg acctgaacag cctggacagg
421 gaaagctttc cgcgggacgc catcagcgaa aagctggtcc gccagcaccg ggtcattccc
481 ctgtggcggc ggggcaacaa gttgttcgta ggcatctccg acgccgccaa ccaccaggcc
541 atcaacgacg tccagttcag cactggcctg acgaccgagg ccattctggt cgaggacgac
601 aagctcggcc ttgccatcga caagctattc gagaatgcaa ccgacggctt ggcaggtctc
661 gatgacgtgg acctggaagg actggatgta ggcgtcaagg aaacgtccgg acaagaagat
721 actggcgcgg aggcagacga cgctcccgtc gtccgcttcg tcaacaagat gctgctggac
781 gctatcaaag gcgggtcttc cgacttgcac ttcgagccct acgagaagat ctaccgggtg
841 cgcttccgta ccgacggcat gcttcacgaa gtggccaagc cgccgatcca gttggccagt
901 cgtatctctg ctcgtctcaa ggtaatggcc ggcctggata tctccgaacg acgcaagccg
961 caggacggac ggatcaagat gcgcgtgtcg aagaccaagt ccatcgactt ccgcgtcaac
1021 accctgccga ccctgtgggg cgagaagatc gtgatgcgga tcctcgactc ctccagcgcg
1081 cagatgggca tcgacgccct gggctacgag gaggaccaga aggaactcta cctggccgca
1141 ctcaagcagc cgcagggcat gatcctggtc accggcccca ccggctcggg caagacggta
1201 tcgctgtaca ccggcctgaa catcctcaat accaccgaca tcaacatttc caccgccgaa
1261 gacccggtgg agatcaacct ggaaggcatc aaccaggtca acgtcaatcc gcgccagggc
1321 atggacttct cccaggcgct gcgcgccttc ctgcgccagg acccggacgt gatcatggtc
1381 ggcgagatcc gcgacctgga gaccgccgag atcgccatca aggcggcgca gaccgggcat
1441 atggtgatgt ccaccctgca caccaacagc gccgccgaga ccctgacccg cctgctgaac
1501 atgggcgtac cggcgttcaa cctggcgacc tcggtgaacc tgatcatcgc ccagcgcctt
1561 gcgcgaaaac tctgttcgca ctgcaagaaa gagcacgacg taccgaagga aaccctgctt
1621 cacgaaggtt tcccagaaga gttgatcggc accttcaagc tgtattcacc agtgggctgc
1681 gaccattgca agaacggtta caagggccgt gtcggtattt atgaagtagt taaaaacacc
1741 ccggccctgc agcggattat catggaggaa ggcaactcca tcgagatcgc cgagcaagcc
1801 cgcaaagaag gcttcaacga tctgcgcacc tcaggcctgc tgaaagccat gcaggggatc
1861 accagcctgg aggaagtcaa ccgcgtgacc aaggattaat ccatggcgga caaagcgtta
1921 aaaaccagcg ttttcatctg ggagggcacc gacaagaaag gcgccaaggt caagggcgaa
1981 ctgaccgggc agaatcccat gctggtgaaa gcccatctgc gcaagcaagg catcaatccg
2041 ctcaaggtac gcaagaaagg tatctccctg ctgggcgcag gcaagaaagt gaaacccatg
2101 gacatcgccc tgttcacccg gcagatggcg accatgatgg gcgctggcgt tcccctcctg
2161 caatcgttcg acatcatcgg cgagggcttc gacaacccca acatgcgcaa gcttgtggat
2221 gaaatcaaac aggaagtttc ctcaggtaac agcctagcca actccttgag aaaaaagccc
2281 cagtattttg acgagcttta ttgcaacctg gtagatgcag gggaacagtc tggcgccttg
2341 gaaaaccttc tcgatcgggt ggcaacctat aaagaaaaga cggaatcact gaaagccaag
2401 atcaaaaagg cgatgaccta tcccattgcc gtcatcattg tcgcactgat tgtatctgcg
2461 atcctcctga ttaaagtggt tccacaattt cagtcggtct ttgaaggttt cggcgcggaa
2521 cttcccgcct ttacccagat gattgtcaat ctatcggagt tcatgcagga gtggtggttc
2581 ttcatcatac tggcgatagc gatatttggc tttgcattca aagaattgca taaacgctca
2641 caaaaattcc gtgacacact cgacagaacg atcctcaaac ttcccatttt cggaggcatc
2701 gtctacaaat ctgcggtcgc ccgttatgca cggaccttgt ccacgacctt cgccgcgggt
2761 gttcccctgg tcgatgcgct cgactccgtc tccggagcga ccggcaatat cgtgttcaag
2821 aacgcggtca gcaagatcaa gcaagacgtt tccaccggca tgcagctcaa cttctccatg
2881 cgcaccacca gcgtctttcc caacatggcg atccagatga ccgccatcgg cgaggagtcc
2941 ggttcgctcg atgagatgct gagcaaagtc gccagctact acgaagagga agtcgacaac
3001 gccgtggaca acctcaccac gctcatggaa ccgatgatca tggccgttct cggcgtactg
3061 gttggcggtc tgatcgtggc catgtacctt ccgatcttcc aactcggcaa cgtcgtcgga
3121 taactgatgc ccctcctcga ctacctggcc agccatccgc tggcctttgt cttgtgcgcc
3181 atcctccttg gcctgctggt cggcagcttc ctcaacgtcg tcgtgcatcg cctgccgaag
3241 atgatggagc gcaactggaa ggccgaagcc cgcgaagcgc tcgggctgga gccggaaccc
3301 aagcaggcga cctacaacct ggtgctgccg aattcggcgt gcccgcgctg cggccacgag
3361 atccgcccct gggaaaacat cccgctggtc agctacctgg cgctgggcgg caagtgctcg
3421 tcctgcaagg ccgccatcgg caaacgctac cccctggtgg aactggccac cgcgctgctt
3481 tccggctacg tcgcctggca tttcggcttc acctggcagg cgggcgcgat gctcctgctg
3541 acctggggtc tgctggcgat gagcctgatc gatgccgatc accaactgct gccggacgtg
3601 ctggtgctgc cgctgctctg gctgggcctg atcgccaacc atttcggcct cttcgccagc
3661 ctcgacgatg cgctgttcgg cgcggtgttc ggctacctga gcctgtggtc ggtgttctgg
3721 ctgttcaagc tggtgaccgg caaggagggc atgggctacg gcgacttcaa gctgctggcc
3781 atgctcggtg cctggggtgg ctggcagatc ctgccgctga ccatcctgct gtcgtcgctg
3841 gtcggggcga tcctcggggt gatcatgctg cgcctgcgca acgccgaaag cggcacgccg
3901 atccccttcg gtccctatct ggctattgcc gggtggattg ctttgctctg gggtgatcaa
3961 ataacccgga cctatctgca attcgccgga ttcaaatgac ccaaccc
//
LOCUS RADHRPER 955 bp ds-DNA SYN 18-JUL-1990
DEFINITION Synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit
alpha-1 DNA, complete cds.
ACCESSION J05552
KEYWORDS peroxidase.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 955)
AUTHORS Smith,A.T., Santama,N., Dacey,S., Edwards,M., Bray,R.C.,
Thornely,R.N.F. and Burke,J.F.
TITLE Expression of a synthetic gene for horseradish peroxidase C in
E.coli and folding and activation of the recombinant enzyme with
calcium and heme
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.T.Smith, 13-JUN-1990.
FEATURES from to/span description
pept 11 940 synthetic horseradish peroxidase isoenzyme C
(HRP-C) subunit alpha-1 (E.C. 1.11.1.7)
BASE COUNT 246 a 269 c 221 g 219 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttaacc atgcagttaa cccctacatt ctacgacaat agctgtccca acgtgtccaa
61 catcgttcgc gacacaatcg tcaacgagct cagatccgat cccaggatcg ctgcttcaat
121 attacgtctg cacttccatg actgcttcgt gaatggttgc gacgctagca tattactgga
181 caacaccacc agtttccgca ctgaaaagga tgcattcggg aacgctaaca gcgccagggg
241 ctttccagtg atcgatcgca tgaaggctgc cgttgagtca gcatgcccac gaacagtcag
301 ttgtgcagac ctgctgacta tagctgcgca acagagcgtg actcttgcag gcggaccgtc
361 ctggagagtg ccgctcggtc gacgtgactc cctacaggca ttcctagatc tggccaacgc
421 caacttgcct gctccattct tcaccctgcc ccagctgaag gatagcttta gaaacgtggg
481 tctgaatcgc tcgagtgacc ttgtggctct gtccggagga cacacatttg gaaagaacca
541 gtgtaggttc atcatggata ggctctacaa tttcagcaac actgggttac ctgaccccac
601 gctgaacact acgtatctcc agacactgag aggcttgtgc ccactgaatg gcaacctcag
661 tgcactagtg gactttgatc tgcggacccc aaccatcttc gataacaagt actatgtgaa
721 tctagaggag cagaaaggcc tgatacagag tgatcaagaa ctgtttagca gtccaaacgc
781 cactgacacc atcccactgg tgagaagttt tgctaactct actcaaacct tctttaacgc
841 cttcgtggaa gccatggacc gtatgggtaa cattacccct ctgacgggta cccaaggcca
901 gattcgtctg aactgcagag tggtcaacag caactcttaa taaggatccg aattc
//
LOCUS RK2KORC 1701 bp ds-DNA BCT 18-JUL-1990
DEFINITION Plasmid RK2 kil regulatory protein (korC) (complete cds), kleA
(5' end), and Tn1 bla protein (3' end) genes.
ACCESSION M32794
KEYWORDS Tn1 bla protein; kil regulatory protein; kil-kor regulon;
kleA protein; korC protein.
SOURCE Plasmid RK2 DNA, passed in E.coli MV10.
ORGANISM Plasmid RK2
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1701)
AUTHORS Kornacki,J.A., Burlage,R.S. and Figurski,D.H.
TITLE The kil-kor regulon of broad host-range Plasmid RK2: Nucleotide
sequence, polypeptide product and expression of regulatory gene
korC
JOURNAL J. Bacteriol. 172, 3040-3050 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by D.H.Figurski, 13-MAR-1990.
FEATURES from to/span description
pept < 1 320 Tn1 bla protein
pept 1169 1426 korC protein
pept 1652 > 1701 kleA protein
pept.ps 468 1146 interrupted ORFX
trns < 1 467 Tn1 remnant
mRNA 1153 1509 korC mRNA
mRNA 1637 > 1701 kleA mRNA
binding 1153 1161 korC ribosome binding site
binding 1637 1641 kleA ribosome binding site
signal 1568 1597 kleA promoter
signal 1509 1526 pot. transcription termination signal
site 1459 1471 KorB box
site 1552 1562 KorA box
site 1577 1593 KorC box
BASE COUNT 359 a 481 c 523 g 338 t
ORIGIN Map position 4.3'-6.0'.
1 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
61 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
121 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtggatctc
181 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
241 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
301 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
361 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
421 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccttt tatttccgaa
481 acgggctatc ggtcccatta cgaccatgcg cgcggcggca tgacggtgga ccaggtagcg
541 gacggggttt tgcgcgccct gttgcgctcg catcgccggt atctggacgc ccgcgaccag
601 gaccgccttg ccgacgagcc actgccggcg tggctcgcgg gtatcacgcc accgccgcgc
661 cgcgtgcggg ccgtggtcga ggactggcgg aagccggacg aactgccgcc gggcttcgcc
721 tgggttgatg ccgtgctgcc ggcgcaccag gcattcatcg cccgcaaatg ggcggccagc
781 gccaaggcga agctcgcggc cgcgcgtgcg aaagctcagg agccggccgg gcagcgcagg
841 gagccggtta caccggccaa gccggagccg gagccggcca aggacgagga cgcgccggcg
901 tggccggcga cgttctttcc tggcctgcgg tgcgagattg tgagcgtgca tcacccggtt
961 tttgccaagg aaatcgggaa gcacgtcatc attacgaaga tcagtcccga aacgcgacag
1021 gtgtgggcgc acgacgacaa gccgccgcgc tatcgcatca atcgcaacgg tagaaaggtt
1081 tgtgaatatg acccgcgttg catcgagtcg tgctacggct acgaccaatt gcgggccgcc
1141 atttgataac gattaggaga aaagcagcat gagcgacgtg aatatccggc ttgagtgcct
1201 gcgcccggcg gaacgctggg tgcagccgac cggcgcagaa atccgggaag tcttgcactt
1261 ggccggcctc accggcggac aggctgcgcg catcttgggc ttgggtgcca agggcgaccg
1321 cacggtgcgg cgttgggttg gcgaggattc gccgatcccc tatgccgcct gggcgatcct
1381 ttgcgatcta gcggggattg gggcgatctg gaaaggccag ggctgacgcc cgcaggactt
1441 tacgccaagg gagaggactt tagcggctaa aaccgccaac cctgatcgtt gcgatatgct
1501 gccagcagcc cggcccgcaa gccgggtttt ttcgtgcgcc cgccgcaaat tgtttagcta
1561 aatttccttg actatctagg gcataatgcc ctaatatagc aatccaaggc cgggcacttc
1621 gcccaggtca gcaaccggag gatcaacccc gatgaaaagc aagatcatgt cttggctcga
1681 tgagctgccc ggcgcggccg c
//
LOCUS STMSUAC 1380 bp ds-DNA BCT 18-JUL-1990
DEFINITION S.griseolus herbicide-inducible cytochrome P-450-SU1 (suaC) gene,
complete cds.
ACCESSION M32238
KEYWORDS cytochrome P-450-SU1.
SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO302.
ORGANISM Streptomyces griseolus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1380)
AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
TITLE Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
JOURNAL J. Bacteriol. 172, 3335-3345 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Omer, 23-FEB-1990.
FEATURES from to/span description
pept 128 1348 cytochrome P-450-SU1 (suaC)
binding 117 120 ribosomal binding site
BASE COUNT 236 a 503 c 445 g 196 t
ORIGIN
1 gcggacaggg ggactcctga agatgtctga tagaggccgt tgcgttctct acgggggcaa
61 gtctatgctc cgaaatagag aacatggcgt tctttaaagg tgagaattct tgaatcggag
121 tggaccgatg accgataccg ccacgacgcc ccagaccacg gacgcacccg ccttcccgag
181 caaccggagc tgtccctacc agttaccgga cggctacgcc cagctccggg acacccccgg
241 ccccctgcac cgggtgacgc tctacgacgg ccgtcaggcg tgggtggtga ccaagcacga
301 ggccgcgcgc aaactgctcg gcgacccccg gctgtcctcc aaccggacgg acgacaactt
361 ccccgccacg tcaccgcgct tcgaggccgt ccgggagagc ccgcaggcgt tcatcggcct
421 ggacccgccc gagcacggca cccggcggcg gatgacgatc agcgagttca ccgtcaagcg
481 gatcaagggc atgcgccccg aggtcgagga ggtggtgcac ggcttcctcg acgagatgct
541 ggccgccggc ccgaccgccg acctggtcag tcagttcgcg ctgccggtgc cctccatggt
601 gatctgccga ctcctcggcg tgccctacgc cgaccacgag ttcttccagg acgcgagcaa
661 gcggctggtg cagtccacgg acgcgcagag cgcgctcacc gcgcggaacg acctcgcggg
721 ttacctggac ggcctcatca cccagttcca gaccgaaccg ggcgcgggcc tggtgggcgc
781 tctggtcgcc gaccagctgg ccaacggcga gatcgaccgt gaggaactga tctccaccgc
841 gatgctgctc ctcatcgccg gccacgagac cacggcctcg atgacctccc tcagcgtgat
901 caccctgctg gaccaccccg agcagtacgc cgccctgcgc gccgaccgca gcctcgtgcc
961 cggcgcggtg gaggaactgc tccgctacct cgccatcgcc gacatcgcgg gcggccgcgt
1021 cgccacggcg gacatcgagg tcgaggggca cctcatccgg gccggcgagg gcgtgatcgt
1081 cgtcaactcg atagccaacc gggacggcac ggtgtacgag gacccggacg ccctcgacat
1141 ccaccgctcc gcgcgccacc acctcgcctt cggcttcggc gtgcaccagt gcctgggcca
1201 gaacctcgcc cggctggagc tggaggtcat cctcaacgcc ctcatggacc gcgtcccgac
1261 gctgcgactg gccgtccccg tcgagcagtt ggtgctgcgg ccgggtacga cgatccaggg
1321 cgtcaacgaa ctcccggtca cctggtgacg ggggagaggg gcaaggacat gaccatgcgg
//
LOCUS STMSUBC 1441 bp ds-DNA BCT 18-JUL-1990
DEFINITION S.griseolus cytochrome P-450-SU2 (subC) gene, complete cds.
ACCESSION M32239
KEYWORDS cytochrome P-450-SU2.
SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO304.
ORGANISM Streptomyces griseolus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1441)
AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
TITLE Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
JOURNAL J. Bacteriol. 172, 3335-3345 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.A.Omer, 23-FEB-1990.
FEATURES from to/span description
pept 195 1406 cytochrome P-450-SU2 (subC)
binding 182 186 ribosomal binding site
BASE COUNT 207 a 605 c 432 g 197 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccggcc accgcccgac ccgtccgcac tccgccccgc cgaccgtcgt ccatccgccc
61 ctgcggccat gcggtttgag ccaacctcgg tgctgccgcg atctgccctt ccctcccccg
121 ccgggcctgc gttagcgtga cgacatctta attacctaag ttaggtaatt agctcacgcg
181 gaaggaccgg ccgcatgacg accgcagaac gcaccgctcc ccccgacgcc ctcaccgtcc
241 cggccagccg cgcccccggc tgccccttcg accccgcgcc cgacgtcacc gaggcggccc
301 gcaccgaacc ggtcacccgg gccaccctct gggacggctc ctcctgctgg ctggtgacgc
361 gccatcagga cgtccgcgcg gtcctcggcg acccgcgctt cagcgccgac gcccaccgca
421 ccggcttccc cttcctgacc gccggcggcc gcgagatcat cggcaccaac ccgaccttcc
481 tgcgcatgga cgacccggag cacgcccgac tgcgccggat gctcaccgcc gacttcatcg
541 tcaagaaggt cgaggcgatg cgccccgagg tgcagcgcct cgccgacgac ctggtcgacc
601 ggatgaccac cggacgcacc tccgccgacc tggtcaccga gttcgcgctg ccgctgccgt
661 ccctggtgat ctgcctgctg ctcggcgtcc cctacgagga ccacgcgttc ttccaggagc
721 gcagccgggt cctgctcacc ctgcggtcca ctcccgagga agtccgggcc gcccaggacg
781 agttgctgga gtacctcgcc cggctcgccc ggaccaagcg ggagcggccg gacgacgcca
841 tcatcagccg cctggtcgcc cgcggcgagc tcgacgacac ccagatcgcc accatgggac
901 gcctgttgct ggtcgccggc cacgagacga ccgccaacat gaccgcgctc tccaccctcg
961 tgctgctgcg caaccccgac caactcgccc ggctgcgcgc cgaacccgcg ctcgtcaagg
1021 gcgccgtcga ggagctgctg cgctacctga cgatcgtgca caacggcgtt ccccggatcg
1081 ccaccgagga cgtgctcatc ggcggccgca ccatcgccgc cggcgagggc gtcctgtgca
1141 tgatcagctc cgccaaccgg gacgccgagg tgttccccgg cggcgacgac ctcgacgtgg
1201 cccgcgacgc ccgccgccac gtggccttcg gcttcggcgt ccaccagtgc ctgggacagc
1261 cgttggccag ggtggagctc cagatcgcca tcgaaacgct gctgcgccgc ctgccggacc
1321 tgcggctggc cgtgccccac gaggagatcc cgttccgcgg cgacatggcg atctacgggg
1381 tccactcgct gccgatcgcc tggtagcccg ggcgccccca ccaccgacca ccacgcaccc
1441 a
//GenBank-Updates@genbank.bio.net (07/19/90)
LOCUS HUMPPPB1A 3215 bp ss-mRNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete
cds.
ACCESSION M33689
KEYWORDS protein phosphotyrosyl phosphatase.
SOURCE Human placenta, cDNA to mRNA, (library of Clontech), clone
lambda-16-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3215)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept 73 1380 protein phosphotyrosyl phosphatase 1B (EC
3.1.3.48)
BASE COUNT 818 a 828 c 801 g 768 t
ORIGIN Chromosome 20q13.1-q13.2.
1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg
61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg
121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag
181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg
241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa
301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg
361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa
421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa
481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga
541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat
601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc
661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt
721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac
781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt
841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa
901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac
961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc
1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa
1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc
1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg
1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag
1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct
1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag
1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc
1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc
1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat
1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag
1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc
1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc
1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc
1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat
1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg
1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt
1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata
2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc
2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat
2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac
2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg
2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg
2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc
2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg
2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc
2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca
2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg
2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc
2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg
2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc
2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac
2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg
2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg
3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg
3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag
3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt
3181 ttacaatggc catggaataa accattttta caaaa
//
LOCUS HUMPPPB1A1 276 bp ds-DNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x.
ACCESSION M33688
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 1 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 276)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept / 34 + 243 protein phosphotyrosyl phosphatase 1B, exon x
(EC 3.1.3.48) (AA at 34)
pre-msg < 1 > 276 PTP1B mRNA and introns
IVS < 1 33 PTP1B intron x-1
IVS 244 > 276 PTP1B intron x
BASE COUNT 56 a 77 c 69 g 74 t
ORIGIN Chromosome 20q13.1-q13.2.
1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat
61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac
121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg
181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg
241 ctggtaagga ggcctcgcgg gtgccctggg gagctc
//
LOCUS HUMPPPB1A2 453 bp ds-DNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1.
ACCESSION M33687
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 2 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 453)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 236 + 397 protein phosphotyrosyl phosphatase 1B, exon
x+1 (EC 3.1.3.48)
pre-msg < 1 > 453 PTP1B mRNA and introns
IVS < 1 235 PTP1B intron x
IVS 398 > 453 PTP1B intron x+1
BASE COUNT 104 a 118 c 111 g 120 t
ORIGIN Chromosome 20q13.1-q13.2.
1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca
61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa
121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta
181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga
241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt
301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga
361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat
421 ccaggtgtga ccattttaac ttttttgtct ttg
//
LOCUS HUMPPPB1A3 426 bp ds-DNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2.
ACCESSION M33686
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 3 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 426)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 46 + 269 protein phosphotyrosyl phosphatase 1B, exon
x+2 (EC 3.1.3.48)
pre-msg < 1 > 426 PTP1B mRNA and introns
IVS < 1 45 PTP1B intron x+1
IVS 270 > 426 PTP1B intron x+2
BASE COUNT 115 a 122 c 109 g 80 t
ORIGIN Chromosome 20q13.1-q13.2.
1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag
61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc
121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg
181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc
241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga
301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc
361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac
421 tggtgg
//
LOCUS HUMPPPB1A4 732 bp ds-DNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3.
ACCESSION M33685
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 4 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 732)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 402 + 597 protein phosphotyrosyl phosphatase 1B, exon
x+3 (EC 3.1.3.48)
pre-msg < 1 > 732 PTP1B mRNA and introns
IVS < 1 401 PTP1B intron x+2
IVS 598 > 732 PTP1B intron x+3
BASE COUNT 164 a 193 c 205 g 170 t
ORIGIN Chromosome 20q13.1-q13.2.
1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt
61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat
121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc
181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc
241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta
301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt
361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa
421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa
481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc
541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta
601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc
661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag
721 gttttaagtc tg
//
LOCUS HUMPPPB1A5 365 bp ds-DNA PRI 19-JUL-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4.
ACCESSION M33684
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 5 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 365)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 266 289 protein phosphotyrosyl phosphatase 1B, exon
x+4 (EC 3.1.3.48)
pre-msg < 1 > 365 PTP1B mRNA and introns
IVS < 1 265 PTP1B intron x+3
BASE COUNT 80 a 101 c 88 g 96 t
ORIGIN Chromosome 20q13.1-q13.2.
1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta
61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag
121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc
181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca
241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc
301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg
361 catgc
//
LOCUS HB3HBLA 1319 bp ds-DNA PHG 19-JUL-1990
DEFINITION Bacteriophage HB-3 amidase (hbl) gene, complete cds.
ACCESSION M34652
KEYWORDS amidase.
SOURCE Bacteriophage HB-3 (host Streptococcus pneumoniae) DNA.
ORGANISM Bacteriophage HB-3
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1319)
AUTHORS Romero,A., Lopez,R. and Garcia,P.
TITLE Sequence of the Streptococcus pneumoniae bacteriophage HB-3 amidase
reveals high homology with the major host autolysin
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Romero, 25-MAY-1990.
Consejo superior de Investigaciones Clentificas
Centro de Investigaciones Biologicas
Velazquez 144 Madrid, 28006 SPAIN
FEATURES from to/span description
pept 298 1254 amidase (hbl)
binding 283 289 ribosomal binding site (put.)
site 1285 1305 transcription stop sequence
BASE COUNT 412 a 268 c 334 g 305 t
ORIGIN
1 aagcttttta acagtagcag taggcggtat tgtaaaagca gtaaaagatt atcttttgcg
61 taaaggcgga gagaaagcgg tgatcatcgc tgaaattcta gctaaaatgc agttcatgcc
121 gttgagcaag tagcttcaga gactggctat aagggcgaag aaaagctgga gcaggctcgt
181 gctaaagtcc gtgctgagct tacaaaatac aatattagca tgactgacaa aaacttagac
241 accttcgtag agtcagcagt gaagcagatg aatgacgcat ggaaaggacg atagggaatg
301 gatatcgata gaaacagact acgtacaggc ttgccccagg ttggggtgca gccttatcga
361 caagtacatg ctcactcaac aggtaaccgc aactcaaccg tacagaatga agcggattat
421 cactggcgga aagacccaga attaggtttt ttctcgcacg ttgttgggaa ctttcgcatc
481 atgcaggtcg gacctgtgaa caacggaagt tgggatgttg ggggcggttg gaatgctgag
541 acctatgcag cggttgaact gattgaaagc cattcaacta aggaagagtt tatggctgac
601 tatcgcctct atatcgaatt gctacgcaat ctagcggacg aagcaggctt gccgaagact
661 cttgatacag acgacttggc aggtatcaag acgcatgaat actgtaccaa taaccaacca
721 aacaaccact cagaccatgt ggatccatat ccatatcttg caagttgggg cattagccgt
781 gaacagttta agcaagacat cgaaaacggc ttgagcgctg caacaggctg gcagaaaaat
841 ggcactggct actggtacgt acattcagac ggctcttatt caaaagataa gtttgagaaa
901 atcaacggta cctggtatta tttcgatggc tcaggctata tgctttcaga ccgctggaag
961 aagcacacag acggtaattg gtactacttt gaccaatcag gcgaaatggc cacaggctgg
1021 aagaaaatcg ctgacaagtg gtactatttt gatgtagaag gtgccatgaa gacaggctgg
1081 gtcaagtaca aggacacttg gtactactta gacgctaaag aaggcgccat ggtatcaaat
1141 gcctttatcc agtcagcgga cggaacaggc tggtactacc tcaaaccaga cggaacactg
1201 gcagacaagc cagagttcac agtagagcca gatggcttga ttacagttaa ataaatagaa
1261 aggaaacttt ctaaattgtt cttcaccgca gctcaggctt acggtttttt tgttttaaa
//
LOCUS FIBGLUC 1426 bp ds-DNA BCT 19-JUL-1990
DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
complete cds.
ACCESSION M33676
KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE F.succinogenes (strain S85) DNA, clone PJI5.
ORGANISM Fibrobacter succinogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1426)
AUTHORS Teather,R.M. and Erfle,J.D.
TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage
beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
JOURNAL J. Bacteriol. 172, 3837-3841 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Teather, 11-APR-1990.
FEATURES from to/span description
pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase
precursor (EC 3.2.1.73)
sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase
signal peptide
matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase
binding 132 137 ribosome binding site
signal 62 66 -35 region
signal 85 90 -10 region
BASE COUNT 371 a 346 c 335 g 374 t
ORIGIN
1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS PP1BOFFO 931 bp ds-DNA PHG 19-JUL-1990
DEFINITION Bacteriophage P1 regulatory protein (bof) gene, complete cds.
ACCESSION M33224
KEYWORDS regulatory protein.
SOURCE Bacteriophage P1 viral DNA.
ORGANISM Bacteriophage P1
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 931)
AUTHORS Schaefer,T.S. and Hays,J.B.
TITLE The bof gene of bacteriophage P1. DNA sequence and evidence for
roles in regulation of phage c1 and ref genes
JOURNAL J. Bacteriol. 172, 3269-3277 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.B.Hayes, 26-MAR-1990.
FEATURES from to/span description
pept > 544 789 Bof regulatory protein
site 541 543 potential ttg start codon for Bof
BASE COUNT 260 a 211 c 243 g 217 t
ORIGIN Map position 9-10.
1 gggtaactgg tggattatcg agacaaaaca caacgtggcg gacgttctgg ccgtcatcca
61 acaatacgca taacaggagc gcccggttcg cgctgcgcat aatatggcca cactatctga
121 tacaataaaa ccgaataaaa catatcttga ggcggtactg cgtacggcat tattaggaaa
181 gacagaagac gaatacgttg atttcttcct gtcagggcta cgcgggcgat tactgaaaaa
241 tccccgcctg taccgcagct atggcccata ctggcggaaa ttaaaaaatt attactggag
301 cgacggttat ggtaatttcg gtcgtctcgt tgaccgtgac gttcgcaaat tttaccgtta
361 tgaccgcccg gcgctaacac tcatagccgc gacgctctac agccatgagc gttttgataa
421 tggtcagata tactcagcct ggcatttact gccagtccct gaagaagttg acgaccagga
481 ctatgagttt gagtcttacg atttggaagt tgaagccttg gcacaggctg gagagaaaac
541 ttgaaaaagc gatactacac agtaaagcat gggacgctac gagcattaca agagtttgct
601 gacaagcata acgttgaggt gcgcagggaa gggggaagta aagctctgcg catgtaccgt
661 ccggacggga aatggcggac ggtcgtcgat ttcaaaacaa acagtgttcc ccagggcgtc
721 cgtgaccggg cattcgaaga atgggagcag atcatcatag ataatgcatt gcttctcaat
781 gcggattaaa cttccccaaa ttagggctgt ttgctcaccg agcatcgctc aaagaagcac
841 gattcttcaa acatatagat agtgatagtg ccacaacttc tggctctaac gggctgggga
901 ggcggcgctt tgttgctaaa tgatctggtt t
//
LOCUS STRTEE6 2508 bp ds-DNA BCT 19-JUL-1990
DEFINITION S.pyogenes trypsin-resistant surface T6 protein (tee6) gene,
complete cds.
ACCESSION M32978
KEYWORDS surface protein; trypsin-resistant surface T6 protein.
SOURCE S.pyogenes (strain D471, sub-species M-type 6) DNA.
ORGANISM Streptococcus pyogenes
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 2508)
AUTHORS Schneewind,O., Jones,K.F. and Fischetti,V.A.
TITLE Sequence and structural characteristics of the trypsin-resistant T6
surface protein of group A streptococci
JOURNAL J. Bacteriol. 172, 3310-3317 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.Schneewind, 18-MAR-1990.
FEATURES from to/span description
pept 719 2332 trypsin-resistant surface T6 protein (tee6)
precursor
sigp 719 784 trypsin-resistant surface T6 protein signal
peptide
matp 785 2329 trypsin-resistant surface T6 protein
binding 706 709 ribosome binding site
signal 2353 2382 transcription terminator
BASE COUNT 929 a 364 c 480 g 735 t
ORIGIN
1 aagcttcaga tgaagcctat gagaagtata aggataacga aggaagatat agcgaaatgg
61 gagattccga tactgattat ggaaccaacc aaactagttc tggaaaaggt ggtttgcctt
121 ctaattcaga tgcttcggtt aattatatgg cagatggtcg tgaacagaaa ttaccttata
181 agcacccagt gattcaggtc aaaacagtac caatcacgtt taccaaagta gatgctgaca
241 acaaccagaa aaaacttgca ggtgttgagt ttgaactccg taaagaggac aagaagatcg
301 tctgggaaaa gggaacaaca ggttcaaatg gccaactcaa ctttaagtac cttcaaaaag
361 gcaaaaccta ttatctgtat gagacgaagg caaaacttgg atacactctt ccagaaaatc
421 catgggaagt tgccgttgct aacaacggtg atataaaagt aaaacacccg attgaaggtg
481 aattgaagtc aaaagatggc tcttacatga ttaaaaatta taagatttat cagttgccat
541 cgtctggggg aagaggaagt caaattttca ttatagttgg tagcatgaca gcaactgtag
601 cattattatt ttatagacgc caacacagga aaaagcaata ttaaattaat gatcatattt
661 attgacaaac aggagagaaa cagtgagaga gaagatatta ataacagcaa aaaaactaat
721 gctagcttgt ttagctatct tagcggtagt agggcttgga atgacaagag tatcagcttt
781 atcaaaagat gatactgcac aactaaagat aacaaatatt gaaggtgggc caacagtaac
841 actttataaa ataggagaag gtgtttacaa cactaatggt gattctttta ttaactttaa
901 atatgctgag ggggtttctt taactgaaac aggacctaca tcacaagaaa ttactactat
961 tgcaaatggt attaatacgg gtaaaataaa gccttttagt actgaaaacg ttagtatttc
1021 taatggaaca gcaacttata atgcgagagg tgcatctgtt tatattgcat tattaacagg
1081 tgcgacagat ggccgtacct acaatcctat tttattagct gcatcttata atggtgaggg
1141 aaatttagtt actaaaaata ttgattccaa atctaattat ttatatggac aaacaagtgt
1201 tgcaaaatca tcattaccat ctattacaaa gaaagtaacc gggacaatag atgacgtgaa
1261 taaaaagact acctcgttag gaagtgtatt gtcttattcg ctgacatttg aattaccaag
1321 ttataccaaa gaagcagtca ataaaacagt atatgtttct gataatatgt cggaaggtct
1381 tacttttaac tttaatagtc ttacagtaga atggaaaggt aagatggcta atattactga
1441 agatggttca gtaatggtag aaaatacaaa aatcggaata gctaaggagg ttaataacgg
1501 ttttaattta agttttattt atgatagttt agaatctata tcaccaaata taagttataa
1561 agctgttgta aacaataaag ctattgttgg tgaagagggt aatcctaata aagctgaatt
1621 cttctattca aataatccaa caaaaggtaa tacatacgat aatttagata ggaagcctga
1681 taaagggaat ggtattacat ccaaagaaga ttctaaaatt gtttatactt atcaaatagc
1741 gtttagaaaa gttgatagtg ttagtaagac cccacttatt ggtgcaattt ttggagttta
1801 tgatactagt aataaattaa ttgatattgt tacaaccaat aaaaatggat atgctatttc
1861 aacacaagta tcttcaggaa aatataaaat taaggaatta aaagctccta aaggttattc
1921 attgaataca gaaacttatg aaattacggc aaattgggta actgctacag tcaagacaag
1981 tgctaattca aaaagtacta cttatacatc tgataaaaat aaggcgacag ataattcaga
2041 gcaagtagga tggttaaaaa atggtatatt ctattctata gatagtagac ctacaggaaa
2101 tgatgttaaa gaggcttata ttgaatctac taaggcttta actgatggaa caactttctc
2161 aaaatcgaat gaaggttcag gtacagtatt attagaaact gacatcccta acaccaagct
2221 aggtgaatta ccttcgacag gtagcattgg tacttacctc tttaaagcta ttggttcggc
2281 tgctatgatt ggtgcaattg gtatttatat tgttaaacgt cgtaaagctt aatgctttca
2341 aaagtcgaaa tcaatcgaga ctgtctttat gcggtctcga tttttaatga taaggaactg
2401 ctatgacaga aagactaaaa aatctaggga tactcttatt atttttattg ggaacagcca
2461 tttttcttta ccctacgcta agtagtcagt ggaatgccta tcgtgatc
//
LOCUS HALHPA 1317 bp ds-DNA BCT 19-JUL-1990
DEFINITION H.volcanii histidinol-phosphate-aminotransferase (hisC) gene,
complete cds.
ACCESSION M33161
KEYWORDS histidinol-phosphate-aminotransferase.
SOURCE H.volcanii (strain DSM 3757) cell line WFD 18 DNA, clone 477.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 1317)
AUTHORS Conover,R.K. and Doolittle,W.F.
TITLE Characterization of a gene involved in histidine biosynthesis in
Halobacterium (Haloferax) volcanii: Isolation and rapid mapping by
transformation of an auxotroph with cosmid DNA
JOURNAL J. Bacteriol. 172, 3244-3249 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.K.Conover, 22-MAR-1990.
FEATURES from to/span description
pept 121 1206 histidinol-phosphate-aminotransferase (hisC)
(EC 2.6.1.9)
BASE COUNT 234 a 504 c 410 g 169 t
ORIGIN
1 agtcgttcgg gcggcgctcg gctgacggcc gtcggtcgtc gcgtccccaa cccgaccccc
61 taccgccacg tccgacccgg agtacgcacc cttaagaacc gcgacccgca ttttccgacc
121 atgcaaccac gggacctctc cgcgcacgct ccctacgtac ccggccgcgg gacagaggag
181 gtcgcccgcg aactcggaat ggaccccgag gacctgacga aactctcctc gaacgagaac
241 ccccacggcc cgagtccgaa ggcggtcgcc gccatcgaag acgccgcgcc gaccgtgagc
301 gtctacccga agaccgccca cacggacctg accgaacgcc tcgccgacaa gtggggcctc
361 gcacccgaac aggtgtgggt gtctcccggc gcggacggct ctatcgacta cctgacccgc
421 gcggtgctcg aaccggacga ccggattctc gaacccgcgc ccggcttttc gtactactcg
481 atgagcgccc gctaccacca cggcgacgcc gtccagtacg aggtgtcgaa ggacgacgac
541 ttcgaacaga ccgccgacct cgtcctcgac gcctacgacg gcgagcgcat ggtctacctc
601 acaacgccgc acaaccccac cggttccgtg ctcccgcggg aggaactcgt cgaactggcc
661 gagtcggtcg aagagcacac gctcctcgtc gtcgacgagg cctacggcga gttcgccgag
721 gagccgtcgg ccatcgacct cttgtcggag tacgacaacg tcgcggccct gcggacgttc
781 tcgaaggcgt acgggctggc cggcctccgc atcggctacg cctgcgtgcc cgaggcgtgg
841 gccgacgcct acgcccgcgt gaacacgccg ttcgccgcca gcgaggtcgc ctgccgcgcc
901 gcgctcgccg cgctcgacga cgaggaacac gtcgagaaat ccgtcgagtc ggcccggtgg
961 tcccgcgact atctccgcga acacctcgac gcgccgacgt gggaaagcga gggcaacttc
1021 gtcctcgtcg aggtcggcga cgccacggcc gtcaccgagg ccgcccagcg cgagggcgtc
1081 atcgtccgcg actgcgggag cttcggcctg ccggagtgca tccgcgtctc ctgcggcacg
1141 gaaacccaga ccaagcgcgc cgtggacgtg ctcaaccgca tcgtctcgga ggtgccgacg
1201 gcgtgagaga cgacgacacc ggcacgcccg gcaccggaaa gaccacggcg accgagccgg
1261 tcgccgccga cctcgacctc gacgtggtcc acctcaaccg actcgtgaaa gacgagg
//
LOCUS BOVGOA 472 bp ss-mRNA MAM 19-JUL-1990
DEFINITION B.taurus go-alpha mRNA, 3' end.
ACCESSION J02900
KEYWORDS go-alpha.
SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (sites)
AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated
regions
JOURNAL Biochemistry 29, 5069-5076 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 472)
AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by S.R.Price, 12-JUN-1990.
FEATURES from to/span description
pept < 1 3 go-alpha (AA at 1)
BASE COUNT 130 a 133 c 88 g 121 t
ORIGIN
1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct
61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat
121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc
181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg
241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa
301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc
361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct
421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt
//
LOCUS CAJCAT 1334 bp ds-DNA BCT 19-JUL-1990
DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene,
complete cds.
ACCESSION M35190
KEYWORDS chloramphenicol acetyltransferase.
SOURCE C.coli plasmid C-589 DNA.
ORGANISM Campylobacter coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1334)
AUTHORS Wang,Y. and Taylor,D.E.
TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide
sequence, expression and cloning vector construction
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Taylor, 15-JUN-1990.
FEATURES from to/span description
pept 309 932 chloramphenicol acetyltransferase (cat)
mRNA 277 > 932 chloramphenicol acetyltransferase mRNA
signal 242 271 promoter
binding 297 301 ribosome binding site
signal 960 1006 transcriptional termination signal
BASE COUNT 433 a 232 c 282 g 387 t
ORIGIN
1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag
61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc
121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat
181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt
241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag
301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg
361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta
421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca
481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt
541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt
601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg
661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg
721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg
781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata
841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt
901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac
961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga
1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac
1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat
1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac
1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag
1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga
1321 aataataaat ggct
//
LOCUS CLOCBA 5120 bp ds-DNA BCT 19-JUL-1990
DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and
beta-D-galactosidase regulatory protein (cbgR) genes, complete cds.
ACCESSION M35107
KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein.
SOURCE C.acetobutylicum (strain NCIB2951) DNA.
ORGANISM Clostridium acetobutylicum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1260 to 5120)
AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B.
TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is
positively regulated in Escherichia coli by a novel regulatory
gene, cbgR
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5120)
AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by D.B.Scott, 11-JUN-1990.
Author address:D.B.Scott:
Molecular Genetics Unit
Department of Microbiology and Genetics
Massey University
Palmerston North, New Zealand
E-mail:D.B.Scott@massey.ac.nz
FEATURES from to/span description
pept 1560 4253 beta-D-galactosidase (cbgA)
pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR)
BASE COUNT 1921 a 683 c 876 g 1640 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac
61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa
121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat
181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat
241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt
301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt
361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga
421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa
481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga
541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt
601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat
661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg
721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat
781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct
841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt
901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt
961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc
1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt
1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa
1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata
1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt
1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa
1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta
1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc
1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg
1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta
1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata
1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag
1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt
1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata
1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta
1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc
1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt
1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag
2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg
2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc
2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa
2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt
2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag
2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt
2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac
2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag
2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct
2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg
2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac
2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag
2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat
2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa
2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct
2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt
3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta
3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt
3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca
3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc
3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta
3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta
3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg
3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat
3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag
3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa
3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg
3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag
3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca
3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt
3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa
3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg
3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa
4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag
4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt
4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact
4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct
4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt
4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag
4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc
4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa
4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata
4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa
4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga
4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc
4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa
4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa
4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag
4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa
4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca
5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg
5101 ttaatctaat atacgaattc
//
LOCUS ECOPNCB 1490 bp ds-DNA BCT 19-JUL-1990
DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene,
complete cds.
ACCESSION J05568
KEYWORDS nicotinic acid phosphoribosyl transferase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1490)
AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
Meesters,H.A.R. and Witholt,B.
TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis
and expression of the pncB gene encoding nicotinic acid
phosphoribosyl transferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1490; revises [1])
AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
Meesters,H.A.R. and Witholt,B.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by P.Terpstra, 31-MAY-1990.
FEATURES from to/span description
pept 216 1418 nicotinic acid phosphoribosyl transferase
(pncB) (EC 2.4.2.11)
mRNA 158 > 1490 nicotinic acid phosphoribosyl transferase mRNA
signal 124 129 -35 region
signal 146 151 -10 region
rpt 170 185 inverted repeat
binding 197 202 ribosome binding site
signal 1426 1450 rho-independent transcription termination
signal
revision 56 57 gc in [2]; cg in [1]
revision 191 191 t in [2]; tt in [1]
BASE COUNT 348 a 374 c 364 g 404 t
ORIGIN
1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac
61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg
121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac
181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc
241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc
301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta
361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg
421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac
481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata
541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta
601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca
661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct
721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg
781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc
841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa
901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc
961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt
1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg
1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa
1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct
1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg
1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac
1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca
1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc
1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc
//
LOCUS NEUCCON13 2728 bp ds-DNA PLN 19-JUL-1990
DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds.
ACCESSION M35120
KEYWORDS conidiation-specific protein.
SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 2728)
AUTHORS Hager,K.M. and Yanofsky,C.
TITLE Genes expressed during conidiation in Neurospora crassa: Molecular
characterization of con-13
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.M.Hager, 12-JUN-1990.
Author address:K.M.Hager:
Dept. of Physiology
UCLA Medical School
10833 Le Conte Avenue
Los Angeles, CA 90024-1751
E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU
FEATURES from to/span description
pept 1009 1275 conidiation-specific protein (con-13), exon 1
1333 1847 conidiation-specific protein, exon 2
1910 2150 conidiation-specific protein, exon 3
pre-msg 922 2367 con-13 mRNA and introns (alt.)
pre-msg 927 2367 con-13 mRNA and introns (alt.)
pre-msg 936 2367 con-13 mRNA and intron (alt.)
pre-msg 946 2367 con-13 mRNA and intron (alt.)
IVS 1276 1332 con-13 intron A
IVS 1848 1909 con-13 intron B
site 2364 2367 polyadenylation site
BASE COUNT 653 a 695 c 720 g 660 t
ORIGIN Linkage group IV.
1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc
61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt
121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc
181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt
241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa
301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc
361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca
421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta
481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca
541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta
601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt
661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg
721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc
781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt
841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg
901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca
961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct
1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg
1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa
1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg
1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat
1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta
1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc
1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc
1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac
1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt
1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag
1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag
1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc
1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc
1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc
1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt
1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg
1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct
2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg
2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact
2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg
2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt
2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga
2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc
2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa
2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct
2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag
2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga
2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat
2701 gatggaaacg agatcgaaac ccattgag
//
LOCUS VVUVVHAB 2237 bp ds-DNA BCT 19-JUL-1990
DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds.
ACCESSION M34670
KEYWORDS cytolysin; cytotoxin; hemolysin; toxin.
SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702.
ORGANISM Vibrio vulnificus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 2237)
AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G.
TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship
to Vibrio cholerae El Tor hemolysin
JOURNAL Infect. Immun. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.B.Kaper, 29-MAY-1990.
FEATURES from to/span description
pept 745 2160 cytolysin (vvhA) precursor
sigp 745 804 cytolysin signal peptide
matp 805 2157 cytolysin
pept 237 743 pot. cytolysin (vvhB)
signal 55 60 -35 region
signal 80 85 -10 region
signal 87 92 -35 region
signal 110 115 -10 region
signal 184 189 -35 region
signal 206 211 -10 region
signal 2185 2219 transcription termination signal
binding 54 69 CRP binding site
binding 59 74 Fur binding site
binding 185 199 Fur binding site
binding 226 231 vvhB ribosome binding site
binding 730 735 vvhA ribosome binding site
BASE COUNT 639 a 498 c 509 g 591 t
ORIGIN
1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata
61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat
121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa
181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc
241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga
301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg
361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt
421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct
481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt
541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc
601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa
661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc
721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta
781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac
841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga
901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg
961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc
1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc
1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac
1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat
1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa
1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt
1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt
1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg
1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt
1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg
1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac
1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg
1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca
1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg
1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc
1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga
1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt
1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt
2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc
2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag
2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta
2221 ttcgccatct attcgtc
//
LOCUS CHKMTPEPCK 3571 bp ss-mRNA ORG 19-JUL-1990
DEFINITION Chicken mitochondrial phosphoenolpyruvate carboxykinase (PEPCK-M)
mRNA, complete cds.
ACCESSION J05419
KEYWORDS phosphoenolpyruvate carboxykinase.
SOURCE Chicken 3-day old liver mitochondrion, cDNA to mRNA.
ORGANISM Mitochondrion Gallus domesticus
Unclassified.
REFERENCE 1 (bases 1 to 3571)
AUTHORS Weldon,S.L., Rando,A., Matathias,A.S., Hod,Y., Kalonick,P.A.,
Savon,S., Cook,J.S. and Hanson,R.W.
TITLE Mitochondrial phosphoenolpyruvate carboxykinase from the chicken:
Comparison of the cDNA and protein sequences with the cytosolic
isozyme
JOURNAL J. Biol. Chem. 265, 7308-7317 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.L.Weldon, 17-MAY-1990.
FEATURES from to/span description
pept 28 1950 phosphoenolpyruvate carboxykinase precursor
(EC4.1.1.32)
matp 28 126 phosphoenolpyruvate carboxykinase signal
peptide
matp 127 1947 phosphoenolpyruvate carboxykinase
rpt 2025 2123 large repeat copy A
rpt 2124 2223 large repeat copy B
rpt 2224 2316 large repeat copy C
rpt 2317 2416 large repeat copy D
rpt 2417 2508 large repeat copy E
rpt 2519 2543 small repeat copy A
rpt 2547 2577 small repeat copy B
rpt 2580 2611 small repeat copy C
rpt 2745 2777 small repeat copy D
rpt 2780 2811 small repeat copy E
rpt 2847 2878 small repeat copy F
rpt 2040 2050 GCCAAGATGGC 11 bp repeat
rpt 2105 2115 GCCAAGATGGC 11 bp repeat
rpt 2205 2215 GCCAAGATGGC 11 bp repeat
rpt 2298 2308 GCCAAGATGGC 11 bp repeat
rpt 2072 2082 TCCAAGATGGC 11 bp repeat
rpt 2139 2149 TCCAAGATGGC 11 bp repeat
rpt 2265 2275 TCCAAGATGGC 11 bp repeat
rpt 2332 2342 TCCAAGATGGC 11 bp repeat
rpt 2424 2434 TCCAAGATGGC 11 bp repeat
rpt 2524 2534 TCCAAGATGGC 11 bp repeat
rpt 2558 2568 TCCAAGATGGC 11 bp repeat
rpt 2657 2667 TCCAAGATGGC 11 bp repeat
rpt 2691 2701 TCCAAGATGGC 11 bp repeat
rpt 2724 2734 TCCAAGATGGC 11 bp repeat
rpt 2758 2768 TCCAAGATGGC 11 bp repeat
rpt 2792 2802 TCCAAGATGGC 11 bp repeat
rpt 2825 2835 TCCAAGATGGC 11 bp repeat
rpt 2859 2869 TCCAAGATGGC 11 bp repeat
rpt 2926 2936 TCCAAGATGGC 11 bp repeat
rpt 2983 2993 TCCAAGATGGC 11 bp repeat
rpt 3023 3033 TCCAAGATGGC 11 bp repeat
rpt 3057 3067 TCCAAGATGGC 11 bp repeat
rpt 3114 3124 TCCAAGATGGC 11 bp repeat
rpt 3234 3244 TCCAAGATGGC 11 bp repeat
BASE COUNT 589 a 1077 c 1197 g 708 t
ORIGIN
1 tcctcgccta tactgggaca atttataatg ttttggttaa gagggggggc gcagagttgt
61 aggggggggg aaactgagga cagaatgcag cgcgggatgt ggggcgtggg cctggcccgg
121 cgcaggctga gcacgtcgct gtcggcgctg ccggcggccg cgcgggattt cgtggaggag
181 gcggtccggc tgtgcaggcc cagggaggtt ctgctgtgcg atgggtccga ggaggagggg
241 aaggagctgc tcagagggct gcaggacgac ggggtgctgc atccgctgcc caaatacgac
301 aactgctggt tggctcgcac cgacccccgg gacgtggctc gggtgcaaag caagacggtg
361 ttggtaaccc ccgaacagag cgacgccgtc cccccacccc ccccatccgg gtccccccaa
421 ttggggaact ggatgagccc caatgctttc caggcagctg tgcaggagcg tttccccgga
481 tgcatggcag gccgccccct ctacgtcatc ccattcagca tgggcccccc cacgtccccc
541 ttggccaaac tgggggttca ggtgaccgac tccccctacg tggtgctctc catgcgcatt
601 atgacccgcg tgggccccgc ggtgctgcag cgcctcgacg acgacttcgt ccgctgcctc
661 cactctgtgg ggcggcctct gcccctcacc gagcccctgg tgagctcgtg gccgtgcgac
721 cggtcccgtg tcctggttgc ccacatcccc tcggagcgcc ggatcgtctc cttcggttcg
781 ggatacggcg gcaattcgct gctgggcaag aagtgcttcg cgctggccat cgcgtcccgc
841 atggcccagc agcagggctg gctggccgag cacatgctga ttttgggggt gacgtccccc
901 agcggtgaga agcgttacat ggcggcggcc tttcccagcg cctgcgggaa aaccaacctg
961 gccatgatga cccccagcct gccgggttgg cggatccact gcgttgggga cgacattgcg
1021 tggatgaagt tcgatgatcg agggcgcctc cgcgccatca accccgagcg tggctttttt
1081 ggggtggccc cggggacgtc gtcgcgcacc aaccccaacg ccatggccac catcgcccgc
1141 aacaccatct tcaccaacgt ggggctgcga agcgatggcg gcgtctactg ggacggcctg
1201 gatgagccca cggagcccgg ggtcacctac acctcctggc tgggcaagcc gtggaagcac
1261 ggtgaccccg agccgtgcgc ccaccccaac tcccgtttct gcgccccggc cgatcagtgc
1321 cccattatgg acccgcgttg ggacgacccg gaaggagttc ccatcgacgc catcatcttc
1381 ggggggcgcc gaccccgcgg agtgccgttg gtggtggagg cctttgggtg gcgccacgga
1441 gttttcatgg gcagcgcaat gaggagcgaa gccaccgccg ccgccgagca caaaggcggc
1501 cgtttgatgc acgacccctt cgccatgagc ccctttttcg gctacaacgc ggggcgttac
1561 ctggaacatt ggctgtctac gggtctccgg agcaacgccc gcctcccccg tctgttccac
1621 gttaattggt tcctccgaga taacgaaggt cgcttcgttt ggcccggctt cggtcacaac
1681 gcccgcgtct tggcttggat cttcgggagg atccagggga gggacactgc ccggcccacc
1741 cccatcggtt gggtacccaa agaaggggat ttggacctgg gggggctgcc gggggtcgat
1801 tactcccaac tgttccctat ggagaagggc ttttgggagg aggagtgcag gcagctgagg
1861 gagtattacg gggagaactt cggggccgat ctgcccaggg atgtcatggc ggagctggag
1921 ggcctggagg agagggtgag gaagatgtga ggggtcgggg tggggctgag ggaaaggatg
1981 gggggaggtt gggggggctg tggggggcga ggtgggggct ggcggtgggg gttggtgagg
2041 ccaagatggc ccatcggtat gggttggccg ttccaagatg gctgccgccg ctatgagttg
2101 gtcagccaag atggccgccg acagtgtggg ttggtgggtc caagatggct gccatcggta
2161 tgggttggcc gttccaagat gctgccgccg ctacgagttg gtcagccaag atggccgccg
2221 acagcgtggg tccaagaagg ccgccatcat tacgggttgg ccgttccaag atggctgccg
2281 ccactacgag ttggtcagcc aagatggcca ccgacagtgt gggttggtgg gtccaagatg
2341 gctgccatca gtatgggttg gccattccaa gatcgtgccg ccgctacgag ttggtcagcc
2401 aagatggctg ccgacagcgt gggtccaaga tggccgccat cattacgggt tggccgttcc
2461 aagatcgtgc cgccactacg agttggtcag ccaagatggc caccgacagc gttggttggt
2521 gggtccaaga tggctgccat cattgtgggt tggccgttcc aagatggccg ccatcactgt
2581 gggttggccg ttccaaggtg gctgccatct ttgtgggtcg gtgggcccat gatggctgcc
2641 atcgtgggtt ggctgttcca agatggctgc cagcagcgtg agatgactgt tccaagatgg
2701 ctgccaccac tatcagttgg ccatccaaga tggccgccaa cagcgtgggt tggtgggtcc
2761 aagatggccg ccatcactgt gggttggccg ttccaagatg gctgccgcca ctatgagttg
2821 gccatccaag atggctgcca gcaggatggg ttggtgggtc caagatggct gccaccataa
2881 tgcattggcc agacaagatg gccaccagca gcatgggatt gccgatccaa gatggccgcc
2941 ctacctggga aggagccccc tgcctgctca ttggctgagc gctccaagat ggctgccatt
3001 ccacgtcctc gttggttgac catccaagat ggctgccacc cccacagagt ggccgatcca
3061 agatggccgc cccgcctggg agggatcctc ctgccctctc attggctgag cgttccaaga
3121 tggctgccat tccacgtcct cattggttga ccatccacga tggctgctgc cttcctctcc
3181 attggctacc catctaagat ggctgctctc ctttgtcctg attggctggc caccccaaga
3241 tggctgctcg tgcccatcct ggctgctcat tggttcctgc agagctgtgg tgcctcccaa
3301 ttggtcgggg ccatttgata gtgggacttc tgggcgccat cttggagtga cgtcacactg
3361 tgagcaacgc tgcgttccta ctggcttgcc gcagcctccc atgaccaatg gctgtgtccg
3421 cttggttgcg aacgccctcg cctaatcaca gcgtcccgtt ggccgagcgg agcgtcctga
3481 ttggccgagc tcttcccctt gtccaaacgg cagcttccca ttggctgtgc tcatctcaat
3541 ggcctatcag agccgcccgt ggacctcaga a
//
LOCUS HUMPANMU 4139 bp ss-mRNA PRI 19-JUL-1990
DEFINITION Human pancreatic mucin mRNA, complete cds.
ACCESSION J05582
KEYWORDS pancreatic mucin; tumor-associated antigen.
SOURCE Human pancreatic tumor cell line HPAF-CD11, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4139)
AUTHORS Lan,M., Batra,S., Qi W,-N., Metzgar,R. and Hollingsworth,M.
TITLE Cloning and sequencing of a human pancreatic tumor mucin
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Hollingsworth, 08-JUN-1990.
FEATURES from to/span description
pept 74 3841 pancreatic mucin precursor
sigp 74 136 pancreatic mucin signal peptide
matp 137 3838 pancreatic mucin
mRNA < 1 4139 pancreatic mucin mRNA
rpt 453 2880 tandem repeat
rpt 299 452 5' degenerate tandem repeat
rpt 2881 2957 3' degenerate tandem repeat
signal 4118 4123 poly-A signal
BASE COUNT 632 a 1910 c 1055 g 542 t
ORIGIN
1 ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca
61 tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac
121 agtgcttaca gttgttacag gttctggtca tgcaagctct accccaggtg gagaaaagga
181 gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg ctgtgagtat
241 gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca ctcagggaca
301 ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca cctggggaca
361 ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc cgccagccca
421 cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc ccccagccca
481 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
541 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
601 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
661 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
721 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
781 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
841 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
901 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
961 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1021 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1081 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1141 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1201 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1261 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1321 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1381 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1441 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1501 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1561 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1621 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1681 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1741 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1801 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1861 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1921 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
1981 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2041 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2101 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2161 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2221 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2281 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2341 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2401 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2461 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2521 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2581 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2641 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2701 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2761 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2821 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
2881 tggtgtcacc tcggccccgg acaacaggcc cgccttgggc tccaccgccc ctccagtcca
2941 caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg tgcacaacgg
3001 cacctctgcc agggctacca caaccccagc cagcaagagc actccattct caattcccag
3061 ccaccactct gatactccta ccacccttgc cagccatagc accaagactg atgccagtag
3121 cactcaccat agctcggtac ctcctctcac ctcctccaat cacagcactt ctccccagtt
3181 gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc agtttaattc
3241 ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca tttctgaaat
3301 gtttttgcag atttataaac aagggggttt tctgggcctc tccaatatta agttcaggcc
3361 aggatctgtg gtggtacaat tgactctggc cttccgagaa ggtaccatca atgtccacga
3421 cgtggagaca cagttcaatc agtataaaac ggaagcagcc tctcgatata acctgacgat
3481 ctcagacgtc agcgtgagtg atgtgccatt tcctttctct gcccagtctg gggctggggt
3541 gccaggctgg ggcatcgcgc tgctggtgct ggtctgtgtt ctggttgcgc tggccattgt
3601 ctatctcatt gccttggctg tctgtcagtg ccgccgaaag aactacgggc agctggacat
3661 ctttccagcc cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg
3721 gcgctatgtg ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa
3781 cggtggcagc agcctctctt acacaaaccc agcagtggca gccgcttctg ccaacttgta
3841 gggcacgtcg ccgctgagct gagtggccag ccagtgccat tccactccac tcaggttctt
3901 caggccagag cccctgcacc ctgtttgggc tggtgagctg ggagttcagg tgggctgctc
3961 acagcctcct tcagaggccc caccaatttc tcggacactt ctcagtgtgt ggaagctcat
4021 gtgggcccct gaggctcatg cctgggaagt gttgtggggg ctcccaggag gactggccca
4081 gagagccctg agatagcggg gatcctgaac tggactgaat aaaacgtggt ctcccactg
//
LOCUS DOGSRP9A 1271 bp ss-mRNA MAM 19-JUL-1990
DEFINITION C.lupus signal recognition particle 9 protein (SRP9) mRNA,
complete cds.
ACCESSION M34952
KEYWORDS signal recognition particle protein.
SOURCE C.lupus (strain Madin-Darby) kidney, cDNA to mRNA.
ORGANISM Canis lupus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 1271)
AUTHORS Strub,K. and Walter,P.
TITLE Assembly of the alu domain of the signal recognition particle
(SRP): Dimerization of the two protein components is required for
efficient binding to SRP RNA
JOURNAL Mol. Cell. Biol. 10, 777-784 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Strub, 07-JUN-1990.
FEATURES from to/span description
pept 59 319 signal recognition particle 9 protein
BASE COUNT 372 a 232 c 254 g 413 t
ORIGIN
1 gcccacctac cacctacctc gggcggccag aaaccgatgc ggggggccca gcggcaagat
61 ggcgcagtac cagacttggg aggagttcag ccgcgcggcc gagaaactct acctcgccga
121 ccctatgaag gcacgtgtgg ttctcaaata taggcattct gatgggagtt tgtgtattaa
181 agtaacagat gatttagttt gtttggtgta tagaacagac caagcccaag atgtaaagaa
241 gattgagaaa ttccacagtc aactaatgcg actcatggta gccaaggaat cccgcagtgt
301 tgccatggaa acggactgac gggtttgaaa tgaagatcct tcatgttctt aggagtaaat
361 atcttttgaa tcagaaaaag tgttgggaaa gaaaatatgt aactaagtgg gctcttcaga
421 agtggggaga tcattttttg tactttgttt tttaatgttt actttagaga gctaggaacg
481 tacatgcttt cggtgaaagc ctttatttat ttttggaaat tcagtaaaag gcagttcttc
541 cttaaattta gttaatctgt ctttaaaaga aaattaaatt taaccatttt gctggattgt
601 tgtatttctt ttggagcata aaatttgtgc tattgatgac caacaaacaa acataaaata
661 tagtaattgg aattacctgt gcacagcagt gtacctatgt ataatatagt aattagtctc
721 agttctatct aaaagtaatc atggaaatga gtatgcttta cctaaaactt ttccaaactt
781 aaactgtatt tttgaatgta aggaatttgt agtatcgtta gcttgttgag cagggacttg
841 ctttaatcta gtttccagtg ctcaaaaaca actgcattta cttgaagtgc atgaacagat
901 gatcactagt ggactgaacc accatattac gcaagtattt gcctgcagat ttcccatcta
961 tattttctca gaagggctaa agattatttg aactgttaaa tctttgccat atgtctgtgc
1021 cactcctgcc tgtttctccc tgtacttaac caaggtgttg aacatgactg tcacaactgt
1081 tagttaaatc tttgcatatg tctgtgccac tcctgcctgt ttctccctgt acttaaccaa
1141 ggtgttgaac atgactgtca caactgttat ttttttcatt aagtcagaag gatatcattt
1201 gatatttatc atataattgt aacctcagtt ttaccatctc aatgtaatgt tcacatgttg
1261 ttcctacatt a
//
LOCUS PCHPMMMSA 6409 bp ss-mRNA INV 19-JUL-1990
DEFINITION P.chabaudi major merozoite surface antigen mRNA, complete cds.
ACCESSION M34947
KEYWORDS major merozoite surface antigen; surface antigen.
SOURCE P.chabaudi chabaudi (strain IPP-C1), cDNA to mRNA, and DNA, clone
IPP-C1/C.
ORGANISM Plasmodium chabaudi
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 6409)
AUTHORS Deleersnijder,W., Hendrix,D., Bendahman,N., Hanegreefs,J.,
Brijs,L., Hamers-Casterman,C. and Hamers,R.
TITLE Molecular cloning and sequence analysis of the gene encoding the
major surface antigen of Plasmodium chabaudi chabaudi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Deleersnijder, 06-JUN-1990.
FEATURES from to/span description
pept 667 6024 major merozoite surface antigen precursor
sigp 667 723 major merozoite surface antigen signal peptide
matp 724 6021 major merozoite surface antigen
rpt 324 365 tandem repeat region
BASE COUNT 2681 a 991 c 922 g 1814 t 1 others
ORIGIN
1 tctagataat atattttttg tatgcatgct aaaattaatt atacatatat taaatagatt
61 tgtgcgaatc tttatgtgtg caagttattt tttttaataa taattatcca tataccacat
121 tatttatttg tgtaccgtta aatatttatt ttctaagcga tttttctcct taaattatat
181 tttttntgat catttttttt ttttttttgg aaaatcggga gcataaaaaa tatatattac
241 actttataaa ttttttatac acatttgttt attttatttt atatatattt tttaacacat
301 ttttattttg aaatgatatg atcaattata aaaaaacaat aacataataa tagtaataat
361 ttttttttgt acgatatata aaattatgca tttttatttt tatagtaagt taaaaagtgt
421 attatatgta cgtattttgt ttaacagaac ggaaattaga aaaaacacaa taaaacttat
481 atatatatgt gtaattagtg tatgtgtata tatttgtcaa cattataaat gatataattg
541 aacttcaata tttattttta cacaaattag tactaatata aaatgcaaaa gtaatgtacc
601 tttgtgtgta ttaattttag cattataatt tattccactc tgtatattag ttaagtttcg
661 ttgaaaatga aggcgatcgg acttttgttt tctttcgttt tttttgctat atattgcaaa
721 tctgaaacaa taggagttta caatgatctc gttcataagt tagaaaagtt agaagaatta
781 tcagtagaag gattagaact atttcaaaaa agtcaagtaa ttgtaaatgc acaatcacca
841 gaaacacctg ttgatccatt tacaaaccct gaatttgcac aaaagttaca accatttatt
901 ttaaaatttg aagaattagg atttacagaa caaacagagt tagtcaattt aataaaaact
961 ttaggcccaa ataaatatgg actaaaatat ttaattgaaa gtaaagaaga atttaacgaa
1021 ttaatgcacg caataaattt ttactatgac gtgcttagag ataaattgaa tgatatgtgt
1081 gcaaataact attgtgaaat tcctgaacat cttaaaatta atgttgaaga aatcgaaatg
1141 cttaagaaag ttgtcttagg ctatagaaaa ccaattgaaa atattcaaga tgatcttgta
1201 aaattagaag aatatattgc aagaaataaa gcaactgctg aaaccttaaa cactcttatt
1261 actgaagaaa caaaaaaaat aacacctgaa gaagaaacag attgcaacga tactaattgc
1321 gacaatacta aatatggaaa gaaaaaagca atatatcaag ctatgtacaa tgttatattt
1381 tacaaaaagc aattagctga aataaaaaaa gtcatcgaag tcttagaaaa gagagttgct
1441 acattaaaga agaacgaagc cataaaacca ttgttacaac aaatcgaagc tatcagaggt
1501 ccacctgctg tcactgaagg acaaatagct acagaaggaa gcagcgaaga aacaaaacaa
1561 aatagtacag aatcatctaa cacaaaaacg actactactg acaaagctgt tacaacccaa
1621 accgctacta aagcaactgg tacagaaaca aatactggta cagaaacaaa tactggcaca
1681 gaaacaaata ctgccacagg aacaactact gccacaggaa caactactgc cacaggaaca
1741 cctactgtca ctgaaccagt tcaagtgcca gccgttcaag ttcttacaga agaagaaaaa
1801 gcaaaaaaaa tagctgaact ttatgctcaa attaaagaaa ttgcaaaaac tataaaattc
1861 aatttagacg gaatatttgt cgatccagtt gaattagaat attacaaaaa agaaaaaaaa
1921 aatgaaagct gccattcaac ttcatcttgc cacaaaaata aaacacctga aactgtaata
1981 ccattaaatg tacgttatcc aaatggtatt agctacccat taactgaaga agttgtttac
2041 agcaaaattg ctcataatgc cgctgaaaca acttatggtg atttaacaaa tgtcgataat
2101 acagccataa cagaagattt aaccacaaat gaacaagcaa gaaaaaattt aattaaagct
2161 attaaaaaga aaatcgaagc agaagaacaa aaattagtag aattaaaaga tgattatgat
2221 actaaacttg cagcatttaa tggacaaaaa actccattca aagaagcagc taaaaaattt
2281 tatgaatcca aatttagaaa taaattgact actgacattt ttgacgattt taaaacaaaa
2341 agaactgaat atatgaacaa gaaagctgca ttagtaggtt gtgaatatgg aaatactcaa
2401 caactcatta ataaattaaa taaacaactt aattatttac aagattatgg attaagaaaa
2461 gaaatagtta acactgaaat tgaatatttt tcaaacaaaa aatcagaatt acaatataat
2521 attaatagat tagcaaatgc tgttcaagca aaacaaaata tattagttgc atcaaaacat
2581 attccacttt caacacttgt tgaattacaa atccaaaaat ctttattaac aaaactaatt
2641 gaacaattaa ataaaactga attttcttta aataaagctc acttaaaaga caagatatac
2701 gttccacaaa catatggtaa agaaggaaaa ccagaaccat actacttaat agctataaaa
2761 aaagaaattg acagacttgc caaatttatt cctaaaattg atgatatgat tgagaaagag
2821 aaacaaaaaa tggaacaaga acatgtagct accggagaat ctgaacaagc ctcttctgcc
2881 tctggtactg gatcatccac agaaaccaca tcacaaacag caccagccgt tccagctgca
2941 cccgcaccag cagaaaaggc aaaagaagga acagaatcaa cagaagaaac cccagcagca
3001 tcaaaaccag ccgaaggtgc agcatccaca ggtgcaacca ccccaacaga acaagaagct
3061 gcaccaacag aacaagaagc acaacctgca gcacctgaaa caccagcaga ggtaccagca
3121 ccaaccacgc ctgcagctcc agcaactcca gccgcaccag cagcacccgc aaaaccagtt
3181 atgacaaaat tatattacct tgaaaaatta aagaaatttt tagcattctc atatgcatgc
3241 cataaatatg ttttattaca aaactctacc ataaacaaag atgctttaag caaatatgct
3301 cttacaccag aagaagataa aataagaaca ttaaagagat gcagtgaatt agatgtatta
3361 ttagctattc aaaataatat gcctactatg tattcacttt atgaaaatgt agttgatggt
3421 ttacaaaaca tttacactga attatatgaa aaagaaatga tgtatcatat atataactta
3481 aaagataaaa acccagctgt taaagcttta ttagtaaaag ctggcgtcat tgatccagaa
3541 ccagtagccc caacaccagc agtaccagca ccagaaactg caccagaaac tgcaccagaa
3601 actgcaccag aaacaccagc acaagaagct ccacaacaac cagaatcggc acaagcacca
3661 gaagcagcaa ctgaaacaac aacaccagcc gaatcggcat caacagaacc aacaccaaaa
3721 gcacctacag caacacccac atctgaaaca gtaacacaag aaggaacaac accagcagca
3781 ccaaaagcac aagaaggagc atcatcatca gcaccagcac aaccagcccc agcaaaacca
3841 gcacctgcac aaacagtaac agggcaatca acaaacgttg aaggaagtac tcaagtaaga
3901 gcagaaagtg aagacgaaat gtttgtcgat gattttgaag tagacaattt ttacaaatct
3961 tacttacaac aagttgatgg aaataatact caattcatag attttataaa atctaaaaaa
4021 gaattaatca atgcattgac ccctgaaaaa gttaaccaat tatatcttga tattgcacac
4081 ttaaaggaat tatcagaaca ttactataat cgttattata aatataaatt aaaattagaa
4141 agattatatc aaaaacatga acaaattgaa gcagctaacc aaaaagttaa agaaattagc
4201 gtattaaaat cccgattatt aaaaagaaaa aaatatatta atggtacatt ttatgtatta
4261 tctggttttg caaatttctt taacaagaga agagaagctg aaaagcaata tgtagataac
4321 gcaataaaaa atactgatat gttattgaaa tactacaaag ctcgtagtaa atattttact
4381 tctgaagctg ttcctttaaa aacattaact aaaacatcaa ttgacagaga agccaactac
4441 ttgaaaatcg aaaaattcag agcatacagc cgattagaat taagattaaa gaaaaatatt
4501 aacttaggaa aagaaagaat tacatatgta tctggtggtt tacaccatgt atttgaagaa
4561 tttaaagaac ttttaaaaaa taaaggttat accggaaaaa ctaaccctga aaatgctcct
4621 gaagttatca aggcattcga acaatataaa gaattacttc caaagggagc aacaactcca
4681 gctccagtag ttgcacctgt agttgctcca gccccagcta cagcagcccc agcagctgac
4741 gcaccagtac cagcagccgc agccgcagcc gcatcaggat caggatcagc agccacaaca
4801 gaaggagaag ccgctacaac agtagttgca agcagcgata atgatgatga tgacgatgat
4861 gatatggatc aaattgcaaa tgctcaatcc acagacgaag aagtaaaaga tattcttgat
4921 gcatttaaaa gtgaaaatga atatatatac acaaagagct taggtaacac atataaatca
4981 tttaaaaaac acatgttaaa agaattttca atgattaaag aagacataat gactggatta
5041 aactataaat tagaaaaaag aaatgatttc cttgatgtat taagctatga attagcttta
5101 ttcaaagata taaataccaa caaatttgtt gttaaaaacc cataccaatt attagataat
5161 gataagaaag acaaacaaat gataaactta aaatatgcca ttaaaggtgt aactgaagat
5221 atcgaaacag ctactgatgg aattgaattc tttaacaaaa tgattgaatt atacaaacct
5281 caattaaacg cagttaatga acaaattgct gccataggaa cagaacctac cgatgccgaa
5341 aaaaagaaat acgctccaat ctttgaagat cttaaaggat tatatgaaac catattgaac
5401 ggagcagaag aattttcaga attattacaa cacaaacttg aaaactataa aattgaaaaa
5461 gctggatttg acattttaat ggcaaattta gaaacataca taagaattga cgaaaaactt
5521 gaagacttcg tagaaagtgc agaaaaaaat aaacacattg cctcaatagc tttaaataac
5581 ttaaacaaat ctggtttagt aactgaaggt gaatcaaaga aaatattagc aaaaatgctt
5641 aacatggatg ccatggattt attaggtata ggttctaatc atgtatgtat tagtacaagt
5701 actcctgaca atgctggatg ctttagatat gatgatggta cagaagaatg gagatgttta
5761 ttaggtttca aaaaagatga tgatggtaat agatgtgtag cagatgatgc tcctgtttgt
5821 aataacaaca atggtggatg tgataaaaat gctgattgta gagaagtaga aaatacagat
5881 agggatcctt ccaaaaaaat tgtatgtact tgtaaagaac caaacccaaa tgcatattat
5941 gctggtgtat tctgtagttc ttccggattt atgggattat caattttatt gatcatcaca
6001 ttaattgtat ttaatttatt ttaaataaat gattaaaata tttgttgcat tttatatttt
6061 tcctatatat attttaaaag ttgtataata catttgaaat atatattttg gcataaattg
6121 tatatttttt attatataaa aaaatatata tatataattt ttaataaaca tttttaaata
6181 aacgtacatg tgttttagta taggaaattt tgtatgactt taaaatatga tgatactatt
6241 ttttttaaat gtatagtaaa ttaatttatt tttatttttt atacaatata ttgtatgtgt
6301 gttctttatt actattattt tataagtata taaaataaag ctattttttt ttttttttta
6361 acttcaaaca tatttagtaa cttttttatt taaagaatag ccggaattc
//
LOCUS SHPMHCA 588 bp ss-mRNA MAM 19-JUL-1990
DEFINITION Sheep MHC class I protein gene, 3' end.
ACCESSION M34672
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep 8-week old, cDNA to mRNA, clone SC17.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 588)
AUTHORS Grossberger,D., Hein,W. and Marcuz,A.
TITLE Class I major histocompatibility complex cDNA clones from sheep
thymus: Alternative splicing could make a long cytoplasmic tail
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Grossberger, 30-MAY-1990.
FEATURES from to/span description
pept < 1 287 MHC protein (AA at 3)
BASE COUNT 120 a 161 c 157 g 150 t
ORIGIN
1 ccaggaagtg ggcggccctg gtggttcctt ctggagagga gcacacatac acgtgccgtg
61 tgcagcacga ggggcttcag gagcctaccc tgagatggga acctcctcag acctccttcc
121 tcaccattgg catcattggt ctggatctcc tcgtggttgc tgtggtggct ggagctgtga
181 gctggatgaa gaagctctca ggtgaaaaaa gacggacgta cacacaggct gcaagcagtg
241 acagtgccca gggctctgat gtgtctctca cggtccctaa agtgtgaaac gctgccttgt
301 gggactgagt gatgctgcat cccgcaatgt gacgtcagat cctggacccc tctttctcgg
361 ctgcatccga atgtgtctgt gctcctagta gcataacatg aggagctggg gagactggtc
421 acccctgccc accacacccc cttctccgct gacctgtgtt ctcctccctg atacactgtc
481 ctgttccagc agagacaggg ctgggccgtg tcatcgctgt ctttgcttca tatgcactta
541 gtaatgatgt cttatttcat ctttgaaaat aaaatctgta tatatatc
//
LOCUS SHPMHCB 841 bp ss-mRNA MAM 19-JUL-1990
DEFINITION Sheep MHC class I protein gene, 3' end.
ACCESSION M34673
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep 8-week old, cDNA to mRNA, clone SCI89.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 841)
AUTHORS Grossberger,D., Hein,W. and Marcuz,A.
TITLE Class I major histocompatibility complex cDNA clones from sheep
thymus: Alternative splicing could make a long cytoplasmic tail
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Grossberger, 30-MAY-1990.
FEATURES from to/span description
pept < 1 543 MHC protein (AA at 1)
BASE COUNT 179 a 235 c 238 g 189 t
ORIGIN
1 gaggactacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaccgggaag
61 gacacgctgc tgccgcagac ccttccaaag gcacatgtga cccgacaccc catctctgag
121 cgtgaggtac ccttgaggtg ctgggccctg ggcttctacc ctgaggagat ctcactgacc
181 tggcagcgca atggggagga ccagacccag gacatggagc tcgtggagac caggccttca
241 ggagatggaa ccttccagaa gagggcggcc ctggtggtgc cttctgaaga ggagcagaga
301 tacacgtgcc atgtgcagca cgaggggctt caggagctca ccctgagatg ggaacctcct
361 cagacctcct tcctcaccaa gggcatcatt gttggcctgg ttctcctcgt gctggctgtg
421 gtggctggag ctgtgatctg gaggaagaag tgctcaggtg aaaaaagagg cacctatacc
481 caggcttcaa acaatgacat gtgcccaggc tctgatgtgt ctctcacagt tcctaaagtg
541 tgagacgctg ccttgtggga ctgagtgatg ctgtatccca ctatgtgatg tcagatccct
601 gacccctctt tctgcagctg catctgaacg ttgtctgtgc tccatgtagc ataacgtgag
661 gagctgggga gattggtcac ccctgcccac cacaccccct cccgcctgga cctatgtctc
721 ctccctgata cactgtccta atccagcaga gagggcctgg ctgtctccat ccctgtcttg
781 cttcatgtgc actgagtaat gatgtcttat acccttattg aaaataaaat ctgtatatat
841 g
//
LOCUS SHPMHCC 995 bp ss-mRNA MAM 19-JUL-1990
DEFINITION Sheep MHC class I protein gene, 3' end.
ACCESSION M34674
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI16.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 995)
AUTHORS Grossberger,D., Hein,W. and Marcuz,A.
TITLE Class I major histocompatibility complex cDNA clones from sheep
thymus: Alternative splicing could make a long cytoplasmic tail
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Grossberger, 30-MAY-1990.
FEATURES from to/span description
pept < 1 537 MHC protein (AA at 1)
site 466 626 unspliced intron
BASE COUNT 203 a 248 c 288 g 254 t 2 others
ORIGIN
1 gaccctccaa aggcacatgt ggcccatcac cccatctctg accgtgaggt caccctgagg
61 tgctgggccc tgggcttcta ccctgaggag atctcactga cctggcagcg tgacggggag
121 gaccagactc aggacatgga gtttgtggag accaggcctt caggggatgg aaccttccag
181 aagtgggcgg ccctggtggt gccttctgga gaggagcaga gatacacgtg ccgtgtgcag
241 cacgaggggc ttcaggagcc cctcaccctg agatgggaat ctcctcagcc ctccgtcctc
301 accatgggca tcattgttgg cctggttctc ctcgtggtgg ctgtggtggc tggagctgtg
361 atctggatga agaagcgctc aggtgaaaaa ggacggatct acacccaggc tgcaagcatg
421 tacagtgccc agggctctga tgtgtctctc acggttccta aaggtgaggc cctggagtgt
481 ctagattgga aggagcattg gggcagaggg gacacactgg gtggcggggg tctctgagtg
541 ggacatgtga gcatgtcggg ggctgtggag aatatcagcc cttacatgac tgacctgaac
601 tggctcctga ttcttttctc tcacagtgtg agacagctgc cttgtgggga ctgagtgatg
661 cttggtccca ctttgtgatg tcagatcgcc ggacccctct ttcttcagct gcatctgaat
721 gtgtctgtgc tcctattagc ataacatgag aagttgggga gactggtcac ccttgcccac
781 tgtacgctgt ccccaccctg acctgtgttc tcctccctga tccaccatcc tgttcagcga
841 gacgggctgg gccatcttca ttgctatctt tgcttcacat gcactgagta atgatgtctt
901 atttccttat tgaaaataaa ttctgtatat atatgaatct attttttcta attggtgcca
961 tgaaagggnn ttggataata aaatgagaat tcgat
//
LOCUS SHPMHCD 1050 bp ss-mRNA MAM 19-JUL-1990
DEFINITION Sheep MHC class I protein gene, 3' end.
ACCESSION M34675
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI11.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1050)
AUTHORS Grossberger,D., Hein,W. and Marcuz,A.
TITLE Class I major histocompatibility complex cDNA clones from sheep
thymus: Alternative splicing could make a long cytoplasmic tail
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Grossberger, 30-MAY-1990.
FEATURES from to/span description
pept < 1 747 MHC protein (AA at 1)
BASE COUNT 216 a 292 c 315 g 227 t
ORIGIN
1 ggcgggtctc acaccatcca ggcgatgtac ggctgcgaag tgggacctga cgggcgtctc
61 ctccgcgggt atgagcagtt cgcctacgaa ggcagagatt acctcgccct gaacgaggac
121 ctgcgctcct ggaccgcggc ggacacggcg gctcagatca ccaagcgcaa gtgggaggcg
181 gcaggtgagg cggcgcgtgt gaggatctac ctggagggca cgtgcgtgga gtggctccgc
241 agacacctgg agaccgggaa ggacacgctg ctgcccgcag accctccaaa ggcacatgtg
301 acccaacacc ccatcactga gcgtgaggtc accctgaggt gctgggcctt gggcttctac
361 cctgaggaga tctcactaac ctggcagcac aatgaggagg accagaccca ggacatggag
421 cttgtgaaga ccaggccttc aggggatgga accttccaga agtgggcagc cctggtggtg
481 ccttctggaa aggagcagag atacacgtgc cgtgtgcagc acgaggggct tcaggagccc
541 ctcaccctga gatgggcacc tcctcagacc tccttcctca ccatgggcat cattgttggc
601 ctggttctcc tcgtggtgac tgtggtggct ggagctgtga tctggaggaa gaagcgctca
661 ggtgaaaaaa gacagaccta tacccaggct gcaagcagtg acagtgccca gggctctgat
721 gtgtctctta tggttcctaa agtgtgagac agctgccttg tggggactga gtgatgcttg
781 gtcccattct gtgacatcag atcttgggac ccctctttct gcaggggcat ctgaatgtgt
841 ctgtgctcct attagtataa catgaggagt tggggagact ggtcacccct gcccactgca
901 caccgtcccc accctgacct gtgttctcct tcctgatcca ctgtcctgtt gcagcagaga
961 cgcctgggcc ctctccatca ctgtctttgc ttcatatgca ctgagtaatg atgtgttatt
1021 tcctttttga aaataaaatc tgtatatatg
//
LOCUS SHPMHCE 1396 bp ss-mRNA MAM 19-JUL-1990
DEFINITION Sheep MHC class I protein gene, complete cds.
ACCESSION M34676
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI12.
ORGANISM Ovis aries
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1396)
AUTHORS Grossberger,D., Hein,W. and Marcuz,A.
TITLE Class I major histocompatibility complex cDNA clones from sheep
thymus: Alternative splicing could make a long cytoplasmic tail
JOURNAL Immunogenetics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Grossberger, 30-MAY-1990.
FEATURES from to/span description
pept 4 1110 MHC protein precursor
sigp 4 89 MHC protein signal peptide
matp 90 1107 MHC protein
BASE COUNT 278 a 403 c 434 g 281 t
ORIGIN
1 cccatgacca gaggattgcg agtaatgggg ccgcgaaccc tcctgttgct gctctcggga
61 gtcctggtcc tgaccgagat ccgggcgggc ccccactcca tgaggtattt cagcaccgcc
121 gtgtcccgcg ccggcgccgg ggagccccgg tacctggaag tcggctacgt ggacgacacg
181 cagttcgtgc ggttcgacag cgacgccccg gatccgaaga tggagcagag ggagccgtgg
241 atgaagcagg tggggccgga gtattgggat cggaacacgc gaaatcccaa gggcaacgca
301 cagactttcc gagtgggcct gaccatcctg cgcggctact acaaccagag cgagaccggg
361 tctcacacct ggcagtgtat gtacggctgc gacgtggggc cggacgggcg tctcctccgc
421 gggttcatgc agttcggcta cgacggcaga gattacatcg ccctgaacga ggacctgcgc
481 tcctggaccg cggcggacac ggcggctcag gtcacccagc gcaagtggga gaaggaaggt
541 gcggcggacc actacaggaa ctacgtggag ggcacgtgcg tggagtgcgt gcgcagatac
601 ctggagatcg ggaaggaaca gctgcagcga gcagaccctc caaaggcaca tgtgacccat
661 caccccatct ctggccatga tgtcaccctg aggtgctggg ccctgggctt ctaccctgag
721 gagatctcac tgacctggca gcgcaatggg gaggaccagt tgcaggacat ggagcttgtg
781 gagactaggc cttcagggga tggaaccttc cagaagtggg cggcccttgg tggtgcttct
841 ggagaggagc agagatacac gtgccatgtg cagcatgagg ggcttcagga gcccctcacc
901 ctgagatggg aacctcctca gacctccttc ctcacttcct caatgggcat cattgttggc
961 ctggttctcc tcgtcatggt ggctgtggtg gctgcagctg tgatctggag gaagaagtgc
1021 tcaggtgaaa aaagagggac ctatacccag gcttcaagca atgacagtgc ccagggttct
1081 gatgtgtctc tcacggttca taaagtgtga gacagtgatg ctgcatcccg ctatgtgcca
1141 tcagatcccc ggacccctct ttctgaagct gcatctgcac gtgtctgtgc tcctagtagc
1201 ataacgtgag gagttgggga gaccgttcac ccctgcccac cgcgccccct cctgccctga
1261 cctgtgttct cctccctgat ccactgtcct gttccagcag cagacagggc tgggccgtct
1321 ccatccctgt ctttgcttcg tatgcactga gtaatgatgt cttatttcct tattgaaaat
1381 aaaatctgta tgtatg
//
LOCUS YSPNMT1A 3787 bp ds-DNA PLN 19-JUL-1990
DEFINITION S.pombe no message in thiamine protein (nmt1) gene, complete cds.
ACCESSION J05493
KEYWORDS .
SOURCE S.pombe DNA.
ORGANISM Schizosaccharomyces pombe
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3787)
AUTHORS Maundrell,K.
TITLE nmt1 of fission yeast: A highly transcribed gene completely
repressed by thiamine
JOURNAL J. Biol. Chem. 265, 10857-10864 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Maundrell, 07-JUN-1990.
FEATURES from to/span description
pept 1499 2539 no message in thiamine protein (nmt1)
mRNA 1430 2681 nmt1 mRNA
signal 1396 1403 TATA box
BASE COUNT 1218 a 657 c 690 g 1222 t
ORIGIN
1 ggctcattta taatctagca ctttatacct tttacctgac tgttgggttg tttatctgac
61 ctcataaaag aaagtgtcgt tttggaaaaa ttagcattac attgagggtc ctccgctaat
121 gctcctgcga aaaatgattt taattttgga tgttttttca gaaataaaat gaaaattagc
181 ttgatataat atcaaccggc agcgagtaat agatttaaaa taaatttgat taattaaaaa
241 aatttgttgt tttaagcaag ccattttgct aaaatcaaag gtaatggaag agtatttccg
301 aaaaatctca acacatgtga atgatcagaa aattatcgcc ataaaagaca gaataagtca
361 tcagcggttg tttcatttcc tatatttttt ttttattttt ttatttttta ataagggaaa
421 atttaacgtc taaggataca gaagattgtt agcacattaa agtaataaag gcttaagtag
481 taagtgcctt agcatgttat tgtatttcaa aggacataat ctaaaataat aacaatatca
541 tttctcacaa gttattcaat tttctttttt ttttctaata atatcaagaa tgtattattt
601 gtttgacata agtcaactaa tttatttaat atgctggatt aatcttgcag acatgtaaat
661 taacaagttt tagtcaaata acgttgaagt ttcaatgaac tcaaataatt tctctttttt
721 tttatataac catatgtcta atctgattta tattttccgc aggatcaact gaagttatga
781 catttggatt ggatcactta taaccttggt cgccaaataa tacaaaaatc agcgttataa
841 aacaaagaag gtttttgtta agaaattaat cctctttctt gataagaaag ttgaaccgaa
901 attgcagata ctgatatatg aaaataatac ccacaatttt gggaatagcg caagcctcaa
961 tttaaacaat aggtgaggac acatgataat gacctcaatg attgttagaa gaaaagagcc
1021 tcattacaaa atcgaaaaat gaatggttgg gtacaagttt ccaaaacatg gtaaagtgga
1081 ctttgcgtat gagacgtaaa tagaaaaaaa cacttgttat atgttttcta gaattattgt
1141 tgtctcttta tggttggatg atgcaaaata gtaatttcgg ttagttgctg taaaacacca
1201 cgagacaaat agatatggat atttattaaa tcaggaaaaa cgtaactctc ggctactgga
1261 tggttcagtc acccaacgat tactggggag agaaaacagg gcaaaagcaa agcttaaagg
1321 aatccgattg tcattcggca atgtgcagcg aaactaaaaa ccggataatg gacctgttaa
1381 tcgaaacatt gaagatatat aaaggaagag gaatcctggc atatcatcaa ttgaataagt
1441 tgaattaatt atttcaatct cattctcact ttctgactta tagtcgcttt gttaaatcat
1501 gtctactaac aagatcactt tcctcacaaa ctgggaggcc actccttacc atttgcccat
1561 ctttcttgct caaactcgcg gatactatga gcgtgaaggg attgaggttg ctattctcga
1621 gcctaccaac ccttccgacg ttacagcatt gattggttct ggtaaggttg acatgggatt
1681 aaaggccatg atccatactt tagctgctaa ggctcgcgga taccctgtca ccagttttgg
1741 atctttgtta aatgagcctt tcactggctt aattactttg aagggtaatg gcatcaacga
1801 cttcaaggac attaaaggaa agcgtattgg ctacgttggt gagtttggaa agatccaact
1861 cgatgacttg tgcagcaagt tcggtttgtc tccttctgat tatactgcta ttcgctgtgg
1921 tatgaacatt gcccctgcca tcatcaatgg tgaaatcgat ggcggcattg gcattgaatg
1981 catgcaacaa gtcgagcttg agcgctggtg cgtctcccaa ggccgcccaa ggtctgatgt
2041 ccaaatgttg cgtattgatc gattagccaa cttaggttgc tgctgtttct gtaccatttt
2101 gtatattgca catgatgaat tcattgctaa acatcccgac aagatcaagg ccttcttacg
2161 tgctatccat tctgctactt tggatatgct taaagatcct gtccaaacct acaaggagta
2221 cattcacttc aagcgtgaaa tgggatccga acttcatcgg gaacaatttg aacgttgctt
2281 tgcatatttc tcacatgaca tctctaacgt ccccagagat tggaacaagg ttaccaatta
2341 ttccaagcgt ttgggcatca tcccccaaga ttttgagccc aactgtacta acggttactt
2401 gacctgggaa cttgaccccg atgagaagga tcccatgggc aaacaagaag ccattgccga
2461 gatccaagat gaaattaagc aaaagggagg tgtcttcagc ggcaactcac ttcgttatgt
2521 cgagcctgcc aacctttaaa aggaatgtct cccttgccag tactgctagg gtttttcttt
2581 caaactatgg aagcccattc aagctgcata ttacgatttt gtttttcgct tttagaaagt
2641 ggtttagatg agataataga aaaattcttg atctccgaca acgagtactt ttattttttt
2701 tgctaatcac tttactcaat attagctcga aatcgtagaa acgtagacgg gtgcgggata
2761 ccgagtggtg tagttaagaa tttttataaa ccacgtggcc caaaaatatg aacccaaaac
2821 gtttatacat gagtatactt taagaaggct ataccccttc gtgttagatg tagttttagc
2881 tacccaaccc gagtctatga gcttgacttc agatgtagaa ggcattaaat cgttttgaat
2941 attaattaaa aaacgatgaa aattaaatat ttaaaagcaa tcatacgctg aaaatttagt
3001 gctgtggcta atccttcaac atggaaatgc cataaaagtg actttgacaa aaaaaaaagt
3061 atatacaggt agtaaactca tctacttcat tgactttgtt tacagcatgt ggaaggagga
3121 atatttattg ctaaatcgta gtttaacatt caataagtaa tactattgaa attcgacaag
3181 attggccgca tggatgaaaa agaggcattt tgctttggga gaattagttc aaattagaac
3241 tgaaaaaaaa aactttacga ggcaaaaatg tcggattgag atcgtaaaag ttcgctcgtc
3301 gtcttttgct ttgtgattgt tttcatggat acatcttgct ggatatttaa attttagtac
3361 tatgtataag atattctata aatgttttat cacccaaacc tgttagcgcc ttcttaattc
3421 tattcaatct ggcttttgct ctgagactac ttcttggact ttcactactt gttagttata
3481 cggaatttgt gtaattagaa gtgaaataat cctttctatt agtaatgcaa acaaaaatca
3541 attggaaagc aaatttacac atacttgctg tatcgccttc gactatcttt tcattgcata
3601 ccatgatttt agacgtttat acttaagcaa ttaaaaggtt ttgattcaat cataaacata
3661 attatccttg ataaaaaaag aattatacac attgttctct ttatttgact tcgaactgtt
3721 taacatcgaa acggtcagat gatacaccca ttcctccaat gtaatccctg gcttcttggg
3781 caagctt
//
LOCUS CP7CPL 1470 bp ds-DNA PHG 19-JUL-1990
DEFINITION Bacteriophage Cp-7 muramidase (cpl7) gene.
ACCESSION M34779
KEYWORDS muramidase.
SOURCE Bacteriophage Cp-7 [from S.pneumoniae] DNA.
ORGANISM Bacteriophage Cp-7
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1470)
AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and
Lopez,R.
TITLE Modular organization of the lytic enzymes of Streptococcus
pneumoniae and its bacteriophages
JOURNAL Gene 86, 81-88 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 286 1314 muramidase
BASE COUNT 483 a 270 c 318 g 399 t
ORIGIN
1 cagctggaca ggcttaaaag gagttatcaa acataccctt acattcattt tttactactt
61 tgtagcggta ttcttgacct atattcacgc tatggcagtc ggtcagattt tgctggttat
121 cattaactta tactatgctt tgtcaatcat ggaaaatctt gctgttatgg gtgtatttat
181 tcccaagttt atgacggcaa gggtgcaaga agagttacag aaatacacag cacaactaga
241 cgcagggaaa gacctgctag aagaatttaa aggagaaaag aaataatggt taagaaaaat
301 gatttatttg tagacgttgc aagccatcaa ggctacgaca tttcaggaat tttagaagaa
361 gcagggacaa caaacacaat tattaaagtg tcagaaagta caagctattt aaacccttgc
421 ttgtctgctc aagtgagcca gtcaaatcct atcgggtttt atcattttgc ttgctttggt
481 ggaaatgaag aagaagcaga agcagaagca cgctatttcc ttgataacgt gcctacacaa
541 gttaaatacc ttgtactaga ttatgaagac catgcaagcg caagcgtaca aagaaacact
601 accgcgtgct tacgctttat gcaaatgatc gcagaagctg gatatacacc tatttattat
661 agttacaaac cgtttacgct tgataatgtg gactatcagc agattttagc acagttccct
721 aattctctat ggattgcagg ctatggctta aatgatggta cagctaactt tgaatacttt
781 ccaagcatgg acggtatcag atggtggcaa tattctagta acccgtttga caagaatatt
841 gtactgttag atgatgagaa agaagataat ataaacaatg aaaacactct aaaaagcctt
901 accacagtag ccaacgaggt cattcaggga ctttggggca acggtcaaga acgttatgac
961 agtttagcga atcgagggta tgacccccaa gcggttcaag acaaagtgaa tgaaatctta
1021 aacgctagag aaattgcaga ccttaccaca gtagccaacg aggtcattca gggactttgg
1081 ggcaacggtc aagaacgtta tgacagttta gcgaatcgag ggtatgaccc ccaagcggtt
1141 caagacaaag tgaatgaaat cttaaacgct agagaaattg cagaccttac cacagtagcc
1201 aacgaggtca ttcagggact ttggggcaac ggtcaagaac gttatgacag tttagcgaat
1261 cgagggtatg acccccaagc ggttcaagac aaagtgaatg aattactttc ataacaagta
1321 aaagctagta gaaattttct actagctatt tttatattct gctatgattt tataggcgtc
1381 ctcatctggg ttatccagag caatggagca aatggcagac aggacagctg ttcatctgat
1441 tgtatttctg taaatagtga ttttctagct
//
LOCUS CP9CPL 1253 bp ds-DNA PHG 19-JUL-1990
DEFINITION Bacteriophage Cp-9 muramidase (cpl9) gene.
ACCESSION M34780
KEYWORDS muramidase.
SOURCE Bacteriophage Cp-9 [from S.pneumoniae] DNA.
ORGANISM Bacteriophage Cp-9
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 1253)
AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and
Lopez,R.
TITLE Modular organization of the lytic enzymes of Streptococcus
pneumoniae and its bacteriophages
JOURNAL Gene 86, 81-88 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 52 1071 muramidase
BASE COUNT 403 a 213 c 294 g 343 t
ORIGIN
1 agtagacgca ggaaaagacc tgctagaaga atttaaagga gaaaagaaat aatggttaag
61 aaaaatgatt tatttataga cgtatcaagc cacaacggtt acgatataac aggaatttta
121 gagcagatgg gaacaacaaa cacgattgtt aaaatctcag aaagtacgac ctatttaaac
181 ccttgcttgt ctgctcaagt ggaacagtct acccctattg gcttttatca cttcgcacgc
241 tttggcggag acgtagcaga agctgaaaga gaagcgcagt ttttccttga caacgtgcct
301 acacaagtta aataccttgt attggactat gaagacgacc caagcggaaa cgcacaagcc
361 aacactaacg catgcttacg ctttatgcag atgattgcag acgctggata tacacctatt
421 tattatagtt ataaaccttt cacgcttgat aatgtggact atcagcagat tttagcacag
481 ttccctaatt ctctctggat tgcagggtat ggcttgaatg atggaaacgc tgattttgaa
541 tattttccat ctatggacgg gataagatgg tggcagtatt ctagtaaccc gtttgacaag
601 aatattgtac tgttagacga tgaagaagac gaaaagccaa agactgctgg aacgtggaaa
661 caagacagta agggctggtg gttcagacgc aataacggta gtttccctta taataaatgg
721 gaaaaaatcg ggggtgtgtg gtactacttc gatagtaaag gatattgctt aacgagcgaa
781 tggctcaaag ataatgaaaa atggtactac ctcaaggaca acggcgctat ggtgactggt
841 tgggtgctag tcgggtcaga gtggtattat atggacgatt caggtgcaat ggttactggt
901 tgggtcaaat acaagaataa ctggtactat atgacaaatg aacgtggtaa catggtttct
961 aatgaattta ttaaatctgg aaaaggctgg tatttcatga acacaaacgg agagcttgca
1021 gacaatccaa gctttacaaa agaaccagac ggacttataa cggtagcata aaaagaaaag
1081 ctagtagaaa ctttctacta gctgttttta tattctgcaa tgattttata agcgtcttcg
1141 tctgggttgt ccagagcgat ggagcagatg gcagacagaa ccgctgttca tctgattgta
1201 tttctgtagg tagtgatttt ctaggctgtt atgttgctga tgtgctttat acc
//
LOCUS YSCTY31A 5510 bp ds-DNA PLN 19-JUL-1990
DEFINITION S.cerevisiae Ty3-1 retrotransposon integrase gene, complete cds,
and Cys-tRNA gene.
ACCESSION M34549
KEYWORDS integrase; transfer RNA-Cys; transposable element; transposon.
SOURCE S.cerevisiae (strain AB950) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 5510)
AUTHORS Hansen,L.J. and Sandmeyer,S.B.
TITLE Characterization of a transpositionally active Ty3 element and
identification of the Ty3 integrase protein
JOURNAL J. Virol. 64, 2599-2607 (1990)
STANDARD unannotated staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.B.Sandmeyer, 24-MAY-1990.
FEATURES from to/span description
pept 536 1408 integrase
tRNA 105 31 (c) Cys-tRNA
mRNA 343 > 5510 integrase mRNA
site 121 462 5' sigma element
site 5132 5471 3' sigma element
rpt 121 128 5' inverted terminal repeat
rpt 455 462 3' inverted terminal repeat
rpt 5132 5139 5' inverted terminal repeat
rpt 5464 5471 3' inverted terminal repeat
rpt 116 120 5' insertion target sequence
rpt 463 467 3' insertion target sequence
rpt 5127 5131 5' insertion target sequence
rpt 5472 5476 3' insertion target sequence
BASE COUNT 1955 a 1306 c 919 g 1330 t
ORIGIN
1 aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga
61 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag
121 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa
181 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct
241 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct
301 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa
361 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt
421 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta
481 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag
541 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc
601 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa
661 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag
721 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt
781 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca
841 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga
901 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc
961 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga
1021 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc
1081 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg
1141 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc
1201 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa
1261 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat
1321 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc
1381 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta
1441 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa
1501 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt
1561 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac
1621 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg
1681 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact
1741 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac
1801 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta
1861 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt
1921 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca
1981 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc
2041 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga
2101 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac
2161 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac
2221 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca
2281 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat
2341 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata
2401 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga
2461 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg
2521 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat
2581 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct
2641 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct
2701 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat
2761 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga
2821 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa
2881 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa
2941 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa
3001 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct
3061 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac
3121 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat
3181 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg
3241 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat
3301 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca
3361 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc
3421 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt
3481 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg
3541 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg
3601 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc
3661 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt
3721 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa
3781 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac
3841 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa
3901 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct
3961 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac
4021 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc
4081 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc
4141 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat
4201 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga
4261 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg
4321 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac
4381 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat
4441 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag
4501 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta
4561 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag
4621 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa
4681 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga
4741 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct
4801 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga
4861 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa
4921 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt
4981 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat
5041 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg
5101 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt
5161 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac
5221 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg
5281 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat
5341 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg
5401 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc
5461 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga
//GenBank-Updates@genbank.bio.net (07/20/90)
LOCUS SYNM13MP18 7249 bp ds-DNA SYN 20-JUL-1990
DEFINITION Cloning vector M13mp18 DNA.
ACCESSION M11454
KEYWORDS .
SOURCE Synthetic DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 7249)
AUTHORS Yanisch-Perron,C., Vieira,J. and Messing,J.
TITLE Improved M13 phage cloning vectors and host strains: Nucleotide
sequences of the M13mp18 and pUC19 vectors
JOURNAL Gene 33, 103-119 (1985)
STANDARD simple staff_entry
BASE COUNT 1768 a 1538 c 1533 g 2410 t
ORIGIN
1 aatgctacta ctattagtag aattgatgcc accttttcag ctcgcgcccc aaatgaaaat
61 atagctaaac aggttattga ccatttgcga aatgtatcta atggtcaaac taaatctact
121 cgttcgcaga attgggaatc aactgttaca tggaatgaaa cttccagaca ccgtacttta
181 gttgcatatt taaaacatgt tgagctacag caccagattc agcaattaag ctctaagcca
241 tccgcaaaaa tgacctctta tcaaaaggag caattaaagg tactctctaa tcctgacctg
301 ttggagtttg cttccggtct ggttcgcttt gaagctcgaa ttaaaacgcg atatttgaag
361 tctttcgggc ttcctcttaa tctttttgat gcaatccgct ttgcttctga ctataatagt
421 cagggtaaag acctgatttt tgatttatgg tcattctcgt tttctgaact gtttaaagca
481 tttgaggggg attcaatgaa tatttatgac gattccgcag tattggacgc tatccagtct
541 aaacatttta ctattacccc ctctggcaaa acttcttttg caaaagcctc tcgctatttt
601 ggtttttatc gtcgtctggt aaacgagggt tatgatagtg ttgctcttac tatgcctcgt
661 aattcctttt ggcgttatgt atctgcatta gttgaatgtg gtattcctaa atctcaactg
721 atgaatcttt ctacctgtaa taatgttgtt ccgttagttc gttttattaa cgtagatttt
781 tcttcccaac gtcctgactg gtataatgag ccagttctta aaatcgcata aggtaattca
841 caatgattaa agttgaaatt aaaccatctc aagcccaatt tactactcgt tctggtgttc
901 tcgtcagggc aagccttatt cactgaatga gcagctttgt tacgttgatt tgggtaatga
961 atatccggtt cttgtcaaga ttactcttga tgaaggtcag ccagcctatg cgcctggtct
1021 gtacaccgtt catctgtcct ctttcaaagt tggtcagttc ggttccctta tgattgaccg
1081 tctgcgcctc gttccggcta agtaacatgg agcaggtcgc ggatttcgac acaatttatc
1141 aggcgatgat acaaatctcc gttgtacttt gtttcgcgct tggtataatc gctgggggtc
1201 aaagatgagt gttttagtgt attctttcgc ctctttcgtt ttaggttggt gccttcgtag
1261 tggcattacg tattttaccc gtttaatgga aacttcctca tgaaaaagtc tttagtcctc
1321 aaagcctctg tagccgttgc taccctcgtt ccgatgctgt ctttcgctgc tgagggtgac
1381 gatcccgcaa aagcggcctt taactccctg caagcctcag cgaccgaata tatcggttat
1441 gcgtgggcga tggttgttgt cattgtcggc gcaactatcg gtatcaagct gtttaagaaa
1501 ttcacctcga aagcaagctg ataaaccgat acaattaaag gctccttttg gagccttttt
1561 ttttggagat tttcaacgtg aaaaaattat tattcgcaat tcctttagtt gttcctttct
1621 attctcactc cgctgaaact gttgaaagtt gtttagcaaa accccataca gaaaattcat
1681 ttactaacgt ctggaaagac gacaaaactt tagatcgtta cgctaactat gagggttgtc
1741 tgtggaatgc tacaggcgtt gtagtttgta ctggtgacga aactcagtgt tacggtacat
1801 gggttcctat tgggcttgct atccctgaaa atgagggtgg tggctctgag ggtggcggtt
1861 ctgagggtgg cggttctgag ggtggcggta ctaaacctcc tgagtacggt gatacaccta
1921 ttccgggcta tacttatatc aaccctctcg acggcactta tccgcctggt actgagcaaa
1981 accccgctaa tcctaatcct tctcttgagg agtctcagcc tcttaatact ttcatgtttc
2041 agaataatag gttccgaaat aggcaggggg cattaactgt ttatacgggc actgttactc
2101 aaggcactga ccccgttaaa acttattacc agtacactcc tgtatcatca aaagccatgt
2161 atgacgctta ctggaacggt aaattcagag actgcgcttt ccattctggc tttaatgaag
2221 atccattcgt ttgtgaatat caaggccaat cgtctgacct gcctcaacct cctgtcaatg
2281 ctggcggcgg ctctggtggt ggttctggtg gcggctctga gggtggtggc tctgagggtg
2341 gcggttctga gggtggcggc tctgagggag gcggttccgg tggtggctct ggttccggtg
2401 attttgatta tgaaaagatg gcaaacgcta ataagggggc tatgaccgaa aatgccgatg
2461 aaaacgcgct acagtctgac gctaaaggca aacttgattc tgtcgctact gattacggtg
2521 ctgctatcga tggtttcatt ggtgacgttt ccggccttgc taatggtaat ggtgctactg
2581 gtgattttgc tggctctaat tcccaaatgg ctcaagtcgg tgacggtgat aattcacctt
2641 taatgaataa tttccgtcaa tatttacctt ccctccctca atcggttgaa tgtcgccctt
2701 ttgtctttag cgctggtaaa ccatatgaat tttctattga ttgtgacaaa ataaacttat
2761 tccgtggtgt ctttgcgttt cttttatatg ttgccacctt tatgtatgta ttttctacgt
2821 ttgctaacat actgcgtaat aaggagtctt aatcatgcca gttcttttgg gtattccgtt
2881 attattgcgt ttcctcggtt tccttctggt aactttgttc ggctatctgc ttacttttct
2941 taaaaagggc ttcggtaaga tagctattgc tatttcattg tttcttgctc ttattattgg
3001 gcttaactca attcttgtgg gttatctctc tgatattagc gctcaattac cctctgactt
3061 tgttcagggt gttcagttaa ttctcccgtc taatgcgctt ccctgttttt atgttattct
3121 ctctgtaaag gctgctattt tcatttttga cgttaaacaa aaaatcgttt cttatttgga
3181 ttgggataaa taatatggct gtttattttg taactggcaa attaggctct ggaaagacgc
3241 tcgttagcgt tggtaagatt caggataaaa ttgtagctgg gtgcaaaata gcaactaatc
3301 ttgatttaag gcttcaaaac ctcccgcaag tcgggaggtt cgctaaaacg cctcgcgttc
3361 ttagaatacc ggataagcct tctatatctg atttgcttgc tattgggcgc ggtaatgatt
3421 cctacgatga aaataaaaac ggcttgcttg ttctcgatga gtgcggtact tggtttaata
3481 cccgttcttg gaatgataag gaaagacagc cgattattga ttggtttcta catgctcgta
3541 aattaggatg ggatattatt tttcttgttc aggacttatc tattgttgat aaacaggcgc
3601 gttctgcatt agctgaacat gttgtttatt gtcgtcgtct ggacagaatt actttacctt
3661 ttgtcggtac tttatattct cttattactg gctcgaaaat gcctctgcct aaattacatg
3721 ttggcgttgt taaatatggc gattctcaat taagccctac tgttgagcgt tggctttata
3781 ctggtaagaa tttgtataac gcatatgata ctaaacaggc tttttctagt aattatgatt
3841 ccggtgttta ttcttattta acgccttatt tatcacacgg tcggtatttc aaaccattaa
3901 atttaggtca gaagatgaaa ttaactaaaa tatatttgaa aaagttttct cgcgttcttt
3961 gtcttgcgat tggatttgca tcagcattta catatagtta tataacccaa cctaagccgg
4021 aggttaaaaa ggtagtctct cagacctatg attttgataa attcactatt gactcttctc
4081 agcgtcttaa tctaagctat cgctatgttt tcaaggattc taagggaaaa ttaattaata
4141 gcgacgattt acagaagcaa ggttattcac tcacatatat tgatttatgt actgtttcca
4201 ttaaaaaagg taattcaaat gaaattgtta aatgtaatta attttgtttt cttgatgttt
4261 gtttcatcat cttcttttgc tcaggtaatt gaaatgaata attcgcctct gcgcgatttt
4321 gtaacttggt attcaaagca atcaggcgaa tccgttattg tttctcccga tgtaaaaggt
4381 actgttactg tatattcatc tgacgttaaa cctgaaaatc tacgcaattt ctttatttct
4441 gttttacgtg ctaataattt tgatatggtt ggttcaattc cttccataat tcagaagtat
4501 aatccaaaca atcaggatta tattgatgaa ttgccatcat ctgataatca ggaatatgat
4561 gataattccg ctccttctgg tggtttcttt gttccgcaaa atgataatgt tactcaaact
4621 tttaaaatta ataacgttcg ggcaaaggat ttaatacgag ttgtcgaatt gtttgtaaag
4681 tctaatactt ctaaatcctc aaatgtatta tctattgacg gctctaatct attagttgtt
4741 agtgcaccta aagatatttt agataacctt cctcaattcc tttctactgt tgatttgcca
4801 actgaccaga tattgattga gggtttgata tttgaggttc agcaaggtga tgctttagat
4861 ttttcatttg ctgctggctc tcagcgtggc actgttgcag gcggtgttaa tactgaccgc
4921 ctcacctctg ttttatcttc tgctggtggt tcgttcggta tttttaatgg cgatgtttta
4981 gggctatcag ttcgcgcatt aaagactaat agccattcaa aaatattgtc tgtgccacgt
5041 attcttacgc tttcaggtca gaagggttct atctctgttg gccagaatgt cccttttatt
5101 actggtcgtg tgactggtga atctgccaat gtaaataatc catttcagac gattgagcgt
5161 caaaatgtag gtatttccat gagcgttttt cctgttgcaa tggctggcgg taatattgtt
5221 ctggatatta ccagcaaggc cgatagtttg agttcttcta ctcaggcaag tgatgttatt
5281 actaatcaaa gaagtattgc tacaacggtt aatttgcgtg atggacagac tcttttactc
5341 ggtggcctca ctgattataa aaacacttct caagattctg gcgtaccgtt cctgtctaaa
5401 atccctttaa tcggcctcct gtttagctcc cgctctgatt ccaacgagga aagcacgtta
5461 tacgtgctcg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg
5521 tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt
5581 cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg
5641 ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga
5701 tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac
5761 gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc
5821 tatctcgggc tattcttttg atttataagg gattttgccg atttcggaac caccatcaaa
5881 caggattttc gcctgctggg gcaaaccagc gtggaccgct tgctgcaact ctctcagggc
5941 caggcggtga agggcaatca gctgttgccc gtctcgctgg tgaaaagaaa aaccaccctg
6001 gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca
6061 cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct
6121 cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat
6181 tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct
6241 cggtacccgg ggatcctcta gagtcgacct gcaggcatgc aagcttggca ctggccgtcg
6301 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac
6361 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac
6421 agttgcgcag cctgaatggc gaatggcgct ttgcctggtt tccggcacca gaagcggtgc
6481 cggaaagctg gctggagtgc gatcttcctg aggccgatac ggtcgtcgtc ccctcaaact
6541 ggcagatgca cggttacgat gcgcccatct acaccaacgt aacctatccc attacggtca
6601 atccgccgtt tgttcccacg gagaatccga cgggttgtta ctcgctcaca tttaatgttg
6661 atgaaagctg gctacaggaa ggccagacgc gaattatttt tgatggcgtt cctattggtt
6721 aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat taacgtttac
6781 aatttaaata tttgcttata caatcttcct gtttttgggg cttttctgat tatcaaccgg
6841 ggtacatatg attgacatgc tagttttacg attaccgttc atcgattctc ttgtttgctc
6901 cagactctca ggcaatgacc tgatagcctt tgtagatctc tcaaaaatag ctaccctctc
6961 cggcattaat ttatcagcta gaacggttga atatcatatt gatggtgatt tgactgtctc
7021 cggcctttct cacccttttg aatctttacc tacacattac tcaggcattg catttaaaat
7081 atatgagggt tctaaaaatt tttatccttg cgttgaaata aaggcttctc ccgcaaaagt
7141 attacagggt cataatgttt ttggtacaac cgatttagct ttatgctctg aggctttatt
7201 gcttaatttt gctaattctt tgccttgcct gtatgattta ttggatgtt
//
LOCUS RATCDOL 1458 bp ss-mRNA ROD 20-JUL-1990
DEFINITION Rat cysteine dioxygenase mRNA, complete cds.
ACCESSION M35266
KEYWORDS cysteine dioxygenase.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1458)
AUTHORS Hosokawa,Y., Matsumoto,A., Oka,J., Itakura,H. and Yamaguchi,K.
TITLE Isolation and characterization of a cDNA for rat liver cysteine
dioxygenase
JOURNAL Biochem. Biophys. Res. Commun. 168, 473-478 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 190 792 cysteine dioxygenase (EC 1.13.11.20)
BASE COUNT 396 a 325 c 343 g 394 t
ORIGIN
1 gagggtttag gctggtctcc ggtgacctcc tagtcctaaa tcttgatacc cttgcaagag
61 ctttgagcgt gtggggtccc gggcgttcgg ggtcccgggt gtgtgcggtt tgtatagcct
121 gaagccgggg tcctccgcgc tcgcgtcctc cgcagctgga ctgaagagac gcgtcccagc
181 cctgcgggga tggaacggac cgagctgctg aagccccgga ccctggccga cctcatccga
241 atcttgcatg agctcttcgc cggggacgaa gtcaatgtgg aggaggtgca ggctgtgctg
301 gaagcctacg agagcaatcc tgccgagtgg gctttgtatg ccaaattcga tcaatacagg
361 tatacccgaa accttgtgga tcaaggaaat gggaagttta atctgatgat tctgtgctgg
421 ggtgaagggc atggcagcag tattcacgat cacacggact cccactgctt tttgaagctg
481 ctgcaaggaa atctaaagga gacattgttt gactggcctg acaagaaatc caacgagatg
541 atcaagaagt ctgaaagaac tttgagggaa aatcagtgtg cctacattaa tgattctatt
601 ggcttacatc gagtagagaa cgtcagccac acagagcctg ctgtgagcct tcacttgtac
661 agtccacctt tcgatacatg ccatgccttt gaccaacgaa cagggcataa aaacaaagtc
721 accatgacat tccacagcaa atttggaatc agaactccat ttacaacttc aggttcactg
781 gagaacaact aagacctgcc aagcctttca aagttttgct tctgggtcgt tggaatgttt
841 taccttggat aagagaggcc acccatcatt tgctgtccag ttatacattt taataagtcc
901 atgctcagtg tgtatactaa ggaagcaaac catcccctga gctatgcagg agaaaaatcc
961 cactaaagaa aaagtcactt gatttttaat agccaaatca ccttgctccc agttcttctg
1021 tcttctaact ccatggaaat tctattggga gttctcagtg gggttttttt tcaaccttag
1081 gaaagcactt ctggtctctg aactctaata atcaataagt aaaaatgaag aaaccacaag
1141 ctatcacatg tctgttttca tacctggaag tctaagtgtg gaaatcttta atttactttg
1201 tatgttctta atgtttgaca agaatttttt taaatcttgg ttttcagttt tttcaaccct
1261 gtttgacaaa ttcctatgct gtggagacta gggatgcaga tagcagtttg gtgtttggta
1321 gtgaacagca gtggggccag aaatgtgcat gtatccagac ctcctgcaaa taaaaactga
1381 aactcatgtg taatgtgtgc caccacctta agctgccacc aaaattgcca aacgacttta
1441 ataaaactgg atttgaga
//
LOCUS HUMMYOHP 6024 bp ss-mRNA PRI 20-JUL-1990
DEFINITION Human perinatal myosin heavy chain mRNA, complete cds.
ACCESSION M36769 M35250 Y00821
KEYWORDS myosin; myosin heavy chain.
SOURCE human fetal muscle, cDNA to mRNA, clone pSMHCP.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6024)
AUTHORS Leinwand,L.A.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 2651 to 6024)
AUTHORS Feghali,R. and Leinwand,L.A.
TITLE Molecular genetic characterization of a developmentally regulated
human perinatal myosin heavy chain
JOURNAL J. Cell Biol. 108, 1791-1797 (1989)
STANDARD simple automatic
REFERENCE 3 (bases 1 to 2650)
AUTHORS Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A.
TITLE Generation of a full-length human perinatal myosin heavy chain cDNA
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Author address: L.A.Leinwand Albert Einstein College of Medicine
1300 Morris Park Avenue Bronx, N.Y. 10461, USA.
Submitted (12-APR-1989) on tape to the EMBL data library. Draft
entry and computer-readable sequence for [3] kindly submitted by
I.Karsch-Mizrachi, 18-JUN-1990.
FEATURES from to/span description
pept 74 5887 myosin heavy chain
BASE COUNT 1961 a 1300 c 1586 g 1177 t
ORIGIN
1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa
61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc
121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc
181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag
241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga
301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat
361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat
421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt
481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca
541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc
601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca
661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca
721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc
781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg
841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt
901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa
961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt
1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga
1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac
1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc
1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc
1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa
1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga
1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag
1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct
1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat
1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt
1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat
1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta
1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc
1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg
1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc
1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc
1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga
2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat
2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct
2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat
2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg
2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca
2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct
2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag
2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg
2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt
2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa
2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct
2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc
2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat
2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa
2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga
2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa
3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc
3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga
3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga
3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg
3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa
3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa
3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc
3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga
3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc
3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa
3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg
3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt
3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag
3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga
3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct
3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga
3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga
4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct
4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg
4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac
4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt
4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc
4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga
4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct
4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga
4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga
4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac
4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt
4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca
4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga
4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt
4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag
4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa
4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac
5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat
5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga
5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt
5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga
5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga
5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca
5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct
5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca
5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa
5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca
5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc
5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct
5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc
5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc
5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat
5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac
6001 agtaaatgta tacattaaaa aaaa
//
LOCUS PT4MRH 1266 bp ds-DNA PHG 20-JUL-1990
DEFINITION Bacteriophage T4 rpoH modulating protein (mrh) gene, complete cds.
ACCESSION M30001
KEYWORDS Mrh protein; rpoH modulating protein.
SOURCE Bacteriophage T4 (strain GT7) DNA, clone pMF28,h37.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 1266)
AUTHORS Frazier,M.W. and Mosig,G.
TITLE The bacteriophage T4 gene mrh whose product inhibits late T4 gene
expression in an E.coli rpoH (sigma-32) mutant
JOURNAL Gene 88, 7-14 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.W.Frazier, 20-NOV-1989.
FEATURES from to/span description
pept 782 1132 rpoH modulating protein
ORF 96 302 mrh ORF soc-1
site 39 68 early promoter 14.95
BASE COUNT 444 a 167 c 234 g 421 t
ORIGIN
1 ctcaaggact ccttcgggag tcctttttca tttaaatggt ttactttcca aaatgagtat
61 ggtataatag aaattatctt atagaggaga gtactatgtt aaatcgttgg attaaaccaa
121 atgaagattt ggatattatc atttcacgac atgtaatgaa gaaatatgaa ctacaaccat
181 ggtctacaga agttgttgtg cattcattta tgatgtacgc agatggttct gtcgaattta
241 atgtagagat tcgatatgat tatggtgaga agcaagtcga attcaaaaga ggctttttgt
301 aatgtttatc tttaattggt ttaaaagttt ctttacggat tttttctcta caactctggg
361 gaaggtgtag ttcctatttc aaatgactac cttcctttaa ctgtagttga atatgtttat
421 atgggagatg gaacagtaga agcagttact atgacttatg aagaagccca agaatattat
481 aaaaatcctt ggcgctggtc aacacctact acatcatcta acacacagaa tacacagtct
541 agttctgatt catatgatac taatgttcct gttcatgtat gggcaggtga ttcatgtgga
601 agttcttgtg attctagttg ttcatctaca tcttgtgatt gaggaaaatt atggaagcaa
661 ttttgtttga aatgtctatc aatagtatta gtatggcgtt tgtcaaggat gttccaatta
721 ctgtagcagt aatgattgat aaaagacaat aatatgtatc tagtagaaga ttttatttca
781 aatgccaatt ccagaagatg ttgaaataaa acttaaaaag atcggaagga ttgaaactgt
841 acaaaattct ccatttatgg caattcaagc atttactaaa tctaactata ttgatgttcg
901 tgaagcatat tataataata aaccgttatc cttctattca tatgattcaa tatatgattg
961 gaaaatagat aaaggaaata aatttataat tacggacgaa agtgcgctat catattttat
1021 tacctcttta tggaataatt taaatccaaa tttgctaaaa attcataaat ttgacgatgc
1081 tcctactatt gttttaggta aaacgaatga aagttctgaa gaagatgctt gaatggttca
1141 atagaccaaa ctcaatgtat attgatgacg gttgggttga acaagcaaat aaagaaatca
1201 gaatgaatca gaagaagaat ggatgaaatc aatgattatg ctgagaaaga aaagaaacta
1261 gaacgc
//
LOCUS BOVPER 1295 bp ss-mRNA MAM 20-JUL-1990
DEFINITION Bovine peripherin mRNA, complete cds.
ACCESSION J02884
KEYWORDS peripherin; photoreceptor protein.
SOURCE Bovine retina, cDNA to mRNA, clones lambda.[5,4,8,17].
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1295)
AUTHORS Connell,G.J. and Molday,R.S.
TITLE Molecular cloning, primary structure and orientation of the
vertebrate photoreceptor protein peripherin in the rod outer
segment disk membrane
JOURNAL Biochemistry 29, 4691-4698 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Connell, 20-OCT-1989.
FEATURES from to/span description
pept < 1 1023 peripherin
BASE COUNT 272 a 368 c 379 g 276 t
ORIGIN
1 aaatttgacc agaagaagcg ggtcaagttg gcccaagggc tctggctcat gaactggttc
61 tccgtgttgg ctggtatcat catcttcggc ttagggctgt tcctgaagat tgaactccgg
121 aagagaagcg atgtgatgaa caattctgag agccattttg tgcccaattc cttgatcggg
181 gtgggggtgc tgtcctgtgt cttcaattct ctggctggca agatctgtta cgacgccctg
241 gaccctgcca agtacgccaa gtggaagccc tggctgaagc cgtacctggc cgtgtgtgtc
301 ctcttcaacg tggtcctctt cctggtggcc ctctgctgct tcctcctgcg gggctcgctg
361 gagagtacgc tggcccacgg actcaagaac ggcatgaaat tctatcggga cacggacacc
421 ccaggccggt gtttcatgaa gaagaccatc gacatgctgc agatcgagtt caagtgctgc
481 ggcaacaacg gctttcggga ctggtttgag attcagtgga tcagcaaccg ctatctggat
541 ttttcctcca aagaagtcaa agatcgcatc aagagcaatg tggacgggcg gtacctggtg
601 gacggtgtcc ccttcagctg ctgcaacccc aactcaccgc ggccctgcat ccagtaccag
661 ctcaccaaca actctgcgca ctacagctac gatcaccaga cggaggagct caacctgtgg
721 ctgcgtggct gcagggccgc cctgctgagc tattacagca acctcatgaa tactacaggc
781 gctgtgacgc tcctcgtttg gctctttgag gtgaccatca ctgttgggct acgctacctg
841 cacacggcgc tggaaggcat ggccaacccc gaagaccctg agtgcgagag tgagggctgg
901 cttctggaga agagcgtgcc ggagacctgg aaggcctttc tggagagtgt gaagaagctg
961 ggcaagggca accaggtgga agccgagggc gaggacgcag gccaggcccc ggcggcaggc
1021 tgacggccct gcggccccct cccctctgca cactgaaaag tagtggactc caggaacttc
1081 ggataccccc cggatcccat cagaatctcc caaagagggc ggccgtctca cagaggctct
1141 tctggatgtg ggatttaata ttcagggccc ctaaaaactt ctgtggaatg aatgtctcac
1201 tgtgagtgca tgagtcctgc tcagccccac cgccctggat gcaccttcac ggtgggctca
1261 tggagggcct ctcagcaggt gactctgtgg tcaca
//
LOCUS HAMRPS14 5300 bp ds-DNA ROD 20-JUL-1990
DEFINITION C.griseus ribosomal protein S14 (RPS14) gene, complete cds.
ACCESSION M35008
KEYWORDS ribosomal protein S14.
SOURCE C.griseus (haplotype EmtB wild type) ovary transformed fibroblast
cell line CHO DNA.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 5300)
AUTHORS Diaz,J.-J., Rhoads,D. and Roufa,D.J.
TITLE Genetic analysis of a vital mammalian housekeeping locus using CHO
cells that express a transfected mutant allele
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.J.Roufa, 08-JUN-1990.
Author address: D.J.Roufa
Kansas State University
Division of Biology
Ackert Hall
Manhattan, Kansas 66506
FEATURES from to/span description
pept 1977 2125 ribosomal protein S14, exon 2 (first expressed
exon)
2576 2737 ribosomal protein S14, exon 3
4074 4150 ribosomal protein S14, exon 4
4724 4791 ribosomal protein S14, exon 5
pre-msg 179 4841 ribosomal protein S14 mRNA and introns
IVS 220 1974 ribosomal protein S14 intron A
IVS 2126 2575 ribosomal protein S14 intron B
IVS 2738 4073 ribosomal protein S14 intron C
IVS 4151 4723 ribosomal protein S14 intron D
signal 4819 4824 poly-A signal
BASE COUNT 1315 a 1121 c 1415 g 1449 t
ORIGIN
1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg
61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc
121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc
181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca
241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc
301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg
361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg
421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg
481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc
541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta
601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga
661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg
721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg
781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt
841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt
901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga
961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta
1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc
1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc
1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc
1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg
1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt
1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc
1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc
1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca
1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg
1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt
1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg
1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt
1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt
1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc
1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc
1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg
1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg
2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg
2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag
2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga
2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg
2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt
2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa
2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt
2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa
2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa
2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt
2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg
2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca
2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga
2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa
2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta
2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg
3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga
3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt
3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt
3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg
3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca
3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa
3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat
3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg
3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat
3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa
3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag
3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc
3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg
3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca
3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt
3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt
3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat
4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag
4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt
4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc
4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact
4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta
4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc
4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg
4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag
4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc
4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc
4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct
4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc
4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt
4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt
4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga
4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc
4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat
5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct
5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc
5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca
5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat
5281 atggtctact tggggaattc
//
LOCUS GPIFCGR 1494 bp ss-mRNA ROD 20-JUL-1990
DEFINITION Guinea pig Fc-gamma-1/gamma-2 receptor mRNA, complete cds.
ACCESSION M35272
KEYWORDS Fc-gamma receptor; Fc-gamma-1/gamma-2 receptor.
SOURCE Guinea pig peritoneal macrophage, cDNA to mRNA, clone D-3.
ORGANISM Cavia sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae.
REFERENCE 1 (bases 1 to 1494)
AUTHORS Tominaga,M., Sakata,A., Ohmura,T., Yamashita,T., Koyama,J. and
Onoue,K.
TITLE The structure and expression of the guinea pig Fc receptor for IgG1
and IgG2 (Fc-gamma-1/gamma-2-R)
JOURNAL Biochem. Biophys. Res. Commun. 168, 683-689 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 100 972 Fc-gamma-1/gamma-2 receptor precursor
sigp 100 147 Fc-gamma-1/gamma-2 receptor signal peptide
matp 148 969 Fc-gamma-1/gamma-2 receptor
mRNA < 1 1494 Fc-gamma-1/gamma-2 receptor mRNA
BASE COUNT 401 a 369 c 348 g 376 t
ORIGIN
1 cagcttgctg cagaagtgat ggcgatccct tcgttcttac ctgtccttgg caccaagagt
61 caccgagctg actataagcc cttgcagact ttgagccaca tgctgctgtg gataactgtg
121 ctattcctgg ctcctgttgc tgggacctct gcagaccccc cgaaggctgt ggtgagactg
181 gagcccccat ggatccaggt gctccgggga gaccgcgtga ctctgacctg cgagggtgcc
241 cccagccctg ggaaccactc cacccagtgg ctccacaacg ggagactcat ccctacccag
301 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag
361 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag
421 gcgggcggga ccagtctcag cgaccctgtg cgtctggacg tgatttctga ctggctggtg
481 ctccagactt ctcaactgat tttccaggag ggggacgtca tcgtgctgcg gtgccacagc
541 tggaataact ggcctttggc caaggtcaca ttctaccaca atggggtagc caagaaatat
601 ttctctatca gtaaaaattt ctccatccca caagcaaacc acagtcacag tggtgcttac
661 aactgcacgg gattaatagg aaggacatct cacacatcac cgcctgtgac catcactgtc
721 caagggccca agtcaagcga ctcttcaatg gtggtgataa ttgtggctgc agtcattggg
781 attgctacag cggccattgt tgttgctgta gtagctatca tctgcctcaa gaaaaagcag
841 cctccagcca atctctctga tcctgaggag gtcgctaaat ctgaggttga aaatacaatc
901 acctattcac ttctcaagca cccggaagct caggatgatg acacagagca tgactatcag
961 aaccacattt aatctccatt atctggccct gggatttggg ggagaaaaat caagaagtga
1021 agatctgcta tctccaggcc taaggttccc ttggagaggt cgagaggatg ctgaagttca
1081 aagaaggagc aggatttttc cagagtcctg tatgtgagtc ctaaagttct ttggcctgac
1141 actaacagaa aatatgaact ctgaaggctg gctgattctg tgcctcagca cttccctaca
1201 tcagggctgt tatacagccc cacagccaac aaaatgataa aattaatatt gctaagagat
1261 tttaacaaca tgtgacatgc ctacattatg ggtaacatga gaaaaattac ataagtatat
1321 atgatttcag aagtgataaa atcaactaac atctaccaac atattaaaaa tgattgtttc
1381 agggtgatag aattatcagt ggtttttgtt ctttcttatt ttcctacaaa tctataagtt
1441 tattttccta taaatcctat aaatcatgta ctgtatttgt aataaaatat tatg
//
LOCUS NPOCAP 2104 bp ds-DNA VRL 20-JUL-1990
DEFINITION Orgyia pseudotsugata polyhedrosis virus capsid protein (P39)
gene, complete cds.
ACCESSION M21042
KEYWORDS capsid protein; structural protein.
SOURCE Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58.
ORGANISM Orgyia pseudotsugata polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 2104)
AUTHORS Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S.
TITLE Nucleotide sequence, transcriptional mapping, and temporal
expression of the gene encoding p39, a major structural protein of
the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata
JOURNAL Virology 168, 354-362 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by G.W.Blissard, 03-NOV-1988.
The p39 protein is found in both the PDV and BV virion phenotypes;
P39 mRNAs are approximately 2600 nt in length, are expressed late
(24-48 hpi), and have two major transcription initiation sites. At
each major transcription start site, two transcripts are initiated
within the ATAAG consensus. Transcripts initiate at positions 2
and 3 of the ATAAG consensus.
FEATURES from to/span description
pept 124 1179 capsid protein
mRNA 27 > 1179 capsid protein mRNA (alt.)
mRNA 28 > 1179 capsid protein mRNA (alt.)
mRNA 75 > 1179 capsid protein mRNA (alt.)
mRNA 76 > 1176 capsid protein mRNA (alt.)
BASE COUNT 495 a 606 c 586 g 417 t
ORIGIN Map position 57-58.
1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc
61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag
121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt
181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg
241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc
301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg
361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac
421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg
481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac
541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg
601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc
661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac
721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc
781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg
841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa
901 acggggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg
961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc
1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg
1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc
1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt
1201 gaggctgcag tgccacaaat ttgctgctcg gtgggcgaaa tcaaaaatta ttttttgcag
1261 cccgtcgacg caatcaccat tttgcccatt gtggagctgc acacgtgccg gcaccaattg
1321 tgcgtcatgt gcgtgcgcaa aattgcgcag cgcggccgag acaagcgcgt cgagtgcccc
1381 atgtgccgcc gcaaaaacgc gcactttaac gtgtacagcg tgaaccgtaa ttcggtggat
1441 gtgttgcgct gcagcgtggc cgacgtgcgc gagcacgggc gctttggcgg attggcggac
1501 gccgcgtcgc tggctcgtgg gctgtttgag cccagcttgc ttgaagccga gcccgcgccg
1561 gacaactcgt ttggcccaaa cgagttacaa cttgtgctaa aacgtctaaa ggcgcaaatt
1621 gaggcgcaaa caagaactaa ttatgatttg caactgcaag caactgcttt ggagcgaaca
1681 atcgaagaag caaacgaccg cttgggtaaa agccgcggcg actacagcga cgcgtgcaag
1741 ctgatggacg agcttcgcgg cgacaggctg cgcgcggagc gcgcggtcaa ggcgctcgcc
1801 gacgcgcacg cccagtgggc ggacaagaac gccaaaatgc ggcgcgagaa cgacaggttg
1861 acaaatgaaa acattggttt gattcgggac aacaatttat ttaaacaaaa cactgcacgc
1921 aaacgtaaaa tcgctccgta accgcctttg tttgcgcgtc aaagtccacc tcgtagacgt
1981 gtcgctcgtt tacgctgccg cagcggtccc agttgtccgt cgtgcaccac gccgcattgt
2041 gtcggcggca caaatacagc gcttcgttga attgcgccgg cggcagattg aagcggccga
2101 gctc
//
LOCUS RATPPTA 981 bp ss-mRNA ROD 20-JUL-1990
DEFINITION Rat alpha-tachykinin mRNA, complete cds.
ACCESSION M34184
KEYWORDS alpha-tachykinin; substance P.
SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen
complex, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 981)
AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance
P and neurokinin A
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 11-MAY-1990.
FEATURES from to/span description
pept 100 438 preprotachykinin
sigp 100 156 tachykinin signal peptide (156 could be 171)
matp 271 303 substance P
mRNA < 1 981 tachykinin mRNA
BASE COUNT 271 a 203 c 234 g 273 t
ORIGIN
1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggagcgccca gcaagtgcgc
61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc
241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
361 tatgggcatg gtcagatctc tcacaaaatg gcttatgaaa gaagcgcaat gcagaactac
421 gaaagaaggc gtaaataaac cctgtaacgc actatctatt catctccatc tgtgtccgcg
481 agcagtgagc ggtaaaataa aaatgtgcgc tatgaggaat gattatttat ttaatatcaa
541 atgttgttat gagtgaaaaa ctcaaaaaag tgtttatttt ttcatattgt gccaataagc
601 attgtaattc taatgtggtg acctcctcag acagaagtag aaattagttg taacttcagc
661 aaagcacagt gttgatggag ttgtacaagt ttgccagcga tgcaagtctc caaagacaga
721 aaggctgctg tgaggcagtg caggcggctg ctgctggagg cagagaaact cctgtgtgtc
781 ttgcgcttcc cttggttgct tttatcctaa tgatgtactg agagtttggt atctgactct
841 atttgtatcc tagcagcatg tttcctgtgt tgtgactata tagagatgtt tttaaaagtt
901 tcaatgtact tctctggtct tcagtcattg tatgatgtgt tgtgatagct accattttaa
961 ataaaagaat gtatcttcag g
//
LOCUS RATPPTB 1035 bp ss-mRNA ROD 20-JUL-1990
DEFINITION Rat beta-tachykinin mRNA, complete cds.
ACCESSION M15191
KEYWORDS neurokinin A; substance P; tachykinin.
SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen
complex, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1035)
AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance
P and neurokinin A
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 100 492 beta-preprotachykinin
sigp 100 156 beta tachykinin signal peptide (156 could be
171)
matp 271 303 substance P
matp 391 420 neurokinin A
mRNA < 1 1035 beta-preprotachykinin mRNA
BASE COUNT 289 a 212 c 245 g 289 t
ORIGIN
1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc
241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg
421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga
481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt
541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg
601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta
661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca
721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct
781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc
841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt
901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg
961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa
1021 gaatgtatct tcagg
//
LOCUS RATPPTG 990 bp ss-mRNA ROD 20-JUL-1990
DEFINITION Rat gamma-tachykinin mRNA, complete cds.
ACCESSION M34183
KEYWORDS gamma-tachykinin; neurokinin; substance P.
SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen
complex, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 990)
AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance
P and neurokinin A
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 11-MAY-1990.
FEATURES from to/span description
pept 100 447 preprotachykinin
sigp 100 156 tachykinin signal peptide (156 could be 171)
matp 271 303 substance P
matp 346 375 neurokinin
mRNA < 1 990 tachykinin mRNA
BASE COUNT 275 a 202 c 238 g 275 t
ORIGIN
1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc
241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
301 atgggcaaac gggatgctgg gcatggtcag atctctcaca aaaggcataa aacagattcc
361 tttgttggac taatgggcaa aagagcttta aattctgtgg cttatgaaag aagcgcaatg
421 cagaactacg aaagaaggcg taaataaacc ctgtaacgca ctatctattc atctccatct
481 gtgtccgcga gcagtgagcg gtaaaataaa aatgtgcgct atgaggaatg attatttatt
541 taatatcaaa tgttgttatg agtgaaaaac tcaaaaaagt gtttattttt tcatattgtg
601 ccaataagca ttgtaattct aatgtggtga cctcctcaga cagaagtaga aattagttgt
661 aacttcagca aagcacagtg ttgatggagt tgtacaagtt tgccagcgat gcaagtctcc
721 aaagacagaa aggctgctgt gaggcagtgc aggcggctgc tgctggaggc agagaaactc
781 ctgtgtgtct tgcgcttccc ttggttgctt ttatcctaat gatgtactga gagtttggta
841 tctgactcta tttgtatcct agcagcatgt ttcctgtgtt gtgactatat agagatgttt
901 ttaaaagttt caatgtactt ctctggtctt cagtcattgt atgatgtgtt gtgatagcta
961 ccattttaaa taaaagaatg tatcttcagg
//GenBank-Updates@genbank.bio.net (07/21/90)
LOCUS BSUGLPKD 4259 bp ds-DNA BCT 21-JUL-1990
DEFINITION B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate
dehydrogenase (glpD) genes, complete cds.
ACCESSION M34393
KEYWORDS glycerol kinase; glycerol-3- phosphate dehydrogenase.
SOURCE B.subtilis (starin W168) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4259)
AUTHORS Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L.
TITLE Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of
the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate
dehydrogenase (glpD)
JOURNAL Unpublished (1990) University of Lund, Dept. of Microbiology,
Solvegatan 21, S-223 62 Lund , SWEDEN
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.M.E.Holmberg, 22-MAY-1990.
FEATURES from to/span description
pept < 1 679 ORF 1 (AA at 2)
pept 698 2188 glycerol kinase (glpK) (EC 2.7.1.30)
pept 2329 3996 glycerol-3-phosphate dehydrogenase (glpD) (EC
1.1.99.5)
pept 4128 > 4259 ORF 4
binding 682 690 ribosome binding site
binding 2315 2321 ribosome binding site
binding 4116 4121 ribosome binding site
signal 2243 2303 transcription termination signal
signal 4000 4033 transcription termination signal
signal 4067 4072 -35 region
signal 4090 4095 -10 region
BASE COUNT 1269 a 853 c 1092 g 1045 t
ORIGIN
1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat
61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca
121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa
181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca
241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt
301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat
361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga
421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg
481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt
541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt
601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc
661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt
721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca
781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc
841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat
901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg
961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc
1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg
1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga
1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct
1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac
1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt
1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga
1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc
1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg
1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac
1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc
1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag
1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt
1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag
1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa
1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga
1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc
1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat
2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg
2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc
2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa
2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc
2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat
2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac
2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg
2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca
2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct
2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa
2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg
2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca
2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg
2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg
2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa
2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat
3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat
3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct
3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg
3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg
3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg
3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg
3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca
3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg
3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga
3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt
3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac
3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa
3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa
3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga
3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg
3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag
3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc
4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc
4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa
4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta
4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc
//
LOCUS CLCMTPLAS 379 bp ds-DNA PLN 21-JUL-1990
DEFINITION C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR).
ACCESSION M35177
KEYWORDS .
SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
ORGANISM Claviceps purpurea
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE 1 (bases 1 to 379)
AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
Rogmann-Backwinkel,P. and Tudzynski,P.
TITLE Structural and functional analysis of mitochondrial plasmids in
Claviceps purpurea
JOURNAL Mol. Gen. Genet. 214, 128-134 (1988)
STANDARD simple staff_review
BASE COUNT 97 a 76 c 64 g 142 t
ORIGIN
1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct
361 acaatgtctt ccgccaaaa
//
LOCUS CLCMTPLASX 379 bp ds-DNA PLN 21-JUL-1990
DEFINITION C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR).
ACCESSION M35178
KEYWORDS .
SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
ORGANISM Claviceps purpurea
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE 1 (bases 1 to 379)
AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
Rogmann-Backwinkel,P. and Tudzynski,P.
TITLE Structural and functional analysis of mitochondrial plasmids in
Claviceps purpurea
JOURNAL Mol. Gen. Genet. 214, 128-134 (1988)
STANDARD simple staff_review
BASE COUNT 105 a 75 c 62 g 137 t
ORIGIN
1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt
361 aaaaaagcac gtcttgctc
//
LOCUS DDISP60 328 bp ds-DNA INV 21-JUL-1990
DEFINITION D.discoideum spore coat 60 (sp60) gene, 5' flank.
ACCESSION M34546
KEYWORDS spore coat 60.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly
submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 168 a 15 c 11 g 134 t
ORIGIN
1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta
61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc
121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat
181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat
241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa
301 tcataaattt aaaataaata attttttt
//
LOCUS DDISP70 563 bp ds-DNA INV 21-JUL-1990
DEFINITION D.discoideum spore coat 70 (sp70) gene, 5' flank.
ACCESSION M34547
KEYWORDS spore coat 70.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 563)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 207 a 73 c 47 g 236 t
ORIGIN
1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt
61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt
121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa
181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt
241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat
301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc
361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac
421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa
481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct
541 ctacttttta tatatttttt ttc
//
LOCUS DDISP96 682 bp ds-DNA INV 21-JUL-1990
DEFINITION D.discoideum spore coat 96 (sp96) gene, 5' flank.
ACCESSION M34548
KEYWORDS spore coat 96.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 682)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 321 a 37 c 63 g 261 t
ORIGIN
1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat
61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa
121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg
181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat
241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg
301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta
361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt
421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt
481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat
541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt
601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa
661 taaaaaaaaa aaaaaaaaaa aa
//
LOCUS GLAA2GIA 1870 bp ds-DNA INV 21-JUL-1990
DEFINITION Giardia lamblia alpha-2 giardin gene, complete cds.
ACCESSION M34550
KEYWORDS alpha-2 giardin.
SOURCE G.lamblia (strain Portland-1) trophozoite DNA, clone pG2.
ORGANISM Giardia lamblia
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE 1 (bases 1 to 1213)
AUTHORS Alonso,R.A. and Peattie,D.A.
TITLE Genomic organization of alpha giardin genes in Giardia lamblia
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1214 to 1870)
AUTHORS Alonso,R.A. and Peattie,D.A.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Alonso, 25-MAY-1990.
Author address: R.A.Alonso
Dept of Tropical Public Health
Harvard School of Public Health
665 Huntington Avenue, Boston, MA 02115
FEATURES from to/span description
pept 215 1105 alpha-2 giardin
BASE COUNT 466 a 468 c 507 g 429 t
ORIGIN
1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg
61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac
121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct
181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc
241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga
301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga
361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga
421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa
481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga
541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga
601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac
661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag
721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat
781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta
841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa
901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag
961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta
1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg
1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa
1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat
1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg
1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat
1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt
1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct
1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt
1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa
1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc
1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat
1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc
1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg
1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt
1861 cagtattgtc
//
LOCUS KAEHUTC 1625 bp ds-DNA BCT 21-JUL-1990
DEFINITION K.aerogenes histidine utilization repressor C (hutC) gene,
complete cds.
ACCESSION M34604
KEYWORDS histidine utilization repressor C.
SOURCE K.aerogenes (strain W-70) DNA.
ORGANISM Klebsiella aerogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1625)
AUTHORS Schwacha,A. and Bender,R.A.
TITLE Nucleotide sequence of the gene encoding the repressor for
histidine utilization genes of Klebsiella aerogenes
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Bender, 29-MAY-1990.
FEATURES from to/span description
pept < 1 669 histidine utilization repressor G (AA at 1)
pept 731 1456 histidine utilization repressor C (hutC)
pept 1599 > 1625 histidine utilization repressor U
signal 667 686 hutC promoter
signal 1522 1493 Pc promoter
signal 1531 1557 Puh promoter
binding 1475 1495 CRP binding site
site 34 61 helix-turn-helix
BASE COUNT 311 a 476 c 508 g 330 t
ORIGIN
1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt
61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac
121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg
181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg
241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg
301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat
361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg
421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg
481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag
541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt
601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg
661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga
721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt
781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc
841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct
901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc
961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc
1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat
1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca
1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc
1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc
1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc
1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac
1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg
1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac
1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt
1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc
1621 agctg
//
LOCUS MUSMHH2KBM 66 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse major histocompatibility complex antigen H-2Kbm29 mRNA.
ACCESSION M34542
KEYWORDS major histocompatibility complex antigen.
SOURCE Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and
Fischer Lindahl,K.
TITLE Immunological characterization and molecular analysis of the
H-2Kbm29 mutant
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and
Flavell,R.A.
TITLE The DNA sequence of the H-2Kb gene: Evidence for gene conversion as
a mechanism for the generation of polymorphism in
histocompatibility antigens
JOURNAL EMBO J. 2, 453-462 (1983)
STANDARD full automatic
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Horton, 24-MAY-1990.
Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this
sequence is identical to reference 2.
FEATURES from to/span description
pept < 1 > 66 MHC H-2Kbm29 (AA at 3)
BASE COUNT 14 a 22 c 19 g 11 t
ORIGIN
1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact
61 attcag
//
LOCUS YSCKGD2 2112 bp ds-DNA PLN 21-JUL-1990
DEFINITION S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene,
complete cds.
ACCESSION M34531
KEYWORDS alphaketogluterate dehydrogenase complex;
dihydrolipoyl transsuccinylase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2112)
AUTHORS Repetto,B. and Tzagoloff,A.
TITLE Structure and regulation of KGD2, the structural gene for yeast
dihydrolipoyl transsuccinylase
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Tzagoloff, 29-MAY-1990.
FEATURES from to/span description
pept 560 1987 dihydrolipoyl transsuccinylase
mRNA 306 > 1987 dihydrolipoyl transsuccinylase mRNA
mRNA 451 > 1987 dihydrolipoyl transsuccinylase mRNA (alt.)
signal 196 200 CAAT box
signal 228 232 CAAT box
BASE COUNT 649 a 435 c 425 g 603 t
ORIGIN
1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt
61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt
121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca
181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc
241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat
301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc
361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac
421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca
481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact
541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt
601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc
661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg
721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac
781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac
841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa
901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc
961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc
1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc
1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa
1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac
1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac
1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc
1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga
1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat
1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc
1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac
1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat
1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga
1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac
1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc
1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga
1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat
1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat
1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt
2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat
2101 atggaaaagc gc
//
LOCUS PHUDLTN 781 bp ss-mRNA VRT 21-JUL-1990
DEFINITION P.bicolor [D-ala-2]deltorphin mRNA, complete cds.
ACCESSION M34560
KEYWORDS [D-ala-2]deltorphin.
SOURCE P.bicolor bicolor skin, cDNA to mRNA, clone AD2.
ORGANISM Phyllomedusa bicolor
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Neobatrachia; Bufonoidea; Phyllomedusidae.
REFERENCE 1 (bases 1 to 781)
AUTHORS Richter,K., Egger,R., Negri,L., Corsi,R., Sevrini,C. and Kreil,G.
TITLE cDNA encoding [D-ala-2]deltorphin precursors from skin of
Phyllomedusa bicolor also contain genetic information for three
dermorphin-related opoid peptides
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4836-4839 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Kreil, 18-MAY-1990.
FEATURES from to/span description
pept 30 713 pre pro[D-ala-2]deltorphin
matp 348 368 [D-ala-2]deltorphin II
matp 456 476 [D-ala-2]deltorphin I
matp 564 584 [D-ala-2]deltorphin I
matp 672 692 [D-ala-2]deltorphin I
mRNA < 1 781 [D-ala-2]deltorphin mRNA
BASE COUNT 323 a 69 c 233 g 156 t
ORIGIN
1 tcagtacttc ctgaataaca agacccaaca tgtctttctt gaagaaatcg cttctgcttg
61 tacttttcct tggattagtg tcccattccg tttgtaaaga agagaaaaga gagactgaag
121 aggagaatga aaatgaggaa gaaaatcatg aagtgggaag tgagatgaag agatatgcgt
181 tctggtatcc gaatagagac actgaagaga agaatgaaaa tgaggaagaa aatcaggaag
241 agggaagtga gatgaagaga tatgcgttcg gctatccgaa aagagagcct gaagaggaaa
301 atgagaatga ggaagaaaat catgaagagg gaagtgagat gaagagatat gcgtttgaag
361 ttgtgggagg agaagctaag aaaatgaaaa gagaacctga agaggaaaat gagaatgagg
421 aagaaaatca tgaagaggga agtgagatga agagatatgc gtttgacgtt gtgggaggag
481 aagctaagaa aatgaaaaga gagcctgaag aggaaaatga gaatgaggaa gaaaatcatg
541 aagagggaag tgagatgaag agatatgcgt ttgacgttgt gggaggagaa gctaagaaaa
601 tgaaaagaga gcctgaagag gaaaatgaga atgaggaaga aaatcatgaa gagggaagtg
661 agatgaagag atatgcgttt gacgttgtgg gaggagaagc taagaaaatg taatatttca
721 taacttaaag gagcacaatt atcagttata tgccaaacat atattaaatg atagataact
781 t
//
LOCUS HSBGPG3A 1829 bp ds-DNA VRL 21-JUL-1990
DEFINITION Bovine herpesvirus type 1 glycoprotein gIII gene, complete cds.
ACCESSION M27491
KEYWORDS glycoprotein.
SOURCE Bovine herpesvirus type 1 DNA, clones pDR[7,123] and p7-123.
ORGANISM Bovine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae.
REFERENCE 1 (bases 1 to 1829)
AUTHORS Fitzpatrick,D.R., Babiuk,L.A. and Zamb,T.J.
TITLE Nucleotide sequence of bovine herpesvirus type 1 glycoprotein gIII,
a structural model for gIII as a new member of the immunoglobulin
superfamily, and implications for the homologous glycoproteins of
other herpesviruses
JOURNAL Virology 173, 46-57 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.J/Zamb, 31-AUG-1989.
FEATURES from to/span description
pept 160 1725 glycoprotein gIII precursor
sigp 178 222 put. glycoprotein gIII signal peptide
matp 223 1723 glycoprotein gIII
signal 25 28 CAAT box
signal 61 64 TATA box
BASE COUNT 247 a 737 c 597 g 248 t
ORIGIN 97 bp upstream of BamHI site; between map units 0.122 and 0.135.
1 cgcgcctgca gccgcgcgtg tgctcaatcc cggaccacga aagcacaaaa cggacgccct
61 taaaaatgta gcccgcgccg cggtcgcggc catcttggat ccacccgcgc gcacgaccgc
121 cgagagaccg ccagcccgag acctcgccgc gcgtccgcca tgggcccgct ggggcgagcg
181 tggctgatcg cagctatttt cgcctgggcg ctcctgtctg cccggcgggg gctcgccgag
241 gaggcggaag cctcgccctc gcctccgccc tccccgtgcc caaccgagac ggaaagctcc
301 gctgggacca ccggcgcaac gccccccacg cccaacagcc ccgacgctac gccagaggac
361 agcacgcccg gtgctactac gcccgtgggg acgccggagc cgccgtccgt gtccgagcac
421 gacccgcccg ttaccaacag cacgccgccg cccgccccgc ccgaggacgg gcgacccggc
481 ggcgctggca acgcgagccg cgatgggcga cctagcggcg gggggcggcc tcgccccccg
541 cggccgagca aagccccgcc gaaggagcgc aagtggatgc tctgcgagcg cgaggccgtg
601 gccgcctcgt acgccgagcc gctgtacgtg cactgcggcg tggccgacaa cgccactggc
661 ggtgcgcgcc tggagctctg gtttcagcgc gtgggcaggt tccgctccac gcgcggcgac
721 gacgaggccg tgcgcaaccc ctttccgcgg gccccgcccg tgctgctgtt cgtagcccag
781 aacggctcga tcgcgtaccg tagcgcggag ctgggcgaca actatatttt cccttcgccc
841 gccgaccccc gcaacttgcc cctgaccgtg cgctccctga cggccgccac cgagggcgtg
901 tacacttggc gccgcgacat gggcaccaag tcacagcgca aggtcgtgac cgtcacgacg
961 caccgcgcgc ccgctgtttc cgtcgaaccc cagccagcgc tagaaggcgc cggctacgcg
1021 gccgtgtgcc gcgccgccga gtactacccg ccgcgctcca cgcgcctgca ctggttccgc
1081 aacggctacc ccgtggaggc tcggcacgcg cgcgacgtct ttacggtcga cgactccggg
1141 ctcttttcgc gcacgtccgt ccttacgctc gaggacgcga cgccaaccgc ccacccgccc
1201 aacctgcgct gcgacgtctc ctggttccag agcgctaaca tggagcgccg cttttacgcg
1261 gctggcacgc cggccgttta ccgcccgccc gagctgcgcg tgtacttcga gggcggcgag
1321 gccgtctgcg aggcgcgctg cgtccccgag gggcgcgtct ccctgcgctg gacggtgcgc
1381 gacggcatcg ccccgtcgcg cactgagcag accggcgtct gcgccgagcg gcccgggctg
1441 gtaaacctgc gcggcgtgcg cctgctttct acaaccgacg ggcccgtcga ctacacctgc
1501 accgccactg gctacccggc accgctgccc gagttctccg cgaccgccac gtacgacgcc
1561 tcgcccggcc taatcggaag ccccgtcctc gtcagcgtcg tggccgtcgc ctgcggtctc
1621 ggcgccgtgg ggctcctgct ggtggcggcc tcgtgcctgc ggcgcaaggc ccgggtaatc
1681 caacccggtc ttactcgcgc tcgcgccctc ggctccgcgc cctagacgac cggcacggcc
1741 tggaggcgct ggcggctgcc ggtgccgctc acaccgcgcg ccacaaccgc gacgtgtggc
1801 agcgcttttc ccgcgtctgc gaggccggc
//
LOCUS DOGCHAMC 933 bp ss-mRNA MAM 21-JUL-1990
DEFINITION Dog mast cell chymase mRNA, complete cds.
ACCESSION J02904
KEYWORDS chymase.
SOURCE Dog skin mastocytoma cell line BR, cDNA to mRNA.
ORGANISM Canis sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 933)
AUTHORS Caughey,G.H., Raymond,W.W. and Vanderslice,P.
TITLE Dog mast cell chymase: Molecular cloning and characterization
JOURNAL Biochemistry 29, 5166-5171 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.W.Raymond, 17-MAY-1990.
FEATURES from to/span description
pept 10 759 mast cell chymase precursor
sigp 10 72 mast cell chymase signal peptide
matp 73 756 mast cell chymase
mRNA < 1 933 mast cell chymase mRNA
signal 916 921 polyA signal
BASE COUNT 233 a 272 c 229 g 199 t
ORIGIN
1 cctgagaaga tgcattgtct tcctctcacc ctgctgctcc ttctcctatg ttccagagca
61 gaagctgagg agatcatcgg gggcacagag tccaagccac actcccgccc ttacatggcc
121 cacctggaaa tcctcactct ccggaatcac ctggcgtctt gtggtggttt cctgataagg
181 cggaacttcg tactgacagc tgctcactgt gcaggaaggt tcataatggt caccctggga
241 gcccataaca tacaaaagaa agaagacacg tggcagaagc ttgaggtcat aaaacaattc
301 cctcacccaa aatatgatga ccttactctt cgccatgaca tcatgctact gaagttgaag
361 gagaaggcca acctgaccct ggccgtgggg accctccccc tctcacccca gttcaacttc
421 gtcccacctg ggagaatgtg ccgggtggct ggctggggaa aaagacaagt caatggatca
481 ggctctgaca ctctgcaaga ggtgaagctg aggctcatgg acccccaggc ctgtagacac
541 tacatggctt ttgaccacaa cctccagctg tgtgttggca atcccaggaa gacaaaatct
601 gcatttaagg gagattcagg gggccctctt ctgtgtgctg gggtagccca gggaattgtg
661 tcctatgggc agaatgatgc aaagccccct gctgtcttca cccgaatctc ccactaccgg
721 ccctggatca ataaggttct gaagcagaat aaagcctgaa tggagcctgg gcaagcctga
781 ggggaaatct ggaaccagac ctgagcaggc tctctgccac ttactctgga gctgcctcta
841 gtctctactg aggccccact acatccctca gacccaagag ggctcctcca ggtcacagag
901 tcctcagtaa acctcaataa agatccagct tcc
//
LOCUS MUSIGHACQ 340 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34732
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 340)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 340 Ig H-chain VDJ-region (AA at 1)
recomb 291 292 V-region end/D-region start
recomb 298 299 D-region end/J-region start
BASE COUNT 80 a 79 c 93 g 88 t
ORIGIN
1 gtggagtctg gaggaggctt ggtacagcct gggggttctc tgagactctc ctgtgcaact
61 tctgggttca ccttcactga ttactacatg agctgggtcc gccagcctcc aggaaaggca
121 cttgagtggt tgggttttat tagaaacaaa gctaatggtt acacaacaga gtacagtgca
181 tctgtgaagg gtcggttcac catctccaga gataattccc aaagcatcct ctatcttcaa
241 atgaacaccc tgagagctga ggacagtgcc acttattact gtgcaagaga tagcggcggg
301 tttgcttact ggggccaagg gactctggtc actgtctctg
//
LOCUS MUSIGHACR 286 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34733
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 286)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 286 Ig H-chain VDJ-region (AA at 1)
recomb 234 235 V-region end/D-region start
recomb 238 239 D-region end/J-region start
BASE COUNT 74 a 70 c 68 g 74 t
ORIGIN
1 acttctgggt tcaccttcac tgattactac atgagctggg tccgccagcc tccaggaaag
61 gcacttgagt ggttgggttt tattagaaac aaagctaatg gttacacaac agagtacagt
121 gcatctgtga agggtcggtt caccatctcc agagataatt cccaaagcat cctctatctt
181 caaatgaaca ccctgagagc tgaggacagt gccacttatt actgtgcaag agatcaacgc
241 ctggtttgct tactggggcc aagggactct ggtcactgtc tctgca
//
LOCUS MUSIGHACS 354 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34734
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 354 Ig H-chain VDJ-region (AA at 1)
recomb 303 304 V-region end/D-region start
recomb 311 312 D-region end/J-region start
BASE COUNT 85 a 79 c 93 g 90 t 7 others
ORIGIN
1 gaggtgaagt tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc
61 tcctgtgcaa cctctggttt taccttcacg aattactaca tgnnnnnnnt ccgccagcct
121 ccaggaaagg cacttgagtg gttgggtgtt attagaaaca aagctaatgg tgacacaaca
181 gagtataatg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga
301 gatagcggtc cgtttgctta ctggggccaa gggactctgg tcactgtctc tgca
//
LOCUS MUSIGHACT 312 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34735
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig H-chain VDJ-region (AA at 2)
recomb 262 263 V-region end/D-region start
recomb 268 269 D-region end/J-region start
BASE COUNT 95 a 69 c 73 g 75 t
ORIGIN
1 gcctaaaggg tcattgaaac tctcatgtgc agcctctgga ttcaccttca ataccaatgc
61 catgaactgg gtccgccagg ctccaggaaa gggtttggaa tgggttgctc gcataagaag
121 taaaagtaat aattatgcaa catattatgc cgattcagtg aaagacaggt tcaccatctc
181 cagagatgat tcacaaagca tgctctatct gcaaatgaac aacttgaaaa ctgaggacac
241 agccatgtat tactgtgtga gagcggcgta ctatgctatg gactactggg gtcaaggaac
301 ctcagtcacc gt
//
LOCUS MUSIGHACU 353 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34736
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 353)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 353 Ig H-chain VDJ-region (AA at 1)
recomb 294 295 V-region end/D-region start
recomb 308 309 D-region end/J-region start
BASE COUNT 87 a 83 c 100 g 83 t
ORIGIN
1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc
61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact
121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat
181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac
241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtacggg
301 acccttattg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctc
//
LOCUS MUSIGHACV 351 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34737
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 351)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 351 Ig H-chain VDJ-region (AA at 1)
recomb 294 295 V-region end/D-region start
recomb 300 301 D-region end/J-region start
BASE COUNT 89 a 82 c 99 g 81 t
ORIGIN
1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc
61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact
121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat
181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac
241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtattac
301 tacggtagga ttgactactg gggccaaggc accactctca cagtctccgc a
//
LOCUS MUSIGHACW 348 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION M34738
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 348 Ig H-chain VDJ-region (AA at 1)
recomb 291 292 V-region end/D-region start
recomb 309 310 D-region end/J-region start
BASE COUNT 84 a 97 c 80 g 87 t
ORIGIN
1 gatgtgcagc ttcaggagtc gggacctggc ctggtgaaac cttctcagtc tctgtccctc
61 acctgcactg tcactggcta ctcaatcacc agtgattatg cctggaactg gatccggcag
121 tttccaggaa acaagctgga gtggatgggc tacataaact acagtggtta cactacctac
181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagga ccagttcttc
241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc attactacgg
301 ctacgtgcgg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS MUSIGKACN 341 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34739
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H144-17B7,
cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 341)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1)
recomb 302 303 V-region end/J-region start
BASE COUNT 92 a 83 c 87 g 79 t
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS MUSIGKACO 341 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34740
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 341)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1)
recomb 302 303 V-region end/J-region start
BASE COUNT 92 a 82 c 87 g 79 t 1 others
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atnagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS MUSIGKACP 338 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34741
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 338)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 338 Ig kappa-chain VJ-region (AA at 1)
recomb 302 303 V-region end/J-region start
BASE COUNT 91 a 82 c 86 g 79 t
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaa
//
LOCUS MUSIGKACQ 341 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34742
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 341)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 92 a 83 c 86 g 80 t
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS MUSIGKACR 341 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34743
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 341)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1)
recomb 304 305 V-region end/J-region start
BASE COUNT 96 a 81 c 84 g 80 t
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
301 cctctcacgt tcggctcggg gacaaagttg gaaataaaac g
//
LOCUS MUSIGKACS 335 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34744
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 335)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 335 Ig kappa-chain VJ-region (AA at 1)
recomb 302 303 V-region end/J-region start
BASE COUNT 89 a 82 c 84 g 79 t 1 others
ORIGIN
1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctngca gtttattact gtcagaatga ttatagttat
301 ccgctcacgt tcggtgctgg gaccaagctg gagct
//
LOCUS MUSIGKACT 333 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34745
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 333)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 333 Ig kappa-chain VJ-region (AA at 1)
recomb 296 297 V-region end/J-region start
BASE COUNT 90 a 80 c 86 g 77 t
ORIGIN
1 gtgatgacac agtctccatc ctccctgact gtgacagcag gagagaaggt cactatgagc
61 tgcaagtcca gtcagagtct gttaaacagt ggaaatcaaa agaagtactt gacctggtac
121 cagcagaaac cagggcagcc tcctaaactg ttgatctact gggcatccac tagggaatct
181 ggggtccctg atcgcttcac aggcagtgga tctggaacag atttcactct caccatcagc
241 agtgtgcagg ctgaagacct ggcagtttat tactgtcaga atgattatag ttatccgctc
301 acgttcggtg ctgggaccaa gctggagctg aaa
//
LOCUS MUSIGKACU 339 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34746
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma IC5-2A6, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 339 Ig kappa-chain VJ-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 92 a 80 c 84 g 83 t
ORIGIN
1 gacattgtga tgacacagtc tccatcttcc ctgactgtga cagcaggaga gaaggtcact
61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa cttcttgacc
121 tggtaccagc agaaaccagg acagtctcct aaactgttga tctactgggc atccactagg
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacgtggca gtttattact gtcagaatga ttatacttat
301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaa
//
LOCUS MUSIGKACV 318 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34747
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-8, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 318 Ig kappa-chain VJ-region (AA at 2)
recomb 283 284 V-region end/J-region start
BASE COUNT 85 a 79 c 77 g 77 t
ORIGIN
1 tccatcctcc ctgactgtga cagcaggaga gaaggtcact atgagttgca agtccagtca
61 gagtctctta aacagtggaa ctcaaaagaa ctacttgacc tggtaccagc agaaaccagg
121 gcagcctcct aaactattga tctactgggc atccactagg gaatctgggg tccctgatcg
181 cttcacaggc agtggatctg gaacagattt cactctcacc atcatcagtg tgcagactga
241 agacctggca gtttatttct gtcagaatga ttatagttat ccgctcacgt tcggtgctgg
301 gaccaagctg gagctgaa
//
LOCUS MUSIGKACW 246 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34748
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 246 Ig kappa-chain VJ-region (AA at 2)
recomb 211 212 V-region end/J-region start
BASE COUNT 64 a 64 c 59 g 59 t
ORIGIN
1 ccgtggaaat caaaagaact acttgacctg gtaccagcag aaaccagggc agcctcctaa
61 actcttaatc tactgggcat ccactaggga atctggggtc cctgatcgct tcacaggcag
121 tggatctgga acagatttca ctctcaccat cagcagtgtg cgggctgaag acctggcagt
181 ttattactgt cagaatgatt atcattatcc gctcactttc ggtgctggga ccaagctgga
241 cctgaa
//
LOCUS MUSIGKACX 176 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION M34749
KEYWORDS immunoglobulin light chain; joining exon; kappa-chain;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H37-41, cDNA
to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 176)
AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
Weigert,M.G.
TITLE V region gene usage and somatic mutation in the primary and
secondary responses to influenza virus hemagglutinin
JOURNAL J. Immunol. 144, 2795-2801 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 176 Ig kappa-chain VJ-region (AA at 1)
recomb 138 139 V-region end/J-region start
BASE COUNT 41 a 45 c 47 g 43 t
ORIGIN
1 tgggcatcca ctagggaatc tggggtccct gatcgcttca caggcagtgg atctggaaca
61 gatttcactc tcaccatcac caatgtgcag gctgaagacc tggcagttta ttactgtcag
121 aatgattaca cttatccgct cacgttcggt gctgggacca agctggagct gagacg
//
LOCUS MUSMHH2A 876 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse MHC class I H-2K-kml mRNA (H-2K-kml haplotype), partial cds.
ACCESSION M34932
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse (strain CBA.M523) lymphoma cell line RDM-4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 876)
AUTHORS Martinko,J.M., Solheim,J.C. and Geliebter,J.
TITLE The H-2K-km1 mutation: A single nucleotide substitution is
responsible for multiple functional differences in a class I MHC
molecule
JOURNAL Mol. Immunol. 25, 267-274 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 876 MHC H-2K-kml mRNA (AA at 1)
BASE COUNT 180 a 255 c 286 g 155 t
ORIGIN
1 ggcccacatt cgctgaggta tttccacacc gccgtgtccc ggcccggcct cgggaagccc
61 cggttcatct ctgtcggcta cgtggacgac acgcagttcg tgcgcttcga cagcgacgcg
121 gagaatccga ggtatgagcc gcgggtgcgg tggatggagc aggtggagcc cgagtattgg
181 gagcggaaca cgcagatcgc caagggcaat gagcagattt tccgagtgaa cctgaggacc
241 gcgctgcgct actacaacca gagcgcgggc ggctctcaca cgttccaacg gatgtacggc
301 tgtgaggtgg ggtcggactg gcgcctcctc cgcgggtacg agcagtacgc atacgacggc
361 tgcgattaca tcgccctgaa cgaagacctg aaaacgtgga cggcggccga catggcggcg
421 ctgatcacca aacacaagtg ggagcaggct ggtgctgcag agagagaccg ggcctacctg
481 gagggcacgt gcgtggagtg gctccgcaga tacctgcagc tcgggaacgc gacgctgccg
541 cgcacagatt ccccaaaggc ccatgtgacc cgtcacagca gacctgaaga taaagtcacc
601 ctgaggtgct gggccctggg cttctaccct gctgacatca ccctgacctg gcagttgaat
661 ggggaggagc tgacccagga catggagctt gtggagacca ggcctgcagg ggatggaacc
721 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagtatta cacatgccat
781 gtgtaccatc aggggctgcc tgagcccctc accctgagat gggagcctcc tccatccact
841 gtctccaaca cggtaatcat tgctgttctg gttgtc
//
LOCUS PEALEGAB 137 bp ss-mRNA PLN 21-JUL-1990
DEFINITION P.sativum legumin mRNA, 3' flank.
ACCESSION M34908
KEYWORDS legumin; storage protein.
SOURCE P.sativum, cDNA to mRNA, clones pDUB[1,3].
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 137)
AUTHORS Lycett,G.W., Delauney,A.J. and Croy,R.R.D.
TITLE Are plant genes different?
JOURNAL FEBS Lett. 153, 43-46 (1983)
STANDARD simple staff_review
FEATURES from to/span description
mRNA < 1 137 legumin mRNA
BASE COUNT 53 a 20 c 24 g 40 t
ORIGIN
1 tagatttcgc accaaatcaa tgaaagtaat gaataagaaa actaaggctt agatgccttt
61 gttacttgtg taaaataact cgagtcatgt acctttttgc ggaaacagaa taaataaaag
121 gtaaaatttc agtgctc
//
LOCUS PEAVICA 81 bp ss-mRNA PLN 21-JUL-1990
DEFINITION P.sativum vicilin mRNA, 3' flank.
ACCESSION M34909
KEYWORDS storage protein; vicilin.
SOURCE P.sativum, cDNA to mRNA, clone pDUB2.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 81)
AUTHORS Lycett,G.W., Delauney,A.J. and Croy,R.R.D.
TITLE Are plant genes different?
JOURNAL FEBS Lett. 153, 43-46 (1983)
STANDARD simple staff_review
BASE COUNT 32 a 11 c 13 g 25 t
ORIGIN
1 taatgagaga tcaaatattt tgcatgtatg ctataaagaa ctatagctca taatgagcaa
61 ggaataaaac atcgttctct t
//
LOCUS ECOALKPH 52 bp ds-DNA SYN 21-JUL-1990
DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION M34498
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain AW1043) DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 52)
AUTHORS Chou,M.M. and Kendall,D.A.
TITLE Polymeric sequences reveal a functional interrelationship between
hydrophobicity and length of signal peptides
JOURNAL J. Biol. Chem. 265, 2873-2880 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 52 synthetic alkaline phosphatase precursor (AA
at 1)
sigp < 1 > 52 synthetic alkaline phosphatase precursor
signal peptide
BASE COUNT 9 a 15 c 10 g 18 t
ORIGIN
1 tcgactctgc ttctgctgct cttgctgtta ctgttaaccc ctgtgacaaa ag
//
LOCUS ECOALKPHX 52 bp ds-DNA SYN 21-JUL-1990
DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION M34499
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain AW1043) DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 52)
AUTHORS Chou,M.M. and Kendall,D.A.
TITLE Polymeric sequences reveal a functional interrelationship between
hydrophobicity and length of signal peptides
JOURNAL J. Biol. Chem. 265, 2873-2880 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 52 synthetic alkaline phosphatase precursor (AA
at 1)
sigp < 1 > 52 synthetic alkaline phosphatase precursor
signal peptide
BASE COUNT 8 a 10 c 17 g 17 t
ORIGIN
1 tcgactgtcg tggttgtcgt agttgtggtg gtcgttaccc ctgtgacaaa ag
//
LOCUS ECOALKPHXX 52 bp ds-DNA SYN 21-JUL-1990
DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION M34500
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain AW1043) DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 52)
AUTHORS Chou,M.M. and Kendall,D.A.
TITLE Polymeric sequences reveal a functional interrelationship between
hydrophobicity and length of signal peptides
JOURNAL J. Biol. Chem. 265, 2873-2880 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 52 synthetic alkaline phosphatase precursor (AA
at 1)
sigp < 1 > 52 synthetic alkaline phosphatase precursor
signal peptide
BASE COUNT 10 a 18 c 17 g 7 t
ORIGIN
1 tcgactgcag cagctgcagc tgcggctgcc gcggcgaccc ctgtgacaaa ag
//
LOCUS ECOALKPHY 52 bp ds-DNA SYN 21-JUL-1990
DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION M34501
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain AW1043) DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 52)
AUTHORS Chou,M.M. and Kendall,D.A.
TITLE Polymeric sequences reveal a functional interrelationship between
hydrophobicity and length of signal peptides
JOURNAL J. Biol. Chem. 265, 2873-2880 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 52 synthetic alkaline phosphatase precursor (AA
at 1)
sigp < 1 > 52 synthetic alkaline phosphatase precursor
signal peptide
BASE COUNT 8 a 11 c 25 g 8 t
ORIGIN
1 tcgactggcg gtgggggcgg cggaggtggt ggcggtaccc ctgtgacaaa ag
//
LOCUS CPCURA5A 2029 bp ds-DNA BCT 21-JUL-1990
DEFINITION C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene,
complete cds.
ACCESSION M34606
KEYWORDS orotidine monophosphate pyrophosphorylase.
SOURCE C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA,
clone pURA5g2.
ORGANISM Cryptococcus neoformans
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina; Filobasidiales.
REFERENCE 1 (bases 1 to 2029)
AUTHORS Edman,J.C. and Kwon-Chung,K.J.
TITLE Isolation of the URA5 gene from Cryptococcus neoformans var
neoformans and its use as a selective marker for transformation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Edman, 29-MAY-1990.
Author address: J.C.Edman
University of California San Francisco
Hormone Research Institute
San Francisco, CA 94143-0534
email: edman@socrates.ucsf.edu
FEATURES from to/span description
pept 714 814 orotidine monophosphate pyrophosphorylase
(URA5), exon 1 (E.C. 2.4.2.10)
867 1095 orotidine monophosphate pyrophosphorylase,
exon 2
1145 1492 orotidine monophosphate pyrophosphorylase,
exon 3
pre-msg < 714 1527 URA5 mRNA and introns
IVS 815 866 URA5 intron A
IVS 1096 1144 URA5 intron B
BASE COUNT 512 a 515 c 479 g 523 t
ORIGIN
1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg
61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg
121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt
181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg
241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg
301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg
361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg
421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca
481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg
541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga
601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa
661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct
721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc
781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag
841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt
901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct
961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct
1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg
1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca
1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat
1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc
1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag
1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc
1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga
1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc
1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa
1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa
1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc
1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt
1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg
1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg
1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac
1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc
1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc
//
LOCUS ECOCHEA 2190 bp ds-DNA BCT 21-JUL-1990
DEFINITION E.coli cheA protein (complete cds), motB protein (3' end),
and cheW protein (5' end) genes.
ACCESSION M34669
KEYWORDS cheA gene; cheW gene; motB gene.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2190)
AUTHORS Kofoid,E.C. and Parkinson,J.S.
TITLE Tandem translational starts in the cheA locus of E.coli
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.C.Kofoid, 01-JUN-1990.
Author address: E.C.Kofoid
University of Utah
Dept of Biology
Salt Lake City, UT 84112
email: kofoid@bioscience.utah.edu
FEATURES from to/span description
pept < 1 104 MotB protein
pept 109 2073 CheA protein (alt.)
pept 400 2073 CheA protein (alt.)
pept 2094 > 2190 CheW protein
BASE COUNT 575 a 546 c 609 g 460 t
ORIGIN 42 min on K12 map.
1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac
61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat
121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag
181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt
241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa
301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac
361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct
421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt
481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt
541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt
601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg
661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc
721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa
781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa
841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc
901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc
961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata
1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt
1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg
1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa
1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac
1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt
1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg
1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac
1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag
1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag
1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta
1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc
1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca
1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg
1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc
1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag
1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc
1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac
2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg
2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta
2161 cccttggtga tgaagagtac ggtattgatg
//
LOCUS HAMHSP60A 2726 bp ds-DNA ROD 21-JUL-1990
DEFINITION C.griseus chaperonin (HSP60) non-functional pseudogene.
ACCESSION M34562
KEYWORDS chaperonin; pseudogene.
SOURCE C.griseus cell line CHO DNA, clone GC-1.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2726)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
IVS 1738 1824 HSP60 intron A
BASE COUNT 915 a 513 c 579 g 719 t
ORIGIN
1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca
61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac
121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac
181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag
241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg
301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta
361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa
421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc
481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta
541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag
601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg
661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa
721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg
781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta
841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg
901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta
961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca
1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt
1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa
1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa
1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa
1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa
1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt
1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct
1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg
1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt
1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt
1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag
1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta
1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat
1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta
1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc
1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga
1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac
2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa
2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta
2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag
2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt
2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt
2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc
2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct
2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct
2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt
2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc
2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca
2701 aatggagaca acccaaagtt tcttga
//
LOCUS HAMHSP70A 2243 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Chinese hamster heat shock protein (cognate form of hsp70) mRNA,
complete cds.
ACCESSION M34561
KEYWORDS heat shock protein; stress protein.
SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2243)
AUTHORS Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S.
TITLE Identification of a protein altered in mutants resistant to
microtubule inhibitors with the constitutive form of the major heat
shock protein (hsp70)
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN% GUPTAR@,SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 50 1990 heat shock protein (hsp70)
mRNA < 1 2243 hsp70 mRNA
signal 2220 2225 poly-A signal
BASE COUNT 654 a 486 c 551 g 552 t
ORIGIN
1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg
61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg
121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt
181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac
241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca
301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca
361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct
421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt
481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat
541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg
601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac
661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg
721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga
781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg
841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat
901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga
961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc
1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc
1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa
1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa
1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga
1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca
1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga
1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat
1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg
1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac
1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa
1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc
1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga
1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa
1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa
1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg
1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga
1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg
2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact
2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat
2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa
2221 ataaaactat ttaaattggc acc
//
LOCUS HAMTCP1A 1845 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds.
ACCESSION M34665
KEYWORDS T-complex protein 1.
SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1845)
AUTHORS Ahmad,S. and Gupta,R.S.
TITLE Cloning of a Chinese hamster ovary cell protein homologous to the
mouse T-complex protein TCP-1: Structural similarity to the
ubiquitous "Chaperonin" family of heat shock proteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 01-JUN-1990.
Author address: R.S.Gupta
McMaster University
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR@,SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 89 1759 T-complex protein 1
signal 1818 1823 poly-A signal
BASE COUNT 530 a 368 c 472 g 475 t
ORIGIN
1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg
61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg
121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat
181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga
241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc
301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac
361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa
421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta
481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc
541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat
601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc
661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat
721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa
781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca
841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac
901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg
961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt
1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact
1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt
1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac
1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc
1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg
1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg
1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac
1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca
1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat
1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt
1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat
1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc
1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt
1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga
//
LOCUS HUMHP601A 1941 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 1.
ACCESSION M34660
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS2-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 111 to 1941)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1941)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 624 a 329 c 446 g 541 t 1 others
ORIGIN
1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng
61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca
181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg
241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc
301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata
361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa
421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg
481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa
541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag
601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa
661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg
721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga
781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact
841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg
901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc
961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact
1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg
1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg
1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag
1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta
1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa
1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg
1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg
1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg
1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac
1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg
1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga
1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg
1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag
1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag
1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca
1921 ataacttcag agaagtcagt t
//
LOCUS HUMHP602A 1920 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 2.
ACCESSION M34662
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS2-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 117 to 1920)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1920)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 619 a 324 c 431 g 544 t 2 others
ORIGIN
1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt
61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac
181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga
241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt
301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg
361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag
421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac
481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg
541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag
601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat
721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa
781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt
841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt
901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat
961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt
1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg
1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc
1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag
1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata
1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg
1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa
1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg
1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa
1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag
1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt
1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa
1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc
1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg
1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg
1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg
//
LOCUS HUMHP603A 1912 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 3.
ACCESSION M34661
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS5-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 99 to 1912)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1912)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 614 a 319 c 449 g 527 t 3 others
ORIGIN
1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt
61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac
121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac
181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc
241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca
301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag
361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca
421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc
481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc
541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt
601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca
661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga
721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa
781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga
841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca
901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc
961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct
1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga
1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa
1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc
1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt
1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc
1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga
1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt
1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc
1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg
1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt
1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca
1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc
1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa
1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct
1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt
//
LOCUS HUMHP604A 2019 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 4.
ACCESSION M34663
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS5-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 123 to 2019)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2019)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 657 a 327 c 479 g 556 t
ORIGIN
1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat
61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga
121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa
181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg
241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc
301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg
361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt
421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt
481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg
541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt
601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca
661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc
781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa
841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga
901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt
961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca
1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg
1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata
1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga
1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg
1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac
1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga
1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga
1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg
1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga
1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat
1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc
1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat
1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta
1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg
1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag
1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa
1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct
//
LOCUS HUMHSP60A 2202 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human chaperonin (HSP60) mRNA, complete cds.
ACCESSION M34664
KEYWORDS chaperonin.
SOURCE Human placenta cDNA to mRNA, and DNA, clone PGEM-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2202)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 25 1746 chaperonin (HSP60)
BASE COUNT 699 a 371 c 538 g 594 t
ORIGIN
1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga
61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt
121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc
181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa
241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac
301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc
361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc
421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt
481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct
541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa
601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa
661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca
721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct
781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc
841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag
901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag
961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg
1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa
1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa
1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa
1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt
1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt
1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac
1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc
1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag
1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat
1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct
1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa
1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg
1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc
1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc
1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact
1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt
1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac
2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat
2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta
2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa
//
LOCUS MUSGCANF 4049 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse guanylate cyclase/atrial natriuretic factor mRNA,
complete cds.
ACCESSION J05504
KEYWORDS atrial natriuretic factor; guanylate cyclase.
SOURCE Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4049)
AUTHORS Pandey,K.N. and Singh,S.
TITLE Molecular cloning and expression of murine guanylate cyclase/atrial
natriuretic factor receptor cDNA
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.N.Pandey, 31-MAY-1990.
FEATURES from to/span description
pept 348 3521 guanylate cyclase/atrial natriuretic factor
signal 4029 4034 poly-A signal
BASE COUNT 879 a 1164 c 1140 g 866 t
ORIGIN
1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa
61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc
121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg
181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc
241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga
301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc
361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa
421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc
481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc
541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg
601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca
661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca
721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga
781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg
841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc
901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg
961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca
1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg
1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg
1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag
1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg
1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc
1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca
1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg
1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct
1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc
1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta
1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac
1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact
1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg
1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc
1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca
1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc
1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg
2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg
2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct
2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta
2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct
2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag
2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc
2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa
2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc
2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa
2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc
2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc
2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca
2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg
2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc
2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg
2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt
3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg
3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg
3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg
3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc
3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga
3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg
3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg
3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct
3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc
3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac
3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc
3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac
3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc
3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg
3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg
3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc
3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag
4021 caaaacccat taaagtcttt attcctgtg
//
LOCUS MUSMHH2DR 1415 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse MHC class I H-2Dr protein mRNA, complete cds.
ACCESSION M34961
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse (strain B10.RIII) spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1415)
AUTHORS Zeling,C. and Pease,L.R.
TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Zeling, 29-MAY-1990.
Author address: C.Zeling
Mayo Clinic
323 Guggenheim
200 1st St. SW
Rochester, MN 55905
email: Pease@mayo.edu
FEATURES from to/span description
pept 1 1089 MHC H-2Dr protein precursor
sigp 1 72 MHC H-2Dr protein signal peptide
matp 73 1086 MHC H-2Dr protein
BASE COUNT 298 a 398 c 442 g 277 t
ORIGIN
1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc
181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg
301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag
361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac
421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag
541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac
601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa
661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat
841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS MUSMHH2DS 1415 bp ss-mRNA ROD 21-JUL-1990
DEFINITION M.musculus MHC class I H-2Ds protein mRNA, complete cds.
ACCESSION M34962
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE M.musculus (strain B10.RIII) spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1415)
AUTHORS Zeling,C. and Pease,L.R.
TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Zeling, 29-MAY-1990.
Author address: C.Zeling
Mayo Clinic
323 Guggenheim
200 1st St. SW
Rochester, MN 55905
email: Pease@mayo.edu
FEATURES from to/span description
pept 1 1089 MHC H-2Dr protein precursor
sigp 1 72 MHC H-2Dr protein signal peptide
matp 73 1086 MHC H-2Dr protein
BASE COUNT 292 a 401 c 444 g 278 t
ORIGIN
1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc
181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg
301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag
361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac
421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac
541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac
601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga
661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat
841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS HUMCETP 1787 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human cholesteryl ester transfer protein mRNA, complete cds.
ACCESSION M30185
KEYWORDS cholesteryl ester transfer protein; transfer protein.
SOURCE Human adult liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1787)
AUTHORS Drayna,D., Jarnagin,A.S., McLean,J., Henzel,W., Kohr,W.,
Fielding,C. and Lawn,R.
TITLE Cloning and sequencing of human cholesteryl ester transfer protein
cDNA
JOURNAL Nature 327, 632-634 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 131 1612 cholesteryl ester transfer protein precursor
/nomgen="CETP" /map="16q21"
/hgml_locus_uid="LF0130L"
sigp 131 181 cholesteryl ester transfer protein signal
peptide
matp 182 1609 cholesteryl ester transfer protein
mRNA < 1 1787 CETP mRNA
BASE COUNT 397 a 531 c 456 g 403 t
ORIGIN
1 gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc
61 tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg
121 cctgataacc atgctggctg ccacagtcct gaccctggcc ctgctgggca atgcccatgc
181 ctgctccaaa ggcacctcgc acgaggcagg catcgtgtgc cgcatcacca agcctgccct
241 cctggtgttg aaccacgaga ctgccaaggt gatccagacc gccttccagc gagccagcta
301 cccagatatc acgggcgaga aggccatgat gctccttggc caagtcaagt atgggttgca
361 caacatccag atcagccact tgtccatcgc cagcagccag gtggagctgg tggaagccaa
421 gtccattgat gtctccattc agaacgtgtc tgtggtcttc aaggggaccc tgaagtatgg
481 ctacaccact gcctggtggc tgggtattga tcagtccatt gacttcgaga tcgactctgc
541 cattgacctc cagatcaaca cacagctgac ctgtgactct ggtagagtgc ggaccgatgc
601 ccctgactgc tacctgtctt tccataagct gctcctgcat ctccaagggg agcgagagcc
661 tgggtggatc aagcagctgt tcacaaattt catctccttc accctgaagc tggtcctgaa
721 gggacagatc tgcaaagaga tcaacgtcat ctctaacatc atggccgatt ttgtccagac
781 aagggctgcc agcatccttt cagatggaga cattggggtg gacatttccc tgacaggtga
841 tcccgtcatc acagcctcct acctggagtc ccatcacaag ggtcatttca tctacaagaa
901 tgtctcagag gacctccccc tccccacctt ctcgcccaca ctgctggggg actcccgcat
961 gctgtacttc tggttctctg agcgagtctt ccactcgctg gccaaggtag ctttccagga
1021 tggccgcctc atgctcagcc tgatgggaga cgagttcaag gcagtgctgg agacctgggg
1081 cttcaacacc aaccaggaaa tcttccaaga ggttgtcggc ggcttcccca gccaggccca
1141 agtcaccgtc cactgcctca agatgcccaa gatctcctgc caaaacaagg gagtcgtggt
1201 caattcttca gtgatggtga aattcctctt tccacgccca gaccagcaac attctgtagc
1261 ttacacattt gaagaggata tcgtgactac cgtccaggcc tcctattcta agaaaaagct
1321 cttcttaagc ctcttggatt tccagattac accaaagact gtttccaact tgactgagag
1381 cagctccgag tccatccaga gcttcctgca gtcaatgatc accgctgtgg gcatccctga
1441 ggtcatgtct cggctcgagg tagtgtttac agccctcatg aacagcaaag gcgtgagcct
1501 cttcgacatc atcaaccctg agattatcac tcgagatggc ttcctgctgc tgcagatgga
1561 ctttggcttc cctgagcacc tgctggtgga tttcctccag agcttgagct agaagtctcc
1621 aaggaggtcg ggatggggct tgtagcagaa ggcaagcacc aggctcacag ctggaaccct
1681 ggtgtctcct ccagcgtggt ggaagttggg ttaggagtac ggagatggag attggctccc
1741 aactcctccc tatcctaaag gcccactggc attaaagtgc tgtatcc
//
LOCUS HUMGHRA04 289 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human growth hormone receptor gene, exon 5.
ACCESSION M28461 M26401
KEYWORDS growth hormone binding-protein; growth hormone receptor;
transmembrane protein.
SEGMENT 4 of 10
SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 289)
AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
TITLE Characterization of the human growth hormone receptor gene and the
demonstration of a partial gene deletion in two patients with
Laron-type dwarfism
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.I.Wood, 02-AUG-1989.
FEATURES from to/span description
pept + 59 + 231 growth hormone receptor, exon 5 /nomgen="GHR"
/map="5p13-p12" /hgml_locus_uid="LA0219G"
pre-msg < 1 > 289 growth hormone receptor mRNA and introns
IVS < 1 > 58 GHR Intron D
IVS 232 > 289 GHR Intron E
BASE COUNT 90 a 45 c 55 g 99 t
ORIGIN About 6 kb after segment 3.
1 acttaagcta caacatgatt tttggaacaa ttaatctttt tttaaccctt cattttagga
61 acactcaaga atggactcaa gaatggaaag aatgccctga ttatgtttct gctggggaaa
121 acagctgtta ctttaattca tcgtttacct ccatctggat accttattgt atcaagctaa
181 ctagcaatgg tggtacagtg gatgaaaagt gtttctctgt tgatgaaata ggtaaatcac
241 aggtttttgt ttcatttgac atagttttag actaaataaa tggggaagc
//
LOCUS HUMGHRA06 362 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human growth hormone receptor gene, exon 7.
ACCESSION M28463 M26401
KEYWORDS growth hormone binding-protein; growth hormone receptor;
transmembrane protein.
SEGMENT 6 of 10
SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 362)
AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
TITLE Characterization of the human growth hormone receptor gene and the
demonstration of a partial gene deletion in two patients with
Laron-type dwarfism
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.I.Wood, 02-AUG-1989.
FEATURES from to/span description
pept + 75 + 240 growth hormone receptor, exon 7 /nomgen="GHR"
/map="5p13-p12" /hgml_locus_uid="LA0219G"
pre-msg < 1 > 362 growth hormone receptor mRNA and introns
IVS < 1 74 GHR Intron F
IVS 241 > 362 GHR Intron G
BASE COUNT 117 a 58 c 76 g 111 t
ORIGIN Undetermined number of bp after segment 5.
1 atacctgtag tgttcattgg cattgagttg ttgactcttt ggccaatatg gcgtttatat
61 ttttgtcttg aaagatggac cctatattga caacatcagt tccagtgtac tcattgaaag
121 tggataagga atatgaagtg cgtgtgagat ccaaacaacg aaactctgga aattatggcg
181 agttcagtga ggtgctctat gtaacacttc ctcagatgag ccaatttaca tgtgaagaag
241 gtaaaagaaa taaaagatta aaatagtagc taacctggct tttgtcaata taacagttga
301 ttcacccctg cactggtagt gtgttgtcca aatcaaaata tattaacatc agatatcagg
361 at
//
LOCUS HUMGHRA07 212 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human growth hormone receptor gene, exon 8.
ACCESSION M28464 M26401
KEYWORDS growth hormone binding-protein; growth hormone receptor;
transmembrane protein.
SEGMENT 7 of 10
SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
TITLE Characterization of the human growth hormone receptor gene and the
demonstration of a partial gene deletion in two patients with
Laron-type dwarfism
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.I.Wood, 02-AUG-1989.
FEATURES from to/span description
pept + 59 + 149 growth hormone receptor, exon 8 /nomgen="GHR"
/map="5p13-p12" /hgml_locus_uid="LA0219G"
pre-msg < 1 > 212 growth hormone receptor mRNA and introns
IVS < 1 58 GHR Intron G
IVS 150 > 212 GHR Intron H
BASE COUNT 53 a 33 c 43 g 83 t
ORIGIN Undetermined number of bp after segment 6.
1 gaaactgtgc ttcaactagt cgtaattctg aaagcgaaat attcttgtgt gtttgcagat
61 ttctactttc catggctctt aattattatc tttggaatat ttgggctaac agtgatgcta
121 tttgtattct tattttctaa acagcaaagg taggatgtag gaaggtagta ttctttggta
181 ccttctgtac cagttgtgtt agaccttgcc at
//
LOCUS HUMGHRA08 198 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human growth hormone receptor gene, exon 9.
ACCESSION M28465 M26401
KEYWORDS growth hormone binding-protein; growth hormone receptor;
transmembrane protein.
SEGMENT 8 of 10
SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
TITLE Characterization of the human growth hormone receptor gene and the
demonstration of a partial gene deletion in two patients with
Laron-type dwarfism
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.I.Wood, 02-AUG-1989.
FEATURES from to/span description
pept + 63 + 132 growth hormone receptor, exon 9 /nomgen="GHR"
/map="5p13-p12" /hgml_locus_uid="LA0219G"
pre-msg < 1 > 198 growth hormone receptor mRNA and introns
IVS < 1 62 GHR Intron H
IVS 133 > 198 GHR Intron I
BASE COUNT 68 a 35 c 30 g 65 t
ORIGIN About 4 kb after segment 7.
1 gctataattg agaatatgta gcttttaaga tgtcaaaacc aaaattttta tatgttttca
61 aggattaaaa tgctgattct gcccccagtt ccagttccaa agattaaagg aatcgatcca
121 gatctcctca aggtaactaa taattttatc taaagttgta gctagtacta attaacacct
181 gaagactcct gtcatatg
//
LOCUS HUMGHRA09 1881 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human growth hormone receptor gene, exon 10.
ACCESSION M28466 M26401
KEYWORDS growth hormone binding-protein; growth hormone receptor;
transmembrane protein.
SEGMENT 9 of 10
SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1881)
AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
TITLE Characterization of the human growth hormone receptor gene and the
demonstration of a partial gene deletion in two patients with
Laron-type dwarfism
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.I.Wood, 02-AUG-1989.
FEATURES from to/span description
pept + 64 1035 growth hormone receptor, exon 10 /nomgen="GHR"
/map="5p13-p12" /hgml_locus_uid="LA0219G"
pre-msg < 1 > 1881 growth hormone receptor mRNA and introns
IVS < 1 63 GHR Intron I
BASE COUNT 607 a 366 c 362 g 546 t
ORIGIN About 0.5 kb after segment 8.
1 gctaattcat ttaattatta tgagtttctt ttcatagatc ttcattttct ttctattttc
61 taggaaggaa aattagagga ggtgaacaca atcttagcca ttcatgatag ctataaaccc
121 gaattccaca gtgatgactc ttgggttgaa tttattgagc tagatattga tgagccagat
181 gaaaagactg aggaatcaga cacagacaga cttctaagca gtgaccatga gaaatcacat
241 agtaacctag gggtgaagga tggcgactct ggacgtacca gctgttgtga acctgacatt
301 ctggagactg atttcaatgc caatgacata catgagggta cctcagaggt tgctcagcca
361 cagaggttaa aaggggaagc agatctctta tgccttgacc agaagaatca aaataactca
421 ccttatcatg atgcttgccc tgctactcag cagcccagtg ttatccaagc agagaaaaac
481 aaaccacaac cacttcctac tgaaggagct gagtcaactc accaagctgc ccatattcag
541 ctaagcaatc caagttcact gtcaaacatc gacttttatg cccaggtgag cgacattaca
601 ccagcaggta gtgtggtcct ttccccgggc caaaagaata aggcagggat gtcccaatgt
661 gacatgcacc cggaaatggt ctcactctgc caagaaaact tccttatgga caatgcctac
721 ttctgtgagg cagatgccaa aaagtgcctc cctgtggctc ctcacatcaa ggttgaatca
781 cacatacagc caagcttaaa ccaagaggac atttacatca ccacagaaag ccttaccact
841 gctgctggga ggcctgggac aggagaacat gttccaggtt ctgagatgcc tgtcccagac
901 tatacctcca ttcatatagt acagtcccca cagggcctca tactcaatgc gactgccttg
961 cccttgcctg acaaagagtt tctctcatca tgtggctatg tgagcacaga ccaactgaac
1021 aaaatcatgc cttagccttt ctttggtttc ccaagagcta cgtatttaat agcaaagaat
1081 tgactggggc aataacgttt aagccaaaac aatgtttaaa ccttttttgg gggagtgaca
1141 ggatggggta tggattctaa aatgcctttt cccaaaatgt tgaaatatga tgttaaaaaa
1201 ataagaagaa tgcttaatca gatagatatt cctattgtgc aatgtaaata ttttaaagaa
1261 ttgtgtcaga ctgtttagta gcagtgattg tcttaatatt gtgggtgtta atttttgata
1321 ctaagcattg aatgactatg tttttaatgt atagtaaatc acgctttttg aaaaagcgaa
1381 aaaatcaggt ggcttttgcg gttcaggaaa attgaatgca aaccatagca caggctaatt
1441 ttttgttgtt tcttaaataa gaaacttttt tatttaaaaa actaaaaact agaggtgaga
1501 aatttaaact ataagcaaga aggcaaaaat agtttggata tgtaaaacat ttattttgac
1561 ataaagttga taaagatatt ttttaataat ttagacttca agcatggcta ttttatatta
1621 cactacacac tgtgtactgc agttggtatg acccctctaa ggagtgtagc aactacagtc
1681 taaagctggt ttaatgtttt ggccaatgca cctaaagaaa aacaaactcg ttttttacaa
1741 agccctttta tacctcccca gactccttca acaattctaa aatgattgta gtaatctgca
1801 ttattggaat ataattgttt tatctgaatt tttaaacaag tatttgttaa tttagaaaac
1861 tttaaagcgt ttgcacagat c
//
LOCUS HUMMHDQAR1 426 bp ds-DNA PRI 21-JUL-1990
DEFINITION Human MHC class II HLA-DQ-alpha chain gene, exon 1.
ACCESSION M29613
KEYWORDS cell surface protein; class II gene; integral membrane protein;
major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Human (haplotype HLA-D/DR) adult male DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 426)
AUTHORS Jonsson,A.-K., Hyldig-Nielsen,J.-J., Servenius,B., Larhammar,D.,
Andersson,G., Joergensen,F., Peterson,P.A. and Rask,L.
TITLE Class II genes of the human major histocompatibility complex:
Comparisons of the DQ and DX alpha and beta genes
JOURNAL J. Biol. Chem. 262, 8767-8777 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 173 + 254 MHC HLA-DQ-alpha chain precursor, exon 1
/nomgen="HLA-DQA1" /map="6p21.3"
/hgml_locus_uid="LY0151Z"
sigp 173 241 MHC HLA-DQ-alpha chain signal peptide
matp 242 + 254 MHC HLA-DQ-alpha chain
IVS 255 > 426 MHC HLA-DQ-alpha, intron A
BASE COUNT 120 a 90 c 108 g 108 t
ORIGIN
1 atacaaactc ttcagctagt aactgagatg tcaccatggg ggatttttct aattggccaa
61 aacctgacct ggcagggttt ggtttgggtg tcttcagatt tccttgtctt gaggccctca
121 caattgctct acagctcaga acagcaactg ctgaggctgc cttgggaaga ggatgatcct
181 aaacaaagct ctgatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg
241 tgaagacatt gtgggtgagt gcatgagtga ggaatgttct ctggagctga aaaacagtaa
301 attgaaggaa aagagagaaa gcgatttgca gagaaattgt agagattcca agaccccttc
361 agtattaaga gaattaaaaa ttatagctgt tcctccttca ggaaaccaga gccccaacca
421 ctcttt
//
LOCUS SOYP34A 1287 bp ss-mRNA PLN 21-JUL-1990
DEFINITION Soybean oil bodies-associated protein P34 mRNA, complete cds.
ACCESSION J05560
KEYWORDS oil bodies-associated protein.
SOURCE Soybean (strain Century) seed, cDNA to mRNA, clone P34/1350.
ORGANISM Glycine max
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 1287)
AUTHORS Kalinski,A.J., Weiseman,J.M., Matthews,B.F. and Herman,E.M.
TITLE Molecular cloning of a protein associated with soybean oil bodies
which is similar to thiol proteases of the papain family
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.Herman, 18-MAY-1990.
FEATURES from to/span description
pept 3 1142 oil bodies-associated protein P34 precursor
sigp 3 368 oil bodies-associated protein P34 signal
peptide
matp 369 1139 protein P34
mRNA < 1 1287 P34 protein mRNA
BASE COUNT 407 a 240 c 297 g 343 t
ORIGIN
1 ttatgggttt ccttgtgttg cttcttttct ccctcttagg tctctcttct agttccagca
61 tatcaactca tcgttccata ttggaccttg acctaaccaa gtttaccaca cagaaacagg
121 tgtcttcact gttccaacta tggaagagtg agcatggacg tgtctaccat aaccacgaag
181 aagaggcaaa gagacttgag attttcaaga ataactcgaa ctatatcagg gacatgaatg
241 caaacagaaa atcaccccat tctcatcgtt taggattgaa caagtttgct gacatcactc
301 ctcaagagtt cagcaaaaag tacttgcaag ctcccaagga tgtgtcgcag caaatcaaaa
361 tggccaacaa gaaaatgaag aaggaacaat attcttgtga ccatccacct gcatcatggg
421 attggaggaa aaaaggtgtc atcacccaag taaagtacca agggggctgt ggaaggggtt
481 gggcgttttc tgccacggga gccatagaac cagcacatgc aatagcaaca ggagaccttg
541 ttagcctttc tgaacaagaa ctcgtagact gtgtggaaga aagcgaaggt tgttacaatg
601 gatggcacta tcaatcgttc gaatgggttt tagaacatgg tggtattgcc actgatgatg
661 attatcctta cagagctaaa gagggtagat gcaaagccaa taagatacaa gacaaggtta
721 caattgacgg atatgaaact gtaataatgt cagatgagag tacagaatca gagacagagc
781 aagcgttctt aagcgccatc cttgagcaac caattagtgt ctcaattgat gcaaaagatt
841 ttcatttata caccggggga atttatgatg gagaaaactg tacaagtccg tatgggatta
901 atcactttgt tttacttgtg ggttatggtt cagcggatgg tgtagattac tggatagcga
961 aaaattcatg gggagaagat tggggagaag atggttacat ttggatccaa agaaacacgg
1021 gtaatttatt aggagtgtgt gggatgaatt atttcgcttc atacccaacc aaagaggaat
1081 cagaaacact ggtgtctgct cgcgttaaag gtcatcgaag agttgatcac tctcctcttt
1141 gaagccgtaa aggttcaata caacgagtgc ttgttttctt agggacaagc attgtactta
1201 tgtatgattc tgtgtaacca tgagtctcca cgttgtacta atgtgaaggg caaaaataaa
1261 acacacaaca agttcgtttt tctcaat
//
LOCUS XELRPS22A 531 bp ss-mRNA VRT 21-JUL-1990
DEFINITION X.laevis ribosomal protein S22, 40S subunit mRNA, complete cds.
ACCESSION M34706
KEYWORDS nucleolar protein; ribosomal protein; ribosomal protein S22.
SOURCE X.laevis stage 17 embryo, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 531)
AUTHORS Keiper,B.D. and Wormington,M.
TITLE Nucleotide sequence and 40S subunit assembly of Xenopus laevis
ribosomal protein S22
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Wormington, 18-MAY-1990.
Author Address [1]: M.Wormington
University of Virginia
Department of Biology
Gilmer Hall
Charlottesville, VA 22901
FEATURES from to/span description
pept 131 490 ribosomal protein S22, 40S subunit
mRNA < 1 531 ribosomal protein S22 mRNA
BASE COUNT 158 a 117 c 124 g 132 t
ORIGIN
1 tgtttaataa aagaaagtaa tagtaacttc aacctctaca cgtggctgtt cctgactcac
61 cgctgttcgc tcggcaagca tcggatcgca ccgtcggacg aataagtcgg tcagggaagt
121 gataagaaaa atggctttta aagacccagg caaagcccct gtagaccagg aagtggccat
181 ccatcgtatc aggattactt taacaagtcg taatgtgaag tctctggaaa aagtgtgtgc
241 tgatctgatc cgtggtgcca aagagaagaa cctgaaggtt aagggcccag tccgtatgcc
301 taccaagact cttcgtatca caaccagaaa aacaccttgc ggtgagggtt ccaaaacctg
361 ggatcgtttc cagatgcgca tccacaagcg cctgattgac ctgcacagtc cttccgagat
421 tgttaagcag atcacttcca tcagtatcga acctggtgta gaagttgaag ttactattgc
481 tgatgcataa atgacacttc tgtttaataa aagaaagtaa tgtggattgt t
//
LOCUS HUMSSBLA1 81 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human SS-B/La protein mRNA, partial cds.
ACCESSION M35261
KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen;
small nuclear ribonucleoprotein.
SEGMENT 1 of 3
SOURCE Human fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 81)
AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
Nishioka,K. and Miyamoto,T.
TITLE Fine epitope mapping the human SS-B/La protein: Identification of a
distinct autoepitope homologous to a viral gag polyprotein
JOURNAL J. Clin. Invest. 85, 1566-1574 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 81 SS-B/La protein (AA at 1)
BASE COUNT 36 a 14 c 15 g 16 t
ORIGIN
1 gaaatcagtg aagataaaac taaaatcaga aggtctccaa gcaaacccct acctgaagtg
61 actgatgagt ataaaaatga t
//
LOCUS HUMSSBLA2 155 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human SS-B/La protein mRNA, partial cds.
ACCESSION M35263
KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen;
small nuclear ribonucleoprotein.
SEGMENT 2 of 3
SOURCE Human fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 155)
AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
Nishioka,K. and Miyamoto,T.
TITLE Fine epitope mapping the human SS-B/La protein: Identification of a
distinct autoepitope homologous to a viral gag polyprotein
JOURNAL J. Clin. Invest. 85, 1566-1574 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 155 SS-B/La protein (AA at 3)
BASE COUNT 81 a 19 c 30 g 25 t
ORIGIN About 200 base pairs after segment 1.
1 agaagtacaa agaaacagac ctgctaatac ttttcaagga cgattacttt gccaaaaaaa
61 atgaagaaag aaaacaaaat aaagtggaag ctaaattaag agctaaacag gagcaagaag
121 caaaacaaaa gttagaagaa gatgctgaaa tgaaa
//
LOCUS HUMSSBLA3 192 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human SS-B/La protein mRNA, partial cds.
ACCESSION M35262
KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen;
small nuclear ribonucleoprotein.
SEGMENT 3 of 3
SOURCE Human fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 192)
AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
Nishioka,K. and Miyamoto,T.
TITLE Fine epitope mapping the human SS-B/La protein: Identification of a
distinct autoepitope homologous to a viral gag polyprotein
JOURNAL J. Clin. Invest. 85, 1566-1574 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 192 SS-B/La protein (AA at 1)
BASE COUNT 90 a 23 c 50 g 29 t
ORIGIN About 170 base pairs after segment 2.
1 gccaaggaag cattgggtaa agccaaagat gcaaataatg gtaacctaca attaaggaac
61 aaagaagtga cttgggaagt actagaagga gaggtggaaa aagaagcact gaagaaaata
121 atagaagacc aacaagaatc cctaaacaaa tggaagtcaa aaggtcgtag atttaaagga
181 aaaggaaagg gt
//
LOCUS RATPPTA1 1757 bp ds-DNA ROD 21-JUL-1990
DEFINITION Rat tachykinin (PPT) gene, exons 1 and 2.
ACCESSION M34159
KEYWORDS alternative splicing; neurokinin A; neuropeptide K;
neuropeptide gamma; substance P; tachykinin.
SEGMENT 1 of 4
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1757)
AUTHORS Carter,M.S. and Krause,J.E.
TITLE Structure, expression and some regulatory mechanisms of the rat
preprotachykinin gene encoding substance P, neurokinin A,
neuropeptide K, and neuropeptide gamma
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 10-MAY-1990.
FEATURES from to/span description
pept 1400 + 1522 beta-preprotachykinin (PPT), exon 2 (first
expressed exon)
sigp 1400 1456 beta-tachykinin signal peptide (1456 could be
1471)
pep$ 1400 + 1522 gamma-preprotachykinin (PPT), exon 2 (first
expressed exon)
sigp 1400 1456 gamma-tachykinin signal peptide (1456 could be
1471)
pep$ 1400 + 1522 alpha-preprotachykinin (PPT), exon 2 (first
expressed exon)
sigp 1400 1456 alpha-tachykinin signal peptide (1456 could be
1471)
pre-msg 866 > 1757 PPT mRNA and introns
IVS 966 1390 PPT intron A
IVS 1523 > 1757 PPT intron B
BASE COUNT 411 a 443 c 470 g 433 t
ORIGIN
1 ctgcagagct ccaaaggtaa gcatccagcc tttctagtcc cccaacaagg ctaaagggga
61 gagaggcaca attatcctct tcccacccct tctgccttca gggtgtgcct gggaagaagc
121 tgtaggggaa caaaagatgc cttagaatgg ctgatgggta agttctacat gagaaaggag
181 gtttaaattc ctctttcccc taaatgtaaa acaaacctgc cttcatcctc tgaagcggga
241 gaccggaaac acttttgcag tgctagagaa atgagaatat tctgactgat ttggtgggga
301 ggggggttgg gggggtgtgt tccagcccta gatataacac ctcataaacc ttaagacaca
361 taaagtagaa atgaaaggaa aaccccgctt gcttcatccc tctgaagtgc ttgctggtgt
421 cttagtatta ttcacaaggt tttgctgctc aagttatttg gctgtcctca aagcgcaata
481 ttccctgatg cctcttgaga gaaaagttcc ctaagtccga agcatgagtc acttcgctca
541 gttttgatga gtaatctcag gtgtcactga accttgttcg gaagaagagg ggaggggggc
601 gtcagatttg cagacggaag aaaacaggtc tctctggatt ggatggcgag acctcgactt
661 ccctaaaatt gcgtcatttc gaacccaatt tggtccagat gttatggact ccgacgggtt
721 accgtctcgg aaactctatc acgcaagcaa aaggcgaggg ggcggctaat taaatattga
781 gcagaaagtc gcgtggggag agtgtcacgt ggctctccag gctcatcacg cctgagataa
841 ataaggcgaa gcaggagcag ggactagagc gcactcggac cagctccact ccagcaccgc
901 ggcggaggag agcgaggagc gcccagcaag tgcgcacctg cggagcatca ccgggtccga
961 ccgcagtgag taccactccc gcctgccatc tgccgctgcc tggggcccgt cgtgccttgg
1021 tcagaggtgg cgtccgcccg gggtttcacc tgcagcagcc acagacccgg agagaaactg
1081 tgcaaatacc aacacctctc tttgtctgcc ttcaggcttc ggagtgtggg tcagtgggta
1141 gggggctggg acgttgagag gcaaagagag gaggacttga ggctctttgg cacgtcagta
1201 gccttcttaa agggtttgga gaaatgttcg acctggtggg tttttgtttt tgttttaatc
1261 ttggtttagc gagacctctt ctttcttcct tgtggcattg aaattcagct gctctaacgt
1321 cttgggtggc cccccgccta catatgtata acgattcttt ctctccttct ctgtctccta
1381 ctctctccag aaatccaaca tgaaaatcct cgtggcggtg gcggtctttt ttctcgtttc
1441 cactcaactg tttgcagagg aaatcggtgc caacgatgat ctaaattatt ggtccgactg
1501 gtccgacagt gaccaaatca aggtgagacc aaccgggcgt gggagctgtt gccgtcccct
1561 tgtggtagca ccctgactag cctagctgga gcctgagagg ctcattggat tgtaagggcg
1621 tcatacccgg acaaagcaca aggcttattg cccagaacta agtttctccc tgagggagga
1681 aaatgtcctg ggttcagtgt ctccccaaca gaaaaggagt tcttctttcc ttcctccctc
1741 cctccctccc tccctcc
//
LOCUS RATPPTA2 875 bp ds-DNA ROD 21-JUL-1990
DEFINITION Rat tachykinin (PPT) gene, exons 3 and 4.
ACCESSION M34160
KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
tachykinin.
SEGMENT 2 of 4
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 875)
AUTHORS Carter,M.S. and Krause,J.E.
TITLE Structure, expression and some regulatory mechanisms of the rat
preprotachykinin gene encoding substance P, neurokinin A,
neuropeptide K, and neuropeptide gamma
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 10-MAY-1990.
FEATURES from to/span description
pept + 187 283 beta-preprotachykinin (PPT), exon 3
722 + 766 beta-preprotachykinin, exon 4
matp 235 267 substance P
pep$ + 187 + 283 gamma-preprotachykinin (PPT), exon 3
matp 235 267 substance P
pep$ + 187 283 alpha-preprotachykinin (PPT), exon 3
722 + 766 alpha-preprotachykinin, exon 4
matp 235 267 substance P
pre-msg < 1 > 875 PPT mRNA and introns
IVS < 1 186 PPT intron B
IVS 284 721 PPT intron C
IVS 767 > 875 PPT intron D
BASE COUNT 213 a 191 c 219 g 252 t
ORIGIN
1 ccctgataga ctttggggga gaaggaaatc tcaggcgggt ccgggttggg gaaagtgctg
61 catagattct agtatggccc ggtctcatgc tagcggctgc ttctgtggct ctgaggaaag
121 ctcaggcaca ccgtgaatgt ggaccttgtt cacttaattt gtctactcgc tggtttgtcc
181 ctccaggagg caatgccgga gccctttgag catcttcttc agagaatcgc ccgaagaccc
241 aagcctcagc agttctttgg attaatgggc aaacgggatg ctggtgagat gagcaatagt
301 cactaggtct atcaggctac ccggtctgca tgcctgctcc ttcctggagc acccaagtgt
361 ctcttgtacc gcgagtataa ttttacatgc ctaaagactt ttgattccct gaagtcttcg
421 atccactaac acttgcacta gctcaaagtg gaggggaggg gggatccatt tcccttgctt
481 cacagcacca aggtgaccca aattggaaac taaccttagc taaacactcc cctgactaaa
541 gatcccaatt acaagggtcc tcaggtatgc taatggtcca ggctcaggtg aaagatgttt
601 tcttggattg ggttagattt ggctatacct cgctacagtg ataggaattt tcaattgcct
661 tgagttgatg gcacagtcag tggagcttgt gaagagttaa tgtcaattca tctcttgtca
721 gattcctcaa ttgaaaaaca agtggccctg ttaaaggctc tttatggtaa acatccctat
781 aatcttattt cctatatgat gatcatgtaa aagtatttga tggtataatc agacttgcag
841 tgtgcctata gagagtgtgt gtgtgtgtgt gtgtg
//
LOCUS RATPPTA3 2503 bp ds-DNA ROD 21-JUL-1990
DEFINITION Rat tachykinin (PPT) gene, exons 5 and 6.
ACCESSION M34161
KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
tachykinin.
SEGMENT 3 of 4
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2503)
AUTHORS Carter,M.S. and Krause,J.E.
TITLE Structure, expression and some regulatory mechanisms of the rat
preprotachykinin gene encoding substance P, neurokinin A,
neuropeptide K, and neuropeptide gamma
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 10-MAY-1990.
FEATURES from to/span description
pept + 140 163 beta-preprotachykinin (PPT), exon 5
1493 + 1546 beta-preprotachykinin, exon 6
matp 1495 1524 neurokinin
pep$ + 140 163 gamma-preprotachykinin (PPT), exon 5
1493 + 1546 gamma-preprotachykinin, exon 6
matp 1495 1524 neurokinin
pep$ + 140 + 163 alpha-preprotachykinin (PPT), exon 5
pre-msg < 1 > 2503 PPT mRNA and introns
IVS < 1 139 PPT intron D
IVS 164 1492 PPT intron E
IVS 1547 > 2053 PPT intron F
BASE COUNT 861 a 454 c 428 g 760 t
ORIGIN
1 catctacttt agtcctcaga acagcacata aataagcgtt aaatatttcg aaaatcaact
61 ccttaagttc tctggaaaag ataaaatagc tttaaaaaag aaacccatgt atctgctaac
121 tggctctctt ttcttctagg gcatggtcag atctctcaca aaagtaagtt cacaatcatt
181 tgacatttat caaatgctaa tgttaatgat agtgaatatt gcatcctgtg ttaccttttg
241 cagcacaaca tagagcccaa tggttttgat tgagataatg tagggaaaga aagacagaat
301 gtgatttgct gaatgatcac tgactagact ctttatgtag acagattacg taccaaatgc
361 acattacatt gagagaaaag atacactata gtttttagtt tatattctct ctagcgtaat
421 cacatgagca ctttccagcc tgtccagaag acgagagaga gagagagaga gagagagaga
481 gagagagaga gagagaacac ttatatacac cttctaattc atattcatga actacctttt
541 ataatggcag gaaggggatt aagtagactg ggtgaatatt tgagtagaaa atgcactcga
601 aatgtgatag agatacaggt ttattttccc ccttgatccc ataattgaat gggaacctgg
661 gagagaagag gtgcagattg cagaactgga gggtccttag agctatttcc tagttaattg
721 gatccagagg agacaggaag tcccttgggc ccctggaaat gttgcctgct tctatgaggg
781 tcatgaattg gactaaccct ggcagggtac accccaaacc attaacactg tggtctcaac
841 taacacaatg gaaactaaga aaaatcctcc ccgttgcagt atgataataa ccttccccaa
901 agccttcatc tttggggaaa aaaaatgaca tttactaagg ggttaaattt taaagaaatc
961 ttatacagca aaaaaagtaa taaaatgaat gtatcttatt aatgttaaaa aacaaattgc
1021 tttcaggaac agtattttta agtctaaatc tactaaatgt gaccccttag aatattccaa
1081 tactaaggat ctgacttaaa gttcccttgt tgagaaatgc aacatatttg gtaataaaga
1141 cataaacgtg gatgtctttt cccacacaaa aacatataat taaagtcccc caaaattcaa
1201 agctactcca gtggcaacca tatttaaaac ttcataatag ataaaacata atacagataa
1261 aagattctgc taatgtctag ataaagaact gagaaaagta ctcacaatat gttgttgcag
1321 tgtggtttgc aaggcttata taagtctttg tgggttcacc agtgggtgac atagaaacat
1381 tcatttattt attctacagg catatattca gaagcttgtt aactttgtac tagttatgag
1441 tttatttctt caaaaacata catactaaaa tacccctaaa tgtattttcc agggcataaa
1501 acagattcct ttgttggact aatgggcaaa agagctttaa attctggtat gtataaaatc
1561 atgtctgaaa atatgtaaat caacttctgt aatttttacg gaaagtaaca attttcaaca
1621 tactaaaaaa aaaaaaagga gccaatagac ttaacatgag ccatttctgt agacaaattt
1681 tcataggagc gtttgtagcc aagagtgggc agaaagctga ttaagctgat accaccacaa
1741 tgtaagtata tttctctcaa agtgccattc tgtccagcat ctgccccact tagtcactac
1801 atatcctgaa tctttggtct agtctgacac tctttttcac agatattctt aataaaatgt
1861 cctggaccct tagagagaga tgagagcttt tttacttaca aactgaagtg atgtggtact
1921 ttcatcagac taaccattta gtgattttct actagaaaaa taaatagata atgacaagca
1981 aaatgtttct aaaattactg tctctctctg aagttatgga agctcaaaat ccctgtctta
2041 atgggtggca aagaggctga gtgtagtggt gtccattcta acgccttcaa accattcttc
2101 ctgttccaga tttagaccta catttaaaag taccacacgc cccaaaccca ttttaaacct
2161 aagcgggatt ttccttggcc cagctgcata atctctcatt tatcctgctt gaactttcta
2221 cataaatgcc acagcatatc tctaaagccc ctcactgttc acctcattta atcaattata
2281 ttcaatgtta aataatccca ttattgtttt catgaaaatt ggctatcagt gaaaatgggt
2341 taaaatgata aacacctgat tagaaattaa ccagcatctg tttatgatat ttagttcaca
2401 gaaaatttat aagatgtgct ggcttaattt ttaaatatct gactcatccc caaaataggt
2461 cactctatag gtatcttaat caatgtagac atattttata ttt
//
LOCUS RATPPTA4 779 bp ds-DNA ROD 21-JUL-1990
DEFINITION Rat tachykinin (PPT) gene, exon 7.
ACCESSION M34162
KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
tachykinin.
SEGMENT 4 of 4
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 779)
AUTHORS Carter,M.S. and Krause,J.E.
TITLE Structure, expression and some regulatory mechanisms of the rat
preprotachykinin gene encoding substance P, neurokinin A,
neuropeptide K, and neuropeptide gamma
JOURNAL J. Neurosci. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Krause, 10-MAY-1990.
FEATURES from to/span description
pept + 7 56 beta-preprotachykinin (PPT), exon 7
pep$ + 7 56 gamma-preprotachykinin (PPT), exon 7
pep$ + 7 56 alpha-preprotachykinin (PPT), exon 7
pre-msg < 1 701 PPT mRNA and introns
IVS < 1 6 PPT intron F
BASE COUNT 228 a 150 c 159 g 242 t
ORIGIN
1 ctgcagtggc ttatgaaaga agcgcaatgc agaactacga aagaaggcgt aaataaaccc
61 tgtaacgcac tatctattca tctccatctg tgtccgcgag cagtgagcgg taaaataaaa
121 atgtgcgcta tgaggaatga ttatttattt aatatcaaat gttgttatga gtgaaaaact
181 caaaaaagtg tttatttttt catattgtgc caataagcat tgtaattcta atgtggtgac
241 ctcctcagac agaagtagaa attagttgta acttcagcaa agcacagtgt tgatggagtt
301 gtacaagttt gccagcgatg caagtctcca aagacagaaa ggctgctgtg aggcagtgca
361 ggcggctgct gctggaggca gagaaactcc tgtgtgtctt gcgcttccct tggttgcttt
421 tatcctaatg atgtactgag agtttggtat ctgactctat ttgtatccta gcagcatgtt
481 tcctgtgttg tgactatata gagatgtttt taaaagtttc aatgtacttc tctggtcttc
541 agtcattgta tgatgtgttg tgatagctac cattttaaat aaaagaatgt atcttcagga
601 attttgtcac attctttcat taatatttat gctaaccaca ccatctctat taggatgagt
661 gtgatcaagc tgtttgggta aacacttcta tctctctcag gcacagtgca aattgagtgt
721 gcgcgcaaca cacacacaca cacacacaca cacacacaca tacctccaca cacattttt
//
LOCUS MXAFRZE 2493 bp ds-DNA BCT 21-JUL-1990
DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete
cds.
ACCESSION M35192
KEYWORDS gliding motility regulatory protein.
SOURCE M.xanthus (strain DZF1) DNA.
ORGANISM Myxococcus xanthus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria;
Myxococcaceae.
REFERENCE 1 (bases 1 to 2493)
AUTHORS McCleary,W.R. and Zusman,D.R.
TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
Salmonella typhimurium
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.R.McCleary, 05-JUN-1990.
FEATURES from to/span description
pept 64 2397 gliding motility regulatory protein (frzE)
BASE COUNT 375 a 848 c 926 g 344 t
ORIGIN
1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS RATPCBB 463 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Rat PCB-binding protein mRNA, complete cds..
ACCESSION J05536
KEYWORDS PCB binding protein; binding protein.
SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
pPCB6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 463)
AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
Gillner,M., Gustafsson,J.-A. and Lund,J.
TITLE Cloning, structure, and expression of a rat binding protein for
polychlorinated biphenyls: Homology to the hormonally regulated
progesterone-binding protein uteroglobin
JOURNAL J. Biol. Chem. 265, 1-4 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.Lund, 15-JUN-1990.
FEATURES from to/span description
pept 58 348 PCB binding protein precursor
sigp 58 114 PCB binding protein signal peptide
matp 115 345 PCB binding protein
BASE COUNT 117 a 156 c 84 g 106 t
ORIGIN
1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS YSBCPR 2990 bp ds-DNA PLN 21-JUL-1990
DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION M35199
KEYWORDS NADPH-cytochrome P450 reductase.
SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1.
ORGANISM Candida tropicalis
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 2990)
AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C.
TITLE Isolation and characterization of the alkane-inducible NADPH-
cytochrome P450 oxidoreductase gene from Candida tropicalis:
Identification of invariant residues within similar amino acid
sequences of divergent flavoproteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
T.R.Sutter, 15-JUN-1990.
Author name and address
T.R.Sutter
CIIT, P.O. Box 12137
Research Triangle Park, NC 27709
FEATURES from to/span description
pept 554 2596 NADPH-cytochrome P450 reductase
signal 198 204 TATA box
BASE COUNT 988 a 442 c 558 g 1002 t
ORIGIN
1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS HUMCYPIIF 1825 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds.
ACCESSION J02906
KEYWORDS cytochrome P450.
SOURCE Human lung, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1825)
AUTHORS Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A.,
Gelboin,H.V. and Gonzalez,F.J.
TITLE The human CYP2F gene subfamily: Identification of a cDNA encoding a
new cytochrome P450, cDNA-directed expression and chromosome
mapping
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.T.Nhamburo, 18-MAY-1990.
FEATURES from to/span description
pept 56 1531 cytochrome P450IIF1 /nomgen="CYP2F"
/map="19q13.1-q13.2" /hgml_locus_uid="LV0170W"
mRNA < 1 1813 CYP2F mRNA
BASE COUNT 356 a 596 c 470 g 403 t
ORIGIN
1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga
61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct
121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa
181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta
241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca
301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc
361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa
421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat
481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg
541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt
601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat
661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc
721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt
781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg
841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca
901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac
961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc
1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga
1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat
1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat
1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt
1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa
1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc
1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct
1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg
1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga
1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca
1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca
1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc
1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat
1801 tgtaatagat tcaaaccagt cttgg
//
LOCUS MUSADA3A 891 bp ds-DNA ROD 21-JUL-1990
DEFINITION Mouse adenosine deaminase (ada) gene, 3' end.
ACCESSION J05523
KEYWORDS adenosine deaminase.
SOURCE Mouse fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 891)
AUTHORS Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and
Kellems,R.E.
TITLE Identification of transcription stop sites at the 5' and 3' ends of
the murine adenosine deaminase gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.Ramamurthy, 18-MAY-1990.
BASE COUNT 241 a 207 c 262 g 181 t
ORIGIN
1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat
61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg
121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg
181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag
241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca
301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac
361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg
421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc
481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca
541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag
601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg
661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca
721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc
781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc
841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a
//
LOCUS YSCG6PD 3782 bp ds-DNA PLN 21-JUL-1990
DEFINITION S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF,
complete cds.
ACCESSION M34709
KEYWORDS glucose-6-phosphate dehydrogenase.
SOURCE S.cerevisiae (strain S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3782)
AUTHORS Nogae,I. and Johnston,M.
TITLE Isolation and characterization of the ZWF1 gene of Saccharomyces
cerevisiae, encoding glucose-6-phosphate dehydrogenase
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Johnston, 18-MAY-1990.
FEATURES from to/span description
pept 1585 3102 glucose-6-phosphate dehydrogenase (ZWF1) (EC
1.1.1.49)
pept 246 1037 pot. ORF
mRNA 1472 > 3102 ZWF1 mRNA
BASE COUNT 1066 a 899 c 887 g 930 t
ORIGIN
1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg
61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa
121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg
181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt
241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg
301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag
361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg
421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt
481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa
541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg
601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt
661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact
721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa
781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc
841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg
901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag
961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc
1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg
1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac
1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg
1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc
1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa
1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat
1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac
1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc
1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata
1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc
1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta
1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg
1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct
1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg
1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag
1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt
1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt
2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac
2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag
2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat
2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt
2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa
2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac
2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc
2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac
2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg
2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc
2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg
2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt
2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac
2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc
2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag
2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca
3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac
3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc
3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact
3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc
3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt
3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga
3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa
3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc
3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt
3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc
3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata
3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc
3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg
3781 at
//
LOCUS HLTLYS 642 bp ss-mRNA INV 21-JUL-1990
DEFINITION H.rufescens sperm lysin mRNA, complete cds.
ACCESSION M34388
KEYWORDS sperm lysin.
SOURCE H.rufescens testis, cDNA to mRNA, clones r13 and r14.
ORGANISM Haliotis rufescens
Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda;
Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D.
TITLE Species specific sequences of abalone sperm lysin: The sperm
protein that dissolves a hole in the egg envelope
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES from to/span description
pept 16 480 sperm lysin precursor
sigp 16 69 sperm lysin signal peptide
matp 70 477 sperm lysin
mRNA < 1 642 sperm lysin mRNA
signal 625 630 poly-A signal
BASE COUNT 202 a 124 c 151 g 165 t
ORIGIN
1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg
61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt
121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt
181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac
241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc
301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat
361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac
421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga
481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc
541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta
601 aaacggaaag attgtacatt ccagaataaa tagatttact ac
//
LOCUS HLTLYS2 650 bp ss-mRNA INV 21-JUL-1990
DEFINITION H.corrugata sperm lysin mRNA, complete cds.
ACCESSION M34389
KEYWORDS sperm lysin.
SOURCE H.corrugata testis, cDNA to mRNA, clones p1 and p3.
ORGANISM Haliotis corrugata
Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda;
Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE 1 (bases 1 to 650)
AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D.
TITLE Species specific sequences of abalone sperm lysin: The sperm
protein that dissolves a hole in the egg envelope
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES from to/span description
pept 16 483 sperm lysin precursor
sigp 16 69 sperm lysin signal peptide
matp 70 480 sperm lysin
mRNA 1 650 sperm lysin mRNA
signal 635 640 poly-A signal
BASE COUNT 194 a 134 c 162 g 160 t
ORIGIN
1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg
61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa
121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga
181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt
241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg
301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt
361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca
421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag
481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc
541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg
601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact
//
LOCUS HUMROSSA 1843 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human 52 kD ribonucleoprotein Ro/SSA, complete cds.
ACCESSION M34551
KEYWORDS ribonucleoprotein.
SOURCE Human thymocyte, cDNA to mRNA, clone FI18.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1843)
AUTHORS Itoh,K., Itoh,Y. and Frank,M.B.
TITLE Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd
Ro/SSA proteins are encoded by separate genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.B.Frank, 18-MAY-1990.
Author address: M.B.Frank
Oklahoma Medical Res Foundation
Arthritis and Immunology program
825 WE 13th St, Oklahoma City, OK 73104
FEATURES from to/span description
pept 39 1466 52 kD Ro/SSA ribonucleoprotein
signal 1827 1832 poly-A signal
BASE COUNT 474 a 458 c 488 g 423 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga
61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga
121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg
181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac
241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac
301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga
361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc
421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa
481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga
541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact
601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc
661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc
721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga
781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag
841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc
901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac
961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt
1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg
1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc
1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat
1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg
1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct
1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc
1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg
1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg
1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt
1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc
1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg
1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt
1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg
1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc
//
LOCUS TRBBS12 4004 bp ds-DNA INV 21-JUL-1990
DEFINITION T.brucei expression site-associated protein (BS1.1 and BS1.2)
genes, 5' end and complete cds.
ACCESSION M34520
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
clone BS1.1-2.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 4004)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept < 1 129 expression site-associated protein (BS1.1)
pept 1318 2151 expression site-associated protein (BS1.2)
site 3083 3604 RIME sequence
BASE COUNT 1181 a 650 c 1052 g 1121 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg
61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga
121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat
181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg
241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa
301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg
361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa
421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt
481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc
541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa
601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga
661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg
721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt
781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta
841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc
901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt
961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac
1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat
1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga
1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg
1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt
1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg
1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa
1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat
1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca
1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct
1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt
1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct
1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg
1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa
1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta
1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat
1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga
1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt
2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa
2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat
2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa
2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct
2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc
2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa
2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta
2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga
2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg
2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt
2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg
2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag
2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac
2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt
2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac
2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa
3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta
3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg
3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt
3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt
3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca
3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca
3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg
3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg
3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg
3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg
3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat
3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga
3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat
3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc
3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt
3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat
3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc
//
LOCUS TRBBS134 2396 bp ds-DNA INV 21-JUL-1990
DEFINITION T.brucei expression site-associated protein (BS1.3 and BS1.4)
genes, 3' and 5' ends.
ACCESSION M34521
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
clone BS1.3-4.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 2396)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept 1836 > 2396 expression site-associated protein (BS1.4)
ORF < 1 1206 expression site-associated protein (BS1.3)
BASE COUNT 744 a 357 c 642 g 653 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa
61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat
121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt
181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga
241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat
301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac
361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat
421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga
481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc
541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc
601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac
661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg
721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca
781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg
841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac
901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca
961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg
1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt
1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc
1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag
1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct
1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa
1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct
1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt
1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt
1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat
1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa
1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat
1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg
1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag
1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt
1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg
1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag
1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc
2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca
2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg
2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg
2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg
2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg
2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa
//
LOCUS TRBBS15 714 bp ss-mRNA INV 21-JUL-1990
DEFINITION T.brucei expression site-associated protein (BS1.5) gene, partial
cds.
ACCESSION M34522
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to
mRNA, clone BS1.5.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 714)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept < 1 > 714 expression site-associated protein (BS1.5) (AA
at 1)
BASE COUNT 226 a 116 c 203 g 169 t
ORIGIN
1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg
61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg
121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg
181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag
241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg
301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct
361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa
421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga
481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg
541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat
601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca
661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc
//
LOCUS HUMCALCAT 1308 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human calcineurin catalytic subunit mRNA, complete cds.
ACCESSION J05480
KEYWORDS calcineurin.
SOURCE Human adult cerebral cortex and 7-year-old cerebellum, cDNA to
mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1308)
AUTHORS Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C.,
Marietta,C.A., Amorese,D.A. and Martin,B.M.
TITLE Cloning and characterization of molecular isoforms of the catalytic
subunit of calcineurin using nonisotopic methods
JOURNAL J. Biol. Chem. 265, 11312-11319 (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by R.Kincaid,
08-MAY-1990, for release after publication.
Two cDNA isoforms were found that either contained (class 1) or
lacked (class 2) base pairs 535 to 564, but otherwise appeared
identical; this region presumably represents an alternatively
spliced exon.
FEATURES from to/span description
pept < 1 762 calcineurin catalytic subunit
signal 921 926 poly-A signal
signal 927 932 poly-A signal
BASE COUNT 401 a 251 c 283 g 373 t
ORIGIN
1 ttacagcaca ataacttgtt atctatactc cgagcccacg aagcccaaga tgcagggtac
61 cgcatgtaca ggaaaagcca aacaacaggc ttcccttctc taattacaat tttttcagca
121 ccaaattact tagatgtata caataacaaa gctgcagtat tgaagtatga gaacaatgtt
181 atgaatatca ggcaattcaa ctgttctcct catccatact ggctcccaaa tttcatggat
241 gtttttactt ggtcccttcc atttgttggg gaaaaagtga ctgagatgct ggtaaatgtc
301 ctcaacatct gctcagatga tgaactaggg tcagaagaag atggatttga tggtgcaaca
361 gctgcagccc ggaaagaggt gataaggaac aagatccgag caataggcaa aatggccaga
421 gtgttctcag tgctcagaga agagagtgag agtgtgctga cgctgaaagg cttgacccca
481 actggcatgc tccccagcgg agtactttct ggagggaagc aaaccctgca aagcgctact
541 gttgaggcta tcgaggctga tgaagctatc aaaggatttt caccacaaca taagatcact
601 agcttcgagg aagctaaggg cttagaccga attaatgaga ggatgccgcc tcgcagagat
661 gccatgccct ctgacgccaa ccttaactcc atcaacaagg ctctcacctc agagactaac
721 ggcacggaca gcaatggcag taatagcagc aatattcagt gaccacttac tgttcacttt
781 tttttttttt tttttttttt tttttgagct gcggggcatg atgggattgc tgcatatcag
841 cagttggatg ttcttgcctc tgacagtagc ttatttgctc tgggggccag gaattggatt
901 cagtttacac tatcattaaa aataaaaata aaaaaagggg gagagagata ataaactata
961 ttttggtggg gctggtgatt aaacacctct tttgggtatg ccttttaaaa atgcttatag
1021 agaaaaaaaa ttttaaaaag aaagctaatg ctagatatac tgcaatgtta ggggaatgaa
1081 catgttttcc tactgaattg gggacttcta gataggttaa ggttttattc tgttactgga
1141 catcaaaact ttgtctaatt tcttactcta ttgtacgttt acagtcgcag cactaaaatg
1201 gatgtcatca aacattttta acatgatgta caaactaagg actatttatt gataatgttt
1261 tgctactctt gtcagacaat ggctataaac tgaattaggc agtcttaa
//
LOCUS MUSCALCAT 2194 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Mouse calcineurin catalytic subunit mRNA, complete cds.
ACCESSION J05479
KEYWORDS calcineurin.
SOURCE Mouse (strain NIH Swiss) 18-day-old adult brain, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2194)
AUTHORS Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C.,
Marietta,C.A., Amorese,D.A. and Martin,B.M.
TITLE Cloning and characterization of molecular isoforms of the catalytic
subunit of calcineurin using nonisotopic methods
JOURNAL J. Biol. Chem. 265, 11312-11319 (1990)
STANDARD full staff_review
COMMENT Authorin submission for [1] kindly submitted by R.Kincaid,
08-MAY-1990, for release after publication.
Two cDNA isoforms were found that either contained (class 1) or
lacked (class 2) base pairs 1414 to 1443, but otherwise appeared
identical; this region presumably represents an alternatively
spliced exon.
FEATURES from to/span description
pept 76 1641 calcineurin catalytic subunit
mRNA < 1 2194 calcineurin catalytic subunit mRNA
signal 1786 1791 poly-A signal
signal 1792 1797 poly-A signal
BASE COUNT 624 a 488 c 529 g 553 t
ORIGIN
1 gcgccggtgc ggtcggggtg tgcagtcgga cgggacgagc agcgcgtcgc tgtccccccc
61 tcccggtggc tggagatgtc cgagcccaag gcgattgatc ccaagttgtc gacgaccgac
121 agggtggtga aagccgttcc atttccacca agtcaccggc tgacagcaaa ggaagtgttt
181 gataatgatg ggaaacctcg tgtggatatc ttaaaagcac atctcatgaa ggagggcagg
241 ctggaagaaa gtgttgcatt gagaataata acagagggtg cttcgattct ccgacaggaa
301 aaaaacttgc tggatatcga cgcaccagtc acagtttgtg gggacatcca tggacaattc
361 tttgacttga tgaagctctt tgaagtggga ggatctcctg ccaacactcg ctacctcttc
421 ttaggggact atgttgacag agggtacttc agtatcgaat gtgtgctgta tttgtgggcc
481 ttgaaaattc tttaccccaa aacactgttt ttacttcgcg gaaaccatga atgtaggcac
541 ctcacagagt atttcacgtt taaacaagaa tgtaaaataa agtattcaga acgcgtttat
601 gacgcctgta tggatgcctt cgactgcctt cccctggctg cgctaatgaa ccagcagttc
661 ctgtgtgtac acggtggttt gtctccagag attaacactc tagatgacat cagaaaatta
721 gaccgattca aagaaccacc tgcttatggg cccatgtgtg acatcctatg gtcagacccc
781 ctggaggact ttggaaatga gaagactcag gaacatttca ctcacaacac agtcagaggc
841 tgttcgtact tctacagtta cccagctgtg tgtgacttcc tgcagcacaa taatttgttg
901 tccatactcc gcgcccacga agcccaggat gcagggtacc gcatgtacag gaaaagccaa
961 acaacaggct tcccgtctct aattacaatc ttctcggcac caaattactt agatgtgtac
1021 aataacaaag ctgcagtgtt gaagtacgag aacaatgtga tgaacatcag gcagttcaac
1081 tgctccccgc atccgtactg gctcccaaat ttcatggatg ttttcacctg gtcgctgcca
1141 tttgttgggg agaaagtgac tgagatgctg gtcaatgttc tcaacatctg ctccgacgat
1201 gaactggggt cagaagaaga tggatttgac ggagccacgg ccgcagcccg gaaggaagtc
1261 atcagaaaca agatccgagc aataggcaaa atggccagag tgttctcagt tctcagagaa
1321 gagagtgaga gtgtcctgac actgaagggc ctgaccccaa ctggcatgct ccccagcgga
1381 gtgctctctg gcgggaaaca gactctgcaa agcgctactg ttgaggctat tgaggctgat
1441 gaagccatca aaggattttc accacaacat aagatcacta gcttcgagga ggccaagggc
1501 ttagaccgaa ttaacgagag gatgccacct cgcagagacg ccatgccctc tgacgccaac
1561 cttaactcca tcaacaaggc tctcgcctca gagactaacg gcacggacag caatggcagt
1621 aatagcagca atatccagtg accacttcct gttcactttt tttttttttt gagctgcagg
1681 gcatgatggg attgctgcat ctcagcagtt ggatgttctt gcctctgaag gtagcttgtt
1741 tgctctgggg gccaggaatt ggattcagtt tacactatca tgaaaaataa aaataaaaaa
1801 agagggagag agataataaa ctatattttg gtgagggtgg tgattaaaca cctcttttgg
1861 gtatgccttt aaaaaatgct tctagggcaa aaaagtttta aaaagaaagc taatgctagc
1921 tatactgcaa tgttagggga atgaacgcgt tttcctactg cactggggac ttttagatag
1981 gttaatgaaa ggcctttatt ctgttactgg acacgaaaac tttgtctaat ttcttatact
2041 ctattgtacc tttacagtcg cagcactaaa atggaagaca tcaaacattt ttaacagaaa
2101 aaaaaaaaga tgtaaaaact aactaaggac tatttattaa tgatgttttg ctactcctgt
2161 cagacaatgg ctataaactg aattaggcag tctt
//
LOCUS RATGGLUT 1060 bp ds-DNA ROD 21-JUL-1990
DEFINITION Rat gamma-glutamyltransferase gene, 5' promoter region.
ACCESSION J05515
KEYWORDS gamma-glutamyltransferase.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1060)
AUTHORS Rajagopalan,S., Park,J.H., Patel,P., Lebovitz,R.M. and
Lieberman,M.W.
TITLE Cloning and analysis of the rat gamma glutamyltransferase gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Rajagopalan, 25-MAY-1990.
BASE COUNT 235 a 277 c 275 g 273 t
ORIGIN
1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc
61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga
121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca
181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc
241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg
301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga
361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc
421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca
481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag
541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc
601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg
661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc
721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat
781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga
841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag
901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt
961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc
1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga
//
LOCUS YSCRGI 130 bp ds-DNA PLN 21-JUL-1990
DEFINITION Yeast 35S ribosomal RNA gene, 5' end.
ACCESSION M35269
KEYWORDS 35S ribosomal RNA.
SOURCE Yeast DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 130)
AUTHORS Riggs,D.L. and Nomura,M.
TITLE Specific transcription of Saccharomyces cerevisiae 35 S rDNA by RNA
polymerase I in vitro
JOURNAL J. Biol. Chem. 265, 7596-7603 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 101 > 130 35S ribosomal RNA
BASE COUNT 48 a 9 c 43 g 30 t
ORIGIN
1 agagggcaaa agaaaataaa agtaagattt tagtttgtaa tgggaggggg ggtttagtca
61 tggagtacaa gtgtgaggaa aagtagttgg gaggtacttc atgcgaaagc agttgaagac
121 aagttcgaaa
//
LOCUS ACNRRDA 1486 bp ss-rRNA RNA 21-JUL-1990
DEFINITION A.lignieresii 16S ribosomal RNA.
ACCESSION M35017
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE A.lignieresii (strain ATCC 19393) rRNA.
ORGANISM Actinobacillus lignieresii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1486)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1486 16S ribosomal RNA
BASE COUNT 384 a 303 c 448 g 321 t 30 others
ORIGIN
1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt
61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa
121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt
181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg
241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga
301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng
361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct
421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt
481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag
541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga
601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact
661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca
781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga
1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg
1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc
1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga
1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg
1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg
//
LOCUS BNORRDA 1532 bp ss-rRNA RNA 21-JUL-1990
DEFINITION D.nodosus 16S ribosomal RNA.
ACCESSION M35016
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE D.nodosus (strain 198A; ATCC 27521) rRNA.
ORGANISM Bacteroides nodosus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Anaerobic gram-negative straight, curved and helical rods;
Bacteroidaceae.
REFERENCE 1 (bases 1 to 1532)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1532 16S ribosomal RNA
BASE COUNT 410 a 330 c 465 g 324 t 3 others
ORIGIN
1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag
61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag
121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg
181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag
241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat
301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat
361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag
481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag
541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa
601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga
661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat
781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc
841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact
901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg
961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc
1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt
1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat
1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc
1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag
1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat
1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct
1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg
1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc
1501 gtaggggaac ctgcggttgg atcacctcct tt
//
LOCUS CAJSAPA 3974 bp ds-DNA BCT 21-JUL-1990
DEFINITION C.fetus surface array protein (sapA) gene, complete cds.
ACCESSION J05577
KEYWORDS surface array protein.
SOURCE C.fetus fetus DNA.
ORGANISM Campylobacter fetus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 3974)
AUTHORS Blaser,M.J. and Gotschlich,E.C.
TITLE Surface array protein of Campylobacter fetus: Cloning and gene
structure
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.J.Blaser, 01-JUN-1990.
FEATURES from to/span description
pept 24 2825 surface array protein (sapA)
binding 12 15 ribosomal binding site
signal 2846 2870 transcription termination signal
BASE COUNT 1293 a 641 c 785 g 1255 t
ORIGIN
1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca
61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata
121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta
181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata
241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa
301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca
361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta
421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa
481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg
541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc
601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg
661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat
721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa
781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa
841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg
901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat
961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag
1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg
1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg
1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat
1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga
1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta
1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt
1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta
1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg
1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa
1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag
1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca
1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt
1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag
1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg
1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag
1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa
1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg
2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata
2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc
2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg
2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg
2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag
2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg
2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg
2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta
2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat
2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag
2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag
2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt
2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa
2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct
2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat
2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt
3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta
3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat
3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa
3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt
3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct
3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa
3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt
3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca
3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat
3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta
3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta
3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta
3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac
3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta
3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg
3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat
3961 ttactaattt tttg
//
LOCUS CDBRRDA 1470 bp ss-rRNA RNA 21-JUL-1990
DEFINITION C.hominis 16S ribosomal RNA.
ACCESSION M35014
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE C.hominis (strain ATCC 16826) rRNA.
ORGANISM Cardiobacterium hominis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods.
REFERENCE 1 (bases 1 to 1470)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1470 16S ribosomal RNA
BASE COUNT 373 a 331 c 466 g 296 t 4 others
ORIGIN
1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt
61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg
121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc
181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag
241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat
301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat
361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac
481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag
541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa
601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga
661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat
781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac
841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa
901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa
961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct
1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg
1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg
1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc
1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga
1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca
1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc
1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact
1441 taggagggcg cttgccacgg tgtggccgat
//
LOCUS GLARGBA 763 bp ds-DNA INV 21-JUL-1990
DEFINITION G.lamblia 5.8S rRNA gene.
ACCESSION M35013
KEYWORDS 5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA.
SOURCE G.lamblia DNA, clones B4 and C4.
ORGANISM Giardia lamblia
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Edlind,T.D., Sharetzsky,C. and Cha,M.E.
TITLE Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large
subunit domain I and potential processing signals
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.D.Edlind, 04-JUN-1990.
FEATURES from to/span description
rRNA 177 321 5.8S ribosomal RNA
rRNA < 1 146 small subunit ribosomal RNA
rRNA 372 > 763 large subunit ribosomal RNA
BASE COUNT 121 a 293 c 293 g 56 t
ORIGIN
1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga
61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga
121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg
181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag
241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc
301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg
361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca
421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag
481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg
541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg
601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac
661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag
721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg
//
LOCUS HEIRRDA 1472 bp ss-rRNA RNA 21-JUL-1990
DEFINITION H.influenzae 16S ribosomal RNA.
ACCESSION M35019
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE H.influenzae (strain ATCC 33391) rRNA.
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1472)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1472 16S ribosomal RNA
BASE COUNT 376 a 300 c 459 g 311 t 26 others
ORIGIN
1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt
61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg
121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat
181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg
241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang
301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg
361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc
421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga
481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg
541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg
601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt
661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga
721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa
781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn
841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt
901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga
961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg
1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg
1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg
1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca
1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc
1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga
1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc
1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct
1441 naacctnnnn gagggcgtnn ncnacggtan ga
//
LOCUS HUMIBSUB 3110 bp ss-mRNA PRI 21-JUL-1990
DEFINITION Human integrin beta-5 subunit mRNA, complete cds.
ACCESSION M35011
KEYWORDS integrin.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3110)
AUTHORS Suzuki,S., Huang Z,-S. and Tanihara,H.
TITLE Cloning of a new integrin beta subunit exhibiting high homology
with integrin beta-3 subunit
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Suzuki, 05-JUN-1990.
FEATURES from to/span description
pept 30 2420 integrin beta-5 subunit
BASE COUNT 726 a 809 c 887 g 688 t
ORIGIN
1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct
61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg
121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca
181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg
241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga
301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac
361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc
421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga
481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc
541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct
601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt
661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca
721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg
781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc
841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag
901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat
961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca
1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc
1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga
1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg
1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga
1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc
1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg
1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac
1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc
1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga
1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca
1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg
1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact
1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca
1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg
1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct
1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga
1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca
2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg
2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg
2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca
2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc
2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa
2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct
2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa
2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg
2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa
2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga
2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt
2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc
2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt
2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt
2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga
2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg
3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta
3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa
//
LOCUS KINRRDAA 1474 bp ss-rRNA RNA 21-JUL-1990
DEFINITION K.indologenes 16S ribosomal RNA.
ACCESSION M35015
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE K.indologenes (strain ATCC 25869) rRNA.
ORGANISM Kingella indologenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1474)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1474 16S ribosomal RNA
BASE COUNT 375 a 322 c 451 g 315 t 11 others
ORIGIN
1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc
61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata
121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact
181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta
241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga
301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata
361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt
421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta
481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna
541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga
601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg
661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg
721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga
781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg
841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa
901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca
961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc
1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg
1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact
1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc
1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg
1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc
1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt
1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc
1441 agggcggtta ccacggtgtg attcangact aggg
//
LOCUS MUSTNFBG 3219 bp ds-DNA ROD 21-JUL-1990
DEFINITION Mouse tumor necrosis factor-beta (lymphotoxin) gene.
ACCESSION Y00137
KEYWORDS lymphotoxin; signal peptide; tumor necrosis factor.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3219)
AUTHORS Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N.
TITLE The murine tumor necrosis factor-beta (lymphotoxin) gene sequence
JOURNAL Nucleic Acids Res. 15, 3937-3937 (1987)
STANDARD simple automatic
REFERENCE 2 (sites for [1])
AUTHORS Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H.
TITLE The murine lymphotoxin gene promoter: Characterization and negative
regulation
JOURNAL J. Immunol. 145, 177-183 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1632 1727 lymphotoxin precursor, exon 2 (first expressed
exon)
1811 1910 lymphotoxin precursor, exon 3
2135 2547 lymphotoxin precursor, exon 4
sigp 1632 1727 lymphotoxin signal peptide
1811 1813 lymphotoxin signal peptide
matp 1814 1910 lymphotoxin
2135 2547 lymphotoxin
pre-msg 685 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 857 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 865 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 1114 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 1179 3121 lymphotoxin mRNA and introns (major alt.)
IVS 1286 1622 put. intron I
IVS 1728 1810 intron II
IVS 1911 2134 intron III
signal 1153 1158 put. TATA-box
BASE COUNT 724 a 971 c 744 g 780 t
ORIGIN
1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat
61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag
121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc
181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca
241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac
301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg
361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa
421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag
481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc
541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca
601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag
661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc
721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg
781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata
841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac
901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc
961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc
1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct
1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac
1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag
1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag
1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc
1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct
1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga
1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt
1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc
1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc
1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc
1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag
1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct
1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc
1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct
1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga
1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc
2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct
2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg
2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac
2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga
2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc
2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg
2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag
2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt
2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc
2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa
2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg
2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg
2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag
2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta
2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac
2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt
3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct
3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag
3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag
3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc
//
LOCUS NGORRDAA 1478 bp ss-rRNA RNA 21-JUL-1990
DEFINITION N.denitrificans 16S ribosomal RNA.
ACCESSION M35020
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE N.denitrificans (strain ATCC 14686) rRNA.
ORGANISM Neisseria denitrificans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1478)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1478 16S ribosomal RNA
BASE COUNT 380 a 331 c 448 g 308 t 11 others
ORIGIN
1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag
61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac
121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata
181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct
241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag
301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg
361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt
421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg
481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg
541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg
601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt
661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg
721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa
781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac
841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa
901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt
961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga
1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag
1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg
1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct
1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa
1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag
1441 ggtaaccgca aggagcccnc ttaccacggt atacttca
//
LOCUS PASRRDA 1481 bp ss-rRNA RNA 21-JUL-1990
DEFINITION P.multocida 16S ribosomal RNA.
ACCESSION M35018
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE P.multocida (strain NCTC 10322) rRNA.
ORGANISM Pasteurella multocida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1481)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1481 16S ribosomal RNA
BASE COUNT 374 a 305 c 459 g 321 t 22 others
ORIGIN
1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt
61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa
121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt
181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg
241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga
301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg
361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct
421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt
481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag
541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga
601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct
661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca
781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt
1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg
1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc
1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga
1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg
1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c
//
LOCUS RATSUBB 1124 bp ss-mRNA ROD 21-JUL-1990
DEFINITION Rat F-0-ATPase subunit b mRNA, complete cds.
ACCESSION M35052
KEYWORDS F-0-ATPase subunit b.
SOURCE Rat (strain Wistar) liver, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1124)
AUTHORS Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T.,
Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F.,
Kumatori,A., Tanaka,K. and Ichihara,A.
TITLE cDNA cloning and sequencing for the import precursor of subunit b
in H+-ATP synthase from rat mitochondria
JOURNAL Biochem. Biophys. Res. Commun. 169, 136-142 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Higuti, 04-JUN-1990.
FEATURES from to/span description
pept 60 830 F-0-ATPase subunit b precursor
sigp 60 185 F-0-ATPase subunit b signal peptide
matp 186 827 F-0-ATPase subunit b
signal 1106 1111 poly-A signal
BASE COUNT 323 a 241 c 266 g 294 t
ORIGIN
1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa
61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg
121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc
181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg
241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa
301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct
361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg
421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt
481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc
541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca
601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata
661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga
721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg
781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat
841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc
901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc
961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat
1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa
1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag
//GenBank-Updates@genbank.bio.net (07/22/90)
LOCUS BORFLAG 684 bp ds-DNA BCT 22-JUL-1990
DEFINITION B.burgdorferei flagellin gene, 5' end.
ACCESSION M34710
KEYWORDS flagellin.
SOURCE B.burgdorferei (strain B31; isolate ATCC #35210) DNA.
ORGANISM Borrelia burgdorferei
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes;
Spirochaetales; Spirochaetaceae.
REFERENCE 1 (bases 201 to 684)
AUTHORS Picken,R.N.
TITLE Nucleotide sequence of the flagellin gene of Borrelia hermsii and
identification of oligonucleotides for the specific amplification
and detection of Borrelia burgdorferi flagellin sequence by the
polymerase chain reaction
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 200)
AUTHORS Picken,R.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by R.N.Picken, 18-MAY-1990.
Author address:R.N.Picken:
Baxter Healthcare Corp., Pandex Division
909 Orchard Street
Mundelein, IL 60060
FEATURES from to/span description
pept 493 > 684 flagellin
binding 480 486 ribosome binding site
BASE COUNT 249 a 74 c 128 g 233 t
ORIGIN
1 tctagatatt taagacttga tgagaaaaag tttgatgaat caattcgtaa taatattgat
61 aatgttagag agcttttttt atatgatctt aatggtgaca gagtgtatga taatggaatt
121 gctaaaatgc taggagattg tctgtcgcct cttgtggctt ccggaggagt tatttataat
181 aaaataaaga attacgactt gaaaattttt aatcaaaaaa ataaagttga agattataaa
241 aagaagtacg aagatagaga gagaaaagtg gaaggtgaac ttaatacctt ggattttacc
301 gttaagcgca tgaaagatca agaaaataca ttaaaggctt ttgattttaa tcaaagaaat
361 aaataataat aattattttt aatgctattg ctatttgcgt ttcttttttt ttaatttttg
421 tgctattctt tttaacaggc aaaaggattt gccaaagtca gaaatttaaa ttttatcatg
481 gaggaatgat atatgattat caatcataat acatcagcta ttaatgcttc aagaaataat
541 ggcattaacg ctgctaatct tagtaaaact caagaaaagc tttctagtgg gtacagaatt
601 aatcgagctt ctgatgatgc tgctggcatg ggagtttctg gtaagattaa tgctcaaata
661 agaggtttgt cacaagcttc taga
//
LOCUS BORGFLAGA 620 bp ds-DNA BCT 22-JUL-1990
DEFINITION B.burgdorferei DNA.
ACCESSION M34711
KEYWORDS .
SOURCE B.burgdorferei (strain B31; isolate ATCC #35210) DNA.
ORGANISM Borrelia burgdorferei
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes;
Spirochaetales; Spirochaetaceae.
REFERENCE 1 (bases 1 to 620)
AUTHORS Picken,R.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.N.Picken, 18-MAY-1990.
Author address:R.N.Picken:
Baxter Healthcare Corp., Pandex Division
909 Orchard Street
Mundelein, IL 60060
BASE COUNT 220 a 113 c 125 g 162 t
ORIGIN
1 tctagaaata cttcaaaggc tattaatttt attcagacaa cagaagggaa tttaaatgaa
61 gtagaaaaag tcttagtaag aatgaaggaa ttggcagttc aatcaggtaa cggcacatat
121 tcagatgcag acagaggttc tatacaaatt gaaatagagc aacttacaga cgaaattaat
181 agaattgctg atcaagctca atataaccaa atgcacatgt tatcaaacaa atctgcttct
241 caaaatgtaa gaacagctga agagcttgga atgcagcctg caaaaattaa cacaccagca
301 tcactttcag ggtctcaagc gtcttggact ttaagagttc atgttggagc aaaccaagat
361 gaacctattg ctgtaaatat ttatgcagct aatgttgcaa atcttttctc tggtgaggga
421 gctcaaactg ctcaggctgc accggttcaa gagggtgttc aacaggaagg agctcaacag
481 ccagcacctg ctacagcacc ttctcaaggc ggagttaatt ctcctgttaa tgttacaact
541 acagttgatg ctaatacatc acttgctaaa attgaaaatg ctattagaat gataagtgat
601 caaagggcaa atttaggtgc
//
LOCUS SSOBETA 2346 bp ds-DNA BCT 22-JUL-1990
DEFINITION S.solfataricus beta-D-galactosidase (lacS) gene, complete cds.
ACCESSION M34696
KEYWORDS beta-D-galactosidase.
SOURCE S.solfataricus (strain MT-4) DNA, clone C1.
ORGANISM Sulfolobus solfataricus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 2346)
AUTHORS Cubellis,M.V., Rozzo,C., Montecucchi,P. and Rossi,M.
TITLE Isolation and sequencing of a new beta-galactosidase
archaebacterial gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.V.Cubellis, 18-MAY-1990.
Author address:M.V.Cubellis
Dipt. Chimica Organica e Biologica
Universita di Napoli
Via Mezzocannone, 16
80134 Napoli, Italia
FEATURES from to/span description
pept 229 1698 beta-D-galactosidase (lacS) (EC 3.2.1.23)
BASE COUNT 770 a 402 c 487 g 687 t
ORIGIN
1 aaggagaaac ttggcagttt ataacttgac agtaggttgt ggagtgatga ctggatcaat
61 actaggagga gtagcatata attacgttac acaattttat aacccaatat attcaataga
121 ccttatgctt atcctatcct ctattctaag attctcggta tctcccctat tcttgaccat
181 aaaagatact cgctcaaagc ttaaataata ttaatcataa ataaagtcat gtactcattt
241 ccaaatagct ttaggtttgg ttggtcccag gccggatttc aatcagaaat gggaacacca
301 gggtcagaag atccaaatac tgactggtat aaatgggttc atgatccaga aaacatggca
361 gcgggattag taagtggaga tctaccagaa aatgggccag gctactgggg aaactataag
421 acatttcacg ataatgcaca aaaaatggga ttaaaaatag ctagactaaa tgtggaatgg
481 tctaggatat ttcctaatcc attaccaagg ccacaaaact ttgatgaatc aaaacaagat
541 gtgacagagg ttgagataaa cgaaaacgag ttaaagagac ttgacgagta cgctaataaa
601 gacgcattaa accattacag ggaaatattc aaggatctta aaagtagagg actttacttt
661 atactaaaca tgtatcattg gccattacct ctatggttac acgacccaat aagagtaaga
721 agaggagatt ttactggacc aagtggttgg ctaagtacta gaacagttta cgaattcgct
781 agattctcag cttatatagc ttggaaattc gatgatctag tggatgagta ctcaacaatg
841 aatgaaccta acgttgttgg aggtttagga tacgttggtg ttaagtccgg ttttccccca
901 ggatacctaa gctttgaact ttcccgtagg catatgtata acatcattca agctcacgca
961 agagcgtatg atgggataaa gagtgtttct aaaaaaccag ttggaattat ttacgctaat
1021 agctcattcc agccgttaac ggataaagat atggaagcgg tagagatggc tgaaaatgat
1081 aatagatggt ggttctttga tgctataata agaggtgaga tcaccagagg aaacgagaag
1141 attgtaagag atgacctaaa gggtagattg gattggattg gagttaatta ttacactagg
1201 actgttgtga agaggactga aaagggatac gttagcttag gaggttacgg tcacggatgt
1261 gagaggaatt ctgtaagttt agcgggatta ccaaccagcg acttcggctg ggagttcttc
1321 ccagaaggtt tatatgacgt tttgacgaaa tactggaata gatatcatct ctatatgtac
1381 gttactgaaa atggtattgc ggatgatgcc gattatcaaa ggccctatta tttagtatct
1441 cacgtttatc aagttcatag agcaataaat agtggtgcag atgttagagg gtatttacat
1501 tggtctctag ctgataatta cgaatgggct tcaggattct ctatgaggtt tggtctgtta
1561 aaggtcgatt acaacactaa gagactatac tggagaccct cagcactagt atatagggaa
1621 atcgccacaa atggcgcaat aactgatgaa atagagcact taaatagcgt acctccagta
1681 aagccattaa ggcactaaac tttctcaagt ctcactatac caaatgagtt ttcttttaat
1741 cttattctaa tctcattttc attagattgc aatactttca taccttctat attatttatt
1801 ttgtaccttt tgggatctac acttaatgtt agcctaattg gaaagtcatt tagatttaat
1861 actgttacca gtccatccct tttaattatt aatgaaaata agaagggata agtagcgata
1921 gcccttattc cgatatggtc tccaacaata tcccttatta tctgccttgc aacactaggg
1981 tagaactctg aaatcagata tggtaggtaa gttgtaagtg ataggacgta aactttagag
2041 ttagagtaag tgttctgaaa gactactggg tgcaattcga caccgttata ggcgtaaagg
2101 attggcgtag ctccgtttaa tgaaaatata ggtcctacag ggaaattggc ttgcctcttg
2161 taatatgacc aatagaacgt tttcccatcc ctggttaacg cattgacact aacactatcg
2221 taaatcaagt taccgacacc aagaattttc agtgcagtat cccccaagac ttcaataagc
2281 tttttagctg cacttgctgt aaacattaag ttaactcccc tattaagtaa atccacaata
2341 tctaga
//GenBank-Updates@genbank.bio.net (07/24/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
cds.
ACCESSION M26434
KEYWORDS Alu repetitive element; LINE repetitive element;
hypoxanthine phosphoribosyltransferase.
SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia
patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 56736)
AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
TITLE Automated DNA sequencing of the human HPRT locus
JOURNAL Genomics 6, 593-608 (1990)
STANDARD full staff_entry
REFERENCE 2 (sites; mutations causing gout)
AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N.
TITLE A possible nucleotide-binding domain in the tertiary fold of
phosphoribosyltransferase
JOURNAL J. Biol. Chem. 258, 6450-6457 (1983)
STANDARD full staff_entry
REFERENCE 3 (sites; mutations causing gout)
AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P.
TITLE Resolution of a missense mutant in human genomic DNA by denaturing
gradient gel electrophoresis and direct sequencing using in vitro
DNA amplification: HPRT-Munich
JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988)
STANDARD full staff_entry
REFERENCE 4 (sites; mutations causing gout)
AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D.
TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence
for identical mutations in two partially deficient subjects
JOURNAL J. Clin. Invest. 82, 2164-2167 (1988)
STANDARD full staff_entry
REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single
nucleotide substitution in cDNA clones isolated from a patient with
Lesch-Nyhan syndrome (HPRT-Midland)
JOURNAL Gene 68, 85-91 (1988)
STANDARD full staff_entry
REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase
deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint)
JOURNAL Gene 63, 331-336 (1988)
STANDARD full staff_entry
REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency.
The molecular defect in a patient with gout (HPRT-Ashville)
JOURNAL J. Biol. Chem. 264, 520-525 (1989)
STANDARD full staff_entry
REFERENCE 8 (sites; mutations causing gout)
AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N.
TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
deficiency in ten subjects determined by direct sequencing of
amplified transcripts
JOURNAL J. Clin. Invest. 84, 342-346 (1989)
STANDARD full staff_entry
REFERENCE 9 (sites; mutations causing gout)
AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and
Stambrook,P.J.
TITLE Nucleotide sequence and organization of the mouse adenine
phosphoribosyltransferase gene: Presence of a coding region common
to animal and bacterial phosphoribosyltransferases that has a
variable intron/exon arrangement
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985)
STANDARD full staff_entry
REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D.
TITLE Identification of a single nucleotide change in the
hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale)
responsible for Lesch-Nyhan syndrome
JOURNAL J. Clin. Invest. 83, 11-13 (1989)
STANDARD full staff_entry
REFERENCE 11 (sites; mutations causing gout)
AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N.
TITLE Identification of a single nucleotide change in a mutant gene for
hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor)
JOURNAL Hum. Genet. 79, 39-43 (1988)
STANDARD full staff_entry
REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T.
TITLE Multiplex DNA deletion detection and exon sequencing of the
hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families
JOURNAL Genomics 7, 235-244 (1990)
STANDARD full staff_entry
REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T.
TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by
automated direct DNA sequencing of in vitro amplified cDNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989)
STANDARD full staff_entry
REFERENCE 14 (sites; mutations causing gout)
AUTHORS Hershey,H.V. and Taylor,M.W.
TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia
coli adenine phosphoribosyltransferase and comparison with other
analogous enzymes
JOURNAL Gene 43, 287-293 (1986)
STANDARD full staff_entry
REFERENCE 15 (sites; conserved domain)
AUTHORS King,A. and Melton,D.W.
TITLE Characterisation of cDNA clones for hypoxanthine-guanine
phosphoribosyltransferase from the human malarial parasite,
Plasmodium falciparum: Comparisons to the mammalian gene and
protein
JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987)
STANDARD full staff_entry
REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and
Caskey,C.T.
TITLE Molecular analysis of a female Lesch-Nyhan patient
JOURNAL J. Clin. Invest. 84, 1024-1027 (1989)
STANDARD full staff_entry
REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Stout,J.T. and Caskey,C.T.
TITLE HPRT: Gene structure, expression, and mutation
JOURNAL Annu. Rev. Genet. 19, 127-148 (1985)
STANDARD full staff_entry
REFERENCE 18 (sites;
AUTHORS Wilson,J.M. and Kelley,W.N.
TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
deficiency in a patient with the Lesch-Nyhan syndrome
JOURNAL J. Clin. Invest. 71, 1331-1335 (1983)
STANDARD full staff_entry
REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Wilson,J.M. and Kelley,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural
alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated
from a patient with gout
JOURNAL J. Biol. Chem. 259, 27-30 (1984)
STANDARD full staff_entry
REFERENCE 20 (sites; mutations causing gout)
AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular
abnormality in a mutant form of the enzyme (HPRT-Toronto)
JOURNAL J. Biol. Chem. 258, 6458-6460 (1983)
STANDARD full staff_entry
REFERENCE 21 (sites; mutations causing gout)
AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N.
TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino
acid substitution in a mutant form of the enzyme isolated from a
patient with gout
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983)
STANDARD full staff_entry
REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G.,
Hildebrand,B.M. and Caskey,C.T.
TITLE Molecular evidence for new mutation at the hprt locus in
Lesch-Nyhan patients
JOURNAL Nature 310, 412-414 (1984)
STANDARD full staff_entry
REFERENCE 23 (sites; mutant reversion)
AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and
Caskey,C.T.
TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene
rearrangement
JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Edwards, 26-JUL-1989.
Mutant Description
RJK 1780 missing intron 1 (partial and exon 2 [12]
RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA)
[22]
RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA)
[17]
[22]
GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12]
[22]
RJK 853 complete gene deletion [12] [22] [16]
GM 2227 inversion of exons 6-9 (no mRNA) [23]
GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated
mRNA [22] [23]
Connersville missing intron 7 (partial and exon 8 [7]
FEATURES from to/span description
pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1
/nomgen="HPRT" /map="Xq26"
/hgml_locus_uid="LE0031K"
14780 14886 hypoxanthine phosphoribosyltransferase, exon 2
16603 16786 hypoxanthine phosphoribosyltransferase, exon 3
27891 27956 hypoxanthine phosphoribosyltransferase, exon 4
31617 31634 hypoxanthine phosphoribosyltransferase, exon 5
34937 35019 hypoxanthine phosphoribosyltransferase, exon 6
39815 39861 hypoxanthine phosphoribosyltransferase, exon 7
40033 40109 hypoxanthine phosphoribosyltransferase, exon 8
41454 41501 hypoxanthine phosphoribosyltransferase, exon 9
IVS 1704 14779 HPRT intron A
IVS 14887 16602 HPRT intron B
IVS 16787 27890 HPRT intron C
IVS 27957 31616 HPRT intron D
IVS 31635 34936 HPRT intron E
IVS 35020 39814 HPRT intron F
IVS 39862 40032 HPRT intron G
IVS 40110 41453 HPRT intron H
rpt 258 41 (c) Alu repeat copy A
rpt 1012 694 (c) Alu repeat copy B
rpt 3532 3342 (c) Alu repeat copy C
rpt 3982 3343 (c) Alu repeat copy D
rpt 6232 5937 (c) Alu repeat copy E
rpt 6524 6288 (c) Alu repeat copy F
rpt 7504 7807 Alu repeat copy G
rpt 8713 8408 (c) Alu repeat copy H
rpt 9477 9169 (c) Alu repeat copy I
rpt 9699 10142 Alu repeat copy J
rpt 10407 10293 (c) Alu repeat copy K
rpt 12154 12310 Alu repeat copy L
rpt 13302 12998 (c) Alu repeat copy M
rpt 13638 13314 (c) Alu repeat copy N
rpt 14238 13923 (c) Alu repeat copy O
rpt 14610 14239 (c) Alu repeat copy P
rpt 16090 15843 (c) Alu repeat copy Q
rpt 16518 16293 (c) Alu repeat copy R
rpt 18783 18439 (c) Alu repeat copy S
rpt 18962 19084 Alu repeat copy T
rpt 19397 19104 (c) Alu repeat copy U
rpt 20064 19778 (c) Alu repeat copy V
rpt 20486 20636 Alu repeat copy W
rpt 20990 20696 (c) Alu repeat copy X
rpt 21293 20996 (c) Alu repeat copy Y
rpt 22171 21878 (c) Alu repeat copy Z
rpt 23772 23460 (c) Alu repeat copy AA
rpt 25310 25426 Alu repeat copy AB
rpt 26388 26279 (c) Alu repeat copy AC
rpt 29863 29560 (c) Alu repeat copy AD
rpt 33618 33148 (c) Alu repeat copy AE
rpt 36575 36645 Alu repeat copy AF
rpt 38613 39039 Alu repeat copy AG
rpt 39091 39388 Alu repeat copy AH
rpt 40622 40925 Alu repeat copy AI
rpt 40952 41262 Alu repeat copy AJ
rpt 42640 42936 Alu repeat copy AK
rpt 44776 44486 (c) Alu repeat copy AL
rpt 46627 46305 (c) Alu repeat copy AM
rpt 47520 47763 Alu repeat copy AN
rpt 49257 49551 Alu repeat copy AO
rpt 49705 49993 Alu repeat copy AP
rpt 50002 50312 Alu repeat copy AQ
rpt 51066 50774 (c) Alu repeat copy AR
rpt 52276 51978 (c) Alu repeat copy AS
rpt 52407 52674 Alu repeat copy AT
rpt 54653 54551 (c) Alu repeat copy AU
rpt 55465 55164 (c) Alu repeat copy AV
rpt 56523 56689 Alu repeat copy AW
rpt 11312 11254 (c) LINE repeat copy A (partial)
rpt 18284 18242 (c) LINE repeat copy B (partial)
rpt 27091 26843 (c) LINE repeat copy C (partial)
rpt 37558 37300 (c) LINE repeat copy D (partial)
rpt 39451 39427 (c) LINE repeat copy E (partial)
rpt 51317 51100 (c) LINE repeat copy F (partial)
rpt 261 321 short interspersed repeat copy A
rpt 1013 1054 short interspersed repeat copy B
rpt 2902 2869 (c) short interspersed repeat copy C
rpt 3078 3107 short interspersed repeat copy D
rpt 4228 4254 short interspersed repeat copy E
rpt 4463 4494 short interspersed repeat copy F
rpt 21582 21635 short interspersed repeat copy G
rpt 28430 28394 (c) short interspersed repeat copy H
rpt 28923 28981 short interspersed repeat copy I
rpt 35562 35604 short interspersed repeat copy J
rpt 44042 44108 short interspersed repeat copy K
rpt 46628 46688 short interspersed repeat copy L
rpt 51715 51687 (c) short interspersed repeat copy M
unsure 17479 17479 polymorphism or cloning error
unsure 24903 24903 polymorphism or cloning error
mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13]
mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2
[12]
mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7]
mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7]
mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12]
mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys
[12]
mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20]
mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in
wt; at in Lesch-Nyhan RJK 2108 [12]
mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13]
mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu
[7]
mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10]
mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892
Phe->Leu [5] [13]
mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7]
mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13]
mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19]
mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12]
mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG
[12]
mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4]
[12] [14]
mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896
Val->Asp [6] [13]
mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12]
mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser
[13]
mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11]
mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK
1210 Met->Lys [13]
mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser
[7] [13]
mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA
[13]
mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12]
mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr
[12]
mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7
[12]
mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK
855 [7] [13]
mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188
Asp->Asn [12] [19]
mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950
Phe->Val [7] [13]
mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8]
mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no
exon 8 [12]
mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in
Lesch-Nyhan Evansville RJK 894 [7] [12]
mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM
1899, downstream cryptic splice site used
mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079,
His->Asp [12] [13]
mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr
[13]
BASE COUNT 15689 a 11281 c 11599 g 18167 t
ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26.
1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
56701 gccgcccctg gtgatggacc agcggggctt cgacga
//
LOCUS CHTMOMPCT 1407 bp ds-DNA BCT 24-JUL-1990
DEFINITION C.trachomatis major outer membrane protein gene, complete cds.
ACCESSION J03813
KEYWORDS major outer membrane protein.
SOURCE C.trachomatis (serotype A) DNA.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (sites; amino acid sequences)
AUTHORS Baehr,W., Zhang,Y.-X., Joseph,T., Su,H., Nano,F.E.,
Everett,K.D.E. and Caldwell,H.D.
TITLE Mapping antigenic domains expressed by Chlamydia trachomatis major
outer membrane protein genes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4000-4004 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1407)
AUTHORS Baehr,W.
JOURNAL Unpublished (1988) Rocky Mountain laboratories, Hamilton, MT 59840
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly provided
by W.Baehr, 04-MAY-1988
FEATURES from to/span description
pept 169 1359 major outer membrane protein
BASE COUNT 403 a 283 c 311 g 410 t
ORIGIN 1 bp upstream of XhoI site.
1 ctcgagcatt gaacgacatg ttctcgatta aggctgcttt tacttgcaag acattcctca
61 ggccattaat tgctacagga catcttgtct ggctttaact aggacacagt gccgccagaa
121 aaagatagcg agcacaaaga gagctaatta tacaatttag aggtaagaat gaaaaaactc
181 ttgaaatcgg tattagtatt tgccgctttg agttctgctt cctccttgca agctctgcct
241 gtggggaatc ctgctgaacc aagccttatg atcgacggaa ttctgtggga aggtttcggc
301 ggagatcctt gcgatccttg caccacttgg tgtgacgcta tcagcatgcg tatgggttac
361 tatggtgact ttgttttcga ccgtgttttg aaaacagatg tgaataaaga atttcagatg
421 ggagcggcgc ctactaccag cgatgtagca ggcttagaaa aggatccagt agcaaatgtt
481 gctcgcccaa atcccgctta tggcaaacac atgcaagatg ctgaaatgtt tacgaacgct
541 gcttacatgg cattaaatat ctgggatcgt tttgatgtat tttgtacatt gggagcaact
601 accggttatt taaaaggaaa ctccgcttcc ttcaacttag ttggattatt cggaacaaaa
661 acacaatctt ctggctttga tacagcgaat attgttccta acactgcttt gaatcaagct
721 gtggttgagc tttatacaga cactaccttt gcttggagcg taggtgctcg tgcagctctc
781 tgggaatgtg ggtgtgcaac gttaggagct tctttccaat atgctcaatc taaacctaaa
841 gtagaagagt tgaatgttct ttgtaatgca tccgaattta ctattaataa gccgaaagga
901 tatgttgggg cggaatttcc acttgatatt accgcaggaa cagaagctgc gacaggaact
961 aaggatgcct ctattgacta ccatgagtgg caagcaagtt tagccctttc ttacagatta
1021 aatatgttca ctccttacat tggagttaaa tggtctagag taagttttga tgccgacacg
1081 atccgtatcg ctcagcctaa attggctaaa ccagtcttgg ataccactac tctaaacccg
1141 accatcgctg gtaaaggaac tgtggtctct tccgcagaaa acgaactggc tgatacaatg
1201 caaatcgttt ccttgcagtt gaacaagatg aaatctagaa aatcttgcgg tattgcagta
1261 ggaacaactg ttgtagatgc agataaatac gcagttacaa ttgagactcg cttgatcgat
1321 gagagagcag ctcacgtaaa tgcacaattc cgcttctaat attctaataa ttgtataatt
1381 atattagaat cttttcttag taagctt
//
LOCUS DROD1P 1734 bp ss-mRNA INV 24-JUL-1990
DEFINITION D.melanogaster chromosomal protein D1 mRNA, complete cds.
ACCESSION J04725
KEYWORDS chromosomal protein.
SOURCE D.melanogaster (strain Oregon R) embryo, cDNA to mRNA, (library of
T.-S.Hsieh), clones D1-[9,9.14,15].
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 1734)
AUTHORS Ashley,C.T., Pendleton,C.G., Jennings,W.W., Saxena,A. and
Glover,C.V.C.
TITLE Isolation and sequencing of cDNA clones encoding Drosophila
chromosomal protein D1: A repeating motif in proteins which
recognize AT DNA
JOURNAL J. Biol. Chem. 264, 8394-8401 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by C.V.C.Glover, 29-MAR-1989.
FEATURES from to/span description
pept 228 1295 chromosomal protein D1
mRNA < 1 1734 D1 mRNA
signal 1599 1604 polyadenylation signal
signal 1717 1722 polyadenylation signal
BASE COUNT 558 a 410 c 462 g 304 t
ORIGIN Map position 85D1-2; polytene band.
1 acatcactac tttgaactgt actagcgtgc gcgtatttta caattacaaa ttgagaaggc
61 aaacttgatt ttcggcaaac aaaaaggtga agaagaagag cggagccaag cgaaaagagt
121 gagcgtgtgt tcgtgagaga gagcatttgt gcgcgtgtgt gtgtgagggg acgcgaagaa
181 acaaggattt tttttattcg cttgctaatt gaatcccgta cagagaaatg gaggaagttg
241 cggtaaagaa gcgcggccga ccgtcaaagg catccgtcgg tggtaaatct tcaacagcgg
301 cggttgcagc aatctcgcca ggcatcaaaa agcgcggtcg tcccgccaag aacaagggct
361 ccagtggcgg tggtggacag cggggtcgcc caccgaaggc gtcaaaaatc caaaatgacg
421 aagatcccga agacgagggg gaggaggacg gcgatggcga tggctccggc gctgagcttg
481 caaacaactc atcaccctcg ccgacgaagg gcaggggacg acccaagagc agtggtggcg
541 ccggatcagg atccggggat tcggtgaaga cgcccggttc cgccaagaag cgcaaggccg
601 gcaggcccaa gaagcaccag cccagcgata gcgagaatga agacgatcag gatgaggacg
661 acgacggcaa cagcagcatc gaagaacgtc gcccagtggg ccgaccgtcg gctggatccg
721 tcaacctgaa catatcgcgc acgggacgcg gcctgggaag acccaagaag cgggctgtcg
781 agtccaatgg cgatggggag ccacaagtgc ccaagaaacg cggtcgtcca ccacagaaca
841 agtcgggtag cggtggcagt actggctacg tacccaccgg tcgtccgcgc ggtcgtccaa
901 aggccaatgc tgcacccgtc gagaagcacg aggacaatga cgacgatcag gacgatgaga
961 actccgggga ggaggaacac agctcgcctg agaagacggt agtggcgccg aagaaacgtg
1021 gacgtccctc tcttgccgct ggcaaggtgt ccaaagagga aacaacaaag ccccgtagtc
1081 gcccagcgaa aaatattgat gacgatgcag atgatgctga ttccgccgat cagggccaac
1141 acaactccaa gaaggagtcg aacgacgagg accgtgccgt ggatgggaca ccaaccaaag
1201 gagatggcct caaatggaac tccgatggcg aaaatgatgc caacgacggt tatgtttcgg
1261 ataactacaa cgattccgaa tcggtagctg cctaaaacat acaatttata aacatttcca
1321 cacacccctc cccaaaaatc aacacaaaaa aaaaaagaat gttgtaaaag caaaaaaaaa
1381 aaaaacgaaa aaaaatctgc aagtaagaga agaaaacaca aaaatggata ggacctactt
1441 ggactttaga gaaccttccc ctcctaaaac aacacacttt gtctacatat atattccgat
1501 ttattgctga gaaatcggac aaaccgttta caactgcact gcactgtaat gacgacacat
1561 agctcggaaa acgatttcaa cttggattga aattaccaaa taaataacac ataaataaca
1621 tatttgggga agatttcaag ttccgaagta gaactgaact ttattgtaac aacaaatttg
1681 atttgattgc agaaatttca atcgtataca agtttaaata aacctactat acct
//
LOCUS ECOESTA2A 549 bp ds-DNA BCT 24-JUL-1990
DEFINITION E.coli estA2 gene encoding heat-stable enterotoxin A2 (STa2)
protein, complete cds.
ACCESSION M18345
KEYWORDS STa2 protein; estA2 gene; heat-stable enterotoxin A2.
SOURCE E.coli (strain CRL 25090; isolated from human in Bangladesh) HB101
pGK22 cell line DNA, clones 2,12,21, and 27.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 549)
AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M.
TITLE Rectification of two Escherichia coli heat-stable enterotoxin
allele sequences and lack of biological effect of the change of the
carboxyterminal tyrosine for histidine
JOURNAL Infect. Immun. 57, 645-648 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.M.Kupersztoch, 08-NOV-1988.
FEATURES from to/span description
pept 149 367 heat-stable enterotoxin A2 precursor
sigp 149 205 heat-stable enterotoxin A2 signal peptide
matp 308 364 heat-stable enterotoxin A2
mRNA 132 549 heat-stable enterotoxin A2 mRNA
BASE COUNT 182 a 93 c 115 g 159 t
ORIGIN Unreported.
1 gaattccccg ccctaaaaca taatattatt atgctcttcg tagcggagag tatagtatga
61 tacacatcac aaaaaaaaaa taaaaaagtt tgcgcaatcg ttctgatttt gatttaaata
121 ttcgtggacg acgtgtttcg gaggtaatat gaaaaaatca atattattta tttttctttc
181 tgtattatct ttttcacctt tcgctcagga tgctaaacca gcagggtctt caaaagaaaa
241 aattacacta gaatcgaaaa aatgtaacat tgtaaaaaaa aataatgaaa gtagtcctga
301 aagcatgaat agtagcaatt actgctgtga attgtgttgt aatcctgctt gtaccgggtg
361 ctattaataa tataaaggga actaaacagt tccctttata tttgtgtgcg ccgtggctgg
421 cgctgttctt caactgtgga ggctgaagaa cgactaagag gtgaaagtcc tccacacacc
481 cggtgagggg aagtgttagc ggaaggcaag gtgatcctac ccacgtaata tggacacagg
541 ggaattcgt
//
LOCUS ECOESTA3A 403 bp ds-DNA BCT 24-JUL-1990
DEFINITION E.coli estA3 gene encoding heat-stable enterotoxin A3 (STa3)
protein, complete cds.
ACCESSION M18346
KEYWORDS STa3 protein; estA3 gene; heat-stable enterotoxin A3.
SOURCE E.coli (strain 153837-2; isolated from human in Bangladesh) HB101
pYK117 cell line DNA, clones 5 and 7.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 403)
AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M.
TITLE Rectification of two Escherichia coli heat-stable enterotoxin
allele sequences and lack of biological effect of the change of the
carboxyterminal tyrosine for histidine
JOURNAL Infect. Immun. 57, 645-658 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.M.Kupersztoch, 08-NOV-1988.
FEATURES from to/span description
pept 147 365 heat-stable enterotoxin A3 precursor
sigp 147 203 heat-stable enterotoxin A3 signal peptide
matp 204 362 heat-stable enterotoxin A3
mRNA 130 > 403 heat-stable enterotoxin A3 mRNA
BASE COUNT 139 a 61 c 72 g 131 t
ORIGIN Unreported.
1 ttttcggtcg ccgaaaaaga taatattact atgctcttcg tagcggagag tatagtatga
61 tgttcatcac aaaaaaaata aaaaagtttg cgcaaccgtt ctgattttga ttcaaatgtt
121 cgtggatgcc atgttccgga ggtaatatga agaaatcaat attatttatt tttctttctg
181 tattgtcttt ttcacctttc gctcaggatg ctaaaccagt agagtcttca aaagaaaaaa
241 tcacactaga atcaaaaaaa tgtaacattg caaaaaaaag taataaaagt ggtcctgaaa
301 gcatgaatag tagcaattac tgctgtgaat tgtgttgtaa tcctgcttgt accgggtgct
361 attaataata taaagggaac taaacagttc cctttatatt tgt
//
LOCUS HIV2ISY 9636 bp ss-RNA VRL 24-JUL-1990
DEFINITION Human immunodeficiency virus type 2, isolate SBLISY, complete
genome.
ACCESSION J04498
KEYWORDS .
SOURCE Human immunodeficiency virus type 2 (HIV-2) proviral DNA, isolate
SBLISY.
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9636)
AUTHORS Franchini,G., Fargnoli,K.A., Giombini,F., Jagodzinski,L.,
De Rossi,A., Bosch,M., Biberfeld,G., Fenyo,A.M., Albert,J.,
Gallo,R.C. and Wong-Staal,F.
TITLE Molecular and biological characterization of a replication
competent human immunodeficiency type 2 (HIV-2) proviral clone
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2433-2437 (1988)
STANDARD full staff_review
COMMENT Clean copy of sequence [1] kindly provided by G. Franchini,
30-JAN-1989.
FEATURES from to/span description
pept 547 2109 gag polyprotein
pept < 1827 4934 pol polyprotein (NH2 terminus uncertain; AA at
1827)
pept 4864 5511 vif protein
pept 5339 5677 vpx protein
pept 5677 5994 vpr protein
pept 5840 6135 tat, exon 2 (first expressed exon)
8281 8377 tat, exon 3 (AA at 8282)
pept 6066 6135 rev, exon 2 (first expressed exon)
8281 8717 rev, exon 3 (AA at 8283)
pept 6145 8685 envelope polyprotein
pept 8519 9289 nef protein
pre-msg 1 9636 genomic mRNA
pre-msg 1 9636 tat, rev, nef subgenomic mRNA
IVS 6136 8280 tat cds intron 2
IVS 6136 8280 rev cds intron 2
IVS 6136 8280 tat, rev, nef subgenomic mRNA intron 2
LTR < 1 300 5' LTR
LTR 8904 > 9636 3' LTR
rpt 1 174 R repeat 5' copy
rpt 9457 9636 R repeat 3' copy
binding 304 320 primer (Lys-tRNA) binding site
signal 9609 9614 mRNA polyadenylation signal
BASE COUNT 3291 a 1940 c 2396 g 2009 t
ORIGIN
1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gtagagcctg ggtgttccct gctggactct caccagtgct tggccggcgc tgggcagacg
121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa
181 gtgtgtgttc ccatctctcc tagtcgccgc ctggtcattc ggtgttctcc tgagtaacaa
241 gaccctggtc tgttaggacc cttcttgctt tgggaaaccg aggcaggaaa atccctagca
301 ggttggcgcc cgaacaggga cttgaggaag actgagaagc cttggaacac ggctgagtga
361 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaagg cgcaggccaa
421 ggtaccaaag gcggcgtgtg gagcgggagt caagaggcct ccgggtgaag gtaagtacct
481 acaccaaaaa ctgtagccgg aaaaggcttg ttatcctacc tttagacagg tagaagattg
541 tgggagatgg gcgcgaaaaa ctccgtcttg agagggaaaa aggcagatga attagaaaaa
601 attaggttac ggcccggcgg gaagaaaaaa tacagattaa aacatattgt gtgggcagcg
661 aatgaattgg acagattcgg attaacagag agcctgttgg agtcaaaaga aggttgccaa
721 aaaattattt cagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt
781 tataatacta cctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa
841 gaagcaaaaa gaatagtagg gagacatcta gtggcagaaa cagaaactgc agagaaaatg
901 ccaaatataa gtagaccaac agcaccacct agcgggaaag ggggaaactt ccccgtgcaa
961 caaataggcg gcaactatgt ccatctgccg ctgagtcccc gaaccctaaa tgcttgggta
1021 aagttagtag aggagaaaaa gttcggggca gaagtagtgc cgggatttca ggcactctca
1081 gaaggctgca cgccctatga tattaatcaa atgcttaatt gtgtgggcga ccatcaagca
1141 gcgatgcaaa taatcagaga aattattaat gaagaagcag cagactggga tgtacaacat
1201 ccaataccag gccccttacc agcggggcag ctcagagatc cacgaggatc tgacatagca
1261 gggacaacaa gcacagtaga ggaacagatc gaatggatgt ataggcaaga aaatcctgta
1321 ccagtaggaa acatctatag gagatggatc cagataggac tgcagaagtg tgtcagaatg
1381 tacaatccaa ccaacattct agacataaaa caaggaccaa aagagtcgtt ccaaagctat
1441 gtggatagat tctacaaaag cttaagggca gaacagacag atgcagcagt gaagaattgg
1501 atgacccaga cgctgctagt gcaatcgaac ccagactgta agttagtact aaagggacta
1561 gggatgaatc ctaccttaga agagatgcta accgcctgtc aagggatagg tggaccaggc
1621 cagaaggcca gactaatggc agaagcctta aaagaggcca tgcgaccagc ccctatccca
1681 tttgcagcag cccaacagaa aagggcaatt aagtgttgga attgtggaaa ggaagggcac
1741 tcggcaagac aatgccgagc gcctagaaga cagggctgct ggaaatgtgg caagtcagga
1801 cacatcatgg caaactgccc agatagacag gctggttttt tagggcttgg accatgggga
1861 aagaagcccc gcaacttccc cgtggtccca agttcgcagg ggctaacacc aacagcaccc
1921 ccaatggatc cagcagtgga cctactggag aagtacatgc agcaagggag aaaacagaga
1981 gagcagagac aaagaccata caaagaagtg acagaggact tgctgcatct cgagcaagga
2041 gagacaccac acagagagac gacagaggac ttgctgcacc tcaattctct ctttggaaac
2101 gaccagtagt cacagcatac attgaggatc agccagtaga agttttacta gacacagggg
2161 ctgacgactc aatagtagca ggaatagagt tagggagcaa ttatagtcca aaaatagtag
2221 ggggaatagg aggattcata aataccaaag aatataaaga tgtagaaata agagtgctaa
2281 ataaaaaggt aagagccacc ataatgacag gtgatacccc aatcaacatt tttggcagaa
2341 atatcctgac agccttgggc atgtcattaa atttaccagt cgccaaaata gaaccagtaa
2401 aagtaacatt aaagccagga aaagatgggc caaaacaaag acaatggccc ttaacaagag
2461 aaaaaataga agcactaaga gaaatctgtg aaaaaatgga aagagaaggt cagctagaag
2521 aagcgcctcc aactaatccc tataataccc ctacatttgc aattaagaaa aaggacaaaa
2581 acaaatggag gatgctgata gattttagag aactaaacaa ggtaactcaa gatttcacag
2641 aggttcagtt aggaattcca cacccagcag gattagccaa gaaaagaaga attactgtgt
2701 tagatgtagg agatgcctac ttttccatac ccctatatga ggattttaga cagtatactg
2761 catttactct gccatcagta aacaatgcag aaccaggaaa aagatatata tacaaagtct
2821 taccacaggg atggaagggg tcaccagcaa tttttcaata cacaatgagg caagtcttag
2881 aaccattcag aaaagcaaac ccagatgtca ttatcgttca gtacatggat gatatcttaa
2941 tagctagtga caggacagat ttggaacatg acaaagtagt cctacagcta aaggaacttc
3001 taaatggcct gggattttcc accccagacg aaaagttcca aaaggaccct ccataccaat
3061 ggatgggcta tgaactgtgg ccaaccaaat ggaaattgca aaaaatacaa ttgccccaaa
3121 aggaagtatg gacagttaat gacatccaga aactagtggg tgtcctaaac tgggcggcac
3181 aaatctaccc aggaataaag accaaacact tatgtaagct aattagagga aagatgacac
3241 ccacggaaga agtacagtgg acagaattag cagaagcaga gctggaggaa aacaaaatta
3301 tcttaagcca ggaacaggag ggacactatt accaagagga aaaagagtta gaagcaacag
3361 ttcaaaagga tcaagacaat cagtggacat ataaagtaca ccagggagaa aaaattctaa
3421 aagtaggaaa atatgcaaag ataaaaaata cccataccaa cggggtcagg ttgttggcac
3481 aggtagttca gaaaatagga aaagaagcac tagtcatttg gggacgaata ccaaaatttc
3541 acctaccagt agaaagagag acctgggaac agtggtggga taactattgg caagtgacat
3601 ggatcccaga ctgggacttc gtatccaccc caccgttggt caggttagca tttaacctgg
3661 taaaagatcc tataccaggc gcagagacct tctacacgga tggatcttgc aataggcaat
3721 caaaagaggg aaaagcagga tatataacag atagaggaaa agacaaagta aggatattag
3781 agcaaactac caaccaacaa gcagaattag aagcctttgc aatggcagta acagactcag
3841 gtccaaaagt caatattgta gtagactcac agtatgtaat gggaatagta acaggccaac
3901 cggctgaatc agagagtaga atagtaaata aaattataga agagatgata aaaaaggaag
3961 caatctatgt tgcatgggtc ccggcccaca aaggcatagg aggaaatcaa gaaattgacc
4021 acttagtaag tcagggcatc agacaagtat tattcctaga gagaatagag cccgctcagg
4081 aagaacatgg aaaatatcat agcaatgtaa aagaactagc ccataagttt ggattaccca
4141 acctggtggc aagacaaata gtaaacacat gtgcccagtg ccaacaaaaa ggggaagcta
4201 tacatgggca agtaaatgca gaactaggca cctggcaaat ggactgcaca cacttagaag
4261 gaaaaatcat tatagtagca gtacatgttg caagtggatt tatagaagca gaagtcatcc
4321 cacaggaatc aggaaggcaa acagcactct tcctattaaa actggccagt aggtggccaa
4381 taacacactt gcacacagat aatggtgcca acttcacttc acaggaggta aagatggtag
4441 catggtgggt aggcatagaa caatcctttg gagtacctta caatccacaa agccagggag
4501 tagtagaagc aatgaatcac cacctgaaaa atcagataga aagaattaga gagcaggcaa
4561 atacaatgga aacaatagta ctaatggcag ttcattgcat gaattttaaa agaaggggag
4621 gaatagggga tatgacccca gtagaaagac tagtcaatat gatcaccaca gaacaagaaa
4681 tacaattcct ccaagcaaaa aattcaaaat taaaaaattt tcgggtctat ttcagagaag
4741 gcagaaatca actgtggcaa ggacctgggg agctactgtg gaaaggggac ggagcagtca
4801 tagtcaaggt agggacagat ataaaagtaa taccaagaag aaaggccaag atcatcagag
4861 actatggacc aaggcaagag atggatagcg gttcccacct ggagggtgcc agggaggatg
4921 gagaaatggc atagccttat caagtatcta aaatacagaa caaaagatct agaacaggtg
4981 cgctatgttc cccaccataa ggtggggtgg gcatggtgga cttgcagcag ggtaatattc
5041 ccattaaaag gaaacagtca tctagagata caggcatatt ggaacctaac accagaaaaa
5101 ggatggctct cctcttattc agtaagaatg acttggtact cagaagggtt ctggacagat
5161 gttaccccag actgtgcaga caccctaata cacagcactt atttctcttg ctttacggca
5221 ggtgaagtaa gaagagccat caggggagaa aagtcattgt cctgctgcaa ttatccccaa
5281 gcccataagt ccaaggtacc gtcactccaa tttctggcct tagtagtagt acagcaaaat
5341 gacaaacccc agagagacaa taccaccagg aaacagtggc gaagaaacta tcgaagaggc
5401 cttcgattgg ctagacagga cggtagaagc cataaacaga gaggcagtga accacctgcc
5461 cagggagctt attttccagg tgtggcaaag gtcctggaga tactggcatg atgagcaagg
5521 gatgtcacga agctacacaa agtatagata tttgtgctta atgcagaaag ctgtgttcat
5581 gcatttcaag aaagggtgca cttgccgggg ggaaggacat gggccaggag ggtggagatc
5641 aggacctccc cctcctcctc ccccaggttt agtctaatga ctgaagcacc agcagagttt
5701 cccccggagg atgggacccc accgagggaa ccaggggatg agtgggtaat agaaattctg
5761 agagaaataa aggaagaagc tttaaagcat tttgaccctc gcttgctaac tgctcttggc
5821 tactatatct atactagaca tggagacacc cttgaaggcg ccagagagct cattagggtc
5881 ctacaacgag ccctcttcac gcacttcaga gcaggatgtg gccactcaag aattggccaa
5941 ccaaggggaa gaaatcctct ctcagctata ccgaccccta gaaacatgca ataacaaatg
6001 cttttgtaag gggtgctgct tccattgcca gctgtgtttt ttaaacaagg ggctcgggat
6061 atgttatgac cgaaagggca gacgaagaag gagtccgaag aaaactaagg ctcattcgtc
6121 tcctgcatca gacaagtgag tacaatgagt ggtaaaattc agctgcttgt tgcctttctg
6181 ctaactagtg cttgcttaat atattgcacc aaatatgtga ctgttttcta tggagtaccc
6241 gtgtggaaaa atgcatccat tcccctcttt tgtgcaacta aaaatagaga tacttgggga
6301 accatacagt gcttgccaga caatgatgat tatcaagaga tacctttgaa tgtaacagag
6361 gcttttgacg catgggataa tatagtaaca gaacaagcag tagaagatgt ctggaatcta
6421 tttgagacat caataaaacc atgtgtcaaa ttaacgcctt tatgtgtaac aatgaactgt
6481 aacgcaagta cagagagcgc agttgcaact acaagcccat ctggacctga tatgataaat
6541 gatactgatc catgcataca attgaacaat tgctcaggac tgagggagga agacatggtc
6601 gagtgtcagt tcaatatgac aggactagag ttagataaga aaaaacagta tagtgaaacc
6661 tggtactcaa aagatgtggt ttgtgaatca gataacagca cagaccgaaa aagatgttac
6721 atgaaccatt gcaacacatc agtcatcaca gagtcatgtg acaagcacta ttgggatgct
6781 atgagattta gatactgtgc accaccgggt tttgtcttgc taaggtgcaa tgataccaat
6841 tactcaggct ttgagcccaa ttgctctaaa gtagtagctt ctacatgtac aagaatgatg
6901 gaaacgcaac cttctacttg gcttggcttt aatggcacta gggcagaaaa tagaacatat
6961 atctattggc atggtaggga taacagaact attatcagct taaacaaata ttataatctc
7021 accatacttt gtaggagacc agaaaataaa acagttgtac caataacact catgtcaggc
7081 cgcagatttc actcccagaa gatcatcaat aaaaaaccca ggcaagcatg gtgccggttc
7141 aaaggcgagt ggagggaagc catgcaggag gtgaaacaaa cccttgtaaa acatcccagg
7201 tataaaggaa ccaatgacac aaataaaatt aactttacag caccagaaaa agactcagac
7261 ccagaagtag catatatgtg gactaactgc agaggagaat tcctctattg caacatgact
7321 tggttcctta attgggtaga aaacaagacg ggtcaacagc ataactatgt gccgtgccat
7381 atagagcaaa taattaatac ctggcataag gtagggaaaa atgtatattt gcctcctagg
7441 gaaggagagt tgtcctgcga atcaacagtg accagtatca ttgctaacat tgatgttgat
7501 ggagataacc ggacaaatat tacctttagt gcagaggtgg cagaactata ccgattggaa
7561 ttgggggatt ataaattagt agaagtaaca ccaattggct tcgcccctac agcagaaaaa
7621 agatactcct ctgctccagg gagacataag agaggtgtgc ttgtgctagg gttcctaggt
7681 tttctcacga cagcaggtgc tgcaatgggg gcggcgtctc tgacgctgtc ggctcagtct
7741 cggactttat tccgtgggat agtgcagcaa cagcaacagc tgttggacgt ggtcaagaga
7801 caacaagaaa tgttgcgact gaccgtctgg ggaactaaaa acctccaggc aagagtcact
7861 gctattgaga agtacctagc agaccaggcg cgactaaatt catggggatg tgcgtttaga
7921 caagtctgcc acactactgt accatgggta aatgacacct taacacctga gtggaacaac
7981 atgacatggc aagaatggga acacaaaatc cgcttcctag aggcaaatat cagtgagagt
8041 ttagaacagg cacaaatcca gcaagaaaag aatatgtatg agctgcaaaa gctaaatagc
8101 tgggatgttt ttggcaattg gtttgactta acctcctgga tcaagtatat tcaatatgga
8161 gtcatgatag tagtaggaat agtagctctc agaatagtaa tatatgtagt acaaatgcta
8221 agtagactta gaaagggcta taggcctgtt ttctcttccc cccccggtta tattcaacag
8281 atccatatcc acaaggactg ggaacagcca gacagagaag aaacagaaga agacgttggg
8341 aacgacgttg gaagcagatc ctggccttgg ccgatagaat atatacattt cctgatccgc
8401 ctgctgatcc gcctcttgac cagactatac aacagctgca gggacttact atccagactc
8461 tacctgatcc tccaaccact cagagactgg ctgagactca aggcagccta cctgcagtat
8521 gggtgcgagt ggatccaaga agcgttccag gccctcgcga gggttacaag agagactctt
8581 acgagcgcgg ggaggagctt gtggggggct ctgggacgaa tcggaagggg gatactcgca
8641 gttccacgaa ggatcaggca gggagcagaa attgccctcc tgtgagggac agagatatca
8701 gcaaggagac tttatgaata ccccatggag aaccccagca acagaaaagg aaaaagaatc
8761 gtacaggcaa caaaatatgg atgatgtaga ttcagatgat gatgacctag taggggtctc
8821 tgacacatca agagtaccat tgagagcaat gacatataga atggcagtag acatgtcaga
8881 tttaataaaa gataaggggg gactggaagg gatgtattac agtgagagaa gacatagaat
8941 cctagacata tacttagaaa aggaagaagg gataattcca gattggcaga actatactca
9001 tgggctagga gtaaggtacc caatgttctt tgggtggcta tggaagctag taccagtaac
9061 tgtcccacaa gaaggggagg acactgagac tctctgctta atgcactcag cacaagtaag
9121 cagatttgat gacccgcatg gggagacact agtctggaag tttgacccca tgctggctca
9181 tgagtacacg acctttattc tatacccaga ggaatttggg cacaagtcag gaatggaaga
9241 agatgactgg aaggcaaaac tgaaagcaag agggatacca tttagttaaa aacaggaaca
9301 accatacttg gtcaggacag gaagtagcta ctgaaaacag ctgagactgc agggactttc
9361 cagaaggggc tgtaaccagg ggagggacat gggaggagct ggtggggaac gccctcatac
9421 tttctgtata aatgtacccg ctgctcgcat tgtattcagt cgctctgcgg agaggctggc
9481 agattgagcc ctgggaggtt ctctccagca ctagcaggta gagcctgggt gttccctgct
9541 ggactctcac cagtgcttgg ccggcgctgg gcagacggct ccacgcttgc ttgcttaaaa
9601 gacctcttaa taaagctgcc agttagaagc aagtta
//
LOCUS MUSIGHAAL 411 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from
hybridoma HY2-2C8.
ACCESSION M22438
KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon;
variable region.
SOURCE Mouse (strain SM/J) hybridoma HY2-2C8 cell, cDNA to mRNA, clone 1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K.
TITLE A single V-H gene is utilized predominantly in anti-BrMRBC
hybridomas derived from purified Ly-1 B cells: Definition of the
V-H11 family
JOURNAL J. Immunol. 142, 3643-3651 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.E.Carmack 03-FEB-1989.
FEATURES from to/span description
pept 1 / 411 Ig heavy-chain (V-J-region) precursor
sigp 1 57 Ig heavy-chain (V-J-region) signal peptide
matp 58 / 411 Ig heavy-chain (V-J-region)
recomb 147 148 framework 1 end/CDR 1 start
recomb 162 163 CDR 1 end/framework 2 start
recomb 204 205 framework 2 end/CDR 2 start
recomb 255 256 CDR 2 end/framework 3 start
recomb 351 352 framework 3 end/CDR 3 or D-region start
recomb 360 361 CDR 3 or D-region end/J-region start
BASE COUNT 100 a 86 c 118 g 107 t
ORIGIN Chromosome 12.
1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa
61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct
121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct
181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca
241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg
301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatgggccc
361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a
//
LOCUS MUSIGHAAM 411 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from
hybridoma HY1-1H2.
ACCESSION M22439
KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon;
variable region.
SOURCE Mouse (strain SM/J) hybridoma HY1-1H2 cell, cDNA to mRNA, clone 1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K.
TITLE A single V-H gene is utilized predominantly in anti-BrMRBC
hybridomas derived from purified Ly-1 B cells: Definition of the
V-H11 family
JOURNAL J. Immunol. 142, 3643-3651 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.E.Carmack 03-FEB-1989.
FEATURES from to/span description
pept 1 / 411 Ig heavy-chain (V-J-region) precursor
sigp 1 54 Ig heavy-chain (V-J-region) signal peptide
matp 55 / 411 Ig heavy-chain (V-J-region)
recomb 141 142 framework 1 end/CDR 1 start
recomb 159 160 CDR 1 end/framework 2 start
recomb 204 205 framework 2 end/CDR 2 start
recomb 252 253 CDR 2 end/framework 3 start
recomb 348 349 framework 3 end/CDR 3 or D-region start
recomb 360 361 CDR 3 or D-region end/J-region start
BASE COUNT 94 a 115 c 96 g 106 t
ORIGIN Chromosome 12.
1 atgagagtgc tgggattttt gtgcctggtg acagtccttc ctggtagcct gtctcagatg
61 cagcttcagg agtcaggacc tggcctggtg aaaccctcac agtcactctt cctcgcctgc
121 tctattactg gtttccccat caccagtggt tactactgga tctggatccg tcagtcacct
181 gggaaacccc tagaatggat ggggtacatc actcatagtg gggaaacttt ctacaaccca
241 tccctccaga gccccatctc cattactaga gaaacatcca agaaccagtt ctttctgcaa
301 ttgaactctg tgaccacaga ggacacagcc atgtattact gtgcaggaga ctatgatggt
361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a
//
LOCUS RATCKBR 4360 bp ds-DNA ROD 24-JUL-1990
DEFINITION Rat creatine kinase gene, complete cds.
ACCESSION M18668 M26669
KEYWORDS creatine kinase.
SOURCE Rat (Sprague Dawley) brain DNA, clone lambda ckb-2.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4360)
AUTHORS Benfield,P.A., Graf,D., Korolkoff,P.N., Hobson,G. and Pearson,M.L.
TITLE Isolation of four rat creatine kinase genes and identification of
multiple potential promoter sequences within the rat brain creatine
kinase promoter region
JOURNAL Gene 63, 227-243 (1988)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.A.Benfield, 23-MAR-1988.
FEATURES from to/span description
pept 1276 1468 creatine kinase, exon 2 (first expressed exon)
1656 1810 creatine kinase, exon 3
1888 2020 creatine kinase, exon 4
2197 2368 creatine kinase, exon 5
2941 3064 creatine kinase, exon 6
3174 3363 creatine kinase, exon 7
3443 3621 creatine kinase, exon 8
pre-msg 941 3818 creatine kinase mRNA and introns
IVS 1015 1254 creatine kinase intron A
IVS 1469 1655 creatine kinase intron B
IVS 1811 1887 creatine kinase intron C
IVS 2021 2196 creatine kinase intron D
IVS 2369 2940 creatine kinase intron E
IVS 3065 3173 creatine kinase intron F
IVS 3364 3442 creatine kinase intron G
BASE COUNT 801 a 1314 c 1278 g 967 t
ORIGIN
1 aagtactgct aaacctactt tccagagaag atgaaactgc tggcagctgg aggcagggtg
61 tgaacgcctg tgttcacaca ttccaattca cagccggcac tggctgtggt caaattccag
121 gtttccccgt acctgcgtgt gtgggacacg gggtgggggt caaagagact ctttcaatct
181 ggtggatcta ttaatagtta tttttgactg gctgcttcct ccacccccag tttgccctgc
241 cgtaggtact gtctttgtag gtggggtggt agaaaccggt ccctcaggaa agaacctggg
301 gatttgaaga ttcaaaaaca gtctctagga gctcagtgtc ttaatttttc agggtccggg
361 tccaagggtc ttggctaggt tccttagggc ccgcccaagg tcagaacacc tggtcttccg
421 gcggacgagc cagatgcgct ctgtgtggac caagaacacc caggagatgc ccgcaggcac
481 cttgaggaag gtcagctccc ctgtaccgtt ttgtaccgcc tggatgcagt ggatggaagc
541 cgtaccccat gctaaaaacc tggtatctct tgccatgacc ccctcaacac tgctgtcggc
601 ctgacaaggc aggaagcagg gaaggcagga acctcaagtg ctaactggct ttccagactc
661 agtctctcgc cccaggcatg tgtgtgaagc aggtctaggt atctacggtc ctgggatttc
721 gtcttcgaga tcctgagcga gcgccacggg gttgaccaaa ctgcgcagcg gggtcgagac
781 ttggggacca tggggtgggc cgctgggggt gtcgagggtt gttgcctcgg acaaagcggc
841 ggcaccaccc caaagcgcgg gccaatggaa tgaatgggct ataaatagcc gccaatggga
901 ggccggcgac gcgcccctta agagctcagg gagcagcgag cggccgtcgt tcttctgcgt
961 gcgcccggag ctgcaagcac aagcagtctg cgttcctgct ccgtcggaat cccggtgagc
1021 gggtcggagg gtgaggggct agtcctgagg gcgtacgcgg ttgggtgcat cacgcgcagc
1081 tcgttgcccg gagcgcgtcc ggaccgccgg ggaacttggg atgcgccgga ctcaaaagat
1141 gcagacctcg ctgaccttgg tgacgtccga ggaaagcttc gggggtccgg attggtctgc
1201 agggtctctg gagaccggtg tgtaaagctc ctctgacccc gctcttcccc gcagcctgcc
1261 gccgccgccg ccgccatgcc cttctccaac agccacaaca cgcagaagct gcgcttcccg
1321 gccgaggatg aattccctga tctgagcagc cacaacaacc atatggccaa ggtgctgacc
1381 cccgagctgt acgcggagct ccgtgccaag tgcacgccga gcggctttac gttggacgac
1441 gccatccaga ctggcgtaga caatccgggt atgcacaccc ttgtagcgtc aggctttcgt
1501 ctccccaaca agccccccgg gcaaggatcc cactgctctt ccctgaacct cggcggggtg
1561 gggtctcctg gtcctctctg cgcttagcca taagggcctc agtgccttgc tctagcttgg
1621 cgctgacgtc actgtcgccg cgccccatcc tccaggccac ccgtacatca tgacagtggg
1681 tgcagtggcg ggcgacgagg agagttacga cgtattcaag gaccttttcg accccatcat
1741 cgaggaccgg cacggcggct accagcccag tgatgagcac aagactgacc tcaacccaga
1801 caacctgcag gtgcggggcg tcagggtccg ggcgtgctgg ggagaggggg tctcggcgct
1861 cactccggcc actgcttgca tccctagggc ggcgatgacc tggaccccaa ctacgtgctg
1921 agctcgcggg tgcgcacagg ccgaagcatc cgcggcttct gcctcccccc tcactgcagc
1981 cgtggggagc gccgcgccat cgagaagctg gcagtagaag gtagggtccg gtatcagcca
2041 ccagagctgc tgcgttcttg tctgggcgcg cagcgttccc ccagggtctg agggtggaag
2101 acccaactaa gacagcgggg acactgccgc gggcgggtgg cgagctttct agctcgccca
2161 cccacccacc ggctccttga actctgtact ccgcagccct gtccagccta gatggcgacc
2221 tgtctggcag gtactatgcg ctcaagagca tgaccgaggc ggacgagcag cagctcattg
2281 acgaccactt cctcttcgac aagcctgtgt cgcctctgct gctggcctcc gcgatggccc
2341 gcgactggcc ggatgctcgc ggcatttggt acgagccctc ttccctcaca gtccccggaa
2401 ggtggggcac ggcctgaatt cgtagattgt gcgttaccac actgtgcgca ggtgaccttg
2461 gttctgcgga tgactgaggt ccctgcgccg cgggaggccg cagtctccag ggattcaagg
2521 gtggtgacca gtttctttgg cgtctgttct ccgccctcct cctgggagcc ggcgcttctt
2581 gttttctctc ctccttctca cccccttatt ccgccgggat tctgccaggt gccagtgacg
2641 caaaagcctc cgcacccgtc cggcagggct cctacccctg cagactgcgc gggcgagcgt
2701 gcttcctctg gtgggctgct ctggctgctc cagccccttg cgggccacgc agcacatgac
2761 tggtgattga ggtgccctgg tctccctgcc ttcgagtgtt gctgggtggg cgtcttagag
2821 acagcccaca ctcctgcact tggaaagcgc tttggaggtc acccacctag ggcgtcctgc
2881 tgggtgtgac ctaactaagg ctgtagactt tgactttgtc tgcccgaccc cctcctccag
2941 gcacaatgac aataagacgt tcctggtgtg gatcaacgag gaggaccacc tgcgggttat
3001 ctccatgcag aaagggggca acatgaagga agttttcacg cgattctgca ctggcctcac
3061 tcaggtctgg cctggtgcta aggagggagg gggtcggggg gatttgagtc cacatttccc
3121 agggtgcttg gggttccagc tacattttcc cttaactttc ccttcacctt cagattgaaa
3181 ctctcttcaa gtctaagaac tatgagttca tgtggaaccc tcacctgggc tacatcctca
3241 cgtgcccatc caacctgggc actgggcttc gggcaggcgt gcacatcaag ctgccccacc
3301 tgggaaagca cgagaagttc tcggaggtgc tcaagcgact gcggcttcag aagcgaggca
3361 caggtgaggg gcagacgact caggtgaagc agcaaacgcg gctttcccca gccctgctga
3421 cctgcctgtc tcccatcccc aggtggtgtg gacaccgctg ctgtgggtgg agtttttgat
3481 gtctccaacg ctgaccgcct gggcttctcg gaggtggagc tggtgcagat ggtggtggac
3541 ggagtgaagc tactcattga gatggagcag cggcttgagc agggtcagcc cattgacgac
3601 ctcatgcctg cccagaagtg aagcctggcc ctagccacca ccaggctgcc gcttcctaac
3661 ttattacccg ggcagtgccc gccatgcatc cttgatgttt gccgcctggc gtgagccctt
3721 agcctcgctg tagagacttc tgtcgccctg ggtagagttt attttttctg atggctaagc
3781 tgttgcagac actgaaataa attagggttt ggcctgccct atgtctgagt gttgcttctc
3841 ctttcttaga gacagtcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
3901 tgtgtgtgtc catccgtcct tcctggcctg tcacttcacc tagcagatga aacatgagtc
3961 ttaagataca gggcagaggg agggagactt gagtgcagcc tgaacatcta aggacatttg
4021 tacttgcagg gtggagccta gtatttcctt agctccagcc agctaaaggg agtcagcaac
4081 ttgacatctt ccccagtttc tttctgccgc attcctggca gcctgaatat ctaaagttct
4141 atctcttcac ttccttcccc tccaggcttc aacacttact ccagttctta aataattctc
4201 tccctacttc ctccagcttc caaccccagg ttgagtgcct cctcctgatc cctcagccca
4261 gctctccagc cccatccaga caagagcttt tcagaaccac ccgtccctct ctcttgcagc
4321 ttcctctctg gggccctcct ccttgggagg agacaagctt
//
LOCUS RATTHY 797 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Rat prothymosin-alpha mRNA, complete cds.
ACCESSION M20035
KEYWORDS prothymosin.
SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA,
clone pRSpro-187.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 797)
AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L.
TITLE Prothymosin alpha and parathymosin: Amino acid sequences deduced
from the cloned rat spleen cDNAs
JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
B.L.Horecker, 26-APR-1988.
FEATURES from to/span description
pept 174 512 prothymosin precursor
matp 177 509 prothymosin
BASE COUNT 248 a 187 c 219 g 143 t
ORIGIN Unreported.
1 gccatctttg catttgttcc cgggtcgtgc tccgcgctca catgcaagcc aacctatcgg
61 ccgacccacc ggtcatcctc caaacgcgga cttccggcta gcttatctcg gccaagagtc
121 ctcgaaactc gacatattct tacgcgtaga ccagaccacc ggcgtgcccc accatgtcag
181 acgcggcagt ggacaccagc tccgagatca ccaccaagga cttgaaggag aagaaggaag
241 ttgtggagga ggcagagaat ggaagagacg cacctgccaa tgggaacgct caaaatgagg
301 aaaatgggga gcaggaggct gacaatgagg tagatgaaga agaggaagaa ggtggggagg
361 aagaggagga ggaggaagaa ggtgatggtg aggaagaaga tggagatgaa gatgaggaag
421 ctgaggctcc tacgggcaag cgggtagctg aggatgatga ggatgatgat gttgagacca
481 agaagcagaa gaagactgat gaggatgact agacagcaaa aggaaaagct aaccttacgc
541 accgtgacct attcaccctc cacttcccgt ctcagaattt aaacgtggtc accttcgagt
601 agagaagcag gcccccgccg cccactccag gtgcacccac acatgacatg cgctcttcac
661 acccacaaaa cacaacatga ttgcacatgg aggagaaaga acagaactcc aggcctcttt
721 ttttcttaaa aatctttaaa ggaattgttg attttattaa ctaagcaaat tgaggcagca
781 ttaataaatt cgacaca
//
LOCUS RATTHYP 378 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Rat spleen parathymosin-alpha mRNA, 3' end.
ACCESSION M20616
KEYWORDS parathymosin.
SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA,
clone pRSpara-5187.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 378)
AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L.
TITLE Prothymosin-alpha and parathymosin: Amino acid sequences deduced
from the cloned rat spleen cDNAs-1
JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988)
STANDARD full staff_entry
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
B.L.Horecker, 26-APR-1988.
FEATURES from to/span description
pept < 1 245 parathymosin precursor (AA at 3)
BASE COUNT 111 a 69 c 144 g 54 t
ORIGIN Unreported.
1 acaaggtgga ggagaaggct ggccggaaag aacggaagaa agaagtagtg gaggaggagg
61 agaatggagc tgaggaggag gaagaagaaa ctgctgagga tggagaggat gatgatgaag
121 gagacgaaga agatgaggag gaagaggagg aggaggatga aggccccgtg cggaagagaa
181 ctgctgaaga ggaggatgaa gcggatccca agaggcagaa gacagaaaac ggggcgtcgg
241 cttgagcccc tgcccgtggc ttggggatgg gaggcccctc aggtctggag gtgggcagga
301 acacacaatc agccccttct ctgctcctgc tctgcctgcc agagcttgac tgcttgacag
361 cttattcatt tcagcatc
//
LOCUS STYNUSAINF 228 bp ds-DNA BCT 24-JUL-1990
DEFINITION S.typhimurium nusA-infB operon promoter region.
ACCESSION M28868
KEYWORDS .
SOURCE S.typhimurium (strain LT2) DNA, clone 40.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 228)
AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I.
TITLE Identification of a second promoter for the metY-nusA-infB operon
of Escherichia coli
JOURNAL J. Bacteriol. 172, 2336-2342 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.E.Granston, 27-SEP-1989.
FEATURES from to/span description
signal 159 164 -35 region promoter P1
signal 183 188 -10 region promoter P1
BASE COUNT 69 a 39 c 54 g 66 t
ORIGIN
1 aaaataaatg agttttcatt taatcatctt ttagcagaga caggaagcgt ttagtgaagg
61 tttactgaaa taacgttgtg acatgcgtgt ataagccgga gaggtgggat gagtataacg
121 aagagtgata ttcattcatc tgcatattgg gaatagggtt gacataactc atgccaggtc
181 actatactgt gcgcagattt tacgtcccgt cctcggtacc aaatccca
//
LOCUS SYNAA113M 9186 bp ds-DNA SYN 24-JUL-1990
DEFINITION Sequencing vector pAA113M DNA.
ACCESSION M20189
KEYWORDS cloning vector; synthetic DNA.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 9186)
AUTHORS Ahmed,A.
TITLE A vector for sequencing long (40-kb) DNA fragments
JOURNAL Gene 75, 315-321 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer readable copy of sequence [1] kindly
provided by A.Ahmed, 28-SEP-1988. pAA113M is a cosmid vector
specially developed to sequence long (40kb) DNA fragments. It
makes use of the cos site to clone long segments, IS1 to subdivide
the cloned fragment by deletions, galKT to select such deletions,
and some unique restriction sites to further shorten the fragment
for supercoil or M13 sequencing. It contains a normal plasmid ori
of replication and also the M13 ori of replication to obtain DNA in
the double-stranded or single- stranded form.
FEATURES from to/span description
pept 3787 4977 tet protein
ORF 7433 8293 amp protein
BASE COUNT 2051 a 2446 c 2453 g 2236 t
ORIGIN Unreported.
1 caattactgc aatgccctcg taattaagtg aatttacaat atcgtcctgt tcggagggaa
61 gaacgcggga tgttcattct tcatcacttt taattgatgt atatgctctc ttttctgacg
121 ttagtctccg acggcaggct tcaatgaccc aggctgagaa attcccggac cctttttgct
181 caagagcgat gttaatttgt tcaatcattt ggttaggaaa gcggatgttg cgggttgttg
241 ttctgcgggt tctgttcttc gttgacatga ggttgccccg tattcagtgt cgctgatttg
301 tattgtctga agttgttttt acgttaagtt gatgcagatc aattaatacg atacctgcgt
361 cataattgat tatttgacgt ggtttgatgg cctccacgca cgttgtgata tgtagatgat
421 aatcattatc actttacggg tcctttccgg tgatccgaca ggttacgggg cggcgacctc
481 gcgggttttc gctatttatg aaaattttcc ggtttaaggc gtttccgttc ttcttcgtca
541 taacttaatg tttttattta aaataccctc tgaaaagaaa ggaaacgaca ggtgctgaaa
601 gcgaggcttt ttggcctctg tcgtttcctt tctctgtttt tgtccgtgga atgaacaatg
661 gaagtcaaca aaaagcagct ggctgacatt ttcggtgcga gtatccgtac cattcagaac
721 tggcaggaac agggaatgcc cgttctgcga ggcggtggca agggtaatga ggtgctttat
781 gactctgccg ccgtcataaa atggtatgcc gaaagggatg ctgaaattga gaacgaaaag
841 ctgcgccggg aggttgaaga actgcggcag gccagcgagg cagatcaaca gtcggtacgg
901 ctgaccatcg ggtgccagtg cgggagtttc gttcagcact gtcctgctcc ttgtgatggt
961 ttacaaacgt aaaaagtctc tttaatacct gtttttgctt catattgttc agcgacacgt
1021 tgctgtacgg caggcaccag ctcttccggg atcagcgcga cgatacagcc gccaaatccg
1081 ccgccggtca tgcgtacgcc acctttgtcg ccaatcacag ctttgacgat ttctaccaga
1141 gtgtcaattt gcggcacggt gatttcgaaa tcatcgcgca tagaggcatg agactccgcc
1201 atcaactcgc ccatacgttt caggtcgcct tgctccagcg cgctggcagc ttcaacggtg
1261 cgggcgtttt cagtcagtat atgacgcacg cgttttgcca cgatcgggtc cagttcatgc
1321 gcaacagcgt tgaactcttc aatggtgaca tcacgcaggg ctggctgctg gaagaaacgc
1381 gcaccggttt cgcactgttc acgacgggtg ttgtattcgc tgccaaccag ggtacgtttg
1441 aagttactgt tgatgatgac gacagccaca cctttgggca tggaaactgc tttggtcccc
1501 agtgagcggc aatcgatcag caaggcatga tctttcttgc cgagcgcgga aattagctga
1561 tccatgatcc cgcagttaca gcctacaaac tggttttctg cttcctgacc gttaagcgcg
1621 atttgtgcgc cgtccagcgg cagatgataa agctgctgca atacggttcc gaccgcgact
1681 tccagtgaag cggaagaact taacccggca ccctgcggca cattgccgtg atcaaccatg
1741 tccacgccgc cgaagctgtt gttacgcagt tgcagatgtt tcaccacgcc acgaacgtag
1801 ttagcccatt gatagttttc atgtgcgaca atgggcgcat cgagggaaaa ctcgtcgagc
1861 tgattttcat aatcggctgc catcacgcga actttacggt catcgcgtgg tgcacaactg
1921 atcacggttt gataatcaat cgcgcagggc agaacgaaac cgtcgttgta gtcggtgtgt
1981 tcaccaatca aattcacgcg gccaggcgcc tgaatggtgt gagtggcagg gtagccaaat
2041 gcgttggcaa acagagattg tgttttttct ttcagactca tttcttacac tccggattcg
2101 cgaaaatgga tatcgctgac tgcgcgcaaa cgctctgctg cctgttctgc ggtcaggtct
2161 cgctgggtct ctgccagcat ttcataacca accataaatt tacgtacggt ggcggacggc
2221 agcagaggcg gataaaagtg cgcgtgcagc tgccagtgtt gattctcttc gccattaaat
2281 ggcgcgccgt gccagcccat agagtagggg aaggagcact ggaagaggtt gtcataacga
2341 ctggtcagct ttttcaacgc cagcgccaga tcgctgcgct gggcgtcggt caaatcggtg
2401 atccgtaaaa cgtgggcttt gggcagcagt agcgtttcga acggccaggc agcccagtaa
2461 ggcacgacgg ctaaccagtg ttcggtttcg acaacggtac ggctaccgtc tgccagctcg
2521 cgctgaacat aatccaccag cattggtgat ttctgttcgg caaaatattc tttttgcagg
2581 cggtcttcgc gctcagcttc gttaggcagg aagctatttg cccaaatctg accgtgcgga
2641 tgcgggttag agcagcccat cgccgcgcct ttgttttcaa aaacctgcac ccatgggtac
2701 gttttcccca gttctgcggt ttgctcctgc caggttttga cgatttccgt caatgctgca
2761 acgctgagct ctggcagcgt tttactgtga tccggtgaaa agcagatcac ccggctggtg
2821 ccgcgcgcgc tctggcaacg catcagcgga tcgtgacttt ctggcgcatc tggcgtgtca
2881 gacatcaaag ccgcaaagtc attagtgaaa acgtaagtcc cggtgtaatc ggggttttta
2941 tcgcctgtca cccgcacatt acctgcgcag aggaagcaat ctggatcgtg cgcaggtaac
3001 acctgtttgg ctggcgtttc ctgcgccccc tgccagggcg ttagccgggt gcggtgaaac
3061 cagaatccat tgcccggtga gcggttgtag cggcgatgtg gatgatcaac gggattaaat
3121 tgcgtcatgg tcgttcctta atcgggatat ccctgtggat ggcgtgactg ccagtgccag
3181 gtgtcctgcg ccatttcatc gagtgtgcgc gttacgcagt tcagttcacg gtcggctttg
3241 ctggcgtccg ccagtaggcc ggaaggtcgc cctcgcgagc ggtgcaaaat gataattaac
3301 cggtttgccg cacgtttgct gaaggcatta accacgtcca gcacgctgtt gcctacgcca
3361 gcgccgaggt tgtagatgtg tacgcctggc ttgttcgcca gtttttccat cgccacgacg
3421 tgaccgtccg ccagatccat tacgtggatg taatcgcgta cgccagtacc atcttcggtc
3481 ggataatcgt taccaaaaat cgccagcgag tcgcgacggc ctacagcaac ctgggcgatg
3541 tatggcatca ggttattcgg aatgccttgc ggatcttcgc ccatatcgcc cgacggatgc
3601 gcgccaaccg ggttgaagta gcgcacgagg gcaatgctcc agtccggctg ggctttttgc
3661 agatcggtga ggatctgttc caccatcagc ttgcttttgc cgtaagggct ttgcggtgtg
3721 ccggtcggga agctataatg cggtagttta tcacagttaa attgctaacg cagtcaggca
3781 ccgtgtatga aatctaacaa tgcgctcatc gtcatcctcg gcaccgtcac cctggatgct
3841 gtaggcatag gcttggttat gccggtactg ccgggcctct tgcgggatat cgtccattcc
3901 gacagcatcg ccagtcacta tggcgtgctg ctagcgctat atgcgttgat gcaatttcta
3961 tgcgcacccg ttctcggagc actgtccgac cgctttggcc gccgcccagt cctgctcgct
4021 tcgctacttg gagccactat cgactacgcg atcatggcga ccacacccgt cctgtggatc
4081 ctctacgccg gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc
4141 tatatcgccg acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct
4201 tgtttcggcg tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc
4261 ttgcatgcac cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc
4321 ttcctaatgc aggagtcgca taagggagag cgtcgaccga tgcccttgag agccttcaac
4381 ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc
4441 ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag
4501 gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg
4561 cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag
4621 gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg
4681 cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc
4741 gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga
4801 tcgctcgcgg ctcttaccag cctaacttcg atcattggac cgctgatcgt cacggcgatt
4861 tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac
4921 cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg
4981 gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct
5041 tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca
5101 tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca
5161 tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc
5221 agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga
5281 cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga
5341 agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct
5401 gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct
5461 ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat
5521 gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc
5581 ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc
5641 ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga
5701 gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct
5761 ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct
5821 cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg
5881 cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag
5941 cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
6001 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc
6061 gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct
6121 cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg
6181 tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc
6241 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga
6301 aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct
6361 cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg
6421 gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag
6481 ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat
6541 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac
6601 aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac
6661 tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc
6721 ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt
6781 tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc
6841 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg
6901 agattatcaa aaaggatctt cacctagatc cttttaaatt gtaaacgtta atattttgtt
6961 aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg
7021 caaaatccct tataaatcaa aagaatagcc cgagataggg ttgagtgttg ttccagtttg
7081 gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta
7141 tcagggcgat ggcccactac gtgaaccatc acccaaatca agttttttgg ggtcgaggtg
7201 ccgtaaagca ctaaatcgga accctaaagg gagcccccga tttagagctt gacggggaaa
7261 gccggcgaac gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct
7321 ggcaagtgta gcggtcacgc tgcgcgtaac caccacaccc gccgcgctta atgcgccgct
7381 acagggcgcg taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat
7441 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct
7501 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg
7561 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag
7621 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta
7681 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg
7741 ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg
7801 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct
7861 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta
7921 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg
7981 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc
8041 cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg
8101 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga
8161 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg
8221 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat
8281 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc
8341 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca
8401 catttccccg aaaagtgcgg taatgactcc aacttattga tagtgtttta tgttcagata
8461 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc
8521 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg
8581 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt
8641 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat
8701 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa
8761 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca
8821 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt
8881 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca
8941 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa
9001 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg
9061 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa
9121 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc
9181 atcacc
//
LOCUS SYNHUMUBA 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Thr14->Cys), complete cds.
ACCESSION M21581
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 81 a 51 c 54 g 60 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccatatg tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS SYNHUMUBB 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Phe4->Cys), complete cds.
ACCESSION M22214
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 82 a 51 c 54 g 59 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS SYNHUMUBC 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys), complete
cds.
ACCESSION M22215
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 81 a 50 c 55 g 60 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS SYNHUMUBD 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys, Thr66->Cys),
complete cds.
ACCESSION M22216
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 80 a 48 c 56 g 62 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS SYNHUMUBE 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr66->Cys), complete
cds.
ACCESSION M22217
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 81 a 49 c 55 g 61 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS SYNHUMUBF 246 bp ds-DNA SYN 24-JUL-1990
DEFINITION Synthetic human ubiquitin gene (Thr66->Cys), complete cds.
ACCESSION M22218
KEYWORDS ubiquitin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (sites)
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
TITLE Ubiquitin function studied by disulfide engineering
JOURNAL J. Biol. Chem. 264, 1887-1893 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 246; for [1])
AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
Dixon,J.S., Weber,P.L. and Crooke,S.T.
JOURNAL Unpublished (1989) See comment for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.Ecker, 01-DEC-1988.
Author address [1] Smith Kline and French Laboratories P.O. Box
1539 King of Prussia, PA 19046
FEATURES from to/span description
pept 10 240 synthetic ubiquitin
BASE COUNT 81 a 50 c 54 g 61 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccataac tctagaagtt
61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
241 ggtacc
//
LOCUS XELC3 1087 bp ss-mRNA VRT 24-JUL-1990
DEFINITION X.laevis complement component 3 (C3) mRNA, 3' end.
ACCESSION M21790 J04493
KEYWORDS complement component 3; serum protein.
SOURCE X.laevis (adult) liver, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1087)
AUTHORS Grossberger,D., Marcuz,A., Du Pasquier,L. and Lambris,J.D.
TITLE Conservation of structural and functional domains in complement
component C3 of Xenopus laevis and mammals
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1323-1327 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.Grossberger, 14-DEC-1988.
FEATURES from to/span description
pept < 1 969 complement component 3 (AA at 1)
BASE COUNT 327 a 211 c 258 g 291 t
ORIGIN Unreported.
1 gggagggtgg tgaccgtgta ccagtctctt gtcatagaga aagaaagaaa gtgccttaac
61 tttgaactgt cttgtgaatg taaaggaagt acaacttgca agacctcccg aagtgcaaag
121 gcaacagtgt ctatagaagc ctgtgcaagg catctcaaga acgttgatgc caccatgtcc
181 attattgata tttccatgat gactggcttt tctcctgata ccgattccct ggataggcta
241 atgaagggag tggacaaata catctctaaa tatgaagtca acaaaggagc aaatgacaag
301 ggaacactta ttctctactt ggacaaagtc tcccacatag acgaagaatg tgtgaagttt
361 tatgctcatc aatactttga agtgggtttc atccagccag cttctgtaac tgtgtatgac
421 tattatactc cagataatcg ttgcactaaa ttttaccatg tggaagaagg cagtgccttg
481 ctggcaggga tttgccaagg ggatatatgc cgatgtgcag aagagaactg tttcatgcag
541 cagcaaattg agggtaaaat cactgctgac atgagagtca acatggcttg gctcccggag
601 tggaattttg tgtacaaggc tactctcacg gagttgcagc ccagcgacaa ctatgacaac
661 tatgtaatga caattaagaa ggtcattaag caaggcacag atgaggatcc tgaggacaag
721 acacgcaatt ttatcagcca tatcaaatgc cgaaaagctt taaatatgca gctgaaccga
781 gattatctga tttggggggt aactggtgac ctctggcgcc acgatggata ttcctacatc
841 attgggaagg acacatggat ggagtggtgg cccaatgaga gggagtgtca acaacgtgag
901 aaccaggatc tctgcgatga ctttgagaca gtttctgaca acctggagat tgttggttgt
961 cccaactgag gaacctaaat tcagaccaaa atgtttccat cttcccaagc tcatctgtga
1021 tgtttaattt tgtaaatgtt gttcagtgtt ttatatacag ttccaataaa gcatttattt
1081 gcttaac
//
LOCUS YSCKEX2A 4106 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae prohormone processing enzyme (KEX2) gene, complete
cds.
ACCESSION M24201
KEYWORDS Ca-dependent serine protease; furin-related protein;
processing enzyme; protease; subtilism-related protease.
SOURCE S.cerevisiae (strain DBY939) DNA, clone YEp24-pJ2B.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 4106)
AUTHORS Fuller,R.S., Brake,A. and Thorner,J.
TITLE Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2+
-dependent serine protease
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1434-1438 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Brake, 09-MAY-1989
FEATURES from to/span description
pept 1329 3773 prohormone processing enzyme (KEX2)
BASE COUNT 1327 a 803 c 827 g 1149 t
ORIGIN Chromosome 14, left arm.
1 gaattctctg ttgactacta aactgagaga atttgccgag actctaagaa cagctttgaa
61 agagcgttct gccgatgatt ccataattgt cactctgaga gagcaaatgc aaagagaaat
121 cttcaggttg atgtcgttgt tcatggacat acctccagtg caaccaaacg agcaattcac
181 ttgggaatac gttgacaaag acaagaaaat ccacactatc aaatcgactc cgttagaatt
241 tgcctccaaa tacgcaaaat tggacccttc cacgccagtc tcattgatca atgatccaag
301 acaccatatg gtaaattaat taagatcgat cgtttaggaa acgtccttgg cggagatgcc
361 gtgatttact taaatgttga caatgaaaca ctatctaaat tggttgttaa gagattacaa
421 aataacaaag ctgtcttttt tggatctcac actccaaagt tcatggacaa gaaaactggt
481 gtcatggata ttgaattgtg gaactatcct gccatggcta taatttacct cagcaaaagg
541 catccggtat tagataccat gaaagtttga tgactcatgc tatgttggat cactggctgc
601 cacgtcgatg aaacgtctaa attaccactt cgctaccgtc tgaaaattcc tggggtaaag
661 actccggtaa agacggatta tacgtgatga ctcaaaagta cttcgaggag tactgctttc
721 aaattgtggt cgatatcaat gaattgccaa aagagctggc ttcaaaattc acctcaggta
781 aggaagagcc gattgtcttg cccatctgga cccaatggtg ctttggccaa ataaatagtt
841 tcagcagctc tgatgtagat acacgtatct cgacatgttt tatttttact atacatacat
901 aaaagaaata aaaaatgata acgtgtatat tattattcat ataatcaatg agggtcattt
961 tctgaaacgc aaaaaacggt aaatggaaaa aaaataaaga tagaaaaaga aaacaaacaa
1021 aggaaaggtt agcatattaa ataactgagc tgatacttca acagcatcgc tgaagagaac
1081 agtattgaaa ccgaaacatt ttctaaaggc aaacaaggta ctccatattt gctggacgtg
1141 ttctttctct cgtttcatat gcataattct gtcataagcc tgttcttttt cctggcttaa
1201 acatcccgtt ttgtaaaaga gaaatctatt ccacatattt cattcattcg gctaccatac
1261 taaggataaa ctaatcccgt tgttttttgg cctcgtcaca taattataaa ctactaaccc
1321 attatcagat gaaagtgagg aaatatatta ctttatgctt ttggtgggcc ttttcaacat
1381 ccgctcttgt atcatcacaa caaattccat tgaaggacca tacgtcacga cagtattttg
1441 ctgtagaaag caatgaaaca ttatcccgct tggaggaaat gcatccaaat tggaaatatg
1501 aacatgatgt tcgagggcta ccaaaccatt atgttttttc aaaagagttg ctaaaattgg
1561 gcaaaagatc atcattagaa gagttacagg gggataacaa cgaccacata ttatctgtcc
1621 atgatttatt cccgcgtaac gacctattta agagactacc ggtgcctgct ccaccaatgg
1681 actcaagctt gttaccggta aaagaagctg aggataaact cagcataaat gatccgcttt
1741 ttgagaggca gtggcacttg gtcaatccaa gttttcctgg cagtgatata aatgttcttg
1801 atctgtggta caataatatt acaggcgcag gggtcgtggc tgccattgtt gatgatggcc
1861 ttgactacga aaatgaagac ttgaaggata atttttgcgc tgaaggttct tgggatttca
1921 acgacaatac caatttacct aaaccaagat tatctgatga ctaccatggt acgagatgtg
1981 caggtgaaat agctgccaaa aaaggtaaca atttttgcgg tgtcggggta ggttacaacg
2041 ctaaaatctc aggcataaga atcttatccg gtgatatcac tacggaagat gaagctgcgt
2101 ccttgattta tggtctagac gtaaacgata tatattcatg ctcatggggt cccgctgatg
2161 acggaagaca tttacaaggc cctagtgacc tggtgaaaaa ggctttagta aaaggtgtta
2221 ctgagggaag agattccaaa ggagcgattt acgtttttgc cagtggaaat ggtggaactc
2281 gtggtgataa ttgcaattac gacggctata ctaattccat atattctatt actattgggg
2341 ctattgatca caaagatcta catcctcctt attccgaagg ttgttccgcc gtcatggcag
2401 tcacgtattc ttcaggttca ggcgaatata ttcattcgag tgatatcaac ggcagatgca
2461 gtaatagcca cggtggaacg tctgcggctg ctccattagc tgccggtgtt tacactttgt
2521 tactagaagc caacccaaac ctaacttgga gagacgtaca gtatttatca atcttgtctg
2581 cggtagggtt agaaaagaac gctgacggag attggagaga tagcgccatg gggaagaaat
2641 actctcatcg ctatggcttt ggtaaaatcg atgcccataa gttaattgaa atgtccaaga
2701 cctgggagaa tgttaacgca caaacctggt tttacctgcc aacattgtat gtttcccagt
2761 ccacaaactc cacggaagag acattagaat ccgtcataac catatcagaa aaaagtcttc
2821 aagatgctaa cttcaagaga attgagcacg tcacggtaac tgtagatatt gatacagaaa
2881 ttaggggaac tacgactgtc gatttaatat caccagcggg gataatttca aaccttggcg
2941 ttgtaagacc aagagatgtt tcatcagagg gattcaaaga ctggacattc atgtctgtag
3001 cacattgggg tgagaacggc gtaggtgatt ggaaaatcaa ggttaagaca acagaaaatg
3061 gacacaggat tgacttccac agttggaggc tgaagctctt tggggaatcc attgattcat
3121 ctaaaacaga aactttcgtc tttggaaacg ataaagagga ggttgaacca gctgctacag
3181 aaagtaccgt atcacaatat tctgccagtt caacttctat ttccatcagc gctacttcta
3241 catcttctat ctcaattggt gtggaaacgt cggccattcc ccaaacgact actgcgagta
3301 ccgatcctga ttctgatcca aacactccta aaaaactttc ctctcctagg caagccatgc
3361 attatttttt aacaatattt ttgattggcg ccacattttt ggtgttatac ttcatgtttt
3421 ttatgaaatc aaggagaagg atcagaaggt caagagcgga aacgtatgaa ttcgatatca
3481 ttgatacaga ctctgagtac gattctactt tggacaatgg aacttccgga attactgagc
3541 ccgaagaggt tgaggacttc gattttgatt tgtccgatga agaccatctt gcaagtttgt
3601 cttcatcaga aaacggtgat gctgaacata caattgatag tgtactaaca aacgaaaatc
3661 catttagtga ccctataaag caaaagttcc caaatgacgc caacgcagaa tctgcttcca
3721 ataaattaca agaattacag cctgatgttc ctccatcttc cggacgatcg tgattcgata
3781 tgtacagaaa gcttcaaatt acaaaatagc atttttttct tatagattat aatactctct
3841 catacgtata cgtatatgtg tatatgatat ataaacaaac attaatatcc tattccttcc
3901 gtttgaaatc cctatgatgt actttgcatt gtttgcaccc gcgaataaaa tgaaaactcc
3961 gaaccgatat atcaagcaca taaaagggga gggtccaatt aatgcatatt taagaccaca
4021 gctgaataac tttaaaacgg cagacaaaac aaaaaatagg tcgaataaac cttacctgcc
4081 tagaaggaat gacagcagct aataag
//
LOCUS YSCTASPT 2641 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae mitochondrial aspartyl-tRNA synthetase (MSD1) gene,
complete cds.
ACCESSION M26020
KEYWORDS aspartyl-tRNA synthetase.
SOURCE S.cerevisiae (strain D273-10B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2641)
AUTHORS Tzagoloff,A.
JOURNAL Unpublished (1989) Columbia U. Dept. Biol. Sci. NY NY 10027
STANDARD full staff_review
REFERENCE 2 (sites; for [1])
AUTHORS Gampel,A. and Tzagoloff,A.
TITLE Homology of aspartyl-and lysyl-tRNA synthetases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Tzagoloff, 06-JUL-1989.
FEATURES from to/span description
pept 601 2577 aspartyl-tRNA synthetase
BASE COUNT 878 a 466 c 551 g 746 t
ORIGIN
1 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacctg taacaatagc aataccccaa
61 atacctaatg tagttccagc aagcaagcta aaaagtaaag caacaacata actcacccct
121 gcatctgcag cttttgcccg ggcagcctgc tctgcctgtg ttttctttaa ttgagcagta
181 gaccatttag cagttgcatg aatagctgca gcgtcacatc ggataataat gatggcagcc
241 attgtagaag tgccttttgc atttctagtc tctttctcgg tctagctagt tttactacat
301 cgcgaagata gaatcttaga tcacactgcc aatcttgtgc aagctgtggc aaaatcgacc
361 ttctccatgc gtccatggta aatgaaatgt tttgctttct ttttcgtgct tgcaaagttt
421 ggtagatgaa atttcagaaa gctactgccg taccaaagta caataataag gtgaaaatgg
481 gagtaggaag taatagatgg cattgaatgc ctcatttatt cattattcat gttttcttaa
541 cattttgatt cagttgagtg catgcttttt tgaaaagact aaacattaca agttttaatc
601 atgttggccc gttccagggt gtgcttgcag acaatcacta gacggttggc agactttcca
661 gaagccaatg ctattaagaa aaaatttctc tttaggaagg acacctcgac catcaaacag
721 ttaaaaggac tgtctagcgg ccagaaaata gtactcaatg gatggataga gcagaagcca
781 aaaagagttg ggaaaaattt gatcttcgga cttttaaggg actctaacgg tgacattatc
841 cagttggttg ataacaaatc gttgttgaaa ggctttactt tagaggatgt ggttcaagcc
901 gtaggtatac tctctttgaa gaggaagcta tcaaatgagg acgcagatga gtacgaagtg
961 caattggagg atattactgt gttaaatgca tctaataaaa aaccagcgca aatgcaggat
1021 tttaaattgt cagccatata cccacctgag ttccgctatt tgcaattgag aaatcccaaa
1081 tatcaagatt ttttaaagaa gaggtcatct atctctaagg aaataagaaa ctccttcaac
1141 aattttgatt ttacggaggt cgagacccca atgttattta aagctacccc agaaggcgca
1201 agagagtttc tggttccaac aaggacaaag agatccgatg gtaaaccatc gttttatgct
1261 ctagatcaga gccctcaaca atacaagcaa ctcttaatgg ctagtggtgt caacaaatat
1321 tatcaaatgg caaggtgctt tagagatgaa gatttaagag cagacaggca gcctgagttt
1381 acacaggttg atatggaaat ggcctttgct aattctgaag atgtcatgaa aatcatagaa
1441 aagacagttt ctggggtatg gagtaaattt tccaaaaaac gaggattatt gactttagac
1501 agtaagggta cattagtgcc tgcgaaaaag gaaaacggca cagtatctat ctttcgtatg
1561 acctacgaac aagccatgac ctcatatggt attgacaagc cagatttgag agcgccagat
1621 ttgaagatta tcaatttagg cgagttcaat gcctttagtc atttgaacaa aaaatttccc
1681 gtttttgaag taattattct aagaagtgcc ttttcaaata tggaagagta caaagaacga
1741 tggtcgtttc tgacaaataa cagtaattac aattatagag ttccaatagt gctaccaatt
1801 gaaaatgacg aacaagctaa ttcaaattgg tttgagaatt ttcatgcaat tgccacgttt
1861 gaaaacccac atctaataac caaatttctg aaactgaaaa aaggtgacat tgtatgcggt
1921 tgtacgagag agccaaacca ttccattttc gagaatccta ctcccctggg aagattgaga
1981 cagttggtgc tacaaagtga gcatgggaaa aatatctatc atgctgtcaa taaggatgtt
2041 gcctcatgga ttgtggattt cccgttattt tctcccgtta taattgaaga taagtctggt
2101 aaaaaagaaa agcttgcata tccggagtac gaaaaggata gactatgttc cacgcatcat
2161 ccttttacta tggtgaagct taaagactac gaaaaattag aaaagacgcc ggaaaagtgc
2221 ttgggtcggc attatgacct cgtagttaac ggtgtggaac ttggtggtgg ctcaacaaga
2281 attcacgatc caagattaca agactatatt ttcgaagata tcctcaaaat agataatgca
2341 tatgaactat ttggccattt actgaatgct tttgatatgg gaacaccgcc acacgctgga
2401 tttgctattg gttttgatcg tatgtgcgct atgatctgtg aaactgagag tataagggac
2461 gtaatcgcct tcccaaaaag tattaccggg gctgatttgg ttgtcaagag tccaagtgtg
2521 atacctgaaa gcattctgga accttacaat atcaagtata gtaattcaaa aaaatgagca
2581 gaagttgatg ttacctagat gtgagagaat cttagtgtgg ctgcagtatg taggtttttc
2641 t
//
LOCUS YSCTRMSD 2604 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae MSD gene encoding mitochondrial aspartyl-tRNA
synthetase, complete cds.
ACCESSION M24418
KEYWORDS aspartyl-tRNA synthetase.
SOURCE S.cerevisiae (strain D273-10B) DNA, clone pG94/T1.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2604)
AUTHORS Gampel,A. and Tzagoloff,A.
TITLE Homology of aspartyl- and lysyl-tRNA synthetases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989)
STANDARD full staff_review
COMMENT Draft entry of sequence [1] kindly submitted by A.Tzagoloff,
01-MAY-1989.
FEATURES from to/span description
pept 565 2541 mitochondrial aspartyl-tRNA synthetase
BASE COUNT 842 a 466 c 551 g 745 t
ORIGIN
1 cctgtaacaa tagcaatacc ccaaatacct aatgtagttc cagcaagcaa gctaaaaagt
61 aaagcaacaa cataactcac ccctgcatct gcagcttttg cccgggcagc ctgctctgcc
121 tgtgttttct ttaattgagc agtagaccat ttagcagttg catgaatagc tgcagcgtca
181 catcggataa taatgatggc agccattgta gaagtgcctt ttgcatttct agtctctttc
241 tcggtctagc tagttttact acatcgcgaa gatagaatct tagatcacac tgccaatctt
301 gtgcaagctg tggcaaaatc gaccttctcc atgcgtccat ggtaaatgaa atgttttgct
361 ttctttttcg tgcttgcaaa gtttggtaga tgaaatttca gaaagctact gccgtaccaa
421 agtacaataa taaggtgaaa atgggagtag gaagtaatag atggcattga atgcctcatt
481 tattcattat tcatgttttc ttaacatttt gattcagttg agtgcatgct tttttgaaaa
541 gactaaacat tacaagtttt aatcatgttg gcccgttcca gggtgtgctt gcagacaatc
601 actagacggt tggcagactt tccagaagcc aatgctatta agaaaaaatt tctctttagg
661 aaggacacct cgaccatcaa acagttaaaa ggactgtcta gcggccagaa aatagtactc
721 aatggatgga tagagcagaa gccaaaaaga gttgggaaaa atttgatctt cggactttta
781 agggactcta acggtgacat tatccagttg gttgataaca aatcgttgtt gaaaggcttt
841 actttagagg atgtggttca agccgtaggt atactctctt tgaagaggaa gctatcaaat
901 gaggacgcag atgagtacga agtgcaattg gaggatatta ctgtgttaaa tgcatctaat
961 aaaaaaccag cgcaaatgca ggattttaaa ttgtcagcca tatacccacc tgagttccgc
1021 tatttgcaat tgagaaatcc caaatatcaa gattttttaa agaagaggtc atctatctct
1081 aaggaaataa gaaactcctt caacaatttt gattttacgg aggtcgagac cccaatgtta
1141 tttaaagcta ccccagaagg cgcaagagag tttctggttc caacaaggac aaagagatcc
1201 gatggtaaac catcgtttta tgctctagat cagagccctc aacaatacaa gcaactctta
1261 atggctagtg gtgtcaacaa atattatcaa atggcaaggt gctttagaga tgaagattta
1321 agagcagaca ggcagcctga gtttacacag gttgatatgg aaatggcctt tgctaattct
1381 gaagatgtca tgaaaatcat agaaaagaca gtttctgggg tatggagtaa attttccaaa
1441 aaacgaggat tattgacttt agacagtaag ggtacattag tgcctgcgaa aaaggaaaac
1501 ggcacagtat ctatctttcg tatgacctac gaacaagcca tgacctcata tggtattgac
1561 aagccagatt tgagagcgcc agatttgaag attatcaatt taggcgagtt caatgccttt
1621 agtcatttga acaaaaaatt tcccgttttt gaagtaatta ttctaagaag tgccttttca
1681 aatatggaag agtacaaaga acgatggtcg tttctgacaa ataacagtaa ttacaattat
1741 agagttccaa tagtgctacc aattgaaaat gacgaacaag ctaattcaaa ttggtttgag
1801 aattttcatg caattgccac gtttgaaaac ccacatctaa taaccaaatt tctgaaactg
1861 aaaaaaggtg acattgtatg cggttgtacg agagagccaa accattccat tttcgagaat
1921 cctactcccc tgggaagatt gagacagttg gtgctacaaa gtgagcatgg gaaaaatatc
1981 tatcatgctg tcaataagga tgttgcctca tggattgtgg atttcccgtt attttctccc
2041 gttataattg aagataagtc tggtaaaaaa gaaaagcttg catatccgga gtacgaaaag
2101 gatagactat gttccacgca tcatcctttt actatggtga agcttaaaga ctacgaaaaa
2161 ttagaaaaga cgccggaaaa gtgcttgggt cggcattatg acctcgtagt taacggtgtg
2221 gaacttggtg gtggctcaac aagaattcac gatccaagat tacaagacta tattttcgaa
2281 gatatcctca aaatagataa tgcatatgaa ctatttggcc atttactgaa tgcttttgat
2341 atgggaacac cgccacacgc tggatttgct attggttttg atcgtatgtg cgctatgatc
2401 tgtgaaactg agagtataag ggacgtaatc gccttcccaa aaagtattac cggggctgat
2461 ttggttgtca agagtccaag tgtgatacct gaaagcattc tggaacctta caatatcaag
2521 tatagtaatt caaaaaaatg agcagaagtt gatgttacct agatgtgaga gaatcttagt
2581 gtggctgcag tatgtaggtt tttc
//
LOCUS MUSCD14A 1873 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse monocyte/granulocyte cell surface glycoprotein homologous
(CD14) gene, complete cds.
ACCESSION M34510
KEYWORDS CD14 receptor protein;
monocyte/granulocyte cell surface glycoprotein.
SOURCE Mouse (strain BALB/c) adult liver macrophage cell line J774A.1 DNA,
clone pMCD14/5, and cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1873)
AUTHORS Ferrero,E., Hsieh,C.-L., Francke,U. and Goyert,S.M.
TITLE CD14 is a member of the family of leucine-rich proteins and is
encoded by a gene syntenic with multiple receptor genes
JOURNAL J. Immunol. 145, 331-336 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by E.M.Ferrero, 25-MAY-1990.
FEATURES from to/span description
pept 173 175 murine CD14 protein (MCD14), exon 1
273 1370 murine CD14 protein, exon 2
sigp 173 175 murine CD14 protein signal peptide
273 314 murine CD14 protein signal peptide
matp 315 1367 murine CD14 protein
pre-msg 62 1602 murine CD14 mRNA and introns
IVS 176 272 murine CD14 protein intron A
signal 1433 1438 poly-A signal
signal 1583 1588 poly-A signal
BASE COUNT 427 a 500 c 478 g 468 t
ORIGIN Chromosome 18.
1 aggaagggac agggtgacac cccaggatta cataaattta caggggctgc cgaattggtc
61 gaacaagccc gtggaacctg gaagccagag aacaccatcg ctgtaaagga aagaaactga
121 agcttttctc ggagcctatc tgggctgctc aaactttcag aatctaccga ccatggtgag
181 tcagacagac tgtcttgggg tggaactgga gccaacctga ggaatctcag ggtcctggca
241 ggagtctccc tgtcccccta ctttctcctc aggagcgtgt gcttggcttg ttgctgttgc
301 ttctggtgca cgcctctccc gccccaccag agccctgcga gctagacgag gaaagttgct
361 cctgcaactt ctcagatccg aagccagatt ggtccagcgc tttcaattgt ttgggggcgg
421 cagatgtgga attgtacggc ggcggccgca gcctggaata ccttctaaag cgtgtggaca
481 cggaagcaga tctggggcag ttcactgata ttatcaagtc tctgtcctta aagcggctta
541 cggtgcgggc cgcgcggatt cctagtcgga ttctattcgg agccctgcgt gtgctcggga
601 tttccggcct ccaggaactg actcttgaaa atctcgaggt aaccggcacc gcgccgccac
661 cgcttctgga agccaccgga cccgatctca acatcttgaa cctccgcaac gtgtcgtggg
721 caacaaggga tgcctggctc gcagaactgc agcagtggct aaagcctgga ctcaaggtac
781 tgagtattgc ccaagcacac tcactcaact tttcctgcga acaggtccgc gtcttccctg
841 ccctctccac cttagacctg tctgacaatc ctgaattggg cgagagagga ctgatctcag
901 ccctctgtcc cctcaagttc ccgaccctcc aagttttagc gctgcgtaac gcggggatgg
961 agacgcccag cggcgtgtgc tctgcgctgg ccgcagcaag ggtacagctg caaggactag
1021 accttagtca caattcactg cgggatgctg caggcgctcc gagttgtgac tggcccagtc
1081 agctaaactc gctcaatctg tctttcactg ggctgaagca ggtacctaaa gggctgccag
1141 ccaagctcag cgtgctggat ctcagttaca acaggctgga taggaaccct agcccagatg
1201 agctgcccca agtggggaac ctgtcactta aaggaaatcc ctttttggac tctgaatccc
1261 actcggagaa gtttaactct ggcgtagtca ccgccggagc tccatcatcc caagcagtgg
1321 ccttgtcagg aactctggct ttgctcctag gagatcgcct ctttgtttaa ggaacatttg
1381 catcctcctg gtttctgagg gtcctcgtca acgaatcctc tgctttaaat ttattaaaat
1441 cttaatccac gatgtaagga aagaaaggca gtcaagatgg ttcagtgggt aaaagccagc
1501 aaacttgacc cctgatttta accctcagga tccacacgga aggggaaaac tcactcctga
1561 aagttgtcca tctgtgctca caaataaata ttttttaaaa taacaatgtg tttgttggtt
1621 ttgtttttgt ttgggttttg ttgtggtttt gtttgttttg ttttgttttt gagacagtct
1681 ggctatgtat ccttggctgg cctcaaactc ataaagatca agatcggcct gcctctacct
1741 ccaaatgctc tggttaaagg gatgtgcctc catgcccagt tgaagtcatc ctgaaccacg
1801 agtccaggcc actcactctt tactaagatc tttactaagt atctgccata ggtcaggctc
1861 agtgctaaat gag
//
LOCUS YSTAKYAMT 563 bp ss-mRNA PLN 24-JUL-1990
DEFINITION Yeast sp. mitochondrial adenylate kinase (AKY2) gene, 5' end.
ACCESSION M35511
KEYWORDS adenylate kinase.
SOURCE Yeast sp. (wild type strain D273-10B, ATCC 25657), cDNA to mRNA.
ORGANISM Yeast sp.
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE 1 (bases 1 to 563)
AUTHORS Oechsner,U., Magdolen,V., Zoglowek,C., Haecker,U. and Bandlow,W.
TITLE Yeast adenylate kinase is transcribed constitutively from a
promoter in the short intergenic region to the histone H2A-1 gene
JOURNAL FEBS Lett. 242, 187-193 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 561 > 563 adenylate kinase (AKY2)
BASE COUNT 153 a 100 c 98 g 212 t
ORIGIN
1 taagatcggt tctggtattt taaagaaggc ggaaggaact aaactgcttc tccattgtaa
61 ctatattaag tgtctattag acctttctgt ttttcctttt tatttttcta tatataattt
121 attgggtata taattaacgg taacatatgt catgcatgat ataaattttt tgttgtgtct
181 tcctgttttc tctgttgctc tgtttaggtt cattgggcac tgttgcttac ttgaacatga
241 ttgagtagca aacacattct gtgctttgtt cactttgata gtgtgacgaa attttcacaa
301 ccaaaggaaa aaagctcacg attgcgcgat cctgtcacgg agaactgtcc gcagcagccc
361 gcggcttctg tccttattcg cccatttttt tttgattttc gactttttca ctctggctag
421 ttttattacg catatttgta tatatatata cgcataaatt tctgaaatgg attacctggt
481 ttatcagaaa cgagagcttt aataaagaaa cattaacgtt tctctggtaa agtcaccaca
541 cagcatcaaa tataacagta atg
//
LOCUS MLMPOLA 1680 bp ss-RNA VRL 24-JUL-1990
DEFINITION Moloney murine leukemia virus pol gene, partial cds.
ACCESSION M32803
KEYWORDS .
SOURCE Moloney murine leukemia virus RNA, clone 3-1R.
ORGANISM Moloney murine leukemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE 1 (bases 1 to 1680)
AUTHORS Martinelli,S.C. and Goff,S.P.
TITLE Rapid reversion of a deletion mutation in Moloney murine leukemia
virus by recombination with a closely related endogenous provirus
JOURNAL Virology 174, 135-144 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 684 pol protein (AA at 1)
688 > 1680 pol protein
site 685 687 pol protein in-frame stop codon
BASE COUNT 469 a 467 c 435 g 309 t
ORIGIN
1 gcttttcccc tcgagcgccc agactgggat tacaccaccc aggcaggtag gaaccaccta
61 gtccactatc gccagttgct cctagcgggt ctccaaaacg cgggcagaag ccccaccaat
121 ttggccaagg taaaagggat aacacaggga cctaatgagt ctccctcagc ctttttagag
181 agactcaagg aggcctatcg caggtacact ccttatgacc ctgaggaccc agggcaagaa
241 accaatgtgt ctatgtcatt catctggcag tctgccccgg atatcgggcg aaagttaggg
301 cggttagaag atttaaagag caagacctta ggagacttag ttagggaggc tgaaaagatc
361 tttaataaac gtgaaacccc ggaagaaaga gaggaacgta tcaggagaga aacagaggaa
421 aaagaagaac gccgtaggac agtggatgag cagaaagaga aagaaaggga ccgcagaaga
481 catagagaga tgagcaagct cttggccact gtagttattg gtcaggaaca ggatagacag
541 gagggagagc ggaagaggcc ccaacttgat aaggatcaat gcgcctactg caaagaaaag
601 gggcactggg ctaaagactg tcccaagaag ccacgagggc cccgaggacc caggccccag
661 acctccctcc tgaccttagg tgactaggga ggtcagggtc aggacccccc ccctgaaccc
721 aggataaccc tcaaagtcgg ggggcaaccc gtcaccttcc tggtagatac tggggcccaa
781 cactccgtgc tgacccaaaa tcctggaccc ctaagtgaca agtctgcctg ggtccaaggg
841 gctactggag gaaagcggta tcgctggacc acggatcgca aagtgcatct agctaccggt
901 aaggtcaccc actctttcct ccatgtacca gactgcccct atcctctgct aggaagagat
961 ttgctgacta aactaaaagc ccaaatccac tttgagggat caggagctca ggttgtggga
1021 ccaatgggac agcccctgca agtgctgacc ctaaacatag aagatgagta tcggctacat
1081 gagacctcaa aagagccgga tgtttctcta gggttcacat ggctttctga ttttcctcag
1141 gcctgggcgg aatccggggg catgggactg gcagttcgcc aagctcctct gatcatacct
1201 ctgaaggcaa cctctacccc cgtgtccata aaacaatacc ccatgtcaca agaagccaga
1261 ctggggatca agccccacat acagagactg ttggaccagg gaatactggt accctgccag
1321 tccccctgga acacgcccct gctacccgtt aagaaaccag ggactaatga ttacaggcct
1381 gtccaagatc tgagagaagt caacaagcgg gtggaagaca tccaccccac cgtgcccaac
1441 ccttacaacc tcttgagcgg gctcccaccg tcccaccagt ggtacactgt gcttgactta
1501 aaggatgcct ttttctgcct gagactccac cccaccagtc agcctctctt cgcctttgag
1561 tggagagacc cagagatggg aatctcagga caattaacct ggaccagact cccacagggt
1621 ttcaaaaaca gtcccaccct gtttgatgag gcactgcaca gagacctagc agacttccgg
//
LOCUS RHIISRA 1350 bp ds-DNA BCT 24-JUL-1990
DEFINITION Plasmid RP4 of Rhizobium transposon Tn1 transposable element ISR1
open reading frames A, A1, A2 and A3.
ACCESSION X06616
KEYWORDS transposable element.
SOURCE Plasmid RP4 transposon Tn 1 of Rhizobium sp. DNA.
ORGANISM Rhizobium sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 1350)
AUTHORS Priefer,U.B., Kalinowski,J., Rueger,B., Heumann,W. and Puehler,A.
TITLE ISR1, a transposable DNA sequence resident in Rhizobium class IV
strains, shows structural characteristics of classical insertion
elements
JOURNAL Plasmid 21, 120-128 (1989)
STANDARD simple staff_review
FEATURES from to/span description
ORF 94 360 ORF A3
ORF 433 714 ORF A1
ORF 787 1272 ORF A2
ORF 900 64 (c) ORF A
BASE COUNT 258 a 415 c 470 g 207 t
ORIGIN
1 acggaaattg ctcatcagct cagtattgcc tgacgtgacc ccctgaaact cctccaggaa
61 tagctagagt ccgcccaacg aaggacggac agaatgaagc gatcgaggtt cacggaagag
121 cagatcatcg ggatcctgcg ggagcaggag gccggtgtag cgacggcgga ggtgtgccga
181 cgccacgggg tcagctcggc gaccttttac aaatggaagg ccaagtttgg cgggctcgac
241 gtgtcggagg ctcggcggct caaggcgctc gaggatgaga acgcccggct gaagcggatg
301 ctggcggacg cgatgctgga caacgtggcc ctgaaggacc tgctgggaaa aaagtggtga
361 cgcccgccgg ctatcgcgag gcggctggcc atctgcaggc cgcctacgag atgagcgaaa
421 ggcgggcgtg ccgtgttctg ggcgtcgatc ggacgagcgt gcgctatcag gcgacgcgcc
481 cggacgacgc gctctgcgcg accggctgaa ggccctggcc aggaacgtcg acggttcggc
541 tatcgtcgcc tgcacgtgct gctgcggcgc gagggccatg cggtcaacag gaaacgggtc
601 cagcggatct atcgcgacga gcaactgacg gtgcgccggc gcgcgccgca agcgagcgat
661 gggcacgcgg cgaccgtgac ctgccgctgg cgcccaacca gcgctggtcg ctgacttcgt
721 cgcagaccag ctgacggacg ggcggcgctt ccgcatcctg acggtgatcg ataactgtac
781 gcgcgagtgc ttggcgctgg tggccgacac ctcgctgtcg ggcgcgcggg tcgtgcggga
841 gttggacgcc gtcatccggc agcgagggcg gcccgacacc atcgtcagcg acaacggcac
901 ggagtacacc tcgaacgcgg tcctggcctg ggcgacgaca ccggcgtcgg ctggcactac
961 atcgcgccgg gcaagcccca gcagaacggc ttcaacgaaa gcttcaacgg gcggctgcgt
1021 gacgagctgc tgaacgagac gctgttccgc tcgctgccgc cgctcgcgcc gtgcggaggc
1081 gtggcgacgc gattacaacg agcagcgtcc gcactcgaag ctcgggtggc tgacgccgca
1141 ggcttatgca gaagcgctca ccgcacacaa cggccggcct gctgcgtggt tgatggctgc
1201 gcagaccggc ctcttgccaa cccagccgat cacagctcag atcaacctag gactctcgtt
1261 atggctggat gagaaacggg ggtcacgtca tgcccgctcc acggtttata aaattcttga
1321 agacgaaagg gcctcgtgat acgcctattt
//
LOCUS HAMACAPT 1837 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
partial cds.
ACCESSION J05590 M22755
KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
tunicamycin-resistance.
SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
TRG-[4,22].
ORGANISM Cricetulus longicaudatus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 889 to 960)
AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S.
TITLE Amplification and molecular cloning of the hamster tunicamycin-
sensitive N-acetylglucosamine-1-phosphate transferase gene: The
hamster and yeast enzymes share a common peptide sequence
JOURNAL J. Biol. Chem. 263, 19796-19803 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1837)
AUTHORS Zhu,X. and Lehrman,M.A.
TITLE Cloning, sequence and expression of a cDNA encoding hamster
UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
transferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman,
04-APR-1989. Draft entry and computer-readable sequence for [1]
kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES from to/span description
pept 148 1374 N-acetylglucosamine-1-phosphate transferase
(GPT)
binding 67 79 dolichol binding site
binding 222 234 dolichol binding site
BASE COUNT 363 a 515 c 418 g 541 t
ORIGIN
1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 24-JUL-1990
DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
genes, complete cds.
ACCESSION M34440
KEYWORDS nuclear antigen.
SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 10089)
AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
Rickinson,A. and Kieff,E.
TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
EBNA-3A, EBNA-3B and EBNA-3C genes
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.T.Sample, 23-MAY-1990.
Kieff Laboratory
Harvard Medical School
75 Francis Street
Boston, MA 02115
FEATURES from to/span description
pept 189 524 nuclear antigen EBNA-3A, exon 1
610 3051 nuclear antigen EBNA-3A, exon 2
pept 3237 3593 nuclear antigen EBNA-3B, exon 1
3672 6155 nuclear antigen EBNA-3B, exon 2
pept 6277 6636 nuclear antigen EBNA-3C, exon 1
6711 9560 nuclear antigen EBNA-3C, exon 2
IVS 525 609 EBNA-3A intron
IVS 3594 3671 EBNA-3B intron
IVS 6637 6710 EBNA-3C intron
BASE COUNT 2495 a 2837 c 2619 g 2138 t
ORIGIN
1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
10081 ggaggatcc
//
LOCUS HUMPAPB 3061 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION M34840
KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE Human prostate, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1199)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3061)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.L.Van Etten, 08-JUN-1990.
Chemistry Department
Purdue University
West Lafayette, IN 47907
FEATURES from to/span description
pept 15 1175 prostatic acid phosphatase precursor (EC
3.1.3.2) /hgml_locus_uid="LA0168F"
/nomgen="ACPP" /map="3q21-qter"
sigp 15 110 prostatic acid phosphatase signal peptide
matp 111 1172 prostatic acid phosphatase
BASE COUNT 917 a 642 c 681 g 821 t
ORIGIN Chromosome 3q21-qter.
1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
3061 a
//
LOCUS LDV3ECPA 1064 bp ss-RNA VRL 24-JUL-1990
DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete
cds.
ACCESSION M34856
KEYWORDS viral capsid protein.
SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
A16, B24, B63, B90, B103, C44, dt4.
ORGANISM Lactate dehydrogenase-elevating virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae.
REFERENCE 1 (bases 1 to 1064)
AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A.
TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid
protein (Vp1) gene
JOURNAL Virology (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.K.Godeny, 06-JUN-1990.
FEATURES from to/span description
pept 637 984 capsid protein
pept 135 650 unidentified ORF
BASE COUNT 267 a 223 c 265 g 309 t
ORIGIN 3' terminus region.
1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 24-JUL-1990
DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
(frhADGB) gene, complete cds.
ACCESSION J02914
KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE M.thermoautotrophicum (strain delta-H) DNA.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 3903)
AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
TITLE Cloning, sequence determination, and expression of the genes
encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
reducing hydrogenase from Methanobacterium thermoautotrophicum
delta-H
JOURNAL Biochemistry (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.Alex, 25-MAY-1990.
FEATURES from to/span description
pept 298 1515 F420 hydrogenase alpha-subunit (frhA)
pept 1524 2000 F420 hydrogenase delta-subunit (frhD)
pept 2114 2824 F420 hydrogenase gamma-subuint (frhG)
pept 2837 3682 F420 hydrogenase beta-subunit (frhB)
binding 284 293 ribosomal binding site (put.)
binding 1514 1520 ribosomal binding site (put.)
binding 2100 2106 ribosomal binding site (put.)
binding 2823 2832 ribosomal binding site (put.)
BASE COUNT 1201 a 878 c 965 g 859 t
ORIGIN
1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
3901 cga
//
LOCUS MUSLRCNA 1751 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse loricrin gene, complete cds.
ACCESSION M34398
KEYWORDS cell envelope protein; loricrin.
SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1751)
AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
Yuspa,S.H. and Roop,D.R.
TITLE Identification of a major keratinocyte cell envelope protein,
loricrin
JOURNAL Cell 61, 1103-1112 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Rothnagel, 22-MAY-1990.
FEATURES from to/span description
pept 71 1516 loricrin
signal 1740 1745 polyA signal
BASE COUNT 223 a 493 c 706 g 329 t
ORIGIN
1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
1741 ataaatttgc t
//
LOCUS PFAGIPA 1906 bp ds-DNA INV 24-JUL-1990
DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
cds.
ACCESSION J05544
KEYWORDS glucosephosphate isomerase.
SOURCE P.falciparum (strain 7G8) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1906)
AUTHORS Kaslow,D.C. and Hill,S.
TITLE Cloning metabolic pathway genes by complementation in E.coli:
Isolation and expression of Plasmodium falciparum glucosephosphate
isomerase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.C.Kaslow, 04-JUN-1990.
FEATURES from to/span description
pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT 782 a 174 c 289 g 661 t
ORIGIN Chromosome 14.
1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS RATGADA 3198 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION M34445
KEYWORDS glutamic acid decarboxylase.
SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of
D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
pCR22HP.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3198)
AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
TITLE Characterization of a cDNA coding for rat glutamic acid
decarboxylase
JOURNAL Mol. Brain Res. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.I.Gottlieb, 23-MAY-1990.
FEATURES from to/span description
pept 184 1965 glutamic acid decarboxylase
mRNA < 1 3198 glutamic acid decarboxylase mRNA
signal 3174 3179 polyA signal
BASE COUNT 869 a 742 c 767 g 820 t
ORIGIN
1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
3181 taaaaaatat tctccatg
//
LOCUS RATTSHRA 5270 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION M34842
KEYWORDS TSH receptor; thyroid stimulating hormone receptor;
transmembrane protein.
SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5270)
AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
McBride,O.W. and Kohn,L.D.
TITLE Cloning, chromosomal assignment and regulation of the rat
thyrotropin receptor by thyrotropin, agents which increase cAMP
levels and thyroid autoantibodies
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.D.Kohn, 05-JUN-1990.
FEATURES from to/span description
pept 55 2349 thyroid stimulating hormone receptor precursor
sigp 55 117 thyroid stimulating hormone receptor signal
peptide
matp 118 2346 thyroid stimulating hormone receptor
signal 5256 5261 poly-A signal
BASE COUNT 1415 a 1343 c 1176 g 1336 t
ORIGIN
1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS STMBLIPA 840 bp ds-DNA BCT 24-JUL-1990
DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete
cds.
ACCESSION M34538
KEYWORDS beta-lactamase inhibitory protein.
SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
ORGANISM Streptomyces clavuligerus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 840)
AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
TITLE Isolation and characterization of a beta-lactamase inhibitory
protein from Streptomyces clavuligerus and cloning and analysis of
corresponding gene
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.E.Jensen, 25-MAY-1990.
Dept. of Microbiology
University of Alberta
Edmonton, Alberta Canada T6G 2E9
FEATURES from to/span description
pept 67 672 beta-lactamase inhibitory protein precursor
(bliP)
sigp 67 174 beta-lactamase inhibitory protein signal
peptide
matp 175 669 beta-lactamase inhibitory protein
binding 52 63 ribosomal binding site
signal 686 724 transcription termination signal
BASE COUNT 140 a 248 c 320 g 132 t
ORIGIN
1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS TRBPARPA 763 bp ss-mRNA INV 24-JUL-1990
DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION M34790
KEYWORDS procyclic acidic repetitive protein; procyclin.
SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
pPS-1.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
TITLE Inhibition of protein synthesis results in super-induction of
procyclin RNA
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Boothroyd, 04-JUN-1990.
Stanford University School of Medicine
Dept of Microbiology and Immunology
Sherman Fairchild Science Building
Stanford, CA 94305-5402 USA
FEATURES from to/span description
pept 29 466 procyclin
mRNA < 1 763 procyclin mRNA
rpt 215 393 12 bp degenerate repeats
BASE COUNT 192 a 215 c 156 g 200 t
ORIGIN
1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS W10PPIBCA 4761 bp ds-DNA BCT 24-JUL-1990
DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
complete cds, and ippD gene, 3' end.
ACCESSION M34849
KEYWORDS antigen.
SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
clones HB101/pHS[4108,5357].
ORGANISM Plasmid pWR100
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes
(ipaB and ipaC) of Shigella flexneri
JOURNAL Microb. Pathog. 4, 345-357 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 538 and 3987 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Baudry, 05-JUN-1990.
Center for Vaccine Development
University of Maryland at Baltimore
10 South Pine Street
Baltimore, MD 21201
FEATURES from to/span description
pept 606 1073 polypeptide I (ippI)
pept 1079 2821 polypeptide B (ipaB)
pept 2841 3932 polypeptide C (ipaC)
pept 3983 > 4761 ipaD protein
pept < 1 549 unidentified ORF
binding 595 600 ippI ribosomal binding site
binding 1065 1070 ipaB ribosomal binding site
binding 2829 2834 ipaC ribosomal binding site
binding 3971 3977 ipaD ribosomal binding site
signal 556 561 ippI -10 region
signal 2723 2728 ipaC -35 region
signal 2745 2750 ipaC -10 region
BASE COUNT 1769 a 952 c 805 g 1235 t
ORIGIN
1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
4741 agacaatatg ttaaaaagct t
//
LOCUS YEPOMPHA 824 bp ss-mRNA BCT 24-JUL-1990
DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene,
complete cds.
ACCESSION M34854
KEYWORDS cationic outer membrane protein;
enterobacterial outer membrane protein;
lipopolysaccharide binding protein; secretory protein;
structural membrane protein.
SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 824)
AUTHORS Hirvas,L., Koski,P. and Vaara,M.
TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
comparison to known ompH sequences
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.H.Hirvas, 06-JUN-1990.
University of Helsinki
FEATURES from to/span description
pept 324 818 ompH protein precursor (gtg start codon)
sigp 324 392 ompH protein signal peptide
matp 393 815 ompH protein
binding 308 317 ompH ribosomal binding site
signal 120 125 ompH -35 region
signal 143 148 ompH -10 region
BASE COUNT 257 a 147 c 198 g 221 t 1 others
ORIGIN
1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS YSCEXG1A 2056 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
(EXG1) gene, complete cds.
ACCESSION M34341
KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2056)
AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
Nebreda,A.R., Mendez,E. and del Rey,F.
TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
in the yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.del Rey, 17-MAY-1990.
Instituto de Microbiologia bioquimica
Facultad de Biologia
CSIC/Universidad de Salamanca
37008 Salamanca, SPAIN
FEATURES from to/span description
pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase precursor (EXG1, EC 3.2.1.58)
sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase signal peptide
matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase
mRNA 265 > 1709 EXG1 mRNA
signal 153 159 TATA box
BASE COUNT 610 a 395 c 396 g 655 t
ORIGIN Chromosome XII-right arm.
1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
2041 gagcttttcc gccggc
//
LOCUS HUMPSG6A 1409 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human pregnancy-specific glycoprotein-6 (PSG6) mRNA,
complete cds.
ACCESSION M33666
KEYWORDS pregnancy-specific glycoprotein.
SOURCE Human fetal liver first trimester, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1409)
AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A.
TITLE cDNA cloning demonstrates the expression of pregnancy-specific
glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
family, in fetal liver
JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 53 1360 pregnancy-specific glycoprotein-6
BASE COUNT 395 a 403 c 294 g 317 t
ORIGIN
1 gggcgggcct aggctcatct ccacagggga gaacacacag acagcagaga ccatgggacc
61 cctctcagcc cctccctgca ctcagcacat cacctggaag gggctcctgc tcacagcatc
121 acttttaaac ttctggaacc tgcccaccac tgcccaagta ataattgaag ccaagccacc
181 caaagtttcc gaggggaagg atgttcttct acttgtccac aatttgcccc agaatcttac
241 tggctacatc tggtacaaag ggcaaatgac ggacctctac cattacatta catcatatgt
301 agtacacggt caaattatat atgggcctgc ctacagtgga cgagaaacag tatattccaa
361 tgcatccctg ctgatccaga atgtcacaca ggaggatgca ggatcctaca ccttacacat
421 cataaagcga ggcgatggga ctggaggagt aactggatat ttcactgtca ccttatactc
481 ggagactccc aagccctcca tctccagcag caacttaaac cccagggagg tcatggaggc
541 tgtgcgctta atctgtgatc ctgagactcc ggatgcaagc tacctgtggt tgctgaatgg
601 tcagaacctc cctatgactc acaggttgca gctgtccaaa accaacagga ccctctatct
661 atttggtgtc acaaagtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag
721 tgccagccgc agtgacccag tcaccctgaa tctcctcccg aagctgccca tgccttacat
781 caccatcaac aacttaaacc ccagggagaa gaaggatgtg ttagccttca cctgtgaacc
841 taagagtcgg aactacacct acatttggtg gctaaatggt cagagcctcc cggtcagtcc
901 gagggtaaag cgacccattg aaaacaggat actcattcta cccagtgtca cgagaaatga
961 aacaggaccc tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtaacccagt
1021 caccctgaat gtcctctatg gtccagacct ccccagaatt tacccttcat tcacctatta
1081 ccgttcagga gaaaacctcg acttgtcctg ctttgcggac tctaacccac cggcagagta
1141 ttcttggaca attaatggga agtttcagct atcaggacaa aagctcttta tcccccaaat
1201 tactacaaat catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga
1261 aatctccaaa tccatgatag tcaaagtctc tgagacagca tctccccagg ttacctatgc
1321 tggtccaaac acctggtttc aagaaatcct tctgctgtga cctcccaaag tgctaggatt
1381 aaaacatgac ccaccatgaa acccgccca
//
LOCUS HUMPSGA 1930 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human pregnancy-specific glycoprotein-1a (PSG-1a) mRNA,
complete cds.
ACCESSION M33664
KEYWORDS pregnancy-specific glycoprotein.
SOURCE Human fetal liver first trimester, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1930)
AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A.
TITLE cDNA cloning demonstrates the expression of pregnancy-specific
glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
family, in fetal liver
JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1260 pregnancy-specific glycoprotein-1a
BASE COUNT 560 a 520 c 390 g 460 t
ORIGIN
1 atgggaaccc tctcagcccc tccctgcaca cagcgcatca aatggaaggg gctcctgctc
61 acagcatcac ttttaaactt ctggaacctg cccaccactg cccaagtcac gattgaagcc
121 cagccaccca aagtttccga ggggaaggat gttcttctac ttgtccacaa tttgccccag
181 aatcttaccg gctacatctg gtacaaaggg caaatgaggg acctctacca ttacattaca
241 tcatatgtag tagacggtga aataattata tatgggcctg catatagtgg acgagaaaca
301 gcatattcca atgcatccct gctgatccag aatgtcaccc gggaggacgc aggatcctac
361 accttacaca tcataaaggg agatgatggg actagaggag taactggacg tttcaccttc
421 accttacacc tggagactcc taagccctcc atctccagca gcaacttaaa tcccagggag
481 accatggagg ctgtgagctt aacctgtgac cctgagactc cagacgcaag ctacctgtgg
541 tggatgaatg gtcagagcct ccctatgact cacagcttga agctgtccga aaccaacagg
601 accctctttc tattgggtgt cacaaagtat actgcaggac cctatgaatg tgaaatacgg
661 aacccagtga gtgccagccg cagtgaccca gtcaccctga atctcctccc gaagctgccc
721 aagccctaca tcaccatcaa caacttaaac cccagggaga ataaggatgt cttaaacttc
781 acctgtgaac ctaagagtga gaactacacc tacatttggt ggctaaatgg tcagagcctc
841 ccggtcagtc ccagggtaaa gcgacccatt gaaaacagga tcctcattct acccagtgtc
901 acgagaaatg aaacaggacc ctatcaatgt gaaatacggg accgatatgg tggcatccgc
961 agtgacccag tcaccctgaa tgtcctctat ggtccagacc tccccagaat ttacccttca
1021 ttcacctatt accgttcagg agaagtcctc tacttgtcct gttctgcgga ctctaaccca
1081 ccggcacagt attcttggac aattaatgaa aagtttcagc taccaggaca aaagctcttt
1141 atccgccata ttactacaaa gcatagcggg ctctatgttt gctctgttcg taactcagcc
1201 actggcaagg aaagctccaa atccatgaca gtcgaagtct ctgactggac agttccctga
1261 attctactag tacctccaat tccattttct cccatggaat cactaagagc aagacccact
1321 ctgttccaga agccctataa gctggaggtg gacaactcaa tgtaaatttc atgggaaaac
1381 ccttgtacct gaagcatgag ccactcagaa ctcaccaaaa tattcgacac cataacaaca
1441 gatgctcaaa ctgtaaacca ggacaacaag tggatgactt cacactgtgg acagtttttc
1501 ccaagatgtc agaagaagac tccccatcat gatgaggctc tcccccctct taactgtcct
1561 tgctcatgcc tgcctctttc acttggcagg ataatgcagt cattagaatt tcacatgtag
1621 tagcttctga gagtaacaac agagtgtcag atatgtcatc tcaacctcaa acttttatgt
1681 aacatctcag ggggaaatgt ggctctctcc accttgcata cagggctccc aatagaaatc
1741 aacacagaga tattgcctgt gtgttttcag agaagatggt ttgtatgaag acgtaggaaa
1801 gctgaaatta taatagagtc ccctttaaat ccacattgtg tggatggctc ttgccgtttc
1861 ctaagagata cattgtaaaa cgtgacagta agacattcta gcagaataaa acatgtacca
1921 cctttgcccc
//
LOCUS HUMPSGC 1307 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human pregnancy-specific glycoprotein-4 (PSG4) mRNA, 3' end.
ACCESSION M33665
KEYWORDS pregnancy-specific glycoprotein.
SOURCE Human fetal liver first trimester, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1307)
AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A.
TITLE cDNA cloning demonstrates the expression of pregnancy-specific
glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
family, in fetal liver
JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 639 pregnancy-specific glycoprotein-4 (AA at 1)
BASE COUNT 394 a 339 c 249 g 325 t
ORIGIN
1 gcggggtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag tgccagccgc
61 agtgacccag tcaccctgaa tctcctccca aagctgtcca agccctacat cacaatcaac
121 aacttaaacc ccagagagaa taaggatgtc ttaaccttca cctgtgaacc taagagtgag
181 aactacacct acatttggtg gctaaatggt cagagcctcc ctgtcagtcc cagggtaaag
241 cgacccattg aaaacaggat cctcattcta cccaatgtca cgagaaatga aacaggacct
301 tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtgacccagt caccctgaat
361 gtcctctatg gtccagacct ccccagcatt tacccttcat tcacctatta ccgttcagga
421 gaaaacctct acttgtcctg cttcgccgag tctaacccac gggcacaata ttcttggaca
481 attaatggga agtttcagct atcaggacaa aagctctcta tcccccaaat aactacaaag
541 catagtgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa
601 tccatcacag tcaaagtctc tgactggata ttaccctgaa ttctactagt tcctccaatt
661 ccattttctc ccatggaatc acgaagagca agacccactc tgttccagaa gccctataag
721 ctggaggtgg acaactcgat gtaaatttca tgggaaaacc cttgtacctg acatgtgagc
781 cactcagaac tcaccaaaat gttcgacacc ataacaacag ctactcaaac tgtaaaccag
841 gataagaagt tgatgacttc acactgtgga cagtttttcc aaagatgtca gaacaagact
901 ccccatcatg ataaggctcc cacccctctt aactgtcctt gctcatgcct gcctctttca
961 cttggcagga taatgcagtc attagaattt cacatgtagt agcttctgag ggtaacaaca
1021 gagtgtcaga tatgtcatct caacctcaaa cttttacgta acatctcagg cgaaatgtgg
1081 ctctctccat cttgcataca gggctcccaa tagaaatgaa cacagagata ttgcctgtgt
1141 gtttgcagag aagatggttt ctataaagag taggaaagct gaaattatag tagagtctcc
1201 tttaaatgca cattgtgtgg atggctctca ccatttccta agagatacag tgtaaaacgt
1261 gacagtaata ctgattctag cagaataaaa catgtaccac atttccc
//
LOCUS HUMPSGD 1699 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human pregnancy-specific glycoprotein-1d (PSG-1d) mRNA,
complete cds.
ACCESSION M33663
KEYWORDS pregnancy-specific glycoprotein.
SOURCE Human fetal liver first trimester, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1699)
AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A.
TITLE cDNA cloning demonstrates the expression of pregnancy-specific
glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
family, in fetal liver
JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 64 1344 pregnancy-specific glycoprotein-1d
BASE COUNT 493 a 442 c 343 g 421 t
ORIGIN
1 ctcagagagt ttctggatcc taggcttatc tccacagagg agaacacaca agcagcagag
61 accatgggaa ccctctcagc ccctccctgc acacagcgca tcaaatggaa ggggctcctg
121 ctcacagcat cacttttaaa cttctggaac ctgcccacca ctgcccaagt cacgattgaa
181 gcccagccac ccaaagtttc cgaggggaag gatgttcttc tacttgtcca caatttgccc
241 cagaatctta ccggctacat ctggtacaaa gggcaaatga gggacctcta ccattacatt
301 acatcatatg tagtagacgg tgaaataatt atatatgggc ctgcatatag tggacgagaa
361 acagcatatt ccaatgcatc cctgctgatc cagaatgtca cccgggagga cgcaggatcc
421 tacaccttac acatcataaa gggagatgat gggactagag gagtaactgg acgtttcacc
481 ttcaccttac acctggagac tcctaagccc tccatctcca gcagcaactt aaatcccagg
541 gagaccatgg aggctgtgag cttaacctgt gaccctgaga ctccagacgc aagctacctg
601 tggtggatga atggtcagag cctccctatg actcacagct tgaagctgtc cgaaaccaac
661 aggaccctct ttctattggg tgtcacaaag tatactgcag gaccctatga atgtgaaata
721 cggaacccag tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct cccgaagctg
781 cccaagccct acatcaccat caacaactta aaccccaggg agaataagga tgtcttaaac
841 ttcacctgtg aacctaagag tgagaactac acctacattt ggtggctaaa tggtcagagc
901 ctcccggtca gtcccagggt aaagcgaccc attgaaaaca ggatcctcat tctacccagt
961 gtcacgagaa atgaaacagg accctatcaa tgtgaaatac gggaccgata tggtggcatc
1021 cgcagtgacc cagtcaccct gaatgtcctc tatggtccag acctccccag aatttaccct
1081 tcattcacct attaccgttc aggagaagtc ctctacttgt cctgttctgc ggactctaac
1141 ccaccggcac agtattcttg gacaattaat gaaaagtttc agctaccagg acaaaagctc
1201 tttatccgcc atattactac aaagcatagc gggctctatg tttgctctgt tcgtaactca
1261 gccactggca aggaaagctc caaatccatg acagtcgaag tctctggtaa gtggatccca
1321 gcatcgttgg caatagggtt ttaggtggag tctatctggc attcagagaa gagtcaggaa
1381 aacaattgta ttcccagcct gtgtccctag ggcacaagca aatcccaaat tctcctcctg
1441 aaccctccaa atttgtctaa gaacttcgaa aactttaaca aacaggctga tatcttcata
1501 atattcccag cctagaccaa gcaggaagaa cattgatttc attgaaataa ttgataataa
1561 tgaagataat gtttttatga tttttatttg aaaatttgct aattctttaa atggtttgtt
1621 ttctacattg atggaatttt tctcttttaa tctatctaca gcttatagca gttcaataaa
1681 ctatacttct gggaaccgt
//
LOCUS RHMDCTAB 764 bp ds-DNA BCT 24-JUL-1990
DEFINITION R.meliloti dctA and dctB genes, 5' end.
ACCESSION M33555
KEYWORDS regulatory protein.
SOURCE R.meliloti DNA.
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 764)
AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F.
TITLE Genetic analysis and regulation of the Rhizobium meliloti genes
controlling C-4-dicarboxylic acid transport
JOURNAL Gene 85, 135-144 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 726 > 764 dctA protein
BASE COUNT 149 a 239 c 252 g 124 t
ORIGIN
1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg
61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc
121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg
181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg
241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc
301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg
361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca
421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc
481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc
541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg
601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt
661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc
721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg
//
LOCUS MNKSVAA 589 bp ds-DNA PRI 24-JUL-1990
DEFINITION Monkey sp. DNA fragment found also in SV40 genome.
ACCESSION M33902
KEYWORDS .
SOURCE Monkey sp. DNA.
ORGANISM Monkey sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini;
Cercopithecidae.
REFERENCE 1 (bases 1 to 589)
AUTHORS Szymanski,P. and Woodworth,M.
TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40
replication and transcription through multiple motifs
JOURNAL J. Virol. 64, 1360-1365 (1990)
STANDARD simple staff_review
FEATURES from to/span description
binding 221 230 AP-4 binding site
binding 243 249 AP-1 binding site
BASE COUNT 162 a 161 c 118 g 148 t
ORIGIN
1 aagcttcgta gaatctcaga aagtcataaa gctacaaacg agttcacaaa caagtgatgc
61 atagaccaca cccagcccac agatgagttc tgtgtggctc acacttattt tttaaaacag
121 agtttgctgc caacacttca aaacccaatg attttgcata aaaatccaga tttattagaa
181 aatctgtgga cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga
241 gatgagtcac agttcccacc ccactcaccc atgtcaccta tatggcttgg aaggcatgtg
301 agtttgcaac ctgctcaaag ccacacccca gaaacggcct gctaaaatgg gattccagaa
361 tccacttgca cctttcgtct ttctcagtct cccctgactc aacacagcac actggcattg
421 ttttttttcc tctcagaaaa ctgagcatgc acttttgaga atcagagaga aatccgcctc
481 tcaaatgtca gaaaacagct gatgctgcct cttacttggg ggcctcttga cttcccacgt
541 gtcttactgc cacttgagag ggaggggtgc aggcaggcag gctccctgg
//
LOCUS MNKSVAB 69 bp ds-DNA PRI 24-JUL-1990
DEFINITION Monkey sp. DNA fragment also found in SV40 genome.
ACCESSION M33903
KEYWORDS .
SOURCE Monkey sp. DNA.
ORGANISM Monkey sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini;
Cercopithecidae.
REFERENCE 1 (bases 1 to 69)
AUTHORS Szymanski,P. and Woodworth,M.
TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40
replication and transcription through multiple motifs
JOURNAL J. Virol. 64, 1360-1365 (1990)
STANDARD simple staff_review
FEATURES from to/span description
binding 1 7 AP-1 binding site
binding 41 50 AP-4 binding site
binding 63 69 AP-1 binding site
BASE COUNT 16 a 21 c 15 g 17 t
ORIGIN
1 tgactcatca cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga
61 gatgagtca
//
LOCUS RSSFBPPRK 2099 bp ds-DNA BCT 24-JUL-1990
DEFINITION R.sphaeroides fructose 1,6-bisphosphatase (fbpB) and
phosphoribulokinase (prkB) genes, complete cds.
ACCESSION J02922
KEYWORDS 1,6-bisphosphatase; phosphoribulokinase.
SOURCE R.sphaeroides (strain HR) DNA.
ORGANISM Rhodobacter sphaeroides
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 2099)
AUTHORS Gibson,J.L., Chen,J.-H., Tower,P.A. and Tabita,F.R.
TITLE The form II fructose 1,6-bisphosphatase and phosphoribulokinase
genes form part of a large operon in Rhodobacter sphaeroides:
Primary structure and insertional mutagenesis analysis
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Gibson, 15-JUN-1990.
FEATURES from to/span description
pept 44 1039 fructose 1,6-bisphosphatase (fbpB)
pept 1043 1921 phosphoribulokinase (prkB)
BASE COUNT 361 a 694 c 720 g 324 t
ORIGIN
1 ggatcctgac gcaacgtccg ccgcgacaga ggcaggagga gccatggcca tcgagctgga
61 ggacctgggg ctgagccccg atgtggcgga cgtgatgcag cgtctggcgc gcgtgggggc
121 aggcatcgcc cgcatcatct cgcgcaacgg gctcgagcgc gatctgggcg cgggcgtcgg
181 caccaatgcc ggaggagacg ggcagaaggc gctcgacgtg atcgcggacg acgcgttccg
241 cgcggcgctc gaaggctctg cggtggctta ttacgcctcc gaggagcagg acgaagtggt
301 gacgctgggc gagggaagcc tcgcgctcgc catcgacccg ctggacggct cgtccaacat
361 cgatgtgaac gtgtcgatcg ggacgatctt ctcgatcttc ccggcggcgg ctggccccga
421 ggccagcttc ctgcgcccgg gcaccgagca gattgccggc ggctacatca tctacgggcc
481 gcaatgcgcg ctggtctgca gcttcgggca gggcgtgcag cactgggtgc tcgacctcga
541 tgcgggcatc ttccggcgga tgcccgacat ccgcccgctg ccggccgaga cgtccgagtt
601 tgcgatcaac gcctcgaact accgccactg gccgcagccg atccgcgcct tcgtcgacga
661 tctggtcgcc ggggccgagg ggccgcgcgg caagaacttc aacatgcgct ggatcgcctc
721 gctggtggcc gagacgcacc gcatcctgat gcggggcggg gtgtttctct atcccggcga
781 cgagcgcaag ggctacgagc ggggccggct gcgccatgtc tacgaatgcg cgcccatcgc
841 cttcctgatc gcgaatgtcg gggggggcgc caccgacggc tgcgccgaca tcctgaccgc
901 gctgcccgac cggctgcacg cccgcacccc cttcgtcttc ggctgcgcga gcaaggtcgc
961 ccgcgtcgcc gcctatcacg atctggcctg cgaagagacg tccgctctct tcggcagccg
1021 gggcctgttc cggagttaaa gagtgtcgaa gaaatatccc atcatttccg tggtcggctc
1081 gtccggcgcg ggcacctcga cggtcaagaa cacgttcgag cagatcttcc gccgcgaggg
1141 ggtcaagtcc gtctcgatcg agggcgacgc cttccaccgc ttcaaccggg ccgacatgaa
1201 ggccgaactc gagcggcgct atgcggcggg cgatgcgacc ttctcgcatt tctcctacga
1261 ggcgaacgaa ctgaaggagc tggagcgcgt cttccgcgaa tatggcgaga cggggcgcgg
1321 ccgcacccgc acctatgtcc atgacgatgc cgaagccgcc cggacgggcg tggcccccgg
1381 caatttcacc caatgggcgc cgttcgagga caacagcgac ctgcttttct acgaggggct
1441 gcacggctgc gtggtcaatg acgaggtgaa cctcgtccgc catgccgatc tgaagctcgg
1501 cgtggcgccg gtcatcaacc ttgaatggat ccagaagatc caccgcgacc gggcgcagcg
1561 cggctatacg accgaagccg tcaccgacgt gatcctgcgc cggatgtatg cctacgtcca
1621 ctgtatcgtc ccgcaattct ccgagacgga catcaacttc cagcgcgtgc cggtggtgga
1681 cacctcgaac ccgttcatcg cgcgctggat ccccacgccg gacgagagcc tgatcgtgat
1741 ccggttcaag aacccgcgcg ggatcgactg cccctatctc acctcgatga tcgcgggctc
1801 gtggatgagc cgggcgaatt ccatcgtggt gccgggcaac aagcaggatc tggcgatgca
1861 gctgatcctg acgccgctca tcgagcggat ggtgcgcgag gcgcgccgcg cgcgggcctg
1921 aggcgcgaca gacagacgga gagggggccg cggatcgagc cgcggcccgg caaccgggac
1981 gcacagcgaa ggatgagagc catgaaggac attggagccg cgcaggagac gcggatggcg
2041 aacgcatccg ggccctcgcg atggatgccg tgaagaaggc caagtcgggc catcccggg
//
LOCUS YSCACH1A 2466 bp ss-mRNA PLN 24-JUL-1990
DEFINITION S.cerevisiae acetyl-CoA hydrolase (ACH1) mRNA, complete cds.
ACCESSION M31036
KEYWORDS acetyl-CoA hydrolase.
SOURCE S.cerevisiae (strain T3A), cDNA to mRNA, clones pBA[4,5].
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2466)
AUTHORS Lee,F.-J.S., Lin,L.-W. and Smith,J.A.
TITLE A glucose-repressible gene encodes acetyl-CoA hydrolase from
Saccharomyces cerevisiae
JOURNAL J. Biol. Chem. 265, 7413-7418 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.-W.Lin, 27-DEC-1989.
FEATURES from to/span description
pept 614 2194 acetyl-CoA hydrolase (ACH1) (EC 3.1.2.1)
BASE COUNT 730 a 529 c 558 g 649 t
ORIGIN Chromosome II.
1 gttagccatc attatcatta aaatatcaac ccgaagaaca atgtatacat atacatatac
61 gtacacatat acatatgtac atatgacata cgtattagcc gctgaggacg cggacgtata
121 aaaggacaat acttatatgg agctaagggg agcagttacg caactccgtg atcgcgcgcc
181 acgggccgtc ggcggctgtt aattgaagaa aaaaaaaatg aagaaccaca aggggtgatc
241 catataggtg actagcatca tcccctgcga cgcgcggccc gccgggcaaa ggcgggcaat
301 gcgcgctgct gattggcctc gaggacaacg ccctcaacca catccgcaac agccaatccc
361 atcggagcgt caaactacca aagtagtgat tgtatggatc accactgtat tgtggacggt
421 aagcgcttgc tggagcaaat gtgtaatcaa gttgctgtgt atatatagac gttagatgtg
481 ttctacccct tcttttgtct tgtgcccacc gggcttacat tagcacacaa agcagcaaga
541 gaccgtctta ctagacaata gcggcaaaac aaacaacaca tttctttttt tctttttcac
601 atattgcact aaaatgacaa tttctaattt gttaaagcag agagttaggt atgctcccta
661 tctgaaaaaa gttaaggaag ctcacgagct tattccattg ttcaagaatg gtcagtacct
721 tgggtggtcc ggttttacag gagtgggtac tcccaaggca gtgccggagg cactgataga
781 tcacgtggag aagaacaatt tacaagggaa gttgagattc aacctttttg ttggagcttc
841 tgctggtcca gaggaaaacc gttgggctga acacgacatg atcattaaga gagcccctca
901 tcaagtaggg aaacccattg caaaggcaat taaccagggt agaattgagt tctttgataa
961 acatctgtcc atgttccctc aggatctgac atacgggttc tacaccaggg aaagaaaaga
1021 caacaaaatc cttgattata ctataatcga ggcaacggcc attaaagagg acgggtctat
1081 cgtcccaggt ccctctgtcg gtggttctcc agaattcatt acagtcagtg ataaagtgat
1141 tattgaggtt aacacggcta cgccttcgtt cgagggtatt cacgatatag acatgcccgt
1201 gaacccacct ttcaggaaac catacccata tctgaaagtg gacgacaagt gtggtgttga
1261 ctccatcccg gttgatcctg aaaaggttgt tgcgattgtg gagtccacca tgagggacca
1321 ggtcccacca aatacgccct ctgacgacat gtccagggct attgcaggtc atttggtcga
1381 gtttttcaga aacgaggtaa aacatggtag gctacctgaa aacctgctgc ctttacaaag
1441 tggtataggt aacattgcta acgctgtcat tgaagggctt gctggcgccc aattcaagca
1501 cttgactgta tggacggaag tgctgcagga ctcgttattg gatcttttcg agaacggatc
1561 tttggactac tccactgcta cttccgtgag attgactgaa aagggtttcg acagagcctt
1621 tgcaaactgg gaaaatttca aacacagatt gtgtttgaga tctcaagttg tctcgaacaa
1681 tccggaaatg atccgtagat tccctgtcat cgccatgaat accccagtag aagttgacat
1741 ttacgcgcac gccaattcta caaatgtgaa tggttcccgt atgttgaacg ggttgggtgg
1801 atctgctgat ttcttgagaa atgcaaagtt gtccatcatg catgccccct ctgcaagacc
1861 aactaaagta gaccctaccg gtatctctac cattgttcct atggcctctc atgtagatca
1921 aactgagcat gacctggaca tcttggtcac tgaccaaggt ttggcggatc taagaggtct
1981 atcgcctaag gaaagagccc gtgaaatcat caacaagtgt gctcatcccg attatcaagc
2041 tttgttgacc gattacttgg acagagcaga gcattacgct aaaaagcaca attgcttgca
2101 tgaaccacac atgctaaaga atgctttcaa gttccacacc aacttagctg aaaagggtac
2161 aatgaaggtc gacagctggg aaccagttga ctagtgtttg tgcgcaaacc gagagatgag
2221 tatttaacaa aaaaaagaaa ggaaatgata tgattatgat tttatgttta taaagctttt
2281 atccaatgcg ttgttttttc ttgcatattt ataccttttg cgctcatgga gggagttaat
2341 caatacgcat gacgtctagt taattcacag gtagtactgt atatttatat gtttacacaa
2401 taattatgta ttaagtagtg attagtaaaa aaaactaaga ggttgaaagt catcaaccct
2461 tatatt
//
LOCUS YSCYEF3B 3969 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae elongation factor 3 (EF-3) gene, complete cds.
ACCESSION J05583
KEYWORDS elongation factor.
SOURCE S.cerevisiae DNA, clone YEpEF3.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3969)
AUTHORS Sandbaken,M.G., Lupisella,J.A., DiDomenico,B. and Chakraburtty,K.
TITLE Protein synthesis in yeast: Structural and functional analysis of
the gene encoding elongation factor three (EF-3)
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Chakraburtty, 15-JUN-1990.
FEATURES from to/span description
pept 489 3623 elongation factor 3
signal 202 212 regulatory UAS-RPG
binding 1875 1898 nucleotide binding consensus GX4GK(S/T)
binding 2589 2612 nucleotide binding consensus GX4GK(S/T)
binding 463 470 nucleotide binding consensus GX4GK(S/T)
binding 701 708 nucleotide binding consensus GX4GK(S/T)
binding 2117 2213 nucleotide binding consensus
binding 3176 3272 nucleotide binding consensus
binding 544 575 nucleotide binding consensus
binding 897 928 nucleotide binding consensus
BASE COUNT 1203 a 758 c 809 g 1199 t
ORIGIN
1 caacttacaa tcattgttcg ccccttccat acttactgcc actcgcaaaa gggcccaacc
61 agggcaatta cgtatcaaaa aatcatgaca ggctgggtaa taaatattcg tgaagaaaga
121 agaaattaaa aaaagaaacg aagaagcaaa aaaaagaaaa gactccgttt aatcactttc
181 aaccgcggtt tatccggccc cacccatgca taaccctaaa ttattagatc acttagcacg
241 tgaaaaagaa acgtttttaa tgtttttttt ttttttcttt ttcttttttt gcgttggtga
301 aaattttttc gcttcctcga gtataattat ctcatctcat ctttcatata agataagaag
361 ttttataaaa accttttgca tcaaaatttt gttgaataac tctttttctt acgctctctt
421 tctttcctta attgttttct aaagaaccgt gtatttttct agttcgaatc catcgataac
481 attaaaagat gtctgattcc cagcaatcca ttaaggttct agaagaacta ttccagaagt
541 tatctgttgc cactgctgac aacagacacg aaatcgcttc tgaagtcgct tctttcttga
601 atggtaacat cattgaacat gatgttccag aacacttctt cggtgaattg gccaagggta
661 tcaaggacaa gaagaccgct gctaacgcca tgcaagctgt tgctcacatt gctaaccaat
721 ctaacttgtc tccatctgtt gaaccataca tcgtccaatt ggttccagct atctgtacca
781 acgcaggtaa caaggacaaa gaaattcaat ctgttgcttc cgaaactttg atttccatcg
841 ttaacgctgt taacccagtt gccatcaaag ctttgttgcc acatttgact aacgctattg
901 tagaaactaa caaatggcaa gaaaagattg ctattttggc agctttctct gctatggtcg
961 atgctgctaa ggatcaagtt gccctaagaa tgccagaatt gattccagtc ttgtctgaaa
1021 ccatgtggga caccaagaag gaagtcaagg ctgctgctac tgccgccatg accaaggcta
1081 ccgaaactgt tgacaacaag gatattgaac gtttcattcc aagtttgatt caatgtattg
1141 ctgacccaac tgaagttcca gaaaccgttc atttgctagg tgctactact ttcgttgctg
1201 aagttactcc agctactttg tccatcatgg tcccattgtt gtccagaggt ttgaacgaaa
1261 gagaaaccgg tatcaagcgt aagtctgctg ttattattga caacatgtgt aagttggtcg
1321 aagacccaca agttattgct cctttcttgg gtaaattgtt gccaggtttg aagagtaact
1381 ttgctaccat tgctgaccca gaagccagag aagttacttt gagagctttg aagactttga
1441 gaagagttgg taacgttggt gaagacgatg ctattccaga actttctcac gctggtgacg
1501 tttctactac tttgcaagtc gttaacgaat tgttgaagga cgaaaccgtt gctccaagat
1561 ttaagattgt cgtcgagtac attgccgcca ttggtgctga tttgatcgat gaaagaatca
1621 ttgaccaaca agcttggttc acccacatca ccccatacat gactatcttc ttgcacgaaa
1681 agaaggccaa ggacatcttg gacgaattca gaaagagagc tgtcgacaac attccagttg
1741 gtccaaactt cgacgacgaa gaagacgaag gtgaagactt atgtaactgt gaattttctt
1801 tggcttatgg tgctaaaatc ttgttgaaca agacccaatt aagattgaag agagccagaa
1861 gatatggtat ctgtggtcca aacggttgtg gtaagtccac tttaatgaga gctattgcca
1921 acggtcaagt tgatggtttc ccaacccaag aagaatgtag aaccgtctac gtcgaacacg
1981 acattgatgg tactcactct gacacttccg tcttggattt cgttttcgaa tctggtgttg
2041 gtactaaaga agctatcaag gacaaattga ttgaattcgg tttcaccgat gaaatgattg
2101 ctatgccaat ctctgcttta tctggtggtt ggaagatgaa gttggctcta gctagagctg
2161 tgttgagaaa tgctgatatc ttgttgttag atgaaccaac taaccatttg gataccgtca
2221 acgttgcttg gttagttaac tacttgaaca cctgtggtat cacttctatc actatttctc
2281 acgactccgt tttcttagat aacgtctgtg aatatattat taactacgaa ggtttgaagt
2341 tgagaaagta caagggtaac tttaccgaat tcgttaagaa gtgtccagct gctaaggctt
2401 acgaagaatt atccaacact gatttggaat tcaagttccc agaaccaggt tacttggaag
2461 gtgttaagac taagcaaaag gctattgtca aggttaccaa catggaattc caatatccag
2521 gtacctctaa gccacaaatc actgacatta acttccaatg ttctttgtct tccagaattg
2581 ctgtcattgg tccaaatggt gctggtaagt ctactttgat taacgtcttg actggtgaac
2641 tattaccaac ctctggtgaa gtctacaccc acgaaaattg tcgtatcgct tacattaagc
2701 aacacgcttt tgctcatatc gaatctcatt tggacaagac tccatctgaa tatatccaat
2761 ggagattcca aaccggtgaa gatagagaaa ccatggacag agctaacaga caaatcaacg
2821 aaaacgatgc tgaagctatg aacaagatct tcaagattga aggtacccct agaagaattg
2881 ccggtatcca ctccagaaga aagttcaaga acacttacga atatgaatgt tctttcttat
2941 tgggtgaaaa cattggtatg aaatctgaaa gatgggttcc aatgatgtcc gtcgacaacg
3001 cttggattcc aagaggtgaa ttggttgaat ctcactctaa gatggttgct gaagttgata
3061 tgaaggaagc tttggcttct ggtcaattcc gtccattaac cagaaaagaa attgaagaac
3121 attgttccat gttgggtttg gacccagaaa ttgtttctca ctccagaatt agaggtttgt
3181 ctggtggtca aaaggttaag ttggtcttag ctgccggtac atggcaaaga cctcacttga
3241 ttgtcttaga tgaacctacc aactatctgg acagagattc tttgggtgct ttgtctaagg
3301 ctttgaagga atttgaaggt ggtgttatta tcattactca ctctgctgaa ttcacaaaga
3361 acttgactga agaagtctgg gccgtcaagg acggtagaat gactccatct ggtcacaact
3421 gggttagtgg tcaaggtgct ggtccaagaa tcgaaaagaa ggaagacgaa gaagataaat
3481 tcgatgctat gggtaacaag attgccggtg gtaagaagaa gaagaagttg tcttctgcgg
3541 aattgagaaa gaagaagaag gaaagaatga agaagaagaa ggaattgggt gatgcttacg
3601 tttcttctga cgaagaattc taatcttttt gatcactgct ttcacagttt tctttaagat
3661 ttttattgat caataattta tgtatatttt aatttctatg tttttgtaat attgtttatt
3721 ttggtaaaat atagacgcaa cttccttatt ataaagaaag gcattattta aaagaaaaag
3781 cgttccatta gtcagacatc tttttttttt tcatacattc ttaagcctca ggcaaattga
3841 gcattgcctc ataccctttt cggtaagagg gtaacgaaaa tatttttttg gaagaataaa
3901 aataggtgac ggatcataga ctaggaagct ttaaaacatg attgagcgta atattatatt
3961 ccttctaga
//
LOCUS HUMKSAA 1504 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds.
ACCESSION M32325
KEYWORDS adenocarcinoma-associated antigen.
SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1504)
AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D.,
Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A.
TITLE Molecular cloning and characterization of a human
adenocarcinoma/epithelial cell surface antigen complementary DNA
JOURNAL Cancer Res. 49, 314-317 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D. Sharp, 22-EB-1990.
FEATURES from to/span description
pept 155 1099 adenocarcinoma-associated antigen precursor
(KSA)
sigp 155 223 adenocarcinoma-associated antigen signal
peptide
matp 398 1096 adenocarcinoma-associated antigen
mRNA < 1 1504 adenocarcinoma-associated antigen mRNA
site 1486 1491 polyadenylation site
BASE COUNT 442 a 302 c 356 g 404 t
ORIGIN
1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc
61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc
121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt
181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg
241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac
301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat
361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca
421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca
481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa
541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa
601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga
661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa
721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat
781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa
841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat
901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat
961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc
1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca
1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat
1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg
1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga
1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt
1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc
1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc
1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa
1501 actg
//
LOCUS HAMHITAO1 1581 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Hamster G-o protein alpha subunit 1 (GNAO I) mRNA, complete cds.
ACCESSION M33661 J05476
KEYWORDS G-o protein alpha subunit.
SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA.
ORGANISM Cricetulus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1581)
AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C.,
Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L.
TITLE Molecular cloning of a novel splice variant of the alpha subunit of
the mammalian G-o protein
JOURNAL J. Biol. Chem. 265, 11220-11226 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Birnbaumer, 10-APR-1990, for release after publication.
FEATURES from to/span description
pept 458 1522 G-o protein alpha subunit 1
BASE COUNT 357 a 472 c 405 g 347 t
ORIGIN
1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc
61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt
121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc
181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg
241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt
301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt
361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca
421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga
481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat
541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac
601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca
661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat
721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg
781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat
841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca
901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta
961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac
1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga
1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct
1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgcatgc acgagtctct
1201 catgctcttc gactccatct gtaacaacaa gtttttcatc gatacctcca tcattctctt
1261 cctcaacaag aaagacctct ttggcgagaa gattaagaag tcacccttga ccatctgctt
1321 ccctgaatac ccaggctcca acacctatga agatgcagct gcctacatcc aaacacagtt
1381 tgaaagcaaa aaccgctcac ctaacaaaga aatctattgt cacatgactt gtgccacaga
1441 cacgaataat atccaggtgg tattcgacgc tgtcaccgac atcatcattg ccaacaatct
1501 ccggggctgc ggcttgtact gacctcttgt cctgtatagc aacctatttg actgcttcat
1561 ggactctttg ctgttgatgt t
//
LOCUS HAMHITAO2 1678 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Hamster G-o protein alpha subunit 2 mRNA, complete cds.
ACCESSION M33662 J05476
KEYWORDS G-o protein alpha subunit.
SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA.
ORGANISM Cricetulus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1678)
AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C.,
Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L.
TITLE Molecular cloning of a novel splice variant of the alpha subunit of
the mammalian G-o protein
JOURNAL J. Biol. Chem. 265, 11220-11226 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Birnbaumer, 10-APR-1990, for release after publication.
FEATURES from to/span description
pept 458 1522 G-o protein alpha subunit 2
BASE COUNT 374 a 514 c 448 g 342 t
ORIGIN
1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc
61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt
121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc
181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg
241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt
301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt
361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca
421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga
481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat
541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac
601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca
661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat
721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg
781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat
841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca
901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta
961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac
1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga
1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct
1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgaatgc atgagtccct
1201 gaagcttttt gacagcatct gcaacaacaa gtggttcaca gacacgtcta tcatcctgtt
1261 tctcaacaag aaggacatat ttgaggagaa gatcaccagg tccccactga ccatctgctt
1321 tcctgaatac acaggcccca gtgccttcac agaagctgtg gctcacatcc aggggcagta
1381 tgagagtaag aacaaatcag ctcacaagga aatctacacc catttcacct gtgccaccga
1441 caccaacaac atccagtttg tctttgatgc tgtgacggac gtcatcatcg ccaaaaatct
1501 gcggggctgt ggactctact gagccctggc ctcctgccca gcctgctgct cacttctccc
1561 ctggacccag agctctgcta ccactcgatg ctctgtgcac tgagggaaaa cccagaggct
1621 ggccctgggg gaggaggaag caccctctga gcatccccac cctaccccac tccagagg
//
LOCUS HUMPEPD1 610 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human peptidase D gene, exon 1.
ACCESSION M36549 J05513
KEYWORDS peptidase D; prolidase.
SEGMENT 1 of 2
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 610)
AUTHORS Tanoue,A., Endo,F. and Matsuda,I.
TITLE Structural organization of the gene for human prolidase (peptidase
D) and demonstration of a partial gene deletion in a patient with
prolidase deficiency
JOURNAL J. Biol. Chem. 265, 11306-11311 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
F.Endo, 23-MAY-1990.
FEATURES from to/span description
pept 372 / 388 peptidase D, exon 1 (EC 3.4.13.9)
/hgml_locus_uid="LF0121K" /nomgen="PEPD"
/map="19q12-q13.2"
pre-msg 241 > 610 PEPD mRNA and introns
IVS 389 > 610 PEPD intron A
BASE COUNT 96 a 225 c 185 g 104 t
ORIGIN
1 gtggtgtacg tgtacgtaag atcggggact gaacggagaa tcaagatgtc ctctgcttca
61 ggacctccct cccaccacga aaaggctggc agatgcagat gctgcccgct aacccgacct
121 actgtctgca gccggcagcc cgctcattta tcctggccga gctcttgggc ccccaatctc
181 caggagcccc gccccatcgg ctccgccctt ccccaggtac cgccccttcc cgcccgtgga
241 atttggctct ggaggaggtg ggtcaaggac tccatgcccc gccccctgcc ccgcccttac
301 cccgccccat gcattggcac cggaggggct catggacgcg cgcacttcag tgacgccggt
361 gccgggcgaa catggcggcg gccaccgggt gagtgctgcg cctccctcgc tctttcccgc
421 tggcgtggag aggggcatgg gtgtcccggc cggaccgcgg gtgagagacc gtcaccgggt
481 cgcgtgaatg ccgcgggtag gcgcctcttc ccgttggccc ggaagtccca gcgccggtca
541 ggcgagcctg cacgcgtccg agtcaaaggt caacccgagc aggacccgac tacctctcct
601 aacctctcca
//
LOCUS HUMPEPD2 530 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human peptidase D gene, 3' end.
ACCESSION M35496 J05513
KEYWORDS peptidase D; prolidase.
SEGMENT 2 of 2
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 530)
AUTHORS Tanoue,A., Endo,F. and Matsuda,I.
TITLE Structural organization of the gene for human prolidase (peptidase
D) and demonstration of a partial gene deletion in a patient with
prolidase deficiency
JOURNAL J. Biol. Chem. 265, 11306-11311 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
F.Endo, 23-MAY-1990.
FEATURES from to/span description
pept / 1 3 peptidase D, exon 15 (EC 3.4.13.9)
/hgml_locus_uid="LF0121K" /nomgen="PEPD"
/map="19q12-q13.2"
pre-msg < 1 394 PEPD mRNA and introns
signal 373 378 poly-A signal
BASE COUNT 133 a 132 c 114 g 151 t
ORIGIN About 110 kb after segment 1.
1 tagagccagc cagaaatccc agcgcacctg ggggcctggc cttgcaacct cttttcgtga
61 tgggcagcct gctggtcagc actccagtag cgagagacgg cacccagaat cagatcccag
121 cttcggcatt tgatcagacc aaacagtgct gtttcccggg gaggaaacac ttttttaatt
181 acccttttgc aggcaccacc tttaatctgt tttatacctt gcttattaaa tgagcgactt
241 aaaatgattg aaaataatgc tgttctttag tagcaactaa aatgtgtctt gctgtcattt
301 atattccttt tcccaggaaa gaagcatttc tgatactttc tgtcaaaaat caatatgcag
361 aatggcattt gcaataaaag gtttcctaaa atggtccttt gtgcgtcggc ttcccctttc
421 attcctttgc agatgagcat ttgggacccc gtccgacccc tgaaaagatc ggctcacctg
481 gtgtcagtat cagtcattcc aggaagatgg ctcgtggcag cacaggccct
//
LOCUS SYNPMAMA 4354 bp ds-DNA SYN 24-JUL-1990
DEFINITION Artificial cloning vector pMAMPF.
ACCESSION M33637
KEYWORDS .
SOURCE Synthetic DNA from E.coli cell line WK6-lambd-pMAMPF3, from pMA58.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 4354)
AUTHORS Szardenings,M. and Collins,J.
TITLE A plasmid optimized for protein design projects: pMAMPF
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Collins, 09-APR-1990.
FEATURES from to/span description
signal 4121 4183 OmpA signal sequence
site 4181 4186 NaeI cloning site
site 4184 4308 Stuffer (pBR325:5900 to 29)
BASE COUNT 1152 a 1054 c 1012 g 1136 t
ORIGIN
1 aattcacctc gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt
61 ttttttggag attttcaacg tgaaaaaatt attattcgca attccaagct aattcacctc
121 gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt ttttttggag
181 attttcaacg tgaaaaaatt attattcgca attccaagct ctgcctcgcg cgtttcggtg
241 atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag
301 cggatgcaga tcacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc
361 gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt
421 cctttctcgc cacgttcgcc agctttcccc gtcaagctct aaatcggggg ctccctttag
481 ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt
541 cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt
601 tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt
661 cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt
721 aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttga tctgcgctcg
781 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca
841 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac
901 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac
961 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg
1021 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac
1081 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat
1141 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag
1201 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac
1261 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt
1321 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt
1381 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc
1441 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga
1501 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac
1561 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc
1621 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct
1681 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca
1741 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct
1801 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca
1861 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc
1921 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg
1981 cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct
2041 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa
2101 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta
2161 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc
2221 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg
2281 agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa
2341 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg
2401 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc
2461 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg
2521 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcagacag
2581 ttttattgtt catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga
2641 cacaacgtgg ctttgttgaa taaatcgaac ttttgctgag ttgactcccc gcgcgcgatg
2701 ggtcgaattt gctttcgaaa aaaaagcccg ctcattaggc gggctaaaaa aaagcccgct
2761 cattaggcgg gctcgaattt ctgccattca tccgcttatt atcacttatt caggcgtagc
2821 aaccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact
2881 catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg
2941 catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc
3001 ccatagtgaa aacgggggcg aagaagttgt ccatattcgc cacgtttaaa tcaaaactgg
3061 tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga
3121 aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc
3181 ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa
3241 cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac
3301 gaaattccgg atgagcattc atcaggcggg caagaatgtg aataaaggcc ggataaaact
3361 tgtgcttatt tttctttacg gtctttaaaa aggccgtaat atccagctga acggtctggt
3421 tataggtaca ttgagcaact gactgaaatg cctcaaaatg ttctttacga tgccattggg
3481 atatatcaac ggtggtatat ccagtgattt ttttctccat tttagcttcc ttagctcctg
3541 aaaatctcga taactcaaaa aatacgcccg gtagtgatct tatttcatta tggtgaaagt
3601 tggaacctct tacgtgccga tcaacgtctc attttcgcca aaagttggcc cagggcttcc
3661 cggtatcaac agggacacca ggatttattt attctgcgaa gtgatcttcc gtcacaggta
3721 tttattcgaa gacgaaaggg catcgcgcgc ggggaattgg ccacgatgcg tccggcgtag
3781 aggatctctc acctaccaaa caatgccccc ctgcaaaaaa taaattcata taaaaaacat
3841 atagataacc atctgcggtg ataaattatc tctggcggtg ttgacataaa taccactggc
3901 ggtgatactg agcacatcag caggacgcac tgaccaccat gaaggtgacg ctcttaaaat
3961 taagccctga agaagggcag cattcaaagc agaaggcttt ggggtgtgtg atacgaaacg
4021 aagcattgga attctacaac ttgcttggat tcctacaaag aagcagcaat tttcagtgtc
4081 agaagtcgac caaggaggtc tagataacga ggcgcaaaaa atgaaaaaga cagctatcgc
4141 gattgcagtg gcactggctg gtttcgctac cgtagcgcag gccggcgata atggcctgct
4201 tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg gcgtgcaaga
4261 ttccgaatac cgcaagcgac aggccatgtt tgacagctta tcatcgataa gcttggatcc
4321 gtcgacctgc agccaagctt ggtctagagg tcga
//
LOCUS DSBRR1610 1376 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.autotrophicum 16S ribosomal rRNA.
ACCESSION M34409
KEYWORDS ribosomal 16S rRNA.
SOURCE D.autotrophicum (strain DSM 3382) rRNA.
ORGANISM Desulfobacterium autotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1376)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1376 16S rRNA
BASE COUNT 340 a 287 c 399 g 293 t 57 others
ORIGIN
1 aactggagag tttgattctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag
61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcacgggt gagtaacacg
121 tggataatct acctttgaat ccgggataac tgttcgaaag aatcgctaat accggatgaa
181 gtcagtnttt ccttggaagg attgatgaaa gacagcctct tcttgaaagc tnttgtttga
241 agatgagtcc gcgcaccatt agttngttgg tggggtaacg gcctaccaag acnttnatgg
301 ttagctggtc tgagaggatg atcagccacn ctggaactga cacacggtcc ngactcctnc
361 gggaggcagc agtgaggaat tttgcgcaat gggngcnacc ctnacgcagc aacgccgcgt
421 gagtgaagaa ggcctntggg tcgtaaagct ctgtcaatgg ggaagaagtt rcttcatatt
481 aatrgtatgt ngtattgacg gtacccatnn nngaagcacc ggcnnactcc gtgccagcan
541 nnncggtaat anggagngtg cnagcgtnat tcggaatnat tnggngtnaa gggcacgcag
601 gcggtnttgc cagtcagatg tgaaagcccg gggctcaacc ctggaagtgc atttgaaact
661 acaaggcttg agtacggtng aggaaagggg nattcctngt gtagaggtga aattcgtaga
721 tatcaggagg aacaccggtg gcgaaggcgc cnntctggac csnnactgac gctnaggtnc
781 naaggcgtgg gnagcgaaca ggattagata ccctggtagt ccacgcagta aacgttgtat
841 actaggtgta gcgggntatt gacccntgct gtgccggagc taacgcatta agtataccgc
901 ctgggaagta cggtcgcaag actaaaactc aaagaaattg acgggggccc gcacaagcgg
961 tggagcatgt ggtttaattc ganncaacgc gcagaacctt acccagactt gacatcctga
1021 gaatctgtta gagatattag agtgccttcg ggaactcaga gacaggtgct gcatggctgt
1081 cgtcagctcg tgtcgtgaga tgttcggtta agtccggcaa cgagcgcaac ccctatcttc
1141 agttgccagc acatcatggt gggaactctg aagagactgc cccggttaac ggggaggaag
1201 gnggggatga cgtcaagtcc tcatggccct tatgtctggg gctacacacg tgntacaatg
1261 gtatgtacna cgggcagcga acctgcnnag gtgngcgaat ccctnaaagc atatcccagt
1321 ccggattgga gtctgcaact cgactccatg aagttggaat cgctagtaat cgtgga
//
LOCUS DSBRR1677 1375 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.niacini 16S ribosomal rRNA.
ACCESSION M34406
KEYWORDS ribosomal 16S rRNA.
SOURCE D.niacini (strain DSM 2650) rRNA.
ORGANISM Desulfobacterium niacini
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1375)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1375 16S rRNA
BASE COUNT 292 a 252 c 340 g 256 t 235 others
ORIGIN
1 taactggaga gtttgattct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa
61 gtcgaacgag aaagtgcctg cttgcaggaa cgagtagagt ggcgcacggg tgagtaacac
121 gtggataatc taccttcaaa tcgggaataa ccattcgaaa ggatggctaa tgccggataa
181 agtcagatrt tctttggagt atttgatgaa agggtgcctc tgattgaaag catctgtttg
241 aagatgagtc cgcgcaccat tagtttgttg gtggggtaac ggcctaccaa gactncgatg
301 gtnagctngt ctgagaggat gatcagccac actggaactg acacacggtc crgrctccta
361 cgggaggcag cagtgaggaa ttttgcgcaa tgggggcaac cctnacgcag caacgcngcg
421 tgagtgaaga aggccnnngg gtcgtaaagc tntgtccatg gggaagaaat tatttagtgt
481 tratrgcatn ntttattgac nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttcctgg tgtagaggtg aaattcgtag
721 atatcaggag gaacaccggt ggcgaaggcg ccngtctgga ccgatactga cgctsaggtg
781 cgaaggcgtg ggnagcgaac gggattagat rccccggtag tccacgcagt aaacgttgta
841 tgctaggtgt agcgggtatt gcccntgctn cgccgcagtt aacacattaa gcataccgcc
901 tnnnnngtac ggtcgcaaga ctaaaactca aagaaattga cgggggcccg cacaagcggt
961 ggagcatgtg gtttrattcg acnnaacgcg aagaacctta cccagacttg acatcttgag
1021 aatcctttcg aaagttagga gtgccttcgg nngctcaaag acagnngctg catggctgtc
1081 gtcagctcgt gtcgtgagat gtttggttaa gtccagcaac gagcgcaacc cttgtcttca
1141 gttgccagcg cgtaatggcg gaactctgaa gagactgccc cggtnaacgg ggaggaaggt
1201 ggggatgacg tcaagtcctc atggcccnna tgtctgggct acrcrcgtgc tacaatggta
1261 tgtacaacgg gcagcaangc agcgatgcca agcaaatccc ttaaagcata tcccrgtccg
1321 gattggagtc tgcractcga ctccatgaag ttggaatcgc tagtaatcgt ggatc
//
LOCUS DSBRR169 1383 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.vacuolatum 16S ribosomal rRNA.
ACCESSION M34408
KEYWORDS ribosomal 16S rRNA.
SOURCE D.vacuolatum (strain DSM 3385) rRNA.
ORGANISM Desulfobacterium vacuolatum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1383)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1383 16S rRNA
BASE COUNT 345 a 295 c 411 g 294 t 38 others
ORIGIN
1 ttctggctca gratnaacgc tggcggngtg crnaacacat gcaagtcgaa cgrgraagtt
61 sctgcttgca ggaacgagga gagtggcgct cgggngagta actcgtggat aatctacctt
121 caaatcggga ataaccattc gaaaggatgg ctaatgccgg ataaagtcag atrttctttg
181 grrtntttga tgaaagggag cctctgnata taagctnttg tttgaagatg agtccgcgca
241 ccattagttt gttggtgggg taacggccta ccaagactgt gatggtnagc tggtctgaga
301 ggatgatcag ccacactgga actgacacac ggtccagact cctacgggag gcagcagtga
361 ggaattttgc gcaatggggg aaaccctgac gcagcaacgc cgcgtgagtg aagaaggccc
421 ttgggtcgta aagctctgtc acagggaaga aattattatg ctttaataga gtgtrgtatt
481 gacggtaccc tgtgaagaag caccggctaa ctccgtgcca gcagccgcgg taatacgggg
541 ggtgcaagcg ttattcggaa ttattnggcg taaagggcac gcaggcggtc ctgtcagtca
601 gatgtgaaag ccggggctca accccggacg tgcatttgaa acagcaggnc ttgagtacgg
661 nngaggaaag ggnnnttcct ggtgtagagg tgaaattcgt agatatcagg aggaacaccg
721 gtggcgaagg cgccnntctr gaccgatact gacgctnagg tacnaaggcg tgggtagcga
781 acgggattag ataccccggt agtccacgca gtaaacgttg tatgctaggt gtagcrgggt
841 attgacccct gctgtgccgc agttaacaca ttaagcatac cgcctgggga gtacggtcgc
901 aagactnaaa ctnaaagaaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa
961 ttcgacgnaa cgcgaagaac cttacccaga cttgacatct tgagaatcct ntagaaattt
1021 aggagtgcct tcgggagctc aaagacaggt gctccatggc tgtcgtcagc tcgtgtcgtg
1081 agatgtttgg ttaagtccag caacgagcgc aacccttgtc ttcagttgcc agcacataat
1141 ggtgggaact ctgaagagac tgccccggtt aacggggagg aaggtggggn tgacgtcaag
1201 tcctcatggc cctnatgtct ggggctacac acgtgctaca atggtatgta caacgggcrg
1261 caaggcagcg atgccaagca aatcccttaa agcatatccc agtccggatt ggagtctgcg
1321 actcgactcc atgaagttgg aatcgctagt aatcgtggat cagcatgccn cggtgaatac
1381 gtt
//
LOCUS DSFRR166 1372 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.multivorans 16S ribosomal rRNA.
ACCESSION M34405
KEYWORDS ribosomal 16S rRNA.
SOURCE D.multivorans (ATCC 33890) rRNA.
ORGANISM Desulfococcus multivorans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1372)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1372 16S rRNA
BASE COUNT 329 a 295 c 429 g 291 t 28 others
ORIGIN
1 taattgacga gtttgatcct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa
61 gtcgcacgag aacgcttctg cttgcagggg ctagtaagtg gcgcacgggt gagtaacgcg
121 tgggtaatct acccttggat tcgggataac ccttcgaaag gagtgctaat accggatgaa
181 gtcatttagg ctgtggttta gatgatgaaa ggtggcctct ccgaggaagc taccgtttgg
241 ggatgagccc gcgtaccatt agcttgttgg tggggtaacg gcctaccaag gcgacgatgg
301 ttagctngtc tgagaggatg atcagccaca ctggaactga cacacggtcn ngnctcctac
361 gggaggcagc agtgaggaat tttgcnnnnn gggggnaacc ntnacgcagc aacgccgcgt
421 gagtgaagaa ggccttcggg tcgtaaagct ctgtcagtgg gaagaacctt cgtatgttga
481 atatgcatac ggactgacgg taccacagaa ggaagcaccg gctaactccg tgccagcagc
541 cgcggtaata cggagggtgc aagcgttatt cggaattatt gggcgtaaag agcgcgtagg
601 cggctttgca agtcaggtgt gaaatccggg gctnaacccc ggaagagcat ttgatactgt
661 ggagcttgag tatgggagag gnnagtggna ttcctggtgt agcggtgaaa tgcgtagata
721 tcaggaggaa caccggtggc gaaggcggyt tcctggacca atactgacgc tgaggcgcga
781 aggcgtgggg agcaaacagg attagatacc ctngtagtcc acgcagtaaa cgttgatcac
841 taggtgtagc gggtattgac ccctgctgtg ccggagttaa cgcattaagt gatccgcctg
901 gggagtacga tcgcaagatt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg
961 agcatgtggt ttaattcgan nnaacgcgaa gaaccttacc tggatttgac atccgcggrr
1021 ttcttatgaa aatagggagt gccttcgggg agccgcgaga caggtgctgc atggctgtcg
1081 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctrtcttcag
1141 ttaccatcat taagttgggg actctgaaga tactgccccg gttaacggga ggaaggtggg
1201 gatgacgtca agtcctcatg gcctttatgt ccagggctac acacgtgcta caatgggctg
1261 tacnnagggn agcgatctcg cgagagcaag ccaatcccaa aaagcagtcc ttagttcggr
1321 ttggagtctg caactcgact ccatgaagct ggaatcgcta gtaatcgcgg at
//
LOCUS DSLRR165 1381 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.pigra 16S ribosomal rRNA.
ACCESSION M34404
KEYWORDS ribosomal 16S rRNA.
SOURCE D.pigra (ATCC 29098) rRNA.
ORGANISM Desulfomonas pigra
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1381)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1381 16S rRNA
BASE COUNT 322 a 294 c 388 g 250 t 127 others
ORIGIN
1 gaactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa
61 gtcgtacgcg aaangggact tcggtcccga gtaaagtggc gcgacgggtg agtaacgacg
121 tggataatct gcctctatga tgggnataac rgttggaaac gactgctaat accgaatacg
181 ctcatgatga ncttgtggaa aggtggcctn tgcttgcaag ctatcgcata gagatgagtc
241 cgcgtcccat tagctagttg gtggggtaac ggcctaccaa ggcaacgatg ggtagccgat
301 ctgagaggat gatcggccac actggaactg aaacacggtc cagactccta cgggaggcag
361 cagtggggaa tattgcgcaa tgggcgaaag cctgacgcag cgacgccgcg tgagggatga
421 aggtcttcgg atcgtaaacc tctgtcagaa gggaagaaac tagggtgctc taatcatcat
481 cctaactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa
541 tacggagggt gcaagcgtta atcggaatca ctgggcgtaa agcgcacgta ggctnttatg
601 taagtcaggg gtgaaagccc acggctnaac cgtggaactg cccttgatac tgcacggctn
661 gaatnnnnnn nnnnggtncg gnattccagg tgtaggagtg aaatccgtag atatctggag
721 gaacatcagt ggcgaaggcg gcnacctnga cctngaccgn nattgacgct naggtgcnna
781 agcgtgggga gcaaacagga ttagataccc tngtngtcca cgccgtaaac gatggatgct
841 agatgtcggg atgtannnct cggtgtcgta gtnaacgcgt naagcatccg gcctggggag
901 tacggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
961 nnnnnnnnnn nnnnnnnaac gcgaagaacc ttacctaggt ttgacatctg gggaaccctc
1021 ccgaarrtga ggrrtrrcct tcggggagcc ccaagacagg tgctgcatgg ctgtcgtcag
1081 ctnnnnnnnt gagatgttgg gttaagtccc gcaacgagcg caacccctat gcatagttgc
1141 cagcaagtaa agttgggcac tctatgcaga ctgnncgggt tracagggga ggaaggtggg
1201 gacgacgtca agtcatnatg gcccttacgc ctagggctac anacgtacta cnatggcacg
1261 cacaaagggc agcgataccg tgagatggag ccaatcccaa aaaacgtgtc ccagtccgga
1321 ttgcagtctg caactcgact gcatgaagtc ggaatcgcta gtaattcgag gtcagcatnc
1381 c
//
LOCUS DSORR1618 1402 bp ss-rRNA BCT 24-JUL-1990
DEFINITION Desulfotomaculum orientis 16S ribosomal rRNA.
ACCESSION M34417
KEYWORDS ribosomal 16S rRNA.
SOURCE D.orientis (strain Singapore I, NCIMB 8382) rRNA.
ORGANISM Desulfotomaculum orientis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1402)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1402 16S rRNA
BASE COUNT 301 a 253 c 386 g 255 t 207 others
ORIGIN
1 ttttttggag agtttgrtcc tggctcagga ngaacgctgg cggcgtgcct aacacatgca
61 agtcgaacgg agaattgaat aagcttgctt agacaattst tagtggcgga cgggtgagta
121 acgcgtgggt aacctgccca taaakccggg acaacccttg gaaacgaggg ctaataccgg
181 atatcttaga tyytggcatc aggayttaag gaaagatggc ctctgaacat gctatcgatt
241 atggatggac ccgcgtctga ttagctagtt ggtggggtaa aggcctacca aggcgacgat
301 cagtagccgg cctgagaggg tgaacggcca cactgggact gagacacggc cnngactcct
361 acgggaggca gcagtgggga atcttccgca atggacgaaa gtctgacgga gcaacgcngc
421 gtgtatgatg aaggtcttcg gattgtaaag tactgtcagt gtggggacgg tgctntttga
481 aaatattgag gagacatgac ggtacccaag gaggaagccc cggctaannn nnnnnnnnnn
541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncggaatta ctgggcgtta agcgcgcgtt
601 ggcggatgct taagtcckgt gtgaaagatc agggctcaac cctnagagtg cattcggtaa
661 tgggtattgn nnnnnnnnga gaggaaagtg gaattccacg tgtagcggtg aaatgcgtag
721 atatgtggag gaacaccagc ggcgaaggcg actttctgga ctgtaactga cgctgaggcg
781 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgag
841 tgctaggtgt agagggtatc nncacnttct gtgccgcagt taacgcgata agcactccgc
901 ctggggagta cggccgcaag gttnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngctg tgagacaggt ggtgcatggt
1081 tgtcgtcagc ttgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccctgta
1141 tttagttgct acgagtaagg ttgagcnctc tagatagact gccggtgaca aaccggagga
1201 aggtggggat gacgtcaaat catnatgccc nttatgtcct gggctacaca cgtgctacaa
1261 tggccggtac agacggaagc gaagccgcga ggtgaagcca atccgagaaa gccggtctcr
1321 gttcggattg caggctgcaa ctcgcctgca tgaagtcgga atcgctagta atcgcaggtc
1381 agcatactgc ggtgaatacg tt
//
LOCUS DSORR1619 1368 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.ruminis 16S ribosomal rRNA.
ACCESSION M34418
KEYWORDS ribosomal 16S rRNA.
SOURCE D.ruminis (strain DL NCIMB 8452) rRNA.
ORGANISM Desulfotomaculum ruminis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1368)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1368 16S rRNA
BASE COUNT 341 a 284 c 420 g 267 t 56 others
ORIGIN
1 ttatggagag tttrgtcctg gctcaggacg aacgctggcg gcgtgcttaa cacatgcaag
61 tcgaacggag tttaaaatga aacctngtga tttttraact tagtggcgga cgggtgagta
121 acgcgtggat aacctggcgg atagaccggg ataacagttg gaaacgactg ctaataccgg
181 atacgctcct aggaacgcat gttctaagga ggaaagattt atcgctaacc gatggatccg
241 cgtcccatta gctagttgga ggtgtaacgg acccctaagg cgacgatggg tagccggcct
301 gagagggcga acggccacac tggaactgag acacggcctn gactcctacg ggaggcagca
361 gtggggaata ttccgcaatg ggcgaaagcc tgagggagca acgcngcgtg agggaagaag
421 gctttcgggt tgtaaacctc tgtcttaagg gaagaacaaa atgacggtac cttgagagga
481 agccccggct annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag cgttgtccgg
541 aattactggg cgtaaagggc gcgtaggtgg tgcattaagt tagaggtgaa agtgccgggc
601 nnaaccccgt gaggcctctg atactggtgg acttgagtgc aggagagggg agcagaattc
661 ccagtgtagc ggtgaaatgc gtagatattg ggaggaatac cagtggcgaa ggcggcnntc
721 tngactgtaa ctgacactga ggcgcgaaac cgtggggagc gaacaggatt agataccctg
781 gtagtccacg ccgtaaacga tgagtgctag gtgttggggg tatcgaccac ccccagtgcc
841 gtagttaaca caataagcac tccgcctggg gagtacggtc gcaagactga aactcaaagg
901 aattgacggg ggcccgcaca agcggtggag tatgtggttt aattcgacgn aacgcgaaga
961 accttaccag ggcttgacat ccgtctnaaa agtataggga tatactcttg ccttcggggn
1021 nacagagaga caggtggtgc atngttgtcg tcagctcgtg tcgtgagatg ttgggttaag
1081 tcccgcaacg agcgcaaccc ctracattcg ttgccagcga gtaatgtcgg gaactcgaat
1141 gtgactgccg ttgacaaaac ggaggaaggt ggggatgacg tcaaatcatc atgcccctta
1201 tgtcctgggc tacrcgcgta ctacnatggc cggtacagac ggaagcgaag ctgtgaagtg
1261 aagcaaatct gagaaagccg gtcccagttc ggattgtagt ctgcaactcg actacatggg
1321 gtcggaatcg ctagtaatcg caggtcagca tactgcggtn aatacgtt
//
LOCUS DSPRR1613 1390 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.hydrogenophilus 16S ribosomal rRNA.
ACCESSION M34412
KEYWORDS ribosomal 16S rRNA.
SOURCE D.hydrogenophilus (DSM 3380) rRNA.
ORGANISM Desulfobacter hydrogenophilus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1390)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1390 16S rRNA
BASE COUNT 354 a 297 c 416 g 292 t 31 others
ORIGIN
1 actgagagtt tggacctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc
61 gaacgagaaa argattgctt gcaatcctga gtagagtggc gcrcgggtga gtaacrcgta
121 gataatctgc cttcaagcct ggaataacta ttcgaaaggg tagctaatac cggataaagt
181 cgatttacac nagtrgattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat
241 gagtctgcgg accattagct tgttggtggg gtaaaggcct accaaggcag cgatggttag
301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccagac tcctacggga
361 ggcagcagtg aggaattttg cgcaatgggg gcaaccctga cgcagcaacg ccgcgtgagt
421 gaagaaggcc tttgggtcgt aaagctctgt caactgggaa gaaattagct tctattaata
481 gtagttgcta ttgacggtac cagtngacga agcgccggcn aactcngtgc cagcagcngc
541 ggtaacacgg ggggcgcnag cgtnattcgg aattattggg cgtaaagggc gcgcaggcgg
601 tnttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa
661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
721 aggaggaaca ccgatggcga aggcatctct ctrgaccgat attgacgctn aggcgcnaag
781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacactc
841 ggtgtngcgg atattnaaat ctgctgtgcc taagctaacg cattaagtgt accgcctggn
901 ragtacggtc gcaagactaa aactcaaagg aattgacggg ggcccgcaca agcggtggag
961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata
1021 ttgtgtaatt gcaatagtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca
1081 gctcgtntcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgtcagttg
1141 ccagcacntr atggtgggaa ctctggcgag actgccccrg nnnacgggra ggaaggtggg
1201 gatgacgtca agtcctcatg gcccttatac ccagggctac acacgtgcta caatggtagg
1261 tacaaagggc agcgactctg crragggaag cgaatcccaa aagcctatct cagtccggat
1321 tggggtctgc aactcgaccc catgaagttg gaatcgctag taatcgcgga tcagcatgcc
1381 gcggtnaata
//
LOCUS DSPRR1614 1396 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.curvatus 16S ribosomal rRNA.
ACCESSION M34413
KEYWORDS ribosomal 16S rRNA.
SOURCE D.curvatus (DSM 3379) rRNA.
ORGANISM Desulfobacter curvatus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1396)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1396 16S rRNA
BASE COUNT 339 a 292 c 432 g 285 t 48 others
ORIGIN
1 actggagagt ttgatcctgg ctcagaatga acgctggcgg cgtgctnaac acatgcaagt
61 cgaacgagaa agggattgct tgcaatcccg agtagagtgg cgcacgggtg agnnacacgt
121 rgatratctg ccttcaagcc tgggataact rttcgaaagg gtagctaata ccggataaag
181 tcggnncacn nnagtggatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga
241 tgagtttgcg taccattagc ttgttggtgg ggtaacggcc taccaaggca acgatggtna
301 gctggtctga gaggatgaca ggccacactg gaactgagac acggtcgact cgtgaacggg
361 aggcagcagt gaggaatttt gcgcaatggg ggaaaccctg acgcagcgac gccgcgtgag
421 tgaagaaggc ctttgggtcg taaagctctg tcgactggga agaagttcca atgttttaat
481 agaaggtggt attgagggta ccaggggagg aacgaccggc nnactccgtg ccagcagcag
541 cggtaacacn nngggggcna gcgttattcg gaanattggg cntaaagggc gcgcaggcgg
601 tcttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa
661 gacttgaata ccgtagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
721 aggaggaaca ccgatggcga aggcatctct ctggaccgat attgacgctn aggcgnnaag
781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtnaacg ttgtacnctc
841 ggtgtngcgg atatnnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg
901 gagtacggtc gcaagactaa aactcnaagg aattgacggg ggcccgcaca gtcggtggag
961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata
1021 yygrgtaatn gnrrtngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca
1081 gctcgtgtcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgttagttg
1141 ccagcacgtt atgtgggaac tctaacgaga ctgcccgggt nnaccgggag gaaggcgggg
1201 atgacgtcaa gtcctcatgg cccttatatc cagggctacg cgcgtgctac natggtaggt
1261 acaaagggca gcgactttgc rraaagaagc gaatgcccaa aagcctatcg cngtccggat
1321 tggggtctgc nactcgaccc catgaagttg gaatcgctag taatcgcgga tcngcatgcc
1381 gcggtnaata tgttcc
//
LOCUS DSPRR1615 1373 bp ss-rRNA BCT 24-JUL-1990
DEFINITION Desulfobacter latus 16S ribosomal rRNA.
ACCESSION M34414
KEYWORDS ribosomal 16S rRNA.
SOURCE D.latus (DSM 3381) rRNA.
ORGANISM Desulfobacter latus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1373)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1373 16S rRNA
BASE COUNT 322 a 268 c 393 g 267 t 123 others
ORIGIN
1 ctggagagtt tgatcctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc
61 gaacgagaaa gggattgctt gcaatcctga gtagagtggc gcacgggtga gtaacacgta
121 gataatctgc cttcaagcct gggataacta ttcgaaaggg tagctaatac cggataaagt
181 cgattyacay nagtnnattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat
241 gagtttgcgt accattagct agttggtggg gtnaaggcct accaaggcnn cgatggnnag
301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccngnc tcctacggga
361 ggcagcagtg aggaattttg cgcratgggg gcaacccnna cgcagcaatg ccgcgtgagt
421 gaagaaggcc tttgggtcgt aaagctctgt caacaaggaa gaaattagga attattaata
481 gtngtttcta ttgacggtac ttgtngagga agcgccggct nactccgtgc cagcagccgc
541 ggtaacacgg ggggcgcnag cgttattcgg aattattggg cgtaaagggc gcgcaggcgg
601 tnttgtccgt naggtgtgaa agcccggggc tcaaccccgg aagagcactt gaaacagcaa
661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
721 aggaggaaca ccgatggcga aggcatctct ctngaccgrt attgacgctn aggcgnnaag
781 gcgtggggag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacgctc
841 ggtgtngcgg atattnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg
901 nagtacggtc gcaagactaa aactcraagg aattgrcggg ggcccgcrca agcggtggag
961 catgtggttt aattcgatnn aacgcgaaga accttacctg ggtntgacat cctgtgaata
1021 tcccgtnatt gggatngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtcn
1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1141 nnnnnnnnnn nnnntgggaa ctctggcgag actgccccgg tnnacgggga ggaaggtggg
1201 gatgacgtca agtcctsrtg gccctnatat ccagggctac acacgtgcta caatggtagg
1261 tacaaagggc agcgactttg crrraagaag cgaatcccaa aangcctntc ccagtccgaa
1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atc
//
LOCUS DSPRR1616 1397 bp ss-rRNA BCT 24-JUL-1990
DEFINITION Desulfobacter sp. Vulgaris 16S ribosomal rRNA.
ACCESSION M34415
KEYWORDS ribosomal 16S rRNA.
SOURCE Desulfobacter sp. (strain 3ac10 DSM 2035) rRNA.
ORGANISM Desulfobacter sp.
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1397)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1388 16S rRNA
BASE COUNT 363 a 290 c 404 g 296 t 44 others
ORIGIN
1 actggagagt ttgrtcctgg ctcagaatga acgctggcgg cgtgcttaac acatgcaagt
61 cgaacgagaa agggattgct tgcratcctg agtagagtgg cgcacgggtg agtaacgcgt
121 agataatctg ccttcaagcc tgggataact attcgaaagg gtagctaata ccggataaag
181 tcgattcaca taagtaaatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga
241 tgagtttgcg taccattagc ttgttggtgg ggtaaaggcc taccaaggca acgatggtna
301 gctggtctga gaggatgatc agccacactg gaactggaac acggtccaga ctcctacggg
361 aggcagcagt gaggaatttt gcrgcrratg ggggcaaccc tgacgcagca acgccgcgtg
421 agtgaagaag gcctttgggt cgtaaagctc tgtcaacaag gaagaaatta caattattta
481 atagataatt gtattgacgg tacttgtaga ggaagcgccg gcnaactcnn nnnnnnnnnn
541 nnnggtaaca cggggggcgc aagcgttaat tggaattatt gggcgtaaag ggcgcgcagg
601 cggtcttgtc cgtcaggtgt gaaagctcgg ggctcaaccc cggaagtgca cttgaaacag
661 caagacttga atacgggaga ggagagagga attcctggtg tagaggtgaa attcgtagat
721 atcaggagga acaccgatgg cgaaggcatc tctctggacc gatattgacg ctnaggcgct
781 aaggcgtggg tagcgaacgg gattagatac cccggtagtc cacgcagtaa acgttgtaca
841 ctcggtgtgg cggatattta aatctgctgt gcccaagcta acgcattaag tgtaccgcct
901 gggaagtacg gtcgcaagac taaaactcaa aggaattgac gggggcccgc acaagcggtg
961 gagcatgtgg tttaattcga cgnaacgcga agaaccttac ctgggtttga catcctgtga
1021 atatcccgta attgggatag tgccttcggg agcacagaga caggtgctgc atggctgtcg
1081 tcagctcgtg tngtgagatg tttggttaag tccagcaacg agcgcaaccc ttacgtcagt
1141 tgccagcaca taatggtggg aactctggcg aactgcccgc tanannnnnn nnnnnnnnnn
1201 nntgacgtca agtcctcatg gcccttatat ccagggctac acacgtgcta caatggtagg
1261 tacaaagggc accgactctg caaagggaag cgaaatccca aaagcctatc tcagtccgga
1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atcagcatgc
1381 cgcggagaat atgttcc
//
LOCUS DSPRR1617 1472 bp ss-rRNA BCT 24-JUL-1990
DEFINITION Desulfobacter sp. 16S ribosomal rRNA.
ACCESSION M34416
KEYWORDS ribosomal 16S rRNA.
SOURCE Desulfobacter sp. (strain 4ac11 DSM 2057) rRNA.
ORGANISM Desulfobacter sp.
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1472)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1472 16S rRNA
BASE COUNT 368 a 312 c 442 g 313 t 37 others
ORIGIN
1 aactggagag tttgrtcctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag
61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcrcgggt gagtaacrcg
121 tagataatct gccttcaagc ctgggataac tatccgaaag gatagctaat accggataaa
181 gtcgatttac acaagtagac cgatgaaaga ttgcctcttc ttgaaagcaa ttgtttgggg
241 atgagtttgc gtaccattag ctaggttggt ggggtaaagg cctaccaagg cngngatggt
301 nagctggtct gagaggatga tcagtcacac tggaactgga acgcggtcca gnctcctacg
361 ggaggcagca gtgaggaatt ttgcgcaatg ggggcaaccc tgacgcagcg acgccgcgtg
421 agtgaagaag gccnntgggt cgtaaagctc tgtcwacwag gaagaaatta ggatctatta
481 atagtgggtt ctattgacgg tacttttgga ggaagcgccg gctnactckk tgccagcagc
541 ngcggtaaca cggggggcgc nagcgttatt cggaattatt gggcgtnaag ggngcgcagg
601 cggtcttgtc tgttaggtgt gaaagcgggg gctcaacccc ggaagtgcac ttgaaacagc
661 aagacttgaa tacgggagag gagagaggaa ttcctggtgt agaggtgaaa ttcgtagata
721 tcaggaggaa caccgatggc gaaggcatct ctctggaccg atattgacgc tnaggcgnna
781 aggcgtgggt agcgaacggg attagatacc ccggtagtcc acgcagtnaa cgttgtacac
841 tcggtgtagc ggatattnaa atctgctgtg ccnaagctaa cgcattaagt gtaccgcctn
901 gnnngtacgg tcgcaagact aaaactcaaa ggaattgacg ggggcccgca caagcggtgg
961 agcatgtggt ttaattcgac nnaacgcgaa gaaccttacc tgggtttgac atcctgtgaa
1021 tatcccgtaa ttgggrtrgt gccttcggga gcacagagac aggtgctgca tggctgtcgt
1081 cagctcgtgt cgtgagatgt ttggttaagt ccagcaacga gcgcaaccct tatcgtcagt
1141 tgccagcacg taatggtggg aactctggcg agactgcccc ggtcaacggg gaggaaggtg
1201 gggatgacgt caagtcctca tggcccttat atccagggct acacacgtgc tacaatggta
1261 ggtacaaagg gcagcgactt tgcaaaaagg agcgaatccc naaaagccta tctcagtccg
1321 aattggggtc tgcaactnga ccgnatgaag ttggaatcgc tagtaatcgc ggatcagcat
1381 gccgcggtna atatgttccc gggccttgta cacaccgccc gtcacaccat ggaagttgat
1441 tatacccgac gtcrctgggc taactattta ga
//
LOCUS DSURR1611 1371 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.propionicus 16S ribosomal rRNA.
ACCESSION M34410
KEYWORDS ribosomal 16S rRNA.
SOURCE D.propionicus (ATCC 33891) rRNA.
ORGANISM Desulfobulbus propionicus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1371)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1371 16S rRNA
BASE COUNT 333 a 271 c 397 g 276 t 94 others
ORIGIN
1 naactggaga gtttnatcct ggctcagaac gaacgctggc ggcgtgctta acacatgcaa
61 gtcgaacgcg aaagggactt cggtcctgag taaagtggcg cacgggtgag taacgcgtag
121 ataacctnnc ttcatgtctg gaataatacg ccgaaagggg trctaatacc ggatattntn
181 gctnttcata agttatgcaa gcraaaggtg gcctctgcat atgctactgc rtgaagaggg
241 gtctgcgtac cattagctag taggtggggt aatggcctac ctaggcgacg atggtnagcg
301 ggtctgagag gatgatccgc nacactggca ctggaacacg ggcnngnctc ctacgggagg
361 cagcagtgag gaatattgcg caatgggggn aacccnnacg cagcgacgcn gcgtgagtga
421 ggaaggcctt cgggtcgtaa agctctgtca agaggaaaga agtgtacaat ggctnatacc
481 tnttgtnttt gacggtacct ctnaaggaag caccggctna ctccgtncca gcagcnncgg
541 taatacggag ggtgcaagcg tnnttcggaa tcactnggng tnaagggcgc gtaggcggnt
601 tggtaagtca gatgtgaaag cccacggctn aactgtggaa gtgcatttga aactgtcagg
661 cntgagtacc agaggggaaa gtggaattcc cggtgtagag gtnaaattcg tagatatcgg
721 gaggaatacc ggtggcgaag gcgactttct ggctngatac tnacgctnag gcgcnaaagc
781 gtggggagca aacaggatta gataccctng tagtccacgc tgtaaacgat gtgaactaga
841 tgcagggggt gtnnatccnn nctgtgtngc agctaacgca ttnagttnnn nnnnnnnnna
901 gtacggnngc aagattaaaa ctcaaaggaa ttgacggggn ncngcacaag cggtggagta
961 tgtggtttaa ttcgatnnaa cgcgaagaac ctnacctngt nttgacatcc cggnaatctn
1021 ttggaaacaa gagagtgctt ccgnnagnat ctggagacag nnnctgcatn gctntcgtna
1081 nnnnntgttg tgagatgttg ggttaagtcc cnnaacgagc gcaacccttg cctttagttg
1141 ccagcagttc ggctggrcac tctaaaggga ctgccggtgt taaaccggag gaaggtgggg
1201 atgacgtcaa gtcctnatgg ccnntatgac cagggctaca cacgtcctac aatggccgat
1261 acaaagggca gcgacactgc gaggtggagc taatcccata aaatcggnct cagtccggat
1321 tggagtctgc aactcgactc catgaagttg gaatcgctag taatcgngga t
//
LOCUS DSURR1612 1365 bp ss-rRNA BCT 24-JUL-1990
DEFINITION Desulfobulbus sp. 16S ribosomal rRNA.
ACCESSION M34411
KEYWORDS ribosomal 16S rRNA.
SOURCE Desulfobulbus sp. (strain 3pr10, DSM 2058) rRNA.
ORGANISM Desulfobulbus sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1365)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1365 16S rRNA
BASE COUNT 299 a 267 c 389 g 267 t 143 others
ORIGIN
1 gaagagtttn attctggctc agaacgaacg ctggcggcgt gcntnncaca tgcaagtcga
61 accgnnnnnn nnnnnnnntc ctagtagtgg cgcacgggtg agtracgcgt agatracctg
121 ncctnatgtc tgggatrccc caccgaaagg tgggctaata ccggatgtnc tnntntttca
181 tnagcgtaat aaagaaaagg tggcctctga tattaagctn ctgcrtgagr aggggtctgc
241 gtaccattag ctagttggtg gggtnagngc ctaccaaggc atcgatggtn agcgggtctg
301 agngnatgat ccgtcacact ggcactgaaa cncgggccng rctccnacgg gaggcngcng
361 tgaggaatnt tgcgcnatgg gggcnaccct nacgcagcga cgccgcgtga gtgagnaagg
421 ctttcgggtc gtaaagctct gtnnaaggga agaaatgcat aatnattnat ncttnttatg
481 tttgacggta cctttaaagg aagcaccggc taactccgtg ccagnagccg cggtaatacg
541 gagggtgcaa gcgttgtccg gaattactgg gcgtnaaggg cacgcaggcg gcctgataag
601 tcagatgtga aagcccacgg ctnnaccgtg gaagtgcatt tgaaactatt aggnnngagt
661 atcagagggg aaagtggaat tcctggtgta gaggtgaaat tcgtagatat caggaggaat
721 rccagtggcg aaggcgactt tctggctgaa tactgacgct gaggtgcgaa agcgtgggga
781 gsaaacagga ttagataccc tggtagtcca cgccgtaaac gatgtcaact aggtgtaggg
841 ggggttnatc csstctgtgc cgcagctrac gcattragtt gacngcctgg ggagtacggt
901 cgcaagatta aaactcnaag naattgrcgg gggcccgcrc nagcggtgga gtatgtggtt
961 taattcgatg nnacgcgaag aaccttacct ggtcttgaca tcccaagaat cttctagaaa
1021 tagaagagtg cttttcggag aacttggtga caggtgctgc atggctgtcg tcagcnnnnn
1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ttgccagtag
1141 ttcggctgag cactctaaag ggactgccgg tgtnnaaccg gaggaaggtg gggatgacgt
1201 caagtcctca tggcsnntat gaccagggct acacacgtac tacnatggcc ggtacnaagg
1261 gcagcgacag ggcgrcctna agcgaatccc gtraagccgg nctcngtccg gattggagtc
1321 tgcnactcga ctccatgaag tcggaatcgc tagtaatcgt ggatc
//
LOCUS DSVRR168 1527 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.variabilis 16S ribosomal rRNA.
ACCESSION M34407
KEYWORDS ribosomal 16S rRNA.
SOURCE D.variabilis (strain DSM 2060) rRNA.
ORGANISM Desulfosarcina variabilis
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Sulfolobales.
REFERENCE 1 (bases 1 to 1527)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1527 16S rRNA
BASE COUNT 380 a 338 c 458 g 328 t 23 others
ORIGIN
1 ttaattggag agtttgatcc tggctcagaa tgaacgctgg cggcgtgctt aacacatgca
61 agtcgtacga gaacgctcta gcttgctaga gcaagtaaag tggcgcacgg gtgagtaacg
121 cgtgggtaac ctaccttcga attggggata acgttgcgaa agcgacgcta ataccgaata
181 atatccaaat atcttcggat ttttggatca agggtggcct cttcttgaaa gctactgttt
241 gaagatgggc ccgcgtacca ttagcttgtt ggtggggtaa tggcctacca aggcaacgat
301 ggttagctgg tctgagagga tgatcagcca cactggaact gacacacggt ccagactcct
361 acgggaggca gcagtgagga attttgcgca atgggcgaaa gcctgacgca gcaacgccgc
421 gtgagtgatg aaggctttcg ggtcgtaaag ctctgtcaag tgggaagaac ctatataatg
481 ttaatatcat tatatactga cggtaccact gaaggaagca ccggctaact ccgtgccagc
541 agccgcggta atacgggggg tgcaagcgtt attcggaatc attgggcgta aagagcacgt
601 aggcggcctg gtaagtcaga tgtgaaagcc cggggcttaa ccccggaagt gcatttgata
661 ctgtcaggct tgagtatggg agagggaagt ggaattcctg gtgtagaggt gaaattcgta
721 gatatcagga ggaacaccgg tggcgaaggc gacttcctgg accaatactg acgctgaggt
781 gcgaaggcgt gggtagcaaa caggattaga taccctggta gtccacgcag taaacgttga
841 tcactaggtg tagcrgggta ttgacccctg ctgtgccgca gttaacgcat taagtgatcc
901 gcctggggag tacggtcgca agattaaaac tcaaaggaat tgacgggggc ccgcacaagc
961 ggtggagcat gtggtttaat tcgannnaac gcgaagaacc ttacctggat ttgacatccc
1021 gggaatccta tcgaaagact ggagtgccct tcggggagcc cggwgacagg tgctgcatgg
1081 ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccttgt
1141 ctttagttac catcatttag ttggggactc taaagatact gccccggtca acggggagga
1201 aggtggggat gacgtcaagt cctcatggcc tttatgtcca gggctacaca cgtgctacaa
1261 tgggcggtac aaagggtagc gaacccgcga gggtaagcca atcccaaaaa gccgtcccaa
1321 gttcggattg gagtctgcaa ctcgactcca tgaagctgga atcgctagta atcgcggatc
1381 agcatgccgc ggtgaatacg ttcccgggcn ttgtacacac cgcccgtcac accatgagag
1441 ttggttgtac ccgaagtcgt cgggctaacc cttcggggat gcangcgcct aaggtatggc
1501 tgannnnnnn nnnnnnnnng taacaag
//
LOCUS DVURR16 1388 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.vulgaris 16S ribosomal rRNA.
ACCESSION M34399
KEYWORDS ribosomal 16S rRNA.
SOURCE D.vulgaris (strain Hildenborough DSM 644) rRNA.
ORGANISM Desulfovibrio vulgaris
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1388)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1388 16S rRNA
BASE COUNT 306 a 315 c 418 g 248 t 101 others
ORIGIN
1 taactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa
61 gtcgngcgtg aaaggacttc ggyccgagta aagcggcgcr cgggtgagta acgcgtggat
121 gatctaccca tgagttggga ataacggctg gaaacggtcg ctaataccga atacgctccg
181 atttcracnt cgggggaaag gtggcctctg cttgcaagct actgctcatg gatgagtccg
241 cgtcccatta gctagttggt ggggtaacgg cccacnaagg cgacratggg nagccggtct
301 gagaggatga ccggccacac tgggactgga acacggccna gactcctacg ggaggcagca
361 gtggggaata ttgcgcaatg ggcgaaagcc tnacgcagcg acgccgcgtr agggatgaag
421 gtcctcggat cgtnaacctc tgtcnggagg gaagaaccgc cacggtgctn atcagccgtn
481 gtctgacggt acctccagag gaagcaccgg ctaactccgt gccagcagcc gcggtaatac
541 ggagggtgca agcgttaatc ggaatcactg ggcgtaaagc gcacgtaggc tgcttggtaa
601 gtcaggggtg aaagccggnn nctcaaccgc ggaattgcct ttgatactgc caagcnagag
661 tccgggagag ggtagtggaa ttccaggtgt aggagtgaaa tcmgtagaga tctggaggaa
721 catcagtggc gaaggcgact acctggaccg gnactgacgc tnaggtgcnn aagcgtgggg
781 agcaaacagg attagatacc ctngtagtcc acgccgtaaa cgatggatgc taggtrtcgg
841 ggccttgarr cttcggtgcc gtagttaacg cgttaagcaa tccgcctngg gagtacggtc
901 gcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
961 nnnnnnnnnn aacgcgaaga accttaccta ggtttgacat ccggaagacc ttcccgaaaa
1021 ggaaggntgc cttcggggaa ttccgagaca ggtgctgcat ggctgtcgtc agctcgtgcc
1081 gtgaggtgtt gggttaagtc ccgcaacgag cgcaacccct attgccagtt gctaccaggt
1141 aatgctgggc tctctggtga gactgccccg gttaacgggg aggaaggtgg ggacgacgtc
1201 aagtcatgat ggcccttacg cctagggcta cacacgtact acaatggcgc atacaaaggg
1261 cagcgatacc rcraggtgga gccaatccca aaaagtgcgt cccagtccgg attgcagtct
1321 gcaactcgac tgcatgaagt tggaatcgct agtaattcga gatcagcatg cncgggtgaa
1381 tgcgttcc
//
LOCUS DVURR161 1370 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.gigas 16S ribosomal rRNA.
ACCESSION M34400
KEYWORDS ribosomal 16S rRNA.
SOURCE D.gigas (ATCC 19364) rRNA.
ORGANISM Desulfovibrio gigas
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1370)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1370 16S rRNA
BASE COUNT 310 a 303 c 393 g 243 t 121 others
ORIGIN
1 taactggaga gtttgattct ggctcagatt gaacgctrgc ggcgtgctta acacatgcaa
61 gtcgagcgag anngggtctt tcgggacctg agtacagcgg cgcacgggtg agtaacgcgt
121 ggatgatcta cctcattgrc cgggataacg gtgggaaacc gctgctaata ctggatacgc
181 ttcatggaaa catgagggaa aggtggcctc tgatttaagc trtcacaatg agatgagtcc
241 gcgtctcatt agcttgttgg tagggtaatg gcctaccaag gctacgatga gtagcggccc
301 tgagagggtg gccrgccaca ctggaactga aacacggtcc agactcctac gggaggcagc
361 artggggaat attgcgcaat gggcgaaagc ctnacgcagc gacgccgcgt gagggaagaa
421 ggccttcggg tcgtaaacct ctgtctggag gaaagaaacc cataggtgtg aatagggctt
481 atggctgacg gtacctccga aggaagcacc ggctaactcc gtgccagcag ccgcggtaat
541 acggagggtg caagcgttaa tcggaatcac tgggcgtaaa gcgcacgtag gctgtcttgc
601 aagtcaggtg tgaaagccct cggctcaacc ggggaactgc acttnaaact gcatgacttg
661 agtacaggag aggatggcag aattccgggt gtaggggtna aatccgtaga tatccggagg
721 aataccagtg gcgaaggcgg ccatctggac tgtcactgac gctgaggtgc rnaagcgtgg
781 ggagcaaaca ggattagata ccctngtagt ccacgctgta aacgatggac gctnggtrtc
841 grggggtcgc ccctcggcgc cgaactaacg cgttnannnn nnnnnnnnnn nnnnnnnnnn
901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
961 nnnnnnnnnn aacgcgaaga accttaccta ggcttgacat cccgggaacc ctcccgaaaa
1021 ggaggnntgc tcttcggaga gcccggagac aggtgctgca tggctrtcgt cagctcgtgc
1081 cgtgaggtgt tgggttaagt cccgcaacga gcgcaacccc tgtccatagt tgccnnnngg
1141 taaagctngn cactctatgg agactgcctt ggcaacaagg aggaaggtgg ggatgacgtc
1201 aagtcatcat ggcccttacg cctagggcta cacacgtact acaatggcga gtacaaaggg
1261 ttgcgaaacc gcgaggttga gccaatccca aaaagctmgt cccagtccgg attgcagtct
1321 gcaactcgac tgcatgaagt cggaatcgct agtaatcccg gatcagcatg
//
LOCUS DVURR162 1400 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.salexigens 16S ribosomal rRNA.
ACCESSION M34401
KEYWORDS ribosomal 16S rRNA.
SOURCE D.salexigens (ATCC 14822) rRNA.
ORGANISM Desulfovibrio salexigens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1400)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1400 16S rRNA
BASE COUNT 320 a 306 c 393 g 270 t 111 others
ORIGIN
1 actggagagt ttgattctgg ctcagattga acgctggtgg cgtgcttaac acatgcaagt
61 cgtgcgagaa cagttccttc gggaacctag tagagcggcg cacgggtgag taacgcgtgg
121 ataatctacc cagaagactg ggataacagt tggaaacgac tgctaatacc ggatacgttt
181 catatttaac tnnatnagag aaaggtggcc tctntttcaa gctatcactt ttggatgagt
241 ccgcgtttca ttagctagtt ggtaaggtaa cggcttacca aggcgacgat gaatagctgg
301 tctgagagga tgaccagcca cactgggact ggaacacggc cnagactcct acgggaggca
361 gcagtgggnn atattgcgca atgggggaaa ccctgacgca gcgacgccat gtgagggacg
421 aaggctttcg ggtcgtaaac ctctgtcrrg rrggaagaaa ctgtttgarg ctaataccct
481 ntttcacttg acggtaccct cagaggaagc accggctaac tccgtgccag cagccgcggt
541 aatacggagg gtgcgagcgt taatcggaat cactgggcgt aaagcgcgcg taggtggcgc
601 gataagtcag gcgtgaaagc cctcggctca accggggaat tgcrcttgat actgtcgtgc
661 ttgagtctcg gagagggtgg cggaattcca ggtgtaggag tgaaatccgt agatatctgg
721 aggaacacca gtggcgaagg cggccacctg gacgagtact gacgctgagg tgcgaaagcg
781 tggggagcaa acaggattag ataccctngt agtccacgcc gtaaacgatg gatgctagat
841 gtcgggcctt aaccaggttc ggtgtcgaag ttaacgcgat aagcatccnn nnnnnnnnnn
901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
961 nnnnnnnnnn nnnnnnaacg cgaagnacct tacctggact tgacatcctg agaatcctct
1021 agaaatagag gagtgccttc gggnnattca gtgacaggtn ctgcatggct ntcgtcagct
1081 cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccctattg ctagttgcca
1141 tcacataatg gtgggcactc tagtgagact gcccgggtca accgggagga aggtgggnac
1201 gacgtcaagt catcatggcc cttacgtcca gggctacaca cgtactacaa tggtggatac
1261 aaagggttgc caagccgcga ggccgagcca atcccaaaaa gtccrtccca gtccggatcg
1321 cagtctgcaa ctcgactgtg tgaagttgga atcgctagta atcccggatc agcatgccgg
1381 ggtnaatacg ttcccgggcc
//
LOCUS DVURR163 1395 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.sapovorans 16S ribosomal rRNA.
ACCESSION M34402
KEYWORDS ribosomal 16S rRNA.
SOURCE D.sapovorans (strain 1pa3 ATCC 33892) rRNA.
ORGANISM Desulfovibrio sapovorans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1395)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1395 16S rRNA
BASE COUNT 334 a 291 c 427 g 286 t 57 others
ORIGIN
1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag
61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg
121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt
181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga
241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna
301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg
361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag
421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac
481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg
541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg
601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc
661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat
721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa
781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact
841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg
901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag
961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact
1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc
1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt
1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg
1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat
1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg
1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg
1381 ccgcggtgaa tacgt
//
LOCUS DVURR164 1395 bp ss-rRNA BCT 24-JUL-1990
DEFINITION D.baarsi 16S ribosomal rRNA.
ACCESSION M34403
KEYWORDS ribosomal 16S rRNA.
SOURCE D.baarsi (strain 2st14 DSM 2075) rRNA.
ORGANISM Desulfovibrio baarsi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1395)
AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
TITLE Natural relationships among sulfate-reducing eubacteria
JOURNAL J. Bacteriol. 171, 6689-6695 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES from to/span description
rRNA < 1 1395 16S rRNA
BASE COUNT 334 a 291 c 427 g 286 t 57 others
ORIGIN
1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag
61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg
121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt
181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga
241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna
301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg
361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag
421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac
481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg
541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg
601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc
661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat
721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa
781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact
841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg
901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag
961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact
1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc
1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt
1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg
1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat
1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg
1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg
1381 ccgcggtgaa tacgt
//
LOCUS DRORNP70K 3569 bp ds-DNA INV 24-JUL-1990
DEFINITION D.melanogaster U1 70K small nuclear ribonucleoprotein gene,
complete cds.
ACCESSION M31162
KEYWORDS ribonucleoprotein; small nuclear ribonucleoprotein.
SOURCE D.melanogaster (strain Oregon R) DNA, clones lambda D331.[1,2]; and
adult head, cDNA to mRNA, clones pBS9.[1,3] and lambda R3-2.1.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 3569)
AUTHORS Mancebo,R., Lo,P.C.H. and Mount,S.M.
TITLE Structure and expression of the Drosophila melanogaster gene for
the U1 small nuclear ribonucleoprotein particle 70K protein
JOURNAL Mol. Cell. Biol. 10, 2492-2502 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Mancebo, 09-JAN-1990.
FEATURES from to/span description
pept 614 1960 U1 70K small nuclear ribonucleoprotein, exon 2
(first expressed exon)
pre-msg < 1 2259 snRNP mRNA and intron (alt.)
pre-msg < 1 3367 snRNP mRNA and intron (alt.)
IVS 488 566 U1 70K snRNP, intron A
site 103 182 RNA recognition motif
BASE COUNT 972 a 839 c 919 g 839 t
ORIGIN Chromosome II; map position 27D1,2.
1 gcatgaagga aaatattcta caaaaaactt caattttata aaattcattt aaaatacaaa
61 attgtacgta aacttaacgt aaccgttact cagttatgga atgtgtgagc gagatggtga
121 agcagcagca agtgatgtag caaattgcaa ttgaacgcga gtgggaaaag gggcaactat
181 aaaaccgaga aacttgcttt tagcatggat tcgaacccct tatttatagt actctggatg
241 tccgagacac acacctattt gtggtattta tattttataa cgtaagtagt atatttaatt
301 actaatcagt atttcatgcg gaattcttcc gcttaattca tagaccgcgc gggggtcaca
361 cttgctactc aagccaggcg aaaaactaaa gaaaatcggg aaaatacttg gtctgcaccg
421 aattatattg ctggtactta aacgaagtac cctagattta ttcttgccaa gcggatggct
481 gtttaaggtg agttgcgcca gcgcttacta tcccttgtgg agtaaacaaa ctccaaccta
541 acctcaaact gaccgttttt ttgcagacga ggaacttcag gaaaaggtaa aacaaaacaa
601 aaaagcccac aaaatgaccc aatatctgcc gccgaatctg ctggcgctgt tcgcggcacg
661 ggagcccatc ccgttcatgc cgccggtgga caagctgccg cacgagaaga agtctcgcgg
721 ctacctggga gtggccaagt tcatggccga tttcgaggat cccaaggaca cgccgctgcc
781 gaaaacggtg gaaacgcgtc aggagcggct ggagcgacgc cggcgcgaga aggccgagca
841 ggtggcctac aagctggagc gtgagatagc gctgtgggac cccacagaga tcaaaaatgc
901 cacggaggac ccgtttcgca cgctgttcat tgcacgcatc aactacgaca cgtccgagtc
961 gaagctgcgg cgtgagttcg agttctacgg gcccatcaag aagatcgtcc tgatccacga
1021 ccaggaatca ggtaaaccca agggctacgc cttcatcgag tacgagcacg agcgggacat
1081 gcatgccgcc tacaagcacg ccgatggtaa gaagatcgac agcaagcgcg tcctggtgga
1141 cgtggagcgg gctcgcacgg tcaagggctg gctgcctcga cgcctgggcg gcggtctggg
1201 tggaacgcgc cgcggcggca acgatgtcaa cattaagcac tccggccgcg aggacaacga
1261 gagggaacgc gagcgctacc ggctggagcg ggagcgtgag gatcgcgagg gtcctggacg
1321 cggcggcggc tccaatggcc tggatgcccg gcccggacgc ggtttcggtg cggaacgtcg
1381 acgttcccgc tccagggaac gccgcgaccg tgaacgagat cgcggacggg gcgctgtggc
1441 tagcagcggt cgctcgcgca gccgttctcg cgagcgcaga aaacgacgag cgggcagccg
1501 ggagcggtac gacgagttcg accgccgcga tcggcgggac agggagcgcg agcgtgatcg
1561 cgatcgcgag cgtgagaaga aaaagaagcg ctccaagtct cgcgaacgcg aatcctccag
1621 ggagcgtcgc gaacggaagc gagagagaag ggaccgtgaa cgcggcaccg gatccggcgg
1681 cgatgtcaag gagcgcaagc ccgatttccg tgatatggat gtcatcaaga tcaaggagga
1741 gcccgtcgac gatggctatc ccacatttga ctaccagaac gcgaccatca agcgtgagat
1801 cgacgatgag gatgaggaga agtaccggcc gccgcctgcg catcacaata tgttcagtgt
1861 gccgccgccg cccattttgg ggcgtggaaa tgccagcacg aatcccaatc ccgacaatgg
1921 ccagcagagc tccggcgacc cgagttggtg gcgtcagtag agtcttggag cgatgtgtca
1981 aggttaatat ttctagaagt cagacgtctg tgggtcgcaa ctatttattc attcgactcc
2041 gaggcacccc aatcctggag agcctaaccg tagctatcca ttttacacag aaaattttaa
2101 ggtaccactt ggaggagggc caagtagaac aagattatag aaaacccgac cgctaaacgc
2161 agaatccgct aatgtgtgcg tagcttaaat cacttaaatt tataagtaac tcttaacaaa
2221 tgaatatgaa aacagtaagt aaaataaagc tagccctcat gtgtttgttt ccccaccttt
2281 ggtaaggggg ttaaagggaa tacggagagt caggagctgg aacgctttcg gtggcgcata
2341 caccgtacta tatggttact ccatcccatg gtggttcctg ggattttcta actcacctaa
2401 cataataagc tgaacaatac aaacccttgc actaactcgt gccttttatt ttctctgttt
2461 ttttgcagtt ttcaatcaat tgaaaatctg actctgacta gtgtgaaagc aaaagcataa
2521 gtatttaatc aaacaaacag taatccaaaa acggaaatta gttccgccag tattcgtagc
2581 ccatgcccaa gtctaaattc caagcccaca tcaggtaatt tggtctacgc acaaacctca
2641 ctaatccatg cgtctaccgt tctaggacag ctctagaatc aagacagcta ccgcaatact
2701 tttccaatct cctccgctct gggttgcctg tgttgtgtgg tggtgtggcg tgtaagttga
2761 ttccgggcta agaaattttg taaaccaaaa ccttttccgt aagtttgccc cggtaagatt
2821 acgatatcct cgcctaaccg ccgtggatcg gatggatgag tgagttagtg taagggagct
2881 ttcctctgtt tggtacacat tgcgaactgc tccgatggcc tgtggcaatc tactccattc
2941 atccattatg tctgtaacca atttaccatt tcgatctttt catgtacgtt gagctgattg
3001 ttatcgtact gaagacgaat ccgcggcgga aactcgcata gaacaaacag aaaactgcgc
3061 aaaggtttgc ttttggtaac tggtaacgat tggttgtggt tggtcaggtc aggggtcttt
3121 cgagctggtt agcctccact ttggattgcg gacggtgaca aaattctggg ggtcttttat
3181 ttttgtattg tattgtttgg cagacaggat ccatgcaatt gccttttcgt accgtgcgac
3241 atacgggaaa caaactatcg tgtagcaggg catttttcct attacaccat tattaagaaa
3301 agcggacaga gatcaatgtc acatttagat ttgatcaaat aaagaaaaga attcctcctt
3361 acacgaaaaa gtacttgttt catttctaag tatttatcaa gatgacttga attttttaca
3421 attttcgctt gttaaactgg tctatatgtc ctggataaga tttgcagtgg cgtccttgga
3481 atctctaaaa atgtatacat ttgttgcaaa taaagcattg tgaaatctat ggacattaaa
3541 tgcgacctct attggaaaac attcacata
//
LOCUS FSBAFP05A 946 bp ds-DNA VRT 24-JUL-1990
DEFINITION Macrozoarces americanus antifreeze protein OP5A gene, complete cds.
ACCESSION J03924
KEYWORDS antifreeze protein.
SOURCE Ocean pout adult testis DNA, clone lambda-OP5.
ORGANISM Macrozoarces americanus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Gadiformes; Zoarcoidei; Zoarcidae.
REFERENCE 1 (bases 1 to 946)
AUTHORS Hew,C.L., Wang,N.-C., Joshi,S., Fletcher,G.L., Scott,G.K.,
Hayes,P.H., Buettner,B. and Davies,P.L.
TITLE Multiple genes provide the basis for antifreeze protein diversity
and dosage in the ocean pout, Macrozoarces americanus
JOURNAL J. Biol. Chem. 263, 12049-12055 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.H.Hayes, 13-JUN-1988.
FEATURES from to/span description
pept 270 338 antifreeze protein, exon 1
521 715 antifreeze protein, exon 2
IVS 339 520 antifreeze protein, intron A
BASE COUNT 283 a 209 c 186 g 268 t
ORIGIN 211 bp upstream of Sau3AI site.
1 aagcttgtga tagtttggac aaaaacaagt tatactttac ttataagaat ataaaatttc
61 cattgcaatt ggcataagga ggtgtgacac agtgacctac tttcaggcca ataggaaacg
121 ggatatgccg gttaagtcct cccacatact gtatattaga tgcagcacat ggacctgtcc
181 tgtcagaagt ctcagctaca gctttcactt cgatctccga taattaatta attaattaat
241 tattaattaa ttaagtctca gccacagcca tgaagtccgt tattttaacc ggtttgcttt
301 tcgtcctcct ttgtgtcgac cacatgacag ccagccaggt gagatattct tgctccacta
361 aaaaatattc aaaaatgtga gctacagtaa aattcaacag tgttctgttt agaaagacag
421 agaacctttt aagtaaacat ttttagaatt ttctttttca actgtgccat gagaacaata
481 ataacgtctg accttttatc ttccattctt caacggtcag tccgtggtgg ccacccagct
541 gatccccata aatactgccc tgactccggt gatgatggag gggaaggtga ccaacccaat
601 aggcatcccg ttcgcagaga tgtcccaaat agtggggaag caagtgaaca cgccagtggc
661 taagggccaa accctcatgc caaacatggt gaaaacgtac gccgcgggaa agtagttctg
721 agggtgccaa ggagcttctt cccaaaacca aaagaagaaa tgccccctct cacaattaac
781 cctgtttttg tcacaaaccc aagtctgtta actgaacatg tcaaaacctg tggagactgt
841 tgagatttga tgttctgaaa agataaagcc tataaataaa atgttgccca aatttcctgc
901 ctgatgtttt tctttgtcgt tgctacatgg ctttgctgct cggatc
//
LOCUS HUMCRPX 78 bp ss-mRNA PRI 24-JUL-1990
DEFINITION Human C-reactive protein (CRP) mRNA, 5' end.
ACCESSION M35163
KEYWORDS C-reactive protein.
SOURCE Human adult liver, cDNA to mRNA, clone pCRP5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 78)
AUTHORS Tucci,A., Goldberger,G., Whitehead,A.S., Kay,R.M., Woods,D.E. and
Colten,H.R.
TITLE Biosynthesis and postsynthetic processing of human C-reactive
protein
JOURNAL J. Immunol. 131, 2416-2419 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 > 78 C-reactive protein precursor
/hgml_locus_uid="LR0093W" /nomgen="CRP"
/map="1q21-q23"
sigp 1 54 C-reactive protein signal peptide
matp 55 > 78 C-reactive protein
BASE COUNT 15 a 17 c 22 g 24 t 0 others
ORIGIN
1 atggagaagc tgttgtgttt cttggtcttg accagcctct ctcatgcttt tggccagaca
61 gacatgtcga ggaaggct
//
LOCUS PT3TAUF 117 bp ds-DNA PHG 24-JUL-1990
DEFINITION Bacteriophage T3 early terminator (T3Te) region.
ACCESSION M35171
KEYWORDS early terminator region.
SOURCE Bacteriophage T3 DNA.
ORGANISM Bacteriophage T3
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Briat,J.-F., Bollag,G., Kearney,C.A., Molineux,I. and
Chamberlin,M.J.
TITLE Tau factor from Escherichia coli mediates accurate and efficient
termination of transcription at the bacteriophage T3 early
termination site in vitro
JOURNAL J. Mol. Biol. 198, 43-49 (1987)
STANDARD simple staff_review
FEATURES from to/span description
signal 85 106 transcription termination signal
BASE COUNT 28 a 37 c 28 g 24 t
ORIGIN
1 cggacggctc cctgcgacac cctagcttcg agaagttccg aggaactgag gacaaccctc
61 aagagaaaat gtaaccaact cactggctca ccttcacggg tgggcctttc ttcgttc
//
LOCUS VIBRHP 1489 bp ds-DNA BCT 24-JUL-1990
DEFINITION V.harveyi Arg-tRNA, two Pro-tRNA and two His-tRNA genes.
ACCESSION M29762 M31832
KEYWORDS transfer RNA-Arg; transfer RNA-His; transfer RNA-Pro.
SOURCE V.harveyi (strain ATCC 14126) DNA.
ORGANISM Vibrio harveyi
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1489)
AUTHORS Giroux,S. and Cedergren,R.
TITLE Evolution of a tRNA operon in gamma purple bacteria
JOURNAL J. Bacteriol. 171, 6446-6454 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.J.Cedergren, 06-NOV-1989.
FEATURES from to/span description
tRNA 60 136 Arg-tRNA
tRNA 171 247 His-tRNA
tRNA 324 400 Pro-tRNA
tRNA 450 523 His-tRNA
tRNA 588 664 Pro-tRNA
anticdn 94 96 Arg-tRNA anticodon ccg
anticdn 205 207 His-tRNA anticodon gtg
anticdn 358 360 Pro-tRNA anticodon tgg
anticdn 481 483 His-tRNA anticodon gtg
anticdn 617 619 Pro-tRNA anticodon tgg
signal 11 16 -35 region
signal 34 39 -10 region
BASE COUNT 351 a 331 c 360 g 447 t
ORIGIN
1 aaaaaagtac tagacggcat agtgtgatat ccgtattatt cctctccgcc gatgggcatg
61 cgcccgtagc tcagttggat agagcgttgg cctccggagc caaaggtcga aggttcgaat
121 cctttcgggc gtgccattcc ggaatacaaa ttaaaggcac gaacaatagt ggtggctata
181 gctcagttgg tagagccctg gattgtgatt ccggtggtcg cgagttcgaa tctcgttagc
241 caccccatta tttcggtaac tcattaaatt agtgaagttt ccagttttga ttcaaatatc
301 ccaagagtcg aaacaaaatt cgtcggtgaa tagcgcagct tggtagcgca tctggtttgg
361 gaccagaggg tcgggggttc gaatccctct tcaccgacca ctatttagtt tatggctata
421 atagagcggt aaacggcaaa attgatggtg gctatagctc agttggtaga gccctggatt
481 gtgattccgg tggtcgcgag ttcgaatctc gttagccacc ccattaattt aggtgacttc
541 ttagttgaag tatcctgtct gatttgagag aatcagcaaa tacagttcgg tgaatagcgc
601 agcttggtag cgcatctggt ttgggaccag agggtcgggg gttcgaatcc ctcttcaccg
661 accactatta gaaagcctgc tttttagcag gctttcgtcg ttttagagct tatgatctcg
721 cttggactag atgagtcggc aatatttcaa tggatttgaa cccatatggg gttcgcctga
781 agttagaata tggctcttca tcgaccaata tttaaagcct cagtcgaaag actagggctt
841 ttttgtattt ctcaccctaa aaaatctgca ttccatttca atatcagtaa acatgtacag
901 ctctggaagc ttttgaccac ctatgatcta aatgcgattc gagctgataa aagaaacccc
961 agcacgttgg ctggggtttt gtgtttcttg aattaagcgc tttactgagc aagcgttttt
1021 aggatttcat tcagctcaac cgtctctcgc acttgatagt tgggtttgat atcggttgta
1081 tttccgtacc tgttgcgttt acccacacgc tcagaatacc catattgttg gcacccgcaa
1141 tgtctgctgg tagagagtcg ccaatatgaa tcacctcttc tggctttgca tccacaggtt
1201 gagtgctttg cggaaaatgt tggctgctgg cttctcttct ggctcttccc caccgacgat
1261 aatgtgatca acccattcgc tcatttgtgt tgctttgagt tttgggtgct gtgagaagat
1321 tggaccgtta gtgatcacca ctaatttgta gtgttgacgt agctcggtca gcatctcttt
1381 tacgccaggg aagaagtcaa aagcccccat gcgagccgag tcaaaatagc tctgcgcctg
1441 ttgagccttt tctgtatcaa tctctatact ttgctctgcc aagatggcc
//
LOCUS YSCPOX1 2915 bp ds-DNA PLN 24-JUL-1990
DEFINITION Yeast (S.cerevisiae) acyl-coenzyme A oxidase (POX1) gene, complete
cds.
ACCESSION M27515
KEYWORDS acyl-coenzyme A oxidase.
SOURCE Yeast (S. cerevisiae) DNA, clone PAD17.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2915)
AUTHORS Dmochowska,A., Dignard,D., Maleszka,R. and Thomas,D.Y.
TITLE Structure and transcriptional control of the Saccharomyces
cerevisiae POX1 gene encoding acyl-coenzyme A oxidase
JOURNAL Gene 88, 247-252 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.Dignard, 01-SEP-1989.
FEATURES from to/span description
pept 455 2701 acyl-coenzyme A oxidase
mRNA 364 2843 POX1 mRNA (minor alt.)
mRNA 365 2843 POX1 mRNA (minor alt.)
mRNA 369 2843 POX1 mRNA (minor alt.)
mRNA 386 2843 POX1 mRNA (major alt.)
mRNA 400 2843 POX1 mRNA (major alt.)
mRNA 406 2843 POX1 mRNA (minor alt.)
mRNA 413 2843 POX1 mRNA (minor alt.)
BASE COUNT 895 a 532 c 600 g 888 t
ORIGIN 1 bp upstream of BglII site; chromosome VII.
1 agatctcgac caaaaaaagg gggtagggta ataaaattaa ccctattatt ttttaacttt
61 aaaacctata atgtgctaat attttattat aaacctcctt tttttgcgtt caaaccctga
121 cacattttaa gccctatatt tacggtatta gttgattaaa ctccgaagcg aaaggaattc
181 ggtcattagc ggctaatagc cgttggggta aatcacctac aagcaagtac acaagagaac
241 gttggcgttg ttaagtcaaa gcactaatac attggggctt taagagtgtt tataaaggtc
301 taacctgtaa aaattattta aacaacttga acaggcctta aagttttcct cattccgctc
361 atcatcacta atattgctct ccgtttttga atacacactt gacactaata agtatcacag
421 aaaaaaagaa aatataataa attagtattg cgatatgacg agacgtacta ctattaatcc
481 cgattcggtg gttctgaatc ctcaaaaatt tatccagaaa gaaagggcgg attcgaaaat
541 caaagttgac caagttaaca catttttaga gtcatccccg gagaggagaa ctctgacgca
601 cgccttaata gaccaaatag tgaatgatcc tatattgaaa actgatacgg actattacga
661 tgctaaaaaa atgcaagaga gagaaattac tgccaaaaaa atagctaggc ttgctagtta
721 tatggagcac gatatcaaaa cagtgcgcaa acactttcgc gacactgacc tgatgaaaga
781 gttgcaagca aatgatccag acaaagcttc gcctttaaca aacaaagacc tttttatatt
841 cgataagaga ttgtcacttg tagcaaatat tgatcctcaa ttgggtacgc gcgtgggtgt
901 acacttgggg ctatttggta attgtatcaa gggcaatggt actgatgagc aaatccggta
961 ttggttgcag gagagaggtg ccactttgat gaaaggtata tatggctgtt ttgcaatgac
1021 tgagttagga catggttcca atgttgccca gctgcagact agggctgtgt acgataagca
1081 aaatgatact tttgtaattg atacacctga tctaactgcc accaaatggt ggattggtgg
1141 ggctgcccat tctgccacgc acgctgccgt gtacgccaga ttgatcgttg aaggtaaaga
1201 ctacggtgta aaaacattcg ttgttcctct gagagaccct tcgactttcc aactgttagc
1261 tggtgtttcc ataggggata ttggagcgaa gatgggtcgt gacggtattg ataatggctg
1321 gatccagttc agaaacgtag ttatccctag agaatttatg ctaagtagat ttaccaaagt
1381 tgtccgttct ccagatggtt cagtcaccgt caaaactgag ccacaattgg atcaaatttc
1441 tggttatagt gcattgttaa gtggtagagt taacatggtc atggattcat ttaggtttgg
1501 ctccaaattt gctactattg ctgtacgtta cgcggttggt cgtcagcaat tcgcacctag
1561 aaagggattg tctgaaacac aattaatcga ctatcccctt caccaatatc gtgttttacc
1621 acaattgtgt gttccatatt tggtgtcacc tgtagctttt aagttaatgg acaactatta
1681 ttccactttg gacgagttat acaacgcttc ctcatctgca tacaaagctg ctctggttac
1741 cgtgagtaaa aagttgaaga atttatttat tgatagcgcc acgttgaaag ccaccaatac
1801 ttggttaatt gctacactga ttgatgagtt gagacagact tgcggaggac atgggtattc
1861 acagtataac ggatttggta aaggctatga cgactgggtg gttcagtgca catgggaggg
1921 tgataataat gttttatctt taacttcagc aaaatcaata ttgaaaaaat ttatcgattc
1981 agccacaaag ggtagatttg acaacacact ggatgtggac tcattctctt acttaaaacc
2041 tcagtacata ggatctgtgg tttctggaga aataaagagt ggtttaaagg agttgggtga
2101 ttatactgaa atttggtcta tcaccttaat caaattactg gcacatattg gtactttagt
2161 tgaaaaatca agaagtattg atagcgtttc taagctttta gtcttagtat ccaaatttca
2221 tgccttgcgc tgcatgttga aaacctatta cgacaagtta aactctcgtg attcacatat
2281 ttccgatgaa attacaaagg aatctatgtg gaatgtttat aagttatttt ccttgtattt
2341 tattgacaag cattccggag aattccaaca attcaagatc ttcactcctg atcagatctc
2401 taaagttgtg cagccacaac tattggctct tttgccaatt gtgaggaaag actgtatagg
2461 tctgacagac tcctttgaat tacctgacgc gatgttaaat tctcctatag gttactttga
2521 tggcgatatc tatcacaatt acttcaatga agtttgccgc aataatccag tggaggcaga
2581 tggggcaggg aagccttctt atcatgcgct gttgagcagc atgctcggta gaggtttcga
2641 atttgaccaa aagttaggtg gtgcagctaa tgcggaaatt ttatcgaaaa taaacaagtg
2701 agtagaggtt tcctgttttc cttcgaaccc tctgttttgc gacttttgtt tcaattcaac
2761 tagtgtcgcc aagttttaac aaaaagttac aaaatcctag tgagaggcca tcttatgtgc
2821 ataacggtac tctctatcta tttacatatc taatactatt cacataacta tgacgaatca
2881 atgacatgac tacatttacc aatgtatagt agtaa
//
LOCUS YSPUGU1 507 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.pombe U1 small nuclear RNA gene (snu1), complete cds.
ACCESSION M29062
KEYWORDS U1 small nuclear RNA; small nuclear RNA.
SOURCE S.pombe (sub-species lindner; strain h-972) DNA, clone Bb.
ORGANISM Schizosaccharomyces pombe
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 507)
AUTHORS Porter,G., Brennwald,P. and Wise,J.A.
TITLE U1 small nuclear RNA from Schizosaccharomyces pombe has unique and
conserved features and is encoded by an essential single-copy gene
JOURNAL Mol. Cell. Biol. 10, 2874-2881 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Porter, 13-OCT-1989.
FEATURES from to/span description
uRNA 240 387 U1 small nuclear RNA
BASE COUNT 148 a 82 c 89 g 188 t
ORIGIN 3 bp upstream of HincII site.
1 gttaactatt catggcttga aagcaacgaa atcatttcct attatgaaat tttggagatt
61 tatataatta caaatcgctg aacgatttca ctaatcagca aaatttaacg atttaaatat
121 cgtttttgtt aaataattat actgtacttc tttattgcat gtaagaatgt gactactttc
181 gtgactgctt cgttccgcgt cgttactata aatatgggtg cttacactta cggttcttta
241 cttacctggc atgagtttct gcagcacaag aattgtggag actcagttat ttgtcttggc
301 attgcactga gccctgacga ataactgtgg actggctaag gtcagctccg gatgcatcat
361 ttttgagttc gtccctcatt tggggcaatt ttactaatta aggcgatatg gttgccttaa
421 atagtaaagt tatccgaaat ggattatgta aagtaacatt ttctttcatc atatataatt
481 gtaaattatt gtatcataat taatatt
//
LOCUS MUSIGHAAZ 149 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
C16-11D11, partial cds.
ACCESSION M33399
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-11D11, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 149)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 149 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 102 103 D-region end/JH4-region start
BASE COUNT 39 a 45 c 35 g 30 t
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
61 gacactgccg tctattactg tgctagagct ggtaagggag gctatgctat ggactactgg
121 ggtcaaggaa cctcagcacc gtctcctca
//
LOCUS MUSIGHABB 136 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
C20-7D6, partial cds.
ACCESSION M33401
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C20-7D6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 136)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 135 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 105 106 D-region end/JH4-region start
BASE COUNT 36 a 37 c 32 g 26 t 5 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gncatctgag
61 gacactncng tctattactg tgctagaggg nataggtacg acggtnctat ggactactgg
121 ggtcaaggaa cctcag
//
LOCUS MUSIGHABC 131 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
29-1G5, partial cds.
ACCESSION M33402
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 131)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 131 Ig H-chain V-D-JH4-region (AA at 1)
recomb 84 85 V-region end/D-region start
recomb 87 88 D-region end/JH4-region start
BASE COUNT 34 a 40 c 27 g 28 t 2 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
61 gacactgcng tctattactg tgctccgtac tatgctatgg actactgggg tcaaggaacc
121 tcagtnaccg t
//
LOCUS MUSIGHABD 123 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
23-9D10, partial cds.
ACCESSION M33403
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 123 Ig H-chain V-D-JH4-region (AA at 1)
recomb 84 85 V-region end/D-region start
recomb 87 88 D-region end/JH4-region start
BASE COUNT 28 a 37 c 29 g 27 t 2 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
61 gacactgcng tctattactg tgctggggct tactggggcc aagggactct ggtnactgtc
121 tct
//
LOCUS MUSIGHABE 129 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
41-4F5, partial cds.
ACCESSION M33404
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 129)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 129 Ig H-chain V-D-JH4-region (AA at 1)
recomb 84 85 V-region end/D-region start
recomb 90 91 D-region end/JH4-region start
BASE COUNT 34 a 42 c 24 g 25 t 4 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
61 gacactgcng tctattactg tgctaactgg gactactggg gnnaaggcac cactctcaca
121 gtctcctca
//
LOCUS MUSIGHABF 132 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
49-1G5, partial cds.
ACCESSION M33405
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 132)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 132 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 90 91 D-region end/JH4-region start
BASE COUNT 34 a 44 c 25 g 27 t 2 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
61 gacacngcng tctattactg tgctaggtac tactttgact actggggcca aggcaccact
121 ctcacagtct cc
//
LOCUS MUSIGHABG 141 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
23-10E7, partial cds.
ACCESSION M33406
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 99 100 D-region end/JH4-region start
BASE COUNT 39 a 43 c 30 g 27 t 2 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct aacatctgag
61 gacactgcng tctattactg tgctagaagg ggnaactacg ctatggacta ctggggtcaa
121 ggaacctcag tcaccgtctc c
//
LOCUS MUSIGHABH 151 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
44-2F5, partial cds.
ACCESSION M33407
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 151)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 151 Ig H-chain V-D-JH4-region (AA at 1)
recomb 84 85 V-region end/D-region start
recomb 105 106 D-region end/JH4-region start
BASE COUNT 39 a 48 c 27 g 34 t 3 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
61 gacactgcng tctattactg tgcnctcatc tactatggta acacctatgc tatggactac
121 tggggtcaag gaacctcagt caccgtctcc t
//
LOCUS MUSIGHABI 160 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
25-5E10, partial cds.
ACCESSION M33408
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 160)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 160 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 117 118 D-region end/JH4-region start
BASE COUNT 44 a 44 c 36 g 33 t 3 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
61 gacactgcng tctattactg tgctagaaaa gactattact acggtagtag cgggggagct
121 atggactact ggggtcaagg aacctcagtn accgtctcct
//
LOCUS MUSIGHABJ 125 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
49-1E7, partial cds.
ACCESSION M33409
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1E7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 125)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 125 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 90 91 D-region end/JH4-region start
BASE COUNT 32 a 42 c 26 g 24 t 1 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
61 gacactgcng tctattactg tgctagctgg gactactggg gcaaggcacc actctcacag
121 tctcc
//
LOCUS MUSIGHABK 91 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
50-2G3, partial cds.
ACCESSION M33410
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-2G3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 91)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 91 Ig H-chain V-D-JH4-region (AA at 3)
recomb 29 30 V-region end/D-region start
recomb 47 48 D-region end/JH4-region start
BASE COUNT 21 a 18 c 23 g 24 t 5 others
ORIGIN Chromosome 12.
1 gagacactnc ngtctattac tgtgctagag gtactacggt agtagtctat gctatggact
61 actggggtna aggaacctna gtnaccgtct c
//
LOCUS MUSIGHABL 107 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
50-6D8, partial cds.
ACCESSION M33411
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-6D8, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 107)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 107 Ig H-chain V-D-JH4-region (AA at 1)
recomb 48 49 V-region end/D-region start
recomb 57 58 D-region end/JH4-region start
BASE COUNT 26 a 28 c 24 g 28 t 1 others
ORIGIN Chromosome 12.
1 ctcagcagcc taacatctga ggacactgcn gtctattact gtgctagtag gtacgattac
61 tatgctatgg actactgggg tcaaggaacc tcagtcaccg tctctca
//
LOCUS MUSIGHADE 404 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain gene V-region, Id+ hybridoma
C16-15F6, partial cds.
ACCESSION M33390
KEYWORDS immunoglobulin heavy chain; processed gene; variable region.
SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 404)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 20 Ig H-chain V-region, exon 1 (AA at 2)
100 > 404 Ig H-chain V-region, exon 2
IVS 21 99 Ig H-chain V-region intron A
BASE COUNT 105 a 100 c 106 g 91 t 2 others
ORIGIN Chromosome 12.
1 cctgatggca gtggttacag gtaaggggct ccnaagtccc aaacttgagg gtccataaac
61 tctgtgacag tggcaatcac tttgcctttc tttctacagg ggtcaattca gaggttcagc
121 tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg tcctgcacag
181 cttctggctt caacattaaa gacacctata tgcactgggt gaagcagagg cctgaacagg
241 gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat gacccgaagt
301 tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac ctgcagctca
361 gcagcctgac atctgaggac actgccgtct attactgtgc taga
//
LOCUS MUSIGHADF 141 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
J7-11H3, partial cds.
ACCESSION M33400
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) adult Id+ hybridoma J7-11H3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1)
recomb 87 88 V-region end/D-region start
recomb 96 97 D-region end/JH4-region start
BASE COUNT 36 a 43 c 28 g 29 t 5 others
ORIGIN Chromosome 12.
1 ataacagcag acacatcctc caacacagcc tacctgcagc tnagcagcct gacatctgag
61 gacactgcng tctattactg tgctaggagt agctacgact actttgacta ctggggnnna
121 ggcaccactc tcacagtctc c
//
LOCUS MUSIGHZD 234 bp ss-mRNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
C16-15F6, partial cds.
ACCESSION M17008 M33398
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Busto,P., Gerstein,R., Dupre,L., Giorgetti,C.A., Selsing,E. and
Press,J.L.
TITLE Molecular analysis of heavy and light chains used by primary and
secondary anti-(T,G)-A--L antibodies produced by normal and Xid
mice
JOURNAL J. Immunol. 139, 608-618 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 82 to 234)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 234 Ig H-chain V-D-JH4-region (AA at 1)
recomb 168 169 V-region end/D-region start
recomb 186 187 D-region end/JH4-region start
BASE COUNT 65 a 61 c 58 g 47 t 3 others
ORIGIN Chromosome 12.
1 cagggccnnn agtggattgg aaggattgat cctgcgaatg gtaatactaa atatgacccg
61 aagttccagg gcaaggccac tataacagca gacacatcct ccaacacagc ctacctgcag
121 ctcagcagcc tgacatctga ggacactgcc gtctattact gtgctagaag agggggcaat
181 ggtaactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca
//
LOCUS MUSIGHZL 291 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
29-1G5, partial cds.
ACCESSION M33391
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig H-chain V-region (AA at 1)
BASE COUNT 77 a 69 c 70 g 57 t 18 others
ORIGIN Chromosome 12.
1 gaggntcagc tncagnagtc tggggnagag ctngtnaagc caggngcctc agtcaagttg
61 tcctgcacag cttctnnnnn naacattaaa gacacctata tgcactgggt naagcagagg
121 cctnaacagg gcctngagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacccgaagt tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t
//
LOCUS MUSIGHZM 291 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
23-9D10, partial cds.
ACCESSION M33392
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig H-chain V-region (AA at 1)
BASE COUNT 77 a 71 c 75 g 60 t 8 others
ORIGIN Chromosome 12.
1 gaggttcagc tgnagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
61 tcctgcacag cttctgnntt caacattaaa gacacctata tgcactgggt gaagcagagg
121 cctnaacagg gcctngagtg gattggaagg atngatcctg cgaatggtaa tactaaatat
181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t
//
LOCUS MUSIGHZN 291 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
41-4F5, partial cds.
ACCESSION M33393
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig H-chain V-region (AA at 1)
BASE COUNT 77 a 71 c 76 g 60 t 7 others
ORIGIN Chromosome 12.
1 gaggntcagc tgcagnagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg
61 tcctgcacag cttctggntt caacattaaa gacacctata tgcactgggt naagcagagg
121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc t
//
LOCUS MUSIGHZO 294 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V1A-J2-region, Id+ hybridoma
49-1G5, partial cds.
ACCESSION M33394
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig H-chain V-region (AA at 1)
BASE COUNT 78 a 69 c 76 g 59 t 12 others
ORIGIN Chromosome 12.
1 gaggttcagc tgcagcagtc tggggcagan cttgtnaagc caggggcctc agtcaagttg
61 tcctgcacag cttctggnnt naacattaaa gacacctata tgcactgggt gaagcagagg
121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacnngaagt tcnagngcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctgac atctgaggac acngcngtct attactgtgc tagg
//
LOCUS MUSIGHZP 294 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
23-10E7, partial cds.
ACCESSION M33395
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig H-chain V-region (AA at 1)
BASE COUNT 80 a 72 c 78 g 60 t 4 others
ORIGIN Chromosome 12.
1 gaggttcagc tgcagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg
121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctaac atctgaggac actgcngtct attactgtgc taga
//
LOCUS MUSIGHZQ 291 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
44-2F5, partial cds.
ACCESSION M33396
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 291)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 291 Ig H-chain V-region (AA at 1)
BASE COUNT 77 a 72 c 76 g 59 t 7 others
ORIGIN Chromosome 12.
1 gaggttcagc tgcagcagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg
61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg
121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc n
//
LOCUS MUSIGHZT 294 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma
25-5E10, partial cds.
ACCESSION M33397
KEYWORDS germline; immunoglobulin heavy chain; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 294 Ig H-chain V-region (AA at 1)
BASE COUNT 79 a 73 c 78 g 60 t 4 others
ORIGIN Chromosome 12.
1 gaggttcagc tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
61 tcctgcacag cttctggcnt caacattaaa gacacctata tgcactgggt gaagcagagg
121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
181 gacccgaagt tcnagggcaa ggccactata acagcagaca catcctccaa cacagcctac
241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc taga
//
LOCUS MUSIGKABO 339 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V1A-J2-region, Id+ hybridoma
29-1G5, partial cds.
ACCESSION M33389
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 339)
AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
Zhukovsky,E. and Press,J.L.
TITLE Neonatal and adult primary B cells use the same germ-line V-H and
V-kappa genes in their (T,G)-A-L-specific repertoire
JOURNAL J. Immunol. 144, 583-592 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 339 Ig kappa-chain V1A-J2-region (AA at 1)
recomb 300 301 V1A-region end/J2-region start
BASE COUNT 88 a 84 c 83 g 82 t 2 others
ORIGIN Chromosome 6.
1 gatgttgtga tgacccaaac tccnctctcc ctgcctgtca gtcttggaga tcaagcctcc
61 atctcttgca gatctagtca gagccttgta cacagtaatg gaaacaccta tttacattgg
121 tacctgcaga agccaggcca gtctccaaag ctcctgatct acaaagtttc caaccgattt
181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
241 agcagagngg aggctgagga tctgggagtt tatttctgct ctcaaagtac acatgttccg
301 tacacgttcg gaggggggac caagctggaa ataaaacgg
//
LOCUS M36836 2651 bp - UNA 24-JUL-1990
DEFINITION Figure 1. Nucleotide sequence of the characterized locust mtDNA
segment.
ACCESSION M36836
REFERENCE 1 (bases 1 to 2651)
AUTHORS Haucke,H.-R. and Gellissen,G.
TITLE different mitochondrial gene orders amino insects: exchanged trna
gene positions in the coii/coiii region between an orthopteran and
a dipteran species
JOURNAL Curr. Genet. 14, 471-476 (1988)
STANDARD unannotated staff_entry
BASE COUNT 1034 a 419 c 303 g 895 t
ORIGIN
1 cacagatact cagaactacc actaattaat ttatagattc taatatggca gattaatgca
61 ttagatttaa gctctaaaaa taaagttttg accttttatt agaaaaatat ggcaacatga
121 tcaaacttat cattacaaga tggagcttca ccattaatag aacaattatc attctttcat
181 gatcatacaa taattgatct attattaatc acaataattg taggatattc gcttagatat
241 atattattaa caaaatatac aaaccgaaat atacttcatg gtcatttaat tgaaactatc
301 tgaacagcac taccagcaat tacattaatc tttattgcat taccatcatt acgattacta
361 tatttacttg atgattcatc agatgctata attacaatta aaacaattgg acgacaatga
421 tactgaagtt atgaatattc agatttcatt aatgtagaat ttgatacata tataactcct
481 gaaaatgaat taaatacaga tgaattccga ctcttagaag ttgataatcg aacaacatta
541 cctataaata cagaagtacg agtattaact agagcatctg atgtactcca ctcatgagca
601 gtaccagctt tagtattaaa aattgacgca acacccggac gactaaacca aggtatattt
661 ataattaacc gcccaggtct atttttcggt caatgctcag aaatctgtgg agctaatcat
721 agatttatac caattgtaat tgaaagaaca tcaattaaac ttttcattaa atgattatct
781 aacataatat aaggagttag ttaaaatata acattagaat gtcaatctaa aataactaaa
841 aatagtacac cttgaaccat cagatgactg aaagtaagta atggtctctt aaaccaaaaa
901 atagtaaatt aacgtttact tctgatgagg taaattaaaa acttaatccc tcaaatatca
961 ccaataatat gattttcact attcattata ttctcaataa caataatatt atttaatcaa
1021 ttaaacttct tctcttataa accaaataaa attataagaa gtaataataa aattaaaaaa
1081 aaaaacatta attgaatatg ataacaaatc tattctcaac attcgaccca tcaactaatt
1141 tatttaattt atcattaaat tgaactagaa cattccttgg actattatta attccatcaa
1201 tattttgatt aataccttct cgaattaata ttttatgaaa taaaataaat ctaaacttac
1261 ataatgaatt taaaacatta ttaggaaaaa actcatttca aggatcaaca ctaattctaa
1321 tttcaatttt tatcataata ttatttaaca attttatagg attattccca tacattttta
1381 caagaactag tcatataaca ttaacattct caattgcatt accaatatga ataagattta
1441 tattatttgg atgaattaat aatactaatc atatatttac acatcttgtc ccacaaggaa
1501 caccaaatgc acttatatca tttatggttc taattgaaac aattagaaat gttattcggc
1561 cagggaccct agctgtacga ttagctgcaa atatgattgc aggacatctt ttattaacat
1621 tattaggaaa tacaggaccg tcactaacaa caagaattat attatttcta attattggac
1681 aaatattact tttaattctt gaatcagccg tagctataat tcaagcatac gtattctcaa
1741 ttctaagaac attatattca agagaagttt actaaactta tgttaacaaa taacaacaat
1801 cacccatttc atatagtaga ttatagacca tgaccattaa ctggagcaat cggagcaata
1861 attctaacct caggaataac taaatgattt catacattca atataaattt attaataatt
1921 ggaataactg tcattgtatt aacaataatt caatggtgac gagatgtagt tcgagaaggt
1981 acatttcaag gtctacacac taaattagta tcaaaaggac tgcgatgagg aataatttta
2041 tttattgcat cagaagtttt atttttcgcc tcattcttct gagcattctt taatagtaga
2101 ttagcaccaa caattgaact aggaataaaa tgacctccaa taggaattca accatttaat
2161 ccaatccaaa ttccactact taatactgca attttactag cgtcaggagt tacaattaca
2221 tgagcacatc acagaatcat agaatgtaat cattcacaag cattacaagg attattcttt
2281 acagtaatat taggatttta ttttacttta ttacaaatat acgaatattg agaagcacct
2341 tttaccattg cagacgcagt atatggatct acattcttcg tagcaacagg attccatggg
2401 ctacatgtaa tcattggtac aacattctta ttaacatgcc taattcgaca tataataaac
2461 caattctcat ctaatcacca ctttggattt gaagctgcag catgatactg acactttgtt
2521 gatgtagtat gattattttt atatttatca atttactgat gaggtagata attaattttt
2581 ctagtataat tagtacattt gacttccaat caaaagattg aaatatcaag aaaaataatt
2641 ataatattaa t
//
LOCUS ECOFPP 1950 bp ds-DNA BCT 24-JUL-1990
DEFINITION E.coli dipeptide transport protein (fpp) gene, complete cds.
ACCESSION M35045
KEYWORDS fpp gene; periplasmic protein.
SOURCE E.coli (strain K12; isolate E1222) DNA, clone lambda-10.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1950)
AUTHORS Olson,E.R., Dunyak,D.S., Jurss,L.M. and Poorman,R.A.
TITLE Identification and characterization of fpp, an E.coli gene involved
in dipeptide transport
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.R.Olson, 04-JUN-1990.
Author address: E.R.Olson
Molecular Biology Research, 7242-209-7
The UpJohn Company
Kalamazoo, MI 49001
FEATURES from to/span description
pept 147 1754 dipeptide transport protein (fpp) precursor
sigp 147 230 dipeptide transport protein signal peptide
matp 231 1751 dipeptide transport protein
site 1792 1888 REP sequence
BASE COUNT 548 a 499 c 491 g 412 t
ORIGIN 79.5 min on the K12 map.
1 ggatccgcac tgttacactg atgttaatta gtacggcatc cccacctcat aacgttgacc
61 cgaccgggca aaaaacaaaa aaggtcaggc agcgacaacc cactgcaaag ggttaaaaca
121 acaaacatca caattggagc agaataatgc gtatttcctt gaaaaagtca gggatgctga
181 agcttggtct cagcctggtg gctatgaccg tcgcagcaag tgttcaggct aaaactctgg
241 tttattgctc agaaggatct ccggaagggt ttaacccgca gctgtttacc tccggcacca
301 cctatgacgc ctcttccgtc ccgctttata accgtctggt tgaatttaaa atcggcacca
361 ccgaagtgat cccgggcctc gctgaaaagt gggaagtcag cgaagacggt aaaacctata
421 ccttccatct gcgtaaaggt gtgaagtggc acgacaataa agaattcaaa ccgacgcgtg
481 aactgaacgc cgatgatgtg gtgttctcgt tcgatcgtca gaaaaacgcg caaaacccgt
541 accataaagt ttctggcggc agctacgaat acttcgaagg catgggcttg ccagagctga
601 tcagtgaagt gaaaaaggtg gacgacaaca ccgttcagtt tgtgctgact cgcccggaag
661 cgccgttcct cgctgacctg gcaatggact tcgcctctat tctgtcaaaa gaatatgctg
721 atgcgatgat gaaagccggt acaccggaaa aactggacct caacccaatc ggaaccggtc
781 cgttccagtt acagcagtat caaaaagatt cccgtatccg ctacaaagcg tttgatggct
841 actggggcac caaaccgcag atcgatacgc tggttttctc tattacccct gacgcttccg
901 tgcgttacgc gaaattgcag aagaatgaat gccaggtgat gccgtacccg aacccggcag
961 atatcgctcg catgaagcag gataaatcca tcaatctgat ggaaatgccg gggctgaacg
1021 tcggttatct ctcgtataac gtgcagaaaa aaccactcga tgacgtgaaa gttcgccagg
1081 ctctgaccta cgcggtgaac aaagacgcga tcatcaaagc ggtttatcag ggcgcgggcg
1141 tatcagcgaa aaacctgatc ccgccaacca tgtggggcta taacgacgac gttcaggact
1201 acacctacga tcctgaaaaa gcgaaagcct tgctgaaaga agcgggtctg gaaaaaggtt
1261 tctccatcga cctgtgggcg atgccggtac aacgtccgta taacccgaac gctcgccgca
1321 tggcggagat gattcaggca gactgggcga aagtcggcgt gcaggccaaa attgtcacct
1381 acgaatgggg tgagtacctc aagcgtgcga aagatggcga gcaccagacg gtaatgatgg
1441 gctggactgg cgataacggg gatccggata acttcttcgc caccctgttc agctgcgccg
1501 cctctgaaca aggctccaac tactcaaaat ggtgctacaa accgtttgaa gatctgattc
1561 aaccggcgcg tgctaccgac gaccacaata aacgcgttga actgtacaaa caagcgcagg
1621 tggtgatgca cgatcaggct ccggcactga tcatcgctca ctccaccgtg tttgaaccgg
1681 tacgtaaaga agttaaaggc tatgtggttg atccattagg caaacatcac ttcgaaaacg
1741 tctctatcga ataattaaaa gccatacaag actgatggca aaggcaaaaa tgcctgatgc
1801 gctccgctta tcaggcctac gaaaattctg caatgtattg aatttgcacg attttgtagg
1861 ccggataagg cgttaacgcg catccggcat aaacaaagcg cactttgtca acaatctgta
1921 tacccggtgg cgctgtgcct ctgcgtgcag
//
LOCUS ECOSHL 1820 bp ds-DNA BCT 24-JUL-1990
DEFINITION E.coli shl gene, complete cds, and acetolactate synthase II (ilvH)
gene, 3' end.
ACCESSION M35034
KEYWORDS acetolactate synthase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1820)
AUTHORS Leclerc,G., Noel,G. and Drapeau,G.
TITLE Molecular cloning, nucleotide sequence and expression of shl
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Drapeau, 01-JUN-1990.
FEATURES from to/span description
pept < 1 105 acetolactate synthase (ilvH) (AA at 1)
pept 117 203 leader protein
pept 285 1289 shl protein
signal 66 73 -35 region
signal 89 95 -10 region
BASE COUNT 446 a 433 c 463 g 478 t
ORIGIN 1 bp upstream HindIII site.
1 aagcttgatg catttttagc atcgattcgc gatgtggcga aaattgtgga ggttgctcgc
61 tctggtgtgg tcggactttc gcgcggcgat aaaataatgc gttgagaatg atctcaatgc
121 gcaatttaca gcccaacatg tcacgttggg ctttttttgc gaaatcagtg ggaacctgga
181 ataaaagcag ttgccgcagt taattttctg cgcttagatg ttaatgaatt taacccatac
241 cagtacaatg gctatggttt ttacatttta cgcaaggggc aattgtgaaa ctggatgaaa
301 tcgctcggct ggcgggagtg tcgcggacca ctgcaagcta tgttattaac ggcaaagcga
361 agcaataccg tgtgagcgac aaaaccgttg aaaaagtcat ggctgtggtg cgtgagcaca
421 attaccaccc gaacgccgtg gcagctgggc ttcgtgctgg acgcacacgt tctattggtc
481 ttgtgatccc cgatctggag aacaccagct atacccgcat cgctaactat cttgaacgcc
541 aggcgcggca acggggttat caactgctga ttgcctgctc agaagatcag ccagacaacg
601 aaatgcggtg cattgagcac cttttacagc gtcaggttga tgccattatt gtttcgacgt
661 cgttgcctcc tgagcatcct ttttatcaac gctgggctaa cgacccgttc ccgattgtcg
721 cgctggaccg cgccctcgat cgtgaacact tcaccagcgt ggttggtgcc gatcaggatg
781 atgccgaaat gctggcggaa gagttacgta agtttcccgc cgagacggtg ctttatcttg
841 gtgcgctacc ggagctttct gtcagcttcc tgcgtgaaca aggtttccgt actgcctgga
901 aagatgatcc gcgcgaagtg catttcctgt atgccaacag ctatgagcgg gaggcggctg
961 cccagttatt cgaaaaatgg ctggaaacgc atccgatgcc gcaggcgctg ttcacaacgt
1021 cgtttgcgtt gttgcaagga gtgatggatg tcacgctgcg tcgcgacggc aaactgcctt
1081 ctgacctggc aattgccacc tttggcgata acgaactgct cgacttctta cagtgtccgg
1141 tgctggcagt ggctcaacgt caccgcgatg tcgcagagcg tgtgctggag attgtcctgg
1201 caagcctgga cgaaccgcgt aagccaaaac ctggtttaac gcgcattaaa cgtaatctct
1261 atcgccgcgg cgtgctcagc cgtagctaag ccgcgaacaa aaatacgcgc caggtgaatt
1321 tccctctggc gcgtagagta cgggactgga catcaatatg cttaaagtaa ataagactat
1381 tcctgactat tattgataaa tgcttttaaa cccgcccgtt aattaactca ccagctgaaa
1441 ttcacaataa ttaagtgata tcgacagcgc gtttttgcat tattttgtta catgcggcga
1501 tgaattgccg atttaacaaa cacttttctt tgcttttgcg caaacccgct ggcatcaagc
1561 gccacacaga cgtaacaagg actgttaacc ggggaagata tgtcctaaaa tgccgctcgc
1621 gtcgcaaact gacactttat atttgctgtg gaaaatagtg agtcatttta aaacggtgat
1681 gacgatgagg gattttttct tacagctatt cataacgtta atttgcttcg cacgttggac
1741 gtaaaataaa caacgctgat attagccgta aacatcgggt tttttacctc ggtatgcctt
1801 gtgactggct tgacaagctt
//
LOCUS HUMATCT4A 1742 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human T4 surface glycoprotein CD4 gene, complete cds.
ACCESSION M35160
KEYWORDS glycoprotein; immunoglobulin supergene family; recognition antigen;
surface antigen.
SOURCE Human (haplotype OKT4-negative) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1742)
AUTHORS Hodge,T.W., Sasso,D.R., Steve,J. and McDougal,S.
TITLE Humans with OKT4-epitope deficiency have a single base change in
the CD4 gene resulting in substitutions of TRP-240 for ARG-240
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.W.Hodge, 12-JUN-1990.
A cytosine to thymidine transition at nucleotide position 868
results in substitution of TRP-240 for ARG-240.
Author address: T.W.Hodge;
A-25 Bldg 1 Rm 1226
Centers for Disease Control
1600 Clifton Rd.
Atlanta, GA 30333
FEATURES from to/span description
pept 76 1452 T4 surface glycoprotein precursor
sigp 76 150 T4 surface glycoprotein signal peptide
matp 151 1449 T4 surface glycoprotein mature peptide
mut 792 792 g in OKT4 epitope deficiency; c in wt
BASE COUNT 405 a 491 c 466 g 380 t
ORIGIN
1 caagcccaga gccctgccat ttctgtgggc tcaggtccct actgctcagc cccttcctcc
61 ctcggcaagg ccacaatgaa ccggggagtc ccttttaggc acttgcttct ggtgctgcaa
121 ctggcgctcc tcccagcagc cactcaggga aagaaagtgg tgctgggcaa aaaaggggat
181 acagtggaac tgacctgtac agcttcccag aagaagagca tacaattcca ctggaaaaac
241 tccaaccaga taaagattct gggaaatcag ggctccttct taactaaagg tccatccaag
301 ctgaatgatc gcgctgactc aagaagaagc ctttgggacc aaggaaactt ccccctgatc
361 atcaagaatc ttaagataga agactcagat acttacatct gtgaagtgga ggaccagaag
421 gaggaggtgc aattgctagt gttcggattg actgccaact ctgacaccca cctgcttcag
481 gggcagagcc tgaccctgac cttggagagc ccccctggta gtagcccctc agtgcaatgt
541 aggagtccaa ggggtaaaaa catacagggg gggaagaccc tctccgtgtc tcagctggag
601 ctccaggata gtggcacctg gacatgcact gtcttgcaga accagaagaa ggtggagttc
661 aaaatagaca tcgtggtgct agctttccag aaggcctcca gcatagtcta taagaaagag
721 ggggaacagg tggagttctc cttcccactc gcctttacag ttgaaaagct gacgggcagt
781 ggcgagctgt ggtggcaggc ggagagggct tcctcctcca agtcttggat cacctttgac
841 ctgaagaaca aggaagtgtc tgtaaaatgg gttacccagg accctaagct ccagatgggc
901 aagaagctcc cgctccacct caccctgccc caggccttgc ctcagtatgc tggctctgga
961 aacctcaccc tggcccttga agcgaaaaca ggaaagttgc atcaggaagt gaacctggtg
1021 gtgatgagag ccactcagct ccagaaaaat ttgacctgtg aggtgtgggg acccacctcc
1081 cctaagctga tgctgagctt gaaactggag aacaaggagg caaaggtctc gaagcgggag
1141 aaggcggtgt gggtgctgaa ccctgaggcg gggatgtggc agtgtctgct gagtgactcg
1201 ggacaggtcc tgctggaatc caacatcaag gttctgccca catggtccac cccggtgcag
1261 ccaatggccc tgattgtgct ggggggcgtc gccggcctcc tgcttttcat tgggctaggc
1321 atcttcttct gtgtcaggtg ccggcaccga aggcgccaag cagagcggat gtctcagatc
1381 aagagactcc tcagtgagaa gaagacctgc cagtgccctc accggtttca gaagacatgt
1441 agccccattt gaggcacgag gccaggcaga tcccacttgc agcctcccca ggtgtctgcc
1501 ccgcgtttcc tgcctgcgga ccagatgaat gtagcagatc ccacgctctg gcctcctgtt
1561 cgtcctccct acaatttgcc attgtttctc ctgggttagg ccccggcttc actggttgag
1621 tgttgctctc tagtttccag aggcttaatc acaccgtcct ccacgccatt tccttttcct
1681 tcaagcctag cccttctctc attatttctc tctgaccctc tccccactgc tcatttggat
1741 cc
//
LOCUS HUMFUSED 555 bp ds-DNA SYN 24-JUL-1990
DEFINITION Human interleukin-1beta/growth hormone fusion protein gene,
complete cds.
ACCESSION M35049
KEYWORDS growth hormone; interleukin-1-beta.
SOURCE Human synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 555)
AUTHORS Pecceu,R., Dousset,P., Shire,D., Cavrois,E., Marchese,E.,
Ferrara,P., Kaghad,M., Dumont,X. and Lupker,J.
TITLE Human interleukin-1-beta fused to the human growth hormone signal
peptide is N-glycosylated and secreted by Chinese hamster ovary
cells
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Pecceu, 08-JUN-1990.
Author address: F.Pecceu
SANOFI ELF BIO-RECHERCHES
LABEGE INNOPOLE B.P. 137
31328 LABEGE CEDEX
FRANCE
FEATURES from to/span description
pept 11 550 human growth hormone/interleukin--1beta fusion
protein
sigp 11 88 human growth hormone signal peptide
matp 89 547 interleukin-1-beta
BASE COUNT 154 a 142 c 135 g 124 t
ORIGIN
1 agcttccgcc atggctacag gctcccggac gtccctgctc ctggcttttg gcctgctctg
61 cctgccctgg cttcaagagg gcagtgccgc acctgtaaga tctctgaact gcacgctccg
121 ggactcacag caaaagagct tggtgatgtc tggtccatat gaactgaaag ctctccacct
181 gcagggacag gatatggagc aacaagtggt gttctccatg tcctttgtcc aaggagaaga
241 aagtaatgac aaaatacctg tggcgttggg cctcaaggaa aagaatctgt acttaagctg
301 cgtgttgaaa gatgataagc ccactctaca gctggagagt gtagacccca aaaattaccc
361 aaagaagaag atggaaaagc gatttgtctt caacaagata gaaatcaata acaagctgga
421 atttgagtct gcccagttcc ccaactggta catcagcacc tctcaagcag aaaacatgcc
481 cgtcttcctg ggtggtacca aaggcggcca ggatataact gacttcacca tgcaatttgt
541 gagctcctag gatcc
//
LOCUS HUMGLYA 1200 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human Miltenberger V-like glycophorin gene, exons 3 and 4.
ACCESSION J05563
KEYWORDS glycophorin A; transmembrane protein.
SOURCE Human leukocyte DNA, clone Mi51.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1200)
AUTHORS Kudo,S., Chagnovich,D., Rearden,A., Mattei,M.-G. and Fukuda,M.
TITLE Molecular analysis of a hybrid gene encoding human glycophorin
variant Miltenberger V-like molecule
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Kudo and M.Fukuda, 04-JUN-1990.
FEATURES from to/span description
pept / 81 176 Miltenberger V-like glycophorin, exon 3 (AA at
81)
1110 / 1148 Miltenberger V-like glycophorin, exon 4
IVS < 1 80 Miltenberger V-like glycophorin intron B
IVS 177 1109 Miltenberger V-like glycophorin intron C
site 1079 1116 crossing-over site
BASE COUNT 387 a 236 c 212 g 365 t
ORIGIN
1 agatttgtct ttcatgatac gctctatgtc cacgcagtca cctcattctt gacccctttc
61 tcaacttctc ttatatgcag atacgcacaa acgggacaca tatgcagcca ctcctagagc
121 tcatgaagtt tcagaaattt ctgttagaac tgtttaccct ccagaagagg aaaccggtat
181 gttcttagtt ttaaatagtt gctctggagt cattgttgtg attgaactct gtttacacga
241 gctgtaactc atgacagttc tcaagctttc gtgacagaaa acccatctct tttactccaa
301 agcccatata gcacccacaa ctattaactg tgaccaagaa agagaaggca agccccaatt
361 aacctttgta cgtaaagcct aaagaatgaa aaaatatacc tgaatcctca atcatcaaac
421 agcatagtat atactaagta atttgtaata attaaactct agaaaattgt gtggcttcgg
481 tagtaagaga gcttcatgac ataaaatggc aagtggagac agagacaaaa gtaggatgtg
541 gactgagagg gaaggttagc acaggtggaa cagtaaggca accatactat caattgctac
601 tgacatagaa tccagagaga ctattggcaa aagctcaaat gagacacagt aacagtttag
661 attcagacag tggctgtggc ataaatcaga aaattgatag ccgcatgatc cctctttgca
721 tgggactggc atctgtgtgg agtaatggtt ccatatgcct cctttcttct ccttattttt
781 aaatttttta aaaatgcatt gcttcttgtg gaagtcaata agtgattctt ccaatacttt
841 ctcattcctt ccccctcagt tatgagacaa tttgcttatt tctcatccat gaatacttgt
901 tgggtcatta aaagtagata ctgaaattac taatggtacg actgacatat tacctcataa
961 atgttactag ctagatgttg aaagttgacc aacaactctc aaaatatgat taagaaaagg
1021 aaacccacag aacagtttga ttccaaaatg atttttttct ttgcacatgc cttacttatt
1081 tggacttaca ttgaaatttt gctttatagg agaaatggga caacttgtcc atcgtttcac
1141 tgtaccaggt atgttaatat ttgacaaaga ataaaagtca ttccatttta aactatccat
//
LOCUS HUMMHDRB 1172 bp ds-DNA PRI 24-JUL-1990
DEFINITION Human MHC class II HLA-DR-beta DR2 gene, complete cds.
ACCESSION M35159
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (black with haplotype DR2) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1172)
AUTHORS Demopulos,J.T., Hodge,T.W., Wooten,V. and Acton,R.T.
TITLE A novel DRB1 allele in DR2-positive American blacks
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.W.Hodge, 12-JUN-1990.
A thymidine to cytosine transition at nucleotide position 88
results in an amino acid change from tyrosine to histidine at
residue 30.
Author address: T.W.Hodge;
A-25 Bldg 1 Rm 1226
Centers for Disease Control
1600 Clifton Rd.
Atlanta, GA 30333
FEATURES from to/span description
pept 6 806 MHC HLA-DR-beta chain precursor
/hgml_locus_uid="LV0063D" /nomgen="HLA-DRB1"
/map="6p21.3"
sigp 6 92 MHC HLA-DR-beta chain signal peptide
matp 93 803 MHC-HLA-DR-beta chain
BASE COUNT 272 a 315 c 324 g 261 t
ORIGIN
1 ccagcatggt gtgtctgaag ctccctggag gttcctacat ggcaaagctg acagtgacac
61 tgatggtgct gagctcccca ctggcttcgg ctggggacac ccgaccacgt ttcttgcagc
121 aggataagta tgagtgtcat ttcttcaacg ggacggagcg ggtgcggttc ctgcacagag
181 acatctataa ccaagaggag gacttgcgct tcgacagcga cgtgggggag taccgggcgg
241 tgacggagct ggggcggcct gacgctgagt actggaacag ccagaaggac ttcctggaag
301 acaggcgcgc cgcggtggac acctactgca gacacaacta cggggttggt gagagcttca
361 cagtgcagcg gcgagttgag cctaaggtga ctgtgtatcc tgcaaggacc cagaccctgc
421 agcaccacaa cctcctggtc tgctctgtga atggtttcta tccaggcagc attgaagtca
481 ggtggttccg gaacagccag gaagagaagg ctggggtggt gtccacaggc ctgattcaga
541 atggagactg gaccttccag accctggtga tgctggaaac agttcctcga agtggagagg
601 tttacacctg ccaagtggag cacccaagcg tgacgagccc tctcacagtg gaatggagag
661 cacagtctga atctgcacag agcaagatgc tgagtggagt cgggggcttt gtgctgggcc
721 tgctcttcct tggggccggg ctattcatct acttcaagaa tcagaaaggg cactctggac
781 ttcacccaac aggactcgtg agctgaagtg cagatgacca cattcaaggg ggaaccttct
841 gccccagctt tgcatgatga aaagctttcc tgcttggctc ttattcttcc acaagagagg
901 actttctcag gccctggttg ctaccggttc agcaactctg cagaaaatgt ccatccttgt
961 ggcttcctca gctcctgccc cttggcctga agtcccagca ttgatggcag tgcctcatct
1021 tcaactttag tgctcccctt tacctaaccc tacggcctcc catgcatctg tactccccct
1081 gtgtgccaca aatgcactac gttattaaat ttttctgaag cccagagtta aaaatcatct
1141 gtccacctgg ctccaaagac aaaaaataaa aa
//
LOCUS MUM3TER 55 bp ss-RNA VRL 24-JUL-1990
DEFINITION Mumps virus genome 3' terminus mRNA.
ACCESSION M35284
KEYWORDS .
SOURCE Mumps virus (strain SBL-1), cDNA to mRNA, passed in Vero cells.
ORGANISM Mumps virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Paramyxoviridae; Paramyxovirus.
REFERENCE 1 (bases 1 to 55)
AUTHORS Elango,N., Varsanyi,T.M., Koevamees,J. and Norrby,E.
TITLE Molecular cloning and characterization of six genes, determination
of gene order and intergenic sequences and leader sequence of mumps
virus
JOURNAL J. Gen. Virol. 69, 2893-2900 (1988)
STANDARD simple staff_review
BASE COUNT 9 a 17 c 4 g 24 t 1 others
ORIGIN
1 nggttcccct ctttcttcta ccctataacc atcttgttta tcacattctt tgtca
//
LOCUS MUSIGCR 2210 bp ds-DNA ROD 24-JUL-1990
DEFINITION Murine Ig germline membrane-bound gamma 2a-chain gene C region,
exon 2.
ACCESSION M35032
KEYWORDS constant region; gamma-immunoglobulin; germline;
immunoglobulin heavy-chain.
SOURCE Murine (strain BALB/c) myeloma cell line RPC 5 DNA, clone
CH.M.IgG2a-11.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2210)
AUTHORS Hall,B. and Milcarek,C.
TITLE Sequence and polyadenylation site determination of the murine
immunoglobulin gamma 2a membrane 3' UT region
JOURNAL Mol. Immunol. 26, 819-826 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.L.Hall, 08-JUN-1990.
FEATURES from to/span description
pept / 37 120 IgG-2a chain, exon 2
pre-msg < 1 1682 IgG-2a chain mRNA (alt.)
pre-msg < 1 1759 IgG-2a chain mRNA (alt.)
IVS < 1 36 IgG-2a chain intron A (no splice consensus)
signal 1506 1511 major Poly-A signal
signal 1585 1590 minor Poly-A signal
rpt 1270 1305 (GA)17 repeats
rpt 1395 1440 (CT)23 repeats
site 1536 1543 McLauchlan consensus sequence
site 1552 1559 McLauchlan consensus sequence
site 1625 1633 McLauchlan consensus sequence
site 1536 1540 Berget consensus sequence
BASE COUNT 596 a 543 c 557 g 514 t
ORIGIN
1 aagagaagcg ccaagagcac agcgccgact cctaccgtaa agtggatctt ctcctctgtg
61 gtggagctga agcagacgat ctcccctgac tacagaaaca tgattgggca gggagcctag
121 gccacttcct ctgggatcag aagagcttcc taggccctgc agaagcccat ccatcctact
181 gtgcagccta acagggaggc cacactctag ccctatgact ctctgatcag aactcccatg
241 gtctcctctt tggaggacca cgtgcagtgc aggctttgcc cagacctaaa cacttccaca
301 gcagtcgcca gatatctaac tactccggac cagaagaacc atctccttcc aaaccagcac
361 tagggatctg agatctcaga atgtttgcct aagaagagct ggaaatccag gcttcctgtg
421 ttctgctaca aggacatcag cctggatttg acctggacca cacattttca tctaaatgag
481 ttttccacaa aggacacgtt tcagatcctt gaatgagacc tctacatgga agaccagagt
541 cactataccc aaaggtcact ctgtatcctt gcaccagcta tactggacag cttccttcct
601 ggtacttcag tgaccctggc tgaggaaagg atctgtgacc tcaactgttt ggagagcctc
661 tggaagatgt agtcttctct tcctgctacc accaacatgc tggatctcag atgcagaatc
721 caatccacag acaccactga ccacacaacc tgaagacaag gccattgcca cctccacaga
781 gatgccatcc acactctgtg gagaaataag gagtgctttg tgcagcctct gcaaagctct
841 ggcagggatt agagtataca cactgagtac tgactaggtg accaggcaga aaaacctcca
901 ggagaaggaa caatggggga gagatgtgaa cagatagtta gaaaaagcat ggtgtcacag
961 gtctgctctg tggactgatt tccagattgg accacctaca gcagaaacca tcggttgcag
1021 tggcaatcta ggaggaccaa cctggaatag gagggctgct gtggtcaatg gagagtagac
1081 ctgtatctat ttctccactg cctcttatga ccaataagaa gccagagtct ccagacagaa
1141 agaaagaaag aaagaaagaa agaaagaaag aaagaaagag agagagagag agagagagag
1201 agagaggaag gaaggaagga aggaaggaag gaaggaagga aggaggagga ggaggaggag
1261 gaggaggagg agagagagag agagagagag agagagagag agagagcacc agcttttctg
1321 tgactggaag gaaatgctta gagagcttgg atctttaaag cttctttttt ctagagacca
1381 tgaatgtctt tgttctctct ctctctctct ctctctctct ctctctctct ctctctctct
1441 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcgtgcatgc acgctattgt tttggcattt
1501 gaaacaataa aacattcttt taatattctg tatctcatgg ttccccttct gtgtggatca
1561 gccctaacac ccaggaacag ggacaataaa cagaccacag ccatgtacag ccttctacct
1621 cccttctggt tctgacctcc cagaggtccc tcagtgggcc cctcacagct gggtttcttc
1681 cctggcagtg ccaccaagag ctcaggcacc tctgagctgg aggctgtcct gatgccatag
1741 gcaggctatg gagcagagat gatgaccacg gtgactcagg tgagccaggc aaagcctccc
1801 atgccagaag agaagcgtgt ggtactcact ggcctcgggc tgctacggat tcagcaaaga
1861 gcatggatcc agctacctcc atagtggatg cttattgtct caagctgtgt tcagagaaat
1921 ttcagatcag gtattaatag ggtgggttct ccttacctgg taggtgcctt ctcccatggc
1981 acctcacagg ctctccatct gtgtgtctgg gtcctgatct cttctcataa gtacacagac
2041 aggctggaat ggtacaacct ggcctcatta taacttacca gttatgatcc tatctgcaag
2101 gcaggctcag tctgaggtct ggcggcttaa atgacaattg gtgtgtcttc tagaacagaa
2161 tttagtccac attagcgcgc ctgtggacca cagtatatct ctctggcctc
//
LOCUS YSCARF2 1012 bp ds-DNA PLN 24-JUL-1990
DEFINITION S.cerevisiae ADP-ribosylation factor 2 (ARF2) gene, complete cds.
ACCESSION M35158
KEYWORDS ADP-ribosylation factor 2.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1012)
AUTHORS Stearns,T., Hoyt,M.A., Botstein,D. and Kahn,R.A.
TITLE ADP-ribosylation factor is an essential protein in yeast and is
encoded by two genes
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Kahn, 05-JUN-1990.
Author address: R.A.Kahn
Lab. of Biol. Chem., DCT
Bldg. 37; Room 5D-02
National Cancer Institute
Bethesda, MD 20892
FEATURES from to/span description
pept 266 811 ADP-ribosylation factor 2 (ARF2)
binding 24 30 consensus GTP-binding site
binding 67 71 consensus GTP-binding site
binding 126 129 consensus GTP-binding site
BASE COUNT 301 a 180 c 222 g 309 t
ORIGIN
1 gcctgcaggt acgccccttt atttgatcag gaagccgtat tgattatcta atagggccta
61 gttatcctaa ttgtggggag tcgagcagta cggctctgat gtttttcgaa cgaagataag
121 gagttgacat acaaagtcaa cagaagttct tcttgttagc gtctctgtgc tcaatatctc
181 tctttttttc tttaagtagt aattactaac atcagccaac caatagagat aaaaaaaaaa
241 ggaattaaga tttcatagag aaaagatggg tctatacgct tctaagttat tcagcaatct
301 ttttggcaac aaagagatgc gtatacttat ggttggtcta gatggtgccg gtaagaccac
361 cgttttgtac aagttgaagt tgggcgaagt tatcactacc attccaacca ttggtttcaa
421 cgttgagact gtccaatata agaacatttc cttcactgtc tgggacgtcg gtggacaaga
481 caggattaga tctttatgga gacactacta cagaaacacc gaaggtgtta tttttgtcat
541 cgattccaac gatagatcgc gtattggtga agccagagaa gtcatgcaga gaatgctgaa
601 tgaagatgaa ttgagaaatg ctgtctggtt agtcttcgct aacaaacaag atttgccaga
661 agccatgtct gctgctgaaa tcaccgaaaa attaggttta cattctatta gaaaccgtcc
721 atggtttatc cagtctactt gtgcaacctc gggtgaaggt ctgtacgaag gtctggagtg
781 gttaagcaac aacttgaaga atcaatccta atctaaatct gtatagaacg tttagtcatg
841 cgcaccttgt gtgttttgtt tctagattgt tttattttta tgattgttga agatataaac
901 cactgtatag ttgtataaga taggataatg atggtgcact gaaaataaac ttactagctc
961 tttaatattg caacggcttg taacgggcga ctctagagga tccccgggta cc
//
LOCUS CODCPRRKA 94 bp ss-RNA RNA 24-JUL-1990
DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds.
ACCESSION M35276 M15192
KEYWORDS 4.5S RNA.
SOURCE C.fragile chloroplast RNA.
ORGANISM Chloroplast Codium fragile
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Brypsidales; Codiaceae; Codium fragile.
REFERENCE 1 (bases 1 to 94)
AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S.
TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts
JOURNAL J. Biol. Chem. 262, 1848-1854 (1987)
STANDARD simple staff_review
FEATURES from to/span description
RNA 1 94 4.5 S RNA
BASE COUNT 35 a 11 c 14 g 34 t
ORIGIN
1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc
61 atttgtattt gaaaattgtt taggaactag gcac
//
LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 24-JUL-1990
DEFINITION Hamster apolipoprotein (apoB) gene, partial cds
(LDL receptor-binding domain).
ACCESSION M35187
KEYWORDS apolipoprotein B.
SOURCE Hamster DNA.
ORGANISM Mesocricetus auratus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2339)
AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N.
TITLE Sequence of the putative low-density lipoprotein receptor-binding
regions of apolipoprotein B in mouse and hamster
JOURNAL Gene 87, 309-310 (1990)
STANDARD simple staff_review
COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications
on line 4 and line 17 of printed sequence. The hamster sequence
should be 1 bp to the left on these lines.
FEATURES from to/span description
pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding
domain) (AA at 1)
BASE COUNT 725 a 519 c 450 g 645 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat
61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat
121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc
181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct
241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat
301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt
361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat
421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac
481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta
541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag
601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg
661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc
721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc
781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc
841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt
901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt
961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt
1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat
1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga
1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac
1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa
1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag
1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac
1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt
1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg
1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact
1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac
1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg
1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg
1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt
1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt
1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt
1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac
1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca
2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa
2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac
2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta
2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct
2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc
//
LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 24-JUL-1990
DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding
domain).
ACCESSION M35186
KEYWORDS apolipoprotein B.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2354)
AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N.
TITLE Sequence of the putative low-density lipoprotein receptor-binding
regions of apolipoprotein B in mouse and hamster
JOURNAL Gene 87, 309-310 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding
domain) (AA at 1)
BASE COUNT 757 a 525 c 423 g 649 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat
61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat
121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc
181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct
241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat
301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt
361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat
421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac
481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg
541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa
601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta
661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc
721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc
781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt
841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt
901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt
961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt
1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat
1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga
1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac
1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa
1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag
1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac
1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt
1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt
1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa
1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc
1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg
1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca
1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc
1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac
1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag
1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct
1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag
2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg
2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt
2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat
2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga
2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat
2341 gtatatctga attc
//
LOCUS DDINDK 518 bp ss-mRNA INV 24-JUL-1990
DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, complete
cds.
ACCESSION J05457
KEYWORDS nucleoside diphosphate kinase.
SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clones
G[821,951].
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 518)
AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M.
TITLE Functional cloning of a nucleoside diphosphate kinase from
Dictyostelium discoideum
JOURNAL J. Biol. Chem. 265, 10012-10018 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Lacombe, 16-MAY-1990, for release after publication.
FEATURES from to/span description
pept 19 486 nucleoside diphosphate kinase Gip17 (EC
2.7.4.6)
mRNA < 1 518 Gip17 mRNA
signal 485 490 poly-A signal
BASE COUNT 164 a 97 c 97 g 160 t
ORIGIN
1 taacctaata agattaaaat gtccacaaat aaagtaaaca aagaaagaac tttccttgct
61 gttaaaccag acggtgttgc tcgtggttta gttggtgaaa tcatcgccag atacgaaaag
121 aaaggtttcg ttttagttgg tttaaaacaa ttagttccaa ccaaagactt agctgaatct
181 cactatgctg aacacaaaga aagaccattc ttcggtggtt tagtctcatt cattacctct
241 ggtccagtcg ttgctatggt cttcgaaggt aaaggtgttg ttgcctctgc ccgtttaatg
301 atcggtgtta ccaacccatt agcctcagcc ccaggttcaa ttcgtggtga tttcggtgtt
361 gatgttggta gaaacatcat ccacggttct gattcagttg aatctgccaa cagagaaatt
421 gctttatggt tcaaaccaga agaattatta actgaagtta aaccaaaccc aaatttatac
481 gaataaataa tttagctaaa ttctaaatca tttaatgt
//
LOCUS DDINDKA 184 bp ss-mRNA INV 24-JUL-1990
DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, 5' end.
ACCESSION M36679
KEYWORDS nucleoside diphosphate kinase.
SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clone
G821.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 184)
AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M.
TITLE Functional cloning of a nucleoside diphosphate kinase from
Dictyostelium discoideum
JOURNAL J. Biol. Chem. 265, 10012-10018 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 137 > 184 nucleoside diphosphate kinase Gip17 (EC
2.7.4.6)
BASE COUNT 77 a 21 c 14 g 72 t
ORIGIN
1 atttttgatt ttctttattc ttatttgtaa ttgtatataa tagtaaaaaa aaaaaattct
61 cagatttata atttattatt atttttcaaa tttcttcaat tctttgtatt aaatattact
121 caaacataaa ataaaaatgg cctcaaccaa aagaagatta aacagagaag aatcaatgtc
181 caca
//GenBank-Updates@genbank.bio.net (07/25/90)
LOCUS DOGRAB2A 656 bp ss-mRNA MAM 25-JUL-1990
DEFINITION C.familiaris GTP-binding protein (rab2) mRNA, complete cds.
ACCESSION M35521
KEYWORDS GTP-binding protein.
SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
mRNA, clone II.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 656)
AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
TITLE Localization of low-molecular weight GTP binding proteins to
exocytic and endocytic compartments
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Chavrier, 22-JUN-1990.
Base-pairs 508 to 564 form a synthetic peptide used to raise
antibodies.
FEATURES from to/span description
pept 7 645 GTP-binding protein (rab2)
BASE COUNT 209 a 124 c 158 g 165 t
ORIGIN
1 gcggccatgg cgtacgctta tctcttcaag tacatcatca tcggcgacac aggtgttggt
61 aaatcatgct tattgctaca gtttacagac aagaggtttc agccagtgca tgacctgact
121 atcggtgtag agtttggtgc tcgaatgata actattgatg ggaaacagat aaaacttcag
181 atatgggata cggcagggca agagtccttt cgttccatca caaggtcata ttacagaggt
241 gcagcagggg ctttactagt gtatgatatt acaaggagag atacattcaa ccacttgaca
301 acctggttag aagatgcccg ccagcattcc aattccaaca tggtcattat gcttattgga
361 aataaaagtg atttagaatc aagaagagaa gtaaaaaaag aagaaggtga agcttttgca
421 cgagaacatg gacttatctt catggaaact tctgctaaga ctgcttccaa tgtagaagag
481 gcatttatta atacagcaaa agaaatttat gagaaaatcc aagaaggagt ctttgacatt
541 aataatgagg caaacggcat taaaattggc cctcagcacg ctgctactaa tgccacacac
601 gcgggcaatc agggaggaca gcaggccggg ggaggctgct gttgagtccg tttttt
//
LOCUS DOGRAB5A 796 bp ss-mRNA MAM 25-JUL-1990
DEFINITION C.familiaris GTP-binding protein (rab5) mRNA, complete cds.
ACCESSION M35520
KEYWORDS GTP-binding protein.
SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
mRNA, clone II.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 796)
AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
TITLE Localization of low-molecular weight GTP binding proteins to
exocytic and endocytic compartments
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Chavrier, 22-JUN-1990.
Base-pairs 664 to 711 form a synthetic peptide used to raise
antibodies.
FEATURES from to/span description
pept 121 768 GTP-binding protein (rab5)
BASE COUNT 267 a 163 c 174 g 192 t
ORIGIN
1 ccgcggctcc tcgtgctgcg gcctcaggtt tctgtatatc cagaaagaaa aaatttgaca
61 ccttgcatcc tggaagttca tttaagagac tgaaattagg gacttctttc aaatttggac
121 atggctaatc gaggagcaac aagacccaac gggccaaata ctggaaataa aatatgccag
181 ttcaaactag tacttctggg agagtctgct gttggcaaat caagcctagt gcttcgtttt
241 gtgaagggcc aatttcatga atttcaagag agtaccatag gggctgcttt tctaacccaa
301 actgtgtgtc ttgatgatac aacagtaaag tttgaaatat gggatacagc tggtcaagaa
361 cgataccata gcttagcacc aatgtactac agaggagcac aagcagccat agttgtatat
421 gatatcacaa atgaggagtc ctttgccaga gccaaaaact gggttaaaga acttcagagg
481 caagccagtc ctaacattgt aatagcttta tcaggaaaca aggctgatct tgcaaataaa
541 agagctgtcg atttccagga agcacagtcc tatgcagatg acaacagttt attattcatg
601 gagacatcag ctaaaacatc gatgaacgta aatgaaatat tcatggcaat agctaaaaag
661 ttgccaaaga acgaaccaca gaatccagga gcaaattctg ccagaggaag aggagtagac
721 cttactgaac ccacgcagcc aaccaggagt cagtgttgta gtaactaaac ctccagtttg
781 aacttcctgg aatatc
//
LOCUS DOGRAB7A 811 bp ss-mRNA MAM 25-JUL-1990
DEFINITION C.familiaris GTP-binding protein (rab7) mRNA, complete cds.
ACCESSION M35522
KEYWORDS GTP-binding protein.
SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
mRNA, clone II.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 811)
AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
TITLE Localization of low-molecular weight GTP binding proteins to
exocytic and endocytic compartments
JOURNAL Cell (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Chavrier, 22-JUN-1990.
Base-pairs 542 to 592 form a synthetic peptide used to raise
antibodies.
FEATURES from to/span description
pept 20 643 GTP-binding protein (rab7)
BASE COUNT 251 a 206 c 194 g 160 t
ORIGIN
1 gagcggctgc gtttgaagga tgacctctag gaagaaagtg ttgctgaagg ttatcatcct
61 gggagattct ggagttggta agacatcact catgaaccag tatgtgaaca agaaattcag
121 taatcagtac aaagctacaa taggagcaga ctttctgaca aaggaggtga tggtggatga
181 cagactagtt acaatgcaga tctgggacac agcaggccag gaacggttcc agtcccttgg
241 tgtggccttc tacagaggtg cagactgctg cgttctggta tttgacgtta ctgcccccaa
301 cacattcaaa accctcgata gctggagaga tgagtttctc atccaggcca gtccccggga
361 tcctgaaaac ttccctttcg ttgtgttggg aaacaagatt gacctcgaaa acagacaagt
421 ggccacaaag cgggcacagg cctggtgcta cagcaaaaac aacattccct acttcgagac
481 cagtgccaag gaggccatca atgtggagca ggcgttccag acgattgcaa ggaatgcact
541 taaacaggaa acagaggtgg agctgtacaa tgaattccct gaacccatca aactggacaa
601 gaacgaccgg gccaagacct cagcggaaag ctgcagttgc tgaaggggca gtgagagcag
661 agcacagagt ccttcacaaa caaagaacac acttaggcct tccaacacga gcccccttct
721 tctcttccaa acaaaacata aagtcatctc tcgaatccag ctgccaaaag accctaccaa
781 acacttcacc ctgacacaca catacacaca c
//
LOCUS HUMU7AA 649 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32sm.
ACCESSION M35537
KEYWORDS U7 small nuclear RNA; pseudogene.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 649)
AUTHORS Soldati,D. and Schimperli,D.
TITLE Structures of four human pseudogenes for U7 small nuclear RNA
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES from to/span description
uRNA.ps 293 352 pseudo-U7 uRNA
BASE COUNT 217 a 107 c 115 g 199 t 11 others
ORIGIN
1 attatggcag agtacatgta acatatagtt tgctattcna actgattttt gacaaagata
61 caacagcana tcaatggagg aacaatagcn tttttaacaa atggtgttgg cacaactgga
121 caactgtaag nnaaagaaaa tgaanttcaa tctanatctc anaccgtatt aaaaaaaact
181 caaagtgggc cacagactta gatataaaat gtaaaactat aacactttta gaaaanatat
241 aggagaanat ctatgggatt tagggcaaaa gcatgattca aaaaaggaaa gtcagtgtta
301 cagccctttt agaatttgtc tagcaggttt tctggttttc cagaaaacct ccacataaaa
361 aggaaaaaga aaaaaaggaa aaagtaataa attagtatga attgagcatt ttaatgattc
421 tattttattg cctttgttgg cttattaaat ataactctct gttttgttat tttagtggtt
481 gctttaggtt ttatagtaat acatctttaa cctgttacag tccaccttct ttttgtttgt
541 ttgttttgga agcagggtct cactctgtca ccaaggctag agtgcagtgg cactatcacg
601 gctcactgca acctcaacct cccaggctcc agngttcctc ctgctgcag
//
LOCUS HUMU7AB 521 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32BG.
ACCESSION M35538
KEYWORDS U7 small nuclear RNA; pseudogene.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 521)
AUTHORS Soldati,D. and Schimperli,D.
TITLE Structures of four human pseudogenes for U7 small nuclear RNA
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES from to/span description
uRNA.ps 295 344 pseudo-U7 uRNA
BASE COUNT 186 a 102 c 89 g 144 t
ORIGIN
1 tttcttcttt ttccacctct tgtctattca ggccctcagt gaattggatc atgctcaccc
61 acatcagggc aggcaatcta cttattgagt tcactgattc aaatgataac ctcacctgga
121 aaaatcctca cagacccaga aataatgttt aatctaagca cccatggcca gtcaagttga
181 gacataaaat tagccatcac agtacaggca tacctgggaa atgacgcagg ttcagttcca
241 gaccatcaca ataaagcaaa tattgcaata aagtgagtca caaaaagaaa aagtcagtgt
301 tacagctttt agaatttgtc tagcaggttt tctggaaaac cttcacaaaa aaaggagaaa
361 gagtgcatat aaaatgctta tgttgatacc atactgtagt ctattaagtg tgcaatagca
421 ttatgtctat aaaacaatgt acatacttta aaaatatttt attgttaaaa catgctatca
481 cagagacaca aagtgagcac atgctgttgg aaaaatggta c
//
LOCUS HUMU7AC 513 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human U7 small nuclear RNA pseudogene.
ACCESSION M35539
KEYWORDS U7 small nuclear RNA; pseudogene.
SOURCE Human liver DNA, clone 25H.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 513)
AUTHORS Soldati,D. and Schimperli,D.
TITLE Structures of four human pseudogenes for U7 small nuclear RNA
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES from to/span description
uRNA.ps 204 264 pseudo-U7 uRNA
BASE COUNT 127 a 83 c 81 g 222 t
ORIGIN
1 aattgtctgt ctttcatatt tttgtcattc tcgtgagtgt gaagtggtat ctcattgtgg
61 ttttgatttg catttcccta atgactaatg gtgttgaata tcttttcata tgcttataag
121 ccatttatat gtctttggag aaattctttt caaatctctt gctcatttta aaattaggtt
181 gtcattttat tacggagttg cattagtgtt acagctcttt tagaatttgt ctagcaggtt
241 ttctgatttt tacccggaac ccctccccag ccaaaagtaa aagaaaaaaa aagctgcaat
301 agttctttat atagtttaga tacaaggccc ttatcagata tttgattttc aaatattgtc
361 tcccattctg tgagttgttt tttcactctc ttgatggtgt catatgaagc acaaattttt
421 ttttttattt tgataatgtc ccatttatct atgtattttt tcttttcatt tgtgcttttg
481 gtgtcgtacc taagaaactg ctgcttaact caa
//
LOCUS HUMU7AD 418 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human U7 small nuclear RNA pseudogene, fragment 36h.
ACCESSION M35540
KEYWORDS U7 small nuclear RNA; pseudogene.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 418)
AUTHORS Soldati,D. and Schimperli,D.
TITLE Structures of four human pseudogenes for U7 small nuclear RNA
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES from to/span description
uRNA.ps 229 286 pseudo-U7 uRNA
BASE COUNT 128 a 58 c 106 g 122 t 4 others
ORIGIN
1 agaggcacat gtcaagatga agctctggtg aagaattgat caaaaatagt ggcggagtga
61 gatggagatt taaatccaag ggctgattta tgaaggcttc aaagattttt tttttttaaa
121 gaaagaacat agattagttg tttctgaggg ctggagggga cagagataga ggcggcgacg
181 gaaggatcct tcaggtttct tcttgaggtg attaaacgtt ctgaaatcgc gtgttacagc
241 tcttttggaa tttgtctagc aggttttctg gttttcactg caaaacccca cagtnnnaaa
301 acagaaaaaa aaawttatcc taaaattggg ctgtggtaat ggttgcgcat atgctgtgaa
361 taggcttcca aatattgaaa tgtccacttc aaacgagtga actgtatggt atgtgaat
//
LOCUS SCMPMYA1 3156 bp ss-mRNA INV 25-JUL-1990
DEFINITION S.mansoni paramyosin mRNA, complete cds.
ACCESSION M35499
KEYWORDS paramyosin.
SEGMENT 1 of 2
SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones
Pmy[1,8,11,15].
ORGANISM Schistosoma mansoni
Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda;
Digenea; Strigeidida; Strigeata; Schistosomatoidea;
Schistosomatidae.
REFERENCE 1 (bases 1 to 3156)
AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and
Shoemaker,C.B.
TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of
antigen B from Taenia solium (cestoda)
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.P.Laclette, 22-JUN-1990.
Author address: J.P.Laclette
Department of Tropical Public Health
Harvard School of Public Health
665 Huntington Avenue
Boston, MA 02115
Email: zehm%hscvax%harvunxwxw.edu
FEATURES from to/span description
pept 47 2647 paramyosin
mRNA < 1 > 3156 paramyosin mRNA
BASE COUNT 1279 a 435 c 516 g 923 t 3 others
ORIGIN
1 tctttcacta atattaaaaa gaaaaattta aaaaaaaaga ggaaaaatga tgaatcatga
61 tacagaatct catgtgaaaa tatcaagaac tatttatcga ggagtatcac caagtacaac
121 aagacttgag agtcgagtac gggaattaga agatcttttg gatttagaac gtgatgcaag
181 agttcgagct gaacgacatg ctgctgattt aggttttcaa gtggatgcat tatcagaacg
241 tttagatgaa gctggaggtt ctacaacaca aactcaagaa ttattaaaac gtcgtgaaat
301 ggaaatcaat aaactacgta aagatttaga aaatgctaat gcatcacttg aactagctga
361 aacatcaatg agacgtcgac atcaaacagc attgaatgaa ttagctttgg aagttgaaaa
421 tttacaaaaa caaaaaggaa aggctgaaaa agacaaaagt catttgatta tggaagtgga
481 taatgttcta ggacaattag atggtgcatt aaaagctaag caatcagctg aatcaaaatt
541 agaaggatta gatagtcaat taaatcgttt aaaatcatta accgacgatt tacaaagaca
601 attaactgaa ttaaataatg ctaaatcaag attaacatca gaaaattttg aattattaca
661 tataaatcaa gattatgaag cacaaatatt aaattattct aaagctaaat catcacttga
721 aagtcaagta gatgatttaa aaagatcatt agatgatgaa gctaaaaatc gttttaatct
781 tcaagctcaa cttacatcac ttcaaatgga ttatgataat ttacaagcta aatatgatga
841 agaaagtgaa gaagctagta atttacgtag tcaagtatct aaatttaacg ctgatattgc
901 tgcattaaaa tcgaaatttg aacgtgaact tatgagtaaa acagaagaat tcgaagaaat
961 gaagaggaaa ttcactatga gaattaccga acttgaagat actgctgaaa gagaacgatt
1021 aaaagcggta tcattagaaa aacttaaaac aaaattaaca ttagaaatta aagatttaca
1081 atctgaaata gaaagtcttt cattagaaaa tagtgaatta attcgtcgtg ctaaagctgc
1141 tgaatcatta gcttctgatt tacaacgtcg tgttgatgaa ttaacaattg aagtgaatac
1201 attaacatca caaaatagtc aattagaaag tgaaaatcta cgtttaaaaa gtttagttaa
1261 tgatttaacg gataaaaata atttattaga acgtgaaaat cgtcaaatga atgatcaagt
1321 caaagaatta aaaagttcac ttcgtgatgc taatcgtcgt cttactgatt tagaagcatt
1381 aagatcgcaa ttagaggctg aaagagataa tcttgcatca gctttacatg atgctgaaga
1441 agcattacat gatatggatc aaaagtatca agcatcacaa gctgcattaa atcatttgaa
1501 atctgaaatg gaacaaaggc ttagagaaag agatgaagaa ttagaaagtt taagaaaaag
1561 tactactaga acaattgaag aattaactgt tacaataact gaaatggaag ttaaatataa
1621 atcagaatta tcacgtttaa aaaaacgtta tgaatcaaat attgctgatt tagaaattca
1681 acttgataca gctaataaag ctaatgcaaa tcttatgaaa gagaataaaa atttatcaca
1741 acgtgttaaa gatttagaaa catttttaga tgaagaacgt cgtcttcgtg aagcagctga
1801 aaataattta caaattactg aacataaacg tttacaatta gcaaatgaaa ttgaagaaat
1861 acgtagtaca ttagaaaatt tagaacgttt acgtaaacat gctgaaacag aacttgaaga
1921 agctcaatca cgtgttagtg aattaactat tcaagttaat acattaacta atgataaacg
1981 tcgtcttgaa ggtgatattg gtgtaatgca ggctgatatg gatgatgcta ttaatgctaa
2041 acaagcttct gaagatcgag caattagatt aaataatgaa gtattacgtt tagctgatga
2101 attacgtcaa gaacaaggaa attataaaca tgctgaagca ttaagaaaac aattagaaat
2161 tgaaatacgt gaaattacag ttaaattaga agaagctgaa gcatctgcta cacgtgaagg
2221 tcgtcgtatg gtacaaaaat tacaggctcg tgtacgtgaa cttgaatcag aattcgatgg
2281 tgaatcaaga agatgtaaag atgcattagc tcaagcacgt aaatttgaac gtcaatataa
2341 agaattacaa acacaagctg aagatgatcg tcgtatggta ttagaacttc aagatttatt
2401 agataaaact caaatgaaaa tgaaagccta taaacgtcaa ttggaagaaa tggaagaagt
2461 atctcaaatt acaatgaata aatatcgtaa agcccaacaa caaattgaag aagctgaaca
2521 tcgtgcagat atggctgaac gtacagtcac tgtacgtcgt gttggtccag gtggacgtgc
2581 tgtttctgta gcacgtgaat tatctgtcac atcaaataga ggaatgagag caacaagtat
2641 gatgtaaagc acttaaataa taataataat agtgatacta tacacatata caaacgccta
2701 tatctttctt tctctctttg tttcgttttc ctcatcttcg ctttttttta gtcatgatat
2761 tcatctaaat gaggaaatta tcaataatga cctattatta ttcaatgtgc tttactttac
2821 ttcccaccct aaatctcctc ggtatatcgt ttcccttttt ttttcttttt ttttctaaaa
2881 acaaaaaatt ctaaaagtga aagacgaaaa aaaaaaannn cagaaatttg tttcctcctc
2941 tcatattttc tctttgttct ttttattcat ttcatttatt gtattattaa tattgctatt
3001 attattattg ttattactac ctaaccgatg gtttcaacga cagcaatctc ccatatttct
3061 acacacacac acacacaaca cacacaacac acaaaagtat ctgtgcaatc gtaatagata
3121 atctttattt attgattaaa aaaaaaaaaa aaaaaa
//
LOCUS SCMPMYA2 217 bp ss-mRNA INV 25-JUL-1990
DEFINITION S.mansoni paramyosin mRNA, 3' flank.
ACCESSION M36871
KEYWORDS paramyosin.
SEGMENT 2 of 2
SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones
Pmy[1,8,11,15].
ORGANISM Schistosoma mansoni
Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda;
Digenea; Strigeidida; Strigeata; Schistosomatoidea;
Schistosomatidae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and
Shoemaker,C.B.
TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of
antigen B from Taenia solium (cestoda)
JOURNAL Unpublished (1990) Harvard 665 Huntington Avenue, Boston, MA 02115
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.P.Laclette, 22-JUN-1990.
Author address: J.P.Laclette
Department of Tropical Public Health
Harvard School of Public Health
665 Huntington Avenue
Boston, MA 02115
Email: zehm%hscvax%harvunxwxw.edu
FEATURES from to/span description
mRNA < 1 217 paramyosin mRNA
BASE COUNT 66 a 46 c 17 g 88 t
ORIGIN About 1 kb after segment 1.
1 cagaaatttg tttcctcctc tcatattttc tctttgttct ttttattcat ttcatttatt
61 gtattattaa tattgctatt attattattg ttattactac ctaaccgatg gtttcaacga
121 cagcaatctc ccatatttct acacacacac acacacaaca cacacaacac acaaaagtat
181 ctgtgcaatc gtaatagata atctttattt attgatt
//
LOCUS ECAPNL 420 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.carotovora pectin lyase (PNL) gene, 5' end.
ACCESSION M35271
KEYWORDS pectin lyase.
SOURCE E.carotovora DNA, clone pTN2159.
ORGANISM Erwinia carotovora
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 420)
AUTHORS Nishida,T., Suzuki,T., Ito,K., Kamio,Y. and Izaki,K.
TITLE Cloning and expression of pectin lyase gene from Erwinia carotovora
in Escherichia coli
JOURNAL Biochem. Biophys. Res. Commun. 168, 801-808 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 284 > 420 pectin lyase (EC 4.2.2.10)
BASE COUNT 127 a 66 c 94 g 133 t
ORIGIN
1 cctatcagtc tgatgaagtt gaacaggctg cgaaccgtat ttttaatggc ggcgggtaaa
61 aggctggtga tgataatcgt agcgctgcca ttttactaaa agatggcggc gtattaattg
121 ggtattgaat tattcgcaag gttgtttttt tattaaactc gattaataag cgtaatgaaa
181 tcctttctat acaattttta attgtcggag gcgtattatt tagtctcaat taaataatac
241 gctggaagac attattattc actcattgta aaaaggaaaa cttatggctt atccaacaac
301 aaatcttact gggcttattg gttttgcaaa agcggcaaaa gttaccggag gaacgggcgg
361 taaagtcgtt acggtaaatt ctttggccga ttttaaatca gcggtgacgg ttccgcaaaa
//
LOCUS ECOUXEX 318 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli uxaCT-exuT intercistronic region.
ACCESSION M35280
KEYWORDS catabolite receptor protein.
SOURCE E.coli (strain K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Blanco,C. and Mata-Gilsinger,M.
TITLE Identification of cyclic AMP-CRP binding sites in the
intercistronic regulatory uxaCA-exuT region of Escherichia coli
JOURNAL FEMS Microbiol. Lett. 33, 205-209 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
site 46 71 catabolite receptor protein binding site 1
site 165 193 catabolite receptor protein binding site 2
BASE COUNT 98 a 62 c 70 g 88 t
ORIGIN
1 gtcgacttat gatttgcgac ggcagaaaga taacttgtca tacaacttta aaaggtgaga
61 gccatcacaa atgtgggaat atttgtaggg acattacctg acgacagcaa ggccagtact
121 ggcgcggcct gcagcgagat ttaccacttt gagagtaatt tttttaacta cgtttattga
181 tctaactcac gaaaatatct tcggactctg gaaattggtg tgataacttt gtcagcatcg
241 caccataagc aagctagctc actcgttcga agaggaagac gaaaataact ccgtttatga
301 ctgaagatta tcctgtta
//
LOCUS HUMSYNIFA 144 bp ds-DNA SYN 25-JUL-1990
DEFINITION Human synthetic interferon alpha-2 gene, 3' end.
ACCESSION M35281
KEYWORDS interferon.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 144)
AUTHORS Rossi,J.J., Kierzek,R., Huang,T., Walker,P.A. and Itakura,K.
TITLE An alternate method for synthesis of double-stranded DNA segments
JOURNAL J. Biol. Chem. 257, 9226-9229 (1982)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 13 135 interferon alpha-2
BASE COUNT 37 a 32 c 32 g 43 t
ORIGIN
1 caagaattca tgatcactct gtacctgaag gaaaagaaat actctccgtg tgcttgggaa
61 gttgtacgtg ctgaaatcat gcgttctttc tccctgtcta ctaaccttca ggagtctctg
121 cgttctaaag aatagctgca gtgg
//
LOCUS RATMAL5 1104 bp ds-DNA ROD 25-JUL-1990
DEFINITION Rat malic enzyme (ME) gene, 5' end.
ACCESSION M35258 M21619
KEYWORDS malic enzyme.
SOURCE Rat (Sprague-Dawley, female) liver, clone lambda-g-ME-29.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1104)
AUTHORS Morioka,H., Tennyson,G.E. and Nikodem,V.M.
TITLE Structural and functional analysis of the rat malic enzyme gene
promoter
JOURNAL Mol. Cell. Biol. 8, 3542-3545 (1988)
STANDARD simple staff_review
REFERENCE 2 (bases 427 to 925; revises [1])
AUTHORS Petty,K.J., Desvergne,B., Mitsuhashi,T. and Nikodem,V.M.
TITLE Identification of a thyroid hormone response element in the malic
enzyme gene
JOURNAL J. Biol. Chem. 265, 7395-7400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 918 > 1104 malic enzyme (EC 1.1.1.40)
mRNA 883 > 1104 malic enzyme mRNA
rpt 814 823 direct repeat
rpt 827 836 direct repeat
BASE COUNT 220 a 376 c 290 g 218 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgcat agcccagaag ctatagctgt actgatgggc tcaagtaaaa taattagaaa
61 ttatttctca ggtatctagg caatatttaa cccccaaatt gttccgcagt gtctagatga
121 acaccataga atttggccgt gcgacttaac tgaaaagaaa gggctttgtt gtctgaaggc
181 tgcttggctg tattgttttg ttttaatcag acatccttgg gagacatagg atttatttct
241 ccagtccttg gatcttcaag tataaatatc aataatacaa ccactgggtt tcagtactgg
301 aagacctgtt attctgaccc tctgtcatca gagaagaaac catacatcat cttgcaaaaa
361 ttaacatctt ggtttccaga acgctcagga aaattgttct taagctcaat aggactggcc
421 actggacctg tgccctctaa cacctttttc ttaccacgtt cgaacacaat tccctcagat
481 actattcaga aacaggcgag gagtcgcccg ccctatcgcc cagtgccatc gaggcctggg
541 cattctgggt caaagttgat cccctcctgc atcaggcccc tggggcatgg ctggcatcca
601 ggacgttggg gttaggggag gacagtggac gagcggagga agcgaggcgg cccgcccctc
661 acccgtcggt gcccaggtcg cacgctcggc gctcaccagc ttggccggcg ccccgccccc
721 gcctcctcgc acggcggctc ggccgatgcc gccgtgactc agcgcttctc gcgggccgcc
781 cgcgcggccg cggctaggcc gggctcctcc cgcctcgcca ccccctctcg ccacccacgc
841 ccgcccccgg ccgcggggcc ttccgtcgca cggccgccgc ccgccgcact cccgtccgcc
901 ccgccacggt gctggccatg gatccccgag ccccccgccg ccgacacacc caccagcgcg
961 gctacctgct gacgcgggac ccgcatctca acaaggtgag ccccgccccg agagccgccc
1021 tgggcccgcc gctgggctcg ggcacccgcg tcccaccgag gggacggtcc cacccgggag
1081 gccactgcgg agccggcgcc aacg
//
LOCUS RATSPA 1595 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Rat serine pyruvate aminotransferase mRNA, complete cds.
ACCESSION M35270 X06357
KEYWORDS serine pyruvate aminotransferase.
SOURCE Rat (strain Wistar) liver, clones pRspt910,321].
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 63 to 1595)
AUTHORS Oda,T., Miyajima,H., Suzuki,Y. and Ichiyama,A.
TITLE Nucleotide sequence of the cDNA encoding the precursor for
mitochondrial serine:pyruvate aminotransferase of rat liver
JOURNAL Eur. J. Biochem. 168, 537-542 (1987)
STANDARD simple automatic
REFERENCE 2 (bases 1 to 198)
AUTHORS Oda,T., Funai,T. and Ichiyama,A.
TITLE Generation from a single gene of two mRNAs that encode the
mitochondrial and peroxisomal serine:pyruvate aminotransferase of
rat liver
JOURNAL J. Biol. Chem. 265, 7513-7519 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 109 1353 peroxisomal serine:pyruvate aminotransferase
precursor (EC 2.6.1.51; pSPT)
sigp 109 118 serine:pyruvate aminotransferase signal
peptide
matp 119 1350 serine:pyruvate aminotransferase
pept 175 1353 mitochondrial serine:pyruvate aminotransferase
(mSPT)
mRNA 61 > 1520 pSPT mRNA (alt.)
mRNA 62 > 1520 pSPT mRNA (alt.)
mRNA 127 > 1520 mSPT mRNA (alt.)
mRNA 129 > 1520 mSPT mRNA (alt.)
mRNA 130 > 1520 mSPT mRNA (alt.)
signal 1515 1520 polyA signal
BASE COUNT 376 a 437 c 455 g 327 t
ORIGIN
1 aggacaaaca tcgatcaggg tcaaattgac aataaaaggg ctggagcaag caacagggac
61 tcaccaacca ggcctcgcct ctgagttcag cccagagcta gctgggaaat gttccggatg
121 ttggccaagg ccagtgtgac gctgggctcc agggcagcaa gttgggtacg gaacatgggc
181 tcgcaccagc tgctggtgcc acccccagag gccctgagca agcccctgtc aattcctaag
241 aggctcctgt tgggtccggg accctccaac ctggctcctc gtgtgctagc agctggaagt
301 ctgaggatga ttggccacat gcaaaaagag atgtttcaga tcatggatga gatcaagcag
361 ggcatccagt atgtgttcca gaccaggaac cccctcacac tggttgtcag cggctcagga
421 cattgtgcca tggagactgc cctgttcaac ctcctggagc ctggggactc ctttcttgtg
481 ggaaccaatg gcatctgggg gatacgggct gcagagatcg ctgagcggat tggagcccgt
541 gtgcaccaga tgatcaagaa gcctggagaa cattacacac tgcaggaggt ggaggagggc
601 ctggctcagc ataaaccagt gttgctgttc ctgacccacg gggagtcatc cactggtgtg
661 ctgcagcccc tggatggttt cggggagctc tgccacaggt atcagtgcct actcctggtg
721 gactcggtgg catcattggg cggagtccct atctacatgg accaacaagg catcgacatc
781 ttgtactctg gctctcagaa ggtcctgaat gccccaccag ggatctccct catctccttc
841 aacgacaagg ccaaatccaa agtctactcc cggaagacaa agccagtctc cttctacaca
901 gacatcactt atttgtccaa gttgtggggc tgtgagggca agaccagagt aattcatcat
961 acgttgcctg tcatcagctt atactgcctg agggagagcc tagcactcat ttcagagcag
1021 ggcctggaga attcctggcg gcgtcacagg gaggctacag cacatctgca caagtgcctg
1081 cgggagttgg gcttaaagtt ctttgtgaag gacccggaaa tccggctacc tacaatcacc
1141 accgtgaccg tgcctgccgg ctacaactgg agggacatcg tcagctacgt gctggaccac
1201 ttcaacattg aaatctctgg tggtcttggg ccctctgagg ataaggtgct gcggattggc
1261 ctcctgggct acaacgccac cacagagaat gcggaccgtg tagcggaggc cctgagggag
1321 gccctgcaac attgtcctaa gaataaattg tgagcatcgt ctcaccagac tgtgccctcc
1381 tggaggggct gggaatatag caggaacgag aagactgtgc aagccctcca gccagcaaag
1441 gctgccgatg taaccaggcg ggaagggtca gggcgaagct gcccctctcc ccacagatgg
1501 agccctgtgg tcacatgatg ctaatcacct tccgatgaag ctgcattctg caggccactg
1561 gacttcggga atattcaata aagtacttgc cagac
//
LOCUS YSCCOX9A 180 bp ds-DNA PLN 25-JUL-1990
DEFINITION S.cerevisiae cytochrome c oxidase subunit VIIa (COX9) gene,
complete cds.
ACCESSION M35260
KEYWORDS cytochrome c oxidase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Duhl,D.M., Powell,T. and Poyton,R.O.
TITLE Mitochondrial import of cytochrome c oxidase subunit VIIa in
Saccharomyces cerevisiae: Identification of sequences required for
mitochondrial localization in vivo
JOURNAL J. Biol. Chem. 265, 7273-7277 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 180 cytochrome c oxidase subunit VIIa
BASE COUNT 53 a 35 c 51 g 41 t
ORIGIN
1 atgactattg ctccaattac tggtacgatc aagagaagag tcatcatgga catcgtcctc
61 gggttctccc tcgggggtgt catggcctct tactggtggt ggggattcca catggataag
121 attaacaaga gagagaagtt ctacgcagag ctagctgaga ggaaaaagca agagaactga
//
LOCUS DROTNCOPIA 276 bp ds-DNA INV 25-JUL-1990
DEFINITION D.melanogaster transposable element copia DNA in omega-aLTR1.
ACCESSION M35053
KEYWORDS copia transposon; transposable element.
SOURCE D.melanogaster (strain w-a-1A) DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 276)
AUTHORS Zachar,Z., Davison,D., Garza,D. and Bingham,P.M.
TITLE A detailed developmental and structural study of the
transcriptional effects of insertion of the copia transposon into
the white locus of Drosophila melanogaster
JOURNAL Genetics 111, 495-515 (1985)
STANDARD simple staff_entry
BASE COUNT 100 a 42 c 34 g 100 t
ORIGIN
1 tgttggaata tactattcaa cctacaaaaa taacgttaaa caacactact ttatatttga
61 tatgaatggc cacacctttt atgccataaa acatattgta agagaatacc actcttttta
121 ttccttcttt ccttcttgta cgttttttgc tgtgagtagg tcgtggtgct ggtgttgcag
181 ttgaaataac ttaaaatata aatcataaaa ctcaaacata aacttgacta tttatttatt
241 tattaagaaa ggaaatataa attataaatt acaaca
//
LOCUS ECOMETBJI 82 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli metB-metJ intercistronic DNA region.
ACCESSION M34899
KEYWORDS .
SOURCE E.coli (strain K12) DNA, clone pAA110.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 82)
AUTHORS Smith,A.A., Greene,R.C., Kirby,T.W. and Hindenach,B.R.
TITLE Isolation and characterization of the product of the methionine-
regulatory gene metJ of Escherichia coli K-12
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6104-6108 (1985)
STANDARD simple staff_entry
BASE COUNT 25 a 13 c 15 g 29 t
ORIGIN
1 tataatttta acggctattt gggatttgct catctatacg caaagaagtt tagatgtcca
61 gatgtattga cgtccattaa ca
//
LOCUS MNICPRRKA 103 bp ds-DNA RNA 25-JUL-1990
DEFINITION M.rugicum 4.5S ribosomal RNA.
ACCESSION M35056
KEYWORDS 4.5S ribosomal RNA.
SOURCE M.rugicum chloroplast DNA.
ORGANISM Chloroplast Mnium rugicum
Eukaryota; Plantae; Embryobionta; Bryophyta; Bryopsida; Bryidae;
Bryales; Mniaceae; Mnium rugicum.
REFERENCE 1 (bases 1 to 103)
AUTHORS Troitsky,A.V., Bobrova,V.K., Ponomarev,A.G. and Antonov,A.S.
TITLE The nucleotide sequence of chloroplast 4.5 S rRNA from Mnium
rugicum (Bryophyta): Mosses also posses this type of RNA
JOURNAL FEBS Lett. 176, 105-109 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 103 4.5S ribosomal RNA
BASE COUNT 33 a 20 c 28 g 22 t
ORIGIN
1 taaggtgacg gcaagactag ccgtttatca tcacgatagg tgccaagtgg aagtgcagta
61 atgtatgcag ctgaggcatc ctaacagacc gagagattta aac
//
LOCUS MUSCABLA 125 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse tyrosine kinase (c-abl) mRNA, 3' terminus.
ACCESSION M34905
KEYWORDS tyrosine kinase.
SOURCE Mouse (strain NIH Swiss) testis, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 125)
AUTHORS Meijer,D., Hermans,A., von Lindern,M., van Agthoven,T.,
de Klein,A., Mackenbach,P., Grootegoed,A., Talarico,D.,
Valle,G.D. and Grosveld,G.
TITLE Molecular characterization of the testis specific c-abl mRNA in
mouse
JOURNAL EMBO J. 6, 4041-4048 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA < 1 44 tyrosine kinase (c-abl) mRNA (alt.)
mRNA < 1 125 tyrosine kinase (c-abl) mRNA (alt.)
BASE COUNT 26 a 35 c 26 g 38 t
ORIGIN
1 gcttactgta cctgcacctt tgatgcttac aaactgtccc cgagagcctg tgctcactgt
61 gttttcattg gaaggaagct gcttactgta cctgcacctt tgatgcttac aaactgtccc
121 cgaga
//
LOCUS SOPMPDNA 111 bp ds-DNA SYN 25-JUL-1990
DEFINITION Synthetic ovalbumin pre-message selfprimer DNA.
ACCESSION M35058
KEYWORDS ovalbumin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 111)
AUTHORS Oyama,F., Kikuchi,R. and Uchida,T.
TITLE A synthetic, partial pre-mRNA for ovalbumin primes its own
complementary DNA with reverse transcriptase
JOURNAL J. Biochem. 104, 403-408 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
site 3 3 cDNA start with primer
site 36 36 cDNA start without primer
site 69 84 primer-independent cDNA
BASE COUNT 44 a 15 c 20 g 32 t
ORIGIN
1 atcctggaag tttatcaaag cgaacaacct gtaattgaaa ataatagtag ctgaaataat
61 ggttatgaca aaaagaagtt atgcaatcca gtttcaagat ttctagctag t
//
LOCUS XELRRAA 121 bp ss-RNA RNA 25-JUL-1990
DEFINITION X.laevis 5S RNA.
ACCESSION M35055
KEYWORDS 5S ribosomal RNA.
SOURCE X.laevis kidney ribosomal RNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Brownlee,G.G., Cartwright,E., McShane,T. and Williamson,R.
TITLE The nucleotide sequence of somatic 5 S RNA from Xenopus laevis
JOURNAL FEBS Lett. 25, 8-12 (1972)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 121 5S ribosomal RNA
BASE COUNT 24 a 34 c 38 g 25 t
ORIGIN
1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg
61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
121 t
//
LOCUS YSCTRR2 76 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (S.cerevisiae, Brewer's) Arg-tRNA-II.
ACCESSION K00157 M34900
KEYWORDS transfer RNA; transfer RNA-Arg.
SOURCE Yeast (Saccharomyces cerevisiae, Brewer's) tRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G.
TITLE Nucleotide sequence of tRNA-Arg-II from Brewer's yeast
JOURNAL FEBS Lett. 28, 353-355 (1972)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 76)
AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G.
TITLE The primary structure of Arg-tRNA-II from brewer's yeast: Partial
digestion with ribonuclease T-1 and derivation of the complete
sequence
JOURNAL Eur. J. Biochem. 56, 527-532 (1975)
STANDARD full staff_review
COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
entry 0130 in Nucleic Acids Res. 11, r1-r54 (1983).
FEATURES from to/span description
tRNA 1 76 Arg-tRNA-II (NAR: 0130)
anticdn 34 36 Arg-tRNA-II anticodon gcg
modified 1 1 f = pseudouridine
modified 9 9 m1g = 1-methylguanosine
modified 10 10 m2g = 2-methylguanosine
modified 16 16 d = dihydrouridine
modified 19 19 d = dihydrouridine
modified 26 26 m22g = 2,2-dimethylguanosine
modified 27 27 f = pseudouridine
modified 34 34 i = inosine
modified 47 47 d = dihydrouridine
modified 49 49 m5c = 5-methylcytidine
modified 54 54 t = 5-methyluridine
modified 55 55 f = pseudouridine
modified 58 58 m1a = 1-methyladenosine
BASE COUNT 15 a 22 c 24 g 15 t
ORIGIN 5' end of mature tRNA.
1 ttcctcgtgg cccaatggtc acggcgtctg gctgcgaacc agaagattcc aggttcaagt
61 cctggcgggg aagcca
//
LOCUS YSCTRT1A 76 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1a.
ACCESSION K00278 M34898
KEYWORDS transfer RNA; transfer RNA-Thr.
SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G.
TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast
JOURNAL FEBS Lett. 71, 6-8 (1976)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 76)
AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G.
TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere
JOURNAL Biochimie 59, 381-391 (1977)
STANDARD full staff_review
COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast
Thr-tRNA-1 is 50% Thr-tRNA-1a and 50% Thr-tRNA-1b [1].
FEATURES from to/span description
tRNA 1 76 Thr-tRNA-1a (NAR: 1760)
anticdn 34 36 Thr-tRNA-1a anticodon ggt
modified 10 10 m2g
modified 16 16 d
modified 17 17 d
modified 20 20 d
modified 26 26 m22g
modified 32 32 m3c
modified 34 34 i
modified 37 37 t6a
modified 39 39 f
modified 47 47 d
modified 48 48 m5c
modified 54 54 t
modified 55 55 f
modified 58 58 m1a
BASE COUNT 20 a 17 c 21 g 18 t
ORIGIN 5' end of mature tRNA.
1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcat cggttcaaat
61 ccgattggaa gcacca
//
LOCUS YSCTRT1B 76 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1b.
ACCESSION K00279 M34898
KEYWORDS transfer RNA; transfer RNA-Thr.
SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G.
TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast
JOURNAL FEBS Lett. 71, 6-8 (1976)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 76)
AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G.
TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere
JOURNAL Biochimie 59, 381-391 (1977)
STANDARD full staff_review
COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast
Thr-tRNA-1 is 50% Thr-tRNA-1b and 50% Thr-tRNA-1a [1].
FEATURES from to/span description
tRNA 1 76 Thr-tRNA-1b (NAR: 1760)
modified 10 10 m2g
modified 16 16 d
modified 17 17 d
modified 20 20 d
modified 26 26 m22g
modified 32 32 m3c
modified 34 34 i
anticdn 34 36 Thr-tRNA-1b anticodon ggt
modified 37 37 t6a
modified 39 39 f
modified 47 47 d
modified 48 48 m5c
modified 54 54 t
modified 55 55 f
modified 58 58 m1a
BASE COUNT 19 a 18 c 22 g 17 t
ORIGIN 5' end of mature tRNA.
1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcgt cggttcaaat
61 ccgactggaa gcacca
//
LOCUS BSTGLGBA 2735 bp ds-DNA BCT 25-JUL-1990
DEFINITION B.stearothermophilus branching enzyme (glgB) gene, complete cds.
ACCESSION M35089
KEYWORDS branching enzyme.
SOURCE B.stearothermophilus (strain 1503-4R, variant 4) DNA, clone pKVS1.
ORGANISM Bacillus stearothermophilus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2735)
AUTHORS Kiel,J.A.K.W., Boels,J.M., Beldman,G. and Venema,G.
TITLE Molecular cloning and nucleotide sequence of the branching enzyme
gene (glgB) from Bacillus stearothermophilus, expression in E.coli
and B.subtilis
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.K.W.Kiel, 12-JUN-1990.
Dept of Genetics
Center of Biological Sciences
Kerklaan 30, NL 9751 NN
Haren, THE NETHERLANDS
FEATURES from to/span description
pept 522 2441 branching enzyme (glgB) (EC 2.4.1.18)
pept 325 < 1 (c) unidentified ORF2
binding 337 330 (c) ORF2 ribosomal binding site (put.)
binding 504 516 glgB ribosomal binding site (put.)
signal 370 365 (c) ORF2 -10 region (put.)
signal 394 389 (c) ORF2 -35 region (put.)
signal 446 458 glgB -35 region (put.)
signal 469 479 glgB -10 region (put.)
BASE COUNT 835 a 492 c 634 g 774 t
ORIGIN
1 gaattccaat ggaaataatg gctaacgtaa ggccgtttaa aaaggacgta ataatttcaa
61 agcgcaaata accgaatgta aatcgatgat ttggcggacg catggcaaga taaagagcga
121 tcatgctaag cccaagcgcc aatacgtcag atgccatatg ggcagagtcg gaaagcaaag
181 ctaaggaatt ggataatagc cccccaacaa tttccacaat cgtaaaaaac aatgttaaaa
241 cgagagtgat ccaaagcgtt tttttcgatt gattttgcgt ttttacatga ggaagatggt
301 gataatcgta ttgaattggt gacatgacac acctcttatt tagaattatt tttaatttat
361 atacattata atatagtttt ttataattgt gcaaaaaaat tttttgttta tttatcgaaa
421 aatgtaaaaa aaatacaatt tttttatcaa ggaatttatg gaatcgctgt ggaatataag
481 taacaacggt aagaaacttt aaggaaagga tgcgatacag attgatcgcc gtcggtccca
541 ctgatttaga aatctattta tttcatgaag gcagcttata taaaagttat gaattgtttg
601 gtgcacatgt gataaagaaa aatggcatgg tcggaacccg gttttgtgta tgggcacccc
661 atgcgcggga agtgcgatta gtcggcagtt ttaatgaatg gaacggaact aattttaacc
721 ttatgaaagt aagtaatcaa ggcgtatgga tgatttttat tcctgaaaac ttagaagggc
781 atttatataa atacgaaatt acgacgaacg atgggaatgt tctgttaaaa tcggatccat
841 acgcgtttta ctccgagttg cgtccccata ctgcttccat tgtctacaac ataaaaggat
901 atcaatggaa tgaccagaca tggcgacgga agaaacagcg aaagcgaatt tatgaccagc
961 ctttgttcat ttatgaactt cactttggtt cgtggaaaaa gaaagaggac ggcagttttt
1021 atacatatca agagatggca gaggagctaa tcccttatgt tctcgaacat gggtttactc
1081 atattgagct gctcccactc gtcgagcatc cgttcgatcg ttcttgggga tatcagggaa
1141 taggttatta ttcagcaaca agccgctacg gaacaccgca tgatttgatg tattttattg
1201 accgctgtca ccaagctgga ataggcgtca ttctcgattg ggttcctggc cacttttgta
1261 aagattccca tgggttatat atgtttgatg gcgcaccggc atatgaatat gccaacatgc
1321 aagaccggga aaattacgta tggggaacgg caaactttga ccttggcaag ccggaagtcc
1381 gcagcttttt gatttccaat gcgttatttt ggatggaata tttccatgtg gacgggtttc
1441 gtgtagatgc tgttgccaat atgttatatt ggccaaacag cgacgtacta tacaaaaata
1501 cgtatgccgt ggagttcttg caaaaattaa atgaaacggt attcgcctat gatccgaaca
1561 tattaatgat tgccgaagat tcgacagact ggccgcgcgt cactgctcca acatacgacg
1621 gaggattagg atttaactat aaatggaaca tgggatggat gaacgatatt ttaacttata
1681 tggaaacgcc gcctgaacat cgaaaatacg tgcacaataa agtaacattt tccctcttgt
1741 atgcgtattc ggaaaatttc attttacctt tttcccatga cgaggtcgta catggaaaaa
1801 aatcgctgtt aagtaaaatg ccggggacat atgaggaaaa gtttgcgcaa ttaaggttgc
1861 tgtatggata tttgttgacg catcctggta agaaattatt gtttatgggc ggcgaatttg
1921 gccagtttga tgaatggaaa gatttagagc agctggattg gatgcttttt gattttgata
1981 tgcatcggaa tatgaatatg tatgtgaaag aattgttgaa atgttataag cgctataaac
2041 cgctttatga gttagaccac tctccagatg gattcgagtg gattgatgtt cataacgccg
2101 aacaaagtat tttctcattc attcgcagag gaaaaaaaga ggatgatttg cttattgttg
2161 tgtgtaattt cacaaataaa gtataccacg gttataaagt tggtgttccg ttatttacaa
2221 gatatcggga agtaatcaat agcgatgcaa tccaattcgg cggctttggg aatatcaatc
2281 caaaaccgat tgcggcgatg gaagggccgt ttcacggaaa gccatatcat attcagatga
2341 cgatcccgcc gtttggcatt tctattttaa gaccagtaaa aaaaggtagc gtcaaaagtt
2401 ttatgaaaac tccacatccg ccatcccatg gagcatcgta aggcatcctt ggagccggat
2461 tcgcccttga ccaacacccg ccaaaggtgt gaaagggacg tcaagggcga cggggacaaa
2521 aaagagggca taggaaagcc gcccttgccc ttaccgaatt ttacctttga cgaggttcgg
2581 ttggtcaagg gttcgcttcg ccgaatccgg ctgttcttct gatccatggg ctccggcgga
2641 caaaaaagtt aggctgcctc ttgttggagg aaatcttgag ccatggcgat cagcttcgtc
2701 caccgggccg gcatatgggg cagatcggcg agctc
//
LOCUS HUMETMAGA 3343 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human secreted epithelial tumor mucin antigen (H23Ag) gene,
complete cds.
ACCESSION M35093
KEYWORDS cell surface antigen; tumor mucin antigen.
SOURCE Human breast tumor cell line MCF7 DNA, clone lambda-gtWES.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3343)
AUTHORS Tsarfaty,I., Hareuveni,M., Horev,J., Zaretsky,J., Weiss,M.,
Jeltsch,J.M., Garnier,J.M., Lathe,R., Keydar,I. and Wreschner,D.H.
TITLE Isolation and characterization of an expressed hypervariable gene
coding for a breast cancer associated antigen
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.Tsarfaty, 12-JUN-1990.
FEATURES from to/span description
pept 785 842 secreted epithelial tumor mucin antigen
precursor, exon 1 (H23Ag)
1342 2207 secreted epithelial tumor mucin antigen
precursor, exon 2 (H23Ag)
sigp 785 805 secreted epithelial tumor mucin antigen signal
peptide
matp 806 842 secreted epithelial tumor mucin antigen
1342 2207 secreted epithelial tumor mucin antigen
pre-msg 777 > 842 H23Ag mRNA and introns
IVS 843 1341 H23Ag intron A
signal 384 397 H23Ag ERE
signal 633 644 H23Ag CACCT motifs
signal 689 692 H23Ag TATA box
site 1063 1090 put. enhancer
rpt 1670 1729 repeat unit
BASE COUNT 679 a 986 c 981 g 697 t
ORIGIN Chromosome 1q21-q24.
1 gagctcctgg ccagtggtgg agagtggcaa ggaaggaccc tagggttcat cggagcccag
61 gtttactccc ttaagtggaa atttcttccc ccactcccct ccttggcttt ctccaaggag
121 ggaaccccag gctgctggaa agtccggctg gggcggggac tgtgggtttc agggtagaac
181 tgcgtgtgga acgggacagg gagcggttag aagggtgggg ctattccggg aagtggtggt
241 ggggggaggg agcccaaaac tagcacctag tccactcatt atccagccct cttatttctc
301 ggccgcctct gcttcagtgg acccggggag ggcggggaag tggagtggga gacctagggg
361 tgggcttccc gaccttgctg tacaggacct cgacctagct ggctttgttc cccatcccca
421 gttagttgtt gccctgaggc taaaactaga gcccaggggc cccaagttcc agactgcccc
481 tcccccctcc cccggagcca gggagtggtt ggtgaaaggg ggaggccagc tggagaagaa
541 acgggtagtc aggggttgca gcattagagc ccttgtagcc ctagcccagg aatggttgga
601 gagagaagag tagagtaggg aggggggttt gtcacctgtc acctgctcgg ctgtgcctag
661 ggcgggcggg ggggagtggg gggaccggta taaagcggta ggcgcctgtg cccgctccac
721 ctctcaagca gccagcgcct gcctgaatct gttctgcccc ctccccaccc atttcaccac
781 caccatgaca ccgggcaccc agtctccttt cttcctgctg ctgctcctca cagtgcttac
841 aggtgagggg cacgaggtgg ggagtgggct gccctgctta ggtggtcttc gtggtctttc
901 tgtgggtttt gctccctggc agatggcacc agaagttaag gtaagaattg cagacagagg
961 ctgccctgtc tgtgccagaa ggagggagag gctaaggaca ggctgagaag agttgccccc
1021 aaccctgaga gtgggtacca ggggcaagca aatgtcctgt agagaagtct agggggaaga
1081 gagtagggag agggaaggct taagagggga agaaatgcag gggccatgag ccaaggccta
1141 tgggcagaga gaaggaggct gctgcaggaa ggaggcggcc aacccagggg ttactgaggc
1201 tgcccactcc ccagtcctcc tggtattatt tctctggtgg ccaggcttat attttcttct
1261 tgctcttatt tttccttcat aaagacccaa ccctatgact ttaacttctt acagctacca
1321 cagcccctgg gcccgcaaca gttgttacag gttctggtca tgcaagctct accccaggtg
1381 gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg
1441 ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca
1501 ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca
1561 cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc
1621 cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc
1681 ccccagccca gggtgtcacc tcggccccgg agaccaggcc gcccccgggc tccaccgccc
1741 ccccagccca tggtgtcacc tcggcgccgg acaacaggcc cgccttggcg tccaccgccc
1801 ctccagtcca caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg
1861 tgcacaacgg cacctctgcc agggctacca caaccccagc cagcaagagc actccattct
1921 caattcccag ccaccactct gatactccta ccacccttgc cagccatagc accaagactg
1981 atgccagtag cactcaccat agcacggtac ctcctctcac ctcctccaat cacagcactt
2041 ctccccagtt gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc
2101 agtttaattc ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca
2161 tttctgaaat ggtgagtatc ggcctttcct tccccatgct cccctgaagc agccatcaga
2221 actgtccaca ccctttgcat caagcctgag tcctttccct ctcaccccag tttttgcaga
2281 tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggtac agttctgggt
2341 gtggacccag tgtggtggtt ggaggggtgg gtggtggtca tgagccgtag ggagggactg
2401 gtgcacttaa ggttggggga agagtgctga gccagagctg ggacccgtgg ctgaagtgcc
2461 catttccctg tgaccaggcc aggatctgtg gtggtacaat tgactctggc cttccgagaa
2521 ggtaccatca atgtccacga cgtggagaca cagttcaatc agtataaaac ggaagcagcc
2581 tctcgatata acctgacgat ctcaagacgt cagcggtgag gctacttccc tgctgcagcc
2641 agcaccatgc cggggcccct ctccttccag tgtctgggtc cccgctcttt ccttagtgct
2701 ggcagcggga ggggcgcctc ctctgggaga ctgccctgac cactgctttt ccttttagtg
2761 agtgatgtgc catttccttt ctctgaccag tctggggctg gggtgccagg ctggggcatc
2821 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg
2881 gtgagtgcag tccctggccc tgatcagagc cccccggtag aaggcactcc atggcctgcc
2941 ataacctcct atctccccag gctgtctgtc agtgccgccg aaagaactac gggcagctgg
3001 acatctttcc agcccgggat acctaccatc ctatgagcga gtaccccacc taccacaccc
3061 atgggcgcta tgtgccccta gcagtaccga tcgtagcccc tatgagaagg tgagattggg
3121 ccccacaggc aggggaagca gagggtttgg ctgggcaagg attctgaagg gggtacttgg
3181 aaaacccaaa gagcttggaa gaggtgagaa gtggcgtgaa gtgagcaggg gagggctggc
3241 aaggatgagg ggcagaggtc agaggagttt tgggggacag gcctgggagg agactatgga
3301 agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc
//
LOCUS MPMVPIA 1155 bp ds-DNA VRL 25-JUL-1990
DEFINITION Mouse polyomavirus major structural protein (VP1) gene, complete
cds.
ACCESSION M34958
KEYWORDS major structural protein.
SOURCE Mouse polyomavirus (strain RA) DNA.
ORGANISM Mouse polyomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 1155)
AUTHORS Freund,R., Garcea,R.L., Sahli,R. and Benjamin,T.L.
TITLE A specific amino acid substitution in polyoma virus VP1 correlates
with plaque size and hemagglutination behavior
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Freund, 08-JUN-1990.
Author address: R.Freund
Bldg C2 RM 129A
Dept of Pathology
Harvard Medical School
200 Longwood Avenue
Boston, MA 02115
FEATURES from to/span description
pept 1 1155 VPI protein (VPI)
BASE COUNT 367 a 274 c 285 g 229 t
ORIGIN
1 atggccccca aaagaaaaag cggcgtctct aaatgcgaga caaaatgtac aaaggcctgt
61 ccaagacccg cacccgttcc caaactgctt attaaagggg gtatggaggt gctggacctt
121 gtgacagggc cagacagtgt gacagaaata gaagcttttc tgaaccccag aatggggcag
181 ccacccaccc ctgaaagcct aacagaggga gggcaatact atggttggag cagagggatt
241 aatttggcta catcagatac agaggattcc ccaggaaata atacacttcc cacatggagt
301 atggcaaagc tccagcttcc catgctcaat gaggacctca cctgtgacac cctacaaatg
361 tgggaggcag tctcagtgaa aaccgaggtg gtgggctctg gctcactgtt agatgtgcat
421 gggttcaaca aacccacaga tacagtaaac acaaaaggaa tttccactcc agtggaaggc
481 agccaatatc atgtgtttgc tgtgggcggg gaaccgcttg acctccaggg acttgtgaca
541 gatgccagaa caaaatacaa ggaagaaggg gtagtaacaa tcaaaacaat cacaaagaag
601 gacatggtca acaaagacca agtcctgaat ccaattagca aggccaagct ggataaggac
661 ggaatgtatc cagttgaaat ctggcatcca gatccagcaa aaaatgagaa cacaaggtac
721 tttggcaatt acactggagg cacaacaact ccacccgtcc tgcagttcac aaacaccctg
781 acaactgtgc tcctagatga aaatggagtt gggcccctct gtaaaggaga gggcctatac
841 ctctcctgtg tagatataat gggctggaga gttacaagaa actatgatgt ccatcactgg
901 agagggcttc ccagatattt caaaatcacc ctgagaaaaa gatgggtcaa aaatccctat
961 cccatggcct ccctcataag ttcccttttc aacaacatgc tcccccaagt gcagggccaa
1021 cccatggaag gggagaacac ccaggtagag gaggttagag tgtatgatgg gactgaacct
1081 gtaccggggg accctgatat gacgcgctat gttgaccgct ttggaaaaac aaagactgta
1141 tttcctggaa attaa
//
LOCUS MYCP115A 3082 bp ss-mRNA BCT 25-JUL-1990
DEFINITION M.hyorhinis 115 kDa protein (p115) gene, complete cds.
ACCESSION M34956
KEYWORDS .
SOURCE M.hyorhinis (strain GDL) DNA, clone MhrG27.
ORGANISM Mycoplasma hyorhinis
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 3082)
AUTHORS Notarnicola,S.M., McIntoch,M.A. and Wise,K.S.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.S.Wise, 08-JUN-1990.
University of Missouri-Columbia
Dept of Mol Microbiol and Immunol
School of Medicine-M653
Columbia, MO 65212
FEATURES from to/span description
pept 70 3009 115 kDa protein
BASE COUNT 1324 a 410 c 443 g 905 t
ORIGIN
1 gaattctttt ttaataattt ttttacttta aaattctagt taaaactcta caaaaaaaca
61 aggacaaata tgttaaagct tattaaaatt gaaatcgaag gttttaaatc gttcgccgat
121 ccgatcagca taaatttcga tggttctgtt gtaggaatag ttggaccaaa tggttcagga
181 aaatctaata ttaatgacgc aattagatga gtattaggtg aacaatcagc aaaacaactt
241 cgtggactaa atatggatga tgttatcttt gcaggttcca aaactgtcaa acctcaagaa
301 aaagcaatgg taaaattaac cttcaaaaat gaagatgcaa ttgaagaaac gaaacaaatt
361 tttactattt ctcgtttact taaaagaggt caaggaacta atgaatattt ttacaatgat
421 caacctgtta gatataaaga tattaaaaat ttagctgttg aatctggaat ttctaaatct
481 tcacttgcaa ttatttccca aggtactata tctgaaattg cagaagcaac gcctgaacaa
541 agaaaagcag ttattgaaga agctgctgga acttcaaaat acaaattaga caaagaagaa
601 gcacaaaaga aacttattag aacaaatgat gcaattgata aattacaagg tgcaatcaaa
661 gagttagaac gtcaagtaaa ctcgcttgat aaacaagctt ctaaagcaaa aatttattta
721 gaaaaaagta aagctcttga atcagttgaa gtaggtttaa ttgttaatga tctaaacttt
781 ttcaatgaaa aattaaataa tttaaatact tcactattag aagtagaaca acaaagaaat
841 gatcttgaac tcaacattca aacttatgaa tccagtattt cacaaactgt tcattttaaa
901 acagaagttg aatcttcaat ccaagaaatt acttcaaaat tagacaattt aaaaaacgca
961 ctttccgaaa tcaaccttca agaagctaga attgaagaac gtagaaaatt aattatcagt
1021 ggtgaaattg tagttgatca aaaaacaaaa attgaagaaa ttaaaaaaca agttgaatca
1081 ctcaaaatac aaataaatgc ttcaaaacaa agagaaattg aactagacca acaacttaca
1141 agactaaatg caaaagctaa ttctttaaaa ttgcaagaaa atgatattaa taaagaaatt
1201 ggtgtattac ttgaaaaaaa atcagctgct gcagcaaata ttaatatatt aaaacaacaa
1261 tttgaaaata aaagttttct ttctaaagga attaaaacta ttaaagataa ctcattttta
1321 tttgatggtt acattggatt agcttctgaa ttatttaaag tagaatccga atttagttta
1381 gcaattgaaa ctgttttagg tgctgcttta aatcaaatag taatgaaaac atctgaagat
1441 gtacttcaag ctattgactt tttaaagaaa aatctttcag gtaaagcaac ttttattcct
1501 ttaacatcta ttaaagaaag agaagtaaga gaagatcatt tacttgtttt aaaaggacaa
1561 aaaggatttt taggtgttgc aaaagaacta attgaatttg atactcaatt taacaaactc
1621 tttggatttt tacttggaaa catcttagtg gttgataatg tagacaatgc aaatagaata
1681 gctaaaatat tagatcataa atacactata gtttctttag aaggtgattt attcagacca
1741 ggcggaacca ttactggagg ttcaaaacta gaaagaactt ctattttaaa ttacgatatc
1801 aaaataaaag aacacacaaa tacacttaaa tttgctgaag atcaaattca tgatttaaaa
1861 attaaacagc aaacaatata taacgaaatt gaaacagtca attcaacaat ccaacaagta
1921 aaaattgaag ctaattcaat aaattcaaaa cttaatatct taaacgaaga attaaataac
1981 ttaaaactaa acgcaagcga aattttcaaa gaacaacaag aagaccaaga gagtttaaat
2041 ttaagttttg attctgaaaa attgaacata gaaaaacaaa tttctactct aacaattgaa
2101 ttaaattcta aaaaagatcg actaacaaat ttaattagtg agcaaggaaa aggagaaacc
2161 aagaaacaag aattagatgc caaactaaga aaattaaaca ctcaacactc agatagtatc
2221 actgaacaaa acagagcaaa attcttggta gagcaaaatc aaaaaagact ttctgagcac
2281 tacaaattaa ctttagaagc tgctagtgaa caatattctt tagatttaga cattgaacaa
2341 gcaagacatt ttgttgatag ccttaaaaaa gagttaaaag aattaggaaa cgttaattta
2401 gaagcaatta ctgaatttga agaagtaaat caacgttacc aagagaaaaa acaatacatc
2461 gaagaactaa ccactgctaa atccaaaatt gaagaagcaa tttctgattt agataaaatt
2521 attatcaata aaacaacaga aattgttaac ttagtaaata atgaatttaa tatggtattt
2581 caaaaaatgt ttggtggtgg aaaagcagaa attcacttca cagacaaaaa tgatatttta
2641 aattctggtg ttgaaatatc tgcacaacca cctggtaaaa caattaaaaa cttacgactt
2701 ttttcaggtg gagaaaaagc tattattgca atttcacttc tttttgctat tttaaaagca
2761 agaccaattc cattgtgtat tttagacgaa gttgaagctg cacttgatga atctaatgtt
2821 attcgttatg tagaattttt aaaattacta aaagaaaata ctcaattctt aattattact
2881 caccgttcag gaacaatgtc aagagtagat cagttacttg gagttactat gcaaaaacgt
2941 ggagttactt ccattttctc agttgaacta agcaaagcaa aagagatgct aaaagacgaa
3001 ttaaaataat acaaataaaa ataaaaaaaa cagaagtttg aagtgaggtg ataccctttt
3061 cttgaaaaaa ttttttgagt gt
//
LOCUS PPHVLCRA 314 bp ds-DNA VRL 25-JUL-1990
DEFINITION Human papillomavirus type 6 long control region DNA.
ACCESSION M35091
KEYWORDS .
SOURCE Human papillomavirus type 6 (patient specimen X020) DNA.
ORGANISM Human papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 314)
AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H.
TITLE The human papillomavirus type 6 long control region and human
cellular DNA contain related sequences
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.H.Fife, 12-JUN-1990.
AUTHOR address: K.H.Fife
Emerson Hall 435
Indiana University School of Medicine
545 Barnhill Dr.
Indianapolis, IN 46202-5124
FEATURES from to/span description
pept < 1 21 L1 open reading frame (AA at 1)
signal 231 236 polyA signal
site 81 175 insert (as compared to prototype sequence)
site 245 259 insert (as compared to prototype sequence)
BASE COUNT 75 a 29 c 72 g 138 t
ORIGIN Mapped between nucleotides 7271 to 7476.
1 cgcgccaaaa ccaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt
61 actgttatgt atatgtgttt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta
121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt
181 tatgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat
241 taactacatt attgtatatc ttgttacacc ctgtgactca gtggctgttg cacgcgtttt
301 ggtttgcacg cgcc
//
LOCUS PPHVLCRB 300 bp ds-DNA VRL 25-JUL-1990
DEFINITION Human papillomavirus type 6 long control region DNA.
ACCESSION M35092
KEYWORDS .
SOURCE Human papillomavirus type 6 (patient specimen X019) DNA.
ORGANISM Human papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 300)
AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H.
TITLE The human papillomavirus type 6 long control region and human
cellular DNA contain related sequences
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.H.Fife, 12-JUN-1990.
Emerson Hall 435
Indiana Univ School of Medicine
545 Barnhill Dr.
Indianapolis, IN 46202-5124
FEATURES from to/span description
pept < 1 21 L1 open reading frame (AA at 1)
signal 231 236 polyA signal
site 81 175 insert (as compared to prototype sequence)
BASE COUNT 68 a 29 c 72 g 131 t
ORIGIN Mapped between nucleotides 7271 to 7476.
1 cgcgccaaaa ctaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt
61 actgttatgt atatgtgtgt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta
121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt
181 tctgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat
241 tacctcttgt tacaccctgt gactcagtgg ctgttgcacg cgttttggtt tgcacgcgcc
//
LOCUS TRHTCSA 1010 bp ss-mRNA PLN 25-JUL-1990
DEFINITION T.kirilowii trichosanthin (TCS) mRNA, complete cds.
ACCESSION M34858
KEYWORDS ribosome inactivating protein; trichosanthin.
SOURCE T.kirilowii maximowicz, cDNA to mRNA.
ORGANISM Trichosanthes kirilowii
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Violales; Cucurbitaceae.
REFERENCE 1 (bases 1 to 1010)
AUTHORS Shaw,P.-C., Yung,M.-H., Zhu,R.-H., Ho,W.K.-K., Ng,T.-B. and
Yeung,H.-W.
TITLE Molecular cloning of trichosanthin cDNA and its expression in
Escherichia coli
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.-C.Shaw, 06-JUN-1990.
Author address: P.-C.Shaw
Department of Biochemistry
Chinese University of Hong Kong
Shatin, NT, HONG KONG
FEATURES from to/span description
pept 10 879 trichosanthin precursor
sigp 10 78 trichosanthin signal peptide
matp 79 819 trichosanthin
variant 196 196 t in wild type; a in allele
variant 197 197 c in wild type; g in allele
variant 468 468 t in wild type; a in allele
BASE COUNT 290 a 218 c 205 g 297 t
ORIGIN
1 gtcaaaaaga tgatcagatt cttagtcctc tctttgctaa ttctcaccct cttcctaaca
61 actcctgctg tggagggcga tgttagcttc cgtttatcag gtgcaacaag cagttcctat
121 ggagttttca tttcaaatct gagaaaagct cttccaaatg aaaggaaact gtacgatatc
181 cctctgttac gttcctctct tccaggttct caacgctacg cattgatcca tctcacaaat
241 tacgccgatg aaaccatttc agtggccata gacgtaacga acgtctatat tatgggatat
301 cgcgctggcg atacatccta ttttttcaac gaggcttctg caacagaagc tgcaaaatat
361 gtattcaaag acgctatgcg aaaagttacg cttccatatt ctggcaatta cgaaaggctt
421 caaactgctg caggcaaaat aagggaaaat attccgcttg gactccctgc tttggacagt
481 gccattacca ctttgtttta ctacaacgcc aattctgctg cgtcggcact tatggtactc
541 attcagtcga cgtctgaggc tgcgaggtat aaatttattg agcaacaaat tgggaagcgt
601 gttgacaaaa ccttcctacc aagtttagca attataagtt tggaaaatag ttggtctgct
661 ctctccaagc aaattcagat agcgagtact aataatggac agtttgaaag tcctgttgtg
721 cttataaatg ctcaaaacca acgagtcacg ataaccaatg ttgatgctgg agttgtaacc
781 tccaacatcg cgttgctgct gaatagaaac aatatggcag ccatggatga cgatgttcct
841 atgacacaga gctttggatg tggaagttat gctatttagt gtaacttcaa gctacgtacg
901 agtacaaact cccacttgaa gaatctatta tcgtttgaga gtttaatcta cttgtagaaa
961 taataaagca tgttcgtgtg accgacctac gtggatgctc tgtatgtgtg
//
LOCUS CIBABI 1989 bp ds-DNA BCT 25-JUL-1990
DEFINITION Plasmid ColIB abortive infection protein (abi) gene, complete cds.
ACCESSION J03314
KEYWORDS abortive infection protein.
SOURCE Plasmid ColIB DNA, clone pTP64, isolated from E.coli K-12 strain
W3110.
ORGANISM Plasmid Colicin Ib
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1989)
AUTHORS Gupta,S.K. and McCorquodale,D.J.
TITLE Nucleotide sequence of a DNA fragment that contains the Abi gene of
the ColIb plasmid
JOURNAL Plasmid 20, 194-206 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.J.McCorquodale, 21-NOV-1988.
FEATURES from to/span description
pept 1306 1575 abortive infection protein
binding 1028 1050 LexA binding site
binding 1100 1121 LexA binding site
binding 1215 1234 LexA binding site
binding 1232 1252 LexA binding site
BASE COUNT 476 a 533 c 531 g 449 t
ORIGIN 5bp upstream of PstI site.
1 ctgcaggtcc gtgccgacca ggtgcttaag gggtggaaaa atatcccgcg cgggatctcc
61 ctgaccttct ccctgtttgc cgagatcgcc ggccgggaca gggaaaccat cgaccaggcc
121 tggaaaaata tcttctactc gcaactgagg gaaaaaaaac accgctttta ccaaagatat
181 cgaggccatc cgcgccctga aaaaactgcc tgccctcacc ggcgacagct ggcgcgggat
241 ggcatcacgg tgcgtatcta ccgcccggaa aattacgccc gcggcgatgg cggcttacac
301 tgagcctccc ggaaaattac gccacccaga tgtggaacat cccgttcccg gagcttgaat
361 accgcctctt taccgccgat ccgggctaca gcgccctgat cagcgccgaa cccgacaggt
421 gggacaaggc cttccgtttt gtggacgggg tgtgcgagct tcacctttac accaacggtg
481 tggaaggaag atcacaatcc caccccgctc ggggatgtcg ctcaggcgct gatcaacgtg
541 gtggaagaaa acctgctgta acggacccgg atgctgcggg cacaactgca tcatcaggag
601 gatgcaatga aaggacgaca gagccgctat gttaccggcg gagagagttt cgcggagatt
661 gcccgtctcc cttcaggggc ggtggtgagg ctctgtctga acaccggtct tgaggatgcg
721 ctgcgggagg cctccaaatc gctcaagtca gccttcaccc gttccgggcg aaaatgccgg
781 ctgtcagcgg gtacggcgca ggggccgttt accggacgcc ggcaggcgtg gccacacatc
841 tcttcgtctc ggtactctga gggggcaggg ggcaaaaaaa gtaaaaatgt attcgccagg
901 ttgcccggag gtgaaggaaa atagacatac agcagaacga cggatagcac tttttgctaa
961 atggacatca gtattactat gctatagttg ctttaatgga taagtgcgcc ttgacaaagg
1021 cggtgatttc tgttaacatt actctcatag tattgttccg tcccgctcca ccccaacaag
1081 atccgtttat ttcccgccag actggttatc accattcagg cccggatttt tttggatttt
1141 tttccgggga gcccccggac gagcttaaaa tcggtatgac aaacaggagg atgcgaatga
1201 acacatcata acagagctga aagataaaac attctgtacg gcattaacag cgttcacgtg
1261 tgtgaggcgc cgggtgcctt ttgacttaaa aacgaggtta ttgagatgac caaaatcaag
1321 acagttactt ttgtaaatac ttacccggga gggtctatga aaaacttgtt agacaccgag
1381 ggaacggttc tattcccatt ccagactgaa atccatttta tttggacgat tttctccacc
1441 gttaaacgcc tggttatcgg aaccagggac catatttgcc agaagcaata ctggagcgcc
1501 tgtctctgta ttttgcttct tatggcctat gtgggtctct gtgctgcggt ggtctggttt
1561 gtagtgccct gctgaaggcc tttatagtgt cgaaatttgc ggtttcggca ctatgggtca
1621 cgccagtaaa gcgcggacta ctctggggta tcggtaaagt ggttaccgcc acttgccgaa
1681 gatttactct gctaaagtaa gtagccgcaa cgctacacga actgatggtg aatgtcaaca
1741 gatactcacc atctccttac ggcggtggtc cctgtgacca ctggcctttc gcgtgggtgc
1801 aacacggcaa aactcctctg tacaacaggc tcccgccgtc attttccggc acaggtgagg
1861 ccggaattcg gactaaaacg taaaccgcgg gccagtccgg tagcgttcac tatcggccag
1921 cattctctca accagagaga aatccttttc accgcagaac acgtacgtct ccgcgaactc
1981 cacctgcag
//
LOCUS HUMET3 2223 bp ss-mRNA PRI 25-JUL-1990
DEFINITION Human endothelin 3 (EDN3) mRNA, complete cds.
ACCESSION J05081
KEYWORDS endothelin.
SOURCE Human adult hypothalamus, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2223)
AUTHORS Bloch,K.D., Eddy,R.L., Shows,T.B. and Quertermous,T.
TITLE cDNA cloning and chromosomal assignment of the gene encoding
endothelin 3
JOURNAL J. Biol. Chem. 264, 18156-18161 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by K.D.Bloch, 06-OCT-1989.
FEATURES from to/span description
pept 194 910 endothelin 3 precursor
/hgml_locus_uid="LU0066V" /map="unassigned"
/nomgen="EDN1"
sigp 194 268 endothelin 3 signal peptide (put.)
matp 484 544 endothelin 3
matp 670 712 endothelin-like protein
BASE COUNT 575 a 535 c 583 g 530 t
ORIGIN
1 cgggtagcgc gctctgaaag tttatgaccg ccgcagccaa ctcctggccg gagctggaga
61 cgcagcgagc gatcggccgg cctcgaaccc ccacagctgg agggcgaggc cagctgtacc
121 cggccccagt gccctttcgc ggccacaagc ggccgtcctc ctggtccggt gctccggcgc
181 ctgatctagg ttcatggagc cggggctgtg gctccttttc gggctcacag tgacctccgc
241 cgcaggattc gtgccttgct cccagtctgg ggatgctggc aggcgcggcg tgtcccaggc
301 ccccactgca gccagatctg agggggactg tgaagagact gtggctggcc ctggcgagga
361 gactgtggct ggccctggcg aggggactgt ggccccgaca gcactgcagg gtccaagccc
421 tggaagccct gggcaggagc aggcggccga gggggcccct gagcaccacc gatccaggcg
481 ctgcacgtgc ttcacctaca aggacaagga gtgtgtctac tattgccacc tggacatcat
541 ttggatcaac actcccgaac agacggtgcc ctatggactg tccaactaca gaggaagctt
601 ccggggcaag aggtctgcgg ggccacttcc agggaatctg cagctctcac atcggccaca
661 cttgcgctgc gcttgtgtgg ggagatatga caaggcctgc ctgcactttt gcacccaaac
721 tctggacgtc agcagtaatt caaggacggc agaaaaaaca gacaaagaag aggaagggaa
781 ggttgaagtc aaggaccaac aaagcaagca ggctttagac ctccaccatc caaagctcat
841 gcccggcagt ggactcgccc tcgctccatc tacctgcccc cgctgcctct ttcaggaagg
901 agccccttag gaggacaggc ctgcagctcc aatttcatgc aggaaattgg ttttggagag
961 ttttggcaag ttggaaagcc acttactggc ttttgacatg acttctcttg gagaataagt
1021 ggactccaag ctaactcttt gcaaatgtaa acacatgtcc atcttgttaa taaatgcaaa
1081 atgcccgtgc agcagaagca tgcgactttc atatccttgc ctagaatagg ctgcatggtg
1141 tatgtcagtg agggccacga ggcgtcggct ttagacacag atcatagctc tacaggagtt
1201 tatgaatttg aagcttatgg gattttggca gagaaatttt cagctgtgct tgatacccac
1261 caaaagaatg tatctcgaaa gaatgaagga agaagaaaaa aggatccttg atgtttgtga
1321 caagaaaatg agaaagttag tatctgcaat acagagcttg ttcctgttca gtgactgacc
1381 ctctgtattc tgtatagaca ccaggccgat acacagtgga gttcccaggc cttgtttgca
1441 ggaagccgac tgtaaagaca gccccagctc aaggctatta ggttgaatat ttgctttcat
1501 gagtaaatgt ggatctttgg ggaatggctt caaaataagt cacgaacaca aattctttgt
1561 aaattatgta aattcctgtt tatataaatt ggcaacaact tataccgtct gacagttcaa
1621 aatctctttc agctgcgctc ttcccaccga gccgagctta ctgtgagtgt ggagatgtta
1681 tcccaccatg taaagtcgcc tgcgcagggg agggctgccc atctccccaa cccagtcaca
1741 gagagatagg aaacggcatt tgagtgggtg tccagggccc cgtagagaga catttaagat
1801 ggtgtatgac agagcattgg ccttgaccaa atgttaaatc ctctgtgtgt atttcataag
1861 ttattacagg tataaaagtg atgacctatc atgaggaaat gaaagtggct gatttgctgg
1921 taggattttg tacagtttag agaagcgatt atttattgtg aaactgttct ccactccaac
1981 tcctttatgt ggatctgttc aaagtagtca ctgtatatac gtatagagag gtagataggt
2041 aggtagattt taaattgcat tctgaataca aactcatact ccttagagct tgaattacat
2101 ttttaaaatg catatgtgct gtttggcacc gtggcaagat ggtatcagag agaaacccat
2161 caattgctca aatactcaga aagtactgtc aaaagcctaa taaaaaacct aaagtttgct
2221 ctg
//
LOCUS HUMSATAA 293 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human alpha satellite DNA, clone pC1.8.
ACCESSION M26918 J04744
KEYWORDS alpha satellite DNA; satellite DNA.
SOURCE Human (cell line HHW423) DNA, clone pC1.8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 293)
AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic
and internal organization and chromosomal assignment
JOURNAL Genomics 5, 822-828 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by A.Baldini, 08-AUG-1989.
FEATURES from to/span description
rpt 1 293 alpha-satellite
BASE COUNT 84 a 55 c 63 g 91 t
ORIGIN Chromosomes 1, 5, and 19; centromere.
1 gatcctttac acagagcaga cttgaaacac tctttttgtg gaatttgcag tggagatttc
61 aagcgctttg aggccaatgg cagaaaagga aatacttcga tataaaaact agacagaatc
121 attctcagaa actgctctgc gatgtgtcgg ttcaactctc agagtttaac ttttcttttc
181 attcagcagt ttggaaacac tctgtttgta aagtctgcaa cgtggatatt tgaccactta
241 gaggccttcg ttggaaacgg gtttttttcc tgtaaggcta gacagaagaa ttc
//
LOCUS HUMSATAB 344 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human alpha satellite DNA, clone pC1.8.
ACCESSION M26919 J04744
KEYWORDS alpha satellite DNA; satellite DNA.
SOURCE Human (cell line HHW423) DNA, pC1.8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 344)
AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic
and internal organization and chromosomal assignment
JOURNAL Genomics 5, 822-828 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by A.Baldini, 08-AUG-1989.
FEATURES from to/span description
rpt 1 344 alpha-satellite
BASE COUNT 98 a 67 c 68 g 111 t
ORIGIN Chromosomes 1, 5 and 9; centromere.
1 gaattcccag tagcttcctt gtgttgtgaa cattcaactc acagagttga acgttccctt
61 agacagagca gatttgaaca ctctttttgt gcaattggca agtggagatt tcaagcgctt
121 taaggtcaat ggcagaaaag gaaatatctt cgtttcaaaa ctagacagaa tcattcccac
181 aaactgcgtt gtgatgtgtt cattcaactc acacagttta acctttcttt tcatagagca
241 gttaggaaac agtctgtttg taaattctct aagtggatat tctgacatct tgtggccttc
301 gttggaaacg ggatttcttc atattctgct agacagaaga attc
//
LOCUS HUMSATAC 1049 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human alpha satellite DNA, clone pC1.8.
ACCESSION M26920 J04744
KEYWORDS alpha satellite DNA; satellite DNA.
SOURCE Human (cell line HHW423) DNA, clone pC1.8.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1049)
AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic
and internal organization and chromosomal assignment
JOURNAL Genomics 5, 822-828 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by A.Baldini 08-AUG-1989.
FEATURES from to/span description
rpt 1 1049 alpha-satellite
BASE COUNT 295 a 198 c 221 g 335 t
ORIGIN Chromosomes 1, 5 and 19; centromere.
1 aaattttctt ttcatacagc agagtttgga aacactctgt ttgtaaagtc tgcacgtgga
61 taagttgtcc acttagaggc attcgttgga aacgggtttt tttcatgtaa ggctacacag
121 aagaattccc agtaacttcc ttgtgttgtg tgtatcaact caaagagttg aacgatcctt
181 tacacagagc agacttctaa cactcttttt gtggaatttg caagtggaga tttcagccgc
241 tttgaagtca aaggtagaaa aggaaatatc ttcctataaa aactagacag aatgattctc
301 agaaactcct ttgtgatgtg tgcgttcaac tcacagagtt taacctttct tttcatagag
361 cagttaggaa acactctgtt tgtaaagtct gcaagtggat attcagacct ctttgaggcc
421 ttcgtggaac gggttttcat ataaggctag gcagagaatt cccagtaact tccttgtgtt
481 gtgtgtgtca actcacagag ttgactttca tttacacaga gcagacttga aacactcttt
541 ttgtaattgc aagtggagat ttcaagcgct ttgagcaagg ccgaaaagga aatatcttcg
601 tataaaaact agacagaatc attctcagaa actgctctgc gatgtgtgcg ttcaactctc
661 agagtttaac ttttcttttc atcagcagtt tggaaacact ctgtttgtaa agtctgcacg
721 tggatatttt gaccacttag aggccttcgt tggaaacggg tttttttcct gtaaggctag
781 acagaagaat tccctgtagc ttccttgtgt tgtgtacatt caacgcacag agttgaacgt
841 tcccttagac agagcagatt tgaaacactc tttttgtgca attggcaagt ggagatttca
901 ggcgctttaa ggtcaatggc agaaaaggaa atatcttcgt ttcaaaacta gacagaatca
961 ttcccacaaa ctgcgtggtg atgtgttcgt tcaactcaca gagtttaacc tttcctttca
1021 tagagcagtt aggaaacagt ctgtttttt
//
LOCUS PVYCPA 1122 bp ss-RNA VRL 25-JUL-1990
DEFINITION Potato virus Y coat protein gene, 3' end.
ACCESSION M22470
KEYWORDS coat protein.
SOURCE Potato virus Y (necrotic strain; isolate New Zealand; N-PVY),
passed in Nicotiana tabacum cv. Burley 21, cDNA to viral RNA, clone
PVYN 27.
ORGANISM Potato virus Y
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Potyvirus.
REFERENCE 1 (bases 1 to 1122)
AUTHORS Hay,J.M., Fellowes,A.P. and Timmerman,G.M.
TITLE Nucleotide sequence of the coat protein gene of a necrotic strain
of potato virus Y from New Zealand
JOURNAL Arch. Virol. 107, 111-122 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by J.M.Hay, 09-FEB-1989.
FEATURES from to/span description
pept < 1 796 coat protein (AA at 2)
BASE COUNT 351 a 196 c 263 g 312 t
ORIGIN 6 bp upstream of TaqI site.
1 cacaatcgat gcaggaggaa gcactaaaaa ggatgcaaaa caagagcaag gtagcattca
61 accaaatttc aacaaggaaa aggaaaagga cgtgaatgtt ggaacatctg gaactcatac
121 tgtgccacga attaaagcta tcacgtccaa aatgagaatg cccaagagta aaggtgcaat
181 tgcattaaat ttggaacact tactcgagta tgctccacag caaattgaca tctcaaatac
241 tcgagcaact caatcacagt ttgatacgtg gtatgaagca gtacaacttg catacgacat
301 aggagaaact gaaatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg
361 aacctcgcca aacatcaacg gagtttgggt tatgatggat ggagatgaac aagtcgaata
421 cccactaaaa ccaatcgttg agaatgcaaa accaacactt aggcaaatca tggcacattt
481 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg
541 atatggttta gttcgtaatc tgcgcgatgg aagtttggct cgctatgctt ttgactttta
601 tgaagttaca tcacggacac cagtgagggc tagagaggca cacattcaaa tgaaggccgc
661 agctttaaaa tcagctcaat ctcgactttt cggattggat ggtggcatta gtacacaaga
721 ggaaaacaca gagaggcaca ccaccgagga tgtttctcca agtatgcata ctctacttgg
781 agtgaagaac atgtgattgt agtgtctttc cggacgatat atagatattt atgtttgcag
841 taagtatttt ggcttttcct gtactacttt tatcgaaatt aataatcgtt tgaatattac
901 tggcagatag gggtggtata gcgattccgt cgttgtagtg accttagctg tcgtttctgt
961 attattatgt ttgtataaaa gtgccgggtt gttgttgttg tggctgatct atcgattagt
1021 tgatgttgcg atttgtcgta gcagtgacta tgtctggatt tagttagttg ggtgatgctg
1081 tgattctgtc atagcagtga ctgtaaactt caatcaggag ac
//
LOCUS SRAAFPG 2420 bp ds-DNA VRT 25-JUL-1990
DEFINITION Sea raven (H.americanus) antifreeze protein type II gene, complete
cds.
ACCESSION J05100
KEYWORDS antifreeze protein.
SOURCE Sea raven (adult) testes DNA, clone lambda SR7.
ORGANISM Hemitripterus americanus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Scorpaeniformes; Cottoidei; Cottidae.
REFERENCE 1 (bases 1 to 2420)
AUTHORS Hayes,P.H., Scott,G.K., Ng,N.F.L., Hew,C.L. and Davies,P.L.
TITLE Cystine-rich type II antifreeze protein precursor is initiated from
the third AUG codon of its mRNA
JOURNAL J. Biol. Chem. 264, 18761-18767 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.L.Davies, 19-OCT-1989.
FEATURES from to/span description
pept 434 494 antifreeze protein, exon 2 (first expressed
exon)
1246 1382 antifreeze protein, exon 3
1488 1604 antifreeze protein, exon 4
1697 1805 antifreeze protein, exon 5
2045 2112 antifreeze protein, exon 6
pre-msg 226 2112 antifreeze protein mRNA and introns
IVS 284 389 antifreeze protein intron A
IVS 495 1245 antifreeze protein intron B
IVS 1383 1487 antifreeze protein intron C
IVS 1605 1696 antifreeze protein intron D
IVS 1806 2044 antifreeze protein intron E
rpt 74 94 repeat copy A
rpt 95 114 repeat copy B
rpt 115 135 repeat copy C
signal 74 135 antifreeze protein regulatory sequence (put.)
site 161 164 antifreeze protein CAAT box
site 195 198 antifreeze protein TATA box
signal 2368 2373 antifreeze protein polyA signal
BASE COUNT 684 a 475 c 496 g 765 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttcaga aattcactcc tttttctaat attaacttta aagccacagt gtgcgatttg
61 gagccctttg atttgttgtt ttcaaagttc aaactgttgt ttcaaaattc aaactgttgt
121 tttcaaagtt caaactgatg ccagtgtcca taataaaaat caatgtatga ataatattgt
181 gaaatgtaat tgactatata agagctggtc tttctctagt tcagcacatg aatgcagagg
241 caacaggctg acactgaaac aagagaagat atttctacag caggtttgct ctcagcctct
301 tcttcgtcct gccgagcccc acaggcactg tgctgccctg ctgtctttgt aattcattgc
361 aactcttgtg tttttctctt ctgatgcagg gctatcaatc atcttcatcg tctgcaccat
421 ctctaccacg aggatgctga ctgtgtctct actggtttgt gccatgatgg ctctgactca
481 agctaatgat gacagtgagt ctcagtctta cattctgtgt gtaggatact atactgtctg
541 taaatatatt caattgtaga cctattaaga tgctgtgaat attaatatta ggtaatattt
601 agtttattta tatatgtata tatatttgac agtaataaca aaaaactagg atagattgca
661 atccgacttt ttgttatctt tattgttaac aatattaaag acataattcc atagaattat
721 ataatttaca tagaaacagc aaatacaact gtcagagaaa gacttgacag ctaaagcagg
781 agagatcaag tgtagaaggg agatttgatc tcgtctcaac tgaagctaga actgaatgta
841 ctaacttatt tttggtgaaa caaccgaata attaattcat ttttccccca caaaactaaa
901 cgagacgcag accaagctaa gtgtgtgcta acagtaatca gcattcgttt agcaaagtat
961 tagtaactgc catcacagct tttgactcta gtggaattca tgaaatttgg cagaacaaag
1021 gagacctgtg cacatctgat tccaatgaga atacaatgtg cttcacagaa aagcacttca
1081 ccaatcctgt acacattcat aaagccacag aaaaaaagag agctgattaa tcgtcgttcc
1141 ctctgctctg acaataaaag gattataaac tccagatttc tgataaacag actcggtggc
1201 ttacctgtga tcagacatgt tacccactct tctgtttgtc ctcagaaata ctcaaaggca
1261 cggctacaga ggctggaccg gtctctcaga gagccggacc aaactgtccc gctggttggc
1321 aacctcttgg tgaccgctgt atctattatg agacaacagc gatgacttgg gctctggctg
1381 aggtagtcag gatatgatta tgattcagat tgcttctaaa ctggtctggt ggtattgcct
1441 tacatgctcg gttaattgag catgagcttg actcatttcc actgcagaca aactgtatga
1501 aattgggtgg acaccttgca tccatccaca gccaggagga gcatagtttc attcagacct
1561 tgaatgctgg tgttgtatgg atcggaggct ccgcttgcct ccaggtaaaa cattgcatta
1621 caatggtggc agaaagaaag gatttttatt acatgctatc ttactatacg tatattcttt
1681 cctttctgtt ttctaggcag gtgcttggac ctggtctgat ggtacaccta tgaattttcg
1741 ttcctggtgt tctaccaaac ctgatgatgt actggccgcg tgctgtatgc agatgactgc
1801 tgcaggtaaa tcacaacaca ttagagcata gtattaaatg actgaaggca gtagtgttgt
1861 ttagtacatt tggttcatct tgagatcaat actctcagaa tttcactttt gaatcacttt
1921 tgttcttcag ttcatgtgta gctttggcct cgttatccgt gtctttgtct gtctagtgat
1981 gaagacagtt tcaggttagg ttggtatggc gctgactcac ttcttgtgtt tttgatgttt
2041 acagctgacc aatgctggga tgacttgcct tgtccggcgt cccacaaatc agtctgcgcc
2101 atgacattct aagctaacac agaggccatc catcacacaa acactttagt gggtgtttga
2161 ttgtgtgtgt tcgcatactc atctgtgttc gtgtcaacag cctcatgctg aacctgaagg
2221 ttcaaaatct catatgacat ctttaattct ttgctattgt tggagctgcc tgaaaggatg
2281 agacgacaag agctggaaag catctgaggg attttaggaa gaaagtgaat ggttatgaaa
2341 atgatggtct ttttatgtat tatgtcaaat taaaaggctg acacgttgaa acaaactctt
2401 ctgtgagttt ggcagaattc
//
LOCUS YSCTFIIDA 2439 bp ds-DNA PLN 25-JUL-1990
DEFINITION Yeast (S.cerevisiae) TATA-binding protein (TFIID) gene, complete
cds.
ACCESSION M27135
KEYWORDS DNA binding protein; TATA-binding protein; transcription factor.
SOURCE Yeast (S.cerevisiae, strain S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2439)
AUTHORS Hahn,S., Buratowski,S., Sharp,P.A. and Guarente,L.
TITLE Isolation of the gene encoding the yeast TATA binding protein
TFIID: A gene identical to the SPT15 suppressor of Ty element
insertions
JOURNAL Cell 58, 1173-1181 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Hahn, 10-AUG-1989.
FEATURES from to/span description
pept 1237 1959 TATA-binding protein
BASE COUNT 737 a 481 c 472 g 749 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 5 right arm after TRP1.
1 gaattcgttc aagtggtccg taatattccc gtctttacaa agctggatta ccatctctaa
61 tgccaacttc catgcatata gctcaggccc caccgtgtgc agctccgtgc ttcgcagctc
121 ctgcagagca tcctcgggga ttgggaacct ctcatttagc aagtaattca cataacacag
181 atttagaaac catttccatt gtgacttttc ccgacattgc gagagtagcc catgaaaact
241 cgtcttcacc ctgcggtgct gtttcagctt aatgcaaagc atcacgccga catactggaa
301 tacggatgcc caattttgat acaactcatc ctgcaaattt accatgtact ggactaattc
361 attgcaattt cttagtgcaa tcttatagtg gaacttactg tctctcataa gtggcaagtc
421 atgtaacagc agaaactcgc aacgcatgat ctcttctacc aaatctgtgt cgctctggtg
481 cgtttgtaac cgttctttca aactggaaat gtaaagctct gctaggtcaa aattatacgt
541 ctcctgtatc aataactcca ccatctcaaa cgtgacctta ctatcctcca gaactgaaag
601 cgtacatttc gttttcaata gctgaaacat ctggatagac atgttcatga ggccataata
661 ctgcttcaac ccttcctcag aaccgatttt attcgcaatt gatatgcatg gtctctgtat
721 tcctgtgcta agtggtatac ttgtgaaata ctaagtttgt cgccaagatt ttccatgaat
781 ttgtacttct ttcgaaatcg ttcaatttct accaatactg attcccctct gatagctgag
841 atgtcgggat tccctttgct gatagatcta actcatctct ttacgtattt taattgtgaa
901 gccgtaaata gttatcttcc aagtttctct tacgcgagct ttttgggaaa agaaaaaaat
961 ttgaagatct acatataaaa catggcttca aaggattact aatgactttt tttaccttga
1021 taggtattct tgatggtaag agtaaacaag ggacgtgaaa attacagtag ttactgtttt
1081 ttttggacta taagatcggg ggaaagataa cacataagaa ataaaacgac tactagttag
1141 actgctctgc ggaagaagca aggaagtaaa ggctgcattt tatttttctt ttctagtcca
1201 acataaacag gtgtatcaag agaaactttt ttaattatgg ccgatgagga acgtttaaag
1261 gagtttaaag aggcaaacaa gatagtgttt gatccaaata ccagacaagt atgggaaaac
1321 cagaatcgag atggtacaaa accagcaact actttccaga gtgaagagga cataaaaaga
1381 gctgccccag aatctgaaaa agacacctcc gccacatcag gtattgttcc aacactacaa
1441 aacattgtgg caactgtgac tttggggtgc aggttagatc tgaaaacagt tgcgctacat
1501 gcccgtaatg cagaatataa ccccaagcgt tttgctgctg tcatcatgcg tattagagag
1561 ccaaaaacta cagctttaat ttttgcctca gggaaaatgg ttgttaccgg tgcaaaaagt
1621 gaggatgact caaagctggc cagtagaaaa tatgcaagaa ttatccaaaa aatcgggttt
1681 gctgctaaat tcacagactt caaaatacaa aatattgtcg gttcgtgtga cgttaaattc
1741 cctatacgtc tagaagggtt agcattcagt catggtactt tctcctccta tgagccagaa
1801 ttgtttcctg gtttgatcta tagaatggtg aagccgaaaa ttgtgttgtt aatttttgtt
1861 tcaggaaaga ttgttcttac tggtgcaaag caaagggaag aaatttacca agcttttgaa
1921 gctatatacc ctgtgctaag tgaatttaga aaaatgtgat ggggaaggag tagacgaaaa
1981 gaaaaaaagg ttttctattt gttccatttt ctcaattatt aatggtcctc aaagaaataa
2041 aagaaaagga agaagaagta attgtaatat caaacggttt tttatagtat attcttctta
2101 ttctatattt atatatcaat gttttataat aagatgttta ttcatagcat atctggtgga
2161 tcgtctctat taagcgccag cgaggtgttt gcctctgcat ttttcagcaa agcaagctcc
2221 ctttccagct tgaatctatg ttcacgctca tccgacaatt ctttttcata ctttctttgt
2281 gtactcgtaa gcactttttt aaactcactt gtcattattg aaagtgaacg tgatccagaa
2341 ccgcttgtgg ggcttcctac agaggaaggt gaacttggat cccaagtcac tggcgaactc
2401 gctggtgatg acatgccgaa attatgtctg cttgaattc
//
LOCUS ECOPUTC 730 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli putC region encoding proline uptake protein (putP) and
proline oxidase (putA) genes, 5'ends.
ACCESSION M35174
KEYWORDS putA protein; putC region; putP protein.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 730)
AUTHORS Nakao,T., Yamato,I. and Anraku,Y.
TITLE Nucleotide sequence of putC, the regulatory region for the put
regulon of Escherichia coli K 12
JOURNAL Mol. Gen. Genet. 210, 364-368 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 129 < 1 (c) proline uptake protein (putP)
pept 549 > 730 proline oxidase (putA)
pept 209 544 ORF
mRNA 266 < 1 (c) putP mRNA (alt.)
mRNA 255 < 1 (c) putP mRNA (alt.)
mRNA 249 < 1 (c) putP mRNA (alt.)
mRNA 224 < 1 (c) putP mRNA (alt.)
mRNA 142 < 1 (c) putP mRNA (alt.)
mRNA 506 > 730 putA mRNA
BASE COUNT 205 a 173 c 144 g 208 t
ORIGIN
1 cccaagacta cgaccgccca gaatatagtc gtcaaagttt ttcgttgatc gccaggcgat
61 aaacccaatc aatatcatgc caaagatata gacacaaaat gtcaccaaca tcggtgtgct
121 aatagccatc taaagtctcc aaaaaattat tatcggcaat gtcgaaactt gccgttatat
181 ctgccaccgg aacggggtaa cagagtttat gttttaccag ggcgaccgta tcctgccgga
241 agcgctggtt attcacaatc gatttaacac accatttaca ttaaatttta gtgctcagcg
301 acactatttt tcatcaggtt gcactctctc acattttttg cggttgcacc tttcaaaaat
361 gttaactgcc gcagagaaaa agtctgagtt atttttttcc ctgtcatatc gatttctttt
421 attaacattt cattcatttt taagcttgct acgcatgtca catttaacat ggttgcacaa
481 agttgcaaca tcatggatat ttcacgataa cgttaagttg cacctttctg aacaacagga
541 gtaatggcat gggaaccacc accatggggg ttaagctgga cgacgcgacg cgtgagcgta
601 ttaagttcgc cgcgacacgt atcgatcgca caccacactg gttaattaag caggcgattt
661 tttcttatgc tcgaacaact ggaaaacagc gatactctgc cggagctacc tgcgctgctt
721 tctggcgcgg
//
LOCUS FIBGLUC 1426 bp ds-DNA BCT 25-JUL-1990
DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
complete cds.
ACCESSION M33676 M33311
KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE F.succinogenes (strain S85) DNA, clone PJI5.
ORGANISM Fibrobacter succinogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1426)
AUTHORS Teather,R.M. and Erfle,J.D.
TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage
beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
JOURNAL J. Bacteriol. 172, 3837-3841 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Teather, 11-APR-1990.
FEATURES from to/span description
pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase
precursor (EC 3.2.1.73)
sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase
signal peptide
matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase
binding 132 137 ribosome binding site
signal 62 66 -35 region
signal 85 90 -10 region
BASE COUNT 371 a 346 c 335 g 374 t
ORIGIN
1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS BFRRCRRA 89 bp ss-RNA PHG 25-JUL-1990
DEFINITION Bacteriophage fr coat protein replicase cistron (R region) RNA.
ACCESSION M35063
KEYWORDS coat protein.
SOURCE Bacteriophage fr RNA.
ORGANISM Bacteriophage fr
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Leviviridae.
REFERENCE 1 (bases 1 to 89)
AUTHORS Cielens,I.E., Jansone,I.V., Gribanov,V.A., Vishnevskii,Y.I.,
Berzin,V.M. and Gren,E.J.
TITLE Regulator region of phage fr replicase cistron: II. Isolation and
structure of specific fr RNA fragments
JOURNAL Mol. Biol. 16, 886-892 (1982)
STANDARD simple staff_entry
FEATURES from to/span description
pept 55 > 89 coat protein (R region)
pept < 1 20 undefined ORF (AA at 3)
BASE COUNT 34 a 23 c 13 g 19 t
ORIGIN
1 ccaactcggg aatctactaa gaaacccgtg ccattccaac aatgaggaat acccatgtca
61 aaatcaacaa agaagttcaa ctctttatg
//
LOCUS CHKAGLBB 71 bp ss-mRNA VRT 25-JUL-1990
DEFINITION Chicken alpha-globin gene, partial cds.
ACCESSION M35068
KEYWORDS alpha-globin.
SOURCE Chicken (strain white Leghorn) 2-3 week old, cDNA to mRNA, clone
pHb1003.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 71)
AUTHORS Cummings,I.W., Liu,A.Y. and Salser,W.A.
TITLE Identification of a new chicken alpha-globin structural gene by
complementary DNA cloning
JOURNAL Nature 276, 418-419 (1978)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 71 alpha-globin (AA at 1)
BASE COUNT 17 a 22 c 18 g 14 t
ORIGIN
1 aagaaggtag tggctgcctt gatcgaggct gccaaccaca ttgatgacat cgccggcacc
61 ctctccaagc t
//
LOCUS ECOTGLPA 141 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli suppressor tRNA-Leu (leuX) precursor gene.
ACCESSION M35064
KEYWORDS leuX gene; suppressor transfer RNA-Leu.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Nomura,T. and Ishihama,A.
TITLE A novel function of RNase P from Escherichia coli: Processing of a
suppressor tRNA precursor
JOURNAL EMBO J. 7, 3539-3545 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 23 114 Leu-tRNA
anticdn 57 59 Leu-tRNA anticodon caa
site 35 36 self-cleavage site
BASE COUNT 38 a 34 c 33 g 36 t
ORIGIN
1 gttttccgca tacctcttca gtgccgaagt ggcgaaatcg gtagacgcag ttgattcaaa
61 atcaaccgta gaaatacgtg ccggttcgag tccggccttc ggcaccaaaa gtatgtaaat
121 agacctcaac tgaggtcttt t
//
LOCUS HUMFBPC 66 bp ss-mRNA PRI 25-JUL-1990
DEFINITION Human folate binding protein mRNA, partial cds.
ACCESSION M35069
KEYWORDS folate binding protein.
SOURCE Human epidermoid carcinoma cell line KB, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Sadasivan,E. and Rothenberg,S.P.
TITLE Molecular cloning of the complementary DNA for a human folate
binding protein
JOURNAL Proc. Soc. Exp. Biol. Med. 189, 240-244 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 66 folate binding protein (AA at 1)
BASE COUNT 21 a 17 c 17 g 11 t
ORIGIN
1 acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac
61 aaggaa
//
LOCUS HUMMETONA 90 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human met oncogene, middle exon.
ACCESSION M35073
KEYWORDS met oncogene; tyrosine kinase.
SOURCE Human cell line MNNG-HOS DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 90)
AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O.,
Rowley,J.D., Blair,D.G. and Vande Woude,G.F.
TITLE The human met oncogene is related to the tyrosine kinase oncogenes
JOURNAL Nature 318, 385-388 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 22 / 90 met oncogene (AA at 24)
/hgml_locus_uid="LN0032R" /nomgen="MET"
/map="7q31"
IVS < 1 21 met oncogene intron
BASE COUNT 28 a 16 c 21 g 25 t
ORIGIN Chromosome 7q31.
1 ttggctttgg tcttcaagta gccaaagcga tgaaatatct tgcaagcaaa aagtttgtcc
61 acagagactt ggctgcaaga aactgtatgt
//
LOCUS HUMMETONB 375 bp ss-mRNA PRI 25-JUL-1990
DEFINITION Human met oncogene mRNA, 3' end.
ACCESSION M35074
KEYWORDS met oncogene; tyrosine kinase.
SOURCE Human cell line MNNG-HOS, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 375)
AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O.,
Rowley,J.D., Blair,D.G. and Vande Woude,G.F.
TITLE The human met oncogene is related to the tyrosine kinase oncogenes
JOURNAL Nature 318, 385-388 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 375 met oncogene /hgml_locus_uid="LN0032R"
/nomgen="MET" /map="7q31"
BASE COUNT 94 a 97 c 83 g 101 t
ORIGIN Chromosome 7q31.
1 tggtcctttg gcgtcgtcct ctgggagctg atgacaagag gagccccacc ttatcctgac
61 gtaaacacct ttgatataac tgtttacttg ttgcaaggga gaagactcct acaacccgaa
121 tactgcccag accccttata tgaagtaatg ctaaaatgct ggcaccctaa agccgaaatg
181 cgcccatcct tttctgaact ggtgtcccgg atatcagcga tcttctctac tttcattggg
241 gagcactatg tccatgtgaa cgctacttat gtgaacgtaa aatgtgtcgc tccgtatcct
301 tctctgttgt catcagaaga taacgctgat gatgaggtgg acacacgacc agcctccttc
361 tgggagacat catag
//
LOCUS MS23ENDA 105 bp ss-RNA PHG 25-JUL-1990
DEFINITION Bacteriophage MS2 3' terminal fragment.
ACCESSION M35059
KEYWORDS .
SOURCE Bacteriophage MS2 RNA.
ORGANISM Bacteriophage MS2
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Leviviridae.
REFERENCE 1 (bases 1 to 105)
AUTHORS Contreras,R., Vandenberghe,A., Jou,W.M., De Wachter,R. and Fiers,W.
TITLE Studies on the Bacteriophage MS2 nucleotide sequence of a 3'
terminal fragment (n=104)
JOURNAL FEBS Lett. 18, 141-144 (1971)
STANDARD simple staff_entry
BASE COUNT 21 a 34 c 30 g 20 t
ORIGIN
1 gctccaccga aaggtgggcg ggcttcggcc cagggacccc tccctaaaga gaggacccgg
61 gattctcccg atttggtaac tagctgcttg gctagttacc accca
//
LOCUS PEAPCATE 1004 bp ds-DNA SYN 25-JUL-1990
DEFINITION Chimaeric gene with P.sativum ribulose 1,5-bisphosphate carboxylase
5' flank/A.tumefaciens chloramphenicol acetyltransferase gene, 5'
end.
ACCESSION M35072
KEYWORDS .
SOURCE Recombined Pisum sativum and Agribacterium tumefaciens DNA inserted
in decapitated tobacco seedlings.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 1004)
AUTHORS Herrera-Estrella,L., Van den Broeck,G., Maenhaut,R.,
Van Montagu,M., Schell,J., Timko,M. and Cashmore,A.
TITLE Light-inducible and chloroplast-associated expression of a
chimaeric gene introduced into Nicotiana tabacum using a Ti plasmid
vector
JOURNAL Nature 310, 115-120 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1002 > 1004 chloramphenicol acetyltransferase (CAT)
recomb 965 966 P.sativum DNA end/CAT DNA start
signal 943 946 TATA box
BASE COUNT 309 a 176 c 146 g 373 t
ORIGIN
1 gaattcaaca ttggctatta ctggttttac aaagtcagac taaggagcat gtccaaccac
61 tataaggtct ataataggat ttaccttttt ccttagaagc actttaatca actagaaatc
121 aaagaagcaa aatgtagtgt ctagatcttc atcagaagta aagtatagag ctttagcaaa
181 cacatcatgt gagacacgat ggtttctata cttgcttcag gatctctgca tttcccatac
241 ctcgttcatg acaattgcaa accaacctcg tacatttgat gcccataatt tctgaaaacc
301 aagttgcata cctcttcacc aaaactcttc atcttggtct cttctcctct tttgttcaca
361 aactaggaat tattaacttt cattctaatt tataggggct gctacaactt aatatatttt
421 taattatttt tattctctta atttcctttt tttctatttg tttgtcaggt agttgagata
481 tttgggctaa tctattagag atagtttctc taacaaactt gtaactttgg gtctatatta
541 gctaatgatt catcttatat tttttcaaat gaatcattaa taaaactttc ctcttttatt
601 taattttttc aattcagttt catcatcaaa gcaaatgttt ccctgccatc tgtttgtcaa
661 cactaacatc taatgtactt atctcattag tttaattatt gtttgatcat gtttaatcct
721 tctagtgttg ttagtttttt cagttagctt aatgggcatc ttacacgtgg cattatccta
781 ttggtggcaa atgataaggt taggacacac aacttttcaa tcttgtgtgg ttaatatggc
841 tgcaaagttt atcatttcac aatctaacaa gattggtact aggcagtagc taattaccac
901 aatattaaga ccataatatt ggaaatagat aaataaaaac attatatata gcaagtttta
961 gcagaagctt ggcgagattt tcaggagcta aggaagctaa aatg
//
LOCUS TRFMTTGVA 149 bp ds-DNA ORG 25-JUL-1990
DEFINITION C.oncopelti mitochondrion Val-tRNA gene.
ACCESSION M35071
KEYWORDS transfer RNA-Val.
SOURCE C.oncopelti mitochondrial DNA, clone pCo150.
ORGANISM Mitochondrion Crithidia oncopelti
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Crithidia oncopelti.
REFERENCE 1 (bases 1 to 149)
AUTHORS Entelis,N.S., Maslov,D.A., Bol'shakova,E.V. and Zaitseva,G.N.
TITLE Primary structure of an unusual valine tRNA gene from mitochondria
of Crithidia oncopelti
JOURNAL Dokl. Biochem. 297, 435-438 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 18 89 Val-tRNA
anticdn 45 47 Val-tRNA anticodon tac
BASE COUNT 44 a 39 c 16 g 50 t
ORIGIN
1 gatctaaaat ccctgttaga cacttgtttt tgcaaacgta taattacgtt ttctacacca
61 aaacccttta aatccctgtt aggaccccat ttcttcaaat gtataatcac gttttctgcg
121 tccaaacccc ttaaaaccca gatttcgat
//
LOCUS YSCTRV2A 75 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (S.cerevisiae, Baker's) Val-tRNA-2a.
ACCESSION M35070 K01066
KEYWORDS transfer RNA-Val.
SOURCE Yeast (S.cerevisiae, Baker's) tRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N. and Baev,A.A.
TITLE Nucleotide sequence in Val-tRNA-2a from Baker's yeast
JOURNAL FEBS Lett. 45, 333-336 (1974)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 75)
AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N.. and Baev,A.A.
TITLE Primary structure of Val-tRNA-2a from Baker's yeast
JOURNAL Mol. Biol. 9, 42-48 (1975)
STANDARD simple staff_entry
COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
entry 2050 in Nucleic Acids Res. 11, r1-r54 (1983). [1] compared
given sequence with that of baker's yeast Val-tRNA-1.
FEATURES from to/span description
tRNA 1 75 Val-tRNA-2a (NAR: 2050)
anticdn 35 37 Val-tRNA-2a anticodon tac
modified 10 10 m2g
modified 16 16 d
modified 19 19 d
modified 20 20 d
modified 27 27 m22g
modified 28 28 p
modified 33 33 p
modified 35 35 unidentified uridine derivative
modified 46 46 d
modified 48 48 m5c
modified 53 53 t
modified 54 54 p
modified 57 57 m1a
BASE COUNT 16 a 22 c 20 g 17 t
ORIGIN 5' end of mature tRNA.
1 ggtccaatgg tccagtggtt caagacgtcg cctttacacg gcgaatcccg agttcgaacc
61 tcggttggat cacca
//
LOCUS YSCTRW 75 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (S.cerevisiae) Trp-tRNA-cca.
ACCESSION M35060 X02698
KEYWORDS transfer RNA-Trp.
SOURCE Yeast tRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Keith,G., Roy,A., Ebel,J.P. and Dirheimer,G.
TITLE The nucleotide sequences of two tryptophane-tRNAs from Brewer's
yeast
JOURNAL FEBS Lett. 17, 306-308 (1971)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 75)
AUTHORS Keith,G., Roy,A., Ebel,J.-P. and Dirheimer,G.
TITLE The primary structure of tryptophan transfer ribonucleic acid from
Brewer's yeast: II. Partial digestion with pancreatic ribonuclease
and derivation of complete sequence
JOURNAL Biochimie 54, 1417-1426 (1972)
STANDARD full staff_review
FEATURES from to/span description
tRNA 1 75 transfer RNA-Trp
anticdn 33 35 Trp-tRNA anticodon cca
modified 9 9 1-methylguanosine
modified 10 10 2-methylguanosine
modified 16 16 dihydrouridine
modified 17 17 2'-O-methylguanosine
modified 19 19 dihydrouridine
modified 25 25 pseudouridine
modified 26 26 pseudouridine
modified 27 27 pseudouridine
modified 31 31 2'O-methylcytidine
modified 33 33 2'O-methylcytidine
modified 38 38 pseudouridine
modified 45 45 7-methylguanosine
modified 47 47 dihydrouridine
modified 53 53 5-methyluridine (ribosylthymine)
modified 54 54 pseudouridine
modified 57 57 1-methyladenosine
modified 64 64 pot. pseudouridine
BASE COUNT 17 a 18 c 20 g 20 t
ORIGIN
1 gaagcggtgg ctcaatggta gagctttcga ctccaaatcg aagggttgca ggttcaattc
61 ctgtccgttt cacca
//
LOCUS YSUTRAI 76 bp ss-tRNA RNA 25-JUL-1990
DEFINITION Yeast (T.utilis) Ala-tRNA-I.
ACCESSION M35061 K00143
KEYWORDS transfer RNA-Ala.
SOURCE Yeast (T.utilis) tRNA.
ORGANISM Candida utilis
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 76)
AUTHORS Takemura,S., Ogawa,K. and Nakazawa,K.
TITLE Nucleotide sequence of alanine tRNA I from Torulopsis utilis
JOURNAL FEBS Lett. 25, 29-32 (1972)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 76)
AUTHORS Takemura,S. and Ogawa,K.
TITLE The primary structure of alanine transfer ribonucleic acid 1 from
Torulopsis utilis: II. Partial digestion with ribonuclease T-1 and
derivation of the complete sequence
JOURNAL J. Biochem. 74, 323-333 (1973)
STANDARD full staff_review
COMMENT Contributed on tape April 1983 by M.Sprinzl and D.H.Gauss; from
their entry 0020 in Nucleic Acids Res. 11, r1-r54 (1983). [1]: The
cloverleaf model for the secondary structure was compared with that
of Saccharomyces Ala-tRNA, especially with respect to the
aminoacyl-tRNA synthetase recognition sites.
FEATURES from to/span description
tRNA 1 76 Ala-tRNA-I (NAR: 0020)
anticdn 34 36 Ala-tRNA-I anticodon ggc
modified 9 9 m1g = 1-methylguanosine
modified 16 16 d = dihydrouridine
modified 17 17 d = dihydrouridine
modified 20 20 d = dihydrouridine
modified 26 26 m22g = 2,2-dimethylguanosine
modified 27 27 f = pseudouridine
modified 34 34 i = inosine
modified 37 37 m1i = 1-methylinosine
modified 38 38 f = pseudouridine
modified 47 47 d = dihydrouridine
modified 54 54 t = 5-methyluridine
modified 55 55 f = pseudouridine
modified 58 58 m1a = 1-methyladenosine
BASE COUNT 9 a 21 c 28 g 18 t
ORIGIN 5' end of mature tRNA
1 gggcgtgtgg cgtagttggt agcgcgttcg cttggcgtgc gaaaggtctc cggttcgact
61 ccggactcgt ccacca
//
LOCUS MUSPTKA 211 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD15.
ACCESSION M33421
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 211)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 47 a 55 c 64 g 45 t
ORIGIN
1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga
61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc
121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct
181 cagagagtga tgtgtggagc tttggaattc c
//
LOCUS MUSPTKB 211 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD16.
ACCESSION M33422
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 211)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 50 a 50 c 59 g 52 t
ORIGIN
1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa
61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga
121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta
181 ctcattctga cgtctggtcc ttcggaattc c
//
LOCUS MUSPTKC 214 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD17.
ACCESSION M33423 M22448 J04523
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
REFERENCE 2 (sites)
AUTHORS Wilks,A.F.
TITLE Two putative protein-tyrosine kinases identified by application of
the polymerase chain reaction
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [2] kindly submitted by
A.Wilks, 08-FEB-1989, for release after publication.
FEATURES from to/span description
pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 69 a 42 c 58 g 45 t
ORIGIN
1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa
61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg
121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt
181 ctgtggcctc agatgtctgg tcctttggaa ttcc
//
LOCUS MUSPTKD 217 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD19.
ACCESSION M33424
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 217 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 64 a 44 c 57 g 52 t
ORIGIN
1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg
61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca
121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt
181 tcaccaccac gtcagatgtg tggtcctttg gaattcc
//
LOCUS MUSPTKE 214 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD22.
ACCESSION M33425 M22447 J04523
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
REFERENCE 2 (sites)
AUTHORS Wilks,A.F.
TITLE Two putative protein-tyrosine kinases identified by application of
the polymerase chain reaction
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [2] kindly submitted by
A.Wilks, 08-FEB-1989, for release after publication.
FEATURES from to/span description
pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 59 a 46 c 55 g 54 t
ORIGIN
1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga
61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg
121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt
181 atatcgcctc tgacgtctgg tcctttggaa ttcc
//
LOCUS MUSPTKF 208 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD175.
ACCESSION M33426
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 208)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 49 a 53 c 55 g 51 t
ORIGIN
1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga
61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg
121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca
181 aatctgacgt gtggtccttt ggaattcc
//
LOCUS MUSPTKG 208 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone W3.13.
ACCESSION M33427
KEYWORDS protein-tyrosine kinase.
SOURCE Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 208)
AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
TITLE The application of the polymerase chain reaction to cloning members
of the protein tyrosine kinase family
JOURNAL Gene 85, 67-74 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC
2.7.1.112)
BASE COUNT 53 a 44 c 56 g 55 t
ORIGIN
1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag
61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg
121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca
181 agtcagacgt gtggtcctat ggaattcc
//
LOCUS RATCROS1A 7839 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Rat lung-derived c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION M35104
KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE Rat (strain Fischer) lung, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 7839)
AUTHORS Matsushime,H. and Shibuya,M.
TITLE Tissue-specific expression of rat c-ros-1 gene and partial
structural similarity of its predicted products with sev protein of
Drosophila melanogaster
JOURNAL J. Virol. 64, 2117-2125 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 402 7355 c-ros-1 tyrosine kinase (put.)
mRNA < 1 7839 c-ros-1 mRNA
BASE COUNT 2159 a 1760 c 1887 g 2033 t
ORIGIN
1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg
1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt
1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg
1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc
1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag
1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag
2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc
2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca
2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg
2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt
2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca
2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta
2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt
2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg
2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct
2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt
2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg
2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca
2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg
2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc
2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca
2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg
3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa
3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc
3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg
3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc
3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc
3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct
3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat
3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt
3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta
3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg
3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt
3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct
3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat
3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc
3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg
3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt
3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact
4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg
4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata
4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt
4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc
4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat
4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact
4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca
4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta
4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg
4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag
4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa
4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc
4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta
4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc
4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt
4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg
4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg
5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg
5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg
5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc
5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt
5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt
5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt
5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca
5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag
5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac
5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag
5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact
5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca
5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta
5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt
5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag
5941 tcccactgac ctttgtctgg catagaagct tgaaaaacca caaagctacc aaggaaggcc
6001 tctcagttct caacgacaat gaccaagagt tggctgagct tcgaggactg gcggctggag
6061 tgggcctggc caatgcctgc tatgcagtac atactcttcc aacccaagag gagattgaaa
6121 gtcttcccgc cttccctcgg gagaagctga gcctgcgcct tctgttggga agtggagctt
6181 ttggagaagt gtacgagggc acagctgtag acatcctagg acggggaagt ggagaaatca
6241 aggtggccgt gaagaccctg aagaaaggtt cgacagacca ggagaagatc gagttcctga
6301 aggaggcaca cctgatgagc aagtttaatc accccaacat tctgaagcag ctgggagtct
6361 gtctgctgag tgaaccccag tacattatcc tggaactgat ggaaggggga gaccttctaa
6421 gctatctgcg caaagcccga gggacaacgt tgtctggccc tttactcaca ttggctgacc
6481 tggtagagct gtgtgtagat atttcaaaag gctgcgtcta cttggagcag atgcacttca
6541 ttcacaggga tctggcagct cggaattgcc ttgtgtctgt gaaagactat accagtcctc
6601 gggtagtcaa gatcggtgac tttggtttgg caagggaaat ctataagcat gattattata
6661 gaaagagagg ggaaggcctg cttcctgtcc ggtggatggc tcctgaaaac ttgatggatg
6721 gaatcttcac ttcccagtct gatgtatggt cttttggaat tttggtttgg gagattttaa
6781 ctcttggtca tcaaccttat ccagcgcatt ccaaccttga tgttttaaac tatgtgcaag
6841 caggagggag actggagcca ccgagaaact gtcctgatga tctgtggaat ttaatgttcc
6901 gatgttgggc ccaagaacct gaccaaagac ccactttcta taacattcaa gaccagcttc
6961 agttattcag aaatgtttcc ttaaacaatg tttctcactg tggacaagca gctcctgctg
7021 gtggagtcat caacaaaggc tttgaaggtg aagacaatga aatggccact ttgaattcag
7081 atgacacgat gccagttgcc ttgatggaaa ccaggaacca agaaggatta aattatatgg
7141 tacttgccac aaagtgtagc caaagtgagg atcgttatga gggtcctcta ggctctaagg
7201 aatctgggtt gcatgatctg aagaaagacg agaggcaacc agcagacaaa gatttctgcc
7261 agcaaccaca ggtggcttat ggctctcctg gccactctga aggcctgaac tatgcctgtc
7321 ttgctcacag tggacatgga gatgtgtctg aataatagta tctcatagga aacatagcac
7381 tgagatgaac actgtattaa gttaaaaaga agaaaggtgg ggtggcagtc tagactctga
7441 actgacacag ccaagttcca aagttctgat cttggttcca agagccatta tgtttcattc
7501 agcattctct ttaccagtga cgtaaccttc agtggattat cagaggaacc tgtgtgtgtg
7561 cggaaatccc aggacaaatc ctaagtctgg gaagaaaaca tcactgtctc tctcctctga
7621 agccctttac ttcagagcat tgcctgccct ggcaatctta ctaggttcat gcaaggatgt
7681 gagtggggga ggggccggag tctgctgagg accacctgaa ctacagatta ccttaagagg
7741 atgcaggaaa caattactca cacaggagga agcagcctgt ggaccatgag gaatcatctg
7801 gcacgctatt attccaataa aatattccct ttaatcatc
//
LOCUS RATCROS1B 8010 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Rat lung-derived L01 c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION M35105
KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE Rat (strain Fischer) lung, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 8010)
AUTHORS Matsushime,H. and Shibuya,M.
TITLE Tissue-specific expression of rat c-ros-1 gene and partial
structural similarity of its predicted products with sev protein of
Drosophila melanogaster
JOURNAL J. Virol. 64, 2117-2125 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 402 5966 c-ros-1 unknown protein
mRNA < 1 8010 c-ros-1 mRNA
BASE COUNT 2197 a 1812 c 1930 g 2071 t
ORIGIN
1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg
1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt
1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg
1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc
1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag
1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag
2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc
2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca
2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg
2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt
2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca
2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta
2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt
2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg
2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct
2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt
2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg
2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca
2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg
2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc
2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca
2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg
3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa
3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc
3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg
3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc
3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc
3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct
3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat
3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt
3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta
3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg
3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt
3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct
3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat
3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc
3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg
3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt
3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact
4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg
4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata
4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt
4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc
4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat
4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact
4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca
4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta
4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg
4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag
4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa
4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc
4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta
4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc
4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt
4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg
4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg
5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg
5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg
5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc
5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt
5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt
5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt
5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca
5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag
5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac
5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag
5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact
5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca
5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta
5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt
5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag
5941 tcccactgac ctttgcctgt cactgaagct ggggctcaca gatcagctag gccggctggc
6001 caacagatcc ccgagatctg cctgcctctg acctctacct ccaacactgg ggctacagat
6061 gtgtgctaca ttctcagtat ttaactgggt gctgaggaac caagcacagg tcctcatgct
6121 cgtaagtctg gcatagaagc ttgaaaaacc acaaagctac caaggaaggc ctctcagttc
6181 tcaacgacaa tgaccaagag ttggctgagc ttcgaggact ggcggctgga gtgggcctgg
6241 ccaatgcctg ctatgcagta catactcttc caacccaaga ggagattgaa agtcttcccg
6301 ccttccctcg ggagaagctg agcctgcgcc ttctgttggg aagtggagct tttggagaag
6361 tgtacgaggg cacagctgta gacatcctag gacggggaag tggagaaatc aaggtggccg
6421 tgaagaccct gaagaaaggt tcgacagacc aggagaagat cgagttcctg aaggaggcac
6481 acctgatgag caagtttaat caccccaaca ttctgaagca gctgggagtc tgtctgctga
6541 gtgaacccca gtacattatc ctggaactga tggaaggggg agaccttcta agctatctgc
6601 gcaaagcccg agggacaacg ttgtctggcc ctttactcac attggctgac ctggtagagc
6661 tgtgtgtaga tatttcaaaa ggctgcgtct acttggagca gatgcacttc attcacaggg
6721 atctggcagc tcggaattgc cttgtgtctg tgaaagacta taccagtcct cgggtagtca
6781 agatcggtga ctttggtttg gcaagggaaa tctataagca tgattattat agaaagagag
6841 gggaaggcct gcttcctgtc cggtggatgg ctcctgaaaa cttgatggat ggaatcttca
6901 cttcccagtc tgatgtatgg tcttttggaa ttttggtttg ggagatttta actcttggtc
6961 atcaacctta tccagcgcat tccaaccttg atgttttaaa ctatgtgcaa gcaggaggga
7021 gactggagcc accgagaaac tgtcctgatg atctgtggaa tttaatgttc cgatgttggg
7081 cccaagaacc tgaccaaaga cccactttct ataacattca agaccagctt cagttattca
7141 gaaatgtttc cttaaacaat gtttctcact gtggacaagc agctcctgct ggtggagtca
7201 tcaacaaagg ctttgaaggt gaagacaatg aaatggccac tttgaattca gatgacacga
7261 tgccagttgc cttgatggaa accaggaacc aagaaggatt aaattatatg gtacttgcca
7321 caaagtgtag ccaaagtgag gatcgttatg agggtcctct aggctctaag gaatctgggt
7381 tgcatgatct gaagaaagac gagaggcaac cagcagacaa agatttctgc cagcaaccac
7441 aggtggctta tggctctcct ggccactctg aaggcctgaa ctatgcctgt cttgctcaca
7501 gtggacatgg agatgtgtct gaataatagt atctcatagg aaacatagca ctgagatgaa
7561 cactgtatta agttaaaaag aagaaaggtg gggtggcagt ctagactctg aactgacaca
7621 gccaagttcc aaagttctga tcttggttcc aagagccatt atgtttcatt cagcattctc
7681 tttaccagtg acgtaacctt cagtggatta tcagaggaac ctgtgtgtgt gcggaaatcc
7741 caggacaaat cctaagtctg ggaagaaaac atcactgtct ctctcctctg aagcccttta
7801 cttcagagca ttgcctgccc tggcaatctt actaggttca tgcaaggatg tgagtggggg
7861 aggggccgga gtctgctgag gaccacctga actacagatt accttaagag gatgcaggaa
7921 acaattactc acacaggagg aagcagcctg tggaccatga ggaatcatct ggcacgctat
7981 tattccaata aaatattccc tttaatcatc
//
LOCUS RATCROS1C 7902 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Rat heart-derived c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION M35106
KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE Rat (strain Fischer) heart, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 7902)
AUTHORS Matsushime,H. and Shibuya,M.
TITLE Tissue-specific expression of rat c-ros-1 gene and partial
structural similarity of its predicted products with sev protein of
Drosophila melanogaster
JOURNAL J. Virol. 64, 2117-2125 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 402 7418 c-ros-1 tyrosine kinase (put.)
mRNA < 1 7902 c-ros-1 mRNA
BASE COUNT 2171 a 1775 c 1899 g 2057 t
ORIGIN
1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
1681 cctacaatgg gtatgtcttt tatctcctaa gagacggcat ttatagagtc catcttcctt
1741 tgccgtctgt cagggacacc aaagctgtgc gtattgtgga gagtggcaca ttaaaggact
1801 tcgcagtaaa gccgcagtcc aagcgaatca tttacttcaa tggcaccatg caagtcttca
1861 tgtcgacatt tctggatggc tcggcattcc acagggttct gccgtgggtc ccccttgcgg
1921 atgtgaagag ctttgcttgt gaaaacaatg acttcctcat cacagatggc aaggccattt
1981 tccaacagga ctctctgtct ttcaatgagt tcatcgtggg atgtgacctg agtcacatag
2041 aagaatttgg gtttggtaac ttggtcatct ttggctcctc cgtccagtcg taccctctgc
2101 caggccatcc acaggaggtc tcggtgctgt ttggttctcg agaggccctt attcagtgga
2161 agcctccgat tctcgccata ggagccagtc cttccgcctg gcagaactgg acttatgagg
2221 tcaaagtttc ctcccaggac attctggaaa ccactcaagt tttcttgaac ataagcagga
2281 ctgtgctgaa tgtacccaag ctgcaaagtt ctacaaagta catggtgtct gtgcgagcaa
2341 gttctcctaa aggcccaggc ccatggtcag aaccctcagt gggtactacc ttggtaccag
2401 ccactgagcc accgttcatc atggctgtga aagaagatgg gctttggagc aaaccactca
2461 gtagttttgg cccaggagag ttcctatcct ctgacgtagg aaacgtgtca gatatggatt
2521 ggtataacaa cagcctctac tacagtgaca caaaaggcaa tgtgtatgtg cggcctctga
2581 atgggatgga tatctcggag aattaccaca tatccagcat tgcaggagct tgtgccttgg
2641 cctttgaatg gctgggtcac tttctctact gggctgggaa gacatatgtg attcaaaggc
2701 agtctgtgtt aacgggacac acagacattg tgactcacgt gaagctgttg gtgaatgaca
2761 tggccgtgga tccagttggt ggctatctgt actggacgac gctctactcg gttgaaagca
2821 ccagactcaa tggagaaagt tctcttgtac tacaggctca gccctggctc tctggaaaaa
2881 aggttattgc tctaacatta gacctcagcg atgggctcct gtactggctg gtgcaggaca
2941 atcagtgtat tcacctgtac acggctgttc tccggggatg gagtggtgcg gatgctacca
3001 tcaccgagtt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactaca
3061 gcggtagact cttctggatc aatggcttta ggatcatcac agcacaggaa ataggtcaga
3121 gaaccagcgt gtctgtttct gagccaggga aattcaatca gtttacgatc atacagacat
3181 ccctcaagcc tctgccaggg aacttttcct ctactcccac ggttatccca gattctgttc
3241 aggagtcctc atttcgaatt gaaggacaca cttcaagttt ccgaatcctg tggaatgagc
3301 cccctgcggt ggactggggc atagttttct acagtgtgga atttagtgct cattctaagt
3361 tcctggctat tgaacaacag tctttacctg tttttactgt ggaaggactg gagccctatg
3421 ccttatttaa tctttctgtc actccttata cctattgggg aaaaggtcaa aaaacatctc
3481 tatcatttcg agcgcctgaa tcagttccgt cagcaccaga gaaccccaga atatttatat
3541 tgtcacttgg aagatacacc aggaagaatg aagtcgtggt agagtttagg tggaataaac
3601 ctaagcatga aaatggagtg ctaaccaaat ctgaaatctt ctaccacata tctaaacaaa
3661 gtggcacaaa taaatcaacg gaagactggg tatctgtcag cgttacaccg ccggtgatgt
3721 cttttcaact tgaagccatg agtcctgggt atattgtttc cttccaggtt cgagtcttca
3781 cctccaaagg gccaggacca ttttctgata tagtgatgtc taaaacatca gaaatcaagc
3841 catgtccata tctcatatct cttcttggca ataagattga gttcttagac atggaccaaa
3901 atcaagttgt gtggacattt tccctggagg gagccgtcag cacagtgggg tacacagcgg
3961 atgatgaaat ggggtatttc gctcaaggag atgcactctt ccttctgaat ttgcacaatc
4021 attccagctc caagcttttc caggacgtgc tggcttctga cattgcggtt attgctgttg
4081 actggatcgc aaggcacctc tactttgctc tgaaagcatc gcaagatgga acacagatat
4141 tcgatgttga ccttgaacac aaggtgaaat cccccaggga ggtgaagatt tgcaaaagcc
4201 atacagcaat aatttctttc tctatgtatc ccctcttaag tcgcctgtat tggacagaag
4261 tttcagatct gggctaccag atgttctact gcaatattag cagtcacacc ttgcatcacg
4321 ttctacaacc caaggcctca aaccagcatg gaaggagaca gtgttcttgt aatgtgacag
4381 aatccgagtt aagtggggca atgactgtgg acacgtctga tccagacaga ccttggatat
4441 actttaccaa acagcaagag atctgggcca tggatctgga aggatgtcag tgttggaaag
4501 tcatcatggt acctgctacc cctggaaaaa gaatcattag tttaacagtg gatggggagt
4561 ttatatattg gatcacaaca atgaaggacg acacagaaat ttatcaagca aagaagggaa
4621 gtggggccat cctctcccag gtgaaggccc ccaggagtaa gcatatcttg gcttacagtt
4681 cagctctgca accttttcca gataaagcat atctgtctgt agcttccaat atggtagaag
4741 caagtatatt gaatgccacc aacaccagcc tcattctcaa gttacctcca gtcaagacaa
4801 acctcacgtg gcatggaatt accactccca cgtcaacata cctggtttac tatatggagg
4861 ctaatagggc aaacagctct gacaggaaac acaacatgtt ggaatcacag gagaatgtag
4921 cccggattga aggtctgcag ccattttcaa catacgtgat tcagatagct gtgaagaact
4981 attattctga tcctttagaa catctctctc tgggaaaaga gattcaagga aaaactaaaa
5041 gtggagtgcc cggggcagtt tgtcatatca atgcaactgt gctgtcggac accagtcttc
5101 ttgtattctg gacagaatcg cataaaccaa acggacccaa agagttagtc cgctatcagt
5161 tggttatgtc atacctggct ccgattcctg agactcctct aagacaggac gaatttccaa
5221 gcgccaggct ttctctactt gtcactaaac tctctggtgg acaacaatat gtgctgaaga
5281 tccttgcctg ccactcagag gaaatgtggt gtactgagag tcatcctgtc agtgtcaaca
5341 tgtttgacac accggagaaa ccttctgcct tggttccaga gaacactagt ctgctgttgg
5401 attggaaggc tccgtctaac gctaacctca ccagattttg gtttgaactc cagaagtgga
5461 agtatagtga gttttaccat gtcaaggctt catgcagcca aggtccagtt tatgtctgta
5521 acatcgcaaa tctgcagcct tacactcctt ataacatccg agtggtggtg gtctatacga
5581 caggagaaaa tagctcctcg attcccgaga gcttcaagac aaaagctgga gtcccaagca
5641 aaccagggat tcctaagtta ctagaaggga gtaaaaattc aatccagtgg gaaaaagccg
5701 aagataacgg gaacagattg atgtactaca ccctggaggt cagaaaaagc atttcaaatg
5761 actcacggga ccagagttta aggtggacgg cggtgtttaa tgggtcctgc agtagcattt
5821 gcacatggag gtcaaaaaac ctaaaaggaa ctttccagtt cagagcagta gcgtcaaatg
5881 ctattggatt tggagaatac agtgaaatca gtgaagatat tacattagtg gaagatggtt
5941 tttggataac agaaacaagt tttatactta ctatcatagt tgggatattt ctggttgcca
6001 cagtcccact gacctttgtc tggcatagaa gcttgaaaaa ccacaaagct accaaggaag
6061 gcctctcagt tctcaacgac aatgaccaag agttggctga gcttcgagga ctggcggctg
6121 gagtgggcct ggccaatgcc tgctatgcag tacatactct tccaacccaa gaggagattg
6181 aaagtcttcc cgccttccct cgggagaagc tgagcctgcg ccttctgttg ggaagtggag
6241 cttttggaga agtgtacgag ggcacagctg tagacatcct aggacgggga agtggagaaa
6301 tcaaggtggc cgtgaagacc ctgaagaaag gttcgacaga ccaggagaag atcgagttcc
6361 tgaaggaggc acacctgatg agcaagttta atcaccccaa cattctgaag cagctgggag
6421 tctgtctgct gagtgaaccc cagtacatta tcctggaact gatggaaggg ggagaccttc
6481 taagctatct gcgcaaagcc cgagggacaa cgttgtctgg ccctttactc acattggctg
6541 acctggtaga gctgtgtgta gatatttcaa aaggctgcgt ctacttggag cagatgcact
6601 tcattcacag ggatctggca gctcggaatt gccttgtgtc tgtgaaagac tataccagtc
6661 ctcgggtagt caagatcggt gactttggtt tggcaaggga aatctataag catgattatt
6721 atagaaagag aggggaaggc ctgcttcctg tccggtggat ggctcctgaa aacttgatgg
6781 atggaatctt cacttcccag tctgatgtat ggtcttttgg aattttggtt tgggagattt
6841 taactcttgg tcatcaacct tatccagcgc attccaacct tgatgtttta aactatgtgc
6901 aagcaggagg gagactggag ccaccgagaa actgtcctga tgatctgtgg aatttaatgt
6961 tccgatgttg ggcccaagaa cctgaccaaa gacccacttt ctataacatt caagaccagc
7021 ttcagttatt cagaaatgtt tccttaaaca atgtttctca ctgtggacaa gcagctcctg
7081 ctggtggagt catcaacaaa ggctttgaag gtgaagacaa tgaaatggcc actttgaatt
7141 cagatgacac gatgccagtt gccttgatgg aaaccaggaa ccaagaagga ttaaattata
7201 tggtacttgc cacaaagtgt agccaaagtg aggatcgtta tgagggtcct ctaggctcta
7261 aggaatctgg gttgcatgat ctgaagaaag acgagaggca accagcagac aaagatttct
7321 gccagcaacc acaggtggct tatggctctc ctggccactc tgaaggcctg aactatgcct
7381 gtcttgctca cagtggacat ggagatgtgt ctgaataata gtatctcata ggaaacatag
7441 cactgagatg aacactgtat taagttaaaa agaagaaagg tggggtggca gtctagactc
7501 tgaactgaca cagccaagtt ccaaagttct gatcttggtt ccaagagcca ttatgtttca
7561 ttcagcattc tctttaccag tgacgtaacc ttcagtggat tatcagagga acctgtgtgt
7621 gtgcggaaat cccaggacaa atcctaagtc tgggaagaaa acatcactgt ctctctcctc
7681 tgaagccctt tacttcagag cattgcctgc cctggcaatc ttactaggtt catgcaagga
7741 tgtgagtggg ggaggggccg gagtctgctg aggaccacct gaactacaga ttaccttaag
7801 aggatgcagg aaacaattac tcacacagga ggaagcagcc tgtggaccat gaggaatcat
7861 ctggcacgct attattccaa taaaatattc cctttaatca tc
//
LOCUS HUMFVIIIM 65 bp ds-DNA PRI 25-JUL-1990
DEFINITION Human mutant coagulation factor VIII exon 13 duplication region.
ACCESSION M34731
KEYWORDS coagulation factor VIII.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 65)
AUTHORS Murru,S., Casula,L., Pecorara,M., Mori,P., Cao,A. and Pirastu,M.
TITLE Illegitimate recombination produced a duplication within the FVIII
gene in a patient with mild hemophilia A
JOURNAL Genomics 7, 115-118 (1990)
STANDARD simple staff_review
COMMENT As a result of illegitimate recombination of two misaligned
chromosomes, exon 13 of the factor VIII is duplicated in its
entirety. The exon undergoes normal splicing and its incorporation
into the mRNA generates an unstable protein.
FEATURES from to/span description
recomb 25 26 chromosome DNA end; misaligned chromosome DNA
start
BASE COUNT 26 a 3 c 11 g 25 t
ORIGIN
1 aagttttagg ggtacatgtg cacaattagt ttgaaataat ttaattagtt tgaaataatt
61 taaaa
//
LOCUS EUBBAIA3 2596 bp ds-DNA BCT 25-JUL-1990
DEFINITION Eubacterium sp. baiA3 protein gene, complete cds.
ACCESSION M34658
KEYWORDS .
SOURCE Eubacterium sp. (strain VPI 12708) DNA.
ORGANISM Eubacterium sp.
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 135 to 2242)
AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B.
TITLE Multiple copies of a bile acid-inducible gene in Eubacterium sp.
strain VPI 12708
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 134; 2243 to 2596)
AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by D.H.Mallonee, 24-MAY-1990.
FEATURES from to/span description
pept 1165 1914 baiA3 protein gene
BASE COUNT 778 a 521 c 633 g 664 t
ORIGIN
1 tccctgtgct ttcttctgca gttcataaaa tccgccgcca caaatccaag aatccacaat
61 agactcagaa gcaaggcgta ttccagcgca tccattggga tattatacaa atagaatagc
121 aaggcaaata tggccatgat cccggcgtac atgcctattc ccctgatatg atccctgata
181 tatcttcctg tcaatctcat gcctgcacca tatatcctat tccttttttc gttacgatcc
241 atttgcattt atcctgtctg atgcggatta tgcatcgtat accgctggcc ttacagagga
301 taacaaagag gatatggttt tctttaacgt gaaggatgtg atggatactt atccattcgc
361 caaagaactg gaagaagaat atatcgcgca tgccacagat atctcggacc attattttct
421 ttatgatgcc cgcgaagaag aacttgcaaa aaaagcaggg gaaccctaca catattcagg
481 cagggtaggg atgacggcgg acaatccgga acttcttcag gactggaaat atgcgcctgc
541 cttcaaagtt cttacaaaag gggaggttat gcagatgatt gcggtattcg tgatgcttag
601 cgcctacatt gcgataattg ccctggcggc aatcggggtt atgacttatg taagaagcgt
661 taccattgct gtcgataaca ggcagctgtt cgaggatatg aagaagctgg gggccagccg
721 ggattatgag acgcgggtgg taaaagtaca gcttcgcaag atcttcttat atcccggtat
781 cgcaggatgc gggatatccc tggtctttac ggtcctgatg ctctttttta acaatatgcg
841 cctggaaatt gaagaaatca ggctgatcgg aatcgagagc attatgattg gggcatccgc
901 catcttcctg tacgtactgt accggatctc ttttcggaag atgagaagca tgctggatct
961 atagggaaac aaaatagtga tagtgtttgc aaactttttg tccatggact gcttatattt
1021 tgcaattaaa aaagaacttt acaagttgta agatgccgtg tgattttcca atgtcgcgtc
1081 ctgtaaaatg ttaaagttgt atcaatcgat acgatacttt ggcagatatg ataagccaaa
1141 ggaaaagaaa ggaaggaaaa gttcatgaaa cttgtacagg acaaaattac aattatcaca
1201 ggcggaaccc gtggaatcgg attcgcagca gcaaaactct ttattgagaa tggagcaaaa
1261 gtctccatat ttggcgagac ccaggaagag gtagacacag cgctggctca gttaaaggaa
1321 ctctatccgg aggaagaggt attaggattc gctccagacc ttacatcaag agatgctgtt
1381 atggcagcag ttggaacggt tgcacagaag tacggaagac tggatgtcat gatcaacaac
1441 gcaggcatta caatgaattc tgtattctcc agggtatcag aagaggattt caaaaatata
1501 atggacatca atgttaacgg cgtattcaat ggcgcatggt ctgcttatca gtgcatgaaa
1561 gatgcaaagc agggcgttat catcaatacg gcatctgtaa ccggaatcta tggttcctta
1621 tcaggaatcg gatatcctac cagcaaggcg ggcgtaatcg gcctgactca tggtcttgga
1681 agagagatta tccgtaagaa catccgtgta gttggcgttg cacctggcgt tgtagataca
1741 gatatgacga aggggcttcc accggagatc ctggaggact acttgaagac actgccaatg
1801 aagagaatgc ttaagccgga agagatcgcg aatgtatatc tgttccttgc atccgacctg
1861 gctagcggca tcacggctac gacgatcagc gtagatgggg cttacaggcc atagaaaaga
1921 catactgcta ttaattccat agttcatact ccaagaacag gcaggcaaga ggcatttgcg
1981 ttttagcgcg gatgcccggg cctgcctgat ttaattcagc tggtatatca tgaaattcag
2041 atatgcggcg aacaggcacc atataaggta ggggatctgt agataggcgg caacaggact
2101 tatcttgtga aactgatata tcatcagggc tatgaggatg ataagcacga gaagccataa
2161 aaatgcaaag aggtacatgg aaaagccgaa aaagaatatg ctccagagga agttgaagaa
2221 cagctggata aaatatagtc gaagcgcctt attcttttca ggagtttcgg attcatagat
2281 tatataagaa gatatcccca ttaatatata taatatggtc cagacgatgg gaaataggaa
2341 ggacggagga ctaagaggcg gcttattcaa tgccaaatag gccgccgaat tgccgcttaa
2401 gagagcagac aaggatcctg ccgcaagagg aataaggata aaaataatga gagcgctttt
2461 gtttttgatg ttcatatata ccggctccag gcatgacttt caatattata tgaaaaatct
2521 ccgggaaata tgaacggtat ctccggcttt acttgccgct ctttgacttg cccgccgtct
2581 ctttgagcag ttccag
//
LOCUS ECOTRAU 1080 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli F plasmid transfer operon: traU gene, complete cds;
traW gene, 3' end; and trbC gene, 5' end.
ACCESSION M34695
KEYWORDS periplasmic protein; transfer operon.
SOURCE E.coli F Plasmid (strain K12; isolate Flac plasmid FLO) DNA, clones
pKI[182;282;175].
ORGANISM Plasmid F
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1080)
AUTHORS Moore,D., Maneewannakul,K., Maneewannakul,S., Wu,J.H.,
Ippen-Ihler,K.A. and Bradley,D.E.
TITLE Characterization of the F plasmid conjugative transfer gene traU
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.A.Ippen-Ihler, 25-MAY-1990.
FEATURES from to/span description
pept 49 1041 traU protein precursor
sigp 1 66 traU protein signal peptide
matp 67 1038 traU protein
pept < 1 52 traW protein
pept 1050 > 1080 trbC protein
BASE COUNT 242 a 281 c 311 g 246 t
ORIGIN Map position 77.9-78.9 units on the genome.
1 cgatcgcttc ctgaaggtgg aatttattcc ggcagaggag ggcagaaaat gaagcgaagg
61 ctgtggctgc tgatgttatt ccttttcgcc ggtcatgtcc ctgcggcgtc tgcggattct
121 gcctgtgagg ggcgttttgt aaacccgatc acagatatct gctggagctg tattttcccg
181 ctctcgctgg gcagtatcaa agtcagtcag ggcaaggtcc ccgacacggc gaacccgtcg
241 atgcccattc agatttgtcc ggcaccgccg ccgctgttca ggcgtatcgg gctggccatt
301 ggttactggg agccgatggc gttgacggac gtcacccggt caccgggatg catggtgaac
361 ctgggcttca gcctgccggc ttttggtaaa acggcacagg gaacggcgaa aaaggatgag
421 aagcaggtaa atggggcgtt ctatcacgtt cactggtaca aatacccgct gacgtactgg
481 ctgaacatca tcacatcgct gggctgtctg gaaggtggtg acatggatat cgcttatctt
541 tctgaaatcg accccacctg gacggacagc agcctgacca ccattctcaa tccggaagct
601 gtcatctttg ccaatccgat agcacaggga gcctgcgcag cagatgcgat tgccagcgcc
661 tttaatatgc ctctcgatgt tctgttctgg tgtgccggtt cgcagggaag tatgtacccg
721 ttcaatggct gggtgagtaa tgagtccagt ccgttgcagt cctccctgct ggtcagtgaa
781 cgcatggcgt tcaagctgca ccgtcagggc atgattatgg aaaccatcgg gaaaaataac
841 gccgtctgta atgaatatcc gtccccaatc ctgcccaaag aacgctggcg ttaccagatg
901 gtgaatatgt atccggacag cgggcagtgc cacccgttcg ggcgcagcgt gacccgctgg
961 gaaaccggga aaaatccgcc caacacaaag aaaaacttcg gctacctgat gtggcgtaaa
1021 cgtaactgtg tcttcctgtg aggtgaatga tgaagctgag tatgaaatct ctggcagcac
//
LOCUS MUSSMRNAA 74 bp ss-RNA RNA 25-JUL-1990
DEFINITION Mouse brain-specific small RNA, clone pABr-4.
ACCESSION M35067
KEYWORDS small RNA.
SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 74)
AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S.
TITLE Conservation of the ID sequence and its expression as small RNA in
rodent brains: Analysis with cDNA for mouse brain-specific small
RNA
JOURNAL Mol. Brain Res. 2, 43-49 (1987)
STANDARD simple staff_review
FEATURES from to/span description
RNA < 1 > 74 brain-specific small RNA
BASE COUNT 12 a 16 c 28 g 18 t
ORIGIN
1 ggggttgggg atttagctca gtggtagagc gcttgcctag caagcaaggc cctgggttcg
61 gtcctaagct ctgg
//
LOCUS MUSSMRNAB 74 bp ss-RNA RNA 25-JUL-1990
DEFINITION Mouse brain-specific small RNA, clone pABr-9.
ACCESSION M36619
KEYWORDS small RNA.
SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 74)
AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S.
TITLE Conservation of the ID sequence and its expression as small RNA in
rodent brains: Analysis with cDNA for mouse brain-specific small
RNA
JOURNAL Mol. Brain Res. 2, 43-49 (1987)
STANDARD simple staff_review
FEATURES from to/span description
RNA < 1 > 74 brain-specific small RNA
BASE COUNT 11 a 15 c 28 g 20 t
ORIGIN
1 ggggttgggg atttagctta gtggtagagc ttgcctagca agcgcaaggc cctgggttcg
61 gtccttagct ctgg
//
LOCUS BOVPRLB 1214 bp ds-DNA MAM 25-JUL-1990
DEFINITION Bovine prolactin gene, exon 5.
ACCESSION M34535
KEYWORDS prolactin.
SOURCE Bovine pituitary DNA, and cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 628)
AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M.
TITLE N-6-methyladenosine resides in an intron-specific region of bovine
prolactin pre-mRNA
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 629 to 1214)
AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Narayan, 16-MAY-1990.
The cDNA sequence which corresponds to this gene is found in J.B.C.
257: 678-681 (1982), accession number M25007.
Author address: P.Narayan
Dept. of Molecular Biology and Microbiology
School of Medicine
Case Western University
Cleveland, OH 44106
FEATURES from to/span description
pept / 629 820 prolactin, exon 5 (AA at 629)
pre-msg < 1 971 prolactin mRNA and intron
IVS < 1 628 prolactin intron D
BASE COUNT 400 a 229 c 197 g 388 t
ORIGIN
1 gtgagcttca tgaaagcttc cttgctattt tcatgaatga gagaggtgat ttctgtaatg
61 aggaatgagt tttgaactat ctcactgtac aagaacacaa ttcaggcctt ctttttctag
121 accggtgtta cataaagcaa gaacctgttc attcatagtg atagattcta ttgtaagtga
181 attagaattc caccagcaat ttttcacaga ggtatagtct ttcttgaatt gtacagttac
241 accaaaatct tgcctcttcc tgggtacaga tggctgaaat attttcaagg ataagagaat
301 tagagaatac aatttgcaag ataaatgttt tcttcaaaat atcccaagat atcctctact
361 gaaattcagc ttgtattctt tctctattct cctcaaacca caggatgaga atgagaagaa
421 agaaaagaga agatcaaaac caaatacttg agttctgctt tagtttttat taataaatta
481 ctaacatata tctgatacac tggctccaaa atccaagtgt agagactttc atgtatcttc
541 cctaattttt aatttgataa atagaaagaa caaagatgag ctaatactac taaaactcat
601 aataactcat tatcttttgg atgtttaggt tattcctgga gccaaagaga ctgagcccta
661 ccctgtgtgg tcaggactcc cgtccctgca aactaaggat gaagatgcac gttattctgc
721 tttttataac ctgctccact gcctgcgcag ggattcaagc aagattgaca cttaccttaa
781 gctcctgaat tgcagaatca tctacaacaa caactgctaa gcccacattc catcctatcc
841 atttctgaga tggttcttaa tgatccattc cctggcaaac ttctctgagc tttatagctt
901 tgtaatgcat gcttggctct aatgggtttc atcttaaata aaaacagact ctgtagcgat
961 gtcaaaatct aagactgcaa ttttgtcaat gtttcttatc ttcatttaat agacaatcaa
1021 atgaaaatcc ttccttatga ttgagagaaa gaacttctga ttaaaatttg tcacaaatag
1081 cagaaactga cattacaaag accgttaata acttacttta gaatcacagc aaattattct
1141 ggggtcaagt tattagaatt aaaaattaga taaacattca ttgtgttggt catgctacca
1201 agaagactga attc
//
LOCUS RHMCYA 1560 bp ds-DNA BCT 25-JUL-1990
DEFINITION R.meliloti adenylate cyclase (cyaA) gene, complete cds.
ACCESSION M35096
KEYWORDS adenylate cyclase.
SOURCE R.meliloti DNA.
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 1560)
AUTHORS Beuve,A., Boesten,B., Crasnier,M., Danchin,A. and O'Gara,F.
TITLE Rhizobium meliloti adenylate cyclase is related to eucaryotic
adenylate and guanylate cyclases
JOURNAL J. Bacteriol. 172, 2614-2621 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 885 1466 adenylate cyclase (cyaA)
binding 871 879 ribosome binding site
BASE COUNT 292 a 512 c 487 g 269 t
ORIGIN
1 ggatcctgtt cctggacgcg agcggcctgc agtttgccga acgtcacgct gcctccaacg
61 gcttcgatcc gaggacgcgg ccctggtacc gcgcggccgt caacggcaag gcgccggtgg
121 ccatcggtcc ctatgagatg gccaccacag gcaatctcgg gatgaccata tcgcaagcgc
181 accgcggcaa cccccaaatc gtcatcggcg ccgatgtcgt tctcgatacg atcacggatt
241 ttctgtcccg cgagcggctg accgacgact cggtttcctt cgtgctcgat gcggtgggac
301 gaccgatcat ccactccgac tccaccatga tgcggcgcat catggcatcg aagggccggg
361 accggccggt ggccacgccg caggaggatg gactgatcga gagcatccgg cgcaacccgc
421 caccggccgg aaaggcaact ctcgtcgaag tcggaaaccg cacctatctc gtcacggtgg
481 cgccgctcga atcggcattg cttctgtccg ggcaccgggt ggtcgtcgcc gcccctctcg
541 acgagctgct ggcggccgca aacgagacgc tcgttcaggg acttgccgtc tcgggcgccg
601 tggtggtggt cgccgttctc ctggccctcg tgcttgcgca tctgatcacg aagtcgctca
661 accagctcac cgacagcgcc aaccgcctgc aggacctgga tttcgccact cctatcgacg
721 tttcgtcgca tgtggcggaa atctcgacgc tcaacggcgc aatgaacagg gctcgcgacg
781 cgatcttcac cttcgcgctc tatgttccga aggagctggt gcgcaagggc atcgaatccg
841 gccatttcgg cggccgcgcc gcatggcggc aggaggtgac ggcgatgttc accgacatct
901 acgacttcac caccatcagc gagggccggt cgccggaaga agtggtcgcg atgctctcgg
961 agtatttcga cctgttcagc gaggtcgtcg ccgcccacga cggaaccatc atccaattcc
1021 atggagactc ggtctttgcc atgtggaacg cgccggtcgc cgataccagg catgccgagc
1081 atgcctgtcg atgcgcactc gcggtcgagg agaggctcga ggccttcaat tctgcgcaac
1141 gcgccagcgg attgccggag ttccgcaccc gcttcggcat ccacaccgga acggccgtcg
1201 tcggcagcgt cggcgccaag gaacggctgc aatatacggc gatgggcgac acggtgaacg
1261 tcgcctcgcg gctcgagggc atgaacaagg attacggcac gagcgttctt gcaagcggcg
1321 cggtggtcgc ccaatgcaaa gacatggtga agttccgccc gctcggcacc gccaaggcaa
1381 agggccgttc gacggcgctc gacatttacg aagtcgtggg cgtcgtccgc gcggtgaaca
1441 ctaccgaagc cggaacggcc gcctgaggaa aggcagatgc cgcggcgaac ggcggccccg
1501 ctgaattcgc ttcgaaactc tgaaagcaaa aaagcccgga aacccgggct ttttttgact
//
LOCUS ECOCYSD 492 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli sulfate adenylate transferase (cysD) gene, 5' end.
ACCESSION M35098
KEYWORDS sulfate adenylate transferase.
SOURCE E.coli (strain K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Malo,M.S. and Loughlin,R.E.
TITLE Promoter elements and regulation of expression of the cysD gene of
Escherichia coli K-12
JOURNAL Gene 87, 127-131 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 412 > 492 sulfate adenylate transferase (cysD)
mRNA 373 > 492 cysD mRNA
BASE COUNT 141 a 110 c 118 g 123 t
ORIGIN
1 ctgcaggagt tccggtcatg cgtcccggaa agaaagtagc aatatgtcgt gcctgagtat
61 tagcaaaatc gccaggttta ggtgacgagg cgtgtacggg gagaataaag catacgccga
121 gcgccagggc agcggtacgg tggcgcaatg cggaaaacat agtgagtcct taaataccat
181 gcaaattttt ttaccgccat agtatgaaac tgccgctgcg ctaaaacaat ttcaaatctt
241 cctaaacgcc cgaaatccgg tgccttaagc actttttgat attagctttg ccaaatcgtt
301 attccgttaa ggaactactc attctaattg gtaatttcat tcgttctctt acgctcccta
361 tagtcgaaac atctgatggc aagaaaatag cggtattgca aaggaacggt tatggatcaa
421 atacgactta ctcacctgcg gcaactggag gcggaaagca tccacattat tcgcgaggtg
481 gcggcagaat tc
//
LOCUS CHPCOX41A 956 bp ds-DNA PRI 25-JUL-1990
DEFINITION Chimpanzee cytochrome c oxidase subunit IV (COX4P1) processed
pseudogene, complete cds.
ACCESSION M34599
KEYWORDS cytochrome c oxidase subunit IV; pseudogene.
SOURCE Chimpanzee DNA, clone lambda-Ch1.
ORGANISM Pan troglodytes
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE 1 (bases 1 to 956)
AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I.
TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of
human cytochrome c oxidase subunit IV
JOURNAL Gene 86, 209-216 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 216 665 cytochrome c oxidase subunit IV (COXIV)
pseudogene (E.C. 1.9.3.1)
signal 881 888 poly-A signal
BASE COUNT 268 a 222 c 277 g 189 t
ORIGIN Chromosome 14q21-qter.
1 ggtacctcca atcccagcta ctcgggaggc tgaggcagga gaatcacttg aactcgggag
61 gcggaggttg cagtgagctg agatcacgcc tctgcgctac agcctgggca acaagagcaa
121 aactccgtct cggaaaagaa aaaaacaaaa aagaactact ggggtcgcgg gacaccgggc
181 atagagggcg gcggtggtgg ggcagctgcg gcagaatgtt ggctaccagg gtagttagcc
241 tagttggcaa gcgagcaatt tccaccttgg tgtctgtacg agcacacgga aatgttgtga
301 agagcgatga ctatgcgctc ccagcttatg tggatcgacg tgactatccc gtacccgatg
361 tggcccatgt caagcacctg tctgccagac agaaagcctt gaagaagaag gagaaggcct
421 cctggagcaa ccgctccacg gatgggaaag tcgagttgta tcacattcag ttcaaggaga
481 gctttgctga gatgaacagg ggcgtgaacg agtggaagat ggttgtgggc gctgccatgt
541 tcttccttgg cttcacggcg ttcattatca tctgggagaa gcgctgtgtg tacggcccca
601 tcccgcacac ctttgacaaa gagtgggtgc ccatgcagac caagaggatg ctggacatga
661 ggtgaacccc tgcagggctt cgccagccaa gtgggactat gacaagaacg agtggaagaa
721 gtgaacccct gcagggcttc gccagccaag tgggactatg acaagaacga gtggaagaag
781 tgagagatgc tgtcctgctt ttgagccttg ctctgtcacc tccatactat aactccatgc
841 ctatttactg gaaacctgtt atgccaaaca gtaccactgc taataaatga ccagtttacc
901 tgaaagaaaa aaaaaaaaag aactactgaa gtgaaagaaa aatctggaga aagtac
//
LOCUS CHTMOMPA 682 bp ss-mRNA BCT 25-JUL-1990
DEFINITION C.trachomatis outer membrane protein (ompl) gene, 5' end.
ACCESSION M35099
KEYWORDS outer membrane protein.
SOURCE C.trachomatis (serovar L2/434/Bu), cDNA to mRNA.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 682)
AUTHORS Kaul,R., Duncan,M.J.J., Guest,J. and Wenman,W.M.
TITLE Expression of the Chlamydia trachomatis major outer membrane
protein-encoding gene in Escherichia coli: Role of the 3' end in
mRNA stability
JOURNAL Gene 87, 97-103 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 568 > 682 outer membrane protein (ompl) precursor
sigp 568 633 outer membrane protein (ompl) signal peptide
matp 634 > 682 outer membrane protein (ompl)
mRNA 1 > 682 ompl mRNA
BASE COUNT 218 a 144 c 111 g 209 t
ORIGIN
1 aaaaacactt tctttgtagt aataaaaacg atttctatca aaacaaattc ttagattttc
61 ttacaaaaat ctcctctttt cttttagcca aacccccatc ttcgagctat tccaaacaca
121 aaaatcttag gttttggaaa ttaacaactc ataaaaattg aactgttttg taattaactc
181 aaaaccctct cattctcaac aatcaacata ttgccaacat ggcttttgct ctcggtttca
241 cagcgatttt tttcgcaaaa accaagaaca taaaacataa aaagatatac aaaaatggct
301 ctctgcttta tcgctaaatc aggaggcgct taagggcttc ttcctgggac gaacgttttt
361 cttatcaact ttacgagaat aagaaaattt tgttatggtc tcgagcattg aacgacatgt
421 tctcgattaa ggctgctttt acttgcaaga cattcctcag gccattaatt gctacaggac
481 atcttgtctg gctttaacta ggacgcagtg ccgccagaaa aagatagcga gcacaaagag
541 agctaattat acaatttaga ggtaagaatg aaaaaactct tgaaatcggt attagtgttt
601 gccgctttga gttctgcttc ctccttgcaa gctctgcctg tggggaatcc tgctgaacca
661 agccttatga tcgacggaat tc
//
LOCUS ECOK99FIM 740 bp ds-DNA BCT 25-JUL-1990
DEFINITION E.coli K99 fimbrial subunit gene, complete cds.
ACCESSION M35282
KEYWORDS K99 fimbrial subunit.
SOURCE E.coli (strain K-12 C600) DNA, clones 1, 2, 3, 4 and 5.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 740)
AUTHORS Roosendaal,B., Gaastra,W. and de Graaf,F.K.
TITLE The nucleotide sequence of the gene encoding the K99 subunit of
enterotoxigenic Escherichia coli
JOURNAL FEMS Microbiol. Lett. 22, 253-258 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 70 615 K99 fimbrial subunit precursor
sigp 70 135 K99 fimbrial subunit signal peptide
matp 136 612 K99 fimbrial subunit
pept 648 > 740 ORF1
BASE COUNT 235 a 133 c 146 g 226 t
ORIGIN
1 tagggaatgg ctatgttttc tggtgattcc acggaactaa aaaataatat cgaacaatgg
61 agaatctaga tgaaaaaaac actgctagct attatcttag gtggtatggc ttttgcgact
121 accaatgctt ctgcgaatac aggtactatt aacttcaatg gcaaaataac gagtgctact
181 tgtacaattg accctgaggt caatggtaat cgtacatcaa ctatagatct tgggcaggct
241 gctattagtg gtcatggcac tgtagtggat tttaaactaa aaccagcgcc cggcagtaat
301 gactgcctag cgaaaacaaa tgctcgtatt gactggtctg gttctatgaa cagtttaggt
361 tttaataata cagcttcagg aaatactgct gctaaaggat accatatgac tttgcgcgca
421 acaaacgttg gaaatgggtc tggtggtgct aatattaata cttcattcac tacggctgaa
481 tacactcaca cttctgcaat tcagtcattt aactattcag cccagctgaa aaaagatgac
541 cgcgctccgt ctaatggtgg atataaagct ggcgtattta ctacttcagc atccttctta
601 gtcacttata tgtaatattt aaagtatttt acattgcggg catatctatg attgcccgca
661 atattactga tggatattat atgaatagaa aaaaacatca gattttaaaa attttattgt
721 tgtgtctaat aagcagtaaa
//
LOCUS ECORRDAA 72 bp ss-rRNA RNA 25-JUL-1990
DEFINITION E.coli 16S rRNA fragment.
ACCESSION M35308
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE E.coli (MRE 600) ribosomal RNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 72)
AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P.
TITLE Nucleotide sequences of sections of 16S ribosomal RNA
JOURNAL Nature 227, 1321-1323 (1970)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 72 16S rRNA
BASE COUNT 17 a 16 c 20 g 19 t
ORIGIN
1 ggcttggttt gcaagtgtca gatactgtta agcatctgaa atccccgggc taaccctggg
61 aactgatgac tg
//
LOCUS ECORRDAB 174 bp ss-rRNA RNA 25-JUL-1990
DEFINITION E.coli 16S rRNA fragment.
ACCESSION M35309
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE E.coli (MRE 600) ribosomal RNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 174)
AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P.
TITLE Nucleotide sequences of sections of 16S ribosomal RNA
JOURNAL Nature 227, 1321-1323 (1970)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 158 16S rRNA
BASE COUNT 42 a 40 c 55 g 37 t
ORIGIN
1 ggcatgaaga cacactgcta actccgaata cgcacaagcc cgtaatggag cgacggtggg
61 ccttgttccc gtgccccgat gtggggtgga ggtgactgtg ggttgtgata ttcggggagg
121 caaaagaagt agcgagtcta accttgctta ccactttgcc taatacggga aacg
//
LOCUS HPTRRA 117 bp ss-rRNA RNA 25-JUL-1990
DEFINITION H.aurantiacus 5S rRNA gene.
ACCESSION M35310
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE H.aurantiacus (strain Sengas Wie 2) ribosomal RNA.
ORGANISM Herpetosiphon aurantiacus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Cytophagaceae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Van den Eynde,H., Stackebrandt,E. and De Wachter,R.
TITLE The structure of the 5S ribosomal RNA of a member of the phylum of
green non-sulfur bacteria and relatives
JOURNAL FEBS Lett. 213, 301-303 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 117 5S rRNA
BASE COUNT 22 a 37 c 40 g 18 t
ORIGIN
1 tccggtggca atgtcggagg ggtcccaccc gttcccatcc cgaacacgga agttaagccc
61 tccagagccg atggtactcc gcggggaacc gcgcgggaga gtaggtcgct gccggat
//
LOCUS HUMCOX4AA 634 bp ss-mRNA PRI 25-JUL-1990
DEFINITION Human cytochrome c oxidase subunit IV (COX4) mRNA, complete cds.
ACCESSION M34600
KEYWORDS cytochrome c oxidase; cytochrome c oxidase subunit IV.
SOURCE Human liver, cDNA to mRNA, clones pCOX4.-[111 and 4.2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 634)
AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I.
TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of
human cytochrome c oxidase subunit IV
JOURNAL Gene 86, 209-216 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 510 cytochrome c oxidase subunit IV (COX4)
/hgml_locus_uid="LS0022W" /nomgen="COX4L2"
/map="16q22-q24"
mRNA < 1 634 COX4 mRNA
BASE COUNT 156 a 157 c 180 g 141 t
ORIGIN
1 atgttggcta ccagggtatt tagcctagtt ggcaagcgag caatttccac ctctgtgtgt
61 gtacgagctc atgaaagtgt tgtgaagagc gaagactttt cgctcccagc ttatatggat
121 cggcgtgacc accccttgcc ggaggtggcc catgtcaagc acctgtctgc cagccagaag
181 gcactgaagg agaaggagaa ggcctcctgg agcagcctct ccatggatga gaaagtcgag
241 ttgtatcgca ttaagttcaa ggagagcttt gctgagatga acaggggctc gaacgagtgg
301 aagacggttg tgggcggtgc catgttcttc atcggtttca ccgcgctcgt tatcatgtgg
361 cagaagcact atgtgtacgg ccccctcccg caaagctttg acaaagagtg ggtggccaag
421 cagaccaaga ggatgctgga catgaaggtg aaccccatcc agggcttagc ctccaagtgg
481 gactacgaaa agaacgagtg gaagaagtga gagatgctgc ctgcgcctgc acctgcgcct
541 ggctctgtca ccgccatgca actccatgcc tatttactgg aaacctgtta tgccaaacag
601 ttgtaccact gctaataaat gaccagttta cctg
//
LOCUS MRGRBMII 2574 bp ds-DNA VRT 25-JUL-1990
DEFINITION M.serrator retropseudogene-like repetitive element I (RBMI).
ACCESSION M35143
KEYWORDS repetitive sequence.
SOURCE M.serrator blood DNA.
ORGANISM Mergus serrator
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE 1 (bases 1 to 2574)
AUTHORS McHugh,K.P., Madsen,C.S. and de Kloet,S.R.
TITLE A highly repeated retropseudogene-like sequence in DNA of the
redbreasted merganser (Mergus serrator)
JOURNAL Gene 87, 193-197 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 1 2574 retropseudogene-like repetitive element
pept 1403 909 (c) ORF1
pept 2118 1426 (c) ORF2
BASE COUNT 691 a 600 c 484 g 790 t 9 others
ORIGIN
1 gaattcctca aacacgctgc ggctgcttac ctttaataca cccgttgcat gcgatggagc
61 tgtatttctt gcttttncct gcactggaag gcttcccttc cttgtcaggt tgtttactgc
121 cctcactctt ctgcattgct cacatgaaga gccatctgga ggatgggttt cttccttctt
181 ctcccgggtt atcttctgga aacgaggacc taagtattcc aaggagcctt tcactttcct
241 ggtgtttctc cttttttttc tttttcttct cctttttctt ctttttctta tgcttgtgat
301 tggcattgtc aaagtggagc gcacagaaac acaaatcgtg aagtctgaaa gaaacatgca
361 agttaaaaag agaaaaaaag atgtggcact tgttgcctat atgaaacttt atttttttta
421 ccacaggtga tgatttgcag catgtcagct attttgtggt gctttgtgca cacgcaactt
481 acttacttta gatgcagcaa acttaagccc tcagattgaa ggaccatagg ctggtttgta
541 cacagatcat taaccatggt tagctctgga atacgtgcaa gcagaaaaaa acttttaacc
601 taatccggaa tggtgtacag atgtgattcg aactatgtgg tctaacgcta gtgctctgac
661 acaattcagc aatagctttc ctatcttcac tgaacaccta cacacagacc cagccagctg
721 atgctatcta aataacttag aaactaccag aaaaaaaaaa aaaaaaaaaa gaagaaaaaa
781 cgagaataaa aaaaaaaagt agaaaaaaaa aaaaaaagga agacatgaga agcacccaga
841 aatgaattag gataaaaaat tcggagtatg ctggaatcct tgcttacttg gaatccttct
901 ctgcatgttt aatccttaga cttctttttt cttctagaac ttgttgatat ttttgcattt
961 ttttcaccac ctaaaagctc cttttctatc tttctgtctt tcctttctat ttcactttca
1021 ctaccttctg cacgggtata ttttcttttt ctgtttcttt ctgtttcatt tttctggcga
1081 cagttctcca aatgagctga cacgggtgga agcgcatgtc tttcacgaga atgtcttctg
1141 gaatgttgct gatgtaccga gcaacgatgc aagtctgctc ggggtgtgct aaagcgacgt
1201 acatcttcct ctctcaagag ggaactgtga ggccatccgc ttttgtaatg ataactctta
1261 tgtgacctgc tgtagtaagt tgcagtcgat ttgtcaaagg ctgcatcgcc gtgagacaac
1321 tttctctctc tactgtctcc tgtcgcatga ggtgaatagt aatcattgta atagctacat
1381 ctttcccatc tccgagcntt catcctcgat agtatctntc tctgctcaac ttctttgccc
1441 tttggatcgg taatatctat tgctacctcg ttctgatctt cctccgcttg ccagatctgt
1501 actttgaata tttgacngct cttctgccat tctcagggct gtttctttca nnnnggaaag
1561 atctgcacct gcttcccccc cagtgctcct gcttgtgacg cttttgctca acaacttcca
1621 cgctctgaga acacctcctc ttgctggaag gacctgcttt ttgactctcc ttctcttcag
1681 taggagcatg ttcctcttgc tttggtaatg ctctttgtca gtgtttttag tctcgncttg
1741 tatcttggca tctctctgta atagctgagg aggaaaggtt tttagagcta cattcagtgt
1801 cagacttgag agaggaagct tgccgcaatt tctcaccagg ctcagaagac tctttgccgg
1861 acaaaacgtt ttcttttgaa atgaggtcac gttcttttca tcttcttgct ttctccttat
1921 ctccaccgtc atcacattgg tactgtgcga ggtattcatc attccagtag attttcgagg
1981 gtccgcaaca ctgcacaaaa taaaagcaca tttctcagtt ctgctgaagg acgtgaatat
2041 taagaggaaa accttccaaa agtcgaacaa acaaacaaaa acctccggac tacaggaaca
2101 ctctccaaga tatgccattt agaaacctct cctgtcatta ggacaccttc ttcagctcca
2161 cagaaagggg ttttgccctc ttgcttctga agccattgca ctaaaaagca aacgcagtgc
2221 tgtctccctc cacatgctgc tctgaataag agccagaata ttcaaaacca ctctctttgt
2281 tctcccacat agccgaaaaa acaccggttg aaacagagtt ttctacctct cgcccaacaa
2341 tttacattca catagcctat gactgaaaaa ataaaaggcg gggctgagga ggaacagcca
2401 gtgttggaaa tgaaaagaag cagcccgttc cttcatagtc ttaagcctat gctactagga
2461 aaacaaaaca aaacaaaaca aaacaagagg agaggagaac aacagcggga aattttcctg
2521 ttctccaggt gttaaattgc aaagcctcct ctggaggatc acagctgtga attc
//
LOCUS MUSLAMB 2789 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse lamin B mRNA, complete cds.
ACCESSION M35153
KEYWORDS lamin B.
SOURCE Mouse liver, cDNA to mRNA, clone FML11-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2789)
AUTHORS Hoeger,T.H., Krohne,G. and Franke,W.W.
TITLE Amino acid sequence and molecular characterization of murine lamin
B as deduced from cDNA clones
JOURNAL Eur J Cell Biol 47, 283-290 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 256 2019 lamin B
mRNA < 1 2789 lamin B mRNA
site 2768 2773 poly-A signal
BASE COUNT 696 a 657 c 812 g 624 t
ORIGIN
1 aataatctta agctcttaca aagagctgcg ggcgggagac tcgcgtccgg cgcacagccg
61 tctgcgtctc ccggctgccc tggcctcttc ccgcgcgcgc gtgcagtgtg cgtgtacact
121 cacaaagggc gtctggcggg cgatccgcgg ccctcccgct tcgctctttg tgcggtagcc
181 ccgccgccac cgccagccca ggtccgctcg atcctcaccg gcctgtggtt tgtaccttcg
241 gtcccgccgc ccgccatggc gaccgcgacc cccgtgcagc agcagcgggc gggcagccgc
301 gccagcgccc ccgccacgcc gctcagcccc acgcgcctgt cgcgcctgca ggagaaagag
361 gagctgcggg agctcaacga ccgcctggct gtgtacatcg ataaggtccg cagcctggag
421 acggagaaca gcgcgctgca gctgcaggtg accgagcggg aggaggtgcg cggccgcgag
481 ctcaccggcc tcaaggctct ctacgagacc gagctggccg acgcacgccg cgctctggac
541 gacacggccc gcgagcgcgc caagcttcag atcgagctgg gcaagttcaa ggccgagcac
601 gaccagctgc tgctcaatta tgccaagaag gaatctgatc tcagtggagc ccagatcaag
661 cttcgagagt atgaggcggc actaaactct aaggatgcgg cgctggcaac tgccctaggg
721 gacaaaaaga gtttagaggg agacttggag gatctgaaag atcagattgc ccagctagaa
781 gcatccttat ctgccgccaa aaagcagtta gcagatgaaa ctttacttaa agtggatttg
841 gagaatcgct gtcagagcct tactgaggac ttggagtttc gtaaaaatat gtatgaagag
901 gagatcaatg agacaaggag gaagcatgag acccgcttgg tggaagtgga ctctgggcgt
961 cagattgagt atgagtacaa gctggctcaa gccctgcatg agatgcggga gcagcacgac
1021 gcgcaggtga ggctgtacaa ggaagagctg gagcagacct accacgccaa gcttgagaat
1081 gccagactct cctcagagat gaacacttcc actgtcaaca gtgcccggga agagctgatg
1141 gagagccgga tgaggatcga gagcctctcc tcacagctct ctaacctgca gaaagagtct
1201 agagcgtgtt tggaaaggat ccaggaattg gaggacatgc ttgctaagga gagagacaac
1261 tcgcgccgca tgctgtctga cagagagaga gagatggcgg agatcaggga ccagatgcag
1321 cagcagctga gtgattatga gcagctgctg gacgtgaagc tggccctgga catggagatc
1381 agcgcctaca ggaagctcct ggaaggcgaa gaagagcggt taaagctctc tccaagccct
1441 tcttcccggg tgaccgtgtc cagagcgtcc tccagtcgca gtgtgcgcac caccagagga
1501 aagcggaaga gagttgatgt ggaggagtcg gaggcgagca gcagtgttag catttcccac
1561 tctgcctcag ccacggggaa cgtgtgcatt gaagagatag atgttgatgg gaagtttatt
1621 cgcttgaaga acacttctga gcaggatcaa ccaatgggag gctgggagat gatcagaaaa
1681 attggagaca catcagtcag ttacaaatat acctcaagat atgtgctgaa ggctggccag
1741 actgtcacag tgtgggctgc aaatgctggc gtcacagcca gccctccaac tgacctcatc
1801 tggaagaacc agaactcttg gggtactggt gaagatgtga aggttatgct taagaattct
1861 cagggagagg aggttgctca gagaagctct gtcttcaaga ccaccatacc cgaggaggag
1921 gaggaggagg aggagcccat cggagtggct gtggaggagg agcgtttcca ccagcaggga
1981 gccccaagag catggaataa aagctgtgcc attatgtgaa cttatcaaga catggtcgat
2041 cttcctcaag ctagaagcat ggagtcctgt atacagtgca gagccttctc agaagcacat
2101 gatatttttg tatttccttt atgtgaattt ttaagctgcg aatctgatgg ccttaatttc
2161 ctttttgaca ctgaaagttt tgtcaaaaga aatcctatcc atacacgttg taagatgtga
2221 attattgaca ctgagctaac tgtactgttt ggaaaggggc cctcaagttt ttggcatttt
2281 ttctttcctt tttgtatgtg tgtatgtaat tttttttttt taagttcttt taagagggga
2341 caaggagggt aagaaaacca ctgcgtgtcc gggcattaat tgaagcttgc tctccctaga
2401 tgggcggtct gctctcggtc cttctctgct ctctataaaa tggtgctgtc ggggagggag
2461 gggggaagtt tttcaatata tgaacttttg tatggaattt tttgtaataa gtgatcaggt
2521 tacaattttt ttaaatagaa aagagaagaa aaacgttgta agaacggaat attaatctag
2581 tcacccatgt acgcactctg gatggaggtt ctacagagct gttgattggt caactacttc
2641 tcttacattg ttgactcatg aggggagcgg gcaggcgggt gagggtgggg gaaggctttc
2701 tcttcaaatt cgctagttga gtttttaaga tagtgtacat gcttacattt cttatccgac
2761 attaacaaat aaaacgctgt tttcctatt
//
LOCUS MUSMSTA1 1651 bp ds-DNA ROD 25-JUL-1990
DEFINITION Mouse metastatic cell protein (mts1) gene, exons 1 and 2.
ACCESSION M36578 M35147
KEYWORDS mts1 protein.
SEGMENT 1 of 2
SOURCE Mouse metastatic cell line DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1651)
AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and
Lukanidin,E.M.
TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells
JOURNAL Gene 87, 219-223 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1498 / 1638 mts1 protein, exon 2 (first expressed exon)
pre-msg 255 > 1076 mts1 mRNA and introns
IVS 293 1482 mts1 intron A
IVS 1639 > 1651 mts1 intron B
signal 225 231 TATA box
BASE COUNT 380 a 358 c 460 g 437 t 16 others
ORIGIN
1 ttctggctga gctgtggctg cttggtggtg tccaccccat ccaagcctct gccgtgccca
61 ctggagctca ctcactactt gattgtgcct gctggggagg gagcaggaag cctagatccc
121 agactgggct ggtcgagggt gctatgacat ttactacatc aaccaacagc aagagcacag
181 tatccatgtt cccccatcct ctgcatgggc agggcctagc agggtataaa taggtcagat
241 tgttgggctc tccccaaacc tctctattca gcacttcctc tctcttggtc tggtgagttg
301 tgttggtctg atagcactgc tagcggcatt agaggctgag gctagggtag aagaaagggg
361 ggctgctgtg ggggaacaga tgtctttaat aaatccagat gagagattct gatgtggagg
421 ttcatgtatg tgtgtgtgtg tgtgtgtttt cacgagaatg aaaaccaaaa aaaaaaaaaa
481 aaaaaaaaaa agtgtataaa tggctacatc tgagctcccg aaggttttga gatactgagg
541 ctggcttgca tgttgctata gtgtatattg gtggtgcttg ggagtcactg tcatgcatag
601 gatgctgact cgtgttgctg ggtaatacaa gacagtgtgt ggacactcgg gtacaggaag
661 caaagcgaag gcatcagtag gcctttttgt tttacagtat ttaaattaca gtttttattt
721 gtgtgtatga gcgtatgggt tgggctggag caaatgccaa ggcgacattg tgggagccaa
781 aggacaattt gtgtgggagt caactcgttc cttctagcat gtgggctgtg gggatcaaac
841 tcaggccttg gagcttggtg gcaagcacct ctacccattg agctatctct ccagcaccct
901 cctgcagnnn nnnnnnnnnn nntttgtagt gtcttgtttt taattgccct atgaacatat
961 agcacctagg ccaagaaagc ctagcttccc caccctctcc tcttgcatcc ctacctctgc
1021 cacttcatct tactcctatt aggcagctgg ggtttttcca cttttttttt gtctgcctct
1081 gggcaggcag ccagcagccg cgcccaacgc tgggagggag aagaatgggc caggcctgtg
1141 cttgtggttg agctgtggga gtgagtaagc tgatggaaaa ctgctgttgt tgaggccata
1201 gctgagaggc acagaaaggt gctggcatag gtctccagag tttgaggggt agctttgcag
1261 gtttcagagc ccagagcaca tgtgaccttc ttgccaccaa tgggtcccat tcctctgatc
1321 cccnaggggg tgaggtccat ctcttagaga gttgtgggat agagcactta aaatgggaac
1381 agaatgagtg tgatttgggt catgctcagc aacacatatc cagttctcaa cacactgttg
1441 gcgtgggttg gagaatgtta cttttgtgtc tcctgccctt aggtctcaac ggttaccatg
1501 gcaagaccct tggaggaggc cttggatgta attgtgtcca ccttccacaa atactcaggc
1561 aaagagggtg acaagttcaa gctgaacaag acagagctca aggagctact gaccagggag
1621 ctgcctagct tcctgggggt aagtgggtcc t
//
LOCUS MUSMSTA2 545 bp ds-DNA ROD 25-JUL-1990
DEFINITION Mouse mts1 protein gene, exon 2.
ACCESSION M36579 M35147
KEYWORDS mts1 protein.
SEGMENT 2 of 2
SOURCE Mouse metastatic cell line NIH3T3 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 545)
AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and
Lukanidin,E.M.
TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells
JOURNAL Gene 87, 219-223 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 14 178 mts1 protein, exon 2
pre-msg < 1 315 mts1 mRNA and introns
IVS < 1 13 mts1 intron B
signal 297 302 poly-A signal
BASE COUNT 126 a 97 c 170 g 152 t
ORIGIN
1 cttcaacggc cagaaaagga cagatgaagc tgcattccag aaggtgatga gcaacttgga
61 cagcaacagg gacaatgaag ttgacttcca ggagtactgt gtcttcctgt cctgcattgc
121 catgatgtgc aatgaattct ttgagggctg cccagataag gagccccgga agaagtgaag
181 actcctcaga tgaagtgttg gggtgtagtt tgccagtggg ggatcttccc tgttggctgt
241 gagcatagtg ccttactctg gcttcttcgc acatgtgcac agtgctgagc aaattcaata
301 aaaggttttg aaactattag ctgttgtctg agagactgga gctatgggct gagggctgtg
361 gtagagactg ctggaagttg acctgagctt tgtggggcca aactaaaaaa aggtcgggga
421 gggggtgggt ggcttatttt gagtacattg caagtatgta tttgtgtgtg tcggcttagt
481 catgcgtgca tgtgtgcgtg cgtgtgtgtt tgtgtgtgtt tacgtgctcc tatatagcaa
541 ccgag
//
LOCUS MUSNFH 3959 bp ss-mRNA ROD 25-JUL-1990
DEFINITION Mouse neurofilament component (NF-H) mRNA, complete cds.
ACCESSION M35131
KEYWORDS neurofilament protein.
SOURCE Mouse (strain Swiss-Webster) brain, cDNA to mRNA, clones pMuH1,
pMuH5, and DNA, clone lambda-5A.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3959)
AUTHORS Shneidman,P.S., Carden,M.J., Lees,J.F. and Lazzarini,R.A.
TITLE The structure of the largest murine neurofilament protein (NF-H) as
revealed by cDNA and genomic sequences
JOURNAL Mol. Brain Res. 4, 217-231 (1988)
STANDARD simple staff_review
COMMENT Nucleotides 1-955 are derived from genomic DNA.
FEATURES from to/span description
pept 154 3372 neurofilament component (NF-H)
signal 59 65 TATA box
signal 3936 3942 NF-H mRNA
BASE COUNT 1104 a 1122 c 1197 g 536 t
ORIGIN
1 ggggccgcgg gggaggaggt ggagcccact gccgaggggc cggaccgggc caccgcgata
61 taaaagagcc ggagtcccag agctgccgca gtgctgcctg ccccgtccca gccccgcact
121 cccgctccgc tggcggccgc acctgctccg gccatgatga gcttcggcag cgccgatgcg
181 ctgctgggcg ccccgttcgc gccgctgcac ggaggcggca gcctgcacta ctcgctgagc
241 cgcaaggcag gcccgggcgg cacgcgctcc gcggccggct cctccagcgg cttccactcg
301 tgggcgcgga cgtccgtgag ctccgtgtcc gcctcaccca gccgcttccg cggcgccgcc
361 tcgagcaccg actcgctaga caccctaagc aacggcccag agggctgcgt ggtggcggcg
421 gtggcggcgc gcagcgagaa ggagcagctg caggctctga acgaccgctt cgcgggctac
481 atcgacaagg tgaggcagct cgaggcgcac aaccgcagcc tggagggcga ggcggcggcg
541 ctgcggcagc aacaagccgg ccgcgccgcc atgggcgagc tgtacgagcg cgaggtgcgc
601 gagatgcgcg gcgccgtgct gcgcctcggg gcggcgcgcg ggcagctgcg cctggagcag
661 gagcacctgc tggaggacat cgctcacgtc cgccagcggc tggacgagga ggcccggcag
721 cgtgaggagg cggaggcggc ggcgcgcgcc ctggcgcgct tcgcgcagga ggcggaagcg
781 gcgcgcgtgg agctgcagaa gaaggcgcag gcgctgcagg aggagtgcgg ctacctgcgg
841 cgccaccacc aggaggaggt gggcgagctg ctcggtcaga tccagggctg cggggccgcg
901 caggcgcagg ctcaggccga ggctcgcgac gccctcaagt gcgacgtgac gtcggcgctg
961 cgggagatcc gcgcgcagct cgaaggccac gcggtgcaga gcacgctgca gtccgaggag
1021 tggttccgag tgaggttgga ccgactctca gaggcagcca aagtgaacac agatgctatg
1081 cgctcggccc aagaggagat aactgagtac cggcggcagc tgcaagccag gaccacagag
1141 ttggaggccc tgaaaagcac caaggagtca ctggagaggc agcgctctga gctagaggac
1201 cgtcatcagg cagacattgc ctcctaccag gacgctattc agcagctgga cagtgagctg
1261 agaaacacca agtgggagat ggctgcacag ctccgagagt accaggacct gctcaacgtc
1321 aagatggccc tggacattga gattgccgct tacagaaagc tcctggaagg cgaagagtgt
1381 cggattggct ttggtccgag tcccttctct cttactgaag gactcccaaa aattccctcc
1441 atatccacgc acataaaagt caaaagcgaa gagatgataa aggtagtaga gaaatccgag
1501 aaggaaactg tgattgtaga aggacagaca gaagagatcc gggtgacgga aggagtgaca
1561 gaagaggagg acaaagaggc ccaaggtcag gaaggagaag aagcagaaga gggagaagaa
1621 aaagaagaag aggaaggagc agcagctaca tctccccctg cagaagaggc tgcatctcca
1681 gaaaaagaaa ccaagtctcg tgtgaaagaa gaggccaagt ccccaggtga ggccaagtcc
1741 ccaggtgagg ccaagtcccc aggtgaggcc aagtccccag ctgaggccaa gtccccaggt
1801 gaggccaagt ccccacgtga ggccaagtcc ccaggtgagg ccaagtctcc agctgagccc
1861 aagtctccag ctgagcccaa gtctccagct gaggccaagt caccagctga gcccaagtct
1921 ccagctacag tgaagtctcc aggtgaggcc aagtcaccat ctgaggccaa atctccagct
1981 gaagccaaat ctccagctga ggccaaatct ccagctgagg ccaaatctcc agctgaggcc
2041 aagtcaccag ctgaagccaa gtcaccagct gaagccaaat ctccagctac agtgaagtct
2101 ccaggtgagg ccaagtcacc atctgaggcc aaatctccag ctgaagccaa atctccagct
2161 gaggccaaat ctccagctga ggccaaatct ccagctgagg tcaagtcacc aggtgaggcc
2221 aagtctccag ctgagcccaa gtcaccagct gaggccaaat ctccagctgc agtgaagtca
2281 ccagctgagg ccaagtctcc agctgcagtc aagtccccag gtgaggccaa gtccccaggt
2341 gaggccaagt caccagctga ggccaaatct ccagctgagg ccaagtcacc aattgaggta
2401 aaatctccag agaaggccaa gacccccgtc aaggaaggag caaaatctcc agctgaggcc
2461 aagtctcctg agaaggccaa gtcccccgtg aaggaagata tcaagccccc agctgaggcg
2521 aaatcccctg agaaggccaa gagccccatg aaggaaggag caaagcctcc tgagaaggcc
2581 aagcctctag atgtgaagtc tccggaagcc cagactccag tacaggagga agcgaacgac
2641 cccacagaca tcagaccccc tgagcaggtg aaaagtcctg ccaaggagaa ggccaagtcc
2701 cctgagaagg aagaagccaa gacttctgaa aaggtggctc ccaagaagga agaggtgaag
2761 tcccctgtga aggaggaggt aaaagccaaa gaacccccaa agaaggtaga agaagagaag
2821 acactgccta caccaaagac agaggcgaag gagagtaaga aagacgaagc tcccaaggag
2881 gccccgaagc ccaaggtgga ggagaagaag gaaactccca cggaaaagcc caaggactct
2941 acagcagaag ccaagaagga agaggctgga gagaagaaga aagccgtggc ctcagaggag
3001 gagactcctg ccaagttggg tgtgaaggaa gaagctaaac ccaaagagaa gacagagaca
3061 accaagacag aagcagaaga caccaaggcc aaagaaccta gcaaacccac agagacggaa
3121 aagccaaaga aagaggagat gccagcggca ccagagaaga aagacaccaa ggaggagaag
3181 accacagagt ccaggaagcc tgaggagaag cccaaaatgg aggccaaggt caaggaggat
3241 gacaagagcc tttccaaaga gcctagcaaa cccaagacag aaaaggctga aaaatcctct
3301 agcacagacc agaaagaaag ccagccccca gagaagacca cagaggacaa ggccaccaag
3361 ggagagaagt aagagaacaa gagaaacacc cagaatagcc aaagaaactc aggacggtcc
3421 cagtactcag gggtcggcgt aataaatttt atttcttcct ttccctccgt aagaagaaac
3481 actgcttaga tggtgggcct gccctcacca aacaggaatt tctattaaga ttaagttagc
3541 aagagaagat aaccctgagc cttgtccccc acgccgaaaa ccctccccag gtgatggaca
3601 attatgatag cttcttgtag ccgaacgtga tgtatgctga acgctacgcg taaaacacgc
3661 gtctaaaaac tgccccctcc tttccaagta agtgcattta tttcctgtat gtccaactga
3721 cagatgaccg caataatgaa tgagcagtta gaaacgcatt atgcttgaaa tgttgtaacc
3781 tattcctgaa tgccttcttg ttttccaaag gagtggtcag gcccttgccc agtacacgct
3841 cctggaagag ctgcagcagg tgaggcaggg cgctggccac tgaaccacgc cagggtgtac
3901 tctccactga agtccacttt caattgcttc catgcaataa aaccaagtgc ttctgaaat
//
LOCUS MUSRGCA 350 bp ds-DNA ROD 25-JUL-1990
DEFINITION Mouse 18S rRNA gene.
ACCESSION M35283
KEYWORDS 18S ribosomal RNA; processing factor; ribosomal RNA.
SOURCE Mouse (strain S100) ribosomal DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 350)
AUTHORS Mishima,Y., Katayama,M. and Ogata,K.
TITLE Identification of a protein factor and the nucleotide sequence
required for processing of mouse precursor rRNA
JOURNAL J. Biochem. 104, 515-520 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 325 > 350 18S rRNA
site 220 220 processing site
BASE COUNT 29 a 128 c 117 g 76 t
ORIGIN
1 tcgacgttcc ggctctcccg atgccgaggg gttcgggatt tgtgccgggg acggagggga
61 gagcgggtaa gagaggtgtc ggagagctgt cccggggcga cgctcgggtt ggctttgccg
121 cgtgcgtgtg ctcgcggcgg gttttgtcgg accccgacgg ggtcggtccg gccgcatgca
181 ctctcccgtt ccgcgcgagc gccgcccggc tcacccccgg tttgtcctcc cgcgaggctc
241 tccgccgccg cctcctcctc ctctctcgcg ctctctgttc cgcctggtcc tgtcccaccc
301 ccgacggctt cgctcgcgct tccttacctg gttgatcctg ccagtagcat
//
LOCUS MYCRDNAA 190 bp ds-DNA BCT 25-JUL-1990
DEFINITION M.hyorhinis A-repeat sequence DNA.
ACCESSION M35303
KEYWORDS A-repeat.
SOURCE M.hyorhinis DNA, clone pG102.1.
ORGANISM Mycoplasma hyorhinis
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 190)
AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
TITLE Reiterated DNA sequences defining genomic diversity within the
species Mycoplasma hyorhinis
JOURNAL Mol. Microbiol. 2, 665-672 (1988)
STANDARD simple staff_review
BASE COUNT 94 a 15 c 31 g 50 t
ORIGIN
1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat
121 agaggctata aaattggact taaaaaagtt cacagattat tggaaaaatt caatcttaaa
181 gcaatttgtt
//
LOCUS MYCRDNAB 190 bp ds-DNA BCT 25-JUL-1990
DEFINITION M.hyorhinis A-repeat sequence DNA.
ACCESSION M35304
KEYWORDS A-repeat.
SOURCE M.hyorhinis DNA, clone pG102.3.
ORGANISM Mycoplasma hyorhinis
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 190)
AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
TITLE Reiterated DNA sequences defining genomic diversity within the
species Mycoplasma hyorhinis
JOURNAL Mol. Microbiol. 2, 665-672 (1988)
STANDARD simple staff_review
BASE COUNT 87 a 16 c 38 g 49 t
ORIGIN
1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat
121 agaggctata aaattggact tagatagggt tgagtgttgt tccagtttgg acaagaagtc
181 cactattaaa
//
LOCUS MYCRDNAC 191 bp ds-DNA BCT 25-JUL-1990
DEFINITION M.hyorhinis A-repeat sequence DNA.
ACCESSION M35305
KEYWORDS A-repeat.
SOURCE M.hyorhinis DNA, clone pG101.
ORGANISM Mycoplasma hyorhinis
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
TITLE Reiterated DNA sequences defining genomic diversity within the
species Mycoplasma hyorhinis
JOURNAL Mol. Microbiol. 2, 665-672 (1988)
STANDARD simple staff_review
BASE COUNT 97 a 18 c 29 g 47 t
ORIGIN
1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
61 atttttgaaa gaaaataaag caagatatga aaaaaagaga atcaaagctg aactcaataa
121 tagaggctat aaaattggac ttaaaaaagt tcacagatta ttgaaaaaat tcaatctcaa
181 agcaatttgt t
//
LOCUS MYCRDNAD 191 bp ds-DNA BCT 25-JUL-1990
DEFINITION M.hyopneumoniae A-repeat sequence DNA.
ACCESSION M35306
KEYWORDS A-repeat.
SOURCE M.hyopneumoniae DNA, clone pJ125.
ORGANISM Mycoplasma hyopneumoniae
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
TITLE Reiterated DNA sequences defining genomic diversity within the
species Mycoplasma hyorhinis
JOURNAL Mol. Microbiol. 2, 665-672 (1988)
STANDARD simple staff_review
BASE COUNT 94 a 17 c 31 g 49 t
ORIGIN
1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
61 atttttgaaa gaaaataaag caagatatgc aaaaaagaga ataaaagctg atcttaataa
121 tagaggctat aaaattggac ttaaaaaagt tcgcagatta ttggaaaaat tcaatctcaa
181 agcaatttgt t
//
LOCUS MYCRDNAE 210 bp ds-DNA BCT 25-JUL-1990
DEFINITION M.hyorhinis B-repeat sequence DNA.
ACCESSION M35307
KEYWORDS B-repeat.
SOURCE M.hyorhinis DNA, clones pG102.[1,3].
ORGANISM Mycoplasma hyorhinis
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
TITLE Reiterated DNA sequences defining genomic diversity within the
species Mycoplasma hyorhinis
JOURNAL Mol. Microbiol. 2, 665-672 (1988)
STANDARD simple staff_review
BASE COUNT 84 a 29 c 28 g 69 t
ORIGIN
1 gaattcttta aatttagtag aaatcaaaaa aactcaacaa ggcaactgag ttcgttataa
61 aaaagtttat caatatgcta aattcgatgc aagaactaaa caatttatct tagttgaaaa
121 aggcgttcct tttactaata tgattattgc taatcaaaac aatctacatt tgaatatttt
181 gactgacagg ttctaaagaa tgcagcattt
//
LOCUS STRLACZ 209 bp ds-DNA BCT 25-JUL-1990
DEFINITION S.bovis lactose catabolic protein (lacZ) gene, 5' end.
ACCESSION M35285
KEYWORDS catabolic protein.
SOURCE S.bovis (strain H/3) DNA.
ORGANISM Streptococcus bovis
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 209)
AUTHORS Gilbert,H.J. and Hall,J.
TITLE Molecular cloning of Streptococcus bovis lactose catabolic genes
JOURNAL J. Gen. Microbiol. 133, 2285-2293 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 184 > 209 lactose catabolic protein (lacZ)
BASE COUNT 59 a 55 c 43 g 52 t
ORIGIN
1 tcgattagcc cttggaccct gctagtcttg acctgcctag gtttcccagg tcaagttccc
61 agttaccgac tacccgtaaa tcgatactac gccattgtta gatcggatct gaacccgtaa
121 ctttatagtt gggtatcgtg agcagatcac aatatcccac aataaaagga ggataacatc
181 caaatgatca cggacacagt ggccatcga
//
LOCUS STYSSCA 1551 bp ds-DNA BCT 25-JUL-1990
DEFINITION S.typhimurium Ssc protein (ssc) gene, complete cds.
ACCESSION M35193
KEYWORDS Ssc protein.
SOURCE S.typhimurium (strain SH5014, isolate LT2) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1551)
AUTHORS Hirvas,L., Koski,P. and Vaara,M.
TITLE Characterization of a new protein encoding region between ompH and
lipid A biosynthesis genes of Salmonella typhimurium
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.H.Hirvas, 15-JUN-1990.
Author address: L.H.Hirvas
University of Helsinki
Dept of Bacteriology and Immunology
Haartmaninkatu 3
00290 Helsinki
FINLAND
FEATURES from to/span description
pept 19 1044 Ssc protein
BASE COUNT 362 a 354 c 438 g 397 t
ORIGIN
1 aaacaggtta aataagtaat gccttcaatt cgactggctg acttagcaga acagttggat
61 gcagaattac acggtgatgg cgatatcgtc atcaccggcg ttgcgtccat gcaatctgca
121 acaacaggcc acattacgtt tatggtgaat cctaagtacc gtgaacactt aggtttatgc
181 caggcttctg cggttgtcat gacgcaggac gatcttcctt ttgctaagag tgcggcgctg
241 gtagttaaaa atccctacct gacctacgcg cgcatggcgc aaattttaga tactacgccg
301 cagcccgcgc agaatatcgc gccaagcgcc gtgattgatg cgacggcaac gctgggtagc
361 aatgtttcag tcggcgcgaa tgcggtgatt gaatctggcg tacaactggg cgataacgtg
421 gttatcggcg caggctgttt cgtcggaaaa aatagcaaaa tcggggcggg ttcacgcttg
481 tgggcgaacg taacgattta ccacgacatt cagatcggtg agaattgcct gatccagtcc
541 agtacggtga tcggcgcgga cggttttggc tacgctaacg atcgtggcaa ctgggtgaag
601 atcccacaac tgggccgggt cattattggc gatcgtgtcg agatcggcgc ttgtaccacc
661 attgaccgtg gcgcgttgga tgatactgtt attggcaatg gcgtgattat tgataatcag
721 tgccagattg cacataacgt cgtgattggc gacaatacgg cagttgccgg tggcgtcatt
781 atggcgggta gcctgaagat tggccgttac tgcatgattg gcggcgccag cgtgatcaat
841 gggcatatgg aaatatgcga caaagtcacg gtaactggca tgggtatggt gatgcgtccc
901 atcacggaac cgggcgtcta ctcctcaggc attccgctgc aacccaacaa agtatggcgt
961 aaaactgctg cactggtgat gaacattgat gatatgagca agcgtctcaa agcgattgag
1021 cgcaaggtta atcaacaaga ctaacgttcc gccttgtagt tgccattctt ttccggcctg
1081 tcacattcat acgattgcgg caggccgtgt tattattgcc tttttgtata tttggacagg
1141 aagagtattt tgactactaa cactcatact ctgcagattg aagagatttt agagcttctg
1201 ccgcaccgtt ttccgttttt actggtcgat cgcgtgctgg actttgaaga aggtcgtttt
1261 ctgcgtgcgg tgaaaaatgt ctccgtcaac gagccgtttt tccaggggca tttcccgggc
1321 aaaccgattt tgccaggcgt gctgattctg gaagcgatgg cgcaggcaac cggtattctg
1381 gcgtttaaaa gcgttggtaa actggaacct ggcgaactgt attatttcgc gggtattgat
1441 gaagcgcgct ttaagcgtcc ggtggtgcca ggcgatcaga tgatcatgga agtcactttc
1501 gagaaaacgc gccgtggcct gacccgcttt aaaggggttg cgctggtcga c
//
LOCUS TOBRUBPA 979 bp ds-DNA PLN 25-JUL-1990
DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit gene,
exons 1 and 2.
ACCESSION M32419
KEYWORDS ribulose-1,5-bisphosphate carboxylase.
SOURCE Tobacco DNA, clone TSSU3-8.
ORGANISM Nicotiana tabacum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 979)
AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K.
TITLE Isolation of tobacco SSU genes: Characterization of a
transcription- ally active pseudogene
JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 584 760 ribulose-1,5-bisphosphate carboxylase small
subunit precursor, exon 1
854 > 979 ribulose-1,5-bisphosphate carboxylase small
subunit precursor, exon 2
sigp 584 757 ribulose-1,5-bisphosphate carboxylase small
subunit signal peptide
matp 758 760 ribulose-1,5-bisphosphate carboxylase small
subunit
854 > 979 ribulose-1,5-bisphosphate carboxylase small
subunit
IVS 761 853 ribulose-1,5-bisphosphate carboxylase small
subunit intron A
BASE COUNT 320 a 175 c 185 g 299 t
ORIGIN
1 ttaattatgt ctttgtttgc ttctcatgtg ataaagaatc gaagccttga tgaacataat
61 ttgcatttga gtagtgaata gctgctttca caaagagtac tctagctatt aagtttagtt
121 tgaatatttt gaaacacaaa aatatatgta tacatacaaa aacaaatacc gcaatagtcc
181 aagcaaaagg gactttaaaa aaaaaaacca acctcaatta cacattcata tcctcttcct
241 accccatcta ggatgagata agattactga ggttgtttac acgtggcacc tccattgtgg
301 tgaattaaat gatcaatggc ttagctcaaa atataatttt ccaacctttc atgtgtggat
361 attaagtttt gtgtagtgaa tcaagaacca cataatccaa tggttagctt tactccaaga
421 tgagggggtt gttgattttt gtccgttaga tatgggaaat atgtaaaacc ttatcattat
481 atatagagtg gtgggcaact atgcaatgac catcttggaa gtttaaagga aaaaaaagga
541 aagggagaaa gagaaatctt tctgtcttaa agtgtaatta acaatggctt cctcagttct
601 ttcctctgca gcagttgcca cccgcagcaa tgttgctcaa gctaacatgg ttgcaccttt
661 cactggcctt aagtcagctg cctcattccc tgtttcaagg aagcaaaacc ttgacatcac
721 ttccattgcc agcaacggcg gaagagtgca atgcatgcag gtaatttata tacaatgaca
781 gtgcaaaaaa ttttgataca attaatgcat cttaacatgt catagctaaa aattctattt
841 tggtggaata taggtgtggc caccaattaa caagaagaag tacgagactc tctcatacct
901 tcctgatttg agccaggagc aattgcttag tgaagttgag taccttttga aaaatggatg
961 ggttccttgc ttggaattc
//
LOCUS TOBRUBPB 1337 bp ds-DNA PLN 25-JUL-1990
DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit
pseudogene, complete cds.
ACCESSION M32420
KEYWORDS pseudogene; ribulose-1,5-bisphosphate carboxylase.
SOURCE Tobacco DNA, clone TSSU3-2.
ORGANISM Nicotiana tabacum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 1337)
AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K.
TITLE Isolation of tobacco SSU genes: Characterization of a
transcription- ally active pseudogene
JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 272 451 ribulose-1,5-bisphosphate carboxylase small
subunit, exon 1
620 754 ribulose-1,5-bisphosphate carboxylase small
subunit, exon 2
912 1100 ribulose-1,5-bisphosphate carboxylase small
subunit, exon 3
IVS 452 619 ribulose-1,5-bisphosphate carboxylase small
subunit intron A
IVS 755 911 ribulose-1,5-bisphosphate carboxylase small
subunit intron B
BASE COUNT 379 a 249 c 256 g 453 t
ORIGIN
1 gttttagaaa atatttccca ttcacaaatt aagtttggga actttgagat aaggacgact
61 gagtgtaatc aatgtcaggg gttcaaattt atgtgcccgt caatttttca atccacggct
121 acgattcctc taagatgagg tcattgcttg cttgtgtccg ttagatgaga aaaagacgtg
181 aaaccttatc actatatata gcactcatca cacccttgaa agcaaaggtc aagggaagca
241 atagctttaa gctaaacaat tactttcaac aatggcttcg tctgtgattt cctcagccgc
301 tgccgttgcc accggcgcta atgcggctca agccagtatg gttgcacctt tcactggcct
361 caaatccgcc tactccttcc ctgtttccag aaaacaaaac cttgacatta cttccattgc
421 tagcaatggt ggaagagttt aatgcatgca ggtttgtagc atatattatt gtagttagct
481 tatataaact gatagagtaa agaaatttta cgttatatat tgatatattt taacctggta
541 atttgattta tttttcatat tattaatccc acttttttat tgtacttatg aagtttattt
601 taattcttta tatatatagg tgtggccacc aattaacaag aagaagtacg agacactctc
661 ataccttcct gatttgagcg aggagcaatt gcttagggaa gttgaatacc ttttgaaaaa
721 tggatgggtt ccttgcttgg aattcgagac tgaggtcaaa catctattct aaatcatgct
781 actattatca agcataacta acatgaataa ctcaatccta actagtttgg gattagacat
841 atatagttga ttaagtgaaa gaggagtatt atctcatgtt aatgttttgt ttatcttgtg
901 gatatgcgca gcacggattc gtctaccgtg agaataacaa gtcaccaggt tactacgatg
961 gaagggccac tcaggtcttg gctgaggtcg aggaggcaaa gaaggcttac ccacaagcct
1021 ggatcagaat cattggattc gacaacgtcc gtcaagtgca atgcatcagt ttcatcgcct
1081 acaagcccgc aggctactaa aatctccatt tttaagacaa cttaccgtat gtattcaggg
1141 gaagtttgtt tgaattctcc ttgtgttttt ccccggagaa actgttttgg ttttcctttg
1201 ttttaattcc ttctttctat tcggtgtata tttttgaatt ccaatcaagt ttatgagaac
1261 taataatgtc atttgtttct ttcgtaattt gctttgtggt gtacatcggt tttaattatc
1321 cgagtaatat ctgcttt
//
LOCUS ZYMCPA 1374 bp ss-RNA VRL 25-JUL-1990
DEFINITION Zucchini yellow mosaic virus coat protein (cp) mRNA, 3' end.
ACCESSION M35095
KEYWORDS capsid protein; coat protein.
SOURCE Zucchini yellow mosaic virus, cDNA to viral RNA, clone ZYKS-22cp.
ORGANISM Zucchini yellow mosaic virus
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Potyvirus.
REFERENCE 1 (bases 1 to 1374)
AUTHORS Gal-On,A., Antignus,Y., Rosner,A. and Raccah,B.
TITLE Nucleotide sequence of the zucchini yellow mosaic virus capsid-
encoding gene and its expression in Escherichia coli
JOURNAL Gene 87, 273-277 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 1164 coat protein (cp) (AA at 1)
mRNA < 1 1374 cp mRNA
BASE COUNT 423 a 279 c 342 g 330 t
ORIGIN
1 tcgacgaagg agagattgtt tccaatttta gagtgggata gaagcaaaga aattatgcac
61 cgaacagagg ctatttgcgc tgcgatgatt gaggcatggg gacacaccga gcttttacaa
121 gagatcagaa agttttatct atggttcgtt gaaaaggaag aagtgcgaga attagccgcc
181 ctcggaaaag ctccatacat agctgagaca gcacttcgta agctatacac tgacaaggga
241 gcggatacaa gtgaactggc acgttatcta caagccctcc accaagacat cttctttgaa
301 caaggagaca ctgtaatgct ccaatcaggc actcagccaa ctgtggcaga cactggagcc
361 acaaagaaag acaaagaaga tgacaaaggg aaaaacaagg atgttacagg ctccggctca
421 agtgagaaaa cagtggcagc tgtcacgaag gacaaggatg taaatgctgg ttctcatggg
481 aaaattgtgc cgcgtctttc gaagataaca aagaagatgt cactgccacg cgtgaaagga
541 aatgtgatac tcgacattga tcacttgctg gagtataagc cggatcaaat tgagttatac
601 aacacacgag cgtctcatca gcaattcgcc tcttggttca accaagttaa aacagaatat
661 gatctgaatg agcaacagat gggagttgta atgaatggtt tcatggtttg gtgcatcgaa
721 aatggcacgt cacccgacat taacggagta tgggttatga tggacggtaa tgagcaggtt
781 gaatatcctt tgaaaccaat agttgaaaat gcaaagccaa cgctgcgaca aataatgcat
841 cacttttcag atgcagcgga ggcatatata gagatgagaa atgcagaggc accatacatg
901 ccgaggtatg gtttgcttcg aaacttacgg gataggagtt tggcacgata tgctttcgac
961 ttctacgaag tcaattccaa aactccggaa agagcccgcg aagctgttgc gcagatgaaa
1021 gcagcagccc ttagcaatgt ttcttcaagg ttgtttggcc ttgatggaaa tgttgccacc
1081 actagcgaag acactgaacg gcacactgca cgtgatgtta ataggaacat gcacaccttg
1141 ctaggtgtga atacaatgca gtaaagggta ggtcgcctac ctaggttatc gtttcgctcc
1201 gacgtaattc taatatttac cgctttatgt gatgtcttta catttctaga gtgggcctcc
1261 cacctttaaa gcgtaaagtt tatgttagtt gtccaggagt gccgtagtcc tgtcggaagc
1321 tttagtgtga gcctctcacg aataagctcg agattagact ccgtttgcaa gcct
//GenBank-Updates@genbank.bio.net (07/26/90)
LOCUS ATUNPSS 6425 bp ds-DNA SYN 26-JUL-1990
DEFINITION A.tumefaciens neomycin phosphotransferase and
streptomycin/spectinomycin adenyltransferase, complete cds.
ACCESSION M35007
KEYWORDS neomycin phosphotransferase;
streptomycin/spectinomycin adenyltransferase.
SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6425)
AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
Depicker,A.
TITLE Cloning and sequence analysis of truncated T-DNA inserts from
Nicotiana tabacum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.R.Gheysen, 01-JUN-1990.
FEATURES from to/span description
pept 2782 1985 (c) neomycin phosphotransferase
pept 4983 4042 (c) streptomycin/spectinomycin adenyltransferase
recomb 24 25 T-DNA end/plant DNA start
recomb 40 41 plant DNA end/T-DNA start
recomb 1094 1095 plant DNA end/T-DNA start
recomb 2786 2787 T-DNA end/plant DNA start
recomb 3044 3045 T-DNA end/plant DNA start
recomb 3354 3355 plant DNA end/T-DNA start
recomb 5585 5586 T-DNA end/plant DNA start
recomb 6389 6390 T-DNA end/plant DNA start
site 1 24 left T-DNA border
site 372 395 24 bp border-like sequence
site 1569 1592 24 bp border-like sequence
site 1669 1692 24 bp border-like sequence
site 1779 1756 (c) 24 bp border-like sequence
site 2128 2105 (c) 24 bp border-like sequence
site 2449 2472 24 bp border-like sequence
site 2485 2462 (c) 24 bp border-like sequence
site 3660 3683 24 bp border-like sequence
site 3875 3898 24 bp border-like sequence
site 4359 4336 (c) 24 bp border-like sequence
site 5868 5891 24 bp border-like sequence
BASE COUNT 1509 a 1754 c 1710 g 1452 t
ORIGIN
1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa
61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca
121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa
181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct
241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc
301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat
361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc
421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt
481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt
541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa
601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg
661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa
721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc
781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac
841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc
901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc
961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc
1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa
1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc
1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc
1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc
1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa
1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt
1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat
1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga
1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata
1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta
1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata
1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt
1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct
1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc
1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa
1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc
1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg
2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat
2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga
2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg
2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg
2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc
2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat
2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa
2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg
2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata
2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa
2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct
2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca
2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag
2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg
2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc
2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg
3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg
3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg
3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc
3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg
3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat
3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc
3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac
3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag
3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg
3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat
3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat
3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa
3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg
3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt
3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg
3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg
3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg
4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta
4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag
4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc
4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg
4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag
4321 cgttaaggtt tcatttagcg cctcaaatag atcctgttca ggaaccggat caaagagttc
4381 ctccgccgct ggacctacca aggcaacgct atgttctctt gcttttgtca gcaagatagc
4441 cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat tgcgctgcca
4501 ttctccaaat tgcagttcgc gcttagctgg ataacgccac ggaatgatgt cgtcgtgcac
4561 aacaatggtg acttctacag cgcggagaat ctcgctctct ccaggggaag ccgaagtttc
4621 caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg tcaccgtaac
4681 cagcaaatca atatcactgt gtggcttcag gccgccatcc actgcggagc cgtacaaatg
4741 tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct ctgatagttg
4801 agtcgatact tcggcgatca ccgcttccct catgatgttt aactttgttt tagggcgact
4861 gccctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg
4921 cgcttgctgc ttggatgccc gaggcataga ctgtacccca aaaaaacagt cataacaagc
4981 catgaaaacc gccactgcgc cgttaccacc gctgcgttcg gtcaaggttc tggaccagtt
5041 gcgtgaggcc atacgctact tgcattacag cttacgaacc gaacaggctt atgtccactg
5101 ggttcgtgcc ttcatccgtt tccacggtgt gcgtcacccg gcaaccttgg gcagcagcga
5161 agtcgaggca tttctgtcct ggctggcgaa cgagcgcaag gtttcggtct ccacgcatcg
5221 tcaggcattg gcggccttgc tgttcttcta cggcaagtgc tgtgcacgga tctgccctgg
5281 cttcaggaga tcggaagacc tcggccgtcc gggcgcttgc cggtggtgct gaccccggat
5341 gaagtggttc gcatcctcgg ttttctggaa ggcgagcatc gtttgttcgc ccagcttctg
5401 tatggaacgg gcatgcggat cagtgagggt ttgcaactgc gggtcaagga ctggatttcg
5461 atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc ttgatgttac
5521 ccgagagctt ggcacccagc ctgcgcgagc agctgtctcg tgcacgggca tggtggctga
5581 aggactaggc cgagggccgc agcggcgttg cgcttcccga cgcccttgag cggaagtatc
5641 cgcgcgccgg gcattcctgg ccgtggttct gggtttttgc gcagcacacg cattcgaccg
5701 atccacggag cggtgtcgtg cgtcgccatc acatgtatga ccagaccttt cagcgcgcct
5761 tcaaacgtgc cgtagaacaa gcaggcatca cgaagcccgc cacaccgcac accctccgcc
5821 actcgttcgc gacggccttg ctccgcagcg gttacgacat tcgaaccgtg caggatctgc
5881 tcggccattc cgacgtctct acgacgatga tttacacgca tgtgctgaaa gttggcggtg
5941 ccggagtgcg ctcaccgctt gatgcctgcc gcccctcact gtgagaggta gggcagcgca
6001 agtcaatcct agcggattca ctacccctgc gcgaaggcca tcggtgccgc atcgaacggc
6061 cggttgcgga aagtcctccc tgcgtccgct gatggccggc agcagcccgt cgttgaagga
6121 tccctgaaag cgacgttgga tgttaacatc tacaaattgc cttttcttac gaccatgtac
6181 gtaagcgctt acgtttttgg tggacccttg aggaaactgg tagctgttgt gggcctgtgg
6241 tctcaagatg gatcattaat ttccaccttc acctacgatg gggggcatcg caccggtgag
6301 taatattgta cggctaagag cgaatttggc ctgtagacct caattgcgag ctttctaatt
6361 tcaaactatt cgggcctaac ttttggtgtg atgatgctga ctggcaggat atataccgtt
6421 gtaat
//
LOCUS TOBNPTII 200 bp ds-DNA PLN 26-JUL-1990
DEFINITION N.tabacum nptII gene, complete cds.
ACCESSION M34757
KEYWORDS nptII protein.
SOURCE N.tabacum (strain SR1) DNA.
ORGANISM Nicotiana tabacum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
Depicker,A.
TITLE Cloning and sequence analysis of truncated T-DNA inserts from
Nicotiana tabacum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.R.Gheysen, 01-JUN-1990.
FEATURES from to/span description
pept 198 > 200 nptII protein
mRNA 133 > 200 nptII protein mRNA (5' end +/- 2 bp)
recomb 193 194 T-DNA end/plant DNA start
signal 25 30 CAAT box
signal 47 55 CAAT box
signal 101 107 TATA box
BASE COUNT 64 a 53 c 32 g 51 t
ORIGIN
1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg
61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta
121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg
181 atacaccaaa tcggatcatg
//
LOCUS BOVANDRE 2461 bp ss-mRNA MAM 26-JUL-1990
DEFINITION Cow alpha-1C-adrenergic receptor mRNA, complete cds.
ACCESSION J05426
KEYWORDS alpha-1C-adrenergic receptor.
SOURCE Cow adult brain cortex, cDNA to mRNA, clone B12.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 2461)
AUTHORS Schwinn,D.A., Lomasney,J.W., Lorenz,W., Szklut,P.J.,
Fremeau,R.T.Jr., Yang-Feng,T.L., Caron,M.G., Lefkowitz,R.J. and
Cotecchia,S.
TITLE Molecular cloning and expression of the cDNA for a novel
alpha-1-adrenergic receptor subtype
JOURNAL J. Biol. Chem. 265, 8183-8189 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by D.Schwinn, 29-MAY-1990.
FEATURES from to/span description
pept 97 1497 alpha-1C-adrenergic receptor
BASE COUNT 551 a 667 c 647 g 596 t
ORIGIN
1 tgactccccg ctccctcgct cccctcctcc tcacccgccg aggggtggcc ctcaagagcc
61 ggactttgcc ggccccggcc ccggggggct gggaccatgg tgtttctctc cggaaatgcc
121 tccgacagct ccaactgcac ccacccgccg ccaccggtga acatttccaa ggccattctg
181 ctcggggtga tcttgggggg cctcatcctt ttcggggtac tggggaacat cctcgtgatc
241 ctttccgtgg cctgccaccg gcacctgcac tcggtcacac actactacat cgtcaacctg
301 gcggtggccg accttctcct cacttccacg gtgctgccct tctccgctat cttcgagatc
361 ttgggctact gggccttcgg cagggtcttc tgcaatgtct gggcggcggt ggacgtcctg
421 tgctgcacgg cttccatcat gggactctgc atcatctcca tcgaccgcta catcggcgtg
481 agctatcctc tgcgctaccc caccatcgtc acccagaaga ggggcctcat ggccctgctc
541 tgcgtctggg cgctctcttt ggtcatctcc atcgggcccc tcttcggctg gaggcagccg
601 gccccggagg acgagaccat ctgccagatc aacgaggagc cgggctacgt gctcttctcg
661 gctctgggct ccttctacgt gccgctgacc atcatcctgg tcatgtactg ccgggtctac
721 gtcgtggcca agagggagag ccggggcctc aagtcgggcc ttaagaccga caagtcagac
781 tcggagcagg tgacgctccg catccatcgc aaaaacgccc aggtaggagg cagcggggtg
841 accagcgcca agaacaagac gcacttctcc gtgagactgc tcaaattttc ccgcgagaag
901 aaagcggcca aaacgctggg catcgtggtc ggctgcttcg tcctctgctg gctgcctttt
961 ttcttagtga tgcccattgg gtctttcttt cctgatttca ggccctcaga aaccgttttt
1021 aaaatagcat tttggctcgg ttacctaaac agctgcatca accccattat atacccatgc
1081 tccagtcaag agtttaaaaa ggcctttcag aatgtcttga gaatccagtg tctgcgacga
1141 aagcagtcct ccaaacacac cctgggctac acgctgcacg cacccagcca cgtcctggag
1201 ggacagcaca aggacctggt tcgcattccg gtgggatctg cagagacctt ctataagatc
1261 tccaagacgg atggggtctg tgaatggaaa attttctctt ccctaccccg cggatctgcc
1321 aggatggcgg tggccagaga cccatcagcc tgcaccactg cccgggtgag aagtaaaagc
1381 tttttgcaag tgtgctgttg cctggggccc tcgaccccca gtcatggaga gaatcatcag
1441 attccgacca ttaagatcca caccatctcc ctcagtgaaa atggggagga agtctaaagg
1501 acaggaaagg tcagaaggat gggagggtga tcttaggtac ccactctcca cttccttctg
1561 ggaaggccag ttcacgttcc gtggatgctg agacacagcc agtaaaccag ggaccatctg
1621 ggaatgggct ggggaggaga gctgactctg gggcagaggt agggcttaga gacgagagag
1681 gatgtcctac caccatccag ttcactatga tgagaaacag catttccttg aggctaatgc
1741 tctctgggtc attctctgag cctgctttct acgcctgtcc ctttcaacga caaacaccat
1801 gggaaacaga atttcataca caatccaaaa gacgataaat ataggattat gatttcatca
1861 tgaatatttt gagcatgcac tctaagtttg gagctatttc ttgatggagt gaggggattt
1921 tatttccagg ctaaacttgc tgaaagccac gttggatttt tatggagaga aggcctggag
1981 aggaagagcc ttaagatggt ggccaatatc cagacgcatt atttttagag caagttttac
2041 agtccaccct ttctcagttt gggtgaaact tgacagtgag attttattta ccttttgctg
2101 ctgcttgaca ggatactgct cccaattccc taaggatgag ggtgaggggt actcattatg
2161 ccaatggtca tctgcacttg ggtatagaga gtgttgaaag aaccagttgg gaaaaggatg
2221 gcttttcctg gtggaagaca gtaaggatga gagtcagttc ttcaaattct atggacagaa
2281 ttccattaag tggttccaag atcaggtgga ggaaggcttc ttgtgtaaca tatttaaaga
2341 tcaagagttt ggggtggggt gggtgctact ttcaagctaa gatagaggct gcaaaattac
2401 tccacagcct tttcaacatg gcatagaaag gcttttcttg gcaaatcact taccttttcc
2461 a
//
LOCUS CHKANCC2A 1229 bp ss-mRNA VRT 26-JUL-1990
DEFINITION Chicken anchorin CII mRNA, 3' end.
ACCESSION M30971 J03194
KEYWORDS anchorin; collagen-binding protein.
SOURCE Chicken cartilage (sternum) and bone, cDNA to mRNA, clones
A[1,4,6,7,14,15,22,23].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1229)
AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M.,
Deutzmann,R., Mollenhauer,J. and von der Mark,K.
TITLE The structure of anchorin CII, a collagen binding protein isolated
from chondrocyte membrane
JOURNAL J. Biol. Chem. 263, 5921-5925 (1988)
STANDARD simple staff_entry
REFERENCE 2 (bases 373 to 504)
AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M.,
Deutzmann,R., Mollenhauer,J. and von der Mark,K.
TITLE The structure of anchorin CII, a collagen binding protein isolated
from chondrocyte membrane
JOURNAL J. Biol. Chem. 265, 8344-8344 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 19 984 anchorin CII
BASE COUNT 353 a 230 c 310 g 336 t
ORIGIN 28 bp upstream of AccI site.
1 cccggcgaac cggggaagat ggcgaagtat acaagaggca ccgtgacagc attctctcct
61 tttgatgcca gagctgatgc agaagccctt cgcaaggcca tgaagggaat ggggactgat
121 gaagagacaa ttctgaagat ccttaccagc agaaataatg ctcaacgtca agaaattgca
181 tctgctttta aaacactgtt tggcagggat cttgtggatg acctgaaatc agaacttact
241 ggcaagtttg aaacactgat ggtatctttg atgagaccag cacgtatttt tgatgcgcat
301 gcactgaagc atgcaatcaa gggagcagga accaatgaga aagtgttgac tgaaattctt
361 gcctccagaa cacctgctga agtgcagaat attaaacagg tttatatgca agagtatgag
421 gccaacttgg aggataagat cacaggagag acatcaggcc attttcagag actgctggtg
481 gtcctgctgc aggcaaatag agatcctgat ggcagagttg acgaggctct tgttgagaag
541 gatgctcagg tcttgtttag agctggggag ctaaaatggg gaacagatga agaaacattc
601 atcaccatct tgggaactcg aagtgtttct catttgagga gggtgtttga caaatacatg
661 actatttctg gctttcaaat tgaagaaacc attgaccgtg aaacctctgg tgatttggag
721 aagttgcttt tggcagttgt gaagtgcatc cgaagtgtgc ctgcttattt tgctgaaact
781 ttgtattatt ctatgaaagg ggctggcact gatgatgata ccctgatcag agtcatggtt
841 tcaagaagtg aaatcgacct gttggatatt agacatgaat tcagaaagaa ttttgcgaaa
901 tcgttgtatc agatgattca gaaagataca tctggggact acaggaaggc actcctgctc
961 ctctgtggtg gagatgatga gtaatggtgg cagcgacgtg aaggatttct tgtaatccag
1021 ctttgcagcc cttcagttag catgcctagc taagattttg catcttaatg ctttatggct
1081 gttcgaattt atattcatat cacacttatt aaacacaaac atgttactac tagctgataa
1141 acagtccctc ctcctcagac gtcctgactc tgggaatttc agtgccttct gagtgtatgc
1201 aaagtctctc atggagtaga gtagtatcg
//
LOCUS ECOHLYCA 633 bp ds-DNA BCT 26-JUL-1990
DEFINITION E.coli hly plasmid hemolysin (hlyC) gene, complete cds.
ACCESSION M35668
KEYWORDS hemolysin.
SOURCE E.coli hly plasmid pHly152 DNA, clone pANN202-419.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 633)
AUTHORS Goebel,W., Hacker,J., Knapp,S., Then,I., Wagner,W., Hughes,C. and
Juarez,A.
TITLE Structure, function, and regulation of the plasmid-encoded
hemolysin determinant of Escherichia coli
JOURNAL Basic Life Sci. 30, 791-805 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 121 633 hemolysin (hlyC)
BASE COUNT 219 a 99 c 115 g 200 t
ORIGIN
1 tagtcacgca ataaaacgtt ctttaatatt aatgcagtta tgacattaaa ggcaagaaac
61 ataaaggcat atttttgcca caatatttaa tcatataatt taagttgtag tgagtttatt
121 atgaatataa acaaaccatt agagattctt gggcatgtat cctggctatg ggccagttct
181 ccactacaca gaaactggcc agtatctttg tttgcaataa atgtattacc cgcaatacag
241 gctaaccaat atgttttatt aacccgggat gattaccctg tcgcgtattg tagttgggct
301 aatttaagtt tagaaaatga aattaaatat cttaatgatg ttacctcatt agttgcagaa
361 gactggactt caggtgatcg taaatggttc attgactgga ttgctccttt cggggataac
421 ggtgccctgt acaaatatat gcgaaaaaaa ttccctgatg aactattcag agccatcagg
481 gtggatccca aaactcatgt tggtaaagta tcagaatttc atggaggtaa aattgataaa
541 cagttagcga ataaaatttt taaacaatat caccacgagt taataactga agtaaaaaga
601 aagtcagatt ttaatttttc attaactggt taa
//
LOCUS ECOTRMX4 77 bp ss-tRNA RNA 26-JUL-1990
DEFINITION E.coli f-Met-tRNA.
ACCESSION M35184
KEYWORDS transfer RNA-f-Met.
SOURCE E.coli tRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 77)
AUTHORS Dube,S.K., Marcker,K.A., Clark,B.F.C. and Cory,S.
TITLE Nucleotide sequence of N-formyl-methionyl-transfer RNA
JOURNAL Nature 218, 232-233 (1968)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 77 f-Met-tRNA
anticdn 35 37 f-Met-tRNA anticodon cat
modified 21 21 d
modified 33 33 2'Ome
modified 47 47 m7g
modified 56 56 p
BASE COUNT 14 a 26 c 25 g 12 t
ORIGIN
1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa
61 tccggccccc gcaacca
//
LOCUS HECDA8 1435 bp ss-rRNA BCT 26-JUL-1990
DEFINITION H.mustelae 16S ribosomal RNA.
ACCESSION M35048
KEYWORDS 16S ribosomal RNA.
SOURCE H.mustelae (strain ATCC 43772) ribosomal RNA.
ORGANISM Helicobacter mustelae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 9 to 1435)
AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
Ferrero,R.
TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped
bacterium isolated from the gastric mucosa of the cat, Helicobacter
mustelae, and related bacteria
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1435)
AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
Ferrero,R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by B.J.Paster, 06-JUN-1990.
Author address:B.J.Paster
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 > 1435 16S ribosomal RNA
BASE COUNT 375 a 301 c 412 g 326 t 21 others
ORIGIN
1 attatggaga gtttnatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa
61 gtcgaacgat gaagcttcta gcttgctaga agtggattag tggcgcacgg gtgagtaacg
121 cataggttat gtgccccata gtctgggata gccactggaa acggtgatta atactggata
181 ctcctacggg ggnaaagntn ttcgctatgg gatcagccta tgtcctatca gcttgttggt
241 gaggtaatgg ctcacnnagg ctatgacggg tatccggcct nagagggtga tcggacacac
301 tggaactgag acacggtcca gactcctacg ggaggcagca gtagggaata ttgctcaatg
361 ggcgaaagcc tgaagcagca acgccgcgtg gaggatgaag gttttaggat tgtaaactcc
421 ttttctaaga gaagataatg acggtatctt aggaataagc accggcnnac tccgtgccag
481 cagccgcggn antacggagg gtgcnagcgt tactcggaat cactgggcgt naagagcgcg
541 taggcggagt aataagtcag atgtgaaatc ctgtagctta actacagaac tgcatttgaa
601 actgttattc tagagtgtgg gagaggtagg tggaattctt ggtgtagggg tnaaatccgt
661 agagatcaag aggaatactc attgcgaagg cgacctactg gaacattact gacgctgatg
721 cgcgaaagcg tggggagcaa acaggattag ataccctggt agtccacgcc ctaaacgatg
781 aatgctagtt gttggggtgc ttgtcactcc agtaatgcag ttaacacatt aagcattccg
841 cctggggagt acggtcgcaa gattaaaact caaaggaata gacggggacc cgcacaagcg
901 gtggagcatg tggtttaatt cgannntacg cgaagaacct tacctaggct tgacattgat
961 agaatctgct agaaatagcg gagtgtctag tttactagac cttgaaaaca ggtgctgcac
1021 ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc
1081 gttcttagtt gctagcagtt cggctgagca ctctaagaag actgccttcg tnaggaggag
1141 gaaggtgagg acgacgttaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac
1201 aatggggtgc acaaagagac gcaataccgc gaggtggagc aaatctcaaa aacatctctc
1261 agttcggatt gtagtctgca actcgactac atgaagctgg aatcgctagt aatcgtgaat
1321 cagccatgtc acggtgaata cgttcccggg tcttgtactc accgnccgtc acaccatggg
1381 agttgtattc gccttaagcc gggatgctaa attggctacc gtccanggcg gatnc
//
LOCUS HECRDA 1446 bp ss-rRNA BCT 26-JUL-1990
DEFINITION H.felis 16S ribosomal RNA.
ACCESSION M35047
KEYWORDS 16S ribosomal RNA.
SOURCE H.felis (ATCC 49179) ribosomal RNA.
ORGANISM Helicobacter felis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 9 to 1446)
AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
Ferrero,R.
TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped
bacterium isolated from the gastric mucosa of the cat, Helicobacter
mustelae, and related bacteria
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1446)
AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
Ferrero,R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by B.J.Paster, 06-JUN-1990.
Author address:B.J.Paster
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 > 1446 16S ribosomal RNA
BASE COUNT 354 a 299 c 402 g 310 t 81 others
ORIGIN
1 tttatggaga gtttgatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa
61 gtcgaacgat gaagcctagc ttgctaggcg gattagtggc gcacgggtga gtaacgcata
121 gatgacatgc cctttagttt gggatagcca ctagaaatgg tgattaatac caaatactac
181 ctacggggga aagatttatc gctaaaggat tggtctatgt cctatcagct tgttggtgag
241 gtaaaggctc acnnaggcta tgacgggtat ccggcctgag agggtgaacg gacacactgg
301 aactgagaca cggtccagac tccnncggga ggcagcagta gggaatattg ctcaatgggc
361 gcaagcctga agcagcaacg ccgcgtggag gatgaaggtt ttaggattgt aaactccttt
421 tgtcagagaa gataatgacg gtatctgacg aataagcacc ggctanctcc gtgccagcag
481 ccgcggtaat acggagggtg cnagcgttac tcggaatcnc tgggcgtaaa gagtgcgtag
541 gcggggttgt aagtcagatg tgaaatccta tggcttaacc atagaactgc atttgaaact
601 acaactctgg agtgtgggag aggtaggtgg aattcttggt gtaggggtaa aatccgtaga
661 gatcaagagg aatactcatt gcgaaggcga cctgctggaa caatactgac gctgattgcn
721 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccct aaacgatgga
781 tgctagttgt tggggggctt tgtcctccca gtaatgcagc taacgcctta agcatcccgc
841 ctggggagta cggtcgcaag annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc gaagaacctt acctaggctt gacattgaan
961 gaatctgcta gaaatatgtg agtgtctagc ttgctagacc ctgaaaacag gtgctgcacg
1021 gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgcaacgagc gcaaccctct
1081 ttcttagttg ctaacaggta gtgctgagct ctctaagaat actgcctgcg taagcaggag
1141 gaaggtgagg acgacgtcaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac
1201 aatggggtgc acaaagagat gcaatgccgc gaggttgagc caatcttaaa aacnnctctc
1261 agttcggatt gcaggctgca actcgcctgc atgaagctgg aatcgctagt aatcgcaaat
1321 cagccatgtt gcggtgaata cgttcccggg tcttgtactc accgnncgtc acaccatggg
1381 agttgtgttt gccttaagtc aggatgctaa ggtagctact gcccacggca cacacagcga
1441 ctgggg
//
LOCUS HUMHPBS 821 bp ss-mRNA PRI 26-JUL-1990
DEFINITION Human peripheral benzodiazepine receptor (hpbs) mRNA, complete cds.
ACCESSION M36035
KEYWORDS peripheral benzodiazepine receptor.
SOURCE Human hystiocytic lymphoma monocyte-like cell line U937, cDNA to
mRNA, clone p-hPBS11.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 821)
AUTHORS Riond,J., Mattei,M.G., Kaghad,M., Dumont,X., Guillemot,J.C.,
Le Fur,G., Caput,D. and Ferrara,P.
TITLE Molecular cloning and chromosomal localization of a human
peripheral-type benzodiazepine receptor
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Riond, 27-JUN-1990.
Author address:J.Riond
SANOFI ELF BIO RECHERCHES
BP137 31328 LABEGE CEDEX
FRANCE
FEATURES from to/span description
pept 62 571 peripheral benzodiazepine receptor
mRNA < 1 811 peripheral benzodiazepine receptor mRNA
site 800 805 polyadenylation site
BASE COUNT 118 a 271 c 260 g 171 t 1 others
ORIGIN Chromosome 22, map position q13.3.
1 agtgcccttc ccggagcgtg ccctcgccgc tgagctcccc tgaacagcag ctgcagcagc
61 catggccccg ccctgggtgc ccgccatggg cttcacgctg gcgcccagcc tggggtgctt
121 cgtgggctcc cgctttgtcc acggcgaggg tctccgctgg tacgccggcc tgcagaagcc
181 ctcgtggcac ccgccccact gggtgctggg ccctgtctgg ggcacgctct actcagccat
241 ggggtacggc tcctacctgg tctggaaaga gctgggaggc ttcacagaga aggctgtggt
301 tcccctgggc ctctacactg ggcagctggc cctgaactgg gcatggcccc ccatcttctt
361 tggtgcccga caaatgggct gggccttggt ggatctcctg ctggtcagtg gggcggcggc
421 ngccactacc gtggcctggt accaggtgag cccgctggcc gcccgcctgc tctaccccta
481 cctggcctgg ctggccttcg cgaccacact caactactgc gtatggcggg acaaccatgg
541 ctggcatggg ggacggcggc tgccagagtg agtgcccggc ccaccaggga ctgcagctgc
601 accagcaggt gccatcacgc ttgtgatgtg gtggccgtca cgctttcatg accactgggc
661 ctgctagtct gtcagggcct tggcccaggg gtcagcagag cttcagaggt tgccccacct
721 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa
781 catggaattt tataagctga ataaagtttt tgacttcctt t
//
LOCUS XELAAA 121 bp ss-rRNA VRT 26-JUL-1990
DEFINITION X.laevis 5S ribosomal RNA.
ACCESSION M35175
KEYWORDS 5S ribosomal RNA.
SOURCE X.laevis somatic cell ribosomal RNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Wegnez,M. and Denis,H.
TITLE Expression des genes ribosomiques 5 S chez le Xenope
JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 121 5S ribosomal RNA
BASE COUNT 24 a 34 c 38 g 25 t
ORIGIN
1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg
61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
121 t
//
LOCUS XELAAB 121 bp ss-rRNA VRT 26-JUL-1990
DEFINITION X.laevis 5S ribosomal RNA.
ACCESSION M35176
KEYWORDS 5S ribosomal RNA.
SOURCE X.laevis oocyte ribosomal RNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Wegnez,M. and Denis,H.
TITLE Expression des genes ribosomiques 5 S chez le Xenope
JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 121 5S ribosomal RNA
BASE COUNT 25 a 33 c 37 g 26 t
ORIGIN
1 gcctacggcc acaccaccct gaaagtgcct gatctcgtct gatctcagaa gcgatacagg
61 gtcgggcctg gttagtactc ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
121 t
//
LOCUS ECORR50L1A 165 bp ds-DNA RNA 26-JUL-1990
DEFINITION E.coli 50S rRNA protein L1-associated RNA.
ACCESSION M24864
KEYWORDS 50S ribosomal RNA.
SOURCE E.coli 50S ribosomal RNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 165)
AUTHORS Branlant,C., Krol,A., Sriwidada,J. and Brimacombe,R.
TITLE RNA sequences associated with proteins L1, L9, and L5, L18, L25, in
ribonucleoprotein fragments isolated from the 50-S subunit of
Escherichia coli ribosomes
JOURNAL Eur. J. Biochem. 70, 483-492 (1976)
STANDARD simple staff_entry
FEATURES from to/span description
modified 13 13 7-methyluridine
unsure 61 61 u could be a
unsure 141 141 c could be g
BASE COUNT 36 a 35 c 47 g 47 t
ORIGIN
1 taacctttac tatggcgaca ctgaacattg agccttgatg tgtaggatag gtgggagctt
61 tgaagtggac gtgccagtct gcatggagcc gaccttgaaa taccctttac aatgtttgat
121 gttctaacgt ggacccgctt cgggttgcat cgcggacagt gtctg
//
LOCUS BMOSP1 1512 bp ds-DNA INV 26-JUL-1990
DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exon 1.
ACCESSION M24371 J04829
KEYWORDS arylphorin-type storage protein; storage protein;
storage protein 2.
SEGMENT 1 of 2
SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to
mRNA.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 983)
AUTHORS Fujii Tomino,S.
TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2
of Bombyx mori
JOURNAL J. Biol. Chem. 264, 11020-11025 (1989)
STANDARD simple staff_review
REFERENCE 2 (bases 984 to 1512)
AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S.
JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan
STANDARD simple staff_review
COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino,
28-APR-1989.
FEATURES from to/span description
pept 796 + 883 storage protein SP2, exon 1
pre-msg 771 > 1512 SP2 mRNA and intron
IVS 884 > 1512 SP2 intron A
binding 436 441 glucocorticoid-receptor binding site
site 447 457 SV40 enhancer core
conflict 199 199 c in [1]; g in [2]
BASE COUNT 475 a 266 c 247 g 524 t
ORIGIN Unreported.
1 aagcttttta aaaaaagaac tttatttaat tttaataatt aaaacatttg aaattaacaa
61 ttgaaattaa ttggcgcaag tgtcaccggg agcgcggtta gaattgaact gcgtgatcta
121 tcggtaacct aactaagctg cattacgtcg tgcaccttac attgcacatt tatgtacatt
181 aaaaatatat aacagtaccc aataaaaaag cattatttcg tcttgtaaca gtcggttgaa
241 aaattgaaag taattaacga catgcttaga gtttcgatcg tagtaaaagc tacgttttgt
301 ctatcatatt agaaagatat agtaacttct tttgtctctc tttattcttt aaattttact
361 taatcaggtg aatagctttt actactttac tcaatgtttt catcatactc ctggctaagt
421 cttcgctagc ccgcctgtcc tagtaagccg tggaaaggct ccgggacacc agcaaacctt
481 caatcataaa aaaaattgct ttcatcatgt tttcgtttac agttttacaa atatttcata
541 attttccatt cctttttttt gaattatata ataataacaa gaaaaaaact ttatatctat
601 ttgtttatca tcatcgttga aatttatatt cagtaattca aattatgaga ccggtgaaaa
661 ggtcagtaga ttacgttgat aatgaaagca taacacttgt tgctaatgag tgcatgtttc
721 gggagaagat aaagtgtggg tataaatatt cgaaaacgga ttgcagaagc acagtttgct
781 tctaggctgg aaaccatgaa gtctgtcttg attctggctg ggcttgtagc cgtcgcgctc
841 agcagtgcag taccaaaacc gagcaccata aagtcaaaaa atggtaagcg ttaaatagta
901 gtgctctatt ttaatacgct tttattatta ttattattat tattaattct ttatttcagt
961 tttgtttttt aaaaccataa cattttgtta gtagtaatta cttatatcta tgttagtgac
1021 ttaaaaaatc taacacataa ctctcattat atatatacat tttataccat tacatttttt
1081 attttatttt tttctccttc caagtgccta ctgcaaaggc tattgatcag cagtccctcg
1141 atcttgctcg atatgattct caaaagactg ttgccactgt cacgaactcg acgcaataac
1201 gatgcacttc tcttccgcat tattgcaaag aagtcatcgg tgtgagatgt cgcaaacatt
1261 gtggatgcac tacaaaagcg cggcagtgac aacatcatcc taaacgcatt attatattga
1321 acgcgtaggg cattgtaagc tctccgcgtg tatgtggtcc acagactact ggcgtaaaaa
1381 ttctggcaat aagctttaaa aattgtaatt tgacatacta tcgcaaccag taaatctgcg
1441 ggccagcata ttgcatctta ctatcaatta ttattattat tttttttatt gcttagatgt
1501 gtggacgagc tc
//
LOCUS BMOSP2 3876 bp ds-DNA INV 26-JUL-1990
DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exons 2,3,4, and 5.
ACCESSION M24370 J04829
KEYWORDS arylphorin-type storage protein; storage protein;
storage protein 2.
SEGMENT 2 of 2
SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to
mRNA.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1288 to 3876)
AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S.
TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2
of Bombyx mori
JOURNAL J. Biol. Chem. 264, 11020-11025 (1989)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 1287)
AUTHORS Tomino,S.
JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan
STANDARD simple staff_review
COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino,
28-APR-1989.
FEATURES from to/span description
pept + 1388 1521 storage protein SP2, exon 2
1614 2444 storage protein SP2, exon 3
2526 2692 storage protein SP2, exon 4
2850 3744 storage protein SP2, exon 5
pre-msg < 1 3846 SP2 mRNA and introns
IVS < 1 1387 SP2 intron A
IVS 1522 1613 SP2 intron B
IVS 2445 2525 SP2 intron C
IVS 2693 2849 SP2 intron D
BASE COUNT 1204 a 734 c 696 g 1242 t
ORIGIN Unknown number of bp after segment 1.
1 ttgctagccc ttcttcttta tgttttggag aaggttctca attcaaaatg tacgttttca
61 ttatagcctt attacgaaag cttatacgaa cgttatatct ttaactatgc atacagccgt
121 ctattgaatc attgttgtta taaattgttt tacaattgct ataggctcac atctctttct
181 gaggcgtgat ttagaaaagg atgcacgatg cgtgatccaa tttggaattt gatagctcgg
241 cctcatctcc tgcctcatag caaggccgat tttgtgaggc ctcctatcta aactaaaaag
301 aacaaaaccg cacttacccc gcagcggccg actaggttgc actgttgcta taccatcatt
361 tgtatgttgg tatattatta ccgctgtaat gtataggtac attaccgcca gtattgcata
421 tgttgcacga tgaacatgtt caatatatgt aaaatttaca atttaaatac gtcaccgttt
481 caacacaaaa ctatttgcaa atggattcat cattcatcat ctaaactcgt cgtggcctaa
541 aggataagac gtccggtgca ttcgtgttga gcgatgcacc ggtgctcgaa tcccaagcgg
601 gtaccaattt ttctaatgga atacgtactc aacaaatgtt catgattgac ttccacggta
661 aaggaataac atctatacta atattataaa gaggaaagat ttgtttgttt gtttgtttcg
721 aataggctcc gaaactactg gaccgatttg aaaaattctt tttccattag aagccaacat
781 tgtccctgat gaacataggc tacatttttt aatttttttt tttttttttg tttcatgtgt
841 gttttaatgt ttccgaagcg aagcgagggc gggtcgctag tcgtgtaata aaaatcaaag
901 ccgcaaaaat tataatttgc gtaattacta gtggtaggac ctcttgtgac gcaagggtag
961 gtacttgaga ccttagaatt tatatctcaa ggtgggtggt gcatatacgt tgtaaatgtc
1021 tatggggtct agtaaccgct taacaccagg tggactagtt cagccaccta agcaataaaa
1081 ataaaaatca tcaaaataga aaatcaacca ttgtaggttt ataccgtatt gactaagtaa
1141 taaagaaaag caggtttttt ttacaaacaa ccaaattatg taataaaagt aaatatagta
1201 agctatgaac gaccgattag tggtaacata tcggcgctga aagttcctaa tgtgctttga
1261 tgccaatatt tatctcagaa ttgaagttat tcaatacttt ccagataatg atgacatcta
1321 agtgatatcg cttattcgta aatacttctt tataaaatat ttacatatat ttttttactt
1381 tattcagtgg atgccgtatt tgttgaaaag caaaagaaaa ttctgtcctt cttccaagat
1441 gtgagccaac taaacactga tgatgaatat tataaaattg gcaaagacta tgatatcgaa
1501 atgaatatgg acaactacac tgtaagtact aataattaat atcaatttaa atttaacgtg
1561 aatttgtttg tttctttctt tctttattga aaaccatgtt tcatatttaa cagaacaaga
1621 aagctgttga agaatttctg aagatgtaca ggactggttt tatgcctaag aatttagagt
1681 tctccgtttt ttatgacaag atgagggatg aagctattgc tctattggat ttattctatt
1741 acgctaagga ctttgaaacg ttctacaaga gtgcctgttt tgcgcgtgtg catctcaatc
1801 aaggtcaatt cttgtatgcc ttctacatcg ctgttatcca gcgccctgat tgccacggtt
1861 tcgttgttcc tgctccgtat gaagtatacc ctaaaatgtt tatgaatatg gaagtgctgc
1921 aaaaaattta cgtaacaaag atgcaacatg gcctcattaa tcctgaagcc gcagctaagt
1981 atggcattca caaggaaaac gactacttcg tttacaaagc caattattct aacgccgttt
2041 tatacaataa tgaagaacaa aggctgacat acttcactga ggatattggc atgaacgctt
2101 actactacta cttccactct catttaccgt tctggtggac atcagaaaaa tacggagccc
2161 ttaaagagcg tcgtggagag gtttacttct acttctacca gcaattattg gctcgttact
2221 actttgagcg tcttaccaat ggacttggta agattcccga attctcatgg tactctccga
2281 taaagactgg atactatcca ttgatgctaa ctaagtttac acccttcgca caaagacctg
2341 actactacaa cttgcacacc gaagaaaact atgaaagagt aagattcctt gacacttatg
2401 agaagacatt cgttcagttc ctccaaaagg accactttga agccgtaagt tcgaacatta
2461 agtgtctaat cttattggtt tatttctaaa aatgtatgaa tttaatagat tttttcattt
2521 tatagttcgg acaaaaaatt gattttcacg acccgaaagc cattaacttc gtcggcaact
2581 actggcaaga taatgcagat ctgtatggag aagaagtcac aaaagattac caacgttctt
2641 acgaagtatt tgcgcgccgt gtgctaggtg ctgcgcctat gccattcgac aagtacgttt
2701 aaaaaatatt ttcaaaactt aatttttact aagcaatgac gacaactctt ttctacgtta
2761 tatccaagtc aaccgtaatc cggatttgtc tttgtacgtt tgcaaaaaaa ttaatagtaa
2821 tacatagttt cttcatgcta ctttttcagg tacactttca tgcctagtgc aatggacttt
2881 taccagactt ctcttcgtga tcctgctttc tatcagctct acaacagaat tgtggaatac
2941 atcgttgagt tcaagcaata cttgaagcct tacactcaag acaaacttta ctttgatggt
3001 gtcaagataa ctgatgttaa agtcgacaaa ttgacaacat tcttcgagaa ctttgaattc
3061 gacgccagca acagcgtgta ctttagtaag gaggagatta agaacaatca cgtccatgag
3121 ttaaggtgcg ccacacgatt gaaccacagc cccttcaacg ttaacattga ggttgattct
3181 aatgtcgcca gtgacgctgt tgtcaaaatg ttgctggccc ccaaatacga tgacaacgga
3241 atacctctca cattagagga caactggatg aaattcttcg agttggactg gttcacaact
3301 aaactcaccg ctggtcagaa caagattatc cgcaattcga atgaatttgt catatttaaa
3361 gaagactccg tgccaatgac tgaaattatg aagatgctcg acgaaggaaa agtacctttt
3421 gatatgtcgg aagagttctg ttacatgcct aaaagactca tgctgcctag aggtactgaa
3481 ggtggattcc cattccagct ctttgttttc gtctatccat tcgacaacaa aggcaaggac
3541 ttggctcctt tcgaatcttt tgttcttgac aataacctct tggcttccct ctggatcgcc
3601 ccgttgttga tgcattattc aaggttccta acatgtattt caaggatatt ttcatttacc
3661 acgagggtga acggttccct tacaaattca atcttccttc gtatgacaca catgataatg
3721 ttgttccaaa aaattaaatt ttaataaact gatgaatttt gcatccgtaa tatccaaaga
3781 aaatgtaaaa actttaagta gaactgttat gatttagaaa aaataaaatc aagtaggtaa
3841 aattataatt atgtattttt attgcatgca ttttta
//
LOCUS HUMG6PA 1464 bp ss-mRNA PRI 26-JUL-1990
DEFINITION Human glucose-6-phosphate dehydrogenase, complete cds.
ACCESSION M24470 M27958
KEYWORDS glucose-6-phosphate dehydrogenase.
SOURCE Human, cDNA to mRNA, clone NG6PD 1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1464)
AUTHORS Kanno,H., Huang,I.-Y., Kan,Y.W. and Yoshida,A.
TITLE Two structural genes on different chromosomes are required for
encoding the major subunit of human red cell glucose-6-phosphate
dehydrogenase
JOURNAL Cell 58, 595-606 (1989)
STANDARD simple staff_review
COMMENT Draft entry and sequence for [1] kindly submitted by A.Yoshida,
02-MAY-1989.
FEATURES from to/span description
pept 72 1109 glucose-6-phosphate dehydrogenase
/nomgen="G6PD" /map="Xq28"
/hgml_locus_uid="LH0033J"
mRNA < 1 1464 glucose-6-phosphate dehydrogenase mRNA
BASE COUNT 331 a 404 c 389 g 340 t
ORIGIN
1 ctccccgcgc cgccccgcgc aggcgccccc gccccgccgt cgccgccgcc gcagccagga
61 gccgctgcac catgccccgc atagatgcgg acctcaagct cgacttcaag gacgtcctgc
121 tccgacctaa gcggagcagc ctcaagagcc gagccgaggt ggatcttgaa cgcaccttca
181 cgtttcgaaa ttcaaagcag acctactcag ggattcccat catcgtggcc aacatggaca
241 ctgtgggcac gtttgagatg gcagccgtga tgtcacagca ctccatgttt acagcaattc
301 ataagcatta ctccctggat gactggaagc tctttgccac aaatcaccca gaatgcctgc
361 agaatgtagc cgtgagttca ggcagtgggc agaatgatct ggaaaagatg accagcatcc
421 tggaagctgt gccacaggtt aagtttattt gcctggatgt ggccaatggg tattcagaac
481 attttgtgga attcgtgaaa cttgtccgtg ccaaatttcc tgaacacacc attatggcag
541 ggaacgtggt gacaggagaa atggtagaag agcttattct ttccggagca gatatcatca
601 aagtgggagt tggaccaggt tctgtgtgca ccacccgcac caagacggga gtggggtacc
661 cccagctgag tgccgtcatt gagtgtgccg actctgccca cggcctgaag ggccacatca
721 tctctgatgg aggctgtacg tgtccagggg atgtcgccaa agcctttgga actggagcag
781 attttgtcat gctgggagga atgttttcgg gtcatacgga gtgtgctgga gaagtgattg
841 agaggaacgg acggaagctc aagctcttct acgggatgag ctctgacacc gccatgaaca
901 agcacgcagg aggagttgct gagtacagag cctctgaggg taagactgtg gaagttcctt
961 acaaaggaga tgtggaaaac actatcctgg atattctcgg gggactgagg tccacgtgca
1021 cctacgtggg ggccgccaaa ctcaaggagc tcagcaggag ggcaacattc atccgggtga
1081 cccagcagca caacaccgtg ttcagctaac cctggggaca aagcagcgtc tggctcgatg
1141 gaagcgtcca aacctgcttt tcccatctcc ccccaagtct gttccgtcag agcttctggc
1201 tgctcctgaa tggtggaatg cctgtgtcct ctcttctgtc tcctgccgcc tggaggcttc
1261 ggggctctcc cgcctgcctt ctcggggccc agacgcaagg caccgattgg gccaacatca
1321 gagccctgct gcccagaact cataacctca ttgttcaaac caacacttgc acctttctct
1381 ttttctcttt ctctctccct ttctttgttt ttctttcttt tttaaaagaa gatggtttca
1441 gctttaatat aatgctatta tctt
//
LOCUS MUSGT1A 2544 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse glucose transporter 1 mRNA, complete cds.
ACCESSION M23384 J04557
KEYWORDS glucose transporter 1.
SOURCE Mouse adipocyte cell line 3T3-L1, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites)
AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T.,
Cornelius,P., Pekala,P.H. and Lane,M.D.
TITLE Sequence, tissue distribution, and differential expression of mRNA
for a putative insulin-responsive glucose transporter in mouse
3T3-L1 adipocytes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3150-3154 (1989)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 2544; for [1])
AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T.,
Cornelius,P., Pekala,P.H. and Lane,M.D.
JOURNAL Unpublished (1989)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly provided
by M.Lane, 28-MAR-1989.
FEATURES from to/span description
pept 190 1668 glucose transporter 1
BASE COUNT 514 a 719 c 679 g 632 t
ORIGIN
1 ttggtcctat aaaaaggcag ctccgcgcgc tctcccccaa gagcagaggc ttgcttgtag
61 agtgacgatc tgagctacgg ggtcttaagt gcgtcagggc gtggaggtct ggcgggagac
121 gcatagttac agcgcgtccg ttctccgtct cgcagccggc acagctagag cttcgagcgc
181 agcgcggcca tggatcccag cagcaagaag gtgacgggcc gcctcatgtt ggctgtggga
241 ggagcagtgc tcggatcact gcagttcggc tataacactg gtgtcatcaa cgccccccag
301 aaggttattg aggagttcta caatcaaaca tggaaccacc gcatcggaga gcccatccca
361 tccaccacac tcaccacgct ttggtctctc tccgtggcca tcttctctgt cgggggcatg
421 attggttcct tctctgtcgg cctctttgtt aatcgctttg gcaggcggaa ctccatgctg
481 atgatgaacc tgttggcctt tgtggctgct gtgcttatgg gcttctccaa actgggcaag
541 tcctttgaga tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgactact
601 ggctttgtgc ccatgtatgt gggagaggtg tcacctacag ctctacgtgg agccctaggc
661 acactgcacc agctgggaat cgtcgttggc atccttattg cccaggtgtt tggcttagac
721 tccatcatgg gcaatgcaga cttgtggcct ctgctgctca gtgtcgtctt cgtcccagcc
781 ctgctacagt gtatcctgtt gcccttctgc cccgagagcc cccgcttcct gctcatcaat
841 cgtaacgagg agaaccgggc caagagtgtg ctgaagaagc ttcgagggac agccgatgtg
901 acccgagacc tgcaggagat gaaagaagag ggtcggcaga tgatgcggga gaagaaggtc
961 accatcttgg agctgttccg ctcacccgcc taccgccagc ccatcctcat cgctgtggtg
1021 ctgcagctgt cccagcagct gtcgggtatc aatgctgtgt tctactactc aacgagcatc
1081 ttcgagaagg caggtgtgca gcagcctgtg tacgccacca tcggctccgg tatcgtcaac
1141 acggccttca ctgtggtgtc gctgtttgtt gtagagcgag ctggacgacg gaccctgcac
1201 ctcattggcc tggctggcat ggcaggctgt gctgtgctca tgaccatcgc cctggccttg
1261 ctggaacggc tgccttggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc
1321 ttctttgaag taggccctgg tcctattcca tggttcattg tggccgagct gttcagccag
1381 gggccccgtc ctgctcgtat tgctgtggct ggcttctcca actggacctc aaacttcatt
1441 gtgggcatgt gcttccagta tgtggagcaa ctgtgcggcc cctacgtctt catcatcttc
1501 acggtgctcc tcgtgctctt cttcatcttc acctacttca aagtccctga gaccaaaggc
1561 cgaaccttcg atgagatcgc ttccggcttc cggcaggggg gtgccagcca aagtgacaag
1621 acacccgagg agctcttcca ccctctgggg gcggactccc aagtgtgagg agccccacac
1681 ccagcccggc ctgctccctg cagcccaagg atctctctgg agcacaggca gctagatgag
1741 acctcttccg aaccgacaga tctcgggcaa gccgggcctg ggcgcctttc ctcagccagc
1801 agtgaagtcc aggaggatat tcaggacttt gatggctcca gaatttttaa tgaaagcaag
1861 actgctgctc agatctattc agataagcag caggttttat aattttttta ttactgattt
1921 tgttattttt tttttttatc agccactctc ctatctccac actgtagtct tcaccttgat
1981 tggcccagtg cctgagggtg gggaccacgc cctgtccaga cacttgcctt ctttgccaag
2041 ctaatctgta gggctggacc tatggccaag gacacactaa taccgaactc tgagctagga
2101 ggctttacgc tggaggcggt agctgccacc cacttccgca ggcctggacc tcggcaccat
2161 aggggtccgg actccatttt aggattcgcc cattcctgtc tcttcctacc caaccactca
2221 attaatcttt ccttgcctga gaccagttgg aagcactgga gtgcagggag gagagggaag
2281 ggccaggctg ggctgccagg ttctagtctc ctgtgcactg agggccacac aaacaccatg
2341 agaaggacct cggaggctga gaacttaact gctgaagaca cggacactcc tgccctgctg
2401 tgtatagatg gaagatattt atatactggt tgtcaatatt aaatacagac actaagttat
2461 agtatatctg gacaaaccca cttgtaaata caccaacaaa ctcctgtaac tttacctaag
2521 cagatataaa tggctggttt ttag
//
LOCUS MUSMS6HM 321 bp ds-DNA ROD 26-JUL-1990
DEFINITION M.musculus Ms6-hm locus, repeat elements.
ACCESSION J04743
KEYWORDS dispersed repetitive element; minisatellite sequence.
SOURCE M.musculus (strain C57BL/6J) DNA, clone pMm3-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Kelly,R., Bulfield,G., Collick,A., Gibbs,M. and Jeffreys,A.J.
TITLE Characterization of a highly unstable mouse minisatellite locus:
Evidence for somatic mutation during early development
JOURNAL Genomics 5, 844-856 (1989)
STANDARD full staff_entry
COMMENT Printed sequence for [1] kindly submitted by R.Kelly, 08-AUG-1989.
FEATURES from to/span description
rpt 37 168 MT dispersed repetitive element
rpt 168 263 tandem repeated element
rpt 263 > 321 MT dispersed repetitive element
BASE COUNT 84 a 66 c 109 g 62 t
ORIGIN
1 gatccccagt gatgtaaacc agactatatg gctaactgtt ttagttagag tttctagttg
61 ctgtgaccaa caccatgacc aaaaagcaag ttggggagga aaggatttat ttgacttaca
121 cttccatata actgttcatc atcaaaagaa atcaggacag aaacccgggg gcagggcagg
181 gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg
241 gcagggcagg gcagggcagg gcagggctga tgtagcgtca ctgaggagtc ctgcttccta
301 ctttgcttcc atgggtggat c
//
LOCUS RABCYP4A6 1790 bp ss-mRNA MAM 26-JUL-1990
DEFINITION Rabbit cytochrome P450IVA6 (CYP4A6) mRNA, complete cds.
ACCESSION M28656
KEYWORDS cytochrome P450; lauric acid omega-hydroxylase.
SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA,
clone KdA6.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 1790)
AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T.,
Muerhoff,A.S. and Masters,B.S.
TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric
acid omega-hydroxylases
JOURNAL Biochemistry 29, 873-879 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.F.Johnson, 04-OCT-1989.
FEATURES from to/span description
pept 14 1546 lauric acid omega-hydroxylase
BASE COUNT 341 a 610 c 479 g 360 t
ORIGIN
1 gggccgctgc accatgagcg tgtctgcact gaaccccacc cggctcccgg gcagcctctc
61 cgggctcctc caagtggcgg gcctgctggg cctgctcctg ctgctgctca aggcagctca
121 gctctacctg caccgccagt ggctgctcag agccctccag cagttcccgt gcccaccctt
181 ccactggctc ctggggcaca gccgagagtt ccaaaatggc catgagttac aagtgatgct
241 gaaatgggtg gagaaattcc caagtgcttg tcctcgctgg ctatggggga gcagagccca
301 cctcctgatc tatgaccctg actacatgaa ggtgattctg gggagatcag acccaaaagc
361 tcaaggttcc tacagattcc tggctccctg gattgggtat ggtttgctcc tgctgaatgg
421 gcagacgtgg ttccagcacc ggcgcatgct caccccagcc ttccactacg acatcctgaa
481 gccctacgtg gggctcatgg cggactccgt ccaaatcatg ctggacaaat gggagcagct
541 ggtcagccag gactcctccc tggaggtctt ccaagacatc tccctgatga ccctggacac
601 catcatgaag tgtgccttca gccaccaggg cagcgtccag ttggacagga attcccagtc
661 ctacatccag gctgttgggg acctgaacaa cctgttcttt tcccgagtga ggaacgtctt
721 tcatcagagt gacaccatct acaggctgag ccctgaaggc cgcttgtccc accgtgcctg
781 ccagctcgcc cacgagcaca cagaccgagt gatccagcag aggaaggctc agctgcagca
841 ggagggggag ctggagaagg tcaggaggaa gaggcgcttg gacttcctgg acgtcctcct
901 ctttgccaag atggagaacg ggagcagcct gtccgaccag gacctccgcg ccgaggtgga
961 cacgttcatg ttcgagggcc acgacaccac ggccagcggc atctcctgga tcttctatgc
1021 cctggccacg caccccgagc atcagcaccg gtgccgcgag gagatccagg gcctcctggg
1081 ggacggagcc tccatcacct gggagcacct ggaccagatg ccctacacca ccatgtgcat
1141 caaggaggcg ctgagactct acccaccagt gccaggtgtc ggcagacagc tcagctcacc
1201 tgtcaccttc cctgatggac gctccctccc caagggtgtc atagtcacgc tctccatcta
1261 cgcccttcac cacaacccga aggtgtggcc aaacccagag gtgtttgacc ctttcccgtt
1321 cgcaccgggt tctgctcgcc acagccacgc tttcctgccc ttctcaggag gaccacggaa
1381 ctgcatcggg aagcaatttg ccatgaatga gctgaaggtg gccgtggccc tgaccctcgt
1441 gcgcttcgag ctgctgccag atcccaaaag agtcccggac caaaaaccac gtcttgtgct
1501 gaagtccagc aacgggatcc acctgcgtct gaggaagctc cgctaaccct ggtggggaca
1561 agagcaggct ctggggcctt ctgccaggcg tcctggcttc ctgtcacctg cccatgcccc
1621 ctgcctgtct gcccacatcc tgctttctat ccaccagcac ttcttccacc tgtctgcctt
1681 gctgcctctt ggcctccagg ctgtctgtcc tctcgcacct tcctctgggc cactgacctg
1741 tctgtctact gtccgcttcc tgccagcatc tctgaccgtg cacctaaccc
//
LOCUS RABCYP4A7 1694 bp ss-mRNA MAM 26-JUL-1990
DEFINITION Rabbit cytochrome P450IVA7 (CYP4A7) mRNA, complete cds.
ACCESSION M28657
KEYWORDS cytochrome P450; lauric acid omega-hydroxylase.
SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA,
clone KdB18.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 1694)
AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T.,
Muerhoff,A.S. and Masters,B.S.
TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric
acid omega-hydroxylases
JOURNAL Biochemistry 29, 873-879 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.F.Johnson, 04-OCT-1989.
FEATURES from to/span description
pept 25 1560 lauric acid omega-hydroxylase
BASE COUNT 330 a 564 c 463 g 337 t
ORIGIN
1 ggcagatcca gaagctgctg caccatgagc gtgtctgcgc tgagctccac ccggctcccg
61 ggcagcttct ccgggttcct ccaagcggcg gccctgctgg gcctactcct gctgctgctc
121 aaggcagctc agctctacct gcgccgccag tggctgctca gagccctcca gcagttcccg
181 tgcccaccct cccactggct cctggggcac agccgagagt ttccaataga ctcggagctg
241 cagcaggtgc tgaagcgagt ggagaaattc ccaagcgcct gtcctcgctg gctgtggggg
301 agtgagctgt ttctcatttg ctacgaccct gactacatga agacgattct ggggcgatca
361 gacccaaagg ctcgtgtttc ctacagcttc ctggctccct ggattgggta tggcttgctg
421 cttttggaag ggcagacgtg gttccagcac cggcgcatgc tcaccccagc cttccactac
481 gacatcctga agccctacgt ggggctcatg gtggactccg tccaagtgat gctggacaaa
541 ctggagaagc tcgcccgcaa ggacgcgcct ctggagatat acgaacacgt ctccctgatg
601 accctggaaa ccatcatgaa gtgcgccttc agccaccagg gcagcgtcca gctggaaagc
661 aggacctcca aatcctacat ccaggctgtc agggagctca gcgacttggc attgcagcgg
721 gtgaggaacg tctttcacca gagcgacttc ctctacaggc tgagccctga gggccgcttg
781 tcccaccgtg cctgccagct cgcccacgag cacacagacc gagtgatcca gcagaggaag
841 gctcagctgc agcaggaggg ggagctggag aaggtcagga ggaagaggcg cttggacttc
901 ctggacgtcc tcctctttgc caagatggag aacgggagca gcctgtccga ccaggacctc
961 cgcgccgagg tggacacgtt catgttcgag ggccacgaca ccacggccag cggcatctcc
1021 tggatcttct atgccctggc cacgcacccc gagcatcagc accggtgccg cgaggagatc
1081 cagggtctcc tgggggacgg agcctccatc acctgggagc acctggacaa gatgccctac
1141 accaccatgt gcatcaagga ggcgctgaga ctctacccac cggtgccagg tgtcggcagc
1201 aagctcagct cacctgtcac cttccctgat ggacgctccc tccccaaggg catcataatc
1261 acactctcca tctatggcct gcatcacaac ccgaaggtgt ggccaaaccc agaggtgttt
1321 gacccttccc gcttcgcacc gggttctgct cgccacagcc acgctttcct gcccttctca
1381 ggaggatcga ggaactgcat cgggaaacaa tttgccatga acgagctgaa ggtggccgtg
1441 gccctgaccc tcgtgcgctt cgagctgctg ccggatccca ccagagtccc catccccata
1501 acaagacttg tgctgaagtc taagaatggg attcacctac gtctcaggaa gctccactaa
1561 ccctgctgga aacaagaatg gtctgccagg cgtcctctct tcctgtcacc tgcccgtgtc
1621 ccgcactctg tctgtatctt gctttctctc tacctacctg cccttcttcc acctgcctcc
1681 gattcggcct tttg
//
LOCUS HUMRGIT 1095 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION M36624
KEYWORDS internal transcribed spacer.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1095)
AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
Schmickel,R.D.
TITLE Ribosomal RNA gene sequences and hominoid phylogeny
JOURNAL Mol. Biol. Evol. 7, 203-219 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
BASE COUNT 83 a 429 c 441 g 139 t 3 others
ORIGIN
1 acggagcccg gagggcgagg cccgcggcgg cgccgccgcc gccgcgcgct tccctccgca
61 cacccacccc cccaccgcga cgcggcgcgt gcgcgggcgg ggcccgcgtg cccgttcgtt
121 cgctcgctcg ttcgttcgcc gcccggcccc gccgccgcga gagccgagaa ctcgggaggg
181 agacgggggg gagagagaga gagagagaga gagagagaga gagagagaga gaaagaaggg
241 cgtgtcgttg gtgtgcgcgt gtcgtggggc cggcgggcgg cggggagcgg tccccggccg
301 cggccccgac grcgtgggtg tcggcgggcg cgggggcggt tctcggcggc gtcgcggcgg
361 gtctgggggg gtctcggtgc cctcctcccc gccggggccc gtcgtccggc cccgccgcgc
421 cggctccccg tcttcggggc cggccggatt cccgtcgcct ccgccgcgcc gctccgcgcc
481 gccgggcacg gccccgctcg ctctccccgg ccttcccgct agggcgtctc gagggtcggg
541 ggccggacgc cggtcccctc ccccgcctcc tcgtccgccc ccccgccgtc caggtaccta
601 gcgcgttccg gcgcggaggt ttaaagaccc cttgggggga tcgcccgtcc gcccgtgggt
661 cgggggcggt ggtgggcccg cgggggagtc ccgtcgggag gggcccggcc cctcccgcgc
721 ctccaccgcg gactccgctc cccggccggg gccgcgccgc cgccgmcgcc gcggcggccg
781 tcgggtgggg gctttacccg gcggccgtcg cgcgcctgcc gcgcgtgtgg cgtgcgcccc
841 gcgccgtggg ggcgggaacc cccgggcgcc tgtggggtgg tgtccgcgct cgcccccgcg
901 tgggcggcgc gcgcctcccc gtggtgtgaa accttccgac ccctctccgg agtccggtcc
961 cgtttgctgt ctcgtctggc cggcctgagg caaccccctc tcctcttggg cggggggggs
1021 ggggggacgt gccgcgccag gaagggcctc ctcccggtgc gtcgtcggga gcgccctcgc
1081 caaatcgacc tcgta
//
LOCUS MHV1NP 1670 bp ss-RNA VRL 26-JUL-1990
DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV1) RNA, complete cds.
ACCESSION M35253
KEYWORDS N protein; RNA binding viral structural protein;
nucleocapsid protein.
SOURCE Mouse hepatitis virus (strain 1), cDNA to viral RNA.
ORGANISM Mouse hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1670)
AUTHORS Parker,M.M. and Masters,P.S.
TITLE Sequence comparison of the N genes of five strains of the
coronavirus mouse hepatitis virus suggests a three domain structure
for the nucleocapsid protein
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Masters, 19-JUN-1990.
FEATURES from to/span description
pept 1 1368 hepatitis virus nucleocapsid (N-MHV1) ORF 1
pept 65 688 hepatitis virus nucleocapsid (N-MHV1) ORF 2
BASE COUNT 497 a 364 c 426 g 383 t
ORIGIN
1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgt aaaccgcgct
61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg
181 agtgtggttc cccattactc ttggttttcg ggcattaccc aatttcagaa gggaaaagag
241 tttcagtttg cacaaggaca gggagtgcct attgccaacg gaatcccagc ttcagagcaa
301 aagggatatt ggtatagaca caaccgacgg tcttttaaaa cacctgatgg ccagcagaag
361 cagctactgc ccagatggta tttttactat cttggaacag ggccccatgc tggcgcagag
421 tatggcgacg atatcgacgg agttgtctgg gtcgcaagcc aacaggccga cactaagacc
481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg
541 cccggtacgg tattgcctca aggtttttat gttgaaggct caggaaggtc tgcacctgct
601 agtcgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc
661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacaaa gcaaagcgcc
781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag
841 tgccctgtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca
961 cctagtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct
1021 gatgacccca ccaaagatgt gtatgaattg cagtattcag gtgcaattag atttgatagt
1081 actctcccag gatttgagac tatcatgaaa gtgttgaatg agaatttgga tgcctaccag
1141 gatcaagctg gtggtgcaga tgtagtgagc ccaaagcccc aaagaaagag agggacaaaa
1201 caaaaggctc tgaaaggtga agtagataat gtaagcgttg caaagcccaa aagctctgtg
1261 cagcgaaatg taagtagaga attaacccct gaggatcgta gtctgttggc tcagatcctt
1321 gatgatggcg ttgtgcctga tgggttagaa gatgactcta atgtgtaaag agaatgaatc
1381 ctatgtcggc actcggtggt aacccctcgc gagaaagtcg ggataggaca ctctctatca
1441 gaatggatgt cttgctgtca taacagatag agaaggttgt ggcagaccct gtatcaatta
1501 gttgaaagag attgcaaaat agagaatgtg tgagagaagt tagcaaggtc ctacgtctaa
1561 ccataagaac ggcgataggc gcccccctgg gaagagctca catcagggta ctattcctgc
1621 aatgccctag taaatgaatg aagttgatca tggccaattg gaagaatcac
//
LOCUS MHV3NP 1666 bp ss-RNA VRL 26-JUL-1990
DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV3) RNA, complete cds.
ACCESSION M35254
KEYWORDS N protein; RNA binding viral structural protein;
nucleocapsid protein.
SOURCE Mouse hepatitis virus (strain 3), cDNA to viral RNA.
ORGANISM Mouse hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1666)
AUTHORS Parker,M.M. and Masters,P.S.
TITLE Sequence comparison of the N genes of five strains of the
coronavirus mouse hepatitis virus suggests a three domain structure
for the nucleocapsid protein
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Masters, 19-JUN-1990.
FEATURES from to/span description
pept 1 1365 hepatitis virus nucleocapsid (N-MHV3) ORF 1
pept 65 688 hepatitis virus nucleocapsid (N-MHV3) ORF 2
BASE COUNT 494 a 358 c 432 g 382 t
ORIGIN
1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgg aaaccgcgct
61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg
181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag
241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa
301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag
361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt
421 tatggagaca gcattgaagg agtcttctgg gttgcaaaca gccaagcgga caccaatacc
481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg
541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct
601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc
661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc
781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag
841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca
961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct
1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct
1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat
1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcctgcaatg
1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcgc
//
LOCUS MHVA59NP 1666 bp ss-RNA VRL 26-JUL-1990
DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVA59) RNA, complete cds.
ACCESSION M35256
KEYWORDS N protein; RNA binding viral structural protein;
nucleocapsid protein.
SOURCE Mouse hepatitis virus (strain A59), cDNA to viral RNA.
ORGANISM Mouse hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1666)
AUTHORS Parker,M.M. and Masters,P.S.
TITLE Sequence comparison of the N genes of five strains of the
coronavirus mouse hepatitis virus suggests a three domain structure
for the nucleocapsid protein
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Masters, 19-JUN-1990.
FEATURES from to/span description
pept 1 1365 hepatitis virus nucleocapsid (N-MHVA59) ORF 1
pept 65 688 hepatitis virus nucleocapsid (N-MHVA59) ORF 2
BASE COUNT 497 a 355 c 433 g 381 t
ORIGIN
1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgg aagccgctct
61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgcgc tggaaataat
121 ggaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg
181 agtgtggttc cccattactc ttggttttcg ggcattaccc aattccagaa gggaaaagag
241 tttcagtttg tacaaggaca gggagtgcct attgccaatg gaatcccagc ttcagagcaa
301 aagggatatt ggtatagaca caaccgacgt tcttttaaaa cacctgatgg ccagcagaag
361 cagctactgc ccagatggta tttttactat ctcggaacag ggccccatgc tggcgcagag
421 tatggcgacg atatcgaagg agttgtctgg gtcgcaagcc aacaggccga cactaagacc
481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg
541 cccggtacgg ttttgcctca gggtttttat gttgaaggct caggaaggtc tgcacctgct
601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc
661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
721 gttttggcta agctcggtaa agatgccggt cagcccaagc aagtaacaaa gcaaagtgcc
781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag
841 tgcccagtgc agcaatgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
901 atgcttaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca
961 gctggtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct
1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
1141 aaggatggtg gtgcagatgt agtgagccca aagccccaaa gaaaagggcg tagacaggct
1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatcctagat
1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg
1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac
//
LOCUS MHVSHV 1666 bp ss-RNA VRL 26-JUL-1990
DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVS) RNA, complete cds.
ACCESSION M35255
KEYWORDS N protein; RNA binding viral structural protein;
nucleocapsid protein.
SOURCE Mouse hepatitis virus (strain S), cDNA to viral RNA.
ORGANISM Mouse hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1666)
AUTHORS Parker,M.M. and Masters,P.S.
TITLE Sequence comparison of the N genes of five strains of the
coronavirus mouse hepatitis virus suggests a three domain structure
for the nucleocapsid protein
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Masters, 19-JUN-1990.
FEATURES from to/span description
pept 1 1365 hepatitis virus nucleocapsid (N-MHVS) ORF 1
pept 65 688 hepatitis virus nucleocapsid (N-MHVS) ORF 2
BASE COUNT 494 a 357 c 430 g 385 t
ORIGIN
1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgt aaaccgcgct
61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg
181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag
241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa
301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag
361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt
421 tatggagaca gcattgaagg tgtcttctgg gttgcaaaca gccaagcgga caccaatacc
481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg
541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct
601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc
661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc
781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag
841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca
961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct
1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct
1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat
1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg
1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac
//
LOCUS MXAFRZGF 2999 bp ds-DNA BCT 26-JUL-1990
DEFINITION M.xanthus frzG and frzF genes, complete cds.
ACCESSION M35200
KEYWORDS FrzF protein; FrzG protein; methylesterase; methyltransferase.
SOURCE M.xanthus (strain DZF1) DNA.
ORGANISM Myxococcus xanthus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria;
Myxococcaceae.
REFERENCE 1 (bases 1 to 2999)
AUTHORS McCleary,W.R., McBride,M.J. and Zusman,D.R.
TITLE Developmental sensory transduction in Myxococcus xanthus involves
methylation and demethylation of FrzCD
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.R.McCleary, 15-JUN-1990.
FEATURES from to/span description
pept 120 1124 FrzG protein
pept / 1117 2899 FrzF protein (AA at 2)
BASE COUNT 383 a 990 c 1134 g 492 t
ORIGIN
1 ggatgccggc gcggacgcgt acctcgtcaa gggcgagctg ggcgtggagg ttctcgcgca
61 ggccatcgac cggctgacct gaggagccag gcttgggcgg tggcgcggta gtcgcaggaa
121 tggcgtttcg ggtgctcatg gtgggcaagg ggctgcgtgc gctcgcggcc cggggcctgt
181 tcgatgggga atccctggtg cccgtggggc cggcggaggt ggacttcgcc ggcgccctgg
241 tggccgtgca gcggcacttc ccggacgtgg tgctggtgga cctgagcgcg ctggacgcgc
301 tgcccgccat cgagcacgtc atggtggagc ggcccgtgcc ggtgctggcg ttgcaccccg
361 gcgtgttgtc cggccaggag gccttccagg cgatggtggc gggcgcgctg gacgtgctgg
421 agcgtccggc gaaccccggg cccgagttct ggacgcacgt gtcgcgcaag ctggtgctgc
481 tggcgcaggt gaaggcggtg cggcaggtgc agacgcggcc gccaccgcaa gcggcgcgtg
541 aggcgaagcc gcctcctccg tatccgctgg tggccatcgc cgcgtccctg ggtggcccca
601 aggcggtggc gcaggtgctg cggatgattc cgcgcgcctt cccggcgccc atcgcctact
661 gccagcacat cagcgacggt ttcacggaag ggctggcgca ctggttgtcc aatgaaacgg
721 cgctgcgcgt gctggaggcc gagcatgacg tgctcatggc gccgggcacg gtgtacatcg
781 ctccgtcggg cagtcacctc ttggtccgac ccgagggcag gttggagctg gacgcgggcc
841 ccgcgcttcg cggtttccgg ccgtcctgtg acatgctgct gacttcagcg ggtgagtcgt
901 tcggcccgcg ctgcatcggg gtcatcctga cgggcatggg gcgcgacggg gcgcgagggt
961 tgaaggagat tcgagagcgc ggcggtcgga ccattgccca ggacgaagcg tcgagcgtcg
1021 tctggggcat gccgcgcgag gcggtgttga tgggcgcggc gcacgaggtg ctgccactga
1081 gccggattgg cgcggcgctg atgcagtggg tggatgtgtg ttgacggcga gccagaaagt
1141 cttgcaacaa ctcgcggcgc tgctgctgga gcgcgcgggg ctgaaaatca cgccggatgg
1201 cttccacagc ctccgactgg cgctgtccgc gcggatgccc gtgctggggc tggaagagcc
1261 cgagcactac atccagcgac tgacgggcgc cggtggcgaa gaggagctgc gctcgctgtt
1321 gccgctggtg acggtggggc acacggagtt cttccgcgac gcgaagcagt tccgcgcgct
1381 ggagaagagc gtgctgccgg acctggtgtc ccgttcgcgg cgcgagatgc gcaaggtgtc
1441 catctggtcc gcgggctgcg cgacggggga ggagccctac agcctggcca tggtgctggc
1501 ggagctgggc gcgctgtcgc tggaggtgga cctgtgggcc accgacctca acctggccgc
1561 ggtggaggcc gcgaagcagg ggcgcttcac ctcgcggcgg gccatcagca tcaaccaggc
1621 gcggctgacg cgcttcttca agcccgtgga agagggctat gaggcgctgc ccgcgctgcg
1681 tgagtacatc cgcttcgatg gacagaacct ggcggttccc gtcttcgaca aggtggccct
1741 gtcgtcgctg gacctcatcc tctgccgcaa cgtcatcatc tacttcgacc tgcccaccat
1801 ccgcgggttg atggaccgct tcctcgccgc gctgcggccg ggcgggctgt tgttcctggg
1861 gtactcggag agcctcttca aggtctacga ccgcttcgag atgatcgaag tcgatggggc
1921 gttcgtgtac cgccgcccgc tgaacgacaa gagcatgcgg gcgccgccgc tgcgcatcac
1981 cccgtatcct ggcgagcccg atgtcgccgc gcgcaggccc gtgcctgcgg acgcgttcac
2041 cgcggacctg cgcaagcgga tgctgcccga ggacgtcccg ttgacgacgc ggctgcccgc
2101 ggtgtcagcg tcgtcggtgg cggcgcctgg ctcgcccagc gtgacgctgc cggcgctggg
2161 ggcctcttcg agtccgcgtt ccgtggtgcc ggggcggctg cccgcggtgt cgcctcactc
2221 gccgctgccg gccatcgccg cgcgctcgcg tgtcaccgcg gagttgccca cggtgggaag
2281 cgtggactcc gcccgtccgc gcatcaccac cgagctgccg gccgtggcca ccacgccgcg
2341 cgcgcccacc gtggaggtgc ccgcctggcc cacgctgctg cctccggcgg agcggctggc
2401 catggcggtg cggaagatgg cgcaggggga tttctcggcg gccatcgctg gcgtgcagcg
2461 gctgctcgcg gacgagccca gtgacttgga tgggctgctg acgctgggca acctgttctc
2521 gctcaccggc cgcatccccg aggcgcgcga ggccttcgcg caggccattc agcgcgagcc
2581 gctgtgcgtg gaggcgcggg tgttcggcgg ggtcgccgcg ttgcaagcag gggagttgag
2641 cgaggcgcgc tccgagctga gcaaggccct gttcctggag cccacgctgg ccattggcca
2701 ctacctgctg gcgcaggtgc acgagcgcac gcaggaccat gaggcggccc gccgcagcta
2761 ccgcaacgcc attgcccagc ttcgcttccc gcagcgtccc ctcgcggggc actacccgga
2821 gatgccggac tcggcggatg ccatctctcg cgcggcgcgt tacgccctgg ccgcgctgga
2881 ggagcagccc ctgcgctgag gcaggggccg cgtcccaggc ttcacgtcag tccaggctgc
2941 tcttcacctg gtccaggctc ttgctcgggt cgagcacgga gccgaacttc ttctgcagg
//
LOCUS ECOSFIM 762 bp ds-DNA BCT 26-JUL-1990
DEFINITION E.coli S-fimbrial protein (sfaA) gene, complete cds.
ACCESSION M35273
KEYWORDS S-fimbrial protein.
SOURCE E.coli (strain 536) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 762)
AUTHORS Schmoll,T., Hacker,J. and Goebel,W.
TITLE Nucleotide sequence of the sfaA gene coding for the S-fimbrial
protein subunit of Escherichia coli
JOURNAL FEMS Microbiol. Lett. 41, 229-235 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 166 708 S-fimbrial protein precursor
sigp 166 237 S-fimbrial protein signal peptide
matp 238 705 S-fimbrial protein
BASE COUNT 218 a 140 c 172 g 232 t
ORIGIN
1 gaaaatatta tcggagataa tgtcataaat gctgcctgag tgtatttctc acattgcatt
61 tatgaagttc tcctgaaaaa agattcccgt cgttcgggat attgattgtg tctgttgtga
121 tgacagatac ggtgtgcgta gttcaattaa aaacaggaat taaatatgaa gttaaaattc
181 atctccatgg ctgtattttc agccctgacc ttgggtgttg cgacaaatgc gtctgctgtc
241 accacggtta atggtggtac agttcatttt aagggggaag ttgttgatgc tgcatgtgct
301 gtaaacacta attcagcaaa tcaaacgttt tctgggcaag ttcgttcagc taagttggcg
361 aatgatggag agaagagttc ccctgttgga tttagtattg aacttaatga ctgtagttct
421 gcaactgccg ggcatgcatc aattatcttt gcaggaaatg ttattgctac acacaatgat
481 gtgctgtctc tacagaatag tgctgcaggt agtgcaacaa atgtaggtat tcagatattg
541 gatcatacag gtactgcagt tcaatttgac ggagtgactg catctacaca atttacatta
601 acagatggca ccaataaaat tcctttccag gcagtttatt atgcaacagg taagtcaacg
661 cctggtattg ccaacgccga cgccaccttt aaagttcagt accagtaata tcagaacagt
721 gtaacgatat atacccggcc aggagggctg tttttatcat gc
//
LOCUS ECOSRNB 655 bp ds-DNA BCT 26-JUL-1990
DEFINITION F plasmid (from E.coli) stable RNA degradation promoter (srnB)
gene, complete cds.
ACCESSION M35279
KEYWORDS .
SOURCE F plasmid (from E.coli) DNA.
ORGANISM Plasmid F
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 655)
AUTHORS Akimoto,S., Ono,K., Ono,T. and Ohnishi,Y.
TITLE Nucleotide sequence of the F plasmid gene srnB that promotes
degradation of stable RNA in Escherichia coli
JOURNAL FEMS Microbiol. Lett. 33, 241-245 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 251 457 stable RNA degradation promoter (srnB)
signal 127 132 -35 region
signal 150 155 -10 region
signal 466 485 transcription termination signal (put.)
binding 242 245 ribosome binding site
BASE COUNT 163 a 156 c 173 g 163 t
ORIGIN
1 aattcccatt ctggaccagc gggagcatac gaacaataat ttacggtttc gcgctatagc
61 tggctcaagt taggttggac cctgaatctc cagacaacca atatctgatc gcgccagtgg
121 tggcagttat taagcaacag ggaatgtggt attatcgcgg cgggtgtctg agcctttctg
181 gttcaggcaa gacgcaggta ccagaaatgc gaagacccca cttgttaatc cattaactcg
241 tgaggtctgc atgaagtacc ttaacactac tgattgtagc ctcttccttg cagagaggtc
301 aaagtttatg acgaaatatg cccttatcgg gttgctcgcc gtgtgcgcta cggtgttgtg
361 tttttcactg atattcaggg aacggttatg tgagctgaat attcacaggg gaaatacagt
421 ggtgcaggta actctggcct acgaagcacg gaagtaagct gccgggcggg gacggaagtc
481 cccgctttcc ggaagtgtga ggtatttcag gggcagacac ccgacatgcc agaaacagcc
541 ggtcccgccc ggggccggca cccaggttca ggcatttcct gcttttcagt catttcatta
601 tcaaaatcac attaaacggt cgtaatcaga catgatttgt gcgccaacac agatc
//
LOCUS HUMTIMP2 1062 bp ss-mRNA PRI 26-JUL-1990
DEFINITION Human metalloproteinase-2 inhibitor (TIMP-2) mRNA, complete cds.
ACCESSION J05593
KEYWORDS metalloproteinase-2 inhibitor.
SOURCE Human melanoma cell line A2058, cDNA to mRNA, clone pT2-M01.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1062)
AUTHORS Stetler-Stevenson,W.G., Brown,P.D., Onisto,M., Levy,A.T. and
Liotta,L.A.
TITLE Tissue inhibitor of metalloproteinases-2 (TIMP-2) mRNA expression
in tumor cell lines and human tumor tissues
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by W.G.Stetler-Stevenson, 19-JUN-1990.
FEATURES from to/span description
pept 271 933 metalloproteinase-2 inhibitor precursor
sigp 271 348 metalloproteinase-2 inhibitor signal peptide
matp 349 930 metalloproteinase-2 inhibitor
BASE COUNT 222 a 370 c 316 g 154 t
ORIGIN
1 ggggccgccg agagccgcag cgccgctcgc ccgccgcccc ccaccccgcc gccccgcccg
61 gcgaattgcg ccccgcgccc tcccctcgcg cccccgagac aaagaggaga gaaagtttgc
121 gcggccgagc gggcaggtga ggagggtgag ccgcgcggag gggcccgcct cggccccggc
181 tcagcccccg cccgcgcccc cagcccgccg ccgcgagcag cgcccggacc ccccagcggc
241 ggccccgccc gcccagcccc ccggcccgcc atgggcgccg cggcccgcac cctgcggctg
301 gcgctcggcc tcctgctgct ggcgacgctg cttcgcccgg ccgacgcctg cagctgctcc
361 ccggtgcacc cgcaacaggc gttttgcaat gcagatgtag tgatcagggc caaagcggtc
421 agtgagaagg aagtggactc tggaaacgac atttatggca accctatcaa gaggatccag
481 tatgagatca agcagataaa gatgttcaaa gggcctgaga aggatataga gtttatctac
541 acggccccct cctcggcagt gtgtggggtc tcgctggacg ttggaggaaa gaaggaatat
601 ctcattgcag gaaaggccga gggggacggc aagatgcaca tcaccctctg tgacttcatc
661 gtgccctggg acaccctgag caccacccag aagaagagcc tgaaccacag gtaccagatg
721 ggctgcgagt gcaagatcac gcgctgcccc atgatcccgt gctacatctc ctccccggac
781 gagtgcctct ggatggactg ggtcacagag aagaacatca acgggcacca ggccaagttc
841 ttcgcctgca tcaagagaag tgacggctcc tgtgcgtggt accgcggcgc ggcgcccccc
901 aagcaggagt ttctcgacat cgaggaccca taagcaggcc tccaacgccc ctgtggccaa
961 ctgcaaaaaa agcctccaag ggtttcgact ggtccagctc tgacatccct tcctggaaac
1021 agcatgaata aaacactcat cccatgggtc caaattaata tg
//
LOCUS ALREV1 717 bp ss-RNA VRL 26-JUL-1990
DEFINITION Rous sarcoma defective endogenous virus ev-1 locus gag polyprotein
RNA, 5' end.
ACCESSION M30517
KEYWORDS gag polyprotein.
SOURCE Rous sarcoma defective endogenous virus (strain Prague C), cDNA to
viral RNA, clone pGD27.
ORGANISM Rous sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian sarcoma viruses.
REFERENCE 1 (bases 1 to 717)
AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E.
TITLE Primary structure of p19 species of avian sarcoma and leukemia
viruses
JOURNAL J. Virol. 56, 31-39 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 1 > 717 gag polyprotein
matp 1 465 p19 protein
matp 466 531 p2 protein
matp 532 717 pp10 protein
BASE COUNT 161 a 173 c 246 g 137 t
ORIGIN
1 atggaagccg tcataaaggt gatttcgtcc gcgtgtaaaa cctattgcgg gaaaacctct
61 ccttctaaga aggaaatagg ggccatgttg tccctgttac aaaaggaagg gttgcttatg
121 tctccctcag acttatattc cccggggtcc tgggatccca ttaccgcggc gctctcccag
181 cgggcaatgg tacttgggaa atcgggagag ttaaaaacct ggggattggt tttgggggca
241 ttgaaggcgg ctcgagagga acaggttaca tctgagcaag caaagttttg gttgggatta
301 gggggaggga gggtctctcc cccaggtccg gagtgcatcg agaaaccagc aacggagcgg
361 cgaatcgaca aaggggagga agtgggagaa acaactgcgc agcgagatgc gaagatggcg
421 ccggagaaaa tggccacacc taaaaccgtt ggcacatcct gctatcagtg cggaacagct
481 actggctgta attgcgccac agcctcggcc cctcctcctc cttatgtggg gagtggtttg
541 tatccttccc tggcgggggt gggagagcag cagggccagg ggggtgacac accttggggg
601 gcggaacagc caagggcgga gccagggcac gcgggtctgg cccctgggcc ggccctgact
661 gactgggcaa ggatcaggga ggagcttgcg agtactggtc cgcccgtggt ggccatg
//
LOCUS ALREV2 564 bp ss-RNA VRL 26-JUL-1990
DEFINITION Rous sarcoma endogenous virus ev-2 locus gag polyprotein RNA,
partial cds.
ACCESSION M30518
KEYWORDS gag polyprotein.
SOURCE Rous sarcoma endogenous virus (strain Prague C), cDNA to viral RNA,
clone pAS2.
ORGANISM Rous sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian sarcoma viruses.
REFERENCE 1 (bases 1 to 564)
AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E.
TITLE Primary structure of p19 species of avian sarcoma and leukemia
viruses
JOURNAL J. Virol. 56, 31-39 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept < 1 > 564 gag polyprotein (AA at 1)
matp < 1 312 p19 protein
matp 313 378 p2 protein
matp 379 564 pp10 protein
BASE COUNT 123 a 138 c 207 g 96 t
ORIGIN
1 gatcccatta ccgcggcgct ctcccagcgg gcaatggtac ttgggaaatc gggagagtta
61 aaaacctggg gattggtttt gggggcattg aaggcggctc gagaggaaca ggttacatct
121 gagcaagcaa agttttggtt gggattaggg ggagggaggg tctctccccc aggtccggag
181 tgcatcgaga aaccagcaac ggagcggcga atcgacaaag gggaggaagt gggagaaaca
241 actgtgcagc gagatgcgaa gatggcgccg gaggaaacgg ccacacctaa aaccgttggc
301 acatcctgct atcattgcgg aacagctatt ggctgtaatt gcgccacagc ctcggcccct
361 cctcctcctt atgtggggag tggtttgtat ccttccctgg cgggggtggg agagcagcag
421 ggccaggggg gtgacacacc tcggggggcg gaacagccaa gggcggagcc agggcacgcg
481 ggtctggccc ctgggccggc cctgactgac tgggcaagga tcagggagga gcttgcgagt
541 acaggtccgc ccgtggtggc catg
//
LOCUS HAMCHO1 1953 bp ss-mRNA ROD 26-JUL-1990
DEFINITION C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION M34949
KEYWORDS p27 protein; pseudogene.
SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.SW2.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1953)
AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
Nguyen,T.P. and Wurm,F.M.
TITLE Presence and transcription of intracisternal A-particle-related
sequences in CHO cells
JOURNAL J. Virol. 64, 2021-2032 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 2 277 IAP p27 homologue
277 564 IAP p27 homologue
mRNA < 1 1953 p27 (pot.) mRNA
BASE COUNT 485 a 473 c 509 g 486 t
ORIGIN
1 ctttactctt acacaattgg atagacttgc cctaaatgcc ttgacgccat ctgactggca
61 gatggtcaca aaagctgcgc ttgtcagcat gggccaatac atggagtgga aagcactctg
121 gcatgaggcc gcccaagagc aggccagagc taacgcgacg gccttaactc ctgagcaaca
181 actatggaca ttcgacctgt taacgggcca gggtcgtttt gcagctgatc aaacaaatta
241 tcattggggc gcttatccac aaatcgacaa cgcggcatta gggcctgaaa ggtgctctcc
301 aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca ggagactttc
361 tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga tcctgagcag
421 gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg tcgcgcagct
481 atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag agaacttggg
541 ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa gcgccccctt
601 aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat tatggcagat
661 ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca tccaatatga
721 aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt tacagaattg
781 gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc agattaactc
841 cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt cgactgtctc
901 ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct gcggattggt
961 gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca aggtggttat
1021 cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt taacaatact
1081 taagcaatag gcgctggcca gacagctctt gcacacccgg agcctaggct cattgcacag
1141 ggtagagtgt ctggcttgag cagcccatga gggaatgtgg agcaaggcat cgcacagaag
1201 agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg gttgcctgcc
1261 ctagtctccc tttcccagaa aacggcagag gacaggtcga gagcgcttcg ggtcaagcta
1321 acagcctaat ggcgactctc gtacacagtc ttaatgtttg attgggaagg tacaacctct
1381 gcctctatcc ctcaacatat gggtgaccta tttgcttgta aaaatatgta agccttatca
1441 ttaattaata aaaaagggga gatgtaggga gccgtccctg cattctctat tacaagatgg
1501 cgcctgcatc cggcaggcac cgaatggtaa acaagttaat gcgcaggtgc tgggtaactt
1561 tccatccctt ggtctctgcc tctcccgtgg cgtcatatgg tccgatgagc tgcagccagt
1621 cagggggtga cacgtccgag gcggtggttg ccagcctata taagggatgg gtttttggga
1681 gttcggggtc tctgctctgt aagcttatgc tctccctctc aagatgcatt aaagctttac
1741 tacagaagga tcctgaatgt cctgcgtcat tcttgctggc gagacggtag cgcgggacag
1801 atggtgacag ccggtgcaga aagtgtcaac ctcagcttcc ttctccagga agacttcagc
1861 ctgggactgc tcctctacag agccccctac caagattatc taacctgcct gccttcttgt
1921 tgagctgtgt gtaataaact cattgagttt ccc
//
LOCUS HAMCHO2 1570 bp ss-mRNA ROD 26-JUL-1990
DEFINITION C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION M34950
KEYWORDS p27 protein; pseudogene.
SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.LY6.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1570)
AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
Nguyen,T.P. and Wurm,F.M.
TITLE Presence and transcription of intracisternal A-particle-related
sequences in CHO cells
JOURNAL J. Virol. 64, 2021-2032 (1990)
STANDARD full staff_review
FEATURES from to/span description
pept.ps 26 694 IAP p27 homologue
BASE COUNT 418 a 368 c 406 g 378 t
ORIGIN
1 aaaaagaaag ctgggcctcg cttttcccat ctttgagggc attgagggag agtgtatgca
61 tgcacccatg gagtataatc agataaaaga attggcagaa tcagtcagga aatatggagt
121 cacagccaac tttactctta cacaattgga tagacttgcc ctaaatgcct tgacgccatc
181 tgactggcag atggtcacaa aagctgcgct tgtcagcatg ggccaataca tggagtggaa
241 agcactctgg catgaggccg cccaagagca ggccagagct aacgcgacgg ccttaactcc
301 tgagcaacaa ctatggacat tcgacctgtt aacgggccag ggtcgttttg cagctgatca
361 aacaaattat cattggggcg cttatccaca aatcgacaac gcggccatta gggcctgaaa
421 ggtgctctcc aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca
481 ggagactttc tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga
541 tcctgagcag gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg
601 tcgcgcagct atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag
661 agaacttggg ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa
721 gcgccccctt aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat
781 tatggcagat ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca
841 tccaatatga aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt
901 tacagaattg gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc
961 agattaactc cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt
1021 cagctgtctc ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct
1081 gcggattggt gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca
1141 aggtggttat cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt
1201 taacaatact taagcaatag gccgctggcc agacagctct tgcacacccg gagcctaggc
1261 tcattgcaca gggtagagtg tctggcttga gcagcccatg agggatgtgg agcaaggcat
1321 cgcacagaag agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg
1381 gttgcctgcc ctagtctccc tttcccagaa aaacggcaga ggacaggtcg agagcgcttc
1441 gggtcaagct aacagcctaa tggcgactct cgtacacagt cttaatgttt gattgggaag
1501 gtacaacctc tgcctctatc cctcaacata tgggtgacct atttgcttgt aaaaatatga
1561 agccttatca
//
LOCUS HAMCHO3 2186 bp ss-mRNA ROD 26-JUL-1990
DEFINITION C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION M34951
KEYWORDS protease; pseudogene.
SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.YL[7,9].
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2186)
AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
Nguyen,T.P. and Wurm,F.M.
TITLE Presence and transcription of intracisternal A-particle-related
sequences in CHO cells
JOURNAL J. Virol. 64, 2021-2032 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 553 1281 IAP protease homologue
BASE COUNT 649 a 431 c 518 g 588 t
ORIGIN
1 gcaataactc catataaggg taaaggcctt gaagtctgga tgaaagtctg tagggagtta
61 gggggtccgc tgactaatgc tggactagca gctgctgtgt tgcaattaac taagaaaggt
121 ggaggttcag gagcttgctt taaatgcggc aagcaagggc atttgaaaaa gcaatgcccc
181 gagggaggaa acactaaagt caataaactt tgctccgcgc cctaagcaac ctggcttatg
241 tcctagatgt agaaaaggaa atcattgggc taaggattgt agatcagtaa aagacatcag
301 tggacagcct cttgttcagg ggtatggagg agcccgttca aaaaacggac gacggggccc
361 acgaccccag ggcccacaaa tatatggggc catggaggat cagaaccagg agcagagtcc
421 cgaaacctgg ccctctcttc gtcatccgag ggaccgagga gagccactac aggctccgcg
481 gggctggact tacgctccac caccagactc gtattaactc ccagaatggg ggtccagctt
541 gttgacaccg attttaaggg accccttgag cctggcacag taggtttgct tataggaaga
601 tcatctgcag cattgaaagg tttacgagta catcctggag ttatagatcc tgattacatg
661 ggtgtagtaa agatcatggt agaatctcct agagggatta cggccatttc tcctggagac
721 aggatagcac agttactgct tttgccaagc ttgcatgaca agtttccagc acaagccaga
781 gagagaggag agggaaactt tggctccact ggatcaaact taactttcct agctttagac
841 cttgatcaac gtccaaccct tgagttaata gtgaatggta agaaaatctt aggcttacta
901 gattctggag ctgataagag catcatagcc actaaagatt ggccctctgg ctggcctata
961 caggtttctt ctcaaagttt acaaggttta ggctatgcta aggctcctga tatgagtgct
1021 agacaattgc cttggaaaga tcaggaaggg cattcaggga ccatgcaacc ttatgtgtta
1081 gacttaccaa tttcattatg gggaagagat ttgttaaagg atatgggttt taaactcaca
1141 aatgaatact cagaaacatc tcaaggtatc atgaaacgaa tgggatacag tcccaggcca
1201 ggcctcggga aacatctgca gggtcgtacc agtcctatta attccacaat tgagaccaaa
1261 gaatctaggt ctgggttttt cctagggcca ctgaggaggt attcctatta cctggaaaac
1321 agaggagccg gtatgggttc ctcagtggcc actttcctct gagaaactgg aagctgctaa
1381 gactctagtg cgggagcagc tggatctggg gcatataaaa tcctctgtat ctccatggaa
1441 tactcctatt tttgtcatta agaaaaaatc tggtaaatgg agactgcttc acgatcttag
1501 agctattaat caacagatgc aaattatggg ccctgtacaa cgtggtcttc cacttttaac
1561 ttctttacct gcatcatggc ctatcatctc tatagatatt aaagattgct tcttttccat
1621 acctttgtgt gccaaggatt cagggcgttt tgcgtttacg ctgccctctt gtaatcatga
1681 acaacctgat ttaaggtatg aatgggatag tgttggccac aggggatggc caatagtcct
1741 actatgtgtc agttgtttgt agcagaagca attgctcctt ttgagagtgg actttcccaa
1801 agattagatg tgttcattat atggatgata ttttattggc tgccaaagat gataaaacgc
1861 ttaataaggc atatacaaaa ttggtaaaat tgcttgagat gcataattta gtcatagcct
1921 cagaaaaggt acaaaaggac actgttgtta actatctagg ggctaagatt ctccctcata
1981 caattattcc acaaaagata gagattagaa aagataattt aaaaactctt aatgattttc
2041 aaaagttgtt gggagatata aattggataa gatgttattt aaaattacca aattatgagt
2101 tgaagccatt gtataatatt ctcaatggtg attcagcatt agattcacct aggcagttaa
2161 ctgctgaagc cagagaagct ttaaag
//
LOCUS HUMCHRM 2098 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human muscarinic acetylcholine receptor gene, complete cds.
ACCESSION M35128 Y00508
KEYWORDS muscarinic acetylcholine receptor.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2098)
AUTHORS Allard,W.J., Sigal,I.S. and Dixon,R.A.F.
TITLE Sequence of the gene encoding the human M1 muscarinic acetylcholine
receptor
JOURNAL Nucleic Acids Res. 15, 10604-10604 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 449 1831 muscarinic acetylcholine receptor
BASE COUNT 458 a 662 c 570 g 408 t
ORIGIN
1 agtatagctt ataagtggat gaatgcttga gaagttgcag attatacaaa gtagttccca
61 actcctgcaa cccagtatgt aagatagaat tgtagttaat ttcccagtaa gaaaatgagc
121 ctgagtctga aaggtaaaac tgaatgaagt attcaaaccc tggatcccaa agccactcca
181 cgctgctggc aaatccactt atggctggga aagtgccact gcataaatga ccatgagtgg
241 gcaccggtaa gggagggtga tgctatctgg tctgaagctc tgaagggcaa gaattacatc
301 ccatgcatct tccaataagg tctatcagaa atgtccagtg gcccaaccaa agcccatgtc
361 ctctctttta ggtgatgact ttcccctgag gaagccctgt agcgtgcctg gaggaagggg
421 tctccaaccc cagccccacc tagccaccat gaacacttca gccccacctg ctgtcagccc
481 caacatcacc gtcctggcac caggaaaggg gccctggcaa gtggccttca ttgggatcac
541 cacgggcctc ctgtcgctag ccacagtgac aggcaacctg ctggtactca tctccttcaa
601 ggtcaacacg gagctcaaga cagtcaataa ctacttcctg ctgagcctgg cctgtgctga
661 cctcatcatc ggtaccttct ccatgaacct ctataccacg tacctgctca tgggccactg
721 ggctctgggc acgctggctt gtgacctctg gctggccctg gactatgtgg ccagcaatgc
781 ctccgtcatg aatctgctgc tcatcagctt tgaccgctac ttctccgtga ctcggcccct
841 gagctaccgt gccaagcgca caccccgccg ggcagctctg atgatcggcc tggcctggct
901 ggtttccttt gtgctctggg ccccagccat cctcttctgg cagtacctgg taggggagcg
961 gacagtgcta gctgggcagt gctacatcca gttcctctcc cagcccatca tcacctttgg
1021 cacagccatg gctgccttct acctccctgt cacagtcatg tgcacgctct actggcgcat
1081 ctaccgggag acagagaacc gagcacggga gctggcagcc cttcagggct ccgagacgcc
1141 aggcaaaggg ggtggcagca gcagcagctc agagaggtct cagccagggg ctgagggctc
1201 accagagact cctccaggcc gctgctgccg ctgctgccgg gcccccaggc tgctgcaggc
1261 ctacagctgg aaggaagaag aggaagagga cgaaggctcc atggagtccc tcacatcctc
1321 agagggagag gagcctggct ccgaagtggt gatcaagatg ccaatggtgg accccgaggc
1381 acaggccccc accaagcagc ccccacggag ctccccaaat acagtcaaga ggccgactaa
1441 gaaagggcgt gatcgagctg gcaagggcca gaagccccgt ggaaaggagc agctggccaa
1501 gcggaagacc ttctcgctgg tcaaggagaa gaaggcggct cggaccctga gtgccatcct
1561 cctggccttc atcctcacct ggacaccgta caacatcatg gtgctggtgt ccacgttctg
1621 caaggactgt gttcccgaga ccctgtggga gctgggctac tggctgtgct acgtcaacag
1681 caccatcaac cccatgtgct acgcactctg caacaaagcc ttccgggaca cctttcgcct
1741 gctgctgctt tgccgctggg acaagagacg ctggcgcaag atccccaagc gccctggctc
1801 cgtgcaccgc actccctccc gccaatgctg atagtcccct ctcctgcatc cctccacccc
1861 agtccccggg aaaaggccgg tcggaagagg gcaggggctg catcctcagc cccagggccc
1921 tgctcaggcc tcacctggct tcccaggacc ctgggtcacc ttcctgggca gcccagagag
1981 acctgccaac tttccagact tcgctattcc caggcaggga gggaaacccg gggaactggt
2041 ttttctgttc cctgctgggt gggaatgcgc tcttcacagg aagaaggccc gggaggag
//
LOCUS MVOTRPBA 2874 bp ds-DNA BCT 26-JUL-1990
DEFINITION M.voltae tryptophan synthase operon (trp) genes, complete cds.
ACCESSION M35130
KEYWORDS tryptophan synthase.
SOURCE M.voltae (PS DSM 1537) DNA.
ORGANISM Methanococcus voltae
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanococcales; Methanococcaceae.
REFERENCE 1 (bases 1 to 2874)
AUTHORS Sibold,L. and Henriquet,M.
TITLE Cloning of the trp genes from the archaebacterium Methanococcus
voltae: Nucleotide sequence of the trpBA genes
JOURNAL Mol. Gen. Genet. 214, 439-450 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 206 tryptophan synthase F (AA at 3)
pept 304 1533 tryptophan synthase B
pept 1571 2425 tryptophan synthase A
pept 2460 2600 ORF 46
pept > 2874 2666 (c) ORF 68 (AA at 2872)
BASE COUNT 1046 a 381 c 569 g 878 t
ORIGIN
1 gggttgcgga aactcatgac catagagtta gcgaaattat ctccaaaaaa tttgatgtcg
61 tacttgcagg cggtataact tttgaaaacg tgagaaaaat tgtaaattcc gtaaaacccg
121 ttggaattga tgtttctagt ggcgttgagt taaacaacag aaaaaacgaa ttattaataa
181 aaaagatttg tcataatttg atttaattag aattaattag aattaatcga attttaacta
241 attaaaaatt ataggttatt aaattatgac taaatacagt atatgtaaaa ttaaggtgaa
301 attatgaaat gtaatacaaa atgtgacaaa aatggatatt ttggggaatt tgggggtcaa
361 tatatacctg aagttttaaa accggctgtt gaagagctta aagaagccta taaagagtta
421 aaagatgacg aagactttca aaatgagctt gcatactatt taaaacatta tgcaggacgt
481 gaaactcccc tatattatgc aaaaaacttg actgaaaaac ttggtggtgc caaaatctac
541 ctaaaaagag aggacttatt gcatggtggt gcccataaaa ccaataacac tattggtcaa
601 gcacttcttg ctaaaaaaat gggtaaaaca agaataattg ctgagacggg tgcgggtcaa
661 catggtgttg gcacgtctat ggcaggagca ctttttggtc tcgaaacaga gatttttatg
721 ggtagggtag atacagaacg acaacaacct aacgtagcac gtatgaaatt attgggtgca
781 aaagttacgc cagtcgatac aggttccaaa gttttaaaag acgctgtaaa tgaagctatg
841 agaaattgga ctgctacttt tgaaaatact cactatttac ttggcactgt gatgggtcca
901 cacccattcc caactatggt gagagatttt cagtcagtaa ttgggaaaga agttaaaaaa
961 caaataatgg agcaggaaga aagacttcct gattatttag ttgcctgtat tggagggggt
1021 agcaatgcaa tgggtttatt tcatccattt ttaagtaata atatcagtac tggcaatgat
1081 gatgccaaaa atgttaaaat gataggaata gaggctgcag gtaaggggct taacactagc
1141 cttcacggtg catccataac taaaggtgaa aaaggggtac ttcacggtat gctttcgtat
1201 ttcttacaag acgaggatgg acaaatagaa gaagcttata gtatttctgc cggattggat
1261 tacccaggga taggtccaga gcatgcttat ttacataacc ttgggcgtgt gcagtatgct
1321 tcagcaactg ataaacaggc cttaaaagca tttatggaac ttacgagaac cgaaggaatt
1381 atcccggctc tagaatcgtc tcacgcgatt gcttatgcca ttgaaaatgc aggaaatatg
1441 gataaggacg atataatggt aataaacctt tcaggacgtg gggataaaga tttaaacaca
1501 gtaataaatg cagtacataa attgggttgt taaaattaat taaaattaat taaaatatcg
1561 aggaatttaa atgaaaaact tagaaaattt agaaaaagat ttgaaaaatg acttaaaaaa
1621 agatttgaaa aaagaaaaac caattttagt tagtttttta gtatcagggg acccaaatat
1681 tgaagctaca ctaaaattta tgaatgcact agacgaatat tgcggagtta tagaactagg
1741 tataccattt agtgacccga tagcagatgg ttcaactatt caagaggcaa atgtacgttc
1801 cttatcaaat ggttataaaa tacatcaatc ttttgacgta ttacgggaat ttaggaaatt
1861 ttcagatacg ccagttgtac ttatgacgta ttacaatcca atatataaaa gaggtattga
1921 aaattttgta attcaagcaa aagaagcagg ggcaaatggg cttataattg tagatttacc
1981 cctagatgaa gcagaacagt atagggcaat atgtaaaaag catgatatgg gaacagtatt
2041 ccttgtagcc ccaaatacac ctgatgagag gttgatgtat tctgatgagg ctagtacact
2101 gtttttatac gtaatttcga catttggtat tactggagct aggggttcat ttgaaaagat
2161 gacttttgaa tttatagctc gtgcaaaaaa tctttgcgat aaaaataagc tgtatgtagg
2221 ttttggaatt tcaaacggtg aacatgctga aaaaataatt gaaaatggtg ctgacggtgt
2281 tattgtaggg agtgcttttg tagatattat taaggaatac ggggattcta atgaaactat
2341 ttataaatta aaagaattag ctcgggaatt aagcgaaggg attcataaag gttatgttaa
2401 atacaatgaa aagaataaat attaaataat ataatttatt ttaaattttg ggtggagata
2461 tgaatttaaa agataatata ctttataaat caatcaaatg gttttttgcg gttaaatcgg
2521 agaaacctaa aaattacgat actgaagtaa aacctatatt gtatgagcaa gagcgacgtg
2581 gtagacgccg tatattataa taaattctaa tttaaaaaaa taaaaaaaga aattatatta
2641 ttgtagtatt taattaatta ttcatttaat tcttttttga attcaaaaag tttttggcaa
2701 tgtccattat attttcagat attatatatt ttgaattttc ttttaaaacg ctatttgcaa
2761 cgtcaagtga tttataaact tgtgcttcac ctttaaagta catttgtgcc gcttctgcaa
2821 ctgtttttat tgctttagcc tgcccctcag cttcaattct aatactttct gcag
//
LOCUS MYCSD1XX 425 bp ds-DNA BCT 26-JUL-1990
DEFINITION M.pneumoniae SDC1 repetitive sequence.
ACCESSION M35024
KEYWORDS .
SOURCE M.pneumoniae (strain M-129) DNA.
ORGANISM Mycoplasma pneumoniae
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 425)
AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F.
TITLE Prevalence of novel repeat sequence in and around the P1 operon in
the genome of Mycoplasma pneumoniae
JOURNAL Gene 87, 91-96 (1990)
STANDARD simple staff_review
BASE COUNT 100 a 118 c 120 g 87 t
ORIGIN
1 aattcgaatt tgaaggccca aggcctcacc caacccgcct acctcatcgc cggtcttgac
61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg
121 acgactgatt cgagcgcttc gacctacaac caagcactcg cctggtcgac cacggccggg
181 ttggacagtg atggggggta caaggccttg gtggaaaaca cggccgggct caacggcccg
241 attaatggct tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg
301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa
361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat
421 gatgg
//
LOCUS MYCSDC1 425 bp ds-DNA BCT 26-JUL-1990
DEFINITION M.pneumoniae SDC1 repetitive sequence.
ACCESSION M35022
KEYWORDS .
SOURCE M.pneumoniae (strain M-129) DNA, clone MP135.
ORGANISM Mycoplasma pneumoniae
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 425)
AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F.
TITLE Prevalence of novel repeat sequences in and around the P1 operon in
the genome of Mycoplasma pneumoniae
JOURNAL Gene 87, 91-96 (1990)
STANDARD simple staff_review
BASE COUNT 102 a 110 c 123 g 90 t
ORIGIN
1 aattcgaatt tgaaggctca aggcctcacc caacccgcct acctcatcgc cggtcttgac
61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg
121 agcacggaaa acagtgctgc caccaaagac caagcactcg cctggtcgac cacggccggg
181 ttggacagtg ctggggggta caaggccttg gtggaaaaca cggccgggct caacggtccg
241 attaatggct tgtttaccct gctcgacagc tttgcctatg tgaccccggt gagtggcatg
301 aaagggggta gtcagaataa cgaagaagtg cagaccaagt atcccgttaa ggatgatagt
361 aaggcttccg ccaaaattgc gtccttaatt aatgccagcc cactcaacag ttatggggat
421 gatgg
//
LOCUS MYCSDC1X 425 bp ds-DNA BCT 26-JUL-1990
DEFINITION M.pneumoniae SDC1 repetitive sequence.
ACCESSION M35023
KEYWORDS .
SOURCE M.pneumoniae (strain M-129) DNA, clone MP46..
ORGANISM Mycoplasma pneumoniae
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 425)
AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F.
TITLE Prevalence of novel repeat sequences in and around the P1 operon in
the genome of Mycoplasma pneumoniae
JOURNAL Gene 87, 91-96 (1990)
STANDARD simple staff_review
BASE COUNT 105 a 114 c 117 g 89 t
ORIGIN
1 aattcgaatt tgaagaccca aggcctcacc caacccgcct acctcatcgc cggtcttgac
61 gttgtggccg accacctcgt ctttgcggca tttaaagcgg gcgcggtggg gtatgatatg
121 acgactgatt cgaacgcttc gacctacaac caagcactcg tctggtcgac cacggccggg
181 ttggacagtg atggggggac aaggctttgg tagaaaacac aggccgggct caacggcccg
241 attaatggtt tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg
301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa
361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat
421 gatgg
//
LOCUS MZECAT1 2065 bp ss-mRNA PLN 26-JUL-1990
DEFINITION Z.mays catalase isozyme 1 (CAT-1) mRNA, complete cds.
ACCESSION M33104
KEYWORDS catalase isozyme 1.
SOURCE Z.mays seedling, cDNA to mRNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 2065)
AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G.
TITLE Characterization of catalase transcripts and their differential
expression in maize
JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 169 1647 catalase isozyme 1 (EC 1.11.1.6)
mRNA 1 2065 catalase isozyme 1 mRNA
BASE COUNT 502 a 558 c 487 g 518 t
ORIGIN
1 gaaaaaaaag gggaaatcgg cttcctactc cccgtcctta tcgccagccg aaccgacatg
61 ttttctcccc ccttctcgcc ttctccttct ccccctagtc tagaggcgtt tgctccccaa
121 ctccttcggc ccgtccgccc gcccactcga ctgatcccac cggcagccat ggatccatac
181 aagcaccgcc cgtctagtgg gagcaactcc agcttctgga ccaccaactc cggcgccccc
241 gtctggaaca acaactctgc cctcaccgtc ggacagcgag gtccaatcct ccttgaggat
301 tatcatctaa tcgaaaagct tgctcagttc gacagagaac gtatccctga acgtgttgtg
361 catgcacggg gagccagtgc caagggtttc tttgaggtca ctcatgatgt ctctcacctt
421 acatgtgctg attttctccg tgctcctggg gtccagacac ctgttattgt ccgtttctct
481 acagttgtgc atgagcgtgg aagccctgag accttgaggg atccacgtgg ttttgctgtc
541 aagttctaca ccagagaggg taactttgac ctcgtgggta acaacatgcc tgtgtttttc
601 atacgagatg ggatgaaatt ccctgacatg gtccacgctt tcaagccgaa tccaaagacc
661 aatttgcagg agaactggag aatagtagat ttcttctctc accacccaga gagcctacac
721 atgttcacct tcctctttga cgatgttggc atcccactca actacaggca catggagggc
781 tttggtgtca atacctactc cttgatcaac agggatggaa agcctcacct tgtgaaattc
841 cattggaagc ctacttgtgg tgtgaaatgc ttgctcgaca atgaagctgt gactgttgga
901 ggcacctgcc acagccatgc gacgaaggat ctatatgatt ccatcgcagc tgggaattac
961 cctgaatgga agctctacat ccagactatt gatcttgacc atgaggataa gtttgacttt
1021 gacccgctcg atgtcaccaa gacctggcct gaggatatca tcccgctgca gcccgttgga
1081 cggatggtcc tgaacaagaa cgtcgacaac ttctttgcag agaatgaaca gattgctttc
1141 tgcccagcga ttagtgttcc tgcaattcac tattctgatg ataagctgct ccagacgaga
1201 atcttctcct atgctgatac ccagaggcac cgccttggtc caaactatct gatgcttcct
1261 gtgaatgcac caaaatgtgc ccaccacaat aaccaccatg atgggttcat gaacttcatg
1321 cacagggacg aagaggtgaa ctacttccct tcgaggtttg atcccgcccg tcacgcggag
1381 aaggtcccca ttcctccccg tgttctaaca cgctgtcgtg agaagtgcat cattcagaag
1441 gagaacaact tcaagcaggc tggcgagaga tatcgttcct tcgaccctgc aaggcaagac
1501 cggttcatcc agcgatgggt tgacgcactg acacaccctc gcgtgaccca tgaacaccgt
1561 accatttgga tctcctactg gtcccagtgc gacgccgctc ttggccagaa gctgccttct
1621 aggctgaacc tgaagccgag catgtaagga tcgacgagga agaaagcagg caccggtggc
1681 caaggatgca acgcaacatg gagcgtgtga tgtttacacc aatataattg aataaacagg
1741 ggatgtgcgc gttgtcgtac ttatgctgat gctgatggtc ggtggtcgat tatatatact
1801 ggaacttctg gtgtatgctc ttctcttctg gggagacgta atctaacgaa gaagaatgtg
1861 tgtcattgtg gcctgtgcta caaaccctgc tgtatgggcc tgtctataag aaaacacgga
1921 tggagttgtg acgttatgtt ctgacagttt atttactaat gagcacatac tttgatctaa
1981 ctagaacgaa gagaagttca cggaactgtc ggacacatgc agcaaggatc ctcattataa
2041 tacgaatcac tcttcgtttg cattc
//
LOCUS MZECAT3 1790 bp ss-mRNA PLN 26-JUL-1990
DEFINITION Z.mays catalase isozyme 3 (CAT-3) mRNA, complete cds.
ACCESSION M33103
KEYWORDS catalase isozyme 3.
SOURCE Z.mays seedling, cDNA to mRNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1790)
AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G.
TITLE Characterization of catalase transcripts and their differential
expression in maize
JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 22 1509 catalase isozyme 3 (EC 1.11.1.6)
mRNA 1 1790 catalase isozyme 3 mRNA
BASE COUNT 386 a 550 c 550 g 304 t
ORIGIN
1 cgtgggtagc tagctaggtg aatgacaatg gatcctacca agttccgtcc gtccagcagc
61 cacgacacga cggtgacgac gacgaacgct ggcgctcctg tgtggaacga caacgaggcg
121 ctgactgtgg ggcctcgcgg tcccatcctg ctggaggact accacctgat cgagaaggtg
181 gcgcacttcg accgcgagcg catcccggag agggtggtgc acgcgcgtgg cgcgtccgcc
241 aagggcttct tcgagtcgac ccacgacgtg acgtcgctga cgtgcgccga cttcctgcgc
301 gcgcccggcg tgcggacgcc cgtgatcgtg cgcttctcgc aggtgatccc agagccgggg
361 tccggacgga cgatccgaga cgcgcgcggg ttcgccgtga agttctacac ccgcgagggc
421 aactgggacc tgctgggcaa caacttcccc gtcttcttca tccgcgacgg catcaagttc
481 cccgacgtga tccacgcgtt caagcccaac ccgcggtcgc acgtgcagga gtactggcgg
541 gtgttcgact tcctgtcgca cctccccgag agcctgcaca ccttcttctt cctcttcgac
601 cacgtgggcg tgccgtccga ctaccgccac atggaagggt tcggcgtgaa cacgtacacg
661 ttcgtgagcg cggcggggaa ggcgcagtac gtgaagttcc actggaagcc gacgtgcggc
721 gagcggtcca tcctgacgga cgaggaggcg cgcgtcgggg gacggaacca cagccacacg
781 caggacctgt acgactccat cgcggcggag gggagcttcc cggagtggac gctgtacgtg
841 caggtgatgg acccggcaca gcaggagcag tacgacttcg acccgctgga cgacaccaag
901 acgtggccgg aggacctgtt gccgctccgc cccgtgggga ggctggtgct ggacaggaac
961 gtggacaact tcttgaacga gaacgagcag ctggcgttcg ggccggggct ggtggtgcca
1021 gggatctact actcggacga caagatgctg cagtgccggg tgttcgccta cgccgacacg
1081 cagcgctaca ggctgggtcc caactacctg atgctgcccg tcaacgcgcc gcgctgcggc
1141 acccacaaca accactacga cggcgccatg aacttgatgc accgcgacga ggaggtggac
1201 tactacccgt ccaggcacgc gcgccgctgc ggcagggcgg cgcccacgcc actgccgccc
1261 aggccggtcg cggggaggag ggagaaggca accatacgca agcccaacga cttcaagcag
1321 ccaggggaga ggtaccgctc ctgggacgcc gaccgacagg accgattcgt gaaggcgatt
1381 cgccgactcg ctcggacacc caaacgtcag ccagagctca ggtccatctg gatagacctc
1441 ctcgccaagg tcgacgcgtc gctggggatg aagattgcca cccggctcaa catgaaggca
1501 aacatgtgat gcttgtgctg aatagaataa taatgaagac gcatgcatgt cgtcgccagg
1561 aacaagagaa ataataacaa gaccaccacg catgggcata ctccatatat atatgtatag
1621 cccgtgcccg tgtccgcctt tgtaccaata caagccaaga ctagtggatg tattattatt
1681 attattattg cgctatcaca tacatgtacc cctgctacct gaagatggat attgtatcca
1741 gttatcaaat taagacacct gcagcaaaaa aactatatat gttgcataag
//
LOCUS BRVRNASA 197 bp ss-mRNA VRL 26-JUL-1990
DEFINITION Berne virus ORF5 mRNA, 5'end.
ACCESSION M33503 M33501
KEYWORDS core protein.
SOURCE Berne virus (strain P138/72) viral RNA.
ORGANISM Berne virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Toroviridae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
TITLE A 3'-coterminal nested set of independently transcribed mRNAs is
generated during Berne virus replication
JOURNAL J. Virol. 64, 331-338 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 137 > 197 ORF5
mRNA 113 > 197 RNA5
BASE COUNT 50 a 27 c 38 g 82 t
ORIGIN
1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag
61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt
121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc
181 acctcaggtt gttgcat
//
LOCUS BRVRNASB 179 bp ss-RNA VRL 26-JUL-1990
DEFINITION Berne virus ORF3 mRNA, 5' end.
ACCESSION M33502
KEYWORDS core protein.
SOURCE Berne virus (strain P138/72) viral RNA.
ORGANISM Berne virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Toroviridae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
TITLE A 3'-coterminal nested set of independently transcribed mRNAs is
generated during Berne virus replication
JOURNAL J. Virol. 64, 331-338 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 153 > 179 ORF3
BASE COUNT 52 a 17 c 34 g 76 t
ORIGIN
1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat
61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag
121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt
//
LOCUS CHKGLOBA 1204 bp ds-DNA VRT 26-JUL-1990
DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4.
ACCESSION M30485
KEYWORDS pie-alpha-globin.
SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1204)
AUTHORS Broders,F., Zahraoui,A. and Scherrer,K.
TITLE The chicken alpha-globin gene domain is transcribed into a
17-kilobase polycistronic RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.)
BASE COUNT 282 a 252 c 263 g 407 t
ORIGIN
1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg
61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat
121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg
181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg
241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt
301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac
361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga
421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg
481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag
541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc
601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga
661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag
721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg
781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac
841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt
901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca
961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc
1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt
1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg
1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact
1201 ggtt
//
LOCUS CHKGLOBB 582 bp ds-DNA VRT 26-JUL-1990
DEFINITION Chicken pie-alpha-globin gene, fragment H10.
ACCESSION M30486
KEYWORDS pie-alpha-globin.
SOURCE Chicken AEV transformed erythroblast DNA, fragment H10.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 582)
AUTHORS Broders,F., Zahraoui,A. and Scherrer,K.
TITLE The chicken alpha-globin gene domain is transcribed into a
17-kilobase polycistronic RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 241 > 582 pie-alpha-globin fragment mRNA H10
BASE COUNT 171 a 128 c 108 g 175 t
ORIGIN
1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata
61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct
121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg
181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata
241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac
301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg
361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat
421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg
481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca
541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc
//
LOCUS ECOPHOAA 600 bp ds-DNA BCT 26-JUL-1990
DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end.
ACCESSION M33536
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain K-12) cell line BW7710 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 600)
AUTHORS Agrawal,D.K. and Wanner,B.L.
TITLE A phoA structural gene mutation that conditionally affects
formation of the enzyme bacterial alkaline phosphatase
JOURNAL J. Bacteriol. 172, 3180-3190 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.K.Agrawal, 03-APR-1990.
The phoA503 mutation does not interfere with export of active
enzyme but does interfere with assembly.
FEATURES from to/span description
pept 283 > 600 alkaline phosphatase precursor (phoA) (EC
3.1.3.1)
sigp 283 345 alkaline phosphatase signal peptide
matp 346 > 600 alkaline phosphatase
variant 413 413 c in wild type; t in phoA503 mutation
BASE COUNT 159 a 130 c 151 g 160 t
ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site.
1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc
61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg
121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt
181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct
241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg
361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt
421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat
481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat
541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac
//
LOCUS GCOEARA 1771 bp ds-DNA PLN 26-JUL-1990
DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene.
ACCESSION M33640
KEYWORDS 18S ribosomal RNA.
SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA.
ORGANISM Gracilaria tikvahiae McLachlan
Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae;
Florideophycideae; Gigartinales; Gracilariaceae.
REFERENCE 1 (bases 1 to 1771)
AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A.
TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga
Gracilaria tikvahiae mclachlan
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research
Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA
B3H 3Z1
FEATURES from to/span description
rRNA 1 1771 18S ribosomal RNA
BASE COUNT 445 a 371 c 501 g 454 t
ORIGIN
1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata
181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat
301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta
721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg
841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
961 tcgaagacga tcagataccg tcgtagtctt tac
tataaac gatgaggact ggagatcgga
1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
1741 ttccgtaggt gaacctgcag aaggatcaag c
//
LOCUS HS6MCP 4440 bp ds-DNA VRL 26-JUL-1990
DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete
cds.
ACCESSION M33515
KEYWORDS major capsid protein.
SOURCE Human herpesvirus type 6 DNA.
ORGANISM Human herpesvirus type 6
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 4440)
AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R.
TITLE Identification, cloning, and expression of the major capsid protein
gene of human herpesvirus 6
JOURNAL J. Virol. 64, 714-722 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 235 4272 major capsid protein (MCP)
BASE COUNT 1422 a 1169 c 785 g 1064 t
ORIGIN
1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct
61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag
121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac
181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa
241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac
301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat
361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa
421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat
481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca
541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa
601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca
661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga
721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc
781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat
841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga
901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt
961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact
1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa
1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat
1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg
1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa
1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg
1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac
1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg
1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac
1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt
1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga
1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa
1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga
1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac
1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc
1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct
1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca
1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt
2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct
2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat
2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac
2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta
2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg
2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc
2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta
2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc
2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac
2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag
2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag
2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc
2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt
2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc
2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca
2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac
3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac
3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat
3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt
3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa
3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag
3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc
3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat
3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat
3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg
3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc
3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg
3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt
3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac
3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc
3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac
3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt
3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac
4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag
4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac
4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc
4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa
4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga
4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct
4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca
//
LOCUS HUMSEXREPB 916 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human sex chromosome repeat, clone pDP330.
ACCESSION M33524
KEYWORDS sex chromosome repeat.
SOURCE Human cell line OXENII DNA, clone pDP320.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 916)
AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
Page,D.C.
TITLE Human sex-chromosome-specific repeats within a region of
pseudoautosomal/Yq homology
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.C.Fisher, 03-APR-1990.
FEATURES from to/span description
site 1 388 low copy flanking sequence
rpt 389 916 sex chromosome repeat
BASE COUNT 206 a 228 c 229 g 253 t
ORIGIN Chromosome Yp.
1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta
61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc
121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt
181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca
241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg
301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc
421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga
481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct
541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct
601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca
661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct
721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg
781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct
841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc
901 ctggaggctg gaattc
//
LOCUS HUMSEXRPA 918 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human sex chromosome repeat, clone pDP316.
ACCESSION M33523
KEYWORDS sex chromosome repeat.
SOURCE Human cell line OXENII DNA, clone pDP316.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 918)
AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
Page,D.C.
TITLE Human sex-chromosome-specific repeats within a region of
pseudoautosomal/Yq homology
JOURNAL Genomics (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.C.Fisher, 03-APR-1990.
FEATURES from to/span description
site 1 388 low copy flanking sequence
rpt 389 918 sex chromosome repeat
BASE COUNT 207 a 242 c 226 g 243 t
ORIGIN Chromosome Yp.
1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta
61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc
121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt
181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca
241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg
301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc
421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg
481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc
541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc
601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac
661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc
721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta
781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg
841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag
901 tcctggaggc tggaattc
//
LOCUS IRICAP 2461 bp ds-DNA VRL 26-JUL-1990
DEFINITION Iridescent virus type 1 capsid protein gene, complete cds.
ACCESSION M33542
KEYWORDS capsid protein.
SOURCE Iridescent virus type 1 DNA.
ORGANISM Iridescent virus type 1
Viridae; ds-DNA nonenveloped viruses; Iridoviridae.
REFERENCE 1 (bases 1 to 2461)
AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L.
TITLE Molecular cloning, characterization, and expression of the Tipula
iridescent virus capsid gene
JOURNAL J. Virol. 64, 125-136 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 601 1995 capsid protein
mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp)
BASE COUNT 717 a 462 c 443 g 839 t
ORIGIN
1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat
61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa
121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta
181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt
241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat
301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt
361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt
421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt
481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt
541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac
601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa
661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga
721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct
781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg
841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta
901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat
961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct
1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca
1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca
1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag
1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct
1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct
1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt
1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt
1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca
1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat
1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt
1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga
1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt
1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc
1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg
1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc
1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac
1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga
2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt
2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa
2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa
2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt
2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc
2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc
2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct
2461 t
//
LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds.
ACCESSION M33679
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
TITLE Clustered heavy chain somatic mutations shared by anti p
azophenylarsonate antibodies confer enhanced affinity and ablate
the cross-reactive idiotype
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital,
Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES from to/span description
pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1)
BASE COUNT 98 a 83 c 89 g 93 t
ORIGIN
1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg
61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg
121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac
181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac
241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt
301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGKABF 324 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds.
ACCESSION M33678
KEYWORDS immunoglobulin light chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
TITLE Clustered heavy chain somatic mutations shared by anti p
azophenylarsonate antibodies confer enhanced affinity and ablate
the cross-reactive idiotype
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital,
Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES from to/span description
pept < 1 > 324 Ig light-chain V-J region (AA at 1)
BASE COUNT 96 a 77 c 73 g 77 t 1 others
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat
241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagt tggaaatcaa acgg
//
LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5'
end.
ACCESSION M33500
KEYWORDS T cell receptor; beta-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2567)
AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and
Fotedar,A.
TITLE Transcription of the T cell receptor beta-chain gene is controlled
by multiple regulatory elements
JOURNAL J. Immunol. 144, 1111-1119 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 2544 > 2567 T cell receptor beta-chain V-2 region
precursor
sigp 2544 > 2567 T cell receptor beta-chain signal peptide
mRNA 2478 > 2567 T cell receptor beta-chain mRNA
BASE COUNT 708 a 560 c 583 g 716 t
ORIGIN
1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag
61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc
121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga
181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag
241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga
301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc
361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc
421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc
481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga
541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt
601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt
661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga
721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg
781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga
841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa
901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata
961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac
1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag
1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt
1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta
1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta
1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat
1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa
1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc
1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct
1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa
1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc
1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc
1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag
1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag
1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa
1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc
1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct
1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat
2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg
2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc
2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta
2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact
2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca
2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt
2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt
2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat
2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc
//
LOCUS R751TRA 578 bp ds-DNA BCT 26-JUL-1990
DEFINITION Plasmid R751 traJ and traK genes, 5'end.
ACCESSION M25422
KEYWORDS inverted repeat; transfer origin region.
SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA.
ORGANISM Plasmid R751
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 578)
AUTHORS Lanka,E. and Euerste,J.P.
TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of
plasmid-encoded products with the transfer origin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
E.Lanka, 17-JUL-1989.
FEATURES from to/span description
pept 48 < 1 (c) traJ protein
pept 403 > 578 traK protein
signal 243 211 promoter PL
signal 266 294 promoter PR
rpt 49 64 inverted repeat
rpt 118 157 inverted repeat
rpt 296 331 inverted repeat
BASE COUNT 141 a 168 c 163 g 106 t
ORIGIN
1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc
61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg
121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt
181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat
241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct
301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag
361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc
421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac
481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg
541 atgaaaacga tctgggagca catgaaggaa accggccg
//
LOCUS RP4TRAB 571 bp ds-DNA BCT 26-JUL-1990
DEFINITION Plasmid RP4 traJ and traK genes, 5' end.
ACCESSION M25423
KEYWORDS inerted repeat; transfer origin region.
SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA.
ORGANISM Plasmid RP4
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 571)
AUTHORS Lanka,E. and Euerste,J.P.
TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of
plasmid-encoded products with the transfer origin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
E.Lanka, 17-JUL-1989.
FEATURES from to/span description
pept 26 < 1 (c) traJ protein
pept 394 > 571 traK protein
rpt 48 63 inverted repeat
rpt 118 157 inverted repeat
signal 219 192 promoter PL
rpt 281 318 inverted repeat
signal 246 272 inverted repeat
BASE COUNT 135 a 148 c 181 g 107 t
ORIGIN
1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc
61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg
121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt
181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa
241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa
301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg
361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg
421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg
481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc
541 atttgggagc acatgcggga aacggggaag g
//
LOCUS STAREPEBR 2389 bp ds-DNA BCT 26-JUL-1990
DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA)
genes, complete cds.
ACCESSION M33479
KEYWORDS ethidium resistance protein; replication protein.
SOURCE S.aureus plasmid DNA.
ORGANISM Staphylococcus aureus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 2389)
AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J.
TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium
bromide resistance plasmid naturally occurring in Staphylococcus
aureus
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First
Avenue, RM 1166, New York, NY 10016
FEATURES from to/span description
pept 1153 1476 ethidium resistance protein (ebr)
BASE COUNT 796 a 403 c 290 g 900 t
ORIGIN
1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa
61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca
121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt
181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg
241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa
301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg
361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc
421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat
481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa
541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata
601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt
661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc
721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt
781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt
841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt
901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg
961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac
1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca
1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa
1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga
1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata
1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat
1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt
1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc
1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt
1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc
1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa
1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt
1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata
1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta
1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg
1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg
1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg
1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca
2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg
2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct
2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa
2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc
2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag
2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg
//
LOCUS XANAVR 2100 bp ds-DNA BCT 26-JUL-1990
DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds.
ACCESSION M32142 J03672
KEYWORDS avirulence protein.
SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA.
ORGANISM Xanthomonas campestris
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2100)
AUTHORS Ronald,P.C. and Staskawicz,B.J.
TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv.
vesicatoria encodes a 50-kD protein
JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.Ronald, 15-FEB-1990.
FEATURES from to/span description
pept 308 622 ORF1
pept 713 2050 ORF2
BASE COUNT 656 a 423 c 505 g 516 t
ORIGIN
1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca
61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag
121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc
181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa
241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt
301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg
361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt
421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa
481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct
541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga
601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc
661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga
721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc
781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc
841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc
901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag
961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt
1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc
1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc
1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca
1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc
1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg
1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt
1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct
1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga
1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat
1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct
1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac
1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca
1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac
1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga
1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga
1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga
1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg
2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc
//
LOCUS YSCSCD25 5055 bp ds-DNA PLN 26-JUL-1990
DEFINITION S.cerevisiae SCD25 gene, complete cds.
ACCESSION M26647 M31771
KEYWORDS Ras protein; SCD25 gene; cell division cycle.
SOURCE S.cerevisiae (strain OL136) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 2129 to 5055)
AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M.
TITLE The C-terminal part of a gene partially homologous to CDC25 gene
suppresses the CDC25-5 mutation in Saccharomyces cerevisiae
JOURNAL Gene 77, 21-30 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3880)
AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and
Jacquet,M.
TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is
a dispensable gene of Saccharomyces cerevisiae
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak,
01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie
cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex
FEATURES from to/span description
pept 128 3880 SCD25 protein
pept 4319 > 5055 ORF X
BASE COUNT 1638 a 973 c 900 g 1544 t
ORIGIN
1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc
61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc
121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga
181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt
241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa
301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa
361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa
421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa
481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg
541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag
601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt
661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat
721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa
781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt
841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt
901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag
961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa
1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc
1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct
1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac
1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac
1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg
1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt
1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg
1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag
1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca
1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc
1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac
1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga
1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg
1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga
1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga
1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa
1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga
2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa
2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt
2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca
2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat
2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga
2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg
2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg
2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga
2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt
2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga
2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag
2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc
2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac
2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga
2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac
2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt
3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat
3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag
3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag
3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa
3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt
3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa
3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag
3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg
3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata
3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa
3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt
3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc
3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga
3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt
3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt
3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca
3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta
4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc
4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc
4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca
4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg
4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat
4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg
4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt
4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag
4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac
4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag
4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt
4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa
4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta
4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt
4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga
4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt
4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa
5041 gaaaatcaaa gatct
//
LOCUS ADBMLPA 101 bp ds-DNA VRL 26-JUL-1990
DEFINITION Mastadenovirus 2 R1, R2 and R3 binding sites.
ACCESSION M33540
KEYWORDS .
SOURCE Mastadenovirus 2 viral DNA.
ORGANISM Mastadenovirus 2
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 101)
AUTHORS Leong,K., Lee,W. and Berk,A.J.
TITLE High-level transcription from the adenovirus major late promoter
requires downstream binding sites for late-phase-specific factors
JOURNAL J. Virol. 64, 51-60 (1990)
STANDARD simple staff_review
COMMENT Sequence-specific binding proteins are induced during the late
phase of infection. These proteins interact with three regions in
the first intron of the major late promoter (MLP).
BASE COUNT 24 a 25 c 26 g 26 t
ORIGIN
1 ccagctgttg gggtgagtac tccctctcaa aagcgggcat gacttctgcg ctaagattgt
61 cagtttccaa aaacgaggag gatttgatat tcacctggcc c
//
LOCUS LB3HDCBA 804 bp ds-DNA BCT 26-JUL-1990
DEFINITION Lactobacillus 30a histidine decarboxylase-B (hdcB) gene,
complete cds.
ACCESSION X13099
KEYWORDS histidine decarboxylase.
SOURCE Lactobacillus 30a DNA.
ORGANISM Lactobacillus 30a
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 804)
AUTHORS Copeland,W.C., Domena,J.D. and Robertus,J.D.
TITLE The molecular cloning, sequence and expression of the hdcB gene
from Lactobacillus 30a
JOURNAL Gene 85, 259-265 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 85 609 histidine decarboxylase-B (hdcB)
BASE COUNT 277 a 140 c 152 g 235 t
ORIGIN
1 actaatccac aggacatagt ttgaggaaga gatggtgttt actacctctt cctttaatat
61 tttgtaagtt aaggattgat tgcaatgagc aacagtaact accaagttag tttagaacga
121 attaaaaaag ttgtccctga agaactctta accaatgcat tgttagcagc tattgacaat
181 tctggtgaaa ggatgtcaca aataatagtc gataaaaaag ataacggcaa cgactattac
241 ctcaccatcc atagattctt cgtttatagc aacgaagaat tcaccgcttt tgataaagaa
301 gatgttgcag atgtcgaatt cgttaatggt acgccagatg gtgaagtaat cattacttta
361 aaggacggca aagtgttgca cccgtctcac atttgttacg gccgagcttt tgactttatc
421 caagatgtca agccaaaagt aattacaatg gcgggatatg acagcacaat tcgaggcgaa
481 tttccacaat tattagatcc agatcatgcg gaagagattg atcgattacg tcgctggatg
541 caagatggaa atattagcca ttacgaatac gatgatgcaa atccagctta tccaaaagca
601 ggaaaataaa aaaacatatt gacatatcat cagatatagg ttatgttaca atcaagcatc
661 ttaataggta atgcgcaatt tatatctttg aatatagttc cattatttat ttataaatag
721 ttactccgaa aaggactacg tacctactat acttttaaat aaatatattt cgtgatgggg
781 agcgttatta ccccggctgt cgac
//
LOCUS LBPREPA 3547 bp ds-DNA BCT 26-JUL-1990
DEFINITION L.plantarum repA, repB and repC genes, complete cds.
ACCESSION M33531
KEYWORDS rep protein.
SOURCE L.plantarum DNA.
ORGANISM Lactobacillus plantarum
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 3547)
AUTHORS Bates,E.E.M. and Gilbert,H.J.
TITLE Characterization of a cryptic plasmid from Lactobacillus plantarum
JOURNAL Gene 85, 253-258 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 2191 2349 repA protein
pept 2406 3062 repB protein
pept 570 1655 repC protein
BASE COUNT 1189 a 589 c 758 g 1011 t
ORIGIN
1 gatatctggt taactttgat cacattagtg atcaaattca tttctttagc cccatcaaac
61 gatcagtttg ctttatgaaa gtgaccgctt gatggggctt tttcgtttac cttttgtcaa
121 aggtaaggtg tgacgggctt gactttgggt ggcgttgtgc ggaagcgcaa tcgacacgat
181 tttgactttg aggggagtta agaggggaag cgtagcgccc cttcttacaa gtgtaaagtg
241 tggacaagag agcgtagcga tattgtctac actttacccc aattgtcatg cgactttaaa
301 tagaattatt gattaataaa agccccctga caaaagtcga agggggactt ttattttagt
361 ttgaggtttg catacctact taaaaaagta gggcagcaaa acgtcaaaca ggtatcagct
421 aatcatccga tagggtgcgc tgatacggtc ctcaaaagag agccgacaga gccgtctgca
481 agacccctcg gcggaggccc acctttacga agtaagatat agtgggttat actttacttg
541 gaagataact ccgaaatgag gtgcatacaa tgagttttgc agtggctaga atgacgaaat
601 taaaagctga taatttagtc ggcattggca atcatgacca acggaaaacg actaatcaca
661 gcaacgaaga tattgatgtt tcccgctctc acctgaatta tgatttagtg gctgggcgca
721 ctgataactt taaaacggat attgaagcct atatcaacga aaacaaagcg agtaagcggg
781 cagttcgcaa agacgctgtt ttagtcaatg agtggattat aaccagtgac aaagactttt
841 ttgagcaatt agacgaagcc gaaacccgta aatattttga aacagccaaa caatattttg
901 cagataacta tggtgacgaa aatattcgct atgcagttgt tcatatggac gagaagaccc
961 ctcacatgca tatgggcatt gtgccctttg atgatgataa aaagctctca gctaagcgta
1021 tattcaatcg tgaagcctta cagcacattc aagaggaatt accacagtac ctcaaagaaa
1081 atggctttga tgttcaacgt ggtaacaaaa ataaagagcg taagaattta tcagtacccg
1141 aatacaaagc tatgcgggaa gaattgaaaa aaatagagac cgaaaaacaa gagacacaag
1201 caaagcttgc agatacaaaa aaacagcttg atgagatcaa accacgggat accaagaaaa
1261 ttgctagtaa acccaccttg atgaataaaa ataaagtcac ggttgataaa tctgatctcg
1321 ctgatttgga acaaagggcg gtgactagcg acgcttataa ctttgaaaaa attcatctgg
1381 aagtaggaaa tcatagttta cgtaatgatt tgagtgaagc caagggccgc aactatgaac
1441 tgagaaaaga aaatgagcga ttgcaaaaac tagtaggaac gcttcaaggc attatacgaa
1501 atgttgatga gtttctacac aaaaaactag gtattaattt acctgaaaag tggctagagc
1561 gtgcaggact aaaagaaccg tctaaaaaag cccctgaaag ctcacaggaa ctcgacagac
1621 ataaatctga tgaattaggc ggtccacatc tttaaatcgc ttatacgagc ttaaaatggc
1681 gtttaagagc ttaatttacc atctcgctag attgaacgta gttaactttg tgtccgtcaa
1741 cggtaaatcg acgtaggcgt tttatagccg ctgggctatt agacgcccta ggaggcttta
1801 aggagttgat agactagcgg ataaaacact tttgcacatg caaagaaaag cacccctgct
1861 ttttttgcct gccccacggc gagtgcgggg tgagtttagc gggtgctccc gtcatttatg
1921 gggtcaagct gacacagctt gcgggtttgg gcagagccca tattttggtt tggtttgagt
1981 gggataaaaa aattgggcga aaaacatggg ggtactacga caccccccca tgtgtccatt
2041 gtccattaaa cagaacactt ttttcaagaa accttttagg ttaggggttt tcgggggggt
2101 ttgagatttt ataaaaaatg ttgtatttct aacgtatgta taatataatg atggaataga
2161 gataaaaata gtaagaaaga aggttttttg atggttgaag ttgaaaagaa aaaaattact
2221 ttgtctatac ctgttgaaac taatggaaag ctggaagaat tggcccagaa atatggcatg
2281 actaaatctg gattggttaa ttttttggtt aatcaggttg cagaagctgg aactatttat
2341 aggcaataaa aaaagcgccc tgtgcatagg acgcaatcta aaagtctgtg aggtaattat
2401 aacatatgaa aagtgaatct aaaatcgatt ggacggtacc tcgtccaaat aaaaatccca
2461 aaacaaaaca gccttataaa cgtggtcgta attggggtat tgttgtttat cctgaaagtc
2521 ttcctgaaaa ttggaaagat attatcaggc aagagcctat tgctgtcagt cccttacatg
2581 ataaagatgt taaccctgat ggagaaaaga aaaaatctca ctatcatctt gttttgaact
2641 ataaagggaa caaatctttt gaacaaattg atgaaattgc taggtcttta agggcgcctg
2701 ctcctcaaag aattagtagt ttaactggcg ctgttaggta cttgacacat atggataatc
2761 ctgaaaaata tcagtatgat aatgctgata ttgagacctt tggaggcttt gatttagaga
2821 gttgcttagc tctttctact ggcgataagc gccaagcctt acgtgacatg ttggctttta
2881 tttctgaaaa tgaaattatg catttaaaag actttgcaga ttattgcatg tctgaggaag
2941 caccagctgg ctggttcgaa cttctaactg aaaggaatac gctttttatt aaagaatata
3001 tcaagtcaaa ttggcagaaa caacagtatg ctagtaaaaa catcaataaa atgtcggatt
3061 aaaattttat tgatgttgtt gctatattat tagtgaaagg atggtttact ttatgccaac
3121 aagaaaaaat attttagatg atattcaaga acatattgac aatgaagaac gtgttttggt
3181 tactaattca agcaaaatta actagcacca cgcgtataga gtgatttaaa ataactaaca
3241 tcgtttttat ttgaatttag aagggaagag atttttatta aaaatatagg ttttaactca
3301 aattatttta aaacctggta tttttggcta ggcatattaa cggtagtggg attaatcggt
3361 gatcccattt tacactatca ttcttcaact agtccgtggt tacaaatact tattgctatt
3421 ttattatttg tagcagcatt taccaaaaaa ataaataata actgacttaa atcgcaattc
3481 actctaaact tttaacaaat ttgttatcat aattgggtaa ggtgtttgca agttaagtat
3541 ttttccc
//
LOCUS RATUD2A01 1088 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 1.
ACCESSION M35202 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 1 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1088)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP-glucuronosyltransferase, UDPGTr-2, gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pept 349 + 1072 UDP glucuronosyltransferase-2 (UDPGTr-2)
pre-msg 315 > 1088 UDP glucuronosyltransferase-2 mRNA and intron
IVS 1073 > 1088 UDP glucuronosyltransferase-2 intron A
signal 288 293 CAT box
BASE COUNT 329 a 194 c 216 g 349 t
ORIGIN
1 ctgcagtcaa cggatcttca ctgctatgta agaacattta agaaataaga gctttcatct
61 gtgattttta catgactcta acacgttata atcaacagat gatgtttgca catgagaagt
121 gattcaattt tggctgaata gaatcaggga caaaaaagac aaataaactc tgttaacctt
181 gagctcatgt tccatgcttg tatttacaca tggcgtaaca tcattgcact catctaatcg
241 gtgatggttt aaaagttata tattaatttc ttgggtgact gaactttcat aaaaaacatg
301 aatatctaca atgaacgaca gatatcaaaa gcattccatt tctgcaagat gtctatgaaa
361 cagacttcag tgtttctgtt gatacagctc atatgctact ttagacctgg agcctgtgga
421 aaagtgctag tgtggcccac agaatacagc cactggatta atataaagat aattctgaat
481 gaacttgccc agagaggtca tgaagtcacg gttcttgtat cttcggcttc cattctcatt
541 gagcctacca aggaatcttc tattaatttt gagatttact ctgtaccttt gagtaaaagt
601 gatcttgaat atagttttgc aaaatggata gatgaatgga cacgtgattt tgaaacactc
661 tcgatttgga catattattc aaaaatgcaa aaagtcttca atgaatattc tgatgtcgtt
721 gaaaatttat gcaaagcact catttggaac aagagtctta tgaaaaaact ccaaggatct
781 caatttgatg tcattctcgc agatgctgtg ggtccctgtg gtgagctgct agcagaactg
841 cttaagacac ctttagtgta cagtctccgc ttctgtcctg gatacagatg tgaaaagttc
901 agtgggggac ttccactgcc tccttcctat gtgcctgttg ttctttcaga attaagtgac
961 cgcatgacat ttgtggaaag agtgaagaat atgttgcaga tgctgtattt tgacttttgg
1021 tttcaaccat ttaaagagaa gtcctggagt cagttttaca gtgatgttct aggtaaactg
1081 tgcctttc
//
LOCUS RATUD2A02 373 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A.
ACCESSION M35078 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 2 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 373)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 373 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 > 373 UDP glucuronosyltransferase-2 intron A
BASE COUNT 137 a 75 c 56 g 105 t
ORIGIN About 0.1 kb after segment 1.
1 aaaatgctat agagtaactg agcagaacac tccaaaaatt actatccatg taaactgaga
61 caaagatttc tcttagtaat cactagatct actctaagtt tgtcttagta aaagaaactc
121 caagtttctc gaatgcttta atgactgtag atgcgaacac taaagagtca ttatatacca
181 ccacaactat ctgtgtagca cagaaggaaa catgttccct tatacaaatt actcacttgc
241 aaatgatgaa aaaactccaa ggagctaagt ttgatgttat cacctagaat atcacgacag
301 gttttctcac aattaaatca tatcactaga accagaaaca gtcaaggcat cttagtttct
361 tcgagttcag ctg
//
LOCUS RATUD2A03 380 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A.
ACCESSION M35079 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 3 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 380)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 380 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 > 380 UDP glucuronosyltransferase-2 intron A
BASE COUNT 154 a 49 c 55 g 122 t
ORIGIN About 0.1 kb after segment 2.
1 tcaaataaaa tagtacctaa attaatagga gaaagaattt aaaggttaac tatttgtgga
61 aatatccagg tgtaactttg acatatacaa ctaagttagt attacttgtc tcttctaata
121 ggcacagcac agtagtgata aaaagaaact tagtcataaa ctgcagatta tcacagtgca
181 tttcaagaat cagaaatcaa aagaatagct actaaaatgt ataaagtaga tgaaatattc
241 tacaaaagtt gatttttcta aggcattttc aagctttttt gcaaggaaca aatgttccaa
301 attcattggt gtaactttag aaaacatgta attgacaaca ttgatattat gttatacatt
361 atatcataat caaatgactt
//
LOCUS RATUD2A04 1435 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 2.
ACCESSION M35080 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 4 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1435)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 1435 UDP glucuronosyltransferase-2 mRNA and intron
pept + 51 + 199 UDP glucuronosyltransferase-2, exon 2
IVS < 1 50 UDP glucuronosyltransferase-2 intron A
IVS 200 > 1435 UDP glucuronosyltransferase-2 intron B
BASE COUNT 375 a 301 c 268 g 485 t 6 others
ORIGIN About 0.2 kb after segment 3.
1 ccacaaaacc tcttttcacc attgagtatt tttatctgtt ttggatgcag gtagacccac
61 aacattaact gagatgatgg ggaaggcaga tatatggctc attcgaacct tctgggactt
121 ggaatttcca cacccattct tacctaattt tgactttgtt ggaggactac attgcaaacc
181 agccaaacca ctgcctaggg taacattgga ttgttttcct tgataaactg ttcgttcctt
241 tatcattctt tatttgtttt tacaaagagg atagtttatt ttaattatta atatttatct
301 ttaatctttt tttacagtcc agtaattatc cccttctgga ccaccctcgt tccatcctcc
361 tcctcccttg ctccaagagt atgtatgcca ggagcctcct gcgatggaga ggatagtgtc
421 aggggtgcag gagggaacaa agtaagactc tggtgtggct ttaaagctga cggtctcctg
481 acattctaac tctctacctg ttcagaaaca ctgatgataa cttctagaaa atcatacaaa
541 ctttcttgct ctttctcatg ataaaaggct gctggcttgg gaatcagtac ctgtaactta
601 acaacagagg attgagcaat gtggccttgg tcctatatag taggaactgt gtggctctaa
661 ctttcagcct gctagtcaga anngcagaag ggatctttcc acatgatgtc tcctccttct
721 tcttcttgta gtcctcctct actctcctgg attctcaact gggatcagac gccctgccct
781 cttctcttct gcccagctga tcgattcttt attaactaat caaggatgat ctaaattatt
841 ttatacataa cattgagacc agtgatgctt gactgtgcca aattttggac tgcaaccaga
901 tatctgggca taaaaattag cacatgaata cacagtgtaa aaaaaaaacc gtcccctaac
961 actcacctat tgttttctgc atgtgggtga gtctacatgt gtctgatggg aggcctgtgc
1021 atgtttcttt ttacaactag gtcccttnnc tg
gtatataa gtttcattac taggaagtgt
1081 tagcatttaa tggtaatttt gttagatgga tgggattgtg aatttaaaac ttgccttgaa
1141 gtagattttg agtgacatag cacattttta aattttattt tgtgtttttt taaagaggac
1201 atctctctat agcttanntg tccttaacct catagcagtc cttctgcctc agtctcccat
1261 gtgctgagat tagaccagtc ttaatacctc ttctgaaaca tgatgtgtaa tatcagtgat
1321 ggagatctta ctgtgcacag ctttagatca tgatgtttag cagattgtaa cttccattca
1381 tgagaagaaa ctgcacaaac catctcattc ctgtcttact ttattgattg gaagc
//
LOCUS RATUD2A05 769 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron B.
ACCESSION M35081 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 5 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 769)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 769 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 > 769 UDP glucuronosyltransferase-2 intron B
BASE COUNT 293 a 116 c 117 g 243 t
ORIGIN About 0.1 kb after segment 4.
1 aaatgctact tcatttgatc ttgaaggtgt gtgagctgtc attatttaat tggtacggta
61 tttctttcaa ataaacaatt aaaatagtgt tcttttcttt aaaaaaataa agaaaaaaga
121 gatcataaag aaaaaaagaa gttgcagaaa gaaaagggga caccttgaaa agtgattata
181 gcacttatta ctaagttgta aaaggtttcc tatgaaaact atctaagaag ataagtagaa
241 aagtcctaat gagggaaagg aaaaaaaaat tcttctcctt ctcatcattt tgtcctcagt
301 acttacacat cttttcagaa tacatgacca caagttaaaa gtcataacaa aaaattaaat
361 aataaattta agtagaagtt tacaagaaaa aaatgcttac atgcatatcc attaggagta
421 atttctggct aaacaccatt cacatggctc cacaggttca tagaaggttg aaaaccataa
481 ttaaaattat tagtgaagtt ttgtattgat gaacccagtc catattttat cttctgtctt
541 agcacctata ataaatttta gttccctttt tacgaccttt agttaagtgt tttacaacct
601 cttggattgt gctctgagaa gaagaaagtc tggttgctat ctaagaacaa ttaactggtg
661 acacatagga gactgataca gttctcattg cacttttcac tatcagaaaa ggaactaaaa
721 taattccact ataaaagagc ttaataatca ctgatatact tagatctct
//
LOCUS RATUD2A06 359 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 3.
ACCESSION M35082 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 6 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 359)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 359 UDP glucuronosyltransferase-2 mRNA and intron
pept + 175 + 306 UDP glucuronosyltransferase-2, exon 3
IVS < 1 174 UDP glucuronosyltransferase-2 intron B
IVS 307 > 359 UDP glucuronosyltransferase-2 intron C
BASE COUNT 114 a 69 c 75 g 101 t
ORIGIN About 4.0 kb after segment 5.
1 gtagtatagt acaaatgcac acttaatgaa cactgggtac cgaggcaatg gatacactgg
61 tctcccaaaa taattccagg aattacataa tttcctctgg taagtttgtc tcggtagttg
121 agacaatgct tcccatgcaa ccattcatct gtgatgtcat aaccatcttc ataggaaatg
181 gaagaatttg ttcagagctc tggagaacat ggtgtagtgg tgttttctct gggatcaatg
241 gttaaaaacc tgactgaaga aaaagccaat gtagttgctt ctgctcttgc ccaaattcca
301 cagaaggtaa gataaaatgt ccacagagat ggcaaatgta ttataagtca tctgaaccc
//
LOCUS RATUD2A07 609 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exons 4 and 5.
ACCESSION M35083 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 7 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 609)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pept + 69 156 UDP glucuronosyltransferase-2, exon 4
292 + 511 UDP glucuronosyltransferase-2, exon 5
pre-msg < 1 > 609 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 68 UDP glucuronosyltransferase-2 intron C
IVS 157 291 UDP glucuronosyltransferase-2 intron D
IVS 512 > 609 UDP glucuronosyltransferase-2 intron E
BASE COUNT 170 a 119 c 115 g 205 t
ORIGIN About 0.6 kb after segment 6.
1 ccaggaacaa attttaccaa agccttggaa tttctgtaat taaataaggc attgtctgtg
61 tgtaacaggt tgtatggaga tttgatggta agaaaccaga taccttagga tctaacactc
121 ggctgtacaa gtggatcccc cagaatgacc ttcttggtaa ggcaaagttt aactacaagt
181 ttgtggctat agtaacacac tttcttgaga atagcacact tctgagtctt catattttcc
241 tctcttaaat attattcggt caataattat gtcaacttct tctcattgca ggtcatccaa
301 aaaccaaagc ttttgtagct catggtggaa caaatggcat ctatgaggca atctaccatg
361 gcattcctat tgttggtatt cccttgtttg cagatcaacc ggataacatt aatcacatgg
421 tagccaaagg agctgctgtt agagttgact tcagcatact gtcaactaca ggccttctca
481 ctgccttgaa gattgtcatg aatgaccctt cgtgagtctg tttgtttgtt gaagttgttt
541 tttccaagga aggctgtttc tttttctttt ttgaaacata atttttacta tataactaca
601 agagctgcc
//
LOCUS RATUD2A08 316 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E.
ACCESSION M35084 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 8 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 316)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 316 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 > 316 UDP glucuronosyltransferase-2 intron E
BASE COUNT 112 a 39 c 42 g 123 t
ORIGIN About 0.1 kb after segment 7.
1 ctcatagata tttgcttgct tcagcctcct gggtgctggg attagaaata tctgaattta
61 tatttgctgt gaataactat tattttaaaa atattgacag attcagatga tcatcagatt
121 gattttatcc tatttgaagg agggagaata atttcgaaaa attatgtttt tgcatatctg
181 aaatatgtgc ttttttaaca ataaagttac tctaaatttc taattgaatc aattagacat
241 gattattctc aaactattct atataaagaa ataatattac aaatatttat ctattataac
301 aaaggacaca ttttct
//
LOCUS RATUD2A09 487 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E.
ACCESSION M35085 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 9 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 487)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pre-msg < 1 > 487 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 > 487 UDP glucuronosyltransferase-2 intron E
BASE COUNT 196 a 89 c 80 g 122 t
ORIGIN About 0.05 kb after segment 8.
1 gaataagaga cagtattaaa ttcatacaaa tacctggaga acactattgt aatttcaagg
61 tttgctagaa gacaaatgta cctaatgaga aggtcctgag tcaaaaataa ctggagaaag
121 tgctgttcgt tcctacatac acagtcttct agtccaggaa cagaattaaa ttgttttcat
181 tgtggtgaat tcttgtggaa ctgttgtaca aagaagagtc ataaacaaca aagtgttttt
241 agaagaagaa cctagttata aacagataca taggagagga aaaaaaacta gagaggagat
301 atcgaacatg acatatgacc tggaaaaagt tctatggcta cttcccttct tggtcttata
361 tcatgagtta catgttacac aaaaacacac acacacaaac aaacacacac aaacatacac
421 acacacaaac atacacacac acaaacatac aaacacatac acacaagttt gtgtgtctta
481 ctagttt
//
LOCUS RATUD2A10 895 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 6.
ACCESSION M35086 J05482
KEYWORDS UDP glucuronosyltransferase-2.
SEGMENT 10 of 10
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pUDPGTr-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 895)
AUTHORS Mackenzie,P.I. and Rodbourn,L.
TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
and characterization of its promoter
JOURNAL J. Biol. Chem. 265, 11328-11332 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pept + 212 488 UDP glucuronosyltransferase-2, exon 6
pre-msg < 1 836 UDP glucuronosyltransferase-2 mRNA and intron
IVS < 1 211 UDP glucuronosyltransferase-2 intron E
signal 808 812 poly-A signal
BASE COUNT 248 a 206 c 163 g 278 t
ORIGIN About 0.15 kb after segment 9.
1 gttcataatt accctgtgct aaacaagact gtttcactgt ctttcctgtc actcaactct
61 cctctgccac cacctgaaac aaaacacttg agtgggaagt atacatgatt tattttaagt
121 tgcttgtgag acttttccct aaaacaacaa atgttgttaa gtcatcaaat tgcctcctct
181 ttaatcttag ttgtatacat tgtcccttca gctataagga gaatgccatg agattatcca
241 gaatccacca tgatcagcca gtgaagcccc tggaccgagc cgtcttctgg atcgagtatg
301 tcatgcgtca caaaggagcc aagcacctcc gctcaactct gcatgacctt agctggttcc
361 agtaccactc tctggatgtc attgggttcc tattgctctg tgtggtaggt gtggtattca
421 tcatcacaaa attctgcctc ttttgttgcc gtaagactgc taacatggga aagaagaaga
481 aagagtagca tcataaaggc tgaagcagag ccctgagaga tgagcctctg ccagctgctt
541 ccagcaggaa cctgttgtca tgccagtgcc ttccctctaa aagaagacag cgttgggacc
601 tcattgaaca tggctccaat gaattcacta tgttctgaag acatgcaaga tttcatgcca
661 aatatatatt cagtgctaaa aaaacaaaat cctgtgttca gtttagaatg ttttgatgta
721 gctgagaagc tttgcccaac aacaataact gaagctactg tagttcataa agttcacatg
781 gctttatagc ctttgcaaaa catatctata aatcaattac tttttgaaaa tacccagcct
841 gctttgtctt catttagtag actatttttc tctccttctt tcttttttct tcttt
//
LOCUS RATUDPA 1858 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-5 (UDPGTr-5) mRNA, complete cds.
ACCESSION M33746 J05440
KEYWORDS UDP glucuronosyltransferase-5.
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
UDPGTr-5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1858)
AUTHORS Mackenzie,P.I.
TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid
UDP glucuronosyltransferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pept 43 1635 UDP glucuronosyltransferase-5 precursor (EC
2.4.1.17)
sigp 43 111 UDP glucuronosyltransferase-5 signal peptide
(put.)
matp 112 1632 UDP glucuronosyltransferase-5
mRNA < 1 1858 UDP glucuronosyltransferase-5 mRNA
BASE COUNT 559 a 365 c 387 g 547 t
ORIGIN
1 agaaaggaac acagtgtgaa cagaaggatt ttgattttca aaatgccagg aaaatggatt
61 tttgctctgc tcttgctgca gataagtttc tgcctcagat ctgcgaaatg tgggaaggtg
121 ttggtgtggc cgatggaatt cagtcactgg atgaatataa aaacaatact ggatgaactt
181 gtacagaggg gccatgaagt cactgttctg aaaccttcag cttactatgt tcttgatcca
241 aaaaaatcgc cagaccttaa gtttgaaact tttcctacat ctgtcagtaa agatgaactg
301 gaaaaatatt tcataaaact tgctgatgcg tggacttatg agttgcaaag agatacatgt
361 ttgtcttttt ctcctttact acaaaatatg atggatgaat tttctgatta ttatctaagt
421 gtttgtaaag acgccgtttc aaacaagcag ctcatggcaa aactacagga atccaagttt
481 gatgttcttt tgtcagatcc tgtggctgcc tgtggggagc tgatagccga agtgctccac
541 attccttttc tgtacagtct tcgtgcctct ccaggccata aaattgaaaa gtccagtgga
601 agatttatac tacctccctc ttatgtgcct gtaattttgt caggattggg tggccaaatg
661 acattcatag acagggttaa aaatatgata tgtatgcttt attttgactt ttggttccat
721 atgtttaatg ccaagaattg ggatccattt tatactgaga ttttgggaag gcccaccacc
781 ttagctgaga caatgggcaa agcagaaatg tggctcatta gatcctactg ggatttggag
841 tttccccacc caacattacc aaatgttgac tacattggag gactccaatg caaacctgct
901 aaacccttgc ccaaggatat agaagacttt gtccagagct ctggagagca tggtgtggtg
961 gtgttttctc tggggtcaat ggtcagcagc atgacagaag aaaaggccaa cgcaattgca
1021 tgggcccttg cccagattcc acaaaaggtt ctttggaaat ttgatggcaa aatcccagca
1081 actttaggac ccaataccag agtctacaag tggcttcccc agaatgacct ccttggtcat
1141 ccaaaaacca aagcctttgt aactcatggt ggagccaatg gtgtctatga ggccatctat
1201 catggaatcc ctatgattgg cattcctatg tttggagaac aacatgataa cattgcccac
1261 atggtggcca aaggagcagc tgttacactg aatatcagga caatgtcaaa gtcagatttg
1321 ttcaatgcac ttaaggaagt aataaacaat cctttctata aaaaaaatgc tatgtggctg
1381 tcaaccattc accatgacca acctatgaaa cccctggaca aggctatctt ctggattgag
1441 tatgtcatgc gccacaaaag agccaagcac ctgagaccac ttggacataa ccttccctgg
1501 taccagtacc actctctgga tgtgattgga ttcctgctag cctgtttggc agtcattgca
1561 gcccttgctg taaaatgctt cttgttcatt taccgattct ttgcaaagaa gcaaaagaaa
1621 atgaagaatg agtagagctc gttgacaatg cactacagga atgaaattta agcctcattc
1681 taatttatga atcactttct taacacttcc tgattttttt ttgtggaggc agatcatcat
1741 tgtaagaaga catatagctc tgtgaatatt gatatgttat caaaatttta aaatcactta
1801 atgtaaaaaa gttgcattgt agaaaaattg aggaaaataa agtttacttg atagtctt
//
LOCUS RATUDPB 2216 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Rat UDP glucuronosyltransferase-21 (UDPGTr-21) mRNA, 3' end.
ACCESSION M33747 J05440
KEYWORDS UDP glucuronosyltransferase-21.
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
UDPGTr-21.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2216)
AUTHORS Mackenzie,P.I.
TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid
UDP glucuronosyltransferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.I.Mackenzie, 13-APR-1990.
FEATURES from to/span description
pept < 1 463 UDP glucuronosyltransferase-21 (AA at 2) (EC
2.4.1.17)
mRNA < 1 2216 UDP glucuronosyltransferase-21 nRNA
BASE COUNT 738 a 386 c 386 g 706 t
ORIGIN
1 agccaatggt gtctatgagg ccatctatca tggaatccct atgattggca ttcctatgtt
61 gggagaacaa catgataaca ttgcccacat ggtggccaaa ggagcagctg ttacactgaa
121 tatcaggaca atgtcaaagt cagatttgtt caatgcactt aaggaagtaa taaacaatcc
181 tttctataaa aaaaatgcta cgtggctgtc aaccattcac catgaccaac ctatgaaacc
241 cctggacaag gctatcttct ggattgagta tgtcatgcgc cacaaaagag ccaagcacct
301 gagaccactt ggacataacc ttccctggta ccagtaccac tctctggatg tgattggatt
361 cctgctagcc tgtttggcag tcattgcagc ccttgctgta aaatgcttct tgttcattta
421 ccgattcttt gcaaagaagc aaaagaaaat gaagaatgag tagagctcgt tgacaatgca
481 ctacaggaat gaaatttaag cctcattcta atttatgaat cactttctta acatttcctg
541 attttttttt gtggaggcag atcatcattg taagaagaca tatagctctg tgaatattga
601 tatgttatca aaattttaaa atcacttaat gtaaaaaagt tgcattgtag aaaaattgag
661 gaaaataaag tttacttgat agtcttaaaa atcacagtat taaccttaca atatttgaat
721 attgtccatt gacctctttc tctgagactg aatctgtagc tttcatacaa ataagtagct
781 aacttgtata ctataaatat ggacatataa atagtttttt ctgtaatagt cttaattatt
841 tgtagtcggg gataaagtgt ggtttggttt ggatattcat ttcaaagggt aggaatctgt
901 tggctatttt gttcctgtaa caaaatgtgc tgaccaaaag catctccagg gaaaagcaga
961 gcagtttatt ttgagttgtg cttacagatc ctgagaacgc aggatagata ggaaggcagg
1021 gcagcagtca gccagatgac aaaactctct cattacatct taaccacaca tagaaagcac
1081 aaagagtgag caaaaagtgt gactatggtg tgaactttca aagcttgctc cagtgatata
1141 tttcctccaa aaagatttaa cccctttaaa taatattcct gtacccctgg agttgggagt
1201 ttagctcagt ggtagagcat ttgcctacca aacacaaggc tctgtgttca gtcctcagct
1261 ccgggggaaa aaaaaaagaa agattccata acctcaaaca gcattacaaa ttttggaaaa
1321 tgtgctaaaa ttcatcagcc tatctgaaac attttacatt gaatccataa caggaaataa
1381 acctgtttct taattcttat tttttagcat accattctaa tactccaagt tctaacacag
1441 cacttgtacc tcttcaatgt aatttaacta tgatcatgag gcataatgtt cattggaaat
1501 gaagcatatg aacaggaaac aaataaaagt cctaactaaa gtaaacttag ctttgagatt
1561 ggctattaca agtctggttg taattccact aatgctgcca tatgctgtga ggaatgttat
1621 aaaagagcta tgtaactatt atgacagttg tagcttttag cattgaaata catagatatt
1681 aatataaaag taagtgtata atatgatgct taaatgtgta acctaatatt ttagaataaa
1741 tttaattagt ggaaacattc tagacaggaa cagtaaatat atccaacatc attattcttt
1801 gatttaaaaa atgcaatttg gaggttcttc cctgcaaaag actatctctt tcccactctt
1861 aacattactt aggtgcttat tacagtttta tgttgagttg gggaaagggt aaaattgacc
1921 cctttccata ttagcatgaa tattggtatc atctttattg agatcttgtt taggaaccca
1981 ttatgagact tcaggagtat aactttcata atgtttgtaa tagatgcaac tttacagcag
2041 acaacttgat ccttctggcc tcttaaatct ttccatcctc tattatgtaa tgttttgttg
2101 atagttactt cagtatttga cacaagattc aataatttta tgcctatggg ttccatcaaa
2161 catcatgact ctatatatat gtaaatccaa aataagaaat aaaaaatagt gtatct
//
LOCUS BCEHEMOL 280 bp ds-DNA BCT 26-JUL-1990
DEFINITION B.cereus hemolysin gene, partial cds.
ACCESSION M35411
KEYWORDS hemolysin.
SOURCE B.cereus DNA.
ORGANISM Bacillus cereus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 280)
AUTHORS Gilmore,M.S., Gilmore,K.S. and Goebel,W.
TITLE A new strategy for ordered DNA sequencing based on a novel method
for the rapid purification of near-milligram quantities of a cloned
restriction fragment
JOURNAL Gene Anal. Tech. 2, 108-114 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 280 hemolysin (AA at 2)
BASE COUNT 99 a 43 c 51 g 87 t
ORIGIN
1 gaattctcat ttatggattg taaaccgtgc aattgatatt atgtctcgta atacaacact
61 tgtaaaacaa gatcgagttg cactattaaa tgaatggcgt actgagttag agaacggtat
121 ttatgctgct gactatgaaa atccttatta tgataatagc acatttgctt cacatttcta
181 tgaccctgac aatgggaaaa cttatattcc gtatgcaaag caggcaaagg aaactggagc
241 taaatatttt aaattagctg gtgagtctta caaaaataaa
//
LOCUS BPEFHAA 164 bp ds-DNA BCT 26-JUL-1990
DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds.
ACCESSION M35274
KEYWORDS filamentous hemagglutinin antigen.
SOURCE B.pertussis DNA, clone lambda-FHA15.
ORGANISM Bordetella pertussis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 164)
AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and
Ullmann,A.
TITLE Molecular cloning of a coding sequence of Bordetella pertussis
filamentous hemagglutinin gene
JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 164 filamentous hemagglutinin antigen (AA at 1)
BASE COUNT 38 a 45 c 60 g 21 t
ORIGIN
1 gaattccaca tgcacctgga tgcgccgcgc atcgagaaca ccgcgaaact gacgcggcga
61 ggtgcaacgc aaaggcgtgc aggacgtcgg gggaggcgag cacggccgct ggacgtatcg
121 gctatgtcaa ctactggttg cgcgcgcatg gaagaaggcg ggca
//
LOCUS BPEFHAB 165 bp ds-DNA BCT 26-JUL-1990
DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds.
ACCESSION M35275
KEYWORDS filamentous hemagglutinin antigen.
SOURCE B.pertussis DNA, clone lambda-FHA15.
ORGANISM Bordetella pertussis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 165)
AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and
Ullmann,A.
TITLE Molecular cloning of a coding sequence of Bordetella pertussis
filamentous hemagglutinin gene
JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 165 filamentous hemagglutinin antigen (AA at 1)
BASE COUNT 30 a 54 c 55 g 26 t
ORIGIN
1 gaattcggac cagcctggcc cgagcgctgc atgccgcgcg ggaaggccca cacagttggt
61 cccgacactg ccactttccg agtcccatcg caacgggcgg tgatccactc gtcgttggcg
121 cgtgatagac agcgcgtgca tgcgagagcg catgcagcag gctgg
//
LOCUS CRECYCA 662 bp ss-mRNA PLN 26-JUL-1990
DEFINITION C.reinhardtii mitochondrial apocytochrome c (cyc) mRNA, complete
cds.
ACCESSION M35173
KEYWORDS apocytochrome c; cytochrome c apoprotein.
SOURCE C.reinhardtii, cDNA to mRNA, clone C321.
ORGANISM Chlamydomonas reinhardtii
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Chlamydomonadaceae.
REFERENCE 1 (bases 1 to 662)
AUTHORS Amati,B.B., Goldschmidt-Clermont,M., Wallace,C.J.A. and
Rochaix,J.-D.
TITLE cDNA and deduced amino acid sequences of cytochrome c from
Chlamydomonas reinhardtii: Unexpected functional and phylogenetic
implications
JOURNAL J. Mol. Evol. 28, 151-160 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 42 380 apocytochrome c (cyc)
BASE COUNT 147 a 176 c 200 g 139 t
ORIGIN
1 ccgaaccaaa acctttcctg tgacccttct atctgcttaa aatgtcgacc ttcgctgagg
61 cccccgctgg cgaccttgct cgcggcgaga agattttcaa gaccaagtgc gcgcaatgcc
121 acgttgctga gaagggcggc ggccacaagc agggccccaa cctgggcggt ctgttcggcc
181 gtgtctcggg cactgctgcc ggcttcgcat actcgaaggc gaacaaggag gctgccgtga
241 cctggggcga gagcactctc tacgagtacc tgctgaaccc caagaagtac atgcctggca
301 acaagatggt gttcgctggc ctgaagaagc ccgaggagcg cgccgatctg attgcctacc
361 tgaagcaggc gactgcttaa actgcgcgcg gcttagcaag cggcttcatt cattaggcag
421 aagcgggtct caagagcggg atagggttgc atctgggcgc ggcgtgtgtt cgcttcagaa
481 cgtcccacca gatgcaacag gcggatgtgt tacgagtgtc gagtgtgtac tgatgatggt
541 gtgcatgtgt aacggcgaca tacggatgga atagacatat cgtcttgaag actgtctcat
601 aggcagagac atctgctcac aggcaactta ttatgtctgc catgggcggt cgtaaagaat
661 tc
//
LOCUS ECOABC 1993 bp ds-DNA SYN 26-JUL-1990
DEFINITION Synthetic plasmid (for E.coli) DNA.
ACCESSION M34519
KEYWORDS b-galactosidase; b-lactamase; bla gene; lacZ gene; promoter.
SOURCE Synthetic DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 1993)
AUTHORS Hayden,M.A., Shallcross,M.A., Stotland,E. and Mandecki,W.
TITLE A totally synthetic plasmid for general cloning, gene expression
and mutagenesis in Escherichia coli
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Mandecki, 22-MAY-1990.
Author address:W.Mandecki
Abbott Laboratories
Corporate Molecular Biology D93D
Abbott Park, IL 60064
FEATURES from to/span description
pept 119 301 b-galactosidase
pept 438 1298 b-lactamase
site 81 327 lacZ fragment mRNA
site 136 137 SmaI site for introduction of multicloning
signal 301 336 trpA terminator
signal 337 402 bla promoter P3
mRNA 403 1329 bla mRNA
signal 1299 1342 phage fd terminator
signal 1343 1408 RNAII promoter
mRNA 1409 1962 RNAII mRNA
signal 1415 1440 RNAI terminator
signal 1518 1555 RNAI promoter
mRNA 1422 1517 RNAI mRNA
site 1961 1962 RNaseH cleavage site
BASE COUNT 524 a 475 c 500 g 494 t
ORIGIN
1 gaattgatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctg
61 ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat
121 gactatgatt acgcccgggc ttgccgtcgt tttacagcga cgagactggg aaaatcctgg
181 cgttacccaa cttaatcgcc ttgccgcaca cccccctttc gccagttggc gtaatagcga
241 agaagcccgc accgaccgcc cttcccaaca gttgcgtagt ctgaatggcg aatggcgtta
301 aactagtagc ccgcctaatg agcgggcttt tttttaattc ccctatttgt ttatttttct
361 aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat
421 attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg
481 cggcattttg ccttcctgtt tttgctcacc cagaaacgct cgtgaaagta aaagacgcag
541 aggaccaatt gggggcacga gtgggataca tagaactgga cttgaatagc ggtaaaatcc
601 ttgagagttt tcgccctgaa gagcgttttc caatgatgag cactttcaaa gttctgctat
661 gtggagcagt attatcccgt gtagatgcgg ggcaagagca actcggacga cgaatacact
721 attcgcagaa tgacttggtt gaatactccc cagtgacaga aaagcacctt acggacggaa
781 tgacggtaag agaattatgt agtgccgcca taacgatgag tgataacact gcggcgaact
841 tacttctgac aaccatcggt ggaccgaagg aattaaccgc ttttttgcac aatatgggag
901 accatgtaac tcgccttgac cgttgggaac cagaactgaa tgaagccata ccaaacgacg
961 agcgagacac cacaatgcct gcggcaatgg caacaacatt acgcaaacta ttaactggcg
1021 aactacttac tctggcttca cggcaacaat taatagactg gcttgaagcg gataaagttg
1081 caggaccact actgcgttcg gcacttcctg ctggctggtt tattgctgat aaatctgggg
1141 caggagagcg tggttcacgg ggtatcattg ccgcacttgg accagatggt aagccttccc
1201 gtatcgtagt tatctacacg acgggtagtc aggcaactat ggacgaacga aatagacaga
1261 ttgctgaaat aggggcttca ctgattaagc attggtaaac cgatacaatt aaaggctcct
1321 tttggagcct ttttttttgg acggaccgag tagaaaagat caaaggatct tcttgagatc
1381 ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg
1441 tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag
1501 cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact
1561 ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg
1621 gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc
1681 ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg
1741 aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggacaaagg
1801 cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag
1861 ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc
1921 gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct
1981 ttttacggtt cct
//
LOCUS HS1PROM 591 bp ds-DNA VRL 26-JUL-1990
DEFINITION Herpes simplex virus type 1 joint promoter.
ACCESSION M34532
KEYWORDS promoter.
SOURCE Herpes simplex virus type 1 (strain KOS) DNA, clone pRAB6.
ORGANISM Herpes simplex virus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 591)
AUTHORS Bohenzky,R.A., Papavassiliou,A.P., Gelman,I.H. and Silverstein,S.
TITLE Identification of novel transcripts mapping to the joint region of
Herpes simplex virus type 1
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Silverstein, 23-MAY-1990.
Author address:S.Silverstein
Dept. of Microbiology
Columbia University
701 W. 168th ST.
N.Y., N.Y. 10032
FEATURES from to/span description
binding 134 139 CTF binding site
binding 290 295 Sp1 binding site
binding 508 513 Sp1 binding site
binding 387 392 Sp1 binding site
binding 480 485 Sp1 binding site
binding 492 497 Sp1 binding site
site 298 306 Oct1/a-TIF site
site 327 330 a4 enhancer
site 366 372 E4TF1 site
signal 528 533 TATA box
BASE COUNT 103 a 193 c 204 g 91 t
ORIGIN Map position 0.794-0.798.
1 gcatgcccct cccgccgacg caacaggggc ttggcctgcg tcggtgcccc ggggcttccc
61 gccttcccga agaaactcat taccataccc ggaaccccag gggaccaatg cgggttcatt
121 gagcgacccg cgggccaatg cgcgaggggc cgtgtgttcc gccaaaaaag caattaacat
181 aacccggaac cccaggggag tggttacgcg cggcgcggga ggcggggaat accggggttg
241 cccattaagg gccgcgggaa ttgccggaag cgggtaatgt cggccggggc cgcccattaa
301 tgagtttcta attaccatac cgggaagcgg aacaaggcct ctgcaagttt ttaattacca
361 taccgggaag tgggcgcccg cccagtgggc gggagttacc gcccagtggg ccggcccgac
421 gactcggcgg acgctggttg gccgggcccc gccgcgctgg cggccgccga ttggccagtc
481 ccgccctccg agggcggccc gcctcggggg cgggccggct ccaagcgtat atatgcgcgg
541 ctcctgccat cgtctctccg gagagcggct tggtgcggac ctgcagccaa g
//
LOCUS MZEHETRO 184 bp ds-DNA PLN 26-JUL-1990
DEFINITION Corn heterochromatin repetitive DNA.
ACCESSION M35408
KEYWORDS .
SOURCE Corn knob heterochromatin DNA, clone pZm4.25.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 184)
AUTHORS Peacock,W.J., Dennis,E.S., Rhoades,M.M. and Pryor,A.J.
TITLE Highly repeated DNA sequence limited to knob heterochromatin in
maize
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 4490-4494 (1981)
STANDARD simple staff_review
BASE COUNT 58 a 41 c 42 g 43 t
ORIGIN
1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat
181 ggcc
//
LOCUS RATRSB1 170 bp ds-DNA ROD 26-JUL-1990
DEFINITION Rat B1 repetitive sequence.
ACCESSION M35409
KEYWORDS B1 repetitive sequence.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 170)
AUTHORS Blin,N., Weber,T. and Alonso,A.
TITLE Cross-reaction of snRNA and an Alu I-like sequence from rat with
DNAs from different eucaryotic species
JOURNAL Nucleic Acids Res. 11, 1375-1388 (1983)
STANDARD simple staff_entry
BASE COUNT 57 a 29 c 44 g 40 t
ORIGIN
1 aaaaaaaagc aaatgacagc tgtgtgtggt ttcatatgtg tttaatccag cactcaggag
61 gcagaggtaa atggatctct gtgagttcga gtccagtctg gctacaaagc aagttctaga
121 gcagccaggg ctgttacaca gagaaactct gtcttggaag ataaaaaaga
//
LOCUS SHFINV 261 bp ds-DNA BCT 26-JUL-1990
DEFINITION Plasmid pINV (from S.flexneri) RepA gene, 5' end.
ACCESSION M35403
KEYWORDS .
SOURCE Plasmid pWR110 (from S.flexneri 5) DNA.
ORGANISM Shigella flexneri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Silva,R.M., Saadi,S. and Maas,W.K.
TITLE A basic replicon of virulence-associated plasmids of Shigella spp.
and enteroinvasive Escherichia coli is homologous with a basic
replicon in plasmids of IncF groups
JOURNAL Infect. Immun. 56, 836-842 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 256 > 261 repA protein
mRNA 171 82 (c) inc mRNA
BASE COUNT 70 a 54 c 68 g 69 t
ORIGIN
1 gatcgtttaa ggaattttat ggctggccac gccttaaggt ggcagggaac tggttctgat
61 gtggatgtac aggagccaga aaagcaaaaa ccccgataat cttctttaac tttggcgagt
121 cagaaagatt accggggccc acttaaaccg tatagccaac aatcaagcta tgcggggagt
181 atagttatat gcccggaaaa gttcaagact tctttctgtg ctcgctcctt ctgcgcattg
241 taagtgcagg atggtgtgac t
//
LOCUS YSYPSKLA 598 bp ds-DNA PLN 26-JUL-1990
DEFINITION S.kluyveri plasmid pSKL left-end inverted terminal repeat.
ACCESSION M35319
KEYWORDS .
SOURCE S.kluyveri plasmid pSKL DNA.
ORGANISM Saccharomyces kluyveri
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 598)
AUTHORS Kitada,K. and Hishinuma,F.
TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces
kluyveri
JOURNAL Mol. Gen. Genet. 206, 377-381 (1987)
STANDARD simple staff_review
FEATURES from to/span description
BASE COUNT 247 a 18 c 206 g 127 t
ORIGIN
1 aaaaggtata gatatagata tattttttat gggtttggaa gggggaagtg gaagaatgta
61 tcgtgtaaaa aaagagcaaa aaaaaaatta gatgagagaa ggggaaaaga ggggagtgta
121 tcatgtgaaa aaacgcgtca aaatgaagag aagggaaaaa ggggagagtg tatcgtgggg
181 aaagtgaatt ttgaagaaga gaaggggaaa agaggggagt gtatcgtcta agaagggggt
241 attataagag aaggggatat tggtagagtg tattgaatgt ggcttagcaa aaatagaaaa
301 agggtaaaaa atgggggata aaaaaaagaa aaaaacggta ttaaggggag aaggggaaaa
361 gggtagagtg tatcgtgcaa aaagtgagtt caaaatgaag agaaggggaa aagggtagag
421 tgtatcgtgg gggaaagtga gtttaaatga agagaagggg aaaagggtag agtgtatcgt
481 gggggaaagt gagtttaaat gaagagaagg gaaaaagggg gagtgtatcg tataaaaagt
541 gaatatattt tatttgatgg gattaagtat tgaaaatgga aatggatgat aggttgtt
//
LOCUS YSYPSKLB 117 bp ds-DNA PLN 26-JUL-1990
DEFINITION S.kluyveri plasmid pSKL right-end DNA.
ACCESSION M35320
KEYWORDS .
SOURCE S.kluyveri plasmid pSKL DNA.
ORGANISM Saccharomyces kluyveri
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Kitada,K. and Hishinuma,F.
TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces
kluyveri
JOURNAL Mol. Gen. Genet. 206, 377-381 (1987)
STANDARD simple staff_review
BASE COUNT 35 a 5 c 18 g 59 t
ORIGIN
1 caaaaagtga gattaggggg agaatatatt tattatgtta aatataaggt agttttttta
61 taatttattt aatttatttt gtttgtattt tagcttcttt aattagtctg tattctt
//
LOCUS XELTRH 1442 bp ss-mRNA VRT 26-JUL-1990
DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds.
ACCESSION M34699 K00931 J05514
KEYWORDS thyrotropin releasing hormone.
SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 478)
AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G.
TITLE Biosynthesis of thyrotropin releasing hormone in the skin of
Xenopus laevis: Partial sequence of the precursor deduced from
cloned cDNA
JOURNAL EMBO J. 3, 617-621 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 15 to 1442)
AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
TITLE Two precursors of thyrotropin releasing hormone from skin of
Xenopus laevis: Each contains seven copies of end-product
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by G.Kreil, 18-MAY-1990.
FEATURES from to/span description
pept 110 793 thyrotropin releasing hormone precursor
matp 332 340 thyrotropin releasing hormone copy 1
matp 374 382 thyrotropin releasing hormone copy 2
matp 428 436 thyrotropin releasing hormone copy 3
matp 470 478 thyrotropin releasing hormone copy 4
mRNA < 1 1442 TRH mRNA
conflict 139 139 t in [2]; c in [1]
conflict 214 216 tct in [2]; ctc in [1]
conflict 319 319 g in [2]; t in [1]
BASE COUNT 460 a 286 c 334 g 362 t
ORIGIN
1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag
61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt
121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc
181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg
241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct
301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga
361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc
421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg
481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga
541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt
601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga
661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg
721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag
781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca
841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga
901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt
961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag
1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt
1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca
1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat
1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct
1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt
1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg
1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa
1441 tg
//
LOCUS XELTRHA 2955 bp ss-mRNA VRT 26-JUL-1990
DEFINITION X.laevis thyrotropin releasing hormone mRNA.
ACCESSION M34698 J05514
KEYWORDS thyrotropin releasing hormone.
SOURCE X.laevis, cDNA to mRNA, clone C6.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2955)
AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
TITLE Two precursors of thyrotropin releasing hormone from skin of
Xenopus laevis: Each contains seven copies of end-product
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Kreil, 18-MAY-1990.
FEATURES from to/span description
pept 157 831 thyrotropin releasing hormone
BASE COUNT 927 a 597 c 604 g 827 t
ORIGIN
1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc
61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga
121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg
181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag
241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc
301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg
361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga
421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac
481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag
541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat
601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa
661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa
721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac
781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag
841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt
901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac
961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc
1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca
1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa
1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc
1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca
1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg
1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca
1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg
1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa
1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct
1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa
1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt
1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt
1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca
1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat
1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg
1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga
1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg
2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga
2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca
2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct
2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc
2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac
2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg
2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata
2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga
2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc
2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa
2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg
2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct
2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa
2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct
2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt
2941 tcgtggcgct ggccc
//
LOCUS MUSIGCS 302 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse Ig heavy-chain gene enhancer region.
ACCESSION M35179
KEYWORDS constant region; germline; immunoglobulin heavy-chain.
SOURCE Mouse (strain BXXB:SB/Le) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 302)
AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
TITLE Molecular aspects of murine systemic lupus erythematosus
JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986)
STANDARD simple staff_review
BASE COUNT 90 a 59 c 68 g 85 t
ORIGIN
1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa
61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc
121 cccaccaaac cgaaagtcca ggctgagcaa aacaccacct gggtaatttg catttctaaa
181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct
241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat
301 gt
//
LOCUS MUSIGCT 313 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse Ig heavy-chain gene enhancer region.
ACCESSION M35180
KEYWORDS constant region; germline; immunoglobulin heavy-chain.
SOURCE Mouse (lupus erythematosus strain MRL/I) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 313)
AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
TITLE Molecular aspects of murine systemic lupus erythematosus
JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986)
STANDARD simple staff_review
BASE COUNT 94 a 59 c 69 g 91 t
ORIGIN
1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa
61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc
121 cccaccaaac cgaaagtcta ggctgagcaa aacaccacct gggtaatttg catttctaaa
181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct
241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat
301 gtatttagaa ttc
//
LOCUS MUSTCBYBB 459 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse T-cell receptor C beta-1/2 recombinant chain, exon 1.
ACCESSION M35181
KEYWORDS T-cell receptor beta chain; constant region; germline.
SOURCE Mouse (strain NZW) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 459)
AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
TITLE Molecular aspects of murine systemic lupus erythematosus
JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pre-msg < 1 > 459 TCR C-beta-1/2 recombinant chain, exon 1
IVS 436 > 459 TCR C-beta-1/2 intron A (no splice consensus)
BASE COUNT 117 a 124 c 126 g 92 t
ORIGIN
1 ttacaagatc aaggcagatc cagatagctc tcagaccatt cgtactctct ttactttcca
61 gaggatctga gaaatgtgac tccacccaag gtctccttgt ttgagccatc aaaagcagag
121 attgcaaaca aacaaaaggc taccctcgtg tgcttggcca ggggcttctt ccctgaccac
181 gtggagctga gctggtgggt gaatggcagg gaggtccaca gtggggtcag cacggaccct
241 caggcctaca aggagagcaa ttatagctac tgcctgagca gccggctgag ggtctctgct
301 accttctggc acaatcctcg aaaccacttc cgctgccaag tgcagttcca tgggctttca
361 gaggaggaca agtggccaga gggctcaccc aaacctgtca cacagaacat cagtgcagag
421 gcctggggcc gagcaggtaa gtgcggacct catgaggaa
//
LOCUS HAMSCARPB 537 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Hamster alpha-crystallin B chain mRNA, 5' end.
ACCESSION J03849
KEYWORDS alpha-crystallin B chain.
SOURCE Hamster scrapie infected brain, cDNA to mRNA.
ORGANISM Mesocricetus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 537)
AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B.
TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive
hybridization of a cDNA library
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
J.Duguid, 25-OCT-1990.
FEATURES from to/span description
pept 21 > 537 alpha-crystallin B chain
BASE COUNT 113 a 177 c 127 g 120 t
ORIGIN
1 catacattca cctagccacc atggacatcg ccatccacca cccctggatc cgccgtccct
61 ttttcccttt ccactccccc agccgcctct ttgaccagtt cttcggagag cacctgttgg
121 agtctgacct cttctcaact gccacttctc tgagtccctt ctacctgcgg ccaccttcct
181 tccttcgggc acccagctgg attgacactg gactctcaga gatgcggatg gagaaggaca
241 gattctccgt caacctggat gtgaagcact tctccccgga agagctgaaa gtcaaggtgc
301 tgggggacgt ggttgaagtg catggcaagc acgaagagcg ccaggacgaa cacggcttca
361 tctctaggga gttccatagg aagtaccgga tcccagctga tgtggatcct ctgaccatta
421 cttcatccct gtcatctgac ggcgtcctca ctgtgaatgg accaaggaaa caggcctctg
481 gccccgagcg taccattccc atcacccgtg aagagaagcc tgctgtcact gcagccc
//
LOCUS HAMSCRAP 282 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Hamster glial fibrillary acidic protein mRNA, partial cds.
ACCESSION J03847
KEYWORDS glial fibrillary acidic protein.
SOURCE Hamster scrapie infected brain, cDNA to mRNA.
ORGANISM Mesocricetus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 282)
AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B.
TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive
hybridization of a cDNA library
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
J.Duguid, 25-OCT-1990.
FEATURES from to/span description
pept < 1 > 282 glial fibrillary acidic protein (AA at 1)
BASE COUNT 90 a 69 c 76 g 47 t
ORIGIN
1 gagggccaaa gcctcaagga ggagatggct cgccacctgc aggagtatca agatctactc
61 aatgtcaagc tagccctgga catcgagatt gccacctata ggaaattgct agaaggcgag
121 gaaaaccgca tcaccatccc tgtacaaact ttctccaacc tgcaaatccg agaaaccagc
181 ctggacacca agtccgtgtc agaaggacac ctcaagagga acatcgtggt aaagacagtg
241 gagatgaggg atggtgaggt cattaaggag tccaagcagg ag
//
LOCUS HAMSCRAPA 327 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Hamster metallothionein II mRNA, complete cds.
ACCESSION J03848
KEYWORDS metallothionein II.
SOURCE Hamster scrapie infected brain, cDNA to mRNA.
ORGANISM Mesocricetus sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 327)
AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B.
TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive
hybridization of a cDNA library
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
J.Duguid, 25-OCT-1990.
FEATURES from to/span description
pept 62 247 metallothionein II
BASE COUNT 65 a 100 c 81 g 81 t
ORIGIN
1 cactcaagtt tcgacttttc ctcggtcctc agccggtctt caaccgccgc cttcactcgc
61 catggacccc aactgctcct gtgccacaga tggatcctgc tcctgctctg ggtcttgcaa
121 atgcaaagag tgcaaatgca ccacgtgcaa gaaaagctgc tgctcctgct gcccggtggg
181 ctgtgcgaag tgctcccagg gctgcgtctg caaagaggct tcggagaagt gcagctgctg
241 cgcctgaagc ggattcccct cagctgtctg taaatagagc aatgtgtaga aacgtattgg
301 tttttttaca accccgtcct attctcc
//
LOCUS ASOTAAG1 2935 bp ds-DNA PLN 26-JUL-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION M33218
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 2935)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1
828 868 Taka-amylase A precursor, exon 2
955 1070 Taka-amylase A precursor, exon 3
1140 1248 Taka-amylase A precursor, exon 4
1317 1545 Taka-amylase A precursor, exon 5
1603 1765 Taka-amylase A precursor, exon 6
1830 1976 Taka-amylase A precursor, exon 7
2041 2281 Taka-amylase A precursor, exon 8
2360 2647 Taka-amylase A precursor, exon 9
sigp 607 669 Taka-amylase A signal peptide
matp 670 772 Taka-amylase A
828 868 Taka-amylase A
955 1070 Taka-amylase A
1140 1248 Taka-amylase A
1317 1545 Taka-amylase A
1603 1765 Taka-amylase A
1830 1976 Taka-amylase A
2041 2281 Taka-amylase A
2360 2644 Taka-amylase A
pre-msg 543 > 2789 Taa-G1 mRNA and introns
IVS 773 827 Taa-G1 intron A (no splice consensus)
IVS 869 954 Taa-G1 intron B
IVS 1071 1139 Taa-G1 intron C
IVS 1249 1316 Taa-G1 intron D
IVS 1546 1602 Taa-G1 intron E
IVS 1766 1829 Taa-G1 intron F
IVS 1977 2040 Taa-G1 intron G
IVS 2282 2359 Taa-G1 intron H
signal 2784 2789 poly-A signal
BASE COUNT 818 a 752 c 657 g 708 t
ORIGIN
1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 26-JUL-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION M33220
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 195 > 197 Taka-amylase A (Taa-G2) precursor
pre-msg 131 > 197 Taa-G2 mRNA and introns
signal 2 11 CAAT box
signal 95 100 TATA box
BASE COUNT 59 a 52 c 36 g 50 t
ORIGIN
1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
181 cacagaaggc atttatg
//
LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 26-JUL-1990
DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION M33222
KEYWORDS Taka-amylase A.
SOURCE A.oryzae (strain JCM02239) DNA.
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1)
pre-msg < 1 151 Taa-G2 mRNA and introns (alt.)
pre-msg < 1 156 Taa-G2 mRNA and introns (alt.)
BASE COUNT 45 a 48 c 53 g 52 t
ORIGIN About 2.1 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
181 gtcctccaga ggcgagga
//
LOCUS ASOTTAM1 191 bp ss-mRNA PLN 26-JUL-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION M33219
KEYWORDS Taka-amylase A.
SEGMENT 1 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 189 > 191 Taka-amylase A (Taa)
mRNA 125 > 191 Taa mRNA
BASE COUNT 56 a 52 c 32 g 51 t
ORIGIN
1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
181 aggcatttat g
//
LOCUS ASOTTAM2 156 bp ss-mRNA PLN 26-JUL-1990
DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION M33221
KEYWORDS Taka-amylase A.
SEGMENT 2 of 2
SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
ORGANISM Aspergillus oryzae
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
Udaka,S.
TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
Evidence for multiple related genes
JOURNAL Gene 84, 319-327 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 9 Taka-amylase A (Taa) (AA at 1)
mRNA < 1 151 Taa mRNA (alt.)
mRNA < 1 156 Taa mRNA (alt.)
BASE COUNT 40 a 35 c 42 g 39 t
ORIGIN About 1.5 kb after segment 1.
1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 26-JUL-1990
DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
mRNA, complete cds.
ACCESSION J05223
KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
glyceraldehyde-3-phosphate dehydrogenase.
SOURCE M.crystallinum, cDNA to mRNA.
ORGANISM Mesembryanthemum crystallinum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE 1 (bases 1 to 1354)
AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3-
phosphate dehydrogenase during the transition from C-3
photosynthesis to crassulacean acid metabolism in Mesembryanthemum
crystallinum
JOURNAL J. Biol. Chem. 265, 3497-3502 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC
1.2.1.12)
BASE COUNT 315 a 308 c 345 g 386 t
ORIGIN
1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS ECOPRIA 2658 bp ds-DNA BCT 26-JUL-1990
DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR
gene, 5' end.
ACCESSION M33293
KEYWORDS cytR gene; priA gene; primosomal protein.
SOURCE E.coli (strain W3110) DNA, clone pEL042.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2658)
AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
TITLE The priA gene encoding the primosomal replicative n' protein of
Escherichia coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4620-4624 (1990)
STANDARD full automatic
COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
26-MAR-1990.
FEATURES from to/span description
pept 64 2262 primosomal protein n'
pept 2418 > 2658 cytR gene product
signal 27 32 -10 region
signal 5 10 -35 region
binding 53 59 ribosome binding site
BASE COUNT 578 a 738 c 756 g 586 t
ORIGIN
1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
2641 cggatatctg cgatcccc
//
LOCUS EWCTELRNA 657 bp ds-DNA INV 26-JUL-1990
DEFINITION E.crassus telomerase RNA component gene, complete cds.
ACCESSION M33461
KEYWORDS telomerase RNA.
SOURCE E.crassus DNA.
ORGANISM Euplotes crassus
Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora;
Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE 1 (bases 1 to 657)
AUTHORS Shippen-Lentz,D. and Blackburn,E.H.
TITLE Functional evidence for an RNA template in telomerase
JOURNAL Science 247, 546-552 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
RNA 152 342 telomerase RNA component
site 186 197 functional telomeric template
BASE COUNT 202 a 122 c 111 g 222 t
ORIGIN
1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS HS4DWXJ 160 bp ds-DNA VRL 26-JUL-1990
DEFINITION Epstein-Barr virus defective WZhet junction.
ACCESSION M33474
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 160)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 87 90 WZhet junction
BASE COUNT 41 a 45 c 41 g 33 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS HS4WXJ 181 bp ds-DNA VRL 26-JUL-1990
DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION M33473
KEYWORDS .
SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 181)
AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy
leukoplakia
JOURNAL J. Virol. 64, 397-400 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 108 111 WZhet junction
BASE COUNT 50 a 42 c 46 g 43 t
ORIGIN
1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
181 c
//
LOCUS HUMREGA 4251 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human regenerating protein (reg) gene, complete cds.
ACCESSION J05412
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4251)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues: The reg protein,
pancreatic stone protein, and pancreatic thread protein are one and
the same product of the gene
JOURNAL J. Biol. Chem. 265, 7432-7439 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
FEATURES from to/span description
pept 1571 1634 regenerating protein (reg), exon 2 (first
expressed exon)
2270 2388 regenerating protein, exon 3
2696 2833 regenerating protein, exon 4
3549 3660 regenerating protein, exon 5
3856 3923 regenerating protein, exon 6
pre-msg 1196 4116 reg mRNA and introns
IVS 1224 1524 reg intron A
IVS 1635 2269 reg intron B
IVS 2389 2695 reg intron C
IVS 2834 3548 reg intron D
IVS 3661 3855 reg intron E
signal 1169 1174 TATA box
BASE COUNT 1161 a 927 c 869 g 1294 t
ORIGIN
1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS HUMREGRELA 1524 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human reg-related sequence, complete cds.
ACCESSION J05413
KEYWORDS pancreatic stone protein; pancreatic thread protein;
regenerating protein.
SOURCE Human esophageal mucosa DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
TITLE Complete nucleotide sequence of the human reg gene and its
expression in normal and tumoral tissues: The reg protein,
pancreatic stone protein, and pancreatic thread protein are one and
the same product of the gene
JOURNAL J. Biol. Chem. 265, 7432-7439 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
H.Okamoto, 23-FEB-1990.
BASE COUNT 382 a 368 c 345 g 429 t
ORIGIN
1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
1501 tataaatcat ataatgtaaa ggat
//
LOCUS MDPCGA 4801 bp ss-RNA VRL 26-JUL-1990
DEFINITION Aleutian mink disease parvovirus complete genome.
ACCESSION M20036
KEYWORDS complete genome.
SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (bases 1 to 4592)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 4593 to 4801)
AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
Perryman,S. and Wolfinbarger,J.B.
TITLE Nucleotide sequence of the 5'-terminal palindrome of Aleutian mink
disease parvovirus (ADV) and construction of an infectious
molecular clone
JOURNAL J. Virol. 64, 3551-3556 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by M.E.Bloom, 18-MAR-1990.
Sequence reported below is (+) strand.
FEATURES from to/span description
ORF 116 1978 Left ORF
ORF 1983 2207 Middle ORF2
ORF 1993 2211 Middle ORF1
ORF 2241 4399 Right ORF
BASE COUNT 1740 a 912 c 943 g 1206 t
ORIGIN
1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
4801 a
//
LOCUS MDPUPS 3454 bp ss-RNA VRL 26-JUL-1990
DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
sequence.
ACCESSION M32981
KEYWORDS .
SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
ORGANISM Aleutian mink disease parvovirus
Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE 1 (sites)
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
TITLE Nucleotide sequence and genomic organization of Aleutian mink
disease parvovirus (ADV): Sequence comparisons between a
nonpathogenic and pathogenic strain of ADV
JOURNAL J. Virol. 62, 2903-2915 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3454, for [1])
AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
Wolfinbarger,J.B.
JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by M.E.Bloom 18-MAR-1990.
Sequence reported below is (+) strand.
BASE COUNT 1268 a 673 c 698 g 815 t
ORIGIN
1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS MUSAA2DEL 300 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
fragment.
ACCESSION M33468
KEYWORDS deletion mutant.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 130 133 deletion breakpoint
BASE COUNT 82 a 45 c 76 g 97 t
ORIGIN
1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS MUSSL20JA 300 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
fragment.
ACCESSION M33467
KEYWORDS deletion mutant.
SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
TITLE Molecular analysis of two mouse dilute locus deletion mutations:
Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
lethal Aa2 alleles
JOURNAL Mol. Cell. Biol. 10, 501-509 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps / 42 122 dilute gene, exon 3
179 > 300 dilute gene, exon 5
recomb 129 130 deletion breakpoint
BASE COUNT 86 a 51 c 61 g 102 t
ORIGIN
1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS PRVVP4 2359 bp ds-RNA VRL 26-JUL-1990
DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION M33516
KEYWORDS capsid protein VP4.
SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA.
ORGANISM Porcine rotavirus
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 2359)
AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain
of porcine rotavirus to that of asymptomatic human rotavirus
strains
JOURNAL J. Virol. 64, 414-418 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 10 2337 capsid protein VP4
BASE COUNT 812 a 406 c 442 g 699 t
ORIGIN
1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
2341 ctctatagag gatgtgacc
//
LOCUS RATSTAA 1000 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION M33329
KEYWORDS hydroxysteroid sulfotransferase a.
SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
Yamazoe,Y. and Kato,R.
TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
strong homology in amino acid sequence with the senescence marker
protein SMP-2 in rat livers
JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 38 892 hydroxysteroid sulfotransferase a (STa)
mRNA < 1 1000 STa mRNA
signal 977 982 poly-A signal
BASE COUNT 299 a 187 c 211 g 303 t
ORIGIN
1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS SMFPOLENV 3534 bp ss-RNA VRL 26-JUL-1990
DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
envelope (env) gene, complete cds.
ACCESSION M33561
KEYWORDS envelope protein; polymerase.
SOURCE Simian foamy virus type 1, cDNA to viral RNA.
ORGANISM Simian foamy virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Spumavirinae.
REFERENCE 1 (bases 1 to 3534)
AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
TITLE Relationship of the env genes and the endonuclease domain of the
pol genes of simian foamy virus type 1 and human foamy virus
JOURNAL J. Virol. 64, 406-410 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 582 polymerase (AA at 1)
pept 524 3481 envelope protein
BASE COUNT 1164 a 648 c 674 g 1048 t
ORIGIN
1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
2341 atatttactc cgggatcatg tggtaaccct gatggaagct gcccttcatg atgtttcgat
2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS MUSC5DPROA 5401 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse complement component C5D (pro-C5D) mRNA, complete cds.
ACCESSION M35526 J05234
KEYWORDS complement component C5D.
SOURCE Mouse (strain B10.D2/oSnJ) liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5401)
AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L.
TITLE Deficiency of the murine fifth complement component (C5): A 2- base
pair gene deletion in a 5'-exon
JOURNAL J. Biol. Chem. 265, 2435-2440 (1990)
STANDARD full staff_review
COMMENT This coding sequence is translated into a truncated protein of the
fifth complement component C5, for the C5S sequence see accession #
M35525.
FEATURES from to/span description
pept 14 664 complement component C5D
BASE COUNT 1636 a 1212 c 1221 g 1332 t
ORIGIN Chromosome 2.
1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac
61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc
121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct
181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga
241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga
301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa
361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac
421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa
481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga
541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta
601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcact
661 ttgaaattaa agaatatgtc ttgccacgat tctctgtttc aatagaacta gaaagaacct
721 tcattggcta taaaaacttt aagaactttg aaatcactgt gaaagcaaga tatttttata
781 ataaagtggt acctgatgct gaagtgtatg ccttttttgg attgagagag gacataaaag
841 atgaggagaa gcagatgatg cacaaagcca cacaagccgc aaagttggtt gacggagttg
901 ctcagatctc ttttgattct gaaacagcag ttaaagagct gtcctacaac agtctagaag
961 acttaaacaa caagtacctt tatattgcag taacagtcac agaatcttca ggtggatttt
1021 cagaagaggc agaaatccct ggagtcaaat atgtcctctc tccctacaca ctgaatttgg
1081 tcgctactcc tcttttcgtg aagcccggga ttccattttc catcaaggca caggttaaag
1141 attcactcga gcaggcggta ggaggggtcc cagtaactct gatggcacaa acagtcgatg
1201 tgaatcaaga gacatctgac ttggaaacaa agaggagcat cactcacgac actgatggag
1261 tagctgtgtt tgtgctgaac ctcccatcaa acgtgacggt gctaaagttt gagatcagaa
1321 ctgatgaccc agaacttccc gaagaaaatc aagccagcaa agagtacgaa gcagttgcgt
1381 actcgtctct cagccaaagt tacatttaca tcgcttggac tgaaaactac aagcccatgc
1441 ttgtgggaga atacctgaat attatggtta cccccaagag cccatatatc gacaaaataa
1501 ctcactataa ttacttgatt ttatccaaag gcaaaattgt acagtacggc acaagagaga
1561 aacttttctc ctcaacttat caaaatataa atattccagt gacacagaac atggttcctt
1621 cagcacgact cctggtctat tacatagtca caggggagca aacagcagaa ttagtggctg
1681 acgcagtctg gataaatatt gaggagaagt gtggcaacca gctccaggtc catctgtctc
1741 cagatgaata tgtgtattct ccaggccaaa ctgtgtccct tgacatggtg actgaagcag
1801 actcatgggt agcactatca gcagtggaca gagctgtgta taaagtccag ggaaacgcca
1861 aaagggccat gcaaagagtc tttcgagctt tggatgaaaa gagtgacctg ggctgtgggg
1921 caggtggtgg ccatgacaat gcagatgtat tccatctagc tgggctcacc ttcctcacca
1981 acgcaaacgc agatgactcc cattatcgtg atgactcttg taaagaaatt ctcaggtcaa
2041 agagaaatct gcatctccta aggcagaaaa tagaagaaca agctgctaag tacaaacata
2101 gtgtgctaaa gaaatgctgc tatgacggag cccgagtgaa cttctatgaa acctgtgagg
2161 agcgagtggc ccgggttacc ataggccctc tctgcatcag ggccttcaac gagtgctgta
2221 ctattgcgaa caagatccga aaagaaagcc cccataaacc tgtccaactg ggaaggatcc
2281 acattaagac cctgttacca gtgatgaagg cagatatccg aagctacttt ccagagagct
2341 ggctatggga aattcatcgc gttcccaaaa gaaaacagct gcaggtcacg ctgcctgact
2401 cactaacgac ttgggaaatt caaggcattg gcatttcaga caatggtata tgtgttgctg
2461 atacactcaa ggcaaaggtg ttcaaagaag tcttcctgga gatgaacata ccatattctg
2521 ttgtgcgagg agaacagatc caattgaaag gaactgttta caactatatg acctcaggga
2581 caaagttctg tgttaaaatg tctgctgtgg agggaatctg cacttcggga agctcagctg
2641 ctagccttca cacctccagg ccctccagat gtgtgttcca gaggatagag ggctcgtcca
2701 gtcacttggt gaccttcacc ctgcttcctc tggaaattgg ccttcactcc ataaacttct
2761 cactagagac ctcatttggg aaagacatct tagtaaagac attacgggta gtgccagaag
2821 gagtcaagag ggaaagctat gccggcgtga ttctggaccc taagggaatt cgtggtattg
2881 ttaacagacg aaaggaattc ccatacagga tcccattaga tttggtcccc aagaccaaag
2941 ttgaaaggat tttgagtgtc aaaggactgc ttgtagggga gttcttgtcc acggttctga
3001 gtaaggaagg catcgacatc ctaacccacc tccccaaggg cagtgcagag gcagagctca
3061 tgagcatagc tccggtgttc tatgttttcc actacctgga agcaggaaac cattggaata
3121 ttttctatcc tgatacactg agtaaaagac agagcctgga gaaaaaaata aaacaagggg
3181 tggtgagcgt catgtcctac agaaacgctg actattccta cagcatgtgg aagggggcga
3241 gcgctagtac ctggctgaca gcttttgctc tgagagtgct tggacaggtg gccaagtatg
3301 taaaacagga tgaaaactca atttgtaact ctttgctatg gctggttgag aagtgtcagc
3361 tggaaaacgg ctctttcaag gaaaattccc aatatctacc aataaaatta cagggtactt
3421 tgcctgctga agcccaagag aaaactttgt atcttacagc cttttctgtg attggaatta
3481 gaaaggcagt tgacatatgc cccaccatga aaatccacac agcgctagat aaagccgact
3541 ccttcctgct tgaaaacacc ctgccatcca agagcacctt cacactggcc attgtagcct
3601 atgctctttc cctaggagac agaacccacc cgaggtttcg tctaattgtg tcggccctga
3661 ggaaggaagc ttttgttaaa ggtgatccgc ccatttaccg ttactggaga gataccctca
3721 aacgtccaga cagctctgtg cccagcagcg gcacagcagg tatggttgaa accacagcct
3781 atgctttgct cgccagcctg aaactgaagg atatgaatta cgccaacccc atcatcaagt
3841 ggctatctga agagcagagg tatggaggcg gcttttattc cacccaggat acgattaatg
3901 ccatcgaggg cctgacagaa tattcactcc tgttaaaaca aattcatttg gatatggaca
3961 tcaatgtcgc ctacaaacac gaaggtgact tccacaagta taaggtgaca gagaagcatt
4021 tcctggggag gccagtggag gtatctctca atgatgacct tgttgtcagc acaggctaca
4081 gcagtggctt ggccacagta tatgtaaaaa ctgtggttca caaaattagt gtctctgagg
4141 aattttgcag cttttacttg aaaattgata cccaagatat tgaagcatcc agccacttca
4201 ggctcagtga ctctggattc aagcgcataa tagcatgtgc cagctacaag cccagcaagg
4261 aggagtcaac atccgggtcc tcccatgcag taatggatat atcactgccg actggaatcg
4321 gagcaaacga ggaagattta cgggctcttg tggaaggagt ggatcaacta ctaactgatt
4381 accagatcaa agatggccat gtcattctgc aactgaattc gatcccctcc agagatttcc
4441 tctgtgtccg gttccggata tttgaacttt tccaagttgg gtttctgaat cctgctacct
4501 tcacggtgta cgagtatcac agaccagata agcagtgcac catgatttat agcatttctg
4561 acaccaggct tcagaaagtc tgtgaaggag cagcttgcac atgtgtggaa gctgactgtg
4621 cgcaactgca ggcagaagtg gacctagcca tctctgcaga ctccagaaaa gagaaagcct
4681 gtaaaccaga gactgcatat gcttataaag tcaggatcac atcagccact gaagaaaatg
4741 tttttgtcaa gtacactgcg actcttctgg tcacttacaa aacaggggaa gctgctgatg
4801 agaattcgga ggtcaccttc attaaaaaga tgagctgtac caatgccaac ctggtgaaag
4861 ggaagcagta tttaatcatg ggcaaagagg ttctgcagat caaacacaat ttcagtttca
4921 agtatatata ccctctagat tcctccacct ggattgaata ttggcccaca gacacaacgt
4981 gtccatcctg tcaagcattt gtagagaatt tgaataactt tgctgaagac ctctttttaa
5041 acagctgtga atgaaaagtt ctgctgcacg aagattcctc ctgcggcggg gggatttctc
5101 ctcctctggc ttggaaacct agcctagaat cagatacact ttctttagag taaagcacaa
5161 gctgatgagt tacgactttg tgaaatggat agccttgagg ggaggcgaaa acaggtcccc
5221 caaggctatc agacgtcagt gccaatagac tgaaacaagt ctgtaaagtt agcagtcagg
5281 ggtgttggtt ggggccggaa gaagagaccc actgaaactg tagcccctta tcaaaacata
5341 tccttgcttg aaagaaaaat accaaggaca gaaaatgcca taaaatcttg actttgcact
5401 c
//
LOCUS MUSC5PRO 5403 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse complement component C5S (pro-C5) mRNA, complete cds.
ACCESSION M35525 M15079 J05234
KEYWORDS clotting factor; complement component C5; complement protein.
SOURCE Mouse (strain B10.D2/nSnJ) liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 141 to 5403)
AUTHORS Wetsel,R.A., Ogata,R.T. and Tack,B.F.
TITLE Primary structure of the fifth component of murine complement
JOURNAL Biochemistry 26, 737-743 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5403)
AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L.
TITLE Deficiency of the murine fifth complement component (C5): A 2- base
pair gene deletion in a 5'-exon
JOURNAL J. Biol. Chem. 265, 2435-2440 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Noack, 02-APR-1987; for [2] by R.A.Wetsel, 01-DEC-1989.
For the C5D sequence see accession # M35526.
FEATURES from to/span description
pept 14 5056 complement component C5S precursor
sigp 14 67 complement component C5S signal peptide
matp 68 2035 complement component pro-C5S beta-chain
matp 2048 5053 complement component pro-C5S alpha-chain
mut 659 660 2 bp deletion in C5D
BASE COUNT 1640 a 1212 c 1219 g 1332 t
ORIGIN Chromosome 2.
1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac
61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc
121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct
181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga
241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga
301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa
361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac
421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa
481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga
541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta
601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcata
661 ctttgaaatt aaagaatatg tcttgccacg attctctgtt tcaatagaac tagaaagaac
721 cttcattggc tataaaaact ttaagaactt tgaaatcact gtgaaagcaa gatattttta
781 taataaagtg gtacctgatg ctgaagtgta tgcctttttt ggattgagag aggacataaa
841 agatgaggag aagcagatga tgcacaaagc cacacaagcc gcaaagttgg ttgacggagt
901 tgctcagatc tcttttgatt ctgaaacagc agttaaagag ctgtcctaca acagtctaga
961 agacttaaac aacaagtacc tttatattgc agtaacagtc acagaatctt caggtggatt
1021 ttcagaagag gcagaaatcc ctggagtcaa atatgtcctc tctccctaca cactgaattt
1081 ggtcgctact cctcttttcg tgaagcccgg gattccattt tccatcaagg cacaggttaa
1141 agattcactc gagcaggcgg taggaggggt cccagtaact ctgatggcac aaacagtcga
1201 tgtgaatcaa gagacatctg acttggaaac aaagaggagc atcactcatg acactgatgg
1261 agtagctgtg tttgtgctga acctcccatc aaatgtgacg gtgctaaagt ttgagatcag
1321 aactgatgac ccagaacttc ccgaagaaaa tcaagccagc aaagagtacg aagcagttgc
1381 gtactcgtct ctcagccaaa gttacattta catcgcttgg actgaaaact acaagcccat
1441 gcttgtggga gaatacctga atattatggt tacccccaag agcccatata tcgacaaaat
1501 aactcactat aattacttga ttttatccaa aggcaaaatt gtacagtacg gcacaagaga
1561 gaaacttttc tcctcaactt atcaaaatat aaatattcca gtgacacaga acatggttcc
1621 ttcagcacga ctcctggtct attacatagt cacaggggag caaacagcag aattagtggc
1681 tgacgcagtc tggataaata ttgaggagaa gtgtggcaac cagctccagg tccatctgtc
1741 tccagatgaa tatgtgtatt ctccaggcca aactgtgtcc cttgacatgg tgactgaagc
1801 agactcatgg gtagcactat cagcagtgga cagagctgtg tataaagtcc agggaaacgc
1861 caaaagggcc atgcaaagag tctttcaagc tttggatgaa aagagtgacc tgggctgtgg
1921 ggcaggtggt ggccatgaca atgcagatgt attccatcta gctgggctca ccttcctcac
1981 caacgcaaac gcagatgact cccattatcg tgatgactct tgtaaagaaa ttctcaggtc
2041 aaagagaaac ctgcatctcc taaggcagaa aatagaagaa caagctgcta agtacaaaca
2101 tagtgtgcca aagaaatgct gctatgacgg agcccgagtg aacttctacg aaacctgtga
2161 ggagcgagtg gcccgggtta ccataggccc tctctgcatc agggccttca acgagtgctg
2221 tactattgcg aacaagatcc gaaaagaaag cccccataaa cctgtccaac tgggaaggat
2281 ccacattaag accctgttac cagtgatgaa ggcagatatc cgaagctact ttccagagag
2341 ctggctatgg gaaattcatc gcgttcccaa aagaaaacag ctgcaggtca cgctgcctga
2401 ctcactaacg acttgggaaa ttcaaggcat tggcatttca gacaatggta tatgtgttgc
2461 tgatacactc aaggcaaagg tgttcaaaga agtcttcctg gagatgaaca taccatattc
2521 tgttgtgcga ggagaacaga tccaattgaa aggaactgtt tacaactata tgacctcagg
2581 gacaaagttc tgtgttaaaa tgtctgctgt ggaggggatc tgcacttcag gaagctcagc
2641 tgctagcctt cacacctcca ggccctccag atgtgtgttc cagaggatag agggctcgtc
2701 cagtcacttg gtgaccttca ccctgcttcc tctggaaatt ggccttcact ccataaactt
2761 ctcactagag acctcatttg ggaaagacat cttagtaaag acattacggg tagtgccaga
2821 aggagtcaag agggaaagct atgccggcgt gattctggac cctaagggaa ttcgtggtat
2881 tgttaacaga cgaaaggaat tcccatacag gatcccatta gatttggtcc ccaagaccaa
2941 agttgaaagg attttgagtg tcaaaggact gcttgtaggg gagttcttgt ccacggttct
3001 gagtaaggaa ggcatcaaca tcctaaccca cctccccaag ggcagtgcag aggcagagct
3061 catgagcata gctccggtgt tctatgtttt ccactacctg gaagcaggaa accattggaa
3121 tattttctat cctgatacac tgagtaaaag acagagcctg gagaaaaaaa taaaacaagg
3181 ggtggtgagc gtcatgtcct acagaaacgc tgactattcc tacagcatgt ggaagggggc
3241 gagcgctagt acctggctga cagcttttgc tctgagagtg cttggacagg tggccaagta
3301 tgtaaaacag gatgaaaact caatttgtaa ctctttgcta tggctggttg agaagtgtca
3361 gctggaaaac ggctctttca aggaaaattc ccaatatcta ccaataaaat tacagggtac
3421 tttgcctgct gaagcccaag agaaaacttt gtatcttaca gccttttctg tgattggaat
3481 tagaaaggca gttgacatat gccccaccat gaaaatccac acagcgctag ataaagccga
3541 ctccttcctg cttgaaaaca ccctgccatc caagagcacc ttcacactgg ccattgtagc
3601 ctatgctctt tccctaggag acagaaccca cccgaggttt cgtctaattg tgtcggccct
3661 gaggaaggaa gcttttgtta aaggtgatcc gcccatttac cgttactgga gagataccct
3721 caaacgtcca gacagctctg tgcccagcag cggcacagca ggtatggttg aaaccacagc
3781 ctatgctttg ctcgccagcc tgaaactgaa ggatatgaat tacgccaacc ccatcatcaa
3841 gtggctatct gaagagcaga ggtatggagg cggcttttat tccacccagg atacgattaa
3901 tgccatcgag ggcctgacag aatattcact cctgttaaaa caaattcatt tggatatgga
3961 catcaatgtc gcctacaaac acgaaggtga cttccacaag tataaggtga cagagaagca
4021 tttcctgggg aggccagtgg aggtatctct caatgatgac cttgttgtca gcacaggcta
4081 cagcagtggc ttggccacag tatatgtaaa aactgtggtt cacaaaatta gtgtctctga
4141 ggaattttgc agcttttact tgaaaattga tacccaagat attgaagcat ccagccactt
4201 caggctcagt gactctggat tcaagcgcat aatagcatgt gccagctaca agcccagcaa
4261 ggaggagtca acatccgggt cctcccatgc agtaatggat atatcactgc cgactggaat
4321 cggagcaaac gaggaagatt tacgggctct tgtggaagga gtggatcaac tactaactga
4381 ttaccagatc aaagatggcc atgtcattct gcaactgaat tcgatcccct ccagagattt
4441 cctctgtgtc cggttccgga tatttgaact tttccaagtt gggtttctga atcctgctac
4501 cttcacggtg tacgagtatc acagaccaga taagcagtgc accatgattt atagcatttc
4561 tgacaccagg cttcagaaag tctgtgaagg agcagcttgc acatgtgtgg aagctgactg
4621 tgcgcaactg caggcagaag tagacctagc catctctgca gactccagaa aagagaaagc
4681 ctgtaaacca gagactgcat atgcttataa agtcaggatc acatcagcca ctgaagaaaa
4741 tgtttttgtc aagtacactg cgactcttct ggtcacttac aaaacagggg aagctgctga
4801 tgagaattcg gaggtcacct tcattaaaaa gatgagctgt accaatgcca acctggtgaa
4861 agggaagcag tatttaatca tgggcaaaga ggttctgcag atcaaacaca atttcagttt
4921 caagtatata taccctctag attcctccac ctggattgaa tattggccca cagacacaac
4981 gtgtccatcc tgtcaagcat ttgtagagaa tttgaataac tttgctgaag acctcttttt
5041 aaacagctgt gaatgaaaag ttctgctgca cgaagattcc tcctgcggcg gggggattgc
5101 tcctcctctg gcttggaaac ctagcctaga atcagataca ctttctttag agtaaagcac
5161 aagctgatga gttacgactt tgtgaaatgg atagccttga ggggaggcga aaacaggtcc
5221 cccaaggcta tcagatgtca gtgccaatag actgaaacaa gtctgtaaag ttagcagtca
5281 ggggtgttgg ttggggccgg aagaagagac ccactgaaac tgtagcccct tatcaaaaca
5341 tatccttgct tgaaagaaaa ataccaagga cagaaaatgc cataaaatct tgactttgca
5401 ctc
//
LOCUS HUMENN 1592 bp ss-mRNA PRI 26-JUL-1990
DEFINITION Human endonexin II mRNA, complete cds.
ACCESSION J03745
KEYWORDS Ca2+ -dependent phospholipid binding protein; endonexin.
SOURCE Human placenta, cDNA to mRNA, (library of Clonetech Laboratories
Inc.).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1592)
AUTHORS Kaplan,R., Jaye,M., Burgess,W.H., Schlaepfer,D.D. and Haigler,H.T.
TITLE Cloning and expression of cDNA for human endonexin II, a Ca2+ and
phospholipid binding protein
JOURNAL J. Biol. Chem. 263, 8037-8043 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by H.T.Haigler, 06-APR-1988
FEATURES from to/span description
pept 160 1122 endonexin II /hgml_locus_uid="LS0217S"
/nomgen="ENX2" /map="4q28-q32"
mRNA < 1 1592 endonexin II mRNA
BASE COUNT 434 a 337 c 366 g 455 t
ORIGIN 284 bp upstream of HincII site.
1 ttggatcagt ctaggtgcag ctgccggatc cttcagcgtc tgcatctcgg cgtcgcccgc
61 gtaccgtcgc ccggctctcc gccgctctcc cggggtttcg gggcacttgg gtcccacagt
121 ctggtcctgc ttcaccttcc cctgacctga gtagtcgcca tggcacaggt tctcagaggc
181 actgtgactg acttccctgg atttgatgag cgggctgatg cagaaactct tcggaaggct
241 atgaaaggct tgggcacaga tgaggagagc atcctgactc tgttgacatc ccgaagtaat
301 gctcagcgcc aggaaatctc tgcagctttt aagactctgt ttggcaggga tcttctggat
361 gacctgaaat cagaactaac tggaaaattt gaaaaattaa ttgtggctct gatgaaaccc
421 tctcggcttt atgatgctta tgaactgaaa catgccttga agggagctgg aacaaatgaa
481 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa
541 gtttatgaag aagaatatgg ctcaagcctg gaagatgacg tggtggggga cacttcaggg
601 tactaccagc ggatgttggt ggttctcctt caggctaaca gagaccctga tgctggaatt
661 gatgaagctc aagttgaaca agatgctcag gctttatttc aggctggaga acttaaatgg
721 gggacagatg aagaaaagtt tatcaccatc tttggaacac gaagtgtgtc tcatttgaga
781 aaggtgtttg acaagtacat gactatatca ggatttcaaa ttgaggaaac cattgaccgc
841 gagacttctg gcaatttaga gcaactactc cttgctgttg tgaaatctat tcgaagtata
901 cctgcctacc ttgcagagac cctctattat gctatgaagg gagctgggac agatgatcat
961 accctcatca gagtcatggt ttccaggagt gagattgatc tgtttaacat caggaaggag
1021 tttaggaaga attttgccac ctctctttat tccatgatta agggagatac atctggggac
1081 tataagaaag ctcttctgct gctctgtgga gaagatgact aacgtgtcac ggggaagagc
1141 tccctgctgt gtgcctgcac caccccactg ccttccttca gcacctttag ctgcatttgt
1201 atgccagtgc ttaacacatt gccttattca tactagcatg ctcatgacca acacatacac
1261 gtcatagaat gaaaatagtg gtgcttcttt ctgatctcta gtggagatct ctttgactgc
1321 tgtagtacta aagtgtactt aatgttacta agtttaatgc ctggccattt tccatttata
1381 tatatttttt aagaggctag agtgctttta gcctttttta aaaactccat ttatattaca
1441 tttgtaacca tgatacttta atcagaagct tagccttgaa attgtgaact cttggaaatg
1501 ttattagtga agttcgcaac taaactaaac ctgtaaaatt atgatgattg tattcaaaag
1561 attaatgaaa aataaacatt tctgtccccc tg
//
LOCUS CPAFPRFA 1277 bp ds-DNA ORG 26-JUL-1990
DEFINITION C.paradoxa cyanelle ferredoxin (petF) and ribosomal protein S10
(rps10; rpsJ) genes, complete cds, and elongation factor Tu (tufA)
gene, 5' end.
ACCESSION M35206
KEYWORDS elongation factor Tu; ferredoxin; ribosomal protein S10.
SOURCE C.paradoxa (isolate UTEX LB555) cyanelle DNA, clone pCpcGP1.3.
ORGANISM Cyanelle Cyanophora paradoxa
Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE 1 (bases 1 to 1277)
AUTHORS Bryant,D.A., Schluchter,W.M. and Stirewalt,V.L.
TITLE Ferredoxin and ribosomal protein S10 are encoded on the cyanelle
genome of Cyanophora paradoxa
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.A.Bryant, 14-JUN-1990.
Author address: D.A.Bryant
s-101 Frear Bldg.
Dept. of Mol. and Cell Biol.
Pennsylvania State University
University Park, PA 16802
email: DAB14@PSUVM
FEATURES from to/span description
pept 338 637 ferredoxin (petF)
pept 1027 710 (c) ribosomal protein S10 (rps10; rpsJ)
pept > 1277 1114 (c) elongation factor Tu (tufA; AA at 1275)
binding 313 318 ribosome binding site
rpt 643 694 inverted repeat
rpt 1065 1097 inverted repeat
BASE COUNT 477 a 170 c 163 g 467 t
ORIGIN
1 agatcttatc taagatatgt aaataaataa aaatatatat ctatatttat agtatatatt
61 aatttttttt aaaaatcgat actaaattta aattttcctt ttttttcttt ataaaaattt
121 aattttaaat agaaaaaatt aagtttttcg aaaaaagcaa ttaaaacata ttaaaaaaaa
181 attaataaac atggtaaact ttaaatataa atttataatt aactgaaaaa ataataaaaa
241 taaatttata tatatatata ttttagatta aaataattta aattaaatta ttaaaagttc
301 taccttgtaa ctataattat ttaggagata gtattttatg gcagtatata aagttcgtct
361 tatttgtgaa gaacaaggtt tagataccac tattgaatgt ccagatgatg agtacattct
421 tgatgcagca gaagaacaag gtattgattt accatactcc tgtcgtgcag gtgcatgttc
481 tacttgtgca ggtaaagtgg tagaaggaac tgtagatcaa tctgatcaat ctttcttaga
541 tgacgctcaa ttagcagctg gttatgtatt aacttgtgta gcatacccat cttctgactg
601 tacagttaaa actcaccaag aagaatctct ttactaaaaa ataaaaaatc taaataataa
661 aatagaaatc tctattttat tatttagatt ttcttaattc aaaaaaaaac taaagtttaa
721 cttccacatc aacacctgct ggtaaatcta aacgagttaa agtatcaatt gttttggaag
781 atggtaaata taaatcaatt attctgcgat gaactctaat ttcgaaatgt tctcgtgaat
841 ctttatctac atgtggggaa cgtaaaacgc aataaatttt cttttttgtt ggtaaaggaa
901 taggtcctac tgcggtagca tcagttcgtt ttgcagcttc aataatttgt tcacatgagt
961 tttctaataa tgaagagtca taagaacgta gttgaatacg aatttttaat tgttgattac
1021 tggccataat ttttaatttt taatttttat tttttaaatt aaaaagagag aaataaatac
1081 attttctatt tctctctaaa atttagattt taattatttt aaaatcttag aaactacacc
1141 tgcaccaatt gtacgaccac cttcacgaat cgcgaaacgc ataccttgtt caatcgcaat
1201 tggatgtact aaacttactg tcattttaat acgatctcct ggcataacca tttctgcatt
1261 actaccatca tctgcag
//
LOCUS MUSIGHZSA 333 bp ds-DNA ROD 26-JUL-1990
DEFINITION Mouse Ig germline H-chain gene, D region.
ACCESSION M35332
KEYWORDS diversity exon; germline; immunoglobulin heavy chain.
SOURCE Mouse liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 333)
AUTHORS Landolfi,N.F., Capra,J.D. and Tucker,P.W.
TITLE Germ-line sequence of the D-H segment employed in Ars-A antibodies:
Implications for the generation of junctional diversity
JOURNAL J. Immunol. 137, 362-365 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept / 107 / 129 Ig heavy chain D region (AA at 107; 107 could
be 109)
iDNA < 1 106 V-D intervening DNA
iDNA 130 > 333 D-J intervening DNA
BASE COUNT 95 a 98 c 62 g 78 t
ORIGIN
1 tgacaactga aactcaaccg tgctgcctgg cccccaatgc tctctacacc tgcaaaacca
61 gagaccatac tggccagtgc tttttgtgaa gggatctact actgtgttta ttactatggt
121 ggtagctacc acagtgctat atccatcagc aaaaacccat tgtgcccagc agactcttga
181 gctcgaaaaa ctgagtctag aaaagctggc atcacggggt ttatatcccg agtcttgacc
241 actgacccat taatactatc caacacagag ctctccgtct gcccacaaag aaatccaacc
301 accctaaagt cagatcctct agagtcgacc tgc
//
LOCUS WHTREPTA 295 bp ds-DNA PLN 26-JUL-1990
DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD1.
ACCESSION M35329
KEYWORDS repetitive DNA.
SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD1.
ORGANISM Triticum monococcum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 295)
AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B.
TITLE Apparent sources of the A genomes of wheats inferred from
polymorphism in abundance and restriction fragment length of
repeated nucleotide sequences
JOURNAL Genome 30, 680-689 (1988)
STANDARD simple staff_review
BASE COUNT 72 a 58 c 66 g 99 t
ORIGIN
1 tccagacttg ggtaacaggg tgtgccttag aatcccagtt gatagtgggc agtcctgaca
61 gaagatagtg cactgagcca aacttgaatg tgtcaagtgc ttcattcgga atctccttgt
121 acatgttgaa catagagttg tggtccatct ttttcttggc ataaatgtcc aagtcatctg
181 cttgctcctc tggggcattg atcattataa gtaatagtct tttcttcaac tttaataggt
241 gcagctactt ttacttctat gggaggatga tatttaaacc acttctcctt gggga
//
LOCUS WHTREPTB 273 bp ds-DNA PLN 26-JUL-1990
DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD2.
ACCESSION M35330
KEYWORDS repetitive DNA.
SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD2.
ORGANISM Triticum monococcum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 273)
AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B.
TITLE Apparent sources of the A genomes of wheats inferred from
polymorphism in abundance and restriction fragment length of
repeated nucleotide sequences
JOURNAL Genome 30, 680-689 (1988)
STANDARD simple staff_review
BASE COUNT 58 a 44 c 74 g 97 t
ORIGIN
1 ctggccatgg agggcctatg tagatagaca ggcttcgaga agcttctttc tttctagtgt
61 ctgtactcag accggttgct tccgcatgtg cttgtatgag tgtatgactt gagtgtcggg
121 tcatgtgacc cctatctgta tgaacatgtt atgtatggct ctctagagcc tttaaataaa
181 gtacttgagt tgtagagtat tgttgtgatg ccatgttgta tgtactcata tcgggcatat
241 tgtgtgtatg attgaaatgc ttggtatgag tgg
//
LOCUS WHTREPTC 229 bp ds-DNA PLN 26-JUL-1990
DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD3.
ACCESSION M35331
KEYWORDS repetitive DNA.
SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD3.
ORGANISM Triticum monococcum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 229)
AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B.
TITLE Apparent sources of the A genomes of wheats inferred from
polymorphism in abundance and restriction fragment length of
repeated nucleotide sequences
JOURNAL Genome 30, 680-689 (1988)
STANDARD simple staff_review
BASE COUNT 85 a 55 c 37 g 52 t
ORIGIN
1 caaattagct actccagtat gtaaaaacct gtttgtccaa cacttagcag atttcactct
61 tgatagatca ctagcaatag ctcccgcaaa atcgcaaaag agttcatgat ctgcccaaaa
121 caacaactat gcaaaagttg agctcgattg agtcaaccta gggtgctcca acataacaag
181 taaagacatg gatggattaa gcacaacaag catgacaaac cactcttac
//
LOCUS RATMTXXX 169 bp ds-DNA ORG 26-JUL-1990
DEFINITION Rat mitochondrial HindIII fragment.
ACCESSION M35251
KEYWORDS .
SOURCE Rat mitochondrial DNA.
ORGANISM Mitochondrion Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Rattus norvegicus.
REFERENCE 1 (bases 1 to 169)
AUTHORS Brown,G.G., Castora,F.J., Frantz,S.C. and Simpson,M.V.
TITLE Mitochondrial DNA polymorphism: Evolutionary studies on the genus
Rattus
JOURNAL Ann. N.Y. Acad. Sci. 361, 135-153 (1981)
STANDARD simple staff_review
FEATURES from to/span description
ORF < 1 > 169 ORF
allele 56 56 a in type A; g in type B
allele 80 80 a in type A; g in type B
allele 122 122 t in type A; g in type B
BASE COUNT 44 a 24 c 43 g 58 t
ORIGIN
1 agcttgctaa tagtcatcat gttgctatca atggaaagat tatttgtaat cctcgagcta
61 taattatagt tcggctgtga attcgttcgt agttggtgtt tgctaggcag aataagagtg
121 atgaggttaa gccgtgggcg attattagta ttgtagctcc catgaagct
//
LOCUS MUSCRABP 868 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA,
complete cds.
ACCESSION M35523
KEYWORDS cellular retinoic acid-binding protein.
SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 868)
AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S.
TITLE Molecular cloning of a novel cellular retinoic acid-binding protein
expressed during mouse embryogenesis and in adult skin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.Giguere, 22-JUN-1990.
FEATURES from to/span description
pept 116 532 cellular retinoic acid-binding protein
BASE COUNT 226 a 218 c 220 g 204 t
ORIGIN Chromosome 2.
1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac
61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc
121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc
181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt
241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac
301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg
361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa
421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct
481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac
541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc
601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc
661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc
721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac
781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt
841 taattttatt aaaatgtttc cggaattc
//
LOCUS BTHCRYIA 4320 bp ds-DNA BCT 26-JUL-1990
DEFINITION B.thuringiensis delta-endotoxin gene, complete cds.
ACCESSION M35524
KEYWORDS delta-endotoxin.
SOURCE B.thuringiensis kenyae (strain HD588) DNA.
ORGANISM Bacillus thuringiensis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4320)
AUTHORS Von Tersch,M.A., Loidl,R.H., Jany,C.S. and Johnson,T.B.
TITLE Insecticidal toxin genes from Bacillus thuringiensis variety
kenyae: Cloning characterization and comparative studies
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.A.Von Tersch, 19-JUN-1990.
Author address: M.A.Von Tersch
Ecogen Inc.
2005 Cabot Blvd. West
Loanghorne PA 19047
FEATURES from to/span description
pept 239 3772 delta-endotoxin
binding 228 232 ribosome binding site
BASE COUNT 1392 a 724 c 909 g 1295 t
ORIGIN
1 gttaacggaa tacaaaccct taatgcattg gttaaacatt gtaaagtcta aagcatggat
61 aatgggcgag aagtaagtag attgttaaca ccctgggtca aaaattgata tttagtaaaa
121 ttagttgcac tttgtgcatt ttttcataag atgagtcata tgttttaaat tgtagtaatg
181 aaaaacagta ttatatcata atgaattggt atcttaataa aagagatgga ggtaacttat
241 ggataacaat ccgaacatca atgaatgcat tccttataat tgtttaagta accctgaagt
301 agaagtatta ggtggagaaa gaatagaaac tggttacacc ccaatcgata tttccttgtc
361 gctaacgcaa tttcttttga gtgaatttgt tcccggtgct ggatttgtgt taggactagt
421 tgatataata tggggaattt ttggtccctc tcaatgggac gcatttcttg tacaaattga
481 acagttaatt aaccaaagaa tagaagaatt cgctaggaac caagccattt ctagattaga
541 aggactaagc aatctttatc aaatttacgc agaatctttt agagagtggg aagcagatcc
601 tactaatcca gcattaagag aagagatgcg tattcaattc aatgacatga acagtgccct
661 tacaaccgct attcctcttt tggcagttca aaattatcaa gttcctcttt tatcagtata
721 tgttcaagct gcaaatttac atttatcagt tttgagagat gtttcagtgt ttggacaaag
781 gtggggattt gatgccgcga ctatcaatag tcgttataat gatttaacta ggcttattgg
841 caactataca gatcatgctg tacgctggta caatacggga ttagaacgtg tatggggacc
901 ggattctaga gattgggtaa ggtataatca atttagaaga gaattaacac taactgtatt
961 agatatcgtt gctctgttcc cgaattatga tagtagaaga tatccaattc gaacagtttc
1021 ccaattaaca agagaaattt atacaaaccc agtattagaa aattttgatg gtagttttcg
1081 aggctcggct cagggcatag aaagaagtat taggagtcca catttgatgg atatacttaa
1141 cagtataacc atctatacgg atgctcatag gggttattat tattggtcag ggcatcaaat
1201 aatggcttct cctgtcggtt tttcggggcc agaattcacg tttccgctat atggaaccat
1261 gggaaatgca gctccacaac aacgtattgt tgctcaacta ggtcagggcg tgtatagaac
1321 attatcctct actttttata gaagaccttt taatataggg ataaataatc aacaactatc
1381 tgttcttgac gggacagaat ttgcttatgg aacctcctca aatttgccat ccgctgtata
1441 cagaaaaagc ggaacggtag attcgctgga tgaaatacca ccacagaata acaacgtgcc
1501 acctaggcaa ggatttagtc atcgattaag ccatgtttca atgtttcgtt caggctctag
1561 tagtagtgta agtataataa gagctcctat gttctcttgg atacatcgta gtgctgaatt
1621 taataatata attgcatcgg atagtattac tcaaatccct gcagtgaagg gaaactttct
1681 ttttaatggt tctgtaattt caggaccagg atttactggt ggggacttag ttagattaaa
1741 tagtagtgga aataacattc agaatagagg gtatattgaa gttccaattc acttcccatc
1801 gacatctacc agatatcgag ttcgtgtacg gtatgcttct gtaaccccga ttcacctcaa
1861 cgttaattgg ggtaattcat ccattttttc caatacagta ccagctacag ctacgtcatt
1921 agataatcta caatcaagtg attttggtta ttttgaaagt gccaatgctt ttacatcttc
1981 attaggtaat atagtaggtg ttagaaattt tagtgggact gcaggagtga taatagacag
2041 atttgaattt attccagtta ctgcaacact cgaggctgaa tataatctgg aaagagcgca
2101 gaaggcggtg aatgcgctgt ttacgtctac aaaccaacta gggctaaaaa caaatgtaac
2161 ggattatcat attgatcaag tgtccaattt agttacgtgt ttatcggatg aattttgtct
2221 ggatgaaaag cgagaattgt ccgagaaagt caaacatgcg aagcgactca gtgatgaacg
2281 caatttactc caagattcaa atttcaaaga cattaatagg caaccagaac gtgggtgggg
2341 cggaagtaca gggattacca tccaaggagg ggatgacgta tttaaagaaa attacgtcac
2401 actatcaggt acctttgatg agtgctatcc aacatatttg tatcaaaaaa tcgatgaatc
2461 aaaattaaaa gcctttaccc gttatcaatt aagagggtat atcgaagata gtcaagactt
2521 agaaatctat ttaattcgct acaatgcaaa acatgaaaca gtaaatgtgc caggtacggg
2581 ttccttatgg ccgctttcag cccaaagtcc aatcggaaag tgtggagagc cgaatcgatt
2641 cgcgccacac cttgaatgga atcctgactt agattgttcg tgtagggatg gagaaaagtg
2701 tgcccatcat tcgcatcatt tctccttaga cattgatgta ggatgtacag acttaaatga
2761 ggacctaggt gtatgggtga tctttaagat taagacgcaa gatgggcacg caagactagg
2821 gaatctagag tttctcgaag agaaaccatt agtaggagaa gcgctagctc gtgtgaaaag
2881 agcggagaaa aaatggagag acaaacgtga aaaattggaa tgggaaacaa atatcgttta
2941 taaagaggca aaagaatctg tagatgcttt atttgtaaac tctcaatatg atcaattaca
3001 agcggatacg aatattgcca tgattcatgc ggcagataaa cgtgttcata gcattcgaga
3061 agcttatctg cctgagctgt ctgtgattcc gggtgtcaat gcggctattt ttgaagaatt
3121 agaagggcgt attttcactg cattctccct atatgatgcg agaaatgtca ttaaaaatgg
3181 tgattttaat aatggcttat cctgctggaa cgtgaaaggg catgtagatg tagaagaaca
3241 aaacaaccaa cgttcggtcc ttgttgttcc ggaatgggaa gcagaagtgt cacaagaagt
3301 tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca gcgtacaagg agggatatgg
3361 agaaggttgc gtaaccattc atgagatcga gaacaataca gacgaactga agtttagcaa
3421 ctgcgtagaa gaggaaatct atccaaataa cacggtaacg tgtaatgatt atactgtaaa
3481 tcaagaagaa tacggaggtg cgtacacttc tcgtaatcga ggatataacg aagctccttc
3541 cgtaccagct gattatgcgt cagtctatga agaaaaatcg tatacagatg gacgaagaga
3601 gaatccttgt gaatttaaca gagggtatag ggattacacg ccactaccag ttggttatgt
3661 gacaaaagaa ttagaatact tcccagaaac cgataaggta tggattgaga ttggagaaac
3721 ggaaggaaca tttatcgtgg acagcgtgga attactcctt atggaggaat agtctcatgc
3781 aaactcaggt ttaaatatcg ttttcaaatc aattgtccaa gagcagcatt acaaatagat
3841 aagtaatttg ttgtaatgaa aaacggacat cacctccatt gaaacggagt gatgtccgtt
3901 ttactatgtt attttctagt aatacatatg tatagagcaa cttaatcaag cagagatatt
3961 ttcacctatc gatgaaaata tctctgcttt ttcttttttt atttggtata tgctttactt
4021 gtaatcgaaa ataaagcact aatagggtgt ttttgcccat cccttcggga aatcaagact
4081 aaaatgaaaa ataaacagaa aatataaggc tcttactttg tggatatgac cacaaagtaa
4141 gagccttatt tcattaaatt tgttcataca tttttccttg tagtcttttg ttttcatcct
4201 ttaatcgcct attctcgtac tctacttcct tgattcgatc ccgtaataat tgaatcattg
4261 catctttatt ttcatcactc attttccgtt tttcgaattt tggagataca gctcgttgct
//
LOCUS HUMHBLOD 3373 bp ss-mRNA PRI 26-JUL-1990
DEFINITION Human GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase
mRNA, complete cds.
ACCESSION M35531
KEYWORDS GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase.
SOURCE Human epidermal carcinoma cell line A431, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3373)
AUTHORS Larsen,R.D., Ernst,L.K., Nair,R.P. and Lowe,J.B.
TITLE Molecular cloning, sequence and expression of a human GDP-L-fucose:
Beta-D-galactoside 2-alpha-l-fucosyltransferase cDNA that can be
from the H blood group antigen
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD ull staff_review staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by J.B.Lowe, 22-JUN-1990.
FEATURES from to/span description
pept 104 1201 GDP-L-fucose:beta-D-galactoside
2-alpha-L-fucosyltransferase
site 1744 2385 Alu sequence homologue
BASE COUNT 687 a 925 c 905 g 856 t
ORIGIN
1 gcctggcgtt ccaggggcgg ccggatgtgg cctgcctttg cggagggtgc gctccggcca
61 cgaaaagcgg actgtggatc tgccacctgc aagcagctcg gccatgtggc tccggagcca
121 tcgtcagctc tgcctggcct tcctgctagt ctgtgtcctc tctgtaatct tcttcctcca
181 tatccatcaa gacagctttc cacatggcct aggcctgtcg atcctgtgtc cagaccgccg
241 cctggtgaca cccccagtgg ccatcttctg cctgccgggt actgcgatgg gccccaacgc
301 ctcctcttcc tgtccccagc accctgcttc cctctccggc acctggactg tctaccccaa
361 tggccggttt ggtaatcaga tgggacagta tgccacgctg ctggctctgg cccagctcaa
421 cggccgccgg gcctttatcc tgcctgccat gcatgccgcc ctggccccgg tattccgcat
481 caccctgccc gtgctggccc cagaagtgga cagccgcacg ccgtggcggg agctgcagct
541 tcacgactgg atgtcggagg agtacgcgga cttgagagat cctttcctga agctctctgg
601 cttcccctgc tcttggactt tcttccacca tctccgggaa cagatccgca gagagttcac
661 cctgcacgac caccttcggg aagaggcgca gagtgtgctg ggtcagctcc gcctgggccg
721 cacaggggac cgcccgcgca cctttgtcgg cgtccacgtg cgccgtgggg actatctgca
781 ggttatgcct cagcgctgga agggtgtggt gggcgacagc gcctacctcc ggcaggccat
841 ggactggttc cgggcacggc acgaagcccc cgttttcgtg gtcaccagca acggcatgga
901 gtggtgtaaa gaaaacatcg acacctccca gggcgatgtg acgtttgctg gcgatggaca
961 ggaggctaca ccgtggaaag actttgccct gctcacacag tgcaaccaca ccattatgac
1021 cattggcacc ttcggcttct gggctgccta cctggctggc ggagacactg tctacctggc
1081 caacttcacc ctgccagact ctgagttcct gaagatcttt aagccggagg cggccttcct
1141 gcccgagtgg gtgggcatta atgcagactt gtctccactc tggacattgg ctaagccttg
1201 agagccaggg agactttctg aagtagcctg atctttctag agccagcagt acgtggcttc
1261 agaggcctgg catcttctgg agaagcttgt ggtgttcctg aagcaaatgg gtgcccgtat
1321 ccagagtgat tctagttggg agagttggag agaaggggga cgtttctgga actgtctgaa
1381 tattctagaa ctagcaaaac atcttttcct gatggctggc aggcagttct agaagccaca
1441 gtgcccacct gctcttccca gcccatatct acagtacttc cagatggctg cccccaggaa
1501 tggggaactc tccctctggt ctactctaga agaggggtta cttctcccct gggtcctcca
1561 aagactgaag gagcatatga ttgctccaga gcaagcattc accaagtccc cttctgtgtt
1621 tctggagtga ttctagaggg agacttgttc tagagaggac caggtttgat gcctgtgaag
1681 aaccctgcag ggcccttatg gacaggatgg ggttctggaa atccagataa ctaaggtgaa
1741 gaatcttttt agtttttttt tttttttttt ggagacaggg tctcgctctg ttgcccaggc
1801 tggagtgcag tggcgtgatc ttggctcact gcaacttccg cctcctgtgt tcaagcgatt
1861 ctcctgtctc agcctcctga gtagatggga ctacaggcac aggccattat gcctggctaa
1921 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggatggt ctcgatctcc
1981 tgaccttgtc atccacctgt cttggcctcc caaagtgctg ggattactgg catgagccac
2041 tgtgcccagc ccggatattt ttttttaatt atttatttat ttatttattt attgagacgg
2101 agtcttgctc tgtagcccag gccagagtgc agtggcgcga tctcagctca ctgcaagctc
2161 tgcctcccgg gttcatgcca ttctgcctca gcctcctgag tagctgggac tacaggcgcc
2221 cgccaccacg cccggctaat tttttttgta tttttagtag agacggggtt tcatcgtgtt
2281 aaccaggatg gtctcgatct cctgacctcg tgatctgccc acctcggcct cccacagtgc
2341 tgggattacc ggcgtgagcc accatgcctg gcccggataa ttttttttaa tttttgtaga
2401 gacgaggtct tgtgatattg cccaggctgt tcttcaactc ctgggctcaa gcagtcctcc
2461 caccttggcc tcccagaatg ctgggtttat agatgtgagc cagcacaccg ggccaagtga
2521 agaatctaat gaatgtgcaa cctaattgta gcatctaatg aatgttccac cattgctgga
2581 aaaattgaga tggaaaacaa accatctcta gttggccagc gtcttgctct gttcacagtc
2641 tctggaaaag ctggggtagt tggtgagcag agcgggactc tgtccaacaa gccccacagc
2701 ccctcaaaga cttttttttg tttgttttga gcagacaggc taaaatgtga acgtggggtg
2761 agggatcact gccaaaatgg tacagcttct ggagcagaac tttccaggga tccagggaca
2821 ctttttttta aagctcataa actgccaaga gctccatata ttgggtgtga gttcaggttg
2881 cctctcacaa tgaaggaagt tggtctttgt ctgcaggtgg gctgctgagg gtctgggatc
2941 tgttttctgg aagtgtgcag gtataaacac accctctgtg cttgtgacaa actggcaggt
3001 accgtgctca ttgctaacca ctgtctgtcc ctgaactccc agaaccacta catctggctt
3061 tgggcaggtc tgagataaaa cgatctaaag gtaggcagac cctggaccca gcctcagatc
3121 caggcaggag cacgaggtct ggccaaggtg gacggggttg tcgagatctc aggagcccct
3181 tgctgttttt tggagggtga aagaagaaac cttaaacata gtcagctctg atcacatccc
3241 ctgtctactc atccagaccc catgcctgta ggcttatcag ggagttacag ttacaattgt
3301 tacagtactg ttcccaactc agctgccacg ggtgagagag caggaggtat gaattaaaag
3361 tctacagcac taa
//
LOCUS MUSCRABPA 868 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA,
complete cds.
ACCESSION M35523
KEYWORDS cellular retinoic acid-binding protein.
SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 868)
AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S.
TITLE Molecular cloning of a novel cellular retinoic acid-binding protein
expressed during mouse embryogenesis and in adult skin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.Giguere, 22-JUN-1990.
FEATURES from to/span description
pept 116 532 cellular retinoic acid-binding protein
BASE COUNT 226 a 218 c 220 g 204 t
ORIGIN Chromosome 2.
1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac
61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc
121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc
181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt
241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac
301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg
361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa
421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct
481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac
541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc
601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc
661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc
721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac
781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt
841 taattttatt aaaatgtttc cggaattc
//
LOCUS YSCGLN3 3021 bp ds-DNA PLN 26-JUL-1990
DEFINITION S.cerevisiae nitrogen regulatory protein (GLN3) gene, complete cds.
ACCESSION M35267
KEYWORDS nitrogen regulatory protein.
SOURCE S.cerevisiae (strain S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3021)
AUTHORS Minehart,P.L. and Magasanik,B.
TITLE Sequence and expression of GLN3, a positive nitrogen regulatory
gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by P.Minehart, 19-JUN-1990.
Author address: P.Minehart
MIT, 56-428 MIT
77 Mass Ave.
Cambridge, MA 02139
FEATURES from to/span description
pept 730 2922 GLN3 protein
signal 509 514 TATA box
site 1 140 acidic activation region
BASE COUNT 1020 a 704 c 530 g 767 t
ORIGIN Chromosome VL, map position 43cm distal to GCN4.
1 gacgtcaact ccatagaagt gacttttccg ccaaagaaga ggacctcgcc ataagcaatg
61 agaatgatcg tcagattctt gaaaattgtg tagatgggca cggcaaggta ttgtaagctc
121 tttgacgacg tataaatcat caatacgagc agcaaagaaa ttggaaacca gttttttaca
181 tctgtcctgt tcaaagatca aaaattagca acgcctacaa ttcgtaggat acatagcgtc
241 acagtgcaca ccagtgattg tacaaacaac atcacaaagt tcatgttaaa gttgtccagg
301 ttaaccacga atttgttcgt tactgtcatc aaaatcgagg acgcgcagta agataagatt
361 gaagccggcc cagagttggc cactgattcc gtccattcat gcttatgctt gctcataatt
421 accacacctt cttgatctct ttacagcttt tcaaccttcc attcttgtac tctatctcta
481 cctggccctt taaacattct taatatgata tattcacatt ttttgctcta ttacccggcg
541 gacaggttcc cgaaagaaag tgacatggca atgctgagag agtggaaaga gtcatcttgc
601 aagacagaga aagatgttca agagtggtaa gctaatgtca gcgcagtagc ccatcccaca
661 ataacagagt gtgtaagaaa gagagacgag agagagcaca gggccccctt ttcccccacc
721 aacaaacaaa tgcaagacga ccccgaaaat tcgaagctgt acgacctgct gaatagtcat
781 ctggacgtgc atggtcgaag taatgaagag ccgagacaaa ctggtgacag taggagccag
841 agtagtggca acaccggtga aaacgaggag gatatagcat ttgccagtgg attaaacggc
901 ggcacattcg actcaatgct ggaggcactg cccgatgatt tatattttac ggacttcgtg
961 tctcctttta cagcagctgc cacgaccagc gtgactacta agacggtcaa ggacaccaca
1021 ccagctacca atcatatgga tgatgatatt gcgatgtttg attcacttgc cacaactcag
1081 cccatcgaca tagccgcatc caaccaacaa aatggtgaaa ttgcacaact ttgggacttt
1141 aacgtggacc aattcaacat gacgcccagc aactcgagcg gttcagctac tattagtgct
1201 cctaacagct ttacttccga cataccgcaa tacaaccacg gttccctcgg caacagcgtc
1261 tccaaatcct cactgttccc gtataattcc agcacgtcca acagcaacat caaccagcca
1321 tctatcaata acaactcaaa tactaatgcg cagtcccacc attccttcaa catctacaaa
1381 ctacaaaaca acaactcatc ttcatccgct atgaacatta ccaataataa taatagcaac
1441 aatagtaata tccagcatcc ttttctgaag aagagcgatt cgataggatt atcttcatcc
1501 aacacaacaa attctgtaag aaaaaactca cttatcaagc caatgtcgtc cacgtccctg
1561 gccaatttca aaagagctgc ctcagtatct tccagtatat ccaatatgga accatcagga
1621 caaaataaaa aacctctgat acaatgtttc aattgtaaaa ctttcaagac accgctttgg
1681 aggagaagcc cagaggggaa tactctttgc aatgcctgcg gtcttttcca gaaattacat
1741 ggtaccatga ggccattatc cttaaaatcg gacgttatca aaaagaggat ttcaaagaag
1801 agagccaaac aaacggaccc aaacattgca caaaatactc caagtgcacc tgcaactgcc
1861 tcaacttcag taaccactac aaatgctaaa cccatacgat cgaggaaaaa atcactacaa
1921 caaaactctt tatctagagt gatacctgaa gaaatcatta gagacaacat cggtaatact
1981 aataatatcc ttaatgtaaa taggggaggc tataacttca actcagtccc ctccccggtc
2041 ctcatgaaca gccaatcgta taatagtagt aacgcaaatt ttaatggagc aagcaatgca
2101 aatttgaatt ctaataactt aatgcgtcac aattcgaaca ctgttactgg taattttaga
2161 aggtcttcaa gacgaagtag tacttcatcg aacacctcaa gttccagtaa atcttcatcc
2221 agatctgttg ttccgatatt accaaaacct tcacctaata gcgctaattc acagcagttc
2281 aacatgaaca tgaacctaat gaacacaaca aataatgtaa gtgcaggaaa tagtgtcgca
2341 tcctcaccaa gaattatatc gtccgcaaac tttaactcaa atagtcctct acagcagaat
2401 ctattatcaa attctttcca acgtcaagga atgaatatac caagaagaaa gatgtcgcgc
2461 aatgcatcgt actcctcatc gtttatggct gcgtctttgc aacaactgca cgaacagcaa
2521 caagtggacg tgaattccaa cacaaacacg aattcgaata gacagaattg gaattcaagc
2581 aatagcgttt caacaaattc aagatcatca aattttgtct ctcaaaagcc aaattttgat
2641 atttttaata ctcctgtaga ttcaccgagt gtctcaagac cttcttcaag aaaatcacat
2701 acctcattgt tatcacaaca attgcagaac tcggagtcga attcgtttat ctcaaatcac
2761 aaatttaaca atagattatc aagtgactct acttcaccta taaaatatga agcagatgtg
2821 agtgcaggcg gaaagatcag tgaggataat tccacaaaag gatcttctaa agaaagttca
2881 gcaattgctg acgaattgga ttggttaaaa tttggtatat gaccgcgtat tatcattatc
2941 attattctta ttatgttaat aattactgaa cggttgcatt gatagatttt cattacctct
3001 gaccacaatc ctgagcattg g
//
LOCUS BLYHISH3PA 505 bp ss-mRNA PLN 26-JUL-1990
DEFINITION Barley histone H3 mRNA, 3' end.
ACCESSION M34928
KEYWORDS histone H3 protein.
SOURCE Barley (strain Nudinka) seed scutella 2 days after germination,
cDNA to mRNA.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 505)
AUTHORS Chojecki,J.
TITLE Identification and characterization of a cDNA clone for histone H3
in barley
JOURNAL Carlsberg Res. Commun. 51, 211-217 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 243 histone H3 protein
mRNA < 1 505 histone H3 mRNA
BASE COUNT 95 a 138 c 146 g 126 t
ORIGIN
1 aagagcaccg agctgctgat ccgcaagctc ccgttccagc gcctggtgag ggagatcgcg
61 caggacttca agaccgacct caggttccag tcccacgccg tgctggccct ccaggaggcc
121 gccgaggcgt acctcgtcgg gctgttcgag gacaccaacc tgtgcgccat ccacgccaag
181 cgcgtcacca tcatgcccaa ggacatccag ctcgcccgcc gcatccgcgg ggagcgcgcc
241 taagccaccc agagcgctgc attcgggagc gatgacaccg ttcgccagca ttagtgtagt
301 tgattggctt tccttgtcca gatatgcgtc ttgtggttcg ttgtagaaac cctggttggt
361 tggttcccgt agttacagag acttttctgc ttaagtggtt ttggtttgcg gtgttgcaaa
421 ccgatgctta ctgtgatgca aattgttggt taatgtagtg ttgattgaca attatcgatg
481 gatgaacttg tggtgttgcg tagtt
//
LOCUS BMOFIBA 324 bp ss-mRNA INV 26-JUL-1990
DEFINITION B.mori silk fibroin mRNA, partial cds.
ACCESSION M35378
KEYWORDS fibroin.
SOURCE B.mori (Kinryu x Showa) posterior silk gland, cDNA to mRNA.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Mita,K., Ichimura,S., Zama,M. and James,T.C.
TITLE Specific codon usage pattern and its implications on the secondary
structure of silk fibroin mRNA
JOURNAL J. Mol. Biol. 203, 917-925 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 324 silk fibroin (AA at 1)
BASE COUNT 35 a 60 c 144 g 85 t
ORIGIN
1 ggatacggag caggagctgg aagcggagct gcctctggtg ccggtgccgg ttcaggtgct
61 ggtgctggtt caggagctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct
121 ggttcaggtg ctggtgctgg ttcaggagct ggtgctggtt caggtgctgg tgctggttca
181 ggagctggtg ctggatacgg agcaggagct ggcgttggat acggagcagg agctgggagc
241 ggagctgcct ctggtgctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct
301 ggttcaggtg ctggtgctgg ttca
//
LOCUS DROMETA 338 bp ss-mRNA INV 26-JUL-1990
DEFINITION D.melanogaster metallothionein (MT) mRNA, complete cds.
ACCESSION M35390
KEYWORDS metallothionein.
SOURCE D.melanogaster larva, cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 338)
AUTHORS Maroni,G., Lastowski-Perry,D., Otto,E. and Watson,D.
TITLE Effects of heavy metals on Drosophila larvae and a metallothionein
cDNA
JOURNAL Environ. Health Perspect. 65, 107-116 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 124 246 metallothionein
mRNA < 1 338 metallothionein mRNA
signal 308 313 polyA signal
BASE COUNT 101 a 88 c 77 g 72 t
ORIGIN
1 gatcagttgt ggtcagcagc aaaatcaagt gaatcatctc agtgcaacta aaggcctaaa
61 tagcccatac ctaccttttt tgtaaacaag tgaacaagtt cgaggaaata caactcaatc
121 aagatgcctt gcccatgcgg aagcggatgc aaatgcgcca gccaggccac caagggatcc
181 tgcaactgcg gatctgactg caagtgcggc ggcgacaaga aatccgcctg cggctgctcc
241 gagtgagctt tcccccaaaa aagatctgga gtagaggcgc tgcatcttgt ctctctacac
301 accctgcaat aaatgtccaa ttaaagtaat tgatgcct
//
LOCUS HUMVPREBA 503 bp ds-DNA PRI 26-JUL-1990
DEFINITION Human pre-B lymphocyte VpreB gene, 5' end.
ACCESSION M34927
KEYWORDS .
SOURCE Human myeloid cell line U937 DNA, clone pHVPB-6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 503)
AUTHORS Bauer,S.R., Kudo,A. and Melchers,F.
TITLE Structure and pre-B lymphocyte restricted expression of the VpreB
gene in humans and conservation of its structure in other mammalian
species
JOURNAL EMBO J. 7, 111-116 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 46 VpreB protein precursor, exon 1
133 > 503 VpreB protein precursor, exon 2
sigp 1 46 VpreB protein signal peptide
133 143 VpreB protein signal peptide
matp 144 > 503 VpreB protein
IVS 47 132 VpreB intron A
BASE COUNT 104 a 160 c 140 g 99 t
ORIGIN
1 atgtcctggg ctcctgtcct gctcatgcac tttgtctact gcacaggtga gggaaccccc
61 agatcccaaa gactcctgcc ccttccttca tcctgccctg cccccacggg ccacatgcat
121 ctgtgtcacc aggttgtggt cctcagccgg tgctacatca gccgccggcc atgtcctcgg
181 cccttggaac cacaatccgc ctcacctgca ccctgaggaa cgaccatgac atcggtgtgt
241 acagcgtcta ctggtaccag cagaggccgg gccaccctcc caggttcctg ctgagatatt
301 tctcacaatc agacaagagc cagggccccc aggtcccccc tcgcttctct ggatccaaag
361 atgtggccag gaacaggggg tatttgagca tctctgagct gcagcctgag gacgaggcta
421 tgtattactg tgctatgggg gcccgcagct cggagaagga ggagagggag agggagtggg
481 aggaagaaat ggaacccact gca
//
LOCUS MUSNGF 1176 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Mouse nerve growth factor (NGF) precursor mRNA, complete cds.
ACCESSION M35075 J00608
KEYWORDS nerve growth factor.
SOURCE Mouse male submaxillary gland, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1176)
AUTHORS Scott,J., Selby,M., Urdea,M., Quiroga,M., Bell,G.I. and Rutter,W.J.
TITLE Isolation and nucleotide sequence of a cDNA encoding the precursor
of mouse nerve growth factor
JOURNAL Nature 302, 538-540 (1983)
STANDARD simple staff_review
REFERENCE 2 (bases 3 to 226)
AUTHORS Edwards,R.H., Selby,M.J. and Rutter,W.J.
TITLE Differential RNA splicing predicts two distinct nerve growth factor
precursors
JOURNAL Nature 319, 784-787 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 96 1019 nerve growth factor precursor
sigp 96 656 nerve growth factor signal peptide
matp 657 1010 nerve growth factor
mRNA 1 1176 NGF mRNA
BASE COUNT 283 a 330 c 295 g 268 t
ORIGIN
1 gagcgcctgg agccggaggg gagcgcatcg agtgactttg gagctggcct tatatttgga
61 tctcccgggc agctttttgg aaactcctag tgaacatgct gtgcctcaag ccagtgaaat
121 taggctccct ggaggtggga cacgggcagc atggtggagt tttggcctgt ggtcgtgcag
181 tccagggggc tggatggcat gctggaccca agctcacctc agtgtctggg cccaataaag
241 gttttgccaa ggacgcagct ttctatactg gccgcagtga ggtgcatagc gtaatgtcca
301 tgttgttcta cactctgatc actgcgtttt tgatcggcgt acaggcagaa ccgtacacag
361 atagcaatgt cccagaagga gactctgtcc ctgaagccca ctggactaaa cttcagcatt
421 cccttgacac agccctccgc agagcccgca gtgcccctac tgcaccaata gctgcccgag
481 tgacagggca gacccgcaac atcactgtag accccagact gtttaagaaa cggagactcc
541 actcaccccg tgtgctgttc agcacccagc ctccacccac ctcttcagac actctggatc
601 tagacttcca ggcccatggt acaatccctt tcaacaggac tcaccggagc aagcgctcat
661 ccacccaccc agtcttccac atgggggagt tctcagtgtg tgacagtgtc agtgtgtggg
721 ttggagataa gaccacagcc acagacatca agggcaagga ggtgacagtg ctggccgagg
781 tgaacattaa caacagtgta ttcagacagt acttttttga gaccaagtgc cgagcctcca
841 atcctgttga gagtgggtgc cggggcatcg actccaaaca ctggaactca tactgcacca
901 cgactcacac cttcgtcaag gcgttgacaa cagatgagaa gcaggctgcc tggaggttca
961 tccggataga cacagcctgt gtgtgtgtgc tcagcaggaa ggctacaaga agaggctgac
1021 ttgcctgcag cccccttccc cacctgcccc ctccacactc tcttgggccc ctccctacct
1081 cagcctgtaa attattttaa attataagga ctgcatgata atttatcgtt tatacaattt
1141 taaagacatt atttattaaa ttttcaaagc atcctg
//
LOCUS RATXDHA 4162 bp ss-mRNA ROD 26-JUL-1990
DEFINITION Rat xanthine dehydrogenase mRNA, complete cds.
ACCESSION J05579
KEYWORDS xanthine dehydrogenase.
SOURCE Rat (strain Wistar) liver, cDNA to mRNA, clones
lambda-RXD[7,32,42,51].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4162)
AUTHORS Amaya,Y., Yamazaki K,-i., Sato,M., Noda,K., Nishino,T. and
Nishino,T.
TITLE Proteolytic conversion of xanthine dehydrogenase from the NAD-
dependent type to the oxygen-dependent type: Amino acid sequence of
rat liver xanthine dehydrogenase and identification of the cleavage
sites of the enzyme protein during irreversible conversion by
trypsin
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Y.Amaya, 22-JUN-1990.
FEATURES from to/span description
pept 27 3986 xanthine dehydrogenase (EC 1.1.1.204)
signal 4125 4130 polyA signal (put.)
signal 4146 4151 polyA signal (put.)
BASE COUNT 1055 a 1046 c 1121 g 940 t
ORIGIN
1 agagctcagt gactccagca gccacgatga ctgcggatga gttggtcttc tttgtgaatg
61 gcaaaaaggt ggtggagaaa aatgcggacc ctgaaacaac acttctggtc tacctgagaa
121 gaaagttggg gctatgtggg accaagcttg gctgtggaga aggtggctgt ggggcatgca
181 ccgtgatgat ctccaagtat gaccgtcttc agaacaagat tgttcatttt tctgtcaatg
241 cctgcttggc tcccatctgc tccttgcacc atgttgctgt gaccaccgtg gaaggcatag
301 gaaacaccca gaagctgcat cctgtacagg agagaattgc cagaagccat ggttcccagt
361 gtgggttctg cactcctggc attgtcatga gtatgtacac actgctccgg aaccagcctg
421 agcctactgt tgaggagatc gagaatgcct tccaaggaaa cctctgtcgc tgtacaggct
481 acagacccat cctccaggga ttccggacct ttgccaagga tggtgggtgc tgtggaggga
541 gtggaaacaa cccaaactgc tgtatgaacc agacgaaaga ccaaacggtt tctctctcac
601 cttctttatt caacccagag gatttcaaac ctttagatcc cacgcaagag cccatcttcc
661 ccccagagtt gctgaggctg aaagacactc cccagaagaa gctgcgtttt gaaggggaac
721 gtgtgacctg gatccaggct tcaactatgg aggagctgct tgacctgaaa gctcagcacc
781 ctgatgccaa gctggtggtg ggaaacacag agataggcat tgaaatgaaa tttaagaata
841 tgctatttcc tctgatcgtc tgcccagcct ggatccctga actgaattca gtggtgcatg
901 ggcctgaggg aatctccttc ggagcttctt gcccccttag cttggtggaa agtgtcctgg
961 cggaggagat tgctaaactt ccagagcaaa agacagaggt gttcagaggc gtgatggagc
1021 agctgcgctg gtttgccggc aagcaggtca agtccgtggc gtccatcgga gggaacatca
1081 tcactgccag ccccatctct gacctcaacc ctgtgttcat ggccagtgga gccaagctga
1141 ctctggtgtc tagaggtacc aggagaactg ttcggatgga tcataccttc ttccctggct
1201 acagaaagac tctgctcaga ccagaggaga tattgctgtc catcgagatc ccctatagca
1261 aggagggaga gtttttctca gccttcaagc aggcctccag gagggaagat gacattgcca
1321 aggtgactag tggcatgaga gtcctgttca aaccggggac cattgaagtg caggaactgt
1381 ccctttgctt cggagggatg gccgacagaa ctatctcagc cctcaagacc actccgaagc
1441 agctatcgaa gtcctggaat gaggagctgc agctggcccc cgatgcccct ggtggtatgg
1501 tggaattccg gcgcaccctc accctcagct tcttcttcaa gttctacctg acagtgctcc
1561 agaagctggg cagagcggac cttgaggata tgtgtggtaa actggacccc acctttgcca
1621 gtgccaccct gctctttcag aaggaccctc cagctaatgt ccagcttttc caagaggtgc
1681 caaaggatca gtctgaggag gacatggtgg gccggcccct gcctcacctg gcggcaaaca
1741 tgcaggcatc gggagaggcc gtgtactgtg atgacattcc ccgctatgag aatgagctct
1801 ctctcaggct ggtcaccagc acccgggcgc atgctaaaat cacgtccatc gacacttcag
1861 aagccaagaa ggtgccaggg tttgtttgct tcctcaccgc agaggatgtc cctaatagta
1921 atgcaaccgg ccttttcaat gatgaaactg tctttgcgaa ggatgaggtt acttgtgttg
1981 ggcacatcat tggtgctgtg gtcgctgaca ccccagaaca cgcacagaga gctgcgagag
2041 gggtgaaaat cacctatgaa gatcttccag ccattatcac aatccaggat gctataaaca
2101 acaactcctt ttatggctct gagataaaaa ttgagaaagg agatctcaag aaaggctttt
2161 cagaagctga caatgttgtc tcaggagagt tgtatatcgg tggccaggag cacttctacc
2221 tggagaccaa ctgcaccatt gccgtgccaa aaggcgaggc aggcgagatg gagctgttcg
2281 tgagcacaca gaacaccatg aaaacccaga gctttgttgc aaaaatgttg ggcgttccgg
2341 acaacagaat cgtagtccga gtgaagagga tgggtggagg ctttggaggg aaggagaccc
2401 ggagcactgt ggtgtccaca gcactggcct tggctgcaca caagactggc cggcccgtac
2461 gttgcatgtt ggaccgagat gaggacatgc tgataactgg tggcagacat cccttcctgg
2521 ctaaatacaa ggttggcttc atgaagactg ggactgtagt ggctctcgag gtggctcact
2581 tcagcaatgg tggtaacact gaggatctct ctcggagtat aatggaacga gctttgttcc
2641 acatggataa cgcctataag atccccaaca ttcgaggcac tgggaggatt tgcaagacta
2701 atctgccctc caacacagcc ttcagaggtt ttgggggtcc tcaggggatg ctaatcgcag
2761 aatactggat gagcgaggtc gccataacct gtgggctgcc tgcagaggag gtacggagga
2821 aaaacatgta caaagaaggg gacctgactc acttcaacca gaagctggag gggttcacct
2881 tgcccaggtg ctgggatgaa tgcatcgcca gctctcagta tcttgctcgc aagagggaag
2941 tggagaaatt caacagggag aattgttgga aaaagagagg gctgtgtata atcccaacta
3001 agtttggaat aagctttaca cttccttttc tgaaccaggg aggcgctctg gttcacgtgt
3061 acactgatgg ttcggtgctg ttgacccatg gagggactga gatgggccaa ggccttcaca
3121 ccaagatggt tcaggtggcc agcagagctc tgaaaatccc cacctccaag attcatataa
3181 gtgagacaag cactaacacc gtccccaaca cttctcccac agctgcctct gccagtgctg
3241 acctcaatgg acagggtgtt tatgaagcat gccagaccat actgaaaagg ctggaacctt
3301 tcaagaagaa gaaacccacc ggcccctggg aggcatgggt gatggacgcc tatacgagcg
3361 cagtgagttt gtccgcaact ggattttata agacacccaa ccttggctac agctttgaga
3421 caaactccgg aaatcccttc cactatttca gttatggggt ggcttgctct gaagtagaaa
3481 ttgactgctt aacaggggat cataagaatc tccgtacgga tatcgtcatg gatgttggtt
3541 ccagcttgaa tcctgccatt gatattggac aagtagaggg ggcatttgtc cagggccttg
3601 gtctcttcac tatggaggag ctgcactact cccctgaggg gagcctgcat actcgtggcc
3661 ccagtaccta caaaatccct gcatttggta gcatccccat tgagttcaga gtatccctac
3721 tccgggactg ccccaacaag agggccatct atgcatccaa ggctgttggg gagccacctc
3781 ttttcctggc ttcctctatc ttctttgcca tcaaagatgc cattcgtgca gctcgagctc
3841 agcacggaga taacgcaaaa caacttttcc agctagacag ccctgccact ccggagaaga
3901 tccgaaacgc ctgtgtggac cagttcacca ccctgtgtgt cactggagta ccagaaaact
3961 gtaaatcctg gtctgtgagg atctgaagag aaggtctcca ccattggttt gtaccgcacc
4021 aggattcctt ggagccacaa gcacatcctg tagtatccag atttccgcat gccgcgtggg
4081 actcagcagg atgacatttt caggaagatg gacattttga cccaaataag agctgcaaac
4141 aaaccaataa gcaaatgggg ag
//
LOCUS RICHISH2AA 321 bp ds-DNA PLN 26-JUL-1990
DEFINITION Rice histone H2A gene, 5' end.
ACCESSION M35379
KEYWORDS histone.
SOURCE Rice DNA, clone pIR22.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Thomas,G. and Padayatty,J.D.
TITLE Restriction map and partial sequence of a rice DNA fragment
carrying histone genes H2A, H2B and H4
JOURNAL Indian J Biochem Biophys 21, 1-6 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept 260 > 321 histone H2A protein
mRNA 186 > 321 histone H2A mRNA
signal 36 40 CAAT box
signal 74 77 GATCC motif
signal 138 145 TATA box
BASE COUNT 74 a 70 c 60 g 73 t 44 others
ORIGIN
1 caaaggacnt gttcccgctg atgtgagcaa ttgtcacaat gccctcccaa acngttttca
61 gatngtngat gtggatcnnn antttnttgc gnntnnanac ctggctctcg ttttttcgca
121 angtcccgaa cnnnnngtat aaatagcgtg tggacccgta ncgtgagaac tcgtgatctn
181 atttcatctg gaacgactcn nggaatnttc cgaaaannnn nnnnnnnnng ccgaaagcct
241 tttggaactt ttcnnccaaa tgcacaccaa aggcctcngg aagnnttttc ancgcaaaaa
301 gatatcaccc gcagggatca c
//
LOCUS TEYMT14SRR 169 bp ds-DNA ORG 26-JUL-1990
DEFINITION T.pyriformis mitochondrial 14S rRNA.
ACCESSION M35376
KEYWORDS 14S ribosomal RNA.
SOURCE T.pyriformis (strain ST) linear mitochondrial DNA.
ORGANISM Mitochondrion Tetrahymena pyriformis
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae;
Tetrahymena pyriformis.
REFERENCE 1 (bases 1 to 169)
AUTHORS Suyama,Y., Fukuhara,H. and Sor,F.
TITLE A fine restriction map of the linear mitochondrial DNA of
Tetrahyemena pyriformis: Genome size, map locations of rRNA and
tRNA genes, terminal inversion repeat, and restriction site
polymorphism
JOURNAL Curr. Genet. 9, 479-493 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 169 14S rRNA
site 31 144 conserved U5 region
BASE COUNT 54 a 26 c 37 g 52 t
ORIGIN
1 gaattcagaa tagctaacgc aaagtattct gcttggggag tattatcgca agattaaaac
61 ttaactgaat tggcgggaat ttgttcgaac ggtggaacat gtggtttaat gcgataatcc
121 acgcaaaatc ttaccaacgt tttaggcttt atctgataat atggttaac
//
LOCUS YSCPET122 2862 bp ds-DNA PLN 26-JUL-1990
DEFINITION Yeast PET122 encoded protein gene, complete cds.
ACCESSION X07558
KEYWORDS PET122 encoded protein.
SOURCE Yeast (S.cerevisiae, strain AB320) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2862)
AUTHORS Ohmen,J.D., Burke,K.A. and McEwen,J.E.
TITLE Divergent overlapping transcripts at the PET122 locus in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 3027-3035 (1990)
STANDARD simple staff_entry
REFERENCE 2 (bases 953 to 2862)
AUTHORS Ohmen,J.D., Kloeckener-Gruissem,B. and McEwen,J.E.
TITLE Molecular cloning and nucleotide sequence of the nuclear PET122
gene required for expression of the mitochondrial COX3 gene in
S.cerevisiae
JOURNAL Nucleic Acids Res. 16, 10783-10862 (1988)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D.Ohmen, 11-JUN-1990.
Draft entry and computer-readable sequence for [1] submitted to
EMBL by J.D.Ohmen, 09-JUN-1989.
EMBL features not translated to GenBank features:
key from to description
SITE 746 1096 similarity to E.coli alanyl
tRNA-synthetase (AA 116-232) [1]
Author address: McEwan J.E.
Department of Microbiology
College of Letters and Science
University of California
405 Hilgard Avenue
Los Angeles, CA 90024-1489
FEATURES from to/span description
pept 1139 < 1 (c) ORF3
pept 1354 2118 PET122 protein
pept 2171 > 2862 ORF2
mRNA 1144 < 1 (c) ORF3 mRNA
mRNA 1152 < 1 (c) ORF3 mRNA
mRNA 1157 < 1 (c) ORF3 mRNA
mRNA 1159 < 1 (c) ORF3 mRNA
mRNA 1870 < 1 (c) ORF3 mRNA
mRNA 1875 < 1 (c) ORF3 mRNA
mRNA 1882 < 1 (c) ORF3 mRNA
mRNA 1883 < 1 (c) ORF3 mRNA
mRNA 1887 < 1 (c) ORF3 mRNA
mRNA 1895 < 1 (c) ORF3 mRNA
mRNA 1907 < 1 (c) ORF3 mRNA
mRNA 1343 > 2119 PET122 mRNA
mRNA 1348 > 2119 PET122 mRNA
mRNA 1354 > 2119 PET122 mRNA
mRNA 2140 > 2862 ORF2 mRNA
mRNA 2143 > 2862 ORF2 mRNA
mRNA 2147 > 2862 ORF2 mRNA
mRNA 2150 > 2862 ORF2 mRNA
BASE COUNT 748 a 652 c 795 g 667 t
ORIGIN
1 aagctttctt gtaacttctt ctcattatct tgcatcaatt gccttctttc cgcctgatct
61 cttgcctttt gaatgttatg ttttaatgat tggaagatgc ccatgttctc tgtgggggaa
121 gcgccagcga taggagtcct tggtttagct acttctgtta tcttcagttt cgaacgaacc
181 catttgtttc tcaaaatcat tgtctgtagg acggagaagg caccattaaa ggcaaagtag
241 aggaccacag cggacgataa gttcattgtg gccggtatag aaatgatcgg tagaatagtg
301 aaaagacgct tcatgggaga actgaattgt tgagcaccag tctcaccccc cagccttgta
361 aatgagatga acacagcggc agtgattact tgcaaaccta agtaagggtc tgcttgagtc
421 aagtctgtaa accaagcgac accttgatta gcgaacccat ctactgggta gttagccatg
481 tgtctcaatg cgttgaaaaa cccaagggcg attggaattt gtagcatggg tgcggccagc
541 catctgttct taatgccgtg cgaggagagc agttttttcc tttgcatggc gactagctga
601 ccttgttgca aatctgtagt ggacattagc ttattattca aggcgtccag ctcgggcttg
661 atatgggaat ttctagcaac agtatcagag gacttgacat agaggggaaa catcaggcat
721 cgaatgagga tggtggtggc cgcgatagtt ccccaccaag gcaacccaga gtaaacatga
781 acggcctcca agacgtgttg gataatgtcc gagggccagt accaggtttg ggccaggcca
841 atgctattta agtaccctat atgggaggac aactcgccca ctgtttgggt cgtgttagcg
901 ataaggtccg aagtagaagc ggaaagagaa ggagctgaag aggttaattc atcgatggaa
961 ggcaactggg tttggatttc cgagacatcg ttggcatttg ggcccgtcga attaaatctt
1021 ttggcctgaa aagagatcca tgacggatgg ggccggggca atactatggt tcgagcggtg
1081 gccagtctgg aagaggcagc aaaccttgac gtgacgagtc gagaggtgag tttgaacatc
1141 gtcggggagg ttattctgtg gctccgcttg tacgtgaaca gatacgtata gagggcgagc
1201 cactggttaa atttttcatg gctcggatta cttccgtact gctggctaaa atcgaaatct
1261 cggcctgctg agagtgtttt gagcaatcaa gggaacatct gaacgtggaa gagcagacga
1321 ggcattagct cgaacataag aacggaacac gtcatgttga ctatcacgaa aagactggtg
1381 accaccgatg tgcggtcgcg aatactgtta agcagtttaa acgggaaaat gtccgatgca
1441 ctggcgctgc tgcgtcagca gcagcagacc agcgtggatg tggagctgct gcacacgatg
1501 ctagcgcgag ccgctgcgct tgcccatgcc gacactatag catacatgtg gtatcagcat
1561 gtgatgccac gccggttgcc agtagagggc cgcctgctat gtgaaatggc tggcgtagca
1621 ttgtaccagg acaggctctt cttacccgcg cagttcctcc agcactacca ggcgatgaat
1681 cgcgatcgtc gcaccagccc agaagatgaa ctgattgagt atgagcttag acggattaaa
1741 gtcgaagcgt ttgcgcgtgg cacaatgcac tccacggcgc tcagggaaaa gtggaaggta
1801 ttcttgcagg agatggatac gctaccaggg cagccgccat taaggctgcg cgacttcccg
1861 caaatgacca aggctatggg catagcattg atgcagcaag atgagcaagc agctgccctg
1921 gcgttgtttg gacgacagcc cctagtgata aagaacgaat ggtcactacc gctactactg
1981 gctggtgtcc tttggcatgt tcccggccca gcgcaggcgc gacgtgtgct ggcggagttc
2041 cgtcaaagtt atcgcgggct gccgctgctg gatgccgaac tagtgataaa gagaagagga
2101 tttgaaatca acacataaat ctgggtggag catcgctgta acaaggaaca acgcgtgcta
2161 gcaagcggta atgaaataca aggaaatcaa tttcttcaag ggccatccga gctcgaggtt
2221 gctgcctcga gaagcagtaa ttcaagcgac tgcggctata ttggggcccg agaccaggga
2281 gtacgataac gacccctata acaggcatcc gctgacgtac ggttcggacg aaggtgccct
2341 gtgggtgcga gagcagattt gtacgtttct gaatgatcag ctgtttaagt tcgaaaatgg
2401 ggctcggagc aggacacggg cagactattt gaatctgaat agcggcgctt cgtatggcat
2461 gctgaacatc cttctgcaaa caaccttgcc acataacggg tataccaggc aggcgttcat
2521 catcacgcca acatatttct tgatcaacaa ttgcttcaca gatgcgggat tcaaggggaa
2581 aatgaccgcc atcaacgagc agggccacga ctcgattgat ttcgagtcgt tgatttctgc
2641 ccttgagcag cacgaggcgg agccgcagcc ccatagtacc acagagatga ttcaggggcc
2701 aaagttgacc aagaaggtct acaggtacgt tatgtactgc atcccgacgt ttgcaaaccc
2761 atcgggaaac acatactcgc ttgagaccag acgcagactt atcgacatcg ctcggaagta
2821 cgacatgctg ataatcactg atgacgtgta cgatattcta ga
//
LOCUS ECO987P 954 bp ds-DNA BCT 26-JUL-1990
DEFINITION E.coli fimbriae 987P subunit gene, complete cds.
ACCESSION M35257
KEYWORDS fimbriae.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 954)
AUTHORS De Graaf,F.K. and Klaasen,P.
TITLE Nucleotide sequence of the gene encoding the 987P fimbrial subunit
of Escherichia coli
JOURNAL FEMS Microbiol. Lett. 42, 253-258 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 259 843 fimbriae 987P subunit precursor
sigp 259 328 fimbriae 987P subunit signal peptide
matp 329 840 fimbriae 987P subunit
BASE COUNT 309 a 170 c 180 g 295 t
ORIGIN
1 aaatttagaa aagtgcatta tgcttatcac tagataagaa aataaaacac gaaatatagc
61 gagccatata gcctgttgtg tttgtaatag ataaaaaaca cgcaattgat tatttatgta
121 tctttttgtt tgtatttttt tattaaaaaa agcacacaat tactgcgtgc atcgaaatga
181 gttgaagtgg atgcatatat gcatgaaatg cttttaactt gaaagtctta atgtttctat
241 taattaagat aaggtaatat gagaatgaaa aaatccgcat taacattagc agtgctttcc
301 tctctgttca gtggttactc gctcgcagcg cccgctgaaa acaacaccag ccaggcaaat
361 ttagacttta ctggtaaagt tactgccagt ctatgccaag tggatacttc taatctgtcg
421 caaaccatag atcttggaga gttgtctact tctgctctta aagctactgg caaggggcct
481 gccaagtcat ttgcagttaa tcttatcaac tgcgatacaa cattgaattc tattaaatac
541 actattgctg gtaataataa tacaggaagt gatactaaat atttagttcc agcctccaat
601 gatactagtg catcaggagt tggcgtatac attcaggaca acaacgccca ggctgtggaa
661 attggtactg aaaaaactgt acctgtggta tcaaatggcg gattagctct ttcagaccaa
721 agtattccac tgcaagcata catcggaacc accacaggga atcctgatac aaacggtgga
781 gttacggccg gtactgtcac tgctagtgca gtaatgacta ttcgttcagc aggtacaccg
841 taattagata acaattttta tacaacaaaa caggaaggat tttgaactaa tccttcctgt
901 tattggagat tgaaatgtct aagtttgtaa tatttcttgt gtttttgttt atat
//GenBank-Updates@genbank.bio.net (07/27/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 27-JUL-1990
DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
cds.
ACCESSION M26434
KEYWORDS Alu repetitive element; LINE repetitive element;
hypoxanthine phosphoribosyltransferase.
SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia
patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 56736)
AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
TITLE Automated DNA sequencing of the human HPRT locus
JOURNAL Genomics 6, 593-608 (1990)
STANDARD full staff_entry
REFERENCE 2 (sites; mutations causing gout)
AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N.
TITLE A possible nucleotide-binding domain in the tertiary fold of
phosphoribosyltransferase
JOURNAL J. Biol. Chem. 258, 6450-6457 (1983)
STANDARD full staff_entry
REFERENCE 3 (sites; mutations causing gout)
AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P.
TITLE Resolution of a missense mutant in human genomic DNA by denaturing
gradient gel electrophoresis and direct sequencing using in vitro
DNA amplification: HPRT-Munich
JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988)
STANDARD full staff_entry
REFERENCE 4 (sites; mutations causing gout)
AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D.
TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence
for identical mutations in two partially deficient subjects
JOURNAL J. Clin. Invest. 82, 2164-2167 (1988)
STANDARD full staff_entry
REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single
nucleotide substitution in cDNA clones isolated from a patient with
Lesch-Nyhan syndrome (HPRT-Midland)
JOURNAL Gene 68, 85-91 (1988)
STANDARD full staff_entry
REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase
deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint)
JOURNAL Gene 63, 331-336 (1988)
STANDARD full staff_entry
REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency.
The molecular defect in a patient with gout (HPRT-Ashville)
JOURNAL J. Biol. Chem. 264, 520-525 (1989)
STANDARD full staff_entry
REFERENCE 8 (sites; mutations causing gout)
AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N.
TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
deficiency in ten subjects determined by direct sequencing of
amplified transcripts
JOURNAL J. Clin. Invest. 84, 342-346 (1989)
STANDARD full staff_entry
REFERENCE 9 (sites; mutations causing gout)
AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and
Stambrook,P.J.
TITLE Nucleotide sequence and organization of the mouse adenine
phosphoribosyltransferase gene: Presence of a coding region common
to animal and bacterial phosphoribosyltransferases that has a
variable intron/exon arrangement
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985)
STANDARD full staff_entry
REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D.
TITLE Identification of a single nucleotide change in the
hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale)
responsible for Lesch-Nyhan syndrome
JOURNAL J. Clin. Invest. 83, 11-13 (1989)
STANDARD full staff_entry
REFERENCE 11 (sites; mutations causing gout)
AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N.
TITLE Identification of a single nucleotide change in a mutant gene for
hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor)
JOURNAL Hum. Genet. 79, 39-43 (1988)
STANDARD full staff_entry
REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T.
TITLE Multiplex DNA deletion detection and exon sequencing of the
hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families
JOURNAL Genomics 7, 235-244 (1990)
STANDARD full staff_entry
REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T.
TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by
automated direct DNA sequencing of in vitro amplified cDNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989)
STANDARD full staff_entry
REFERENCE 14 (sites; mutations causing gout)
AUTHORS Hershey,H.V. and Taylor,M.W.
TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia
coli adenine phosphoribosyltransferase and comparison with other
analogous enzymes
JOURNAL Gene 43, 287-293 (1986)
STANDARD full staff_entry
REFERENCE 15 (sites; conserved domain)
AUTHORS King,A. and Melton,D.W.
TITLE Characterisation of cDNA clones for hypoxanthine-guanine
phosphoribosyltransferase from the human malarial parasite,
Plasmodium falciparum: Comparisons to the mammalian gene and
protein
JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987)
STANDARD full staff_entry
REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and
Caskey,C.T.
TITLE Molecular analysis of a female Lesch-Nyhan patient
JOURNAL J. Clin. Invest. 84, 1024-1027 (1989)
STANDARD full staff_entry
REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Stout,J.T. and Caskey,C.T.
TITLE HPRT: Gene structure, expression, and mutation
JOURNAL Annu. Rev. Genet. 19, 127-148 (1985)
STANDARD full staff_entry
REFERENCE 18 (sites; Lesch-Nyhan mutation)
AUTHORS Wilson,J.M. and Kelley,W.N.
TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
deficiency in a patient with the Lesch-Nyhan syndrome
JOURNAL J. Clin. Invest. 71, 1331-1335 (1983)
STANDARD full staff_entry
REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout)
AUTHORS Wilson,J.M. and Kelley,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural
alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated
from a patient with gout
JOURNAL J. Biol. Chem. 259, 27-30 (1984)
STANDARD full staff_entry
REFERENCE 20 (sites; mutations causing gout)
AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N.
TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular
abnormality in a mutant form of the enzyme (HPRT-Toronto)
JOURNAL J. Biol. Chem. 258, 6458-6460 (1983)
STANDARD full staff_entry
REFERENCE 21 (sites; mutations causing gout)
AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N.
TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino
acid substitution in a mutant form of the enzyme isolated from a
patient with gout
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983)
STANDARD full staff_entry
REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome)
AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G.,
Hildebrand,B.M. and Caskey,C.T.
TITLE Molecular evidence for new mutation at the hprt locus in
Lesch-Nyhan patients
JOURNAL Nature 310, 412-414 (1984)
STANDARD full staff_entry
REFERENCE 23 (sites; mutant reversion)
AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and
Caskey,C.T.
TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene
rearrangement
JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Edwards, 26-JUL-1989.
Mutant Description
RJK 1780 missing intron 1 (partial and exon 2 [12]
RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA)
[22]
RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA)
[17]
[22]
GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12]
[22]
RJK 853 complete gene deletion [12] [22] [16]
GM 2227 inversion of exons 6-9 (no mRNA) [23]
GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated
mRNA [22] [23]
Connersville missing intron 7 (partial and exon 8 [7]
FEATURES from to/span description
pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1
/nomgen="HPRT" /map="Xq26"
/hgml_locus_uid="LE0031K"
14780 14886 hypoxanthine phosphoribosyltransferase, exon 2
16603 16786 hypoxanthine phosphoribosyltransferase, exon 3
27891 27956 hypoxanthine phosphoribosyltransferase, exon 4
31617 31634 hypoxanthine phosphoribosyltransferase, exon 5
34937 35019 hypoxanthine phosphoribosyltransferase, exon 6
39815 39861 hypoxanthine phosphoribosyltransferase, exon 7
40033 40109 hypoxanthine phosphoribosyltransferase, exon 8
41454 41501 hypoxanthine phosphoribosyltransferase, exon 9
IVS 1704 14779 HPRT intron A
IVS 14887 16602 HPRT intron B
IVS 16787 27890 HPRT intron C
IVS 27957 31616 HPRT intron D
IVS 31635 34936 HPRT intron E
IVS 35020 39814 HPRT intron F
IVS 39862 40032 HPRT intron G
IVS 40110 41453 HPRT intron H
rpt 258 41 (c) Alu repeat copy A
rpt 1012 694 (c) Alu repeat copy B
rpt 3532 3342 (c) Alu repeat copy C
rpt 3982 3343 (c) Alu repeat copy D
rpt 6232 5937 (c) Alu repeat copy E
rpt 6524 6288 (c) Alu repeat copy F
rpt 7504 7807 Alu repeat copy G
rpt 8713 8408 (c) Alu repeat copy H
rpt 9477 9169 (c) Alu repeat copy I
rpt 9699 10142 Alu repeat copy J
rpt 10407 10293 (c) Alu repeat copy K
rpt 12154 12310 Alu repeat copy L
rpt 13302 12998 (c) Alu repeat copy M
rpt 13638 13314 (c) Alu repeat copy N
rpt 14238 13923 (c) Alu repeat copy O
rpt 14610 14239 (c) Alu repeat copy P
rpt 16090 15843 (c) Alu repeat copy Q
rpt 16518 16293 (c) Alu repeat copy R
rpt 18783 18439 (c) Alu repeat copy S
rpt 18962 19084 Alu repeat copy T
rpt 19397 19104 (c) Alu repeat copy U
rpt 20064 19778 (c) Alu repeat copy V
rpt 20486 20636 Alu repeat copy W
rpt 20990 20696 (c) Alu repeat copy X
rpt 21293 20996 (c) Alu repeat copy Y
rpt 22171 21878 (c) Alu repeat copy Z
rpt 23772 23460 (c) Alu repeat copy AA
rpt 25310 25426 Alu repeat copy AB
rpt 26388 26279 (c) Alu repeat copy AC
rpt 29863 29560 (c) Alu repeat copy AD
rpt 33618 33148 (c) Alu repeat copy AE
rpt 36575 36645 Alu repeat copy AF
rpt 38613 39039 Alu repeat copy AG
rpt 39091 39388 Alu repeat copy AH
rpt 40622 40925 Alu repeat copy AI
rpt 40952 41262 Alu repeat copy AJ
rpt 42640 42936 Alu repeat copy AK
rpt 44776 44486 (c) Alu repeat copy AL
rpt 46627 46305 (c) Alu repeat copy AM
rpt 47520 47763 Alu repeat copy AN
rpt 49257 49551 Alu repeat copy AO
rpt 49705 49993 Alu repeat copy AP
rpt 50002 50312 Alu repeat copy AQ
rpt 51066 50774 (c) Alu repeat copy AR
rpt 52276 51978 (c) Alu repeat copy AS
rpt 52407 52674 Alu repeat copy AT
rpt 54653 54551 (c) Alu repeat copy AU
rpt 55465 55164 (c) Alu repeat copy AV
rpt 56523 56689 Alu repeat copy AW
rpt 11312 11254 (c) LINE repeat copy A (partial)
rpt 18284 18242 (c) LINE repeat copy B (partial)
rpt 27091 26843 (c) LINE repeat copy C (partial)
rpt 37558 37300 (c) LINE repeat copy D (partial)
rpt 39451 39427 (c) LINE repeat copy E (partial)
rpt 51317 51100 (c) LINE repeat copy F (partial)
rpt 261 321 short interspersed repeat copy A
rpt 1013 1054 short interspersed repeat copy B
rpt 2902 2869 (c) short interspersed repeat copy C
rpt 3078 3107 short interspersed repeat copy D
rpt 4228 4254 short interspersed repeat copy E
rpt 4463 4494 short interspersed repeat copy F
rpt 21582 21635 short interspersed repeat copy G
rpt 28430 28394 (c) short interspersed repeat copy H
rpt 28923 28981 short interspersed repeat copy I
rpt 35562 35604 short interspersed repeat copy J
rpt 44042 44108 short interspersed repeat copy K
rpt 46628 46688 short interspersed repeat copy L
rpt 51715 51687 (c) short interspersed repeat copy M
unsure 17479 17479 polymorphism or cloning error
unsure 24903 24903 polymorphism or cloning error
mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13]
mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2
[12]
mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7]
mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7]
mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12]
mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys
[12]
mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20]
mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in
wt; at in Lesch-Nyhan RJK 2108 [12]
mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13]
mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu
[7]
mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10]
mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892
Phe->Leu [5] [13]
mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7]
mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13]
mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19]
mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12]
mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG
[12]
mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4]
[12] [14]
mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896
Val->Asp [6] [13]
mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12]
mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser
[13]
mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11]
mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK
1210 Met->Lys [13]
mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser
[7] [13]
mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA
[13]
mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12]
mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr
[12]
mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7
[12]
mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK
855 [7] [13]
mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188
Asp->Asn [12] [19]
mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950
Phe->Val [7] [13]
mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8]
mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no
exon 8 [12]
mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in
Lesch-Nyhan Evansville RJK 894 [7] [12]
mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM
1899, downstream cryptic splice site used
mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079,
His->Asp [12] [13]
mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr
[13]
BASE COUNT 15689 a 11281 c 11599 g 18167 t
ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26.
1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
56701 gccgcccctg gtgatggacc agcggggctt cgacga
//
LOCUS VSVGLYPA 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene,
complete cds.
ACCESSION M35207
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 482 a 364 c 364 g 447 t
ORIGIN
1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat
61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga
121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg
181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa
361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac
481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca
541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag
601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc
661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg
721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg
901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag
1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata
1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag
1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga
1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg
1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa
1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta
1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca
1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc
1621 atgctccaag aggctttaat tatgtttcaa ttttatg
//
LOCUS VSVGLYPB 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene,
complete cds.
ACCESSION M35208
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 358 c 366 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPC 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene,
complete cds.
ACCESSION M35209
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 489 a 358 c 364 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPD 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene,
complete cds.
ACCESSION M35210
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 488 a 357 c 365 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPE 1665 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene,
complete cds.
ACCESSION M35211
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 476 a 364 c 358 g 467 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPF 1665 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene,
complete cds.
ACCESSION M35212
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 476 a 362 c 358 g 469 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPG 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene,
complete cds.
ACCESSION M35213
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 488 a 356 c 366 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca
1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgtttgaa ttgtatg
//
LOCUS VSVGLYPH 1665 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene,
complete cds.
ACCESSION M35214
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 477 a 366 c 353 g 469 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat
61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac
301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag
601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta
1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta
1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg
//
LOCUS VSVGLYPI 1665 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene,
complete cds.
ACCESSION M35215
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 477 a 363 c 357 g 468 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPJ 1788 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene,
complete cds.
ACCESSION M35216
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1788)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 508 a 362 c 344 g 574 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS VSVGLYPK 1788 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene,
complete cds.
ACCESSION M35217
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1788)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 508 a 362 c 344 g 574 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS VSVGLYPL 1815 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene,
complete cds.
ACCESSION M35218
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1815)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 512 a 358 c 346 g 599 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta
1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt
1801 atttttattt ttatg
//
LOCUS VSVGLYPM 1665 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein
gene, complete cds.
ACCESSION M35219
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 484 a 364 c 355 g 462 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc
481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt
1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta
1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta
1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc
1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg
//
LOCUS VSVGLYPN 1812 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene,
complete cds.
ACCESSION M35220
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1812)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 512 a 362 c 347 g 591 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt
1801 tttattttta tg
//
LOCUS VSVGLYPO 1861 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene,
complete cds.
ACCESSION M35221
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1861)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 363 c 347 g 631 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt
1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat
1861 g
//
LOCUS VSVGLYPP 1802 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B)
glycoprotein gene, complete cds.
ACCESSION M35222
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B, Indiana
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1802)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 506 a 362 c 348 g 586 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta
1801 tg
//
LOCUS VSVGLYPQ 1868 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene,
complete cds.
ACCESSION M35223
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1868)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 362 c 347 g 639 t
ORIGIN
1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta
1861 tttttatg
//
LOCUS VSVGLYPR 1860 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene,
complete cds.
ACCESSION M35224
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1860)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 362 c 347 g 631 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg
//
LOCUS VSVGLYPS 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene,
complete cds.
ACCESSION M35225
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 358 c 364 g 448 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac
661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg
841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPT 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene,
complete cds.
ACCESSION M35226
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 490 a 358 c 363 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPU 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene,
complete cds.
ACCESSION M35227
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 486 a 357 c 367 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPV 1796 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene,
complete cds.
ACCESSION M35228
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1796)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 503 a 362 c 349 g 582 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg
//
LOCUS VSVGLYPW 1800 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene,
complete cds.
ACCESSION M35229
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1800)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 506 a 362 c 348 g 584 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt
1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg
//
LOCUS VSVGLYPX 1652 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene,
complete cds.
ACCESSION M35230
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1652)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 344 c 367 g 453 t 1 others
ORIGIN
1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc
61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga
121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg
181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag
241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac
301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa
361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg
421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc
481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag
541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag
601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc
661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg
721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag
781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag
841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg
901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc
961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg
1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca
1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag
1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga
1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg
1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat
1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca
1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg
1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta
1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa
1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct
1621 cagagaagct ttaatctttt attgacaata tg
//
LOCUS VSVGLYPY 1657 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene,
complete cds.
ACCESSION M35231
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 489 a 367 c 357 g 444 t
ORIGIN
1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat
61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga
121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg
181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa
361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac
481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca
541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag
601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc
661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag
721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg
901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag
1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag
1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga
1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg
1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa
1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta
1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca
1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa
1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc
1621 atgctctaag aggctttaat tatgttttaa ttttatg
//
LOCUS VSVGLYPZ 1812 bp ss-RNA VRL 27-JUL-1990
DEFINITION Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene,
complete cds.
ACCESSION M35232
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1812)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 513 a 360 c 346 g 593 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt
1801 tttattttta tg
//
LOCUS CAVXDHB 8062 bp ds-DNA INV 27-JUL-1990
DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 1, exons 2-4.
ACCESSION M30316
KEYWORDS xanthine dehydrogenase.
SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et
al.), clone Lcv1.
ORGANISM Calliphora vicina
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae.
REFERENCE 1 (bases 1 to 8062)
AUTHORS Houde,M., Tiveron,M.-C. and Bregegere,F.
TITLE Divergence of the nucleotide sequences encoding xanthine
dehydrogenase in Calliphora vicina and Drosophila melanogaster
JOURNAL Gene 85, 391-402 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Bregegere, 01-DEC-1989.
FEATURES from to/span description
pept / 2538 6299 xanthine dehydrogenase, exon 2 (AA at 2538)
(EC 1.1.1.204)
6367 6534 xanthine dehydrogenase, exon 3
7233 7283 xanthine dehydrogenase, exon 4
pre-msg < 1 7400 xanthine dehydrogenase mRNA and introns
IVS < 1 2537 xanthine dehydrogenase intron A
IVS 6300 6366 xanthine dehydrogenase intron B
IVS 6535 7232 xanthine dehydrogenase intron C
signal 7370 7375 poly-A signal
signal 7391 7396 poly-A signal
BASE COUNT 2516 a 1358 c 1473 g 2715 t
ORIGIN
1 aagcttcaac ctgtaaaata aatttcaata aaatgttata aaatcttata ttttggtaaa
61 aattccattt gaataaaaca aatctagacg aaattataca tacaatcaaa tttatgcact
121 tcagatgttg aaaacagatt ggaaaattac ttattatacg tttcttagtg ttcgatttat
181 ttcgcttagc aataattgca aaatatttga ttgatttgta accataataa catcgttttt
241 aaatatgagt acttacttgt cttatcaact ttgttgttta tatatcaata tatatgtata
301 tttcattaaa aaacaataac aattttaaaa tggcaacttt ttgtgtaacc ttaatgaaaa
361 aaaaaaaata acacataaaa catcttatca atgactgctg tgagaccccc ctgtgacgta
421 cagataataa caatcttaaa actgttatac tgcacttttg taggactaat gaattagtaa
481 acaaataaat aactctgtat aaatatagaa cattttatta tccacatatt agttaaattt
541 gtcttggagg tctaatagct agtttctggg ataataaaat ctacattctt cgtttaaacc
601 tagattaagt aaactacaaa ttttatttct cactaatggc gtaaagagta tatgtaatct
661 aggtttaact gagcattatt gacgagttaa acctatgaaa aaaattattc caattgattt
721 tgtttcttca gattccattt gatttcgaca gttcccaaaa tattttataa ttctgaattt
781 ccaaatttaa tttggttttt ggtaaccccc tgtttatact tgacaaattt tgatcataat
841 aattaatgac gtttgttgcc aagacaaagt tgtcctgcaa tagcactaac aattgtgtca
901 taacgaagca ataatactaa taaatgaaga cattacctga aatttttttt atcttgacaa
961 caattttgac ttttatcatg ataaaaattt atcaggtata gataattatt taaaaatatc
1021 tgaaaaacaa ataattacaa actaataaca attcgacata taagacattc acacgatgaa
1081 ttttcagtca tgcaattgta gcaattctct tttgaatttt cttttactcc acccctcttg
1141 ccaacaaact cataaaatca cttagagtga catgacaaaa caagacatcg taaaaattcc
1201 ttattaaaaa aaaacaagtt agagagctat attcggcagt gccgaatctt atataccctt
1261 caccaaatta tacttcaaaa taaaaatttt aaatattttt aagtaaacaa attttttttt
1321 ccaaagttgt ttttttaagt ttttaggaaa tttttttttt cgaattgtta ttttaaaatt
1381 tttaaacagc tgttacgaac aaaatcaact atgtattgtg aatgaaaagt tcatgagaat
1441 attacgatag caattttccc tttgagggaa cataaatttc acgtattatt gccgataggg
1501 tctattgact agcatgtatt ttgttgttgc aagagttagg ttttggacaa cagacttagg
1561 tttttgactc gtctcgtctt ttattacaca gttttatttt atatttatta ttacagaagt
1621 ttaaaagttt ccattgtttt gccaatgtta agcattatat ccacatatac atgtattttt
1681 gttaattatt ccctatattt taatagaaaa cctggtgaaa ggtcatattt tttaaaactg
1741 cacagagaaa atgattcggt tgcagacata gacattttcg gttctatcaa cagaaaatca
1801 gttgataaag aagaatttcc gttgaagcaa ccaaactttt gttacctctt ctaaaatgtt
1861 gtagccataa ctgtaaaatt cggtcactaa gaatcattcg attggcaaca aattcggttg
1921 ctatcacgaa tctgttttct ctgtgtgaaa gtatggaata ttgttttaaa aaagtttcca
1981 ttttttcgct agtaggtctg ggttaattaa taactcgcgg cttcgtataa cgattcctgt
2041 tctacgtcgt ttacaccaac cgcatgtatt taatttgaac aaatcctggt ggagaccata
2101 aaaacccttc gatactggtg gagtatcatt tcaaactcaa atcgtaaatt ggtagagaac
2161 catataaaac tcgacggatc aactggagca gtatcattat cactcttgat tttaataaca
2221 aggtttaagc attctcactt ctatgaagac atatgtaggt catttaggct tacttaaata
2281 gacatgggat aacctggaaa atttgtgaga cgttatactc aaccaacaaa ctttaaaaca
2341 attttattgt cactctctgt ctaaagcatc gtaatctata catcacttta cttcacaaat
2401 gttacaatac tctccgtaat taatcttaaa tatcatcaat tacacatttt gtttaagtaa
2461 tttaaaaata ttaagttctg cattcttaaa atacagcttt gattcaagaa atataactat
2521 ttgtcttatt ttcctaggta attgatacca atcccgatcc agaatgtacc cttctaacat
2581 atttacgtga aaaactaaga ttatgtggca caaaattggg ctgtggcgag ggtggttgtg
2641 gtgcctgcac cgtcatgatt tctcgcattg atacgctaac gaatcggatt aaacacatag
2701 cggtgaatgc ctgcctgact ccggtatgtg ccatgcatgg tagtgccgtg actaccgtgg
2761 aaggtattgg cagtacgcga actcgattgc atccagtaca agagcgtctg gctaaagccc
2821 acggcagtca gtgtggtttt tgtacacccg gtattgtcat gtctatgtat gcactgttga
2881 gaaatttatc ccagccctcg atgaaagatt tggaaattgc tttccaggga aatttgtgca
2941 gatgcacggg ctatagaccc attttagagg gttataaaac atttactaaa gaattcggct
3001 gtgctatggg tgataaatgt tgtaaggtga atggcaataa atgtggagag ggtatggaaa
3061 atggtggtga catggtcgac gataaactat ttgagaaaag tgaatttgta cctttcgatc
3121 cctcacaaga acccatattt cctccggagc tgcagctgaa taaagattgg gactctcaaa
3181 cgttagtgta taagggcgaa cgtgctactt ggtatagacc tggcaattta gaagatttac
3241 tgaagattaa ggcccaattt ccggaagcca agttggttgt gggcaacact gaaataggcg
3301 tggaggttaa atttaaacat tttctatatc cagtcttggt aaatcccaca aaagttaagg
3361 aaatgattga tgtccaggaa ctcgaggata gtatttattt tggtgcttcc gttagtttaa
3421 tggatataga taggatttta agatccagca tagaaaagct acccgaacat caaacaagat
3481 ttttccaatg tgccgtgaat atgttgcatt attttgctgg taaacaaatt cgtaatgtgg
3541 cgtcattggg tggcaatatt atgacgggta gtccaatatc cgatatgaat ccggtgttga
3601 tggcgggtgc tgttaagttg aaagtggcca agtatgttga gggtcaaata aaatacagag
3661 aagtttgcat ggcatctggc ttctttacgg gttatcgcaa gaatgttata gaacccacag
3721 aaattttggt gggtctgtat ttccccaaga ctttggaaca tcaatatgtg gtggctttca
3781 agcaggccaa acgtagagat gatgatattg ctattgtaaa tgcagccata aatgtattca
3841 ttgatcccag gtccataact gtggataagg tgtatatggc ttttggtggt atggctccta
3901 ccactgtcct ggccacacgc actgccgaca taatggttaa acaacaatgg aataaggtgc
3961 ttatggagcg tgtggtggaa aatctgtgtg cagagttgcc tttggctccc tcagctcctg
4021 gtggcatgat tgcctatcgt cgttccctgg tggtgagctt atttttcaag gcctacttaa
4081 caataaccca acaattaatt aaatctggca ttttgcccca agattcttta ccccaagaag
4141 aactaagcgg ttctgatgtc ttccatactc ctgccttaaa aagtgctcaa ctgtttgaaa
4201 aagtttcaaa caaacaatcg gaatgtgatc ctattggcag accaaagata catgcctccg
4261 cattgaagca ggctaccgga gaagccatct actgcgatga tatgcctcgc atggaaaatg
4321 aattatattt ggcgctggtg ttaagtacaa aagctcatgc gaaaatccta agcattgatg
4381 cttcggaagc tctggcaatg cccggtgttc atgctttctt ctccagcaaa gacataaccc
4441 aacatgaaaa tgaagttggt ccagtgtttc atgatgaaga ggttttcgcc tcagatatgg
4501 tttactgtca aggtcaagtt attggagcta tagcagccga taatccaaac tttagctcaa
4561 agacggctcg taaggttaca atcgaatatg aagatataaa gccagttatt ataactattg
4621 aacaggctat cgagcataaa tcctacttcc ctgattatcc ccgttttacg gaaataggtg
4681 atgtagagaa ggctttcagt gaggccgatc atgtctatga aggttcttgt cgcatgggtg
4741 gccaggaaca tttctatttg gaaacccatg cctctttagc cgtgccccga gattcagatg
4801 aaattgaaat cttctgttca acacaacatc cctctgaggt acaaaaacta gtggcccatg
4861 tattgtcaac ttcagcacat cgagtagtgt gtcgtgccaa acgtttggga ggaggttttg
4921 gtggcaaaga gtctcgtgct atagctgttg ccttgcccgt ggctttggcc tgccatcgtc
4981 ttcgtagacc aatacgttgt atgctcgatc gtgatgaaga tatgatgatc accggtactc
5041 gtcatccctt tttatttaaa tacaaaattg cttttacctc tgaaggtcgt ttaaccggtt
5101 gctatattga gtgctacaat aatgccggtt ggtctatgga tctttccttc tccgtattgg
5161 aaagagccat gtttcatttt gaaaattgct ataaaatacc caacattaag gtgggtggtt
5221 gggtttgtaa aacaaactta ccatcgaaca cggcattccg tggctttggt ggtcctcagg
5281 gcatgtttgc gggagagcat ataatacgtg atgtggcccg catattgggc aaggactatt
5341 tggaaattat gaaacaaaat ttctataaag aaggcgatat aacgcactat caacaaaagt
5401 tggataactt tcccattgag aaatgtttct acgattgcct gcaacagtcg aactattacc
5461 agaaaaggaa agaaattgaa gagtttaatc gtaaccaccg ttggcgcaag cgtggcatat
5521 cattagttcc cacaaaatat ggcatagcct ttggtgtttc acatctcaat caagctggtg
5581 ctttaataaa tatttatgct gatggctcgg tgctgctttc gcatggtggt gtggaaattg
5641 gccaaggttt acataccaaa atgatccagt gttgtgctcg tgctttacaa atacccatag
5701 aattcattca catttccgag actgccactg ataaagtgcc aaatacatca cctacggctg
5761 ctagttcggg ttccgatcta aacggcatgg ccgttctgga tgcttgtgaa aagttaaata
5821 aacgtctggc tccgattaag gaggccaatc cgaatggctc ttggacagaa tggattaata
5881 aagcctattt cgaacgtgtc agcctgtcgg ctaccggctt ctatagaatg cctgatatcg
5941 gctatgatcc tgttcaaaat ccaaacgctc taatgtacaa ctattttacc aatggtgttg
6001 gttcgtcgat agttgaaatc gattgtctaa ctggagatca tcaggtccta agcacagata
6061 ttgttatgga tattggttcg agtttgaatc cggccattga tattggtcaa attgagggag
6121 ccttcatgca aggctacggc ctattcacat tggaagaaat gatttactcg cctcagggtg
6181 tactctattc tcgcggcccc ggcatgtaca aattacctgg ttttgctgat ataccgggag
6241 aatttaatgt gacaattttg acaggagctg ctaatccccg ggcggtttac tcctccaagg
6301 taagagataa tttgtgtttt aattgaagtt gtatgtttta ttatattgtg tttatgttta
6361 tttcaggctg ttggcgaacc tcctttgttt attggctgtt ctgtgttctt tgccattaaa
6421 gaagccatta cgtcggctcg actaatgaat ggcttgagcg aggactttaa attagaatca
6481 ccagctactt cggctcgtat acgtatggct tgtcaggatg agtttacgaa tttggtaaga
6541 aaatattaac aatttgaaat ggctcttatt aaatctttat atattgtttt aatgaagatc
6601 tagttagttt tctataggaa aatttcaatt gattcggcat tatattaaaa acgttggata
6661 caactagttt tctttaggaa atttttggat tcaacttgtt tttattagaa aattatttga
6721 ttcgattaga ttttagactt cttttatatg aaaatttgca ttcgactagt tttttacagc
6781 aagtattctt ttttataccc ttcaccttcg ggagcagggg atatataagt ttgtcattcc
6841 gtttgtaatt tccacaatat aattttccga ccctataaag tcgattaagc catgtccgtc
6901 tgtctatctg tcggcttgcc tgtctgttga aattttctga atacatttga ttaattaatt
6961 cgacatgttt tctgtggcaa acttgagatt cgactaggtt tatttcaaaa acttttttat
7021 tcaattagtt ttcattataa aatttaccat ccaattgttt ttattagaaa atttccgttt
7081 aattataagt tttcattaaa aactattcga ttcgagtagt tgtctgaaga aactgtttgt
7141 tttcctaaag aaaaccttcg attagactag ttttgattaa aaaacctttc gattcgaaca
7201 tttttaatgc caaatttttt ttatttttac agattgagca gccccctgct ggaagttatg
7261 tgccatggaa tattgtacct taaaattaat atttttttta atttattagc aaaaatttag
7321 ctagaaataa acgcaataga catttttttt gtatacaaat aaactgtgaa attaaatgaa
7381 atatttagaa aattaaatat aatactcgta tttgcattta ttacaatgaa atataaaact
7441 gattaaatgg aaaatgaaat gatgtgcaat taaatgttaa tggcagagtc tgtattaaca
7501 gaattgacct caattaaatg atcattactt tgttcattta tcacattggc attatcatct
7561 ccggacaaag tttcatacgg ttgattgtgg ctgaaaagga aatggaaaat tttaacagta
7621 tttcttcaca ccttaaagta tgctttaaaa tgtagataat aaataatttt attaaaagca
7681 ctgggagaaa acgtttaact gttaactact atctacttta ccttggtttt ttttcaattt
7741 aaacaatttt tatttgtttt ttttttaatt caacacatca actactttta tttgtttaat
7801 ttataaaatt ttttcataca aaatttgagc atgcatgcga tacttgatga cttttgtcaa
7861 cttttggcgt tgaattgtat ttaacgatat ttgagttttg aacattcgaa cagctacaaa
7921 cactttttct aaatagtttg ctaaatttag ggcaatgaca ggtaaatgaa ggacgtcttc
7981 taacgctaaa gtaagtcaaa tttcaagtaa ttaaaatatt aatacaaaaa aaaaatttga
8041 gtgtggttta aaagtaagcg ct
//
LOCUS CAVXDHC 3346 bp ds-DNA INV 27-JUL-1990
DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 2, exon 1.
ACCESSION M30488
KEYWORDS xanthine dehydrogenase.
SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et
al.), clone Lcv2.
ORGANISM Calliphora vicina
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae.
REFERENCE 1 (bases 1 to 3346)
AUTHORS Houde,M., Theron,M.-C. and Bregegere,F.
TITLE Divergence of the nucleotide sequences encoding xanthine
dehydrogenase in Calliphora vicina and Drosophila melanogaster
JOURNAL Gene 85, 391-402 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Bregegere, 01-DEC-1989.
FEATURES from to/span description
pept 1529 / 1609 xanthine dehydrogenase, exon 1 (AA at 1529)
(EC 1.1.1.204)
pre-msg 1305 > 3346 xanthine dehydrogenase mRNA and introns
IVS < 1610 3346 xanthine dehydrogenase intron A
BASE COUNT 1213 a 444 c 419 g 1270 t
ORIGIN
1 aactgttcga ataatttgaa attgccgatt ttcaaataac aaataaaccg attaatttgt
61 gtcaattaac cgattaaccg aaattccttt tttttgcgct ttaaaatttt tttttgtatt
121 tatttttccg tttcaattca aatacaaatt aaaatttgat attttttgaa catccaataa
181 acatgattaa tgagtatgta taacaactga catgttttga aactttgttt gtatttacat
241 gtatagacta aacttttttg aattgagtct tttatcataa ctaaatgaat ctattaaatt
301 aatccaaatc taaaacaatc caaaaaagaa tattctttat catgtttttg atttctccaa
361 catataaaat cagtgagaga gttttttcca agaaaagttt tattaaatct gaagaaaaaa
421 atcgtttaaa ttatattctt tttataaaag attatttcag aagatctctt gctgcaacaa
481 cgtaataatt aaaaaaaagt cgtttccaaa aaatgtcttt gaatatttta tgacatttta
541 ctatttctta tataaatttt caacaaatca tgcgatttca gaaatataaa tagtagatgt
601 taatatcttt ctaatctgta tttaacccaa atttttactt atgaaatata cgagaaaaca
661 tgaattttaa ttttgatgtt tgcaataata aaaaacactc tcacacaaaa aataattgac
721 ttttttgaaa actatatgaa agattataga tcagcatatt ttgtattact cagttcataa
781 aacacggatt ttgagttatg acgttgttgc agcaaataca aaaagattac aaaactgatc
841 tcaaatacct tatttgctgt tttttatgtt tttgtacaca aaattcgttg ttgtattttc
901 aatttaaaat gaaaatagaa attattcggt taatcgaata attttaaatt aaccgattat
961 taaccgaata aatctaaacc tcgattaatt atttgctcga ttaaccgatt aaaccagaga
1021 cccgattaat tgaataccct acccccatct ttttatgcaa tttttgccct gtaaatgtca
1081 atgcgacaac actaccacta ctgtatattt cttattgtag cgccctcatg agtattaaga
1141 catttactct cgccctcgat tttatttgta ttattctctg gttatcatca cacttgttga
1201 aaatttatac acaaatatct tagcgactct ctaccgtagc cagccggttg tacaattaat
1261 tattaatacc agtataaata aaatttttat attcaaataa ataaataatt tgtattcgac
1321 tctcagttct ccagtttgca acgtgtgatc attgtgtcga gacgtatcgt ttaaaagtgc
1381 ttataatata cgtacaatat ttcatattgc gtttattatc gattgtttga ttaaaaaaat
1441 tgaattgaaa cgaaaataaa gacatcattg caattagtta aataaaaatt gataaaacaa
1501 gtggaattca gacagctaaa cgtttaatat gacgcaagaa cataatgcag cagtcctaga
1561 tttaaatcca acattttcta cattgatatt ttttgttaat ggcaaaaagg taattaatgt
1621 aatagtattt aaaaataatt tatgtatgaa aacagttagt ataatatttg gttaaaacat
1681 aactatacat tgtgattgag ggggggttta gggaacaaat tttattaaca cacatattta
1741 aatacacgaa aattttataa ataaaagata tttgttgatt atttattgag tcaattccct
1801 aatatttgtt gtaatcgtac tcgcattgtt attgtctgtc atacataaaa acttctgtct
1861 gtctgtctca acttctcatc aaacgagtaa tcatcatgaa cacttatacc tggtttgacg
1921 gcataaataa taaaatagag ttgcacagtg aggtcgatcg taaaacatgc taataaatct
1981 gaatcttctg acctatttcg attacaaaat gatgtataaa aaactaactt tgaaaaaata
2041 aattttagta taatttgtat gcaattaaaa tctgctcccc tgagcttgtt ttaggacggg
2101 tgattaatgt ttaaatgtca gattaatatt ctaattaata tgtttattat aatttgtaaa
2161 aaaaatttaa gaactagatg tgacacttag aatttttttt tttttaaatc actattttct
2221 tcctttgcct gaagtagggt acatatcgat tttaaaatct tatcgaaaat gaaaaaatat
2281 tacaattaag ctacattttt gtcaaaatgc atccattttt atatccttca ccatgagtgg
2341 cataagtctg tcattccgat tgtaaaatct acatttttca ttttcgaccc cacaaagaaa
2401 atatattcta gatcgttata catcaatata tcgggaattc ttccggctcg gaaactattt
2461 aaaaccgaaa aaatcggctg agatataagg aataaaccgg gacaacctcg actcttggcc
2521 tatttttgat ttatattttt ttgtcataaa ttaaaaaaag aattaaaaat aaaatttgga
2581 aaaaacttct ttaaacaaaa ttaaaaaaca agttagagag atacatatat tcggccgtgc
2641 atcttatata cccttcacca aattatactt caaaataaaa attttaaata gttttaggta
2701 aacaaaattt attttttttc caaagtagtt tttttttaat tttttggaaa aaaaaatttt
2761 cgaattgttt tttaaaattt taattttttt tttgtttttt taatatttag cgaaaaaaaa
2821 tttttttgtg aaaaaaaaat taacaaaaaa ctatttacat gtcattataa attttttgaa
2881 ataagatttt gacattctcg taaaatcagc ctaaggtgtt ccaacatttt gattaaataa
2941 tatctagttg atttttttgt tgttgttgct tttgtctatc attcaggttc agttgtaaac
3001 attaacgtgt tttgtttgcc ttatcacttg tttgcatttt tagaaaatgt gttttaatta
3061 aattagttta gaaaagtttt ttaaatggaa attattttaa gagcattgtc tggacattta
3121 tagatatata ttattgttaa ttgtattagt taaattttga ataaattgtg gaaaaatatc
3181 agttattttt tggttaaatt aaggaattag tacataaaaa cgaaagatag ctgagaattt
3241 ccatattagc gaatgcttct tggcttccga gtaaaaactc gcccgttagc gaattccatc
3301 atagcgttgt caatgattct tacagaaaca tcaagtatta aatgtt
//
LOCUS RATCGM1A 1104 bp ds-DNA ROD 27-JUL-1990
DEFINITION Rat carcinoembryonic antigen (rnCGM1) gene, partial cds.
ACCESSION M22226 J04626
KEYWORDS carcinoembryonic antigen.
SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1104)
AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
Zimmermann,W.
TITLE Identification of a carcinoembryonic antigen gene family in the
rat: Analysis of the N-termianl domains reveals
immunoglobulin-like, hypervariable regions
JOURNAL J. Biol. Chem. 264, 6906-6912 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann 12-JAN-1989.
FEATURES from to/span description
pept / 723 / 1082 carcinoembryonic antigen rnCGM1, exon x (AA at
725)
pre-msg < 1 > 1104 carcinoembryonic antigen rnCGM1, mRNA and
introns
IVS < 1 722 carcinoembryonic antigen rnCGM1, intron y
IVS 1083 > 1104 carcinoembryonic antigen rnCGM1, intron y+1
BASE COUNT 314 a 262 c 232 g 296 t
ORIGIN Unreported.
1 gaattcactc ctcagctctc acagcataga tggacataca gactcctgaa ggctcttctc
61 ttccctccac actggtgtgt gtcacgtacc tgtagtgtgc acactgggac atgtaccttc
121 ccaaaccctc acgaacaata cagaaatatt aaattacact tgaatataat tatttttatg
181 tgctataaac atggaaatta tgtagacaaa cccagagata tcttttcttc cttccttcct
241 tccttcttcc ttccttcctt ccttcctctt tttccatact agtttctgag attttttgag
301 gaactgaacc ttccaaaaag accataccaa tccctgtcct caaaaagcct tttttattct
361 aatggactgg aaatcattgt atccagagga gaaagtcaat gatttagtgg aaccataaat
421 agaacagaaa acattcagga agtgaggatt gtatggagga ggaaaaagag gaggaggagg
481 aggaagagga ggaggaggag gaggaggacc gagagccggt tctccactca ccagacactt
541 tatggaaaga gtgatatggg gacacctgag tagaggattc cacagagagg aaatgacacc
601 ctttgaggtt ctgagggcat ggaggtcatg ctgctcacct ccattaaggg tgcatcctac
661 ctacaggctg agggatgctc acacctgctc aggattgtca acttttctct cttcccttct
721 agcctccctc ttaacctgct ggctcctgcc caccactgcc caagtctcca ttgaatcctt
781 accaccccag gtggttgaag gagaaaatgt tcttctacgt gttgacaatt tgccagagaa
841 tctcatagcc tttgtctggt acaaagggct gacaaacatg agcctcggag ttgcactgta
901 ttcactaacc tataacgtaa ctgtgacggg acctgtgcac agtggtagag agacattgta
961 cagcaatggg tccctgtgga tccaaaatgt cacccagaag gacacaggat tctacaccct
1021 acgaaccata agtaatcatg gagaaattgt atcaaataca tccctgcacc ttcatgtgta
1081 ctgtaagtaa ttctttgtga attc
//
LOCUS RATCGM1B 654 bp ds-DNA ROD 27-JUL-1990
DEFINITION Rat carcinoembryonic antigen rnCGM2 gene, partial cds.
ACCESSION M22227 J04626
KEYWORDS carcinoembryonic antigen.
SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM2-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
Zimmermann,W.
TITLE Identification of a carcinoembryonic antigen gene family in the
rat: Analysis of the N-termianl domains reveals
immunoglobulin-like, hypervariable regions
JOURNAL J. Biol. Chem. 264, 6906-6912 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann 12-JAN-1989.
FEATURES from to/span description
pept / 249 / 608 carcinoembryonic antigen rnCGM2, exon x (AA at
251)
pre-msg < 1 > 654 carcinoembryonic antigen rnCGM2, mRNA and
introns
IVS < 1 248 carcinoembryonic antigen rnCGM2, intron y
IVS 609 > 654 carcinoembryonic antigen rnCGM2, intron y+1
BASE COUNT 168 a 169 c 171 g 146 t
ORIGIN Unreported.
1 aatgtaattc ttgttggaga gtgagtgggg agccatgcag acacgggagg agagagaccc
61 gtacaaaagg tcactccagc ttcgggggac tgggaacata gatgatgaag tttccctgca
121 ccaatgagag cgacgccctc accccacacc tcggcagaag atgaacacac ctacctgttc
181 cggacttggg cctcctctca gcgatcacta agcttctgac actgatggag ttttttcctt
241 ctccctagct tccttcttaa cctgctggaa tgcacccgcc gctgccgagc tcactattga
301 attagtgcca cccatggttg ctgaaggcgg aaactccgtt ttgtttgtgc atgaaatgcc
361 attgaatgtc caggcgtttt actggtacaa acagagagat ccgacgaaga gctatgaagt
421 cgcgcggtac ttaacaccca ccaacgaaag ttcgaagatg cctcagcaca gcggccggaa
481 aaccgtattc tacagtggat ccctgctgat cagaaacgtc acccaggccg acagtggagt
541 ctacacctta ctaacattta acacagaaat gcaaagcgaa ttaacacatg tgcatctgga
601 agtacgcggt aggtggttgc gggatctctg ggtgctaggg gtcggggtga gctc
//
LOCUS RATCGM1C 1185 bp ds-DNA ROD 27-JUL-1990
DEFINITION Rat carcinoembryonic antigen rnCGM3 gene, partial cds.
ACCESSION M22228 J04626
KEYWORDS carcinoembryonic antigen.
SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM3-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1185)
AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
Zimmermann,W.
TITLE Identification of a carcinoembryonic antigen gene family in the
rat: Analysis of the N-terminal domains reveals
immunoglobulin-like, hypervariable regions
JOURNAL J. Biol. Chem. 264, 6906-6912 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann 12-JAN-1989.
FEATURES from to/span description
pept / 729 / 1088 carcinoembryonic antigen rnCGM3, exon x (AA at
731)
pre-msg < 1 > 1185 carcinoembryonic antigan rnCGM3, mRNA and
introns
IVS < 1 728 carcinoembryonic antigen rnCGM3, intron y
IVS 1089 > 1185 carcinoembryonic antigen rnCGM3, intron y+1
BASE COUNT 312 a 279 c 277 g 317 t
ORIGIN Unreported.
1 ctgcagtgtg cacagcaaga cattgtgctt tcccgaaccc acacgaacac actgaattat
61 taaatcacac ttgaatatat tgatttccct ttgctctgag cctgggcact atgtagataa
121 gtccatggaa atattaatct ttccttcctt ccttccttcc ttccttcctt ccttccttcc
181 ttccttcctt tcttccttct tctagttctt ttcacgtttt cccttttctt tttctctcca
241 atttgtttct aatctatttt caggaactga accttccaaa aagatgattc cagtccctgt
301 cctcacaaag cccttttctt gtggactgga agtcagagta tccagagaaa ggcaatggtt
361 taatggaacc tcaaacagaa cagaaaacaa ttctgagagt gagcattgca tgaggaagag
421 gaggaacggg aagaggagga ggaagaggag gaggaagagg aggaggaaga ggaatgggaa
481 gaggaggagg aagaggagga agaggaggaa gaggtcagac agctgcttca cctctcacca
541 gacactctat gggaagaatg atatggggac acctgagtag aggattcctg gagaggaaat
601 gacagctttt gagtctttga gggcatggag gtcatgctgc tcacctccat taagggtgca
661 tcctacctac aggctgaggg atgctcacac ctgctcagga tcggtgactt ttttctcttc
721 ccttctagcc tcacttttaa cctgctggct cctgcccacc actgcccacg tcaccctcaa
781 gtcctcaccg ccccaggtgg ttgaaggaga aaacgttctt ctaagtgctg acaatctgcc
841 agagaacatt atagctttcg cctggtacaa aggggagacc gacatgaacc gtggaattgc
901 actgtattca ctgaggtata ctgtaagttt gacggggcct gtgcacagtg gtcgagagac
961 attgtacagc gacgggtccc tgtggatcaa aaatgtcacc caggaggaca caggatttta
1021 tacctttcga atcataaata atcatggaaa aattcaatca aatacaaccc tgttccttca
1081 cgtgaaatgt aagtaactct ttgtgaactg tgggttttgg gtggtgtcct tccactagac
1141 acatagaagt atcaggccag ggctgtgtct cccttccccc tgcag
//
LOCUS RATCGM1D 748 bp ds-DNA ROD 27-JUL-1990
DEFINITION Rat carcinoembryonic antigen rnCGM4 gene, partial cds.
ACCESSION M22229
KEYWORDS carcinoembryonic antigen.
SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM4-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 748)
AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
Zimmermann,W.
TITLE Identification of a carcinoembryonic antigen gene family in the
rat: Analysis of the N-terminal domains reveals
immunoglobulin-like, hypervariable regions
JOURNAL J. Biol. Chem. 264, 6906-6912 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann 12-JAN-1989.
FEATURES from to/span description
pept / 90 / 434 carcinoembryonic antigen rnCGM4, exon x (AA at
92)
pre-msg < 1 > 748 carcinoembryonic antigen rnCGM4, mRNA and
introns
IVS < 1 89 carcinoembryonic antigen rnCGM4, intron y
IVS 435 > 748 carcinoembryonic antigen rnCGM4, intron y+1
BASE COUNT 197 a 158 c 183 g 210 t
ORIGIN Unreported.
1 gagctctggg aaggcagaag tgtgattttt taaaaaacca acagatttca cctgctcaat
61 atcgatggtt gctctgtctt cccttttagc ctcccttcta acctgttggc tcctgactac
121 tgcccaggtc aacattgaat cggtgccatt caatgtggtt gaaggggaaa acgtccttct
181 tcttgtccac aatctgccag agaatctcat agcctttgcc tggtatagag ggctgaggaa
241 aattggagta tacatactga acactgaagt aagtgtgacg gggccaatgt acagcggtag
301 agagacagtg tacagcaatg gttccctgtg tatccgcaat gtcacccaga aggacacagg
361 attctacact ctacgaacag tcaacacacg tggagaaact gtatcaacaa catccttgta
421 cctctatgtg tacagtaagt gatactttgt gaactctggg tgttgtgtgg ggttcattcc
481 gtagacacac acagaagagg caggcctacc taccctttgc attgtgtctc cttattgagg
541 tgtgaacatt taactcaggc taaggagagt aatgccaatt gaatagaatc cttcttttga
601 ctttaccttg tagtcagctg gatgtgtggt taactcagtg aaggacatca gcccttgtct
661 agacttctgg ggttcttagc agtaatgtgt ccttgggaaa gaccttgagg gaaggagatt
721 gggtttgaat gagatagcca taggatcc
//
LOCUS RATCGM1E 750 bp ds-DNA ROD 27-JUL-1990
DEFINITION Rat carcinoembryonic antigen rnCGM5 gene, partial cds.
ACCESSION M22230
KEYWORDS carcinoembryonic antigen.
SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM5-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 750)
AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
Zimmermann,W.
TITLE Identification of a carcinoembryonic antigen gene family in the
rat: Analysis of th N-terminal domains reveals immunoglobulin-like,
hypervariable regions
JOURNAL J. Biol. Chem. 264, 6906-6912 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann 12-JAN-1989.
FEATURES from to/span description
pept / 118 / 477 carcinoembryonic antigen rnCGM5, exon x (AA at
120)
pre-msg < 1 > 750 carcnioembryonic antigen rnCGM5, mRNA and
introns
IVS < 1 117 carcinoembryonic antigen rnCGM5, intron y
IVS 478 > 750 carcinoembryonic antigen rnCGM5, intron y+1
BASE COUNT 180 a 207 c 178 g 185 t
ORIGIN Unreported.
1 agcgtaggca ggagactcca cacctcagct gaccactgga cacagctgct cggactcagg
61 caccatctta gccaaatact aaagtcctga tgttgacgga tctctcttcc cttctagcct
121 ctcttttcat ctgtgggcgt ccttttaacc ctgccaagct cactattgaa tcagtgccgc
181 ccagtgttgc tgaaggggga agcgttcttc tcctcgttca caatctccag gacgagcttc
241 gagggttttt ctggtacaaa ggggcgtcta tgtctagcaa ccatgagata gcccgataca
301 gaacagcaaa gaattcaagt gtgccaggcc ctgcccacag tggtagagag acggtgtaca
361 gcaatggatc cctcctgctc cagaatgtca cccggaatga cactgggttc tacaccctac
421 gcactctgaa aagacatcag aaaatggaat tggcacacgt gcaacttcag gtggacagta
481 agtgattttc cgtgatcgtt cagtgctggg tgggtctttg acacacagga ctgtcacccc
541 tggcatgtgg ctacctcctc tctgcctttt tatccccatg ttgtggttaa ccactatgtg
601 caggacacat gtgatggaaa gaaatgccca tgggtcagac ttatcatctg actctcccct
661 gtatcaagga cagtaactca accctaggtg ctagactctg cccagtcatc tggggcatct
721 tgccatgcaa cgtgaggaaa ccatggatcc
//
LOCUS RATGSTAA 1116 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Rat glutathione S-transferase mRNA, complete cds.
ACCESSION M28241
KEYWORDS glutathione S-transferase.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1116)
AUTHORS Hsieh,J.-C., Liu,L.-F., Chen,W.-L. and Tam,M.F.
TITLE Expression of Yb1 glutathione S-transferase using a Baculovirus
expression system
JOURNAL Biochem. Biophys. Res. Commun. 162, 1147-1154 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by J.-C.Hsieh, 21-SEP-1989.
FEATURES from to/span description
pept 103 753 glutathione S-transferase (EC 2.5.1.18)
BASE COUNT 268 a 334 c 246 g 268 t
ORIGIN
1 tttgtccggc ccacgtttct ctggtaatct gtataaatcg caacacacag gtcaattctg
61 ctgaagccaa attgagaaga ccacagcgcc agaaccatgc ctatgatact gggatactgg
121 aacgtccgcg ggctgacaca cccgatccgc ctgctcctgg aatacacaga ctcaagctat
181 gaggagaaga gatacgccat gggcgacgct cccgactatg acagaagcca gtggctgaat
241 gagaagttca aactgggcct ggacttcccc aatctgccct acttaattga tggatcgcgc
301 aagattaccc agagcaatgc cataatgcgc taccttgccc gcaagcacca cctgtgtgga
361 gagacagagg aggagcggat tcgtgcagac attgtggaga accaggtcat ggacaaccgc
421 atgcagctca tcatgctttg ttacaacccc gactttgaga agcagaagcc agagttcttg
481 aagaccatcc ctgagaagat gaagctctac tctgagttcc tgggcaagcg accatggttt
541 gcaggggaca aggtcaccta tgtggatttc cttgcttatg acattcttga ccagtaccac
601 atttttgagc ccaagtgcct ggacgccttc ccaaacctga aggacttcct ggcccgcttc
661 gagggcctga agaagatctc tgcctacatg aagagcagcc gctacctctc aacacctata
721 ttttcgaagt tggcccaatg gagtaacaag taggcccttg ctacactggc actcacagag
781 aggacctgtc cacattggat cctgcaggca ccctggcctt ctgcactgtg gttctctctc
841 cttcctgctc ccttctccag ctttgctagc cccatctcct caacctcacc ccagtcatgc
901 ccacatagtc ttcattctcc ccactttctt tcatagtggt ccccttcttt attgacacct
961 taacacaacc tcacagtcct tttctgtgat ttgaggtctg ccctgaactc agtctcccta
1021 gacttacccc aaatgtaaca ctgtctcagt gccagcctgt tcctggtggg ggagctgccc
1081 caggcctgtc tcatctttaa taaagcctga aacacc
//
LOCUS MUSTCGXI 354 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse T-cell receptor gamma-chain mRNA V4-J1 region, 3' end.
ACCESSION M30171
KEYWORDS T-cell receptor beta-chain; joining exon; variable region.
SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Heilig,J.S. and Tonegawa,S.
TITLE Diversity of murine gamma genes and expression in fetal and adult T
lymphocytes
JOURNAL Nature 322, 836-840 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 342 T-cell receptor beta-chain V4-J1 region (AA at
1)
recomb 293 294 V-region DNA end/J-region DNA start
BASE COUNT 118 a 72 c 69 g 95 t
ORIGIN
1 aagttggagc aacctgaaat atcaatttcc agaccaagag atgagactgc acaaatatcc
61 tgtaaagttt tcatcgaaag ctttaggagt gtaaccatac actggtaccg gcagaaacca
121 aaccaaggtt tagagtttct attatatgtc cttgcaaccc ctacccatgt tttcttagat
181 aaggagtaca agaaaatgga ggcaagtaaa aatcctagtg cttctacatc gatattgaca
241 atatattcct tggaggaaga agacgaagct atctactact gttcctacgg ccaagtagct
301 caggttttca caaggtattt gcagaaggaa ctaagctcat agtaattccc tctg
//
LOCUS MUSTCGXJ 353 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse T-cell receptor beta-chain mRNA V2-J2 region, 3' end.
ACCESSION M30172
KEYWORDS T-cell receptor beta-chain; joining exon; variable region.
SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 353)
AUTHORS Heilig,J.S. and Tonegawa,S.
TITLE Diversity of murine gamma genes and expression in fetal and adult T
lymphocytes
JOURNAL Nature 322, 836-840 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 297 T-cell receptor beta-chain V2-J2 region (AA at
1)
recomb 294 295 V-region DNA end/J-region DNA start
BASE COUNT 131 a 71 c 65 g 86 t
ORIGIN
1 cagctggagc aaactgaatt atcggtcacc agagagacag atgagaatgt gcaaatatcc
61 tgtatagttt atcttccata tttctccaac acagctatac attggtaccg gcaaaaaaca
121 aatcaacagt ttgagtatct aatatatgtc gcaacaaact acaatcaacg acccttagga
181 gggaagcaca aaaaaattga agcaagtaaa gattttaaaa gttctacctc aaccttggaa
241 ataaattact tgaagaaaga agatgaagcc acctactact gtgcagtctg gatatagctc
301 gggctttcac aaggtatttg cagaaggaac aaagctcata gtaattccct ccg
//
LOCUS MUSTCGXK 76 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse T-cell receptor beta-chain mRNA V5-J1 region, 3' end.
ACCESSION M30173
KEYWORDS T-cell receptor beta-chain; joining exon; variable region.
SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Heilig,J.S. and Tonegawa,S.
TITLE Diversity of murine gamma genes and expression in fetal and adult T
lymphocytes
JOURNAL Nature 322, 836-840 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 21 T-cell receptor beta-chain V5-J1 region (AA at
1)
recomb 18 19 V-region DNA end/J-region DNA start
BASE COUNT 19 a 18 c 16 g 23 t
ORIGIN
1 acgtactact gtgcctgcta gctcaggttt tcacaaggta tttgcagaag gaactagctc
61 atagtaattc cctctg
//
LOCUS MUSTCGXL 926 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse T-cell receptor beta-chain mRNA C1 region, 3' end.
ACCESSION M30174
KEYWORDS T-cell receptor beta-chain; constant region.
SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 926)
AUTHORS Heilig,J.S. and Tonegawa,S.
TITLE Diversity of murine gamma genes and expression in fetal and adult T
lymphocytes
JOURNAL Nature 322, 836-840 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 518 T-cell receptor beta-chain C1 region (AA at 3)
BASE COUNT 267 a 231 c 169 g 259 t
ORIGIN
1 acaaaagact tgatgcagac atttccccca agcccactat tttccttcct tctgttgctg
61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttccagatg
121 ttataagggt gtattggaaa gaaaaggatg gcaatactat cctggactcc caggaagggg
181 atacgctgaa gactaacgac acatacatga agtttagctg gcttacagtg cccgaaaggg
241 caatggggaa agagcacaga tgtattgtca aacatgagaa caacaaagga ggagcagatc
301 aagagatttt ctttccttca ataaagaaag ttgctgtgag taccaagcct acaacttgct
361 ggcaagataa aaatgatgtg ctgcagctcc agttcacgat cacctctgcc tactacacct
421 accttctcct gctcctcaag agtgtgatct acttggccat catcagcttc tctctgctta
481 gaagaacatc tgtctgtggc aatgagaaga aatcctaaag aaacgagtgg tggtacagca
541 agtcagctgg atttcatcct cactgccata aaggtgcctt aacgaggtcc agctgccttc
601 acttgttggc tttcacttct ataaagtccc tcactcatgt tgcataaaca ttttctgaac
661 gtttgtgtgc aatttctgca acttctctct aactcacatc acctcattcc tgattccatc
721 cactgcagaa gtcccctccc ccagaagtct gaagcatcaa aattctagta cccatagcct
781 acagctttta cccatagccc ttgttgctct tgaaccagct aactccatga agacccccac
841 tctagacaac ctgaagagcc ttctttactg tttgattctg tgccttgtat ttcacattat
901 ggacaatata ttttttaaaa attaaa
//
LOCUS MUSTCGXM 913 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse T-cell receptor beta-chain mRNA C2 region, 3' end.
ACCESSION M30175
KEYWORDS T-cell receptor beta-chain; constant region.
SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 913)
AUTHORS Heilig,J.S. and Tonegawa,S.
TITLE Diversity of murine gamma genes and expression in fetal and adult T
lymphocytes
JOURNAL Nature 322, 836-840 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 503 T-cell receptor beta-chain C2 region (AA at 3)
BASE COUNT 263 a 223 c 168 g 259 t
ORIGIN
1 acaaaaggct tgatgcagac atttccccca agcccactat ttttcttcct tctgttgctg
61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttcccgatg
121 tcataagggt gtattggaaa gaaaagaatg gcaatactat cctggactcc caggaagggg
181 atacgctgaa gactaagggc acatacatga agtttagctg gcttactgtg cccgaaaggg
241 caatggggaa agagcacagt tgtattgtca aacatgagaa caacaaagga ggagcagatc
301 aagagatttt cttcccttca ataaagaaag ttgctacaac ttgctggcaa gataaaaatg
361 atgtgctgca gttccagttc acgagcacct ctgcctacta cacctacctc ctcctgctcc
421 tcaagagtgt gatctacttg gccatcatca gcttctctct gcttagaaga acatctgtct
481 gtggcaatga gaagaagtcc taaagaaacg agtggtggta cagcaagtca gctggatttc
541 atcctcactg ccataaaggt gccttaacgg ggaaccagat gccttctctt gttggctttc
601 acttctataa agtccctcac tcatgttgca taaacatttt ctgaactttt gtatgcaatt
661 tcagcaactt tttttaaact gaactcacct tcttcctgat tccatccact ccagaagtcc
721 cctcccccag aagcctgaaa cattaaaatt ctagtaccca tagcctacag cttttaccca
781 tggcccttgt tgctcttgaa ccagctaact ccatgaagac ccccactcta gacaacctga
841 agagccttct ttactgtttg attctgtgcc ttgtatttca gattattgac aataaatttt
901 ttaaaaaatt aaa
//
LOCUS DVUHYDAB 3180 bp ds-DNA BCT 27-JUL-1990
DEFINITION D.fructosovorans periplasmic hydrogenase (hydA and hydB) gene,
complete cds.
ACCESSION M35333
KEYWORDS periplasmic hydrogenase.
SOURCE D.fructosovarans (strain DSM 3604) DNA, clone DHS-alpha-[pHH7].
ORGANISM Desulfovibrio fructosovorans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 3180)
AUTHORS Rousset,M., Dermoun,Z., Matchikian,C.E. and Belaich,J.P.
TITLE Cloning and sequencing of the locus encoding the large and small
subunit genes of the periplasmic [Ni,Fe] hydrogenase Desulfovibrio
fructosovorans
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Rousset, 15-JUN-1990.
FEATURES from to/span description
pept 301 1242 periplasmic hydrogenase-alpha subunit
precursor (hydA) (EC 1.18.99,1)
sigp 301 450 periplasmic hydrogenase-alpha subunit signal
peptide
matp 452 1239 periplasmic hydrogenase
pept 1314 3008 periplasmic hydrogenase-beta subunit (hydB)
mRNA 129 3060 periplasmic hydrogenase mRNA
signal 97 102 -35 region
signal 120 125 -10 region
binding 289 293 ribosome binding site for hydA
binding 1303 1307 ribosome binding site for hydB
signal 3025 3060 termination signal
BASE COUNT 660 a 1150 c 866 g 504 t
ORIGIN
1 cgtgctgcat acgatttctt tattcatatc ctttacttcc ccggcctcat caccccggta
61 acacccttga atcacgtgaa aaatttgcca aaccccattg acgttcaaac aaactcgtga
121 ttaactgcaa aagggaaatc gaacccggcg acgcaggaac gcacgcgtcc gtcacataga
181 ggcggcggcg gccggttcgc ctgcgacggc cgcgtgccga acgggtcaac ggatgctacg
241 tggcctgggg atcatgggac gccccggacc atgcctctcg taaaacaaag gaggacgttt
301 atgaactttt ccgtgggtct tggcaggatg aatgcggaaa aacggcttgt gcaaaacggc
361 gtctcccgcc gcgacttcat gaaattttgc gccaccgtgg ccgcggccat gggcatgggc
421 ccggcgttcg cgcccaaggt cgccgaagca ttgacggcca aacaccgtcc gtcggtggtc
481 tggctgcaca acgccgagtg caccggctgc accgaagcgg cgatccggac gatcaaacct
541 tatatagacg cgctcattct cgacaccatc tccctggatt accaggagac catcatggcc
601 gcggccggcg agacgtccga ggcggccctg caccaggccc tcgaaggcaa ggacggctac
661 tacctcgtgg tcgagggcgg cctgcccacc atcgacggcg gccagtgggg catggttgcc
721 ggccatccca tgatcgagac caccaagaag gccgcggcca aggccaaggg catcatctgc
781 atccggcacc tgcctcacgg cggcgtccag aaggccaaac ccaatcccag ccaggccaag
841 ggcgtgtccg aagccctcgg cgtcaagacc atcaacatcc ccggctgccc gcccaacccc
901 atcaacttcg tgggcgccgt ggtccatgtc ctgaccaagg gcatcccgga tctcgacgag
961 aacggccgtc cgaagctctt ctacggcgag ctggtccacg acaactgtcc gcgcctgccc
1021 cacttcgagg cctccgaatt cgcgccctcc ttcgattccg aagaggccaa gaaaggcttc
1081 tgcctctacg aactcggctg caagggcccc gttacctaca acaactgccc caaggtgctg
1141 ttcaaccagg tcaactggcc cgtccaggcc ggccacccct gcctcggctg cagcgagccg
1201 gacttctggg acaccatgac gccgttctac gagcagggct aaccccctcc tttgtagcgg
1261 ccagtacccg gacaccttca acagcaccga acgtctcgtg acggaggaag catatggctg
1321 agagcaaacc cacgccacaa tccaccttca ccggccccat cgtggtcgac cccattaccc
1381 ggatcgaagg tcacttgcgg atcatggtcg aggtggaaaa cggtaaggtc aaggacgcct
1441 ggagctcctc gcagctcttc cgcggcctgg aaatcatcct caaaggccgc gatccccgcg
1501 acgcccagca cttcacccag cgcgcctgcg gcgtgtgcac gtacgtccac gccctggcct
1561 ccagccgctg cgtcgatgac gccgtcaagg tcagcatccc ggccaacgcc cgcatgatgc
1621 gcaacctggt catggcctcc cagtatctcc atgaccacct cgtccacttc tatcacctgc
1681 acgccctcga ctgggtcgac gtgaccgccg ccctcaaggc cgatcccaac aaggccgcca
1741 aactggcagc ctccatcgac acggctcgga ccggcaactc ggaaaaggcc ctcaaggcag
1801 tccaggacaa gctgaaagcc ttcgtcgagt ccggacagct cggcatcttc accaacgcct
1861 acttcctcgg cggccacaaa gcctactacc tgccgcccga ggtcaacctc atcgccaccg
1921 cccactacct ggaagccctg cacatgcagg tcaaggcggc cagcgccatg gccatcctcg
1981 gcggcaagaa cccccacacc cagttcaccg tcgtgggcgg ctgctccaac taccagggcc
2041 tgaccaagga cccgctggcc aactacctgg ccctgagcaa ggaagtctgc cagttcgtca
2101 acgagtgcta catccctgac ctgctggccg tggccggctt ctacaaggac tggggcggca
2161 tcggcggcac cagcaactac ctggccttcg gcgagttcgc caccgacgac agctcccccg
2221 agaaacacct ggccacctcg cagtttccgt ccggcgtcat caccggccgc gacctcggca
2281 aggtggataa cgtggacctc ggcgccatct acgaagacgt caagtactcc tggtacgccc
2341 ccggcggcga cggcaagcac ccctacgact gcgtcaccga tcccaagtac accaagctcg
2401 acgacaagga ccactactcc tggatgaagg ccccccgcta caagggcaag gccatggaag
2461 tcggtccctt ggcccgcacc ttcatcgcct acgccaaggg gcagcccgac ttcaaaaagg
2521 tcgtggacat ggtcctcggc aaactctccg tcccggccac ggccctgcat tcgaccctcg
2581 gacgcaccgc cgcccgcggc atcgagaccg ccatcgtctg cgccaacatg gagaagtgga
2641 tcaaggaaat ggccgacagc ggcgccaagg acaacaccct gtgcgccaag tgggagatgc
2701 ccgaggagtc caagggcgtc ggcctggccg atgctccccg cggctccctg tcccactgga
2761 tccgcatcaa gggcaagaag atcgacaact tccagctggt tgtcccctcg acctggaacc
2821 tcggtccccg ggggccccag ggcgacaaga gcccggtgga agaggccctt atcggcacgc
2881 ccatcgccga tcccaaacgc ccggtcgaaa tcctgcgcac ggtccacgcc ttcgacccct
2941 gcatcgcctg cggcgtgcac gtcatcgagc ccgagaccaa cgagatcctc aagttcaagg
3001 tttgctaaga gcggcaaccc tctaatacga aggacccgct tcgggccggg ttcttcgttt
3061 ctcaaggagg tcgcatgtcc gataccccgc gcaaaatcct catcctcggc ctcggcaaca
3121 tcctctacac cgacgagggc gtcggcgtgc gggccgtgga gcgccttctc gaaacccacg
//
LOCUS MTFRECA 1349 bp ds-DNA BCT 27-JUL-1990
DEFINITION M.flagellatum recA gene, complete cds.
ACCESSION M35325
KEYWORDS co-protease; recombinase.
SOURCE M.flagellatum (strain MFK1) DNA.
ORGANISM Methylobacillus flagellatum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Methylococcaceae.
REFERENCE 1 (bases 1 to 1349)
AUTHORS Gomelsky,M., Bolstin,A. and Tsygankov,Y.D.
TITLE The nucleotide sequence and regulation of recA gene from
Methylobacillus flagellatum
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.V.Gomelsky, 19-JUN-1990.
Author address:M.V.Gomelsky
Institute of Genetics and Section of Industrial Microorganisms
1st Dorozhry pr-d,1
Moscow 113545 U.S.S.R.
FEATURES from to/span description
pept 293 1327 RecA protein
site 1 292 LexA repressor binding site
BASE COUNT 320 a 413 c 386 g 230 t
ORIGIN
1 aggagatggc cctgggcgcg ctggcctaca gcaccgccga cattagtgct tcctacacgg
61 gcatcgctgg gccgatggcg gaagcgagca aaaaccggtc ggtaccgtat gctttggctg
121 ggcccatcgc gatggccgga tcatcaccca aacccaccac ttcctgggcg agagacgccg
181 tgcgcagaca gtctgtcaaa accgcattaa ccggcttgtt gcagcttact ttacccactg
241 atttatgaaa taattgcagg ctttagcagg caactcgaaa aggtattcac tgatggatga
301 aaacagaagc aaagcgctag ccgccgccct ctcgcaaatc gagaaacaat ttggcaaagg
361 ctctatcatg cgcatgggcg acaccgatgt cgctgctgac atccaggcag tttctaccgg
421 ctccctgggc ctggacatcg ccctgggtat tggcggcttg ccgcgcggcc gcatcgtcga
481 gatttacggc ccggaatcct ccggcaagac cacgctgaca ttgtctgtga tcgcgcagat
541 gcaaaagctg ggcggcacgg cagcattcat cgacgcagaa cacgcgctcg acccggtcta
601 cgcgcagaag ctcggtgtca atgtttccga tctgctcatc tcccagcccg acaccggcga
661 acaggcgctc gagatcgccg acatgctggt gcgttccggc tcggtagatg tcgtggtggt
721 cgactccgtc gctgcgctga cgcccaaggc cgaaatcgaa ggcgaaatgg gcgactcgca
781 catgggcctg caggcccgcc tgatgtcgca ggcactgcgc aagctcaccg ccaacatcaa
841 gcgcaccaac accttggtca tcttcatcaa ccagatccgc atgaagatcg gcctgatgtt
901 cggcaacccc gagaccacca ccggcggcaa tgcactcaag ttctacgcct ccgtgcgcct
961 ggacatccgc cgcaccggcg cgatcaagaa gggggacgag gtcacgggat cagaaacccg
1021 ggtcaaggta gtgaagaaca acgtggcccc tccgttcaag ctagcggaat tcgacatcct
1081 ctatggcgaa ggtatctcgc gcgagggcga aatcatcgaa ctcggcgtca accttaaatt
1141 gatcgagaag gccggtgcct ggtacagcta caagggcgag aagatcggcc agggcaagga
1201 caacgcgcgc gagttcctca gggagcaccc ggaaatcgcc aatgagatcg acgccaagat
1261 tcgcgagcat tccaatctgg ccaacgccgc gatgacgacg gcaccggatg aagaaagcga
1321 cgaataagcc taccaccatg aatgcgaga
//
LOCUS CHKIGLAO 329 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34804
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 329)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 34 > 329 Ig lambda-chain V1-J region, exon N precursor
(AA at 36)
sigp / 34 50 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 36)
matp 51 > 329 Ig lambda-chain V1-J region, exon N
IVS < 1 33 Ig lambda-chain V1-J region intron X
BASE COUNT 65 a 107 c 82 g 75 t
ORIGIN
1 ccgttttctc ccctctttcc tcctcctctc caggttccct ggtgcaggca gcactgactc
61 agccggcctc ggtgtcagca aacctgggag aaaccgtcga gatcacctgc tccgggggta
121 gtggcagcta cggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct
181 atgctaacac caacagaccc tcagacatcc cttcacgatt ctccggttcc aaatccggct
241 ccacgggcac attaaccatc actggggtcc aagccgagga cgaggctgtc tatttctgtg
301 ggagcaggga cagcaactat gttggtata
//
LOCUS CHKIGLAP 332 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34805
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 36.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 332)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor
(AA at 36)
sigp / 34 50 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 36)
matp 51 > 332 Ig lambda-chain V1-J region, exon N
IVS < 1 33 Ig lambda-chain V1-J region intron X
BASE COUNT 70 a 110 c 79 g 73 t
ORIGIN
1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggagta
121 gtggcagcta ctatggctgg taccagcaga aggcacctgg cagtgcccct gtcactgtga
181 tctatgacaa caccaacaga ccctcggaca tcccttcacg attctccggt tccaaatccg
241 gctccacggc cacattaacc atcactgggg tccaagccga cgacgaggct gtctatttct
301 gtgggagtac agacagcagc tatattggta ta
//
LOCUS CHKIGLAQ 335 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34806
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 24.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 335)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor
(AA at 36)
sigp / 34 50 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 36)
matp 51 > 335 Ig lambda-chain V1-J region, exon N
IVS < 1 33 Ig lambda-chain V1-J region intron X
BASE COUNT 71 a 110 c 77 g 77 t
ORIGIN
1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
61 agccggcctc ggtgtcagca aacctgggag aaaccgtcaa gatcacctgc tcagggagta
121 gcaatactta ttactattat ggctggttcc agcagaagtc tcctggcagt gcccctgtca
181 ctgtgatcta taacaacgac aagagaccct cggacatccc ttcacgattc tccggtgccc
241 tatccggctc cacagccaca ttaaccatca ctggggtcca agccgacgac gaggctgtct
301 atttctgtgg gagcaacgac agcagtactg ctgca
//
LOCUS CHKIGLAR 330 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34807
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 330)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 29 > 330 Ig lambda-chain V1-J region, exon N precursor
(AA at 31)
sigp / 29 45 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 31)
matp 46 > 330 Ig lambda-chain V1-J region, exon N
IVS < 1 28 Ig lambda-chain V1-J region intron X
BASE COUNT 71 a 101 c 80 g 78 t
ORIGIN
1 ccccattttt tctcctctcc ttctccaggt tccctggtgc aggcagcgct gactcagccg
61 gcctcggtgt cagcaaacct gggagaaacc gtcgagatca cctgctccgg gggtagcagc
121 gactatggct ggtaccagca gaagtcacct ggcagtgccc ctctcactgt gatctataat
181 ggcaataaca gaccctcaaa catcccttca cgattctccg gttccctatc cggctccaca
241 aacacattaa ccatcactgg ggtccaagtc gaggacgagg ctgtctattt ctgtgggagt
301 gcagacagca gtgctggtta tgctgatata
//
LOCUS CHKIGLAS 332 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34808
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 332)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor
(AA at 36)
sigp / 34 50 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 36)
matp 51 > 332 Ig lambda-chain V1-J region, exon N
IVS < 1 33 Ig lambda-chain V1-J region intron X
BASE COUNT 70 a 106 c 79 g 77 t
ORIGIN
1 ccgttttctc ccctctctcc tctccctctc caggttccct ggtgcaggca gcgctgactc
61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggggtt
121 acagcgacta tggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct
181 atgctaacac caacagaccc tcgaacatcc cttcacgatt ctccggttcc aaatctggct
241 ccacaaacac attaaccatc actggggtcc aagccgagga cgaggctgtc tattactgtg
301 ggagctggga tagcagtggt aatagtggta ta
//
LOCUS CHKIGLAT 108 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION M34809
KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
processed gene; variable region.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 108)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 108 Ig lambda-chain V1-J region, exon N (AA at 1)
BASE COUNT 22 a 27 c 35 g 24 t
ORIGIN
1 ggatccggct ccacgggcac attaaccatc actggggtcc gagccgagga cgaggctgtc
61 tattactgtg gtggctacga cggcagcagt gcttatgctg gtgatata
//
LOCUS CHKIGLAU 529 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION M34811
KEYWORDS immunoglobulin light-chain; joining exon; processed gene.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 / 37 Ig light-chain J region, exon N (AA at 3)
IVS 38 > 529 IgL intron X
BASE COUNT 94 a 123 c 155 g 157 t
ORIGIN
1 ggtatatttg gggccgggac aaccttgacc gtcctaggtg agtcgctgac ctcgtctcgg
61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS CHKIGLAV 529 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION M34812
KEYWORDS immunoglobulin light-chain; joining exon; processed gene.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 / 37 Ig light-chain J region, exon N (AA at 3)
IVS 38 > 529 IgL intron X
BASE COUNT 95 a 124 c 154 g 156 t
ORIGIN
1 gatatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS CHKIGLAW 529 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION M34813
KEYWORDS immunoglobulin light-chain; joining exon; processed gene.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 / 37 Ig light-chain J region, exon N (AA at 3)
IVS 38 > 529 IgL intron X
BASE COUNT 94 a 123 c 155 g 155 t 2 others
ORIGIN
1 ggtanntttg gggacgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS CHKIGLAX 529 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION M34814
KEYWORDS immunoglobulin light-chain; joining exon; processed gene.
SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 / 37 Ig light-chain J region, exon N (AA at 3)
IVS 38 > 529 IgL intron X
BASE COUNT 97 a 126 c 152 g 154 t
ORIGIN
1 gatatattag gggccgggac aaccctgacc gtcctaggtg agtcactgac ctcgtctcgg
61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat tcttggtgat ttgggggttt
121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
361 tttgtcagga aattacttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS CHKIGLJA 529 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig germline light-chain gene J region, partial cds.
ACCESSION M34810
KEYWORDS germline; immunoglobulin light-chain; joining exon.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19-Eco.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 / 37 Ig light-chain J region, exon N (AA at 3)
IVS 38 > 529 IgL intron X
BASE COUNT 94 a 124 c 155 g 156 t
ORIGIN
1 ggtatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS CHKIGLVE 335 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken Ig germline lambda-chain gene V1-J region, partial cds.
ACCESSION M34803
KEYWORDS germline; immunoglobulin light-chain; joining exon;
lambda-immunoglobulin; variable region.
SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19/g1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 335)
AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
TITLE Somatic diversification of chicken immunoglobulin light chains by
point mutations
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor
(AA at 36)
sigp / 34 50 Ig lambda-chain V1-J region, exon N signal
peptide (AA at 36)
matp 51 > 335 Ig lambda-chain V1-J region, exon N
IVS < 1 33 Ig lambda-chain V1-J region intron X
BASE COUNT 70 a 111 c 82 g 72 t
ORIGIN
1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccggggata
121 ggagctacta tggctggtac cagcagaagg cacctggcag tgcccctgtc actgtgatct
181 atgctaacac caacagaccc tcggacatcc cttcacgatt ctccggttcc aaatccggct
241 ccacagccac attaaccatc actggggtcc aagccgacga cgaggctgtc tattactgtg
301 ggagtgcaga cagcagcagt actgctggtg gtata
//
LOCUS CH1L1CG 7502 bp ds-DNA circular BCT 27-JUL-1990
DEFINITION Plasmid pCHL1 (from C.trachomatis) virulence proteins pGP1-D,
pGP2-D, pGP3-D, pGP4-D, pGP5-D, pGP6-D, pGP7-D, and pGP8-D,
complete cds.
ACCESSION J03321
KEYWORDS complete genome; virulence protein.
SOURCE Plasmid pCHL1 (from Chlamydia trachomatis trachoma biovar, serotype
D) DNA.
ORGANISM Plasmid pCHL1
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 7502)
AUTHORS Comanducci,M., Ricci,S., Cevenini,R. and Ratti,G.
TITLE Diversity of the Chlamydia trachomatis common plasmid in biovars
with different pathogenicity
JOURNAL Plasmid 23, 149-154 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Ratti, 28-MAR-1990.
FEATURES from to/span description
pept 1480 488 (c) pGP8-D
pept 1579 2934 pGP1-D
pept 2928 3992 pGP2-D
pept 4054 4848 pGP3-D
pept 4918 5226 pGP4-D
pept 5317 6048 pGP5-D (gtg start codon)
pept 6045 6788 pGP6-D (gtg start codon)
pept 7022 7502 pGP7-D
1 437 pGP7-D
rpt 6857 6945 four tandem 22bp repeats
BASE COUNT 2460 a 1285 c 1433 g 2324 t
ORIGIN
1 ggatccgtaa gttagacgaa attttgtctt tgcgcacaga cgatctattt tttgcatcca
61 atcagatttc ctttcgcatt aaaaaaagac agaataaaga aaccaaaatt ctaatcacat
121 ttcctatcag cttaatggaa gagttgcaaa aatacacttg tgggagaaat gggagagtat
181 ttgtttctaa aatagggatt cctgtaacaa caagtcaggt tgcgcataat tttaggcttg
241 cagagttcca tagtgctatg aaaataaaaa ttactcccag agtacttcgt gcaagcgctt
301 tgattcattt aaagcaaata ggattaaaag atgaggaaat catgcgtatt tcctgtcttt
361 catcgagaca aagtgtgtgt tcttattgtt ctggggaaga ggtaattcct ctagtacaaa
421 cacccacaat attgtgatat aattaaaatt atattcatat tctgttgcca gaaaaaacac
481 ctttaggcta tattagagcc atcttctttg aagcgttgtc ttctcgagaa gatttatcgt
541 acgcaaatat catctttgcg gttgcgtgtc ctgtgacctt cattatgtcg gagtctgagc
601 accctaggcg tttgtactcc gtcacagcgg ttgctcgaag cacgtgcggg gttattttaa
661 aagggattgc agcttgtagt cctgcttgag agaacgtgcg ggcgatttgc cttaacccca
721 ccatttttcc ggagcgagtt acgaagacaa aacctcttcg ttgaccgatg tactcttgta
781 gaaagtgcat aaacttctga ggataagtta taataatcct cttttctgtc tgacggttct
841 taagctggga gaaagaaatg gtagcttgtt ggaaacaaat ctgactaatc tccaagctta
901 agacttcaga ggagcgttta cctccttgga gcattgtctg ggcgatcaac caatcccggg
961 cattgatttt ttttagctct tttaggaagg atgctgtttg caaactgttc atcgcatccg
1021 tttttactat ttccctggtt ttaaaaaatg ttcgactatt ttcttgttta gaaggttgcg
1081 ctatagcgac tattccttga gtcatcctgt ttaggaatct tgttaaggaa atatagcttg
1141 ctgctcgaac ttgtttagta ccttcggtcc aagaagtctt ggcagaggaa acttttttaa
1201 tcgcatctag gattagatta tgatttaaaa gggaaaactc ttgcagattc atatccaagg
1261 acaatagacc aatcttttct aaagacaaaa aagatcctcg atatgatcta caagtatgtt
1321 tgttgagtga tgcggtccaa tgcataataa cttcgaataa ggagaagctt ttcatgcgtt
1381 tccaatagga ttcttggcga atttttaaaa cttcctgata agacttttca ctatattcta
1441 acgacatttc ttgctgcaaa gataaaatcc ctttacccat gaaatccctc gtgatataac
1501 ctatccgtaa aatgtcctga ttagtgaaat aatcaggttg ttaacaggat agcacgctcg
1561 gtattttttt atataaacat gaaaactcgt tccgaaatag aaaatcgcat gcaagatatc
1621 gagtatgcgt tgttaggtaa agctctgata tttgaagact ctactgagta tattctgagg
1681 cagcttgcta attatgagtt taagtgttct catcataaaa acatattcat agtatttaaa
1741 cacttaaaag acaatggatt acctataact gtagactcgg cttgggaaga gcttttgcgg
1801 cgtcgtatca aagatatgga caaatcgtat ctcgggttaa tgttgcatga tgctttatca
1861 aatgacaagc ttagatccgt ttctcatacg gttttcctcg atgatttgag cgtgtgtagc
1921 gctgaagaaa atttgagtaa tttcattttc cgctcgttta atgagtacaa tgaaaatcca
1981 ttgcgtagat ctccgtttct attgcttgag cgtataaagg gaaggcttga tagtgctata
2041 gcaaagactt tttctattcg cagcgctaga ggccggtcta tttatgatat attctcacag
2101 tcagaaattg gagtgctggc tcgtataaaa aaaagacgag tagcgttctc tgagaatcaa
2161 aattctttct ttgatggctt cccaacagga tacaaggata ttgatgataa aggagttatc
2221 ttagctaaag gtaatttcgt gattatagca gctagaccat ctatagggaa aacagcttta
2281 gctatagaca tggcgataaa tcttgcggtt actcaacagc gtagagttgg tttcctatct
2341 ctagaaatga gcgcaggtca aattgttgag cggattattg ctaatttaac aggaatatct
2401 ggtgaaaaat tacaaagagg ggatctctct aaagaagaat tattccgagt agaagaagct
2461 ggagaaacgg ttagagaatc acatttttat atctgcagtg atagtcagta taagcttaac
2521 ttaatcgcga atcagatccg gttgctgaga aaagaagatc gagtagacgt aatatttatc
2581 gattacttgc agttgatcaa ctcatcggtt ggagaaaatc gtcaaaatga aatagcagat
2641 atatctagaa ccttaagagg tttagcctca gagctaaaca ttcctatagt ttgtttatcc
2701 caactatcta gaaaagttga ggatagagca aataaagttc ccatgctttc agatttgcga
2761 gacagcggtc aaatagagca agacgcagat gtgattttgt ttatcaatag gaaggaatcg
2821 tcttctaatt gtgagataac tgttgggaaa aatagacatg gatcggtttt ctcttcggta
2881 ttacatttcg atccaaaaat tagtaaattc tccgctatta aaaaagtatg gtaaattata
2941 gtaactgcca cttcatcaaa agtcctatcc accttgaaaa tcagaagttt ggaagaagac
3001 ctggtcaatc tattaagata tctcccaaat tggctcaaaa tgggatggta gaagttatag
3061 gtcttgattt tctttcatct cattaccatg cattagcagc tatccaaaga ttactgaccg
3121 caacgaatta caaggggaac acaaaagggg ttgttttatc cagagaatca aatagttttc
3181 aatttgaagg atggatacca agaatccgtt ttacaaaaac tgaattctta gaggcttatg
3241 gagttaagcg gtataaaaca tccagaaata agtatgagtt tagtggaaaa gaagctgaaa
3301 ctgctttaga agccttatac catttaggac atcaaccgtt tttaatagtg gcaactagaa
3361 ctcgatggac taatggaaca caaatagtag accgttacca aactctttct ccgatcatta
3421 ggatttacga aggatgggaa ggtttaactg acgaagaaaa tatagatata gacttaacac
3481 cttttaattc accacctaca cggaaacata aagggttcgt tgtagagcca tgtcctatct
3541 tggtagatca aatagaatcc tactttgtaa tcaagcctgc aaatgtatac caagaaataa
3601 aaatgcgttt cccaaatgca tcaaagtatg cttacacatt tatcgactgg gtgattacag
3661 cagctgcgaa aaagagacga aaattaacta aggataattc ttggccagaa aacttgttat
3721 taaacgttaa cgttaaaagt cttgcatata ttttaaggat gaatcggtac atctgtacaa
3781 ggaactggaa aaaaatcgag ttagctatcg ataaatgtat agaaatcgcc attcagcttg
3841 gctggttatc tagaagaaaa cgcattgaat ttctggattc ttctaaactc tctaaaaaag
3901 aaattctata tctaaataaa gagcgctttg aagaaataac taagaaatct aaagaacaaa
3961 tggaacaatt agaacaagaa tctattaatt aatagcaagc ttgaaactaa aaacctaatt
4021 tatttaaagc tcaaaataaa aaagagtttt aaaatgggaa attctggttt ttatttgtat
4081 aacactgaaa actgcgtctt tgctgataat atcaaagttg ggcaaatgac agagccgctc
4141 aaggaccagc aaataatcct tgggacaaca tcaacacctg tcgcagccaa aatgacagct
4201 tctgatggaa tatctttaac agtctccaat aattcatcaa ccaatgcttc tattacaatt
4261 ggtttggatg cggaaaaagc ttaccagctt attctagaaa agttgggaga tcaaattctt
4321 gatggaattg ctgatactat tgttgatagt acagtccaag atattttaga caaaatcaaa
4381 acagaccctt ctctaggttt gttgaaagct tttaacaact ttccaatcac taataaaatt
4441 caatgcaacg ggttattcac tcccagtaac attgaaactt tattaggagg aactgaaata
4501 ggaaaattca cagtcacacc caaaagctct gggagcatgt tcttagtctc agcagatatt
4561 attgcatcaa gaatggaagg cggcgttgtt ctagctttgg tacgagaagg tgattctaag
4621 ccctgcgcga ttagttatgg atactcatca ggcattccta atttatgtag tctaagaacc
4681 agtattacta atacaggatt gactccgaca acgtattcat tacgtgtagg cggtttagaa
4741 agcggtgtgg tatgggttaa tgccctttct aatggcaatg atattttagg aataacaaat
4801 acttctaatg tatctttttt agaggtaata cctcaaacaa acgcttaaac aatttttatt
4861 ggatttttct tataggtttt atatttagag aaaacagttc gaattacggg gtttgttatg
4921 caaaataaaa gaaaagtgag ggacgatttt attaaaattg ttaaagatgt gaaaaaagat
4981 ttccccgaat tagacctaaa aatacgagta aacaaggaaa aagtaacttt cttaaattct
5041 cccttagaac tctaccataa aagtgtctca ctaattctag gactgcttca acaaatagaa
5101 aactctttag gattattccc agactctcct gttcttgaaa aattagagga taacagttta
5161 aagctaaaaa aggctttgat tatgcttatc ttgtctagaa aagacatgtt ttccaaggct
5221 gaatagacaa cttactctaa cgttggagtt gatttgcaca ccttagtttt ttgctctttt
5281 aagggaggaa ctggaaaaac aacactttct ctaaacgtgg gatgcaactt ggcccaattt
5341 ttagggaaaa aagtgttact tgctgaccta gacccgcaat ccaatttatc ttctggattg
5401 ggggctagtg tcagaagtga ccaaaaaggc ttgcacgaca tagtatacac atcaaacgat
5461 ttaaaatcaa tcatttgcga aacaaaaaaa gatagtgtgg acctaattcc tgcatcattt
5521 tcatccgaac agtttagaga attggatatt catagaggac ctagtaacaa cttaaagtta
5581 tttctgaatg agtactgcgc tcctttttat gacatctgca taatagacac tccacctagc
5641 ctaggagggt taacgaaaga agcttttgtt gcaggagaca aattaattgc ttgtttaact
5701 ccagaacctt tttctattct agggttacaa aagatacgtg aattcttaag ttcggtcgga
5761 aaacctgaag aagaacacat tcttggaata gctttgtctt tttgggatga tcgtaactcg
5821 actaaccaaa tgtatataga cattatcgag tctatttaca aaaacaagct tttttcaaca
5881 aaaattcgtc gagatatttc tctcagccgt tctcttctta aagaagattc tgtagctaat
5941 gtctatccaa attctagggc cgcagaagat attctgaagt taacgcatga aatagcaaat
6001 attttgcata tcgaatatga acgagattac tctcagagga caacgtgaac aaactaaaaa
6061 aagaagcgga tgtctttttt aaaaaaaatc aaactgccgc ttctctagat tttaagaaga
6121 cgcttccctc cattgaacta ttctcagcaa ctttgaattc tgaggaaagt cagagtttgg
6181 atcgattatt tttatcagag tcccaaaact attcggatga agaattttat caagaagaca
6241 tcctagcggt aaaactgctt actggtcaga taaaatccat acagaagcaa cacgtacttc
6301 ttttaggaga aaaaatctat aatgctagaa aaatcctgag taaggatcac ttctcctcaa
6361 caactttttc atcttggata gagttagttt ttagaactaa gtcttctgct tacaatgctc
6421 ttgcatatta cgagcttttt ataaacctcc ccaaccaaac tctacaaaaa gagtttcaat
6481 cgatccccta taaatccgca tatattttgg ccgctagaaa aggcgattta aaaaccaagg
6541 tcgatgtgat agggaaagta tgtggaatgt cgaactcatc ggcgataagg gtgttggatc
6601 aatttcttcc ttcatctaga aacaaagacg ttagagaaac gatagataag tctgattcag
6661 agaagaatcg ccaattatct gatttcttaa tagagatact tcgcatcatg tgttccggag
6721 tttctttgtc ctcctataac gaaaatcttc tacaacagct ttttgaactt tttaagcaaa
6781 agagctgatc ctccgtcagc tcatatatat atatctatta tatatatata tttagggatt
6841 tgatttcacg agagagattt gcaactcttg gtggtagact ttgcaactct tggtggtaga
6901 ctttgcaact cttggtggta gactttgcaa ctcttggtgg tagacttggt cataatggac
6961 ttttgttaaa aaatttatta aaatcttaga gctccgattt tgaatagctt tggttaagaa
7021 aatgggctcg atggctttcc ataaaagtag attgttttta acttttgggg acgcgtcgga
7081 aatttggtta tctactttat cttatctaac tagaaaaaat tatgcgtctg ggattaactt
7141 tcttgtttct ttagagattc tggatttatc ggaaaccttg ataaaggcta tttctcttga
7201 ccacagcgaa tctttgttta aaatcaagtc tctagatgtt tttaatggaa aagttgtttc
7261 agaggcatct aaacaggcta gagcggcatg ctacatatct ttcacaaagt ttttgtatag
7321 attgaccaag ggatatatta aacccgctat tccattgaaa gattttggaa acactacatt
7381 ttttaaaatc cgagacaaaa tcaaaacaga atcgatttct aagcaggaat ggacagtttt
7441 ttttgaagcg ctccggatag tgaattatag agactattta atcggtaaat tgattgtaca
7501 ag
//
LOCUS MUSMIP1X 638 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse macrophage inflammatory protein 1-beta (MIP-1) mRNA,
complete cds.
ACCESSION M35590
KEYWORDS macrophage inflammatory protein 1-beta.
SOURCE Mouse macrophage cell line RAW 264.7, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 638)
AUTHORS Sherry,B., Tekamp-Olson,P., Gallegos,C., Bauer,D., Davatelis,G.,
Wolpe,S.D., Masiarz,F., Coit,D. and Cerami,A.
TITLE Resolution of the two components of macrophage inflammatory protein
1, and cloning and characterization of one of those components,
macrophage inflammatory protein 1-beta
JOURNAL J. Exp. Med. 168, 2251-2259 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 63 341 macrophage inflammatory protein 1-beta (MIP-1)
precursor
sigp 63 131 macrophage inflammatory protein 1-beta signal
peptide
matp 132 338 macrophage inflammatory protein 1-beta
mRNA < 1 638 macrophage inflammatory protein 1 mRNA
BASE COUNT 125 a 175 c 139 g 199 t
ORIGIN
1 gcttctgaag cttctgggcc ctgcagtccc agctctgtgc aaacctaacc ccgagcaaca
61 ccatgaagct ctgcgtgtct gccctctctc tcctcttgct cgtggctgcc ttctgtgctc
121 cagggttctc agcaccaatg ggctctgacc ctcccacttc ctgctgtttc tcttacacct
181 cccggcagct tcacagaagc tttgtgatgg attactatga gaccagcagt ctttgctcca
241 agccagctgt ggtattcctg accaaaagag gcagacagat ctgtcctaac cccagtcagc
301 cctgggtcac tgagtacatg agtcacttgg agttgaactg agcagctcca gcggcagggc
361 aggaggagcc acttcaggag aggcctcctc agccctgatg cttctcactg agaagcgtcc
421 ttgctcctca cgttcagatt tcctgcccct cttcttaatt taaatctctg tgtagacttt
481 gttttgtttt tttgggggag tattatttct attatttatg ttttagttat aggacgcgtc
541 tctcccatgg agatggtcca ccattgctgt ttctctgcta ttgtggatat gactgtgaaa
601 ttgatttcat gcattttcat aataaatctt tctttaag
//
LOCUS DENAAA 699 bp ss-RNA VRL 27-JUL-1990
DEFINITION Dengue virus type 2 RNA.
ACCESSION M36806
KEYWORDS .
SOURCE Dengue virus type 2, cDNA to viral RNA.
ORGANISM Dengue virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE 1 (bases 1 to 699)
AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L.,
Strauss,E.G. and Strauss,J.H.
TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs
and potential cyclization sequences
JOURNAL J. Mol. Biol. 198, 33-41 (1987)
STANDARD simple staff_review
BASE COUNT 170 a 160 c 183 g 186 t
ORIGIN
1 cctccatcag agcatggctg tcttagggct actcttctgc ctagtgacat tcccaagctg
61 tgagtgtttc agggtttcag aagagggact aggacatgtc agataggaga tgtgtgactt
121 atgttgatgt tgcttgtccc caggtgtcct atcccaggtg cagctgaagc agtcaggacc
181 tggcctggtg cagccctcac agagcctgtc catcacctgc acagtctctg gtttctcatt
241 aactacctat ggtgtaaact gggttcgcca gtctccagga aagggtctgg agtggctggg
301 agtgatatgg agtggtggaa gcacagagta taatgcagct ttcatatcca gactgaacat
361 caccaaggac aattccaaga ccaagttttc tttaaaatga acagtctgca atctaatgac
421 acagccatat attactgtgc cagaagggac tacgttagta ggtacgaccc gtttgcttac
481 tggggccaag ggactctggt cactgtctct gcaggtgagt cctaacttct cccattctag
541 atgcatcttg gggggattct gagccttcag gaccaagatt ctctgcaaac gaatcaagat
601 tcaacccctt tgtaccaaaa ttgagacagg ggtctgggct agggactctc tgcctgctgg
661 tctgtggtgg cattagaaat gaagtatgat gaaggatct
//
LOCUS MVEAAA 558 bp ss-RNA VRL 27-JUL-1990
DEFINITION Murray Valley encephalitis virus RNA.
ACCESSION M35172
KEYWORDS repetitive DNA.
SOURCE Murray Valley encephalitis virus, cDNA to viral RNA.
ORGANISM Murray Valley encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE 1 (bases 1 to 558)
AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L.,
Strauss,E.G. and Strauss,J.H.
TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs
and potential cyclization sequences
JOURNAL J. Mol. Biol. 198, 33-41 (1987)
STANDARD simple staff_review
FEATURES from to/span description
ORF < 1 30 open reading frame
rpt 184 211 repeat
rpt 333 359 repeat
BASE COUNT 170 a 122 c 162 g 104 t
ORIGIN
1 actcatgtga gtgaagatag ggtcttgtaa ataacattga tagaaaattt tgtaaatatt
61 taatgtaata tagtataggt aaaatttttt gaaattaagt aaaattaagt agcaagactt
121 gatagtcagg ccagccggtt aggctgcacc cgaaggttgg tagacggtgc tgcctgcgac
181 caaccccagg aggactgggt taccaaagct gattctccac ggttggaaag cctcccagaa
241 ccgtctcgga agaggagtcc ctgccaacaa tggagatgaa gcccgtgtca gagccgaaag
301 cgccacttcg ccgaggagtg caatctgtga ggccccagga ggactgggta aacaaagccg
361 taaggccccc gcagcccggg ccgggaggag gtgatgcaaa ccccggcgaa ggactagagg
421 ttagaggaga ccctgcggaa gaaatgagtg gcccaagctc gccgaagctg taaggcgggt
481 ggacggacta gaggttagag gagaccccac tctcaaaagc atcaaacaac agcatattga
541 cacctgggaa aagactag
//
LOCUS ECOSTII 806 bp ds-DNA BCT 27-JUL-1990
DEFINITION E.coli heat-stable toxin II (STII) gene, complete cds.
ACCESSION M35586
KEYWORDS heat-stable toxin II.
SOURCE E.coli (strain K12/711) DNA, pCHL6.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 806)
AUTHORS Lee,C.H., Moseley,S.L., Moon,H.W., Whipp,S.C., Gyles,C.L. and So,M.
TITLE Characterization of the gene encoding heat-stable toxin II and
preliminary molecular epidemiological studies of enterotoxigenic
Escherichia coli heat-stable toxin II producers
JOURNAL Infect. Immun. 42, 264-268 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 189 40 (c) ORF 1
pept 443 658 heat-stable toxin II (put.)
BASE COUNT 287 a 141 c 135 g 243 t
ORIGIN
1 aagcttagta ttgttgacat gaacagcaaa tcgacttaat caaaaaacta tcagagaacg
61 ctgcattatt gattttagga ctttcaatgc aaatatcatc gaggaacgta catcattgcc
121 cacagataca acggatagtt gtatgaaagc ataaagcaaa atcaagattg caaacacaga
181 agggggcatt cctccccctc tcttttgcac ttctttccta tatttttaca agaaaagaaa
241 atgattttat atgttgaata ataaaacaat caatatgtaa atacctacaa cgggtgattg
301 acactacact cattaactat actgcaagta gcattaaaaa tcttaataaa ggagagcttc
361 gtcacatttt tttgacttga ctcatataaa ggcccactgg tataagtttt attgcttata
421 gcaataaggt tgaggtgatt ttatgaaaaa gaatatcgca tttcttcttg catctatgtt
481 cgttttttct attgctacaa atgcctatgc atctacacaa tcaaataaaa aagatctgtg
541 tgaacattat agacaaatag ccaaggaaag ttgtaaaaaa ggttttttag gggttagaga
601 tggtactgct ggagcatgct ttggcgccca aataatggtt gcagcaaaag gatgctaata
661 tatttatcaa tagcattcag caccatatac acaaaaataa tttttcataa aaagaactct
721 ataaaataaa tattttttgt gacaatgtcc taacgcaaga cggacattgt ccatttctca
781 ctgcaggctg cggctgagcg ataccc
//
LOCUS TRNTETE 830 bp ds-DNA BCT 27-JUL-1990
DEFINITION Transposon Tn10 (from E.coli) tetracycline resistance E (tetE)
gene, complete cds.
ACCESSION M34933
KEYWORDS tetracycline resistance.
SOURCE Transposon Tn10 (from E.coli) DNA, clone pSL1503.
ORGANISM Transposon Tn10
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 830)
AUTHORS Tovar,K., Ernst,A. and Hillen,W.
TITLE Identification and nucleotide sequence of the class E tet
regulatory elements and operator and inducer binding of the encoded
purified Tet repressor
JOURNAL Mol. Gen. Genet. 215, 76-80 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 152 787 tetracycline resistance determinant E
BASE COUNT 244 a 179 c 187 g 220 t
ORIGIN
1 agttaacgtt ctctatcgat gatagggttt gaaaaataac tctatcagtg atagattgtc
61 aacagcaagt atcaattgca agagatagct actatcccaa actttttatt gagatagtca
121 ctatctaaca gttgtccaaa aggagatagt gatggcacga ctaagcttgg acgacgtaat
181 ttcaatggcg ctcaccctgc tggacagcga agggctagag ggcttgacta cgcgtaagct
241 ggcgcagtcc ctaaaaattg agcaaccgac tctgtattgg cacctgcgca acaagcagac
301 tcttatgaac atgctttcag aggcaatact ggcgaagcat cacacccgtt cagcaccgtt
361 accgactgag agttggcagc agtttctcca ggaaaatgct ctgagtttcc gtaaagcatt
421 actggtccat cgtgatggag cccgattgca tatagggacc tctcctacgc ccccccagtt
481 tgaacaagca gaggcgcaac tacgctgtct atgcgatgca gggttttcgg tcgaggaggc
541 tcttttcatt ctgcaatcta tcagccattt tacgttgggt gcagtattag aggagcaagc
601 aacaaaccag atagaaaata atcatgtgat agacgctgca ccaccattat tacaagaggc
661 atttaatatt caggcgagaa cctctgctga aatggccttc catttcgggc tgaaatcatt
721 aatatttgga ttttctgcac agttagatga aaaaaagcat acacccattg aggatggtaa
781 taaatgatgc tatctatgtg tcaactctaa tttatagtta tggatagtgt
//
LOCUS CLONIFC 1191 bp ds-DNA BCT 27-JUL-1990
DEFINITION C.pasteurianum nitrogenase C (nifC) gene, complete cds.
ACCESSION M34365
KEYWORDS nitrogen-fixation protein; nitrogenase C.
SOURCE C.pasteurianum (strain W5) DNA.
ORGANISM Clostridium pasteurianum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1191)
AUTHORS Wang,S.-Z., Chen,J.-S. and Johnson,J.L.
TITLE A nitrogen-fixation gene (nifC) in Clostridium pasteurianum with
sequence similarity to chlJ of Escherichia coli
JOURNAL Biochem. Biophys. Res. Commun. 169, 1122-1128 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-Z.Wang, 18-MAY-1990.
FEATURES from to/span description
pept 331 1191 nitrogenase C (nifC)
BASE COUNT 403 a 132 c 178 g 478 t
ORIGIN
1 tttagtacat ctgaaatctg taattaagga gataataagt caaagtttaa tatatttgat
61 ttgttatttc cttaattttt tataaataac tgctatagta taactgtata aaatatggta
121 taatgtcgta gttgataata attatatttt tttatatatc attctgtgcc tataataatg
181 tatgaactaa tatttattga tacaataaat tataaaatta tataagaata tatagaatat
241 tatattgttt tatataagtt atagtatttt atattaaaca acttaattct aaatattttt
301 ttatgtattt aacaggaggt aatataatac atggaaaata ataaaaaaat attagagagt
361 tcaaaaaaac tttcttccta tggagatgga gagagcaggt tttcttttct tgaaaaaata
421 ttagctcctt tatttttagc tctaacggct atatattttg taatgcttat atttccaata
481 atttcaatga ttcggtattc aggtggttct cacattattc aaacccttta tgatcaagat
541 aatataaaga ctataatttt aagctttgta acttcactta tagcacttat atttactttt
601 atcataggaa ctccaacagc tttttgtatc aattttgtta gaaataaagt tttatctaaa
661 atattagata tatttgtaga aattcctgta gttttgcctc cagcagtagc tggtattgct
721 ttgctactag cttttggtaa aaatggtgta gtgggtaatt ttttatctaa tcatgggata
781 aatgttatat ttacctctac agctgtaata attgcacaat tttttgtatc ttctgctctt
841 tatgtaagag ttcttagaga ttctgttaaa tcagtaccta ttgaactttt tgaagtgagt
901 tatgtgctag gggcaggaaa aattgaaaca ataattaaaa ttatgattcc aatgcttaaa
961 aagtcaatcg tgtctggact tattttggct tggataagat ctcttggaga gtttggagcc
1021 actttgatgt ttgcaggaaa cattatagga aaaacaagaa ccataccact tcaaatctat
1081 acttatatgc aggatgatat taaaatggct actgcttttg ctacgatttt gtatataatg
1141 acttttgtat tgttattatt agttagactt agtattaggg atgatgatta g
//
LOCUS LACISTRAN 808 bp ds-DNA BCT 27-JUL-1990
DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds.
ACCESSION M33868
KEYWORDS transposase.
SOURCE L.lactis DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 808)
AUTHORS Romero,D.A. and Klaenhammer,T.R.
TITLE Characterization of gram-positive insertion sequence IS946, an
Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid
pTR2030
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.A.Romero, 26-APR-1990.
Author address: D.A.Romero
North Carolina State University
Dept. of Food Science
Box 7624, Raleigh, NC 27695
FEATURES from to/span description
pept 74 754 transposase (put.)
rpt 1 18 inverted repeat
rpt 791 808 inverted repeat
signal 6 35 promoter region
binding 62 65 ribosomal binding site
BASE COUNT 281 a 137 c 158 g 232 t
ORIGIN
1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag
61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt
121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata
181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa
241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga
301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga
361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt
421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc
481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca
541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg
601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat
661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac
721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta
781 tttgattttt aaactttgca acagaacc
//
LOCUS MUSMB1 5629 bp ds-DNA ROD 27-JUL-1990
DEFINITION Murine B cell 1 (mb-1) gene, complete cds.
ACCESSION M31773
KEYWORDS murine B cell 1.
SOURCE Mouse (strain Balb/c) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5629)
AUTHORS Kashiwamura,S.-I., Koyama,T., Matsuo,T., Steinmetz,M.,
Kimoto,M. and Sakaguchi,N.
TITLE Structure of the murine mb-1 gene encoding a putative
sIgM-associated molecule
JOURNAL J. Immunol. 145, 337-343 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Sakaguchi, 01-FEB-1990.
FEATURES from to/span description
pept 277 343 B cell 1, exon 1
1823 2116 B cell 1, exon 2
2308 2426 B cell 1, exon 3
4450 4518 B cell 1, exon 4
4646 4759 B cell 1, exon 5
pre-msg 268 5072 B cell 1 mRNA and introns
IVS 344 1822 mb-1 intron A
IVS 2117 2307 mb-1 intron B
IVS 2427 4449 mb-1 intron C
IVS 4519 4645 mb-1 intron D
BASE COUNT 1332 a 1515 c 1400 g 1382 t
ORIGIN
1 gatcctttct cagggatcca tggtgatgaa ccaggccttg aaccaccctc tccccgaccc
61 cacgcactag agagagactc aagggaattg tggccaccca ggtgcagggc agttcctctc
121 cactcagagc ccacacatat ggcaaataaa gggccaggag taagggccac tccagcccat
181 ctccggcacg gctgaacagg aagtgaggcg gagtttggag tggaatctgg tacctcaagc
241 tacggccacc caaactaacc gactcactgg gagacgatgc cagggggtct agaagccctc
301 agagccctgc ctctcctcct cttcttgtca tacgcctgtt tgggtaatgt ggccaaaggg
361 aaggtgcggg tggggcgggg gtgttgcggc tgacaggaga ggtgcagagt ggagtcagaa
421 atggccaggc agggtgatgc ataacgtggc atctcttaaa gtaacgccag aactctagga
481 aactgaggag ggaggaacat gcatttgagg ccagctggcg tacataatga cttctatcta
541 ggccaacctc gcaagttact cattaaaact ctctcaaaaa gaaaaggagt gtcgggaggg
601 aatcagggaa agcctctctc agctagctca gacagctgag taagtccact ttaggaccaa
661 cactaccttg cccacgacac acgaccttag ctcacagtca gaatgagaac tgagtaagtg
721 tgactgaaca cacaccagtg agtgtagcaa tggtgtcggt gagggaacgg gttttattgt
781 tgttgttgtt gttgttgttg ttttttcctc tgccttacat ctggaggcag gcttgccatg
841 tgaccacata gacccaggtt tgcagtggag gtcttacgag actggggccc tggtctctgt
901 gtcttaagcc tgtgtgacag gaactggctg gggctctttc ccctcccttt ctttcttcct
961 cacttccgac cttcttcttt tctgtcatgt ccacaaaagg gacctacaag ccagtgtgga
1021 gctggggagg ctgagaacag ggtggggggt ggggggggcg ggtcaagata aatacaagag
1081 cagacaccat agccaccagt ccaccccgtc tcttagcctc ctccgatgcc ccagacagaa
1141 ctcagtccca gtgcctaggt taatactggt ttctctgttt tgtttttcga gacagggatt
1201 ctgtgtgcaa ctggaacgta ctctgttaga ccagcaggcc tcaaactccc agagatctgc
1261 ctgacttctc tctcctgagt gctgggatta aaggcatgta ccagtactgc ctggcagtcg
1321 ccacatttaa attaaattac agagatgaga aacagcctag gacaaaaaaa caaaaaaaca
1381 aacaaacaaa caaacaaaaa aaacaaaaac aaaaaaaaaa acaaggccaa gcgcagtaaa
1441 ggagagattc aggaaggtgg gaattgggct ctactatctt ttccaatagc ctggttactg
1501 ctgagctgag ttttaccgct gcatctaagg ttctgcccca ccttcacccc ctccccgtgg
1561 ggttcctaag agttgtaagg aatggggaac aaatgggatg gttttcatga aagaatccct
1621 ggggtttttg gggaacatca aaatacaaag gggaaaggga agaaagaggc catagtgtgc
1681 ccccccccaa gtctcccatc atctttttct agtacattac cattggtacc ggctccaatt
1741 ctgatgtgaa ggggtctggg ctagagcaat catctccatt ctccaggcta tgacccatct
1801 gtctcctctc ctctctccac aggtcccgga tgccaggccc tgcgggtaga agggggtcca
1861 ccatccctga cggtgaactt gggcgaggag gcccgcctca cctgtgaaaa caatggcagg
1921 aaccctaata tcacatggtg gttcagcctt cagtctaaca tcacatggcc cccagtgcca
1981 ctgggtcctg gccagggtac cacaggccag ctgttcttcc ccgaagtaaa caagaacaca
2041 ggggcttgta ctgggtgcca agtgatagaa aacaacatat taaaacgctc ctgtggtact
2101 tacctccgcg tgcgcagtga gtagggaggg cgctggcctc ttgcgttccc tgctccctct
2161 ttcttccaaa acattaggag cagacgtagc tcctccctcc tggacctgcc acccagccac
2221 agagactggt tgcgagggct ccctgacctc gcaagggtct aggcgagaag aagggacaca
2281 gaatgctgag cagcacctgt cttcaagatc cagtccctag gcccttcctg gacatggggg
2341 aaggtaccaa gaaccgcatc atcacagcag aagggatcat cttgctgttc tgtgcagtgg
2401 tgccagggac gctgctgcta ttcagggtga gtctccccca agtgacaccc ttcaagtcat
2461 cttggcttac ctgggcctgg actgtgactc ccagtgtcca gctggtcctc tgaattcgag
2521 ctcgcccgga gatcttgtac tgatgccaac tgatagaaaa caacatatta aaacgctcct
2581 tgtacttacc atcccgtgcg cagtgagtag gaggcgcctg gcctccttgc tttccctgct
2641 ccctctttct tcaaaacatt agagcaagct agctcctcct cctgacctcg acagcacaga
2701 gatggtggtt caggcgcgtc cgaccctcgc aagggtctgg gctgccagaa gaagggacac
2761 cagaatgctg agcagcaccc tgtcttcaca gatccagtcc ctaggccctt cctggacatg
2821 ggggaaggta ccaagaaccg catcatcaca gcagaaggga tcatcttgct gttctgtgca
2881 gtggtgccag ggacgctgct gctattcagg gtgagtctcc cccaagtgac acccctcaag
2941 tcatcttggc ttccctgggc ctggactgtg actcccagtg tccagctggt ccttgggtct
3001 tccgaattcc aaggtttgtc cttccaattt gggaggctgg gaataaccac caaaaggatt
3061 ccaggggggt cagctgtccc cacggccctc atagaaaccc tctcacttct ggccgccaac
3121 gggagtccct gcaaatgcag ccaagacaag ggaagaaggt gtgtcctccc agagcccctg
3181 cgtctcctta ggccagaaga caacattttt caccacattg gggcttcatt attgtacgcc
3241 tggctggcct tgaactcaca gagatctgcc tatttctgcc tcccaagtgc tgctgggatt
3301 aaagggccac catgcctgcc cttcttcctt tctttaactt tttcttttcc ccaccaccta
3361 accccccaag actgggtctc tctgtgtagc cctggtgtcc tagatataag gctggctcag
3421 actcacagag atccacctgc ctctgcctcc tgagtgctgg gattacagtt gtgcacccct
3481 atcgccctgc actttattta tgtcttagtg tgtacgtggg catgttgagt gtggaggtcg
3541 aagaacaact ttcaggagtt gattctttct ccactgtgga ttcgcacaca aattcatatc
3601 ctcaggtttg caggcataca ttaccagttg agctatcatc tgacttcttt gcctgagcta
3661 tcatctgact tctttgcttt catttttatt ttagtttatt ttattttatt tttttaaagc
3721 tagatattag tctagctagc ctcaaacttg ctacatcaca tggctggcct gattcctctt
3781 taccttccaa gcactgaaat tacagggcag tggcaccacc agttttattt tgtttgtttt
3841 aatttttgtt ttgtgttttg ttttgttttt gatggggagg gttagacagc attttgctat
3901 gtaattcagg ctgggctaga actcactcta tagatcatat tggtctggaa ttctcctgcc
3961 tcagtttgcc aagcactgga attacagatc tccagcacca ctcattccct gctcgccgtt
4021 ttgcttgtca ttgaggagca gtagagagtc agcatggtgg tgcacagctt tgaccctagc
4081 agcactcagg aggcagaggc aggcggatct ctcgggagcc aaggccagcc tggtccacgt
4141 actgagttcc caggtagttc ttgctagata gagatcctca gagcatcagt ggagtggcag
4201 ggaggtgcag cagttccact tgagacaggg agattgctta aagtttgagg cccataaaaa
4261 ggaaagagag gagaggggca gagactagaa tcttgacaag atacctgatg gtgggaatct
4321 agaccttgat ggtggggagc caagtatccc tcactctttc tgataaagca cttgctagta
4381 ggagacagtc taaaaaccct tcacctccct ccttgctaaa cgtcgcttct acttcatccc
4441 cttctgtaga aacggtggca aaatgagaag tttggggtgg acatgccaga tgactatgaa
4501 gatgaaaatc tctatgaggt aagcactgtt tgcagcaggc gtcgggtgca tggggtgtgg
4561 tgtctgcctt gagctctcac cctggagttg actggactgg ggctccagct gtcagggtac
4621 tgatgtctcc tctctgaccc tgcagggcct gaaccttgat gactgttcta tgtatgagga
4681 catctccagg ggactccagg gcacctacca ggatgtgggc aacctccaca ttggagatgc
4741 ccagctggaa aagccatgac tgacatgtcc cacccttccc tgcctgccat atgtctgact
4801 ccagcatccc tgcctcccca ctcttcccct ggagtccccc cttcccagtg cttgctccag
4861 ccactcgagt gcgctaccct tgcctccagg ctgccccacc ctattctccc acctaatctt
4921 gtccttcccc aagtccagct ccctctctag tgggtaatga gcccttaatc gctgcctcta
4981 ggggagctga ttacagcatc ctcgttagtg tcaccccccc tcctcccaga tctgtcatgg
5041 ccacttaagt gataataaat ccttcccaaa gcagccttgg caggagtcat ggatctcatg
5101 tagactgctc ctcaacactg ccagggaacc aggagctaga aaacccaagt cccctccccc
5161 ctcagaccca ggattctagg ttcaagttct cacctgacca cttctagatc tacctgacaa
5221 ctcttcgaac ggtacagata gactctgcgt atgtcttcga cttcccctca acccctcccc
5281 ccgaaaataa cggtcaaaga gctctcttgg ctaatcccca ccccaattgt aaatgtctgg
5341 tttatctctt atgggcctca ccctcctctg tcccaaaaca aggggaacca gcccggtgtt
5401 aacaaatttt tacctatatt taaactctga gaacacaaga tattattttc tggagtgggg
5461 cctaattcct catgggttca cttgcataag gcagatttgg agagtggctg tcaaaaaccg
5521 gggggagggg gatgacagag agagctagag tggtgtgaca gtcattggga tgggaagagc
5581 agactcaggt tgttaaaaga caggaggcca agaggagact cagtgggga
//
LOCUS FRNTUL4 1233 bp ds-DNA BCT 27-JUL-1990
DEFINITION F.tularensis 17 kDa major membrane protein (TUL4) and 13-kDa
protein genes, complete cds.
ACCESSION M32059
KEYWORDS 13-kDa protein; 17 kDa major membrane protein.
SOURCE F.tularensis (strain live vaccine) DNA.
ORGANISM Francisella tularensis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria.
REFERENCE 1 (bases 1 to 1233)
AUTHORS Sjoestedt,A., Sandstroem,G., Taernvik,A. and Jaurin,B.
TITLE Nucleotide sequence and T cell epitopes of a membrane protein of
Francisella tularensis
JOURNAL J. Immunol. 145, 311-317 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A.Sjoestedt, 12-FEB-1990.
FEATURES from to/span description
pept 99 470 13-kDa protein
pept 551 1000 17 kDa major membrane protein (TUL4) precursor
sigp 551 607 17 kDa major membrane protein (TUL4) signal
peptide
matp 608 997 17 kDa major membrane protein (TUL4)
binding 85 97 ribosome binding site
binding 539 548 ribosome binding site
BASE COUNT 440 a 171 c 204 g 418 t
ORIGIN
1 gaattcatag tcgcctcaaa atatgtttag atatgttgtt aataagttat aatcttatat
61 atagttaact atgttattat aatcaaggtt tttttgctat gatgacttta caagaagaaa
121 aaatacaagc gccagtattt tttaaagaat atgtcaaagg taggtttata cttaatatag
181 gtgagtataa tcatccattg attttatcag caactcaagt gcttgagtat caagataaaa
241 ttgatgatat ccaaagtatc aaaaaaagtc atcttgatct tatcttagcg actaatcctg
301 aaataatact cataggtacc ggtgaaaaac aacttttgcc tccacttgag ataattaatc
361 aaatcgcaaa agctggtaaa agtgttgatt ttatggcgag tgatactgct tgtaagacat
421 ataatttgct tgttaatgaa aatcgtaatg ttagctgtat catcatttaa taaactgctg
481 tttattttat tttaattaat gttataatcg atttgagtat atgtgaatat ttaaaaatag
541 gagtatctat atgaaaaaaa taattaagct tagtctttta tctttatcaa tcgcaggttt
601 agcgagctgt tctactctag ggttaggtgg ctctgatgat gcaaaagctt cagctaaaga
661 tactgctgct gctcagacag ctactactga gcaagctgct gctgtatcta agccaactgc
721 aaaagtaagt ttaaataaac ttggtcagga taaaataaaa gcaactgtat atacagcata
781 caataataac ccacaaggaa gtgtaagatt acaatggcag gctccagaag gttctaagtg
841 ccatgataca agcttcccaa ttactaagta tgctgagaag aacgataaaa cttgggcaac
901 tgtaacagtt aagcaaggta ataacttctg tagcggtaag tggacagcta atgtagttta
961 tgacaaagaa gtaatcgctt ctgattcaat aaatatttaa tttcttcaaa tatttataat
1021 tttctagttt ttgcttttct caaaaaaaaa taacggtgtg attaattttg cttagctaga
1081 ttagttgtac ttttaacttt ttactaaaaa gatttgctat aattgcttaa atataacaaa
1141 taaaaaagta atagtttaat gaaaaacatt atcttgttat taatgatatt ctcgatgtta
1201 gttgtactta gccatgtgca ccaaccacag ctg
//
LOCUS TIPREGU 29802 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid Ti (from A.tumefaciens) virulence regulon (vir) genes,
complete cds.
ACCESSION J03320
KEYWORDS virulence regulon.
SOURCE Plasmid Ti (strain C58) (from A.tumefaciens) DNA.
ORGANISM Plasmid Ti
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 29802)
AUTHORS Rogowsky,P.M., Powell,B.S., Shirasu,K., Lin,T.-S., Morel,P.,
Zyprian,E.M., Steck,T.R. and Kado,C.I.
TITLE Molecular characterization of the vir regulon of Agrobacterium
tumefaciens: Complete nucleotide sequence and gene organization of
the 28.63-kbp regulon cloned as a single unit
JOURNAL Plasmid 23, 85-106 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Kado, 02-MAR-1990.
FEATURES from to/span description
pept 1983 4484 virA protein
pept 5029 5766 virB1 protein
pept 5766 6131 virB2 protein
pept 6131 6457 virB3 protein
pept 6457 8823 virB4 protein
pept 8840 9502 virB5 protein
pept 9603 10487 virB6 protein
pept 10523 10690 virB7 protein
pept 10677 11390 virB8 protein
pept 11387 12268 virB9 protein
pept 12265 13398 virB10 protein
pept 13439 14473 virB11 protein
pept 14728 15453 virG protein (ttg start codon)
pept 17174 17617 virD1 protein
pept 17651 18994 virD2 protein
pept 19009 21030 virD3 protein
pept 21104 23101 virD4 protein
pept 23098 25734 ORF 5
pept 26802 27053 virE1 protein
pept 27169 27360 virE2
pept 27374 29044 virE3 29044
site 1983 29044 vir regulon region
BASE COUNT 7085 a 7819 c 8213 g 6685 t
ORIGIN
1 taccgacgat atgtgcagtc catcagcagt ataatgatta caggagacca aaaattggtt
61 tcaaatgtaa ccgaataagc ctgctgtacg taatggcaat cggtcgatat cagtcggcac
121 agccactcgt cttgctgaga tgcaatgtcg acgtgtaacc aaaatgactt tgatggaagt
181 cgactttgaa gcctgaccga caccggctaa tcgaaatctg caactgctgg aactcggaag
241 cggccaaatc cctcgaggac cgagcgccgg ccgaggtgct tcgaaagaaa tcacttactc
301 aaataagatg tgcgaatagc tttggcgcgc cgcatgtcgc gcttggaggt gacgccgacc
361 gatgctgcgc cgccgatggc catccaacac gcaggtcaaa tgatgaagcg gcgaatcacc
421 gaattcgcgt cagcgtgacc gggcaaacag gctgtcctgc gccatcttct ggccactgag
481 gaaaatcgcg ctcctgcgag agcgcatgtt caagatattc attagctatg gcctctatga
541 gctctacgtg ccgccctgca tcaatatttg gcaactggct gattgcgaga tcgtgtttgc
601 gcgcaaaacg aattgcgcag cgatatccgt cgatatcttc caaaatcggt cgagcggcag
661 ggtagttcca gagttccgcc aactcctcca acaacgaggg gcgatcttcc cggatggcaa
721 acatttccgc aacgcgttgc ttggctcggg tgagaaaggc gtccgaatcc ccaagacgta
781 gccgcttgac atgccattga aaacccgatc tccaaaacgg acttttagcc atgcaattga
841 gaagcgaaat cgacccgccc tcgagaataa gaccctcttc ggacttccgc caatccactt
901 cgaatatgag tcgacgatgg gcactctcag cgtcaaggat gccctcggtg agggggcggg
961 aatccaaata tattctccgc gttgattgca attccgattc caaaggtctt ccgctacctg
1021 tcgcgatttg aggacagcat tgcacacgat caagggcaac caccggccac ccggtttctt
1081 gtgcgatttg gatcgccatg tccgttttgc cgctgcaagt cggtccgtag atgagatgga
1141 gtaacatgtg cgcctcatat aaatttaatg ggatatccca tctgttccaa accagatttt
1201 tcaattcgga ttatgatgtt ttactgcatg atttttacca gctttacgca atctgctgaa
1261 ctaatatggc gctacatttg taatcgtaca tttttgacgt tgccggctca ttcggccatg
1321 agagtttcca tcccgttgtc ctcctgaaca attgaggagt ttggccccga cacctgctta
1381 tggtgaatgc agcggacccc gttaacgcaa cgtcgaaaag cgataggcga tttgagtagg
1441 aagccacatc ggccgcgcgg tatgctccgt cagcacctga cagttcgtgt tgcagctttg
1501 cttggtcacc tgaaacgaaa aatgccgctg aagaatattt aacggatatg cacacgaaaa
1561 agccggttgg gtctcatatg ccgctgacgt cctggcttcg tttcgaagcc gacagcctct
1621 aaggtggtac gaacacagac ctaatatccg ctagtgtccg cggcagccgc cacggcagtt
1681 ctcgctgcgc atagcatagc aacttgccct gtcgacgcat gatcagccat attgttaatg
1741 tcgacgattt catttttctc ggtgcaaacg gcgacatcct gaccatatcg atcctaccag
1801 ccaccgcgcg gagtcaaaga acacctcggc aaccggaaac atccattgga atgccgaggc
1861 tgtgtttcat ttgaaacaca ctgagtcgac gtttgttgct tcaaacccat ttacaaaccc
1921 tactgtgcgg cctaagggcc aacggggtgg gactgagcgc tggtacgagg acgtaagtgc
1981 ggatgaatgg aaggtattca ccgtctcggc aagatttcaa gacaggcgcc aagccttggt
2041 ctatcctggc cttggtagtt gctgcaatga tttttgcctt gatggcgatt acgtcttggc
2101 aggacaatga aaccaatcgg gcgatcctga cccaattgcg agctattaac atcgacagtg
2161 cttcgctgca gcgggatgta ctctccgcgg aagcgggtgt ggtggcgaac taccggccca
2221 ttatctccag gttgggagct ttgcggaaga acctggaaaa tttgaagcga ctatttaaac
2281 aatctcatct tgtgatcggc aatgatttct ctcaactgct cgacaagcta aaggtgtctg
2341 tggatacgac cgacgcggcc gttgcagcct tcggagcgca aaacgtgctc ctgcaagatt
2401 cgcttgccag cttcactcgc gcgcttagta ttcttcccaa aatgtcgtcg acggatcaga
2461 cggtcgaaaa ttcgaacgaa ttgggcagcc tgatgctgcg gtttgtgcgt cagccaagcc
2521 cagcactctc gttggagatc agtcacgaac tcgacatgct ccaaaaagct agcggtgggg
2581 ctgaagttcc tatccgtata cttgcacgcg agggtcgcgt catcttgtcg attttgcccc
2641 gtgtgaacga tgccgtaaac atgattcaga cctccgacac cgctgaaatt gccgaaagat
2701 tggagcgcaa gtgtttggag gcctatagct tgcaaagcgt gagggagcag cgggcacgga
2761 tcttcttggg ttccgtttcg gtgggccttt gcatctacat catctcactg gtctataggc
2821 tgcgtcggaa aacggcttgg ttaacgcggc gtttggatta cgaagaggta atcaaagaga
2881 ttggggtttg tttcgagggg ggaggggcca cagcgtcgtc cctcaattcg tccgcgcaag
2941 ctgccgttgg aattattcaa cgcttcttta atgcggaatc gtgtgcacta gcattggtgg
3001 accatggtga caggtgggct gtcgaaagtt tcgctgcgaa gctgcctgag cccgtctggg
3061 aggacctcgc gctacgcgag atggtttctc ttgccagagc ggatgagcgt gcgtcagtat
3121 tccgcatcat gtcgacgcga aaggtcagct gccttcctcc ggagactccg ggcgtttcta
3181 tgctgctggc acacaaatct acggatcaac tgatagcgat ttgttccctc ggttaccagg
3241 gctatcgtct gaaatcttgt ccaggcgaag ttcagcttct tgaactcgcc accgcctgcc
3301 tctgccacta tatcgatgtc cggcgtaagc agaccgaatg cgattttctg gagaggcgat
3361 tagagcatgc ggaacgtctt caggcagttg gtacacttgc tggtggaata gcgcatgagt
3421 tcaacaacat tttgggagca atcctcgggt acgctgaaat ggcgcaaaac atgctgcgtc
3481 ggtcatctgt cacccgaaga cacattgacc aaattatttc gtcgggtgac agagccaggc
3541 tcattatcga tcagatcttg actctaagtc gaaaactaga acgcgtgaca aagccgttca
3601 gtgtctccga actcgtaatg gaaattgctc ccttattgcg tgttgcattg cagcgcaaca
3661 tcgagctgaa gttcaagttt gacgacaaga agagcgtagt cgaaggaagc ccgctagagg
3721 ttcagcagat gctgatgaat ctttgcaaga acgcttccca ggcgtttacc gccgatggtc
3781 aaatcgacat catcgttagc cgaatttttg tatctcgaca gaaagtactg gcgcatggtg
3841 ttatgccagc tggcgactat gttcttcttt ctgtcagcga cgacggtgaa ggcattgccg
3901 aaactgtgct cccccacatt tttgagcctt tctttacaac acgctcttgc agcggcggca
3961 cgggtctagg tcttgctgcc gtgcacggtc atgtcagcgc gcttgcagga tatattgacg
4021 ttacttcagc cgtagggcga gggacgcgct tcgacattta tctacctcct tcctcgaaga
4081 agcccgtcag cccggacgcg ttttttgggc cctgtaaaac accgcgtggc aacggagaaa
4141 ttgtggcatt gattgagcca gatcctgtct tgcgggaggt gtacgaagac aagatcgccg
4201 ctctgggcta tgagccggtg ggctttaaga catgtgcaga cctttgcaat tggatatcaa
4261 aaggcaagca agccgatctg gttctagttg accaatcgtc tcttcccgag aatcagagtg
4321 ctactgcttt gcacgcagcc ttcaagacgg cgtccatcat cattggagga agtgatctta
4381 aaatgtcact ttccagcgat gacatgacgt ccgcgctttt tctgcccaaa cctatatcct
4441 ccaggacgat ggcctacgcg attcgtacca agatcaaagc ctagagttga ggatgttttt
4501 caggagactc gaccagggta atcagcacgt ctgttcggcg cgcctatagc tatggtttct
4561 cgcaatcgct ttgggcagtc ggttgtggcg gtgcgtcttc tgagacccga aggttggcga
4621 ttatctgctg gaggtcttcg ataccgtctt cgaagaacaa gacatatgtt cgctgagctc
4681 ggacatagga tagtcggcat aggcggatac cctcgccacg caaatggcgc gatcgacacc
4741 ctgatcggag cctgcgaagt gccacagttt attggaacca tcccccgttc attccgccct
4801 ggaatttcac ttcgacggca ttctagatcc tcgccgttgt cttagagagc catctttcgt
4861 cggcgcaccc ttaagtgcat ggaaagccgt tttcgcttca aatgaaatcg aaaagaagaa
4921 aacaaaaatc ctagagtaac cgaccctccc gataatcgtg aacatcagat cacagcattt
4981 cttccgaccg aagtggctgt gttggttatg agcttgggga gattgggaat gttgaaggca
5041 acagggccgc tgtcgattat cttactggcc tccacgtgca cgtcgagtgg tgctgcccca
5101 ctttcatttg ctgagttcaa taattttgca cgcgaatgcg ctccatccgt tgctccatct
5161 acgcttgcag cgatcgctca ggtcgaaagt cgctttgatc cgcttgctgt gcatgacaat
5221 accaccggcg aaacgcttca ctggcagaac caggctcaag caacgcaagt cgtgatggac
5281 ggtctcgaag cacggcattc gctggatgtt gggctcatgc agatcaattc ccggaatttt
5341 tccgtgctcg gtctgacacc tgacggagcc cttcaacctt gcacgtcatt atctgtcgcc
5401 gcaaacttgc ttgggagccg ctacgctggc ggcaacacgg ctgacgacga gcaattgtcg
5461 cttcgtcggg caatctccgc ctataacacc ggtgatttca cgcacggctt cgcgaacggc
5521 tacgtgcgaa aagttgaaac ggccgctcaa cagctcgtcc ccccgttaac cgcgcgtcca
5581 aaagatgatc gtgagaagcc gggatccgag gaaacatggg atgtttgggg ggcatataag
5641 cgtcgttccc cggagggcgg agctggcggg tcatccggtc cgcctccgcc gccagacgag
5701 gacaaccgca aatccgaaga cgacgatcaa ctcttgttcg acttaaatca aggaggtccg
5761 caataatgcg atgctttgaa agataccgtg tacatctgaa tcgcctctcg ctctcgaacg
5821 cggtgatgcg catggtatcc ggctatgcgc cgagcgtggt cggtgcaatg gggtggagca
5881 ttttctcctc tgggccggcc gcggcccaat ctgcaggtgg cggcactgac ccagccacaa
5941 tggttaacaa catatgcacg tttatccttg gtccgttcgg ccagtcactc gctgttcttg
6001 gcatcgtggc catcggaatc tcctggatgt tcggtcgcgc ttcactcggt ctcgttgccg
6061 gcgtcgtcgg cggcattgtc atcatgtttg gagccagctt cctgggcaaa acgctgactg
6121 gaggtggcta atgaatgatc gtctggaaga agcaaccctt tacttggcgg cgacacggcc
6181 cgcattgttt cttggcgtgc cgctgacgtt ggcggggcta ctcgtgatgt ttgccggctt
6241 tgtcatcgtc atcgttcaga acccgctgta cgaagtcgtt ctcgtgccct tgtggttcgg
6301 agcgcggctt gtggtggaac gagactataa cgcggccagc gtcgttctac tttttttgca
6361 gacggcggga aggagcgttg atggtctgat ttggggcggc gcaagcgtta gcccaaatcc
6421 aatcaaggtt cccgcgcgag ggagaggaat ggcgtaatgc tcggagcaag tggcacaacc
6481 gaaaggtctg gcgaggtcta tctaccctac gtcgggcacg tcagcgacca tattgtcctt
6541 ctagaagatg gatcgatcat gacgatggcg cacgtaagtg gcatggcctt cgaactcgaa
6601 gatgccgaaa tgcgcaatcg acgttgccgt gcatttaata cgctcttgcg caatatcgct
6661 gatgatcatg tgtcaatata tgctcacctc gtacgtcatg acgatgtgcc gccgtcaccc
6721 gcgcgacatt tccgcagcgc tttttccgcc agtctgagcg aagcttttga ggagcgcgtt
6781 ctctccggca aactccttcg caatgaccac ttccttacgc tgatcgtgtc tccccgggcc
6841 gcgcttggca aagtgaggag aaggttcacc aaacgctaca gacaaaaaga aaacgatctc
6901 acagctcaaa ccaggaacct ggaagatctc tggcatcttg tcgctggcgc tctcgaagcg
6961 tacggcctgc gtcgtcttgg tattcgtgag aagcaagatg tgctttttac ggaggttgga
7021 gaagctctgc ggctgataat gactggtcga ttcacgccgg ttcccgtcgt tagcggttcg
7081 ctcggcgcct cgatctatac cgaccgagtt atttgcggca agcggggact cgagatccga
7141 acaccaaaag atagttatgt gggatctatt tactcgtttc gcgaataccc cgcaacgacg
7201 ggaccgggta tgctcaacgt gctactgtct ctcgattttc cgcttgttct gacgcagagc
7261 ttctcgtttc tgactcgctc gcaagcccac tcgaagctca gcctcaagtc cagccaaatg
7321 ttgagttctg gcgacaaagc cgtcacccaa atcagcaagt tatccgaggc ggaggacgca
7381 ctagcgagca acgaattcgt attgggggcg catcatgtga gtctttgcat atatgcaaat
7441 gatctcaata atcttgcaga tagaggtgcc cgcgcccgga cgcgattggc ggatgcggga
7501 gctgttgttg tccaagaggg catcggcatg gaggcggctt attggtcgca gctgccaggc
7561 aactataagt ggcgcacgcg tccgggagcg atcacatcgc gcaacttcgc tggtttagtc
7621 tcattcgaga attttcccga gggatccggc tcaggtcact ggggcaacgc gattgcgcgc
7681 tttcgtacca atggtggaac ccctttcgac tacatcccgc acgagcacga tgtcggcatg
7741 acggcgatat tcggtcccat cgggaggggt aaaacgacgc tcatgacctt tatcctcgct
7801 atgctcgagc agagcatggt cgaccgcgcg ggtgcggttg tcctcttcga caaggaccgc
7861 ggcagtgagc tgctggttcg cgccaccggg ggaacatatt tggcgctccg tagaggagga
7921 agcggattgg cgccattgcg tggcctggaa aatacagcgg cttcacatga ttttctgcgc
7981 gaatggatcg tggcgctcat tgagagcgat ggccgtggag gaatatcccc ccaggaaaat
8041 cgccgtctgg tgcggggtat ccatcggcag ctctcgtttg atccccacat gcgctcaatc
8101 gcggggttac gcgaattttt gttgcatgga cccgccgaag gggcgggagc gagactccaa
8161 cgctggtgcc gtggcaatgc actaggctgg gcgttcgacg gcgagctcga cgaagtaaag
8221 ttggatcctt cgattactgg tttcgacatg acgcatcttc tcgaatatga ggaagtatgc
8281 gctgccgcgc gagcatatct tctgcaccgc attggagcca tggttgacgg ccgtcggttt
8341 gtgatgagtt gcgatgagtt tcgcgcctat ttgctaaatc ctaaatttgc ggcggtcgtc
8401 gacaagttcc tgcttactgt ccgcaaaaac aatgggatgc tgatactggc aacgcagcaa
8461 cctgagcatg tcctggaatc gcagctaggc gccagtctcg tcgcgcaatg tatgacgaag
8521 attttctatc cttcacccac ggcagatcga tcggcttaca tcgatggact gaaatgtact
8581 gaaaaggaat ttcaggcgat ccgtgaagac atggcagtag gtagccggaa gtttctgctt
8641 aaacgagaaa gcggaagcgt cgtctgcgaa ttcgatctgc gggaaatgcg cgaatatgtc
8701 gccgtacttt cggggcgcgc caacacggtg cgcttcgcgg atcagcttcg caaagtacag
8761 ggggacaacc catcagcctg gctcagcgaa tttatggctc gttaccacga ggcaaaagat
8821 tgatcaaagg tggggaacta tgaagatcat gcaacttgtt gctgcggcca tggccgtcag
8881 ccttctttcg gtcgggcccg cgcgggcgca gttcgttgtc agcgatccgg cgacggaagc
8941 tgagacgctg gcgacggcgc tcgagactgc ggcaaatctc gaacagacca taacgatggt
9001 ggcgatgtta acctcggctt atggcgtcac cggcctacta acttcgctca accaaaaaaa
9061 tcagtatccc tcgaccaggg acttggacac ggaaatgttt tcgccgcgaa tgccaatgtc
9121 gaccacggca cgtgcgatca ccaccgatac agatcgcgcc gtagttggtg gcgacgctga
9181 agcggatctg ttgcgatcgc agatcaccgg ttccgcaaat agcgcgggca ttgcggctga
9241 caacctggag acgatggaca aacgcttaac agcgaatgcc gagacctcga cacagctttc
9301 tcgctctcgc aatatcatgc aggcaaccgt taccaacggt ttgcttctca agcagatcca
9361 tgacgcaatg attcaaaatg tacaggcgac cagcttgtta acgatgacca ccgcgcaggc
9421 tggccttcat gaggcggaag aggcggccgc tcaacgtaag gagcatcaaa agactgcggt
9481 catctttggg gccgtcccct gagactgggc gacttattcg tgccgcccat ctgcacattt
9541 caggatgcga gttcattgta tccgacattc tgcgacaagc cagtcaaggt caggtccaaa
9601 cgatgaattt cacgatcccg gcgccgttta cggccattca tacgatcttc gatctagcct
9661 ttacgacaag cctggacaca atgcttggga cgatccaaga ggcggtgagc gcgccattgg
9721 tcgcctgcgt cactctttgg attatcgttc agggtattct ggtcatgcgt ggcgaaatcg
9781 acacgcgcgg cggtatcact cgggtgatca cggtcaccgt cgttgttgcc cttgtcgtcg
9841 ggcaggccaa ctaccacgac tatgtggttt cagtctttga agagacgatt ccaaacttta
9901 ttcagcagtt tagtggcagc ggcctgcctc tgcagaccat tcccgctcag ctcgatacaa
9961 tgttcgccct aacccaggct gcatttcaga gaattgcatc tgaaatcggc ccgatgaatg
10021 accaggacat ccttgctttc cagggggctc agtgggtctt ttacggcacg ctctggtctg
10081 ccttcggaat ctacgacgcc gtcggaattc tcacgaaagt tcttctggcg atcggccctt
10141 tgatcctcac gggatatatc tttgatcgca cgcgcgacat cgcggcaaag tggatcgggc
10201 aacttatcac ctatggtctc ctacttctcc tgctaaacct cgtggcgaca atagtcatcc
10261 taaccgaagc gactgcactc acgcttatgc ttggtgtaat cacccttgcc ggtacgaccg
10321 cggccaagat cattggtctt tacgaactcg acatgttctt tctgactggt gacgcactca
10381 ttgtcgcttt gccggcaatc gccaacatcg gaggcagtta ttggagcggc gcgacccaat
10441 ctgccaacag cttgtaccgt cgcttcgctc aggtcgatcg tcgttaggtc ccgcaaaaaa
10501 attcacgtcg gtggagaact ctatgaaata ttgcctgctg tgcctggctc tcgctttaag
10561 cggctgccag acaaacgata aattggcgag ctgcaaaggc ccaatattcc cgctgaatgt
10621 ggggcgatgg cagcctacgc cgtcagatct tcagctcagc aacgtaggtg gtcgccatga
10681 aggggtctga atacgccttg ctagtagcgc gggaaaccct ggctgagcac tacaaggaag
10741 tggaagcttt ccaaactgcg cgtgcgaaat cagcgcggcg cctctccaag gtcattgcag
10801 ctgtcgcaac catcgcggtt ttggggaatg ttgcgcaagc cttcacaatt gccaccatgg
10861 tgccgctgat caggcttgtg ccggtatatc tttggatacg gccggatggc accgttgaca
10921 gcgaggtgtc cgtctcccga ttgcctgcaa ctcaagagga ggccgtcgtt aacgcctcac
10981 tgtgggagta tgttcggctg cgcgagagct atgatgccga taccgcccag tatgcctatg
11041 acctggtctc gaacttcacg cgcccaatgg tgcgccaaaa ttatcagcaa ttcttcaatt
11101 atcccaatcc aacttcgcct caagtcatcc tcggcaaaca cggcaggcta gaggtcgaac
11161 acatcgcttc gaatgatgtt actccgggtg tgcagcaaat tcgctataag cgaaccctca
11221 tcgttgacgg caaaatgccg atggcgagca cttggactgc tacggttcgt tatgaaaagg
11281 tgaccagctt gcccggcaga ttgagactga ccaacccggg aggcttggtt gtcacctcct
11341 accagacatc ggaagatacc gtttcgaacg caggccacag cgaaccatga cgaaaaaagc
11401 atttctcact ctggcatgtt tactttttgc ggcgattggc gcgagggctg aagacacgcc
11461 aacggcgggc agacttgatc cgcgcatgcg ttatctcgct tacaatcccg atcaagtggt
11521 gcgcctttca acggcggttg gagccacttt ggttgttact ttcggggcta acgaaacggt
11581 gacagctgtt gccgtttcca atagcaaaga tctcgcggcc cttccacgcg gaaattatct
11641 tttcttcaag gctagcaagg ttctcccacc ccagccagtg gtcgtgctaa ctgcgagtga
11701 cgccggtatg cgacgctacg ttttcagcat ctcttccaag acgctgccgc acctcgataa
11761 agagcaggcc gatctctact atagcgtaca attcgcttac cctgccgatg acgcagcggc
11821 tccgcagaag gcggcacaag agaaggctgt tgcagaccgt atacgtgcgg aagcgcaata
11881 tcaacagaga gcagagggtt tattggagca gcctgccacg accgttggtg ccgaggacaa
11941 gaattggcac tatgtcgctc agggcgatcg ttcgctgttg ccgctcgaag tcttcgatga
12001 tggatttacg acggtatttc acttcccagg taatgtacgc ataccctcca tctacacgat
12061 aaatccggat ggaaaggaag ctgtcgctaa ctattcagtc aaagggagct atgtcgagat
12121 ttcttcggtt tcccgtggtt ggcgtctgag ggatggccac acggtattat gcatttggaa
12181 taccgcctac gatcccgtcg gccgcaggcc ggagacgggc actgtgaggc ccgatgtgaa
12241 gcgcgtccta aaggaggtga gaggatgaac gacgacaatc agcaatcggc gcatgatgtc
12301 gatgcgtcgg ggtccctggt ctccgacaca catcaccggc gcctttcggg ggctcaaaag
12361 ttgatcgtag gaggtgtagt tctcgcgcta tcacttagcc tcatttggct tggcgggcgt
12421 gaaaagaagg aaaacgggga cgcacccccg tcaaccatga tcgccacgaa caccaagcca
12481 tttcatccgg ctccgattga cgttacactt gatcctccgg ctgcccagga agctgttcag
12541 ccgactgctc ctccgccagc acgaagtgag ccggaacggc atgagccgcg gccggaagaa
12601 acaccgattt ttgcgtacac cagtggtgat caagggacca gcaagcgcgt tcaacaaggc
12661 gaaacggacc gaagacgcga aggcaatggg gaagacagtc ctttgccgaa ggtcgaagtg
12721 tccgccgaga atgatctctc gatacgcatg aagcccaccg agctgcagcc caccagggct
12781 acgctcttgc ctcatcccga cttcatggtg acggagggga cgattattcc atgtatcttg
12841 caaacggcaa tcgacaccag tctggcaggc tatgtaaaat gcgtgttacc ctgggatgtt
12901 cgtggaacaa cgaacaacgt tgtgcttctt gatcgcggca ccaccgttgt tggcgagatc
12961 cagcgcggtt tgcaacaggg agatgcgcgt gtttttgtgc tctgggatcg ggcggagaca
13021 cccgaccatg ccatgatttc gcttgcgtca ccaagcgctg acgaactcgg tcgctcggga
13081 ttgccgggca ccgtcgacaa tcacttctgg cagcgcttta gcggggccat gctcttgagt
13141 gtcgtccaag gtccgttcca ggcagcgagc acctacgctg gcagctcggg tggagggacg
13201 agcttcaaca gcgtccagaa taacggtgaa caaacggcag acacagccct caaggccacg
13261 atcaacatac cgccaaccct gaagaagaat cagggcgaca cggtctccat ttttgtcgca
13321 cgggatctcg atttctcagg catataccag cttcgtatgg ctggtcgcgc ggcgcggggg
13381 cgggatcgcc gtccataacg aattcaactt atcacttaca gatggagata cgatacaaat
13441 ggaggtggat ccgcaattac gaatccttct caagccgatt ttggaatggc tcgatgaccc
13501 gcggaccgaa gaagttgcga taaatcgacc tggggaggca tttgtgcgcc aggccggcgc
13561 cttcctcaag ttccctttgc ctgtctccta tgacgatctc gaagatatcg ctattttagc
13621 aggcgcgctg agaaaacagg acgttggacc acgcaaccca ctttgcgcaa ctgaacttcc
13681 agacggcgag cggctgcaga tctgtttgcc gccgacggta ccatcgggca ccgtcagctt
13741 gacgattcga cggccaagtt cccgtgtttc tagtctcaaa gaagtctcgt cccgttacga
13801 tgctccgagg tggaatcagt ggaaggaacg aaaaaaacgg catgctcagc atgatgaagc
13861 tatccttcgg tactatgaca acggggatct ggaggcgttt ctgcacgcat gtgtcgttgg
13921 tcggttgacg atgctgcttt gcggacccac cgggagtggc aagacaacga tgagcaagac
13981 cttgatcaac gctatcccgc cgcaggaaag gctgattacc atcgaagata cgctcgaact
14041 cgtcattcca cacgagaacc acgtaaggct gctttattct aagaatgggg ctgggctggg
14101 cgcagttacc gctgagcacc tgctacaggc tagcctgcgc atgcgaccgg accgaatact
14161 gctcggcgag atacgcgacg atgccgcgtg ggcttatctg agtgaagtcg tctcagggca
14221 tccgggatcg atttccacaa tacatggtgc caatcccgtc caaggtttca aaaagctatt
14281 ttcgctcgtg aaaagcagcg ctcagggggc tagcttggaa gatcgcaccc tgattgacat
14341 gctcgcaacc gcagttgatg tcatcgtacc cttccgtgcc cacggtgaca tttacgaggt
14401 gggcgaaatc tggctcgctg ccgatgcgcg tcggcgcggt gagacaatag gcgatcttct
14461 taaccagcag tagttgtgat ccatgtttct aaatgccgca tggcgcgttg tagaattacg
14521 tttgtagcaa tgctcagcaa tctttgtcat caaacggaga catctagttt gcatttctgt
14581 cgtgcgcggt ttggtcgaaa tcttgccgaa atgcccgtgt agtgagagaa aattaaagag
14641 tggagtctag caaatacaac ctttacgtgt ataaattctg ttgagctgca aatggctggc
14701 caggatccta gattgagagg tgaaccgttg aaacacgttc ttgtcatcga tgacgatgtc
14761 gctatgcggc atcttatagt cgagtatctt acgatccatg cctttaaggt gactgcggta
14821 gccgacagca agcagttcaa tcgtgtactc tgctccgaga cggtcgatgt cgtggtcgtc
14881 gatcttaatt tgggtcgcga agatgggctt gaaattgttc gtagtctggc cacgaagtcc
14941 gatgttccaa tcataattat tagcggcgct cgcctcgaag aggcggacaa agttattgcg
15001 ctcgagttgg gagcaaccga ttttattgcc aagccttttg ggacgcggga atttctggcg
15061 cgcatccgtg ttgcgttacg cgtgcggccc agtgtcgcgc gaaccaaaga tcgacgctca
15121 tttagtttcg ctgactggac acttaatctc aggcgacgcc gcttgatttc ggaagagggc
15181 agtgaggtga aactcacggc aggtgagttt aatctcctgg ttgctttcct ggagaagccg
15241 cgcgacgtcc tatcccggga gcagcttctg atcgccagtc gggtacgcga ggaggaggtg
15301 tatgacagaa gtattgatgt cctcattttg cggctgcgcc ggaagcttga gggggatccg
15361 acgacccctc agttgatcaa gactgcaaga ggtgctggct atttctttga cgctgacgtg
15421 gatgtttcgt acgggggtgt gatggcggcc tgaggtagag gtgcatttcg cctttagcaa
15481 tctgttccca acgtgagcag attgctatgc ggcttggcaa agctgccttt cctcggtact
15541 atccgaaaaa ctcagcactg cggagtgatt ggatgggtcc tatcttttga gagatcagct
15601 gttcgttgcc ttctcccgag caaagaaaca tgcaagcgct gcggtagcca gcttgtggcc
15661 gaaagcccgg gcggtctcca atcccaatgg atcaaagtga ttgcgagcgg cttctattag
15721 cgagaccggg aacatgcggg aggtctgaac gatgattgat ttttcgaaag ctgtgtgagg
15781 gatcggataa ctcttcggag ccgcacgaaa cgatccatcc gccagcatgt tttcaaaatc
15841 gccaagcgca cggcgcaaga tcatttgtag cgacttggaa ggactgtatt gcaggatcag
15901 gttgtcatat atcttcgata cttcaggcgc gggcgggcgc gctgaaagga agacctggat
15961 cttttctggc gctgtcgtcg aactcaaagc atccacggtc agcatcgatt gctgatcaca
16021 gctgtgacaa cgcttggcgg tggctggggc aggtcgtcga tcttcctcgt cgagattttc
16081 aggcggctgc ggcagggtcg agttttgggt ggcaacaggc aaagaaggat ggacgatttc
16141 gggtcgagcg gcggcaagcc gcctggcctc cccgacagac aaagcgggtt tgcgaattcc
16201 catcttcacc cctccaaggc ttcgctaacc aatttggaga tagtgacgag ttcctccatg
16261 gcgattctga gattccgttc gaggaggcgc attgtcggat cggttctcat attcagcaat
16321 gtgagatgca acatgccacg ttccttcatc gcggcaaatg cgtctctctc gtgcatggga
16381 gactgtacaa ctggaaggct tgcgagcatg tccgacatcg cgcgctgcga tgtggtcaat
16441 cgaccaaccg gcacgcgttg gcgcaatacg gctgtcggaa ttgccaagtt ctcgctcagc
16501 agcagttcaa tgacatagcg gtaggtcgac aatgcttcat cgatatcgag cggagttaac
16561 atggtcggga tcagaagcag gtttgagctg gcaatgatcg tgttgttgag ttcgctcgaa
16621 ccaccatgcg tatcggccag cgcataatca aatccctgga gttcggcgtc ctcataggcc
16681 gcctcaagga gtgccatttc ctcggcggcg tagacttcgc agaaggagcc ccaggtattg
16741 ctgcgaaggg cgttttcttt ccatcgcgtt agtggtcggt tttcatcagc gtcgaagaga
16801 gccaatcgtt tgccgtcact tgcaaaggcg gcgcaaagac ccatgagtgc cgtggttttg
16861 ccggctcctc ccttgaagga gcaaaatgtc agaagtttca tgtccttatc ctgtcgattt
16921 tgtgaagcgg aagtgtgtct gtacttttat ttgtgtgtat gattttgcga taattcataa
16981 gtaatgtagt aattacctga ttttatattt caattttatt gtaatataat ttcaattgta
17041 ataatataaa aataaatatc ccttatgtgt tcttgatttc gttttgtata tggctagatt
17101 cccatctgcc acgacgagga aatgctacgg cggggcaagt tccaagatct ttccgtcttc
17161 tatggaggaa gctatgtcgc aaggcagtag gcccacctca agtgacattg ccgtcaacca
17221 gcgcgaatgc gtgaaggttg aaggcttcaa ggtcgtcagt acccgattaa gatcggccga
17281 atatgagagt ttttctcatc aggcacgctt gctgggcctc tccgacagca tggccatacg
17341 ggttgcggtg cgccgcattg gtggctttct tgaaatcgac gcagagactc gtcataggat
17401 ggaggccata ctacaatcca taggaacact ctcaagcaac attgccgcgc tgctatctgc
17461 ctatgccgaa aatccgacaa tggatttgga ggctttgcga gctgaacgta tcgccttcgg
17521 taaatctttc gctgacctcg acggcttgct ccgttccatt ttgtccgtat cacggcggcg
17581 gatcgacggt tgctcgctgc tgaaagacgc cttgtagcac tgacgtagca cttggcgggg
17641 aacatattcg atgcccgatc gagctcaagt tatcattcgc attatgccgg gaggtggcac
17701 caagaccctt caacaaatta tcaatcagtt ggagtatcta tcccggaagg gcaggctgga
17761 gctgcagcgt tcagcccgac atctcgatat tcccctgcca ccggatcaaa tccacgaact
17821 tgcccgaagc tgggttcaag agactggaac ttatgacgaa agtcagccag acgaggaaag
17881 gcaacaggag ttgaccaccc atattattgt aagcttcccc gccggtacaa gccaggtagc
17941 ggcttatgcg gcgagccggg agtgggcagc cgagatgttt gggtcaggcg cagggggggg
18001 ccgatacaac tatcttacgg ccttccacat cgatcgcgac cacccacatc tgcatgtcgt
18061 cgtcaatcgg cgcgaacttt taggacacgg ctggctgaag atatctcggc gccatcccca
18121 actgaattac gacgccctgc gcataaagat ggccgagatt tcacttcgtc atggcattgc
18181 cctcgatgcg agccgacgag cagaacgtgg catcaccgag cggccgatca cttatgccca
18241 atatcggcgc cttgagcggg agcaggctcg ccaaatccgt ttcgaagacg cggatttgga
18301 acagtcgtcg ccgcaaggag atcatccaga attcagccaa cctttcgata catccccatt
18361 tgaagcatcc gcgggcggac cggaggacat gcctcggccc aacaatcggc agaatgagtc
18421 gcaagttcat ctccaggagc cagctggtgt cagcaacgaa gccggtgtcc ttgtgcgggt
18481 tgcattggag acggagcgcc ttgctcaacc attcgtttcc gaaaccattc tcgcggacga
18541 catagggagc ggctcttcgc gtgttgccga gggccgtgtg gagagcgcaa accgcactcc
18601 cgatattcct cgcgcagcaa ctgaagctgc cacgcacacg acacacgacc ggcagcggcg
18661 tgcaaagcgt cctcatgatg acgacggagg gccgagtgga gcaaaacgtg tgacattgga
18721 aggcatcgcg gttgggcccc aggcgaacgc cggcgaacag gatggcagta gtggcccctt
18781 agtacggcaa gctggaacgt ctcggccatc tccaccgacg gccacgacgc gggccagcac
18841 cgcaaccgat tcattgtctg ctacagccca cctccagcaa cggagaggtg tcctttcaaa
18901 gcgtccgcgt gaagatgatg atggagaacc gagtgaacgc aaacgcgaga gagatgagcg
18961 cagcaaggac gggcgtgggg gaaataggag atagagaagg accgagtgat ggcaaatgaa
19021 gagttcacca gacactatgc gtggcccgtt cctgtggctt cgaatgatga agggcgtggg
19081 accgcgcgga tccccatcca ggcacaatca atcgttgctg gagaagacgg tcgggacact
19141 tcggtcccaa cggctttgtc gcgaccgcca attgaagata tgccgcacgg cgtccaagaa
19201 acatcggcga gtggcggacg actgggagcg gcccgtctgc gggattccgt aatcccgcca
19261 ggaatatccg aagcccgcac ggacctatcc gcaattttgc ggaaaaaaag cggttctttc
19321 cgcaccggta tgcagtatct gcgtgggctt gaacgggaaa attttgataa acaagacagg
19381 gaagccagtg cgttgccaga tttaagtgca aggggcataa agcgaccgcg cgaaattgag
19441 tatcccggca atgcaagcgg attaaccata aagaggcaag acggcttagg catagagatc
19501 aatactatct cggcatcttc gcccgtgaac cgggccgcgc attcgtcgaa ctggcaaggc
19561 gcgccggaac cgggcgtgta caatgttcag ccatcggcag atagagcaca gaactctgcg
19621 caggaaagtt cgacgtttcc cgatggcact tctgtttccg ctctctattc aggcccactc
19681 gcagaatggt tcgagaggga tactggcagc gagacgacca ggaattcagg caacaccatt
19741 tcgtcaccgc tccgagggct ggaggagttc ggcgattccg cggacagccg gtatcttggg
19801 cgcgaagctc agagtctttc agttaccgta acaacgccta attcgaatgc tgaggcaagt
19861 tctcatagcg cacacactga aactctcgac gatgtcagca gtgaccgctc gagcgaacag
19921 gggaggggtc cccttggcgc cgcaatcctt ggatcccatc atgacctttc accgcgtgcg
19981 cagaagttat cacaaacaaa ccgcgattcc cctgaactga ccgacgccga cctggcaaaa
20041 gtcgatgcgg tgttcgaatc tctctccaag gggcccccgg gggcggagag cgccgcaccg
20101 gactttcgtg agcgcggacc aggcagcgct ttccagaaag aaggcgtgtc cgacagggcg
20161 aacggggtgc ccactaactg ggaggtacct tttggtcgcg gtggcgggca ttcaccgcaa
20221 gcgctgaggt cctcaggagt ggagctcgat gacttccctg atttcaccga ggccgaactg
20281 gcaaaaatcg acgcactagt cgaatctcac tccaacagat ctttatcagt gcgaaatata
20341 gtaccagatt tgcgcggagc gggagcagac aacgtcttcc ggaaagaagg cgttgtcgaa
20401 cgcgcggaga agatgccgat cgatagtgtg agtcttacca gactgaatgg cgagcgttca
20461 cgttcaccga agacatcgca agcaagcctt gaggatttcc ccgatctaac cgatgccgat
20521 ctggcccata tcgaagagtc ggagcggatc gcaagaactg ctgttgaaaa ggggaagcaa
20581 aaaatctcga ccgaagccga tacgcggttt gatttgggca actcttctgc accccgggtt
20641 agccctcgat cggttacccc gttggttccc aatgccaatc agccgattac atcgtggttc
20701 tacgaggcac aaaagacttg tgacaaattg gtagagaaca cttacgtgaa gccggccgtt
20761 gacagttcta gggcgcgaaa tgacgtcgaa aacaccgctg ccaggcttgg tgaccctgct
20821 cctgctttgg ggcatgacaa tcttggtaga actcgcgcgt taacgcccgt tcgcgacgtg
20881 atgtctagac ccagcgcgga ccgacaactg gctagtcatg cggcagaaca ttcggcaata
20941 gacgacattt ggaaacggga tgaccgtgat cgcaggacac acccttatag aggattggat
21001 tcgcgttcac gtgaaggttg cgggcggtaa aggctgtgtt gttcgcagca acacccaaat
21061 caaaagccca atttgcgatc gatcgaagaa tctcggtgaa gtgatgaatt cgagcaagac
21121 tacgccccag cgtttagctg tcagcatcgt atgttcgctg gcagccggtt tttgcgcggc
21181 aagtctctat gtaacatttc gccatggttt caatggcgaa gcgatgatga cgtttagcgt
21241 cttcgccttt tggtacgaga ccccgctcta tatgggtcat gcgacccccg tcttctattg
21301 cggtttagcc attgtcgtct cgacgtctat tgttgtgctg ttaagccaac ttatcatatc
21361 gtttcgcaat cacgagcatc atggcacggc tcgttgggcg ggatttggcg aaatgcggca
21421 cgccggttac ctgcagcgct ataatcgtat caaggggcca atctttggca agacgtgtgg
21481 ccctcgttgg ttcggcagct atctgaccaa tggcgaacag cctcacagtc ttgttgtcgc
21541 gccaacgcgc gccggcaaag gcgttggcgt cgttattccg acgctcctaa ccttcaaggg
21601 ttcggtgata gccctcgacg tcaaaggcga gctttttgag ctgacttcca gagcacgcaa
21661 agcgggccgc gacgccgttt tcaagttctc gcctctggat ccagagcgac ggactcattg
21721 ttacaatccc gtcctcgata ttgcagcttt accgcccgag cggcagttta ccgagacgcg
21781 ccgcctcgcc gcaaacctca tcacggccaa aggcaaggga gcggaaggct ttatcgacgg
21841 cgcgcgggac cttttcgttg cgggcatcct tacctgtatt gaccgcggta cgccaacaat
21901 tggtgcggtc tacgacttat ttgctcaacc tggagagaag tacaaacttt ttgcgcacct
21961 cgcggaagaa agccgaaata aagaggctca gcgcattttc gacaatatgg cgggtaacga
22021 cacgaaaatt ttgacttctt atacatcagt gctcggcgac ggcggactta acctgtgggc
22081 tgatccattg gttaaagcag cgacaagtcg atcagatttt tccgtctatg atctgcgtcg
22141 gaagaggacc tgcgtctatc tttgcgtcag tcccaacgac ctcgaggtcg tggcgccatt
22201 gatgcgcctc ctttttcagc aggtcgtgtc aattctgcag cgatcactgc caggtaaaga
22261 tgagcggcat gaagttttgt ttctccttga cgaatttaaa cacctgggga agcttgaggc
22321 aatcgagacc gcgattacaa ccatcgccgg ttacaaaggc cgcttcatgt ttattattca
22381 gagtctctcg gccttgacgg gcatctacga tgacgcgggc aaacaaaact ttctcagtaa
22441 cactggcgtg caagtattta tggccacggc cgacgacgaa acaccgacct atatctcaaa
22501 agctatcggc gactatacgt tcaaggcgcg ttcgacctca tacagtcaag cccgaatgtt
22561 cgatcataac atccagattt ccgatcaggg ggcaccgctt ttgcgccccg aacaagtgcg
22621 cttgcttgac gataacaatg aaatcgtcct tattaaaggg catccgcctc tcaaactacg
22681 aaaggtgcga tattattccg atcgtatgct gaggcgcctt ttcgaatgcc aaattggcgc
22741 cctccctgag cccgcatctt tgatgctttc ggaaggtgtc catcgggatg ggcaagacct
22801 cagtcaacaa gccgccgtca cggaggcagc tagtactcga tcgataccta ataatatgga
22861 agccgctaca ccgcaaaaca gtgaaatgga tgacgagcag gacagcctcc caactggaat
22921 tgacgtcccc cagggcctta ttgaaagcga tgaggtgaaa gaagacgccg gtggcgtggt
22981 gccagatttc ggtgtttcag ctgaaatggc tccagctatg attgcacaac agcagctgct
23041 ggagcagatc attgcgcttc agcaacgata tggacccgcg tcctcgcact cggtgaaatg
23101 acttcaagct gcctttcaca ttggaatcat tcgatgcggg gcgtcgtcag atatccggtt
23161 gtattcgaat gcggactaga accccgaggt tgccgattaa cagatatcgc catgagacct
23221 tcaggaaacc cgaacgtcga tcttagcggt tcgactgcat cgcttgccga agttcccgcc
23281 ggagctaccc ctgtccttaa tctaatcgag cccaggaacc gtccggctga cgactcgctt
23341 gagggccaaa ccgatcgcgg cgagcatcca tctgcatcat ttgactatga tggcatgaag
23401 cttggcgccg cggagcgtga agcatacgag aactggtgtc catcgaaccg gcctacatgg
23461 aaagatctgg tactcagggc gcgccttgat gcaatcgaca gttccgcttg gctccccgat
23521 ttgggcgagg agtcgccttt gatcttcaga tatgaaggga ttccgctggg tgagggggaa
23581 cggcaagcct acaaagaatg gcaagaggag gctcagccca catgggaaga cctcgttgtc
23641 aacgcacgaa tggcggtacc tgatccttgt gctgacgttg cagacgagca caatcccctc
23701 aaagaaggcg aggagtttcg gtctgaagcg tcgaaacgca agcggaaaaa accgatcgac
23761 caggacgaga attctcctac atcgttttac tatgacggga tgaggctcgg agaacccgag
23821 cgcgaggcat atgataactg gggcaacgcg gagccgccca cgtggaaaga cctggtactt
23881 aaggcgcgcc ttgatgcaat tgacagctcc gcctggctct ttgcttcaga agggtcttcc
23941 tcgacttttg agtatgaggg aattccactg ggtgaggggg aacggcaagc ctacaaagaa
24001 tggcaagagg acgctcagcc cacgtgggag gacctcgtca ttaatgcacg catggcagaa
24061 ctcgaccatc cttcttggat tacagacgag cacaattccc ttgaagaaaa cttagagttt
24121 cggcccgatg caagacaggc cagcctgaag gactcgaccg accagcggaa gagttcttcc
24181 gcgtcattta tctatgatgg aatgaagctc ggggaacccg agactgctgc atacgagaac
24241 tggagcaaac cggaacgacc gtcatgggaa gccctcatcc tagatgggcg ccaggcttcc
24301 atagcaagct cttcggtttc gaattcgtta cttgcaaaga catcctcgcc agtctttcta
24361 tacgagggaa tgtcgctagg ggatgcggaa cgtcaatcgt atccaaggtg gaggcagcct
24421 gcccaaccgc gatggcaaaa tcttgtggtg aacgcgcgcc ttgcggatct tgatccgttg
24481 gcctggattc ccgatgaaca tgatccgttt gcggaagccg aggcgcttag ccccacttcg
24541 caatcgagtg gcgccaacaa gtccaaccgc gctttgggtc aatcagattc aggccggccc
24601 gccttcgcat atctggcagc acaagaggcg agtcacctgc aatcaccggc atgctcacaa
24661 ttggaaacga ggcgtgcatt aaatttcggg tcgcctggac cggatgcaaa tccaacggaa
24721 agcatcgcca aatgtaatcg tttggatggc gtcagtaaaa ttaagcgatt aggcaccaaa
24781 ggccgccggg cagtaaacgc gactattcac ggcggcaagt tcggtgcgca aggactgttg
24841 tccgaagatt gcggacaagc ggctgagccc tctccatcgg aacaaactac tcggccgcga
24901 accgataata ttggtaccta tgcaagccgg aaaaacgaac gagctcggct ggctacagag
24961 accgggaaat acgaatcgga gcacattttc ggatttaagg tcgtccacga tactttgcgg
25021 gcgaccaaag aaggccgtcg tctcgaaagg ccaatgccgg catacctcga atacaaggag
25081 cttcatcggc aacacgtagg gacaggaaga ggacggactg gactggtcgg gcgcggatgg
25141 ccagatgatg cgagctatcg cgcagatcaa agggcaaccc tgtcggaccc tgttgcgtcc
25201 gcggaaggtg caacagcttc aaatgggtac cagttgaacc agctgggcta cgcgcaccag
25261 ctcgccaccc atggtctcca aagtgaaacg cccgatgggg ttaccatgcc aattcaagtt
25321 gcgacaatta gctacaacta tacagtgagt cgtgatccgg tcctttcacc acccagcaaa
25381 gaacaagcgc ccccattgct acacctgggc cttcgtgggc aaacggaagc tgtgcttgcc
25441 cgcgagaccg cattgaccgg aaaatggcca actctcgagc gtgagtgtca agtgtatcag
25501 cggtttttgg ccctgtacga cgtaaagaag gatctggacg ccaaacaact tggtgtacgg
25561 cagaaaaagc aggcactcgt tgcggcgttg aaccggaccg ccggcttaat tggcgcgtca
25621 cctttggaag cccaatcgtc gaccgctgag gttgaatata ctaccgacga gcccgatgaa
25681 cgacgggttt atgatccgcg cgatcgaggc agagacaaag cgttcaatcg ctgacaaagt
25741 cgcctagaca atgattgtta cggacttcgc aatcgatttc ttgacagcag tgccattcaa
25801 cttttgatcc accatcggaa aataatcggc ggagaaacgc acgacagctt ctattgccaa
25861 gccccgtcga tcgatctcat ggcatattcc cgttatggtc gatttttcga ggggccgcgc
25921 aacctctctc agcaaagatg aagttctcgc gggtccgcca tgggtctcca ggaaaatcct
25981 gggcgtcacg gtcggtaaag aattgaagac tttaagcaaa cgcgaacttg acgttcaaag
26041 attacgcgcc gcggaacgcg ccctcgataa acaagtctgg cagaatcctc ccgttaaccc
26101 ctcccgtcga aacgaggaga tgccaatagt catcttcgaa acgaacgcga tgagcggcta
26161 gagtcggaga tgacgatcgc gcatcaagcc gcttccgtct ccgttggcga ggatcgccaa
26221 gctcttgatg ccgctcgatc gaccttgaat gaactgcaca actcgccatc gtcggatgat
26281 agggcaaact tgagtttgcc gagagccgaa atttgcgacc gtacaacgta tccgccgtct
26341 ccatcccatt gaatacggat atcctgagca ttaggtccgt gccaatacgt ctccggcaag
26401 agctgcagat ttggcatggc ggatgatgag ggatttgctg ggcgctttga gattgtcgag
26461 gcgcgtcgag gaatactatc cccctcgaca agcgaagagt ctgatccgtg gggtcgaggc
26521 ctgcagaaca ccgtaaacgc taacggcggg atcccgaatg accagtcatt gtggcattcc
26581 gaaagaatta cattcacacg gcaccggtta ctcgatcact atctcctgcg gccttattcg
26641 ataaagtcgt gctaactccg tgacttggca agaattacat atgaaacgcg atattcgttt
26701 aatgacggag gctcggatgg tagattcacc gaaggctcaa ttctattagg agtcttgatt
26761 gtgtccgaag agtgactgcc gctgcagcag cctagaaagc aatgcacggt gatgatgttg
26821 atcggacggc tttcgtcccg cgcaggctgc cggctgaagc cgcttacgtc gatagtcaag
26881 ttgacctggc ggcaaaccga caacctcact gtgtgacttg ttttgagcga cttttggttc
26941 aagagaaaga tgaaaaccgt cgacgattgt gggcttccgg caaagcatgc gccacgtctg
27001 aagagagttg tgcaggaggc cctacactcg caccgggcca ttcggggcca tgaagatgga
27061 gcaaacctta atttgcaccg ataattttcc gttcgatgga tttgggctgg gcaggatcgg
27121 agcggaaggc actaggaaaa tcgtaatttc tgtaggagaa acaatgatat ggtgatcatc
27181 aagctaaatg cgaataaaaa catgcctgtc ttggcggttg agaagccgca agaaattcac
27241 aaagaggagt tgagcgacca tcaccagtca aatggcttta cgagtttgga tctcgaaatg
27301 attgaactgg agaattttgt ccttcactgc ccgctccccg aagaaaacct agccggctaa
27361 aaggagttag acgatggatc cgaaggccga aggcaatggt gaaaatatca ctgagaccgc
27421 agcaggcaat gtcgaaactt ctgatttcgt gaatttgaag cgccagaaga gggagggcgt
27481 aaattccacc gggatgtccg aaattgatat gacgggtagc caagaaactc ccgaacacaa
27541 catgcacgga agcccgactc acacggatga tctcggcccg cggttggatg cggacatgct
27601 cgattctcag tcaagtcatg tttctagcag cgctcaaggc aatcggtctg aggttgaaaa
27661 tgagctatcc aacttattcg cgaagatggc tttaccaggc catgatcggc gtaccgacga
27721 gtatattctt gtgcggcaaa ccggacaaga caagttcgca ggtactacta aatgtaacct
27781 cgatcatctg cccaccaagg cggaattcaa tgcgagctgc cggctctata gggatggagt
27841 cggcaactac tatcccccgc ccctcgcatt cgagaggatc gatatcccgg agcaattggc
27901 tgcacaattg cataacctgg agccaagaga acagagtaaa cagtgttttc agtacaagtt
27961 ggaagtctgg aatcgcgctc acgcagagat gggcatcact ggcaccgaca tcttctatca
28021 aacagacaag aatattaagc tcgaccgaaa ttataaattg aggcctgagg atagatatat
28081 acaaacagag aaatacgggc gcagagaaat tcaaaaacgc tatgagcacc agtttcaagc
28141 tggttcactg ctgccggata tcttaatcaa gaccccgcaa aatgatatac atttctcgta
28201 caggtttgcg ggcgaccgtt acgctaacaa gcgatttgag gaattcgaac gcgcaatcaa
28261 aactaaatac ggtagcgata ccgagatcaa gctcaaatcc aaatctggga ttatgcatga
28321 ctccaaatat ttggaatcat gggagcgggg cagtgcggat atccgtttcg cagagttcgc
28381 cggcgagaat cgagctcaca acaagcagtt tccggctgcg actgtgaata tgggaaggca
28441 gccagatggc cagggaggga tgactcgcga tcgccatgta agcgttgact acctattgca
28501 aaacctaccc aactcccctt ggacgcaagc cttgaaagag ggaaagttgt gggatcgagt
28561 tcaggtcctt gctcgcgacg gaaaccgtta catgtcacct tcaagactgg aatattccga
28621 ccccgaacac tttacccaac tgatggatca agttggtctg cccgtgtcga tgggtcggca
28681 aagtcatgcg aatagtgtca agtttgagca gtttgacaga caggcagcgg ttattgttgc
28741 ggatggcccg aacttacgtg aggttccaga tttgtccccg gaaaagttgc aacaactgtc
28801 tcaaaaagat gtcctgatag cggatcgcaa tgaaaagggg caaagaaccg gcacttacac
28861 taatgttgtg gaatatgagc gcctgatgat gaaattaccg agcgacgcag cgcagcttct
28921 cgctgaaccg tccgatagat attcacgtgc ttttgtccgg ccggagccag cattgccccc
28981 catcagtgac agccggcgga cttatgaaag ccgaccgcgc ggcccaaccg taaacagtct
29041 gtagatttcc tgataccgcg tcagtgacga agccctttag ttcaaaatat gcatttgaaa
29101 ggattcaaca atggtcgaca cgacgaagaa gagtgtcgcg aagtcgctta cggctgacat
29161 gcgccgttct gctaagcggc tttctaagca aatgcgtaaa gcctcgctta ctgaagagga
29221 ggcaacaagg aatctagccc ggctcgaaac gccggaccag aagcgaaaat atgtcgccga
29281 tatgcagata atcgacaagc tggaagacgg ctttcgaggc gaaataagct ataaaatgct
29341 gggaaataaa cagcttcggg tcgacagccc aaaagaatta acgcgcgagc atggtataat
29401 aagaaaaaca agaaaggttc tgaagcgtaa cgcagagact ggcaatgttt acttgggtct
29461 ccacgaaaag aagacctgga ggagcgttag cagccatctt tatgccgagg acggtacact
29521 tcgcgcgaag catgtgaaat acaaagacgg acgctttgaa gaaaaatggg aacgagacga
29581 aaatggcctg ctgttccgca cgcagtttgt caaccgaaat cggctatttc aacctatttc
29641 cgagaaggtc agcacaccct accggagcgg accggaaaac cggctctttc gtgaactaac
29701 ccgtcgaaaa ggttccaagc aggaaacttt tgagcgggac gaaaaaggca acctcgagct
29761 catcggcagc aaacgtctcg gcttttccaa gaattcgacg aa
//
LOCUS HUMPCOLLA2 700 bp ds-DNA PRI 27-JUL-1990
DEFINITION Human procollagen alpha2(I) chain gene, exon 6.
ACCESSION M35391
KEYWORDS procollagen alpha2(I).
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 700)
AUTHORS Weil,D., D'Alessio,M., Ramirez,F. and Eyre,D.
TITLE Structural and functional characterization of a splicing mutation
in the pro-alpha2(I) collagen gene of an Ehlers-Danlos type VII
patient
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by F.Ramirez, 19-JUN-1990.
FEATURES from to/span description
pept / 6 / 59 procollagen alpha2(I) chain, exon 6 (AA at 6)
/hgml_locus_uid="LP0002V" /nomgen="COL1A2"
/map="7q21.3-q22.1"
IVS < 1 5 procollagen alpha2(I) chain intron E
IVS 60 > 700 procollagen alpha2(I) chain intron F
BASE COUNT 235 a 120 c 101 g 244 t
ORIGIN
1 tctagaactt tgctgctcag tatgatggaa aaggagttgg acttggccct ggaccaatgg
61 tatgcttatc tgtttatctt agccaaaaaa attgctaaat aaatcattca ttttatgtca
121 cattttacca cgccatttat ttagctacct aagttaacac tcaatactta gattatataa
181 aaaacaactc tttttgtttt caaatttatg aaaacataag ttaaggagtt cacttttctt
241 tacaaaagaa agattaattg atcttttatg attatatgat ctttttgatt atatgatcct
301 cattaagata gatcatatac ttatgtccga agaaataatc tttcggcaat atgccaaatc
361 aattcgggaa atcaatttaa tttcaaaagt aatccatctg attaattttt catatcttca
421 atcggtagat ttagcaacac tagaggcata gaactattta ttaagttctc tgaacttgtt
481 ggaaaggatc aacaagttct atctagtcca gctaactcat ttttaaaatg ggagagttta
541 agcccttttc tcaaagtcat ccaggtaact aatgacataa ctagaactag atgccaggca
601 agatgtctaa tatttgctta catcatggtt tatgtaccta gtccttgaat aaaccactca
661 tttagtcaac agatattaat cagatgcctt caatgggccc
//
LOCUS MUSC4P 1739 bp ds-DNA ROD 27-JUL-1990
DEFINITION Mouse complement C4 5' flank.
ACCESSION D90052
KEYWORDS C4; complement C4; promoter.
SOURCE Mouse (strain B10.BR) genomic DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1739)
AUTHORS Nakayame K Pattanakitsakul,S.-N., Yokoyama,S., Kimura,H.,
Nosaka,M. and Takahashi,M.
TITLE Post-transcriptional regulation of the complement C4 gene in the
low C4-producing strain of mouse
JOURNAL Immunogenetics 31, 361-367 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Masaru
Nonaka Cancer Research Institute, Kanazawa University 13-1
Takaramachi Kanazawa 920 Japan Phone: 0762-62-8151
FEATURES from to/span description
site 1691 1691 transcription initiation site
BASE COUNT 376 a 464 c 476 g 423 t
ORIGIN chromosome 17.
1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtgcctg
61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacccc accccacttc
121 acacccaggc agcagtagtg gacccaacag ggtaagtggg tccctgcttg ggttgcctgt
181 gtttctggaa gctcaaggtc catgcatcag cagtagggga agggagggtt ctcagcccag
241 gctgcatcat catggccgcc taccacctgc aggcacctgg tgaacgctgg ggtcctcact
301 gtccgagatg ctggaagctg gtggctggct gtgcctggag ctgggagatt catcaagtgc
361 tttgttaaag gtatggtgcc tgaggagcaa cacccatggg gggtctttag gtctccaaat
421 ttgactcact tcttcctttt cctggacctt ctcctccagg gcgccaggct gtactgagca
481 tggtgtggaa ggccaagtac tgggagcttg ccctgtcaga gctcctgggc cgcagggccc
541 ctttggcagt gcggctaggt cttgcctacc atgtgcacga cctcattgga gcccagctgg
601 tggactggtg agtctagtcc tgagacttgg gaaacgtgca gggacccagg gttggtgtcc
661 ccctaagtgt tgcttctgtc atcagtgaag ccaagattct ctcattcccg gactcgggag
721 aagactggac taaattctgg atttgcacac tttgggtcta ggcagctggc tccagtcttg
781 tctttctgtt acacagtgtc cccaccactt ctggaaccct ccttcgcctg ccagatacat
841 gaggagccaa tgcttggatt ctgcaactca ctgagtgagg ttcctggaag tgccacccca
901 gggtggctga gcaagtcacc gcagtgggtg ccaggctcta ctgctgcaag ctgggcttct
961 acctgagccg ggctgtgggc attgcagctc ttgcttctgt gcgtgtggag tcaggagccg
1021 tgccaagggg atgagaaggt gggattgcta gagacactgg agcagggagt agaaaactct
1081 gcccttcacg tcaggctgaa attgccaaat aaaatacttg tgcctgtact tctttctgtg
1141 tcctttgttc agtgtggtgt atgtacggct gtgtgaagcc caacttgggg ctgagcattt
1201 catcattgtt tagagagaga acaaggctta ctcaggccaa tttccctgtc tggggtgaga
1261 gtgtacctca tcttctgagc tataggcttt ctgttagagc cataccaagc agaaggctgc
1321 cttgaatatg aggccatcga ctggggaacc cattctagtt ttgttaggac ctttacctgt
1381 cttatctatg tgtcatgttg gagctgaata cacgcacaca tggacaagca catgtattcg
1441 tacacacaca cacacacacg cacacgcaca caaacacaca cacaagggga agtccctggt
1501 actgggaata aacaagcacc agatcaattt tggggccaag atttctacag agaccctggt
1561 gtttgggttg cctccccagc tctgggccta gggccagttc tgtttcttgg ccatcacgtg
1621 gtttcccggc tcatggggca aagggagggg caaggtccag atcaaactct gccccaaccc
1681 taggttgatc agaagggagc agacagtcag accagacagg tctgaccttt cctggatcc
//
LOCUS BSURPOF 3078 bp ds-DNA BCT 27-JUL-1990
DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
complete cds.
ACCESSION M34995
KEYWORDS RNA polymerase; minor sigma-37 factor.
SOURCE B.subtilis (strains 168 and Marburg), clones
lambda-[21,gt11,1,11,21,31].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 368 to 2297)
AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus
subtilis RNA polymerase: Isolation, nucleotide sequence,
chromosomal locus, and cryptic function
JOURNAL J. Bacteriol. 169, 771-778 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3078)
AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
TITLE Similar organization of the sigB and spoIIA operons encoding
alternate sigma factors of Bacillus subtilis RNA polymerase
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.W.Price, 18-FEB-1987. Draft entry and computer-readable
sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES from to/span description
pept 200 529 ORF V
pept 526 1008 ORF W
pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg
start codon)
pept 1762 2361 ORF X
mRNA 131 2368 sigB operon mRNA
signal 131 168 sigB operon promoter
signal 2368 2401 sigB transcription termination signal
binding 188 194 ORF V ribosomal binding site
binding 511 519 ORF W ribosomal binding site
binding 954 961 sigB ribosomal binding site
binding 1750 1753 ORF X ribosomal binding site
BASE COUNT 999 a 575 c 755 g 749 t
ORIGIN 40 degrees on B.subtilis map.
1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
3061 caaactgctg cgaagctt
//
LOCUS HUMERPR 1624 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human erythropoietin receptor mRNA, complete cds.
ACCESSION M34986
KEYWORDS erythropoietin receptor.
SOURCE Human cell line OCI-M1, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1624)
AUTHORS Ehrenman,K. and St John,T.
TITLE Sequence of the human erythropoietin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.St.John, 07-JUN-1990.
Author address: T.St.John
Hutchinson Cancer Res Center
1124 Columbia St.
Seattle, WA
Email: TSTJOHN@FHCRCVAX
FEATURES from to/span description
pept 1 1527 erythropoietin receptor precursor
sigp 1 72 erythropoietin receptor signal peptide
matp 73 1524 erythropoietin receptor
BASE COUNT 276 a 537 c 498 g 313 t
ORIGIN
1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
1621 agga
//
LOCUS HUMGFP40H 4379 bp ds-DNA PRI 27-JUL-1990
DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete
cds.
ACCESSION M30135
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor;
megakeryoblastic leukemia cell growth factor.
SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4379)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 12-DEC-1989.
FEATURES from to/span description
pept 796 909 P40 T-cell and mast cell growth factor
precursor, exon 1
1022 1057 P40 T-cell and mast cell growth factor
precursor, exon 2
1140 1172 P40 T-cell and mast cell growth factor
precursor, exon 3
2461 2592 P40 T-cell and mast cell growth factor
precursor, exon 4
4096 4215 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 796 849 P40 T-cell and mast cell growth factor signal
peptide
matp 850 909 P40 T-cell and mast cell growth factor
1022 1057 P40 T-cell and mast cell growth factor
1140 1172 P40 T-cell and mast cell growth factor
2461 2592 P40 T-cell and mast cell growth factor
4096 4212 P40 T-cell and mast cell growth factor
IVS 910 1021 hP40 intron A
IVS 1058 1139 hP40 intron B
IVS 1173 2460 hP40 intron C
IVS 2593 4095 hP40 intron D
signal 742 747 TATA box
signal 4354 4359 polyA signal
rpt 2965 3265 Alu repeat
binding 370 376 AP-2 binding site
binding 624 629 IRF-1 binding site
binding 627 633 AP-1 binding site
BASE COUNT 1208 a 921 c 904 g 1346 t
ORIGIN
1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS MEFDNABP 1710 bp ds-DNA BCT 27-JUL-1990
DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION M34778
KEYWORDS DNA binding protein HMfB.
SOURCE M.fervidus DNA.
ORGANISM Methanothermus fervidus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanothermaceae.
REFERENCE 1 (sites; for [2])
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
TITLE DNA binding protein HMf, isolated from the hypothermophilic
archaeal Methanothermus fervidus is most closely related to
histones
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1710)
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Sandman, 01-JUN-1990.
Author address: K.Sandman
Department of Microbiology
Ohio State University
484 W. 12th Ave.
Columbus, Ohio 43210
E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES from to/span description
pept 578 787 DNA binding protein HMfB
pept 1662 805 (c) ORF285
signal 450 457 promoter
signal 523 530 promoter
signal 790 814 transcription termination signal
BASE COUNT 619 a 263 c 265 g 561 t 2 others
ORIGIN 1 bp upstream of HindIII site.
1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS MUSINT1A 5607 bp ds-DNA ROD 27-JUL-1990
DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION K02593 M34750
KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE Mouse (BALB/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1098 to 5606)
AUTHORS Van Ooyen,A. and Nusse,R.
TITLE Structure and nucleotide sequence of the putative mammary oncogene
int-1; Proviral insertions leave the protein-encoding domain intact
JOURNAL Cell 39, 233-240 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5607)
AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
Otte,A., Schuuring,E. and Van Ooyen,A.
TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of
activation by insertion of proviral DNA of the mouse mammary tumor
virus
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [2] kindly submitted
by R.Nusse, 31-MAY-1990.
FEATURES from to/span description
pept 1779 1882 Wnt-1 protein (exon 1)
2452 2705 Wnt-1 protein (exon 2)
3279 3544 Wnt-1 protein (exon 3)
4002 4490 Wnt-1 protein (exon 4)
IVS 1883 2451 Wnt-1 cds intron A
IVS 2706 3278 Wnt-1 cds intron B
IVS 3545 4001 Wnt-1 cds intron C
site 1133 1134 MMTV insertion site in tumor 17 [1]
site 1394 1395 MMTV insertion site in tumor 35 [1]
site 1631 1632 MMTV insertion site in tumor 102 [1]
site 4495 4496 MMTV insertion site in tumor 53 [1]
mRNA 1428 5398 Wnt-1 mRNA (alt.)
mRNA 1595 5398 Wnt-1 mRNA (alt.)
signal 1562 1566 TATA box
signal 1573 1577 TATA box
BASE COUNT 1188 a 1681 c 1519 g 1219 t
ORIGIN Chromosome 15.
1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
5581 gaccctaaat gagaccaact agggatc
//
LOCUS MUSP40M 3808 bp ds-DNA ROD 27-JUL-1990
DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
cds.
ACCESSION M30136
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor.
SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
DNA, clone lambda-S40.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3808)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 21-NOV-1989.
FEATURES from to/span description
pept 644 757 P40 T-cell and mast cell growth factor
precursor, exon 1
846 881 P40 T-cell and mast cell growth factor
precursor, exon 2
965 997 P40 T-cell and mast cell growth factor
precursor, exon 3
2184 2315 P40 T-cell and mast cell growth factor
precursor, exon 4
3393 3512 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 644 697 P40 T-cell and mast cell growth factor signal
peptide
matp 698 757 P40 T-cell and mast cell growth factor
846 881 P40 T-cell and mast cell growth factor
965 997 P40 T-cell and mast cell growth factor
2184 2315 P40 T-cell and mast cell growth factor
3393 3509 P40 T-cell and mast cell growth factor
IVS 758 845 mP40 intron A
IVS 882 964 mP40 intron B
IVS 998 2183 mP40 intron C
IVS 2316 3392 mP40 intron D
signal 589 594 TATA box
binding 227 233 AP-2 binding site
binding 475 480 IRF-1 binding site
binding 478 484 AP-1 binding site
BASE COUNT 1012 a 903 c 869 g 1024 t
ORIGIN Chromosome 13.
1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 27-JUL-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
cds.
ACCESSION M29938
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
ZQ30-1.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1105)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 13 777 MHC SLA-DQ alpha chain precursor
sigp 13 81 MHC SLA-DQ alpha chain signal peptide
matp 82 774 MHC SLA-DQ alpha chain
mRNA < 1 1105 MHC SLA-DQ mRNA
BASE COUNT 254 a 290 c 255 g 306 t
ORIGIN
1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1081 aaacaataaa tcacttcatg aggtt
//
LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 27-JUL-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
cds.
ACCESSION M29939
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 69 836 MHC SLA-DQ alpha chain precursor
sigp 69 137 MHC SLA-DQ alpha chain signal peptide
matp 138 833 MHC SLA-DQ alpha chain
BASE COUNT 273 a 313 c 266 g 322 t
ORIGIN
1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS PIGMHDQBA 212 bp ds-DNA MAM 27-JUL-1990
DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION M29944
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
Gustafsson,K. and LeGuern,C.
TITLE Class II genes of miniature swine: II. Molecular identification and
characterization of B (beta) genes from SLA-c haplotype
JOURNAL Immunogenetics 31, 1-6 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept < 1 12 MHC SLA-DO beta chain
BASE COUNT 38 a 58 c 52 g 64 t
ORIGIN
1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS PP2LEND 266 bp ds-DNA PHG 27-JUL-1990
DEFINITION Bacteriophage P2 left end DNA.
ACCESSION M34756
KEYWORDS .
SOURCE Bacteriophage P2 DNA.
ORGANISM Bacteriophage P2
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Ziermann,R. and Calendar,R.
TITLE Characterization of the cos sites of Bacteriophages P2 and P4
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Ziermann, 05-JUN-1990.
Author address: R.Ziermann
University of California
Dept. Mol. Cell Biology
Berkeley, CA 94720
Email: calendar@garnet.berkeley.edu
FEATURES from to/span description
site 1 19 cohesive end of phage DNA
site 20 266 left end of phage DNA
BASE COUNT 41 a 58 c 99 g 68 t
ORIGIN Map position 0.0-0.8%.
1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
241 gagccagccg tttatctcgc ggatcc
//
LOCUS RCABCHH 314 bp ds-DNA BCT 27-JUL-1990
DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
F108) genes, 3' end, and 5' end respectively.
ACCESSION M34843
KEYWORDS .
SOURCE R.capsulata DNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 314)
AUTHORS Yang,Z. and Bauer,C.E.
TITLE Rhodobacter capsulatus genes involved in early steps of the
bacteriochlorophyll biosynthetic pathway
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.E.Bauer, 04-JUN-1990.
FEATURES from to/span description
pept < 1 79 bacteriochlorophyll biosynthesis protein
(bchH) (AA at 2)
pept 108 > 314 bacteriochlorophyll biosynthesis protein
(F108)
BASE COUNT 59 a 87 c 119 g 49 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
301 agcgcgtgct gcag
//
LOCUS TREANTAT 999 bp ds-DNA ORG 27-JUL-1990
DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION M34848
KEYWORDS .
SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA.
ORGANISM Kinetoplast Trypanosoma evansi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma evansi.
REFERENCE 1 (bases 1 to 999)
AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
Steinert,M. and Hamers,R.
TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Bajyana Songa, 05-JUN-1990.
FEATURES from to/span description
site 367 516 kinetoplast minicircle common region (ORI)
BASE COUNT 391 a 78 c 203 g 327 t
ORIGIN
1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS VIBSCRRE 1524 bp ds-DNA BCT 27-JUL-1990
DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds,
and sucrose transport (scrA) protein, 5' end.
ACCESSION M35009
KEYWORDS Scr repressor protein; sucrose transport protein.
SOURCE V.alginolyticus DNA.
ORGANISM Vibrio alginolyticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Blatch,G.L. and Woods,D.R.
TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
repressor-encoding gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Blatch, 10-JUN-1990.
Author address: G.L.Blatch
University of Cape Town
Dept. Microbiology
Private Bag
Rondebosch
7700 SOUTH AFRICA
Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
FEATURES from to/span description
pept 36 < 1 (c) sucrose transport protein (scrA)
pept 271 555 Scr repressor protein (scrR)
pept 791 1090 ORF5
binding 264 267 scrR ribosomal binding site
binding 780 785 ORF5 ribosomal binding site
signal 178 183 scrA -10 region
signal 203 208 scrA -35 region
signal 190 195 scrR -10 region
signal 162 167 scrR -10 region
signal 762 767 ORF5 -35 region
signal 742 747 ORF5 -35 region
site 78 91 repressor operator site (put.)
site 229 242 repressor operator site (put.)
site 12 71 ScrR helix-turn-helix motif
BASE COUNT 466 a 344 c 321 g 393 t
ORIGIN
1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
1501 gccaattgtg atattgaaga agct
//
LOCUS XELNRP1A 2431 bp ss-mRNA VRT 27-JUL-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
complete cds.
ACCESSION M34894
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2431)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 74 1117 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2431 nrp-1B mRNA
signal 2407 2418 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
site 921 941 alternative exon (put.)
BASE COUNT 663 a 522 c 543 g 703 t
ORIGIN
1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS XELNRP1B 2324 bp ss-mRNA VRT 27-JUL-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
complete cds.
ACCESSION M34895
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2324)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 29 1051 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2324 nrp-1B mRNA
signal 2306 2311 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
BASE COUNT 647 a 467 c 520 g 690 t
ORIGIN
1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS YSJSACDHY 2569 bp ds-DNA PLN 27-JUL-1990
DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION M34929
KEYWORDS saccharopine dehydrogenase.
SOURCE Y.lipolytica (strain W29) DNA.
ORGANISM Yarrowia lipolytica
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2569)
AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
Gaillardin,C.
TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
lipolytica
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.M.Fournier, 06-JUN-1990.
ORF1 is probably not translated, since a lacZ fusion/expression
experiment did not produce results.
FEATURES from to/span description
pept 649 1929 ORF1
pept 1764 655 (c) saccharopine dehydrogenase
mRNA 582 2060 ORF1 mRNA (alt.)
mRNA 598 2060 ORF1 mRNA (alt.)
mRNA 599 2060 ORF1 mRNA (alt.)
mRNA 602 2060 ORF1 mRNA (alt.)
mRNA 606 2060 ORF1 mRNA (alt.)
mRNA 609 2060 ORF1 mRNA (alt.)
mRNA 1807 639 (c) ORF2 mRNA (alt.)
mRNA 1809 639 (c) ORF2 mRNA (alt.)
mRNA 1811 639 (c) ORF2 mRNA (alt.)
mRNA 1812 639 (c) ORF2 mRNA (alt.)
mRNA 1814 639 (c) ORF2 mRNA (alt.)
signal 521 528 TATA box
signal 565 575 TATA box
BASE COUNT 668 a 573 c 666 g 662 t
ORIGIN
1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 27-JUL-1990
DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region.
ACCESSION M33765
KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene;
major histocompatibility complex.
SOURCE Human (DR4, DR4) DNA, clone H11A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1515)
AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C., Maffe,A. and Guardiola,J.
TITLE Effect of the AIR-I locus on the activation of an enhancerless
HLA-DQA1 promoter
JOURNAL Immunogenetics 31, 368-376 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Guardiola, 18-APR-1990.
FEATURES from to/span description
pre-msg 655 > 1515 HLA-DQA1 mRNA and introns
/hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1"
/map="6p21.3"
IVS 785 > 1515 HLA-DQA1 intron A
BASE COUNT 441 a 322 c 340 g 412 t
ORIGIN Chromosome 6p21.3.
1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag
61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga
121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac
181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag
241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc
301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag
361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag
421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag
481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct
541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg
601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta
661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc
721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt
781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga
841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta
901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt
961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg
1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt
1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc
1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc
1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac
1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg
1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca
1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac
1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta
1501 gcaaaataag ggatc
//
LOCUS BSUAMYEA 727 bp ds-DNA BCT 27-JUL-1990
DEFINITION B.subtilis alpha-amylase (amyE) gene, 5' end.
ACCESSION M35517
KEYWORDS alpha-amylase.
SOURCE B.subtilis (strain 2633) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 727)
AUTHORS Emori,M., Tojo,T. and Maruo,B.
TITLE Molecular cloning and expression of an alpha-amylase gene from an
alpha-amylase extrahyper producing Bacullis subtilis
JOURNAL Agric. Biol. Chem. 52, 399-406 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 599 > 727 alpha-amylase structural protein (amyE)
BASE COUNT 207 a 125 c 179 g 216 t
ORIGIN
1 gatcatccgc ggcattatgt ttgaatttcc gtttaaagaa tgggctgcaa gcttgttcat
61 cattatctta tattactgca tcagggctgc ggcatccgga atgctcatgc cgagaataga
121 caccaaagaa gaactgcaaa aacgggtgaa gcagcagcga atagaatcag ttgcggtcgc
181 ctttgcggta gtggtgctta cgatgtacga cagggggatt ccccatacat tcttcgcttg
241 gctgaaaatg attcttcttt ttatcgtctg cggcggcgtt ctgtttctgc ttcggtatgt
301 gattgtgaag ctggcttaca gaagagcggt aaaagaagaa ataaaaaaga aatcatcttt
361 tttgtttgga aagcgaggga agcgtacaca gtctcgggca gtttttttat aggaacattg
421 atttgtattc actctgccaa gttgttttga tagagtgatt gtgataattt aaaatgtaag
481 tgttaacaaa attctccagt cttcacatca gtttgaaagg aggaagcgga agaatgaagt
541 aagagggatt tttgactccg aagtaagtct tcaaaaaatc aaataaggag tgtcaagaat
601 gtttgcaaaa cgattcaaaa cctctttact gccgttattc gctggatttt tattgctgtt
661 ttatttggtt ctggcaggac cggcggctgc gagtgctgaa acggcgaaca aatcgaatga
721 gcttaca
//
LOCUS CHKCKMB 274 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken b-creatine kinase gene, 3' end.
ACCESSION M35381
KEYWORDS B-creatine kinase.
SOURCE Chicken brain DNA, clone CK-B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 274)
AUTHORS Kwiatkowski,R.W., Ehrismann,R., Schweinfest,C.W. and Dottin,R.P.
TITLE Accumulation of creatine kinase mRNA during myogenesis: Molecular
cloning of a b-creatine kinase cDNA
JOURNAL Dev. Biol. 112, 84-88 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 51 M-creatine kinase (AA at 1)
97 243 M-creatine kinase
BASE COUNT 60 a 81 c 75 g 58 t
ORIGIN
1 ctgcagaaac gaggcacagg tggtgtggac acagctgcag ccgaagcctc ttgaggagcc
61 cccccccccc cccccccccc tgcagtcgga ggggtgtttg atgtatccaa tgctgatcgt
121 cttggcttct ctgaggtaga gctggtgcag atggtggtag atggtgtgaa gctgctcatt
181 gaaatggaaa aacgccttga gaaaggccag tccattgatg acctcatgcc agctcagaaa
241 taaagcactt tattctcccc ccgcccccct gcag
//
LOCUS CHKCMYBA1 172 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken c-myb gene, exon 1.
ACCESSION M35506
KEYWORDS c-myb protein.
SEGMENT 1 of 4
SOURCE Chicken hematopoietic cell DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 172)
AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
TITLE Organization of 5'-proximal c-myb exons in chicken DNA:
Implications for c-myb tissue-specific transcription
JOURNAL FEBS Lett. 232, 227-234 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 140 + 162 thymic c-myb protein, exon 1
IVS 163 > 172 thymic c-myb protein intron A
BASE COUNT 16 a 73 c 62 g 21 t
ORIGIN
1 aacctgtttc ctcctcctcc tcccgcggct cctcggcact tcgctgcgcg gcggcggagc
61 gcctgggcgc agcccggcag cagggcgcgg cggcgggcgc gggctccccc gtcctcttcc
121 cccagcggcc gccgcgagga tggcccggag accccggcac aggtaacggg gc
//
LOCUS CHKCMYBA2 290 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken c-myb gene, exon 2.
ACCESSION M35507
KEYWORDS c-myb protein.
SEGMENT 2 of 4
SOURCE Chicken hematopoietic cell DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 290)
AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
TITLE Organization of 5'-proximal c-myb exons in chicken DNA:
Implications for c-myb tissue-specific transcription
JOURNAL FEBS Lett. 232, 227-234 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 163 + 280 thymic c-myb protein, exon 2
pep$ 200 + 280 yolk sac c-myb protein, exon 1
IVS < 1 162 thymic c-myb protein intron A
IVS 281 > 290 thymic c-myb protein intron B
IVS 281 > 290 yolk sac c-myb protein intron A
BASE COUNT 78 a 59 c 71 g 82 t
ORIGIN
1 aatactagaa caacagaatg cagaacaaaa tcttgttgtg caagttttaa aacttttatc
61 tgtgtaacct ttgtaaagat tgtgtgtttt ctgtgtgctg tgtaattaag tagctccagt
121 cctcacgttc cgctctgtgt ctcgctgtgt ttcctgctgc agcatataca gcagcgatga
181 cgatgaagaa gatgttgaga tgtacgacca cgattacgac ggcctgcttc ctaaggctgg
241 gaaacgtcac ctagggaaaa ccaggtggac ccgtgaagag gtaattgccg
//
LOCUS CHKCMYBA3 92 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken c-myb gene, exon 3.
ACCESSION M35508
KEYWORDS c-myb protein.
SEGMENT 3 of 4
SOURCE Chicken hematopoietic cell DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 92)
AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
TITLE Organization of 5'-proximal c-myb exons in chicken DNA:
Implications for c-myb tissue-specific transcription
JOURNAL FEBS Lett. 232, 227-234 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 82 thymic c-myb protein, exon 3
pep$ + 11 + 82 yolk sac c-myb protein, exon 2
IVS < 1 10 thymic c-myb protein intron B
IVS 83 > 92 thymic c-myb protein intron C
IVS < 1 10 yolk sac c-myb protein intron A
IVS 83 > 92 yolk sac c-myb protein intron B
BASE COUNT 32 a 16 c 24 g 20 t
ORIGIN
1 aatgatccag gatgagaaac tgaagaaact tgtggaacag aatggcacag aagactggaa
61 agtcattgcc agtttccttc ctgtaggtag ac
//
LOCUS CHKCMYBA4 34 bp ds-DNA VRT 27-JUL-1990
DEFINITION Chicken c-myb gene, exon 4.
ACCESSION M35509
KEYWORDS c-myb protein.
SEGMENT 4 of 4
SOURCE Chicken hematopoietic cell DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 34)
AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
TITLE Organization of 5'-proximal c-myb exons in chicken DNA:
Implications for c-myb tissue-specific transcription
JOURNAL FEBS Lett. 232, 227-234 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 > 34 thymic c-myb protein, exon 4
pep$ + 11 > 34 yolk sac c-myb protein, exon 3
IVS < 1 10 thymic c-myb protein intron C
IVS < 1 10 yolk sac c-myb protein intron B
BASE COUNT 10 a 6 c 8 g 10 t
ORIGIN
1 atttcattag aatcggacag atgttcagtg ccag
//
LOCUS MUSFOS 3673 bp ds-DNA ROD 27-JUL-1990
DEFINITION Mouse c-fos gene; cellular homolog to viral oncogene.
ACCESSION J00370
KEYWORDS fos oncogene; oncogene.
SOURCE Mouse DNA, clones pc-fos(mouse)-1 and lambda-c-fos(mouse)-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3548)
AUTHORS Van Beveren,C., van Straaten,F., Curran,T., Mueller,R. and
Verma,I.M.
TITLE Analysis of FBJ-MuSV provirus and c-fos (mouse) gene reveals that
viral and cellular fos gene products have different carboxy termini
JOURNAL Cell 32, 1241-1255 (1983)
STANDARD full staff_review
REFERENCE 2 (bases 2799 to 3673)
AUTHORS Meijlink,F., Curran,T., Miller,A.D. and Verma,I.M.
TITLE Removal of a 67-base-pair sequence in the noncoding region of
protooncogene fos converts it to a transforming gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 4987-4991 (1985)
STANDARD simple staff_review
COMMENT A TATA box was found at bp 101-106 and a poly-A signal at
3493-3498. [1] also reported the proviral FBJ-MuSV sequence, which
is found in the viral section of the data base.
FEATURES from to/span description
pept 284 424 c-fos protein exon 1
1179 1430 c-fos protein exon 2
1836 1943 c-fos protein exon 3
2061 2702 c-fos protein exon 4
pre-msg 133 > 2702 c-fos protein mRNA
IVS 425 1178 c-fos protein intron A
IVS 1431 1835 c-fos protein intron B
IVS 1944 2060 c-fos protein intron C
variant 2856 2856 g in [1]; c in [2]
variant 3135 3136 gt in [1]; gtt in [2]
BASE COUNT 817 a 987 c 974 g 895 t
ORIGIN 997 bp upstream of EcoRI site.
1 gagttgacga cagagcgccc gcagagggcc ttggggcgcg cttccccccc cttccagttc
61 cgcccagtga cgtaggaagt ccatccattc acagcgcttc tataaaggcg ccagctgagg
121 cgcctactac tccaaccgcg actgcagcga gcaactgaga agactggata gagccggcgg
181 ttccgcgaac gagcagtgac cgcgctccca cccagctctg ctctgcagct cccaccagtg
241 tctacccctg gaccccttgc cgggctttcc ccaaacttcg accatgatgt tctcgggttt
301 caacgccgac tacgaggcgt catcctcccg ctgcagtagc gcctccccgg ccggggacag
361 cctttcctac taccattccc cagccgactc cttctccagc atgggctctc ctgtcaacac
421 acaggtgagt ttggctttgt gtagccgcca ggtccgcgct gagggtcgcc gtggaggaga
481 cactggggtg tgactcgcag gggcgggggg gtcttccttt ttcgctctgg agggagactg
541 gcgcggtcag agcagcctta gcctgggaac ccaggacttg tctgagcgcg tgcacacttg
601 tcatagtaag acttagtgac cccttcccgc gcggcaggtt tattctgagt ggcctgcctg
661 cattcttctc tcggccgact tgtttctgag atcagccggg gccaacaagt ctcgagcaaa
721 gagtcgctaa ctagagtttg ggaggcggca aaccgcggca atcccccctc ccggggcagc
781 ctggagcagg gaggagggag gagggaggag ggtgctgcgg gcgggtgtgt aaggcagttt
841 cattgataaa aagcgagttc attctggaga ctccggagca gcgcctgcgt cagcgcagac
901 gtcagggata tttataacaa accccctttc gagcgagtga tgccgaaggg ataacgggaa
961 cgcagcagta ggatggagga gaaaggctgc gctgcggaat tcaagggagg atattgggag
1021 agcttttatc tccgatgagg tgcatacagg aagacataag cagtctctga ccggaatgct
1081 tctctctccc tgcttcatgc gacactaggg ccacttgctc cacctgtgtc tggaacctcc
1141 tcgctcacct ccgctttcct ctttttgttt tgtttcagga cttttgcgca gatctgtccg
1201 tctctagtgc caactttatc cccacggtga cagccatctc caccagccca gacctgcagt
1261 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc
1321 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg
1381 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag gtgagcagcg
1441 attctggacc tttgtgggct gggggggggg gggggggcgg agactgacgc acagaccaca
1501 caacagagaa gggacgctac tgactgcact tcctgaccag gagctgtggc tgctagccct
1561 ttccctccct tgtcagattt tgacagttgg acccaagaca aactctagac agtttccctg
1621 acagcttcct acttcattct ctagccgggg agcttctttg ttcccctgct aaagatctca
1681 ctttaaatgc aaatcacact ctgcctgcca actgcaggtt agaaaaactg cttcaccgag
1741 aggtgcgggt gctgtaggag ccagtttcac tggggtgact gaatggaggt gacactagac
1801 aaccttaact gaatgttggt ccttttcttc tatagctatc tcctgaagag gaagagaaac
1861 ggagaatccg aagggaacgg aataagatgg ctgcagccaa gtgccggaat cggaggaggg
1921 agctgacaga tacactccaa gcggtaggtt gaaccagctg ctgctcctga aactttatta
1981 aagttggagc ttgggactat gggcgcaggg tccttgagca tgcccgtgtc ttatgctttc
2041 ttatatctct ccctatgcag gagacagatc aacttgaaga tgagaagtct gcgttgcaga
2101 ctgagattgc caatctgctg aaagagaagg aaaaactgga gtttattttg gcagcccacc
2161 gacctgcctg caagatcccc gatgaccttg gcttcccaga ggagatgtct gtggcctccc
2221 tggatttgac tggaggtctg cctgaggctt ccaccccaga gtctgaggag gccttcaccc
2281 tgccccttct caacgaccct gagcccaagc catccttgga gccagtcaag agcatcagca
2341 acgtggagct gaaggcagaa ccctttgatg acttcttgtt tccggcatca tctaggccca
2401 gtggctcaga gacctcccgc tctgtgccag atgtggacct gtccggttcc ttctatgcag
2461 cagactggga gcctctgcac agcaattcct tggggatggg gcccatggtc acagagctgg
2521 agcccctgtg tactcccgtg gtcacctgta ctccgggctg cactacttac acgtcttcct
2581 ttgtcttcac ctaccctgaa gctgactcct tcccaagctg tgccgctgcc caccgaaagg
2641 gcagcagcag caacgagccc tcctccgact ccctgagctc acccacgctg ctggccctgt
2701 gagcagtcag agaaggcaag gcagccggca tccagacgtg ccactgcccg agctggtgca
2761 ttacagagag gagaaacacg tcttccctcg aaggttcccg tcgacctagg gaggacctta
2821 cctgttcgtg aaacacacca ggctgtgggc ctcaaggact tgcaagcatc cacatctggc
2881 ctccagtcct cacctcttcc agagatgtag caaaaacaaa acaaaacaaa acaaaaaacc
2941 gcatggagtg tgttgttcct agtgacacct gagagctggt agttagtaga gcatgtgagt
3001 caaggcctgg tctgtgtctc ttttctcttt ctccttagtt ttctcatagc actaactaat
3061 ctgttgggtt cattattgga attaacctgg tgctggattg tatctagtgc agctgatttt
3121 aacaatacct actgtgttcc tggcaatagc gtgttccaat tagaaacgac caatattaaa
3181 ctaagaaaag ataggacttt attttccagt agatagaaat caatagctat atccatgtac
3241 tgtagtcctt cagcgtcaat gttcattgtc atgttactga tcatgcattg tcgaggtggt
3301 ctgaatgttc tgacattaac agttttccat gaaaacgttt ttattgtgtt ttcaatttat
3361 ttattaagat ggattctcag atatttatat ttttatttta tttttttcta ccctgaggtc
3421 tttcgacatg tggaaagtga atttgaatga aaaattttaa gcattgtttg cttattgttc
3481 caggacattg tcaataaaag catttaagtt gaatgcgacc accttcttgc tctctttatt
3541 ctcagttttg tatggtttca ggaaggcctc tgaggagacc agtttgtcaa gatgggtggg
3601 tcctggaggg gaacacgccc tctgtcccct tgtcactcag aggacacgta gttcagggta
3661 ttttacagat gtg
//
LOCUS MUSIGHACX 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION M34880
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 83 a 77 c 72 g 80 t
ORIGIN Chromosome 6.
1 gacattgtga tgacacagtc tccatcctcc ctggctatgt cagtaggaca gaaggtcact
61 atgaggtgca agtccagtca gagcctttta aatagtagca gtcaaaagaa ctatttggcc
121 tggtaccagc agaaaccagg acagtctcct aaacttctgg tatactttgc atccactagg
181 gaatctgggg tccctgatcg cttcataggc agtggatctg ggacagattt cactcttacc
241 atcagcagtg tgcaggctga agacctggca gattacttct gtcagcaaca ttatagcact
301 ccgtacacgt tc
//
LOCUS MUSIGHACY 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION M34883
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 80 a 78 c 68 g 83 t 3 others
ORIGIN Chromosome 6.
1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
61 atctcttgca gatctagtca gaacattgta catactnnna atggaaacac ctatttagaa
121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacaaagt ttccaaccga
181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag
241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt
301 cctccgacgt tc
//
LOCUS MUSIGHACZ 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION M34884
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 78 a 76 c 72 g 83 t 3 others
ORIGIN Chromosome 6.
1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
61 atctcttgca gattcagtca gagcattgta catactnnna atggaaacac ctatttagaa
121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacagagt ttccaaccga
181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag
241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt
301 ccgtggacgt tc
//
LOCUS MUSIGHADA 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION M34885
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 69 a 75 c 78 g 87 t 3 others
ORIGIN Chromosome 6.
1 gatattgtga tgactcaggc tgcaccctct gtacctgtca ctcctggaga gtcagtatcc
61 atctcctgca ggtctagtaa gagtctcctg catagtnnna atggcaacac ttacttgtat
121 tggttcctgc agaggccagg ccagtctcct cagctcctga tatatcggat gtccaacctt
181 gcctcaggcg tcccagacag gttcagtggc agtgggtcag gaactgcttt cacactgaga
241 atcagtagag tggaggctga ggatgtgggt gtttattact gtatgcaaca tctagaatat
301 cctttgacgt tc
//
LOCUS MUSIGHADB 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J.
ACCESSION M34886
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 86 a 73 c 62 g 73 t 18 others
ORIGIN Chromosome 6.
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gggcaagtca gnnnnnnnnn nnnnnnnnng acattagcaa ttatttaaac
121 tggtatcagc agaaaccaga tggaactgtt aaactcctga tctactacac atcaagatta
181 cactcaggag tcccatcaag gttcagtggc agtgggtctg gaacagatta ttctctcacc
241 attagcaacc tggagcaaga agatattgcc acttactttt gccaacaggg taatacgctt
301 ccgtggacgt tc
//
LOCUS MUSIGHADC 312 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J.
ACCESSION M34887
KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1)
recomb 303 304 V-region end/J-region start
BASE COUNT 87 a 79 c 70 g 76 t
ORIGIN Chromosome 6.
1 gacattgtga tgacacagtc tccatcctcc ctgactgtca cagcaggaga gaaggtcact
61 atgagttgca agtccagtca gagtctgtta aacagtggaa atcaaaacaa ctacttgacc
121 tggtaccagc agaaaccagg gcagcctcct aaactgttaa tctactgggc atccactaga
181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatcgttat
301 ccgctcacat tc
//
LOCUS MUSIGHADG 327 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION M34888
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 327)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 327 Ig H-chain V-D-J-region (AA at 1)
recomb 294 295 V-region end/D-region start
recomb 297 298 D-region end/J-region start
BASE COUNT 87 a 78 c 87 g 75 t 0 others
ORIGIN Chromosome 12.
1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggattc agtgaagatg
61 tcctgcaagg cttctggcta cacattcact gactcctaca tggactgggt gaagcagagc
121 cctggaaaga gccttgagtg gattggatat atttatccta acaatggtgg tactaactac
181 aatcagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac
241 atggaactcc acagcctgac atctgaggac tctgcagtct attactgtgc aagatggact
301 tactggggcc aagggactct ggtcact
//
LOCUS MUSIGHADH 345 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION M34889
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 345)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 345 Ig H-chain V-D-J-region (AA at 1)
recomb 293 294 V-region end/D-region start
recomb 309 310 D-region end/J-region start
BASE COUNT 81 a 93 c 86 g 85 t 0 others
ORIGIN Chromosome 12.
1 cagcctgggg ctgagcttgt gaagcctggg acttctgtga agttgtcctg caggccttct
61 ggcttcacct acaccgactt ctggattcac tgggtgaaac agaggcctgg acaaggcctt
121 gagtggattg gagagattaa tcctagcaat ggtggtacta gctacaatga gaagttcaag
181 accaaggcca cactgactgc agacaaatcc tccagcacag tccttacatg caactcaacg
241 agcctgacat ctgacgactc tgcggtctat ttctgtacaa gaccccctct ctcctatggt
301 aacttcgcct ggtttactta ctggggccaa gggactctgg tcact
//
LOCUS MUSIGHADI 264 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION M34890
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 264 Ig H-chain V-D-J-region (AA at 1)
recomb 223 224 V-region end/D-region start
recomb 230 231 D-region end/J-region start
BASE COUNT 70 a 68 c 67 g 59 t 0 others
ORIGIN Chromosome 12.
1 ttcactacct actggatgca ctgggtgaag cagaggcctg gacaaggcct tgagtggatc
61 ggagagattg atccttctga tacttatact taccacaatc aaaacttcaa gggcaaggcc
121 acattgactg tagacaaatc ctccagcaca gcctacatgc aactcagcag cctgacatct
181 gaggactctg cggtctatta ctgtgcaaga aggtcgatgg gccactacgg gcttgcatac
241 tggggccaag ggactctggt cact
//
LOCUS MUSIGHADJ 329 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION M34891
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 329)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 329 Ig H-chain V-D-J-region (AA at 1)
recomb 273 274 V-region end/D-region start
recomb 294 295 D-region end/J-region start
BASE COUNT 80 a 84 c 82 g 83 t
ORIGIN Chromosome 12.
1 ttagtgaagc ttggagggtc cctgcaactc tcctgtgcag cctctggatt cactttcagt
61 agctattaca tgtcttgggt tcgccagact cctgagaaga ggctggagtg ggtcgcaacc
121 attagtaatg gtggtgttag gcacctacta tccagacagt gtgaaggacc gcttcacatc
181 tccagagaca atgccaggaa caccctctac ctgcgaatga gcagtctgaa ttctgaggac
241 acagccgtgt attactgtgc aagagatgga tcttatggta actaccacgt accttacttt
301 gactactggg gccaaggcac cactctcac
//
LOCUS MUSIGHADK 271 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J.
ACCESSION M34892
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 271)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 271 Ig H-chain V-D-J-region (AA at 2)
recomb 226 227 V-region end/D-region start
recomb 238 239 D-region end/J-region start
BASE COUNT 69 a 63 c 73 g 66 t 0 others
ORIGIN Chromosome 12.
1 attcactttc agtagctatg ccatgtcttg ggttcgccag actccagaga agaggctgga
61 gtgggtcgca tccattaata gtggtggtag tacctactat ccagacagtg tgaagggccg
121 attcaccatc tccagagata atggcaggaa caccctgtac ctgcaaatga gcagtctgag
181 gtctgaggac acggccatgt attattgtgg aagaggcagt tacccctatt actatggtct
241 ggactactgg ggtcaaggaa cctcagtcac c
//
LOCUS MUSIGHADL 348 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J.
ACCESSION M34893
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Andria,M.L., Levy,S. and Benjamini,E.
TITLE Diverse V-H and V-L genes are used to produce antibodies against a
defined protein epitope
JOURNAL J. Immunol. 144, 2614-2619 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 348 Ig H-chain V-D-J-region (AA at 1)
recomb 303 304 V-region end/D-region start
recomb 315 316 D-region end/J-region start
BASE COUNT 86 a 80 c 99 g 83 t 0 others
ORIGIN Chromosome 12.
1 gaagtgaagc tggtggagtc tgggggaggc ttagtgacgc ctggagggtc cctgaaactc
61 tcctgtgcag cctctggatt cactttcagt agctatgcca tgtcttgggt tcgccagact
121 ccagagaaga ggctggaatg ggtcgcatcc attaataatg gtggtaatac ctactatcca
181 gacagtgtga agggccgatt caccatctcc agagataatg gcaggaacac cctgtacctg
241 caaatgagca gtctgaggtc tgaggacacg gccatgtatt actgtggaag aggcagttat
301 ccctattact atggtctaga ctactggggt caaggaacct cagtcacc
//
LOCUS NEUMTNDCO 839 bp ds-DNA ORG 27-JUL-1990
DEFINITION N.crassa mitochondrial pseudo-NADH dehydrogenase (ND5) gene, 3'
end, and cytochrome oxidase subunit 2 (co2) gene, 5' end.
ACCESSION M35513
KEYWORDS NADH dehydrogenase; cytochrome oxidase subunit 2; pseudogene.
SOURCE N.crassa (strain RL3-8A) stopper mutant ER-3 mitochondrial DNA.
ORGANISM Mitochondrion Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE 1 (bases 1 to 839)
AUTHORS Almasan,A. and Mishra,N.C.
TITLE Molecular characterization of the mitochondrial DNA of a new
stopper mutant ER-3 of Neurospora crassa
JOURNAL Genetics 120, 935-945 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps > 1 270 pseudo-NADH dehydrogenase (ND5; AA at 1)
pept 645 > 839 cytochrome oxidase subunit 2 (co2)
BASE COUNT 313 a 124 c 119 g 283 t
ORIGIN
1 tgtatgaaat atttagctga aatattagga ggtaaaaaac atttattaaa aagttacgaa
61 ggttacaatg taacagttaa taccactaaa ttatctccta ttgtacaata ttttaatctt
121 tatcctttaa aaactaaaaa gtatattaca tattttaact gaataaaaat atataagtta
181 gtaatagata aaaaacataa tgatcctgaa aatttattat tgattacgaa atataaaaat
241 aacattaata aatctgatta taataaataa atcaactttc ggcggacccc gccccttcgg
301 ggcgggggcc cgtttttagt aataactaaa aattaaacac agctacataa atgaagatat
361 agtccgatca gttaagtaat taactgcgta ttccacccct cccttccccc gccccgaagg
421 cgcccccgaa aaaacaaagg acgctagcgt agctaacctt atgagcagca ggctataaaa
481 acccattagg cacagggtat acttcgttag tggataagaa ttagcctaag ccttaatata
541 caatacagtt taatgttata tttaaattct atatttatga aaagatataa tatatttatt
601 attatggttt aatattcaat atagaatatt ctatttcttt ttttatggga ttattattta
661 ataatttaat tatgaatttt gatgctccaa gcccatgagg tatctatttt caagatagtg
721 ctactcctca gatggaaggt ttagttgaat tacatgacaa tattatgtac tatctggtgg
781 taatactatt tgttgtagga tgaatcttat tatctataat aagaaattat atcagtaca
//
LOCUS PUMCHSA1 90 bp ds-DNA PLN 27-JUL-1990
DEFINITION P.crispum chalcone synthase gene, 5' flank.
ACCESSION M35515
KEYWORDS chalcone synthasse.
SEGMENT 1 of 2
SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1.
ORGANISM Petroselinum crispum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Apiales; Apiaceae.
REFERENCE 1 (bases 1 to 90)
AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K.
TITLE Two alleles of the single-copy chalcone synthase gene in parsley
differ by a transposon-like element
JOURNAL Mol. Gen. Genet. 212, 93-98 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rpt 36 43 5' insertion target sequence
rpt 44 58 5' inverted terminal repeat
trns 44 > 90 transposon-like sequence
BASE COUNT 29 a 14 c 23 g 24 t
ORIGIN
1 tgaaaatgca tgttacaaac gttccctaat aatatagtag aagtagggtg taaatgagtc
61 gatacgctca cgaacagttc ggtgttcggg
//
LOCUS PUMCHSA2 732 bp ds-DNA PLN 27-JUL-1990
DEFINITION P.crispum chalcone synthase gene, exon 1.
ACCESSION M35516
KEYWORDS chalcone synthasse.
SEGMENT 2 of 2
SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1.
ORGANISM Petroselinum crispum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Apiales; Apiaceae.
REFERENCE 1 (bases 1 to 732)
AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K.
TITLE Two alleles of the single-copy chalcone synthase gene in parsley
differ by a transposon-like element
JOURNAL Mol. Gen. Genet. 212, 93-98 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 725 > 732 chalcone synthase, exon 1
mRNA 591 > 732 chalcone synthase mRNA (alt.)
mRNA 639 > 732 chalcone synthase mRNA (alt.)
trns < 1 53 transposon-like sequence
rpt 38 53 3' inverted terminal repeat
rpt 54 61 3' insertion target sequence
BASE COUNT 245 a 139 c 126 g 222 t
ORIGIN About 810 bp after segment 1.
1 ctgaaacgtg aacacttaag aattcggctc ggttcggctc atttacagcc ctaagtagaa
61 gtgatccata gcattttata ataactaaca ttacatatat attacccttt ttctacattc
121 gtgtagagag aaaatgactc aataattcaa ttagtattat agaatttctt gttggtaact
181 cttaatttta tttattaata gaataacgga aacgaaacga gatgggattg gtagatggga
241 aacagaaata ggcacgtcac ggggtgtatg tatttattaa aaacaatggc atataggtaa
301 gttgaggata acaattaaat cgtggtttca ttacaatagt tctgaaagaa gaggtagaat
361 cgattgtcct tcacttgatg tatctactca ctttggatgt acgtggaggc ctaaatcccc
421 accatggttg caagtagcag ttacatccac ataactcctc ccttattcca cgtggccatc
481 cggtggccgt ccctccaacc taacctccct tgaatttcta tcagtaaatt ttcaaccctc
541 tctcttctct gtcgtcacta tataatttcc ctacctacat ctctcttgat atctctttct
601 ctctataaat atttaacaag gagaaaggaa aaataacaac acaaatcaga cagatcagat
661 catcccttta cataaagata aggggtaagg gggggtttat agagagattt ataaaaaaaa
721 aaaaatggca aa
//
LOCUS R11ORIA 589 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid R1162 origin of replication region DNA.
ACCESSION M35512
KEYWORDS origin of replication.
SOURCE Plasmid R1162 DNA.
ORGANISM Plasmid R1162
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 589)
AUTHORS Meyer,R.J., Lin,L.-S., Kim,K. and Brasch,M.A.
TITLE Broad host-range plasmid R1162: Replication, incompatibility, and
copy-number control
JOURNAL Basic Life Sci. 30, 173-188 (1985)
STANDARD simple staff_review
BASE COUNT 114 a 156 c 199 g 120 t
ORIGIN
1 ccgggctgaa tgatcgaccg agacaggccc tgcggggctg cacacgcgcc cccacccttc
61 gggtaggggg aaaggccgct aaagcggcta aaagcgctcc agcgtatttc tgcggggttt
121 ggtgtggggt ttagcgggct ttgcccgcct ttccccctgc cgcgcagcgg tggggcggtg
181 tgtagcctag cgcagcgaat agaccagcta tccggcctct gggcatattg ggcagggcag
241 cagcgcccca cagggcgtga ctaaccgcgc ctagtggatt attcttagat aatcatggat
301 ggatttttcc aacaccccgc cagcccccgc ccctgctggg tttgcaggtt tgggggcgtg
361 acagttattg caggggttcg tgacagttat tgcagggggg cgtgacagtt attgcagggg
421 ttcgtgacag ttagtacggg atgacgggca ctggctggca atgtctagca acggcaggca
481 tgtcggctga cggtaaaaca actttccgct aagcgataga ctgtatgtga aacacagtat
541 tgcaaggacg cggaacatgc ctcatgtggc ggccaggacg gccagccgg
//
LOCUS STRSPEC 936 bp ds-DNA BCT 27-JUL-1990
DEFINITION S.pyogenes streptococcal pyrogenic exotoxin type C (speC) gene,
complete cds.
ACCESSION M35514
KEYWORDS pyrogenic exotoxin.
SOURCE S.pyogenes DNA.
ORGANISM Streptococcus pyogenes
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Streptococcaceae.
REFERENCE 1 (bases 1 to 936)
AUTHORS Goshorn,S.C. and Schlievert,P.M.
TITLE Nucleotide sequence of Streptococcal pyrogenic exotoxin type C
JOURNAL Infect. Immun. 56, 2518-2520 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 154 861 streptococcal pyrogenic exotoxin type C (speC)
precursor
sigp 154 234 streptococcal pyrogenic exotoxin type C signal
peptide (put.)
matp 235 858 streptococcal pyrogenic exotoxin type C
BASE COUNT 364 a 144 c 127 g 301 t
ORIGIN
1 caaccttgac tatttaaatg gaactgccac tcctaaaaac taaaatataa atacatttat
61 aaaatttcta aataaacaga aatctgattt ttaactactt actgctattt catgtattct
121 cgtacgagta atacatttaa ttaaggagaa aaaatgaaaa agattaacat catcaaaata
181 gttttcataa ttacagtcat actgatttct acttatttca cctatcatca aagtgactct
241 aagaaagaca tttcgaatgt taaaagtgat ttactttatg catacactat aactccttat
301 gattataaag attgcagggt aaatttttca acgacacaca cattaaacat tgatactcaa
361 aaatatagag ggaaagacta ttatattagt tccgaaatgt cttatgaggc ctctcaaaaa
421 tttaaacgag atgatcatgt agatgttttt ggattatttt atattcttaa ttctcacacc
481 ggtgagtaca tctatggagg aattacgcct gctcaaaata ataaagtaaa tcataaatta
541 ttgggaaatc tatttatttc gggagaatct caacagaact taaataacaa gattattcta
601 gaaaaggata tcgtaacttt ccaggaaatt gactttaaaa tcagaaaata ccttatggat
661 aattataaaa tttatgacgc tacttctcct tatgtaagcg gcagaatcga aattggcaca
721 aaagatggga aacatgagca aatagactta tttgactcac caaatgaagg gactagatca
781 gatatttttg caaaatataa agataataga attatcaata tgaagaactt tagtcatttc
841 gatatttatc ttgaaaaata attcatcata cacaaaaaac cgcccagaat aatctgagcg
901 gttttgtctt atctcggagc tttacctcct aattta
//
LOCUS HUMTROPI 816 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds.
ACCESSION J04760
KEYWORDS slow-twitch skeletal troponin I; troponin I.
SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 816)
AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L.
TITLE cDNA sequence, tissue-specific expression, and chromosomal mapping
of the human slow-twitch skeletal muscle isoform of troponin I
JOURNAL Genomics 7, 346-357 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Wade, 15-MAR-1990.
FEATURES from to/span description
pept 77 640 slow-twitch skeletal troponin I (TNN1)
mRNA < 1 816 TNN1 mRNA
BASE COUNT 180 a 247 c 244 g 145 t
ORIGIN Chromosome 1q12-qter.
1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg
61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa
121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga
181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg
241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt
301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa
361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg
421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc
481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc
541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa
601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc
661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg
721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct
781 ctggaactgg gattaaacag atacccaaga ggcagg
//
LOCUS PPB4TSP 349 bp ds-DNA VRL 27-JUL-1990
DEFINITION Bovine papillomavirus major transcript, 5' flank.
ACCESSION M35265
KEYWORDS major transcript.
SOURCE Bovine papillomavirus (type 4) DNA.
ORGANISM Bovine papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 349)
AUTHORS Stamps,A.C. and Campo,M.S.
TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4
JOURNAL J. Gen. Virol. 69, 3033-3045 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pre-msg 320 > 349 BPV-4 major transcript
BASE COUNT 109 a 54 c 88 g 98 t
ORIGIN
1 ctttgcagat gaattagaac aggaggaaca gtaaatagaa tattgctgtg cttttacatt
61 tgagaaggat tgtagtgtgg gcatttaaaa gctgaccttc cagtcttaat tgcagtaggc
121 gcctaagagg gtggtggtgg tataagttca agtttttcaa gaaaatgaag gggcagaacg
181 tgacattaca ggacattgca atagaattag aggatacaat tagtccaatt aacttgcatt
241 gtgaagagga gattgaaact gaggaggtgg acacccctaa cccttttgca ataacagcaa
301 cttgttatgc ttgcgagcaa gtccttcgtt tagctgttgt aacgtcaac
//
LOCUS PPBBPV4X 69 bp ds-DNA VRL 27-JUL-1990
DEFINITION Bovine papillomavirus X protein gene, partial cds.
ACCESSION M35264
KEYWORDS .
SOURCE Bovine papillomavirus (type 4) DNA.
ORGANISM Bovine papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 69)
AUTHORS Stamps,A.C. and Campo,M.S.
TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4
JOURNAL J. Gen. Virol. 69, 3033-3045 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 69 X protein (AA at 3)
BASE COUNT 18 a 27 c 19 g 5 t
ORIGIN
1 acaacgcggg acccaaaccg gggaccacgc ccgaggacgt agcagaccgt ccccccgatc
61 ttccagaga
//
LOCUS BLIPGECR 314 bp ds-DNA BCT 27-JUL-1990
DEFINITION B.licheniformis regulatory extracellular protease gene, complete
cds.
ACCESSION M35503
KEYWORDS extracellular protease.
SOURCE B.licheniformis DNA, clone pAN2.
ORGANISM Bacillus licheniformis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Watanabe,K., Sato,N., Asano,K., Hatanaka,Y., Okada,J.,
Murata,K. and Kimura,A.
TITLE Nucleotide sequence of the gene increasing the extracellular
proteolytic activities of Bacillus licheniformis; Comparison with
similar phenotypic genes from other Bacillus sp
JOURNAL Agric. Biol. Chem. 51, 2807-2809 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 27 149 regulatory extracellular protease
binding 12 16 ribosomal binding site (put.)
binding 20 23 ribosomal binding site (put.)
BASE COUNT 118 a 53 c 55 g 88 t
ORIGIN
1 gatcttccaa aaggagtgtg gaaccggtgg aaaagcaaca aattgaagaa ttaaaacaac
61 tgctttggcg gctagagaat gaaatcagag aaacaaagga ctccttgcgc aagattaaca
121 aaagcattga tctgttaata caagtttaaa gggatttggt ttctttttca ccgaaaagtt
181 gcatataaat tcatgcgtta caccgcaaac tttaaataga ccataacacc cattttacac
241 aacttgcgat tagttctcaa atcctaatga aacataaaat ttttgtttgt ttttcaagga
301 gtcaaataaa gatc
//
LOCUS DOGPLBA 832 bp ss-mRNA MAM 27-JUL-1990
DEFINITION Dog cardiac phospholamban mRNA, complete cds.
ACCESSION M35393
KEYWORDS phospholamban.
SOURCE Dog heart, cDNA to mRNA, clone pPLB1.
ORGANISM Canis sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 832)
AUTHORS Uyeda,A., Kitano,K., Fujii,J., Kadoma,M., Tada,M. and Tanaka,S.
TITLE Characterization of recombinant cDNA clones for canine cardiac
phospholamban
JOURNAL Nucleic Acids Symp Ser 17, 121-124 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 181 339 cardiac phospholamban (EC 2.7..1.37)
BASE COUNT 277 a 161 c 125 g 269 t
ORIGIN
1 agaaaacttt ctaactaaac accgataaga cttcatacaa ctcacaatac tttatattgt
61 aatcatcaca agagccaagg ctacctaaaa gaagagagtg gttgagctca catttggccg
121 ccagcttttt acctttctct tcaccattta aaacttgaga cttcctgctt tcctggggtc
181 atggataaag tccaatacct cactcgctct gctattagaa gagcttcaac cattgaaatg
241 cctcaacaag cacgtcaaaa tcttcagaac ctatttataa atttctgtct cattttaata
301 tgtctcttgt tgatctgcat cattgtgatg cttctctgaa gttctgctgc aatctccagt
361 gatgcaactt gtcaccatca acttaatatc tgccatccca tgaagagggg aaaataatac
421 tatataacag accacttcta agtagaagat tttacttgtg aaaaggtcaa gattcagaac
481 aaaagaaatt attaacaaat gtcttcatct gtgggatttt gtaaacatga aaagagcttt
541 attttcaaaa attaacttca aaatgactat aggtgcgcat aatgtaattg ctgaattcct
601 caacaaagct tgtaaaagtt tctatgccaa attttttctg agggtaaagt aggagtttag
661 ttttaaaact gctctgctaa ccagttcact tcacatataa agcattagct tcactatttg
721 agctaaatat ttatattgta ctgtaaatgc ctatgtaatg tttattaaga tttttcaagt
781 ctccgctaag tacgaaaata atcatccaaa tgaagtcatc atttgaaata gc
//
LOCUS DVUFDXA 707 bp ds-DNA BCT 27-JUL-1990
DEFINITION D.salexigens flavodoxin gene, complete cds.
ACCESSION M35475
KEYWORDS flavodoxin.
SOURCE D.salexigens (ATCC 14822) DNA.
ORGANISM Desulfovibrio salexigens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 707)
AUTHORS Helms,L.R., Krey,G.D. and Swenson,R.P.
TITLE Identification, sequence determination, and expression of the
flavodoxin gene form Desulfovibrio salexigens
JOURNAL Biochem. Biophys. Res. Commun. 168, 809-817 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 238 678 flavodoxin
binding 227 232 ribosome binding site (put.)
signal 129 135 flavodoxin -35 region
signal 152 157 flavodoxin -10 region
BASE COUNT 223 a 152 c 149 g 183 t
ORIGIN
1 agactaacaa taaaaatttt gttttctcac aaacaacatt ttgaacgcgc atatatcggt
61 acgtcttaag gagtgcataa caccactcag gcgcggcgta aacactcaat taccgattgg
121 ctgacatttt cacattttct taatattttt ataaattgat attgactttg aatttcactc
181 tcactacaaa ctaacacaac gttatcaaac cgattacaga tttataagga ggacactatg
241 tccaaatcac tgatcgttta cggctctact accggaaata ctgaaacagc cgccgaatac
301 gtggctgaag cgtttgaaaa caaagaaatt gatgtggaac ttaaaaatgt tactgatgtc
361 agtgttgccg atctcggcaa cggatacgac atcgtgctat tcggctgctc tacctggggc
421 gaagaagaaa ttgaattgca ggacgacttc atccccctct acgattccct cgaaaacgca
481 gacctgaagg gcaagaaagt atctgtcttc ggatgcggtg actccgatta cacttatttt
541 tgcggtgcag tagatgccat cgaagaaaaa ctcgaaaaaa tgggggccgt cgtcataggt
601 gacagcctca agatcgatgg cgacccggaa cgcgatgaga ttgtaagctg gggttcagga
661 atagcggaca aaatttaaat taatacagct cagctatcct tacgaag
//
LOCUS HUMCKRASA 450 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human PR310 c-K-ras protein mRNA, 5' end.
ACCESSION M35504
KEYWORDS c-K-ras protein; oncogene.
SOURCE Human (patient PR310) lung carcinoma, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 450)
AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M.
TITLE Structure and mechanisms of activation of c-K-ras oncogenes in
human lung cancer
JOURNAL Prog. Med. Virol. 32, 101-114 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 450 PR310 c-K-ras oncogene
BASE COUNT 155 a 71 c 106 g 118 t
ORIGIN
1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg
61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac
121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt
181 catgaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt
241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt
301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg
361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct
421 tttattcaaa catcagcaaa gacaagacag
//
LOCUS HUMCKRASB 450 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human PR371 c-K-ras oncogene mRNA, 5' end.
ACCESSION M35505
KEYWORDS c-K-ras oncogene.
SOURCE Human (patient PR371) lung carcinoma, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 450)
AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M.
TITLE Structure and mechanisms of activation of c-K-ras oncogenes in
human lung cancer
JOURNAL Prog. Med. Virol. 32, 101-114 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 450 PR371 c-K-ras oncogene
BASE COUNT 156 a 71 c 105 g 118 t
ORIGIN
1 atgactgaat ataaacttgt ggtagttgga gcttgtggcg taggcaagag tgccttgacg
61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac
121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt
181 caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt
241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt
301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg
361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct
421 tttattcaaa catcagcaaa gacaagacag
//
LOCUS HUMTCRAER 237 bp ds-DNA PRI 27-JUL-1990
DEFINITION Human T cell receptor-alpha enhancer region DNA.
ACCESSION M35477
KEYWORDS T-cell receptor.
SOURCE Human T-lymphocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 237)
AUTHORS Winto,A. and Baltimore,D.
TITLE Developmental regulation of the TCR-alpha-delta locus
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 87-92 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
binding 28 60 NF-alpha-1 protein binding site
binding 128 139 NF-alpha-2 protein binding site
binding 159 172 NF-alpha-3 protein binding site
binding 175 189 NF-alpha-5 protein binding site
binding 192 203 NF-alpha-4 protein binding site
mut 180 187 gcagaagc in NF-alpha-5; tactccta in
Mut-alpha-5
mut 192 197 tcctct in NF-alpha-4; gaagag in Mut-alpha-4
BASE COUNT 73 a 58 c 57 g 49 t
ORIGIN
1 agacagctgc accctgaaat gggtaagctg gtcagatagt gaatcaatag ccagaagtag
61 aacaggaaat ggaaaaagtt tcccacttcc ctccaggtgt ttgggtctga acagcctccc
121 acttccatga cgtcacggct gctgacatgg gcaaacaggt ccccctttga agctctcccg
181 cagaagccac atcctctgga aagaggagtt aaaaatacag agttagagat aagatct
//
LOCUS HUMTGIGMU 352 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human Ig mu-chain transgenic mRNA, partial cds.
ACCESSION M35478
KEYWORDS immunoglobulin heavy-chain; mu-immunoglobulin; transgenic gene.
SOURCE Human leukemic patient mononuclear cell line, cDNA to mRNA, clones
pTMMG1-[4,5,12].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 352)
AUTHORS Shimizu,A., Kinashi,T., Nussenzweig,M.C., Mizuta,T.-R.,
Leder,P. and Honjo,T.
TITLE Molecular mechanism for immunoglobulin double-isotype expression
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 175-181 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 352 mu-immunoglobulin transgene protein (AA at 2)
/hgml_locus_uid="LW0029F" /nomgen="IGHM"
/map="14q32.33"
recomb 226 227 V region end/D region start
recomb 274 275 D region end/J6 region start
recomb 332 333 J6 region end/C1 region start
BASE COUNT 84 a 99 c 104 g 65 t
ORIGIN
1 ggcttctgga ggcaccttca gcagctatgc tatcagctgg gtgcgacagg cccctggaca
61 agggcttgag tggatgggag ggatcatccc tatctttggt acagcaaact acgcacagaa
121 gttccagggc agagtcacga ttaccgcgga cgaatccacg agcacagcct acatggagct
181 gagcagcctg agatctgagg acacggccgt gtattactgt gcgaaaaccg ggatcctggg
241 gcggtatagc agtggctggt acccgaactc ggactactac tactacggta tggacgtctg
301 gggccaaggg accacggtca ccgtctcctc agccaaaacg acacccccat ct
//
LOCUS MUSIGHADD 411 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Ig mu-chain mRNA VDJ region, 5' end.
ACCESSION M35502
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse hybridoma CP8 B3-D3, cDNA to mRNA, clone pCP12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Reininger,L., Kaushik,A., Izui,S. and Jaton,J.-C.
TITLE A member of a new V-H gene family encodes anti-bromelinized mouse
red blood cell autoantibodies
JOURNAL Eur. J. Immunol. 18, 1521-1526 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 411 Ig mu-chain VDJ region precursor
sigp 1 57 Ig mu-chain signal peptide
matp 58 > 411 Ig mu-chain
recomb 351 352 V region end/D region start
recomb 360 361 D region end/J1 region start
BASE COUNT 101 a 84 c 118 g 108 t
ORIGIN
1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa
61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct
121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct
181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca
241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg
301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatggtaac
361 tactggtact tcgatgtctg gggcgcaggg accacggtca ccgtctcctc a
//
LOCUS PRW1707PPS 66 bp ds-DNA SYN 27-JUL-1990
DEFINITION Plasmid pRW1707 DNA with polypurine-polypyrimidine sequences.
ACCESSION M35455
KEYWORDS .
SOURCE Synthetic DNA, clone pRW1707.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D.
TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine
sequences in plasmids
JOURNAL Biochemistry 29, 4704-4713 (1990)
STANDARD simple staff_entry
BASE COUNT 36 a 5 c 21 g 4 t
ORIGIN
1 ggatccgaag aagaagaaga agaagaagaa gaattcgaag aagaagaaga agaagaagaa
61 ggatcc
//
LOCUS PRW1718PPS 72 bp ds-DNA SYN 27-JUL-1990
DEFINITION Plasmid pRW1718 DNA with polypurine-polypyrimidine sequences.
ACCESSION M35460
KEYWORDS .
SOURCE Synthetic DNA, clone pRW1718.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 72)
AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D.
TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine
sequences in plasmids
JOURNAL Biochemistry 29, 4704-4713 (1990)
STANDARD simple staff_entry
BASE COUNT 38 a 6 c 22 g 6 t
ORIGIN
1 ggatccgaag aagaagaaga agaagaagaa gaattaattc gcgaagaaga agaagaagaa
61 gaagaaggat cc
//
LOCUS PRW1724PPS 66 bp ds-DNA SYN 27-JUL-1990
DEFINITION Plasmid pRW1724 DNA with polypurine-polypyrimidine sequences.
ACCESSION M35458
KEYWORDS .
SOURCE Synthetic DNA, clone pRW1724.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D.
TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine
sequences in plasmids
JOURNAL Biochemistry 29, 4704-4713 (1990)
STANDARD simple staff_entry
BASE COUNT 19 a 6 c 38 g 3 t
ORIGIN
1 ggatccggag gaggaggagg aggaggagga ggatccggag gaggaggagg aggaggagga
61 ggatcc
//
LOCUS PRW1725PPS 66 bp ds-DNA SYN 27-JUL-1990
DEFINITION Plasmid pRW1725 DNA with polypurine-polypyrimidine sequences.
ACCESSION M35459
KEYWORDS .
SOURCE Synthetic DNA, clone pRW1725.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 66)
AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D.
TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine
sequences in plasmids
JOURNAL Biochemistry 29, 4704-4713 (1990)
STANDARD simple staff_entry
BASE COUNT 27 a 6 c 30 g 3 t
ORIGIN
1 ggatccggag gaggaggagg aggaggagga ggatccgaag aagaagaaga agaagaagaa
61 ggatcc
//
LOCUS PT1INC3AA 330 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid pT181 (from S.aureus) inc3A region DNA with repC gene,
5' end.
ACCESSION M35476
KEYWORDS .
SOURCE Plasmid pT181 (from Staphylococcus aureus) DNA.
ORGANISM Plasmid pT181
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 330)
AUTHORS Highlander,S.K. and Novick,R.P.
TITLE Mutational and physiological analyses of plasmid pT181 functions
expressing incompatibility
JOURNAL Plasmid 23, 1-15 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 11 < 1 (c) RepC protein
mRNA 191 < 1 (c) RepC mRNA III
mRNA 291 < 1 (c) RepC mRNA IV
RNA 106 190 RNA I
RNA 106 250 RNA II
binding 27 22 (c) RepC ribosomal binding site (put.)
site 151 176 inverted repeat terminator structure
site 218 249 inverted repeat terminator structure
BASE COUNT 113 a 51 c 37 g 129 t
ORIGIN
1 tttttataca taaaaaatcg actccttaat ctcaatttcg tttaaggaat cgctcaccca
61 aatatatatc ttgatgtata tttaaatatc gtttaatatc taaatataca agattataaa
121 aacaactcag tgtttttttc tttgaatgat gtcgttcaca aactttggtc agggcgtgag
181 cgactccttt ttattttgtt attaatataa cactatcaaa agatttggtc taatcagatc
241 aagtcttttt ttatttaagc atttgtatta tctggtaaac agttaaaact actaaaacac
301 caagtacata cttacttgtt ataaaattct
//
LOCUS RICRGC1 145 bp ds-DNA PLN 27-JUL-1990
DEFINITION Rice 17S rRNA gene.
ACCESSION M35382 M35383
KEYWORDS 17S ribosomal RNA.
SEGMENT 1 of 4
SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Hariharan,N. and Padayatty,J.D.
TITLE Ribosomal RNA gene repeat units in rice embryos
JOURNAL Indian J Biochem Biophys 24, 157-164 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 > 145 17S rRNA
variant 105 106 cg in IR20; cgcggccg in Mangetsumochi
variant 112 113 gt in IR20; gcgt in Mangetsumochi
BASE COUNT 30 a 40 c 43 g 32 t
ORIGIN
1 gatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta
61 tatttaagtt gttgcagtta aaaagctcgt agttggacct tgggcgggcc ggtccgcctc
121 acggcaggca ccgacctgct cgacc
//
LOCUS RICRGC2 260 bp ds-DNA PLN 27-JUL-1990
DEFINITION Rice 5.8S rRNA gene.
ACCESSION M35384
KEYWORDS 5.8S ribosomal RNA.
SEGMENT 2 of 4
SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 260)
AUTHORS Hariharan,N. and Padayatty,J.D.
TITLE Ribosomal RNA gene repeat units in rice embryos
JOURNAL Indian J Biochem Biophys 24, 157-164 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 85 247 5.8S rRNA
BASE COUNT 61 a 85 c 76 g 38 t
ORIGIN About 1.0 kbp after segment 1.
1 cagaacccac ggcgccgacg gcgtcaagga acacagcgag gccctggccg tccggcggcg
61 cgcgcgatac cacgagttaa atccacacga ctctcggcaa cggatatctc ggctctcgca
121 tcgatgaaga acgtagcgaa atgcgatacc tggtgtgaat tgcagaatcc cgtgaaccat
181 cgagtctttg aacgcaagtt gcgcccgagg ccatccggcc gagggcacgc ctgcctgggc
241 gtcacgccaa aagacgctcc
//
LOCUS RICRGC3 91 bp ds-DNA PLN 27-JUL-1990
DEFINITION Rice 25S rRNA gene.
ACCESSION M35385
KEYWORDS 25S ribosomal RNA.
SEGMENT 3 of 4
SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 91)
AUTHORS Hariharan,N. and Padayatty,J.D.
TITLE Ribosomal RNA gene repeat units in rice embryos
JOURNAL Indian J Biochem Biophys 24, 157-164 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 91 25S rRNA
BASE COUNT 30 a 12 c 29 g 20 t
ORIGIN About 1.6 kbp after segment 2.
1 gatcttggtg gtagtagcaa atattcaaat gagaacttga agggcgaaga ggagaaaggt
61 tccatgtgaa cggcacttgc acatgggtaa g
//
LOCUS RICRGC4 161 bp ds-DNA PLN 27-JUL-1990
DEFINITION Rice 25S rRNA gene.
ACCESSION M35386
KEYWORDS 25S ribosomal RNA.
SEGMENT 4 of 4
SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 161)
AUTHORS Hariharan,N. and Padayatty,J.D.
TITLE Ribosomal RNA gene repeat units in rice embryos
JOURNAL Indian J Biochem Biophys 24, 157-164 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 161 25S rRNA
variant 24 25 cc in IR20; ccgc in Mangetsumochi
variant 63 68 gccctt in IR20; tccacg in Mangetsumochi
variant 128 128 c in IR20; g in Mangetsumochi
BASE COUNT 36 a 52 c 48 g 25 t
ORIGIN About 250 bp after segment 3.
1 cgcacgtcgc gcggtgtccg gtgcccccgg cggcccttga aaatccggag gaccgagtac
61 cggcccttcc cggtcgtact cataaccgca tcaggtctcc aaggtgaacg acctctggcc
121 aatggaacaa tgtaggcaag ggaagtcggc aaaacggatc c
//
LOCUS STRRRA 5100 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid pAM-beta-1 (from S.faecalis) replication region DNA.
ACCESSION X17092
KEYWORDS .
SOURCE plasmid pAM-beta-1 (from Streptococcus faecalis) DNA.
ORGANISM Plasmid pAM-beta-1
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 5100)
AUTHORS Swinfield,T.-J., Oultram,J.D., Thompson,D.E., Brehm,J.K. and
Minton,N.P.
TITLE Physical characterization of the replication region of the
Streptococcus faecalis plasmid pAM-beta-1
JOURNAL Gene 87, 79-90 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 476 ORFA (AA at 3)
pept 505 873 ORFB
pept 933 1799 ORFC
pept 2121 2405 ORFF
pept 2772 3071 ORFD
pept 3114 4604 ORFE
pept 4954 5091 ORFG
binding 488 496 ORFB ribosomal binding site (put.)
binding 919 927 ORFC ribosomal binding site (put.)
binding 2759 2765 ORFD ribosomal binding site (put.)
binding 3100 3106 ORFE ribosomal binding site (put.)
binding 4942 4948 ORFG ribosomal binding site (put.)
BASE COUNT 1953 a 806 c 944 g 1397 t
ORIGIN
1 gaattcattt gttcatatct attttgactc tttttattgt attattgata ggaataagtg
61 ttattcaaca agtacaactt caatcagtca aaaaagagag cgaattattg gaagaacaaa
121 ttgaacgtgt caaagaaacg gatatttctc aatcgaaaat agatactttt ggaagatact
181 ttttgactta ctatttttca caggaaaaaa atcaagaaaa ttatcagtca agtttaagaa
241 catatgtatc cgaaaaggta gatatatccg attggaaagc gttagggaaa acgcttaaga
301 gtgtcaatta ctatggtagt gaacaaacaa aaaaagggta tagcgttgaa taccttctaa
361 atgtttcagt agacaatcgt agtaaaatgc aaaaaatcac ttttgaggtt gaaccaacta
421 aaaacggctt tcttgtgaca actcaaccaa aacttacaga cttctctttt aactgaaata
481 cgctaaaaag agagtgctta aacaatggga aaaattaatt taaatcaaat ttacacagca
541 aaagaaatga gtgaaaggat aggtaagaat agaaattacc tatcccaagc ttatcgtaac
601 aacaaacatg aaatacttaa aaattttaat tatcgaaaaa ttggtggaac aattattttc
661 tctgataatc ccaataatga tttatcgcaa ttgattacag caaaagaagc aagccagttg
721 ttaggaaaga atgatgaata ctttgcccac atttataaac ggtttcctca tagactagag
781 ggaatcgatc acatttatac aggtaaaaca ttattcttaa cgaaagaatc gctagaagtt
841 tttaaaaaaa agatgaacaa aaatgtcaga taaagcatat ttatgatata ctatttatgt
901 acacaaatat taatttttaa ggagagaaga aaatgaaaaa aattgccctt ttaacacttt
961 ttagtacagc gctattggta tcagccccac ttgttagttt tgcggacgaa gcaacaggta
1021 gttcaggtgt caatgttcta gctgatgatg tagttgatcc agtagaacca accgatccaa
1081 ctactccggt tgatccagtt gatccagttg atccagttga tccagttgat ccagttgatc
1141 cagttgatcc agtcgatcca gttgatccag tcgatccaac agagccaacg gaaccaacag
1201 agccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acggaaccaa
1261 cggaaccaac ggaaccaacg gaaccaacag agccaacgga accaacggaa ccaacagagc
1321 caacggaacc aacagagcca acggaaccaa cggaaccaac agagccaaca gagccaacgg
1381 aaccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acagagccaa
1441 cggaaccaac ggagccaacg gaaccaacgg aaccaagcaa gccgacagag ccaacagaac
1501 caagtaaacc agtagacaca aacccaatcg aaaacccagt taatacagat acaggcgttg
1561 tgattgttgc agttgaagat agtaaaccaa tcattcaatt agcagacgga acaacaaaaa
1621 aagttgaagc taaagaaatt ggcgcagacg ttcaaaaaga cggaacagta acggttaaag
1681 gttcagacgg aaaaatgaaa gttttaccaa aaacaggaga aacagaaaat attgcattgt
1741 ctgtcctagg ttcattaatg gtattaggat cagcattcat ttttaaaaag agaatctaat
1801 taagtatttt taaaagcatg gtcgcaagtt tcactagcag ccatgctttt tttgaatctt
1861 tgtaggacgt tttaaagaga attacccact ttaaaattaa acatgcgctt agaatcgctt
1921 taggaaacac gatccagtcc aataatcgtc gataaaaact tttgaaaaag gttggtgaaa
1981 ttacctactt ttggaataat cacaaatcac aagtgattaa tcacaaatca caagtgatta
2041 atcacttgtt tattaagata ttaaaagcta taatttaaat aaagcgtgaa ttttattaca
2101 caaaaagagg ggggagaaac ttggaactag catttagaga aagcttaaaa aagatgagag
2161 gtaccaaatc aaaagaaaaa ttctcccaag aattagaaat gagtagatca aattattcac
2221 gaatagaatc aggaaaatca gatccaacca taaaaacact agaacaaatt gcaaagttaa
2281 ctaactcaac gctagtagtg gatttaatcc caaatgagcc aacagaacca gaaccagaaa
2341 cagaatcaga acaagtaaca ttggatttag aaatggaaga agaaaaaagc aatgacttcg
2401 tgtgaataat gcacgaaatc gttgcttatt tttttttaaa agcggtatac tagatataac
2461 gaaacaacga actgaataga aacgaaaaaa gagccatgac acatttataa aatgtttgac
2521 gacattttat aaatgcatag cccgataaga ttgccaaacc aacgcttatc agttagtcag
2581 atgaactctt ccctcgtaag aagttattta attaactttg tttgaagacg gtatataacc
2641 gtactatcat tatataggga aatcagagag ttttcaagta tctaagctac tgaatttaag
2701 aattgttaag caatcaatcg gaaatcgttt gattgctttt tttgtattca tttatagaag
2761 gtggagtttg tatgaatcat gatgaatgta aaacttatat aaaaaatagt ttattggaga
2821 taagaaaatt agcaaatatc tatacactag aaacgtttaa gaaagagtta gaaaagagaa
2881 atatctactt agaaacaaaa tcagataagt atttttcttc ggagggggaa gattatatat
2941 ataagttaat agaaaataac aaaataattt attcgattag tggaaaaaaa ttgacttata
3001 aaggaaaaaa atctttttca aaacatgcaa tattgaaaca gttgaatgaa aaagcaaacc
3061 aagttaatta aacaacctat tttataggat ttataggaaa ggagaacagc tgaatgaata
3121 tcccttttgt tgtagaaact gtgcttcatg acggcttgtt aaagtacaaa tttaaaaata
3181 gtaaaattcg ctcaatcact accaagccag gtaaaagcaa aggggctatt tttgcgtatc
3241 gctcaaaatc aagcatgatt ggcggtcgtg gtgttgttct gacttccgag gaagcgattc
3301 aagaaaatca agatacattt acacattgga cacccaacgt ttatcgttat ggaacgtatg
3361 cagacgaaaa ccgttcatac acgaaaggac attctgaaaa caatttaaga caaatcaata
3421 ccttctttat tgattttgat attcacacgg caaaagaaac tatttcagca agcgatattt
3481 taacaaccgc tattgattta ggttttatgc ctactatgat tatcaaatct gataaaggtt
3541 atcaagcata ttttgtttta gaaacgccag tctatgtgac ttcaaaatca gaatttaaat
3601 ctgtcaaagc agccaaaata atttcgcaaa atatccgaga atattttgga aagtctttgc
3661 cagttgatct aacgtgtaat cattttggta ttgctcgcat accaagaacg gacaatgtag
3721 aattttttga tcctaattac cgttattctt tcaaagaatg gcaagattgg tctttcaaac
3781 aaacagataa taagggcttt actcgttcaa gtctaacggt tttaagcggt acagaaggca
3841 aaaaacaagt agatgaaccc tggtttaatc tcttattgca cgaaacgaaa ttttcaggag
3901 aaaagggttt aatagggcgt aataacgtca tgtttaccct ctctttagcc tactttagtt
3961 caggctattc aatcgaaacg tgcgaatata atatgtttga gtttaataat cgattagatc
4021 aacccttaga agaaaaagaa gtaatcaaaa ttgttagaag tgcctattca gaaaactatc
4081 aaggggctaa tagggaatac attaccattc tttgcaaagc ttgggtatca agtgatttaa
4141 ccagtaaaga tttatttgtc cgtcaagggt ggtttaaatt caagaaaaaa agaagcgaac
4201 gtcaacgtgt tcatttgtca gaatggaaag aagatttaat ggcttatatt agcgaaaaaa
4261 gcgatgtata caagccttat ttagtgacga ccaaaaaaga gattagagaa gtgctaggca
4321 ttcctgaacg gacattagat aaattgctga aggtactgaa ggcgaatcag gaaattttct
4381 ttaagattaa accaggaaga aatggtggca ttcaacttgc tagtgttaaa tcattgttgc
4441 tatcgatcat taaagtaaaa aaagaagaaa aagaaagcta tataaaggcg ctgacaaatt
4501 cttttgactt agagcataca ttcattcaag agactttaaa caagctagca gaacgcccta
4561 aaacggacac acaactcgat ttgtttagct atgatacagg ctgaaaataa aacccgcact
4621 atgccattac atttatatct atgatacgtg tttgtttttt ctttgctgtt tagcgaatga
4681 ttagcagaaa tatacagagt aagattttaa ttaattatta gggggagaag gagagagtag
4741 cccgaaaact tttagttggc ttggactgaa cgaagtgagg gaaaggctac taaaacgtcg
4801 aggggcagtg agagcgaagc gaacacttga ttttttaatt ttctatcttt tataggtcat
4861 tagagtatac ttatttgtcc tataaactat ttagcagcat aatagattta ttgaataggt
4921 catttaagtt gagcatatta gaggaggaaa atcttggaga aatatttgaa gaacccgatt
4981 acatggattg gattagttct tgtggttacg tggtttttaa ctaaaagtag tgaatttttg
5041 atttttggtg tgtgtgtctt gttgttagta tttgctagtc aaagtgatta aatagaattc
//
LOCUS SUSISDNA 241 bp ds-DNA INV 27-JUL-1990
DEFINITION Sea urchin integration site DNA.
ACCESSION M34912
KEYWORDS .
SOURCE Sea urchin DNA, clone 1.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 241)
AUTHORS Flytzanis,C.N., McMahon,A.P., Hough-Evans,B.R., Katula,K.S.,
Britten,R.J. and Davidson,E.H.
TITLE Persistence and integration of cloned DNA in postembryonic sea
urchins
JOURNAL Dev. Biol. 108, 431-442 (1985)
STANDARD simple staff_entry
BASE COUNT 70 a 42 c 33 g 96 t
ORIGIN
1 gaaaagtaac gatattcgtt tttataaaag gtcattttca cgtagagatg aatttacctg
61 gtactaataa ctgatgcaga gtcgttttca aacagtgtca aatgcaaacc tttttgtctg
121 actggctctt ttttaaattt gtattaatgc cataacactc actttaattt gcgacatggc
181 tctttccatg tcactttttt attcttcaca tgataaatat tctaaaacta tctgttcttt
241 t
//
LOCUS MSQD7AA 3819 bp ds-DNA INV 27-JUL-1990
DEFINITION A.aegypti D7 gene, exons 1-5.
ACCESSION M33156
KEYWORDS .
SOURCE A.aigypti (strain Rockefeller) female salivary gland DNA, clone
AEGD7.
ORGANISM Aedes aegypti
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae.
REFERENCE 1 (bases 1 to 3819)
AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J.
TITLE Molecular analysis of a gene expressed in the female salivary
glands of the vector mosquito, Aedes aegypti
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.A.James, 22-MAR-1990.
Author address:A.A.James:
U of C, Irvine, Dept Biochem
Irvine, CA 92717
FEATURES from to/span description
pept 2377 2421 D7 protein, exon 1
2481 2696 D7 protein, exon 2
2757 2975 D7 protein, exon 3
3038 3232 D7 protein, exon 4
3288 3578 D7 protein, exon 5
pre-msg 2350 > 3819 D7 protein mRNA and introns
IVS 2422 2480 D7 protein intron A
IVS 2697 2756 D7 protein intron B
IVS 2976 3037 D7 protein intron C
IVS 3233 3287 D7 protein intron D
signal 2321 2326 TATA box
signal 3634 3639 poly-A signal
BASE COUNT 1202 a 721 c 744 g 1152 t
ORIGIN
1 gaattcaagg aacctttaaa tgaatctctt gaaatatccc tggaaacatt actcgtgaag
61 tcctggagaa ttttcagtta taaatctggg aagaatcctt gcatgaatcc ctggggtaat
121 ttatgaagaa atcctttgag gtattcttgc aagaatctct tgacgttgaa tgcatttggt
181 ttggcttgaa atattttaag gttttggtga aattagaata cttggagaaa tctgtgataa
241 tattttgaat atattcttga aataatttta gaaagagtta ccgcattaat tggaactgaa
301 atcgctggcg gaatttttgt gaaaatcttc aagaaatttc aaaatgtctc tgaaaaaacc
361 ctgaaggaaa ttactaaagg ataatttgac agattttttt tattattgac tctttattat
421 ggagaaattc agcccttagc tagttcatct ccttgacaga attactgaaa aacgactctg
481 ttttttgtaa tctactaatt cagagtccaa cgctcgggtt ttgaatttcc catatttcct
541 acacaaatcc atgaaagaat aatagtcgag aactgtcaaa aagttatgaa aaattattga
601 aaacggaatc gagactagca ctgtaaaaga gtcatctatt gtcatttcag ttacaagggc
661 caatcaattg acacaaacac aagcttcacc aactcttcca gaaattttaa agcatctact
721 gatattggct tagataattt atacgtattc ctcctagatt tggcccagtt gttagttaca
781 tgaaaggcca aatgttaaag gttaaatatc cgcagagtcc attgtcaatc agtgaagtat
841 tttaagccag catgaaaggg tattatcaac attttttcca aaaccctccg ggttcattta
901 tgctgagaag gaaaaaccga agtgattttt ttgcgctctc gtgctctgca taataacctt
961 accgtccttc aattaaactt tccgtcaaag tcttcatagt ccaagcaata aaaaaaaaga
1021 tgaaaatttc ttcaataagg tatcaccaaa gacacatacc ataccgaggg accaaatgca
1081 gtactagaag tggtaccaaa tcagagcccg agtgggacgg acctggtatc actcgacata
1141 ttcactgttt actgatgtcg tacaaacagc tattcgtcgt aattcttcgt ctgccacttc
1201 aaatgaaaac ctgctaggtg aaacggacga acaatccaaa ctttccttca gcaaatcttg
1261 tcctacccat cagaaagtgt tatttatgag gactcctgag cggtactgca atgctctcat
1321 atgagcgctg gacggcactg acgttcacct tccggatcct cgtggagtgc ttgctgtttc
1381 gacgccatct tcgattgaac tgtcagttat tactctcaac agaagcagta agcttcacct
1441 taaggctaag taacccgtca ttcgttttgg caacaatgat gacttttcag cttgcatttc
1501 aaaatgataa aacgcagtct tgatagttta tactgacttg aaaaagtatc actgtacgca
1561 tttacatgca taaagtatgc tgatactttt tcagctgtgt cagtgcaaaa gcaactgatt
1621 ttctttgatt cgaaatcatg agatgaatta gcaacaacca tcaacgacgc gtacaaattt
1681 taatgacggc ctacttcgcc ttaacagcct tgtcgtgatg acgatccttg cagttgcgac
1741 aggaatactt cgacaaatcc tagctagatt tcccgatctg aagcagtttg ttaaaaaaaa
1801 atattttcag ttgagaaatg attttcctca cctcagagag ttaccggatc taaccgtatt
1861 aaaggttgtc aatcaggata tgcgcatcca gtaccgattc cttggataac gatggcaaat
1921 cgattaaaac ttgtgcctac gcaatttaat agttctagtt gtctctaagc aatctatcag
1981 gtcttttggt tctcgcgaaa tctgattaat tcatcgacat taccgtcgag agcctggagc
2041 ttgatttgcg acaaacatat tgatcgattg cttgaataca taagcttttg ctaagcctat
2101 taatccaata atagatctct tcagagacga atggaagcac accatagtac tttataatat
2161 tgaacatttg ttagtactgt gcacgtgtgc acacattcat gtacatatgg atgatttgat
2221 cagctttact caattctttc ttcacactct gctggttgtt tatatttgcg taaatctatt
2281 atctgttatc agctgactgc taaatcacat tcaacttgca tataaaacgc attaggcctg
2341 tctaacattc atcccgatca tacttacaat actagaatga agctacttct attactcgca
2401 atttttacaa ccttttccgt ggtaagatgt tcctagtgca tccaactctt catttcttaa
2461 caaatactcc accttttaag gttgcctcaa tgggaccatt tgatccggag gagatgctgt
2521 tcatctttac gcgctgcatg gaagacaatt tggaagatgg agcgaatcga cttccaatgt
2581 tagcgaaatg gaaagaatgg attaacgaac cggtagacag ccccgcaact cagtgtttcg
2641 gcaaatgcgt cctggtaaga acaggtctgt acgatccggt agcccaaaag ttcgatgtaa
2701 gttggtggtt gatatccggg tggaattgga taatttcatc gcttcaattg ttgtaggcgt
2761 cggtgatcca ggagcaattt aaggcttatc cgtccttggg ggaaaagagc aaagttgaag
2821 catatgctaa cgcagttaaa cagttgcctt ccacaaataa cgactgtgcc gctgttttca
2881 aagcgtacga tcctgttcat aaggcgcata aggacaccag caagaacttg ttccatggaa
2941 acaaggagtt gaccaagggc ctctatgaga agttggtaag tgtttccgtg ttattcgatt
3001 gaaacaattg tgtgtatgaa catattaatt gcttcaggga aaagacattc gccagaagaa
3061 gcaatcctac ttcgagtttt gcgagaacaa gtactaccca gctgggtcag ataagcgtca
3121 gcaactttgt cagataaggc aatacactgt cttagatgat gcgctgttca aggagcacac
3181 tgattgcgtg atgaagggta ttcgctacat tacgaaggat aatcaactgg atgtaagaaa
3241 ataataaata tcatgcaaac atcatagtaa tcgacatttc ctttcaggtg gaagaggtga
3301 agcgggactt caagctagtg aataaagata cgaaggcact cgaaaaggtt ttgaatgact
3361 gtaagtctaa ggaaccaagc aacgcaaaag agaaatcatg gcactactac aaatgtttgg
3421 tggaatcttc ggttaaggat gatttcaaag aggcttttga ctatcgtgag gtacggtcac
3481 agatttatgc cttcaatttg cccaagaaac aggcttacag caaaccagca gtgcaatctc
3541 aagtgatgga aatcgacggt aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt
3601 ccaattcaat ggtttcatcg aagtagaaag ttaaataaag atataataat catcacgcag
3661 aggaactttg ttttaactta tattgtttat ttggtaattg ttgtagttat ttttaaatta
3721 ttttaatttg ggaagccgaa aaaccgttaa taggaaatta aagtagtttt tgaatcattg
3781 ttattataaa atatcgcata gttctgtact gaagaattc
//
LOCUS MSQD7AB 1040 bp ss-mRNA INV 27-JUL-1990
DEFINITION A.aegypti D7 mRNA, complete cds.
ACCESSION M33157
KEYWORDS .
SOURCE A.aigypti (strain Rockefeller) female salivary gland, cDNA to mRNA,
clone AEGD7.
ORGANISM Aedes aegypti
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae.
REFERENCE 1 (bases 1 to 1040)
AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J.
TITLE Molecular analysis of a gene expressed in the female salivary
glands of the vector mosquito, Aedes aegypti
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.A.James, 22-MAR-1990.
Author address:A.A.James
U of C, Irvine, Dept Biochem
Irvine, CA 92717
FEATURES from to/span description
pept 13 978 D7 protein
signal 1016 1021 poly-A signal
BASE COUNT 335 a 208 c 251 g 246 t
ORIGIN
1 gaattcccta gaatgaagct gcctctatta ctcgcaatag ttacaacctt ttccgtggtt
61 gcttcaacgg gaccatttga tccggaggag atgctgttca cctttacgcg ctgcatggaa
121 gacaatttgg aagatggacc gaatcgactt ccaatgttag cgaaatggaa agaatggatt
181 aacgaaccgg tagacagccc cgcaactcag tgtttcggca aatgcgtcct ggtaagaaca
241 ggtctgtacg atccggtagc ccaaaagttc gatgcgtcgg tgatccagga gcaatttaag
301 gcttatccgt ccttggggga aaagagcaaa gttgaagcat atgctaacgc agttcaacag
361 ttgccttcca caaataacga ctgtgccgct gttttcaaag cgtacgatcc tgttcataag
421 gcgcataagg acaccagcaa gaacttgttc catggaaaca aggagttgac caagggcctc
481 tatgagaagt tgggaaaaga cattcgccag aagaagaaat cctacttcga gttttgcgag
541 aacaagtact acccagctgg atcagataag cgccagcaac tttgtaagat aaggcaatac
601 actgtcttag atgatgcgct gttcaaggag cacactgatt gcgtgatgaa gggtattcgc
661 tacataacga agaataatga actggatgct gaagaggtga aacgggactt caagctagtg
721 aataaagata cgaaggcact tgaaaaggtt ttgaatgact gtaagtctaa ggaaccaagc
781 aacgcaaaag agaaatcatg gcactactac aaatgtttgg tggaatcttc ggttaaggat
841 gatttcaaag aggcttttga ctatcgtgag gtacggtcac agatttatgc cttcaatttg
901 cccaagaagc aggcttacag caaaccagca gtgcaatctc aagtgatgga aatcgacggt
961 aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt ccaattcaat ggttaaataa
1021 agaaataatg atcatcacgc
//
LOCUS ADGLS 1477 bp ds-DNA VRL 27-JUL-1990
DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units.
ACCESSION K03325
KEYWORDS alternate splicing; terminal repeat.
SOURCE Adenovirus type 7 (simian) DNA.
ORGANISM Mastadenovirus s7
Unclassified.
REFERENCE 1 (bases 1 to 1477)
AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E.
TITLE E1a regions of the human adenoviruses and of the highly oncogenic
simian adenovirus 7 are closely related
JOURNAL J. Virol. 53, 399-409 (1985)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.Kimelman (15-AUG-1985).
The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA
hybridization and S1 nuclease digestion. The peptide coding
regions were deduced by analogy to other adenoviruses and following
the most likely mode of translation. The intron of the mRNA III
transcribed from the E1a of other adenoviruses has not yet been
determined by S1 mapping.
Enhancer sequences are located at positions 165-173 and 265-270. A
TATA box is located at position 408-413. A polyadenylation signal
is located at posityion 1447-1452.
FEATURES from to/span description
pept 486 837 E1a 28 K protein from mRNA I, exon 1
1178 1407 E1a 28 K protein from mRNA I, exon 2
pept 486 1056 E1a 24 K protein from mRNA II, exon 1
1178 1407 E1a 24 K protein from mRNA II, exon 2
pre-msg 439 1477 E1a mRNA
IVS 838 1177 E1a 24K pept intron A
IVS 1057 1177 E1a 28K pept intron A
BASE COUNT 331 a 334 c 442 g 370 t
ORIGIN 5' end of the l strand of the genome.
1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt
61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg
121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt
181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt
241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga
301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc
361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc
421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg
481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata
541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca
601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg
661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc
721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg
781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg
841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg
901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg
961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc
1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact
1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga
1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca
1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc
1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca
1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg
1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta
1441 tcaggaaata aaaagattaa ctggattctt tgtgcct
//
LOCUS ADGRS 116 bp ds-DNA VRL 27-JUL-1990
DEFINITION adenovirus type 7 (simian) right end of genome.
ACCESSION V00035 J01991
KEYWORDS terminal repeat.
SOURCE adenovirus type 7 (simian) dna.
ORGANISM Mastadenovirus s7
Unclassified.
REFERENCE 1 (bases 1 to 116)
AUTHORS Broker,T.R.
TITLE terminal nucleotide sequences and restriction endonuclease cleavage
maps of simian and murine adenoviruses
JOURNAL (in) Tooze,J. (Ed.);
DNA Tumor Viruses, Second Edition Revised: 1037-1046;
Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 116)
AUTHORS Tolun,A., Alestroem,P. and Pettersson,U.
TITLE sequence of inverted terminal repetitions from different
adenoviruses: demonstration of conserved sequences and homology
between sa7 termini and sv40 dna
JOURNAL Cell 17, 705-713 (1979)
STANDARD simple staff_review
COMMENT L-strand is given, from 5' to 3'.
BASE COUNT 25 a 15 c 46 g 30 t
ORIGIN
1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg
61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct
//
LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 27-JUL-1990
DEFINITION Simian adenovirus SA7P, leftmost 2338 bp.
ACCESSION X01027
KEYWORDS E1A protein; E1b protein; inverted terminal repeat;
overlapping genes; transforming gene.
SOURCE Simian adenovirus SA 7P.
ORGANISM Mastadenovirus s7
Unclassified.
REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338)
AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and
Van Ormondt,H.
TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of
simian adenovirus SA7P
JOURNAL J. Gen. Virol. 65, 1699-1708 (1984)
STANDARD simple automatic
COMMENT **map: leftmost XhoI fragment (6%);
EMBL features not translated to GenBank features:
key from to description
SITE 1 183 inverted terminal repeat (ITR)
of SA7P
PRM 409 416 TATA E1A region
CAP 440 440 cap site E1A RNAs
MSG 440 838 "12S" E1A RNA
MSG 440 838 1st exon "12S" E1A RNA
IVS 839 1175 intron "12S" E1A RNA
MSG 1176 >1445 2nd exon "12S" E1A RNA
MSG 440 1057 1st exon "13S" E1A RNA
IVS 1058 1175 intron "13S" E1A RNA
MSG 1176 >1445 2nd exon "13S" E1A RNA
SITE 1445 1450 AATAAA E1A
PRM 1508 1514 TATA E1B
CAP 1538 1538 cap E1B RNA's
This virus, an independent isolate from Moscow, USSR, is highly
related to simian adenovirus 7 (SA7). The E1A sequences are 95%
homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985)
FEATURES from to/span description
pept 487 838 E1A 21K protein (part 1)
1176 / 1401 E1A 21K protein (part 2)
pept 487 1057 E1A 29K protein (part 1)
1176 / 1401 E1A 29K protein (part 2)
pept 1569 2111 pot. 21K E1B protein
pept 1874 > 2338 pot. 55K E1B protein
BASE COUNT 514 a 535 c 725 g 564 t
ORIGIN
1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag
61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg
121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt
181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt
241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg
301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg
361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg
421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct
481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat
541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac
601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag
661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg
721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct
781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt
841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg
901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat
961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg
1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact
1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg
1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga
1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac
1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc
1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc
1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata
1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct
1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca
1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg
1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc
1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt
1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg
1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct
1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca
1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc
1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc
2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa
2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg
2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat
2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg
2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag
//
LOCUS ADGVARNA 850 bp ds-DNA VRL 27-JUL-1990
DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene.
ACCESSION M14082
KEYWORDS virus-associated RNA.
SOURCE Simian adenovirus type 7 DNA.
ORGANISM Mastadenovirus s7
Unclassified.
REFERENCE 1 (bases 1 to 850)
AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G.
TITLE Characterization of a low-molecular-weight virus-associated (VA)
RNA encoded by simian adenovirus type 7 which functionally can
substitute for adenovirus type 5 VA RNA-1
JOURNAL J. Virol. 60, 635-644 (1986)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by S.Larsson, 15-JAN-1987.
Simian adenovirus type 7 VA RNA stimulates protein synthesis in a
transient expression assay and can functionally substitute for VA
RNA-I during lytic growth of human Ad5. The SA7 genome encodes
only one VA RNA species.
FEATURES from to/span description
RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp)
ORF 105 < 1 (c) ORF A cds
ORF 366 > 850 ORF B cds
BASE COUNT 140 a 260 c 338 g 112 t
ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map.
1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc
61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg
121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga
181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg
241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg
301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt
361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag
421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc
481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg
541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc
601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc
661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca
721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc
781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg
841 tggccccggg
//
LOCUS ADZITRL 150 bp ds-DNA VRL 27-JUL-1990
DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end.
ACCESSION J01958
KEYWORDS origin of replication; terminal repeat.
SOURCE ad31 (adenovirus 31).
ORGANISM Mastadenovirus 31
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A.
TITLE conserved sequences at the origin of adenovirus dna replication
JOURNAL J. Virol. 44, 530-537 (1982)
STANDARD simple staff_review
BASE COUNT 38 a 29 c 48 g 35 t
ORIGIN
1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
121 gtgggagccg ggcgcgccgg atgtgacgcg
//
LOCUS ADZITRR 150 bp ds-DNA VRL 27-JUL-1990
DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end.
ACCESSION J01959
KEYWORDS origin of replication; terminal repeat.
SOURCE ad31 (adenovirus 31).
ORGANISM Mastadenovirus 31
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 150)
AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A.
TITLE conserved sequences at the origin of adenovirus dna replication
JOURNAL J. Virol. 44, 530-537 (1982)
STANDARD simple staff_review
BASE COUNT 40 a 28 c 47 g 35 t
ORIGIN
1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
121 gtgggagccg ggcgcgccgg atgtgacgaa
//
LOCUS HSKGAORA 3726 bp ds-DNA VRL 27-JUL-1990
DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown
function.
ACCESSION M33303
KEYWORDS glycoprotein A.
SOURCE Herpesvirus of turkey DNA.
ORGANISM Gallid herpesvirus type 2
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 3726)
AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K.
TITLE Homologies between herpesvirus of turkey and Marek's disease virus
type-1 DNAs within two co-linearly arranged open reading frames,
one encoding glycoprotein A
JOURNAL Gene 84, 399-405 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 207 1676 glycoprotein A
pept 2118 2756 ORF2
BASE COUNT 1016 a 884 c 815 g 1011 t
ORIGIN
1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat
61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt
121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc
181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc
241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg
301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg
361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg
421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca
481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta
541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt
601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg
661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta
721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc
781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg
841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata
901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat
961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg
1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg
1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca
1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct
1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga
1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta
1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta
1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc
1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa
1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt
1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt
1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt
1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc
1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact
1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat
1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt
1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa
1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca
2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg
2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg
2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc
2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat
2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg
2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg
2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta
2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg
2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca
2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca
2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac
2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac
2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata
2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta
2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc
2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga
3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc
3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc
3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt
3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg
3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt
3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc
3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa
3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca
3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga
3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag
3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat
3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat
3721 gaattc
//
LOCUS HSKTK 2039 bp ds-DNA VRL 27-JUL-1990
DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds.
ACCESSION M26659
KEYWORDS thymidine kinase.
SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo
fibroblasts.
ORGANISM Gallid herpesvirus type 2
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 2039)
AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K.
TITLE Genetic and biochemical characterization of the thymidine kinase
gene from herpesvirus of turkeys
JOURNAL J. Virol. 63, 2847-2852 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.L.Martin, 03-AUG-1989.
FEATURES from to/span description
pept 608 1540 thymidine kinase (EC 2.7.1.21)
BASE COUNT 570 a 458 c 437 g 574 t
ORIGIN 649 bp upstream of XbaI site.
1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa
61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg
121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt
181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta
241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa
301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc
361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg
421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat
481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct
541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg
601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc
661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc
721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt
781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc
841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga
901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt
961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact
1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc
1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg
1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt
1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga
1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt
1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct
1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta
1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga
1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga
1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat
1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat
1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt
1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa
1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc
1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc
1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc
1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat
//
LOCUS LACHSP 3095 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene,
complete cds.
ACCESSION M30192
KEYWORDS .
SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 3095)
AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R.
TITLE Nucleotide sequence and distribution of the pTR2030 resistance
determinant (hsp) which aborts Bacteriophage infection in
Lactococci
JOURNAL Appl. Environ. Microbiol. 56, 2255-2258 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Hill, 29-NOV-1989.
FEATURES from to/span description
pept 1105 2991 hsp gene (aborts phage infection)
BASE COUNT 1129 a 439 c 518 g 1009 t
ORIGIN
1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg
61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga
121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat
181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt
241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac
301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga
361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat
421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca
481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac
541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct
601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca
661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat
721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa
781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg
841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc
901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat
961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt
1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag
1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca
1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct
1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca
1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca
1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata
1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat
1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa
1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt
1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata
1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa
1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat
1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt
1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat
1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat
1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa
1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat
2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct
2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct
2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat
2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct
2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta
2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata
2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa
2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa
2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata
2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac
2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct
2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa
2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa
2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc
2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa
2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt
3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat
3061 ctcctcatcc gcatttggat aaagatgact ataaa
//
LOCUS PH15RER 2016 bp ds-DNA PHG 27-JUL-1990
DEFINITION Bacteriophage phi-15 right early region containing genes 17,
16.5-16.9, complete cds.
ACCESSION M28830
KEYWORDS DNA replication protein.
SOURCE Bacteriophage phi-15 DNA.
ORGANISM Bacteriophage phi-15
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 2016)
AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V.
TITLE Nucleotide sequence of the right early region of Bacillus phage
phi-15 and comparison with related phages: Reorganization of gene
17 during evolution
JOURNAL Gene 75, 341-347 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 203 664 gene 17 peptide
pept 666 992 gene 16.9 peptide (put.)
pept 989 1306 gene 16.8 peptide (put.)
pept 1308 1700 gene 16.7 peptide (put.)
pept 1697 1861 gene 16.6 peptide (put.)
pept 1858 1971 gene 16.5 peptide (put.)
BASE COUNT 744 a 374 c 382 g 516 t
ORIGIN
1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca
61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa
121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa
181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga
241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc
301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt
361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca
421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc
481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt
541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga
601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa
661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca
721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag
781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa
841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta
901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca
961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg
1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga
1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg
1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat
1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt
1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt
1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa
1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc
1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt
1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc
1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg
1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat
1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt
1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt
1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg
1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc
1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt
1981 ttgattgttt cacgtggaac atcactgtat acgcat
//
LOCUS STLENV 4122 bp ss-RNA VRL 27-JUL-1990
DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region.
ACCESSION M11373 M11374
KEYWORDS env gene; pol gene.
SOURCE Simian T-cell leukaemia virus genomic RNA.
ORGANISM Simian T-cell lymphotropic virus type I
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 4122)
AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and
Yoshida,M.
TITLE Sequence homology of the simian retrovirus genome with human T-cell
leukemia virus type I
JOURNAL Virology 144, 59-65 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 311 1777 env protein
BASE COUNT 905 a 1456 c 741 g 1020 t
ORIGIN
1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt
61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc
121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc
181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc
241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac
301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc
361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca
421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct
481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca
541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg
601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg
661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca
721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg
781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa
841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga
901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac
961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg
1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta
1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt
1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc
1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc
1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc
1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt
1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca
1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca
1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc
1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca
1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat
1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc
1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca
1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca
1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac
1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag
1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc
2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt
2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg
2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt
2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta
2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc
2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca
2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga
2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac
2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa
2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa
2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt
2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag
2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa
2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc
2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc
2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat
3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact
3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt
3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc
3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt
3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc
3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt
3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa
3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta
3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg
3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat
3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc
3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg
3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt
3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt
3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag
3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc
3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc
4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc
4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca
//
LOCUS HUMIGHDA 368 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION M33858
KEYWORDS diversity region; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region;
variable region V-H4.
SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
clone C.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 368)
AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic
lymphocytic leukemia-producing pathologic IgM rheumatoid factor
JOURNAL J. Immunol. 144, 1526-1530 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 368 Ig H-chain V4-D-J region (AA at 1)
BASE COUNT 73 a 110 c 112 g 73 t
ORIGIN
1 caggtgcatc tacagcagtg gggcgcagga ctgttgaagc gttcggagac cctgtccctc
61 acctgcgctg tctatggtgg gcccttcagt ggttactatt ggagctgggt ccgccagccc
121 ccagggaagg gaccggagtg gattggggaa atcaatcata gtggacgcac cacctacaac
181 ccgtccctca cgagtcgagt caccatgtca ctggacacgt ccaagaacca gttctccctg
241 aagctgacct ctgtgaccgc cgcggacacg gctgtttatt actgtgcgag aggcccttgc
301 gaggcatact gtactgatga tgccccgcaa ggcggtgcgc ggtacttcca gcactggggc
361 cagggcac
//
LOCUS HUMIGHDB 260 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION M33859
KEYWORDS diversity region; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region;
variable region V-H4.
SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
clone A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 260)
AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic
lymphocytic leukemia-producing pathologic IgM rheumatoid factor
JOURNAL J. Immunol. 144, 1526-1530 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1)
BASE COUNT 57 a 79 c 75 g 49 t
ORIGIN
1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc
61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac
121 cagttctccc tgaagttgac ctctgtgacc gccgcggaca cggctgttta tttctgtgcg
181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aaggcggtgc gcggtacttc
241 cagcactggg gccagggcac
//
LOCUS HUMIGHDC 260 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION M33860
KEYWORDS diversity region; immunoglobulin heavy chain; joining exon;
mu-immunoglobulin; processed gene; variable region;
variable region V-H4.
SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
clone B.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 260)
AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic
lymphocytic leukemia-producing pathologic IgM rheumatoid factor
JOURNAL J. Immunol. 144, 1526-1530 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1)
BASE COUNT 58 a 82 c 74 g 46 t
ORIGIN
1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc
61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac
121 cagttctccc tgaagctgac ctctgcgacc gccgcggaca cggctgttta ttactgtgcg
181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aagccggtgc gcggtacttc
241 cagcactggg gccagggcac
//
LOCUS HUMHLADQA 1468 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC class II HLA-DQA1 mRNA, complete cds.
ACCESSION M33906
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone
pII-alpha-7.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1468)
AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L.
TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to
the DQw4 specificity
JOURNAL Immunogenetics 30, 232-234 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 44 811 MHC HLA-DQA1-alpha protein
/hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1"
/map="6p21.3"
signal 1119 1124 poly-A signal (pot.)
BASE COUNT 356 a 381 c 309 g 422 t
ORIGIN
1 cacagctcag agcagcaact gctgaggctg ccttgggaag aggatgatcc taaacaaagc
61 tctgctgctg ggggcccttg ccctgaccac cgtgatgagc ccctgtggag gtgaagacat
121 tgtggctgac catgttgcct cttatggtgt aaacttgtac cagtcttacg gtccctctgg
181 ccagtacacc catgaatttg atggagacga gcagttctac gtggacctgg ggaggaagga
241 gactgtctgg tgtttgcctg ttctcagaca atttagattt gacccgcaat ttgcactgac
301 aaacatcgct gtgacaaaac acaacttgaa catcctgatt aaacgctcca actctactgc
361 tgctaccaat gaggttcctg aggtcacagt gttttccaag tctcccgtga cgctgggtca
421 gcccaacacc ctcatctgtc ttgtggacaa catctttcct cctgtggtca acatcacatg
481 gctgagcaat gggcactcag tcacagaagg tgtttctgag accagcttcc tctccaagag
541 tgatcattcc ttcttcaaga tcagttacct caccttcctc ccttctgctg atgagattta
601 tgactgcaag gtggagcact ggggcctgga cgagcctctt ctgaaacact gggagcctga
661 gattccagcc cctatgtcag agctcacaga gactgtggtc tgcgccctgg gattgtctgt
721 gggcctcgtg ggcattgtgg tgggcactgt cttcatcatc atccgaggcc tgcgttcagt
781 tggtgcttcc agacaccaag ggcccttgtg aatcccatcc tggaagggaa ggtgcatcgc
841 catctacagg agcagaagag tggacttgct acatgaccta gcattatttt ctggccccat
901 ttatcatatc ccttttctcc tccaaatgtt tctcctctca cctcttctgt gggacttaaa
961 ttgctatatc tgctcagagc tcacaaatgc ctttgaatta tttccctgac ttcctgattt
1021 ttttcttttc tcaagtgtta cctactaaga gatgcctgga gtaagccacc cagctaccta
1081 attcctcagt aacctccatc tataatctcc atggaagcaa caaattccct ttatgagata
1141 tatgtcaaat ttttccatct ttcatccagg gctgactgaa accgtggcta agaattggga
1201 gactctcttg tttcaagcca atttaacatc atttaccaga tcatttgtca tgtccagtaa
1261 cacagaagca accaactaca gtatagcctg ataacatgtt gatttcttag ctgacgttaa
1321 tatttctttc ttccttgtgt tcccaccctt ggcattgcca cccacccctc aattaaggca
1381 acaatgaagt taatggatac cctctgcctt tggctcagaa atgttatagc aaaaatttta
1441 aaataaaaaa gtaagtctgt attaattg
//
LOCUS HUMHLADQB 896 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC class II HLA-DQB1 mRNA, complete cds.
ACCESSION M33907
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone
pII-beta-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 896)
AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L.
TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to
the DQw4 specificity
JOURNAL Immunogenetics 30, 232-234 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 46 831 MHC HLA-DQB1-beta protein
/hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
/map="6p21.3"
BASE COUNT 185 a 252 c 263 g 196 t
ORIGIN
1 gtccaagctg tgttgactac cactactttt cccttcgtct caattatgtc ttggaagaag
61 gctttgcgga tccctggagg ccttcgggta gcaactgtga ccttgatgct ggcgatgctg
121 agcaccccgg tggctgaggg cagagactct cccgaggatt tcgtgttcca gtttaagggc
181 atgtgctact tcaccaacgg gaccgagcgc gtgcggggtg tgaccagata catctataac
241 cgagaggagt acgcgcgctt cgacagcgac gtgggggtgt atcgggcggt gacgccgctg
301 gggcggcttg acgccgagta ctggaatagc cagaaggaca tcctggagga ggaccgggcg
361 tcggtggaca ccgtatgcag acacaactac cagttggagc tccgcacgac cttgcagcgg
421 cgagtggagc ccacagtgac catctcccca tccaggacag aggccctcaa ccaccacaac
481 ctgctggtct gctcagtgac agatttctat ccagcccaga tcaaagtccg gtggtttcgg
541 aatgaccagg aggagacaac tggcgttgtg tccacccccc ttattaggaa cggtgactgg
601 accttccaga tcctggtgat gctggaaatg actccccagc gtggagacgt ctacacctgc
661 cacgtggagc accccagcct ccagaacccc atcatcgtgg agtggcgggc tcagtctgaa
721 tctgcccaga gcaagatgct gagtggcatt ggaggcttcg tgctggggct gatcttcctc
781 gggctgggcc ttattatcca tcacaggagt cagaaagggc tcctgcactg actcctgaga
841 ctattttaac tgggattggt tatcactttt ctgtaacgcc tgcttgtccc tgccca
//
LOCUS HAMRPS14 5300 bp ds-DNA ROD 27-JUL-1990
DEFINITION C.griseus ribosomal protein S14 (RPS14) gene, complete cds.
ACCESSION M35008
KEYWORDS ribosomal protein S14.
SOURCE C.griseus (haplotype EmtB wild type) ovary transformed fibroblast
cell line CHO DNA.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (base 1 to 5300)
AUTHORS Rhoads,D. and Roufa,D.
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (sites for [1])
AUTHORS Diaz,J.-J., Rhoads,D. and Roufa,D.J.
TITLE Genetic analysis of a vital mammalian housekeeping locus using CHO
cells that express a transfected mutant allele
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by D.J.Roufa, 08-JUN-1990.
Author address: D.J.Roufa
Kansas State University
Division of Biology
Ackert Hall
Manhattan, Kansas 66506
FEATURES from to/span description
pept 1977 2125 ribosomal protein S14, exon 2 (first expressed
exon)
2576 2737 ribosomal protein S14, exon 3
4074 4150 ribosomal protein S14, exon 4
4724 4791 ribosomal protein S14, exon 5
pre-msg 183 4841 ribosomal protein S14 mRNA and introns (5' end
+/- 4 bp)
IVS 220 1974 ribosomal protein S14 intron A
IVS 2126 2575 ribosomal protein S14 intron B
IVS 2738 4073 ribosomal protein S14 intron C
IVS 4151 4723 ribosomal protein S14 intron D
signal 4819 4824 poly-A signal
mut 4783 4783 c in wt; g in mutant Emr-75 [ref. 2] null
Arg-150 -> Gly with ThaI rflp.
BASE COUNT 1315 a 1121 c 1415 g 1449 t
ORIGIN
1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg
61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc
121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc
181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca
241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc
301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg
361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg
421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg
481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc
541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta
601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga
661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg
721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg
781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt
841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt
901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga
961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta
1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc
1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc
1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc
1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg
1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt
1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc
1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc
1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca
1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg
1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt
1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg
1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt
1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt
1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc
1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc
1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg
1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg
2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg
2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag
2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga
2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg
2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt
2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa
2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt
2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa
2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa
2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt
2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg
2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca
2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga
2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa
2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta
2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg
3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga
3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt
3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt
3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg
3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca
3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa
3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat
3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg
3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat
3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa
3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag
3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc
3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg
3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca
3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt
3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt
3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat
4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag
4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt
4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc
4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact
4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta
4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc
4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg
4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag
4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc
4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc
4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct
4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc
4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt
4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt
4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga
4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc
4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat
5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct
5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc
5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca
5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat
5281 atggtctact tggggaattc
//
LOCUS AFATFDCD 7956 bp ds-DNA BCT 27-JUL-1990
DEFINITION A.eutrophus plasmid pJP4 chlorocatechol oxidative (tfdCDEF) operon,
complete cds; and 2,4-dichlorophenol hydroxylase (tfdB) gene,
complete cds.
ACCESSION M35097 X07754
KEYWORDS 2,4-dichlorophenol hydroxylase; chlorocatechol 1,2-dioxygenase;
chloromuconate cycloisomerase; tdfF gene; tfdE gene.
SOURCE A.eutrophus (strain JMP134) plasmid pJP4 DNA.
ORGANISM Plasmid pJP4
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1598; enum. 1 to 1598)
AUTHORS Perkins,E.J.
JOURNAL Unpublished (1988) Dept of Biochem,Univ of Wash.,Seattle,WA 98195
STANDARD simple automatic
REFERENCE 2 (bases 1 to 339)
AUTHORS Perkins,E.J., Bolton,G.W., Gordon,M.P. and Lurquin,P.F.
TITLE Partial nucleotide sequence of the chlorocatechol degradative
operon tfdCDEF of pJP4 and similarity to promoters of the
chlorinated aromatic degradative operons tfdA and clcABD
JOURNAL Nucleic Acids Res. 16, 7200 (1988)
STANDARD simple automatic
REFERENCE 3 (bases 1 to 7956)
AUTHORS Perkins,E.J., Gordon,M.P., Caceres,O. and Lurquin,P.F.
TITLE Organization and sequence analysis of the 2,4-dichlorophenol
hydroxylase and dichlorocatechol oxidative operons of plasmid pJP4
JOURNAL J. Bacteriol. 172, 2351-2359 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 337 1104 chlorocatechol 1,2-dioxygenase (tfdC)
pept 1101 2213 chloromuconate cycloisomerase (tfdD)
pept 2288 2992 tfdE product
pept 2989 4053 tfdF product
pept 4398 6194 2,4-dichlorophenol hydroxylase (tfdB)
conflict 681 682 gg in [3]; cc in [1]
BASE COUNT 1716 a 2209 c 2403 g 1628 t
ORIGIN
1 aagctttgcc ccaatatcgc gttccaaggc ctggatctga cgagtaagcg gtggctgcga
61 tatgtgcatc ctctgtgccg ctgttccgaa acctcccgcc tccgcgaccg cgacgaagta
121 tttcaactgt cttatttcca tgttccgtcc cgttgcgctg cctgttacaa ggcgattgtg
181 agcaagcacc tgcggtctcg caactaggga aagacatacc gaaaccgtat ggtctaggcg
241 gagaaaaggt attggacggt atgggattgc cgtctctatt atgactgtgc ggggcaattg
301 tcgcccacgt gttcatgttt tcatgacgga ggcaaagtga acaaaagagt caaggatgtt
361 gtcgatgcga tcgtcgcggc ggtacagcgg gtgctcgacc agaaggaggt tactgaggcg
421 gaataccgaa ctgcggttca ttacctcatg caagtagccg aacagcgtga aaccgctctt
481 ctttgcgatg tttttttcaa tagcaccgtg gctgcgacga aggctcgtat tagcgaaggt
541 tcaacgcctg ctatcgaggg accctactat cgcgacgacg cccctctggt cgatgaccgg
601 ctcaagactt acgacacgga cgaccacaag cctttgctta tccagggaac ggtcaaagcg
661 gtcgacggga gcgttgtcga ggacgtgacg attgatgtct ggcattcgac gcccgatggg
721 aagtacagcg gtttccatga cgacatcccg actgattttt atcgagggaa gctcagggtg
781 ggcaccgatg gcagcttccg cgtgcgcaca acgatgccgg tgccgtatca gatcccggat
841 cagggtccca cgggcgcatt gctcgaaacc atgggtggtc actcgtggcg tcccgctcat
901 gtacatttca aggtgaaggc gccgggatat gaaacgttga ccacgcagta ctacttcgaa
961 ggtggagatt ggatcacgga cgactgttgc aacggcgtcc agtctagtct gattactccc
1021 gatattgtgg aagagggtgt ccgcctgatg aacatcaact tcgttatcga gcccgcgcgt
1081 gcgcaggcgg gggcaaaccc gtgaagattg acgcgattga agcagtgatc gtggatgtgc
1141 cgaccaagcg gccgatccag atgtcgatca ctaccgtgca ccagcagagc tacgttatcg
1201 tccgggtgta ttcggagggg ctcgttggtg tcggcgaggg tggaagcgtt ggtggtcccg
1261 tctggagcgc agagtgtgcg gagacgatca agatcatcgt ggaacggtat ctcgcgcccc
1321 acctcctcgg aactgatgcg ttcaacgttt caggtgcact gcaaaccatg gcgcgtgccg
1381 tcaccggaaa cgcctctgca aaggctgcgg tcgagatggc gttactggat ctcaaagctc
1441 gagcgttagg cgtatcgatc gccgagttac ttggcgggcc gttgcgcagt gcgattccga
1501 ttgcctggac attggcgagc ggagatacga aacgcgatct cgattctgcc gtcgagatga
1561 ttgaaagacg acgacacaat cgcttcaaag tcaagcttgg cttccggtcg ccccaagacg
1621 atctcatcca tatggaggct ttgtcaaata gtctcggatc gaaggcctac cttcgcgttg
1681 acgtgaatca ggcttgggac gagcaagtgg cgtccgtcta cattcctgaa ctggaggcgc
1741 ttggcgtgga actcatcgaa cagccggtcg gccgcgaaaa tacgcaagcg ttgaggcggc
1801 tctccgacaa caaccgcgtg gccatcatgg ccgatgagag cctgagtacg ttggcctcgg
1861 cattcgatct cgcgcgcgac cgaagtgtgg atgtcttttc gctgaagctt tgcaacatgg
1921 gaggggtctc ggcgacgcaa aagatagcag cggtcgcgga agcaagcggg attgcatcgt
1981 atggcggaac aatgcttgac tcgacgatcg gcacatcggt tgcacttcag ctctattcta
2041 cggttccatc gcttccgttc ggttgcgaac tgatcggtcc cttcgtgttg gcagacacgc
2101 tgagccacga gccactcgag atccgggatt acgaactgca ggttcccact ggcgtaggtc
2161 acggcatgac gcttgacgag gacaaggtgc gccaatacgc acgcgtcagc taggcaaaac
2221 gcgcagcgca ggccgcggct gtggatcgcc tcgaatgcaa ctgcgcatgt gtaactggag
2281 actaaatatg ttatcagacg gcgttgagat cacgtcgcgc tcgggtggtc gctttggtgc
2341 ctacctcgga aagccgacga cggactccgc acccatcgtc gtgatcgcgc aggagatttt
2401 cgggatcacg cctttcataa gagaaaccgt ggaatggctc gttggtgcgg gctttggttg
2461 cgtgtgtccg gatctgtact ggagacaggc gccgaatatc gagcttgatg caaacgtacc
2521 atcggaacgg gaacaggcgc ttgcgttgtt tcgcgacttt gacatggagg caggagtcaa
2581 tgacctttca tgcgccattg aatacgctcg tgcgctccct ttctcgaacg gtcgtgtcgc
2641 tgtagtcggg tactgcctgg gaggtgcgct ggcgtttgat gtggcggcgc gatcattggc
2701 cgattgctca atcggctatt acggcgtggg acttgaaaaa aaggtctcac tggtgccagc
2761 tattacacga ccggcgatgt tccacatggg taccaaagat cactatgtca cagaagaggc
2821 ccgtagcatt ctcgaagagc atttcggtcg aaacaaaaat ctgagtctgc actggtatcc
2881 agtggggcat tcatttgcac ggtcatccag cccaaatttc gatcaggcgg cgacaactgt
2941 ggccaatgct cggacgctcg aactgcttgc aatgttgaag gacccttcat gaagaagttc
3001 acgcttgact acctgagccc gagggtcgtc ttcggggcgg gcactgcttc tgcattgcca
3061 gatgaaatag gacgccttgg cgcacgccgg cccttggtat taagcagccc ggaacaacgc
3121 gagttagcga aggatatcgt ccgtccgata ggtgacaggg tagctggata tttcgatggc
3181 gcgacgatgc atgttcccgt cgacgtcatc cagaaagccg agcgggcttt taacgatact
3241 gacgccgact caatcatcgc gatcggggga ggatcgacca ccggactcgc aaaaatcctt
3301 tcgatgaacc ttgacgtccc aagtctggtt ataccaacga cctatgccgg tagtgaaatg
3361 actaccattt ggggtgtcac ggaaggcgga atgaagagga ccggccgcga ccccaaggtg
3421 ctaccgaaga ccgtgattta tgatccattg ctcacggtcg atttgccgct tgctatctcg
3481 gtgacgagcg ccttgaatgc gatcgctcac gccgcagaag gtctgtactc ggccgacctc
3541 aatcccgttc tcgagaccat gtgtaagcag ggcatatgcg ccttgttcga tgcaatcccg
3601 cgcctggtgg caaagccgac tgacgccgaa gcgcgtacgg atgccctttt tggggcatgg
3661 atgtgtggca ctgcactgtg ccacttgggc atggggctac atcacaaact ctgccacacg
3721 cttgggggaa cccttaatct tccccacgcg gagacacatg caatcgtact accacacgca
3781 ctggcataca atctgccgta cgccgcgcca gctgagcgac tgcttcagga agtcgccggc
3841 agtagtgacg tcccgagcgc gctatatgat ctcgccagaa atgctggagc accactcagt
3901 ctcgccgaaa tcggtatgcg gcctgaagat attccgaggg tacgcgacct cgcgctaagg
3961 gaccaatatc cgaatccgcg tccgctggaa tcggacgcat tggaaacatt gttagtcaat
4021 gcgtttcgtg ggcgaagacc ggatttcaaa taatgtgacc tgcactccgc gtttagtacg
4081 gtagcggtga agagcccgtt ccaaagtcga catcgggtct tcaagctaaa gcgaagcaca
4141 tgaatttgca tccactgttc cttgtggagc atcgccagcc ttcgcgcgaa aaaggccgac
4201 tgtactggag gtggctcgga ttttctttgc gcgaagtggc tcatttttac tttgcgcgca
4261 acaaaggctc ttcagcctta ctattcatcg cgcgttaaag acggtgaagt ttaaaaaatc
4321 gacttggccg aacgcgagat gaacaatccc tgtttcttct cacaaagaga gagacataac
4381 aattgacgga ggaagacatg gcattgacga tcgaaaccga cgtgctggtc gttggcaccg
4441 ggccggcggg cgcttcggcg ggggcactgc ttgcgcgata cggtgtcagg acgatgctca
4501 ttaacaagta caactggacg gctccgacgc cgcgtgccca catcaccaac caacgcacaa
4561 tggagatcct gcgtgatctc ggtctcgaag ctgaagcccg cctctacgcc gcgccgaatg
4621 acctaatggg cgagaacacc atctgcgcct cgttggccgg cgaggagttc ggtcggattc
4681 gcacctgggg gacggatgtt cgccggcgcg ccgattacga cgaatgttcg ccgacctcca
4741 tgtgcgacct gccacagaac tatctcgagc cgattctcgt caagtcggcc gcactcgacg
4801 gctgcaaggt gcgcttcgac acagagtatc ttggacacga acaggatgct gacggcgtgt
4861 cgtcgcgctt gcgcgatcgt ctgaatggcg aagagtttac cgtccggtcg aaatatctga
4921 tcggcgccga cggcgccaac agccgcgtcg tgtcagacct ggacctgccg ctcgaaggta
4981 ccatgggcaa gtccggttcc atcaacttgc tgttcgaggc tgatctcgac cgctacgtcg
5041 cgcatcgtcc atcggtcctt tactgggtca tccagcccgg ctccgatatc ggcggtctcg
5101 gcatcggcgt ggtgcgcatg gtccggccgt ggaacaaatg gcttgcgatc tgggggtacg
5161 acgtcgagca gggcccgccg gagatctcgg aatccttcgc ccgccgcatc gtccacaatc
5221 tgatcggcga tgacagcgtg ccgctgaaaa ttgagggcat ctcgacctgg acggtcaacg
5281 acatgtacgc gacccggttg cagcaaggac gggtcttctg cgccggcgac gcggtccacc
5341 ggcatccgcc gaccaacgga ctcggctcaa acacctccat ccaggacagc ttcaatctcg
5401 cctggaagat cgccatggtg ctgaacggca cggccgacga gagcctgctg gacacctaca
5461 cgatcgagcg ggcgccgatc gcgaagcagg tggtctgcag ggcgaacaag tcgcttgagg
5521 atttcccgcc gatcgcgatg gctctggggt tgccgcaggc caagagcgcc gacgagatga
5581 agagcaacat ggctcgccgc aaggaacctg ggcccgaggc gcaagcgcag cgcacgcggt
5641 tacgcgaggc catcgccggg accaattacg tttacaacgc ccacggcgtc gaaatgaatc
5701 agcgttacga cagtcccgcg atcgtggcag acaattcgcc ggacgaagtc ttccgcgacg
5761 tcgagctcta tcaccaggcg tcgacgcggc ctggagcgcc catgccgcat gtctgggtct
5821 atgcgagtgg tgatggacac cgtatctcaa caaaggatct ctgcgggaag ggcaacttca
5881 ctctctttac cgggattggc ggcgcagcct ggcaggatgc cgcggcggcc gtgagccggc
5941 aactcggggt ggcagtgacg gtgcgcatca ttgggcctgg ccaggcctac gaggatcatt
6001 acggcgactt cgcgagaatt tccgagatca tcgacaccgg cgccatcctg gtgcggccgg
6061 atttccacgt cgcctatcgg gcgacgtcct tgccggccga tgcggcaggc gacctcgtgt
6121 ccgccatgag gcggatcctc ggacgccagt cagagaggtc gagcgcgctg cgcgttacgt
6181 ctcgagctat ataaggagtg gcgtcgcgat ctattcgagg tgagacggaa gcgctcacct
6241 cgggttgcgg gaggggaagc gtgatcttgc gcagcgtgag atggcccgtg gcgacgcggg
6301 aaggttgtcg cgcgattaag gcaaggcggt gcgcaacgac gtgacgtgtt ttctggaccg
6361 gacgtaactg cagatcatct aatttgggaa gttcgtaccg gtgtcgaccg ccttgctgat
6421 gatgctatta agcacgtgag cagacgacgg gccggtgcag ccaacgttga aaaaagagcg
6481 cttgattgca atgtcgacga cacgatctac attactgtca tttgaacgac cgcttgctcg
6541 tgcggattca accggtggat gcaacagttt gctgaaatcg ttcggcgggc gttccgtggt
6601 tcagggtttt cctcggtcgc ccatccagcc gcctcgccac ggtattgagt agccaattgg
6661 tacaccgaaa tgtaggtgcc gttggggatg tctcgatagc ccgttcgcat tttcattcga
6721 cccgcgctgc caaggatgct gggggttcag taactgcagg ctgaactgcg ccaggtgaac
6781 caaacgctga tcgtcaagca aaatgaaata acccagttga acagagaaag cgcgcggctc
6841 gtcgccgaag ccggcgtcgc gacaaggcat ctacgcgagc tgcaagcgca cagcaagtag
6901 ttacagaata cgttgaacca gtcgctcgcc gaccacgcac gcaccaaggc cgagcacgat
6961 gcgctgctaa caccgcgagg actgaggcga ctgaactgac gcggctgcgc gaagcacgtg
7021 aatcgctgac cgccgatctg acgaagctca cagcgcagtt cgacgagcag cagttgctcg
7081 gggactatcg gatgcgtctg ggcctcacgg acgcggcggg atgaaagcgc cagattccgc
7141 atcaaaagga tggggatcta cgccggaacc ccagaaaatt ccgtcgttcc acattgtgaa
7201 cattaatcgg cttattccag ccgaagaatg ttggagcgac gtccggtgag tgacatgacg
7261 gtctcggtca cacggaccat caggaccgtt acgcggggcc cagttcgatt acattgtggg
7321 acaggtcgag acaatcttcg cagtcccggc aaacccagtg cagcatcggc ccgaccggat
7381 ctggatggtc gctgaacgtt gtcagcgccg ggtcgtacta gcatcggacg tgctcgagcg
7441 cagtatccgc tcgagattag actcgatgcg gtccaggctg cattcaccga tgccggactt
7501 gaacaccgtc acaccgcctg acgggttgac cgcgaagaag aaaaacgcct tcgtgtcgac
7561 catgtgcgac aggacttgtc ctggcgatga agttgctcgg attgatccag cgcattgaag
7621 atgcggtagc cgtaaaactc taagccgaag tgtacgacct cgaagtcgtc ggtgccgcgg
7681 atcgagtggc caaagttgaa accgctgcgt gttacgtcgt tcggaatgtc atccaccggc
7741 cgcagtgcca ggcaagcata gtgcgtctgg ccctcggcca cgagtcccgt cgcaaagcag
7801 ggcgcgcgct cggccagcaa ggcctcgaca aaattccggt tcatgcggag cgttggcagc
7861 atcgggcatc cttcatcgtg aatcaaatca acatttgccg acctggctgt gcggtgcgtc
7921 gacgtcgaac atcaaaggct atacatcgtc gagctc
//
LOCUS ECOCLPAA 3380 bp ds-DNA BCT 27-JUL-1990
DEFINITION E.coli ATP-dependent C1p protease (clpA) gene, complete cds.
ACCESSION M31045
KEYWORDS C1p protease; clpA gene.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3380)
AUTHORS Gottesman,S., Clark,W.P. and Maurizi,M.R.
TITLE The ATP-dependent Clp protease of Escherichia coli: Sequence of
ClpA and identification of a Clp-specific substrate
JOURNAL J. Biol. Chem. 265, 7886-7893 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Gottesman, 03-JAN-1990.
FEATURES from to/span description
pept 1000 3276 ClpA protein
signal 779 784 -35 region
signal 805 810 -10 region
binding 990 994 ribosome binding site
binding 214 221 ATP binding site
binding 495 502 ATP binding site
BASE COUNT 877 a 759 c 943 g 801 t
ORIGIN 1 bp upstream of BamHI site; 19 min. on K12 map.
1 ggatccagta gatgctctgg cacgagagta atttttgaga taaaaatcga gccgatcggg
61 cggcattttg cctttaggat gtacacaatg agacagaaga gctatgcgac tgccgcttct
121 acttcgacgg gcacaataac actggcgtga ttgccttttg gcccctggtg gacatcaaac
181 tgaacggatt gtccagcttt tagcgttctg taaccatcca tctgaatggt ggaataatga
241 gcgaaaatat cttcgccgcc gccttcaggg cagatgaaac caaacccttt ggcattgttg
301 aaccacttaa cagtaccctt ttccatgctt cgacatcctt cgcaaatctt atacaagtaa
361 gatggaataa accggggtca gagagggggc tgttcaaaac ctcgccaact ctagaaatac
421 aatttagaga attagggcga gccgtcaagc atttgacagg ggacaagggg caggtatgaa
481 tcaaaaattt gaagcagtta acgctattga caggaatgtg acagatgtcg ctgatgccaa
541 cgatagatga tagttatcta tcatgtggag tagattggtc aggcaaataa gctcttgtca
601 gcggcagggc gttctgccga taaccgtaac cgaagatgat aactgacaat gggtaaaacg
661 aacgactggc tggactttga tcaactggcg gaagaaaaag ttcgcgacgc gctaaaaccg
721 ccatctatgt ataaagtgat attagtcaat gatgattaca ctccgatgga gtttgttatt
781 gacgtgttac aaaaattctt ttcttatgat gtagaacgtg caacgcaatt gatgctcgct
841 gttcactacc aggggaaggc catttgcgga gtctttaccg ccgaggttgc agaaaccaaa
901 gtggcgatgg tgaacaagta cgcgagggag aatgagcatc cattgctgtg tacgctagaa
961 aaagcctgaa tgcaggcata aaaattgggg gaggtgccta tgctcaatca agaactggaa
1021 ctcagtttaa atatggcttt cgccagagcg cgcgagcacc gtcatgagtt tatgaccgtc
1081 gagcacttgt tactggcgct gctcagtaac ccatctgccc gggaggcgct ggaagcgtgt
1141 tctgtggatt tggttgcgct ccgtcaggaa ctggaagcct ttattgaaca aaccacaccc
1201 gttctgcctg ccagtgaaga ggagcgcgac acacagccga cgctgagttt tcagcgtgta
1261 ctgcaacgtg cggtcttcca tgtccagtcc tccggtcgca atgaggttac cggtgcaaac
1321 gttctggtcg ctatctttag cgaacaggag tcgcaggcgg catatctgtt gcgtaaacac
1381 gaagtcagcc gtctcgatgt ggtgaacttt atctctcatg gcacgcgtaa agacgagccg
1441 acacagtctt ctgatcctgg cagccagcca aacagcgaag aacaagctgg tggggaggaa
1501 cgtatggaga atttcacgac gaacctgaat cagcttgcgc gcgtgggcgg aatcgaccca
1561 ctgattggtc gtgagaagga gctggagcgt gctattcagg ttctctgccg tcgccgtaaa
1621 aacaacccgc tgctggtggg ggaatctggt gtcggtaaaa ccgcgattgc ggaaggtctt
1681 gcctggcgaa ttgttcaggg cgatgtgccg gaagtgatgg ctgactgtac gatttactct
1741 ctcgatatcg gttctctgtt agcgggcaca aaatatcgcg gcgactttga aaaacgtttt
1801 aaagcgttgc tcaagcagct ggagcaggac actaacagca tcctgtttat tgatgagatc
1861 cacaccatta tcggtgcggg tgcagcgtct ggtggtcagg tcgatgcggc taacctaatc
1921 aaaccgttgc tctccagcgg taaaattcgt gtaattggtt cgacaaccta tcaggagttc
1981 agcaacattt tcgagaaaga ccgtgctctg gcgcgtcgct tccagaaaat tgatattact
2041 gaaccgtcga tcgaagaaac tgttcaaatc atcaatggcc tgaaaccgaa gtatgaaggc
2101 caccacgacg tgcgttatac cgcaaaagcg gtgcgtgcgg cggtagagct ggcggtgaaa
2161 tacattaacg atcgtcatct gccggataaa gccattgatg ttatcgacga agcgggcgct
2221 cgcgcacgcg tgatgccggt aagcaaacgc aagaaaaccg ttaatgtggc ggatattgag
2281 tccgtggtgg cccgtattgc acgcattcca gagaagagtg tttctcagag tgatcgtgat
2341 accctgaaaa acctcggcga tcgcttgaaa atgctggtct tcggtcagga taaagccatt
2401 gaggcgctga ctgaagccat taagatggcg cgtgcaggtt taggtcacga acataaaccg
2461 gttggttcgt tcctgtttgc cggccctacc ggggtcggga aaacagaggt gacggtacag
2521 ctttcgaaag ctttgggcat tgagcttctg cgctttgata tgtccgagta tatggaacgc
2581 cataccgtca gccgtgttat tggtgcgcct ccgggatacg ttggttttga tcagggcggt
2641 ttgctgactg atgcggtcat caagcatcca catgcggtgc tgctgctgga cgaaatcgag
2701 aaagcgcacc cggacgtgtt caatattctg ttgcaggtga tggataacgg tacgctgacc
2761 gataacaacg gacgcaaagc agacttccgt aacgtggtgc tggtgatgac caccaacgcc
2821 ggggtacggg aaactgagcg caaatccatt ggtcttatcc accaggataa cagcaccgat
2881 gcgatggaag agatcaagaa gatctttaca ccggaattcc gtaaccgtct cgacaacatt
2941 atctggtttg atcatctgtc aaccgacgtg atccatcagg tggtggataa attcatcgtc
3001 gagttgcagg ttcagctgga tcagaaaggt gtttctctgg aagtgagcca ggaagcgcgt
3061 aactggctgg ccgagaaagg ttacgaccgg gcaatgggcg ctcgtccgat ggcgcgtgtc
3121 atccaggaca acctgaaaaa accgctcgcc aacgaactgc tgtttggttc gctggtggac
3181 ggcggtcagg tcaccgtcgc gctggataaa gagaaaaatg agctgactta cggattccag
3241 agtgcacaaa agcacaaggc ggaagcagcg cattaatctg attgttaggt aggttggtca
3301 agtccgtaat ctcgaaagag gttacggact ttttgtttat ggggtggagg agttcagacc
3361 ctttttttat gatgatggta
//
LOCUS HUMCSAE 1372 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds.
ACCESSION M32306
KEYWORDS cell surface antigen; cell surface glycoprotein;
epithelial glycoprotein; transmembrane protein;
tumor-associated antigen.
SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library
HT-29/cdm8).
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1372)
AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J.,
Gattoni-Celli,S. and Brand,S.J.
TITLE Epithelial glycoprotein is a member of a family of epithelial cell
surface antigens homologous to nidogen, a matrix adhesion protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2755-2759 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
B.Simon, 23-FEB-1990.
FEATURES from to/span description
pept 22 966 epithelial glycoprotein (EGP) precursor
sigp 22 84 epithelial glycoprotein signal peptide
matp 85 963 epithelial glycoprotein
mRNA < 1 1372 EGP mRNA
BASE COUNT 436 a 233 c 317 g 386 t
ORIGIN
1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt
61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag
121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca
181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg
241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg
301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc
361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata
421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga
481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt
541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt
601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct
661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg
721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat
781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg
841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga
901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat
961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac
1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc
1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc
1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact
1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt
1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt
1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg
//
LOCUS HUMMHCAD 1093 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele B7, complete cds.
ACCESSION M32317
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line JY cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1093)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1089 HLA protein allele B7
sigp 1 72 HLA protein signal peptide
matp 73 1086 HLA protein mature peptide
BASE COUNT 221 a 335 c 364 g 173 t
ORIGIN
1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc
61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg
241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag
301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag
361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac
421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg
481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg
541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac
661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc
721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg
901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa
1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
1081 acagcttgaa aag
//
LOCUS HUMMHCAE 1089 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele B57, complete cds.
ACCESSION M32318
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line WIN and cell line MOC cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1089)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1089 HLA protein allele B57
sigp 1 72 HLA protein signal peptide
matp 73 1086 HLA protein mature peptide
BASE COUNT 217 a 331 c 366 g 175 t
ORIGIN
1 atgcgggtca cggcaccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc
61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc
121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacccagtt cgtgaggttc
181 gacagcgacg ccgcgagtcc gaggatggcg ccccgggcgc catggataga gcaggagggg
241 ccggagtatt gggacgggga gacacggaac atgaaggcct ccgcgcagac ttaccgagag
301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca catcatccag
361 gtgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtcc
421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
481 gacacggcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcagctg
541 agagcctacc tggagggcct gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac
661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca
781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
841 tacacatgcc atgtacagca tgaggggctg ccaaagcccc tcaccctgag atgggagcca
901 tcttcccaat ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagctc aggtggaaaa
1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
1081 acagcttga
//
LOCUS HUMMHCAF 1093 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele B51, complete cds.
ACCESSION M32319
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line BM92 cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1093)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1089 HLA protein allele B51
sigp 1 72 HLA protein signal peptide
matp 73 1086 HLA protein mature peptide
BASE COUNT 228 a 334 c 358 g 173 t
ORIGIN
1 atgcgggtca cggcgccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc
61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc
121 cgcggggagc cccgcttcat tgcagtgggc tacgtggacg acacccagtt cgtgaggttc
181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc catggataga gcaggagggg
241 ccggagtatt gggaccggaa cacacagatc ttcaagacca acacacagac ttaccgagag
301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca cacttggcag
361 acgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca taaccagtac
421 gcctacgacg gcaaagatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagctg
541 agagcctacc tggagggcct gtgcgtggag tggctccgca gacacctgga gaacgggaag
601 gagacgctgc agcgcgcgga ccccccaaag acacacgtga cccaccaccc cgtctctgac
661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca
901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa
1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc
1081 acagcttgaa aag
//
LOCUS HUMMHCAG 1093 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele B37, complete cds.
ACCESSION M32320
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line KASO and cell line MG cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1093)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1089 HLA protein allele B37
sigp 1 72 HLA protein signal peptide
matp 73 1086 HLA protein mature peptide
BASE COUNT 224 a 337 c 358 g 174 t
ORIGIN
1 atgcgggtca cggcgccccg aaccctcctc ctgctgctct ggggggcagt ggccctgacc
61 gagacctggg ctggctccca ctccatgagg tatttccaca cctccgtgtc ccggcccggc
121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc cgtggataga gcaggagggg
241 ccggagtatt gggaccggga gacacagatc tccaagacca acacacagac ttaccgagag
301 gacctgcgga ccctgctccg ctactacaac cagagcgagg ccgggtctca caccatccag
361 aggatgtctg gctgcgacgt ggggccggac gggcgcctcc tccgcgggta taaccagttc
421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcaggac
541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac
661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca
781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca
901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa
1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc
1081 acagcttgaa aag
//
LOCUS HUMMHCAH 1098 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele A25, complete cds.
ACCESSION M32321
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line BM92 cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1098)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1098 HLA protein allele A25
sigp 1 72 HLA protein signal peptide
matp 73 1095 HLA protein mature peptide
BASE COUNT 216 a 329 c 372 g 181 t
ORIGIN
1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggccct ggccctgacc
61 cagacctggg cgggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
121 cgcggggagc cccgcttcat cgccgtgggc tacgtggacg acacgcagtt cgtgcggttc
181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggg
241 ccggagtatt gggaccggaa cacacggaat gtgaaggccc actcacagac tgaccgagag
301 agcctgcgga tcgcgctccg ctactacaac cagagcgagg acggttctca caccatccag
361 aggatgtatg gctgcgacgt ggggccggac gggcgcttcc tccgcgggta ccagcaggac
421 gcttacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcttg gaccgcggcg
481 gacatggcgg ctcagatcac ccagcgcaag tgggagacgg cccatgaggc ggagcagtgg
541 agagcctacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gagacgctgc agcgcacgga cgcccccaag acgcatatga ctcaccacgc tgtctctgac
661 catgaggcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc
721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca
781 ggggatggga ccttccagaa gtgggcgtct gtggtggtgc cttctggaca ggagcagaga
841 tacacctgcc atgtgcagca tgagggtctg cccaagcccc tcaccctgag atgggagccg
901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct
961 gtgatcgctg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa
1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tatgtctctc
1081 acagcttgta aagtgtga
//
LOCUS HUMMHCAI 1098 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human MHC HLA protein, allele A2, complete cds.
ACCESSION M32322
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human blood B-cell cell line JY cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1098)
AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain
reaction: Frequency and nature of errors produced in amplification
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
STANDARD full staff_entry
COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren,
22-FEB-1990.
FEATURES from to/span description
pept 1 1098 HLA protein allele A2
sigp 1 72 HLA protein signal peptide
matp 73 1095 HLA protein mature peptide
BASE COUNT 220 a 323 c 368 g 187 t
ORIGIN
1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggctct ggccctgacc
61 cagacctggg cgggctctca ctccatgagg tatttcttca catccgtgtc ccggcccggc
121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacgcagtt cgtgcggttc
181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggt
241 ccggagtatt gggacgggga gacacggaaa gtgaaggccc actcacagac tcaccgagtg
301 gacctgggga ccctgcgcgg ctactacaac cagagcgagg ccggttctca caccgtccag
361 aggatgtatg gctgcgacgt ggggtcggac tggcgcttcc tccgcgggta ccaccagtac
421 gcctacgacg gcaaggatta catcgccctg aaagaggacc tgcgctcttg gaccgcggcg
481 gacatggcag ctcagaccac caagcacaag tgggaggcgg cccatgtggc ggagcagttg
541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gagacgctgc agcgcacgga cgcccccaaa acgcatatga ctcaccacgc tgtctctgac
661 catgaagcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc
721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca
781 ggggatggaa ccttccagaa gtgggcggct gtggtggtgc cttctggaca ggagcagaga
841 tacacctgcc atgtgcagca tgagggtttg cccaagcccc tcaccctgag atgggagccg
901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct
961 gtgatcactg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa
1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tgtgtctctc
1081 acagcttgta aagtgtga
//
LOCUS HUMNOXF 2206 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds.
ACCESSION M32011
KEYWORDS neutrophil oxidase factor.
SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA,
clone 10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2206)
AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G.,
Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L.
TITLE Cloning of a 67kD neutrophil oxidase factor with similarity to a
noncatalytic region of P60-c-src
JOURNAL Science 248, 727-730 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
H.L.Malech, 08-FEB-1990.
FEATURES from to/span description
pept 68 1648 neutrophil oxidase factor (p67-phox)
mRNA < 1 2206 p67-phox
signal 2030 2035 poly-A signal
signal 2181 2186 poly-A signal
BASE COUNT 624 a 499 c 573 g 510 t
ORIGIN
1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta
61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga
121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg
181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa
241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg
301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt
361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct
421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa
481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa
541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc
601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta
661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc
721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag
781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga
841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc
901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga
961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc
1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa
1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta
1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat
1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga
1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa
1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga
1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa
1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt
1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga
1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga
1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat
1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca
1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga
1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat
1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct
1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt
1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa
2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa
2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg
2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc
//
LOCUS HUMPTPBX 3247 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human phosphotyrosyl-protein phosphatase (PTP-1B) mRNA, complete
cds.
ACCESSION M31724
KEYWORDS phosphotyrosyl-protein phosphatase.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3247)
AUTHORS Chernoff,J., Schievella,A.R., Jost,C.A., Erikson,R.L. and Neel,B.G.
TITLE Cloning of a cDNA for a major human protein-tyrosine-phosphatase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2735-2739 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Chernoff,J., 13-MAR-1990.
FEATURES from to/span description
pept 91 1398 phosphotyrosyl-protein phosphatase (EC
3.1.3.48)
mRNA < 1 3247 phosphotyrosyl-protein phosphatase mRNA
BASE COUNT 836 a 834 c 806 g 771 t
ORIGIN
1 gggcgggcct cggggctaag agcgcgacgc ctagagcggc agacggcgca gtgggccgag
61 aaggaggcgc agcagccgcc ctggcccgtc atggagatgg aaaaggagtt cgagcagatc
121 gacaagtccg ggagctgggc ggccatttac caggatatcc gacatgaagc cagtgacttc
181 ccatgtagag tggccaagct tcctaagaac aaaaaccgaa ataggtacag agacgtcagt
241 ccctttgacc atagtcggat taaactacat caagaagata atgactatat caacgctagt
301 ttgataaaaa tggaagaagc ccaaaggagt tacattctta cccagggccc tttgcctaac
361 acatgcggtc acttttggga gatggtgtgg gagcagaaaa gcaggggtgt cgtcatgctc
421 aacagagtga tggagaaagg ttcgttaaaa tgcgcacaat actggccaca aaaagaagaa
481 aaagagatga tctttgaaga cacaaatttg aaattaacat tgatctctga agatatcaag
541 tcatattata cagtgcgaca gctagaattg gaaaacctta caacccaaga aactcgagag
601 atcttacatt tccactatac cacatggcct gactttggag tccctgaatc accagcctca
661 ttcttgaact ttcttttcaa agtccgagag tcagggtcac tcagcccgga gcacgggccc
721 gttgtggtgc actgcagtgc aggcatcggc aggtctggaa ccttctgtct ggctgatacc
781 tgcctcctgc tgatggacaa gaggaaagac ccttcttccg ttgatatcaa gaaagtgctg
841 ttagaaatga ggaagtttcg gatggggttg atccagacag ccgaccagct gcgcttctcc
901 tacctggctg tgatcgaagg tgccaaattc atcatggggg actcttccgt gcaggatcag
961 tggaaggagc tttcccacga ggacctggag cccccacccg agcatatccc cccacctccc
1021 cggccaccca aacgaatcct ggagccacac aatgggaaat gcagggagtt cttcccaaat
1081 caccagtggg tgaaggaaga gacccaggag gataaagact gccccatcaa ggaagaaaaa
1141 ggaagcccct taaatgccgc accctacggc atcgaaagca tgagtcaaga cactgaagtt
1201 agaagtcggg tcgtgggggg aagtcttcga ggtgcccagg ctgcctcccc agccaaaggg
1261 gagccgtcac tgcccgagaa ggacgaggac catgcactga gttactggaa gcccttcctg
1321 gtcaacatgt gcgtggctac ggtcctcacg gccggcgctt acctctgcta caggttcctg
1381 ttcaacagca acacatagcc tgaccctcct ccactccacc tccacccact gtccgcctct
1441 gcccgcagag cccacgcccg actagcaggc atgccgcggt aggtaagggc cgccggaccg
1501 cgtagagagc cgggccccgg acggacgttg gttctgcact aaaacccatc ttccccggat
1561 gtgtgtctca cccctcatcc ttttactttt tgccccttcc actttgagta ccaaatccac
1621 aagccatttt ttgaggagag tgaaagagag taccatgctg gcggcgcaga gggaaggggc
1681 ctacacccgt cttggggctc gccccaccca gggctccctc ctggagcatc ccaggcggcg
1741 cacgccaaca gcccccccct tgaatctgca gggagcaact ctccactcca tatttattta
1801 aacaattttt tccccaaagg catccatagt gcactagcat tttcttgaac caataatgta
1861 ttaaaatttt ttgatgtcag ccttgcatca agggctttat caaaaagtac aataataaat
1921 cctcaggtag tactgggaat ggaaggcttt gccatgggcc tgctgcgtca gaccagtact
1981 gggaaggagg acggttgtaa gcagttgtta tttagtgata ttgtgggtaa cgtgagaaga
2041 tagaacaatg ctataatata taatgaacac gtgggtattt aataagaaac atgatgtgag
2101 attactttgt cccgcttatt ctcctccctg ttatctgcta gatctagttc tcaatcactg
2161 ctcccccgtg tgtattagaa tgcatgtaag gtcttcttgt gtcctgatga aaaatatgtg
2221 cttgaaatga gaaactttga tctctgctta ctaatgtgcc ccatgtccaa gtccaacctg
2281 cctgtgcatg acctgatcat tacatggctg tggttcctaa gcctgttgct gaagtcattg
2341 tcgctcagca atagggtgca gttttccagg aataggcatt tgctaattcc tggcatgaca
2401 ctctagtgac ttcctggtga ggcccagcct gtcctggtac agcagggtct tgctgtaact
2461 cagacattcc aagggtatgg gaagccatat tcacacctca cgctctggac atgatttagg
2521 gaagcaggga caccccccgc cccccacctt tgggatcagc ctccgccatt ccaagtcaac
2581 actcttcttg agcagaccgt gatttggaag agaggcacct gctggaaacc acacttcttg
2641 aaacagcctg ggtgacggtc ctttaggcag cctgccgccg tctctgtccc ggttcacctt
2701 gccgagagag gcgcgtctgc cccaccctca aaccctgtgg ggcctgatgg tgctcacgac
2761 tcttcctgca aagggaactg aagacctcca cattaagtgg ctttttaaca tgaaaaacac
2821 ggcagctgta gctcccgagc tactctcttg ccagcatttt cacattttgc ctttctcgtg
2881 gtagaagcca gtacagagaa attctgtggt gggaacattc gaggtgtcac cctgcagagc
2941 tatggtgagg tgtggataag gcttaggtgc caggctgtaa gcattctgag ctggcttgtt
3001 gtttttaagt cctgtatatg tatgtagtag tttgggtgtg tatatatagt agcatttcaa
3061 aatggacgta ctggtttaac ctcctatcct tggagagcag ctggctctcc accttgttac
3121 acattatgtt agagaggtag cgagctgctc tgctatatgc cttaagccaa tatttactca
3181 tcaggtcatt attttttaca atggccatgg aataaaccat ttttacaaaa ataaaaacaa
3241 aaaaagc
//
LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA,
complete cds.
ACCESSION J05428
KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase.
SOURCE Human liver, cDNA to mRNA, clone 63-11.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1855)
AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S.
TITLE Cloning and expression of human liver UDP-glucuronosyltransferase
in COS-1 cells: 3,4-catechol estrogens and estriol as primary
substrates
JOURNAL J. Biol. Chem. 265, 7900-7906 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.S.Owens, 22-FEB-1990.
FEATURES from to/span description
pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17)
BASE COUNT 571 a 337 c 394 g 553 t
ORIGIN
1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct
61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt
121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac
181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa
241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga
301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca
361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat
421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct
481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc
541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg
601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct
661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt
721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat
781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt
841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg
901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca
961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc
1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt
1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg
1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt
1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg
1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc
1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc
1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc
1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt
1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt
1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg
1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat
1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat
1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa
1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt
//
LOCUS RATGFGDVA 630 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Rat glioma-derived vascular endothelial cell growth factor mRNA,
complete cds.
ACCESSION M32167
KEYWORDS endothelial cell growth factor protein.
SOURCE Rat cell line GS-9L, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 630)
AUTHORS Conn,G., Bayne,M.L., Soderman,D.D., Kwok,P.W., Sullivan,K.A.,
Palisi,T.M., Hope,D.A. and Thomas,K.A.
TITLE Amino acid and cDNA sequences of a vascular endothelial cell
mitogen that is homologous to platelet-derived growth factor
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2628-2632 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.L.Bayne, 16-FEB-1900.
FEATURES from to/span description
pept 5 577 endothelial cell growth factor protein
precursor
sigp 5 82 endothelial cell growth factor protein signal
peptide
matp 83 574 endothelial cell growth factor protein
BASE COUNT 169 a 163 c 172 g 126 t
ORIGIN
1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca
61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt
121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga
181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct
241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag
301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat
361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga
421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac
481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa
541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc
601 ctcagggttt cgggaactag acctctcacc
//
LOCUS RATYWKII 1821 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Rat sperm membrane protein (YWK-II) mRNA, 3' end.
ACCESSION M31322
KEYWORDS sperm membrane protein.
SOURCE Rat testis cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1821)
AUTHORS Yan,Y.C., Bai,Y., Wang,L.F. and Miao,S.Koide.S.S.
TITLE Characterization of cDNA encoding a rat sperm membrane protein
related to A4 amyloid protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2405-2408 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.S.Koide 17-JAN-1990.
FEATURES from to/span description
pept < 1 576 sperm membrane protein (AA at 1)
mRNA < 1 1821 YWK-II mRNA
BASE COUNT 447 a 427 c 416 g 531 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgtta cctcctccat ctcagagaac cctgtggatg tccgggtgag ctctgaggaa
61 agtgaggaga tcccgccgtt ccaccctttc catcccttcc catccttgtc tgagaacgaa
121 gacactcagc cggagttgta ccacccaatg aaaaaaggat ctggaatggc agagcaagat
181 gggggcctga ttggtgcaga agaaaaggtg atcaacagta agaataaaat ggatgaaaat
241 atggtcattg acgagactct ggatgttaag gaaatgattt tcaatgctga gagagttggg
301 ggtctggagg aagagccgga ctctgtgggg cctctacggg aggacttcag tttgagcagc
361 agtgccctta ttggcttgct ggtcatcgcg gtggccattg ctacagtcat cgtcatcagc
421 ctggtgatgc tgaggaagag gcaatacggc accatcagcc atggcattgt ggaggttcat
481 ccaatgctca ccccagaaga gcgtcacttg aacaagatgc agaaccacgg ctatgaaaac
541 ccaacctaca aatacctgga gcagatgcag atttaaggac agcagcgtgt gcgacaccct
601 ggctgaggct gctgcaggtg ggctggaaga gcctcagcgt ttgtgcttga ctgctgacca
661 ccagcggtgc cagaggcctc atcctacatc ctgctctcct ggattgttaa gactataaag
721 tactactgta ggattgcaat ttccattctt ttaaatgggt ttaaaagatg ttaatataac
781 aatatatgat atataaacct taagtgaaaa aaagatctat tgcagatatc tgatggatgt
841 agttttcttt ttttaaatta gaaatgccac ttctattgta ttgtctcaca catgctctat
901 ataaatggaa aatgttgatt tttcaatgat agactatata cacaggctgt tcccgttatg
961 taagtctgtt ctttaggctc gtttgctggg ctggttttgt cgtcatttgt tttaatgtat
1021 aaaggcagta ttcccctttt caggttgctg agaaatgtaa gtggaactga agtacattgt
1081 atgcagttac tgactgtttt aggcatagtc tccttggaag cctagagctt ccagtgccgg
1141 gtgtccagtg cctgtcacca aagcaagggc taagtcacct tgagctagct ggatgcaaac
1201 tagatccact gtgctttcct tcaaatccag ttcttccaca gcaaccagcc catagttgtt
1261 ctgtgttctt ccacagctgt ttacggtagc ctcctagcca ctctcctcag caagtgcatc
1321 caagagtgca ccaccccctt ctttggacgt ctccgtccca tgcactgacc ctctgcttgc
1381 cttcgtacct cacttcctcc accgctcttc agcccctttg atgtcccctc agagaatacc
1441 gatatacaca tggctaagga cccaggagac ttcacgggag gcctcattag gtgaaaggac
1501 gatgttctgg gctgtacatg aaattggatc tgtagacact gtgtttcctt cactgacttg
1561 taatgtcacg cagctggagt tgatgccaca acccttagtg ctttgttgct gttttgtttt
1621 tcagggttct ggtaacctgc tactgttttt gttttggttt tggtttggtt ttttttgtat
1681 ttttctgtga tttccctccc cttccccccc atgcctcttc ccactatgca cagatggaaa
1741 ctttacctac aaactccttc gtatgatctg tggagaatgt acagaactta ttacatcaat
1801 aaaacacttt aacttccccc g
//
LOCUS RHBNODVW 4227 bp ds-DNA BCT 27-JUL-1990
DEFINITION B.japonicum nodulation protein (nodV, nodW) genes, complete cds.
ACCESSION M31765
KEYWORDS nodulation protein; regulatory protein.
SOURCE B.japonicum (strain USDA3I1b110spc4) DNA, clone pRJ141.
ORGANISM Bradyrhizobium japonicum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 4227)
AUTHORS Goettfert,M., Grob,P. and Hennecke,H.
TITLE Proposed regulatory pathway encoded by the nodV and nodW genes,
determinants of host specificity in Bradyrhizobium japonicum
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2680-2684 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Goettfert, 01-FEB-1990.
FEATURES from to/span description
pept 648 3317 nodulation protein (nodV)
pept 3314 3997 nodulation protein (nodW; gtg start)
BASE COUNT 919 a 1147 c 1228 g 933 t
ORIGIN
1 ggatcctgcg gacgcagctt gtcgttctcg ttacgatgcg cgctgcgtta gcagagcatg
61 gtgggtgtga tcagcgtgaa tggcaccgac aaagccgtgg cgagtcatac gaggctgccc
121 tgcgttctgc gacgccgctc gcttccagcg gaaccagttt ttcgcagcta aagcgcgata
181 agattggaac gaatcatcat cgcgcgtagg actcgcgcga ctccggcttc tgtgtcggca
241 ggcgaacttg cgctttcttc gctttagcgg ctgagtcggg gagacgagtc attgggcagc
301 ttttttctga tccttgccgg tccggaggcg cgccttcgaa cctatcggct gaactgaaac
361 cactgtcctc attccgcggc ctaatggggc gcattggccg taggctcgag cgatgcgcaa
421 aagcggacgt ttcagcttcg actaaagtcg cgtcagcgac acctactgaa ggtttaggac
481 aaatatacgt atgtttataa tgcgtttttt ccccgcgcgt gctgagttcg gcagccgatg
541 atcgatcttg cacacgctat catgcggacc caagtgttca agattgcctc atttttgcct
601 aaaatgctat agctctcccg ccggataaca ggccggacgg tctcaaaatg tataggatca
661 gcagtttcaa acgttcagcc cctcagttga ccttgggcag catagcgcta gccgcggtta
721 cactgacttg cgtgtacttt caggcgcatt tcgccgccgc ggcgttcgcc tatttgttag
781 tagtcttact attttcgttg atgggcagct tcattgcttc atcagcgctt tgcatcgtcg
841 caatcgctgc tctcgcatac tactttgcgc cgccggcgtt tagtttacga atcgatgatc
901 cccgcgatgt tcctgtggtt gttgcatttc ttattgtctc tgttgtcgga acgtacctga
961 ttggaaaact ccgccaggaa agagaggctg cacgtgtggc tgcggccaag cttcagcgga
1021 gtgcctcgga tttggaggat cgtgaaaaac ggtggcgcgc aattttcgag cacaatccag
1081 ccatgtactt catggtcgat gaggccggca ttgtcctcaa cgtcaatacg ttgggcgcga
1141 cacaactggg atttgcttgt gcagaactat tgggccaatc cgtgctcgac gtatttctgg
1201 aggaggatcg cgcattcgtc cgcaaatgca ttcagacgtg tcttgaggat gttggacaat
1261 cgcgcacttg ggacgtccgg aaagtcagga aggacggttc tgtattgtgg gtgcgtgaaa
1321 acgccaaggc catgctttgg gccggcgacc gccccgtcat cctcatggcg tgcgaagata
1381 ttacggagcg caagcagacc gagcttgcgc tgcagcggag cgaagcgcat ttggctcacg
1441 cgcaggagtt gagtcataca ggcagcttca gctggaacgc ctctaccggc gaggccttct
1501 ggtctaagga gacatttcgg attttccaaa tcgatcttca gacgacaccg gcgccacaac
1561 tcgtcattga gcgcacgcac ccagatgata gggcttctgt caaagagatt atcgatgaag
1621 cgatgcgaga cctgagggat ttcgagcacg agtaccggct gctgctacct gacggctccg
1681 tgaagcacat ccatgcgcag gcacgagtca cgcgaaccgc ctctggtgaa attgagtttg
1741 ttggggcagc caccgatatt acggcagcta ggcgagcaga acagcagttg cgccgaagcg
1801 aggcctatct ggccgaggct cagcatctca ctcacacagg cagctggtcc tgggacgtcc
1861 acacacgaga tttcgtttat cgctccgctg aggtcgaccg cctgtttggc tttaacccac
1921 aagagccggt ttcgctagag actattcgat cgcgcatcca tccggaagac ttgccagggt
1981 tgcaggaggt gcagcgtcag gcgattgacc aggagcacga acggttcgaa tatgatttcc
2041 gtgttattct gccagatggc gggataaggc gcatacactc cgttgcacac gttgtcgtcg
2101 gcagcgatgg taatgtcagc gagctgatcg gaacacatat ggatgttacc gagcaacacg
2161 cagctaggga acgcttggaa aacacgcttg tagcgctgcg cgagagcgaa cagcgctttc
2221 gcgactatgc cgagactgct tccgactggc tctgggaaac cgggccagat catcgggtca
2281 ctcacttgtc cgagcacacc agcgctgcgg gaattttggc gacagggtta acgggcctgc
2341 ttcgctggga catcgcgtgc gacatggaag aagaaccgga gaagtggcgt cagcatcggg
2401 cgacgttgca ggcacacctg ccgttccggg atctcatcta ccgtaccgta aataggatgg
2461 gatctccgat ctacgtccgc actagtggca agcccttctt cgacggaaac ggaaattttt
2521 tgggctatcg cggcgtcagc actgacatca ccgctaccat tcgcgctgat caggccgaac
2581 aagagctgcg aaaggcacag gcggagcttg cacatgtgac gcgtgtaacg accttgggag
2641 aaatgacaac ttctatcgcc cacgagataa cccaaccact cgccgctatc ctcagcaacg
2701 ccgatgcgtg cctcggctgg atggctcgcg atgttcccaa tcttgcagcc gcgcgctctt
2761 cagtcgaatg gatcatagaa gatgcaatcc gggcaagcga ggtgatccgt agtattcgcg
2821 cactcgcgaa aaagggcgag atcgagatgg tgccgctcga cattaatcag gtggttaggg
2881 acgtcagcgc gctggtaaca cgagagctgg tgagccacca agtgacgttg cgaagcgagt
2941 tggcgtctgc gctgcctagg gtcctcggtg atcgaattca gctacaacaa gtgatcatca
3001 atctggtgat gaacggaatc gaggccatgg acgcagttac agaccggccg cgtgaacttc
3061 tgattcaatc atctacggac gatctggggt acgtgcagct ttccgtgacc gattgcggcg
3121 tcgggatcgc cgagaatgac gcggaccgcg tcttggaccc cttcttcacc accaaatcga
3181 gcggcctagg aatgggcctt tcgatctgcc ggtcgatcgt ggaagttcac ggaggacgaa
3241 tttcagtggt tcagaaaaat ggaccgggcg cgacgttcca gtttgccctt ccgctgcata
3301 aggaggccat ctcgtgacag gacgatttga ctggagaggc caaggcggac ataccgaggc
3361 ttcgacgaag gcaatcgtct ttgtcgtcga ggatgacatc tctatgcgtc gctcgcttac
3421 gaaccttttt cgatcggtag gcttggaggt cgtggcgttc ggatcggccc gtgaaatgct
3481 gcagagcaca atgccggacg tcacaagctg tctagttctt gatgtccggc tgccgggcct
3541 gagcggcctt gactaccaga ccgagctcgc caggttgaac atacacatcc caatcatctt
3601 cattaccggc catggcgaca ttcccatgac cgtcagggcc atgaagggag gcgcggtcga
3661 tttcctcagc aagcccttcc gcgatcagga actgcttgat gccgtcgttg cggcgaccga
3721 acgcgatcgc aaaagacgag aagctcagcg aaccgtggcg aacctgaaat ctctatttga
3781 gaccctaagc ccgcgagaac aggcagtgat gaaactggtc gcgacggggc tgatgaacaa
3841 gcaggtagcc gccgaacttg ggctcgccga gatcaccgtc aagatctacc ggggacacgt
3901 aatgaaaaag atgcgtgccc gctcgctggc tgacttgatc agaatgagcg agacgctcgg
3961 aattagcgcc aatcacactg aacaaaccca agtatgattt tacaattcca tcacttaagc
4021 ccactttcgc gaaagtggct gacggtttgg cagccgctgt actgcgtcgg gagggctcat
4081 cttgtccacg cctttgattt ccgtcgttga cgacgacccc tcggtccgtg cggcgacaga
4141 aaaccttttg aaatcgcgtg gctacgtcgt gcagatattt gcctcggccg aggcgctcct
4201 gcggtcgccg cggttgaacg agatatc
//
LOCUS ANAPETF1A 1682 bp ds-DNA BCT 27-JUL-1990
DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds.
ACCESSION M35126
KEYWORDS ferredoxin.
SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1.
ORGANISM Anabaena variabilis
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE 1 (bases 1 to 1682)
AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
Van Arkel,G. and Weisbeek,P.
TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
Synechococcus sp. PCC 7942: Structure and regulation
JOURNAL Photosyn. Res. 18, 179-204 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 794 1093 ferredoxin (petF1)
binding 782 785 ribosomal binding site (put.)
BASE COUNT 480 a 353 c 322 g 527 t
ORIGIN
1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg
61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta
121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat
181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat
241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat
301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata
361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat
421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc
481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat
541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata
601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt
661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat
721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga
781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag
841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta
901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc
961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata
1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga
1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag
1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta
1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga
1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt
1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac
1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct
1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt
1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat
1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg
1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata
1681 tc
//
LOCUS ANIPETF1A 1074 bp ds-DNA BCT 27-JUL-1990
DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds.
ACCESSION M35127
KEYWORDS ferredoxin.
SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612.
ORGANISM Anacystis nidulans
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria.
REFERENCE 1 (bases 1 to 1074)
AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
Van Arkel,G. and Weisbeek,P.
TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
Synechococcus sp. PCC 7942: Structure and regulation
JOURNAL Photosyn. Res. 18, 179-204 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 504 803 ferredoxin (petF1)
binding 493 496 ribosomal binding site (put.)
BASE COUNT 234 a 298 c 281 g 261 t
ORIGIN
1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg
61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca
121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac
181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt
241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga
301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa
361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc
421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc
481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg
541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag
601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca
661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg
721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc
781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat
841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc
901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc
961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt
1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag
//
LOCUS ASAASPPA 1764 bp ds-DNA PLN 27-JUL-1990
DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds.
ACCESSION M34454
KEYWORDS aspartic proteinase; aspergillopepsin.
SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA.
ORGANISM Aspergillus awamori
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE 1 (bases 1 to 1764)
AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H.,
Carlomagno,L.P. and Thompson,S.A.
TITLE Molecular cloning and deletion of the gene encoding
aspergillopepsin A from Aspergillus awamori
JOURNAL Gene 86, 153-162 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 177 496 aspergillopepsin (pepA), exon 1
548 825 aspergillopepsin, exon 2
878 1126 aspergillopepsin, exon 3
1186 1523 aspergillopepsin, exon 4
sigp 177 308 aspergillopepsin signal peptide
matp 455 496 aspergillopepsin
548 825 aspergillopepsin
878 1126 aspergillopepsin
1186 1523 aspergillopepsin
pre-msg 124 1764 pepA mRNA and introns (put.)
pre-msg 125 1764 pepA mRNA and introns (put.)
pre-msg 127 1764 pepA mRNA and introns (put.)
IVS 497 547 pepA intron A
IVS 826 877 pepA intron B
IVS 1127 1185 pepA intron C
signal 54 58 TATA box
BASE COUNT 387 a 515 c 445 g 417 t
ORIGIN
1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga
61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc
121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg
181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc
241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc
301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc
361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg
421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca
481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct
541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt
601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg
661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca
721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg
781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac
841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc
901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac
961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc
1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt
1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt
1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc
1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag
1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg
1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct
1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca
1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc
1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg
1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg
1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt
1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact
1741 gaaccagtag tagtactttc cact
//
LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 27-JUL-1990
DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene,
complete cds.
ACCESSION M34836
KEYWORDS protein synthesis initiation factor 2.
SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and
pJET1.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2758)
AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K.,
Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and
Leighton,T.
TITLE Isolation and molecular genetic characterization of the Bacillus
subtilis gene (infB) encoding protein synthesis initiation factor 2
JOURNAL J. Bacteriol. 172, 2675-2687 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 381 2531 protein synthesis initiation factor 2 (infB)
binding 369 375 ribosomal binding site (put.)
BASE COUNT 917 a 535 c 662 g 644 t
ORIGIN
1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat
61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc
121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac
181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt
241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg
301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta
361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa
421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc
481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg
541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg
601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg
661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca
721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc
781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca
841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc
901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag
961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg
1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa
1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc
1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac
1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga
1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg
1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa
1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg
1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg
1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa
1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg
1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg
1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg
1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg
1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc
1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg
1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg
1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg
2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt
2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc
2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc
2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag
2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca
2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga
2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta
2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg
2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa
2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc
2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt
2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat
//
LOCUS CIBRCRA 1845 bp ds-DNA BCT 27-JUL-1990
DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator
protein (repZ) gene, complete cds.
ACCESSION M34837
KEYWORDS replicon initiator protein.
SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1.
ORGANISM Plasmid Colicin Ib-P9
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1845)
AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K.
TITLE Organization of the replication control region of plasmid Co1Ib-P9
JOURNAL J. Bacteriol. 172, 1983-1991 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 455 1486 repZ protein
mRNA 157 1762 repZ protein mRNA (alt.)
mRNA 158 1762 repZ protein mRNA (alt.)
mRNA 363 < 1 (c) inc RNA
binding 442 448 ribosomal binding site (put.)
signal 377 372 (c) inc -10 region
signal 400 395 (c) inc -35 region
mut 353 353 c in wild type; t in inc3
mut 374 374 t in wild type; c in inc2
mut 400 400 a in wild type; c in inc1
BASE COUNT 505 a 467 c 490 g 383 t
ORIGIN
1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag
61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct
121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac
181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag
241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa
301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag
361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata
421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata
481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta
541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg
601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg
661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc
721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa
781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc
841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt
901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact
961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct
1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt
1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca
1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct
1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag
1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct
1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct
1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga
1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc
1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa
1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca
1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa
1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt
1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa
1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt
//
LOCUS ECOHEMBA 1085 bp ds-DNA BCT 27-JUL-1990
DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds.
ACCESSION M35121
KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene;
porphobilinogen synthase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1085)
AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A.
TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12
JOURNAL Mol. Gen. Genet. 214, 503-508 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 80 1054 porphobilinogen synthase (hemB)
signal 15 20 hemB -35 region
signal 41 46 hemB -10 region
BASE COUNT 254 a 277 c 301 g 253 t
ORIGIN
1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc
61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc
121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg
181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat
241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg
301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg
361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat
421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg
481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc
541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc
601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac
661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg
721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact
781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga
841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg
901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt
961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc
1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag
1081 ttaac
//
LOCUS ECOSTMA 1418 bp ds-DNA BCT 27-JUL-1990
DEFINITION E.coli F factor (stm) gene, complete cds.
ACCESSION M35124
KEYWORDS F factor; stm gene.
SOURCE E.coli DNA, clones pEG[157,159,175,176,220].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1418)
AUTHORS Golub,E.I. and Panzer,H.A.
TITLE The F factor of Escherichia coli carries a locus of stable plasmid
inheritance stm, similar to the parB locus of plasmid RI
JOURNAL Mol. Gen. Genet. 214, 353-357 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 830 1009 F factor (stm)
signal 1013 1039 pot. transcription termination signal
signal 1074 1105 pot. transcription termination signal
BASE COUNT 279 a 364 c 429 g 345 t 1 others
ORIGIN
1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa
61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct
121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag
181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg
241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact
301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc
361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg
421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc
481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact
541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag
601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc
661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca
721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga
781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc
841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta
901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc
961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg
1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg
1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag
1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct
1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt
1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt
1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg
1381 aagctgtctc tggcgggctt ccatccctga cgggaaac
//
LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds.
ACCESSION M34480
KEYWORDS platelet glycoprotein IIb.
SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3333)
AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
Marguerie,G.
TITLE GPIIb and GPIIIa amino acid sequences deduced from human
megakaryocyte cDNAs
JOURNAL Mol. Biol. Rep. 14, 27-33 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 33 3152 platelet glycoprotein IIb (GPIIb)
mRNA 1 3333 GPIIb mRNA
BASE COUNT 626 a 998 c 1040 g 669 t
ORIGIN
1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag
61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct
121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt
181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg
241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga
301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg
361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca
421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga
481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc
541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg
601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg
661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc
721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc
781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg
841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca
901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc
961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca
1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag
1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg
1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg
1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc
1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc
1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct
1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag
1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc
1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca
1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc
1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc
1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg
1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc
1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg
1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg
1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga
1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca
2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact
2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg
2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga
2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct
2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg
2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc
2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca
2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc
2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac
2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact
2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca
2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa
2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc
2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc
2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg
2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct
3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc
3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca
3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga
3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt
3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc
3301 ccctcctgct gcctaataaa gagactgagc cct
//
LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 27-JUL-1990
DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds.
ACCESSION M35999
KEYWORDS platelet glycoprotein IIIa.
SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4487)
AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
Marguerie,G.
TITLE GPIIb and GPIIIa amino acid sequences deduced from human
megakaryocyte cDNAs
JOURNAL Mol. Biol. Rep. 14, 27-33 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 19 2385 plate glycoprotein IIIa (GPIIIa)
mRNA < 1 2603 GPIIIa mRNA
BASE COUNT 1046 a 1090 c 1186 g 1165 t
ORIGIN
1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg
61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga
121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat
181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac
241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc
301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt
361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag
421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg
481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac
541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc
601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt
661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag
721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag
781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt
841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct
901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat
961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt
1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc
1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat
1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta
1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga
1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag
1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag
1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc
1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga
1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc
1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc
1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt
1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg
1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg
1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt
1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc
1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa
1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact
2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag
2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc
2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc
2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct
2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa
2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc
2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag
2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag
2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg
2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag
2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg
2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg
2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt
2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta
2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag
2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc
3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt
3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca
3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga
3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc
3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc
3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc
3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat
3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat
3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac
3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc
3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg
3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat
3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa
3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata
3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa
3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa
3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc
4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt
4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa
4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata
4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga
4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt
4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta
4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc
4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc
//
LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 27-JUL-1990
DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds.
ACCESSION M37163 M35125
KEYWORDS Cdx-1 homeobox protein.
SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1597)
AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D.,
Guenet,J.-L. and Gruss,P.
TITLE A mouse gene homologous to the Drosophila gene caudal is expressed
in epithelial cells from the embryonic intestine
JOURNAL Genes Dev. 2, 1647-1654 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 158 736 Cdx-1 homeobox protein
site 389 571 Cdx-1 homeobox
BASE COUNT 326 a 523 c 444 g 304 t
ORIGIN
1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt
61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct
121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca
181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc
241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc
301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg
361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca
421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa
481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa
541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc
601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg
661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga
721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg
781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt
841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt
901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc
961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc
1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca
1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc
1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat
1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg
1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat
1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat
1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca
1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg
1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa
1561 aaatcataca aggcattaaa gaatttttaa aaagttt
//
LOCUS MUSHSP70B 3518 bp ds-DNA ROD 27-JUL-1990
DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds.
ACCESSION M35021
KEYWORDS heat shock protein 70.1.
SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3].
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3518)
AUTHORS Hunt,C. and Calderwood,S.
TITLE Characterization and sequence of a mouse hsp70 gene and its
expression in mouse cell lines
JOURNAL Gene 87, 199-204 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 806 2734 heat shock protein 70.1 (hsp70.1)
signal 507 512 inverted CCAAT box
signal 547 552 TATA box
BASE COUNT 827 a 993 c 1014 g 684 t
ORIGIN
1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag
61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga
121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta
181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca
241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc
301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg
361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac
421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct
481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga
541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat
601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc
661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc
721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag
781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg
841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga
901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg
961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg
1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt
1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg
1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga
1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga
1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat
1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga
1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat
1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga
1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga
1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag
1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga
1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg
1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca
1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga
1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg
1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct
1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc
2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc
2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga
2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc
2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga
2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa
2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca
2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc
2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga
2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt
2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca
2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc
2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg
2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt
2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt
2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat
2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag
3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc
3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt
3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac
3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat
3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt
3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc
3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct
3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt
3481 agctatcttc agacacacca gaaaagggca tcagatct
//
LOCUS RDVS10 1319 bp ds-RNA VRL 27-JUL-1990
DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds.
ACCESSION D00241
KEYWORDS S10 RNA.
SOURCE Rice dwarf virus, cDNA to viral RNA.
ORGANISM Rice dwarf virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE 1 (bases 1 to 1319)
AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T.
TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome
JOURNAL J. Gen. Virol. 69, 227-231 (1988)
STANDARD simple automatic
COMMENT Three contiguous termination codons were detected at the third
triplet downstream from the first stop codon.
Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated
20-JUN-1988
FEATURES from to/span description
pept 27 1088 ORF
BASE COUNT 354 a 279 c 284 g 402 t
ORIGIN 5' end of the plus-sense strand of segment S10.
1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta
121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc
1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc
1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat
//
LOCUS RDVS10A 1320 bp ds-RNA VRL 27-JUL-1990
DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds.
ACCESSION M35118
KEYWORDS .
SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367].
ORGANISM Rice dwarf virus
Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE 1 (bases 1 to 1320)
AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E.
TITLE Nucleotide sequence of rice dwarf virus genome segment 10
JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 27 1088 ORF
BASE COUNT 357 a 280 c 282 g 401 t
ORIGIN
1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta
121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc
1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt
1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat
//
LOCUS SRVAGSM 3651 bp ss-RNA VRL 27-JUL-1990
DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds.
ACCESSION M34882
KEYWORDS .
SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
E6 cell line C1008.
ORGANISM Sapporo rat virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Hantavirus.
REFERENCE 1 (bases 1 to 3651)
AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
Schmaljohn,C.S.
TITLE Coding properties of the S and the M genome segments of Sapporo rat
virus: Comparison to other causative agents of hemorrhagic fever
with renal syndrome
JOURNAL Virology 176, 114-125 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 47 3448 M segment ORF
BASE COUNT 1119 a 692 c 755 g 1085 t
ORIGIN
1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct
61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat
121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc
181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa
241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa
301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg
361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat
421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc
481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca
541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt
601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga
661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac
721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg
781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat
841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga
901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag
961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat
1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa
1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac
1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc
1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt
1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt
1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt
1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc
1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact
1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt
1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc
1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat
1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca
1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca
1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc
1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg
1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag
1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt
2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata
2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat
2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt
2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca
2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc
2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt
2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt
2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag
2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact
2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg
2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga
2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga
2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt
2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat
2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga
2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt
3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat
3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca
3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca
3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa
3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat
3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt
3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg
3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg
3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt
3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct
3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a
//
LOCUS SRVAGSS 1769 bp ss-RNA VRL 27-JUL-1990
DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds.
ACCESSION M34881
KEYWORDS .
SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
E6 cell line C1008.
ORGANISM Sapporo rat virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Bunyaviridae; Hantavirus.
REFERENCE 1 (bases 1 to 1769)
AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
Schmaljohn,C.S.
TITLE Coding properties of the S and the M genome segments of Sapporo rat
virus: Comparison to other causative agents of hemorrhagic fever
with renal syndrome
JOURNAL Virology 176, 114-125 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 43 1332 S segment ORF
BASE COUNT 569 a 331 c 410 g 459 t
ORIGIN
1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa
61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag
121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat
181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc
241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca
301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt
361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg
421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg
481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag
541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc
601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat
661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg
721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc
781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc
841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa
901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg
961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct
1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa
1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc
1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt
1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt
1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct
1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat
1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa
1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat
1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag
1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt
1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa
1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc
1741 tttttgggga gtaagggagt ctactacta
//
LOCUS TRNSTIIA 1315 bp ds-DNA BCT 27-JUL-1990
DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene,
complete cds.
ACCESSION M35123
KEYWORDS heat-stable toxin II; insertion sequence; transposase.
SOURCE Transposon Tn4521 (from Escherichia coli) DNA.
ORGANISM Transposon Tn4521
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1315)
AUTHORS Hu,S.T. and Lee,C.H.
TITLE Characterization of the transposon carrying the STII gene of
enterotoxigenic Escherichia coli
JOURNAL Mol. Gen. Genet. 214, 490-495 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 879 400 (c) heat-stable toxin II (STII)
trns 88 607 insertion sequence IS2 (bases 17-536)
trns 608 970 insertion sequence IS2 (bases 969-1327)
BASE COUNT 313 a 349 c 320 g 333 t
ORIGIN
1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct
61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac
121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc
181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt
241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt
301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact
361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata
421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt
481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca
541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc
601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc
661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg
721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag
781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt
841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct
901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg
961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt
1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga
1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac
1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct
1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat
1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga
//GenBank-Updates@genbank.bio.net (07/28/90)
LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 28-JUL-1990
DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds.
ACCESSION M33581
KEYWORDS P-glycoprotein.
SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line
J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library
pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4924)
AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and
Horwitz,S.B.
TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
reveals the basis for differential transcript heterogeneity in
multidrug-resistant J774.2 cells
JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine,
1300 Morris Park Ave, Bronx, NY 10461
FEATURES from to/span description
pept 137 3967 P-glycoprotein (mdr1a)
signal 4315 4320 poly-A signal
signal 4898 4903 poly-A signal
BASE COUNT 1450 a 1021 c 1210 g 1243 t
ORIGIN Chromosome 5.
1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc
61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga
121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt
181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt
241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct
301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac
361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga
421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac
481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc
541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga
601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt
661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac
721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt
781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt
841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc
901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa
961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat
1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc
1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt
1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg
1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc
1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt
1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag
1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct
1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag
1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct
1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga
1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca
1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat
1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc
1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg
1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc
1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa
1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa
2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg
2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga
2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg
2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat
2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt
2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt
2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa
2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca
2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc
2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa
2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact
2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt
2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga
2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat
2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg
2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt
3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc
3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc
3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga
3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg
3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga
3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt
3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa
3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga
3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt
3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga
3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg
3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct
3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga
3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa
3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca
3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg
3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt
4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca
4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa
4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag
4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat
4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa
4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa
4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat
4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag
4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct
4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata
4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg
4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca
4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca
4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt
4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa
4921 aaat
//
LOCUS MUSMDRXX 2873 bp ds-DNA ROD 28-JUL-1990
DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2.
ACCESSION M33580
KEYWORDS P-glycoprotein.
SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line
J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2873)
AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and
Horwitz,S.B.
TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
reveals the basis for differential transcript heterogeneity in
multidrug-resistant J774.2 cells
JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.S.Kirschner, 05-APR-1990. Albert Einstein C., 1300 Morris Pk,
Bronx, 10461. E-mail: kirschne@aecom.yu.edu.
FEATURES from to/span description
pre-msg 1992 > 2873 P-glycoprotein mRNA and introns
IVS 2120 2606 P-glycoprotein intron A
IVS 2678 > 2873 P -glycoprotein intron B
signal 1904 1912 CAAT box
signal 1956 1963 TATA box
site 1880 1887 SP-1 site
site 1921 1927 SP-1 site
site 1937 1944 SP-1 site
site 1869 1875 AP-1 site
rpt 1 1300 L1Md repetitive element
BASE COUNT 860 a 621 c 714 g 678 t
ORIGIN Chromosome 5.
1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct
61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa
121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa
181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc
241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca
301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag
361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga
421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat
481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac
541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct
601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga
661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt
721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc
781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt
841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac
901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta
961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag
1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc
1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg
1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg
1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa
1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa
1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa
1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact
1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta
1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca
1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg
1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt
1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag
1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag
1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca
1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct
1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa
1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct
2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc
2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag
2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag
2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg
2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg
2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag
2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc
2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt
2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg
2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa
2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg
2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat
2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct
2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc
//
LOCUS HUMKSAA 1504 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds.
ACCESSION M32325
KEYWORDS adenocarcinoma-associated antigen.
SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1504)
AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D.,
Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A.
TITLE Molecular cloning and characterization of a human
adenocarcinoma/epithelial cell surface antigen complementary DNA
JOURNAL Cancer Res. 49, 314-317 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D. Sharp, 22-FEB-1990.
There are a few base differences between the sequence presented
here and that which appears in entry M26481. The difference occurs
mostly in the Poly-A signal.
FEATURES from to/span description
pept 155 1099 adenocarcinoma-associated antigen precursor
(KSA)
sigp 155 223 adenocarcinoma-associated antigen signal
peptide
matp 398 1096 adenocarcinoma-associated antigen
mRNA < 1 1504 adenocarcinoma-associated antigen mRNA
site 1486 1491 polyadenylation site
BASE COUNT 442 a 302 c 356 g 404 t
ORIGIN
1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc
61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc
121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt
181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg
241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac
301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat
361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca
421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca
481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa
541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa
601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga
661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa
721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat
781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa
841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat
901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat
961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc
1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca
1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat
1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg
1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga
1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt
1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc
1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc
1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa
1501 actg
//
LOCUS HUMMHDNDRW 1066 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human MHC class II DN alpha mRNA, complete cds.
ACCESSION M26039 M27046
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to
mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1066)
AUTHORS Jonsson,A.-K. and Rask,L.
TITLE Human class II DNA and DOB genes display low sequence variability
JOURNAL Immunogenetics 29, 411-413 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by A.-K.Jonsson, 07-JUL-1989.
FEATURES from to/span description
pept 26 778 MHC DN alpha protein precursor
/nomgen="HLA-DNA" /map="6p21.3"
/hgml_locus_uid="LV0117X"
sigp 26 106 MHC DN alpha protein signal peptide
matp 107 775 MHC DN alpha protein
mRNA < 1 1066 MHCDNA-a mRNA
BASE COUNT 199 a 337 c 283 g 247 t
ORIGIN Chromosome 6p21.3.
1 catttgatta aagcaccaga gtgtaatggc cctcagagca gggctggtcc tggggttcca
61 caccctgatg accctcctga gcccgcagga ggcaggggcc accaaggctg accacatggg
121 ctcctacgga cccgccttct accagtctta cggcgcctcg ggccagttca cccatgaatt
181 tgatgaggaa cagctgttct ctgtggacct gaagaaaagc gaggccgtgt ggcgtctgcc
241 tgagtttggt gactttgccc gctttgaccc gcagggcggg ctggccggca tcgccgcaat
301 caaagcccat ctggacatcc tggtggagcg ctccaaccgc agcagagcca tcaacgtgcc
361 tccacgggtg accgtgctcc ccaagtctcg ggtggagctg ggccagccca acatcctcat
421 ctgcatcgtg gacaacatct tcccccctgt gatcaatatc acctggctgc gcaacggcca
481 aactgtcact gagggagtgg cccagaccag cttctattcc cagcctgacc atttgttccg
541 caagttccac tacctgccct tcgtgccctc agccgaggac gtctatgact gccaggtgga
601 gcactggggc ctggatgcgc cactcctcag gcattgggag ctccaggtgc ctattccacc
661 accagatgcc atggagaccc tggtctgtgc cctgggcctg gccatcggcc tggtgggctt
721 cctcgtgggc accgtcctca tcatcatggg cacatatgtg tccagtgtcc ccaggtaatg
781 atccttctga gagaaatgac ttgtgggaga caccctgcag atcctcatgg gtttgtgaca
841 gaccctgcgt gctcagtgcc ctttaagtgc atcccgctgt gctgactttg agtgggatca
901 acatctgtcc tacgggtccc ctcttttttg gccccagtat tcatggcagg gtttgttgga
961 cacctactag cttcccttcc cattcaacac acacacacat tcttgctcta cccaaagctc
1021 tggctggcag cactaaatgc tttggtggtg tttgcactgt gtcctt
//
LOCUS HUMMHDOBDR 1293 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human MHC class II DO beta mRNA, complete cds.
ACCESSION M26040 M27047
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to
mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1293)
AUTHORS Jonsson,A.-K. and Rask,L.
TITLE Human class II DNA and DOB genes display low sequence variability
JOURNAL Immunogenetics 29, 411-413 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by A.-K.Jonsson, 07-JUL-1989.
FEATURES from to/span description
pept 36 857 MHC DO beta protein precursor
/nomgen="HLA-DOB" /map="6p21.3"
/hgml_locus_uid="LM0050S"
sigp 36 125 MHC DO beta protein signal peptide
matp 126 854 MHC DO beta protein
BASE COUNT 293 a 315 c 330 g 355 t
ORIGIN Chromosome 6p21.3.
1 cgattttact gtctcatttt tttcctttct ccagaatggg ttctgggtgg gtcccctggg
61 tggtggctct gctagtgaat ctgacccgac tggattcctc catgactcaa ggcacagact
121 ctccagaaga ttttgtgatt caggcaaagg ctgactgtta cttcaccaac gggacagaaa
181 aggtgcagtt tgtggtcaga ttcatcttta acttggagga gtatgtacgt ttcgacagtg
241 atgtggggat gtttgtggca ttgaccaagc tggggcagcc agatgctgag cagtggaaca
301 gccggctgga tctcttggag aggagcagac aggccgtgga tggggtctgt agacacaact
361 acaggctggg cgcacccttc actgtgggga gaaaagtgca accagaggtg acagtgtacc
421 cagagaggac cccactcctg caccagcata atctgctgca ctgctctgtg acaggcttct
481 atccagggga tatcaagatc aagtggttcc tgaatgggca ggaggagaga gctggggtca
541 tgtccactgg ccctatcagg aatggagact ggacctttca gactgtggtg atgctagaaa
601 tgactcctga acttggacat gtctacacct gccttgtcga tcactccagc ctgctgagcc
661 ctgtttctgt ggagtggaga gctcagtctg aatattcttg gagaaagatg ctgagtggca
721 ttgcagcctt cctacttggg ctaatcttcc ttctggtggg aatcgtcatc cagctaaggg
781 ctcagaaagg atatgtgagg acgcagatgt ctggtaatga ggtctcaaga gctgttctgc
841 tccctcagtc atgctaaggt cctcactgaa gcttctctct ctggagcctg aagtagtgat
901 gagtagtctg ggccctgggt gaggtaaagg acattcatga ggtcaatgtt ctgggaataa
961 ctctcttccc tgatccttgg aggagcccga actgattctg gagctctgtg ttctgagatc
1021 atgcatctcc cacccatctg cccttctccc ttctacgtgt acatcattaa tccccattgc
1081 caagggcatt gtccagaaac tcccctgaga ccttactcct tccagcccca aatcatttac
1141 ttttctgtgg tccagcccta ctcctataag tcatgatctc caaagctttc tgtcttccaa
1201 ctgcagtctc cacagtcttc agaagacaaa tgctcaggta gtcactgttt ccttttcact
1261 gtttttaaaa accttttatt gtcaaataaa atg
//
LOCUS TRPFLAA 966 bp ds-DNA BCT 28-JUL-1990
DEFINITION T.pallidum endoflagellar sheath protein (flaA) gene, 3' end.
ACCESSION M26525
KEYWORDS endoflagellar sheath protein.
SOURCE T.pallidum (strain Nichols) DNA.
ORGANISM Treponema pallidum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes;
Spirochaetales; Spirochaetaceae.
REFERENCE 1 (bases 1 to 966)
AUTHORS Isaacs,R.D., Hanke,J.H., Guzman-Verduzco,L.-M., Newport,G.,
Agabian,N., Norgard,M.V., Lukehart,S.A. and Radolf,J.D.
TITLE Molecular cloning and DNA sequence analysis of the 37-kilodalton
endoflagellar sheath protein of Treponema pallidum
JOURNAL Infect. Immun. 57, 3403-3411 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.D.Radolf, 26-JUL-1989.
FEATURES from to/span description
pept < 1 966 endoflagellar sheath protein (AA at 1)
BASE COUNT 199 a 196 c 325 g 246 t
ORIGIN Unreported
1 aagctgaacg ctgatatcat ggcggataag agtggaggta tgacgcataa tcggcgtacc
61 gttctggact atgcttctct ggcggatacc tcgtacactg acgagcagaa ggcattgatg
121 agatcttctc ttgcggttgc acagtgggag gttgtgctga attcttccgc gcgtaatcct
181 gtcgcccatg ctgcctctcg cgttattgag gctccggtaa gtgagggagc gaagagtttt
241 gctggtgagc gtgtccttgg tgtgcgcgtg ttgttcccca cgtgggacag taacgcaaac
301 gcaatgataa agccggcgtt cgtaattcct gcgtacgagg tgatggctca ggtggacgat
361 cagggtaatg tacaggcccc cacagaggag gagaaggctt ctggaaaggg gcgttttgaa
421 gatgggtacg gagtggtaaa gaatgtgggt gttcttaagt ccatcgcggt gaacacttac
481 gggatgaatt atcctcatgg tttgtacgtg atgatgcggg atcaggatgg tgaggtgcat
541 cgctacttca tggggtatct cctgttcgac tcctggaagg agttggtgtg gaacaatcct
601 tcgtatatct ctgatgttcg gtcgcgggag gtgcgcttgt atcccgtgta tcccgcgtcg
661 acgccccacg tcgtgtttga aggctttatg gttactaggg acgcggctca tgccggaggg
721 gactatgttg gttatttcaa ggacgtcaag attatctatg ataaggcggt gctgagtacg
781 gtgcgcgatt ttgcggacga ggacctgtgg ggtatccagg cgcggcgtga ggctgagcgt
841 aagagagttg aggttgcgcg tttcgggcag cagcaggtgc tgcgttatat agagcaagag
901 aagcttgcta cagaggttgg ttttacaccc tctgggggtg ctcagcggca ggaagagcag
961 cagtag
//
LOCUS DROMPP1 3376 bp ds-DNA INV 28-JUL-1990
DEFINITION D.melanogaster membrane protein (patched) gene complete cds.
ACCESSION M28418
KEYWORDS transmembrane protein.
SEGMENT 1 of 2
SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 3376)
AUTHORS Hooper,J.E. and Scott,M.P.
TITLE The Drosophila patched gene encodes a putative membrane protein
required for segmental patterning
JOURNAL Cell 59, 751-765 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by J.E.Hooper, 27-SEP-1989.
FEATURES from to/span description
pept 3161 + 3289 membrane protein exon 1
pre-msg 2389 > 3376 MPR mRNA and intron
IVS 3290 > 3376 MPR intron A
BASE COUNT 1059 a 731 c 764 g 822 t
ORIGIN Chromosome 2, map position 44D-E.
1 gatcctgaat tgagaaatat agattgaaac agaattcatt accatttaag caatcattat
61 ttatgggggc gtaatgcgcc tccgagtagg caatgctttt cttgacattg ttactaagaa
121 ttgtgaatga tatttgggcg tggatcaacg ccgattaaaa gctgcttttg cttccaggcg
181 gccagagaag agatccaaac ttcaactcca gccataaaag caacaacatt tccgtctccc
241 ccttgtagct ccccttcctc cggctcttcc actctccacg aaacggcaaa tgaagctctc
301 aaagcgaact gtgcttcgct ggtggtccat tggcagctgc cgccacacag gcgctgcttt
361 tgtgtgtgtg tgtaatatca atcttgctct ccctctcttt ttatctctct tgggaattgg
421 agctgcatgc gaattgagcg acagcaaaac gaactgcaag tcattgagag gagagcaaaa
481 actcgagcgc aagccaaaga tagcgcaatc tggggagagc gaaataaagc taaaatatgc
541 atgttggaga aaaaatgccg cccatgtcgc caaaatgcgc cacacgcaga gtgagcgggc
601 ggaggtggga gtaatggaaa gggcgatgag ggaacgatta gcttgaagag agagaacaac
661 aaatgaatgt gctgcaacgt tagttcaggt gagcgcgtta gagagagagt tgttgttttt
721 tgattgtaat agctcgcttg gtggtgggtc cacattcaca tctccctctc ccactctttc
781 tccccgaaag agagagcggg agcgaagggg cacgagggga gcacgatgac tatgcagttg
841 cattcaattt gaatttccat ggtgctgatg attcgagcgc caattttttc gaagagttct
901 tatttgttta cttcgttgtt gttgcctcaa ttggaaaggg aaaatgtgga atgcggagaa
961 acaccagaag caaatgcatt tccattcata aatccaaaga agttttaaag ataacatgtc
1021 atttggctta agttcgtggt gcacaaaaaa gatcggtttg cggttgtcgc atgaaatgag
1081 tttattccat tggtatatta ttattcagaa attaaaaaaa aacttgttta gtctattttt
1141 tttttttaaa taaaaaaaaa aaattctttt ataagtcgat tttagagtaa atatttaaag
1201 actacgtcta ataaacatat aatttgttct gtgttttaat ttgccggcaa aaacaaacct
1261 acttgtgtgg tcctcgcaca ctcataaccc ctcgcatatt tgagattcat ggggcaagag
1321 gctgcaaaaa caatggaaag ggaaaagcag aaacatcctg ccgctcataa tttagcatcg
1381 gaacatgcaa aaacagacat catcgcatgg ggcagcagca acagccataa aaccaacacg
1441 agcaatgtaa agctaacaaa tttgccaaca gttcgcggca cggctacaca cacacacatg
1501 catgcgcagc ctgccacgca cgcgcttccc ccaaacaaat acacacacac acactgagac
1561 gaaagctcca ttgggcagcg ctgccgacgc tgaaggccga catcggcaga gctgaacgtt
1621 tgggtagggg accacccaca tcgcttggcg gtttcagttt aatgaaggca gaaacaaatt
1681 tatttttggg tggtccacac tgcagcgaaa ataaactaca gtggcaacaa caaaccagca
1741 gccaaggcac tttgggtggt ccatgcaaaa aaaaaacaaa ttacggcatg cgaataacaa
1801 tagaaattag cgctctcgtg gcggagctat ttgggtatat tagagctaca tattttattt
1861 gtttataaaa agtataaatg taaacaatga gttccaagca ttaagtccgt atgctcaaca
1921 attacattat cattattatt atcacttaaa tatttacaaa ggatatttaa acagtaatag
1981 atatatattt tatttcttaa tttctgttaa catatgtatt tacattggta gttattcttt
2041 attttgcaac aagcattcat aaattttata taacaaactt ggtattttct cggaaaaact
2101 cctgaatcac ccctcggtat tttgtgcgtt gagctatcgt taaagcagcc ctcgcagaga
2161 gcgttctcaa accaaaatgg ccgcacacga aacaagagag cgagtgagag tagggagagc
2221 gtctgtgttg tgtgttgagt gtcgcccacg cacacaggcg caaaacagtg cacacagacg
2281 cccgctgggc aagagagagt gagagagaga aacagcggcg cgcgctcgcc taatgaagtt
2341 gttggcctgg ctggcgtgcc gcatccacga gatacagata catctctcag actgcgtgcg
2401 atcctcgaac gaaacggttg taagtgcgga gcgcgacgac ttgttattcg tatttccgac
2461 tactggcact ctctgtgtgt ggtatactaa caagatagat atcacagaac tcgtggaaaa
2521 gctaagatat tgtacctcac ggatgcgagg cgaagttcat ggattaaatg ccaggcaaca
2581 acaaaagcca gccaaccagc cagtgtttgt gtgtgtgcgt cgccaagtgc aaagtaaagt
2641 aaaggtaaaa gagcgaaagg cgagagagaa aaccgaatac gtgagtcgtc cgactgccgc
2701 ttttccatgt gtaaaagatc tgtgaaaatt ctgtcaaatt cccctgagaa attgtgccca
2761 agataaaacc cgaaaaccgc gttttaatcg tcgaaaaaac ccagcaaaag cgaagccagc
2821 aatcacaaca aaacaacata acgagagctc agatacacag cgtgctcagt gagtgagcga
2881 gagagcgcgg gagagagcgt ctcttgattt aaaatacaaa ataattaaaa ataaaaatgc
2941 ggaatgcagt gcaaaatgca gccaaacaaa atacgagatt ccaataacaa ttaatcgaac
3001 cgaaagtcca cgaacaatcc gcacactgtc tcccaagtct cagttctcag gacgcagacg
3061 aacggcaggc actgtagaaa gaccgattcc gcagcacact cccatctgca catctccgcc
3121 acgcgattcc gtccggaatc tggctataaa cataaccata atggaccgcg acagcctccc
3181 acgcgttccg gacacacacg gcgatgtggt cgatgagaaa ttattctcgg atctttacat
3241 acgcaccagc tgggtggacg cccaagtggc gctcgatcag atagataagg tgagtgccca
3301 actacagtga actttcactg tgaaggatag ccatgtgttg aattcaataa tattcttgat
3361 cgtattcgga ggatcc
//
LOCUS DROMPP2 5665 bp ds-DNA INV 28-JUL-1990
DEFINITION D.melanogaster membrane protein (patched) gene, complete cds.
ACCESSION M28999
KEYWORDS transmembrane protein.
SEGMENT 2 of 2
SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 5665)
AUTHORS Hooper,J.E. and Scott,M.P.
TITLE The Drosophila patched gene encodes a putative membrane protein
required for segmental patterning
JOURNAL Cell 59, 751-765 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by J.E.Hooper, 27-SEP-1989.
Mak called J.E.Hooper and requested copy of cds be sent showing
introns and exons, 9-OCT-1989. Copy was received and corrections
made, 18-OCT-1989.
FEATURES from to/span description
pept + 95 648 membrane protein exon 2
986 2912 membrane protein exon 3
3051 3258 membrane protein exon 4
3322 3593 membrane protein exon 5
3789 4559 membrane protein exon 6
pre-msg < 1 > 5448 MPR mRNA and introns
IVS < 1 94 MPR intron A
IVS 649 985 MPR intron B
IVS 2913 3050 MPR intron C
IVS 3259 3321 MPR intron D
IVS 3594 3788 MPR intron E
BASE COUNT 1397 a 1537 c 1350 g 1381 t
ORIGIN About 9.2kb after segment 1, Chromosome 2, map position 44D-E.
1 aattttaatg cgtattttat ggcagtggag caaggcgggg gaatctaaaa aaaaaactaa
61 acgctaaatt ccgtattttt gttgcatttt tcagggcaaa gcgcgtggca gccgcacggc
121 gatctatctg cgatcagtat tccagtccca cctcgaaacc ctcggcagct ccgtgcaaaa
181 gcacgcgggc aaggtgctat tcgtggctat cctggtgctg agcaccttct gcgtcggcct
241 gaagagcgcc cagatccact ccaaggtgca ccagctgtgg atccaggagg gcggccggct
301 ggaggcggaa ctggcctaca cacagaagac gatcggcgag gacgagtcgg ccacgcatca
361 gctgctcatt cagacgaccc acgacccgaa cgcctccgtc ctgcatccgc aggcgctgct
421 tgcccacctg gaggtcctgg tcaaggccac cgccgtcaag gtgcacctct acgacaccga
481 atgggggctg cgcgacatgt gcaacatgcc gagcacgccc tccttcgagg gcatctacta
541 catcgagcag atcctgcgcc acctcattcc gtgctcgatc atcacgccgc tggactgttt
601 ctgggaggga agccagctgt tgggtccgga atcagcggtc gttataccgt aagtagttaa
661 tatgtagtta atagccacat cttatagatt ctaaagtgaa cgtatccctt atgaccatat
721 ccttttgcat gatctacttt aacccacagt acttctctat tcatattaag gaattaataa
781 agtacttact ttgcgcttac ctttattaaa tacgatagct tatctttata aacttgctat
841 caagtcgaaa gataaacgtg acaagagtat ctttgtactt atcccagttg cttaccatcg
901 taaataatct tcttattaat aaatattcgt aaataaatat tcttaactca acaaatccat
961 ctttattatt gttactcctc tacagaggcc tcaaccaacg actcctgtgg accaccctga
1021 atcccgcctc tgtgatgcag tatatgaaac aaaagatgtc cgaggaaaag atcagcttcg
1081 acttcgagac cgtggagcag tacatgaagc gtgcggccat tggcagtggc tacatggaga
1141 agccctgcct gaacccactg aatcccaatt gcccggacac ggcaccgaac aagaacagca
1201 cccagccgcc ggatgtggga gccatcctgt ccggaggctg ctacggttat gccgcgaagc
1261 acatgcactg gccggaggag ctgattgtgg gcggacggaa gaggaaccgc agcggacact
1321 tgaggaaggc ccaggccctg cagtcggtgg tgcagctgat gaccgagaag gaaatgtacg
1381 accagtggca ggacaactac aaggtgcacc atcttggatg gacgcaggag aaggcagcgg
1441 aggttttgaa cgcctggcag cgcaactttt cgcgggaggt ggaacagctg ctacgtaaac
1501 agtcgagaat tgccaccaac tacgatatct acgtgttcag ctcggctgca ctggatgaca
1561 tcctggccaa gttctcccat cccagcgcct tgtccattgt catcggcgtg gccgtcaccg
1621 ttttgtatgc cttttgcacg ctcctccgct ggagggaccc cgtccgtggc cagagcagtg
1681 tgggcgtggc cggagttctg ctcatgtgct tcagtaccgc cgccggattg ggattgtcag
1741 ccctgctcgg tatcgttttc aatgcgctga ccgctgccta tgcggagagc aatcggcggg
1801 agcagaccaa gctgattctc aagaacgcca gcacccaggt ggttccgttt ttggcccttg
1861 gtctgggcgt cgatcacatc ttcatagtgg gaccgagcat cctgttcagt gcctgcagca
1921 ccgcaggatc cttctttgcg gccgccttta ttccggtgcc ggctttgaag gtattctgtc
1981 tgcaggctgc catcgtaatg tgctccaatt tggcagcggc tctattggtt tttccggcca
2041 tgatttcgtt ggatctacgg agacgtaccg ccggcagggc ggacatcttc tgctgctgtt
2101 ttccggtgtg gaaggaacag ccgaaggtgg cacctccggt gctgccgctg aacaacaaca
2161 acgggcgcgg ggcccggcat ccgaagagct gcaacaacaa cagggtgccg ctgcccgccc
2221 agaatcctct gctggaacag agggcagaca tccctgggag cagtcactca ctggcgtcct
2281 tctccctggc aaccttcgcc tttcagcact acactccctt cctcatgcgc agctgggtga
2341 agttcctgac cgttatgggt ttcctggcgg ccctcatatc cagcttgtat gcctccacgc
2401 gccttcagga tggcctggac attattgatc tggtgcccaa ggacagcaac gagcacaagt
2461 tcctggatgc tcaaactcgg ctctttggct tctacagcat gtatgcggtt acccagggca
2521 actttgaata tcccacccag cagcagttgc tcagggacta ccatgattcc tttgtgcggg
2581 tgccacatgt gatcaagaat gataacggtg gactgccgga cttctggctg ctgctcttca
2641 gcgagtggct gggtaatctg caaaagatat tcgacgagga ataccgcgac ggacggctga
2701 ccaaggagtg ctggttccca aacgccagca gcgatgccat cctggcctac aagctaatcg
2761 tgcaaaccgg ccatgtggac aaccccgtgg acaaggaact ggtgctcacc aatcgcctgg
2821 tcaacagcga tggcatcatc aaccaacgcg ccttctacaa ctatctgtcg gcatgggcca
2881 ccaacgacgt cttcgcctac ggagcttctc aggtgggtct tcttattaaa ttaaattaaa
2941 ttaaattaaa ttagatcgcc ttagttctcc tcatatgtac atacatatta taacttatcg
3001 cactccaaag ttaaagatta ctaaatgtgt gtgtatcttt attcttacag ggcaaattgt
3061 atccggaacc gcgccagtat tttcaccaac ccaacgagta cgatcttaag atacccaaga
3121 gtctgccatt ggtctacgct cagatgccct tttacctcca cggactaaca gatacctcgc
3181 agatcaagac cctgataggt catattcgcg acctgagcgt caagtacgag ggcttcggcc
3241 tgcccaacta tccatcgggt gagtcggaaa tgagtacttc atacatgggg cccaactaac
3301 agtcgattta tttatcgcca ggcattccct tcatcttctg ggagcagtac atgaccctgc
3361 gctcctcact ggccatgatc ctggcctgcg tgctactcgc cgccctggtg ctggtctccc
3421 tgctcctgct ctccgtttgg gccgccgttc tcgtgatcct cagcgttctg gcctcgctgg
3481 cccagatctt tggggccatg actctgctgg gcatcaaact ctcggccatt ccggcagtca
3541 tactcatcct cagcgtgggc atgatgctgt gcttcaatgt gctgatatca ctggtgagtc
3601 ttcatttctg gctggaccat taagagcttc ggagtgagtc ttcatttctg gctggaccat
3661 taagagcttc ggagtgagtc ttcatttctg gctggaccat taagagcttc ggattttcca
3721 gagatatccc aagacttttc attggatcct cttcagcaca cattaattgc ttatctttcc
3781 gattctaggg cttcatgaca tccgttggca accgacagcg ccgcgtccag ctgagcatgc
3841 agatgtccct gggaccactt gtccacggca tgctgacctc cggagtggcc gtgttcatgc
3901 tctccacgtc gccctttgag tttgtgatcc ggcacttctg ctggcttctg ctggtggtct
3961 tatgcgttgg cgcctgcaac agccttttgg tgttccccat cctactgagc atggtgggac
4021 cggaggcgga gctggtgccg ctggagcatc cagaccgcat atccacgccc tctccgctgc
4081 ccgtgcgcag cagcaagaga tcgggcaaat cctatgtggt gcagggatcg cgatcctcgc
4141 gaggcagctg ccagaagtcg catcaccacc accacaaaga ccttaatgat ccatcgctga
4201 cgacgatcac cgaggagccg cagtcgtgga agtccagcaa ctcgtccatc cagatgccca
4261 atgattggac ctaccagccg cgggaacagc gacccgcctc ctacgcggcc ccgccccccg
4321 cctatcacaa ggccgccgcc cagcagcacc accagcatca gggcccgccc acaacgcccc
4381 cgcctccctt cccgacggcc tatccgccgg agctgcagag catcgtggtg cagccggagg
4441 tgacggtgga gacgacgcac tcggacagca acaccaccaa ggtgacggcc acggccaaca
4501 tcaaggtgga gctggccatg cccggcaggg cggtgcgcag ctataacttt acgagttagc
4561 actagcacta gttcctgtag ctattaggac gtatctttag actctagcct aagccgtaac
4621 cctatttgta tctgtaaaat cgatttgtcc agcgggtctg ctgaggattt cgttctcatg
4681 gattctcatg gattctcatg gatgcttaaa tggcatggta attggcaaaa tatcaatttt
4741 tgtgtctcaa aaagatgcat tagcttatgg tttcaagata catttttaaa gagtccgcca
4801 gatatttata taaaaaaaat ccaaaatcga cgtatccatg aaaattgaaa agctaagcag
4861 acccgtatgt atgtatatgt gtatgcatgt tagttaattt cccgaagtcc ggtatttata
4921 gcagctgcct tccgcgcccc ccttcccttg aaatgaacac ccttccagcc acgccccacc
4981 gcccctctgc gtagcagctt tgtatgtatg tagtatgcta gcacctaagg aatacttaaa
5041 cttagagata tttattgtaa cacacgcaaa acacacacaa tgtacttaca tataattcaa
5101 tgcgagattc acccacacaa aaaggaaaca caacaaacta gtaattgtag ctcgtaattt
5161 agtttaaata tgttacataa aacacaagga cttgaaccaa aatagtatcg cttaaacgga
5221 aacgagagaa acgagaaaaa ataactatta cttaatcaac tacaagagag atatccctcc
5281 tcccctaacc gtacttacaa ccaaaataaa acaagagtat aagcataaaa atggaaaacg
5341 aagcgaggaa cgattgtaaa cgcggtcatt tatcctgtac atttgttgcc cgaagactga
5401 ctgtcttttt tttaataaaa atatatatta tacagttttt taaaagcgaa attcatgact
5461 tttttttaac agtgagcaga gaacaaaaga aacggaagtt ttcgctgtat caataaaaag
5521 attccatttt tttaataaat tgtaaaaatc ctaaaaaaaa gaagactaca aaagtttaaa
5581 tttttatacg ttattgataa acttttatac acgaaaatac ttgtacttag ctatgatcaa
5641 ctccttggct taagtctcgg gtaag
//
LOCUS BLYGEH 1250 bp ss-mRNA PLN 28-JUL-1990
DEFINITION Barley (1->3)-beta-glucan endohydrolase mRNA, complete cds.
ACCESSION M23548 X15205
KEYWORDS glucan endohydrolase.
SOURCE Barley (2 days into germination) scutellum, cDNA to mRNA, clone
lambda-3.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1250)
AUTHORS Hoej,P.B., Hartman,D.J., Morrice,N.A., Doan,D.N.P. and Fincher,G.B.
TITLE Purification of (1->3)-beta-glucan endohydrolase isoenzyme II from
germinated barley and determination of its primary structure from a
cDNA clone
JOURNAL Plant Mol. Biol. 13, 31-42 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by G.W.Fincher, 02-MAY-1989.
FEATURES from to/span description
pept 48 1052 glucan endohydrolase precursor
sigp 48 131 glucan endohydrolase signal peptide
matp 132 1049 glucan endohydrolase
mRNA < 1 1250 GEH mRNA
BASE COUNT 262 a 404 c 362 g 222 t
ORIGIN
1 ccagcattgc atagcatttg agcaccagat actccatgtg tgcagcaatg gctagaaaag
61 atgttgcctc catgtttgca gctgctctct tcattggagc gttcgctgct gttcctacga
121 gtgtgcagtc catcggcgtg tgctacggcg tgatcggcaa caacctcccc tcccggagcg
181 acgtggtgca gctctacagg tccaagggca tcaacggcat gcgcatctac ttcgccgacg
241 ggcaggccct ctcggcgctc cgcaactccg gcatcggcct catcctcgac atcggcaacg
301 accagctcgc caacatcgcc gccagcacct ccaacgcggc gtcctgggtc cagaacaacg
361 tgcggcccta ctaccctgcc gtgaacatca agtacatcgc cgccggcaac gaggtgcagg
421 gcggcgccac gcagagcatc ctgccggcca tgcgcaacct caacgcggcc ctctccgcgg
481 cggggctcgg cgccatcaag gtgtccacct ccatccggtt cgacgaggtg gccaactcct
541 tcccgccctc cgccggcgtg ttcaagaacg cctacatgac ggacgtggcc cggctcctcg
601 cgagcaccgg cgcgccgctg ctcgccaacg tctaccccta cttcgcgtac cgtgacaacc
661 ccgggagcat cagcctgaac tacgcgacgt tccagccggg caccaccgtg cgtgaccaga
721 acaacgggct gacctacacg tccctgttcg acgcgatggt ggacgccgtg tacgcggcgc
781 tggagaaggc cggcgcgccg gcggtgaagg tggtggtgtc ggagagcggg tggccgtcgg
841 cgggcgggtt tgcggcgtcg gccggcaatg cgcggacgta caaccagggg ctgatcaacc
901 acgtcggcgg gggcacgccc aagaagcggg aggcgctgga gacgtacatc ttcgccatgt
961 tcaacgagaa ccagaagacc ggggacgcca cggagaggag cttcgggctc ttcaacccgg
1021 acaagtcgcc ggcatacaac atccagttct agtgtagcta cctagctcac atacctacat
1081 ccccagccta aataaataag ctgctcgtac gtacgtaatg cggcatccaa gtgtaacgta
1141 gacacgtaca ttcatccatg gaagagtgca accaagcatg cgttaacttc ctggtgatga
1201 tacatcatca tggtatgaat aaaagatatg gaagatgtta tgaatttgtg
//
LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 28-JUL-1990
DEFINITION E.coli DNA polymerase (polB) gene, 5' flank.
ACCESSION M35371
KEYWORDS DNA polymerase; polB gene.
SOURCE E.coli (strain W3110) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 4666)
AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R.
TITLE Nucleotide sequence and deletion analysis of the polB gene of
E.coli
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Chen, 20-JUN-1990.
Author address: H.Chen
Baylor College of Medicine
Cell Biology and IMG
One Baylor Plaza
Houston, TX 77030
email: hchen@vulture.bcm.tmc.edu
FEATURES from to/span description
signal 1209 1214 -35 region
signal 1229 1234 -10 region
binding 1250 1255 ribosome binding site
site 1193 1208 lexA box
BASE COUNT 1100 a 1278 c 1257 g 1031 t
ORIGIN
1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg
61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg
121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga
181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc
241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc
301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg
361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg
421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca
481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct
541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg
601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat
661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga
721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac
781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac
841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga
901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca
961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg
1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat
1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga
1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa
1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc
1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca
1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa
1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt
1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat
1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt
1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt
1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt
1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc
1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat
1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc
1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg
1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt
1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc
2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag
2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag
2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc
2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg
2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac
2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg
2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc
2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac
2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg
2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc
2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa
2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt
2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc
2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac
2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa
2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa
3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc
3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga
3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc
3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac
3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat
3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg
3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc
3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg
3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc
3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca
3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa
3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac
3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat
3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga
3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac
3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc
3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg
4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt
4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga
4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt
4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa
4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg
4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc
4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct
4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca
4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca
4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc
4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc
//
LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA,
complete cds.
ACCESSION M35410
KEYWORDS insulin-like growth factor binding protein 2.
SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1387)
AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B.,
Francke,U. and Swaroop,A.
TITLE Sequence analysis, expression and chromosomal localization of a
gene, isolated from a subtracted human retina cDNA library, that
encodes an insulin-like growth factor binding protein (IGFBP2)
JOURNAL Exp. Eye Res. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Swaroop, 20-JUN-1990.
FEATURES from to/span description
pept 64 1050 insulin-like growth factor binding protein 2
(IGFBP2) precursor /hgml_locus_uid="LN0220S"
/nomgen="IBP1" /map="7p13-p12"
sigp 64 180 insulin-like growth factor binding protein 2
signal peptide
matp 181 1047 insulin-like growth factor binding protein 2
mRNA < 1 1387 IGFBP2 mRNA
signal 1362 1367 poly-A signal
binding 175 197 ATP binding site
BASE COUNT 232 a 455 c 477 g 223 t
ORIGIN
1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc
61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg
121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg
181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg
241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg
301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag
361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc
421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac
481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga
541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct
601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact
661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag
721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag
781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg
841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg
901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga
961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct
1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg
1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg
1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg
1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc
1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga
1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta
1381 aagtgtg
//
LOCUS HUMLBPA 1431 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA,
complete cds.
ACCESSION M35533
KEYWORDS lipopolysaccharide binding protein.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites; for [2])
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
TITLE Structure and function of lipopolysaccharide binding protein
JOURNAL Science (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1431)
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Tobias, 21-JUN-1990.
Author address: P.S.Tobias
Department of Immunology, IMM-12
10466 N. Torrey Pines Rd
La Jolla, CA 92037
FEATURES from to/span description
pept 1 > 1431 lipopolysaccharide binding protein (LBP)
precursor
sigp 1 75 lipopolysaccharide binding protein signal
peptide
matp 76 1431 lipopolysaccharide binding protein
BASE COUNT 319 a 417 c 359 g 336 t
ORIGIN
1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc
61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag
121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct
181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc
241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc
301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag
361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc
421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc
481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc
541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg
601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca
661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc
721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt
781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat
841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca
901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc
961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc
1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata
1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc
1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca
1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg
1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg
1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag
1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t
//
LOCUS HUMPEC12L 2344 bp ds-DNA PRI 28-JUL-1990
DEFINITION Human cell 12-lipoxygenase gene, complete cds.
ACCESSION M35418
KEYWORDS lipoxygenase.
SOURCE Human platelet/erythroleukemia cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2344)
AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A.
TITLE Molecular cloning, primary structure and expression of the human
platelet/erythroleukemia cell 12-lipoxygenase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.D.Funk, 20-JUN-1990.
FEATURES from to/span description
pept 42 2033 12-lipoxygenase
signal 2324 2329 poly-A signal
mRNA < 1 2344 12-lipoxygenase mRNA
BASE COUNT 514 a 692 c 642 g 496 t
ORIGIN
1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc
61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg
121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg
181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca
241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg
301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga
361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc
421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag
481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc
541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc
601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct
661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt
721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc
781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag
841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca
901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc
961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa
1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg
1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg
1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca
1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac
1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac
1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc
1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc
1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg
1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc
1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc
1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc
1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca
1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg
1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc
1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat
1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact
1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag
2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt
2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac
2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt
2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg
2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac
2341 actg
//
LOCUS HUMRALBA 1327 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds.
ACCESSION M35416
KEYWORDS GTP-binding protein.
SOURCE Human retina, cDNA to mRNA, clone AS181.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1327)
AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U.
TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP
binding protein
JOURNAL Somat. Cell Mol. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Swaroop, 20-JUN-1990.
FEATURES from to/span description
pept 171 791 GTP-binding protein (RALB)
mRNA < 1 1327 RALB mRNA
signal 1303 1308 poly-A signal
BASE COUNT 368 a 273 c 373 g 313 t
ORIGIN Chromosome cen-q13.
1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc
61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc
121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca
181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag
241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg
301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga
361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc
421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa
481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc
541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca
601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca
661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa
721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt
781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg
841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc
901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa
961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc
1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt
1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta
1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag
1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag
1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga
1321 aacagct
//
LOCUS LACLACR 1332 bp ds-DNA BCT 28-JUL-1990
DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene,
complete cds.
ACCESSION M35375
KEYWORDS lactose phosphotransferase system repressor; lactose repressor.
SOURCE L.lactis (strain MG1820) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 1332)
AUTHORS Van Rooijen,R.J. and Devos,W.M.
TITLE Molecular cloning, transcriptional analysis, and nucleotide
sequence of LACR, a gene encoding the repressor of the lactose
phosphotransferase system of Lactococcus lactis
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.J.Van Rooijen, 20-JUN-1990.
Author address: R.J.Van Rooijen
Netherlands Inst for Dairy Research
Kernhemseweg 2
P.O. Box 20
6710 BA EDE
THE NETHERLANDS
FEATURES from to/span description
pept 370 1155 lactose repressor (lacR; alt.)
pept 388 1155 lactose repressor (lacR; alt.)
mRNA 79 1245 lactose repressor mRNA
signal 1215 > 1155 transcription termination signal
binding 353 357 ribosomal binding site
signal 45 51 -35 region
signal 68 74 -10 region
BASE COUNT 469 a 207 c 198 g 458 t
ORIGIN
1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt
61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca
121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac
181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata
241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg
301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa
361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta
421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt
481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt
541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag
601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc
661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa
721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg
781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata
841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa
901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa
961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt
1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc
1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt
1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac
1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt
1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga
1321 taattggata tc
//
LOCUS MARCMYCA 1391 bp ds-DNA ROD 28-JUL-1990
DEFINITION Woodchuck c-myc protein gene, exon 1.
ACCESSION M35498
KEYWORDS c-myc protein.
SOURCE Woodchuck (Marmota monax) DNA.
ORGANISM Marmota monax
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae;
Marmotini.
REFERENCE 1 (bases 1 to 1391)
AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J.
TITLE Evolutionary conservation of target sequences for cis-acting
regulation in c-myc exon 1 and its upstream sequences
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Buendia, 21-JUN-1990.
FEATURES from to/span description
pre-msg 818 > 1391 c-myc mRNA and introns
IVS 1378 > 1391 c-myc intron 1
site 159 220 P0 promoter
site 790 794 TATA box
site 964 968 TATA box
BASE COUNT 317 a 385 c 383 g 306 t
ORIGIN
1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc
61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat
121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac
181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg
241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg
301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc
361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc
421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt
481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag
541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa
601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg
661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag
721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca
781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg
841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg
901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag
961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac
1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt
1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc
1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg
1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg
1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg
1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta
1381 agcaccagat c
//
LOCUS ONGOSTLE 214 bp ds-DNA INV 28-JUL-1990
DEFINITION O.volvulus recombinant antigen gene, 3' end.
ACCESSION M35370
KEYWORDS T-cell epitope; recombinant antigen.
SOURCE O.volvulus DNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B.
TITLE The identification of an Onchocerca-specific recombinant antigen
containing a T cell epitope
JOURNAL Unpublished (19900
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.B.Nutman, 19-JUN-1990.
Author address: T.B.Nutman
Inst. Lab. of Parasitic Diseases
NIAID
National Institutes of Health
Bldg 4, Rm 126
9000 Rockville Pike
Bethesda, MD 20892
email: tbn@helix.nih.gov
FEATURES from to/span description
pept < 1 93 recombinant antigen (AA at 1)
site 91 93 nematode splice junction
BASE COUNT 81 a 37 c 31 g 65 t
ORIGIN
1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt
61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa
121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga
181 aaactttcac ttatgtaaaa ttgcgcacaa gacc
//
LOCUS PSEIAAL 2766 bp ds-DNA BCT 28-JUL-1990
DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds.
ACCESSION M35373
KEYWORDS IAA-lysine synthetase.
SOURCE P.syringae savastanoi DNA.
ORGANISM Pseudomonas syringae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2766)
AUTHORS Roberto,F., Klee,H., White,F., Nordeen,R. and Kosuge,T.
TITLE Expression and fine structure of the gene encoding IAA-lysine
synthetase from Pseudomonas savastanoi
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.J.Klee, 20-JUN-1990.
FEATURES from to/span description
pept 1100 2287 IAA-lysine synthetase (iaaL)
pept 95 1003 ORF1
BASE COUNT 637 a 760 c 754 g 615 t
ORIGIN
1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc
61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag
121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag
181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat
241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac
301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt
361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat
421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt
481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct
541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat
601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt
661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc
721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac
781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg
841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac
901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca
961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt
1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa
1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc
1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt
1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct
1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa
1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt
1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct
1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt
1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa
1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca
1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt
1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg
1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt
1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac
1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat
1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat
1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc
2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt
2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga
2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa
2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg
2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc
2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga
2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc
2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg
2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac
2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg
2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct
2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag
2761 ccgcct
//
LOCUS RABLPBA 1446 bp ss-mRNA MAM 28-JUL-1990
DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA,
complete cds.
ACCESSION M35534
KEYWORDS lipopolysaccharide binding protein.
SOURCE Rabbit liver, cDNA to mRNA.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (ases 1 to 1446ites; for [2]
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
TITLE Structure and function of lipopolysaccharide binding protein
JOURNAL Science (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1446)
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Tobias, 21-JUN-1990.
Author address: P.S.Tobias
Department of Immunology, IMM-12
10466 N. Torrey Pines Rd
La Jolla, CA 92037
FEATURES from to/span description
pept 1 > 1446 lipopolysaccharide binding protein (LBP)
precursor
sigp 1 78 lipopolysaccharide binding protein signal
peptide
matp 79 1446 lipopolysaccharide binding protein
BASE COUNT 316 a 454 c 391 g 285 t
ORIGIN
1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct
61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg
121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg
181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac
241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag
301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc
361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc
421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc
481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac
541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg
601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca
661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca
721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca
781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt
841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg
901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag
961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc
1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa
1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc
1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact
1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc
1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag
1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac
1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg
1441 agagtt
//
LOCUS RATUKATPA 1484 bp ss-mRNA ROD 28-JUL-1990
DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete
cds.
ACCESSION M35535
KEYWORDS (H+,K+)-ATPase beta-subunit.
SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1484)
AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P.,
Forte,J.G. and Levenson,R.
TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression,
chromosomal assignment, and relationship to Na,K-ATPase beta
subunits
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.A.Canfield, 21-JUN-1990.
Author address: V.A.Canfield
Yale University School of Medicine
Dept. of Cell Biology
P.O. Box 3333
New Haven, CT 06510
email: levenson@YALEMED
FEATURES from to/span description
pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C.
3.6.1.3)
mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit
BASE COUNT 363 a 457 c 362 g 302 t
ORIGIN
1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta
61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag
121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc
181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg
241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct
301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct
361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt
421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga
481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc
541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt
601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg
661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa
721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac
781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta
841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc
901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt
961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta
1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca
1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca
1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga
1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc
1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg
1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata
1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg
1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta
//
LOCUS YSCMYO2A 5675 bp ds-DNA PLN 28-JUL-1990
DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds.
ACCESSION M35532
KEYWORDS myosin-1.
SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 5675)
AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A.
TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for
vectorial transport of vesicles
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.C.Johnston, 21-JUN-1990.
Author address: G.C.Johnston
Dalhousie University
7E Tupper Medical Bldg.
Halifax, N.S.
CANADA B3H 4H7
email: JOHNSTON@AC.DAL.CA
FEATURES from to/span description
pept 581 5305 myosin 1 isoform (MYO2)
binding 138 171 ATP-binding site
binding 443 523 actin-binding site
rpt 926 981 heptad repeat
rpt 1010 1086 heptad repeat
BASE COUNT 1994 a 986 c 1110 g 1585 t
ORIGIN Chromosome 15 right arm.
1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt
61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa
121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt
181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa
241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc
301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta
361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc
421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt
481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg
541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac
601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga
661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt
721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc
781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt
841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct
901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc
961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc
1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg
1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga
1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc
1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc
1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag
1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa
1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca
1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg
1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac
1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat
1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact
1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca
1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc
1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat
1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct
1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta
1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga
2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat
2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc
2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac
2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc
2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga
2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg
2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg
2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt
2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg
2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc
2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc
2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg
2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat
2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt
2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc
2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat
3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt
3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc
3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag
3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat
3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac
3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa
3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt
3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag
3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa
3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca
3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga
3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga
3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca
3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg
3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc
3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct
3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg
4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac
4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc
4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat
4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca
4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag
4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa
4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta
4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa
4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat
4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt
4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg
4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt
4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg
4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca
4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca
4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac
4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat
5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc
5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac
5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc
5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca
5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat
5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa
5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg
5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat
5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga
5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat
5641 aataacaact aagaataaac acattctcat tttta
//
LOCUS PTUB256 205 bp ds-DNA SYN 28-JUL-1990
DEFINITION Synthetic pTUB256 alpha-amylase gene promoter region.
ACCESSION M36663
KEYWORDS alpha-amylase.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 205)
AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
TITLE Two tandemly located promoters, artificially constructed, are
active in a Bacillus subtilis alpha-amylase secretion vector
JOURNAL J. Biochem. 99, 1181-1190 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 164 > 205 synthetic alpha-amylase
BASE COUNT 73 a 29 c 44 g 59 t
ORIGIN
1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct
61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
121 ctccgaagta agtcttcaaa aaatcaaata aggagtgtca agaatgtttg caaaacgatt
181 caaaacctct ttactgccgt tattc
//
LOCUS PTUB261 232 bp ds-DNA SYN 28-JUL-1990
DEFINITION Synthetic pTUB261 alpha-amylase gene promoter region.
ACCESSION M36664
KEYWORDS alpha-amylase.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 232)
AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
TITLE Two tandemly located promoters, artificially constructed, are
active in a Bacillus subtilis alpha-amylase secretion vector
JOURNAL J. Biochem. 99, 1181-1190 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 191 > 232 synthetic alpha-amylase
BASE COUNT 82 a 28 c 52 g 70 t
ORIGIN
1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct
61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg
181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc
//
LOCUS PTUB263 232 bp ds-DNA SYN 28-JUL-1990
DEFINITION Synthetic pTUB263 alpha-amylase gene promoter region.
ACCESSION M36665
KEYWORDS alpha-amylase.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 232)
AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
TITLE Two tandemly located promoters, artificially constructed, are
active in a Bacillus subtilis alpha-amylase secretion vector
JOURNAL J. Biochem. 99, 1181-1190 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 191 > 232 synthetic alpha-amylase
BASE COUNT 78 a 40 c 49 g 65 t
ORIGIN
1 aagcactccc gcgatcgcct atttggcttt tccccaaaat gtaagcgtga acaaaattct
61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg
181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc
//
LOCUS PTUB265 214 bp ds-DNA SYN 28-JUL-1990
DEFINITION Synthetic pTUB265 alpha-amylase gene promoter region.
ACCESSION M36666
KEYWORDS alpha-amylase.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 214)
AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
TITLE Two tandemly located promoters, artificially constructed, are
active in a Bacillus subtilis alpha-amylase secretion vector
JOURNAL J. Biochem. 99, 1181-1190 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 173 > 214 synthetic alpha-amylase
BASE COUNT 73 a 28 c 50 g 63 t
ORIGIN
1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaatcgtga acaaaattct
61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
121 ctcggggttg ttattatttt atcgctagaa gaaaagaagg tggagaggaa acatgatcca
181 aaaacgattc aaaacctctt tactgccgtt attc
//
LOCUS RABMEPHA 1653 bp ss-mRNA MAM 28-JUL-1990
DEFINITION Rabbit microsomal epoxide hydrolase.
ACCESSION M21496
KEYWORDS microsomal epoxide hydrolase.
SOURCE Rabbit (New Zealand White) adult liver cDNA to mRNA, clone pEH.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 1653)
AUTHORS Hassett,C., Turnblom,S.M., DeAngeles,A. and Omiecinski,C.J.
TITLE Rabbit microsomal epoxide hydrolase: Isolation and characterization
of the xenobiotic metabolizing enzyme cDNA
JOURNAL Arch. Biochem. Biophys. 271, 380-389 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer readable copy for sequence [1] kindly
submitted by C.Hassett 12-JAN-1989.
FEATURES from to/span description
pept 148 1515 microsomal epoxide hydrolase (EC 3.3.2.3)
BASE COUNT 351 a 505 c 465 g 332 t
ORIGIN
1 cggcatccgc aaggacctgt acgccaacac ggtgctgtct cgcctctccc gcagctctgc
61 agtgtcgccg tgcgcagagt tccacagctc tgcttcccaa gcaggtgagc agaggctgac
121 aacacagcgc ccttgtggac aggagccatg ttgctggaac tccttctcgc ctcggtgctg
181 ggcttcgtca tctactggtt cgtctctgga gacaaggagg agagtctgcc actggaggat
241 gggtggtggg gcccggggtc gaggcccgta ggcctggagg acgagagcat ccggcccttc
301 aaggtggaga cgtcggacga agagatcaac gacttacacc agaggatcga caggatccgc
361 ttgaccccac ctttggagaa cagccgcttc cactacggct tcaactccaa ctacctgaag
421 aagatcctct cctactggag gcacgaattc gactggaaga agcaagtgga gattctgaac
481 tcataccctc acttcaagac caagatcgaa gggctggaca tccacttcat ccacgtgaag
541 cccccgcagg tgccccctgg ccgcacccca aagcccttgc tgatggtgca tggctggccc
601 ggctccttct tcgagttcta caaaatcatc ccgctgctga ctgaccccaa gagccacggc
661 ctgagcgatg agcacatctt tgaagtcatc tgcccttcca ttccaggcta tggcttctca
721 caggcatctt ccaagaaggg cttcaactcg gtgagcaccg ccaggatctt ctacaagctg
781 atgctgcggc tgggcttcca ggagttctac atccagggcg gggactgggg ggccctggtc
841 tgcacgaaca tggcccagct ggtgcccagc cacgtgaaag gtctgcactt gaacatggct
901 ttgattttaa gaaatcacta cactctgacc ctcctgctgg gacggcgcat cgggggactt
961 cttggctaca ctgagaggga catggagctg ctgtacccct tcaaggagaa ggtgttctac
1021 agtctgatga gggagagcgg ctacatgcac atccgggcca ccaagcccga cactgtgggc
1081 tgtgctctga atgactctcc tgtgggactg gctgcataca ttctagagaa attttccacc
1141 tggaccaact cagaattccg agacctggag gacggaggcc tggagaggaa gttctccctg
1201 caggacctgc tgaccaacat catgatctac tggaccactg gctccatcgt ctcctcccag
1261 cgctactaca aggagaacct gggccagggc ttcatggccc acaagcatga gcggctgaag
1321 gtccacgtgc ccacgggctt cgcagccttc ccgtgtgaga taatgcatgt gccagagaag
1381 tgggtgagga ccaagtaccc gcagctcatc tcctactcct acatgccccg cgggggccac
1441 ttcgccgcct tcgaggagcc ggagctgctg gcccgggaca tctgcaagtt cgtggggctg
1501 gtggagcggc agtgatgctc ccagccttgc ctggggtgag gggtcggctt gcctcctccc
1561 ctggcctgct ggaacccacc tcaggcctcc atactcactg tctcaccccc atggcgtggc
1621 tgataaatga tttgactccc aaaaaaaaaa aaa
//
LOCUS XELBETA 1138 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A1 mRNA, complete cds.
ACCESSION M35359
KEYWORDS thyroid hormone receptor beta A1 protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1138)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 1 1110 thyroid hormone receptor beta A1 protein
BASE COUNT 351 a 228 c 283 g 276 t
ORIGIN
1 atggaagggt atatacccag ctacttggat aaagatgagc tatgcgtggt gtgtggagac
61 aaggctacag ggtatcatta tagatgtatc acctgcgagg gctgcaaggg cttttttaga
121 agaactattc agaagaacct ccacccaagc tattcttgta aatatgaagg aaaatgtgtt
181 atagacaaag taacaagaaa ccagtgccaa gaatgtcgct tcaaaaagtg catcgctgtt
241 ggaatggcaa cagacttggt tttggatgac aacaaacgtt tggcaaaaag aaagctcata
301 gaagaaaaca gagaaaaaag acggaaagat gagattcaga aatcacttgt tcagaaacct
361 gaacccacac aagaagaatg ggagttgata caagttgtca ctgaagcaca tgtggccacc
421 aacgcacagg gaagccactg gaaacagaaa agaaaatttt tgccagagga cattggacaa
481 gctcccatag ttaatgcgcc cgagggtgga aaagtggact tagaagcctt cagccagttt
541 acaaaaataa tcaccccagc aattacaaga gttgttgatt ttgccaaaaa gctacctatg
601 ttttgtgagc tgccatgtga agaccagatc atccttctta aaggctgttg tatggagatc
661 atgtcgctcc gagcagcagt gcgttatgac cccgaaagtg aaactctaac gttaaatggt
721 gagatggcag tgacaagggg gcagctaaaa aatggaggac ttggagtggt ttcagatgcc
781 atctttgact taggggtatc gctttcttca ttcagtcttg atgataccga agtcgccttg
841 ttgcaggctg tgctgcttat gtcatcagat cggcctggtc ttgctagcgt ggagagaata
901 gaaaagtgcc aggaaggttt cctcttggct tttgaacact acattaatta caggaaacat
961 aacattgcac acttttggcc aaaactgctg atgaaagtca ccgacctccg catgattgga
1021 gcgtgccacg ccagccggtt cctgcacatg aaggtggagt gccccactga actgtttccc
1081 ccactgttct tggaagtgtt tgaggactag aacagactgt gcttctggat tctcagca
//
LOCUS XELBETA1 259 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon A.
ACCESSION M35345
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 1 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 259)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 63 a 55 c 88 g 53 t
ORIGIN
1 aaattgggat ctatcctggg agagaatgga aatagacgac agcgctttat cctgactgaa
61 ctgaggcagg ggtaacgctg ggagtgactg gcatagcagg ggctgcgggg aggcacttca
121 gtccgtgcca agtccaacat tgtagctagt gacgagaatc gtactacagt gcgggctctc
181 actaagtgac gctcgaattc gggaagaacg acgcggcagc tgttgcatta tggtgcgtct
241 gtaggtcgga gagccggcg
//
LOCUS XELBETA2 97 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon B.
ACCESSION M35346
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 2 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 97)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 14 a 32 c 24 g 27 t
ORIGIN
1 atttcaggac agcccagcgc cctggtgcac gatcagctgt agatctccct gtctgtgtcg
61 ctgctgccgc tgctacttca gttcctctga ctgtcag
//
LOCUS XELBETA3 44 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon C.
ACCESSION M35347
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 3 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 44)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 17 a 5 c 13 g 9 t
ORIGIN
1 atgttgaaga ctgattgggg ttaagcaggc acatacaaga aaag
//
LOCUS XELBETA4 79 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon D.
ACCESSION M35348
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 4 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 79)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 32 a 12 c 21 g 14 t
ORIGIN
1 acagaagccg tgaaccaatg cagaattaca ggaaaggacg aggattgaaa catctgtaca
61 tgagaaggaa tttctgaag
//
LOCUS XELBETA5 72 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon E.
ACCESSION M35349
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 5 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 72)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 19 a 15 c 20 g 18 t
ORIGIN
1 ttaaagttga agtatttctg gtcaggtgat ctctgaggca gcgcacaggc cctcacaaaa
61 tggtggctca ag
//
LOCUS XELBETA6 46 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon F.
ACCESSION M35350
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 6 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 46)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 17 a 11 c 8 g 10 t
ORIGIN
1 gttcctctca agcccaggaa caaaaaccgg aaatttttca aatgag
//
LOCUS XELBETA7 64 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon G.
ACCESSION M35351
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 7 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 64)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 46 > 64 thyroid hormone receptor beta A protein, exon
G (first expressed exon) (alt.)
BASE COUNT 20 a 13 c 14 g 17 t
ORIGIN
1 gctatatgtg attcttagaa gaatgagcgg accttccaat ccataatgcc aagcagtatg
61 tcag
//
LOCUS XELBETA8 191 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A gene, exon H.
ACCESSION M35352
KEYWORDS thyroid hormone receptor beta A protein.
SEGMENT 8 of 8
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
A unique procedure for translation determination reveals that
alternate translation initiation occurs at exons G and H of the
beta A thyroid hormone receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 185 > 191 thyroid hormone receptor beta A protein, exon
H (first expressed exon) (alt.)
BASE COUNT 57 a 31 c 37 g 66 t
ORIGIN
1 gcagagtata tggtttagaa gaactaacac agaagttttt tgttggacac tactctccat
61 aatgacaatg agatttccat tgtaacatcc taattgtaac cagtaatcag agatgctgct
121 tggacagtgc ttacagcttt tttaaagaga ttttttattt ttgctttgca tcgaaccgtg
181 tactatggaa g
//
LOCUS XELBETAB 1150 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta A5 mRNA, complete cds.
ACCESSION M35360
KEYWORDS thyroid hormone receptor beta A5 protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1150)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 1 1122 thyroid hormone receptor beta A5 protein
BASE COUNT 354 a 232 c 285 g 279 t
ORIGIN
1 atgccaagca gtatgtcagg gtatataccc agctacttgg ataaagatga gctatgcgtg
61 gtgtgtggag acaaggctac agggtatcat tatagatgta tcacctgcga gggctgcaag
121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcttg taaatatgaa
181 ggaaaatgtg ttatagacaa agtaacaaga aaccagtgcc aagaatgtcg cttcaaaaag
241 tgcatcgctg ttggaatggc aacagacttg gttttggatg acaacaaacg tttggcaaaa
301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag atgagattca gaaatcactt
361 gttcagaaac ctgaacccac acaagaagaa tgggagttga tacaagttgt cactgaagca
421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag
481 gacattggac aagctcccat agttaatgcg cccgagggtg gaaaagtgga cttagaagcc
541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagttgttga ttttgccaaa
601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt
661 tgtatggaga tcatgtcgct ccgagcagca gtgcgttatg accccgaaag tgaaactcta
721 acgttaaatg gtgagatggc agtgacaagg gggcagctaa aaaatggagg acttggagtg
781 gtttcagatg ccatctttga cttaggggta tcgctttctt cattcagtct tgatgatacc
841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcggcctgg tcttgctagc
901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat
961 tacaggaaac ataacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc
1021 cgcatgattg gagcgtgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact
1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gtgcttctgg
1141 attctcagca
//
LOCUS XELBETAC 1132 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B1 mRNA, complete cds.
ACCESSION M35361
KEYWORDS thyroid hormone receptor beta B1 protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1132)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 1 1122 thyroid hormone receptor beta B1 protein
BASE COUNT 352 a 233 c 284 g 263 t
ORIGIN
1 atgccaagca gtatgtcagg gtacataccc agctacttgg ataaagatga gctatgtgtg
61 gtatgtggag acaaagctac agggtatcac tatagatgta tcacctgcga gggctgcaag
121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcctg taaatatgaa
181 ggaaaatgtg ttatagacaa agtaacaagg aaccagtgcc aagaatgtcg cttcaaaaag
241 tgcaaaactg ttggaatggc aacagacttg gttttggatg acagcaaacg tttggcgaaa
301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag acgagataca gaaatcaatt
361 gttcagagac cggaaccaac acaagaagaa tgggagttga tacaagttgt cactgaagca
421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag
481 gacattggac aagctcccat agttaatgcg cctgaaggtg gaaaagtgga cttagaagcc
541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagtggttga ttttgccaaa
601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt
661 tgtatggaga tcatgtccct ccgagcagcc gtgcggtatg accccgaaag tgaaactcta
721 acgctgaatg gggagatggc agtgacaagg gggcagctaa aaaatggagg actcggtgtg
781 gtctcagatg ccatctttga cttgggggtg tcgctttctt cattcagtct tgatgatacc
841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcgtcctgg tctctctagt
901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat
961 tacaggaaac acaacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc
1021 cgcatgatcg gagcatgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact
1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gt
//
LOCUS XELBETAD 1255 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B2 mRNA, complete cds.
ACCESSION M35362
KEYWORDS thyroid hormone receptor beta B2 protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1255)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 1 1245 thyroid hormone receptor beta B2 protein
BASE COUNT 383 a 258 c 323 g 291 t
ORIGIN
1 atgccaagca gtatgtcagt tcggcttttc actgcatctg ccgcacaaag aaagaagata
61 caggaagggg attgctgtgt ggtgctcgct ggaaaaaccc agggccggtt tatattgata
121 ggagcagtgg cccgggtatc agggtacata cccagctact tggataaaga tgagctatgt
181 gtggtatgtg gagacaaagc tacagggtat cactatagat gtatcacctg cgagggctgc
241 aagggctttt ttagaagaac tattcagaag aacctccacc caagctattc ctgtaaatat
301 gaaggaaaat gtgttataga caaagtaaca aggaaccagt gccaagaatg tcgcttcaaa
361 aagtgcaaaa ctgttggaat ggcaacagac ttggttttgg atgacagcaa acgtttggcg
421 aaaagaaagc tcatagaaga aaacagagaa aaaagacgga aagacgagat acagaaatca
481 attgttcaga gaccggaacc aacacaagaa gaatgggagt tgatacaagt tgtcactgaa
541 gcacatgtgg ccaccaacgc acagggaagc cactggaaac agaaaagaaa atttttgcca
601 gaggacattg gacaagctcc catagttaat gcgcctgaag gtggaaaagt ggacttagaa
661 gccttcagcc agtttacaaa aataatcacc ccagcaatta caagagtggt tgattttgcc
721 aaaaagctac ctatgttttg tgagctgcca tgtgaagacc agatcatcct tcttaaaggc
781 tgttgtatgg agatcatgtc cctccgagca gccgtgcggt atgaccccga aagtgaaact
841 ctaacgctga atggggagat ggcagtgaca agggggcagc taaaaaatgg aggactcggt
901 gtggtctcag atgccatctt tgacttgggg gtgtcgcttt cttcattcag tcttgatgat
961 accgaagtcg ccttgttgca ggctgtgctg cttatgtcat cagatcgtcc tggtctctct
1021 agtgtggaga gaatagaaaa gtgccaggaa ggtttcctct tggcttttga acactacatt
1081 aattacagga aacacaacat tgcacacttt tggccaaaac tgctgatgaa agtcaccgac
1141 ctccgcatga tcggagcatg ccacgccagc cggttcctgc acatgaaggt ggagtgcccc
1201 actgaactgt ttcccccact gttcttggaa gtgtttgagg actagaacag actgt
//
LOCUS XELBETB1 226 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon A.
ACCESSION M35353
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 1 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 226)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 57 a 50 c 73 g 46 t
ORIGIN
1 agcttcatta tcctgactga acacaagcag ggataacgct gggagtgact ggcatagcag
61 gggctgcagg gaggcacttc ataatccgtg ccaaatccaa cgttgtagcg agtgacgaga
121 atcgtagagt gcgcggaaca gtctcacgga cgctggggtt tgggaaggac gacgcggcag
181 ctgttgcact acgttacgtc taactctata ggttggagag ctgacg
//
LOCUS XELBETB2 65 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon B.
ACCESSION M35354
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 2 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 65)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 7 a 19 c 16 g 23 t
ORIGIN
1 agctgtagat ctcctgtctg tgttgctgcc actgctgttg ctgctccagt tcctctgact
61 gtcag
//
LOCUS XELBETB3 50 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon C.
ACCESSION M35355
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 3 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 19 a 5 c 15 g 11 t
ORIGIN
1 atgttgaaga gtgattgggg ttaagcaggc acatactgta caagaaaaag
//
LOCUS XELBETB4 67 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon F.
ACCESSION M35356
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 4 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 67)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 27 a 14 c 12 g 14 t
ORIGIN
1 ctacaggttt ccctcaagca ccaagaacga aaaccagaaa gaatttgcag agaatttttc
61 aaatgag
//
LOCUS XELBETB5 64 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon G.
ACCESSION M35357
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 5 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 64)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 46 > 64 thyroid hormone receptor beta B gene
BASE COUNT 21 a 12 c 14 g 17 t
ORIGIN
1 gttatatgtg atgcttagaa gaatgagcag accttccaat ccataatgcc aagcagtatg
61 tcag
//
LOCUS XELBETB6 123 bp ds-DNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor beta B gene, exon H.
ACCESSION M35358
KEYWORDS thyroid hormone receptor beta B protein.
SEGMENT 6 of 6
SOURCE X.laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 123)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Yaoita, 15-JUN-1990.
Exons D, E and H do not exist for the beta B thyroid hormone
receptor protein.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
BASE COUNT 31 a 25 c 39 g 28 t
ORIGIN
1 ttcggctttt cactgcatct gccgcacaaa gaaagaagat acaggaaggg gattgctgtg
61 tggtgctcgc tggaaaaacc cagggccggt ttatattgat aggagcagtg gcccgggtat
121 cag
//
LOCUS XELTHYA 1406 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor alpha A mRNA, complete cds.
ACCESSION M35343
KEYWORDS thyroid hormone receptor protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1406)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 122 1378 thyroid hormone receptor protein
BASE COUNT 346 a 350 c 406 g 304 t
ORIGIN
1 gtcgacctgt gagaggcgtc cgcccgcctc catgtgaacg ctacgcccca tgatcctcgg
61 ggagctgggg gcggagcccg ccttggtctc ttcggattgg ttctggatgg aattacgttg
121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc
181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga
241 cagcttggtg tctctgccct ctgcagggta catccccagc tatctggaca aagatgagcc
301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg
361 gtgtaagggt ttctttcgcc gcaccatcca gaagaacctg cacccctcct actcgtgcaa
421 gtacgatggc tgctgcatta tcgacaagat cacccgaaat cagtgccagc tctgccgctt
481 caagaaatgc attgccgttg gcatggcaat ggatcttgtc ctggatgatg gcaagcgggt
541 agccaagcga aaactgattg aagagaatcg acagcggcgg cggaaggagg agatgatcaa
601 gactctgcaa cagcgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac
661 agaagctcac aggagtacca atgctcaggg cagccactgg aaacagcgta ggaagtttct
721 gccggaagat atcgggcagt ctcccatggc ttccatgccg gatggggata aagttgacct
781 ggaagctttc agtgagttca ccaagataat caccccggca attaccagag tggtggactt
841 tgccaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa
901 aggatgttgt atggagatca tgtctctccg tgctgctgta cgctacgatc cagacagcga
961 gaccctaacg ctgagcggag agatggctgt gaaacgggag cagcttaaga acggaggtct
1021 gggtgttgtc tctgatgcca tctttgacct cgggaggtcg cttgctgcgt ttaaccttga
1081 cgatacggaa gtggcgctgc tgcaggctgt tttgctaatg tcatcagacc gaactggttt
1141 aatctgcacg gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta
1201 catcaaccat cgcaaacaca acattcccca cttctggccc aaactcctaa tgaaggtgac
1261 ggacctgcgc atgatagggg catgccatgc cagccgcttt ctgcacatga aggtcgagtg
1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg
1381 gacagtgcat gtcggtagag aggaaa
//
LOCUS XELTHYB 1406 bp ss-mRNA VRT 28-JUL-1990
DEFINITION X.laevis thyroid hormone receptor alpha B mRNA, complete cds.
ACCESSION M35344
KEYWORDS thyroid hormone receptor protein.
SOURCE X.laevis, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1406)
AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D.
TITLE The Xenopus laevis alpha and beta thyroid hormone receptors
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by Y.Yaoita, 15-JUN-1990.
Author address:Yoshio Yaoita
Carnegie Inst of Washington
115 West University Parkway
Baltimore, MD 21210
FEATURES from to/span description
pept 122 1378 thyroid hormone receptor protein
BASE COUNT 350 a 347 c 402 g 307 t
ORIGIN
1 gtcgacctgt gagaggcgcc cgcccgcctc catgtgaaag ccacgcccca tgagccttgg
61 gcagctgggg gcggagccca ccttggtctc ttcggattgg ttctggatgg aattacgttg
121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc
181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga
241 cagcttggtg tctctgcccc ctgcagggta catccccagc tatctggaca aagatgagcc
301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg
361 gtgcaagggt ttcttccgcc gcaccatcca gaagaacctg cacccctcct attcttgcaa
421 gtacgatggc tgctgcatta tcgacaaaat cacccgtaat cagtgccagc tctgccgctt
481 caagaaatgc attgccgttg gcatggcaat ggatcttgac ctggatgata gcaagcgggt
541 agccaagcga aaactgattg aagaggatcg agtgcggcgg cggaaggagg agatgatcaa
601 gactctgcaa cagtgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac
661 agaagctcac aggagtacca atgcccaggg cagccattgg aaacagcgta ggaagtttct
721 gccagaagac atcggacagt ctcctatggc ttccatgcca gatggggata aagttgacct
781 ggaagctttc agtgagttca ccaaaataat caccccggca attaccagag tggtggactt
841 tgcgaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa
901 aggatgttgt atggagatca tgtctcttcg tgctgctgtg cgctacgatc cagacagcga
961 gaccctaacg ctgagcggcg agatggcggt gaaacgggag cagcttaaga acggaggtct
1021 gggtgttgtc tctgatgcca tctttgacct tgggaggtcg cttgctgcgt tcaaccttga
1081 tgatacggaa gtggcactgt tgcaggctgt tttgctaatg tcatcagacc gtactggttt
1141 aatctgcaca gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta
1201 catcaaccat cgcaaacaca acattcccca cttctggccc aagctcctaa tgaaggtgac
1261 ggacctgcgc atgatagggg catgccatgc cagctgcttt ctgcacatga aggtcgagtg
1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg
1381 gacagtgcat gtcggtagag aggaaa
//
LOCUS RATSIMPA1 205 bp ds-DNA ROD 28-JUL-1990
DEFINITION Rat simple sequence DNA, clone 5.
ACCESSION M36626
KEYWORDS simple sequence DNA.
SEGMENT 1 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 205)
AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
TITLE Simple sequences of the rat genome detected by hybridization with
adenovirus DNA
JOURNAL Dokl. Biochem. 276, 189-193 (1984)
STANDARD simple staff_entry
BASE COUNT 52 a 53 c 79 g 21 t
ORIGIN
1 cagctctgtc ctgttgtcgc ccttgggcag agttgtgcct cctgctcttc tttccctaag
61 gaggggcagc agcagcagca gcaggaggag caggaggagc agcagcagga gcagcaggag
121 cagcagcagc agcaggagga gcaggagcag cagcaggagc agcagcagca ggagcagcag
181 cagcagcagg agcaggagga gcagc
//
LOCUS RATSIMPA2 146 bp ds-DNA ROD 28-JUL-1990
DEFINITION Rat simple sequence DNA, clone 5.
ACCESSION M32514
KEYWORDS simple sequence DNA.
SEGMENT 2 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 146)
AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
TITLE Simple sequences of the rat genome detected by hybridization with
adenovirus DNA
JOURNAL Dokl. Biochem. 276, 189-193 (1984)
STANDARD simple staff_entry
BASE COUNT 46 a 36 c 61 g 3 t
ORIGIN About 500 base pairs after segment 1.
1 agcagcagca gcaggagcag caggaggagc agcaggagca ggagcagcag gagcagcagc
61 aggagcagga gcaggagcag caggagcagc aggagcagca gcaggagcag cagcagcagc
121 agcagcagca gcggtgcagc tccatg
//
LOCUS RATSIMPB 380 bp ds-DNA ROD 28-JUL-1990
DEFINITION Rat simple sequence DNA, clone 8.
ACCESSION M32515
KEYWORDS simple sequence DNA.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 380)
AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
TITLE Simple sequences of the rat genome detected by hybridization with
adenovirus DNA
JOURNAL Dokl. Biochem. 276, 189-193 (1984)
STANDARD simple staff_entry
BASE COUNT 131 a 96 c 138 g 15 t
ORIGIN
1 tgatcattgc tgcaatccca cagcaggagc agcagcagga gcagcagcag cagcaggagc
61 aggagcagcc acaggaggag cagcaacaag aggcagcagc agcagcagga gcagcagcag
121 caggaggagc agcaacagga gcagcagcaa caggagcagc agcaggaaca gaacaggagc
181 agcagcagca ggaacaagga gtagcagcag cagcagcagg aacaggagaa gcagcagcag
241 cagcagcagc aggagcagga gcagcaggag cagcagcagc agcagtagga gcagcagcag
301 cagcaggagc agcagcagca gcagcaggag gagcagcagc agcagcagca cagcagcagg
361 gtacttggtg atcccttgac
//
LOCUS RATSIMPC 542 bp ds-DNA ROD 28-JUL-1990
DEFINITION Rat simple sequence DNA, clone 16.
ACCESSION M36627
KEYWORDS simple sequence DNA.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 542)
AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
TITLE Simple sequences of the rat genome detected by hybridization with
adenovirus DNA
JOURNAL Dokl. Biochem. 276, 189-193 (1984)
STANDARD simple staff_entry
BASE COUNT 76 a 255 c 142 g 69 t
ORIGIN
1 ggatccaccg cctgagtagc cgccgccaca gctagagccg cctccacctc caccgccgtc
61 ggagtagccg cctccgcagc tggagccacc gccgccgccg ccgccggagt acttgccccc
121 ttcggaccgc cgccgcgacc accgggctgc cgctccagag gagcctccgc agtaggagcc
181 gccgcctcct gattcgtctc ctatagttgg agcctccgcc tccgtcggag tcgccgccgc
241 cgccgtagcc ggagccgccg ccgccgccgc ccgcctccgg agtaccttga cgccgccgcc
301 gccgccgccg gagtacttcg cccctccgga ccgccgccgc gaccagagaa ctgacgcccc
361 ctccggagcc gcctccgccg ccgcagctgg aaccacctcc ataggaacca ccgcctccgc
421 ctccgcctcc gcagccagag cctcctccag atgagccacc tccgcagctg ggagcctcca
481 ccgctaccac caccgctata gtaaccgcca ccgccgcctc ctcctccacc agaggtcttt
541 tc
//
LOCUS RATPSTIAA 2382 bp ss-mRNA ROD 28-JUL-1990
DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI)
mRNA, complete cds.
ACCESSION M35299
KEYWORDS monitor protein;
pancreatic secretory trypsin inhibitor-like protein.
SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 427; 594 to 1693; 1954 to 2338)
AUTHORS Fukuoka,S.-I. and Scheele,G.
TITLE Rapid and selective cloning of monitor peptide, a novel
CCK-releasing peptide, using minimal amino acid sequence and the
polymerase chain reaction (PCR)
JOURNAL Pancreas 4, 1-7 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2382)
AUTHORS Fukuoka,S.-I. and Scheele,G.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.-I.Fukuoka, 19-JUN-1990.
FEATURES from to/span description
pept 10 249 pancreatic secretory trypsin inhibitor-like
protein precursor
sigp 10 63 pancreatic secretory trypsin inhibitor-like
protein signal peptide
matp 64 246 pancreatic secretory trypsin inhibitor-like
protein
signal 1665 1670 poly-A signal
signal 2064 2069 poly-A signal
signal 2303 2308 poly-A signal
BASE COUNT 770 a 423 c 517 g 672 t
ORIGIN
1 tctacaacca tgaaggtagc aattatcttt cttctcagtg ctttggccct gctcagttta
61 gcaggtaacc ctccagctga ggtgaatgga aaaacgccta attgccctaa gcaaattatg
121 ggatgtccca ggatttatga ccctgtgtgt gggactaacg gaattactta ccccagtgaa
181 tgcagtctgt gctttgaaaa caggaaattc ggaacatcta tccacattca gaggagaggg
241 acttgctgaa tgtcctgatt ttgaaatctt ttagggctac cataatgttt agcaagaagg
301 tttgctgaat aaatgcatct gaacatattt tgttcttccc aaagcttttg ctcaaaggca
361 tatatgagta tattgagaat agggatctga gaagaaaacc agagtagagc aagctttacc
421 acttagttct tcatgctcat acttcaaaaa ttgcagatga tgacaacaca tagttgagca
481 tgaacatgtg taatgaatag agtttgggtt aggatgaaga aggtagccta tctgtgcaca
541 agaaagaagt agactgactt ggatctttct taggggagtt taccaaagga aagactgcct
601 tgtatatcta cagtgtttca cttgtgagac accacaactc tgcagattta ctcttgttct
661 gtgaggaaac ttagaagagt caaattgttt gactaatagt ccaacataca tgatgccagg
721 gtgttctttt agatcaagct gacctcttcc ttcatccata tgagcactcc ttcttttaac
781 cacaatcttc tcttgtggat catgccttga ctttcttcaa tgggaatcct agataatatt
841 ccctactgta agatcttgca tgtctatatt cagtgataga atatagacgt gatataatag
901 gatataacca aatgaattag aaacaaggaa atattctcaa aagggaaagt atcaacaact
961 acttttaaaa aaggaatcat tttaagatcc tgagtttcta aagaaaatct tagtctaaga
1021 tggaaagaga gtaaagagct aacacaggtg agtctgggca aggaacccta gtacagtggg
1081 gttgggtcag cacctttgcc agaaataacc aagctattca gaaatacact aggaaaggag
1141 agttgcctag taacccactt ctggtcatat tcagtattca tgccttgaac tgaactcttg
1201 ctcctagagg atgctataac taacaaaccg agcaacttaa acagcctgac agctctcacc
1261 aaataccttg ctatctcaag ttatggatgc aagatggctc ccagtgtcta tctgtgattc
1321 tagaggacac ttgaagggca ccaacactta acaaattctg tgggggtaaa tttattttaa
1381 tcactggatg ctggaagaca cacacagaga cacaaacaca caaagagaga cagagagaga
1441 gaaagagaga gagagaggta gagagagaga gagagagaga gagggagaga gagggagaga
1501 gagagtgttt tgggttttgt tgttgttgtt gttgttgatt tggaattata tcaagatata
1561 agataatctc aaatgtatct ttagtagttc tgctccctgg acccatgaga agacaggaat
1621 gaggattctg tgcatgtggt acttacattt caaaaggagt atctaataaa ctggaaactg
1681 cttaaaagaa tgagactatc agcactgata agaatataaa gcttcaagct atgaagagtg
1741 attcaaagaa ggaaaagaat tccctcagaa ctgggaggac cttttaaaaa attctgagtc
1801 cccgtttcta aagtttcacc ttcctaactt catgtatttt ttaatagctc aaagagtcca
1861 attactgctg ctcatatact catgagtgtg acaccatgca ctgttactgc caatatatga
1921 aaggccatac ccctaaagaa aattgactta agaactcctt gtttagggtt gggtacttct
1981 gtgaccctcc cacattcatg ctggaatgtt gactggcttc atttttataa ggcaaaagat
2041 cttcccactc tcttctgaga gagaataaat cagttttgct caatggagtg attctgagta
2101 tactaatcac gatcccagga caggccccat tctcacaagc agttagctaa cacaaataga
2161 actccatatt ttatagcagt ttttatcttt tgttcttggt tttagttctt attttcaaga
2221 cagagaaaaa cacatgaagt tggaagggta gaagtggggg ggggcgtggg tctgggagga
2281 gttgggggat agagaaaaat ataataaaaa tatatgaaat tctcgagaat gaataaatgg
2341 aattcgatat caagcttatc gataccgtcg acctcgaggg gg
//
LOCUS RATPSTIBA 300 bp ss-mRNA ROD 28-JUL-1990
DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI)
mRNA, 3' end.
ACCESSION M35300
KEYWORDS monitor protein;
pancreatic secretory trypsin inhibitor-like protein.
SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP3.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Fukuoka,S.-I. and Scheele,G.
TITLE Rapid and selective cloning of monitor peptide, a novel
CCK-releasing peptide, using minimal amino acid sequence and the
polymerase chain reaction (PCR)
JOURNAL Pancreas 4, 1-7 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.-I.Fukuoka, 19-JUN-1990.
FEATURES from to/span description
pept < 1 116 pancreatic secretory trypsin inhibitor-like
protein precursor (AA at 3)
matp < 1 113 pancreatic secretory trypsin inhibitor-like
protein
signal 285 290 poly-A signal
BASE COUNT 98 a 57 c 65 g 80 t
ORIGIN
1 gtcccaggat ttatgaccct gtgtgtggga ctaacggaat tacttacccc agtgaatgca
61 gtctgtgctt tgaaaacagg aaattcggaa catctatcca cattcagagg agatagagcg
121 tctgcaaaaa cagatcgaac ggcataagaa gaagattaat acctaaagaa tagtgaggca
181 ttgagtgcac acagtcagtc tctcacatag tggcagtatc attcccactc ttatagagat
241 tgttttgaat gattgatgtt tgaccatgtg tgctactaac agataataaa ttatcaccag
//
LOCUS SYNTRPA 3763 bp ds-DNA circular SYN 28-JUL-1990
DEFINITION Cloning vector pATH3 propagated in E.coli.
ACCESSION M33622
KEYWORDS trpE' protein.
SOURCE Synthetic DNA, clone pATH3.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 3763)
AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
TITLE High-expression vectors with multiple cloning sites for
construction of trpE-fusion genes: pATH vectors
JOURNAL Meth. Enzymol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Hill, 06-APR-1990.
Nucleotides 1-147 are provided as a personal communication from
R.P.Gunsalus at the Dept. of Microbiology at UCLA.
Construction of pATH3: 1. PvuII-HindIII fragment from the 5' end
of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds)
was ligated to the HindIII-PvuII fragment of pBR322 containing the
bla (= Amp-resistance) gene and origin of replication, but not the
rop gene, which encodes a negative regulator of ColE1 replication.
In addition, the EcoRI site in the pBR322 backbone was eliminated.
This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141
(1984))
2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the
trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
fragment and an EcoRI-HindIII fragment, both from the MCS of
M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more
details).
3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
the remaining plasmid religated. This produced plasmid pATH2
(GenBank acc M33624)
4. An EcoRI linker was inserted at the remaining SmaI site of
pATH2 replacing the SmaI site and changing the reading frames of
the other sites in the MCS. This plasmid is pATH3.
FEATURES from to/span description
pept 423 > 1391 trpE' fusion protein
BASE COUNT 926 a 942 c 946 g 949 t
ORIGIN
1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
1381 agattgagat ccccccgaat tcggggggat cctctagagt cgacctgcag cccaagctta
1441 tcgatgataa gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac
1501 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt
1561 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt
1621 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta
1681 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
1741 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
1801 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
1861 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
1921 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
1981 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
2041 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
2101 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
2161 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
2221 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
2281 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
2341 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
2401 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
2461 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
2521 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
2581 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
2641 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
2701 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
2761 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
2821 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
2881 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
2941 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
3001 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
3061 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc
3121 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
3181 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
3241 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
3301 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
3361 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
3421 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
3481 agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt
3541 gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc
3601 gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg
3661 acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg
3721 catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agg
//
LOCUS SYNTRPB 3771 bp ds-DNA circular SYN 28-JUL-1990
DEFINITION Cloning vector pATH10, propagated in E.coli.
ACCESSION M33623
KEYWORDS beta-lactamase; trpE' protein.
SOURCE Synthetic DNA, clone pATH10.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 3771)
AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
TITLE High-expression vectors with multiple cloning sites for
construction of trpE-fusion genes path vectors
JOURNAL Meth. Enzymol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Hill 06-APR-1990.
Nucleotides 1-147 are provided as a personal communication from
R.P.Gunsalus at the Dept. of Microbiology at UCLA.
Construction of pATH10: 1. PvuII-HindIII fragment from the 5' end
of the trp operon (through nt 1999 of ECOTGP, which is in the trpD
cds) was ligated to the HindIII-PvuII fragment of pBR322 containing
the bla (= Amp-resistance) gene and origin of replication, but not
the rop gene, which encodes a negative regulator of ColE1
replication. In addition, the EcoRI site in the pBR322 backbone
was eliminated. This plasmid is pKRS101. (Spindler et al. M.
Virol. 49, 132-141 (1984))
2. The BglII-HindIII fragment (nt 423 of trpE to the end of the
trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
fragment and an EcoRI-HindIII fragment, both from the MCS of
M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more
details).
3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
the remaining plasmid religated producing plasmid pATH2 (GenBank
acc M33624).
4. An interim vector was constructed by inserting an EcoRI linker
at the remaining SmaI site of pATH2.
5. The EcoRI-HindIII fragment of MCS in this interim vector was
replaced with the EcoRI-HindIII fragment containing the MCS of
M13mp12.
6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla
gene of this interim vector was replaced with the corresponding
AvaII fragment from pUC8, eliminating this PstI site, making the
PstI site in the MCS unique. This is plasmid pATH10.
FEATURES from to/span description
pept 423 1472 trpE' protein
pept 1688 2548 beta-lactamase
BASE COUNT 927 a 945 c 948 g 951 t
ORIGIN
1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
1381 agattgagat cccccggaat tcgagctcgc ccggggatcc tctagagtcg acctgcagcc
1441 caagcttatc gatgataagc tgtcaaacat gagaattaat tcttgaagac gaaagggcct
1501 cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg
1561 tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc
1621 aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag
1681 gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg
1741 ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt
1801 gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt
1861 tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt
1921 attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa
1981 tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag
2041 agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac
2101 aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac
2161 tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac
2221 cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac
2281 tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact
2341 tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg
2401 tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt
2461 tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat
2521 aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta
2581 gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa
2641 tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga
2701 aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac
2761 aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt
2821 tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc
2881 gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat
2941 cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag
3001 acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc
3061 cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag
3121 cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac
3181 aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg
3241 gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct
3301 atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc
3361 tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga
3421 gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga
3481 agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg
3541 catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact
3601 ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac
3661 gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc
3721 gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag g
//
LOCUS SYNTRPC 3753 bp ds-DNA SYN 28-JUL-1990
DEFINITION Cloning vector pATH2, propagated in E.coli.
ACCESSION M33624
KEYWORDS beta-lactamase; trpE' protein.
SOURCE Synthetic DNA, clone pATH2.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 3753)
AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
TITLE High-expression vectors with multiple cloning sites for
construction of trpe-fusion genes path vectors
JOURNAL Meth. Enzymol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Hill 06-APR-1990.
Nucleotides 1-147 are provided as a personal communication from
R.P.Gunsalus at the Dept. of Microbiology at UCLA.
Construction of pATH2: 1. PvuII-HindIII fragment from the 5' end
of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds)
was ligated to the HindIII-PvuII fragment of pBR322 containing the
bla (= Amp-resistance) gene and origin of replication, but not the
rop gene, which encodes a negative regulator of ColE1 replication.
In addition, the EcoRI site in the pBR322 backbone was eliminated.
This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141
(1984))
2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the
trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
fragment and an EcoRI-HindIII fragment, both from the MCS of
M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more
details).
3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
the remaining plasmid religated. This produced plasmid pATH2
FEATURES from to/span description
pept 423 1454 trpE' protein
pept 1670 2530 beta-lactamase
BASE COUNT 924 a 939 c 943 g 947 t
ORIGIN
1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
1381 agattgagat ccccggggat cctctagagt cgacctgcag cccaagctta tcgatgataa
1441 gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt
1501 ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa
1561 tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat
1621 gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca
1681 acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca
1741 cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta
1801 catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt
1861 tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc
1921 cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc
1981 accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc
2041 cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa
2101 ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga
2161 accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat
2221 ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca
2281 attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc
2341 ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat
2401 tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag
2461 tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa
2521 gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca
2581 tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc
2641 ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc
2701 ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc
2761 agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt
2821 cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt
2881 caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc
2941 tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa
3001 ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac
3061 ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc ttcccgaagg
3121 gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga
3181 gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact
3241 tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa
3301 cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc
3361 gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg
3421 ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat
3481 gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag
3541 tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac
3601 tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt
3661 ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag
3721 aggttttcac cgtcatcacc gaaacgcgcg agg
//
LOCUS SYNTRPD 3772 bp ds-DNA SYN 28-JUL-1990
DEFINITION Cloning vector pATH11, propagated in E.coli.
ACCESSION M33625
KEYWORDS beta-lactamase; trpE' protein.
SOURCE Synthetic DNA, clone pATH11.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 3772)
AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
TITLE High-expression vectors with multiple cloning sites for
construction of trpe-fusion genes path vectors
JOURNAL Meth. Enzymol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.E.Hill 06-APR-1990.
Nucleotides 1-144 are provided as a personal communication from
R.P.Gunsalus at the Dept. of Microbiology at UCLA.
Construction of pATH11: 1. PvuII-HindIII fragment from the 5' end
of the trp operon (through nt 1999 of ECOTGP, which is in the trpD
cds) was ligated to the HindIII-PvuII fragment of pBR322 containing
the bla (= Amp-resistance) gene and origin of replication, but not
the rop gene, which encodes a negative regulator of ColE1
replication. In addition, the EcoRI site in the pBR322 backbone
was eliminated. This plasmid is pKRS101. (Spindler et al. M.
Virol. 49, 132-141 (1984))
2. The BglII-HindIII fragment (nt 423 of trpE to the end of the
trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
fragment and an EcoRI-HindIII fragment, both from the MCS of
M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more
details).
3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
the remaining plasmid religated. This produced plasmid pATH2
(GenBank acc M33624).
4. An interim vector was constructed by inserting an EcoRI linker
at the remaining SmaI site of pATH2.
5. The EcoRI-HindIII fragment of MCS in this interim vector was
replaced with the EcoRI-HindIII fragment containing the MCS of
M13mp12.
6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla
gene of this interim vector was replaced with the corresponding
AvaII fragment from pUC8, eliminating this PstI site, making the
PstI site in the MCS unique. This is plasmid pATH11.
FEATURES from to/span description
pept 423 1487 trpE' protein
pept 1689 2549 beta-lactamase
BASE COUNT 927 a 946 c 948 g 951 t
ORIGIN
1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
1381 agattgagat ccccccggaa ttcgagctcg cccggggatc ctctagagtc gacctgcagc
1441 ccaagcttat cgatgataag ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc
1501 tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag
1561 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt
1621 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa
1681 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt
1741 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt
1801 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt
1861 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg
1921 tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga
1981 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa
2041 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga
2101 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa
2161 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca
2221 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta
2281 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac
2341 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc
2401 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag
2461 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga
2521 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt
2581 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata
2641 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag
2701 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa
2761 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt
2821 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc
2881 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa
2941 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa
3001 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc
3061 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag cattgagaaa
3121 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa
3181 caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg
3241 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc
3301 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg
3361 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg
3421 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg
3481 aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc
3541 gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac
3601 tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga
3661 cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc
3721 cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gg
//
LOCUS ECOTRMF 77 bp ss-tRNA RNA 28-JUL-1990
DEFINITION E. coli initiator Met-tRNA-f.
ACCESSION K00305 M25117
KEYWORDS transfer RNA; transfer RNA-Met.
SOURCE E. coli (strain CA265) tRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 77)
AUTHORS Dube,S.K. and Marcker,K.A.
TITLE The nucleotide sequence of N-formyl-methionyl-transfer RNA: Partial
digestion with pancreatic and T-1 ribonuclease and derivation of
the total primary structure
JOURNAL Eur. J. Biochem. 8, 256-262 (1969)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 77)
AUTHORS Uemura,H., Imai,M., Ohtsuka,E., Ikehara,M. and Soell,D.
TITLE E. coli initiator tRNA analogs with different nucleotides in the
discriminator base position
JOURNAL Nucleic Acids Res. 10, 6531-6539 (1982)
STANDARD full staff_review
REFERENCE 3 (sites)
AUTHORS Dahlberg,J.E., Kintner,C. and Lund,E.
TITLE Specific binding of tRNA-Met-f to 23S rRNA of Escherichia coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1071-1075 (1978)
STANDARD simple staff_entry
COMMENT [1] Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss from
their entry 1310 in Nucleic Acids Res. 11, r1-r54 (1983).
[1] notes that there may be either another minor Met-tRNA-f or a
modification of this sequence, because there is a small amount of
an oligonucleotide which shows that base 47 is adenosine instead of
m7g.
[2] generated all possible substitutions at the fourth base up from
the 3' end (position 74); all four variants accepted methionine in
in-vitro aminoacylation reactions, implying that the "discriminator
hypothesis" is incorrect.
FEATURES from to/span description
tRNA 1 77 Met-tRNA (NAR: 1310)
modified 8 8 s4u
modified 21 21 d
modified 33 33 cm
anticdn 35 37 Met-tRNA-f anticodon cat
modified 47 47 m7g
modified 55 55 t
modified 56 56 f
BASE COUNT 14 a 26 c 25 g 12 t
ORIGIN 5' end of mature Met-tRNA-f.
1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa
61 tccggccccc gcaacca
//
LOCUS MCPRNA3A 80 bp ss-RNA VRL 28-JUL-1990
DEFINITION Cowpea mosaic virus M RNA 3' terminal sequence.
ACCESSION M25438
KEYWORDS .
SOURCE Cowpea mosaic virus RNA.
ORGANISM Cowpea mosaic virus
Viridae; ss-RNA nonenveloped viruses; Comoviridae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A.
TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea
mosaic virus RNAs
JOURNAL Nucleic Acids Res. 7, 493-500 (1979)
STANDARD simple staff_entry
BASE COUNT 22 a 13 c 13 g 32 t
ORIGIN
1 tatgaattta atttcttttg tgagctcctg tttagcaggt cgtcccttca gcaaggacac
61 aaaaagattt taattttatt
//
LOCUS MCPRNA3B 80 bp ss-RNA VRL 28-JUL-1990
DEFINITION Cowpea mosaic virus B RNA 3' terminal sequence.
ACCESSION M25439
KEYWORDS .
SOURCE Cowpea mosaic virus RNA.
ORGANISM Cowpea mosaic virus
Viridae; ss-RNA nonenveloped viruses; Comoviridae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A.
TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea
mosaic virus RNAs
JOURNAL Nucleic Acids Res. 7, 493-500 (1979)
STANDARD simple staff_entry
BASE COUNT 22 a 12 c 13 g 33 t
ORIGIN
1 taaataatgc ttatgttttt gtttgctcct gtttagcagg tcgttccttc agcaagaaca
61 acaaaaatat gtgttttatt
//
LOCUS PPCCGAAA 5306 bp ds-DNA VRL 28-JUL-1990
DEFINITION Hamster papovavirus complete genome.
ACCESSION M26281
KEYWORDS complete genome.
SOURCE Hamster papovavirus DNA.
ORGANISM Hamster papovavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 5306)
AUTHORS Delmas,V., Bastien,C., Scherneck,S. and Feunteun,J.
TITLE A new member of the polyomavirus family: The hamster papovavirus.
Complete nucleotide sequence and transformation properties
JOURNAL EMBO J. 4, 1279-1286 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
ORF 192 839 early proteins polyprotein (T antigens)
ORF 5083 4046 (c) VP2
ORF 4711 4046 (c) VP2
ORF 4045 2927 (c) VP1
BASE COUNT 1595 a 1124 c 1080 g 1507 t
ORIGIN
1 ccccttgcct ccttagctct caagtagaaa aggaagagag gcttttgggg ctttttggct
61 ttaagcctca ttttatgagc aggaggagct tgttgcaact tgagaggcgt tttgaggctt
121 ccaggcagag aatactcaca gaccccacac agtctagacg ctcagaagca tctctagctg
181 caacaagcaa gatggataga attcttacta aagaagaaaa gcaagcctta ataagtttac
241 tagatttgga gccacaatat tggggagact atggacgaat gcagaaatgc tacaagaaaa
301 agtgtcttca actgcatcct gataaaggtg gcaatgaaga gctcatgcaa cagcttaata
361 ccctgtggac caaactaaaa gatggtcttt acagagttag gctgttactt gggcctagtc
421 aggtaagaag acttggaaaa gatcagtgga atttatcttt acagcaaaca ttttctggta
481 cctactttag gaggctctgc agactcccca ttacctgcct aagaaacaag ggaattagta
541 cctgcaattg catactttgt ttgctcagaa aacagcattt tctgctaaag aagtcctgga
601 gagtaccttg cctggtgtta ggagaatgct actgcataga ctgctttgcc ttatggtttg
661 gcctgccagt taccaatatg ctggttccat tatatgcaca atttcttgct ccaatacctg
721 tggattggct tgatctgaat gttcatgagg tctacaatcc ggcctcaggt atgtatgaat
781 atggggggct tatagttgta actgtacaag tttaaaatgt gcttttttca ggaccctaat
841 gcttccacct ccaccagcag acccggagag ttctacaatc ctgacacagg aggatactgg
901 tcctactctt atgggtcagc aggatactct gaccagcaga agaaatactg ggaagagttt
961 ttctctaagt gggatgttaa tgaggacctc acctgccaag aagagttatc atcatcagaa
1021 gatgaattca ccccctggca tcccaatccc cccccctccc ctgtttctat ttccagtgac
1081 agctccagtt cctcctgtga cgaggaatac ccaagaaact caagcagaaa gagaaaacga
1141 gtacatgcca atggctcccc aaatacacct atacagccaa ataagagagc ccacacacca
1201 ggaggaggaa gaaccacaat acgaggagat accgatatac ctagaactcc tgccagagaa
1261 tcccaatcaa catttggctc ttacttcaac agcacggagg agcttgagga ggaaatatca
1321 caaacacaac agtcacatca taacacaacg ccaaagaaac cgcctccgac ggttagtcct
1381 gatgattttc ctactatcct tagggggttt ctttctcacg ctattttttc taataaaacg
1441 caaaatgcat ttataatcta cagtactaag gaaaaatgtg aagtacttta tgaacaaata
1501 gacaaatata atccagacta taaaggtatc ttcattatga aacaaacaga agcatttgta
1561 atgtttatga ctcctggaaa acatagagta gctgcagtta aaagttactg ttgtaaattt
1621 tgtaccgtta gcttcctgct atgcaaagct gttacaaaac cgttagagtt gtataactgt
1681 gtggctaaat gtgatgactt tcaaatttta aaagaaaata agcctggtct atatcatttt
1741 gaattctgtg atgaaaaaaa agaggtgaag caaatagact ggaatttcct aacatctttt
1801 gcagttgaaa atgagttaga tgatcctctt gtaattatgg gacattatct agaatttagt
1861 cagtgtgaaa gctcttgcaa aaagtgtgca gaagctttac caaggatgaa agtccactgg
1921 gctaaccaca gtcagcactt agagaatgct gagcttttct tacactgcaa acaacagaaa
1981 agtatctgtc agcaagcagc agataatgtt ctggcaagga gaagattaaa ggtccttgaa
2041 tcaacaagac aagaattgtt ggcagagaga ctgaacaaac tgttagacca attaaaagat
2101 ttatctcctg tagataagca tttatatctt gctggagtag cctggtacca atgtatgttt
2161 cctgattttg agatgatgtt attagatatt ttaaaattgt ttactgaaaa tgttccaaaa
2221 aaaagaaatg tactttttag aggtcctgta aattcaggga aaactagcct tgctgcagct
2281 atcatgaatc ttgtaggagg agttgccctc aatgttaatt gtcctgcaga taagctcaac
2341 tttgaacttg gtgttgctat agataaattt gcagtagtct ttgaagatgt caaaggacaa
2401 accggagata agagacacct acagtctgga cttggaatta ataaccttga taacctgaga
2461 gattaccttg atggaagtgt aaaggttaat ttagaaaaga agcatgtaaa taagaggtcc
2521 cagatatttc ctccttgtat tgttactgct aatgaatatt tttttcctca aacactctat
2581 gccagattcc ataaagttta taactttgaa gtgaaggatt ttcttgccaa gagccttgag
2641 gaaaacagtt acatggggag acatagagtc tgtcaaagtc cacttacaat gctgatagca
2701 ttgctttgga atgtacccac tgaaaatttt gataagtctc tcaaagagaa ggtggaaaca
2761 gaaaagaagg ttttgtctga tatgtgtaac tttactacat ttgcagaaat gtgtctcaat
2821 attcagaggg gtgctgatcc ccttgaggca ttgtaattga ggaggaaaca ataattgatg
2881 aataaagcat ttattagaag ctctgtgtac agtcattttt caagcattag tttgctggtt
2941 ttgcaggggg tttagtatgc tgttggccat acttgtcaat gaacctattc acatctgggt
3001 caccaggaac agcctctgta ccctcataaa tcctgacttc ttctacctga gcagcttctc
3061 cttccatggg ctggccttca attgttggaa gcatattgtt gtacaaagaa gctagcaagc
3121 ttgtaactgg gtaaggattt ttcacccatc tttttctcaa ggtcacatta aaatatctag
3181 gcagccccct ccaatgccag cctgcactgt tgtattctat gtaccagccc ataacatctg
3241 ctgcactgag ataaagccca tctcctttgc aaagaggccc aaccccattt tcatccagaa
3301 gcacagtagt caaggtatta gtaaactgca tcactggtgg agtaccagta ccacctgtga
3361 ggtacctacc atccttgtcc aattttgctt ttgcagtagg gtccagcacc tggtttgtgg
3421 aagtcattgc tttgccagta acagttttga tactaacaat agctgcctca taatttgcat
3481 tatagttctg cactaggcct tgcaaatcta atggttctcc tcccactgca aacatgtggt
3541 aagttgtacc ctcaactggt ttggaaattc caatatcctt tgtctcactt ctggagccat
3601 atccatgcac atttagaagg gatcccactc caacaacttc agtttttaca gatacagcct
3661 cccacatttg aagggtatca caggtcaaat cttcattcag tgttggaagc tgtattttag
3721 ccatactgta atatggcagt tgattagcct tcacttcatc agcagtaagg gagctattta
3781 ctttaatact ctgggagaac ccataatact ggccatcagt tcctgtgcca ggcttgttct
3841 gacccattct aggattaagg taggcctcaa tttgtgtgat actgtcttct cctgttacaa
3901 gatcaagcac acccacacca ccccgcataa taagcttggg aacattagca ggctttggac
3961 agggtttcca caggggtttg cacatctact ggaagcgccg ctttttcttt ttggggccat
4021 actcaacctc atcaatgtat gtctgccaag taggactaat gtctccgtac aatcctagaa
4081 ttaaaggaag catccaatca ggtgtcactc tttggtgggc tccaccagga gcaaaatacc
4141 tcatgatatt tgcccctgat tcaaaccaac tagaagagtc ttcctgctgt tgacttcttc
4201 tctggacatc aggtctccct agttcagctt ctaatacttg tctgctattg gcatcttcaa
4261 tagaaggtct attactgtat tctaaagctc tttctatttg tcttctttga gctggattaa
4321 ttcctggaag ttctgcatag tagttttgta ggccaccata tattctacta taggcctctc
4381 taggtaaatt agtaacaacc catctactat tttccatcat tctggcaatg gcatctaaaa
4441 attgatgggt ggtctgcaaa cttaagtccc tcacagcaga ttctacagct ccttgagttt
4501 cccttctcaa agtatcccag atatactctc ctacagattg aaataatgag tggccccagc
4561 catgaataac atccaaggca tgggtaaatg actgtacacc agggaataat atatcatagt
4621 agtcagctgg tctcctggga ataagtgcca tgtttctatt cacaatcggt acttcgtgag
4681 caagatagcc gtgtagactt cccaaagaga aggctgctga acctgcaaca gtttgaaaaa
4741 taaatgctgt ctggactgat tctctcacaa actcagtcat tactgttgat gttaactcag
4801 gggcagcttg cataaatata aacatgtctt cacttaggcc aattgaagac aaagctgtct
4861 cagctcctaa aaacccctcc atagttatta atgaagtaac ttgggcatct atcgcggcaa
4921 aggcttctcc actaagtatg gcctctactg aaattccagt aactgatgaa atttcggaga
4981 ggtagctgat catctcaata atcactgaaa tggcagatcc catgttgact tacttgaaca
5041 gtttgaaaat cttctgaact gtttcaggca ggtttttagg ccgaattcta aagaaacaga
5101 aagcaaacac tcagcgccga agagcaggaa atggctgacc actgcacttg ggcgacacga
5161 cacgcctagc gataaggaag tcaccatggc aacataaccg cagcactgct gttgtcacag
5221 ttgcctagca aatgacagac tcagcaacca caggagagga aatgataggg ctagcatttt
5281 ttcaaatgta aaccagaggc tagggg
//
LOCUS RATGST2YB 500 bp ss-mRNA ROD 28-JUL-1990
DEFINITION Rat liver glutathione S-transferase Ya subunit mRNA.
ACCESSION M26874
KEYWORDS S-transferase; glutathione S-transferase; ligandin; transferase.
SOURCE Rat liver cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 500)
AUTHORS Daniel,V., Sarid,S., Bar-Nun,S. and Litwack,G.
TITLE Rat ligandin mRNA molecular cloning and sequencing
JOURNAL Arch. Biochem. Biophys. 227, 266-271 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 402 glutathione S-transferase Ya subunit (AA at 1)
BASE COUNT 153 a 112 c 109 g 126 t
ORIGIN
1 gccctgattg acatgtattc agagggtatt ttagatctga ctgaaatgat tatccaattg
61 gtaatatgtc ccccagacca aagagaagcc aagaccgcct tggcaaaaga caggaccaaa
121 aaccggtact tgcctgcctt tgaaaaggtg ttgaagagcc atggccaaga ctaccttgta
181 ggtaacaggc tgacccgggt agacatccac ctgctggaac ttctcctcta tgttgaagag
241 tttgatgcca gccttctgac ctctttccct ctgctgaagg ccttcaagag cagaatcagc
301 agcctcccca atgtgaagaa gttcctgcag cctggcagtc agagaaagct tcccgtggat
361 gcaaaacaaa tcgaagaagc aaggaagatt ttcaagtttt agcggagctg cactatccaa
421 tttctttatg ttttgcaaaa aatgagaagc aattgttgat cctaggtatt tttgaaataa
481 taaacacgaa aaaatactct
//
LOCUS CPARBCSL 528 bp ds-DNA PLN 28-JUL-1990
DEFINITION C.paradoxa ribulose-1,5-bisphosphate carboxylase/oxygenase
large (rbcL) and small (rbcS) subunits, 3' end and complete cds.
ACCESSION M35728
KEYWORDS ribulose-1,5-bisphosphate carboxylase/oxygenase.
SOURCE C.paradoxa DNA.
ORGANISM Cyanophora paradoxa
Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
Cryptomonadales; Kathablepharidaceae.
REFERENCE 1 (bases 1 to 528)
AUTHORS Starnes,S.M., Lambert,D.H., Maxwell,E.S., Stevens,S.E.Jr.,
Porter,R.D. and Shively,J.M.
TITLE Cotranscription of the large and small subunit genes of ribulose-
1,5-bisphosphate carboxylase/oxygenase in Cyanophora paradoxa
JOURNAL FEMS Microbiol. Lett. 28, 165-169 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 18 ribulose-1,5-bisphosphate
carboxylase/oxygenase large subunit (rbcL)
pept 124 444 ribulose-1,5-bisphosphate (AA at 1)
carboxylase/oxygenase small subunit (rbcS)
BASE COUNT 189 a 85 c 60 g 194 t
ORIGIN
1 actattgata ctatctaata tcatttaatt tatttaatta tttagagttt aaaactctaa
61 ataattaatc aaaatgatat tacttcaatc tatttttacc ttaaaattcg gaattataaa
121 taaatgcaac ttagagtaga acgtaagttc gaaacttttt cttatttacc accattaaac
181 gaccaacaga ttgcgcgtca attacaatac gcactttcca atggttatag cccagcaatc
241 gaattcagtt ttacaggtaa agctgaagac ttagtatgga ctttatggaa attaccttta
301 tttggtgcac aatctcctga agaagtactt agcgaaattc aagcttgtaa acaacagttc
361 cctaatgctt acattcgtgt tgtagcattt gactctatca gacaagttca aactttaatg
421 ttcttagttt acaaaccatt atagtttaat tgatatctac tctaattgat agatatcaat
481 ttttaattaa tctacaaaac aaaattatct aattattatt aatacttt
//
LOCUS HUMCFIX 873 bp ss-mRNA PRI 28-JUL-1990
DEFINITION Human coagulation factor IX mRNA, partial cds.
ACCESSION M35672
KEYWORDS coagulation factor IX; serine protease.
SOURCE Human adult liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 873)
AUTHORS Jagadeeswaran,P., Lavelle,D.E., Kaul,R., Mohandas,T. and
Warren,S.T.
TITLE Isolation and characteriztion of human factor IX cDNA:
Identification of Taq I polymorphism and regional assignment
JOURNAL Somat. Cell Mol. Genet. 10, 465-473 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 873 coagulation factor IX (AA at 1)
BASE COUNT 279 a 146 c 205 g 243 t
ORIGIN
1 aacgccaaca aaattctgaa tcggccaaag aggtataatt caggtaaatt ggaagagttt
61 gttcaaggga accttgagag agaatgtatg gaagaaaagt gtagttttga agaagcacga
121 gaagtttttg aaaacactga aagaacaact gaattttgga agcagtatgt tgatggagat
181 cagtgtgagt ccaatccatg tttaaatggc ggcagttgca aggatgacat taattcctat
241 gaatgttggt gtccctttgg atttgaagga aagaactgtg aattagatgt aacatgtaac
301 attaagaatg gcagatgcga gcagttttgt aaaaatagtg ctgataacaa ggtggtttgc
361 tcctgtactg agggatatcg acttgcagaa aaccagaagt cctgtgaacc agcagtgcca
421 tttccatgtg gaagagtttc tgtttcacaa acttctaagc tcacccgtgc tgagactgtt
481 tttcctgatg tggactatgt aaattctact gaagctgaaa ccattttgga taacatcact
541 caaagcaccc aatcatttaa tgacttcact cgggttgttg gtggagaaga tgccaaacca
601 ggtcaattcc cttggcaggt tgttttgaat ggtaaagttg atgcattctg tggaggctct
661 atcgttaatg aaaaatggat tgtaactgct gcccactgtg ttgaaactgg tgttaaaatt
721 acagttgtcg caggtgaaca taatattgag gagacagaac atacagagca aaagcgaaat
781 gtgattcgaa ttattcctca ccacaactac aatgcagcta ttaataagta caaccatgac
841 attgcccttc tggaactgga cgaaccctta gtg
//
LOCUS HUMMHDRBPV 292 bp ds-DNA PRI 28-JUL-1990
DEFINITION Human MHC class II HLA-DR-beta-I allele gene, partial cds.
ACCESSION M35651
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human (Pemphigus vulgaris patient, haplotype DR4 Dw10) blood DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 292)
AUTHORS Scharf,S.J., Long,C.M. and Erlich,H.A.
TITLE Sequence analysis of the HLA-Dr-beta and HLA-DQ-beta loci from
three Pemphigus vulgaris patients
JOURNAL Hum. Immunol. 22, 61-69 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 26 > 292 HLA-DR-beta, exon 2 (AA at 26)
BASE COUNT 64 a 74 c 101 g 53 t
ORIGIN
1 ccggatcctt cgtgtcccca gaccacgttt cttggagcag gttaaacatg agtgtcattt
61 cttcaacggg acggagcggg tgcggttcct ggacagatac ttctatcacc aagaggagta
121 cgtgcgcttc gacagcgacg tgggggagta ccgggcggtg acggagctgg ggcggcctga
181 tgccgagtac tggaacagcc agaaggacat cctggaagac gagcgggccg cggtggacac
241 ctactgcaga cacaactacg gggttgtgga gagcttcaca gtgcagcggc ga
//
LOCUS MUSC3B 647 bp ss-mRNA ROD 28-JUL-1990
DEFINITION Mouse complement component 3 (C3) mRNA, partial cds.
ACCESSION M35659
KEYWORDS complement component 3.
SOURCE Mouse liver, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 647)
AUTHORS Fey,G., Domdey,H., Wiebauer,K., Whitehead,A.S. and Odink,K.
TITLE Structure and expression of the C3 gene
JOURNAL Springer Semin. Immunopathol. 6, 119-147 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 647 complement 3 (AA at 1)
BASE COUNT 170 a 171 c 168 g 138 t
ORIGIN
1 atccccatgt attccatcat tactcccaat gtcctacggc tggagagcga agagaccatc
61 gtactggagg cccacgatgc tcagggtgac atcccagtca cagtcactgt gcaagacttc
121 ctaaagaggc aagtgctgac cagtgagaag acagtgttga caggagccag tggacatctg
181 agaagcgtct ccatcaagat tccagccagt aaggaattca actcagataa ggaggggcac
241 aagtacgtga cagtggtggc aaacttcggg gaaacggtgg tggagaaagc agtgatggta
301 agcttccaga gtgggtacct cttcatccag acagaccaga ccatctacac ccccggctcc
361 actgtcttat atcggatctt cactgtggac aacaacctac tgcccgtggg caagacagtc
421 gtcatcctca ttgagacccc cgatggcatt cctgtcaaga gagacattct gtcttccaac
481 aaccaacacg gcatcttgcc tttgtcttgg aacattcctg aactggtcaa catggggcag
541 tggaagatcc gagcctttta cgaacatgcg ccgaagcaga tcttctccgc agagtttgag
601 gtgaaggaat acgtgctgcc cagttttgag gtccgggtgg agcccac
//
LOCUS P30LTA 777 bp ds-DNA BCT 28-JUL-1990
DEFINITION Plasmid P307 (from E.coli) heat-labile enterotoxin subunit A (LTA)
gene, complete cds.
ACCESSION M35581
KEYWORDS enterotoxin.
SOURCE Plasmid P307 (from Escherichia coli) DNA, clone pAT153.
ORGANISM Plasmid P307
Unclassified.
REFERENCE 1 (bases 1 to 777)
AUTHORS Dykes,C.W., Halliday,I.J., Hobden,A.N., Read,M.J. and Harford,S.
TITLE A comparison of the nucleotide sequence of the A subunit of heat-
labile enterotoxin and cholera toxin
JOURNAL FEMS Microbiol. Lett. 26, 171-174 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 777 heat-labile enterotoxin subunit A (LTA)
BASE COUNT 255 a 136 c 164 g 222 t
ORIGIN
1 atgaaaaata taactttcat tttttttatt ttattagcat cgccattata tgcaaatggc
61 gacagattat accgtgctga ctctagaccc ccagatgaaa taaaacgttc cggaggtctt
121 atgcccagag ggcataatga gtacttcgat agaggaactc aaatgaatat taatctttat
181 gatcacgcga gaggaacaca aaccggcttt gtcagatatg atgacggata tgtttccact
241 tctcttagtt tgagaagtgc tcacttagca ggacagtcta tattatcagg atattccact
301 tactatatat atgttatagc gacagcacca aatatgttta atgttaatga tgtattaggc
361 gtatacagcc ctcacccata tgaacaggag gtttctgcgt taggtggaat accatattct
421 cagatatatg gatggtatcg tgttaatttt ggtgtgattg atgaacgatt acatcgtaac
481 agggaatata gagaccggta ttacagaaat ctgaatatag ctccggcaga ggatggttac
541 agattagcag gtttcccacc ggatcaccaa gcttggagag aagaaccctg gattcatcat
601 gcaccacaag gttgtggaaa ttcatcaaga acaatcacag gtgatacttg taatgaggag
661 acccagaatc tgagcacaat atatctcagg gaatatcaat caaaagttaa gaggcagata
721 ttttcagact atcagtcaga ggttgacata tataacagaa ttcgggatga attatga
//
LOCUS PIGFSHB 929 bp ss-mRNA MAM 28-JUL-1990
DEFINITION Pig follicle stimulating hormone (FSH) beta-subunit mRNA,
3" end.
ACCESSION M35676
KEYWORDS follicle stimulating hormone.
SOURCE Pig anterior pituitary, cDNA to mRNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 929)
AUTHORS Kato,Y.
TITLE Cloning and DNA sequence analysis of the cDNA for the precursor of
porcine follicle stimulating hormone (FSH) beta-subunit
JOURNAL Mol. Cell Endocrinol. 55, 107-112 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 348 follicle stimulating hormone beta-subunit (AA
at 1)
BASE COUNT 256 a 230 c 207 g 236 t
ORIGIN
1 gccatctgct gcaatagctg tgagctgacc aacatcacca tcacagtgga gaaagaggag
61 tgtaacttct gcataagcat caacaccacg tggtgtgctg gctattgcta cacccgggac
121 ctggtataca aggacccagc caggcccaac atccagaaaa catgtacctt caaggagctg
181 gtgtacgaga ccgtgaaagt acctggctgt gctcaccatg cagactccct gtatacgtat
241 ccagtagcca ctgaatgtca ctgtggcaag tgtgacagtg acagtactga ctgcaccgtg
301 agaggcctgg ggcccagcta ctgctccttc agtgaaatga aagaataaag agcagtggac
361 atttcatgct tcctaccctt gtctgaagga ccaagacgtc caagaagttt gtgtgtacat
421 gtgcccaggc tgcaaaccac tatgagagac cccactgatc cctgctgtcc tgtggaggag
481 gagctccagg aatgcagagt gctagggcct cagtcccatc accactcaac cctgtatttt
541 gggtctggtt ccataagttt tattcggtct ttttttttaa attactcaat gaattttatt
601 acatttataa ttgtacaatg atcatcacaa cccaatttta taggatttcc atcccaaacc
661 cccagcatag acccccatct cccaatctgt ctcatttgga aaccataagt ttttcaaagt
721 ccgtgagtca gtatctactc agtcttatta ccttaaagac atgtgggtgt tttctgttta
781 ataatcttag aaatcctctc aagacaggga tatggaccca gaggaaggaa atgggctaag
841 aatgggtgaa aggactaaat gcagcattct cccactagac acagcagcct acaagagcag
901 ggccagtctc tttgtcatga gtgtggccg
//GenBank-Updates@genbank.bio.net (07/31/90)
LOCUS CHKCYTA 4400 bp ds-DNA VRT 31-JUL-1990
DEFINITION Chicken cytotactin gene, 5' flank.
ACCESSION M35369
KEYWORDS cytotactin.
SOURCE Chicken DNA, (library of Clontech), clone CTG[1,2].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 4400)
AUTHORS Jones,F.S., Crossin,K.L., Cunningham,B.A. and Edelman,G.M.
TITLE Identification and characterization of the promoter for the
cytotactin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.S.Jones, 19-JUN-1990.
Author address: F.S.Jones
Laboratory of Developmental and Molecular Biology
The Rockefeller University
1230 York Avenue
N.Y., N.Y. 10021
FEATURES from to/span description
signal 3305 3301 (c) GAGA box
signal 3996 4001 TATA box
BASE COUNT 1156 a 937 c 1031 g 1276 t
ORIGIN
1 gtcgacctgc aggtcaacgg atctgagcat cacttgccaa cttgtaccct cagatgctgt
61 gtctatatct gcttaggtca gacttccatt ggagtagggg atatatttct gcttttccta
121 atatcaaaac aatgcccttc agcatgggtt gcaattatga gaatcaatcc agcctcaaag
181 caatgtcatt gtcaatgctg tatgttgaat taaatgtagt catactgctg tttacactgg
241 aaaaggatgg tctatgaata actatataaa atagtattcc catttaaata tatgcactga
301 acaaatagtc aagagtttgc agtgtttttg aaaactcgta ttatacttga gagccattag
361 ttcaggaaag gaggatatta gtacacgggc agcatgaatt gttgagaata tcagatactt
421 aaagcaaaag tttagctatt tagatgctat ctgaaaaaga agttaagcta tcttcttgcg
481 tattagcact gagtgattcc aaacagcaaa tgtagaggat cgaccctgtg atatgatgtc
541 tctctttaaa acctactctt tcatctttct cttaacatta cataagataa actatagttg
601 gtgggttctg tttctcaggc tacctattct gaatctggtg tggctgtttc tgctgtgttc
661 ttgtgcgctg ctgttgagcc tggtactgct ctaacatccc tttttttgca agaaatgtgc
721 ttccttccag ctacaggcac tgaatacttc ccctggctcc tgaaattgtc caagaatgtt
781 tccctgcccc aaggcatgat catctgtcca tttgttttgt aagttatatt ccagtggctc
841 agtgcattcc catctgcaca ctattgaaat cagcttcgct acgtgcatgg caagatgcat
901 ctcacgctgt gctcatctct gtgccttcag ggcagctttt acactgtctg atatgccaat
961 acaaaataaa ttattcctgg gactgggtag attaagtttt caagctgaac taagactaat
1021 tacctgtgtt gcaacaattt gtcccaaggt aacgatgagt atgtagtgtg aaatccacta
1081 ttttccagaa tgatgagttg ggatctcttt caggctttca tgctgttctt gatttcatgc
1141 tatgtccaac tgttcctaga tttgtctgta atgattatcc ttacatgtgt gaattgcatc
1201 tgcagtgtga gcttgtgtgt aaaaagacaa cgataaactc agctgcatat acaaaattta
1261 atgctcgttt actttctcct tttcattaaa atagaaatca actgatattg acagtcatct
1321 agtgaaagag aatggcacag ggcaaatgat tcagttgtaa tccttcatgt ttgacatcac
1381 aacctaaagc tatcctgctg attaaaatac attttccctg ttctttcatg ttatcatttg
1441 cttctgactc aatgtatcac tcagtgtttt ccacagagtg catttaaaga tgtgtgagcc
1501 agaaaagata aaaccaggac aggcctggac tcttacctga ctcttattca ccgcatcacg
1561 gcacgttgtt tgaagtcagg taagtgggtg gaggaagata ggagaaaaaa ttggtttaaa
1621 ttttgattca atagctgctg gcagaagacc tggtgttagc aaacactaca aaaacaaatc
1681 ctgcagattg cttcccgttg tttctgtctt gcttagactt gtggatgtgg acagagagat
1741 ggagtgtttc tcaaatggat ctgaatggac ttctggagta agagaaagca catggtgcat
1801 tctatgggga ctgtctgctt ctcagtcccc atcagatctt attgggaaaa tctcacattg
1861 ggtttttgcc attgagttta attgaatttc acaggactgt atgactgggg gagaattaag
1921 ctatgctggc accatgatca ccttgagaat attctgggga gaacagcaga caaacattga
1981 gtatttattg ggggctgcaa acggtaacct tattgccttt ggaatcaatg gaaacgtcca
2041 gaacaagttc tacttttaag aggtgttcca gaaccgtgga gatgtggcac tgcaggacgt
2101 ggtcagtggg catggtggga tgggttgggg ttggacttgg gggatctggg agatcttttg
2161 caagcctaat gattctgtga ttctaagtca gcagaaagaa tccaatgcct ttactgggga
2221 gcagtcagca gttctgacac tctttccagt aggatcttgt ttctgctctt atgacgtgac
2281 tgctatggaa cacagcgagc cagaaaatta aagcaaaaat cagaaatgag tgctcataaa
2341 tggctgatag ggatgcatcc ctgattgtac tgaagcgttt gatctcggtg tttcctgaat
2401 acttctgtag ttgctcggag aaatgggatc tgtttgttgt acaaatcatt tgtaaaaacg
2461 tcaccaagtt ttgcttgctt ttttgtcagc acctgtttag aaatgattga gagttgtgct
2521 gtgcattgaa gaagggaatg agctcagtaa tattgttaca catcaggaaa taaacacaag
2581 ggtcactggt aaataggaaa tccccaccaa agaaaatctg acccgtgtta ccaaaagaac
2641 tctgttttct gtaatgatga taataataat gattctaaaa tggccatcag aattttgggg
2701 ccaggaattc cccatccctt cccgaaggtg gagaatgggc caagttttcc tgttagtgct
2761 gtgcctgtga agcaggaaaa ggaaaaaatc ctagaagtga tgcgagtcca tggaggctgg
2821 ggaaggccag ctgggaggag gcacactcag aaagggagaa tgctcagaag ggcctggaca
2881 gtccctctta attcttgtga gcacccttgt tgctgaatag gaccccactg ctttcttact
2941 gggcagtccc acagcctcct gaggtaggtg ggtaatgctt aaagggattt ggatcagtcc
3001 tgcttctgaa atggtgaagc aatttcatcc cttctcctag aacacagaca tgacagagct
3061 taactctcta agagcttaac cctgcagctc ccctccctct aactaccagc ccccccccac
3121 ccccgaaaga aaggtagatt cagatagaag ggagctgact gctccctgag tgctaactgt
3181 gatcacaatt ctttgccatg aggctgagat ttgagggtaa gcaggagacc ctccctctga
3241 gcagcactgt gcattgcccg caggtccggg actgaccaag ggactcacgg aggggttgga
3301 ggatctttcc aacacccaaa tgcacagtct ctctttaaaa agaaggaagc cctcctgtgc
3361 gcttctgtgg gcttttatct ctctcttttt aagagaatca gcttaggatg ctgctcagcc
3421 ctgctgtccc ccgcagctga gggagagcag aagcagcgtg ggatgcagag atcacaagcc
3481 tggaaagacc gatgtgtgcc agatcagcag cttgtgtaca tccgagctgc tctgctcatc
3541 cttgcagagc agtgccagcc ccactcgctg cccatccgcc ccagcacact ccgagggact
3601 ttttggaatg gaaaccctgt gtttgtgatc cccctgattc cagtttgctg ctctttcgca
3661 gtgttaaaat aaaggggaaa ggggtttaaa attcctgata atgcactctg ctgatgctgt
3721 aaggaaaata ttcctctttc tgagtcattt gccctgagct gacttttttt tttttgtttg
3781 aagggttcag gggtttcctt ctttagccat aaactgcatg caaattcttt gtgtgtgtgt
3841 gtatgacagt gtgtgtctgc gagtgcgtgt gtgtttcttc agaagctaag ccaaagagag
3901 aggctgttcc tttaaagcaa aactgaaaaa aaaaatgcct tcaggcatcc tattacagtg
3961 gaatgagcca aggatcagga tttgcagctc agctatataa acccggcaga atctggctga
4021 tctgaccagt gtgccgcact gtcaaaccct cctttcacac acgcgcgcac caaatgagac
4081 ggcacaactt ctctgagttt tgacaggacg gcgaggaatc cgggagccga cagctggctg
4141 cagtacctct gcttcgtgga ggtcgcccgt ggcaggatct gatccgtcag cccacacggt
4201 aagcctcctt ccttcctctg ttcaaatatc aggagctctt ttgccaccca agtctgcgac
4261 acggttccat tgcgaccgtg gcagctgtga ctttgttcag tcggagcggg caagggtttt
4321 taggagggag tgtttctgct ttcgttcggc tgtgctccct gatggcataa gaacgtagga
4381 aaggtttcag cggcatcgat
//
LOCUS HUMIFNB 507 bp ds-DNA SYN 31-JUL-1990
DEFINITION Human fibroblast interferon (IFN-beta) gene, complete cds.
ACCESSION M35591
KEYWORDS interferon.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 507)
AUTHORS Nagase,Y., Nakamura,N., Tohyama,J., Watanabe,S., Ogino,H.,
Horikoshi,K., Nii,A., Soma,M., Nobuhara,M., Suzuki,Y. and
Mochida,E.
TITLE Chemical synthesis of a human fibroblast interferon gene and its
expression in Escherichia coli
JOURNAL Nucleic Acids Symp Ser 12, 83-86 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 3 503 fibroblast interferon (IFN-beta)
BASE COUNT 140 a 126 c 106 g 135 t
ORIGIN
1 cgatgtcgta caacctgctt ggattcctgc aacgctctag caacttccaa tgtcagaaac
61 tgctctggca gcttaatggt cgtctggaat actgcctgaa agaccgtatg aacttcgaca
121 tcccggaaga gatcaagcaa ctccagcaat tccagaaaga agacgcagct ctgactatct
181 acgaaatgct tcagaacatc ttcgcaatct ttcgtcagga cagctcctct actggttgga
241 acgagactat cgttgagaac ctgctcgcta acgtatatca tcagatcaac cacctgaaga
301 ctgttcttga agagaagctg gagaaagaag atttcactcg aggtaaactg atgtcttccc
361 ttcatctgaa gcgttattac ggtcgtatcc tgcattacct gaaagctaag gaatactctc
421 attgcgcttg gactatcgtt cgtgtagaaa tcctgcgtaa tttctacttt atcaaccgtc
481 tgactgggta ccttcgcaac tgatagg
//
LOCUS HUMMAC2 908 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Human macrophage galactose-specific lectin (hMAC-2) mRNA, complete
cds.
ACCESSION M35368
KEYWORDS macrophage galactose-specific lectin.
SOURCE Human colon epithelial carcinoma cell line HT29, cDNA to mRNA,
clone hMac2.2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 908)
AUTHORS Cherayil,B., Chaitovitz,S., Wong,C. and Pillai,S.
TITLE Molecular cloning of a macrophage galactose-specific lectin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.J.Cherayil, 22-JUN-1990.
FEATURES from to/span description
pept 23 769 macrophage galactose-specific lectin (hMAC-2)
mRNA < 1 908 macrophage galactose-specific lectin mRNA
site 137 248 lectin domain
BASE COUNT 251 a 233 c 217 g 207 t
ORIGIN
1 ggagccagcc aacgagcgga aaatggcaga caatttttcg ctccatgatg cgttatctgg
61 gtctggaaac ccaaaccctc aaggatggcc tggcgcatgg gggaaccagc ctgctgggca
121 ggggctaccc aggggcttcc tatcctgggg cctaccccgg gcaggcaccc ccagggctta
181 tcctggacag gcacctccag gcgcctaccc tggagcacct ggagcttatc ccggagcacc
241 tgcacctgga gtctacccag ggccacccag cggccctggc tacccatctt ctggacagcc
301 aagtgccccc ggagcctacc ctgccactgg cccctatggc gcccctgctg ggccactgat
361 tgtgccttat aacctgcctt tgcctggggg agtggtgcct cgcatgctga taacaattct
421 gggcacggtg aagcccaatg caaacagaat tgctttagat ttccaaagag ggaatgatgt
481 tgccttccac tttaacccac gcttcaatga gaacaacagg agagtcattg tttgcaatac
541 aaagctggat aataactggg gaagggaaga aagacagtcg gttttcccat ttgaaagtgg
601 gaaaccattc aaaatacaag tactggttga acctgaccac ttcaaggttg cagtgaatga
661 tgctcacttg ttgcagtaca atcatcgggt taaaaaactc aatgaaatca gcaaactggg
721 aatttctggt gacatagacc tcaccagtgc ttcatatacc atgatataat ctgaaagggg
781 cagattaaaa aaaaaaaaaa agaatctaaa ccttacatgt gtaaaggttt catgttcact
841 gtgagtgaaa atttttacat tcatcaatat ccctcttgta agtcatctac ttaataaata
901 ttacagag
//
LOCUS MUSIGLCD 794 bp ds-DNA ROD 31-JUL-1990
DEFINITION Mouse Ig germline lambda-chain gene C5 region.
ACCESSION M35582
KEYWORDS constant region; germline; immunoglobulin light-chain;
lambda-immunoglobulin.
SOURCE Mouse (strain SPE) DNA.
ORGANISM Mus spretus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 794)
AUTHORS Mami,F., Cazenave,P.-A. and Kindt,T.J.
TITLE Conservation of the immunoglobulin C-lambda-5 gene in the Mus genus
JOURNAL EMBO J. 7, 117-122 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 294 613 Ig lambda-chain C5 region
BASE COUNT 195 a 188 c 195 g 216 t
ORIGIN
1 ggttccagct tcaaggggct agagaattca gctggtctta gtctcagcgg gggaactgag
61 attgcaaggg tctgggtctg ggtcatttta tctggaagag gaacatgttc taatggatgc
121 taggctgtct gctctccaag ggactcaagt ggtcagagga gaagaaggaa gcatccctgg
181 atggaagact gatgctgtag tgaatggcca cagagctcct gataagagaa ggacgcttcc
241 ttatcacatg ggctctccta tgctaactct tatccccttc tctatctgcg caggtcagcc
301 caagtctgat cccttggtca ctctgttcct gccttcctta aagaatcttc aggccaataa
361 ggtcacactg gtgtgtttgg tgagcgaatt ctacccaggt actttggtgg tggactggaa
421 ggtagatggg gtccctgtca ctcagggtgt agagacaacc caaccctcca aacagaccaa
481 caacaaatac atggtcagca gctacctgac attgatatct gaccagtgga tgcctcacag
541 tagatacagc tgccgggtca ctcatgaagg aaacactgtg gagaagagtg tgtcacctgc
601 tgagtgttct tagaccacaa tcctccctga agccttaggg gcctggatct gaagtgccag
661 aaaaagttgt tttttgtttt tttttttttt tccccattaa ccatctcact gtctttcctg
721 tgcctaatac tcaataaata tcttaccacc aaccagagag tcttgtcgat ctcatcattt
781 gcatgccagg gatg
//
LOCUS PSELSP 2396 bp ds-DNA BCT 31-JUL-1990
DEFINITION P.fluorescens signal peptidase II (lsp) gene, complete cds and
isoleucyl-tRNA synthetase (ileS) gene, 3' end.
ACCESSION M35366
KEYWORDS isoleucyl-tRNA synthetase; signal peptidase II.
SOURCE P.fluorescens (strain NCIB 10586) DNA.
ORGANISM Pseudomonas fluorescens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2396)
AUTHORS Isaki,L., Beers,R. and Wu,H.
TITLE Nucleotide sequence of the Pseudomonas fluorescens signal peptidase
II (lsp) and flanking genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Wu, 18-JUN-1990.
Author address:H.Wu
Uniformed Services University of the Health Sciences
4301 Jones Bridge Rd.
Bethesda, MD 20814
FEATURES from to/span description
pept < 1 729 isoleucyl-tRNA synthetase (ileS) (AA at 1)
pept 722 1234 signal peptidase II (lsp)
pept 1227 1679 ORF149
pept 1857 > 2396 ORF316
rpt 768 810 inverted repeat
rpt 1239 1268 inverted repeat
rpt 1640 1674 inverted repeat
rpt 1697 1889 inverted repeat
BASE COUNT 498 a 706 c 712 g 478 t 2 others
ORIGIN
1 ctgcagcgcg agttgcagga acactacggc gaataccgct tctggaacgt ctactccaag
61 atccacaact tctgcgtgca ggagctgggt ggtttctacc tcgacatcat caaggaccgc
121 cagtacacca ctggcgccaa cagcaaggcg cgccgttcgg cagaccgcgc tgtaccacat
181 cagcgaaggc tggtgcgctg gatcgcaccg atcctggcat tcaccgctga cgaactgtgg
241 gagtacctgc cgggcgagcg taacgagtcc gtcatgctca acacctggta cgaaggcctg
301 accgaactgc cggctgactt cgaactgggc cgcgagtact gggagggcgt gatggccgtc
361 aaggttgcgg tgaacaagga actggaagtc cagcgtgcgg ccaaggccgt gggtggcaac
421 ctgcaagccg aagtcaccct gtttgccgaa gacggcctga ccgccgacct ggccaagctg
481 agcaacgagc tgcgcttcgt gctgatcacc tctactgcga gcctggcgcc gtttacccag
541 gctccggcag atgcggtggc taccgaagtg cctggcctta agctcaaagt ggtcaagtcg
601 gccttcccca agtgcgctcg ttgctggcac tgccgtgaag acgtcggcgt gaaccctgag
661 cacccggaaa tctgcggtcg ttgtgtcgac aacatcagcg gtgaaggcga ggttcgccac
721 tatgcctaat gcagacagtc gtttcggacg tctgggctgg ctcgtactga gcctgctggt
781 cctggtcatt gaccaggtca gcaaggctca cttcgagggc tccctggaaa tgttccagca
841 aatcgtggtg atcccggatt atttcagctg gaccctggcc tacaacactg gcgccgcctt
901 cagcttcctc gctgacggcg gtggctggca gcgctggctg tttgctgtga tcgccgtggt
961 ggtaagtgcc gtactggtgg tgtggctcaa gcgcctgggc cgcgacgaca cctggctggc
1021 catcgcgctg gccctagtgt tgggtggcgc gctgggcaac ctgtatgacc gcatcgccct
1081 gggccatgtg atcgacttta tcctggtgca ttggcagaac cgccactact tcccggcgtt
1141 caactttgcc gacagtgcta tcaccgtcgg cgcaatcatg ctggcgctgg atatgttcaa
1201 gagcaagaaa accggagaaa ccgtcaatga ctgatcaggt attggctgag caacgcatcg
1261 gccagaacac ggaagtcact ttgcatttcg cactgcgcct ggagaatggc gacacggtcg
1321 acagcacgtt cgacaaggcc cccgcgacct tcaaggtcgg cgacggcaac ctgctgccgg
1381 gtttcgaagc ggcactgttc ggtttcaagg ccggtgacaa gcgcaccctg caaatcctgc
1441 cggaaaacgc ctttggccag cccaacccgc aaaacgtgca gatcatcccc cgttcgcagt
1501 tccagaacat ggacctgtcg gaaggcttgc tggtgatctt caatgatgcg gcgaacactg
1561 aattgcctgg cgtggtgaaa gcatttgatg acgcgcaagt gaccatcgac ttcaatcacc
1621 cgttggccgg caagacgttg acctttgacg tcgagattat cgacgtcaaa gcgctgtaac
1681 cgaccgtacg cggtctaaaa tgtgggaggg ggcttgctcc cgatagcggt atatcagcta
1741 aatgtgcaga ctgatatacc tatatcggga gcaagccccc tcccaccttt gatctccatt
1801 gtcggttgag tcagtgtcaa attgactcaa tatggctgca atacacgagg cacagcatgc
1861 aaatcaaact cgccaacccc cgtggcttct gcgccggcgt ggaccgggcg atcgaaatcg
1921 tcaatcgcgc cctggaagtc ttcgggccgc cgatttatgt gcgccatgaa gtcgtccata
1981 acaaatttgt ggtcgaagac ttgcgtgcgc gcggggcgat ctttgtcgaa gaactcgatc
2041 aggtgaaaga cgacgtgatc gtcatcttca gtgcccacgg tgtttcccag gctgtacgta
2101 ccgaagcggc gggccgtggc ctgaaggtgt tcgatgccac ctgcccactg gtgaccaagg
2161 tgcatatcga ggtggcgcgc tacagccgcg acggtcgtga gtgcatcctg atcggccacg
2221 ccggtcaccc ggangtagaa ggcaccatgg gccaatacga cgccagcaac ggcggtgcta
2281 tctacctggt cgaggacgag aaagacgtcg ccaacttgca ggtgcacaat ccggaacgcc
2341 tggccttcgt gacccagacc accttgtcca tggacgacac cagccgngta atcgat
//
LOCUS PSEXILES 1128 bp ds-DNA BCT 31-JUL-1990
DEFINITION P.fluorescens X and isoleucyl-tRNA synthetase (ileS) genes,
complete cds.
ACCESSION M35367
KEYWORDS isoleucyl-tRNA synthetase.
SOURCE P.fluorescens (strain NCIB 10586) DNA.
ORGANISM Pseudomonas fluorescens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1128)
AUTHORS Isaki,L., Beers,R. and Wu,H.
TITLE Nucleotide sequence of the Pseudomonas fluorescens signal peptidase
II (lsp) and flanking genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Wu, 18-JUN-1990.
Author address:H.Wu
Uniformed Services University of the Health Sciences
4301 Jones Bridge Rd.
Bethesda, MD 20814
FEATURES from to/span description
pept 90 1025 protein X
pept 1040 > 1128 isoleucyl-tRNA synthetase (ileS)
signal 56 61 TATA box
BASE COUNT 198 a 349 c 349 g 231 t 1 others
ORIGIN
1 tcgattcgac ccatttggcc tgcgctgttg cctgtcgtcc ggggctgggt gtggttataa
61 tcgaccactt tatgagcaag aagcgcgtta tgcagctggt tcgaggtctc cacaacctgc
121 gccccgagca tcggggctgc gtcgccacta ttggcaactt tgacggtgtt caccgtggcc
181 accaggctat cctggcaagg ctgcgcgagc gtgcggtcga gttgggtgtg cccagctgcg
241 tggtgatttt cgagccacaa ccgcgggagt tctttacccc ggaaacagcg ccggcccgct
301 tggcgcttgc gcgacaagct gcaactgctg gcggagaggg cgtggaccgc gtcctctgcc
361 tggctttcaa ccagcgtttg cgcagcctca gcgccgccga gttcgtcgac cgcattctgg
421 tcgatggcct gggtgtacaa cacctggagg ttggtgacga cttccatttc ggttgcgatc
481 gggtcgggga tttcgatttc ctgcaacatg ccggcgtcaa ccagggcttt accgttgaag
541 ccgcccaaac cgtcgaactg gacggcctgc gtgtgagcag cacccaggtg cgtaacgcct
601 tggctgccgc cgacttcgac ctggccgagc gtttgctcgg tcgcccgttc cgcattgccg
661 ggcgggtact gcacggccag aagctggcgc gccaattggg cacgccaact gccaacgtgc
721 aactcaagcg ccgccgagtg ccgctgactg gggtttacct ggtgagcgtc gacatcgacg
781 gccaatcgtg gccgggagtc gccaatatag gcgtcaggcc cacggttgca ggtgatggca
841 aggcccacct ggaagttcac cttttggatt ttgccggtga tttatacgac cggcgtttga
901 cggtggtttt ccaccagaag ctgcgtgaag agcagcgttt cgcctccctg gaggcgttga
961 aaacggcgat caatgcggat gtcgccgccg cccgtgcact agccgcacct agcgcccatc
1021 gctaaccgaa gagccttaaa tgaccgacta taaagccacg ntaaaccttc cggacaccgc
1081 cttcccaatg aaggccggcc tgccacagcg cgaaccgcag tcctgcag
//
LOCUS BLY656 672 bp ss-mRNA PLN 31-JUL-1990
DEFINITION H.vulgare thaumatin II mRNA, complete cds.
ACCESSION M35656
KEYWORDS thaumatin II.
SOURCE H.vulgare (cv. Bomi) endosperm, cDNA to mRNA, clone pUP-23.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 672)
AUTHORS Lazaro,A., Rodriguez-Palenzuela,P., Marana,C., Carbonero,P. and
Garcia-Olmedo,F.
TITLE Signal peptide homology between the sweet protein thaumatin II and
unrelated cereal alpha-amylase/trypsin inhibitors
JOURNAL FEBS Lett. 239, 147-150 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 39 482 thaumatin II precursor
sigp 39 101 thaumatin II signal peptide
matp 102 479 thaumatin II
BASE COUNT 155 a 175 c 209 g 133 t
ORIGIN
1 aagagattga accaacgacc aataaactag tatcaacaat ggcatccgac catcgtcgct
61 tcgtcctctc cggcgccgtc ttgctctcgg tcctcgccgt cgccgccgcc accttggaga
121 gcgtcaagga cgagtgccaa ctaggggtgg acttcccgca taacccgtta gccacctgcc
181 acacctacgt gataaaacgg gtctgcggcc gcggtcccag ccggcccatg ctggtgaagg
241 agcggtgctg ccgggagctg gcggccgtcc cggatcactg ccggtgcgag gcgctgcgca
301 tcctcatgga cggggtgcgc acgccggagg gccgcgtggt tgagggacgg ctcggtgaca
361 ggcgtgactg cccgagggag gagcagaggg cgttcgccgc cacgcttgtc acggcggcgg
421 agtgcaacct atcgtccgtc caggcgccgg gagtacgctt ggtgctactg gcagatggat
481 gacgatgcaa atgcgccaag gtaatgaagc ggagtactgt atacagaata aaagtactcg
541 agtgaaaaca aactcataaa taaaccttgt gagatgtatg cgtatgatct atggtgtgga
601 cagttaaatt gtggccgatt gatgaataaa aaaggttgga acaaattaaa ttgttgtggg
661 ttcatatact at
//
LOCUS SPIFERRI 632 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Spinach ferredoxin I mRNA, complete cds.
ACCESSION M35660
KEYWORDS ferredoxin I.
SOURCE Spinach (var. Monatol) seedlings, cDNA to mRNA, p6SocFd/1.
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 632)
AUTHORS Wedel,N., Bartling,D. and Herrmann,R.G.
TITLE Analysis of cDNA clones encoding the entire ferredoxin I precursor
polypeptide from spinach
JOURNAL Botanica Acta 101, 295-300 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 17 460 ferredoxin I precursor
sigp 17 166 ferredoxin I signal peptide
matp 167 457 ferredoxin I
mRNA < 1 632 ferredoxin I mRNA
BASE COUNT 176 a 131 c 139 g 186 t
ORIGIN
1 ttacaaaatt gtaaaaatgg cagcaaccac cacaacaatg atgggcatgg ccaccacctt
61 tgtcccaaaa ccccaagcac caccaatgat ggcggcgctt ccatccaaca ccggccgctc
121 tttgttcgga ctcaagaccg gtagccgtgg cggaaggatg acaatggctg cctacaaggt
181 aaccttggta acacccaccg gtaacgtaga gtttcaatgc ccagacgatg tttacatctt
241 ggatgctgct gaagaagaag gcattgactt gccttactca tgcagagctg ggtcgtgctc
301 ttcatgcgcc ggaaagctta agacaggtag tcttaaccaa gatgatcaga gttttttgga
361 tgacgatcag atcgatgaag gatgggttct tacctgtgct gcttaccctg ttagtgatgt
421 tactattgag acccacaagg aagaggagct tactgcctaa ttcatttttt tttatttttt
481 attattttgt tctcatttga gagggaattg aaagactaaa aaagagtaaa tgcagcgagg
541 agtttttact tcttcgatct gtggtttgta attgtgtatt atcatgttaa tcaattgctc
601 ttataagctt aattactacg taatatatat at
//
LOCUS GEM5SRRNA 108 bp ss-rRNA RNA 31-JUL-1990
DEFINITION G.obscuriglobus 5S ribosomal RNA.
ACCESSION M35170
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE G.obscuriglobus (strain UQM 2246) ribosomal RNA.
ORGANISM Gemmata obscuriglobus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 108)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 108 5S rRNA
BASE COUNT 25 a 33 c 31 g 19 t
ORIGIN
1 ttccggtgac catacccaaa cggaaacacc cgttcccatt ccgaacacgg ccgtgaaagt
61 ttgaggccga tggtagtgcg tccagcgcga gagtaggtat cgccggcc
//
LOCUS ISO5SRRNA 111 bp ss-rRNA RNA 31-JUL-1990
DEFINITION I.pallida 5S ribosomal RNA.
ACCESSION M35169
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE I.pallida (strain IS 1B) ribosomal RNA.
ORGANISM Isosphaera pallida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 111)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 111 5S rRNA
BASE COUNT 19 a 39 c 32 g 21 t
ORIGIN
1 ctccggtgac cataccgtcg gggtcctacc cgttcccatt ccgaacacgg ccgtcaagcc
61 cgccgggcct atgatagtgc gtcagcgcga aagtcggtga tcgccggatc t
//
LOCUS MUSINTZ1 271 bp ds-DNA ROD 31-JUL-1990
DEFINITION Mouse integration site for foreign DNA, 5' flank.
ACCESSION M35630
KEYWORDS integration site.
SEGMENT 1 of 3
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 271)
AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
TITLE Structural features of the integration site of foreign DNA in the
genome of transgenic mice
JOURNAL Mol. Biol. 22, 1235-1242 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rpt 9 50 region of dyad symmetry
BASE COUNT 108 a 48 c 55 g 60 t
ORIGIN
1 gaaaaatcta cgttaataaa acgaactaac ggaacaacat tattacaggt agaaagattc
61 atcagttgag atttaggaat accacatcaa ctaatgcaga tacataacgc caaaaggaat
121 tacgaggcat agtaagagca acactatcat aaccctcgtt taccagacga cgataaaaac
181 caaaatagcg agaggctttt gcaaagaagt tttgccagag ggggtaatag taaatgttta
241 gactggatag cgtccaatat gcggtcgtca a
//
LOCUS MUSINTZ2 300 bp ds-DNA ROD 31-JUL-1990
DEFINITION Mouse integration site for foreign DNA, 3' flank.
ACCESSION M35631
KEYWORDS integration site.
SEGMENT 2 of 3
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
TITLE Structural features of the integration site of foreign DNA in the
genome of transgenic mice
JOURNAL Mol. Biol. 22, 1235-1242 (1988)
STANDARD simple staff_review
BASE COUNT 56 a 94 c 81 g 69 t
ORIGIN About 1.5 kb after segment 1.
1 ccagctggtg ctccatgtgc agccaggtcc tggccccact atgtccatcc cagctccagc
61 ccctgcttgc tggcagcagc ctggcagccc caggcacttc tcccagcacg ctgccatgtc
121 actggtcagg tcgggtgctg cctgcctgct gccagcaggc tgtccccggt gtacccactg
181 tttccccact gctaggtcat ctgtgcactg atgcatgcag gtgattctcg gtaatatggc
241 atacgtgttc gggaatgatc gtaatcaaaa aggcgagata agctggtcag agatatatag
//
LOCUS MUSINTZ3 99 bp ds-DNA ROD 31-JUL-1990
DEFINITION Mouse repetitive sequence DNA.
ACCESSION M35632
KEYWORDS .
SEGMENT 3 of 3
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 99)
AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
TITLE Structural features of the integration site of foreign DNA in the
genome of transgenic mice
JOURNAL Mol. Biol. 22, 1235-1242 (1988)
STANDARD simple staff_review
BASE COUNT 12 a 26 c 41 g 20 t
ORIGIN About 1.5 kb after segment 3.
1 agctgggaat cggggggggg ggggggggcg ggtggtggtt tggccaggga tttgggcacc
61 ttcctaagcc ccaaccctaa tgccggtttc cttcccccg
//
LOCUS NPOCAPA 3555 bp ds-DNA VRL 31-JUL-1990
DEFINITION O.pseudotsugata polyhedrosis virus capsid protein p87 gene,
complete cds.
ACCESSION M35445 D00514
KEYWORDS capsid protein; structural protein.
SOURCE O.pseudotsugata polyhedrosis virus DNA.
ORGANISM Orgyia pseudotsugata polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 3555)
AUTHORS Mueller,R., Pearson,M.N., Russell,R.L.Q. and Rohrmann,G.F.
TITLE A capsid-associated protein of the multicapsid nuclear polyhedrosis
virus of Orgyia pseudotsugata: Genetic location, sequence,
transcriptional mapping, and immunocytochemical characterization
JOURNAL Virology 176, 133-144 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 216 2090 capsid protein p87
mRNA 182 2282 capsid protein p87 mRNA (alt.)
mRNA 209 2282 capsid protein p87 mRNA (alt.)
signal 2194 2199 polyA site
BASE COUNT 946 a 1003 c 880 g 726 t
ORIGIN
1 gaattctgta aaaaaggtta ggccctttat gtcaatcaaa ctgtgctggt caaagtactt
61 ggccgtcaaa aaggttagcg agtcgatttc gcgttggtgc atttgcgctt caaagcgcac
121 gtgttcaaag cggtgctcca gaccgtgcac cgcgttgaac ctcaaattgt acaacaattt
181 gtaagtgtgc atgttgcatt cgatatagcc ttataatgga cgagcagcat tcattacgca
241 ttgccgcgct ggctggcgag atcttgacgc gcgaccgcgc acaggtgaac acaatcatac
301 acagccccga gcgcgcgctg ggccaaaaat tggacgccat tacggcgcta gtcgattcaa
361 tgcagcccgg cacgccgcgc gacgctgctg tgaacgaagc cgcaaacgtt accgcgcaaa
421 gcccaatgtc ggaaactcaa gacccgcagc gcgccaacga caacgtgtca gataccgttg
481 caaatgaaaa cgcgcaaaac ttgttgttgg agggccaaga ccgcgttttg cgacatcgcg
541 ttttgcaaat cgcggtaacg tttttgcagc gcaacaagcg agtaaaagcg aacgcgacca
601 cgttggccca gattgaggaa gcgctccgaa actatgaaac tgcaaaaaac agcggcgcaa
661 gcgacagtgt aatcgacggg tttcttgaac gtgccgaatc tttgtttaac accctaaaaa
721 acatatcgct atccgagctg ttggaccgcg aaagcgcagt gtttgcggac acggaaagcg
781 ctccgcggac gcaaaccgcg gacaactcgc cgccacctgt gagcgaacaa gactttgacc
841 ggttggacat tagccagctc accgattaca tcgagaacaa ttacagggat cagtttgatt
901 tcgacaaaca caacagtgtg gaagacgtgc gcaactttgc aaaaaatctt tggcgcaaca
961 aaacgcgtgt cacgtccacg cctctgcagg aataccagat gccaccgcaa acgcccgcgc
1021 ctctgcaaga ccaaatgcca ccgcaaacgc ccgcgtacgc gacgcccgcg cagcagccaa
1081 gccagccaac gcccgcgcaa acgcccgcgc agcagccaag ccagccaacg cccgcgtacg
1141 ttacgtctgc gcaaacgccc gcgcagcagc caagccagcc aacgcccgtt tccaattaca
1201 gttgggaaag gcgcgtcgcg tcaatgtttg cgaatacaga tttaccacaa aacgtgccgc
1261 tgcccgacag ctacgacacc tcgtccgtca ttgggcaaaa acggcgcaag cggcgcgcgc
1321 cgccgctgcc gccgtactcg tcagacgaag aagacgccgc cccgccacgt tcgccaaaac
1381 gcaaagagtc gctttcctcg tctgaagaag acgaatttga ctacgagcgc gaacaaaaac
1441 gccggcgcga ggaggataaa aactttttgc gcctaaaggc gttggaactg tccaagtacg
1501 ccggcgttaa cgagcgcatg gaaaagattg tgcgcgttac caaggcgatg caacaaacgt
1561 acgactactg caattgcaaa aacacaatca gcggcacgcc cgccgcggcg tcgtttatta
1621 acttgttgcg gcggctgaac acctacaacc tcagccacgt ggaaatgacg gtcaactttt
1681 acgagctctt gtaccccctc acgctgtaca acgacgagtc caaccgcatt gttggctaca
1741 tattcgcggc gaccaactat tttcaaaatt gcgccaaaaa ctttgggcgc atgcgcgccg
1801 agtttaacga gcacggcccg tttgcgcaaa ttgattcgct cgtcatgttt gtcatcaaat
1861 tcaacttttt gtgcgacttg caaacgtttt ttggaaaaat tgacggcttg ccaatgttgg
1921 cccagcccaa catcaagacg cacaccgtgc tggtgatgcg cgacaagatc gtcaagctgg
1981 cctttggcgc gctgcaatac gacaccagtc tgaaaacgga caatcggcgg gaccccaaac
2041 acctgcaacg actcatccag ttgatgaacg ccgattttaa cataatgtaa acaaatgtgt
2101 tagcaattta tttattacaa acagagttac gatattcggt gtacatggtt ttataatctt
2161 tgtgatcaaa gttaaaacgt tttctataca aataataaag cttgttaaac agctgttttc
2221 gttttttttt gtgtcgcaaa caaatttttt gtggcaatgg cgtagcacat ctagaaaatt
2281 aaatttttcg ccgaccgcac actcttcatt aagctcgtta cggctggccg cctatgcgat
2341 aactgcacgt gacacacagg gtgtagtggc cgcatggcgc cgccaccatg cgcttcggct
2401 cgctcatgca aacaatgcac tcgtcgttgg cttcgtcggt aaacacgcgc tcaacagtaa
2461 ttttacaaac ggtaacggcg tctggtttta tgtaataata gcatttggag cggcaaatta
2521 aggaactgtc tttgggcgcc acggcaaaag tggcgccctc gcgccgcaac cgcttgggaa
2581 aggtgcagtt ggcgtcgcgc cgataggcgg gatcgtctat acgcacgcgc agcctgccgt
2641 tcacctttac acaaatcaac ttaacgtgcg ccattgcacg gtcgaatgta ggaccagtgt
2701 caattattta agacaccgtt agcataaccg ctggcccaat ggacgccaaa tacaaggtgg
2761 tcgacgtgga cacctttgcc cgccagctaa tcacggacaa gtgcagcgag ctgattgaaa
2821 ccgaaaatct attgcctgca aacattttgc atgttgttaa acaagcaaga gacaaatact
2881 ttgaagaccc ctcggtcaaa aactacgaat atgtcaaaaa cctgtttttg cgcacaaaat
2941 acatggacga ctcaattgac tacaaaaatt ttaatcggcg cgtgttgctg attgtgttca
3001 aatttgcgct caaccgaggc tccggctact ttccctcgta cagggagctc attgaagtgg
3061 ccgtcaagcg cctcaacaaa atcaaccccg acctcaagag ctcgccgcgc gccatgttgc
3121 agcactacaa cgagtgcctg gaaaacctgg acaacccggt gaccgacgag caccacctgt
3181 taacgtttgg aaaagaagtg gccaccaaaa tgtttatcga ggcgttcgag tttagctacg
3241 ccagcaacaa cgaaatcaac ttaactacca acaagcgcgg ctcggacctg ttcgacccga
3301 taccgatgcc cgcgcccgcg cccgcgcctt cggcgtcttt gttggacaac gtcatgaacg
3361 agcgcaagcg caagttgcag gcgtcggtca cgacaacgcc gcccaaacga tgcaaattgg
3421 ccgaccggcc ggcgcagacc acacaagaca cgccgcgcgc gccgcaaccg gcgcctgtgc
3481 gcgcgcagcg gcccctgttt acgctgtaga cgcgcgcgca gaggcgtcgg tgggcggcgc
3541 gtaatttcgc tgcag
//
LOCUS PLL5SRRNA 110 bp ss-rRNA RNA 31-JUL-1990
DEFINITION P.marina 5S ribosomal RNA.
ACCESSION M35164
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE P.marina (strain IFAM 1313) ribosomal RNA.
ORGANISM Pirella marina
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 110)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 110 5S rRNA
BASE COUNT 30 a 29 c 25 g 26 t
ORIGIN
1 ttccggtgac catatcgaaa aggtcatacc tgttcccatt ccgaacacag ccgtcaagct
61 tttcgagccg atgatagtac cacaagtgtg aaagtaggta tcgccggatc
//
LOCUS PLLS5RRNAA 111 bp ss-rRNA RNA 31-JUL-1990
DEFINITION Pirella sp. 5S ribosomal RNA.
ACCESSION M35165
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE Pirella sp. (strain IFAM 1310) ribosomal RNA.
ORGANISM Pirella sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 111)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 111 5S rRNA
BASE COUNT 29 a 29 c 25 g 28 t
ORIGIN
1 ttccggcgat catatcttaa aggttatacc tgttcccatt ccgaacacag cagtcaagct
61 ttaagagccg atgatagtgc ccaccagcgt gaaagtaggt cttgccggat c
//
LOCUS PLLS5RRNAB 111 bp ss-rRNA RNA 31-JUL-1990
DEFINITION Pirella sp. 5S ribosomal RNA.
ACCESSION M35166
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE Pirella sp. (strain IFAM 1441) ribosomal RNA.
ORGANISM Pirella sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 111)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 111 5S rRNA
BASE COUNT 29 a 31 c 27 g 24 t
ORIGIN
1 ttccggtgac catatggttg tggaaacacc tgttcccttt ccgaacacag cagttaagca
61 caaccagccg atgatagtgc ccaccagtgc gaaagtaggt atcgccggat c
//
LOCUS PLT5SRRNAA 111 bp ss-rRNA RNA 31-JUL-1990
DEFINITION P.limnophilus 5S ribosomal RNA.
ACCESSION M35167
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE P.limnophilus (strain IFAM 1008) ribosomal RNA.
ORGANISM Planctomyces limnophilus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 111)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 111 5S rRNA
BASE COUNT 26 a 33 c 29 g 23 t
ORIGIN
1 ttccggtgac tttacgcgtg aggaaacact cgttcccatt ccgaacacga cagttaagct
61 cccgcggccg atgatagtgc ccaccagcgt gaaagtaggt tatcgccgga c
//
LOCUS PLT5SRRNAB 110 bp ss-rRNA RNA 31-JUL-1990
DEFINITION P.brasiliensis 5S ribosomal RNA.
ACCESSION M35168
KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE P.brasiliensis (strain IFAM 1448) ribosomal RNA.
ORGANISM Planctomyces brasiliensis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE 1 (bases 1 to 110)
AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E.
TITLE A unique type of eubacterial 5S rRNA in members of the order
Planctomycetales
JOURNAL J. Mol. Evol. 27, 121-125 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 110 5S rRNA
BASE COUNT 28 a 26 c 29 g 27 t
ORIGIN
1 ttctggtgac tttacgtctg gggaaacact cgttcccatt ccgaacacga cagttaagcc
61 ctgacggccg atgatagtac tgcaagtgtg aaagtaggtg attgccagac
//
LOCUS MZEPPDK1F1 1272 bp ds-DNA PLN 31-JUL-1990
DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 5' end.
ACCESSION X14927
KEYWORDS pyruvate,orthophosphate dikinase.
SEGMENT 1 of 2
SOURCE Z.mays mays (strain B73) DNA, clone H2-lambda-21; and cDNA to mRNA,
clone pPPDK2.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1272)
AUTHORS Glackin,C.A. and Grula,J.W.
TITLE Organ-specific transcripts of different size and abundance derive
from the same pyruvate,orthophosphate dikinase gene in maize
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES from to/span description
pept 1017 / 1241 pyruvate,orthophosphate dikinase precursor
(PPDK1) (EC 2.7.9.1)
sigp 1017 1241 pyruvate,orthophosphate dikinase signal
peptide
pre-msg 805 > 1272 PPDK1 mRNA and intron
IVS 1242 > 1272 PPDK1 intron A
BASE COUNT 306 a 324 c 327 g 315 t
ORIGIN
1 agtactgaaa gcaatctaaa gacatggagg tggaaggcct gacgtagata gagaagatgc
61 tcttagcttt cattgtcttt cttttgtagt catctgattt acctctctcg tttatacact
121 ggttttttaa acactcctta acttttcaaa ttgtctcttt ctttacccta gactagataa
181 ttttaatggt gattttgcta atgtggcgcc atgttagata gaggtaaaat gaactagtta
241 aaagctcaga gtgataaatc aggctctcaa aaattcataa actgtttttt aaatatccaa
301 atatttttac atggaaaata ataaaattta gtttagtatt aaaaaattca gttgaatata
361 gttttgtctt caaaaattat gaaactgatc ttaattattt ttccttaaaa ccgtgctcta
421 tctttgatgt ctagtttgag acgattatat aatttttttt gtgcttaact acgacgagct
481 gaagtacgta gaaatactag tggagtcgtg ccgcgtgtgc ctgtagccac tcgtacgcta
541 cagcccaagc gctagagccc aagaggccgg aggtggaagg cgtcgcggca ctatagccac
601 tcgccgcaag agcccaagag gccggagctg aaggatgagg gtctgggtgt tcacgaattg
661 cctggaggca ggaggctcgt cgtccggagc cacaggcgtg gagacgtccg ggataaggtg
721 agcagccgct gcgatagggg cgcgtgtgaa ccccgtcgcg ccccacggat ggtataagaa
781 taaaggcatt ccgcgtgcag gattcacccg ttcgcctctc accttttcgc tgtactcact
841 cgccacacac accccctctc cagctccgtt ggagctccgg acagcagcag gcgcggggcg
901 gtcacgtagt aagcagctct cggctccctc tccccttgct ccatatgatc gtgcaaccca
961 tcgagctacg cgcgtggact gccttccctg ggtcggcgca ggaggggatc ggaaggatgg
1021 cggcgtcggt ttccagggcc atctgcgttc agaagccggg ctcaaaatgc accagggaca
1081 gggaagcgac ctccttcgcc cgccgatcgg tcgcagcgcc gaggcccccg cacgccaaag
1141 ccgccggcgt catccgctcc gactccggcg cgggacgggg ccagcattgc tcgccgctga
1201 gggccgtcgt tgacgccgcg ccgatacaga cgaccaaaaa ggtatccctt gcagctctta
1261 gaaactgaat tc
//
LOCUS MZEPPDK1F2 265 bp ds-DNA PLN 31-JUL-1990
DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 3' end.
ACCESSION X14929
KEYWORDS pyruvate,orthophosphate dikinase.
SEGMENT 2 of 2
SOURCE Z.mays mays (strain B73) DNA, clones H2-lambda-[21,13].
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 265)
AUTHORS Glackin,C.A. and Grula,J.W.
TITLE Organ-specific transcripts of different size and abundance derive
from the same pyruvate,orthophosphate dikinase gene in maize
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES from to/span description
pept < 1 23 pyruvate,orthophosphate dikinase (PPDK1) (EC
2.7.9.1) (AA at 3)
BASE COUNT 62 a 62 c 50 g 91 t
ORIGIN Undetermined number of base pairs after segment 1.
1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct
61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag
121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca
181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg
241 tggcagaggt tttttctttt ctttc
//
LOCUS MZEPPDK2F 322 bp ds-DNA PLN 31-JUL-1990
DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK2) gene, 3' end.
ACCESSION M32081
KEYWORDS pyruvate,orthophosphate dikinase.
SOURCE Z.mays mays (strain B73) DNA, clone H2-lambda-23.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 258)
AUTHORS Glackin,C.A. and Grula,J.W.
TITLE Organ-specific transcripts of different size and abundance derive
from the same pyruvate,orthophosphate dikinase gene in maize
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 322)
AUTHORS Glackin,C.A. and Grula,J.W.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1],[2] kindly submitted
by C.A.Glackin, 13-FEB-1990.
Author address: C.A.Glackin
Phytogen
101 Waverly Dr.
Pasadena, CA 91105
FEATURES from to/span description
pept < 1 23 pyruvate,orthophosphate dikinase (PPDK2) (EC
2.7.9.1) (AA at 3)
mRNA < 1 322 PPDK2 mRNA
BASE COUNT 75 a 74 c 74 g 99 t
ORIGIN
1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca
61 tgctggtgat taataatact actatgacag agccatatgc tctgtgaaga gtattagtag
121 cagcgctcat aaaagctaca gttccatcta ttttctcagc tatgtaaaac ttccaaactg
181 ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg catgtcgttg ttgaggccat
241 tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc
301 ttttctgaga actctagact cg
//
LOCUS MZEPPDKLR 246 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) mRNA, 3' end.
ACCESSION X14928
KEYWORDS pyruvate,orthophosphate dikinase.
SOURCE Z.mays mays (strain B73) leaf and root, cDNA to mRNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 246)
AUTHORS Glackin,C.A. and Grula,J.W.
TITLE Organ-specific transcripts of different size and abundance derive
from the same pyruvate,orthophosphate dikinase gene in maize
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES from to/span description
pept < 1 23 pyruvate,orthophosphate dikinase (PPDK1) (EC
2.7.9.1) (AA at 3)
BASE COUNT 61 a 59 c 48 g 78 t
ORIGIN
1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct
61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag
121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca
181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg
241 tggcag
//
LOCUS BABAPOAI 860 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Baboon apolipoprotein A-I (ApoA-I) mRNA, complete cds.
ACCESSION M35634
KEYWORDS apolipoprotein A-I.
SOURCE Baboon liver, cDNA to mRNA.
ORGANISM Papio hamadryas
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini;
Cercopithecidae; Cercopithecinae.
REFERENCE 1 (bases 1 to 860)
AUTHORS Hixson,J.E., Borenstein,S., Cox,L.A., Rainwater,D.L. and
VandeBerg,J.L.
TITLE The baboon gene for apolipoprotein A-I: Characterization of a cDNA
clone and identification of DNA polymorphisms for genetic studies
of cholesterol metabolism
JOURNAL Gene 74, 483-490 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 3 806 apolipoprotein A-I (ApoA-I)
mRNA < 1 860 ApoA-I mRNA
BASE COUNT 191 a 265 c 286 g 118 t
ORIGIN
1 ggatgaaagc tacggtgctg accttggccg tgctcttcct gacggggagc caggctcggc
61 atttctggca gcaagatgaa cccccacaga ccccctggga tcgagtgaag gacctggtca
121 ctgtgtacgt ggaagcactc aaagacagcg gcaaagacta tgtgtcccag tttgaaggct
181 ccgccttggg aaaacagcta aacctaaagc tcctggacaa ctgggacagc gtgacctcca
241 ccgtcagcaa gctgcgcgaa cagctcggcc cggtgaccca ggagttctgg gataacctgg
301 aaaaggagac agagggcctg aggcaggaga tgagcaagga tctggaggag gtgaaggcca
361 aggtgcagcc ctacctggac gacttccaga agaagtggca ggaggagatg gagctctacc
421 gccagaaggt ggagccgctg cgcgcggagc tccacgaggg cacgcgccag aagctgcacg
481 agctgcacga gaaactgagc ccgctggggg aggaggtgcg cgaccgcgcg cgcgcccacg
541 tggacgcgct gcgcacgcat ctggccccct acagcgatga gctgcgccag cgcctggccg
601 cgcgccttga ggcgctcaag gagaacggcg gcgcccgcct ggcggagtac cacgccaagg
661 ccagcgagca tctgagcacg ctcagcgaga aggccaagcc cgcgctcgag gacctccgcc
721 agggcctgct gcccgtgctg gagagcttca aggtcagctt cctgagcgct ctcgaggagt
781 acactaagaa gctcagcacc cagtgaggcg cccgccgccg ccccccttcc cggtgctcag
841 aataaacgtt tccgaagtgg
//
LOCUS DROARSDA 721 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster autonomically replicating sequence (ARS).
ACCESSION M35633
KEYWORDS autonomically replicating sequence.
SOURCE D.melanogaster (strain Dm665) DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 721)
AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
TITLE Structure of ARS elements from Drosophila melanogaster telomeres
JOURNAL Genetika 24, 407-414 (1988)
STANDARD simple staff_review
FEATURES from to/span description
site 574 584 core sequence
BASE COUNT 237 a 168 c 110 g 206 t
ORIGIN
1 cctcgctata tcaacgtctt cgtgccgcga tccaactgga cggagccatg ttcgggacca
61 ccttcccgca catgttcttg acatgttagc tgccgaactt tagaccccag ccgctcctct
121 ctgctatgca ggagtatatg taagccgagt atacatgggt acatactttc caatacgcca
181 cctcaaaagt tacattgctg tacttttatt acttgcatgt ccttctgggg aataattttg
241 atccagcatt cgctgcgagg tgtgggaaac caggtgggaa acccagtaaa acccaaccag
301 atcaattgaa catattattt acataaattc agtttgcaaa caattttaac taaaatcaat
361 attattcaac tttgtagaac ctcaaaatta tcaataaact taattaaatt taatttaatg
421 atactttata tcacgacccc cagttcagtc tgtttcttca tttcaatcta agcatatatt
481 gttttaattc attctgagag tccaagccca cttctcaatg taatgaatct tgcagatggt
541 aacggcttag ttagtatatc agcaagttgt aaataaatat aaataaatat aacaatcaaa
601 taacgcactt acctccttga ccacagccaa tccaacgcaa ccgaaaacag gagacgggcc
661 ccgcaaacgc aaattaaaat tgccaacatt accgattata actacaaaaa actaacaagc
721 t
//
LOCUS DROARSDB 275 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster autonomously replicating sequence (ARS).
ACCESSION M36876
KEYWORDS autonomously replicating sequence.
SOURCE D.melanogaster (strain Dm665) deletion mutant pSL41 DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 275)
AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
TITLE Structure of ARS elements from Drosophila melanogaster telomeres
JOURNAL Genetika 24, 407-414 (1988)
STANDARD simple staff_review
FEATURES from to/span description
site 146 156 core sequence
BASE COUNT 118 a 56 c 34 g 67 t
ORIGIN
1 tttttgcata ataatttcaa ttgtaaaatg taaacaaacc acgcaatatg ataatgttac
61 cagtccagat tactgtcaaa aacctagtac gaaaaaaata ctaactagga ccaaactatt
121 atttaagctt tataatattt ataaaataat ataaaagctg gaattgccgc ttgaccacag
181 ccaatccaac gcaaccgaaa acaggagacg ggccccgcaa acgcaaatta aaattgccaa
241 cattaccgat tataactaca aaaaactaac aagct
//
LOCUS DROARSDC 290 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster autonomously replicating sequence (ARS).
ACCESSION M36877
KEYWORDS ARS element.
SOURCE D.melanogaster (strain Dm665) deletion mutant pSL52 DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 290)
AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
TITLE Structure of ARS elements from Drosophila melanogaster telomeres
JOURNAL Genetika 24, 407-414 (1988)
STANDARD simple staff_review
FEATURES from to/span description
site 81 90 core sequence
BASE COUNT 103 a 58 c 52 g 77 t
ORIGIN
1 agattactgt caaaaaccta gtacgaaaaa aatactaact aggaccaaac tattatttaa
61 gctttataat atttataaaa taatataaaa gctggaattg ccgctttgat gcgtttgagg
121 tctttgtaaa gggcatcgag tcctcggacc attaagggtt gctgccctgt cgaaactctg
181 acgtttgatt tgtttttggc ccaacgcaac cgaaaacagg agacgggccc cgcaaacgca
241 aattaaaatt gccaacatta ccgattataa ctacaaaaaa ctaacaagct
//
LOCUS GIBINVOL 1569 bp ds-DNA PRI 31-JUL-1990
DEFINITION H.lar involucrin gene, complete cds.
ACCESSION M35447
KEYWORDS involucrin.
SOURCE H.lar keratinocyte DNA, clones 1 and 2.
ORGANISM Hylobates lar
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hylobatidae.
REFERENCE 1 (bases 1 to 1569)
AUTHORS Djian,P. and Green,H.
TITLE The involucrin gene of the gibbon: The middle region shared by the
hominoids
JOURNAL Mol. Biol. Evol. 7, 220-227 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1569 involucrin
BASE COUNT 472 a 389 c 550 g 158 t
ORIGIN
1 atgtcccagc aacacacact gccagtgacc ctctcccctg ccctcagtca ggagctcctc
61 aagactgttc ctcctccagt caatacccag caggagcaaa tgaaacagcc aactccactg
121 cctcccccat gccagaaggt gcttggtgag ctcccagtgg aggtcccatc aaagcaagag
181 gaaaagcaca tgactactgt aaaggggctg cctgagcaag aatgtgagca acagcagcag
241 gagccacagg agcaggagct gcagcaacag cactgggaac agcatgagga acatcagaaa
301 gcagaaaacc cagagcagca gcttaagcag gagaaagcac aaagggatca gcagctaaac
361 gaacatctgg aagaagagaa gaagctctta gaccagcaac tgaatcaaga gctaatcaag
421 agagatgagc aactgggaat aaagaaagag caactcttgg agctcacaga gcagcaggag
481 gggcagttgg agcacctaga acagcaggag gggcagctgg agctcccaga gcagcaggag
541 gggcagctgg agcacctgga gcagcaggag gggcagctga agcacctgga tcagcagggg
601 aagcagccag agctcccaga gcagcaggtg gcacagctga agcacctgga gcagcaggag
661 gggcagctga agcatctgga gcaccagaag ggggagctgc aggtcccaga ggagcaggtg
721 gggcagctga agtacctgga acagcaggag gggcagctga agcacctgga tcagcaggag
781 aagcagccag agctcccaga gcagcaggtg gggcagctga agcacctgga gcagcaggag
841 gggcagctgg agcacatgga gcaccaggaa gggcagctgg ggctcccaga gcagcaggtg
901 gggcagctga agcagctgga ggagcaggag ggacagccaa agcacctgga ggaggaggag
961 gggcagctga agcacctggt gcagcaggag gggcagctgg agcatctggt gcagcaggag
1021 aggcagctgg agcagcagga ggggaaggtg cagcacctgg agcagcaggt ggagcagctg
1081 aagcacctag aggagcagga ggggcagctg aagcatctgg agcagcagca ggggcagctg
1141 gaggtctcag agcagcaggt ggggcagcca aagcacctgg agcaggaggg gaagcagctg
1201 gagctcccag agcagcaaga gggccagctg aagcacctgg agaagcagga ggcacagctg
1261 gagctcccag agcagcaggt aggacagcca aagcacccgg agcaacagga gaagcagcta
1321 gagcacccag agcagcagga gggacaacta aaacatctgg agcagcagga ggggcagctg
1381 aaagacctgg agcagcagaa ggggcagctg gagcagcagc aggggcagct ggagcagcct
1441 gtgtttgccc cagctccagg ccaggtccaa gacattcaac cagtcctgcc cacaaaggga
1501 gaagcattgc ttcctgtaga gcagcagcag cagaagcagg aggtgcagtg gccacccaaa
1561 cataaataa
//
LOCUS HUMHEMBP 822 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Human eosinophil major basic protein mRNA, complete cds.
ACCESSION M35670
KEYWORDS major basic protein.
SOURCE Human eosinophil, cDNA to mRNA, clone MBP-lambda-9.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 822)
AUTHORS McGrogan,M., Simonsen,C., Scott,R., Giffith,J., Ellis,N.,
Kennedy,J., Campanelli,D., Nathan,C. and Gabay,J.
TITLE Isolation of a complementary DNA clone encoding a precursor to
human eosinophil major basic protein
JOURNAL J. Exp. Med. 168, 2295-2308 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 49 717 eosinophil major basic protein precursor
sigp 49 94 eosinophil major basic protein signal peptide
matp 95 714 eosinophil major basic protein
mRNA < 1 822 eosinophil major basic protein mRNA
BASE COUNT 173 a 219 c 236 g 194 t
ORIGIN
1 gggaagatct aaagacccag gaaggtctct gggtgggata aagccaagat gaaactcccc
61 ttacttctgg ctcttctatt tggggcagtt tctgctcttc atctaaggtc tgagacttcc
121 acctttgaga cccctttggg tgctaagacg ctgcctgagg atgaggagac accagagcag
181 gagatggagg agaccccttg cagggagctg gaggaagagg aggagtgggg ctctggaagt
241 gaagatgcct ccaagaaaga tggggctgtt gagtctatct cagtgccaga tatggtggac
301 aaaaacctta cgtgtcctga ggaagaggac acagtaaaag tggtgggcat ccctgggtgc
361 cagacctgcc gctacctcct ggtgagaagt cttcagacgt ttagtcaagc ttggtttact
421 tgccggaggt gctacagggg caacctggtt tccatccaca acttcaatat taattatcga
481 atccagtgtt ctgtcagcgc gctcaaccag ggtcaagtct ggattggagg caggatcaca
541 ggctcgggtc gctgcagacg ctttcagtgg gttgacggca gccgctggaa ctttgcgtac
601 tgggctgctc accagccctg gtcccgcggt ggtcactgcg tggccctgtg tacccgagga
661 ggctactggc gtcgagccca ctgcctcaga agacttcctt tcatctgttc ctactgagct
721 ggtcccagcc gacagttcag agctgccctc tcctgggcat ggcctcccct cctctgcttg
781 ccatccctcc ctccacctcc ctgcaataaa atgggtttta ct
//
LOCUS HUMIGHVX 529 bp ds-DNA PRI 31-JUL-1990
DEFINITION Human Ig germline H-chain gene V-III region subgroup, exons 1 and
2.
ACCESSION M35415
KEYWORDS germline; immunoglobulin heavy chain; variable region;
variable region subgroup VH-III.
SOURCE Human fetal liver DNA, clone lambda-VH-26.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 529)
AUTHORS Matthyssens,G. and Rabbitts,T.H.
TITLE Structure and arrangement of human heavy chain variable region
genes
JOURNAL immune system 1, 132-138 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept 29 74 Ig heavy chain V-region precursor, exon 1
/hgml_locus_uid="LK0080M" /nomgen="IGHV"
/map="14q32.33"
179 / 483 Ig H-chain V-region precursor, exon 2
sigp 29 74 Ig H-chain signal peptide
179 189 Ig H-chain signal peptide
matp 190 / 483 Ig H-chain
IVS 75 178 Ig H-chain V-region intron A
iDNA 484 > 529 Ig iDNA
BASE COUNT 131 a 106 c 165 g 127 t
ORIGIN
1 gatcaggact gaacagagag aactcaccat ggagtttggg ctgagctggc tttttcttgt
61 ggctatttta aaaggtaatt catggagaaa tagaaaaatt gagtgtggaa tggataagag
121 tgagagaaac agtggatacg tgtggcagtt tctgaccagg gtttcttttt gtttgcaggt
181 gtccagtgtg aggtgcagct gttggagtct gggggaggct tggtacagcc tggggggtcc
241 ctgagactct cctgtgcagc ctctggattc acctttagca gctatgccat gagctgggtc
301 cgccaggctc cagggaaggg gctggagtgg gtctcagcta ttagtggtag tggtggtagc
361 acatactacg gagactccgt gaagggccgg ttcaccatct caagagacaa ttccaagaac
421 acgctgtatc tgcaaatgaa cagcctgaga gccgaggaca cggccgtata ttactgtgcg
481 aaagacacag tgagggaagt cattgtgagc ccagacacaa acctccctg
//
LOCUS HUMPRAPP 256 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Human amyloid beta precursor protein (ABPP) mRNA, 5' end.
ACCESSION M35675
KEYWORDS amyloid beta.
SOURCE Human fetal muscle, cDNA to mRNA, clone lambda-HAP2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 256)
AUTHORS Mita,S., Sadlock,J., Herbert,J. and Schon,E.A.
TITLE A cDNA specifying the human amyloid beta precursor protein (ABPP)
encodes a 95-kDa polypeptide
JOURNAL Nucleic Acids Res. 16, 9351-9351 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 8 > 256 amyloid beta precursor protein (ABPP)
/hgml_locus_uid="LG0136J" /nomgen="APP"
/map="21q21.2"
BASE COUNT 42 a 77 c 93 g 44 t
ORIGIN
1 ggaggatatg aagtccatct tcacttcaga gatctcctcc gtcttgatat ttgtcaaccc
61 agaacctggt cgagtggtca gtttcctcgg cagcggtagg cgagagcacg cggaggagcg
121 tgcgcggggg ccccgggaga cggcggcggt ggcggcgcgg gcagagcaag gacgcggcgg
181 atcccactcg cacagcagcg cactcggtgc cccgcgcagg gtcgcgatgc tgcccggttt
241 ggcactgctc ctgctg
//
LOCUS MUSRP32A 901 bp ds-DNA ROD 31-JUL-1990
DEFINITION Mouse ribosomal protein L32' (L32') gene, complete cds.
ACCESSION M35397
KEYWORDS ribosomal protein L32'.
SOURCE Mouse cell line L1210 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 901)
AUTHORS Jacks,C.M., Biltz,R.E. and Hackett,P.B.
TITLE Analysis of potential expression of highly related members of the
ribosomal protein L32 gene family
JOURNAL Nucleic Acids Res. 16, 10751-10764 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 255 662 ribosomal protein L32'
mRNA 200 > 695 L32' mRNA (alt.)
mRNA 203 > 695 L32' mRNA (alt.)
mRNA 204 > 695 L32' mRNA (alt.)
mRNA 207 > 695 L32' mRNA (alt.)
mRNA 246 > 695 L32' mRNA (alt.)
mRNA 247 > 695 L32' mRNA (alt.)
BASE COUNT 290 a 203 c 210 g 198 t
ORIGIN
1 gtccaacaag tatttccttg gccaggctta gcgttcttcg agtttatctt ctagggggaa
61 aaaatcaact agcaacatat aaaaaggtct tatttgtcca ggaaaaaaaa aggaagcaac
121 attatagatt gactcttgca tagtatgtat tttggaatcc gagtaatgca acagtataat
181 gtggttaaga acaaagactc tggctccctc ctcggcactg cctacgaggt ggctaccttc
241 tgttatgcag catcatggct gccctccggc ctctggtgaa gcccaagatc gtcaaaaaga
301 ggaccaaaaa gttcatcaga caccagtcag accgatatgt gaaaattaag tggaactgga
361 ggaaacccag aggcatcgac aacagggtgc ggagaaggtt caagggccag atcctgatgc
421 ccaacatcgg ttacaggagc aacaagaaaa ccaagcacac gctgtctagt ggcttccgca
481 agttcctggt ccacaatatc aaggagccgg aggtgctgtt gatgtgcaac aaatcttacc
541 gtgctgagat tgctcacaac gtgtcctcta agaaccgaaa agccattgta gaaagagcag
601 cacagctggc catcagagtc accaatccca acaccaggct acacagcgaa gaaaatgaat
661 agatggcttg tgtacatgtt ttgtgtttaa ataaaatcac aaaaactgga aaaaaaaaag
721 aataaagact ctgaggctgg aagagggtgt caaatgcccc tggaactgga gttacagaca
781 attgtgagcc accccccccc catgggagct ggtgatggaa cttgggtccc ctggaagaga
841 ggcaagcact attaattatg gagccatttc ttcagccaaa catcccatat tttaatccga
901 a
//
LOCUS SPIB559AC 597 bp ds-DNA ORG 31-JUL-1990
DEFINITION S.oleracea chloroplast apocytochrome b-559 gene, complete cds.
ACCESSION M35673
KEYWORDS apocytochrome b-559.
SOURCE S.oleracea chloroplast DNA.
ORGANISM Chloroplast Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae;
Spinacia oleracea.
REFERENCE 1 (bases 1 to 597)
AUTHORS Herrmann,R.G., Alt,J., Schiller,B., Widger,W.R. and Cramer,W.A.
TITLE Nucleotide sequence of the gene for apocytochrome b-559 on the
spinach plastid chromosome: Implications for the structure of the
membrane protein
JOURNAL FEBS Lett. 176, 239-244 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 152 403 apocytochrome b-559
pept 412 531 ORF 39
BASE COUNT 173 a 120 c 117 g 187 t
ORIGIN
1 attcggtata ctctaaagac acctttggta ctatattggc aatcctacaa gtattttacg
61 tttttacgtg gagtaaaaag taaatagaaa ttgactgatt tcatctttta cggatttgcc
121 tttgactgta caagaatatg tggagctcag tatgtctgga agcacaggag aacgttcttt
181 tgctgatatt attaccagta ttcgatactg ggttattcat agcattacta taccttccct
241 attcattgcg ggttggttat tcgtcagcac aggtttagct tacgatgtgt ttggaagccc
301 tcgtccaaac gaatatttca cagagagccg acaaggaatt ccattaataa ctggccgttt
361 tgactctttg gaacaacttg atgaatttag tagatccttt taggaggccc aatgaccata
421 gatcgaacct atccaatttt tacagtgcgc tggctggcta ttcacggact ggctgtacct
481 accgtttctt ttttggggtc aatatccgca atgcagttca tccaacgata aacaaaaccg
541 aatccgaatt atagagctac gacacaatca aacccgaacg aacaaaatgt tgaattg
//
LOCUS SYNQBRNA 672 bp ss-RNA SYN 31-JUL-1990
DEFINITION Synthetic lambda-N protein RNA.
ACCESSION M35652
KEYWORDS lambda-N protein.
SOURCE Synthetic RNA, clone pDM1001.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 672)
AUTHORS Mills,D.R.
TITLE Engineered recombinant messenger RNA can be replicated and
expressed inside bacterial cells by an RNA bacteriophage replicase
JOURNAL J. Mol. Biol. 200, 489-500 (1988)
STANDARD simple staff_review
FEATURES from to/span description
RNA 157 516 lambda N-protein complementary RNA
BASE COUNT 122 a 190 c 183 g 177 t
ORIGIN
1 gggaggagag agggcaaagc agatccccct ctcactcgta agagtaattg tgataattta
61 ttcacaatta ggcgccatat tggcgccctc ccccaggtta cgcctcgtgt agagacgcaa
121 ccttcgtgcc ctggaagacc ccgtagggcc tcgaggccaa tatctaagta actagataag
181 aggaatcgat tttcccttaa ttttctggcg tccactgcat gttatgccgc gttcgccagg
241 cttgctgtac catgtgcgct gattcttgcg ctcaatacgt tgcaggttgc tttcaatctg
301 tttgtggtat tcagccagca ctgtaaggtc tatcggattt agtgcgcttt ctactcgtga
361 tttcggtttg cgattcagcg agagaatagg gcggttaact ggttttgcgc ttaccccaac
421 caacagggga tttgctgctt tccattgagc ctgtttctct gcgcgacgtt cgcggcggcg
481 tgtttgtgca tccatctgga ttctcctgtc agttagcctc gaggcgcttt cgcgctctcc
541 caggtgacgc ctcgtgaaga ggcgcgacct tcgtgcgttt cggcgacgca cgagaaccgc
601 cacgctgctt cgcagcgtgg ccccttcgcg cagcccgctg cgcgaggtga ccccccgaag
661 ggggcttccc ca
//
LOCUS SYNT4ENDV 424 bp ds-DNA SYN 31-JUL-1990
DEFINITION Synthetic T4 endonuclease V (T4endV) gene, complete cds.
ACCESSION M35392
KEYWORDS T4 endonuclease V.
SOURCE Synthetic gene, expressed in E.coli.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 424)
AUTHORS Inaoka,T., Miura,K. and Ohtsuka,E.
TITLE Chemical synthesis of the T4 endonuclease V gene and its expression
in Escherichia coli
JOURNAL Nucleic Acids Symp Ser 17, 105-108 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 5 421 T4 endonuclease V (T4endV)
BASE COUNT 120 a 90 c 86 g 128 t
ORIGIN
1 cgatatgacg cgtatcaacc ttactttagt atccgagtta gctgaccaac acttaatggc
61 tgaataccgt gaattgccgc gtgtttttgg tgcagttcgt aagcacgtag caaacggtaa
121 acgtgttcgt gacttcaaaa tcagtcctac ttttatcctt ggcgcaggtc atgttacatt
181 cttctacgat aagctcgagt tcttacgcaa gcgtcaaatt gagcttatag ctgaatgttt
241 gaaacgtggc ttcaatatca aggatactac agtccaggac atcagtgaca ttcctcaaga
301 attccgtggt gattatattc cccatgaagc ttctattgct atatcacaag ctcgtttaga
361 tgaaaaaatt gcacaacgtc ctacttggta caaatactac ggtaaggcga tttatgcatg
421 atag
//
LOCUS TOMACS 1819 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Tomato 1-aminocyclopropane-1-carboxylate synthase mRNA, complete
cds.
ACCESSION M34289
KEYWORDS 1-aminocyclopropane-1-carboxylate synthase.
SOURCE Tomato (strain Mill., cv. Orlando), fruit, cDNA to mRNA.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 1819)
AUTHORS Van Der Straeten,D., Van Wiemeersch,L., Goodman,H.M. and
Van Montagu,M.
TITLE Cloning and sequence of two different cDNAs encoding
1-aminocyclopropane-1-carboxylate synthase in tomato
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4859-4863 (1990)
STANDARD unannotated staff_entry
FEATURES from to/span description
pept 154 1611 1-aminocyclopropane-1-carboxylate synthase
mRNA < 1 1819 1-aminocyclopropane-1-carboxylate synthase
mRNA
BASE COUNT 610 a 285 c 365 g 559 t
ORIGIN
1 ccaaacacat aatactttta atacaattag ttatttatta gaagtattta aagtaaagca
61 cttgtgagtt gtgtacattt tattaatctt catcttctta attctcttca gtttttaatt
121 tcttcacttc taaactcatt tagtaaaaaa aaaatgggat ttgagattgc aaagaccaac
181 tcaatcttat caaaattggc tactaatgaa gagcatggcg aaaactcgcc atattttgat
241 gggtggaaag catacgatag tgatcctttc caccctctaa aaaaccccaa cggagttatc
301 caaatgggtc ttgctgaaaa tcagctttgt ttagacttga tagaagattg gattaagaga
361 aacccaaaag gttcaatttg ttctgaagga atcaaatcat tcaaggccat tgccaacttt
421 caagattatc atggcttgcc tgaattcaga aaagcgattg cgaaatttat ggagaaaaca
481 agaggaggaa gagttagatt tgatccagaa agagttgtta tggctggtgg tgccactgga
541 gctaatgaga caattatatt ttgtttggct gatcctggcg atgcattttt agtaccttca
601 ccatactacc cagcatttaa cagagattta agatggagaa ctggagtaca acttattcca
661 attcactgtg agagctccaa taatttcaaa attacttcaa aagcagtaaa agaagcatat
721 gaaaatgcac aaaaatcaaa catcaaagta aaaggtttga ttttgaccaa tccatcaaat
781 ccattgggca ccactttgga caaagacaca ctgaaaagtg tcttgagttt caccaaccaa
841 cacaacatcc accttgtttg tgacgaaatc tacgcagcca ctgtctttga cacgcctcaa
901 ttcgtcagta tagctgaaat cctcgatgaa caggaaatga cttactgcaa caaagattta
961 gttcacatcg tctacagtct ttcaaaagac atggggttac caggatttag agtcggaatc
1021 atatattctt ttaacgacga tgtcgttaat tgtgctagaa aaatgtcgag tttcggttta
1081 gtatctacac aaacgcaata ttttttagcg gcaatgccat cggacgaaaa attcgtcgat
1141 aattttctaa gagaaagcgc gatgaggtta ggtaaaaggc acaaacattt tactaatgga
1201 cttgaagtag tgggaattaa atgcttgaaa aataatgcgg ggcttttttg ttggatggat
1261 ttgcgtccac ttttaaggga atcgactttc gatagcgaaa tgtcgttatg gagagttatt
1321 ataaacgatg ttaagcttaa cgtctcgctt ggatcttcgt ttgaatgtca agagccaggg
1381 tggttccgag tttgttttgc aaatatggat gatggaacgg ttgatattgc gctcgcgagg
1441 attcggaggt tcgtaggtgt tgagaaaagt ggagataaat cgagttcgat ggaaaagaag
1501 caacaatgga agaagaataa tttgagactt agtttttcga aaagaatgta tgatgaaagt
1561 gttttgtcac cactttcgtc acctattcct ccctcaccat tagttcgtta agacttaatt
1621 aaaagggaag aatttaattt atgttttttt atattttgaa aaaaatttgt aagaataaga
1681 ttataatagg aaaagaaaat aagtatgtag gatgaggagt attttcagaa atagttgtta
1741 gcgtatgtat tgacaactgg tctatgtact tagacatcat aatttgtctt agctaattaa
1801 tgaatgcaaa agtgaagtt
//
LOCUS TTHRGG 873 bp ds-DNA BCT 31-JUL-1990
DEFINITION T.thermophilus 23S ribosomal RNA gene.
ACCESSION M35674
KEYWORDS 23S ribosomal RNA.
SOURCE T.thermophilus (strain HB8; ATCC 27634) DNA.
ORGANISM Thermus thermophilus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 873)
AUTHORS Hartmann,R.K., Vogel,D.W., Kroeger,B., Ulbrich,N. and Erdmann,V.A.
TITLE Studies on rDNA from the extreme thermophilic eubacterium Thermus
thermophilus HB8
JOURNAL FEBS Lett. 218, 215-221 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 873 23S RNA gene
BASE COUNT 167 a 248 c 321 g 137 t
ORIGIN
1 ctggccgtga agatgcggcc tacccgtggc aggacgaaaa gaccccgtgg agctttactg
61 cagcctggtg ttggctcttg gtcgcgcctg cgtaggatag gtgggagcct gtgaaccccc
121 gcctccgggt gggggggagg cgccggtgaa ataccaccct ggcgcggctg ggggcctaac
181 cctcggatgg ggggacagcg cttggcgggc agtttgactg gggcggtcgc ctcctaaaag
241 gtaacggagg cgcccaaagg tcccctcagg cgggacggaa atccgccgga gagcgcaagg
301 gtagaagggg gcctgactgc gaggcctgca agccgagcag gggcgaaagc cgggcctagt
361 gaaccggtgg tcccgtgtgg aagggccatc gatcaacgga taaaagttac cccggggata
421 acaggctgat ctcccccgag cgtccacagc ggcggggagg tttggcacct cgatgtcggc
481 tcgtcgcatc ctggggctga agaaggtccc aagggttggg ctgttcgccc attaaagcgg
541 cacgcgagct gggttcagaa cgtcgtgaga cagttcggtc tctatccgcc acgggcgcag
601 gaggcttgag gggggctctt cctagtacga gaggaccgga agggacgcac ctctggtttc
661 ccagctgtcc ctccaggggc ataagctggg tagccatgtg cggaagggat aaccgctgaa
721 agcatctaag cgggaagccc gccccaagat gaggcctccc acggcgtcaa gccggtaagg
781 acccgggaag accacccggt ggatgggccg ggggtgtaag cgccgcgagg cgttgagccg
841 accggtccca atcgtccgag gtcttgaccc ctc
//
LOCUS WHTCPRS2A 780 bp ds-DNA ORG 31-JUL-1990
DEFINITION T.aestivum chloroplast ribosomal protein S2 gene, complete cds.
ACCESSION M35396
KEYWORDS ribosomal protein S2.
SOURCE T.aestivum (cv. Mardler) chloroplast DNA.
ORGANISM Chloroplast Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE 1 (bases 1 to 780)
AUTHORS Hoeglund,A.-S. and Gray,J.C.
TITLE Nucleotide sequence of the gene for ribosomal protein S2 in wheat
chloroplast DNA
JOURNAL Nucleic Acids Res. 15, 10590-10590 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 46 756 ribosomal protein S2
BASE COUNT 266 a 118 c 164 g 232 t
ORIGIN
1 ccgtaatgga atggtaggat taaaaaaaaa ggtaatgtgg aaaaaatgac aagaagatat
61 tggaacatta atttgaaaga gatgatagaa gcaggagttc attttggtca tggtattaag
121 aaatggaatc ctaaaatggc cccttacatt tcggcaaagc gtaaaggtac tcatattata
181 aatctcgcta gaacggctcg ttttttatca gaagcttgtg atttagtttt tgatgcagca
241 agtcagggaa caagtttctt aattgttggt accaaaaaaa gagcaacaga tttagtagca
301 tcagctgcaa taagggctcg ttgtcattat gttaataaaa agtggttcag tggtatgtta
361 acgaattggt cgattacgaa aactagactt tctcaattta gagacttaag agcagaagaa
421 aaaatgggaa aattccacca tctcccaaaa agagatgtgg caatcttgaa gagaaaatta
481 tctaccttgc aaaggtatct cggcgggatc aaatatatga cgagattgcc agacattgtg
541 atcgtccttg atcagcaaaa agagtatata gctcttcggg aatgtgccat tttggggatt
601 cctactattt ctttagtcga tacaaattgt gacccggatc tcgcgaatat atcgattcca
661 gccaacgatg acactatgac ttcaattcga ttgattctta acaaattagt attttcaatt
721 tgtgagggcc gttctctcta tataagaaat cgttgattaa gaatatatag tgaattcttg
//
LOCUS YSCVDACA 852 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Yeast voltage-dependent anion-selective channel (VDAC) protein
mRNA, complete cds.
ACCESSION M34907
KEYWORDS voltage-dependent anion-selective channel protein.
SOURCE Yeast, cDNA to mRNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 852)
AUTHORS Forte,M., Guy,H.R. and Mannella,C.A.
TITLE Molecular genetics of the VDAC ion channel: Structural model and
sequence analysis
JOURNAL J Bioenerg Biomembr 19, 341-350 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 852 voltage-dependent anion-selective channel
(VDAC) protein
BASE COUNT 224 a 213 c 172 g 243 t
ORIGIN
1 atgtctcctc cagtttacag cgatatctcc agaaatatca atgacctatt gaacaaggat
61 ttctatcatg ctaccccagc tgcctttgat gtgcaaacaa caaccgccaa tggcattaag
121 ttctcattga aggctaaaca gcctgtcaaa gacggtccac tgtctactaa cgtggaagca
181 aagttgaatg acaagcaaac cggcttgggt ctaactcaag gctggtctaa cacaaacaac
241 ttgcaaacca aattagagtt tgccaacttg acccctggtc taaagaacga attgatcact
301 tctttgactc caggcgtcgc caagtccgcc gtcttaaaca ctacgttcac agaacctttc
361 ttcaccgcaa gaggtgcctt tgacttgtgt ttgaagtcac caacatttgt tggtgactta
421 actatggccc acgaaggtat tgttggtggc gcagagtttg gttacgatat cagcgccggt
481 tccatttctc gttatgccat ggctttaagt tatttcgcca aagactactc cttgggcgct
541 acattgaaca acgagcaaat aactaccgtt gacttcttcc aaaacgtcaa cgccttttta
601 caggtcgcta aggctacaat gatgaactgc aaactaccta actccaatgt caacatcgaa
661 ttcgccacta gatatttgcc tgatgcatct tcccaagtta aggctaaggt gtccgattcc
721 ggtattgtca ctttggctta caagcaattg ttaagacctg gcgtcactct gggtgtcggt
781 tcctctttcg atgctttgaa gttgtctgaa cctgttcaca agctaggttg gtctttgtcc
841 ttcgacgctt ga
//
LOCUS ATHHISH3A 411 bp ds-DNA PLN 31-JUL-1990
DEFINITION A.thaliana histone H3 gene, complete cds.
ACCESSION M35387
KEYWORDS histone H3.
SOURCE A.thaliana (strain Strasbourg) adult leaf DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C.
TITLE Histone genes in higher plants: Organization and expression
JOURNAL Dev. Genet. 8, 461-473 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 411 histone H3
BASE COUNT 119 a 111 c 109 g 72 t
ORIGIN
1 atggctcgta ccaagcagac ggctaggaaa tcaaccggag gaaaggctcc aaggaagcag
61 ctggcgacaa aagcggcgag gaaatctgct ccggccaccg gaggagtgaa gaagccacac
121 agattcagac ccggaactgt tgctttaaga gaaatcagga agtaccagaa gagcactgag
181 cttcttatcc gtaaactccc tttccagagg cttgtcagag agatcgctca ggacttcaaa
241 accgatcttc gtttccagag cagtgccgtc gccgctcttc aggaagctgc tgaagcttac
301 ctcgtcggac tcttcgaaga caccaatctc tgcgccattc acgccaagag ggtcacgatc
361 atgcccaagg acatccagct cgcgagaaga atcagaggag aacgagctta g
//
LOCUS BLYCHORDA 391 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Barley C-hordein storage protein, 3' end.
ACCESSION M35610
KEYWORDS C-hordein; storage protein.
SOURCE Barley 20-day old endosperm, cDNA to mRNA, clone pc hor1-3.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 391)
AUTHORS Rasmussen,S.K. and Brandt,A.
TITLE Nucleotide sequences of cDNA clones for C-hordein polypeptides
JOURNAL Carlsberg Res. Commun. 51, 371-379 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 219 C-hordein storage protein
mRNA < 1 391 C-hordein storage protein mRNA
signal 310 315 polyA signal (put.)
signal 365 370 polyA signal (put.)
BASE COUNT 134 a 120 c 57 g 80 t
ORIGIN
1 tttccccagc cccaggaacc atttccccag caaccacaac aaccattccc actgcaaccg
61 caacaaccat tcccccaaca accacaacaa ccatttcccc agccccaaca accattccgc
121 caacaagcag aactaataat tccccagcaa cctcaacaac cattccctct gcagccacac
181 caaccatata cacaacaaac catctggagt atggtctagt catcaggggc ctatgaagcg
241 acaagttgta atactaaatg ggtggatcat catcctttag tcaacggagt gtttaatgta
301 atgatgataa ataaagtgat gtggaccatc atgtgtaacc ccggcctata ctagttcaaa
361 tgagaataaa agaccaaaga aagttcttgt c
//
LOCUS BLYCHORDB 255 bp ss-mRNA PLN 31-JUL-1990
DEFINITION Barley C-hordein storage protein, 3' end.
ACCESSION M35611
KEYWORDS C-hordein; storage protein.
SOURCE Barley 20-day old endosperm, cDNA to mRNA, clone pc-919.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 255)
AUTHORS Rasmussen,S.K. and Brandt,A.
TITLE Nucleotide sequences of cDNA clones for C-hordein polypeptides
JOURNAL Carlsberg Res. Commun. 51, 371-379 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 207 C-hordein storage protein
BASE COUNT 97 a 96 c 24 g 38 t
ORIGIN
1 tacccacaac aaccacaacc atttccacaa cagcccatcc cacaacaacc acaaccatac
61 ccacaacaac cacaaccatt ttcacaacag cccatcccac aacaaccaca accataccca
121 caacaaccac aaccatttcc acaacaaccc atccctctgc agccacacca accatataca
181 caacaaacca tctggagtat ggtctagtca tcaggggcct atgaagcgac aagttgtaat
241 actaaatggg tggat
//
LOCUS BOVFGFAA 668 bp ss-mRNA MAM 31-JUL-1990
DEFINITION Bovine acidic eye-derived fibroblast growth factor (EDGF II) mRNA,
complete cds.
ACCESSION M35608
KEYWORDS acidic eye-derived fibroblast growth factor.
SOURCE Bovine retina, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 668)
AUTHORS Alterio,J., Halley,C., Brou,C., Soussi,T., Courtois,Y. and
Laurent,M.
TITLE Characterization of a bovine acidic FGF cDNA clone and its
expression in brain and retina
JOURNAL FEBS Lett. 242, 41-46 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 91 558 acidic eye-derived fibroblast growth factor
BASE COUNT 173 a 184 c 170 g 141 t
ORIGIN
1 ggatcctctt tcccttctac tggagaggaa aagccctcag cctgcaagct gttcagcctt
61 gaaacagcca caaccagcag ctgctgagcc atggctgaag gagaaaccac gaccttcacg
121 gccctgactg agaagtttaa cctgcctcta ggcaattaca agaagcccaa gctcctctac
181 tgcagcaacg ggggctactt cctgagaatc ctcccagatg gcacagtgga tgggacgaag
241 gacaggagcg accagcacat tcagctgcag ctctgtgcgg aaagcatagg ggaggtgtat
301 attaagagta cggagactgg ccagttcttg gccatggaca ccgacgggct tttgtacggc
361 tcacagacac ccaatgagga atgtttgttc ctggaaaggt tggaggaaaa ccattacaac
421 acctacatat ccaagaagca tgcagagaag cattggttcg ttggtctcaa gaagaacgga
481 aggtctaaac tcggtcctcg gactcacttc ggccagaaag ccatcttgtt tctccccctg
541 ccagtctcct ctgattaaag aaatctgttg tgggtgctga gccactccag aggaatctga
601 aggggtcctc acctggctga ccccagattg taccctttac cattggccgt gctaacccct
661 ggcccaca
//
LOCUS CHKGHA 778 bp ss-mRNA VRT 31-JUL-1990
DEFINITION Chicken growth hormone mRNA, complete cds.
ACCESSION M35609
KEYWORDS growth hormone; somatotropin.
SOURCE Chicken hypophysis, cDNA to mRNA, clone pcGH1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 778)
AUTHORS Zhvirblis,G.S., Gorbulev,V.G., Rubtsov,P.M., Karapetyan,R.V.,
Zhuravlev,I.V., Fisinin,V.I., Skryabin,K.G. and Baev,A.A.
TITLE Genetic engineering of peptide hormones: I. Cloning and primary
structure of cDNA of chicken growth hormone
JOURNAL Mol. Biol. 21, 1324-1328 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 36 686 growth hormone
mRNA < 1 778 GH mRNA
BASE COUNT 178 a 257 c 196 g 147 t
ORIGIN
1 gttcaagcaa cacctgagca actctcccgg caggaatggc tccaggctcg tggttttctc
61 ctctcctcat cgctgtggtc acgctgggac tgccgcagga agctgctgcc accttccctg
121 ccatgcccct ctccaacctg tttgccaacg ctgtgctgag ggctcagcac ctccacctcc
181 tggctgccga gacatataaa gagttcgaac gcacctatat tccggaggac cagaggtaca
241 ccaacaaaaa ctcccaggct gcgttttgtt actcagaaac catcccagct cccacgggga
301 aggatgacgc ccagcagaag tcagacatgg agctgcttcg gttttcactg gttctcatcc
361 agtcctggct gactcccgtg caatacctaa gcaaggtgtt cacgaacaac ttggtttttg
421 gcacctcaga cagagtgttt gagaaactaa aggacctgga agaagggatc caagccctga
481 tgagggagct ggaggaccgc agcccgcggg gcccgcagct cctcagaccc acctacgaca
541 agttcgacat ccacctgcgc aacgaggacg ccctgctgaa gaactacggc ctgctgtcct
601 gcttcaagaa ggatctgcac aaggtggaga cctacctgaa ggtgatgaag tgccggcgct
661 tcggagagag caactgcacc atctgaggcc ccgtgcctgc gccatggctg atggccctgt
721 cccccccccc cccttcctcc ccgtcaccaa aaacacgagg aataaacccc acagcgcc
//
LOCUS ECOTRYSU3A 85 bp ss-tRNA RNA 31-JUL-1990
DEFINITION E.coli su+III tRNA-Tyr suppressor.
ACCESSION M35560
KEYWORDS transfer tRNA-Tyr.
SOURCE E.coli (strain MB100) RNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 85)
AUTHORS Abelson,J.N., Gefter,M.L., Barnett,L., Landy,A., Russell,R.L. and
Smith,J.D.
TITLE Mutant tyrosine transfer ribonucleic acids
JOURNAL J. Mol. Biol. 47, 15-28 (1970)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA 1 85 suppressor Tyr-tRNA su-III
anticdn 35 37 suppressor Tyr-tRNA su-III anticodon cta
modified 8 8 s4u
modified 17 17 gm
modified 38 38 ms2i6a
modified 40 40 p
modified 54 54 p
mut 15 15 g in wild-type; a in su-15
mut 17 17 gm in wild-type; a in su-17
mut 31 31 g in wild-type; a in su-12
BASE COUNT 19 a 28 c 22 g 16 t
ORIGIN
1 ggtggggttc ccgagcggcc aaagggagca gactctaaat ctgccgtcat cgacttcgaa
61 ggttcgaatc cttcccccac cacca
//
LOCUS HUMG6PDC 608 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Human glucose-6-phosphate dehydrogenase (G6PD) mRNA, 3' end.
ACCESSION M35604
KEYWORDS glucose-6-phosphate dehydrogenase.
SOURCE Human fibroblast, cDNA to mRNA, clone pGD6405.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 608)
AUTHORS Toniolo,D., Persico,M.G., Battistuzzi,G. and Luzzatto,L.
TITLE Partial purification and characterization of the messenger RNA for
human glucose-6-phosphate dehydrogenase
JOURNAL Mol. Biol. Med. 2, 89-103 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 103 glucose-6-phosphate dehydrogenase (G6PD)
/hgml_locus_uid="LH0033J" /nomgen="G6PD"
/map="Xq28"
mRNA < 1 608 G6PD mRNA
signal 587 592 polyA signal
BASE COUNT 143 a 143 c 183 g 138 t 1 others
ORIGIN Chromosome Xq28.
1 ggagaagaac cttggggtgc cagggtgggg gcagaagcat gggctgggtt ccggttcatc
61 ctcctccacc ctgccgtgtg tgtgggcaca agaggacatc taaccacctg ctccttggac
121 gaggccccca ggggtggtag aggctggaag gaagccacat caggaggacg ccactccggc
181 ccttcaccct tgccaagtga gctgctcaca gtgtggtcag ggctgcgcgt gctggaggcc
241 ctcctgcctg ggccttgtgg ggcaaatatt gggtccccag gctggaaaga tggacagagg
301 cccaatgggt gaaggctttg aagagcacac agaagcccct ggccccccac gagagctgga
361 gagccatgta tatggcttca aagccaccta cggcagggac acactcgtga gcatgtgtgg
421 cctgcagttc aggtgataca tttaccagtg ttcttgtttg tgtggtgcca ggaaattgat
481 tttggaaaaa gtgaaataac attaaaggtg aatgtgaggc ttctactttt atccaaaagg
541 agctatatta gctaggctgt ttctgatayc caatcattgg tttaacaata aaggcaattt
601 gtttaatc
//
LOCUS MSGMPB57A 553 bp ds-DNA BCT 31-JUL-1990
DEFINITION M.bovis immunogenic protein MPB57 gene, complete cds.
ACCESSION M35389
KEYWORDS immunogenic protein MPB57.
SOURCE M.bovis (strain BCG) DNA.
ORGANISM Mycobacterium bovis
Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE 1 (bases 1 to 553)
AUTHORS Yamaguchi,R., Matsuo,K., Yamazaki,A., Nagai,S., Terasaka,K. and
Yamada,T.
TITLE Immunogenic protein MPB57 from Mycobacterium bovis BCG: Molecular
cloning, nucleotide sequence and expression
JOURNAL FEBS Lett. 240, 115-117 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 223 525 immunogenic protein MPB57
binding 209 213 MPB57 ribosomal binding site
signal 29 34 MPB57 -35 region
signal 50 55 MPB57 -10 region
BASE COUNT 113 a 170 c 184 g 86 t
ORIGIN
1 ttggcaacca ggaagcaagg ggccgccctt gagtgctagc actctcatgt atagagtgct
61 agatggcaat cggctaaccc ctgcgtcggc acccgcgacg acggcgcagg gcgcggacgt
121 cacctggtaa ttcggacggt tcgggcacgc cccggaccga ccgccaactc cggtccgggc
181 gagcgtcccg ggctctgatc caaatagtgg agggctccaa tcgtggcgaa ggtgaacatc
241 aagccactcg aggacaagat tctcgtgcag gccaacgagg ccgagaccac gaccgcgtcc
301 ggtctggtca ttcctgacac cgccaaggag aagccgcagg agggcaccgt cgttgccgtc
361 ggccctggcc ggtgggacga ggacggcgag aagcggatcc cgctggacgt tgcggagggt
421 gacaccgtca tctacagcaa gtacggcggc accgagatca agtacaacgg cgaggaatac
481 ctgatcctgt cggcacgcga cgtcgttggc cgtcgtttca agtagtagag cgtgttcccc
541 ggcgatcccg tcg
//
LOCUS MUSMTMTAAA 78 bp ds-DNA ORG 31-JUL-1990
DEFINITION Mouse mitochondrial maternally transmitted antigen-alpha
(Mta-alpha) mRNA, 5' end.
ACCESSION M35494
KEYWORDS maternally transmitted antigen.
SOURCE Mouse mitochondrial DNA.
ORGANISM Mitochondrion Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Mus musculus.
REFERENCE 1 (bases 1 to 78)
AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
Yonekawa,H.
TITLE Molecular definition of a mitochondrially encoded mouse minor
histocompatibility antigen
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 78 maternally transmitted antigen-alpha
(Mta-alpha) (gtg start codon)
BASE COUNT 22 a 23 c 8 g 25 t
ORIGIN
1 gtgttcttta ttaatatcct aacactcctc gtccccattc taatcgccat agccttccta
61 acattagtag aacgcaaa
//
LOCUS MUSMTMTABA 78 bp ds-DNA ORG 31-JUL-1990
DEFINITION Mouse mitochondrial maternally transmitted antigen-beta (Mta-beta)
mRNA, 5' end.
ACCESSION M36861
KEYWORDS maternally transmitted antigen.
SOURCE Mouse mitochondrial DNA.
ORGANISM Mitochondrion Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Mus musculus.
REFERENCE 1 (bases 1 to 78)
AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
Yonekawa,H.
TITLE Molecular definition of a mitochondrially encoded mouse minor
histocompatibility antigen
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 78 maternally transmitted antigen-beta (Mta-beta)
(gtg start codon)
BASE COUNT 21 a 24 c 9 g 24 t
ORIGIN
1 gtgttcttta ttaatgccct aacactcctc gtccccattc taatcgctat agccttccta
61 acactagtag aacgcaaa
//
LOCUS MUSMTMTACA 78 bp ds-DNA ORG 31-JUL-1990
DEFINITION Mouse mitochondrial maternally transmitted antigen-gamma
(Mta-gamma) mRNA, 5' end.
ACCESSION M36862
KEYWORDS maternally transmitted antigen.
SOURCE Mouse mitochondrial DNA.
ORGANISM Mitochondrion Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Mus musculus.
REFERENCE 1 (bases 1 to 78)
AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
Yonekawa,H.
TITLE Molecular definition of a mitochondrially encoded mouse minor
histocompatibility antigen
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 78 maternally transmitted antigen-gamma
(Mta-gamma) (gtg start codon)
BASE COUNT 21 a 23 c 9 g 25 t
ORIGIN
1 gtgttcttta ttaatgtcct aacactcctc gtccccattc taatcgctat agccttccta
61 acactagtag aacgcaaa
//
LOCUS MUSMTMTADA 78 bp ds-DNA ORG 31-JUL-1990
DEFINITION Mouse mitochondrial maternally transmitted antigen-delta
(Mta-delta) mRNA, 5' end.
ACCESSION M36863
KEYWORDS maternally transmitted antigen.
SOURCE Mouse mitochondrial DNA.
ORGANISM Mitochondrion Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Mus musculus.
REFERENCE 1 (bases 1 to 78)
AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
Yonekawa,H.
TITLE Molecular definition of a mitochondrially encoded mouse minor
histocompatibility antigen
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 78 maternally transmitted antigen-delta
(Mta-delta) (gtg start codon)
BASE COUNT 22 a 23 c 8 g 25 t
ORIGIN
1 gtgttcttta ttaataccct aacactcctc gtccctattc taatcgctat agccttccta
61 acactagtag aacgcaaa
//
LOCUS MZEHISH3A 411 bp ds-DNA PLN 31-JUL-1990
DEFINITION Z.mays histone H3 gene, complete cds.
ACCESSION M35388
KEYWORDS histone H3.
SOURCE Z.mays (L. cv. INRA 258) 5-6 day plantlet DNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 411)
AUTHORS Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C.
TITLE Histone genes in higher plants: Organization and expression
JOURNAL Dev. Genet. 8, 461-473 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 411 histone H3
BASE COUNT 77 a 154 c 128 g 52 t
ORIGIN
1 atggcccgca cgaagcagac ggcgcgcaag tcgacgggcg gcaaggcgcc ccgcaagcag
61 ctggccacca aggcggcgcg caagtcggcg ccggcaaccg gtggcgtgaa gaagcctcac
121 cgcttccgcc ccggcaccgt cgcgctccgg gagattcgca agtaccagaa gagcacggag
181 ctgctcatcc gcaagctgcc cttccagcgc ctcgtccgtg agatcgcgca ggatttcaag
241 accgacctcc gcttccagtc ctccgctgtc gccgcgctgc aggaggccgc cgaggcctac
301 ctcgtggggc tcttcgagga caccaacctc tgcgccatcc acgccaagcg cgtcaccatc
361 atgcccaagg acatccagct cgcgcgccgc atcaggggcg agagggcttg a
//
LOCUS RATFIBAA 511 bp ss-mRNA ROD 31-JUL-1990
DEFINITION Rat alpha-fibrinogen mRNA, 3' end.
ACCESSION M35601
KEYWORDS alpha-fibrinogen.
SOURCE Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA,
clone lambda-7A.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 511)
AUTHORS Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M.
TITLE Molecular cloning of mRNA sequences transiently induced during rat
liver regeneration
JOURNAL Exp. Cell Res. 169, 47-56 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 282 alpha-fibrinogen
BASE COUNT 145 a 124 c 100 g 141 t 1 others
ORIGIN
1 cctgaacttg gttcctttta tgacagccgc tttggttcac tcacaaagaa cttcaaagaa
61 tttggcagta agacctctga ttctgacatc ttcacagaca tcgagaaccc tagctcccat
121 gtacctgagt tttcttncag tagtaaaacc tcaactgtca ggaaacaagt aaccaagagc
181 tataaaatgg cagatgaggc agcaagtgaa gctcaccaag aaggagacac tcgaaccacc
241 aagaggggcc gagctcgcac aatgagaggt atccacgctt aactctggga agttgccctg
301 accccctaga ctaagttaac catttctgca aagtgcttac caggcgcgct ctttcttaac
361 ctcttctagt gctttggtgg aatctcattt tttttcatgc tagactgtac gttccttggg
421 ggcagggact ttgccatgtg tctatttctc taattcccca atgcataaca gtgcagtcat
481 ttctcaataa atatatttta aataaatgaa c
//
LOCUS RATFIBBA 261 bp ss-mRNA ROD 31-JUL-1990
DEFINITION Rat beta-fibrinogen mRNA, 3' end.
ACCESSION M35602
KEYWORDS beta-fibrinogen.
SOURCE Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA,
clone lambda-7B.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M.
TITLE Molecular cloning of mRNA sequences transiently induced during rat
liver regeneration
JOURNAL Exp. Cell Res. 169, 47-56 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 168 beta-fibrinogen
BASE COUNT 73 a 49 c 61 g 78 t
ORIGIN
1 tgtcatgcag ccaatccaaa cggcagatac tactggggtg gacagtacac ctgggacatg
61 gcaaagcatg gcacagatga tggtgtagta tggatgaatt ggaaggggtc atggtactca
121 atgaggaaga tgagtatgaa gatcaggccc ttcttcccac agcaatagtc cccaatagcg
181 tagatttttg ctcttctgta tgtgacaaca tttttgtaca ttatgttatt ggaattttct
241 ttcatacatt atattcctac t
//
LOCUS RATNKBA 727 bp ss-mRNA ROD 31-JUL-1990
DEFINITION Rat neurokinin B mRNA, complete cds.
ACCESSION M35607
KEYWORDS neurokinin B.
SOURCE Rat cerebral cortex, cDNA to mRNA, clone cRNKp[1,6].
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 727)
AUTHORS Bonner,T.I., Affolter,H.-U., Young,A.C. and Young,W.S.III.
TITLE A cDNA encoding the precursor of the rat neuropeptide neurokinin B
JOURNAL Mol. Brain Res. 2, 243-249 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 115 465 neurokinin B precursor
sigp 115 162 neurokinin B signal peptide
matp 358 387 neurokinin B
mRNA < 1 727 neurokinin B mRNA
BASE COUNT 155 a 228 c 179 g 165 t
ORIGIN
1 agctgggact ggttcctgcc acctccccga gcccctgcac tcttgtctct ctctgtctcc
61 acagcagcgt caccggctcc cggatcctgc cagccgtcca gctctacagg caccatgagg
121 agcgccatgc tgttcgcggc tgtcctcgcc ctcagcttgg catggacctt cggggctgcg
181 tgtgaggaac ctcaggagca gggtgggagg ctcagtaagg actctgatct ctctctgcta
241 cctccacccc tgcttcggag actctacgac agccgctcca tctctctgga aggattgctg
301 aaagtgctga gcaaggctag cgtgggaccg aaggagacat cacttccaca gaaacgtgac
361 atgcacgact tctttgtggg acttatgggc aagaggaaca gccaaccaga cactcccgct
421 gatgtggttg aagagaacac ccccagcttt ggcgtcctca aataggccag cagtgcagaa
481 aagcactcca ctctcagacc ctggactgca tcataaagac agggttcctg tggcggtccc
541 agtgcctgcg ctcctgcttc cctgcctgca aggtcctcct gttggctccc ttccctactc
601 tgcacagatg ctgcatatga acagcctcta cccccatatc aattatggtt tctgtagtgt
661 cctgcattaa aaataccatg tctcctcctc aacaataaag ggtttttaca atggagtgac
721 tgaaaag
//
LOCUS RABIGHCPS 577 bp ss-mRNA MAM 31-JUL-1990
DEFINITION Rabbit nonproductive VHa1 mRNA for Ig(mu) heavy chain constant
region.
ACCESSION X13699
KEYWORDS constant region; immunoglobulin; immunoglobulin heavy chain;
mu-immunoglobulin.
SOURCE rabbit (Oryctolagus cuniculus).
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 577)
AUTHORS Mage,R.G.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 577)
AUTHORS Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E.,
Becker,R.S. and Knight,K.L.
TITLE Evolutionary conservation of splice sites in sterile C-mu
transcripts and of immunoglobulin heavy chain (IgH) enhancer
sequences
JOURNAL Mol. Immunol. 26, 1007-1010 (1989)
STANDARD simple automatic
COMMENT *source: strain=allotype-defined NIH VHal Ckb9 Rabbit No CW247-4;
*source: library=pBR322; clone=pB3; The sequence overlaps with that
reported by Bernstein et. al. in J. Immunol. 132:490-495(1984)
J00666 (F-I haplotype). The cDNA pB3 sequence starts at a position
corresponding to pos. 70 in a germline DNA from JH-C-mu intron 80
bp downstream of the Ig(mu) enhancer. See x13700.
EMBL features not translated to GenBank features:
key from to description
SITE 549 >577 CH1 domain of Ig(mu)H chain
[1] Author address Mage R.G., Lab of Immunology, NIAID , NIH,
Building 10,11-N311, NIH, Bethesda, MD 20892.
Submitted (05-DEC-1988) on tape to the EMBL data library
BASE COUNT 145 a 120 c 132 g 180 t
ORIGIN
1 tagaaagctc aagcttagct tatcaatttc taaattgtat tcatttttaa agtaagatat
61 tttagaaatt aaacttacag atggtttatt aatgcgccca tgaaactttt taaaattact
121 ttgttatttc ttcatacggt aattaacctc aaacttcaga gacctcaaag cattattttt
181 gcaacgcctg cgaccgtcat ttttaagagt ctgttttctc tatgattaga agagacgttt
241 tgaggcttca tttcagtatt tttcaagact gcttttcaag tattcatttt tgcaccagtt
301 ctagctgagg tggacctaag tgtcctgagt tattgggtta aacgtgtgac ttggtgaatt
361 ggagagccat cagacagtgc agagacagga gtcgggagct ctgcagaagc cgggccaccc
421 acgccttgag gtttcccaag accgccgggt ctgggggtgc agccgtggct tctggttagt
481 caggaattcc tgtttggagt ggacttggag gacccgcagg acgtgccagt gaccagccac
541 tgcacatgtg agcctgtcat ctccaactct gtacccg
//
LOCUS RABIGHENH 1257 bp ds-DNA MAM 31-JUL-1990
DEFINITION Rabbit IgH gene JH-C(mu) intron with enhancer region yielding a
nonproductive C(mu) transcript.
ACCESSION X13700
KEYWORDS enhancer; germ line; immunoglobulin; immunoglobulin heavy chain;
mu-immunoglobulin.
SOURCE rabbit (Oryctolagus cuniculus).
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 1257)
AUTHORS Mage,R.G.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1257)
AUTHORS Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E.,
Becker,R.S. and Knight,K.L.
TITLE Evolutionary conservation of splice sites in sterile C-mu
transcripts and of immunoglobulin heavy chain (IgH) enhancer
sequences
JOURNAL Mol. Immunol. 26, 1007-1010 (1989)
STANDARD simple automatic
COMMENT *source: library=cosmid; clone=COS 3.3; tissue=sperm; *source:
strain=allotype-defined a3n80f71g75 rabbit No X314-6. see x13699
for corresponding transcript seq
EMBL features not translated to GenBank features:
key from to description
IVS 1 1257 JH-C(mu) intron
SITE 591 598 conserved enhancer octamer
TRANSCR 670 1227 homolog. to pB3 cDNA (X13699)
IVS 1228 >1257 intron (to form pB3 cDNA)
[1] Author address Mage R.G., Lab of Immunology, NIAID , NIH\,
Building 10,11-N311, NIH, Bethesda, MD 20892.
Submitted (05-DEC-1988) on tape to the EMBL data library
BASE COUNT 328 a 248 c 320 g 361 t
ORIGIN
1 gttagaaaga gaacggttgt cagcagtaca aggaaggagg gccgtggagc cccaaaggcg
61 ctgcctctga cagggaccta gcaaggcctg ggcagcccaa ccgtgctggg acggcgccca
121 ccaggtgtgc agggacatgg tgtccgaagt ggcttctgag aaaagtggct gagaaaattg
181 agaatgaaga ctttttgttc atgtatattc agctttttgt gtttttttaa tgtaaaggac
241 ttttagatta ttattgttca gtatctgtct gggattggca caagacagca gagaaacagt
301 ggagtggcac cagcaggccc caatacagga ctgagttttg acaaaaatta tttaagttta
361 atttttaatt aattgagcaa tgccagatta gtgcatgagt gacagccaag atggccgcgc
421 atccggccgg acacctgcag caggtggcag gaagcatgtc gcactgagag tctattttag
481 gaagccaaaa agcaccctgt gtgaatttat agctcctgtt tttgaagtgg tggttttcaa
541 gtggttttcc cggatccgaa agtgttgcag agcaaaacaa cacctgggta gtttgcatat
601 ctaaaatagg acaggggact gccagggttc ctcttgtcac tatttgaatt tattttttca
661 atctgaactt agcaacttct agtttgctca agcttagctt atcaatttct aaattgtatt
721 catttttaaa gtaagatatt ttagaaatta aacttacaga tggtttatta atgcgcccat
781 gaaacttttt aaaattactt tgttatttct tcatacggta attaacctca aacttcagag
841 acctcaaagc attatttttg caacgcctgc gaccgtcatt tttaagagtc tattttctct
901 atgattagaa gagacgtttt gaggcttcat ttcagtattt ttcaagactg cttttcaagt
961 attcattttc acaccagttc tagctgaggt ggacctaagt gtcctgagtt attgggttaa
1021 acgtgtgact tggtgaattg gagagccatc agacagtgca gagacaggag tcgggagctc
1081 tgcagaagcc gggccaccca cgccttgagg tttcccagga ccgccgggtc tgggggtgca
1141 gccgtggctt ctggttagtc aggaattcct gtttggagtg gacttggagg acccgcagga
1201 cgtgccggtg accagccact gcacatggtg agaggggcag ctgctctgcc acagatc
//
LOCUS ECOTN5011 60 bp ds-DNA BCT 31-JUL-1990
DEFINITION E.coli mercuric ion resistance transposon (Tn501), 5' end.
ACCESSION M25453
KEYWORDS transposon.
SEGMENT 1 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 60)
AUTHORS Brown,N.L., Choi,C.-L., Richmond,M.H. and Grinsted,J.
TITLE Nucleotide sequences at the ends of the mercury resistance
transposon, Tn501
JOURNAL Nucleic Acids Res. 8, 1933-1945 (1980)
STANDARD simple staff_review
COMMENT Transposon Tn501 was originally isolated in Pseudomonas aeruginosa.
FEATURES from to/span description
recomb 12 13 E.coli DNA end/Tn501 DNA start
BASE COUNT 21 a 11 c 17 g 11 t
ORIGIN
1 ctgaaactat gagggggaac cgcagaattc ggaaaaaatc gtacgctaag ctaacggtgt
//
LOCUS ECOTN5012 60 bp ds-DNA BCT 31-JUL-1990
DEFINITION E.coli mercuric ion resistance transposon (Tn501), 3' end.
ACCESSION M25454
KEYWORDS transposon.
SEGMENT 2 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 60)
AUTHORS Brown,N.L., Choi,C.-L., Grinsted,J., Richmond,M.H. and
Whitehead,P.R.
TITLE Nucleotide sequences at the ends of the mercury resistance
transposon, Tn501
JOURNAL Nucleic Acids Res. 8, 1933-1945 (1980)
STANDARD simple staff_review
COMMENT Transposon Tn501 was originally isolated in Pseudomonas aeruginosa.
FEATURES from to/span description
recomb 48 49 Tn501 DNA end/E.coli DNA start
BASE COUNT 15 a 17 c 13 g 15 t
ORIGIN About 8.2 kb after segment 1
1 cccggaaaac cttagcgtac gattttttcc gaattctgcg agccccccta tgagatgaga
//
LOCUS COXCBBEA 1749 bp ds-DNA BCT 31-JUL-1990
DEFINITION Plasmid QpRS (from Coxiella burnetii) surface protein (cbbE') gene,
complete cds.
ACCESSION M29982
KEYWORDS surface protein.
SOURCE Plasmid QpRS (from Coxiella burnetti, strain Biotzere, isolate
priscilla Q177) DNA.
ORGANISM Plasmid QpRS
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1749)
AUTHORS Minnick,M.F., Heinzen,R.A., Frazier,M.E. and Mallavia,L.P.
TITLE Characterization and expression of the cbbE' gene of Coxiella
burnetii
JOURNAL J. Gen. Microbiol. 136, 1099-1107 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Heinze, 20-NOV-1989.
FEATURES from to/span description
pept 196 1683 E' protein
signal 157 162 cbbE' -35 region (pot.)
signal 178 183 cbbE' -10 region (pot.)
signal 1691 1713 rho-independent transcription terminator
(pot.)
binding 187 193 ribosome binding site (pot.)
BASE COUNT 579 a 335 c 343 g 492 t
ORIGIN
1 gtcgacgtct gcctttgccg ctatagttca ccttcaaata aacaacaata tttatttata
61 aattttcgga gtccatcatg ccgctgagaa aggaaacttt taaagtaagt tacacattag
121 gtttttttgt gaaaataaat cgactagctt agcttattta atcctttctt aaaattttat
181 aataacggag agactatgcc taaaaaactc gtacccaaag actatgaata tatccatctg
241 gatcttacca ccggtgaaat aaactttacg tcgtttaatt cgcttgaaga attgcaagcc
301 tctttaaaag aaggtcagat ttttttccac aaaagtgtca tctttgaaga aaaaccagaa
361 agtggggaaa tttactcccc taaactgata agccagatat atcgaaaaga acaggaactc
421 tttgaaataa gggaaaaaag taagggacat ccactaccgg ttactaagaa actgcttaag
481 agagggcagg gaacgatagt gtgttgcggt atttacacaa aagaactctt gaaaaacgta
541 gcggaaaagg gacagtacga tacccaatgt gacgatctaa atttgggaat ttttcacgta
601 cgcgcccata aacctttagg catcgcgcaa aggcttgtgc atcttccgct tcccgaggat
661 gcttcttccg ctgcagtagc cactgaaaat ttattcggct taatacgatt tatactcgtc
721 aatgatcccg ctaagaaaaa aatttactta cctatctctt gttttgcaat tgagaagcgt
781 atagaacaag agcatataat tggatattct cagaaagata gcctggcgct ctctcagcga
841 gcttattatg aatataagaa ggacggaacg cttatcgggc tagtcgcatt gatcggtgtt
901 gacgtaaaga tagatggtaa gctaggtttt ttatatcatc cggtgtggcg tgagaaacaa
961 tgggcattaa aattcaatga aaaaatgttt tattgtgctg tttcccgtgc agaaaaagaa
1021 aaagtcttta agccgcctta ttatttggag ccgactgcga taattgtcga tgtaactgaa
1081 acgcccgtta agcgcttaaa gaatacaagt gaggactatt tatggttgga ggtttcccaa
1141 atttcggcaa aattctctct tttttgtgca caaaacaatt tgaaattgga gaaggctgat
1201 tcaaaaaata aatctccttt cgtggcccta tcaatggaat cgattagtga attaacaggc
1261 gaacagaaaa gagcttttgt caaaatcttg aatattcccg gaatcatttt ctcttcctca
1321 accctagcta aagcgagact agagagcaaa cttcaatata ttggaccagc attaattgaa
1381 gccgctgcag atggaaattt caccgatgtg gttgatataa taaatcgaat agaaccactc
1441 tatgattaca aagagatttt aaaagaggca ttaaaaacac aacgtttggg aacgggcaat
1501 actcctttac aggaggctat caaaggacag catacaagcc tagttaagta cttcagttcg
1561 ctatccgctt cgttgaaagt cataaatcat aaaaatcatc aaggattaac agcactcaat
1621 ttcgctacag ccattggatc atcccctgcg atcgtacaag agcttgaatg gtgctcccaa
1681 taaagttagg aatggtccgc cgcgcaaatc attgctaata cgttatctga aaaggtgtca
1741 tccccttcc
//
LOCUS DROOPA 240 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster neurogenic locus mastermind opa repeat sequence.
ACCESSION M35653
KEYWORDS mastermind locus.
SOURCE D.melanogaster DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 240)
AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila
melanogaster
JOURNAL Genetics 118, 483-497 (1988)
STANDARD simple staff_review
BASE COUNT 68 a 81 c 67 g 24 t
ORIGIN
1 ccgccgcacg gcatgaatcc gcagcaacag cagcagcagc aacaacagca gcagcagcaa
61 cagcaacagg cccagcagca acatggtcaa atgatgggac aaggacagcc gggtcgctat
121 aacgactacg gcggcggctt tcccaatgac tttggcctgg gacccaatgg tccgcagcag
181 cagcagcagc aggcgcagca acagcagccg cagcagcaac acctgccgcc gcagttccat
//
LOCUS DRORS1 60 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster neurogenic locus mastermind RS repeat sequence.
ACCESSION M35654
KEYWORDS mastermind locus.
SOURCE D.melanogaster DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 60)
AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila
melanogaster
JOURNAL Genetics 118, 483-497 (1988)
STANDARD simple staff_review
BASE COUNT 29 a 29 c 1 g 1 t
ORIGIN
1 gcacacacac acacacacac acacacacac acacacacac acacacacac acacacacat
//
LOCUS DRORS2 60 bp ds-DNA INV 31-JUL-1990
DEFINITION D.melanogaster neurogenic locus mastermind RS repeat sequence.
ACCESSION M35655
KEYWORDS mastermind locus.
SOURCE D.melanogaster DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 60)
AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila
melanogaster
JOURNAL Genetics 118, 483-497 (1988)
STANDARD simple staff_review
BASE COUNT 20 a 27 c 7 g 6 t
ORIGIN
1 agttgccctg ctcacacaca cgcacacaca cacacacaca cacacacatg catgcacgcc
//
LOCUS ECOCS3 612 bp ds-DNA BCT 31-JUL-1990
DEFINITION E.coli major fimbrial subunit of CS3 gene, complete cds.
ACCESSION M35657
KEYWORDS fimbrial subunit.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 612)
AUTHORS Boylan,M., Smyth,C.J. and Scott,J.R.
TITLE Nucleotide sequence of the gene encoding the major subunit of CS3
fimbriae of enterotoxigenic Escherichia coli
JOURNAL Infect. Immun. 56, 3297-3300 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 82 588 fimbrial subunit precursor of CS3
sigp 82 126 CS3 fimbrial subunit signal peptide
matp 127 585 CS3 fimbrial subunit
BASE COUNT 197 a 114 c 110 g 191 t
ORIGIN
1 agcagtacag ttccaggtac gtatactgtt ggtcttaacg taaccagtaa tgttatttaa
61 agtgaatgta tgagggattc gatgttaaaa ataaaatact tattaatagg tctttcactg
121 tcagctatga gttcatactc actagctgca gcggggccca ctctaaccaa agaactggca
181 ttaaatgtgc tttctcctgc agctctggat gcaacttggg ctcctcagga taatttaaca
241 ttatccaata ctggcgtttc taatactttg gtgggtgttt tgactctttc aaataccagt
301 attgatacag ttagcattgc gagtacaagt gtttctgata catctaagaa tggtacagta
361 acttttgcac atgagacaaa taactctgct agctttgcca ccaccatttc aacagataat
421 gccaacatta cgttggataa aaatgctgga aatacgattg ttaaaactac aaatgggagt
481 cagttgccaa ctaatttacc acttaagttt attaccactg aaggtaacga acatttagtt
541 tcaggtaatt accgtgcaaa tataacaatt acttcgacaa ttaaataatt atataataga
601 cgtagccttc ga
//
LOCUS HUMCRYAX1 189 bp ds-DNA PRI 31-JUL-1990
DEFINITION Human alpha-A-crystallin gene, exon 1.
ACCESSION M35629
KEYWORDS alpha-A-crystallin.
SEGMENT 1 of 2
SOURCE Human spleen DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 189)
AUTHORS McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J.
TITLE Isolation and partial characterization of the human alpha-A-
crystallin gene
JOURNAL Exp. Eye Res. 43, 285-291 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 / 189 alpha-A-crystallin, exon 1
BASE COUNT 27 a 72 c 45 g 45 t
ORIGIN
1 atggacgtga ccatccagca cccctggttc aagcggaccc tggggccctt ctaccccagc
61 cggctgttcg accagttttt cggcgagggg ctttttgagt atgacctgct gcccttcctg
121 tcgtccacca tcacccccta ctaccgccag tccctcttcc gcaccgtgct ggactccggc
181 atctctgag
//
LOCUS HUMCRYAX2 51 bp ds-DNA PRI 31-JUL-1990
DEFINITION Human alpha-A-crystallin gene, exon 3 (partial).
ACCESSION M35628
KEYWORDS alpha-A-crystallin.
SEGMENT 2 of 2
SOURCE Human spleen DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 51)
AUTHORS McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J.
TITLE Isolation and partial characterization of the human alpha-A-
crystallin gene
JOURNAL Exp. Eye Res. 43, 285-291 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 27 alpha-A-crystallin, exon 3 (AA at 1)
BASE COUNT 7 a 23 c 11 g 10 t
ORIGIN About 1.7 kb after segment 1
1 aagcccacct cggctccctc gtcctaagca gcattgcctc ggctggctcc c
//
LOCUS HUMTBP1 1341 bp ss-mRNA PRI 31-JUL-1990
DEFINITION Human tat binding protein-1 (tbp-1) mRNA, complete cds.
ACCESSION M34079
KEYWORDS Tat binding protein-1.
SOURCE Human Jurkat T-cell line, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites for [2])
AUTHORS Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A.
TITLE A cDNA for a protein that interacts with the human immunodeficiency
virus tat transactivator
JOURNAL Science 248, 1650-1653 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1341)
AUTHORS Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.A.Rosen, 08-MAY-1990.
FEATURES from to/span description
pept 63 1277 tat binding protein-1 (tbp-1)
BASE COUNT 350 a 337 c 412 g 242 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccggc gaccgtgtgg gatgaggccg agcaagatgg aattggggag gaggtgctca
61 agatgtccac ggaggagatc atccagcgca cacggctgct ggacagtgag atcaagatca
121 tgaagagtga agtgttgaga gtcacccatg agctccaagc catgaaggac aagataaaag
181 agaacagtga gaaaatcaaa gtgaacaaga ccctgccgta ccttgtctcc aacgtcatcg
241 agctcctgga tgttgatcct aatgaccaag aggaggatgg tgccaatatt gacctggact
301 cccagaggaa gggcaagtgt gctgtgatca aaacctctac acgacagacg tacttccttc
361 ctgtgattgg gttggtggat gctgaaaagc taaagccagg agacctggtg ggtgtgaaca
421 aagactccta tctgatcctg gagacgctgc ccacagagta tgactcgcgg gtgaaggcca
481 tggaggtaga cgagaggccc acggagcaat acagtgacat tgggggtttg gacaagcaga
541 tccaggagct ggtggaggcc attgtcttgc caatgaacca caaggagaag tttgagaact
601 tggggatcca acctccaaaa ggggtgctga tgtatgggcc cccagggacg gggaagaccc
661 tcctggcccg ggcctgtgcc gcacagacta aggccacctt cctaaagctg gctggccccc
721 agctggtgca gatgttcatt ggagatggtg ccaagctagt ccgggatgcc tttgccctgg
781 ccaaggagaa agcgccctct atcatcttca ttgatgagtt ggatgccatc ggcaccaagc
841 gctttgacag tgagaaggct ggggaccggg aggtgcagag gacaatgctg gagcttctga
901 accagctgga tggcttccag cccaacaccc aagttaaggt aattgcagcc acaaacaggg
961 tggacatcct ggaccccgcc ctcctccgct cgggccgcct tgaccgcaag atagagttcc
1021 cgatgcccaa tgaggaggcc cgggccagaa tcatgcagat ccactcccga aagatgaatg
1081 tcagtcctga cgtgaactac gaggagctgg cccgctgcac agatgacttc aatggggccc
1141 agtgcaaggc tgtgtgtgtg gaggcgggca tgatcgcact ggccaggggt gccacggagc
1201 tcacccacga ggactacatg gaaggcatcc tggaggtgca ggccaagaag aaagccaacc
1261 tacaatacta cgcctaggca cacaggccag ccccagtctc acggctgaag tgcgcaataa
1321 aagatggttt agggggaatt c
//
LOCUS MUSMETMO 280 bp ds-DNA SYN 31-JUL-1990
DEFINITION MoMLV/mouse metallothionein promoter recombinant DNA.
ACCESSION M35969
KEYWORDS .
SOURCE Mouse and mouse moloney leukemia virus DNA.
REFERENCE 1 (bases 1 to 280)
AUTHORS Mclvor,R.S.
TITLE Deletion in a recombinant retroviral vector resulting from a
cryptic splice donor signal in the moloney leukemia virus encvelope
gene
JOURNAL Virology 176, 652-655 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 127 128 MoMLV DNA end/mouse metallothionein promoter
DNA start
recomb 241 242 metallothionein promoter region deletion
BASE COUNT 64 a 78 c 77 g 61 t
ORIGIN
1 gggtccagac tccttctact gtgcctattg gggctgtgag acaaccggta gagcttactg
61 gaagccctcc tcatcatggg atttcatcac agtaaacaac aatctcacct ctgaccaggc
121 tgtccagctt cgatccagag agagacctgg gcggagctgg tcgctgctag gaactccagg
181 aaaggagaag ctgaggttac cacgctgcga atgggtttac ggagatagct ggctttccgg
241 gctccagctt caccagatcc gtcgacctgc agccaagctt
//
LOCUS PLYREGU 203 bp ds-DNA VRL 31-JUL-1990
DEFINITION Polyomavirus JC HindIII C fragment regulatory sequence.
ACCESSION M34921
KEYWORDS .
SOURCE Polyomavirus JC DNA.
ORGANISM Polyomavirus JC
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 203)
AUTHORS Frisque,R.J.
TITLE Regulatory sequences and virus-cell interactions of JC virus
JOURNAL Prog. Clin. Biol. Res. 105, 41-59 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 99 < 1 (c) T- antigen
BASE COUNT 62 a 55 c 31 g 55 t
ORIGIN
1 aagctttctc atgacaggaa tgttccccca tgcagaccta tcaaggccta ataaatccat
61 aagctccatg gattcctccc tattcagcac tttgtccatt ttagcttttt gcagcaaaaa
121 attactgcaa aaaagggaaa aacaagggaa tttccctggc ctcctaaaaa gcctccacgc
181 ccttactact tctgagtaag ctt
//
LOCUS PLYREGUX 292 bp ds-DNA VRL 31-JUL-1990
DEFINITION Polyomavirus JC regulatory sequence.
ACCESSION M34922
KEYWORDS .
SOURCE Polyomavirus JC DNA, clones Mad1-[TC,Br].
ORGANISM Polyomavirus JC
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 292)
AUTHORS Frisque,R.J.
TITLE Regulatory sequences and virus-cell interactions of JC virus
JOURNAL Prog. Clin. Biol. Res. 105, 41-59 (1983)
STANDARD simple staff_review
FEATURES from to/span description
binding 3 19 pot. T-antigen binding site
rpt 20 117 tandem repeat copy A
rpt 118 215 tandem repeat copy B
BASE COUNT 96 a 68 c 81 g 47 t
ORIGIN 0.67 - 0.73 mu.
1 cggaggcggc ctcggcctcc tgtatatata aaaaaaaggg aagggatggc tgccagccaa
61 gcatgagctc atacctaggg agccaaccag ctaacagcca gtaaacaaag cacaaggctg
121 tatatataaa aaaaagggaa gggatggctg ccagccaagc atgagctcat acctagggag
181 ccaaccagct aacagccagt aaacaaagca caaggggaag tggaaagcag ccaagggaac
241 atgttttgcg agccagagct gttttggctt gtcaccagct ggccatggtt ct
//
LOCUS PT7RNA 203 bp ds-DNA PHG 31-JUL-1990
DEFINITION Bacteriophage T7 DNA sequence.
ACCESSION M35614
KEYWORDS .
SOURCE Bacteriophage T7 (strain delta-D111) DNA RNA.
ORGANISM Bacteriophage T7
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 203)
AUTHORS Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and
Kutateladze,T.V.
TITLE Factors affecting the discontinuous in vitro elongation of RNA by
RNA polymerase from Escherichia coli
JOURNAL Mol. Biol. 15, 510-522 (1981)
STANDARD simple staff_review
BASE COUNT 63 a 54 c 49 g 35 t 2 others
ORIGIN
1 catcgagagg gacacggcga atagccatcc caatccacac gtccaacggg gcaaccgtat
61 gtacacctga tgggttcgca atgaacacna cgaatcgaac gccttaagcg tgaactccgc
121 attaaccgca agattacnag ataggtcggc tatgacgaac gcactgatgg catagaggta
181 atgcatgcat gcacacatat acg
//
LOCUS PT7RNAX 209 bp ds-DNA PHG 31-JUL-1990
DEFINITION Bacteriophage T7 DNA sequence.
ACCESSION M35615
KEYWORDS .
SOURCE Bacteriophage T7 DNA.
ORGANISM Bacteriophage T7
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 209)
AUTHORS Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and
Kutateladze,T.V.
TITLE Factors affecting the discontinuous in vitro elongation of RNA by
RNA polymerase from Escherichia coli
JOURNAL Mol. Biol. 15, 510-522 (1981)
STANDARD simple staff_review
BASE COUNT 71 a 52 c 54 g 32 t 0 others
ORIGIN
1 catcgagagg gacacggcga atagccatcc caatcgacac cggggtcaac cggataagta
61 gacagcctga taagtcgcac gaaaaacagg tattgacaac atgaagtaac atgcagtaag
121 atacaaatcg ctaggtaaca ctagcagcgt caaccgggcg cacagtgcct tctaggtgac
181 ttaagcgcac cacggcacat aaggtgaag
//
LOCUS SEQPGH 879 bp ss-mRNA VRT 31-JUL-1990
DEFINITION S.quinqueradiata growth hormone (GH) mRNA, complete cds.
ACCESSION M35627
KEYWORDS growth hormone.
SOURCE S.quinqueradiata (pituitary glands), cDNA to mRNA.
ORGANISM Seriola quinqueradiata
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Perciformes; Percoidei; Carangidae.
REFERENCE 1 (bases 1 to 879)
AUTHORS Watahiki,M., Tanaka,M., Masuda,N., Yamakawa,M., Yoneda,Y. and
Nakashima,K.
TITLE cDNA cloning and primary structure of yellow tail (Seriola
quinqueradiata) pregrowth hormone
JOURNAL Gen. Comp. Endocrinol. 70, 401-406 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 97 711 growth hormone (GH) precursor
sigp 97 147 growth hormone signal peptide (pot.) (other
potential cleavage site on bp 150)
matp 150 708 growth hormone (pot.) (other matp starts on bp
153)
BASE COUNT 226 a 239 c 192 g 222 t
ORIGIN
1 acaacacact gaacaactga actcagctca gattaaacag aacctgaacc tgaagcagac
61 cctgaaccag aacctgaact tgaaccctga tcagccatgg acagagttgt ccttctgctg
121 tcagtcctgt ctctgggcgt ctcctctcag ccaatcacag acagccagca tctgttctcc
181 atcgctgtca gcagaatcca aaacctccac ctgctcgctc agagactctt ctccaacttc
241 gagagtactc tgcagacgga ggaccagcgt caactcaaca aaatcttcct acaggatttc
301 tgtaactctg attacatcat cagtcccatt gacaagcatg agacacaacg cagctctgtt
361 ctgaagctgt tatcgatctc ctatcgattg gtggagtctt gggagttctc cagtcgcttt
421 ctgtctggag gttctgctct gaggaaccag atttcaccca gactgtctga actcaagaca
481 ggaatccaac tgctgatcac agccaatcag gacggagcag agatgttctc tgacgtctcg
541 gccctccagc tcgctccata tggaaacttc tatcagagtc tgggaggcga agagttgctg
601 agacgaaact acgaattgct ggcctgtttc aagaaggaca tgcacaaggt ggagacgtac
661 ctgacggtgg ctaaatgtcg gctctctcca gaagctaact gcaccctgta gccccgtctc
721 tccacagtga agcctgtcgt tgatgatgta atcctgtatg ttctcaagct ccgcccctat
781 gttagcatta gtgtttacct ctgtcaggtg ctgaagtcca aactgatgat gtcatagtga
841 tgtcatacat tcagcttgtg aaataaagtg tgttaattc
//
LOCUS VIBRR5S 118 bp ss-rRNA RNA 31-JUL-1990
DEFINITION V.marinus 5S ribosomal RNA.
ACCESSION M35658
KEYWORDS 5S ribosomal RNA.
SOURCE V.marinus (strain MP-1, ATCC 15381) RNA.
ORGANISM Vibrio marinus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS MacDonell,M.T. and Colwell,R.R.
TITLE The nucleotide sequence of 5S ribosomal RNA from Vibrio marinus
JOURNAL Microbiol. Sci. 1, 229-231 (1984)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 118 5S rRNA
BASE COUNT 27 a 25 c 34 g 32 t
ORIGIN
1 tgtttggtga ccatagtgtt gtggtaccac ctgactccat tccgaactca gtagtgaaac
61 gtaataacgc cgatggtagt gtggggtttc cccatgtgag agtaggtatc gccaggca
//GenBank-Updates@genbank.bio.net (08/01/90)
LOCUS RATGGLUT 1060 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat gamma-glutamyltransferase gene, 5' promoter region.
ACCESSION J05515
KEYWORDS gamma-glutamyltransferase.
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1060)
AUTHORS Rajagopalan,S., Park,J.-H., Patel,P.D., Lebovitz,R.M. and
Lieberman,M.W.
TITLE Cloning and analysis of the rat gamma-glutamyltransferase gene
JOURNAL J. Biol. Chem. 265, 11721-11725 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Rajagopalan, 25-MAY-1990.
FEATURES from to/span description
mRNA 834 > 1060 gamm-glutamyltransferase mRNA
BASE COUNT 235 a 277 c 275 g 273 t
ORIGIN
1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc
61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga
121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca
181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc
241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg
301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga
361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc
421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca
481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag
541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc
601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg
661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc
721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat
781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga
841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag
901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt
961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc
1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga
//
LOCUS ACMGAG 167 bp ss-RNA VRL 01-AUG-1990
DEFINITION Avian myelocytomatosis retrovirus gag gene, partial cds.
ACCESSION M35626
KEYWORDS gag protein; oncogene.
SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10H) RNA.
ORGANISM Avian myelocytomatosis retrovirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian myelocytomatosis viruses.
REFERENCE 1 (bases 1 to 167)
AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
Patschinsky,T.
TITLE Structure of mutant and wild-type MC29 v-myc alleles and
biochemical properties of their protein products
JOURNAL Oncogene 1, 97-109 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 167 gag protein (AA at 1)
BASE COUNT 38 a 47 c 52 g 30 t
ORIGIN
1 ggggaggagc ttgcgagtac aggtccgccc gtggtggcca tgcctgtagt gattaacaca
61 gagggacccg cctggacccc tctggagcca aaattgatca caagactggc tgatacggtc
121 aggaccaagg gcttacgatc cccgattact atagcggcgg ccactcg
//
LOCUS ACMVMYC 333 bp ss-RNA VRL 01-AUG-1990
DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds.
ACCESSION M35624
KEYWORDS oncogene; v-myc protein.
SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10A) RNA.
ORGANISM Avian myelocytomatosis retrovirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian myelocytomatosis viruses.
REFERENCE 1 (bases 1 to 333)
AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
Patschinsky,T.
TITLE Structure of mutant and wild-type MC29 v-myc alleles and
biochemical properties of their protein products
JOURNAL Oncogene 1, 97-109 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 333 v-myc protein (AA at 1)
BASE COUNT 72 a 130 c 90 g 41 t
ORIGIN
1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtggtc
61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccggcgc caaccccgcg
121 gctctgctgg gggtcgacac gccgcccacg atccaccaac acaactacgc tgctcctccc
181 tccaccaagg tggaataccc agccgccaag aggctaaagt tggacagtgg cagggtcctc
241 aaacagatca gcaacaaccg aaaatgctcc agtccccgca cgttagactc agaggagaac
301 gacaagaggc gaacgcacaa cgtcttggag cgc
//
LOCUS ACMVMYCA 202 bp ss-RNA VRL 01-AUG-1990
DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds.
ACCESSION M35625
KEYWORDS oncogene; v-myc protein.
SOURCE Avian myelocytomatosis retrovirus (mutants MC29-10C and 10H) RNA.
ORGANISM Avian myelocytomatosis retrovirus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Avian myelocytomatosis viruses.
REFERENCE 1 (bases 1 to 202)
AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
Patschinsky,T.
TITLE Structure of mutant and wild-type MC29 v-myc alleles and
biochemical properties of their protein products
JOURNAL Oncogene 1, 97-109 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 202 v-myc protein (AA at 1)
BASE COUNT 34 a 77 c 62 g 29 t
ORIGIN
1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtcgtc
61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccgacga caagaggcga
121 acgcacaacg tcttggagcg ccagcgaagg aatgagctga agctgcgttt ctttgccctg
181 cgtgaccaga tacccgaggt gg
//
LOCUS HUM3BHSD 1565 bp ss-mRNA PRI 01-AUG-1990
DEFINITION Human placental 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase
mRNA, complete cds.
ACCESSION M35493
KEYWORDS 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase.
SOURCE Human placenta, cDNA to mRNA, clone H3-beta-hp6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Lorence,M.C., Murry,B.A., Trant,J.M. and Mason,J.I.
TITLE Human 3-beta-hydroxysteroid dehydrogenase/delta-5->4isomerase from
placenta: Expression in nonsteroidogenic cells of a protein that
catalyzes the dehydrogenation/isomerization of C21 and C19 steroids
JOURNAL Endocrinology 126, 2493-2498 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 31 1152 3-beta-hydroxysteroid
dehydrogenase/5-4-isomerase
mRNA < 1 1565 3-beta-hydroxysteroid
dehydrogenase/5-4-isomerase
BASE COUNT 417 a 381 c 376 g 391 t
ORIGIN
1 gcggagtgat tcctgctact ttggatggcc atgacgggct ggagctgcct tgtgacagga
61 gcaggagggt ttctgggaca gaggatcatc cgcctcttgg tgaaggagaa ggagctgaag
121 gagatcaggg tcttggacaa ggccttcgga ccagaattga gagaggaatt ttctaaactc
181 cagaacaaga ccaagctgac agtgctggaa ggagacattc tggatgagcc attcctgaag
241 agagcctgcc aggacgtctc ggtcatcatc cacaccgcct gtatcattga tgtcttcggt
301 gtcactcaca gagagtctat catgaatgtc aatgtgaaag gtacccagct cctgttagag
361 gcctgtgtcc aagctagtgt gccagtcttc atctacacca gtagcataga ggtagccggg
421 cccaactcct acaaggaaat catccagaat ggccatgaag aagagcctct ggaaaacaca
481 tggcccgctc catacccaca cagcaaaaag cttgctgaga aggctgtact ggcggctaac
541 gggtggaatc tgaaaaacgg cggcaccctg tacacttgtg ccttacgacc catgtatatc
601 tatggggaag gaagccgatt cctttctgct agtataaacg aggccctgaa caacaatggg
661 atcctgtcaa gtgttggaaa gttctccact gttaacccag tctatgttgg caatgtggcc
721 tgggcccaca ttctggcctt gagggccctg caggacccca agaaggcccc aagcatccga
781 ggacagttct actatatctc agatgacacg cctcaccaaa gctatgataa ccttaattac
841 accctgagca aagagttcgg cctccgcctt gattccagat ggagctttcc tttatccctg
901 atgtattgga ttggcttcct gctggaaata gtgagcttcc tactcaggcc aatttacacc
961 tatcgaccgc ccttcaaccg ccacatagtc acattgtcaa atagcgtatt caccttctct
1021 tataagaagg ctcagcgaga tctggcgtat aagccactct acagctggga ggaagccaag
1081 cagaaaacgg tggagtgggt tggttccctt gtggaccggc acaaggagac cctgaagtcc
1141 aagactcagt gatttaagga tgacagagat gtgcatgtgg gtattgttag gagatgtcat
1201 caagctccac cctcctggcc tcatacagaa agtgacaagg gcacaagctc aggtcctgct
1261 gcctcccttt catacaatgg ccaacttatt gtattcctca tgtcatcaaa acctgcgcag
1321 tcattggccc aacaagaagg tttctgtcct aatcatatac cagaggaaag accatgtggt
1381 ttgctgttac caaatctcag tagctgattc tgaacaattt agggactctt ttaacttgag
1441 ggtcgttttg actactagag ctccatttct actcttaaat gagaaaggat ttcctttctt
1501 tttaatcttc cattccttca catagtttga taaaaagatc aataaatgtt tgaatgttta
1561 atgtg
//
LOCUS HUMMHB7B 1089 bp ss-mRNA PRI 01-AUG-1990
DEFINITION Human class I HLA-B7 mRNA, complete cds.
ACCESSION M35444
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1089)
AUTHORS Parham,P., Benjamin,R.J., Chen,B.P., Clayberger,C., Ennis,P.D.,
Krensky,A.M., Lawlor,D.A., Littman,D.R., Norment,A.M., Orr,H.T.,
Salter,R.D. and Zemmour,J.
TITLE Diversity of class I HLA molecules: Functional and evolutionary
interactions with T cells
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 529-543 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1089 MHC HLA-B7 /hgml_locus_uid="LX0031C"
/nomgen="HLA-A" /map="6p21.3"
BASE COUNT 218 a 335 c 363 g 173 t
ORIGIN
1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc
61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg
241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag
301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag
361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac
421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg
481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg
541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag
601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac
661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc
721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg
901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa
1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
1081 acagcttga
//
LOCUS MUSMUPE 872 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse major urinary protein mRNA, complete cds.
ACCESSION M28649
KEYWORDS major urinary protein.
SOURCE Mouse liver, cDNA to mRNA, clones 8-1 and 13-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 872)
AUTHORS Bennett,A.L., Paulson,K.E., Miller,R.E. and Darnell,J.E.Jr.
TITLE Aquisition of antigens characteristic of adult pericentral
hepatocytes by differentiating fetal hepatoblasts in vitro
JOURNAL J. Cell Biol. 105, 1073-1085 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 65 601 major urinary protein
mRNA 43 872 major urinary protein
BASE COUNT 266 a 188 c 170 g 248 t
ORIGIN
1 gccacgatca caagaaagat gtggtcctga cagacagaca atcctattcc ctaccaaaat
61 gaagatgctg ctgctgctgt gtttgggact gaccctagtc tgtgtccatg cagaagaagc
121 tagttctacg ggaaggaact ttaatgtaga aaagattaat ggggaatggc atactattat
181 cctggccttt gacaaaagag aaaagataga agataatggc aactttagac tttttctgga
241 gcaaatccat gtcttggaga attccttagt tcttaaattc catactgtaa gagatgaaga
301 gtgctcggaa ttatctatgg ttgctgacaa aacagaaaag gctggtgaat attctgtgac
361 gtatgatgga ttcaatacat ttactatacc taagacagac tatgataact ttcttatggc
421 tcatctcatt aacgaaaatg atggggaaac cttccagctg atggggctct atggccgaga
481 accagatttg agttcagaca tcaaggaaag gtttgcacaa ctatgtgaga agcatggaat
541 ccttagagaa aatatcattg acctatccaa tgccaatcgc tgcctccagg cccgagaatg
601 aagaatggcc tgagcctcca gtgttgagtg gagacttctc accaggactc caccatcatc
661 ccttcctatc catacagcat ccccagtata aattctgtga tctgcattcc atcctgtctc
721 actgagaagt ccaattccag tctatccaca tgttacctag gatacctcat caagaatcaa
781 agacttcttt aaatttttct ttgatatacc catgacaatt tttcatgaat ttcttcctct
841 tcctgttcaa taaatgatta cccttgcact ta
//
LOCUS RATMHREC 1552 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Rat MHC class I IgG Fc region receptor large subunit p51 (FcRn)
mRNA, complete cds.
ACCESSION M35495
KEYWORDS IgG Fc region receptor large subunit p51; cell surface antigen;
cell surface glycoprotein; class I gene; integral membrane protein;
major histocompatibility complex.
SOURCE Rat 11 day old epithelium, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1552)
AUTHORS Simister,N.E. and Mostov,K.E.
TITLE Cloning and expression of the neonatal rat intestinal FC receptor,
a major histocompatibilty complex class I antigen homolog
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 571-580 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 205 1305 IgG Fc region receptor large subunit p51
(FcRn) precursor
sigp 205 270 IgG Fc region receptor large subunit p51
signal peptide
matp 271 1302 IgG Fc region receptor large subunit p51
mRNA < 1 1552 FcRn mRNA
BASE COUNT 312 a 420 c 443 g 377 t
ORIGIN
1 tcagttctgt aattaattaa ctaacgtgga tcaaatgaga aggtgaaagt tcacacagga
61 gcactcctgt cgtcttggac tgggtctcca tcccaccatc cagtgccctg gtctacgaag
121 agtccacagg gaccttgtga agaatcaaca aggcggggtc cagaggagtc acgtgtgcct
181 tccactccgg gtcgccctgt caggatgggg atgtcccagc ccggggtcct cctcagcctc
241 ttattggtcc tcctgcctca gacctgggga gcggagcccc gtctcccact gatgtatcat
301 cttgcagctg tgtctgactt atcaacgggg cttccctctt tctgggccac gggctggctg
361 ggtgctcagc aatatctgac ctacaacaac ctgcggcagg aggctgaccc ctgtggggcc
421 tggatatggg aaaaccaggt gtcttggtat tgggagaagg agaccacgga tctgaaaagc
481 aaagaacagc tcttcttgga ggccatcagg accctggaga accaaataaa tgggaccttc
541 acactgcagg gcctgctggg ctgtgaactg gcccctgata attcttcatt gcccacggct
601 gtgtttgccc tcaatggtga ggagttcatg cggttcaacc caagaacggg caactggagt
661 ggggagtggc cggagacaga tatcgttggt aatctgtgga tgaagcaacc tgaggcggcc
721 aggaaggaga gcgagttcct gctaacttct tgtcctgagc ggctgctagg ccacctggag
781 aggggccgtc agaacctgga gtggaaggag ccgccatcta tgcgcctgaa ggcccgtcct
841 ggcaactctg gctcctcagt actgacctgt gctgctttct ccttctaccc gccggagctc
901 aagtttcgat tcctgcgcaa tgggctagcc tcaggctctg ggaattgcag cactggtccc
961 aatggtgatg gatctttcca tgcatggtca ttgctagagg tcaaacgtgg agatgaacac
1021 cattaccaat gtcaagtgga gcatgagggg ctggcccagc ctctcactgt ggacctagat
1081 tcgcccgcca gatcttctgt gcctgtggtc ggaatcattc ttggtttatt gctggtggta
1141 gtggccatcg cagggggtgt gctgctatgg aacaggatgc gaagtgggct gccagcccca
1201 tggctttctc tcagtggtga tgactctggc gacctattgc ctggtgggaa cttgcccccg
1261 gaggctgaac ctcaaggtgt aaatgccttt ccggccactt cctgatgcca acccaggccc
1321 catacccatt gcagcctgtg gggctgtgtg acctcctgaa ctgtctctga gcctcccgag
1381 ggagccctgg gctggatgtc ctcctcgtgg atcccttctt ttgtggcctg cttcagtttc
1441 ccctcttaat gtcaatggct atttccatct ccacataaat ttgggcccaa atctgtgtgt
1501 gcatcgttat tctcaggttt caggcagccg gaataaattg aacaagtttg ag
//
LOCUS YSCATP10 2343 bp ds-DNA PLN 01-AUG-1990
DEFINITION S.cerevisiae ATP10 (essential for mitochondrial ATPase complex
assembly) gene, complete cds.
ACCESSION J05463
KEYWORDS .
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2343)
AUTHORS Ackerman,S.H. and Tzagoloff,A.
TITLE ATP10, a yeast nuclear gene required for the assembly of the
mitochondrial F1-Fo complex
JOURNAL J. Biol. Chem. 265, 9952-9959 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Tzagoloff, 19-JUN-1990.
The gene sequence submitted codes for a protein that is essential
for the biosynthesis of the F1-F0 ATPase complex of the
mitochondrial inner membrane.
Bases 1 to 977 are shown on the complementary strand as shown in
Fig. 6.
FEATURES from to/span description
pept 629 < 1 (c) ORF
pept 976 680 (c) ORF
pept 1444 2238 ATP10 protein
BASE COUNT 692 a 447 c 490 g 714 t
ORIGIN
1 agatcttttg gctcaggtat aaattcgaac gtctcgattt cccttatcag tttatggaat
61 ggcttaaacc aagatgaaga tttccctagc tctaattgaa cttgcaccat atatataact
121 tttccaaaag agtaaaaata caaatccaga ttatcgattt tattaaattc ttgccaatga
181 ctattgaacg taggtgggag tcgggcatta cttcttgtca caaacgctac tgtctttgcc
241 gtatgatttt tcagacattc aggctttctg ggaaacttgt caaattgaaa gctataatta
301 tatgaacctg gtttaacttt gaacggcttg gaggagccat caagagcatt ccatacatta
361 tctggaggga aaactctctg ttcgaatttc attaatgtat gaaaggattt gttgtcttgg
421 cccggcatca tcatgccatt ctgttgaaac atgtactctt gatcaatttt tgttaaggtc
481 tctgagaatc cttttagaat gacggaaatt ttccttatag atagcgcttt tgttaactga
541 agactaacta tccctgacat ttgatcatta gagctataaa actccccgtt gtacggtggg
601 tttaaggata ttgaaatttt tggagccatg gtttgacaaa ctgtatggtt ctcaaccttc
661 tctaatcaaa agcagaatct taaatataaa cactcacaga atatccgttg gtcaatgaag
721 taattctcct ttgtactggc tgctttttct cctctagttt atgtaattct acttttggat
781 gggtgcgact gcttttaatt gattgagtgg cggtgttaga agggctgtag agtcgaaggc
841 ttgtttctct cttacgcacc tcttgtgaaa agggcgtgca ccttccccag gaccctctct
901 caccctcaac ccgcattttg ctgagaattt tcaccaaggc cctaggtgat attagattcc
961 acctgactaa ttgcattaca gccgacccaa ggcaatatca gtttaataaa atatcatgta
1021 tctcaccctc ttcttggtat tagtaaagag acgcctgatc ttgtaacagt ggtgaagatt
1081 gtactagagc agaatcaaga atttaaaagt gtaaggcagg cagaggcgat gtacataaac
1141 ttcgaagtaa gaaatattta atagttctcg ccacatcact atgcagctat ataaaaacta
1201 ctataaacgt ttgttttgtt ccttacgcac aatatccttg cctagaaatc gtttttgaaa
1261 tttaaatttt tattaccatt tatttgattc gccttcagaa aaatatggaa gagtgcatat
1321 ttaaaaagga ctatttcagc atatagtaaa agtcaggtta tttgtttatt tgcgatatca
1381 gagtaactta aactaactat gcagggcact tttaaaaggt tttaccatcc cacgcttacg
1441 cggatgtcct tcttggataa attcctcaag cctatgatgg caacggcttc cccaaaggaa
1501 taccagatca aacaactggt caagccaata ggcttaacac aagcaccaag gaaaagcacc
1561 aaatactccc aggggaactc tttgagggat atgtttgatt cggaaaagac aaaccacaga
1621 gttaaagagt tggccgttga attcagcaaa tctggacttt atgacgtgca agtcttccaa
1681 aagacaaagg ggaaattgtt tatagctcca gtttcatatt ggaaagaaga taaagctttg
1741 ttttttcctc atttgatagg aacggcaatg gatggtacga aacaacagaa tatcgaggat
1801 atgttaaggg gtaaaaccag tatagtgagg ttatttagta cagcatctgg cgataagttg
1861 agtagttcat acttccaagg aatcgtagac gataacaaaa aaactgacta cttgactgaa
1921 gctgatgcgc gtttaagttt aaatgacagt aacgtccaaa tcatcgaggt caatcttgta
1981 gaaaacgctg tgaaaagtgc tctagtgaaa acgcttgctc gttgggccaa tcgcgttcca
2041 tcctggcgcc agccatttta tttcgaatat tctagaggcc aatggccatt ttccgtcagg
2101 gaagagctct tttgcaataa tgtcttttct ggatacgtct ttcttgtgga ccagcagtta
2161 aaaattaggt gggcagcttg cggggaggct actccatctg aaaaggaagc attgtggaag
2221 tttgccaaac gtctgtgaag ttgacgcttt gtgcggcggc caacaaggga tgggcggcta
2281 tttggcgatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag
2341 tag
//
LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 01-AUG-1990
DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds.
ACCESSION M33785
KEYWORDS protein-tyrosine kinase-1.
SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1090)
AUTHORS Tan,J.L. and Spudich,J.A.
TITLE Developmentally regulated protein-tyrosine kinase genes in
Dictyostelium discoideum
JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Tan, 20-APR-1990.
FEATURES from to/span description
pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1)
BASE COUNT 371 a 198 c 186 g 335 t
ORIGIN
1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca
61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat
121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat
181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca
241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta
301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa
361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa
421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat
481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac
541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct
601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt
661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat
721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat
781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca
841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat
901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta
961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt
1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca
1081 cttcaactcg
//
LOCUS DDIDPYK2A 1291 bp ss-mRNA INV 01-AUG-1990
DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds.
ACCESSION M33784
KEYWORDS protein-tyrosine kinase-2.
SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1291)
AUTHORS Tan,J.L. and Spudich,J.A.
TITLE Developmentally regulated protein-tyrosine kinase genes in
Dictyostelium discoideum
JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.L.Tan, 20-APR-1990.
FEATURES from to/span description
pept < 1 1233 protein-tyrosine kinase-2 (DPYK2; AA at 1)
BASE COUNT 491 a 207 c 203 g 390 t
ORIGIN
1 cgattctaca atacaacaaa ctctactaaa gatatcacat ttttagtttg tgataatcct
61 gattcaacta aagaaaagag taacgtttca aatacttcat caataatttc cgcttcaaat
121 ttaaatagac atataacacc aaattctcat atgagaccta gaggtagatc aatttctgaa
181 tctttaatta tgtcaccaat taataaagaa tctttaaatg atattcaaag agcaattgaa
241 agtgaaaaaa taaagaaaac taaatttgaa gaattaaaat caatattggg cgaaagagaa
301 tatataattg atataaatga tattcaattt atacaaaaag ttggagaagg tgcattcagt
361 gaagtttggg aaggttggtg gaaaggtatt catgttgcca taaaaaagtt aaagattata
421 ggagatgaag aacaattcaa agagagattc attagagagg ttcaaaattt gaaaaaagga
481 aatcatcaaa acattgtcat gtttattggt gcatgttata aaccagcatg tatcataaca
541 gagtatatgg caggtggtag tctttacaat atacttcata atccaaatag ttccactcca
601 aaagttaaat attctttccc attggttttg aaaatggcaa ccgacatggc attgggctta
661 ttacatcttc attccatcac cattgtgcat cgtgatttaa ccagtcaaaa cattctattg
721 gatgaattgg gtaatataaa gatctctgat tttggtttat ctgctgaaaa gagtagagaa
781 ggttcaatga caatgacaaa tggtggcatt tgcaatccaa gatggagacc acccgaattg
841 acaaagaatt taggtcacta ctcggaaaag gttgatgtct attgtttctc tctagtagtt
901 tgggaaattt taactggcga aattcctttc tctgatttag atggatctca acgatccgct
961 caagtagctt atgctggttt aagaccacca ataccagagt attgcgatcc tgaattaaaa
1021 ttactcttaa ctcaatgttg ggaggctgat ccaaatgata gacctccctt tacctatata
1081 gtaaacaaat taaaagaaat ctcttggaat aatccaattg gtttcgtctc tgatcaattc
1141 tatcaatata gcgaaccttc aactccaaga ttagcattat caaatcaatc ttcaaattca
1201 agtagtattt ctttatcacc aactaaatta taaaaaaaaa aaaaaaaaaa aacaaatttc
1261 aaacaccaaa caccaccact catcaaaatc g
//
LOCUS HUMSPTB 6765 bp ss-mRNA PRI 01-AUG-1990
DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION J05500
KEYWORDS beta-spectrin; spectrin.
SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
V252.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6765)
AUTHORS Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L.,
Marchesi,V.T. and Forget,B.G.
TITLE Full length sequence of the cDNA for human erythroid beta-spectrin
JOURNAL J. Biol. Chem. 265, 11827-11832 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Winkelmann, 08-MAY-1990.
FEATURES from to/span description
pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14"
/hgml_locus_uid="LS0033T"
mRNA < 1 6765 SPTB mRNA
signal 6716 6722 poly-A signal
BASE COUNT 1626 a 1822 c 2146 g 1171 t
ORIGIN Chromosome 14q23-q24.
1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc
2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc
2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca
4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS STYEUTBC 2526 bp ds-DNA BCT 01-AUG-1990
DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
complete cds.
ACCESSION J05518
KEYWORDS ethanolamine ammonia-lyase.
SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2526)
AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
TITLE Cloning, sequencing and expression of the genes encoding the
alcohol-dependent ethanolamine ammonia-lyase of Salmonella
typhimurium
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.M.Babior, 08-MAY-1990.
FEATURES from to/span description
pept 141 1499 ethanolamine ammonia-lyase (eutB)
pept 1518 2378 ethanolamine ammonia-lyase (eutC)
binding 130 133 ribosome binding site
binding 1507 1510 ribosome binding site
BASE COUNT 563 a 687 c 779 g 497 t
ORIGIN
1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
2521 gcctcc
//
LOCUS XELPCNA 1018 bp ss-mRNA VRT 01-AUG-1990
DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
cds.
ACCESSION M34080
KEYWORDS nuclear protein; proliferating cell nuclear antigen.
SOURCE X.laevis oocyte, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1018)
AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
TITLE Characterization and developmental expression of Xenopus
proliferating cell nuclear antigen (PCNA)
JOURNAL Dev. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Leibovici, 08-MAY-1990.
FEATURES from to/span description
pept 28 813 proliferating cell nuclear antigen (PCNA)
mRNA < 1 1018 PCNA mRNA
BASE COUNT 284 a 223 c 237 g 274 t
ORIGIN
1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS YSCAAC2A 1333 bp ds-DNA PLN 01-AUG-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
cds.
ACCESSION M34076 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1333)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator in yeast
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 235 1158 ADP/ATP-translocator protein (AAC2)
BASE COUNT 388 a 209 c 301 g 435 t
ORIGIN
1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
1321 agccactgtt gtt
//
LOCUS YSCAAC3 1308 bp ds-DNA PLN 01-AUG-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
cds.
ACCESSION M34075 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1308)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator in yeast
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 78 1034 ADP/ATP-translocator protein (AAC3)
BASE COUNT 353 a 228 c 263 g 464 t
ORIGIN
1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS HUMMTVA1 367 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 5' end.
ACCESSION M28909
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 367)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 121 a 123 c 42 g 80 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat cacccccctc
361 agatagg
//
LOCUS HUMMTVA2 361 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 3' end.
ACCESSION M28910
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 361)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 107 a 102 c 54 g 93 t 5 others
ORIGIN
1 tttggtattt tcgtctgggg ggtgtgcacg cgatagcatt gcgagacgct ggagccggag
61 caccctatgt cgcagtatct gtctttgatt cctgccccat cctattattt atcgcaccta
121 cgttcaatat tacaggcgaa catacnctac taaagtgtgt taattaatta atgcttgtag
181 gacataataa taacaattaa atgtctgcac agccactttc cacacagaca tcataacaaa
241 aaatttncca ccaaaccccc ccnnntcccc ccgcttctgg ccacagcact taaacacatc
301 tctgccaaac cccaaaaaca aagaacccta acaccagcct aaccagattt caaattttat
361 c
//
LOCUS HUMMTVB1 367 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 5' end.
ACCESSION M28911
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 367)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 121 a 120 c 43 g 82 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ttcaactgtc acatatcaac cgtaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agatagg
//
LOCUS HUMMTVB2 356 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 3' end.
ACCESSION M28912
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 356)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 105 a 104 c 52 g 90 t 5 others
ORIGIN
1 ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc tggagccgga gcaccctatg
61 tcgcagtatc tgtctttgat tcctgcccca tcccattatt tatcgcacct acgttcaata
121 ttacaggcga acatacncta ctaaagtgtg ttaattaatt aatgcttgta ggacataata
181 ataacaattn aatgtctgca cagccacttt ccacacagac atcataacaa aaaatttncc
241 accaaacccc ccccnntccc cccgcttctg gccacagcac ttaaacacat ctctgccaaa
301 ccccaaaaac aaagaaccct aacaccagcc taaccagatt tcaaatttta tctttt
//
LOCUS HUMMTVC1 367 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 5' end.
ACCESSION M28905
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 367)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 119 a 123 c 45 g 79 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtacg gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agatagg
//
LOCUS HUMMTVC2 371 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 3' end.
ACCESSION M28906
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 371)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 109 a 107 c 55 g 94 t 6 others
ORIGIN
1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc
61 tggagccgga gcaccctatg tcgcagcacc tgtctttgat tcctgcccca ttccattatt
121 tatcgcacct acgttcaata ttacaggcga acatacncta ctaaagtgtg ttaattaatt
181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac
241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac
301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt
361 tcaaatttta t
//
LOCUS HUMMTVD1 368 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 5' end.
ACCESSION M28907
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 368)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 119 a 122 c 46 g 80 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacggtac cataaatact
121 tgaccacctg tagtacataa aaacccanac ccacatcaaa accctccccc catgcttaca
181 agcaagcaca gcaatcaacc ttcaactgtc acacatcaac tgcaactcca aagccacccc
241 tcacccacta ggatatcaac aaacctactc acccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agataggg
//
LOCUS HUMMTVD2 375 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 3' end.
ACCESSION M28908
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 375)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 109 a 104 c 59 g 97 t 6 others
ORIGIN
1 ggaggctctn ccatgcattt ggtattttcg tctggggggt gtgcacgcga tagcattgcg
61 agacgctgga gccggagcac cctatgtgca gtatctgtct ttgattcctg ccccattcca
121 ttatttatcg cacctacgtt caatattaca ggcgagcata cnctattaaa gtgtattaat
181 taattaatgc ttgtaggaca taataataac aattaaatgt ctgcacagcc actttccaca
241 cagatcataa caaaaaattt nccaccaaac ccccccnnnt ccccccgctt ctggccacag
301 cacttaaaca catctctgcc aaaccccaaa aacaaagaac cctaacacca gcctaaccag
361 atttcaaatt ttatc
//
LOCUS HUMMTVE1 367 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 5' end.
ACCESSION M28893
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 367)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 120 a 121 c 44 g 81 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac cgcaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agatagg
//
LOCUS HUMMTVE2 362 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 3' end.
ACCESSION M28894
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 362)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 108 a 102 c 54 g 93 t 5 others
ORIGIN
1 catttggtat tttcgtctgg ggggtgtgca cgcgatagca ttgcgagacg ctggagccgg
61 agcaccctat gtcgcagtat ctgtctttga ttcctgcccc atcctattat ttatcgcacc
121 tacgttcaat attacaggcg aacatacnct actaaagtgt gttaattaat taatgcttgt
181 aggacataat aataacaatt aaatgtctgc acagccactt tccacacaga catcataaca
241 aaaaatttnc caccaaaccc ccccnnntcc ccccgcttct ggccacagca cttaaacaca
301 tctctgccaa accccaaaaa caaagaaccc taacaccagc ctaaccagat ttcaaatttt
361 at
//
LOCUS HUMMTVF1 369 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 5' end.
ACCESSION M28899
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 369)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 120 a 122 c 47 g 79 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccactcc
241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agatagggg
//
LOCUS HUMMTVF2 371 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 3' end.
ACCESSION M28900
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 371)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 109 a 107 c 55 g 94 t 6 others
ORIGIN
1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc
61 tggagccgga gcaccctatg tcgcagtatc tgtctttgat tcctgcccca tcccattatt
121 tatcgcacct acgttcaata ttacaggcga acatacncta ccaaagtgtg ttaattaatt
181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac
241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac
301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt
361 tcaaatttta t
//
LOCUS HUMMTVG1 340 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 5' end.
ACCESSION M28895
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 340)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 114 a 111 c 39 g 75 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gtaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccacccc
241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt
//
LOCUS HUMMTVG2 349 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 3' end.
ACCESSION M28896
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 349)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 106 a 103 c 52 g 83 t 5 others
ORIGIN
1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc
121 aatattacag gcgaacatac nctaccaaag tgtgttaatt aattaatgct tgtaggacat
181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc
301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaa
//
LOCUS HUMMTVH1 348 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 5' end.
ACCESSION M28901
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 348)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 115 a 115 c 38 g 79 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac tgcaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatcc
//
LOCUS HUMMTVH2 355 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 3' end.
ACCESSION M28902
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 355)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 107 a 101 c 52 g 90 t 5 others
ORIGIN
1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
61 tatgtcgcag tatctgtctt tgattcctgc cccatcctat tatttatcgc acctacgttc
121 aatattacag gcgaacatac nctactaaag tgtgttaatt aattaatgct tgtaggacat
181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc
301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat
//
LOCUS HUMMTVI1 367 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 5' end.
ACCESSION M28897
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 367)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 121 a 121 c 43 g 81 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gtaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
361 agatagg
//
LOCUS HUMMTVI2 358 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 3' end.
ACCESSION M28898
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 358)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 106 a 102 c 54 g 91 t 5 others
ORIGIN
1 ttggtatttt cgtctggggg gtgtgcacgc gatagcattg cgagacgctg gagccggagc
61 accctatgtc gcagtatctg tctttgattc ctgccccatc ccattattta tcgcacctac
121 gttcaatatt acaggcgaac atacnctact aaagtgtgtt aattaattaa tgcttgtagg
181 acataataat aacaattaaa tgtctgcaca gccactttcc acacagacat cataacaaaa
241 aatttnccac caaacccccc cnnntccccc cgcttctggc cacagcactt aaacacatct
301 ctgccaaacc ccaaaaacaa agaaccctaa caccagccta accagatttc aaattttt
//
LOCUS HUMMTVJ1 365 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 5' end.
ACCESSION M28903
KEYWORDS mitochondrial DNA.
SEGMENT 1 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 365)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 120 a 122 c 43 g 79 t 1 others
ORIGIN
1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
181 agcaagtaca gcaatcaacc ttcaactgtc acaatcaacc gcaactccaa agccacccct
241 cacccactag gataccaaca aacctaccca cccttaacag tacatagcac ataaagccat
301 ttaccgtaca tagcacatta cagtcaaatc ccttctcgtc cccatggatg acccccctca
361 gatag
//
LOCUS HUMMTVJ2 355 bp ds-DNA ORG 01-AUG-1990
DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 3' end.
ACCESSION M28904
KEYWORDS mitochondrial DNA.
SEGMENT 2 of 2
SOURCE Human mitochondrial hair root DNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 355)
AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
Wilson,A.C.
TITLE Mitochondrial DNA sequences in single hairs from a southern African
population
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Vigilant 06-OCT-1989.
BASE COUNT 107 a 102 c 52 g 90 t 4 others
ORIGIN
1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc
121 aatattacag gcgaacatac nctattaaag tgtgttaatt aattaatgct tgtaggacat
181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
241 tnccaccaaa ccccccccnn tccccccgct tctggccaca gcacttaaac acatctctgc
301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat
//
LOCUS HUMLD78A 3176 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human cytokine LD78 alpha gene, complete cds.
ACCESSION D90144
KEYWORDS LD78; LD78 alpha; cytokine; inducible gene family;
secreted peptide.
SOURCE Human blood lymphocyte DNA , clone Lm LD-3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3176)
AUTHORS Nakao,M., Nomiyama,H. and Shimada,K.
TITLE Structures of human genes coding for cytokine LD78 and their
expression
JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hisayuki
Nomiyama Department of Biochemistry Kumamoto University Medical
School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax:
096-372-6140
FEATURES from to/span description
pept 1155 1227 cytokine LD78 alpha precursor, exon 1
1916 2030 cytokine LD78 alpha precursor, exon 2
2451 2541 cytokine LD78 alpha precursor, exon 3
sigp 1155 1219 cytokine LD78 alpha signal peptide
matp 1220 1227 cytokine LD78 alpha mature peptide
1916 2030 cytokine LD78 alpha mature peptide
2451 2538 cytokine LD78 alpha mature peptide
pre-msg 1069 2957 cytokine LD78 alpha mRNA and introns
IVS 1228 1915 cytokine LD78 alpha intron A
IVS 2031 2450 cytokine LD78 alpha intron B
signal 1041 1045 TATA box
BASE COUNT 833 a 741 c 752 g 850 t
ORIGIN
1 acccagggac ctatcacaca aatataagaa ctattcattc tttaaggcat gtatttccaa
61 gcctttgtat ttttttccat gcttagggtt ggcaaggaat atatatatat ttgtacaaat
121 atatatgtgt atatgtacaa atacatgtat atatagtaca aatatatata tatatttgta
181 caattcttca gactttgtag aatttgtata atgtcgtatc ttgctttttt taaccactga
241 tgttataagc atatttatgc cacttcattc attttagaga cttaataata aatgatctag
301 tggataattt atcattccct gatggagaaa aatttagctt tgtttatttt agagttataa
361 acgatgctgg gtcaggtatc tttatgtttg aagatggctc catatttggg ttgtttccac
421 agaactcttt cctagaaatg ctttttctag gttaatggct acagatattt ctaggcacct
481 gacatattga cacccacctc taaagtattt ttatgatcca caactagcgt ttaacacagc
541 gccctagtca ctacatgact aataaataga caaatgactg aaacatgacc tcatgctttc
601 tattcctcca gctttcattc agttctttgc ctctgggagg aggaagggtt gtgcagccct
661 ccacagcatc agcccatcaa ccctatccct gtggttatag cagctgagga agcagaattg
721 cagctctgtg ggaaggaatg gggctggaga gttcatgcac agaccagttc ttatgagaag
781 ggactgacta agaatagcct tgggttgaca tatacccctc ttcacactca caggagaaac
841 catttcccta tgaaactata acaagtcatg agttgagagc tgagagttag agaatagctc
901 aaagatgcta ttcttggata tcctgagccc ctgtggtcac cagggaccct gagttgtgca
961 acttagcatg acagcatcac tacgcttaaa aatttccctc ctcaccccca gattccattt
1021 ccccatccgc cagggctgcc tataaagagg agagctggtt tcagacttca gaaggacacg
1081 ggcagcagac agtggtcagt cctttcttgg ctctgctgac actcgagccc acattccgtc
1141 acctgctcag aatcatgcag gtctccactg ctgcccttgc tgtcctcctc tgcaccatgg
1201 ctctctgcaa ccagttctct gcatcacgtg agtctgagtt tcgttgtggg tatcaccact
1261 ctctggccat ggttagacca catcaatctt ttcttgtggc ctaaaagccc ccaagagaaa
1321 agagaacttc ttaaagggct gccaaacatc ttggtctttc tctttaagac ttttattttt
1381 atctctagaa ggggtcttag ccccctagtc tccaggtatg agaatctagg caggggcagg
1441 ggagttacag tcccttttac agatagaaaa acagggttcg aaacgaatca gttagcaaga
1501 ggcagaatcc agggctgctt acttcccagt ggggtatgtt gttcactctc cagctcactc
1561 taggtctccc aggagctctg tcccttggat gtcttatgag agatgtccaa ggcttctctt
1621 gggttggggt atgacttctt gaaccagaca aaattccctg aagagaactg agataagaga
1681 acagtccgtt caggtatctg gatcacacag agaaacagag aacccactat gaagagtcaa
1741 ggagaaagaa ggatacagac agaaacaaag agacatttct cagcaaaaat gcccaaatgc
1801 cttccagtca cttggtctga gcaagcctgc cttcctcaac tgctcgggga tcagaagctg
1861 cctggccttt tcttctgagc tgtgactcgg gctcattctc ttcctttctc cacagttgct
1921 gctgacacgc cgaccgcctg ctgcttcagc tacacctccc ggcagattcc acagaatttc
1981 atagctgact actttgagac gagcagccag tgctccaagc ccggtgtcat gtaagtgcca
2041 gtcttcctgc tcacctctat ggaggtaggg agggtcaggg ttggggcaga gacaggccag
2101 aaggctatcc tggaaaggcc cagccttcag gagcctatcg gggatacagg acgcagggct
2161 ccgaggtgtg acctgacttg gagctggagt gaggcatgtg ttacagagtc aggaagggct
2221 gccccagccc agaggaaagg gacaggaaga aggaggcagc gggacactct gagggccacc
2281 cctactgagt cactgagaga agctctctag acagagatag gcagggggcc cctgaaagag
2341 gagcaagccc tgagctgccc aggacagaga gcagaatggt ggggccatgg tgggcccagg
2401 attcccctgc tggattcccc agtgcttaac tcttcctccc ttctccacag cttcctaacc
2461 aagcgaagcc ggcaggtctg tgctgacccc agtgaggagt gggtccagaa atatgtcagc
2521 gacctggagc tgagtgcctg aggggtccag aagcttcgag gcccagcgac ctcggtgggc
2581 ccagtgggga ggagcaggag cctgagcctt gggaacatgc gtgtgacctc cacagctacc
2641 tcttctatgg actggttgtt gccaaacagc cacactgtgg gactcttctt aacttaaatt
2701 ttaatttatt tatactattt agtttttgta atttattttc gatttcacag tgtgtttgtg
2761 attgtttgct ctgagagttc ccctgtcccc tcccccttcc ctcacaccgc gtctggtgac
2821 aaccgagtgg ctgtcatcag cctgtgtagg cagtcatggc accaaagcca ccagactgac
2881 aaatgtgtat cggatgcttt tgttcagggc tgtgatcggc ctggggaaat aataaagatg
2941 ctcttttaaa aggtaaacca gtattgagtt tggttttgtt tttctggcaa atcaaaatca
3001 ctggttaaga ggaatcatag gcaaagatta ggaagaggtg aaatggaggg aaattgggag
3061 agatggggag ggctaccaca gagttatcca ctttacaacg gagacacagt tctggaacat
3121 tgaaactacg aatatgttat aactcaaatc ataacatgca tgctctagga gaattc
//
LOCUS HUMLD78B 3112 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human cytokine LD78 beta gene.
ACCESSION D90145
KEYWORDS LD78; LD78 beta; cytokine; inducible gene family; secreted peptide.
SOURCE Human placenta DNA, clone Lm LD-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3112)
AUTHORS Nakao,M., Nomiyama,H. and Shimada,K.
TITLE Structures of human genes coding for cytokine LD78 and their
expression
JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hisayuki
Nomiyama Department of Biochemistry Kumamoto University Medical
School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax:
096-372-6140
FEATURES from to/span description
pept 1192 1267 cytokine LD78 beta precursor, exon 1
1953 2067 cytokine LD78 beta precursor, exon 2
2488 2578 cytokine LD78 beta precursor, exon 3
sigp 1192 1259 cytokine LD78 beta signal peptide
matp 1260 1267 cytokine LD78 beta mature peptide
1953 2067 cytokine LD78 beta mature peptide
2488 2575 cytokine LD78 beta mature peptide
pre-msg 1106 2995 cytokine LD78 beta mRNA and introns
IVS 1268 1952 cytokine LD78 intron A
IVS 2068 2487 cytokine LD78 intron B
rpt 498 797 Alu repeat
signal 1078 1082 TATA box
BASE COUNT 756 a 775 c 780 g 801 t
ORIGIN
1 ttagagactt aataataaag gatcttgtgg ataatttatc attccctgat agagaaaaat
61 ttagctttgc ttattttaga gttataaatg atgctgggtc aggtatcttt atgtttgaag
121 atggctccat atttgggttg tttccacaga actctttccc agaaatgctt tttctaggtt
181 aatggctaca catatttcta ggcacctgac atactgacac ccacctctaa agtattttta
241 tgatccacaa ctagcgttta acacagcgcc ccagtcactc cgagactaat aaatagacaa
301 atgactgaaa cgtgacctca tgctttctat tcctccagct ttcattgagt tcctttcctc
361 tgggaggact gggggttgtc tagccctcca cagcatcagc ccattgaccc tatccttgtg
421 gttatagcag ctgaggaagc agaattacag ctctgtggga aggaatgggg ctggagagtt
481 catgcataga ccaattcttt tttttttttt tttttgagat ggagtttcac ttttgttgcc
541 caggctggag tgcaatggca tgatctcagc tcaccacagc ccccacctcc tgggttcaag
601 cgattctcct gccctcagcc tcccgagtag ctgggattac aggcatgtgc caccacgcct
661 gactactttt gtatttttag tagagatgga gtttctcttt cttggtcagg ttggtctcaa
721 actcctgacc tcaggtgatc cgcagcctcg gcctcccaaa gtgttgggat tacaggtgtg
781 agcgaccatg cctggctgca tagaccagtt cttatgagaa gggatcaact aagaatagcc
841 ttgggttgac acacacccct cttcacactc acaggagaaa ccccatgaag ctagaaccag
901 tcatgagttg agagctgaga gttagagagt agctcagaga tgctattctt ggatatcctg
961 agcccctgtg gtcaccaggg accctgagtt gtgcaacact cagcatgaca gcatcactac
1021 acttaaaaat ttccctcctc acccccagat tccatttccc catccgccag ggctgcctat
1081 aaagaggaga gatggcttca gacatcagaa ggacgcaggc agcaaagagt agtcagtccc
1141 ttcttggctc tgctgacact cgagcccaca ttccatcacc tgctcccaat catgcaggtc
1201 tccactgctg cccttgccgt cctcctctgc accatggctc tctgcaacca ggtcctctct
1261 gcaccacgtg agtccatgtt gttgttgtgg gtatcaccac tctctggcca tggttagacc
1321 acatcagtct ttttttgcgg cctgagagcc ccgaagagaa aagaaggaag ttcttaaagc
1381 gctgccaaac accttggtct ttttcttcac aacttttatt tttatctcta gaaggggtct
1441 tagccctcct agtctccagg tatgagaatc taggcagggg caggggagtt acagtccctt
1501 gtacagatag aaaaacaggg ttcaaaacga atcagtttgc aagaggcaga atccagggct
1561 gcttacttcc cagtggggtc tgttgttcac tctccagctc accctaggtc tcccaggagc
1621 cctgtccctt ggatgtctta tgagagatgt ccagggcttc tcttgggctg gggtatgact
1681 tcttgaaccg acaaaattcc atgaagagag ctaagagaac agtccattca ggtatctgga
1741 tcacatagag aaacagagaa cccactatga agagtcaagg ggaaagagga atatagacag
1801 aaacaaagag acatttctct gcaaaacccc ccaaatgcct tgcagtcact tggtctgagc
1861 aagcctgccc tcctcaacca ctcagggatc agaagctgcc tggccttttc ttctgagctg
1921 tgactcgggc ttattctctc ctttctccgc agttgctgct gacacgccga ccgcctgctg
1981 cttcagctac acctcccgac agattccaca gaatttcata gctgactact ttgagacgag
2041 cagccagtgc tccaagccca gtgtcatgta agtgccagtc ttcctgctca cctctaggga
2101 ggtagggagt gtcagggtgg gggcagaaac aggccagaag gccatcctgg aaaggcccag
2161 ccttcaggag cctatcgggg atacaggacg cagggcactg aggtgtgacc tgacttgggg
2221 ctggagtgag gtgggtgtta cagagtcagg aagggctgcc ccaggccaga ggaaaggaac
2281 aggaagaagg aggcagcagg acactctgag ggcccccttg cctggagtca ctgagagaag
2341 ctctctagac ggagataggc agggggcccc tgagagagga gcaggccttg agctgcccag
2401 gacagagagc aggatgtcag gccatggtgg gcccaggatt ccccggctgg attccccagt
2461 gcttaactct tcctcccttc tccacagctt cctaaccaag agaggccggc aggtctgtgc
2521 tgaccccagt gaggagtggg tccagaaata cgtcagtgac ctggagctga gtgcctgagg
2581 ggtccagaag cttcgaggcc cagcgacctc agtgggccca gtggggagga gcaggagcct
2641 gagccttggg aacatgcgtg tgacctctac agctacctct tctatggact ggttattgcc
2701 aaacagccac actgtgggac tcttcttaac ttaaatttta atttatttat actatttagt
2761 ttttataatt tatttttgat ttcacagtgt gtttgtgatt gtttgctctg agagttcccc
2821 ctgtcccctc caccttccct cacagtgtgt ctggtgacga ccgagtggct gtcatcggcc
2881 tgtgtaggca gtcatggcac caaagccacc agactgacaa atgtgtatca gatgcttttg
2941 ttcagggctg tgatcggcct ggggaaataa taaagatgtt cttttaaacg gtaaaccagt
3001 attgagtttg gttttgtttt tctggcaaat caaaatcact agttaagagg aatcataggc
3061 aaagattagg aagaggtgaa atggagggaa actgggagag atggggagcg ct
//
LOCUS XELTRH 1442 bp ss-mRNA VRT 01-AUG-1990
DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds.
ACCESSION M34699 K00931 J05514
KEYWORDS thyrotropin releasing hormone.
SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 478)
AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G.
TITLE Biosynthesis of thyrotropin releasing hormone in the skin of
Xenopus laevis: Partial sequence of the precursor deduced from
cloned cDNA
JOURNAL EMBO J. 3, 617-621 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 15 to 1442)
AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
TITLE Two precursors of thyrotropin releasing hormone from skin of
Xenopus laevis: Each contains seven copies of the end product
JOURNAL J. Biol. Chem. 265, 11731-11733 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by G.Kreil, 18-MAY-1990.
FEATURES from to/span description
pept 110 793 thyrotropin releasing hormone precursor
matp 332 340 thyrotropin releasing hormone copy 1
matp 374 382 thyrotropin releasing hormone copy 2
matp 428 436 thyrotropin releasing hormone copy 3
matp 470 478 thyrotropin releasing hormone copy 4
matp 566 574 thyrotropin releasing hormone copy 5
matp 611 619 thyrotropin releasing hormone copy 6
matp 686 694 thyrotropin releasing hormone copy 7
mRNA < 1 1442 TRH mRNA
conflict 139 139 t in [2]; c in [1]
conflict 214 216 tct in [2]; ctc in [1]
conflict 319 319 g in [2]; t in [1]
BASE COUNT 460 a 286 c 334 g 362 t
ORIGIN
1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag
61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt
121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc
181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg
241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct
301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga
361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc
421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg
481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga
541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt
601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga
661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg
721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag
781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca
841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga
901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt
961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag
1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt
1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca
1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat
1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct
1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt
1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg
1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa
1441 tg
//
LOCUS XELTRHA 2955 bp ss-mRNA VRT 01-AUG-1990
DEFINITION X.laevis thyrotropin releasing hormone mRNA, complete cds.
ACCESSION M34698 J05514
KEYWORDS thyrotropin releasing hormone.
SOURCE X.laevis, cDNA to mRNA, clone C6.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (sites)
AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
TITLE Two precursors of thyrotropin releasing hormone from skin of
Xenopus laevis: Each contains seven copies of the end product
JOURNAL J. Biol. Chem. 265, 11731-11733 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2955; for [1])
AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by G.Kreil, 18-MAY-1990.
FEATURES from to/span description
pept 157 831 thyrotropin releasing hormone
matp 379 387 thyrotropin releasing hormone copy 1
matp 421 429 thyrotropin releasing hormone copy 2
matp 475 483 thyrotropin releasing hormone copy 3
matp 517 525 thyrotropin releasing hormone copy 4
matp 613 621 thyrotropin releasing hormone copy 5
matp 658 666 thyrotropin releasing hormone copy 6
matp 733 741 thyrotropin releasing hormone copy 7
BASE COUNT 927 a 597 c 604 g 827 t
ORIGIN
1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc
61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga
121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg
181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag
241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc
301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg
361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga
421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac
481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag
541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat
601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa
661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa
721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac
781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag
841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt
901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac
961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc
1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca
1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa
1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc
1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca
1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg
1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca
1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg
1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa
1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct
1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa
1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt
1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt
1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca
1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat
1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg
1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga
1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg
2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga
2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca
2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct
2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc
2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac
2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg
2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata
2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga
2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc
2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa
2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg
2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct
2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa
2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct
2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt
2941 tcgtggcgct ggccc
//
LOCUS PVICSD 1107 bp ds-DNA INV 01-AUG-1990
DEFINITION P.vivax circumsporozoite protein gene, complete cds.
ACCESSION M34697
KEYWORDS circumsporozoite protein.
SOURCE P.vivax (strain Thai; isolate NYU Thai) sporozoite DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 286 to 798)
AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W.
TITLE Antigenic diversity in Thai Plasmodium vivax circumsporozoite
proteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 285; 799 to 1107)
AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Arnot, 18-MAY-1990.
The bases in reference [2] are identical to bases 140 to 442 and
995 to 1294 of the sequence of the North Korean strain published
in Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106; accession number
M20671.
Author address: D.E.Arnot
Dept. of Genetics
University of Edinburgh
West Mains Rd., Edingburgh, EM93JM
Scotland
FEATURES from to/span description
pept 1 1107 circumsporozoite protein
BASE COUNT 382 a 212 c 338 g 175 t
ORIGIN
1 atgaagaact tcattctctt ggctgtttct tccatcctgt tggtggactt gttccccacg
61 cactgcgggc acaatgtaga tctgtccaag gccataaatt taaatggagt aaacttcaat
121 aatgtagacg ccagttcact tggcgcggca cacgtaggac aaagtgctag ccgaggcaga
181 ggacttggtg agaacccaga tgacgaggaa ggagatgcta aaaaaaaaaa ggatggaaag
241 aaagcagaac caaaaaatcc acgtgaaaat aagctgaaac aaccaggaga cagagcagat
301 ggacagccag caggagacag agcagatgga cagccagcag gtgatagagc agatggacaa
361 ccagcaggtg atagagctgg acagccagca ggagatagag cagatggaca gccagcagga
421 gacagagcag atggacagcc agcaggagac agagcagatg gacagccagc aggagacaga
481 gcagatggac agccagcagg tgacagagct ggacaaccag caggtgatag agctggacag
541 ccagcaggcg atagagcaga tggacagcca gcaggagata gagctggaca gccagcaggc
601 gatagagcag atggacagcc agcaggagat agagctggac aaccagcagg agatagagca
661 gatggacaac cagcaggaga tagagctgga cagccagcag gagatagagc tggacagcca
721 gcaggagata gagctggaca gccagcagga gatagagctg gacagccagc aggaaatggt
781 gcaggtggac aggcagcagg aggaaacgca ggaggacagg gacaaaataa tgaaggtgcg
841 aatgccccaa atgaaaagtc tgtgaaagaa tacctagata aagttagagc taccgttggc
901 accgaatgga ctccatgcag tgtaacctgt ggagtgggtg taagagtcag aagaagagtt
961 aatgcagcta acaaaaaacc agaggatctt actttgaatg accttgagac tgatgtttgt
1021 acaatggata agtgtgctgg catatttaac gttgtgagta attcattagg gctagtcata
1081 ttgttagtcc tagcattatt caattaa
//
LOCUS ATTRRA 119 bp ss-RNA RNA 01-AUG-1990
DEFINITION A.solani 5S rRNA.
ACCESSION M35573
KEYWORDS 5S ribosomal RNA.
SOURCE A.solani (strain CBS 277-32) 5S rRNA.
ORGANISM Atractiella solani
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales;
Auriculariaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 119 5S rRNA
BASE COUNT 29 a 30 c 34 g 26 t
ORIGIN
1 aggtgcgacc ataccgtgtt gaaaattctg catcccgtcc gatctgcaaa gacaagcaac
61 acagggccca gtcagtagtg cggtgggtga ccacgtgcga atactgtggt gttgcactt
//
LOCUS CETRRA 118 bp ss-RNA RNA 01-AUG-1990
DEFINITION C.cornigerum 5S rRNA.
ACCESSION M35577
KEYWORDS 5S ribosomal RNA.
SOURCE C.cornigerum (strain FO 29225) 5S rRNA.
ORGANISM Ceratobasidium cornigerum
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Metabasidiomycetidae; Metatremellales;
Ceratobasidiaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 118 5S rRNA
BASE COUNT 23 a 35 c 37 g 23 t
ORIGIN
1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga
61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
//
LOCUS GRARRA 118 bp ss-RNA RNA 01-AUG-1990
DEFINITION G.phoenicis 5S rRNA.
ACCESSION M35575
KEYWORDS 5S ribosomal RNA.
SOURCE G.phoenicis (strain PB 4349) 5S rRNA.
ORGANISM Graphiola phoenicis
Eukaryota; Plantae; Thallobionta; Basidiomycotina; Teliomycetes;
Ustilaginales; Graphiolaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 118 5S rRNA
BASE COUNT 26 a 33 c 36 g 23 t
ORIGIN
1 atctgcggcc atagaaccgt gaaaataccg catcccgtcc gatctgcgaa gtcaagcacg
61 gtatcgccta gtcagtactg cggtggggga ccacgcggga atcctgggtg ctgcagtt
//
LOCUS PLARRA 119 bp ss-RNA RNA 01-AUG-1990
DEFINITION P.peniophorae 5S rRNA.
ACCESSION M35571
KEYWORDS 5S ribosomal RNA.
SOURCE P.peniophorae (strain FO 22315) 5S rRNA.
ORGANISM Platygloea peniophorae
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales;
Auriculariaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 119 5S rRNA
BASE COUNT 27 a 35 c 36 g 21 t
ORIGIN
1 atctgcggcc ataccgtgat gaacattccg cgtcccgtcc gatccgcgca gacaagcatc
61 acaggggcca gagagtattg acgtgggtga ccagtcgaga acactgtgct gccgcaggt
//
LOCUS PLERRA 119 bp ss-RNA RNA 01-AUG-1990
DEFINITION P.faginea 5S rRNA.
ACCESSION M35574
KEYWORDS 5S ribosomal RNA.
SOURCE P.faginea (strain FO 22315) 5S rRNA.
ORGANISM Phleogena faginea
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Eutremellales;
Phleogenaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 119 5S rRNA
BASE COUNT 28 a 30 c 34 g 27 t
ORIGIN
1 atgtgcgacc ataccaagct gaaaatactg catcccgtct gatctgcaca gtcaagcagc
61 ttagggccca gtcagtagtg cggtggggga ccatgcgcga acattgtggt gttgcactt
//
LOCUS SEPRRA 119 bp ss-RNA RNA 01-AUG-1990
DEFINITION S.carestianum 5S rRNA.
ACCESSION M35572
KEYWORDS 5S ribosomal RNA.
SOURCE S.carestianum (strain FO 25109) 5S rRNA.
ORGANISM Septobasidium carestianum
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Septobasidiales;
Septobasidiaceae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 119 5S rRNA
BASE COUNT 25 a 37 c 36 g 21 t
ORIGIN
1 atctggggcc ataccacagt gaacacaccg catcccgtcc gatctgcgca gttaaccact
61 gtagggccga gtcagtagtg cggtggggga ccacgcgcga atactctggt gccccaggt
//
LOCUS TULRRA 118 bp ss-RNA RNA 01-AUG-1990
DEFINITION T.violea 5S rRNA.
ACCESSION M35576
KEYWORDS 5S ribosomal RNA.
SOURCE T.violea (strain FO 29326) 5S rRNA.
ORGANISM Tulasnella violea
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales;
Tulasnellaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 118 5S rRNA
BASE COUNT 30 a 29 c 31 g 28 t
ORIGIN
1 atcttcggcc ataggacaga gaaaataccg catcccgtcc gatctgcgca gtcaagctct
61 gtaccgctta gttagtacca tagtggggga ccatatggga atcctgagtg ctgaagtt
//
LOCUS UTHRRA 118 bp ss-RNA RNA 01-AUG-1990
DEFINITION U.fusisporum 5S rRNA.
ACCESSION M35578
KEYWORDS 5S ribosomal RNA.
SOURCE U.fusisporum (strain FO 25106) 5S rRNA.
ORGANISM Uthatobasidium fusisporum
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales;
Tulasnellaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Blanz,P.A. and Gottschalk,M.
TITLE Systematic position of Septobasidium, Graphiola and other
basidiomycetes as deduced on the basis of their 5S ribosomal RNA
nucleotide sequences
JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 > 118 5S rRNA
BASE COUNT 23 a 35 c 37 g 23 t
ORIGIN
1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga
61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
//
LOCUS C11CMIA 2149 bp ds-DNA BCT 01-AUG-1990
DEFINITION Plasmid pColBM-C1139 colicin lysis protein (cmi) gene, 5' end.
ACCESSION M35683
KEYWORDS colicin lysis protein.
SOURCE Plasmid pColBM-C1139 DNA.
ORGANISM Plasmid pColBM-C1139
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 2149)
AUTHORS Thumm,G., Oelschlaeger,T. and Braun,V.
TITLE Plasmid pColBM-C1139 does not encode a colicin lysis protein but
contains sequences highly homologous to the D protein (resolvase)
and the oriV region of the miniF plasmid
JOURNAL Plasmid 20, 75-82 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 1108 1890 ORF
pept 1991 > 2149 colicin lysis protein (cmi)
BASE COUNT 512 a 510 c 578 g 549 t
ORIGIN
1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac
61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct
121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt
181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca
241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca
301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg
361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc
421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg
481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg
541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg
601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg
661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg
721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca
781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg
841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc
901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc
961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga
1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg
1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc
1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag
1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg
1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt
1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat
1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga
1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc
1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga
1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg
1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg
1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag
1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg
1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg
1861 gtcacaatgc tgaaaaacag acatgcataa taagtcacaa ttatgaattg tgatttcttc
1921 tataaaaaag agaccactgc aatatgtgat ctcttgtatt atttcataat tgttaaagcc
1981 acttcacagt atgctcacat tgtacggata tattcgtaat gtttttttat atcgaatgaa
2041 cgacagaagt tgtggagatt ttatgaaagt aattagcatg aaatttattt ttattttaac
2101 gattattgct cttgctgctg tttttttctg gtctgaagat aaaggtccg
//
LOCUS DOGPPPP 427 bp ss-mRNA MAM 01-AUG-1990
DEFINITION Canine pancreatic polypeptide mRNA, complete cds.
ACCESSION M35596
KEYWORDS pancreatic polypeptide.
SOURCE Canine pancreas, cDNA to mRNA.
ORGANISM Canis lupus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 427)
AUTHORS Toothman,P. and Paquette,T.L.
TITLE Canine pancreatic polypeptide complementary deoxyribonucleic acid
sequence: Pancreatic polypeptide and insulin messenger ribonucleic
acid distribution in the lobes of the pancreas
JOURNAL Mol. Endocrinol. 1, 413-419 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 21 302 pancreatic polypeptide precursor
sigp 21 107 pancreatic polypeptide signal peptide
matp 108 215 pancreatic polypeptide
matp 225 284 icosapeptide
mRNA 1 427 pancreatic polypeptide mRNA
BASE COUNT 88 a 149 c 115 g 75 t
ORIGIN
1 tccgcccctt aggactcggg atgcctgccg cctgccgctg cctcttcctg ctgctcctgt
61 cagcctgtgt ggctctgttg ctgcagccgc cactgggtac ccggggggcc ccgctggagc
121 cagtgtatcc gggggacgat gccacaccag agcagatggc ccagtacgcg gctgagctcc
181 gcagatacat caacatgctg accaggccca ggtatgggaa aagagacaga ggagaaatgc
241 gggacatcct ggaatggggc tccccccatg cagccgcccc cagggagctg atggacgagt
301 aatgccacct ccaagtaatg ccacctctgc ctctcaggcc aatgccagcc tacctctccc
361 ctctgcaccc ctggccaaag cttgctccct gctctcacac acagactaaa taaagcaagt
421 caaagtc
//
LOCUS GVICG 296 bp ss-RNA circular VRL 01-AUG-1990
DEFINITION Grapevine viroid grapevine isolate (SHV-g(GV)) complete genome.
ACCESSION M35717
KEYWORDS complete genome.
SOURCE Grapevine viroid RNA.
ORGANISM Grapevine viroid
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 296)
AUTHORS Sano,T., Ohshima,K., Hataya,T., Uyeda,I., Shikata,E., Chou,T.-G.,
Meshi,T. and Okada,Y.
TITLE A viroid resembling hop stunt viroid in grapevines from Europe, the
United States and Japan
JOURNAL J. Gen. Virol. 67, 1673-1678 (1986)
STANDARD simple staff_review
BASE COUNT 60 a 87 c 80 g 69 t
ORIGIN
1 ctggggaatt ctcgagttgc cgcatcaggc aagcaaagaa aaaacaaggc agggaggtac
61 ttacctgaga aaggagcccc ggggcaactc ttctcagaat ccagcgagag gcgtggagag
121 agggccgcgg tgctctggag tagaggctct gcttcagaac accatcgatc gtcccttctt
181 ctttaccttc ttctggctct tccgatgaga cgcgaccggt ggcatcacct ctcggttcgt
241 cccaacctgc tttttgtcta tctgagcctc tgccgcggat cctctcttga gcccct
//
LOCUS HUMTCAJK 94 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor germline J-alpha RP DNA, partial cds.
ACCESSION M35619
KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline;
joining exon.
SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 94)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 32 / 92 T-cell receptor germline J-alpha RP region (AA
at 32) /hgml_locus_uid="LX0123X"
/nomgen="TCRA" /map="14q11.2"
IVS 93 > 94 TCR intron
signal 2 10 nonamer recombination signal
signal 23 29 heptamer recombination signal
BASE COUNT 25 a 23 c 22 g 24 t
ORIGIN
1 aggtttctgt tatgaagcat ctcacagtgt aaataccggc actgccagta aactcacctt
61 tgggactgga acaagacttc aggtcacgct cggt
//
LOCUS HUMTCAJM 80 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor unproductively rearranged J-alpha AA/J-alpha
AB DNA pseudogene, partial cds.
ACCESSION M35621
KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon;
processed gene; pseudogene.
SOURCE Human cell line AT5-B1 tumor DNA, clone lambda-A30.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 80)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps / 30 > 78 T-cell receptor unproductively rearranged
J-alpha AA/J-alpha AB region (AA at 30)
/hgml_locus_uid="LX0123X" /nomgen="TCRA"
/map="14q11.2"
recomb 26 27 J-alpha AA end/J-alpha AB start
signal 7 15 nonamer recombination signal
BASE COUNT 23 a 15 c 18 g 24 t
ORIGIN
1 tatgttggtt tatgtagaga cacatataga ccgacaagct catctttggg actgggacca
61 gattacaagt ctttccaagt
//
LOCUS HUMTCAZI 520 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor productively rearranged V-alpha-J-alpha DNA,
exons 1 and 2.
ACCESSION M35617
KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon;
processed gene; variable region.
SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 520)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 11 56 T-cell receptor V-alpha-J-alpha region, exon 1
/hgml_locus_uid="LX0123X" /nomgen="TCRA"
/map="14q11.2"
166 / 513 T-cell receptor V-alpha-J-alpha region, exon 2
IVS 57 165 T-cell receptor intron A
IVS 514 > 520 T-cell receptor intron B
BASE COUNT 115 a 129 c 122 g 154 t
ORIGIN
1 ttgctcagcc atgctcctgg agcttatccc actgctgggg atacattttg tcctgagtga
61 gtaaaaattt ctttatggtc tctagttcca caggttctga ctagaaatgc ttgcttttta
121 tactgagtct gcactgcttt cactgatagt acgttgtttt tccaggaact gccagagccc
181 agtcagtgac ccagcctgac atccacatca ctgtctctga aggagcctca ctggagttga
241 gatgtaacta ttcctatggg gcaacacctt atctcttctg gtatgtccag tcccccggcc
301 aaggcctcca gctgctcctg aagtactttt caggagacac tctggttcaa ggcattaaag
361 gctttgaggc tgaatttaag aggagtcaat cttccttcaa cctgaggaaa ccctctgtgc
421 attggagtga tgctgctgag tacttctgtg ctgtggttgg cactgccagt aaactcacct
481 ttgggactgg aacaagactt caggtcacgc tcggtaggta
//
LOCUS HUMTCAZJ 130 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor unproductively rearranged J-alpha RX/J-alpha
RP DNA, partial cds.
ACCESSION M35618
KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon;
processed gene.
SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 130)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps / 71 / 127 T-cell receptor J-alpha RP region (AA at 71)
/hgml_locus_uid="LX0123X" /nomgen="TCRA"
/map="14q11.2"
pept.ps / 68 / 9 (c) T-cell receptor J-alpha RX (AA at 68)
recomb 69 70 J-alpha RX end/J-alpha RP start
BASE COUNT 29 a 42 c 22 g 37 t
ORIGIN
1 tttaaagata gcttcactct cacttgcgtc cccattccaa atgtaaattt cctgtttccc
61 cccctccgtt accggcactg ccagtaaact cacctttggg actggaacaa gacttcaggt
121 cacgctcggt
//
LOCUS HUMTCAZL 97 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor germline J-alpha AA DNA, partial cds.
ACCESSION M35620
KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline;
joining exon.
SOURCE Human cell line AT5-B1 tumor DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 97)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 35 / 95 T-cell receptor germline J-alpha RP region (AA
at 35) /hgml_locus_uid="LX0123X"
/nomgen="TCRA" /map="14q11.2"
IVS 96 > 97 TCR intron
signal 7 15 nonamer recombination signal
signal 28 34 heptamer recombination signal
BASE COUNT 31 a 19 c 21 g 26 t
ORIGIN
1 tatgttggtt tatgtagaga cacataacac tgtgactacc tcaggaacct acaaatacat
61 ctttggaaca ggcaccaggc tgaaggtttt agcaagt
//
LOCUS HUMTCAZN 89 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human T-cell receptor germline J-alpha AB DNA, partial cds.
ACCESSION M35622
KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline;
joining exon.
SOURCE Human cell line AT5-B1 tumor DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 89)
AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
alpha locus and aberrant recombination between J-alpha segments
JOURNAL EMBO J. 7, 1661-1668 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 30 > 87 T-cell receptor germline J-alpha RP region (AA
at 30) /hgml_locus_uid="LX0123X"
/nomgen="TCRA" /map="14q11.2"
IVS 88 > 89 TCR intron
signal 2 10 nonamer recombination signal
signal 23 29 heptamer recombination signal
BASE COUNT 23 a 19 c 18 g 29 t
ORIGIN
1 aggtttttgt agatctcagt atcactgtgt cttataacac cgacaagctc atctttggga
61 ctgggaccag attacaagtc tttccaagt
//
LOCUS MUSBMTA 141 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION M35719
KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C.
TITLE Expression of the beta-subunit gene of Murine thyrotropin results
in multiple messenger ribonucleic acid species which are generated
by alternative exon splicing
JOURNAL Mol. Endocrinol. 1, 875-883 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 118 > 141 thyrotropin beta-subunit
BASE COUNT 40 a 30 c 39 g 32 t
ORIGIN
1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gactgagcgc
61 atacgagtgg agagaaaaat attctgcttc agtcaagagc tggggttgtt caaaagcatg
121 agtgctgccg tcctcctctc c
//
LOCUS MUSBMTB 99 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION M35720
KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 99)
AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C.
TITLE Expression of the beta-subunit gene of Murine thyrotropin results
in multiple messenger ribonucleic acid species which are generated
by alternative exon splicing
JOURNAL Mol. Endocrinol. 1, 875-883 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 76 > 99 thyrotropin beta-subunit
BASE COUNT 26 a 24 c 29 g 20 t
ORIGIN
1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gagtgaccgc
61 atacgagtgg agagcatgag tgctgccgtc ctcctctcc
//
LOCUS MUSBMTC 93 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION M35721
KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 93)
AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C.
TITLE Expression of the beta-subunit gene of Murine thyrotropin results
in multiple messenger ribonucleic acid species which are generated
by alternative exon splicing
JOURNAL Mol. Endocrinol. 1, 875-883 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 70 > 93 thyrotropin beta-subunit
BASE COUNT 26 a 22 c 21 g 24 t
ORIGIN
1 agcagtaact cactcatgca aagtaagaaa aatattctgc ttcagtgaag agctggggtt
61 gttcaaagca tgagtgctgc cgtcctcctc tcc
//
LOCUS MUSBMTD 52 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION M35723
KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 52)
AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C.
TITLE Expression of the beta-subunit gene of Murine thyrotropin results
in multiple messenger ribonucleic acid species which are generated
by alternative exon splicing
JOURNAL Mol. Endocrinol. 1, 875-883 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 29 > 52 thyrotropin beta-subunit
BASE COUNT 13 a 17 c 10 g 12 t
ORIGIN
1 agcagtaact cactcatgca aagtaagcat gagtgctgcc gtcctcctct cc
//
LOCUS MUSIGKCSU 444 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse Ig aberrantly rearranged kappa-chain mRNA V-J2-C-region,
complete cds.
ACCESSION M35669
KEYWORDS constant region; immunoglobulin light chain; joining exon;
kappa-immunoglobulin; variable region.
SOURCE Mouse myeloma MOPC-21, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 444)
AUTHORS Carroll,W.L., Mendel,E. and Levy,S.
TITLE Hybridoma fusion cell lines contain an aberrant kappa transcript
JOURNAL Mol. Immunol. 25, 991-995 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 28 414 Ig kappa-chain V-J2-C-region precursor
sigp 28 87 Ig kappa-chain V-J2-C-region signal peptide
matp 88 411 Ig kappa-chain V-J2-C-region
recomb 380 381 V-region end/J2-region start
recomb 411 412 J2-region end/C-region start
BASE COUNT 108 a 122 c 111 g 103 t
ORIGIN Chromosome 6.
1 cagcatcctc tcttccagct ctcagagatg gagacagaca cactcctgtt atgggtactg
61 ctgctctggg ttccaggttc cactggtgac attgtgctga cacagtctcc tgcttcctta
121 gctgtatctc tggggcagag ggccaccatc tcatacaggg ccagcaaaag tgtcagtaca
181 tctggctata gttatatgca ctggaaccaa cagaaaccag gacagccacc cagactcctc
241 atctatcttg tatccaacct agaatctggg gtccctgcca ggttcagtgg cagtgggtct
301 gggacagact tcaccctcaa catccatcct gtggaggagg aggatgctgc aacctattac
361 tgtcagcaca ttagggagct tacacgttcg gaggggggac caagctggaa ataaaacggg
421 ctgatgctgc accaactgta tcca
//
LOCUS MUSLACPI 844 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse placental lactogen I (mPL-I) mRNA, complete cds.
ACCESSION M35662
KEYWORDS placental lactogen I.
SOURCE Mouse (strain Swiss-Webster) day 10 placenta, cDNA to mRNA, clone
1.5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 844)
AUTHORS Colosi,P., Talamantes,F. and Linzer,D.I.H.
TITLE Molecular cloning and expression of mouse placental lactogen I
complementary deoxyribonucleic acid
JOURNAL Mol. Endocrinol. 1, 767-776 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 42 716 placental lactogen I (mPL-I) precursor
sigp 42 131 placental lactogen I (mPL-I) signal peptide
matp 132 713 placental lactogen I (mPL-I)
mRNA < 1 844 mPL-I mRNA
signal 821 831 mPL-I poly-A signal
BASE COUNT 243 a 188 c 176 g 237 t
ORIGIN
1 ttcctcactt ggagcctaca ttgtggtgga tcttctcaga aatgcagctg actttgaatc
61 tttcaggctc cgcaggaatg caattgttgc tgctggtgtc aagcctactc ctttgggaga
121 atgtgtcctc caaaccaact gccatggtgc ccactgaaga cctgtatact cgtttggctg
181 aactgctcca taatacattt atcttggccg cagatgtgta tagggaattt gatttggatt
241 ttttcgataa aacttggata acagacagaa cacttcccct gtgtcatact gcttccatcc
301 atactccaga gaatcgagag gaagtccacg aaactaaaac tgaagacctt ctgaaagcaa
361 tgatcaatgt ttcaatttcc tggaaagaac ctctgaaaca cctggtgtct gcactgacgg
421 ctctcccagg agcttctgag agtatgggga aaaaagctgc tgacattaag ggcagaaacc
481 ttgtaattct ggagggactt cagacaatat acaacaggtc tcaggctaac attgaagaaa
541 atgaaaattt tgactaccct gcttggtctg gactcgaaga actgcagtca cctaacgaag
601 acactcatct ttttgccgtt tataatctat gccgctgcat taaaagggac atccataaga
661 tagacagcta tatcaaagtc ttgaggtgcc gagttgtctt tcagaacgaa tgttgagtgc
721 ccacccagcg aagccctgcc cacatggtct ttgttgaacc agacttgtaa tgctttcccc
781 tcctcagtta tgatgagcta taatggaatt attgtcataa aataaaataa aattatttag
841 attc
//
LOCUS BLYGSA 1621 bp ss-mRNA PLN 01-AUG-1990
DEFINITION Barley glutamate 1-semialdehyde aminotransferase (GSA) mRNA,
complete cds.
ACCESSION M31545
KEYWORDS glutamate 1-semialdehyde aminotransferase.
SOURCE Barley (cv. Bonus) 5 day old dark grown seedling, cDNA to mRNA.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1621)
AUTHORS Grimm,B.
TITLE Primary structure of a key enzyme in plant tetrapyrrole synthesis:
Glutamate 1-semialdehyde aminotransferase
JOURNAL Unpublished (1990) Carlsberg Laboratory, Dept. of Physiology, Gamle
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grimm, 22-JAN-1990.
FEATURES from to/span description
pept 20 1429 glutamate 1-semialdehyde aminotransferase
(GSA) precursor (EC 5.4.3.8)
sigp 20 121 glutamate 1-semialdehyde aminotransferase
signal peptide
matp 122 1426 glutamate 1-semialdehyde aminotransferase
signal 1598 1603 polyA signal
mRNA < 1 1621 GSA mRNA
BASE COUNT 362 a 363 c 459 g 437 t
ORIGIN
1 ggagaaggaa ggcagcatca tggccggagc agcagccgcc gtggcctccg gcatatcgat
61 caggcctgta gccgcgccta agatctcgcg cgcgccccgc tctcggtcgg tggtgagggc
121 ggccgtctcc atagacgaga aggcttacac ggttcagaaa tccgaggaga tcttcaacgc
181 cgccaaggaa ttgatgcctg gtggtgttaa ttcaccagtc cgtgccttca aatcagtcgg
241 cgggcagccc atagtttttg attctgtgaa gggctctcat atgtgggatg tcgatggaaa
301 tgaatatatt gattatgttg gttcctgggg tcctgcaatc attggtcatg cagatgacaa
361 ggtgaatgct gcacttattg aaactctgaa gaagggtact agctttggtg ctccatgtgc
421 gttggagaat gtgttggctc aaatggtcat ctccgctgtg ccgagtatcg aaatggttcg
481 ttttgtaaat tcaggaacag aagcttgcat gggagcactc cgccttgtgc gtgcattcac
541 tgggagggaa aagattctca agtttgaagg ctgttaccat ggccatgcag attccttcct
601 tgttaaagca ggcagtggtg ttgccaccct cggcctccca gactcccctg gagtgcctaa
661 gggagccacc gttgggactc taacagcacc ttataatgat gctgatgcgg ttaaaaagct
721 gtttgaggat aacaaagggg agattgctgc agtcttcctt gagccggttg ttggcaatgc
781 tggcttcatt cctccgcagc ctgctttcct aaatgctctc cgtgaggtga ccaaacaaga
841 cggcgcactt ctggtgtttg atgaagtgat gactcctttc cgtttagctt atggtggggc
901 acaagagtac tttggaatca cccctgatgt gacaaccttg ggccaaatta ttggcggtgg
961 tcttccggtt ggtgcttacg gtggacggaa ggatatcatg gagatggttg ctccagcagg
1021 gccaatgtac caggcaggaa ccctcagtgg aaaccctcta gctatgactg ctggaatcca
1081 cactctcaag cgtctgatgg agcctggcac ctatgaatac ttagacaagg tcactggtga
1141 acttgtccgg ggcatattgg atgtgggcgc taaaacaggg cacgagatgt gtggaggaca
1201 catcagaggc atgttcggat tcttcttcgc aggtggccca gtgcacaact ttgatgatgc
1261 caagaagagt gacacagcga agtttgggag gttccaccgt ggaatgctgg gcgaaggcgt
1321 gtatctggca ccatcccagt tcgaggcagg ttttacaagc ttggcacaca ccacccaaga
1381 cattgagaaa accgtggagg ctgccgagaa ggttcttcga tggatataga tgatttggat
1441 tgcaaacctt ttgaagcttt tccttctgtt gtattctgtt agtttgtacg tggctgaagt
1501 ttagttttgt attgtatttt gttgtgcagc agcagtatct tgtctctagc ccatttttct
1561 tcttctgagt tagcatttgg ggtgattttg tcttggcaat aaaactttgg ctacgacctc
1621 c
//
LOCUS MUSSVSIV 541 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse seminal vesicle secretory protein IV (SVS IV) mRNA, 3' end.
ACCESSION M35732
KEYWORDS seminal vesicle secretory protein IV.
SOURCE Mouse adult seminal vesicle, cDNA to mRNA, clone p2A2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 541)
AUTHORS Chen,Y.H., Pentecostt,B.T., McLachlan,J.A. and Teng,C.T.
TITLE The androgen-dependent mouse seminal vesicle secretory protein IV:
Characterization and complementary deoxyribonucleic acid cloning
JOURNAL Mol. Endocrinol. 1, 707-716 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 329 seminal vesicle secretory protein IV (SVS IV)
precursor (AA at 3)
sigp < 1 50 seminal vesicle secretory protein IV (SVS IV)
signal peptide
matp 51 326 seminal vesicle secretory protein IV (SVS IV)
mRNA < 1 541 SVS IV mRNA
BASE COUNT 154 a 107 c 131 g 149 t
ORIGIN
1 gtttgttcct cttttctctg cttctccttc tggtgacagg agccattggg aagaaaacta
61 aggaaaaatt cttgcagtcg gaagaaactg tcagagagag cttctcgacg ggaagcagag
121 gccatatgtc aagaagttct gagccagagg tatttgttag gccacaggac tccatcggtg
181 acgaagcttc tgaggaaatg agtagtagta gtagtagtag aagaagaagt aagattatct
241 ctagcagttc tgatggttct aatatggaag gtgagagttc atattcaaag agaaagaaga
301 gccggttttc tcaagatgca ctcgagtgat actgcattga ccagctgaac atctggacca
361 atatgctgga gccatatcgc cagaacagag cccatgatgt cttcagcata cagctcccat
421 gtggtctcag aggcagtccc tggatggcat ttacttccca tgcttgtttg tcttgaggtt
481 cttaaaccta acatttactc tggagctttc tttccaataa agagataaca attgcatcat
541 t
//
LOCUS NEMRPT 677 bp ds-DNA INV 01-AUG-1990
DEFINITION A.lumbricoides BamHI repetitive DNA.
ACCESSION M35399
KEYWORDS BamHI repetitive sequence.
SOURCE A.lumbricoides DNA, clone AL700-1.
ORGANISM Ascaris lumbricoides
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Ascaridida; Ascaridina; Ascaridoidea; Ascarididae.
REFERENCE 1 (bases 1 to 677)
AUTHORS Warren,T. and Pasternak,J.J.
TITLE A related moderately repetitive DNA family in the nematodes Ascaris
lumbricoides and Panagrellus silusiae
JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rpt 1 677 BamHI repeat
BASE COUNT 186 a 158 c 161 g 172 t
ORIGIN
1 ggatccgagt aagtgtgcaa aaacagcatt atttatgtaa acgaagctca attacatttc
61 taagtgcaat tacggctgta tcacgggttg gcaactccat attccacgga aatccaccca
121 ttcaacgggt gcaattcccg tgagtatcgt aaaataggag agtgaaagct cagaatgcgg
181 ctagaatgtg tcatcttgtt gccaaatcgg agatatgtat cgtgtgaatt gacatgtatc
241 atgccaaggt aggtcggaaa ggccaaagaa aagcggaaac cagacggtcg gaaagtacag
301 aactcgattc ttgcgattgt gcatcttcga gttctggtaa gtgtaaatgc gagtccggtg
361 tctgatcgga tctgatcggc cagtgccgag gcttacacgt gactatcaca tagtctcact
421 ctttcactct tcccttttcg cgatttccga ttcagtgcta acaactcgac gtagacaccc
481 cactctttct cctgcgcatt cctatgccgg tcaccgattg ggtcgcaaaa tgccaaagga
541 cagggcatgt aagcccgcat cttaattgtt aagattcacc gatgaatcgt caaaaatttt
601 gcaaaagcta gtggaaaacg gggttttgag gcccgttcca ccggcaaacc gtcatcgtgc
661 gccgatcaga tggatcc
//
LOCUS PNGRPT 682 bp ds-DNA INV 01-AUG-1990
DEFINITION P.silusiae BamHI repetitive DNA.
ACCESSION M35398
KEYWORDS BamHI repetitive sequence.
SOURCE P.silusiae DNA, clone PS700-1.
ORGANISM Panagrellus silusiae
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE 1 (bases 1 to 682)
AUTHORS Warren,T. and Pasternak,J.J.
TITLE A related moderately repetitive DNA family in the nematodes Ascaris
lumbricoides and Panagrellus silusiae
JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rpt 1 682 BamHI repeat
BASE COUNT 201 a 154 c 155 g 172 t
ORIGIN
1 ggatccgcag cgaattgtgt aaaacagcat taattatgta aaagaagctc aattaacctt
61 tctaagtgca attgaggctg tatcacgggt tggcaacctc gtattccacg gaaatccacc
121 cattcaacgg gtgcgatttc gtgtttttcg taaaaatcgg attctgaagg ctagaatccg
181 gccagaatgt gtcatcttgt tccaaatgag agttatttga catctgaatc acatttgaaa
241 tgcaaagaca ggtcggaaag gccaaacaag agcgaaaacc cgcgggtcgc caaaagtacc
301 agaactcgat tcttgcgatt tttcgcattt tcgagttctg gtaagtgcaa aaagtttcga
361 tttcggatct gcatcggaat ctgattgccc acgtgccaga aggcttaaaa acgtgcacaa
421 accacatggt taccctttac cttgttttcg aaatttaaca aaaagtgcaa aaaccgggta
481 aaaacccatc tttggcctgc gcattgccaa tggcggtcat cgatgggtcg cgaagtgcca
541 aagggaccaa ggtgtaagcc cgcatcatat ctgttaagat tcatcgatga atcggccaat
601 attttgaaaa gctagtggaa aaacgcgttt tgacgcccgt ttccaccggc aaaccgtcat
661 cgtgcgccga tcagacggat cc
//
LOCUS TETTRGA 75 bp ss-tRNA RNA 01-AUG-1990
DEFINITION T.thermophila Gln-tRNA-UUG.
ACCESSION M35400
KEYWORDS glutamine tRNA.
SOURCE T.thermophila tRNA.
ORGANISM Tetrahymena thermophila
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG
termination codons to glutamine codons due to anticodon mutations
in two Tetrahymena tRNAs-Gln
JOURNAL EMBO J. 5, 1307-1311 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 75 Gln-tRNA
modified 9 9 m1g
modified 10 10 m2g
modified 13 13 p
modified 19 19 d
modified 20 20 d
modified 34 34 um
anticdn 34 36 Gln-tRNA anticodon ttg
modified 39 39 p
modified 48 48 m5c
modified 54 54 p
modified 57 57 m1a
BASE COUNT 15 a 19 c 21 g 18 t 2 others
ORIGIN
1 ggttgtatgg tgtagcggaa agcaccgagg actttgaatc ctctgacctg ggttcgaatc
61 ccagtacgac ctcca
//
LOCUS TETTRGB 75 bp ss-tRNA RNA 01-AUG-1990
DEFINITION T.thermophila Gln-tRNA-CUA.
ACCESSION M35401
KEYWORDS transfer RNA-Gln.
SOURCE T.thermophila tRNA.
ORGANISM Tetrahymena thermophila
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG
termination codons to glutamine codons due to anticodon mutations
in two Tetrahymena tRNAs-Gln
JOURNAL EMBO J. 5, 1307-1311 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 75 Gln-tRNA
modified 10 10 m2g
modified 13 13 p
modified 19 19 d
modified 20 20 d
anticdn 34 36 Gln-tRNA anticodon cta
modified 37 37 t6a
modified 39 39 p
modified 48 48 m5c
modified 54 54 p
modified 57 57 m1a
BASE COUNT 19 a 18 c 18 g 19 t 1 others
ORIGIN
1 ggttctatag tatagcgcaa agtactgggg antctaaatc ccttgacctg ggttcgaatc
61 ccagtaggac ctcca
//
LOCUS TETTRGC 75 bp ss-tRNA RNA 01-AUG-1990
DEFINITION T.thermophila Gln-tRNA-UUA.
ACCESSION M35402
KEYWORDS transfer RNA-Gln.
SOURCE T.thermophila tRNA.
ORGANISM Tetrahymena thermophila
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG
termination codons to glutamine codons due to anticodon mutations
in two Tetrahymena tRNAs-Gln
JOURNAL EMBO J. 5, 1307-1311 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 75 Gln-tRNA
modified 10 10 m2g
modified 13 13 p
modified 16 16 d
modified 19 19 d
modified 20 20 d
modified 32 32 cm
modified 34 34 um
anticdn 34 36 Gln-tRNA anticodon tta
modified 37 37 t6a
modified 39 39 p
modified 48 48 m5c
modified 54 54 p
modified 57 57 m1a
BASE COUNT 16 a 17 c 20 g 19 t 3 others
ORIGIN
1 ggttccatag tatagdggdd agtactgggg actttaaatc ccttgacctg ggttcgaatc
61 ccagtgggac ctcca
//
LOCUS BEGRR5S 120 bp ss-rRNA RNA 01-AUG-1990
DEFINITION B.alba 5S ribosomal RNA.
ACCESSION M35565
KEYWORDS 5S ribosomal RNA.
SOURCE B.alba (strain B18LD) rRNA.
ORGANISM Beggiatoa alba
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Beggiatoaceae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 120 5S ribosomal RNA
BASE COUNT 32 a 32 c 29 g 27 t
ORIGIN
1 ttcttggcga ccatagcaaa taggaaccac ccgaccccat cccgaactcg gtagtgaaac
61 tgttctgcgc cgatgatagt gtggatactc tccatgtgaa agtaggttat cgccaagagc
//
LOCUS ECOHEMC 2092 bp ds-DNA BCT 01-AUG-1990
DEFINITION E.coli porphobilinogen deaminase (hemC) and uroporphyrinogen III
synthase (hemD) genes, complete cds.
ACCESSION X04242 M35827
KEYWORDS deaminase; hemC gene; hemD gene; porphobilinogen deaminase;
uroporphyrinogen III synthase.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1957)
AUTHORS Thomas,S.D. and Jordan,P.M.
TITLE Nucleotide sequence of the hemC locus encoding porphobilinogen
deaminase of Escherichia coli K12
JOURNAL Nucleic Acids Res. 14, 6215-6226 (1986)
STANDARD simple automatic
REFERENCE 2 (bases 1290 to 2092)
AUTHORS Jordan,P.M., Mgbeje,B.I.A., Thomas,S.D. and Alwan,A.F.
TITLE Nucleotide sequence for the hemD gene of Escherichia coli encoding
uroporphyrinogen III synthase and initial evidence for a hem operon
JOURNAL Biochem. J. 249, 613-616 (1988)
STANDARD simple staff_review
COMMENT Data kindly reviewed (11-SEP-1986) by P. Jordan
FEATURES from to/span description
pept 390 1331 porphobilinogen deaminase (hemC)
pept 1328 2068 uroporphyrinogen III synthase (hemD)
signal 330 335 put. -35 region
rpt 63 67 inverted repeat A
rpt 78 82 direct repeat 1
rpt 349 853 inverted repat A'
signal 354 359 put. -10 region
rpt 356 360 direct repeat 1
rpt 367 371 direct repeat 1
binding 377 381 put. ribosome binding site
signal 1508 1522 pot. transcription termination signal
BASE COUNT 495 a 540 c 566 g 491 t
ORIGIN
1 caagacgtat cgcctgattt gctacccgtc atgactgtga ttccgccaac atcaacggta
61 acacgcggca ttcgggatat ttcgtatgtc aaaggtaacc gttaccactt ttcgcgcctg
121 gtttttttag tttcacgacg aaaaaatggt ctaaaacgtg atcaatttaa caccttgctg
181 attgaccgta aagaaagatg cgctacatac aagtgtagca ccgtttattc tctgtaaatt
241 ccttattaca acggcgtgaa acgcctgtca ggatccactg ccagacctca ttttacggtt
301 tgcgcaggcg tctacgtttc accacaacac tgacatcact ctggcaagga tgttaggatg
361 gaccacggat gataatgacg gtaacaagca tgttagacaa tgttttaaga attgccacac
421 gccaaagccc acttgcactc tggcaggcac actatgtcaa agacaagttg atggcgagcc
481 atccgggcct ggtcgttgaa ctggtaccga tggtgacgcg cggcgatgtg attcttgata
541 cgccgctggc gaaagtaggc ggaaaaggct tatttgtaaa agagctggaa gtcgcgctcc
601 tcgaaaatcg cgccgatatc gccgtacact caatgaaaga tgtgccggtt gaattcccgc
661 aaggtctggg actggtcact atttgtgagc gtgaagatcc tcgcgatgcc tttgtgtcca
721 ataactatga cagtctggat gcgttaccgg caggcagtat cgtcgggacg tccagtttac
781 gtcgccagtg ccaactggct gaacgccgtc cggatctgat tatccgctcc ctgcgcggca
841 acgtcggcac tcgcctgagc aaactggata acggcgaata cgatgccatc attcttgccg
901 tagccggact aaaacgttta ggtctggagt cacgtattcg cgccgcgttg ccacccgaga
961 tttctcttcc ggcggtagga caaggtgcgg tgggtattga atgccgcctt gatgattcac
1021 gcactcgcga gctgcttgcc gcgctgaatc accacgaaac tgcactgcgc gttaccgcag
1081 aacgcgccat gaatacccgt ctcgaaggcg catgtcaggt gccaattggt agctacgccg
1141 agcttattga tggcgaaatc tggctgcgtg ggctggtcgg cgcgccggac ggttcgcaga
1201 ttattcgcgg tgaacgccgc ggtgcgccgc aagatgccga acaaatgggg atttcgctgg
1261 cagaagagct actgaataac ggcgcgcgcg agatcctcgc tgaagtctat aacggagacg
1321 ccccggcatg agtatccttg tcacccgccc gtctcccgct ggagaagagt tagtgagccg
1381 tctgcgcaca ctggggcagg tggcctggca ttttccgctg attgagtttt ctccgggtca
1441 acaattaccg caacttgctg atcaactggc agcgctgggg gagagcgatc tgttgtttgc
1501 cctctcgcaa cacgcggttg cttttgccca atcacagctg catcagcaag atcgtaaatg
1561 gccccgacta cctgattatt tcgccattgg acgcaccacc gcactggcac tacataccgt
1621 aagtggacag aagattctct acccgcagga tcgggaaatc agcgaagtct tgctacaatt
1681 acctgaatta caaaatattg cgggcaaacg tgcgctgata ttacgtggca atggtggtcg
1741 tgagctaatt ggggataccc tgacggcgcg cggtgctgag gtcacttttt gtgaatgtta
1801 tcaacgatgc gcaatccatt acgatggtgc agaagaagcg atgcgctggc aagcccgcga
1861 ggtgacgatg gtcgttgtta ccagcggtga aatgttgcag caactctggt cactgatccc
1921 acaatggtat cgtgagcact ggttactaca ctgtcgacta ttggtcgtca gtgagcgttt
1981 ggcgaaactc gcccgggaac tgggctggca agacattaag gtcgccgata acgctgacaa
2041 cgatgcgctt ttacgggcat tacaataact ctcataacag gaagccataa tg
//
LOCUS LTTRR5S 117 bp ss-rRNA RNA 01-AUG-1990
DEFINITION L.discophora 5S ribosomal RNA.
ACCESSION M35569
KEYWORDS 5S ribosomal RNA.
SOURCE L.discophora (strain Stokes) rRNA.
ORGANISM Leptothrix discophora
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE 1 (bases 1 to 117)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 117 5S ribosomal RNA
BASE COUNT 27 a 35 c 32 g 23 t
ORIGIN
1 atgcctgacg accatagcga ggtggtccca ctccttccca tcccgaacag gacagtgaaa
61 cgcctcagcg ccgatgatag tgcgcattcg cgtgtgaaag taggtcatcg tcaggct
//
LOCUS TBSACG 4776 bp ss-RNA VRL 01-AUG-1990
DEFINITION Tomato bushy stunt virus complete genome.
ACCESSION M21958 M31019
KEYWORDS capsid protein; coat protein; complete genome; p19 protein;
p22 protein; p33 protein; p41 protein; p92 protein.
SOURCE Tomato bushy stunt virus (strain cherry), cDNA to viral RNA.
ORGANISM Tomato bushy stunt virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Tombusvirus.
REFERENCE 1 (bases 2621 to 4776)
AUTHORS Hillman,B.I., Hearne,P., Rochon,D. and Morris,T.J.
TITLE Organization of tomato bushy stunt virus genome: Characterization
of the coat protein gene and the 3' terminus
JOURNAL Virology 169, 42-50 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2620)
AUTHORS Hearne,P.Q., Knorr,D.A., Hillman,B.I. and Morris,T.J.
TITLE The complete genome structure and synthesis of infectious RNA from
clones of tomato bushy stunt virus
JOURNAL Virology 177, 141-151 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Q.Hearne, 16-DEC-1988. Draft entry and computer-readable
sequence for [2] kindly submitted by D.Knorr, 21-DEC-1989.
The 5' terminal nucleotide was not determined. However, in the
infectious constructs, two 5' terminal "g" residues are added, one
of which is removed during subsequent replication in host plants.
FEATURES from to/span description
pept 166 1056 p33 protein
pept 166 2622 p92 protein (read-through of p33)
pept 2652 3818 p41 capsid protein
pept 3888 4406 p19 protein
pept 3856 4425 p22 protein
mRNA 2621 4776 2.2kb subgenomic mRNA
mRNA 3841 4776 0.9kb subgenomic mRNA
BASE COUNT 1257 a 983 c 1315 g 1220 t 1 others
ORIGIN
1 naaattctcc aggatttctc gacctagttc gtttatctgg tgacttgcgc taccgttgct
61 ttgcgtagag aatttctctc cataattatt atctttagtt gtggggtttg aaggttgggt
121 ctacctttcg gggggataaa ttgtaacttc caacaaacaa gcgacatgga gaccatcaag
181 agaatgattt ggcctaagaa agagattttt gtgggtgatt tcgcaaccgg agtgaatagg
241 acagttccgg tgaacatctt tcaattggtg tgtcgtgtgg ttctgagata catgaggaca
301 gggaaaatag agtgtgattc tgacagcatg actaagttta tagttgaatt actcaaaact
361 gattgtgctg ccaaatggga atggttcatg aagagacggc agaggggtga ttacattgtc
421 cctctatcta tagcctccat accaatcata ccgctgttga gttatgccac tagggtacgc
481 gcagtctcag tcaaggcttt tggcaatgaa ctatcgttca atgtcagggt gcctagacca
541 tctgtaccta agaaaggatt gctcctcaga ctggcggcag gtctagcgtt agctcctata
601 tgcgcgctgg ccgtgtacgc taccctacct agggaaaaac tgtcggtatt taagctgaga
661 actgaggcac gagcacacat ggaggatgag agagaagcga cagattgtct ggtggttgag
721 ccggcaaggg aacttaaggg taaagatggt gaggatctcc tcactggtag tagattgact
781 aaggtgatcg cgtccactgg gcgccctcgt cgaagacctt atgcggcaaa gatcgcacag
841 gtggcgagag caaaggtggg ttaccttaag aacagtccag agaatagact aatctaccag
901 agggtgatga tcgagatcat ggacaaagac tgcgtcaggt atgttgacag ggatgtcata
961 ttgcctttgg ctattggatg ctgttttgtc tatccggatg gagtggagga gtcggcggca
1021 ctatggggct cacaggagtc cctgggtgtc aaatagggag gcctagtacg tctacctggg
1081 gttgtaacac agatcaatcg agatatccca tctgatgtgt tacttcctca ggaggtgcta
1141 gaggttcgta caggacctcc caatgctaag gaccgtaata tatttatggt tgcaggttgc
1201 ccatcacagg cacggttctt agtacataat cactgcctga aaaaccttaa aaggggtctt
1261 gtggagagag tcttctgcgt agagagaaac gggaagctcg ctcgcactcc acaacctacc
1321 aaaggagcct ttggacgtct ttccccgttc aggaaagcgg tttgtgagaa ggttggggta
1381 gcccaccgac ttgggtatga tgggtttctg tcatactaca gcggtgcgaa actccgtact
1441 tacacacgag ccgtggagag tctgcatatc acacctgtct ccgagaggga tagtcacttg
1501 actaccttcg taaaagcaga gaagatatcg acgtctaagg gtgacccagc acctcgggtg
1561 attcagcctc gaaacccgag gtacaatgtg gaacttggaa gatatctacg gcatatggaa
1621 tccaagctga tgaaagctgt tgatggcgtg ttcggagaga cgacatgcat caaaggatac
1681 acagctgatg aggtaggtgc aattttccgg gctaaatggg acaggtttga taagcctgtc
1741 gccatagggc tcgatgcatc taggtttgat caacactgtt ccgttgaagc attgcaatat
1801 gagcatagct tctacagggc catgtaccct ggcaacaagc tcttgggcaa gttgttggaa
1861 tggcagctcc ataataaagg taaaggttat gttccagatg gaactataac ctatcgcaag
1921 gagggctgtc gcatgagtgg ggatataaac acctcgttgg gcaactatct actgatgtgt
1981 gcaatggtac atgggtacat gcgtcatctg gggattaatg agtttagtct ggcaaactgt
2041 ggggatgatt gcgtcctaat tgtcgaacgc aggaatctta agcagataca gagaacttta
2101 ccggagtatt tcctcaatct gggatatact atgaaggtgg agcaacctgt atttcaactg
2161 gaagaggttg aattttgcca ggcacaccca gtacagtttc aaggcggttg gaagatggtt
2221 cgaaacgtcc gtactgctat gagcaaggat gtgcactgtg tcaacaatat acgcgatttg
2281 gcgacgagga gagcttggag taatgctcaa catcatgggg gtctagcgct tagtgctggt
2341 attccagttg tggagacgtt ttactctagg tttaagcttt atgatgtacc tcgtaaacat
2401 caacgtattg acacggtcac aaatgtgcac aagtggcgtg gatccggtgg gagttatgtt
2461 gtgacccctg aatctagggc tagcttttgg gctgcctttg gactcacggg ggatgagcaa
2521 ctggctctgg aggaccgtct ggaaagatgg gagatggatc tgtttggaga ggagggtgtt
2581 gacgctcatg agcccagcat cctcgactcc gccgtagctt gaccaagaat acacacacgc
2641 aggatagaca catggcaatg gtaaagagaa acaacaacac gggaatgatc ccggtgagta
2701 caaagcaatt actggcattg ggtgcggccg ctggggccac agccttgcag ggatttgtca
2761 agaataatgg gatggccatc gttgaggggg ctgtcgatct gactaaaaga gcgtacaaag
2821 cagtgcggag aagaggaggt aagaaacagc agatgattaa tcatgtaggt ggtacaggtg
2881 gtgctataat ggcgccggta gcagtgacta gacaacttgt cggtagtaag cctaagttta
2941 ctggcaggac gtctggctct gtcacagtta cccaccgtga gtatctgtca caagtgaata
3001 attccacggg tttccaagtt aatgggggaa ttgtcggcaa tttgttacag cttaacccgt
3061 tgaatggtac attgttctct tggttgccag cgatagcatc caattttgat cagtacacat
3121 tcaacagcgt tgtgctacat tatgtgcccc tatgttcaac tactgaggta gggagagtgg
3181 ctatttactt tgataaggac tcagaagatc cagaacctgc tgatagagtt gagttggcga
3241 attacagcgt gcttaaagag acagcccctt gggctgaagc gatgcttagg gtacccaccg
3301 ataagattaa gagattttgt gatgacagtt ccacatctga tcacaaactt atcgacttgg
3361 gtcaattggg cattgctaca tatggtggcg ctgggactaa tgctgtgggg gatatcttta
3421 tctcgtacag tgttacgtta tatttccctc aacctacgaa cacactcctt agtaccagaa
3481 ggctcgacct tgctggcgct cttgtcacag catctggccc tggatacctc ctggtgtcta
3541 ggactgccac tgtattgaca atgacattcc gtgctacagg cacgtttgtc atatccggga
3601 cgtatcggtg cctcacggca acaacgttag gcttggctgg cggagtgaat gtcaatagta
3661 tcacagttgt agataacata ggtacagaca gtgcgttttt cataaattgt actgtctcta
3721 acctaccatc tgtggtgaca ttcacatcta ccggtatcac atctgccaca gtacattgcg
3781 tgcgcgcgac acgacagaat gatgtttctc taatttagtg tgtcctgcga ggggcctctt
3841 gaacaagacc agttcatgga tactgaatac gaacaagtca ataaaccatg gaacgagcta
3901 tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga ggatcaggag
3961 gtaccacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag tggcggctac
4021 ataacgatga gacgaattcg aatcaagata atccccttgg tttcaaggaa agctggggtt
4081 tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaagct tcactgcaca
4141 gagtccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga tttttcggtt
4201 tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc accgtttctg
4261 gagggtcgcg aactcttcag catctctgtg agatggcaat tcggtctaag caagaactgc
4321 tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct gaaggtactg
4381 agaccttcga aaaagaaagc gagtaagaca gactcttcag tctgagtttg tggagatgag
4441 tgtaaatctg gcatagcata caggttactc ttgttgggtt ctggatgtta ggatgacgag
4501 tcgactcggg ctccgcacta ggtttggtcg cctaggggat ggagatatgg aaagggtctc
4561 gtgtggtatc agtcggtcga aagacgcgct tccaacatgg gcctatggtc ggataagtct
4621 tagcaatacc agccagcatg aattggattc ctgtttacga aagttaggtg tcacttgtgg
4681 aagcggaccc agacacggtt gatctcaccc ttcggggggc tatagagatc gctggaagca
4741 ctaccggaca accggaacat tgcagaaatg cagccc
//
LOCUS THTRR5S 122 bp ss-rRNA RNA 01-AUG-1990
DEFINITION T.nivea 5S ribosomal RNA.
ACCESSION M35563
KEYWORDS 5S ribosomal RNA.
SOURCE T.nivea (strain JP2) rRNA.
ORGANISM Thiothrix nivea
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Leucotrichaceae.
REFERENCE 1 (bases 1 to 122)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 122 5S ribosomal RNA
BASE COUNT 27 a 35 c 36 g 24 t
ORIGIN
1 tttgcctggt gtccatagag cactggaacc acctgatccc atcccgaact cagaagtgaa
61 acggtgcatc gccgatggta gtgtggggcc tccccatgtg agagtaggtc aacgccaggc
121 gc
//
LOCUS THVRR5S 123 bp ss-rRNA RNA 01-AUG-1990
DEFINITION Thiovulum sp. 5S ribosomal RNA.
ACCESSION M35570
KEYWORDS 5S ribosomal RNA.
SOURCE Thiovulum sp. rRNA.
ORGANISM Thiovulum sp.
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Colorless sulfur bacteria.
REFERENCE 1 (bases 1 to 123)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 123 5S ribosomal RNA
BASE COUNT 30 a 28 c 30 g 35 t
ORIGIN
1 tttggttggt gattacagag aaaaggtcac actcagctcc atttcgaacc tgaaagttaa
61 gcttttcttc gtcgataata ctgcccccta cgggggtggg acggtagatc gttgccaacc
121 att
//
LOCUS VITRR5S 118 bp ss-rRNA RNA 01-AUG-1990
DEFINITION V.beggiatoides 5S ribosomal RNA.
ACCESSION M35566
KEYWORDS 5S ribosomal RNA.
SOURCE V.beggiatoides (strain B23SS) rRNA.
ORGANISM Vitreoscilla beggiatoides
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Beggiatoaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 30 a 35 c 31 g 22 t
ORIGIN
1 cgcctgacga ccacagcgac tgtgaaccac ccgaccccat ctcgaactcg gtagtgaaac
61 cagtcagcgc cgatgatagt gtggcatatg ccatgtgaaa gtaggtcatc gtcaggct
//
LOCUS VITRR5SX 118 bp ss-rRNA RNA 01-AUG-1990
DEFINITION V.stercoraria 5S ribosomal RNA.
ACCESSION M35567
KEYWORDS 5S ribosomal RNA.
SOURCE V.stercoraria (strain VT1) rRNA.
ORGANISM Vitreoscilla stercoraria
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Beggiatoaceae.
REFERENCE 1 (bases 1 to 118)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 118 5S ribosomal RNA
BASE COUNT 30 a 32 c 30 g 26 t
ORIGIN
1 tgtttgacga ccatagcgag ttggtcccac gccttcccat cccgaacagg accgtgaaac
61 gacttagcgc cgatgatagt gtggattacc catgtgaaag taggtcatcg tcaaacgc
//
LOCUS VITRR5SXX 116 bp ss-rRNA RNA 01-AUG-1990
DEFINITION V.filiformis 5S ribosomal RNA.
ACCESSION M35568
KEYWORDS 5S ribosomal RNA.
SOURCE V.filiformis (strain ATCC 15551) rRNA.
ORGANISM Vitreoscilla filiformis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales;
Beggiatoaceae.
REFERENCE 1 (bases 1 to 116)
AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
Pace,N.R.
TITLE Phylogenetic analysis of certain sulfide-oxidizing and related
morphologically conspicuous bacteria by 5S ribosomal ribonucleic
acid sequences
JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 116 5S ribosomal RNA
BASE COUNT 27 a 34 c 31 g 24 t
ORIGIN
1 gcctgatgac catagcaagg tggtcccact ccttcccatc ccgaacagga cagtgaaacg
61 ccttagcgcc gatgatagtg cggttctccc gtgtgaaagt aggacatcgt caggct
//
LOCUS PVICSA 1895 bp ds-DNA INV 01-AUG-1990
DEFINITION Plasmodium vivax circumsporozoite protein gene, complete cds.
ACCESSION M11926 M20671 J04090
KEYWORDS circumsporozoite protein.
SOURCE P.vivax (strain Belem) DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1529)
AUTHORS Arnot,D.E., Barnwell,J.W., Tam,J.P., Nussenzweig,V.,
Nussenzweig,R.S. and Enea,V.
TITLE Circumsporozoite protein of Plasmodium vivax: Gene cloning and
characterization of the immunodominant epitope
JOURNAL Science 230, 815-818 (1985)
STANDARD simple staff_review
REFERENCE 2 (bases 158 to 1294; revises [1])
AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J.
TITLE Does biased gene conversion influence polymorphism in the
circumsporozoite protein-encoding gene of Plasmodium vivax?
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 157; 1295 to 1895; revises [1])
AUTHORS Arnot,D.E.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2],[3] kindly
submitted by D.E.Arnot, 14-SEP-1988.
FEATURES from to/span description
pept 158 1294 circumsporozoite protein
BASE COUNT 674 a 347 c 471 g 403 t
ORIGIN
1 ctgcataagg caaactcaca aacatccaaa aaaatataca tatatatatt tatatacacg
61 tgtatatatt attaagcggc ttaagttaag caagcaaaac agccaaaggc ctacaagtgt
121 aaacagcttc ctgcacacac gtatatacca gaacaagatg aagaacttca ttctcttggc
181 tgtttcttcc atcctgttgg tggacttgtt ccccacgcac tgcgggcaca atgtagatct
241 gtccaaggcc ataaatttaa atggagtaaa cttcaataat gtagacgcca gttcacttgg
301 cgcggcacac gtaggacaaa gtgctagccg aggcagagga cttggtgaga acccagatga
361 cgaggaagga gatgctaaaa aaaaaaagga tggaaagaaa gcagaaccaa aaaatccacg
421 tgaaaataag ctgaaacaac caggagacag agcagatgga cagccagcag gagacagagc
481 agatggacag ccagcaggtg atagagcaga tggacaacca gcaggagata gagcagctgg
541 acaaccagca ggagatagag cagatggaca gccagcagga gacagagcag atggacagcc
601 agcaggagac agagcagatg gacaaccagc aggagacaga gcagatggac aaccagcagg
661 tgatagagca gctggacaac cagcaggtga tagagcagct ggacaaccag caggagatag
721 agcagatgga cagccagcag gagatagagc agctggacag ccagcaggag atagagcaga
781 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagatggaca
841 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacagccagc
901 aggagataga gcagctggac agccagcagg agatagagca gctggacagc cagcaggaaa
961 tggtgcaggt ggacaggcag caggaggaaa cgcaggagga ggacagggac aaaataatga
1021 aggtgcgaat gccccaaatg aaaagtctgt gaaagaatac ctagataaag ttagagctac
1081 cgttggcacc gaatggactc catgcagtgt aacctgtgga gtgggtgtaa gagtcagaag
1141 aagagttaat gcagctaaca aaaaaccaga ggatcttact ttgaatgacc ttgagactga
1201 tgtttgtaca atggataagt gtgctggcat atttaacgtt gtgagtaatt cattagggct
1261 agtcatattg ttagtcctag cattattcaa ttaagtagct gacatccatt attttcggcg
1321 tcctccacgg tgcatattaa gtgttttgtg ttttgtacat gcacataaat acttgcccgt
1381 agggacatga tttttttccc tttcttatga atgttccctg ctgtttgcac gtaactgtat
1441 gtacgtgcgc gtaaggcata gtaagtaaca cctcttacac attatgcgct tacgcacaat
1501 cagttgtgca attctagaaa acacgatatg agtattttta aacacttatc gtccaaaaaa
1561 acaaaaaaaa cagaaaaaac agaaaaaaca gaaaaaacaa aaaaaaacaa aaaaaaacaa
1621 aaaaaaacaa aaaaaacaca tttatattaa cttttccttt ttgattgacc cttttttgac
1681 gtatattttt tttttttttt cgtatgtatt atatatactg cttaacgtag agaacttaaa
1741 ttttgagaat gtattttttt ttaacaagtt aaaaaaagaa ctggtatttt tgggaattca
1801 aaaaatttgc aaattcaaaa gaggcgagtt aaaatttgcg ccgtggcaaa cggggtgcgt
1861 gcgggagtcg tgcaaatgtg gcttatatcc ggggg
//
LOCUS PVICSC 1375 bp ds-DNA INV 01-AUG-1990
DEFINITION Plasmodium vivax circumsporozoite protein gene, 3' end.
ACCESSION M20670 J04090
KEYWORDS circumsporozoite protein.
SOURCE P.vivax (strain North Korean) DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1105)
AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J.
TITLE Does biased gene conversion influence polymorphism in the
circumsporozoite protein-encoding gene of Plasmodium vivax?
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1106 to 1375)
AUTHORS Arnot,D.E.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Arnot, 14-SEP-1988.
FEATURES from to/span description
pept < 1 1105 circumsporozoite protein (AA at 2)
BASE COUNT 464 a 260 c 407 g 244 t
ORIGIN Sau3AI site.
1 agatctgtcc aaggccataa atttaaatgg agtaaacttc aataatgtag acgccagttc
61 acttggcgcg gcacacgtag gacaaagtgc tagccgaggc agaggacttg gtgagaaccc
121 agatgacgag gaaggagatg ctaaaaaaaa aaaggatgga aagaaagcag aaccaaaaaa
181 tccacgtgaa aataagctga aacaaccagg agacagagca gatggacagc cagcaggaga
241 cagagcagat ggacagccag caggagacag agcagatgga caggcagcag gaaatggtgc
301 aggtggacag ccagcaggtg atagagcagc tggacaacca gcaggcgatg gagcagctgg
361 acagccagca ggcgatagag cagatggaca gccagcagga gatagagcag ctggacagcc
421 agcaggcgat agagcagatg gacagccagc aggagataga gcagctggac agccagcagg
481 cgatagagca gatggacagc cagcaggaga tagagcagct ggacaggcag caggaaatgg
541 tgcaggtgga caggcagcag gaaatggtgc aggtggacaa ccagcaggag atagagcagc
601 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca
661 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacaggcagc
721 aggaaatggt gcaggtggac aggcagcagg aggaaatgcg gcaaacaaga aggcagaaga
781 cgcaggagga aacgcaggag gaaacgcagg aggacaggga caaaataatg aaggtgcgaa
841 tgccccaaat gaaaagtctg tgaaagaata cctagataaa gttagagcta ccgttggcac
901 cgaatggact ccatgcagtg taacctgtgg agtgggtgta agagtcagaa gaagagttaa
961 tgcagctaac aaaaaaccag aggatcttac tttgaatgac cttgagactg atgtttgtac
1021 aatggataag tgtgctggca tatttaacgt tgtgagtaat tcattagggc tagtcatatt
1081 gttagtccta gcattattca attaagtagc tgacatccat tattttcggc gtcctccacg
1141 gtgcatatta agtgttttgt gttttgtaca tgcacataaa tacttgcccg tagggacatg
1201 atttttttcc ctttcttatg aatgttccct gctgtttgca cgtaactgta tgtacgtgcg
1261 cgtaaggcat agtaagtaac acctcttaca cattatgcgt tacgcacaat cagttgtgca
1321 attctagaaa acacgatatg agtattttta aacacttatc gtgaccaaaa aaaca
//
LOCUS ECOHSEST 360 bp ds-DNA BCT 01-AUG-1990
DEFINITION E.coli heat-stable enterotoxin gene, complete cds.
ACCESSION M34916
KEYWORDS heat-stable enterotoxin.
SOURCE E.coli (strain 153837-2) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Moseley,S.L., Hardy,J.W., Huq,M.I., Echeverria,P. and Falkow,S.
TITLE Isolation and nucleotide sequence determination of a gene encoding
a heat-stable enterotoxin of Escherichia coli
JOURNAL Infect. Immun. 39, 1167-1174 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 48 266 heat-stable enterotoxin
signal 268 301 pot. transcription termination signal
BASE COUNT 115 a 54 c 65 g 126 t
ORIGIN
1 ttctggtttt gattcaaatg ttcgtggatg ccatgtccgg aggtaatatg aagaaatcaa
61 tattatttat ttttctttct gtattgtctt tttcaccttt ccctcaggat gctaaaccag
121 tagagtcttc aaaagaaaaa atcacactag aatcaaaaaa atgtaacatt gcaaaaaaaa
181 gtaataaaag tggtcctgaa agcatgaata gtagcaatta ctgctgtgaa ttgtgttgta
241 atcctgcttg taccgggtgc tattaataat ataaagggaa ctaaacagtt ccctttatat
301 ttgttctgat tctgatgatg tctgtaacgt atgtacctgt tgctttgttg aataaatcga
//
LOCUS HUMRENA1 826 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human renin gene, exon 1.
ACCESSION M10030 M34914
KEYWORDS aspartyl protease; renin.
SEGMENT 1 of 5
SOURCE Human fetal liver DNA (library of Lawn et al.), clone lambda-III.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 826)
AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
Morris,B.J. and Shine,J.
TITLE Primary structure of the human renin gene
JOURNAL DNA 3, 457-468 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 276 to 583)
AUTHORS Shine,J., Hardman,J.A., Hort,Y.J., Tellam,J.T., Catanzaro,D.F.,
Morris,B.J. and Baxter,J.D.
TITLE Structure of the human renin gene
JOURNAL Trans Assoc Am Physicians 97, 63-69 (1984)
STANDARD simple staff_review
COMMENT There is only a single renin gene in the human haploid genome [1].
It is comprised of 10 exons encoding 406 amino acids. The first
intron separates the 5' untranslated region and the signal peptide
coding region from the remainder of the gene. Exon 2 comprises
most of the sequence coding for the pro portion of the enzyme.
Precise boundaries were not indicated by in figure 2 of [1], but
were taken from the text and from other human renin entries.
FEATURES from to/span description
pept 626 + 723 preprorenin /hgml_locus_uid="LW0050B"
/nomgen="REN" /map="1q32"
sigp 626 685 renin signal peptide
pre-msg 584 > 826 renin mRNA [1]
IVS 724 > 826 renin intron A
site 520 521 ga in [1]; gagca in [2]
BASE COUNT 190 a 226 c 222 g 188 t
ORIGIN Chromosome 1q32; 437 bp upstream of KpnI site.
1 gatctaccca ccttggcctc ccaaagtgct gggacaggtg tgagccacca tgcctggccc
61 ctctactctt ataattaaac cagctgttgc ttttcctgcc aagaaaccag tcatgaagat
121 tcacccatgt tctagatggg aaaactgggc tgtagctggg agaggccagt cagggacaaa
181 gccaaagtta atatagagaa tggagcttcc agggtatagg ggttgggtct gggctaggga
241 gctggaaacc taggttttac gcttgtccca gttttgatgt tagccctgac agtgctgttt
301 ctcatcagcc tctgcctgct ccaggggtca cagggccaag ccagatagag ggctgctagc
361 gtcactggac acaagattgc tttcccacag ctgtccttcc tccagcccct ctgctcccca
421 tccggaaacc tgggtaccct tcacccacct agctctgtcc cgcagtgaga tttattgctg
481 actgccctgc catctacccc agggtaataa atcagggcag agcagaattg caatcacccc
541 atgcatggag tgtataaaag gggaagggct aagggagcca cagaacctca gtggatctca
601 gagagagccc cagactgagg gaagcatgga tggatggaga aggatgcctc gctggggact
661 gctgctgctg ctctggggct cctgtacctt tggtctcccg acagacacca ccacctttaa
721 acggtaattg gtaactcagg cagagaaggg gtgggcaggg gtgtaggttc ccaccttccc
781 aacaccctgg cttttccaca tgcggtgtca ttcagtcctt acgatc
//
LOCUS HUMRENA2 373 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human renin gene, exon 2.
ACCESSION M10128
KEYWORDS renin.
SEGMENT 2 of 5
SOURCE Human fetal liver DNA, clones lambda-[III,V].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 373)
AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
Morris,B.J. and Shine,J.
TITLE Primary structure of the human renin gene
JOURNAL DNA 3, 457-468 (1984)
STANDARD full staff_review
FEATURES from to/span description
pept + 105 + 255 preprorenin, exon 2 /nomgen="REN" /map="1q32"
/hgml_locus_uid="LW0050B"
matp 205 + 255 renin
pre-msg < 1 > 373 renin mRNA
IVS < 1 104 renin intron A
IVS 256 > 373 renin intron B
BASE COUNT 79 a 107 c 96 g 91 t
ORIGIN Chromosome 1q32; about 4.8 kb after segment 1.
1 aacgttaaag gtggttgtac taaagagagg ggtttggcct cagggactca catgtggtgg
61 aggtacagca cttttctatt tttgcttcct ccaccctggg ccaggatctt cctcaagaga
121 atgccctcaa tccgagaaag cctgaaggaa cgaggtgtgg acatggccag gcttggtccc
181 gagtggagcc aacccatgaa gaggctgaca cttggcaaca ccacctcctc cgtgatcctc
241 accaactaca tggacgtgag tgcttggctc agcccctcgc tccctccctg tctcctttcc
301 ctcatggacc tagggctttc tttgctgcaa gactcaccct ttccaagctg tgtttgacga
361 aggcgctgag tag
//
LOCUS HUMRENA3 2480 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human renin gene, exons 3, 4 and 5.
ACCESSION M10150
KEYWORDS renin.
SEGMENT 3 of 5
SOURCE Human fetal liver DNA, clones lambda-[III,V].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2480)
AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
Morris,B.J. and Shine,J.
TITLE Primary structure of the human renin gene
JOURNAL DNA 3, 457-468 (1984)
STANDARD full staff_review
FEATURES from to/span description
pept + 140 263 preprorenin, exon 3 /nomgen="REN" /map="1q32"
/hgml_locus_uid="LW0050B"
879 997 preprorenin, exon 4
1949 + 2145 preprorenin, exon 5
matp + 140 263 renin
879 997 renin
1949 + 2145 renin
pre-msg < 1 > 2480 renin mRNA
IVS < 1 139 renin intron B
IVS 264 878 renin intron C
IVS 998 1948 renin intron D
IVS 2146 > 2480 renin intron E
BASE COUNT 568 a 710 c 622 g 579 t 1 others
ORIGIN Chromosome 1q32; about 0.4 kb after segment 2.
1 ctgcaggaaa atggaaaccc cgacaggtat aggacctcgc ctggggcaag tctacacccg
61 agagccaaga gtgaagccag gcaagacccc aagcccaagg tcccctgagc ccctccagcc
121 ctctcttttt accccacaga cccagtacta tggcgagatt gggatcggga ccccacccca
181 aaccttcaaa gtcgtctttg acactggttc gtccaatgtt tgggtgccct cctccaagtg
241 cagccgtctc tacactgcct gtggtgagac ctaagaccca cagtgcctct cctccatccc
301 cctgccctac tgtgcatgag caatcctgcc caacacccag ctcccatccc tcttgccacc
361 aagggagtgg cttcctctct gcctctgtgc ccactgacat gtaggggaga ggggaagatg
421 tctcccgttt ttctgataca gccaccaagg ttaaaaacaa aaaaaggtcc aagaacccct
481 gagnacccag gaggaccagt tcccagtcgt cctgagattg agacaggact gaattctcaa
541 acccatccca ggcactcgga actcttccat ccctagtctt aatcaacaac ctcttactag
601 cacttactct gtgcctggca tacttctctg gtgttatcag tggttagtga ttactttaaa
661 ttccttcatt taggacaaaa ttctcgatgt atgggacact taggagagcc caagaaaccc
721 agtccttgat tgatgaagca catattccaa gccccctgac cctagggcca ctcatccctg
781 cacctaagct aaccagccat acccacaatg caccctgcct ctgagtcccc ctgtctgggc
841 cactcttgga caaacctgag cctctgtccc cctgccagtg tatcacaagc tcttcgatgc
901 ttcggattcc tccagctaca agcacaatgg aacagaactc accctccgct attcaacagg
961 gacagtcagt ggctttctca gccaggacat catcaccgta agttgggccg ccctaggtca
1021 tctgccccgg accccttctg tccccaggcc tctcctgacc caccagggcc cacacctgcg
1081 gggaggtaca ctgcagccca cttggagcct ggggagctga ggaacaccct actctgccac
1141 atctggtgtt gaaagcagca gtacctatgg gggagcaagc ctgggctacg ggctcaccgt
1201 tgggtggttt gtggatgttt ttgcatctaa cttgcatgta gggctgtcct gagccccgtg
1261 gctgcagtca agtaactcgt cccaagttca ccagctctga ctggggctac taccctagac
1321 tgaaatcctg ggtcagagtc aggctatttt agggtcaggc atagttttaa ggtcacatta
1381 gttgactctg ggactcaggt caaggctctc ttttcttttc catgtggccc atgtctgacc
1441 gtttcctcat cctggagttt ctcaggccct gctccatcag agttagggga ggggcacacg
1501 tggcacctga gaggaaatca gggtgattcc tgcctccctt cctttttctg ttgaactctg
1561 atataaagga ggaagaaggg caagcttgtc tgtgctaaag aaacccttcg cccatgataa
1621 gggtggggcc aagacccagt cctgccaggc acgaaagtct ggccactggg gaggggagga
1681 gctcttggac ttttcttttg cgcttggcag gaccaccctc tcagcctctg ctctccgatc
1741 cctggtcaac tctagctctc tctgggctcc gcagcagaga tgtgtattgg cacagagtgt
1801 gtgcgtgcag ggttgaggca atactcttac cccgatttct gtaccctgga gcatgtgtgc
1861 ccctgggatc cctagtgtgg atgcccagac cagactccaa ccaaggaggg gcagtgggct
1921 tggtctccta tggtccttcc tcccacaggt gggtggaatc acggtgacac agatgtttgg
1981 agaggtcacg gagatgcccg ccttaccctt catgctggcc gagtttgatg gggttgtggg
2041 catgggcttc attgaacagg ccattggcag ggtcacccct atcttcgaca acatcatctc
2101 ccaaggggtg ctaaaagagg acgtcttctc tttctactac aacaggtggg gactgggact
2161 ccaagggctg aggtgggggg caggagggga gaagagatgg ggagtggaag gagagtctgg
2221 gccagaattg taaagtgttt gtaacttagg tgacagccaa tcaatatcta gagctgtact
2281 agccaatatg gaaggcacta ttgcaaattt aaacttaact taaatacagc ttaagcatca
2341 attaagcatt caactggctg gcctcttagt tgtactagcc acagctcaat gcctggcagc
2401 cacggtggct agtaactaca gtctagtaca gtgcagatag agatatccag catgacagga
2461 catctataga cagcgccact
//
LOCUS HUMRENA4 3057 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human renin gene, exons 6, 7, 8, and 9.
ACCESSION M10151
KEYWORDS renin.
SEGMENT 4 of 5
SOURCE Human fetal liver DNA, clones lambda-[III,V].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3057)
AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
Morris,B.J. and Shine,J.
TITLE Primary structure of the human renin gene
JOURNAL DNA 3, 457-468 (1984)
STANDARD full staff_review
FEATURES from to/span description
pept + 998 1006 preprorenin, exon 6 /nomgen="REN" /map="1q32"
/hgml_locus_uid="LW0050B"
1572 1691 preprorenin, exon 7
2061 2202 preprorenin, exon 8
2466 + 2564 preprorenin, exon 9
matp + 998 1006 renin
1572 1691 renin
2061 2202 renin
2466 + 2564 renin
pre-msg < 1 > 3057 renin mRNA
IVS < 1 997 renin intron E
IVS 1007 1571 renin intron F
IVS 1692 2060 renin intron G
IVS 2203 2465 renin intron H
IVS 2565 > 3057 renin intron I
BASE COUNT 750 a 799 c 825 g 683 t
ORIGIN Chromosome 1q32; about 0.6 kb after segment 3.
1 aaaagaatag aggaggatca gagttcagag aaatctcaca gtaaaatgga gaggagtctc
61 cggtttggtg atagaaagtg aggccttgag aaaaggccaa ttggcggctc tgcattcagg
121 ggtggtcttt agaagaactg ttttagagga ggtgggggca aggccagatg gcaagaagtt
181 aagaggtgga cgacgtgggt gtcaggaagt ggaggtcatg agatgtacgc tgccctggga
241 cattcaacag ggaagggaat ggggggtggc gtgggggggt gagatccaga agcagaagag
301 gaagggtggg tgtttttaaa tgctagagga tgctcgagtg atcgcctgta ggtggaggaa
361 gaacccaata gaaagaaaga gattaaaaat gtggaaagaa gaggagctaa atgggggcac
421 tggagtttag aggccttgaa agagatgagg aaccagcaga taggaagaag ccaggtttta
481 cagaggagag ggctggcctc ttcttttatc ttgggatggg aaggagggaa catccagaga
541 gatactgaag tgttgagaga caggcaggag ggaatttgtg ctagcatata cacatacgag
601 ttccgaattt ataaaaacac aagtagtttg cagttgcaca aaataacata tgcacaccta
661 cacacccatg cacacatgtg catgtgaatt ctggaaaaac acatcacaca cacaggcatg
721 ccctggagac taggcctaca gtagtccctg agccaagtgc agtgaggagg aaaggaaggt
781 gaggggaatc atctccagac ggggcaccag gagcctggct ccagtccccc acttgttcac
841 tcatggactg ggtaacttca ggcaagtgac ttcgcctctt ggtgactcca ttgcctgaag
901 ggcaaagaga gtacataaca cccaccctgc caaacagcag ggtgatgagg ctggcatgaa
961 atgaagcttc ctttctgctg tctctctttc tctgcagaga ttccgagtaa ggagacaaaa
1021 cccccacatg gctgtgacct tccagtattc cccgagcacc tgacctagaa ttacacacgc
1081 caccggccca aaactcacat cagcaagtcc cagcctccgc tagatgccga agttctctgt
1141 ctctccttcc tgctctctcc atgccacctg cccaccccat acccaatagc ctccccaggg
1201 tcccctccca tgcacctgct caatcagcag caacccaaga gtgaggggtg tccatttgtg
1261 tcttgttcac atccactcac tgtccttgta cctgctcctt ttctgtgacc tctctgggga
1321 tgctttttgg gggaacagct ggactaccct ggaacaacct ctggttggtc ttggggaggg
1381 gaagaaaggc agagaagcag tatgttctgc atgcttccca acgacagctc cgagcctggc
1441 tgtctgtccc acattcctct gctctagagc cctctgtcct cccctcgacc cttgtgcaac
1501 cttccccaat tgcctgagtt gctgggtcct ggaggttatg ggtttccaag agcttctgat
1561 ctttccttta ggaattccca atcgctggga ggacagattg tgctgggagg cagcgacccc
1621 cagcattacg aagggaattt ccactatatc aacctcatca agactggtgt ctggcagatt
1681 caaatgaagg ggtcagaaat cctcagaccc tccccgggct ccaaaaaatg ctgccgtcac
1741 tggggttggg gagggcgggc gcggactgca ttaccatcct gccctctttc caaatgcagc
1801 cacttcttaa gcacagccac catttgctct ctgcctggct ctggtccagg ctggggcaga
1861 gagaagggag gggcctgggc cggagtggtg gaggccgaga gtaccttccc tcctctactc
1921 actgcctcaa cagccagcca gcgtggcgct ccacccaccc acccaccact caggaaggac
1981 atgcagcctg gcgtgcccat cagccttctg tctgtctgtc tgtctgtctg tctctctgtc
2041 tgactgtggc gctcccccag ggtgtctgtg gggtcatcca ccttgctctg tgaagacggc
2101 tgcctggcat tggtagacac cggtgcatcc tacatctcag gttctaccag ctccatagag
2161 aagctcatgg aggccttggg agccaagaag aggctgtttg atgtaagaag ccaaagaggg
2221 aaggtgctgt gggtgtgggg agcggccacc tggtatcggc tcacaaatcc cccaggcaaa
2281 tgaggccatc tcaggccttc gcttgttcac ctcacactct ccacacatgt ggctggtcac
2341 ccatggggcg gggcactgtc cccagccctc tccagcagag agacccaggg ccaccagcgc
2401 aggactcctt gtctgctgag acgtcgttcc atactcaaga aggctctctt tgccccccac
2461 cccagtatgt cgtgaagtgt aacgagggcc ctacactccc cgacatctct ttccacctgg
2521 gaggcaaaga atacacgctc accagcgcgg actatgtatt tcaggtgagg ttcgagtcgg
2581 ccccctcggt ggcagggaga aaggctggac agagaccctc aagagtgaca gattacaatg
2641 cacagatcat gttagaactg tagttctcaa acttggctgt gcatgtcacc tggagagctt
2701 tggaaaaatc caggtacctg ggccacatcc catacctatt aaatcagaac ctctagaagt
2761 gggacctggg gttcagtttc cccagatgat tccaatgtgt ggccatgttt gggcatcact
2821 atgcctgttc cctcatctcc attttctcat caaatactcc caataatcct atgctcctat
2881 attcttaccc tcttttcata atcaataggc ttagagaatt tgaataactt gtctaggatc
2941 agaagctaag gcaaactgta agctcctgaa ggaagcacgt tgcctgatgc cctgtttgcc
3001 tgggatctag cacaggggct aaacatagga atggtgcagt ccacgatggg gcaaaat
//
LOCUS HUMRENA5 763 bp ds-DNA PRI 01-AUG-1990
DEFINITION Human renin gene, exon 10.
ACCESSION M10152
KEYWORDS renin.
SEGMENT 5 of 5
SOURCE Human fetal liver DNA, clone lambda-V.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
Morris,B.J. and Shine,J.
TITLE Primary structure of the human renin gene
JOURNAL DNA 3, 457-468 (1984)
STANDARD full staff_review
COMMENT A poly-adenylation signal is located at positions 357-362.
FEATURES from to/span description
pept + 22 183 preprorenin, exon 10 /nomgen="REN" /map="1q32"
/hgml_locus_uid="LW0050B"
matp + 22 180 renin
pre-msg < 1 > 183 renin mRNA
IVS < 1 21 renin intron I
BASE COUNT 170 a 236 c 194 g 163 t
ORIGIN Chromosome 1q32; about 0.6 kb after segment 4.
1 aaaactctcc ccctctgcca ggaatcctac agtagtaaaa agctgtgcac actggccatc
61 cacgccatgg atatcccgcc acccactgga cccacctggg ccctgggggc caccttcatc
121 cgaaagttct acacagagtt tgatcggcgt aacaaccgca ttggcttcgc cttggcccgc
181 tgaggccctc tgccacccag gcaggccctg ccttcagccc tggcccagag ctggaacact
241 ctctgagatg cccctctgcc tgccttatgc cctcagatgg agacattgga tgtggagctc
301 ctgctggatg cgtgccctga cccctcacag cccttccctg ctttgaggac aaagagaata
361 aagacttcat gttcacagcc tgttgcatct gggttcacta gggtttagaa cagagggagg
421 ggctgcgtga tcatgtgtgg acaggaatgt gacacagaca agctacacat tagcctaggc
481 cacaggttct tgcgtgcagg gatgatgcca tccatctgcc atcaacggga ctcaggtgga
541 gctgttacac aacctcaggt gggaagtctg aaaagagccg gaaccaagct ccctgctatc
601 gactcaggga ccaaggcgta atgctgtggc gagtagactg gggtcagaaa gttgtcccag
661 ctcacagaag ccagctctga gttcagactc tgctctgctg agctagtcag ccctgtctct
721 tgtccctgca aaactcccct cacctgtcct tatccacctg cag
//
LOCUS SYNT1RNAA 324 bp ds-DNA SYN 01-AUG-1990
DEFINITION Synthetic ribonuclease T1 gene, 3' end.
ACCESSION M37098 M35733 M35736
KEYWORDS ribonuclease T1.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 324)
AUTHORS Ikehara,M., Ohtsuka,E., Uesugi,S., Kikyodani,T., Aoyama,Y.,
Tokunaga,T. and Fujimoto,K.
TITLE Synthesis and expression of RNase T1 gene
JOURNAL Nucleic Acids Symp Ser 15, 197-200 (1984)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 324)
AUTHORS Nishikawa,S., Morioka,H., Tokunaga,T., Aoyama,Y., Kikyotani,S.,
Fujimoto,K., Yanase,K., Tanaka,T., Uesugi,S., Ohtsuka,E. and
Ikehara,M.
TITLE Synthesis and expression of the native RNase T1 gene and several
mutant genes
JOURNAL Nucleic Acids Symp Ser 16, 287-290 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 321 ribonuclease T1 precursor (AA at 1)
sigp < 1 6 ribonuclease T1 signal peptide
matp 7 318 ribonuclease T1
BASE COUNT 73 a 98 c 75 g 78 t
ORIGIN
1 ttcatggctt gcgactacac ctgcggcagc aactgctact ctagctctga cgtttctacc
61 gctcaggctg ctggctacca gctgcacgag gacggcgaaa ccgttggctc taactcttac
121 ccgcacaaat acaacaacta tgagggcttc gactttagcg tttcttctcc gtactacgaa
181 tggccgatcc tgtctagcgg cgacgtttac tccggtccag gtagcggtgc tgaccgtgta
241 gtattcaacg aaaacaacca gctcgctggc gttatcaccc acaccggcgc ttctggcaac
301 aactttgtag aatgcaccta atag
//
LOCUS TIPCDREG 209 bp ds-DNA BCT 01-AUG-1990
DEFINITION Plasmid pTiC58 promoter-active fragment CD25 DNA.
ACCESSION M35735
KEYWORDS .
SOURCE Plasmid pTiC58 DNA.
ORGANISM Plasmid pTiC58
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 209)
AUTHORS Tait,R.C. and Kado,C.I.
TITLE Regulation of the virC and virD promoters of pTiC58 by the ros
chromosomal mutation of Agrobacterium tumefaciens
JOURNAL Mol. Microbiol. 2, 385-392 (1988)
STANDARD simple staff_review
BASE COUNT 60 a 37 c 41 g 71 t
ORIGIN
1 gtcgacccgg gatccgcggc gataattcat aagtaatgta gtaattacct gattttatat
61 ttcaatttta ttgtaatata atttcaattg taataatata aaaataaata tcccttatgt
121 gttcttgatt tcgttttgta tatggctaga ttcccatctg ccacgacgag gaaatgctac
181 ggcggggcaa gttcagatcc cgggtcgac
//
LOCUS FIBGLUC 1426 bp ds-DNA BCT 01-AUG-1990
DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
complete cds.
ACCESSION M33676 M33311
KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase;
mised-linkage beta-glucanase.
SOURCE F.succinogenes (strain S85) DNA, clone PJI5.
ORGANISM Fibrobacter succinogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE 1 (bases 1 to 1426)
AUTHORS Teather,R.M. and Erfle,J.D.
TITLE DNA sequence of a Fibrobacter succinogenes mixed-linkage
beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
JOURNAL J. Bacteriol. 172, 3837-3841 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Teather, 11-APR-1990.
FEATURES from to/span description
pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase
precursor (EC 3.2.1.73)
sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase
signal peptide
matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase
binding 132 137 ribosome binding site
signal 62 66 -35 region
signal 85 90 -10 region
BASE COUNT 371 a 346 c 335 g 374 t
ORIGIN
1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS BOVRS157A 824 bp ss-mRNA MAM 01-AUG-1990
DEFINITION Bovine retina-specific 15.7 kDa protein mRNA, complete cds.
ACCESSION M34915
KEYWORDS .
SOURCE Bovine retina, cDNA to mRNA, clone pCR18.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 824)
AUTHORS Nakagawa,Y., Kuo,C.-H., Ishii,K., Shiosaka,S., Tohyama,M. and
Miki,N.
TITLE Cloning and characterization of a cDNA specific for bovine retina
JOURNAL Neurosci. Res. 3, 300-310 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 138 581 retina-specific 15.7 kDa protein
mRNA < 1 824 retina-specific mRNA
signal 800 805 polyA signal
BASE COUNT 178 a 219 c 224 g 203 t
ORIGIN
1 tttagcctca gccgtgaccg gccccgtccc gcggcgccgg gagttcgtgt gaacgggtag
61 gtgtaccgac ttcgcccgtc cgtgaatccc gtggtcgcaa aggcccgcgc ggcgggccgg
121 gttctgccga taccttaatg ggctgtgcgc gaggagagcc tcaattgcaa gttggtcgag
181 gagatcgcca cgctggtgca gagctggcct cactagttgc ggctagtgta ggacgttgta
241 ctccgacatt ccgcaagccc ttccacacgg acagtcctag catccagggt cagtggcacc
301 ccttcaccaa caaaccgaca gcactggggt gctcctcgag aggtccagaa tcctgccccg
361 acccagcggc cagcacaatg aagaccaact ccatacccac agtttggact tttactccag
421 cagagggtgg ttcctgctcc tggtttgctt cacgggagac agatgaagcc accaatgggg
481 tacttcttgc ttgggataaa gaagagctgc ctgtctcttt tgatgtccac cgtgaggcag
541 ggactgtgag tctcctcatt cttagccagt tgacatcctg aaaccctgag aatcttcaga
601 gatttgactt ggtcttcatt tcttaaatcc aaatcaataa tagtgatctc aaatcaagtg
661 agggctttca aggctggctt ctgaagaatt ccttttggcc tgtttctgta gccagtgacc
721 aagagagtct gctgtgagct ggcattgggc taggccttgt atctatgtga tgtttgtgtg
781 cagttagaaa actgaagtta ataaatttgc caaggtcaca cttg
//
LOCUS CHKFRA2A1 360 bp ds-DNA VRT 01-AUG-1990
DEFINITION Chicken fra-2 oncogene gene, exon 1.
ACCESSION D90104
KEYWORDS fos-related gene; fra-2 gene; oncogene.
SEGMENT 1 of 4
SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H.
TITLE Isolation and characterization of fra-2, and additional member of
the fos gene family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 241 + 339 fra-2 protein, exon 1
pre-msg < 1 > 360 fra-2 mRNA and introns
IVS 340 > 360 fra-2 intron A
BASE COUNT 39 a 123 c 113 g 85 t
ORIGIN
1 tgtttttttg gttgtttttt ttttttgtcg gctttccgct ttttcttttt ttcttttttt
61 tccctttttc tatttttccc ccccttcttc ttctcccgct gcggactctc ccccggctgc
121 gggaggcgcg aggcagagcc cgagaggtcg gcacggagca gggggcgggg agacggcgag
181 ggagcggcgg ccgcggcgcg ggaaggcggg gacgcggctc ccccgggccg gcctcggacc
241 atgtaccagg actatcccgg gagcttcgac acctcctcca gaggcagcag cggctccccg
301 ggacaccccg agccctactc cgccggcgca gcccagcagg tagggccgcc tccgccccgt
//
LOCUS CHKFRA2A2 297 bp ds-DNA VRT 01-AUG-1990
DEFINITION Chicken fra-2 oncogene gene, exon 2.
ACCESSION D90105
KEYWORDS fos-related gene; fra-2 gene; oncogene.
SEGMENT 2 of 4
SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H.
TITLE Isolation and characterization of fra-2, and additional member of
the fos gene family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 22 + 276 fra-2 protein, exon 2
pre-msg < 1 > 297 fra-2 mRNA and introns
IVS < 1 21 fra-2 intron A
IVS 277 > 297 fra-2 intron B
BASE COUNT 68 a 103 c 74 g 52 t
ORIGIN About 5 kbp after segment 1.
1 ctcccccacc tttcctccta gaaattccga gtagatatgc caggatcagg cagtgctttt
61 attcccacga tcaacgccat cacaaccagc caagacctgc agtggatggt gcagcccacc
121 gtcatcacct ccatgtccag cccgtactct cgctcgcacc cctacagcca cccactgccg
181 ccgctgtcct cggtggctgg acacacggcc cttcagcgac cgggcgtgat caaaaccatc
241 ggcaccacag tgggacggag acgaagggat gagcaggtaa ctgtgtgagc aggagga
//
LOCUS CHKFRA2A3 149 bp ds-DNA VRT 01-AUG-1990
DEFINITION Chicken fra-2 oncogene gene, exon 3.
ACCESSION D90106
KEYWORDS fos-related gene; fra-2 gene; oncogene.
SEGMENT 3 of 4
SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 149)
AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H.
TITLE Isolation and characterization of fra-2, and additional member of
the fos gene family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 22 + 129 fra-2 protein, exon 3
pre-msg < 1 > 149 fra-2 mRNA and introns
IVS < 1 21 fra-2 intron B
IVS 130 > 149 fra-2 intron C
BASE COUNT 42 a 34 c 46 g 27 t
ORIGIN About 4 kbp after segment 2.
1 tttcttggca cttgcccata gctgtcgcct gaggaagaag agaagcgaag gatccggaga
61 gagaggaaca agctggcagc tgctaaatgt cgtaacaggc gccgagagct aacagagaaa
121 ctccaggcgg tacgtgctct gcatgcatt
//
LOCUS CHKFRA2A4 744 bp ds-DNA VRT 01-AUG-1990
DEFINITION Chicken fra-2 oncogene gene, exon 4.
ACCESSION D90107
KEYWORDS fos-related gene; fra-2 gene; oncogene.
SEGMENT 4 of 4
SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 744)
AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H.
TITLE Isolation and characterization of fra-2, and additional member of
the fos gene family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 22 531 fra-2 protein, exon 4
pre-msg < 1 > 744 fra-2 mRNA and introns
IVS < 1 21 fra-2 intron C
BASE COUNT 180 a 206 c 209 g 149 t
ORIGIN About 2 kbp after segment 3.
1 ttattccctt tttgtctgca ggaaactgag gtgctggagg aggaaaagtc agtgcttcaa
61 aaagagattg ctgagctcca gaaggagaag gagaaactag agttcatgct ggttgctcac
121 agccctgtgt gtaaaatcag ccctgaggaa cgtcggagcc caccaaccag cagcctccag
181 agcgttcgga ctggagcgag cggagcagtg gtggtgaagc aggagcctgt ggaggaagag
241 atcccatctt cctctttggt ccttgacaaa gctcagaggt ctgtcattaa gcccatcagc
301 attgctggag gttattatgg ggaggaggca ctcaacactc ccatcgtggt gacctcgaca
361 ccagccatca ctcctggttc ctccaacttg gtgttcacct accccaatgt cttggatcag
421 gagtctcctc tctccccgtc cgagtcctgc tccaaagctc accggaggag cagcagcagc
481 ggcgaccagt cctcggattc cttgaactct cccaccttgc tggcattgta atcccctgag
541 gcccccccat tgccagtgtg ttacatcccc cgcccggctc catggggaga cccctccatg
601 ggattagaga caggcacagg atcgttcaag cacaagggca gcaagaacaa gaatggggaa
661 atgctgcagc tccaggaaag agagtgagga ccaatgccag ctccctggag gcaggaaatg
721 gcaagggtgg gactgatgca ccag
//
LOCUS ECOTGP 7335 bp ds-DNA BCT 01-AUG-1990
DEFINITION E.coli tryptophan operon: entire DNA sequence.
ACCESSION J01714 M12471 M12472 M25593
KEYWORDS anthranilate isomerase; anthranilate synthetase; attenuator;
glutamine amidotransferase; isomerase; leader peptide;
phosphoribosyl anthranilate synthetase; synthetase; transferase;
trp operon; trpA gene; trpB gene; trpC gene; trpD gene; trpE gene;
tryptophan synthetase.
SOURCE Escherichia coli RNA and DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 5917 to 6133)
AUTHORS Platt,T. and Yanofsky,C.
TITLE An intercistronic region and ribosome-binding site in bacterial
messenger RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 2399-2403 (1975)
STANDARD full staff_review
REFERENCE 2 (bases 84 to 141)
AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and
Yanofsky,C.
TITLE Nucleotide sequence of region preceding trp mRNA initiation site
and its role in promoter and operator function
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 2351-2355 (1976)
STANDARD full staff_review
REFERENCE 3 (bases 117 to 310)
AUTHORS Squires,C., Lee,F., Bertrand,K., Squires,C.L., Bronson,M.J. and
Yanofsky,C.
TITLE Nucleotide sequence of the 5' end of tryptophan messenger RNA of
Escherichia coli
JOURNAL J. Mol. Biol. 103, 351-381 (1976)
STANDARD full staff_review
REFERENCE 4 (bases 230 to 272)
AUTHORS Bertrand,K., Korn,L.J., Lee,F. and Yanofsky,C.
TITLE The attenuator of the tryptophan operon of Escherichia coli:
heterogeneous 3'-OH termini in vivo and deletion mapping of
functions
JOURNAL J. Mol. Biol. 117, 227-247 (1977)
STANDARD full staff_review
REFERENCE 5 (bases 230 to 272)
AUTHORS Stauffer,G.V., Zurawski,G. and Yanofsky,C.
TITLE Single base-pair alterations in the Escherichia coli trp operon
leader region that relieve transcription termination at the trp
attenuator
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4833-4837 (1978)
STANDARD full staff_review
REFERENCE 6 (bases 6707 to 6863)
AUTHORS Wu,A.M. and Platt,T.
TITLE Transcription termination: nucleotide sequence at 3' end of
tryptophan operon in Escherichia coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5442-5446 (1978)
STANDARD full staff_review
REFERENCE 7 (bases 1 to 140)
AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and
Yanofsky,C.
TITLE Nucleotide sequence of the promoter-operator region of the
tryptophan operon of Escherichia coli
JOURNAL J. Mol. Biol. 121, 113-137 (1978)
STANDARD full staff_review
REFERENCE 8 (bases 2351 to 2503)
AUTHORS Miozzari,G.F. and Yanofsky,C.
TITLE Gene fusion during the evolution of the tryptophan operon in
enterobacteriaceae
JOURNAL Nature 277, 486-489 (1979)
STANDARD full staff_review
REFERENCE 9 (bases 5932 to 6809)
AUTHORS Nichols,B.P. and Yanofsky,C.
TITLE Nucleotide sequences of trpA of Salmonella typhimurium
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5244-5248 (1979)
STANDARD full staff_review
REFERENCE 10 (bases 117 to 256)
AUTHORS Oxender,D.L., Zurawski,G. and Yanofsky,C.
TITLE Attenuation in the Escherichia coli tryptophan operon: role of RNA
secondary structure involving the tryptophan codon region
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5524-5528 (1979)
STANDARD full staff_review
REFERENCE 11 (bases 3422 to 4824)
AUTHORS Christie,G.E. and Platt,T.
TITLE Gene structure in the tryptophan operon of Escherichia coli:
nucleotide sequence of trpC and the flanking intercistronic regions
JOURNAL J. Mol. Biol. 142, 519-530 (1980)
STANDARD full staff_review
REFERENCE 12 (bases 230 to 296)
AUTHORS Farnham,P.J. and Platt,T.
TITLE A model for transcription termination suggested by studies on the
trp attenuator in vitro using base analogs
JOURNAL Cell 20, 739-748 (1980)
STANDARD full staff_review
REFERENCE 13 (bases 4810 to 6003)
AUTHORS Crawford,I.P., Nichols,B.P. and Yanofsky,C.
TITLE Nucleotide sequence of the trpB gene in Escherichia coli and
Salmonella typhimurium
JOURNAL J. Mol. Biol. 142, 489-502 (1980)
STANDARD full staff_review
REFERENCE 14 (bases 1761 to 2443)
AUTHORS Nichols,B.P., Miozzari,G.F., van Cleemput,M., Bennett,G.N. and
Yanofsky,C.
TITLE Nucleotide sequences of the trpG regions of Escherichia coli,
Shigella dysenteriae, Salmonella typhimurium and Serratia
marcescens
JOURNAL J. Mol. Biol. 142, 503-517 (1980)
STANDARD full staff_review
REFERENCE 15 (bases 6707 to 7335)
AUTHORS Wu,A.M., Chapman,A.B., Platt,T., Guarente,L.P. and Beckwith,J.
TITLE Deletions of distal sequence affect termination of transcription at
the end of the tryptophan operon in E. coli
JOURNAL Cell 19, 829-836 (1980)
STANDARD full staff_review
REFERENCE 16 (bases 279 to 1843)
AUTHORS Nichols,B.P., van Cleemput,M. and Yanofsky,C.
TITLE Nucleotide sequence of Escherichia coli trpE: anthranilate
synthetase component I contains no tryptophan residues
JOURNAL J. Mol. Biol. 146, 45-54 (1981)
STANDARD full staff_review
REFERENCE 17 (bases 5932 to 6809)
AUTHORS Schneider,W.P., Nichols,B.P. and Yanofsky,C.
TITLE Procedure for production of hybrid genes and proteins and its use
in assessing significance of amino acid differences in homologous
tryptophan synthetase alpha polypeptides
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2169-2173 (1981)
STANDARD full staff_review
REFERENCE 18 (bases 6807 to 6856; 7057 to 7119)
AUTHORS Wu,A.M., Christie,G.E. and Platt,T.
TITLE Tandem termination sites in the tryptophan operon of Escherichia
coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2913-2917 (1981)
STANDARD full staff_review
REFERENCE 19 (review; bases 77 to 6809; compiled)
AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
Horowitz,H., van Cleemput,M. and Wu,A.M.
TITLE The complete nucleotide sequence of the tryptophan operon of
Escherichia coli
JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981)
STANDARD full staff_review
REFERENCE 20 (bases 2504 to 3436)
AUTHORS Horowitz,H., Christie,G.E. and Platt,T.
TITLE Nucleotide sequence of the trpD gene, encoding anthranilate
synthetase component II of Escherichia coli
JOURNAL J. Mol. Biol. 156, 245-256 (1982)
STANDARD full staff_review
REFERENCE 21 (bases 57 to 137)
AUTHORS Windass,J.D., Newton,C.R., De Maeyer-Guignard,J., Moore,V.E.,
Markham,A.F. and Edge,M.D.
TITLE The construction of a synthetic Escherichia coli trp promoter and
its use in the expression of a synthetic interferon gene
JOURNAL Nucleic Acids Res. 10, 6639-6657 (1982)
STANDARD full staff_review
REFERENCE 22 (sites; mutational analysis of the regulatory region)
AUTHORS Kolter,R. and Yanofsky,C.
TITLE Genetic analysis of the tryptophan operon regulatory region using
site-directed mutagenesis
JOURNAL J. Mol. Biol. 175, 299-312 (1984)
STANDARD full staff_entry
REFERENCE 23 (bases 36 to 136)
AUTHORS Brown,K.D., Bennet,G.N., Lee,F., Schweingruber,M.E. and Yanofsky,C.
TITLE RNA polymerase interaction at the promoter-operator region of the
tryptophan operon of Escherichia coli and Salmonella typhimurium
JOURNAL J. Mol. Biol. 121, 153-177 (1978)
STANDARD simple staff_entry
COMMENT The tryptophan operon of E.coli consists of a repressor(trpR), a
promoter(trpP), an operator(trpO), an attenuator which is part of a
leader peptide region(trpL) and five structural genes:
trpE(anthranilate synthetase), trpD(glutamine amido transferase and
anthranilate 5-phosphoribosylpyrophosphate phosphoribosyl-
transferase), trpC(phosphoribosyl anthranilate isomerase-indole
glycerol phosphate synthetase), trpB(tryptophan synthetase beta)
and trpA(tryptophan synthetase alpha).
The promoter region covers approximately 40 bases upstream from
the mRNA initiation site(75-116); the operator approximately 20
bases upstream with two-fold axes of symmetry around 104-105 and
109-110([2],[7],[20]). The attenuator region is the first 140
nucleotides(117-256) of the mRNA leader, a G-C rich region with a
two-fold axis of symmetry around base 240 and an A-T rich region
with its axis about bases 259-260; it provides a second site for
control of transcription ([4],[5],[10],[12]). Two mRNA termination
regions are reported: trpT (bases 6807-6856) and trpT' (bases
7057-7119), the first of which bears some similarity to the
attenuator region ([18]). A chi site for recombination is localized
between bases 2492 and 2501 and the trp-P2 promoter is located
between bases 3240 and 3280 ([20]).
The trpE gene is unusual in that it codes for no tryptophan
residues([16]). The two enzymatic functions coded by trpG and trpD
genes in S.marcescens are coded by the single trpD gene in E.coli
and other enterobacteriaceae. This appears to have occurred via
base changes at sites 2420 and 2438. The intercistronic regions for
the structural genes show little superfluity: the trpE-trpD and
trpB-trpA boundaries consist of 'tgatg'; the trpD-trpC boundary is
'taaatgatg' and the trpC-trpB boundary is 'taaggaaaggaacaatg'. All
the cistrons show a high degree of homology with their correlates
among the enterobacteriaceae. Sequence discrepancies in early
work([3]) are corrected in later work from the same
laboratory([10],[19]). [17] also sequenced S.typhimurium trpA
region. [19] compiles sequences from
[7],[8],[9],[11],[13],[14],[16],[20].
FEATURES from to/span description
pept 143 187 trp operon leader peptide (putative)
pept 279 1841 anthranilate synthetase component I
/nomgen="trpE"
pept 1841 3436 anthranilate synthetase component II:
glutamine amidotransferase and phosphoribosyl
anthranilate synthetase /nomgen="trpD"
pept 3440 4798 anthranilate isomerase /nomgen="trpC"
pept 4810 6003 tryptophan synthetase beta subunit
/nomgen="trpB"
pept 6003 6809 tryptophan synthetase alpha subunit
/nomgen="trpA"
mRNA 117 257 trp mRNA (alt.) [2],[3],[7],[10],[21]
mRNA 117 6842 trp mRNA (alt.) [2],[3],[6],[7],[10],[18],[21]
used
revision 1787 1787 c in [16]; t in [14]
revision 1793 1793 t in [16]; c in [14]
conflict 3526 3530 gg in [19]; gaatg in [11]
conflict 4289 4293 gc in [19]; gttgc in [11]
conflict 5949 5949 c in [1]; a in [17]
BASE COUNT 1740 a 1926 c 1960 g 1705 t 4 others
ORIGIN 9 bp upstream from HhaI site [7].
1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc
61 aaatattctg aaatgagctg ttgacaatta atcatcgaac tagttaacta gtacgcaagt
121 tcacgtaaaa agggtatcga caatgaaagc aattttcgta ctgaaaggtt ggtggcgcac
181 ttcctgaaac gggcagtgta ttcaccatgc gtaaagcaat cagataccca gcccgcctaa
241 tgagcgggct tttttttgaa caaaattaga gaataacaat gcaaacacaa aaaccgactc
301 tcgaactgct aacctgcgaa ggcgcttatc gcgacaatcc caccgcgctt tttcaccagt
361 tgtgtgggga tcgtccggca acgctgctgc tggaatccgc agatatcgac agcaaagatg
421 atttaaaaag cctgctgctg gtagacagtg cgctgcgcat tacagcttta ggtgacactg
481 tcacaatcca ggcactttcc ggcaacggcg aagccctcct ggcactactg gataacgccc
541 tgcctgcggg tgtggaaagt gaacaatcac caaactgccg tgtgctgcgc ttcccccctg
601 tcagtccact gctggatgaa gacgcccgct tatgctccct ttcggttttt gacgctttcc
661 gtttattgca gaatctgttg aatgtaccga aggaagaacg agaagccatg ttcttcagcg
721 gcctgttctc ttatgacctt gtggcgggat ttgaagattt accgcaactg tcagcggaaa
781 ataactgccc tgatttctgt ttttatctcg ctgaaacgct gatggtgatt gaccatcaga
841 aaaaaagcac ccgtattcag gccagcctgt ttgctccgaa tgaagaagaa aaacaacgtc
901 tcactgctcg cctgaacgaa ctacgtcagc aactgaccga agccgcgccg ccgctgccag
961 tggtttccgt gccgcatatg cgttgtgaat gtaatcagag cgatgaagag ttcggtggcg
1021 tagtgcgttt gttgcaaaaa gcgattcgcg ctggagaaat tttccaggtg gtgccatctc
1081 gccgtttctc tctgccctgc ccgtcaccgc tggcggccta ttacgtgctg aaaaagagta
1141 atcccagccc gtacatgttt tttatgcagg ataatgattt caccctattt ggcgcgtcgc
1201 cggaaagctc gctcaagtat gatgccacca gccgccagat tgagatctac ccgattgccg
1261 gaacacgccc acgcggtcgt cgcgccgatg gttcactgga cagagatctc gacagccgta
1321 ttgaactgga aatgcgtacc gatcataaag agctgtctga acatctgatg ctggttgatc
1381 tcgcccgtaa tgatctggca cgcatttgca cccccggcag ccgctacgtc gccgatctca
1441 ccaaagttga ccgttattcc tatgtgatgc acctcgtctc tcgcgtagtc ggcgaactgc
1501 gtcacgatct tgacgccctg cacgcttatc gcgcctgtat gaatatgggg acgttaagcg
1561 gtgcgccgaa agtacgcgct atgcagttaa ttgccgaggc ggaaggtcgt cgccgcggca
1621 gctacggcgg cgcggtaggt tatttcaccg cgcatggcga tctcgacacc tgcattgtga
1681 tccgctcggc gctggtggaa aacggtatcg ccaccgtgca agcgggtgct ggtgtagtcc
1741 ttgattctgt tccgcagtcg gaagccgacg aaacccgtaa caaagcccgc gctgtactgc
1801 gcgctattgc caccgcgcat catgcacagg agactttctg atggctgaca ttctgctgct
1861 cgataatatc gactctttta cgtacaacct ggcagatcag ttgcgcagca atgggcataa
1921 cgtggtgatt taccgcaacc atataccggc gcaaacctta attgaacgct tggcgaccat
1981 gagtaatccg gtgctgatgc tttctcctgg ccccggtgtg ccgagcgaag ccggttgtat
2041 gccggaactc ctcacccgct tgcgtggcaa gctgcccatt attggcattt gcctcggaca
2101 tcaggcgatt gtcgaagctt acgggggcta tgtcggtcag gcgggcgaaa ttctccacgg
2161 taaagcctcc agcattgaac atgacggtca ggcgatgttt gccggattaa caaacccgct
2221 gccggtggcg cgttatcact cgctggttgg cagtaacatt ccggccggtt taaccatcaa
2281 cgcccatttt aatggcatgg tgatggcagt acgtcacgat gcggatcgcg tttgtggatt
2341 ccagttccat ccggaatcca ttctcaccac ccagggcgct cgcctgctgg aacaaacgct
2401 ggcctgggcg cagcataaac tagagccagc caacacgctg caaccgattc tggaaaaact
2461 gtatcaggcg cagacgctta gccaacaaga aagccaccag ctgttttcag cggtggtgcg
2521 tggcgagctg aagccggaac aactggcggc ggcgctggtg agcatgaaaa ttcgcggtga
2581 gcacccgaac gagatcgccg gggcagcaac cgcgctactg gaaaacgcag cgccgttccc
2641 gcgcccggat tatctgtttg ctgatatcgt cggtactggc ggtgacggca gcaacagtat
2701 caatatttct accgccagtg cgtttgtcgc cgcggcctgt gggctgaaag tggcgaaaca
2761 cggcaaccgt agcgtctcca gtaaatctgg ttcgtccgat ctgctggcgg cgttcggtat
2821 taatcttgat atgaacgccg ataaatcgcg ccaggcgctg gatgagttag gtgtatgttt
2881 cctctttgcg ccgaagtatc acaccggatt ccgccacgcg atgccggttc gccagcaact
2941 gaaaacccgc accctgttca atgtgctggg gccattgatt aacccggcgc atccgccgct
3001 ggcgttaatt ggtgtttata gtccggaact ggtgctgccg attgccgaaa ccttgcgcgt
3061 gctggggtat caacgcgcgg cggtggtgca cagcggcggg atggatgaag tttcattaca
3121 cgcgccgaca atcgttgccg aactgcatga cggcgaaatt aaaagctatc agctcaccgc
3181 agaagacttt ggcctgacac cctaccacca ggagcaactg gcaggcggaa caccggaaga
3241 aaaccgtgac attttaacac gtttgttaca aggtaaaggc gacgccgccc atgaagcagc
3301 cgtcgctgcg aacgtcgcca tgttaatgcg cctgcatggc catgaagatc tgcaagccaa
3361 tgcgcaaacc gttcttgagg tactgcgcag tggttccgct tacgacagag tcaccgcact
3421 ggcggcacga gggtaaatga tgcaaaccgt tttagcgaaa atcgtcgcag acaaggcgat
3481 ttgggtagaa gcccgcaaac agcagcaacc gctggccagt tttcagaatg aggttcagcc
3541 gagcacgcga catttttatg atgcgctaca gggtgcgcgc acggcgttta ttctggagtg
3601 caagaaagcg tcgccgtcaa aaggcgtgat ccgtgatgat ttcgatccag cacgcattgc
3661 cgccatttat aaacattacg cttcggcaat ttcggtgctg actgatgaga aatatttcag
3721 gggtagcttt aatttcctcc ccatcgtcag ccaaatcgcc ccgcagccga ttttatgtaa
3781 agacttcatt atcgaccctt accagatcta tctggcgcgc tattaccagg ccgatgcctg
3841 cttattaatg ctttcagtac tggatgacga ccaatatcgc cagcttgccg ccgtcgctca
3901 cagtctggag atgggggtgc tgaccgaagt cagtaatgaa gaggaacagg agcgcgccat
3961 tgcattggga gcaaaggtcg ttggcatcaa caaccgcgat ctgcgtgatt tgtcgattga
4021 tctcaaccgt acccgcgagc ttgcgccgaa actggggcac aacgtgacgg taatcagcga
4081 atccggcatc aatacttacg ctcaggtgcg cgagttaagc cacttcgcta acggttttct
4141 gattggttcg gcgttgatgg cccatgacga tttgcacgcc gccgtgcgcc gggtgttgct
4201 gggtgagaat aaagtatgtg gcctgacgcg tgggcaagat gctaaagcag cttatgacgc
4261 gggcgcgatt tacggtgggt tgatttttgt tgcgacatca ccgcgttgcg tcaacgttga
4321 acaggcgcag gaagtgatgg ctgcggcacc gttgcagtat gttggcgtgt tccgcaatca
4381 cgatattgcc gatgtggtgg acaaagctaa ggtgttatcg ctggtggcag tgcaactgca
4441 tggtaatgaa gaacagctgt atatcgatac gctgcgtgaa gctctgccag cacatgttgc
4501 catctggaaa gcattaagcg tcggtgaaac cctgcccgcc cgcgagtttc agcacgttga
4561 taaatatgtt ttagacaacg gccagggtgg aagcgggcaa cgttttgact ggtcactatt
4621 aaatggtcaa acgcttggca acgttctgct ggcggggggc ttaggcgcag ataactgcgt
4681 ggaagcggca caaaccggct gcgccggact tgattttaat tctgctgtag agtcgcaacc
4741 gggcatcaaa gacgcacgtc ttttggcctc ggttttccag acgctgcgcg catattaagg
4801 aaaggaacaa tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg
4861 ccacaaatcc tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa
4921 gatcctgaat ttcaggctca gttcaacgac ctgctgaaaa actatgccgg gcgtccaacc
4981 gcgctgacca aatgccagaa cattacagcc gggacgaaca ccacgctgta tctcaagcgt
5041 gaagatttgc tgcacggcgg cgcgcataaa actaaccagg tgctggggca ggcgttgctg
5101 gcgaagcgga tgggtaaaac cgaaatcatc gccgaaaccg gtgccggtca gcatggcgtg
5161 gcgtcggccc tggccagcgc cctgctcggc ctgaaatgcc gtatttatat gggtgccaaa
5221 gacgttgaac gccagtcgcc taacgttttt cgtatgcgct taatgggtgc ggaagtgatc
5281 ccggtgcata gcggttccgc gacgctgaaa gatgcctgta acgaggcgct gcgcgactgg
5341 tccggtagtt acgaaaccgc gcactatatg ctgggcaccg cagctggccc gcatccttat
5401 ccgaccattg tgcgtgagtt tcagcggatg attggcgaag aaaccaaagc gcagattctg
5461 gaaagagaag gtcgcctgcc ggatgccgtt atcgcctgtg ttggcggcgg ttcgaatgcc
5521 atcggcatgt ttgctgattt catcaatgaa accaacgtcg gcctgattgg tgtggagcca
5581 ggtggtcacg gtatcgaaac tggcgagcac ggcgcaccgc taaaacatgg tcgcgtgggt
5641 atctatttcg gtatgaaagc gccgatgatg caaaccgaag acgggcagat tgaagaatct
5701 tactccatct ccgccggact ggatttcccg tctgtcggcc cacaacacgc gtatcttaac
5761 agcactggac gcgctgatta cgtgtctatt accgatgatg aagcccttga agccttcaaa
5821 acgctgtgcc tgcacgaagg gatcatcccg gcgctggaat cctcccacgc cttggcccat
5881 gcgttgaaaa tgatgcgcga aaacccggat aaagagcagc tactggtggt taacctttcc
5941 ggtcgcggcg ataaagacat cttcaccgtt cacgatattt tgaaagcacg aggggaaatc
6001 tgatggaacg ctacgaatct ctgtttgccc agttgaagga gcgcaaagaa ggcgcattcg
6061 ttcctttcgt cacgctcggt gatccgggca ttgagcagtc attgaaaatt atcgatacgc
6121 taattgaagc cggtgctgac gcgctggagt taggtatccc cttctccgac ccactggcgg
6181 atggcccgac gattcaaaac gccactctgc gcgcctttgc ggcaggtgtg actccggcac
6241 aatgttttga aatgctggca ctgattcgcc agaaacaccc gaccattccc attggcctgt
6301 tgatgtatgc caatctggtg tttaacaaag gcattgatga gttttatgcc cagtgcgaaa
6361 aagtcggcgt cgattcggtg ctggttgccg atgtgccagt tgaagagtcc gcgcccttcc
6421 gccaggccgc gttgcgtcac aacgtcgcac ctatcttcat ctgcccgcca aatgccgatg
6481 acgacctgct gcgccagata gcctcttacg gtcgtggtta cacctatttg ctgtcacgag
6541 caggcgtgac cggcgcagaa aaccgcgccg cgttacccct caatcatctg gttgcgaagc
6601 tgaaagagta caacgctgca cctccattgc agggatttgg tatttccgcc ccggatcagg
6661 taaaagcagc gattgatgca ggagctgcgg gcgcgatttc tggttcggcc attgttaaaa
6721 tcatcgagca acatattaat gagccagaga aaatgctggc ggcactgaaa gtttttgtac
6781 aaccgatgaa agcggcgacg cgcagttaat cccacagccg ccagttccgc tggcggcatt
6841 ttaactttct ttaatgaagc cggaaaaatc ctaaattcat ttaatattta tctttttacc
6901 gtttcgctta ccccggtcga tcgtyractt acgtcatttt tccgcccaac agtaatataa
6961 acaaacaaat taaacccgca acataacacc agtaaaatca ataattttct ctaagtcact
7021 tattcctcag gtaattctta atatatccag aatgttcctc aaaatatatt ttccctctat
7081 cttctcgttg cgcttaattt gactaattct cattagcgac taattttaat gagtgtcgac
7141 acacaacact catattaatg aaacaatgca acgcaacggg agaaataaca tggccgaaca
7201 tcgtggtggt tcaggaaatt tcgccgaaga ccgtgagaag gcatccgacg cagccgtaaa
7261 ggcggtcagc atagcggcgg taattttaaa aatgatcgca acgcgcatct gaagcgggta
7321 aaaaaggcgg tyrac
//
LOCUS HUMGSTH 808 bp ss-mRNA PRI 01-AUG-1990
DEFINITION Human glutathione S-transferase (GST) a-subunit mRNA, complete
cds.
ACCESSION M14777
KEYWORDS GSH S-transferase; glutathione S-transferase.
SOURCE Human liver, cDNA to mRNA, clone pGTH1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 808)
AUTHORS Tu,C.-P.D. and Qian,B.
TITLE Human liver glutathione S-transferases: Complete primary sequence
of an H-a subunit cDNA
JOURNAL Biochem. Biophys. Res. Commun. 141, 229-237 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 67 735 glutathione S-transferase (GST, EC 2.5.1.18)
/hgml_locus_uid="LL0130R" /nomgen="GST2"
/map="6p12.2"
mRNA < 1 808 GST mRNA
signal 792 797 polyA signal
BASE COUNT 252 a 175 c 192 g 189 t
ORIGIN Chromosome 6p12.2.
1 agttgtcgag ccaggacggt gacagcgttt aacaaagctt agagaaacct ccaggagact
61 gctatcatgg cagagaagcc caagctccac tacttcaatg cacggggcag aatggagtcc
121 acccggtggc tcctggctgc agctggagta gagtttgaag agaaatttat aaaatctgca
181 gaagatttgg acaagttaag aaatgatgga tatttgatgt tccagcaagt gccaatggtt
241 gagattgatg ggatgaagct ggtgcagacc agagccattc tcaactacat tgccagcaaa
301 tacaacctct atgggaaaga cataaaggag agagccctga ttgatatgta tatagaaggt
361 atagcagatt tgggtgaaat gatcctcctt ctgcccgtat gtccacctga ggaaaaagat
421 gccaagcttg ccttgatcaa ggagaaaata aaaaatcgct acttccctgc ctttgaaaaa
481 gtcttaaaga gccatggaca agactacctt gttggcaaca agctgagccg ggctgacatt
541 catctggtgg aacttctcta ctacgtcgag gagcttgact ccagtcttat ctccagcttc
601 cctctgctga aggccctgaa aaccagaatc agcaacctgc ccacagtgaa gaagtttcta
661 cagcctggca gcccaaggaa gcctcccatg gatgagaaat ctttagaaga agcaaggaag
721 attttcaggt tttaataacg cagtcatgga ggccaagaac ttgcaatacc aatgttctaa
781 agttttgcaa caataaagta ctttacct
//
LOCUS MUSIGKACY 321 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse lysozyme-binding Ig kappa chain (HyHEL-10) V23-J2 region
mRNA, partial cds.
ACCESSION M35667
KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
variable region VK23.
SOURCE Mouse hybridoma, cDNA to mRNA, clone 10K-106.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Mainhart,Smith-Gill-S.J., Lavoie,C., Feldman,T.B., Drohan,R.J. and
Brooks,W.B.R.
TITLE A three-dimensional model of an anti-lysozyme antibody
JOURNAL J. Mol. Biol. 194, 713-724 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 321 lysozyme binding Ig kappa chain V23-J2 region
(AA at 1)
recomb 285 286 V23 region end/J2 region start
BASE COUNT 88 a 80 c 75 g 78 t
ORIGIN
1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaaa tagcgtcagt
61 ctttcctgca gggccagcca aagtattggc aacaacctac actggtatca acaaaaatca
121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc
181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact
241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggccgtacac gttcggaggg
301 gggaccaagc tggaaataaa a
//
LOCUS MUSLTAGBSA 237 bp ds-DNA ROD 01-AUG-1990
DEFINITION Mouse SV40 transformed large T-antigen binding site DNA.
ACCESSION M35500
KEYWORDS large T antigen.
SOURCE Mouse (strain BALB/c) SV40 transformed cell line SVA31E7 DNA, clone
p27.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 237)
AUTHORS Lane,D.P., Simanis,V., Bartsch,R., Yewdell,J., Gannon,J. and
Mole,S.
TITLE Cellular targets for SV40 large T-antigen
JOURNAL Proc. R. Soc. Lond., B, Biol. Sci. 226, 25-42 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
binding 84 118 large T-antigen binding site
BASE COUNT 67 a 65 c 60 g 45 t
ORIGIN
1 ggatccatcc cataatcagc ctctaaacgc tgacaccatt gcatacacta gcaagatttt
61 gctgaaagaa ccctgatata gctgtctctt gtgaggctat gccggggcct agcaaacaca
121 gaagtggatg ctcacagtca gctagtggat cacagggccc ccaatggagg agctagagaa
181 agtacccaag gagctaaagg gatcctctac gccggacgca tcgtggccag tcaccgc
//
LOCUS PEAIVSS 350 bp ds-DNA PLN 01-AUG-1990
DEFINITION Pea legumin J gene, exons 1 and 2 (partial).
ACCESSION M26771
KEYWORDS legumin.
SOURCE Pea DNA, clone pSP65LegJi.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 350)
AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D.
TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa
cell nuclear extract
JOURNAL EMBO J. 5, 2749-2758 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 48 legumin J, exon 1 (AA at 3)
50 144 legumin J, exon 2
pre-msg < 1 > 350 legumin J mRNA and introns
IVS 49 186 legumin intron
BASE COUNT 117 a 74 c 71 g 88 t
ORIGIN
1 gaatacacgg aattcgagct cgcccgggga tcccattcaa ccccaagagt aagtaatagt
61 gtatccatac attacattat ctcttataaa ttgttcatac agcatgctca ttcgattata
121 actttaaaag tttctaatgt ataatttgtt atactaaatc aatcacacgt aaatatgtgt
181 atgcaggtat tttaccttgg tgggaaccca gaaacagagt tccccgaaac acaggaggaa
241 caacaaggaa ggcatcggca aaagcatagt taccctgttg gacgtaggag tggacatcac
301 caacaagaag aggaatggga tcctctagag tcgacctgca gcccaagctt
//
LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete
cds.
ACCESSION M32474 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3190)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analysis imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 122 2251 carcinoembryonic antigen-related protein
precursor (CGM1)
sigp 122 220 carcinoembryonic antigen-related protein
signal peptide
matp 221 2248 CGM1 protein
BASE COUNT 871 a 798 c 693 g 828 t
ORIGIN
1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
3181 ttctccaaat
//
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
1 and 2.
ACCESSION M32476 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 1 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2238)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene and analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept 704 767 carcinoembryonic antigen-related protein
(CGM1) precursor, exon 1
1770 + 2129 carcinoembryonic antigen-related protein, exon
2
sigp 704 767 carcinoembryonic antigen-related protein
(CGM1) signal peptide
1770 1804 carcinoembryonic antigen-related protein
signal peptide
matp 1805 + 2129 carcinoembryonic antigen-related protein
pre-msg 512 > 2238 CGM1 mRNA and introns
IVS 768 1769 CGM1 intron A
IVS 2130 > 2238 CGM1 intron B
BASE COUNT 615 a 532 c 539 g 552 t
ORIGIN
1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
2221 tgcattgatc tacatgtt
//
LOCUS RATCGM1AC2 539 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32477 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 2 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 539)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 539 carcinoembryonic antigen-related protein
intron B
BASE COUNT 126 a 127 c 138 g 148 t
ORIGIN
1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS RATCGM1AC3 828 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron B.
ACCESSION M32478 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 3 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 828)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 828 carcinoembryonic antigen-related protein
intron B
BASE COUNT 193 a 255 c 190 g 190 t
ORIGIN
1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS RATCGM1AC4 642 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 3.
ACCESSION M32479 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 4 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 61 + 420 carcinoembryonic antigen-related protein
(CGM1), exon 3
matp + 61 + 420 carcinoembryonic antigen-related protein
pre-msg < 1 > 642 CGM1 mRNA and introns
IVS < 1 60 CGM1 intron B
IVS 421 > 642 CGM1 intron C
BASE COUNT 151 a 160 c 154 g 177 t
ORIGIN
1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS RATCGM1AC5 616 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 4.
ACCESSION M32480 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 5 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 616)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 90 + 449 carcinoembryonic antigen-related protein
(CGM1), exon 4
matp + 90 + 449 carcinoembryonic antigen-related protein
pre-msg < 1 > 616 CGM1 mRNA and introns
IVS < 1 89 CGM1 intron C
IVS 450 616 CGM1 intron D
BASE COUNT 152 a 153 c 147 g 164 t
ORIGIN
1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
601 ttgtctgatt cccttc
//
LOCUS RATCGM1AC6 654 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
intron 4.
ACCESSION M32481 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 6 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 654)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
IVS < 1 > 654 carcinoembryonic antigen-related protein
intron D
BASE COUNT 185 a 146 c 155 g 168 t
ORIGIN
1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS RATCGM1AC7 492 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 5.
ACCESSION M32482 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 7 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 118 + 471 carcinoembryonic antigen-related protein
(CGM1), exon 5
matp + 118 + 471 carcinoembryonic antigen-related protein
pre-msg < 1 > 492 CGM1 mRNA and introns
IVS < 1 117 CGM1 intron D
IVS 472 492 CGM1 intron E
BASE COUNT 134 a 134 c 105 g 119 t
ORIGIN
1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
481 tctcaaggtc tc
//
LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene,
exon 6.
ACCESSION M32483 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SEGMENT 8 of 8
SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone
lambda-rnCGM1-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1341)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept + 640 / 987 carcinoembryonic antigen-related protein
(CGM1), exon 6
matp + 640 / 987 carcinoembryonic antigen-related protein
pre-msg < 1 > 1341 CGM1 mRNA and introns
IVS < 1 639 CGM1 intron E
IVS 988 1341 CGM1 intron F
BASE COUNT 357 a 329 c 321 g 334 t
ORIGIN
1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
1321 atccgttgcc tgacggtcga c
//
LOCUS RATCGM4AA 4627 bp ds-DNA ROD 01-AUG-1990
DEFINITION Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2
and 3.
ACCESSION M32475 J05417
KEYWORDS carcinoembryonic antigen-related protein.
SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM415-1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites for [2])
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
TITLE cDNA and gene analyses imply a novel structure for a rat
carcinoembryonic antigen-related protein
JOURNAL J. Biol. Chem. 265, 7872-7879 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 4627)
AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Zimmermann, 02-MAR-1990.
FEATURES from to/span description
pept / 145 489 carcinoembryonic antigen-related protein
(CGM4), exon 2 (AA at 147)
3379 / 3738 carcinoembryonic antigen-related protein exon
3
pre-msg < 1 > 4627 CGM4 mRNA and introns
IVS < 1 144 CGM4 intron B
IVS 490 3378 CGM4 intron C
IVS 3739 > 4627 CGM4 intron D
BASE COUNT 1213 a 1165 c 1162 g 1087 t
ORIGIN
1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct
61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga
121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc
181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg
241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg
301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga
361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct
421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct
481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga
541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga
601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta
661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact
721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt
781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac
841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga
901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc
961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca
1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt
1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct
1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc
1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga
1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt
1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca
1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt
1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa
1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag
1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca
1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg
1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc
1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct
1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg
1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga
1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct
1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag
2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg
2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga
2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg
2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg
2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa
2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg
2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac
2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa
2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt
2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc
2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg
2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac
2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt
2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc
2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga
2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt
3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac
3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc
3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca
3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg
3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg
3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg
3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc
3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc
3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca
3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca
3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg
3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg
3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt
3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat
3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga
3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct
3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt
4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc
4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga
4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct
4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg
4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata
4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac
4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg
4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt
4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc
4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac
4621 agagctc
//
LOCUS STMRGDA 2540 bp ds-DNA BCT 01-AUG-1990
DEFINITION S.coelicolor 16S rRNA gene and 23S rRNA, 5' end (rrnD) gene
cluster.
ACCESSION Y00411 M35377
KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA.
SOURCE S.coelicolor (strain 1147 A3(2)) DNA, clone RSC33.
ORGANISM Streptomyces coelicolor
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 705 to 2230)
AUTHORS Baylis,H.A. and Bibb,M.J.
TITLE The nucleotide sequence of a 16S rRNA gene from Streptomyces
coelicolor A3(2)
JOURNAL Nucleic Acids Res. 15, 7176-7176 (1987)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 771 and 2196 to 2540)
AUTHORS Baylis,H.A. and Bibb,M.J.
TITLE Transcriptional analysis of the 16S rRNA gene of the rrnD gene set
of Streptomyces coelicolor A3(2)
JOURNAL Mol. Microbiol. 2, 569-579 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 144 ORF (AA at 1)
rRNA 536 > 2231 16S rRNA gene
rRNA 537 > 2231 16S rRNA gene
rRNA 704 2231 16S rRNA
rRNA 2507 > 2540 pot. 23S rRNA
BASE COUNT 573 a 654 c 834 g 478 t 1 others
ORIGIN
1 tgggcccgca tcaccatcgg cgtcctcgcc gagctggcct tcctggccta cgtctacgtt
61 ctgggcggcc gagccgtgcg cgacggcgag acgggtgacg tcgaggcagc cgaacgcagc
121 gccacggtgc caacagccgc ctgatgtgca tccacccctg cgagctgcta gtgtcctctt
181 cgttcccgca agagccgttg acacggagcg agcggggagg tagattcgaa cagttgcctg
241 gagacgggtt caccccagag ggcaacagtg aacatctacc agcttctccg aatcaacgaa
301 ttcgacgaag cactctcccg atgaatcgga aacgaaggcc ggtaagaccg gctcgaaagt
361 tctgataaag tcggagccgc cggaaaggga aacgcgaaag cgggaacctg gaaagcgccg
421 aggaaatcgg atcggaaaga tctgatagag tcggaaacgc aagaccgaag ggaagcgccc
481 ggaggaaagc ccgagagggt gagtacaaag gaagcgtgcc gttccttgag aactcaacag
541 cgtgccaaaa gtcaacgcca gatatgttga taccccgacc tgatcggatc tccgttcggg
601 ttgaggttcc tttgaagtaa cacaacagcg aggacgctgt gaacggtcgg attattcctc
661 cgactgttcc gctctcgtgg tgtcacccga ttacgggtat acattcacgg agagtttgat
721 cctggctcag gacgaacgct ggcggcgtgc ttaacacatg caagtcgaac gatgaaccac
781 ttcggtgggg attagtggcg aacgggtgag taacacgtgg gcaatctgcc cttcactctg
841 ggacaagccc tggaaacggg gtctaatacc ggatactgac cctcgcaggc atctgcgagg
901 ttcgaaagct ccggcggtga aggatgagcc cgcggcctat cagcttgttg gtgaggtaat
961 ggctcaccaa ggcgacgacg ggtagccggc ctgagagggc gaccggccac actgggactg
1021 agacacggcc cagactccta cgggaggcag cagtggggaa tgttgcacaa tgggcgaaag
1081 cctgatgcag cgacgccgcg tgagggatga cggccttcgg gttgtaaacc tctttcagca
1141 gggaagaagc gaaagtgacg gtacctgcag aagaagcgcc ggctaactac gtgccagcag
1201 ccgcggtaat acgtagggcg caagcgttgt ccggaattat tgggcgtaaa gagctcgtag
1261 gcggcttgtc acgtcggttg tgaaagcccg gggcttaacc ccgccactgc agtcgatacg
1321 ggcaggctag agttcggtag gggagatcgg aattcctggt gtagcggtga aatgcgcaga
1381 tatcaggagg aacaccggtg gcgaaggcgg atctctgggc cgatactgac gctgaggagc
1441 gaaagngtgg ggagcgaaca ggattagata ccctggtagt ccacgccgta aacggtgggc
1501 actaggtgtg ggcaacattc cacgttgtcc gtgccgcagc taacgcatta agtgccccgc
1561 ctggggagta cggccgcaag gctaaaactc aaaggaattg acgggggccc gcacaagcgg
1621 cggagcatgt ggcttaattc gacgcaacgc gaagaacctt accaaggctt gacatacacc
1681 ggaaagcatc agagatggtg ccccccttgt ggtcggtgta caggtggtgc atggctgtcg
1741 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ttgtcccgtg
1801 ttgccagcaa gccttcgggg tgttggggac tcacgggaga ccgccgggtc aactcggagg
1861 aaggtgggga cgacgtcaag tcatcatgcc ccttatgtct tgggctgcac acgtgctaca
1921 atggccggta caatgagctg cgataccgca aggtggagcg aatctcaaaa agccggtctc
1981 agttcggatt ggggtctgca actcgacccc atgaagtcgg agtcgctagt aatcgcagat
2041 cagcattgct gcggtgaata cgttcccggg ccttgtacac accgcccgtc acgtcacgaa
2101 agtcggtaac acccgaagcc ggtggcccaa ccccttgtgg gagggagctg tcgaaggtgg
2161 gactggcgat tgggacgaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca
2221 cctcctttct aaggagcaca tagccgactg cagcgaaatg tcctgcacgg ttgctcatgg
2281 gtggaacgtt gactactcgg cacggtcttc ttgatggatc actagtactg cttcggcgtg
2341 gaacgtgact tcaaagaggg gttcgtgtcg ggcacgctgt tgggtatctg agggtacggc
2401 cgtgaggtcg ccttcagttg ccggccccgg taaaaatccg cgtgagtggg ttgtgacggg
2461 tggttggtcg ttgtttgaga actgcacagt ggacgcgagc atctgtggcc aagtttttaa
2521 gggcgcacgg tggatgcctt
//
LOCUS SUSCYIIAA 230 bp ds-DNA INV 01-AUG-1990
DEFINITION S.purpuratus cytoskeletal actin CyIIa gene, complete cds.
ACCESSION M35321 M35322
KEYWORDS cytoskeletal actin SpG11A.
SOURCE S.purpuratus DNA, clone pSpG11A.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 230)
AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
TITLE DNA sequence analysis and structural relationships among the
cytoskeletal actin genes of the sea urchin Strongylocentrotus
purpuratus
JOURNAL J. Mol. Evol. 28, 72-86 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 219 > 230 actin CyIIa
BASE COUNT 74 a 43 c 33 g 80 t
ORIGIN
1 ttcgaattgt cactcattct tcaaataaag attgtgagat cacgcgtttt ctgtacccta
61 ccctacaaat acgtaggaca cctgggtatg tagtgaacct taaagtttat aaatgatgtt
121 cttgtttgtc catcaattta accgggaaaa aaatttatct gtctaatatc attatctatt
181 ttcacacttt tagatcaaac tagattaaac aaatcatcat gtgtgacgac
//
LOCUS SUSCYIIBA 1972 bp ds-DNA INV 01-AUG-1990
DEFINITION S.purpuratus cytoskeletal actin CyIIb gene, complete cds.
ACCESSION M35323
KEYWORDS cytoskeletal actin CyIIb.
SOURCE S.purpuratus DNA, clone pSpG11A.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 1972)
AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
TITLE DNA sequence analysis and structural relationships among the
cytoskeletal actin genes of the sea urchin Strongylocentrotus
purpuratus
JOURNAL J. Mol. Evol. 28, 72-86 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 251 616 cytoskeletal actin CyIIb, exon 1
841 1087 cytoskeletal actin CyIIb, exon 2
1312 1829 cytoskeletal actin CyIIb, exon 2
pre-msg 1 1972 CyIIb mRNA and introns
IVS 617 840 CyIIb intron A
IVS 1088 1311 CyIIb intron B
BASE COUNT 529 a 492 c 408 g 543 t
ORIGIN
1 tcggcagttc aagaccacgt gtgtttcccg gattggtaaa ctccttatca cgaactcctt
61 atcagtaaaa cttacgagct ttgtacactt ttaatgactt ttcgattatt ctttcaagag
121 attttccctg ccacaaaatt acttagttct tttatttctc attcctgtgc aattccaatt
181 actagcattt tatttatgat ccatttttgt gtttttattt tagagtaaat aaaacgagaa
241 atcaatcatc atgtgtgacg acgatgttgc cgctcttgtc atcgacaacg gatccggtat
301 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtcttcccat ccatcgttgg
361 cagaccccgt caccagggtg tcatggtcgg catgggacag aaggacagct acgtcggaga
421 cgaggcccag agcaagagag gtatcctcac cctgaagtac cccatcgagc acggtatcgt
481 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg aactccgtgt
541 tgccccggag gagcaccccg tcctccttac cgaggctccc ctcaacccca aggccaacag
601 ggaaaagatg acacaggtta gaaaaagcaa tatgcctatt attgaagtaa tcaaattctc
661 aaaacaaata cattctcaca tttaaacatc ttaatttaag ctgtttatta atattaatat
721 caagtgagtt tcgttgttga aataacagcg attgactaaa atgaacttgt atcaaacttg
781 ttgtgattag tgaaatgaaa tcggtgatta acaattgttt tgttttcatg tcttctgcag
841 atcatgttcg agaccttcaa ctcacccgcc atgtacgtcg ctatccaggc cgtgctttcc
901 ctctacgcct ctggtcgtac cactggtatc gttttcgact ctggtgatgg tgtttcacac
961 acagtgccca tctacgaggg ttatgccctt ccccacgcca tcctccgtct ggacttggct
1021 ggacgtgatc tcacagacta cctgatgaag atccttaccg agcgtggcta ctctttcacc
1081 accaccggta agatatcttt tttttacaat caaagagtga gtgaagctat cacctgcatc
1141 ctgtgcttaa agaatattaa aaaaagagga gggaagatat tatatatgat taatgttcat
1201 tttctttgga ctttgacaat aacattttgg ggggatagaa agtgaatgtt gcttttcgtt
1261 atacattcgt aactaactaa tttcatcttg tttttttttt ctatcttgca gctgagcgtg
1321 aaatcgttcg tgacatcaag gagaagctct gctacgttgc tcttgacttt gagcaagaga
1381 tgcagactgc tgcctcatcc tcctccctcg agaagagcta cgagcttccc gacggacagg
1441 tcatcaccat tggcaacgag cgattccgtg ccccagaggc cctcttccag ccagccttcc
1501 ttggaatgga atccgctgga atccacgaga cctgctacaa cagcatcatg aagtgcgatg
1561 ttgacatccg taaggatctg tacgccaaca ctgttctgtc tggaggctcc accatgttcc
1621 caggaatcgc cgacaggatg cagaaggaga tcaccgccct tgccccacca accatgaaga
1681 tcaagatcat tgctcctcca gaaaggaaat actccgtatg gatcggaggc tccatccttg
1741 cctctctctc caccttccaa cagatgtgga tcagcaagca ggaatacgat gagtccggcc
1801 catccatcgt ccacaggaag tgcttctaaa caactcgctt ttggtgaaca aactcttgaa
1861 catcaatatc aaggaaacga ccatgatctc aaattgcaaa gtttaagtat gacaccattg
1921 cgggcaatgc agccgaaaaa ctcgcgcttt ctcaaaactt ggaggactgc ag
//
LOCUS SUSCYIIIBA 2918 bp ds-DNA INV 01-AUG-1990
DEFINITION S.purpuratus cytoskeletal actin CyIIIb gene, complete cds.
ACCESSION M35324
KEYWORDS cytoskeletal actin CyIIIb.
SOURCE S.purpuratus DNA, clone pSpG11A.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 2918)
AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
TITLE DNA sequence analysis and structural relationships among the
cytoskeletal actin genes of the sea urchin Strongylocentrotus
purpuratus
JOURNAL J. Mol. Evol. 28, 72-86 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 371 736 cytoskeletal actin CyIIIb, exon 1
1634 1880 cytoskeletal actin CyIIIb, exon 2
2247 2764 cytoskeletal actin CyIIIb, exon 2
pre-msg 1 1972 CyIIIb mRNA and introns
IVS 737 1633 CyIIIb intron A
IVS 1881 2246 CyIIIb intron B
BASE COUNT 850 a 668 c 559 g 841 t
ORIGIN
1 acggttcggg catttaggga tagctttgat tttaagaatg ttaaaatgag aatgtcaaat
61 agcctaacgc tggtctgtgc cagtaaacat gaatcaattc caaatgttga tatattaata
121 gtcggggagt tcaaatagga caacatgttt cacggggata gaattatcag acataattat
181 aatcccactt tgtcgtgaat tttgttggtt gtatgaaagt tttttagacc gtttgaaagg
241 aaaacagacc tatgccaaat ccaccaccac gaattaacta gtctgcaaac aaagaaacta
301 aaattaatat ttctctgggt atgtttttct catattcagg acaggaaaac gaaattcaat
361 catcatgtgt atgtgtgacg atgatgttgc cgctcttgtc gtcgacaacg ggtccggaat
421 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtctttccat ccatcgttgg
481 caggccccgt caccagggtg tcatggttgg tatgggacaa aaggacagct acgttggaga
541 cgaagcacag agcaagagag gtatcctcac cctgaagtac cctattgagc acggtatcgt
601 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg agctccgtgt
661 tgccccagag gagcaccccg tccttctgac agaggccccc ctcaacccta aggccaacag
721 ggaaaagatg acacaggtaa ggatatagtg cggaattgca aaacattcct taaagatact
781 atgtctcttt tgcacccaac atcagattct gtagaacttt gcaggaacta taattatgac
841 ttgtcatgta tgtcctatct atgaaatcta aacattagca atgtcgtatt attcgaatta
901 tgcaaggaaa cccgtttatc ttctagactt cactgtcaga cttactgaca tctatttttc
961 tttattgtaa taacatacat acatttagct ttaacaggta catgagcatt tgtctacatc
1021 aataacccac tatttgtgac ggccaaaatt aaactgattg aatatttgta cagcacaaaa
1081 cgtacgacca atcggtgaaa gggtgtgaaa atgaaactat tacttaggtg atcgcaatta
1141 cttaactcga ttcgataact aatggtaaca tgtagttatt ttcccactaa aagccctttt
1201 taatcctttc gtttcgaagg aacttctaac ttagtttttt tccttcaaat gcagttggaa
1261 tttaatcttt tcattgttgg cctgcaaatg ggacatacag tagtaccttt aactgcattt
1321 tggcaggaat gaaatgaaca acggctacag atagcccacg tcaccaatag cctacataag
1381 cgaagaaaac tagtcggata cccccacacg accgacatat cgctctccct gaccaatcta
1441 aaatatcgtt tttctttttt aaagtccata aaatgctatg aaaacctttt cgtttcttta
1501 ctgcagtgaa aataaaagct gatacggact acgagtacaa aatcgcgaac attcagataa
1561 aaaagttgaa tttgcccagt ttataatccc tagagtttat tcttaattca aaaaaatatt
1621 cttcttttgt tagatcatgt ttgagacctt caactcgccc gccatgtacg tcgccatcca
1681 ggccgtgctt tccctctacg cctctggtcg taccactggt atcgttttcg actctggcga
1741 cggtgtttca cacactgtac caatctatga gggttacgcc ctcccccacg ccatcatccg
1801 tctggacttg gctggacgtg atcttaccga ttacctgatg aagatcctta ccgagcgtgg
1861 ctactctttc accaccactg gtaagacatg atatggataa tagcaatagc taatgatgat
1921 aattaaaata gggataattg ataatattag aatactaatg taaacagatg aatgtcttac
1981 caaagggcag tctgtctcgg gttttgaatt caaaaacctc acatctcgtt atctttaagc
2041 cgcagaccac aacacctgca tgttcatttt tttttttact gcttgttcaa atccttttga
2101 caaagcgaat atctgattag atcgataata attaataaca aataccctct aagtcccgga
2161 gtttcaacac atttccattg ttatcttcac attttacaat ttgtctgcaa ttgatatgtg
2221 actgcatcca ttattatctc ttacagctga gcgtgaaatc gtccgtgaca taaaggagaa
2281 gctctgctac gtagctcttg attttgagga ggagatgcaa actgctgcct catcctcctc
2341 cctcgagaag agctacgagc ttcccgacgg acaggtcatc accatcggca acgagcgatt
2401 tcgttgctca gagaccctct tacagccctc tttcattgga atggaatctg ctggaatcca
2461 tgagacctgt tataacagca tcatgaagtg cgatgttgac atccgtaagg atctatacgc
2521 caacaccgtt ctctccggag cttccaccat gttcccagga atcgctgaca ggatgcagaa
2581 agagattgtc gcccttgccc caccaaccat gaagatcaag atcatcgctc ctcctgagag
2641 gaaatactct gtatggatcg gaggctccat tcttgcctct ctctccacct tccaacagat
2701 gtggatcagc aagcaggaat acgatgagtc tggtccatcc atcgtccaca ggaagtgctt
2761 ctaaacaacc ttccaacaga tttggatcag caagcaggaa tacaatgagt ccggtccatc
2821 catcgtccaa gggaagtgct tctaaacaac ttgattttct tctacttcta atgagcaacc
2881 tgattttttt aattctgttt cactccatgt tgccacct
//
LOCUS WHTIVSS 310 bp ds-DNA PLN 01-AUG-1990
DEFINITION Wheat amylase gene, exons 2 and 3 (partial).
ACCESSION M26770
KEYWORDS .
SOURCE Wheat DNA, clone pSP64Amyi.
ORGANISM Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 310)
AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D.
TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa
cell nuclear extract
JOURNAL EMBO J. 5, 2749-2758 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 109 amylase, exon 2 (AA at 1)
214 > 310 amylase, exon 3
pre-msg < 1 > 310 amylase mRNA and intron
IVS 110 213 amylase intron 2
BASE COUNT 79 a 91 c 76 g 64 t
ORIGIN
1 gaatacaagc ttgggctgca ggtcgacgca gaggctgtgg ccattcccct cggacaaggt
61 catgcagggc tacgcctaca tcctcacaca cccgggcata ccatgcatcg taagtagtag
121 cacactacac aacctcacca taacatttcg catcaaacgt accccacgat gtttgtgatc
181 tgaacttaca actacttggt tttgcgcgcg cagttctacg accatgtgtt cgactggaaa
241 ctgaagcagg agatcaccgc actggctacg gtcaggtcaa ggaacgggat ccccgggcga
301 gctcgaattc
//
LOCUS YSCMTARSA 384 bp ds-DNA ORG 01-AUG-1990
DEFINITION Yeast (S.cerevisiae) mitochondrial autonomously replicating
sequence DNA.
ACCESSION M35612
KEYWORDS .
SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1021.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 384)
AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K.
TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces
cerevisiae
JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
site 46 56 consensus autonomously replicating sequence
site 126 136 consensus autonomously replicating sequence
site 245 255 consensus autonomously replicating sequence
site 290 300 consensus autonomously replicating sequence
site 148 156 ori/rep GC cluster A
site 187 194 ori/rep GC cluster A
BASE COUNT 126 a 29 c 34 g 195 t
ORIGIN
1 ccgccgcggg cggacgccgg aggagaatta tatttttata taataattta tatttctata
61 tatatatata tatattatat ataaatatta ttatatatat ttttatatat attataatta
121 tattcattaa tattttatta tagtggtggg ggtcccaatt attattttca ataataattt
181 atcatgggac ccggatatct tcttgttttt atttattatt ttttttaatt tattttaatt
241 atttatttat aatttatatt atacaattta ttatttcgtt aataccttta tttatattat
301 ataatatatt atattattat aatatattta ttgattatat taatacattt aactaatgtg
361 tgctctatat ttattgaata gttt
//
LOCUS YSCMTARSB 218 bp ds-DNA ORG 01-AUG-1990
DEFINITION Yeast (S.cerevisiae) mitochondrial Ser-tRNA, 3' end in and
autonomously replicating sequence.
ACCESSION M35613
KEYWORDS transfer RNA-Ser.
SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone
pYmit1S2SC-delta-11.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 218)
AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K.
TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces
cerevisiae
JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
tRNA < 1 49 Ser-tRNA
site 138 148 consensus autonomously replicating sequence
BASE COUNT 99 a 25 c 11 g 83 t
ORIGIN
1 ctatcattag tctttattgg ctacgtaggt tcaaatccta catcatccgt aataatacat
61 atatataata ataattttaa tattattcct ataaaaataa aataaataaa taaataataa
121 taattaatta attttaataa atataaaata tataaaataa taataataat aattattatt
181 ttaataatat tatttatata atagtccggc ccgccccc
//
LOCUS MUSMDRXX 2873 bp ds-DNA ROD 01-AUG-1990
DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2.
ACCESSION M33580
KEYWORDS P-glycoprotein.
SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line
J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2873)
AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L.,
Hartstein,M. and Horwitz,S.B.
TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
reveals the basis for differential transcript heterogeneity in
multidrug-resistant J774.2 cells
JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.S.Kirschner, 05-APR-1990.
FEATURES from to/span description
pept 2613 / 2677 P-glycoprotein, exon 2 (first expressed exon)
pre-msg 1992 > 2873 P-glycoprotein mRNA and introns (alt.)
pre-msg 1801 > 2873 P-glycoprotein mRNA and introns (alt.)
IVS 2120 2606 P-glycoprotein intron A
IVS 2678 > 2873 P -glycoprotein intron B
signal 1904 1912 CAAT box
signal 1956 1963 TATA box
site 1880 1887 SP-1 site
site 1921 1927 SP-1 site
site 1937 1944 SP-1 site
site 1869 1875 AP-1 site
rpt 1 1300 L1Md repetitive element
BASE COUNT 860 a 621 c 714 g 678 t
ORIGIN Chromosome 5.
1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct
61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa
121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa
181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc
241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca
301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag
361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga
421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat
481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac
541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct
601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga
661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt
721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc
781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt
841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac
901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta
961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag
1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc
1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg
1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg
1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa
1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa
1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa
1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact
1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta
1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca
1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg
1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt
1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag
1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag
1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca
1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct
1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa
1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct
2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc
2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag
2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag
2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg
2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg
2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag
2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc
2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt
2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg
2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa
2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg
2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat
2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct
2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc
//
LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 01-AUG-1990
DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds.
ACCESSION M33581
KEYWORDS P-glycoprotein.
SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line
J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library
pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4924)
AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L.,
Hartstein,M. and Horwitz,S.B.
TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
reveals the basis for differential transcript heterogeneity in
multidrug-resistant J774.2 cells
JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine,
1300 Morris Park Ave, Bronx, NY 10461
FEATURES from to/span description
pept 137 3967 P-glycoprotein (mdr1a)
mRNA < 1 4924 P-glycoprotein mRNA
signal 4315 4320 poly-A signal
signal 4898 4903 poly-A signal
BASE COUNT 1450 a 1021 c 1210 g 1243 t
ORIGIN Chromosome 5.
1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc
61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga
121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt
181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt
241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct
301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac
361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga
421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac
481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc
541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga
601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt
661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac
721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt
781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt
841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc
901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa
961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat
1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc
1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt
1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg
1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc
1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt
1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag
1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct
1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag
1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct
1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga
1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca
1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat
1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc
1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg
1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc
1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa
1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa
2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg
2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga
2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg
2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat
2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt
2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt
2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa
2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca
2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc
2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa
2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact
2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt
2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga
2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat
2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg
2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt
3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc
3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc
3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga
3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg
3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga
3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt
3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa
3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga
3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt
3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga
3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg
3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct
3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga
3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa
3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca
3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg
3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt
4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca
4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa
4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag
4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat
4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa
4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa
4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat
4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag
4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct
4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata
4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg
4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca
4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca
4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt
4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa
4921 aaat
//
LOCUS RICCPCTA 2526 bp ds-DNA ORG 01-AUG-1990
DEFINITION Rice chloroplast beta and epsilon subunit (atpB and atpE) genes,
complete cds.
ACCESSION M31464 Y00323
KEYWORDS atpB protein; atpE protein.
SOURCE Rice chloroplast DNA, clone Ct-3.
ORGANISM Chloroplast Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE 1 (bases 1 to 2526)
AUTHORS Moon,E., Kao,T.-h. and Wu,R.
TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
from rice
JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 398 1894 atpB protein
pept 1891 2304 atpE protein
BASE COUNT 770 a 459 c 563 g 734 t
ORIGIN
1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttgacagc
61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag
121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc
181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag
241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga
301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt
361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc
421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg
481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt
541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt
601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg
661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg
721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc
781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat
841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat
901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa
961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga
1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga
1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag
1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt
1201 gcttctattc atcgataata tctttcgttt tgttcaagca ggatcggagg tatctgcctt
1261 attagggaga atgccctctg cagtgggtta tcaacctact cttagtacag aaatgggttc
1321 tttgcaagaa agaattactt ctactaaaaa gggatctata acttcgatcc aagcggttta
1381 tgtacctgcg gacgatttga ccgaccctgc tcctgctaca acatttgcac atttggatgc
1441 tactaccgta ctttccagag gattagcttc caaagggatt tatcctgcag tagatccttt
1501 agattcaacc tcaactatgt tacaacctcg gatcgttggc aacgaacatt atgaaactgc
1561 gcaaagagtt aagcaaactt tacaacgtta caaagaactt caggacatta tcgcaattct
1621 tgggttggat gaattatcgg aggaggatcg tttaactgta gcaagagcac gaaaaattga
1681 gcgcttctta tcacaaccgt tttttgtggc agaagttttt accggttctc caggaaagta
1741 tgttggtctt gcagaaacta ttaggggatt tcaactaatc ctttccggag aattagacgg
1801 cctacccgaa caggcttttt atttggtggg taacatcgat gaagctagca cgaaagctat
1861 aaacttagaa gaggagaaca acttgaagaa atgaaattaa atctttatgt actgactcct
1921 aagcgaatta tttgggattg tgaagtgaaa gaaatcattt tatctactaa tagtggccaa
1981 attggcgtat taccaaacca cgcccccatt aacacagctg tagatatggg tcccttgaga
2041 atacgcctcc tcaacgatca atggttaacg gcggttctgt ggagcggttt tgccagaata
2101 gttaataatg agatcatcat tttaggaaat gatgcggaac tgggtagtga cattgatccg
2161 gaagaagctc aacaggcact tgaaatagcc gaagctaacg tgagtagagc tgagggtacg
2221 aaagaattgg ttgaagcgaa ggtagctctc agacgagcta ggatacgagt cgaggctgtt
2281 aattggattc ccccatctaa ttgaagacaa cccaacggtt tagttgatac aaagaaaaag
2341 ggaagagggg tagaaaaaat tattagatag cgaagcgaag tagggccaat gctatctagt
2401 aatttttcta cctacctacc tactattgga tttgaaccaa tgactcccgc cgtatgaaag
2461 caatactcta accactgagt taagtaggca atttatcacc acaaaggaag accctttact
2521 tcgatc
//
LOCUS RICCPCTB 2524 bp ds-DNA ORG 01-AUG-1990
DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE)
pseudogenes, complete cds.
ACCESSION M31465 Y00323
KEYWORDS pseudogene.
SOURCE Rice chloroplast DNA, clone Ct-1.
ORGANISM Chloroplast Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE 1 (bases 1 to 2524)
AUTHORS Moon,E., Kao,T.-h. and Wu,R.
TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
from rice
JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 398 1392 atpB pseudogene
pept.ps 1389 2302 atpE pseudogene
BASE COUNT 769 a 458 c 564 g 733 t
ORIGIN
1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttggcagc
61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag
121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc
181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag
241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga
301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt
361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc
421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg
481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt
541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt
601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg
661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg
721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc
781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat
841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat
901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa
961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga
1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga
1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag
1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt
1201 gctctattca tcgataatat ctttcgtttt gttcaagcag gatcggaggt atctgcctta
1261 ttagggagaa tgccctctgc agtgggttat caacctactc ttagtacaga aatgggttct
1321 ttgcaagaaa gaattacttc tactaaaaag ggatctataa cttcgatcca agcggtttat
1381 gtacctgcgg acgatttgac cgaccctgct cctgctacaa catttgcaca tttggatgct
1441 actaccgtac tttccagagg attagcttcc aaagggattt atctgcagta gatcctttag
1501 attcaacctc aactatgtta caacctcgga tcgttggcaa cgaacattat gaaactgcgc
1561 aaagagttaa gcaaacttta caacgttaca aagaacttca ggacattatc gcaattcttg
1621 ggttggatga attatcggag gaggatcgtt taactgtagc aagagcacga aaaattgagc
1681 gcttcttatc acaaccgttt tttgtggcag aagtttttac cggttctcca ggaaagtatg
1741 ttggtcttgc agaaactatt aggggatttc aactaatcct ttccggagaa ttagacggcc
1801 tacccgaaca ggctttttat ttggtgggta acatcgatga agctagcacg aaagctataa
1861 acttagaaga ggagaacaac ttgaagaaat gaaattaaat ctttatgtac tgactcctaa
1921 gcgaattatt tgggattgtg aagtgaaaga aatcatttta tctactaata gtggccaaat
1981 tggcgtatta ccaaaccacg cccccattaa cacagctgta gatatgggtc ccttgagaat
2041 acgcctcctc aacgatcaat ggttaacggc ggttctgtgg agcggttttg ccagaatagt
2101 taataatgag atcatcattt taggaaatga tgcggaactg ggtagtgaca ttgatccgga
2161 agaagctcaa caggcacttg aaatagccga agctaacgtg agtagagctg agggtacgaa
2221 agaattggtt gaagcgaagg tagctctcag acgagctagg atacgagtcg aggctgttaa
2281 ttggattccc ccatctaatt gaagacaacc caacggttta gttgatacaa agaaaaaggg
2341 aagaggggta gaaaaaatta ttagatagcg aagcgaagta gggccaatgc tatctagtaa
2401 tttttctacc tacctaccta ctattggatt tgaaccaatg actcccgccg tatgaaagca
2461 atactctaac cactgagtta agtaggcaat ttatcaccac aaaggaagac cctttacttc
2521 gatc
//
LOCUS RICMTBEA 2281 bp ds-DNA ORG 01-AUG-1990
DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE)
pseudogene, complete cds.
ACCESSION M31466 Y00323
KEYWORDS pseudogene.
SOURCE Rice mitochondrion DNA, clone Mt-0.
ORGANISM Mitochondrion Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE 1 (bases 1 to 2281)
AUTHORS Moon,E., Kao,T.-h. and Wu,R.
TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
from rice
JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 384 1657 atpB pseudogene
pept.ps 1654 2067 atpE pseudogene
BASE COUNT 710 a 416 c 488 g 667 t
ORIGIN
1 cccctttctt attttgagtc caaataccta aatactatga aaattctctg ttgacagcaa
61 tctatgcttc acagtagtat atattttgta tatcgaagtc ctagataaga aatggagtag
121 gcacagatcc ttcacaaaag gcgaaatgta tatgaaaaaa agattgattg aactttccga
181 cggactcatg gaatgagtaa acgattgaat gggattcgtt tgggcaacga aatcaagtgc
241 tggtcccctt ttctctctta ttgaattaac taattcattt ccttttgact tttgttggat
301 ttttggatat ttttttggtg ttgatttggc attattcaac aagataaaaa gaaaaatttc
361 tataaattcc ttttttttta attatgagaa ccaatcctac tacttctcat cccggggttt
421 ctacaattga agaaaaaagt acagggcgta tcgatcaaat tattggaccc gtgctggatg
481 ccacttttcc cccgggcaag ttaccttata tttataacgc tttggtagtc gagacactga
541 gggtaagcaa attaatgtga cttgtgaggt acaacaatta ttaggaaata atcgagttag
601 aacgaaatta tccatctttg aaactggtat taaggtggtc gatcttttag ctccttatcg
661 gcgtggagga aaaatcggac tatttggggg aactggagta ggtaaaacag tactcatcat
721 ggaattaatc aacaatattg ctaaagctca tagaggcgta tccgtatttg gcggagtagg
781 ggaacggact cgtgaaggaa atgatcttta tatggaaata aaggagtaat taatgaaaaa
841 aatccttgag gaatcaaagg tagctctagt ctatggccaa atgaatgaac gccaggagct
901 cgtatgagag ttggtttgac tgccctaact atggcagaat atttccgaga tgttattaag
961 caagacgtgc ttctattcat cgataatatc tttcgttttg ttcaagcagg atcgggggta
1021 tttgccttat tagggagaat gccctctgca gtgggttatc aacctactct tagtacagaa
1081 atgggttctt tgcaagaaag aattacttct actaaaaagg gatctataac ttcgatccaa
1141 gcggtttatg tacctgcgga cgatttgacc gaccctgctc ctgccacaac atttgcacat
1201 ttggatgcta ctaccgtact ttccagagga ttagcttcca agggtattta tcctagatcc
1261 tttagattca acctcaacta tgttacaacc tcggatcgtt ggcaacgaac attatgaaac
1321 tgcgcaaaga gttaagcaaa ctttacaacg ttacaaagaa cttcaggaca ttatcgcaat
1381 tcttgggttg gatgaattat cggaggagga tcgtttaact gtagcaagag cacgaaaaat
1441 tgagcgcttc ctatcacaac cgttctttgt ggcagaagtt tttaccggtt ctccaggaaa
1501 gtatgttggt cttgcagaaa caattcgggg atttcaacta atcctttccg gagaattaga
1561 cggcctaccc gaacaggctt tttatttggt gggtaacatc gatgaagcta gcacgaaagc
1621 tataaactta gaagaggaaa acaacttgaa gaaatgaaat taaatcttta tgtactgact
1681 cctaagcgaa ttatttggga ttgtgaagtg aaagaaatca ttttttctac taatagtggc
1741 caaattggcg tattaccaaa ccacgccccc attaacacag ctgtagatat gggtcccttg
1801 agaatacgcc tcctcaacga tcaatggtta acggcggttc tgtggagcgg ttttgccaga
1861 atagttaata atgagatcat cattttagga aatgatgcgg aactgggtag tgacattgat
1921 ccggaagaag ctcaacaggc acttgaaata gccgaagcta acgtgagtag agctgagggt
1981 acgaaagaat tggttgaagc gaacgtagct ctcagacgag ctgggatacg agtcgaggct
2041 gttaattgga ttcccccatc taattgaaga caatccaacg gtttagttga tacaaagaaa
2101 aagggtctaa aaagttatta gatagcgaag cgaagtaagt ccaatgctat ctagtaattt
2161 ttctacctac ctacctacta ttggatttga accaatgact cccgccgtat gaaagcaata
2221 ctctaaccac tgagttaagt aggcaattta tcaccacaaa ggaagaccct ttacttcgat
2281 c
//GenBank-Updates@genbank.bio.net (08/02/90)
LOCUS ECOALPHOA 171 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33724
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 171)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 141 phoA coding sequence /note="upstream sequence
of psiA44:Mu dI" /evidence=EXPERIMENTAL
[1],[2]
site 141 142 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA44:Mu dI insertion site" [1]
trns 142 > 171 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 49 a 45 c 51 g 26 t
ORIGIN
1 caaagctccg ggcctcaccc aggcgctaaa taccaaagat ggcgcagtga tggtgatgag
61 ttacgggaac tccgaagagg attcacaaga acataccggc agtcagttgc gtattgcggc
121 gtatggcccg catgccgcca atgaagcggc gcacgaaaaa cgcgaaagcg t
//
LOCUS ECOALPHOB 183 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33725
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 183)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 153 phoA coding sequence /note="upstream sequence
of psiA10:Mu dI" /evidence=EXPERIMENTAL
[1],[2]
site 153 154 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA10::Mu dI insertion site" [1]
trns 154 > 183 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 55 a 39 c 39 g 50 t
ORIGIN
1 ctgtcataaa gttgtcacgg ccgagactta tagtcgcttt gtttttattt tttaatgtat
61 ttgtacatgg agaaaataaa gtgaaacaaa gcactattgc actggcactc ttaccgttac
121 tgtttacccc tgtgacaaaa gcccggacac cagtgaagcg gcgcacgaaa aacgcgaaag
181 cgt
//
LOCUS ECOALPHOD 242 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33726
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 242)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 212 phoA coding sequence /note="upstream sequence
of psiA40" /evidence=EXPERIMENTAL [1],[2]
site 212 213 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA40:Mu dI insertion site" [1]
trns 213 > 242 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 59 a 61 c 64 g 58 t
ORIGIN
1 attgcactgg cactcttacc gttactgttt acccctgtga caaaagcccg gacaccagaa
61 atgcctgttc tggaaaaccg ggctgctcag ggcgatatta ctgcacccgg cggtgctcgc
121 cgtttaacgg gtgatcagac tgccgctctg cgttattctc ttagcgataa acctgcaaaa
181 aatattattt tgctgattgg cgatgggatg ggtgaagcgg cgcacgaaaa acgcgaaagc
241 gt
//
LOCUS ECOALPHOE 97 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33727
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 97)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 67 phoA coding sequence /note="upstream sequence
of psiA5::Mu dI. psiA30 upstream region is
identical" /evidence=EXPERIMENTAL [1],[2]
site 67 68 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA5:Mu dI insertion site; psiA30
upstream region is identical" [1]
trns 68 > 97 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 34 a 13 c 22 g 28 t
ORIGIN
1 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
61 gcactggtga agcggcgcac gaaaaacgcg aaagcgt
//
LOCUS ECOALPHOF 82 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33728
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 82)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and
sequencing of the phn (psiD) genes involved in alkylphosphonate
uptake and C-P lyase activity in Escherichia coli B
JOURNAL J. Biol. Chem. 265, 4461-4471 (1990)
STANDARD full staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 52 phn operon sequence /note="upstream sequence
of psiD33; psiD41 is identical"
/evidence=EXPERIMENTAL [1],[2]
site 52 53 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiD33:Mu dI insertion site; psiD41 is
identical" [1]
trns 53 > 82 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 28 a 19 c 22 g 13 t
ORIGIN
1 attgaggaaa acgaatgaac gctaagataa ttgcctcgct ggccttcacc agtgaagcgg
61 cgcacgaaaa acgcgaaagc gt
//
LOCUS ECOALPHOG 94 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33729
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 94)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 64 phoA coding sequence /note="upstream sequence
of psiA4" /evidence=EXPERIMENTAL [1],[2]
site 65 65 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA4:Mu dI insertion site" [1]
trns 65 > 94 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 35 a 18 c 22 g 19 t
ORIGIN
1 aatgtatttg tacatggaga aaataaagtg aaacaaagca ctattgcact ggcactctta
61 ccgttgaagc ggcgcacgaa aaacgcgaaa gcgt
//
LOCUS ECOALPHOH 233 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33730
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 233)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and
sequencing of the phn (psiD) genes involved in alkylphosphonate
uptake and C-P lyase activity in Escherichia coli B
JOURNAL J. Biol. Chem. 265, 4461-4471 (1990)
STANDARD full staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 203 phn operon sequence /note="upstream sequence
of psiD31" /evidence=EXPERIMENTAL [1],[2]
site 203 204 Mu dI insertion site /evidence=EXPERIMENTAL
/note="Mu dI insertion site into phnD gene
(psiD31::Mu dI)" [1]
trns 204 > 233 Mu dI-R /evidence=EXPERIMENTAL [1]
variant 201 201 g in E.coli; a in E.coli B
variant 174 174 c in E.coli; g in E.coli B
variant 141 141 g in E.coli; a in E.coli B
variant 135 135 g in E.coli; a in E.coli B
variant 76 76 t in E.coli; c in E.coli B
variant 51 51 g in E.coli; a in E.coli B
BASE COUNT 62 a 64 c 66 g 41 t
ORIGIN
1 tcgctggcct tcaccagcat gttcagcctc agcaccctgt taagcccggc gcacgccgaa
61 gagcaggaaa aggcgttgaa tttcggcatt atttcaacgg aatcacagca aaacctgaaa
121 ccgcaatgga cgccgttctt gcaggatatg gagaagaagc tgggcgtgaa ggtcaacgcc
181 ttctttgccc cggactacgc gggtgaagcg gcgcacgaaa aacgcgaaag cgt
//
LOCUS ECOALPHOJ 88 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33732
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 58 phoA coding sequence /note="upstream sequence
of psiA12; upstream sequence of psiA13::Mu dI
is identical" /evidence=EXPERIMENTAL [1],[2]
site 58 59 Mu dI insertion site /evidence=EXPERIMENTAL
/note="psiA12:Mu dI insertion site; psiA13::Mu
dI is identical" [1]
trns 59 > 88 Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT 33 a 13 c 21 g 21 t
ORIGIN
1 ttttttaatg tatttgtaca tggagaaaat aaagtgaaac aaagcactat tgcactggtg
61 aagcggcgca cgaaaaacgc gaaagcgt
//
LOCUS ECOALPHOK 66 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33733
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y.
TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia
coli
JOURNAL Gene 44, 121-125 (1986)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 36 psiF (orf-106) coding sequence /pseudo
/evidence=EXPERIMENTAL [1],[2]
trns 37 > 66 Mu dI-R
BASE COUNT 26 a 17 c 15 g 8 t
ORIGIN
1 cgcgaatgca ataatcaagc aacacaacaa tctctgtgaa gcggcgcacg aaaaacgcga
61 aagcgt
//
LOCUS ECOALPHOL 67 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33734
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 67)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 37 psiR43 coding sequence /pseudo
trns 38 > 67 Mu dI-R
BASE COUNT 23 a 18 c 17 g 9 t
ORIGIN
1 ctctaattag gacaacagaa tcactcggta ccggacctga agcggcgcac gaaaaacgcg
61 aaagcgt
//
LOCUS ECOALPHOM 106 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33735
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 106)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Overduin,P., Boos,W. and Tommassen,J.
TITLE Nucleotide sequence of the ugp genes of e. coli k-12: homology to
the maltose system
JOURNAL Mol. Microbiol. 2, 767-775 (1988)
STANDARD simple staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 76 ugpB coding sequence /pseudo
/evidence=EXPERIMENTAL [1],[2]
trns 77 > 106 Mu dI-R
BASE COUNT 41 a 26 c 22 g 17 t
ORIGIN
1 gcccaacgtt ttaacgccga aaacccggat tacaaaattg taccgaccta taaaggcaac
61 tacgaacaga atttaatgaa gcggcgcacg aaaaacgcga aagcgt
//
LOCUS ECOALPHON 67 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33736
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 67)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 37 psiH6 coding sequence /pseudo [1]
trns 38 > 67 Mu dI-R
BASE COUNT 22 a 14 c 17 g 14 t
ORIGIN
1 ggcatcatga ctatcgtaag tttgatctat cacatagtga agcggcgcac gaaaaacgcg
61 aaagcgt
//
LOCUS ECOALPHOO 82 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33737
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 82)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
Flores,N., Becerril,B., Valle,F. and Bolivar,F.
TITLE Determination of the nucleotide sequence for the glutamate synthase
structural genes of Escherichia coli K-12
JOURNAL Gene 60, 1-11 (1987)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 52 gltD coding sequence /pseudo
/evidence=EXPERIMENTAL [1],[2]
trns 53 > 82 Mu dI-R
BASE COUNT 26 a 21 c 28 g 7 t
ORIGIN
1 gaacatgccg ggttcccgcc gcgaagtgaa aaacgcccgg gaagaaggcg tatgaagcgg
61 cgcacgaaaa acgcgaaagc gt
//
LOCUS ECOALPHOP 116 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33738
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 116)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 86 psiK20 coding sequence /pseudo [1]
trns 87 > 116 Mu dI-R
BASE COUNT 31 a 26 c 34 g 25 t
ORIGIN
1 gcgactcaga gagatcagca agtcagcact accgaccgtg ggcagttttc aaaaggtcat
61 cggtagtagt ttttgcgttc tgcgtttgaa gcggcgcacg aaaaacgcga aagcgt
//
LOCUS ECOALPHOQ 70 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33739
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Cole,S.T., Eiglmeier,K., Ahmed,S., Honore,N., Elmes,L.,
Anderson,W.F. and Weiner,J.H.
TITLE Nucleotide sequence and gene-polypeptide relationships of the
glpABC operon encoding the anaerobic sn-glycerol-3-phosphate
dehydrogenase of Escherichia coli K-12
JOURNAL J. Bacteriol. 170, 2448-2456 (1988)
STANDARD simple staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 40 glpB coding sequence /pseudo [1],[2]
trns 41 > 70 Mu dI-R
BASE COUNT 19 a 20 c 21 g 10 t
ORIGIN
1 acaacgcgtg ctcgatctcg cttgccaggc gcaggcatta tgaagcggcg cacgaaaaac
61 gcgaaagcgt
//
LOCUS ECOALPHOR 78 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33740
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 78)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
Flores,N., Becerril,B., Valle,F. and Bolivar,F.
TITLE Determination of the nucleotide sequence for the glutamate synthase
structural genes of Escherichia coli K-12
JOURNAL Gene 60, 1-11 (1987)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 48 gltB coding sequence /pseudo
/evidence=EXPERIMENTAL [1],[2]
trns 49 > 78 Mu dI-R
BASE COUNT 20 a 21 c 24 g 13 t
ORIGIN
1 cgaactgatg gttatcgaca cccgcagtgg gcgtattctg cactcggctg aagcggcgca
61 cgaaaaacgc gaaagcgt
//
LOCUS ECOALPHOS 87 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33741
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 87)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 57 paiJ22 coding sequence /pseudo [1]
trns 58 > 87 Mu dI-R
BASE COUNT 28 a 20 c 22 g 17 t
ORIGIN
1 agactatgag acggaatgaa gaactgttca tcacattctt tattccgagc cgctacgtga
61 agcggcgcac gaaaaacgcg aaagcgt
//
LOCUS ECOALPHOT 81 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33742
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 81)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Davis,E.O. and Henderson,P.J.F.
TITLE The cloning and DNA sequence of the gene xylE for xylose-proton
symport in Escherichia coli K12
JOURNAL J. Biol. Chem. 262, 13928-13932 (1987)
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 51 psiE coding sequence /pseudo [1],[2]
trns 52 > 81 Mu dI-R
BASE COUNT 25 a 19 c 24 g 13 t
ORIGIN
1 gtgcatctgg ctgatgtgct gttcgcgcca gaacaaacca gcaaatatga gtgaagcggc
61 gcacgaaaaa cgcgaaagcg t
//
LOCUS ECOALPHOU 69 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33743
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 69)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
Flores,N., Becerril,B., Valle,F. and Bolivar,F.
TITLE Determination of the nucleotide sequence for the glutamate synthase
structural genes of Escherichia coli K-12
JOURNAL Gene 60, 1-11 (1987)
STANDARD simple staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 39 gltB coding sequence /evidence=EXPERIMENTAL
[1],[2]
trns 40 > 69 Mu dI-R [1]
BASE COUNT 19 a 15 c 21 g 14 t
ORIGIN
1 ctttgacttt aactccatgc atatggagcc gtgggatggt gaagcggcgc acgaaaaacg
61 cgaaagcgt
//
LOCUS ECOALPHOV 94 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33744
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 94)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 64 psiH16 coding sequence /pseudo [1]
trns 65 > 94 Mu dI-R [1]
BASE COUNT 19 a 26 c 29 g 20 t
ORIGIN
1 ccatcttagc agcgtctggt ccgggtcggc ttaggggctt cctttatgca gtactgcctg
61 ccgatgaagc ggcgcacgaa aaacgcgaaa gcgt
//
LOCUS ECOALPHOW 59 bp ds-DNA BCT 02-AUG-1990
DEFINITION Escherichia coli Mu dI insertion site.
ACCESSION M33745
KEYWORDS alkaline phosphatase.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed
13-APR-1990.
FEATURES from to/span description
site 1 29 psiJ49 coding sequence /pseudo [1]
trns 30 > 59 Mu dI-R [1]
BASE COUNT 15 a 13 c 18 g 13 t
ORIGIN
1 ccgattgcgg ctgtttatgg gtatctatct gaagcggcgc acgaaaaacg cgaaagcgt
//
LOCUS SYNFUSAAA 240 bp ds-DNA SYN 02-AUG-1990
DEFINITION Bacteriophage Mu fusion gene of the trp operon.
ACCESSION M33723
KEYWORDS .
SOURCE Bacteriophage Mu and E.coli lacZ DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 240)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Allet,B.
TITLE Mu insertion duplicates a five base pair sequence at the host
inserted site
JOURNAL Cell 16, 123-129 (1979)
STANDARD simple staff_entry
REFERENCE 3 (sites)
AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
Horowitz,H., van Cleemput,M. and Wu,A.M.
TITLE The complete nucleotide sequence of the tryptophan operon of
Escherichia coli
JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981)
STANDARD full staff_entry
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
signal 1 104 stem_loop, Mu dI-R [1]
site 105 240 trp operon (partial) /note="trp operon
sequence present in Mu dI is derived from the
trpA-lacZ W209 protein fusion and contains
last base of trpC, all of trpB and 187 bases
of trpA" [1],[3]
site 1 48 MuR derived sequence [1],[2]
site 104 43 (c) MuR derived sequence [1],[2]
BASE COUNT 67 a 54 c 57 g 62 t
ORIGIN
1 tgaagcggcg cacgaaaaac gcgaaagcgt ttcacgataa atgcgaaacg taaagttttc
61 gcatttatcg tgaaacgctt tcgcgttttt cgtgcgccgc ttcattaagg aaaggaacaa
121 tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg ccacaaatcc
181 tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa gatcctgaat
//
LOCUS SYNFUSAAB 53 bp ds-DNA SYN 02-AUG-1990
DEFINITION Bacteriophage Mu fusion gene of the trp operon.
ACCESSION M33731
KEYWORDS .
SOURCE Bacteriophage Mu and E.coli lacZ DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 53)
AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L.
TITLE Identification of phosphate-starvation-inducible genes in
Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
transcriptional fusions
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
Horowitz,H., van Cleemput,M. and Wu,A.M.
TITLE The complete nucleotide sequence of the tryptophan operon of
Escherichia coli
JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981)
STANDARD full staff_entry
REFERENCE 3 (sites)
AUTHORS Kalnins,A., Otto,K., Ruether,U. and Mueller-Hill,B.
TITLE Sequence of the lacZ gene of Escherichia coli
JOURNAL EMBO J. 2, 593-597 (1983)
STANDARD full staff_review
COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
13-APR-1990.
FEATURES from to/span description
site 1 9 lacZ coding sequence /note="lacZ sequence
begins with third codon of LacZ protein"
[1],[3]
site 53 10 (c) trpA coding sequence
BASE COUNT 13 a 14 c 17 g 9 t
ORIGIN
1 atccgtaatc gccagtgggt cggagaaggg gatacctaac tccagcgcgt cag
//
LOCUS ATH60CPNA 735 bp ss-mRNA PLN 02-AUG-1990
DEFINITION A.thaliana plastid 60-kDa chaperonin-60 alpha polypeptide
(cpn-60 alpha) mRNA, partial cds.
ACCESSION M35597
KEYWORDS chaperonin-60.
SOURCE A.thaliana (cv. Columbia), cDNA to mRNA, clone pTZ19R.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 735)
AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
TITLE Unique composition of plastid chaperonin-60: Alpha and beta
polypeptide-encoding genes are highly divergent
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.M.Hemmingsen, 26-JUN-1990.
FEATURES from to/span description
pept < 1 > 735 60-kDa chaperonin-60 alpha-polypeptide (AA at
1)
BASE COUNT 187 a 139 c 196 g 213 t
ORIGIN
1 aagcttgctg attgtgttgg tctcactctt ggccctagag ggaggaatgt tgtgttggat
61 gaatttggaa gtcctaaggt tgtgaatgat ggagtcacca ttgctagagc tattgagtta
121 cctaatgcca tggaaaatgc tggtgcagcg cttattcgtg aggttgcgag taagactaat
181 gactctgctg gtgatgggac aaccactgcg tctatccttg ctcgggagat aatcaaacat
241 ggattgttga gcgtcacttc tggtgcgaat cccgtttcac tcaagagggg aattgataag
301 actgttcaag gtctgatcga agagcttcag aagaaagcta ggcccgtgaa aggtcgtgta
361 gactacagag ctgtggcttc tatctctgct ggaaatgatg accttatagg gtcaatgatt
421 gctgatgcca tcgacaaagt tggacctgat ggtgttttgt ccattgaatc ttcatcttct
481 tttgagacca cggtcgaagt tgaagaaggg atggagattg acagaggtta catctcgcct
541 cagtttgtta caaaccctga gaaactacta gctgagtttg agaatgctag ggtattgatc
601 actgatcaga agatcactgc gatcaaagac atcatcccta ttttggagaa gaccactcag
661 cttcgtgctc cgttgttgat tattgcagag gatgttactg gtgaagcctt agctaccctt
721 gtcgtgaaca agctt
//
LOCUS ATH60CPNB 559 bp ss-mRNA PLN 02-AUG-1990
DEFINITION A.thaliana plastid 60-kDa chaperonin-60 beta-polypeptide
(cpn-60 beta) mRNA, partial cds.
ACCESSION M35598
KEYWORDS chaperonin-60.
SOURCE A.thaliana (cv. Columbia), cDNA to mRNA, clone ptz19R.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 559)
AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
TITLE Unique composition of plastid chaperonin-60: Alpha and beta
polypeptide-encoding genes are highly divergent
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.M.Hemmingsen, 26-JUN-1990.
FEATURES from to/span description
pept < 1 > 559 60-kDa chaperonin-60 beta-polypeptide (AA at
1)
BASE COUNT 166 a 96 c 155 g 142 t
ORIGIN
1 aagcttgtga ggcaagcggc tgccaaaacc aatgacttgg ctggtgacgg tacaacaaca
61 tctgttgttc ttgcacaagg ttttatcgct gagggtgtca aggtggtagc tgctggtgca
121 aaccctgttt tgatcaccag aggtattgag aagacagcaa aggctttggt taccgagctc
181 aagaaaatgt ctaaggaggt tgaagacagt gaacttgcag atgttgcagc tgttagtgcg
241 ggtaacaacg acgaaattgg aaatatgatt gctgaagcaa tgagcaaagt gggcaggaaa
301 ggtgtggtga cccttgagga gggtaaaagt gccgagaaca accttcacgt cgtagaagga
361 atgcaatttg accgtggtta tatctctcct tactttgtga cagacagcga gaaaatgtcc
421 gtcgagttcg acaattgcaa gttacttctc gttgacaaga aaattaccaa tgcaagggat
481 cttgttggtg ttttggagga tgcaattaga ggcggatacc caatattaat tattgctgaa
541 gacattgaac aagaagctt
//
LOCUS BNA60CPNA 1783 bp ss-mRNA PLN 02-AUG-1990
DEFINITION B.napus plastid 60-kDa chaperonin-60 alpha-polypeptide
(cpn-60 alpha) mRNA, partial cds.
ACCESSION M35599
KEYWORDS chaperonin-60 alpha-polypeptide.
SOURCE B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA.
ORGANISM Brassica napus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 1783)
AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
TITLE Unique composition of plastid chaperonin-60: Alpha and beta
polypeptide-encoding genes are highly divergent
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.M.Hemmingsen, 26-JUN-1990.
FEATURES from to/span description
pept < 1 1638 60-kDa chaperonin-60 alpha-polypeptide
precursor (AA at 1)
sigp < 1 21 60-kDa chaperonin-60 alpha-polypeptide signal
peptide
matp < 1 1635 60-kDa chaperonin-60 alpha-polypeptide
BASE COUNT 460 a 395 c 486 g 442 t
ORIGIN
1 cgccgtttca gcgttagagc taatgtaaaa gagatctctt tcgaccagag ctcaagagct
61 gctcttcaag ctggtattga caagcttgct gatgccgttg gtcttaccct tggccctaga
121 ggaagaaatg ttgtgttgga tgagtttgga agtcccaagg tcgtgaatga tggagtcacc
181 attgctaggg ccattgagtt acctgacgct atggagaatg ctggtgcagc acttatccgt
241 gaggttgcta gtaagactaa tgactcagct ggtgacggca caaccactgc ctccgtcctt
301 gctcgggaga taatcaaaca cggcttacta agcgtcactt ccggtgccaa tccagtctca
361 ctcaagagag gaatcgataa gaccgttcaa gctttgatcg aagagctcga gaagagggct
421 agacctgtca aaggcggtag tgacatcaaa gccgtggcta caatctctgc tggaaatgat
481 gagcttgtgg gaacaatgat tgctgatgcc attgataaag ttggacctga tggtgttttg
541 tccattgaat cttcttcctc ctttgagact accgtcgaag ttgaagaagg aatggagatt
601 gatagaggtt acatctcacc acagtttgtt acaaaccctg agaagctcct agtcgagttc
661 gagaacgcgc gcgtgctgat caccgatcag aagatcactg cgatcaaaga catcatcccg
721 atcctggaga aaaccactca gctccgagct ccgttgctca tcatcgcgga ggatgtcacc
781 ggcgaggccc tggcgaccct cgtcgtgaac aaactccgcg gcgtgctcaa cgtggtcgcc
841 gtcaaggctc cagggtttgg tgaaagaaga aaagctatgc ttcaggacat tgccatcttg
901 acggggccga gtacgaccct cgacatgggc cttctggtgg agaacacgac catagatcag
961 ttggggatcg ctaggaaagt cactattagc aaagactcga ccacgcttat agctgatgcg
1021 gcttccaagg agaccctgca agctcgtatc tcccagctta agaaggagct gtcagagact
1081 gactccgtgt acgactcgga gaagctcgcc gagagaatgc ccaagctgcc aggcggtgtt
1141 gccgttatta aagtcggagc ggcgactgag actgagctcg aggaccgtaa gcttcgtatc
1201 gaggacgcga agaacgctac atttgctgca atcgaagaag ggatagttcc tggcggcggt
1261 gccacgctgg tgcatctatc aactgtgatt cctgctatta aggagaagct tgaggatgct
1321 gatgagcgtt tgggagccga catagtacag aaggctttgg tggcaaggtc gcttattgca
1381 cagaacgctg gaataggagg agaagttgtt gtggagaaga ttatgttcag cgaatgggag
1441 atagggtaca acgccatgac tgatacatat gagaatctgt tggaagctgg agtgattgat
1501 ccagctaaag tgacgagatg tgcgcttcag aacgctgctt cggttgcggg gatggtgctg
1561 accactcagg ccattgttgt cgacaaaccg aaacctaagg ctcctactgc tgctcctcct
1621 cagggtctca tggtgtaatg aaatcactta tcatatggct atatttgttg tccccaatta
1681 tatctgcaat atttttctct ggggaaatga gtagtttgtt taacggtaat gagaatatca
1741 tctttttgat ttatgatttg caaaataaaa gtttatattc aat
//
LOCUS BNA60CPNB 1982 bp ss-mRNA PLN 02-AUG-1990
DEFINITION B.napus plastid 60-kDa chaperonin-60 beta-polypeptide
(cpn-60 beta) mRNA, partial cds.
ACCESSION M35600
KEYWORDS chaperonin-60 beta-polypeptide.
SOURCE B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA.
ORGANISM Brassica napus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 1982)
AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
TITLE Unique composition of plastid chaperonin-60: Alpha and beta
polypeptide-encoding genes are highly divergent
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.M.Hemmingsen, 26-JUN-1990.
FEATURES from to/span description
pept 64 1830 60-kDa beta-polypeptide of plastid
chaperonin-60 precursor
sigp 64 228 60-kDa beta-polypeptide of plastid
chaperonin-60 signal peptide
matp 229 1827 60-kDa beta-polypeptide of plastid
chaperonin-60
BASE COUNT 564 a 372 c 547 g 499 t
ORIGIN
1 cctgctcccc agtttcttat ccgtggtgct tctccgcctc ttcgcttcac ttctcttgaa
61 aagatggcat caaccttcac tgcaacgtct tccattggct ccatggttgc tccaaatgcc
121 cacaaaacgg ataagaagct tatgaacaag ttgtcttcaa gctctttcgg gaggaggcag
181 aacgtgttcc ccaagctcag aagatccagt cctgcagttg tatgtgcagc caaggagttg
241 catttcaaca aagacgggac taccatcagg aagcttcaag ctggtgtcaa caagcttgca
301 gacctagttg gtgtgacact tggacctaaa gggcgaaacg ttgttcttga gagcaagtat
361 ggatcaccaa gaattgtcaa tgatggtgtg actgttgcaa gggaggttga attggaagac
421 cctgttgaga acattggtgc caaacttgtt aggcaagcag ctgccaagac caatgatctg
481 gccggtgatg gtaccacaac atctgtggtt cttgcacaag gttttattgc tgagggtgtc
541 aaggtggtgg ctgctggtgc aaaccctgta ttgatcacta gaggcattga gaagacagca
601 aaggctttgg tagccgagct caagaaaatg tctaaggagg ttgaagacag tgagcttgca
661 gatgtggcag ccgttagtgc cggtaacaat gcagaaattg gaagcatgat tgctgaagca
721 atgagcagag tgggcaggaa gggtgtggtg acacttgagg agggtaaaag tgcagagaac
781 gctctctacg tggtggaagg aatgcaattt gatcgaggtt atgtctcccc ttactttgtg
841 acagacagcg agaaaatgtc agttgagttc gacaattgca agttgcttct tgttgacaag
901 aaaattacca atgcaaggga tcttgttggt gttctggagg atgcaattag aggaggatac
961 ccaattttaa taattgcgga agacattgag caggaggctt tagcgaccct tgttgttaac
1021 aagcttagag gcacactgaa gattgcagct ctcaaagctc caggatttgg agagcgcaag
1081 agccaatacc ttgatgatat tgccatcctc actggagcga ctgtgattcg tgaggaagtt
1141 ggtctttcac ttgacaaagc tggaaaagag gttcttggac atgcagcaaa agttgtcctc
1201 actaaggaga cttcgaccat tgtgggtgat gggagcacac aggacgcagt gcaaaagcgt
1261 gttacacaaa ttaagaacct tattgagcaa gcagagcaag actatgagaa ggaaaaactg
1321 aatgagagaa ttgcaaagct ctctggtgga gttgctgtga ttcaggtcgg agcacaaact
1381 gagacagaac tcaaagagaa gaaactgaga gttgaagatg ctcttaatgc tacaaaggct
1441 gctgttgagg aaggaattgt cgttggtggt ggttgtactc tgcttcgcct tgcttccaag
1501 gtggatgcca ttaaagccgc ccttgacaat gatgaagaaa aggttggagc cgacatcgtg
1561 aaaagagcac tgagttaccc tctgaaactg attgccaaga atgcaggagt caatggaagc
1621 gtagttagcg agaaggtgct ttctaacgat aatgtgaagt tcggttacaa tgctgcaacc
1681 ggcaagtacg aggatctaat ggctgcagga atcatcgatc caactaaggt tgtgagatgt
1741 tgcttggaac atgcagcttc ggttgcaaag acattcttga tgtctgactg tgtggttgtt
1801 gagatcaagg agcctgagcc agttccgtag gcaaccaatg gacaactcag gttatggata
1861 ctgaggagat gggggaatga agtaactctg tcggaaattg attcctttta gcttttgtaa
1921 tagatgagca agttgtaatg atttcgaaca aaagaatgtg ctgtgtactg aaattttgaa
1981 cc
//
LOCUS HS4RAJI 11835 bp ds-DNA VRL 02-AUG-1990
DEFINITION Epstein-Barr virus B95-8 deletion region.
ACCESSION M35547
KEYWORDS .
SOURCE Epstein-Barr virus (strain Raji) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 11835)
AUTHORS Parker,B.D., Bankier,A., Satchwell,S., Barrell,B. and Farrell,P.J.
TITLE Sequence and transcription of Raji Epstein-Barr virus DNA spanning
the B95-8 deletion region
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Farrell, 25-JUN-1990.
FEATURES from to/span description
pept 3625 851 (c) LF3 gene product
pept 10605 9316 (c) LF2 gene product
pept > 11835 10566 (c) LF1 gene product (AA at 11834)
orgrpl 3554 4609 origin of replication
BASE COUNT 1948 a 3315 c 3869 g 2703 t
ORIGIN
1 ggccgctgtt cacctaaagt gacgcaaggt ctgtcagccg ccagggtccg tttaccaggc
61 tttcaggtgt ggaatttaga tagagtgggt gtgtgctctt gtttaattac accaagatca
121 ccaccctcta tccatatccc acaattgata aacctccgca tgtccaacca ccacgttgaa
181 caggatgtgg caccctaaga ggacgcaggc atacaaggtt attacccagt ccttgtatgc
241 ctggtgtccc cttagtggga cgcaggccta ggtagcatca tttacactaa aagcagtgac
301 cttgttggta ctttaaggtt ggtccaatcc ataggctttt tttgtgaaaa cccggggatc
361 ggactagcct tagagtaact caaggccaag catttcacac ctgcaaatgc accatgtaac
421 cacagatcta aactgaaagt tgcagcttta gatggcaagg aaacttgggt ttcaggcata
481 gaaagcctgg ctcactatag cagcccatgt ttgttccagg gtgggggaaa ggcacgtgcc
541 cttagaaaac ttagctgcaa aaattctatt gtgttgggag agcctctata tctaaaggcc
601 tttcctcaca atacaaatgt tactaacgtc tgccctctgg agacctgcta tgtggctaga
661 cgtatggcct acccaagacg ttgggggtct cgggtaggcc atgattcttc caggcatagg
721 ttacaaccag tcactgctat caagcctact cagttcccaa cgcagcacat accccccgcc
781 tctcctgcca tgaggactta tggcagtgtt tactgttctg cttttactct tggaccaggc
841 tgtcattcta tcagaataac aggggaagca aggccccctg cttcagcggg acacgtgttt
901 ctagaatctc ggagccaata actacctgcc cctctaatct gtatgctgca tgaaaaacca
961 catacacgtg atgtaagttt agccagttta ttgttacacc aatgccccga aagtctcccc
1021 ctgtcccttt gggtctcagg acccagccct ggagctcggg ggcggccggg tggcccaccg
1081 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
1141 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
1201 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
1261 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
1321 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
1381 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
1441 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
1501 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg
1561 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc
1621 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg
1681 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca
1741 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg
1801 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg
1861 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc
1921 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg
1981 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg
2041 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg
2101 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
2161 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
2221 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
2281 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
2341 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
2401 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
2461 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
2521 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg
2581 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc
2641 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg
2701 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca
2761 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg
2821 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg
2881 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc
2941 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg
3001 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg
3061 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg
3121 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
3181 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
3241 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
3301 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
3361 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
3421 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
3481 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
3541 ccgctgcccc gctccggcgg ggatgggggt gcgctcccag gccggaccct ggtgccaggc
3601 agggaccccg cgccacccgc ttcatggggg gggaggccgc cgcaaggacg ccgggccggc
3661 tgggaggtgt gcaccccccg agcgtctgga cgacgctggc gagccgggcc agctcgcctt
3721 cttttatcct ctttttgggg tctctgtgca ataccttaag gtttgctcag gagtgggggg
3781 cttctcattg gttaattcag gtgtgtgatt ttagcccgtt gggttacatt aaggtgtgta
3841 accaggtggg tggtacctgg aggtcattct attgggataa cgagaggagg aggggctaga
3901 ggcccgcgag atttggggta ggcggagcct caggagggtc ccctccatag ggttgaacca
3961 ggagggggag gatcgggctc cgccccgata tacctagtgg gtggagccta gaggtaggta
4021 tccatagggt tccattatcc tggaggtatc ctaagctccg cccctatata ccaggtgggt
4081 ggagctaggt aggattcagc taggttccta ctggggtacc cccctaccct accttaaggt
4141 gcgccaccct tcctccttcc gttttaatgg tagaataacc tataggttat taacctagtg
4201 gtggaatagg gtattgcagc tgggtatata cctataggta tatagaacct agaggaaggg
4261 aaccctatag tgtaatccct ccccccccta cccccccctc ccttacggtt gcctgagccc
4321 atcccccacc ccagcacccc ggggtgacgt ggcaccccgc gtgccttact gacttgtcac
4381 ctttgcacat ttggtcagct gaccgatgct cgccacttcc tgggtcatga cctggcctgt
4441 gccttgtccc atggacaatg tccctccagc gtggtggctg cctttgggat gcatcacttt
4501 gagccactaa gcccccgttg ctcgccttgc ctgcctcacc atgacacact aagcccctgc
4561 taatccatga gccccgcctt taggaagcac cacgtcccgg ggacggaagc tggattttgg
4621 ccagtcttca attttgggga gtggttttgt gtgagccgga agttggcaat ggggtgaggg
4681 tggcgctggt taagctgacg acctcccaag gtctctcacc ctgggtacac aggtggggcg
4741 gcagcctcta actttggctg tggcctctat ttcctccctt tcctagccag ggccatgtgt
4801 tcctgcatgt ctacttgcct cctgtggtgg cagagcttgg ccctgggccc aacccccgcc
4861 ttgggagcct gtaggggcca acacccttgg tttgtttgtg ttcctgtttg ctggcaactt
4921 actggcagcc gagcagattc taatgggcgc ccgccttctt tctctcttgt tttattaata
4981 gaatctcagc caggacctat acctgagact tcaaagtctg gtcctgggtt ctgagacccc
5041 caagatttgt catgcacacc tgcacacctg ttggtattgg gtttctattc ttgagtgtga
5101 aagtttgtaa aaaaattcat aaaatgtcac taattcctct tacctgttta gggtattgtg
5161 caattcttca gcctgcctat tttcaatttg cctaaggtgg caatttaaga tgtggttaat
5221 taaccatttt cctgtctgac accactgcat gggcaaccgg gttccatggc acatttagag
5281 ataaacatag atgtcttgtc ttgctcatgt gcagaggagg gggtgttggt gtgcaatata
5341 gtttctggat tccaaattga gttgggggtg ctattttcac tatggaatta aattactgac
5401 attagacagt ggacaccggg ctatatgtgg ggatgtctgt ggcttgtcat ttcctcttag
5461 aaggtaatcc cccatcttaa cttcccttta aattgtgatg caagccctgg gttatttata
5521 gaatgattat ctaggtttga tagtctgaag gctgggcaga gaatgtttgt aatttttatt
5581 caccttcttt accccccacg agtatccagt tctagaagat ctcctgatat cccgggctgc
5641 cattattccc ttgagtgtta tagcttcctc ttaacttaag caagagctcc aggatgttag
5701 cttttttggt ggggctggtt gtcaggaaga ggttccagtg ttgtccttta tttttagatg
5761 ttagctttgt gttaggttag tatgggctgg gtattcacta gtgaaggcaa ctaacacagt
5821 tagacgtgct agttgtgccc actggtgttt atccggtccc aaatgtcacc acagaacaca
5881 gggggctgga tttggcagca gcacttgtgc ttttgttgat ttttacccgt gtatcagagt
5941 gggggatgct agccaattta gcttcccctc cccttaacag ggggtctcgc ggggtgccaa
6001 ttgtcgcctg ccttcccccg cttccccttg ttaacttata gcatgatagg taggtcacct
6061 aacgtggaag cctggtgggt gatccttcct cggtagggag cgcttagggc tgttgagctc
6121 aacagcccca cctgggtaaa atgtatgttc taaagagtta cccaattata acaaaactgt
6181 tgtagggtaa cgaagacctg atggaagtgg tattgttgcc gttgaaagac gggtgtcctg
6241 gctcaagttc gcacttccta tacagtgtta aagccttgta tcggaagttt gggcttcgtc
6301 ccagtgtact cgataatgtc gactgctgcg aaaggtttgg accgtcttcc agtaggtgtt
6361 gggggtccca aatcacgagg ttaggcaggt gcacttggct ctttaggagg gacccttaag
6421 ccagacaatg tagtgcccct tttttttgca aattggcctt attattaatt tcttgttaac
6481 actaattctg ttctatgacc ctgtgttttt cagatgccgt tgaacgtgtc actgagctga
6541 atttggacgc agctacttga cctttgcccc cgtgcctcca gcgctgataa gtgctgcgtc
6601 cactttgtgt tacaggtggg ccaaacctcc agaatatcaa ttggtggggc cttggtgggc
6661 tgcataaggc agtaggtttg aggtgaccta cttggaccat gtggatccag tgtcctgatc
6721 ctggaccttg actatgaaac aattctaaaa aaatgcatca tagtccagtg tccagggaca
6781 gtgcactcgg aagtctcatc atctccgttt gtgtgtttag tgtggccagt acggccaccc
6841 ctgtgccacg ccctggcatg ctgctgacat ctggccgcca atttcagcgg gcccttttcc
6901 cccttgttca ccccatagca agaagggtag gttacatggg tattttccca tcagcacctg
6961 actggccggt gcaattagag gagagggcaa caacgcaagg ctgttgtttt atttgggtta
7021 caagagctgc ggcggtcgat gggttcactg attacggttt cctagattgt acagatgaac
7081 tagaactgtc acaatctatg gggtcgtaga cagtgtgctt accagacttc catggaagat
7141 gtgaatttgc tgctagctat atgggtggtg ctatgggctc cctagggact catgtagtgg
7201 ggctttgtga tagctaatga atgtggcagc tgttgtttgt actggaccct gaattggaaa
7261 cagtaacttg gattctgtaa cacttcatgg gtcccgtagt gacaactatg ctgaatatct
7321 tgaatatggg aggagggggg ctttgggttc cattgtgtgc cctttcctgg ccaacgtgag
7381 ggtcctagtg ttatagggcg tggcagtttt cttgagggct aataacccgg gtgaggcggt
7441 tgtcacaggt gctagaccct ggagttgaac cagtaccact cggttacaaa gtcatggtct
7501 agtagttgtg accctgcaaa gctacgtggg gatgagcagc cagggacttt ggttggcaag
7561 cagacaggcg gcgcattgga accccagagg agtgtcccgg ggccacctct ttggttctgt
7621 acatattttg ttattgtaca taaccatgga gttggctgtg gtgcactcca tctggtaagg
7681 gggctggtgc ggacgcctgt gtttagtcta tgccaatgtt tacctgcctt gggttactat
7741 tccaaacgac cacacctttg aggacacctg gagccctgat cattctcggc ttttactgcc
7801 acctggcttc tgttgggtca gacagtttgg tgcgctagtt gtgtgcttag cagcaacgca
7861 caccaggctg actgccttag cagtgtggcc ctttattgtg gcatcctaag gagggattct
7921 ggagtgcctt tcgcgtgaag catgccctga gacgtactcg agttaggact taatcgctcc
7981 tgtgccgctg gatgagggag cgccaatttg tacatcctag ctctggccat agagttagcc
8041 cacccttgtg tctccctttg gcctttgcgg tgccaatttc cggtggtttc ccttttccgc
8101 ccgtttatcc aatagcatgt aagagaggtt gcctagattt ggcaactttg agggaacgtt
8161 ccgtgtagct ggtgacctaa cacccgccca tcaccaccgg acagattctg aacttgtcct
8221 gtggtgtttg gtgtggtttt ggggtacgca ggagtacgtt ggaatgcttt ggagccgaga
8281 gggatgggcc cgcttgtgcg cttatgtgtt acacggtgcc aataaccggc ccggtgcggc
8341 tgccccgtga cccgtgggcc ttaccttcct ggccatcggg ggaccctggt gctagggtcc
8401 cttgtgttgc tttctgccat aggggggaaa gcatcgcctt cagaattggc tgctccgttg
8461 gaacatttga ggcctactgt atccgtgtcc tgacaacatt ccccgcaaac atgacatggg
8521 ttaatttaaa catgttttgt ttgcttggga atgctcttag ggcctggaag cttgtcattg
8581 gattcatcgt ttcctgaact acaggcgtag ggcctattgt agcaggcatg tcttcattcc
8641 tgcgtaccga atggcatgaa ggcacagcct gttaccattg gcaccttttt tccatgtaaa
8701 cctccgtgat cctgggtcct ttggagactc aagtgtgaat ttgttttggt gttcggcgcc
8761 agggtatctc gacgttggaa tgtcaactca acttgggcac ctcgataacc ggctcgtggc
8821 tcgtacagac gattgtttgg ctctgtaact tgccagggac ggctgacgat gtgtttagtc
8881 tgccacttgc atccggcgct ttggttactc gggagactaa tggggggtgt ggtatggcac
8941 aggctggggg tgagtctggg gatgtccctg ggcgttgctg cagcccattc gccctctggg
9001 gatgagatgt tcaggggtgg ccggtaccct acgctgccga tttacataat ataaattgta
9061 aatgctgcag tagtagggat ctggacgcgc gacctgctac tcttcggaaa cgccaaccca
9121 ggagcgtcgc ctctggcccc atactcccgc catgcgactg ctcgccccct cccaggcctc
9181 cctggtgagc ccttgccgct ccccgcattc ctgctttcgg cgcccctgcg gatcccgatg
9241 acagcaggcc tttccttccc ccgttaatga aaagaatgac agtgaggttg tgacagaagg
9301 acagctttat tcagtttaca gagtgccctc ggaggctacg atattcccgt taaatgtctt
9361 gttgattctc tcaaaggtgg ggagggagga gctctccaca acaatgttcc ctggcagcgt
9421 gagcgcgcag ccctgccgtt ggatgtatct tctcatgatg gtgctgatag aggggtctcc
9481 ggcgtagatg aaaaaggcct gggccatgct ctggccggtc acgatcgtta tggggttgtt
9541 ggaaatgttc cggaccgtca gcttgagggt ctggcccggc ttccactcct gtgggtagac
9601 gtagaagacc gggttggagg agtgggacac gacaacggcc gtaatcttgg agctcagggg
9661 ggcctcgtag gtgttgttgt attccagctc cgtgatgaaa ttaggaggaa taatcacagg
9721 ggagccaaag tagcggatgt ctgtggattc cccgtcccag cgccagtggc tcttagggta
9781 ggggttgtaa cggaaggcaa taatcacatc atccaatagg gtcatgccca ccttgacgtt
9841 cagcgggccc tctcgtttca ggtccggcgt gtccacggag actcggacgt agcccttacc
9901 gcggcgtatg gcgtttaccg gacacacctt ccccgggaat gtgtgaatac gggcgtatga
9961 ctttagaaat gggggcgtgt gctgcgccag caggtaaggc aggcactcgt cctggctggt
10021 gacgggagag ccactgagga agatctgggg ctcgctggtg tttagcttgt ccccgctctg
10081 ggtgcaggag cgtgtcagct gaatgtcgct ctgcccgggc agaatctgca ggtagaggta
10141 ggggttcttg accaatctga tgggcacaat gtaccaggta aacttccctt tctctatgaa
10201 caggctgcgc ggattcagga cgcttagcac gatgtcctgg tcagagtgca taacgaagaa
10261 gggcttgagg aatacctcgt tgtcttccgc tccaaagaac aaaaacgcga ccgtaaagta
10321 gcggctgccg taggtggtcg tgttgaagga gaaagaaggt aacttgaagc tgagtatctg
10381 gcccaccgag gggcagggag gcagctcttg gcactgcgcg tccagctgca atacctgctt
10441 gttggtgacg cggacgtatg aggggaagat ctcgtacttc cacacgcctc tcatgaacga
10501 cgtgtctggt ttttcagtgg gccgcaggcg gcggaggctg ttcctgaacg acgagcgccg
10561 ggacgctagt gctgcatggg ctcctccggg gtaagcttcg gccatggccg gagctcgtcg
10621 acgggcaagg tgagagtcgg ggggcgggcg acggtgcggc cccaatacaa ctctccgctc
10681 gttagctggt agaatatccg cccggcgtct aggttgtcac ttcgctcggc cggccagaag
10741 agcgcaagtc caagtctggt gctggggccg atgtgcagcg gtttgtgccc gcagttgtag
10801 actgtcattt ttatgggcga gtgggcggtc cacacgcgcg ggcgcagcac ccattggtcg
10861 cacgccgcct cctggaatgt aaacccccag agagagggcg tgccgccctg gagatggccc
10921 tgtgccatca catgtatttc ctccttgggt ggaacaacgg cgtcgtgctc cgggtggagg
10981 gggaatagcg tccaggcatc tttcagggtc acgagaccgg ggtccatgct cagagaacag
11041 ccctcccggg cggtgggcgg cccgggctcc agcagaacgt cgcagaccca gccctcctcg
11101 gccctgtcca cctgtatgtc caggtgcacg gacccggagg ctgcgtctcg tgacatggcc
11161 aggcctggtg ccagccgacc acgtcccgtg tcccagccga ggccgcgcca gagcagagcc
11221 cgggactgac tcagggccac atcccctcgg cccgcggacg ccgcctcgcc agcccccggg
11281 ccttcatggg cccgctttct acctctctcc ggcaccccag cctggtcagc cgcagaggaa
11341 gcatgacctt ggggtgggac ggggcaggcg tgatcctggg cgcaatcttt gccgatcccc
11401 acaccttcac tccttgttag gttgatagaa tgtcggtacc acgccacggg gggcgggccc
11461 gcatagggaa aagccaggga gagcgatgtg ggcgaggatg ggctcaggcg gccccagaca
11521 cgcaatttgc ccccctgggc ggccgcagcc tgcccctcgg cggcccgtgc cccagctccg
11581 tcacgggggg cgcataggag gggtatatct aggatagccg cacctacaca aatgagacac
11641 agacacaggt cgtgaggatt taggcaacgc aggcttgtct ttatagttac aaacatggga
11701 gcgtgcacct ggaagatgca gctggggtag atctttacat ctttacaggg cgcagcggcc
11761 gccagacact gaagggcaga gttcacggcg ggcacctccc agagggagcc caccagcccg
11821 tacctggcca cggcc
//
LOCUS HUMG25KA 1014 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human GTP-binding protein (G25K) mRNA, complete cds.
ACCESSION M35543
KEYWORDS G25K gene; GTP-binding protein G25K.
SOURCE Human fetal brain, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1014)
AUTHORS Munemitsu,S., Innis,M.A., Clark,R., McCormick,F., Ullrich,A. and
Polakis,P.
TITLE The molecular cloning and expression of G25K, the human homolog of
the yeast cell cycle gene CDC42
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Munemitsu, 25-JUN-1990.
Author address: S.Munemitsu
Cetus Corporation
1400 53rd Street
Emeryville, CA 94608
FEATURES from to/span description
pept 4 579 GTP-binding protein G25K
BASE COUNT 286 a 204 c 210 g 314 t
ORIGIN
1 gcaatgcaga caattaagtg tgttgttgtg ggcgatggtg ctgttggtaa aacatgtctc
61 ctgatatcct acacaacaaa caaatttcca tcggaatatg taccgactgt ttttgacaac
121 tatgcagtca cagttatgat tggtggagaa ccatatactc ttggactttt tgatactgca
181 gggcaagagg attatgacag attacgaccg ctgagttatc cacaaacaga tgtatttcta
241 gtctgttttt cagtggtctc tccatcttca tttgaaaacg tgaaagaaaa gtgggtgcct
301 gagataactc accactgtcc aaagactcct ttcttgcttg ttgggactca aattgatctc
361 agagatgacc cctctactat tgagaaactt gccaagaaca aacagaagcc tatcactcca
421 gagactgctg aaaagctggc ccgtgacctg aaggctgtca agtatgtgga gtgttctgca
481 cttacacaga gaggtctgaa gaatgtgttt gatgaggcta tcctagctgc cctcgagcct
541 ccggaaactc aacccaaaag gaagtgctgt atattctaaa ctgttttctc cttcccttct
601 ttgctgctgc ttcctgtccc actactgtag aaagatcgtt taaaaacaaa ggaataaaac
661 catcctgttt gaaagcctct gcgtcttttt actcaccacc ttagagcaac ctctgtatta
721 gtttttgatc aagaattgca atatcatata aattttttgt gatcagtagt caagttggac
781 ttgttttaac gttctgctgc ttgagttgcc tgatgctcag agctttttgg tttggattac
841 tattgcaaag ggaacttggt ctggcttaga tgtcctcttg gagaaaataa caagagtttt
901 aacacttcta gatcttagtt cagatggaga aagtaacaca aacatcattt tactcttatg
961 atcaattgtt aattgtaatt gcatgacaaa ccttatggaa aaggggtgac ctgg
//
LOCUS HUMMUCAB 1804 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human polymorphic epithelial mucin (PEM) mRNA, complete cds.
ACCESSION J05581
KEYWORDS polymorphic epithelial mucin.
SOURCE Human adult mammary adenocarcinoma cell line BT20, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1804)
AUTHORS Gendler,S.J., Lancaster,C.A., Taylor-Papadimitriou,J., Duhig,T.,
Peat,N., Burchell,J., Pemberton,L., Lalani,E.-N. and Wilson,D.
TITLE Molecular cloning and expression of the human tumour-associated
polymorphic epithelial mucin, PEM
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.J.Gendler, 26-JUN-1990.
FEATURES from to/span description
pept 73 1500 polymorphic epithelial mucin (PEM) precursor
sigp 73 135 polymorphic epithelial mucin signal peptide
matp 136 1497 polymorphic epithelial mucin
signal 1783 1788 poly-A signal
mRNA < 1 1804 PEM mRNA
BASE COUNT 361 a 622 c 434 g 387 t
ORIGIN
1 cgctccacct ctcaagcagc cagcgcctgc ctgaatctgt tctgccccct ccccacccat
61 ttcaccacca ccatgacacc gggcacccag tctcctttct tcctgctgct gctcctcaca
121 gtgcttacag ttgttacagg ttctggtcat gcaagctcta ccccaggtgg agaaaaggag
181 acttcggcta cccagagaag ttcagtgccc agctctactg agaagaatgc tgtgagtatg
241 accagcagcg tactctccag ccacagcccc ggttcaggct cctccaccac tcagggacag
301 gatgtcactc tggccccggc cacggaacca gcttcaggtt cagctgccac ctggggacag
361 gatgtcacct cggtcccagt caccaggcca gccctgggct ccaccacccc gccagcccac
421 gatgtcacct cagccccgga caacaagcca gccccgggct ccaccgcccc cccagcccac
481 ggtgtcacct cggccccgga caccaggccg gccccgggct ccaccgcccc cccagcccat
541 ggtgtcacct cggccccgga caacaggccc gccttgggct ccaccgcccc tccagtccac
601 aatgtcacct cggcctcagg ctctgcatca ggctcagctt ctactctggt gcacaacggc
661 acctctgcca gggctaccac aaccccagcc agcaagagca ctccattctc aattcccagc
721 caccactctg atactcctac cacccttgcc agccatagca ccaagactga tgccagtagc
781 actcaccata gcacggtacc tcctctcacc tcctccaatc acagcacttc tccccagttg
841 tctactgggg tctctttctt tttcctgtct tttcacattt caaacctcca gtttaattcc
901 tctctggaag atcccagcac cgactactac caagagctgc agagagacat ttctgaaatg
961 tttttgcaga tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggcca
1021 ggatctgtgg tggtacaatt gactctggcc ttccgagaag gtaccatcaa tgtccacgac
1081 gtggagacac agttcaatca gtataaaacg gaagcagcct ctcgatataa cctgacgatc
1141 tcagacgtca gcgtgagtga tgtgccattt cctttctctg cccagtctgg ggctggggtg
1201 ccaggctggg gcatcgcgct gctggtgctg gtctgtgttc tggttgcgct ggccattgtc
1261 tatctcattg ccttggctgt ctgtcagtgc cgccgaaaga actacgggca gctggacatc
1321 tttccagccc gggataccta ccatcctatg agcgagtacc ccacctacca cacccatggg
1381 cgctatgtgc cccctagcag taccgatcgt agcccctatg agaaggtttc tgcaggtaat
1441 ggtggcagca gcctctctta cacaaaccca gcagtggcag ccacttctgc caacttgtag
1501 gggcacgtcg cccgctgagc tgagtggcca gccagtgcca ttccactcca ctcaggttct
1561 tcagggccag agcccctgca ccctgtttgg gctggtgagc tgggagttca ggtgggctgc
1621 tcacaccgtc cttcagaggc cccaccaatt tctcggacac ttctcagtgt gtggaagctc
1681 atgtgggccc ctgaggctca tgcctgggaa gtgttgtggt gggggctccc aggaggactg
1741 gcccagagag ccctgagata gcggggatcc tgaactggac tgaataaaac gtggtctccc
1801 actg
//
LOCUS RABGHKAB 1367 bp ss-mRNA MAM 02-AUG-1990
DEFINITION O.cuniculus gastric H+ - K+ ATPase beta subunit mRNA, complete cds.
ACCESSION M35544
KEYWORDS H+ - K+ ATPase beta subunit.
SOURCE O.cuniculus (strain New Zealand White) adult stomach, cDNA to mRNA,
clone RB-49.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 1367)
AUTHORS Reuben,M.A., Lasater,L.S. and Sachs,G.
TITLE Characterization of a beta subunit of the gastric H,K ATPase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Reuben, 25-JUN-1990.
FEATURES from to/span description
pept 54 929 H+/K+ ATPase beta subunit
mRNA 1 1367 H+/K+ ATPase beta subunit mRNA
BASE COUNT 293 a 450 c 376 g 248 t
ORIGIN
1 tgtccatcta aggggtgcag ggtgggggag cggcggcttc cacagcagac accatggccg
61 ccttgcagga gaagaagtcg tgcagccagc gcatggagga gttccgccac tactgctgga
121 acccggacac ggggcagatg ctgggccgca ccctgtcccg ctgggtgtgg atcagcctct
181 actacgtggc cttctacgtg gtgatgaccg ggctgtttgc cctgtgcatc tacgtgctca
241 tgcaaaccat cgacccctac acgcctgact accaggacca gctgaagtcg ccaggcgtaa
301 ccttaaggcc agacgtgtat ggggagaagg gcctggaaat ccactacaac atctctgaca
361 accgcacctg gaccagcctc acgcacaccc tgagaagctt cctggcaggt tactcacccg
421 cagcccaggt ggacaacatc aactgcacct ccaagaccta cttcttccag gagagcttcg
481 gggctcccaa ccacaccaag ttctcctgca agttcacggc ggacatgctg gagaactgct
541 ccggcctcac cgaccccagc ttcggcttca aggaaggaaa gccctgcttc atcattaaga
601 tgaacaggat tgtcaggttc ctgcccagca acagcacacc accccgcgtg gactgcacct
661 tcctggacat gccccaccag gccctcacgc cgctgcaggt cgagtactac ccccccaacg
721 gcaccttcag cctccactac ttcccttact acggaaagaa agcgcagccg cactacagca
781 accctctggt ggccgccaag ctcctcaacg tgcccacgaa cacggaggtc gtggtcctct
841 gcaagatcct ggccgatcac gtgaccttcg acaacccgca cgacccctac gaggggaaag
901 tggagttcaa gctcaagatc cagaagtgag ctcgcggtcg tgggtctcac gggggtcccg
961 ctgggccctg tggacgcgtt tctgaggccc gcaccagtgc ccccctggga gcgccaggca
1021 agcagccgcg tgcacagcgc ttccattggc gtccgctgct tcccgtggtg tcccgtcccg
1081 tgcaggcgca tctgtagtag gatggaaaat gccggctaac ccagccatgg caaggcggcg
1141 accgcgagtg cccagagccc agaactgacc cgtcctttcc gtgcggggag caccttcttg
1201 ctaccattgg tttttctgac gtcagtgtga gcggcttccc agatgacgca ggggctgtta
1261 gtgtgctgag gtgcggtgac gtgagacgcc catggcaacc cccactcaca gcgcaatccc
1321 cgtgttgtaa aaataaaacg tgttagttac tctaggtaaa acaaaaa
//
LOCUS ECOTRRNA23 485 bp ss-rRNA RNA 02-AUG-1990
DEFINITION E.coli 23S small subunit ribosomal RNA, 5' end.
ACCESSION M24881
KEYWORDS 23S ribosomal RNA; small subunit ribosomal RNA.
SOURCE E.coli rRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 485)
AUTHORS Branlant,C., Sri Widada,J., Krol,A. and Ebel,J.-P.
TITLE RNA sequences in ribonucleoprotein fragments of the complex formed
from ribosomal 23-S RNA and ribosomal protein L24 of Escherichia
coli
JOURNAL Eur. J. Biochem. 74, 155-170 (1977)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 > 485 23S rRNA
BASE COUNT 143 a 105 c 151 g 86 t
ORIGIN
1 ggttaagcga ctaagccgga gcgtacacgg atggtgcaac gccctgggca gtcagagagc
61 gtgcgatgaa ggacgctaat ctgataagcg atatggtaag gtcgaaccgt gttataaccg
121 atttccggcg aatggggaaa cagtgtgttt cgacacacta tcattaactg aatcaatagg
181 ttaatgaacc ggggggaact gaaacatcta agtaccccga aaaggggaaa taaccgagat
241 tcccccagta gagagcgtag ggcccagagc ctgaacgaat cagtgtgtgg aagcgataca
301 gggtgaaagg gcgtgtctgt tagcgccccg acagtacaca aaaatgcaca tgctgagctc
361 gggacacgag tgatgagggg gggcgcgaaa agaaccccga aagggggcgt ctgaatatgt
421 atccctgacc atcctccaag gctaaatacc cttggggggg gactgaccga tagtaccgaa
481 gagtg
//
LOCUS KPNNIFBQ 2178 bp ds-DNA BCT 02-AUG-1990
DEFINITION K.pneumoniae nitrogen fixation (nifB and niQ) genes, complete cds.
ACCESSION M15545
KEYWORDS nitrogen fixation protein.
SOURCE K.pneumoniae DNA.
ORGANISM Klebsiella pneumoniae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2178)
AUTHORS Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M.
TITLE Conservation of structure and location of Rhizobium meliloti and
Klebsiella pneumoniae nifB genes
JOURNAL J. Bacteriol. 169, 1120-1126 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 199 1605 nitrogen fixation B (nifB)
pept 1605 2108 nitrogen fixation Q (nifQ)
BASE COUNT 402 a 670 c 692 g 414 t
ORIGIN
1 tcagattatg gatatcacca tgccgcgact gtgaagcctt atgtgagatt caggacattg
61 tcgccagcgc ggcggaattg cgacaattca gggacgcggg ttgccggtta aaaagtctac
121 ttttcatgcg gttgcgaaat taacctctgg tacagcattt gcagcaggaa ggtatcgccc
181 aaccacgaag gtacgaccat gacttcctgc tcctcttttt ctggcggcaa agcctgccgc
241 ccggcggatg acagcgcatt gacgccgctt gtggccgata aagctgccgc gcacccctgc
301 tactctcgcc atgggcatca ccgtttcgcg cggatgcatc tgcccgtcgc gcccgcctgc
361 aatttgcagt gcaactactg taatcgcaaa ttcgattgca gcaacgagtc ccgccccggg
421 gtatcgtcaa cgctgctgac gcctgaacag gcggtcgtga aagtgcgtca ggtcgcgcag
481 gcgatcccgc agctttcggt ggtgggcatc gccgggcccg gcgatccgct cgccaatatc
541 gcccgcacct ttcgcaccct ggagctgatc cgcgaacagc tgccggacct gaaattatgc
601 ctgtcgacca acggactgat gctgcctgac gcggtggacc gcctgctgga tgtcggcgtt
661 gaccacgtca cggtcaccat taacaccctc gacgcggaga ttgccgcgca aatctacgcc
721 tggctatggc tggacggcga acgctacagc gggcgcgaag gcggagagat cctgattgcc
781 cgtcagcttg agggcgtacg caggctgacc gccaaaggcg tgctggtgaa aataaattcg
841 gtgctgatcc ccggtatcaa cgatagcggc atggccgacg tgagccgcgc gctgcgggcc
901 agcggcgcgt ttatccataa tattatgccg ctgatcgcca ggccggagca cggcacggtg
961 tttcggctca acggccagcc ggagccggac gccgagacgc tcgccgccac ccgcagccgg
1021 tgcggcgaag tgatgccgca gatgacccac tgccaccagt gtcgcgccga cgccattggg
1081 atgctcggcg aagaccgcag ccagcagttt acccagcttc cggcgccaga gagtctcccg
1141 gcctggctgc cgatcctcca ccagcgcgcg cagctgcacg ccagcattgc gacccgcggc
1201 gaatctgaag ccgatgacgc ctgcctggtc gccgtggcgt caagccgcgg ggacgtcatt
1261 gattgtcact ttggtcacgc cgaccggttc tacatttaca gcctctcggc cgccggtatg
1321 gtgctggtca acgagcgctt tacgcccaaa tattgtcagg ggcgcgatga ctgcgagccg
1381 caggataacg cagcccggtt tgcggcgatc ctcgaactgc tggcggacgt taaagccgta
1441 ttctgcgtgc gtatcggcca tacgccgtgg caacagctgg aacaggaagg cattgaaccc
1501 tgcgttgacg gcgcgtggcg gccggtctcc gaagtgctgc ccgcgtggtg gcaacagcgt
1561 cgggggagct ggcctgccgc gttgccgcat aagggggtcg cctgatgccg ccgctcgact
1621 ggttgcggcg cttatggctg ctgtaccacg cggggaaagg cagctttccg ctgcgcatgg
1681 ggcttagccc gcgcgattgg caggcgctgc ggcggcgcct gggcgaggtg gaaacgccgc
1741 tcgacggcga gacgctcacc cgtcgccgcc tgatggcgga gctcaacgcc acccgcgaag
1801 aggagcgcca gcagctgggc gcctggctgg cgggctggat gcagcaggat gccgggccga
1861 tggcgcagat tatcgccgag gtttcgctgg cgtttaacca tctctggcag gaatcttggt
1921 ctggcatcgc cgccgaattg cgcctgctga tgagcgactg ctttccacag ctggtggtga
1981 tgaacgaaca caatatgcgc tggaaaaagt tcttttatcg tcagcgctgt ttgctgcaac
2041 agggggaagt tatctgccgt tcgccaagct gcgacgagtg ctgggaacgc agcgcctgtt
2101 ttgagtagcc gtttcccgaa gggggcgctg caaacaaaaa agccggaggt ttccctccgg
2161 cttttcacat catcaaat
//
LOCUS RHMNIFB 1722 bp ds-DNA BCT 02-AUG-1990
DEFINITION R.meliloti nitrogen fixation B (nifB) gene, complete cds.
ACCESSION M15544
KEYWORDS nitrogen fixation protein.
SOURCE R.meliloti DNA.
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 1722)
AUTHORS Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M.
TITLE Conservation of structure and location of Rhizobium meliloti and
Klebsiella pneumoniae nifB genes
JOURNAL J. Bacteriol. 169, 1120-1126 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 199 1671 nitrogen fixation B protein (nifB)
BASE COUNT 422 a 476 c 481 g 343 t
ORIGIN
1 gcgtcgcgag cgtcgcctct tttctccgtc ctttcgaaac acgaccggat gcaattcaac
61 tttgcccttg tcggaagtcc gacaagcctg tgagcaaaga gacaggaccg aatccaaaga
121 cgccaatcca ttgacgaaaa ggatttggca tagctgttgc tggttgaatt gcaaaatcac
181 cctccactgg agttgttcat gtccacaccc atgattttgc gtgagagccg gaccagcact
241 acattctctg accagttgct ggagaacgct aaatcggttg gctgctcacc cccatcgacg
301 gcgccgggcg acatagatcc tggaacttgg gacaagatta agaatcaccc ctgtttttca
361 gaggaggcgc atcactattt cgcgcgcatg cacgtggcgg tcgcgcctgc ctgcaacatc
421 caatgtaact attgcaatcg caaatacgat tgcgccaacg aaagtcggcc cggtgttgcc
481 tcggaaaagc tcacacctga ccaggcggtg cgaaaggtga ttgccgttgc caacgaagtg
541 cctcagctgt cagtgcttgg catcgctggg cctggcgatg cctgttacga ctggaagaaa
601 acaagggcga cgttcgaacg agtggctagg gaaattcccg acataagact ctgcatctcc
661 acgaacgggc tctcgctgcc ggaccatgtc gatgagcttg ccgaaatgaa cgtcgatcac
721 gtgacgatca ccatcaacat ggtcgatccg cgtgtcggcg taaagatcta cccctggatt
781 tactatggtc agcgccgcca cactggtatc gacgctgcga gaatcctgca cgaacggcag
841 atgttgggcc tggagatgct agccgaacgc ggcatcctca ccaaggtcaa ctcggtaatg
901 atccccggcg tcaatgatga gcacctgatc gaagtcaaca aagttgtgaa aggaagaggc
961 gcgttgctgc acaacgtaat gccgctaatt tcaaaccgca tacacgggac ctattacgga
1021 ctgacagggc agcgcggccc ggaggccttc gaactgcagg cccttcagga ccgtctagaa
1081 ggaaccaaac tgatgcgtca ttgtcgacat tgccgggccg atgccatagg cttgctcggc
1141 gatgatcgtg gtcacgagtt cacgctcgct gaaatccccg acgagataac ctacgatgcc
1201 agcaagcgac aggcctatcg ccagttggtc gcgcgcgaac gcggggacca cctagtggcc
1261 aagaacgagg cgaacagaac ggtaatgtcg gtggaatatg gcggatcgct tctcattgcc
1321 gtggcgacca aaggcggggg ccggatcaac gaacattttg gacacgcgaa agaatttcac
1381 gtttataccg tctcccagag agggatcaag ctggcaggcc gccgcagggt tgagcagtat
1441 tgcctcggcg gttggggcga ggtcgccacc ctcgatcaca tcgtcgttgc gcttgaagga
1501 atagacatcc tgctctgcgt caaaatcgga gattacccaa ggaaacagct gacacaggcc
1561 gggcttcgag cgacggaagc ttatggccat gactacatcg agagtgcgct cgaaagctct
1621 acgccgccga gtttggctat cgaaccaccg gtaaagacgg cgacacgttg agctgccctc
1681 cgactgaata aggagtttaa aatggccttc aagattattg ca
//
LOCUS AKOMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.boliviensis mitochondrial cytochrome b gene, 5' end.
ACCESSION M35691
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.boliviensis (MVZ catalog # 171607) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon boliviensis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon boliviensis.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 111 c 52 g 116 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac
61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc
121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc
241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga
301 ggtatttact atggttccta taccctctca gaaacctgaa acattggcat catcttattc
361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c
//
LOCUS AKOMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.boliviensis mitochondrial cytochrome b gene, 5' end.
ACCESSION M35692
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.boliviensis (MVZ catalog # 171608) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon boliviensis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon boliviensis.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 112 c 52 g 115 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac
61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc
121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc
241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga
301 ggtatttact atggttccta caccctctca gaaacctgaa acattggcat catcttattc
361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c
//
LOCUS AKOMTCYTBC 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.puer mitochondrial cytochrome b gene, 5' end.
ACCESSION M35693
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.puer (MVZ catalog # 171612) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon puer
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon puer.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 112 c 52 g 115 t
ORIGIN
1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc atttattgac
61 ctaccaactc catctaacat ctcatcctgg tgaaattttg gatccctact aggcatatgc
121 ctagtaatcc aaattctaac aggcctattc ctagccatac actatacatc agacacaacc
181 acagcattct cctcagtagc ccacatctgc cgagatgtaa attacggatg attaatccgc
241 tatctacacg ccaatggagc ttctatattc tttatctgcc tcttcatcca tgtaggccga
301 ggtatttact acggctccta tgtcctctca gaaacctgaa acatcggcat catcttattc
361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS AKOMTCYTBD 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35694
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.subfuscus (MVZ catalog # 172969 and 172970) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon subfuscus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon subfuscus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 120 a 112 c 53 g 116 t
ORIGIN
1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc
121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact
181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
241 tacttacacg ctaatggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc
361 ctcacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS AKOMTCYTBE 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35695
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.subfuscus (MVZ catalog # 174109 and 174110) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon subfuscus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon subfuscus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 120 a 113 c 53 g 115 t
ORIGIN
1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc
121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact
181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc
361 ctcacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS AKOMTCYTBF 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35696
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.subfuscus (MVZ catalog # 174229 and 174230) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon subfuscus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon subfuscus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 116 c 51 g 112 t
ORIGIN
1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc
121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact
181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc
361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS AKOMTCYTBG 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.kofordi mitochondrial cytochrome b gene, 5' end.
ACCESSION M35697
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.kofordi (MVZ catalog # 171665 and 171666) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon kofordi
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon kofordi.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 116 c 51 g 112 t
ORIGIN
1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc
121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact
181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc
361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS AKOMTCYTBH 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.juninensis mitochondrial cytochrome b gene, 5' end.
ACCESSION M35698
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.juninensis (MVZ catalog # 173038 and 173039) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon juninensis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon juninensis.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 117 a 115 c 57 g 112 t
ORIGIN
1 atgaaagtct tacgaaaaaa tcacccacta ctcaaaattg ttaatcactc atttatcgat
61 ctaccaaccc catctaacat ctcatcctga tggaatttcg ggtccctact aggcacatgc
121 ctagtaatcc aaattctaac aggcctattc ctagccatac actacacgtc agatacaacc
181 acagcattct cctcagtagc ccacatctgc cgagatgtaa actacggatg attaatccgc
241 tacttacacg ctaacggagc ctctatattc tttatctgcc tcttcatcca tgtaggccga
301 gggatttact acggctccta catcctctca gaaacctgaa acattggcat catcttgttt
361 ctcacaacta tagcaacagc atttgtaggg tatgtacttc c
//
LOCUS AKOMTCYTBI 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.mollis mitochondrial cytochrome b gene, 5' end.
ACCESSION M35699
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.mollis (MVZ catalog # 173057 and 173058) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon mollis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon mollis.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 120 a 110 c 54 g 117 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaatta ttaatcactc attcattgac
61 ctaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactgct cggtgtatgc
121 ctaataatcc aaatcctaac tggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagtagc ccacatttgt cgagatgtaa actacggatg attaatccgc
241 tatctacacg ctaacggagc ttccatattc tttatttgcc ttttcatcca tgtaggccga
301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc
361 cttacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS AKOMTCYTBJ 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION M35700
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.torques (MVZ catalog # 171720) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon torques
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon torques.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 117 a 112 c 58 g 114 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac
61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc
121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc
361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c
//
LOCUS AKOMTCYTBK 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION M35701
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.torques (MVZ catalog # 171721) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon torques
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon torques.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 116 a 112 c 59 g 114 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac
61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc
121 ctaataatcc aaattctgac cggcctgttc ctagccatac actacacatc agacacaacc
181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc
361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c
//
LOCUS AKOMTCYTBL 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION M35702
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.torques (MVZ catalog # 174053 and 174054) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon torques
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon torques.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 118 a 112 c 58 g 113 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc atttattgac
61 ttaccaaccc catccaacat ctcatcttga tggaactttg ggtcactact aggcgtatgc
121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cttcagtagc ccacatttgc cgagacgtaa actacggatg attaatccgc
241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
301 gggatctatt acggctccta tgccctctca gaaacctgaa acatcggtat catcttattc
361 ctcacaacca tagcaacagc attcgtaggg tatgtattac c
//
LOCUS AKOMTCYTBM 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35703
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 171679 and 1761680) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 115 a 107 c 59 g 120 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccacta cttaaaattg ttaatcactc atttattgac
61 ctaccaaccc cacccaacat ttcatcctga tggaattttg ggtcactact tggtgtatgc
121 ctaataatcc aaattctgac tggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cttcagtagc ccatatttgc cgagatgtaa actacgggtg gctaatccgc
241 tatctacatg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtgggtcga
301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat tatcttattc
361 cttacaacta tagcaacagc attcgtaggg tatgtactac c
//
LOCUS AKOMTCYTBN 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35704
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 172818 and 172819) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 110 a 114 c 60 g 117 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc
361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS AKOMTCYTBO 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35705
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 172849) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 111 a 114 c 59 g 117 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc
361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS AKOMTCYTBP 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35706
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 172850) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 110 a 114 c 60 g 117 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc
361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS AKOMTCYTBQ 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35707
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 172870) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 115 a 108 c 58 g 120 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac
61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc
121 ctaataatcc aaattctgac tggactattc ctagccatac actacacatc agacacaacc
181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc
241 taccttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga
301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctgttc
361 cttacaacta tagcaacagc atttgtagga tatgtactac c
//
LOCUS AKOMTCYTBR 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35708
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.aerosus (MVZ catalog # 172871) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon aerosus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon aerosus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 116 a 108 c 57 g 120 t
ORIGIN
1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac
61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc
121 ctaataatcc aaattctgac tggactattc ctagccatac actacacacc agacacaacc
181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc
241 tatcttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga
301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctattc
361 cttacaacta tagcaacagc atttgtagga tatgtactac c
//
LOCUS AKOMTCYTBW 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION M35716
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.jelskii (MVZ catalog # 173084) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon jelskii
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon jelskii.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 109 a 117 c 55 g 117 t 3 others
ORIGIN
1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat
61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
121 ctaattattc aaattctaac tggccttttc ctagcaatac actacacatc cgacacagcc
181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc
241 taccttcatg ctaacggagc atcyatattt tttatttgyc tattyattca cgtaggacga
301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc
361 ctggccacaa tagccacagc attcgtagga tacgttcttc c
//
LOCUS AKOMTCYTBX 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION M35715
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.jelskii (MVZ catalog # 173083) liver mitochondrial DNA.
ORGANISM Mitochondrion Akodon jelskii
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon jelskii.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 107 a 118 c 57 g 119 t
ORIGIN
1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat
61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
121 ctaattattc aaatcctaac tggccttttc ctagcaatac actacacatc cgacacagcc
181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc
241 taccttcatg ctaacggggc atctatattt tttatttgtc tatttattca cgtaggacgg
301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc
361 ctggccacaa tagccacagc attcgtagga tacgttcttc c
//
LOCUS AKOMTCYTBY 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION M35714
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.jelskii (MVZ catalog # 173073 and 173074) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon jelskii
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon jelskii.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 103 a 121 c 60 g 117 t
ORIGIN
1 atggcaatta tacgaaagac ccacccctta cttaaaatca ttaaccactc gttcattgat
61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
121 ctaattattc aaattctaac tggccttttc ctagccatac actacacatc cgacacagcc
181 acagcattct cctccgtcac ccacatctgc cgagatgtaa attacggctg actcatccgc
241 tatcttcatg ctaacggagc atccatattt tttatttgcc tattcattca cgtggggcga
301 gggatctact atggctccta catgctctca gagacctgaa acatcggaat tgttctactc
361 ctgaccacaa tagccacagc attcgtagga tacgttctcc c
//
LOCUS AKOMTCYTBZ 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION A.andinus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35713
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE A.andinus (MVZ catalog # 174062 and 174063) liver mitochondrial
DNA.
ORGANISM Mitochondrion Akodon andinus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Akodon andinus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 112 a 115 c 55 g 119 t
ORIGIN
1 atgacaatca tacgaaaaac ccacccttta cttaaaatca ttaatcactc atttattgac
61 ctacccgccc cctccaacat ctcatcctga tggaacttcg gctcactcct tggcatttgc
121 ttgataattc aaattttaac gggccttttc ctagccatac attatacgtc cgacacagct
181 acagcattct cctccgtcac ccatatctgc cgagatgtaa attacggatg actaatccgc
241 tacttacacg ccaatggagc atctatattc tttatctgca tatttattca tgtgggacga
301 ggaatttact acggctccta catactctca gaaacctgaa acatcggaat tattctactc
361 ctgaccacaa tagccacagc atttgtaggc tatgtccttc c
//
LOCUS BOMMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION B.amoenus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35711
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE B.amoenus (MVZ catalog # 172878) liver mitochondrial DNA.
ORGANISM Mitochondrion Bolomys amoenus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Bolomys amoenus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 107 c 50 g 122 t
ORIGIN
1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat
61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc
121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc
181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc
241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcatcca tgtaggccga
301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt
361 tttacaacta tagcaacagc attcgtagga tatgtccttc c
//
LOCUS BOMMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION B.amoenus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35712
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE B.amoenus (MVZ catalog # 172879) liver mitochondrial DNA.
ORGANISM Mitochondrion Bolomys amoenus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Bolomys amoenus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 122 a 106 c 50 g 123 t
ORIGIN
1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat
61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc
121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc
181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc
241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcattca tgtaggccga
301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt
361 tttacaacta tagcaacagc attcgtagga tatgtccttc c
//
LOCUS MCXMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION M.mimus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35709
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE M.mimus (MVZ catalog # 171745 and 171746) liver mitochondrial DNA.
ORGANISM Mitochondrion Microxus mimus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Microxus mimus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 118 a 116 c 54 g 113 t
ORIGIN
1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac
61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc
121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacatc agacacaacc
181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
241 tatctacacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga
301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc
361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c
//
LOCUS MCXMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990
DEFINITION M.mimus mitochondrial cytochrome b gene, 5' end.
ACCESSION M35710
KEYWORDS cytochrome b; mitochondrial DNA.
SOURCE M.mimus (MVZ catalog # 171752 and 171753) liver mitochondrial DNA.
ORGANISM Mitochondrion Microxus mimus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Hesperomyini; Microxus mimus.
REFERENCE 1 (bases 1 to 401)
AUTHORS Smith,M.F. and Patton,J.L.
TITLE Variation in mitochondrial cytochrome b sequence in natural
populations of South American akodontine rodents (Muridae:
Sigmodontinae)
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Smith, 28-JUN-1990.
FEATURES from to/span description
pept 1 > 401 cytochrome b
BASE COUNT 116 a 116 c 56 g 113 t
ORIGIN
1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac
61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc
121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacgtc agacacaacc
181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
241 tatctgcacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga
301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc
361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c
//
LOCUS CRCIGHA 482 bp ds-DNA VRT 02-AUG-1990
DEFINITION Caiman Ig germline heavy-chain gene V region, exons 1 and 2.
ACCESSION M34918
KEYWORDS germline; immunoglobulin heavy-chain; variable region.
SOURCE Caiman DNA.
ORGANISM Caiman crocodylus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Reptilia;
Archosauria; Crocodylia; Crocodylidae; Alligatorinae.
REFERENCE 1 (bases 1 to 482)
AUTHORS Litman,G.W., Berger,L., Murphy,K., Litman,R., Podlaski,F.,
Hinds,K., Jahn,C.L., Dingerkus,G. and Erickson,B.W.
TITLE Phylogenetic diversification of immunoglobulin V-H genes
JOURNAL Dev. Comp. Immunol. 8, 499-514 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 46 Ig H-chain V region precursor, exon 1
137 / 441 Ig H-chain V region precursor, exon 2
sigp 1 46 Ig H-chain V region signal peptide
137 147 Ig H-chain V region signal peptide
matp 148 / 441 Ig H-chain V region, exon 2
IVS 47 136 Ig H-chain V region intron A
signal 444 450 heptamer recombination signal
signal 474 482 nonamer recombination signal
BASE COUNT 104 a 139 c 131 g 108 t
ORIGIN
1 atgggacttg ggctccacct gctcgtcctg gccgcagctc tgcaaggtat gtcatcttcc
61 cacaggtcgg ggagggaatc attgcactga tgctgcccta atactagatc ctcttggatc
121 ctaccttgct tttcaggtgc ttggtcccag gtgcagctgg tggagtccgg aggagatgtg
181 aggaaacctg gaaactcttt gcgcctctcc tgcaaagcct cggggttcac cttcggtggc
241 tacggcatgt tctgggtccg ccaggctcct gggaaggggc tggactgggt ggctacaatt
301 aatactgatg gatccagcca gtggtactcc ccggccgttc aggggaaatt caccatctcc
361 agaggcaact cccagaacat gctgtacctg cagatgagca gcctcacacc tgaggacaca
421 gccacgtatt actgcgccag agacacagtg actcaaaccc tattcacggc aatacaaaat
481 cc
//
LOCUS MUSIGKACZ 203 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35640
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 203)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 203 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 45 a 57 c 50 g 50 t 1 others
ORIGIN
1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg
61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa
121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc
181 catggacgtt cggtggaggc acc
//
LOCUS MUSIGKADA 204 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35641
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 204)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 204 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 45 a 58 c 50 g 50 t 1 others
ORIGIN
1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg
61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa
121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc
181 catcggacgt tcggtggagg cacc
//
LOCUS MUSIGKADB 203 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35642
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 203)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 203 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 42 a 59 c 53 g 48 t 1 others
ORIGIN
1 ggtaccngca gaagccagga tcctcgccca aaccctggat ttatgacaca tccaacctgg
61 cttctggatt ccctgctcgc ttcagtggca gtgggtctgg gacctcttgc tctctcataa
121 tcagcagcat ggaggctgaa gatgctgcca cttattactg ccatcagcgg agtagttacc
181 catggacgtt cggtggaggc acc
//
LOCUS MUSIGKADC 203 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35643
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 203)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 203 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 47 a 54 c 52 g 50 t
ORIGIN
1 ggtaccaaca gaagccaggg cagtctccta aactgctgat atactatgca tccaatcgct
61 acactggagt ccctgatcgc ttcactggca gtggatatgg gacggatttc actttcacca
121 tcagcactgt gcaggctgaa gacctggcag tttatttctg ccagcaggat tatagctctc
181 cgtggacgtt cggtggaggc acc
//
LOCUS MUSIGKADD 205 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35644
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 205)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 205 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 50 a 55 c 52 g 48 t
ORIGIN
1 ggtacctgca caagcctggc cagtctccac agctcctcat ctatgggatt tccaacagat
61 tttctggggt gccagaccgc ttcagtggca gtggttcagg gacagatttc acactcaaga
121 tcagcacaat aaagcctgag gacttgggaa tgtattactg cttacaaggt acacattcag
181 cctccggacg ttcggtggag gcacc
//
LOCUS MUSIGKADE 203 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35645
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 203)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 203 Ig kappa-chain V-J region (AA at 9)
BASE COUNT 46 a 51 c 53 g 50 t 3 others
ORIGIN
1 ggtaccnnnc gaaaccaggg cagtctccta aactgctgat ttactgggca tccactaggg
61 aatctggggt ccctgatcgc ttcacaggca gtggatctgg gactcatttc actctcacca
121 tcagcagtgt gaaggctgaa gacctggcag tttattactg tcagcaatat tatagctatc
181 cgtggacgtt cggtggaggc acc
//
LOCUS MUSIGKADF 205 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION M35646
KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, clone 1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 205)
AUTHORS Nishi,M., Kataoka,T. and Honjo,T.
TITLE Preferential rearrangement of the immunoglobulin kappa-chain
joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept / 9 > 205 Ig kappa-chain V-J region (AA at 9)
recomb 181 182 V-region end/J-region start
BASE COUNT 46 a 60 c 51 g 42 t 6 others
ORIGIN
1 ggtaccnnnn nnaaacagga cagccaccca aactcctcat ctatcttagc atccaaccta
61 gaatctgggg tccctgccag cttcagtggc agtgggtctg ggacaggctt caccctcaac
121 atccatcctg tggaggagtg aggatgctgc aacctattac ttccagcaca gtagggagct
181 tcctcggacg ttcggtggag gcacc
//
LOCUS MUSPROTCP 912 bp ss-mRNA ROD 02-AUG-1990
DEFINITION Mouse proteoglycan core protein mRNA, complete cds.
ACCESSION M34603
KEYWORDS proteoglycan core protein.
SOURCE Mouse (strain C57B1/6, haplotype H-2b) T-lymphocyte cell line
EL4.E1, cDNA to mRNA, clone 154.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 912)
AUTHORS Elliott,J.F., Pohajdak,B., Talbot,D., Miller,C.L., Helgason,C.D.,
Bleackley,R.C. and Paetkau,V.H.
TITLE Cloning and characterization of an inducible proteoglycan core
protein mRNA from a mouse T-lymphocyte cell line
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.H.Paetkau, 29-MAY-1990.
FEATURES from to/span description
pept 43 501 proteoglycan core protein precursor
sigp 43 117 proteoglycan core protein signal peptide
matp 265 498 proteoglycan core protein
BASE COUNT 252 a 200 c 194 g 266 t
ORIGIN
1 gatttgctaa tccagaggct gagtggagcc gagctggtca ggatgcaggt tcccgtcggc
61 agcaggcttg tcctggctct cgccttcgtc ctggtttggg gatcttcagt gcaaggttat
121 cctgctcgga gagccaggta ccagtgggtc cgctgcaaac cgaatggctt ttttgctaac
181 tgcatcgagg agaagggacc acagtttgac ctaatagatg aatccaataa catcggccct
241 cccatgaata atcctgtttt gatggaagga ccctcaaaag atttcatctc caattatgat
301 gactatgggt caggttcggg ctccggctct ggctccggct ctggctcggg ttccggctcc
361 ggaagtggct tcctaggtga catggaatgg gaataccagc caacagatga aagcaatatt
421 gtctatttca actataagcc ttttgacagg attctcactg agcaaaacca agaccaacca
481 gaagacgatt ttattatatg aatgtgacgg tctctgtctc cccacctcca tgtggaacaa
541 tgtattcagt atacttagtg taccacgttt aaatgaccag tctcaggata aagagtttta
601 cagaaaattt aaaatgcctg gaaaagactc ttgaatcctg ttaccccttt cctcattaac
661 tcgtaaggaa ttatgcttta atgctgttac ctatcttgtt gttctggaaa atgcctgcat
721 ttatgtgtat tgaatcaaca tttaagaaat taacacacac ccccattatt atacaataac
781 tttcaaagcc atactggttt tgaaaatttt aatttgatag caagttgatg aacatctttc
841 atacctaaag tgttcaggac ccaactcgca ttgtggatta caatatattc ctttatgtga
901 taaaagaaat cg
//
LOCUS PVBENH 316 bp ds-DNA VRL 02-AUG-1990
DEFINITION Papovavirus BKV enhancer/regulatory region.
ACCESSION M35737
KEYWORDS .
SOURCE Papovavirus BKV (strain BKV-P2, from prototype Gardner ATCCVR 837)
DNA, passed in primary human embryonic kidney cells.
ORGANISM Papovavirus BKV
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 316)
AUTHORS Berg,D.T., Walls,J.D. and Grinnell,B.W.
TITLE A variant enhancer/regulatory region from a cloned human prototype
BK virus genome
JOURNAL Nucleic Acids Res. 16, 9057-9057 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rpt 92 191 tandem repeat copy A
rpt 192 290 tandem repeat copy B
BASE COUNT 101 a 79 c 85 g 51 t
ORIGIN
1 cctcagaaaa agcctccaca cccttactac ttgagagaaa gggtggaggc agaggcggcc
61 tcggcctctt atatattata aaaaaaaagg ccacagggag gagctgctta cccatggaat
121 gcagccaaac catgaccgca ggaaggaaag tgcatgactg ggcagccagc cagtggcagt
181 taataagcag ccagacagac atttgcttac ccatggaatg cagccaaacc atgacctcag
241 gaaggaaagt gcatgactgg gcagccagcc agtggcagtt aataagcagc agccagacag
301 acatgttttg cgagcc
//
LOCUS HUMCREB 1026 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human active transcription factor CREB mRNA, complete cds.
ACCESSION M34356
KEYWORDS active transcription factor CREB; alternative splicing.
SOURCE Human T-cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1026)
AUTHORS Berkowitz,L.A. and Gilman,M.Z.
TITLE Two distinct forms of active transcription factor CREB (cAMP
response element binding protein)
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5258-5262 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Z.Gilman, 16-MAY-1990.
FEATURES from to/span description
pept 1 1026 active transcription factor CREB-B
pept 1 261 active transcription factor CREB-A
304 1026 active transcription factor CREB-A
BASE COUNT 332 a 249 c 233 g 212 t
ORIGIN
1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct
61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct
121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat
181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct
241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca
301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat
361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac
421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact
481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga
541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg
601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc
661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt
721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc
781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa
841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt
901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat
961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca
1021 gattaa
//
LOCUS ACNRRDA 1486 bp ss-rRNA RNA 02-AUG-1990
DEFINITION A.lignieresii 16S ribosomal RNA.
ACCESSION M35017
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE A.lignieresii (strain ATCC 19393) rRNA.
ORGANISM Actinobacillus lignieresii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1486)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1486 16S ribosomal RNA
BASE COUNT 384 a 303 c 448 g 321 t 30 others
ORIGIN
1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt
61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa
121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt
181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg
241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga
301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng
361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct
421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt
481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag
541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga
601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact
661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca
781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga
1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg
1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc
1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga
1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg
1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg
//
LOCUS BNORRDA 1532 bp ss-rRNA RNA 02-AUG-1990
DEFINITION D.nodosus 16S ribosomal RNA.
ACCESSION M35016
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE D.nodosus (strain 198A; ATCC 27521) rRNA.
ORGANISM Bacteroides nodosus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Anaerobic gram-negative straight, curved and helical rods;
Bacteroidaceae.
REFERENCE 1 (bases 1 to 1532)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1532 16S ribosomal RNA
BASE COUNT 410 a 330 c 465 g 324 t 3 others
ORIGIN
1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag
61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag
121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg
181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag
241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat
301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat
361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag
481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag
541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa
601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga
661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat
781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc
841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact
901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg
961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc
1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt
1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat
1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc
1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag
1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat
1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct
1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg
1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc
1501 gtaggggaac ctgcggttgg atcacctcct tt
//
LOCUS CAJSAPA 3974 bp ds-DNA BCT 02-AUG-1990
DEFINITION C.fetus surface array protein (sapA) gene, complete cds.
ACCESSION J05577
KEYWORDS surface array protein.
SOURCE C.fetus fetus DNA.
ORGANISM Campylobacter fetus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 3974)
AUTHORS Blaser,M.J. and Gotschlich,E.C.
TITLE Surface array protein of Campylobacter fetus: Cloning and gene
structure
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.J.Blaser, 01-JUN-1990.
FEATURES from to/span description
pept 24 2825 surface array protein (sapA)
binding 12 15 ribosomal binding site
signal 2846 2870 transcription termination signal
BASE COUNT 1293 a 641 c 785 g 1255 t
ORIGIN
1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca
61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata
121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta
181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata
241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa
301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca
361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta
421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa
481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg
541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc
601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg
661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat
721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa
781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa
841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg
901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat
961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag
1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg
1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg
1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat
1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga
1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta
1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt
1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta
1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg
1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa
1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag
1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca
1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt
1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag
1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg
1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag
1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa
1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg
2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata
2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc
2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg
2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg
2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag
2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg
2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg
2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta
2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat
2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag
2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag
2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt
2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa
2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct
2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat
2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt
3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta
3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat
3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa
3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt
3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct
3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa
3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt
3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca
3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat
3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta
3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta
3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta
3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac
3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta
3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg
3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat
3961 ttactaattt tttg
//
LOCUS CDBRRDA 1470 bp ss-rRNA RNA 02-AUG-1990
DEFINITION C.hominis 16S ribosomal RNA.
ACCESSION M35014
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE C.hominis (strain ATCC 16826) rRNA.
ORGANISM Cardiobacterium hominis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods.
REFERENCE 1 (bases 1 to 1470)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1470 16S ribosomal RNA
BASE COUNT 373 a 331 c 466 g 296 t 4 others
ORIGIN
1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt
61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg
121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc
181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag
241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat
301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat
361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac
481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag
541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa
601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga
661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat
781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac
841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa
901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa
961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct
1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg
1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg
1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc
1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga
1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca
1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc
1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact
1441 taggagggcg cttgccacgg tgtggccgat
//
LOCUS GLARGBA 763 bp ds-DNA INV 02-AUG-1990
DEFINITION G.lamblia 5.8S rRNA gene.
ACCESSION M35013
KEYWORDS 5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA.
SOURCE G.lamblia DNA, clones B4 and C4.
ORGANISM Giardia lamblia
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Edlind,T.D., Sharetzsky,C. and Cha,M.E.
TITLE Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large
subunit domain I and potential processing signals
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.D.Edlind, 04-JUN-1990.
FEATURES from to/span description
rRNA 177 321 5.8S ribosomal RNA
rRNA < 1 146 small subunit ribosomal RNA
rRNA 372 > 763 large subunit ribosomal RNA
BASE COUNT 121 a 293 c 293 g 56 t
ORIGIN
1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga
61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga
121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg
181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag
241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc
301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg
361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca
421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag
481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg
541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg
601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac
661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag
721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg
//
LOCUS HEIRRDA 1472 bp ss-rRNA RNA 02-AUG-1990
DEFINITION H.influenzae 16S ribosomal RNA.
ACCESSION M35019
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE H.influenzae (strain ATCC 33391) rRNA.
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1472)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1472 16S ribosomal RNA
BASE COUNT 376 a 300 c 459 g 311 t 26 others
ORIGIN
1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt
61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg
121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat
181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg
241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang
301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg
361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc
421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga
481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg
541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg
601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt
661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga
721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa
781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn
841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt
901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga
961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg
1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg
1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg
1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca
1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc
1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga
1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc
1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct
1441 naacctnnnn gagggcgtnn ncnacggtan ga
//
LOCUS HUMIBSUB 3110 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human integrin beta-5 subunit mRNA, complete cds.
ACCESSION M35011
KEYWORDS integrin.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3110)
AUTHORS Suzuki,S., Huang Z,-S. and Tanihara,H.
TITLE Cloning of an integrin beta subunit exhibiting high homology with
integrin beta-3 subunit
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5354-5358 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Suzuki, 05-JUN-1990.
FEATURES from to/span description
pept 30 2420 integrin beta-5 subunit
BASE COUNT 726 a 809 c 887 g 688 t
ORIGIN
1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct
61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg
121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca
181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg
241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga
301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac
361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc
421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga
481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc
541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct
601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt
661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca
721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg
781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc
841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag
901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat
961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca
1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc
1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga
1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg
1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga
1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc
1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg
1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac
1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc
1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga
1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca
1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg
1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact
1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca
1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg
1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct
1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga
1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca
2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg
2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg
2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca
2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc
2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa
2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct
2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa
2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg
2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa
2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga
2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt
2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc
2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt
2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt
2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga
2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg
3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta
3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa
//
LOCUS KINRRDAA 1474 bp ss-rRNA RNA 02-AUG-1990
DEFINITION K.indologenes 16S ribosomal RNA.
ACCESSION M35015
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE K.indologenes (strain ATCC 25869) rRNA.
ORGANISM Kingella indologenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1474)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1474 16S ribosomal RNA
BASE COUNT 375 a 322 c 451 g 315 t 11 others
ORIGIN
1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc
61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata
121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact
181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta
241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga
301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata
361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt
421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta
481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna
541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga
601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg
661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg
721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga
781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg
841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa
901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca
961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc
1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg
1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact
1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc
1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg
1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc
1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt
1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc
1441 agggcggtta ccacggtgtg attcangact aggg
//
LOCUS MUSTNFBG 3219 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse tumor necrosis factor-beta (lymphotoxin) gene.
ACCESSION Y00137
KEYWORDS lymphotoxin; signal peptide; tumor necrosis factor.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3219)
AUTHORS Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N.
TITLE The murine tumor necrosis factor-beta (lymphotoxin) gene sequence
JOURNAL Nucleic Acids Res. 15, 3937-3937 (1987)
STANDARD simple automatic
REFERENCE 2 (sites for [1])
AUTHORS Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H.
TITLE The murine lymphotoxin gene promoter: Characterization and negative
regulation
JOURNAL J. Immunol. 145, 177-183 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1632 1727 lymphotoxin precursor, exon 2 (first expressed
exon)
1811 1910 lymphotoxin precursor, exon 3
2135 2547 lymphotoxin precursor, exon 4
sigp 1632 1727 lymphotoxin signal peptide
1811 1813 lymphotoxin signal peptide
matp 1814 1910 lymphotoxin
2135 2547 lymphotoxin
pre-msg 685 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 857 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 865 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 1114 3121 lymphotoxin mRNA and introns (minor alt.)
pre-msg 1179 3121 lymphotoxin mRNA and introns (major alt.)
IVS 1286 1622 put. intron I
IVS 1728 1810 intron II
IVS 1911 2134 intron III
signal 1153 1158 put. TATA-box
BASE COUNT 724 a 971 c 744 g 780 t
ORIGIN
1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat
61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag
121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc
181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca
241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac
301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg
361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa
421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag
481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc
541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca
601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag
661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc
721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg
781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata
841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac
901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc
961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc
1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct
1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac
1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag
1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag
1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc
1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct
1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga
1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt
1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc
1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc
1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc
1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag
1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct
1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc
1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct
1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga
1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc
2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct
2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg
2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac
2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga
2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc
2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg
2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag
2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt
2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc
2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa
2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg
2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg
2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag
2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta
2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac
2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt
3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct
3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag
3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag
3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc
//
LOCUS NGORRDAA 1478 bp ss-rRNA RNA 02-AUG-1990
DEFINITION N.denitrificans 16S ribosomal RNA.
ACCESSION M35020
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE N.denitrificans (strain ATCC 14686) rRNA.
ORGANISM Neisseria denitrificans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1478)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1478 16S ribosomal RNA
BASE COUNT 380 a 331 c 448 g 308 t 11 others
ORIGIN
1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag
61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac
121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata
181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct
241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag
301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg
361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt
421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg
481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg
541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg
601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt
661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg
721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa
781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac
841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa
901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt
961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga
1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag
1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg
1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct
1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa
1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag
1441 ggtaaccgca aggagcccnc ttaccacggt atacttca
//
LOCUS PASRRDA 1481 bp ss-rRNA RNA 02-AUG-1990
DEFINITION P.multocida 16S ribosomal RNA.
ACCESSION M35018
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE P.multocida (strain NCTC 10322) rRNA.
ORGANISM Pasteurella multocida
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 1481)
AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the
genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
transfer of Bacteroides nodosus (Beveridge 1941) to the genus
Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
assignment of the genera Cardiobacterium, Dichelobacter and
Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.E.Dewhirst, 05-JUN-1990.
Author address: F.E.Dewhirst
Forsyth Dental Center
140 Fenway
Boston, MA 02115
FEATURES from to/span description
rRNA 1 1481 16S ribosomal RNA
BASE COUNT 374 a 305 c 459 g 321 t 22 others
ORIGIN
1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt
61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa
121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt
181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg
241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga
301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg
361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct
421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt
481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag
541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga
601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct
661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca
781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt
1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg
1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc
1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga
1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg
1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c
//
LOCUS RATSUBB 1124 bp ss-mRNA ROD 02-AUG-1990
DEFINITION Rat F-0-ATPase subunit b mRNA, complete cds.
ACCESSION M35052
KEYWORDS F-0-ATPase subunit b.
SOURCE Rat (strain Wistar) liver, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1124)
AUTHORS Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T.,
Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F.,
Kumatori,A., Tanaka,K. and Ichihara,A.
TITLE cDNA cloning and sequencing for the import precursor of subunit b
in H+-ATP synthase from rat mitochondria
JOURNAL Biochem. Biophys. Res. Commun. 169, 136-142 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Higuti, 04-JUN-1990.
FEATURES from to/span description
pept 60 830 F-0-ATPase subunit b precursor
sigp 60 185 F-0-ATPase subunit b signal peptide
matp 186 827 F-0-ATPase subunit b
signal 1106 1111 poly-A signal
BASE COUNT 323 a 241 c 266 g 294 t
ORIGIN
1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa
61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg
121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc
181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg
241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa
301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct
361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg
421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt
481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc
541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca
601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata
661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga
721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg
781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat
841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc
901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc
961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat
1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa
1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag
//
LOCUS RATHDNFNT 1020 bp ss-mRNA ROD 02-AUG-1990
DEFINITION Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds.
ACCESSION M34643
KEYWORDS neurotrophic factor; neurotrophin-3.
SOURCE Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1020)
AUTHORS Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H.
TITLE Molecular cloning and neurotrophic activities of a protein with
structural similarities to nerve growth factor: Developmental and
topographical expression in the brain
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5454-5458 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.P.Ernfors, 18-MAY-1990.
FEATURES from to/span description
pept 73 849 neurotrophin-3 (HDNF/NT-3) precursor
sigp 73 120 neurotrophic factor/neurotrophin-3 (HDNF/NT-3)
signal peptide
matp 490 846 neurotrophic factor/neurotrophin-3
BASE COUNT 320 a 229 c 226 g 245 t
ORIGIN
1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag
61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc
121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc
181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag
241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag
301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag
361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc
421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg
481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag
541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg
601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt
661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct
721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta
781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga
841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc
901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa
961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa
//
LOCUS NEMG12 3620 bp ds-DNA INV 02-AUG-1990
DEFINITION A.lumbricoides trans-spliced mRNA encoded protein gene 12, complete
cds.
ACCESSION M33757
KEYWORDS .
SOURCE A.lumbricoides suum DNA, clone gene-12.
ORGANISM Ascaris lumbricoides
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Ascaridida; Ascaridina; Ascaridoidea; Ascarididae.
REFERENCE 1 (bases 1 to 3620)
AUTHORS Denker,J.A. and Nilsen,T.W.
TITLE Trans splicing of nematode pre-mRNA in in vitro
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 886 to 1149)
AUTHORS Hannon,G.J., Maroney,P.A., Denker,J.A. and Nilsen,T.W.
TITLE Trans splicing of nematode pre-messenger RNA in vitro
JOURNAL Cell 61, 1247-1255 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.W.Nilsen, 18-APR-1990.
[1] Author address: T.W.Nilsen,
Case Western Reserve University,
Department of Molecular Biology and Microbiology
School of Medicine, 2119
Abbington Rd., Cleveland OH, 44106,
Ph: (216) 368-3913, FAX: 216-368-3055.
FEATURES from to/span description
pept 995 1190 gene-12 encoded protein, exon 1
1860 1945 gene-12 encoded protein, exon 2
2280 2432 gene-12 encoded protein, exon 3
2947 2997 gene-12 encoded protein, exon 4
pre-msg < 1 3210 gene-12 mRNA (alt.)
pre-msg < 1 3114 gene-12 mRNA (alt.)
IVS 1191 1859 gene-12 intron A
IVS 1946 2279 gene-12 intron B
IVS 2433 2946 gene-12 intron C
signal 3097 3102 poly-A signal
BASE COUNT 942 a 722 c 867 g 1080 t 9 others
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgagc acactctggc ctctttcagt tgtttcatgg caaggcgcat acgattttct
61 aaacgcgctt gagtttttgt atgtgcataa ctgtgccaac actgccgagg atattcgtgc
121 gctttttgct tgttcttttc cacaaacagc aagtaatgtt gtaaaatacg ccctgtcata
181 atcattggca tcttcctttg atgtcgacga actggtcacc gatggcttct ttactttgcc
241 tctgctcatt cttacccaca ttgtccactt acctgttgac cgtgtttcgg caaccagtgc
301 agtcagatgg tctctatacg cgacgcagca ctcgtcccag tcgtcttcaa gagcaacttt
361 agcgaagatt ttgaatcccg cgggagagcc gtttcctgtt cgacggtgac agagctattg
421 tagagggata tcatcgaaat ctaggtcctc ttgagtgcga atcctctgca aacaggtaac
481 tctagactcc tggcaaaaaa tgagttgatt ctaccaattt tttagttagc agcagttggt
541 tgggacgagc ttgcatgctt cgttcggttg aggaacttta ctgcagttcc gtcagccatg
601 ttgcctttcg gattctgttt ttccgacatc atggctacag gtggcactga agacgacgtt
661 atatttgaca caccttgcac gtcataatgt gtatccgcat tgtgctggtc ggatatcgct
721 gcagcaatta ttgaggaggt atttggtcat ctgggttcat ttgaaacggc gtgcattatg
781 ggaacaagtg gtacgcagtc gttcacgcgc tcctctgatt agttacagaa gtggatgcgt
841 gtccggtctt ttccctcatc ttcttaaccc ttttctttta gaatgtattc aaatgcatgg
901 ttttgtttgg tggttgtatc gaaaatttac gccatgagac actagtgcat tatttaatag
961 tagtaaaaga gatgtttttt tagtatattg aaggatgtca tacgacttgg caacggctga
1021 aagagcagcg tatgctcctt tcttcggcta tatgggagca gcttctgcac agatcttcac
1081 agtgcttggg gctgcgtacg gcactgcgaa atcggccgtt ggaatctcat caatgggagt
1141 gatgaggccg gagctgatca tgaaatccgt tattcctgtc atcatggccg gtcagtgttt
1201 cccgttattc ccactttgat accactgttc agctttacgt ggtgtgatat cggagcatcg
1261 cccagagcag catattactt tattttttga aagggagcat ggcaagaggt tgacgaaagg
1321 caaggaatat ttactgcttc taacttactg aattcatcac tgccaattct ttttgtttta
1381 tatctgactt tcttagagct gaagatgttg tttcaattaa ggtggttgtc gaaatgctgt
1441 aatcgttacc tgttgctgtc tgaaatttgg cttgtacatc cattattctg agattctttt
1501 gtataatttt ccactaactg ggatcaaaac ggacttttat aaatatttgt cggtaactaa
1561 gagtgatttc catttagagt atggaaaaca aagtagggca tctcgcatat ctcatctctt
1621 gcctctggac tgtcacatat tttctttcat ccccaaagaa aggtccggaa tacttggaga
1681 tgatcgtcaa tgggtgatta ttgcctcaag gcatatgaga agtgtgcata aatggaattg
1741 cttaatttgg tgctgtttga tggattctgt ttatgatttg acatattcgg acagatattc
1801 gggtaacata aaagtatttt tattgacaca agctggtcat gaatttacgt gcgttgcagg
1861 tattatcggc atttatgggc ttgttgttgc gatggtttta agagggaagg tgacttccgc
1921 tagcgcagga tacaccctcg acaaagtaag accatttgtt gctctattct cgcaagccca
1981 ccactgtttt cgctgttctt catctgcacg aattttcttt tggtgcataa tgtcttctgc
2041 tgcatcatct attgcgactc aaagaaagga cttcagcttt tctaaccttt cctgcgtaaa
2101 tcatggaaag tcggggggat tgggagggcg ataggtggag ggggagggag cggggaactg
2161 caaagagaga agtgggacta agtaaacatg atgcggagcg agtcgaccta agattgccta
2221 aaggtgattt tatattgagt agtgttttta gaggcggtca ttaacgaggt ccgtttcagg
2281 gatttgccca cctggctgct ggccttacat gtggtctttg cggtctcggt gccggatacg
2341 caatcggaat cgttggagat gcgggtgtca ggggaacagc acagcaaccg cgcttgttcg
2401 tcggcatgat cctcattctt atcttctccg aggtactttt ccgctacttg gttaanctgt
2461 tggaattggt cgatggaatc agcgcgtctt agaatgacta gactgattta aagtggccgg
2521 atatacctca gtgggtttgg tgctcgtctg gctgagttta gagggaggaa gacttctccg
2581 ctgttcctgg attcttattt gaacagtcta agagtttatt cccagaactt ccnnnnaaga
2641 ctaggtagnn aaggggggag agtgggannt aggaaggccg cgaccctaga atgtacaaag
2701 gaaggtcgat ccctcaatga cgattcgcac ttagaggagt gcacataaga cttacgagaa
2761 tttacgatgg caccggaaaa aaggtagaag tcataaggtt gttgaccaga tggttactac
2821 accagcattg tcgcagtcag tcccggaggt gctcatataa agagaaaaca ccagaacaaa
2881 aataacaaca ttgagatgta gtagtcttca catatttctg tcgctggtga tttgaaaatc
2941 gttcaggtac tcggtctata cggaatgatt gtcgcgctca ttcttggcac ctcatagacg
3001 acgggcataa tcggtggctg tacaacaact gtgaaatggc gccgggaggg ttgccaataa
3061 tgttccacta aatgttatac actactgctg catataaata aaagctgctc cgcgaaaatg
3121 ctgagtttaa gtggctgcaa cacattttgg tgctgagatg tcgtcttaac tgcaagtttt
3181 ttgacggaga caatgaagta ttttatgtgc aattattcat gtaacgtcga tgcatttaca
3241 gtcatatgca aatgatctgc gtgaatgtcg cactgtgaac cacaaccaca gaaaaacgaa
3301 gtggatctat catgactgca tttacgacta aaacagatcg gtgcacaaag caagtgaaac
3361 ttagccaagg taaaattaac acagttcttt ctctcaaaca aacatgaagt aaacacatca
3421 cagaaacatc caaactgcct cgaaagtggt tgtagcgcta taacagcaag agatacgaaa
3481 agaataaaag aacccgcatt tatacgataa gcgtatttaa atgcgctttc acgcaacgat
3541 gcaaaaggaa aggctgttgg ctattgtatt gccttcgaga aacaattgac ttagagcaat
3601 gccatattgg cgtctgaacg
//
LOCUS HUMPPPB1A 3215 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete
cds.
ACCESSION M33689
KEYWORDS protein phosphotyrosyl phosphatase.
SOURCE Human placenta, cDNA to mRNA, (library of Clontech), clone
lambda-16-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3215)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept 73 1380 protein phosphotyrosyl phosphatase 1B (EC
3.1.3.48)
BASE COUNT 818 a 828 c 801 g 768 t
ORIGIN Chromosome 20q13.1-q13.2.
1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg
61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg
121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag
181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg
241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa
301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg
361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa
421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa
481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga
541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat
601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc
661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt
721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac
781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt
841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa
901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac
961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc
1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa
1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc
1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg
1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag
1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct
1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag
1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc
1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc
1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat
1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag
1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc
1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc
1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc
1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat
1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg
1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt
1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata
2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc
2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat
2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac
2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg
2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg
2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc
2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg
2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc
2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca
2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg
2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc
2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg
2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc
2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac
2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg
2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg
3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg
3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag
3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt
3181 ttacaatggc catggaataa accattttta caaaa
//
LOCUS HUMPPPB1A1 276 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x.
ACCESSION M33688
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 1 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 276)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept / 34 + 243 protein phosphotyrosyl phosphatase 1B, exon x
(EC 3.1.3.48) (AA at 34)
pre-msg < 1 > 276 PTP1B mRNA and introns
IVS < 1 33 PTP1B intron x-1
IVS 244 > 276 PTP1B intron x
BASE COUNT 56 a 77 c 69 g 74 t
ORIGIN Chromosome 20q13.1-q13.2.
1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat
61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac
121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg
181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg
241 ctggtaagga ggcctcgcgg gtgccctggg gagctc
//
LOCUS HUMPPPB1A2 453 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1.
ACCESSION M33687
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 2 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 453)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 236 + 397 protein phosphotyrosyl phosphatase 1B, exon
x+1 (EC 3.1.3.48)
pre-msg < 1 > 453 PTP1B mRNA and introns
IVS < 1 235 PTP1B intron x
IVS 398 > 453 PTP1B intron x+1
BASE COUNT 104 a 118 c 111 g 120 t
ORIGIN Chromosome 20q13.1-q13.2.
1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca
61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa
121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta
181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga
241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt
301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga
361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat
421 ccaggtgtga ccattttaac ttttttgtct ttg
//
LOCUS HUMPPPB1A3 426 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2.
ACCESSION M33686
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 3 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 426)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 46 + 269 protein phosphotyrosyl phosphatase 1B, exon
x+2 (EC 3.1.3.48)
pre-msg < 1 > 426 PTP1B mRNA and introns
IVS < 1 45 PTP1B intron x+1
IVS 270 > 426 PTP1B intron x+2
BASE COUNT 115 a 122 c 109 g 80 t
ORIGIN Chromosome 20q13.1-q13.2.
1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag
61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc
121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg
181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc
241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga
301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc
361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac
421 tggtgg
//
LOCUS HUMPPPB1A4 732 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3.
ACCESSION M33685
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 4 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 732)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 402 + 597 protein phosphotyrosyl phosphatase 1B, exon
x+3 (EC 3.1.3.48)
pre-msg < 1 > 732 PTP1B mRNA and introns
IVS < 1 401 PTP1B intron x+2
IVS 598 > 732 PTP1B intron x+3
BASE COUNT 164 a 193 c 205 g 170 t
ORIGIN Chromosome 20q13.1-q13.2.
1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt
61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat
121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc
181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc
241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta
301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt
361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa
421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa
481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc
541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta
601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc
661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag
721 gttttaagtc tg
//
LOCUS HUMPPPB1A5 365 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4.
ACCESSION M33684
KEYWORDS protein phosphotyrosyl phosphatase.
SEGMENT 5 of 5
SOURCE Human DNA, (library of Clontech), clone lambda-10-2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 365)
AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
Bruskin,A., Green,N.R. and Hill,D.E.
TITLE Molecular cloning and chromosome mapping of the human gene encoding
protein phosphotyrosyl phosphatase 1B
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Hill, 13-APR-1990.
FEATURES from to/span description
pept + 266 289 protein phosphotyrosyl phosphatase 1B, exon
x+4 (EC 3.1.3.48)
pre-msg < 1 > 365 PTP1B mRNA and introns
IVS < 1 265 PTP1B intron x+3
BASE COUNT 80 a 101 c 88 g 96 t
ORIGIN Chromosome 20q13.1-q13.2.
1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta
61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag
121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc
181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca
241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc
301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg
361 catgc
//
LOCUS YSCIRA2A 9859 bp ds-DNA PLN 02-AUG-1990
DEFINITION S.cerevisiae IRA2 protein gene, complete cds.
ACCESSION M33779
KEYWORDS IRA2 protein.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Unclassified.
REFERENCE 1 (bases 1 to 9859)
AUTHORS Tanaka,K., Nakafuku,N.M., Tamanoi,F., Kagiro,Y., Matsumoto,K. and
Toh-i,A.
TITLE IRA2, a second gene of Saccharomyces cerevisiae that encodes a
protein with a domain homologous to mammalian ras GTPase activating
protein
JOURNAL Mol. Cell. Biol. 10, 4303-4313 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Tanaka, 19-APR-1990, for release after publication.
University of Chicago
Dept. of Biochem and Mol Biol
920 E. 58th Street
Chicago, IL 60637
FEATURES from to/span description
pept 231 9470 IRA2 protein (IRA2)
BASE COUNT 3163 a 1908 c 1735 g 3053 t
ORIGIN
1 gagctcctgg gaaacaaaag accgaaaaag cgaaaacaaa atcagaacaa ggcttaagta
61 ctttttcacc aattgtagca aacatttaac cacattttag cacactagca tatagcattg
121 tcctctgtta ttcgttttgc ttttctcctt tagtgttact tttcccccaa cgttacacca
181 ttttttgata tcaactaaac tgtatacatt atctttcttc agggagaagc atgtcccagc
241 ccactaagaa taagaagaaa gaacacggga ccgattccaa gtcatcccgc atgactcgga
301 cgttggttaa tcatattctt tttgaaagaa ttctcccgat ccttccggtg gagtctaatc
361 taagtaccta ttcggaagtg gaagagtatt cctcattcat ttcatgcaga tctgtgctca
421 ttaacgttac cgtttcccga gatgcaaatg ctatggtgga aggcaccttg gagttgatag
481 aatcgcttct tcaagggcac gaaatcattt cagataaggg tagcagtgac gttattgaat
541 caatactgat tatactaaga ttgttaagtg atgcgctaga gtataattgg caaaatcaag
601 aaagccttca ttacaacgac atttcgactc acgtagaaca tgaccaagaa cagaagtaca
661 gaccaaagct taacagtatt ctgcccgact actcgtcgac tcattccaat ggcaacaaac
721 actttttcca ccagagcaaa cctcaggcac tgataccgga actggcatcg aaattgcttg
781 agagttgcgc gaagttgaag ttcaatacaa gaactttgca aattttacaa aatatgatca
841 gtcatgttca tggaaacatt ctaacgactt tgagttcctc gattcttccc cgccacaaat
901 cctatctgac aaggcacaac catccttctc attgtaaaat gattgactct actctaggcc
961 atattctccg atttgtagcg gcttccaatc cgtccgagta ttttgaattt atcagaaaga
1021 gtgtgcaagt gcccgtaaca cagacacaca cgcattcaca ctcccattca cactctttgc
1081 catcttccgt ttataacagc atagtgcccc actttgatct tttcagcttc atctatttaa
1141 gcaagcataa ttttaagaaa tacttggaac tcatcaaaaa cttatcggtg acgttaagga
1201 aaacgattta tcattgccta cttttgcatt acagcgccaa agcaataatg ttttggataa
1261 tggctaggcc tgcggaatat tatgaactct tcaacttatt aaaagataat aacaatgaac
1321 actcgaaatc cttaaacacg ttaaaccata cacttttcga ggagatccat tcgactttta
1381 atgtgaatag catgataacc accaatcaaa atgctcatca aggctcatct tccccttcgt
1441 cctcctcgcc atcgtcacca cctagctcat catcatcgga taacaacaat caaaacataa
1501 tagcaaaatc cttaagtcgt cagctttctc accaccagtc atacattcaa cagcagtctg
1561 aaagaaaact acattcttca tggactacaa actctcaatc ctctacttca ctgtcatctt
1621 caacgtctaa ttcaacaaca actgatttct ctactcacac tcaaccagga gaatatgacc
1681 cttccttacc agatactccc acgatgtcta acatcactat tagtgcatct tcattattat
1741 ctcaaactcc aactccaaca acacaattgc aacagcggtt gaactcagca gctgcagccg
1801 ccgccgcagc tgcttcacca tcgaattcca ccccaactgg atacacagca gagcaacaaa
1861 gtcgcgcttc atacgatgca cacaaaactg gccatactgg taaggattat gacgaacatt
1921 ttttgtctgt cactcgtttg gataatgttt tggagttata cacgcacttt gatgatactg
1981 aggtactacc acacacatcc gtactgaagt ttttaactac tttgacaatg ttcgatattg
2041 acctttttaa tgaattaaac gctacatcat tcaaatatat tcctgattgt actatgcatc
2101 gtccaaaaga aagaacaagt tctttcaata atactgcaca cgagacaggt tccgaaaaga
2161 cttcgggtat aaaacatatt acacaaggct taaagaaatt aacttcttta ccttcctcaa
2221 ccaaaaaaac tgtaaaattt gtgaagatgt tgctaagaaa tttaaatggg aatcaagctg
2281 tatcagatgt tgccctctta gatacaatga gggccttact atcattcttt acaatgactt
2341 ctgcggtctt tctcgtggat agaaacttac cctcagtact ttttgccaag agactcatcc
2401 ccataatggg gacaaattta agcgtcggtc aagactggaa ttcaaaaata aataacagtt
2461 tgatggtttg tttgaaaaaa aactccacca cgtttgttca attacaatta atattcttct
2521 cttcagctat tcaattcgat catgaattat tactggcacg tctgagcatc gatacaatgg
2581 ccaacaattt aaacatgcag aagctatgcc tttatactga aggattcagg atattcttcg
2641 acataccaag taagaaggaa ttgcggaagg caattgcggt taaaatttct aaatttttca
2701 aaacattatt ctccattata gcagatattc ttttacaaga atttccgtat tttgatgagc
2761 aaatcaccga catagttgct tccattcttg acggtacaat tatcaatgag tatggtacga
2821 agaaacattt caaggggagc tcaccctctt tatgttcgac aacccggtca agatcaggat
2881 ctacatctca aagttcaatg acaccagttt ctccgctggg actggatact gatatatgtc
2941 caatgaacac cctgtcttta gttggttcaa gtacttcaag aaattctgac aacgttaatt
3001 cattaaacag ttcaccaaag aacttgtctt ctgatccata cttgtcacat cttgtggccc
3061 caagagcgcg tcatgcttta ggtgggccat ctagtattat aaggaataaa ataccgacta
3121 cattgacttc acctccagga acggaaaaat cttcaccagt acaacgtccg caaacggaaa
3181 gcatcagtgc cacaccaatg gccataacaa attctactcc attatcgtcg gcagcattcg
3241 gaattcgatc gcctttgcag aaaataagaa cgaggcgtta ttccgatgaa agtttaggta
3301 aattcatgaa atcaacaaat aattacattc aagaacattt gataccaaaa gatttgaatg
3361 aagcaactct tcaagatgct agaagaataa tgattaatat tttcagtatt tttaagagac
3421 cgaatagtta cttcatcatt cctcacaata taaactcgaa tttacaatgg gtttcgcagg
3481 attttagaaa tattatgaaa ccgattttcg tcgccatcgt aagtccggat gtagatttac
3541 agaatactgc tcaatcattc atggatacct tattatcgaa tgttattact tatggtgaat
3601 cagatgagaa tatcagtatt gaagggtatc atcttctttg cagttacact gtaacattat
3661 ttgcaatggg ccttttcgat ttgaaaatta ataatgaaaa gcgtcaaatt ctcttggata
3721 taactgtcaa gtttatgaag gttagatcac atttagcagg gatcgcggag gcctcacacc
3781 acatggaata cataagtgat tctgaaaaac tcacctttcc gctgattatg gggactgttg
3841 gtagggccct atttgtttca ttatactcta gtcaacaaaa aattgaaaag actttaaaga
3901 ttgcttacac agagtatctt tctgcaatca attttcatga gaggaatatt gatgatgctg
3961 ataaaacttg ggttcataat attgagtttg tagaagcgat gtgtcatgac aactacacaa
4021 cttctggttc aattgctttc caaaggagga caagaaataa tattttacga tttgctacta
4081 ttcctaacgc tatcttactt gattctatga ggatgatcta taagaagtgg catacttaca
4141 cacacagtaa aagtttagaa aaacaagaac ggaacgactt cagaaatttc gcgggtattt
4201 tagcctcttt gtcgggtatc ctattcatca ataaaaagat attgcaagaa atgtatccat
4261 acctactcga caccgtttca gaattgaaaa aaaatataga ctcttttatc tcaaaacaat
4321 gccaatggtt aaactatccg gatttattaa cgagagaaaa ttcaagagat attctaagtg
4381 tagaactgca tcctttgtct tttaacttac tttttaataa tttgaggctc aagttaaaag
4441 aacttgcttg ttcagactta tcaataccag aaaatgaaag ttcctatgtt ttattagaac
4501 aaataatcaa aatgctgcgg acaatcctag gtcgtgatga tgacaattat gtaatgatgc
4561 ttttttccac agagattgta gatcttattg atttattgac agatgaaata aaaaaaatac
4621 cagcctattg tccaaaatat ctcaaggcaa ttattcaaat gaccaaaatg ttcagtgcct
4681 tgcagcactc agaggttaat ttaggtgtca aaaatcattt tcacgttaaa aataaatggt
4741 tgaggcaaat cactgattgg tttcaagtga gtattgcgag agagtacgat ttcgaaaact
4801 tgtcaaaacc tctaaaagaa atggatttgg taaaaagaga catggatatt ctatacatag
4861 atacggcaat cgaagcttca accgctattg cgtacctcac gagacatact ttcttagaga
4921 ttccacctgc cgcgtcagat cccgaactat ctcgatctag gtctgtgata tttgggtttt
4981 atttcaacat cttaatgaaa ggccttgaaa aaagtagtga tcgtgacaat tacccagtat
5041 tcttgaggca caaaatgagt gtcctcaacg acaatgtaat actttcatta acaaatcttt
5101 caaacaccaa tgttgatgcg agtttgcagt tcaccttacc gatgggctat tccggaaatc
5161 gaaacattag gaatgcattt ttggaggtct tcattaatat cgttacgaac tatcggacat
5221 acacggctaa aactgacctt ggaaaattag aggcagcaga caaatttttg cgatatacga
5281 ttgaacatcc ccagctatcg tcctttggag cagcggtttg tcccgctagc gatattgatg
5341 cttatgctgc tggcttaata aatgcatttg aaacgaggaa tgccacccac attgtagtgg
5401 cacagttgat taaaaatgaa attgaaaaat cttccagacc tacggatatc cttagaagaa
5461 atagctgtgc tacgagatca ttatctatgc tagccaggtc caagggtaac gaatatttga
5521 ttcgcacttt gcaaccatta ctaaaaaaaa ttatccagaa cagagatttt tttgaaattg
5581 agaaactaaa accggaagat tcagatgctg aacgtcaaat agagctcttc gttaaataca
5641 tgaatgaatt attggaatcc atatccaact ccgtatctta ttttccccct cctttatttt
5701 atatttgcca aaacatttat aaagttgcgt gtgaaaaatt tccggatcac gcaattatcg
5761 ccgctgggtc tttcgtgttt ttacggtttt tttgtcctgc tttagtcagc cctgattctg
5821 aaaatatcat agatatttct cacttgagcg aaaagcgtac cttcatcagc ttggctaaag
5881 ttatccaaaa tattgccaat ggctcagaaa atttctccag atggccagct ttgtgttccc
5941 aaaaggattt tcttaaggaa tgtagcgata gaattttcag attcctagct gaactttgta
6001 gaacagatcg cacgatagac atccaagtga gaacagaccc aacgccaatt gcatttgact
6061 atcaattcct tcattccttt gtttaccttt acggtcttga ggtgagaagg aatgtgctaa
6121 atgaagcaaa acatgatgat ggtgacattg atggtgacga tttctataag accacatttt
6181 tacttattga tgatgttctt ggccaattag gccaacctaa aatggaattt tccaatgaaa
6241 taccaatata cataagagaa catatggacg actatccgga actgtatgag ttcatgaata
6301 ggcacgcgtt cagaaacatt gagacttcaa cagcgtacag cccaagcgtt cacgagtcca
6361 cctcaagtga aggcattcca attattacgt taacaatgtc aaatttctca gacagacatg
6421 tggacattga tacagttgct tacaagttct tgcaaattta tgctcgaatc tggaccacca
6481 aacactgttt aataatcgac tgtacagaat ttgacgaggg agggcttgat atgaggaaat
6541 ttatttcttt ggttatggga ctattaccag aagttgcacc caaaaattgt ataggctgtt
6601 actactttaa cgtaaacgag acatttatgg ataattatgg aaaatgtttg gacaaagaca
6661 acgtatatgt ttcctcgaaa attcctcatt atttcattaa tagtaactct gatgaaggac
6721 ttatgaaatc tgtgggtata actggacaag ggttgaaggt tctgcaagat attcgtgtct
6781 ctctgcatga tatcacgctt tatgacgaaa aaagaaatag atttacgccg gtatcgttga
6841 aaataggcga tatttacttt caagtcttgc atgaaactcc taggcaatat aaaataaggg
6901 acatgggtac tttattcgac gtaaaattca atgatgtcta cgaaattagc cgaatatttg
6961 aagtacatgt ttcgtcaata actggagtgg cagctgaatt tacagtaact tttcaggacg
7021 agagaaggtt gatttttagt agtccgaaat accttgaaat tgtgaagatg ttctattacg
7081 cacagatccg gttagaaagt gaatatgaaa tggataataa ttcgagtacc tcctccccaa
7141 attcaaacaa caaggtcaaa cagcagaaag agagaacaat tctattgtgc cacctactgt
7201 tagtatctct tattggtctg tttgatgaga gtaaaaaaat gaaaaacagt tcgtataacc
7261 taatagctgc cactgaggcg tcatttggtt tgaactttgg ctcccatttt catcgctctc
7321 ccgaggtgta cgtccccgaa gatactacaa catttttagg tgttattgga aagtctcttg
7381 cagagtctaa tccagaactc acagcctata tgtttatcta tgttttggag gcattgaaga
7441 acaacgtaat tcctcacgtt tacatccctc ataccatttg cggtttgtct tattggatcc
7501 ctaatttata ccaacatgtg tatttggctg atgatgaaga aggccccgaa aacatatctc
7561 acattttccg aattcttatc aggctctctg tgagagagac tgactttaaa gccgtataca
7621 tgcaatatgt ttggttgcta cttttagatg atggccgctt aactgacatt atcgttgatg
7681 aagttattaa tcatgcgtta gaaagagact ccgaaaaccg cgattggaag aaaacaatat
7741 cgttactgac tgtcctaccc actactgagg ttgctaataa tattattcaa aaaatattgg
7801 caaaaattag atcattttta ccgtcattga agttagaagc tatgacccaa agttggtctg
7861 aactaacaat attagttaag ataagcatcc acgttttttt tgaaacttct ttgctggtac
7921 agatgtactt accagagatc ctgtttatcg tatccttatt aattgatgtt ggtccaaggg
7981 aactcagatc atcactacac cagctattaa tgaatgtatg ccattccttg gctattaact
8041 cagctttacc acaagatcat agaaataatc tagatgaaat aagtgatata tttgcacatc
8101 aaaaggtgaa gtttatgttt gggttcagcg aggacaaagg acgaatttta cagattttta
8161 gcgcttcttc ttttgcaagc aagtttaata ttctggattt cttcatcaat aatatattat
8221 tgctgatgga atattcttca acgtacgaag caaacgtgtg gaagacaaga tacaagaaat
8281 atgtcttgga atctgtgttt acaagtaatt cttttctttc ggcacgttca atcatgattg
8341 ttggtataat gggtaaatct tacataactg aagggttatg caaggctatg ttaattgaaa
8401 ccatgaaagt tatcgccgaa ccaaagatta ctgacgagca tcttttctta gccatatctc
8461 atatttttac ttattccaaa attgttgaag gtttggatcc caaccttgac ttaatgaagc
8521 acttattttg gttttcaaca ctcttccttg aatcacgtca cccgataatt tttgagggtg
8581 cccttctctt tgtgtcaaac tgtataaggc gcctatacat ggcccagttt gaaaatgaaa
8641 gcgaaacatc attgataagt actttactta aggggagaaa gtttgctcat acctttttaa
8701 gcaagataga gaatcttagt ggtattgttt ggaatgaaga taattttaca cacattctga
8761 ttttcatcat taataaagga ctatccaatc ctttcattaa gagtacggct tttgatttct
8821 tgaagatgat gtttagaaac tcctactttg agcatcaaat caatcagaaa tctgatcatt
8881 atttgtgcta tatgttccta ttgtattttg ttttaaactg caatcaattt gaggaacttt
8941 taggtgacgt tgattttgaa ggagaaatgg ttaacattga aaacaagaac accattccta
9001 aaattttgtt agagtggttg agttcggata acgaaaatgc aaacattacc ctctatcaag
9061 gtgcgatact gttcaaatgt tcagttacgg atgaaccaag tagatttagg tttgcgttga
9121 ttattaggca tctattgaca aagaaaccca tttgtgcatt gcgtttttac agtgttattc
9181 gtaacgaaat aagaaaaata tcagcatttg agcaaaattc ggattgtgtt ccacttgctt
9241 tcgatatttt aaacttatta gtgacgcatt cagagtctaa ttcgttagaa aaacttcacg
9301 aagaatccat tgaacgtcta accaaaagag gtttatcgat tgtgacttct tctggtatat
9361 ttgcgaagaa ctccgacatg atgatacctt tagatgtaaa acctgaagat atctatgaac
9421 gtaagagaat aatgacaatg attttatcaa ggatgtcatg ttctgcttag aggtgttaca
9481 taaactaatg aaagaaatat caatatctat ctgtaagcat gaatgtacat atctcatgtt
9541 agggttttct tatcgctaat ttttcgcaat ttgttacgtg ggttgctttt atacagctac
9601 aatttttata tattctatcg tgtaatgaat ggctcagtaa attcaagcgc cacatagact
9661 aatgtacata ccaatgcatt ttaattgtaa gaataaaagg ggccattcat ctaccgtctt
9721 agttgaaagt gtttctgtga attttttcaa attccgtttt ttccttttta tataatagca
9781 tggtggcacg agcatcttcg actgaagaat gctcaccttc ttgaatggaa atttttaaaa
9841 cctccctggt taatttctt
//
LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33841
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 967)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 961 967 inverted repeat B
rpt 560 810 R region
signal 792 797 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 530 536 TATA box
BASE COUNT 254 a 235 c 222 g 256 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
961 ccctaca
//
LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
long terminal repeat region.
ACCESSION M33842
KEYWORDS major histocompatibility complex.
SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 960)
AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S.
TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ
locus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.S.Lee, 25-APR-1990.
FEATURES from to/span description
rpt 1 7 inverted repeat A
rpt 954 960 inverted repeat B
rpt 555 803 R region
signal 785 790 poly-A signal
site 74 79 1/2 GRE
site 80 87 enhancer core
site 525 531 TATA box
BASE COUNT 250 a 232 c 219 g 259 t
ORIGIN Chromosome 6p21.3.
1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS NEUAMTR 5928 bp ds-DNA PLN 02-AUG-1990
DEFINITION N.crassa mating type protein gene, complete cds.
ACCESSION M33876
KEYWORDS mating type protein.
SOURCE N.crassa (strain 74-ORS-A) DNA.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 5928)
AUTHORS Glass,N.L., Grotelueschen,J. and Metzenberg,R.L.
TITLE The Neurospora crassa A mating-type region
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4912-4916 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.L.Glass, 26-APR-1990.
FEATURES from to/span description
pept 4121 4283 mating type protein, exon 1
4343 5046 mating type protein, exon 2
IVS 4284 4342 mating type protein intron A
signal 3858 3906 GC signal
signal 3945 3949 CAAT box
signal 5319 5326 poly-A signal
BASE COUNT 1470 a 1459 c 1526 g 1473 t
ORIGIN
1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS PFARGEB 2126 bp ds-DNA INV 02-AUG-1990
DEFINITION P.falciparum 18S ribosomal RNA not in asexual parasites.
ACCESSION M19173
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.falciparum DNA, clone B.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 2126)
AUTHORS McCutchan,T.F., de la Cruz,V.F., Lal,A.A., Gunderson,J.H.,
Elwood,H.J. and Sogin,M.L.
TITLE Primary sequences of two small subunit ribosomal RNA genes from
Plasmodium falciparum
JOURNAL Mol. Biochem. Parasitol. 28, 63-68 (1988)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 2126; revises [1])
AUTHORS Sogin,M.L.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Computer-readable copy of sequence for [2] kindly submitted by
M.L.Sogin, 01-AUG-1990.
SOGIN@EVOL1.MBL.EDU
BASE COUNT 704 a 296 c 412 g 714 t
ORIGIN
1 aacctggttg atcttgccag tagtcatatg cttgtctcac agattaagcc atgcaagtga
61 aagtacatat gtaagtaata tattgaaact gcgaacggct cattaaaaca gttataatct
121 acttgatgtt tttaatataa ggataactac ggaaaatctg tagctaatac ttgtgaaaat
181 accttttgat atatacatat gtatgtatca ttaaggtatg tatttgttag atataagaat
241 aaaataaata taattgaatt ataacaaaga agaaacacat aaaatatgtg tattatcaat
301 cgagtatctg acctatcagc ttttgatgtt agggtattga cctaacatgg ctatgacggg
361 taacggggaa ttagagttcg attccggaga gggagcctga gaaatagcta ccacatctaa
421 ggaaggcagc aggcgcgtaa attacccaat tctaaaaaag agaggtagtg acaagaaata
481 acaatacaat atcgaaaaat gattttgtaa ttggaatgat aggaatttac aaggttccta
541 gagaaacaat tggagggcaa gtctggtgcc agcagccgcg gtaattccag ctccaatagc
601 atatattaaa attgttgcag ttaaaacgtt cgtagttgaa tattaaagaa tccgatgttt
661 catttaaact ggtttgggaa aaccaaatat attatatatt ttgctttgtt caaaataagg
721 ttttctaata aattatgttt ttatcagata tgacagaatc ttttttaaaa tctcttcaat
781 atgcttttat tgcttttgag aggttttgtt actttgagta aaattaagtg ttcataacag
841 acgggtagtc atgattgagt tcattgtgtt tgaatactac agcatggaat aacaaatatg
901 aataagctaa ttattttttt tttcattatt ttttttgata ttcttattag cttagttacg
961 attaatagga gtagctttgg gggcattcgt attcagatgt cagaggtgaa attctaagat
1021 tttctggaga cggactactg cgaaagcatt tgcctaatct atttccatta atcaagaacg
1081 aaagttaagg gagtgaagac gatcagatac cgtcgtaatc ttaaccataa actataccga
1141 ctaggtgttg gatgaatata aaaaatatat aaatatgtag catttcttag ggaatgttga
1201 ttttatatta gaattgcttc cttcagtacc ttatgagaaa tcaaagtctt tgggttctgg
1261 ggcgagtatt cgcgcaagcg agaaagttaa aagaattgac ggaagggcac caccaggcgt
1321 ggagcttgcg gcttaatttg actcaacacg ggaaaactca ctagtttaag acaagagtag
1381 gattgacaga ttaatagctc tttcttgatt tcttggatgg tgatgcatgg ccgtttttag
1441 ttcgtgaata tgatttgtct ggttaattcc gataacgaac gagatcttaa cctgctaatt
1501 agcggtaagt acactatatt tttatttgaa attgaatata ggtaattata catgtttatt
1561 cagtgttcaa attaggatat tttttttatt aaaatattct tttccctgtt ttactaataa
1621 tttgtttttt tttactctat ttctctcttc ttttaagaat gtacttgctt gattaaataa
1681 agcttcttag aggaacagtg tgtatctaac acaaggaagt ttaaggcaac aacaggtctg
1741 tgatgtcctt agataaacta ggctgcacgc gtgctacaat gatatatata acaagttgtt
1801 aaaaatgtac ttataaataa gtgtgtacaa tttttcctgt actgaaaagt ataggtaatc
1861 tttatcagta tatatcgtaa ttgggataga ttattgcaat tattaatctt gaacgaggaa
1921 gcccgtcgct cctaccgatt gaaagatatg ataaattgtt tggatatgaa ttaaaataat
1981 agaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcattaat atatgaaacg
2041 ccgagctcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc
2101 acaattccac acaacatacg agccgg
//
LOCUS ALCRR45S 103 bp ss-rRNA RNA 02-AUG-1990
DEFINITION A.tuberosum chloroplast 4.5S ribosomal RNA.
ACCESSION M35406
KEYWORDS 4.5S ribosomal RNA.
SOURCE A.tuberosum chloroplast young leaf rRNA.
ORGANISM Chloroplast Allium tuberosum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Liliidae; Liliales; Liliaceae; Allium tuberosum.
REFERENCE 1 (bases 1 to 103)
AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four
species of plants, celery (Apium graveoleus), barley (Hordeum
vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
communis)
JOURNAL FEBS Lett. 200, 193-196 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 103 4.5S ribosomal RNA
BASE COUNT 29 a 18 c 31 g 25 t
ORIGIN
1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg
61 atgtatgcag ctgaggcatc ctaatagacc gagagatttg aac
//
LOCUS APIRR45S 103 bp ss-rRNA RNA 02-AUG-1990
DEFINITION A.graveoleus chloroplast 4.5S ribosomal RNA.
ACCESSION M35404
KEYWORDS 4.5S ribosomal RNA.
SOURCE A.graveoleus chloroplast young leaf rRNA.
ORGANISM Chloroplast Apium graveoleus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Apiales; Apiaceae; Apium graveoleus.
REFERENCE 1 (bases 1 to 103)
AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four
species of plants, celery (Apium graveoleus), barley (Hordeum
vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
communis)
JOURNAL FEBS Lett. 200, 193-196 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 103 4.5S ribosomal RNA
BASE COUNT 28 a 19 c 32 g 24 t
ORIGIN
1 gaaggtcacg gtgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg
61 atgtatgcag ctgaggcatc ctaacagacc ggcagatttg aac
//
LOCUS BLYRR45S 96 bp ss-rRNA RNA 02-AUG-1990
DEFINITION H.vulgare chloroplast 4.5S ribosomal RNA.
ACCESSION M35405
KEYWORDS 4.5S ribosomal RNA.
SOURCE H.vulgare chloroplast young leaf rRNA.
ORGANISM Chloroplast Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Hordeum vulgare.
REFERENCE 1 (bases 1 to 96)
AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four
species of plants, celery (Apium graveoleus), barley (Hordeum
vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
communis)
JOURNAL FEBS Lett. 200, 193-196 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 96 4.5S ribosomal RNA
BASE COUNT 29 a 17 c 30 g 20 t
ORIGIN
1 taaggtagcg gcgagacgag ccgtttaaat aggtgtcaag tggaagtgca gtgatgtatg
61 cagctgaggc atcctaacga accgaacgat ttgaac
//
LOCUS CEVCG 371 bp ds-DNA circular VRL 02-AUG-1990
DEFINITION Citrus exocortis viroid RNA sequence, complete genome.
ACCESSION M34917
KEYWORDS complete genome.
SOURCE Citrus exocortis viroid (strain A) RNA.
ORGANISM Citrus exocortis viroid
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 371)
AUTHORS Visvader,J.E., Gould,A.R., Bruening,G.E. and Symons,R.H.
TITLE Citrus exocortis viroid: Nucleotide sequence and secondary
structure of an Australian isolate
JOURNAL FEBS Lett. 137, 288-292 (1982)
STANDARD simple staff_review
BASE COUNT 72 a 112 c 112 g 75 t
ORIGIN
1 cgggatcttt cttgaggttc ctgtggtgct cacctgaccc tgcaggcagg aaaagaaaaa
61 agaggcggcg ggggaagaag tccttcaggg atccccgggg aaacctggag gaagtcgagg
121 tcggggggga cagctgcttc ggtcgccgcg gatcactggc gtccagcgga gaaacaggag
181 ctcgtctcct tcctttcgct gctggctcca catccgatcg tcgctgaagc gcctcgcccc
241 ctcgcccgga gcttctctct ggagactacc cggtggaaac aactgaagct tcaaccccaa
301 accgcttttc ttgtatcttc actgctctcc gggcgagggt gaaagccctc ggaaccctag
361 attgggtccc t
//
LOCUS COMRR45S 103 bp ss-rRNA RNA 02-AUG-1990
DEFINITION C.communis chloroplast 4.5S ribosomal RNA.
ACCESSION M35407
KEYWORDS 4.5S ribosomal RNA.
SOURCE C.communis chloroplast young leaf rRNA.
ORGANISM Chloroplast Commelina communis
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Commelinales; Commelinaceae; Commelina communis.
REFERENCE 1 (bases 1 to 103)
AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four
species of plants, celery (Apium graveoleus), barley (Hordeum
vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
communis)
JOURNAL FEBS Lett. 200, 193-196 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 103 4.5S ribosomal RNA
BASE COUNT 31 a 18 c 31 g 23 t 0 others
ORIGIN
1 taaggtagcg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagta
61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac
//
LOCUS HUMBKM 615 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human banded krait minor satellite (Bkm) repeat.
ACCESSION M35828
KEYWORDS banded krait minor satellite repeat.
SOURCE Human DNA, clone lambda-X13.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 615)
AUTHORS Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and
Drumm,M.M.
TITLE Bkm sequences from the human X chromosome contain large clusters of
GATA/GACA repeats
JOURNAL Ann. Hum. Genet. 52, 167-176 (1988)
STANDARD simple staff_review
BASE COUNT 258 a 90 c 87 g 180 t
ORIGIN X chromosome.
1 agctttggtt caatttttat ggtttttatg gaaggtacaa agttactaat gcaccacccc
61 acagtaagca ccagccccac atggtttcat agagaaatat tagaaaatct tcaaagattg
121 ggtagttgtg atgctacgaa aacttttcca aaacaaactc ctactgaagt tttatgaaat
181 actaaaatat tgatacttta acctaataga tagtacaata aaacctacat ataattgcac
241 ttatggatat agatgtacta attaaaatac tagcaaacat aacccaacac tgcattcaga
301 tataacatac catgaccaaa agaacttcat tctattaatg taaaaaaaat cacattaata
361 gaattaaggg gaaaacatat gattatctac agtgatgcta aatgaaacca acaatattta
421 aaattattct gataagaccc tcaagaactt gaattgatgg agattcaact atatatatat
481 aaatatatgt ttataaatat agatagacag atagatagat agacagatag atagatagat
541 agatagatag atagatagat agatagatat agatatagat actttaatcc taaaggcaag
601 atcttaccta ctgca
//
LOCUS HUMBKMX 206 bp ds-DNA PRI 02-AUG-1990
DEFINITION Human banded krait minor satellite (Bkm) repeat.
ACCESSION M35829
KEYWORDS banded krait minor satellite repeat.
SOURCE Human DNA, clone lambda-X81.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and
Drumm,M.M.
TITLE Bkm sequences from the human X chromosome contain large clusters of
GATA/GACA repeats
JOURNAL Ann. Hum. Genet. 52, 167-176 (1988)
STANDARD simple staff_review
BASE COUNT 93 a 25 c 39 g 49 t
ORIGIN X chromosome.
1 agtctacaga gctgcaagaa caccacatca taaatcttac gatttgaatg acatcgatag
61 atagatagat agatagatag atagatagat agatagatag atagatagat agatagatag
121 atatagatag atacatacat acatacatac acacatagaa atagagatat cctgatggaa
181 agaataaacc atgagtctgg taatag
//
LOCUS MENRR5S 128 bp ss-rRNA BCT 02-AUG-1990
DEFINITION M.tindarius 5S ribosomal RNA.
ACCESSION M34910
KEYWORDS 5S ribosomal RNA.
SOURCE M.tindarius (strain DSM 2095) rRNA.
ORGANISM Methanolobus tindarius
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanomicrobiales; Methanosarcinaceae.
REFERENCE 1 (bases 1 to 128)
AUTHORS Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and
De Wachter,R.
TITLE The structure of 5S ribosomal RNA in the methanogenic
archaebacteria Methanolobus tindarius and Methanococcus
thermolithotrophicus
JOURNAL FEBS Lett. 204, 273-278 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 128 5S ribosomal RNA
BASE COUNT 32 a 33 c 34 g 29 t
ORIGIN
1 ggagtttggc ggccatagcg gcggggcaat tcctgtaccc atcccgaaca cagaagataa
61 gcccgcctgc gttctatact gtactaaagt atgagagtct ttgggaaata tggaacgctg
121 ccatctcc
//
LOCUS MUSSPIX 980 bp ds-DNA ROD 02-AUG-1990
DEFINITION Mouse protease inhibitor gene, 5' flank.
ACCESSION M35824
KEYWORDS protease inhibitor gene.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 980)
AUTHORS Needham,M., Mills,J.S. and Parker,M.G.
TITLE Organization and upstream DNA sequence of the mouse protease
inhibitor gene
JOURNAL Nucleic Acids Res. 16, 6229-6229 (1988)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 964 > 980 protease inhibitor mRNA
BASE COUNT 324 a 200 c 208 g 248 t
ORIGIN
1 taaacagaaa ttgtcttgtg ctgttaggaa agaacatcaa actcttttta ttaagtccct
61 agttgtgtaa gagagcagta gccccatctg agtgttgctg ggaaatatgt gcctttattt
121 ttctagagag aaaaacaaaa ctaagctaaa ctatacaaaa caaaaacaac ttacactttg
181 tgataatggg agacattgct tcattggtca atactgttta ttatacaaac accctgggaa
241 agataatctg tctgaaacaa ggtgacagag ttttttctac tgggcaagac aggcagaaac
301 atgaaagaaa ctagagaccc agggtttggg tagatactca gtacacattg gttaataaag
361 aatcaatcct ttggattatt tactaccatt attttaattt tccaatgcat aagttaggca
421 ataggaaagt gaaccaattt cctagaggca tgcagctaat ctggtcagta cacacagccc
481 cgggacacag gagacagaac tgaactctcc ttatcttgta ccccaccctg gcaccagcct
541 atcaggactt aaggaataca cagcagataa agagacaacg gggggggggg gtgaaggacc
601 aagaattgga gcagaagaat ggaagaaaag gaagaatgga aatcctgttt gagttctgca
661 tgtctgccca tccattcaga actactgaag agacaaagga gacctttgac ttcccaaaca
721 gagaagaaaa cctcacccca ggaaggcttt cctaaaagag gatgtcagga aatgcagcta
781 atttgctttt cctggaagaa tttgggtcca aagttttctg cccttcttcc tacctaggtt
841 acattttccc cagcctggag gccacacaag tcaatcaata accaaagata cctattatag
901 ggcacagtgg gtggagcctg tattaccatc tgcctatatg accactcctc agtttctgaa
961 gagaagcacc ctgtatagtt
//
LOCUS MVTRR5S 120 bp ss-rRNA BCT 02-AUG-1990
DEFINITION M.thermolithotrophicus 5S ribosomal RNA.
ACCESSION M34911
KEYWORDS 5S ribosomal RNA.
SOURCE M.thermolithotrophicus (strain DSM 2095) rRNA.
ORGANISM Methanococcus thermolithotrophicus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanococcales; Methanococcaceae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and
De Wachter,R.
TITLE The structure of 5S ribosomal RNA in the methanogenic
archaebacteria Methanolobus tindarius and Methanococcus
thermolithotrophicus
JOURNAL FEBS Lett. 204, 273-278 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 120 5S ribosomal RNA
BASE COUNT 30 a 33 c 31 g 26 t
ORIGIN
1 tgatataacg gtcatagcgg aggagccaca tccgatcccg tcccgatctc ggaaattaag
61 ccctccagcg ataccctaag tactgccatt tggtgggaac agggtgacgc tgttgatcac
//
LOCUS SF6LYZ 996 bp ds-DNA PHG 02-AUG-1990
DEFINITION Bacteriophage SF6 fragment D lysozyme gene, complete cds.
ACCESSION M35825
KEYWORDS lysozyme.
SOURCE Bacteriophage SF6 DNA.
ORGANISM Bacteriophage SF6
Viridae; ds-DNA viruses.
REFERENCE 1 (bases 1 to 996)
AUTHORS Verma,M.
TITLE Molecular cloning and sequencing of lysozyme gene of bacteriophage
SF6 of Bacillus subtilis
JOURNAL Curr. Microbiol. 13, 299-301 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 13 963 lysozyme
BASE COUNT 284 a 223 c 243 g 246 t
ORIGIN
1 ctgcagatcg taatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta
61 acgagtgata atggagcatc actgtcaaaa cggcttcaat caagggtctc ggcgctcgag
121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca
181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat
241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat
301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt
361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac
421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg
481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc
541 ctctcaatcc gtaataaccg tataaccatg ggattaaatg atggactcac gttgtcaggg
601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt
661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg
721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg
781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacatg
841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg
901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg
961 tgaggatcag accaccccgc ggcactgggg ctgcag
//
LOCUS SYESC6 1071 bp ss-RNA VRL 02-AUG-1990
DEFINITION Sonchus yellow net virus M1 protein gene, complete cds.
ACCESSION M35689
KEYWORDS M1 protein.
SOURCE Sonchus yellow net virus RNA, clones pGL[9,32].
ORGANISM Sonchus yellow net virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Phytorhabdovirus.
REFERENCE 1 (bases 1 to 1071)
AUTHORS Hillman,B.I., Heaton,L.A., Hunter,B.G., Modrell,B. and Jackson,A.O.
TITLE Structure of the gene encoding the m1 protein of sonchus yellow net
virus
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.I.Hillman, 28-JUN-1990.
FEATURES from to/span description
pept 72 932 M1 protein
mRNA 1 1071 M1 mRNA
BASE COUNT 403 a 197 c 235 g 236 t
ORIGIN
1 aacttaaaat aataataata tcaaataatc aagcagacat aaaaacaggg ttatctctat
61 tgtatttcag aatggcaggt atatacgcag tttcaatcaa gggccatgca tcggcgatat
121 tcaaccgaca agagaaggaa atcagcacag gacgtgtttg ggaagttatg aagaagataa
181 tgtccttaaa gcctactagg gtcattatgt catattctct ccttagatca gcgttggata
241 aaagcaggca acttacccaa gaagaataca atattatgca attgatccta gacggctgtg
301 tcaaaaccct tgagcctgtt gcagccagcg ggatatgcat agatgtgaac ctgggaaaat
361 gcaccaagca cacaatccca tttggcatca caaataacga cgtaggtcac gtgtcagtag
421 tgatgacatt accatttttg gaagaaggat gttataacat aggcgcatgt tttgatggaa
481 gattgtccaa gtcaaggagc gacgcaagcc attatgctgt tgatgtaagt ctggaaattt
541 acttaaagtc attatccaga gacgaagctg aagaacaaat atccaaggga acatcagtgt
601 accctttcaa aattaatcat cctacatact ttgaagatga gactgataca tcagatggag
661 aaagtcttag tggacgagca agctcagatg atgggccaga agacggaggc cacggacatg
721 gtgataaaaa caatgagaaa aactctggga aggtggtaag gaaaaggaaa tcaaggaaag
781 agatcgatgt aggaaggttc aagatggtta aggataatat cataaacacc agatccggac
841 tgctaaaatc catgagagga acgggacatc ggaaacacag aacgcaggag atcacagaag
901 gatacaacta cggagacaag gacgcagaat gaaccaaccc accaaaagca gaaaacttgc
961 tactatataa taatatgaag agcccacaat aagatcacac aatgtcaaac agcacaactc
1021 caaatccatt aggtaatacc tgtattattc cgtgcactta tataagaaaa a
//
LOCUS BLYCPPSBEF 681 bp ds-DNA ORG 02-AUG-1990
DEFINITION Barley chloroplast cytochrome b-559 apoprotein (psbEF) genes,
complete cds.
ACCESSION M35616
KEYWORDS cytochrome b-559 apoprotein; psbE gene; psbF gene.
SOURCE Barley chloroplast DNA, clones pDS6 and pDS6-RBSII.
ORGANISM Chloroplast Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Hordeum vulgare.
REFERENCE 1 (bases 1 to 681)
AUTHORS Krupinska,K.
TITLE Characterization and in vitro expression of the cytochrome b-559
genes of barley. II. in vitro transcription and translation
JOURNAL Carlsberg Res. Commun. 53, 233-246 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 139 390 cytochrome b-559 9.4 kDa protein (psbE)
pept 401 520 cytochrome b-559 4.5 kDa protein (psbF)
binding 129 132 psbE ribosomal binding site (put.)
binding 390 394 psbF ribosomal binding site (put.)
BASE COUNT 202 a 131 c 123 g 225 t
ORIGIN
1 cgactcaaaa tacacctttg gtacaaaatt gacaatctca caaggatgaa atatcagtaa
61 ttttctattt actggttgat cccatctttt acggaatcaa ttcctttttt gaatgtacaa
121 aaattttggg agttcagcat gtctggaagc acgggagaac gttcttttgc tgatattatt
181 accagtattc gatactgggt tattcatagc attactatac cttccctatt cattgcgggt
241 tggttatttg tcagtacggg tttagcttat gacgtgtttg gaagtcctag gccaaacgag
301 tatttcacgg aaagccgaca aggaattccg ttaataaccg accgttttga ttctttagaa
361 caactcgatg aatttagtag atccttttag gaggccctca atgaccatag atcgaaccta
421 tcctattttt acagtgcgat ggctggctat tcacggacta gctgtaccta ctgttttttt
481 cttgggatca atatcagcaa tgcagttcat ccaacgataa accaaattcc aactatagaa
541 ctatgacaca atcaaacccg aatgaacaaa atgttgaatt gaatcgtacc agtctatact
601 ggggtttatt actcattttt gtacttgctg ttttattttc caattacttc ttcaattgag
661 agaaagaaag agactaacaa g
//
LOCUS BOVIL1BA 944 bp ds-DNA MAM 02-AUG-1990
DEFINITION Bovine interleukin-1 beta mRNA, complete cds.
ACCESSION M35589
KEYWORDS interleukin-1 beta.
SOURCE Bovine alveolar macrophage, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 944)
AUTHORS Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W.
TITLE The nucleotide sequence for the cDNA of bovine interleukin-1 beta
JOURNAL Nucleic Acids Res. 16, 9054-9054 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 45 845 interleukin-1 beta precursor
sigp 45 383 interleukin-1 beta signal peptide
matp 384 842 interleukin-1 beta
BASE COUNT 270 a 234 c 228 g 212 t
ORIGIN
1 attctctcca gccaaccttc attgcccagg tttctgaaac agccatggca accgtacctg
61 aacccatcaa cgaaatgatg gcttactaca gtgacgagaa tgagctgtta tttgaggctg
121 atgaccctaa acagatgaag agctgcatcc aacacctgga cctcggttcc atgggagatg
181 gaaacatcca gctgcagatt tctcaccagt tctacaacaa aagcttcagg caggtggtgt
241 cggtcatcgt ggccatggag aagctgagga acagtgccta cgcacatgtc ttccatgatg
301 atgacctgag gagcatcctt tcattcatct ttgaagaaga gcctgtcatc ttcgaaacgt
361 cctccgacga gtttctgtgt gacgcacccg ttcagtcaat aaagtgcaaa ctccaggaca
421 gagagcaaaa atccctggtg ctggctagcc catgtgtgct gaaggctctc cacctcctct
481 cacaggaaat gaaccgagaa gtggtgttct gcatgagctt tgtgcaagga gaggaaagag
541 acaacaagat tcctgtggcc ttgggtatca aggacaagaa tctatacctg tcttgtgtga
601 aaaaaggtga tacgcccacc ctgcagctgg aggaagtaga ccccaaagtc taccccaaga
661 ggaatatgga aaagcgcttt gtcttctaca agacagaaat caagaataca gttgaatttg
721 agtctgtcct gtaccctaac tggtacatca gcacttctca aatcgaagaa aggcccgtct
781 tcctgggaca ttttcgagct ggccaggata taactgactt cagaatggaa accctctctc
841 cctaaagaaa gccataccca gggagtccac gtgggctgaa taaccccgag gactggcaga
901 agggaaggga agaatgtagc tgcagcctga acttcactgt tgtc
//
LOCUS ECOK88ADA 744 bp ds-DNA BCT 02-AUG-1990
DEFINITION E.coli K88ad subunit protein gene, partial cds.
ACCESSION M35637
KEYWORDS K88ad subunit protein.
SOURCE E.coli (K12 strain 5K) DNA, clones pFM205, pBad, pBac.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 744)
AUTHORS Gaastra,W., Klemm,P. and de Graaf,F.K.
TITLE The nucleotide sequence of the K88ad protein subunit of porcine
enterotoxigenic Escherichia coli
JOURNAL FEMS Microbiol. Lett. 18, 177-183 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 744 K88ad subunit protein (AA at 1)
BASE COUNT 192 a 137 c 205 g 210 t
ORIGIN
1 aatggttcgg tcgatatcgg tggtagtatc actgcagatg attatcgtca gaaatgggaa
61 tggaaagttg gtacaggtct taatggattt ggtagtgtat tgaatgacct gaccaatggt
121 ggaaccgaac tgaccattac tgttactggt aataagccaa gtttgttagg ccgaaccaga
181 gaggcatttg ctacgccagt ggttggtggt gtagatggaa ttcctcatat tgcatttact
241 gactatgaag gagcttctgt agaactcaga aaccctgatg gtgaaattga aaaaggttta
301 gcatattttg ttctgccgat gaaaaatgca gagggcacta aagttggttc agtgaaagtg
361 aatgcatctt atgccggtgc gctcgggaga ggtggggtta cttctgcgga cggggagctg
421 atgtcgcttt ttgccgaggg gtcgcacgct atcttttatg gtggtttgcc gacgaatgtt
481 cagaattctg cactcccggg tgggagtgcc gcagcggcgc gcacagagtt gtttggaagt
541 ctatcaaaaa atgatattct cggacagatt caaagagtaa acgcaaatat tacttctctt
601 gttaacgtcc caggttcttt caatgaaaac atggcgtaca ctgatggatc cgttgtttcc
661 gttgcctatg cactgggtat tgcaaaccgg cagactattg aggcaacttt taatcaggct
721 gtaactacca gcactcagtg gagc
//
LOCUS HUMTNFTRP 1379 bp ss-mRNA SYN 02-AUG-1990
DEFINITION Synthetic human tumor necrosis factor mRNA, complete cds.
ACCESSION M35592
KEYWORDS tumor necrosis factor.
SOURCE Altered human leukemic B-cell line Ball-1, cDNA to mRNA, clone
pM324-346.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 1379)
AUTHORS Nobuhara,M., Kanamori,T., Nagase,Y., Nii,A., Morishita,H.,
Tohyama,J., Andoh,S. and Kurimoto,M.
TITLE The expression of human tumor necrosis factor in E.coli
JOURNAL Nucleic Acids Symp Ser 17, 131-134 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 178 879 synthetic tumor necrosis factor precursor
sigp 178 399 synthetic tumor necrosis factor signal peptide
matp 400 876 synthetic tumor necrosis factor
BASE COUNT 318 a 438 c 336 g 287 t
ORIGIN
1 agcagacgct ccctcagcaa ggacagcaga ggaccagcta agagggagag aagcaactac
61 agaccccccc tgaaaacaaa aatcagacgc cacatcccct gacaagctgc caggcaggtt
121 ctcttcctct cacatactga cccacggctc caccctctct cccctggaaa ggacaccatg
181 agcactgaaa gcatgatccg ggacgtggag ctggccgagg aggcgctccc caagaagaca
241 ggggggcccc agggctccag gcggtgcttg ttcctcagcc tcttctcctt cctgatcgtg
301 gcaggcgcca ccacgctctt ctgcctgctg cactttggag tgatgggccc ccagagggaa
361 gagttcccca gggacctctc tctaatcagc cctctggccc aggcagtcag atcatcttct
421 cgaaccccga gtgacaagcc tgtagcccat gttgtagcaa accctcaagc tgaggggcag
481 ctccagtggc tgaaccgccg ggccaatgcc ctcctggcca atggcgtgga gctgagagat
541 aaccagctgg tggtgccatc agagggcctg tacctcatct actcccaggt cctcttcaag
601 ggccaaggct gcccctccac ccatgtgctc ctcacccaca ccatcagccg catcgccgtc
661 tcctaccaga ccaaggtcaa cctcctctct gccatcaaga gcccctgcca gagggagacc
721 ccagaggggg ctgaggccaa gccctggtat gagcccatct atctgggagg ggtcttccag
781 ctggagaagg gtgaccgact cagcgctgag atcaatcggc ccgactatct cgactttgcc
841 gagtctgggc aggtctactt tgggatcatt gccctgtgag gaggacgaac atccaacctt
901 cccaaacgcc tcccctgtcc caatcccttt attaccccct ccttcagaca ccctcaacct
961 cttctggctc aaaaagagaa ttgggggctt agggtcggaa cccaagctta gaactttaag
1021 caacaagacc accacttcga aacctgggat tcaggaatgt gtggcctgca cagtgaaggt
1081 gctggcaacc actaagaatt caaactgggg cctccagaac tcactggggc ctacagcttt
1141 gatccctgac atctggaatc tggagaccag ggagcctttg gttctgacca gaatgctgca
1201 ggacttgaga agacctcacc tagaaattga cacaagtgga ccttaggcct tcctctctcc
1261 agatgtttcc agacttcctt gagacacgga gcccagccct ccccatggag ccagctccct
1321 ctatttatgt ttgcacttgt gattatttat tatttattta ttatttattt atttacaga
//
LOCUS SHREF1BA 694 bp ss-mRNA INV 02-AUG-1990
DEFINITION A.salina elongation factor 1-beta mRNA, complete cds.
ACCESSION M35636
KEYWORDS elongation factor 1-beta.
SOURCE A.salina cyst, cDNA to mRNA, clones B[4,27].
ORGANISM Artemia salina
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Branchiopoda; Sarsostraca; Anostraca; Artimiidae.
REFERENCE 1 (bases 1 to 694)
AUTHORS Maessen,G.D.F., Amons,R., Maassen,J.A. and Moeller,W.
TITLE Primary structure of elongation factor 1-beta from Artemia
JOURNAL FEBS Lett. 208, 77-83 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 26 649 elongation factor 1-beta
signal 677 682 polyA signal
BASE COUNT 240 a 117 c 150 g 187 t
ORIGIN
1 ttttctgctg taaaaaaaga tcaaaatggc caatattgac ctaaaagccg aaaagggaca
61 ggaacaacta aacgagcttt tggccaacaa gagttacctt caaggatatg aaccatccca
121 agaggatgtt gctgccttca atcagctaaa taaagccccg tctgacaaat ttccttactt
181 gcttcgatgg tacaagcaca ttagttcttt ttccgatgca gaaaagaaag gatttcccgg
241 tattccaact tctgcatcca aagaagaaga tgatgatgtt gatctttttg gatcagatga
301 ggaagatgaa gaagctgaaa aaatcaaggc agaaaggatg aaggcctata gtgataagaa
361 gtcaaagaaa cctgcaattg ttgccaaatc ctctgtcatt ttggatataa agccatggga
421 cgacgaaacg gatatggcag aaatggagaa actagttcga agtgtacaaa tggatggtct
481 tgtctggggt gcagctaaac ttattcccct tgcatatgga attaagaagc tatcaatcat
541 gtgtgttgta gaagatgata aagtatccat tgacgaactt caagaaaaaa tttctgaatt
601 tgaagacttt gtccagtctg tggatattgc cgctttcaac aaagtttaat tcttaattaa
661 ttacttgtga gaagggaata aatcagggca attg
//
LOCUS YSCRO18SA 813 bp ds-DNA PLN 02-AUG-1990
DEFINITION Yeast (S.cerevisiae) ribosomal operon DNA with 18S rRNA, 5' end.
ACCESSION M35588
KEYWORDS 18S ribosomal RNA; ribosomal operon.
SOURCE Yeast (S.cerevisiae) DNA, clone pVIrA-3.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 813)
AUTHORS Skryabin,K.G., Zakhar'ev,V.M., Rubtsov,P.M. and Baev,A.A.
TITLE Nucleotide sequence of the presumed region of initiation of
transcription of the yeast ribosomal operon
JOURNAL Dokl. Biochem. 247, 288-290 (1979)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 653 > 813 18S rRNA
BASE COUNT 221 a 140 c 194 g 258 t
ORIGIN
1 gaattcgagt aggcttgtcg ttcgttatgt ttttgtaaat ggcctcgtca aacggtggag
61 agagtcgcta ggtgatcgtc agatctgcct agtctctata cagcgtgttt aattgacatg
121 ggttgatgcg tattgagaga tacaatttgg gaagaaattc ccagagtgtg tttcttttgc
181 gtttaacctg aacagtctca tcgtgggcat cttgcgattc cattggtgag cagcgaagga
241 tttggtggat tactagctaa tagcaatcta tttcaaagaa ttcaaacttg ggggaatgcc
301 ttgttgaata gccggtcgca agacgtgatt cttgaagtgt aacctcctct caaatcagcg
361 atatcaaacg taccattccg tgaacaccgg ggtatctgtt tggtggaacc tgattagagg
421 aaactcaaag agtgctatgg tatggtgacg gagtgcgctg gtcaagagcg taagcttttg
481 aacagagagc atttccggca gcagagagac ccgaaaaagc aatttttctg gaatttcagc
541 tgtttccaaa ctcaataagt atcttctagc aagagggaat aggtgggaaa aaaaaaaaga
601 gatttcggtt tctttctttt ttactgcttg ttgcttcttc ttttaagata gttatctggt
661 tgatcctgcc agtagtcata tgcttgtctc aaagattaag ccatgcatgt ctaagtataa
721 gcaatttata cagtgaaact gcgaatggct cattaaatca gttatcgttt atttgatagt
781 tcctttacta catggtataa ctgtggtaat tct
//
LOCUS YSJLEU2A 968 bp ds-DNA PLN 02-AUG-1990
DEFINITION Y.lipolytica beta-isopropylmalate dehydrogenase (LEU2) gene, 5'
end.
ACCESSION M35579
KEYWORDS beta-isopropylmalate dehydrogenase.
SOURCE Y.lipolytica (strain 21109-9) DNA, clone pINA92.
ORGANISM Yarrowia lipolytica
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 968)
AUTHORS Gaillardin,C. and Ribet,A.-M.
TITLE LEU2 directed expression of beta-galactosidase activity and
phleomycin resistance in Yarrowia lipolytica
JOURNAL Curr. Genet. 11, 369-375 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 291 > 968 beta-isopropylmalate dehydrogenase (LEU2)
signal 163 172 CAAT box
signal 213 223 TATA box
BASE COUNT 215 a 303 c 242 g 208 t
ORIGIN
1 gaattccgtc gtcgcctgag tcatcattta tttaccagtt ggccacaaac ccttgacgat
61 ctcgtatgtc ccctccgaca tactcccggc cggctgggta cgttcgatag cgctatcggc
121 atcgacaagg tttgggtccc tagccgatac cgcactacct gagtcacaat cttcggaggt
181 ttagtcttcc acatagcacg ggcaaaagtg cgtatatata caagagcgtt tgccagccac
241 agattttcac tccacacacc acatcacaca tacaaccaca cacatccaca atggaacccg
301 aaactaagaa gaccaagact gactccaaga agattgttct tctcggcggc gacttctgtg
361 gccccgaggt gattgccgag gccgtcaagg tgctcaagtc tgttgctgag gcctccggca
421 ccgagtttgt gttcgaggac cgactcattg gaggagctgc cattgagaag gagggcgagc
481 ccatcaccga cgctactctc gacatctgcc gaaaggctga ctctattatg ctcggtgctg
541 tcggaggcgc tgccaacacc gtatggacca ctcccgacgg acgaaccgac gtgcgacccg
601 agcagggtct cctcaagctg cgaaaggacc tgaacctgta cgccaacctg cgaccctgcc
661 agctgctgtc gcccaagctc gccgatctct cccccatccg aaacgttgag ggcaccgact
721 tcatcattgt ccgagagctc gtcggaggta tctactttgg agagcgaaag gaggatgacg
781 gatctggcgt cgcttccgac accgagacct actccgttcc tgaggttgag cgaattgccc
841 gaatggccgc cttcctggcc cttcagcata acccccctct tcccgtgtgg tctcttgaca
901 aggccaacgt gctggcctcc tctcgacttt ggcgaaagac tgttactcga gtcctcaagg
961 acgaattc
//
LOCUS YSTARSA1 131 bp ds-DNA PLN 02-AUG-1990
DEFINITION Yeast ARS site linked to HML locus, segment 1.
ACCESSION M17505
KEYWORDS autonomous replication; origin of replication.
SEGMENT 1 of 2
SOURCE Yeast DNA.
ORGANISM Yeast sp.
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE 1 (bases 1 to 131)
AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
Nasmyth,K.A. and Hicks,J.B.
TITLE Localization and sequence analysis of yeast origins of DNA
replication
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
site 92 101 ARS consensus sequence
BASE COUNT 41 a 14 c 19 g 57 t
ORIGIN Chromosome III.
1 atggaatgta atttcttaag tattctatat gtacttaaaa cctattaata tatggatcaa
61 cacagtatct tatgaatggg tttttgattt ttttatgttt ttttaaaaca ttaaagtttt
121 cggcacggac t
//
LOCUS YSTARSA2 146 bp ds-DNA PLN 02-AUG-1990
DEFINITION Yeast ARS site linked to HML locus, segment 2.
ACCESSION M17506
KEYWORDS autonomous replication; origin of replication.
SEGMENT 2 of 2
SOURCE Yeast DNA.
ORGANISM Yeast sp.
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE 1 (bases 1 to 146)
AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
Nasmyth,K.A. and Hicks,J.B.
TITLE Localization and sequence analysis of yeast origins of DNA
replication
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
site 136 146 ARS consensus sequence
BASE COUNT 46 a 22 c 21 g 57 t
ORIGIN Chromosome III; about 2.8 kb downstream of of segment 1.
1 cttctatgtt acttcttcaa catgaaagcc cgaacgtttg caatgatatc ctctttttca
61 cccgattatt ttcatttatt aagtaagtgg ggttctgggt gaaaaaatca aaacaaaaat
121 atgaatttgt aatgattttt atattt
//
LOCUS YSTARSB1 147 bp ds-DNA PLN 02-AUG-1990
DEFINITION Yeast ARS site linked to HMR locus, segment 1.
ACCESSION M17507
KEYWORDS autonomous replication; origin of replication.
SEGMENT 1 of 2
SOURCE Yeast DNA.
ORGANISM Yeast sp.
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE 1 (bases 1 to 147)
AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
Nasmyth,K.A. and Hicks,J.B.
TITLE Localization and sequence analysis of yeast origins of DNA
replication
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
site 112 122 ARS consensus sequence
BASE COUNT 56 a 14 c 9 g 68 t
ORIGIN Chromosome III.
1 aatagtatca atatatatat atatatatat atttatttgt ttactttttc tatcagtgtt
61 ttcaattttt tattaaacaa tgtttgattt tttaaatcgc aatttaatac ctaaatataa
121 aaaatgttat tatattgcaa aaaccca
//
LOCUS YSTARSB2 85 bp ds-DNA PLN 02-AUG-1990
DEFINITION Yeast ARS site linked to HMR locus, segment 2.
ACCESSION M17508
KEYWORDS autonomous replication; origin of replication.
SEGMENT 2 of 2
SOURCE Yeast DNA.
ORGANISM Yeast sp.
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE 1 (bases 1 to 85)
AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
Nasmyth,K.A. and Hicks,J.B.
TITLE Localization and sequence analysis of yeast origins of DNA
replication
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
site 63 73 ARS consensus sequence
BASE COUNT 33 a 7 c 5 g 40 t
ORIGIN Chromosome III; about 1.9 kb downstream of segment 1.
1 atttatcatg ttttggtatg ataatttaat ttttaaaaaa acaaatttaa ttgacctcat
61 taattaatat ttattaatac cttta
//
LOCUS HUMROS1 7375 bp ss-mRNA PRI 02-AUG-1990
DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA,
complete cds.
ACCESSION M34353
KEYWORDS oncogene; transmembrane tyrosine-specific protein kinase.
SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 7375)
AUTHORS Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M.
TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.H.Wigler, 15-MAY-1990.
FEATURES from to/span description
pept 207 7250 transmembrane tyrosine-specific protein kinase
precursor /hgml_locus_uid="LN0124W"
/nomgen="ROS1" /map="6q21-q22"
sigp 207 314 transmembrane tyrosine-specific protein kinase
signal peptide
matp 208 7247 transmembrane tyrosine-specific protein kinase
BASE COUNT 2203 a 1496 c 1605 g 2071 t
ORIGIN
1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa
61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc
121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca
181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg
241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata
301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata
361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact
421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg
481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa
541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc
601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt
661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct
721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg
781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg
841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc
901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact
961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg
1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt
1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg
1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg
1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca
1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta
1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct
1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc
1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt
1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga
1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg
1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac
1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca
1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat
1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct
1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc
1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca
1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg
2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa
2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa
2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag
2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa
2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt
2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga
2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag
2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc
2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca
2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca
2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa
2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca
2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca
2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata
2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga
2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat
3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc
3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc
3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt
3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg
3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc
3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat
3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac
3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa
3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt
3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta
3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc
3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa
3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg
3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc
3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg
3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat
3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga
4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag
4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa
4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga
4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga
4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga
4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg
4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga
4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt
4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg
4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca
4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg
4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg
4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg
4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa
4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca
4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc
4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg
5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg
5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca
5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt
5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac
5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt
5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg
5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag
5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg
5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca
5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca
5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag
5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg
5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc
5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg
5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc
5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag
6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg
6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa
6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga
6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc
6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag
6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca
6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac
6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact
6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa
6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa
6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt
6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa
6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga
6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc
6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg
6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt
6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt
7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc
7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag
7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc
7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt
7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc
7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc
//
LOCUS MBOPIVML 969 bp ds-DNA BCT 02-AUG-1990
DEFINITION M.lacunata pilin gene inverting protein (PivML) gene, complete cds.
ACCESSION M34367
KEYWORDS pilin gene inverting protein.
SOURCE M.lacunata (ATCC 17956) DNA, clone pMxL1.
ORGANISM Moraxella lacunata
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 969)
AUTHORS Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C.
TITLE Identification, cloning and sequencing of Piv, a new gene involved
in inverting the pilin gene of M.lacunata
JOURNAL J. Bacteriol. 172, 4370-4377 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.F.Marrs, 17-MAY-1990.
FEATURES from to/span description
pept 1 969 pilin gene inverting protein (PivML)
BASE COUNT 318 a 236 c 165 g 250 t
ORIGIN
1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt
61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta
121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca
181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt
241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca
301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg
361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac
421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg
481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt
541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata
601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca
661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca
721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca
781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc
841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc
901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt
961 catgaataa
//
LOCUS STYMKAD 915 bp ds-DNA BCT 02-AUG-1990
DEFINITION Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds.
ACCESSION M34355
KEYWORDS mouse killing agent D.
SOURCE Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA.
ORGANISM Plasmid pEX102
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 915)
AUTHORS Taira,S. and Rhen,M.
TITLE Nucleotide sequence of mkaD, a virulence associated gene of
Salmonella typhimurium containing variable and constant region
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.P.Taira, 16-MAY-1990.
FEATURES from to/span description
pept 132 857 mouse killing agent D (mkaD)
BASE COUNT 273 a 181 c 213 g 248 t
ORIGIN
1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt
61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa
121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata
181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt
241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg
301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc
361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc
421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag
481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag
541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt
601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac
661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt
721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac
781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc
841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca
901 gctattttaa atgct
//
LOCUS TOBEXT 3994 bp ds-DNA PLN 02-AUG-1990
DEFINITION N.plumbaginifolia extensin (ext) gene, complete cds.
ACCESSION M34371
KEYWORDS cell wall protein; extensin.
SOURCE N.plumbaginifolia leaf DNA.
ORGANISM Nicotiana plumbaginifolia
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 3994)
AUTHORS De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R.,
Genetello,C., Van Montagu,M., Depicker,A. and Inze,D.
TITLE The extensin signal peptide allows secretion of heterologous
proteins from protoplasts
JOURNAL Unpublished (1990)
STANDARD unannotated staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.R.De Loose, 18-MAY-1990.
Author address: M.R.R.De Loose
Laboratorium voor Genetika, Wetenschappen,
K.L.Ledeganckstraat, Gent,
Oost-Vlaanderen, BELGIUM
FEATURES from to/span description
pept 1590 2840 extensin precursor
sigp 1590 1652 extensin signal peptide (could end at 1667)
matp 1653 2837 extensin (could start at 1668)
mRNA 1560 3342 extensin mRNA
rpt 1740 1824 tandem 85 bp repeats
rpt 1740 1758 19 bp repeats
signal 1532 1537 TATA box
signal 3323 3329 poly-A signal
BASE COUNT 1264 a 992 c 550 g 1188 t
ORIGIN
1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt
61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta
121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct
181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt
241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct
301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg
361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc
421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat
481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat
541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt
601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt
661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc
721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac
781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga
841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact
901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa
961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa
1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt
1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata
1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact
1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt
1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt
1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac
1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt
1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc
1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag
1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat
1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact
1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac
1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga
1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt
1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt
1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc
1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac
2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac
2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac
2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt
2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat
2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa
2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt
2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac
2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca
2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc
2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc
2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc
2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac
2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc
2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta
2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta
2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc
3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc
3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct
3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa
3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt
3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc
3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc
3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc
3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact
3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg
3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa
3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc
3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat
3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat
3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca
3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat
3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca
3961 ggtcgactct agggatcccc gggcgagcga attc
//GenBank-Updates@genbank.bio.net (08/03/90)
LOCUS CLLRRE 1861 bp ss-rRNA RNA 03-AUG-1990
DEFINITION C.sapidus 18S rRNA.
ACCESSION M34360
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE C.sapidus rRNA.
ORGANISM Callinectes sapidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura;
Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE 1 (bases 1 to 1861)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1860 18S ribosomal RNA
BASE COUNT 359 a 375 c 421 g 367 t 339 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1861 n
//
LOCUS HUMNCSRC 138 bp ds-DNA PRI 03-AUG-1990
DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene,
exons 3, 4, NI, and NII, partial cds.
ACCESSION M34469
KEYWORDS membrane-associated tryosine protein kinase.
SOURCE Human adult brain DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 138)
AUTHORS Pyper,J.M. and Bolen,J.B.
TITLE Identification of a novel neuronal C-SRC exon expressed in human
brain
JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 29 membrane-associated tyrosine protein kinase
(C-SRC), exon 3 (AA at 1)
30 47 membrane-associated tyrosine protein kinase,
exon NI
48 80 membrane-associated tyrosine protein kinase,
exon NII
81 > 136 membrane-associated tyrosine protein kinase,
exon 4
variant 117 117 a or g
variant 135 135 c or t
BASE COUNT 36 a 40 c 40 g 22 t
ORIGIN
1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
121 cagacaggct acatcccc
//
LOCUS MHVAPEPA 1000 bp ss-RNA VRL 03-AUG-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34435
KEYWORDS peplomer protein.
SOURCE Murine (strain JHM-DL) RNA.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 1000)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
BASE COUNT 258 a 192 c 227 g 317 t 6 others
ORIGIN
1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS MHVAPEPB 843 bp ss-RNA VRL 03-AUG-1990
DEFINITION Murine coronavirus peplomer (S) protein gene.
ACCESSION M34436
KEYWORDS peplomer protein.
SOURCE Murine (strain A59) RNA.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 843)
AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C.
TITLE A clustering of RNA recombination sites adjacent to a hypervariable
region of the peplomer gene of Murine coronavirus
JOURNAL Virology 175, 548-555 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 400 401 deletion site
BASE COUNT 227 a 153 c 183 g 280 t
ORIGIN
1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
841 gaa
//
LOCUS MHVSP 3780 bp ss-mRNA VRL 03-AUG-1990
DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION X04797 M34437
KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein;
surface projection glycoprotein.
SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
pJMS1010, pJS112, and pJS92.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 3780)
AUTHORS Schmidt,I., Skinner,M. and Siddell,S.
TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection
Glycoprotein of Coronavirus MHV-JHM
JOURNAL J. Gen. Virol. 68, 47-56 (1987)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
SITE 24 30 put. region of homology preceeding
mRNA 5' initiation codons
SITE 121 129 pot. N-glycosylation site
SITE 208 216 pot. N-glycosylation site
SITE 430 438 pot. N-glycosylation site
SITE 604 612 pot. N-glycosylation site
SITE 1099 1107 pot. N-glycosylation site
SITE 1333 1341 pot. N-glycosylation site
SITE 1636 1644 pot. N-glycosylation site
SITE 1732 1740 pot. N-glycosylation site
SITE 1756 1764 pot. N-glycosylation site
SITE 1825 1833 pot. N-glycosylation site
SITE 1900 1914 put. proteolytic cleavage site
SITE 1972 1980 pot. N-glycosylation site
SITE 2023 2031 pot. N-glycosylation site
SITE 2293 2301 pot. N-glycosylation site
SITE 2440 2448 pot. N-glycosylation site
SITE 3139 3147 pot. N-glycosylation site
SITE 3301 3309 pot. N-glycosylation site
SITE 3331 3339 pot. N-glycosylation site
SITE 3388 3396 pot. N-glycosylation site
SITE 3436 3444 pot. N-glycosylation site
SITE 3499 3507 pot. N-glycosylation site
SITE 3553 3654 pot. transmembrane domain
SITE 3622 3675 cysteine-rich region
SITE 3655 3660 charge cluster
SITE 3715 3723 pot. N-glycosylation site
FEATURES from to/span description
pept 31 3738 surface protein S precursor
sigp 31 60 surface protein S signal peptide
matp 61 3736 surface protein S
BASE COUNT 1030 a 718 c 800 g 1232 t
ORIGIN
1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION M34598 M29013 J03562
KEYWORDS constant region; immunoglobulin; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; processed gene;
variable region.
SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
Y31.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 713)
AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
TITLE Structure of a third murine immunoglobulin lambda light chain
variable region that is expressed in laboratory mice
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
STANDARD simple staff_entry
REFERENCE 2 (bases 2 to 450)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
REFERENCE 3 (bases 266 to 429)
AUTHORS Sanchez,P. and Cazenave,P.-A.
TITLE A new variable region in mouse immunoglobulin lambda light chains
JOURNAL J. Exp. Med. 166, 265-270 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region
sigp 12 68 Ig lambda chain signal peptide
matp 69 > 713 Ig lambda chain
recomb 379 380 V-region end/J2-region start
recomb 414 415 J2-region end/C2-region start
BASE COUNT 175 a 190 c 159 g 189 t
ORIGIN
1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS MUSIGLVD 681 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION M34597
KEYWORDS constant region; germline; immunoglobulin light chain;
joining exon; lambda-immunoglobulin; variable region.
SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and
is conserved in mammalian species
JOURNAL J. Immunol. 144, 2816-2820 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region,
exon 1
209 531 Ig lambda-chain precursor Vx-J2-C2 region,
exon 2
sigp 59 104 Ig lambda-chain signal peptide
209 219 Ig lambda-chain signal peptide
matp 220 528 Ig lambda-chain
IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A
recomb 530 531 Vx-region end/J2-region start
recomb 565 566 J2-region end/C2-region start
BASE COUNT 179 a 156 c 136 g 210 t
ORIGIN Chromosome 16.
1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
661 tcctcttgga cttcgtaggg c
//
LOCUS PAERRE 1877 bp ss-rRNA RNA 03-AUG-1990
DEFINITION P.kadiakensis 18S rRNA.
ACCESSION M34359
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.kadiakensis rRNA.
ORGANISM Palaemonetes kadiakensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea;
Palaemonoidea; Palaemonidae.
REFERENCE 1 (bases 1 to 1877)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1877 18S ribosomal RNA
BASE COUNT 339 a 321 c 399 g 313 t 505 others
ORIGIN
1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnnnnn
//
LOCUS PBESVBRA 584 bp ds-DNA INV 03-AUG-1990
DEFINITION P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION M34601
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 584)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 461 541 27 bp repeats
BASE COUNT 205 a 41 c 85 g 251 t 2 others
ORIGIN
1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS PBESVBRB 593 bp ds-DNA INV 03-AUG-1990
DEFINITION P.berghei telomeric repeat region subfragment a DNA.
ACCESSION M34602
KEYWORDS .
SOURCE P.berghei DNA, clone pTel.1.
ORGANISM Plasmodium berghei
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 593)
AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
TITLE Organization of subtelomeric repeats in Plasmodium berghei
JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990)
STANDARD simple staff_review
FEATURES from to/span description
rpt 484 510 27 bp repeat motif
BASE COUNT 209 a 40 c 97 g 247 t
ORIGIN
1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS PEURRE 1902 bp ss-rRNA RNA 03-AUG-1990
DEFINITION P.aztecus 18S rRNA.
ACCESSION M34362
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.aztecus rRNA.
ORGANISM Penaeus aztecus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea;
Penaeidae.
REFERENCE 1 (bases 1 to 1902)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1902 18S ribosomal RNA
BASE COUNT 340 a 369 c 427 g 332 t 434 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS POCRRE 1874 bp ss-rRNA RNA 03-AUG-1990
DEFINITION P.ascensionis 18S rRNA.
ACCESSION M34358
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.ascensionis rRNA.
ORGANISM Procaris ascensionis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea;
Procaridoidea; Procarididae.
REFERENCE 1 (bases 1 to 1874)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1874 18S ribosomal RNA
BASE COUNT 298 a 291 c 331 g 266 t 688 others
ORIGIN
1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnnn nnnn
//
LOCUS PRARRE 1869 bp ss-rRNA RNA 03-AUG-1990
DEFINITION P.leonensis 18S rRNA.
ACCESSION M34363
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE P.leonensis rRNA.
ORGANISM Procambarus leonensis
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea;
Astacoidea; Cambaridae.
REFERENCE 1 (bases 1 to 1869)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1869 18S ribosomal RNA
BASE COUNT 409 a 412 c 459 g 419 t 170 others
ORIGIN
1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
1861 nnnnnnnnn
//
LOCUS PVIC1RPTA 711 bp ds-DNA INV 03-AUG-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28745 M25758
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 711)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 711 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 697 711 region 2
site 502 597 post repeat variable region
site 598 696 post repeat constant region
BASE COUNT 261 a 151 c 226 g 73 t
ORIGIN
1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS PVIC1RPTB 657 bp ds-DNA INV 03-AUG-1990
DEFINITION P.vivax circumsporozoite protein gene, partial cds.
ACCESSION M28746 M25759
KEYWORDS circumsporozoite protein.
SOURCE P.vivax sporozoite (isolate VK210) DNA.
ORGANISM Plasmodium vivax
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 657)
AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
Waters,A.P. and Prasittisuk,C.
TITLE Circumsporozoite protein heterogeneity in the human malaria
parasite Plasmodium vivax
JOURNAL Science 245, 973-976 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Lanar, 25-JUN-1989.
FEATURES from to/span description
pept < 1 > 657 circumsporozoite protein (AA at 1)
site 1 15 region 1
site 643 657 region 2
rpt 16 477 27 bp repeats
site 478 544 3' post repeat variable region
site 545 642 3' post repeat constant region
BASE COUNT 229 a 135 c 224 g 69 t
ORIGIN
1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS SHV2A 554 bp ss-RNA VRL 03-AUG-1990
DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA,
partial cds.
ACCESSION M34085
KEYWORDS .
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 554)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL Unpublished (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Brown, 04-MAY-1990.
Author address: E.A.Brown
547 Burnett-Womack
CB# 7030
Chapel Hill, NC 27599
FEATURES from to/span description
pept < 1 > 554 segment 2A-encoded protein (AA at 1)
BASE COUNT 184 a 71 c 138 g 161 t
ORIGIN Segment 2A; map position 3108-3662.
1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
541 tcaggaatac agga
//
LOCUS SHVVP1CP 2373 bp ss-RNA VRL 03-AUG-1990
DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION M34084
KEYWORDS capsid protein VP1.
SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
in cwll line BS-C-1.
ORGANISM Simian hepatitis A virus
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Picornaviridae.
REFERENCE 1 (bases 1 to 2373)
AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M.
TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and
genetic comparison with human HAV
JOURNAL J. Virol. 63, 4932-4937 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.A.Brown, 04-MAY-1990.
FEATURES from to/span description
pept < 1 > 2373 capsid protein VP1 (AA at 1)
BASE COUNT 661 a 447 c 491 g 774 t
ORIGIN Segment P1; map position 735-3107.
1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS STNRRE 1885 bp ss-rRNA RNA 03-AUG-1990
DEFINITION S.hispidus 18S rRNA.
ACCESSION M34361
KEYWORDS 18S ribosomal RNA; ribosomal RNA.
SOURCE S.hispidus rRNA.
ORGANISM Stenopus hispidus
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea;
Stenopodidae.
REFERENCE 1 (bases 1 to 1885)
AUTHORS Kim,W. and Abele,L.G.
TITLE Molecular phylogeny of selected decapod crustaceans based on 18S
rRNA nucleotide sequences
JOURNAL J. Crust. Biol. 10, 1-13 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.G.Abele, 19-MAY-1990.
FEATURES from to/span description
rRNA 1 1885 18S ribosomal RNA
BASE COUNT 327 a 336 c 430 g 368 t 424 others
ORIGIN
1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS SV4CG 5243 bp ds-DNA circular VRL 03-AUG-1990
DEFINITION Simian Virus 40 complete genome.
ACCESSION J02400 J02402 J02406 J02407 J02408 J02409 J02410 V01380 J02403
J04139 M24914
KEYWORDS T-antigen; agnoprotein; alternate splicing; antigen;
auxiliary region; circular; complete genome; core region;
early leader protein; enhancer; large T-antigen;
major structural protein VP1; minor structural protein VP2;
minor structural protein VP3; origin of replication; palindrome;
small T-antigen; tandem repeat; topoisomerase.
SOURCE Simian Virus 40 (SV40) DNA, cDNA, mRNA and RNA from wild type
strains 777 and 776. (When the material is not simply DNA, such is
indicated on the reference line.).
ORGANISM Rhesus macaque polyomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 2463 to 2640; mRNA)
AUTHORS Dhar,R., Zain,S., Weissman,S.M., Pan,J. and Subramanian,K.
TITLE Nucleotide sequences of RNA transcribed in infected cells and by
Escherichia coli RNA polymerase from a segment of Simian Virus 40
DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 71, 371-375 (1974)
STANDARD full staff_review
REFERENCE 2 (bases 2623 to 2699; mRNA)
AUTHORS Dhar,R., Weissman,S.M., Zain,B.S., Pan,J. and Lewis,A.M.Jr.
TITLE The nucleotide sequence preceding an RNA polymerase initiation site
on SV40 DNA. part 2. the sequence of the early strand transcript
JOURNAL Nucleic Acids Res. 1, 595-613 (1974)
STANDARD full staff_review
REFERENCE 3 (bases 2623 to 2699; mRNA)
AUTHORS Dhar,R., Subramanian,K., Zain,B.S., Pan,J. and Weissman,S.M.
TITLE Nucleotide sequence about the 3' terminus of SV40 DNA transcripts
and the region where DNA synthesis is initiated
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 39, 153-160 (1975)
STANDARD full staff_review
REFERENCE 4 (sites; cds start for VP1)
AUTHORS van de Voorde,A., Contreras,R., Rogiers,R. and Fiers,W.
TITLE The initiation region of the SV40 VP1 gene
JOURNAL Cell 9, 117-120 (1976)
STANDARD full staff_review
REFERENCE 5 (bases 470 to 524)
AUTHORS Ysebaert,M., Thys,F., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the restriction fragments Hind L and Hind M
of SV40 DNA
JOURNAL Nucleic Acids Res. 3, 3409-3421 (1976)
STANDARD full staff_review
REFERENCE 6 (bases 32 to 236; direct tandem repeats)
AUTHORS Subramanian,K.N., Reddy,V.B. and Weissman,S.M.
TITLE Occurrence of reiterated sequences in an untranslated region of
Simian Virus 40 DNA determined by nucleotide sequence analysis
JOURNAL Cell 10, 497-507 (1977)
STANDARD full staff_review
REFERENCE 7 (bases 4592 to 4661; mRNA and DNA)
AUTHORS Thimmappaya,B. and Weissman,S.M.
TITLE The early region of SV40 DNA may have more than one gene
JOURNAL Cell 11, 837-843 (1977)
STANDARD full staff_review
REFERENCE 8 (bases 1046 to 1620; overlap between VP1, VP2 and VP3 genes)
AUTHORS Contreras,R., Rogiers,R., van de Voorde,A. and Fiers,W.
TITLE Overlapping of the VP2-VP3 gene and the VP1 gene in the SV40 genome
JOURNAL Cell 12, 529-538 (1977)
STANDARD full staff_review
REFERENCE 9 (bases 1 to 164; 5092 to 5243; origin of replication)
AUTHORS Subramanian,K.N., Dhar,R. and Weissman,S.M.
TITLE Nucleotide sequence of a fragment of SV40 DNA that contains the
origin of DNA replication and specifies the 5' ends of "early" and
"late" viral RNA: III. construction of the total sequence of
EcorII-G fragment of SV40 DNA
JOURNAL J. Biol. Chem. 252, 355-367 (1977)
STANDARD full staff_review
REFERENCE 10 (bases 1 to 585; 4857 to 5243)
AUTHORS Dhar,R., Subramanian,K.N., Pan,J. and Weissman,S.M.
TITLE Structure of a large segment of the genome of Simian Virus 40 that
does not encode known proteins
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 827-831 (1977)
STANDARD full staff_review
REFERENCE 11 (bases 3733 to 4007)
AUTHORS Volckaert,G., Contreras,R., Soeda,E., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of Simian Virus 40 Hind H restriction fragment
JOURNAL J. Mol. Biol. 110, 467-510 (1977)
STANDARD full staff_review
REFERENCE 12 (bases 1427 to 2593; VP1 gene)
AUTHORS Pan,J., Reddy,V.B., Thimmappaya,B. and Weissman,S.M.
TITLE Nucleotide sequence of the gene for the major structural protein of
SV40 virus
JOURNAL Nucleic Acids Res. 4, 2539-2548 (1977)
STANDARD full staff_review
REFERENCE 13 (bases 323 to 565; mRNA)
AUTHORS Celma,M.L., Dhar,R., Pan,J. and Weissman,S.M.
TITLE Comparison of the nucleotide sequence of the messenger RNA for the
major structural protein of SV40 with the DNA sequence encoding the
amino acids of the protein
JOURNAL Nucleic Acids Res. 4, 2549-2559 (1977)
STANDARD full staff_review
REFERENCE 14 (bases 347 to 472)
AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the SV40 DNA restriction fragment Hind C-Hap
2
JOURNAL Nucleic Acids Res. 4, 1015-1024 (1977)
STANDARD full staff_review
REFERENCE 15 (bases 1709 to 1786; RNA and DNA)
AUTHORS Contreras,R., Volckaert,G., Thys,F., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the restriction fragment Hind F-Eco RI2 of
SV40 DNA
JOURNAL Nucleic Acids Res. 4, 1001-1014 (1977)
STANDARD full staff_review
REFERENCE 16 (bases 1 to 163; 181 to 5243)
AUTHORS Reddy,V.B., Thimmappaya,B., Dhar,R., Subramanian,K.N., Zain,S.,
Pan,J., Ghosh,P.K., Celma,M.L. and Weissman,S.M.
TITLE The genome of Simian Virus 40
JOURNAL Science 200, 494-502 (1978)
STANDARD full staff_review
REFERENCE 17 (bases 1 to 163; 181 to 5243)
AUTHORS Fiers,W., Contreras,R., Haegeman,G., Rogiers,R., van de Voorde,A.,
van Heuverswyn,H., van Herreweghe,J., Volckaert,G. and Ysebaert,M.
TITLE Complete nucleotide sequence of SV40 DNA
JOURNAL Nature 273, 113-120 (1978)
STANDARD full staff_review
REFERENCE 18 (bases 2633 to 2933; RNA and DNA)
AUTHORS Thimmappaya,B., Zain,B.S., Dhar,R. and Weissman,S.M.
TITLE Nucleotide sequence of DNA template for the 3' ends of SV40 mRNA:
II. the sequence of the DNA fragment EcorII-F and a part of
EcorII-H
JOURNAL J. Biol. Chem. 253, 1613-1618 (1978)
STANDARD full staff_review
REFERENCE 19 (bases 325 to 526; 1463 to 1540;
sites-late 16S mRNA and RNA splicing)
AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Choudary,P.V., Lebowitz,P. and
Weissman,S.M.
TITLE The 5'-terminal leader sequence of late 16 S mRNA from cells
infected with Simian Virus 40
JOURNAL J. Biol. Chem. 253, 3643-3647 (1978)
STANDARD full staff_review
REFERENCE 20 (bases 322 to 536; 1450 to 1541;
sites-late 16S mRNA and RNA splicing)
AUTHORS Haegeman,G. and Fiers,W.
TITLE Evidence for 'splicing' of SV40 16S mRNA
JOURNAL Nature 273, 70-73 (1978)
STANDARD full staff_review
REFERENCE 21 (bases 4568 to 5175; small t antigen gene)
AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian Virus 40 small-t gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 2160-2164 (1978)
STANDARD full staff_review
REFERENCE 22 (bases 5109 to 5243; 1 to 63; RNA)
AUTHORS Woodworth-Gutai,M. and Nathans,D.
TITLE Evolutionary variants of Simian Virus 40: nucleotide sequence of a
conserved SV40 DNA segment containing the origin of viral DNA
replication as an inverted repetition
JOURNAL J. Mol. Biol. 126, 259-274 (1978)
STANDARD full staff_review
REFERENCE 23 (bases 1 to 74; 5171 to 5243; T antigen binding sites)
AUTHORS Tjian,R.
TITLE The binding site on SV40 DNA for a T antigen-related protein
JOURNAL Cell 13, 165-179 (1978)
STANDARD full staff_review
REFERENCE 24 (sites; 5' and 3' termini of 19s late mRNAs)
AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Lebowitz,P. and Weissman,S.M.
TITLE Heterogeneity and 5'-terminal structures of the late RNAs of Simian
Virus 40
JOURNAL J. Mol. Biol. 126, 813-846 (1978)
STANDARD full staff_review
REFERENCE 25 (bases 562 to 1620; VP2 and VP3 genes)
AUTHORS Reddy,V.B., Dhar,R. and Weissman,S.M.
TITLE Nucleotide sequence of the genes for the Simian Virus 40 proteins
VP2 and VP3
JOURNAL J. Biol. Chem. 253, 621-630 (1978)
STANDARD full staff_review
REFERENCE 26 (bases 358 to 914; sites-5' termini of late mRNAs)
AUTHORS Dhar,R., Reddy,V.B. and Weissman,S.M.
TITLE Nucleotide sequence of the DNA encoding the 5'-terminal sequences
of Simian Virus 40 late mRNA
JOURNAL J. Biol. Chem. 253, 612-620 (1978)
STANDARD full staff_review
REFERENCE 27 (bases 2300 to 2668; sites-mRNA 3' termini)
AUTHORS Zain,B.S., Thimmappaya,B., Dhar,R. and Weissman,S.M.
TITLE Nucleotide sequences of DNA encoding the 3' ends of SV40 mRNA I.
the sequence of the DNA fragment HindII,III-G
JOURNAL J. Biol. Chem. 253, 1606-1612 (1978)
STANDARD full staff_review
REFERENCE 28 (bases 2669 to 2901)
AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian-Virus-40 DNA region coding for
the carboxyl-terminal part of the T antigen
JOURNAL Eur. J. Biochem. 86, 335-344 (1978)
STANDARD full staff_review
REFERENCE 29 (bases 264 to 526; 1463 to 1516; 2582 to 2602; 2663 to 2676;
sites-cDNA, 16S leader segments)
AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P. and Weissman,S.M.
TITLE Gaps and duplicated sequences in the leaders of SV40 16S RNA
JOURNAL Nucleic Acids Res. 5, 4195-4213 (1978)
STANDARD full staff_review
REFERENCE 30 (sites; cap site for 16s and 19s late mRNAs)
AUTHORS Haegeman,G. and Fiers,W.
TITLE Localization of the 5' terminus of late SV40 mRNA
JOURNAL Nucleic Acids Res. 5, 2359-2371 (1978)
STANDARD full staff_review
REFERENCE 31 (sites; mutations in region of origin of replication)
AUTHORS Subramanian,K.N. and Shenk,T.
TITLE Definition of the boundaries of the origin of DNA replication in
Simian Virus 40
JOURNAL Nucleic Acids Res. 5, 3635-3642 (1978)
STANDARD full staff_review
REFERENCE 32 (sites; splice sites in late mRNAs)
AUTHORS Lai,C.-J., Dhar,R. and Khoury,G.
TITLE Mapping the spliced and unspliced late lytic SV40 RNAs
JOURNAL Cell 14, 971-982 (1978)
STANDARD full staff_review
REFERENCE 33 (sites; early mRNAs)
AUTHORS May,E., Kress,M. and May,P.
TITLE Characterization of two SV40 early mRNAs and evidence for a nuclear
"prespliced" RNA species
JOURNAL Nucleic Acids Res. 5, 3083-3099 (1978)
STANDARD full staff_review
REFERENCE 34 (sites; early mRNAs and RNA splicing)
AUTHORS Berk,A.J. and Sharp,P.A.
TITLE Spliced early mRNAs of Simian Virus 40
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1274-1278 (1978)
STANDARD full staff_review
REFERENCE 35 (review; bases 1 to 163; 181 to 5243)
AUTHORS Lebowitz,P. and Weissman,S.M.
TITLE Organization and transcription of the Simian Virus 40 genome
JOURNAL Curr. Top. Microbiol. Immunol. 87, 43-172 (1979)
STANDARD full staff_review
REFERENCE 36 (bases 1 to 472; 5174 to 5243)
AUTHORS van Heuverswyn,H. and Fiers,W.
TITLE Nucleotide sequence of the Hind-C fragment of Simian Virus 40 DNA:
comparison of the 5'-untranslated region of wild-type virus and of
some deletion mutants
JOURNAL Eur. J. Biochem. 100, 51-60 (1979)
STANDARD full staff_review
REFERENCE 37 (bases 1276 to 1539; deletion mutant analysis)
AUTHORS Contreras,R., Cole,C., Berg,P. and Fiers,W.
TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with
deletions in the late region of the genome
JOURNAL J. Virol. 29, 789-793 (1979)
STANDARD full staff_review
REFERENCE 38 (bases 325 to 562; 1403 to 1582; cDNA)
AUTHORS Bina-Stein,M., Thoren,M., Salzman,N. and Thompson,J.A.
TITLE Rapid sequence determination of late Simian Virus 40 16S mRNA
leader by using inhibitors of reverse transcriptase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 731-735 (1979)
STANDARD full staff_review
REFERENCE 39 (bases 2691 to 4571; 4638 to 5163;
sites-5' termini of early mRNAs)
AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P., Piatak,M. and Weissman,S.M.
TITLE Simian Virus 40 early mRNA's: I. genomic localization of 3' and 5'
termini and two major splices in mRNA from transformed and
lytically infected cells
JOURNAL J. Virol. 30, 279-296 (1979)
STANDARD full staff_review
REFERENCE 40 (sites; splice sites for large T antigen mRNA)
AUTHORS Mark,D.F. and Berg,P.
TITLE A third splice site in SV40 early mRNA
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 55-62 (1980)
STANDARD full staff_review
REFERENCE 41 (bases 4963 to 5179; deletion mutant analysis)
AUTHORS Pipas,J.M., Adler,S.P., Peden,K.W.C. and Nathans,D.
TITLE Deletion mutants of SV40 that affect the structure of viral tumor
antigens
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 285-291 (1980)
STANDARD full staff_review
REFERENCE 42 (bases 2667 to 2840; deletion mutant analysis)
AUTHORS van Heuverswyn,H., Cole,C., Berg,P. and Fiers,W.
TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with
deletions in the region coding for the carboxyl terminus of the T
antigen
JOURNAL J. Virol. 30, 936-941 (1979)
STANDARD full staff_review
REFERENCE 43 (bases 263 to 532; deletion mutant analysis)
AUTHORS Subramanian,K.N.
TITLE Segments of Simian Virus 40 DNA spanning most of the leader
sequence of the major late viral messenger RNA are dispensable
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 2556-2560 (1979)
STANDARD full staff_review
REFERENCE 44 (bases 4548 to 4923; deletion mutant analysis)
AUTHORS Volckaert,G., Feunteun,J., Crawford,L.V., Berg,P. and Fiers,W.
TITLE Nucleotide sequence deletions within the coding region for small-t
antigen of Simian Virus 40
JOURNAL J. Virol. 30, 674-682 (1979)
STANDARD full staff_review
REFERENCE 45 (bases 250 to 304; 345 to 399; deletion mutant analysis)
AUTHORS Haegeman,G., van Heuverswyn,H., Gheysen,D. and Fiers,W.
TITLE Heterogeneity of the 5' terminus of late mRNA induced by a viable
Simian Virus 40 deletion mutant
JOURNAL J. Virol. 31, 484-493 (1979)
STANDARD full staff_review
REFERENCE 46 (bases 4566 to 5225; cDNA)
AUTHORS Thompson,J.A., Radonovich,M.F. and Salzman,N.P.
TITLE Characterization of the 5'-terminal structure of Simian Virus 40
early mRNA's
JOURNAL J. Virol. 31, 437-446 (1979)
STANDARD full staff_review
REFERENCE 47 (sites; deletion mutant analysis over bases 4553 to 5172)
AUTHORS Thimmappaya,B. and Shenk,T.
TITLE Nucleotide sequence analysis of viable deletion mutants lacking
segments of the Simian Virus 40 genome coding for small t antigen
JOURNAL J. Virol. 30, 668-673 (1979)
STANDARD full staff_review
REFERENCE 48 (sites; deletion mutant analysis of major late cap site)
AUTHORS Haegeman,G., Iserentant,D., Gheysen,D. and Fiers,W.
TITLE Characterization of the major altered leader sequence of late mRNA
induced by SV40 deletion mutant dl-1811
JOURNAL Nucleic Acids Res. 7, 1799-1814 (1979)
STANDARD full staff_review
REFERENCE 49 (sites; splice sites in early mRNAs)
AUTHORS Khoury,G., Gruss,P., Dhar,R. and Lai,C.-J.
TITLE Processing and expression of early SV40 mRNA: a role for RNA
conformation in splicing
JOURNAL Cell 18, 85-92 (1979)
STANDARD full staff_review
REFERENCE 50 (review; bases 1 to 163; 181 to 5243)
AUTHORS Buchman,A.R., Burnett,L. and Berg,P.
TITLE Appendix A: The SV40 nucleotide sequence
JOURNAL (in) Tooze,J. (Eds.);
DNA Tumor Viruses - Second Edition Revised: 799-841;
Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
STANDARD full staff_review
REFERENCE 51 (bases 2663 to 2921; deletion mutant analysis)
AUTHORS Alwine,J.C. and Khoury,G.
TITLE Simian Virus 40-associated small RNA: mapping on the Simian Virus
40 genome and characterization of its synthesis
JOURNAL J. Virol. 36, 701-708 (1980)
STANDARD full staff_review
REFERENCE 52 (bases 4007 to 4574)
AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction
fragment A: (second part of the T antigen gene)
JOURNAL Eur. J. Biochem. 106, 169-177 (1980)
STANDARD full staff_review
REFERENCE 53 (bases 3476 to 3735; RNA and DNA)
AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction
fragment I: (fourth part of the T antigen gene)
JOURNAL Eur. J. Biochem. 106, 179-192 (1980)
STANDARD full staff_review
REFERENCE 54 (bases 3146 to 3481)
AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Hind-I-proximal part of Simian Virus 40
HindII+III restriction fragment B: (fifth part of the T antigen
gene
JOURNAL Eur. J. Biochem. 106, 193-198 (1980)
STANDARD full staff_review
REFERENCE 55 (bases 2669 to 3251)
AUTHORS van Heuverswyn,H., van de Voorde,A., van Herreweghe,J.,
Volckaert,G., de Winne,P. and Fiers,W.
TITLE Nucleotide sequence of Simian Virus 40 DNA: structure of the middle
segment of the HindII+III restriction fragment B: (sixth part of
the T antigen gene) and codon usage
JOURNAL Eur. J. Biochem. 106, 199-209 (1980)
STANDARD full staff_review
REFERENCE 56 (bases 4572 to 4917)
AUTHORS Seif,I., Khoury,G. and Dhar,R.
TITLE A rapid enzymatic DNA sequencing technique: determination of
sequence alterations in early Simian Virus 40 temperature sensitive
and deletion mutants
JOURNAL Nucleic Acids Res. 8, 2225-2240 (1980)
STANDARD full staff_review
REFERENCE 57 (bases 1 to 74; 5161 to 5243;
deletion and point mutants over the replication origin)
AUTHORS DiMaio,D. and Nathans,D.
TITLE Cold-sensitive regulatory mutants of Simian Virus 40
JOURNAL J. Mol. Biol. 140, 129-142 (1980)
STANDARD full staff_review
REFERENCE 58 (bases 13 to 65; 5155 to 5227;
deletion mutants over the early promoter)
AUTHORS Benoist,C. and Chambon,P.
TITLE Deletions covering the putative promoter region of early mRNAs of
Simian Virus 40 do not abolish T-antigen expression
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3865-3869 (1980)
STANDARD full staff_review
REFERENCE 59 (sites; T antigen binding sites)
AUTHORS Shalloway,D., Kleinberger,T. and Livingston,D.M.
TITLE Mapping of SV40 DNA replication origin region binding sites for the
SV40 T antigen by protection against exonuclease III digestion
JOURNAL Cell 20, 411-422 (1980)
STANDARD full staff_review
REFERENCE 60 (bases 1 to 33; 212 to 236; 5161 to 5243;
origin-defective deletion mutants)
AUTHORS Gluzman,Y., Sambrook,J.F. and Frisque,R.J.
TITLE Expression of early genes of origin-defective mutants of Simian
Virus 40
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3898-3902 (1980)
STANDARD full staff_review
REFERENCE 61 (bases 5213 to 5236; early mRNA initiation sites)
AUTHORS Haegeman,G. and Fiers,W.
TITLE Characterization of the 5'-terminal cap structures of early Simian
Virus 40 mRNA
JOURNAL J. Virol. 35, 955-961 (1980)
STANDARD full staff_review
REFERENCE 62 (sites; early mRNA initiation sites and repression by T antigen)
AUTHORS Hansen,U., Tenen,D.G., Livingston,D.M. and Sharp,P.A.
TITLE T antigen repression of SV40 early transcription from two promoters
JOURNAL Cell 27, 603-612 (1981)
STANDARD full staff_review
REFERENCE 63 (sites; poly-A at 2662; deletion mutants over 2533-2775)
AUTHORS Fitzgerald,M. and Shenk,T.
TITLE The sequence 5'-AAUAAA-3' forms part of the recognition site for
polyadenylation of late SV40 mRNAs
JOURNAL Cell 24, 251-260 (1981)
STANDARD full staff_review
REFERENCE 64 (bases 306 to 428; deletion mutant analysis)
AUTHORS Woodworth-Gutai,M.
TITLE Recombination in SV40-infected cells: viral DNA sequences at sites
of circularization of transfecting linear DNA
JOURNAL Virology 109, 353-365 (1981)
STANDARD full staff_review
REFERENCE 65 (bases 101 to 178; 251 to 280; early promoter analysis)
AUTHORS Gruss,P., Dhar,R. and Khoury,G.
TITLE Simian Virus 40 tandem repeated sequences as an element of the
early promoter
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 943-947 (1981)
STANDARD full staff_review
REFERENCE 66 (bases 110 to 178; 251 to 601; deletion mutant analysis)
AUTHORS Barkan,A. and Mertz,J.E.
TITLE DNA sequence analysis of Simian Virus 40 mutants with deletions
mapping in the leader region of the late viral mRNA's: mutants with
deletions similar in size and position exhibit varied phenotypes
JOURNAL J. Virol. 37, 730-737 (1981)
STANDARD full staff_review
REFERENCE 67 (bases 1 to 54; 5148 to 5243; cDNAs over this region)
AUTHORS Ghosh,P.K., Lebowitz,P., Frisque,R.J. and Gluzman,Y.
TITLE Identification of a promoter component involved in positioning the
5' termini of Simian Virus 40 early mRNAs
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 100-104 (1981)
STANDARD full staff_review
REFERENCE 68 (bases 1 to 173; 182 to 280; 5216 to 5243;
early promoter analysis)
AUTHORS Benoist,C. and Chambon,P.
TITLE In vivo sequence requirements of the SV40 early promoter region
JOURNAL Nature 290, 304-310 (1981)
STANDARD full staff_review
REFERENCE 69 (sites; promoter for early mRNAs)
AUTHORS Mathis,D.J. and Chambon,P.
TITLE The SV40 early region TATA box is required for accurate in vitro
initiation of transcription
JOURNAL Nature 290, 310-315 (1981)
STANDARD full staff_review
REFERENCE 70 (bases 1 to 177; 5171 to 5243; autoregulation by T antigen)
AUTHORS Myers,R.M., Rio,D.C., Robbins,A.K. and Tjian,R.
TITLE SV40 gene expression is modulated by the cooperative binding of T
antigen to DNA
JOURNAL Cell 25, 373-384 (1981)
STANDARD full staff_review
REFERENCE 71 (bases 1 to 102; 5148 to 5243;
upstream shift of early mRNA initiation site)
AUTHORS Ghosh,P.K. and Lebowitz,P.
TITLE Simian Virus 40 early mRNA's contain multiple 5'termini upstream
and downstream from a Hogness-Goldberg sequence; a shift in 5'
termini during the lytic cycle is mediated by large T antigen
JOURNAL J. Virol. 40, 224-240 (1981)
STANDARD full staff_review
REFERENCE 72 (bases 2644 to 2923; deletion mutant analysis)
AUTHORS Feunteun,J., Carmichael,G., Nicolas,J.C. and Kress,M.
TITLE Mutant carrying deletions in the two Simian Virus 40 early genes
JOURNAL J. Virol. 40, 625-634 (1981)
STANDARD full staff_review
REFERENCE 73 (sites; cds for agnoprotein)
AUTHORS Jackson,V. and Chalkley,R.
TITLE Use of whole-cell fixation to visualize replicating and maturing
Simian Virus 40: identification of new viral gene product
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 6081-6085 (1981)
STANDARD full staff_review
REFERENCE 74 (bases 1 to 32; 212 to 236; 5161 to 5243;
T antigen binding to a deletion mutant)
AUTHORS McKay,R. and DiMaio,D.
TITLE Binding of an SV40 T antigen-related protein to the DNA of SV40
regulatory mutants
JOURNAL Nature 289, 810-813 (1981)
STANDARD full staff_review
REFERENCE 75 (sites; transcriptional enhancement by the 72 bp repeats)
AUTHORS Moreau,V., Hen,R., Wasylyk,B., Everett,R., Gaub,M.P. and Chambon,P.
TITLE The SV40 72 base repair repeat has a striking effect on gene
expression both in SV40 and other chimeric recombinants
JOURNAL Nucleic Acids Res. 9, 6047-6068 (1981)
STANDARD full staff_review
REFERENCE 76 (bases 325 to 622; 1463 to 1511; cDNAs;
late mRNA synthesis by deletion mutants)
AUTHORS Piatak,M., Subramanian,K.N., Roy,P. and Weissman,S.M.
TITLE Late messenger RNA production by viable Simian Virus 40 mutants
with deletions in the leader region
JOURNAL J. Mol. Biol. 153, 589-618 (1981)
STANDARD full staff_review
REFERENCE 77 (sites; agnoprotein coding sequence)
AUTHORS Jay,G., Nomura,S., Anderson,C.W. and Khoury,G.
TITLE Identification of the SV40 agnogene product: a DNA binding protein
JOURNAL Nature 291, 346-349 (1981)
STANDARD full staff_review
REFERENCE 78 (sites; mapping of the origin of replication)
AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W. and Subramanian,K.N.
TITLE Territorial limits and functional anatomy of the Simian Virus 40
replication origin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 381-385 (1982)
STANDARD full staff_review
REFERENCE 79 (sites; topoisomerase cleavage sites)
AUTHORS Edwards,K.A., Halligan,B.D., Davis,J.L., Nivera,N.L. and Liu,L.F.
TITLE Recognition sites of eukaryotic DNA topoisomerase I: DNA nucleotide
sequencing analysis of topo I cleavage sites on SV40 DNA
JOURNAL Nucleic Acids Res. 10, 2565-2576 (1982)
STANDARD full staff_review
REFERENCE 80 (bases 2619 to 2920; deletion mutant analysis)
AUTHORS Polvino-Bodnar,M. and Cole,C.N.
TITLE Construction and characterization of viable deletion mutants of
Simian Virus 40 lacking sequences near the 3' end of the early
region
JOURNAL J. Virol. 43, 489-502 (1982)
STANDARD full staff_review
REFERENCE 81 (bases 255 to 2535; only joints of deletion mutants)
AUTHORS Johnson,A.D., Barkan,A. and Mertz,J.E.
TITLE Nucleotide sequence analysis of the recombinant joints in 16
naturally arising deletion mutants of Simian Virus 40
JOURNAL Virology 123, 464-469 (1982)
STANDARD full staff_review
REFERENCE 82 (bases 260 to 422; RNA)
AUTHORS Hay,N., Skolnik-David,H. and Aloni,Y.
TITLE Attenuation in the control of SV40 gene expression
JOURNAL Cell 29, 183-193 (1982)
STANDARD full staff_review
REFERENCE 83 (sites; late mRNA initiation sites)
AUTHORS Alwine,J.C.
TITLE Evidence for Simian Virus 40 late transcriptional control: mixed
infections of wild-type Simian Virus 40 and a late leader deletion
mutant exhibit trans effects on late viral RNA synthesis
JOURNAL J. Virol. 42, 798-803 (1982)
STANDARD full staff_review
REFERENCE 84 (bases 284 to 326; late promoter analysis)
AUTHORS Brady,J., Radonovich,M., Vodkin,M., Natarajan,V., Thoren,M.,
Das,G., Janik,J. and Salzman,N.P.
TITLE Site-specific base substitution and deletion mutations that enhance
or suppress transcription of the SV40 major late RNA
JOURNAL Cell 31, 625-633 (1982)
STANDARD full staff_review
REFERENCE 85 (bases 1 to 16; 5227 to 5243;
mutant analysis over origin of replication)
AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W., Byrne,B.J. and
Subramanian,K.N.
TITLE Cyclization of linear chimeric plasmids in vivo by a novel
end-to-end joining reaction or by intramolecular recombination: one
of the products contains a 147-bp perfect palindrome stable in
Escherichia coli
JOURNAL Gene 20, 157-167 (1982)
STANDARD full staff_review
REFERENCE 86 (bases 5164 to 5243; 1 to 48;
T antigen binding to a deletion mutant)
AUTHORS Tenen,D.G., Haines,L.L. and Livingston,D.M.
TITLE Binding of an analog of the Simian Virus 40 T antigen to wild-type
and mutant viral replication origins
JOURNAL J. Mol. Biol. 157, 473-492 (1982)
STANDARD full staff_review
REFERENCE 87 (sites; T antigen binding sites)
AUTHORS DiMaio,D. and Nathans,D.
TITLE Regulatory mutants of Simian Virus 40: effect of mutations at a T
antigen binding site on DNA replication and expression of viral
genes
JOURNAL J. Mol. Biol. 156, 531-548 (1982)
STANDARD full staff_review
REFERENCE 88 (sites; mapping of early region promoter)
AUTHORS Fromm,M. and Berg,P.
TITLE Deletion mapping of DNA regions required for SV40 early promoter
function in vivo
JOURNAL J. Mol. Appl. Genet. 1, 457-481 (1982)
STANDARD full staff_review
REFERENCE 89 (sites; SV40 enhancement of beta-globin gene expression)
AUTHORS Banerji,J., Rusconi,S. and Schaffner,W.
TITLE Expression of a beta-globin gene is enhanced by remote SV40 DNA
sequences
JOURNAL Cell 27, 299-308 (1981)
STANDARD full staff_review
REFERENCE 90 (bases 1 to 129; 5228 to 5243; mapping of the early promoter)
AUTHORS Byrne,B.J., Davis,M.S., Yamaguchi,J., Bergsma,D.J. and
Subramanian,K.N.
TITLE Definition of the Simian Virus 40 early promoter region and
demonstration of a host range bias in the enhancement effect of the
Simian Virus 40 72-base-pair repeat
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 721-725 (1983)
STANDARD full staff_review
REFERENCE 91 (sites; agnogene mutants)
AUTHORS Mertz,J.E., Murphy,A. and Barkan,A.
TITLE Mutants deleted in the agnogene of Simian Virus 40 define a new
complementation group
JOURNAL J. Virol. 45, 36-46 (1983)
STANDARD full staff_review
REFERENCE 92 (bases 335 to 394; 1706 to 1753; mutant analysis)
AUTHORS Margolskee,R.F. and Nathans,D.
TITLE Suppression of a VP1 mutant of Simian Virus 40 by missense
mutations in serine codons of the viral agnogene
JOURNAL J. Virol. 48, 405-409 (1983)
STANDARD full staff_review
REFERENCE 93 (bases 3512 to 3565; 4154 to 4311;
mutant analysis over these regions)
AUTHORS Pipas,J.M., Peden,K.W.C. and Nathans,D.
TITLE Mutational analysis of Simian Virus 40 T antigen: isolation and
characterization of mutants with deletions in the T-antigen gene
JOURNAL Mol. Cell. Biol. 3, 203-213 (1983)
STANDARD full staff_review
REFERENCE 94 (bases 1709 to 1786; topoisomerase II)
AUTHORS Liu,L.F., Rowe,T.C., Yang,L., Tewey,K.M. and Chen,G.L.
TITLE Cleavage of DNA by mammalian DNA topoisomerase II
JOURNAL J. Biol. Chem. 258, 15365-15370 (1983)
STANDARD full staff_review
REFERENCE 95 (bases 2773 to 4406; only joints of deletion mutants)
AUTHORS Tornow,J. and Cole,C.N.
TITLE Intracistronic complementation in the Simian Virus 40 A gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 6312-6316 (1983)
STANDARD full staff_review
REFERENCE 96 (bases 21 to 118;
transcription factor binding to early promoter)
AUTHORS Dynan,W.S. and Tjian,R.
TITLE The promoter-specific transcription factor Sp1 binds to upstream
sequences in the SV40 early promoter
JOURNAL Cell 35, 79-87 (1983)
STANDARD full staff_review
REFERENCE 97 (sites; 7S-K RNA of transformed cells)
AUTHORS Sohn,U., Szyszko,J., Coombs,D. and Krause,M.
TITLE 7S-K nuclear RNA from Simian Virus 40-transformed cells has
sequence homology to the viral early promoter
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7090-7094 (1983)
STANDARD full staff_review
REFERENCE 98 (bases 1 to 49; 5181 to 5243; T antigen binding analysis)
AUTHORS Tenen,D.G., Livingston,D.M., Wang,S.-S. and Martin,R.G.
TITLE Effect of a stem-loop structure within the SV40 replication origin
upon SV40 T antigen binding to origin region sequences
JOURNAL Cell 34, 629-639 (1983)
STANDARD full staff_review
REFERENCE 99 (bases 253 to 482; cDNA; late mRNA initiation)
AUTHORS Piatak,M., Ghosh,P.K., Norkin,L.C. and Weissman,S.M.
TITLE Sequences locating the 5' ends of the major Simian Virus 40 late
mRNA forms
JOURNAL J. Virol. 48, 503-520 (1983)
STANDARD full staff_review
REFERENCE 100(bases 1 to 14; 5210 to 5243; autoregulation by T antigen)
AUTHORS Rio,D.C. and Tjian,R.
TITLE Sv40 T antigen binding site mutations that affect autoregulation
JOURNAL Cell 32, 1227-1240 (1983)
STANDARD full staff_review
REFERENCE 101(bases 1 to 134; 5216 to 5243; early promoter analysis)
AUTHORS Everett,R.D., Baty,D. and Chambon,P.
TITLE The repeated GC-rich motifs upstream from the TATA box are
important elements of the SV40 early promoter
JOURNAL Nucleic Acids Res. 11, 2447-2464 (1983)
STANDARD full staff_review
REFERENCE 102(bases 5165 to 5243)
AUTHORS Tenen,D.G., Taylor,T.S., Haines,L.L., Bradley,M.K., Martin,R.G. and
Livingston,D.M.
TITLE Binding of simian virus 40 large T antigen from virus-infected
monkey cells to wild-type and mutant viral replication origins
JOURNAL J. Mol. Biol. 168, 791-808 (1983)
STANDARD full staff_review
REFERENCE 103(bases 1 to 26; 5170 to 5243; T antigen binding sites)
AUTHORS Jones,K.A. and Tjian,R.
TITLE Essential contact residues within SV40 large T antigen binding
sites I and II identified by alkylation-interference
JOURNAL Cell 36, 155-162 (1984)
STANDARD full staff_review
REFERENCE 104(sites; sites-5' termini of late mRNAs)
AUTHORS Lycan,D.E. and Danna,K.J.
TITLE S1 mapping of purified nascent transcripts of Simian Virus 40
JOURNAL Mol. Cell. Biol. 4, 625-633 (1984)
STANDARD full staff_review
REFERENCE 105(bases 1 to 33; 5213 to 5243; T antigen binding analysis)
AUTHORS Lewton,B.A., Delucia,A.L. and Tegtmeyer,P.
TITLE Binding of Simian Virus 40 A protein to DNA with deletions at the
origin of replication
JOURNAL J. Virol. 49, 9-13 (1984)
STANDARD full staff_review
REFERENCE 106(sites; T antigen binding sites)
AUTHORS Pomerantz,B.J. and Hassell,J.A.
TITLE Polyomavirus and Simian Virus 40 large T antigens bind to common
DNA sequences
JOURNAL J. Virol. 49, 925-937 (1984)
STANDARD full staff_review
REFERENCE 107(bases 1 to 118; 5186 to 5243; primase initiation sites)
AUTHORS Tseng,B.Y. and Ahlem,C.N.
TITLE Mouse primase initiation sites in the origin region of Simian Virus
40
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 2342-2346 (1984)
STANDARD full staff_review
REFERENCE 108(bases 1 to 294; 297 to 379; transcriptional enhancer analysis)
AUTHORS Weber,F., de Villiers,J. and Schaffner,W.
TITLE An SV40 'enhancer trap' incorporates exogenous enhancers or
generates enhancers from its own sequences
JOURNAL Cell 36, 983-992 (1984)
STANDARD full staff_review
REFERENCE 109(bases 1 to 33; 5213 to 5243;
mutant analysis at the replication origin)
AUTHORS Cohen,G.L., Wright,P.J., Delucia,A.L., Lewton,B.A.,
Anderson,M.E. and Tegtmeyer,P.
TITLE Critical spatial requirement within the origin of Simian Virus 40
DNA replication
JOURNAL J. Virol. 51, 91-96 (1984)
STANDARD full staff_review
REFERENCE 110(sites; mRNA polyadenylation signal sequences)
AUTHORS Sadofsky,M. and Alwine,J.C.
TITLE Sequences on the 3' side of hexanucleotide AAUAAA affect efficiency
of cleavage at the polyadenyation site
JOURNAL Mol. Cell. Biol. 4, 1460-1468 (1984)
STANDARD full staff_review
REFERENCE 111(bases 1582 to 1635; 2524 to 2652;
chromatin structure at origin of replication; mutational analysis)
AUTHORS Innis,J.W. and Scott,W.A.
TITLE DNA replication and chromatin structure of Simian Virus 40
insertion mutants
JOURNAL Mol. Cell. Biol. 4, 1499-1507 (1984)
STANDARD full staff_review
REFERENCE 112(sites; bidirectional promoter element)
AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N.
TITLE Mapping of the late promoter of Simian virus 40
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 23-27 (1984)
STANDARD full staff_review
REFERENCE 113(sites;
late promoter sequence requirements in the presence of T antigen)
AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N.
TITLE The simian virus 40 minimal origin and the 72-base-pair repeat are
required simultaneously for efficient induction of late gene
expression with large tumor antigen
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6335-6339 (1984)
STANDARD full staff_review
REFERENCE 114(bases 304 to 528)
AUTHORS Hay,N., Kessler,M. and Aloni,Y.
TITLE SV40 deletion mutant (d1861) with agnoprotein shortened by four
amino acids
JOURNAL Virology 137, 160-170 (1984)
STANDARD full staff_review
REFERENCE 115(bases 1 to 366; 5230 to 5243)
AUTHORS Swimmer,C. and Shenk,T.
TITLE A viable simian virus 40 variant that carries a newly generated
sequence reiteration in place of the normal duplicated enhancer
element
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6652-6656 (1984)
STANDARD full staff_review
REFERENCE 116(sites; Okazaki fragment sequences)
AUTHORS Hay,R.T., Hendrickson,E.A. and DePamphilis,M.L.
TITLE Sequence specificity for the initiation of RNA-primed simian virus
40 DNA synthesis in vivo
JOURNAL J. Mol. Biol. 175, 131-157 (1984)
STANDARD full staff_review
REFERENCE 117(sites; tsA3900 point mutant sequence)
AUTHORS Hutchinson,N.I., Chang,L.-S., Pater,M.M., Bouck,N., Shenk,T.E. and
Di Mayorca,G.
TITLE Characterization of a new simian virus 40 mutant, tsA3900, isolated
from deletion mutant tsA1499
JOURNAL J. Virol. 53, 814-821 (1985)
STANDARD full staff_review
REFERENCE 118(sites; large t-antigen binding specificity)
AUTHORS Scheller,A. and Prives,C.
TITLE Simian virus 40 and polyomavirus large tumor antigens have
different requirements for high-affinity sequence-specific DNA
binding
JOURNAL J. Virol. 54, 532-545 (1985)
STANDARD full staff_review
REFERENCE 119(bases 1499 to 2590; temperature-sensitive mutants)
AUTHORS Ng,S.-C., Behm,M. and Bina,M.
TITLE DNA sequence alterations responsible for the synthesis of
thermosensitive VP1 in temperature-sensitive BC mutants of simian
virus 40
JOURNAL J. Virol. 54, 646-649 (1985)
STANDARD full staff_review
REFERENCE 120(sites; recombination in poly(GT)-sequences)
AUTHORS Stringer,J.R.
TITLE Recombination between poly[d(GT)-d(CA)] sequences in simian virus
40-infected cultured cells
JOURNAL Mol. Cell. Biol. 5, 1247-1259 (1985)
STANDARD full staff_review
REFERENCE 121(sites; large T-antigen binding sites)
AUTHORS Barrera-Saldana,H., Takahashi,K., Vigneron,M., Wildeman,A.,
Davidson,I. and Chambon,P.
TITLE All six GC-motifs of the SV40 early upstream element contribute to
promoter activity in vivo and in vitro
JOURNAL EMBO J. 4, 3839-3849 (1985)
STANDARD full staff_review
REFERENCE 122(bases 4556 to 4657; 4907 to 4937)
AUTHORS Noble,J.C., Prives,C. and Manley,J.L.
TITLE In vitro splicing of simian virus 40 early pre mRNA
JOURNAL Nucleic Acids Res. 14, 1219-1235 (1986)
STANDARD full staff_review
REFERENCE 123(sites; SV40 early leader protein (SELP) boundaries)
AUTHORS Khalili,K., Brady,J. and Khoury,G.
TITLE Translational regulation of SV40 early mRNA defines a new viral
protein
JOURNAL Cell 48, 639-645 (1987)
STANDARD full staff_review
REFERENCE 124(bases 1494 to 1708)
AUTHORS Rogiers,R., van de Voorde,A., Soeda,E. and Fiers,W.
TITLE Nucleotide sequence of the Simian virus 40 Hind-K restriction
fragment
JOURNAL Eur. J. Biochem. 85, 205-224 (1978)
STANDARD full staff_entry
REFERENCE 125(bases 662 to 1051)
AUTHORS Ysebaert,M., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction
fragment D and the total amino acid sequence of the late proteins
VP2 and VP3
JOURNAL Eur. J. Biochem. 91, 431-439 (1978)
STANDARD full staff_entry
REFERENCE 126(bases 2060 to 2299)
AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction
fragment J and the total amino acid sequence of the major
structural protein VP1
JOURNAL Eur. J. Biochem. 91, 415-430 (1978)
STANDARD full staff_entry
REFERENCE 127(bases 1 to 107; 5146 to 5243)
AUTHORS Hay,R.T. and DePamphilis,M.L.
TITLE Initiation of SV40 DNA replication in vivo: Location and structure
of 5' ends of DNA synthesized in the ori region
JOURNAL Cell 28, 767-779 (1982)
STANDARD full staff_entry
REFERENCE 128(bases 80 to 350)
AUTHORS Contreras,R., Gheysen,D., Knowland,J., van de Voorde,A. and
Fiers,W.
TITLE Evidence for the direct involvement of DNA replication origin in
synthesis of late SV40 RNA
JOURNAL Nature 300, 500-505 (1982)
STANDARD full staff_entry
REFERENCE 129(sites; early SV40 transcription)
AUTHORS Lebowitz,P. and Ghosh,P.
TITLE Initiation and regulation of Simian virus 40 early transcription in
vitro
JOURNAL J. Virol. 41, 449-461 (1982)
STANDARD full staff_entry
REFERENCE 130(sites; temperature sensitive mutants in the VP1 gene)
AUTHORS Behm,M., Lowman,H., Ng,S.-C. and Bina,M.
TITLE Analysis of temperature-sensitive mutations in the simian virus 40
gene encoding virion protein 1
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9421-9425 (1988)
STANDARD full staff_review
REFERENCE 131(sites; ends of "early" and "late" mRNA)
AUTHORS Subramanian,K.N., Ghosh,P.K., Dhar,R., Thimmappaya,B., Zain,S.B.,
Pan,J. and Weissman,S.M.
TITLE The primary structure of regions of SV40 DNA encoding the ends of
mRNA
JOURNAL Prog. Nucleic Acid Res. Mol. Biol. 19, 157-164 (1976)
STANDARD simple staff_review
REFERENCE 132(bases 1 to 164; 5092 to 5243)
AUTHORS Weissman,S.M., Subramanian,K.N., Dhar,R., Zain,B. and Pan,J.
TITLE SV40 DNA sequences as an example of the structure of genes
functioning in animal cell nuclei
JOURNAL Med. Pediatr. Oncol. 2, 259-263 (1976)
STANDARD full staff_entry
REFERENCE 133(bases 1787 to 2059)
AUTHORS Contreras,R., Van De Voorde,A. and Fiers,W.
TITLE Nucleotide sequence of the restriction fragment Hind-F-EcoRI-1 of
simian-virus-40 DNA (part of the VP1 gene)
JOURNAL Eur. J. Biochem. 86, 317-324 (1978)
STANDARD simple staff_review
COMMENT Kindly reviewed by K. N. Subramanian. Draft entry and computer-
readable sequence for [130] kindly submitted by M.Bina,
16-FEB-1989.
The E or minus strand of SV40 strain 776 is shown as it is reported
by [16] and [17] using the numbering system by [50], with exception
of of the addition of 17 bases (between nucleotides 164 and 165) to
their sequence [36]. Differences between the complete sequences as
published by [16] and [17] and this sequence are limited to the 17
bp change, the revision at 328 below, and the numbering. All
references tend to agree upon an independent coordinate system,
where map units run from 0.00 to 1.00 and the origin is defined by
the sole EcoRI site, which is found at base 1782 below. In keeping
with [50], map units for this entry have been calculated by
SV + 3461
mu = -----------
5243
If the result is > 1, subtract 1 for the value. The origin of this
sequence, then, is at 0.66. Identical units are used for Polyoma
virus.
DNA replication is bidirectional from an origin which has been
narrowed by mutational analysis to bases 5193 to 34 [31]. Given
that early mRNA transcription also arises from this region, the two
processes will be, to some extent, simultaneously regulated. In
addition to the 21-bp and 72-bp repeats concerned with early
transcription (see below), the origin includes a 17-bp A + T-rich
sequence; further demarcation of a "core" region and an "auxiliary"
region makes the former comprise a 17-bp palindrome (5193-5209), a
15-bp palindrome (5213-5227), a 27-bp palindrome (5230-13) and the
A + T-rich region (15-31), and the latter the three 21-bp repeats
[85]. Thus the "core" overlaps the T-antigen binding sites I and II
and the "auxiliary region" (which is said to enhance replication
efficiency) overlaps the T-antigen binding site III [85].
Transcription of early mRNAs (18s and 19s) is leftward off the E,
or minus, strand and transcription of late mRNAs (16s and 19s) is
rightward off the L, or plus, strand. In the former case, the
annotation will show the symbols "(c)" and "comp strand" for the
transcripts and products, and for the regulatory elements which
govern them. In some references, the E strand has been labeled
"Late" to indicate that it has the polarity of late mRNA, and the L
strand "Early" to indicate that it has the polarity of early mRNA.
(Early mRNAs can be produced throughout the lytic cycle.)
The characterization of the mRNAs as 18s and 19s early and 16s and
19s late is a simplification: heterogeneity with regard to 5' and
3' termini, splice sites, and abundance as a function of time in
the lytic cycle and regulatory events, have all been observed [24]
[35],[67],[71]. For the purposes of this entry, only the simplest
major mRNA species have been entered in the Sites.
The promoter for the early mRNAs is considered to include at least
bases 5185 to 250 on the E strand: the T antigen binding sites (for
which the minimal spans are given [62]), the "TATA" box at position
21, the 21-bp repeats, and the 72-bp repeats are all known to play
a role in early transcription. One of the effects of regulation at
these sites is to shift the cap site to one of at least seven
positions between bases 95 and 32 [71]. Bases 55 to 97 are also of
interest because they permit base-pairing homology with the small
nuclear RNA 7S-K [97] and because they bind the cell factor SP-1
[96].
For late promoter induction to occur at high efficiency two domains
are required simultaneously: the minimal replication origin and the
72 bp repeat [113]. The origin-proximal 22 bp portion of the 72-bp
repeat is sufficient for induction, while the origin-distal portion
is dispensable [113]. If T antigen is present, the 21 bp repeat is
dispensable for induction of late promoter activity in vivo [113].
The regulatory elements for early transcription have been annotated
for the "comp strand". The late region induction signals are to be
found in the same region as the early regulatory elements [113].
The efficacy of the 72-bp repeat in promoting transcription has
been demonstrated for both orientations [75].
The SAS-RNA is a small viral associated RNA whose function is
unknown. It appears to arise through processing from some of the
longer early transcripts; mutants in the SAS-RNA coding region
remain viable [51].
Seven proteins -- the small t antigen, the large T antigen, the
three structural proteins, the early leader protein [123], and the
agnoprotein (which is thought to be involved with structural
development [92])-- are known to be translated from the
multiplicity of mRNAs. The existence of a middle T antigen, known
to exist in Polyoma, has been suggested [40]. For a summary of
other possible coding sequences, see [50].
Missing data project [124-129], [131].
EMBL features not translated to GenBank features:
key from to description
PRM 21 15 (C) TATA-box [3]
MSG 120 2674 major 16S RNA
FEATURES from to/span description
pept 16 1 (c) early leader protein (SELP; [123])
5243 5188 (c) early leader protein (SELP; [123])
pept 335 523 agnoprotein (lp-1, VCP)
pept 562 1620 VP2 minor structural protein
pept 916 1620 VP3 minor structural protein
pept 1499 2593 VP1 major structural protein
pept 5163 4918 (c) large T antigen exon 1
4571 2691 (c) large T antigen exon 2
pept 5163 4639 (c) small t antigen
RNA 2907 2842 (c) SV40-associated small (SAS) RNA; 0.21 [51]
pre-msg 264 2676 minor late 19s mRNA [29],[62],[71]
pre-msg 325 2676 major late 19s mRNA [10],[19],[29],[30],[38]
pre-msg 2587 31 (c) early mRNA (alt.) [3],[34],[39],[62],[71]
pre-msg 2599 31 (c) early mRNA (alt.) [28],[62],[71]
pre-msg 5225 2587 (c) early mRNA (alt.) [3],[34],[39],[46],[61]
pre-msg 5230 2587 (c) early mRNA (alt.) [3],[34],[39],[68]
pre-msg 5236 2587 (c) early mRNA (alt.) [3],[6],[34],[39],[61]
pre-msg 5225 2599 (c) early mRNA (alt.) [28],[46],[61]
pre-msg 5230 2599 (c) early mRNA (alt.) [28],[68]
pre-msg 5236 2599 (c) early mRNA (alt.) [6],[28],[39],[61]
IVS 295 434 late 19s intron [24],[29]
IVS 527 1462 late 16s intron [19],[29],[32]
IVS 4636 4572 (c) early 19s mRNA (small t antigen) intron
[24],[29]
IVS 4917 4572 (c) large T antigen (18s) intron [24],[29]
rpt 60 40 (c) 21 bp non-tandem repeat I [90],[96],[97]
rpt 82 62 (c) 21 bp tandem repeat II [90],[96],[97]
rpt 103 83 (c) 21 bp tandem repeat III [90],[96],[97]
rpt 178 107 (c) 72 bp tandem repeat enhancer sequence A
[65],[68],[75],[90]
rpt 250 179 (c) 72 bp tandem repeat enhancer sequence B
[65],[68],[75],[90]
binding 5231 13 (c) T antigen binding site II [23],[62],[74]
binding 61 36 (c) T antigen binding site III 3' [23],[62],[74]
binding 5209 5184 (c) T antigen binding site I [23],[62],[74]
signal 27 21 (c) early mRNA promoter element; 0.66
[58],[68],[69]
orgrpl 5191 31 replication origin core region; 0.67 [85]
orgrpl 32 83 replication origin auxiliary region; 0.67 [85]
variant 164 182 gggactttccacacctggt in strain 776 [36]; gt in
strain 777; 0.69 [16],[17]
revision 328 331 gtgc in [36],[50]; gc in [16],[17]
signal 2613 2608 (c) early mRNA polyadenyation signal on the comp
strand; 0.16 [28],[110]
signal 2642 2637 (c) early mRNA polyadenyation signal on the comp
strand; 0.16 [28],[110]
signal 2657 2662 late mRNA polyadenyation signal; 0.17
[28],[63],[110]
mut 1626 1626 g in wild-type; a in temperature-sensitive
mutant tsC260
mut 1667 1667 c in wild-type; a in temperature-sensitive
mutant tsB228
mut 1680 1680 c in wild-type; g in temperature-sensitive
mutant tsB218
mut 1718 1718 g in wild-type; a in temperature-sensitive
mutant tsB[204,211,265]
mut 1719 1719 c in wild-type; t in temperature-sensitive
mutant tsB8
mut 1756 1756 a in wild-type; c in temperature-sensitive
mutant tsB8
mut 1995 1995 g in wild-type; c in temperature-sensitive
mutant tsB221
mut 2003 2003 g in wild-type; a in temperature-sensitive
mutant tsB201
mut 2084 2084 c in wild-type; t in temperature-sensitive
mutant tsBC223
mut 2091 2091 c in wild-type; c in temperature-sensitive
mutant tsB4
mut 2141 2141 c in wild-type; t in temperature-sensitive
mutant tsC219
mut 2237 2237 c in wild-type; a in temperature-sensitive
mutant tsC240
mut 2262 2262 c in wild-type; t in temperature-sensitive
mutant tsC260
mut 2354 2354 c in wild-type; t in temperature-sensitive
mutant tsBC[208,214,216,217,248,274]
mut 2367 2367 a in wild-type; c in temperature-sensitive
mutant tsBC11
BASE COUNT 1518 a 1100 c 1039 g 1586 t
ORIGIN BglI site on the E (minus) strand of the genome; mu 0.66 [50]
1 gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa
61 ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa
121 ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg
181 gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac
241 tttccacacc ctaactgaca cacattccac agctggttct ttccgcctca gaaggtacct
301 aaccaagttc ctctttcaga ggttatttca ggccatggtg ctgcgccggc tgtcacgcca
361 ggcctccgtt aaggttcgta ggtcatggac tgaaagtaaa aaaacagctc aacgcctttt
421 tgtgtttgtt ttagagcttt tgctgcaatt ttgtgaaggg gaagatactg ttgacgggaa
481 acgcaaaaaa ccagaaaggt taactgaaaa accagaaagt taactggtaa gtttagtctt
541 tttgtctttt atttcaggtc catgggtgct gctttaacac tgttggggga cctaattgct
601 actgtgtctg aagctgctgc tgctactgga ttttcagtag ctgaaattgc tgctggagag
661 gccgctgctg caattgaagt gcaacttgca tctgttgcta ctgttgaagg cctaacaacc
721 tctgaggcaa ttgctgctat aggcctcact ccacaggcct atgctgtgat atctggggct
781 cctgctgcta tagctggatt tgcagcttta ctgcaaactg tgactggtgt gagcgctgtt
841 gctcaagtgg ggtatagatt ttttagtgac tgggatcaca aagtttctac tgttggttta
901 tatcaacaac caggaatggc tgtagatttg tataggccag atgattacta tgatatttta
961 tttcctggag tacaaacctt tgttcacagt gttcagtatc ttgaccccag acattggggt
1021 ccaacacttt ttaatgccat ttctcaagct ttttggcgtg taatacaaaa tgacattcct
1081 aggctcacct cacaggagct tgaaagaaga acccaaagat atttaaggga cagtttggca
1141 aggtttttag aggaaactac ttggacagta attaatgctc ctgttaattg gtataactct
1201 ttacaagatt actactctac tttgtctccc attaggccta caatggtgag acaagtagcc
1261 aacagggaag ggttgcaaat atcatttggg cacacctatg ataatattga tgaagcagac
1321 agtattcagc aagtaactga gaggtgggaa gctcaaagcc aaagtcctaa tgtgcagtca
1381 ggtgaattta ttgaaaaatt tgaggctcct ggtggtgcaa atcaaagaac tgctcctcag
1441 tggatgttgc ctttacttct aggcctgtac ggaagtgtta cttctgctct aaaagcttat
1501 gaagatggcc ccaacaaaaa gaaaaggaag ttgtccaggg gcagctccca aaaaaccaaa
1561 ggaaccagtg caagtgccaa agctcgtcat aaaaggagga atagaagttc taggagttaa
1621 aactggagta gacagcttca ctgaggtgga gtgcttttta aatcctcaaa tgggcaatcc
1681 tgatgaacat caaaaaggct taagtaaaag cttagcagct gaaaaacagt ttacagatga
1741 ctctccagac aaagaacaac tgccttgcta cagtgtggct agaattcctt tgcctaattt
1801 aaatgaggac ttaacctgtg gaaatatttt gatgtgggaa gctgttactg ttaaaactga
1861 ggttattggg gtaactgcta tgttaaactt gcattcaggg acacaaaaaa ctcatgaaaa
1921 tggtgctgga aaacccattc aagggtcaaa ttttcatttt tttgctgttg gtggggaacc
1981 tttggagctg cagggtgtgt tagcaaacta caggaccaaa tatcctgctc aaactgtaac
2041 cccaaaaaat gctacagttg acagtcagca gatgaacact gaccacaagg ctgttttgga
2101 taaggataat gcttatccag tggagtgctg ggttcctgat ccaagtaaaa atgaaaacac
2161 tagatatttt ggaacctaca caggtgggga aaatgtgcct cctgttttgc acattactaa
2221 cacagcaacc acagtgcttc ttgatgagca gggtgttggg cccttgtgca aagctgacag
2281 cttgtatgtt tctgctgttg acatttgtgg gctgtttacc aacacttctg gaacacagca
2341 gtggaaggga cttcccagat attttaaaat tacccttaga aagcggtctg tgaaaaaccc
2401 ctacccaatt tcctttttgt taagtgacct aattaacagg aggacacaga gggtggatgg
2461 gcagcctatg attggaatgt cctctcaagt agaggaggtt agggtttatg aggacacaga
2521 ggagcttcct ggggatccag acatgataag atacattgat gagtttggac aaaccacaac
2581 tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt
2641 aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca
2701 ggttcagggg gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat
2761 ggctgattat gatcatgaac agactgtgag gactgagggg cctgaaatga gccttgggac
2821 tgtgaatcaa tgcctgtttc atgccctgag tcttccatgt tcttctcccc accatcttca
2881 tttttatcag cattttcctg gctgtcttca tcatcatcat cactgtttct tagccaatct
2941 aaaactccaa ttcccatagc cacattaaac ttcatttttt gatacactga caaactaaac
3001 tctttgtcca atctctcttt ccactccaca attctgctct gaatactttg agcaaactca
3061 gccacaggtc tgtaccaaat taacataaga agcaaagcaa tgccactttg aattattctc
3121 ttttctaaca aaaactcact gcgttccagg caatgcttta aataatcttt gggcctaaaa
3181 tctatttgtt ttacaaatct ggcctgcagt gttttaggca cactgtactc attcatggtg
3241 actattccag ggggaaatat ttgagttctt ttatttaggt gtttcttttc taagtttacc
3301 ttaacactgc catccaaata atcccttaaa ttgtccaggt tattaattcc ctgacctgaa
3361 ggcaaatctc tggactcccc tccagtgccc tttacatcct caaaaactac taaaaactgg
3421 tcaatagcta ctcctagctc aaagttcagc ctgtccaagg gcaaattaac atttaaagct
3481 ttccccccac ataattcaag caaagcagct gctaatgtag ttttaccact atcaattggt
3541 cctttaaaca gccagtatct ttttttagga atgttgtaca ccatgcattt taaaaagtca
3601 tacaccactg aatccatttt gggcaacaaa cagtgtagcc aagcaactcc agccatccat
3661 tcttctatgt cagcagagcc tgtagaacca aacattatat ccatcctatc caaaagatca
3721 ttaaatctgt ttgttaacat ttgttctcta gttaattgta ggctatcaac ccgcttttta
3781 gctaaaacag tatcaacagc ctgttggcat atggtttttt ggtttttgct gtcagcaaat
3841 atagcagcat ttgcataatg cttttcatgg tacttatagt ggctgggctg ttctttttta
3901 atacatttta aacacatttc aaaactgtac tgaaattcca agtacatccc aagcaataac
3961 aacacatcat cacattttgt ttccattgca tactctgtta caagcttcca ggacacttgt
4021 ttagtttcct ctgcttcttc tggattaaaa tcatgctcct ttaacccacc tggcaaactt
4081 tcctcaataa cagaaaatgg atctctagtc aaggcactat acatcaaata ttccttatta
4141 acccctttac aaattaaaaa gctaaaggta cacaattttt gagcatagtt attaatagca
4201 gacactctat gcctgtgtgg agtaagaaaa aacagtatgt tatgattata actgttatgc
4261 ctacttataa aggttacaga atatttttcc ataattttct tgtatagcag tgcagctttt
4321 tcctttgtgg tgtaaatagc aaagcaagca agagttctat tactaaacac agcatgactc
4381 aaaaaactta gcaattctga aggaaagtcc ttggggtctt ctacctttct cttctttttt
4441 ggaggagtag aatgttgaga gtcagcagta gcctcatcat cactagatgg catttcttct
4501 gagcaaaaca ggttttcctc attaaaggca ttccaccact gctcccattc atcagttcca
4561 taggttggaa tctaaaatac acaaacaatt agaatcagta gtttaacaca ttatacactt
4621 aaaaatttta tatttacctt agagctttaa atctctgtag gtagtttgtc caattatgtc
4681 acaccacaga agtaaggttc cttcacaaag atcaagtcca aaccacattc taaagcaatc
4741 gaagcagtag caatcaaccc acacaagtgg atctttcctg tataattttc tattttcatg
4801 cttcatcctc agtaagcaca gcaagcatat gcagttagca gacattttct ttgcacactc
4861 aggccattgt ttgcagtaca ttgcatcaac accaggattt aaggaagaag caaatacctc
4921 agttgcatcc cagaagcctc caaagtcagg ttgatgagca tattttactc catcttccat
4981 tttcttgtac agagtattca ttttcttcat tttttcttca tctcctcctt tatcaggatg
5041 aaactccttg cattttttta aatatgcctt tctcatcaga ggaatattcc cccaggcact
5101 cctttcaaga cctagaaggt ccattagctg caaagattcc tctctgttta aaactttatc
5161 catctttgca aagctttttg caaaagccta ggcctccaaa aaagcctcct cactacttct
5221 ggaatagctc agaggccgag gcg
//
LOCUS BOVPROB1 257 bp ss-mRNA MAM 03-AUG-1990
DEFINITION Cattle prolactin mRNA, 5' end.
ACCESSION M36873
KEYWORDS prolactin.
SEGMENT 1 of 2
SOURCE Cattle hypophysis, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 257)
AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
Baev,A.A.
TITLE Genetic engineering of peptide hormones: II. Possible polymorphism
of preprolactin in cattle. Data of molecular cloning
JOURNAL Mol. Biol. 22, 117-121 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 51 > 257 prolactin precursor
sigp 51 143 prolactin signal peptide
matp 144 > 257 prolactin
BASE COUNT 51 a 76 c 66 g 64 t
ORIGIN
1 ataggacgag agcttcctgg tgaagtgtgt ttcttgaaat catcaccacc atggacagca
61 aaggttcgtc gcagaaagca gggtcccgcc tgctcctgct gctggtggtg tcaaatctac
121 tcttgtgcca gggtgtggtc tccacccccg tctgtcccaa tgggcctggc aactgccagg
181 tatcccttcg agacctgttt gaccgggcag tcatggtgtc ccactacatc catgacctct
241 cctcggaaat gttcaac
//
LOCUS BOVPROB2 551 bp ss-mRNA MAM 03-AUG-1990
DEFINITION Cattle prolactin mRNA, 3' end.
ACCESSION M36874
KEYWORDS prolactin.
SEGMENT 2 of 2
SOURCE Cattle hypophysis, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 551)
AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
Baev,A.A.
TITLE Genetic engineering of peptide hormones: II. Possible polymorphism
of preprolactin in cattle. Data of molecular cloning
JOURNAL Mol. Biol. 22, 117-121 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 405 prolactin
BASE COUNT 153 a 135 c 119 g 144 t
ORIGIN
1 ccggaagata aagaacaagc ccaacagacc catcatgaag tccttatgag cttgattctt
61 gggttgctgc gctcctggaa tgaccctctc tatcacctag tcaccgaggt gcggggtatg
121 aaaggagccc cagatgctat cctatcgagg gccatagaga ttgaggaaga aaacaaacga
181 cttctggaag gcatggagat gatatttggc caggttattc ctggagccaa agagactgag
241 ccctaccctg tgtggtcagg actcccgtcc ctgcaaacta aggatgaaga tgcacgttat
301 tctgcttttt ataacctgct ccactgcctg cgcagggatt caagcaagat tgacacttac
361 cttaagctcc tgaattgcag aatcatctac aacaacaact gctaagccca cattccatcc
421 tatccatttc tgagatggtt cttaatgatc cattccctgg caaacttctc tgagctttat
481 agctttgtaa tgcatgcttg gctctaatgg gtttcatctt aaataaaaac agactctgta
541 gcgatgtcaa a
//
LOCUS CHKBABA1 4051 bp ds-DNA VRT 03-AUG-1990
DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 1-4.
ACCESSION M33711 J05496
KEYWORDS B-creatine kinase.
SEGMENT 1 of 4
SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clones
T[56,62].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 4051)
AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
TITLE A unique chicken B-creatine kinase gene gives rise to two
B-creatine kinase isoproteins with distinct N-termini by
alternative splicing
JOURNAL J. Biol. Chem. 265, 11656-11666 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Wirz, 13-APR-1990.
FEATURES from to/span description
pept 1201 1393 B-creatine kinase, exon 2b (EC 2.7.3.2) (first
expressed exon)
2449 2603 B-creatine kinase, exon 3
3464 + 3596 B-creatine kinase, exon 4
pep$ 1800 1977 B-creatine kinase, exon 2a
2449 2603 B-creatine kinase, exon 3
3464 + 3596 B-creatine kinase, exon 4
pre-msg 411 > 4051 B-creatine kinase mRNA and introns
IVS 1394 1789 B-creatine kinase intron A
IVS 1978 2498 B-creatine kinase intron B (no splice
consensus)
IVS 2604 3463 B-creatine kinase intron C
IVS 3597 > 4051 B-creatine kinase intron D
signal 384 390 TATA box
signal 251 256 GC box
signal 310 315 GC box
signal 328 332 CAT box
signal 346 351 TATA box
signal 358 362 GC box
signal 384 389 TATA box
BASE COUNT 943 a 985 c 1128 g 995 t
ORIGIN
1 tttgcacacg accagaagag accttcagcg cgttacacaa taaacctatg ttcaaatctg
61 aggtaaaagg agccagctag caaggatgga agcaaaaagc cgcttgtgcc cgccttggtg
121 cctaacggct gccgtattcg ggctgccggt ggccgctggg acggatgtac gtgcacgtct
181 gcgttcacga ggtcacgacg gctaccgctc caggctcgag gcggacgggc cacgaaccca
241 aagccaccgg ccgcccaacc gccgcgcaaa atggcggcac gccttcactg gccaggagga
301 gccgctgctc cgcccctcac ggctcgacca atggaatgaa tgggctataa ataggagcca
361 atgaggcggc gggtcgcgct ctatataaaa ggggcgcggc ggctgactcg gggctccact
421 ccggcttggg cagcgggtgg gtgcggagcg gccgctgagg tccttacgtt ggctgtggag
481 ccctcggtaa gtcccggggc agccgagccg cctgcagctt aaagaaaaga agaaggaaac
541 gatttaactt ttataagaaa aatgtatttt tgatggtgtg ggttggagcc ggtccggccg
601 ggaaagcgtc ggggggagcg ggatgcggcg gggcggcgcg gggcaggtgc cgggtgcgta
661 ccgggtgcgg ccgcagcttg agccggagcc ccgctctgaa agcagctcac cccggtattt
721 agcagggtag ttacatacgg tcgccccagc cgcgtccccc gagggagccg tccgttgtgt
781 aacggctcgc tgagcactgc ccactcgccc cggtccccgg ctcgcgctgc cggtaccggc
841 accggccgcg gcgctcccca acggccgctc cgcggggctc cgccggcggc tgctccgccg
901 cctccccgcg ccctggccga gccccggagc gtccggaccg ggatgctgaa ggatggggcg
961 aggtttgcgg agctggagaa agtttccttt tttttttttc tcggtttttt tttctctttt
1021 tttttttttt tttgggccct ccctccttgc ctggccttga ggaaggtgtt gtggcaacga
1081 aggtgcggaa ggcacgggtg gaagcgaagc caaggtgacc ttgcttggcg ttggcgcctc
1141 ccccgggccc ctggtgatag gcttccctct cttccctcag tgactgctgt agggacagcc
1201 atgcccttct caaacagcca caacctcctg aagatgaagt actccgttga cgatgagtac
1261 cccgacctga gcgtgcacaa caatcacatg gccaaggtgc tgaccctgga cctgtacaag
1321 aagttgaggg acaggcagac ttccagtgga tttacgctgg atgatgtcat tcagaccggg
1381 gttgacaacc caggtaacgg cggcaccgtt aggactgcag ggctctctgc tctccctcct
1441 tcctggaagt gctccttgac agcaaagcaa accccgcacg ctccgaggga cggctgtgct
1501 cctccgtgca atctgacaca gcaattccca ggcgctccca ttatgtaact gctctgctaa
1561 atacttcaaa tcccaaccaa aaaaaaaaaa aaaaagcaaa cgctcaaatg tgttctcatt
1621 tttatgcaac cggggggaaa aaaattctct gagctgtgtg ggcttacccg gctgttctga
1681 gctcgtctgt tccaaagtgg ttatgtgatg ttctgtagca cttctatgaa agcaaggggg
1741 agcggatgcc gtatacgtgt taacattaca gtatttattt ttactctagc aatatcaaaa
1801 tggcccaact aaataatcag aggctgcctc ccgaggagga gtacccggac ctgagcaccc
1861 acaacaacca catggccaaa gtgctaaccc tggatttgta caagaaactg agagacagag
1921 tcacgcccag tggcttcacc ctggatgatg tcattcagac tggggttgat aatcccggta
1981 aaatgcgctg agatattctg tgggagccag catgaagtac ccggtgcttt tcagtatggt
2041 gacttaagct aagctcgtgg aggtggtgtt tatgagcaga ctatgaaaat gcggttctgt
2101 ctgtagcaga gcgaataaag cgatgcatgc ttgctcacag caaagcatgc taaagtgtag
2161 ttctatctac gggtttgcat tagctagtaa aattggatgt gctgtgcgaa attgccctaa
2221 accaaagctt gacaagagtg tggcttaacg gcaactgctg tggtttccca tagcactgca
2281 tttgactaac tggatgaaat gtaaaatgta gggtctccca actgctctta ggccagtgca
2341 tttctaagtg agtagatctg ctgtgaagaa acttgcctgg ttagtgctgt gctggagtac
2401 agctgagcat cacccaactt agctcttctg cctttgttac cactccaggc caccccttca
2461 taatgacagt aggatgcgta gctggcgacg aagaatccta tgaagtgttt aaggaactct
2521 ttgatccggt tattgaggac aggcacggtg gctacaaacc aactgatgag cacaagaccg
2581 acctgaatgc tgataacctg caggtatacg tttttgttga gctactgata cccactgatc
2641 tgaggcctgg ggaaagctgt gagcggctgc ctaaacggat tgtttgggct ctgtcctctc
2701 tctctcatct ctcgctgccc gctgcagtct tacttaagag cctgcagcag taggttgtca
2761 gtaactgggt agaaccaggc agctgacaag caggacttgc ttcatgtgct gctgcaatgt
2821 aagtcatacc aaacacgttg agcacgtatt tctttgggct gtgctagttg tgtgaaacaa
2881 cccagtccca gggctgttag aaatcttaac tcttcctact gtatgatgtt gtaggctaaa
2941 aatcagccag tggtctcgtg ttatagaata gccattgtta cggctggaag tagaggtttc
3001 tgcccatctt acagacttca aatgttattg cagtctctgg gaattccaac tgctatcctg
3061 gttggcctgc cataggacat ctcaaaaatg agaacgcttt gacaataatg agtaactctc
3121 atttgcagga gtaaacacct acctaagcaa ggatttgtct cgtgttgata cctctagatt
3181 gcagagtgaa aggagtaaca gaagcctggg atttgggtct gagatgggtg taaaactgat
3241 ggtatcccta tcttctagca catagttggt agtcgtagtc ttagcctcaa aacaaggagt
3301 ttgatactct catacttgtt ggaaagctgg aaaaaatggt cctgaagtac atcgtattac
3361 aggacaacta gtcagtctca aatatgaatg cacgtgacct gtgtgtattt agggtaagca
3421 tcttagcaac aaggcagtac tgactaattt tctgatcttg tagggtggtg acgacttgga
3481 tcctaattac gtgctgagct ctcgtgtgag aactggcagg agcatccgtg gattctgcct
3541 tccccctcat tgcagcaggg gagagagacg ggctattgaa aagctctctg ttgaaggtaa
3601 agtgttgaat caggcaatga ggggaaactg ctgacctcgt ctaggcaagg aggtgctctg
3661 tgcagttgga gaatggaggt ggttggcaga agctgccttc ctcaaggctt aacttgagtg
3721 tggaaccttc cctgaaaaag gcagcttgga ttatggttgc gtcacgttgc agtcggtact
3781 cagagcagga acacatgtca gggaagttgg actgtattaa gttgaccttt gctgatcttg
3841 gaagcttctt tctgtgtgtg gggaaatagg ttggttcatc acggctcaaa cctgagatgt
3901 aataggtatg gagcaaccct agggcttgag tcatggttgc agtattttgt ggcttctttt
3961 aattgacttt tattaatgtt cagcttgctt agaggtgtag tccagctctg agagagctct
4021 agttctcagt ggaactcctc agctgcagtt c
//
LOCUS CHKBABA2 444 bp ds-DNA VRT 03-AUG-1990
DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 5.
ACCESSION M33712 J05496
KEYWORDS B-creatine kinase.
SEGMENT 2 of 4
SOURCE G.gallus (strain White Leghorn) adult blood cell DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 444)
AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
TITLE A unique chicken B-creatine kinase gene gives rise to two
B-creatine kinase isoproteins with distinct N-termini by
alternative splicing
JOURNAL J. Biol. Chem. 265, 11656-11666 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Wirz, 13-APR-1990.
FEATURES from to/span description
pept + 45 + 216 B-creatine kinase (B-CK), exon 5
pep$ + 45 + 216 B-creatine kinase (B-CK), exon 5
pre-msg 1 > 444 B-creatine kinase mRNA and introns
IVS < 1 44 B-creatine kinase intron E
IVS 217 > 444 B-creatine kinase intron F
BASE COUNT 108 a 95 c 120 g 121 t
ORIGIN
1 tgtttatagt ctctgctccc atcctaaaag cacctctctt tcagctctgg gcagtctggg
61 aggtgatctc aaggggaagt actatgccct aaggaacatg actgatgcag agcagcagca
121 gctgattgat gatcacttct tgtttgacaa acctgtttct cctcttctgt tggcatctgg
181 gatggcacga gattggcctg atgccagggg tatctggtga gtatatgccc agcagggtga
241 gttgtgtgaa atccctgcaa caaaaagcct tgaaattttg gttgtggctg atataattgg
301 acaaaacagc acgttgtgaa cttacgctgc tgaatctaat gaggaacttc tgcgtggtat
361 cccctgggaa gcgatatctg aaagcagaac ttgccccaag tgctagtgtg cactgcattg
421 ggctggtagg aagaagccag cttg
//
LOCUS CHKBABA3 1586 bp ds-DNA VRT 03-AUG-1990
DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 6 and 7.
ACCESSION M33713 J05496
KEYWORDS B-creatine kinase.
SEGMENT 3 of 4
SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1586)
AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
TITLE A unique chicken B-creatine kinase gene gives rise to two
B-creatine kinase isoproteins with distinct N-termini by
alternative splicing
JOURNAL J. Biol. Chem. 265, 11656-11666 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Wirz, 13-APR-1990.
FEATURES from to/span description
pept + 308 431 B-creatine kinase, exon 6
1170 + 1359 B-creatine kinase, exon 7
pep$ + 308 431 B-creatine kinase, exon 6
1170 + 1359 B-creatine kinase, exon 7
pre-msg 1 > 1586 B-creatine kinase mRNA and introns
IVS < 1 307 B-creatine kinase intron G
IVS 432 1171 B-creatine kinase intron H (no splice
consensus)
IVS 1360 > 1586 B-creatine kinase intron I
BASE COUNT 435 a 336 c 385 g 430 t
ORIGIN
1 tggactgaac ataaatgagg tgagcaatga ctgaactctt aagttttgat gctgttaaca
61 ctggctaatg tcctgcaatt actttctgac atggatagat acaggccact tccaaatacc
121 tcttgactag agtagggtaa aagtacaagc ttaattttgt tagtgcttag atggagaaca
181 gggagataaa tttctttgga gaagcaatta aacgagcctc ttggaagcta cagcagtgag
241 taacttggtg ctactgaatc tgtcttgaaa acgtcaagtg agacaacagg tctgtcttgt
301 ctcctaggca caatgacaac aagaccttcc ttgtttggat caatgaggag gatcacctca
361 gggttatttc catgcagaaa ggtggcaaca tgaaggaagt gtttactcgc ttctgtacag
421 ggctaacaca ggtaaatcaa atactgcttg agctctgggt ctaaatatta atctggttgg
481 agctgaccat acaggatctt ccaggctggg agcatgtaca caggaggtga cagccttctc
541 ctttgttctg cagggtgctt ctttgtagtt tgggtttctg ggaacatcct gatgagatcc
601 tctgctttga gggctctgaa atgcctgtac tggcaaagct ttatggctgt gaaatctgga
661 ataaaacctc taatgtgaca cctttaccaa actgtactta aaggatctct gtggttctat
721 aaaatgtatg tagagaccct gggtaggagg aaaaaattgc ttaaatgtga atgggtgaca
781 ttggacagaa gtttgtaggc tgcatgcttt tctcagcttc tggtcttgat atcagcttgt
841 gctttaccag ccccttccct aatcctaaca gatgcaccag tttgacatgt actcaatcaa
901 gcttcccagt agatagcaac ttcagacttt acagtgtagc ttgaacttag gctgtctcgc
961 tgctgggagc aagagtaagg ccttcaaaag cccctcattg gtgaccctct tctacaaatg
1021 gagggggccc tatgtaactg acttaaataa tctgctcaaa ctgttgtggc tgttgctatg
1081 caactgaagc tgtaaatgag cacttagtgg atgacctaca tcgttcaggc agagaacaat
1141 ctctaaacat ccattctctc tatttataga tagaaactct cttcaagtcc aaaaactacg
1201 agttcatgtg gaacccacac ttgggctaca tcctgacctg cccatccaac cttggaacgg
1261 ggctccgtgc tggtgtgcac atcaagctac caaaccttgg gaagcatgag aagtttggag
1321 aagtcctcaa gaggcttcgg ctgcagaaac gaggcacagg tgagaggtgg tgatacagtg
1381 atgccccaag caggctgcct ctgaccagac agcagtgaga gcctggctgc agcgaagcta
1441 atcttggttc atgctgacag tgcctgacag accatccaga agggaccctt ctgaagcgtg
1501 gggggggaca accaagcatt aaaagtgtag cttagaagag gtggagatgc tggtactttg
1561 cttaagcctt attatatcca gcttct
//
LOCUS CHKBABA4 650 bp ds-DNA VRT 03-AUG-1990
DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 8.
ACCESSION M33714 J05496
KEYWORDS B-creatine kinase.
SEGMENT 4 of 4
SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 650)
AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
TITLE A unique chicken B-creatine kinase gene gives rise to two
B-creatine kinase isoproteins with distinct N-termini by
alternative splicing
JOURNAL J. Biol. Chem. 265, 11656-11666 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Wirz, 13-APR-1990.
FEATURES from to/span description
pept + 49 227 B-creatine kinase, exon 8
pep$ + 49 227 B-creatine kinase, exon 8
pre-msg 1 > 396 B-creatine kinase mRNA and introns
IVS < 1 48 B-creatine kinase intron J
signal 391 396 poly-A signal
BASE COUNT 161 a 129 c 161 g 199 t
ORIGIN
1 aactggctca ggcagcggtc agcgtactaa ttaaatctct tcgtccaggt ggtgtggaca
61 cagctgcagt cggaggggtg tttgatgtat ccaatgctga tcgtcttggc ttctctgagg
121 tagagctggt gcagatggtg gtagatggtg tgaagctgct cattgaaatg gaaaaacgcc
181 ttgagaaagg ccagtccatt gatgacctca tgccagctca gaaataaagc actttattct
241 catgcttcct aacttattgg atgaataata aaatgtcact ccaatttcaa accccttggg
301 tcagagccca cttagttaca ctgtagagaa gtcttccatc catctgtgtt agagtttatt
361 ttttgatggc tgaaatgttg ttgaaaatga aataaactgt tgttttggcc tgacctgcct
421 tagatgtgtt aactgagtgt ctttgaggtc taagtgactt gctgttcctg gtaagctgaa
481 ctgattggtt ttaggtgcta gcacatcatc ctggctgtgc tgctgtagtc tgcttggtgg
541 gcttagaaat gtgcttatag cagtgtcaaa agcaagcctc taatgtgatt gcagctgccg
601 taaacctgag aactggtgct aaacatgagt cactttaaac ctgccctacc
//
LOCUS HUMSATIII1 132 bp ds-DNA PRI 03-AUG-1990
DEFINITION Human satellite III DNA fragment, 5' end.
ACCESSION M25431
KEYWORDS satellite repetitive sequence.
SEGMENT 1 of 3
SOURCE Human DNA, clone lambda-HS3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 132)
AUTHORS Cooke,H.J. and Hindley,J.
TITLE Cloning of human satellite III DNA: Different components are on
different chromosomes
JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979)
STANDARD simple staff_review
BASE COUNT 39 a 27 c 14 g 52 t
ORIGIN
1 aattcatttg aagacaattc cattcaatac caattgatga tggttatttt tgattccatt
61 tgatgatgat tacattccat ttcatcataa ttccattcga ttccactcga gattccattc
121 gattccattc aa
//
LOCUS HUMSATIII2 75 bp ds-DNA PRI 03-AUG-1990
DEFINITION Human satellite III DNA fragment.
ACCESSION M25432
KEYWORDS satellite repetitive sequence.
SEGMENT 2 of 2
SOURCE Human satellite DNA, clone lambda-HS3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Cooke,H.J. and Hindley,J.
TITLE Cloning of human satellite III DNA: Different components are on
different chromosomes
JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979)
STANDARD simple staff_review
BASE COUNT 19 a 15 c 11 g 30 t
ORIGIN
1 cgaatgaatg agtccatcca tttcaatttc atgataattc cattcgtttc aattcgatgg
61 tgtttccatt cgatt
//
LOCUS HUMSATIII3 93 bp ds-DNA PRI 03-AUG-1990
DEFINITION Human satellite III DNA fragment, 3' end.
ACCESSION M25433
KEYWORDS satellite repetitive sequence.
SEGMENT 3 of 3
SOURCE Human satellite DNA, clone lambda-HS3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 93)
AUTHORS Cooke,H.J. and Hindley,J.
TITLE Cloning of human satellite III DNA: Different components are on
different chromosomes
JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979)
STANDARD simple staff_review
BASE COUNT 21 a 20 c 16 g 36 t
ORIGIN
1 ttcattcgat tcatttgatg atgattcatg cgcgattcat tagatgatga cccctttcat
61 ttccattcaa tggaggattc cattcggttc cat
//
LOCUS MS2AAR 218 bp ss-RNA PHG 03-AUG-1990
DEFINITION Bacteriophage MS2 A protein mRNA, 3' end, and coat protein mRNA,
complete cds.
ACCESSION M25187
KEYWORDS A protein; coat protein.
SOURCE Bacteriophage MS2 RNA.
ORGANISM Bacteriophage MS2
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Leviviridae.
REFERENCE 1 (bases 1 to 218)
AUTHORS Contreras,R., Ysebaert,M., Min Jou,W. and Fiers,W.
TITLE Bacteriophage MS2 RNA: Nucleotide sequence of the end of the A
protein gene and the intercistronic region
JOURNAL Nature New Biol. 241, 99-101 (1973)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 137 A protein (AA at 3)
pept 161 > 218 coat protein
BASE COUNT 54 a 58 c 52 g 54 t
ORIGIN
1 ctaaggccca aatctcagcc atgcatcggg gagtacaatc cgtatggcca acaactggcg
61 cgtacgtaaa gtctcctttc tcgatggtcc ataccttaga tgcgttagca ttaatcaggc
121 aacggctctc tagatagagc cctcaaccgg agtttgaagc atggcttcta actttactca
181 gttcgttctc gtcgacaatg gcggaactgg cgacgtga
//
LOCUS MUSCMDH1 567 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 1.
ACCESSION M36076
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 1 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 567)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 540 + 542 cytosolic malate dehydrogenase, exon 1
(cMDHase) (EC 1.1.1.37)
pre-msg 458 > 567 cytosolic malate dehydrogenase mRNA and
introns (alt.)
pre-msg 459 > 567 cytosolic malate dehydrogenase mRNA and
introns (alt.)
IVS 543 > 567 cytosolic malate dehydrogenase intron A
BASE COUNT 109 a 156 c 169 g 133 t
ORIGIN
1 gttcttccgc aagcgtcaat tcctcccgcc tctgagagag tttttaaaag gtttgtttcc
61 gggtcgagcg ggctttaagc aacggaaggt ctcttagccg cccttgttta gtcttgggga
121 ggatagattc tcgtggagcg acgtgtgtgt cgctcagggg tcggtttctc ctccctcgag
181 ttaacgcctc ctctcctgcc aattgctgag cgccatcagg caggcgcctc actcaaagca
241 ccaaccctct gctcacagac gcgctccaat caccgaggct cagcccggga ctactttgca
301 gcgaggcgcg taggaagaag gggtttgggg gaattgtagt ttagcactgt caggaatgca
361 cgaaggtggg cgccagaggt cgcggaagaa ctacacttcc cagaaagggc cgtgtctcca
421 gccgaagcct ttgcgggcca gccccggttc tctcccagag tctgttccgc tgtagaggtg
481 acctgactgc tggagactgc cttttgcagg tgcagagatc ggccttgcag tttgcaataa
541 tggtgaggtg ggctctggaa ctcacac
//
LOCUS MUSCMDH2 149 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 2.
ACCESSION M36077
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 2 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 149)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 27 + 125 cytosolic malate dehydrogenase, exon 2
(cMDHase)
pre-msg < 1 > 149 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 26 cytosolic malate dehydrogenase intron A
IVS 126 > 149 cytosolic malate dehydrogenase intron B
BASE COUNT 41 a 25 c 36 g 47 t
ORIGIN About 4.8 kb after segment 1.
1 taaactagtg gtctttgtca ttacagtctg aaccaatcag agtccttgtg actggagcag
61 ctggtcaaat tgcatattca ctgttgtaca gtattggaaa tggatctgtc tttgggaaag
121 accaggtagg ggcatgttct tataaatac
//
LOCUS MUSCMDH3 147 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 3.
ACCESSION M36078
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 3 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 147)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 122 cytosolic malate dehydrogenase exon 3
(cMDHase)
pre-msg < 1 > 147 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron B
IVS 123 > 147 cytosolic malate dehydrogenase intron C
BASE COUNT 27 a 39 c 40 g 41 t
ORIGIN About 0.8 kb after segment 2.
1 gcctgctgtc cttgctcttt ggcagcccat cattcttgtg ctgttggaca tcacccccat
61 gatgggtgtt ctggacggtg tcctgatgga actgcaagac tgtgcccttc cccttctgca
121 gggtgagttg gaagtcaaag aaaacag
//
LOCUS MUSCMDH4 226 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 4.
ACCESSION M36079
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 4 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 226)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 201 cytosolic malate dehydrogenase, exon 4
pre-msg < 1 > 226 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron C
IVS 202 > 226 cytosolic malate dehydrogenase intron D
BASE COUNT 69 a 45 c 62 g 50 t
ORIGIN About 1.6 kb after segment 3.
1 tgtgtgttgt ttgccatgtc catagatgtc attgcaacgg acaaagaaga gattgccttc
61 aaagacctgg atgtggctgt cctagtgggc tccatgccaa gaagggaagg catggagagg
121 aaggacctac tgaaagccaa tgtgaaaatc ttcaaatccc agggcacagc cttggagaaa
181 tacgccaaga aatcagttaa ggtgactcac acagatttca tggggt
//
LOCUS MUSCMDH5 173 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 5.
ACCESSION M36080
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 5 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 173)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 148 cytosolic malate dehydrogenase, exon 5
pre-msg < 1 > 173 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron D
IVS 149 > 173 cytosolic malate dehydrogenase intron E
BASE COUNT 48 a 49 c 35 g 41 t
ORIGIN About 0.9 kb after segment 4.
1 tctgctctgt gcctccacca tctaggtcat tgttgtggga aacccagcca atacgaactg
61 cctgacagcc tccaagtcag cgccatcgat ccccaaggag aatttcagtt gcctgactcg
121 cttggaccac aaccgagcaa aatctcaagt aagaaaaatg gcttttagat tta
//
LOCUS MUSCMDH6 227 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 6.
ACCESSION M36081
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 6 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 227)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 202 cytosolic malate dehydrogenase, exon 6
pre-msg < 1 > 227 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron E
IVS 203 > 227 cytosolic malate dehydrogenase intron F
BASE COUNT 71 a 40 c 60 g 56 t
ORIGIN About 3.4 kb after segment 5.
1 tgatatgatg ttttacatga actagattgc tcttaaactc ggtgtaaccg ctgatgatgt
61 aaagaatgtc attatctggg gaaatcattc atcgacccag tatccagatg tcaatcatgc
121 caaggtgaaa ctgcaaggaa aggaagtcgg tgtgtatgaa gccctgaaag acgacagctg
181 gctgaaggga gagttcatca cggtaagaag gatgtgaacc ctctgag
//
LOCUS MUSCMDH7 164 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 7.
ACCESSION M36082
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 7 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 164)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 139 cytosolic malate dehydrogenase, exon 7
pre-msg < 1 > 164 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron F
IVS 140 > 164 cytosolic malate dehydrogenase intron G
BASE COUNT 34 a 46 c 44 g 40 t
ORIGIN About 0.7 kb after segment 6.
1 actgtctctc tgttgtccca cccagactgt gcaacagcgt ggtgctgctg tcatcaaggc
61 tcggaagctg tccagtgcaa tgtctgctgc gaaagccatc gcagaccaca tcagagacat
121 ctggtttgga accccagagg tgagggttct catttgtact ggcc
//
LOCUS MUSCMDH8 131 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 8.
ACCESSION M36083
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 8 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 131)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 + 115 cytosolic malate dehydrogenase, exon 8
pre-msg < 1 > 131 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron G
IVS 116 > 131 cytosolic malate dehydrogenase intron H
BASE COUNT 27 a 27 c 36 g 41 t
ORIGIN About 0.3 kb after segment 7.
1 gctatgataa tgtaaacttt ttcagggaga gttcgtgtcg atgggtgtta tctctgatgg
61 caactcctat ggtgtccctg atgacctgct ctactcactc cctgtcgtga tcaaggtggg
121 tacatggaga g
//
LOCUS MUSCMDH9 528 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 9.
ACCESSION M36084
KEYWORDS cytosolic malate dehydrogenase.
SEGMENT 9 of 9
SOURCE Mouse (strain C3H/He) male liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 528)
AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
TITLE Structural organization of the mouse cytosolic malate dehydrogenase
gene: Comparison with that of the mouse mitochondrial malate
dehydrogenase gene
JOURNAL J. Mol. Biol. 202, 355-364 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 26 151 cytosolic malate dehydrogenase, exon 9
(cMDHase)
pre-msg < 1 > 353 cytosolic malate dehydrogenase mRNA and
introns
IVS < 1 25 cytosolic malate dehydrogenase intron H
signal 329 334 poly-A signal
signal 449 454 poly-A signal
BASE COUNT 175 a 98 c 100 g 155 t
ORIGIN About 1.4 kb after segment 8.
1 agctctcgcc cttgtcccct gacagaataa gacctggaag tttgttgaag gcctccccat
61 taatgacttc tcccgtgaaa agatggacct gacagcaaag gagctgaccg aggaaaagga
121 gaccgctttt gagtttctct cctctgcgtg actagacact cgttttgaca tcagcagaca
181 gccgaaggct gaggaatcaa aatgtcgtct ttgagcctag taccaaacag taataatgct
241 acattcaaat tgtgaacagc aaaatatttt aaatagtgtg tgctttatga tttgtgaaag
301 tctatcatgt tgttagtgct gcaatctaaa taaaagtata ttcaagtgaa aatctctcag
361 actctgtttc tactttatat ttagtatctt caggaaaaca agtttggccc aatagattat
421 aattttactt ttttaattga ctaaaagaaa taaagatgga aaatattatg aagtaaagca
481 ttagtctcta acataaacaa ggaagcccaa tcaatttcag agggatcc
//
LOCUS MUSMHHLABA 158 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse MHC class I HLA-B7 gene, 5' flank.
ACCESSION M35971
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse L cell line DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 158)
AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
Probst,H. and Mueller,C.A.
TITLE Differential regulation of HLA class I genes by interferon
JOURNAL Immunogenetics 31, 245-252 (1990)
STANDARD simple staff_review
BASE COUNT 21 a 53 c 43 g 41 t
ORIGIN
1 ggtctcaggg tctcaggctc cgagggccgg gtctgcaatg gggaggcgca cgttggggat
61 tccccactcc cctgagtttc acttcttctc ccaacttgtg tcgggtcctt cttccaggat
121 actcgtgacg cgtccccact tcccactccc attgggta
//
LOCUS MUSMHHLABB 160 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse MHC class I HLA-Bw64 gene, 5' flank.
ACCESSION M35972
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse L cell line DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 160)
AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
Probst,H. and Mueller,C.A.
TITLE Differential regulation of HLA class I genes by interferon
JOURNAL Immunogenetics 31, 245-252 (1990)
STANDARD simple staff_review
BASE COUNT 21 a 55 c 43 g 41 t
ORIGIN
1 ggctctcagg gtctcaggct ccgagggccg cgtctgcaat ggggaggcgc agcgttgggg
61 attccccact cccctgagtt tcacttcttc tcccaacttg tgtcgggtcc ttcttccagg
121 atactcgtga cgcatcccca cttcccactc ccattgggtg
//
LOCUS MUSMHHLABC 159 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse MHC class I HLA-B38 gene, 5' flank.
ACCESSION M35973
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse L cell line DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 159)
AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
Probst,H. and Mueller,C.A.
TITLE Differential regulation of HLA class I genes by interferon
JOURNAL Immunogenetics 31, 245-252 (1990)
STANDARD simple staff_review
BASE COUNT 21 a 52 c 44 g 42 t
ORIGIN
1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga
61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga
121 tactcgtgac gcatccccac ttcccactcc cattgggtg
//
LOCUS MUSMHHLABD 159 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse MHC class I HLA-B39 gene, 5' flank.
ACCESSION M35974
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse L cell line DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 159)
AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
Probst,H. and Mueller,C.A.
TITLE Differential regulation of HLA class I genes by interferon
JOURNAL Immunogenetics 31, 245-252 (1990)
STANDARD simple staff_review
BASE COUNT 21 a 52 c 44 g 42 t
ORIGIN
1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga
61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga
121 tactcgtgac gcatccccac ttcccactcc cattgggtg
//
LOCUS MUSTCEAEA 396 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35486
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL23.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 396)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at
1)
sigp < 1 33 T-cell receptor alpha-chain signal peptide
matp 34 > 396 T-cell receptor alpha-chain
recomb 327 328 V-region end/J-region start
recomb 385 386 J-region end/C-region start
BASE COUNT 113 a 101 c 95 g 87 t
ORIGIN
1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg
61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt
121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc
181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc
241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac
301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact
361 gggacaaaac tccaagtcgt tccaaacatc cagaac
//
LOCUS MUSTCEAEB 384 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35487
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL51.1.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 384)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 384 T-cell receptor alpha-chain precursor (AA at
1)
sigp < 1 39 T-cell receptor alpha-chain signal peptide
matp 40 > 384 T-cell receptor alpha-chain
recomb 315 316 V-region end/J-region start
recomb 373 374 J-region end/C-region start
BASE COUNT 111 a 96 c 93 g 84 t
ORIGIN
1 ctagctgggg tgagtggcca gcaggagaaa cgtgaccagc agcaggtgag acaaagtccc
61 caatctctga cagtctggga aggagagacc gcaattctga actgcagtta tgagaacagt
121 gcttttgact acttcccatg gtaccagcag ttccctgggg aaggtcccgc tctcctgata
181 tccatacttt cagtgtccga taaaaaggaa gatggacgat tcacaatctt cttcaataaa
241 agggagaaaa agctctcctt gcacattgca gactctcagc ctggagactc agccacctac
301 ttctgtgcag caagtgcaaa ttctgggact taccagaggt ttggaactgg gacaaaactc
361 caagtcgttc caaacatcca gaac
//
LOCUS MUSTCEAEC 388 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35488
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL83.12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 388)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 388 T-cell receptor alpha-chain precursor (AA at
2)
sigp < 1 43 T-cell receptor alpha-chain signal peptide
matp 44 > 388 T-cell receptor alpha-chain
recomb 319 320 V-region end/J-region start
recomb 377 378 J-region end/C-region start
BASE COUNT 112 a 98 c 93 g 85 t
ORIGIN
1 tcacctagct ggggtgagtg gccagcagga gaaacgtgac cagcagcagg tgagacaaag
61 tccccaatct ctgacagtct gggaaggaga gaccgcaatt ctgaactgca gttatgagaa
121 cagtgctttt gactacttcc catggtacca gcagttccct ggggaaggtc ccgctctcct
181 gatatccata ctttcagtgt ccgataaaaa ggaagatgga cgattcacaa tcttcttcaa
241 taaaagggag aaaaagctct ccttgcacat tgcagactct cagcctggag actcagccac
301 ctacttctgt gcagcaagtg caaattctgg gacttaccag aggtttggaa ctgggacaaa
361 actccaagtc gttccaaaca tccagaac
//
LOCUS MUSTCEAED 359 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35489
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL127.6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 359)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 359 T-cell receptor alpha-chain precursor (AA at
3)
sigp < 1 14 T-cell receptor alpha-chain signal peptide
matp 15 > 359 T-cell receptor alpha-chain
recomb 290 291 V-region end/J-region start
recomb 348 349 J-region end/C-region start
BASE COUNT 107 a 91 c 81 g 80 t
ORIGIN
1 agaaacgtga ccagcagcag gtgagacaaa gtccccaatc tctgacagtc tgggaaggag
61 agaccgcaat tctgaactgc agttatgaga acagtgcttt tgactacttc ccatggtacc
121 agcagttccc tggggaaggt cccgctctcc tgatatccat actttcagtg tccgataaaa
181 aggaagatgg acgattcaca atcttcttca ataaaaggga gaaaaagctc tccttgcaca
241 ttgcagactc tcagcctgga gactcagcca cctacttctg tgcagcaagt gcaaattctg
301 ggacttacca gaggtttgga actgggacaa aactccaagt cgttccaaac atccagaac
//
LOCUS MUSTCEAEE 396 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35490
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL172.10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 396)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at
1)
sigp < 1 51 T-cell receptor alpha-chain signal peptide
matp 52 > 396 T-cell receptor alpha-chain
recomb 327 328 V-region end/J-region start
recomb 385 386 J-region end/C-region start
BASE COUNT 113 a 101 c 95 g 87 t
ORIGIN
1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg
61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt
121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc
181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc
241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac
301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact
361 gggacaaaac tccaagtcgt tccaaacatc cagaac
//
LOCUS MUSTCEAEF 390 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35491
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL183.2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 390)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 390 T-cell receptor alpha-chain precursor (AA at
1)
sigp < 1 45 T-cell receptor alpha-chain signal peptide
matp 46 > 390 T-cell receptor alpha-chain
recomb 321 322 V-region end/J-region start
recomb 379 380 J-region end/C-region start
BASE COUNT 112 a 99 c 93 g 86 t
ORIGIN
1 cttcacctag ctggggtgag tggccagcag gagaaacgtg accagcagca ggtgagacaa
61 agtccccaat ctctgacagt ctgggaagga gagaccgcaa ttctgaactg cagttatgag
121 aacagtgctt ttgactactt cccatggtac cagcagttcc ctggggaagg tcccgctctc
181 ctgatatcca tactttcagt gtccgataaa aaggaagatg gacgattcac aatcttcttc
241 aataaaaggg agaaaaagct ctccttgcac attgcagact ctcagcctgg agactcagcc
301 acctacttct gtgcagcaag tgcaaattct gggacttacc agaggtttgg aactgggaca
361 aaactccaag tcgttccaaa catccagaac
//
LOCUS MUSTCEAEG 395 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION M35492
KEYWORDS T-cell receptor; constant region; diversity exon; joining exon;
processed gene; variable region.
SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL214.12.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 395)
AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
TITLE Autoimmune disease and T-cell immunologic recognition
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 395 T-cell receptor alpha-chain precursor (AA at
3)
sigp < 1 50 T-cell receptor alpha-chain signal peptide
matp 51 > 395 T-cell receptor alpha-chain
recomb 326 327 V-region end/J-region start
recomb 384 385 J-region end/C-region start
BASE COUNT 108 a 101 c 98 g 88 t
ORIGIN
1 ctaggccttc acatgattgt gatgctcctc atattcacaa gggcccatgg agactcagtg
61 actcagacgg aaggtcaagt ggccctctca gaagaggact ttcttacgat acactgcaac
121 tactcagcct agggtaccca gctctgttct ggtacgagca gtatcccggg gaaggtccac
181 agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggt tttgaagcta
241 catatgataa agggaccacc tccttccact tgcggaaagc ctcagtgcaa gagtcagact
301 cggctgtgta ctactgtgct ctgggtccaa attctgggac ttaccagagg tttggaactg
361 ggacaaaact ccaagtcgtt ccaaacatcc agaac
//
LOCUS RATIDA 198 bp ds-DNA ROD 03-AUG-1990
DEFINITION Rat ID-A element in the S-100 beta-subunit gene.
ACCESSION M35983
KEYWORDS .
SOURCE Rat brain DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
Takahashi,Y.
TITLE ID sequences in the genes of three brain-specific proteins
JOURNAL Biochem. Int. 15, 809-816 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rpt 5 19 direct repeat
rpt 184 198 direct repeat
BASE COUNT 94 a 26 c 58 g 20 t
ORIGIN
1 agctaaaaac gaactccgtg gggttgggga tttagctcag tggtagagcg cttgcctagc
61 aagagcaagg ccctgggttc ggtccccagc tccgaaaaaa agaaaaaaag aaaaaagaaa
121 aagaagaaga agaagaagaa gaagaagaag aagaagaaga agaagaagaa gaagaagaag
181 aagaaaaaag aaacgcgt
//
LOCUS RATIDB 141 bp ds-DNA ROD 03-AUG-1990
DEFINITION Rat ID-B element in the S-100 beta-subunit gene.
ACCESSION M35984
KEYWORDS .
SOURCE Rat brain DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
Takahashi,Y.
TITLE ID sequences in the genes of three brain-specific proteins
JOURNAL Biochem. Int. 15, 809-816 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rpt 5 19 direct repeat
rpt 121 135 direct repeat
BASE COUNT 52 a 26 c 43 g 20 t
ORIGIN
1 gtccaaagaa tggggcagag aggctgggga tttagctcag tggtagagcg cttacctagg
61 aagcgacagg ccctgggttc ggtccccagc tccgaaagaa agaaccaaaa aaaaaaaaaa
121 aaagaatggg gcagaataga t
//
LOCUS RATIDC 141 bp ds-DNA ROD 03-AUG-1990
DEFINITION Rat ID-C element in the S-100 beta-subunit gene.
ACCESSION M35985
KEYWORDS .
SOURCE Rat brain DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
Takahashi,Y.
TITLE ID sequences in the genes of three brain-specific proteins
JOURNAL Biochem. Int. 15, 809-816 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rpt 7 19 direct repeat
rpt 113 125 direct repeat
BASE COUNT 39 a 32 c 48 g 22 t
ORIGIN
1 cccttagagc ccaagggcga gggttgggga tttagctcag tggtagagcg cttgcctggc
61 aagtgcaagg ccctgggttc ggtccccagc tccggaaaaa aaaaaaaaaa aggagcccaa
121 gggtgggtca aaggcagcta g
//
LOCUS RSFGCA1 71 bp ds-DNA BCT 03-AUG-1990
DEFINITION Plasmid RSF1010 transposon Tn951 5' inverted terminal repeat.
ACCESSION M25019
KEYWORDS inverted terminal repeat.
SEGMENT 1 of 3
SOURCE Plasmid RSF1010 DNA.
ORGANISM Plasmid RSF1010
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 71)
AUTHORS Cornelis,G., Sommer,H. and Saedler,H.
TITLE Transposon Tn951 (TnLac) is defective and related to Tn3
JOURNAL Mol. Gen. Genet. 184, 241-248 (1981)
STANDARD simple staff_review
FEATURES from to/span description
rpt 16 56 5' inverted terminal repeat
rpt 11 15 5' insertion target sequence
BASE COUNT 23 a 13 c 19 g 16 t
ORIGIN
1 ttgtcaaact gactaggggt ctgacgctca gtggaacgaa aactcacgtt aagcaattgt
61 ggctgagaaa a
//
LOCUS RSFGCA2 109 bp ds-DNA BCT 03-AUG-1990
DEFINITION Plasmid RSF1010 transposon Tn951 DNA fragment.
ACCESSION M25020
KEYWORDS inverted terminal repeat.
SEGMENT 2 of 3
SOURCE Plasmid RSF1010 DNA.
ORGANISM Plasmid RSF1010
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 109)
AUTHORS Cornelis,G., Sommer,H. and Saedler,H.
TITLE Transposon Tn951 (TnLac) is defective and related to Tn3
JOURNAL Mol. Gen. Genet. 184, 241-248 (1981)
STANDARD simple staff_review
BASE COUNT 29 a 27 c 23 g 30 t
ORIGIN
1 gaatcggcat gagtgcaacg gcataagcct gcttaaaaaa caggcttaac gtgggatatt
61 ttccgttttc caagcggacc cctatcaata tgctcggcca ttattcctt
//
LOCUS RSFGCA3 105 bp ds-DNA BCT 03-AUG-1990
DEFINITION Plasmid RSF1010 transposon Tn951 3' inverted terminal repeat.
ACCESSION M25021
KEYWORDS inverted terminal repeat.
SEGMENT 3 of 3
SOURCE Plasmid RSF1010 DNA.
ORGANISM Plasmid RSF1010
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 105)
AUTHORS Cornelis,G., Sommer,H. and Saedler,H.
TITLE Transposon Tn951 (TnLac) is defective and related to Tn3
JOURNAL Mol. Gen. Genet. 184, 241-248 (1981)
STANDARD simple staff_review
FEATURES from to/span description
rpt 50 90 3' inverted terminal repeat
rpt 91 95 3' insertion target sequence
BASE COUNT 31 a 27 c 26 g 21 t
ORIGIN
1 accaaaggac atctgagacc attaaaagag gcgtcagagg tagaaaacgt tgcttaacgt
61 gagttttcgt tccactgagc gtcagacccc gactacgtcc acgcg
//
LOCUS SYNPROA 808 bp ss-mRNA SYN 03-AUG-1990
DEFINITION Manipulation-derived rearrangement of cattle prolactin mRNA.
ACCESSION M35587
KEYWORDS prolactin.
SOURCE Synthetic, cDNA to mRNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 808)
AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
Baev,A.A.
TITLE Genetic engineering of peptide hormones: II. Possible polymorphism
of preprolactin in cattle. Data of molecular cloning
JOURNAL Mol. Biol. 22, 117-121 (1988)
STANDARD simple staff_review
BASE COUNT 217 a 201 c 195 g 195 t
ORIGIN
1 ttgaacattt ccgaggagag gtcatggatg tagtgggaca ccatgactgc ccggtcaaac
61 aggtctcgaa gggatacctg gcagttgcca ggcccattgg gacagacggg ggtggagacc
121 acaccctggc acaagagtag atttgacacc accagcagca ggagcaggcg ggaccctgct
181 ttctgcgacg aacctttgct gtccatggtg gtgatgattt caagaaacac acttcaccag
241 gaagctctcg tcctatcccg gaagataaag aacaagccca acagacccat catgaagtcc
301 ttatgagctt gattcttggg ttgctgcgct cctggaatga ccctctctat cacctagtca
361 ccgaggtgcg gggtatgaaa ggagccccag atgctatcct atcgagggcc atagagattg
421 aggaagaaaa caaacgactt ctggaaggca tggagatgat atttggccag gttattcctg
481 gagccaaaga gactgagccc taccctgtgt ggtcaggact cccgtccctg caaactaagg
541 atgaagatgc acgttattct gctttttata acctgctcca ctgcctgcgc agggattcaa
601 gcaagattga cacttacctt aagctcctga attgcagaat catctacaac aacaactgct
661 aagcccacat tccatcctat ccatttctga gatggttctt aatgatccat tccctggcaa
721 acttctctga ggtttatagc tttgtaatgc atgcttggct ctaatgggtt tcatcttaaa
781 taaaaacaga ctctgtagcg atgtcaaa
//
LOCUS FLANSHB 856 bp ss-RNA VRL 03-AUG-1990
DEFINITION Influenza A/Houston/24269/85 (H3N2) nonstructural protein RNA (seg
8).
ACCESSION M17699
KEYWORDS nonstructural protein.
SOURCE Influenza A/Houston/24269/85 RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 856)
AUTHORS Buonagurio,D.A., Nakada,S., Parvin,J.D., Krystal,M., Palese,P. and
Fitch,W.M.
TITLE Evolution of human influenza A viruses over 50 years: Rapid,
uniform rate of change in NS gene
JOURNAL Science 232, 980-982 (1986)
STANDARD full staff_review
COMMENT Computer-readable sequence for [1] kindly provided by P.Palese,
12-OCT-1987.
FEATURES from to/span description
BASE COUNT 280 a 173 c 198 g 205 t
ORIGIN Unreported.
1 caacactgtg tcaatcttcc aggtagattg ctttctttgg catgtccaca aacaagttgt
61 agaccaagaa ctgagtgatg ccccattcct tgatcggctt cgccgagatc agaggtccct
121 aaggggaaga ggcagcaccc tcggtctaga catcaaagca gccacccatg ttggaaagca
181 gatagtagaa aagattctga aggaagaatc tgatgaggca cttaaaatga ccatggcctc
241 cacacctgct tcgcgataca taactgacat gactattgag gaattgtcaa gaaactggtt
301 catgctaatg cccaagcaga aagtggaagg acctctttgc atcagaatgg accaggcaat
361 catggagaaa aacatcatgt tgaaagcgaa tttcagtgtg atctttgacc gactagagac
421 cctagtatta ctaagggctt tcaccgaaga gggagcaatt gttggcgaaa tctcaccatt
481 gccttctttt ccaggacata ctattgagga tgtcaaaaat gcaattgggg tcctcatcgg
541 aggacttgaa tggaatgata acacagttcg agtctctaaa actctacaga gattcgcttg
601 gggaagcagt aatgaggatg ggggacctcc acttactcca aaacagaaac ggaaaatggc
661 gagaacagct aggtcaaaag tttgaagaga taagatggct gattgaagaa gtgagacaca
721 gactaaaaac aactgagaat agttttgagc aaataacatt catgcaagcc ttacagctgc
781 tgtttgaagt ggaacaggag ataagaactt tctcatttca gcttatttaa tgataaaaaa
841 cacccttgtt tctact
//
LOCUS VACSPI1A 3043 bp ds-DNA VRL 03-AUG-1990
DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-1.
ACCESSION M24217
KEYWORDS serine protease inhibitor.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 3043)
AUTHORS Kotwal,G.J. and Moss,B.
TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related
to Members of the Plasma Serine Protease Inhibitor Superfamily
JOURNAL J. Virol. 63, 600-606 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence for [1] kindly
submitted by B.Moss, 15-FEB-1990.
FEATURES from to/span description
pept < 1 243 unspecified ORF
pept 255 659 16 kDa ORF
pept 927 1988 SPI-1 (40 kDa ORF)
pept 2150 2722 21.5 ORF
BASE COUNT 1100 a 459 c 487 g 997 t
ORIGIN
1 tcacataatc tatttagaga tcgagtcatg cacgattata taagtaatac atatattgat
61 cttgagtgtt tagatattat tagatcgttg gatggattcg atatcaatgg ttactttgaa
121 ggacgtacac cacttcattg cgctatacaa cataacttca ctcagattgc taagtactta
181 ttagatcgag gagctgatat agtcgtaccc aacacattga ttatacatca gtacatacag
241 taaatagcat agatatggag gaggatacaa atatttcaaa taaagttata aggtacaaca
301 ctgtcaataa tatatgggaa acattaccta acttctggac tggaactata aatccaggcg
361 tggtctcgca taaagatgat atatatgttg tatgcgacat caaagatgaa aaaaatgtta
421 aaacttgtat atttagatat aacacgaata cgtataacgg atgggaattg gtcacgacga
481 cagaaagcag attatcagct ctgcatacta ttctttataa caataccata atgatgttac
541 attgttatga atcgtatatg ttacaagata catttaatgt gtacactcgc gaatggaatc
601 atatgtgtca tcaacattcg aatagttata tcatgtacaa tatactaccc atctactaaa
661 tataatagaa taaaataaat gagtatgatc attttagata acgattgatt ttatcattac
721 cgcttcattc ttatattctt tgcttacgga acctatattt agaaacatct actaacgatt
781 ttttatgctt gcattattaa tggtatgtaa tatgattgat tgtgtacgca ataccaattt
841 gttaagtatg aatacggggt acaaacataa actgaagttt aacattattt atttatgata
901 tatatcgtta ttgtttggtc tataccatgg atatctttaa agaactaatc ttaaaacaca
961 cggatgaaaa tgttttgatt tctccagttt ctattttatc tactttatct attctaaatc
1021 atggagcagc tggttctaca gctgaacaac tatcaaaata tatagagaat atgaatgaga
1081 atacacccga tgacaataat gacatggacg tagatattcc gtattgtgcg acactagcta
1141 ccgcaaataa aatatacggt agcgatagta tcgagttcca cgcctccttc ctacaaaaaa
1201 taaaagacga ttttcaaact gtaaacttta ataatgctaa ccaaacaaag gaactaatca
1261 acgaatgggt taagacaatg acaaatggta aaattaattc cttattgact agtccgctat
1321 ccattaatac tcgtatgaca gttgttagcg ccgtccattt taaagcaatg tggaaatatc
1381 cattttctaa acatcttaca tatacagaca agttttatat ttctaagaat atagttacca
1441 gtgttgatat gatggtgagc actgagaata acttgcaata tgtacatatt aatgaattat
1501 tcggaggatt ctctattatc gatattccat acgagggaaa ctctagtatg gtaattatac
1561 taccggacga catagaaggt atatataaca tagaaaaaaa tataacagat gaaaaattta
1621 aaaaatggtg tggtatgtta tctactaaaa gtatagactt gtatatgcca aagtttaaag
1681 tggaaatgac agaaccgtat aatctggtac cgattttaga aaatttagga cttactaata
1741 tattcggata ttatgcagat tttagcaaga tgtgtaatga aactatcact gtagaaaaat
1801 ttctacatac gacgtttata gatgttaatg aggagtatac agaagcatcg gccgttacag
1861 gagtatttat gactaacttt tcgatggtat atcgtacgaa ggtctacata aaccatccat
1921 tcatgtacat gattaaagac aacacaggac gtatactttt tatagggaaa tactgctatc
1981 cgcaataaat ataaacaaat agacttttat cacgtttatc tatgtctaaa tattacaaat
2041 agtaatagta taaactaaag ctgataatac ttaaaaaaat aataatatca tttacaatta
2101 atagtataaa ctaaaaatta aacaaatcgt tattataagt aatatcaaaa tgatgatata
2161 cggattaata gcgtgtctta tattcgtgac ttcatccatc gctagtccac tttatattcc
2221 cgttattcca cccatttcgg aagataaatc gttcaatagt gtagaggtat tagtttcctt
2281 gtttagagat gaccaaaaag actatacggt aacttctcag ttcaataact acactatcga
2341 taccaaagac tggactatcg gcgtactatc cacacctgat ggtttggata taccattgac
2401 taatataact tattggtcac ggtttactat aggtcgtgca ttgttcaaat cagagtctga
2461 ggatattttc caaaagaaaa tgagtattct aggtgtttct atagaatgta agaagtcgtc
2521 gacattactt acttttttga ccgtgcgtaa aatgactcga gtatttaata aatttccaga
2581 tatggcttat tatcgaggag actgtttaaa agccgtttat gtaacaatga cttataaaaa
2641 tactaaaact ggagagactg attacacgta cctctctaat ggggggttgc ctgcatacta
2701 tcgtaatggg gtcgatggtt gattattgat tagtatattc cttattcttt ttattcacac
2761 aaaaagaaca tttttataaa catgaaacca ctgtctaaat gtaattatga tcttgattta
2821 tagatgaaga tcagccttta gaggatttta accagtatgt ttaatatgaa aaaaataaac
2881 ataacatatt ttgagattaa gcgctattgt gcaagattat attagaatca aattaatctt
2941 tcatacgaga aaaataacga catacgtcgt caacaaatta aactttttat ttattagtta
3001 actagcttat agaacttgct cattgttatg tttctaaaac ggg
//
LOCUS VACSPI2A 1475 bp ds-DNA VRL 03-AUG-1990
DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-2.
ACCESSION M24218
KEYWORDS serine protease inhibitor.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 1475)
AUTHORS Kotwal,G.J. and Moss,B.
TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related
to Members of the Plasma Serine Protease Inhibitor Superfamily
JOURNAL J. Virol. 63, 600-606 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1475)
AUTHORS Kotwal,G.J. and Moss,B.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence for [2] kindly
submitted by B.Moss, 15-FEB-1990.
FEATURES from to/span description
pept 295 1332 SPI-2
BASE COUNT 487 a 245 c 286 g 457 t
ORIGIN
1 tccatggaaa aacgaaagta gtataaaagt aataaaacaa aaaaaagaat ataaaaaatt
61 tatagctact ttctttgagg actgttttcc tgaaggaaat gaacctctgg aattagttag
121 atatatagaa ttagtataca cgttagatta ttctcaaact cctaattatg acagactacg
181 taaactgttt atacaagatt gaaattatat tctttttttt atagagtgtg gtagtgttac
241 ggatatttaa tattagacta tctctatcgc gctacacgac caatatcgat tactatggat
301 atcttcaggg aaatcgcatc ttctatgaaa ggagagaatg tattcatttc tccagcgtca
361 atctcgtcag tattgacaat actgtattat ggagctaatg gatccactgc tgaacagcta
421 tcaaaatatg tagaaaagga ggagaacatg gataaggtta gcgctcaaaa tatctcattc
481 aaatccataa ataaagtata tgggcgatat tctgccgtgt ttaaagattc ctttttgaga
541 aaaattggcg ataagtttca aactgttgac ttcactgatt gtcgcactat agatgcaatc
601 aacaagtgtg tagatatctt tactgagggg aaaatcaatc cactattgga tgaaccattg
661 tctcctgata cctgtctcct agcaattagt gccgtatact ttaaagcaaa atggttgacg
721 ccattcgaaa aggaatttac cagtgattat cccttttacg tatctccgac ggaaatggta
781 gatgtaagta tgatgtctat gtacggcaag gcatttaatc acgcatctgt aaaggaatca
841 ttcggcaact tttcaatcat agaactgcca tatgttggag atactagtat gatggtcatt
901 cttccagaca agattgatgg attagaatcc atagaacaaa atctaacaga tacaaatttt
961 aagaaatggt gtaactctct ggaagctacg tttatcgatg ttcacattcc caagtttaag
1021 gtaacaggct cgtataatct ggtggatact ctagtaaagt caggactgac agaggtgttc
1081 ggttcaactg gagattatag caatatgtgt aattcagatg tgagtgtcga cgctatgatc
1141 cacaaaacgt atatagatgt caatgaagag tatacagaag cagctgcagc aacttgtgca
1201 ctggtgtcag actgtgcatc aacaattaca aatgagttct gtgtagatca tccgttcatc
1261 tatgtgatta ggcatgttga tggaaaaatt cttttcgttg gtagatattg ctctccgaca
1321 actaattgtt aaccattttt tttaaaaaat agaaaaaaca tgtggtatta gtgcaggtcg
1381 ttattcttcc aattgcaatt ggtaagatga cggccaactt tagtacccac gtcttttcac
1441 cacagcactg tggatgtgac agactgacca gtatt
//
LOCUS YSCARSX 1539 bp ds-DNA PLN 03-AUG-1990
DEFINITION Yeast (S.cerevisiae) chromosome III L terminal region DNA.
ACCESSION M12922
KEYWORDS .
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1539)
AUTHORS Button,L.L. and Astell,C.R.
TITLE The Saccharomyces cerevisiae chromosome III left telomere has a
type X, but not a type Y', ARS region
JOURNAL Mol. Cell. Biol. 6, 1352-1356 (1986)
STANDARD simple staff_review
REFERENCE 2 (sites)
AUTHORS Longtine,M.S., Wilson,N.M., Petracek,M.E. and Berman,J.
TITLE A yeast telomere binding activity binds to two related telomere
sequence motifs and is indistinguishable from RAP1
JOURNAL Curr. Genet. 16, 225-239 (1989)
STANDARD full staff_entry
FEATURES from to/span description
site 232 236 an unusual yeast telomere repeat motif (CCCCA)
BASE COUNT 514 a 551 c 130 g 344 t
ORIGIN
1 cccacacacc acacccacac cacacccaca caccacacac accacaccca cacacccaca
61 ccacaccaca cccacaccac acccacacac ccacacccac acaccacacc cacacacacc
121 acacccacac acaccacacc cacacaccac acccacacac acaccacacc cacacacacc
181 acaccacacc cacaccacac ccacacccac acaccacacc acaccacacc cacaccacac
241 accacaccca cacacaccac accacacaca ccacaccaca cccacacacc acacccacac
301 acccacaccc acacacacca cacccacacc acacccacac ccacacaccc acaccctaac
361 actaccctaa cactacccta ttctaaccct gattttacct gtctcccaac ttactctcca
421 ttaccctacc tctccactcg ttaccctgtc tgattcaacc gtaccactcc caaccaccat
481 ccatctctct acttactacc accaacccac cgtccaccat aaccgttacc ctccaactac
541 ccgtatccaa ctccactacc gcttacccta ccatcgacca tgtcctactc actgtactgt
601 tgttcaccca ccatattgaa acgtctacaa atgatcgtaa ataatacaca tatacttatc
661 ctaccactct aatcccacta ccacatgcca tactcacctt cacttgtatt ctgatcggtc
721 atacgcacac ggatgctaca gtatatacca tctcaaactt acctactttc atattcatca
781 tcacccatct ctcaccatca gtaccaatgc actcgcatca ttatgcacgg cttgcctcag
841 cggtctatac cctgtgccat ttacgcataa cgcccatcat tatccacatt ttaatatcta
901 tatctcattc ggcgacacca aatattgtat aactgccctt aatacatacg ttataccact
961 tttacaccat atactaacca ctcaatttat acacacttat gtcaatataa ccacaaaatc
1021 accactaaaa tcacctaaac ataaaaatat tctactcttc aacaataata cataaacaca
1081 ctcaattgcg tatctatacc accatgacgt cattaacgta aaagttcctt aatattgcaa
1141 attgcttgaa cggatgccat ttcatgttga atgtggtaac ccaatagcat gatatgagta
1201 atgctttagt attgtttcag agttgtttca gtaatgtttt agacaaggag aacatatagt
1261 agcaaacctc taatccggta gtacttaaga aactacagtt tctatgtacg aaagcagtaa
1321 ctatgtaatt attacattta catgacatat aggaaggtcc aataaactta ctacattatg
1381 acctataagc tagatcgtaa ttcattacgt caacaggtta tgagccctag agcaatgctt
1441 ctgagaacat aatcaatcta tctagcccca acaattataa acagtggctg tacggtatcg
1501 agaccgctgc tgaatatgct aacgaatata tgaacgaat
//
LOCUS VAC35K 840 bp ds-DNA VRL 03-AUG-1990
DEFINITION Vaccinia virus gene for 35K complement control protein.
ACCESSION X13166
KEYWORDS complement protein; regulatory protein; secretory protein.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 840)
AUTHORS Kotwal,G.J. and Moss,B.
TITLE Vaccinia virus encodes a secretory polypeptide structurally related
to complement control proteins
JOURNAL Nature 335, 176-178 (1988)
STANDARD simple automatic
COMMENT *source: library=HindIII in M13; strain=wildtype.
EMBL features not translated to GenBank features:
key from to description
SITE 43 54 late transcription start site
OPR 543 549 early transcription stop signal
FEATURES from to/span description
pept 52 > 840 35K preprotein
sigp 52 108 signal peptide (AA -19 to -1)
matp 109 > 840 mature 35K protein (AA 1-244)
BASE COUNT 257 a 152 c 179 g 252 t
ORIGIN
1 tttttattat ttgtacgatg tccaggataa catttttacg gataaataaa tatgaaggtg
61 gagagcgtga cgttcctgac attgttggga ataggatgcg ttctatcatg ctgtactatt
121 ccgtcacgac ccattaatat gaaatttaag aatagtgtgg agactgatgc taatgctaat
181 tacaacatag gagacactat agaatatcta tgtctacctg gatacagaaa gcaaaaaatg
241 ggacctatat atgctaaatg tacaggtact ggatggacac tctttaatca atgtattaaa
301 cggagatgcc catcgcctcg agatatcgat aatggccaac ttgatattgg tggagtagac
361 tttggctcta gtataacgta ctcttgtaat agcggatatc atttgatcgg tgaatctaaa
421 tcgtattgtg aattaggatc tactggatct atggtatgga atcccgaggc acctatttgt
481 gaatctgtta aatgccaatc ccctccatct atatccaacg gaagacataa cggatacgag
541 gatttttata ccgatgggag cgttgtaact tatagttgca atagtggata ttcgttgatt
601 ggtaactctg gtgtcctgtg ttcaggagga gaatggtccg atccacccac gtgtcagatt
661 gttaaatgtc cacatcctac aatatcaaac ggatacttgt ctagcgggtt taaaagatca
721 tactcataca acgacaatgt agactttaag tgcaagtacg gatataaact atctggttcc
781 tcatcatcta cttgctctcc aggaaataca tggaagccgg aacttccaaa atgtgtacgc
//
LOCUS BSUCISAB 2246 bp ds-DNA BCT 03-AUG-1990
DEFINITION B.subtilis site-specific DNA recombinase (cisA) and RNA polymerase
sigma-K subunit (cisB; spoIVCB) genes, complete cds.
ACCESSION M29040
KEYWORDS DNA recombinase; RNA polymerase sigma-K subunit.
SOURCE B.subtilis (strain JH642) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2246)
AUTHORS Sato,T., Samori,Y. and Kobayashi,Y.
TITLE The cisA cistron of Bacillus subtilis sporulation gene spoIVC
encodes a protein homologous to a site-specific recombinase
JOURNAL J. Bacteriol. 172, 1092-1098 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Kobayashi, 14-OCT-1989.
FEATURES from to/span description
pept 134 1636 DNA recombinase (cisA) (gtg start codon)
pept 2014 1544 (c) RNA polymerase sigma-K subunit (cisB)
binding 119 127 ribosome binding site
signal 1635 1660 pot. transcription termination signal
BASE COUNT 776 a 405 c 475 g 590 t
ORIGIN 3 bp upstream of PvuII site.
1 cagctgttac atagcattac ccaaggggtg atgcatttta tgaaagtgat aatcatcgag
61 ggaccgcaag ctgacaaatg cattaacgat tgctatcatt atttaataaa actttatagg
121 aaggagattc agggtgatag caatatatgt aagggtatcg accgaggaac aagcgatcaa
181 gggatcgagc atcgacagcc aaatcgaggc ctgtataaag aaagcaggga ctaaagatgt
241 gctgaagtat gcagatgaag gattttcagg agagctttta gaacgtccgg ctttgaatcg
301 cttgagggag gatgcaagca agggacttat aagtcaagtc atttgttacg atcctgaccg
361 tctttctcgg aaattaatga atcagctaat cattgatgac gaattgcgaa agcgaaacat
421 acctttgatt tttgtaaatg gtgaatacgc caattctcca gaaggtcaat tgtttttcgc
481 aatgcgcggg gcaatctcag aatttgaaaa agccaaaatc aaagaacgga catcaagcgg
541 ccgacttcaa aaaatgaaaa aaggcatgat cattaaagat tctaaactat atggctataa
601 atttgttaaa gagaaaagaa ctcttgagat attagaagag gaagcaaaaa tcattcggat
661 gatttttaac tatttcaccg atcataaaag cccttttttc ggcagagtaa atggtattgc
721 tctacattta actcagatgg gggttaaaac aaaaaaaggc gccaaagtat ggcacaggca
781 ggttgttcgg caaatattaa tgaactcttc ctataagggt gaacatagac agtataaata
841 tgatacagag ggttcctatg tttcaaagca ggcagggaac aaatctataa ttaaaataag
901 gcctgaagaa gaacaaatca ctgtgacaat tccagcaatt gttccagctg aacaatggga
961 ttatgctcaa gaactcttag gtcaaagtaa aagaaaacac ttgagtatca gccctcacaa
1021 ttacttgtta tcgggtttgg ttagatgcgg aaaatgcgga aataccatga cagggaagaa
1081 aagaaaatca catggtaaag actactatgt atatacttgc cggaaaaatt attctggcgc
1141 aaaggaccgc ggctgcggaa aagaaatgtc tgagaataaa ttgaaccggc atgtatgggg
1201 tgaaattttt aaattcatca caaatcctca aaagtatgtt tcttttaaag aggctgaaca
1261 atcaaatcac ctgtctgatg aattagaact tattgaaaaa gagatagaga aaacaaaaaa
1321 aggccgcaag cgtcttttaa cgctaatcag cctaagcgat gacgatgatt tagacataga
1381 tgaaatcaaa gcacaaatta ttgaactgca aaaaaagcaa aatcagctta ctgaaaagtg
1441 taacagaatc cagtcaaaaa tgaaagtcct agatgatacg agctcaagtg aaaatgctct
1501 aaaaagagcc atcgactatt ttcaatcaat cggtgcagat aacttaactc ttgaagataa
1561 aaaaacaatt gttaacttta tcgtgaaaga agttaccatt gtggattctg acaccatata
1621 tattgaaacg tattaaagag gggtgtatgc accccccttt tgtaattaca atctcatttt
1681 caatacacct cgctgcatac gtcgccacct ttgtcccttt tccagcggaa tagctttcaa
1741 ttcctttaat aagcccgatc gttccgatgg agattaagtc ctctgcatcc tcacctgtat
1801 tttcgaactt tttcacaata tgggcgacca agcgaagatt atgctcaatc agcatgtttc
1861 tggcatgttc atccccttta gccatgagct ctaagtattt tttttcttcg ctgcttgaga
1921 gcggttgtgg aaaggcattg tttttcacgt aagatactaa aaagacaagc tctttaacaa
1981 caaagccgag cgctgcgaaa acacctgtca ccatcgtcac ctccacaaaa gtatgtaggc
2041 aaaagcctat atgtaaatgt atgtgaccgg gaggctgtct gtgtctgtac cgggaaattt
2101 ctcgcggggg agtccggctt tggtaagcag catactaact tcaaatgtta aaaatattct
2161 tcatcaagcg cccatacatt gaaatgaaca aaggtatggg gggatgggga tgaaaaaatg
2221 gatggcaggc ctgtttcttg ctgcag
//
LOCUS BSUSIG43 5127 bp ds-DNA BCT 03-AUG-1990
DEFINITION Bacillus subtilis sigma 43 operon with P23-dnaE-rpoD genes
(dnaE for DNA primase, rpoD for RNA polymerase).
ACCESSION X03897
KEYWORDS DNA primase; RNA polymerase; polymerase; primase;
unidentified reading frame.
SOURCE Bacillus subtilis.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 5127; enum. 1 to 5127)
AUTHORS Wang,L.-F. and Doi,R.H.
TITLE Nucleotide sequence and organization of Bacillus subtilis RNA
polymerase major sigma (sigma 43) operon
JOURNAL Nucleic Acids Res. 14, 4293-4307 (1986)
STANDARD simple automatic
COMMENT EMBL features not translated to GenBank features:
key from to description
PRM 728 733 -35 region P1
PRM 750 755 -10 region P1
PRM 754 759 -35 region P2
PRM 778 783 -10 region P2
RBS 848 851 pot. rRNA binding site
PRM 942 949 -35 region P3
PRM 963 971 -10 region P3
RBS 1462 1472 pot. rRNA binding site
RBS 3462 3474 pot. rRNA binding site
OPR 4609 4631 palindrome pot. transcription
terminator
FEATURES from to/span description
pept 855 1445 P23 unidentified reading frame
pept 1479 3290 dnaE
pept 3489 4604 rpoD
BASE COUNT 1677 a 904 c 1177 g 1369 t
ORIGIN
1 gaattcttat gttgaagata taggcactat caatgaagtg atttcacttg caaaggcaga
61 cggcggcatt atctgtttta cactcgtggt gccggaaatc agagaatatt tgatagccga
121 agcggaaaaa gcaaatgttt tatattatga tattatcggc ccgttgattg ataaaatgga
181 aacagcctac ggtttaacag cgaaatacga accggggcgg gtgcgccagc ttgatgaaga
241 ttatttcaaa aaagtggagg ccatcgagtt tgcagttaaa tacgatgatg gacgtgatcc
301 aagagggatt ttaaaagctg atatcgtttt gatcggcgtg tcaagaacgt ctaaaacacc
361 gctgtctcaa tatctcgcac acaaacgcct gaaggttgcc aatgttccga ttgtaccgga
421 ggttgatccg ccggaagaac tctttaacgt tgatccgaaa aaatgcatcg gtttaaagat
481 tagccctgat aaactgaatc atatcagaaa agaacgttta aaatcactcg ggcttaatga
541 taaagcgatt tatgcaaata tcaacagaat caaagaggaa ctcgagtatt tcgaaaagat
601 tgtggatcgg atcggctgcc aggttgttga tgtttcaaat aaagcggttg aggaaacagc
661 aaatattatc catcatctca aaacaaaaaa catataactc aggacgctct atcctgggtt
721 tttggctgtg ccaaaaggga ataatgaaaa acaatagcat ctttgtgaag tttgtattat
781 aataaaaaat tgtgataaaa tgattaattt taggtttaag gatcgtgtga tacgaataaa
841 ctattatggg taagatgtca agaatttctc ccggaaattt ttcgacaaat tcatatacat
901 ccacaataat aaaggatgtg cgattttgct tgcttttatg caggagttta atggagggat
961 ggagaattac tcttcttaat gaacaagaaa agacgatttt tgtcgatgct gatgcttgtc
1021 cggtaaaaga tgaaatttta caaacagcat ccgagtatga agttcaagtt ctttttgtcg
1081 cttcatttga acattatcag ctttccagaa gcaatgaaga aaaatggaag tatgttgatc
1141 ctcataaaga agctgctgat ttatatatcg caaatcacgt gaaaccggga gatattgttg
1201 tgacgcagga catcggatta gcatctctgc tgttgaacag aaatgtctct gttatgtcgg
1261 aaagaggtcg tctttacaag gaagacacga ttgattttgc cctagagggc cgtcattttt
1321 ccggcaaaca aagaagaaaa ggcgtatatg ccaaagggcc taaaaaattg aataaagaag
1381 atcgagaacg atttattaca ctgctgcaaa aaatcctgtc gaacgatgaa gggattttgc
1441 actaaagcat cgaataatgt acgacggagt gttataagat gggaaatcgg ataccagatg
1501 aaattgtgga tcaggtgcaa aagtcggcag atatcgttga agtcataggt gattatgttc
1561 aattaaagaa gcaaggccga aactactttg gactctgtcc ttttcatgga gaaagcacac
1621 cttcgttttc cgtatcgccc gacaaacaga tttttcattg ctttggctgc ggagcgggcg
1681 gcaatgtttt ctctttttta aggcagatgg aaggctattc ttttgccgag tcggtttctc
1741 accttgctga caaataccaa attgattttc cagatgatat aacagtccat tccggagccc
1801 ggccagagtc ttctggagaa caaaaaatgg ctgaggcaca tgagctcctg aagaaatttt
1861 accatcattt gttaataaat acaaaagaag gtcaagaggc actggattat ctgctttcta
1921 ggggctttac gaaagagctg attaatgaat ttcagattgg ctatgctctt gattcttggg
1981 actttatcac gaaattcctt gtaaagaggg gatttagtga ggcgcaaatg gaaaaagcgg
2041 gtctcctgat cagacgcgaa gacggaagcg gatatttcga ccgcttcaga aaccgtgtca
2101 tgtttccgat ccatgatcat cacggggctg ttgttgcttt ctcaggcagg gctcttggca
2161 gccagcagcc taagtatatg aacagtcctg aaaccccgct ctttcataaa agcaaactgc
2221 tttacaattt ttataaggcc cgccttcata tcagaaagca ggaaagagca gtcttatttg
2281 aagggtttgc tgatgtctat acggccgtaa gctcggatgt aaaggaaagc atagccacga
2341 tgggaacgtc tcttacagat gatcatgtca agatcctgag aagaaacgtc gaagaaatca
2401 ttctttgcta tgactctgat aaagccggtt atgaagccac cttaaaagct tcggagcttc
2461 tgcaaaaaaa aggctgcaaa gtcagagttg caatgattcc tgacggattg gaccctgatg
2521 attacatcaa aaaattcggc ggggaaaaat ttaaaaacga cattattgac gcaagtgtca
2581 ccgtaatggc gttcaaaatg caatatttcc gaaaaggaaa gaacctgtcc gatgaaggcg
2641 accgcctagc ttacattaaa gacgtactga aagaaatcag cacgctttca gggtctctag
2701 agcaggaagt ctatgtaaag cagcttgctt cagagttttc gctttcacag gagtctttaa
2761 ctgagcagct gtctgttttc agcaagcaaa acaaacctgc tgacaatagc ggtgaaacta
2821 aaacgcggcg agcgcatctg acgacaaaag caaggcaaaa acgtttgcgt ccggcgtatg
2881 aaaatgcaga aaggctgtta ctcgctcaca tgcttcgaga tcggagcgtc atcaaaaaag
2941 tgattgaccg ggtagggttt caatttaata ttgatgagca ccgggcatta gccgcttatc
3001 tttatgcttt ttatgaagag ggagccgagc tgacgcctca gcatctgatg gccagggtga
3061 cggatgatca tataagccag ctcttgtccg atatattaat gcttcaggtt aatcaagagc
3121 ttagcgaagc cgagttatca gattatgtaa aaaaagtgtt gaatcaaaga aattggtcaa
3181 tgataaaaga aaaagaggcg gaaagagccg aagcagaaag gcaaaaagat tttttaagag
3241 ctgcttcttt ggctcaagaa atcgttacat tgaaccgatc tttaaaataa ctggagaact
3301 gatgaggagc atttattggc aatgattcct tgcggaggag caaatagatc gcttaacctc
3361 atcatgaatt gtcatttcat tattcgcaca ttgttaaagg cagttcacat agaaaacgcc
3421 tgaatggacc gaataagaat cataccgctt atagaattcg ttgcaagctt tggaaggagg
3481 gatccataat ggctgataaa caaacccacg agacagaatt aacattcgac caagtaaaag
3541 agcaattaac agagtctggt aaaaaacgtg gcgttttgac atatgaagaa attgctgagc
3601 gtatgtccag ctttgaaatt gaatcagacc aaatggatga gtattatgaa tttttaggtg
3661 aacaaggtgt tgaattaatt agtgagaatg aagaaacaga agatcctaat attcagcagc
3721 ttgccaaagc cgaagaagaa tttgacctta atgacctaag tgtaccgcct ggcgttaaaa
3781 tcaatgaccc agttcgtatg tatttaaagg aaatcggtcg ggttaacctt ctttctgcaa
3841 aagaagaaat cgcctacgct caaaagattg aagaaggtga cgaagaatct aaacgcagat
3901 tggctgaagc gaacctgcgg cttgttgtca gtatcgcaaa acggtatgtc ggacgcggta
3961 tgctgttcct tgatctgatc catgaaggaa acatgggcct gatgaaagcc gttgaaaaat
4021 ttgattatcg caaaggttat aaattcagta cgtatgctac gtggtggatc agacaggcga
4081 ttacacgcgc cattgccgat caggcgagaa cgatccggat tcccgttcat atggttgaaa
4141 ccattaataa attaatccgt gtgcagcgtc aattactgca agacttaggc agagaaccaa
4201 cacctgaaga aattgcggaa gatatggatt taacgcctga aaaagtacgc gaaatcttaa
4261 agattgctca agagccggta tctctggaaa caccgatcgg tgaagaggat gactcgcacc
4321 ttggtgattt cattgaagac caagaagcaa cttcaccttc tgaccacgcc gcatacgagc
4381 tattgaaaga gcagctggaa gatgtgcttg atacgttaac tgatcgtgaa gaaaatgtat
4441 tgcgtcttcg attcggtctt gatgacggcc gtacaagaac attagaagag gtcggcaaag
4501 tatttggagt aacgagagag cgtattcgac aaatcgaagc caaagcgttg cggaaactaa
4561 gacatcctag cagaagtaaa cgtttgaaag atttccttga ataagatgga acgggtcttg
4621 aagatccgtt cttctttttt taaaaagata tatggataat atgcctttat tttactgaaa
4681 aatgatgtca tttgcaaatg aacattgtgg tgaaaaattt caaaatctaa ttccatattt
4741 tctattgtaa gcgtatacaa tacattatac aatagaataa aaaggatatt agagatttag
4801 gcatgtttct atttcaattt tgtataaagt gtgaataaaa aacttttgta tagcaatcca
4861 tttacttttt gtaaaaataa gttagaatta gaagtgttta cataggggga aggatcaaaa
4921 agggggaagg gaaatgaaat ggaacccgct tattccattt ttgctgatcg ctgttttagg
4981 aatgggtcta actttctttt tatcagtaaa aggacttgat gactctcggg aggttgcgag
5041 cggaggagaa agcaaatctg ctgaaaagaa agatgcaaac gcttcaccag aagaaattta
5101 caaggcaaat cgcatcgcat cgcatgc
//
LOCUS CHKMHBFVB 4790 bp ds-DNA VRT 03-AUG-1990
DEFINITION Chicken MHC class I B-FIV-B12 alpha-chain gene, complete cds.
ACCESSION M31012
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Chicken (strain CB; haplotype B12), clone cBF23.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 4790)
AUTHORS Kroemer,G., Zoorob,R. and Auffray,C.
TITLE Structure and expression of a chicken MHC class I gene
JOURNAL Immunogenetics 31, 405-409 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.Kroemer, 21-DEC-1990.
FEATURES from to/span description
pept 1405 1471 MHC B12 alpha-chain, exon 1
1589 1852 MHC B12 alpha-chain, exon 2
2081 2353 MHC B12 alpha-chain, exon 3
2457 2729 MHC B12 alpha-chain, exon 4
2803 2910 MHC B12 alpha-chain, exon 5
3021 3053 MHC B12 alpha-chain, exon 6
3213 3245 MHC B12 alpha-chain, exon 7 (put.)
3400 3419 MHC B12 alpha-chain, exon 8
IVS 1472 1588 MHC B12 alpha-chain intron A
IVS 1853 2080 MHC B12 alpha-chain intron B
IVS 2354 2456 MHC B12 alpha-chain intron C
IVS 2730 2802 MHC B12 alpha-chain intron D
IVS 2911 3020 MHC B12 alpha-chain intron E
IVS 3054 3212 MHC B12 alpha-chain intron F
IVS 3246 3399 MHC B12 alpha-chain intron G
pre-msg 1347 3587 MHC B12 alpha-chain and introns (alt.)
pre-msg 1379 3587 MHC B12 alpha-chain and introns (alt.)
binding 1328 1337 Sp1 binding site
site 1219 1228 interferon responsive element
site 1263 1270 cAMP responsive element
site 1238 1244 S box (B-L-beta)
site 1247 1255 X box (B-L-beta)
site 1288 1294 Y box (B-L-beta)
site 1313 1317 CAAT box
BASE COUNT 812 a 1637 c 1460 g 881 t
ORIGIN Microchromosome 16.
1 acctataagg gacatgggaa cacagcagga ggtgtttggg ttctggggac atggggggaa
61 ctctggggaa ggagatgtga ggtgacatgg gtgacatggc aggaggggac ccaagggtgc
121 acacatccct gtgtgccaca ggggtgtccc caaccccctc accccagagg acatccccca
181 tgtccccata tcccccggtg tccccatacc tcaatttgta ggagctggtt gaagttgttg
241 tggggtgtga gggcaaatgg gactgggagg gtgcaggccc agccccacgg tccccctcag
301 cccatgcagt caccgtcccg gtcaccggcc ccacagctgc cagcaacagc ccatgggggt
361 ccccaaggcc acaggggcgg ggggccacca ggacccacct gagggaacac tgcttgagcc
421 accagcccca taactgctcc ataagccctc ccagatccca taagcatcca gacacccctt
481 aggtccatag gaatcccccc agctctggcc ccataacccc acccaaccac acagaaatct
541 cccttagtcc ccccagactc catcacaccc caatcctata gcagccccca gccccaccag
601 tgccccatag tccctcatag gcactgcaga acattcatag tcccatagct accccacaaa
661 caccccatag ccccatactc agggtcctgg gagctggccc tcattaaggg agtgaggaag
721 aggaggagga agagctgaag ccagtgaggc ccatggctcc ggcagtgtgg ggcagatggg
781 agcagtggcg cagggacagc ggggttgggg tcaatcatta acttgggggg tggacattgg
841 ggtccacctg cagaccctga gggcatcgac ctcagatgga ccacgtggaa tgtggggaag
901 cggtgtgggg cggcgctgtg gggaaatggc actgatccca aaggaagccc tgggggacac
961 cactgtcacc accccctccc ccccggacac gagcctctaa ccctatggat gtgacccccc
1021 ccccaaacag ttccttgact ggaatggaga atgactcctc ctcttctcac ggcccccgtc
1081 ctcccggcag agaccctcgg ccccctcccc cggggtaagg acgcggcccc gtgcgaacac
1141 agcccgggtg cccccccgtg ctcgaagggc cgggggttcc cacaccacgc ccatcccctc
1201 ccccgctccg cctttcgctt tcgcttcaca acctgaggga gcgcattctg cctggcgccc
1261 gatgacgtca cataaaactc caactaccat tggcggagag gcgacggagg agccaatggg
1321 ggcgcggggc ggggcggagg agtaggaaaa gctgaaggag ctgcgctggg tgcggcggac
1381 ttgagagtgc agcggtgcga ggcgatgggg ccgtgcgggg cgctgggcct ggggctgctg
1441 ctcgccgccg tgtgcggggc ggcggccccc ggtgagtgcg gccggaccgg gacccctccc
1501 ccgcccgtaa ccccaccccg gggctgtgtc cgtgggatcc tgagacccac acccgcggct
1561 cacggcccgc tgtgctccgt ccccgcagag ctccataccc tgcggtacat ccaaacggcg
1621 atgacggatc ccggccccgg gcagccgtgg ttcgtgactg tggggtacgt ggacggggaa
1681 ctcttcgtgc actacaacag caccgcgcgg aggtacgtgc cccgcaccga gtggatagcg
1741 gccaaggcgg accagcagta ctgggatgga cagacgcaga tcggacaggg caatgagcag
1801 attgaccgcg agaacctggg catactgcag cggcgctaca accagaccgg cggtgagcac
1861 ggccggggcc gcggctccgt gggtgtggga tgggctccat gcgcagtgcc gcccacaccc
1921 cccaggcctg gccctgcccg gcggcaccgt cccggggctg cccgtcacag ccccaccgcg
1981 ctcggggtgc cgcgtcccgg ggggacccca acccatcccc gctgcagtgg gagccccgga
2041 gccggagggg cccctcaccc cctgcccggc tgtgtttcag ggtctcacac ggtgcagtgg
2101 atgtacggct gtgacatcct cgagggcggc cccatccggg ggtattatca gatggcctac
2161 gatgggagag acttcactgc cttcgacaaa ggcacgatga cgttcactgc ggcagttcca
2221 gaggcagttc ccaccaagag gaaatgggag gaagagagtg aacctgagag gtggaagaat
2281 tacctggagg aaacctgcgt ggagtggctg cggagatacg tggaatacgg gaaggctgag
2341 ctgggcagga gaggtgagcg gggtgggggg gggagcggct gcagtgtggg gctggacgtg
2401 gggcgggggc tcagcgtggg gagctcagcc cggccctcac tgccgcccgc ccgcagagcg
2461 gcccgaggtg cgagtgtggg ggaaggaggc cgacgggatc ctgaccttgt cctgccgcgc
2521 tcacggcttc tacccgcggc ccatcgttgt cagctggctg aaggacggcg cggtgcgggg
2581 ccaggacgcc cactcggggg gcatcgtgcc caacggcgac ggcacctacc acacctgggt
2641 caccatcgat gcgcagccgg gggacgggga caagtaccag tgccgcgtgg agcacgccag
2701 cctgccccag cccggcctct actcgtgggg tgagtgaggg gatgtggggc tggggggctg
2761 cgggctgccc cttcccctgc tgatggcccc gctctccccc agagccgcca cagcccaacc
2821 tggtgcccat cgtggcgggg gtggccgtcg ccattgtggc cattgccatc atggttggtg
2881 ttggattcat catctacaga cgccatgcag gtaaaagcag aggggtgcag gcgggcagtg
2941 gggggatctg ggtcccccct tgggagccct catgcctggc tgtgatgtga acctgtgctg
3001 aagcatctct ctgtctgcag ggaagaaggg gaagggctac aacatcgcgc ccggtgagtg
3061 atgagggcag cgctgtcccc cacctctgcc cagtgccagg gcggtcctgg ggtctgcact
3121 ttctcccagg gtacccattc ctggtgcttg gggctgctcc acgccccata gggagcacag
3181 ggctggatct cacagctgtt cctcccttat agacagggaa ggtggatcca gcagctcgag
3241 cacaggtgcg tgtggggctg tgggttggga ggggtccgtg tgctctctgt ggtactgccc
3301 agggctgggc tatgctgggg ctctgcgggg agacccccgg agcagagggt tgggatgtga
3361 acctggcccc gtgggacatc atcccttctc atcccacagg gagcaacccc gccatctgag
3421 tgctgtgctt cagcctgcaa ggagccaaca gtccacacca gcatttgggg tcagtgatgg
3481 gcacagcccc atcctcttga cctctcacat ctcattctgc ttcctatgct gactgttatg
3541 ctttgcctgc actgcttcct gtgaaataaa atgatgggcc attctgtggc tcagcttgcc
3601 tgcagtctgc actgtgctgt ggttggggat ggggtggatg aggggaccgt gtcagtttgg
3661 ctgctcaggg tgcagatgtg gccctgtgct gagtacccac tgccctcccc tcttctatct
3721 gcctgctgct cactcccccc tgtgtacccc catcccttct cacctctcct ctgtgacccc
3781 atgctggtgg ttgcttgctc cctgtcctgg cagaactctc attttccaat ggcatccctg
3841 tttgttggga tgtggtctcc ttggtcctcc ccccagcagt cactgcacat atccacccca
3901 cttccccccc aggttgctgt cccacagcac tcctatttcc ttctctgccc tccccccccg
3961 cccatccagc tgcctctgca atcctcaccc acaaccttgc ccactccacc tccctcatcc
4021 cgcccttccc ccagctgtcc tgtccctgct gggccccctc cccccacatt gtaccctaca
4081 cccaaataaa tacgtttgtt ctgctgccct ccagccgtct tctggtttat ttcccccccg
4141 atttgttgtt gtggcgtccg ctcttctcct ggggggaagg ggctctgggg gtccctcatt
4201 ctccctgctc ttcttacagc accgggactc cccgcgctga gatcccaaca cacccgggta
4261 caaacatgcg gcttttattc ccagttctgt gtcccacccc cggccctggt ggcactcagt
4321 ggcactgcag tccatgcagt ggccgttgtg tgtcgtagag cagcggtacc cgagcgcgcc
4381 ccggctcggc atccatgtgc ccacggcaca gctcttgtgg tcccttgtcg ctgccccggt
4441 gtccccacct ccaccctcag tgtccccaac gtgcagtgtc cccatcccac atcccaccac
4501 cccctgccct ccccatccat aactcccgct atccctctcc ctctgtcccc ctcccccggt
4561 gctccctgtt gtccccagtc ccgcagaagg ctgccggggg cagcacctcg tgggggggtc
4621 cctgctgcca cacctctccc ccgtccaaca ccactaccca ttgtaccagg gctgcccgcc
4681 ccgtcaccat cagcactgca catccttacc ctctggctag ctggatccaa cttgtggaca
4741 ggcttctgat gccctgtgta tgacaaggtt tctgatcctc gcgcctgcag
//
LOCUS DOGMHDL9A 1499 bp ss-mRNA MAM 03-AUG-1990
DEFINITION Dog MHC class I DLA-A9/A9 alpha-chain mRNA, complete cds.
ACCESSION M32283
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Dog (#C624; haplotype DLA-A9/A9 homozygote) peripheral blood
leukocyte, cDNA to mRNA, clone pBT-I16.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 1499)
AUTHORS Sarmiento,U.M. and Storb,R.
TITLE Nucleotide sequence of a dog class I cDNA clone
JOURNAL Immunogenetics 31, 400-404 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
U.M. Sarmiento, 22-FEB-1990.
FEATURES from to/span description
pept 1 1089 MHC DLA-A9/A9 alpha-chain precursor
sigp 1 72 MHC DLA-A9/A9 alpha-chain signal peptide
matp 73 1086 MHC DLA-A9/A9 alpha-chain
mRNA 1 1499 MHC DLA-A9/A9 alpha-chain mRNA
BASE COUNT 296 a 425 c 489 g 289 t
ORIGIN
1 atggaggtgg tgatgccgcg agccctcctc gtgctgctgt cggcggccct ggccctgacc
61 ccgacccggg cgggctccca ctccctgagg tatttctaca cctccgtgtc ccggcccggc
121 gcgggggacc cccgcttcat cgccgtcggc tacgtggacg acacgcagtt cgtgcggttc
181 gacagcgacg cggccactgg gaggatggag ccgcgggcgc cgtgggtgga gcaggagggg
241 ccggagtatt gggaccggca gacgcggacc atcaaggaga ccgcacggac tttccgagtg
301 gacctggaca ccctgcgcgg ctactacaac cagagcgagg ccgggtctca cacccgccag
361 accatgtacg gctgtgacct ggggcccgac gggcgccttc tccgcgggta cagtcaggac
421 gcctacgacg gcgccgatta catcgccctg aacgaggacc tgcgctcctg gaccgcggcg
481 gacacggcgg cgcagatcac ccagcgcaag tgggaagcgg caggtgtagc agagctacaa
541 tggaggaact acctggagac gacgtgcgtg gagtggctgc ggaggtacct ggagatgggg
601 aaggagacgc tgctgcgcgc agacccccca agcacacgtg tgacccacca ccccgtctct
661 gaccatgagg tcaccctgag gtgctgggcg ctgggcttct accctgcgga gatcaccctg
721 acctggcagc gggatgggga ggaccagacc caggacacag aggttgtgga cacaaggcct
781 gcaggagatg ggaccttcca gaagtgggcg gccgtggtgg tgccttctgg acaggagcag
841 agatacacgt gccacgtcca gcatgagggg ctgccggagc ctatcacgcg gagatgggag
901 ccttcccctc tgtccaccat tgtcatcgtc agcattgctg ctctggttct cctcgtggtc
961 gctggggtga ttggagctgt gatctggagg aagcagcgct caggaggaaa aggaccaggc
1021 tactctcatg ctgcacgtga tgacagtgcc cagggctctg atgtgtctct gacagctcct
1081 agagtgtgag accagctgcc tgtgggactg acggatgcaa gatgtgttca catctcacgt
1141 gatgacatca acaaccctgg cttgtctctg caaacagtgt caggatgtgc ctgtgtccct
1201 aggagcataa tgtgaggagg tggggagatt ggcccaccct gcccaccatg acctgtccct
1261 aatctgatgt gcgctctcct ctctgatgtg ctttcctgtc caggagaggc agggctggac
1321 catctccatc cctgtctttg tttcatgttg agtactaatc tcttactatc cgattgaaaa
1381 taagaatcca gatatgagtt tgtgtttcct gagtcttggg atgtggggct gatgaggtaa
1441 taaaaggaga tttgtgaagt tgagagagca aataaatgga agccctgaga accttccag
//
LOCUS DOGMHDRBA 1165 bp ss-mRNA MAM 03-AUG-1990
DEFINITION C.familiaris MHC class II DLA-DR-beta-1 gene, complete cds.
ACCESSION M29611
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE C.familiaris (dog #C624, haplotype DLA-Dw3/Dw3) leukocyte, cDNA to
mRNA, clone pBT-DR5.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 1165)
AUTHORS Sarmiento,U.M. and Storb,R.
TITLE Nucleotide sequence of a dog DRB cDNA clone
JOURNAL Immunogenetics 31, 396-399 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by U.M.Sarmiento, 31-OCT-1989.
FEATURES from to/span description
pept 40 840 MHC DLA-DR-beta-1 precursor
sigp 40 126 MHC DLA-DR-beta-1 signal peptide
matp 127 837 MHC DLA-DR-beta-1
mRNA 1 1165 MHC DLAB1 mRNA
BASE COUNT 237 a 313 c 319 g 296 t
ORIGIN
1 cgactgctcc cctgcaccct gtcctttctg ttttccagca tggtgtgtct gtgttttctt
61 ggaggctcct ggatgacagc tctgatgctg atactgatgg tgctgaaccc tcccttcgct
121 tgggccaggg acaccccacc acatttcttg gaggtggcaa agtccgagtg ctatttcacc
181 aacgggacgg agcgggtgcg gttcgtggaa agatacatcc ataaccggga ggagttcgtg
241 cgcttcgaca gcgacgtggg ggagttccgg gcggtcacgg agctcgggcg gcccgtcgct
301 gagtcctgga acgggcagaa ggagatcttg gagcaggagc gggcaacggt ggacacctac
361 tgcagacaca actacggggt gattgagagc ttcacggtgc agcggcgagt cgagcctaca
421 gtgactgtgt atcctacgaa gactcagacc ttgcagcatc acaacctcct ggtctgctct
481 gtgaatggtt tctatccagg ccacattgaa gtcaggtggc tccggaatgg ccaggaagag
541 gaagctgggg ttgtgtccac aggcctaatc cgtaatggag actggacctt ccagatcctg
601 gtgatgctgg agatagttcc tcagagcgga gaggtctaca cctgccaagt ggagcaccca
661 agtttgacaa gccctgtcac cgtggaatgg agggcacagt ctgattctgc acagagcaag
721 atgctgagtg gaatcggggg ctttgtcctg ggtctgctct tccttgcagt ggggctgttc
781 atctacttca gaaatcagaa gggacactct ggacttcagc caacaggact cctgagctga
841 agtggagatg gtgaccctca agaaaaaacc ttcctttcca gcttcttctc agagtccaaa
901 ggtttccagc ttggctctta ttcttctaca aatacagcgc ctttctcagg atctggtttg
961 ccctggcttc agtgaccctg cacagcttgt gttccccaat ggctccctct gccgtgtcct
1021 tgacctggaa cccccagtgc tgactgcagt accttatctg cattctctcc ttcctcttcc
1081 tatgcagccc ttatgtttcc cttgcatggg aactcacctt ctgctcatgt ttctttataa
1141 acttttctca aataaatgtg gattg
//
LOCUS LACTHYSYN 957 bp ds-DNA BCT 03-AUG-1990
DEFINITION L.lactis thymidylate synthase (thyA) gene, complete cds.
ACCESSION M33770
KEYWORDS thymidylate synthase.
SOURCE L.lactis lactis (strain 712) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 957)
AUTHORS Ross,P., O'Gara,F. and Condon,S.
TITLE Cloning and characterization of the thymidylate synthase gene from
Lactococcus lactis subsp. lactis
JOURNAL Appl. Environ. Microbiol. 56, 2156-2163 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by P.R.Rose, 04-APR-1990.
FEATURES from to/span description
pept 87 926 thymidylate synthase (EC 2.1.1.45)
binding 77 80 ribosome binding site
site 28 33 -35 region
site 51 56 TATA box
BASE COUNT 329 a 139 c 192 g 297 t
ORIGIN
1 tctgagaggt tattttggga aatactattg aaccatatcg aggtgtgtgg tataatgaag
61 ggaattaaaa aagataggaa aatttcatga cttacgcaga tcaagttttt aaacaaaata
121 tccaaaatat cctagataat ggtgtttttt cagaaaatgc aagaccaaag tataaggatg
181 gtcaaatggc gaatagcaaa tatgtcactg gttcattcgt tacttatgat ttgcaaaagg
241 gggagtttcc aattaccact ttgcgtccaa ttccaatcaa atctgctatt aaagaattga
301 tgtggatata ccaagaccaa acaagtgaac tttctgttct cgaagagaag tatggagtca
361 aatactgggg agaatgggga attggtgatg gtacgattgg gcaacgttat ggtgcaacag
421 tcaaaaaata taatatcatt ggtaaattat tagaaggctt ggccaaaaat ccatggaatc
481 gtcgtaatat catcaacctt tggcagtatg aagattttga ggaaacagaa ggtcttttac
541 catgtgcttt ccaaacgatg tttgatgtcc gtcgagaaaa agatggtcag atttatttgg
601 atgccacact gattcaacgt tcaaacgata tgcttgtagc ccaccatatc aatgcgatgc
661 aatatgttgc tttgcaaatg atgattgcaa aacatttttc ttggaaagtt gggaaattct
721 tttattttgt aaataattta catatttatg ataatcagtt tgagcaggca aatgaattaa
781 tgaagcgaac agcttctgaa aaagaacctc gtttggtcct taatgttcct gatggtacaa
841 actttttcga tattaaacct gaagattttg aacttgtgga ctatgagcca gtaaaacctc
901 aattgaaatt tgatttagca atttaaatta atctataagt tactgacaaa actgtca
//
LOCUS MUSLY48A 121 bp ss-mRNA ROD 03-AUG-1990
DEFINITION Mouse leukosialin (Ly48) mRNA, partial cds.
ACCESSION M30693
KEYWORDS leukosialin.
SOURCE Mouse (strain C57BL/6J) adult thymus EL-4 tumor, cDNA to mRNA,
clone ML18.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Baecher,C.M., Dorfman,K.S., Mattei,M.G. and Frelinger,J.G.
TITLE cDNA cloning and localization of the mouse leukosialin gene (Ly48)
to chromosome 7
JOURNAL Immunogenetics 31, 307-314 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.G.Frelinger, 14-DEC-1989.
FEATURES from to/span description
pept < 3 > 119 leukosialin (Ly48; AA at 3)
BASE COUNT 28 a 28 c 43 g 22 t
ORIGIN Chromosome 7 put. 7F3/7F4 bands.
1 gccagggctc tttagtacta gaagagctga agcctgggtc gggtcccaac ctgaaggggg
61 aggaagagcc gcttgtgggc agtgaggatg aagctgtgga aaccccaact tctgacggtc
121 c
//
LOCUS MUSUBILP 2912 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse housekeeping DXS253E (P3) and DXS254E (GdX) genes, 3' end,
and complete cds.
ACCESSION J04761
KEYWORDS housekeeping protein.
SOURCE Mouse (strain Balb/c) liver DNA, clone lambda-M7P.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2912)
AUTHORS Filippi,M., Tribioli,C. and Toniolo,D.
TITLE Linkage and sequence conservation of the X linked genes DXS253E
(P3) and DXS254E (GdX) in mouse and man
JOURNAL Genomics 7, 453-457 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by D.Toniolo, 10-APR-1990.
FEATURES from to/span description
pept < 1 550 housekeeping protein DXS253E (P) (AA at 2)
pept 1536 1583 housekeeping protein DXS254E (GdX), exon 1
1740 1845 housekeeping protein DXS254E (GdX), exon 2
2089 2297 housekeeping protein DXS254E (GdX), exon 3
2413 2523 housekeeping protein DXS254E (GdX), exon 4
pre-msg < 1 801 DXS253E mRNA
pre-msg 1533 > 2912 DXS254E mRNA and introns
IVS 1584 1739 DXS254E intron A
IVS 1846 2088 DXS254E intron B
IVS 2298 2412 DXS254E intron C
BASE COUNT 649 a 822 c 765 g 676 t
ORIGIN X chromosome.
1 cactggtttc ctaccactct catcagccat ctacagctac cttctcagca tccatgaaac
61 actccacgtg cccatctcca agatattggg gaccctgctg tttatcgcca tccccatagc
121 agcgggtgtg gtgatcaaat ctaagctccc caagttctct gaactactgt tacaggtcat
181 caagcccttc agctttatac ttctcctggg tggcctgttc ctggcctacc acatgggggt
241 cttcatccta gtgggagtca ggttacccat tgtactggtg ggtttcacag tgcctcttgt
301 tggcctcttg gtgggctaca gcctggccat ctgcctgaag ctgccagtgg ctcagcgacg
361 aacagtcagc attgaggtag gggtgcaaaa cagcctgcta gccttagcca tgctacagct
421 gtctctgcgc cgccttcaag cagactatgc ctctcaggcc cctttcattg tggcactgag
481 tggtacctcc gagatgttgg ctttggttat tggccagttc atctatagca tcttgtttcc
541 tgttccctga gccctctgga tcaagtttta taacttccag cccccatact cctctcttcc
601 cacagttctt gtgtaccgta ccaaaggcct ttagttctca tgcactatgc actcaagaca
661 aatccaggct tattttttta ctccttttcg ccccagcttt cagtgccaaa gaggccatgc
721 tgagttagat agttgggtat tgcccagaaa tatatttcaa taaaaaaaaa ccaaaaaaac
781 aaaagcaagc tcgggtccct tccagtctgt cccttggagc ctgtggattg gggtaagaag
841 agtgtgtctg ggcggatatg ggccagaaca gtgaaccagc ttggtgaccc tgcacagaac
901 ccactttctc ctcctcactc tgttttgcac atggtagaca tgtacctttg agctgtatcc
961 tcagatctac aaaggctttg ctctaacagc cctacctaac agggcagacc tctgcactat
1021 aaggagtgaa gagagcacta cttccgagct tcagtcagga tatttttgca gccagactag
1081 aaaggaagta gaaagccatt aagctatgac aggcaggact ggaccatttc tgtatctgag
1141 agacaccacc acgtgtttgc attgacttgg tccaccagca gggtgggaat tagcacaggc
1201 cgggcctgga gtgaggctga cattctggtc cttcctttca agccacagct cccagactaa
1261 gcaagcctgg ccccacaggt acctcacttg gatcccagtg ggcggggcat ccccacgtcc
1321 ccgcgccatt aagcccaggt ggtggactct acccaaggca cggccggggc agaagcgaag
1381 gccgggaggc gggactgaag aatgcgcagg cgggagccac gagcggcgcg cccctttcac
1441 ggcttccggc ggcggcgggc ggttccgggg cggcggcgcg cggcagggga ccgttggtgt
1501 ttgcgttggc cgtagtggac tgggccgtgg acaccatgca gctgaccgtg aaggcgctcc
1561 agggccggga atgtagccta caggtaaagt caccccgcag gctgctgcac tcactcctgc
1621 cgctggggcc gagcgcgcgt gggggaggcc gggagagctg acctcaggag ggggtgcgag
1681 ggcttcaccg cccagggaat tcgggctgag cccttgcctc tccacctcgg ccggatcagg
1741 tggcggagga cgagctagtg tctacactga agcacctggt ctcggataag ctgaatgtcc
1801 ctgtgcgcca gcaacgtctg ctgttcaagg gcaaggccct agcaggtaat cagggaaagg
1861 agccatccag ggagctttac agaaagctgg gggacgggat gtacaggcca gtgtagtgga
1921 tcggggcact ggctgccact ggacctcaga aaagtagctg agccatcacc accccaccac
1981 cccatagcca gatgctgcag tgcttcactc tggctgtctg tggcaggatt gcttcccagc
2041 cacctccaca tcagtgagga aggaaccaca ccaatctctc tttggcagat gaaaaacgac
2101 tgtcagatta caacattggg cccaattcta agctcaacct agttgttaag cctttggaga
2161 aggtgctact ggaagaaggg tctgcccaca gactggtcga ctccccagcc acccccatct
2221 ggcagctgat ctccaaagtc ctggcccgtc acttcagtgt agcagatgcc agcagggtcc
2281 tggaacaact acagagggtg agaagagtaa tgctggccct ccattcttaa tgtttctacc
2341 cgtcccctac ctaatgcagc atacgccttc cccactggcc tctctgaata ccactcggtc
2401 ttctctcctc aggattatga caggtccttg agccgcctaa cactggatga catcgaacgt
2461 ttggccagcc gctttctaca ccctgaagtg actgaggcta tggaaaaagg gttctgcaaa
2521 tagcattctg ggattgtggg gagaaatccc aggtcaggcc acagctgcat gttgcattaa
2581 atgtgttctc atgtcgcagt ttggctcata gtgataataa tagctggtat gtatccagct
2641 cttgctaggt gctaggtact caaagtacat ggcctgagtt ctcttctgcc tcccacagca
2701 aagttgaggc tgcactgtag agacacctag caatggcttg ggacagacag ggtccaagtc
2761 tagccagcct atagaagtgc ctctgtagga gaaagagtgt gctgttttca gagatgggtc
2821 ttagtattcc catttagccc actcagcata caccctcaaa agactccctc aggaaaccac
2881 tagttcctgt cacctgccct ctggtgtgtc cc
//
LOCUS YEPLCREA 3316 bp ds-DNA BCT 03-AUG-1990
DEFINITION Virulence plasmid pYVO3 (from Y.enterocolitica) LcrE protein mRNA,
complete cds.
ACCESSION M32097
KEYWORDS .
SOURCE Virulence plasmid pYV03 (from Yersinia enterocolitica, strain
6471/76, serotype 0:3) DNA, clones pY[MS4146,MS4073,P6].
ORGANISM Plasmid pYV03
Prokaryota; Bacteria.
REFERENCE 1 (bases 2 to 1081)
AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M.
TITLE The lcrE gene is part of an operon in the lcr region of Yersinia
enterocolitica O:3
JOURNAL J. Bacteriol. 172, 3152-3162 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3316)
AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by M.Skurnik, 14-FEB-1990.
FEATURES from to/span description
pept 649 1530 lcrE product
ORF 1511 1789 ORF1
ORF 1776 2147 ORF2 (gtg start codon)
ORF 2509 2853 ORF4
ORF 2840 > 3315 ORF5
ORF 451 119 (c) ORF6
ORF 173 < 1 (c) ORF7
mRNA 1020 < 1 (c) lcrB locus mRNA
BASE COUNT 892 a 751 c 791 g 882 t
ORIGIN
1 aggatcccca aagaaagtgg tgtggtaata agttttcggc tcatcggagc tggggcatcc
61 tgataaactg ggtaccaagc cgccggttca gggagatgcc ccccatcgaa aggctgccct
121 aaaccatcca acacctgccc cagcagatgt tcacccaccc caacctgatg cattgtcctg
181 tcgggctaac ttcagtatta gaagatatcc cgtacatttc accaagtgga ataagtaatg
241 cttgatgttg ggcaaaacct atgacttcag cctgtaaaga caggctgttg tctgggttac
301 gtaagtaaca taactcaccg atgcgcacac caggcactac cgcttttaat agcgttcctg
361 tcacttgagt gacacgtcct ctaatttgga ttaggcggct acctacaatg ccatgacgaa
421 tatgatgagg tatctgatct agtgagagca taaatccata atggttgaaa tattaaccac
481 tattttagtg actaaaaacg ctaaaaaatt gtagcgggag ccgcgagttt ttagaaaaat
541 agccaagcag cactaaaatt tctcggctga ttttggcatc gataagcaag aactattttt
601 ataatcgcgg taattgcaat tataaaccgt tcatcttagg gagtagttat gacgacgctt
661 cataacctat cttatggcaa taccccgctg cgtaatgagc atccagagat tgccagtagt
721 cagatcgtaa atcagactct gggtcaattt cggggagaat ctgtgcagat agtcagcggc
781 actctgcagt ctatagctga tatggcagaa gaggtaacat ttgtcttctc cgagcgtaag
841 gagctctccc tcgacaaacg caaattaagt gacagccaag ctcgagttag cgacgttgag
901 gagcaggtta atcaatacct tagcaaagtt ccagagttga aacaaaaaca gaatgtgagt
961 gagctgctca gtctgttgag taacagcccc aatataagct tgtcccagtt aaaggcttat
1021 ctggagggga aatcagaaga accgagtgag caattcaaaa tgctctgcgg cttgcgtgat
1081 gccctgaaag ggcgccctga attagcacat cttttgcatt tggttgaaca agctctggtc
1141 agcatggttg aagagcaaga agaagccatt gtattgggtg ccagaataac cccggaagcg
1201 tacagagaat cccagtcggg tgttaatcca ctgcagccgc tccgtgatac ctaccgcgat
1261 gcagtgatgg gttatcaagg gattaatgcg atctggagtg atttacaaaa acgttttccc
1321 aatggggata tagactcggt gatattattc ctgcaaaagg cgcttagtgc agatctacaa
1381 agtcaacaaa gcgggtctga acgggaaaaa ttagaaatag ttattagtga cttacagaag
1441 ctaaaggagt ttcgtagcgt gagtgaccag gttaaaggat tttggcaact tttttcagag
1501 ggtataacta atggcttacg acctttctga gtttatggga gatattgtcg cactggttga
1561 caagcgctgg gcggggattc atgacattga acatcttgcc aacgcctttt cccttcctac
1621 gcctgaaatc aaagtgcgtt tctatcaaga tttaaaaaga atgtttcgtc ttttccctct
1681 gggggtattt agcgatgagg agcaacggca aaatttattg caaatgtgtc aaaatgcgat
1741 cgatatggct attgagagtg aagaggaaga attgagtgag ttggattgaa cccatcattt
1801 cccatttctg ccaggatctg ggagtgccaa catctagccc cctttcgcct cttattcaat
1861 tagagatggc tcaatctggc acgctgcaac tggaacaaca tggtgcgaca ctgacactgt
1921 ggttagcgcg ttctctcgcc tggcaccagt gcgaagatgc tatggtcaaa gcgctaacgc
1981 tcacggcggc ccaaaagagt ggcgctttac cgctgcgagc ggggtggtta ggggaaaatc
2041 aactggtgtt atttgtctcg cttgatgagc gttccttaac cttgcccctt ttacatcaag
2101 ctttcgaaca gttactgcga ttgcagcaag aggtgcttgc gccgtgagtc gcataataac
2161 tgccccccat attggcatcg aaaaactgtc ggcgattagc ctggaagagc tatcctgtgg
2221 cttgcctgat cgttatgcct tgccgcctga tgggcatcca gtcgaaccac atttagagcg
2281 cctttaccct acagcacaaa gcaagcgtag cctatgggac tttgcttctc ccggctatac
2341 atttcatgga ttacatcgag ctcaagatta tcggcgcgaa ctggatacct tgcagtcact
2401 gctaaccacc agtcagtcct cagagctaca agctgccgcg gcgctgctga aatgccaaca
2461 agatgatgat cggttactgc aaataatcct taacctgttg cacaaagtat gaatattact
2521 ttaaccaaac gacaacagga gttcttgctg ctcaacggtt ggttacaact acaatgtggc
2581 catgcagagc gcgcatgtat tctattggac gccttgctga cgttaaatcc tgagcattta
2641 gccggtcggc gttgccgatt agtcgcgcta cttaataata accagggaga acgtgccgaa
2701 aaagaagcgc aatggctaat atcacatgac cctttacagg ctggaaattg gctctgcttg
2761 agccgtgccc aacaactgaa cggcgatctt gataaggctc gccatgctta tcaacattat
2821 ttggagttga aagatcataa tgaatcccca tgatcttgag tggctaaatc gtattggcga
2881 gcgtaaagat atcatgctgg cagtgctgct gttagctgtg gtattcatga tggtcttacc
2941 actccccccc ctcgtgttgg acattctgat tgctgttaac atgaccatct cagtggtgtt
3001 gttaatgata gcgatctata tcaactctcc tttacaattt tcagctttcc cagcggtgct
3061 actcgttacc acgttatttc gtctcgcact ttcagttagc accacccgca tgatcctgct
3121 acaagctgat gcggggcaga ttgtttacac ctttggtaat ttcgtcgttg gcggtaacct
3181 catcgtcggg attgtcatct tcctgatcat cactattgtg caatttttag tgataacgaa
3241 aggctcagaa cgtgtagcag aagttagtgc cagattctct cttgatgcga tgccgggtaa
3301 acagatgagt atcgat
//
LOCUS FSOFARS1A 170 bp ds-DNA PLN 03-AUG-1990
DEFINITION F.oxysporum autonomous replicating sequence (ARS).
ACCESSION M29451
KEYWORDS autonomous replicating sequence.
SOURCE F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4.
ORGANISM Fusarium oxysporum
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 170)
AUTHORS Powell,W.A. and Kistler,H.C.
TITLE In vivo rearrangement of foreign DNA by Fusarium oxysporum produces
linear self-replicating plasmids
JOURNAL J. Bacteriol. 172, 3163-3171 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.A.Powell, 27-OCT-1989.
FEATURES from to/span description
site 20 28 DNA homologous to yeast autonomously
replicating consensus sequence
site 26 31 telomere repeat unit
site 130 136 ligation point of telomeres -> ApaI site
BASE COUNT 35 a 18 c 66 g 51 t
ORIGIN 116 bp upstream of ApaI site.
1 catgccatcc gtaagatgct ttggtttagg gttagggtta gggttagggt tagggttagg
61 gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg
121 gttagggtta gggccctaac cctaaaccaa agcatcttac ggatggcatg
//
LOCUS ARURRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION A.gigantea ribosomal RNA.
ACCESSION M37170 M35661
KEYWORDS ribosomal RNA.
SOURCE A.gigantea rRNA.
ORGANISM Arundinaria gigantea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 18 c 12 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctacgt ggaggcatcc ctccgcagct
//
LOCUS ASTRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION A.sativa ribosomal RNA.
ACCESSION M37171 M35661
KEYWORDS ribosomal RNA.
SOURCE A.sativa rRNA.
ORGANISM Avena sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS BLYRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION H.vulgare ribosomal RNA.
ACCESSION M37172 M35661
KEYWORDS ribosomal RNA.
SOURCE H.vulgare ribosomal RNA.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctacgt ggagccatcc ctccgcagct
//
LOCUS MUSELF2PS 2183 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds.
ACCESSION X15335 J04749
KEYWORDS elongation factor 2; pseudogene.
SOURCE Mouse cell line Ltk- DNA, clone cMER1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2183)
AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
Uchida,T.
TITLE Amplification of a long sequence that includes a processed
pseudogene for elongation factor 2 in the mouse
JOURNAL Genomics 6, 80-88 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Koide, 31-AUG-1989.
FEATURES from to/span description
pept.ps 319 2038 pseudo-elongation factor 2
rpt 319 330 repeat copy A
rpt 2027 2038 repeat copy B
BASE COUNT 594 a 543 c 585 g 461 t
ORIGIN
1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaaatgta
181 gtttgaaaaa gggagcttgc ctgcctatca ttttagtgcc tgtgaggtgg agatgtgagg
241 atcagtaagt agttcaaggt tattcttggc tgtttagtga gtgttggcta tcctaggcta
301 cataagtccc tgtttcaaga aaacaaacca ggccactgta ggcgcgcttc cctgttcacc
361 tctgactctg agaatctgtc gccatccacc accatggtga acttcacagt agttcagatc
421 cgtgccatca cggacaggaa agccaacatt cagaacatgt cagtcatctc catgtggacc
481 acggcaagtc cacgctgaac gactcccttg tgtgcaaggc tggcatcatt gcctccgccg
541 gagccgggga gacacgtttc actgacatgc gcaaagatga gcaggagcgc tgcatcacaa
601 tcaagtccac cgccatctcc ctcttctacg agctctctaa gaacgacctg aaccgcatta
661 agcagagcaa ggatggctcg ggcttcctca tcaacctcat cgactctccc ggccatgtgg
721 acttctcttc agaggtgaca gctgccttgc gtgtcaccga tggagctctg gtggtggtgg
781 actgtgtgtc tggcgtgtgt gtgcagacag aaattgtgct gcgccaggcc atcgctgagc
841 gcatcaagcc cgtcctgatg atgaacaaga tggaccgggc ccctgctgga gctgcagctg
901 gaacccgagg agctctacca gaccttccag cgcatcgtgg aaaacgtcaa cgtcatcatc
961 tctaccgatg gcgagggtga gagtgggccc atgggcaata tcatgattga ccccgtcctg
1021 ggtgccgtag gctttggttc tggcctgcat ggctgggcct tcaccctgaa gcagtttgcg
1081 gagatgtacg tggccaagtt tgcagccaag ggcgagggcc agctgggtgc aactgagcgt
1141 gccaagaaag tagagggcat gatgaagaag ctgtggggag accagtactt tgatctgccc
1201 tcatggagcc tatctatctg gtggagatcc agtgtcctga gcaagtggtg ggtggcatct
1261 atggtgtcct gaacaggaag cgtggccata tgtcccaggt ggccggtacc cccatgtttg
1321 tggtcaaggc atacctgcct gtcaatgaat cctttggctt cactgctgat ctgcgatcca
1381 acactagcag ccaggccttc ccccagtgcg agtttgacca ctggcagatc ctgcctgggg
1441 atccttttgt caacagcagc cgccccagcc aagtggtagc tgagacctgc aagcacaagg
1501 gcctgaaaga gggctggaca acttcctgga caaactgtag gcagcctgat actgccacat
1561 gttgcacagt gcctgcccat cagaagacac cttgagactg tccccacagt gctcctctgg
1621 aggctgttgg ggccaccctg acatcactca gcactcactt ggctaccaat tctatttatt
1681 tcagaattac aaaatagcgg gaatctctct gcaggctgga ctggcaggcc gtggggtagg
1741 cggacatggc tcttaacatt ttcagaggga aacacgcaga tgtccaaaag tctaaataaa
1801 tgcattcaga ggtttatgag gtccatggcc aagtggagct cccccagagg gggaggtggg
1861 gtaagtacct ccaggaaggc aggcagcctg ccttagactt gcaacccggc tgtgcgaatc
1921 attggagtaa taaactacag tggttgatct gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1981 aaaaaagaaa agaaaagaaa agaaagaaag aaagaaagaa agaaaggaaa acaaaccaaa
2041 agaaaaagag tttccatgat gtccaggatg atgcaggtgg gaagggctag ctcaggttag
2101 aagggctagc tcaggtctct cttaccctgc tctaactcag cctgcttctg gaggagagac
2161 aagtcaaaga gcagcaggag ctt
//
LOCUS MUSELF2PSA 2199 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds.
ACCESSION X15336 J04749
KEYWORDS elongation factor 2; pseudogene.
SOURCE Mouse cell line Ltk- DNA, clone cMER9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2199)
AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
Uchida,T.
TITLE Amplification of a long sequence that includes a processed
pseudogene for elongation factor 2 in the mouse
JOURNAL Genomics 6, 80-88 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Koide, 31-AUG-1989.
FEATURES from to/span description
pept.ps 324 2016 pseudo-elongation factor 2
rpt 324 335 repeat copy A
rpt 2005 2016 repeat copy B
BASE COUNT 602 a 546 c 590 g 461 t
ORIGIN
1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa
181 caaaacaaaa atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga
241 ggtggagatg tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt
301 ggctatccta ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc
361 gcttccctgt tcacctctga ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc
421 acagtagttc agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc
481 atctccatgt ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca
541 tcattgcctc cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg
601 agcgctgcat cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg
661 acctgaaccg cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact
721 ctcccggcca tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag
781 ctctggtggt ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc
841 aggccatcgc tgagcgcatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg
901 ctggagctgc agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac
961 gtcaacgtca tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg
1021 attgaccccg tcctgggtgc cgtaggcttt ggttctggcc tgcatggctg ggccttcacc
1081 ctgaagcagt ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg
1141 ggtgcaactg agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag
1201 tactttgatc tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag
1261 tggtgggtgg catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg
1321 gtacccccat gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg
1381 ctgatctgcg atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc
1441 agatcctgcc tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga
1501 cctgcaagca caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc
1561 ctgatactgc cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc
1621 acagtgctcc tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta
1681 ccaattctat ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc
1741 aggccgtggg gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc
1801 aaaagtctaa ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc
1861 agagggggag gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac
1921 ccggctgtgc gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa
1981 gaaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa
2041 aagaaaacaa accaaaagaa aaagagtttc catgatgtcc aggatgatgc aggtgggaag
2101 ggctagctca ggttagaagg gctagctcag gtctctctta ccctgctcta actcagcctg
2161 cttctggagg agagacaagt caaagagcag caggagctt
//
LOCUS MUSELF2XX 2161 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse elongation factor 2 related sequence.
ACCESSION X15337 J04749
KEYWORDS elongation factor 2.
SOURCE Mouse cell line Ltk- DNA, clone MER7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2161)
AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
Uchida,T.
TITLE Amplification of a long sequence that includes a processed
pseudogene for elongation factor 2 in the mouse
JOURNAL Genomics 6, 80-88 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Koide, 31-AUG-1989.
FEATURES from to/span description
pept.ps 334 2054 elongation factor 2 related sequence
rpt 334 345 tandem repeat
rpt 2043 2054 tandem repeat
BASE COUNT 575 a 544 c 581 g 461 t
ORIGIN
1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa
181 atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga ggtggagatg
241 tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt ggctatccta
301 ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc gcttccctgt
361 tcacctctgg ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc acagtagttc
421 agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc atctccatgt
481 ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca tcattgcctc
541 cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg agcgctgcat
601 cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg acctgaaccg
661 cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact ctcccggcca
721 tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag ctctggtggt
781 ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc aggccatcgc
841 tgagcacatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg ctggagctgc
901 agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac gtcaacgtca
961 tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg attgaccccg
1021 tcctgggtac cgtaggcttt ggttctggcc tgcatggctg ggccttcacc ctgaagcagt
1081 ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg ggtgcaactg
1141 agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag tactttgatc
1201 tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag tggtgggtgg
1261 catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg gtacccccat
1321 gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg ctgatctgcg
1381 atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc agatcctgcc
1441 tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga cctgcaagca
1501 caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc ctgatactgc
1561 cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc acagtgctcc
1621 tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta ccaattctat
1681 ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc aggccgtggg
1741 gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc aaaagtctaa
1801 ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc agagggggag
1861 gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac ccggctgtgc
1921 gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa aaaagaaaag
1981 aaagaaagaa agaaagaaag aaaagaaaac aaaccaaaag aaaaagagtt tccatgatgt
2041 ccaggatgat gcaggtggga agggctagct caggttagaa gggctagctc aggtctctct
2101 taccctgctc taactcagcc tgcttctgga ggagagacaa gtcaaagagc agcaggagct
2161 t
//
LOCUS MUSTHBS1 4915 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse thrombospondin (THBS1) gene, exons 1-7.
ACCESSION J05605
KEYWORDS thrombospondin.
SEGMENT 1 of 2
SOURCE Mouse (strain BALB/C) DNA, clone phi-42.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4915)
AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P.
TITLE Characterization of the mouse thrombospondin gene and evaluation of
the role of the first intron in human gene expression
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Bornstein, 28-JUN-1990.
FEATURES from to/span description
pept 1222 1288 thrombospondin, exon 2 (first expressed exon)
1580 2139 thrombospondin, exon 3
2947 3022 thrombospondin, exon 4
3267 3466 thrombospondin, exon 5
3561 3683 thrombospondin, exon 6
4439 + 4532 thrombospondin, exon 7
pre-msg 484 > 4915 thrombospondin mRNA and introns
IVS 667 1193 thrombospondin intron A
IVS 1289 1579 thrombospondin intron B
IVS 2140 2946 thrombospondin intron C
IVS 3023 3266 thrombospondin intron D
IVS 3467 3560 thrombospondin intron E
IVS 3684 4438 thrombospondin intron F
IVS 4533 > 4915 thrombospondin intron G
BASE COUNT 1188 a 1342 c 1100 g 1282 t 3 others
ORIGIN Chromosome 2, region F.
1 gggatccccc acccaccagg attgactcag gaggcagctt cctttaaggg gtccaacctt
61 ttgccctctt ttatttttaa accagactgt ctcgatcccc aagcgtctct gtcatggaat
121 tattcaagga gatgtgcttt aatgaaagcc tccctaaagg gtcttaggtg gtccccaaag
181 aagcatcgcg tctggggact ctcatgcaga acgtctccag ttcacatggc gcaagatcct
241 aagcgctaaa ggctgagtac gccaaggctg cgtgggcgga gacctatttt ttctgacaag
301 ttccaggggc tcctgtgcgg gatcggagtc tcccccttca ctttcagccc gagagctgtg
361 cgccaagcag cagggggcgg agaaatgaac ccgtccgtcc ccgtcccccg cccccgcccc
421 cagaaccctc ccccctccct gcttcctgcc tctttaaaag gtcctgggct cctcagtcaa
481 gccagccact gcctggagtc agccagcctc atcggacttc tgcaggcaat cgcgaagctg
541 ctatccagtt ctgccacggt ctctcccggc gcaccggcag tctcagcgtc ttcaccggac
601 tcagcgtcct tgtccttcac ttcacctttg ccacctctcc gggttactga gccccggtgc
661 acacaggtaa acctccgcct ctgcggagca cctggccggc cacctcaaag ccccaggctg
721 cagctgcaga tgatcagcgc ttctcctcat ctcctactct tccccttctg gttttcctaa
781 ctcctttatc ctgccccgtc tatcccctgg aaagaacttt ccggagctgg ttctcttcag
841 ccttcgggtt cacaaactat acctccaaga caccccaaga ttttctcccc agtagtctcc
901 aggttacccc tgactgtgcc atccaaaata cattctgtaa tctgtctaaa ggggcggggg
961 ggggagggac ggacccaagt cactttatac cctttatcct ggaactccaa caatccccct
1021 gcctctggcg ctctttttat actggatacc gtttggaaag taaacccttc tttttcagca
1081 catatggggc aaaagaagaa acacacacac acacacacac acacacacac acctctgccc
1141 ttccaccttg cttcctggtc tctgactctg gcctcctgtg cttcttgcta caggctccgt
1201 gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct gttcctgttg
1261 catatgtgtg gaagcaaccg cattccaggt gagtttgtga tatatcattt tcctccccct
1321 tctcctctct ctctctctct ctttctctct ctctctctct gctgctgctc tctctctctc
1381 tctctctctc tctctctctc tctctcgaag ctcctgtgtt tccttggagt ccttatctcc
1441 tgataaggag gcagcttccc tcgctcagct ttccataagt gccattattg tcacgagttc
1501 ccaccttggc tcctcacagc cagaacttta cccccttata cctagcctag aaagccgatt
1561 gtcttctctc tccttacaga gtctggggga gataacggtg tgtttgacat ctttgaactc
1621 attggaggtg cacgaagggg ccccggtcgc cgactggtga agggccaaga tctatccagc
1681 cccgccttcc gcattgagaa tgccaacctg atccccgctg tgccggatga caagttccaa
1741 gacctactgg acgctgtgtg ggccgacaaa ggcttcatct tcttggcttc cttgaggcag
1801 atgaagaaga cccggggcac actcctggct gtggaacgga aagacaacac tggccagatc
1861 ttcagtgtgg tctccaacgg caaagctggc accctggacc tgagcctgag cctgccaggg
1921 aagcaacaag tggtgtcagt ggaggaagct ctcctggcca ctggccagtg gaagagcatc
1981 acgctgtttg ttcaagagga ccgggctcaa ctctacatag actgtgataa gatggagagc
2041 gcggagctgg atgtacccat ccagagcatc ttcaccaggg atctggccag cgttgccagg
2101 ctccgagttg caaagggaga tgtcaatgac aattttcagg taaatcttct tacctgtcac
2161 aagggctatg tggaattaat atcggtagac taaaaatgat tctcaaatgn taaagaggaa
2221 cagacacccc cagaaggttg gttaaatcat attacagatg acccaaaact ccagggnttt
2281 cagatttgat agggacccga gaaattttnt tcctaataag aaataagttg ctaatggtgc
2341 caggacctgg gcacccatgc tttcaaagct atcaggagag ataagaaatg gttttcctat
2401 tactacagcc tgtttcagaa tggggtactc cctgtgccct taggaaagta acttgtatca
2461 aatgtaccta gcagttttaa tagggcatga aaaggatact ttgtcagtat ttgagtaaat
2521 gggcagggtg ttctgggtgc tttcagtatg gatctaaggc aagacatgaa taacagtcca
2581 gtcatgtgcc tctgggactc cttacttgcc ttaactttca gatatttttt tttcttacaa
2641 gcgtggcttt gaacgcccac attaactctt tgtcactgga gagtatcatg tcctccagac
2701 ttgggatcaa attccatttc ctgtgtaccc caagcagcct gaaaactgga ttctttgtaa
2761 aatgctccag ttggcagaag acacaattaa gattttgaag ttgttcggca atttatgtct
2821 ctgtgatagt ttggcaggat gatcttaagt tcttttaaac tttctgagcg ttcaccacaa
2881 gcaaactcag actggccgcc ccacccccac gcccagcatt aaaccacgtc tgtcctctgc
2941 ccacaggggg tgctgcagaa tgtgaggttt gtctttggaa ccaccccaga agacattctc
3001 aggaacaaag gctgctccag ctgtgagtac ccctgtgttt tttaagattt ggagaattct
3061 ggggaagtcc accgaaagca ccctgattaa tgtaagcagt agggtgcata cacaatatct
3121 cagggggagc agagttggct tgcacccact ctgtcatgcc aaaggaaatg tcttttataa
3181 cccagaaccc gaagggtagg ctggtaggat cagctgttga aaccttgaat cagctatcga
3241 atacgttact tcttatttct tagcagctac caacgtcctt cttacccttg acaacaacgt
3301 ggtgaacggt tccagccctg ctatccgcac caactacatc ggccacaaaa caaaggacct
3361 ccaagctatc tgtggcctct cctgtgatga actatccagc atggtcctgg aactgaaggg
3421 cctgcgcacc atcgtgacca ctctgcagga cagcatccga aaagtggtca gtggcctccc
3481 ctcccatttg ccagctcttg gaaatcacca caggtggtcc cacgggagcc tggacactca
3541 tgagccttct cccctttcag acggaagaga acagagagct ggtcagtgag ctgaagcggc
3601 ctcccctctg ctttcacaat ggagtccagt acaagaacaa cgaggagtgg actgtagaca
3661 gttgcacaga gtgtcactgc caggtaagag acactcacag actgcaataa gacttggaga
3721 catatgtctc gggagatacg taaatgacta cacacatttt taaatgtgct ctggcctgct
3781 accaaagtga ttttttttct ttaatatgta atgacatctg caaattcctc tttaaatatc
3841 agaaggttta tcttaataat accccaggct gagcagctaa gagaacaaat ctatctaaac
3901 gttggaatga cttacctgct ttagttcaaa gaccaagtat tgagtgtcag atacataaaa
3961 cccgcttaaa gcattcggtt ttggccagcc ctaatttgta ttcagctctg gcaggcagat
4021 tcttctaaca ctgttcggat ctctggctca ccgggaaaag tttaaatgca aatcactggt
4081 tactgaattt gagacacaac ttcaattaag ttcccgtctc ccttcaatat gtgataaaaa
4141 cccttatcta cctttgagat gcttactgat acctgcaatg tccgttgctt tcccttatgc
4201 aaatgtgttg ttctgtaact tccacccctg actgagtagc agttaaaaag aactttccca
4261 aagaatggag tgagaattag ccctggaact gtatacctct ggggactaca gatccatgag
4321 acatccctcc tataaaagcc acccctcctg tgtcctggct tataatggca atgtcacaga
4381 aagctcctct ctgcaatacg cttggtaaca tcgttcctct ccttctcttg ccttgcagaa
4441 ctcggttacc atctgcaaaa aggtgtcctg tcccatcatg ccctgctcca acgccacagt
4501 tcctgatggt gaatgctgcc cacggtgctg gcgtaagttc ctaaaatggt gtagccatcc
4561 ctcagtggct agctcagtct gggggcatcc tgacagctgg gcacaacacc agagcccagc
4621 ccctcccagc tccatcagac gttccagtca gatctgggtt atggcatagg caatggatga
4681 tcttagcaga gcttatggat tctggtttta tgctataagg agagagatgg atacagcagt
4741 agacttaagc tggttgcttt tagcaattga tctttgttga aaaaaatgat ctactggaga
4801 acggacagaa actgtgtaag cattgggctg aggcttgaaa agccacctgt tttatctgca
4861 gtgagaggag gcctaaggct tagcatctgt tcattcaggc agtaaagaaa tcagc
//
LOCUS MUSTHBS12 1145 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse thrombospondin (THBS1) gene, exons 8 and 9.
ACCESSION J05606
KEYWORDS thrombospondin.
SEGMENT 2 of 2
SOURCE Mouse (strain BALB/C) DNA, clone phi-42.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1145)
AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P.
TITLE Characterization of the mouse thrombospondin gene and evaluation of
the role of the first intron in human gene expression
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by P.Bornstein, 28-JUN-1990.
FEATURES from to/span description
pept + 297 470 thrombospondin, exon 8
969 > 1145 thrombospondin, exon 9
pre-msg < 1 > 1145 thrombospondin mRNA and introns
IVS < 1 296 thrombospondin intron G
IVS 471 968 thrombospondin intron H
BASE COUNT 302 a 294 c 260 g 289 t
ORIGIN 820 bp after segment 1.
1 tgaccgtagt gctagattgc tgtgctgtga gagagatgtg tgatgctcca ttatcattat
61 cagtcagact cttgatagcc aagcgtctga gagtacataa atgtacacaa tctctagttg
121 ttaatcatgc ttccattaaa ctaagagtat gaaaaaagaa ggactttttt taaaaaaaga
181 aacctattgc tttaccacct ctggctataa agggccataa gcatataagt aaataaactg
241 cctaagaaca gtttatctta gacatgagat gctaacgacc tcatttctcc ccacagccag
301 cgactctgct gacgatggct ggtctccctg gtctgagtgg acctcctgct ctgccacatg
361 tggcaatgga attcagcaac gtggtcgttc ctgtgacagc ctcaacaaca gatgcgaggg
421 ctcttcggta cagacgagga cctgccacat tcaggagtgt gacaaaagat gtaagcatca
481 tggctgccaa gggtggcaat ctcccttgcc cagtgcttcc ctcacaaaga gcagctccgg
541 agcggggagc ctgagaatgc aggttctgag cctgcggctc tcagtctcaa tcacacctct
601 aacctttgca ctcctgggct tgaccttcct tccctgaggc tcctcctcat ctgtaacgag
661 agaacaataa ccattcctga taaagttact ttagccattt ctctagataa ttcatatgaa
721 acaatcagaa tggatctggg ctcctgacca cgaacctata aatattaggt tgttgacagc
781 aactcacaca cacagacaaa taccatcttc cctatagttt ctccactgaa gtgtggcagt
841 agccagtttc tacaatacac tagttgaagc tgcttttggt gggctgctgg aatgattgcc
901 tgctgagact tgctgtgtgg ccccctccaa ggcagcaccc attggtgaca atcaactttg
961 cccttcagtt aaacaggatg gtggctggag tcactggtct ccatggtcgt cctgttctgt
1021 gacctgtggt gacggtgtga tcacaaggat ccgtctctgc aactccccca gcccccagat
1081 gaacgggaag ccctgtgaag gtgaagcccg ggagaccaaa gcctgcaaga aagacgcctg
1141 cccaa
//
LOCUS MZERRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION Z.mays ribosomal RNA.
ACCESSION M37178 M35661
KEYWORDS ribosomal RNA.
SOURCE Z.mays rRNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 12 a 17 c 10 g 11 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatac ggagccatcc ctccgtagtt
//
LOCUS PNURRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION P.nudum ribosomal RNA.
ACCESSION M37179 M35661
KEYWORDS ribosomal RNA.
SOURCE P.nudum rRNA.
ORGANISM Psilotum nudum
Eukaryota; Plantae; Thallobionta; Psilotophyta; Psilotopsida;
Psilotales; Psilotaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 16 c 12 g 11 t
ORIGIN
1 acgagacctc agcctgctaa ctagttacgc gaaggatcct cttcgtggcc
//
LOCUS RATRGK5 1883 bp ds-DNA ROD 03-AUG-1990
DEFINITION Rat voltage-gated potassium channel protein (RGK5) gene,
complete cds.
ACCESSION M30312
KEYWORDS voltage-gated potassium channel protein.
SOURCE Rat (Sprague Dawley) adult liver DNA, clone lambda RGK5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1883)
AUTHORS Douglass,J., Osborne,P.B., Cai,Y.-C., Wilkinson,M.,
Christie,M.J. and Adelman,J.P.
TITLE Characterization and functional expression of a rat genomic DNA
clone encoding a lymphocyte potassium channel
JOURNAL J. Immunol. 144, 4841-4850 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.O.Douglass 01-DEC-1989.
FEATURES from to/span description
pept 35 1612 voltage-gated potassium channel protein
BASE COUNT 380 a 567 c 489 g 447 t
ORIGIN
1 ctgcagagcc aggctacgcg agctgccgcc agacatgacc gtggtgcccg gggaccacct
61 gctggagcca gaagcggcgg gaggcggcgg cggggacccg cctcagggag gctgtgtcag
121 tggcggcggc tgcgaccgct acgaaccgct gccgcccgcg ctgcccgccg cgggcgagca
181 ggattgctgc ggggagcgcg tggtcatcaa catctccggg ctgcgcttcg agacacagct
241 caagaccctc tgccagttcc ctgagacgct gctaggcgac cccaagcggc gcatgcgata
301 cttcgacccg ctccgcaatg agtacttctt cgaccgcaac agacccagct tcgacgccat
361 cctctactac taccagtccg ggggccgcat ccgccggccg gtcaacgtgc ccatcgacat
421 cttctccgag gagatccgct tctaccaact gggtgaggag gctatggaga agttccgtga
481 ggacgagggc ttcctgcggg aggaggagcg acccctgccc cgccgtgact tccagcgcca
541 ggtgtggctg ctcttcgaat accccgagag ctcgcggccg gcccggggca ttgccatcgt
601 gtcagtgctg gtcattctca tctccattgt catcttctgc ttggagacac tacccgagtt
661 tcgcgacgag aaggactatc ccgcctctcc gtcgcaggac gtgtttgagg ctgccaacaa
721 cagcacgtcg ggggcctcct ctggagcctc cagcttctcg gaccccttct tcgtagtgga
781 gaccctgtgc atcatctggt tctcctttga gctgctggtg cgattctttg cttgccccag
841 taaagccacc ttctccagaa atatcatgaa cctgatagac attgtagcca tcatccctta
901 ttttattact ctgggcactg agctggctga gcgacagggt aatgggcagc aggctatgtc
961 actggccatc ctgagggtca tccgcctagt aagggtcttc cgcatcttca agctctcccg
1021 ccattctaag gggctgcaga tcctgggaca gacactgaag gcttccatgc gagagctggg
1081 gctgctcatt ttcttccttt tcattggggt catccttttc tccagtgcag tctactttgc
1141 tgaggcagac gacccttctt cgggttttaa cagtatcccg gatgccttct ggtgggccgt
1201 ggtaaccatg acaactgtcg gttatggtga tatgcaccca gtgaccatag gaggcaagat
1261 tgtgggctct ctttgtgcca tcgcaggtgt cttgaccatt gcattgccgg ttcctgtgat
1321 tctttccaac ttcaattact tctaccaccg ggagacagaa ggggaagagc aagcccagta
1381 catgcacgtg ggaagctgcc agcacctctc ctcttcagca gaggagctcc gaaaagcccg
1441 gagtaactcc actctgagta agtcggagta tatggtgatc gaagaggggg gtatgaacca
1501 cagcgccttc ccccagaccc ccttcaaaac gggcaactcc actgccactt gcaccaccaa
1561 caataatccc aactcctgtg tcaacatcaa aaagatattc actgatgtct aatagatgat
1621 acgattgcca ttctgtgccc agtattgtgt ggaacatgcc cccttggtct gtgtatgccc
1681 ttgatttata catttccaga ccattcatca aggaaagtac atgaagaagt ggaaaagcac
1741 acttcattct ccctctccct attgcttcat actgaaacag gtgcctggtt ttgcaagtgg
1801 gctcattctc tcagctcttt ttctctctcc ctctctctct ccctgtttct taattttgtg
1861 aacaacaaac ttacattaag ctt
//
LOCUS RICRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION O.sativa ribosomal RNA.
ACCESSION M37173 M35661
KEYWORDS ribosomal RNA.
SOURCE O.sativa rRNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS SAHRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION S.officinarum ribosomal RNA.
ACCESSION M37174 M35661
KEYWORDS ribosomal RNA.
SOURCE S.officinarum rRNA.
ORGANISM Saccharum officinarum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS SRGRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION S.bicolor ribosomal RNA.
ACCESSION M37175 M35661
KEYWORDS ribosomal RNA.
SOURCE S.bicolor rRNA.
ORGANISM Sorghum bicolor
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS TRDRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION T.dactyloides ribosomal RNA.
ACCESSION M37176 M35661
KEYWORDS ribosomal RNA.
SOURCE T.dactyloides rRNA.
ORGANISM Tripsacum dactyloides
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 17 c 11 g 11 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgtagtt
//
LOCUS WHTRRUN 50 bp ss-rRNA RNA 03-AUG-1990
DEFINITION T.aestivum ribosomal RNA.
ACCESSION M37177 M35661
KEYWORDS ribosomal RNA.
SOURCE T.aestivum rRNA.
ORGANISM Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 50)
AUTHORS Hamby,R.K. and Zimmer,E.A.
TITLE Ribosomal RNA sequences for inferring phylogeny within the grass
family (Poaceae)
JOURNAL Pl. Syst. Evol. 160, 29-37 (1988)
STANDARD simple staff_review
COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is
not specifically delineated in the article.
FEATURES from to/span description
rRNA < 1 > 50 ribosomal RNA
BASE COUNT 11 a 19 c 11 g 9 t
ORIGIN
1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS SUSFCK 3985 bp ss-mRNA INV 03-AUG-1990
DEFINITION S.purpuratus flagellar creatine kinase mRNA, complete cds.
ACCESSION M33763
KEYWORDS creatine kinase; flagellar creatine kinase.
SOURCE S.purpuratus sperm, cDNA to mRNA, clone WD2.2.
ORGANISM Strongylocentrotus purpuratus
Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa;
Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE 1 (bases 1 to 3985)
AUTHORS Wothe,D.D., Charbonneau,H. and Shapiro,B.M.
TITLE The phosphocreatine shuttle of sea urchin sperm: Flagellar creatine
kinase resulted from a gene triplication
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5203-5207 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.D.Wothe, 18-APR-1990.
FEATURES from to/span description
pept 95 3619 flagellar creatine kinase (E.C. 2.7.3.2)
signal 3961 3966 poly-A signal
BASE COUNT 1184 a 923 c 910 g 968 t
ORIGIN
1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat
61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca
121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa
181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa
241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa
301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact
361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt
421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct
481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac
541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc
601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag
661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc
721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat
781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc
841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg
901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa
961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg
1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg
1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt
1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc
1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca
1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga
1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca
1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac
1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg
1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt
1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc
1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct
1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg
1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga
1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc
1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga
1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag
1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat
2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga
2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa
2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt
2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct
2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga
2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga
2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga
2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc
2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa
2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat
2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg
2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc
2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt
2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc
2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc
2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga
3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct
3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga
3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga
3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga
3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact
3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt
3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg
3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg
3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt
3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct
3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg
3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt
3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa
3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat
3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta
3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga
3961 aatacaattc aaactacttt tcatt
//
LOCUS BSUSACP 3518 bp ds-DNA BCT 03-AUG-1990
DEFINITION B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the
sucrose phosphotransferase system), complete cds, and sucrase gene,
5' end.
ACCESSION J03006 M33761
KEYWORDS enzyme II; sucrase; sucrose transferase system.
SOURCE B.subtilis (strain Marburg 168) DNA, clone pBSG8-10.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1987 to 3518)
AUTHORS Fouet,A., Arnaud,M., Klier,A. and Rapoport,G.
TITLE Bacillus subtilis sucrose-specific enzyme II of the
phosphotransferase system: Expression in Escherichia coli and
homology to enzymes II from enteric bacteria enzymes II from
enteric bacteria
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2070)
AUTHORS Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G.
TITLE The sacT gene regulating the sacPA operon in Bacillus subtilis
shares strong homology with transcriptional antiterminators
JOURNAL J. Bacteriol. 172, 3966-3973 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed copy of sequence for [1] kindly provided by
A.Fouet, 16-SEP-1987. Draft entry and computer-readable sequence
for [2] kindly submitted by M.Debarbouille 18-APR-1990.
FEATURES from to/span description
pept 136 897 sacT protein
pept 2008 3390 enzyme II (sacP)
pept 3387 > 3518 sucrase
BASE COUNT 1013 a 712 c 867 g 926 t
ORIGIN
1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat
61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat
121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc
181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa
241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg
301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg
361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag
421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa
481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat
541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa
601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc
661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg
721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat
781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa
841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg
901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag
961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag
1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg
1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa
1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg
1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc
1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc
1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac
1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa
1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc
1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg
1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga
1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca
1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata
1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt
1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga
1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa
1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt
1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt
2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt
2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt
2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat
2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat
2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat
2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt
2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg
2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag
2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt
2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata
2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag
2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca
2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga
2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc
2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc
2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc
3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa
3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact
3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc
3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg
3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac
3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag
3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg
3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt
3481 ttcatatcat gccgccggtt gggctgctga atgacccg
//
LOCUS BOVACTH 1084 bp ss-mRNA MAM 03-AUG-1990
DEFINITION Bovine corticotropin-beta-lipotropin mRNA, complete cds.
ACCESSION M25587
KEYWORDS adrenocorticotropic hormone; corticotropin-beta-lipotropin;
lipotropin; melanocortin.
SOURCE Bovine, cDNA to mRNA, clone pSNAC20.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1084)
AUTHORS Cohen,S.N., Chang,A.C.Y., Nakanishi,S., Inoue,A., Kita,T.,
Nakamura,M. and Numa,S.
TITLE Studies of cloned DNA encoding the structure for the bovine
corticotropin-beta-lipotropin precursor protein
JOURNAL Ann. N.Y. Acad. Sci. 343, 415-425 (1980)
STANDARD simple staff_review
FEATURES from to/span description
pept 114 911 corticotropin-beta-lipoprotein precursor
sigp 114 191 corticotropin-beta-lipoprotein signal peptide
matp 342 377 gamma-melanocortin (gamma-MSH)
matp 507 548 adrenocorticotropic hormone (ACTH)
matp 630 908 beta-lipotropin
matp 756 809 beta-melanocortin (beta-MSH)
matp 816 908 beta-endorphin
mRNA < 1 1084 corticotropin-beta-lipoprotein mRNA
BASE COUNT 220 a 337 c 370 g 157 t
ORIGIN
1 aagagaacga agggaagaaa agtgaccgag acgccgctga acatcctcgc cccggcgcag
61 cgggagccgc ccgagccagc ttccccgtga cagagcctca gcctgcctgg aagatgccga
121 gactgtgcag cagtcgttcg gccgccctgc tgctggcctt gctgcttcag gcctccatgg
181 aagtgcgtgg ttggtgcctg gagagcagcc agtgtcagga cctcaccacg gaaagtaacc
241 tgctggcgtg catccgggcc tgcaagcccg acctctccgc cgagacgccg gtgttccccg
301 gcaacggcga tgagcagccg ctgactgaga acccccggaa gtacgtcatg ggccatttcc
361 gctgggaccg cttcggccgt cggaatggta gcagcagcag cggagttggg ggcgcggccc
421 agaagcgcga ggaggaagtg gcggtgggcg aaggccccgg gccccgcggc gatgacgccg
481 agacgggtcc gcgcgaggac aagcgttctt actccatgga acacttcccc tggggcaagc
541 cggtgggcaa gaagcggcgc ccggtgaagg tgtaccccaa cggcgccgag gacgagtcgg
601 cccaggcctt tcccctcgaa ttcaagaggg agctgaccgg ggagaggctc gagcaggcgc
661 gcggccccga ggcccaggct gagagtgcgg ccgcccggcc tgagctggag tatggcctgg
721 tggcggaggc ggaggctgag gcggccgaga agaaggactc ggggccctat aagatggaac
781 acttccgctg gggcagcccg cccaaggaca agcgctacgg cgggttcatg acctccgaga
841 agagccaaac gccccttgtc acgctgttca aaaacgccat catcaagaac gcccacaaga
901 agggccagtg agggcgcagc gggcaggggc ctctctccgc ggaaagttga ccctgaaggc
961 ctctcttctg ccctcctacc gcctcgcagc ctgggtgagg attcgcccag gcagtgatgg
1021 cgccaggtat cccgactctt aaagctgtct gtagttaaga aataaaacct ttcaagtttc
1081 acga
//
LOCUS ECOK88ACF 861 bp ds-DNA BCT 03-AUG-1990
DEFINITION E.coli K88ac fimbriae subunit gene, 5' end.
ACCESSION M35954
KEYWORDS K88ac fimbriae subunit; antigen.
SOURCE E.coli (K12, strain MC1000) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 861)
AUTHORS Josephsen,J., Hansen,F., de Graaf,F.K. and Gaastra,W.
TITLE The nucleotide sequence of the protein subunit of the K88ac
fimbriae of porcine enterotoxigenic Escherichia coli
JOURNAL FEMS Microbiol. Lett. 25, 301-306 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 13 > 861 K88ac fimbriae subunit precursor
sigp 13 75 K88ac fimbriae subunit signal peptide
matp 76 > 861 K88ac fimbriae subunit
BASE COUNT 227 a 154 c 230 g 250 t
ORIGIN
1 gggtttaatc ttatgaaaaa gactctgatt gcactggcaa ttgctgcatc tgctgcatct
61 ggtatggcac atgcctggat gactggtgat ttcaatggtt cggtcgatat cggtggtagt
121 atcactgcag atgattatcg tcagaaatgg gaatggaaag ttggtacagg tcttaatgga
181 tttggtaatg tattgaatga cctgaccaat ggtggaacca aactgaccat tactgttact
241 ggtaataagc caattttgtt aggccgaacc aaagaagcat ttgctacgcc agtaactggt
301 ggtgtagatg gaattcctca tattgcattt actgactatg aaggagcttc tgtagtactc
361 agaaaccctg atggtgaaac taataaaaaa ggtttagcat attttgttct gccgatgaaa
421 aatgcagagg gcactaaagt tggttcagtg aaagtgaatg catcttatgc cggtgtgtta
481 gggagaggtg gggttacttc tgcggacggg gagctgcttt cgctttttgc cgacgggttg
541 agcgctatct tttatggtgg tttgccgagg ggttctgaat cgtcggctgg gagtgccgca
601 gcggcgcgca caaagttgtt tggaagtcta tcaagaaatg atattctcgg acagattcaa
661 agagtaaacg caaatattac ttctcttgtt gacgtcgcag gttcttacag ggaaaacatg
721 gagtacactg atggaactgt tgtttctgct gcctatgcac tgggtattgc aaacggtcag
781 actattgagg caacttttaa tcaggctgta actaccagca ctcagtggag cgctccgctg
841 aacgtagcaa taacttatta c
//
LOCUS LBARNII 400 bp ds-DNA BCT 03-AUG-1990
DEFINITION L.acidophilus RNA polymerase (RNAI and RNAIIa) genes, 5' promoter
region.
ACCESSION M35739
KEYWORDS RNA polymerase.
SOURCE L.acidophilus DNA, clone pRNL5.
ORGANISM Lactobacillus acidophilus
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 400)
AUTHORS Natori,Y., Kano,Y. and Imamoto,F.
TITLE Characterization and promoter selectivity of Lactobacillus
acidophilus RNA polymerase
JOURNAL Biochimie 70, 1765-1774 (1988)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 143 > 400 RNAI mRNA
mRNA 209 < 1 (c) RNAIIa mRNA
BASE COUNT 149 a 69 c 67 g 115 t
ORIGIN
1 tgcgcggggt tcgagctacg gccaaaagaa ccccgcgttg gaacatctgg cacaaacttt
61 ttctcatttt taactttttt ttaatttaat ttagacaaaa aaagaagcct taactggctt
121 cgatctgtgg tataattttt ttaaagtaaa caaaaaaagt gctaactaaa acaacaacaa
181 cagcactttt tcaatcaact tttcaacttc gaatttatta tagcataaaa ctagtaaaaa
241 taaatacaaa aaaaaattgc ctgtattatc ttcgaagctg ggaagttgtc ccactatcgg
301 aggttttttt tatgggaaaa atcgtaactt taccacaagg ataggctagt cagcctaagt
361 ctaaaaagaa aaaaagcaag aagagttcag gcagtgcaag
//
LOCUS LEIKPGRA 44 bp ss-RNA RNA 03-AUG-1990
DEFINITION L.tarentolae kinetoplast minicircle guide RNA (COIII gRNA-I).
ACCESSION M33870
KEYWORDS guide RNA.
SOURCE L.tarentolae (strain UC) kinetoplast minicircle D-12-1, guide RNA.
ORGANISM Kinetoplast Leishmania tarentolae
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Leishmania tarentolae.
REFERENCE 1 (bases 1 to 44)
AUTHORS Sturm,N. and Simpson,L.
TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of
cytochrome oxidase subunit III mRNA
JOURNAL Cell 61, 879-884 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Simpson 13-APR-1990.
BASE COUNT 17 a 7 c 9 g 11 t
ORIGIN
1 ctatctttac gaggtaaaga cagagagatg aaaacactat tcgt
//
LOCUS LEIKPGRB 46 bp ss-RNA RNA 03-AUG-1990
DEFINITION L.tarentolae mitochondrial minicircle guide RNA (4E-2 gRNA).
ACCESSION M33871
KEYWORDS guide RNA.
SOURCE L.tarentolae mitochondrial minicircle Lt154, guide RNA.
ORGANISM Mitochondrion Leishmania tarentolae
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Leishmania tarentolae.
REFERENCE 1 (bases 1 to 46)
AUTHORS Sturm,N. and Simpson,L.
TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of
cytochrome oxidase subunit III mRNA
JOURNAL Cell 61, 879-884 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Simpson 13-APR-1990.
BASE COUNT 27 a 5 c 5 g 9 t
ORIGIN
1 ataaaacaca acaaaaaaca tagagaaatc atagagtgtt aaatat
//
LOCUS MUSIFNRES 508 bp ds-DNA ROD 03-AUG-1990
DEFINITION Mouse MHC class II-associated invariant chain gene, exon 1.
ACCESSION M35872
KEYWORDS MHC class II-associated invariant chain.
SOURCE Mouse DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 508)
AUTHORS Eades,A.-M., Litfin,M. and Rahmsdorf,H.J.
TITLE The IFN-gamma response of the murine invariant chain gene is
mediated by a complex enhancer that includes several MHC class II
consensus elements
JOURNAL J. Immunol. 144, 4399-4409 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 432 > 508 invariant chain protein, exon 1
mRNA 348 > 508 invariant chain protein mRNA
site 87 158 IFN-gamma response element
signal 267 272 SP1 binding site
signal 255 258 CAAT box
signal 315 319 TATA box
BASE COUNT 133 a 131 c 140 g 104 t
ORIGIN
1 gggtgggggt gggtagacaa aggcactttc tgcttcaaaa ataattcctt ccctggggaa
61 agtgcctgct tctagaaccg tccaaagagc ctgatgaatc cagaagtctg cctagaaaca
121 agtgatgata gccctggcca gccaatggga tcatgcaggc ctttctacct gtttagggaa
181 ctcccccttc atcctgccca gggaggcagc tttgagtgag tggggaattt ccagatttgt
241 ggctttcagt tccacatcta ccatgtgggc ggagtgacct gctgtgggcg aatcagattc
301 cttccagtat cagctttaag aggtgatctt ggggctcaag ggtcccagac acacagcagc
361 agcagcagca gcagcagcag caacagcagc agcagcagca gcgcctgtgg gaaaaaatag
421 aggctagagc catggatgac caacgcgacc tcatctctaa ccatgaacag ttgcccatac
481 tgggcaaccg ccctagagag ccagaaag
//
LOCUS SOYBCONGL 794 bp ss-mRNA PLN 03-AUG-1990
DEFINITION Soybean beta-conglycinin-alpha subunit mRNA, 3' end.
ACCESSION M26128
KEYWORDS beta-conglycinin.
SOURCE Soybean, cDNA to mRNA, clone pB36.
ORGANISM Glycine max
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 794)
AUTHORS Coates,J.B., Medeiros,J.S., Thanh,V.H. and Nielsen,N.C.
TITLE Characterization of the subunits of beta-conglycinin
JOURNAL Arch. Biochem. Biophys. 243, 184-194 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 659 beta-conglycinin-alpha subunit (AA at 3)
mRNA < 1 794 beta-conglycinin-alpha subunit
BASE COUNT 253 a 153 c 177 g 211 t
ORIGIN
1 tgagcaaacg tgccaaatct agttcaagga aaaccatttc ttctgaagat aaacctttta
61 acttgggaag ccgcgacccc atctactcca agaagcttgg caagttcttt gagatcaccc
121 cagagaaaaa cccccagctt cgggacttgg atatcttcct cagtattgtg gatatgaacg
181 agggagctct tcttctacca cacttcaatt caaaggcgat agtgatactg gtaattaatg
241 aaggagatgc aaacattgaa cttgttggcc taaaagaaca acaacaggag cagcaacagg
301 aagagcaacc tttggaagtg cggaaatata gagccgaatt gtctgaacaa gatatatttg
361 taatcccagc aggttatcca gttgtggtca acgctacctc aaatctgaat ttctttgcta
421 ttggtattaa tgccgagaac aaccagagga acttcctcgc aggttcgcaa gacaatgtga
481 taagccagat acctagtcaa gtgcaggagc ttgcattccc tgggtctgca caagctgttg
541 agaagctatt aaagaaccaa agagaatcct actttgtgga tgctcagcct aatgaaaaag
601 aggagggtaa taagggaaga aagggtcctt tgtcttcaat tttgagggct ttttactgaa
661 taagtatgta ctaaaatgta tgctgtaata gctcatagtg agcgaggaaa gtatcgggct
721 atgtaactat gactagagct tcaactatga ataaataaat cgacagcata tgatgctttt
781 gttttgtgtt cttc
//
LOCUS SOYGLYB4 911 bp ss-mRNA PLN 03-AUG-1990
DEFINITION Soybean glycinin A3-B4 subunit mRNA, 3' end.
ACCESSION M35671
KEYWORDS glycinin; glycinin-B4 subunit.
SOURCE Soybean (var. Early-10) seed, cDNA to mRNA, clones pSPG[204,673].
ORGANISM Glycine max
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 911)
AUTHORS Epishin,S.M., Kotenko,S.V., Zakharova,E.S., Kiselev,S.L. and
Vinetskii,Y.P.
TITLE Cloning and structural analysis of cDNA encoding a portion of the
A-3-B-4 glycinin subunit
JOURNAL Genetika 22, 810-815 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 723 glycinin precursor
matp 202 720 glycinin-B4 subunit
mRNA < 1 911 glycinin mRNA
BASE COUNT 280 a 220 c 203 g 208 t
ORIGIN
1 caagaagacg aagacgaaga tgaagacgaa gaatatgaac aaactccctc ttatcctcca
61 cgacgaccaa gccatggaaa gcatgaagat gacgaggacg aggacgaaga agaacatcaa
121 cctcgtcctg atcaccctcc acagcgacca agcaggcccg aacaacaaga accacgtgga
181 agaggatgtc agactagaaa tggggttgag gaaaatattt gcaccatgaa gcttcacgag
241 aacattgctc gcccttcacg tgctgacttc tacaacccaa aagctggtcg cattagcacc
301 ctcaacagtc tcaccctccc agccctccgc caattcggac tcagtgccca atatcttgtc
361 ctctacagga atggaattta ctctccacat tggaacttga acgcgaacag tgtgatctat
421 gtgactcgag ggaaaggaag agttagagtg gtgaactgcc aagggaatcc agtgttcgac
481 ggtgacctaa ccaggggaca attgctactg gtgccgcaga actttgtggt cgctgaccaa
541 gggggaaaac aaggattgga atacgtagtg ttcaagacac agcacaacgc cgtgagcagc
601 tacattaagg atctgtttag ggcaatccct tcggaggttc tttccaattc ttacaacctt
661 ggccagagtc aagtgcgtca gctcaagtat caaggaaact ccggcccttt gctcaaccca
721 taaataacaa caagcatata tcaaggtgtg gtgaggccat cttatatgaa taatatgaaa
781 atatattttg tgtaataata aaactatggc ctatgtattt accaccctcc gacccagcct
841 atgttaatat ctgagtggcg ttgtaccttt gaatcgcctt aataaaatgt cagtcttcaa
901 gttttgtctt t
//
LOCUS TLARGN 133 bp ds-DNA PLN 03-AUG-1990
DEFINITION T.lanuginosus 26S ribosomal RNA gene, partial.
ACCESSION M35561
KEYWORDS 26S ribosomal RNA.
SOURCE T.lanuginosus DNA.
ORGANISM Thermomyces lanuginosus
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 133)
AUTHORS Nazar,R.N. and Wong,W.M.
TITLE Is the 5S RNA a primitive ribosomal RNA sequence?
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 133 23S rRNA
BASE COUNT 39 a 23 c 39 g 32 t
ORIGIN
1 gtagcaaata ttcaaatgag aactttgaag actgaagtgg ggaaaggttc cacgtcaaca
61 gcagttggac gtgggttagt cgatcctaag agatggggaa gctccgtttc aaaggcctga
121 ttttatgcag gcc
//
LOCUS YSCRGNA 252 bp ds-DNA PLN 03-AUG-1990
DEFINITION S.cerevisiae 26S ribosomal RNA gene, partial.
ACCESSION M35562
KEYWORDS 26S ribosomal RNA.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 252)
AUTHORS Nazar,R.N. and Wong,W.M.
TITLE Is the 5S RNA a primitive ribosomal RNA sequence?
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 252 23S rRNA
BASE COUNT 62 a 60 c 75 g 55 t
ORIGIN
1 gtagcaaata ctcaaatgag aactttgagg actgaagtgg ggaaaggttc catgtgaaca
61 gcagttggac atgggttagt cgatcctaag acacagggaa gttccgtttg aaaggcgccc
121 tggtgcgccg ttgacatgca acagctggga ttcgctggtg gtcacccacc caactactaa
181 ccagccggcg tgtggcttaa gtacggctga gcggacggga agccctgttt tccacaccct
241 atggtcgcat gt
//
LOCUS YSCSRP 413 bp ds-DNA PLN 03-AUG-1990
DEFINITION S.cerevisiae serine rich protein (srp) gene, 5' end.
ACCESSION M35738
KEYWORDS serine rich protein.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 413)
AUTHORS Marguet,D. and Lauquin,G.J.-M.
TITLE The yeast srp gene: Positive modulation by glucose of its
transcriptional expression
JOURNAL Biochem. Biophys. Res. Commun. 138, 297-303 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 408 > 413 serine rich protein (srp)
mRNA 347 > 413 SRP mRNA (alt.)
mRNA 355 > 413 SRP mRNA (alt.)
mRNA 404 > 413 SRP mRNA (alt.)
BASE COUNT 123 a 83 c 71 g 136 t
ORIGIN
1 gatccatctt tcttcgcgcg tttattttca gcaccatgtt ttaggttttt acagcatcgt
61 ttaaggaacc caacaataca atagcgggaa gaatgcactt tctcgttcca taaagggtct
121 ctttcaccta tacggttggt acagatttcc agtgaatgcc agtcagccac ggcattacgt
181 cgtttgcttc tattttcttc gttttgaact gcgtttgtat gcaactgtcc cttgacagag
241 aaaaaagtga agcaaaatga cagacaaaga aatcttttgt ataaaaggtc ggttgaatct
301 tgttgttagc ttgaaatcag cttgcttttc tcctctaaat tacataaaaa accaagaaaa
361 tatcagactt tttcattcgc tttcaacaag tactacaata attaaaaatg gct
//
LOCUS AVIVNFDG 6557 bp ds-DNA BCT 03-AUG-1990
DEFINITION A.vinelandii nitrogenase (vnfHDGK) genes, complete cds.
ACCESSION M32371
KEYWORDS nitrogenase 2; vanadium nitrogenase.
SOURCE A.vinelandii (strain CA) DNA.
ORGANISM Azotobacter vinelandii
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Azotobacteraceae.
REFERENCE 1 (bases 1 to 6557)
AUTHORS Joerger,R.D., Loveless,T.M., Pau,R.N., Mitchenall,L.A.,
Simon,B.H. and Bishop,P.E.
TITLE Nucleotide sequence and mutational analysis of the structural genes
for nitrogenase 2 of Azotobacter vinelandii
JOURNAL J. Bacteriol. 172, 3400-3408 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.N.Pau, 28-FEB-1990.
FEATURES from to/span description
pept 238 1110 nitrogenase (vnfH)
pept 2415 3839 nitrogenase (vnfD)
pept 3839 4180 nitrogenase (vnfG)
pept 4237 5664 nitrogenase (vnfK)
pept 1227 1424 ferredoxin-like ORF
rpt 1172 1192 inverted repeat
rpt 4188 4219 inverted repeat
rpt 5737 5760 inverted repeat
binding 1214 1218 pot. ribosome binding site
binding 2400 2406 pot. ribosome binding site
binding 4222 4228 pot. ribosome binding site
site 2311 2327 pot. NTRA-dependent promoter
BASE COUNT 1359 a 2270 c 1874 g 1054 t
ORIGIN
1 gatcgcttcc cggctgtacc tgcgggtacg tcgacggagc gcacagcatc ctggcctgga
61 tttatggagt ccaataaacc tgcaaaaatt aaaataattc actaattaaa tatgtttttt
121 tgattttata ttccaaaaaa taggcaatca tcgacttatc gatccttgtg gcaccgccct
181 tgctcaactc tcgtcggcac aaatcaaaac gccaacgaat caacggaggt tcctaagatg
241 gcattgcgtc agtgtgcaat ttacggcaag ggtggcatcg gcaagtccac caccacccag
301 aacctggtcg ccgccctcgc cgaagccggc aagaaagtga tgatcgtcgg ttgtgacccg
361 aaagccgact ccacccgcct gatcctgcac tccaaggccc agggcaccgt catggaaatg
421 gccgcgtccg ccggctcggt cgaagacctg gagctggaag acgtgctgca gatcggcttc
481 ggcggcgtca agtgcgtcga atccggtggc ccggagccgg gcgtcgcttg cgccggccgt
541 ggcgtgatca ccgcgatcaa cttcctggaa gaagaaggcg cctacagcga cgacctggac
601 ttcgtgttct atgacgtgct gggcgacgtg gtatgcggcg gcttcgccat gccgatccgc
661 gagaacaagg cccaggaaat ctacatcgtc tgctccggcg agatgatggc catgtacgcc
721 gccaacaaca tcgccaaggg catcgtgaaa tacgcccact ccggcagcgt gcgtctgggc
781 ggcctgatct gcaacagccg caagaccgac cgcgaagacg agctgatcat ggccctggcc
841 gcgaagatcg gcacccagat gatccacttc gtgccgcgcg acaacgtcgt gcaacacgcc
901 gaaatccgcc gcatgaccgt gatcgaatac gatccgaagg ccggacaggc cgacgagtac
961 cgtgccctgg ctcgcaaaat cgtcgacaac aagctgctgg tcatcccgaa cccggcctcc
1021 atggaagaac tcgaagagct gctaatggaa ttcggcatca tggaagtcga agacgagtcc
1081 gtcgtcggca aggccgccgc cgaaggctga ttcacccagc acagcgtttg cggaggagcg
1141 tgcgccgcgg gctttcggaa tggcttctcg cggccggcgg cgcacgccgc cctcccttcg
1201 aacaaccgac ctcaggagct gacaccatgg ccatggccat cgacggctac gaatgcactg
1261 tctgcggcga ctgcgagccg gtctgcccga ccggttcgat cgtcttcagg gacgatcact
1321 acgcgatcga agccgacagt tgcaacgaat gcaccgacgt gggcgagccg cgctgtctcg
1381 gcgtctgccc cgtcgacttc tgcatccagc cgctcgatga ctgaacactg aacgactccg
1441 caccccgttg ccggcggcag gacattccgc gccgtcctgc cgccggaccc agaacggcga
1501 tcgcttttcc tcaggggcga tcgccgtttt acttttcccc gctccgctag cgcccgcgga
1561 caacagccgt ccgtatccgc gccgttcgcc gtcctcgacc ggccacgatc gcggcacagc
1621 tcgccaagct acagcccgct ccacaagctg accatcggca tccagacctt cgccaagatc
1681 cgcgaagact tcttacgtcg acaagaccgg cttcgcccgg cgaccgatcg aacgggacac
1741 actacttcct ctcccggcca tgccgcttcg gcaagagccc gttcctcgat accctggccg
1801 aaccgttcgc cggcgacgaa ccttcgttcc gcaggctgca gatgcatgac cgatgggact
1861 ggcgccggcg ccatccggca ttgcggatcg gcttcggcgg cgcgccgatc cgggacggcg
1921 gataagctcg ccgagccgct gctcccaaag ccacgcccac ctccacgtcc cggcgcggaa
1981 gtggcgatcc ccagccgcac gtccccatgg tttccgggtc tggaaccgtc ccccggactt
2041 cccgtacgga cgcccggaag ccaccgcccc gacggccacg cacagccacg tacagacccc
2101 ccgctctccg tacttggccg aacgaaaaac cttttacgta acccgccggc cgtccacccg
2161 gttgtcccga acaggaaaaa aaagcccgga aaaggcttgc cccggcgctt ttccaaaacc
2221 tcgaaaatgc gcacaaataa ttgattcgaa aggattaatc tgagacagcg gcgatgccgc
2281 gaaaaattcc tgcaagcgct gcaaggacat atggcacgca tcctgccctt acctcttcgc
2341 caaccggttt ttcggtcccg gtcgagtgcc gtcaggggac tcgatccacg catagagccg
2401 aggagacttc aatcatgcca atggtattgc tggaatgtga caaggacata cccgagcgcc
2461 agaaacacat ctatctgaag gcgcccaacg aggacacccg cgagttcctg ccgatcgcca
2521 acgcggcgac catccccggc accctgtccg aacgcggctg cgccttctgc ggcgccaagc
2581 tggtgatcgg cggtgtgctc aaagacacca tccagatgat tcacggcccg ctcggctgtg
2641 cctacgacac ctggcacacc aagcgctacc cgaccgacaa cggccacttc aacatgaagt
2701 acgtctggtc gaccgacatg aaggaaagcc atgtggtctt cggcggcgag aaacgcctcg
2761 agaagagcat gcacgaagcc ttcgacgaaa tgcccgacat caagcggatg atcgtctaca
2821 cgacctgccc gaccgcgctg atcggcgacg acatcaaggc cgtggccaag aaggtgatga
2881 aggaccgtcc ggacgtggac gtcttcaccg tcgaatgccc cggcttctcc ggtgtgtccc
2941 agtccaaggg ccaccacgtc ctgaacatcg gctggatcaa cgagaaagtc gagacgatgg
3001 agaaggaaat caccagcgaa tacaccatga acttcatcgg tgacttcaat attcaaggtg
3061 atacccaact gctgcaaacc tactgggacc gcctgggcat ccaggtcgtc gcccacttca
3121 ccggcaacgg cacctacgac gacctgcgct gcatgcacca ggcccagctc aacgtggtga
3181 actgcgcccg ttcctccggc tacatcgcca acgagctgaa gaagcgctac ggcatcccgc
3241 gtctggacat cgactcctgg ggcttcaact acatggccga gggcatccgc aagatctgcg
3301 ccttcttcgg catcgaggag aagggcgagg agctgatcgc cgaggaatac gccaagtgga
3361 agccgaagct cgactggtac aaggagcgtc tgcaaggcaa gaaaatggcg atctggaccg
3421 gcggcccgcg cctgtggcac tggaccaagt cggtcgagga cgacctgggc gtgcaagtgg
3481 tggccatgtc ctccaagttc ggccatgagg aagacttcga gaaggtcatc gcccgcggca
3541 aggaaggcac ctactacatc gacgacggca acgagctgga attcttcgag atcatcgacc
3601 tggtcaagcc ggacgtgatc ttcaccggcc cgcgcgtcgg cgaactggtc aagaagctgc
3661 acatccccta cgtcaacggc cacggctacc acaacggccc gtacatgggc ttcgaaggct
3721 tcgtcaacct ggcccgcgac atgtacaacg ccgtgcacaa cccgctgcgc cacctggccg
3781 ccgtggatat ccgcgacaag tcgcagacta ccccggtcat cgtgcggggg gccgcctgat
3841 gagccagtcc catctcgacg atctgttcgc ctatgtcgag gagcgctgcc tgtggcagtt
3901 cttctcgcgc acctgggacc gcgaggaaaa catcgagggc gtgctcaatc aggtcggccg
3961 cctgctgacc ggccaggagc cgctgcgcgg caccccgcag gagcgcctgt tctacgccga
4021 cgccctggcc atggccaacg atgtccgcga gcgtttcccc tgggcttcgc aggtcaacaa
4081 ggaagagatc gagttcctgc tcgacggtct caagtcccgt ctggtcgacg tgaccatcac
4141 ccgctcgacc aaccgcgaac tcaaccacca cctctactga gtcatcggcg cggagacggc
4201 cggaacgccg tctgcgcgcg cgaaggagcc attttcatga gcaattgcga actgaccgtg
4261 ctgaagccgg cagaagtcaa gctgagcccg cgtgaccgcg agggcatcat caacccgatg
4321 tacgactgcc agccggccgg cgcccagtac gccggcatcg gcatcaagga ctgcatcccg
4381 ctggtccacg gcggccaggg ctgcacgatg ttcgtccgcc tgctgttcgc ccagcacttc
4441 aaggaaaact tcgacgtcgc ctccacctcg ctgcacgagg agtcggcggt gttcggcggc
4501 gccaagcgcg tcgaggaagg cgtgctggtc ctcgcccgcc gctacccgaa cctgcgcgtc
4561 atcccgatca tcaccacctg ctccaccgaa gtcatcggcg acgacatcga gggcagcatc
4621 cgggtctgca accgggcact cgaagccgaa ttcccggatc gcaagatcta cctggcgccg
4681 gtacacaccc cgagcttcaa gggcagccac gtcaccggct acgccgagtg cgtgaagtcg
4741 gtgttcaaga ccatcaccga cgcgcacggc aagggccagc cgagcggcaa gctcaacgtg
4801 ttcccgggct gggtcaaccc cggcgacgtg gtgctgctca agcgctactt caaggaaatg
4861 gacgtcgaag ccaacatcta catggacacc gaggacttcg actcgccgat gctgccgaac
4921 aagagcatcg agacccacgg ccgcaccacc gtcgaggaca tcgccgacag cgccaacgcc
4981 ctggccaccc tgtccctggc ccgctacgag ggcaacacca ccggcgagtt gctgcagaag
5041 accttcgcgg tgccgaatgc cctggtcaac accccctacg gcatcaagaa caccgacgac
5101 atgctgcgca agatcgccga agtcaccggc aaggagatcc ccgagtcgct ggtccgcgag
5161 cgcggcatcg ccctcgacgc cctggccgac ctggcgcaca tgttcttcgc caacaagaaa
5221 gtggcgatct tcggccaccc ggacctggtg ctcggcctgg cccagttctg catggaagtc
5281 gagctggaac ccgtactgct gctgatcggc gacgaccagg gcaacaagta caagaaggac
5341 ccgcgcatcg aggagctgaa gaacaccgcg cacttcgaca tcgagatcgt ccacaacgcc
5401 gacctctggg aactggagaa gcgcatcaac gccggcctcc agctcgacct gatcatgggt
5461 cactcgaagg gccgctacgt cgccatcgag gccaacatcc cgatggtccg cgtcggcttc
5521 ccgaccttcg accgcgccgg tctctaccgc aagccctcga tcggctacca gggcgccatg
5581 gaactgggcg agatgatcgc caacgccatg ttcgcccaca tggaatacac ccgtaacaag
5641 gagtggatcc tcaatacgtg gtgagttgag gtgccggagc ggtttccacg cactccggct
5701 gtcgagccga ccgagatgac ggcatgcacg gatctcccct cgcccatgca tgggcgaggg
5761 agttgcgagc cgtcacgccc gaggtcgttc cgacggcagg ccgatgcacc ggaaaaccgc
5821 tcccgccccc ggcggctccc ggctccggcc ggggcctccg ccccgtattc cagatccggc
5881 cgccatgccg cattcccggg agtgatgcca tgaaacagcg acaggaaatg gtcgcccact
5941 accgcgcctg cttcggcgag ctgtgcgccc gaccggaaca ccgtcctatc gaaccctaca
6001 cccgcccccg gcgcctgagc ttcgcggagc cggaaagcgc gaccgcccgt caggtgccgg
6061 cgcgcctggt actggccctg accagtgcct acgccctgct cgccgactgg caggaatccc
6121 gcgacccgtc gctggccgac ctgggcagtt ggcaacgcta cctcgccctg ccccgccgca
6181 ccccggcgga aaagctgatg gccgagatct tccgcatcct gcgcgtgttc cgcgccgcgg
6241 cgatccagca gaacggcgcc atcgaaatcc gcgacgacgg cctgatccgc gccagttgca
6301 cctacaaccg ctgcgcgctg agcctgctga tcacccaggc cggcctcgaa ctgctcaccg
6361 cctgcgtcgc ctgctacctc gaatccttcg accagcccta cagcgatgcc tacgaggaac
6421 tgctgctcgg ccagtactac gccgacatcg tcgccgagat ccgctccttc gccgacgacg
6481 accgggtgct gttccagttc cgccagaaag cctggttcaa ccgccatgtc cgcctggact
6541 gcgacaaccc gcgcctg
//
LOCUS ECOCYS 5755 bp ds-DNA BCT 03-AUG-1990
DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT,
cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes,
complete cds.
ACCESSION M32101
KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene;
o-acetylserine (thiol)-lyase-B; sulfate permease;
thiosulfate binding protein.
SOURCE E.coli K12 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1292 to 5755)
AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A.
TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide
sequence and expression of the cysTWAM gene cluster
JOURNAL J. Bacteriol. 172, 3351-3357 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1291)
AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D.
TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of
a gene encoding a novel protein involved in thiosulfate binding
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.Halanicka, 14-FEB-1990.
FEATURES from to/span description
pept 559 1575 thiosulfate binding protein (cysP)
pept 1575 2408 sulfate permease (cysT)
pept 2408 3283 sulfate permease (cysW)
pept 3273 4370 sulfate permease (cysA)
pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg
start codon; EC 4.2.99.8)
signal 516 521 -10 region
site 5737 5755 right end of mu
BASE COUNT 1259 a 1477 c 1682 g 1337 t
ORIGIN 52 min on K12 map.
1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag
61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt
121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc
181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt
241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt
301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc
361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac
421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta
481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta
541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc
601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac
661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat
721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg
781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa
841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat
901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc
961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg
1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt
1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc
1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg
1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa
1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac
1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg
1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac
1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc
1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg
1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa
1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg
1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg
1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg
1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc
1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg
1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag
1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg
2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa
2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg
2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga
2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc
2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat
2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag
2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg
2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat
2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc
2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc
2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg
2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg
2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca
2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg
2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga
2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa
3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt
3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca
3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt
3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa
3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga
3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga
3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg
3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc
3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga
3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg
3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg
3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg
3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc
3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca
3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga
3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga
3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg
4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc
4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt
4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg
4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc
4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg
4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt
4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt
4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga
4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat
4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc
4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa
4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat
4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg
4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat
4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga
4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg
4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat
5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca
5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg
5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa
5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc
5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat
5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca
5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg
5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg
5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt
5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa
5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct
5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag
//
LOCUS TTHTRPBA 2586 bp ds-DNA BCT 03-AUG-1990
DEFINITION T.thermophilus tryptophan synthetase (trpAB) genes, complete cds.
ACCESSION M32108
KEYWORDS tryptophan synthetase.
SOURCE T.thermophilus (strain HB27) DNA.
ORGANISM Thermus thermophilus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 2586)
AUTHORS Koyama,Y. and Furukawa,K.
TITLE Cloning and sequence analysis of tryptophan synthetase genes of an
extreme thermophile, Thermus thermophilus HB27: Plasmid transfer
from replica-plated E.coli recombinant colonies to competent
T.thermophilus cells
JOURNAL J. Bacteriol. 172, 3490-3495 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
Y.Koyama, 14-FEB-1990.
FEATURES from to/span description
pept 353 1567 tryptophan synthetase B (EC 4.2.1.20)
pept 1564 2379 tryptophan synthetase A (gtg start codon; EC
4.2.1.20)
BASE COUNT 396 a 879 c 905 g 406 t
ORIGIN 1 bp upstream of BglII site.
1 agatctcgtt gacgctcgcc gtgcggtggc ggaaccactg cctcgccacg aagatggggg
61 ccttgacgtg aaacttgaag accaccatct caaaggggct cgtgtggcgg tggcgcatga
121 ggtagtcaat gagggcggcg tcctcccgca ccgtcttcgt cccttccccg taggagaccc
181 gggccgcctg gacgatggcg cggtcgtccc ccatctggtc caccaggcgg acgaagccct
241 tgtccaagac cgggatcgtg agaggccctt ccatcccccc gagtttaccg ggaggcccct
301 ccggggtagg atgggagttg tcttggcgcg aggcgccttt agggagcgaa gcatgctgac
361 cctacccgac tttcccttgc ccgacgcgag ggggcggttc ggcccctacg gggggcggta
421 cgtgcccgag accctgatcc ccgccctgga ggagttggag gccgcctacc gggaggccaa
481 gaaggatccg gccttcctgg aggagctgga ccactacctc aggcagtttg ccggcaggcc
541 cacgcccctt taccacgcca agcggctttc cgagtactgg ggtggggccc aggtcttcct
601 caagcgggag gacctcctcc acaccggggc ccacaagatc aacaacaccc tgggccaagc
661 cctcctggcg aggcgcatgg gcaagaggcg ggtcatcgcc gaaacggggg cggggcagca
721 cggggtgagc gtggccacgg tggcggccct cttcggcctg gagtgcgtgg tctacatggg
781 ggaggaggac gtcaggcgcc aggccttaaa cgtcttccgc atgaagctcc tcggggccga
841 ggtccggccc gtggccgcgg ggagccgcac cctcaaggac gccaccaacg aggccatccg
901 cgactggatc acgaacgtgc gcaccacctt ctacatcctg ggctccgtgg tgggccccca
961 cccctacccc atgatggtcc gggacttcca gagcgtcatc ggggaggagg tgaagcggca
1021 aagcctggag cttttcggcc gcctgcccga cgccctcatc gccgcggtgg ggggcggttc
1081 caacgccatc ggcctcttcg cccccttcgc ctacctgcct gagggcaggc ccaagctcat
1141 cggggtggag gccgccgggg aggggctttc cacgggcagg cacgccgcga gcatcggggc
1201 ggggaagcgg ggggtgctgc acgggagcta catgtacctc ctctacgacc acgatgggca
1261 gatcaccccg gcccactccg tctccgcagg gctggactac cccggggtgg ggccggagca
1321 cagctactac gccgacgccg gggtggcgga gtacgccagc gtcacggacg aggaggccct
1381 cgagggcttc aagctcctgg cgcggcttga gggcatcatc cccgccttgg agtcggccca
1441 cgccatcgcc tacgccgcca aggtggtgcc ggagatggac aaggaccagg tggtggtcat
1501 caacctctcc ggccgggggg acaaggacgt gaccgaggtg atgcgccttc tgggagggga
1561 gctgtgacca ccctcgaggc cttcgccaag gcccggtccg agggccgggc cgccctcatc
1621 ccttacctca ccgcgggctt ccccagccgg gagggctttt tgcaggcggt ggaggaggtc
1681 ctcccctatg ccgaccttct ggagatcggc ctcccctact ccgaccccct gggggacggc
1741 cccgtgatcc agcgggcaag cgaactggcc ctgaggaagg ggatgagcgt ccaaggcgct
1801 ttggagctcg tccgcgaggt gcgcgccctc acggagaaac ccctcttcct catgacctac
1861 ctgaaccccg tcctggcctg ggggccggag cgcttcttcg gcctcttcaa gcaggcgggg
1921 gccacggggg tgatccttcc cgaccttccc cccgacgagg acccaggcct cgtgcgcctc
1981 gcccaggaga tcggcctgga gacggtcttc cttctcgccc ccacctccac ggacgcccgc
2041 atcgccaccg tggtccgcca cgccacgggc ttctcgtacg ccgtctccgt caccggggtc
2101 acggggatgc gggaaaggct tcccgaggag gtcaaggacc tggtgcggcg catcaaggcc
2161 aggaccgccc tccccgtggc cgtgggcttc ggggtttccg gcaaggccac ggccgcgcag
2221 gcggcggtgg cggacggggt ggtggtggga agcgccttgg tccgggcctt ggaggagggg
2281 aggtcccttg cgccccttct ccaggagatc cgccagggcc tccagcgcct cgaggccaac
2341 cccggcctga aggaaagctc aaagaagccg ttgtcctgag ccttggaggg taaacccatg
2401 cggcagtggc cttttttagt tctggcaggc ggcctcctcc tcggggcctg tgggctcctt
2461 ccggggtttg gggggcctaa ggtctccgga gagttccagg gggactggtc cggcgtggcc
2521 cagggcctgc gcctggccct ggtgggcctc acgacggagg ggcaggtgaa ctacgacaac
2581 cagctg
//GenBank-Updates@genbank.bio.net (08/04/90)
LOCUS M25435 120 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 6. Possible secondary structure model of 5S RNA.
ACCESSION M25435
REFERENCE 1 (bases 1 to 120)
AUTHORS Rabin,D. and Crothers,D.M.
TITLE analysis of rna secondary structure by photochemical reversal of
psoralen crosslinks
JOURNAL Nucleic Acids Res. 7, 689-703 (1979)
STANDARD unannotated staff_entry
BASE COUNT 23 a 36 c 41 g 20 t
ORIGIN
1 tgcctggcgg ccgtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac
61 gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat
//
LOCUS M25592 55 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 4. The nucleotide sequence of the fragment is given
at the top of the figure.
ACCESSION M25592
REFERENCE 1 (bases 1 to 55)
AUTHORS Ogata,R.T. and Gilbert,W.
TITLE dna-binding site of lac repressor probed by dimethylsulfate
methylation of lac operator
JOURNAL J. Mol. Biol. 132, 709-728 (1979)
STANDARD unannotated staff_entry
BASE COUNT 16 a 9 c 16 g 14 t
ORIGIN
1 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacag
//
LOCUS M25594 75 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 4. (a) the cloverleaf model for the base pairing of
native tRNA-Trp.
ACCESSION M25594
REFERENCE 1 (bases 1 to 75)
AUTHORS Jones,C.R., Kearns,D.R. and Muench,K.H.
TITLE nuclear magnetic resonance of the base-pairing structure of the
native and denatured conformers of escherichia coli transfer
rna-trp
JOURNAL J. Mol. Biol. 103, 747-764 (1976)
STANDARD unannotated staff_entry
BASE COUNT 13 a 21 c 24 g 14 t 3 others
ORIGIN
1 aggggcgtag ttcaaddggd agagcaccgg tctccaaacc gggtgttggg agttcgagtc
61 tctccgcccc tgcca
//
LOCUS M25597 78 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 1: (A) The cloverleaf structure of E. coli tRNA-Met-f1.
ACCESSION M25597
REFERENCE 1 (bases 1 to 78)
AUTHORS Daniel,W.E.Jr. and Cohn,M.
TITLE changes in tertiary structure accompanying a single base change in
transfer rna. protein magnetic resonance and aminoacylation studies
of escherichia coli trna-met-f1 and trna-met-f3 and their
spin-labeled (s4u8) derivatives
JOURNAL Biochemistry 15, 3917-3924 (1976)
STANDARD unannotated staff_entry
BASE COUNT 15 a 26 c 25 g 12 t
ORIGIN
1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggatc gtcggttcaa
61 atccggcccc cgcaacca
//
LOCUS M26841 459 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 1. Nucleotide sequence coding for mature Il-1-beta and the
corresponding amino acid sequence.
ACCESSION M26841
REFERENCE 1 (bases 1 to 459)
AUTHORS Gronenborn,A.M., Clore,G.M., Schmeissner,U. and Wingfield,P.
TITLE a h-nmr study of human interleukin-1-beta
JOURNAL Eur. J. Biochem. 161, 37-43 (1986)
STANDARD unannotated staff_entry
BASE COUNT 143 a 109 c 106 g 101 t
ORIGIN
1 gcacctgtac gatcactgaa ctgcacgctc cgggactcgc agcaaaaaag cttggtgatg
61 tctggtccat atgaactgaa agctctccac ctccagggac aggatatgga gcaacaagtg
121 gtgttctcca tgtcctttgt acaaggagaa gaaagtaatg acaaaatacc tgtggccttg
181 ggcctcaagg aaaagaatct gtacctgtcc tgcgtgttga aagatgataa gcccactcta
241 cagctggaga gtgtagatcc caaaaattac caaaagaaga agatggaaaa gcgatttgtc
301 ttcaacaaga tagaaatcaa taacaagctg gaatttgagt ctgcccagtt ccccaactgg
361 tacatcagca cctctcaagc agaaaacatg cccgtcttcc tgggagggac caaaggcggc
421 caggatataa ctgacttcac catgcaattt gtgtcttcc
//
LOCUS M28421 142 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 5. The unique Xho I site of phi X174 DNA.
ACCESSION M28421
REFERENCE 1 (bases 1 to 142)
AUTHORS Lam,L.H. and Reynolds,R.J.
TITLE dna sequence dependence of closely opposed cyclobuty1 pyrimidine
dimers induced by uv radiation
JOURNAL Mutat. Res. 178, 167-176 (1987)
STANDARD unannotated staff_entry
COMMENT
BASE COUNT 43 a 27 c 28 g 44 t
ORIGIN
1 gctttatcaa gataattttt cgactcatca gaaatatccg aaagtgttaa cttctgcgtc
61 atggaagcga taaaactctg caggttggat acgccaatca tttttatcga agcgcgcata
121 aatttgagca gatttgtcgt ca
//
LOCUS M28574 160 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 4. Sequence of tyrT DNA.
ACCESSION M28574
REFERENCE 1 (bases 1 to 160)
AUTHORS Drew,H.R. and Travers,A.A.
TITLE dna bending and its relation to nucleosome positioning
JOURNAL J. Mol. Biol. 186, 773-790 (1985)
STANDARD unannotated staff_entry
COMMENT
BASE COUNT 40 a 42 c 38 g 40 t
ORIGIN
1 ccggttacct ttaatccgtt acggatgaaa attacgcaac cagttcattt ttctcaacgt
61 aacactttac agcggcgcgt catttgatat gatgcgcccc gcttcccgat aagggagcag
121 gccagtaaaa agcattaccc cgtggtgggg gttcccgaga
//
LOCUS M28575 169 bp ds-DNA BAD 04-AUG-1990
DEFINITION Figure 4. Sequence of tyrT DNA.
ACCESSION M28575
REFERENCE 1 (bases 1 to 169)
AUTHORS Drew,H.R. and Travers,A.A.
TITLE dna bending and its relation to nucleosome positioning
JOURNAL J. Mol. Biol. 186, 773-790 (1985)
STANDARD unannotated staff_entry
COMMENT
BASE COUNT 42 a 46 c 40 g 41 t
ORIGIN
1 aattccggtt acctttaatc cgttacggat gaaaattacg caaccagttc atttttctca
61 acgtaacact ttacagcggc gcgtcatttg atatgaagcg ccccgcttcc cgataaggga
121 gcaggccagt aaaaagcatt accccgtggt gggggttccc gagccccgg
//
LOCUS RCANIFR4 2583 bp ds-DNA BCT 04-AUG-1990
DEFINITION Rhodobacter capsulatus nitrogen fixation (nifR4 and nifA) genes,
5' end.
ACCESSION X12358
KEYWORDS nifA gene; nifR4 gene; nitrogen fixation.
SOURCE Rhodobacter capsulatus (strain SB1003) DNA, clone pRCN200.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 2583)
AUTHORS Jones,R. and Haselkorn,R.
TITLE The DNA Sequence of the Rhodobacter capsulata ntrA, ntrB and ntrC
gene analogues required for nitrogen fixation
JOURNAL Mol. Gen. Genet. (1988) In press
STANDARD simple automatic
COMMENT See X07567 for overlapping sequence
[1] Author address Jones R.,
University of Southern California,
Mathematics Department DRB289
University Park
Los Angeles, CA 90089-1113 USA. Submitted (15-JUL-1988) on tape
to the EMBL data library by:
FEATURES from to/span description
pept 644 1924 nifR4 protein
pept 2302 > 2583 nifA protein
BASE COUNT 406 a 906 c 835 g 436 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttgctt gcggggcccg ttacggttgg cacgcttctt gctggatcat ttctgaagcc
61 tcaaccgatc aaaaaggggg atgcgatgaa ggatctcttc gacgaaagcc tgacgctgga
121 caccgcgctg ccgcgcccca ggcaccgccc gccgccgcgc cctggcgccg gcaggccgcg
181 gcgaaggccc ccgacaccgc cgccttcctt gcgaatttcg tcaggatcgg agagatcgcc
241 gccccgaaac ccccggccgc ggcctgtttc cgcccgagga agaagccgcg gtcgtcgccg
301 agctgatcgc cgagatgcgc ccgatgttcc agcgcgacgg cggcgacatc gagctgatcg
361 gcctcaccgg cgccaccgtt caggtgcggc tttcgggcag ctgcgcgggc tgcatgatgt
421 cggcgcgcac gctttccacg gtgcaacacc agctgatcga aaccctgggc cgccggtccg
481 ggtcgtcccc gagatccggc actgagatga tcgcccgccg ccccgcccaa aaacacggca
541 caacccaggg tggcgcggcg cggatttcgg cacctgaccc tttcttcacc tgcggcgccc
601 tgccgcccca cgcaatgacc cgagtttgag ataggttgtc cccatggaac tcgcgcagac
661 gctttcccaa agacagacga tgcagatggc ggggcagatg ctccattccc tcgccatcct
721 tggcatgtcg tcgcaggatt tgtccgagca tctgaccgaa caggccacat cgaacccctt
781 tctgacctac cgcgccccgc ccgccttcat cgcccgcggc ggcgaggatt tcgacgccgt
841 gggcgcggtg gccgcgcaca agcccagcct gatggcccat gtggtcgatc agatcgagat
901 ggccttcacc gagacccccg accggctgct ggcgctgcgc tttgccgagg cgctggagcc
961 gagcggctgg ctgggccaga gcctcgacag catcgcgctg gcggcgggcg tgtcgctttc
1021 gcgcgccgaa tcgatgctgg cggtcctgca gggcttcgag cctaccggcc ttttcgcccg
1081 cgacctgtcg gattgcctga tcctgcaggc gcgcgaggcc gacatcctga cctgggaagt
1141 cgaaacgctg atccgcaaca ttcggctgat cgcggaaaac cggctgtccg atctggccga
1201 tctttgcgat tgcgacatcg gcgacatccc cgaaatcatc aagcagatcc ggcacttgaa
1261 cccgaagccg gggcttgctt tcgaccatca gccgacgccc gttttcccgc ccgatctgat
1321 cgccgtgcgc ggcgccgagg gctggacggt ggaactcaac cgcgccacct cgcccacgat
1381 caccgtgcgc gaagatcgtt tcgccgatgg caccgccgat gcgaaggcgc gcgcggaacg
1441 gcgaaggcgt ggccgaggcc cgggcgctgg cgaggcgctg gagcgacgcc gcgatacgct
1501 gttgcgcacc gccgccgtgc tggtggcgcg gcaatcggcc ttcctcgaca aggggcccgc
1561 ccatctggtg ccgctgacgc tggaggatgt ggcctcggaa ctgggcctgc atgcctcgac
1621 gatttcgcgc gcggtctcgg ggcggatgat ccagacccag acccgggcgc tgccgctgcg
1681 ggcgttcttt tcccgcgcgg tctcgaccca gggcggtggc gaggcggtgt cgcgcgacag
1741 ccttgatttc gtgcagcgca cgtgggcggc gaagatccgc caaaacccgc tgtcggatga
1801 tgcgatcgtg acgctcgccg aacgcgccgg gctgcggatc gcgcggcgga cggtggcgaa
1861 ataccgctcg acgctgggcc ttgcctcctc ctacgagcgt cgccgcgccg ccgccgcgcg
1921 ctgagtctgc gaaacgcgac ctctgccaga ccggaagcgc catttttttc ggcgctccgg
1981 ccattcccgc ctcaaagttg atcttaacga agcctcaatc acccccaagt tgcgacatat
2041 tggcacctga gactgtttgc gccgtgcatc cgggcgcaag cttgaccaag tctcgacatg
2101 tccgccacgg cacccaaccg cgatcgaacg atccacgcag cgcagactgc ggcaccccat
2161 gggagggggt gatcgtcgtt tgatcccccg gcgccgggca agcgagggag aacagtgatg
2221 atcgacatcc gggaccggct cgtcccccaa ccgcagcaag acatcgctcg gcgcgggcca
2281 cggccgaccg ccttgccttg gatgcgcttt acgagatcgc gaagaccttt gcggcggccg
2341 gatccggtgg cgaggtgccg agatcttcaa cgtgctgtcg tctttcctcg atctgcgcca
2401 tggggtgctg gcgttctggc cgagccgggc gagggcgcgg gggtgaaccc ctatgtcatc
2461 gccgcgacgg cctttcagcg cagccccgag gcccccgccg cggatgtgct gcccgatgcc
2521 gtggcgcgga tcgtctttcg ctccggcgtg cctttcgtca gctttgatct ggccgccgaa
2581 ttc
//
LOCUS ACTCPA 634 bp ds-DNA PLN 04-AUG-1990
DEFINITION A.deliciosa actinidin gene, 5' end.
ACCESSION M35795
KEYWORDS actinidin.
SOURCE A.deliciosa (var. deliciosa, strain cv.Hayward) leaf DNA, clone
pKIWI 37.
ORGANISM Actinidia deliciosa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Theales; Actinidiaceae.
REFERENCE 1 (bases 1 to 634)
AUTHORS Keeling,J., Maxwell,P. and Gardner,R.C.
TITLE Nucleotide sequence of the promoter region from kiwifruit actinidin
genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Keeling, 05-JUL-1990.
Author address: J.Keeling
Centre for Gene Technology
Dept of Cellular and Molecular Biology
University of Auckland
Private Bag
Auckland
NEW ZEALAND
email: SYSTEM%CELVAX.AUKUNI.AC.NZ
FEATURES from to/span description
pept 595 > 634 actinidin
mRNA 558 > 634 actinidin mRNA
signal 507 514 TATA box
BASE COUNT 245 a 120 c 71 g 198 t
ORIGIN
1 caaaccgtag ttacaaggtg ggcctgaacc agtttgctga cctgactgat gaggagttcc
61 gataaattag ctttcttttt ttgtcttcac ctattttctt tttcaaattt gatttacatt
121 tgtacattta aaattatttt cgccaagaaa aaccgaataa tataaaaatt tgatcaaaca
181 attaaaacaa ttcattaaag gcgaaaaata taaaaaccaa atcaactaaa aaatatcaac
241 aaaaaaataa ctaatcaaaa ctgggcataa atagaaagga caatttaata attttgtata
301 ataaaaacaa aaataaagta tttaaagttg tggttggata aggatttaaa gaagaaaaaa
361 aattaaatct aaatcattga aatttaattt tatatttttt ttctcttttt tctactgaat
421 ctgcagttcc aacagaacct ttaaaaaaaa ttgtgaaaat cattttttca aatgtcgtaa
481 gacccccccc accccccacg caccctatat aaaggccact ctctccctcc acattcacac
541 acctccaatc ccaatctttt tcttctaaaa ttcaaaaaac gagagagaac aaaaatgggt
601 ttgcccaaat ccttcgtgtc aatgtctctc ctct
//
LOCUS BOVAPOAI 933 bp ss-mRNA MAM 04-AUG-1990
DEFINITION B.taurus apolipoprotein A-I mRNA, complete cds.
ACCESSION M35870
KEYWORDS apolipoprotein A-I.
SOURCE B.taurus liver, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 933)
AUTHORS O'hUigin,C., Chan,L. and Li,W.-H.
TITLE Cloning and sequencing of bovine apolipoprotein A-I cDNA and
molecular evolution of apolipoproteins A-I and B-100
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C. O'hUigin, 27-JUN-1990.
FEATURES from to/span description
pept 73 870 apolipoprotein A-I precursor
sigp 73 127 apolipoprotein A-I signal peptide
matp 145 867 apolipoprotein A-I
signal 914 919 poly-A signal
BASE COUNT 201 a 269 c 313 g 150 t
ORIGIN
1 cgccaattac aggcctcgaa agagtcctgt attgttattt ttcgtcacta cctccccggg
61 tcgggattca agatgaaagc cgtggtgctg accttggctg tgctcttcct gacggggagc
121 caggctcggc atttctggca gcaagatgac ccccagtcat cctgggatcg ggtgaaggat
181 tttgccaccg tgtatgtgga agcaatcaag gatagtggca gagactatgt ggcccaattc
241 gaagcctccg ctttgggaaa acagctcaac ctgaaactcc tggacaactg ggacaccctg
301 gccagcacgt tgtccaaagt gcgtgaacag ctgggcccag tgacccagga gttctgggac
361 aacctggaaa aggagaccgc gtcgctgagg caggagatgc acaaggacct ggaggaggtg
421 aagcagaagg tgcagcccta cctggacgag ttccagaaga agtggcacga ggaggtggag
481 atctaccgcc agaaggtggc gccgctgggc gaggagtttc gcgagggcgc gcgccagaag
541 gtgcaggagc tgcaggacaa gctgagccct ctggcccagg agctccgcga ccgcgcgcgc
601 gcccacgtgg agacgctgcg gcagcacgtg gcgccctaca gcgacgacct gcgccagcgg
661 ctgaccgcgc gcctggaggc gctcaaggag ggcggcggca gcctggccga gtaccacgcc
721 aaggccagcg agcagctgaa ggcgctgggc gagaaggcca agccggtgct ggaggacctc
781 cgccagggcc tgctgcccgt gctggagagc ctcaaggtca gcatcctggc cgccatagac
841 gaggcctcca agaagctgaa cgcccagtga gatgcctcgg gctgccccgc tggccatcac
901 ttcggtttct tagaataaac atttccggag tgg
//
LOCUS DOGINVOLA 864 bp ds-DNA MAM 04-AUG-1990
DEFINITION Dog involucrin gene, complete cds.
ACCESSION M34442
KEYWORDS involucrin.
SOURCE Dog adult skin keratinocyte DNA.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 864)
AUTHORS Tseng,H. and Green,H.
TITLE The involucrin genes of pig and dog: Comparison of their segments
of repeats with those of prosimians and higher primates
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Tseng and H.Green, 23-MAY-1990.
FEATURES from to/span description
pept 1 858 involucrin
rpt 232 513 repeat segment
BASE COUNT 286 a 226 c 249 g 103 t
ORIGIN
1 atgtcccagc aacacactct gccagtgacc ctgccccctg ccctcagtca ggggcccctc
61 aagcctgagt ctcctcccat cgatactcaa caagagcagg tgaaacagcc gacttcacta
121 cctgtcctgt gccaaaaaat gccctcttct ctcccaggga aggttccctt ggggcatggg
181 gagaaacata caagtcttgt gaagggggaa cctgagcaac agtgtgagcc acaggagcag
241 gaacagcagc agaaacaaca agagtcacag gagcagaaac tgcacctgga acagtgtctg
301 gaacagcatc aggaacaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg
361 gaacagcagc aggagcaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg
421 gaacagcagc aggagcaaca agagtcacag gagaaggaac ttcatctgga acaggaacag
481 cagaaggagg agctacagca gcaggaacag cagcagggaa aggaacagtg tgagaaacat
541 caggaagcaa aaaatctgga gcagcagcta gagcagatag gagcacaaag gaagcagcag
601 caaaaggaac agctggaaca ggaaaagaag ctcgtggacc agcacctgga tcaagagcca
661 gcacagagaa ctgagcaacc ggaaaagaaa gaagagcagg tgctggagca gcaggggcag
721 caggaggggc agctggagca gcctgccttt gtcccagctc aggtccaggt ccgagaaccc
781 ctgaagggag aagtcctgcc cctcatagag cagcagcacc agaagcagga ggttcatgac
841 cctccagaac accagtgagc accc
//
LOCUS DRODMRA 360 bp ds-DNA INV 04-AUG-1990
DEFINITION D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION M16263
KEYWORDS dispersed middle repetitive DNA sequence.
SOURCE D.melanogaster DNA, clone p112-0.4RA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Waring,G.L. and Pollack,J.C.
TITLE Cloning and characterization of a dispersed, multicopy, X
chromosome sequence in Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
STANDARD simple staff_review
BASE COUNT 138 a 56 c 42 g 124 t
ORIGIN X chromosome.
1 ccaaatttaa atgcaaatcg atttggattt taataacaaa ctcaacgagg tatgacattc
61 catatttggt ctattatttt caatgttatg atcaaaatac caaatatttt tttcacaaaa
121 aaccaggaaa acgaaatttg gcaaaaaatt taatatttaa gtagagtttt ttgggtataa
181 cttgactaaa aatggtcaca caactaaaat aattaccatt ttgtgctcct aattaccaat
241 actaatcatc cccatcactt tttggcggac tttgaaaaat taatttttgc ccaaattttc
301 gcattttttg taaggggtaa ctcatcaaaa tttttaaaaa atgccaaaaa aataaagttc
//
LOCUS DRODMRB 354 bp ds-DNA INV 04-AUG-1990
DEFINITION D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION M16264
KEYWORDS dispersed middle repetitive DNA sequence.
SOURCE D.melanogaster DNA, clone pX-12R.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Waring,G.L. and Pollack,J.C.
TITLE Cloning and characterization of a dispersed, multicopy, X
chromosome sequence in Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
STANDARD simple staff_review
BASE COUNT 130 a 57 c 47 g 120 t
ORIGIN X chromosome.
1 ccaaatttga atgcaaatcg atttggattt aaaaaacaaa ctcaacgaag atgacattcc
61 atatttgggc cattattttc aatgttttga tcaaaatacc gattattttc ttctcaaaaa
121 attagggaaa cgagttccgg ccaaaaatgc atatttaaga ttggtttttt gggtatactt
181 ggctaaaatg gtcagacagc aaaaaaatta ccatttattc tcctataaca aatactaacc
241 acccctttca cttttaaacg gattttgtaa aattaatttt tggccaaatt ttcgcatttt
301 ttttaagggg taccatcata aaaatttgca aaaaattgaa aaaatctaga attc
//
LOCUS DRODMRC 352 bp ds-DNA INV 04-AUG-1990
DEFINITION D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION M16265
KEYWORDS dispersed middle repetitive DNA sequence.
SOURCE D.melanogaster DNA, clone pX-30 P-0.4R.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 352)
AUTHORS Waring,G.L. and Pollack,J.C.
TITLE Cloning and characterization of a dispersed, multicopy, X
chromosome sequence in Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
STANDARD simple staff_review
BASE COUNT 128 a 64 c 42 g 118 t
ORIGIN X chromosome.
1 ccaaacttaa atgcaaatcg attgggattc aaaaaacaaa ctcaacgagg tatgacattc
61 catatttggg ccattatttc catgttttca tcaaaatacc gaatttttat cacaaaaaat
121 ctgggaaact atttttggca aaaactcaat tttcaagttg gctcttttgg ctataactga
181 ctaaaaatgg tcacagagca aaaattagta ccattttata ctccttataa ccaatactaa
241 ccaccccttt cactttttaa ctgatttagt aaaatttaat tttggccaaa ttttcgcatt
301 ttttgtaagg gtaacatcat aaatttgcaa aaaattgaaa aatcctagat tc
//
LOCUS DROTMLPA 5546 bp ss-mRNA INV 04-AUG-1990
DEFINITION D.melanogaster serrate (Ser) mRNA, complete cds.
ACCESSION M35759
KEYWORDS Ser gene; serrate.
SOURCE D.melanogaster pupa, cDNA to mRNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 5546)
AUTHORS Fleming,R.J., Scottgale,T.N., Diederich,R.J. and
Artavanis-Tsakonas,S.
TITLE The gene serrate encodes a putative EGF-like transmembrane protein
essential for proper ectodermal development in Drosophila
melanogaster
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.J.Fleming, 28-JUN-1990.
Author address: R.J.Fleming
Yale University
Dept of Biology
KBT 1104
New Haven, CT 06511
email: WHAKRIA@YALEVM
FEATURES from to/span description
pept 443 4657 serrate protein precursor
sigp 443 679 serrate protein signal peptide
matp 680 4657 serrate protein
mRNA < 1 5546
BASE COUNT 1384 a 1446 c 1504 g 1212 t
ORIGIN Chromosome 3 97F.
1 ccgagtcgag cgccgtgctt cgagcggtga tgagcccctt ttctgtcaac gctaaagatc
61 tacaaaacat cagcgcctat caagtggaag tgtcaagtgt gaacaaaaca aaaacgagag
121 aagcacatac taaggtccat ataaataata aataataatt gtgtgtgata acaacattat
181 ccaaacaaaa ccaaacaaaa cgaaggcaaa gtggagaaaa tgatacagca tccagagtac
241 ggccagttat tcagctatcc agagcaagtg tagtgtggca aaatagaaac aaacaaaggc
301 accaaaatct gcatacatgg gctaattaag gctgcccagc gaatttacat ttgtgtggtg
361 ccaatccaga gtgaatccga aacaaactcc atctagatcg ccaaccagca tcacgctcgc
421 aaacgccccc agaatgtaca aaatgtttag gaaacatttt cggcgaaaac cagctacgtc
481 gtcgtcgttg gagtcaacaa tagaatcagc agacagcctg ggaatgtcca agaagacggc
541 gacaaaaagg cagcgtccga ggcatcgggt acccaaaatc gcgaccctgc catcgacgat
601 ccgcgattgt cgatcattaa agtctgcctg caacttaatt gctttaattt taatactgtt
661 agtccataag atatccgcag ctggtaactt cgagctggaa atattagaaa tctcaaatac
721 caacagccat ctactcaacg gctattgctg cggcatgcca gcggaactta gggccaccaa
781 gacgataggc tgctcgccat gcacgacggc attccggctg tgcctgaagg agtaccagac
841 cacggagcag ggtgccagca tatccacggg ctgttcgttt ggcaacgcca ccaccaagat
901 actgggtggc tccagctttg tgctcagcga tccgggtgtg ggagccattg tgctgccctt
961 tacgtttcgt tggacgaagt cgtttacgct gatactgcag gcgttggata tgtacaacac
1021 atcctatcca gatgcggaga ggttaattga ggaaacatca tactcgggcg tgatactgcc
1081 gtcgccggag tggaagacgc tggaccacat cgggcggaac gcgcggatca cctaccgtgt
1141 ccgggtgcaa tgcgccgtta cctactacaa cacgacctgc acgaccttct gccgtccgcg
1201 ggacgatcag ttcggtcact acgcctgcgg ctccgagggt cagaagctct gcctgaatgg
1261 ctggcagggc gtcaactgcg aggaggccat atgcaaggcg ggctgcgacc ccgtccacgg
1321 caagtgcgat cgtccggggg aatgcgaatg cagacccggc tggcgtggtc cattgtgcaa
1381 cgagtgcatg gtctatcccg gctgcaagca tggttcctgc aacggcagcg cctggaaatg
1441 cgtgtgcgac accaactggg gtggcatatt gtgcgatcaa gatttaaatt tctgcggcac
1501 ccatgaaccc tgcaagcacg gcggcacctg cgaaaatacc gctccggaca aatatcggtg
1561 cacatgcgcc gagggcctct cgggcgagca gtgcgagatc gtggagcacc catgtgccac
1621 caggccatgc cgcaacggcg gcacatgcac actcaagacg agtaaccgaa ctcaagccca
1681 agtgtatcgc acatcacatg gcaggagcaa catgggccgg ccggtaagac gcagcagttc
1741 gatgcgcagc ctggatcacc tgcggccgga ggggcaggcg ctgaatggca gcagctcctc
1801 gggattggtg tccctaggtt cgctgcagct gcagcagcaa ctggcccccg acttcacttg
1861 cgactgcgca gccggatgga cgggaccgac atgcgaaata aatatcgacg agtgcgccgg
1921 gggtccctgc gagcatggtg gcacttgcat cgatctaatc ggtggctttc gatgtgaatg
1981 tccgccggag tggcatggcg atgtctgtca ggtggatgtg aacgagtgcg aggcgccgca
2041 ttccgccgga atcgctgcga acgcattgct gaccaccaca gccaccgcga ttattggtag
2101 taatctgagc agtactgctc ttctggccgc tctgaccagt gcagtggcat ccacatcctt
2161 ggccatcgga ccctgcatca atgccaagga gtgtcgcaat cagccgggtt cctttgcctg
2221 catctgcaag gagggctggg gcggagtgac ctgtgccgag aatctagatg actgtgtggg
2281 tcagtgccgg aatggagcca cctgcattga tctggtcaac gactataggt gcgcctgtgc
2341 ctctggattc acgggtcgcg attgcgagac ggacatagac gagtgcgcca cttccccgtg
2401 ccgaaacgga ggcgaatgtg tggacatggt gggcaaattc aattgcattt gcccacttgg
2461 ctactcgggt tctctgtgcg aggaggccaa ggagaactgc acaccgtcgc catgtttgga
2521 gggtcactgc ctcaacacgc ccgaaggata ctactgccat tgtccaccgg atcgcgccgg
2581 aaagcactgc gagcaactgc gtccgctctg ctcccagccg ccctgcaacg agggctgctt
2641 cgccaatgtc agcctagcga cgtcagcgac aacgacgacg acaaccacca caacggcgac
2701 aacgacaagg aagatggcca agccaagcgg attgccctgc agcggacacg gcagctgcga
2761 gatgagcgac gtgggcacct tctgcaaatg ccatgtgggc cacaccggca ccttctgcga
2821 gcacaatctc aacgaatgct cgccgaatcc ttgtcgaaat gggggaattt gccttgacgg
2881 cgacggcgat tttacatgcg agtgcatgtc gggctggaca ggtaaacgct gctcggagcg
2941 cgctacaggt tgttatgccg gtcagtgcca gaatggtggt acctgcatgc ctggagcccc
3001 ggacaaggct ctgcagccgc attgccgctg tgcgccaggt tggactggtc tgttttgcgc
3061 cgaggctatt gaccagtgtc gcgggcagcc gtgccacaat ggcggaacgt gcgagtcggg
3121 agcgggctgg ttccgctgcg tctgcgctca gggattctct ggtccagact gccgcatcaa
3181 tgtgaacgag tgctcgccac agccttgcca gggcggtgcc acctgcatcg acggaatcgg
3241 tggatacagc tgcatctgcc caccaggaag gcatggattg cggtgtgaaa ttttgctctc
3301 cgatcccaag tccgcctgcc agaacgcaag caacactatc tctccgtata cagctctaaa
3361 ccgaagccaa aactggctgg atattgctct aaccggaaga acagaagacg atgagaactg
3421 caatgcgtgt gtctgcgaaa acggcacctc tcggtgcacg aatctctggt gtggattgcc
3481 caattgctat aaggtggatc cgctctccaa gtcctcgaat ctgtccggtg tttgcaaaca
3541 gcacgaggtg tgcgttccgg cactgagtga gacatgcctg tcatcgcctt gtaatgttcg
3601 tggagattgc cgggcactgg aaccatcgcg tcgggttgct ccaccccgac tgccagccaa
3661 atctagctgc tggcccaatc aggccgtggt caacgagaac tgcgcccgac tcaccatcct
3721 tttggccctg gagcgagtgg gcaagggagc ttcggtggag ggtctctgct ccctggtaag
3781 ggtgctgctg gctgcccagt tgatcaagaa gccggcgagt acttttggcc aggatccggg
3841 aatgcttatg gtgctctgcg atctcaaaac gggcaccaat gataccgttg aactaactgt
3901 gtcgtccagt aaattaaatg atccccagct gccagtggcg gtgggtctgc tgggtgaact
3961 cctgagctcc aggcagttga atggcatcca gcggcgcaag gaactggagc tgcagcatgc
4021 aaaattggct gccctcacct ccattgtgga ggtcaagttg gaaacggccc gcgtggccga
4081 tggatcgggt catagtctgc tgataggagt gctatgcggt gtctttatag tcctggtggg
4141 attctcggtg ttcatcagtc tttactggaa acagcgtctg gcttatcgca ccagttcggg
4201 aatgaactta actccctccc tggatgcact gcgtcacgag gaggagaagt cgaataatct
4261 gcagaacgag gagaatctgc gaaggtatac aaatccgctg aagggcagca ccagttccct
4321 aagagcggcc accggcatgg aactaagcct caatcccgct ccggaattag ccgcctcggc
4381 ggcgagtagt tccgccttgc acagatcgca gccactattc ccgccatgcg atttcgagcg
4441 tgagctggac tccagtacgg gcctgaagca ggcgcacaag cggagctcac agattctgct
4501 gcacaaaacc caaaactcgg acatgcggaa gaacactgtg ggctcgctgg acagtccgcg
4561 taaggacttt ggcaagcggt cgatcaactg caagtccatg ccaccctctt cgggcgacga
4621 gggctccgat gtccttgcca ccactgtgat ggtttagccg tgatctcacc aaccaaccaa
4681 tcaagaaacc aaccagccgc ccacagccag ctcaaagttc caattgccac agcacgggcg
4741 ctatttccaa gtgcattagt agcgtaatta aaactaggat attgttaagg ataccaaggt
4801 aggccacaac ggagtggctc tgttgaaaac gtaaagttct aaaaatccag gtctctcaga
4861 caaagatgag gtacacaaat aaattggcta gttaatcaag catgttatgg ccacgggatg
4921 ggcaaattta tttgtatacc tgatcttatc ttaatactaa accagttttc tactattttt
4981 tttttgtgga tcaagcttaa aagttcagct aggcaggcgt tttccgcagt gccatgtcga
5041 tgtggaagcc caaaatattt aggttagata gtgtaatttc gaactcttct cttcgctaag
5101 caacatccta cacagtgtga tatttagtgt aacccaggcg cgcatttaca ttcaattaaa
5161 gacaatgata tataaatata aacgaaatca actccttggc tagcacaagc tgtatgtata
5221 tagttctcat ttaggatcgt cgcgctctat attgtgtata agctgtaaat actgtaaatt
5281 agcagttacc gttattgtat tttgtctata gttagattgg tactattaaa ctaagaacca
5341 gccgcaacgc gttagacttt aaaagttgtt tgcaattgta cgcaataata tagttttatg
5401 ctcgtagtta ggtagctgtg taaccgggta agattcaaac gattttgtac tgtattatat
5461 acctatctgt gtagtaatat ttatttatta tattaaattt gatctagacg caataaagta
5521 atatcaataa agatagtaaa agacat
//
LOCUS ECOLRRPA 495 bp ds-DNA BCT 04-AUG-1990
DEFINITION E.coli leucine-responsive-regulatory protein (Lrp) gene,
complete cds.
ACCESSION M35869
KEYWORDS Lrp gene; leucine-responsive-regulatory protein.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 495)
AUTHORS Willins,D.A., Platko,J.V., Ryan,C. and Calvo,J.M.
TITLE Lrp, a small, basic global regulatory protein of E.coli
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.M.Calvo, 27-JUN-1990.
Author address: J.M.Calvo
Cornell University
Dept of Biochemistry
Room 451
Biotechnology Bldg.
Ithaca, NY 14853
email:DEKY@CRNLVAX5
FEATURES from to/span description
pept 1 495 leucine-responsive-regulatory protein (Lrp)
BASE COUNT 127 a 110 c 128 g 130 t
ORIGIN 20 min. on E.coli map.
1 atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt
61 aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt
121 tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc
181 tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt
241 actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt
301 gaagaaattc aggagtgtca tttagtatcc ggtgatttcg actacctgtt gaaaacacgc
361 gtgccggata tgtcagccta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc
421 gtcaatgaca cacggacata cgtcgttatg gaagaagtca agcagagtaa tcgtctggtt
481 attaagacgc gctaa
//
LOCUS HUMA1AT 143 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human alpha-1-antitrypsin (alpha-1-AT) mRNA, 3' end.
ACCESSION M26123
KEYWORDS alpha-1-antitrypsin.
SOURCE Human adult liver, cDNA to mRNA, clone PJN1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 143)
AUTHORS Coutelle,C., Speer,A., Rogers,J., Kalsheker,N., Humphries,S. and
Williamson,R.
TITLE Construction and partial characterization of a human liver cDNA
library
JOURNAL Biomed. Biochim. Acta 44, 421-431 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 101 alpha-1-antitrypsin (alpha-1-AT; AA at 3)
/hgml_locus_uid="LX0081X" /nomgen="PI"
/map="14q32.1"
BASE COUNT 37 a 53 c 19 g 34 t
ORIGIN
1 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaccaaaat accaagtctc
61 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcctcaac
121 ccctcccctc catccctggc ccc
//
LOCUS HUMGAPDHG 5378 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene,
complete cds.
ACCESSION J04038
KEYWORDS glyceraldehyde-3-phosphate dehydrogenase.
SOURCE Human DNA.
REFERENCE 1 (bases 1 to 5378)
AUTHORS Ercolani,L., Florence,B., Denaro,M. and Alexander,M.
TITLE Isolation and complete sequence of a functional human
glyceraldehyde-3-phosphate dehydrogenase gene
JOURNAL J. Biol. Chem. 263, 15335-15341 (1988)
STANDARD simple staff_entry
REFERENCE 2 (bases 650 to 1157)
AUTHORS Nasrin,N., Ercolani,L., Denaro,M., Kong,X.F., Kang,I. and
Alexander,M.
TITLE An insulin response element in the glyceraldehyde-3-phosphate
dehydrogenase gene binds a nuclear protein induced by insulin in
cultured cells and by nutritional manipulations in vivo
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5273-5277 (1990)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
pept 1453 1481 glyceraldehyde-3-phosphate dehydrogenase, exon
2 (first expressed exon) (EC 1.2.2.12)
3116 3215 glyceraldehyde-3-phosphate dehydrogenase, exon
3
3306 3412 glyceraldehyde-3-phosphate dehydrogenase, exon
4
3542 3632 glyceraldehyde-3-phosphate dehydrogenase, exon
5
3723 3838 glyceraldehyde-3-phosphate dehydrogenase, exon
6
3931 4012 glyceraldehyde-3-phosphate dehydrogenase, exon
7
4206 4618 glyceraldehyde-3-phosphate dehydrogenase, exon
8
4723 4792 glyceraldehyde-3-phosphate dehydrogenase, exon
9
pre-msg 1138 4993 GAPDH mRNA and introns
IVS 1190 1429 GAPDH intron A
IVS 1482 3115 GAPDH intron B
IVS 3216 3305 GAPDH intron C
IVS 3413 3541 GAPDH intron D
IVS 3633 3722 GAPDH intron E
IVS 3839 3930 GAPDH intron F
IVS 4013 4205 GAPDH intron G
IVS 4619 4722 GAPDH intron H
site 650 729 insulin response element A [2]
site 729 867 insulin response element B [2]
site 1049 1071 insulin response element A [2]
signal 884 892 CAAT box
signal 1108 1113 TATA box
site 1447 1456 Kozak consensus sequence
revision 877 879 ccc in [2]; cc in [1]
BASE COUNT 994 a 1642 c 1627 g 1115 t
ORIGIN
1 ggatcccctg ctgggagggg gcaggggacc tgttcccacc gtgtgcccaa gacctctttt
61 cccacttttt ccctcttctt gactcaccct gccctcaata tcccccggcg cagcagtgaa
121 agggagtccc tggctcctgg ctcgcctgca cgtcccaggg cggggaggga cttccgccct
181 cacgtcccgc tcttcgcccc aggctggatg gaatgaaagg cacactgtct ctctccctag
241 gcagcacagc ccacaggttt caggagtgcc tttgtgggag gcctctgggc ccccaccagc
301 catcctgtcc tccgcctggg gccccagccc ggagagagcc gctggtgcac acagggccgg
361 gattgtctgc cctaattatc aggtccaggc tacagggctg caggacatcg tgaccttccg
421 tgcagaaacc tccccctccc cctcaagccg cctcccgagc ctccttcctc tccaggcccc
481 cagtgcccag tgcccagtgc ccagcccagg cctcggtccc agagatgcca ggagccagga
541 gatggggagg gggaagtggg ggctgggaag gaaccacggg cccccgcccg agcccatggg
601 cccctcctag gcctttgcct gagcagaccg gtgtcactac cgcagagcct cgaggagaag
661 ttccccaact ttcccgcctc tcagcctttg aaagaaagaa aggggagggg gcaggccgcg
721 tgcagccgcg agcggtgctg ggctccggct ccaattcccc atctcagtcg ttcccaaagt
781 cctcctgttt catccaagcg tgtaagggtc cccgtccttg actccctagt gtcctgctgc
841 ccacagtcca gtcctgggaa ccagcaccga tcacctccca tcgggccaat ctcagtccct
901 tccccctacg tcggggccca cacgctcggt gcgtgcccag ttgaaccagg cggctgcgga
961 aaaaaaaaag cggggagaaa gtagggcccg gctactagcg gttttacggg cgcacgtagc
1021 tcaggcctca agaccttggg ctgggactgg ctgagcctgg cgggaggcgg ggtccgagtc
1081 accgcctgcc gccgcgcccc cggtttctat aaattgagcc cgcagcctcc cgcttcgctc
1141 tctgctcctc ctgttcgaca gtcagccgca tcttcttttg cgtcgccagg tgaagacggg
1201 cggagagaaa cccgggaggc tagggacggc ctgaaggcgg caggggcggg cgcaggccgg
1261 atgtgttcgc gccgctgcgg ggtgggcccg ggcggcctcc gcattgcagg ggcgggcgga
1321 ggacgtgatg cggcgcgggc tgggcatgga ggcctggtgg gggaggggag gggaggcgtg
1381 tgtgtcggcc ggggccacta ggcgctcact gttctctccc tccgcgcagc cgagccacat
1441 cgctcagaca ccatggggaa ggtgaaggtc ggagtcaacg ggtgagttcg cgggtggctg
1501 gggggccctg ggctgcgacc gcccccgaac cgcgtctacg agccttgcgg gctccgggtc
1561 tttgcagtcg tatgggggca gggtagctgt tccccgcaag gagagctcaa ggtcagcgct
1621 cggacctggc ggagccccgc acccaggctg tggcgccctg tgcagctccg cccttgcggc
1681 gccatctgcc cggagcctcc ttcccctagt ccccagaaac aggaggtccc tactcccgcc
1741 cgagatcccg acccggaccc ctaggtgggg gacgctttct ttcctttcgc gctctgcggg
1801 gtcacgtgtc gcagaggagc ccctccccca cggcctccgg caccgcaggc cccgggatgc
1861 tagtgcgcag cgggtgcatc cctgtccgga tgctgcgcct gcggtagagc ggccgccatg
1921 ttgcaaccgg gaaggaaatg aatgggcagc cgttaggaaa gcctgccggt gactaaccct
1981 gcgctcctgc ctcgatgggt ggagtcgcgt gtggcgggga agtcaggtgg agcgaggcta
2041 gctggcccga tttctcctcc gggtgatgct tttcctagat tattctctgg taaatcaaag
2101 aagtgggttt atggaggtcc tcttgtgtcc cctccccgca gaggtgtggt ggctgtggca
2161 tggtgccaag ccgggagaag ctgagtcatg ggtagttgga aaaggacatt tccaccgcaa
2221 aatggcccct ctggtggtgg ccccttcctg cagcggctca cctcacggcc ccgcccttcc
2281 cctgccagcc tagcgttgac ccgaccccaa aggccaggct gtaaatgtca ccgggaggat
2341 tgggtgtctg ggcgcctcgg ggaacctgcc cttctcccca ttccgtcttc cggaaaccag
2401 atctccaccg caccctggtc tgaggtctga ggttaaatat agctgctgac ctttctgtag
2461 ctgggggcct gggctggggc tctctcccat cccttctccc cacacacatg cacttacctg
2521 tgctcccact cctgatttct ggaaaagagc taggaaggac aggcaacttg gcaaatcaaa
2581 gccctgggac tagggggtta aaatacagct tcccctcttc ccacccgccc cagtctctgt
2641 cccttttgta ggagggactt agagaagggg tgggcttgcc ctgtccagtt aatttctgac
2701 ctttactcct gccctttgag tttgatgatg ctgagtgtac aagcgttttc tccctaaagg
2761 gtgcagctga gctaggcagc agcaagcatt cctggggtgg catagtgggg tggtgaatac
2821 catgtacaaa gcttgtgccc agactgtggg tggcagtgcc cacatggccg cttctcctgg
2881 aagggcttcg tatgactggg ggtgttgggc agccctggag ccttcagttg cagccatgcc
2941 ttaagccagg ccagcctggc agggaagctc aagggagata aaattcaacc tcttgggccc
3001 tcctgggggt aaggagatgc tgcattcgcc ctcttaatgg ggaggtggcc tagggctgct
3061 cacatattct ggaggagcct cccctcctca tgccttcttg cctcttgtct cttagatttg
3121 gtcgtattgg gcgcctggtc accagggctg cttttaactc tggtaaagtg gatattgttg
3181 ccatcaatga ccccttcatt gacctcaact acatggtgag tgctacatgg tgagccccaa
3241 agctggtgtg ggaggagcca cctggctgat gggcagcccc ttcataccct cacgtattcc
3301 cccaggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcaagg
3361 ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagtgagtgg
3421 aagacagaat ggaagaaatg tgctttgggg aggcaactag gatggtgtgg ctcccttggg
3481 tatatggtaa ccttgtgtcc ctcaatatgg tcctgtcccc atctcccccc caccccggta
3541 ggcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc gtggagtcca
3601 ctggcgtctt caccaccatg gagaaggctg gggtgagtgc aggagggccc gcgggagggg
3661 aagctgactc agccctgcaa aggcaggacc cgggttcata actgtctgct tctctgctgt
3721 aggctcattt gcagggggga gccaaaaggg tcatcatctc tgccccctct gctgatgccc
3781 ccatgttcgt catgggtgtg aaccatgaga agtatgacaa cagcctcaag atcatcaggt
3841 gaggaaggca gggcccgtgg agaagcggcc agcctggcac cctatggaca cgctcccctg
3901 acttgcgccc cgctccctct ttctttgcag caatgcctcc tgcaccacca actgcttagc
3961 acccctggcc aaggtcatcc atgacaactt tggtatcgtg gaaggactca tggtatgaga
4021 gctggggaat gggactgagg ctcccacctt tctcatccaa gactggctcc tccctgctgg
4081 ggctgcgtgc aaccctgggg ttgggggttc tggggactgg ctttcccata atttcctttc
4141 aaggtgggga gggaggtaga ggggtgatgt ggggagtacg ctgcagggcc tcactccttt
4201 tgcagaccac agtccatgcc atcactgcca cccagaagac tgtggatggc ccctccggga
4261 aactgtggcg tgatggccgc ggggctctcc agaacatcat ccctgcctct actggcgctg
4321 ccaaggctgt gggcaaggtc atccctgagc tgaacgggaa gctcactggc atggccttcc
4381 gtgtccccac tgccaacgtg tcagtggtgg acctgacctg ccgtctagaa aaacctgcca
4441 aatatgatga catcaagaag gtggtgaagc aggcgtcgga gggccccctc aagggcatcc
4501 tgggctacac tgagcaccag gtggtctcct ctgacttcaa cagcgacacc cactcctcca
4561 cctttgacgc tggggctggc attgccctca acgaccactt tgtcaagctc atttcctggt
4621 atgtggctgg ggccagagac tggctcttaa aaagtgcagg gtctggcgcc ctctggtggc
4681 tggctcagaa aaagggccct gacaactctt ttcatcttct aggtatgaca acgaatttgg
4741 ctacagcaac agggtggtgg acctcatggc ccacatggcc tccaaggagt aagacccctg
4801 gaccaccagc cccagcaaga gcacaagagg aagagagaga ccctcactgc tggggagtcc
4861 ctgccacact cagtccccca ccacactgaa tctcccctcc tcacagttgc catgtagacc
4921 ccttgaagag gggaggggcc tagggagccg caccttgtca tgtaccatca ataaagtacc
4981 ctgtgctcaa ccagttactt gtcctgtctt attctagggt ctggggcaga ggggagggaa
5041 gctgggcttg tgtcaaggtg agacattctt gctggggagg gacctggtat gttctcctca
5101 gactgagggt agggcctcca aacagccttg cttgcttcga gaaccatttg cttcccgctc
5161 agacgtcttg agtgctacag gaagctggca ccactacttc agagaacaag gccttttcct
5221 ctcctcgctc cagtcctagg ctatctgctg ttggccaaac atggaagaag ctattctgtg
5281 ggcagcccca gggaggctga caggtggagg aagtcagggc tcgcactggg ctctgacgct
5341 gactggttag tggagctcag cctggagctg agctgcag
//
LOCUS HUMVARNA 156 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human integrated adenovirus 2 VA RNA.
ACCESSION M35961
KEYWORDS .
SOURCE Human adenovirus 2 infected KB cell line, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 156)
AUTHORS Ohe,K. and Weissman,S.M.
TITLE Nucleotide sequence of an RNA from cells infected with adenovirus 2
JOURNAL Science 167, 879-881 (1969)
STANDARD simple staff_review
BASE COUNT 26 a 46 c 53 g 31 t
ORIGIN
1 gggcactctt ccgtggtctg gtggataaat tcgcaagggt atcatggcgt ggacgaccgg
61 ggttcgaacc ccggatccgt gatccatgcg gttaccgtcc gccgcccgtg cgtcgaaccc
121 aggtgtgcga cgtcagacaa cgggggagcg ctcctt
//
LOCUS MUSGFAPA 2511 bp ss-mRNA ROD 04-AUG-1990
DEFINITION Mouse glial fibrillary acidic protein mRNA, 3' end.
ACCESSION M25937
KEYWORDS glial fibrillary acidic protein.
SOURCE Mouse brain, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2511)
AUTHORS Cowan,N.J., Lewis,S.A., Balcarek,J.M., Krek,V. and Shelanski,M.
TITLE Structural implications of a cDNA clone encoding mouse glial
fibrillary acidic protein
JOURNAL Ann. N.Y. Acad. Sci. 455, 575-582 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 1212 glial fibrillary acidic protein (AA at 1)
BASE COUNT 625 a 690 c 697 g 499 t
ORIGIN
1 ctgggtacca tgccacgttt ctccttgtct cgaatgactc ctccactccc tgccagggtg
61 gacttctccc tggccggggc gctcaatgct ggcttcaagg agacacgggc gagcgagcgt
121 gcagagatga tggagctcaa tgaccgcttt gctagctaca tcgagaaggt ccgcttcctg
181 gaacagcaaa acaaggcgct ggcagctgaa ctgaaccagc ttcgagccaa ggagcccacc
241 aaactggctg atgtctacca ggcggagctt cgggagctgc ggctgcggct ggaccagctt
301 acggccaaca gtgcccggct ggaggtggag agggacaact ttgcacagga cctcggcacc
361 ctgaggcaga agctccaaga tgaaaccaac ctgaggctgg aggcagagaa caacctggct
421 gcgtatagac aggaggcaca tgaagccacc ctggctcgtg tggatttgga gagaaaggtt
481 gaatcgctgg aggaggagat ccagttctta aggaagatct atgaggagga agttcgagat
541 ctccgggagc agctggccca acagcaggtc cacgtggaga tggatgtggc caagccagac
601 ctcacagcgg ccctgagaga gattcgcact caatacgagg cagtggccac cagtaacatg
661 caagagacag aggagtggta tcggtctaag tttgcagacc tcacagacgc tgcgtcccgc
721 aacgcagagc tcctccgcca agccaaacac gaagctaacg actatcgccg ccaactgcag
781 gccttgacct gcgatctgga gtccctccgc ggcacgaacg agtccctaga gcggcaaatg
841 cgcgaacagg aagagcgcca tgcgcgggag tcggccagtt accaggaggc acttgctcgg
901 ctggaggagg agggccaaag cctcaaggag gagatggccc gccacctgca ggagtaccag
961 gatctactca acgttaagct agccctggac atcgagatcg ccacctacag gaaattgctg
1021 gagggcgaag aaaaccgcat caccattcct gtacagactt tctccaacct ccagatccga
1081 gaaaccagcc tggacaccaa atccgtgtca gaaggccacc tcaagaggaa catcgtggta
1141 aagactgtgg agatgcggga tggtgaggtc attaaggact cgaagcagga gcacaaggac
1201 gtggtgatgt gaggtgtgcc acctggtggc ccttgccatg cagtgtgagg gcccaaagct
1261 tagcctcaaa taggcctgtt tgccaggctc agttcccacc cacaccagca cttcccttcc
1321 ttccctggtt ttctgcctgt gtgctgccca aggcgtcaat caggtcataa gcttcataga
1381 tggcatagat ggcatatacc cttcaccttc aactaacagg atactcaccc caaaggcgca
1441 gtcaggaggg gagggaaccc cagctggtta gaattggaag ggaagaggaa agatgagcag
1501 agtagagaga tttaacaaat cacttccttc atccttgttg ttatggaaac cgttgccaga
1561 gctggaagtt tccacaggct gctggagcta gacaacaatt cagacagaaa gggaaagtcc
1621 ctgaggcaaa gtctctctag ccagagacct atgcatcccg aatgccacta aggcagtcct
1681 gaagggccct ccaggtgatg actccagtgt gtcagcccca ctgagcagct atgcaggttg
1741 actgcccaca ggcatgtgaa acttggttct cagcacttgg cagggatcta tggcataagt
1801 ggagagggaa ggtgtactgg acggcggaga ggagggctcc ctggccccta agtgtggatg
1861 cagagaggtg gagcccagga agggtctctg cttaggctgc agggtaccga atggcagagg
1921 cactggtaga gatcatttgg acactcggag ttgaaagtta caggcaatct gttacacttg
1981 gctctgaatc ctatcaatca aggaaataac ccgttctctg gaagacactg aaacaggaga
2041 gcaggacttc cgtccactgg gcagggtaca gatgtgtctc agttgtgaag gtctattcct
2101 ggctgcacag tccccatccg ctcaggtcat cttacccctg tgactgctct cagccctgga
2161 agaatccaca accatccttc caaggttgtc catccccaca atgactagct gttgctctcc
2221 aagctaaggg accattccct gctctatgca tatacgtaat gtcacctatt tagtatcatc
2281 ctatttgaga gtttgaggaa ctgaaacact gttctgttca agcacctggt gctatgcctt
2341 catattagag caccttctct gaggctgatt ggtgggcagg tagggaagac attgagcaga
2401 cagtgtccgc tcagttgtcc ttccctccct tccaaggtcc ctccctcttt ccaggacatc
2461 gcccccccac cccacccctc ctttccacct ccgctaacct ccagacgcac t
//
LOCUS MUSHOX61A 318 bp ss-mRNA ROD 04-AUG-1990
DEFINITION Mouse homeobox-6.1 protein (Hox-6.1) mRNA, 3' end.
ACCESSION M35986
KEYWORDS Hox-6.1 gene; homeobox-6.1 protein.
SOURCE Mouse male kidney, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Sharpe,P.T., Miller,J.R., Evans,E.P., Burtenshaw,M.D. and
Gaunt,S.J.
TITLE Isolation and expression of a new mouse homeobox gene
JOURNAL Development 102, 397-407 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 318 homeobox-6.1 protein (AA at 1)
site 30 210 homeobox
BASE COUNT 94 a 80 c 100 g 44 t
ORIGIN Chromosome 14E2.
1 aatcacagtg gggtcggtta cggagcggac cggaggcgcg gccgccagat ctactctcgg
61 taccagaccc tggaactgga gaaggaattt cacttcaacc gctacctaac tcggcgccgg
121 cgcatcgaga tcgccaatgc tctgtgcctg accgagcgac agatcaaaat ctggttccag
181 aacgcgcgca tgaagtggaa aaaagaatct aatctcacgt ccacactctc agggggtggc
241 ggaggggcaa ccgccgacag cctgggagga aaggaggaaa agcgagaaga gacagaagaa
301 gagaagcaga aagagtga
//
LOCUS PEAVICIL 190 bp ds-DNA PLN 04-AUG-1990
DEFINITION P.sativum vicilin mRNA, partial cds.
ACCESSION M35978
KEYWORDS vicilin.
SOURCE P.sativum, cDNA to mRNA, clone pRC 2.2.1.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 190)
AUTHORS Gatehouse,J.A., Lycett,G.W., Croy,R.R.D. and Boulter,D.
TITLE The post-translational proteolysis of the subunits of vicilin from
pea (Pisum sativum L.)
JOURNAL Biochem. J. 207, 629-632 (1982)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 190 vicilin (AA at 2)
BASE COUNT 74 a 30 c 46 g 40 t
ORIGIN
1 agagatagag aagattctct tagaagagca tgagaaagag acacatcaca gaagaagcct
61 tagggataag agacaacaga gccaagaaaa gaatgtaata gtcaaagtat caaagcaagt
121 gcaaagttac aaagctaagt tgactccagg tgatgttttt gtaattccgg caggtcatcc
181 tgttgccgta
//
LOCUS PIGINVOLA 1204 bp ds-DNA MAM 04-AUG-1990
DEFINITION Pig involucrin gene, complete cds.
ACCESSION M34441
KEYWORDS involucrin.
SOURCE Pig (Yorkshire) adult skin keratinocyte DNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1204)
AUTHORS Tseng,H. and Green,H.
TITLE The involucrin genes of pig and dog: Comparison of their segments
of repeats with those of prosimians and higher primates
JOURNAL Mol. Biol. Evol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Tseng and H. Green, 23-MAY-1990.
FEATURES from to/span description
pept 1 1044 involucrin
rpt 247 831 repeat segment
BASE COUNT 377 a 311 c 363 g 153 t
ORIGIN
1 atgtcccagc aatacactct tccagtgacc ctgccccctg ccctcagtca ggagcccctc
61 aagcctgttt ctcctcccgc cgatacccag caggagcaag tgaaacaggc aactccgctg
121 cctgctccat gccagaagat gctctcggag ctcccagtgg aggtcccctt ggaggatgcg
181 gagaaacaca caactcttgt gaagggagtg cctgagcaag agtgtgagcc gcagccacag
241 gagccacagc agcaggaact gcatgtggaa cagcagcagc agcagcaaga atcacaggtg
301 caggaactgc atgtggatca gcagcagcag caacaagagt cacaggagca ggaactgcat
361 gtggatcagc agcagcagca gcaagagtca caggagcagg aactgcatgt ggatcagcag
421 cagcaacaag agtcacaggt gcaggaactg catgtgggtc atcatcagca gcagcaagag
481 tcacaggagc aggaactgca tgtggatcat catcagcagc agcaagagtc acaggagcag
541 gaactgcatg tggatcagca gcagcagcaa caagagtcac aggagcagga actgcatgtg
601 gatcagcagc agcagcagca agagtcacag gagcaggaac tgcatgtgga tcatcatcag
661 cagcagcaag agtcacaggt gcaggaactg catgtggatc atcagcagca gcagcaagag
721 tcacaggagc aggaactgca tgtggatcag caccagcagc agcaagagtc acaggagcag
781 gaactgcatg tggatcagca gcagcaggag ctccaggtgc aggaagtgca gcagcagcag
841 cagcagcagc aagaacagca ggaggaccat cagaaagcag aacacctgga gcaggaggaa
901 gcacaaaggg agcagcagct aaagggacag ctggaacagg agaagaaggg cgtgtaccag
961 cacctggatc aagagctaac aaagagagat gaacacctgg aaaagaaagg ggagcactgc
1021 tggagcagca ggagaagcct ctagagccag cagagcagca ggaggggcag ttgaaacagc
1081 ctgtgcttat tccagctcct ggccaggtcc aggagaccca ccccagtcca actgctgaag
1141 ggagaagttt tgccccctga gaagcaggac ataagtaacc acctctggac ataagtaacc
1201 accc
//
LOCUS FLANPB 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/Wilson-Smith/33 (H1N1) nucleoprotein (seg 5) mRNA,
complete cds.
ACCESSION M30746
KEYWORDS nucleoprotein.
SOURCE Influenza A /Wilson-Smith/33 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 507 a 318 c 409 g 331 t
ORIGIN
1 agcaaaagca gggtagataa tcactcacag agtgacatcg aaatcatggc gaccaaaggc
61 accaaacgat cttacgaaca gatggagact gatggagaac gccagaatgc cactgaaatc
121 agagcatctg tcggaaaaat gattggtgga attggacgat tctacatcca aatgtgcacc
181 gaacttaaac tcagtgatta tgagggacgg ctgattcaga acagcttaac aatagagaga
241 atggtgctct ctgcttttga cgagaggagg aataaatatc tagaagaaca tcccagtgcg
301 gggaaagatc ctaagaaaac tggaggacct atatacagga gagtagatgg aaagtggatg
361 agagaactca tcctttatga caaagaagaa ataagacgaa tctggcgcca agctaataat
421 ggtgacgatg caacggctgg tctgactcac atgatgatct ggcactccaa tttgaatgat
481 gcaacttacc agaggacaag agctcttgtt cgcacaggaa tggatcccag gatgtgctca
541 ctgatgcagg gttcaaccct ccctaggagg tctggggccg caggtgctgc agtcaaagga
601 gttggaacaa tggtgatgga attgatcaga atgatcaaac gtgggatcaa tgatcggaac
661 ttctggaggg gtgagaatgg acggagaaca aggattgctt atgaaagaat gtgcaacatt
721 ctcaaaggga aatttcaaac agctgcacaa agagcaatgg tggatcaagt gagagagagc
781 cggaatccag gaaatgctga gttcgaagat ctcatctttc tagcacggtc tgcactcata
841 ttgagagggt cagttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta
901 gccagtggat acgactttga aagagaggga tactctctag tcggaataga ccctttcaga
961 ctgcttcaaa acagccaagt atacagccta atcagaccaa atgagaatcc agcacacaag
1021 agtcaactgg tgtggatggc atgccattct gctgcatttg aagatctaag agtatcaagc
1081 ttcatcagag ggacgaaagt ggtcccaaga gggaagcttt ccactagagg agttcaaatt
1141 gcttccaatg aaaacatgga gactatggaa tcaagtaccc ttgaactgag aagcagatac
1201 tgggccataa ggaccagaag tggagggaac accaatcaac agagggcttc ctcgggccaa
1261 atcagcatac aacctacgtt ctcagtacag agaaatctcc cttttgacag accaaccatt
1321 atggcagcat tcactgggaa tacagagggg agaacatctg acatgagaac cgaaatcata
1381 aggctgatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcagc gagcccgatc gtgccctcct ttgacatgag taatgaagga
1501 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPC 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/swine/Iowa/15/30 (H1N1) nucleoprotein (seg 5) mRNA,
complete cds.
ACCESSION M30747
KEYWORDS nucleoprotein.
SOURCE Influenza A/swine/Iowa/15/30 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1565; revises [1])
AUTHORS Gorman,O.T.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 509 a 318 c 411 g 327 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcg aaatcatggc gtctcaaggc
61 accaaacgat catacgaaca aatggaaact ggtggagaac gccagaatgc cacggaaatc
121 agagcatctg tcggaagaat gattggtgga attggaagat tctacatcca aatgtgcacc
181 gaactcaaac tcagtgatta tgagggacgg ctgattcaga acagcataac aatagagaga
241 atggtgctct ctgcttttga cgagaggagg aacaaatatc tggaagaaca tccaagcgct
301 gggaaagacc ctaagaaaac tggaggaccc atatacagga gaatagacgg aaagtggata
361 agagaactca tcctttatga caaagaagaa ataaggagaa tctggcgcca agccaacaat
421 ggtgaggata caacagccgg tctgactcac atgatgatct ggcattccaa tttgaatgat
481 gccacttatc agaggacaag agctcttgtt cgcaccggaa tggatcccag gatgtgctct
541 ctgatgcagg gttcaactct ccccaggagg tctggagccg caggtgctgc agtcaaagga
601 gttgggacag tggtgatgga gttgatcaga atgatcaaac gtggaatcaa tgatcggaac
661 ttctggaggg gtgaaaatgg acgaaggaca agaattgcct atgaaagaat gtgcaacatt
721 ctcaaaggga aatttcaaac agctgcacag agagcaatga tggatcaagt gagagagagc
781 cgaaacccag gaaatgctga aatcgaagat ctcatctttc tggcacgatc cgcactcata
841 ttacgaggat cagttgcaca caagtcctgt ctgcctgcct gtgtgtatgg acttgctgta
901 gccagtggac atgactttga aagagagggg tactctctgg tcggaataga ccctttcaga
961 ctgcttcaaa acagccaagt attcagccta atcagaccaa atgaaaaccc agcgcacaag
1021 agtcaattag tgtggatggc atgccattct gctgcatttg aagatttaag ggtatcaagc
1081 ttcataagag ggaaaagagt ggttccacga gggcagcttt ccaccagagg ggttcagatt
1141 gcttccaatg agaacatgga gactatggac tctagtactc ttgaactgag aagcagatac
1201 tgggccataa ggaccagaag tggaggaaac actaatcaac agagggcatc tgcgggccaa
1261 atcagcgtgc aacctacatt ctcggtgcag agaaatctcc cttttgagag agcaaccgtt
1321 atggcagcat tcactgggaa cacagaggga agaacatcag acatgagaac cgaaatcata
1381 aggataatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gagcccgatc gtgccttcct ttgacatgag taacgaagga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPD 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/swine/Tennessee/24/77 (H1N1) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30748
KEYWORDS nucleoprotein.
SOURCE Influenza A/swine/Tennessee/24/77 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1565; revises [1])
AUTHORS Gorman,O.T.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 514 a 307 c 409 g 335 t
ORIGIN
1 agcaaaagca gggtagataa tcactcaatg agtgacatcg aaatcatggc gtctcaaggc
61 accaaacgat catatgaaca aatggagact ggtggggaac gccaggatgc cacagaaatc
121 agagcatctg tcggaagaat gattggtgga atcggaagat tctacatcca aatgtgcact
181 gaactcaaac tcagtgacta tgagggacga ctgattcaaa atagcataac aatagagaga
241 atggtgctct ctgcttttga cgagagaagg aataaatacc tagaagagca tcccagtgct
301 gggaaagatc ctaagaaaac tggaggaccc atatatagaa gagtagacgg aaaatggatg
361 agggaactca tcctttatga caaagaagaa ataaggagag tttggcgcca agccaacaat
421 ggtgaagatg caacagccgg ccttacccat attatgattt ggcactccaa tctgaatgat
481 gccacctatc agagaacaag agctcttgtt cgcactggga tggatcccag aatgtgctcc
541 ctaatgcaag gttcaacact tcccagaagg tctggagccg caggtgctgc agtaaaagga
601 gttggaacaa tagcgatgga gttaatcaga atgatcaaac gtgggatcaa tgaccgaaac
661 ttctggaggg gtgaaaatgg acgaaggaca aggattgcat atgaaagaat gtgcaacatt
721 ctcaaaggga aatttcagac agctgcccag agggcaatga tggatcaagt gagagaaagt
781 cggaacccag gaaacgctga aattgaagat ctcattttcc tggcacggtc agcactcatt
841 ttaagagggt cagttgcaca taagtcttgc ctgcctgctt gtgtgtatgg gcttgcagta
901 gcgagtgggc atgactttga aagagaagga tattctctgg tcggaataga ccccttcaaa
961 ctacttcaaa acagtcaagt attcagcctg atcagaccaa atgaaaaccc agctcacaag
1021 agtcaactgg tgtggatggc atgccactct gccgcatttg aggatttaag agtatcaggc
1081 ttcataagag ggaagaaagt ggttccaaga ggaaagcttt ccacaagagg ggttcagatt
1141 gcttcaaatg agaatgtgga agctatggac tctagtaccc tggaactaag aagcaggtac
1201 tgggccataa ggaccagaag cgggggaaat accaatcaac agaaggcatc cgcaggccag
1261 atcagtgtgc aacctacatt ctcagtacaa aggaatctcc cttttgagag agcgaccgtt
1321 atggcagctt tcattgggaa caatgaggga cgaacatcag atatgcgaac tgaaatcata
1381 aggatgatgg aaagtgcaaa gccagaagat ttgtccttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgaa taatgagggg
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt gaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPE 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/swine/Netherlands/12/85 (H1N1) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30749
KEYWORDS nucleoprotein.
SOURCE Influenza A/swine/Netherlands/12/85 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 503 a 307 c 419 g 336 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacattc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaacc ggaggagaac gccagaatgc tactgaaatc
121 agagcatctg ttgggggaat ggttggtgga attggaagat tctacataca gatgtgcact
181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga atagcataac aatagagaga
241 atggttctct ctgcatttga tgagaggagg aacaaatacc tggaagaaca tcccagtgcg
301 gggaaggacc caaagaaaac tggaggtcca atctacaaaa agagagacgg aaaatggatg
361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcaaacaat
421 ggtgaagatg ctactgctgg tctcactcat ctgatgattt ggcattccaa cctgaatgat
481 gccacatatc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct
541 ctgatgcaag gatcaactct cccgaggaga tctggagctg caggtgcagc agtaaaggga
601 gttgggacga tggtaatgga actgattcgg atgataaagc gggggatcaa tgatcggaac
661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcat atgagagaat gtgcaacatc
721 ctcaaaggga aatttcagac agcagcgcaa cgagcaatga tggaccaggt gcgagaaagc
781 agaaatcctg ggaatgctga gattgaagac cttatctttc tggcacgatc tgcactcatt
841 ctgagaggat cagtggctca taaatcctgc ctgcctgctt gtgtatatgg acttgttgtg
901 gcaagtggat atgactttga aagagaaggg tactctctag tcggaataga tcctttccgt
961 ctgctccaga acagccaggt gttcagcctc attagaccaa atgagaatcc agcacataag
1021 agtcagctgg tatggatggc atgccattct gcagcgtttg aagatctgag agtgtcaagt
1081 ttcatcagag ggacaaaagt ggtcccaaga ggacaactgt ccaccagagg agttcaaatt
1141 gcttcaaatg aaaacatgga aacaatggac tccagtactc ttgaactgag aagcaaatac
1201 tgggctataa gaaccaggag cggaggaaac accaaccaac agagagcatc tgcagggcaa
1261 atcagtgtac aacctacttt ctcggtacag agaaatcttc ctttcgagaa agcgaccatc
1321 atggcagctt ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaattata
1381 agaatgatgg aaagtgccag accagaagat gtgtccttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPF 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/equine/London/1416/73 (H7N7) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30750
KEYWORDS nucleoprotein.
SOURCE Influenza A/equine/London/1416/73 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 497 a 322 c 418 g 328 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatca aactcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
121 agagcatctg ttggaaggat ggtgggagga atcggccggt tttatgttca aatgtgtact
181 gagctcaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagagagg
241 atggtgcttt cggcattcga cgaaagaaga aacaagtacc tggaggagca tcccagtgct
301 ggaaaagacc ccaagaaaac gggaggcccg atatacagaa ggagagatgg gaaatggatg
361 agagaactca tcctccatga taaagaagaa atcaggagga tctggcgtca ggccaacaat
421 ggtgaagatg ctactgctgg tctgactcac atgatgatct ggcactccaa tctcaatgac
481 accacctacc aaagaacaag ggctcttgtt cggactggga tggatcccag aatgtgctct
541 ctgatgcaag gatcaactct cccacggaga tctggagctg ccggtgctgc agtgaagggt
601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaac
661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaacatc
721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gagagagggc
781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcgcgatc agcactcatt
841 ctgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
901 gccagtgggt atgacttcga gaaagaggga tactctctgg ttggaattga tcctttcaaa
961 ctactccaga acagccaaat tttcagtctg atcagaccga aagaaaatcc agcacacaag
1021 agccagctgg tgtggatggc atgccattct gcagcatttg aggacctgag agtttcgaat
1081 ttcattagag gaaccaaagt aatcccaaga ggacagttgg caaccagagg agtgcaaatt
1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
1201 tgggcaataa ggaccaggag tggggggaac accagtcaac agagagcatc tgcaggacag
1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
1321 atggctgcat tcactggaaa cactgagggg aggacttccg acatgagaac ggaaatcata
1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPG 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/equine/Kentucky/2/86 (H3N8) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30751
KEYWORDS nucleoprotein.
SOURCE Influenza A/equine/Kentucky/2/86 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 502 a 318 c 411 g 334 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact
181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg
241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct
301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg
361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat
421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac
481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct
541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt
601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat
661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc
721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc
781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt
841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa
961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag
1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat
1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt
1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag
1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata
1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPH 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/gull/Massachussetts/26/80 (H13N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30752
KEYWORDS nucleoprotein.
SOURCE Influenza A/gull/Massachussetts/26/80 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 492 a 317 c 429 g 327 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
121 agggcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
181 gaactcaagc tcagtgacaa tgagggacga ttgatccaaa acagcatcac catagagaga
241 atggtcctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
361 agagaattag ttctgtatga caaagaagaa ctaagaagaa tctggcggca ggcaaacaat
421 ggggaagatg cgactgctgg tctcacccac ttgatgatct ggcattccaa tttgaatgac
481 gccacgtatc agagaactag agcacatgtg cgcacaggaa tggatcccag gatgtgctcc
541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtgaaggga
601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
661 ttctggagag gtgaaaacgg acgaagaaca agaattgcct atgaaagaat gtgcaacatc
721 ctcaaaggga aatttcaaac agcagcacag cgggctatga tggatcaggt gcgggaaagc
781 cggaatcctg gaaatgctga aatagaggac cttatatttc tggctcgatc tgcacttatc
841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggcggtg
901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
1021 agtcaattgg tgtggatggc atgccattct gctgcatttg aggatttgag agtgtcaagt
1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
1141 gcatccaatg agaacatgga aacaatgaat tccagcactc ttgaattgag aagcaaatac
1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPI 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/gull/Astrakhan/227/84 (H13N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30753
KEYWORDS nucleoprotein.
SOURCE Influenza A/gull/Astrakhan/227/84 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 489 a 323 c 426 g 327 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggtgaac gccagaatgc cactgagatt
121 cgggcatctg tcggaaggat ggttggagga atcggaagat tctacataca gatgtgcact
181 gaactcaaac taagtgacaa cgaggggagg ctgatccaga acagtatcac aatagaaaga
241 atggtccttt ctgcgtttga tgagagaagg aacaggtact tggaggagca ccccagcgct
301 gggagggacc ccaagaagac tggaggacca atttacagaa ggagagaggg aaaatgggtg
361 agagaattgg tcctgtatga caaggaagaa ataagaagaa tctggcgaca agcaaacaat
421 ggagaagatg caactgccgg ccttacccac ttgatgatct ggcattctaa cttgaatgac
481 gccacatatc agaggactag agcccttgtg cgcactggga tggaccccag gatgtgctcc
541 cttatgcaag gatcgacact tccaagaagg tctggagcgg ctggtgcagc tgtaaaggga
601 gttgggacaa tggtgatgga actcatcaga atgataaaaa gaggggtcaa tgaccgtaac
661 ttctggagag gtgaaaatgg acgaagaaca agaattgcct atgaaagaat gtgcaacatt
721 ctcaaaggaa aattccaaac agctgcacaa cgagctatga tggatcaggt cagagaaagc
781 cgtagtcctg gaaatgctga aatcgaggac ctcatctttc tggccaggtc tgcacttatt
841 ctgaggggag cagtggctca caaatcatgc ctgcctgcct gtgtgcatgg acttgccgta
901 gcaagtggat acgactttga gagggaggga tactcccttg ttgggataga ccctttccgt
961 ctgctccaga acagccaggt attcagtcta atccgaccca atgaaaatcc agcacataag
1021 agtcagttgg tgtggatggc atgccattcc gctgcatttg aggatttgag agtgtcaagc
1081 ttcatcaggg gagcaagagt gttaccaaga gggcaactat ccacaagagg tgttcaaatt
1141 gcatccaatg agaacatgga gactatgagt tccagcactc ttgaattgag aagcaaatac
1201 tgggcaataa ggactagaag cggaggaaac accaaccaac aaagagcatc tgcaggacaa
1261 atcagtgtgc aacccacttt ctctgtgcaa agaaacctcc cctttgagag agcgaccatc
1321 atggctgcat tcacaggaaa tgcagaaggc agaacatctg atatgagaac tgagatcata
1381 cggatgatgg aaaatgcaag accagaggat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taaagaggga
1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPK 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/gull/Maryland/1824/78 (H13N9) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30755
KEYWORDS nucleoprotein.
SOURCE Influenza A/gull/Maryland/1824/78 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 492 a 319 c 428 g 326 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat
421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc
541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaagt gcgggaaagc
781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtgaatgg actggctgtg
901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac
1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag
1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPL 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/gull/Maryland/1815/79 (H13N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30756
KEYWORDS nucleoprotein.
SOURCE Influenza A/gull/Maryland/1815/79 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 490 a 319 c 429 g 327 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat
421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc
541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaggt gcgggaaagc
781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg
901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac
1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPN 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/turkey/Minnesota/833/80 (H4N2) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30769
KEYWORDS nucleoprotein.
SOURCE Influenza A/turkey/Minnesota/833/80 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 494 a 311 c 429 g 331 t
ORIGIN
1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc
121 agggcatctg ttgggagaat ggttggtgga atcgggagat tctacataca gatgtgcaca
181 gaactcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga
241 atggttctct cggcatttga tgagaggaga aacaaatatc tggaagagca tcccagtgct
301 ggaaaagacc ctaagaagac tggaggtcca atatacagga ggagagatgg gaaatgggtg
361 agagaactga tcctgtatga caaagaggag atcagaagga tttggcgtca agcgaataat
421 ggagaagacg caaccgctgg tctcacccat ttgatgatct ggcattccaa tctgaatgat
481 gccacatatc agagaacaag ggcactggtg cgtactggga tggatcccag gatgtgctct
541 ctgatgcaag gatcaactct tccgaggagg tctggggctg ctggagcagc agtgaaagga
601 gttggaacaa tggtgatgga attgattcgg atgatcaagc gagggatcaa tgatcggaat
661 ttctggagag gtgaaaatgg acggagaaca agaattgcat acgaaagaat gtgcaacatt
721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gagggaaagc
781 cggaatccag ggaatgctga aattgaagat ctaatctttc ttgcacggtc tgctctcatt
841 ctgaggggat cagtggctca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg
901 gccagtggat acgactttga gagggaggga tactccctag tcggaattga tcctttccgt
961 ctgctccaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa
1021 agtcagctgg tatggatggc atgtcattct gcagcatttg aagatctgag agtgtcaagc
1081 ttcatcagag gaacaagagt ggtcccaaga ggacaactgt ccaccagagg agttcagatt
1141 gcttcaaatg agaacatgga gacaatggac tccagtactc ttgaattgag gagcagatac
1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcaggacaa
1261 atcagcgtac agcccacatt ctctgtgcag agaaacctcc cattcgagag agcaaccatt
1321 atggcggcat ttacgggaaa cactgaaggc agaacttcag acatgagaac tgagatcata
1381 aggatgatgg aaaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga
1501 tcttatttct tcggagacaa tgcagaggaa tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPO 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/gull/Minnesota/945/80 (H13N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30757
KEYWORDS nucleoprotein.
SOURCE Influenza A/gull/Minnesota/945/80 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
REFERENCE 2 (sites)
AUTHORS Gorman,O.T.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
The isolate mentioned in the published paper [1] is incorrect and
should be "A/Gull/Minnesota/945/80".
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 490 a 321 c 426 g 328 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
61 accaaacgat cttatgagca aatggaaact ggtggcgagc gccagaatgc caatgacatc
121 agagcatctg tcgggaggat ggttggggga attggaagat tctacataca gatgtgcact
181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
361 agagaattag ttctgtatga caaagaagag ctaaggagaa tctggcggca ggccaacaat
421 ggggaagatg cgactgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
481 gccacgtatc aaagaactag agcacatgtg cgcacaggga tggatcccag gatgtgctcc
541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
661 ttctggagag gtgaagacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
721 ctcaaaggga aatttcagac agcagcacaa cgggctatga tggatcaagt gcgggaaagc
781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg
901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattaag aagcaaatac
1201 tgggcaataa ggactagaag tggaggaaac accaaccagc aaagagcatc tgcgggacaa
1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgaaag ggcgaccatc
1321 atggctgcat tcacaggaaa tccggagggc agaacatctg atatgagaac ggaaatcata
1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag
1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPP 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/equine/Tennessee/5/86 (H3N8) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30758
KEYWORDS nucleoprotein.
SOURCE Influenza A/equine/Tennessee/5/86 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 502 a 318 c 411 g 334 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact
181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg
241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct
301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg
361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat
421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac
481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct
541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt
601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat
661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc
721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc
781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt
841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa
961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag
1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat
1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt
1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag
1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata
1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPR 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/duck/New Zealand/31/76 (H4N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30760
KEYWORDS nucleoprotein.
SOURCE Influenza A/duck/New Zealand/31/76 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 502 a 325 c 412 g 326 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgaaca gatggagact ggtggagaac gccagaatgc tactgagatc
121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact
181 gaactcaaac tcagtgacta tgaaggaagg ctaatccaaa acagcataac aatagagaga
241 atggttcttt ctgcatttga tgaaaggaga aacaagtact tggaggagca ccccagtgct
301 ggaaaggatc ctaagaaaac tggaggtcca atctatagaa gaagagatgg aaaatgggtg
361 agagaactga tcctgtacga caaagaggag atcaggagga tttggcgcca agcgaataat
421 ggagaggacg caactgctgg tcttacccat ttgatgatct ggcattccaa tcttaatgat
481 gcaacatacc agagaacaag agcccttgtg cgtactggga tggaccccag gatgtgctct
541 ctgatgcaag gctcaactct cccgaggagg tctggagctg caggtgctgc agtaaaagga
601 gttggaacaa tggtgatgga gctgattcga atgataaaac gaggaatcaa tgaccgtaac
661 ttctggagag gcgagaatgg acgaaggaca agaattgctt acgagaggat gtgcaacatt
721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgggagagc
781 cgaaatcctg gaaatgctga aattgaagac ctcatcttct tggcacgatc tgcactcatt
841 ctgaggggat cagtggctca caaatcctgc ctacctgctt gtgtatatgg attagctgta
901 gccagtggat atgactttga aagagaagga tactccctgg tcggaataga ccccttccgt
961 ctgctccaga acagccaggt attcagcctc atcagaccca atgaaaatcc agcacacaag
1021 agtcaactgg tgtggatggc atgccactct gcagcatttg aagacctgag agtttcaagc
1081 ttcatcaggg gaacaagagt gatcccgaga gggcaactgt ccactagagg ggtccaaatt
1141 gcttcaaatg aaaacatgga aacaatggac tctagcactc tcgaattgag aagcagatac
1201 tgggccataa ggaccagaag tggaggaaac accaaccaac agcgggcatc tgcgggacaa
1261 atcagcgtgc aacccacttt ctctgtgcaa agaaatcttc ctttcgaaag agcaaccatc
1321 atggcagcat ttacagggaa cactgaaggc agaacatctg acatgagaac tgagatcata
1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPS 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/grey teal/Australia/2/79 (H4N4) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30761
KEYWORDS nucleoprotein.
SOURCE Influenza A/grey teal/Australia/2/79 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 513 a 327 c 401 g 324 t
ORIGIN
1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgaaca aatggaaact ggtggagaac gccagaatgc tactgaaatc
121 agatcttctg ttggaagaat ggttggtgga attggaagat tctacataca gatgtgcact
181 gaattcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga
241 atggtccttt ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagtgct
301 ggaaaagatc ctaagaagac tggaggtcca atctacagaa ggagagatgg aaagtggatt
361 agagagctga tcctatgcga caaagaggag atcaggagaa tctggcgcca agcgaataat
421 ggggaggatg caactgctgg ccttacccat ttgatgatat ggcattccaa cctcaatgat
481 gccacttacc agagaacaag agcccttgtg cgtactggga tggatcctag aatgtgctct
541 ctgatgcaag gctcaactct cccgagaaga tctggagctg ctggcgctgc agtaaaagga
601 gttgggacaa tggtgatgga actgattcgg atgataaaac ggggaatcaa tgaccgaaat
661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcct atgagagaat gtgcaacatt
721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggaccaagt gcgggaaagc
781 cgaaatcctg ggaatgctga aattgaagac cttatatttc tggcacgatc tgcactcatc
841 ctaagaggat cagtagccca caaatcctgc ctacctgctt gtgtgtatgg attggccgta
901 gccagtggat atgactttga gaaagaagga tactccctag tcggaataga tcctttccgc
961 atgctccaga acagccaggt attcagtctc atcaggccca atgaaaattc agcacacaag
1021 agtcaattgg tatggatggc atgccactct gcagcatttg aagacctgag agtatcaagc
1081 ttcatcagag gaacaaaagt ggtcccaaga gggaaactct ccaccagagg agtccaaatc
1141 gcttcaaatg agaacatgga aacaatggac tctacaactc tcgaattgag aagcaggtac
1201 tgggctataa ggaccagaag cggaggaaac acaaatcagc agcgggcgtc agctggacaa
1261 atcagcgtac aacccacttt ctctgtgcag aggaatctcc cattcgagag agcaaccatt
1321 atggcagcat ttaccgggaa cactgaaggc agaacatccg acatgagaac tgagatcata
1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aagggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPT 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/duck/Czechoslovakia/56 (H4N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30762
KEYWORDS nucleoprotein.
SOURCE Influenza A/duck/Czechoslovakia/56 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 489 a 315 c 432 g 329 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgaaca gatggaaact ggtggagagc gccagaatgc cactgagatc
121 agagcatctg ttgggagaat ggtcggtgga attgggaggt tttacataca gatgtgtact
181 gaactcaagc tcagtgacta tgaagggaga ctgatccaga acagcataac aatagagaga
241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgct
301 gggaaggacc ctaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatgggtg
361 agagaattga tcctgtacga caaagaggag atcaggagga tttggcgcca agcaaacaat
421 ggagaagacg caactgctgg tctcactcac ttgatgatct ggcattccaa tctgaatgat
481 gccacatatc agagaacaag agctcttgta cgtactggga tggacccaag aatgtgctct
541 ctgatgcaag gatcaactct cccgaggagg tctggagctg ctggtgcagc agtgaaggga
601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagaggat gtgcaatatc
721 ctcaaaggga agttccaaac agcagcacaa agagcaatga tggaccaggt gcgggagagc
781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacggtc tgcactcatt
841 ctgagaggat cagtggctca taagtcctgt ctgcctgctt gtgtgtatgg acttgctgtg
901 gccagtggat acgactttga gagagagggg tactctctgg tcggaataga tcctttccgt
961 ctgctccaaa acagccaagt attcagcctc attagaccaa atgagaatcc agcacataag
1021 agccaactgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
1081 ttcatcagag ggacaagagt ggtcccaaga gggacaactg tcaccagagg atttcaaatt
1141 gcttcaaatg agaacatgga aacaatggac tccagcactc ttgaactgag gagcagatat
1201 tgggctataa ggaccaagag cggaggaaac accaaccaac agagagcatc tgcaggacaa
1261 atcagtgtac aacccacttt ctcggtacag agaaatctcc ctttcgagag agcgaccatc
1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata
1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPU 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/duck/Ukraine/2/60 (H11N8) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30763
KEYWORDS nucleoprotein.
SOURCE Influenza A/duck/Ukraine/2/60 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 496 a 325 c 423 g 321 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggagatc gccagaatgc cactgagatc
121 agagcatctg ttgggaggat ggtcagcgga attggaagat tctacataca gatgtgcact
181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagaaaga
241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgca
301 gggaaggatc ctaagaaaac cggaggtcca atctatagaa ggagagacgg aaaatggatg
361 agagagctga tcctgtatga caaagaggag atcaggagaa tttggcgcca agcgaacaat
421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat
481 gccacatatc aaagaacaag agctctcgtg cgcactggaa tggaccccag aatgtgctcc
541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggcgcagc agtaaaggga
601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
721 cttaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gcgggaaagc
781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacgatc tgcactcatt
841 ctgagaggat cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
901 gccagtggat acgacttcga gagagaggga tactctctgg tcggaataga tcctttccgt
961 ctgctccaaa acagccaggt gttcagtctc attagaccaa atgagaatcc tgcacataag
1021 agccaactgg tgtggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
1081 ttcatcagag ggacaagagt ggtcccaaga gggcaactgt ccaccagagg agttcaaatt
1141 gcttcaaatg aaaacatgga aacaatggac tccagcactc ttgaactgag gagcagatac
1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc cgcaggacaa
1261 ataagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgagag agcgaccatc
1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata
1381 agaatgatgg agagtgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPV 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/mallard/Astrakhan/244/82 (H?N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30764
KEYWORDS nucleoprotein.
SOURCE Influenza A/mallard/Astrakhan/244/82 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 500 a 317 c 421 g 327 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatca acatcatggc gtctcaaggc
61 accaaacgat cttatgaaca gatggaaact ggtggagaac gccagaatgc tactgagatc
121 agagcatctg ttggaagaat ggttggtgga attgggaggt tttacataca gatgtgcact
181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagagaga
241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagaaca tcccagtgcg
301 gggaaggacc cgaagaaaac tggaggtcca atctaccgaa ggagagacgg gaaatggatg
361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcgaacaat
421 ggagaagacg caactgctgg tctcactcac ctgatgatct ggcattccaa cctaaatgat
481 gccacatacc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct
541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggtgcagc agtaaaggga
601 gtcgggacga tggtgatgga actaattcgg atgataaagc gagggattaa cgatcggaat
661 ttctggagag gcgaaaacgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgagaaagc
781 agaaatcctg ggaatgctga aattgaagat ctcatctttc tggcacggtc tgcactcatc
841 ctgagaggat cagtggccca taagtcctgc ttgcctgctt gtgtgtacgg acttgctgtg
901 gccagtggat atgactttga gagagaaggg tactctctgg tcgggataga tcctttccgt
961 ctgcttcaaa acagccaggt cttcagtctc atcagaccaa atgagaatcc agcacataag
1021 agtcaactgg tatggatggc atgccattct gcagcatttg aggacctgag agtctcaagt
1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactat ccaccagagg agttcaaatt
1141 gcttcaaatg agaacatgga aacaatggac tccagtactc ttgaactgag aagcagatat
1201 tgggctataa gaaccaggag tggaggaaac accaaccaac agagagcatc tgcaggacaa
1261 atcagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgaaag agcgaccatc
1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaatcata
1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPW 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/budgerigar/Hokkaido/1/77 (H4N6) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30765
KEYWORDS nucleoprotein.
SOURCE Influenza A/budgerigar/Hokkaido/1/77 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 500 a 322 c 428 g 315 t
ORIGIN
1 agcaaaagca gggtagataa tcactcacag agtgacatcc acgacatggc gtctcaaggc
61 accaaacgat cctatgagca gatggaaact ggtggagaac gccagaatgc tactgagatc
121 agagcatctg ttgggagaat ggtcggtgga attggaagat tctacataca gatgtgcact
181 gaaatcaaac tcagtgacta tgaagggagg ctgattcaga acagcataac aatagagaga
241 atggtcctct ctgcatttga ggaaaggagg aacaaatacc tggaagagca ccccagtgcg
301 ggaaaggacc cgaagaaaac tggaggtcca atctacagaa ggagagatgg aaaatgggtg
361 agagaactga tcctgtatga caaagaggag ataaggagga tctggcgcca agcgaacaat
421 ggagaagacg caactgcagg tctcactcac ctgatgatct ggcattccaa cctgaatgat
481 gccacatatc agagaacaag agccctcgtg cgtactggga tggaccccag aatgtgctct
541 ctgatgcaag gatcaactct cccgagaaga tctggggcag ctggtgcagc aataaaggga
601 gttgggacga tggtgatgga gctgattcgg atgataaaac ggggaatcaa tgatcggaat
661 ttctggagag gcgaaaatgg gcgaagaaca aggactgcat acgagagaat gtgcaacatc
721 ctaaaaggaa aattccaaac agcagcacaa cgagcaatga tggatcaggt gcgagaaagc
781 aggaatcctg gaaatgctga gattgaagac cttatatttc tggcacggtc tgcactcatt
841 ctgagaggat cagtagctca taagtcctgc ctgcctgctt gtgtatatgg gcttgcagtg
901 gccagtgggt acgactttga gagagaggga tactctctgg tcggaataga tcctttccgc
961 ctgcttcaaa acagtcaggt gttcagcctc attagaccaa atgagaatcc agcacataaa
1021 agccaactag tctggatggc ttgccattct gcagcatttg aagacctgag agtctcaagc
1081 ttcatcagag ggacaagact agtcccaaga ggacaactgt ccactagagg agttcaaatt
1141 gcttcaaatg agaacatgga agcaatggac tccaatactc ttgagctgag gagcagatac
1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa
1261 atcagcgtgc aacctacttt ctcggtacag aggaatctcc ctttcgagag agctaccatc
1321 atggcggcat ttacagggaa caatgaagga aggacttctg acatgaggac tgagatcata
1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaggga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPX 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/ruddy turnstone/New Jersey/47/85 (H4N6) nucleoprotein
(seg 5) mRNA, complete cds.
ACCESSION M30766
KEYWORDS nucleoprotein.
SOURCE Influenza A/ruddy turnstone/New Jersey/47/85 RNA, and cDNA to viral
RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 491 a 317 c 423 g 334 t
ORIGIN
1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc
121 agagcatctg ttgggagaat ggttggtgga atcggaagat tctacataca gatgtgcact
181 gaactcaaac tcagtgacta tgaagggaga ctgatccaaa acagcataac aatagagaga
241 atggttctct cggcatttga cgagaggaga aacaaatatc tggaagagca ccccagtgct
301 gggaaagacc ctaaaaagac tggaggtcca atctacagga ggagagatgg gaaatggatg
361 agagaactca tcctatatga caaagaggag atcagaagga tttggcgtca agcgaataat
421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat
481 gccacatacc agagaacaag ggcacttgtg cgtactggga tggatcccag gatgtgctct
541 ctgatgcaag gctcaactct tccgaggagg tctggggctg ctggagcagc agtgaaaggg
601 gttggaacaa tggtgttgga attgatccgg atgatcaagc gagggatcaa tgatcggaat
661 ttctggagag gcgaaaatgg aaggaggaca aggattgctt atgaaagaat gtgcaacatt
721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggatcaggt gagggaaagc
781 cggaatcccg ggaatgctga aattgaagat ctcatctttc ttgcacggtc tgctctcatt
841 ctgagaggat cagtggccca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
901 gccagtggat acgactttga aagggaggga tactccctag tcggaattga tcctttccgt
961 ttgcttcaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa
1021 agtcagctgg tatggatggc atgccactct gcagcatttg aagatctgag agtgtcaagc
1081 ttcattagag gaacaagagt agtcccaaga ggacaactgt ccaccagagg agttcagatt
1141 gcttcaaatg agaacatgga gacaatggac tccagcactc ttgaattgag gagcagatac
1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcagggcaa
1261 atcagcgtac agcccacatt ctctgtgcag aggaacctcc cattcgagag agcaaccatt
1321 atggcggcat ttacaggaaa cactgaaggc agaacttcag acatgagaac tgagatcata
1381 aggatgatgg aaaatgccag tcctgaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
1501 tcttatttct tcggagacag tgcagaggaa tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPY 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/tern/South Africa/61 (H5N3) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30767
KEYWORDS nucleoprotein.
SOURCE Influenza A/tern/South Africa/61 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 493 a 322 c 428 g 322 t
ORIGIN
1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggagagc gccagaatgc cactgagatc
121 agagcatctg ttgggagaat ggtcggtgga atcgggagat tctacataca gatgtgcact
181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga acagcataac aatagagaga
241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgcg
301 gggaaggacc cgaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatggatg
361 agagaactga tcctgtatga caaagaggag atcaggagga tttggcgcca agcaaacaat
421 ggagaagacg caactgctgg tctcactcat ctgatgatct ggcattccaa cttgaatgat
481 gccacatatc agagaacaag agctcttgtg cgtactggga tggaccccag aatgtgctct
541 ctgatgcaag gatcaactct cccgaggaga tctggagccg ctggtgcagc agtgaaggga
601 gttggaacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
661 ttttggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gcgagaaagc
781 cggaatcctg ggaatgctga aattgaagac cttatctttc tggcacggtc tgcactcatt
841 ctgagagggt cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
901 gccagtggat acgactttga gagagaggga tactctctgg tcggaataga ccctttccgt
961 ctgctccaaa acagccaggt attcagcctc attagaccaa aagagaatcc agcacataag
1021 agccaattgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactgt ccaccagagg atttcaaatt
1141 gcttcgaatg agaacatgga ggcaatggac tccagtactc ttgaactgag gagcagatac
1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa
1261 atcagcgtac aacccacttt ctcagtacag agaaatcttc ctttcgagag agcgaccatc
1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgagatcata
1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS FLANPZ 1565 bp ss-RNA VRL 04-AUG-1990
DEFINITION Influenza A/chicken/Pennsylvania/1/83 (H5N2) nucleoprotein (seg 5)
mRNA, complete cds.
ACCESSION M30768
KEYWORDS nucleoprotein.
SOURCE Influenza A/chicken/Pennsylvania/1/83 RNA, and cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1565)
AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
TITLE Evolution of the nucleoprotein gene of influenza A virus
JOURNAL J. Virol. 64, 1487-1497 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by O.T.Gorman, 14-DEC-1989.
FEATURES from to/span description
pept 46 1542 nucleoprotein
BASE COUNT 492 a 313 c 427 g 333 t
ORIGIN
1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgagatc
121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact
181 gaactcaaac tcagcgacca tgaagggaga ctgatccaga acagcataac aatagagaga
241 atggttctct cggcatttga tgagagaaga aacagatatc tggaagagca tcccagtgct
301 gggaaagatc ctaagaagac tgggggtcca atctacagaa ggagagatgg gaaatgggtg
361 agagaattga tcctctatga caaagaggag atcagaagaa tttggcgtca agcaaataat
421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcactccaa cctgaatgat
481 gccacatatc agagaacaag ggcacttgtg cgtaccggga tggaccccag gatgtgctct
541 ctgatgcaag gctcaacact tccaaggagg tctggtgctg ctggagcagc cgtgaaaggg
601 attggaacaa tggtaatgga attgatacgg atgatcaaac gagggatcaa tgataggaat
661 ttctggagag gcgaaaatgg gcggagaaca agaattgcat acgagagaat gtgtaacatt
721 ctcaagggga aatttcaaac agctatacaa cgggcaatga tggaccaagt gagggaaagc
781 cggaatcctg ggaatgctga aattgaagat ctcatctttc ttgcgcggtc tgctctcatt
841 ctgaggggat cagtggcaca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg
901 gccagtggat acgattttga gagggaggga tactccctag tcgggattga tcctttccgt
961 ctgctccaaa acagtcaagt tttcagtctt ataagaccaa acgaaaatcc agcacataaa
1021 agtcagctgg tatggatggc atgtcactct gcagcatttg aggatctgag agtatcaagc
1081 ttcattagag gaacaagagt ggtcccaaga gggcaacttt ccaccagagg agttcagatt
1141 gcttcaaatg agaacatgga tgcaatagac tccagtactc ttgaattgag gagcagatac
1201 tgggccataa gaaccagaag cggaggaaac acaaatcagc agagagcatc tgcaggacaa
1261 atcagcgtac agcccacatt ctcagtgcag agaaacctcc cattcgagag agcaaccatc
1321 atggcggcat tcacaggaaa cacagaaggc agaacttcag acatgagaac tgagattata
1381 aggatgatgg agaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag
1441 ctctcggacg aaaaggcgac gaacccgatc gtgccttcct ttgacatgag taacgagggg
1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
1561 ctact
//
LOCUS HUMCEAB 2690 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human carcinoembryonic antigen (CEA) hsCGM1 gene, exons 1 and 2.
ACCESSION M22433 D51537
KEYWORDS carcinoembryonic antigen.
SOURCE Human fetal liver (lambda-hsCHM1-1 library) DNA, clone hsCHM1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2690)
AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
Zimmermann,W.
TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene
family: Isolation and sequencing of N-terminal domain exons
JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.A. Thompson 07-FEB-1989.
FEATURES from to/span description
pept 1250 1313 carcinoembryonic antigen hsCHM1, exon 1
/nomgen="CEA" /map="19q13.1-q13.3"
/hgml_locus_uid="LN0164S"
2158 / 2517 carcinoembryonic antigen hsCHM1, exon 2
IVS 1314 2157 carcinoembryonic antigen hsCHM1, intron A
IVS 2518 > 2690 carcinoembryonic antigen hsCHM1, intron B
BASE COUNT 823 a 637 c 653 g 577 t
ORIGIN Chromosome 19q31.1-q31.3.
1 gaattccaca gcaataacca cgatgacaac caccatgtac tcaacacccg cctgggcacg
61 gggctcccac agcagctcac ttattcccaa caactctgca aggaggattt taccatcctc
121 cttttacaaa tcagggaatc aaggatcata gaagccacgt gcacttgtcc aagtcaacat
181 agttaagtga cagaaccatt agctgtcccc aggtacatct ggacataaag ttcatgttta
241 tgccactgtg tcagcatttc caaaaactga ttttaggcga aacgtaagta agctttttaa
301 aaactttaat acttatgcgt ttattttaat atacattgag aaaacattta agcacacatc
361 aaatctgtaa tttcatggac aatattgcat aagacaagga tgttttgtct ccaactcctg
421 gcctcaagcc atcctcccac catagctctc gagtagctgg gatcacagac tgagccaccg
481 atccctgcta ggacaggatg ttttgtaaac taaatttatt tagaaaaaag gatgaaatat
541 ataataataa aggtggtaca agctagagag aaaatcataa agtcagccta gaaatgtctg
601 gtgtctggat gacataaagc tacagcactg tgaagcctca ttctcagtta ctcccaggaa
661 attagagtca cataatgctg cagaaagaac agctcagaat cttagatccg gctttagccc
721 tagatatatc catttgtagg accccagaca tctctgtgac ctccttgctg ggagtaaatc
781 caaccttccc agacatgtga gaacagtaag aagaccctgc acacacaaag gagtttctcc
841 gtcacagaga aaataacacc aggttcaggg accccaggga ctctgcatgg tgctgacaga
901 cccaaggcca aggcagagca gaggtccacg ctggggaggg agggtcatcc tgttatgaaa
961 cagggatcca agtaagcctt gcttctcaga gcctggtctg ggcaactcaa atgtagacag
1021 aaggccccaa ggaagaagag aaaatgaggc aaaactgaga ggggagggga cagagaggtg
1081 acctgggcag agcttcaccc atgaccctgg aaagtgctcc tgccctggga ggaggctcag
1141 catggaaaga ggaaggacag cagagcctaa gtcacagtag ccctgactac agcattcctg
1201 gagcccaggc tcttttccac agaggaggaa agagcaggca gcagagacca tggggccccc
1261 ctcagcctct ccccacagag aatgcatccc ctggcagggg cttctgctca caggtgagtg
1321 gaggattcct gggagtgggc aagaggaggg atcacagaga atggctgggg tctcctgggg
1381 aggatggggc tctgataggg gacagaaggc ttctgctgaa gcctcagggg agagaacatc
1441 agagagggac acgggtcaca acaagacaat cacattgaac tgggattgat aagagggagg
1501 aaaatccatt gatcatgttt tccaagttaa tcattactgg ccactacaat tagaaaatga
1561 taagaataag aattacatca gggtgatact ttaaataaaa atataaccag ggcactaaaa
1621 cctgtctttg ccccaaccac aagttgcaaa ataaccacca ctccttaact catccaccag
1681 tatttgcaat caaattttag gcactggcgt acaacaaata tcagacaagt ctctgtgttc
1741 aaagagctta cactcttgca gagatgaaga tagacaccca aagagatcta gaatgtgagt
1801 tcaggtgttg acaagagccc tggagggaac agagcagaaa aaggtcagaa agggacgccc
1861 cagggtctct agaggaggtg tcaggggagg gatctcccaa ggatgccctg atgtgagcag
1921 gatctgaggg cagtggggag ggagccatgc agacccctgg ggaaggggat tccacacagg
1981 aaaatgccaa ggttagaggt gctgaagaaa gaaaggtcac gttactgacc ttaaccaagt
2041 gggacacacc tacactctca aggctgaagg gagaagagac tctctcagga cccagggccc
2101 catctttcca tcccaataca tgggtaccaa tattgactga tgctttctcc ctcctagcct
2161 cacttctaaa cttctggaac ccgcccacca ctgccaagct cactattgaa tccacgccgt
2221 tcaatgtcgc agaggggaag gaggtgcttc tacttgtcca caatctgccc caacatcttt
2281 ttggctacag ctggtacaaa ggggaaagag tggatggcaa cagtctaatt gtaggatatg
2341 taataggaac tcaacaagct accccagggg ccgcatacag cggtcgagag acaatataca
2401 ccaatgcatc cctgctgatc cacaatgtca cccagaatga cataggattc tacaccctac
2461 aagtcataaa gtcagatctt gtgaatgaag aagcaactgg acagttccat gtataccgtg
2521 agtatttcca catgacctct ggaggttggg ggtcagttct acttcccaca tatgggattg
2581 tacggcctgg gctgtgcctc tggccctctc tgcattacat tctgtatcag ggtttggaca
2641 tttagtgcag gacacacacg ggggagacaa acttccacag actagaattc
//
LOCUS HUMCEAC 781 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human carcinoembryonic antigen (CEA) hsCGM2 gene, exon 2.
ACCESSION M22434 D51537
KEYWORDS carcinoembryonic antigen.
SOURCE Human fetal liver (lambda-hsCGM2-1 library) DNA, clone hsCGM2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 781)
AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
Zimmermann,W.
TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene
family: Isolation and sequencing of N-terminal domain exons
JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.A. Thompson 07-FEB-1989.
FEATURES from to/span description
pept / 221 / 583 carcinoembryonic antigen hsCGM2, exon 2 (AA at
223) /nomgen="CEA" /map="19q13.1-q13.3"
/hgml_locus_uid="LN0164S"
IVS < 1 220 carcinoembryonic antigen, intron A
IVS 584 > 781 carcinoembryonic antigen, intron B
BASE COUNT 220 a 195 c 184 g 182 t
ORIGIN Chromosome 19q31.1-q31.3.
1 ctgcagatcc ctggggaaga ggattccgaa cagggaaatg taaggtcaga ggtgctgata
61 ggggacatgc tgctgtcatt gatccagtag gacacacaca cacacacaca cttacttcaa
121 gatgggggtg ggtgaagaga cctgctcagg atccagggcc ccatctttcc accccaatac
181 ataggtccca atattgactg atgttctctc cccctcctag cctcgctttt aaccttctgg
241 aacctgccaa acagtgccca gaccaatatt gatgtcgtgc cgttcaatgt cgcagaaggg
301 aaggaggtcc ttctagtagt ccataatgag tcccagaatc tttatggcta caactggtac
361 aaaggggaaa gggtgcatgc caactatcga attataggat atgtaaaaaa tataagtcaa
421 gaaaatgccc cagggcccgc acacaacggt cgagagacaa tataccccaa tggaaccctg
481 ctgatccaga acgtcaccca caatgacgca ggattctata ccctacacgt tataaaagaa
541 aatcttgtga atgaagaagt aaccagacaa ttctacgtat tctgtgagtg atacctccat
601 gacttctggg tgctgggggc cagttctact tcatacacac ggggttgtca ggcctgggtt
661 gtgcctgtgt ccccatctac attttatcca gtgttggagt ttgggcattt agtgaaggac
721 acacatgggg gagacaaact tctacagacc agaatccctt tcctgcatcc agaccctgca
781 g
//
LOCUS HUMPSBGA 1042 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human pregnancy-specific B-1-glycoprotein gene, exon 2.
ACCESSION M22437 D51537
KEYWORDS glycoprotein; pregnancy-specific B-1 glycoprotein.
SOURCE Human fetal liver (lambda-PSBG-1 library) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1042)
AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
Zimmermann,W.
TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene
family: Isolation and sequencing of N-terminal domain exons
JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.A. Thompson 07-FEB-1989.
FEATURES from to/span description
pept / 497 / 862 pregnancy-specific B-1 glycoprotein, exon 2
(AA at 499) /nomgen="B1G1" /map="6"
/hgml_locus_uid="LG0073P"
IVS < 1 496 pregnancy-specific B-1 glycoprotein, intron A
IVS 863 > 1042 pregnancy-specific B-1 glycoprotein, intron B
BASE COUNT 289 a 241 c 274 g 238 t
ORIGIN Chromosome 6.
1 gagctcacgc tgtcatgggg gggaagacag acatgcaaag agatctagaa tgtgaggtca
61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag
121 ggtctctgaa gcaggcatca ggaaagaagt ctaacgatgc cctgatgtaa gcaggacctg
181 agggcattgt ggagggggcc atgcggaccc ctggggaaga ggattccaaa cagaaaaatg
241 ccaacgtcag aagtgttgaa ggaatgggcg tcatgctgct gaccttgacc tagtaggaca
301 gtaggacaca cacacataca cacacacaat cacacatgcc gcttttgtgt gtgtgtgtgt
361 gtgtgtgttt gtatgtgtgt gtgtgtctgt gttgtcaagg ctgaggactg aagagacctt
421 ctcaggaccc agggccccat gttttcacac caatacatag gtctcaatat tgactgatgt
481 tctctccacc tcctagcatc acttttaaac ttctggaacc tgcccaccac tgcccaagtc
541 acgattgaag ccgagccaac caaagtttcc gaggggaagg atgttcttct acttgtccac
601 aatttgcccc agaatcttac cggctacatc tggtacaaag ggcaaatgag ggacctctac
661 cattacatta catcatatgt agtagacggt gaaataatta tatatgggcc tgcatatagt
721 ggacgagaaa cagcatattc caatgcatcc ctgctgatcc agaatgtcac ccgggaggac
781 gcaggatcct acaccttaca catcataaag ggagatgatg ggactagagg agtaactgga
841 cgtttcacct tcaccttaca ccgtaagtga ttccacatga tccctgggtg ttgggggaca
901 ggggtcactt ctacttcaca cacacaggat tgtcaggcct ggactctgcc tgtgtcactc
961 actgcgttat gtcccatgct gggatttggg catttagtgc aggacacaca cagaggagac
1021 aaatttcaac agatcagaat tc
//
LOCUS ASNMTATP9 225 bp ds-DNA ORG 04-AUG-1990
DEFINITION A.nidulans mitochondrion ATPase subunit 9 gene, complete cds.
ACCESSION M35967
KEYWORDS ATPase.
SOURCE A.nidulans mitochondrion DNA.
ORGANISM Mitochondrion Aspergillus nidulans
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Plectomycetes; Eurotiales; Trichocomaceae; Aspergillus nidulans.
REFERENCE 1 (bases 1 to 225)
AUTHORS Brown,T.A., Ray,J.A., Waring,R.B., Scazzocchio,C. and Davies,R.W.
TITLE A mitochondrial reading frame which may code for a second form of
ATPase subunit 9 in Aspergillus nidulans
JOURNAL Curr. Genet. 8, 489-492 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 225 ATPase subunit 9
BASE COUNT 66 a 29 c 51 g 79 t
ORIGIN
1 atgttacaat cagctagaat aataggaaca ggattagcta caacaggttt aatcggagca
61 ggtgtaggta tcggtgtagt atttggtgcc ttaatcttag gagttgcaag aaaccctgca
121 ttaagaggtc aactgttttc atatgcaatt ttaggatttg catttgcgga agcaactgga
181 ttatttgctt taatgatggc ttttttatta ttatatgtag cttaa
//
LOCUS DOGPHA2 562 bp ss-mRNA MAM 04-AUG-1990
DEFINITION Canine phospholipase A-2 mRNA, complete cds.
ACCESSION M35301
KEYWORDS phospholipase A-2.
SOURCE Canine pancreas, cDNA to mRNA.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 562)
AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G.
TITLE Primary structures of canine pancreatic lipase and phospholipase
A-2 messenger RNAs
JOURNAL Pancreas 1, 430-437 (1986)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.Scheele, 06-MAR-1990.
FEATURES from to/span description
pept 29 469 phospholipase A-2 precursor
sigp 29 73 phospholipase A-2 signal peptide
matp 95 466 phospholipase A-2
mRNA < 1 562 phospholipase A-2 mRNA
BASE COUNT 143 a 167 c 128 g 124 t
ORIGIN
1 ttctcttgct cagcaacttg accacaaaat gaaattcctc gtactggctg ctctgctcac
61 agtggccgct gccgaggggg gcatcagccc gcgggcagtt tggcagttcc gcaacatgat
121 caagtgcacg atccccgaga gtgacccctt gaaggactac aacgactacg gctgctactg
181 tggcctgggt ggatccggca cccctgtgga tgagttggac aagtgttgcc agacgcatga
241 ccactgctac tcagaagcca agaaactgga cagctgtaaa ttcctcctgg acaaccccta
301 caccaaaatc tactcatact catgctccgg ctctgagatc acctgcagca gcaaaaacaa
361 agactgccag gcctttatct gcaactgtga ccgcagtgct gccatctgct tttcaaaggc
421 tccgtataac aaggagcaca agaacctgga cactaagaag tactgctaga attgagtctc
481 ctctctggaa agcatcattc ctacatgcct cgtggccttc accctaccct gtgtcctcca
541 ataaagcact ttgttgaaag gc
//
LOCUS DOGPLIP 1493 bp ss-mRNA MAM 04-AUG-1990
DEFINITION Canine lipase mRNA, complete cds.
ACCESSION M35302
KEYWORDS lipase.
SOURCE Canine pancreas, cDNA to mRNA.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 1493)
AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G.
TITLE Primary structures of canine pancreatic lipase and phospholipase
A-2 messenger RNAs
JOURNAL Pancreas 1, 430-437 (1986)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by G.Scheele, 06-MAR-1990.
FEATURES from to/span description
pept 35 1438 lipase precursor
sigp 35 85 lipase signal peptide
matp 86 1435 lipase
mRNA < 1 1493 lipase mRNA
BASE COUNT 410 a 377 c 378 g 328 t
ORIGIN
1 cgaaagggag ggaaggccca gaacatccca caggatggtg agcatctgga caatcgcgct
61 gtttctgctg ggagcagcca aagcaaagga agtttgctat gaacaaatcg gatgcttttc
121 tgacgcggag ccctgggccg ggacagcaat caggcctctg aaagttctcc cctggagccc
181 cgagagaatc ggcacccgct tcctgctcta caccaacaag aacccaaaca actttcaaac
241 tctccttccc tctgatccat caacgattga ggcatcaaat tttcaaacag acaagaagac
301 ccggttcacc atccatggct tcataaacaa gggagaagag aactggctgc tggatatgtg
361 caagaacatg ttcaaggttg aggaggtgaa ctgcatctgc gtggactgga agaaaggttc
421 ccaaacctca tatacacagg ctgccaacaa cgtgcgggtg gtgggcgccc aggtggccca
481 gatgctcagc atgctctcgg caaactacag ctactcacct tcccaagtcc agctcatcgg
541 ccacagccta ggagcccacg tggctgggga ggcagggagc aggactccag gtctgggcag
601 gattacaggg ttggatcctg tagaagcaag tttccagggc actcctgaag aggttcgact
661 tgatcccact gatgctgact ttgttgatgt gattcacaca gatgcagctc ccctgatccc
721 attcctgggt tttggaacaa gccaacagat gggtcacctt gacttcttcc ccaatggtgg
781 agaggaaatg ccaggatgca agaagaatgc cctgtcacag atcgtgaacc tagatggcat
841 ttgggaagga actcgggact tcgtggcttg caatcacctg agaagttaca agtattactc
901 agagagcatc ctcaaccctg atggatttgc ttcctacccc tgtgcttcct acagggcctt
961 tgaatctaac aagtgcttcc cctgcccaga tcaagggtgc ccacagatgg gtcactatgc
1021 tgataaattt gctgtcaaga caagtgatga gacacagaaa tacttcctga acaccggaga
1081 ttccagcaat tttgctcgct ggagatacgg ggtttctata acattgtctg ggaaaagagc
1141 cactggtcag gctaaagttg ctttgtttgg aagtaaggga aatactcatc aattcaatat
1201 cttcaagggg attctcaaac caggctctac tcattccaat gagtttgatg caaagcttga
1261 tgttggaaca attgagaaag tcaagtttct ttggaataac aacgtggtaa acccaacctt
1321 tcccaaagtg ggtgcagcca agatcaccgt gcaaaaggga gaggagaaaa cagtgcacag
1381 cttctgcagc gaaagcaccg tgagggagga tgttctgctc accctcacgc cctgttagcg
1441 tccagcgcaa cccgggcacc gtgttaacag caataaaaac cactgatgca ttt
//
LOCUS HUMPGDZ1 1106 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human MHC class II lymphocyte antigen (HLA-DNA) gene, complete cds.
ACCESSION M31525 M35125
KEYWORDS cell surface glycoprotein; class II gene; lymphocyte antigen;
major histocompatibility complex.
SOURCE Human B-cell cell line PGF, cDNA to mRNA, clone PGDZ1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1106)
AUTHORS Young,J.A.T. and Trowsdale,J.
TITLE The HLA-DNA (DZA) gene is correctly expressed as a 1.1kb mature
mRNA transcript
JOURNAL Immunogenetics 31, 386-388 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.T.Young, 22-JAN-1990.
FEATURES from to/span description
pept 77 829 MHC HLA-DNA precursor
/hgml_locus_uid="LV0117X" /nomgen="HLA-DNA"
/map="6p21.3"
sigp 77 151 MHC HLA-DNA signal peptide
matp 152 826 MHC HLA-DNA
mRNA 1 1106 MHC HLA-DNA mRNA
BASE COUNT 204 a 354 c 289 g 259 t
ORIGIN
1 cttcttcttt acctccgcct tgttcctgtc ctcaccacac ggactgagac tgatttgatt
61 aaagcaccag agtgtaatgg ccctcagagc agggctggtc ctggggttcc acaccctgat
121 gaccctcctg agcccgcagg aggcaggggc caccaaggct gaccacatgg gctcctacgg
181 acccgccttc taccagtctt acggcgcctc gggccagttc acccatgaat ttgatgagga
241 acagctgttc tctgtggacc tgaagaaaag cgaggccgtg tggcgtctgc ctgagtttgg
301 tgactttgcc cgctttgacc cgcagggcgg gctggccggc atcgccgcaa tcaaagccca
361 tctggacatc ctggtggagc gctccaaccg cagcagagcc atcaacgtgc ctccacgggt
421 gaccgtgctc cccaagtctc gggtggagct gggccagccc aacatcctca tctgcatcgt
481 ggacaacatc ttcccccctg tgatcaatat cacctggctg cgcaacggcc aaactgtcac
541 tgagggagtg gcccagacca gcttctattc ccagcctgac catttgttcc gcaagttcca
601 ctacctgccc ttcgtgccct cagccgagga cgtctatgac tgccaggtgg agcactgggg
661 cctggatgcg ccactcctca ggcattggga gctccaggtg cctattccac caccagatgc
721 catggagacc ctggtctgtg ccctgggcct ggccatcggc ctggtgggct tcctcgtggg
781 caccgtcctc atcatcatgg gcacatatgt gtccagtgtc cccaggtaat gatccttctg
841 agagaaatga cttgtgggag acaccctgca gatcctcatg ggtttgtgac agcccctgcg
901 tgctcagtgc cctttaagtg catcccgctg tgctgacttt gagtgggatc aacatctgtc
961 ctacgggtcc cctctttttt ggccccagta ttcatggcag ggtttgttgg acacctacta
1021 gcttcccttc ccattcaaca cacacacaca ttcttgctct acccaaagct ctggctggca
1081 gcactaaatg ctttggtggt gtttgc
//
LOCUS LACPEPXP 3963 bp ds-DNA BCT 04-AUG-1990
DEFINITION L.lactis X-prolyl dipeptidyl aminopeptidase (pepXP) gene,
complete cds.
ACCESSION M35865
KEYWORDS X-prolyl dipeptidyl aminopeptidase.
SOURCE L.lactis lactis (strain NCDO 763, ML3) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 3963)
AUTHORS Nardi,M., Chopin,M.-C., Chopin,A., Cals,M.-M. and Gripon,J.-C.
TITLE Cloning and DNA sequence analysis of an X-prolyl dipeptidyl
aminopeptidase determinant from Lactococcus lactis subsp. lactis
NCDO 763
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Chopin, 02-JUL-1990.
Author address: A.Chopin
Institut National de la Recherche Agronomique
Lab de G/032n/032tique Microbplenne
Institut de Biotechnologie CRJ
78352 Jouy-en-Josas Cedex
FRANCE
FEATURES from to/span description
pept 1105 3396 X-prolyl dipeptidyl aminopeptidase (pepXP) (EC
3.4.14.5)
pept 933 64 (c) ORF1 (put.)
pept 3398 3883 ORF3 (put.)
BASE COUNT 1261 a 748 c 733 g 1221 t
ORIGIN
1 actagtcatt gcacttatgg tcaaataaaa aaattactga caactctgtc agtaattttt
61 ttattaaaga taaatcattt taaataatgc aacggctgca agtgatgcaa gaattggtgc
121 caaaactgga acccatgcat accaccattt agatgaacct ttagcttcac ccaaaactga
181 ttttggtaag agactgtgaa ccaaacgtgg accaaagtca cgcgccggat tcaaaccagg
241 acctgtaggt ccaccaagtg caactacgag tcccataaca aggaatccta ggaacaagtg
301 agcaatcatt ttagaagctg aagctcctga tgcttgaacc caaatttgat tcataacgtc
361 tgatgaagaa acatctgctc cttgaccttt caagtagttt gtcatccaag taattgattg
421 actaccaaag aagatatttg tcgcagcaac cgcaccgaag aataatacaa atgaacctaa
481 gaactcattc aagaaaccat taatggttgc ccccaaacga gttttttcac tgttatcatc
541 aacgttgtcg atagttgaga aagttccaag aatagcattt ggattttgtg ttttaagata
601 gtatggacga taaaccataa cgattaataa ttgtccaaac atcgcaccta atacttgcgc
661 aataatgtat tgagcaacgt gtgcccaagg aaaaagtcct gaggctgcaa gtccaagtgt
721 gaaagctggg ttgatttgtg aagtaatatt accaaaagca actgctggca acattacgcc
781 aagtccgtaa ccccaaccaa taatcatcca agattgtgcg tgagcttttg tacctttaag
841 ttcaacgtta gcaaccgcac cattacccat aataatgagt agggcagttc caacaaattc
901 agtgatatat ttcactgtcc atgtaacatc cattttttat ggaaccctcc aaaatttgct
961 tttaatataa taacctcagc tattatacca gaaaagcacg tttgcttcaa atattcattg
1021 aaaatagcgc gcctttagtc tgattaatca ctgaaaattt gctataatta aaagtattat
1081 catgtttatt acggaggatt taaaatgcgc tttaaccatt tttcaattgt tgacaaaaat
1141 tttgatgaac aattagcaga acttgatcag cttggttttc gttggtctgt tttttgggat
1201 gaaaagaaaa tcttaaaaga ttttctcatt caaagtccta gtgatatgac tgctctacaa
1261 gcaactgctg aattagatgt cattgaattt ttaaaatcct ctatagaatt agattgggaa
1321 attttctgga atattgcttt gcaacttctt gattttgtac caaactttga ttttgagatt
1381 ggcaaagctt ttgagtatgc aaaaaatagc aacctccctc aaatcgaagc agaaatgacc
1441 acagaaaata tcatttctgc tttctactac ctcctttgta cgcgtcgaaa aactggaatg
1501 attttagttg aacattgggt atcagaggga cttcttcctt tagataatca ttatcatttt
1561 ttcaatgaca aatcgttggc aacctttgat tcttcacttc ttgaacgtga agtcctttgg
1621 gttgaaagtc cagtcgatag tgaacaacgg ggtgaaaatg atttgataaa aattcaaatc
1681 attcgtccaa aaagtaccga aaaacttcca gtggtaatga cggcgagtcc ttatcattta
1741 gggattaatg ataaagcaaa tgatttagcc ttacacgata tgaatgtaga acttgaagaa
1801 aaaaccagtc acgaaattca tgttgaacaa aagctcccgc aaaaattatc agcgaaagca
1861 aaggaacttc cgattgttga caaagctcct tatcgtttta ctcatggttg gacttattct
1921 ttaaatgact atttcttgac tagaggattt gcctctatct atgtggccgg tgttggaacc
1981 cgttcttctg acggctttca aacctcaggc gattaccaac aaatttatag tatgacagct
2041 gtcattgact ggttgaatgg tcgagctcgt gcttatactt ctcgtaaaaa gacacatgaa
2101 atcaaagcct cttgggcaaa tggtaaagtg gctatgactg gaaaatctta tctgggaacc
2161 atggcttatg gagccgctac tactggagtt gaaggtctag agcttatctt agccgaagct
2221 ggaatttctt cttggtataa ttattaccgt gaaaatggcc ttgtccgctc tcctggcggt
2281 tttcctggtg aagatctaga tgttcttgct gcgctcactt attcacgtaa tcttgatggt
2341 gctgatttct taaaaggtaa tgctgaatac gaaaaacgat tggcagaaat gactgctgct
2401 ttagaccgca aatctggtga ttacaatcaa ttttggcatg accgaaatta tctgataaat
2461 actgacaaag ttaaggctga tgtcttaatt gttcatggac tacaagattg gaatgtgaca
2521 ccagagcaag cctataattt ctggaaagct ctgccagaag gtcatgctaa acatgccttt
2581 ttacaccgtg gggctcatat ttatatgaat tcttggcagt ccattgattt ttctgaaacg
2641 attaatgctt actttgtcgc aaaattactg gatagagatt taaatttaaa tctcccacct
2701 gttattttac aggaaaattc taaagaccaa gtatggacga tgatgaatga ttttggtgcc
2761 aatacccaaa tcaaactccc tcttggtaaa acggccgttt cttttgctca attcgacaat
2821 aattatgacg atgagacttt caaaaagtat tctaaagatt tcaatgtctt taaaaaagac
2881 ttatttgaaa acaaagccaa tgaagctgtc attgatttag aacttccgtc aatgctgaca
2941 attaatggtc cagttgagct tgaactcaga ctaaaattaa atgacacgaa aggcttctta
3001 tctgctcaaa ttcttgattt tggtcagaaa aaacgtttgg aagataaagt cagagtgaaa
3061 gattttaaag tgcttgaccg tgggcgaaat ttcatgttag acgacttagt tgaactccca
3121 cttgttgaga gcccttatca gttagtcact aaaggcttta ctaatctcca aaatcaaagt
3181 ttactgacag tcagtgattt aaaggctgac gaatggttta caataaaatt tgaactacag
3241 ccaaccattt atcatttaga aaaagctgac aagcttcggg tcattctcta tagtactgac
3301 tttgaacata cggttcgtga taatcgtaaa gtgacttacg agattgattt atctcaatct
3361 aaactcatta ttcctattga aagtgtgaaa aattaatatg gcaattattg gaattcttgg
3421 aactccttac aacacagttg aacgttctcc tttttggtgg aacaaagtaa gctatacacg
3481 tcaatctttt attgatgttt tccaggaatt aggtcacact gtgattgttc ttcctgttga
3541 caaaaccgaa aatataaaaa attatctgcc attggtcgat aaaattgtac tgactggtgg
3601 cgttgatgtc agtccttatc tctatggcga ggaacctcat gctcaattag gaacgactga
3661 cccaattcgt gaccgttttg agcttgccgc aattaaagct gctctagaag cgaataaacc
3721 tattcttggt gtatgtcgtg gcctacaatt gctcaatgtt tattttggag gaactcttta
3781 tcaagacttg agcctaactt caagtcagat aaagcacctc caaagtccaa ctccccaaga
3841 agttccgaca catcacatta gcgtcgaaaa agaagttctt tagatttttt acctgaaaat
3901 tacatggtca attctttcca ccatcaagtc attaaggatt tgggtcaggg cttgcaggcc
3961 att
//
LOCUS PIGA1AG 668 bp ss-mRNA MAM 04-AUG-1990
DEFINITION Swine alpha-1 acid glycoprotein mRNA, 3' end.
ACCESSION M35990
KEYWORDS alpha-1 acid glycoprotein.
SOURCE Swine, cDNA to mRNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 668)
AUTHORS Stone,R.T. and Maurer,R.A.
TITLE Cloning and developmental regulation of alpha-1 acid glycoprotein
in swine
JOURNAL Dev. Genet. 8, 295-304 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 552 alpha-1 acid glycoprotein (AA at 1)
mRNA < 1 668 alpha-1 acid glycoprotein mRNA
BASE COUNT 181 a 186 c 170 g 131 t
ORIGIN
1 ccgctgtgcg ccaacttgac ggcagtgccc atcaccaatg ccaccttgga cttgatctcc
61 ggcaagtggt attatatcgg ctcggccttc cgaaaccccc agtacaatga gtcggccaga
121 tccatccagg cagccttctt tttctttgat cccaagcccg cggaggacaa gataaacctc
181 agagagtacc agaccatagg gaaccagtgc atctataacg acagctccct gaaggtccat
241 cgggagaatg gaagcctatc gaaacacgag atgggcagag aacacgttgc tgacctgctg
301 ttaaccaagg tccccaagac cttcatgctc atcaattcct tgcacgataa gaacaacgtg
361 gggctctcct tctacgccga caaggcggag gtgactccgg agcagatgaa agagttccat
421 gatgctattg agtgcacggg aatccacaag tcagaaatca catacaccga cgagaaaaag
481 gatctgtgtg ggccgctgga gaagcagcat gaggaggaaa ggaagaagga gaaggagaag
541 gaggggtcct agcattgggt caggtccttg ggcaccttgg aggcccatcc tcagcacctc
601 ccacccaccc tttgcacctc aattctctcc ctcagttgca tcaataaaac ttcgcatttg
661 gtacaggc
//
LOCUS RATMTNDI 946 bp ds-DNA ORG 04-AUG-1990
DEFINITION Rat NADH-dehydrogenase (NDI) gene, complete cds.
ACCESSION M35826
KEYWORDS NADH-dehydrogenase.
SOURCE Rat mitochondrion DNA.
ORGANISM Mitochondrion Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae;
Rattus norvegicus.
REFERENCE 1 (bases 1 to 946)
AUTHORS Quagliariello,C. and Spena,A.
TITLE Sequenza nucleotidica del gene NDI del DNA mitocondriale di fegato
di ratto
JOURNAL Boll. Soc. Ital. Biol. Sper. 64, 211-218 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 946 NADH-dehydrogenase (NDI) (att start codon)
BASE COUNT 294 a 299 c 103 g 250 t
ORIGIN
1 attaatatcc taacactcct aatcccaatc ttaattgcca taggccttct caccctagta
61 gaacggaaaa tcctaggcta catacaatta cgcaaaggcc ccaacaacga aggcccatat
121 ggtaaactac aaccatttgc agatgccata aaactattca taaaagaacc catacgccct
181 ctaaccacct caatatcact atttattatc gccccaaccc tctcccttac actagctcta
241 agcctatgaa ttcccttacc aataccccac ccccttatca acctcaacct aggcatacca
301 tttattctag ccacatccag cctttcggtc tactccattc tatgatcggg atgagcctca
361 aattcaaaat actccctatt cggagcccta cgagccgttg cccaaaccat ctcttacgaa
421 gtcacaatag ccttatacct cttatccgtc ctcctaataa gcggctcctt ctccctacaa
481 atacttatca ctacacaaga acatatctga ctattaatcc ccgcctgacc aatagccata
541 atatgataca tttcaaccct cgcagaaaca aatcgagctc ccttcgactt aacagaagga
601 gaatcagaat tagtctcagg ctttaacgtc gaatacgccg caggaccatt cgccctattc
661 ttcatagccg agtacaccaa cattattcta ataaacgccc taacatcaat tgtattccta
721 ggccccttat atcatatcaa ttaccctgaa ttatactcaa ccagcttcat aacagaaaca
781 ctacttctat ccacaacttt cctatgaatt cgagcatcct acccccgttt tcgatatgac
841 caactaatgc acctcctatg aaaaaatttc ctcccactaa cactagcatt ctgcatatga
901 tacatttccc tgccaatttt cctagcagga attccaccct acacat
//
LOCUS RICPETB 1682 bp ds-DNA PLN 04-AUG-1990
DEFINITION Rice chloroplast apocytochrome b6 (petB) gene, complete cds.
ACCESSION M35995
KEYWORDS apocytochrome b6.
SOURCE Rice (L. cv. Labelle) chloroplast DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1682)
AUTHORS Cote,J.-C., Wu,N.-H. and Wu,R.
TITLE Nucleotide sequence of the rice chloroplast apocytochrome b6 gene
(petB)
JOURNAL Plant Mol. Biol. 11, 873-874 (1988)
STANDARD simple staff_review
COMMENT Authors indicate two potential petB products for the given
sequence. The first product contains an intron, and has its
initiation codon at position 67. The second product has no intron,
and is read straight through from position 829.
FEATURES from to/span description
pept 67 72 apocytochrome b6, exon 1 (alt.)
886 1527 apocytochrome b6, exon 2 (alt.)
pept 829 1527 apocytochrome b6 (alt.)
IVS 73 885 apocytochrome b6 intron A (put.) (no splice
consensus)
BASE COUNT 499 a 290 c 318 g 575 t
ORIGIN
1 gctctacatt tctagacatt ctggtagttc gaccgtggaa ttttttggtt tcggtatctc
61 tggaatatga gtgtgtgact tgttagaatt tgctcctatt gataatacat agaaagcacc
121 tgttatctct atcaagatga ttctaattcg tcggatatta tttattctag tatctggaac
181 acgaaataga tagagtggat caagaaaaaa aaatgaaact atgattcata ttaactattc
241 agacctcgca accagactga aaaaaattca agtagttctt aataaaaata aaaaaagaaa
301 atttcttcct tccaattttg tttgcccaaa aaacaacttt ttttctctcg attttgtcga
361 gtcattacac cgattcaata aatgatcatc aagcggttct tattcgaaga acccttgcct
421 tttgtttagc ttgagactca atcatcgtgg ctctagtatg aatctaaggt tttaattgaa
481 ctgattcata ggatcgcaac aagataattt ctaccagaaa actactccaa tttttgcttt
541 atttatttat ctagtaaaac aagagtaaat ctgcattacg cacaaaaaaa agaaatccaa
601 aatagggaag agaaaaatca agaggcctct aatgatcaac atttgggaaa gaaagataga
661 cgagccaact tgagattttt tggcattatc atcacaaaga agaaattctg gatttttctt
721 atttcatatc ttcaaggcaa atcgacccaa cccagtggct gatgaagttt tgaacctttt
781 ttctaatatc cgttgaaaat ttgtgtgttt ctgtttgagc cgtacgagat gaaattctca
841 tatacggttc tcggaggggg gttcgggtta gttacctatc tcaataaagt atatgattgg
901 tttgaggaac gtcttgagat tcaggcaatt gcagatgata taactagtaa atatgttcct
961 cctcatgtca acatatttta ttgtttaggg ggaattacac ttacttgttt tctagtacaa
1021 gttgctaccg gttttgctat gactttttac tatcgcccaa ccgttacaga ggctttttcc
1081 tcggttcaat acataatgac cgaggccaac tttggttggt taatccgatc agttcatcga
1141 tggtcagcaa gtatgatggt tctaatgatg atcctgcacg tatttcgtgt gtatctcaca
1201 ggtgggttta aaaaaccccg cgaattaact tgggtcactg gtgtggtttt agctgtatta
1261 actgcatcgt ttggtgtaac tggttattct ttaccttggg atcaaattgg ttattgggca
1321 gtcaaaattg tgacaggtgt gcctgacgcg attccggtaa taggatcacc tttagtggag
1381 ttattacgtg gaagtgctag tgtgggtcaa tccactttga ctcgttttta tagtttacat
1441 acctttgtac ttcctctgct tactgccgta tttatgttaa tgcactttct aatgatacgt
1501 aagcaaggta tttcgggccc tttataagga aggcatctca tagagagttc taattctcat
1561 atatcatatc gggtaggttg tggtatttca ttgctacaaa catgggttat tctaaaataa
1621 gacatgtcat ttagatactt ctcttcaact ccgaactatt gtgatacaaa taatacaaat
1681 ag
//
LOCUS VAZTHYK 1524 bp ds-DNA VRL 04-AUG-1990
DEFINITION Varicella-zoster virus thymidine kinase gene, complete cds.
ACCESSION M36160
KEYWORDS thymidine kinase.
SOURCE Varicella-zoster virus (strain Oka vaccine) DNA.
ORGANISM Varicella-zoster virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Mori,H., Shiraki,K., Kato,T., Hayakawa,Y., Yamanishi,K. and
Takahashi,M.
TITLE Molecular analysis of the thymidine kinase gene of thymidine
kinase-deficient mutants of Varicella-zoster virus
JOURNAL Intervirology 29, 301-310 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 420 1445 thymidine kinase
BASE COUNT 434 a 354 c 327 g 408 t 1 others
ORIGIN
1 ctggcgcata ccctcgcaaa actggtgata cttagtaggg gtatgtatat tagcgctaaa
61 acggcaagat tttaattcca ctataaaaca aacggtcttt ccggcaccac tggattccgt
121 ttgtataata caaacacaat cggggcgtcg gcgtcccaaa tttacttcaa acgacattga
181 tatgcgtaca gccctttgaa catccacgtg ggataacggc gacaggagtt ttgccagcct
241 cgggttgaac gcgtccgcga aacctcgacg tacgttatca atatcctttt tgagtacatc
301 gtaaaaacga gtgtggcaac gttgtcccaa acgaaaacac ttggcccgaa ttcgactagc
361 ggacatattt gaagttccgt cccagaagat aacctaagac gcgtttgtct acaataaaca
421 tgtcaacgga taaaaccgat gtaaaaatgg gcgttttgcg tatttatttg gacggggcgt
481 atggaattgg aaaaacaacc gccgccgaag aatttttaca ccactttgca ataacaccaa
541 accggatctt actcattggg gagcccctgt cgtattggcg taaccttgca ggggaggacg
601 ctatttgcgg aatttacgga acacaaactc gccgtcttaa tggagacgtt tcgcctgaag
661 acgcacaacg cctcacggct cattttcaga gcctgttctg ttctccgcat gcaattatgc
721 atgcgaaaat ctcggcattg atggacacaa gtacatcgga tctcgtacaa gtaaataagg
781 agccgtataa aattatgtta tccgaccgac acccaatcgc ctcaactata tgttttccct
841 tgtccagata cttagtggga gatatgtccc cagcggcgct tcctgggtta ttgtttacgc
901 ttcccgctga accccccggg accaacttgg tagtttgtac cgtttcactc cccagtcatt
961 tatccagagt aagcaaacgg gccagaccgg gagaaacggt taatctgccg tttgttatgg
1021 ttctgagaaa tgtatatata atgcttatta atacaattat atttcttaaa actaacaact
1081 ggcacgcggg ctggaacaca ctgtcatttt gtaatgatgt atttaaacag aaattacaaa
1141 aatccgagtg tataaaacta cgcgaagtac ctgggattga agacacgtta ttcgccgtnc
1201 ttaaacttcc ggagctttgc ggagagtttg gaaatattct gccgttatgg gcatggggaa
1261 tggagaccct ttcaaactgc ttacgaagca tgtctccgtt cgtattatcg ttagaacaga
1321 caccccagca tgcggcacaa gaactaaaaa ctctgctacc ccagatgacc ccggcaaaca
1381 tgtcctccgg tgcatggaat atattgaaag agcttgttaa tgccgttcag gacaacactt
1441 cctaaatata cctagtattt acgtatgtac cagtaaaaag atgatacaca ttgtcatact
1501 cgcgtgtacg tgtttttctt tttt
//
LOCUS HUMCAMA 2175 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human calmodulin mRNA, complete cds.
ACCESSION J04046
KEYWORDS calmodulin.
SOURCE Human teratoma cell line, cDNA to mRNA, clone lambda-ht6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2175)
AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A.,
Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E.
TITLE Multiple divergent mRNAs code for a single human calmodulin
JOURNAL J. Biol. Chem. 263, 17055-17062 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2175; revises [1])
AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A.,
Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by R.Fischer, 28-JUL-1988, and [2] 18-JUL-1990.
[2] Author address: M.Koller
Biochemistry Dept.
Swiss Federal Inst. of Technology
Universitaetsstr. 16
Zurich, CH
biochemie@czheth5a
FEATURES from to/span description
pept 104 553 calmodulin
mRNA 6 2175 CALM mRNA
BASE COUNT 430 a 626 c 592 g 527 t
ORIGIN
1 tgagtgtgga ggcgcggacg cgcggcggag ctggaactgc tgcagctgct gccgccgccg
61 gaggaacctt gatccccgtg ctccggacac cccgggcctc gccatggctg accagctgac
121 tgaggagcag attgcagagt tcaaggaggc cttctccctc tttgacaagg atggagatgg
181 cactatcacc accaaggagt tggggacagt gatgagatcc ctgggacaga accccactga
241 agcagagctg caggatatga tcaatgaggt ggatgcagat gggaacggga ccattgactt
301 cccggagttc ctgaccatga tggccagaaa gatgaaggac acagacagtg aggaggagat
361 ccgagaggcg ttccgtgtct ttgacaagga tgggaatggc tacatcagcg ccgcagagct
421 gcgtcacgta atgacgaacc tgggggagaa gctgaccgat gaggaggtgg atgagatgat
481 cagggaggct gacatcgatg gagatggcca ggtcaattat gaagagtttg tacagatgat
541 gactgcaaag tgaaggcccc ccgggcagct ggcgatgccc gttctcttga tctctctctt
601 ctcgcgcgcg cactctctct tcaacactcc cctgcgtacc ccggttctag caaacaccaa
661 ttgattgact gagaatctga taaagcaaca aaagatttgt cccaagctgc atgattgctc
721 tttctccttc ttccctgagt ctctctccat gcccctcatc tcttcctttt gccctcgcct
781 cttccatcca cgtcttccaa ggcctgatgc attcataagt tgaagccctc cccagatccc
841 cttggagcct ctgccctcct ccagcccgga tggctctcct tcattttggt ttgtttcctc
901 ttgtttgtca tcttattttg ggtgctgggg tggctgccag cctgtcccgg gacctgctgg
961 gagggacaag aggccctccc caggcagaag agcatgccct ttgccgttgc atgcaaccag
1021 ccctgtgatt ccacgtgcag atcccagcag cctgttgggg caggggtgcc aagagaggca
1081 ttccagaagg actgaggggg cgttgaggaa ttgtggcgtt gactggatgt ggcccaggac
1141 tgggtcgagg gggccaactc acagaagggg actgacagtg ggcaacactc acatcccact
1201 ggctgctgtt ctgaaaccat ctgattggct ttctgaggtt tggctgggtg gggactgctc
1261 atttggccac tctgcagatt ggacttgccc gcgttcctga agcgctctcg agctgttctg
1321 taaatacctg gtgctaacat cccatgccgc tccctcctca cgatgcaccc accgccctga
1381 gggcccgtcc taggaatgga tgtggggatg gtcgctttgt aatgtgctgg ttctcttttt
1441 ttttctttcc cctctatggc ccttaagact ttcattttgt tcagaaccat gctgggctag
1501 ctaaagggtg gggagaggga agatgggccc caccagctct caagagaaac gcacctgcaa
1561 taaaacagtc ttgtcggcca gctgcccagg gacggcagct acagcagcct ctgcgtcctg
1621 gtccgccagc acctcccgct tctccgtggt gacttggcgc cgcttcctca catctgtgct
1681 ccgtgccctc ttccctgcct cttccctcgc ccacctgcct gcccccatac tccccagcgg
1741 agagcatgat ccgtgccctt gcttctgact ttcgcctctg ggacaagtaa gtcaatgtgg
1801 gcagttcagt cgtctgggtt ttttcccctt ttctgttcat ttcatctggc tccccccacc
1861 acctccccac cccacccccc accccctgct tcccctcact gcccaggtcg atcaagtggc
1921 ttttcctggg acctgcccag ctttgagaat ctcttctcat ccaccctctg gcacccagcc
1981 tctgagggaa ggagggatgg ggcatagtgg gagacccagc caagagctga gggtaagggc
2041 aggtaggcgt gaggctgtgg acattttcgg aatgttttgg ttttgttttt tttaaaccgg
2101 gcaatattgt gttcagttca agctgtgaag aaaaatatat atcaatgttt tccaataaaa
2161 tacagtgact acctg
//
LOCUS BMEGDH1 2834 bp ds-DNA BCT 04-AUG-1990
DEFINITION B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION D90043
KEYWORDS glucose dehydrogenase.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2834)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES from to/span description
pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47)
ORF 181 867 ORF1
ORF 1086 1946 ORF2
signal 89 94 put. -35 region for ORF1
signal 116 121 put. -10 region for ORF1
signal 167 172 ORF2 ribosome binding site
signal 893 928 termination signal
signal 972 977 put. -35 region for ORF2
signal 997 1003 put. -10 region for ORF2
signal 1070 1076 ORF2 ribosome binding site
signal 1949 1954 glucose dehydrogenase ribosome binding site
signal 2751 2783 termination signal for glucose dehydrogenase
BASE COUNT 896 a 442 c 644 g 852 t
ORIGIN
1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
2821 ctggtgttga attc
//
LOCUS BMEGDH2 1202 bp ds-DNA BCT 04-AUG-1990
DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION D90044
KEYWORDS glucose dehydrogenase; isozyme.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1202)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura Department of Fermentation Technology Osaka University 2-1
Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax:
06-876-9036
FEATURES from to/span description
pept 125 910 glucose dehydrogenase
signal 27 32 put. -35 region
signal 49 55 put. -10 region
signal 111 116 SD sequence
signal 928 959 termination signal
BASE COUNT 427 a 190 c 268 g 317 t
ORIGIN
1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
1201 ag
//
LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 04-AUG-1990
DEFINITION B.mori PTTH mRNA.
ACCESSION D90082
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
mRNA, clones P1, P2, C2, C9 and C19.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 1023)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept 34 708 preproPTTH
matp 379 708 PTTH subunit
signal 768 773 polyadenylation signal
signal 826 831 polyadenylation signal
signal 997 1002 polyadenylation signal
signal 1004 1009 polyadenylation signal
BASE COUNT 358 a 179 c 186 g 300 t
ORIGIN
1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
1021 aaa
//
LOCUS BMOPTTHP4 944 bp ss-mRNA INV 04-AUG-1990
DEFINITION B.mori preproPTTH mRNA.
ACCESSION D90083
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
mRNA, clone P4 and C21.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 944)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept < 1 631 preproPTTH
matp 302 631 PTTH subunit
signal 691 696 polyadenylation signal
signal 749 754 polyadenylation signal
signal 918 923 polyadenylation signal
signal 925 930 polyadenylation signal
BASE COUNT 337 a 163 c 177 g 267 t
ORIGIN
1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 04-AUG-1990
DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase
(EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION D90047
KEYWORDS Ip; complex II; iron sulfur subunit; sdh B;
succinate-ubiquinone oxidoreductase.
SOURCE Human liver mitochondrion, cDNA to mRNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 958)
AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
of iron sulfur(Ip) subunit of liver mitochondria
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kiyoshi
Kita Department of Parasitology Juntendo University 2-1-1,Hongo
Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax:
03-814-9300
FEATURES from to/span description
pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B)
(AA at 1)
site 74 95 iron-sulfur binding site I
site 167 179 iron-sulfur binding site II
site 224 236 iron-sulfur binding site III
BASE COUNT 319 a 212 c 204 g 223 t
ORIGIN
1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS HUMNCAW 2287 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION D90064
KEYWORDS CEA; CEA gene family; PI-anchored membrane protein.
SOURCE Human white blood cells, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2287)
AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
Matsuoka,Y.
TITLE Characterization of a cDNA clone encoding a new species of the
nonspecific cross-reacting antigen (NCA), a member of the CEA gene
family
JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Fumiko
Arakawa First Department of Biochemistry School of Medicine Fukuoka
University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
092-801-1011 x2892 Fax: 092-801-3600
FEATURES from to/span description
ORF 87 1136 nonspecific cross-reacting antigen ORF
BASE COUNT 618 a 593 c 453 g 623 t
ORIGIN
1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
2281 gctgatc
//
LOCUS PIGDESTN 1666 bp ss-mRNA MAM 04-AUG-1990
DEFINITION Porcine destrin mRNA.
ACCESSION D90053 J05290
KEYWORDS actin-binding protein; cofilin; destrin.
SOURCE Pig adult brain, cDNA to mRNA, clone PD2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1666)
AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
Iida,K. and Yahara,I.
TITLE Destrin, a mammalian actin-depolymerizing protein, is closely
related to cofilin: Cloning and expression of porcine brain destrin
cDNA
JOURNAL J. Biol. Chem. 265, 5768-5773 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji
Moriyama Department of Biophysics and Biochemistry Faculty of
Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
Phone: 03-821-2111 x4408
FEATURES from to/span description
pept 54 551 destrin
mRNA < 1 1666 destrin mRNA
signal 1645 1660 polyadenylation signal
BASE COUNT 492 a 322 c 372 g 480 t
ORIGIN
1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt
//
LOCUS RATPMP70X 3324 bp ss-mRNA ROD 04-AUG-1990
DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION D90038 J05256
KEYWORDS PMP70; peroxisomal membrane protein.
SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
189,201].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3324)
AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the
Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Keiju
Kamijo Department of Biochemistry Shinshu University School of
Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax:
263-33-6458
FEATURES from to/span description
pept 36 2015 peroxisomal membrane protein (PMP70)
signal 3271 3276 polyadenylation signal (put.)
BASE COUNT 921 a 658 c 773 g 972 t
ORIGIN
1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
3301 attaaaaaaa aaaaaaagga attc
//
LOCUS VACSANT 1525 bp ds-DNA VRL 04-AUG-1990
DEFINITION Vaccinia virus surface (S) antigen gene.
ACCESSION D90076
KEYWORDS S gene; surface antigen.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 1525)
AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y.
TITLE Identification and nucleotide sequence of the gene encoding a
surface antigen induced by vaccinia virus
JOURNAL Virology (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Yoshiaki
Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315
FEATURES from to/span description
pept 382 1437 surface antigen S
BASE COUNT 568 a 229 c 258 g 470 t
ORIGIN
1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
1501 gctatagaat gaatgaggat acgcg
//
LOCUS YSCA1 881 bp ds-DNA PLN 04-AUG-1990
DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION D90072 X13682
KEYWORDS acidic ribosomal protein; ribosomal protein.
SOURCE S. cerevisiae (strain IFO-40028) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 315; 631 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
STANDARD full staff_entry
REFERENCE 3 (bases 277 to 742)
AUTHORS Tsurugi,K. and Mitsui,K.
TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1
from Saccharomyces cerevisiae
JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988)
STANDARD simple automatic
COMMENT These data kindly submitted in computer readable form by: Kazuhiro
Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257
FEATURES from to/span description
pept 313 633 acidic ribosomal protein A1
signal 125 135 UASrpg box1
signal 182 192 UASrpg box2
signal 716 721 poly(A) signal
variant 303 303 a in [1]; g in [3]
variant 684 685 tt in [1]; t in [3]
variant 719 719 a in [1]; t in [3]
BASE COUNT 270 a 165 c 157 g 289 t
ORIGIN
1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS YSCCDC23X 3107 bp ds-DNA PLN 04-AUG-1990
DEFINITION S. cerevisiae CDC23 protein gene.
ACCESSION D90081
KEYWORDS CDC23 protein.
SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
D22) DNA, clone YX34.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3107)
AUTHORS Doi,A. and Doi,K.
TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
cerevisiae
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji Doi
The Institute of Scientific and Industrial Research Osaka
University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
Fax: 06-877-4977
FEATURES from to/span description
ORF 765 2645 ORF for CDC23
site 153 161 calcium-binding site
BASE COUNT 1010 a 595 c 629 g 873 t
ORIGIN
1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS BIM127 8482 bp ss-RNA VRL 04-AUG-1990
DEFINITION Bovine immunodeficiency-like virus, complete proviral genome.
ACCESSION M32690
KEYWORDS complete genome.
SOURCE Bovine immunodeficiency-like virus (isolate HXB3), proviral DNA,
clone 127.
ORGANISM Bovine immunodeficiency-like virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 8482)
AUTHORS Garvey,K.J., Oberste,M.S., Elser,J.E., Braun,M.J. and Gonda,M.A.
TITLE Nucleotide sequence and genome organization of biologically active
proviruses of the bovine immunodeficiency-like virus
JOURNAL Virology (1990) In press
STANDARD full staff_review
COMMENT Kindly submitted in computer-readable form prior to publication by
Dr. M.A. Gonda, Frederick Cancer Research Facility, Frederick, MD.
Clones 127 and 106 (see <BIV106>, Accession number M32691) were
obtained from a single virus isolation; the latter, 106, has 8391
nucleotides as the result of a sizeable deletion in env and two
minor deletions in the LTR R regions. The Q and S orfs are named by
analogy with similar orfs in visna; other short reading frames can
be identified.
FEATURES from to/span description
pept 316 1746 gag polyprotein
pept < 1572 4742 pol polyprotein
pept 4601 5197 protein Q (put.)
pept 5228 5539 protein S (put.)
pept 5415 8129 env polyprotein
BASE COUNT 2699 a 1798 c 2017 g 1968 t
ORIGIN
1 ttcagctcgt gtagctcatt agctccgagc tccccaacct acagcctgag aggcactggc
61 tcggttgggt agccagcctt tcgggtaata aaggcttgtt ggcattcggc atctacccgt
121 gcctcctgtc ttgtcttact cgagcgaacc cacaactccg tcctgctgag ctcacagctc
181 gcggggcggt gaagaacacc caacagttgg cgcccaacgt ggggctcgag taagagagac
241 tcggctcgag taaaagaaga cccagctcga acgagaagac tccggacagg tgagtagttg
301 cgtgttttcc ccgggatgaa gagaagggag ttagaaaaga agcttcgtaa ggttagggtg
361 acaccccaac aggataaata ttatactata gggaatcttc aatgggccat tagaatgata
421 aatctaatgg ggatcaaatg tgtgtgtgac gaggagtgct cggcagcaga ggtagccctt
481 atcataaccc aattttcagc tttagactta gaaaattctc ctatcagagg taaggaggag
541 gtggccataa aaaatactct gaaggttttc tggtccctgc tggcggggta caaaccagag
601 agtacagaaa cggccctagg atattgggag gcctttacat atagagaaag ggaggccaga
661 gctgataagg aaggcgaaat taagagtatt tacccttccc taacacagaa cacacagaat
721 aagaagcaga catcgaatca gacaaacact caatcattac cagctatcac tactcaagat
781 ggtactccta ggtttgatcc tgacctcatg aagcagctta agatctggtc agacgccact
841 gaaagaaatg gggttgacct tcatgcagtg aatatattag gggtcattac agcaaaccta
901 gtacaggaag aaattaaact cctcttgaat agtacaccca agtggagatt agatgtacaa
961 cttatagaat caaaagtaag agagaaagaa aatgcccaca gaacgtggaa acagcatcat
1021 ccagaagccc caaaaacaga tgaaatcatc ggtaaggggc ttagttctgc tgaacaagcc
1081 accctgatct cagtagaatg cagagaaact ttcagacagt gggtgctgca ggcagctatg
1141 gaggtggcac aggcaaaaca tgctacccca ggtcccatca acattcatca gggacccaag
1201 gagccgtaca cagactttat aaatagatta gtggcagccc ttgaaggtat ggcggctcca
1261 gaaaccacaa aagaatactt actccaacat ctatctattg atcatgccaa tgaagactgc
1321 cagtctattc taagaccttt gggacccaac accccaatgg agaaaaaatt agaagcatgt
1381 agggtagtgg gatctcagaa atcaaagatg caatttttgg tagcagctat gaaagaaatg
1441 gggatccaat caccaattcc agcagtcttg cctcacacac cagaagcata tgcctcccaa
1501 acctcagggc ccgaggatgg taggagatgt tacggatgtg ggaagacagg acatttgaag
1561 aggaattgta aacagcaaaa atgctaccat tgtggcaaac ctggccacca agcaagaaac
1621 tgcaggtcaa aaaacgggaa gtgctcctct gccccttatg ggcagaggag ccaaccacag
1681 aacaattttc accagagcaa catgagttct gtgaccccat ctgcaccccc tcttatatta
1741 gattagacaa acagcctttt ataaaggtgt tcataggggg aagatgggta aaagggttag
1801 tagacactgg agcagatgag gtagtgctta agaacataca ttgggatagg ataaaagggt
1861 atccagggac accaattaaa caaattgggg taaatggagt aaatgtggcc aaaaggaaga
1921 cccacgtaga gtggagattt aaggataaga ctgggataat tgatgtcttg ttctcagata
1981 ctcctgtaaa cctttttggg agatctcttc tacgtagcat agtgacttgc ttcaccctac
2041 ttgttcacac agaaaaaatc gaacccctac ccgtcaaggt aaggggacca gggcctaagg
2101 taccccagtg gcccttgaca aaagaaaagt atcaggctct taaggaaatt gtgaaagatc
2161 ttttagcaga aggaaaaatt tccgaagctg cttgggataa cccatataat accccagttt
2221 ttgttataaa gaaaaaggga acgggaagat ggaggatgct aatggatttt agggaattaa
2281 ataagataac agttaaagga caagaattct ctacaggctt accttaccct ccaggaatta
2341 aggaatgtga acacttaact gcaatagata taaaagatgc ctactttact atccctttac
2401 atgaggactt tagacccttt acagccttct ctgtagtccc tgtaaatcga gaaggaccta
2461 tagagaggtt ccagtggaat gttctaccac aaggatgggt atgtagccct gccatttatc
2521 agactaccac ccagaagatt atagaaaaca ttaaaaagag tcacccagat gtcatgttgt
2581 atcaatatat ggatgatttg ttgattgggt ctaataggga tgatcataag caaatagtgc
2641 aggaaatcag ggataagtta ggatcatatg gtttcaagac tccagatgaa aaggtccagg
2701 aagagagagt gaaatggatc ggttttgagc tcacacccaa gaaatggcgt tttcagccca
2761 ggcaactaaa gataaaaaac ccactcacag taaatgaatt acagcaatta gtaggtaatt
2821 gtgtttgggt acagccagaa gtaaaaatcc ctctataccc cttaaccgat ctactgaggg
2881 ataagaccaa tctccaagaa aagatacaac taacaccaga agccatcaag tgtgtagaag
2941 aattcaatct aaaactaaaa gatccagaat ggaaagatag aataagagaa ggagcagaat
3001 tagtcataaa aatacagatg gttcctcggg gcatagtatt tgatctgttg caagatggaa
3061 atcccatatg gggaggagta aaaggactaa attatgatca ttcaaacaaa ataaaaaaga
3121 tacttagaac tatgaatgag ctgaacagaa cagtggtaat tatgacagga agagaagcta
3181 gtttcctgct tcctgggtct tctgaagatt gggaagcggc actccagaag gaagaaagtc
3241 taacacaaat attcccagta aagttttata ggcactcctg cagatggacc tccatatgtg
3301 ggccagtaag agaaaatcta accacctact atactgacgg agggaagaaa gggaaaacag
3361 ctgcagcagt atattggtgt gaaggaagga ctaagtcaaa ggtatttcca ggaaccaatc
3421 aacaggcgga attgaaggcc atatgcatgg ctctcttgga tggaccacca aaaatgaata
3481 tcataacaga tagtagatac gcctatgagg gaatgagaga agaaccagaa acgtgggcca
3541 gggaaggaat ctggctggag attgccaaga tattgccctt taagcagtac gtgggggtcg
3601 ggtgggtgcc tgcacataaa gggataggag gaaatacaga ggcagatgaa ggagttaaga
3661 aagccttaga acagatggcc ccgtgtagcc ctcctgaggc cattctatta aaaccaggag
3721 aaaaacaaaa tctggagaca gggatctaca tgcaggggct tagaccacaa agcttcctcc
3781 caagagcaga cttaccagta gccatcacag gaaccatggt agattcagag ctacagctac
3841 agctacttaa cataggaact gagcatataa gaatccaaaa agatgaggtc ttcatgacct
3901 gtttcctaga aaatatcccc tcagccactg aagatcatga gagatggcat acctcaccag
3961 acattttggt taggcagttc catctcccta agagaatagc taaagagata gtagccagat
4021 gccaagaatg taaaaggaca accactagcc cagtcagagg aacaaacccc agaggtcgat
4081 tcttatggca gatggacaat actcactgga ataaaacaat tatttgggta gcagtagaga
4141 caaattcagg attagtggaa gctcaggtga tccctgaaga aacagcacta caagtagctc
4201 tctgcatttt acagctaatc cagagatata cagttcttca cttacatagt gacaacgggc
4261 cgtgctttac tgcacacagg atagaaaatc tatgtaagta tctggggatc acaaaaacta
4321 cgggaatacc ctacaaccca caatcccagg gagttgtaga aagagcccac agagatctaa
4381 aagacagatt ggcagcttat cagggagatt gtgaaaccgt agaagcagcc cttagcctcg
4441 cattagtttc tttaaataaa aaaagagggg gaataggggg ccatacacca tatgaaatat
4501 acctagaatc agaacatacc aaataccaag accaactaga acaacaattt tcaaaacaaa
4561 aaattgaaaa gtggtgttac gtaaggaaca gaagaaagga atggaaagga ccctacaaag
4621 tgttgtggga cggagacggg gcagcagtaa tagaggaaga gggaaaaaca gccttatatc
4681 cacaccgtca tatgcgcttc atcccccccc cagattcaga tatccaagat gggagttcgt
4741 gaggcagaca gaatacagca tgaccgcgtg cgtaagaaaa gggaaattag tccttactta
4801 ccagtacgcg atctggaaaa gagtctggac gatagaaaca ggatttacag atccaagtct
4861 gtttatgacc ccagctggaa cacacaccac tgaagaaata ggtcacttag atctcttttg
4921 gcttaggtac tgttcatgtc cgcatgagat gcccccgtgg ctagacttcc ttagaggcac
4981 cctcaatcta cgcatttcct gtcgacgcgc tcttcaagcg tcagtgttga ctagcacccc
5041 tagacactcc ctccaacgct tagctgcact tcagctgtgc actaacgcat gtctctgttg
5101 gtacccgtta ggacgcatca acgacaccac cccgttgtgg ttgaactttt cgtctgggaa
5161 ggaaccaacg atccaacaac tgagtggcca cccctaactc gtcgtaacat tcatagattg
5221 tggcaatatg cccggacctt gggtggcgat gataatgttg ccacagccca aagaaagctt
5281 tggaggaaag ccaattggct ggcttttctg gaacacgtgc aaaggaccta ggcgggactg
5341 tccacattgt tgttgtccca tatgtagttg gcattgtcag ctttgctttt tgcagaaaaa
5401 tctaggaatc aactatggat caggacctag acggcgcgga acgcggggaa aggggaggag
5461 gatccgaaga actgcttcag gaggagatca acgaagggag gctgacagcc agagaagctt
5521 tacaaacatg gatcaataac ggtgagatcc acccttgggt cctggcagga atgctgtcca
5581 tgggagtagg aatgctacta ggagtatatt gtcagttacc agacacactg atttggatac
5641 taatgtttca attatgcctt tattggggtt tgggtgaaac atctagagaa ttagacaagg
5701 atagttggca gtgggtcaga agtgtattta taatagcaat attgggaact ctcactatgg
5761 caggaactgc tttggccgac gacgatcaaa gtactttaat ccccaatatc acaaaaattc
5821 ctacaaagga cacggaaccc ggttgcacct atccgtggat attaatcctc ttgattttgg
5881 ctttcatact gggaattctg ggtataatac ttgtcttgag acgcagcaac tcggaggata
5941 tattggcagc cagagatacc atagattggt ggctctcagc taatcaggaa atacctccaa
6001 agtttgcttt cccaataata ttaatatctt cccctctagc aggcataata ggatattatg
6061 tcatggaaag gcacttagag atcttcaaaa agggatgtca aatttgtggg agcctgagca
6121 gcatgtgggg aatgcttttg gaagaaattg gcaggtggct cgcacgtagg gaatggaatg
6181 ttagtagagt aatggttatc ctcttaatca gcttcagttg gggaatgtat gtcaataggg
6241 taaatgcctc agggtcacat gtagccatgg tcaccagccc tccagggtac cgcatagtga
6301 atgataccag ccaggcacct tggtattgct tctcctcggc accaatccca acgtgtagtt
6361 cctctcagtg gggagacaaa tattttgagg agaaaataaa cgagacactg gtcaaacagg
6421 tgtatgaaca ggccgcgaaa cattcgagag ccacatggat tgaacctgat ctattggagg
6481 aagcagtcta tgagctagct ctgttatcag ctaatgacag tcgtcaggtg gtggtagaaa
6541 atggtacaga cgtatgtagc tcacagaact cgagcacaaa caaaggccac ccaatgacgc
6601 ttctaaagtt gagagggcag gtgtcagaaa cttggatagg gaattcctcc ctccagtttt
6661 gtgtccagtg gccatatgtc ttggtaggtc ttaataatag tgatagtaat attagcttca
6721 attcgggaga ttggatagca accaattgta tgcacccaat tacactaaat aaaagtgcac
6781 aagatctagg aaaaaatttt ccgagactaa catttcttga cggacaactg tcccagttga
6841 agaacacact gtgcggacat aacacaaact gtttgaaatt tggaaacaag tccttcagta
6901 caaattccct aatactatgc caagacaacc ccatcggcaa cgacaccttt tatagcctaa
6961 gtcattcctt ctcaaaacag gcctctgccc ggtggattct tgtaaaggtc cccagctatg
7021 ggtttgtggt agtaaatgac acagatacac caccatccct ccgcatccga aagcctcgag
7081 cagtcggact agcaatattc ctgcttgtgc tggctatcat ggccatcaca tcctccttgg
7141 tggcagctac aacgctcgtg aaccagcaca cgacggctaa ggttgtggag agggttgtgc
7201 aaaatgtgtc atatattgct caaacccagg accaattcac ccacctgttc aggaatataa
7261 acaacagatt aaatgtccta caccatagag tttcatactt ggagtatgta gaggaaatca
7321 gacaaaaaca agtattcttt ggttgcaaac ctcatggaag gtattgccac tttgactttg
7381 gaccagagga agttggatgg aacaatagtt ggaatagcaa aacttggaat gatctacaag
7441 atgagtatga taagatagaa gaaaaaatat taaaaattcg agtggactgg ctcaatagct
7501 ccctgagtga cacacaggac acctttggcc tggagacctc tatttttgac catttagtgc
7561 aattgtttga ttggacttct tggaaagact ggataaaaat cattatagta atcattgtac
7621 tttggcttct gataaagatt ctcctaggta tgttaagaag ctgcgccaag gtcagccaga
7681 attaccaaca tctcccggcg gaggaggagg acggggacac agagccagaa agctccccgg
7741 cgagaggaga cccggcttct ggaagtctct acgagaattg gttgaacaaa ataggagaaa
7801 gcaagaacga cgcctatcgg gtctggacag aagaatacaa cagcttgagg atcttgttcg
7861 ccacatgtcg ctgggatctc ctgacccctc aactccttca gcttccgttc tttctgttaa
7921 ccctcctgct caaactcctt tgggacatct tccgccacgc tcctatttta aacttaaaag
7981 ggtggactgt ggggcagggt gggacctcag gacaacagca gcccccggac ttcccatatg
8041 tgaattggac tggatccagg gaacaaaata acccagaagg gggattagac tctggggctt
8101 ggtatgaagg cctgagaggt tctcagtaga ttgtaagtct tcggcgagac tgcatgtctg
8161 cacgtagaca ggaaatgttt atcttctcag ctgattgtgg ttaggccgat tactggaaac
8221 tagacaacct gattcattag tggttaagat tatgcataag tgctcgcaat gatgtagctg
8281 cttacgcttg cttactccgc cctgaaacgc ctaccttaac acgcaacacg cccacctgta
8341 agaatatata aaccatatct tcactctgta cttcagctcg tgtagctcat tagctccgag
8401 ctccccaacc tacagcctga gaggcactgg ctcggttggg tagccagcct ttcgggtaat
8461 aaaggcttgt tggcattcgg ca
//
LOCUS CEAVCG 9180 bp ss-RNA VRL 04-AUG-1990
DEFINITION Caprine arthritis encephalitis virus, complete proviral genome.
ACCESSION M33677
KEYWORDS complete genome.
SOURCE Caprine arthritis encephalitis virus (isolate Clements) proviral
DNA, infectious molecular clone.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9180)
AUTHORS Saltarelli,M., Querat,G., Konings,D.A.M., Vigne,R. and
Clements,J.E.
TITLE Nucleotide sequence and transcriptional analysis of an infectious
clone of CAEV
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Kindly provided in computer-readable form prior to publication by
Dr. Janice Clements, Johns Hopkins University, Baltimore, MD., 10
April 1990. The annotation shown below is tentative.
FEATURES from to/span description
pept 512 1858 gag polyprotein
pept < 1717 5046 pol polyprotein (NH2-terminus uncertain)
pept 5006 5695 protein Q (put.)
pept 5688 5951 protein S (put.)
pept 6012 8837 env polyprotein
ORF 8405 8794 ORF 4
BASE COUNT 3505 a 1466 c 2301 g 1908 t
ORIGIN
1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat
61 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg
121 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg
181 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga
241 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag
301 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg
361 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg
421 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta
481 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag
541 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg
601 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct
661 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt
721 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt
781 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga
841 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga
901 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc
961 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc
1021 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac
1081 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa
1141 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc
1201 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca
1261 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa
1321 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca
1381 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga
1441 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc
1501 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt
1561 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt
1621 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa
1681 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg
1741 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca
1801 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact
1861 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat
1921 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg
1981 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta
2041 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa
2101 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag
2161 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc
2221 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta
2281 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt
2341 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa
2401 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag
2461 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa
2521 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa
2581 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt
2641 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata
2701 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg
2761 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa
2821 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag
2881 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta
2941 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa
3001 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa
3061 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac
3121 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag
3181 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg
3241 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa
3301 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg
3361 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa
3421 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag
3481 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa
3541 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct
3601 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg
3661 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa
3721 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata
3781 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga
3841 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa
3901 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt
3961 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca
4021 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata
4081 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa
4141 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg
4201 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc
4261 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa
4321 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg
4381 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta
4441 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc
4501 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat
4561 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac
4621 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta
4681 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca
4741 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat
4801 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa
4861 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac
4921 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa
4981 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag
5041 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat
5101 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag
5161 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc
5221 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg
5281 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca
5341 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg
5401 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga
5461 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga
5521 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag
5581 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca
5641 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac
5701 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa
5761 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg
5821 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt
5881 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt
5941 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt
6001 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg
6061 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc
6121 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct
6181 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga
6241 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct
6301 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat
6361 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc
6421 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc
6481 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga
6541 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat
6601 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg
6661 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag
6721 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga
6781 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc
6841 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc
6901 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga
6961 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag
7021 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa
7081 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga
7141 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc
7201 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga
7261 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc
7321 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg
7381 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat
7441 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga
7501 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca
7561 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg
7621 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca
7681 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca
7741 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta
7801 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa
7861 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg
7921 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa
7981 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg
8041 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac
8101 gaatcttgga agctcgagtg gaagctatca cagatagaat aatgctatac caagaattgg
8161 attgttggca ctatcatcaa tactgtataa cctctacaaa aacagaagta gcaaaatata
8221 tcaattggac gaggtttaag gataattgca catggcagca gtgggagaga ggattacagg
8281 ggtatgatac aaacttaaca atactgttaa aggaatcagc agcaatgaca caactagcag
8341 aagagcaagc aaggaggata ccagaagtat gggaaagttt aaaagacgtc tttgattggt
8401 caggatggtt ctcatggcta aagtatattc ctattatagt agtaggatta ttaggatgca
8461 ttctgataag agctgtgata tgtgtatgtc aacctcttgt gcagatatac agaactctaa
8521 gtaccccgac ataccaacgg gtcacagtca tcatggaaac aagagcagac gtcgcaggag
8581 aaaatcagga ttttggcgat ggcttagagg aatcagacaa cagcgaaaca agcgaaagag
8641 tgacagtaca gaaagcttgg agccgtgcct gggagctttg gcagaactca ccctggaagg
8701 agccatggaa aaggggcctg ctgaggctgc tcgtccttcc gctgacgatg ggaatctgga
8761 taaatggatg gcttggagaa caccacaaaa ataaaaaaag aaagggtgac tgtgagacat
8821 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat
8881 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact
8941 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg
9001 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag
9061 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt
9121 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga
//
LOCUS CEAVLTRM 522 bp ss-RNA VRL 04-AUG-1990
DEFINITION Caprine arthritis encephalitis virus proviral long terminal repeat.
ACCESSION M14149
KEYWORDS long terminal repeat.
SOURCE Caprine arthritis encephalitis virus (CAEV) unintegrated proviral
DNA.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 522)
AUTHORS Hess,J.L., Pyper,J.M. and Clements,J.E.
TITLE Nucleotide sequence and transcriptional activity of the caprine
arthritis-encephalitis virus long terminal repeat
JOURNAL J. Virol. 60, 385-393 (1986)
STANDARD full staff_review
COMMENT [1] presents the LTR for CAEV in a fashion that includes sequence
data downstream of the 5' LTR (bases 499-522) and sequence data
upstream of the 3' LTR (bases 1-48). For this reason, two
"pre-msg" entries are found in the FEATURES table, one for the
5'end of the unspliced transcript and the other for the 3' end of
the unspliced transcript. Similarly, both the TATA box and the
polyadenylation signal are annotated. The CAEV LTR and the Visna
LTR are approximately 50% homologous. [1] did not find a
significant level of homology of CAEV LTR with any other retroviral
LTR. [1] showed that, unlike Visna and HIV, CAEV LTR is not
transactivated. Plasmids constructed with CAEV LTR and CAT showed
no evidence of transactivated up-regulation in four cell types
(SCP, GSM, SAM and primary goat macrophages). This may explain
CAEV's lack of cytotoxicity.
FEATURES from to/span description
pre-msg < 1 420 genomic mRNA, 3' end (see COMMENT)
pre-msg 336 > 522 genomic mRNA, 5' end (see COMMENT)
site 336 336 mRNA cap site
LTR 49 498 long terminal repeat
rpt 87 157 71 bp direct repeat, 5' copy
rpt 158 228 71 bp direct repeat, 3' copy
rpt 336 420 R repeat
binding 499 515 Lys-tRNA primer binding site
signal 306 310 TATA box
signal 393 398 poly-A signal
BASE COUNT 163 a 108 c 137 g 114 t
ORIGIN 84 bp upstream of AluI site.
1 aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtggac tgtgagacat
61 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat
121 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact
181 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg
241 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag
301 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt
361 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga
421 gtggagagtg attactgagc ggccggtgta tcgggagtcg tcccttaatc tgtgcaatac
481 cagagcggct ctcgcagctg gcgcccaacg tggggcccga gg
//
LOCUS CEAVPOL 651 bp ss-RNA VRL 04-AUG-1990
DEFINITION Caprine arthritis encephalitis virus proviral pol gene, partial
cds.
ACCESSION K03327
KEYWORDS .
SOURCE Caprine arthritis encephalitis virus proviral DNA.
ORGANISM Caprine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (sites; for [2])
AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F.,
Tronick,S.R. and Aaronson,A.
TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to
lentiviruses
JOURNAL Nature 317, 366-368 (1985)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 651)
AUTHORS Chiu,I.-M.
JOURNAL Unpublished (1985) Ohio State U. Medical Sch., Columbus, Ohio
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
I.-M.Chiu, 09-OCT-1985.
FEATURES from to/span description
pept < 1 > 651 pol protein (reverse transcriptase; AA at 1)
BASE COUNT 269 a 102 c 122 g 158 t
ORIGIN 434 bp upstream of BglII site
1 gtcccacaat ggccattgac agaagaaaaa ttaaaaggtc tatcagaaat agtagataaa
61 ttagtaaaaa aaaaaaaact aggaaaggca cccccacatt ggacatgtaa tactccaata
121 ttttgcataa aaaagaaatc aggaaaatgg aaaatgttaa tagatttcag aaaattaaac
181 aaacaaacag aggatttaac agaagcgcaa ttaggacttc cgcatccggg gggactacaa
241 aagaaaaaac atgttacaat attagacata ggagatgcat attttactat acccctatat
301 aaaccatatc gagagtacac ttgctttact ctattaagtc ctaataatct aggaccatgt
361 aaaagatatt attggaaggt tctaccacaa ggttggaaac tgagtccatc tgtatatcaa
421 tttaccatgc aggagatctt aggggaatgg atacaggagc atcctgaaat tcaatttaga
481 atatatatgg atgatatcta cataagaagt gatttagaaa taaaaaagca tagagagata
541 gtggaggaat tagccaatta tattgcccaa tatagattca ctctgccaga agagaaaagg
601 caagagagat atccagcaaa atggttaggg tatgaactac atcctcaggc c
//
LOCUS EAEVWU5 2580 bp ss-RNA VRL 04-AUG-1990
DEFINITION Equine arthritis encephalitis virus, envelope gene, complete cds.
ACCESSION X16988
KEYWORDS envelope protein.
SOURCE Equine arthritis encephalitis virus, clone pWU5.
ORGANISM Equine arthritis encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 2580)
AUTHORS McGuire,T.C., Lacy,P.A. and O'Rourke,K.
TITLE cDNA sequence of the env gene of a pathogenic equine infectious
anemia lentivirus variant
JOURNAL Nucleic Acids Res. 18, 196 (1990)
STANDARD full staff_review
COMMENT Kindly submitted in computer-readable form by K. O'Rourke,
Washington State University, Pullman WA.
FEATURES from to/span description
pept 1 2580 env polyprotein
BASE COUNT 891 a 428 c 565 g 696 t
ORIGIN Start of env cds.
1 atggtcagca tcgcattcta tggggggatc ccagggggaa tctcaacccc tattacccaa
61 cagtcagaaa aatccaagta tgaggaaaac acaatgtttc aaccttattg ttataataat
121 gacagtaaga acagcatggc agaatcgaag gaagcaagag accaagaaat gaacctgaaa
181 gaagaatcta aagaagaaaa aagaagaaat gactggtgga aaataggtat gtttctgtta
241 tgcttagcag gaactactgg aggaatactt tggtggtatg aaggactccc acagcaacat
301 tatatagggt tggtagcgat agggggaaga ttaaacggat ctggccaatc aaatgctata
361 gaatgctggg gttccttccc ggggtgtaga ccatttcaaa attacttcag ttatgagacc
421 aatagaagca tgcatatgga taataatact gctacattat tagaagctta tcatagagag
481 ataacattca tttataagtc ttcttgcaca gatagtgatc attgtcaaga gtatcaatgt
541 aaaaaagtta atcttaattc ctctgactcc tctaactctg tacgtgttga ggatgtaacg
601 aacacagcgg aatattgggg atttaaatgg ctagaatgta atcaaacaga aaattttaag
661 actatattag tacctgaaaa tgaaatggta aatatcaatg atactgatac ctggatacct
721 aaggggtgta atgagacgtg ggcaagagtg aaacgttgtc ctatagatat tttatatggg
781 atacatccaa tcaggctgtg tgtacagcca ccattttttc tggtacagga gaaagggatt
841 gctgatactt ctagaattgg caattgtggg ccaacaatat ttcttggggt tttagaagat
901 aataagggag tagtacgggg ggattataca gcctgcaatg tgagtcgcct aaatataaat
961 agaaaggatt atacagggat ctatcaagta cctatatttt atacatgtac tttcactaac
1021 ataacttcct gcaataatga gccaataatc agtgttatca tgtatgaaac aaaccaggta
1081 caatatttat tgtgtaataa taataatagt aataattata attgtgtagt acaaagtttt
1141 ggagttatag gacaggcaca cttagaactg cctagaccta acaaaagaat aaggaatcaa
1201 agctttaacc aatataactg ctctataaat aacaaaacag aattagaaac atggaagtta
1261 gtaaagactt ctggcataac tcctttacct atttcttctg aagctaacac tggactaatt
1321 agacataaga gagattttgg tataagtgca atagtggcag ctattgtagc cgctactgct
1381 attgctgcta gcgctactat gtcttatgtt gctctaactg aggttaacaa aataatggaa
1441 gtacaaaatc atacttttga ggtagaaaat agtactctaa atggtatgga tttaatagaa
1501 cgacaaataa agatattata tgctatgatt cttcaaacac atgcagatgt tcaactgtta
1561 aaggaaagac aacaggtaga ggagacattt aatttaattg gatgtataga aagaacacat
1621 gtattttgtc atactggtca tccctggaat atgtcatggg gacatttaaa tgagtcaaca
1681 caatgggatg actgggtaag caaaatggaa gatttaaatc aagagatact aactacactt
1741 catggagcca ggaacaattt ggcacaatcc atgataacat tcaatacacc agatagtata
1801 gctcaatttg gaaaagacct ttggagtcat attggaaatt ggattcctgg attgggagct
1861 tccattataa aatatatagt gatgtttttg cttatttatt tgttactaac ctcttcgcct
1921 aagatcctca gggccctctg gaaagtgacc agtggtgcag ggtcctccgg cagtcgttac
1981 ctgaagaaaa aattccatca caaacatgca tcacgagaag acacctggga ccaggcccaa
2041 cacaacatac acctagcagg cgtgaccggt ggatcagggg acaaatacta caagcagaag
2101 tactccagga acgactggaa tggagaatca gaggagtaca acaggcggcc aaagagctgg
2161 gtgaagtcaa tcgaggcatt tggagagagc tatatttccg agaagaccaa aggggagatt
2221 tctcagcctg gggcggctat caacgagcac aagaacggct ctggggggaa caatcctcac
2281 caagggtcct tagacctgga gattcgaagc gaaggaggaa acatttatga ctgttgcatt
2341 aaagcccaag aaggaactct cgctatccct tgctgtggat ttcccttatg gctattttgg
2401 ggactagtaa ttatagtagg acgcatagca ggctatggat tacgtggact cgctgttata
2461 ataaggattt gtactagagg cttaaatttg atatttgaaa taatcagaaa aatgcttgat
2521 tatattggaa gagctttaaa tcctggcaca tctcatgtat caatgcctca gtatgtttag
//
LOCUS EIAV 8344 bp ss-RNA VRL 04-AUG-1990
DEFINITION Equine infectious anemia virus (EIAV), complete proviral genome.
ACCESSION M16575 M11337 K03334 M14855
KEYWORDS complete genome.
SOURCE Equine infectious anemia virus proviral DNA, clones 1369 and 409-2
[1].
ORGANISM Equine infectious anemia virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 8229)
AUTHORS Kawakami,T., Sherman,L., Dahlberg,J., Gazit,A., Yaniv,A.,
Tronick,S.R. and Aaronson,S.A.
TITLE Nucleotide sequence analysis of equine infectious anemia virus
proviral DNA
JOURNAL Virology 158, 300-312 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 324 to 5122)
AUTHORS Stephens,R.M., Casey,J.W. and Rice,N.R.
TITLE Equine infectious anemia virus gag and pol genes: Relatedness to
visna and AIDS virus
JOURNAL Science 231, 589-594 (1986)
STANDARD full staff_review
REFERENCE 3 (bases 2300 to 2950)
AUTHORS Chiu,I.-M.
JOURNAL Unpublished (1985) Meloy Labs Inc, Springfield, VA 22151
STANDARD full staff_review
REFERENCE 4 (sites; pept)
AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F.,
Tronick,S.R. and Aaronson,A.
TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to
lentiviruses
JOURNAL Nature 317, 366-368 (1985)
STANDARD full staff_review
REFERENCE 5 (bases 4769 to 8048; 8069 to 8344)
AUTHORS Payne,S.L.
JOURNAL Unpublished (1987) Dept Biochem, Louisiana St Univ, Baton Rouge, LA
STANDARD full staff_review
REFERENCE 6 (bases 4769 to 8048; 8069 to 8228)
AUTHORS Rushlow,K., Olsen,K., Stiegler,G., Payne,S.L., Montelaro,R.C. and
Issel,C.J.
TITLE Lentivirus genomic organization: The complete nucleotide sequence
of the env gene region of equine infectious anemia virus
JOURNAL Virology 155, 309-321 (1986)
STANDARD full staff_review
REFERENCE 7 (sites; correction of 5348 [1] personal communication)
AUTHORS Tronick,S.R. and Fuller,F.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Correction of [1] by [7] leads to agreement with [6] regarding the
env cds. The sequence starting with coordinate 8230 is horse
cellular DNA.
FEATURES from to/span description
pept 465 1925 gag polyprotein (put.)
pept < 1682 5122 pol polyprotein (AA at 1682; NH2-terminus
uncertain)
pept 5312 7891 envelope polyprotein
pept < 7234 7641 3' ORF protein (AA at 7233; NH2-terminus
uncertain)
pre-msg 209 8193 EIAV genomic RNA
cell 8230 8344 horse cellular DNA
binding 324 342 primer (Lys-tRNA) binding site
variant 342 342 c in [1], t in [2]
variant 2341 2341 t in [1], c in [2],[3]
variant 2349 2349 t in [1], c in [2]
variant 2436 2436 g in [1], a in [3]
variant 2677 2677 t in [1], a in [2]
variant 2865 2870 tggaag in [1], agtaaa in [2]
variant 2870 2870 g in [1], a in [3]
variant 3777 3777 g in [1], a in [2]
variant 3858 3858 g in [1], a in [2]
variant 4529 4530 ac in [1], ca in [2]
variant 4573 4573 a in [1], g in [2]
revision 5348 5348 gg in [6],[7]; [g] in [1]
variant 6348 6348 a in [1], g in [5],[6]
variant 6350 6350 g in [1], a in [5],[6]
variant 7060 7060 c in [1], t in [5],[6]
variant 8048 8069 gagttccgcatttgtgacgcgt in [1], gt in [5],[6]
variant 8073 8073 g in [1], c in [5],[6]
variant 8098 8098 a in [1], g in [5],[6]
variant 8104 8104 g in [1], a in [5],[6]
variant 8118 8119 ga in [1], gca in [5],[6]
mRNA 210 8193 EIAV genomic mRNA
BASE COUNT 2984 a 1358 c 1846 g 2156 t
ORIGIN
1 tgtggggttt ttatgagggg ttttataaat gattataaga gtaaaaagaa agttgctgat
61 gctctcataa ccttgtataa cccaaaggac tagctcatgt tgctaggcaa ctaaaccgca
121 ataaccgcat ttgtgacgcg agttccgcat ttgtgacgcg ttaagttcct gtttttacag
181 tatataagta cttgtgttct gacaattggg actcagattc tgcggtctga gtcccttctc
241 tgctgggctg aaaaggcctt tgtaataaat ataattctct actcagtccc tgtctctagt
301 ttgtctgttc gagatcctac agttggcgcc cgaacaggga cctgagaggg gcgcagaccc
361 tacctgttga acctggctga tcgtaggatc cccgggacag cagaggagaa cttacagaag
421 tcttctggag gtgttcctgg ccagaacaca ggaggacagg taagatggga gaccctttga
481 catggagcaa ggcgctcaag aagttagaga aggtgacggt acaagggtct cagaaattaa
541 ctactggtaa ctgtaattgg gcgctaagtc tagtagactt atttcatgat accaactttg
601 taaaagaaaa ggactggcag ctgagggatg tcattccatt gctggaagat gtaactcaga
661 cgctgtcagg acaagaaaga gaggcctttg aaagaacatg gtgggcaatt tctgctgtaa
721 agatgggcct ccagattaat aatgtagtag atggaaaggc atcattccag ctcctaagag
781 cgaaatatga aaagaagact gctaataaaa agcagtctga gccctctgaa gaatatccaa
841 tcatgataga tggggctgga aacagaaatt ttagacctct aacacctaga ggatatacta
901 cttgggtgaa taccatacag acaaatggtc tattaaatga agctagtcaa aacttatttg
961 ggatattatc agtagactgt acttctgaag aaatgaatgc atttttggat gtggtacctg
1021 gccaggcagg acaaaagcag atattacttg atgcaattga taaaatagca gatgattggg
1081 ataatagaca tccattaccg aatgctccac tggtggcacc accacaaggg cctattccca
1141 tgacagcaag gtttattaga ggtttaggag tacctagaga aagacagatg gagcctgctt
1201 ttgatcagtt taggcagaca tatagacaat ggataataga agccatgtca gaaggcatca
1261 aagtgatgat tggaaaacct aaagctcaaa atattaggca aggagctaag gaaccttacc
1321 cagaatttgt agacagacta ttatcccaaa taaaaagtga gggacatcca caagagattt
1381 caaaattctt gactgataca ctgactattc agaacgcaaa tgaggaatgt agaaatgcta
1441 tgagacattt aagaccagag gatacattag aagagaaaat gtatgcttgc agagacattg
1501 gaactacaaa acaaaagatg atgttattgg caaaagcact tcagactggt cttgcgggcc
1561 catttaaagg tggagccttg aaaggagggc cactaaaggc agcacaaaca tgttataact
1621 gtgggaagcc aggacattta tctagtcaat gtagagcacc taaagtctgt tttaaatgta
1681 aacagcctgg acatttctca aagcaatgca gaagtgttcc aaaaaacggg aagcaagggg
1741 ctcaagggag gccccagaaa caaactttcc cgatacaaca gaagagtcag cacaacaaat
1801 ctgttgtaca agagactcct cagactcaaa atctgtaccc agatctgagc gaaataaaaa
1861 aggaatacaa tgtcaaggag aaggatcaag tagaggatct caacctggac agtttgtggg
1921 agtaacatat aatctagaga aaaggcctac tacaatagta ttaattaatg atactccctt
1981 aaatgtactg ttagacacag gagcagatac ttcagtgttg actactgcac attataatag
2041 gttaaaatat agagggagaa aatatcaagg gacgggaata ataggagtgg gaggaaatgt
2101 ggaaacattt tctacgcctg tgactataaa gaaaaagggt agacacatta agacaagaat
2161 gctagtggca gatattccag tgactatttt gggacgagat attcttcagg acttaggtgc
2221 aaaattggtt ttggcacagc tctccaagga aataaaattt agaaaaatag agttaaaaga
2281 gggcacaatg gggccaaaaa ttcctcaatg gccactcact aaggagaaac tagaaggggc
2341 taaagagata gtccaaagac tattgtcaga gggaaaaata tcagaagcta gtgacaataa
2401 tccttataat tcacccatat ttgtaataaa aaagaggtct ggcaaatgga ggttattaca
2461 agatctgaga gaattaaaca aaacagtaca agtaggaacg gaaatatcca gaggattgcc
2521 tcacccggga ggattaatta aatgtaaaca catgactgta ttagatattg gagatgcata
2581 tttcactata cccttagatc cagagtttag accatataca gctttcacta ttccctccat
2641 taatcatcaa gaaccagata aaagatatgt gtggaattgt ttaccacaag gattcgtgtt
2701 gagcccatat atatatcaga aaacattaca ggaaatttta caacctttta gggaaagata
2761 tcctgaagta caattgtatc aatatatgga tgatttgttc gtgggaagta atggttctaa
2821 aaaacaacac aaagagttaa tcatagaatt aagggcaatc ttactggaag agggttttga
2881 gacaccagat gataaattac aagaagtgcc accttatagc tggctaggtt atcaactttg
2941 tcctgaaaat tggaaagtac aaaaaatgca attagacatg gtaaagaatc caacccttaa
3001 tgatgtgcaa aaattaatgg ggaatataac atggatgagc tcaggggtcc cagggttgac
3061 agtaaaacac atagcagcta ctactaaggg atgtttagag ttgaatcaaa aagtaatttg
3121 gacggaagag gcacaaaaag agttagaaga aaataatgag aagattaaaa atgctcaagg
3181 gttacaatat tataatccag aagaagaaat gttatgtgag gttgaaatta caaaaaatta
3241 tgaggcaact tatgttataa aacaatcaca aggaatccta tgggcaggta aaaagattat
3301 gaaggctaat aagggatggt caacagtaaa aaatttaatg ttactgttgc aacatgtggc
3361 aacagaaagt attactagag taggaaaatg tccaacgttt aaggtaccat ttaccaaaga
3421 gcaagtaatg tgggaaatgc aaaaaggatg gtattattct tggctcccag aaatagtata
3481 tacacatcaa gtagttcatg atgattggag aatgaaattg gtagaagaac ctacatcagg
3541 aataacaata tacactgatg ggggaaaaca aaatggagaa ggaatagcag cttatgtgac
3601 cagtaatggg agaactaaac agaaaaggtt aggacctgtc actcatcaag ttgctgaaag
3661 aatggcaata caaatggcat tagaggatac cagagataaa caagtaaata tagtaactga
3721 tagttattat tgttggaaaa atattacaga aggattaggt ttagaaggac cacaaagtcc
3781 ttggtggcct ataatacaaa atatacgaga aaaagagata gtttattttg cttgggtacc
3841 tggtcacaaa gggatatgtg gtaatcaatt ggcagatgaa gccgcaaaaa taaaagaaga
3901 aatcatgcta gcataccaag gcacacaaat taaagagaaa agagatgaag atgcagggtt
3961 tgacttatgt gttccttatg acatcatgat acctgtatct gacacaaaaa tcatacccac
4021 agatgtaaaa attcaagttc ctcctaatag ctttggatgg gtcactggga aatcatcaat
4081 ggcaaaacag gggttattaa ttaatggagg aataattgat gaaggatata caggagaaat
4141 acaagtgata tgtactaata ttggaaaaag taatattaaa ttaatagagg gacaaaaatt
4201 tgcacaatta attatactac agcatcactc aaattccaga cagccttggg atgaaaataa
4261 aatatctcag agaggggata aaggatttgg aagtacagga gtattctggg tagaaaatat
4321 tcaggaagca caagatgaac atgagaattg gcatacatca ccaaagatat tggcaagaaa
4381 ttataagata ccattgactg tagcaaaaca gataactcaa gaatgtcctc attgcactaa
4441 gcaaggatca ggacctgcag gttgtgtcat gagatctcct aatcattggc aggcagattg
4501 cacacatttg gacaataaga taatattgac ttttgtagag tcaaattcag gatacataca
4561 tgctacatta ttatcaaaag aaaatgcatt atgtacttca ttggctattt tagaatgggc
4621 aagattgttt tcaccaaagt ccttacacac agataacggc actaattttg tggcagaacc
4681 agttgtaaat ttgttgaagt tcctaaagat agcacatacc acaggaatac catatcatcc
4741 agaaagtcag ggtattgtag aaagggcaaa taggaccttg aaagagaaga ttcaaagtca
4801 tagagacaac actcaaacac tggaggcagc tttacaactt gctctcatta cttgtaacaa
4861 agggagggaa agtatgggag gacagacacc atgggaagta tttatcacta atcaagcaca
4921 agtaatacat gagaaacttt tactacagca agcacaatcc tccaaaaaat tttgttttta
4981 caaaatccct ggtgaacatg attggaaggg acctactagg gtgctgtgga agggtgatgg
5041 tgcagtagta gttaatgatg aaggaaaggg aataattgct gtaccattaa ccaggactaa
5101 gttactaata aagccaaatt gagtattgtt gcaggaagca agacccaact accattgtca
5161 gctgtgtttc ctgaggtctc taggaattga ttacctcgat gcttcattaa ggaagaagaa
5221 taaacaaaga ctgaaggcaa tccaacaagg aagacaacct caatatttgt tataaggttt
5281 gatatatggg attatttggt aaaggggtaa catggtcagc atcgcattct atggggggat
5341 cccaggggga atctcaaccc ctattaccca acagtcagaa aaatctaagt gtgaggagaa
5401 cacaatgttt caaccttatt gttataataa tgacagtaag aacagcatgg cagaatcgaa
5461 ggaagcaaga gaccaagaaa tgaacctgaa agaagaatct aaagaagaaa aaagaagaaa
5521 tgactggtgg aaaataggta tgtttctgtt atgcttagca ggaactactg gaggaatact
5581 ttggtggtat gaaggactcc cacagcaaca ttatataggg ttggtggcga tagggggaag
5641 attaaacgga tctggccaat caaatgctat agaatgctgg ggttccttcc cggggtgtag
5701 accatttcaa aattacttca gttatgagac caatagaagc atgcatatgg ataataatac
5761 tgctacatta ttagaagctt atcatagaga gataacattc atttataagt cttcttgcac
5821 agatagtgat cattgtcaag agtatcaatg taaaaaagtt aatcttaatt cctctgactc
5881 ctctaactct gtacgtgttg aggatgtaac gaacacagcg gaatattggg gatttaaatg
5941 gctagaatgt aatcaaacag aaaattttaa gactatatta gtacctgaaa atgaaatggt
6001 aaatatcaat gatactgata cctggatacc taaggggtgt aatgagacgt gggcaagagt
6061 gaaacgttgt cctatagata ttttatatgg gatacatcca atcaggctgt gtgtacagcc
6121 accatttttt ctggtacagg agaaagggat tgctgatact tctagaattg gcaattgtgg
6181 gccaacaata tttcttgggg ttttagaaga taataaggga gtagtacggg gggattatac
6241 agcttgcaat gtgcgtcgcc taaatataaa tagaaaggat tatacaggga tctatcaagt
6301 acctatattt tatacatgta ctttcactaa cataacttcc tgcaataatg agccaataat
6361 cagtgttatc atgtatgaaa caaaccaggt acaatattta ttgtgtaata ataataatag
6421 taataattat aattgtgtag tacaaagttt tggagttata ggacaggcac acttagaact
6481 gcctagacct aacaaaagaa taaggaacca aagctttaac caatataact gctctataaa
6541 taacaaaaca gaattagaaa catggaagtt agtaaagact tctggcgtaa ctcctttacc
6601 tatttcttct gaagctaaca ctggactaat tagacataag agagattttg gtataagtgc
6661 aatagtggca gctattgtag ccgctactgc tattgctgct agcgctacta tgtcttatgt
6721 tgctctaact gaggttaaca aaataatgga agtacaaaat catacttttg aggtagaaaa
6781 tagtactcta aatggtatgg atttaataga acgacaaata aagatattat atgctatgat
6841 tcttcaaaca catgcagatg ttcaactgtt aaaggaaaga caacaggtag aggagacatt
6901 taatttaatt ggatgtatag aaagaacaca tgtattttgt catactggtc atccctggaa
6961 tatgtcatgg ggacatttaa atgagtcaac acaatgggat gactgggtaa gcaaaatgga
7021 agatttaaat caagagatac taactacact tcatggagcc aggaacaatt tggcacaatc
7081 catgataaca ttcaatacac cagatagtat agctcaattt ggaaaagacc tttggagtca
7141 tattggaaat tggattcctg gattgggagc ttccattata aaatatatag tgatgttttt
7201 gcttatttat ttgttactaa cctcttcgcc taagatcctc agggccctct ggaaagtgac
7261 cagtggtgca gggtcctccg gcagtcgtta cctgaagaaa aaattccatc acaaacatgc
7321 atcgcgagaa gacacctggg accaggccca acacaacata cacctagcag gcgtgaccgg
7381 tggatcaggg gacaaatact acaagcagaa gtactccagg aacgactgga atggagaatc
7441 agaggagtac aacaggcggc caaagagctg ggtgaagtca atcgaggcat ttggagagag
7501 ctatatttcc gagaagacca aaggggagat ttctcagcct ggggcggcta tcaacgagca
7561 caagaacggc tctgggggga acaatcctca ccaagggtcc ttagacctgg agattcgaag
7621 cgaaggagga aacatttatg actgttgcat taaagcccaa gaaggaactc tcgctatccc
7681 ttgctgtgga tttcccttat ggctattttg gggactagta attatagtag gacgcatagc
7741 aggctatgga ttacgtggac tcgctgttat aataaggatt tgtattagag gcttaaattt
7801 gatatttgaa ataatcagaa aaatgcttga ttatattgga agagctttaa atcctggcac
7861 atctcatgta tcaatgcctc agtatgttta gaaaaacaag gggggaactg tggggttttt
7921 atgaggggtt ttataaatga ttataagagt aaaaagaaag ttgctgatgc tctcataacc
7981 ttgtataacc caaaggacta gctcatgttg ctaggcaact aaaccgcaat aaccgcattt
8041 gtgacgcgag ttccgcattt gtgacgcgtt aagttcctgt ttttacagta tataagtact
8101 tgtgttctga caattgggac tcagattctg cggtctgagt cccttctctg ctgggctgaa
8161 aaggcctttg taataaatat aattctctac tcagtccctg tctctagttt gtctgttcga
8221 gatcctacac aaaggattag caatcagaat aatattagac tactcaaaag caacacataa
8281 agccagaaga cagctgagtc atgttttcaa attctgacaa aaaattattt acaacttaga
8341 attc
//
LOCUS HIV2BEN 10359 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral
genome.
ACCESSION M30502
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 2 (isolate BEN) proviral DNA,
clones MK[2,6].
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (sites; for [2])
AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C.,
Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and
Hunsmann,G.
TITLE A novel proviral clone of HIV-2: Biological and phylogenetic
relationship to other primate immunodeficiency viruses
JOURNAL Virology 177, 305-311 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 10359)
AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C.,
Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and
Hunsmann,G.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Kindly submitted prior to publication by Dr. F. Kirchhoff, DPZ,
Gottingen, 0551/3851-0. HIV2BEN was isolated from a German AIDS
patient(with predominantly neurological complications) who was
probably infected in Mali (Klemm et al., J Neuro 235,304,1988).
The sequence was determined from two clones: bases 1903 to 5057 and
7293 to 9131 were from MK2 and bases 1 to 1902, 5058 to 7292 and
9132 to 10359 were from MK6.
FEATURES from to/span description
pept 1103 2668 gag polyprotein
pept < 2326 5754 pol polyprotein (NH2 terminus uncertain)
pept 5423 6070 vif protein
pept 5898 6239 vpx protein
pept 6239 6502 vpr protein
pept 6402 6697 tat protein, exon 2 (first expressed exon)
8861 8957 tat protein, exon 3 (AA at 8862)
pept 6628 6697 rev protein, exon 2 (first expressed exon)
8861 9102 rev protein, exon 3
pept 6704 9286 env polyprotein
pept 9120 9893 nef protein
LTR 1 855 5' LTR
LTR 9505 10359 3' LTR
binding 859 875 primer (Lys-tRNA) binding site
signal 10211 10216 poly-A signal
BASE COUNT 3506 a 2132 c 2598 g 2123 t
ORIGIN Left end of viral genome (5'LTR)
1 tgcaagggat gttttacagt aggaggagac atagaatcct agacatatac ctagaaaaag
61 aggaagggat aataccagat tggcagaatt atactcatgg gccaggagta aggtacccaa
121 tgtacttcgg gtggctgtgg aagctagtat cagtagaact ctcacaagag gcagaggaag
181 atgaggccaa ctgcttagta cacccagcac aaacaagcag acatgatgat gagcatgggg
241 agacattagt gtggcagttt gactccatgc tggcctataa ctacaaggcc ttcactctgt
301 acccagaaga gtttgggcac aagtcaggat tgccagagaa agaatggaag gcaaaactga
361 aagcaagagg gataccatat agtgaataac aggaacaacc atacttggtc aaggcaggaa
421 gtagctacta agaaacagct gaggctgcag ggactttcca gaaggggctg taaccaaggg
481 agggacatgg gaggagctgg tggggaacgc cctcatactt actgtataaa tgtacccgct
541 tcttgcattg tattcagtcg ctctgcggag aggctggcag atcgagccct gagaggttct
601 ctccagcact agcaggtaga gcctgggtgt tccctgctgg actctcacca gtacttggcc
661 ggtactgggc agacggctcc acgcttgctt gcttaaagac ctcttcaata aagctgccag
721 ttagaagcaa gttaagtgtg tgttcccatc tctcctagtc gccgcctggt cattcggtgt
781 tcatctgagt aacaagaccc tggtctgtta ggacccttct cgctttggga atccaaggca
841 ggaaaatccc tagcaggttg gcgcccgaac agggacttga agaggactga gaagccctgg
901 aactcggctg agtgaaggca gtaagggcgg caggaacaaa ccacgacgga gtgctcctag
961 aaaggcgcgg gccgaggtac caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg
1021 gtgaaggtaa gtacctacac caaaaactgt agccagaaaa ggcttgttat cctaccttta
1081 gacaggtaga agattgtggg agatgggcgc gagaaactcc gtcttgagag ggaaaaaagc
1141 agacgaatta gaaaaagtta ggttacggcc cggcggaaag aaaaagtaca ggttaaaaca
1201 tattgtgtgg gcagcgaatg aattggataa attcggattg gcagagagcc tgttggagtc
1261 aaaagaaggt tgccaaaaga ttctcagagt tttagatcca ttagtaccaa cagggtcaga
1321 aaatttaaaa agccttttta ataccgtctg cgtcatttgg tgcttgcacg cagaagagaa
1381 agtgaaagat actgaggaag caaagaaact agcacagaga catctagtgg cagaaactgg
1441 aactgcagag aaaatgccaa atacaagtag accaacagca ccacctagtg ggaaaagagg
1501 aaactacccc gtgcaacaag cgggtggcaa ctatgtccat gtgccactga gcccccgaac
1561 tctaaatgca tgggtaaaat tagtggagga aaagaagttc ggggcagaag tagtgccagg
1621 atttcaggca ctctcagaag gctgcacgcc ctatgatatt aatcaaatgc ttaattgtgt
1681 gggcgatcac caagcagcta tgcaaataat cagagagatt attaatgaag aagcagcaga
1741 ctgggattcg cagcacccaa taccaggccc cttaccagca ggacagctca gagacccaag
1801 agggtctgac atagcaggaa caacaagcac agtagatgaa cagatccagt ggatgtatag
1861 gccacaaaat cccgtaccgg tagggaacat ctacagaaga tggatccaaa tagggctgca
1921 aaagtgtgtc agaaagtaca acccaactaa catcttagac ataaaacagg gaccaaaaga
1981 accgttccaa agctatgtag acaggttcta caaaagcttg agggcagaac aaacagaccc
2041 agcagtaaaa aattggatga cccaaacgct gctaatacag aatgccaacc cagactgcaa
2101 gttagtacta aaaggactgg ggatgaatcc caccctagaa gagatgctaa ccgcctgcca
2161 gggggtaggc ggaccaggcc agaaagccag gctaatggct gaagccctaa aagaggctat
2221 gggaccaagc cctatcccat ttgcagcagc ccaacaaaga aaggcaatta ggtattggaa
2281 ctgtggaaag gagggacact cggcaagaca gtgccgagca cctagaagac agggctgctg
2341 gaagtgtggc aagccaggac acatcatggc aaactgcccg gaaagacagg caggtttttt
2401 agggttgggc ccacggggaa agaagcctcg caacttcccc gtgacccaag cccctcaggg
2461 gctgatacca acagcacctc cggcagatcc agcagcggaa ctgttggaga gatatatgca
2521 gcaagggaga aagcagaggg agcagaggga gagaccatac aaagaggtga cggaggactt
2581 gctgcacctc gagcagagag agacacctca cagagaggag acagaggact tgctgcacct
2641 caattctctc tttggaaaag accagtagtc acagcgtaca tcgaggatca gccggtagaa
2701 gtcttactag acacaggggc tgatgactca atagtagcag gaatagaatt aggggacaat
2761 tacactccaa aaatagtagg gggaataggg ggatttataa acaccaaaga atacaaaaat
2821 gtagaaataa aagtactaaa taaaagagta agagccacca taatgacagg agatacccca
2881 atcaacatct ttggcagaaa tattctgaca gccttaggca tgtcattaaa tttaccagtt
2941 gccaagatag agccaataaa agtaacattg aagccaggga aagatggacc aaggctgaaa
3001 caatggcccc taacaaaaga gaaaatagaa gcactaaaag agatctgtga aaaaatggaa
3061 aaagagggcc agctagaaga ggcacctcca actaatcctt ataatacccc cacatttgca
3121 attaagaaaa aggacaagaa caaatggagg atgctgatag attttagaga actaaataag
3181 gtgactcaag atttcacaga aattcagcta ggaattccac acccggcagg actagccaaa
3241 aagaaaagga tctctatatt agatgtaggg gatgcctatt tttccatacc actacatgaa
3301 gattttaggc agtatactgc atttacccta ccagcagtaa acaatatgga accaggaaaa
3361 agatatatat ataaagtctt gccacaagga tggaagggat caccagcaat ttttcaatac
3421 acaatgaggc aagtcttaga acctttcaga aaagcaaacc cagatgtcat tctcatccag
3481 tacatggatg atatcttaat agctagtgac aggacaggtt tagagcatga caaagtggtc
3541 ctgcagctaa aagaacttct aaatggccta gggttttcta ctccagatga gaagttccaa
3601 aaagaccctc catttcaatg gatgggctgt gaactatggc caactaaatg gaagctgcag
3661 aaactacaac tgccccagaa agacatatgg acagtcaatg acatccaaaa gctagtggga
3721 gtcttaaatt gggcggcaca aatctattca ggaataaaaa ccaaacactt atgtagacta
3781 attagaggaa aaatgacact cacagaagaa gtgcagtgga cagaactagc agaagcagag
3841 ctagaagaaa acaaaattat cttgagccag gaacaagaag gatattatta ccaagaagaa
3901 aaagaattag aggcaacaat ccaaaaaagc caaggacatc aatggacata caaaatacac
3961 caggaagaga aaatcctaaa agtaggaaag tatgcaaaga taaaaaatac ccataccaat
4021 ggggtcagat tactagcaca ggtagttcag aaaataggaa aagaggcact agtcatttgg
4081 ggacggatac caaaatttca cctgccagtg gagagagaga cctgggagca gtggtgggat
4141 aactactggc aagtgacatg gatcccagag tgggactttg tatctacccc accactggtc
4201 aggttaacat ttaacctagt aggagatcct ataccaggcg cagagacctt ctacacagat
4261 ggatcatgca atagacagtc aaaagaggga aaagcaggat atgtaacaga tagaggaaaa
4321 gacaaagtaa aagtattaga acaaactacc aatcagcagg cagaattaga agtctttcgg
4381 atggcactgg cagactcagg cccaaaggtt aatatcatag tagattcaca gtatgtaatg
4441 gggatagtag caggccagcc aacagagtca gaaaatagaa tagtgaacca gatcatagaa
4501 gaaatgataa agaaggaagc agtctatgtt gcatgggtcc cagcccataa aggcatagga
4561 ggaaaccagg aagtagacca tttagtaagt caaggcatca gacaagtatt attcctggaa
4621 aagatagagc ccgctcaaga ggaacatgaa aaatatcata gcattataaa agaactaacc
4681 cataaatttg gaatacccct tctagtagca agacagatag taaactcatg tgcccaatgc
4741 caacagaaag gagaagccat acatgggcaa gtaaatgcag aaataggcgt ttggcaaatg
4801 gactacacac acttagaagg aaaaatcatt atagtagcag tacatgttgc aagtggattc
4861 atagaagcag aagtcatccc acaggaatca ggaaggcaga cagcactctt cctattaaaa
4921 ctggccagta ggtggccaat aacgcacttg cacacagaca atggccccaa cttcacttca
4981 caggaagtga agatggtggc atggtgggta ggtatagaac aatcctttgg agtaccttac
5041 aacccacaaa gccagggagt agtagaagca atgaatcacc acctaaagaa tcagataagt
5101 agaattagag aacaggcaaa tacaatagaa acaatagtac tgatggcagt tcattgcatg
5161 aattttaaaa gaaggggagg aataggggat atgaccccag cagaaagact aatcaacatg
5221 attaccacag aacaagaaat acaattcctc caaagaaaaa attcaaattt taaaaatttc
5281 caggtctatt acagagaagg cagagatcag ctgtggaaag gacctggtga actactgtgg
5341 aagggagaag gagcagtcat agtcaaggta gggacagaca taaaagtagt accaagaagg
5401 aaggccaaga ttatcaggga ctatggagga agacaggaac tggatagtag tccccacctg
5461 gagggtgcca gggaggatgg agaaatggca tgcccttgtc aagtacctga aatacagaac
5521 aaaagaccta gaggaggtgc gctatgttcc ccaccacaag gtgggatggg catggtggac
5581 ttgcagcagg gtaatattcc cactacaagg aaaaagtcat ctagaaatac aggcatattg
5641 gaacctaaca ccagaaaaag gatggctctc ctctcatgca gtaagattaa cctggtatac
5701 agaaaagttc tggacagatg ttaccccaga ctgtgcagac atcctaatac atagcactta
5761 tttctcttgc tttacggcag gtgaagtaag aagagccatc agaggggaaa agttattgtc
5821 ctgctgcaac tatccccaag ctcataaagc acaggtacca tcacttcaat acctagccct
5881 agtagtagta caacaaaatg acagacccca gagaaagggt accgccagga aacagtggag
5941 aagagaccat tggagaggcc ttcgagtggc tagagaggac catagaagcc ttaaacaggg
6001 aggcagtgaa ccatctgccc cgagagctca ttttccaggt gtggcaaagg tcctggagat
6061 attggcatga tgaacaaggg atgtcagcaa gctacacaaa gtatagatat ttgtgcctaa
6121 tgcaaaaagc tatatttaca catttcaaga gagggtgcac ttgctggggg gaggacatgg
6181 gccgggaagg attggaagac caaggacctc cccctcctcc ccctccaggt ctagtctaat
6241 gactgaagca ccaacagagt ttcccccaga agatgggacc ccacggaggg acttagggag
6301 tgactgggta atagaaactc tgagggaaat aaaggaagaa gccttaagac attttgatcc
6361 ccgcttgcta attgctcttg gctactatat ccataataga catggagaca cccttgaagg
6421 cgccagagag ctcattaaaa ccctacaacg agccctcttc gtgcacttca gagcgggatg
6481 taaccgctca agaattggct aaacaaggag aagaactcct tgcccagctg caccgacccc
6541 tagagccatg cactaacaaa tgctattgta agcgatgcag tttccattgc cagctgtgtt
6601 tctcgaaaaa ggggctcgga atatcatatg agcgaaaggg cagacgaaga aggactccaa
6661 ggaaaactaa gactccttcg ccttctgcac cagacaagtg agtatggagc ctggtaggaa
6721 tcagctgttt gttgtcattt tactaacaag tgcttgctta gtatattgta gccagtatgt
6781 gactgttttc tatggcatac ccgcgtggaa aaatgcatct attcccttat tttgtgcaac
6841 taaaaataga gacacttggg ggaccataca gtgcttgcca gacaatgatg attatcagga
6901 aataatttta aatgtgacag aggcttttga tgcatggaat aatacagtga cagaacaagc
6961 agtagaagat gtctggcatc tatttgagac atcaataaaa ccatgtgtca agctaacacc
7021 tctatgtgtg gcaatgaatt gtagcagggt tcaagggaat accacgaccc cgaatcccag
7081 gacctcgagt tccacaacct cgagaccacc cacatccgca gcctccataa taaatgaaac
7141 ttctaactgc atagaaaaca acacatgcgc aggattaggg tatgaggaga tgatgcaatg
7201 tgagttcaat atgaaggggt tagaacaaga taagaaaagg aggtataagg acacatggta
7261 tttagaagat gtggtttgtg acaacacaac agctggcaca tgttacatga gacattgcaa
7321 cacatcaatc atcaaagagt catgtgataa gcactattgg gatgctatga ggtttagata
7381 ctgtgcacca ccgggctttg ccctattaag atgtaatgat accaactatt caggctttga
7441 acctaagtgc actaaagtag tagctgcttc atgcacaagg atgatggaaa cgcaaacttc
7501 tacttggttt ggctttaatg gcactagagc agaaaataga acatatatct attggcatgg
7561 cagagataat aggactatca ttagcttaaa caagtattat aatctcacaa tgcgttgtaa
7621 gagaccagga aataagacag ttttaccaat aacacttatg tcaggattag tgtttcactc
7681 tcagccaatc aacacaaggc ctaggcaggc atggtgccgg tttggaggca gatggaggga
7741 agccatgcag gaggtgaagc aaacccttgt acaacatccc agatacaaag gaatcaatga
7801 tacagggaaa attaacttta cgaaaccggg agcaggctca gacccggaag tggcatttat
7861 gtggactaac tgcagaggag aatttctcta ctgtaacatg acttggttcc tcaattgggt
7921 agaagacaag aaccaaacac ggcgcaacta ttgccatata aagcagataa ttaatacctg
7981 gcataaagta gggaaaaatg tatatttgcc tcctagggaa ggggagttgg cctgtgaatc
8041 aacagtaacc agcataattg ctaacattga catagataaa aatcggactc ataccaacat
8101 tacctttagt gcagaagtgg cagaactgta ccgattagaa ctgggagact acaaattaat
8161 agaaataaca ccaattggct tcgcacctac agatcagaga aggtactcct caactccagt
8221 gaggaacaaa agaggtgtgt tcgtgctagg gttcttgggt tttctcgcga cagcaggttc
8281 tgcaatgggc gcgcggtccc tgacgctgtc agcccagtcc cggactttac tggccgggat
8341 agtgcagcaa cagcaacagc tgttggacgt agtcaagaga caacaagaaa tgttgcgact
8401 gaccgtctgg ggaacgaaaa acctccaggc aagagtcact gctatcgaga agtacctaaa
8461 gcatcaggca cagctaaatt catggggatg tgcgtttaga caggtctgcc acactactgt
8521 accgtgggta aatgactctt tatcgcctga ctggaaaaat atgacatggc aggagtggga
8581 gaaacaagtc cgctacctag aggcaaatat cagtcaaagt ttagaagaag cccaaattca
8641 acaagaaaag aatatgtatg aattacaaaa attaaatagc tgggatattc ttggcaactg
8701 gtttgactta acctcctggg tcaagtatat tcaatatgga gtgcatatag tagtgggaat
8761 aatagcttta agaatagcaa tctatgtagt gcaattgtta agtagattta gaaagggcta
8821 taggcctgtt ttctcttccc cccccggtta tctccaacag atccatatcc acaaggaccg
8881 gggacagcca gccaacgaag gaacagaaga agacgtcgga ggcgacagtg gttacgactt
8941 gtggccttgg ccaataaact atgtgcagtt cctgatccac ctactgactc gcctcttgat
9001 cgggctatac aacatctgca gagacttact atccaagaac tccccgaccc gccgactgat
9061 ctcccagagt ctaacagcaa tcagggactg gctgagactt aaggcggccc aactgcaata
9121 tgggtgcgag tggatccaag aagctttcca agcattcgcg aggactacga gagagactct
9181 tgcgggcgcg tggggatggt tatgggaagc agcgcgacgc atcgggaggg gaatactcgc
9241 agttccaaga agaatcaggc agggagcaga actcgccctc ctgtgaggga cagcagtatc
9301 agcagggaga gtacatgaac agcccatgga gaaacccagc aacagaaaga cagaaagatt
9361 tgtataggca gcaaaatatg gatgatgtag attctgatga tgatgaccta ataggagttc
9421 ctgttacacc aagagtacca cggagagaaa tgacctataa attggcaata gatatgtcac
9481 attttataaa agaaaaaggg ggactgcaag ggatgtttta cagtaggagg agacatagaa
9541 tcctagacat atacctagaa aaagaggaag ggataatacc agattggcag aattatactc
9601 atgggccagg agtaaggtac ccaatgtact tcgggtggct gtggaagcta gtatcagtag
9661 aactctcaca agaggcagag gaagatgagg ccaactgctt agtacaccca gcacaaacaa
9721 gcagacatga tgatgagcat ggggagacat tagtgtggca gtttgactcc atgctggcct
9781 ataactacaa ggccttcact ctgtacccag aagagtttgg gcacaagtca ggattgccag
9841 agaaagaatg gaaggcaaaa ctgaaagcaa gagggatacc atatagtgaa taacaggaac
9901 aaccatactt ggtcaaggca ggaagtagct actaagaaac agctgaggct gcagggactt
9961 tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgggga acgccctcat
10021 acttactgta taaatgtacc cgcttcttgc attgtattca gtcgctctgc ggagaggctg
10081 gcagatcgag ccctgagagg ttctctccag cactagcagg tagagcctgg gtgttccctg
10141 ctggactctc accagtactt ggccggtact gggcagacgg ctccacgctt gcttgcttaa
10201 agacctcttc aataaagctg ccagttagaa gcaagttaag tgtgtgttcc catctctcct
10261 agtcgccgcc tggtcattcg gtgttcatct gagtaacaag accctggtct gttaggaccc
10321 ttctcgcttt gggaatccaa ggcaggaaaa tccctagca
//
LOCUS HIV2D205 7817 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2 (HIV-2), gag, pol, vif,
vpX, vpR, partial env.
ACCESSION X16109
KEYWORDS .
SOURCE Human immunodeficiency virus type 2 (isolate D205) viral RNA.
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 7817)
AUTHORS Dietrich,U., Adamski,M., Kreutz,R., Seipp,A., Kuhnel,H. and
Rubsamen-Waigmann,H.
TITLE A highly divergent HIV-2-related isolate
JOURNAL Nature 342, 948-950 (1990)
STANDARD full staff_review
COMMENT This isolate, also called HIV2-ALT by the authors, was taken from a
healthy Ghanian woman. Tree analysis reveals that it likely arose
prior to the branching of HIV2s and SIVs, i.e. the sooty mangabey
and macaque isolates. [1] calls attention to an extension of the
gag-pol overlap region resulting in a dilated (by 20 residues) pol
orf. The env cds begins at 6671; tat-exon1 at 6375 and rev-exon1 at
6601, with the possible donor splice site at 6665.
FEATURES from to/span description
pept 1079 2644 gag polyprotein
pept < 2299 5475 pol polyprotein (NH2 terminus uncertain)
pept 5396 6046 vif protein
pept 5877 6212 vpX protein
pept 6212 6517 vpR protein
pept 6375 / 6664 tat protein, exon 2 (first expressed exon)
pept 6601 / 6664 rev protein, exon 2 (first expressed exon)
pept 6744 > 6875 env polyprotein
site 6876 6878 env polyprotein in-frame stop codon
BASE COUNT 2705 a 1602 c 1930 g 1580 t
ORIGIN 5'-terminus of 5'LTR
1 tggaagggat gtattatagt gagagaagac acagaatatt agacacatat tttgagaatg
61 aagaaggcat tgtgtctggc tggcaaaact atactcatgg gccagggata aggcatccca
121 aatactttgg ttggctgtgg aagctggtac cagtagaggt gccagcagcg acccgagagg
181 aggaggaaac ccattgccta atgcacccgg cacagatctc ctcatgggat gacatccatg
241 gggagactct tatctggcag tttgattccc tcctggcata tgattatgtg gctttcaata
301 ggtttccaga agagttcggg taccagtcag ggctaccaga ggaggagtgg aaggctagac
361 taaaagcaag agggatacct acagattagg caagagacag cagcataaac aggaactagc
421 tgacactgca caagaaggaa actagcagac actgcaggga ctttccaaaa gaggctgtaa
481 catgggagga gcaagggagg ggttggttgg gaacgcccat taaatctctg tataaatgta
541 cccgcttctc gcattgtatt cagtcgctct gcggagaggc tggcagattg agccctggga
601 ggttctctcc agcactagca ggtagagcct gggtgttccc tgctagactc tcaccagcac
661 taggccagtg ttgggcagac ggctccacgc ttgcttgctt aaaaccctct taataaactg
721 ccaattagaa gcaagtaagt gtgtgttccc atctctccta gtcgcgcctg gtcactcggt
781 gctccactga tagaccctgg tctgttagga cccttcttgc tttgggaaac cgaagcggga
841 aaatccctag cagattggcg cccgaacagg gacctgggaa gagtgagagt cttcaagagc
901 acagctgagt gaaggcagta agggcggcag gaatcaacca cgacggagag ctcctgtaaa
961 agcgcaggcc ggtaccaggc agcgtgagga gcgggaggag aagaggctcc gggagtaagg
1021 taagtactcc aaattttgca ttgtctactc taaagaggga gtagggcata agtgggagat
1081 gggcgcgaga ggctccgtcc tgtcagggaa gaaaacagat gaattagaaa aagttaggtt
1141 acggcccggc ggaaagaaaa agtatatgct aaaacatgta gtatgggcag tgaacgaatt
1201 agacagattt ggattagcag agagcctgtt ggaatcaaaa gaaggatgtc aaaaaattct
1261 taaagtctta gctccattag taccaacagg gtcagaaaat ttaaaaagcc tttttaatat
1321 cgtctgcgtc attttttgcc tgcacgcaga agagaaagtg aaagatacag aggaagcaaa
1381 aaagatagca cagagacatc tagcggcgga cacagaaaaa atgccagcta caaataaacc
1441 aacagcacca cctagcggcg gaaattatcc agtgcagcaa ctggctggca actacgtcca
1501 cctgccgcta agcccccgaa ccttaaatgc ttgggtaaag ttagtagaag aaaagaagtt
1561 cggggcagaa gtagtaccag gatttcaggc actatcagaa ggatgcaccc cttatgatat
1621 aaatcagatg ctaaattgtg taggagaaca tcaggcagcc atgcaaatta ttagagaaat
1681 aatcaatgag gaagcagcag actgggacca gcaacacccg tcaccaggcc caatgccggc
1741 aggacaactt agggacccaa gagggtcaga tatagcagga accaccagca cagtagagga
1801 acagatacag tggatgtaca gggcccaaaa tcctgtccca gtgggaaaca tttatagaag
1861 atggattcaa ttaggattgc agaaatgtgt ccgaatgtac aatcctacca acatattaga
1921 cataaagcag ggaccaaagg agcccttcca aagctatgta gatagattct acaaaagctt
1981 acgggcagaa caaacagacc cagcagtgaa aaattggatg acacaaacac tgctgattca
2041 gaatgctaac ccagattgca agttagtgct taagggcttg ggaatgaatc ccaccttaga
2101 ggaaatgcta acggcctgcc aagggatagg aggcccaggg cagaaggcaa ggctaatggc
2161 cgaagcctta aaagaggccc taacacctgc acccataccg tttgctgccg ttcaacaaaa
2221 agcagggaag agagggacag tgacatgctg gaactgtggc aaacagggac acacagccag
2281 gcaatgcagg gcccctagaa gacagggatg ctggaaatgt ggaaaaacag gacacatcat
2341 gtcaaaatgc ccagaaagac aggcgggttt tttagggtta ggaccctggg gaaagaagcc
2401 tcgcaacttc cccatgaccc aagtgcctca gggagtgaca ccatctgcac ccccgatgaa
2461 cccagcagag ggcatgacac ctcggggggc gacaccatct gcgccccctg cagatccagc
2521 agtggagatg ctgaaaagtt acatgcagat ggggagacaa cagagagaga gccgagagag
2581 accctacaag gaggtgacag aggatttgct gcacctcaat tctctctttg gagaagacca
2641 gtagtcaaag catgtatcga gggtcagtca gtagaagtat tactagacac aggagttgac
2701 gactcaatag tagcagggat agaattaggt agcaattaca ccccaaaaat agtaggaggg
2761 ataggagggt tcataaatac caaagaatac aaagatgtag aaatagaagt agtgggaaaa
2821 agagtaaggg caactataat gacaggagat accccaataa acatttttgg cagaaatatt
2881 ttaaatacct tgggcatgac tttaaatttc ccagtggcaa aggtagaacc agtaaaagtt
2941 gagttaaaac ctggaaaaga tgggccaaag atcagacaat ggcctctatc cagggaaaag
3001 atactagccc tcaaagaaat ctgtgaaaaa atggaaaagg agggacaatt agaagaagca
3061 ccccctacta atccatacaa cacacccacc tttgccataa agaagaaaga taagaacaaa
3121 tggagaatgc taatagactt tagagagtta aacaaggtaa cccaggactt cacagaagtc
3181 aactgggtat tccccacccg gcaggtggca gaaaaaagga gaataacagt aatagatgtg
3241 ggagatgcct acttcagtat cccactagat ccaaacttca gacagtatac agcattcacc
3301 ttgccatcag taaacaatgc agagccagga aagagataca tttataaagt tctaccacaa
3361 gggtggaagg ggtcccagtc aatttgtcaa tactccatga gaaaggtatt agatcctttt
3421 agaaaggcca acagcgatgt cattataatt cagtacatgg atgacatcct tatagcaagt
3481 gacagaagtg atctggagca cgacagggta gtgtcccaac taaaagagtt attaaatgac
3541 atgggattct ctaccccaga agaaaagttc caaaaagacc ctccgttcaa atggatgggt
3601 tatgagctct ggccaaaaaa gtggaaactg caaaaaatac aactgccaga aaaagaagtt
3661 tggacagtga atgcaattca aaaactggta ggagtattaa actgggcagc tcaactcttt
3721 cctggaatta agacaaggca catatgcaaa ctaattaggg gaaagatgac cctaacagaa
3781 gaagtacagt ggacagaact agcagaagca gagctacagg agaataaaat catcttagaa
3841 caggaacaag aaggatccta ctacaaggaa agggtaccgc tagaagcaac agtacagaaa
3901 aacctagcaa atcagtggac atacaaaatt catcagggaa ataaagtcct aaaagtagga
3961 aaatatgcaa aggttaaaaa cacgcacacc aacggggtaa gactactggc acatgtagtt
4021 cagaaaatag gcaaagaagc cctagtcatc tggggagaga taccagtgtt ccatctgcca
4081 gtagaaagag agacatggga ccagtggtgg acagattact ggcaagtaac ctggatccca
4141 gagtgggact ttgtctcgac cccaccatta ataagactag cctacaacct agtcaaagac
4201 cccctagaag ggagagaaac ctactacaca gatgggtcct gcaatagaac ctcaaaggaa
4261 ggaaaagcag gatatgtcac tgacagggga aaagataagg ttaaagtgtt agaacagaca
4321 acaaaccaac aagcagaact tgaagcattt gcattagcat taacagactc agaaccacaa
4381 gttaacatca tagtagattc acaatatgtc atgggaataa tagctgcaca gccaacagaa
4441 acagaatcac caatagtagc aaaaataatt gaagaaatga tcaaaaaaga ggcagtatat
4501 gtaggatggg taccagctca caagggactg ggtggtaatc aggaagtaga ccacctagta
4561 agtcaaggaa tcagacaggt cttgttccta gaaaaaatag aaccagccca ggaagagcat
4621 gaaaaatatc atggcaatgt aaaagaactg gtccataaat tcggaattcc acaattagtg
4681 gcaaaacaga tagtaaattc ctgtgataaa tgccaacaaa aaggggaagc tattcatgga
4741 caggtaaatg cagacctagg gacatggcag atggactgta cacatttaga aggaaaaatt
4801 ataatagtgg cagtccatgt agccagtggg tttatagaag cagaggtaat accccaagag
4861 acaggaagac agacagctct cttcctacta aagttggcca gcagatggcc tatcacacac
4921 ctacacacag acaacggtgc caacttcacc tcaccaagtg taaagatggt agcctggtgg
4981 gtaggaatag aacaaacttt tggagtaccc tataacccac aaagtcaagg agtagtggaa
5041 gcaatgaacc atcacctgaa aaatcaaata gacagactca gagaccaagc agtatcaata
5101 gagacagttg tactaatggc aactcactgc atgaatttta aaagaagggg aggaataggg
5161 gatatgaccc ctgcagaaag actagttaac atgataacca cagagcaaga aatacagttc
5221 ttccaagcaa aaaatttaaa atttcaaaat ttccaggtct attacagaga aggcagagat
5281 caactctgga agggacctgg tgaactattg tggaaagggg aaggagcagt catcataaag
5341 gtagggacag aaatcaaagt agtacccagg agaaaagcaa aaattataag gcactatgga
5401 ggaggaaaag gattggattg tagtgccgac atggaggata ccaggcaggc tagagagatg
5461 gcacagtctg attaagtatc ttaagtatag aacaggagag ttgcaacagg tctcttatgt
5521 ccctcaccac aaggtaggat gggcttggtg gacttgcagt agaataatat ttcccctaaa
5581 caaaggagca tggctagaag tccaaggata ttggaaccta accccagaaa ggggattctt
5641 gagctcctat gctgtaagac taacatggta tgagaggaac ttttatacag atgtaacacc
5701 tgatgtggca gaccagctac tgcatgggtc ttatttctct tgcttttcag ccaatgaagt
5761 aaggagagcc atcaggggag aaaagatatt gtcctactgc aactatccat cagctcacga
5821 agggcaggta ccaagcttac agtttctagc cctaagggtc gtacaggaag gaaaaaatgg
5881 atcccaggga gagagtgcca ccaggaaaca gcgacgaaga aacagtagga gaagcattcg
5941 cttggctaga aagaacaata acagagctca acagggtagc ggtcaaccat ttgccccgag
6001 aacttatttt ccaggtctgg cagaggtctt gggcatactg gcgtgaggaa cagggcatgt
6061 caattagcta taccaaatat agatacttgt tgctaatgca gaaagcaatg tttgtgcact
6121 atacaaaggg ctgtaggtgc ctgcaggagg gccatgggcc agggggatgg agatcaggac
6181 ctcctcctcc tcctccccca ggcctggcct aatggcagaa gcagccccag agatccctcc
6241 agagaacgag aacccacaaa gagaaccgtg ggaagagtgg ataggggaga tcctggagga
6301 aataaagcaa gaagccttaa agcattttga tcctcgcttg ctaactgcgc ttggtaactt
6361 tatctacagt aggcatggag atacccttgc aggagcagga gagctcatta aaatcctcca
6421 acgagccctc ttcctccact tcagagccgg ttgtcaacac tcaaggattg gacaatcagg
6481 gggaggaaat cctctctcaa ctataccgcc cccttaaggc atgcgataat acatgctact
6541 gtaagaaatg ctgctaccat tgccagcttt gttttcttaa aaagggtctt gggatatgtt
6601 atgaccgctc gagaaggaga tctgcaaaaa gagctaagac tactgcacct tctgcaccag
6661 acaagtgagt atggcatatt ttagcagccg cctgcctatt gcgctcctgc ttataggtat
6721 cagtgggttt gtatgtaaac aatatgttac tgtcttctat ggcatacccg catggaggaa
6781 cgcaacagtt cccctcattt gtgcaaccac aaacagagac acctggggaa ctgtacagtg
6841 tctcccagac aatggtgact acactgagat caggctaaac ataacagagg cttttgatgc
6901 atgggataat acagtgacac aacaggcagt agatgatgtg tggagactct ttgaaacctc
6961 cataaaacca tgtgtcaaac taaccccact gtgtgtggca atgaactgta gtaaaaccga
7021 aacaaaccca gggaatgcca gtagtactac caccactaag cctactacca cctctcgtgg
7081 gctgaaaacg attaacgaaa cagacccatg cataaaaaat gacagctgca caggactagg
7141 agaagaggaa ataatgcaat gtaattttag tatgacggga ctaagaagag atgagctaaa
7201 acaatataaa gacacctggt actcagaaga tttagagtgt aataatacca ggaagtatac
7261 cagcagatgc tatataagaa cctgcaacac aacaattatc caagagtcat gtgacaaaca
7321 ttattgggac agcttaaggt ttaggtattg tgctcccccg gggttttttc tactaagatg
7381 taatgatacc aactattcag gcttcatgcc caactgcagt aaggtagtag cgtcctcctg
7441 cacaagaatg atggaaacac agtcctctac atggtttggc ttcaatggta caagggcaga
7501 gaacaggaca tatatatatt ggcatgaaaa agacaatagg accatcataa gcttaaatac
7561 atactataat ttgtcaatac actgtaagag gccaggaaac aagacggttg taccaataag
7621 aaccgtgtca ggactacttt tccattcaca gcctatcaat aagagaccca gacaagcttg
7681 gtgctggttt aagggaaact ggacagaagc cataaaggag gtgaaaagga ccatcataaa
7741 acatcccagg tataaaggag gtgcaaaaaa tatcacaagc gtaaagttag tatcagaaca
7801 tggaaaaggt tcagatc
//
LOCUS HIV2GH1 9480 bp ds-DNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2 proviral DNA, gag, pol, vif,
vpX, vpR, tat, rev, env and nef genes.
ACCESSION M30895
KEYWORDS .
SOURCE Human immunodeficiency virus type 2 MOLT-4 cell (isolate GH1)
proviral DNA, clone 8.
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9480)
AUTHORS Hasegawa,A., Tsujimoto,H., Maki,N., Ishikawa,K.-I., Miura,T.,
Fukasawa,M., Miki,K. and Hayami,M.
TITLE Genomic divergence of HIV-2 from Ghana
JOURNAL AIDS Res. Hum. Retroviruses 5, 593-604 (1989)
STANDARD full staff_review
COMMENT GH1 was isolated from a Ghanian AIDS patient; virus was cloned in
MOLT-4 clone 8 cells.
FEATURES from to/span description
pept 544 2112 gag polyprotein
pept < 1770 4937 pol polyprotein
pept 4867 5514 vif protein
pept 5342 5680 vpX protein
pept 5680 5997 vpR protein
pept 5843 6138 tat protein, exon 2 (first expressed exon)
8281 8377 tat protein, exon 3 (AA at 8282)
pept 6069 6138 rev protein, exon 2 (first expressed exon)
8281 8522 rev protein, exon 3 (AA at 8283)
pept 6148 8364 env polyprotein (see COMMENT)
pept 8540 9307 nef protein
BASE COUNT 3257 a 1957 c 2349 g 1917 t
ORIGIN
1 cagtcgctct ggcagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg
121 gctccacgct tgcttgctta aagacctctt aataaagctg ccaattagaa gcaagttaag
181 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gaataacaag
241 accctggtct gttaggaccc ttccgctttg ggaatccaag gcaggaaaat ccctagcagg
301 ttggcgccca acagggactt gaaagggact gagaagccct ggaacacggc tagtgaaggt
361 agtaaggcgg caggaacaaa ccacgacgga gtgctcctag aaaagcgcgg gccgaggtac
421 caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg gtgaaggtaa gtgcctacac
481 cgaaaaccgt agccagaaaa ggctagttat cctatctttt agacaggtag aagattgtgg
541 gagatgggcg cgagaaactc cgtcttgaga gggaaaaaag cagatgaatt agaaaagatt
601 aggttacggc ccagcggaaa gaaaaagtac aggctaaaac atattgtgtg ggcagcgaac
661 gaattggaca aattcggatt ggcagagagc ctgttggagt caaaagaagg ttgccaaaag
721 attctcacag ttttagatcc attagtacca acagggtcag aaaatttaaa aagccttttt
781 aacacagtct gcgtcatttg gtgcttgcac gcagaagaga aagtgaaaga tactgaagaa
841 gcaaaaaaac tagtacagag acatctaggg gcagaaacag gaactgcaga aaaaatgcca
901 agtacaagta gaccaacagc accacctagc gggagaggaa gaaacttccc tgtacaacag
961 acaggcggtg gcaactatat ccacgtgcca ctgagccccc gaactctaaa tgcttgggta
1021 aaattagtag aggacaagaa gttcggggca gaagtagtgc caggatttca agcactctca
1081 gaaggctgca cgccctatga tatcaaccaa atgcttaatt gtgtgggcga tcaccaagca
1141 gctatgcaaa taatcagaga gattatcaat gacgaagcag cagattggga tgcacagcac
1201 ccaataccag gccccttacc agcagggcag cttagagacc caagggggtc tgacatagca
1261 ggaacaacaa gcacagtaga agaacagatc cagtggatgt ataggccaca aaatcccgtg
1321 ccggtaggga acatctacag aagatggatc cagatagggc tacagaagtg tgtcaggatg
1381 tacaacccaa ctaacatctt agacgtaaag cagggaccaa aggaaccgtt ccagagctat
1441 gtggacaggt tctataaaag cttgagggca gaacaaacag atccggcagt aaagaactgg
1501 atgacccaaa cgctgctaat acagaatgcc aacccagact gcaagttagt actaaaagga
1561 ctggggatga atcccaccct agaagagatg ctgactgcct gtcagggggt aggtggacca
1621 ggccaaaaag ccagactaat ggccgaagcc ctgaaggagg ctttgacacc accccctatc
1681 ccatttgcag cagcccaaca aagaaaggta attaggtgtt ggaactgcgg aaaggaggga
1741 cactcggcga gacagtgccg agcgcctaga agacagggct gctggaaatg tggcaagaca
1801 ggacacgtca tggcaaaatg cccagaaaga caggcaggtt ttttagggat gggtccatgg
1861 ggaaagaagc cccgcaactt ccccgtggcc caagctcctc cggggctgat accaacagca
1921 cccccagcag atccagcagt ggatctattg gaaagatata tgcagcaggg gagagagcag
1981 agggagcaga gggagagacc atacaaagag gtgacggaag acttactgca cctcgagcag
2041 ggaaaagcac ctcacagaga ggcgacagag gacttgctgc acctcaattc tctctttgga
2101 aaagaccagt agtcacagca tacatcgagg tacagccggt agaagtctta ttagacacag
2161 gggctgatga ctcaatagtg gcaggaatac agttagggga caattacgtt ccaaaaatag
2221 tggggggaat agggggattc ataaacacca aagagattaa aaatatagaa ataaaagtac
2281 taaataaaag agtaagagcc accataatga caggagatac cccaatcaac atctttggca
2341 gaaatatcct gacagcctta ggcatgtcat taaatttacc aatcgccaag atagaaccaa
2401 taaaagtaac gttgaagcca gggaaagatg gaccaagact gagacaatgg cccctaacaa
2461 aagagaaaat agaagcacta agagagatct gcgaaaaaat ggaaaaggag ggccaactag
2521 aagaggcacc tccaactaat ccttataata cccccacatt tgcaattaag aaaaaggaca
2581 agaacaaatg gaggatgcta atagacttta gagaactaaa cagggtgact caagatttca
2641 cagaaattca gctaggaatt ccacacccgg caggattagc caaaaagaaa aggatcactg
2701 tactagatgt aggggatgcc tacttttcca taccgctaca tgaagatttt aggcagtata
2761 ctgcatttac cctaccatca gtaaacaatg cagaaccagg aaaaagatat atatataagg
2821 tcttaccaca aggatggaag ggatcaccag caatttttca acacacaatg aggcaagtct
2881 tagaaccttt cagaaaagca aacccagatg tcattctcat ccaatacatg gatgatattt
2941 taatagctag tgacaggaca ggtttagagc atgacaaagt agtcctgcag ctaaaagaac
3001 ttctgaatgg cctagggttc tctaccccag atgagaagtt ccaaaaagac cctccgcttc
3061 aatggatggg ctatgaacta tggccaacta aatggaagct gcagaaatta caactgcccc
3121 agaaagaaat atggacagtc aacgacatcc agaagctggt gggagtctta aattgggcgg
3181 cacaaatcta tccaggaata aaaaccaaac atttgtgtag gctaatcaaa ggaaaaatga
3241 cactcacaga agaagtgcag tggacagagt tagcagaagc agagctagaa gaaaacaaaa
3301 ttatcttaag ccaggaacaa gaaggatact attaccaaga agaaaaagaa ttagaggcaa
3361 caatccaaaa aaatcaagac aatcagtgga catacaaaat acaccaggaa gagaaaatcc
3421 taaaagtagg gaaatatgca aagataaaaa atacccatac caatggggtc agattactag
3481 cacaggtagt tcagaaaata ggaaaagaag cactagtcat ctggggacgg ataccaaaat
3541 tccacttacc ggtggagaga gagacctggg agcagtggtg ggataactac tggcaagtga
3601 catggatccc agagtgggac ttcgtatcta ccccaccact agtcaggtta acatttaacc
3661 tagtaggaga tcctatacca ggcgcagaga ccttctacac agatggatca tgcaatagac
3721 agtcaaagga aggaaaagca cgatatgtga cagatagagg aagagacaag gtaagagtac
3781 tagaacgaac taccaatcag caagcagaat tagaagcctt tgcgatgaca ctgacagact
3841 cgggtccaaa agttaatatc atagtagact cacagtatgt aatgggaata gtagtaggcc
3901 aaccaacaga gtcagaaagt agaatagtaa accaaatcat tgaggacatg ataaagaaag
3961 aagcagtcta tgtcgcgtgg gtcccagccc ataaaggcat aggaggaaac caggaagtag
4021 accatttagt aagtcagggc atcagacaag tattgttcct agaaaggata gagcccgccc
4081 aagaagaaca tgaaaaatat catagcaata tgaaagaact aacccataaa tttggaatac
4141 cccaactagt agcaagacag atagtaaaca catgtgccca atgccaacag aaaggagagg
4201 ccatacatgg gcaggtaaat gcagaaatag gtgtctggca aatggactgc acacacttag
4261 aaggaaagat cattatagta gcagtacatg ttgcaagtgg attcatagaa gcagaagtta
4321 tcccacagga atcaggaagg cagacagcgc tcttcctatt aaaactggcc agtaggtggc
4381 caataacaca cttacacaca gacaatggct ccaacttcac ttcacaggaa gtgaagatgg
4441 tggcatggtg gataggtata gagcaatcct ttggagtacc ttacaaccca caaagccagg
4501 gagtagtaga agcaatgaat caccacttaa agaatcagat aagtagaatt agagaacagg
4561 caaatacaat agaaaccata gtactaatgg cagttcattg catgaatttt aaaagaaggg
4621 gaggaatagg ggatatgacc ccagcagaaa gactaatcaa catgattacc acagaacaag
4681 aaatacaatt cctccaaaga aaaaattcaa attttaaaaa cttccaggtc tattacagag
4741 aaggcagaga tcagctgtgg aaaggacctg gagaactact gtggaaggga gacggagcag
4801 tcatagtcaa ggtaggggca gacataaaag taataccaag aaggaaggcc aagattatca
4861 gagactatgg aggaaggcaa gaactggata gttcccacct ggagggtgcc agggaggagg
4921 atggagaggt ggcatagtct tgtcaagtac ctaaaataca gaacaagaga cctagaggag
4981 gtgcgctatg ttccccacca caaggtggga tgggcgtggt ggacttgcag cagggtaata
5041 ttccctctaa aaggagaaag tcatctagag atacaggcat attggaacct aacgccagaa
5101 aaaggatggc tctcctctca ttcagtgaga ataacctggt atacagaaag gttctggaca
5161 gatgttaccc cagactatgc agacatccta atacatagca cttatttctc ttgctttacg
5221 gcaggtgaag taagaagagc catcagaggg gaaaagttat tgtcctgctg caactatccc
5281 caagctcata aagtacaggt accatcactt caatacctag ccctagtggt agtgcaacaa
5341 aatgacagac cccagagaaa gggtaccgcc aggaaacagt ggagaagaga ccattggaga
5401 ggccttcgag tggctagaca ggactataga agccttaaac agagaggcag tgaaccatct
5461 gccccgagag ctcattttcc aggtgtggca aaggtcctgg agatactggc atgatgacca
5521 agggatgtca ccaagctaca caaagtatag atatttgtgc ctgatgcaga aagctgtatt
5581 tatacatttc aagagagggt gcacttgcct ggggggagga catggaccgg gaggatggag
5641 atcaggacct ccccctcctc cccctccagg tctagtctaa tgactgaagc accaacagag
5701 tttcccccag aagatgggac cccacggagg gaactagggg gtgactgggt aataagaatt
5761 ctgggagaaa taaaggaaga agccttaaag cattttgatc cccgcttgct aatcgctctt
5821 ggcaactata tccatagtag acatggagac acacctgaag gcgccagaga gctcattaga
5881 atcctacaac gagccctctt cgtgcacctc agagcagggt gtaaccgctc aagaattagc
5941 caaacaagga gaagaactcc tttcccagct gcaccgaccc ctagaggcat gtactaactc
6001 atgctattgt aagcagtgca gcttccattg ccagctgtgt ttcttgaaaa aagggctcgg
6061 gatatggtat gcacgaaaaa gcagacggag aagaactcca agaaagacta agactcattc
6121 gtcttctgca tcagacaagt gagtaagatg tgtggtaaga gtctactatg tgttgccagc
6181 ttgctagcta gtgcttactt agtatattgc acccaatatg tgactgtttt ctatggcgtg
6241 cccgtgtgga gaaatgcatc cattcccctc ttttgtgcaa ctaaaaacag agatacttgg
6301 ggaaccatac agtgcaagcc agacaatgat gattatcaag aaataacctt gaatgtgaca
6361 gaggctttcg acgcatggga taatacagta acagaacaag cagtagaaga tgtctggagt
6421 ctatttgaga catcaataaa accatgcgtc aaactaacac ctctatgtgt agcaatgagt
6481 tgtaacagca ccactaacaa cacaaccaca acagggagca caacagggat gtcagagata
6541 aatgaaactt ctccatcata ctcagacaac tgcacaggat tggggaagga agagatagtc
6601 aattgtcagt tctatatgac aggattagag agagataaga aaaaacagta taatgaaaca
6661 tggtactcaa aagatgtagt ttgtgaatca aataatacca aagatgggaa aaacagatgt
6721 tacatgaacc attgcaacac atcagtcatc acagagtcat gtgataagca ctattgggat
6781 gctataaagt ttagatactg tgcaccaccg ggttatgccc tactaagatg caatgatacc
6841 aattattcag gcttcgagcc aaagtgttct aaagtagtag cttctacatg cacaaggatg
6901 atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga gaatagaaca
6961 tatatctatt ggcatggcag agataatagg actatcatca gcttaaacaa gtattataac
7021 ctttctatac attgtaagag accaggaaac aagacagttg taccaataac actcatgtca
7081 gggttagtgt ttcactccca gccaatcaat acaagaccta ggcaagcatg gtgctggttc
7141 aaaggcaagt ggagggaagc catgcaggag gtaaagcaaa cccttataaa acatcccagg
7201 tataaaggaa ccaacgacac aaagaatatt aactttacaa aaccaggaag aggctcagac
7261 ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg caacatgact
7321 tggttcctca attgggtaga aaacagaccg aatcagacac agcacaatta tgcgccgtgc
7381 catataaggc aaataattaa tacctggcac aaggtaggga aaaatgtata tttgcctcct
7441 agagaaggac agttgacctg caactcaaca gtgaccagca taattgctaa cattgacgta
7501 aatagtaacc agacaaatat tacctttagt gcagaggtgg cagaactata ccgattagaa
7561 ttgggagatt ataaattaat agaagtaaca ccaattggct tcgcacctac gagagagaaa
7621 agatattcct cggctccagt gaggaacaaa agaggtgtgt ttgtgctagg gttcttgggt
7681 tttctcgcga cagcaggttc tgcaatgggc gcggcgtccc tgacgctgtc ggctcagtcc
7741 cggactttac tggccgggat agtgcagcaa cagcaacagc tgttggacgt agtcaagaga
7801 caacaagaaa tgttgcgact gaccgtctgg ggaacgaaaa atctccaggc aagagtcact
7861 gctatcgaga agtacttaaa ggaccaggca cagttaaatt catgggggtg tgcgtttagg
7921 caggtctgcc acactactgt accatgggta aatgattctt tatcgcctga ctggaacaat
7981 atgacatggc aggaatggga aaaacaagtc cgctacctag aggcaaatat cagtcaaagt
8041 ttagaacagg cacaaattca acaagaaaag aatatgtatg aattacaaaa gctaaatagc
8101 tgggatgtct ttggcaactg gtttgattta acctcctgga tcaagtatat tcaatatgga
8161 gtttatatag tagtaggagt gatagtttta agaatagcaa tatatatagt gcaattgtta
8221 agtagactta gaaagggcta taggcctgtt ttctcttccc ctcccggtta tctccaacag
8281 atccatatcc acacggaccg gggacagcca gccaacgaag gaacagaaga agacgacaga
8341 gacgacgatg gttacgactt gtagccttgg ccgataaact atatacattt cctgatccac
8401 ctactgactc gcctcttgac cgggctatac aagatctgca gagacttact atccacgaac
8461 tccccgaccc accgactgat ctcccagaat ctaacagcaa tcagggactg gctgagactt
8521 aaagcggcct acctgcaata tgggggcgag tggatccaag aagcattcca agcattcgca
8581 aagactacga gagagactct tgcgagcgca tggggggggt tatgtgcagc agtgcaacgc
8641 gtcgggaggg gaatactcgc agtcccaaga aggatcaggc aaggggcaga aatcgccctc
8701 ctgtgaggga cagcagtatc ggcagggaga tttcatgaac accccatgga gaaccccggc
8761 aatagaagga cagaaaaagt tatacaagca gcaaaatatg gatgatatag attctagtga
8821 tgatgaccta gtgggagttc ctgttacacc aagagtacca ctaagagcaa tgacctataa
8881 attggcagta gatatgtcac attttataaa gaaaagggga ctggatggga tgttttacag
8941 tagggataga catagaatcc tagacttgta cctagaaaag gaggaaggga taataccaga
9001 ttggcagaat tatactcatg ggccaggagt aaggtaccca atgtgtttcg ggtggctatg
9061 gaagctagta ccagtagatg tctcacaaga ggcagaggac gatgagacca actacttaac
9121 acacccagca caaacaagca gacatgatga cgagcatgga gaaacactac tttggcggtt
9181 tgaccccacg ctggcctatg attacaaggc tttcattctg catccagaag agtttgggca
9241 caagtcagga ttgccagaga aagagtggaa ggcaaaactg aaagcaagag ggataccata
9301 tagttaacaa caggaacagc catacttggt caaggcagga agtagctact gagaacagct
9361 gaggctgcag ctgcagggac tttccagaag gggctgtaac caagggaggg acatgggagg
9421 agctggtggg gaacgccctc atacttactg tataaatgta ctcgcttctt acattgtatt
//
LOCUS HIV2NIHZA 9431 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2 (HIV-2) proviral gag, pol, vif,
vpX, vpR, tat, rev, env and nef genes.
ACCESSION J03654
KEYWORDS .
SOURCE Human immunodeficiency virus type 2 (isolate FG) proviral DNA,
clone NIHZ.
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9431)
AUTHORS Zagury,J.F., Franchini,G., Reitz,M., Collalti,E., Starcich,B.,
Hall,L., Fargnoli,K., Jagodzinski,L., Guo,H.G., Laure,F. and et,al.
TITLE Genetic variability between isolates of human immunodeficiency
virus (HIV) type 2 is comparable to the variability among HIV type
1
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5941-5945 (1988)
STANDARD full staff_review
COMMENT Hard copy of sequence kindly provided by G. Franchini (August
1987).
The NIHZ clone has a large deletion in the nef cds and LTR relative
to other HIV2 sequences; nevertheless, it is biologically active.
FEATURES from to/span description
pept 547 2106 gag polyprotein
pept < 1824 4931 pol polyprotein (NH2 terminus uncertain; AA at
1824)
pept 4861 5508 vif protein
pept 5336 5674 vpX protein
pept 5674 5991 vpR protein
pept 5837 6132 tat protein, exon 2 (first expressed exon)
8272 8368 tat protein, exon 3 (AA at 8273)
pept 6063 6132 rev protein, exon 2 (first expressed exon)
8272 8525 rev protein, exon 3 (AA at 8274)
pept 6139 8709 env polyprotein
pept 8543 9085 nef protein
LTR < 1 301 5' LTR
LTR 8928 > 9431 3' LTR
rpt < 1 177 R repeat 5' copy
rpt 9254 9431 R repeat 3' copy
binding 305 321 primer (Lys-tRNA) binding site
signal 9404 9409 poly-A signal
BASE COUNT 3213 a 1913 c 2348 g 1957 t
ORIGIN Cap site of genomic RNA.
1 agtcgctctg gcagaggctg gcagattgag ccctgggagg ttctctccag cactagcagg
61 tagagcctgg gtgttccctg ctagactctc accagtgctt ggccggcgct gggcagacgg
121 ctccacgctt gcttgcttaa agacctctca ataaagctgc caattagaag caagttaagt
181 gtgtgctccc atctctccta gtcgccgcct ggtcatccgg tgttcacctg tataacaaga
241 ccctggtctg ttaggaccct tcttgctttg ggaaaccgag gcaggcggaa aatccctagc
301 aggttggcgc ccgaacagga cttgaagaag actgagaagc cttggaacaa gggctgggtg
361 aaggcagtaa gggcggcagg aacaaaccag cacggagtgc tcctagaaag gcgcgggccg
421 aggtaccaaa gcggcgtgtg gacgggagtc gagaggcctc cgggtgaagg gtaagtacct
481 acaccaaaaa ctgtagccaa agagggcttg ttatcctacc tttagacagg tagaagattg
541 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagatga attagaaaaa
601 attaggttac ggcccggcgg aaagaaaaag tacaagttaa aacatattgt gtgggcagcg
661 aatgaattgg acagattcgg attagcagag agcctgttgg agtcaaaaga aggttgccaa
721 aaaattctta cagttttaga tccattagta ccgacaggtt cagaaaattt aaaaagcctt
781 tttaatactg tctgcgtcat ctggtgcata cacgcagaag agaaagtgaa agatactgaa
841 ggagcaaaac aaatagtaca gagacatcta gtggcagaaa caggaactgc agagaaaatg
901 ccaaatacaa gtagaccaac agcaccacct agcgggaaaa acttccccgt gcagcaagta
961 gctggcaact atacccatat accactgagt cccggaactc taaatgcttg ggtaaaatta
1021 gtggaggaaa agaaattcgg ggcagaagta gtgccaggat ttcaggcact ctcagaaggc
1081 tgcacgccct atgatattaa tcagatgctt aattgtgtgg gcgaccatca agcagctatg
1141 caaataatca gggaaattat taatgaggaa gcagcagact gggatgttgc acatcccata
1201 ccaggcccct taccagcggg gcagcttaga gagccaagag ggtctgacat agcagggaca
1261 acaagcacag tagaagagca gatccagtgg atgtttagac cacaaaatcc tgtaccagta
1321 ggcaacatct ataggagatg gattcagata ggactgcaaa agtgcgtcag gatgtacaat
1381 ccaaccaaca tcctagacat aaatcaagga ccaaaggaac cattccaaag ctatgtagat
1441 aggttctaca aaagcttgag ggcagaacaa acagatccag cagtaaaaaa ttggatgacc
1501 caaacactgc taatacagaa tgccaaccca gattgtaaat tagtactaaa aggattagga
1561 atgaatccta ccttagaaga aatgctgacc acctgtcagg gggtaggggg accaggccag
1621 aaagctagat taatggcaga agccctaaag gaggtcatgg caccagcccc catcccattt
1681 gcagcagccc agcagagaaa gacattcaag tgctggaatt gtggaaagga aggacactcg
1741 gcaagacaat ggagcgcgcc tagaagacag ggctgctgga agtgtggtaa gtcaggacat
1801 gtcatggcaa attgcccaga tagacaggct ggttttttag ggattggcca ctggggaaag
1861 aaggcccgca acttccccgc ggcccaagtc ccgcaggggc taacaccaac agcaccccca
1921 ttggatccag cagtggacct actggagaaa tatatgcagc aaggaaaaag gcaaagggag
1981 cagagagaga gaccgtacaa ggaagtgaca gaggacttac tgcgtttcga gcaggcagag
2041 acaccatgca gggagacgac agaggacttg ctgcacctca attctctctt tggaaaagac
2101 cagtagtcac agcacacatt gagggtcagc cagtagaagt tttgttagac acaagggcta
2161 acgactcaat agtagcagga atagagttag ggagcaatta tagtccaaaa atagtaggag
2221 gaataggggg attcataaat accaaggaat ataaaaatgt agaaatagaa gtcctaggta
2281 aaagggtaaa agccaccata atgacaggtg ataccccgat caacattttt ggcagaaatg
2341 ttctgacagc tttaggcatg tcattaaacc tgccagttgc caagatagaa ccaataaaaa
2401 taatgctaaa gccagggaaa gatggaccaa gactaaaaca atggccttta acaaaagaaa
2461 aaatagaagc actaaaagaa atctgtgaaa aaatggaaaa agaaggccag ctagaagaag
2521 cacctccaac taatccttat aataccccca catttgcaat caagaaaaag gacaaaaaca
2581 aatggagaat gctaatagac tttagggagc taaacaaggt aactcaggat ttcacagaaa
2641 ttcagttagg aattccacac ccagcaggat tggccaaaaa gaggagaatt actgtactag
2701 atgtagggga tgcttacttt tccataccac tacatgagga ctttagacag tatactgcat
2761 tcactctacc atcagtaaac aatgcagaac caggaaaaag atatatatac aaagtcctac
2821 cacaggggtg gaaggggtca ccagcaattt ttcaatacac aatgaggcag atcttagaac
2881 cattcagaaa agcaaacgag gatgtcatta tcattcagta catggatgat atcttaatag
2941 ccagcgacag gacagactta gaacatgaca aagtggtcct gcagctaaaa gaacttctaa
3001 atggactagg attttccacc ccagatgaga aattccagaa agaccctcca tatcgctgga
3061 tgggctatga attatggcca actaaatgga agttgcaaaa aatacagctg ccccaaaaag
3121 aagtatggac agtcaatgac atccaaaagc tagtgggtgt cctaaattgg gcagcacaaa
3181 tctacccagg gataaagacc aaacacctat gtaggctaat tagaggaaaa atgacactca
3241 cagaagaggt acagtggaca gaattagcag aagcagagct agaggaaaac agaattatct
3301 taagccagaa acaagaagga cactattacc aggaagaaaa aaagttagaa gcaacagtcc
3361 agaaagatca agacaatcag tggacatata aggtacacca gggggagaaa attctcaaag
3421 gtgggaaaat atgcaaagat aaaaaatacc cataccaacg ggtcagattg ttagcacagg
3481 tagttcaaaa aataggaaaa gaagcactag tcatttgggg acggatacca aaatttcacc
3541 taccagtaga gagagatacc tgggagcagt ggtgggataa ctactggcaa gtaacatgga
3601 tcccagactg ggactttgta tctaccccac cactggtcag gctagcattt aacctggtag
3661 gagagcctgt accaggcgca gaaactttct acacagatgg atcctgcaat aggcagtcaa
3721 aagaagggaa agcaggatat ataacagata gagggagaga cagggtaaaa gtattagagc
3781 aaactaccaa tcagcaagca gaattagaag cctttgcaat ggcactaaca gactcaggtc
3841 caaaagctaa tattatagta gactcacaat atgtaatggg gatagtagca ggccaaccaa
3901 cagagtcaga gaatagaata gtaaatcaga tcatagaaga aatgataaag aaagaagcaa
3961 tctatgttgc gtgggtccca gcccacaaag gcataggagg aaaccaggaa gtagatcatt
4021 tagtaagtca gggcattaga caagtattat tcctagagaa aatagaaccc gcgcaggaag
4081 aacatgaaaa atatcatagc aatataaaag aactgtccca taaatttgga atacccaagc
4141 tagtggcaag acaaatagta aacacatgtg cccatgtgca acagaaaggg gaggctatac
4201 atgggcaagt aaatgcagaa ctaggcactt ggcaaatgga ctgcacacat ttagaaggaa
4261 aagtcattat agtagcagta catgttgcaa gtggatttat agaggcagaa gttatcccac
4321 aggaatcagg aaggcaaaca gcactgttct tactgaaact ggccagtagg tggccaataa
4381 cacacctgca cacagataat ggtgccaact tcacttcaca ggaagtgaaa atggtagcat
4441 ggtgggtagg tatagaacaa acctttggag tgccttacaa tccacaaagc caaggagtag
4501 tagaagcaat gaatcatcat ctaaaaaatc agatagacag aattagagag caggcaaata
4561 cagtagaaac aatagtatta atggcagttc attgcatgaa ttttaaaaga aggggaggaa
4621 taggggatat gaccccagca gaaagaataa tcaatatgat caccacagaa caagaaatac
4681 aattcctcca agcaaaaaat tcaaaattaa aaaattttcg ggtctatttc agagaaggca
4741 gagatcagct gtggaaagga cctggggaac tgctgtggaa gggagacgga gcagtcatag
4801 tcaaggtagg gacagaaata aaagtagtac caagaaggaa agccaagatc attaaagact
4861 atggaggaag gcaagagatg gatagtggtt cccatttgga gggtgccagg gaggatggag
4921 agatggcata gcttggtcaa gtacttgaaa tacagaacaa aagacttaga gaaggtatgc
4981 tatgttcccc atcataaggt ggggtgggca tggtggactt gcagcagggt aatattccca
5041 ctgaaagaga acagtcatct ggagatacaa gcatattgga acttaacacc agaaaaagga
5101 tggctctcct ctcactcagt gagaataact tggtatacag aaaagttctg gacagatgtt
5161 accccagact gtgcagatac cctaatacat agcacttatt tctcttgctt tacggcaggt
5221 gaagtaagaa gagccatcag aggggaaaaa ttattgtcct gctgcaaata cccccgagcg
5281 catagatccc aggtaccgtc acttcaattt ctggccttag tggtagtgca acaaaatgac
5341 agatcccagg gaaacagtgc caccaggaaa cagcggcgag gagactatcg aagaggcctt
5401 cgcatggcta gacaggacag tagaggctat aaacagagag gcagtgaatc accacccacg
5461 agagctcatt ttccaggtct ggcagaggtc ctggagatac tggcatgatg aacaagggat
5521 gtcaacaagc tacacaaaat atagatattt gtgcttaata cagatggcaa tgtacatgca
5581 tgctaagaga gatggcactt gcctgggggg aggcatgggc cagaagggtg gagatcaggg
5641 acctccccct cctccacctc caggtttagt ctaatgactg aagcaccaac agaactcccc
5701 ccggaggata ggaccccacc gagggagcca ggggatgcat gggtaataga aatcctgaga
5761 gaaatagaag aagaagcttt aaggcatttt gaccctcgct tgctaactgc tcttggcaga
5821 tatatctata ctaggcatgg agacaccctt gaaggcgcca gagagctcat tagaatcctg
5881 caacgagccc tcttcgcgca cttcagagca ggatgtggcc actcaagaat tggccagaca
5941 aggggaggaa atcctctctc agctataccg acccctagag gcatgcacta actcctgcta
6001 ttgcaagaaa tgctgctacg attgccagct gtgtttttta caaaaggggc tcgggatatg
6061 gtatgaccga aagggcagac gaagaaggac tccaaagaaa actaaggctc atccgtcttc
6121 tgcatcagac aagtgagtat gaagggtagt aagaatcaac tgctgattgc tattgtacta
6181 gctagtgctt acctaataca ttgcaagcaa tttgtgactg ttttctacgg catacccgcg
6241 tggaggaatg catccattcc cctgttttgt gcaaccaaaa atagagatac ttggggaaca
6301 atacagtgct taccagacaa tgatgattat caggaaataa ctctaaatgt gacagaggct
6361 ttcgatgcat ggaataatac agtaacagaa caagcagtag aggatgtctg gaatctattt
6421 gagacatcaa taaaaccatg tgtcaaatta acacccttat gtgtagcaat gaactgtaca
6481 aggaacatga ccacatggac agggaggaca gacacccaaa atatcacaat tataaatgac
6541 acttcccatg cgcgtgcaga caactgcaca ggattaaagg aggaagaaat gatcgactgt
6601 cagtttagta tgacaggatt agagagagac aagagaaaac agtatactga agcatggtac
6661 tcaaaagatg tggtttgtga caataacacc tcaagtcaga gcaagtgtta catgaaccat
6721 tgcaacacat cagtcatcac agagtcatgt gataagcact attgggatgc tatgaggttt
6781 agatactgtg caccaccggg ttttgcctta ctaagatgca atgataccaa ttattcaggc
6841 tttgcaccta actgctctaa agtagtagct gctacatgca ccagaatgat ggaaacgcaa
6901 acttctacat ggtttggctt taatggcact agagcagaaa atagaacata tatctattgg
6961 catggtaaag ataacagaac tattatcagc ttaaataact tttataatct cactatgcat
7021 tgtaagaggc cgggaaataa gacagtgtta ccaataacat ttatgtcagg gtttaagttt
7081 cactcccagc cagtcatcaa taaaaaaccc aggcaagcat ggtgttggtt cgaaggccaa
7141 tggaaggaag ccatgcagga ggtgaaggag acccttgcga aacatcccag atataaaggg
7201 aacaggagcc gcacagagaa tattaaattt aaagcaccag ggagaggctc agacccagaa
7261 gtaacataca tgtggactaa ctgcagagga gaatctctct actgcaacat gacttggttc
7321 ctcaattggg tagagaacag gacgggtcag aaacagcgca attatgcacc gtgccgtata
7381 aggcaaataa ttaatacctg gcacagggta gggaaaaatc tatatttgcc tcccagggaa
7441 ggggagttga cctgcaactc aacggtgacc agcataattg ccaacattga tgcgggagat
7501 caaacaaata ttacctttag tgcagaggcg gcagaactat accgattgga attgggggat
7561 tacaaattag tagaaatcac accaattggc ttcgcaccta catcagtaaa gagatactcc
7621 tctgctcacc agagacatac aagaggtgtg tttgtgctag ggttcctagg ttttctcgca
7681 acagcaggtt ctgccatggg cgcggcgtcc ttgacgctgt cagctcagtc tcggacttta
7741 ttggccggga tagtgcagca acagcaacag ctgttggatg tggtcaagag acaacaagaa
7801 atgttgcgac tgaccgtctg gggaacaaaa aatctccagg caagagtcac tgctatagag
7861 aagtacctaa aggaccaggc gcagctaaat tcatggggat gtgcgtttag acaagtctgc
7921 cacacttctg taccatgggt aaatgatacc ttgacacctg attggaacaa tatgacgtgg
7981 caggaatggg aacaaaaagt ccgctacctg gaggcaaata tcagtcaaag tctagaacaa
8041 gcacaaattc agcaagaaaa gaatatgtat gagctacaaa aattaaatag ctgggatgtt
8101 tttaccaatt ggcttgactt cacctcctgg gtcaggtata ttcaatatgg agtttatgta
8161 gtagtaggaa tagtagcttt aagaatagta atatatatag tacagatgtt aagtagactt
8221 agaaagggct ataggcctgt tttctcttcc ccccccggtt acatccaaca gatccatatc
8281 cacaaggacc aggaacagcc agccagagaa gaaacagaag aagacgttgg aagcaacggt
8341 ggagacagat cttggccttg gccgatagca tatatacatt tcctgatccg cctgctgatt
8401 cgcctcttga ccgggctata caacatctgc agggacttac tatccaggat ctccccgatc
8461 ctccaaccaa tcttccagag tctccagaga gcactaacag caatcagaga ctggctgagg
8521 cttaaagcag cctacctgca gtatgggtgc gagtggatcc aagaagcgtt ccaagccctt
8581 gcaaggacta caagagagac tcttgcaggc gcggggagag acttgtggag ggcgctgcaa
8641 cgaatcggga gggggatact tgcagtccca cgaaggatca ggcagggagc agaactcgcc
8701 ctcctgtgag ggacagcgat atcagcaggg agattttgta aataccccat ggagaacccc
8761 agcagcagaa agggagaaag aattgtacaa acagcaaaat atggatgatg tagatctaga
8821 tgatgatgac caagtaggat tccctgtcac accaagagta ccattaagac caatgacatt
8881 caaattggca gtagatatgt ctcattttat aaaagaaaaa gggggactgg aagggctgtt
8941 ttatagtcag agaagacata gaatcttaga cttatactta gacaaggctt ttactctgta
9001 cccagaggaa tttgggcata attcaggact gccagagaaa gagtggaagg cgagactgaa
9061 agcaagggga ataccattta gttaaaaaca ggaacagcta tacttggtca gggcaggaag
9121 tagctactga gaacagctga gactgcaggg actttccaga aggggctgta accaggagag
9181 ggacatggga ggagctggtg gggaacgccc ttcatacttt ctgtataaat gtacccgctg
9241 cttgcattgt attcagtcgc tctggcagag gctggcagat tgagccctgg gaggttctct
9301 ccagcactag caggtagagc ctgggtgttc cctgctagac tctcaccagt gcttggccgg
9361 cgctgggcag acggctccac gcttgcttgc ttaaagacct ctcaataaag ctgccaatta
9421 gaagcaagtt a
//
LOCUS HIV2RODZ 9671 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2, complete proviral genome.
ACCESSION M15390
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 2 (isolate ROD) proviral DNA,
clones lambda-ROD[27,35].
ORGANISM Human immunodeficiency virus type 2
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 9361 to 9671)
AUTHORS Clavel,F., Guyader,M., Guetard,D., Salle,M., Montagnier,L. and
Alizon,M.
TITLE Molecular cloning and polymorphism of the human immunodeficiency
virus type 2
JOURNAL Nature 324, 691-695 (1986)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 9671)
AUTHORS Guyader,M., Emerman,M., Sonigo,P., Clavel,F., Montagnier,L. and
Alizon,M.
TITLE Genome organization and transactivation of the human immuno-
deficiency virus type 2
JOURNAL Nature 326, 662-669 (1987)
STANDARD full staff_review
COMMENT The ROD isolate was obtained from offshore Senegal in 1985. Its
sequence was determined from two clones: bases 1 to 2658 are from
ROD 27 and bases 2659 to 9671 are from ROD 35. An infectious clone
of this virus has been constructed by Keith Peden, Molecular
Biology and Genetics, Johns Hopkins University School of Medicine,
Baltimore, MD. 21205 (301)955-3652.
FEATURES from to/span description
pept 546 2114 gag polyprotein
pept < 1829 4939 pol polyprotein (NH2 terminus uncertain; AA at
pept 4869 5516 vif protein
pept 5344 5682 vpX protein
pept 5682 5999 vpR protein
pept 5845 6140 tat protein, exon 2 (first expressed exon)
8307 8403 tat protein, exon 3 (AA at 8308)
pept 6071 6140 rev protein, exon 2 (first expressed exon)
8307 8539 rev protein, exon 3 (AA at 8309)
pept 6147 8723 env polyprotein
pept 8557 9327 nef protein
LTR < 1 299 5' LTR
LTR 8942 > 9671 3' LTR
rpt < 1 173 R repeat 5' copy
rpt 9497 9671 R repeat 3' copy
binding 303 319 primer (Lys-tRNA) binding site
signal 9649 9654 poly-A signal
BASE COUNT 3314 a 1972 c 2401 g 1984 t
ORIGIN Cap site of genomic RNA.
1 ggtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagacg
121 gccccacgct tgcttgctta aaaacctctt aataaagctg ccagttagaa gcaagttaag
181 tgtgtgctcc catctctcct agtcgccgcc tggtcattcg gtgttcacct gagtaacaag
241 accctggtct gttaggaccc ttcttgcttt gggaaaccga ggcaggaaaa tccctagcag
301 gttggcgcct gaacagggac ttgaagaaga ctgagaagtc ttggaacacg gctgagtgaa
361 ggcagtaagg gcggcaggaa caaaccacga cggagtgctc ctagaaaggc gcgggccgag
421 gtaccaaagg cagcgtgtgg agcgggagga gaagaggcct ccgggtgaag gtaagtacct
481 acaccaaaaa ctgtagccga aagggcttgc tatcctacct ttagacaggt agaagattgt
541 gggagatggg cgcgagaaac tccgtcttga gagggaaaaa agcagatgaa ttagaaagaa
601 tcaggttacg gcccggcgga aagaaaaagt acaggctaaa acatattgtg tgggcagcga
661 ataaattgga cagattcgga ttagcagaga gcctgttgga gtcaaaagag ggttgtcaaa
721 aaattcttac agttttagat ccaatggtac cgacaggttc agaaaattta aaaagtcttt
781 ttaatactgt ctgcgtcatt tggtgcatac acgcagaaga gaaagtgaaa gatactgaag
841 gagcaaaaca aatagtgcgg agacatctag tggcagaaac aggaactgca gagaaaatgc
901 caagcacaag tagaccaaca gcaccatcta gcgagaaggg aggaaattac ccagtgcaac
961 atgtaggcgg caactacacc catataccgc tgagtccccg aaccctaaat gcctgggtaa
1021 aattagtaga ggaaaaaaag ttcggggcag aagtagtgcc aggatttcag gcactctcag
1081 aaggctgcac gccctatgat atcaaccaaa tgcttaattg tgtgggcgac catcaagcag
1141 ccatgcagat aatcagggag attatcaatg aggaagcagc agaatgggat gtgcaacatc
1201 caataccagg ccccttacca gcggggcagc ttagagagcc aaggggatct gacatagcag
1261 ggacaacaag cacagtagaa gaacagatcc agtggatgtt taggccacaa aatcctgtac
1321 cagtaggaaa catctataga agatggatcc agataggatt gcagaagtgt gtcaggatgt
1381 acaacccgac caacatccta gacataaaac agggaccaaa ggagccgttc caaagctatg
1441 tagatagatt ctacaaaagc ttgagggcag aacaaacaga tccagcagtg aagaattgga
1501 tgacccaaac actgctagta caaaatgcca acccagactg taaattagtg ctaaaaggac
1561 tagggatgaa ccctacctta gaagagatgc tgaccgcctg tcagggggta ggtgggccag
1621 gccagaaagc tagattaatg gcagaggccc tgaaagaggt cataggacct gcccctatcc
1681 cattcgcagc agcccagcag agaaaggcat ttaaatgctg gaactgtgga aaggaagggc
1741 actcggcaag acaatgccga gcacctagaa ggcagggctg ctggaagtgt ggtaagccag
1801 gacacatcat gacaaactgc ccagatagac aggcaggttt tttaggactg ggcccttggg
1861 gaaagaagcc ccgcaacttc cccgtggccc aagttccgca ggggctgaca ccaacagcac
1921 ccccagtgga tccagcagtg gatctactgg agaaatatat gcagcaaggg aaaagacaga
1981 gagagcagag agagagacca tacaaggaag tgacagagga cttactgcac ctcgagcagg
2041 gggagacacc atacagggag ccaccaacag aggacttgct gcacctcaat tctctctttg
2101 gaaaagacca gtagtcacag catacattga gggtcagcca gtagaagtct tgttagacac
2161 aggggctgac gactcaatag tagcaggaat agagttaggg aacaattata gcccaaaaat
2221 agtaggggga atagggggat tcataaatac caaggaatat aaaaatgtag aaatagaagt
2281 tctaaataaa aaggtacggg ccaccataat gacaggcgac accccaatca acatttttgg
2341 cagaaatatt ctgacagcct taggcatgtc attaaatcta ccagtcgcca aagtagagcc
2401 aataaaaata atgctaaagc cagggaaaga tggaccaaaa ctgagacaat ggcccttaac
2461 aaaagaaaaa atagaagcac taaaagaaat ctgtgaaaaa atggaaaaag aaggccagct
2521 agaggaagca cctccaacta atccttataa tacccccaca tttgcaatca agaaaaagga
2581 caaaaacaaa tggaggatgc taatagattt cagagaacta aacaaggtaa ctcaagattt
2641 cacagaaatt cagttaggaa ttccacaccc agcagggttg gccaagaaga gaagaattac
2701 tgtactagat gtaggggatg cttacttttc cataccacta catgaggact ttagaccata
2761 tactgcattt actctaccat cagtgaacaa tgcagaacca ggaaaaagat acatatataa
2821 agtcttgcca cagggatgga agggatcacc agcaattttt caacacacaa tgagacaggt
2881 attagaacca ttcagaaaag caaacaagga tgtcattatc attcagtaca tggatgatat
2941 cttaatagct agtgacagga cagatttaga acatgatagg gtagtcctgc agctcaagga
3001 acttctaaat ggcctaggat tttctacccc agatgagaag ttccaaaaag accctccata
3061 ccactggatg ggctatgaac tatggccaac taaatggaag ttgcagaaaa tacagttgcc
3121 ccaaaaagaa atatggacag tcaatgacat ccagaagcta gtgggtgtcc taaattgggc
3181 agcacaactc tacccaggga taaagaccaa acacttatgt aggttaatca gaggaaaaat
3241 gacactcaca gaagaagtac agtggacaga attagcagaa gcagagctag aagaaaacag
3301 aattatccta agccaggaac aagagggaca ctattaccaa gaagaaaaag agctagaagc
3361 aacagtccaa aaggatcaag agaatcagtg gacatataaa atacaccagg aagaaaaaat
3421 tctaaaagta ggaaaatatg caaaggtgaa aaacacccat accaatggaa tcagattgtt
3481 agcacaggta gttcagaaaa taggaaaaga agcactagtc atttggggac gaataccaaa
3541 atttcaccta ccagtagaga gagaaatctg ggagcagtgg tgggataact actggcaagt
3601 gacatggatc ccagactggg acttcgtgtc taccccacca ctggtcaggt tagcgtttaa
3661 cctggtaggg gatcctatac caggtgcaga gaccttctac acagatggat cctgcaatag
3721 gcaatcaaaa gaaggaaaag caggatatgt aacagataga gggaaagaca aggtaaagaa
3781 actagagcaa actaccaatc agcaagcaga actagaagcc tttgcgatgg cactaacaga
3841 ctcgggtcca aaagttaata ttatagtaga ctcacagtat gtaatgggga tcagtgcaag
3901 ccaaccaaca gagtcagaaa gtaaaatagt gaaccagatc atagaagaaa tgataaaaaa
3961 ggaagcaatc tatgttgcat gggtcccagc ccacaaaggc atagggggaa accaggaagt
4021 agatcattta gtgagtcagg gtatcagaca agtgttgttc ctggaaaaaa tagagcccgc
4081 tcaggaagaa catgaaaaat atcatagcaa tgtaaaagaa ctgtctcata aatttggaat
4141 acccaattta gtggcaaggc aaatagtaaa ctcatgtgcc caatgtcaac agaaagggga
4201 agctatacat gggcaagtaa atgcagaact aggcacttgg caaatggact gcacacattt
4261 agaaggaaag atcattatag tagcagtaca tgttgcaagt ggatttatag aagcagaagt
4321 catcccacag gaatcaggaa gacaaacagc actcttccta ttgaaactgg caagtaggtg
4381 gccaataaca cacttgcata cagataatgg tgccaacttc acttcacagg aggtgaagat
4441 ggtagcatgg tggataggta tagaacaatc ctttggagta ccttacaatc cacagagcca
4501 aggagtagta gaagcaatga atcaccatct aaaaaaccaa ataagtagaa tcagagaaca
4561 ggcaaataca atagaaacaa tagtactaat ggcaattcat tgcatgaatt ttaaaagaag
4621 ggggggaata ggggatatga ctccatcaga aagattaatc aatatgatca ccacagaaca
4681 agagatacaa ttcctccaag ccaaaaattc aaaattaaaa gattttcggg tctatttcag
4741 agaaggcaga gatcagttgt ggaaaggacc tggggaacta ctgtggaaag gagaaggagc
4801 agtcctagtc aaggtaggaa cagacataaa aataatacca agaaggaaag ccaagatcat
4861 cagagactat ggaggaagac aagagatgga tagtggttcc cacctggagg gtgccaggga
4921 ggatggagaa atggcatagc cttgtcaagt atctaaaata caaaacaaag gatctagaaa
4981 aggtgtgcta tgttccccac cataaggtgg gatgggcatg gtggacttgc agcagggtaa
5041 tattcccatt aaaaggaaac agtcatctag agatacaggc atattggaac ttaacaccag
5101 aaaaaggatg gctctcctct tattcagtaa gaataacttg gtacacagaa aagttctgga
5161 cagatgttac cccagactgt gcagatgtcc taatacatag cacttatttc ccttgcttta
5221 cagcaggtga agtaagaaga gccatcagag gggaaaagtt attgtcctgc tgcaattatc
5281 cccgagctca tagagcccag gtaccgtcac ttcaatttct ggccttagtg gtagtgcaac
5341 aaaatgacag accccagaga gacagtacca ccaggaaaca gcggcgaaga gactatcgga
5401 gaggccttcg cctggctaaa caggacagta gaagccataa acagagaagc agtgaatcac
5461 ctaccccgag aacttatttt ccaggtgtgg cagaggtcct ggagatactg gcatgatgaa
5521 caagggatgt cagaaagtta cacaaagtat agatatttgt gcataataca gaaagcagtg
5581 tacatgcatg ttaggaaagg gtgtacttgc ctggggaggg gacatgggcc aggagggtgg
5641 agaccagggc ctcctcctcc tccccctcca ggtctggtct aatggctgaa gcaccaacag
5701 agctcccccc ggtggatggg accccactga gggagccagg ggatgagtgg ataatagaaa
5761 tcttgagaga aataaaagaa gaagctttaa agcattttga ccctcgcttg ctaattgctc
5821 ttggcaaata tatctatact agacatggag acacccttga aggcgccaga gagctcatta
5881 aagtcctgca acgagccctt ttcacgcact tcagagcagg atgtggccac tcaagaattg
5941 gccagacaag gggaggaaat cctctctcag ctataccgac ccctagaaac atgcaataac
6001 tcatgctatt gtaagcgatg ctgctaccat tgtcagatgt gttttctaaa caaggggctc
6061 gggatatgtt atgaacgaaa gggcagacga agaaggactc caaagaaaac taagactcat
6121 ccgtctccta caccagacaa gtgagtatga tgaatcagct gcttattgcc attttattag
6181 ctagtgcttg cttagtatat tgcacccaat atgtaactgt tttctatggc gtacccacgt
6241 ggaaaaatgc aaccattccc ctcttttgtg caaccagaaa tagggatact tggggaacca
6301 tacagtgctt gcctgacaat gatgattatc aggaaataac tttgaatgta acagaggctt
6361 ttgatgcatg gaataataca gtaacagaac aagcaataga agatgtctgg catctattcg
6421 agacatcaat aaaaccatgt gtcaaactaa cacctttatg tgtagcaatg aaatgcagca
6481 gcacagagag cagcacaggg aacaacacaa cctcaaagag cacaagcaca accacaacca
6541 cacccacaga ccaggagcaa gagataagtg aggatactcc atgcgcacgc gcagacaact
6601 gctcaggatt gggagaggaa gaaacgatca attgccagtt caatatgaca ggattagaaa
6661 gagataagaa aaaacagtat aatgaaacat ggtactcaaa agatgtggtt tgtgagacaa
6721 ataatagcac aaatcagacc cagtgttaca tgaaccattg caacacatca gtcatcacag
6781 aatcatgtga caagcactat tgggatgcta taaggtttag atactgtgca ccaccgggtt
6841 atgccctatt aagatgtaat gataccaatt attcaggctt tgcacccaac tgttctaaag
6901 tagtagcttc tacatgcacc aggatgatgg aaacgcaaac ttccacatgg tttggcttta
6961 atggcactag agcagagaat agaacatata tctattggca tggcagagat aatagaacta
7021 tcatcagctt aaacaaatat tataatctca gtttgcattg taagaggcca gggaataaga
7081 tagtgaaaca aataatgctt atgtcaggac atgtgtttca ctcccactac cagccgatca
7141 ataaaagacc cagacaagca tggtgctggt tcaaaggcaa atggaaagac gccatgcagg
7201 aggtgaagga aacccttgca aaacatccca ggtatagagg aaccaatgac acaaggaata
7261 ttagctttgc agcgccagga aaaggctcag acccagaagt agcatacatg tggactaact
7321 gcagaggaga gtttctctac tgcaacatga cttggttcct caattggata gagaataaga
7381 cacaccgcaa ttatgcaccg tgccatataa agcaaataat taacacatgg cataaggtag
7441 ggagaaatgt atatttgcct cccagggaag gggagctgtc ctgcaactca acagtaacca
7501 gcataattgc taacattgac tggcaaaaca ataatcagac aaacattacc tttagtgcag
7561 aggtggcaga actatacaga ttggagttgg gagattataa attggtagaa ataacaccaa
7621 ttggcttcgc acctacaaaa gaaaaaagat actcctctgc tcacgggaga catacaagag
7681 gtgtgttcgt gctagggttc ttgggttttc tcgcaacagc aggttctgca atgggcgcgg
7741 cgtccctgac cgtgtcggct cagtcccgga ctttactggc cgggatagtg cagcaacagc
7801 aacagctgtt ggacgtggtc aagagacaac aagaactgtt gcgactgacc gtctggggaa
7861 cgaaaaacct ccaggcaaga gtcactgcta tagagaagta cctacaggac caggcgcggc
7921 taaattcatg gggatgtgcg tttagacaag tctgccacac tactgtacca tgggttaatg
7981 attccttagc acctgactgg gacaatatga cgtggcagga atgggaaaaa caagtccgct
8041 acctggaggc aaatatcagt aaaagtttag aacaggcaca aattcagcaa gagaaaaata
8101 tgtatgaact acaaaaatta aatagctggg atatttttgg caattggttt gacttaacct
8161 cctgggtcaa gtatattcaa tatggagtgc ttataatagt agcagtaata gctttaagaa
8221 tagtgatata tgtagtacaa atgttaagta ggcttagaaa gggctatagg cctgttttct
8281 cttccccccc cggttatatc caacagatcc atatccacaa ggaccgggga cagccagcca
8341 acgaagaaac agaagaagac ggtggaagca acggtggaga cagatactgg ccctggccga
8401 tagcatatat acatttcctg atccgccagc tgattcgcct cttgaccaga ctatacagca
8461 tctgcaggga cttactatcc aggagcttcc tgaccctcca actcatctac cagaatctca
8521 gagactggct gagacttaga acagccttct tgcaatatgg gtgcgagtgg atccaagaag
8581 cattccaggc cgccgcgagg gctacaagag agactcttgc gggcgcgtgc aggggcttgt
8641 ggagggtatt ggaacgaatc gggaggggaa tactcgcggt tccaagaagg atcagacagg
8701 gagcagaaat cgccctcctg tgagggacgg cagtatcagc agggagactt tatgaatact
8761 ccatggaagg acccagcagc agaaagggag aaaaatttgt acaggcaaca aaatatggat
8821 gatgtagatt cagatgatga tgaccaagta agagtttctg tcacaccaaa agtaccacta
8881 agaccaatga cacatagatt ggcaatagat atgtcacatt taataaaaac aaggggggga
8941 ctggaaggga tgttttacag tgaaagaaga cataaaatct taaatatata cttagaaaag
9001 gaagaaggga taattgcaga ttggcagaac tacactcatg ggccaggagt aagataccca
9061 atgttctttg ggtggctatg gaagctagta ccagtagatg tcccacaaga aggggaggac
9121 actgagactc actgcttagt acatccagca caaacaagca agtttgatga cccgcatggg
9181 gagacactag tctgggagtt tgatcccttg ctggcttata gttacgaggc ttttattcgg
9241 tacccagagg aatttgggca caagtcaggc ctgccagagg aagagtggaa ggcgagactg
9301 aaagcaagag gaataccatt tagttaaaga caggaacagc tatacttggt cagggcagga
9361 agtaactaac agaaacagct gagactgcag ggactttcca gaaggggctg taaccaaggg
9421 agggacatgg gaggagctgg tggggaacgc cctcatattc tctgtataaa tatacccgct
9481 agcttgcatt gtacttcggt cgctctgcgg agaggctggc agattgagcc ctgggaggtt
9541 ctctccagca gtagcaggta gagcctgggt gttccctgct agactctcac cagcacttgg
9601 ccggtgctgg gcagacggcc ccacgcttgc ttgcttaaaa acctccttaa taaagctgcc
9661 agttagaagc a
//
LOCUS HIV2ST 9672 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral
genome.
ACCESSION M31113
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 2 (isolate HIV-2/ST) proviral
DNA, clone JSP4-27.
REFERENCE 1 (bases 1 to 9672)
AUTHORS Kumar,P., Hui,H., Kappes,J.C., Haggarty,B.S., Hoxie,J.A.,
Arya,S.K., Shaw,G.M. and Hahn,B.
TITLE Molecular characterization of an attenuated human immunodeficiency
virus type 2 isolate
JOURNAL J. Virol. 64, 890-901 (1990)
STANDARD full staff_review
REFERENCE 2 (sites; corrections bases 6952, 7003, 7926)
AUTHORS Hahn,B.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Kindly provided in computer-readable form by B. Hahn, University of
Alabama, Birmingham. The corrections of nts 6952, 7003 and 7926,
from 'c' to 't' result in amino acid changes.
This isolate, obtained from an asymptomatic Senegalese woman, has
low pathogenicity, infecting cells at a relatively slow rate and
causing little cell death or fusion.
FEATURES from to/span description
pept 548 2113 gag polyprotein
pept < 1717 1767 pol polyprotein (NH2 terminus uncertain; AA at
1717)
1771 4938 pol polyprotein
pept 4868 5515 vif protein
pept 5343 5681 vpX protein
pept 5681 5776 vpR protein
5780 5998 vpR protein
pept 5844 6139 tat protein, exon 2 (first expressed exon)
8291 8387 tat protein, exon 3 (AA at 8292)
pept 6070 6139 rev protein, exon 2 (first expressed exon)
8291 8544 rev protein, exon 3 (AA at 8293)
pept 6149 8728 env polyprotein
site 1768 1770 pol in-frame stop codon
site 5777 5779 vpR in-frame stop codon
revision 6952 1 t in [2]; c in [1]
revision 7003 1 t in [2]; c in [1]
revision 7926 1 t in [2]; c in [1]
pept 8562 9329 nef protein
pre-msg 1 9672 genomic mRNA
pre-msg 1 9672 tat, rev, nef subgenomic mRNA
rpt 9499 9672 R repeat 3' copy
binding 305 321 primer (Lys-tRNA) binding site
signal 9650 9655 poly-A signal
BASE COUNT 3304 a 1991 c 2412 g 1965 t
ORIGIN
1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg
121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa
181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcattc ggtgttcatc taaagtaaca
241 agaccctggt ctgttaggac cctttctgct ttgggaaacc aaggcaggaa aatccctagc
301 aggttggcgc ccgaacaggg acttgaagaa gactgagaag ccttggaaca cggctgagtg
361 aaggcagtaa gggcggcagg aacaaaccac gacggagtgc tcctagaaaa gcgcaggccg
421 aggtaccaag ggcggcgtgt ggagcgggag tgaaagaggc ctccgggtga aggtaagtgc
481 ctacaccaaa tacagtagcc agaagggctt gttatcctac ctttagacgg gtagaagatt
541 gtgggagatg ggcgcgagaa actccgtctt gagagggaaa aaagcagacg aattagaaaa
601 gattaggtta cggcccggcg gaaagaaaaa atataggcta aaacatattg tgtgggcagc
661 gaatgaattg gacagattcg gattggcaga gagcctgttg gagtcaaaag agggttgcca
721 aaaaattctt acagttttag atccattagt accgacaggg tcagaaaatt taaaaagcct
781 ttttaatact gtctgcgtca tttggtgtat acacgcagaa gagaaagcga aagatactga
841 agaagcaaaa caaaaggtac agagacatct agtggcagaa acaaaaacta cagaaaaaat
901 gccaagtaca agtagaccaa cagcaccacc tagcgggaac ggaggaaact tccccgtaca
961 acaagtggcc ggcaactata cccatgtgcc actaagtccc cgaaccctaa atgcttgggt
1021 aaaactagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc
1081 agaaggctgc acgccctatg atattaatca aatgcttaat tgtgtgggcg accatcaagc
1141 agctatgcaa ataatcaggg aaattattaa tgaagaagca gcagattggg acgcacaaca
1201 cccaatacca ggccccttac cagcggggca gctcagggag ccaaggggat ctgacatagc
1261 agggacaaca agcacagtag aagagcagat ccagtggatg tttaggccac aaaatcctgt
1321 accagtagga agcatctata gaagatggat ccagataggg ctacagaagt gcgtcaggat
1381 gtacaaccca accaacatcc tagacataaa acagggacca aaggagccat tccagagtta
1441 tgtagataga ttctacaaga gcttgagggc agaacaaaca gatccagcag taaaaaattg
1501 gatgacccaa acactgctag tgcagaatgc caacccagac tgtaagttag tactaaaagg
1561 actagggata aatcctacct tagaagaaat gctaaccgcc tgtcaggggg taggtggacc
1621 aggccagaaa gccagattaa tggcagaagc cttaaaggag gccatggcac cagcccccat
1681 cccatttgca gcagcccaac agagaaggac aattaagtgc tggaattgcg gaaaggaagg
1741 gcactcggca agacaatgcc gagcacctag aagacaaggc tgctggaaat gtggcaaggc
1801 aggacacatc atggcaaaat gcccagaaag acaggcgggt tttttagggt tgggcccatg
1861 gggaaagaag ccccgcaatt tccctgtggc ccaaatcccg caggggctga caccaacagc
1921 acccccgata gacccagtag aggacctact agagaagtac atgcagcaag ggaaaaggca
1981 gagagagcag agagagaggc catacaaaga agtgacagag gacttcctgc agctcgagaa
2041 acaagagaca ccatgcagag agacgacaga ggacttgctg cacctcaatt ctctctttgg
2101 aaaagaccag tagtcacagc acatgttgag ggccagccag tagaagtttt gctagacaca
2161 ggggctgacg actcaatagt agcaggcgta gagttaggga gcaattatag tccaaagata
2221 gtagggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataagagta
2281 ttaaataaaa gagtaagagc caccataatg acaggtgata ccccaatcaa catttttggc
2341 agaaacattc tgacagcctt aggcatgtca ttaaatctac cagtcgccaa gatagaacca
2401 ataaaaataa tgctgaagcc aggaaaggat ggaccaaaac tgagacaatg gcccttaaca
2461 aaagaaaaaa tagaggcact aaaagagatc tgtgagaaaa tggaaagaga gggccagcta
2521 gaggaggcac ctccaactaa tccttataat acccccacat ttgcaatcaa gaaaaaggac
2581 aaaaacaaat ggagaatgct aatagatttt agagaactaa acaaggtaac tcaagacttc
2641 acagaaatcc agttaggaat tccacaccca gcaggactag ccaagaagaa acgaattact
2701 gtcctagatg taggggatgc ttacttttcc ataccactac atgaggattt tagacagtat
2761 actgcattta ctctaccatc aataaacaat gctgaaccag gaaaaagata catatataaa
2821 gtctcaccac agggatggaa gggatcacca gcaatttttc agtacacaat gaggcaggtc
2881 ttagaaccat tcagaaaagc aaacccggat atcattctca ttcagtacat ggatgatatc
2941 ttgatagcca gcgacaggac agatttagaa catgacagag tggttctgca gctaaaggaa
3001 cttctaaatg gcctgggatt ttccacccca gatgagaagt tccaaaaaga ccctccatac
3061 caatggatgg gctatgaact gtggccaact aaatggaagc tgcaaagaat acaattgccc
3121 caaaaggaag tatggacagt caatgacatc caaaaactgg tgggtgtcct aaattgggca
3181 gcacaaatct acccagggat aaagaccaga aacttatgta ggttaatcag aggaaaaatg
3241 acactcacag aagaggtaca gtggacagaa ttagcagaag cggaactaga agaaaacaaa
3301 atcatcttaa gccaggaaca agaaggatgc tattaccaag aggaaaagga gctagaagca
3361 acagtccaaa aagatcaaga caatcagtgg acatataaga tacaccaggg aggaaaaatt
3421 ctaaaagtag gaaaatatgc aaaggtaaaa aatacccaca ccaacggagt cagactccta
3481 gcacaagtag ttcaaaaaat aggaaaagaa gcactagtca tttggggacg aataccaaaa
3541 tttcacctac cagtagaaag agatacctgg gaacagtggt gggataacta ctggcaagtg
3601 acatggatcc cagactggga cttcatatct accccgccac tggtcagatt agtatttaac
3661 ctggtgaaag atcccatact aggcgcagaa accttctaca cagatggatc ctgcaataag
3721 caatcaagag aaggaaaagc aggatacata acagatagag gaagagacaa ggtgaggcta
3781 ttagagcaaa ccaccaatca gcaagcagaa ttagaagcct ttgcgatggc agtaacagac
3841 tcaggtccaa aggccaacat tatagtagac tcacaatatg taatgggaat agtagcaggc
3901 caaccaacag agtcagagag taaaatagta aatcaaatca tagaagaaat gataaaaaag
3961 gaagcaatct atgttgcatg ggtcccagcc cataaaggca taggaggaaa tcaggaggta
4021 gatcacttag taagtcaggg catcagacaa gtattattcc tagagaaaat agaacccgct
4081 caggaggaac atgaaaaata tcatagcaat gtaaaagaac tatcccataa atttggactg
4141 cccaaattag tggcaagaca aatagtaaac acatgcaccc aatgtcagca gaaaggggag
4201 gctatacatg ggcaagtaaa tgcagaatta ggcacttggc aaatggactg cacacactta
4261 gaaggaaaaa tcattatagt agcagtacat gttgcaagtg gatttataga agcagaagtc
4321 atcccacagg aatcaggaag gcaaacggca ctcttcctac taaaactggc cagtaggtgg
4381 ccaataacac atttgcacac agacaatggt gccaacttca cttcacagga agtaaagatg
4441 gtggcatggt ggataggtat agaacaatcc ttcggagtac cttacaatcc acaaagccaa
4501 ggagtagtgg aagcaatgaa tcaccaccta aaaaatcaga taagcagaat tagagagcag
4561 gcaaacacag tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaaggagg
4621 ggaggaatag gggatatgac cccagcagaa agactaatca atatggtcac tgcagaacag
4681 gaaatacaat tcctccaagc aaaaaattca aaattacaaa attttcgggt ctatttcaga
4741 gaaggcagag atcagctgtg gaaaggacct ggggaactac tgtggaaggg ggacggagca
4801 gtcatagtca aggtaggggc tgacataaaa ataataccaa gaaggaaagc taagatcatc
4861 aaagactatg gaggaaggca agagatggat agcggttcca acttggaggg tgccagggag
4921 gatggagagg tggcatagcc ttatcaagta tctaaaatac agaacaggag atctagagaa
4981 ggtgtgctat gttccccacc ataaggtggg atgggcgtgg tggacttgca gcagggtaat
5041 attcccatta aaaggagaaa gtcatctgga gatacaggca tactggaacc taacaccaga
5101 aaaaggatgg ctctcctcct attcagtaag actaacttgg tatacagaaa aattctggac
5161 agatgttacc ccagactgtg cggactccct aatacatagc acttatttct cttgctttac
5221 ggcaggcgaa gtaagaagag ccatcagagg ggaaaagcta ttatcctgct gcaactaccc
5281 ccaagcccat aagtaccagg taccgtcact ccagtttctg gccttagtgg tagtgcaaca
5341 aaatggcagg ccccagagag acaataccac caggaaacag tggcgaagaa actatcggag
5401 aggccttcga gtggctagac aggacggtag aagccataaa cagagaggca gtgaaccacc
5461 tgccccgaga gcttattttc caggtgtggc aaaggtcctg gagatactgg catgatgaac
5521 aaggaatgtc aataagttac acaaagtata gatatttgtg cctaatgcag aaagctatgt
5581 tcatacattc taagagaggg tgcacttgcc tggggggagg acatgggccg ggaggatgga
5641 gatcaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga
5701 gtctcccccg gaggatagga ccccaccgag ggagccaggg gatgagtggg taatagaaac
5761 cctgagagag ataaaataag aagctttaaa gcactttgac cctcgcttgc taattactct
5821 tggcaactat atctatgcta gacatggaga cacccttgaa ggcgccagag ggctcattag
5881 gatcctacaa cgagccctcc tcttgcactt cagagcagga tgcggccgct caaggattgg
5941 tcagcccagg ggacgaaatc ctttatcagc tataccaacc cctagaggca tgcgataaca
6001 aatgttactg taaaaagtgc tgctaccatt gccagatgtg ttttttaaac aaggggctcg
6061 ggatatggta tgaacgaaag ggcagaagaa gaagaactcc gaagaaaact aaggctcatt
6121 cgtcttctgc atcagacaag tgagtaagat gtgtggtagg aatcaactat ttgttgccag
6181 cttgctagct agtgcttgct taatatattg cgtccaatat gtgactgttt tctatggcgt
6241 gcccgtgtgg agaaatgcat ccattcccct cttttgtgca actaaaaata gagatacttg
6301 gggaaccata cagtgcttgc cagacaatga tgactatcag gaaatagctt taaatgtgac
6361 agaggccttc gacgcatgga ataatacagt aacagaacaa gcagtagaag atgtctggag
6421 tctatttgag acatcaataa aaccatgcgt caaactaaca cccttatgtg tagcaatgcg
6481 ttgtaacagc acaactgcaa aaaacacaac ctccacacca acaaccacca caacagcaaa
6541 cacaacaata ggagagaatt cttcatgcat acgcacagac aactgcacag ggttgggaga
6601 agaagagatg gtcgactgtc agttcaatat gacaggatta gagagggata agaaaaaact
6661 atataatgaa acatggtact caaaagatgt agtctgtgaa tcaaatgaca ccaagaaaga
6721 gaaaacatgt tacatgaacc actgcaacac atcagtcatc acagagtcat gtgacaagca
6781 ctattgggat actatgaggt ttagatattg tgcaccaccg ggttttgccc tgctaagatg
6841 caatgatacc aattattcag gctttgagcc caattgttct aaggtagtag ctgctacatg
6901 tacaaggatg atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga
6961 aaatagaaca tatatctatt ggcatggtag ggataataga actatcatta gcttaaacaa
7021 gttttataat ctcaccgtac attgtaagag gccaggaaac aagacagttg taccaataac
7081 actcatgtca gggttagtgt ttcactccca gccaatcaat agaagaccca ggcaagcatg
7141 gtgctggttc aaaggcgagt ggaaggaagc catgaaggag gtgaagctaa cccttgcaaa
7201 acatcccagg tataaaggaa ccaacgacac agaaaaaatt cgttttatag cgccaggaga
7261 acgctcagac ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg
7321 caatatgact tggttcctca attgggtaga aaacagaacg aatcagacac agcacaatta
7381 tgtgccatgc catataaagc aaataattaa tacctggcac aaggtaggga aaaatgtata
7441 tttgcctcct agggaaggac agttaacctg caactctaca gtgaccagca taattgctaa
7501 cattgacgga ggagagaacc agacaaatat tacctttagt gcagaggtgg cagaactata
7561 ccgattagaa ttgggggatt ataaattgat agaagtaaca ccaattggct ttgcacctac
7621 accagtaaaa agatactcct ctgctccagt gaggaataaa agaggtgtat tcgtgctagg
7681 gttcttaggt tttctcacga cagcaggagc tgcaatgggc gcggcgtcct tgacgctgtc
7741 ggctcagtct cggactttat tggccgggat agtgcagcaa cagcaacagc tgttggacgt
7801 ggtcaagaga caacaagaaa tgttgcgact gaccgtctgg ggaacaaaaa atctccaggc
7861 aagagtcact gctatcgaga aatacttaaa ggaccaggcg caactaaatt catggggatg
7921 tgcgtttaga caagtctgcc acactactgt accatgggta aatgacacct taacgcctga
7981 ttggaacaac atgacatggc aggaatggga gcaacgaatc cgcaacctag aggcaaatat
8041 cagtgaaagt ttagaacagg cacaaatcca gcaagaaaag aacatgtatg aactacaaaa
8101 attaaatagc tgggatgttt ttggcaactg gtttgattta acctcctgga tcaaatatat
8161 tcagtatgga gtttatatag tagtaggaat aatagtttta agaatagtaa tatatgtagt
8221 acaaatgtta agtagactta gaaagggcta taggcctgtt ttctcttccc cccccgctta
8281 cttccaacag atccatatcc acaaggaccg ggaacagcca gccagagaag aaacagaaga
8341 agacgttgga aacagcgttg gagacaattg gtggccctgg ccgataagat atatacattt
8401 cctgatccgc cagctgattc gcctcttgaa cagactatac aacatctgca gggacttact
8461 atccaggagc ttccagaccc tccaactaat ctcccagagt cttcggagag cattgacagc
8521 agtcagagac tggctgagat ttaacacagc ctacctgcaa tatgggggcg agtggatcca
8581 agaagcgttc cgagccttcg cgagggctac gggagagact cttacaaacg cctggagagg
8641 cttctggggg acactgggac aaattgggag gggaatactt gcagtcccaa gaaggatcag
8701 gcagggggca gaaatcgccc tcctgtgagg gacggcggta tcaacaggga gattttatga
8761 ataccccatg gagagcccca gcagaagggg agaaaggctc gtacaagcaa caaaatatgg
8821 atgatgtaga ttcagatgat gatgacctag taggggtccc tgtcacacca agagtaccat
8881 taagagaaat gacatatagg ttggcaagag atatgtcaca tttgataaaa gaaaaggggg
8941 gactggaagg gctgtattac agtgatagga gacgtagagt cctagacata tacttagaaa
9001 aggaagaggg aataattgga gactggcaga actatactca tggaccagga gtaaggtatc
9061 caaagttctt tgggtggtta tggaagctag taccagtaga tgtcccacaa gagggagatg
9121 acagtgagac tcactgctta gtgcatccag cacaaacaag caggtttgat gacccgcatg
9181 gagaaacatt agtttggagg tttgacccca cgctagcttt tagctacgag gcctttattc
9241 gatacccaga ggagtttggg tacaagtcag gcctgccaga ggatgaatgg aaggcaagac
9301 tgaaagcaag agggataccg tttagctaaa aacaggaaca gctatacttg gtcagggcag
9361 gaagtaacta acagaaaaca gctgagactg cagggacttt ccagaagggg ctgttaccag
9421 gggagggaca tgggaggagc cggtggggaa cgccctcata ctttctgtat aaatgtaccc
9481 gctactcgca ttgtattcag tcgctctgcg gagaggctgg cagattgagc cctgggaggt
9541 tctctccagc actagcaggt agagcctggg tgttccctgc tagactctca ccagtgcttg
9601 gccggcactg ggcagacggc tccacgcttg cttgcttaaa agacctctta ataaagctgc
9661 cagttagaag ca
//
LOCUS HIVBOS1 253 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 1, envelope region.
ACCESSION M27910
KEYWORDS .
SOURCE Human immunodeficiency virus type 1 (isolate 1153), clone 1153-1.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 253)
AUTHORS Ou,C.-Y.et.al.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer-readable form by C.-Y. Ou, CDC,
Atlanta. The sequence below, designated 1153-1, appears to be a
major form of 11 sibling sequences taken from a Boston isolate.
BASE COUNT 98 a 46 c 54 g 55 t
ORIGIN Approximately 1170 nt from env start (BRU coordinates).
1 ttcaacacaa ctgtttaata gtacttggac ttggaatggt actgaaaggt caaataacac
61 tgaaggaaat atcacactcc catgcagaat aaaacaaatt ataaacatgt ggcaggaagt
121 aggaaaagca atgtatgccc ctcccatcag aggacaaatt agatgttcat caaatattac
181 agggctacta ttaacaagag atggtggtaa tgagaccaac gagaccgaga ccttcagacc
241 tggaggagga aat
//
LOCUS HIVJFL 2553 bp ss-RNA VRL 04-AUG-1990
DEFINITION Human immunodeficiency virus type 1 proviral env gene, 5' end.
ACCESSION M31451
KEYWORDS envelope protein.
SOURCE Human immunodeficiency virus type 1 (isolate JFL) proviral DNA,
clone I-BL2.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 2553)
AUTHORS McNearney,T., Westervelt,P., Thielan,B., Trowbridge,D.B.,
Garcia,J., Whittier,R. and Ratner,L.
TITLE Limited sequence heterogeneity among biologically distinct human
immunodeficiency virus type 1 isolates from individuals involved in
a clustered infectious outbreak
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1917-21 (1990)
STANDARD full staff_review
COMMENT This sequence was kindly provided prior to publication by Dr. Lee
Ratner, Washington University School of Medicine, St. Louis. Clone
I-BL2 and twelve other clones sequenced from seven isolates, taken
from one donor and two TA-AIDS recipients, all have pre-mature stop
codons in env. The isolate itself is described as monocytropic.
FEATURES from to/span description
pept 1 1851 envelope polyprotein
1855 1992 envelope polyprotein
1996 > 2553 envelope polyprotein
site 1852 1854 envelope polyprotein in-frame stop codon
site 1993 1995 envelope polyprotein in-frame stop codon
BASE COUNT 890 a 435 c 603 g 625 t
ORIGIN Env start codon.
1 atgagagtga aggagatcag gaagaattgt cagcacttat ggagatgggg caccatgctc
61 cttggaatgt tgatgatctg tagtgctgca ggaaaattgt gggtcacagt ctattatggg
121 gtacctgtat ggaaagaagc aaccaccact ctattttgtg catcagacgc taaagcatat
181 gacacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccctaaccca
241 caagaagtaa aattggaaaa tgtgacagaa aattttaaca tgtggaaaaa tgacatggta
301 gaacagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa
361 ttaaccccac tctgtgttac tttaaattgc attaatttga agaatgttac taatatcaat
421 aatagttggg gaaagatgga ggaaggagaa ataaaaaact gctctttcaa taccaccaca
481 agcataaaaa ataagatgca gagagaatat gcactttttt ataaacttga tgtagtacca
541 atagataagg ataatacaag ctatacattg ataaattgta acacctcaac cattacacag
601 gcctgtccaa aggtatcctt tgaaccaatt ccaatacatt attgtacccc ggctggtttt
661 gcgattctaa agtgtaatga taagaagttc aatggaaaag gaccatgtaa aaatgtcagc
721 acagtacaat gtacacatgg aattaggcca gtagtgtcaa ctcacttgct gttaaatggc
781 agtctagcag aaggagaggt agtaattaga tctgaaaatt tcatggacaa tgctaaaacc
841 ataatagtac agctgaataa atctatagca attaattgta caagacctaa caacaataca
901 agaaaaagta tcactctagg accagggagg gcattttata caacaggaga cataatagga
961 gacataagac aagcacattg taaccttagt agagtacaat ggaataacac tttaaaacag
1021 atagttataa agttaagaga acagtttggg aataaaacaa tagtctttaa gcaatcctca
1081 ggaggggacc cagaaattgt aatgcacact tttaattgtg gaggggaatt tttctactgt
1141 aattcaacac agctgtttaa tagtacttgg ctgtccaatg gtacttggaa tgatactgaa
1201 gagtcaaaag acactagaga acctatcata ctcccatgca gaataaaaca aattataaac
1261 aggtggcagg aagtaggaaa agcaatgtat gcccctccca tcaacggaca aattagatgt
1321 tcatcaaata ttacagggct actattaaca agagatggtg gtaaagagaa tggggccgag
1381 atcttcagac ctggaggagg aaatatgagg gacaattgga gaagtaaatt atataaatat
1441 aaagtagtaa aaattgagcc attaggagta gcacccacca aggcaaagag aagagtggtg
1501 cagagaaaaa aaagagcagt gggaacgata ggagctatgt tccttgggtt cttgggagca
1561 gcaggaagca ctatgggcgc agcgtcgatg acgctgacgg tacaggccag actattattg
1621 tctggtatag tgcaacagca aaacaatttg ctgagggcta ttgaggcgca acagcatctg
1681 ttgcaactca cagtctgggg catcaagcag ctccaggcaa gagtcctggc tgtggaaaga
1741 tacctacagg atcaacagct cctggggatt tggggttgct ctggaaaact catttgcacc
1801 actactgtgc cttggaatgc tagttggagt aataaatctc tggatgagat ttaaaataac
1861 atgacctgga tgcagtggga gagagaaatt agcaattaca caagcctaat atacacctta
1921 attgaagaat cgcagaacca acaagaaaag aatgaactag aattattgga attggataag
1981 tgggcaagtt tgtgaaattg gtttgacata acaaactggc tgtggtctat aaaaatattc
2041 ataatgatag tagcaggttt agtaggttta agaatagttt ttgctgtgct ttctatagtg
2101 aatagagtta ggcagggata ctcaccattg tcattccaga cccgcctccc aaccccgagg
2161 ggacccgaca ggcccggagg aatcgaagaa gaaggtggag agagagacag agacagatcc
2221 ggttacttag tgactggatt cttagcactt atctgggaag acctgtggag cctgtgcctc
2281 ttcagctacc accgcttgag agacttactc ttgattgtaa cgaggattgt ggaactcctg
2341 ggacgcaggg ggtgggaagc cctcaaatat tggtggaatc tcctacagta ttggagtcag
2401 gaactaaaga atagtgctgt tagtttgctt aacgctacag ctatagcagt agctgagggg
2461 acagatagga ttatagaagt agtacaaaga attggtagag ctatcctcca catacctgta
2521 agaataagac agggcttaga aagggctttg cta
//
LOCUS RATPKATA1 4644 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
exons 1-5.
ACCESSION D90055 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 1 of 4
SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4644)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of twO
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1 Chuo-ku, Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567 The sequence 1487-1594 encodes
presumptive leader peptide.
FEATURES from to/span description
pept 1487 1687 3-ketoacyl-CoA thiolase, exon 1
1827 1920 3-ketoacyl-CoA thiolase, exon 2
2800 2857 3-ketoacyl-CoA thiolase, exon 3
4064 4143 3-ketoacyl-CoA thiolase, exon 4
4381 + 4423 3-ketoacyl-CoA thiolase, exon 5
pre-msg 1373 > 4644 3-ketoacyl-CoA thiolase mRNA and introns
IVS 1688 1826 3-ketoacyl-CoA thiolase, intron A
IVS 1921 2799 3-ketoacyl-CoA thiolase, intron B
IVS 2858 4063 3-ketoacyl-CoA thiolase, intron C
IVS 4144 4380 3-ketoacyl-CoA thiolase, intron D
signal 1223 1228 GC box
signal 1263 1268 GC box
signal 1295 1300 GC box
signal 1346 1351 GC box
signal 1202 1206 CAAT box
signal 1217 1221 CAAT box
signal 1327 1331 CAAT box
signal 1238 1244 enhancer core
signal 1464 1470 enhancer core
BASE COUNT 1013 a 1303 c 1238 g 1090 t
ORIGIN
1 ggatcccacg cggggcctcc cgcagacatg gcaagcaacc ctgggccccg gttctgtacg
61 gtgcgctcgc ttgttgagct tcctacttcc cttaaggctc ctgccaaccc ctcctccttt
121 cgctttccga gaagctccgc cccttgctag aatctggaat ccggacccac aacacgcagc
181 ctgaaattca acccacttga gttctcacct ccccttgaag gtgtgagagc ttatagaaac
241 cgcctctgac gtccgccttg gcccaccgac ctgtcctggg cgatctggta gtggtggcta
301 tgcgtggtga agtgattcag ccttgtttag gacttgacag gagttacagc tcttccgcac
361 agtaatttag attctggtaa tagggaggga agagacggac tggcttcgta agcagtaacc
421 tgaaggctct gtctacctta gccatctggg acaagtggtg accgtgcttt ttaaagggaa
481 agtccacgac cttgcccgag aatagacatt tagggcatcc tgtcctttgt tatcttagac
541 agacaacact accttccgag ggtctgcaaa aagcacacgg acctatcact ttactcttta
601 acgttgggct tagaccaagg aagagtgttt cgttatttcg ctcggttaat ttacttaggt
661 gtcaatcacc agtactagat ctcccccttc tcctcccccg ggcttagtag aattccttgg
721 ggaaagtggg cattaatggc gttatcccaa acgaacgtag aacaggctta agacatgtgg
781 ggcagtgcca cagttagaac ctgcctgcca ttcagcacct tcacaccagc cggacaccca
841 aaattaattt gttttctgag agagcttctg tgacagccca atagggttgg ggttgaacta
901 agtacctcaa cattatctag gtcctcggtt ctgggttttg agacccgggg ccgagcacag
961 caaattttga ggagaaacaa gtgtcttccc tggctcttcc agcaaaggga gaagccctga
1021 gtcgagctcc gccttccacg ccgctgcgcg gtgacctcag aaggagggac tccctcgaga
1081 ctgagttcct cttaggaaac cagaagtttg gggcgcctcc tttccgtgag tctaccaaag
1141 gttgtgcaaa cactagtttc cagtctgcct ttctagggga ggattaagag tacgccttat
1201 gattggcttc cgtaagattg gcgggcgggg ctttgcgtgg aaagagtctt aggacgggtc
1261 ttgggcgggg ctctggggca aggggtgagt gggtgggcgg ccctcaagca gacacccctt
1321 aatctaattg gttgtgtcta agagtgggcg ggccctctaa agaccccatc gcaccactct
1381 cagcattatc ctacccgagc tctgatccac tttcaggcct cgtgaggtag agggctggcc
1441 tgcgcctgcg cctgccatca ttttggtttg ttaagcaagg cagagcatga gcgagtcggt
1501 gggacgcacc tccgcgatgc atcggctgca ggtagtgctg ggccacctgg ccggccgacc
1561 cgagtcgagc tccgcgctgc aagccgcgcc ctgctccgct accttcccgc aggcttcggc
1621 ctccgacgtg gtggtggtgc acggacggcg cacccccatc ggccgcgccg gccgcggcgg
1681 cttcaaggtg aggcccctcg ggcgggcgag aggagcagaa ggcggccaga gtgggtggga
1741 tcctgccctc gggggtccgg tgagcgcaga gacggcgggt ggggatgaag gagttgggct
1801 caccgctgct gcacctcgcc ctgcaggaca ccacccccga cgagcttctg tcggccgtgt
1861 tgaccgcggt tctccaggac gtgaagctaa agcctgagtg tttgggagac atctctgtgg
1921 gtgagctccc ctgagggtcc tgcaccaccc cacccagcct ctctttctta ccagaacctg
1981 ggtttcactg agcaagacct acggattcca ccgatttcag tgggatggtc aaagggccct
2041 tttacagggt tcgcctaaga tcattgggaa acacagacat ctacattatg gttcataaca
2101 gtagcagaat tacagttctg aagtagcaat gaaaaataat tttgtggttg ggggtcacca
2161 cagcatgagg aactgtatta aaagggtcac agcactaaga tagttgagaa ccattgatct
2221 aaagcccaga agttcctgga gacttccagg aggtgagaaa acgcagttct ctacctacat
2281 gaacatcctg acccgggtac gggctgcaga tgggttcacc ccaccctgcg ctatccttga
2341 gttacaacag agtataggac atggttttgt ccccaggtgc cagcctgctc ctcgtgcctg
2401 caccatcgca gggtttggag gaggatactt ggggaggtga acagttgcag gatctgacct
2461 ctttcaccca cagcccgtct tctcacagcg ttcaggctgt gcactgtgtc tcttacccta
2521 gtgtctctcc tactaccgag caagcacccc actccccacc ccaaccgtga tttagaagag
2581 cataccttgg ccccttcagc actgggctgg tagatgactg gctgttgggt agcttggctc
2641 cccattttcc agactccaga cttctatagc ttcacacgag cctgggaaaa tcctaaagtg
2701 atccgttaga aagatctgga atgtttcctc tctggaactc acagctgaac ctcagtgtgc
2761 agaacactac ttggtcctga ataaggtttc tatctacagg taacgtactt gagccaggag
2821 ccggagcagt catggcgcgc attgcccaat ttctgaggta acccttctac ctctggtggg
2881 ggattctacc accatgaacc agtccgtagg tgacaggtta tcaggttgga cagtagatag
2941 caagtcagcc agaatccttg gacccaacag tgctcctact gggttaacca acccagaagc
3001 aggataccca ggctttggga ccagagccaa ggtgaccaga tgcttcccag ggccgaactt
3061 ttgttaccct tggaagcatg gactctgacc tcatatgtca gataaataat cacgtgacca
3121 cttgatcagg atctttactt catttgctac ttgagctgcc tgagaaccct gggctcagtc
3181 agttatcaga agcttgacgg tagtggggtg ggggaagcag ggtctggaca tcttggccct
3241 tcaggtaggg aagcagtagc tcatttgcag gattaagagc agcagtagcc agagtccaaa
3301 ccggggaaaa ccttttgtct agagacacgg aaagggtcag acttagcctt cctgtccctc
3361 ttgtaacttt cctgttcatc tctcacctgc ttcttcattt tgactttccc ttctctgctg
3421 tactggccac cctctcccta tacactcgct ctgcctattc tgtcccaata gagagctctc
3481 tccctgctca cctagctctg gccagtccct gcccagacac ctcggaagct ctcaagcatc
3541 agatgagtga ccttcttgtc caccaagtca ctatgtgcct ccctgtcgct aagacacttg
3601 caatggaggg tctggaaaca catgcctgct accttgagct tctaagctca ggcaggggga
3661 gctcgtttgg tccacccaca ggcactcttc ctctccttca gaagttttga caaaggaaag
3721 gtagacatct ttgcccacaa agagactagc caccagccta agagctaggt ctcatgggca
3781 aggcttccaa ccatctcctc agttgcaagt ggctgtagcg acagggtttg cctagtagtt
3841 gacccctctg catctatcta ccctagtggg agcaaggcca gctgttacct cctggtctgt
3901 acactgctgt gggccataat gggaccctaa agactcgtgg gtcctgcttt ccatgacaca
3961 gggtcgagtc cggggctgac cacacatcca aggatgacta accaccatct cccccaggga
4021 tggctgcaca tccaggctga ccgccatctc ccctctgttt cagtggcatc ccagagaccg
4081 tgcctctgtc agcagtcaac agacagtgtt catcgggact gcaggcagtg gccaacattg
4141 ctggtaagta gtgattcctg ggtgcccatg agtggaccct gaccttggtc cttcctcagg
4201 accctaggca ggaagagtgg aggggcaccc attgccacac cccaagcctt tttgggtgct
4261 cccgggtctt ccatgaggct ccgttgaatg cacccctgag aacagggagg gaacgggctg
4321 tgctgctccc agggcgggcc cgctccggcc aaggataatt atggagtgct ttcgttgcag
4381 gtggcatcag aaatgggtct tacgacattg gcatggcctg tgggtaagat ggccttccgc
4441 cgggccttat taaccaacag caataatcaa actaatttct ctctcagttc ggaggggccg
4501 tgattccagg cttctccccg cccacgaaga cagctgaata gggtgggctt ccagaatgtg
4561 tctaggagtg gcccaaccct ggctgctcag gaatgggcca atcttagcac aagcagtaac
4621 cggaattggg tcctctaagg tacc
//
LOCUS RATPKATA2 1205 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
exon 6.
ACCESSION D90056 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 2 of 4
SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1205)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 283 + 381 3-ketoacyl-CoA thiolase, exon 6
pre-msg < 1 > 1205 3-ketoacyl-CoA thiolase mRNA and introns
IVS < 1 282 3-ketoacyl-CoA thiolase intron D
IVS 382 > 1205 3-ketoacyl-CoA thiolase intron E
BASE COUNT 241 a 351 c 302 g 311 t
ORIGIN
1 ggtaccagta ggctgtgcta gggactgcct gtttcctgca tctgccctgg gagtgtgtct
61 ggtagagtgc cgggtcctaa gcttgaggga aagaaggctc ggctaccttg taacctccag
121 ctgcccttct caggcaggct tcctagagag cttacctcac tccccaaggg ctccctgggc
181 ctttattccc tagcacttgt cacacagatg gctgtcccta gggcttccta tgaaaactga
241 gagctcattc cttcttcttc tccccctctt cctcctcctc agggtggagt ccatgtccct
301 gtctaacaga gggaaccctg ggaatatttc ctcccgcctg ctggagagtg acaaagccag
361 agactgcctg attcctatgg ggtgagtgtt catgggaagc tgtgtatgta tccactcacc
421 gttctgcctg tccttcacca gacagcagca tactcttctc tgactggcag tactcagctt
481 cctcttcttc ctctgactgt ctcatgctcg acgggaactg ggtatgagat caatgctgcc
541 cttgacctcg ctttcctgag gtccccccta tacaacacac acacacacac acacacacac
601 acacacacac acactcgttc ctctccaact ctacaaatgt ccaaatgcac ctcactctac
661 attactgtgc ctgaggtgcc tgtgtcttag ggaaggcagt gggagtcctc aaagccttgg
721 gaggcctcaa tagttgaaga cacggggagc ttggcttttg ttattgcggg ggagggctcg
781 ggctctgaga taccttgtaa gaagtctgtt ccatctctcc agtgctttat aattctgcat
841 ccgtgtctct gagctctcct cagtttgtcc aagggcaagg cattctggga ggctcaccat
901 aggagaacca taggctggcc tggctttctc cacatcctgt gtgtttccaa gagcagccta
961 gaagccgcag agaaactggg gccagatagt gtccccagca tcctcctgcc tatgtcttat
1021 gggcagagac gactcacccg gggggaacag ttccttctgt aggccccacg ttcaggctgc
1081 tcttcattct ttgaggtgtg agtctttgag catccgtgga catgggcgct cagtatttaa
1141 gatgagccag gagtccctgg ggcgtggact gaagctctgt ctctgacgtc gacgccagag
1201 gtcca
//
LOCUS RATPKATA3 470 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
exon 7.
ACCESSION D90057 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 3 of 4
SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 470)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 80 + 160 3-ketoacyl-CoA thiolase, exon 7
pre-msg < 1 > 470 3-ketoacyl-CoA thiolase mRNA and introns
IVS < 1 79 3-ketoacyl-CoA thiolase intron E
IVS 161 > 470 3-ketoacyl-CoA thiolase intron F
BASE COUNT 120 a 91 c 154 g 105 t
ORIGIN
1 cgcaagccta gtccttcatc atgtccctgg ccctcaggca ggcagaccca cggcctctgc
61 tttcctgtgt gtgcactagg ataacctcgg agaatgtggc tgagcggttt ggcatctcac
121 ggcagaagca agatgccttc gcgctggcct ctcagcagaa gtgagtgcag agggatgggc
181 tgagggtggg gctggagagt ggatcgggta gagttttgcc aagaaacaaa ctgtaggttg
241 atgtccttat gcaggtgaag gcaggtacag gatagatcga agcctgtcgt tagatgagaa
301 ggaaggatga gcaggagaaa agttttaggg aggaggagac cggagagagg aaagagcagc
361 caggagaact tggaggcaga tgttaacatt tctctctgca catttacagg tcgttatgac
421 tgttcttaag ggatggatac agggctttgt atgtctcggt gaacaataac
//
LOCUS RATPKATA4 2264 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketocayl-CoA thiolase (EC 2.3.1.16) A gene,
exons 8-12.
ACCESSION D90058 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 4 of 4
SOURCE Rat (Sparague-Dawley) liver DNA, clone PT147.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2264)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 301 491 3-ketocayl-CoA thiolase, exon 8
633 812 3-ketocayl-CoA thiolase, exon 9
1089 1144 3-ketocayl-CoA thiolase, exon 10
1230 1375 3-ketocayl-CoA thiolase, exon 11
1696 1771 3-ketocayl-CoA thiolase, exon 12
pre-msg < 1 2030 3-ketocayl-CoA thiolase mRNA and introns
IVS < 1 300 3-ketoacyl-CoA thiolase intron F
IVS 492 632 3-ketocayl-CoA thiolase intron G
IVS 813 1088 3-ketocayl-CoA thiolase intron H
IVS 1145 1229 3-ketocayl-CoA thiolase intron I
IVS 1376 1695 3-ketocayl-CoA thiolase intron J
signal 2012 2017 polyadenylation signal
BASE COUNT 446 a 615 c 703 g 500 t
ORIGIN
1 gagtctgctc agtgaaaccc aggttctggt aagaaagaga ggctgggtgg ggtggtgcag
61 gaccctcagg ggagctcacc cacagagatg tctcccaaac actcaggctt tagcttcacg
121 tcctggagaa ccgcggtcaa cacggccgac agaagctcgt cgggggtggt gtcctgcagg
181 gcgaggtgca gcagcggtga gcccaactcc ttcatcccca cccgccgtct ctgcgctcac
241 cggacccccg agggcaggat cccaaacatg cctatgtctt actgcccacg gtgctggcag
301 ggcagcaagt gcccagagca aaggctgctt ccgtgctgag atcgtacctg tgacaaccac
361 tgtcctcgat gacaagggtg acaggaaaac catcaccgtg tctcaggatg agggtgtccg
421 ccccagcacc accatggagg gcctggccaa gctgaagcct gccttcaagg atggaggctc
481 taccacggct ggtgagcgtg gccaggagta gggccaatct cagtggcttg ttctactctg
541 agacctggag atgaccaggc cgcctgggaa ggtgatggtg gggggctggc tctcctcagg
601 cctgctgacg tctcctggtc caccttctcc aggaaactcc agtcaggtga gtgatggagc
661 agccgccgtc ctgctggccc ggaggtccaa ggctgaagaa ctgggcctcc ccatccttgg
721 cgtcctgagg tcctatgcag tggtcggggt ccctcctgac atcatgggca tcggacctgc
781 ctatgccatc cctgcggcct tgcagaaagc aggtgaggtg gctccttctc accctgtgct
841 tggatccttc attacctgga tctgggactg gggagtgctg gggtttgacc ttctgggtgc
901 tgtggatgga gcgggctagc accctacacc caagaccttc ctggcactac agctggctgg
961 gtctcactct cttgtcccta gaagacaaac atcagactcc tgggatctag gcttgggata
1021 gagtcctggg ggagtccaaa gtcctgccca gtggttgttg tcaattctgg cctatgcctc
1081 acccacaggg ctgactgtga atgacataga catctttgag atcaatgagg cctttgcaag
1141 tcaggtgagc ctgggtgttg tggtaggatt gactgagcca gcagttcaga gcagatggag
1201 actttagctc ctgcgcttgc tccttccagg ccctctactg tgtggagaag ctgggaattc
1261 ctgcagagaa ggtgaacccc ctggggggtg caatagccct gggccacccc ctgggctgca
1321 ccggagcaag gcaggtggtc acgctgctca atgagctgaa gcgccgaggc agacggtaag
1381 gctgctcctc gtggggtgca gtgggggtta ttagagctgg ggtgtctgat agctggggcg
1441 tggggaggtg agcacacacg caggggttct gaacttggga atggaggggt gagagcctgt
1501 cttccagaag ccttgcttca aaaacagcat aaccatgcat gccacctggg tgtgtgtctg
1561 tccggggctc ggttcccaca gatctgctgt gctggcttgg actctgccca tgaacccttg
1621 tgactgcagc ctgcctgatg ggagcatggg ttggggtttg gctgactcga gccaacactg
1681 gttctttttc cccagggctt atggcgtggt gtccatgtgc attgggactg ggatgggagc
1741 cgctgctgtc tttgaatacc ctgggaactg aggccctgac tgcaggcact acccagagag
1801 tcctatagta gtgtctggag agggatggta caggagccat cttcgtggga cactcagcag
1861 tggagggatt tgtcacagca cttcaattca gaagatgtag tcgatgttgg aacaggaggt
1921 ggaactgccc tgtcaagtac cccaagccat gctaaagtga gcatgggaca cccaggttgc
1981 aaagccatct gtacctctga cggatgggtg tagtaaatgt gtgctgtctc attttggtgg
2041 ctgtcatatc cttcgcaggc agctttcgcc tcctggattc tgtgtataaa tgattaagaa
2101 ccagggtgtc tccggtgaga tctagctgta tgcaggctct ggggacctgt catgaaagac
2161 ctcagctgac accattgccc tagaagaggc tcccccatcc caggagtcat ctagccccta
2221 gccacccccc cccccgctct ccagctgtgg gctatgcaga attc
//
LOCUS RATPKATB1 2234 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
exons 1-3.
ACCESSION D90059 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 1 of 5
SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2234)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567 The sequence, 1251-1328 encodes
leader peptide.
FEATURES from to/span description
pept 1251 1421 peroxisomal 3-ketoacyl-CoA thiolase, exon 1
1562 1655 peroxisomal 3-ketoacyl-CoA thiolase, exon 2
1985 + 2042 peroxisomal 3-ketoacyl-CoA thiolase, exon 3
pre-msg 1223 > 2234 peroxisomal 3-ketoacyl-CoA thiolase mRNA and
introns
IVS 1422 1561 peroxisomal 3-ketoacyl-CoA thiolase intron A
IVS 1656 1984 peroxisomal 3-ketoacyl-CoA thiolase intron B
signal 848 854 enhancer core
signal 1037 1045 enhancer core
signal 1114 1118 CAAT box
signal 1163 1167 CAAT box
signal 1192 1198 TATA box homolog
site 1099 1108 NFK-B site
BASE COUNT 460 a 614 c 647 g 513 t
ORIGIN
1 agtgctctta accactgagc tatctctcca gccctccact ttattttctg aggcggtatc
61 tctaaacacg gtgatttgac tagggtaggg agccactctg cccctcggat cctctgcctt
121 ccccacctaa gcactaagga agcctgccta gcatcgatgt gtgcagtggg gatctgatct
181 ctggtctccc acttatgggg tcgagcactt taccctctga gccctctcgc actcccccct
241 cctttcgagt tttgggctct cctggggact gagaaaaccc tgggtgattg tcgtggtttg
301 gggctgacct gtcactgtga ttggggaacc cagccttact ctttaaggtt ggtaaaacat
361 tctagatcca gggagcccca attcatccac tcagcaggaa accattgacc agctaatgta
421 tcaggcacac tgcagacacc tctgctcagg ggggaggaga gtgggcgctg caggggtgtc
481 catagatgag agcatttcaa atgggcatcc agaggtgtca agacccgcag gacactctca
541 gagacctttg aaccacttca gtgctttaat aaaaccggtt cttccagcct gttagctgcc
601 aaagattagg cctgagaatc tcaattactc aagtatccag cttggggcca gggacccagg
661 agagacaaga ggcaaatgga acagaaaaca gagtcgggac ctctatctcc attcatagtg
721 taaagctgtt accatggctc cagaacctcc tgccttttgt gtcatagtgt gcgatggtca
781 tatttgaggg agggtatcct ggttcatgtt cagaaaccca gaagacttcg aggtcctacc
841 tacagtgtgg attgcaaaat gctagaaacg acaggaaagt tccagacact ggaactggga
901 ggggtttcaa aagtgcgcac aaactttttt ttaaattata tttgaaaaaa aaatcttatt
961 ttgtagccca gactggcctt gaacttgcgc agaaaccact tgcaccatca caggtgcaga
1021 cggatgagct taaggagtgg ttttgaagtt tatttgacca gcattgagaa gctgcgaggt
1081 ggtggtggga tggatggtgg gactttcctt ctgattggct gtagaatggg taggcgggac
1141 ttatgaaaaa tgcaaggctc ccattggctc tgccctgtgc cagccggtgt cttttattct
1201 ctggggtcag cacccagagc ccaggctaat cggtgggacg cacctccgcg atgcatcggc
1261 tgcaggtagt gctgggccac ctggccggcc ggtccgagtc gagctccgcg ctgcaagccg
1321 cgccctgctc tgcgggcttc ccgcaggcct cggcctccga cgtggtggtg gtgcacggac
1381 ggcgcacccc catcggccgc gccggccgcg gcggcttcaa ggtgaggccc ctcgggcggg
1441 cgaggggagc agaaggcggc catagtgggt gggatcctgg ccctcggggg tccggtgacc
1501 gcagagacgc cgggtgggaa tgaaggagct gggctcaccg ctgttgcgcc tcgccctgca
1561 ggacaccacc cccgacgagc ttctgtcggc cgtgttgacc gcggttctcc aggatgtgaa
1621 gctaaagcct gagtgtttgg gagacatctc tgtgggtgag ctcccctgag ggtcctgcac
1681 cggctcttcc ccgccctcct ccttcccggg atccgtctct cacctcacag agcagagtta
1741 ctgaagggca ggagtctgga cctctgaggt ctgaaccagg ctgcccccga tggccagtct
1801 caagcctaga accttgggcg ccccctcgtg gcacatctgc caactgagag gcgctggaaa
1861 gcgggaacct ctctcgaaag gctgtgattc ttggagagat ctggactgtc tcctgtctgg
1921 ggcatgtttt acttcaggcc tcgggacacc acctggcact gagtcaggac tcttctgtcc
1981 acaggcaatg tgcttcagcc cggggccgga gccgcaatgg cacgcatcgc ccagttcctg
2041 aggtaatctt tctggatccg tcttagttgt ccaccatgac tgccgaaggc ccttggcact
2101 gtgtgtgtca cggtggaaga tcaggttgaa tgtagagact gaggcaactg gagtccactg
2161 acctgacctc tgctcgataa ccttttccct acttagaggg agtgtgttgc tggcccatgg
2221 ccaaggcgcc cggg
//
LOCUS RATPKATB2 913 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
exon 4 and 5.
ACCESSION D90060 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 2 of 5
SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 913)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 353 432 peroxisomal 3-ketoacyl-CoA thiolase, exon 4
673 + 715 peroxisomal 3-ketoacyl-CoA thiolase, exon 5
pre-msg < 1 > 913 peroxisomal 3-ketoacyl-CoA thiolase mRNA and
introns
IVS 433 672 peroxisomal 3-ketoacyl-CoA thiolase intron D
BASE COUNT 173 a 234 c 275 g 231 t
ORIGIN
1 tgtctcagat gagcacaaga aatgctgttc catctcagag tcttcaagat agaggaaaga
61 ggttgtctgg gcccacggca ggcctccctg gtccctcttg agaactatgg gacagatgag
121 aatgtgcttg ggggctcttg tgtgtagcta cggagacaga tctgcacagt agctgcttct
181 gctcttgtgg atctgctcca gtgggggttg cctgttgtcc tgagctatat gcacactgct
241 ggggctcttg tgggggccca gaagtctggt ggggcctgct ttccatgaga gagggtcatg
301 tccagggatg gctgcacgtc cagggctgac cgccatctcc cctctgtttc agtggcatcc
361 cagagaccgt gcctctgtca gcagtcaaca gacagtgttc atcgggactg caggcagtgg
421 ccaacattgc tggtaagtga tgattcctgg gtgcccatga gtggaccctg accttggtcc
481 ttgaccctca ggaccctagg caggaagagt ggagggacac ccattgccat accccaagcc
541 tttttgggtg ctcccggggt cttccgtgag gctccgttga atgcactccc gagaacaggg
601 agggaacggg ctgtgctgct ccagggcggg cccgttccgg ccaaggataa ttatggagtg
661 ctcttggtgc aggtggcatc agaaatgggt cttacgacat tggcatggcc tgtgggtaag
721 atggccttcc cgccgggctt attaaccaat gcaataatca aactaaaatt tttctctcct
781 tgactgtgat tctccttgcc cactgtgaca gctggctggg ctgggctccc agggatgcag
841 cccaggttgt ggctctgagt atcgtgacct cttttgttct cacaactttg cttgtcacat
901 tggggatctg cag
//
LOCUS RATPKATB3 145 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
exon 6.
ACCESSION D90061 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 3 of 5
SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 22 + 120 peroxisomal 3-ketoacyl-CoA thiolase, exon 6
pre-msg < 1 > 145 peroxisomal 3-ketoacyl-CoA thiolase mRNA and
introns
BASE COUNT 32 a 38 c 41 g 34 t
ORIGIN
1 gagctcactc tttctcctca gggtggagtc catgaccctg tctgagagag gcaaccctgg
61 gaatatttcc tcccgtcttc tggagaacga gaaggccaga gactgcctga ttcctatggg
121 gtgagtggcg acaggaatct ctcaa
//
LOCUS RATPKATB4 369 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
exon 7.
ACCESSION D90062 J05269
KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 4 of 5
SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 369)
AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
(03)542-2511 Fax: (03)545-3567
FEATURES from to/span description
pept + 242 + 322 peroxisomal 3-ketoacyl-CoA thiolase, exon 7
pre-msg < 1 > 369 peroxisomal 3-ketoacyl-CoA thiolase mRNA and
introns
BASE COUNT 66 a 94 c 123 g 86 t
ORIGIN
1 ggggggaaca gttccttctg taggccccac gttcaggctg ctcttcactc tttgaggtgt
61 gagtctttga gcatccgtgg acatgggcgc tcagtattta agatgagcca ggagtccctg
121 gggcgtggac tgaagctctg tctctgacgt ggacgcccag aggtgcaagt ctagtccttc
181 atcgtgtccc tggccctcag gccagcagac ccatggcctc tgctttcctg tgtcgcacta
241 ggataacctc ggagaatgtg gctgagcggt ttggcatctc acggcagaag caggatgcct
301 tcgcgctggc ctctcagcag aagtgagtgc agagggatgg gctgaggtgg gcagagatgc
361 tgagctggg
//
LOCUS RATPKATB5 2787 bp ds-DNA ROD 04-AUG-1990
DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
exons 8-12.
ACCESSION D90063 J05269
KEYWORDS 3-ketoacyl-coA thiolase; fatty acid beta-oxidation;
peroxisomal protein.
SEGMENT 5 of 5
SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2787)
AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T.
TITLE Rat Peroxisomal 3-Ketoacyl-CoA Thiolase Gene: Occurrence of two
closely related but differentially regulated genes
JOURNAL J. Biol. Chem. 265, 4600-4606 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Makoto
Hijikata Virology Division, National Cancer Center Research
Institute 5-1-1 Tsukiji, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511
Fax: 03-545-3567
FEATURES from to/span description
pept + 987 1177 peroxisomal 3-ketoacyl-CoA thiolase, exon 8
1326 1505 peroxisomal 3-ketoacyl-CoA thiolase, exon 9
1782 1837 peroxisomal 3-ketoacyl-CoA thiolase, exon 10
1923 2068 peroxisomal 3-ketoacyl-CoA thiolase, exon 11
2387 2462 peroxisomal 3-ketoacyl-CoA thiolase, exon 12
pre-msg < 1 2742 peroxisomal 3-ketoacyl-CoA thiolase mRNA and
introns
IVS 1178 1325 peroxisomal 3-ketoacyl-CoA thiolase intron H
IVS 1506 1781 peroxisomal 3-ketoacyl-CoA thiolase intron I
IVS 1838 1922 peroxisomal 3-ketoacyl-CoA thiolase intron J
IVS 2069 2386 peroxisomal 3-ketoacyl-CoA thiolase intron K
signal 2724 2729 polyadenylation signal
BASE COUNT 572 a 722 c 864 g 629 t
ORIGIN
1 ctcacggtgg ggcagagatg gcctgagctg gggctctcac tgtggtacta aggtgttggt
61 tagccccatg actttagctg ggaggtggaa tggttcagcc cggagtgtgg ggtgggtctt
121 cccagttctt aactattcac tggggtcggt ctctctagag acctcctcct cctctaccct
181 gcttcttagg acagaaggac ccagttctga acagacatgg tggtcactgg tgatccttag
241 gcctggagtg tggggtcagt ggtgctaaga ccggatcccc ctcctaccac aagatgaaga
301 cctgggtttg cagcggtggc ctgaggccat cattccagaa aaccagagcc ttaggcccgt
361 gggaaacaga agggcaggtt ggccactcaa atcatggatt gagaacagag ggaggatgga
421 gtgtaagcca ggtggcctca ccttttagag ttggccagca gtgtgcgttc cctggaagcc
481 gtgcgttcct gcccacaggc atgactctac ccacttgttc tgttcaccct gacaacagcg
541 tgggaatcag gaaataaatc caggaggtgc acctcgcttt taaatttaag aggggaagca
601 ggtgaagttt cgcttttagc agtcaaacat gggccttgaa gaatccggca cagacattcc
661 caggggtgca gcctcgttgt ggaagagacc tctgagcagg ctaagcagct agcggattaa
721 gaatcaggag acagaatagc agctctgaac tccctgctgt tggctgggga tgagtactgc
781 caaaccggga aaatcaggag ctgctgctga gtctgggaga gcttcgggtg cgcctttccc
841 tccctgggga atcttagctg tcactcagag ctccatcagg gccacaaagg gctgtggttg
901 cattgccttt ctgagttagc caagcccagg gactgggagg aaagactaca aaggggcccg
961 actctcgctg tccatggtgc tggcagggca gcaagcgccc agagcaaagg ctgcttccgt
1021 gctgagatcg tgcctgtgac aaccactgtc ctcgatgaca agggtgacag gaaaaccatc
1081 accgtgtctc aggatgaggg tgtccgcccc agcaccacca tggagggcct ggccaagctg
1141 aagcctgcct tcaaggatgg aggctctacc acggctggtg agcgtgggcc aggattaggg
1201 ccgatctcgg tggcttgttc tactctgaga cctggagctg accaggccgc ctgggaaggt
1261 gatagtcagg ggctggctct ctcaggccct gcctgacgtc tcctcgtccc acccttcctc
1321 cccaggaaac tccagtcagg tgagtgatgg agcagccgcc gtcctgctgg cccggaggtc
1381 caaggctgaa gaactgggcc tccccatcct tggcgtcctg aggtcctatg cagtggtcgg
1441 ggtccctcct gacatcatgg gcatcggacc tgcctatgcc atccctgcgg ccttgcagaa
1501 agcaggtgag gtggctcctt ctcaccctgt gcttggatcc ttcattacct ggatctggga
1561 ctggggagtg ctggggtttg accttctggg tgctgtggat ggagcgggct agcaccctac
1621 acccaagacc ttcctggcac tacagctggc tgggtctcac tctcttgtcc ctagaagaca
1681 aacatcagac tcctgggatc taggcttggg atagagtcct gggggagtcc aaagtcctgc
1741 ccagtgattg ttgtcaattc tggcctatgc ctcacccaca gggctgactg tgaatgacat
1801 agacatcttt gagatcaatg aggcctttgc aagtcaggtg agcctgggtg ttgtggtagg
1861 attgactggg ccagcagttc agagcagatg gagactttag ctcctgcgct tgctccttcc
1921 aggccctcta ctgtgtggag aaggtgggaa ttcctgcaga aaaggtgaac cccctggggg
1981 gtgcaatagc cctgggccac cccctgggct gcaccggagc aaggcaggtg gtcacgctgc
2041 tcaatgagct gaagcgccga ggcagacggt aaggctgctc ctcgtggggt gctgtggggg
2101 ttattagagc tggggtgtcc aatagctggg ggcgtgggga ggtgagcaca cacacagggg
2161 ttctgaactg ggaatggagg ggcgagacct gtcttccaga agccttgctt caaaaacagc
2221 atgactgtgc ctgccactgg gtgtgtgtct gtccggggct cggttcccac agatctgctg
2281 tgctggcttg gactctgccc atgaaccctt gtggctgcag ccctggctgg tgggagcatg
2341 ggttggggtt tggctactcg agccaacact ggttcttttt ccccagggct tatggcgtgg
2401 tgtccatgtg catcgggact gggatgggag ccgctgctgt ctttgaatac cctgggaact
2461 gaggccctga ctgcaggcac tacccagaga gtcctatagt agtgtctgga gagggatggt
2521 acagaagcca tcttcgtggg acactcagca gtggagggat ttgtcacagc actttaattt
2581 agaaaatgta atcgcaatac gggcacaagg gcatccaatc cactcctctc agttgaacca
2641 aagacagcgc acgtcacacg tgaccacagg acaacatgtt tctgagggac agtggcctca
2701 tgaatagtgg gcatgagggg catagtaaat gtatgttatc tcatcttgct tgcagtcatt
2761 tcccccaggc cggctcttgc tttgttg
//
LOCUS NPOCAP 2102 bp ds-DNA VRL 04-AUG-1990
DEFINITION Orgyia pseudotsugata polyhedrosis virus capsid protein (P39)
gene, complete cds.
ACCESSION M21042 M23429
KEYWORDS capsid protein; structural protein.
SOURCE Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58.
ORGANISM Orgyia pseudotsugata polyhedrosis virus
Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus;
Nuclear polyhedrosis viruses.
REFERENCE 1 (bases 1 to 2102)
AUTHORS Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S.
TITLE Nucleotide sequence, transcriptional mapping, and temporal
expression of the gene encoding p39, a major structural protein of
the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata
JOURNAL Virology 168, 354-362 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by G.W.Blissard, 03-NOV-1988.
The p39 protein is found in both the PDV and BV virion phenotypes;
P39 mRNAs are approximately 2600 nt in length, are expressed late
(24-48 hpi), and have two major transcription initiation sites. At
each major transcription start site, two transcripts are initiated
within the ATAAG consensus. Transcripts initiate at positions 2
and 3 of the ATAAG consensus.
FEATURES from to/span description
pept 124 1179 capsid protein
mRNA 27 > 1179 capsid protein mRNA (alt.)
mRNA 28 > 1179 capsid protein mRNA (alt.)
mRNA 75 > 1179 capsid protein mRNA (alt.)
mRNA 76 > 1176 capsid protein mRNA (alt.)
BASE COUNT 493 a 607 c 585 g 417 t
ORIGIN Map position 57-58.
1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc
61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag
121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt
181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg
241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc
301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg
361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac
421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg
481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac
541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg
601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc
661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac
721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc
781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg
841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa
901 acgcggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg
961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc
1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg
1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc
1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt
1201 gaggctgcag tgccacattt gctgctcggt gggcgaaatc aaaaattatt ttttgcagcc
1261 cgtcgacgca atcaccattt tgcccattgt ggagctgcac acgtgccggc accaattgtg
1321 cgtcatgtgc gtgcgcaaaa ttgcgcagcg cggccgagac aagcgcgtcg agtgccccat
1381 gtgccgccgc aaaaacgcgc actttaacgt gtacagcgtg aaccgtaatt cggtggatgt
1441 gttgcgctgc agcgtggccg acgtgcgcga gcacgggcgc tttggcggat tggcggacgc
1501 cgcgtcgctg gctcgtgggc tgtttgagcc cagcttgctt gaagccgagc ccgcgccgga
1561 caactcgttt ggcccaaacg agttacaact tgtgctaaaa cgtctaaagg cgcaaattga
1621 ggcgcaaaca agaactaatt atgatttgca actgcaagca actgctttgg agcgaacaat
1681 cgaagaagca aacgaccgct tgggtaaaag ccgcggcgac tacagcgacg cgtgcaagct
1741 gatggacgag cttcgcggcg acaggctgcg cgcggagcgc gcggtcaagg cgctcgccga
1801 cgcgcacgcc cagtgggcgg acaagaacgc caaaatgcgg cgcgagaacg acaggttgac
1861 aaatgaaaac attggtttga ttcgggacaa caatttattt aaacaaaaca ctgcacgcaa
1921 acgtaaaatc gctccgtaac cgcctttgtt tgcgcgtcaa agtccacctc gtagacgtgt
1981 cgctcgttta cgctgccgca gcggtcccag ttgtccgtcg tgcaccacgc cgcattgtgt
2041 cggcggcaca aatacagcgc ttcgttgaat tgcgccggcg gcagattgaa gcggccgagc
2101 tc
//
LOCUS PT2REGA 462 bp ds-DNA PHG 04-AUG-1990
DEFINITION Bacteriophage T2 translational repressor (regA) gene, complete cds.
ACCESSION M27737
KEYWORDS RNA binding protein; translational repressor.
SOURCE Bacteriophage T2, cDNA to viral mRNA.
ORGANISM Bacteriophage T2
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 462)
AUTHORS Miller,E.S. and Jozwik,C.E.
TITLE Sequence analysis of conserved regA and variable orf43.1 genes in
T4-like bacteriophages
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.S.Miller, 08-SEP-1989.
The T2 RegA protein coding sequence corresponds to positions
9839-10207 of the T4 regA sequence contained in the entry with
accession number M10160. The complete sequence reported herein
would correspond to positions 9804-10266.
An RNA secondary structure is detected by primer extension
reactions on mRNA templates at positions 407-424. An RNA nuclease
sensitive site is noted at position 431.
FEATURES from to/span description
pept 36 404 translational repressor protein (RegA)
BASE COUNT 169 a 66 c 88 g 139 t
ORIGIN
1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac
61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag
121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggactatac tatatcgttc
181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag
241 ttcgtcgtga ttcgattgca tggctattag aagattgggg actaattgaa atcgttcctg
301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa
361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc
421 ccttatttgg agtataatat atcaagagcc taataactcg gg
//
LOCUS PT4G55G43 13062 bp ds-DNA PHG 04-AUG-1990
DEFINITION Bacteriophage T4 genes 55, alpha-gt, 47, 46, 45, 44, 62, regA,
and 43.
ACCESSION M10160 J02510 X00769
KEYWORDS DNA polymerase; DNA polymerase accessory protein;
alpha-glucosyltransferase; exonuclease; polymerase;
translational repressor.
SOURCE Bacteriophage T4 DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 7513 to 8269)
AUTHORS Spicer,E.K., Noble,J.A., Nossal,N.G., Konigsberg,W.H. and
Williams,K.R.
TITLE Bacteriophage T4 gene 45: Sequences of the structural gene and its
protein product
JOURNAL J. Biol. Chem. 257, 8972-8979 (1982)
STANDARD full staff_review
REFERENCE 2 (bases 7513 to 10735)
AUTHORS Spicer,E.K. and Konigsberg,W.H.
TITLE ;
JOURNAL (in) Mathews,C.K., Kutter,E.M., Mosig,G. and Berget,P.B. (Eds);
Bacteriophage T4: 299-301;
American Society for Microbiology, Washington, DC (1983)
STANDARD full staff_review
REFERENCE 3 (bases 9747 to 10265)
AUTHORS Trojanowska,M., Miller,E.S., Karam,J., Stormo,G. and Gold,L.
TITLE The bacteriophage T4 regA gene: Primary sequence of a translational
repressor
JOURNAL Nucleic Acids Res. 12, 5979-5993 (1984)
STANDARD full staff_review
REFERENCE 4 (bases 8257 to 9272)
AUTHORS Spicer,E.K., Nossal,N.G. and Williams,K.R.
TITLE Bacteriophage T4 gene 44 DNA polymerase accessory protein:
Sequences of gene 44 and its protein product
JOURNAL J. Biol. Chem. 259, 15425-15432 (1984)
STANDARD full staff_review
REFERENCE 5 (bases 1 to 7084)
AUTHORS Gram,H. and Rueger,W.
TITLE Genes 55, alpha-gt, 47, and 46 of bacteriophage T4: The genomic
organization as deduced by sequence analysis
JOURNAL EMBO J. 4, 257-264 (1985)
STANDARD full staff_review
REFERENCE 6 (bases 6985 to 7512)
AUTHORS Hsu,T. and Karam,J.
JOURNAL Unpublished (1985) Med U of South Carolina, Charleston, SC.
STANDARD full staff_review
REFERENCE 7 (bases 9781 to 10291)
AUTHORS Adari,H.Y., Rose,K., Williams,K.R., Konigsberg,W.H., Lin,T.-C. and
Spicer,E.K.
TITLE Cloning, nucleotide sequence, and overexpression of the
bacteriophage T4 regA gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1901-1905 (1985)
STANDARD full staff_review
REFERENCE 8 (bases 7513 to 12985; revises [1],[2],[4],[7])
AUTHORS Spicer,E.K.
JOURNAL Unpublished (1986) Yale U School of Med, New Haven, CT.
STANDARD full staff_review
REFERENCE 9 (bases 7513 to 12985; revises [1],[2],[4],[7],[8])
AUTHORS Spicer,E.K.
JOURNAL Unpublished (1987) Yale U School of Med, New Haven, CT.
STANDARD full staff_review
REFERENCE 10 (bases 10205 to 13062)
AUTHORS Spicer,E.K., Rush,J., Fung,C., Reha-Krantz,L.J., Karam,J.D. and
Konigsberg,W.H.
TITLE Primary Structure of T4 DNA Polymerase: Evolutionary relatedness to
eucaryotic and other procaryotic DNA polymerases
JOURNAL J. Biol. Chem. 263, 7478-7486 (1988)
STANDARD full staff_entry
REFERENCE 11 (bases 9184 to 9903)
AUTHORS Rush,J., Lin,T.-C., Quinones,M., Spicer,E.K., Douglas,I.,
Williams,K.R. and Konigsberg,W.H.
TITLE The 44P subunit of the T4 DNA polymerase accessory protein complex
catalyzes ATP hydrolase
JOURNAL J. Biol. Chem. 264, 10943-10953 (1989)
STANDARD full staff_entry
REFERENCE 12 (bases 10212 to 10215)
AUTHORS Andrake,M., Guild,N., Hsu,T., Gold,L., Tuerk,C. and Karam,J.
TITLE DNA polymerase of bacteriophage T4 is an autogenous translational
repressor
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7942-7946 (1988)
STANDARD full staff_entry
COMMENT Gene 45 protein is required for both T4 DNA replication and T4 late
gene transcription [1]. Gene 44 protein is a DNA polymerase
accessory protein required for T4 DNA replication [4]. Gene 55
protein associates with the modified host RNA polymerase and is
necessary for late RNA transcription [5]. Gene 43 encodes a T4 DNA
polymerase [8]. [5] notes several open reading frames not included
in the FEATURES table.
A draft entry and computer-readable copy of the sequence in [8],[9]
were kindly provided by E.K.Spicer 13-MAR-1986 and 27-JUL-1987.
FEATURES from to/span description
pept 25 582 g55 protein
pept 2356 3558 alpha-glucosyltransferase (alpha-gt)
pept 3735 4754 exonuclease g47
pept 5198 6880 exonuclease g46 (gtg start codon)
pept 7579 8262 DNA polymerase accessory protein g45
pept 8313 9272 DNA polymerase accessory protein g44
pept 9274 9837 DNA polymerase accessory protein g62
pept 9839 10207 translation repressor protein (regA)
pept 10286 12982 DNA polymerase (g43)
revision 8707 8707 g in [8]; c in [2],[4]
revision 8835 8835 c in [8]; g in [2],[4]
revision 8950 8950 g in [8]; c in [2],[4]
revision 9135 9136 cg in [8]; gc in [2],[4]
revision 9484 9484 c in [8]; g in [2]
revision 9929 9929 g in [2],[3],[8]; aaag in [7]
revision 10212 10215 gggg in [12]; gg in
revision 10551 10552 ct in [8]; tc in [2]
revision 10807 10807 a in [9]; c in [8]
revision 12597 12599 ttg in [9]; tg in [8]
revision 12605 12607 aaa in [9]; aa in [8]
revision 12609 12610 tt in [9]; t in [8]
revision 12612 12613 tt in [9]; t in [8]
revision 12618 12619 aa in [9]; a in [8]
revision 12644 12646 aaa in [9]; aa in [8]
BASE COUNT 4471 a 1954 c 2512 g 4125 t
ORIGIN About 45.5 kb on genomic map.
1 tatgaattga gctaagaggt gaaaatgtca gaaactaagc ctaaatataa ttacgtaaac
61 aataaagagc ttttacaagc tattattgat tggaaaacag aattagcaaa taataaagac
121 ccaaataaag tagttcgtca gaatgatact atcggattag ccattatgct tattgcagaa
181 ggcttatcta aacgtttcaa cttttcagga tacacccagt cttggaaaca agaaatgatt
241 gcagatggta tagaagcttc tattaagggg cttcacaatt ttgatgaaac gaaatataaa
301 aacccacatg cgtatataac tcaagcttgt tttaatgcat tcgtccaacg tattaaaaaa
361 gaacgtaagg aagttgcaaa gaaatatagt tacttcgttc acaatgtcta tgacagccgt
421 gacgacgata tggttgcgtt agtagatgaa acttttattc aagacatcta tgataaaatg
481 acgcattacg aagaatcaac ctatagaaca ccgggggctg aaaagaaaag tgttgtagat
541 gattctccta gtttggattt tttatatgag gctaacgatt aacctctccg gattcttgga
601 agaaatacct gaagttgaag ctattcccta tttacttaaa atgtatctca gggaagtttt
661 agctcttgac attgatattg atccagaaaa tccgtatgat accgctttta aatctaatgg
721 tgtagaatta aactatcggt atcatttaac agatgatgat ttttatttta tattagagaa
781 ataatatgac tgataaaccc gaaattaatg atgaagtgga aaagcttatt tcttctattg
841 aagaaaagaa ccgtcttgaa gcagaaagaa aagcaaataa gttattgtct aaaaacaaac
901 gcgaactgaa tcgtctttat aagcacgctc agatcgcagc tgaaaataat aattttgctc
961 aatacgaata tgctatcaag aaaagtcggg atattctaaa acagccatat aacgatgaac
1021 tcatcagtat tctttggaag actactagat cgcagattga ggatatgatt gatgcttaca
1081 cacgtaaaat tcaagcgtct taaaattaat gcaggattta ctgaatcttt gaatggtcat
1141 ctttgcgtga aaatttctga aaaagaatac cgtgatagtt caatcaaaga agttaatcct
1201 cctattgtaa gagcagaccc taatatgaaa gtgtgggttg attcttatca agtcaaaaaa
1261 tggtggcaac tgtgaaagat gaacatccga cttctgaaat agattataat aaaattcgtt
1321 cctctagaga ggaaatgatg agacgcttta aagaggctca tgataaagct aaagcagaag
1381 gaactataac atataagcgc ataaaattta aaagttctaa cgagcctctg tatggcgtat
1441 tatgtggata ggagcttcgg ctcctatatt gctttataaa ttttttggta aaataaacta
1501 aaatgaatta tttaaacata tataatttac tgatcgaaaa aggactaaac agaggcaatg
1561 ataaaagtct tttgacatac tatactgaaa cacatcatat aattcctaga tgtatgggtg
1621 gtacagatga taaaactaat ttagttcttt tgacaccaga agaacatttt accgcgcatt
1681 tattattatt taaaatttat agacttccta aattggcttt agctattcgt atgatgtgtt
1741 atagttctga cggaacaaga ctaaataata aaatgtatgg atggattaaa acagctgtat
1801 cttcttcaat atcggaaagc atgaaagagt tctggaaaga tgatgataat aaaaaataca
1861 tgtctaacgc tagacgaaat gctggaaaac caatttatca atatgattta aatggaaatt
1921 ttattcgcaa atatagatgt attacagatg ctgcagaaga tatgtcatat agctgttcaa
1981 cttcaattaa acaatgtgta gatggtaaaa ggaaaaccgc tggaggattt caatggaaat
2041 attattattc tgataacata ggaaagccct ctaggatgtc taatgcaact aaacaaaaaa
2101 tgtctaaatc taaacgtggg ataactcaga aaagaaatgt gccagtattt caatatgata
2161 ctactggaaa attattacga gtatttccta gaattaaaga cgcagctgta tccgttaaag
2221 gatgcatgtc taatattaaa aagtgcatat ctggaaaaag caaaattgct tatggatatg
2281 tttgggccta ttcgtagcgt tttatagaaa ataaaatatt atttacatga tttattaaat
2341 gaaaagagga aaactatgcg tatttgcatt tttatggctc gaggtcttga aggttgtggt
2401 gtaacaaaat tctcactcga gcaacgtgat tggtttatta aaaatggtca tgaagtaact
2461 ttggtttatg ctaaagataa atcatttact cgtacaagtt ctcatgacca caaatcattt
2521 tcaattccag ttattttagc taaagaatac gataaagcac ttaagctagt aaatgattgt
2581 gatattctaa ttattaattc tgttcctgct acttccgttc aagaagctac gattaataac
2641 tataaaaaac ttttagataa tattaaacct tctattcgtg ttgtagttta tcagcatgat
2701 cattctgttc tttctttgcg tcgaaatttg ggattagaag aaactgttcg tcgagctgat
2761 gttattttta gccattctga taatggtgat tttaataaag ttctgatgaa agaatggtat
2821 ccagaaactg tttctctgtt tgatgatatt gaagaagcac cgacagtata taattttcag
2881 cctcctatgg atattgtgaa ggttcggtca acttattgga aagatgtttc tgaaattaac
2941 atgaatatca accgttggat tggtcgtacg actacatgga aaggttttta ccagatgttt
3001 gattttcatg aaaaattctt aaaacctgct ggtaaatcca ctgtaatgga aggtctggaa
3061 cgttcccctg cttttattgc aattaaggaa aaaggtattc cgtatgaata ttacggtaat
3121 cgtgagattg ataaaatgaa tctcgcgccg aatcaaccgg cacaaatcct agattgttat
3181 attaatagtg aaatgcttga acgaatgagt aaatctggct ttggatatca gttgagtaaa
3241 cttaaccaga aatacttaca acgctcactc gaatatactc atctcgagct tggtgcatgt
3301 ggaacaattc cggtattttg gaaatctact ggcgaaaatt taaaattccg tgttgataat
3361 actcctttga cctcgcatga tagcggtatc atttggtttg atgaaaatga tatggaatca
3421 acatttgaac gtattaaaga actgtcatct gaccgagctc tttatgaccg tgagcgagaa
3481 aaagcatatg aatttttgta tcagcatcaa gattcaagct tctgctttaa agaacagttt
3541 gacattatta caaaataaag ggcttcggcc ctttagcttt atacggagtt tgatataatg
3601 atatttcttg gatatgtgat actttttctt gcattttatc tattcactag agcatgttgg
3661 attgggttct ttagcactcc agatggattt atttcaataa ttttattttg catttcaatg
3721 acggttcttg atatatgaaa attttaaatt taggtgattg gcatttaggc gttaaagctg
3781 atgatgagtg gattcgcggt attcaaattg atggaattaa gcaagcgata gaatattcta
3841 agaaaaatgg aattactacc tggattcaat acggtgatat ttttgatgtg cgaaaagcga
3901 tcacacataa aactatggag tttgcccgtg aaatagttca aacgcttgat gatgctggaa
3961 ttacattaca cactattgta ggaaatcacg atctccacta taaaaatgta atgcatccaa
4021 atgcttctac tgagcttttg gctaaatatc ctaatgttaa agtgtatgat aagcctacta
4081 cagtagattt tgacgggtgt ttgattgatt tgattccttg gatgtgcgaa gaaaatactg
4141 gtgaaattct tgagcatatc aaaacttcat ctgcttcttt ttgtgttggt cactgggaac
4201 tgaatggatt ttatttttat aaaggaatga aatctcatgg tcttgaaccc gatttcctta
4261 agacttataa agaagtgtgg tctggtcact tccatactat ttctgaggct gcaaacgtca
4321 gatatattgg gacaccatgg acactaaccg caggtgacga gaatgaccct cgtgggttct
4381 ggatgtttga tacagaaaca gaacgaacgg aatttattcc aaacaatact acctggcatc
4441 gtagaattca ttatccattt aaaggaaaaa ttgactataa agattttaca aatctatcag
4501 tacgtgttat agtaactgaa gtagacaaaa atctgacgaa gttcgaatct gaactagaaa
4561 aagttgtgca ttcattacga gttgtgtcaa agattgataa ctctgtcgag tcagatgaca
4621 gtgaagaagt tgaagttcaa tcgcttcaga cattaatgga agagtatatt aatgcaattc
4681 cagacatcac tgattctgac cgtgaagcac ttattcaata tgcaaatcag ctatatgtag
4741 aggcaacaca atgacttttg atgaatttaa aaatgttatg atgagtcagc attttaaatg
4801 tgaagtaaaa gatgatattg gtcataaaga aattattgaa tactggtttg aaccgttaga
4861 ggttgaagat aattgtatta aaaaggttac tgtctgcact gactgggctg tatcttttaa
4921 cttcaacatt ttagataatg acacacctaa atcattacga gatatggctg tatcttgtat
4981 taaggatgca tactgtgaag ttttcgacat ttgacattaa tgatgaattc atagcaaata
5041 ttgattatac cgaagaagat tctagatatg ttggaataat ttatatcaca tcaaaaacag
5101 cacaaggcgt tgtttgcatg gctgaatttg atgaatactt tttagattat gatgatatga
5161 tagaatggtc taaaagatac attaaaagga atcttttgtg aagaatttta aacttaatag
5221 agtaaaatat aaaaatataa tgtcagttgg tcaaaatggg attgacattc aattagataa
5281 ggttcaaaaa actcttatta ctggacgaaa tggcggtggt aagtctacta tgctagaagc
5341 catcacattt gggctttttg gtaagccatt tcgcgatgta aagaaaggtc aattaataaa
5401 cagcacaaat aagaaagaac ttttagttga actgtggatg gaatatgatg agaaaaagta
5461 ctatatcaaa agaggacaaa aaccaaacgt tttcgaaatc accgttaacg gtacacgtct
5521 taatgaatct gccagcagta aagatttcca agcagaattt gaacagctta tcggaatgtc
5581 atatgccagt ttcaagcaga ttgttgtcct tggtacagca gggtataccc ctttcatggg
5641 tttgtcgacc cctgcgcgaa gaaagcttgt ggaagacctg cttgaggtag gaacattagc
5701 tgaaatggat aagcttaata aagcactaat acgtgaatta aattcgcaga accaagtgct
5761 tgatgttaaa aaagatagta ttatccaaca aattaaaata tataatgata atgttgaacg
5821 ccagaaaaaa ttaacgggtg acaaccttac tcgtctacaa aatatgtatg atgatttggc
5881 aaaagaagct agaacgctaa aatcggaaat agaagaagct aacgaaagat tagttaatat
5941 tgttttggac gaagacccga ctgatgcatt taataaaatc ggtcaagaag catttttaat
6001 taaatcaaaa attgactcgt ataataaagt cattaatatg tatcacgaag gtggattatg
6061 tccaacctgt ttgtcacaat tgagttccgg tgataaagtt gtttctaaaa ttaaagataa
6121 agtttctgaa tgcacacatt cgtttgaaca gctttcaaca catcgtgata atttaaaagt
6181 tcttgttgat gaataccgag ataatattaa aacccagcag tcgttggcaa atgatattcg
6241 caataaaaag caatctctga tcgcagcagt agataaagct aaaaaagtta aagcggctat
6301 agaaaaagca tcttctgagt ttattgacca tgctgatgaa atagcactgc ttcaagaaga
6361 acttgataaa attgttaaga caaaaactaa tttagtaatg gaaaaatacc accgaggaat
6421 tttgactgat atgctcaaag attctggtat taaaggtgct attattaaaa agtacattcc
6481 attatttaat aagcagatta accattatct taaaataatg gaagcggatt atgtgtttac
6541 attagatgaa gaatttaatg agacaattaa atcccgtggt cgtgaagatt ttagttatgc
6601 ttcatttagt gaaggtgaaa aggcacgaat cgatattgct cttttattta cttggcgtga
6661 tattgcttct atcgtatctg gtgttagtat tagtacatta attcttgatg aagtgtttga
6721 tgggtcattt gatgccgaag gtattaaagg tgtagctaat attataaatt caatgaaaaa
6781 cactaacgtt tttataattt cgcataaaga ccatgacccg caagaatatg gtcagcatct
6841 tcaaatgaag aaagttggtc gatttactgt aatggtttaa tttataagag attatgcttt
6901 aatttattag agtataatct ctatggagga aaaacatgga atattcaact ggacagcatc
6961 tattaactat tcctgaaata aaacgatata ttctgagaaa taatttttct aatgaagagc
7021 atatagttac tgaatctatg cttaggaatg catttaaagc agaatataca aaaataatgt
7081 ccaatagaaa tgaagcttgg actgttactg attattatga ctaaaggtgt attatgacta
7141 aaattactgt gaattatact gttgatgtaa aagatattca gccaaaacac gtgcgttctg
7201 aatcaaatcc acaaaaccaa aataaaattc gtcgagcatg tgttttgtct ctttctgata
7261 acgcaatgga agttattcag aacaaaatta aatctgcacc tgctcgtcat gcgtattatg
7321 aagctatcga tcgtgaagta agtaataaat ggattgaact aatgcgcaaa catactacag
7381 aatccctaaa cgccggtgct aaatttatta tgacttcatg tggtgaaacg ccttgaagat
7441 gattattgcg gtaatgcaga tgaacgtcta attgttgctg ctcaaattgt tgcggaaaca
7501 attgcggctg attttaacgt taattgcttt attaaattag ttataaaatt aaatctcatt
7561 tgaattgaag gaaattacat gaaactgtct aaagatacta ctgctctgct taaaaatttc
7621 gctactatta actctggtat tatgcttaaa tccggtcaat ttattatgac tcgcgcagtt
7681 aatggtacaa cttatgcgga agcaaatatt tctgacgtta ttgattttga tgtagcaatt
7741 tacgatttga acggttttct cggtattctg tctttagtta atgatgcaga aatttcccag
7801 tcagaagatg gaaatattaa aattgctgat gcccgctcaa caattttttg gcgagcagcc
7861 gatccgagta cagatgttgc tcctaataaa ccaattccat tcccggtagc atctgctgtt
7921 actgaaatta aagctgaaga ccttcaacag ctgttgcgtc tatctcgtgg tctgcaaatt
7981 gatacaattg ctatcacggt aaaagaaggt aaaatcgtaa ttaacggttt taataaagta
8041 gaagattctg ctctgacccg tgttaaatat tctttgactc ttggtgatta tgatggtgaa
8101 aatacattta atttcattat caatatggca aatatgaaaa tgcaaccagg aaattataaa
8161 cttctgcttt gggcaaaagg taaacaaggt gctgctaaat ttgaaggtga acacgcgaat
8221 tatgtggtag ctcttgaagc tgattctacc cacgattttt aatagagggc ttcggccttt
8281 ataatttaca ctaaaacttg aatgaggaaa ttatgattac tgtaaatgaa aaagaacaca
8341 ttcttgaaca gaaatatcgt ccatctacta tcgatgaatg tattcttccc gcttttgata
8401 aagaaacctt taaatctatt acaagtaaag gtaagattcc acatattatt cttcattctc
8461 cttctccagg aacaggtaaa acaactgtag caaaagcatt atgtcatgat gtaaatgctg
8521 atatgatgtt tgtgaatggg tcagattgta aaattgattt cgttcgtggt cctttgacta
8581 attttgccag cgccgcttca tttgatggtc gtcaaaaagt aatcgttatt gatgaatttg
8641 accgttcagg gttagcagag tctcagcgac atcttcgttc ctttatggaa gcttatagtt
8701 caaactgtag tattattatt actgctaata atattgatgg tattattaaa ccgcttcagt
8761 cacgctgccg agttattaca ttcggtcaac caactgatga agataaaatt gaaatgatga
8821 agcagatgat tcgtcgattg actgaaatct gcaagcatga aggaattgct atagctgata
8881 tgaaagttgt agcagctttg gttaaaaaga attttcctga ttttcgtaaa actattggcg
8941 agctcgatag ttattcgtct aaaggtgttt tggatgctgg tattttatca ctggttacta
9001 acgatcgtgg tgctattgat gatgttcttg agtctctcaa aaataaagat gttaaacaac
9061 tcagagcttt agcaccaaaa tatgcggctg attattcgtg gttcgtgggt aaacttgccg
9121 aagaaatcta ttcacgtgta actccacaaa gtattattcg tatgtacgaa attgtcggcg
9181 aaaataatca gtatcatggt attgcagcta atactgaatt gcatttagct tatcttttca
9241 ttcaattagc atgcgaaatg cagtggaagt gatatgagct tatttaaaga tgatattcaa
9301 ttaaacgagc atcaagttgc ttggtattca aaagattgga cagctgtcca atccgctgct
9361 gattctttta aagaaaaagc agaaaatgaa ttttttgaaa taattggagc tattaataat
9421 aaaactaaat gctctattgc tcaaaaagat tattcaaaat tcatggttga aaatgcatta
9481 tcacaatttc cagagtgtat gccagctgta tatgctatga atttaattgg atcaggctta
9541 agtgatgaag ctcattttaa ttatctaatg gctgcagttc ctcgtggtaa aagatatggt
9601 aaatgggcaa aactggttga agattccacc gaagtattga ttattaagtt acttgctaag
9661 cggtatcaag ttaatacaaa tgatgcaatt aactataaat caattcttac taaaaatgga
9721 aaacttcctt tagtattaaa agaactaaaa ggtttagtca cggatgattt tttgaaagaa
9781 gtgactaaga acgtaaaaga acagaaacaa ctcaaaaaac tagcattgga atggtaaaat
9841 gattgaaatt actcttaaaa aacctgaaga ttttctgaaa gtaaaagaaa ctttgactcg
9901 tatgggaatt gctaataata aagataaagt tctatatcag tcctgtcata ttcttcagaa
9961 aaaaggacta tactatatcg ttcattttaa agaaatgctt cgtatggatg gtcgtcaagt
10021 tgaaatgaca gaagaagatg aagttcgtcg tgattcgatt gcatggctat tagaagattg
10081 gggactaatt gaaatcgttc ctggtcaaag aacttttatg aaagatttaa ctaataactt
10141 ccgagttatt tcttttaaac aaaaacatga atggaagctc gttcctaaat atacgattgg
10201 taattaagca aggggcttcg gccccttatt tggagtataa tatatcaaga gcctaataac
10261 tcgggctata aactaaggaa tatctatgaa agaattttat atctctattg aaacagtcgg
10321 aaataacatt gttgaacgtt atattgatga aaatggaaag gaacgtaccc gtgaagtaga
10381 atatcttcca actatgttta ggcattgtaa ggaagagtca aaatacaaag acatctatgg
10441 taaaaactgc gctcctcaaa aatttccatc aatgaaagat gctcgagatt ggatgaagcg
10501 aatggaagac atcggtctcg aagctctcgg tatgaacgat tttaaactcg cttatataag
10561 tgatacatat ggttcagaaa ttgtttatga ccgaaaattt gttcgtgtag ctaactgtga
10621 cattgaggtt actggtgata aatttcctga cccaatgaaa gcagaatatg aaattgatgc
10681 tatcactcat tacgattcaa ttgacgatcg tttttatgtt ttcgaccttt tgaattcaat
10741 gtacggttca gtatcaaaat gggatgcaaa gttagctgct aagcttgact gtgaaggtgg
10801 tgatgaagtt cctcaagaaa ttcttgaccg agtaatttat atgccattcg ataatgagcg
10861 tgatatgctc atggaatata tcaatctttg ggaacagaaa cgacctgcta tttttactgg
10921 ttggaatatt gaggggtttg acgttccgta tatcatgaat cgtgttaaaa tgattctggg
10981 tgaacgtagt atgaaacgtt tctctccaat cggtcgggta aaatctaaac taattcaaaa
11041 tatgtacggt agcaaagaaa tttattctat tgatggcgta tctattcttg attatttaga
11101 tttgtacaag aaattcgctt ttactaattt gccgtcattc tctttggaat cagttgctca
11161 acatgaaacc aaaaaaggta aattaccata cgacggtcct attaataaac ttcgtgagac
11221 taatcatcaa cgatacatta gttataacat cattgacgta gaatcagttc aagcaatcga
11281 taaaattcgt gggtttatcg atctagtttt aagtatgtct tattacgcta aaatgccttt
11341 ttctggtgta atgagtccta ttaaaacttg ggatgctatt atttttaact cattgaaagg
11401 tgaacataag gttattcctc aacaaggttc gcacgttaaa cagagttttc cgggtgcatt
11461 tgtgtttgaa cctaaaccaa ttgcacgtcg atacattatg agttttgact tgacgtctct
11521 gtatccgagc attattcgcc aggttaacat tagtcctgaa actattcgtg gtcagtttaa
11581 agttcatcca attcatgaat atatcgcagg aacagctcct aaaccgagtg atgaatattc
11641 ttgttctccg aatggatgga tgtatgataa acatcaagaa ggtatcattc caaaggaaat
11701 cgctaaagta tttttccagc gtaaagactg gaaaaagaaa atgttcgctg aagaaatgaa
11761 tgccgaagct attaaaaaga ttattatgaa aggcgcaggg tcttgttcaa ctaaaccaga
11821 agttgaacga tatgttaagt tcagtgatga tttcttaaat gaactatcga attacaccga
11881 atctgttctc aatagtctga ttgaagaatg tgaaaaagca gctacacttg ctaatacaaa
11941 tcagctgaac cgtaaaattc tcattaacag tctttatggt gctcttggta atattcattt
12001 ccgttactat gatttgcgaa atgctactgc tatcacaatt ttcggccaag tcggtattca
12061 gtggattgct cgtaaaatta atgaatatct gaataaagta tgcggaacta atgatgaaga
12121 tttcattgca gcaggtgata ctgattcggt atatgtttgc gtagataaag ttattgaaaa
12181 agttggtctt gaccgattca aagagcagaa cgatttggtt gaattcatga atcagttcgg
12241 taagaaaaag atggaaccta tgattgatgt tgcatatcgt gagttatgtg attatatgaa
12301 taaccgcgag catctgatgc atatggaccg tgaagctatt tcttgccctc cgcttggttc
12361 aaagggcgtt ggtggatttt ggaaagcgaa aaagcgttat gctctgaacg tttatgatat
12421 ggaagataag cgatttgctg aaccgcatct aaaaatcatg ggtatggaaa ctcagcagag
12481 ttcaacacca aaagcagtgc aagaagctct cgaagaaagt attcgtcgta ttcttcagga
12541 aggtgaagag tctgtccaag aatactacaa gaacttcgag aaagaatatc gtcaacttga
12601 ctataaagtt attgctgaag taaaaactgc gaacgatata gcgaaatatg atgataaagg
12661 ttggccagga tttaaatgcc cgttccatat tcgtggtgtg ctaacttatc gtcgagctgt
12721 tagcggttta ggtgtagctc caattttgga tggaaataaa gtaatggttc ttccattacg
12781 tgaaggaaat ccatttggtg acaagtgcat tgcttggcca tcgggtacag aacttccaaa
12841 agaaattcgt tctgatgtgc tatcttggat tgaccactca actttgttcc aaaaatcgtt
12901 tgttaaaccg cttgcgggta tgtgtgaatc ggctggcatg gactatgaag aaaaagcttc
12961 gttagacttc ctgtttggct gatagaataa atctaggacc tccaggtccc tttttcatac
13021 aagtaatata aatctatact tatgaaaaag agatgattct gg
//
LOCUS PT6REGA 462 bp ss-mRNA PHG 04-AUG-1990
DEFINITION Bacteriophage T6 translational repressor protein (regA), complete
cds.
ACCESSION M27738
KEYWORDS RNA binding protein; translational repressor.
SOURCE Bacteriophage T6, cDNA to viral RNA.
ORGANISM Bacteriophage T6
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 462)
AUTHORS Miller,E.S. and Jozwik,C.E.
TITLE Sequence analysis of conserved regA and variable orf43.1 genes in
T4-like bacteriophages
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by E.S. Miller, 08-SEP-1989.
The T2 RegA protein coding sequence corresponds to positions
9839-10207 of the T4 regA sequence contained in the entry with
accession number M10160. The complete sequence reported herein
would correspond to positions 9804-10266.
An RNA secondary structure is detected by primer extension
reactions on mRNA templates at positions 407-424. An RNA nuclease
sensitive site is noted at position 431.
Author address: E.S.Miller
Dept. of Microbiology
Box 7615
North Carolina State University
Raleigh, NC 27695
E-mail: esm@cgc.ncsu.edu
FEATURES from to/span description
pept 36 404 translational repressor protein (regA)
site 431 431 RNA endonuclease site
BASE COUNT 170 a 64 c 89 g 139 t
ORIGIN
1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac
61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag
121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggattatac tatatcgttc
181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag
241 ttcgtcgtga ttcaatagca tggttgttag aagattgggg actgattgaa atcgtacctg
301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa
361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc
421 ccttatttgg agtataatat atcaagagcc taataactcg gg
//
LOCUS RB18REGA 674 bp ds-DNA PHG 04-AUG-1990
DEFINITION Bacteriophage RB18 translational repressor protein (regA) and
Orf43.1, complete cds.
ACCESSION M33533 M27739
KEYWORDS RNA binding protein; translational repressor.
SOURCE Bacteriophage RB18, cDNA to viral mRNA.
ORGANISM Bacteriophage RB18
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 674)
AUTHORS Miller,E.S. and Jozwik,C.E.
TITLE Sequence analysis of conserved regA and variable orf43.1 genes in
T4-like bacteriophages
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by E.S.Miller, 03-APR-1990.
The T2 RegA protein coding sequence corresponds to positions
9839-10207 of the T4 regA sequence contained in the entry with
accession number M10160. The complete sequence reported herein
would correspond to positions 9815-10255.
Author address: E.S.Miller
Dept. of Microbiology
Box 7615
North Carolina State University
Raleigh, NC 27695
E-mail: esm@cgc.ncsu.edu
FEATURES from to/span description
pept 25 393 translational repressor protein (regA)
pept 395 616 Orf43.1 peptide
site 643 643 RegB endonuclease site
BASE COUNT 251 a 106 c 115 g 202 t
ORIGIN
1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt
61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg
121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa
181 atgcttcgta tggatggtcg ccaggttgaa atgacagaag aagatgaagt tcgtcgtgat
241 tcgattgcat ggctattaga agattgggga ctgattgaaa tcgtccctgg tcaaagaact
301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg
361 aaactcgttc ctaaatatac gattggtaat taatatgact gctataactc cacaagaata
421 catggcgtct cttaaagaaa aatataatct ttctgcaaca gaaacacttt ttgatttacc
481 agaaaacctc caactaaaat ttcaggtaga atttcaaaaa ttaattcatc cagaacaaaa
541 acactttact gcagtcgtta agtcaattaa tgcagatgga atgacaattt ttcaccgaca
601 aatagtacta atttaagcaa ggggcttcgg ccccttattt ggagtataat atatcaagag
661 cctaataact cggg
//
LOCUS RB51REGA 700 bp ss-mRNA PHG 04-AUG-1990
DEFINITION Bacteriophage RB51 translational repressor protein (regA) and
Orf43.1, complete cds.
ACCESSION M33532
KEYWORDS RNA binding protein; translational repressor.
SOURCE Bacteriophage RB51, cDNA to viral mRNA.
ORGANISM Bacteriophage RB51
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 700)
AUTHORS Miller,E.S. and Jozwik,C.E.
TITLE Sequence analysis of conserved regA and variable orf43.1 genes in
T4-like bacteriophages
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by E.S.Miller, 03-APR-1990.
The T2 RegA protein coding sequence corresponds to positions
9839-10207 of the T4 regA sequence contained in the entry with
accession number M10160. The complete sequence reported herein
would correspond to positions 9815-10255.
Author address: E.S.Miller
Dept. of Microbiology
Box 7615
North Carolina State University
Raleigh, NC 27695
E-mail: esm@cgc.ncsu.edu
FEATURES from to/span description
pept 25 393 translational repressor protein (regA)
pept 404 640 Orf43.1 peptide
site 671 671 RegB endonuclease site
BASE COUNT 255 a 114 c 125 g 206 t
ORIGIN
1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt
61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg
121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa
181 atgcttcgta tggatggccg tcaagttgaa atgacagaag aagatgaagt tcgtcgtgat
241 tcgattgcat ggctgttaga agattgggga ctgattgaaa tcgttcctgg tcaaagaact
301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg
361 aaactcgttc ctaaatatac gattggtaat taaggtaaat tatatgacac aatcagagat
421 tcagtttacc gaagcttttc taatattcga aaatctaaaa caaacatatg gtctgtcttc
481 tacccaactg gtttttgacc tcccggaaga taaacaaata gaattccaaa aagaattcca
541 acgattagtt tacccacgcc aacgcgcatt tcatgcgatt atcaaaacca caaataaaga
601 tggcaagtcc gttatatcgc gttgtacaat agaaatttaa agcaaagggg cttcggcccc
661 ttatttggag tataatatat caagagccta ataactcggg
//
LOCUS HUMQBPC1 235 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein gene, 5' flank
with an LTR-like sequence.
ACCESSION M35760 J05438
KEYWORDS ubiquinone-binding protein.
SEGMENT 1 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 235)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Common protein binding sites in the 5'-flanking regions of human
genes for cytochrome c1 and for ubiquinone-binding protein
JOURNAL J. Biol. Chem. 265, 8159-8163 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by H.Suzuki, 08-AUG-1989.
FEATURES from to/span description
site 26 33 Mt3 element
site 103 110 Mt1 element
site 114 122 Mt4 element
BASE COUNT 52 a 59 c 56 g 68 t
ORIGIN
1 tagccaaccg ggttcggatt gtgcagtcca actccagcca atggagtcag gacacagtag
61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgtttagg gtgctctcgt
121 ggcaaccaga cttaccagga gctctattct gcaaaagtaa atttgccttg ctgagagacc
181 ctttgtcctt tggctcagtg ttggttcttc tttgcagcac cgagcatttg tttcc
//
LOCUS HUMQBPC2 1567 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human ubiquinone-binding protein (QP) gene, exon 1.
ACCESSION M35761 M26705 J05438
KEYWORDS ubiquinone-binding protein.
SEGMENT 2 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1567)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Common protein-binding sites in the 5'-flanking regions of human
genes for cytochrome c-1 and ubiquinone-binding protein
JOURNAL J. Biol. Chem. 265, 8159-8163 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 888 to 1232)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by H.Suzuki, 08-AUG-1989.
FEATURES from to/span description
pept 1151 + 1169 ubiquinone-binding protein (QP), exon 1
IVS 1170 > 1567 ubiquinone-binding protein intron A
site 425 432 Mt3 element
site 525 533 Mt1 element
site 559 567 Mt4 element
BASE COUNT 442 a 339 c 339 g 447 t
ORIGIN
1 cttcctgatt agtagtgtag cagtatagca tatgacccta gaattaagag ccccaaattc
61 gatctggatt ttcagattct ggaagaagaa cctggactct tctagtcatg tgctcttgga
121 caagttattc aacctttctg agcctcagct ttctcctctc tacaatgaga taattacagt
181 acttttagaa ttaaaagata gtttagggga gtgagagatc acctccaacc ctccctcccc
241 tttgctctta tgtgggctct ttggtgaatc taggaaccaa attgacacat ggcagattaa
301 caagagaaaa gcatactaat tttactagtt ttacatgtac atagggatcc gcacaaaaga
361 gtgaagtggc caatgtaaga tgcctttata ctttttacac gaactacaaa tcagaagaaa
421 ggggatctgg ctaggggcag taaatttctc gcggagtcat taggagaaaa gtgtgtgtgt
481 tggggtgtaa acttagtgga agataacggt tacttcggaa agtatattca ggtccaaagc
541 agcccccagt ttcgtctctg gtgataggtg ctattttctc cctctggtaa gggacgggta
601 ctccagagga atcctggagg aagctttatg cgcttgctgc atgcaggaag agataggtca
661 tcaaaccttt tctgaaacta caatttctcc aatattttca actcgaaata atcaatgtac
721 caattccata tatttttggg atggcacatc catcagtcct tcaatagaaa gtaagacgat
781 tgcccttaaa acatttagct tcctgcctgg cacaaagagc tgaattaata ctgtccacgc
841 tcccatagct attaacagca aataacggga acccagcttt aagccttggt ctcctaattc
901 ctgacctgtg ttcttcccat cacatcacaa ccaatacttc ttagaatgta gcacgcaaaa
961 tgccccgcct actccattgg cctcaccctg gccgagaggt gcttcagcag gtgctgaacg
1021 gcagtcgtca gaactgcgcc tgcgcaagcg gcctttctct gttcgcgatg tgacgtaacg
1081 cgcctgcgga ctgggcccag cttgtcctct atgacttacc cagaaggcaa cgcttctctt
1141 tctggtcaaa atggctggta agcaggccgg taagtaactg ggggtcttct tggccggcat
1201 cgtcgcgctt caccgtttat ttttgcctgc aggtcaacgg atcacaggaa ataattgtaa
1261 acttttcatt tcaagagagc tggagttgta gggaaaagac agtgagattt cagcatagac
1321 tggcaggtct aaatgatcct catcctctaa aatttttact cgatattaca gttatttcat
1381 ttggccacat tctttaaaaa tgtatgtatt ttggggagga ttttgatacc attagtcatt
1441 taaacattaa tatggattcg taaaagaagt ccatttacct cctcctaagc aactaagttc
1501 ccccaaggga atggatattg aaaaaggatg agaaataggt aattttaaaa ctgtttgggg
1561 aacaggg
//
LOCUS HUMQBPC3 236 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein gene, intron A
with an LTR-like sequence.
ACCESSION M33945 J05438
KEYWORDS ubiquinone-binding protein.
SEGMENT 3 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 236)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Common protein binding sites in the 5'-flanking regions of human
genes for cytochrome c1 and for ubiquinone-binding protein
JOURNAL J. Biol. Chem. 265, 8159-8163 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by H.Suzuki, 08-AUG-1989.
FEATURES from to/span description
site 26 33 Mt3 element
site 103 110 Mt1 element
site 114 122 Mt4 element
BASE COUNT 57 a 55 c 54 g 70 t
ORIGIN
1 aaccgggttc agcttagatt gtacggtcca actctagtca atggagtcag gatacagtag
61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgttcggg gtgctctcgt
121 ggcaaccgaa cctatgagga gcaaccttat gcaaaagtaa atttgccttg ctgagagacc
181 ctttgtcctt tgtctcagtg ctagttcttc tttgcagcac caagcatttg ttttca
//
LOCUS HUMQBPC4 114 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 2.
ACCESSION M26706
KEYWORDS ubiquinone-binding protein cytochrome bc1 complex subunit QP.
SEGMENT 4 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 114)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 23 + 94 ubiquinone-binding protein (QP), exon 2
IVS < 1 22 ubiquinone-binding protein intron A
IVS 95 > 114 ubiquinone-binding protein intron B
BASE COUNT 34 a 17 c 29 g 34 t
ORIGIN
1 ttaacagtgt gacaattatc agtttcagca tcaggcaagt ggctggatgg tattcgaaaa
61 tggtattaca atgctgcagg attcaataaa ctgggtaagc agctgtgctg ttta
//
LOCUS HUMQBPC5 208 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 3.
ACCESSION M26707
KEYWORDS QP gene; ubiquinone-binding protein.
SEGMENT 5 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 208)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 21 + 188 ubiquinone-binding protein (QP), exon 3
IVS < 1 20 ubiquinone-binding protein intron B
IVS 189 > 208 ubiquinone-binding protein intron C
BASE COUNT 73 a 36 c 48 g 51 t
ORIGIN
1 aactttgtca tctatcatag ggttaatgcg agatgataca atatacgagg atgaagatgt
61 aaaagaagcc ataagaagac ttcctgagaa cctttataat gacaggatgt ttcgcattaa
121 gagggcactg gacctgaact tgaagcatca gatcttgcct aaagagcagt ggaccaaata
181 tgaagagggt agcacagctt attactcc
//
LOCUS HUMQBPC6 233 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 4.
ACCESSION M26730
KEYWORDS QP gene; ubiquinone-binding protein.
SEGMENT 6 of 6
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 233)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 21 97 ubiquinone-binding protein (QP), exon 4
IVS < 1 20 ubiquinone-binding protein intron C
BASE COUNT 82 a 23 c 49 g 79 t
ORIGIN
1 atatttcatt cattctttag aaaatttcta ccttgaaccg tatctgaaag aggttattcg
61 ggaaagaaaa gaaagagaag aatgggcaaa gaagtaatca tgtagttgaa gtctgtggat
121 gcagctgtta tgaagatggt taaacttgaa acaaacaatt ttaagaatta tttggtctga
181 agatgtttta ctttaaataa atgtctattg taatggctgg agtttttgaa ttc
//
LOCUS HUMQBPCA 498 bp ss-mRNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein mRNA, complete cds.
ACCESSION M26700
KEYWORDS ubiquinone-binding protein.
SOURCE Human fibroblast SV40 transformed cell line GM637, cDNA to mRNA,
clones 2 and 6.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 498)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 53 388 ubiquinone-binding protein
BASE COUNT 172 a 72 c 122 g 132 t
ORIGIN
1 agcttgtcct ctatgactta cccagaaggc aacgcttctc tttctggtca aaatggctgg
61 taagcagggc gtttcagcat caggcaagtg gctggatggt attcgaaaat ggtattacaa
121 tgctgcagga ttcaataaac tggggttaat gcgagatgat acaatatacg aggatgaaga
181 tgtaaaagaa gccataagaa gacttcctga gaacctttat aatgacagga tgtttcgcat
241 taagagggca ctggacctga acttgaagca tcagatcttg cctaaagagc agtggaccaa
301 atatgaagag gaaaatttct accttgaacc gtatctgaaa gaggttattc gggaaagaaa
361 agaaagagaa gaatgggcaa agaagtaatc atgtagttga agtctgtgga tgcaggtgtt
421 atgaagatgg ttaaacttga aacaaacaat tttaagaatt atttggtctg aagatgtttt
481 actttaaata aatgtcta
//
LOCUS HUMQBPIC 75 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (HQPI) gene,
exon 2.
ACCESSION M37387
KEYWORDS ubiquinone-binding protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept / 3 / 75 ubiquinone-binding protein (HQP1), exon 2 (AA
at 5)
BASE COUNT 22 a 11 c 21 g 21 t
ORIGIN
1 agtttcagca tcggcaagtg gctggatggt attcgaaaat ggtattacaa tgctgcagga
61 ttcaataaac tgggt
//
LOCUS HUMQBPIIC 172 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (HQPII) gene,
complete cds.
ACCESSION M26701
KEYWORDS ubiquinone-binding protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 172)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
COMMENT Intron A starts after nucleotide 98.
FEATURES from to/span description
pept 80 / 170 ubiquinone-binding protein (HQPII), exon 1
IVS 171 > 172 HQPII intron A
site 98 99 site of 2.3 kb intron
BASE COUNT 42 a 40 c 47 g 43 t
ORIGIN
1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc agaaggcaac
61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccgtt tcagcatcag gcaagtggct
121 ggatggtatt cgaaaatggt attacaatgc tgcaggattc aataaactgg gt
//
LOCUS HUMQBPIIIC 100 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIII) gene,
complete cds.
ACCESSION M26702
KEYWORDS ubiquinone-binding protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 80 / 98 ubiquinone-binding protein (HQPIII), exon 1
IVS 99 > 99 HQPIII intron
BASE COUNT 20 a 30 c 27 g 23 t
ORIGIN
1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc acaaggcaac
61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccggt
//
LOCUS HUMQBPIVC 465 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIV) pseudogene,
complete cds.
ACCESSION M26703
KEYWORDS pseudogene; ubiquinone-binding protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 465)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 80 392 pseudo-ubiquinone-binding protein (HQPIV),
exon 1
BASE COUNT 180 a 66 c 110 g 109 t
ORIGIN
1 aaaataaaat aaaataaatt agataaataa ataaataaaa agagagaggg agaaggcaac
61 atttcacgtt tgggtcaaga tggctggcag gctggctacc gcagcatcag acaggtagct
121 ggatggtatt tgaaaatggt atgacaatgc tgcaggattc gagaaactaa agtaaatgtg
181 agatgagaca atatatgaag atgaagatgt aaaggaagcc ataagacggc tttctaacgg
241 caggatgttt cgcattagaa cactggacct atgagacagc agatcttgcc ttaatagcac
301 ggaccaaaca tgaagaggat aaattctacc atgagccata tctgaaaccg cttatttggg
361 aaagagaaga atgagaatgg tcaaagaagt catcacatat ttgaagttcg tggatgtagc
421 tgcctcaagt atttttatga aaatggttaa acctgaaaca tataa
//
LOCUS HUMQBPVC 357 bp ds-DNA PRI 04-AUG-1990
DEFINITION Human mitochondrial ubiquinone-binding protein (HQPV) pseudogene,
complete cds.
ACCESSION M26704
KEYWORDS pseudogene; ubiquinone-binding protein.
SOURCE Human leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
TITLE Isolation of a single nuclear gene encoding human
ubiquinone-binding protein in complex III of mitochondrial
respiratory chain
JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps 71 169 pseudo-ubiquinone-binding protein (HQPV)
BASE COUNT 159 a 51 c 64 g 83 t
ORIGIN
1 catttttagt tcccaagagc cgggcataaa caagagacct gagggaatcc tcctctctta
61 aaaaacaaag cagtggacca actatgaaga ggaaaatttc tgccttgaac tgtatctgaa
121 agatgttatt tgggaaagaa aagaaagaga agaatgggca aagaagtaat cacatggttg
181 aagtctgtgg atgcaactgt tatgaagatg gttaaacttg aaataaacaa ttttaagaat
241 tatttggtct gcagatattt tactttaaat aaatgtctat tgtaatggta aaaaaaacaa
301 aaacaaaaac aaaacaaaac aaaaacaaaa acaaaacaaa accaaaaaaa cctcagg
//GenBank-Updates@genbank.bio.net (08/07/90)
LOCUS RATKV1AA 3291 bp ds-DNA ROD 07-AUG-1990
DEFINITION Rat potassium channel-Kv1 gene, complete cds.
ACCESSION M27158 M37145
KEYWORDS potassium channel-Kv1; transmembrane protein.
SOURCE Rat brain DNA and cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 477 to 3291)
AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
Bennett,C., Stein,R.B. and Kaczmarek,L.M.
TITLE Cloning and expression of cDNA and genomic clones encoding three
delayed rectifier potassium channels in rat brain
JOURNAL Neuron 4, 929-939 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3291)
AUTHORS Luneau,C.J.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Computer readable copy of sequence [1],[2] kindly submitted by
R.J.Swanson, 16-AUG-1989, for release after publication.
FEATURES from to/span description
pept 761 2569 potassium channel-Kv1
BASE COUNT 741 a 905 c 897 g 748 t
ORIGIN
1 acaatacgat agcaacatga cagttatgaa gtagcaacaa aaaataaatt tagggtgtgt
61 gtgtgggggg atggtcaaca caacatgagg aactgtatta aaggtcgaga agcactgctc
121 taaaagaaaa acataggaag gactctcttc ttcaatctct tacactggaa ctcagctaaa
181 agcctttccc ttccatctaa agcagttccc accctctggc gtcccctggc tttcagtgca
241 ttcccaggtg atctgagatc ctgaagctag ctattctcat cctgttgccc tgggagtttc
301 agcactcctg gtacaacctg ttcctcgctc ctcctgctct tttcctcctc cccttcctgt
361 cttgcggcag gggccccagc tgaaccctcc agagagacac tctccactcc gtccccaaag
421 gagaaacgcc cgggggagga gcagaggagg gagccacaga gggcggaggc cggggagctc
481 agccagagag gggctgctgg aggttgcatc tgctggaagg aggtttttcc cgctgcctgg
541 taaccagctg ccagaacact gagagggaga gagagaggca gggagcaggg gcagcagctt
601 cttgacgtca ggaccagcca aggatcgggc cagctacccc ggccagccta gagtcagcgg
661 ggccctcggc tggagaggag acctacgcga ggcttggagc gtgagttggg ggtgtgggaa
721 ccggttgtct ggggcgcagc atgccccctg cccccggacc atggagatct ccctggtgcc
781 cctggagaat ggcagtgcca tgaccctcag aggaggaggg gaggcagggg caagctgtgt
841 gcagaccccc aggggagagt gtgggtgccc tccgacgtct ggactcaata atcagtccaa
901 agaaacactg ttgagggggc gcacaacgct cgaggatgcg aaccagggtg gacggccttt
961 gccccctatg gctcaggagc tgccacaacc tagaaggcta tctgctgagg atgaggaggg
1021 agaaggcgac cctggcctgg gcacagtgga ggaggaccag gctcctcagg atgcagggtc
1081 actccatcac cagcgagtcc tcataaacat ctccgggttg cgtttcgaga cgcagctggg
1141 caccctggca cagtttccca acaccctcct gggggaccca gccaagcgcc tgcactactt
1201 cgaccccttg agaaatgaat acttcttcga ccgcaaccgg cccagcttcg atggcatttt
1261 gtactactac cagtctgggg gccgcctgcg caggcccgtc aatgtctccc tggatgtgtt
1321 tgcagatgag atccgctttt accagctggg ggacgaggcc atggagcgct tccgggagga
1381 tgagggcttc atcaaggaag aggagaagcc cctgccccgc aacgagttcc agcgccaggt
1441 atggcttatc ttcgaatacc cagaaagctc tgggtccgca agagccatcg ccatagtgtc
1501 ggtcctggtc attctcatct ctatcatcac cttctgcctg gagactctgc ctgagttcag
1561 ggatgagcgg gagctgctac gccatccccc agtgccgccc cagcccccag cccctgcccc
1621 tgggatcaat ggcagcgtct ctggagcact ttcctctggc cctacggtgg ctccactctt
1681 gcctaggaca ctggccgatc cattcttcat cgtggagacc acatgtgtga tctggttcac
1741 ttttgagttg ctcgtgcgct tctttgcctg ccccagcaag gcagaattct ctcggaatat
1801 catgaacatc atcgatgtcg tggccatctt cccctacttt atcaccctgg gcaccgagct
1861 ggcagagcaa caaccagggg gagggggtca gaatgggcag caggccatgt ccctagccat
1921 cctcagggtg atccgcctgg tgcgggtgtt tcggatcttc aagctctccc gccactccaa
1981 gggactgcag atcctgggta agaccttgca agcatccatg cgggaactcg ggctactcat
2041 cttcttcctc ttcattggag tcatcctctt ctccagcgct gtctacttcg cagaggcaga
2101 caatcacggg tcccatttct ctagtatccc agatgccttc tggtgggcag tagtcactat
2161 gaccactgta ggctatggag acatgagacc catcactgta gggggcaaga tcgtgggttc
2221 actgtgcgcc atagctgggg tcctcaccat tgccctgcct gtccccgtca tcgtctccaa
2281 ctttaattac ttctatcatc gggagacaga ccacgaggag caggctgccc tgaaggaaga
2341 acaaggcaac cagaggcggg agtctgggct ggacacaggg ggtcaacgga aggtcagctg
2401 cagcaaggcc tccttttgca agactggggg ttccctggag agttctgaca gtatcagaag
2461 gggtagctgt cctctagaaa agtgtcacct caaggccaag agcaacgtgg acttgcggag
2521 gtccctgtat gccctctgtc tggacactag ccgtgaaaca gatttgtaaa gagagatcca
2581 ggcagactgg caccagtgaa gctggccaca ggggtgcccc ttgagcctgg gcatctgctt
2641 tacaccaccg agtatttaag cccacctggt cacccaccga tgtctgcctt tctttcaccc
2701 ttctcacctt cctgccctct aacttcccca ttttaactcc tctttccata acccccaggg
2761 tcgcctattt ttaacaagta tgacattcca tgacgcagga gccgttgaag tgctgagccc
2821 tcactggaag atggatgcat tcatagccag ttttctacac ccagcagagg gataatcaaa
2881 caaatttgac tttaaaaagc ttagatccca agagaatcca tagcattccc tacccgtgtt
2941 ccaaatttac tttacatggg attacatttg tgtatgggca aatattattt ttatggccga
3001 tgactgcatt ttgtacagca gtgcacatga agaatatttg ggtgtgctct taagatatac
3061 tttgaattat gaaagtgttg ggggggtcac catttcctgg gacactcagg gacaaagacc
3121 ctaaaagagt tgttcaggtt tctgtgttct caagcttctt attattctgg gatgtactat
3181 tggtgcttct ggtctaggag agagaatgtt ctagaagacc agcaaatcta atcttgtgac
3241 tgtcttcaca gaagcatttt aaggatgttg gaagaaggat ttggagaatt c
//
LOCUS RATKV2AA 2098 bp ds-DNA ROD 07-AUG-1990
DEFINITION Rat potassium channel-Kv2 gene, partial cds.
ACCESSION M27159 M37146
KEYWORDS potassium channel-Kv1; transmembrane protein.
SOURCE Rat brain DNA and cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 622 to 2098)
AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
Bennett,C., Stein,R.B. and Kaczmarek,L.M.
TITLE Cloning and expression of cDNA and genomic clones encoding three
delayed rectifier potassium channels in rat brain
JOURNAL Neuron 4, 929-939 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2098)
AUTHORS Luneau,C.J.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Computer readable copy of sequence [1],[2] kindly submitted by
R.J.Swanson, 16-AUG-1989, for release after publication.
FEATURES from to/span description
pept 865 > 2098 potassium channel-Kv2
BASE COUNT 358 a 588 c 698 g 453 t 1 others
ORIGIN
1 agtgtgcatc tctctctctc tctctccagc atccaggctt gctgcagccc ctagctgggt
61 aaggggtgtg atgtgagagt ggggtgggag ggggcacagg gcggggcctg ccacgtcact
121 tggagagtgt gttaaggagg aagggcagag ctgagagctg agcctgctgc tgctgttgct
181 gctgctgctg ttgctgctgc ttgctgttgc tgctgctgct ggggctgctg cgggtgcggg
241 caaggattga tgcggtgggg aggtgggtct ctcgctccgg ncccgggcag gcccagggcg
301 gttccgaggc ctgtagaaca gctcctggga agaaggtggc ggctgcaatc gcgaccttgg
361 ccagacctag ttcgtggtgg acgtagggcg gaggcggagg ccgagcccgg gcaggagtct
421 ttggcgagcc agagggaggc gcatctggcg cttcggtacc agcggcagcc ttgtgagtct
481 tgtgaggctg gagaagtgca gcaggacctg gtgagagcta gcaagtctgg agggcgcctg
541 tgggtcccga gcccagtttt aggcaccacc gagagctaag ccaagcgtct tttcgggcag
601 ccagtttcac cgcgcgacag tttgcgggtt ccaggcatct cagaaatctt gagcacggag
661 gcgcggctac tgagagccag agccacatcc cagacctagc ctggcagaga gaccagctgc
721 agggttcacc gacctaaccg ccaggtcaga gcacgggccc caccctaaag gagggcgcag
781 ccggagctgg gaagccggtg ccgcgctccg gagctcgtgt cgtgggcgcc gtcctagtgg
841 cggggagcgc accgccgagg tgacatgaga tcggagaaat ccctgacgct ggcggcgccg
901 ggggaggtcc gtgggccgga gggggagcaa caggatgcgg gtgagttcca ggaggccgag
961 ggcggcggcg gctgctgtag tagtgagagg ctggtgatca acatctctgg gctgcgctac
1021 gagacgcagc tgcgcacctt gtcgctgttc cctgacacgc tgctaggaga ccctggccgc
1081 agagtccgct tctttgaccc cttgaggaat gagtacttct ttgaccgcaa ccgacccagc
1141 ttcgacgcta tcctttatta ctaccagtcg gggggtcgcc tgcgcaggcc ggttaacgtg
1201 ccccttgaca tctttatgga agagattcgc ttctatcagt tgggagatga agccctggcg
1261 gccttccggg aggatgaggg ttgcctgccc gaaggtggtg aggatgagaa gccactcccc
1321 tcccagcctt tccagcgaca ggtctggctc ctctttgagt atccggagag ttctgggccc
1381 gcccgaggca ttgccatcgt ctcagtgttg gtcatcctca tctccattgt catcttttgc
1441 ctggagacct tgcctcagtt ccgtgcagat gggcgcggtg gaagcaacga ggggagtggg
1501 acccgcatgt ccccggcctc cagggggagc cacgaggagg aagatgaaga cgaggattcc
1561 tatgcatttc ctggtagcat tccctctggg gggttgggga ccggaggaac ttcttcattt
1621 agtactctcg ggggttcctt cttcacagac cccttcttcc tggtggaaac tctgtgtatc
1681 gtctggttca cctttgagct cctggtgcgc ttctctgcct gtcccagcaa ggcggccttc
1741 tttcgcaata tcatgaacat cattgacttg gtggccatct tcccctactt tatcaccctg
1801 ggcaccgagc tagtgcaacg tcacgagcag cagcctgtga gtggtggcag tggtcagaat
1861 gggcagcagg ccatgtccct agccatcctc agggtgatcc gcctggtccg ggtgtttcgg
1921 atcttcaagc tctcccgcca ctccaagggg ttgcagatcc tgggtaagac cttgcaagca
1981 tccatgcggg agctcgggct actcatcttc ttcctcttca ttggagtcat cctcttctcc
2041 agcgctgtct acttcgcaga ggcagatgac gttgactcgc tcttccctag catcccag
//
LOCUS RATKV3AA 2264 bp ds-DNA ROD 07-AUG-1990
DEFINITION Rat potassium channel-Kv3 gene, complete cds.
ACCESSION M31744
KEYWORDS potassium channel-Kv3; transmembrane protein.
SOURCE Rat brain DNA and cDNA to mRNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (sites)
AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
Bennett,C., Stein,R.B. and Kaczmarek,L.M.
TITLE Cloning and expression of cDNA and genomic clones encoding three
delayed rectifier potassium channels in rat brain
JOURNAL Neuron 4, 929-939 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2264; for [1])
AUTHORS Luneau,C.J.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Computer readable copy of sequence [1],[2] kindly submitted by
R.J.Swanson, 16-AUG-1989, for release after publication.
[2] Author Address
Merch, Sharp, and Dohme Research Labs
W42-300
West Point, PA 19486
FEATURES from to/span description
pept 508 2085 potassium channel-Kv3
BASE COUNT 436 a 684 c 695 g 449 t
ORIGIN
1 gggttcctcc aaggccagtc tggcagggta ggggttccgc ggggtctggg tagtggggtg
61 caagtcgaag gcgacccagt gacaaaagaa gggcgaggaa caggggtggc aagcggaaag
121 cggagccgcg cgcgcagtgg gcgggaaacc cgggggcggc ggcagagagg cgggcgcgcg
181 cctcgcgggg aggggtgtgg aggagggagg aggaggcgtc tggacggtgc agccgggcgg
241 gccgcgccag ccctgacaga gcctcgcggc agccgcagca acaggcggcg gggccgggct
301 cgtggggcag aggcggtggt ggggcgggga gctccaggag gagcgactcg gagctggcca
361 ccgccagcag ggaaggagag catcgccgcc tcccgcgctc ggccgccgca gccctccgcc
421 cgccaccgcg cccaccctgc gctggacccc gcgggcggcg gcgcgcgcat cctgcagagc
481 ccaggctacg ccgagctgcc gccagacatg accgtggtgc ccggggacca cctgctggag
541 ccagaagcgg cgggaggcgg cggcggggac ccgcctcagg gaggctgtgt cagtggcggc
601 ggctgcgacc gctacgaacc gctgccgccc gcgctgcccg ccgcgggcga gcaggattgc
661 tgcggggagc gcgtggtcat caacatctcc gggctgcgct tcgagacaca gctcaagacc
721 ctctgccagt tccctgagac gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac
781 ccgctccgca atgagtactt ctttgaccgc aacagaccca gcttcgacgc catcctctac
841 tactaccagt ccgggggccg catccgccgg ccggtcaacg tgcccatcga catcttctcc
901 gaggagatcc gcttctacca actgggtgag gaggctatgg agaagttccg tgaggacgag
961 ggcttcctgc gggaggagga gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg
1021 ctgctcttcg aataccccga gagctccggg ccggcccggg gcattgccat cgtgtcagtg
1081 ctggtcattc tcatctccat tgtcatcttc tgcttggaga cactacccga gtttcgcgac
1141 gagaaggact atcccgcctc tccgtcgcag gacgtgtttg aggctgccaa caacagcacg
1201 tcgggggcct cctctggagc ctccagcttc tcggacccct tcttcgtagt ggagaccctg
1261 tgcatcatct ggttctcctt tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc
1321 accttctcca gaaatatcat gaacctgata gacattgtag ccatcatccc ttattttatt
1381 actctgggca ctgagctggc tgagcgacag ggtaatgggc agcaggctat gtcactggcc
1441 atcctgaggg tcatccgcct agtaagggtc ttccgcatct tcaagctctc ccgccattct
1501 aaggggctgc agatcctggg acagacactg aaggcttcca tgcgagagct ggggctgctc
1561 attttcttcc ttttcattgg ggtcatcctt ttctccagtg cagtctactt tgctgaggca
1621 gacgaccctt cttcgggttt taacagtatc ccggatgcct tctggtgggc cgtggtaacc
1681 atgacaactg tcggttatgg tgatatgcac ccagtgacca taggaggcaa gattgtgggc
1741 tctctttgtg ccatcgcagg tgtcttgacc attgcattgc cggttcctgt gattgtttcc
1801 aacttcaatt acttctacca ccgggagaca gaaggggaag agcaagccca gtacatgcac
1861 gtgggaagct gccagcacct ctcctcttca gcagaggagc tccgaaaagc ccggagtaac
1921 tccactctga gtaagtcgga gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc
1981 ttcccccaga cccccttcaa aacgggcaac tccactgcca cttgcaccac caacaataat
2041 cccaactcct gtgtcaacat caaaaagata ttcactgatg tctaatagat gatacgattg
2101 ccattctgtg cccagtattg tgtggaacat gcccccttgg tctgtgtatg cccttgattt
2161 atacatttcc agaccattca tcaaggaaag tacatgaaga agtggaaaag cacacttcat
2221 tctccctctc cctattgctt catactgaaa caggtgcctg gttt
//
LOCUS RATKV3AB 2956 bp ss-mRNA ROD 07-AUG-1990
DEFINITION Rat mRNA for putative potassium channel subunit protein (RCK3).
ACCESSION X16001
KEYWORDS membrane protein; potassium channel protein.
SOURCE Rattus rattus.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2956)
AUTHORS Pongs,O.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 2956)
AUTHORS Stuehmer,W., Ruppersberg,J.P., Schoerter,K.H., Sakmann,B.,
Stocker,M., Giese,K.P., Perschke,A., Baumann,A. and Pongs,O.
TITLE Molecular Basis of Functional Diversity of Voltage Gated Potassium
Channels in Mammalian Brain
JOURNAL EMBO J. 8, 3235-3244 (1989)
STANDARD simple automatic
COMMENT *source: developmental stage=adult;tissue=brain-cortex; *source:
clone=RCK3;
[1] Author address Pongs O., Ruhr Universitaet Bochum, Fakultaet f
Chemie , Lehrstuhl f Biochemie, Universitsstr 150, D-4630 Bochum, F
R G.
Submitted (31-JUL-1989) on tape to the EMBL Data Library
FEATURES from to/span description
pept 8 1585 put. RCK3 protein (AA 1-525)
BASE COUNT 733 a 774 c 690 g 759 t
ORIGIN
1 gccagacatg accgtggtgc ccggggacca cctgctggag ccagaagcgg cgggaggcgg
61 cggcggggac ccgcctcagg gaggctgtgt cagtggcggc ggctgcgacc gctacgaacc
121 gctgccgccc gcgctgcccg cggcgggcga gcaggattgc tgcggggagc gcgtggtcat
181 caacatctcc gggctgcgct tcgagacaca gctcaagacc ctctgccagt tccctgagac
241 gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac ccgctccgca atgagtactt
301 cttcgaccgc aacagaccca gcctcgacgc catcctctac tactaccagt ccgggggccg
361 catccgccgg ccggtcaacg tgcccatcga catcttctcc gaggagatcc gcttctacca
421 actgggtgag gaggctatgg agaagttccg tgaggacgag ggcttcctgc gggaggagga
481 gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg ctgctcttcg aataccccga
541 gagctccggg ccggcccggg gcattgccat cgtgtcagtg ctggtcattc tcatctccat
601 tgtcatcttc tgcttggaga cactacccga gtttcgcgac gagaaggact atcccgcctc
661 tccgtcgcag gacgtgtttg aggctgccaa caacagcacg tcgggggcct cctctggagc
721 ctccagcttc tcggacccct tcttcgtagt ggagaccctg tgcatcatct ggttctcctt
781 tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc accttctcca gaaatatcat
841 gaacctgata gacattgtag ccatcatccc ttattttatt actctgggca ctgagctggc
901 tgagcgacag ggtaatgggc agcaggctat gtcactggcc atcctgaggg tcatccgcct
961 agtaagggtc ttccgcatct tcaagctctc ccgccattct aaggggctgc agatcctggg
1021 acagacactg aaggcttcca tgcgagagct ggggctgctc attttcttcc ttttcattgg
1081 ggtcatcctt ttctccagtg cagtctactt tgctgaggca gacgaccctt cttcgggttt
1141 taacagtatc ccggatgcct tctggtgggc cgtggtaacc atgacaactg tcggttatgg
1201 tgatatgcac ccagtgacca taggaggcaa gattgtgggc tctctttgtg ccatcgcagg
1261 tgtcttgacc attgcattgc cggttcctgt gattgtttcc aacttcaatt acttctacca
1321 ccgggagaca gaaggggaag agcaagccca gtacatgcac gtgggaagct gccagcacct
1381 ctcctcttca gcagaggagc tccgaaaagc ccggagtaac tccactctga gtaagtcgga
1441 gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc ttcccccaga cccccttcaa
1501 aacgggcaac tccactgcca cttgcaccac caacaataat cccaactcct gtgtcaacat
1561 caaaaagata ttcactgatg tctaatagat gatacgattg ccattctgtg cccagtattg
1621 tgtggaacat gcccccttgg tctgtgtatg cccttgattt atacatttcc agaccattca
1681 tcaaggaaag tacatgaaga agtggaaaag cacacttcat tctccctctc cctattgctt
1741 catactgaaa caggtgcctg gttttgcaag tgggctgcat tctctcagct ctttttctct
1801 ctccctctct ctctccctgt ttcttaattt tgtgaacaac aaacttacat taagcttggt
1861 ttcttgtgca agccctaaag ctacatcctg agaggaaatg aaactataga gtcagagtaa
1921 ctgtgtgacc tcaaaaatca aaagatagtt gtttctttac taagtaaagg aggcagaact
1981 taaatgatgc tcctgttcgg tggacttttc agcattattt cctgactaag tgtaactact
2041 taacctctgg atgtgtgggt ggaaagtcta actagagtgt tgacgtgtaa accctctata
2101 attttttttt ttttggactg gcctttcagt ttgatctccc ctctaagaat ttcaggttca
2161 cataactttg aaccaaaagg gaaatgcccc aaatgtcctg atctgactat taactcttcg
2221 gagtttgcaa agcattttga aaacattaga cacagattcc aatgaagttg tgagttcatg
2281 cccaagtcaa tatctacgaa agtctagaaa ctcattttca gtgctgccag gagaaacaac
2341 agaaattcct gatgcatctg agagataagc ttctgcagta tcacaggaag attaaagtgg
2401 cagacacccc ttccagcgga agttactaat tcggacctga ctgatgcagt tcccatagca
2461 acccgtgttt cctgggaaac ccgaaaaagg ttgtcatggc atcttttgct ctctagcccc
2521 acccccaccc caggccgtgc gtttctacag taacctttcc agatggttcc tacttacacg
2581 acttcataag aaaaaacact atttgaataa agcgcacaaa ttaagtgaaa gactatatag
2641 aggtgaaatg aatcacaaaa tgctttttta cattttatat aaatcattga tgttatccca
2701 taacgactgg atcagaaagg aaactgtcgt ctttgaaatg ttacatgtaa ggcatgatct
2761 gaattaaatt atgtcagctg tttggtaaca atttttaaag atgcttctgt acagttcttt
2821 cactaagaaa tgacaagtca acagatgaaa cgaaaacagc aatataaaca tggtctgtac
2881 ataagcaagt gagagtcagt gtacccgaag tcttacacga taccttcaga tttcctcaat
2941 taaaaaaaaa aaaaag
//
LOCUS CEUMTFVLA 2682 bp ds-DNA ORG 07-AUG-1990
DEFINITION C.unicolor mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
complete, and Leu-tRNA gene, 5' end.
ACCESSION M35875
KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
transfer RNA-Phe; transfer RNA-Val.
SOURCE C.unicolor liver mitochondrion cell line DH-alpha-5 DNA, clone
PCUNMR 9.0-18.
ORGANISM Mitochondrion Cervus unicolor
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae;
Cervinae; Cervus unicolor.
REFERENCE 1 (bases 1 to 2682)
AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A.
TITLE Phylogeny and evolution of antlered deer determined from
mitochondrial DNA sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.M.Miyamoto, 03-JUL-1990.
FEATURES from to/span description
tRNA 1 69 Phe-tRNA
rRNA 70 1024 12S ribosomal RNA
tRNA 1025 1091 Val-tRNA
rRNA 1092 2663 16S ribosomal RNA
tRNA 2664 > 2682 Leu-tRNA
BASE COUNT 1010 a 555 c 467 g 650 t
ORIGIN
1 gttgatgtag cttaaacaac aaagcaaggc actgaaaatg cctagatgag tatattaact
61 ccataaacat acaggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc
121 aagcatccgc accccagtga aaatgccccc taagttaata agactaagag gagctggtat
181 caagcacaca tccgtagctc acgacacctt gcacagccac acccccacgg gagacagcag
241 tgataaaaat taagccataa acgaaagttt gactaagcca tattaatcag ggttggtaaa
301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag
361 cgtgttaaag cactatttca aataaagtta aactccaatt aagctgtaaa aagccataat
421 tgcaacaaaa ataaacaacg aaagtaactt tacaaccgct gaaacacgat agctaggacc
481 caaactggga ttagataccc cactatgcct agccttaaac acaaatagtt gtgtaaacaa
541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt
601 acacccttct agaggagcct gttctataat cgataaaccc cgataaacct caccattcct
661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggtac aaaagtaagc
721 acaatcataa tacataaaaa cgttaggtca aggtgtaacc tatggaacgg aaagaaatgg
781 gctacatttt ctaatctaag aaaatccaac acgaaagtta ttatgaaatt aataaccaaa
841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc
901 acacaccgcc cgtcaccctc ctcaagtagg cacaatacac tcaaatttat ttacacgtat
961 taatcatatg agaggagaca agtcgtaaca aggtaagcat actggaaagt gtgcttggat
1021 aaatcaagat atagcttaaa taaagcatct agtttacacc tagaagattt cacatatcac
1081 gaatatcttg aaccaattct agcccgcaag cccattcaca ctaaattatc aatgtaacat
1141 aaaataaaac atttattcaa caacaaaaag tataggagat agaaatttta atacggcgct
1201 atagagaaag taccgtaagg gaacgatgaa agaaaaaaat caaagtacaa aaaagcaaag
1261 attacccctt gtaccttttg cataatgagt taactagtaa aaacttaaca aaatgaattt
1321 cagctaagta ccccgaaacc agacgagcta cttatgaaca atttatcgag aaccaactca
1381 tctatgtagc aaaatagtga gaagatttgt aagtagaggt gaaacgccca acgagcctgg
1441 tgatagctgg ttgtccagaa aatgaatatt agttcagctt taaaaatacc aaaaatatga
1501 acaaattata atgtattttt aaaagttagt ctaaaaaggt acagcctttt agaaatggat
1561 acaaccttaa ctagagagta agttttaaca ttaaaccata gtaggcctaa aagcagccac
1621 caattaagaa agcgttaaag ctcaacaata aaatagtatt aatttcaata ataaatagtc
1681 aactcctaat ctaatactgg actaatctat taagaataga agcaataatg ttaatatgag
1741 taacaagaag taacttctcc ccgcataagt ttaagtcagt atctgataat actctgacta
1801 ttaacagcaa aataagaata acctaactat aaataactta ttaactatac tgttaatccg
1861 acacaggaat gcacttaagg aaagattaaa agaagtaaaa ggaactcggc aaacactaaa
1921 ccccgcctgt ttaccaaaaa catcacctcc agcataacta gtattggagg cactgcctgc
1981 ccagtgacaa ccgttaaacg gccgcggtat cctgaccgtg caaaggtagc ataatcactt
2041 gttctctaaa tagggacttg tatgaatggc cacacgaggg ttttactgtc tcttacttcc
2101 aatcagtgaa attgaccttc ccgtgaagag gcgggaatat actaataaga cgagaagacc
2161 ctatggagct ttaactactt agcccaaaga aacaaatttc attaccaagg aaacaacaac
2221 actctttatg ggctaacagc tttggttggg gtgacctcgg agaacaagaa aacctccgag
2281 cgattttaaa gactagacct acaagtcgaa tcacacaatc gtttattgat ccaaaaaatt
2341 gatcaacgga acaagttacc ctagggataa cagcgcaatc ctattcaaga gtccatatcg
2401 acaatagggt ttacgacctc gatgttggat caggacatcc cgatggtgca accgctatca
2461 aaggttcgtt tgttcaacga ttaaagtcct acgtgatctg agttcagacc ggagtaatcc
2521 aggtcggttt ctatctatta tgtatttctc ccagtacgaa aggaccagag aaataaggcc
2581 aacttcaaat aagcgcctta aattagttaa tgatatcatc ttaattaacc tcacaaacaa
2641 actttgccct agagaagggc tttgttaagg tggcagagcc gg
//
LOCUS CPDBA 76 bp ss-RNA INV 07-AUG-1990
DEFINITION C.colpoda 5.8S ribosomal RNA.
ACCESSION M35558
KEYWORDS 5.8S ribosomal RNA.
SOURCE C.colpoda ribosomal RNA.
ORGANISM Colpidium colpoda
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 76 5.8S ribosomal RNA
BASE COUNT 21 a 14 c 20 g 21 t
ORIGIN
1 atgcgaattg cagaacacgc gagtcatcag atctttgaac gcaattggtg ggggtgtaaa
61 aaccttcatg ttcgtt
//
LOCUS ECOLIVRA 729 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli high-affinity branched-chain amino acid transport repressor
(livR) gene, complete cds.
ACCESSION M36020
KEYWORDS high-affinity branched-chain amino acid transport repressor.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 729)
AUTHORS Antonucci,T.K., Wagner,L.M. and Oxender,D.L.
TITLE Cloning, expression, and nucleotide sequence of livR, the repressor
for high-affinity branched-chain amino acid transport in
Escherichia coli
JOURNAL Proteins 1, 125-133 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 229 729 high-affinity branched-chain amino acid
transport repressor (livR)
BASE COUNT 181 a 192 c 170 g 186 t
ORIGIN
1 cggcatgtca tttcctctct ctgcaattgt gtacaaatgt acacaacctt gctaatgttg
61 tgttccctct cttatttgac tgatctatga ccgtaaattc ttcacgtaat gcattgaaac
121 gccgaacctg gcattgttta tgttcttctt tttgccaggc ctgttaatgg cgtcctgggc
181 aacccgtacg cctctatccg cgatattctc tctgtctcga tcgctgaaat gggcgtgttc
241 tctttggtct ggatctccat tgccggcgcg caggtacgtc cgcgatcatt gcgccgaaca
301 acagcgcgcg gttctttcgc aatacataag cacgaacaag aaacagataa ccacgcctgc
361 cataacacca tcaagccgaa ctcttcacat gcttggtgca cctgcgaatg taaccgaagg
421 tcggtggtta gcacgcataa agccagcata attccggcga ataacaaccc ggcagcgttc
481 cccatgccga aactgaatgt gctgaactgg aaggtgatca tcccgatcat cagcccaata
541 acaaagaagg cgcagaatgc cagcaggtca gtgacctgga ctgtgaatcg agataaagcc
601 gatgcgatcc agtgttttta cgcggcgggc atcgtggatc ttgtaaaagg tcactttgtt
661 aagcacgacg ttgtcatcta tcggcatctc aatctggcta gaatgacgcg gttaaggaag
721 caaccgtga
//
LOCUS GCHBA 76 bp ss-RNA INV 07-AUG-1990
DEFINITION G.chattoni 5.8S ribosomal RNA.
ACCESSION M35559
KEYWORDS 5.8S ribosomal RNA.
SOURCE G.chattoni ribosomal RNA.
ORGANISM Glaucoma chattoni
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 76 5.8S ribosomal RNA
BASE COUNT 21 a 13 c 20 g 22 t
ORIGIN
1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg gaggtgtaaa
61 aaccttcatg tttgtt
//
LOCUS HYRMTFVLA 2680 bp ds-DNA ORG 07-AUG-1990
DEFINITION H.inermis mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
complete, and Leu-tRNA gene, 5' end.
ACCESSION M35876
KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
transfer RNA-Phe; transfer RNA-Val.
SOURCE H.inermis liver and spleen mitochondrion cell line DH-alpha-5 DNA,
clone PHINMR 4.0-6.
ORGANISM Mitochondrion Hydropotes inermis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae;
Odocoileinae; Hydropotes inermis.
REFERENCE 1 (bases 1 to 2680)
AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A.
TITLE Phylogeny and evolution of antlered deer determined from
mitochondrial DNA sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.M.Miyamoto, 03-JUL-1990.
FEATURES from to/span description
tRNA 1 70 Phe-tRNA
rRNA 71 1027 12S ribosomal RNA
tRNA 1028 1094 Val-tRNA
rRNA 1095 2661 16S ribosomal RNA
tRNA 2662 > 2680 Leu-tRNA
BASE COUNT 1010 a 526 c 468 g 676 t
ORIGIN
1 gttaatgtag cttaaaaaat aaagcaaggc actgaaaatg cctagatgag tatatttaac
61 tccataaaca caaaggtttg gtcccagcct tcctattgac ctttaataga cttacacatg
121 caagcatccg catcccagtg aaaatgccct ccaagttaat aagactaaga ggagctggta
181 tcaagcacac acccgtagct catgacacct tgcttagcca cacccccacg ggagacagca
241 gtgataaaaa ttaagccatg aacgaaagtt tgactaagcc atattaatta gggttggtaa
301 atttcgtgcc agccaccgcg gtcatacgat taacccaagt taataggcat acggcgtaaa
361 gcgtgttaaa gcaacatata aaaatagagt taaactttaa ttaagctgta aaaagccata
421 attattaacg aaaataaata acgaaagtaa ctttacaatc gctgaaacat gatagctaag
481 acccaaactg ggattagata ccccactatg cttagcccta aacacaaata gttataataa
541 caaaactgtt cgccagagta ctaccggcaa tagcttaaaa ctcaaaggac ttggcggtgc
601 tttatatcct tctagaggag cctgttctat aatcgataaa ccccgatata cctcaccacc
661 ccttgctaat gcagtctata taccgccatc ttcagcaaac cctaaaaagg agcaaaagta
721 agcataatca taatacataa aaacgttagg tcaaggtgta acctatggag tgggaagaaa
781 tgggctacat tttctaattt aagaaaactt aatacgaaag ttattatgaa attaataacc
841 aaaggaggat ttagcagtaa actaagaata gagtgcttag ttgaactagg caatgaagca
901 cgcacacacc gcccgtcacc ctcctcaaat aagcacaata cgcttaaatt tattagtatg
961 tataaactgt atgagaggag acaagtcgta acaaggtaag catactggaa agtgtgcttg
1021 gataaatcaa gatatagctt aaataaagca cctagtttac acctagaaga tttcacacac
1081 cacgaatatc ttgaactaat tctagcccat aaatttattt acactaaact attattaata
1141 ttataaacaa agcatttatc tataattaaa gtataggaga tagaaatttt taatatgacg
1201 ctatagagaa agtaccgtaa gggaatgatg aaagaagaaa attaaagtac aaaaaagcag
1261 agattacccc tcgtaccttt tgcataatga gttaactagt aaaaacttaa caaaacgaat
1321 tttagctaag taccccgaaa ccagacgagc tacttataaa cagtttatca agaaccaact
1381 catctatgtg gcaaaatagt gagaagattt ataagtagag gtgaaacgcc taacgagcct
1441 ggtgatagct ggttgtccag aaaatgaata ttagttcagc tttaaaaata ccaaaaatat
1501 aaacaaatca cactgtattt ttaaaagtta gtctaaaaag gtacagcctt ttagaaatgg
1561 atacaacctt gactagagag taaaatttaa taacaccata gtaggcctaa aagcagccat
1621 caattaagaa agcgttaaag ctcaacaata agtttacatt aatttcaata attataaatc
1681 aactcctagc ttaatactgg actaatctat aaaaatagaa gcaataatgt taatatgagt
1741 aacaagaagt atattctccc tgcataagtt taagtcagta tctgataata tcctgactat
1801 taacagcaaa ataagaataa ctcaactata aatgacttat taattctact gttaatccaa
1861 cacaggaatg cacttaagga aagattaaaa gaagtaaaag gaactcggca aacactaaac
1921 cccgcctgtt taccaaaaac atcacctcca gcataactag tattggaggc actgcctgcc
1981 cagtgacaac cgttaaacgg ccgcggtatc ctgaccgtgc aaaggtagca taatcacttg
2041 ttctctaaat aaggactagt atgaatggcc aaacgagggt tttactgtct cttacttcca
2101 atcagtgaaa ttgaccttcc cgtgaagagg cgggaatata ataataagac gagaagaccc
2161 tatggagctt taactactta acccaaagaa acaaacttaa ccactaaggt gacaacaata
2221 ttctctatgg gttaacagct ttggttgggg tgacctcgga gaacaaaaaa tcctccgagc
2281 gattttaaag actagaccta caagtcgcat cgcacaatcg tttattgatc caaaaattga
2341 tcaacggaac aagttaccct agggataaca gcgcaatcct atttaagagt ccatatcgac
2401 aatagggttt acgacctcga tgttggatca ggacatcctg gtggtgcagc cgctatcaaa
2461 ggttcgtttg ttcaacgatt aaagtcctac gtgatctgag ttcagaccgg agtaatccag
2521 gtcggtttct atctattgtg tatttctccc agtacgaaag gaccagagaa ataaggccaa
2581 ctaaaaaaaa gcgccttaac ccaattaatg atctcatctc aattaaaatc acaaacaaac
2641 tccgccctag aaaagggctt tgttaaggtg gcagagccgg
//
LOCUS M36004 59 bp ds-DNA BAD 07-AUG-1990
DEFINITION Figure 1. Insert of plasmiod p WL144 and nucleic acid sequence of
selA-selB overlap.
ACCESSION M36004
REFERENCE 1 (bases 1 to 59)
AUTHORS Forchhammer,K., Ruecknagel,K.-P. and Boeck,A.
TITLE Purification and biochemical characterization of SELB, a
translation factor involved in selenoprotein synthesis
JOURNAL J. Biol. Chem. 365, 9346-9350 (1990)
STANDARD simple staff_review
COMMENT Secondary reference to EMBL entry X16644 for Nature 342, 453-456
(1989)
FEATURES from to/span description
BASE COUNT 14 a 9 c 18 g 18 t
ORIGIN
1 gagcaacggt ttttggagat gttgttgaaa tgattattgc gactgccgga cacgttgac
//
LOCUS M36102 611 bp ds-DNA BAD 07-AUG-1990
DEFINITION Figure 2. Nucleotide sequence of AVP-NpII precursor mRNA from
bovine hypothalamus.
ACCESSION M36102
REFERENCE 1 (bases 1 to 611)
AUTHORS Brownstein,M.J.
TITLE biosynthesis of vasopressin and oxytocin
JOURNAL Ann. Rev. Physiol. 45, 129-135 (1983)
STANDARD unannotated staff_entry
COMMENT Secondary reference to Nature 295, 299-303, accession number
J00001.
FEATURES from to/span description
BASE COUNT 92 a 227 c 205 g 86 t 1 others
ORIGIN
1 ngcacagtct acagagcagc actgcgcacg tgtgcccacg cgtgccagga tgcccgacgc
61 cacactgccc gcctgcttcc tcagcctgct ggccttcacc tctgcttgct acttccagaa
121 ctgcccaagg ggcggcaaga gggccatgtc cgacctggag ctgagacagt gtctcccctg
181 cggccccggg ggcaaaggcc gctgcttcgg gcccagcatc tgctgcgggg acgagctggg
241 ctgcttcgtg ggcacggccg aggcgctgcg ctgccaagag gagaactacc tgccgtcgcc
301 ctgccagtcc ggccagaagc cctgcgggag cgggggccgc tgcgccgccg ccggcatctg
361 ctgcaacgat gagagctgcg tgaccgagcc cgagtgccgg gaaggtgtcg gcttcccccg
421 ccgcgttcgc gccaacgacc ggagcaacgc gaccctgctg gacgggccga gcggggcctt
481 gttgctgcgg ctggtgcagc tggcgggggc gccggagccc gcggagcccg cccagcccgg
541 cgtctactga ggcgcgcccc ccccctcccc acccctgccc tcgcagcacg aaaaataaac
601 gttttaaagg c
//
LOCUS MMUMTFVLA 2682 bp ds-DNA ORG 07-AUG-1990
DEFINITION M.reevesi mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
complete, and Leu-tRNA gene, 5' end.
ACCESSION M35877
KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu-tRNA;
transfer RNA-Phe; transfer RNA-Val.
SOURCE M.reevesi liver and brain mitochondrion cell line JM109 DNA, clone
PMREMR 4.8-16.
ORGANISM Mitochondrion Muntiacus reevesi
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae;
Cervulinae; Muntiacus reevesi.
REFERENCE 1 (bases 1 to 2682)
AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A.
TITLE Phylogeny and evolution of antlered deer determined from
mitochondrial DNA sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.M.Miyamoto, 03-JUL-1990.
FEATURES from to/span description
tRNA 1 69 Phe-tRNA
rRNA 70 1028 12S ribosomal RNA
tRNA 1029 1196 Val-tRNA
rRNA 1197 2664 16S ribosomal RNA
tRNA 2659 > 2682 Leu-tRNA
BASE COUNT 1001 a 543 c 463 g 675 t
ORIGIN
1 gttaatgtag cttaagcaat aaagcaaggc actgaaaatg cctagatgag tgtactagct
61 ccataaacat ataggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc
121 aagcatccac atcccagtga aaatgccctc taagttaata agactaagag gagctggtat
181 caagcacaca tccgtagctc acgacgcctt gctcagccac acccccacgg gagacagcag
241 tgacaaaaat taagccataa acgaaagttt gactaagcca tattaattag ggttggtaaa
301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggtata cggcgtaaag
361 cgtgttaaag cactatttcc aaataaagct aaatttcaat taagctgtaa aaagccataa
421 ttgcaacaaa aataaataac gaaagtaact ttacagttgc tgaaacacga tagctaggac
481 ccaaactggg attagatacc ccactatgcc tagccctaaa cacaaatagt ttccacaaac
541 aaaactattc gccagagtac taccggcaat agcttaaaac tcaaaggact tggcggtgct
601 ttataccctt ctagaggagc ctgttctata atcgataaac cccgatagac ctcaccattc
661 ctcgctaata cagtctatat accgccatct tcagcaaacc ctaaaaagga ataaaagtaa
721 gcgcaatcat aatacgtaaa aacgttaggt caaggtgtaa cctatgggat gggaagaaat
781 gggctacatt ttctaactta agaataattc atatacgaaa gttattatga aattaataac
841 caaaggagga tttagcagta aactaagaat agagtgctta gttgaattag gccatgaagc
901 acgcacacac cgcccgtcac cctcctcaag taagtacgat atgctcaaac ttatttacat
961 atattaacca tatgagagga gataagtcgt aacaaggtaa gcatactgga aagtgtgctt
1021 ggataaatca agatatagct taaataaagt atctagttta cacctagaag atttcactat
1081 accacgaata tcttgaacca attctagccc ataagccgat tcacactaaa ttaccaattt
1141 attataaata aaacatttac ctaccattaa aagtatagga gatagaaatt ttaatatggc
1201 gctatagaga tagtaccgta agggaacgat gaaagaaaaa aatcaaagta caaagaagca
1261 aagattaccc cttgtacctt ttgcataatg agttaactag caaaaactta acaaaatgaa
1321 ttttagctaa gtaccccgaa accagacgag ctacttatga acaatttatc gagaaccaac
1381 tcatctatgt agcaaaatag tgagaagatt tgtaagtaga ggtgaaacgc ctaacgagcc
1441 tggtgatagc tggttgtcca gaaaatgaat attagttcag ctttaaaaat accaaaaata
1501 ttaacaaatt ataatgtatt tttaaaagtt agtctaaaaa ggtacagcct tttagaaacg
1561 gatacaacct taattagaga gtaaaactta atacaaatca tagtaggctt aaaagcagcc
1621 accaattaag aaagcgttaa agctcaacaa taaaacaacc ttaattccaa caacaagtaa
1681 tcaactccta acccaatact ggactaatct attaaaaata gaagcaataa tgttaatatg
1741 agtaacaaga agtatttctc cctgcatgag tttaagtcag tatctgataa tattctgact
1801 attaacagca agataagaat aacctaccca taaataattt attaattata ctgttaatcc
1861 aacacaggaa tgcacttaag gaaagattaa aagaagtaaa aggaactcgg caaacactaa
1921 accccgcctg tttaccaaaa acatcacctc tagcattact agtattagag gcactgcctg
1981 cccagtgaca accgttaaac ggccgcggta tcctgaccgt gcaaaggtag cataatcact
2041 tgttctctaa ataaggactt gtatgaatgg ccacacgagg gttttactgt ctcttacttc
2101 caatcagtga aattgacctt cccgtgaaga ggcgggaata tattaataag acgagaagac
2161 cctatggagc tttaactact tagcccaaag aaacaaaatt tatttctaag gaaacaacaa
2221 cattctctat gggctaacag ctttggttgg ggtgacctcg gagaacaaaa aatcctccga
2281 gcgattttaa agactagacc cacaagtcaa atcacataat cgctcattga tccaaataat
2341 tgatcaacgg aacaagttac cctagggata acagcgcaat cctattcaag agtccatatc
2401 gacaataggg tttacgacct cgatgttgga tcaggacatc ccgatggtgc aaccgctatc
2461 aaaggttcgt ttgttcaacg attaaagtcc tacgtgatct gagttcagac cggagtaatc
2521 caggtcggtt tctatctatt atgtatttct cccagtacga aaggaccaga gaaataaggc
2581 caacttcaaa caagcgcctt aaattaatta atgatattat cttaattaac tatacaaaca
2641 gaccctaccc tagaaaaggg ttttgttaag gtggcagagc cg
//
LOCUS ODOMTFVLA 2677 bp ds-DNA ORG 07-AUG-1990
DEFINITION O.virginianus mitochondrial Phe-, Val-tRNA, 12S and 16S rRNA,
genes, complete, and Leu-tRNA gene, 5' end.
ACCESSION M35874
KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
transfer RNA-Phe; transfer RNA-Val.
SOURCE O.virginianus brain mitochondrial cell line JM109 DNA, clone POVIMR
4.1-19.
ORGANISM Mitochondrion Odocoileus virginianus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae;
Odocoileinae; Odocoileus virginianus.
REFERENCE 1 (bases 1 to 2677)
AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A.
TITLE Phylogeny and evolution of antlered deer determined from
mitochondrial DNA sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.M.Miyamoto, 03-JUL-1990.
FEATURES from to/span description
tRNA 1 69 Phe-tRNA
rRNA 70 1022 12S ribosomal RNA
tRNA 1023 1091 Val-tRNA
rRNA 1092 2658 16S ribosomal RNA
tRNA 2659 > 2677 Leu-tRNA
BASE COUNT 1000 a 550 c 467 g 660 t
ORIGIN
1 gttaatgtag cttaaacaat aaagcaaggc actgaaaatg cctagatgag tataattact
61 ccataaacac agaggtttgg tcccagcctt cctattaatc cctagcagac ttacacatgc
121 aagtatcctc atcccagtga aaatgccctc caagtcaata aaactaagag gagctggtat
181 caagctcaca cccgtagctc atgacgcctt gctcagccac acccccacgg gagacagcag
241 tgataaaaat taagctataa acgaaagttt gactaagtca tgctaattag ggttggtaaa
301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag
361 cgtgtttaag cactatgcca aatagagtta aactccaatt aagctgtaaa aagccataat
421 caagacgaaa ataaataacg aaagtaactt tacaaccgct gaaacacgat agctaagatc
481 caaactggga ttagataccc cactatgctt agccctaaac ataaatagtt atataaacaa
541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt
601 atacccttct agaggagcct gttctataat cgataaaccc cgatagacct taccacccct
661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggaac aaaagtaagc
721 acaatcatta tacataaaaa cgttaggtca aggtgtaacc tatggagtgg aaagaaatgg
781 gctacatttt ctaatctaag aaaactcttt acgaaagtta ctatgaaatt agtaaccaaa
841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc
901 acacaccgcc cgtcaccctc ttcaaatagg cacaatacac ttaaatttaa ttatacgtat
961 taatcatatg agaagagata agtcgtaaca aggtaagcat actggaaagt gtgcttggat
1021 aatcaagata tagcttaaac taaagcatct agtttacacc tagaagattt cacatattat
1081 gaatatcttg aactaaccct agcccacaaa tccatttata ctaaattatc aagatactac
1141 aaataaaaca tttatccact gttaaaagta taggagatag aaattttaga tatggcgcta
1201 tagagaaagt accgtaaggg aacgatgaaa gaaaaaatta aagtacaaaa aagcaaagat
1261 taccccttgt accttttgca taatgagtta actagtaaaa acttaacaaa atgaatttca
1321 gctaagtacc ccgaaaccag acgagctact tatgaacaat ttatcgagaa tcaactcatc
1381 tatgtggcaa aatagtgaga agattcataa gtagaggtga aacgcctaac gagcctggtg
1441 atagctggtt gtccaggaaa tgaatattag ttcagcttta aaaaatacca ataatataaa
1501 caaatatact gtatttttaa aagttagtct aaaagggaac agccttttag aaacggatac
1561 aaccttaact agagagtaaa acctgataat accatagtag gcccaaaagc agccaccaat
1621 taagaaagcg ttaaagctca acaataaaac tatcttaatc ccaataacaa ataatcaact
1681 cctagcctta atactggact aatctataaa aatagaagca ataatgttaa tatgagtaac
1741 aagaagtatc ttctccccgc acgagtttaa gtcagtacct gataatattc tgaccattaa
1801 cagcaaaata agaataacct aattataaat aacttactta ttatactgtt gatccaacac
1861 aggagtgcac ttaaggaaag attaaaagaa gtaaaaggaa ctcggcaaac accaaacccc
1921 gcctgtttac caaaaacatc acctccagca taactagtat tgggggcatt gcctgcccag
1981 tgacaaccgt taaacgccgc ggtatcctga ccgtgcaaag gtagcataat catttgttct
2041 ctaaataagg acttgtatga atggccaaac gagggtttta ctgtctctta cttccaatca
2101 gtgaaattga ccttcccgtg aagaggcggg aatactataa taagacgaga agaccctatg
2161 gagctttaac tacttagtcc aaagaaataa attcactaac caaggaaaca acaatattct
2221 tcatggacta acagctttgg ttggggtgac ctcggagaac agaaaatcct ccgagcgatt
2281 ttaaagacta gactcacaag tcaaatcaca caatcgctta ttgatccaaa aaattgatca
2341 acggaacaag ttaccctagg gataacagcg caatcctatt caagagtcca tatcgacaat
2401 agggtttacg acctcgatgt tggatcagga caccccgatg gtgcaaccgc tatcaaaggt
2461 tcgtttgttc aacgattaaa gtcctacgtg atctgagttc agaccggagt aatccaggtc
2521 ggtttctatc tattacgtat ttctctcagt acgaaaggac cagagaaata gggccaactt
2581 agaacaagcg ccttaaatta actaatgatt tactcttaat tagatacaca aacaaatctc
2641 gccctagaaa agggctttgt taaggtggca gagcccg
//
LOCUS SHPTGA 470 bp ss-mRNA MAM 07-AUG-1990
DEFINITION Ovine thyroglobulin mRNA, 3' end.
ACCESSION M36148
KEYWORDS thyroglobulin.
SOURCE Ovine, cDNA to mRNA.
ORGANISM Ovis sp.
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 470)
AUTHORS Durand,J., Malthiery,Y., Chabaud,O. and Lissitzky,S.
TITLE L'extremite C-terminale de la thyroglobuline ovine presente de
fortes homologies interspecifiques
JOURNAL C R Soc Biol (Paris) 181, 258-266 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 385 thyroglobulin (AA at 2)
mRNA < 1 470 thyroglobulin mRNA
BASE COUNT 122 a 140 c 112 g 96 t
ORIGIN
1 ggggcagttt actctggagg agaaaagcct ttccctgaaa atcatgcagt atttttccaa
61 cttcatccga tccggaaatc ccaactaccc tcatgagttc tccaggagag cgcctgaatt
121 cgcagcccct tggcctgact tcgtcccccg tgatggcgca gagagctaca aggagctcag
181 tgtcctgctc cccaaccgac agggcctgaa aaaagccgac tgctccttct ggtccaagta
241 catccagtct ctgaaggcct cagcagatga aactaaggac gggctgttag cagagagcga
301 agaggaggac cagctggctg gctctgggct gacagaggac ctcctgggtc tcccagagtt
361 agcctccaag acctacagca agtgaccagc cccatcccaa atgccacctt ggacacctta
421 ttctccaaca tagccgctat caataaagtg tctacatgtc agaaaaagtc
//
LOCUS TELBA 75 bp ss-RNA INV 07-AUG-1990
DEFINITION T.leucophrys 5.8S ribosomal RNA.
ACCESSION M35555
KEYWORDS 5.8S ribosomal RNA.
SOURCE T.leucophrys ribosomal RNA.
ORGANISM Tetrahymena leucophrys
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 75 5.8S ribosomal RNA
BASE COUNT 22 a 13 c 20 g 20 t
ORIGIN
1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
61 accttcatgt ttgta
//
LOCUS TEPBA 74 bp ss-RNA INV 07-AUG-1990
DEFINITION T.paravorax 5.8S ribosomal RNA.
ACCESSION M35556
KEYWORDS 5.8S ribosomal RNA.
SOURCE T.paravorax ribosomal RNA.
ORGANISM Tetrahymena paravorax
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 74)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 74 5.8S ribosomal RNA
BASE COUNT 21 a 14 c 18 g 21 t
ORIGIN
1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg aaggctaaaa
61 ccttcatgtt tgtt
//
LOCUS TEUBA 75 bp ss-RNA INV 07-AUG-1990
DEFINITION T.patula 5.8S ribosomal RNA.
ACCESSION M35553
KEYWORDS 5.8S ribosomal RNA.
SOURCE T.patula ribosomal RNA.
ORGANISM Tetrahymena patula
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 75 5.8S ribosomal RNA
BASE COUNT 22 a 13 c 19 g 21 t
ORIGIN
1 atgcgaattg caaaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
61 accttcatgt ttgtt
//
LOCUS TEXBA 75 bp ss-RNA INV 07-AUG-1990
DEFINITION T.vorax 5.8S ribosomal RNA.
ACCESSION M35554
KEYWORDS 5.8S ribosomal RNA.
SOURCE T.vorax ribosomal RNA.
ORGANISM Tetrahymena vorax
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 75 5.8S ribosomal RNA
BASE COUNT 22 a 13 c 20 g 20 t
ORIGIN
1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
61 accttcatgt ttgta
//
LOCUS TEYBA 75 bp ss-RNA INV 07-AUG-1990
DEFINITION T.pyriformis 5.8S ribosomal RNA.
ACCESSION M35552
KEYWORDS 5.8S ribosomal RNA.
SOURCE T.pyriformis ribosomal RNA.
ORGANISM Tetrahymena pyriformis
Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora;
Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Van Bell,C.T.
TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
(Ciliophora: Hymenostomatida)
JOURNAL J. Mol. Evol. 22, 231-236 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 75 5.8S ribosomal RNA
BASE COUNT 22 a 13 c 20 g 20 t
ORIGIN
1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
61 accttcatgt ttgta
//
LOCUS WHTRDNA 140 bp ds-DNA PLN 07-AUG-1990
DEFINITION Wheat ribosomal RNA spacer DNA.
ACCESSION M36062
KEYWORDS .
SOURCE Wheat seedling DNA, clone pTA250.
ORGANISM Triticum sp.
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 140)
AUTHORS Appels,R. and Dvorak,J.
TITLE The wheat ribosomal DNA spacer region: Its structure and variation
in populations and among species
JOURNAL Theor. Appl. Genet. 63, 337-348 (1982)
STANDARD simple staff_review
BASE COUNT 15 a 49 c 42 g 34 t
ORIGIN
1 ggatccgcgt caacttccgt ccacgacggc cgccgggcct tttttcccgt gcccacgtac
61 agcccgttca cgggtccgtg taagggtccg tgtacgcggc gtctgccacg tacgtggttt
121 tgcccggttt tccatggcgc
//
LOCUS HUMINT01 42 bp ss-mRNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1.
ACCESSION M29165 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 1 of 7
SOURCE Human cell line HL-60, cDNA to mRNA, clone lambda-X47.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 42)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE cDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD simple automatic
FEATURES from to/span description
mRNA < 1 > 42 P150,95 mRNA, exon 1
BASE COUNT 9 a 15 c 10 g 8 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 16p11-13.1.
1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag
//
LOCUS HUMINT02 3690 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
2 - 6.
ACCESSION M29482 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 2 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3690, exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3690)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept 1028 1064 integrin alpha subunit precursor, exon 2
(first expressed exon)
1666 1771 integrin alpha subunit precursor, exon 3
2391 2494 integrin alpha subunit precursor, exon 4
2795 2865 integrin alpha subunit precursor, exon 5
3020 + 3131 integrin alpha subunit precursor, exon 6
sigp 1028 1064 integrin alpha subunit signal peptide
1666 1685 integrin alpha subunit signal peptide
matp 1686 1771 integrin alpha subunit
2391 2494 integrin alpha subunit
2795 2865 integrin alpha subunit
3020 + 3131 integrin alpha subunit
pre-msg < 1 > 3690 P150,95 mRNA and introns
IVS < 1 975 P150,95 intron A
IVS 1065 1665 P150,95 intron B
IVS 1772 2390 P150,95 intron C
IVS 2495 2794 P150,95 intron D
IVS 2866 3019 P150,95 intron E
IVS 3132 > 3690 P150,95 intron F
BASE COUNT 764 a 1050 c 1020 g 856 t
ORIGIN 1 bp upstream of BamHI site; chromosome 16p11-13.1.
1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc
61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg
121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct
181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc
241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca
301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa
361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt
421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc
481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata
541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc
601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt
661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag
721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca
781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg
841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc
901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc
961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct
1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca
1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg
1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc
1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg
1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa
1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct
1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg
1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc
1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg
1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca
1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc
1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg
1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc
1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact
1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc
1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg
1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta
2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta
2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca
2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc
2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa
2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc
2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt
2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac
2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc
2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt
2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc
2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc
2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg
2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc
2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg
2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc
2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag
3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat
3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt
3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt
3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa
3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg
3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct
3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag
3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta
3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta
3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata
3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg
3661 tgtgtgtgca tgtgcatgca ggttgagacg
//
LOCUS HUMINT03 4863 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
7 - 15.
ACCESSION M29483 Y00093
KEYWORDS integrin; protein p150,95.
SEGMENT 3 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 294 to 3967; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 4863)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 294 424 integrin alpha subunit precursor, exon 7
511 656 integrin alpha subunit precursor, exon 8
901 1054 integrin alpha subunit precursor, exon 9
1659 1809 integrin alpha subunit precursor, exon 10
2428 2501 integrin alpha subunit precursor, exon 11
2665 2794 integrin alpha subunit precursor, exon 12
3207 3349 integrin alpha subunit precursor, exon 13
3531 3671 integrin alpha subunit precursor, exon 14
3760 + 3969 integrin alpha subunit precursor, exon 15
matp + 294 424 integrin alpha subunit
511 656 integrin alpha subunit
901 1054 integrin alpha subunit
1659 1809 integrin alpha subunit
2428 2501 integrin alpha subunit
2665 2794 integrin alpha subunit
3207 3349 integrin alpha subunit
3531 3671 integrin alpha subunit
3760 + 3969 integrin alpha subunit
pre-msg < 1 > 4861 P150,95 mRNA and introns
IVS < 1 293 P150,95 intron F
IVS 425 510 P150,95 intron G
IVS 657 900 P150,95 intron H
IVS 1055 1658 P150,95 intron I
IVS 1810 2427 P150,95 intron J
IVS 2502 2664 P150,95 intron K
IVS 2795 3206 P150,95 intron L
IVS 3350 3530 P150,95 intron M
IVS 3672 3759 P150,95 intron N
IVS 3970 > 4863 P150,95 intron O
BASE COUNT 947 a 1358 c 1460 g 1097 t 1 others
ORIGIN Chromosome 16p11-13.1.
1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct
61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag
121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca
181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac
241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc
301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa
361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac
421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg
481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc
541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct
601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga
661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga
721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct
781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag
841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag
901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt
961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc
1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc
1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga
1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa
1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg
1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct
1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga
1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc
1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt
1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat
1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga
1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca
1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta
1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc
1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct
1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt
1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc
1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac
2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac
2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc
2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc
2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag
2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag
2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg
2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat
2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg
2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt
2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc
2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc
2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct
2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac
2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg
2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg
2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg
3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt
3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg
3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc
3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt
3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac
3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga
3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga
3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac
3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct
3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc
3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct
3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg
3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat
3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg
3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag
3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc
3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc
4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct
4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc
4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg
4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt
4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc
4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa
4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg
4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc
4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct
4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc
4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa
4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc
4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact
4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat
4861 ttt
//
LOCUS HUMINT04 2746 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
16 - 21.
ACCESSION M29484 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 4 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 382 to 2672; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2746)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 382 512 integrin alpha subunit precursor, exon 16
628 791 integrin alpha subunit precursor, exon 17
924 1078 integrin alpha subunit precursor, exon 18
1656 1787 integrin alpha subunit precursor, exon 19
1875 1948 integrin alpha subunit precursor, exon 20
2531 + 2672 integrin alpha subunit
matp + 382 512 integrin alpha subunit
628 791 integrin alpha subunit
924 1078 integrin alpha subunit
1656 1787 integrin alpha subunit
1875 1948 integrin alpha subunit
2531 + 2672 integrin alpha subunit
pre-msg < 1 > 2746 P150,95 mRNA and introns
IVS < 1 381 P150,95 intron O
IVS 513 627 P150,95 intron P (no splice consensus)
IVS 792 923 P150,95 intron Q
IVS 1079 1655 P150,95 intron R
IVS 1788 1874 P150,95 intron S
IVS 1949 2530 P150,95 intron T
IVS 2673 > 2746 P150,95 intron U
BASE COUNT 602 a 821 c 746 g 577 t
ORIGIN Chromosome 16p11-13.1.
1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag
61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg
121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt
181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt
241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc
301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc
361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat
421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct
481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc
541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc
601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat
661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc
721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct
781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc
841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac
901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct
961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag
1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt
1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga
1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag
1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg
1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag
1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc
1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg
1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt
1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg
1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc
1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc
1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc
1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca
1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt
1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca
1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc
1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct
2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt
2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca
2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac
2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa
2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact
2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa
2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc
2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc
2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga
2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag
2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc
2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg
//
LOCUS HUMINT05 1006 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
22 - 24.
ACCESSION M29485 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 5 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 202 to 665)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1006)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 203 319 integrin alpha subunit precursor, exon 22
404 483 integrin alpha subunit precursor, exon 23
582 + 665 integrin alpha subunit precursor, exon 24
matp + 203 319 integrin alpha subunit
404 483 integrin alpha subunit
582 + 665 integrin alpha subunit
pre-msg < 1 > 1006 P150,95 mRNA and introns
IVS < 1 202 P150,95 intron U
IVS 320 403 P150,95 intron V
IVS 484 581 P150,95 intron W
IVS 666 > 1006 P150,95 intron X
BASE COUNT 228 a 286 c 229 g 263 t
ORIGIN Chromosome 16p11-13.1.
1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag
61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt
121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac
181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga
241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat
301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac
361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac
421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag
481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc
541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg
601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt
661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa
721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa
781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg
841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta
901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt
961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt
//
LOCUS HUMINT06 1904 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
25 - 30.
ACCESSION M29486 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 6 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 86 to 1528; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1904)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 86 164 integrin alpha subunit precursor, exon 25
280 387 integrin alpha subunit precursor, exon 26
505 588 integrin alpha subunit precursor, exon 27
788 901 integrin alpha subunit precursor, exon 28
1044 1145 integrin alpha subunit precursor, exon 29
1417 + 1527 integrin alpha subunit precursor, exon 30
matp + 86 164 integrin alpha subunit
280 387 integrin alpha subunit
505 588 integrin alpha subunit
788 901 integrin alpha subunit
1044 1145 integrin alpha subunit
1417 + 1527 integrin alpha subunit
pre-msg < 1 > 1904 P150,95 mRNA and introns
IVS < 1 85 P150,95 intron X
IVS 165 279 P150,95 intron Y
IVS 388 504 P150,95 intron Z
IVS 589 787 P150,95 intron AA
IVS 902 1043 P150,95 intron AB
IVS 1146 1416 P150,95 intron AC
IVS 1528 > 1903 P150,95 intron AD
BASE COUNT 381 a 545 c 525 g 453 t
ORIGIN Chromosome 16p11-13.1.
1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat
61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag
121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta
181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt
241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg
301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg
361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa
421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca
481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg
541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct
601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc
661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac
721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg
781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag
841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca
901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg
961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac
1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg
1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag
1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac
1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg
1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct
1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga
1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac
1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg
1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag
1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat
1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag
1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc
1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt
1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc
1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga
//
LOCUS HUMINT07 149 bp ds-DNA PRI 07-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exon
31.
ACCESSION M29487 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 7 of 7
SOURCE Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA
to mRNA, clone lambda-X47.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 44 to 149)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD simple automatic
REFERENCE 2 (bases 1 to 149; revises [1])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 44 148 integrin alpha subunit precursor, exon 31
matp + 44 145 integrin alpha subunit
pre-msg < 1 > 149 P150,95 mRNA and introns
IVS < 1 85 P150,95 intron AC
BASE COUNT 40 a 41 c 37 g 31 t
ORIGIN Chromosome 16p11-13.1.
1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg
61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca
121 gacccccagc ccgcccagtg agaaatgat
//
LOCUS ECOKDPDE 3600 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli transmembrane protein (kdpD) and cytoplasmic protein (kdpE)
genes, complete cds.
ACCESSION M36066
KEYWORDS cytoplasmic protein; kdpD gene; kdpE gene; transmembrane protein.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3600)
AUTHORS Walderhaug,M.O., Daniel,J.M., Hesse,J. and Epstein,W.
TITLE KdpD and KdpE, regulatory proteins for the kdp operon of
Escherichia coli, are members of the two component sensor-effector
class
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.O.Walderhaug, 03-JUL-1990.
Author address: M.O.Walderhaug
University of Illinois at Chicago
Department of Microbiology and Immunology
Box 6998, Chicago, IL 60680
FEATURES from to/span description
pept 141 2825 transmembrane protein (kdpD)
pept 2822 3499 cytoplasmic protein (kdpE)
mRNA 68 > 3499 kpd operon
signal 30 36 -35 region
signal 55 60 -35 signal
binding 130 133 ribosome binding site
BASE COUNT 789 a 968 c 1051 g 792 t
ORIGIN 16 min on K12 map.
1 acgcgtggcg aaagcgcgta atctcagcgt tgaacagctc acgcaactga tcgcaaaata
61 cagccaacaa ccgctggtga aatatatcgg ccagccggtt gtcaacattg ttgaactcaa
121 tctggcgctg gataaacttg atgaataacg aacccttacg tcccgacccc gatcgtctgc
181 tggaacaaac tgccgcgccg catcggggga agctgaaagt tttcttcggt gcctgtgcag
241 gcgtcgggaa gacctgggcg atgctggcag aagcccagcg actgcgggcg caagggctgg
301 atattgtggt tggcgtggta gaaacccacg ggcgaaaaga taccgccgcc atgctggaag
361 ggctggctgt tctgccgtta aaacgccagg cgtaccgtgg gcggcatatc agcgagtttg
421 atctcgatgc cgccctcgcc cgccgcccgg cgctgatctt aatggacgaa ctggcgcaca
481 gtaatgcgcc aggttcccgt catcccaaac gctggcagga tatcgaagaa ctgctggaag
541 ctggcattga tgttttcact accgtcaacg ttcagcatct ggaaagtctg aatgatgtgg
601 tcagcggcgt caccggaatt caggtacggg aaaccgtgcc cgatcctttt ttcgatgccg
661 ccgacgacgt ggtgctggtg gacttgcccc cggacgatct gcgccagcgg ctgaaagaag
721 gcaaagtcta tattgccggg caggcggagc gcgccattga acattttttc cgcaaaggta
781 atctgatcgc cctgcgcgaa ctggcactgc gccgtactgc cgatcgcgtt gatgagcaaa
841 tgcgcgcctg gcgggggcat cctggcgaag agaaagtgtg gcacacgcgc gacgcgatcc
901 ttttatgcat cggccataac accggcagcg aaaaactggt ccgcgcagcg gcgcggctgg
961 cgtcacggct gggtagcgtc tggcacgcgg tgtatgttga aacccctgcc ctgcaccgct
1021 taccggaaaa aaaacgtcgg gcaattctca gcgccttacg tctggccgag gaactgggcg
1081 cggagacggc aacactttct gatccagcgg aagagaaagc ggtagtgcgt tatgcccgtg
1141 aacataatct cggcaagatt attctcggtc gcccggcctc gcgccgctgg tggcgtcggg
1201 aaacgtttgc tgaccgactg gcgcgcatcg cccccgatct cgatcaggtg ctggtcgcgc
1261 ttgatgaacc acccgcccgc acgattaaca acgcgccgga taaccgctct tttaaagaca
1321 agtggcgtgt acaaattcag ggatgcgtgg ttgccgccgc gttatgcgcc gttatcacct
1381 taattgccat gcagtggctg atggcgtttg atgccgccaa cctggtgatg ctgtatctgc
1441 ttggcgtggt ggtggtggcg ctattttatg gacgctggcc ttcagtggtt gccaccgtca
1501 ttaatgtagt gagtttcgat ctctttttta tcgccccacg cggcacgctc gccgtctctg
1561 atgtgcaata tctgctgacc ttcgcggtga tgttaaccgt cgggctggtg atcgggaacc
1621 ttactgctgg cgtgcgttat caggcgcggg tagcccgtta ccgcgagcaa cgcacacggc
1681 acttatatga aatgtcgaaa gctctggcgg tgggccgcag tccgcaggat atcgctgcca
1741 ccagcgaaca atttattgcc tccacgtttc atgcccgcag tcaggtgttg ttgcccgatg
1801 acaacggtaa attgcagccg ttaacacatc cgcaaggaat gacgccgtgg gacgatgcca
1861 tcgcgcagtg gagttttgat aaaggcctgc ctgcgggcgc gggcaccgac acgttacccg
1921 gtgtaccgta ccagattttg ccgctaaaaa gcggcgagaa aacctacggg ctggtggtgg
1981 tggagccggg gaatctgcgc cagttgatga tcccggaaca gcagcgcctg ctggagacgt
2041 ttacgctgtt agtcgccaat gcccttgagc ggctgacgct aaccgccagc gaagaacagg
2101 cgcggatggc aagcgaacgt gaacagatcc gcaacgccct gctggcggcg ctttcgcatg
2161 atttacgcac gccgcttacg gtgctgtttg gtcaggcaga aatcttaacg ctcgatctgg
2221 caagcgaagg atcaccccac gcccgccagg ccagcgagat ccgtcagcat gtgctgaaca
2281 ctacccgact ggtgaataat ctactggata tggcgcgaat tcagtccggc ggctttaatt
2341 tgaagaaaga gtggttaacg ctggaagaag tagtcggcag cgcgctgcaa atgctggaac
2401 cgggtttatc gtcgcccatc aatctttctc tgccagaacc gctgacctta atccacgttg
2461 acgggccact ctttgaacgg gtgctgatta atctgctgga gaacgcggtg aaatatgcgg
2521 gtgcgcaggc cgaaattggt atcgatgccc acgttgaggg cgaaaatcta caactggatg
2581 tctgggataa cggccccggt cttccgccag gccaggagca gacgatattt gataagtttg
2641 ctcgcgggaa taaagagtcg gcagtaccgg gggtagggct tggactggca atttgtcggg
2701 cgatagtgga tgtacacggg ggcactatta ccgcgttcaa ccgaccggaa ggtggtgcct
2761 gttttcgtgt tacacttccc cagcaaactg cccctgaact tgaagaattt catgaggata
2821 tgtgacaaac gttctgattg ttgaagatga acaggctatt cgtcgctttc tgcgcacggc
2881 gctggagggc gacgggatgc gcgtctttga ggccgaaacg ctgcaacgcg gcttgctgga
2941 agcggcaacc cgtaagccag atttgattat tctcgatctc ggcctgcccg atggtgatgg
3001 gattgagttt atccgcgacc tgcgccagtg gagccgggtg ccggtgattg tgctttccgc
3061 acgcagcgaa gagagcgaca aaatcgccgc gctggatgcc ggagcggatg attatctgag
3121 taagccgttt ggcattggcg aattgcaggc ccgtctgcgc gtcgcattac gccgccactc
3181 tgccaccacc gcgcccgatc cgctggtaaa attttccgat gttaccgtcg atttagccgc
3241 ccgcgtgatt caccggggtg aggaagaggt gcatctcaca ccaattgagt tccgccttgc
3301 tggtcgctgc tcaacaatgc cggaaaagta ctcacccagc ggcccagtcc ttaaccaggt
3361 gtgggggcca aacgcggtcg aacacagtca ctatttgcgt atttatatgg gacatctgcg
3421 acaaaaactg gaacaggatc ccgcccgccc acgccatttc attactgaaa ccggtattgg
3481 ctatcggttt atgctttgaa tattaatttt aatacagcct gccttttatt aattaaagcc
3541 gtaataataa atacggcttt ttatcttaaa caacacacaa aaataacaat tcaatatttt
//
LOCUS KAEUREX 4761 bp ds-DNA BCT 07-AUG-1990
DEFINITION K.aerogenes urease subunits A (ureA), B (ureB), C (ureC), and
urease accessory protein (ureE, ureF and ureG) genes, complete cds.
ACCESSION M36068
KEYWORDS urease; urease accessory protein.
SOURCE K.aerogenes (strain CG253) DNA.
ORGANISM Klebsiella aerogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 4761)
AUTHORS Mulrooney,S.B. and Hausinger,R.P.
TITLE Sequence of the Klebsiella aerogenes urease genes and evidence for
accessory proteins facilitating nickel incorporation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.B.Mulrooney, 03-JUL-1990.
Author address: S.B.Mulrooney
Michigan State University
Dept of Biochemistry
East Lansing, MI 48824
FEATURES from to/span description
pept 264 566 urease subunit A (ureA)
pept 576 896 urease subunit B (ureB)
pept 889 2592 urease subunit C (ureC)
pept 2602 3078 urease accessory protein (ureE)
pept 3080 3754 urease accessory protein (ureF)
pept 3763 4380 urease accessory protein (ureG)
site 4385 4424 transcription termination signal
BASE COUNT 898 a 1542 c 1452 g 869 t
ORIGIN
1 ctctcgccga acgtccctgg gtcggcactt tgctgtgcta tccggctacc gatgccctgc
61 tcgacggggt gcgcgacgcg ctggcgccgc tcggtctcta cgccggcgcc agcctgaccg
121 accgcctgct gacggtgcgt ttcctcagtg acgataatct gatttgccag cgggtgatgc
181 gcgacgtatg gcagtttctg cgccctcatc tcaccggtaa atctcccgta cttccccgaa
241 tctggctgac ttaagagaac gttatggaac tgaccccccg agaaaaagac aagctgttgc
301 tgtttaccgc cgcgctggtg gcggagcgtc gcctggcccg cggcctgaag ctcaactatc
361 cggagtccgt ggccctgatc agcgccttta ttatggaagg cgctcgggac ggcaaaagcg
421 tggcctcgct gatggaggaa ggccgtcacg tcctgacccg cgagcaggtg atggagggcg
481 tcccggaaat gatcccggat atccaggtcg aagccacctt cccggacggc tcgaagctgg
541 tcaccgttca caacccgatt atctgaggta gcgccatgat ccccggtgaa tatcacgtta
601 agcccggtca gatagccctg aataccggcc gggcaacctg tcgcgtggtc gttgagaacc
661 acggcgatcg gccgattcag gtcggttcgc actaccattt cgccgaggtt aacccggcgc
721 tgaagttcga ccgtcagcag gccgccggct atcgcctgaa tatcccggcg ggcacggcgg
781 tacgctttga acccggccag aaacgcgagg tcgagctggt ggccttcgcc ggtcaccgcg
841 ccgtcttcgg cttccgcggc gaggtcatgg gccctctgga ggtaaacgat gagtaatatt
901 tcacgccagg cctatgccga tatgttcggc cccaccgtcg gcgacaaggt gcgcctggca
961 gataccgagc tgtggatcga ggtggaggac gatttgacca cctacgggga agaggtcaaa
1021 ttcggcggcg gcaaagtgat ccgcgacggc atgggccagg gacagatgct ggccgccgac
1081 tgtgtcgacc tggtgctcac caacgcgttg atcgtcgatc actgggggat cgttaaggcc
1141 gatatcggcg tgaaggacgg ccggatcttc gccatcggca aagccggcaa ccccgacatc
1201 cagcccaacg tcaccatccc catcggcgct gcgacggaag tgatcgccgc cgaaggaaaa
1261 attgtcaccg ccggcgggat cgatacccat attcactgga tctgtccgca gcaggcggaa
1321 gaggcgctgg tctctggcgt gaccaccatg gtcggcggcg gcaccggccc ggccgcgggc
1381 acccatgcca ccacctgcac cccgggcccg tggtatatct cacgcatgct gcaggcggcc
1441 gacagcctgc cggtcaatat cggcctgctg ggcaagggaa acgtttctca gccggatgcc
1501 ctgcgcgagc aggtggcggc aggcgttatt ggcctgaaga tccatgagga ctggggcgcc
1561 accccggcgg cgatcgactg tgcgttaacc gtcgccgatg aaatggacat ccaggtcgcc
1621 ctgcacagcg acaccctgaa tgaatccggt tttgtggaag acaccctcgc cgccatcggc
1681 gggcgcacca tccacacctt ccataccgaa ggggccggcg gcggccatgc gccggacatc
1741 atcaccgcct gcgcccaccc gaacattttg ccgtcgtcca ccaacccaac gctgccctac
1801 accctcaaca ccatcgatga acatctcgat atgctgatgg tctgccacca tctggacccg
1861 gacatcgccg aggacgtggc ctttgccgag tcgcgcattc gccgggaaac catcgctgcg
1921 gaagacgtgc tgcacgatct cggcgccttc tcgctcacct cctccgattc gcaggccatg
1981 ggccgcgtcg gggaagtgat tctccgcacc tggcaggtgg cgcatcgcat gaaggtgcag
2041 cgcggagcgc tggcggagga gaccggggat aacgacaact tccgcgtgaa gcgctacatc
2101 gccaaataca ccatcaaccc ggcgctgacc cacggcatcg cacacgaagt cggatccatt
2161 gaggtgggta agctggctga cctcgtggtc tggtcaccag ccttcttcgg cgtgaaaccg
2221 gccaccgtga tcaaaggcgg catgatcgcc atcgcgccga tgggcgatat caatgcctct
2281 attccgaccc cgcagccggt gcactaccgc ccgatgtttg gcgcgctggg cagcgcccgc
2341 catcactgcc gcctcacctt cctgtcgcag gcggcggcag ccaatggcgt tgccgagcgg
2401 ctgaacctgc gcagcgcgat cgccgtggtg aaaggctgcc gtacggtgca gaaagccgac
2461 atggtgcaca acagtctgca gcctaacatc accgtcgacg cccagaccta tgaggtgcgg
2521 gtggatggcg aacttatcac cagcgagccg gcagacgttc tgccgatggc gcaacgatat
2581 tttctgtttt aaggagagcg gatgctttat ttaactcaac gtctggagat ccccgccgcc
2641 gcgaccgcca gcgttacgct gccgattgat gttcgcgtca aaagccgggt taaggtcacc
2701 ctcaacgatg gccgggatgc cggcctgctg ctgccccgcg gcctgctact acgcggcggc
2761 gatgtgctca gcaacgaaga aggcaccgag tttgtgcagg tgattgccgc tgatgaagag
2821 gtgtcggtag tgcgctgcga cgatccgttt atgctggcga aggcctgcta ccacctcggc
2881 aaccgtcacg tgccgctgca gatcatgccg ggcgagctgc gctaccatca cgatcacgtg
2941 ctggacgata tgctgcgcca gttcggcctg acggtgacct ttggccagct gccgttcgag
3001 ccggaagccg gcgcttacgc cagcgagagc cacggtcatc atcatgctca tcatgaccac
3061 cacgctcaca gccactagca tgtcgacagc ggaacaacgc ctgcggctga tgcagctggc
3121 cagcagcaac ctgccggtag ggggttacag ctggtcccag gggctggagt gggctgtgga
3181 agccggctgg gtgctggacg tcgcggcctt cgagcgctgg cagcgacgcc agatgacgga
3241 aggctttttt accgttgacc tgccgctgtt cgcccgcctg taccgcgcct gcgaacaagg
3301 cgatatcgct gcggcccagc gctggaccgc ctatctgctg gcctgccggg aaactcgtga
3361 actgcgggag gaagagcgca accgcggcgc ggcgtttgcc cgtctgctga gcgactggca
3421 gccggactgt ccgccgccgt ggcgctccct gtgccagcaa agccagctcg ccgggatggc
3481 ctggctcggc gtgcgctggc gtatcgccct gcccgagatg gccctcagcc tgggctatag
3541 ctggattgag agcgccgtga tggccggcgt caagctggtc cccttcggcc agcaggccgc
3601 ccagcagctg attttacgtc tttgtgacca ctacgcggcc gagatgcccc gcgcgctggc
3661 cgcgccggac ggcgatatcg gatcggccac cccgctcgcc gccatcgcct ctgcccggca
3721 tgaaacccaa tactctcgat tattccgttc ctaggagaag ccatgaactc ttataaacac
3781 ccgctgcgcg tcggcgtcgg cggcccggtc ggctccggta aaaccgctct gctggaagcg
3841 ctgtgtaaag cgatgcgcga tacctggcag ctggcggtgg tcactaacga catctatacc
3901 aaagaagatc agcgcatcct caccgaagcg ggcgcgctgg cgcctgaacg catcgtcggt
3961 gtggaaaccg gcggctgccc gcatacggcg atccgcgaag atgcctcaat gaacctcgcc
4021 gccgtggaag cgctgagtga aaagttcggt aacctcgacc ttatcttcgt ggaaagcggc
4081 ggcgataacc tgagcgccac cttcagcccg gagctggcgg atctgaccat ctacgtcatc
4141 gatgtggccg aaggggagaa gatcccgcgc aaaggcggac cggggatcac caaatccgat
4201 ttcctggtga tcaataaaac cgaccttgcc ccctatgtgg gcgcgtcgct ggaggtgatg
4261 gcgagcgata cccagcgtat gcgcggcgat cgcccatgga ccttcaccaa tctgaagcag
4321 ggcgacggcc tgagcaccat tatcgccttc ctcgaagaca aaggcatgct tggcaaatag
4381 gcctgttgca ccagccgggc gcaagcgcgt ccggatggtg cagcctctgc tctctcttta
4441 tatcatcctg cctccacctc cgcgccacgc ctgccctgca atatggcata aggtttgcta
4501 attcaagtca tgcctaacca ttaaggaatg actatgtcat cactggatct taaccctgaa
4561 ttacccgcga caacgcggac ttccggtacc cgggaaacct tagaagatta caccttacgt
4621 tacgccccgc tgagcttccg ccgctggggt ccgggcgtcg tcgcggtcac cgcgctgggc
4681 ggcatcgcct atctggccga cttttccatc ggcgccagca tcggtatggc ctggggcacc
4741 agcaacgcca tctattcgat c
//
LOCUS MUSTCP1X 1714 bp ss-mRNA ROD 07-AUG-1990
DEFINITION Mouse t-complex protein (Tcp-1x) mRNA, 3' end.
ACCESSION M35797
KEYWORDS t-complex protein.
SOURCE Mouse (strain C57BL/6J) 8.5 day-old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1714)
AUTHORS Dudley,K., Shanahan,F., Burtenshaw,M., Evans,E.P., Ruddy,S. and
Lyon,M.F.
TITLE Isolation and characterisation of a cDNA clone corresponding to the
mouse t-complex gene Tcp-1x
JOURNAL Genet. Res. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Dudley, 29-JUN-1990.
FEATURES from to/span description
pept < 1 875 t-complex protein (Tcp-1x)
BASE COUNT 454 a 387 c 445 g 428 t
ORIGIN Chromosome 17, A2/A3 junction.
1 taaaacgacg gccagtgaat tcatgtctta cgggcaagga ggatatgagc aagcctcctc
61 acttgactca cctgagaaca ggagtcagga tgggtgaggt gccactggct gacagcatac
121 tctgtgatgg tctgacagat gcctttcaca actaccacat gggcatcaca gctgaaaacg
181 tagccaaaaa atggcaagtg agcagagagg cccaggacaa ggttgcagtt ctgtcacaga
241 acagggcaga gactgctcag aaagctggcc actttgacaa ggagattgtg ccagtgctgg
301 tgtcttccag aaaaggtctt actgaagtga aaatcgatga atttccccgt catgggagta
361 accttgaagc catgggaacg ctgaaacctt actttcttac tgatgggaca ggaactgtca
421 ccccagcgaa cgcatcagga atgaacgatg gcgctgctgc tgtggtcctt atgaagaaga
481 cagaagctga gaggcggatg ctgaaacctt tagcaagaat agtctcctgg tcccaagccg
541 gtgtggagcc ttctgttatg ggagtaggac caattccagc cataaagcaa gctgttgcaa
601 aggcaggctg gtccctggag gatgttgacc tgtttgaaat caatgaagcc tttgcagcgg
661 tgtctgcggc aatagctaaa gaacttggat taaaccccgg gaaggtcaac atcgatggag
721 gagccattgc cttgggccat cctctgggag catctggctg caggattcta gtgaccttgc
781 tgcacaccct ggagagagtg ggcgggaccc gtggtgttgc agccctgtgc attgggggtg
841 ggatgggagt cgcaatgtgt gttcagagag ggtgaactgc atgacagcca ccaccttgga
901 cagttcttgt taaatcagtg caacactaag cagaaaggga aaccagagga ccaaactgag
961 gacaggaacc caggtggaca gcttgctgta ctttaatgtg agacacccaa ggctacggca
1021 ttcgacctga cactgttgta aataagaggg aaatccaatc agtcatcaag ggctccagag
1081 tgaacagcat tttcataact tccatgttta tcgtctttca tttctggtga tttctctgat
1141 cattgttttg tttttgtttc ttgtttgagt taggctacat atatgcctca aaatcgtttt
1201 gtagatcagg ttgactttga actcccggag atccaccttt ctgtgcctcc tgaatacttg
1261 aattaaaggc atgcaccatt atggcagggc ccccaaaatg atccattcaa aacatgggga
1321 ttcggcttct gttacgaact tcaggctctt cactttaact atggttggtt caaacgcccc
1381 tgcgtgcctg tctctcggtt tcccattgat caaatcaaga ccaatcctgt agtgaaagtc
1441 ggattcagtg agcctcttcc aggctgaggt ggggtaggta ttccagctgt tctagtcaga
1501 agactggcag tgccaacaag tgtgcctctg gattgttgaa aggtgggcag caagcagcat
1561 ccacgtactt atttctctgc ttgacgggat gggatgttac ctgcttcaga tatctacctt
1621 gacttcccca aaatgatgac tggaactgga actgacagtc aaataaactc tttcctccct
1681 taaaaaaaaa aaccttgacc gtaagacatg aatt
//
LOCUS YSCAER2 2551 bp ds-DNA PLN 07-AUG-1990
DEFINITION S.cerevisiae heme-regulated and catabolite repressed genes
repressor (AER2) gene, complete cds.
ACCESSION M35861
KEYWORDS catabolite repressed gene repressor; heme-regulated gene repressor.
SOURCE S.cerevisiae.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2551)
AUTHORS Zhang,M., Rosenblum-Vos,L.S., Lowry,C.V., Boakye,K. and
Zitomer,R.S.
TITLE A yeast protein with homology to the B-subunit of G-proteins is
involved in repression of heme-regulated and catabolite repressed
genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Zitomer, 02-JUL-1990.
Author address: R.S.Zitomer
Department of Biological Sciences
State University of New York
Albany, NY 12222
FEATURES from to/span description
pept 247 2388 repressor AER2
BASE COUNT 783 a 628 c 488 g 652 t
ORIGIN
1 aaaacaaaat aacaagtaca taagtaaaca aatagagaga aattccccca tctaccccca
61 attgataaac atcagcgaag caagaacaac tggctgaaca cgtgcccctc tatctgtcct
121 tctgcaattg tttaattttc tttcacgcca agttaccttt cgcacaccac tactcttttt
181 ctattgtttt tttgtctttt ttgataagca ggggaagaaa gaaatcagct ttccatccaa
241 accaatatga ctgccagcgt ttcgaatacg cagaataagc tgaatgagct tctcgatgcc
301 atcagacagg agtttctcca agtctcacaa gaggcaaata cctaccgtct tcaaaaccaa
361 aaggattacg atttcaaaat gaaccagcag ctggctgaga tgcagcagat aagaaacacc
421 gtctacgaac tggaactaac tcacaggaaa atgaaggacg cgtacgaaga agagatcaag
481 cacttgaaac tagggctgga gcaaagagac catcaaattg catctttgac cgtccagcaa
541 cagcggcaac agcaacagca gcaacaggtc cagcagcatt tacaacagca acagcagcag
601 ctagccgctg catctgcatc tgttccagtt gcgcaacaac caccggctac tacttcggcc
661 accgccactc cagcagcaaa cacaactact ggttcgccat cggccttccc agtacaagct
721 agccgtccta atctggttgg ctcacagttg cctaccacca ctttgcctgt ggtgtcctca
781 aacgcccaac aacaactacc acaacagcaa ctgcaacagc agcaacttca acaacagcaa
841 ccacctcccc aggtttccgt ggcaccattg agtaacacag ccatcaacgg atctcctact
901 tctaaagaga ccactacttt accctctgtc aaggcacctg aatctacgtt gaaagaaact
961 gaaccggaaa ataataatac ctcgaagata aatgacaccg gatccgccac cacggccacc
1021 actaccaccg caactgaaac tgaaatcaaa cctaaggagg aagacgccac cccggctagt
1081 ttgcaccagg atcactactt agtcccttat aatcaaagag caaaccactc taaacctatc
1141 ccacctttcc ttttggatct agattcccag tctgttcccg atgctctgaa gaagcaaaca
1201 aatgattatt atattttata caacccggca ctaccaagag aaattgacgt tgagttacac
1261 aaatctttgg atcatacttc agttgtttgt tgcgtgaagt tcagtaacga tggtgaatac
1321 ttagccacag gctgcaacaa aactactcaa gtgtatcgcg tttcagatgg ttctctggtg
1381 gcccgtctat ctgacgattc tgctgccaat aaccatcgaa attcgatcac tgaaaataac
1441 accaccacgt ccacggataa caatacaatg acaaccacta ctaccaccac aattactacc
1501 acagcgatga cttcggcagc agaattggca aaagatgtgg aaaacctgaa cacttcgtct
1561 tccccatcat ccgacttgta tatccgttca gtgtgttttt ctccagatgg gaaatttttg
1621 gcaacaggtg ctgaagacag actgattaga atttgggata ttgaaaatag aaagattgtt
1681 atgattcttc aaggccacga acaagatatt tattcattgg actactttcc ctcaggtgac
1741 aaattagtct ccggttctgg tgaccgtacc gttcgtattt gggacttacg tacaggccag
1801 tgttcattga ctttatccat tgaagatggt gttaccaccg tcgctgtatc accaggtgat
1861 ggtaaataca tcgctgctgg ttctctagat cgtgctgtga gagtttggga ttccgagacc
1921 ggattcttgg tggaaagact agattcggaa aacgaatccg gtacaggcca caaggactct
1981 gtttatagcg ttgtcttcac tagagatgga caaagcgttg tatccggctc attagataga
2041 tctgttaagc tctggaattt gcagaatgca aacaacaaga gcgattcgaa aactccaaat
2101 tccggcactt gtgaagttac gtatatcggg cataaagact ttgtattgtc cgtggccacc
2161 acacaaaatg atgagtacat cttgtccggt tccaaagatc gtggtgtcct gttttgggat
2221 aagaaatccg gcaatccgtt attgatgttg caaggtcata ggaattcagt tatatctgtg
2281 gctgtggcaa acgggtctcc gctgggtcca gaatataacg tttttgctac tggtagcggt
2341 gattgtaaag caaggatttg gaagtataaa aaaatagcgc caaattaaga acagaacaca
2401 aaaggaacac tttacaaatg taactaacta aactattctt caattcattg tcattatatc
2461 tttctctcta ttctttttta tctacttatt tgttatgtta tgatccaggt acgctttaaa
2521 acaaacagtt actttcctca ccattcatcc c
//
LOCUS ALFHISH3A 1089 bp ds-DNA PLN 07-AUG-1990
DEFINITION Alfalfa histone H3 (H3-1.1) gene, complete cds.
ACCESSION M35867
KEYWORDS histone H3.
SOURCE Alfalfa (L. cv. Nagyszenasi) DNA, clone lambda-ALH3-1.
ORGANISM Medicago sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 1089)
AUTHORS Wu,S.C., Boegre,L., Vincze,E., Kiss,G.B. and Dudits,D.
TITLE Isolation of an alfalfa histone H3 gene: Structure and expression
JOURNAL Plant Mol. Biol. 11, 641-649 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 285 695 histone H3 (H3-1.1)
signal 93 97 CAAT box
signal 168 174 TATA box
signal 817 826 polyA signal
signal 963 968 polyA signal
signal 1008 1013 polyA signal
site 2 6 GATCC motif
site 38 43 ACGTCA consensus
site 46 50 GATCC motif
site 53 46 (c) CGCGGATC octamer
BASE COUNT 305 a 271 c 178 g 335 t
ORIGIN
1 ggatcctcat cacacaaaca aaacacatcc acacgccacg tcatcgatcc gcgtgtcgca
61 aatactccaa ataaacgaca cccgtcgatt aaccaataat caacggccac aattacaccc
121 cattcaccca cttctcaaat ttcaaaaccc gcacaaaatc atatcactat ataactcacc
181 cctttcatct cttcttcctc atcactctca attcttcaaa gcacaaaaac aaccttcaag
241 tttctctgtt tgatactgtt cttttcagtt atttttcata atcaatggca cgtaccaagc
301 aaaccgctcg caaatccacc ggtggcaaag ctccaaggaa gcaactcgcc acaaaagccg
361 ctcgcaaatc tgctccggcc accggcggag tgaagaaacc tcaccgtttc aggccaggaa
421 ccgtcgctct ccgtgagatc cgcaagtacc agaagagcac tgagctcctc atccgcaaac
481 tccccttcca gcgtcttgtc cgtgagatcg ctcaggattt caagactgat ctccgtttcc
541 agagctccgt cgtgtcggct ttgcaagaag cggccgaggc ttatctcgtc ggtctctttg
601 aggatactaa cctctgcgcc attcatgcta agcgtgtcac tatcatgcct aaggatatcc
661 agctcgctag gcgtatccgt ggcgagcgtg cttgatcttg ttgattcgct ttgttagggt
721 ttgtgtagat aggttcatga tgtagttaaa tcacaaaccg ttgctataag tttctctatg
781 gattttgtta tattgtaatg tgcttaacgc ttaatcaatg aaatcgatca tcttttgtta
841 aactctttgt tcaattactt atgctttttt tttatctttt cttaacccta attttctgtc
901 attttattac actttccgaa cttttgttat ccctaattgg atttgaaatc aaaattaggg
961 ttgataaagg catattgtat aatgttgaaa ttcttgttaa atatattgat aaatacaatt
1021 atttcaaagt aaaaattaaa tgctgttgat tacatcttga aatgtaaaaa taattgcatg
1081 cataagctt
//
LOCUS APHYEPPC 400 bp ds-DNA SYN 07-AUG-1990
DEFINITION Foot-and-mouth disease virus/SP6 promoter recombinanant DNA.
ACCESSION M35873
KEYWORDS .
SOURCE Foot-and-mouth disease virus (strain O1K), cDNA to viral RNA, clone
pFMDV-YEP-polyC.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 400)
AUTHORS Zibert,A., Maass,G., Strebel,K., Falk,M.M. and Beck,E.
TITLE Infectious foot-and-mouth disease virus derived from a cloned full-
length cDNA
JOURNAL J. Virol. 64, 2467-2473 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
RNA 18 396 RNA
signal 1 17 SP6 promoter
recomb 29 30 plasmid DNA end/foot-and-mouth disease DNA
start
BASE COUNT 68 a 122 c 112 g 98 t
ORIGIN
1 atttaggtga cactatagaa tacaagctct tgaaaggggg cattagggtc tcacccctag
61 taagccaacg acagtccctg cgttgcactc cacacttacg ttgtacacac gcgggacccg
121 atgggctatc gttcacccac ctacagctgg actcacggcg ccgcgtggcc atttagctgg
181 attgtgcgga cgaacacgct tgcgcacctc gcgtgaccgg ttagtactct taccactctc
241 cgcctacttg gtcgttagcg ctgtcttggg cattcctgtg ggggccgttc gacgctccac
301 gggaactctc ctgtgtgaca tctacggtga tggggccgtt tcgcgtgggc tggtcgtttg
361 gactgcttcg gctgtcaccc ggcgcccgcc tttcagctag
//
LOCUS CYNAAAGFP 282 bp ss-mRNA SYN 07-AUG-1990
DEFINITION C.tetragonoloba alpha-amylase/alpha-galactosidase fusion protein
gene, 5' end.
ACCESSION M35837
KEYWORDS alpha-amylase; alpha-galactosidase; fusion protein.
SOURCE C.tetragonoloba and plasmid DNA, clone pUR2601.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 282)
AUTHORS Overbeeke,N., Termorshuizen,G.H.M., Giuseppin,M.L.F.,
Underwood,D.R. and Verrips,C.T.
TITLE Secretion of the alpha-galactosidase from Cyamopsis tetragonoloba
(Guar) by Bacillus subtilis
JOURNAL Appl. Environ. Microbiol. 56, 1429-1434 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 88 > 282 alpha-amylase/alpha-galactosidase fusion
protein
binding 77 80 ribosomal binding site (put.)
recomb 180 181 alpha-amylase end/alpha-galactosidase start
BASE COUNT 88 a 56 c 70 g 68 t
ORIGIN
1 gctgcaggta accggatccg aattcccggg gatccgtcct gatatgtaaa atataatttg
61 tataagaaaa tgagagggag aggaaacatg attcaaaaac gaaagcggac agtttcgttc
121 agacttgtgc ttatgtgcac gctgttattt gtcagtttgc cgattacaaa aacatcagcc
181 gcggaaaacg gtttgggtca aaccccacca atgggttgga actcctggaa ccacttcggt
241 tgtgacatta acgaaaacgt tgtcagagaa acagctgatg ca
//
LOCUS HSODR3E 111 bp ds-DNA VRL 07-AUG-1990
DEFINITION Herpesvirus papio 3' dyad region DNA.
ACCESSION M35835
KEYWORDS dyad region.
SOURCE Herpesvirus papio DNA, from baboon B-cell.
ORGANISM Herpesvirus papio
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 111)
AUTHORS Loeb,D.D., Sung,N.S., Pesano,R.L., Sexton,C.J.,
Hutchison,C.H.III. and Pagano,J.S.
TITLE Plasmid origin of replication of herpesvirus papio: DNA sequence
and enhancer function
JOURNAL J. Virol. 64, 2876-2883 (1990)
STANDARD simple staff_entry
BASE COUNT 29 a 30 c 28 g 24 t
ORIGIN
1 ggacagcaca tactgcctcg cagatagcat ataccgccgg catgatagca tatgctaccc
61 agatggatag catacgctac ctcccgggta gtatttgcta tccgggtaag g
//
LOCUS HUMMHDQB3A 552 bp ss-mRNA PRI 07-AUG-1990
DEFINITION Human MHC class II DQ3.1ER (DR4) mRNA, partial cds.
ACCESSION M35730
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human homozygous cell line ER, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 552)
AUTHORS Holbeck,S.L. and Nepom,G.T.
TITLE Molecular analysis of DQ-beta-3.1 genes
JOURNAL Hum. Immunol. 21, 183-192 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 552 MHC class II DQ3.1ER protein (AA at 3)
/hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
/map="6p21.3"
site 269 270 MHC class II DQ3-beta-1 exon end/MHC class II
DQ3-beta-2 exon start
BASE COUNT 128 a 165 c 162 g 97 t
ORIGIN Chromosome 6p21.3.
1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc
61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg
121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga
181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt
241 tggagctccg cacgaccttg cagcggcgag tggagcccac agtgaccatc tccccatcca
301 ggacagaggc cctcaaccac cacaacctgc tggtctgctc agtgacagat ttctatccag
361 cccagatcaa agtccggtgg tttcggaatg accaggagga gacaaccggc gttgtgtcca
421 ccccccttat taggaacggt gactggacct tccagatcct ggtgatgctg gaaatgactc
481 cccagcatgg agacgtctac acctgccacg tggagcaccc cagcctccag aaccccatca
541 ccgtggagtg gc
//
LOCUS HUMMHDQB3B 530 bp ss-mRNA PRI 07-AUG-1990
DEFINITION Human MHC class II DQ3.1LUY (DR8) mRNA, partial cds.
ACCESSION M35731
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human homozygous cell line LUY, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 530)
AUTHORS Holbeck,S.L. and Nepom,G.T.
TITLE Molecular analysis of DQ-beta-3.1 genes
JOURNAL Hum. Immunol. 21, 183-192 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept.ps < 1 > 530 MHC class II DQ3.1LUY protein (AA at 3)
/hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
/map="6p21.3"
site 248 249 MHC class II DQ3-beta-1 exon end/MHC class II
DQ3-beta-2 exon start
BASE COUNT 124 a 157 c 154 g 95 t
ORIGIN Chromosome 6p21.3.
1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc
61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg
121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga
181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt
241 tggagctctg gagcccacag tgaccatctc cccatccagg acagaggccc tcaaccacca
301 caacctgctg gtctgctcag tgacagattt ctatccagcc cagatcaaag tccggtggtt
361 tcggaatgac caggaggaga caaccggcgt tgtgtccacc ccccttatta ggaacggtga
421 ctggaccttc cagatcctgg tgatgctgga aatgactccc cagcatggag acgtctacac
481 ctgccacgtg gagcacccca gcctccagaa ccccatcacc gtggagtggc
//
LOCUS LGILG106NL 378 bp ss-mRNA PLN 07-AUG-1990
DEFINITION L.gibba negatively light-regulated protein (Lg106) mRNA, 3' end.
ACCESSION M35866
KEYWORDS negatively light-regulated protein.
SOURCE L.gibba (L. G-3), cDNA to mRNA, clone pLg106.
ORGANISM Lemna gibba
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Arecidae; Arales; Lemnaceae.
REFERENCE 1 (bases 1 to 378)
AUTHORS Okubara,P.A., Flores,S. and Tobin,E.M.
TITLE Characterization of a negatively light-regulated mRNA from Lemna
gibba
JOURNAL Plant Mol. Biol. 11, 673-681 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 261 ORF1 (AA at 1)
pept < 1 82 ORF2 (AA at 2)
pept < 1 62 ORF3 (AA at 3)
BASE COUNT 96 a 96 c 102 g 84 t
ORIGIN
1 gacggcggat tattacccaa aaagcctcct ctcatttcca aggaccacga gcgagccttt
61 gactccgctg actgggcatt aggaaagcaa ggaggccccg tggagaagcc gaagcctctg
121 gaggcgctac ggccgaaact ccagccaact cagcagcaag ttcgcttgag atcggtttat
181 gctcctgccg agggcgaaga ttggatccag cggcgcgaca gaagagatgg gcaacgagtg
241 agtcgcctcg gaggaatgta gtatccatcc tatatctctc tctccgattc ctgaataaaa
301 cgatctattt tgtggcatct cctgaagaac cctaattgat tagtagttga ggtttgacct
361 tgggtcaacg gatcagat
//
LOCUS MUSIGHADM 360 bp ss-mRNA ROD 07-AUG-1990
DEFINITION Mouse Ig heavy chain mRNA V-region, partial cds.
ACCESSION M26808
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain C57BL/6) hybridoma line B1-8.V4, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Brueggemann,M., Mueller,H.-J., Burger,C. and Rajewsky,K.
TITLE Idiotypic selection of an antibody mutant with changed hapten
binding specificity, resulting from a point mutation in position 50
of the heavy chain
JOURNAL EMBO J. 5, 1561-1566 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 360 Ig heavy chain V-region (AA at 1)
BASE COUNT 89 a 92 c 99 g 80 t
ORIGIN Chromosome 12.
1 caggtccaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agtgaagctg
61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
121 cctggacgag gccttgagtg gattggaggg attgatccta atagtggtgg tactaagtac
181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aaccctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagatacgat
301 tactacggta gtagctactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS PLYJCRR 270 bp ds-DNA VRL 07-AUG-1990
DEFINITION Polyomavirus JC regulatory region DNA.
ACCESSION M35834
KEYWORDS regulatory region.
SOURCE Polyomavirus JC DNA, from human progressive multifocal
leukoencephalopathy patient CY urine.
ORGANISM Polyomavirus JC
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 270)
AUTHORS Yogo,Y., Kitamura,T., Sugimoto,C., Ueki,T., Aso,Y., Hara,K. and
Taguchi,F.
TITLE Isolation of a possible archetypal JC virus DNA sequence from
nonimmunocompromised individuals
JOURNAL J. Virol. 64, 3139-3143 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 268 > 270 late leader protein
orgrpl 12 < 1 (c) origin of DNA replication
signal 15 29 TATA box
BASE COUNT 78 a 66 c 77 g 49 t
ORIGIN
1 gcctcggcct cctgtatata taaaaaaaag ggaaggtagg gaggagctgg ctaaaactgg
61 atggctgcca gccaagcatg agctcatacc tagggagcca accagctgac agccagaggg
121 agccctggct gcatgccact ggcagttata gtgaaacccc tcccatagtc cttaatcaca
181 agtaaacaaa gcacaagggg aagtggaaag cagccagggg aacatgtttt gcgagccaga
241 gctgttttgg cttgtcacca gctggccatg
//
LOCUS PSEIAAMH 3677 bp ds-DNA BCT 07-AUG-1990
DEFINITION P.syringae tryptophan 2-monooxygenase (iaaM) and indoleacetamide
hydrolase (iaaH) genes, complete cds.
ACCESSION M11035 M35690
KEYWORDS iaaH gene; iaaM gene; indoleacetamide hydrolase; indoleacetic acid;
indoleacetic acid operon promoter region;
tryptophan 2-monooxygenase; tryptophan-2-oxyreductase.
SOURCE P.syringae savastanoi (strain EW2009) DNA, clones pLUC2, pCP3, and
pCJP12.
ORGANISM Pseudomonas syringae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 202 to 3677)
AUTHORS Yamada,T., Palm,C.J., Brooks,B. and Kosuge,T.
TITLE Nucleotide sequences of the Pseudomonas savastanoi indoleacetic
acid genes show homology with agrobacterium tumefaciens T-DNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6522-6526 (1985)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 312)
AUTHORS Gaffney,T.D., da Costa e Silva,O., Yamada,T. and Kosuge,T.
TITLE The indoleacetic acid operon of Pseudomonas syringae subsp.
savastanoi: Transcription analysis and promoter identification
JOURNAL J. Bacteriol. (1990) In press
STANDARD simple staff_entry
COMMENT The iaaH and iaaM genes are part of an operon that is borne on a
plasmid, pIAA, in oleander strains of the pathogen. There appears
to be no genetic transformation of host tissue by P.savastanoi. No
Pribnow boxes were found, though potential ribosome binding sites
are located at positions 435-439 and 2164-2168. Expression of iaaH
depends upon the promoter for iaaM.
Draft entry and computer-readable sequence for [1] kindly submitted
by T.Yamada, 14-JAN-1986.
Draft entry and computer-readable sequence for [2] kindly submitted
by T.D.Gaffney, 28-JUN-1990.
FEATURES from to/span description
pept 454 2127 tryptophan 2-monooxygenase (EC 1.13.12.3)
pept 2170 3537 indoleacetamide hydrolase
pept 82 312 promoter proximal ORF
mRNA 54 > 3537 indoleacetic acid mRNA
site 19 47 indoleacetic acid operon promoter
binding 577 651 FAD binding site
BASE COUNT 813 a 917 c 1046 g 901 t
ORIGIN 1 bp upstream of EcoRI site.
1 ccacaaaccg gcaagacgtt gaaaatcaat ttcaagcatt gtatgttttt ggtgggacgc
61 cacttggaaa atctggtgct gatgcccgga cggactcctg cgaactggct ggacaaggcg
121 atggtggcag tggcccgaat cagacagcgc aagccccagg cggcagtacc cgcggcgctc
181 ctccaagccc cgcacgctat ggaattcgtt tggagcacca ccaaaacctt taccgaatgc
241 cattgtccgg cttacacccc ctatacgcaa gccagtgcgt ggtcggccag cgcggtcgtt
301 agtgcttatt aattttagag gactagcatg gtttgaagga atggttcagg tgttttggat
361 tttggcggct gccgttaaaa ggtgctgttt cagtttcatt taaagaatcg taatccgggt
421 agcacgtaag gtcgaagagg gtagagtcgc gttatgtatg accattttaa ttcacccagt
481 attgatattt tgtacgacta cggtcccttt ctgaaaaaat gtgaaatgac gggaggcata
541 ggcagctatt cagccggaac gcccacccct cgggtagcga tagtcggtgc cggcatcagt
601 gggctggtcg ctgcaactga actattacgt gcgggagtca aggacgttgt cttatatgaa
661 tcgcgtgatc gaatcggggg acgggtatgg tctcaagttt tcgatcagac tcgtccacgt
721 tacattgcag aaatgggtgc gatgcgcttt cctcccagtg caactggcct tttccactac
781 ctgaaaaagt ttggtatttc gacgtcgacc acctttccgg atcctggtgt ggtggacacg
841 gagctgcatt accgtggcaa gcgctatcac tggccagcgg gcaaaaagcc gcccgaatta
901 ttcaggcgag tctatgaggg gtggcagtct ctattgtccg aaggttacct ccttgaaggc
961 ggttctttag ttgccccgct ggacattacc gcaatgctga agtcgggtcg tctggaagag
1021 gcagcgatcg catggcaggg atggctcaat gtattccggg attgttcatt ctataacgcg
1081 attgtctgta tttttactgg ccgccatccg ccaggcggcg acagatgggc tcgtcctgaa
1141 gactttgagc tgtttggctc gcttggcata ggctcgggcg ggtttttgcc agtctttcag
1201 gctggcttta cggaaatact gcggatggtt atcaacggat accaaagtga ccagcgactg
1261 attccggacg ggatatccag tctggccgcg agactcgctg atcagtcgtt tgacggcaaa
1321 gcgttaaggg accgcgtttg ttttagccgg gtaggtcgca tttccagaga agctgaaaaa
1381 atcatcatcc agacggaagc aggagaacag cgtgtatttg atcgagtaat tgtcactagc
1441 agtaatcggg ccatgcaaat gattcactgc ctcacggata gcgagagctt tctgagtcgt
1501 gatgtcgctc gtgctgtccg cgaaacccat ctgacaggtt catcgaagct tttcattctc
1561 acccgaacca aattctggat aaaaaacaag cttcccacca ccatccagtc ggacggtctg
1621 gtgcgcggcg tctattgtct ggattatcag cccgatgaac ctgaggggca tggcgttgtt
1681 ctgctcagtt acacgtggga agacgacgct caaaaaatgc tggcgatgcc tgacaagaaa
1741 acgcgttgcc aggtactggt tgatgacctt gctgcgatac acccgacgtt cgccagttat
1801 ctcctgcccg ttgatgggga ttatgagcgg tatgtattgc accatgactg gctcaccgat
1861 ccccattctg cgggcgcttt caaactcaat tatcccggcg aggacgttta ctcgcagcga
1921 ttgttttttc aaccaatgac agcgaacagt cccaataaag acacggggct ctatctggct
1981 ggctgcagtt gctcttttgc cggagggtgg atcgaaggtg ctgtccagac agcattgaac
2041 agtgcttgcg cggtgctgcg cagcaccgga gggcaactgt caaaaggcaa cccgctggac
2101 tgtatcaacg cctcctatcg ctattaacag tcctgagagc agcgctaagc taatacgggt
2161 gaaaagagca tgcatgaaat aatcacactt gaatcgctgt gccaggcatt ggctgatggc
2221 gagattgccg ctgcggagct gcgcgagcgt gcgctcgata ctgaggcccg cttggcccgt
2281 ttgaactgtt tcatacgtga gggtgatgcg gtcagccagt ttggcgaggc tgatcatgca
2341 atgaagggta ctccactctg gggaatgccg gtttccttca aggataatat ctgtgttcgc
2401 ggtttgccgt tgacagccgg gacgcgagga atgtccggtt tcgtatctga tcaggatgct
2461 gcgattgtca gtcaactcag agccctcggg gctgtggtcg ctggcaagaa caacatgcac
2521 gaactcagtt tcggagtgac ctccatcaat cctcattggg gaaccgtggg aaaccccgtg
2581 gcccccggtt attgtgccgg aggtagtagt ggtggaagtg ccgccgcagt ggcaagtgga
2641 attgttccgc tgtcggtggg gaccgacacg gggggctcga taagaatacc ggcggccttc
2701 tgcggcatta cgggcttcag acccactact ggacgctggt caacggcagg cattatccct
2761 gtttctcata caaaggattg cgttgggttg ctgacacgaa cggcaggcga cgccggattt
2821 ttgtacggac tgttgtcagg caagcagcag tcttttcccc tgagcaggac ggccccctgt
2881 cgaatcggcc taccggtctc catgtggtcc gatctggatg gcgaggtgga gagggcatgc
2941 gtgaatgcac tcagcctgct gcgcaagaca gggttcgagt ttattgaaat tgatgatgcc
3001 gatattgtcg aactgaacca gacactcacg ttcaccgttc cgctttacga attctttgcc
3061 gaccttgctc agtcattgct ctccttgggc tggaagcacg gtatccatca tatttttgca
3121 caggttgatg atgccaacgt gaaaggcatc atcaatcacc accttgggga gggcgctata
3181 aaaccggctc actatttgag ttcactgcaa aatggcgagc tgttgaaaag aaaaatggat
3241 gagctatttg ctcgtcataa tatcgagctc ctggggtacc ccacggtacc ttgccgggta
3301 cctcatctgg atcatgctga ccgaccagag tttttcagtc aggcaattcg caataccgac
3361 ctggccagca atgcgatgct cccctccatt actattcctg tcgggccaga agggcgattg
3421 ccggtcggct tgagctttga tgctctaaga gggcgagacg cacttttgct gagcagagtc
3481 agtgccatag agcaggtatt aggttttgta cgaaaagtat tgccgcatac aacgtaggct
3541 acaggccagc gtgaccatgg ctgcgtagct cttggccagc ttgtcgtagc gggtgccgat
3601 tcggcggttc tcttttagcc agccaaacat ccgctcaatg atgttgcgct gccgatactt
3661 tggacgatcg aagagtc
//
LOCUS SVCG3E 68 bp ss-RNA VRL 07-AUG-1990
DEFINITION Spring viremia of carp virus genome, 3' end.
ACCESSION M35836
KEYWORDS .
SOURCE Spring viremia of carp virus RNA.
ORGANISM Spring viremia of carp virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae.
REFERENCE 1 (bases 1 to 68)
AUTHORS Roy,P., Gupta,K.C. and Kiuchi,A.
TITLE Characterization of Spring viremia of carp virus mRNA species and
the 3' sequence of the viral RNA
JOURNAL Virus Res. 1, 189-202 (1984)
STANDARD simple staff_entry
BASE COUNT 20 a 13 c 10 g 25 t
ORIGIN
1 tgcttctgtt tatttggtaa ctattgtcaa tgtaccaaat aaacagacca ttgtctgtag
61 tactcaca
//
LOCUS TTHSUCDA 880 bp ds-DNA BCT 07-AUG-1990
DEFINITION T.aquaticus B succinyl-CoA synthetase alpha-subunit (sucD) gene,
complete cds.
ACCESSION M35832 X12815
KEYWORDS sucD gene; succinyl-CoA synthetase alpha-subunit.
SOURCE T.aquaticus (strain B (NCIB 11247)) DNA.
ORGANISM Thermus aquaticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 880)
AUTHORS Nicholls,D.J., Sundaram,T.K., Atkinson,T. and Minton,N.P.
TITLE Nucleotide sequence of the succinyl-CoA synthetase alpha-subunit
from Thermus aquaticus B
JOURNAL Nucleic Acids Res. 16, 9858-9858 (1988)
STANDARD full staff_entry
COMMENT Gene product mature structure is (alpha beta)2.
Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES from to/span description
pept 14 880 succinyl-CoA synthetase alpha-subunit (sucD;
EC 6.2.1.5)
binding 2 7 sucD ribosomal binding site (put.)
BASE COUNT 153 a 304 c 310 g 113 t
ORIGIN
1 tgggaggtgc ggcgtgatcc tggtgaaccg cgagacccgc gtcctggtcc agggcatcac
61 cggccgggag gggcagttcc acaccaagca gatgctggac tacggcacca agatcgtcgc
121 cggggtcacc ccgggcaaag ggggaacgga ggtcctaggg gtccccgtct acgacacggt
181 gaaggaggcg gtggcccacc acgaggtgga cgcctccatc atcttcgtgc ccgccccggc
241 cgcggcggac gccgccctgg aagcggccca cgccgggatc cccctcatcg tcctcatcac
301 cgagggcatc cccaccctgg acatggtgcg ggcggtggag gagatcaagg ccctgggaag
361 ccgcctcatc ggggggaact gccccgggat catcagcgcc cacctaccca agatcgggat
421 catgcccggc cacgtcttca agcggggccg ggtggggatc atcagccgct ccggcaccct
481 tcggcacgag gccgcagccg ccctttccca ggcggggctc ggcaccacca ccacggtggg
541 gatcgggggc gaccccgtca ggcggctcac cttcaaggac ctcctccccc tcttcaacga
601 ggacccggag acggaggccg tggtcctcat cggggagatc ggcggctccg acgaggagga
661 ggcggcggct tgggtgaagg accacatgaa gaagccggtg gtgggcttca tcggaggccg
721 ctccgccccc aagggcaagc gcatgggcca cgccggggcc atcatcatgg gcaacgtggg
781 caccccggag tccaagctcc gggccttcgc cgaggcgggc atccccgtgg ccgacaccat
841 cgacgagatc gtggagctgg tcaagaaggc cctgggctaa
//
LOCUS MUSMHPCRE 256 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q even processed pseudogene, exons 7 and 8.
ACCESSION M30902
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain SK/CamE1) liver.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 256)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 82 pseudo-MHC Q even, exon 7 (AA at 46)
191 219 pseudo-MHC Q even, exon 8
IVS < 1 44 pseudo-MHC Q even intron A (no splice
consensus)
IVS 83 190 pseudo-MHC Q even intron B (no splice
consensus)
BASE COUNT 52 a 52 c 70 g 82 t
ORIGIN Chromosome 17.
1 agatatatct ttgtatatat ttttccctag gcagggacag ctcccagagc tctgatatgt
61 ttctctcaag attgtaaagg tgacattcta tggcctgatt gcagaggggc actgtggaca
121 tggttgtgtt tcagggactc ccacaatccc ctgtgagtgg tgggttgttg ggatattgtc
181 ttcattgtgg tggttcctga ccctcgttct ctattatgaa gacagctgcc tggagtggac
241 tcagtgacag ccagtg
//
LOCUS MUSMHPCRM 264 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q odd processed pseudogene, exons 7 and 8.
ACCESSION M30901
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain Metkovic 2) liver.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 81 pseudo-MHC Q odd, exon 7 (AA at 45)
194 225 pseudo-MHC Q odd, exon 8
IVS < 1 44 pseudo-MHC Q odd intron A (no splice
consensus)
IVS 82 193 pseudo-MHC Q odd intron B
BASE COUNT 59 a 53 c 76 g 76 t
ORIGIN Chromosome 17.
1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggagaggg gcaatgtgga
121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga
241 ctgcactgag tgacagacga tgtg
//
LOCUS MUSMHQ4D 261 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q4d Qb-1 processed pseudogene (d haplotype),
exons 7 and 8.
ACCESSION M30896
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 82 peudo-MHC Q4d Qb-1, exon 7 (AA at 46)
191 222 peudo-MHC Q4d Qb-1, exon 8
IVS < 1 44 peudo-MHC Q4d Qb-1 intron A (no splice
consensus)
IVS 83 190 peudo-MHC Q4d Qb-1 intron B (no splice
consensus)
BASE COUNT 54 a 52 c 70 g 85 t
ORIGIN Chromosome 17.
1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata
61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg
121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt
181 gtcttcattg tggtggttcc tgaccctcat tctctatcat gaagacagct gcctggagtg
241 gacttagtga cagccagtgt g
//
LOCUS MUSMHQ5B 264 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q5B processed pseudogene (b haplotype),
exons 7 and 8.
ACCESSION M30897
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain B10, haplotype b) liver, clone B2.5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 81 pseudo-MHC Q5b, exon 7 (AA at 45)
194 225 pseudo-MHC Q5b, exon 8
IVS < 1 44 pseudo-MHC Q5b intron A (no splice consensus)
IVS 82 193 pseudo-MHC Q5b intron B
BASE COUNT 58 a 55 c 77 g 74 t
ORIGIN Chromosome 17.
1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
61 gatctctccc agatggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgcgga
121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga
241 ctgcactgag tgacagacga tgtg
//
LOCUS MUSMHQ5D 264 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q5d processed pseudogene (d haplotype),
exons 7 and 8.
ACCESSION M30898
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 81 pseudo-MHC Q5d, exon 7 (AA at 45)
194 225 pseudo-MHC Q5d, exon 8
IVS < 1 44 pseudo-MHC Q5d intron A (no splice consensus)
IVS 82 193 pseudo-MHC Q5d intron B
BASE COUNT 58 a 53 c 77 g 76 t
ORIGIN Chromosome 17.
1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggggaggg gcaatgtgga
121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga
241 ctgcactgag tgacagacga tgtg
//
LOCUS MUSMHQ6B 261 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q6b Qa-2 processed pseudogene (b haplotype),
exons 7 and 8.
ACCESSION M30899
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain B10, haplotype b) liver, clone B2.17.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 261)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 82 pseud-MHC Q6b Qa-2, exon 7 (AA at 46)
191 222 pseud-MHC Q6b Qa-2, exon 8
IVS < 1 44 pseud-MHC Q6b Qa-2 intron A (no splice
consensus)
IVS 83 190 pseud-MHC Q6b Qa-2 intron B (no splice
consensus)
BASE COUNT 53 a 52 c 71 g 85 t
ORIGIN Chromosome 17.
1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata
61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg
121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt
181 gtcttcattg tggtggttcc tgaccctcgt tctctatcat gaagacagct gcctggagtg
241 gacttagtga cagccagtgt g
//
LOCUS MUSMHQ89DA 264 bp ds-DNA ROD 07-AUG-1990
DEFINITION Mouse MHC class I Q8/Q9d Qa-2 processed pseudogene (d haplotype),
region surrounding exons 7 and 8.
ACCESSION M30900
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex;
pseudogene.
SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 46.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 264)
AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
Dilwith,R. and Flaherty,L.
TITLE Genetic polymorphisms of Q region genes from wild derived mice:
Implications for Q region evolution
JOURNAL Immunogenetics 31, 315-325 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Flaherty 04-JAN-1990.
FEATURES from to/span description
pept.ps / 45 81 peudo-MHC Q8/Q9d Qa-2, exon 7 (AA at 45)
194 225 peudo-MHC Q8/Q9d Qa-2, exon 8
IVS < 1 44 peudo-MHC Q8/Q9d Qa-2 intron A (no splice
consensus)
IVS 82 193 pseudo-MHC Q8/Q9d Qa-2 intron B (no splice
consensus)
BASE COUNT 57 a 54 c 78 g 75 t
ORIGIN Chromosome 17.
1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
61 gatctctccc aggtggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgtgga
121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga
241 ctgcactgag tgacagacga tgtg
//
LOCUS PMUNE1 220 bp ds-DNA PHG 07-AUG-1990
DEFINITION Bacteriophage mu genomic left end.
ACCESSION M34919
KEYWORDS .
SEGMENT 1 of 2
SOURCE Bacteriophage mu DNA.
ORGANISM Bacteriophage mu
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 220)
AUTHORS Groenen,M.A.M., Timmers,E. and Van De Putte,P.
TITLE DNA sequences at the ends of the genome of Bacteriophage mu
essential for transposition
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985)
STANDARD simple staff_review
BASE COUNT 82 a 32 c 44 g 62 t
ORIGIN
1 tgtattgatt cacttgaagt acgaaaaaaa ccgggaggac attggattat tcgggatctg
61 atgggattag atttggtggg gcttgcaagc ctgtagtgca aattttagtc cttaatcaat
121 gaaacgcgaa agatagtaaa aaattgcttt tgtttcattg aaaatacgaa aaacaaaaac
181 actgcaaatc atttcaataa cagcttcaaa aaacgttcaa
//
LOCUS PMUNE2 220 bp ds-DNA PHG 07-AUG-1990
DEFINITION Bacteriophage mu genomic right end.
ACCESSION M34920
KEYWORDS .
SEGMENT 2 of 2
SOURCE Bacteriophage mu DNA.
ORGANISM Bacteriophage mu
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 220)
AUTHORS Groenen,M.A.M., Timmers,E. and Van De Putte,P.
TITLE DNA sequences at the ends of the genome of Bacteriophage mu
essential for transposition
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985)
STANDARD simple staff_review
BASE COUNT 71 a 47 c 41 g 61 t
ORIGIN
1 gtggtacaca aatttaatca gtatcgctac atcagattcc tgaacaaacg agcaaggaag
61 cggctaaata ccaaactatt caaggttcag gcatacccta agtgatcccc atgtaatgaa
121 taaaaagcag taattaatac atctgtttca tttgaagcgc gaaagctaaa gttttcgcat
181 ttatcgtgaa acgctttcgc gtttttcgtg cgccgcttca
//
LOCUS STYRGGA 103 bp ds-DNA BCT 07-AUG-1990
DEFINITION S.typhimurium 23S rRNA gene.
ACCESSION M35803
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE S.typhimurium (strain ATCC 13311) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 103)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 103 23S ribosomal RNA
BASE COUNT 23 a 22 c 38 g 20 t
ORIGIN
1 ggaagatgta acggggctaa accatgcacc gaagctgcgg cagcgacgct gatgcgttgt
61 tgggtagggg agcgttctgt aagcctgcga aggtcacctg tga
//
LOCUS STYRGGB 104 bp ds-DNA BCT 07-AUG-1990
DEFINITION S.typhimurium 23S rRNA gene.
ACCESSION M35804
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE S.typhimurium (strain LB5010) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 104)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 104 23S ribosomal RNA
BASE COUNT 25 a 22 c 34 g 23 t
ORIGIN
1 cctgcgcgga agatgtaacg gggctcaaac catacaccga agctacgggt tcatcctttg
61 gatgagcggt agaggagcgt tctgtaagcc tgtgaaggtg agtt
//
LOCUS STYRGGC 185 bp ds-DNA BCT 07-AUG-1990
DEFINITION S.typhimurium 23S rRNA gene.
ACCESSION M35819
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE S.typhimurium (strain ATCC 13311) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 185)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
145 > 185 23S ribosomal RNA
site 39 144 23S rRNA intervening sequence
BASE COUNT 49 a 36 c 60 g 40 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aactttacgg agttgacgat tgacggagcg aagcgacgtc aaagcgttca ttaaagtcga
121 gttggcttag ggatacgttc gttgggtagg ggagcgttct gtaagcctgt gaaggtgtgt
181 cgtga
//
LOCUS YEPRGGA 85 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.pestis 23S rRNA gene.
ACCESSION M35798
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.pestis (strain EV76) DNA.
ORGANISM Yersinia pestis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 85)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 85 23S ribosomal RNA
BASE COUNT 15 a 16 c 34 g 20 t
ORIGIN
1 atgcaccgaa gctgcggcag cgacgcttag gcgttgttgg gtaggggagc gttctgtaag
61 ccgttgaagg tgacctgtga gggtt
//
LOCUS YEPRGGB 120 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.pseudotuberculosis 23S rRNA gene.
ACCESSION M35799
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.pseudotuberculosis (strain YPIII/pIB1; serotype III) DNA.
ORGANISM Yersinia pseudotuberculosis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 120 23S ribosomal RNA
BASE COUNT 23 a 24 c 47 g 26 t
ORIGIN
1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgacg
61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtgagggtt
//
LOCUS YEPRGGC 83 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35800
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 761/80; serotype 0:6,310) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 83)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 83 23S ribosomal RNA
BASE COUNT 14 a 16 c 34 g 19 t
ORIGIN
1 gcaccgaagc tgcggcagcg acgcttaggc gttgttgggt aggggagcgt tctgtaagcc
61 gttgaaggtg acctgtgagg gtt
//
LOCUS YEPRGGD 114 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35801
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 8081; serotype 0:8) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 114)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 114 23S ribosomal RNA
BASE COUNT 23 a 24 c 43 g 24 t
ORIGIN
1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgaca
61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtg
//
LOCUS YEPRGGE 104 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.bercovieri 23S rRNA gene.
ACCESSION M35802
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA.
ORGANISM Yersinia bercovieri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 104)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 > 104 23S ribosomal RNA
BASE COUNT 21 a 21 c 38 g 24 t
ORIGIN
1 cgcggaagat gtaacggggc taaaccatgc accgaagctg cggcagcttc gcttaggtgt
61 tgttgggtag gggagcgttc tgtaagccgt tgaaggtgac ctgt
//
LOCUS YEPRGGF 197 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35805
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 132; serotype 0:1) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 37 23S ribosomal RNA
159 > 197 23S ribosomal RNA
site 38 158 23S rRNA intervening sequence
BASE COUNT 52 a 43 c 60 g 42 t
ORIGIN
1 agatgtaacg gggctaaacc atgcaccgaa gctgcggcag cgaacgtatc acttaaaacg
61 cttaacgtga cgcaaagccg gcatgttcaa gccacactaa acgttgagtt ggccggtgtg
121 ctgacaaacg aacaggcgtt ttaaggatac gttcgttggg taggggagcg ttctgtaagc
181 cgttgaaggt ggcctgt
//
LOCUS YEPRGGG 179 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35806
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 1142; serotype 0:2) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 19 23S ribosomal RNA
141 > 179 23S ribosomal RNA
site 20 140 23S rRNA intervening sequence
BASE COUNT 47 a 40 c 52 g 40 t
ORIGIN
1 ccatgcaccg aagctgcggc agcgaacgta tcacttaaaa cgcttaacgt gacgcaaagc
61 cggcatgttc aagccacact aaacgttgag ttggccgatg tgctgacaaa cgaacaggcg
121 ttttaaggat acgttcgttg ggtaggggag cgttctgtaa gccgttgaag gtagtctgt
//
LOCUS YEPRGGH 206 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35807
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 6471/76; serotype 0:3) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
160 > 206 23S ribosomal RNA
site 39 159 23S rRNA intervening sequence
BASE COUNT 54 a 43 c 65 g 44 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacttaaaac
61 gcttaacgtg acgcaaagcc ggcatgttca agccacacta aacgttgagt tggccggtgt
121 gctgacaaac gaacaggcgt tttaaggata cgttcgttgg gtaggggagc gttctgtaag
181 ccgttgaagg tggtctgtga gggctg
//
LOCUS YEPRGGI 175 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35808
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 7500; serotype 0:5,27) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 175)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 10 23S ribosomal RNA
132 > 175 23S ribosomal RNA
site 11 131 23S rRNA intervening sequence
BASE COUNT 46 a 36 c 55 g 38 t
ORIGIN
1 gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg tgacgcaaag ccggcatgtt
61 caagccacac taaacgttga gttggccgat gtgctgacaa acgaacaggc gttttaagga
121 tacgttcgtt gggtagggga gcgttctgta agccgttgaa ggtgacctgt gaggg
//
LOCUS YEPRGGJ 190 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35809
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain Ruokola/71; serotype 0:9) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 190)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 30 23S ribosomal RNA
152 > 190 23S ribosomal RNA
site 31 151 23S rRNA intervening sequence
BASE COUNT 49 a 43 c 58 g 40 t
ORIGIN
1 acggggctaa accatgcacc gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg
61 tgacgcaaag ccggcatgtt caagccacac taaacgttga gttggccggt gtgctgacaa
121 acgaacaggc gttttaagga tacgttcgtt gggtagggga gcgttctgta agccgttgaa
181 ggtggcctgt
//
LOCUS YEPRGGK 192 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35810
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain C36; serotype 0:15) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 192)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 31 23S ribosomal RNA
153 > 192 23S ribosomal RNA
site 32 152 23S rRNA intervening sequence
BASE COUNT 50 a 42 c 59 g 40 t 1 others
ORIGIN
1 aacggggcta aaccatgcac cgaagctgcg gcagcgaacg tatcacttaa aacgcttaac
61 gtgacgcaaa gccggcangt tcaagccaca ctaaacgttg agttggccgg tgtgctgaca
121 aacgaacagg cgttttaagg atacgttcgt tgggtagggg agcgttctgt aagccgttga
181 aggtggtctg tg
//
LOCUS YEPRGGL 177 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35811
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain E701; serotype 0:4,32) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 177)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 29 23S ribosomal RNA
139 > 177 23S ribosomal RNA
site 30 138 23S rRNA intervening sequence
BASE COUNT 43 a 40 c 57 g 37 t
ORIGIN
1 cggggctaaa ccatgcaccg aagctgcggc agcgaacgta tcacccaagc caattcacgg
61 cagtggataa tgattgacgg agcgcagcga cgtcaatgcg tccaataaag tctagttggc
121 ttagggatac gttcgttggg taggggagcg ttctgtaagc cgttgaaggt ggcctgt
//
LOCUS YEPRGGM 191 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35812
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 8081; serotype 0:8) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 191)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
148 > 191 23S ribosomal RNA
site 39 147 23S rRNA intervening sequence
BASE COUNT 51 a 39 c 63 g 38 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
181 acctgtgagg g
//
LOCUS YEPRGGN 158 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35813
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 5081; serotype 0:13a,13b) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 158)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 17 23S ribosomal RNA
127 > 158 23S ribosomal RNA
site 18 126 23S rRNA intervening sequence
BASE COUNT 41 a 33 c 51 g 33 t
ORIGIN
1 atgcaccgaa cgtgcggcag cgaacgtatc acccaagaca attcacggca gtggataatg
61 attgacggag cgcagcgacg tcaatgcgtc caataaagtc gagttggctt agggatacgt
121 tcgttgggta ggggagcgtt ctgtaagccg ttgaaggt
//
LOCUS YEPRGGO 180 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35814
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 9312/78; serotype 0:13,18) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
148 > 180 23S ribosomal RNA
site 39 147 23S rRNA intervening sequence
BASE COUNT 50 a 36 c 58 g 36 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aattcacgga agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS YEPRGGP 180 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35815
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 874/77; serotype 0:20) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
148 > 180 23S ribosomal RNA
site 39 147 23S rRNA intervening sequence
BASE COUNT 49 a 37 c 58 g 36 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS YEPRGGQ 180 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35816
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain E736; serotype 0:21) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
148 > 180 23S ribosomal RNA
site 39 147 23S rRNA intervening sequence
BASE COUNT 48 a 38 c 58 g 36 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccactaaagt
121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS YEPRGGR 180 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.enterocolitica 23S rRNA gene.
ACCESSION M35817
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.enterocolitica (strain 2139/72; serotype 0:34) DNA.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA < 1 38 23S ribosomal RNA
148 > 180 23S ribosomal RNA
site 39 147 23S rRNA intervening sequence
BASE COUNT 49 a 37 c 58 g 36 t
ORIGIN
1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS YEPRGGS 153 bp ds-DNA BCT 07-AUG-1990
DEFINITION Y.bercovieri 23S rRNA gene.
ACCESSION M35818
KEYWORDS 23S ribosomal RNA; ribosomal RNA.
SOURCE Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA.
ORGANISM Yersinia bercovieri
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 153)
AUTHORS Skurnik,M. and Toivanen,P.
TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of
pathogenic Yersinia enterocolitica strains. The IVSs in
Y.enterocolitica and Salmonella typhimurium have common origin
JOURNAL Unpublished (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Skurnik, 29-JUN-1990.
Author address: M.Skurnik
Turku University
Dept of Medical Microbiology
Kiinanmyllynkatu 13
SF-20520 Turku
FINLAND
email:mskurnik@finabo
FEATURES from to/span description
rRNA 110 > 153 23S ribosomal RNA
site < 1 109 23S rRNA intervening sequence
BASE COUNT 39 a 30 c 51 g 33 t
ORIGIN
1 cagcgaacgt atcacccaag acaattcacg gcagtggata atgattgacg gagcgcagcg
61 acgtcaatgc gtccaataaa gtcgagttgg cttagggata cgttcgttgg gtaggggagc
121 gttctgtaag ccgttgaagg tgacctgtga ggg
//
LOCUS ECOINSW 49 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli insertion site for transposon Tn1545.
ACCESSION M31752
KEYWORDS .
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 transposon Tn1545 insertion site
BASE COUNT 12 a 10 c 0 g 27 t
ORIGIN
1 cttttcccaa catttccttt tttattatta aaaatcattt ttttcttca
//
LOCUS ECOINSX 49 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli insertion site for transposon Tn1545.
ACCESSION M31754
KEYWORDS .
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 transposon Tn1545 insertion site
BASE COUNT 21 a 4 c 6 g 18 t
ORIGIN
1 atttgaaaat gctgattctt ttatagataa aaatagcaat gcttaaata
//
LOCUS ECOINSY 49 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli insertion site for transposon Tn1545.
ACCESSION M34227
KEYWORDS .
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 transposon Tn1545 insertion site
BASE COUNT 16 a 8 c 5 g 20 t
ORIGIN
1 tgttggattt catttcactt tctccataaa aaaaaacggt aatcatttt
//
LOCUS ECOINSZ 49 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli insertion site for transposon Tn1545.
ACCESSION M35413
KEYWORDS .
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 transposon Tn1545 insertion site
BASE COUNT 18 a 6 c 11 g 14 t
ORIGIN
1 cacctcatac atttgatgtt tatcagtaaa aaaatgggag gattaaggg
//
LOCUS ECOT1545A1 136 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 5' junction.
ACCESSION M35311
KEYWORDS .
SEGMENT 1 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 136)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 E.coli end/ transposon Tn1545 start
BASE COUNT 49 a 28 c 10 g 49 t
ORIGIN
1 cttttcccaa catttccttt tttattttaa aaaatagcat aaaaatctag ttatccgcat
61 aaaaactgga cttatcacac tttatcaagg tcaaaaccac tcaatttact actaatttac
121 tacttatgaa tgagct
//
LOCUS ECOT1545A2 212 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 3' junction.
ACCESSION M35315
KEYWORDS .
SEGMENT 2 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 189 190 transposon Tn1545 end/ E.coli start
BASE COUNT 84 a 29 c 28 g 71 t
ORIGIN
1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
181 ttgtttatta ttaaaaatca tttttttctt ca
//
LOCUS ECOT1545B1 135 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 5' junction.
ACCESSION M35312
KEYWORDS .
SEGMENT 1 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review staff_entry
FEATURES from to/span description
recomb 26 27 E.coli end/ transposon Tn1545 start
BASE COUNT 53 a 23 c 14 g 45 t
ORIGIN
1 atttgaaaat gctgattctt ttatagataa aaatagcata aaaatctagt tatccgcata
61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
121 acttatgaat gagct
//
LOCUS ECOT1545B2 212 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 3' junction.
ACCESSION M35316
KEYWORDS .
SEGMENT 2 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 189 190 transposon Tn1545 end/E.coli start
BASE COUNT 88 a 29 c 30 g 65 t
ORIGIN
1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
181 ttgtttctta taaaaatagc aatgcttaaa ta
//
LOCUS ECOT1545C1 135 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 5' junction.
ACCESSION M35313
KEYWORDS .
SEGMENT 1 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 E.coli end/ transposon Tn1545 start
BASE COUNT 48 a 27 c 13 g 47 t
ORIGIN
1 tgttggattt catttcactt tctccattaa aaatagcata aaaatctagt tatccgcata
61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
121 acttatgaat gagct
//
LOCUS ECOT1545C2 212 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 3' junction.
ACCESSION M35317
KEYWORDS .
SEGMENT 2 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 189 190 transposon Tn1545 end/ E.coli start
BASE COUNT 88 a 28 c 31 g 65 t
ORIGIN
1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
181 ttgttttgat aaaaaaaaac ggtaatcatt tt
//
LOCUS ECOT1545D1 135 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 5' junction.
ACCESSION M35314
KEYWORDS .
SEGMENT 1 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 26 27 E.coli end/ transposon Tn1545 start
BASE COUNT 51 a 28 c 13 g 43 t
ORIGIN
1 cacctcatac atttgatgtt tatcagctaa aaatagcata aaaatctagt tatccgcata
61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
121 acttatgaat gagct
//
LOCUS ECOT1545D2 212 bp ds-DNA BCT 07-AUG-1990
DEFINITION E.coli/transposon Tn1545 3' junction.
ACCESSION M35318
KEYWORDS .
SEGMENT 2 of 2
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Caillaud,F. and Courvalin,P.
TITLE Nucleotide sequence of the ends of the conjugative shuttle
transposon Tn1545
JOURNAL Mol. Gen. Genet. 209, 110-115 (1987)
STANDARD simple staff_review
FEATURES from to/span description
recomb 189 190 transposon Tn1545 end/ E.coli start
BASE COUNT 87 a 26 c 37 g 62 t
ORIGIN
1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
181 ttgttttgat aaaaaaatgg gaggattaag gg
//GenBank-Updates@genbank.bio.net (08/08/90)
LOCUS MUSMHC4B1 278 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse MHC class II H2-C4-w7 gene (C3H, w7), 5' flank, segment 1.
ACCESSION M15601
KEYWORDS class III gene; complement protein; complement system protein;
major histocompatibility complex; protease.
SEGMENT 1 of 2
SOURCE Mouse (C3H.W7) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 278)
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
TITLE Recombination of two homologous MHC class III genes of the mouse
(C4 and Slp) that accounts for the loss of testosterone dependence
of sex-limited protein expression
JOURNAL J. Immunol. 138, 620-627 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 278; revises [1])
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
JOURNAL Unpublished (1990)
STANDARD simple staff_review
FEATURES from to/span description
revision 81 82 ca in [2]; ac in [1]
BASE COUNT 67 a 94 c 62 g 55 t
ORIGIN Chromosome 17.
1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtacctg
61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacacc caccccaccc
121 cacttcacac ccaggcagca gtagtggacc caacagggta agtgggtccc tgcttgggtt
181 gcctgtgttt ctggaagctc aaggtccatg catcagcagt aggggaaggg agggttctca
241 gcccaggctg catcatcatg gccgcctacc acctgcag
//
LOCUS MUSMHSLQ2 314 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse MHC class III H2-Slp-w7d gene (C3H,w7), 5' flank, segment 2.
ACCESSION M15609
KEYWORDS class III gene; complement protein; complement system protein;
major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (C3H.W7) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
TITLE Recombination of two homologous MHC class III genes of the mouse
(C4 and Slp) that accounts for the loss of testosterone dependence
of sex-limited protein expression
JOURNAL J. Immunol. 138, 620-627 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 314; revises [1])
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
JOURNAL Unpublished (1990)
STANDARD simple staff_review
FEATURES from to/span description
revision 119 121 cac in [1]; c in [1]
BASE COUNT 81 a 88 c 72 g 73 t
ORIGIN About 1064 bp downstream from segment 1; chromosome 17.
1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat
61 gttggagctg aatacacgca cacatgggca agcacatgta ttcatacaca cacacacaca
121 cccacacgca catgcagagg gacacgcaca cgcacacaaa cacacacaca aggggaagtc
181 cctggtactg ggaataaaca agcaccagat caattttggg gccaagattt ctacagagac
241 cctggtgttt gggttgcctc cccagctctg gcctagggcc agttctgttt cttggccatc
301 acgtggtttc ccgg
//
LOCUS MUSMHSLR2 320 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse MHC class III H2-Slp-w7c gene (C3H,w7), 5' flank, segment 2.
ACCESSION M15610
KEYWORDS class III gene; complement protein; complement system protein;
major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (C3H.W7) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 320)
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
TITLE Recombination of two homologous MHC class III genes of the mouse
(C4 and Slp) that accounts for the loss of testosterone dependence
of sex-limited protein expression
JOURNAL J. Immunol. 138, 620-627 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 320; revises [1])
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
JOURNAL Unpublished (1990)
STANDARD simple staff_review
FEATURES from to/span description
revision 124 126 aca in [2]; a in [1]
BASE COUNT 83 a 92 c 71 g 74 t
ORIGIN About 1064 bp downstream from segment 1; chromosome 17.
1 tcgactgggg aacccattct agttttttta ggacctttac ctgtcttatc tatgtgtcat
61 gttggagctg aatacacgca cacatgggca agcacatgta ttcgtacaca cacacacaca
121 cacacaccca cacgcacatg cacagggaca cgcacacgca cacaaacaca cacacaaggg
181 gaagtccctg gtactgggaa taaacaagca ccagatcaat tttggggcca agatttctac
241 agagaccctg gtgtttgggt tgcctcccca gctctggcct agggccagtt ctgtttcttg
301 gccatcacgt ggtttcccgg
//
LOCUS MUSMHSLS2 294 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse MHC class III H2-Slp-w7b gene (C3H,w7), 5' flank, segment 2.
ACCESSION M15611
KEYWORDS class III gene; complement protein; complement system protein;
major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (C3H.W7) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 294)
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
TITLE Recombination of two homologous MHC class III genes of the mouse
(C4 and Slp) that accounts for the loss of testosterone dependence
of sex-limited protein expression
JOURNAL J. Immunol. 138, 620-627 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 294; revises [1])
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
JOURNAL Unpublished (1990)
STANDARD simple staff_review
FEATURES from to/span description
revision 232 232 a in [2]; g in [1]
BASE COUNT 76 a 80 c 66 g 72 t
ORIGIN About 1064 bp downstream from segment 1; chromosome 17.
1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat
61 gttggagctg aatacacgca cacatggaca agcacatgta ttcgtacaca cacacacaca
121 cacacgcaca cgcacacaaa cacacacaca aggggaagtc cctggtactg ggaataaaca
181 aggaccagat caattttggg gccaagattt ctacagagac cctggtgttt gagttgcctc
241 cccagctctg gcctagggcc agttctgttt cttggccatc acgtggtttc ccgg
//
LOCUS MUSMHSLT1 253 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse MHC class III H2-Slp-w7a gene (C3H,w7), 5' flank, segment 1.
ACCESSION M15605
KEYWORDS class III gene; complement protein; complement system protein;
major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Mouse (C3H.W7) liver DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 253)
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
TITLE Recombination of two homologous MHC class III genes of the mouse
(C4 and Slp) that accounts for the loss of testosterone dependence
of sex-limited protein expression
JOURNAL J. Immunol. 138, 620-627 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 253; revises [1])
AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
Pattanakitsakul,S.-N. and Takahashi,M.
JOURNAL Unpublished (1990)
STANDARD simple staff_review
FEATURES from to/span description
revision 95 96 ct in [2]; ta in [1]
revision 138 144 tgggtc in [2]; gg in [1]
revision 148 150 ctt in [2]; c in [1]
revision 174 176 tca in [2]; t in [1]
revision 187 189 atc in [2]; a in [1]
BASE COUNT 58 a 76 c 62 g 57 t
ORIGIN Chromosome 17.
1 gtcaacagtt attgcacata cttaatgtaa acctgtagcc tctacatgct tgtgtacctg
61 cccagacagt atatgacacg cacacaccca cacactcccc cacacccagg cagctgtggt
121 ggacccaaca gggtaagtgg gtccctgctt gggttgcctg tgtttctgga agctcaatgt
181 ccacgcatca gcagttgggg aagggagggt tctcagccca ggctgcatca tcatggctgc
241 ctaccacctg cag
//
LOCUS BOVCMPR 7913 bp ss-mRNA MAM 08-AUG-1990
DEFINITION Bovine cation-independent mannose 6-phosphate receptor protein
mRNA, complete cds.
ACCESSION J03527 M15869
KEYWORDS cation-independent mannose 6-phosphate receptor protein;
insulin-like growth factor II receptor protein.
SOURCE Bovine (calf), cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 3267 to 7913)
AUTHORS Lobel,P., Dahms,N.M., Breitmeyer,J., Chirgwin,J.M. and Kornfeld,S.
TITLE Cloning of the bovine 215-kDa cation-independent mannose
6-phosphate receptor
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2233-2237 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 7913; revises [1])
AUTHORS Lobel,P., Dahms,N.M. and Kornfeld,S.
TITLE Cloning and sequence analysis of the cation-independent mannose
6-phosphate receptor
JOURNAL J. Biol. Chem. 263, 2563-2570 (1988)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.Lobel, 22-JUL-1987 and for [2] 23-NOV-1988.
FEATURES from to/span description
pept 153 7652 cation-independent mannose 6-phosphate
receptor protein
BASE COUNT 1665 a 2401 c 2413 g 1434 t
ORIGIN
1 cgcccctcga gctcccccgg ctcgtccgtg gctcctcgcg ctcctgtccg tctccgcctg
61 ccgccctggc tgcacacccg tgtcccggac gccgcccccg gccgccgcgc gccgctcgcc
121 cggctccgga cgcgcagccc gggcccggcg cgatggaggc ggccgccggc cggagctcgc
181 acctggggcc cgcgcccgcc gggcgcccgc cgcggtgccc gctcctgctg cagctgcagc
241 tgctgctgct gctgctgctg ctgccgccgg gctgggttcc cggggccgcg ggcacccagg
301 gcgccgagtt cccagagctg tgcagttata catgggaagc agtggatacc aaaaataaca
361 tgctttataa aatcaacatc tgtggaaata tgggtgttgc ccagtgtgga ccatcaagtg
421 ctgtctgtat gcatgacttg aagacagaca gctttcattc tgtgggtgac tctcttttga
481 aaacagcaag cagatctctt ctggaattta acacaacagt gaactgtaag cagcagaatc
541 acaaaattca gagtagcatc accttcttat gtgggaaaac cttgggaact cccgagtttg
601 taactgcaac agattgtgtg cattacttcg agtggaggac tactgcagcc tgcaaaaaga
661 atatatttaa agcgaataaa gaggtgccct gttacgcttt cgacagagag ctcaagaagc
721 acgatttaaa cccactgatc aagaccagcg gtgcttactt ggtggacgac tctgacccgg
781 acacatctct gttcatcaat gtctgcaggg acatagaggt gctccgggcc tcgagtccac
841 aagtgcgcgt gtgtcccacc ggcgcggccg cctgcctggt gcgaggggac cgcgcgttcg
901 acgtgggccg gccccaggag gggctgaagc tcgtgagcaa tgacaggctc gtcctgagtt
961 acgtgaagga aggggccggc cagcccgact tctgtgacgg ccacagcccg gcggtgacca
1021 tcacgttcgt gtgcccgtcg gagcgcagag agggcaccat tcccaagctc acagcgaaat
1081 ccaactgccg ctttgagatc gagtgggtca ccgagtacgc ctgccacagg gattacctgg
1141 aaagccggag ctgctccctg agcagcgcgc agcatgacgt ggccgtcgac ctccagccgt
1201 tgagccgggt ggaagcctca gactccttgt tctacacctc ggaggcggac gagtatacat
1261 attatttgag catctgcgga ggaagccaag cgcccatctg taataagaaa gatgctgcag
1321 tgtgccaagt gaaaaaggca gattccactc aagtcaaagt ggccgggaga ccccagaacc
1381 tgaccctccg gtactcggat ggagacctca ccttgatcta tttcgggggt gaagagtgca
1441 gctccggctt ccagcggatg agtgtcatca acttcgagtg caatcagaca gcaggtaaca
1501 atggcagagg ggctcctgtg ttcaccgggg aggtggactg cacctacttc ttcacgtggg
1561 atacgaagta cgcctgtgtc cacgagaagg aggccctgct gtgcggcgtc tccgacggga
1621 aacagcgctt cgacctgtcg gcgctggccc ggcactcaga actggaacaa aattgggaag
1681 ctgtggatgg cagtcagagg gaagcagaaa agaagcattt cttcattaac atctgccaca
1741 gggtcctgca gacgggccag gcacggggct gccccgaaga cgcggccgtg tgtgccgtgg
1801 ataagaatgg aagtaaaaat ctgggcagat ttatttcttc tcccaccaga gagaaaggaa
1861 atattcagct ctcttactca gatggtgatg agtgcggtgg tggccagaag ataataacaa
1921 atataacact catgtgcaaa ccaggtgatt tagaaagtgc cccggtgctg acaacctcca
1981 gggctgacgg ctgcttctac gagtttgagt ggcgcacggc tgcagcctgc gtgctctcca
2041 ggaccgaggg ggacaactgc actgtctttg actcccaggc agggttttct ttcgacttga
2101 cgcctctcac gaagaaggac gcctacaagg tcgagacgga caagtacgag ttccacatca
2161 acgtgtgcgg cccggtgtcc gtgggcgcct gcccgccgga ctcgggggcc tgtcaggtgt
2221 ccaggagtga taggaagtct tggaacttgg gacgcagcaa tgctaagctt tcgtattacg
2281 acgggatgat ccagctgacc tacagggacg gcacacccta caataacgag aagcgcacgc
2341 cgagagccac gctcatcacc ttcctctgtg accgagacgc cggagtgggt ttccccgaat
2401 atcaggagga agataactct acatacaact tccggtggta caccagttac gcctgcccgg
2461 aggagccgct ggagtgcatc gtgactgacc ccgtcacact ggaccagtac gacctctcca
2521 ggctagcgaa atccgagggc ggtcctgggg gaaactggta ctctctggac aacggcgggg
2581 cacgcagcac gtggcggaag tactacatca acgtgtgtcg tcccctgaac ccggtgccgg
2641 gctgtgaccg ctacgcgtcc gcctgtcaga tgaagtacca gggcgagcag ggctcgtact
2701 ctgagaccgt ctccatcagc aacctggggg tggcgaagac gggccccatg gtggaggaca
2761 gcggcagcct gctcctggag tacgtcaacg gctccgcctg caccaccagc gaccagaggc
2821 gcaccaccta caccaccagg atccaccttg tctgctctac cggcagcctg tatacccatc
2881 ccatattttc tctcaactgg gagtgtgtgg tcagcttcct gtggaacacg gcggcagcct
2941 gtcctatccg aatcaccacg gacatagacc aggtctgctc catcaaggac cccaacagcg
3001 ggtacgtgtt tgatctgaac ccactgaaca attcccgagg atacgtggtt ttgggcatcg
3061 ggaagacgtt tctgttcaac gtgtgcggtg acatgcccgc ctgtggcacc ctggatggga
3121 agccagcttc cggctgcgag gcagaagtcc agatggacga catgaagacc ctgaagccgg
3181 gcaggctggt gggcctggag aagagcctgc agctgtccac cgagggcttt ataaccctga
3241 actacacggg gcttccttcc caccccaacg ggagggctga tgccttcatc atccgcttcg
3301 tctgcaatga tgacgtttac ccagggacac ccaagttcct gcaccaggac atcgactcta
3361 gcctggggat ccgggacact ttcttcgagt ttgaaaccgc gctggcctgt gtaccttctc
3421 cggtagattg ccaagtcaca gaccccgccg ggaacgagta tgatctgagt ggcctgagca
3481 aggccaggaa gccgtggact gcggttgaca cgttcgatga ggggaagaag aggaccttct
3541 acctgagcgt gtgcacgcct ctcccgtaca ttcccggctg ccacggcacc gctgtggggt
3601 gctgcctggt gacggaagac agcaagttga acctaggcgt cgtgcagatc agtcctcagg
3661 tgggcgccaa cgggtccctg agcctcgtct acgtcaacgg ggacaagtgc aagaaccagc
3721 gtttctccac caggataaac ctcgagtgtg cccacacaac gggctccccg acctttcagc
3781 tccagaacga ctgtgagtat gtgtttctct ggagaaccgt ggaagcctgt cccgtcgtgc
3841 gtgcggaagg agactactgc gaggtgagag acccaaggca cggcaacctg tataacctga
3901 tacctcttgg tctgaacgac actgtcgtgc gggccggcga atacacctat tacttccgcg
3961 tctgcggaga gctgacatcc ggcgtctgcc caaccagtga caagtccaag gtcatctcat
4021 catgccagga aaagcgggga ccccagggat ttcaaaaagt ggcaggtctg tttaatcaga
4081 agctgaccta cgagaatggg gtgctgaaga tgaactacac cgggggcgac acctgccaca
4141 aggtgtacca gcgttccacc accatctttt tctactgcga ccgcagcacg caggcgcccg
4201 tgtttctcca ggagacgtcc gattgctcct acctgtttga gtggcgcacg cagtacgcct
4261 gcccgcccta cgacctgacc gagtgttcgt tcaaaaacga ggctggggaa acctacgacc
4321 tctcgtctct gtcgaggtac agcgacaact gggaggctgt cacgggcaca gggtccaccg
4381 agcactacct catcaacgtg tgcaagtccc tgtccccgca ggctggctca gatccgtgcc
4441 ctccggaggc ggccgtgtgt ctgctgggcg gccccaagcc cgtgaacctg ggcagggtgc
4501 gggacagtcc tcagtggagc cagggcttga ccctcctgaa gtacgttgac ggtgacttgt
4561 gtccagacca gattcggaag aagtcaacca ccatccgctt cacgtgcagc gagagccacg
4621 tgaactccag gcccatgttc atcagcgctg tggaggactg cgaatacacc ttctcctggc
4681 ccacggccgc cgcctgcgcg gtcaagagca acgtgcacga cgactgccag gtcaccaacc
4741 ccgccacggg acacctgttt gacctgagct ctctgagcgg ccgggccggc ttcaccgccg
4801 cctacagcga gaaggggtta gtctacctca gcgtgtgcgg ggacaacgag aactgcgcca
4861 acggcgtggg ggcctgcttt gggcagacca ggatcagcgt gggcaaggcg agcaagaggc
4921 tgacctacgt ggaccaggtc ttgcaactgg tgtatgaggg cggctccccc tgcccctcca
4981 agaccggcct gagctacaag agcgtcatca gcttcgtgtg caggcctgag gtcgggccca
5041 ccaacaggcc catgctgatc tccctggaca agcgcacgtg cacgcttttc ttctcctggc
5101 acacacccct ggcctgcgag cagacgaccg agtgctccgt gaggaacggc agctccctca
5161 tcgacctgtc cccactcatc caccgcaccg ggggttacga agcatacgat gagagtgagg
5221 acgacggctc cgacaccagc cctgacttct acatcaacat ctgccagccg ctcaacccca
5281 tgcacgggtt ggcctgcccc gccggcacgg ccgtgtgcaa ggttcccgtg gacggccccc
5341 cgatagatat tggccgagtg gcaggacctc cgatcctcaa tcccatagcc aacgaagttt
5401 acttgaactt tgaaagcagc actccttgct tggcggaccg gcacttcaac tacacctcac
5461 tgatcacgtt ccactgtaag cggggcgtga gcatgggaac gccaaaactg ctgaggacca
5521 gtgtgtgtga cttcgtgttt gagtgggaga ctcctctggt ctgtcccgac gaagtgaaga
5581 cggacggctg ctccctcacg gacgaacagc tgtactacag cttcaacctg tccagcctct
5641 ccaagagcac cttcaaggtg acccgaggcc cgcacaccta cagtgtgggg gtgtgcaccg
5701 cagccgcagg cctggacgaa ggaggctgca aggacggtgc tgtctgcctg ctgtccggga
5761 gcaagggggc gtctttcggg cggctggcgt ccatgaagct ggactacagg catcaggacg
5821 aagctgtcat cctgagttac gccaacggag acacttgccc tccggaaact gaggacggcg
5881 agccgtgtgt gttccccttc gtgttcaacg ggaagagcta cgaggagtgt gttgtggaga
5941 gcagggccag gctctggtgc gcgaccaccg ccaactacga cagagaccac gagtggggct
6001 tctgcaagca ctccaccagc caccggacgt ccgtcatcat cttcaagtgt gacgaggacg
6061 ccgacgtggg gcggccccag gtcttcagcg aggtgcgcgg ctgtgaggtg accttcgagt
6121 ggaagacgaa ggtggtctgc cccccgaaga agatggagtg caagttcgtc cagaagcacc
6181 ggacctacga cctgcggctg ctctcgtccc tcaccggctc ctggtccttc gtccacaacg
6241 gagcctcgta ctacatcaac ctgtgtcaga aaatatacaa gggaccccag gactgctcgg
6301 agagagccag cgtgtgcaaa aagagcacct ctggcgaggt gcaggtcctc gggctcgttc
6361 acacacagaa gctggatgtt gtagatgaca gagtcatcgt aacttactct aaaggccact
6421 actgtgggga caataagaca gcgtctgctg tcatcgagct gacctgtgcc aagacagtgg
6481 ggcggccttc gttcacgagg ttcgacgtcg acagctgcac ctaccacttc agctgggact
6541 cacgagcggc ctgcgccgtg aagcctcagg aggtgcagat ggtgaatggg accatcacca
6601 acccggccaa cggccggagc ttcagcctcg gggatattta cttcaaacga ttcagcgcct
6661 ctggggacgt gagaaccaac ggggacaggt acatctacga gatccagctg tcgtccatca
6721 cgggctccag cagccccgcc tgctctgggg ccagcatctg ccagaggaag gccaacgacc
6781 agcacttcag tcgcaaagtc ggaacctcca accaaaccag atactacgtt caagatggcg
6841 acctggatgt ggtgttcacc tcgtcctcca agtgtggaaa agacaagaca aagtctgtgt
6901 cctccaccat cttcttccac tgtgaccccc tggtgaagga cgggatcccc gagttcagcc
6961 acgagactgc cgactgccag tacctcttct cctggcacac ctctgccgtg tgcccgctgg
7021 gggcgggctt cgacgaggag atcgcagggg atgacgccca ggagcacaaa gggctctcag
7081 agcgcagcca ggcggtcggg gcagtgctca gcctgctgct ggtggcactc accgcctgcc
7141 tgctcaccct gctgctgtac aagaaggagc gccgggagat ggtaatgagc aggcttacca
7201 actgctgccg ccggagcgcg aacgtgtcct acaagtactc gaaggtgaac aaggaggagg
7261 aggccgacga gaacgagacc gagtggctga tggaggagat ccagccgccg gcgccgcggc
7321 ccgggaagga gggccaggag aacgggcacg tggccgccaa gtcggtgaga gctgccgaca
7381 cgctgagcgc cctgcacggc gacgagcagg atagcgagga cgaggtcctg acgctgcccg
7441 aggtgaaggt gcgcccgcca ggccgggctc ctggtgccga aggtggcccc ccgctgcggc
7501 cactgcctcg gaaggcgcca ccgccgctgc gggcggacga ccgggtgggg ctggtgcgtg
7561 gggagccggc gcgccggggg cggccccggg cggcggccac gcccatcagc accttccacg
7621 acgacagcga cgaggacctc ctgcacgtct aggctcgccc gcgccggctc cgaccaaatc
7681 cgatgggact ccgtgatgct tctgtccttg gcctttaacg aaaactgtcc aaaaaaggga
7741 agagtgttgg tggtggggga ggaggggggc gcccctctcc gtgggcacgg ggaggaggcg
7801 cggccgctga tggccgcgcc agccccgggt ctgtccccag ccctcacctt gagcgtggcc
7861 gccgtgcgct tcctaaaggc gccggggcca gacgcgtctt gggactgagg gct
//
LOCUS HUMHBBC 1552 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human hemoglobin-related sequence across the breakpoint for Indian
delta-beta thalassemia.
ACCESSION M37467
KEYWORDS hemoglobin.
SOURCE Human (case V.1) DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1552)
AUTHORS Mishima,N. and Gilman,J.G.
TITLE The 3' breakpoint of the Indian delta-beta thalassemia deletion
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.G.Gilman, 01-AUG-1990.
Author address: J.G.Gilman,
Montefiore Medical Sickle Cell Center,
111 East 210th Street,
The Bronx, NY 10467
This sequence of 1552 bases connects the sequence of <HUMHBB> at
position 73309, with the sequence beginning at position 15 under
accession number M33706 (position 1464 of this sequence). In
<HUMHBB> coordinates, this sequence begins at 73309 and ends at
74860.
FEATURES from to/span description
BASE COUNT 447 a 271 c 299 g 535 t
ORIGIN
1 ttttctcagt cagttaacat tccttcaact agattagttg tgacaaaatt tcaggccttg
61 ctcaactcta acctcagtct ctccatatat ttctgagagt cttccttctt atttgtttat
121 aagaatttag taaatggcaa tattgagaag tagaaaattt tggagacttg tttttcatga
181 accagaacaa tctcaaagca gcaatactaa acaggagcca agcctagaat aaagttgaat
241 ttctggaatc aacaagctag gggaaagtct cccattaagt caaatatttg acaactggga
301 ggtcaagtca gaggtgaaaa atgattaaag ataacaagag gtctgaaaac aaagtgacca
361 attaaaataa aaagaaagag ccaagctggg cacagtggct tgcatctgta gtcccagtaa
421 cccaggaggc tgaagtgata ggatcacttg agcccaggat ttcagggttg cagtgagcaa
481 taatgacacc actgctctct agccttggtg actgagatcc tgtctctgag gaaaaacaaa
541 caaacaagca aaaaacaaac aaacaaacac acagatatta agaacataga agttttgtta
601 cggtgatgat attctccaac acaggtctga accccttgca gtgcctcggg gctgtttctc
661 cagtggattc ttgggttctt cacatcttct caactttgtg aaaagatatt catccaaagg
721 tctcacctgt cttattcaga gcttggattt tggactagct ttttctattt cttgggattc
781 tctagagcca tatattcaga tataacacag atataaccaa aagaagcaaa ggaggcatca
841 cagaaggggc tttgtattct ggggaggggg acatagagtt gagagaaata caggaattta
901 taggactcta ctcaaagagc cctcaaataa agtacggcta gtcttctttg cctaagacaa
961 tgagcagtcc aggaagttat ccatttcttt tagattttct agtttatgta catagagatg
1021 ttcatagtag tttttgatga ttatttgtat ttctgtgggg tgagtgataa tattcccttt
1081 tcatttctaa ctgtgcttat ttggatcttc tctcttgtct tctttattag tctagctagt
1141 gatctacctt attaattttt tcacaaaacc agctcttgga ctcattaatc ttttgaatgg
1201 tattttgtgt tttaatatcc ttcagttagc tttgatcttg tttatttctt gtcttctgct
1261 agatttaggg ttggtttgct cttggttctc tggttctttt agttgtgaca ttaggttgtt
1321 aatttgaggg ctttaagact ttttgatgtg ggcatttagt gtataaattt ctctcttaac
1381 actgtctaag ctgtgtccca gagattccgg tatattgtat ctttgttctc attagtttca
1441 aagaactatt ttatttttgc cttagttttc attatttacc caaaattcat tcaggagcag
1501 gttgtttaat ttccatgtaa ttgcatagtt ttcagctatt cattagtctt ga
//
LOCUS HUMHBBTHMU 2337 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human hemoglobin DNA with a deletion causing Indian delta-beta
thalassemia.
ACCESSION M33706
KEYWORDS hemoglobin.
SOURCE Human (patient B.T.) leukocyte DNA.
REFERENCE 1 (bases 1 to 2337)
AUTHORS Mishima,N. and Gilman,J.G.
TITLE The 3' breakpoint of the Indian delta-beta thalassemia deletion
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.G.Gilman, 01-AUG-1990.
Author address: J.G.Gilman,
Montefiore Medical Sickle Cell Center,
111 East 210th Street,
The Bronx, NY 10467
FEATURES from to/span description
recomb 15 17 overlap of 5' end 3' breakpoint DNA
BASE COUNT 942 a 462 c 401 g 529 t 3 others
ORIGIN
1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
61 tttccatgta attgcatagt tttcagctat tcattagtct tgaaatctat ttttattcct
121 ctgtggtctg agagtgattg atatgatttt tttgcatttg ctgaggattg ttttaggtcc
181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
1021 ttcaggatat aaaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
1501 ggggtggtag gagagnggag ggataacatt aggagatata tctaatgtaa atgacgagtt
1561 aacgggtgca gcacaccaat atggcacacg tatacatatg taacaaactg cacaggtacc
1621 gtagaactta gagtacaaaa aaaaaaanga gaaaaagaaa aaagctggag ataacacatt
1681 gcctaacttc aaaccatacc acagggctac agtaaccaaa acagtntggt actgttacag
1741 aaaagacaca cagacaaatg gaacagaata gagagcccag aaataagacc acatgcctac
1801 aaccatctga tctttgacaa agctgataaa aacaagcaat gaggaaaaga caccctattc
1861 aacaagtcgt gttgggataa ctgactaacc gtatgtagaa gattgaaacc agaccccgtt
1921 ctatatcata tacacaaata aactcaagat ggattagaga cttaaatgca aagcccaaac
1981 tataaaaatg cttgaaagca acctagacaa taccattccg gacataagca ggggcaaaga
2041 tttcatgaca aagatgccaa gagtcattgc agcaaaagca aaaattgaca aataggatct
2101 aattaaaata aagtttctgc acagcaaaag aaactatcaa ctgagtaaac aacctataga
2161 atgggagaaa atttttgcaa actatgcatc caacaaaggc ctaatattca gaatctataa
2221 ggaacttaaa tgaatctaca agaacaaaac aaacaatgcc acaaaaaatg ggtaaaggac
2281 atgaacagac agtattcaaa agaagacata catgtggcca ccaagcatat gaaaaaa
//
LOCUS BSUGLPKD 4259 bp ds-DNA BCT 08-AUG-1990
DEFINITION B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate
dehydrogenase (glpD) genes, complete cds.
ACCESSION M34393
KEYWORDS glycerol kinase; glycerol-3- phosphate dehydrogenase.
SOURCE B.subtilis (starin W168) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 4259)
AUTHORS Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L.
TITLE Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of
the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate
dehydrogenase (glpD)
JOURNAL Unpublished (1990) University of Lund, Dept. of Microbiology,
Solvegatan 21, S-223 62 Lund , SWEDEN
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.M.E.Holmberg, 22-MAY-1990.
FEATURES from to/span description
pept < 1 679 ORF 1 (AA at 2)
pept 698 2188 glycerol kinase (glpK) (EC 2.7.1.30)
pept 2329 3996 glycerol-3-phosphate dehydrogenase (glpD) (EC
1.1.99.5)
pept 4128 > 4259 ORF 4
binding 682 690 ribosome binding site
binding 2315 2321 ribosome binding site
binding 4116 4121 ribosome binding site
signal 2243 2303 transcription termination signal
signal 4000 4033 transcription termination signal
signal 4067 4072 -35 region
signal 4090 4095 -10 region
BASE COUNT 1269 a 853 c 1092 g 1045 t
ORIGIN
1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat
61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca
121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa
181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca
241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt
301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat
361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga
421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg
481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt
541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt
601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc
661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt
721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca
781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc
841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat
901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg
961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc
1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg
1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga
1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct
1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac
1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt
1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga
1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc
1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg
1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac
1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc
1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag
1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt
1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag
1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa
1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga
1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc
1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat
2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg
2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc
2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa
2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc
2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat
2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac
2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg
2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca
2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct
2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa
2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg
2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca
2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg
2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg
2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa
2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat
3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat
3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct
3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg
3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg
3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg
3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg
3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca
3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg
3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga
3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt
3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac
3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa
3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa
3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga
3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg
3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag
3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc
4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc
4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa
4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta
4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc
//
LOCUS CLCMTPLAS 379 bp ds-DNA PLN 08-AUG-1990
DEFINITION C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR).
ACCESSION M35177
KEYWORDS .
SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
ORGANISM Claviceps purpurea
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE 1 (bases 1 to 379)
AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
Rogmann-Backwinkel,P. and Tudzynski,P.
TITLE Structural and functional analysis of mitochondrial plasmids in
Claviceps purpurea
JOURNAL Mol. Gen. Genet. 214, 128-134 (1988)
STANDARD simple staff_review
BASE COUNT 97 a 76 c 64 g 142 t
ORIGIN
1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct
361 acaatgtctt ccgccaaaa
//
LOCUS CLCMTPLASX 379 bp ds-DNA PLN 08-AUG-1990
DEFINITION C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR).
ACCESSION M35178
KEYWORDS .
SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
ORGANISM Claviceps purpurea
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE 1 (bases 1 to 379)
AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
Rogmann-Backwinkel,P. and Tudzynski,P.
TITLE Structural and functional analysis of mitochondrial plasmids in
Claviceps purpurea
JOURNAL Mol. Gen. Genet. 214, 128-134 (1988)
STANDARD simple staff_review
BASE COUNT 105 a 75 c 62 g 137 t
ORIGIN
1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt
361 aaaaaagcac gtcttgctc
//
LOCUS DDISP60 328 bp ds-DNA INV 08-AUG-1990
DEFINITION D.discoideum spore coat 60 (sp60) gene, 5' flank.
ACCESSION M34546
KEYWORDS spore coat 60.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly
submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 168 a 15 c 11 g 134 t
ORIGIN
1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta
61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc
121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat
181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat
241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa
301 tcataaattt aaaataaata attttttt
//
LOCUS DDISP70 563 bp ds-DNA INV 08-AUG-1990
DEFINITION D.discoideum spore coat 70 (sp70) gene, 5' flank.
ACCESSION M34547
KEYWORDS spore coat 70.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 563)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 207 a 73 c 47 g 236 t
ORIGIN
1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt
61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt
121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa
181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt
241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat
301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc
361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac
421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa
481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct
541 ctacttttta tatatttttt ttc
//
LOCUS DDISP96 682 bp ds-DNA INV 08-AUG-1990
DEFINITION D.discoideum spore coat 96 (sp96) gene, 5' flank.
ACCESSION M34548
KEYWORDS spore coat 96.
SOURCE D.discoideum (strain Ax4) DNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 682)
AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F.
TITLE Coordinate regulation of the spore coat genes in Dictyosteiium
discoideum
JOURNAL Dev. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT 321 a 37 c 63 g 261 t
ORIGIN
1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat
61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa
121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg
181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat
241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg
301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta
361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt
421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt
481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat
541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt
601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa
661 taaaaaaaaa aaaaaaaaaa aa
//
LOCUS GLAA2GIA 1870 bp ds-DNA INV 08-AUG-1990
DEFINITION Giardia lamblia alpha-2 giardin gene, complete cds.
ACCESSION M34550
KEYWORDS alpha-2 giardin.
SOURCE G.lamblia (strain Portland-1) trophozoite DNA, clone pG2.
ORGANISM Giardia lamblia
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE 1 (bases 1 to 1213)
AUTHORS Alonso,R.A. and Peattie,D.A.
TITLE Genomic organization of alpha giardin genes in Giardia lamblia
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1214 to 1870)
AUTHORS Alonso,R.A. and Peattie,D.A.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Alonso, 25-MAY-1990.
Author address: R.A.Alonso
Dept of Tropical Public Health
Harvard School of Public Health
665 Huntington Avenue, Boston, MA 02115
FEATURES from to/span description
pept 215 1105 alpha-2 giardin
BASE COUNT 466 a 468 c 507 g 429 t
ORIGIN
1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg
61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac
121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct
181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc
241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga
301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga
361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga
421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa
481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga
541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga
601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac
661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag
721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat
781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta
841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa
901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag
961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta
1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg
1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa
1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat
1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg
1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat
1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt
1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct
1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt
1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa
1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc
1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat
1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc
1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg
1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt
1861 cagtattgtc
//
LOCUS KAEHUTC 1625 bp ds-DNA BCT 08-AUG-1990
DEFINITION K.aerogenes histidine utilization repressor C (hutC) gene,
complete cds.
ACCESSION M34604
KEYWORDS histidine utilization repressor C.
SOURCE K.aerogenes (strain W-70) DNA.
ORGANISM Klebsiella aerogenes
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1625)
AUTHORS Schwacha,A. and Bender,R.A.
TITLE Nucleotide sequence of the gene encoding the repressor for
histidine utilization genes of Klebsiella aerogenes
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.A.Bender, 29-MAY-1990.
FEATURES from to/span description
pept < 1 669 histidine utilization repressor G (AA at 1)
pept 731 1456 histidine utilization repressor C (hutC)
pept 1599 > 1625 histidine utilization repressor U
signal 667 686 hutC promoter
signal 1522 1493 Pc promoter
signal 1531 1557 Puh promoter
binding 1475 1495 CRP binding site
site 34 61 helix-turn-helix
BASE COUNT 311 a 476 c 508 g 330 t
ORIGIN
1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt
61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac
121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg
181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg
241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg
301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat
361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg
421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg
481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag
541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt
601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg
661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga
721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt
781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc
841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct
901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc
961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc
1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat
1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca
1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc
1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc
1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc
1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac
1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg
1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac
1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt
1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc
1621 agctg
//
LOCUS MUSMHH2KBM 66 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Mouse major histocompatibility complex antigen H-2Kbm29 mRNA.
ACCESSION M34542
KEYWORDS major histocompatibility complex antigen.
SOURCE Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 66)
AUTHORS Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and
Fischer Lindahl,K.
TITLE Immunological characterization and molecular analysis of the
H-2Kbm29 mutant
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and
Flavell,R.A.
TITLE The DNA sequence of the H-2Kb gene: Evidence for gene conversion as
a mechanism for the generation of polymorphism in
histocompatibility antigens
JOURNAL EMBO J. 2, 453-462 (1983)
STANDARD full automatic
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.M.Horton, 24-MAY-1990.
Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this
sequence is identical to reference 2.
FEATURES from to/span description
pept < 1 > 66 MHC H-2Kbm29 (AA at 3)
BASE COUNT 14 a 22 c 19 g 11 t
ORIGIN
1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact
61 attcag
//
LOCUS YSCKGD2 2112 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene,
complete cds.
ACCESSION M34531
KEYWORDS alphaketogluterate dehydrogenase complex;
dihydrolipoyl transsuccinylase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2112)
AUTHORS Repetto,B. and Tzagoloff,A.
TITLE Structure and regulation of KGD2, the structural gene for yeast
dihydrolipoyl transsuccinylase
JOURNAL Mol. Cell. Biol. 10, 4221-4232 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Tzagoloff, 29-MAY-1990.
FEATURES from to/span description
pept 560 1987 dihydrolipoyl transsuccinylase
mRNA 306 > 1987 dihydrolipoyl transsuccinylase mRNA
mRNA 451 > 1987 dihydrolipoyl transsuccinylase mRNA (alt.)
signal 196 200 CAAT box
signal 228 232 CAAT box
BASE COUNT 649 a 435 c 425 g 603 t
ORIGIN
1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt
61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt
121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca
181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc
241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat
301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc
361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac
421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca
481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact
541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt
601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc
661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg
721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac
781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac
841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa
901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc
961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc
1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc
1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa
1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac
1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac
1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc
1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga
1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat
1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc
1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac
1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat
1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga
1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac
1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc
1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga
1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat
1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat
1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt
2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat
2101 atggaaaagc gc
//
LOCUS RATALADPS 1612 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene,
complete cds.
ACCESSION J04764
KEYWORDS delta-aminolevulinate dehydratase; pseudogene.
SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1612)
AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H.
TITLE Isolation and characterization of a rat delta-aminolevulinate
dehydratase processed pseudogene
JOURNAL Genomics 7, 629-632 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.R.Bishop, 20-APR-1990.
FEATURES from to/span description
pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase
rpt 101 108 8 bp flanking direct repeat
rpt 1385 1392 8 bp flanking direct repeat
BASE COUNT 404 a 411 c 418 g 379 t
ORIGIN
1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS MCAAAF1 229 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus aphid acquisition factor 5' end.
ACCESSION M32809
KEYWORDS aphid acquisition factor.
SOURCE Cauliflower mosaic virus viral DNA, clone pCMS31.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 229)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept / 36 > 229 aphid acquisition factor (AA at 38)
IVS < 1 35 intron X
BASE COUNT 88 a 43 c 40 g 58 t
ORIGIN
1 aattaatcat cttaacaacc tcaatgagat tgtaggaaga agcttactcg gaatatggaa
61 gatcaactca tacttcggac taagcaaaga cccttcggag tccaaatcaa aaaacccgtc
121 agtttttaat actgcaaaaa ccatttttaa gagtgggggg gttgattact cgagcctaaa
181 ggaaataaaa tcccttttag aagctcaaaa cactagaatt aaaagtcta
//
LOCUS MCAAAFCP 1125 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus aphid acquisition factor (3' end), and
coat protein (5' end) genes.
ACCESSION M32813
KEYWORDS aphid acquisition factor; coat protein.
SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 1125)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept < 1 246 aphid acquisition factor (AA at 1)
pept 248 637 ORF
pept 619 > 1125 coat protein
BASE COUNT 435 a 218 c 223 g 249 t
ORIGIN
1 aaaaacccgt cagtttttaa tactgcaaaa accattttta agagtggggg ggttgattac
61 tcgagccaac taaaagaaat aaaatctctt ttagaatctc aaaatactag aattaaaagt
121 ctagaaaaag caattcaatc cttagagcgt aagattgaac cagagccctt aactaaagaa
181 gaagttaaag agcttaaaga atcgattaac tcgatcaaag aaggattaaa gaatattatt
241 ggctaaaatg gctaatctta atcaaatcca aaaagaagtc tctgaaatcc tcagtgacca
301 aaaatccatg aaagcggata taaaagctat cttagaacta ttaggatccc aaaatcctat
361 taaagaaagc ttagaagctg ttgcagcgaa aatcgttaat gacttaagca agctcatcaa
421 tgattgcccc tgtaacaaag agatattaga agccttaggc aatcagccta aagagcaact
481 aatagaacaa cctaaagaga aaggcaaagg ccttaatcta ggaaaatatt cttaccccaa
541 ttacggagta ggaaatgaag aattaggatc ctctggaaac cctaaagctt taacctggcc
601 cttcaaagct ccagcaggat ggccgaatca attttagaca ggaccattaa ccggttctgg
661 tataatctgg gagaagattg tctctcagaa agtcaatttg accttatgat aaggttaatg
721 gaagagtccc ttgacgggga ccaaattatt gatctaacct ctctacctag tgataatttg
781 caggtcgaac aggttatgac aactaccgaa gactcgatct cggaagaatc agaattcctt
841 ctagcaatag gagaaacatc tgaagacgaa agcgattcag gagaagaacc tgaattcgaa
901 caagttcgaa tggatcgaac aggaggaacg gagattccca aagaagaaga tggtgaacca
961 tctagacaca atgagagaaa gagaaagacc ccggaagacc ggtactttcc aactcaacca
1021 aagaccattc caggacaaaa acaaacgtct atgggaatgc tcaacatcga ctgccaaacc
1081 aatcgaagaa ccttaatcga cgactgggca gcagaaatcg gactg
//
LOCUS MCACPX 106 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus coat protein, partial cds.
ACCESSION M32814
KEYWORDS coat protein.
SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 106)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept < 1 > 106 coat protein precursor (AA at 2)
BASE COUNT 52 a 16 c 18 g 20 t
ORIGIN
1 atgcaagaaa acatccaaga agaagtatca taagcgatac aagaagaaat ataaggtcta
61 taaaccttat aagaagaaga agaaattccg atccggaaaa tacttc
//
LOCUS MCACTCTP 221 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus cell-to-cell transport protein, 5' end.
ACCESSION M32812
KEYWORDS cell-to-cell transport protein.
SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 221)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept 43 > 221 cell-to-cell transport protein
BASE COUNT 93 a 40 c 32 g 56 t
ORIGIN
1 ctctaacgag tatccacaga aaaaataacc ttctgtgttg agatggattt gtatccagaa
61 gaaaataccc aaagcgagca atcgcagaat tctgaaaata atatgcaaat atttaaatca
121 gaaaattcgg atggattctc ctccgatcta atgatctcaa acgatcaatt aaagaatatc
181 tctaaaaccc aattaacctt ggaaaaagaa aagatattca a
//
LOCUS MCAMPX 972 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus matrix protein (put.), partial cds.
ACCESSION M32811
KEYWORDS matrix protein.
SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 972)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept < 1 187 matrix protein (put.) (AA at 2)
ORF 895 > 972 ORF 7
mRNA < 1 283 35S RNA
signal 253 257 35S TATA box
binding 882 895 primer binding site
BASE COUNT 321 a 209 c 200 g 242 t
ORIGIN
1 ctgtcacttc atcgaaagga cagtagaaaa ggaaggtggc tcctacaaat gccatcattg
61 cgataaagga aaggctatca ttcaagatgc ctctgccgac agtggtccca aagatggacc
121 cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt
181 ggattgatgt gacatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca
241 agacccttcc tctatataag caagctcatt tcatttggag aggacacgct gaaatcacca
301 gtctctctct acaaatctat ctctctctct attctccata ataatgtgtg agtagttccc
361 agataaggga attagggttc ttatagggtt tcgctcatgt gttgagcata taagaaaccc
421 ttagtatgta tttgtatttg taaaatactt ctatcaataa aatttctaat tcctaaaacc
481 aaaatccagt actaaaatcc agatctccta aagtccctat agatctttgt cgtgaatata
541 aaccagacac gagacgacta aacctggagc ccagacgccg tttgaagcta gaagtaccgc
601 ttaggcagga ggccgttagg gaaaagatgc taaggcaggg ttggttacgt tgactccccc
661 gtaggtttgg tttaaatatg atgaagtgga cggaaggaag gaggaagaca aggaaggata
721 aggttgcagg ccctgtgcaa ggtaagaaga tggaaatttg atagaggtac gttactatac
781 ttatactata cactaaggga tgcttgtatt ttaccctata ccccctaata accccttatc
841 gatctaaaga aataatccgc ataagccccc gcttaaaaaa ttggtatcag agccatgaat
901 aggtctatga ccaaaactca agaggataaa acctcaccaa aatacgaaag agttcttaac
961 tctaaagata aa
//
LOCUS MCAORFIX1 74 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus unidentified protein gene, partial cds.
ACCESSION M32808
KEYWORDS .
SOURCE Cauliflower mosaic virus viral DNA, clone CMS31.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 74)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept < 1 / 34 unidentified protein (AA at 2)
IVS 35 > 74 intron X
BASE COUNT 29 a 12 c 13 g 20 t
ORIGIN
1 attatcagtg gacattcacg atgccacagg taaggtatat cttcctttaa tcactaaaga
61 ggagataaat aaaa
//
LOCUS MCAORFX2 178 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus unidentified protein gene, 5' end.
ACCESSION M32810
KEYWORDS .
SOURCE Cauliflower mosaic virus viral DNA, clone pCMS31.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 178)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept 94 > 178 Unidentified protein
binding 82 94 primer binding site
BASE COUNT 68 a 38 c 24 g 48 t
ORIGIN
1 tgcttgtatt ttaccctata taccctaata accccttatc gatttaaaga aataatccgc
61 ataagccccc gcttaaaaat tggtatcaga gccatgaatc ggtttaaaga ccaaactcaa
121 gagggtaaaa cctcatcaaa atacgaaaga gttcttaact ctaaagataa aagatctt
//
LOCUS MCARTRA 467 bp ds-DNA VRL 08-AUG-1990
DEFINITION Cauliflower mosaic virus reverse transcriptase (V) gene, partial
cds.
ACCESSION M32815
KEYWORDS reverse transcriptase.
SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S.
ORGANISM Cauliflower mosaic virus
Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE 1 (bases 1 to 467)
AUTHORS Vaden,V.R. and Melcher,U.
TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications
for mechanisms of recombination
JOURNAL Virology 177, 717-726 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by U.K.Melcher, 13-MAR-1990.
FEATURES from to/span description
pept < 1 > 467 reverse transcriptase (AA at 3)
BASE COUNT 157 a 110 c 86 g 114 t
ORIGIN
1 agaagatctt ctcttccttc gactgcaagt caggattctg gcaagttctg ctagatcaag
61 aatcaagacc tctaacggca ttcacatgtc cccaaggtca ctacgaatgg aatgtggtcc
121 ctttcggcct aaagcaggct ccatccatat tccaaagaca catggacgaa gcatttcgtg
181 tgttcagaaa gttctgttgc gtttatgtcg acgacattct cgtattcagt aacaacgaag
241 aagatcacct acttcatgta gcgatgatct tacaaaagtg caatcaacat ggaatcatcc
301 tttccaaaaa gaaagcacaa ctcttcaaga agaagataaa cttccttggt ctagaaatag
361 atgaaggaac acataagcct caaggacaca tcttggaaca catcaacaag ttcccagata
421 cccttgaaga caagaagcaa cttcagagat tcttaggcat actaaca
//
LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human osteopontin mRNA, complete cds.
ACCESSION J04765
KEYWORDS osteopontin; phosphoprotein.
SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1425)
AUTHORS Young,M.F., Kerr,J.M., Termine,J.D., Wewer,U.M., Wang,M.G.,
McBride,O.W. and Fisher,L.W.
TITLE cDNA cloning, mRNA distribution and heterogeneity, chromosomal
location, and RFLP analysis of human osteopontin (OPN)
JOURNAL Genomics 7, 491-502 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.F.Young, 04-MAY-1990.
FEATURES from to/span description
pept 68 1012 osteopontin precursor
/hgml_locus_uid="LK0223T" /nomgen="OPN"
/map="unassigned"
sigp 68 115 osteopontin signal peptide
matp 176 1012 osteopontin
mRNA < 1 1425 osteopontin mRNA
BASE COUNT 449 a 293 c 296 g 387 t
ORIGIN Chromosome 4q13.
1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS VSVGLYPA 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene,
complete cds.
ACCESSION M35207
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 482 a 364 c 364 g 447 t
ORIGIN
1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat
61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga
121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg
181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa
361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac
481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca
541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag
601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc
661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg
721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg
901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag
1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata
1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag
1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga
1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg
1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa
1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta
1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca
1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc
1621 atgctccaag aggctttaat tatgtttcaa ttttatg
//
LOCUS VSVGLYPB 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene,
complete cds.
ACCESSION M35208
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 358 c 366 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPC 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene,
complete cds.
ACCESSION M35209
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 489 a 358 c 364 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPD 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene,
complete cds.
ACCESSION M35210
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 488 a 357 c 365 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPE 1665 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene,
complete cds.
ACCESSION M35211
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 476 a 364 c 358 g 467 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPF 1665 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene,
complete cds.
ACCESSION M35212
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 476 a 362 c 358 g 469 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPG 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene,
complete cds.
ACCESSION M35213
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 488 a 356 c 366 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca
1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgtttgaa ttgtatg
//
LOCUS VSVGLYPH 1665 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene,
complete cds.
ACCESSION M35214
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 477 a 366 c 353 g 469 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat
61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac
301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag
601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta
1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta
1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg
//
LOCUS VSVGLYPI 1665 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene,
complete cds.
ACCESSION M35215
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 477 a 363 c 357 g 468 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta
1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS VSVGLYPJ 1788 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene,
complete cds.
ACCESSION M35216
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1788)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 508 a 362 c 344 g 574 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS VSVGLYPK 1788 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene,
complete cds.
ACCESSION M35217
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1788)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 508 a 362 c 344 g 574 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS VSVGLYPL 1815 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene,
complete cds.
ACCESSION M35218
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1815)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 512 a 358 c 346 g 599 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta
1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt
1801 atttttattt ttatg
//
LOCUS VSVGLYPM 1665 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein
gene, complete cds.
ACCESSION M35219
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1665)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 484 a 364 c 355 g 462 t
ORIGIN
1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc
481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag
601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt
1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta
1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta
1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc
1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg
//
LOCUS VSVGLYPN 1812 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene,
complete cds.
ACCESSION M35220
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1812)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 512 a 362 c 347 g 591 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt
1801 tttattttta tg
//
LOCUS VSVGLYPO 1861 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene,
complete cds.
ACCESSION M35221
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1861)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 363 c 347 g 631 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt
1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat
1861 g
//
LOCUS VSVGLYPP 1802 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 84-GM-B) glycoprotein gene,
complete cds.
ACCESSION M35222
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-GM-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1802)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 506 a 362 c 348 g 586 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta
1801 tg
//
LOCUS VSVGLYPQ 1868 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene,
complete cds.
ACCESSION M35223
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1868)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 362 c 347 g 639 t
ORIGIN
1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta
1861 tttttatg
//
LOCUS VSVGLYPR 1860 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene,
complete cds.
ACCESSION M35224
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1860)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 520 a 362 c 347 g 631 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg
//
LOCUS VSVGLYPS 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene,
complete cds.
ACCESSION M35225
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 358 c 364 g 448 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac
661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg
841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPT 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene,
complete cds.
ACCESSION M35226
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 490 a 358 c 363 g 446 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPU 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene,
complete cds.
ACCESSION M35227
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 486 a 357 c 367 g 447 t
ORIGIN
1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc
1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS VSVGLYPV 1796 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene,
complete cds.
ACCESSION M35228
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1796)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 503 a 362 c 349 g 582 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg
//
LOCUS VSVGLYPW 1800 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene,
complete cds.
ACCESSION M35229
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1800)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 506 a 362 c 348 g 584 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt
1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg
//
LOCUS VSVGLYPX 1652 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene,
complete cds.
ACCESSION M35230
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA
to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1652)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 487 a 344 c 367 g 453 t 1 others
ORIGIN
1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc
61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga
121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg
181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag
241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac
301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa
361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg
421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc
481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag
541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag
601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc
661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg
721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag
781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag
841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg
901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc
961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg
1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca
1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag
1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga
1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg
1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat
1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca
1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg
1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta
1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa
1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct
1621 cagagaagct ttaatctttt attgacaata tg
//
LOCUS VSVGLYPY 1657 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene,
complete cds.
ACCESSION M35231
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1657)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 489 a 367 c 357 g 444 t
ORIGIN
1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat
61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga
121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg
181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag
241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa
361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac
481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca
541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag
601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc
661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag
721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg
901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag
1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata
1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag
1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga
1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg
1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa
1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta
1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca
1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa
1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc
1621 atgctctaag aggctttaat tatgttttaa ttttatg
//
LOCUS VSVGLYPZ 1812 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene,
complete cds.
ACCESSION M35232
KEYWORDS glycoprotein; surface antigen.
SOURCE Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype),
cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1812)
AUTHORS Bilsel,P.A. and Nichol,S.T.
TITLE Polymerase errors accumulating during natural evolution of the
glycoprotein gene of vesicular stomatitis Indiana serotype viruses
JOURNAL J. Virol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.T.Nichol, 14-JUN-1990.
FEATURES from to/span description
pept 30 1565 glycoprotein
BASE COUNT 513 a 360 c 346 g 593 t
ORIGIN
1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg
421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa
1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt
1801 tttattttta tg
//
LOCUS ECODKSA 1273 bp ds-DNA BCT 08-AUG-1990
DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION M34945
KEYWORDS dnaK suppressor.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1273)
AUTHORS Kang,P.J. and Craig,E.A.
TITLE Identification and characterization of a new Escherichia coli gene
that is a dosage-dependent suppressor of a dnaK deletion mutation
JOURNAL J. Bacteriol. 172, 2055-2064 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 229 441 ORF 1
pept 619 1074 dnaK suppressor (dksA)
BASE COUNT 343 a 301 c 333 g 296 t
ORIGIN
1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
1261 acgcatagaa gat
//
LOCUS STFCYCLI 2180 bp ss-mRNA INV 08-AUG-1990
DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION M33880
KEYWORDS cyclin B.
SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone
lambda-gt10-cyc10.
ORGANISM Asterina pectinifera
Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa;
Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE 1 (bases 1 to 2180)
AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes
cyclin
JOURNAL Dev. Biol. 140, 241-252 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Kishimoto, 11-MAY-1990.
FEATURES from to/span description
pept 126 1313 cyclin B (CYC)
mRNA < 126 2180 cyclin B mRNA
BASE COUNT 650 a 440 c 482 g 608 t
ORIGIN
1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
2161 gtaataaagt taatggaaat
//
LOCUS ECOCLPPA 1236 bp ds-DNA BCT 08-AUG-1990
DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component
(clpP) gene, complete cds.
ACCESSION J05534
KEYWORDS ATP-dependent protease.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1236)
AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
Bowers,B. and Gottesman,S.
TITLE Sequence and structure of ClpP, the proteolytic component of the
ATP-dependent Clp protease of Escherichia coli
JOURNAL J. Biol. Chem. 265, 12536-12545 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Gottesman, 23-APR-1990.
FEATURES from to/span description
pept 378 1001 ATP-dependent protease (clpP)
mRNA 279 > 1236 clpP mRNA
pept < 1 133 ORF
signal 244 249 -35 region
signal 267 272 -10 region
signal 165 193 transcription termination signal
binding 367 372 ribosome binding site (put.)
BASE COUNT 329 a 291 c 318 g 298 t
ORIGIN
1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS HAMACAPT 1837 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
partial cds.
ACCESSION J05590 M22755
KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
tunicamycin-resistance.
SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
TRG-[4,22].
ORGANISM Cricetulus longicaudatus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 889 to 960)
AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S.
TITLE Amplification and molecular cloning of the hamster tunicamycin-
sensitive N-acetylglucosamine-1-phosphate transferase gene: The
hamster and yeast enzymes share a common peptide sequence
JOURNAL J. Biol. Chem. 263, 19796-19803 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1837)
AUTHORS Zhu,X. and Lehrman,M.A.
TITLE Cloning, sequence and expression of a cDNA encoding hamster
UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
transferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman,
04-APR-1989. Draft entry and computer-readable sequence for [1]
kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES from to/span description
pept 148 1374 N-acetylglucosamine-1-phosphate transferase
(GPT)
binding 67 79 dolichol binding site
binding 222 234 dolichol binding site
BASE COUNT 363 a 515 c 418 g 541 t
ORIGIN
1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 08-AUG-1990
DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
genes, complete cds.
ACCESSION M34440
KEYWORDS nuclear antigen.
SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 10089)
AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
Rickinson,A. and Kieff,E.
TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
EBNA-3A, EBNA-3B and EBNA-3C genes
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.T.Sample, 23-MAY-1990.
Kieff Laboratory
Harvard Medical School
75 Francis Street
Boston, MA 02115
FEATURES from to/span description
pept 189 524 nuclear antigen EBNA-3A, exon 1
610 3051 nuclear antigen EBNA-3A, exon 2
pept 3237 3593 nuclear antigen EBNA-3B, exon 1
3672 6155 nuclear antigen EBNA-3B, exon 2
pept 6277 6636 nuclear antigen EBNA-3C, exon 1
6711 9560 nuclear antigen EBNA-3C, exon 2
IVS 525 609 EBNA-3A intron
IVS 3594 3671 EBNA-3B intron
IVS 6637 6710 EBNA-3C intron
BASE COUNT 2495 a 2837 c 2619 g 2138 t
ORIGIN
1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
10081 ggaggatcc
//
LOCUS HUMPAPB 3061 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION M34840
KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE Human prostate, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1199)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3061)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.L.Van Etten, 08-JUN-1990.
Chemistry Department
Purdue University
West Lafayette, IN 47907
FEATURES from to/span description
pept 15 1175 prostatic acid phosphatase precursor (EC
3.1.3.2) /hgml_locus_uid="LA0168F"
/nomgen="ACPP" /map="3q21-qter"
sigp 15 110 prostatic acid phosphatase signal peptide
matp 111 1172 prostatic acid phosphatase
BASE COUNT 917 a 642 c 681 g 821 t
ORIGIN Chromosome 3q21-qter.
1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
3061 a
//
LOCUS LDV3ECPA 1064 bp ss-RNA VRL 08-AUG-1990
DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete
cds.
ACCESSION M34856
KEYWORDS viral capsid protein.
SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
A16, B24, B63, B90, B103, C44, dt4.
ORGANISM Lactate dehydrogenase-elevating virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae.
REFERENCE 1 (bases 1 to 1064)
AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A.
TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid
protein (Vp1) gene
JOURNAL Virology (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.K.Godeny, 06-JUN-1990.
FEATURES from to/span description
pept 637 984 capsid protein
pept 135 650 unidentified ORF
BASE COUNT 267 a 223 c 265 g 309 t
ORIGIN 3' terminus region.
1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 08-AUG-1990
DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
(frhADGB) gene, complete cds.
ACCESSION J02914
KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE M.thermoautotrophicum (strain delta-H) DNA.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 3903)
AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
TITLE Cloning, sequence determination, and expression of the genes
encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
reducing hydrogenase from Methanobacterium thermoautotrophicum
delta-H
JOURNAL Biochemistry (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.Alex, 25-MAY-1990.
FEATURES from to/span description
pept 298 1515 F420 hydrogenase alpha-subunit (frhA)
pept 1524 2000 F420 hydrogenase delta-subunit (frhD)
pept 2114 2824 F420 hydrogenase gamma-subuint (frhG)
pept 2837 3682 F420 hydrogenase beta-subunit (frhB)
binding 284 293 ribosomal binding site (put.)
binding 1514 1520 ribosomal binding site (put.)
binding 2100 2106 ribosomal binding site (put.)
binding 2823 2832 ribosomal binding site (put.)
BASE COUNT 1201 a 878 c 965 g 859 t
ORIGIN
1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
3901 cga
//
LOCUS MUSLRCNA 1751 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse loricrin gene, complete cds.
ACCESSION M34398
KEYWORDS cell envelope protein; loricrin.
SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1751)
AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
Yuspa,S.H. and Roop,D.R.
TITLE Identification of a major keratinocyte cell envelope protein,
loricrin
JOURNAL Cell 61, 1103-1112 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Rothnagel, 22-MAY-1990.
FEATURES from to/span description
pept 71 1516 loricrin
signal 1740 1745 polyA signal
BASE COUNT 223 a 493 c 706 g 329 t
ORIGIN
1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
1741 ataaatttgc t
//
LOCUS PFAGIPA 1906 bp ds-DNA INV 08-AUG-1990
DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
cds.
ACCESSION J05544
KEYWORDS glucosephosphate isomerase.
SOURCE P.falciparum (strain 7G8) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1906)
AUTHORS Kaslow,D.C. and Hill,S.
TITLE Cloning metabolic pathway genes by complementation in Escherichia
coli: Isolation and expression of Plasmodium falciparum
glucosephosphate isomerase
JOURNAL J. Biol. Chem. 265, 12337-12341 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.C.Kaslow, 04-JUN-1990.
FEATURES from to/span description
pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT 782 a 174 c 289 g 661 t
ORIGIN Chromosome 14.
1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS RATGADA 3198 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION M34445
KEYWORDS glutamic acid decarboxylase.
SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of
D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
pCR22HP.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3198)
AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
TITLE Characterization of a cDNA coding for rat glutamic acid
decarboxylase
JOURNAL Mol. Brain Res. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.I.Gottlieb, 23-MAY-1990.
FEATURES from to/span description
pept 184 1965 glutamic acid decarboxylase
mRNA < 1 3198 glutamic acid decarboxylase mRNA
signal 3174 3179 polyA signal
BASE COUNT 869 a 742 c 767 g 820 t
ORIGIN
1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
3181 taaaaaatat tctccatg
//
LOCUS RATTSHRA 5270 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION M34842
KEYWORDS TSH receptor; thyroid stimulating hormone receptor;
transmembrane protein.
SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5270)
AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
McBride,O.W. and Kohn,L.D.
TITLE Cloning, chromosomal assignment and regulation of the rat
thyrotropin receptor by thyrotropin, agents which increase cAMP
levels and thyroid autoantibodies
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.D.Kohn, 05-JUN-1990.
FEATURES from to/span description
pept 55 2349 thyroid stimulating hormone receptor precursor
sigp 55 117 thyroid stimulating hormone receptor signal
peptide
matp 118 2346 thyroid stimulating hormone receptor
signal 5256 5261 poly-A signal
BASE COUNT 1415 a 1343 c 1176 g 1336 t
ORIGIN
1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS STMBLIPA 840 bp ds-DNA BCT 08-AUG-1990
DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete
cds.
ACCESSION M34538
KEYWORDS beta-lactamase inhibitory protein.
SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
ORGANISM Streptomyces clavuligerus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 840)
AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
TITLE Isolation and characterization of a beta-lactamase inhibitory
protein from Streptomyces clavuligerus and cloning and analysis of
corresponding gene
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.E.Jensen, 25-MAY-1990.
Dept. of Microbiology
University of Alberta
Edmonton, Alberta Canada T6G 2E9
FEATURES from to/span description
pept 67 672 beta-lactamase inhibitory protein precursor
(bliP)
sigp 67 174 beta-lactamase inhibitory protein signal
peptide
matp 175 669 beta-lactamase inhibitory protein
binding 52 63 ribosomal binding site
signal 686 724 transcription termination signal
BASE COUNT 140 a 248 c 320 g 132 t
ORIGIN
1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS TRBPARPA 763 bp ss-mRNA INV 08-AUG-1990
DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION M34790
KEYWORDS procyclic acidic repetitive protein; procyclin.
SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
pPS-1.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
TITLE Inhibition of protein synthesis results in super-induction of
procyclin RNA
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Boothroyd, 04-JUN-1990.
Stanford University School of Medicine
Dept of Microbiology and Immunology
Sherman Fairchild Science Building
Stanford, CA 94305-5402 USA
FEATURES from to/span description
pept 29 466 procyclin
mRNA < 1 763 procyclin mRNA
rpt 215 393 12 bp degenerate repeats
BASE COUNT 192 a 215 c 156 g 200 t
ORIGIN
1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS W10PPIBCA 4761 bp ds-DNA BCT 08-AUG-1990
DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
complete cds, and ippD gene, 3' end.
ACCESSION M34849
KEYWORDS antigen.
SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
clones HB101/pHS[4108,5357].
ORGANISM Plasmid pWR100
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes
(ipaB and ipaC) of Shigella flexneri
JOURNAL Microb. Pathog. 4, 345-357 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 538 and 3987 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Baudry, 05-JUN-1990.
Center for Vaccine Development
University of Maryland at Baltimore
10 South Pine Street
Baltimore, MD 21201
FEATURES from to/span description
pept 606 1073 polypeptide I (ippI)
pept 1079 2821 polypeptide B (ipaB)
pept 2841 3932 polypeptide C (ipaC)
pept 3983 > 4761 ipaD protein
pept < 1 549 unidentified ORF
binding 595 600 ippI ribosomal binding site
binding 1065 1070 ipaB ribosomal binding site
binding 2829 2834 ipaC ribosomal binding site
binding 3971 3977 ipaD ribosomal binding site
signal 556 561 ippI -10 region
signal 2723 2728 ipaC -35 region
signal 2745 2750 ipaC -10 region
BASE COUNT 1769 a 952 c 805 g 1235 t
ORIGIN
1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
4741 agacaatatg ttaaaaagct t
//
LOCUS YEPOMPHA 824 bp ss-mRNA BCT 08-AUG-1990
DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene,
complete cds.
ACCESSION M34854
KEYWORDS cationic outer membrane protein;
enterobacterial outer membrane protein;
lipopolysaccharide binding protein; secretory protein;
structural membrane protein.
SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 824)
AUTHORS Hirvas,L., Koski,P. and Vaara,M.
TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
comparison to known ompH sequences
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.H.Hirvas, 06-JUN-1990.
University of Helsinki
FEATURES from to/span description
pept 324 818 ompH protein precursor (gtg start codon)
sigp 324 392 ompH protein signal peptide
matp 393 815 ompH protein
binding 308 317 ompH ribosomal binding site
signal 120 125 ompH -35 region
signal 143 148 ompH -10 region
BASE COUNT 257 a 147 c 198 g 221 t 1 others
ORIGIN
1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS YSCEXG1A 2056 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
(EXG1) gene, complete cds.
ACCESSION M34341
KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2056)
AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
Nebreda,A.R., Mendez,E. and del Rey,F.
TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
in the yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.del Rey, 17-MAY-1990.
Instituto de Microbiologia bioquimica
Facultad de Biologia
CSIC/Universidad de Salamanca
37008 Salamanca, SPAIN
FEATURES from to/span description
pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase precursor (EXG1, EC 3.2.1.58)
sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase signal peptide
matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase
mRNA 265 > 1709 EXG1 mRNA
signal 153 159 TATA box
BASE COUNT 610 a 395 c 396 g 655 t
ORIGIN Chromosome XII-right arm.
1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
2041 gagcttttcc gccggc
//
LOCUS ACCPCACGH 2391 bp ds-DNA BCT 08-AUG-1990
DEFINITION A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and
protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH)
genes, complete cds.
ACCESSION M33798
KEYWORDS gamma-carboxymuconolactone decarboxylase;
protocatechuate 3,4-dioxygenase.
SOURCE A.calcoaceticus DNA, clone pZR2.
ORGANISM Acinetobacter calcoaceticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 2391)
AUTHORS Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N.
TITLE DNA sequences of genes encoding Acinetobacter calcoaceticus
protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of
genes and of DNA sequences within genes during their evolutionary
divergence
JOURNAL J. Bacteriol. 172, 956-966 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 324 728 gamma-carboxymuconolactone decarboxylase
(pcaC) (EC 4.1.1.44)
pept 750 1463 protocatechuate 3,4-dioxygenase, alpha subunit
(pcaH) (EC 1.13.11.3)
pept 1491 2120 protocatechuate 3,4-dioxygenase, beta subunit
(pcaG) (EC 1.13.11.3)
ORF < 1 203 ORF (AA at 3)
binding 312 315 ribosomal binding site (put.)
binding 737 740 ribosomal binding site (put.)
binding 1478 1482 ribosomal binding site (put.)
BASE COUNT 670 a 463 c 533 g 725 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca
61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg
121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg
181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata
241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg
301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag
361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact
421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc
481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc
541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg
601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg
661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca
721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc
781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac
841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac
901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag
961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc
1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc
1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt
1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc
1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt
1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc
1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca
1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc
1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt
1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg
1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg
1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta
1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc
1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta
1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc
1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta
1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag
1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg
2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa
2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac
2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac
2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg
2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt
2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t
//
LOCUS HAMSAA3A 530 bp ss-mRNA ROD 08-AUG-1990
DEFINITION M.auratus serum amyloid A (SAA3) mRNA, complete cds.
ACCESSION M33431
KEYWORDS serum amyloid A protein.
SOURCE M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone
pAA825.
ORGANISM Mesocricetus auratus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 530)
AUTHORS Gervais,C. and Suh,M.
TITLE Serum amyloid A protein-related mRNA expression in herpes simplex
virus type 2-transformed hamster cells
JOURNAL Mol. Cell. Biol. 10, 4412-4414 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Suh, 30-MAR-1990.
FEATURES from to/span description
pept 42 410 serum amyloid A
signal 510 515 polyA signal
BASE COUNT 147 a 122 c 140 g 121 t
ORIGIN
1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca
61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag
121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact
181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac
241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca
301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg
361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt
421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag
481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct
//
LOCUS HEIBEXBA 795 bp ds-DNA BCT 08-AUG-1990
DEFINITION H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end.
ACCESSION M33787
KEYWORDS capsulation protein.
SOURCE H.influenza (type b, strain RM153) DNA.
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 795)
AUTHORS Kroll,J.S. and Moxon,E.R.
TITLE Capsulation in distantly related strains of Haemophilus influenzae
type b: genetic drift and gene transfer at the capsulation locus
JOURNAL J. Bacteriol. 172, 1374-1379 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 795 capsulation protein (bexB)
BASE COUNT 188 a 133 c 177 g 297 t
ORIGIN
1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt
61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt
121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc
181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg
241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt
301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag
361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg
421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt
481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa
541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat
601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa
661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg
721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa
781 ggagttgagc cacaa
//
LOCUS HEIBEXBB 795 bp ds-DNA BCT 08-AUG-1990
DEFINITION H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end.
ACCESSION M33788
KEYWORDS capsulation protein.
SOURCE H.influenza (type b, strain RM926 ATCC 8468) DNA, clones
lambda-926.[8,11,14].
ORGANISM Haemophilus influenzae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE 1 (bases 1 to 795)
AUTHORS Kroll,J.S. and Moxon,E.R.
TITLE Capsulation in distantly related strains of Haemophilus influenzae
type b: genetic drift and gene transfer at the capsulation locus
JOURNAL J. Bacteriol. 172, 1374-1379 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 > 795 bexB protein
BASE COUNT 188 a 128 c 178 g 301 t
ORIGIN
1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt
61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta
121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt
181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg
241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt
301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag
361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg
421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt
481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa
541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac
601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa
661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta
721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag
781 gggattgagc cacaa
//
LOCUS PSEARCDA 1502 bp ds-DNA BCT 08-AUG-1990
DEFINITION P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end)
genes.
ACCESSION M33223
KEYWORDS membrane protein.
SOURCE P.aeruginosa (strain PAO1) DNA, clone pME183.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1502)
AUTHORS Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D.,
Mercenier,A. and Haas,D.
TITLE The arc operon for anaerobic arginine catabolism in Pseudomonas
aeruginosa contains an additional gene, arcD, encoding a membrane
protein
JOURNAL Gene 87, 37-43 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Haas, 26-MAR-1990.
FEATURES from to/span description
pept 24 1472 membrane protein (arcD)
pept 1494 > 1502 membrane protein (arcA)
binding 8 15 ribosome binding site (put.)
BASE COUNT 247 a 507 c 443 g 305 t
ORIGIN
1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag
61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa
121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg
181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg
241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct
301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca
361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg
421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt
481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct
541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg
601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca
661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca
721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt
781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg
841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct
901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg
961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca
1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct
1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga
1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga
1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg
1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc
1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct
1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct
1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca
1501 cg
//
LOCUS HUMSPTB 6765 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION J05500
KEYWORDS beta-spectrin; spectrin.
SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
V252.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 6765)
AUTHORS Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L.,
Marchesi,V.T. and Forget,B.G.
TITLE Full length sequence of the cDNA for human erythroid beta-spectrin
JOURNAL J. Biol. Chem. 265, 11827-11832 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Winkelmann, 08-MAY-1990.
FEATURES from to/span description
pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14"
/hgml_locus_uid="LS0033T"
mRNA < 1 6765 SPTB mRNA
signal 6716 6722 poly-A signal
BASE COUNT 1626 a 1822 c 2146 g 1171 t
ORIGIN Chromosome 14q23-q24.
1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc
2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc
2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca
4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS STYEUTBC 2526 bp ds-DNA BCT 08-AUG-1990
DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
complete cds.
ACCESSION J05518
KEYWORDS ethanolamine ammonia-lyase.
SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2526)
AUTHORS Faust,L.R.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
TITLE Cloning, sequencing, and expression of the genes encoding the
adenosylcobalamin-dependent ethanolamine ammonia-lyase of
Salmonella typhimurium
JOURNAL J. Biol. Chem. 265, 12462-12466 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.M.Babior, 08-MAY-1990.
FEATURES from to/span description
pept 141 1499 ethanolamine ammonia-lyase (eutB)
pept 1518 2378 ethanolamine ammonia-lyase (eutC)
binding 130 133 ribosome binding site
binding 1507 1510 ribosome binding site
BASE COUNT 563 a 687 c 779 g 497 t
ORIGIN
1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
2521 gcctcc
//
LOCUS XELPCNA 1018 bp ss-mRNA VRT 08-AUG-1990
DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
cds.
ACCESSION M34080
KEYWORDS nuclear protein; proliferating cell nuclear antigen.
SOURCE X.laevis oocyte, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1018)
AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
TITLE Characterization and developmental expression of Xenopus
proliferating cell nuclear antigen (PCNA)
JOURNAL Dev. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Leibovici, 08-MAY-1990.
FEATURES from to/span description
pept 28 813 proliferating cell nuclear antigen (PCNA)
mRNA < 1 1018 PCNA mRNA
BASE COUNT 284 a 223 c 237 g 274 t
ORIGIN
1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS YSCAAC2A 1333 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
cds.
ACCESSION M34076 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1333)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator gene in yeast
JOURNAL J. Biol. Chem. 265, 12711-12716 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 235 1158 ADP/ATP-translocator protein (AAC2)
BASE COUNT 388 a 209 c 301 g 435 t
ORIGIN
1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
1321 agccactgtt gtt
//
LOCUS YSCAAC3 1308 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
cds.
ACCESSION M34075 J05542
KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE S.cerevisiae (strain W303-1B) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1308)
AUTHORS Kolarov,J., Kolarova,N. and Nelson,N.
TITLE A third ADP/ATP-translocator gene in yeast
JOURNAL J. Biol. Chem. 265, 12711-12716 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by N.Nelson, 08-MAY-1990.
FEATURES from to/span description
pept 78 1034 ADP/ATP-translocator protein (AAC3)
BASE COUNT 353 a 228 c 263 g 464 t
ORIGIN
1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS VSVPNA 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 06/85-NM-E) nucleoprotein gene,
complete cds.
ACCESSION M31845
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 438 a 267 c 293 g 331 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catatgtata
181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat
421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctctgc cagatgatgt atccgggaca agaaattgac aaagcagatt catacatgcc
841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac
1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNB 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/83-CA-B) nucleoprotein,
complete cds.
ACCESSION M31846
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 437 a 265 c 296 g 331 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac
121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga
541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac
1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNC 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 01/84-SN-P1) nucleoprotein
gene, complete cds.
ACCESSION M31847
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 438 a 268 c 292 g 331 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat
421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctctgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtaa agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac
1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPND 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/83-GA-P) Phosphoprotein and
nucleocapsid genes.
ACCESSION M31848
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 435 a 272 c 297 g 325 t
ORIGIN
1 aacagatatc aaaatggccc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 caaattaccg gccaacgaag atccggttga atacccggcc gattatttca aaaataatac
121 caatatagtg ttatatgtga gcaccaaagt agcactaaat gatttgaggg catacgtata
181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
241 attgaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gtcgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaatt
481 actggacggg cttgaaaatc aatgcaaagt ggcgtcgacc agatttgaga gtctagtcga
541 agatggtctc gacttctttg acatatggga gaacgatcca aatttcacca agatagtggc
601 tgcagtggat atgttcttcc acatgtttaa aaagcatgag cgtgctccaa tcagatacgg
661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
721 agtcagtggg ctctcaattg aagaactcac aacatgggtc ctgaataggg aggttgcaga
781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatc gactttggat tatcccagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
961 tagacagcct gacgacatcg aatacacctc actaacttgt gcaagtttgc tgctgtcatt
1021 cgctgttggg tcatcagcag acattgaaca gcaattctat attggagagg ataaatacac
1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 agactggctt ggctggtatg atgacaatgg ggggaaaccc acaccagata tgctcaactt
1201 cgcgagaaga gcagttaact ccctgcagtc actccgtgag aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaactt aatataccac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNE 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/82-VC-B2) nucleoprotein
gene, complete cds.
ACCESSION M31849
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
BASE COUNT 435 a 263 c 296 g 335 t
ORIGIN
1 aacagacatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 taaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
121 caatatagtc ttatatgtga gcaccaaagt ggcgctaaat gatttgagag catacgtata
181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gagtgggcag atccaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 ggatggtctt gacttttttg acatatggga gaatgatccg aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acatttggac atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 atatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaagaatgc
961 tagacagcct gacgacattg aatacacttc actaacttgt gcaagtttac tgctgtcgtt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaaatacac
1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaatcg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNF 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/84-OA-B) nucleoprotein gene,
complete cds.
ACCESSION M31850
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 435 a 260 c 298 g 336 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcca ttattcagcc
61 taagttaccg gccaacgagg atccgattga atacccggct gattacttca aaaataatac
121 caatatagta ttatatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
181 tcaggggatc aagtccggta atccatccat actccacata aatgcttatc tttacgctgc
241 attaaaggga gtggaaggaa ctttggacag agactgggtt agctttggaa gaacaattgg
301 aaagagagag gagaatgtaa aaatcttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 ggatggtcta gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acattcggac atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgcggcact cttgctaaga tcgactcgag caaagaatgc
961 tagacagcct gatgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
1081 aacagaaaaa gatgatggtc ttaagaaatc ggatgtccca ccaaaaggaa ggaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagataccac tgtcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNG 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../52-GA-P) nucleoprotein gene,
complete cds.
ACCESSION M31851
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain ../52-GA-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 428 a 260 c 302 g 339 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 taaattaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac
121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata
181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt
481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg
661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga
781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt
1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNH 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/84-GM-P) nucleoprotein gene,
complete cds.
ACCESSION M31852
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 441 a 274 c 280 g 334 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcaa taattcagcc
61 caaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac
121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta
181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
241 actaaaggga gtagaaggga cattagacag agactgggtc agctttggaa gaacaatcgg
301 aaagagggag gagaatgtca aaatcttcga tctagttaaa gttgaggaac tgaagacagc
361 tcttcctgac gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat
421 ttacatccta ggcctctacc gagtaggtag atccaaagtt acagattaca gaaagaaact
481 cctggacggt ctagaaaatc agtgtaaagt ggcatcaacc agatttgaaa gccttgttga
541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc
601 tgcagtggat atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg
661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa
721 ggtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatcgag aagtcgcaga
781 cgaactatgc caaatgatgt acccaggtca agaaattgat aaggcggatt catacatgcc
841 gtatatgatt gattttgggt tatcccaaaa atctccttat tcatcagtta agaatccagc
901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcaactcgag caaaaaatgc
961 tcggcaaccc gatgacattg agtacacttc actgacttgt gcaagtttac tactgtcgtt
1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac
1081 aacagaaaaa gatgacggtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt
1141 cgactggctt ggttggtatg atgacaatgg aggaaaaccc acaccagata tgctcaactt
1201 cgcacgaaga gcagtcagtt cgttgcagtc acttcgcgag aaaacaattg gcaagtacgc
1261 caaggcagaa tttgacaaat gaccaagttt gacatatcaa tatcaatatt atattctgtg
1321 ctatatatg
//
LOCUS VSVPNI 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/84-HD-B1) nucleoprotein
gene, complete cds.
ACCESSION M31853
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 442 a 278 c 283 g 326 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa taattcagcc
61 taaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac
121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttgaggg cctatgtcta
181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
241 actaaaggga gtagagggaa cattagacag agactgggtt agctttggaa gaacaatcgg
301 aaaaagggag gagaatgtca aaatcttcga cctggtcaaa gttgaagaac tgaagacagc
361 tcttcctgac ggaaaatcag atccagaccg atcagccgaa gatgataaat ggcttcctat
421 ttacatccta ggcctctatc gagtaggtag atccaaagtg acagattaca gaaagaaact
481 cctggatggt ctagaaaatc agtgcaaagt ggcatcaacc agatttgaaa gcctggttga
541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc
601 tgcagtggac atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg
661 aaccatagtc tcacgattca aagactgtgc ggcacttgct acatttggac atctcagcaa
721 agtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatagag aagtcgcaga
781 tgaactatgc caaatgatgt acccaggcca agaaattgat aaggcggatt catacatgcc
841 atatatgatt gactttgggt tatcccaaaa atctccgtat tcatcagtga agaatccagc
901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcgactcggg caaaaaatgc
961 taggcaacct gatgacattg agtacacttc actgacctgt gcaagtttac tcctgtcatt
1021 tgcagttgga tcttctgcag acattgagca acaattctac atcggggaag acaaatacac
1081 aacagagaag gatgacggtc tcaagaaatc cgaagtccca ccgaaaggaa gaaatgttgt
1141 agactggctt ggttggtacg atgacaacgg aggaaaaccc acacctgata tgctcaactt
1201 cgcacgaaga gcagtcagct ctttgcagtc acttcgtgag aaaacaatcg gcaaatacgc
1261 taaagcagaa tttgataaat gaccaggttt gacatatcaa tatcaatatt atattctgtg
1321 ctatatatg
//
LOCUS VSVPNJ 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 09/82-HD-B) nucleoprotein gene,
complete cds.
ACCESSION M31854
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 09/82-HD-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 437 a 265 c 296 g 331 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac
121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga
541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac
1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc
1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNK 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../49-UT-B1) nucleoprotein
gene, complete cds.
ACCESSION M31855
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 428 a 261 c 302 g 338 t
ORIGIN
1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc
61 taaactaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac
121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata
181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc
361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt
481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc
601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg
661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga
781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt
1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt
1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt
1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg
1321 ttatatatg
//
LOCUS VSVPNL 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/82-CR-B) nucleoprotein gene,
complete cds.
ACCESSION M31856
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 431 a 268 c 298 g 332 t
ORIGIN
1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc
61 taaattgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac
121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta
181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttatgcagc
241 attgaagggg gtagaaggga ccttagacag agactgggtc agcttcggac gcacaattgg
301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gttgaggaat taaagacagc
361 ccttcctgat ggaaaatcag atcccgaccg ttcagctgag gatgacaaat ggcttcccat
421 ctatatcctg ggtctttaca gagtaggcag atcaaaagtt actgattaca gaaagaaatt
481 gctggatggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctcgttga
541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc
601 agcggtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg
661 aaccatagtc tcacggttca aagactgtgc agcattagcg acatttgggc atctaagcaa
721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaatagag aagttgcaga
781 cgagctatgc cagatgatgt atccggggca agaaattgat aaggcggatt cttacatgcc
841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc
901 tttccacttc tggggtcaac tagcagcact cctattgaga tctactcgcg caaaaaatgc
961 ccgacagcca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt
1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac
1081 aacagacagg gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgtcgt
1141 agactggctc ggctggtatg acgataatgg aggaaaacct acaacggaca tgctgaactt
1201 cgcaagaaga gcagtcagct ccctacaagc acttcgcgag aaaacaatcg gcaaatatgc
1261 caaggcagag tttgataaat gaccaaactc tgcataccac tatcaatatt atattctatg
1321 ttacatatg
//
LOCUS VSVPNM 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 12/82-HD-B) nucleoprotein gene,
complete cds.
ACCESSION M31857
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 441 a 267 c 291 g 330 t
ORIGIN
1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc
61 taaactgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac
121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta
181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
241 attgaaggga gtagaaggga ccttggacag agactgggtc agcttcggac gcacaattgg
301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gtggaagaat taaagacagc
361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttccaat
421 ttatatcctg ggtctctaca gagtaggtag atcaaaagtt actgattacc gaaagaaatt
481 gctggacggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctagttga
541 agacggattg gatttctttg acatatggga gaatgatcca aacttcacca agatagttgc
601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg
661 aaccatagtc tcacggttca aagactgtgc agcattggcg acatttgggc atctcagcaa
721 ggtcagtgga ctctctattg aagatctcac aacatgggtt ctaaatagag aagttgcaga
781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc
841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc
901 tttccacttc tggggtcaac tcgcagcgct cctattaaga tctactcgcg caaaaaatgc
961 cagacaacca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt
1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag acaaatacac
1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt
1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt
1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaatcg gcaaatatgc
1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg
1321 ttacatatg
//
LOCUS VSVPNN 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/85-HD-B1) nucleoprotein
gene, complete cds.
ACCESSION M31858
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 437 a 264 c 294 g 334 t
ORIGIN
1 aacagacatc aaaatggctc ctacagttaa gagaatcgtc aatgactcta ttattcatcc
61 taaactgcca gcccatgaag atccggtgga atacccagct gattacttca aaaacaatac
121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttacgtcta
181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
241 attgaaggga gtagaaggga ctttggacag agactgggtc agctttggac gcacgattgg
301 gaaaagagaa gagagtgtga aaatatttga ccttgttaaa gtggaagaat taaagacagc
361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttcccat
421 ttatatcctg ggtctctaca gagtaggtag atcaaaagta actgattacc gaaagaaatt
481 gctggatggt ttagagaatc aatgcaaagt agcatcgacc agattcgaaa gcctagttga
541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc
601 agcggtagat atgttctttc atatgttcaa aaaacatgag cgggcaccaa tcagatacgg
661 aaccatagtc tcacggttca aagactgtgc cgcattggcg acatttgggc atctcagcaa
721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaataggg aagttgcaga
781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc
841 atatatgatt gactttggat tatcgcagaa atcaccatac tcatcagtta aaaacccagc
901 tttccacttc tggggtcaac ttgcagcgct cctattaaga tctactcgcg caaaaaatgc
961 ccgacagcca gatgacattg aatatacatc attgacttgt gcaagtctac tgctgtcatt
1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac
1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt
1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt
1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaattg gtaaatatgc
1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg
1321 ctacatatg
//
LOCUS VSVPNO 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/83-NC-P) nucleoprotein gene,
complete cds.
ACCESSION M31859
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 430 a 270 c 293 g 336 t
ORIGIN
1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactcta ttattcatcc
61 caagctgcca gcccatgaag atccggtaga atacccggct gattacttca aaaacaatac
121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gacttgaggg cttatgtcta
181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
241 attgaaagga gtagaaggga ccttggatcg agactgggtc agcttcggac gcaccattgg
301 aaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc
361 acttcctgac ggaaaatcag accctgatcg ttcagctgag gatgacaaat ggcttcccat
421 ttatatcctg ggtctctaca gagttggtag atcaaaagtt actgattacc ggaagaaatt
481 gctagatggt ctagagaatc agtgcaaggt agcatcaacc agattcgaaa gtttagtcga
541 agatggattg gatttctttg acatatggga gaacgatccg aacttcacca agatagttgc
601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatatgg
661 aaccatagta tcacggttca aagactgtgc cgcattagca acatttgggc atctcagtaa
721 agtcagtgga ctttctattg aggatctcac aacatgggtc ctaaataggg aagttgcaga
781 cgaactatgc cagatgatgt acccagggca agaaattgat aaagcggatt cctacatgcc
841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtga aaaacccagc
901 cttccacttc tggggtcaac ttgcagcact cctattaaga tctactcgcg caaagaatgc
961 tcgacagccg gatgacattg aatatacatc attgacttgt gcaagtctat tactgtcatt
1021 tgcagttggg tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac
1081 aacagataag gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgttgt
1141 agactggctc ggctggtatg atgataatgg gggaaaaccc acaccggaca tgctaaattt
1201 cgcaagaaga gcagtcaact ccctacaagc actccgtgag aaaacgattg gcaagtatgc
1261 caaggccgag ttcgataaat gaccagattc tacctactac tatcaatatt atattctctg
1321 ttatatatg
//
LOCUS VSVPNP 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 01/85-PN-B1) nucleoprotein
gene, complete cds. genes.
ACCESSION M31860
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 437 a 265 c 293 g 334 t
ORIGIN
1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtc aacgactcta ttatccatcc
61 taagctgcca gcccatgaag atccggtaga atacccggct gattacttca agaacaatac
121 gaatatcgtg ttgtatgtaa gcactaaagc agcattaaat gatttaaggg catatgtcta
181 ccaggggatt aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
241 attaaaggga gtagaaggga ccttggacag agattgggtc agtttcggac gcaccattgg
301 gaaaagagaa gagaatgtga aaatatttga tcttgttaaa gtggaagaat taaaaacagc
361 acttcctgat ggaaaatcag accctgatcg ttcggcagag gatgataaat ggctccccat
421 ttatatccta ggtctctaca gagtgggtag atcgaaagtt actgattacc ggaagaaatt
481 gctagatggt ctagagaacc agtgcaaagt agcatcaacc agatttgaaa gtctagtcga
541 agatggattg gatttttttg acatatggga aaatgatccg aatttcacca agatagttgc
601 ggcagtagac atgttctttc atatgttcaa gaaacatgag cgtgcaccaa taagatatgg
661 aaccatagta tcacggttca aggactgtgc agcgttagca acatttggac atctcagtaa
721 agtcagcgga ctttctattg aagatctcac aacatgggtc ctcaataggg aagttgcaga
781 cgaactctgc cagatgatgt acccaggtca agaaattgat aaatcggatt cttacatgcc
841 atatatgatt gactttggat tgtcccagaa atcgccgtac tcatcagtaa aaaacccagc
901 tttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc
961 ccgacagcca gatgacattg aatatacatc actgacttgt gcaagcctac tactgtcatt
1021 tgcagtcggg tcttcagcag acatcgaaca gcagttttat attggagaag acaagtacac
1081 aacagaaaag gatgatggtc taaggaactc agatgtacca cccaaaggga ggaatgttgt
1141 agactggctc agctggtatg acgataatgg aggaaaacca acaccggaca tgctaaattt
1201 tgcaaaaaga gcggtcaact ccttgcaggc actccgtgag aaaacgattg gcaagtatgc
1261 caaggcagag ttcgataaat gaccagatta tacataccgc tatcaatatt atattctctg
1321 ttatatatg
//
LOCUS VSVPNQ 1329 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../60-PN-B) nucleoprotein gene,
complete cds.
ACCESSION M31861
KEYWORDS nucleoprotein.
SOURCE Vesicular stomatitis virus (strain ../60-PN-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 1329)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 14 1282 nucleoprotein
BASE COUNT 434 a 273 c 290 g 332 t
ORIGIN
1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactctg ttattcatcc
61 taagctacca gcccatgagg atcccgtaga atatccagct gattacttca aaaacaatac
121 aaatatcata ttatatgtta gcactaaagt ggcactgaat gatttaaggg cttatgtcta
181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
241 attgaaggga gtagaaggga ctttagacag agactgggtc agcttcggac gaaccattgg
301 gaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc
361 acttcctgat ggaaaatcag accctgatcg ttcggccgag gatgacaaat ggcttcccat
421 ttatatctta ggtctctaca gagtgggtag atcaaaggtt actgattacc ggaagaaatt
481 gctagacggt ctagagaacc agtgcaaagt agcatcaacc agattcgaaa gtctagtcga
541 agatggattg gatttttttg acatatggga aaatgatccg aacttcacca agatagttgc
601 agcagtagac atgttctttc acatgttcaa gaaacatgag cgagcaccaa tcagatatgg
661 aaccatagta tcacggttca aggactgtgc agcattagca acatttgggc atctcagtaa
721 agtcagtgga ctttctattg aggatcttac aacatgggtc ctcaataggg aagttgcaga
781 tgaactatgc cagatgatgt atccggggca agaaattgat aaagctgatt cttacatgcc
841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtta aaaacccagc
901 cttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc
961 ccgacagcca gatgacattg aatatacgtc attgacctgt gcaagtctac tactgtcctt
1021 tgcagttgga tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac
1081 gacagacaag gatgatggtc taaagaaatc agatgcacca cccaaaggga ggaatgttgt
1141 agactggctc ggttggtatg acgataatgg gggaaaaccc acaccggaca tgctaaattt
1201 cgcaagaaga gcggtcaact ccctacaagc actccgtgag aaaacgattg gcaaatatgc
1261 caaggcagag ttccataaat gaccagatta tacccaccgc taccaatatt atatcctatg
1321 ttacatatg
//
LOCUS VSVPPAA 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/84-HD-B1) phosphoprotein
gene, complete cds.
ACCESSION M31871
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 290 a 163 c 182 g 221 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
61 ggattctgcc ttgcaggatg ccaatgaatc tgaagaaaga cgagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga
241 aagaatagac aagtgggaag aggacttgcc tgaattagaa aatattgatg atgatatacc
301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
361 gtctctgttc cctccagttg gactaacaaa gattcaaaca gagcaatgga gaaaaaccat
421 tgaggcagtt tgtgagagct caaaatattg gaatttatca gaatgtcaaa ttctgaactt
481 ggaagactgc ctcaccctca aaggccgagt gatgactcct gattgtagtt cctcagtaaa
541 atctcaaaat tctattcaga gttcggaatc tctctactcc tctcattccc caggtcccac
601 actcaaggga tcagaatcca tcaatctatg ggatttgaag tccacagaag tacaattgat
661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcgaaagtga
721 ggaagagtat tactcagtat gcccagaagg ggcaccagat ttaataggag ctatcatcat
781 gggattgaaa tataagaaac tattcaatca agcaagaatg aagtatcgtc tctaagtctt
841 tctcaggatc agtatg
//
LOCUS VSVPPAB 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 09/82-HD-B) phosphoprotein
gene, complete cds.
ACCESSION M31872
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain , New Jersey serotype), cDNA to
viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 283 a 168 c 185 g 220 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaagg cgagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagatga
241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc
301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
361 gtctctgttc cctccagttg gattaacaaa aattcaaaca gaacaatgga gaaaaaccat
421 tgaggcggtt tgtgagagtt caaaatattg gaatctatca gaatgccaaa ttctgaactt
481 agaagactgc ctaaccctca aaggccgagt gatgactcct gactgtagtt cttcagtcaa
541 atctcaaaat tctgtccaga gttcggaatc tctctactcc tctcattccc caggccccac
601 actcaaggtg tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaactgat
661 ctctaagaga gccggagtta aggacatgac agtcaaattg acagacttct tcggaagtga
721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ctatcatcat
781 gggattgaaa tataagaaac tcttcaatca ggcgcgaatg aagtatcgtc tttaattcct
841 tctcatgatc agtatg
//
LOCUS VSVPPAC 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../49-UT-B1) phosphoprotein
gene, complete cds.
ACCESSION M31873
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 284 a 161 c 185 g 226 t
ORIGIN
1 aacagatatc atggacagtg ttgataggct caagacttac ttagccactt atgataattt
61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg attataatgc tgaacatctt acgatgctgt caccggatga
241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc
301 ggtgaccttt tctgattgga cacagcctgt aatgaaggaa aatgggggag agaaatcatt
361 gtctctgttc cctccagtcg ggttaacaaa gattcaaaca gaacaatgga aaaaaaccat
421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt
481 ggaagacagc ctcactatca aaggccgatt gatgactcct gattgtagtt cttcagtaaa
541 atctcaaaat tctgtccgga ggtcagaacc tctctactcc tctcattctc caggtccccc
601 actcaaggta tcagagtcca tcaatttatg ggatttaaag tccactgaag tacaattgat
661 ctccaagaga gccggagtta aggacatgac agtcaaattg acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccagaagg ggcgccagac ttgatgggag ctatcatcat
781 gggactgaag tacaagaaac tcttcaatca ggcaagaatg aaatatcgtc tttaattcct
841 tttcatgatc aatatg
//
LOCUS VSVPPAD 855 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/82-CR-B) phosphoprotein
gene, complete cds.
ACCESSION M31874
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 855)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 275 a 153 c 199 g 228 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc
301 ggtcaccttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
361 atctttgttt cctcccgttg gattgaccaa ggttcagacc gaacaatgga ggaagacgat
421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc
481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
541 atctcagaac tccgtacgga gttcagaatc tctttactcc tcccattcac caggacctgc
601 atctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaattgac
661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccggaagg agcatcagat ttgatgggag caattattat
781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaactcct
841 tactacagtc aatat
//
LOCUS VSVPPAE 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 12/82-HD-B) phosphoprotein
gene, complete cds.
ACCESSION M31875
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 277 a 153 c 197 g 229 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
241 gagagtagac aagtgggagg aagacttgcc tgaattagag aagattgatg atgatatacc
301 ggtcactttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
361 atctttgttt cctcccgttg gattgaccaa gattcagacc gaacaatgga gaaagacgat
421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc
481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgt
601 tcctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgat
661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct
841 tactacagtc aatatg
//
LOCUS VSVPPAF 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/85-HD-B1) phosphoprotein
gene, complete cds.
ACCESSION M31876
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 279 a 160 c 195 g 222 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagacga
241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc
301 ggtcatcttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
361 atctttgttt cctcccattg gattgaccaa ggttcagacc gaacaatgga gaaagacgat
421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc
481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
541 atctcagaat tctgtacaaa gttcagaatc tctctactcc tccccttcac caggacctgc
601 acctagggca tcagagtcaa tgaatttatg ggacctgaag tcaacagaag tccaactgac
661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caatcattat
781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct
841 tactacagtc aatatg
//
LOCUS VSVPPAG 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/83-NC-P) phosphoprotein
gene, complete cds. genes.
ACCESSION M31877
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 279 a 148 c 199 g 230 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaggagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgccagatga
241 aagaatagac aagtgggaag aggacttgcc tgaattagag aagattgatg atgatatacc
301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt
361 atctttgttt cctccagttg gattgactaa ggttcaaaca gaacaatgga ggaagactat
421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc
481 agatgattgt ctcattctca gaggacgagt tatgactcct gattgcagct catcagttaa
541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgc
601 acctagagca tcagagtcaa cgaatctatg ggatttgagg tctacagagg tgcaattgac
661 ttcaaagaga gcaggagtta aggatatgac agtaaagtta acagatttct ttgaaagcga
721 agaagagtat tattcagtat gcccagaggg agcatcagac ttaatgggag caattattat
781 gggactgaag cataagaaac tctttaatca agcaagaatg aaatatcgac tttaacttct
841 tactgcagac aatatg
//
LOCUS VSVPPAH 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 01/85-PN-B1) phosphoprotein
gene, complete cds.
ACCESSION M31878
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 276 a 156 c 198 g 226 t
ORIGIN
1 aacagatatc atggacagtg tcgatcggct caagacgtat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaggaaga
181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgcccgagga
241 aagaatagac aagtgggaag aagacttgcc tgaattggag aagatcgatg acgatatact
301 ggtcacattc cctagttgga ctcaacctgt aataaaagag aatggtgggg agagatcatt
361 atctttattt cctccagtcg gattgactca ggttcagaca gaacagtgga agaagaccat
421 tgagacagtc tgtgagagct caaaatattg gaatttgtca gagtgccaga ttgtaacttc
481 aggaaattgt cttattttga ggggacaagt tatgacctct gattgcagcc catcagctaa
541 atctaagaat tctgtacaga gttccgagtc tccttcccct tccaattcac ccgaacatgc
601 atctagagca tcagcttcac cgaatctatg ggactttaag tttacagagg tgcaattgat
661 ctcaaaaaga gcaggagtta aggacatgac agtaaaatta acagacttct ttgaaagcga
721 agaagagtat tattcagtat gcccggacgg agcatctgat ttgatggggg caattgttat
781 gggcctgaag cataagaaac tctttaatca ggcaaggatg aaatatcgac tttaacttct
841 tactgcagac aatatg
//
LOCUS VSVPPAI 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../60-PN-B) phosphoprotein
gene, complete cds.
ACCESSION M31879
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain ../60-PN-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 271 a 163 c 206 g 216 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc cgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa accaactccg tcgtattatc aggaggaaga
181 atcgtcagat tcagatactg actacaatgc tgagcatctt acgatgttgt cgccagatga
241 aagaatagac aagtgggaag aggacttgcc agaattagag aagatcgatg atgatatacc
301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt
361 atcattgttt cctccagttg gattgactaa ggttcagaca gaacagtgga agaagaccat
421 cgaggcagtc tgtgagagct caaaatattg gaacttatca gagtgccgga ttgtgacttc
481 agatgattgt ctcattctga ggggacgagt tatgactcct gattgcagct catcagttaa
541 atctcagaat tctgtacaga gttccgagtc tctctgctcc tcccattcac caggacctgc
601 ccctagagca tcagattcaa cgaatctatg ggacttgaag tccacagagg tgcaactagc
661 ctctaagaga gcaggagtta aggacatgac agtaaagtta acagacttct ttgaaagcga
721 agaagagtat tattcagtat gcccagaagg agcatccgac ttgatggggg caattattat
781 gggactgaag cataagaaac tctttaacca ggcaagaatg aaatatcgga tttaacttct
841 tactgcggac aatatg
//
LOCUS VSVPPAJ 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein
gene, complete cds.
ACCESSION M31880
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain ../76-EC-M, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 278 a 156 c 197 g 225 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc
301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt
361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat
421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc
481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa
541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc
601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac
661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct
841 tactacagtc aatatg
//
LOCUS VSVPPR 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein
gene, complete cds.
ACCESSION M31862
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain ../76-EC-M, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 278 a 156 c 197 g 225 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca
121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc
301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt
361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat
421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc
481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa
541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc
601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac
661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga
721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct
841 tactacagtc aatatg
//
LOCUS VSVPPS 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 06/85-NM-E) phosphoprotein gene,
complete cds.
ACCESSION M31863
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and itch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 288 a 161 c 190 g 217 t
ORIGIN
1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc
301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc
601 acccaaggtc aaggagtcaa ccagtctatg ggacttaaga tcaacggaag tacaattgac
661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttgggagtga
721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
841 tccgatgatc gatatg
//
LOCUS VSVPPT 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/83-CA-B) phosphoprotein
gene, complete cds.
ACCESSION M31864
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 292 a 158 c 186 g 220 t
ORIGIN
1 aacagagatc atggacagta ttgatcggct caaaacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaaaatc agggagataa aacgactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
541 atctcagaac tccatacaga gttccgaatc tctctcctct tcacattcac ccggtccagc
601 acccaaagtc aaggagtcaa ccagtctatg ggacttaaga tcaacagaag tacaattgat
661 ctcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
841 ttcgatgatc aatatg
//
LOCUS VSVPPU 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 01/84-SN-P1) phosphoprotein
gene, complete cds.
ACCESSION M31865
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 289 a 160 c 190 g 217 t
ORIGIN
1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataagtc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc
301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc
601 acccaaggtc aaggagtcaa ctagtctatg ggacttaaga tcaacggaag tacaattgac
661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
841 tccgatgatc gatatg
//
LOCUS VSVPPV 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/83-GA-P) phosphoprotein
gene, complete cds.
ACCESSION M31866
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 296 a 161 c 188 g 211 t
ORIGIN
1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
61 ggattctgcc ctgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgt tgaacatctt acgatgttat cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
361 atctctgttt cctccggttg gattaacaaa agttcagaag gaccaatgga ggaagacaat
421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agaagattgt cttatcctca aaggccgaat tatgactccc gactgcagct catcaatcaa
541 atctcagaac tccatacaga gttccgaatc gctctcctcc tcgcattcac ccggtccagc
601 acccaaagtc acggagtcaa ctagtctatg ggatttaaaa tcgacagaag tacaactgat
661 atcgaagagg gccggagtaa aagacatgat ggtgaagttg acagactttt ttggaagtga
721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat
841 tccgatgaac gatatg
//
LOCUS VSVPPW 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 11/82-VC-B2) phosphoprotein
gene, complete cds.
ACCESSION M31867
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 291 a 164 c 187 g 214 t
ORIGIN
1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
361 atctctgttc cctccggttg gattaacaaa agttcagacg gaacaatgga ggaagacaat
421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agatgattgt cttatcctca aaggccgagt gatgactcct gactgcagtt catcaatcaa
541 atctcagaat tccatacaga gttctgaatc tctctcctcc tcgcattcac ccggtccagc
601 acccaaagtc acggagtcaa ctagtctatg ggacttaaaa tcaacagaag tacaactgat
661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
721 agatgaatat cattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattcat
841 tccgatgatc gatatg
//
LOCUS VSVPPX 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 07/84-OA-B) phosphoprotein
gene, complete cds.
ACCESSION M31868
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 294 a 158 c 186 g 218 t
ORIGIN
1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
301 agtcactttt tctgattgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
361 atctctgttt cctccggttg gattaacaaa aattcagacg gaccaatgga ggaagacaat
421 tgaggcagtc tgtgagagct caaaatattg gaatttgtca gaatgccaaa ttatgaactc
481 agataattgt cttatcctca aagtccgagt tatgactcct gactgcagtt catcaatcaa
541 atctcagaac tccatacaga gttccgaatc tttctcctcc tcgcattcac cgggtccagc
601 actaaaagtc acggagtcaa ctagtctatg ggacttaaga tcaacagaag tacaactgat
661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
841 tccgatgatc gatatg
//
LOCUS VSVPPY 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain ../52-GA-P) phosphoprotein
gene, complete cds.
ACCESSION M31869
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain ../52-GA-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 292 a 164 c 189 g 211 t
ORIGIN
1 aacagagatc atggatagtg ttgatcggct caaaaattac ttggctactt atgataattt
61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa atcaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
361 ctatctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
421 tgatgcggtc tgtgagagct ccaaatattg gaatttgtca gaatgccaga ttatgaactc
481 agaagattgt cttatcctca aaggccgaat tatgactcct gactgcagct catcaatcaa
541 atctcagaac tccatacaga gttccgaatc gccctcctcc tcgcattcac ccggtccagc
601 acccaaagtc acggaatcaa ctagtctatg ggacttaaaa tcaacagacg tacaactgat
661 ctcgaagagg gccggagtga aagacatgac ggtgaagttg acagactttt ttggaagtga
721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat
781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat
841 tccgatgaac gatatg
//
LOCUS VSVPPZ 856 bp ss-RNA VRL 08-AUG-1990
DEFINITION Vesicular stomatitis virus (strain 10/84-GM-P) phosphoprotein
gene, complete cds.
ACCESSION M31870
KEYWORDS phosphoprotein.
SOURCE Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey
serotype), cDNA to viral RNA.
ORGANISM Vesicular stomatitis virus
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Rhabdoviridae; Vesiculovirus.
REFERENCE 1 (bases 1 to 856)
AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular
stomatitis virus, New Jersey
JOURNAL J. Virol. 64, 2498-2504 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
S.T. Nichol, 05-FEB-1990.
FEATURES from to/span description
pept 11 835 phosphoprotein
BASE COUNT 283 a 166 c 183 g 224 t
ORIGIN
1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
61 ggattctgcc ctgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca
121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga
241 aagaatagac aagtgggaag aagacttgcc tgaattagaa aagattgatg atgatatacc
301 ggtaaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
361 gtccctgttc cctccagttg ggttaacaaa gattcaaaca gaacaatgga gaaaaaccat
421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt
481 agaagactgc ctcactctca aaggccgagt tatgactcct gattgtagtt cttcagtcaa
541 atctcaaaat tctgtccgga gttcagaatc tctctactct tctcattccc caggtcccac
601 actcaaggta tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaattgat
661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcggaagtga
721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ccatcatcat
781 gggattgaag tacaagaaac tcttcaatca ggcaagaatg aagtatcgcc tttaattcct
841 tcttatgatc agtatg
//
LOCUS HUMCYPIIF 1825 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds.
ACCESSION J02906
KEYWORDS cytochrome P450.
SOURCE Human lung, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1825)
AUTHORS Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A.,
Gelboin,H.V. and Gonzalez,F.J.
TITLE The human CYP2F gene subfamily: Identification of a cDNA encoding a
new cytochrome P450, cDNA-directed expression and chromosome
mapping
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.T.Nhamburo, 18-MAY-1990.
FEATURES from to/span description
pept 56 1531 cytochrome P450IIF1 /nomgen="CYP2F"
/map="19q13.1-q13.2" /hgml_locus_uid="LV0170W"
mRNA < 1 1813 CYP2F mRNA
BASE COUNT 356 a 596 c 470 g 403 t
ORIGIN
1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga
61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct
121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa
181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta
241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca
301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc
361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa
421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat
481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg
541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt
601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat
661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc
721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt
781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg
841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca
901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac
961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc
1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga
1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat
1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat
1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt
1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa
1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc
1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct
1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg
1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga
1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca
1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca
1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc
1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat
1801 tgtaatagat tcaaaccagt cttgg
//
LOCUS MUSADA3A 891 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse adenosine deaminase (ada) gene, 3' end.
ACCESSION J05523
KEYWORDS adenosine deaminase.
SOURCE Mouse fibroblast cell line B-1/200 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 891)
AUTHORS Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and
Kellems,R.E.
TITLE Identification of transcription stop sites at the 5' and 3' ends of
the murine adenosine deaminase gene
JOURNAL J. Biol. Chem. 265, 12513-12519 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.Ramamurthy, 18-MAY-1990.
BASE COUNT 241 a 207 c 262 g 181 t
ORIGIN
1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat
61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg
121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg
181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag
241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca
301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac
361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg
421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc
481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca
541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag
601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg
661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca
721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc
781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc
841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a
//
LOCUS YSCG6PD 3782 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF,
complete cds.
ACCESSION M34709
KEYWORDS glucose-6-phosphate dehydrogenase.
SOURCE S.cerevisiae (strain S288C) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3782)
AUTHORS Nogae,I. and Johnston,M.
TITLE Isolation and characterization of the ZWF1 gene of Saccharomyces
cerevisiae, encoding glucose-6-phosphate dehydrogenase
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Johnston, 18-MAY-1990.
FEATURES from to/span description
pept 1585 3102 glucose-6-phosphate dehydrogenase (ZWF1) (EC
1.1.1.49)
pept 246 1037 pot. ORF
mRNA 1472 > 3102 ZWF1 mRNA
BASE COUNT 1066 a 899 c 887 g 930 t
ORIGIN
1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg
61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa
121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg
181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt
241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg
301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag
361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg
421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt
481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa
541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg
601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt
661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact
721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa
781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc
841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg
901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag
961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc
1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg
1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac
1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg
1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc
1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa
1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat
1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac
1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc
1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata
1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc
1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta
1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg
1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct
1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg
1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag
1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt
1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt
2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac
2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag
2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat
2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt
2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa
2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac
2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc
2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac
2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg
2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc
2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg
2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt
2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac
2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc
2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag
2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca
3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac
3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc
3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact
3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc
3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt
3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga
3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa
3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc
3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt
3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc
3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata
3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc
3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg
3781 at
//
LOCUS BOVLMCB 326 bp ds-DNA SYN 08-AUG-1990
DEFINITION Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5
gene, complete cds.
ACCESSION J02905
KEYWORDS cytochrome b-5.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 326)
AUTHORS Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and
Mauk,A.G.
TITLE Mutagenic, electrochemical, and crystallographic investigation of
the cytochrome b-5 oxidation-reduction equilibrium: Involvement of
asparagine-57, serine-64, and heme propionate-7
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
G.Mauk, 16-MAY-1990.
FEATURES from to/span description
pept 27 311 microsomal cytochrome b-5
binding 16 19 ribosome binding site
BASE COUNT 106 a 82 c 77 g 61 t
ORIGIN
1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac
61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg
121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg
181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt
241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt
301 ccgaatctta atgagaattc gtctga
//
LOCUS DROEST6P 5424 bp ds-DNA INV 08-AUG-1990
DEFINITION D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes,
complete cds.
ACCESSION M33780 M33781
KEYWORDS carboxylesterase.
SOURCE D.melanogaster (strain Dm145) adult DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 929 to 4027)
AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
Richmond,R.C.
TITLE Molecular analysis of duplicated esterase genes in Drosophila
melanogaster
JOURNAL Mol. Biol. Evol. 7, 9-28 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5424)
AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
Richmond,R.C.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.C.Richmond, 19-APR-1990.
FEATURES from to/span description
pept 1173 2559 carboxylesterase 6 (Est-6) precursor (EC
3.1.1.1), exon 1
2611 2858 carboxylesterase 6 precursor, exon 2
sigp 1173 1235 carboxylesterase 6 signal peptide
matp 1236 2559 carboxylesterase 6
2611 2855 carboxylesterase 6
pept 3052 4438 carboxylesterase P (Est-P) precursor (EC
3.1.1.1), exon 1
4495 4742 carboxylesterase P precursor, exon 2
sigp 3052 3111 carboxylesterase P signal peptide
matp 3112 4438 carboxylesterase P
4495 4739 carboxylesterase P
pre-msg 1132 2867 Est-6 mRNA and intron (alt.)
pre-msg 1132 2998 Est-6 mRNA and intron (alt.)
IVS 2560 2610 Est-6 intron A
IVS 4439 4494 Est-P intron A
signal 1104 1110 TATA box
BASE COUNT 1536 a 1117 c 1235 g 1536 t
ORIGIN Chromosome 3L 69A1-5.
1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc
61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa
121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa
181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca
241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc
301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa
361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata
421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa
481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat
541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat
601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac
661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt
721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg
781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa
841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat
901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc
961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa
1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg
1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt
1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca
1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg
1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat
1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca
1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg
1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg
1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag
1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca
1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg
1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta
1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg
1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg
1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag
1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc
1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc
2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc
2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg
2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga
2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac
2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga
2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc
2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc
2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa
2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa
2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga
2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa
2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg
2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg
2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa
2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt
2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa
3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata
3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat
3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac
3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct
3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc
3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta
3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc
3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg
3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca
3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta
3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt
3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat
3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag
3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc
3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc
3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg
3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag
4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct
4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga
4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat
4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg
4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca
4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat
4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt
4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg
4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc
4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg
4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat
4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct
4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt
4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc
4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg
4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg
4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta
5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt
5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg
5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga
5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag
5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct
5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt
5401 gtccgcggta ctcgcgctga attc
//
LOCUS ECOLASNII 1530 bp ds-DNA BCT 08-AUG-1990
DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION M34277
KEYWORDS L-asparaginase II.
SOURCE E.coli (strain K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1530)
AUTHORS Jennings,M.P. and Beacham,I.R.
TITLE Analysis of the Escherichia coli gene encoding L-asparaginase II,
ansB, and its regulation by cyclic AMP receptor and FNR proteins
JOURNAL J. Bacteriol. 172, 1491-1498 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 385 1431 L-asparaginase II (ansB) precursor
sigp 385 450 L-asparaginase II signal peptide
matp 451 1428 L-asparaginase II
mRNA 323 > 1431 ansB mRNA
BASE COUNT 409 a 381 c 382 g 358 t
ORIGIN
1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa
61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa
121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc
181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt
241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata
301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag
361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg
421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc
481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt
541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc
601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg
661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac
721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg
781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat
841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg
901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc
961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag
1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa
1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca
1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat
1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct
1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc
1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg
1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc
1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac
1501 cctaatgata atcaccggaa taaattattt
//
LOCUS HUMPADPRP 2085 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1.
ACCESSION M34337
KEYWORDS poly(ADP-ribose) polymerase.
SOURCE Human peripheral blood leukocyte DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2085)
AUTHORS Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M.,
Sugimura,T. and Esumi,H.
TITLE Characterization of a putative promoter region of the human poly
(ADP-ribose) polymerase gene: Structural similarity to that of the
DNA polymerase beta gene
JOURNAL Biochem. Biophys. Res. Commun. 167, 701-710 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1943 / 2062 poly(ADP-ribose) polymerase
/hgml_locus_uid="LZ0154E" /nomgen="PPOL"
/map="1q41-q42"
pre-msg 1777 > 2085 PPOL mRNA and introns
IVS 2063 > 2085 PPOL intron A
BASE COUNT 495 a 535 c 550 g 505 t
ORIGIN
1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca
61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca
121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag
181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc
241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga
301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc
361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt
421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc
481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat
541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat
601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag
661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag
721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat
781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt
841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta
901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa
961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc
1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc
1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac
1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg
1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct
1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt
1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt
1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg
1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg
1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc
1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta
1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc
1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg
1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg
1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg
1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag
1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta
1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct
2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg
//
LOCUS HUMREVERB1 295 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human Rev-ErbA-alpha mRNA, 5' end.
ACCESSION M34339
KEYWORDS .
SEGMENT 1 of 2
SOURCE Human fetal skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 295)
AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W.
TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
from the noncoding DNA strand of a thyroid hormone receptor gene
results in a related protein that does not bind thyroid hormone
JOURNAL DNA 9, 77-83 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 204 > 295 Rev-ErbA-alpha protein (put.)
BASE COUNT 58 a 107 c 74 g 56 t
ORIGIN
1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg
61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag
121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat
181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg
241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc
//
LOCUS HUMREVERB2 285 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human Rev-ErbA-alpha mRNA, partial cds.
ACCESSION M34340
KEYWORDS .
SEGMENT 2 of 2
SOURCE Human fetal skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 285)
AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W.
TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
from the noncoding DNA strand of a thyroid hormone receptor gene
results in a related protein that does not bind thyroid hormone
JOURNAL DNA 9, 77-83 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 285 Rev-ErbA-alpha protein (put.) (AA at 1)
BASE COUNT 62 a 79 c 99 g 45 t
ORIGIN About 1.7 kb after segment 1.
1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc
61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt
121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa
181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc
241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag
//
LOCUS HUMTFEB 1526 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human TFEB protein mRNA, partial cds.
ACCESSION M33782
KEYWORDS .
SOURCE Human BJAB B cell line, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1526)
AUTHORS Carr,C.S. and Sharp,P.A.
TITLE A helix-loop-helix protein related to the immunoglobulin E
box-binding proteins
JOURNAL Mol. Cell. Biol. 10, 4384-4388 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.S.Carr, 19-APR-1990.
FEATURES from to/span description
pept < 1 > 1526 TFEB protein (AA at 1)
BASE COUNT 324 a 495 c 493 g 214 t
ORIGIN
1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg
61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg
121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg
181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga
241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg
301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa
361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg
421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg
481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg
541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc
601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag
661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg
721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt
781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg
841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc
901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag
961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga
1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat
1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg
1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag
1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac
1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg
1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc
1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag
1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag
1501 gacgagggtc ccccgggcta ccccga
//
LOCUS MUSFCERII 2237 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete
cds.
ACCESSION M34163
KEYWORDS IgE receptor.
SOURCE Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2237)
AUTHORS Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and
Moore,K.W.
TITLE Isolation, characterization, and expression of cDNA clones encoding
the mouse Fc receptor for IgE (Fc-epsilon-RII)
JOURNAL J. Immunol. 144, 1974-1982 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 175 1170 low affinity IgE receptor (Fc-epsilon-RII)
mRNA < 1 2237 Fc-epsilon-RII mRNA
BASE COUNT 592 a 622 c 533 g 490 t
ORIGIN
1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata
61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag
121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa
181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt
241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc
301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca
361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc
421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag
481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta
541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat
601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag
661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca
721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag
781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc
841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc
901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct
961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt
1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca
1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca
1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc
1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg
1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc
1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat
1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc
1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt
1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg
1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg
1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac
1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc
1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct
1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt
1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa
1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca
1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag
2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct
2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac
2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt
2221 aataaaggat ctctgtg
//
LOCUS RATATM1 1585 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 1b.
ACCESSION M34137
KEYWORDS alpha-tropomyosin.
SEGMENT 1 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1585)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
COMMENT [1] reports isolation of exon 1b which is the first exon coding for
the alternatively expressed low-M-r alpha-tropomyosins.
FEATURES from to/span description
pept 1275 / 1406 low-M-r alpha-tropomyosin
BASE COUNT 360 a 408 c 449 g 368 t
ORIGIN
1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt
61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg
121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca
181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt
241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt
301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta
361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact
421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg
481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt
541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca
601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct
661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg
721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag
781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga
841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct
901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct
961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta
1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc
1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt
1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct
1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca
1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga
1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg
1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg
1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc
1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg
1561 gggcgagcat gaaatgggtc ccggg
//
LOCUS RATATM2 462 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 9c.
ACCESSION M34138
KEYWORDS alpha-tropomyosin.
SEGMENT 2 of 2
SOURCE Rat DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 462)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
COMMENT [1] reports isolation of exon 9c which is the final exon coding for
the alternatively expressed brain alpha-tropomyosins TMBr-1 and
TMBr-3.
FEATURES from to/span description
pept / 375 448 alpha-tropomyosin
BASE COUNT 97 a 141 c 100 g 124 t
ORIGIN Unknown number of bp after segment 1.
1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc
61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct
121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg
181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca
241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa
301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt
361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga
421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag
//
LOCUS RATPLPA1 1658 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2.
ACCESSION M34108
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 1 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg10-2.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1658)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 1477 + 1577 parathyroid hormone-like peptide (PLP)
precursor, exon 2 (first expressed exon)
sigp 1477 + 1577 parathyroid hormone-like signal peptide
pre-msg 951 > 1658 PLP mRNA and introns
IVS 1259 1452 PLP intron A
IVS 1578 > 1658 PLP intron B
BASE COUNT 325 a 507 c 466 g 360 t
ORIGIN
1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact
61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc
121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta
181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac
241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt
301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc
361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga
421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg
481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt
541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc
601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc
661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag
721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg
781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc
841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg
901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca
961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc
1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc
1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca
1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt
1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt
1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt
1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa
1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt
1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag
1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag
1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc
1621 agcaccagag gtgccgaact gaggcgggca atgcggaa
//
LOCUS RATPLPA2 144 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B
(partial).
ACCESSION M34109
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 2 of 5
SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 144)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS < 1 > 144 parathyroid hormone-like peptide intron B
BASE COUNT 42 a 26 c 21 g 55 t
ORIGIN 3 kb after segment 1.
1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt
61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc
121 catgctggca ctttggtttg taat
//
LOCUS RATPLPA3 145 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B
(partial).
ACCESSION M34110
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 3 of 5
SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 145)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
IVS < 1 > 145 parathyroid hormone-like peptide intron B
BASE COUNT 22 a 34 c 34 g 55 t
ORIGIN 640 bp after segment 2.
1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc
61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga
121 atcggagtcc tgttttgtgg acttt
//
LOCUS RATPLPA4 920 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 3.
ACCESSION M34111
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 4 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg9-0.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 94 + 516 parathyroid hormone-like peptide (PLP)
precursor, exon 3
sigp + 94 100 parathyroid hormone-like signal peptide
matp 101 + 516 parathyroid hormone-like peptide
IVS < 1 93 PLP intron B
IVS 517 > 920 PLP intron C
BASE COUNT 198 a 353 c 166 g 203 t
ORIGIN 1.3 kb after segment 3.
1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg
61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct
121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct
181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa
241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata
301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg
361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac
421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac
481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag
541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga
601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa
661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct
721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct
781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc
841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct
901 tcctcaagcc aaggctgact
//
LOCUS RATPLPA5 855 bp ds-DNA ROD 08-AUG-1990
DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 4.
ACCESSION M34112
KEYWORDS parathyroid hormone-like peptide.
SEGMENT 5 of 5
SOURCE Rat liver DNA, clone lambda-rPLPg9-0.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 855)
AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
of the rat gene and comparison with the human homologue
JOURNAL Mol. Endocrinol. 4, 441-446 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept + 21 30 parathyroid hormone-like peptide (PLP)
precursor, exon 4
matp + 21 27 parathyroid hormone-like peptide
pre-msg < 1 449 PLP mRNA and introns
IVS < 1 20 PLP intron C
BASE COUNT 280 a 139 c 157 g 279 t
ORIGIN 3.6 kb after segment 4.
1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt
61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat
121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc
181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct
241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca
301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga
361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat
421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta
481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga
541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt
601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt
661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt
721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat
781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata
841 aaagatctgc tcggg
//
LOCUS RATTMBR1A 1050 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds.
ACCESSION M34135
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk4.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1050)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 41 886 brain alpha-tropomyosin (TMBr-1)
mRNA < 1 1050 TMBr-1
BASE COUNT 336 a 220 c 306 g 188 t
ORIGIN
1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa
61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc
121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca
181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc
241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc
301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc
361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat
421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca
481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg
541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga
601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact
661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt
721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac
781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa
841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag
901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac
961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct
1021 ctgtatatct taattaaatc ttcaataacc
//
LOCUS RATTMBR2A 1004 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds.
ACCESSION M34134
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk15.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1004)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 137 892 brain alpha-tropomyosin (TMBr-2)
mRNA < 1 1004 TMBr-2
BASE COUNT 265 a 242 c 321 g 176 t
ORIGIN
1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg
61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct
121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg
181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct
241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg
301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca
361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga
421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc
481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat
541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc
601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc
661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa
721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa
781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc
841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt
901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag
961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt
//
LOCUS RATTMBR3A 2488 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end.
ACCESSION M34136
KEYWORDS alpha-tropomyosin.
SOURCE Rat brain, cDNA to mRNA, clone pOk10.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2488)
AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
TITLE Three novel brain tropomyosin isoforms are expressed from the rat
alpha-tropomyosin gene through the use of alternative promoters and
alternative RNA processing
JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 604 brain alpha-tropomyosin (TMBr-3)
mRNA < 1 2488 TMBr-3
BASE COUNT 713 a 531 c 579 g 665 t
ORIGIN
1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg
61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga
121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat
181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa
241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga
301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac
361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta
421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt
481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta
541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga
601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg
661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag
721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta
781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag
841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa
901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct
961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt
1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg
1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa
1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc
1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt
1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc
1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta
1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac
1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt
1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg
1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat
1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc
1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc
1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa
1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca
1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt
1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg
1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc
2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa
2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc
2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg
2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc
2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc
2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg
2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa
2461 taaaatgcac attgtaggac attttcta
//
LOCUS RSPNIFHD 1375 bp ds-DNA BCT 08-AUG-1990
DEFINITION R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and
dinitrogenase alpha subunit (nifD') gene, 5' end.
ACCESSION M33774
KEYWORDS dinitrogenase alpha subunit; dinitrogenase reductase.
SOURCE R.rubrum DNA.
ORGANISM Rhodospirillum rubrum
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 1375)
AUTHORS Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P.
TITLE The cloning and functional characterization of the nifH gene of
Rhodospirillum rubrum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.J.Lehman, 18-APR-1990.
FEATURES from to/span description
pept 367 1254 dinitrogenase reductase (nifH)
pept 1370 > 1375 dinitrogenase alpha subunit (nifD)
binding 355 358 ribosome binding site
binding 1357 1361 ribosome binding site
signal 145 160 upstream activating sequence
signal 175 190 upstream activating sequence
signal 280 296 ntr promoter
BASE COUNT 287 a 404 c 431 g 253 t
ORIGIN
1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc
61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc
121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg
181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat
241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa
301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt
361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg
421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc
481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc
541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg
601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc
661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac
721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg
781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg
841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt
901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag
961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt
1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc
1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg
1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc
1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt
1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac
1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc
//
LOCUS YSCMTFDEL 70 bp ds-DNA ORG 08-AUG-1990
DEFINITION S.cerevisiae mitochondrial novel junction sequence.
ACCESSION M34150
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 33 44 novel junction site
BASE COUNT 11 a 25 c 19 g 15 t
ORIGIN
1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg
61 cgggacttat
//
LOCUS YSCMTGAP12 457 bp ds-DNA ORG 08-AUG-1990
DEFINITION S.cerevisiae mitochondrial gap 12 sequence.
ACCESSION M34149 X15185
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 457)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
COMMENT The gap 12 sequence is a segment of yeast mitochondrial DNA that
had not been determined previously.
BASE COUNT 161 a 57 c 57 g 182 t
ORIGIN
1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg
61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc
121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata
181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct
241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt
301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata
361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt
421 tttataaaga aaaatagtat aatattatct tctcctc
//
LOCUS YSCMTORI1A 70 bp ds-DNA ORG 08-AUG-1990
DEFINITION S.cerevisiae mitochondrial ori1 gene, 5' flank.
ACCESSION M34153
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
BASE COUNT 19 a 11 c 24 g 16 t
ORIGIN
1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
61 ggggtgggtg
//
LOCUS YSCMTORI2A 70 bp ds-DNA ORG 08-AUG-1990
DEFINITION S.cerevisiae mitochondrial ori2 gene, 5' flank.
ACCESSION M34152
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 32 > 70 ori2 mRNA (+/- 4 bp)
BASE COUNT 23 a 0 c 18 g 29 t
ORIGIN
1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag
61 ggggtgggtg
//
LOCUS YSCMTORI7A 70 bp ds-DNA ORG 08-AUG-1990
DEFINITION S.cerevisiae mitochondrial ori7 gene, 5' flank.
ACCESSION M34154
KEYWORDS .
SOURCE S.cerevisiae mitochondrial DNA.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 70)
AUTHORS Skelly,P.J. and Clark-Walker,G.D.
TITLE Conversion at large intergenic regions of mitochondrial DNA in
Saccharomyces cerevisiae
JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990)
STANDARD simple staff_review
BASE COUNT 19 a 11 c 24 g 16 t
ORIGIN
1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
61 ggggtgggtg
//
LOCUS MXAFRZE 2493 bp ds-DNA BCT 08-AUG-1990
DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete
cds.
ACCESSION M35192
KEYWORDS gliding motility regulatory protein.
SOURCE M.xanthus (strain DZF1) DNA.
ORGANISM Myxococcus xanthus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria;
Myxococcaceae.
REFERENCE 1 (bases 1 to 2493)
AUTHORS McCleary,W.R. and Zusman,D.R.
TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
Salmonella typhimurium
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.R.McCleary, 05-JUN-1990.
FEATURES from to/span description
pept 64 2397 gliding motility regulatory protein (frzE)
BASE COUNT 375 a 848 c 926 g 344 t
ORIGIN
1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS RATPCBB 463 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Rat PCB-binding protein mRNA, complete cds..
ACCESSION J05536
KEYWORDS PCB binding protein; binding protein.
SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
pPCB6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 463)
AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
Gillner,M., Gustafsson,J.-A. and Lund,J.
TITLE Cloning, structure, and expression of a rat binding protein for
polychlorinated biphenyls: Homology to the hormonally regulated
progesterone-binding protein uteroglobin
JOURNAL J. Biol. Chem. 265, 12690-12693 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.Lund, 15-JUN-1990.
FEATURES from to/span description
pept 58 348 PCB binding protein precursor
sigp 58 114 PCB binding protein signal peptide
matp 115 345 PCB binding protein
BASE COUNT 117 a 156 c 84 g 106 t
ORIGIN
1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS YSBCPR 2990 bp ds-DNA PLN 08-AUG-1990
DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION M35199
KEYWORDS NADPH-cytochrome P450 reductase.
SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1.
ORGANISM Candida tropicalis
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 2990)
AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C.
TITLE Isolation and characterization of the alkane-inducible NADPH-
cytochrome P450 oxidoreductase gene from Candida tropicalis:
Identification of invariant residues within similar amino acid
sequences of divergent flavoproteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
T.R.Sutter, 15-JUN-1990.
Author name and address
T.R.Sutter
CIIT, P.O. Box 12137
Research Triangle Park, NC 27709
FEATURES from to/span description
pept 554 2596 NADPH-cytochrome P450 reductase
signal 198 204 TATA box
BASE COUNT 988 a 442 c 558 g 1002 t
ORIGIN
1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS CPCURA5A 2029 bp ds-DNA BCT 08-AUG-1990
DEFINITION C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene,
complete cds.
ACCESSION M34606
KEYWORDS orotidine monophosphate pyrophosphorylase.
SOURCE C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA,
clone pURA5g2.
ORGANISM Cryptococcus neoformans
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina; Filobasidiales.
REFERENCE 1 (bases 1 to 2029)
AUTHORS Edman,J.C. and Kwon-Chung,K.J.
TITLE Isolation of the URA5 gene from Cryptococcus neoformans var
neoformans and its use as a selective marker for transformation
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Edman, 29-MAY-1990.
Author address: J.C.Edman
University of California San Francisco
Hormone Research Institute
San Francisco, CA 94143-0534
email: edman@socrates.ucsf.edu
FEATURES from to/span description
pept 714 814 orotidine monophosphate pyrophosphorylase
(URA5), exon 1 (E.C. 2.4.2.10)
867 1095 orotidine monophosphate pyrophosphorylase,
exon 2
1145 1492 orotidine monophosphate pyrophosphorylase,
exon 3
pre-msg < 714 1527 URA5 mRNA and introns
IVS 815 866 URA5 intron A
IVS 1096 1144 URA5 intron B
BASE COUNT 512 a 515 c 479 g 523 t
ORIGIN
1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg
61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg
121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt
181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg
241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg
301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg
361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg
421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca
481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg
541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga
601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa
661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct
721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc
781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag
841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt
901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct
961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct
1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg
1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca
1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat
1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc
1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag
1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc
1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga
1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc
1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa
1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa
1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc
1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt
1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg
1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg
1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac
1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc
1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc
//
LOCUS ECOCHEA 2190 bp ds-DNA BCT 08-AUG-1990
DEFINITION E.coli cheA protein (complete cds), motB protein (3' end),
and cheW protein (5' end) genes.
ACCESSION M34669
KEYWORDS cheA gene; cheW gene; motB gene.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2190)
AUTHORS Kofoid,E.C. and Parkinson,J.S.
TITLE Tandem translational starts in the cheA locus of E.coli
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.C.Kofoid, 01-JUN-1990.
Author address: E.C.Kofoid
University of Utah
Dept of Biology
Salt Lake City, UT 84112
email: kofoid@bioscience.utah.edu
FEATURES from to/span description
pept < 1 104 MotB protein
pept 109 2073 CheA protein (alt.)
pept 400 2073 CheA protein (alt.)
pept 2094 > 2190 CheW protein
BASE COUNT 575 a 546 c 609 g 460 t
ORIGIN 42 min on K12 map.
1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac
61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat
121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag
181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt
241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa
301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac
361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct
421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt
481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt
541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt
601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg
661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc
721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa
781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa
841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc
901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc
961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata
1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt
1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg
1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa
1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac
1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt
1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg
1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac
1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag
1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag
1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta
1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc
1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca
1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg
1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc
1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag
1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc
1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac
2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg
2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta
2161 cccttggtga tgaagagtac ggtattgatg
//
LOCUS HAMHSP60A 2726 bp ds-DNA ROD 08-AUG-1990
DEFINITION C.griseus chaperonin (HSP60) non-functional pseudogene.
ACCESSION M34562
KEYWORDS chaperonin; pseudogene.
SOURCE C.griseus cell line CHO DNA, clone GC-1.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2726)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
IVS 1738 1824 HSP60 intron A
BASE COUNT 915 a 513 c 579 g 719 t
ORIGIN
1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca
61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac
121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac
181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag
241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg
301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta
361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa
421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc
481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta
541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag
601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg
661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa
721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg
781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta
841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg
901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta
961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca
1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt
1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa
1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa
1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa
1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa
1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt
1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct
1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg
1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt
1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt
1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag
1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta
1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat
1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta
1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc
1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga
1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac
2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa
2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta
2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag
2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt
2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt
2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc
2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct
2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct
2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt
2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc
2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca
2701 aatggagaca acccaaagtt tcttga
//
LOCUS HAMHSP70A 2243 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Chinese hamster heat shock protein (cognate form of hsp70) mRNA,
complete cds.
ACCESSION M34561
KEYWORDS heat shock protein; stress protein.
SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 2243)
AUTHORS Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S.
TITLE Identification of a protein altered in mutants resistant to
microtubule inhibitors with the constitutive form of the major heat
shock protein (hsp70)
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN% GUPTAR@,SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 50 1990 heat shock protein (hsp70)
mRNA < 1 2243 hsp70 mRNA
signal 2220 2225 poly-A signal
BASE COUNT 654 a 486 c 551 g 552 t
ORIGIN
1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg
61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg
121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt
181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac
241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca
301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca
361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct
421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt
481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat
541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg
601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac
661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg
721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga
781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg
841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat
901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga
961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc
1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc
1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa
1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa
1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga
1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca
1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga
1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat
1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg
1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac
1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa
1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc
1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga
1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa
1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa
1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg
1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga
1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg
2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact
2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat
2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa
2221 ataaaactat ttaaattggc acc
//
LOCUS HAMTCP1A 1845 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds.
ACCESSION M34665
KEYWORDS T-complex protein 1.
SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501.
ORGANISM Cricetulus griseus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 1845)
AUTHORS Ahmad,S. and Gupta,R.S.
TITLE Cloning of a Chinese hamster ovary cell protein homologous to the
mouse T-complex protein TCP-1: Structural similarity to the
ubiquitous "Chaperonin" family of heat shock proteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 01-JUN-1990.
Author address: R.S.Gupta
McMaster University
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR@,SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 89 1759 T-complex protein 1
signal 1818 1823 poly-A signal
BASE COUNT 530 a 368 c 472 g 475 t
ORIGIN
1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg
61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg
121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat
181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga
241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc
301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac
361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa
421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta
481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc
541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat
601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc
661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat
721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa
781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca
841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac
901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg
961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt
1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact
1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt
1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac
1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc
1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg
1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg
1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac
1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca
1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat
1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt
1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat
1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc
1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt
1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga
//
LOCUS HUMHP601A 1941 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 1.
ACCESSION M34660
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS2-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 111 to 1941)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1941)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 624 a 329 c 446 g 541 t 1 others
ORIGIN
1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng
61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca
181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg
241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc
301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata
361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa
421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg
481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa
541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag
601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa
661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg
721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga
781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact
841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg
901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc
961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact
1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg
1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg
1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag
1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta
1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa
1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg
1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg
1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg
1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac
1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg
1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga
1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg
1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag
1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag
1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca
1921 ataacttcag agaagtcagt t
//
LOCUS HUMHP602A 1920 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 2.
ACCESSION M34662
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS2-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 117 to 1920)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1920)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 619 a 324 c 431 g 544 t 2 others
ORIGIN
1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt
61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac
181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga
241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt
301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg
361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag
421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac
481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg
541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag
601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat
721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa
781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt
841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt
901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat
961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt
1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg
1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc
1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag
1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata
1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg
1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa
1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg
1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa
1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag
1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt
1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa
1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc
1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg
1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg
1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg
//
LOCUS HUMHP603A 1912 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 3.
ACCESSION M34661
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS5-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 99 to 1912)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1912)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 614 a 319 c 449 g 527 t 3 others
ORIGIN
1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt
61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac
121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac
181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc
241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca
301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag
361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca
421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc
481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc
541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt
601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca
661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga
721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa
781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga
841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca
901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc
961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct
1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga
1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa
1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc
1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt
1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc
1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga
1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt
1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc
1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg
1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt
1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca
1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc
1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa
1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct
1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt
//
LOCUS HUMHP604A 2019 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human chaperonin (HSP60) non-functional pseudogene 4.
ACCESSION M34663
KEYWORDS chaperonin; pseudogene.
SOURCE Human fetal liver DNA, clone PS5-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 123 to 2019)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 2019)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
BASE COUNT 657 a 327 c 479 g 556 t
ORIGIN
1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat
61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga
121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa
181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg
241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc
301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg
361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt
421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt
481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg
541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt
601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca
661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc
781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa
841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga
901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt
961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca
1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg
1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata
1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga
1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg
1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac
1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga
1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga
1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg
1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga
1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat
1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc
1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat
1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta
1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg
1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag
1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa
1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct
//
LOCUS HUMHSP60A 2202 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human chaperonin (HSP60) mRNA, complete cds.
ACCESSION M34664
KEYWORDS chaperonin.
SOURCE Human placenta cDNA to mRNA, and DNA, clone PGEM-10.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2202)
AUTHORS Venner,T.J., Singh,B. and Gupta,R.S.
TITLE Nucleotide sequence and novel structural features of human and
Chinese hamster HSP60 (chaperonin) gene families
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.S.Gupta, 29-MAY-1990.
Author address: R.S.Gupta
McMaster University
Dept of Biochemistry
1200 Main Street West
Hamilton
Ontario, CANADA L8N 3Z5
email: IN%GUPTAR.@SSCVAX.McMASTER.CA
FEATURES from to/span description
pept 25 1746 chaperonin (HSP60)
BASE COUNT 699 a 371 c 538 g 594 t
ORIGIN
1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga
61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt
121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc
181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa
241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac
301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc
361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc
421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt
481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct
541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa
601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa
661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca
721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct
781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc
841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag
901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag
961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg
1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa
1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa
1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa
1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt
1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt
1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac
1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc
1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag
1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat
1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct
1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa
1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg
1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc
1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc
1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact
1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt
1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac
2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat
2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta
2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa
//
LOCUS MUSGCANF 4049 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Mouse guanylate cyclase/atrial natriuretic factor receptor mRNA,
complete cds.
ACCESSION J05504
KEYWORDS atrial natriuretic factor receptor; guanylate cyclase.
SOURCE Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 4049)
AUTHORS Pandey,K.N. and Singh,S.
TITLE Molecular cloning and expression of murine guanylate cyclase/atrial
natriuretic factor receptor cDNA
JOURNAL J. Biol. Chem. 265, 12342-12348 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.N.Pandey, 31-MAY-1990.
FEATURES from to/span description
pept 348 3521 guanylate cyclase/atrial natriuretic factor
receptor
signal 4029 4034 poly-A signal
BASE COUNT 879 a 1164 c 1140 g 866 t
ORIGIN
1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa
61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc
121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg
181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc
241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga
301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc
361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa
421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc
481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc
541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg
601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca
661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca
721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga
781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg
841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc
901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg
961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca
1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg
1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg
1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag
1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg
1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc
1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca
1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg
1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct
1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc
1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta
1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac
1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact
1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg
1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc
1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca
1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc
1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg
2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg
2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct
2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta
2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct
2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag
2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc
2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa
2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc
2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa
2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc
2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc
2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca
2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg
2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc
2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg
2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt
3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg
3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg
3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg
3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc
3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga
3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg
3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg
3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct
3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc
3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac
3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc
3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac
3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc
3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg
3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg
3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc
3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag
4021 caaaacccat taaagtcttt attcctgtg
//
LOCUS MUSMHH2DR 1415 bp ss-mRNA ROD 08-AUG-1990
DEFINITION Mouse MHC class I H-2Dr protein mRNA, complete cds.
ACCESSION M34961
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Mouse (strain B10.RIII) spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1415)
AUTHORS Zeling,C. and Pease,L.R.
TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Zeling, 29-MAY-1990.
Author address: C.Zeling
Mayo Clinic
323 Guggenheim
200 1st St. SW
Rochester, MN 55905
email: Pease@mayo.edu
FEATURES from to/span description
pept 1 1089 MHC H-2Dr protein precursor
sigp 1 72 MHC H-2Dr protein signal peptide
matp 73 1086 MHC H-2Dr protein
BASE COUNT 298 a 398 c 442 g 277 t
ORIGIN
1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc
181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg
301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag
361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac
421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag
541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac
601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa
661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat
841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS MUSMHH2DS 1415 bp ss-mRNA ROD 08-AUG-1990
DEFINITION M.musculus MHC class I H-2Ds protein mRNA, complete cds.
ACCESSION M34962
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE M.musculus (strain B10.RIII) spleen, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1415)
AUTHORS Zeling,C. and Pease,L.R.
TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Zeling, 29-MAY-1990.
Author address: C.Zeling
Mayo Clinic
323 Guggenheim
200 1st St. SW
Rochester, MN 55905
email: Pease@mayo.edu
FEATURES from to/span description
pept 1 1089 MHC H-2Dr protein precursor
sigp 1 72 MHC H-2Dr protein signal peptide
matp 73 1086 MHC H-2Dr protein
BASE COUNT 292 a 401 c 444 g 278 t
ORIGIN
1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc
181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg
301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag
361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac
421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac
541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac
601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga
661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat
841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS CREAPCYN 577 bp ss-mRNA PLN 08-AUG-1990
DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION J05524
KEYWORDS apoplastocyanin.
SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
PC6-2.
ORGANISM Chlamydomonas reinhardtii
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Chlamydomonadaceae.
REFERENCE 1 (bases 1 to 577)
AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
Bogorad,L.
TITLE Isolation and characterization of a complementary DNA clone for an
algal pre-apoplastocyanin
JOURNAL J. Biol. Chem. 265, 12372-12379 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Hill, 11-MAY-1990.
FEATURES from to/span description
pept 22 459 apoplastocyanin (PC6-2) precursor
sigp 22 162 apoplastocyanin signal peptide
matp 163 456 apoplastocyanin
BASE COUNT 91 a 189 c 171 g 126 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS HUMTNC1 1085 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION M33771 J05502
KEYWORDS troponin C.
SEGMENT 1 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1085)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis, and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. 265, 12520-12528 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept 1033 + 1035 troponin C, exon 1
pre-msg 968 > 1085 troponin C mRNA and introns (alt.)
pre-msg 969 > 1085 troponin C mRNA and introns (alt.)
IVS 1036 > 1085 troponin C intron A
BASE COUNT 266 a 276 c 306 g 237 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
1081 tggac
//
LOCUS HUMTNC2 1773 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION M33772 J05502
KEYWORDS troponin C.
SEGMENT 2 of 2
SOURCE Human HeLa cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1773)
AUTHORS Gahlmann,R. and Kedes,L.
TITLE Cloning, structural analysis, and expression of the human
fast-twitch skeletal muscle troponin C gene
JOURNAL J. Biol. Chem. 265, 12520-12528 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Gahlmann, 20-APR-1990.
FEATURES from to/span description
pept + 51 102 troponin C, exon 2
247 390 troponin C, exon 3
479 593 troponin C, exon 4
758 894 troponin C, exon 5
1506 1537 troponin C, exon 6
pre-msg < 1 1668 troponin C mRNA and introns
IVS < 1 50 troponin C intron A
IVS 103 246 troponin C intron B
IVS 391 478 troponin C intron C
IVS 594 757 troponin C intron D
IVS 895 1505 troponin C intron E
BASE COUNT 358 a 455 c 628 g 331 t 1 others
ORIGIN About 2.5 kb after segment 1.
1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS ENTLACBOHI 1164 bp ds-DNA BCT 08-AUG-1990
DEFINITION Plasmid PDS075 (from E.cloacae) beta-lactamase (Ohio-1) gene,
complete cds.
ACCESSION M33655
KEYWORDS beta-lactamase.
SOURCE Plasmid PDS075 (from Enterobacter cloacae) DNA.
ORGANISM Plasmid PDS075
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1164)
AUTHORS Shlaes,D.M., Currie-McCumber,C., Hull,A., Behlau,I. and Kron,M.
TITLE The Ohio-1 beta-lactamase is part of the SHV-1 family
JOURNAL Antimicrob. Agents Chemother. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.M.Shlaes, 10-APR-1990.
FEATURES from to/span description
pept 143 1003 Ohio-1 beta-lactamase
signal 56 62 -35 region
signal 80 85 -10 region
binding 126 131 ribosome binding site
BASE COUNT 233 a 345 c 357 g 229 t
ORIGIN
1 cgagtaatac tcacatggca ctgacgcaaa atcatcgatt tattcaacaa agccattgtg
61 aatcagcaaa acgccgggtt attcttattt gtcgcttctt tactcgcctt tatcggccct
121 cactcaagga tctaccgtgg ttatgcgtta ttttcgcctg tgtattatct ccctgttagc
181 caccctgccg ctgcgggtac acgccggacc gcagccgctt gagcaaatta aactaagcga
241 aagccagctg tcgggcagcg taggcatgat agaaatggat ctggccaggc ccggcacgct
301 gaccgcctgg cgcgccgatg aacgctttcc catgatgagc acctttaaag tagtgctctg
361 cggcgcaggt ctggcgcggg tggatgccgg tgacgaacag ctggagcgaa agatccacta
421 tcgccgacag gatctggtgg actactcgcc ggtcagcgaa aaacaccttg ccgacggcat
481 gacggtcggc gaactctgtg ccgccgccat taccatgagc gataacagcg ccgccaatct
541 gctgctgcca gccgtcggcg gccccgcagg attgactgcc tttttgcgcc agatcggcga
601 caacgtcacc cgccttgacc gctgggaaac ggaactgaat gaggcgcttc ccggcgacgc
661 ccgcgacacc actaccgccc gcagcatggc cgcgaccctg cgcaagctgc tgaccagcca
721 gcgtctgagc gcccgttcgc aacggcagct gctgcagtgg atggtggacg atcgggtcgc
781 cggacgtttg atccgctccg tgctgccggc gggctggttt atcgccgata agaccggagc
841 tggcgaacgg ggtgcgcgcg ggattgtcgc cctgcttggc ccgaataaca aagcagagcg
901 gattgtggtg atttatctgc gggatacgcc ggcgagcatg gccgagcgaa atcagcaaat
961 cgccgggatc gccggggcgc tgatcgagca ctggcaacgc taacccggcg gtgggcgcgc
1021 gcgttatccg gctcgtagca cctcgcaggc gtgccggcga tatgactggc ggcggcatcg
1081 gagagatggc ttctggctaa tgatggtcgg tgcaacctgg gtgaaaggta acgccataaa
1141 cggtggccac ctgattgtgt ttct
//
LOCUS HUMTRPY1B 2609 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human tryptase-I gene, complete cds.
ACCESSION M33494
KEYWORDS serine protease; tryptase-I.
SOURCE Human adult skin DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2609)
AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
Craik,C.S. and Caughey,G.H.
TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a
multigene serine protease family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Vanderslice 02-APR-1990.
FEATURES from to/span description
pept 458 518 tryptase-I, exon 1
673 844 tryptase-I, exon 2
954 1219 tryptase-I, exon 3
1346 1509 tryptase-I, exon 4
1595 1759 tryptase-I, exon 5
pre-msg 225 2072 tryptase-I mRNA
IVS 248 457 tryptase-I, intron A
IVS 519 672 tryptase-I, intron B
IVS 845 953 tryptase-I, intron C
IVS 1220 1345 tryptase-I, intron D
IVS 1510 1594 tryptase-I, intron E
signal 131 135 pot. CAAT box
signal 197 201 pot.TATA box
BASE COUNT 422 a 941 c 776 g 470 t
ORIGIN
1 accagctgac aggtggagct gccagtctcc agtgctcagc cctcagcggg gcctgcctgg
61 cagccccaca cacagagggc atcggggtgg cgggggcacg tgttacacgg gggccctggg
121 tctgagtcat ccacttcctc cgagtctgga tgggaggacc cagcgcccct cctccgcccc
181 ctcctgatct ggaaggataa atggggaggg gagagccact gggtagaagg aacagggagt
241 ggccagggta agtccccact ctcagagacc ctgacatcag cgtcacctgg agcagagtgg
301 cccagcctca gactcagagc accaagaccc aggcccgcag gcctggaccc accccggtcc
361 ccccgtccca gctccattct tcaccccaca atctgtagcc cccagccctg ccctgtgagg
421 cccggccagg cccacgatgc tcctccttgc tccccagatg ctgaatctgc tgctgctggc
481 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctggt gagtcccagc cggggtccac
541 cctgcccctc accacattcc acaggtcagg gcctgggtgg gttctgggga ggtcgggctg
601 gccccccaca cagggaaggg ctgggcccag gcctggggct gcttcctggt cctgacctgg
661 cacctgcccc agccccaggc caggccctgc agcgagtggg catcgtcggg ggtcaggagg
721 cccccaggag caagtggccc tggcaggtga gcctgagagt ccacggccca tactggatgc
781 acttctgcgg gggctccctc atccaccccc agtgggtgct gaccgcagcg cactgcgtgg
841 gaccgtgagt ctcccggggc ctggaggggt ggggaagggc tggatgtgag ccctggctcc
901 cgggtgctcc tgggggctgc ccagggccct gagtgggatc ctccgctgcc cagggacgtc
961 aaggatctgg ccgccctcag ggtgcaactg cgggagcagc acctctacta ccaggaccag
1021 ctgctgccgg tcagcaggat catcgtgcac ccacagttct acaccgccca gatcggagcg
1081 gacatcgccc tgctggagct ggaggagccg gtgaacgtct ccagccacgt ccacacggtc
1141 accctgcccc ctgcctcaga gaccttcccc ccggggatgc cgtgctgggt cactggctgg
1201 ggcgatgtgg acaatgatgg tgggtctggg gacagtggag gtggggccag ggtcttagcc
1261 acagcccagc ccctgggtcc ctctgggctc caggtggggg ttgcccggcc ccctcctgag
1321 gctgcaccct cttccccacc tgcagagcgc ctcccaccgc catttcctct gaagcaggtg
1381 aaggtcccca taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg
1441 ggagacgacg tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac
1501 tcatgccagg tgggccccgc ctgtcccccg ccccccgccc cccaaccccc actcccaggc
1561 ctgttcggcg agcgctgacc tctgaccttc ccagggcgac tccggagggc ccctggtgtg
1621 caaggtgaat ggcacctggc tgcaggcggg cgtggtcagc tggggcgagg gctgtgccca
1681 gcccaaccgg cctggcatct acacccgtgt cacctactac ttggactgga tccaccacta
1741 tgtccccaaa aagccgtgag tcaggcctgg gttggccacc tgggtcactg gaggaccaac
1801 ccctgctgtc caaaacacca ctgcttccta cccaggtggc gactgccccc cacaccttcc
1861 ctgccccgtc ctgagtgccc cttcctgtcc taagccccct gctctcttct gagccccttc
1921 ccctgtcctg aggacccttc cctatcctga gcccccttcc ctgtcctaag cctgacgcct
1981 gcaccgggcc ctccagccct cccctgccca gatagctggt ggtgggcgct aatcctcctg
2041 agtgctggac ctcattaaag tgcatggaaa tcactggtgt gcatcgctgt gtttctggtt
2101 gtggatgtca ctgggagaga aggggtccag gtgtgctgag gacacctgcc acagtgtgag
2161 gtcctagccc tcaaggcaca gccagtcacc gtgggaccac tggaggacca acccctgctg
2221 tccaaaacac cactgcttcc tacccaggtg gcgactgccc cccacacctt ccctgccccg
2281 tcctgagtgc cccttcctgt cctaagcccc ctgctctctt ctgagcccct tcccctgtcc
2341 tgaggaccct tccctatcct gagccccctt ccctgtccta agcctgacgc ctgcaccggg
2401 ccctccagcc ctcccctgcc cagatagctg gtggtgggcg ctaatcctcc tgagtgctgg
2461 acctcattaa agtgcatgga aatcactggt gtgcatcgct gtgtttctgg ttgtggatgt
2521 cactgggaga gaaggggtcc aggtgtgctg aggacacctg ccacagtgtg aggtcctagc
2581 cctcaaggca cagccagtca ccgtgggac
//
LOCUS HUMTRY1A 1137 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human tryptase-I mRNA, 3' end.
ACCESSION M33491
KEYWORDS serine protease; tryptase-I.
SOURCE Human adult skin, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1137)
AUTHORS Vanderslice,P., Ballinger,S.M., Tam,E.K., Goldstein,S.M.,
Craik,C.S. and Caughey,G.H.
TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a
multigene serine protease family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3811-3815 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Vanderslice, 02-APR-1990.
FEATURES from to/span description
pept < 1 824 tryptase-I (AA at 3)
signal 1119 1124 poly-A signal
BASE COUNT 196 a 410 c 328 g 203 t
ORIGIN
1 tgaatctgct gctgctggcg ctgcccgtcc tggcgagccg cgcctacgcg gcccctgccc
61 caggccaggc cctgcagcga gtgggcatcg tcgggggtca ggaggccccc aggagcaagt
121 ggccctggca ggtgagcctg agagtccacg gcccatactg gatgcacttc tgcgggggct
181 ccctcatcca cccccagtgg gtgctgaccg cagcgcactg cgtgggaccg gacgtcaagg
241 atctggccgc cctcagggtg caactgcggg agcagcacct ctactaccag gaccagctgc
301 tgccggtcag caggatcatc gtgcacccac agttctacac cgcccagatc ggagcggaca
361 tcgccctgct ggagctggag gagccggtga acgtctccag ccacgtccac acggtcaccc
421 tgccccctgc ctcagagacc ttccccccgg ggatgccgtg ctgggtcact ggctggggcg
481 atgtggacaa tgatgagcgc ctcccaccgc catttcctct gaagcaggtg aaggtcccca
541 taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg ggagacgacg
601 tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac tcatgccagg
661 gcgactccgg agggcccctg gtgtgcaagg tgaatggcac ctggctgcag gcgggcgtgg
721 tcagctgggg cgagggctgt gcccagccca accggcctgg catctacacc cgtgtcacct
781 actacttgga ctggatccac cactatgtcc ccaaaaagcc gtgagtcagg cctgggttgg
841 ccacctgggt cactggagga ccaacccctg ctgtccaaaa caccactgct tcctacccag
901 gtggcgactg ccccccacac cttccctgcc ccgtcctgag tgccccttcc tgtcctaagc
961 cccctgctct cttctgagcc ccttcccctg tcctgaggac ccttccctat cctgagcccc
1021 cttccctgtc ctaagcctga cgcctgcacc gggccctcca gccctcccct gcccagatag
1081 ctggtggtgg gcgctaatcc tcctgagtgc tggacctcat taaagtgcat ggaaatc
//
LOCUS HUMTRY2A 1128 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human tryptase-II mRNA, 3' end.
ACCESSION M33492
KEYWORDS serine protease; tryptase-II.
SOURCE Human adult skin, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1128)
AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
Craik,C.S. and Caughey,G.H.
TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a
multigene serine protease family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Vanderslice, 02-APR-1990.
FEATURES from to/span description
pept < 1 826 tryptase-II (AA at 2)
signal 1121 1126 poly-A signal
BASE COUNT 190 a 409 c 329 g 200 t
ORIGIN
1 gctgaatctg ctgctgctgg cgctgcccgt cctggcgagc cgcgcctacg cggcccctgc
61 cccaggccag gccctgcagc gagtgggcat cgttgggggt caggaggccc ccaggagcaa
121 gtggccctgg caggtgagcc tgagagtcca cggcccatac tggatgcact tctgcggggg
181 ctccctcatc cacccccagt gggtgctgac cgcagcgcac tgcgtgggac cggacgtcaa
241 ggatctggcc gccctcaggg tgcaactgcg ggagcagcac ctctactacc aggaccagct
301 gctgccggtc agcaggatca tcgtgcaccc acagttctac accgcccaga tcggagcgga
361 catcgccctg ctggagctgg aggagccggt gaaggtctcc agccacgtcc acacggtcac
421 cctgccccct gcctcagaga ccttcccccc ggggatgccg tgctgggtca ctggctgggg
481 cgatgtggac aatgatgagc gcctcccacc gccatttcct ctgaagcagg tgaaggtccc
541 cataatggaa aaccacattt gtgacgcaaa ataccacctt ggcgcctaca cgggagacga
601 cgtccgcatc gtccgtgacg acatgctgtg tgccgggaac acccggaggg actcatgcca
661 gggcgactcc ggagggcccc tggtgtgcaa ggtgaatggc acctggctgc aggcgggcgt
721 ggtcagctgg ggcgagggct gtgcccagcc caaccggcct ggcatctaca cccgtgtcac
781 ctactacttg gactggatcc accactatgt ccccaaaaag ccgtgagtca ggcctgggtt
841 ggccacctgg gtcactggag gaccaacccc tgctgtccaa aacaccactg cttcctaccc
901 aggtggcgac tgccccccac accttccctg ccccgtcctg agtgcccctt cctgtcctaa
961 gccccctgct ctcttctgag ccccttcccc tgtcctgagg acccttcccc atcctgagcc
1021 cccttccctg tcctaagcct gacgcctgca ccgggccctc cggccctccc ctgcccaggc
1081 agctggtggt gggcgctaat cctcctgagt gctggacctc attaaagt
//
LOCUS HUMTRY3A 1081 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human tryptase-III mRNA, 3' end.
ACCESSION M33493
KEYWORDS serine protease; tryptase-III.
SOURCE Human adult skin, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1081)
AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
Craik,C.S. and Caughey,G.H.
TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a
multigene serine protease family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Vanderslice, 02-APR-1990.
FEATURES from to/span description
pept < 1 805 tryptase-III (AA at 2)
BASE COUNT 181 a 398 c 317 g 185 t
ORIGIN
1 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctgcc ccaggccagg ccctgcagcg
61 agtgggcatc gttgggggtc aggaggcccc caggagcaag tggccctggc aggtgagcct
121 gagagtccgc gaccgatact ggatgcactt ctgcgggggc tccctcatcc acccccagtg
181 ggtgctgacc gcagcgcact gcgtgggacc ggacgtcaag gatctggccg ccctcagggt
241 gcaactgcgg gagcagcacc tctactacca ggaccagctg ctgccggtca gcaggatcat
301 cgtgcaccca cagttctaca ccgcccagat cggagcggac atcgccctgc tggagctgga
361 ggagccggtg aaggtctcca gccacgtcca cacggtcacc ctgccccctg cctcagagac
421 cttccccccg gggatgccgt gctgggtcac tggctggggc gatgtggaca atgatgagcg
481 cctcccaccg ccatttcctc tgaagcaggt gaaggtcccc ataatggaaa accacatttg
541 tgacgcaaaa taccaccttg gcgcctacac gggagacgac gtccgcatcg tccgtgacga
601 catgctgtgt gccgggaaca cccggaggga ctcatgccag ggcgactccg gagggcccct
661 ggtgtgcaag gtgaatggca cctggctgca ggcgggcgtg gtcagctggg gcgagggctg
721 tgcccagccc aaccggcctg gcatctacac ccgtgtcacc tactacttgg actggatcca
781 ccactatgtc cccaaaaagc cgtgagtcag gcctggggtg tccacctggg tcactggagg
841 accagcccct cctgtccaaa acaccactgc ttcctaccca ggcggcgact gccccccaca
901 ccttccctgc cccgtcctga gtgccccttc ctgtcctaag ccccctgctc tcttctgagc
961 cccttcccct gtcctgagga cccttcccca tcctgagccc ccttccctgt cctaagcctg
1021 acgcctgcac cgggccctcc ggccctcccc tgcccaggca gctggtggtg ggcgctaatc
1081 c
//
LOCUS YSCAPN1 1815 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae apurinic endonuclease (APN1) gene, complete cds.
ACCESSION M33667
KEYWORDS 3'-repair diesterase; apurinic endonuclease.
SOURCE S.cerevisiae (strain EJ2169) DNA, clone pSCP8.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1815)
AUTHORS Popoff,S.C., Spira,A.I., Johnson,A.W. and Demple,B.
TITLE Yeast structural gene (APN1) for the major apurinic endonuclease:
Homology to Escherichia coli endonuclease 4
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 4193, 4197 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.C.Popoff, 10-APR-1990.
FEATURES from to/span description
pept 340 1443 apurinic endonuclease (APN1)
signal 265 269 TATA box
BASE COUNT 607 a 287 c 372 g 549 t
ORIGIN Chromosome 11.
1 cccgagcaca agaaaataaa aaattgaaca aaaataagaa taaagtcaca aagggaagaa
61 gatgaggggg ttctttttcc agactttctt cttttggtcc ttcaccttgg atatatgaac
121 atagtagtat tattgaacac gaactttttt catctaatac ttctcttttt ttgccagttt
181 ttctgggaac ttggaaactg gaatttttca ttcttacctg gtttcttttt cttcatttat
241 ttacatgttg gaaatcattg taaatataag atgacaaact ccgaataaga aacacaaaac
301 gcaacattaa taagcttttg gcatatcgga accatcgtaa tgccttcgac acctagcttt
361 gttagatctg ctgtctcgaa atacaaattt ggtgcgcaca tgtcaggtgc cggtggaatt
421 tctaatagtg taactaatgc atttaacact ggctgtaatt cgtttgccat gtttttaaaa
481 tctccaagaa agtgggtttc tccgcagtat acacaggagg aaatagataa atttaagaaa
541 aactgtgcaa cttacaatta taacccattg acggacgtct taccgcatgg ccaatacttt
601 attaatttgg ctaatccgga tagggaaaag gcagaaaaga gttatgagtc attcatggac
661 gatctgaaca ggtgtgagca attagggata ggcctgtata acttacatcc aggttctact
721 ttgaaaggag accatcagtt gcagttaaaa caactagcct catatttgaa caaagcaatt
781 aaggaaacga aatttgttaa aattgtatta gaaaatatgg ctggtactgg aaatttagta
841 ggaagttctc tggtagattt gaaggaggtt attggaatga ttgaagataa atcaagaatc
901 ggcgtttgca tagatacatg ccatacattt gcagcaggct acgatatcag caccactgaa
961 acgtttaaca atttttggaa agagttcaat gatgtaatag ggttcaaata tctaagtgct
1021 gttcacttaa atgattccaa ggctccttta ggaagcaata gagatttgca tgaacgcttg
1081 ggtcaaggtt atttgggtat agatgtgttt agaatgatcg cgcactctga atacctgcag
1141 ggtatcccta ttgtcttgga aactccatac gaaaatgatg aaggctatgg taatgaaatc
1201 aaacttatgg agtggttgga atcaaagagt gagagcgaat tgttagaaga caaggagtat
1261 aaagaaaaaa atgatacgct gcagaaatta ggtgctaaat cgcgtaagga acagttagac
1321 aagtttgagg ttaaacaaaa gaagcgagct gggggcacca agaggaagaa agccactgca
1381 gagccaagcg ataatgatat cttgtcacaa atgacaaaga agaggaagac taagaaagaa
1441 taatgagaag cgagaagaat tttaaatacg taatcaattt ttgtagatta tctcaacgta
1501 cgtaactaaa tgctaccctt cgtcatccaa gatgctgttg cccatttcat agaaactact
1561 tacttggttt atagatgttt gcgttgcgtt acagaataag gcagcctttt ttttttttag
1621 attttcattt ttcacatact gatggattag cagattataa aagggtaaag ttaaaatcat
1681 tccaaacttg aactaaggaa ggcgctagac catcctctta tggaacgtta ttcctaagca
1741 tagaaagttt tttacttgta atcttgaacg cggggtgtgt tctttatctt ttttttcttt
1801 tttacacaat tagac
//
LOCUS YSCCKA2 2393 bp ds-DNA PLN 08-AUG-1990
DEFINITION S.cerevisiae casein kinase-II alpha' subunit gene, complete cds.
ACCESSION M33759
KEYWORDS casein kinase-II.
SOURCE S.cerevisiae (strain S288C) DNA, (library of M.Snyder), clone
ScA2-1.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2393)
AUTHORS Padmanabha,R., Chen-Wu,J.L.-P., Hanna,D.E. and Glover,C.V.C.
TITLE Isolation, sequencing, and disruption of the yeast CKA2 gene:
Casein kinase II is essential for viability in Saccharomyces
cerevisiae
JOURNAL Mol. Cell. Biol. 10, 4089-4099 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.V.C.Glover, 19-APR-1990.
FEATURES from to/span description
pept 520 1539 casein kinase-2
signal 337 342 TATA box
BASE COUNT 771 a 470 c 455 g 697 t
ORIGIN
1 agatctacat actgaatgaa ttgtcccctc aactctgaag ttgatttact tgctgtaggt
61 cgattgctcc aaagttggat atccctaatg accaaacttc cctgctttcc gctaagagta
121 aaatttagtg tgcataattt ccgtgacatc aaaagtcagg cgaattggtg aatgctgtcc
181 cctttcaagg cagtctatca tttcttagtt acctgccaac aggcctttat caggcctcaa
241 agaaaaaaat ttccttgcgt agcccgcttt tcgcccgaac ctgacaaagc aaacagacga
301 tcgaaaaatt tccaacgatc ttatctaggt ggaagttata aaaagaagaa taaagcatcg
361 tcaacctttt taacctgaat tgggaaacaa cgggcacaca catttgttgg agtttgtatg
421 attgctaaaa aataaaaagt aattaaaaaa acatcataga gttgtaaggt ctcagggaaa
481 tagaaggaac aataaaccta aaagaataga agaaacagaa tgccattacc tccgtcaaca
541 ttgaaccaga aatctaatag agtctactct gtagctaggg tgtacaagaa tgcctgcgag
601 gagagaccac aagaatactg ggactacgaa caaggggtga ccatcgattg gggaaagatt
661 tccaattacg aaattatcaa caaaattgga agagggaaat attccgaagt gttcagcggt
721 agatgtattg taaacaacca gaagtgtgtt attaaagttt taaaaccagt taaaatgaaa
781 aaaatttata gagagttgaa aattctgacc aatctaacag gcggccccaa tgttgttggc
841 ctttatgata tagtacaaga cgctgactcc aaaatacctg ctttgatctt tgaggaaatc
901 aaaaatgttg atttcagaac tttatatcct acattcaaac ttcctgacat ccagtattat
961 ttcacgcaat tattgattgc gttagactac tgtcactcca tgggcataat gcacagagac
1021 gtaaagcctc agaatgtcat gattgatcct acggaacgta aactaaggct gatcgattgg
1081 ggcctggcgg agttctacca tccaggtgta gattacaacg ttcgtgtcgc ttcgcgttac
1141 cacaagggac cagaactttt agtaaacttg aaccaatatg actactccct agacttatgg
1201 tcagtaggat gcatgctagc agctattgtc ttcaaaaaag aacctttttt caaagggtcg
1261 tctaatccag atcaactggt aaagattgcc acagtactag gaaccaagga actgttaggc
1321 tatttgggta agtacgggtt gcacttacca tctgaatacg acaacattat gagagacttt
1381 acaaaaaaat cgtggacaca ctttataacc tccgagacca aattagctgt tcctgaagtg
1441 gttgatttaa tcgacaattt attaaggtat gaccatcaag aaagattaac agcaaaggag
1501 gctatggatc ataagttttt caaaacgaag tttgaataat tggacaacaa tactcttagc
1561 aaggcaattc tttttccacc actgatttcc atttataaaa agcaggggtt ttcctttttt
1621 gtagcttgat attagagaat atccttaatg tgaatattaa attgcaaact ctatttaaca
1681 acgaatgtag attattatat gaccagaaaa tcactttata tagatataag agccttccgt
1741 aatgctatgc aagttttgat gaattattat attgtcgtga ggcgtgttcc gaggagtttc
1801 gtcgccattg agatagagat gctcgtaatc gtacgtgtct ggaagtgcct gccgacggta
1861 gcgatgtcta ctctcgtccc agacaatgga atagttttga aattgttcta catcatttat
1921 tagaacgcac gattcacgaa atctcccacg tttgtcaata tctcgcctac gaacagtgtc
1981 atcgaatgat attgaacgct gagagagcca tttacctttt gagtaaaata cgctgccttt
2041 tttcagcaat gaagaaccac ttccctcctt tcttacatcc aataattgct tcccttgttc
2101 ctttatattc cccataggaa agtccgttgg agtaggttgc tctctttcat gtattccaac
2161 cagatcaggt ccgtaaagaa cagtaacgtc tatttcttta ttccagttta tgcttagtgg
2221 atcaattttg tattcatcta aattgaagtg ctttatggac cacctcctcc atatacaatg
2281 atacaaccta gccatatata tgctatagtc aaactcattt tcagaagagc aactgggtaa
2341 cctcaataat ttatgcttgc gcattttcag caacaagcgc caggtagtgg aca
//
LOCUS BSURPOF 3078 bp ds-DNA BCT 08-AUG-1990
DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
complete cds.
ACCESSION M34995
KEYWORDS RNA polymerase; minor sigma-37 factor.
SOURCE B.subtilis (strains 168 and Marburg), clones
lambda-[21,gt11,1,11,21,31].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 368 to 2297)
AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus
subtilis RNA polymerase: Isolation, nucleotide sequence,
chromosomal locus, and cryptic function
JOURNAL J. Bacteriol. 169, 771-778 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3078)
AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
TITLE Similar organization of the sigB and spoIIA operons encoding
alternate sigma factors of Bacillus subtilis RNA polymerase
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.W.Price, 18-FEB-1987. Draft entry and computer-readable
sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES from to/span description
pept 200 529 ORF V
pept 526 1008 ORF W
pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg
start codon)
pept 1762 2361 ORF X
mRNA 131 2368 sigB operon mRNA
signal 131 168 sigB operon promoter
signal 2368 2401 sigB transcription termination signal
binding 188 194 ORF V ribosomal binding site
binding 511 519 ORF W ribosomal binding site
binding 954 961 sigB ribosomal binding site
binding 1750 1753 ORF X ribosomal binding site
BASE COUNT 999 a 575 c 755 g 749 t
ORIGIN 40 degrees on B.subtilis map.
1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
3061 caaactgctg cgaagctt
//
LOCUS HUMERPR 1624 bp ss-mRNA PRI 08-AUG-1990
DEFINITION Human erythropoietin receptor mRNA, complete cds.
ACCESSION M34986
KEYWORDS erythropoietin receptor.
SOURCE Human cell line OCI-M1, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1624)
AUTHORS Ehrenman,K. and St John,T.
TITLE Sequence of the human erythropoietin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.St.John, 07-JUN-1990.
Author address: T.St.John
Hutchinson Cancer Res Center
1124 Columbia St.
Seattle, WA
Email: TSTJOHN@FHCRCVAX
FEATURES from to/span description
pept 1 1527 erythropoietin receptor precursor
sigp 1 72 erythropoietin receptor signal peptide
matp 73 1524 erythropoietin receptor
BASE COUNT 276 a 537 c 498 g 313 t
ORIGIN
1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
1621 agga
//
LOCUS HUMGFP40H 4379 bp ds-DNA PRI 08-AUG-1990
DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete
cds.
ACCESSION M30135
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor;
megakeryoblastic leukemia cell growth factor.
SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4379)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 12-DEC-1989.
FEATURES from to/span description
pept 796 909 P40 T-cell and mast cell growth factor
precursor, exon 1
1022 1057 P40 T-cell and mast cell growth factor
precursor, exon 2
1140 1172 P40 T-cell and mast cell growth factor
precursor, exon 3
2461 2592 P40 T-cell and mast cell growth factor
precursor, exon 4
4096 4215 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 796 849 P40 T-cell and mast cell growth factor signal
peptide
matp 850 909 P40 T-cell and mast cell growth factor
1022 1057 P40 T-cell and mast cell growth factor
1140 1172 P40 T-cell and mast cell growth factor
2461 2592 P40 T-cell and mast cell growth factor
4096 4212 P40 T-cell and mast cell growth factor
IVS 910 1021 hP40 intron A
IVS 1058 1139 hP40 intron B
IVS 1173 2460 hP40 intron C
IVS 2593 4095 hP40 intron D
signal 742 747 TATA box
signal 4354 4359 polyA signal
rpt 2965 3265 Alu repeat
binding 370 376 AP-2 binding site
binding 624 629 IRF-1 binding site
binding 627 633 AP-1 binding site
BASE COUNT 1208 a 921 c 904 g 1346 t
ORIGIN
1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS MEFDNABP 1710 bp ds-DNA BCT 08-AUG-1990
DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION M34778
KEYWORDS DNA binding protein HMfB.
SOURCE M.fervidus DNA.
ORGANISM Methanothermus fervidus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanothermaceae.
REFERENCE 1 (sites; for [2])
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
TITLE DNA binding protein HMf, isolated from the hypothermophilic
archaeal Methanothermus fervidus is most closely related to
histones
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1710)
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Sandman, 01-JUN-1990.
Author address: K.Sandman
Department of Microbiology
Ohio State University
484 W. 12th Ave.
Columbus, Ohio 43210
E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES from to/span description
pept 578 787 DNA binding protein HMfB
pept 1662 805 (c) ORF285
signal 450 457 promoter
signal 523 530 promoter
signal 790 814 transcription termination signal
BASE COUNT 619 a 263 c 265 g 561 t 2 others
ORIGIN 1 bp upstream of HindIII site.
1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS MUSINT1A 5607 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION K02593 M34750
KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE Mouse (BALB/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1098 to 5606)
AUTHORS Van Ooyen,A. and Nusse,R.
TITLE Structure and nucleotide sequence of the putative mammary oncogene
int-1; Proviral insertions leave the protein-encoding domain intact
JOURNAL Cell 39, 233-240 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5607)
AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
Otte,A., Schuuring,E. and van Ooyen,A.
TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation
by insertion of proviral DNA of the mouse mammary tumor virus
JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [2] kindly submitted
by R.Nusse, 31-MAY-1990.
FEATURES from to/span description
pept 1779 1882 Wnt-1 protein (exon 1)
2452 2705 Wnt-1 protein (exon 2)
3279 3544 Wnt-1 protein (exon 3)
4002 4490 Wnt-1 protein (exon 4)
IVS 1883 2451 Wnt-1 cds intron A
IVS 2706 3278 Wnt-1 cds intron B
IVS 3545 4001 Wnt-1 cds intron C
site 1133 1134 MMTV insertion site in tumor 17 [1]
site 1394 1395 MMTV insertion site in tumor 35 [1]
site 1631 1632 MMTV insertion site in tumor 102 [1]
site 4495 4496 MMTV insertion site in tumor 53 [1]
mRNA 1428 5398 Wnt-1 mRNA (alt.)
mRNA 1595 5398 Wnt-1 mRNA (alt.)
signal 1562 1566 TATA box
signal 1573 1577 TATA box
BASE COUNT 1188 a 1681 c 1519 g 1219 t
ORIGIN Chromosome 15.
1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
5581 gaccctaaat gagaccaact agggatc
//
LOCUS MUSP40M 3808 bp ds-DNA ROD 08-AUG-1990
DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
cds.
ACCESSION M30136
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor.
SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
DNA, clone lambda-S40.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3808)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 21-NOV-1989.
FEATURES from to/span description
pept 644 757 P40 T-cell and mast cell growth factor
precursor, exon 1
846 881 P40 T-cell and mast cell growth factor
precursor, exon 2
965 997 P40 T-cell and mast cell growth factor
precursor, exon 3
2184 2315 P40 T-cell and mast cell growth factor
precursor, exon 4
3393 3512 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 644 697 P40 T-cell and mast cell growth factor signal
peptide
matp 698 757 P40 T-cell and mast cell growth factor
846 881 P40 T-cell and mast cell growth factor
965 997 P40 T-cell and mast cell growth factor
2184 2315 P40 T-cell and mast cell growth factor
3393 3509 P40 T-cell and mast cell growth factor
IVS 758 845 mP40 intron A
IVS 882 964 mP40 intron B
IVS 998 2183 mP40 intron C
IVS 2316 3392 mP40 intron D
signal 589 594 TATA box
binding 227 233 AP-2 binding site
binding 475 480 IRF-1 binding site
binding 478 484 AP-1 binding site
BASE COUNT 1012 a 903 c 869 g 1024 t
ORIGIN Chromosome 13.
1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 08-AUG-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
cds.
ACCESSION M29938
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
ZQ30-1.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1105)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 13 777 MHC SLA-DQ alpha chain precursor
sigp 13 81 MHC SLA-DQ alpha chain signal peptide
matp 82 774 MHC SLA-DQ alpha chain
mRNA < 1 1105 MHC SLA-DQ mRNA
BASE COUNT 254 a 290 c 255 g 306 t
ORIGIN
1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1081 aaacaataaa tcacttcatg aggtt
//
LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 08-AUG-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
cds.
ACCESSION M29939
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 69 836 MHC SLA-DQ alpha chain precursor
sigp 69 137 MHC SLA-DQ alpha chain signal peptide
matp 138 833 MHC SLA-DQ alpha chain
BASE COUNT 273 a 313 c 266 g 322 t
ORIGIN
1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS PIGMHDQBA 212 bp ds-DNA MAM 08-AUG-1990
DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION M29944
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
Gustafsson,K. and LeGuern,C.
TITLE Class II genes of miniature swine: II. Molecular identification and
characterization of B (beta) genes from SLA-c haplotype
JOURNAL Immunogenetics 31, 1-6 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept < 1 12 MHC SLA-DO beta chain
BASE COUNT 38 a 58 c 52 g 64 t
ORIGIN
1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS PP2LEND 266 bp ds-DNA PHG 08-AUG-1990
DEFINITION Bacteriophage P2 left end DNA.
ACCESSION M34756
KEYWORDS .
SOURCE Bacteriophage P2 DNA.
ORGANISM Bacteriophage P2
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Ziermann,R. and Calendar,R.
TITLE Characterization of the cos sites of Bacteriophages P2 and P4
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Ziermann, 05-JUN-1990.
Author address: R.Ziermann
University of California
Dept. Mol. Cell Biology
Berkeley, CA 94720
Email: calendar@garnet.berkeley.edu
FEATURES from to/span description
site 1 19 cohesive end of phage DNA
site 20 266 left end of phage DNA
BASE COUNT 41 a 58 c 99 g 68 t
ORIGIN Map position 0.0-0.8%.
1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
241 gagccagccg tttatctcgc ggatcc
//
LOCUS RCABCHH 314 bp ds-DNA BCT 08-AUG-1990
DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
F108) genes, 3' end, and 5' end respectively.
ACCESSION M34843
KEYWORDS .
SOURCE R.capsulata DNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 314)
AUTHORS Yang,Z. and Bauer,C.E.
TITLE Rhodobacter capsulatus genes involved in early steps of the
bacteriochlorophyll biosynthetic pathway
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.E.Bauer, 04-JUN-1990.
FEATURES from to/span description
pept < 1 79 bacteriochlorophyll biosynthesis protein
(bchH) (AA at 2)
pept 108 > 314 bacteriochlorophyll biosynthesis protein
(F108)
BASE COUNT 59 a 87 c 119 g 49 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
301 agcgcgtgct gcag
//
LOCUS TREANTAT 999 bp ds-DNA ORG 08-AUG-1990
DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION M34848
KEYWORDS .
SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA.
ORGANISM Kinetoplast Trypanosoma evansi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma evansi.
REFERENCE 1 (bases 1 to 999)
AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
Steinert,M. and Hamers,R.
TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Bajyana Songa, 05-JUN-1990.
FEATURES from to/span description
site 367 516 kinetoplast minicircle common region (ORI)
BASE COUNT 391 a 78 c 203 g 327 t
ORIGIN
1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS VIBSCRRE 1524 bp ds-DNA BCT 08-AUG-1990
DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds,
and sucrose transport (scrA) protein, 5' end.
ACCESSION M35009
KEYWORDS Scr repressor protein; sucrose transport protein.
SOURCE V.alginolyticus DNA.
ORGANISM Vibrio alginolyticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Blatch,G.L. and Woods,D.R.
TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
repressor-encoding gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Blatch, 10-JUN-1990.
Author address: G.L.Blatch
University of Cape Town
Dept. Microbiology
Private Bag
Rondebosch
7700 SOUTH AFRICA
Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
FEATURES from to/span description
pept 36 < 1 (c) sucrose transport protein (scrA)
pept 271 555 Scr repressor protein (scrR)
pept 791 1090 ORF5
binding 264 267 scrR ribosomal binding site
binding 780 785 ORF5 ribosomal binding site
signal 178 183 scrA -10 region
signal 203 208 scrA -35 region
signal 190 195 scrR -10 region
signal 162 167 scrR -10 region
signal 762 767 ORF5 -35 region
signal 742 747 ORF5 -35 region
site 78 91 repressor operator site (put.)
site 229 242 repressor operator site (put.)
site 12 71 ScrR helix-turn-helix motif
BASE COUNT 466 a 344 c 321 g 393 t
ORIGIN
1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
1501 gccaattgtg atattgaaga agct
//
LOCUS XELNRP1A 2431 bp ss-mRNA VRT 08-AUG-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
complete cds.
ACCESSION M34894
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2431)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 74 1117 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2431 nrp-1B mRNA
signal 2407 2418 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
site 921 941 alternative exon (put.)
BASE COUNT 663 a 522 c 543 g 703 t
ORIGIN
1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS XELNRP1B 2324 bp ss-mRNA VRT 08-AUG-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
complete cds.
ACCESSION M34895
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2324)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 29 1051 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2324 nrp-1B mRNA
signal 2306 2311 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
BASE COUNT 647 a 467 c 520 g 690 t
ORIGIN
1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS YSJSACDHY 2569 bp ds-DNA PLN 08-AUG-1990
DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION M34929
KEYWORDS saccharopine dehydrogenase.
SOURCE Y.lipolytica (strain W29) DNA.
ORGANISM Yarrowia lipolytica
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2569)
AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
Gaillardin,C.
TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
lipolytica
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.M.Fournier, 06-JUN-1990.
ORF1 is probably not translated, since a lacZ fusion/expression
experiment did not produce results.
FEATURES from to/span description
pept 649 1929 ORF1
pept 1764 655 (c) saccharopine dehydrogenase
mRNA 582 2060 ORF1 mRNA (alt.)
mRNA 598 2060 ORF1 mRNA (alt.)
mRNA 599 2060 ORF1 mRNA (alt.)
mRNA 602 2060 ORF1 mRNA (alt.)
mRNA 606 2060 ORF1 mRNA (alt.)
mRNA 609 2060 ORF1 mRNA (alt.)
mRNA 1807 639 (c) ORF2 mRNA (alt.)
mRNA 1809 639 (c) ORF2 mRNA (alt.)
mRNA 1811 639 (c) ORF2 mRNA (alt.)
mRNA 1812 639 (c) ORF2 mRNA (alt.)
mRNA 1814 639 (c) ORF2 mRNA (alt.)
signal 521 528 TATA box
signal 565 575 TATA box
BASE COUNT 668 a 573 c 666 g 662 t
ORIGIN
1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//GenBank-Updates@genbank.bio.net (08/09/90)
LOCUS INS43AAA 130 bp ds-DNA BCT 09-AUG-1990
DEFINITION Insertion sequence IS2-43.
ACCESSION M25093
KEYWORDS RNA polymerase binding site; insertion sequence;
insertion sequence IS2.
SOURCE Insertion sequence IS2 DNA.
ORGANISM Insertion sequence IS2
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 130)
AUTHORS Sommer,H., Cullum,J. and Saedler,H.
TITLE IS2-43 and IS2-44: New alleles of the insertion sequence IS2 which
have promoter activity
JOURNAL Mol. Gen. Genet. 175, 53-56 (1979)
STANDARD simple staff_entry
FEATURES from to/span description
BASE COUNT 41 a 23 c 22 g 44 t
ORIGIN
1 cctaagacat caatcatctg ttctccaatg actagtctaa aaactagtat taagactatc
61 acttatttaa gtgatatact tatttaagtg atattggttg tctggagatt cagggggcca
121 gtctaatacc
//
LOCUS PSCIS1IN 146 bp ds-DNA BCT 09-AUG-1990
DEFINITION Plasmid pDG128 insertion element IS1 target region sequence.
ACCESSION M25018
KEYWORDS insertion element; insertion element IS1.
SOURCE Plasmid pDG128, a derivative of Plasmid pSC101, DNA, clone
128/10R7.
ORGANISM Plasmid pSC101
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 146)
AUTHORS Sommer,H., Schumacher,B. and Saedler,H.
TITLE A new type of IS1-mediated deletion
JOURNAL Mol. Gen. Genet. 184, 300-307 (1981)
STANDARD simple staff_entry
FEATURES from to/span description
BASE COUNT 38 a 33 c 34 g 41 t
ORIGIN
1 gctgcgaaaa tgccttatct ggcctacaga ttcgatgcga ttcgtaggtc ggataagatg
61 cgcaagcatc gcatccgaca ataagtgccg aatgcgacct acattcacat ggcgcttttt
121 acatctgacg gtttttattg aagtta
//
LOCUS BRVRNASA 197 bp ss-mRNA VRL 09-AUG-1990
DEFINITION Berne virus ORF5 mRNA, 5'end.
ACCESSION M33503 M33501
KEYWORDS core protein.
SOURCE Berne virus (strain P138/72) viral RNA.
ORGANISM Berne virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Toroviridae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
TITLE A 3'-coterminal nested set of independently transcribed mRNAs is
generated during Berne virus replication
JOURNAL J. Virol. 64, 331-338 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 137 > 197 ORF5
mRNA 113 > 197 RNA5
BASE COUNT 50 a 27 c 38 g 82 t
ORIGIN
1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag
61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt
121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc
181 acctcaggtt gttgcat
//
LOCUS BRVRNASB 179 bp ss-RNA VRL 09-AUG-1990
DEFINITION Berne virus ORF3 mRNA, 5' end.
ACCESSION M33502
KEYWORDS core protein.
SOURCE Berne virus (strain P138/72) viral RNA.
ORGANISM Berne virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Toroviridae.
REFERENCE 1 (bases 1 to 179)
AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
TITLE A 3'-coterminal nested set of independently transcribed mRNAs is
generated during Berne virus replication
JOURNAL J. Virol. 64, 331-338 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 153 > 179 ORF3
BASE COUNT 52 a 17 c 34 g 76 t
ORIGIN
1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat
61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag
121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt
//
LOCUS CHKGLOBA 1204 bp ds-DNA VRT 09-AUG-1990
DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4.
ACCESSION M30485
KEYWORDS pie-alpha-globin.
SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1204)
AUTHORS Broders,F., Zahraoui,A. and Scherrer,K.
TITLE The chicken alpha-globin gene domain is transcribed into a
17-kilobase polycistronic RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.)
BASE COUNT 282 a 252 c 263 g 407 t
ORIGIN
1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg
61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat
121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg
181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg
241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt
301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac
361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga
421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg
481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag
541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc
601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga
661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag
721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg
781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac
841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt
901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca
961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc
1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt
1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg
1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact
1201 ggtt
//
LOCUS CHKGLOBB 582 bp ds-DNA VRT 09-AUG-1990
DEFINITION Chicken pie-alpha-globin gene, fragment H10.
ACCESSION M30486
KEYWORDS pie-alpha-globin.
SOURCE Chicken AEV transformed erythroblast DNA, fragment H10.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 582)
AUTHORS Broders,F., Zahraoui,A. and Scherrer,K.
TITLE The chicken alpha-globin gene domain is transcribed into a
17-kilobase polycistronic RNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 241 > 582 pie-alpha-globin fragment mRNA H10
BASE COUNT 171 a 128 c 108 g 175 t
ORIGIN
1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata
61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct
121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg
181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata
241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac
301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg
361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat
421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg
481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca
541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc
//
LOCUS ECOPHOAA 600 bp ds-DNA BCT 09-AUG-1990
DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end.
ACCESSION M33536
KEYWORDS alkaline phosphatase.
SOURCE E.coli (strain K-12) cell line BW7710 DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 600)
AUTHORS Agrawal,D.K. and Wanner,B.L.
TITLE A phoA structural gene mutation that conditionally affects
formation of the enzyme bacterial alkaline phosphatase
JOURNAL J. Bacteriol. 172, 3180-3190 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.K.Agrawal, 03-APR-1990.
The phoA503 mutation does not interfere with export of active
enzyme but does interfere with assembly.
FEATURES from to/span description
pept 283 > 600 alkaline phosphatase precursor (phoA) (EC
3.1.3.1)
sigp 283 345 alkaline phosphatase signal peptide
matp 346 > 600 alkaline phosphatase
variant 413 413 c in wild type; t in phoA503 mutation
BASE COUNT 159 a 130 c 151 g 160 t
ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site.
1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc
61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg
121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt
181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct
241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg
361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt
421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat
481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat
541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac
//
LOCUS GCOEARA 1771 bp ds-DNA PLN 09-AUG-1990
DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene.
ACCESSION M33640
KEYWORDS 18S ribosomal RNA.
SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA.
ORGANISM Gracilaria tikvahiae McLachlan
Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae;
Florideophycideae; Gigartinales; Gracilariaceae.
REFERENCE 1 (bases 1 to 1771)
AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A.
TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga
Gracilaria tikvahiae mclachlan
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research
Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA
B3H 3Z1
FEATURES from to/span description
rRNA 1 1771 18S ribosomal RNA
BASE COUNT 445 a 371 c 501 g 454 t
ORIGIN
1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata
181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat
301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta
721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg
841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga
1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
1741 ttccgtaggt gaacctgcag aaggatcaag c
//
LOCUS HS6MCP 4440 bp ds-DNA VRL 09-AUG-1990
DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete
cds.
ACCESSION M33515
KEYWORDS major capsid protein.
SOURCE Human herpesvirus type 6 DNA.
ORGANISM Human herpesvirus type 6
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 4440)
AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R.
TITLE Identification, cloning, and expression of the major capsid protein
gene of human herpesvirus 6
JOURNAL J. Virol. 64, 714-722 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 235 4272 major capsid protein (MCP)
BASE COUNT 1422 a 1169 c 785 g 1064 t
ORIGIN
1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct
61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag
121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac
181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa
241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac
301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat
361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa
421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat
481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca
541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa
601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca
661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga
721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc
781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat
841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga
901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt
961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact
1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa
1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat
1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg
1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa
1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg
1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac
1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg
1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac
1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt
1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga
1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa
1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga
1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac
1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc
1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct
1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca
1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt
2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct
2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat
2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac
2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta
2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg
2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc
2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta
2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc
2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac
2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag
2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag
2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc
2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt
2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc
2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca
2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac
3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac
3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat
3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt
3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa
3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag
3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc
3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat
3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat
3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg
3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc
3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg
3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt
3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac
3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc
3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac
3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt
3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac
4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag
4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac
4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc
4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa
4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga
4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct
4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca
//
LOCUS HUMSEXREPB 916 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human sex chromosome repeat, clone pDP330.
ACCESSION M33524
KEYWORDS sex chromosome repeat.
SOURCE Human cell line OXENII DNA, clone pDP320.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 916)
AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
Page,D.C.
TITLE Human sex-chromosome-specific repeats within a region of
pseudoautosomal/Yq homology
JOURNAL Genomics 7, 625-628 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.C.Fisher, 03-APR-1990.
FEATURES from to/span description
site 1 388 low copy flanking sequence
rpt 389 916 sex chromosome repeat
BASE COUNT 206 a 228 c 229 g 253 t
ORIGIN Chromosome Yp.
1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta
61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc
121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt
181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca
241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg
301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc
421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga
481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct
541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct
601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca
661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct
721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg
781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct
841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc
901 ctggaggctg gaattc
//
LOCUS HUMSEXRPA 918 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human sex chromosome repeat, clone pDP316.
ACCESSION M33523
KEYWORDS sex chromosome repeat.
SOURCE Human cell line OXENII DNA, clone pDP316.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 918)
AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
Page,D.C.
TITLE Human sex-chromosome-specific repeats within a region of
pseudoautosomal/Yq homology
JOURNAL Genomics 7, 625-628 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.C.Fisher, 03-APR-1990.
FEATURES from to/span description
site 1 388 low copy flanking sequence
rpt 389 918 sex chromosome repeat
BASE COUNT 207 a 242 c 226 g 243 t
ORIGIN Chromosome Yp.
1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta
61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc
121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt
181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca
241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg
301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc
421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg
481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc
541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc
601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac
661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc
721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta
781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg
841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag
901 tcctggaggc tggaattc
//
LOCUS IRICAP 2461 bp ds-DNA VRL 09-AUG-1990
DEFINITION Iridescent virus type 1 capsid protein gene, complete cds.
ACCESSION M33542
KEYWORDS capsid protein.
SOURCE Iridescent virus type 1 DNA.
ORGANISM Iridescent virus type 1
Viridae; ds-DNA nonenveloped viruses; Iridoviridae.
REFERENCE 1 (bases 1 to 2461)
AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L.
TITLE Molecular cloning, characterization, and expression of the Tipula
iridescent virus capsid gene
JOURNAL J. Virol. 64, 125-136 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 601 1995 capsid protein
mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp)
BASE COUNT 717 a 462 c 443 g 839 t
ORIGIN
1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat
61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa
121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta
181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt
241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat
301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt
361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt
421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt
481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt
541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac
601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa
661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga
721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct
781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg
841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta
901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat
961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct
1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca
1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca
1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag
1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct
1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct
1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt
1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt
1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca
1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat
1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt
1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga
1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt
1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc
1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg
1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc
1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac
1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga
2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt
2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa
2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa
2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt
2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc
2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc
2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct
2461 t
//
LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 09-AUG-1990
DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds.
ACCESSION M33679
KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon;
processed gene; variable region.
SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
TITLE Clustered heavy chain somatic mutations shared by anti p
azophenylarsonate antibodies confer enhanced affinity and ablate
the cross-reactive idiotype
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital,
Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES from to/span description
pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1)
BASE COUNT 98 a 83 c 89 g 93 t
ORIGIN
1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg
61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg
121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac
181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac
241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt
301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc
361 tca
//
LOCUS MUSIGKABF 324 bp ss-mRNA ROD 09-AUG-1990
DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds.
ACCESSION M33678
KEYWORDS immunoglobulin light chain; joining exon; processed gene;
variable region.
SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
TITLE Clustered heavy chain somatic mutations shared by anti p
azophenylarsonate antibodies confer enhanced affinity and ablate
the cross-reactive idiotype
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital,
Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES from to/span description
pept < 1 > 324 Ig light-chain V-J region (AA at 1)
BASE COUNT 96 a 77 c 73 g 77 t 1 others
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca
181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat
241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
301 ggcaccaagt tggaaatcaa acgg
//
LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 09-AUG-1990
DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5'
end.
ACCESSION M33500
KEYWORDS T cell receptor; beta-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2567)
AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and
Fotedar,A.
TITLE Transcription of the T cell receptor beta-chain gene is controlled
by multiple regulatory elements
JOURNAL J. Immunol. 144, 1111-1119 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 2544 > 2567 T cell receptor beta-chain V-2 region
precursor
sigp 2544 > 2567 T cell receptor beta-chain signal peptide
mRNA 2478 > 2567 T cell receptor beta-chain mRNA
BASE COUNT 708 a 560 c 583 g 716 t
ORIGIN
1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag
61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc
121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga
181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag
241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga
301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc
361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc
421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc
481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga
541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt
601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt
661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga
721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg
781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga
841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa
901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata
961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac
1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag
1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt
1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta
1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta
1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat
1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa
1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc
1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct
1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa
1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc
1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc
1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag
1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag
1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa
1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc
1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct
1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat
2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg
2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc
2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta
2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact
2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca
2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt
2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt
2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat
2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc
//
LOCUS R751TRA 578 bp ds-DNA BCT 09-AUG-1990
DEFINITION Plasmid R751 traJ and traK genes, 5'end.
ACCESSION M25422
KEYWORDS inverted repeat; transfer origin region.
SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA.
ORGANISM Plasmid R751
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 578)
AUTHORS Lanka,E. and Euerste,J.P.
TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of
plasmid-encoded products with the transfer origin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
E.Lanka, 17-JUL-1989.
FEATURES from to/span description
pept 48 < 1 (c) traJ protein
pept 403 > 578 traK protein
signal 243 211 promoter PL
signal 266 294 promoter PR
rpt 49 64 inverted repeat
rpt 118 157 inverted repeat
rpt 296 331 inverted repeat
BASE COUNT 141 a 168 c 163 g 106 t
ORIGIN
1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc
61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg
121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt
181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat
241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct
301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag
361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc
421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac
481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg
541 atgaaaacga tctgggagca catgaaggaa accggccg
//
LOCUS RP4TRAB 571 bp ds-DNA BCT 09-AUG-1990
DEFINITION Plasmid RP4 traJ and traK genes, 5' end.
ACCESSION M25423
KEYWORDS inerted repeat; transfer origin region.
SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA.
ORGANISM Plasmid RP4
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 571)
AUTHORS Lanka,E. and Euerste,J.P.
TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of
plasmid-encoded products with the transfer origin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
E.Lanka, 17-JUL-1989.
FEATURES from to/span description
pept 26 < 1 (c) traJ protein
pept 394 > 571 traK protein
rpt 48 63 inverted repeat
rpt 118 157 inverted repeat
signal 219 192 promoter PL
rpt 281 318 inverted repeat
signal 246 272 inverted repeat
BASE COUNT 135 a 148 c 181 g 107 t
ORIGIN
1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc
61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg
121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt
181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa
241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa
301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg
361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg
421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg
481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc
541 atttgggagc acatgcggga aacggggaag g
//
LOCUS STAREPEBR 2389 bp ds-DNA BCT 09-AUG-1990
DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA)
genes, complete cds.
ACCESSION M33479
KEYWORDS ethidium resistance protein; replication protein.
SOURCE S.aureus plasmid DNA.
ORGANISM Staphylococcus aureus
Prokaryota; Bacteria; Firmicutes; Gram-positive cocci;
Micrococcaceae.
REFERENCE 1 (bases 1 to 2389)
AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J.
TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium
bromide resistance plasmid naturally occurring in Staphylococcus
aureus
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First
Avenue, RM 1166, New York, NY 10016
FEATURES from to/span description
pept 1153 1476 ethidium resistance protein (ebr)
BASE COUNT 796 a 403 c 290 g 900 t
ORIGIN
1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa
61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca
121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt
181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg
241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa
301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg
361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc
421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat
481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa
541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata
601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt
661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc
721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt
781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt
841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt
901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg
961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac
1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca
1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa
1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga
1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata
1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat
1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt
1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc
1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt
1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc
1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa
1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt
1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata
1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta
1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg
1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg
1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg
1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca
2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg
2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct
2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa
2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc
2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag
2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg
//
LOCUS XANAVR 2100 bp ds-DNA BCT 09-AUG-1990
DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds.
ACCESSION M32142 J03672
KEYWORDS avirulence protein.
SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA.
ORGANISM Xanthomonas campestris
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2100)
AUTHORS Ronald,P.C. and Staskawicz,B.J.
TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv.
vesicatoria encodes a 50-kD protein
JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
P.Ronald, 15-FEB-1990.
FEATURES from to/span description
pept 308 622 ORF1
pept 713 2050 ORF2
BASE COUNT 656 a 423 c 505 g 516 t
ORIGIN
1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca
61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag
121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc
181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa
241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt
301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg
361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt
421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa
481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct
541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga
601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc
661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga
721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc
781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc
841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc
901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag
961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt
1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc
1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc
1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca
1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc
1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg
1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt
1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct
1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga
1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat
1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct
1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac
1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca
1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac
1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga
1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga
1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga
1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg
2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc
//
LOCUS YSCSCD25 5055 bp ds-DNA PLN 09-AUG-1990
DEFINITION S.cerevisiae SCD25 gene, complete cds.
ACCESSION M26647 M31771
KEYWORDS Ras protein; SCD25 gene; cell division cycle.
SOURCE S.cerevisiae (strain OL136) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 2129 to 5055)
AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M.
TITLE The C-terminal part of a gene partially homologous to CDC25 gene
suppresses the CDC25-5 mutation in Saccharomyces cerevisiae
JOURNAL Gene 77, 21-30 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3880)
AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and
Jacquet,M.
TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is
a dispensable gene of Saccharomyces cerevisiae
JOURNAL Unpublished (1990) See COMMENT for author address
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak,
01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie
cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex
FEATURES from to/span description
pept 128 3880 SCD25 protein
pept 4319 > 5055 ORF X
BASE COUNT 1638 a 973 c 900 g 1544 t
ORIGIN
1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc
61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc
121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga
181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt
241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa
301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa
361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa
421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa
481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg
541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag
601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt
661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat
721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa
781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt
841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt
901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag
961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa
1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc
1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct
1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac
1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac
1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg
1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt
1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg
1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag
1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca
1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc
1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac
1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga
1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg
1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga
1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga
1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa
1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga
2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa
2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt
2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca
2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat
2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga
2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg
2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg
2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga
2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt
2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga
2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag
2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc
2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac
2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga
2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac
2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt
3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat
3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag
3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag
3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa
3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt
3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa
3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag
3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg
3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata
3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa
3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt
3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc
3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga
3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt
3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt
3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca
3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta
4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc
4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc
4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca
4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg
4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat
4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg
4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt
4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag
4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac
4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag
4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt
4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa
4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta
4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt
4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga
4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt
4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa
5041 gaaaatcaaa gatct
//
LOCUS ADAMLPA1 630 bp ds-DNA VRL 09-AUG-1990
DEFINITION Simian adenovirus 30 major late promoter region DNA.
ACCESSION M31631
KEYWORDS promoter.
SEGMENT 1 of 3
SOURCE Mastadenovirus s30 viral DNA.
ORGANISM Mastadenovirus s30
Unclassified.
REFERENCE 1 (bases 1 to 630)
AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
Lee,S.G. and Hung,P.P.
TITLE Conservation of essential sequences in the major late promoter and
tripartite leader of the simian adenovirus type 30
JOURNAL Gene 89, 275-277 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
C.L.Hsiao, 22-JAN-1990.
FEATURES from to/span description
pre-msg 126 > 630 leader sequence 1 mRNA and intron
IVS 167 > 630 leader sequence 1 intron A
signal 95 100 TATA box
signal 273 292 downstream promoter element
site 64 73 upstream regulatory sequence
site 210 220 downstream regulatory sequence
BASE COUNT 128 a 150 c 214 g 138 t
ORIGIN Map position 16.0-16.6.
1 acggtgtgca ggcagaggtc cccgtcctcc gcatccaaaa aggtgattgg cttgtaggtg
61 taagtcacgt gaccttcctt tgggggcggg gggcgataaa agggggcggc gccgtcgtcg
121 ccgtcactgt cctctgcgtc gctgtggacg atcgccagct gctcgggtga gtagaggcgc
181 tcgaaggcgg gcatgacgtc ggcgctgagg gtgtcagttt ctacaaacga ggaggatttg
241 atgttaacct gcccggagcg atgcctttga gaagggcggg gtcgagctgg tcggcaaaaa
301 caattttttt attgtccagc ttagtggcaa aggacccgta gagggcgtag gtcgtaagaa
361 gcttcttgct ttttttccca cagctcgcga ttcaagaggt actcttggcg gttctgccag
421 tactcgggaa gcggaaaccc ctgcgcgtcg gctcggtaag cgcccagcat gtaaaattcg
481 ttaggcgctg acgatgcatt tgattaactg ctgcgtaggc acttgacgcc aggacctgaa
541 ggcggagaaa tccaccggat cggagaactt gtcgaggaag gcgtgtagcc agtcgcagtc
601 gcaaggtaag ctgaggacgg tttccggggg
//
LOCUS ADAMLPA2 135 bp ds-DNA VRL 09-AUG-1990
DEFINITION Simian adenovirus 30 leader region 2 DNA.
ACCESSION M34220
KEYWORDS promoter.
SEGMENT 2 of 3
SOURCE Mastadenovirus s30 viral DNA.
ORGANISM Mastadenovirus s30
Unclassified.
REFERENCE 1 (bases 1 to 135)
AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
Lee,S.G. and Hung,P.P.
TITLE Conservation of essential sequences in the major late promoter and
tripartite leader of the simian adenovirus type 30
JOURNAL Gene 89, 275-277 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
C.L.Hsiao, 22-JAN-1990.
FEATURES from to/span description
IVS < 1 35 leader sequence 2 intron N-1
IVS 108 > 135 leader sequence 2 intron N
site 36 107 leader sequence 2
BASE COUNT 28 a 36 c 36 g 35 t
ORIGIN About 0.8 kb after segment 1; map postion 26.1-26.6.
1 aggtcgtaag aagcttcttg ctttttttcc cacagctcgc gattcaagag gtactcttgg
61 cggttctgcc agtactcggg aagcggaaac ccctgcgcgt cggctcggta agcgcccagc
121 atgtaaaatt cgtta
//
LOCUS ADAMLPA3 147 bp ds-DNA VRL 09-AUG-1990
DEFINITION Simian adenovirus 30 leader sequence 3 DNA.
ACCESSION M34221
KEYWORDS promoter.
SEGMENT 3 of 3
SOURCE Mastadenovirus s30 viral DNA.
ORGANISM Mastadenovirus s30
Unclassified.
REFERENCE 1 (bases 1 to 147)
AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
Lee,S.G. and Hung,P.P.
TITLE Conservation of essential sequences in the major late promoter and
tripartite leader of the simian adenovirus type 30
JOURNAL Gene 89, 275-277 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
C.L.Hsiao, 22-JAN-1990.
FEATURES from to/span description
IVS < 1 35 leader sequence 3 intron N-1
IVS 123 > 147 leader sequence 3 intron N
site 36 122 leader sequence 3
BASE COUNT 34 a 33 c 53 g 27 t
ORIGIN About 2.5 kb after segment 2; map position 19.3-19.9.
1 ggcgctgacg atgcatttga ttaactgctg cgtaggcact tgacgccagg acctgaaggc
61 ggagaaatcc accggatcgg agaacttgtc gaggaaggcg tgtagccagt cgcagtcgca
121 aggtaagctg aggacggttt ccggggg
//
LOCUS TFEMERA 1730 bp ds-DNA BCT 09-AUG-1990
DEFINITION T.ferrooxidans mercuric reductase (merA) gene, complete cds.
ACCESSION M32353
KEYWORDS mercuric reductase.
SOURCE T.ferrooxidans (strain E-15) DNA, clones pTM31[4,5].
ORGANISM Thiobacillus ferrooxidans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Colorless sulfur bacteria.
REFERENCE 1 (bases 1 to 1730)
AUTHORS Inoue,C., Sugawara,K., Shiratori,T., Kusano,T. and Kitagawa,Y.
TITLE Nucleotide sequence of the Thiobacillus ferrooxidans chromosomal
gene encoding mercuric reductase
JOURNAL Gene 84, 47-54 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 65 1702 mercuric reductase
BASE COUNT 323 a 542 c 574 g 291 t
ORIGIN
1 gcgaccgacg gctgcgaaac gcccgccccg cgtagctgag cacatagaca ctttggagga
61 tattatgacc gagaacgcgc ccaccgaact cgctatcact ggcatgacct gcgacggttg
121 cgccgcgcat gtgcgcaaag cactcgaagg cgtgcccggc gtacgcgagg cgcaggtgtc
181 ctacccggat gccacggccc gggtcgtgct ggagggcgag gtgccgatgc agcggctaat
241 caaggcggtg gttgcaagtg gctatggtgt gcatccacgg agcgacggtg cctcctccac
301 aaacgatgga caggagctac acatcgctgt gatcggcacc ggcggagcgg cgatggcgtg
361 cgcattgaag gctgtcgagc ggggcgcgcg cgtgacgctg atcgaacgca gcaccatcgg
421 cggcacctgc gtgaacatcg gttgcgtgcc gtccaagatc atgatccgcg ccgcccatat
481 cgcccacctc cgccgggaaa gcccattcga tggcggcatc caggcggtcg cgccgaccat
541 ccagcgcaca gcgctgctgg tccaacagca ggcccgtgtc gatgaactgc gtcacgccaa
601 gtacgaaggc atcctggacg gcaacccggc catcaccgtt ctgcgcggtg aagcgcgttt
661 caaggacagc cggagtgttg tcgtccattt gaacgatggt ggcgagcgcg tcgtaatgtt
721 cgaccgctgc ctggttgcca cgggcgccag tccggccgtg ccgccgattc ccggcttgaa
781 agacactcct tattggacct ccaccgaagg gctggtcagc gaatcgatcc ccgagcgtct
841 ggccgtgatc ggctcgtcgg tggtggcgct ggaactggcg caagccttcg cccggctcgg
901 cagccatgtg acgatcctgg cgcgcggcac cttgttcctc cgggaagacc cggccatcgg
961 tgaggccatc acggcggcgt ttcgcgccga aggcatcgag gtgctggagc acacccaggc
1021 cagccaggtc gcttatgcgg atggcgaatt tgtgctagcc accgggcacg gcgaactgcg
1081 cgccgataag ctgctggtcg ccactggtcg cgcaccgaac acacgccgcc tgaatctgga
1141 agcggcgggc gtggccatca atgcgcaagg ggccatcgtc atcgaccagg gtatgcgcac
1201 gaacagcccg aacatttacg ccgctggcga ctgcaccgac cagccgcaat tcgtctacgt
1261 ggcggcagcg gccggcaccc gtgcggccat caacatgatg ggcggtagtg cagccctgga
1321 cttgacggcg atgccagccg tggtgttcac cgatccgcaa gtggcgactg tgggttacag
1381 cgcggaagcg catcgcgacg gcatcgaaac cgacagccgc atgacgctcg acaacgtgcc
1441 gcgggcgctc gccaatttca atacacgcgg cttcatcaag ctggtagccg aagtgggcag
1501 tggctcgcta atcggcgtgc aggtggtcgc cccggaagcg ggcgagctga tccagactgc
1561 cgcgctggcg attcgtaacc ggatgacggt acaggaactg gctgaccagt tgtttcccta
1621 cctgacgatg gtcgaagggc tgaagcttgc tgcccagacc ttcaccaggg atgtgaagca
1681 gttgtcctgc tgtgcgggtt gagacggatt gataaaggag tccctgttgc
//
LOCUS MMTELPMA 830 bp ss-RNA VRL 09-AUG-1990
DEFINITION Mouse mammary tumor virus (MMTV) phorbol myristate acetate induced
mRNA, clone 14.
ACCESSION M37198 M19737 M19738 M22729
KEYWORDS .
SOURCE Mouse mammary tumor virus, cDNA to viral RNA, clone 14, passed in
EL4.E1 cells.
ORGANISM Mouse mammary tumor virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses.
REFERENCE 1 (bases 1 to 202; 692 to 720)
AUTHORS Elliott,J.F., Pohajdak,B., Talbot,D.J., Shaw,J. and Paetkau,V.
TITLE Phorbol diester-inducible, cyclosporine-suppressible transcription
from a novel promoter within the mouse mammary tumor virus env gene
JOURNAL J. Virol. 62, 1373-1380 (1988)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 830)
AUTHORS Paetkau,V.H.
JOURNAL Unpublished (1990)
STANDARD full staff_review
FEATURES from to/span description
pept 168 575 PMA-induced transcript
mRNA 1 > 830 PMA-induced transcript mRNA
revision 1 1 c in [2]; g in [1]
revision 39 39 g in [2]; a in [1]
site 94 95 intron site
revision 119 119 a in [2]; g in [1]
site 706 707 site of 491 bp deletion relative to MMTV virus
BASE COUNT 233 a 183 c 184 g 230 t
ORIGIN
1 cactgccaga tcgcctttaa gaaggacgcc ttctgggagg gagacgagtc tgctcctcca
61 cggtggttgc cttgcgcctt ccctgaccaa ggggtgcctt gcgaagagcc ttgaccaaat
121 gcagtcagat cttaacgtgc ttcttttaaa aaagaaaaaa gggggaaatg ccgcgcctgc
181 agcagaaatg gttgaactcc cgagagtgtc ctacacctag gggagaagca gccaaggggt
241 tgtttcccac caaggacgac ccgtctgcgc acaaacgggt gagcccatca gacaaagaca
301 tattcattct ctgctgcaaa cttggcatag ctctgctttg cctggggcta ttgggggaag
361 ttgcggttcg tgctcgcagg gctctcaccc ttgactcttt taatagctct tctgtgcaag
421 attacaatct aaacaattcg gagaactcga ccttcctcct gaggcaagga ccacagccaa
481 cttcctctta caagccgcat cgattttgtc cttcagaaat agaaataaga atgcttgcta
541 aaaattatat tttaccaata agaccaatcc aataggtaga ttattagtta ctatgttaag
601 aaatgaatca ttatctttta gtactatttt tactcaaatt ctgttgttag aaatgggaat
661 agaaaataga aagagacgct caacctcaat tgaagaacag gtgcaaggat gtgagacaag
721 tagtttcctg acttggtttg gtatcaaatg ttttgatcta agctctgaat gttctattct
781 cctatgttct tttgcaactt atccaaggtc ttatgtaaat ggcttagtaa
//
LOCUS MUSPBGD1 2663 bp ds-DNA ROD 09-AUG-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1.
ACCESSION M28663 M29949 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 1 of 4
SOURCE Mouse (strain C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2663)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene: Structural organization,
sequence, and transcriptional analysis
JOURNAL J. Biol. Chem. 264, 14829-14834 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept 505 537 porphobilinogen deaminase (housekeeping) exon
1 (EC 4.3.1.8)
2191 2307 porphobilinogen deaminase (housekeeping) exon
2
2457 + 2510 porphobilinogen deaminase (housekeeping) exon
3
pep$ 2475 + 2510 porphobilinogen deaminase (erythroid sp.) exon
1
pre-msg 341 > 2663 PGB deaminase (hk) mRNA and introns
IVS 538 2190 PGB deaminase (hk) intron A (no splice
consensus)
IVS 2308 2456 PGB deaminase (hk) intron B
IVS 2511 > 2663 PGB deaminase (hk) intron C
pre-msg 2192 > 2663 PGB deaminase (ery.sp.) mRNA and introns
IVS 2511 > 2663 PGB deaminase (ery.sp.) intron A
binding 1126 1133 NPE binding site
binding 312 317 Sp1 binding site
binding 329 334 Sp1 binding site
site 2101 2107 CACCC box
site 2118 2124 CACCC box
BASE COUNT 622 a 661 c 707 g 673 t
ORIGIN Chromosome 9
1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga
61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca
121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg
181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga
241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc
301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac
361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc
421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc
481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg
541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg
601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc
661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag
721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc
781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag
841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc
901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt
961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc
1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc
1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag
1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat
1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt
1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac
1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca
1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt
1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg
1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc
1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg
1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt
1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg
1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga
1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg
1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc
1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta
1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga
2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc
2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc
2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg
2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc
2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga
2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg
2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag
2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag
2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct
2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc
2641 tctgaaaaga gagtctctgg agc
//
LOCUS MUSPBGD2 2763 bp ds-DNA ROD 09-AUG-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2.
ACCESSION M28664 M29950 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 2 of 4
SOURCE Mouse (strain C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2763)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene: Structural organization,
sequence, and transcriptional analysis
JOURNAL J. Biol. Chem. 264, 14829-14834 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 52 124 porphobilinogen deaminase (housekeeping), exon
4 (EC 4.3.1.8)
487 536 porphobilinogen deaminase (housekeeping), exon
5
619 674 porphobilinogen deaminase (housekeeping), exon
6
983 1060 porphobilinogen deaminase (housekeeping), exon
7
1971 2048 porphobilinogen deaminase (housekeeping), exon
8
2143 + 2218 porphobilinogen deaminase (housekeeping), exon
9
pep$ + 52 124 porphobilinogen deaminase (erythroid sp.) exon
2
487 536 porphobilinogen deaminase (erythroid sp.) exon
3
619 674 porphobilinogen deaminase (erythroid sp.) exon
4
983 1060 porphobilinogen deaminase (erythroid sp.) exon
5
1971 2048 porphobilinogen deaminase (erythroid sp.) exon
6
2143 + 2218 porphobilinogen deaminase (erythroid sp.) exon
7
pre-msg < 1 > 2763 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 51 PGB deaminase (hk) intron C; ery.sp. intron A
IVS 125 486 PGB deaminase (hk) intron D; ery.sp. intron B
IVS 537 618 PGB deaminase (hk) intron E; ery.sp. intron C
IVS 675 982 PGB deaminase (hk) intron F; ery.sp. intron D
IVS 1061 1970 PGB deaminase (hk) intron G; ery.sp. intron E
IVS 2049 2142 PGB deaminase (hk) intron H; ery.sp. intron F
IVS 2219 > 2763 PGB deaminase (hk) intron I; ery.sp. intron G
BASE COUNT 704 a 605 c 726 g 728 t
ORIGIN Chromosome 9; 150 bp upstream of segment 1.
1 gaaaggcagt ggccagggga ggtgagaaac catctgactc tctttcccca gctggctcgc
61 atacagaccg agactgtggt ggcgatgctg aaagccttgt accctggcat acagtttgaa
121 atcagtaagt tttcttgaga ggagtgattg gtagtgaacg ggaagccagt gaaccggagg
181 acagggcatc tctcgtttgc ctgtggtcaa agcctgcctt gtaagactat tctggctgct
241 tgtgaaggga aagaaagatt gtctcctgtg cacatctcct ccagctgccc gggctagcct
301 gacatttcca tactttctgc tttgggttct tttatgagta tgtctgcttt ttctgtcggt
361 gtgtgtatct gagagagtta ggggctgggt cttctatgcc tcagactcca ctgtgaatcc
421 agtcaaggcc tgaacgaggg gtgactcagt aggtgttaat gggtatctga ttgactctct
481 cctcagttgc tatgtccacc acgggagaca agattgttga tactgcactc tctaaggtaa
541 cgccagtcct tgtcccattc ttcttgtccc tctcccacgt gtaaggggtt cactctgagg
601 ctctctcttg cctggcagat tggagagaag agcctgttta ccaaggagct agaaaacgcc
661 ctggaaaaaa acgagtgagt gaggatggag gaatgtggta ccccgagcct agaaccccaa
721 agtggctctc caatattggc aggattgtcg ggttagactg tggagctcac aggctttcac
781 agagaagaga gccttgcctt ggagtagcct aactacctgg ggaatcagac tgccggggga
841 aaggggtaga gtagttgaga agagaccagg tcttagatct taagatgcta tcttcctgaa
901 cggtcaagga tgctggggtg ggtggtggag ataaggtcac ctactcaaag cctctctctg
961 tgcctccccc tgccgtctcc agagtggacc tggttgttca ctccctgaag gatgtgccta
1021 ccatactacc tcctggcttt actattggag ccatctgcaa gtaagcgggg aggacatgca
1081 tgggacggag ggccctgggc aggattaatc ctactgtggg aatctttgag tttttttttt
1141 ttttttttcc atttggaact taaccgctta gccgtctgtt ttgaaggttc tcagacatag
1201 tgtggcagga aagccaattg gttgacttgg ttgactattt agagtttgtg gagttgggct
1261 cagtggcacg gacctgaaat cccagctact gggaggctaa gacaggatca gagattctgg
1321 gccagcctgg gctacagagg gatttgaacc agcctgagga acttagattg tgccttaggg
1381 gcacagaagg ctggcttaca gtggcttagg tggtaaaggc attttttgct gtcaagccaa
1441 tgacctgagt tcagtccgtg gggtgcactt ggtgaaagaa gagggttgaa tcccacaagt
1501 tgtcatctga ctcatgcata catgctgtag aatgtttatg ctcctcatcc ctcaatgaaa
1561 atggaaacaa tcaaggaaat gaaatataaa acctgctggg tggtggtgcg cacgcctata
1621 atcccagcac ttgggaggca gaggcaggtg aattcaacct ggtctacaaa gtgagttcca
1681 ggactataca gagaaaccca gtcttaaaaa caaaacaaaa ctaaacaaca acaacaacaa
1741 caacaaaaaa gaaaaaacaa agaaagaaat ataaaacctt tccaaagaaa ataaaatgaa
1801 tttggcctgg tggctcatgc tataatctca gcattcagag agctgaggca ggagggttat
1861 tgtgagttaa aggctagctg gggtacagag aaaattttag gtcacctggg ctagagttaa
1921 ccctatctcc aaatgctaat acctttattt catcatcatt tgctttgcag acggcaaaac
1981 ccttgtgatg ctgttgtctt tcacccaaag tttattggaa agaccctgga aaccttgcca
2041 gagaaaaggt gagtgggcct agtgtgcggg ggagagaggc ctggacagtg gagaacagtt
2101 ggcagcctgg gttaagttta attctaaact ctctctgagc agtgccgtgg gaaccagctc
2161 tctgaggaga gtggctcagc tacagagaaa gttccccaac ctggaattca agagtattgt
2221 atcctttcag aagaaggagg ggaaaaagag ggaaagaagg accttccgaa gcaagtggtc
2281 catgcggtca gggggtcgtc tttccatctg tccgtccacc cacccaccca cccatccatc
2341 catccatcca cacatccaca gtcctttaat gttttgcttt tttttttttt tcctgagaca
2401 gggtttctct gtgtagcctg gctgtcctgg aactcacttt gtagaccaag ctggcctgca
2461 aagtgagaaa tccgcctgcc tctgcctcct gagtgctggg attaaaggtg cgccaccact
2521 gcctggcacc ctaatgtttt ttaaactcag gcctggcaat gaggacaatt tgcaaaacaa
2581 acatggttcc ttgttctata cagctgacat gttagacaga caggcaggca ctgcagatac
2641 tgaccggtga ccactcctgg tgcagggaca gaggcgcttc tgcttttact ttctgtgctg
2701 ctaagtggtt ttggttttta cagtgaatat gtgatatgtt tcataaaagt aatttttttt
2761 tct
//
LOCUS MUSPBGD3 800 bp ds-DNA ROD 09-AUG-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5.
ACCESSION M28665 M29951 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 3 of 4
SOURCE Mouse (strain C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 800)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene: Structural organization,
sequence, and transcriptional analysis
JOURNAL J. Biol. Chem. 264, 14829-14834 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 402 515 porphobilinogen deaminase (housekeeping), exon
10 (EC 4.3.1.8)
739 + 777 porphobilinogen deaminase (housekeeping), exon
11
pep$ + 402 515 porphobilinogen deaminase (erythroid sp.) exon
8
739 + 777 porphobilinogen deaminase (erythroid sp.) exon
9
pre-msg < 1 > 790 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 401 PGB deaminase (hk) intron I; ery.sp. intron G
IVS 516 738 PGB deaminase (hk) intron J; ery.sp. intron H
IVS 778 > 790 PGB deaminase (hk) intron K; ery.sp. intron I
BASE COUNT 181 a 201 c 182 g 236 t
ORIGIN Chromosome 9; 500 bp upstream of segment 2.
1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg
61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct
121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct
181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta
241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa
301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg
361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc
421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc
481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc
541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac
601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc
661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca
721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta
781 ggtaggtttg cctggagaga
//
LOCUS MUSPBGD4 1386 bp ds-DNA ROD 09-AUG-1990
DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4.
ACCESSION M28666 M29952 J04981
KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT 4 of 4
SOURCE Mouse (strain C3H) DNA, clone PBGD.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1386)
AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
TITLE The mouse porphobilinogen deaminase gene: Structural organization,
sequence, and transcriptional analysis
JOURNAL J. Biol. Chem. 264, 14829-14834 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Grandchamp, 06-OCT-1989.
FEATURES from to/span description
pept + 154 273 porphobilinogen deaminase (housekeeping), exon
12 (EC 4.3.1.8)
459 512 porphobilinogen deaminase (housekeeping), exon
13
618 704 porphobilinogen deaminase (housekeeping), exon
14
783 956 porphobilinogen deaminase (housekeeping), exon
15
pep$ + 154 273 porphobilinogen deaminase (erythroid sp.) exon
10
459 512 porphobilinogen deaminase (erythroid sp.) exon
11
618 704 porphobilinogen deaminase (erythroid sp.) exon
12
783 956 porphobilinogen deaminase (erythroid sp.) exon
13
pre-msg < 1 1279 PGB deaminase (hk and ery.sp.) mRNA and
introns
IVS < 1 153 PGB deaminase (hk) intron K; ery.sp. intron I
IVS 274 458 PGB deaminase (hk) intron L; ery.sp. intron J
IVS 513 617 PGB deaminase (hk) intron M; ery.sp. intron K
IVS 705 782 PGB deaminase (hk) intron N; ery.sp. intron L
BASE COUNT 372 a 305 c 343 g 366 t
ORIGIN Chromosome 9; 80 bp upstream of segment 3.
1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga
61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct
121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc
181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc
241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg
301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt
361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt
421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg
481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg
541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca
601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata
661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat
721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc
781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc
841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca
901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg
961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc
1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga
1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag
1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac
1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc
1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt
1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca
1381 gtcgac
//
LOCUS HUMIBP3 10884 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human insulin-like growth factor-binding protein-3 gene, complete
cds.
ACCESSION M35878 M35879 M35880 M35881 M35882 M35883 M35884 M35885 M35886
M36121 M36122 J05537 J05538
KEYWORDS insulin-like growth factor-binding protein-3.
SOURCE Human leukocyte DNA and, cDNA to mRNA, clone #HL1006d.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 10884)
AUTHORS Cubbage,M.L., Suwanichkul,A. and Powell,D.R.
TITLE Insulin-like growth factor binding protein-3: Organization of the
human chromosomal gene and demonstration of promoter activity
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.R.Powell, 03-JUL-1990.
The sequence presented here appears in Figures 2 and 3 of ref. [1].
FEATURES from to/span description
pept 2038 2440 insulin-like growth factor-binding protein-3
precursor (IGFBP-3), exon 1
5726 5952 insulin-like growth factor-binding protein-3
precursor, exon 2
6497 6616 insulin-like growth factor-binding protein-3
precursor, exon 3
8212 8337 insulin-like growth factor-binding protein-3
precursor, exon 4
sigp 2038 2118 insulin-like growth factor-binding protein-3
signal peptide, exon 1
matp 2119 2440 insulin-like growth factor-binding protein-3,
exon 1
5726 5952 insulin-like growth factor-binding protein-3,
exon 2
6497 6616 insulin-like growth factor-binding protein-3,
exon 3
8212 8334 insulin-like growth factor-binding protein-3,
exon 4
pre-msg 1906 10775 IGFBP-3 mRNA and introns
IVS 2441 5725 IGFBP-3 intron A
IVS 5953 6496 IGFBP-3 intron B
IVS 6617 8211 IGFBP-3 intron C
signal 102 1905 bp 3 promoter
binding 1808 1821 Sp1 and AP-2 binding domains
signal 1876 1881 TATA box
site 5728 5736 potential ASN-linked glycosylation site
site 5917 5926 potential ASN-linked glycosylation site
site 7087 7255 region homologous to IGFBP-3 genomic sequence
signal 10751 10756 Poly-A signal
BASE COUNT 2796 a 2578 c 2737 g 2773 t
ORIGIN
1 ctgcagacct gggacctcaa gaattgcatt tgatgccgaa cccagctcta atttcagagt
61 caaggtctct gcgagtattt aaggaacgga tgtaaacctg ggggattcgt tttgtttcct
121 tcaattttcc aatgaaatca gagatcctgt tcttgggtgt caacgcagat actagaagga
181 ggtgatacaa gagaaaggaa acagcaagcg acgattatgg cacggtttcc tgtaaacaag
241 gttgagtgta gccacagcct gagcactgtg ggagaagagc tcataagaaa atgacggtgc
301 tgggccttcg tcaccccggg gccctccatt gttcttgtct ttggtctctt tttatttgta
361 gaggtccaat tatttattta tttagtacaa gagggaacga aattgatctt tccattctaa
421 aaggagagta tatatgtata aaaggaagct gtatagatat gggggaagag gtggacaggg
481 ggaaaagggg agaggacgag agagagaaag ggagggagag ggacaaggag agacactggg
541 cgagagatcg attaggagag acagaaatga tgaatgaaga ttaacttcac ccaaggcttc
601 gtcgctggag gggaatggag gagctcctga tttgctatta ctactccaaa ctgcaaaggg
661 ctccttcaag tcacctatcc acctcctaag gcaagcgtcc aatttcaaca gcgttcagga
721 aagtctcctc ccgcggaggt ctcaccgctt cccactccac ccccacaaac tctttggaaa
781 agtgccttga aaaatttaat cctcaatcca atcctggacc accagcgtcc tctgttggtc
841 accgaaggag ggggtgcgca gacaaaactg aagaaactcg agtgccagag aaggccgaca
901 ggagttacag cgacctcagc gcgcaattgc gccccgaact ttactgaaaa gtgtttagat
961 tgcagagata agctagaatc ccaacgcatc gagaatacag taatacgaag tcgccttcaa
1021 aaaatgacaa tgaaaattgc ctattaaagg actatttggt taattacgtt tcagcagtgc
1081 ccagtttatt gtctttatta ttcttttgtc gtgggtgtaa actccatttg aaaacataat
1141 cagggagaat acccaagaca agaagaacag ttgtcattta aaatatttga aaagccctgc
1201 cttaaggagc attcgcttgc cggtccactc ttaattgggg acttgcggtg tagcaacacg
1261 tgagagtctt cttgcgttga gaagtaagcc tggaaaggcg aaggccccgg ggcatcttca
1321 gatgcgtatt tgtgggcccc tggggatata aacagcccag cgggtgtaaa ttaaaccccg
1381 cagtgccttg gctccctgag acccaaatgt aagtcagaaa tgtcccaaga cttcgcctgc
1441 caacggaatt aaattttaga aagctccacg aggtacacac gaatgcggag cgctgtatgc
1501 cagtttcccc gacaccggct cgccgcaggg agacctcacc ccgagagcgg aaggggtaag
1561 ggcggcgggg tcaaggagat cgggggtgct gagttggcca ggagtgactg gggtgaccgg
1621 gggtgctgag gtggcctgga gtgccggggt ggccgggcac accttggttc ttgtagacga
1681 caaggtgacg ggctccgggc gtgagcacga ggagcaggtg cccgggcgag tctcgagctg
1741 cacgcccccg agctcggccc cggctgctca gggcgaagca cgggccccgc agccgtgcct
1801 gcgccgaccc gcccccctcc caacccccac tcctgggcgc gcgttccggg gcgtgtcctg
1861 ggccaccccg gcttctatat acgggccggc gcgcccgggc cgcccagatg cgagcactgc
1921 ggctgggcgc tgaggatcag ccgcttcctg cctggattcc acagcttcgc gccgtgtact
1981 gtcgccccat ccctgcgcgc ccagcctgcc aagcagcgtg ccccggttgc aggcgtcatg
2041 cagcgggcgc gacccacgct ctgggccgct gcgctgactc tgctggtgct gctccgcggg
2101 ccgccggtgg cgcgggctgg cgcgagctcg gggggcttgg gtcccgtggt gcgctgcgag
2161 ccgtgcgacg cgcgtgcact ggcccagtgc gcgcctccgc ccgccgtgtg cgcggagctg
2221 gtgcgcgagc cgggctgcgg ctgctgcctg acgtgcgcac tgagcgaggg ccagccgtgc
2281 ggcatctaca ccgagcgctg tggctccggc cttcgctgcc agccgtcgcc cgacgaggcg
2341 cgaccgctgc aggcgctgct ggacggccgc gggctctgcg tcaacgctag tgccgtcagc
2401 cgcctgcgcg cctacctgct gccagcgccg ccagctccag gtgagccgcc cgccaggtgc
2461 gctgcgtgca gcaccgccac tggcgccgaa gggcctgggg gttgctgggt gccgctgcgg
2521 gagactccgc ttttcttctc actggagata atatgtgggg aaactgaagg cgctccggga
2581 aaggtgaagg cggtcgccga gggaccctcc ccagccggcc ctctacttgc tcgattctct
2641 aagtgcagag tacttgtaaa ttgcaaagcg ctttcagtga aaatgggtaa aggtttccgg
2701 agctgagggg agcggtaccg atgtttagct gttggaaaga tcctggacac aggagattct
2761 cctcgccccg cacgggtgca cacggactgc aatcccaggg atgcttgggg atggggggat
2821 ataggcggat ttggaccaag gaaggtgggt aggcacgttg taggaaatag tacctctctt
2881 ttaaaatact gactttgcac agccttttgg tttgcaaagc aatgtctagt cccggtatgt
2941 ccaaaaacaa gtaaagtgga ttcgggtttt gatatcttct gcggttggaa aacctgaagc
3001 tgaaaaagaa gtaacttctt aaggttaccc agcggccaca acagagtgta ggtttgaact
3061 ccgcgtgcca ctttcagtac cataccattc ttacaactcg ggccacccct gcacctgcgc
3121 cgacctcaaa caaacttcca ggtgcgtggt gggtgcgggc aatgtggact aagtcaattt
3181 caatgacacg gcaagggaat tggaatcagt cctaggctgt ctcccttctt aatctgaaat
3241 gggggggggg aatgagatgt tgttaagggg agccccagaa gaggaaaaat gcaaacattt
3301 ggcagagtta ccctcttgct tagccactat cagtatcagg cagacagcga ctctggtaag
3361 ggcatcacat tgttccctta aaaaaaggag cgggggttgt ttaaatggat ttggcagctg
3421 ttctttcaag cattcttagc cagcctcacc tagttatatg agaaataaag ttcctgcctt
3481 gcacagctga aggctgggag aattctcccc atcctaattc ccccaactcc ccaacgatca
3541 cgttggacag atgtcactgg gcaggccccc atctagggct agcaggatga acagtccctt
3601 tataatttat gtagctgtag agttccacgc ccgggtgaag ttattttctg gctcggcaag
3661 gctggctctg ttcacccctg agaaatgctg gattcatgga aaggcaagat gcctgaaaca
3721 tacactggct ctggtcagct gttaaagctg ctggaggcat ttgtctctcg gggcaaagtt
3781 atgtcatttg ccaagtgtcg tacattattg tgcattttgg ggtattcaaa aagtgatctt
3841 agaaatactg atacacatcg tcattcttgg gctttagcaa tcatcatgat taccacctta
3901 gtagcactgt agtataggtt gatgtgagtt ataagattat aaaaagatct aagtgacttc
3961 tagaatctat ttgacaaaaa aaggtaaatt ttcgacagtc aaaagtcaca attatctgtt
4021 gcttaaatag aactgttttg tcttcatgcc ctagtctgca gcccaggcat taagaagaaa
4081 ccaaggaaat ttaagaaatt actcaaggtt cttagaaaag aagtataaat acgtttattt
4141 acatgttctt agagtattta cattcttagt atctctttta tctcagtatt tccttgaaaa
4201 agaaagcaag ctaagattaa aagaaattga aaccaaatcc tcgcaggtag ggacctcctc
4261 tgtgaggctc tgtgctggac cctgggaatg tgtgcttccc aaggtatgaa accccttggg
4321 gaactttaca gcaggacctc agtgagctgt ttggcaggtg aggaaactaa gacccagaga
4381 ggagagggac tttcctaagg ccctggtgag tgacctgcca gtagccactt ccaggggaga
4441 gcagagcatc tgcagccaaa tcattgcagc cccaggtagc tttctagata gactgtggac
4501 cagatgggcc acctgagctc cctgctaggg ttacacatta tagccctgtt tgtgtagtag
4561 agaaatttca tgactctcaa ttgtggactt aagccgatgc ctccagacct tggcatggtc
4621 cacaggccct gggagcatgg gctctgaatg tagcctttga tccccatagc ggtcttacag
4681 cccctccaag ttcattctga agaaggaatg gagtgagaat cctggctgca gatccagtct
4741 tgaatttagt catatactta aaattccaat tcaactgtta acattccagc atccatttta
4801 agcatcagac tttcttcatt tagcactttt tattataaaa gggagatctg ctggaggggg
4861 atttctccta ccccaccccc acccagggaa ggaaaagctc tttggcactt agaagtctga
4921 gccgtgagtg ggactttggc attgtctgca tccatgtgct gctgtgttca cccggggtga
4981 aaaggactca cttaggcagg caccagcaag atgcacaggg tctgtgtaga ccttgagttt
5041 tagagatgta acggggacct agaaaacaag ccaccaacat gcttgcatga ttctgagccc
5101 ctgaggcaaa acgctttgca ggtaataatt cagttttccc atctgagctg gacaccaagc
5161 tcttataagc gtgtttacct ggtagcattg aggacggtac tggtcaacct tggaattccc
5221 ataagggctt gttacaactc agactcgtgc cgccactcca gcgtttccgg agtggagaat
5281 gtgcatttct tccaagtccc cgggctgccg ctgctcccgc gggtgggagg accacacttg
5341 gagttgactg caaaatttct gagccggcgc tgcagcagcc tcccgtggct caggtctgcc
5401 ccctgccggt ggaagatgaa gcatactgcc ttcacctact gaggggcact gaagcgtttg
5461 tctgccttct ttagttgcag ctacttagga agagcacctg tcagattgac tttcaaacag
5521 ataacttctt gaggtagagc aaccaccatg tagtgagtag tatgatggaa taatacttca
5581 tcgaggtatt taaaaaaaaa acctcacttg gattgccaac taatattgtc atttacatgt
5641 gacctggttg caacgttaag atttttacaa gactgtgata gatattgatg actctcatgt
5701 gtttgtctct cttgggcgtt ttaaggaaat gctagtgagt cggaggaaga ccgcagcgcc
5761 ggcagtgtgg agagcccgtc cgtctccagc acgcaccggg tgtctgatcc caagttccac
5821 cccctccatt caaagataat catcatcaag aaagggcatg ctaaagacag ccagcgctac
5881 aaagttgact acgagtctca gagcacagat acccagaact tctcctccga gtccaagcgg
5941 gagacagaat atgtgagagc ttttcctctt gttaaaggag gagggcaaga cctgccaagc
6001 ctgggtactc agagcctctt gagggcaatt cttactcaac aaaccccagc gcctggctga
6061 tgggtgggca acccctagcc cctctgtgcc ctacctctct cctctcctta cataaagaat
6121 attgaccctt ttggagaatc ttatgaggat caagctgaaa taacactctt aaaagcatat
6181 gggatgtcat aaagacctct gcagataatg aaaatattct cataaagata gttttattta
6241 cttcatcctc tatgcttgtt gacctgctat tggttccatg ccagcttctg tgccttactc
6301 tgggaagagc aaaaaggaga cagggagtga tggttagctt attcggggga ctttcgtgct
6361 acatcagaca taaggtatct gaggagcaaa ttacaggtcc cacttttggt agttgtgcag
6421 catcgtaaga tttttaaagc acacattcta gagtaaaaac tgtgactctg ttgctctggt
6481 ccttcctgat ccccagggtc cctgccgtag agaaatggaa gacacactga atcacctgaa
6541 gttcctcaat gtgctgagtc ccaggggtgt acacattccc aactgtgaca agaagggatt
6601 ttataagaaa aagcaggtga gtgaggtcct cagtgtgttt tcttcctctt ctgttgacac
6661 agaggagaaa cccatgtcac cagcgcccag gctcttgtgg ccatagctct aactctgagc
6721 ctgtgcagca ccagtgccca ggacttggtg ccagtctcag gaggtcagac caagggctgc
6781 tttgacttgt tgctctgagt gctgctatat tggccataat cctcaaccct agtgcctttc
6841 caccacccgc ttcccactcc tgtcctttca atggttcacc cacaggcgga caagatgctg
6901 cccagtggca ccctttataa actgcaagtg gacatgttaa cacatttgtt aatgctgcgt
6961 cagggagtga catttcaaac aactattata gtcagtttcc aagaagtgtg acatgaggtc
7021 ataccacaaa aaagcttacc ctgaaatccc acaatcgtcc cctttcctac tgatgccttc
7081 ccgatagtga gcaggttgca atattaagat tttgaaaagg ctgttgctag atgttggtga
7141 ctcgtgtgtc tctgtctccc ttgggctttt caaggaaatg ctagtgagtg gggggatgac
7201 tgcagcatgg ccagcttgga gagcccagcc atccccagca cataccaggt gtctgtcttg
7261 gcgtggaggg gatggaactt gaaatcagac actcggtcca tgctggggat ggccagtctc
7321 tccaaactgg catgtggtct tcctccgagt cactggcatt tccctagaaa gtccaagtga
7381 gaagaaggca tgagagtcat caacatcaaa caacagtctt ttcaaaatct ttatattgca
7441 acatagtccc attcctggaa aaggaatgga gtgagaatcc tggctacaca tcagccccaa
7501 atgtagtcat tgcctaaaat cccaattaac ctgaaaatga tcaaacaaat ttaagatata
7561 gtaatattaa gctgtaataa atatgcttct ataggctttg tgttatgtga tggcactatt
7621 tcaattggct ttctaattgg acaattgata ctatgctatc tacagaattg gcctttggag
7681 acctaagtga gccacagtgg cctcagggtg accatatact aggattcata gcagtggcca
7741 cagtcagaag cctaagcttt cctccattgc cattgctcgt ttataccacg tttctgtcaa
7801 agtcatattc attcaacaaa gtcatactga gaaggtgtca tgtgaggctg gatgtgggct
7861 ccaaagtcat agctgtgaca ttcgcaggca gcgggatgtt ctcagttcca catttggcag
7921 agaagtcagt caagaggttc tacaagggct ggtgtccacc ttatactcct agaaacacaa
7981 aactgccccc acccccgctt tcttggagca ggaagttaca cccacacgca tgcacaggcg
8041 cacactcagc gggcctaggc agcgtggctc ttgtgttgcc ttagctgaaa tttctgttgt
8101 gctttctcag catagcagag tcacgctggc aaaccatcat gcgccctggc caccgacctg
8161 acaccagacc caggagcatt cacttctctg tcttctgttt ctctcccaca gtgtcgccct
8221 tccaaaggca ggaagcgggg cttctgctgg tgtgtggata agtatgggca gcctctccca
8281 ggctacacca ccaaggggaa ggaggacgtg cactgctaca gcatgcagag caagtagacg
8341 cctgccgcaa gggtgagtac tcaggagggg cagcctgggc tccagggcct cactgtcctt
8401 ggaccagcct caggggctgg gcgtggccac tggccttccc caggcttaca gacccaggag
8461 ctgcagctca gggccagaaa gagcaaagca aataggacag agccctcaga agggtgcagg
8521 gagagggaga ccccatcaac ccaaccaaac aagtgtgggg aaggaggccg gccagtgcac
8581 ctcagggaca ctctgcttta tctcagatac ctcacagcac ctaagctatc attcatccac
8641 acacaaagtg aagattttca aagttaggct ttacccgtga gtctggaggt catttatctt
8701 cacagagaac gtttatcgca gactgctaag atacatgttc taattaagat gtgatgtgag
8761 aacgctgaat gctcgttgga gactcagttg aagtgcagct ttttttctgt caaatatata
8821 atgaatattc tgttagtctg tggctaatat aattttaata aagttaattt aaatctgata
8881 gaaaaatgaa attttaaacg ataattttag agaatgctat tatatccagt cttctttttt
8941 cttttaataa atgagggaac tattggggga aaggaataaa tacattttct ttcattttat
9001 taagacaaat ttagtaagca gaagaaattt gcatgtttag ttataagggt ttcttttttc
9061 cttacaagtt ggaaaaaata attctaattt aagggtaact ctttgacaat gaacactgtg
9121 agcagcatct ggtactcgtt gctttgtttg aaaacatgag ttgagacccc agccgcactt
9181 gcagcctagt gccattagcc tgcaggctgt gctggatatc tcagggcaag agtcgagccc
9241 ttttgatttt ggggggatta tttcaatata tttgcttttt ctttttgttt tagttaatgt
9301 ggagctcaaa tatgccttat tttgcacaaa agactgccaa ggacatgacc agcagctggc
9361 tacagcctcg atttatattt ctgtttgtgg tgaactgatt ttttttaaac caaagtttag
9421 aaagaggttt ttgaaatgcc tatggtttct ttgaatggta aacttgagca tcttttcact
9481 ttccagtagt cagcaaagag cagtttgaat tttcttgtcg cttcctatca aaatattcag
9541 agactcgagc acagcaccca gacttcatgc gcccgtggaa tgctcaccac atgttggtcg
9601 aagcggccga ccactgactt tgtgacttag gcggctgtgt tgcctatgta gagaacacgc
9661 ttcaccccca ctccccgtac agtgcgcaca ggctttatcg agaataggaa aacctttaaa
9721 ccccggtcat ccggacatcc caacgcatgc tcctggagct cacagccttc tgtggtgtca
9781 tttctgaaac aagggcgtgg atccctcaac caagaagaat gtttatgtct tcaagtgacc
9841 tgtactgctt ggggactatt ggagaaaata aggtggagtc ctacttgttt aaaaaatatg
9901 tatctaagaa tgttctaggg cactctggga acctataaag gcaggtattt cgggccctcc
9961 tcttcaggaa tcttcctgaa gacatggccc agtcgaaggc ccaggatggc ttttgctgcg
10021 gccccgtggg gtaggaggga cagagagacg ggagagtcag cctccacatt cagaggcatc
10081 acaagtaatg gcacaattct tcggatgact gcagaaaata gtgttttgta gttcaacaac
10141 tcaagacgaa gcttatttct gaggataagc tctttaaagg caaagcttta ttttcatctc
10201 tcatcttttg tcctccttag cacaatgtaa aaaagaatag taatatcaga acaggaagga
10261 ggaatggctt gctggggagc ccatccagga cactgggagc acatagagat tcacccatgt
10321 ttgttgaact tagagtcatt ctcatgcttt tctttataat tcacacatat atgcagagaa
10381 gatatgttct tgttaacatt gtatacaaca tagccccaaa tatagtaaga tctatactag
10441 ataatcctag atgaaatgtt agagatgcta tatgatacaa ctgtggccat gactgaggaa
10501 aggagctcac gcccagagac tgggctgctc tcccggaggc caaacccaag aaggtctggc
10561 aaagtcaggc tcagggagac tctgccctgc tgcagacctc ggtgtggaca cacgctgcat
10621 agagctctcc ttgaaaacag aggggtctca agacattctg cctacctatt agcttttctt
10681 tattttttta actttttggg gggaaaagta tttttgagaa gtttgtcttg caatgtattt
10741 ataaatagta aataaagttt ttaccattaa aaaaatatct ttccctttgt tattgaccat
10801 ctctgggctt tgtatcacta attattttat tttattatat aataattatt ttattaaaat
10861 gttccctgct ttccctttta gcaa
//
LOCUS PINCABII2 583 bp ss-mRNA PLN 09-AUG-1990
DEFINITION Pinus sylvestris cab II/2 mRNA for chlorophyll a/b-binding protein.
ACCESSION M37489 X14507
KEYWORDS Cab gene; chlorophyll a/b-binding protein; thylakoid protein.
SOURCE P.sylvestris cotyledones cDNA to mRNA, clone pINE ab 11.
ORGANISM Pinus sylvestris
Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae;
Pinaceae.
REFERENCE 1 (bases 1 to 583)
AUTHORS Jansson,S.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 583)
AUTHORS Jansson,S. and Gustafsson,P.
TITLE Type I and type II genes for the chlorophyll a/b-binding protein in
the gymnosperm Pinus sylvestris (Scots pine): cDNA cloning and
sequence analysis
JOURNAL Plant Mol. Biol. 14, 287-296 (1990)
STANDARD simple automatic
COMMENT [1] Author address Jansson,S.
Plant Physiology
Umea University
S-901 87 Umea
Sweden
FEATURES from to/span description
pept < 1 455 chlorophyll a/b-binding protein
BASE COUNT 135 a 133 c 171 g 144 t
ORIGIN
1 cggagctgtt ggttaaaaac ggggtgaaat ttggggaagc tgtgtggttc aaggccgggg
61 cgcagatatt ctcagaggga ggccttgact acctggggaa ccccaacctg atccacgcgc
121 agagcattct agccatctgg gcctgccagg ttgttctcat gggattgatt gaaggataca
181 gagtgggagg aggacccctt ggagaagggt tggaccctct gtacccaggg gatgccttcg
241 acccactggg gctggccgac gaccccgagg ccaaggcgga gctgaaggtg aaggagatta
301 agaacggtcg gctggccatg ttctccatgt tcggtttctt cgttcaggca atcgtgaccg
361 ggaagggccc cattgaaaat ctctacgacc acttggcgga ccccgttgcc aacaatgcct
421 gggcctacgc caccaatttc gttcctggca agtgaaggtg acggaaaata aaagaggcct
481 gtgatctgtg catcaatcat ttgacagcct tagtgttaat aaaatatgtt ctttcagctg
541 tatgtatttg ttggtgatct tcgttaataa aatattttct ttc
//
LOCUS RATMHCIAB 1563 bp ss-mRNA ROD 09-AUG-1990
DEFINITION Rat MHC class I cell surface antigen mRNA.
ACCESSION M25319
KEYWORDS antigen; cell surface antigen; class I gene; glycoprotein;
histocompatibility antigen; major histocompatibility complex.
SOURCE Rat cDNA to mRNA, clone pARI.5.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1563)
AUTHORS Radojcic,A., Stranick,K.S., Locker,J., Kunz,H.W. and Gill,T.J.III.
TITLE Nucleotide sequence of a rat class I cDNA clone
JOURNAL Immunogenetics 29, 134-137 (1989)
STANDARD full staff_entry
COMMENT Draft entry and sequence in computer readable form for [1] kindly
provided by J.J.Rushton 24-JAN-1990.
FEATURES from to/span description
pept < 1 1134 MHC class I antigen (AA at 1)
sigp < 1 60 MHC class I antigen signal peptide
site 61 330 alpha-1 domain (exon 2)
site 381 606 alpha-2 domain (exon 3)
site 607 882 alpha-3 domain (exon 4)
site 883 1131 transmembrane and cytoplasmic domains (exons
5, 6, 7, and 8)
signal 1535 1541 poly-A signal
BASE COUNT 324 a 412 c 471 g 356 t
ORIGIN
1 gcaccgcgca cgctgctcct gctgttggcg gccgccctgg ccccgaccca gattcacgcg
61 ggctcacact cgctgcggta tttcgacatc accgtgtccc ggcccggcct cggggagccc
121 cggttcatct ctgtcggcta cgtggacgac acggagttcg tgcgctacga cagcgacgca
181 gagaatccga gattcaagcc gcgggtccgg tggatggagc gggaggggcc ggagtattgg
241 gagcggatca cacggatcgc caaggaaagc gagcagattt accgagtggg cctgaggacc
301 ctgcgcggtt actacaacca gagcgagggc ggctctcaca ccatccagag attgtctggc
361 tgtgaggtgg ggtcggacgg gatcctcctc cgcgggtatg agcagttcgc ctacgacggc
421 cgcgattaca tcgccctgaa cgaagacctg aaaacgtggg cggcggcgga ctttgcagca
481 gggatcaccc ggaacaagtt ggagcgggat ggtgaggcag agagactcag ggcctacctt
541 gaaggcggga gcgtggagtg gctccgcaga tacttggagc tcaggaagga gacgctgctg
601 cgctcagaac ccccaaaggc acatgtgacc cttcactcca gacctgaagg tgatgtgacc
661 ctgaggtgct gggccttggg cttctaccct gctgacatat tcctgacctg gctgttgaat
721 ggggaggacc tgacccagga catggaactt gtggagacca ggcctgcagg ggatggaacc
781 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagaatta cacatgccat
841 gtggagcatg aggggctgcc tgagccgctc accctgagat gggagggtcc tccctccgcc
901 aactccaaca cgggaatgtc tgttattctt ggaactgtgg ccatcattgc agttatggcc
961 atcattgcag ctgtggcctt cattggacct gttgtgagga agaggtggat aaaaacagct
1021 tttcttctca caagtggaaa aggaggagac tacacccctg ctccaggcag ggacagctcc
1081 cagagctctg atgtgtctct cccagattgt aaagccatga agacagctgc ttgaggtgaa
1141 ctggatgccg gccgatgtgt tcaggtctct cttgtgacat ccggagccct cggttctctt
1201 tggacaccga tgcctgggat tccctatgat cctatgactt cggtataggg gactatggga
1261 cccggcccaa ccctacacac cgggacccta tccctgcact gtttgtgttt cctttcacag
1321 ccaaccttgc tggttcagcc tgggttgggg cctggacatc tgcatcctat cactcagtgg
1381 tgctttgaac tgcaactcct cacttctaca ctgagaataa gaatctgagt gtgaacttga
1441 ctgttcacat ccttgacaca gtgttgactg ctttttaaat tactggattg agaatactta
1501 gaggttgttt tttgtttttg ttttgttttg ttttaaataa atggcaggtg gagaagcttc
1561 cag
//
LOCUS HUMINT01 42 bp ss-mRNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1.
ACCESSION M29165 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 1 of 7
SOURCE Human cell line HL-60, cDNA to mRNA, clone lambda-X47.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 42)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE cDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD simple automatic
FEATURES from to/span description
mRNA < 1 > 42 P150,95 mRNA, exon 1
BASE COUNT 9 a 15 c 10 g 8 t
ORIGIN 1 bp upstream of EcoRI site; chromosome 16p11-13.1.
1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag
//
LOCUS HUMINT02 3690 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
2 - 6.
ACCESSION M29482 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 2 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3690, exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3690; exons and intron/exon boundaries only)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 3690; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12752 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 3690; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept 1028 1064 integrin alpha subunit precursor, exon 2
(first expressed exon)
1666 1771 integrin alpha subunit precursor, exon 3
2391 2494 integrin alpha subunit precursor, exon 4
2795 2865 integrin alpha subunit precursor, exon 5
3020 + 3131 integrin alpha subunit precursor, exon 6
sigp 1028 1064 integrin alpha subunit signal peptide
1666 1685 integrin alpha subunit signal peptide
matp 1686 1771 integrin alpha subunit
2391 2494 integrin alpha subunit
2795 2865 integrin alpha subunit
3020 + 3131 integrin alpha subunit
pre-msg < 1 > 3690 P150,95 mRNA and introns
IVS < 1 975 P150,95 intron A
IVS 1065 1665 P150,95 intron B
IVS 1772 2390 P150,95 intron C
IVS 2495 2794 P150,95 intron D
IVS 2866 3019 P150,95 intron E
IVS 3132 > 3690 P150,95 intron F
BASE COUNT 764 a 1050 c 1020 g 856 t
ORIGIN 1 bp upstream of BamHI site; chromosome 16p11-13.1.
1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc
61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg
121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct
181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc
241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca
301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa
361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt
421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc
481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata
541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc
601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt
661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag
721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca
781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg
841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc
901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc
961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct
1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca
1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg
1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc
1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg
1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa
1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct
1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg
1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc
1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg
1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca
1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc
1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg
1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc
1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact
1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc
1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg
1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta
2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta
2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca
2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc
2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa
2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc
2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt
2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac
2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc
2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt
2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc
2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc
2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg
2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc
2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg
2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc
2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag
3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat
3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt
3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt
3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa
3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg
3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct
3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag
3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta
3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta
3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata
3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg
3661 tgtgtgtgca tgtgcatgca ggttgagacg
//
LOCUS HUMINT03 4863 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
7 - 15.
ACCESSION M29483 Y00093
KEYWORDS integrin; protein p150,95.
SEGMENT 3 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 294 to 3967; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 4863)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 4863; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12751 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 4863; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 294 424 integrin alpha subunit precursor, exon 7
511 656 integrin alpha subunit precursor, exon 8
901 1054 integrin alpha subunit precursor, exon 9
1659 1809 integrin alpha subunit precursor, exon 10
2428 2501 integrin alpha subunit precursor, exon 11
2665 2794 integrin alpha subunit precursor, exon 12
3207 3349 integrin alpha subunit precursor, exon 13
3531 3671 integrin alpha subunit precursor, exon 14
3760 + 3969 integrin alpha subunit precursor, exon 15
matp + 294 424 integrin alpha subunit
511 656 integrin alpha subunit
901 1054 integrin alpha subunit
1659 1809 integrin alpha subunit
2428 2501 integrin alpha subunit
2665 2794 integrin alpha subunit
3207 3349 integrin alpha subunit
3531 3671 integrin alpha subunit
3760 + 3969 integrin alpha subunit
pre-msg < 1 > 4861 P150,95 mRNA and introns
IVS < 1 293 P150,95 intron F
IVS 425 510 P150,95 intron G
IVS 657 900 P150,95 intron H
IVS 1055 1658 P150,95 intron I
IVS 1810 2427 P150,95 intron J
IVS 2502 2664 P150,95 intron K
IVS 2795 3206 P150,95 intron L
IVS 3350 3530 P150,95 intron M
IVS 3672 3759 P150,95 intron N
IVS 3970 > 4863 P150,95 intron O
BASE COUNT 947 a 1358 c 1460 g 1097 t 1 others
ORIGIN Chromosome 16p11-13.1.
1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct
61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag
121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca
181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac
241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc
301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa
361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac
421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg
481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc
541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct
601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga
661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga
721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct
781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag
841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag
901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt
961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc
1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc
1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga
1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa
1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg
1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct
1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga
1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc
1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt
1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat
1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga
1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca
1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta
1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc
1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct
1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt
1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc
1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac
2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac
2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc
2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc
2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag
2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag
2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg
2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat
2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg
2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt
2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc
2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc
2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct
2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac
2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg
2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg
2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg
3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt
3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg
3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc
3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt
3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac
3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga
3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga
3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac
3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct
3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc
3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct
3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg
3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat
3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg
3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag
3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc
3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc
4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct
4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc
4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg
4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt
4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc
4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa
4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg
4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc
4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct
4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc
4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa
4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc
4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact
4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat
4861 ttt
//
LOCUS HUMINT04 2746 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
16 - 21.
ACCESSION M29484 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 4 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 382 to 2672; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2746)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 2746; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12751 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 2746; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 382 512 integrin alpha subunit precursor, exon 16
628 791 integrin alpha subunit precursor, exon 17
924 1078 integrin alpha subunit precursor, exon 18
1656 1787 integrin alpha subunit precursor, exon 19
1875 1948 integrin alpha subunit precursor, exon 20
2531 + 2672 integrin alpha subunit
matp + 382 512 integrin alpha subunit
628 791 integrin alpha subunit
924 1078 integrin alpha subunit
1656 1787 integrin alpha subunit
1875 1948 integrin alpha subunit
2531 + 2672 integrin alpha subunit
pre-msg < 1 > 2746 P150,95 mRNA and introns
IVS < 1 381 P150,95 intron O
IVS 513 627 P150,95 intron P (no splice consensus)
IVS 792 923 P150,95 intron Q
IVS 1079 1655 P150,95 intron R
IVS 1788 1874 P150,95 intron S
IVS 1949 2530 P150,95 intron T
IVS 2673 > 2746 P150,95 intron U
BASE COUNT 602 a 821 c 746 g 577 t
ORIGIN Chromosome 16p11-13.1.
1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag
61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg
121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt
181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt
241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc
301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc
361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat
421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct
481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc
541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc
601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat
661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc
721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct
781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc
841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac
901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct
961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag
1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt
1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga
1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag
1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg
1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag
1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc
1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg
1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt
1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg
1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc
1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc
1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc
1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca
1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt
1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca
1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc
1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct
2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt
2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca
2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac
2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa
2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact
2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa
2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc
2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc
2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga
2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag
2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc
2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg
//
LOCUS HUMINT05 1006 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
22 - 24.
ACCESSION M29485 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 5 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 202 to 665)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1006)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 1006; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12751 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 1006; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 203 319 integrin alpha subunit precursor, exon 22
404 483 integrin alpha subunit precursor, exon 23
582 + 665 integrin alpha subunit precursor, exon 24
matp + 203 319 integrin alpha subunit
404 483 integrin alpha subunit
582 + 665 integrin alpha subunit
pre-msg < 1 > 1006 P150,95 mRNA and introns
IVS < 1 202 P150,95 intron U
IVS 320 403 P150,95 intron V
IVS 484 581 P150,95 intron W
IVS 666 > 1006 P150,95 intron X
BASE COUNT 228 a 286 c 229 g 263 t
ORIGIN Chromosome 16p11-13.1.
1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag
61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt
121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac
181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga
241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat
301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac
361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac
421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag
481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc
541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg
601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt
661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa
721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa
781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg
841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta
901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt
961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt
//
LOCUS HUMINT06 1904 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
25 - 30.
ACCESSION M29486 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 6 of 7
SOURCE Human DNA, (library pWE15), clone 30.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 86 to 1528; exons only)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1904)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 1904; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12751 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 1904; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 86 164 integrin alpha subunit precursor, exon 25
280 387 integrin alpha subunit precursor, exon 26
505 588 integrin alpha subunit precursor, exon 27
788 901 integrin alpha subunit precursor, exon 28
1044 1145 integrin alpha subunit precursor, exon 29
1417 + 1527 integrin alpha subunit precursor, exon 30
matp + 86 164 integrin alpha subunit
280 387 integrin alpha subunit
505 588 integrin alpha subunit
788 901 integrin alpha subunit
1044 1145 integrin alpha subunit
1417 + 1527 integrin alpha subunit
pre-msg < 1 > 1904 P150,95 mRNA and introns
IVS < 1 85 P150,95 intron X
IVS 165 279 P150,95 intron Y
IVS 388 504 P150,95 intron Z
IVS 589 787 P150,95 intron AA
IVS 902 1043 P150,95 intron AB
IVS 1146 1416 P150,95 intron AC
IVS 1528 > 1903 P150,95 intron AD
BASE COUNT 381 a 545 c 525 g 453 t
ORIGIN Chromosome 16p11-13.1.
1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat
61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag
121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta
181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt
241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg
301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg
361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa
421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca
481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg
541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct
601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc
661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac
721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg
781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag
841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca
901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg
961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac
1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg
1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag
1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac
1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg
1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct
1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga
1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac
1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg
1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag
1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat
1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag
1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc
1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt
1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc
1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga
//
LOCUS HUMINT07 653 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exon
31.
ACCESSION M29487 Y00093
KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT 7 of 7
SOURCE Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA
to mRNA, clone lambda-X47.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 44 to 149)
AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
TITLE CDNA cloning and complete primary structure of the alpha subunit of
a leukocyte adhesion glycoprotein
JOURNAL EMBO J. 6, 4023-4028 (1987)
STANDARD simple automatic
REFERENCE 2 (bases 1 to 653; revises [1])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 2782-2788 (1990)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 653; exons and intron/exon boundaries)
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
TITLE Genomic structure of an integrin alpha subunit, the leukocyte
p150,95 molecule
JOURNAL J. Biol. Chem. 265, 12750-12751 (1990)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 653; exons and intron/exon boundaries; revises [3])
AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES from to/span description
pept + 44 148 integrin alpha subunit precursor, exon 31
matp + 44 145 integrin alpha subunit
pre-msg < 1 > 149 P150,95 mRNA and introns
IVS < 1 85 P150,95 intron AC
BASE COUNT 126 a 195 c 150 g 182 t
ORIGIN Chromosome 16p11-13.1.
1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg
61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca
121 gacccccagc ccgcccagtg agaaatgatc cctctttgcc ttggacttct tctcccgcga
181 ttttccccac ttacttaccc tcacctgtca ggctgacggg gaggaaccac tgcaccaccg
241 agagaggctg ggatgggcct gcttcctgtc tttgggagaa aacgtcttgc ttgggaaggg
301 gcctttgtct tgtcaaggtt ccaactggaa acccttagga cagggtccct gctgtgttcc
361 ccaaaaggac ttgacttgca atttctacct agaaatacat ggacaatacc cccaggcctc
421 agtctccctt ctcccatgag gcacgaatga tctttctttc ctttcctttt tttttttttt
481 cttttctttt tttttttttt tgagacggag tctcgctctg tcacccaggc tggagtgcaa
541 tggcgtgatc tcggctcgct gcaacctccg cctcccgggt tcaagtaatt ctgctgtctc
601 agcctcctgc gtagctggga ctacaggcac acgccacctc gcccggcccg atc
//
LOCUS PEAHSP177A 772 bp ss-mRNA PLN 09-AUG-1990
DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA,
complete cds.
ACCESSION M33901
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 772)
AUTHORS Lauzon,L.M., Helm,K. and Vierling,E.
TITLE A cDNA clone from Pisum sativum encoding a low molecular weight
heat shock protein
JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 65 523 17.7 kDa heat shock protein (hsp17.7)
BASE COUNT 254 a 127 c 170 g 221 t
ORIGIN
1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag
61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga
121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc
181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga
241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt
301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga
361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc
421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc
481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat
541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg
601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata
661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc
721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac
//
LOCUS PEAHSP179A 700 bp ss-mRNA PLN 09-AUG-1990
DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA,
complete cds.
ACCESSION M33900
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 700)
AUTHORS Lauzon,L.M., Helm,K. and Vierling,E.
TITLE A cDNA clone from Pisum sativum encoding a low molecular weight
heat shock protein
JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept < 1 469 17.9 kDa heat shock protein (hsp17.9)
BASE COUNT 209 a 123 c 170 g 198 t
ORIGIN
1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt
61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa
121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt
181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga
241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg
301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc
361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa
421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct
481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg
541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa
601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt
661 gagtatgtga aagtttcttt tgattaagct gtatttatcc
//
LOCUS PEAHSP181A 862 bp ss-mRNA PLN 09-AUG-1990
DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA,
complete cds.
ACCESSION M33899
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 862)
AUTHORS Lauzon,L.M., Helm,K. and Vierling,E.
TITLE A cDNA clone from Pisum sativum encoding a low molecular weight
heat shock protein
JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 12 488 18.1 kDa heat shock protein (hsp18.1)
BASE COUNT 278 a 128 c 207 g 249 t
ORIGIN
1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg
61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac
121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg
181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga
241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag
301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt
361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc
421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt
481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc
541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat
601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt
661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa
721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac
781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga
841 gtctctccaa gatggagaat tc
//
LOCUS PEAHSP227A 795 bp ss-mRNA PLN 09-AUG-1990
DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA,
complete cds.
ACCESSION M33898
KEYWORDS heat shock protein.
SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 795)
AUTHORS Lauzon,L.M., Helm,K. and Vierling,E.
TITLE A cDNA clone from Pisum sativum encoding a low molecular weight
heat shock protein
JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Vierling, 01-MAY-1990.
University of Arizona
Department of Biochemistry Biological Sciences West Building
Tucson, AZ 85721
FEATURES from to/span description
pept 44 637 22.7 kDa heat shock protein (hsp22.7)
BASE COUNT 261 a 122 c 170 g 242 t
ORIGIN
1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct
61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc
121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc
181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca
241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca
301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga
361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca
421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa
481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca
541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga
601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa
661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg
721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa
781 cttgttggct tttgt
//
LOCUS ECOHGRF 140 bp ds-DNA SYN 09-AUG-1990
DEFINITION Synthetic human growth hormone releasing factor (hGRF) gene,
complete cds.
ACCESSION M26106
KEYWORDS growth hormone releasing factor; somatocrinin.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 140)
AUTHORS Cravador,A., Jacobs,P., Van Elsen,A., Lacroix,C., Colau,B.,
Van Alphen,P., Herzog,A. and Bollen,A.
TITLE Total DNA synthesis and cloning in Escherichia coli of a gene
coding for the human growth hormone releasing factor
JOURNAL Biochimie 67, 829-834 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 2 139 synthetic human growth hormone releasing
factor (hGRF)
BASE COUNT 30 a 35 c 36 g 39 t
ORIGIN
1 catgtacgct gacgctatct tcactaactc ttaccgtaaa gttctgggtc agctgtctgc
61 tcgtaaactg ctgcaggaca tcatgtctcg tgagcagggt gaatctaacc aggaacgtgg
121 tgctcgtgct cgtctgtaag
//
LOCUS HUMACALX 724 bp ss-mRNA PRI 09-AUG-1990
DEFINITION Human calcitonin mRNA, complete cds.
ACCESSION M26095
KEYWORDS calcitonin.
SOURCE Human cell-line BEN, cDNA to mRNA, clone hBEN-JR2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 724)
AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M.,
Al-Kazwini,S.J., Holman,J.J. and Marshall,I.
TITLE Expression and function of the human calcitonin/alpha-CGRP gene in
health and disease
JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 35 460 calcitonin precursor
sigp 35 109 calcitonin signal peptide
matp 287 382 calcitonin
matp 383 457 flanking peptide
BASE COUNT 163 a 195 c 200 g 166 t
ORIGIN
1 ggtgagcccc gagattctgg ctcagagagg tgtcatgggc ttccaaaagt tctccccctt
61 cctggctctc agcatcttgg tcctgttgca ggcaggcagc ctccatgcag caccattcag
121 gtctgccctg gagagcagcc cagcagaccc ggccacgctc agtgaggacg aagcgcgcct
181 cctgctggct gcactggtgc aggactatgt gcagatgaag gccagtgagc tggagcagga
241 gcaagagaga gagggctcca gcctggacag ccccagatct aagcggtgcg gtaatctgag
301 tacttgcatg ctgggcacat acacgcagga cttcaacaag tttcacacgt tcccccaaac
361 tgcaattggg gttggagcac ctggaaagaa aagggatatg tccagcgact tggagagaga
421 ccatcgccct catgttagca tgccccagaa tgccaactaa actcctccct ttccttccta
481 atttcccttc ttgcatcctt cctataactt gatgcatgtg gtttggttcc tctctggtgg
541 ctctttgggc tggtattggt ggctttcctt gtggcagagg atgtctcaaa cttcagatgg
601 gaggaaagag agcaggactc acaggttgga agagaatcac ctgggaaaat accagaaaat
661 gagggccgct ttgagtcccc cagagatgtc atcagagctc ctctgtcctg ctttctgaat
721 gtgc
//
LOCUS HUMCALARP 234 bp ds-DNA PRI 09-AUG-1990
DEFINITION Human calcitonin gene, exon 5.
ACCESSION M26094
KEYWORDS calcitonin.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M.,
Al-Kazwini,S.J., Holman,J.J. and Marshall,I.
TITLE Expression and function of the human calcitonin/alpha-CGRP gene in
health and disease
JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept / 36 195 calcitonin precursor, exon 5 (AA at 37)
matp 54 165 alpha-calcitonin related peptide
matp 166 192 carboxyl-terminal-flanking peptide (PDN-21)
IVS < 1 35 alpha-calcitonin related peptide intron C
BASE COUNT 59 a 63 c 61 g 51 t
ORIGIN
1 cagatcttct cttctttctc catcctgcaa atcagaatca ttgcccagaa gagagcctgt
61 gacactgcca cctgtgtgac tcatcggctg gcaggcttgc tgagcagatc agggggtgtg
121 gtgaagaaca actttgtgcc caccaatgtg ggttccaaag cctttggcag gcgccgcagg
181 gaccttcaag cctgagcagc tgaacgactc aagaaggtca caataaagct gaac
//
LOCUS PIPVGB 1883 bp ds-DNA BCT 09-AUG-1990
DEFINITION Plasmid pIP630 (from S. aureus) virginiamycin B hydrolase (vgb)
gene, complete cds.
ACCESSION M36022
KEYWORDS virginiamycin B hydrolase; virginiamycin-resistance.
SOURCE Plasmid pIP630 (from Staphylococcus aureus) DNA.
ORGANISM Plasmid pIP630
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1883)
AUTHORS Allignet,J., Loncle,V., Mazodier,P. and El Solh,N.
TITLE Nucleotide sequence of a Staphylococcal plasmid gene, vgb, encoding
a hydrolase inactivating the B components of virginiamycin-like
antibiotics
JOURNAL Plasmid 20, 271-275 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 641 1540 virginiamycin B hydrolase
BASE COUNT 641 a 284 c 375 g 583 t
ORIGIN
1 agatctacgg attttcgcca tgccacgaaa ttagcatcat gctagcaagt taaacgaaca
61 ctgacatgat atattagtgg ttagctatat ttttttactt tgcaacagaa ccattattat
121 ggtttcttaa aaaaatacaa tgctttttcg ttccttttta ttcatcttcc aattctttgg
181 catgactgtg tgcattttaa atttgttcag caaatgtgcc gtgtaatgga atacttttta
241 aatactgtgt aatgataatg caaggcacat actaaaagga atcttcgatt ttgttggctt
301 attatttgac ttttcataac aattatctta aggttaaaca aatcaataat cgaaagggtg
361 aaaaaaagca catgatcata taatcctaat tttaaaagaa atcgatattt tggccttggg
421 ttcaatttca aagtggtttt ggaatgaact ctatttgtta tcggcttttt tctgagatag
481 gattaatgta atgtgctttt ttggctttaa aaagaccttt gttatccaaa aagtcttttt
541 aagtgtcctt atccgtgcca cattgcctcc tatctcgaaa aaagagatgg aggctatttt
601 tgttttggaa atttaattta aataaaacgg aggggataga atggaattta aattacaaga
661 attaaatctt actaaccaag atacaggacc atatggtata accgtttcag ataaggggaa
721 agtttggatt acacaacata aagcaaatat gataagttgc atcaatttag atggaaaaat
781 tacagagtac ccactaccga caccagatgc aaaagtcatg tgtttaacta tatcctcaga
841 tggggaagtt tggtttactg agaatgcagc aaacaaaata gggaggatta caaaaaaagg
901 gattattaag gaatatacat tgcctaaccc agattcagca ccctacggta ttacagaagg
961 accaaatgga gatatatggt ttacagaaat gaatggcaac cgtattggac gtattacgga
1021 cgacggtaaa attcgtgaat acgagctgcc taataaagga tcttaccctt cttttatcac
1081 tttgggttct gataatgccc tgtggttcac agaaaatcaa aataatgcta ttggtagaat
1141 tacagaaagt ggggatatta cagagtttaa aattcctaca cctgcatcag gaccagttgg
1201 tattacaaag gggaacgacg atgctttatg gtttgtggaa attatcggta ataagatagg
1261 gcgaataact cctctggggg aaattaccga attcaaaatt ccaacgccaa acgctcgacc
1321 tcatgcaatt actgctggag caggaattga tttatggttt actgaatggg gggctaataa
1381 aataggaagg ctgacaagca ataatataat tgaggaatac ccaattcaaa tcaaaagtgg
1441 tgaaccacat ggcatttgtt tcgatggtga aacaatttgg tttgcaatgg agtgtgacaa
1501 gataggcaaa ttaactctca ttaaggataa tatggagtga gtcttttgaa tttaaacaat
1561 gaccatggac ctgatcccga aaatatttta ccgataaaag ggaatcggaa tcttcaattt
1621 ataaaaccta ctataacgaa cgaaaacatt ttggtggggg aatattctta ttatgatagt
1681 aagcgaggag aatcctttga agatcaagtc ttatatcatt atgaagtgat tggagataag
1741 ttgattatag gaagattttg ttcaattggt cccggaacaa catttattat gaatggtgca
1801 aaccatcgga tggatggatc aacatatcct tttcatctat tcaggatggg ttgggagaag
1861 tatatgcctt ccttaaaaga tct
//
LOCUS ECOLIVHMGF 8703 bp ds-DNA BCT 09-AUG-1990
DEFINITION E.coli leucine-specific transport (LS-BP; LIV-BP) system (livHMGF)
genes, complete cds.
ACCESSION J05516 M13166 M10426 M10427 K02178
KEYWORDS heat shock protein;
high affinity branched-chain amino acid transport system;
htpR gene; isoleucine binding protein; leucine binding protein;
leucine binding protein; livJ gene; livK gene;
valine binding protein.
SOURCE E.coli (K12 strain AE404) isolate W3110 DNA, clone pOX[1,15].
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 90 to 1312)
AUTHORS Landick,R., Vaughn,V., Lau,E.T., VanBogelen,R.A., Erickson,J.W. and
Neidhardt,F.C.
TITLE Nucleotide sequence of the heat shock regulatory gene of E. coli
suggests its protein product may be a transcription factor
JOURNAL Cell 38, 175-182 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1312; revises [1])
AUTHORS Vaughn,V.
JOURNAL Unpublished (1985) Univ Michigan Med School, Ann Arbor MI 48109
STANDARD full staff_review
REFERENCE 3 (bases 1407 to 2507 and 3503 to 4609)
AUTHORS Landick,R. and Oxender,D.L.
TITLE The complete nucleotide sequences of the Escherichia coli LIV-BP
and LS-BP genes: Implications for the mechanism of high-affinity
branched-chain amino acid transport
JOURNAL J. Biol. Chem. 260, 8257-8261 (1985)
STANDARD full staff_review
REFERENCE 4 (bases 4610 to 5696)
AUTHORS Nazos,P.M., Antonucci,T.K., Landick,R. and Oxender,D.L.
TITLE Cloning and characterization of livH, the structural gene encoding
a component of the leucine transport system in Escherichia coli
JOURNAL J. Bacteriol. 166, 565-573 (1986)
STANDARD simple staff_review
REFERENCE 5 (bases 1136 to 8703)
AUTHORS Adams,M.D., Wagner,L.M., Graddis,T.J., Landick,R., Antonucci,T.K.,
Gibson,A.L. and Oxender,D.L.
TITLE Nucleotide sequence and genetic characterization reveal six
essential genes for the LIV-I and LS transport systems of
Escherichia coli
JOURNAL J. Biol. Chem. 265, 11436-11443 (1990)
STANDARD full staff_review
COMMENT Draft entry and sequence in computer readable form for [2] kindly
provided by V.Vaughn, 15-NOV-1985.
Draft entry and computer-readable sequence for [1] kindly submitted
by M.D.Adams 19-APR-1990, for release after publication.
The htpR (also known as "hin") gene product appears homologous to
the sigma factor of RNA polymerase, and the two proteins are
predicted to have similar secondary structures. In addition, two
regions of the predicted htpR product resemble protein-DNA contact
points conserved in known DNA-binding proteins.
The htpR gene encodes a protein, which appears to be identical in
size (33 kd by migration on two dimensional polyacrylamide gel) and
isoelectric point with the protein, F33.4, normally present in
E.coli but deficient in an htpR mutant.
A region homologous to the rpoD gene is located at positions
508-549 [2].
The E.coli LIV-I and LS AA transport systems are high-affinity,
periplasmic, binding protein-dependent systems that utilize the
leucine-, isoleucine-, valine-binding protein (LIV-BP) and
leucine-specific binding protein (LS-BP), respectively. These two
binding proteins interact with a common set of membrane proteins to
transport branched-chain AAs into the cytoplasm. The two BP genes
are encoded in a regulon that also contains the genes for the
common membrane protein components.
FEATURES from to/span description
pept 1407 2510 LIV-BP precursor (livJ)
sigp 1407 1475 LIV-BP signal peptide
matp 1476 2507 LIV-BP mature protein
pept 3503 4612 LS-BP precursor (livK)
sigp 3503 3571 LS-BP signal peptide
matp 3572 4609 LS-BP mature peptide
pept 4660 5586 leucine-specific binding protein (livH)
pept 5583 6857 livM product
pept 6854 7621 livG product
pept 7623 8336 livF product
mRNA 1303 > 4612 livJ mRNA [2] /nomgen="livJ"
ORF 3078 2566 (c) ORF19
revision 101 102 ct in [2]; cgt in [1]
revision 270 270 a in [2]; g in [1]
revision 280 280 a in [2]; g in [1]
revision 284 284 a in [2]; g in [1]
revision 304 304 a in [2]; g in [1]
revision 310 310 a in [2]; g in [1]
revision 322 324 caa in [2]; agg in [1]
revision 328 328 g in [2]; a in [1]
revision 1453 1453 c in [5]; g in [3]
revision 3832 3832 t in [5]; c in [3]
BASE COUNT 2039 a 2176 c 2425 g 2062 t 1 others
ORIGIN 76 min on the K12 map.
1 ctgcacggat caacattacg ccacttacgc ctgaataata aaagcgtgta tactctttcc
61 tgcaatgggt tccgtagcag ggaaagagac cccgttgtct cttcccggta tttcatctct
121 atgtcacatt ttgtgcgtaa tttattcaca agcttgcatt gaacttgtgg ataaaatcac
181 ggtctgataa aacagtgaat gataacctcg ttgctcttaa gctctggcac agttgttgct
241 accactgaag cgccagaaga tatcgattga gaggatttga atgactgaca aaatgcaaag
301 tttagcttta gccccagttg gcaacctgga ttcctacatc cgggcagcta acgcgtggcc
361 gatgttgtcg gctgacgagg agcgggcgct ggctgaaaag ctgcattacc atggcgatct
421 ggaagcagct aaaacgctga tcctgtctca cctgcggttt gttgttcata ttgctcgtaa
481 ttatgcgggc tatggcctgc cacaggcgga tttgattcag gaaggtaaca tcggcctgat
541 gaaagcagtg cgccgtttca acccggaagt gggtgtgcgc ctggtctcct tcgccgttca
601 ctggatcaaa gcagagatcc acgaatacgt tctgcgtaac tggcgtatcg tcaaagttgc
661 gaccaccaaa gcgcagcgca aactgttctt caacctgcgt aaaaccaagc agcgtctggg
721 ctggtttaac caggatgaag tcgaaatggt ggcccgtgaa ctgggcgtaa ccagcaaaga
781 cgtacgtgag atggaatcac gtatggcggc acaggacatg acctttgacc tgtcttccga
841 cgacgattcc gacagccagc cgatggctcc ggtgctctat ctgcaggata aatcatctaa
901 ctttgccgac ggcattgaag atgataactg ggaagagcag gcggcaaacc gtctgaccga
961 cgcgatgcag ggtctggacg aacgcagcca ggacatcatc cgtgcgcgct ggctggacga
1021 agacaacaag tccacgttgc aggaactggc tgaccgttac ggcgtttccg ctgagcgtgt
1081 acgccagctg gaaaagaacg cgatgaaaaa attgcgtgct gccattgaag cgtaatttcc
1141 gctattaagc agagaaccct agatgagagt ccggggtttt tgttttttgg gcctctgtaa
1201 taatcaattt cccctccggc aaaacgccaa tccccacgca gattgttaat aaactgtcaa
1261 aatagctatt ccaatatcat aaaaatcggg atatgtttta gcagagtatg ctgctaaagc
1321 acgggtagtc atgcataaaa cgaaataaag tgctgaaaaa caacatcaca acacacgtaa
1381 taaccagaag aatggggatt ctcaggatga acacaaaggg caaagcgtta ctggcaggat
1441 tgatcgcgct ggcattcagc aatatggctc tggcagaaga tattaaagtc gcggtcgtgg
1501 gcgcaatgtc cggtccggtt gcgcagtacg gtgaccagga gtttaccggc gcagagcagg
1561 cggttgcgga tatcaacgct aaaggcggca ttaaaggcaa caaactgcaa atcgcaaaat
1621 atgacgatgc ctgtgatccg aaacaggcgg ttgcggtggc gaacaaagtc gttaacgacg
1681 gcattaaata tgtgattggt cacctctgtt cctcatcaac gcagcctgcg tcggatatct
1741 acgaagacga aggcattttg atgatcaccc cagcggcaac cgcgccggag ctgaccgccc
1801 gtggctatca gctgatcctg cgaaccaccg gcctggattc cgaccaaggg ccgacggctg
1861 ccaaatatat tcttgagaaa gtgaaaccgc agcgtattgc tatcgttcac gacaaacagc
1921 aatacggcga aggtctggcg cgagcggtgc aggacggcct gaagaaaggc aatgcaaacg
1981 tggtgttctt tgatggcatc accgccgggg aaaaagattt ctcaacgctg gtggcgcgtc
2041 tgaaaaaaga gaatatcgac ttcgtttact acggcggtta tcacccggaa atggggcaaa
2101 tcctgcgtca ggcacgcgcg gcagggctga aaactcagtt tatggggccg gaaggtgtgg
2161 ctaacgtttc gctgtctaac attgcgggcg aatcagcgga agggctactg gtgaccaaac
2221 cgaagaacta cgatcaggtt ccggcgaaca aacccattgt tgacgcgatc aaagcgaaaa
2281 aacaggaccc aagtggcgca ttcgtttgga ccacctacgc cgcgctgcaa tctttgcagg
2341 cgggcctcaa tcagtctgac gatccggctg aaatcgccaa atacctgaaa gcgaactccg
2401 tggataccgt aatgggcccg ctgacctggg atgagaaagg cgatctgaaa ggctttgagt
2461 tcggcgtatt tgactggcac gccaacggca cggccaccga tgcgaagtaa tcattaatcg
2521 gcaactttgg gttgccgcca aattgctaat atcgagtacg ttgcttcatg ccggatgcgg
2581 cgtaaacgcc ttatccggcc tacaagatcc aaagaaatca gtaaattgca acacacattg
2641 taggcctgat aagcgtagcg catcaggcaa tacacttttg aaatcggact tgacgattaa
2701 cacttctccc agccgccctg ttgtgccgta aaccccagcg cctgcataaa cgccgtcatc
2761 acaccgcgat cttccacgcc gcagccgcca tccaccagca tgaaacgcca agattgttac
2821 gcaaaacctc ttccagcaga tattgcccca ccgcgacggc gggtgacttc ccgcacgcgc
2881 agggaatcca gtgctccctc ggtgccgctt aaggttgccc gcgcggcgcg agcaggcgct
2941 cgttaaacgc gcggcgtaga tacggtggtt atcgtcaacc tgtaacgagg aaggggaata
3001 ctcncggcca agatcttttg cgaggtcaat ccggtcttgg tcgctaaatt tttctaatcg
3061 aatgatggtc agcttcatgg gtaacccgtg taaatcacaa aagtgtaacc agtgtagcga
3121 aataatttaa tcggaggctt tctctttttt atttcttttg gcaggtgatt aattttttaa
3181 cagcaataat tacaaaatta aaacattaga gaatgaaaaa tgtccagcat aatcccctga
3241 atgatagtga attattccgc ccctttgtgc cgttatttta tgctgacaaa ggcacttttt
3301 tctgtttgtc tatcaataaa ttcggaatat tatctgttct taatcgactg aaaaatgggg
3361 attttaatcg ctattatcac aaaatactgc gctaacccct taatcagaca ggcaaaaaca
3421 gtgcagtata aaaaaagaac agtctgattt gttaacacat aaaaacaaag caacacaaca
3481 tcacgaatgg ggatttttga ctatgaaacg gaatgcgaaa actatcatcg cagggatgat
3541 tgcactggca atttcacaca ccgctatggc tgacgatatt aaagtcgccg ttgtcggcgc
3601 gatgtccggc ccgattgccc agtggggcat aatggaattt aacggcgcgg agcaggcgat
3661 taaagacatt aatgccaaag ggggaattaa gggcgataaa ctggttggcg tggaatatga
3721 cgacgcatgc gacccgaaac aagccgttgc ggtcgccaac aaaatcgtta atgacggcat
3781 taaatacgtt attggtcatc tgtgttcttc ttctacccag cctgcgtcag atatctatga
3841 agacgaaggt attctaatga tctcgccggg agcgaccgcg ccggaactaa cccaacgcgg
3901 ttatcaacac attatgcgta ctgccgggct ggactcttcc caggggccaa cggcggcaaa
3961 atacattctt gagacggtga agccccagcg catcgccatc atccacgaca aacaacagta
4021 tggcgaaggg ctggcgcgtt cggtgcagga cgggctgaaa gcggctaacg ccaacgtcgt
4081 cttcttcgat ggtattaccg ccggggagaa agatttctcc gcgctgatcg cccgcctgaa
4141 aaaagaaaac atcgacttcg tttactacgg cggttactac ccggaaatgg ggcagatgct
4201 gcgccaggcc cgttccgttg gcctgaaaac ccagtttatg gggccggaag gtgtgggtaa
4261 tgcgtcgttg tcgaacattg ccggtgatgc cgccgaaggc atgttggtca ctatgccaaa
4321 acgctatgac caggatccgg caaaccaggg catcgttgat gcgctgaaag cagacaagaa
4381 agatccgtcc gggccttatg tctggatcac ctacgcggcg gtgcaatctc tggcgactgc
4441 ccttgagcgt accggcagcg atgagccgct ggcgctggtg aaagatttaa aagctaacgg
4501 tgcaaacacc gtgattgggc cgctgaactg ggatgaaaaa ggcgatctta agggatttga
4561 ttttggtgtg ttccagtggc acgccgacgg ttcatccacg gcagccaagt gatcatccca
4621 ccgcccgtaa aatgcgggcg ggtttagaaa ggttacctta tgtctgagca gtttttgtat
4681 ttcttgcagc agatgtttaa cggcgtcacg ctgggcagta cctacgcgct gatagccatc
4741 ggctacacca tggtttacgg cattatcggc atgatcaact tcgcccacgg cgaggtttat
4801 atgattggca gctacgtctc atttatgatc atcgccgcgc tgatgatgat gggcattgat
4861 accggctggc tgctggtagc cgcgggattc gtcggcgcaa tcgtcattgc cagcgcctac
4921 ggctggagta tcgaacgggt ggcttaccgc ccggtgcgta actctaagcg cctgattgca
4981 ctcatctctg caatcggtat gtccatcttc ctgcaaaact acgtcagcct gaccgaaggt
5041 tcgcgcgacg tggcgctgcc gagcctgttt aacggtcagt gggtggtggg gcatagcgaa
5101 aacttctctg cctctattac caccatgcag gcggtgatct ggattgttac cttcctcgcc
5161 atgctggcgc tgacgatttt cattcgctat tcccgcatgg gtcgcgcgtg tcgtgcctgc
5221 gcggaagatc tgaaaatggc gagtctgctt ggcattaaca ccgaccgggt gattgcgctg
5281 acctttgtga ttggcgcggc gatggcggcg gtggcgggtg tgctgctcgg tcagttctac
5341 ggcgtcatta acccctacat cggctttatg gccgggatga aagcctttac cgcggcggtg
5401 ctcggtggga ttggcggcat tccgggggcg atgattggcg gcctgattct ggggattgcg
5461 gaggcgctct cttctgccta tctgagtacg gaatataaag atgtggtctc attcgccctg
5521 ccgattctgg tgctgctggt gatgccgacc ggtattctgg gtcgcccgga ggtagagaaa
5581 gtatgaaacc gatgcatatt gcaatggcgc tgctctctgc cgcgatgttc tttgtgctgg
5641 cgggcgtctt tatgggcgtg caactggagc tggatggcac caaactggtg gtcgacacgg
5701 cttcggatgt ccgttggcag tgggtgttta tcggcacggc ggtggtcttt ttcttccagc
5761 ttttgcgacc ggctttccag aaagggttga aaagcgtttc cggaccgaag tttattctgc
5821 ccgccattga tggctccacg gtgaagcaga aactgttcct cgtggcgctg ttggtgcttg
5881 cggtggcgtg gccgtttatg gtttcacgcg ggacggtgga tattgccacc ctgaccatga
5941 tctacattat cctcggtctc gggctgaacg tggttgttgg tctttctggt ctgctggtgc
6001 tggggtacgg cggtttttac gccatcggct tacacttttg cgctgctcaa tcactattac
6061 ggcttgggct tctggacctg cctgccgatt gctggattaa tggcagcggc ggcggcttcc
6121 tgctcggttt tccggtgctg cgtttgcgcg gtgactatct ggcgatcgtt accctcggtt
6181 tcggcgaaat tgtgcgcata ttgctgctca ataacaccga aattaccggc ggcccgaacg
6241 gaatcagtca gatcccgaaa ccgacactct tcggactcga gttcagccgt accgctcgtg
6301 aaggcggctg ggacacgttc agtaatttct ttggcctgaa atacgatccc tccgatcgtg
6361 tcatcttcct ctacctggtg gcgttgctgc tggtggtgct aagcctgttt gtcattaacc
6421 gcctgctgcg gatgccgctg gggcgtgcgt gggaagcgtt gggtgaagat gaaatcgcct
6481 gccgttcgct gggcttaagc ccgcgtcgta tcaagctgac tgcctttacc ataagtgccg
6541 cgtttgccgg ttttgccgga acgctgtttg cggcgcgtca gggctttgtc agcccggaat
6601 ccttcacctt tgccgaatcg gcgtttgtgc tggcgatagt ggtgctcggc ggtatgggct
6661 cgcaatttgc ggtgattctg gcggcaattt tgctggtggt gtcgcgcgag ttgatgcgtg
6721 atttcaacga atacagcatg ttaatgctcg gtggtttgat ggtgctgatg atgatctggc
6781 gtccgcaggg cttgctgccc atgacgcgcc ggcaactgaa gctgaaaaac ggcgcagcga
6841 aaggagagca ggcatgagtc agccattatt atctgttaac ggcctgatga tgcgcttcgg
6901 cggcctgctg gcggtgaaca acgtcaatct tgaactgtac ccgcaggaga tcgtctcgtt
6961 aatcggccct aacggtgccg gaaaaaccac ggtttttaac tgtctgaccg gattctacaa
7021 acccaccggc ggcaccattt tactgcgcga tcagcacctg gaaggtttac cggggcagca
7081 aattgcccgc atgggcgtgg tgcgcacctt ccagcatgtg cgtctgttcc gtgaaatgac
7141 ggtaattgaa aacctgctgg tggcgcagca tcagcaactg aaaaccgggc tgttctctgg
7201 cctgttgaaa acgccatcct tccgtcgcgc ccagagcgaa cggctcgacc gcgccgcgac
7261 ctggcttgag cgcattggtt tgctggaaca cgccaaccgt caggcgagta acctggccta
7321 tggtgaccag cgccgtcttg agattgcccg ctgcatggtg acgcagccgg agattttaat
7381 gctcgacgaa cctgcggcag gtcttaaccc gaaagagacg aaagagctgg atgagctgat
7441 tgccgaactg cgtaatcatc acaacaccac tatcttgttg attgaacacg atatgaagct
7501 ggtgatggga atttcggacc gaatttacgt ggtcaatcag gggacgccgc tggcaaacgg
7561 tagcccggag cagatccgta ataacccgga cgtgatccgt gcctatttag gtgaggcata
7621 agatggaaaa agtcatgttg tcctttgaca aagtcagcgc ccactacggc aaaatccagg
7681 cgctgcatga ggtgagcctg catatcaatc agggcgagat tgtcacgctg attggcgcga
7741 acggggcggg gaaaaccacc ttgctcggca cgttatgcgg cgatcccggt gccaccagcg
7801 ggcgaattgt gtttgatgat aaagacatta ccgactggca gacagcgaaa atcatgcgcg
7861 aagcggtggc gattgtcccg gaagggcgtc gcgtcttctc gcggatgacg gtggaagaga
7921 acctggcgat gggcggtttt tttgctgaac gcgaccagtt ccaggagcgc ataaagtggg
7981 cgtatgagct gtttccacgt ctgcatgagc gccgtattca gcgggcgggc accatgtccg
8041 gcggtgaaca gcagatgctg gcgattggtc gtgcgctgat gagcaacccg cgtttgctac
8101 tgcttgatga gccatcgctc ggtcttgcgc cgattatcat ccagcaaatt ttcgacacca
8161 tcgagcagct gcgcgagcag gggatgacta tctttctcgt cgagcagaac gccaaccagg
8221 ggctaaagct ggcggatcgc ggctacgtgc tggaaaacgg ccatgtagtg ctttccgata
8281 ctggtgatgc gctgctggcg aatgaagcgg tgagaagtgc gtatttaggc gggtaataac
8341 acgttgattg atagggagtc aaaagactcc tttgagacag gtgacaaatg taaaattgcc
8401 tgatgcgctg cgcttatcag gcctactggg tgagtggcaa tatgttgaat ttgcacgatc
8461 ttgtaggcct gataagcgtt taccgcgcat ccggcatgaa acgatgagca atctgtagag
8521 tttgattcag accttctata ttttcccgct tatccgtgcc ccatctccca ttttccctca
8581 cccacgccgt caccgccttg tcatctttct gacaccttac tatcttacaa atgtaacaaa
8641 aaagttattt ttctgtaatt cgagcatgtc atgttacccc gcgagcataa aacgcgtgaa
8701 ttc
//
LOCUS BOVGOA 472 bp ss-mRNA MAM 09-AUG-1990
DEFINITION B.taurus go-alpha mRNA, 3' end.
ACCESSION J02900
KEYWORDS go-alpha.
SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (sites)
AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated
regions
JOURNAL Biochemistry 29, 5069-5076 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 472)
AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by S.R.Price, 12-JUN-1990.
FEATURES from to/span description
pept < 1 3 go-alpha (AA at 1)
BASE COUNT 130 a 133 c 88 g 121 t
ORIGIN
1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct
61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat
121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc
181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg
241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa
301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc
361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct
421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt
//
LOCUS CAJCAT 1334 bp ds-DNA BCT 09-AUG-1990
DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene,
complete cds.
ACCESSION M35190
KEYWORDS chloramphenicol acetyltransferase.
SOURCE C.coli plasmid C-589 DNA.
ORGANISM Campylobacter coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1334)
AUTHORS Wang,Y. and Taylor,D.E.
TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide
sequence, expression and cloning vector construction
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.E.Taylor, 15-JUN-1990.
FEATURES from to/span description
pept 309 932 chloramphenicol acetyltransferase (cat)
mRNA 277 > 932 chloramphenicol acetyltransferase mRNA
signal 242 271 promoter
binding 297 301 ribosome binding site
signal 960 1006 transcriptional termination signal
BASE COUNT 433 a 232 c 282 g 387 t
ORIGIN
1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag
61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc
121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat
181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt
241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag
301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg
361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta
421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca
481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt
541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt
601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg
661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg
721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg
781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata
841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt
901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac
961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga
1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac
1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat
1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac
1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag
1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga
1321 aataataaat ggct
//
LOCUS CLOCBA 5120 bp ds-DNA BCT 09-AUG-1990
DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and
beta-D-galactosidase regulatory protein (cbgR) genes, complete cds.
ACCESSION M35107
KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein.
SOURCE C.acetobutylicum (strain NCIB2951) DNA.
ORGANISM Clostridium acetobutylicum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1260 to 5120)
AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B.
TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is
positively regulated in Escherichia coli by a novel regulatory
gene, cbgR
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5120)
AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by D.B.Scott, 11-JUN-1990.
Author address:D.B.Scott:
Molecular Genetics Unit
Department of Microbiology and Genetics
Massey University
Palmerston North, New Zealand
E-mail:D.B.Scott@massey.ac.nz
FEATURES from to/span description
pept 1560 4253 beta-D-galactosidase (cbgA)
pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR)
BASE COUNT 1921 a 683 c 876 g 1640 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac
61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa
121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat
181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat
241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt
301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt
361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga
421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa
481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga
541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt
601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat
661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg
721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat
781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct
841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt
901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt
961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc
1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt
1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa
1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata
1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt
1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa
1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta
1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc
1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg
1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta
1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata
1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag
1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt
1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata
1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta
1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc
1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt
1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag
2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg
2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc
2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa
2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt
2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag
2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt
2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac
2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag
2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct
2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg
2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac
2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag
2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat
2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa
2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct
2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt
3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta
3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt
3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca
3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc
3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta
3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta
3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg
3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat
3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag
3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa
3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg
3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag
3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca
3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt
3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa
3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg
3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa
4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag
4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt
4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact
4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct
4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt
4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag
4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc
4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa
4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata
4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa
4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga
4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc
4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa
4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa
4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag
4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa
4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca
5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg
5101 ttaatctaat atacgaattc
//
LOCUS ECOPNCB 1490 bp ds-DNA BCT 09-AUG-1990
DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene,
complete cds.
ACCESSION J05568
KEYWORDS nicotinic acid phosphoribosyl transferase.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1490)
AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
Meesters,H.A.R. and Witholt,B.
TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis
and expression of the pncB gene encoding nicotinic acid
phosphoribosyl transferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by P.Terpstra, 31-MAY-1990.
Nicotinic acid phosphoribosyl transferase is the first enzyme of
the three enzyme Preiss-Handler pathway leading to the synthesis of
NAD. The protein sequence shows similarity to orotate
phosphoribosyl transferase (pyr5) from Dictyostelium discoideum
(acc P09556, SWISS-PROT)
FEATURES from to/span description
pept 216 1418 nicotinic acid phosphoribosyl transferase
(pncB) (EC 2.4.2.11)
mRNA 158 1450 nicotinic acid phosphoribosyl transferase mRNA
(3' end put.)
signal 124 129 -35 region
signal 146 151 -10 region
rpt 170 185 inverted repeat
binding 197 202 ribosome binding site
signal 1426 1450 rho-independent transcription termination
signal
BASE COUNT 348 a 374 c 364 g 404 t
ORIGIN
1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac
61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg
121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac
181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc
241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc
301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta
361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg
421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac
481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata
541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta
601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca
661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct
721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg
781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc
841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa
901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc
961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt
1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg
1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa
1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct
1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg
1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac
1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca
1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc
1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc
//
LOCUS NEUCCON13 2728 bp ds-DNA PLN 09-AUG-1990
DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds.
ACCESSION M35120
KEYWORDS conidiation-specific protein.
SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 2728)
AUTHORS Hager,K.M. and Yanofsky,C.
TITLE Genes expressed during conidiation in Neurospora crassa: Molecular
characterization of con-13
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.M.Hager, 12-JUN-1990.
Author address:K.M.Hager:
Dept. of Physiology
UCLA Medical School
10833 Le Conte Avenue
Los Angeles, CA 90024-1751
E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU
FEATURES from to/span description
pept 1009 1275 conidiation-specific protein (con-13), exon 1
1333 1847 conidiation-specific protein, exon 2
1910 2150 conidiation-specific protein, exon 3
pre-msg 922 2367 con-13 mRNA and introns (alt.)
pre-msg 927 2367 con-13 mRNA and introns (alt.)
pre-msg 936 2367 con-13 mRNA and intron (alt.)
pre-msg 946 2367 con-13 mRNA and intron (alt.)
IVS 1276 1332 con-13 intron A
IVS 1848 1909 con-13 intron B
site 2364 2367 polyadenylation site
BASE COUNT 653 a 695 c 720 g 660 t
ORIGIN Linkage group IV.
1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc
61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt
121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc
181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt
241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa
301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc
361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca
421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta
481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca
541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta
601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt
661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg
721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc
781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt
841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg
901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca
961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct
1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg
1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa
1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg
1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat
1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta
1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc
1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc
1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac
1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt
1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag
1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag
1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc
1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc
1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc
1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt
1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg
1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct
2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg
2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact
2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg
2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt
2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga
2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc
2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa
2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct
2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag
2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga
2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat
2701 gatggaaacg agatcgaaac ccattgag
//
LOCUS VVUVVHAB 2237 bp ds-DNA BCT 09-AUG-1990
DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds.
ACCESSION M34670
KEYWORDS cytolysin; cytotoxin; hemolysin; toxin.
SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702.
ORGANISM Vibrio vulnificus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 2237)
AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G.
TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship
to Vibrio cholerae El Tor hemolysin
JOURNAL Infect. Immun. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.B.Kaper, 29-MAY-1990.
FEATURES from to/span description
pept 745 2160 cytolysin (vvhA) precursor
sigp 745 804 cytolysin signal peptide
matp 805 2157 cytolysin
pept 237 743 pot. cytolysin (vvhB)
signal 55 60 -35 region
signal 80 85 -10 region
signal 87 92 -35 region
signal 110 115 -10 region
signal 184 189 -35 region
signal 206 211 -10 region
signal 2185 2219 transcription termination signal
binding 54 69 CRP binding site
binding 59 74 Fur binding site
binding 185 199 Fur binding site
binding 226 231 vvhB ribosome binding site
binding 730 735 vvhA ribosome binding site
BASE COUNT 639 a 498 c 509 g 591 t
ORIGIN
1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata
61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat
121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa
181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc
241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga
301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg
361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt
421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct
481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt
541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc
601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa
661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc
721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta
781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac
841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga
901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg
961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc
1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc
1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac
1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat
1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa
1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt
1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt
1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg
1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt
1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg
1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac
1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg
1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca
1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg
1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc
1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga
1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt
1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt
2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc
2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag
2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta
2221 ttcgccatct attcgtc
//
LOCUS TOBCPCG 155844 bp ds-DNA circular ORG 09-AUG-1990
DEFINITION N.tabacum (var. Bright Yellow 4) chloroplast, complete genome.
ACCESSION Z00044
KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA; 4.5S ribosomal RNA;
5S ribosomal RNA; ATP synthetase; ATPase; NADH dehydrogenase;
RNA polymerase; autonomous replication; carboxylase; chloroplast;
complete genome; cytochrome; cytochrome b559; cytochrome b6;
cytochrome f; initiation factor; phosphoprotein; ribosomal protein;
ribosomal protein L14; ribosomal protein L16; ribosomal protein L2;
ribosomal protein L20; ribosomal protein L22;
ribosomal protein L23; ribosomal protein L33;
ribosomal protein S11; ribosomal protein S12;
ribosomal protein S15; ribosomal protein S18;
ribosomal protein S19; ribosomal protein S2; ribosomal protein S3;
ribosomal protein S7; ribosomal protein S8;
ribulose bisphosphate carboxylase; transfer RNA-Ala;
transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp;
transfer RNA-Cys; transfer RNA-Glu; transfer RNA-Gly;
transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu;
transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe;
transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr;
transfer RNA-Trp; transfer RNA-Tyr; transfer RNA-Val.
SOURCE Nicotiana tabacum (var. Bright Yellow 4) chloroplast DNA, clone
pHC79 (IR-A and IR-B).
ORGANISM Chloroplast Nicotiana tabacum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae; Nicotiana tabacum.
REFERENCE 1 (sites)
AUTHORS Shinozaki,K., Ohme,M., Tanaka,M., Wakasugi,T., Hayashida,N.,
Matsubayashi,T., Zaita,N., Chunwongse,J., Obokata,J.,
Yamaguchi-Shinozaki,K., Ohto,C., Torazawa,K., Meng,B.Y., Sugita,M.,
Deno,H., Kamogashira,T., Yamada,K., Kusuda,J., Takaiwa,F., Kato,A.,
Tohdoh,N., Shimada,H. and Sugiura,M.
TITLE The complete nucleotide sequence of tobacco chloroplast genome: Its
gene organization and expression
JOURNAL EMBO J. 5, 2043-2049 (1986)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 155844)
AUTHORS Sugiura,M.
JOURNAL Unpublished (1986) Biology Dept, Nagoya Univ., Nagoya 464, Japan
STANDARD full staff_review
COMMENT The circular tobacco chloroplast DNA sequence is presented in a
linearized form by cutting at the junction between IRA and LSC.
The DNA strand which codes for the large subunit of
ribulose-1,5-bisphosphate carboxylase is designated as A strand and
the complementary strand as B strand. The nucleotide sequence of
the B strand is presented.
Large single copy region (LSC): 1-86684 (86684 bp)
Inverted repeat B (IR-B): 86685-112023 (25339 bp)
Small single copy region (SSC): 112024-130505 (18482 bp)
Inverted repeat A (IR-A): 130506-155844 (25339 bp)
Rps12 consists of three exons. There are two sets of exons 2 and
3. One set is located on the same strand in IR-B 28 kb upstream of
exon 1. The oteher set is located on the opposite strand in IR-A
69 kb downstream of exon 1. The tobacco rps12 gene probably
consists of three transcription units and requires trans-splicing.
The chloroplast DNA segments capable of replication in yeast, ars1
and ars2, are located at positions 112768-113117 and 14570-15088
respectively.
Seven open reading frames, (RF236, RF548, RF862, stop codon to stop
codon) and (ORF151, ORF90, ORF80, ORF134, start codon to stop
codon) are present near the rpoB gene. Four or these ORFs show
some homology to portions of the beta'-subunit sequence of E.coli
RNA polymerase [1].
Most open reading frames indicated in FEATURES are from start codon
to stop codon. The intron boundaries for the ndhA and ndhB are not
known and thus the largest possible intron is indicated (from stop
codon to stop codon).
FEATURES from to/span description
tRNA 80 6 (c) His-tRNA (GUG)
tRNA 4407 4371 (c) Lys-tRNA (UUU), exon 1
1844 1810 (c) Lys-tRNA (UUU), exon 2
tRNA 7487 7416 (c) Gln-tRNA (UUG)
tRNA 8719 8632 (c) Ser-tRNA (GCU)
tRNA 9499 9521 Gly-tRNA (UCC), exon 1
10213 10260 Gly-tRNA (UCC), exon 2
tRNA 10430 10501 Arg-tRNA (UCU)
tRNA 28783 28854 Cys-tRNA (GCA)
tRNA 31999 31926 (c) Asp-tRNA (GUC)
tRNA 32191 32108 (c) Tyr-tRNA (GUA)
tRNA 32323 32251 (c) Glu-tRNA (UUC)
tRNA 33172 33243 Thr-tRNA (GGU)
tRNA 37223 37132 (c) Ser-tRNA (UGA)
tRNA 38050 38120 Gly-tRNA (GCC)
tRNA 38421 38348 (c) fMet-tRNA (CAU)
tRNA 47111 47197 Ser-tRNA (GGA)
tRNA 48577 48505 (c) Thr-tRNA (UGU)
tRNA 49288 49322 Leu-tRNA (UAA), exon 1
49826 49875 Leu-tRNA (UAA), exon 2
tRNA 50232 50304 Phe-tRNA (GAA)
tRNA 54390 54353 (c) Val-tRNA (UAC), exon 1
53781 53747 (c) Val-tRNA (UAC), exon 2
tRNA 54581 54653 Met-tRNA (CAU)
tRNA 68880 68807 (c) Trp-tRNA (CCA)
tRNA 69118 69045 (c) Pro-tRNA (UGG)
tRNA 88770 88697 (c) Ile-tRNA (CAU)
tRNA 96507 96427 (c) Leu-tRNA (CAA)
tRNA 102459 102530 Val-tRNA (GAC)
tRNA 104547 104583 Ile-tRNA (GAU), exon 1
105291 105325 Ile-tRNA (GAU), exon 2
tRNA 105390 105427 Ala-tRNA (UGC), exon 1
106137 106171 Ala-tRNA (UGC), exon 2
tRNA 109973 110046 Arg-tRNA (ACG)
tRNA 110699 110628 (c) Asn-tRNA (GUU)
tRNA 116067 116146 Leu-tRNA (UAG)
tRNA 131830 131901 Asn-tRNA (GUU)
tRNA 132556 132483 (c) Arg-tRNA (ACG)
tRNA 137139 137102 (c) Ala-tRNA (UGC), exon 1
136392 136358 (c) Ala-tRNA (UGC), exon 2
tRNA 137982 137946 (c) Ile-tRNA (GAU), exon 1
137238 137204 (c) Ile-tRNA (GAU), exon 2
tRNA 140070 139999 (c) Val-tRNA (GAC)
tRNA 146022 146102 Leu-tRNA (CAA)
tRNA 153759 153832 Ile-tRNA (CAU)
rRNA 102758 104246 16S rRNA
rRNA 106325 109134 23S rRNA
rRNA 109236 109338 4.5S rRNA
rRNA 109595 109715 5S rRNA
rRNA 132934 132814 (c) 5S rRNA
rRNA 133293 133191 (c) 4.5S rRNA
rRNA 136204 133395 (c) 23S rRNA
rRNA 139771 138283 (c) 16S rRNA
RNA 32347 31836 (c) Asp-tRNA, Tyr-tRNA, Glu-tRNA RNA precursor
pept 1595 534 (c) PSII 32kd protein (psbA)
pept 6211 6172 (c) ribosomal protein S16 (rps16), exon 1
5311 5094 (c) ribosomal protein S16 (rps16), exon 2
pept 12148 10625 (c) ATPase alpha subunit (atpA)
pept 13452 13308 (c) ATPase I subunit (atpF), exon 1
12612 12203 (c) ATPase I subunit (atpF), exon 2
pept 14099 13854 (c) ATPase III subunit (atpH)
pept 16001 15258 (c) ATPase a subunit (atpI)
pept 16938 16228 (c) ribosomal protein S2 (rps2)
pept 27501 24289 (c) RNA polymerase beta subunit (rpoB)
pept 34462 35523 PSII D2 protein (psbD)
pept 35471 36892 PSII 44kd protein (psbC)
pept 38873 38571 (c) ribosomal protein S14 (rps14)
pept 41200 38996 (c) PSI P700 apoprotein A2 (psaB)
pept 43478 41226 (c) PSI P700 apoprotein A1 (psaA)
pept 48133 47528 (c) ribosomal protein S4 (rps4)
pept 55276 54875 (c) ATPase epsilon subunit (atpE)
pept 56769 55273 (c) ATPase beta subunit (atpB)
pept 57587 59020 RuBisCO large subunit (rbcL)
pept 64327 65289 cytochrome f (petA)
pept 66860 66741 (c) PSII component (psbF)
pept 67121 66870 (c) PSII cytochrome b559 (psbE)
pept 70123 70323 ribosomal protein L33 (rpl33)
pept 70510 70815 ribosomal protein S18 (rps18)
pept 71401 71015 (c) ribosomal protein L20 (rpl20)
pept 72326 72213 (c) ribosomal protein S12 A (rps12A), exon 1
100851 100620 (c) ribosomal protein S12 A (rps12A), exon 2
100083 100058 (c) ribosomal protein S12 A (rps12A), exon 3
pept 72326 72213 (c) ribosomal protein S12 B (rps12B), exon 1
141678 141909 ribosomal protein S12 B (rps12B), exon 2
142446 142471 ribosomal protein S12 B (rps12B), exon 3
pept 74950 76476 PSII P680 apoprotein (psbB)
pept 77098 77319 PSII 10kd phosphoprotein (psbF)
pept 77449 77454 cytochrome b6 (petB), exon 1
78208 78849 cytochrome b6 (petB), exon 2
pept 79845 80264 cytochrome b/f complex subunit 4 (petD)
pept 81465 80452 (c) RNA polymerase alpha subunit (rpoA)
pept 81947 81531 (c) ribosomal protein S11 (rps11)
pept 82465 82175 (c) RF96
pept 83004 82600 (c) ribosomal protein S8 (rps8)
pept 83544 83173 (c) ribosomal protein L14 (rpl14)
pept 85093 85085 (c) ribosomal protein L16 (rpl16), exon 1
84064 83669 (c) ribosomal protein L16 (rpl16), exon 2
pept 85896 85240 (c) ribosomal protein S3 (rps3)
pept 86348 85881 (c) ribosomal protein L22 (rpl22)
pept 86680 86402 (c) ribosomal protein S19 (rps19)
pept 88231 87841 (c) ribosomal protein L2 (rpl2), exon 1
87174 86741 (c) ribosomal protein L2 (rpl2), exon 2
pept 88531 88250 (c) ribosomal protein L23 (rpl23)
pept 100004 99537 (c) ribosomal protein S7 (rps7)
pept 125398 125135 (c) ribosomal protein S15 (rps15)
pept 142525 142992 ribosomal protein S7 (rps7)
pept 153998 154279 ribosomal protein L23 (rpl23)
pept 154298 154688 ribosomal protein L2 (rpl2), exon 1
155355 155788 ribosomal protein L2 (rpl2), exon 2
mRNA 1680 441 (c) psbA mRNA (441 +/- 2 bp)
mRNA 57025 54637 (c) atpB, atpE mRNA (alt.; 54637 +/- 1 bp)
mRNA 57025 54676 (c) atpB, atpE mRNA (alt.; 54676 +/- 2 bp)
mRNA 57405 59161 rbcL mRNA
IVS 4370 1845 (c) Lys-tRNA intron
IVS 6171 5312 (c) rps16 intron
IVS 9522 10212 Gly-tRNA intron (no splice consensus)
IVS 13307 12613 (c) atpF intron
IVS 49323 49825 Leu-tRNA intron (no splice consensus)
IVS 54352 53782 (c) Val-tRNA intron
IVS 77455 78207 petB intron (no splice consensus)
IVS 85084 84065 (c) rpl16 intron
IVS 87840 87175 (c) rpl2 intron
IVS 98349 97838 (c) ndhB intron
IVS 72326 72213 (c) rps12A intron A
IVS 141678 141909 rps12A intron B (no splice consensus)
IVS 142446 142471 rps12A intron C (no splice consensus)
IVS 104584 105290 Ile-tRNA intron (no splice consensus)
IVS 105428 106136 Ala-tRNA intron (no splice consensus)
IVS 123219 122140 (c) ndhA intron
IVS 137101 136393 (c) Ala-tRNA intron
IVS 137945 137239 (c) Ile-tRNA intron
IVS 144180 144690 ndhB intron (no splice consensus)
IVS 154689 155354 rpl2 intron (no splice consensus)
rpt 86685 112023 inverted repeat B (IR-B)
rpt 130506 155844 inverted repeat A (IR-A)
site 1 86684 large single copy region (LSC)
site 112024 130505 small single copy region (SSC)
ORF 3658 2129 (c) ORF-509A cds
ORF 7724 8020 ORF98 cds
ORF 19753 17165 (c) RF862 cds
ORF 20277 19873 (c) ORF134 cds
ORF 20423 20181 (c) ORF80 cds
ORF 20646 20374 (c) ORF90 cds
ORF 21475 20765 (c) RF236 cds
ORF 23127 21481 (c) RF548 cds
ORF 24283 23828 (c) ORF151 cds
ORF 37558 37241 (c) ORF105 cds
ORF 37586 37774 ORF62 cds
ORF 44264 44497 ORF77 cds
ORF 45394 45146 (c) ORF82 cds
ORF 46464 46240 (c) ORF74A cds
ORF 48933 49145 ORF70A cds
ORF 51457 50981 (c) ORF158 cds
ORF 52417 51563 (c) bhpB cds
ORF 52659 52297 (c) bhpA cds
ORF 59785 61323 ORF512 cds
ORF 62630 > 62630 ORF184
ORF 63407 64096 ORF229
ORF 66168 66467 ORF99A
ORF 67580 67269 (c) ORF103
ORF 72686 72465 (c) ORF73
ORF 73547 73323 (c) ORF74B
ORF 82162 82049 (c) ORF37
ORF 88883 90628 ORF581
ORF 90598 95724 ORF1708
ORF 95815 96078 ORF87
ORF 146472 96057 (c) ORF115
ORF 96116 96394 ORF92
ORF 96553 96792 ORF79
ORF 98889 98350 (c) ndhB, exon 1
97837 97047 (c) ndhB, exon 2
ORF 140581 101948 (c) ORF131
ORF 102099 102311 ORF70B
ORF 110820 110593 (c) ORF75
ORF 111025 112077 ORF350
ORF 114198 112066 (c) ndhF
ORF 116250 117191 ORF313
ORF 118958 117429 (c) ndhD
ORF 119860 119555 (c) ndhE
ORF 120383 120084 (c) ORF99B
ORF 120612 120196 (c) ORF138
ORF 121512 121009 (c) ORF167
ORF 123840 123217 (c) ndhA, exon 1
122109 121597 (c) ndhA, exon 2
ORF 125023 123842 (c) ORF393
ORF 126482 125796 (c) ORF228
ORF 127561 126740 (c) ORF273
ORF 131501 127767 (c) ORF1244
ORF 131709 131936 ORF75
ORF 140186 140581 ORF131
ORF 140430 140218 (c) ORF70B
ORF 145976 145737 (c) ORF79
ORF 146125 146472 ORF115
ORF 146413 146135 (c) ORF92
ORF 146714 146451 (c) ORF87
ORF 151931 146805 (c) ORF1708
ORF 153646 151901 (c) ORF581
ORF 96404 96057 (c) ORF 115
anticdn 45 43 (c) His-tRNA anticodon gtg
anticdn 4376 4374 (c) Lys-tRNA anticodon ttt
anticdn 7455 7453 (c) Gln-tRNA anticodon ttg
anticdn 8685 8683 (c) Ser-tRNA anticodon gct
anticdn 10222 10224 Gly-tRNA anticodon tcc
anticdn 10463 10465 Arg-tRNA anticodon tct
anticdn 28815 28817 Cys-tRNA anticodon gca
anticdn 31965 31963 (c) Asp-tRNA anticodon gtc
anticdn 32157 32155 (c) Tyr-tRNA anticodon gta
anticdn 32289 32287 (c) Glu-tRNA anticodon ttc
anticdn 33204 33206 Thr-tRNA anticodon ggt
anticdn 37189 37187 (c) Ser-tRNA anticodon tga
anticdn 38082 38084 Gly-tRNA anticodon gcc
anticdn 38387 38385 (c) fMet-tRNA anticodon cat
anticdn 47145 47147 Ser-tRNA anticodon gga
anticdn 48544 48542 (c) Thr-tRNA anticodon tgt
anticdn 49841 49843 Leu-tRNA anticodon taa
anticdn 50265 50267 Phe-tRNA anticodon gaa
anticdn 54357 54355 (c) Val-tRNA anticodon tac
anticdn 54614 54616 Met-tRNA anticodon cat
anticdn 68846 68844 (c) Trp-tRNA anticodon cca
anticdn 69084 69082 (c) Pro-tRNA anticodon tgg
anticdn 88737 88735 (c) Ile-tRNA anticodon cat
anticdn 96474 96472 (c) Leu-tRNA anticodon caa
anticdn 102491 102493 Val-tRNA anticodon gac
anticdn 104579 104581 Ile-tRNA anticodon gat
anticdn 105423 105425 Ala-tRNA anticodon tgc
anticdn 110007 110009 Arg-tRNA anticodon acg
anticdn 110667 110665 (c) Asn-tRNA anticodon gtt
anticdn 116101 116103 Leu-tRNA anticodon tag
anticdn 131862 131864 Asn-tRNA anticodon gtt
anticdn 132522 132520 (c) Arg-tRNA anticodon acg
anticdn 137106 137104 (c) Ala-tRNA anticodon tgc
anticdn 137950 137948 (c) Ile-tRNA anticodon gat
anticdn 140038 140036 (c) Val-tRNA anticodon gac
anticdn 146055 146057 Leu-tRNA anticodon caa
anticdn 153792 153794 Ile-tRNA anticodon cat
BASE COUNT 47824 a 29991 c 28992 g 49037 t
ORIGIN 2692 bp upstream of BamHI site.
1 ttatgggcga acgacgggaa ttgaacccgc gcatggtgga ttcacaatcc actgccttga
61 tccacttggc tacatccgcc ccctcgccta cttacattcc gtttttacat tatttaaatt
121 agaaaacaaa agattcaagt tcgaatatag ctcttctttc ttatttcaat gatattatta
181 tttcaaagat aagagatatt caaagataag agataagaag aagtcaaaat ttgatttttt
241 ttttggaaaa aaaaaatcaa aaagatatag taacattagc aagaagagaa acaagttcta
301 tttcacaatt taaacaaata caaaatcaaa atagaatact caatcatgaa taaatgcaag
361 aaaataacct ctccttcttt ttctataatg taaacaaaaa agtctatgta agtaaaatac
421 tagtaaataa ataaaaagaa aaaaagaaag gagcaatagc accctcttga tagaacaaga
481 aaatgattat tgctcctttc ttttcaaaac ctcctataga ctaggccagg atcttatcca
541 tttgtagatg gagcttcgat agcagctagg tctagaggga agttgtgagc attacgttca
601 tgcataactt ccataccaag gttagcacgg ttaatgatat cagcccaagt attaattaca
661 cggccttgac tgtcaactac agattggttg aaattgaaac catttaggtt gaaagccata
721 gtgctgatac ctaaagcggt aaaccagata cctactacag gccaagcagc taggaagaag
781 tgtaacgaac gagagttgtt gaaactagca tattggaaga tcaatcggcc aaaataacca
841 tgagcggcta cgatgttata agtttcttcc tcttgaccga atctgtaacc ttcattagca
901 gattcatttt ctgtggtttc cctgatcaaa ctagaagtta ccaaggaacc atgcatagca
961 ctgaataggg agccgccgaa tacaccagct acgcctaaca tgtgaaatgg gtgcataagg
1021 atgttgtgct cagcctggaa tacaatcatg aaattgaaag taccagagat tcctagaggc
1081 ataccatcag aaaaacttcc ttgaccaatt gggtagatca agaaaactgc ggtagcagct
1141 gcaacaggag ctgaatatgc aacagcaatc caaggtcgca tacccagacg gaaactaagc
1201 tcccactcac gacccatgta acaagctacg ccaagtaaga agtgtagaac aattagttca
1261 taaggaccac cgttgtataa ccattcatca acggatgccg cttcccagat tgggtaaaaa
1321 tgtaaaccta tagctgcaga agtaggaata atggcaccgg aaataatatt gtttccgtaa
1381 agtagagacc ctgaaacagg ttcacgaata ccatcaatgt ctactggagg agcagcaatg
1441 aaggcaataa taaatacaga agttgccgtc aataaggtag ggatcatcaa aacaccaaac
1501 catccaatgt aaagacggtt ttcagtgcta gttatccagt tacagaagcg accccatagg
1561 ctttcgcttt cgcgtctctc taaaattgca gtcatggtaa aatcttggtt tatttaatca
1621 tcagggactc ccaagcacac tagttttcta caaatcaaaa tagaaaatgg aaggcttttt
1681 attcaacagt ataacatgac ttatatactc gtgtcaacca aggtgtatgt agatctattc
1741 aaatttttaa tgaagttgat tggaaaaata cggacttctc tacagaaaat tagaatttcg
1801 atatgctagt gggttgcccg ggattcgaac ccggaactag tcggatggag tagataagtt
1861 ccttgttaaa taaaataaat gttaatctta aattaaataa acaagtaaag acccctcccc
1921 aagccgtgct tgcatttttc attgcacacg gctttcccta tgtatacatc agttcctttc
1981 ttatagaaat tagaaagact ttaaaaagtt gaatactcag ttgatttacc ccttaattac
2041 tattacaatc aacatttcag aatagtgaaa tttttttatc tcttcatcat ttagaaacaa
2101 atttccattt agaaaatcta agaatgaatc attgataatt cgccagatca ttgatacaaa
2161 aaatatccaa ataccaaatc cgacttctat atactcccca caaactagaa gaagctcgtg
2221 ggaaggtcaa agaaagaact tgttcttccg acgttaagaa ttcttccaat aattccgagc
2281 ccgatctttt caaaaaagtg cgtacagtac ttttgtgttt ccgagctaaa gttctagcac
2341 aagaaagtcg aagtatatac tttattcgat ataaagtctt ttttttggaa gatccgctat
2401 aataatgaaa aagatttctg catatacgcc caaatcggtc aataatatca gaatctgata
2461 aatcggacca aaccggttta ctaatgggat gccctaatac ggtacaaaag tttgctttag
2521 ctaatgatcc aatcaaagga ataattggaa caagggtatc gaacttctta attgcattat
2581 tgattagaaa tgaattttct aacatttgac tacgtaccat tgaaggattt agtcgcacac
2641 ttgaaagata gcccataaag tcacgggaat gattggataa ttggtttata tggatccttc
2701 ctgtgtgaaa gcacagagaa caatgacatt gccaaaaatt gacaaggtaa aatttccatt
2761 tattcatcaa aagaaacgtc ccttttgaag ccagaatgga ttttccttga tacctaacat
2821 aatgcatgaa aggatccttg aataaccata gggtaacctg aaaatcctta gcaaagactt
2881 ctacaagacg ttctattttt ccatagaaat atattcgttc aagaagggct ccaaaagatg
2941 ttgatcgtaa atgagaagat tggttccgta gaaagacgaa agtggattcg cattcatata
3001 cataagaatt atataagaag aagaagaatc tttgattttt ttttgaaaag gagtaaccgg
3061 gcttctttga agtaataaga ctattcaaat tccaaaattc atggagaaag aatcgtaata
3121 aatgtaaaga agaggcatct tttacccaat agcgaagagt ttgaaccaag atttccagat
3181 gaacagggta gggtattagt atatctaata cataatttag atgtgaaaaa ttgtcctcta
3241 aaaaaggaaa tgttgaatga attgatcgta aattataaga tttaaaaatc tttttgcctt
3301 cgaaagaaga taaagaagat attaatcgta gagaaaacgg aatttccaca ataaacgcaa
3361 atccctctga tatcatttga gaatacaaac tcttgttgca ccccaaaaat gaatttttgt
3421 tagaatcatt agtagaaata agaaaatgat tctgttgata cattcgagta attaaacgtt
3481 tcacaattag gaaacttaat ttattgttat aacctggatt ttccaacaaa atcgacctat
3541 ttctatttaa accatgatca tgagcaagtg cataaatata ctcctgaaag ataagtggat
3601 ataggaagtt gtgttgttgc gatctatctg gctgtaaata tctttggatt tcttccattt
3661 gaaattcgat ttgaaccaaa gacggaagat tttgagggtt atcaaatgat acatagtgcg
3721 atacagttaa aacaaagtat tttagtaaga atagatacct tggatacagg taaacttctc
3781 aacggattct ctatcatctc tttttttttt cgtttcgttt aattggtcta tgttatagtg
3841 ttataggata ataagatggt tagaaatcct ttattttttc aacctaatcg ctcttttgac
3901 ttcggaaaaa actttcttta tcaatatact gtttcttcta cacacacatc tccgtaatag
3961 aaaatggtaa tagttaggat tcattaaaaa aatggagaat ccactcatgg gacaagaaac
4021 ccttcccgca tcaggcacta ataaattttt aacgtctaat tagatcggga atcattcaaa
4081 ttaagaacaa aagctcgttg ctttttcttt ctttccctat aatttaattg aagccgcagc
4141 cctatccatt tattcattcg acccaacttt attttgttcc gttccaagaa ttctaacacg
4201 gttttatacc catctaggaa caatgaaata ttctcagaac tttccgttga tacgacatgc
4261 tatttttacc attcattccc tttcaggatc agtcgtggtc ttccaaactt taccgagagt
4321 atggacgaat ccctcacttc atccatatgt gtaaaagaga ctagccgcac ttaaaagccg
4381 agtactctac cgttgagtta gcaacccgaa gaaaatatcg aagaaaaata aataaagaga
4441 ttagacaaga caaccaaaaa ccattgaagg aataaatcta aaaaaataca ttcacatttt
4501 cgaattaatt taaaacataa aataaaacta aatagatcca cttcatttat cacaatgaat
4561 tatatttgtt cgatacactc tgttgtcata aatattgaat agtgaaaaaa aaaaaagaaa
4621 tttcaattga caacaataaa aaatattaaa aaaaaggact tgtgttagat tggcactaca
4681 aatctaatcc aaataaaata gatacaaaaa agtatagatg agagaataaa ttaagtggaa
4741 aacaaaacta caatttattt agatttattt aatccataat ggattcaatc aagttaagtg
4801 agataagcaa acttgatttc ttttttagtt ttagtagagt tccaatgaaa aacggaaaac
4861 cacccaattg aaggaaatgc ccgaattttc tatttcgagg atcaataaaa taagaggttt
4921 tgtcgttata gaacacggaa ttcaatggaa gcaatgataa aaaaatacaa atagaaaagg
4981 aaagggagga aatacaaaaa aatagaagag aaaagtcata caaagttata tacaaatgac
5041 tacccccctt tttgtatttc cttaatttat ttccttaatt gaatttcggt tgattaggat
5101 tgattaggac gaagttcctt aaaaacctcc gccttcttta aaatatcctg aacagttcct
5161 gtaggttgag cccctttttc aaggaaatat aaaatagcag gaacatttaa ataagtttga
5221 ttctttatcg gatcataaaa acccactttc cgaagatctt ttccttctct tcgggatcga
5281 acatcaattg caacgattcg atagacggct cattgggatt gatgtagatg aacaacaccc
5341 cccctagaaa cgtataggaa gctttctcct cgtacggctc gagaaaaatg attgattcga
5401 ggttttatct ctgtatggaa ttctatctaa gaaatgacaa ctgggtccat aaaatgatca
5461 aatcaattaa agatgtaagt cttttttttt cttctttctt cctgaaaatg aaaaagaaac
5521 cattcgtact ctcataactc aagttggata actttcaaac agttcaaagg aaaatctttc
5581 ggcaatttca tttattgagc ggtctttcct ccttttatgt ttgtctcgtt taaaatggat
5641 ttggattctt cagttcgatc cagttattaa gacaataaaa aaggtgtttc cttgttctgg
5701 gatcctttat ctttgtttta ttttaaatca ttgggtttag acattacttc ggtgcttttt
5761 aatcctttca aaatggcagc aacatacccc ttttgcgatt tctatgaaag aatcctacag
5821 acgatggatt cccgcgtgaa acactttgga tcgaaaagtt tgaatcaatt ccaaggaatt
5881 tttgaattgg aaacttgctc gaattggatt ctttcgattt ccataccgaa aatatattta
5941 cgaagttgtt ccaatttttt tattgattgg cattaaccct agactcttgc cccgagaaat
6001 aaattaatac tttctactcg agctccatca tggactattt acattccaag acaacaaaaa
6061 agaggggttc taatgaaaca gaaccaatga tgtcgagcca agagcacctt cattcctaca
6121 taaaatggtg gatgtacaaa tccacaacgg atcctgtcct tcaagtcgca cgttgctttc
6181 taccacatcg tttcaaacga agttttacca taacattcct ctaagaaccg gtctggaatt
6241 gattcaatta tggaatcatg aatagtcatt ggttgggctg atgtataaac accataatct
6301 atactttgtt ctatatctat atactataga gataggtgga taaatatttt tctttagtaa
6361 gaccccatcg ctaatattaa tttatctaac atattaatta atatttaata tataaatata
6421 tatagaaata ataataaata agaataataa taaataagac gaataaatga gttctttttg
6481 attctgcatc ttcacgtgac tcaataggag agattgacct atttcagact tcttcaaata
6541 gcaaagattc cgcttataag gaatgattaa aactatttat atttctaaat ttagaaagtt
6601 cccttttcga catcattatt tgaagaaaat ttgatagtta aagatcactt ttgatcatct
6661 taggaaagaa aaaagataag tctttctttt ttaattgaat catcaacgat ttcaatgatc
6721 taaaatagat aaatacacca aacaacaaat ccaatttttt tttatgagat ggataaaaaa
6781 agattaatat aaggtaagat tttcattctt attctttttt ttttttttca tctgattgat
6841 aaaatccaaa gaatggggag ggtttcgtat ctatcaattc gatcaaatag actgagcaat
6901 tgtcaccgtt tatagatatt gaaatgaatg ccttcccatt actgattaac tcctatctac
6961 cccattctat gggcctgatg cagcataaat caaaagaaaa gaggggggtg tcctagtctt
7021 tttgattttt acgaaatgcg agctgtctag gcacaaagcc aaacaagtcc agattaagtc
7081 aagtttttgc tcctattttt tgatatttta gcctaactca ttgattaaga attaagagac
7141 ttagtgaatt taattagtac caaaaatccc ctcttggcga aaagtcaaga aatccacaaa
7201 aaagaaaatg gaatctaatt aggctaattt aggggataga gaatacgaga tagggaatat
7261 agattctttc gcatctcgat tccgtttttg aaaaaaaaaa atgattcatc gaagaaaaaa
7321 atcagaaaca acaatcacat tccagctaac atttcgattt taaacagaac attgttaaaa
7381 aagcaatcta tattctcata gaatatatat atgttctggg acggaaggat tcgaacctcc
7441 gaatagcggg accaaaaccc gttgccttac cacttggcca cgccccattt agatttctat
7501 tcgatactaa gaaagtatat tgcttgtttt gtttgtttgt caactctagt ccaaatatct
7561 atagaataga ttagattggt actaggattt tgcgatgttt ttggtatgtg tagatataga
7621 attcaactta atttattgat cattacatat aattcaatta agatattgta tgaaaatatg
7681 attttttcga ttctcctttg agaaaaggag gatttttgat tgggtgggtt caaagaaaaa
7741 gaagtatttt ttgtttacct tacttacttt ccctttcctt atatcaataa cgcaatcaaa
7801 atgcaattat ctctccaaga acaaaaagtc tgttatgctt aataccttta gtttgatcgg
7861 tatctgtctt aattcgaccc ttttttcgag tagttttttc ttcggcaaat tgcccgaggc
7921 ctatgctttt ttgaatccaa tcgtagatat tatgccagtc atacctctgt ttttttttct
7981 cttagccttt gtttggcaag ctgctgtaag ttttcgatga gatccttaat aatatcctag
8041 aaaattcatg atttattcga gaaaaattct aaaataaata aaatcagata agctttaccg
8101 tttgaaacct cgattcaaac attgaaattc ttggatagtc acgagaaatc cggcttaact
8161 tatttcctta ttttttgacg ctttcccttc cagtgaaaga ccttattagg ctcctcacaa
8221 tacctaattg tgtatataaa aaaattttgg ttaatgacaa actcttagta gaaaagaatt
8281 tatgaaaatt cttttagaga aagagcttca ttgcttggtg tcaaactagg atatgcggta
8341 gaaaaatgga tgatctattc tctttttttc aaaaaaaatc atcttggaga ttgtgtaatg
8401 cttactctca aactcttcgt ttacacagta gtgatatttt ttgtttctct cttcatcttt
8461 ggattcctat ctaatgatcc cggacgtaat cctggacgtg aagaataaaa taaaaaaggt
8521 ttttccttgc ttgattttcc aattttctta tgatttggtc tattccacac atttaactaa
8581 gaataagaac aaaggatttc gaaatttgaa aaaaaaaaat caagtcatca acggaaagag
8641 agggattcga accctcggta cgattaactc gtacaacgga ttagcaatcc gccgctttag
8701 tccactcagc catctctccc aattgaaaaa gataattact acatgagata gcacataaga
8761 taaaggaaag aatctttctt tctctctttt cttctttcta tattatatag atatgtacaa
8821 cttttatcat caatttcctt tatttcttta tctaaagtaa aggaagggct cagaagagcc
8881 aagaatatca agaaaaataa agaagacctc ttttctttgt cttgattttg ttcgaaagga
8941 ccctcttatt ctcatggcct ggtctggtca gtacccagcc gggcctcttt tgttccaacg
9001 aatttgaatt tgaaaactaa aaagcctgtt atagttgtaa tatttcattt taattgaata
9061 gttaatattc aagcaacaag aaaaaattcc cattttttgt aaaagtaaaa taaaatatat
9121 aaaatagaaa attcgatcaa aataaaagtc tcatttctct ttctgctttt ttattttatg
9181 tttaccacct tactggacta aaaaaaagaa gctttcgagt attccacaat gcatttttat
9241 gttatgattt tagtggtttt gacgagccgt atctctatca aaactcctcc agcaaaagaa
9301 aagataaaac taaattctgt aatttagtta tttaaatgaa ccctcgtttc caaatctcat
9361 caaattggaa tccccccagg aaaaaagatc aacactctaa tttggatgat tctgtgacga
9421 ccctatctta tcctatcttg attaccacaa ttcccctgtt cgacaaaagt tgcatttgta
9481 tacaataatc ggattgtagc gggtatagtt tagtggtaaa agtgtgattc gttctattat
9541 cccttaaata gttaaagggt ccttcggttt gattcgtatt ccgatcaaaa acttgatttc
9601 taaaaaggat ttaatccttt tcctctcaat gacagattcg agaacaaata cacattctcg
9661 tgatttgtat ccaagggtca cttagacatt gaaaaattgg attatgaaat tgcgaaacat
9721 aattttggaa ttggatcaat acttccaatt gaataagtat gaataaagga tccatggatg
9781 aagatagaaa gttgatttct aatcgtaact aaatcttcaa tttcttattt gtaaagaaga
9841 aattgaagca aaatagctat taaacgatga ctttggttta ctagagacat caacatattg
9901 ttttagctcg gtggaaacaa aacccttttc ctcaggatcc tattaaatag aaatagagaa
9961 cgaaataact agaaaggttg ttagaatccc ctcttctaga aggatcatct acaaagctat
10021 tcgttttatc tgtattcaga ccaaaagctg acatagatgt tatgggtaga attctttttt
10081 tttttcgaat tttgttcaca tcttagatct ataaattgac tcatctccat aaaggagccg
10141 aatgaaacca aagtttcatg ttcggttttg aattagagac gttaaaaata atgaatcgtc
10201 gtcgactata acccctagcc ttccaagcta acgatgcggg ttcgattccc gctacccgct
10261 ctatatctat ttattctaaa tattttaatg tattcattaa atcaaattta gtttattagt
10321 attagtacat cattgaatat acaattccaa aaattctttc acatccgatt ctttctgttt
10381 tttttttcaa acaaaaagtt aaaatacgaa aaaaaaatca gaatgaaaag cgtccattgt
10441 ctaatggata ggacagaggt cttctaaacc tttggtatag gttcaaatcc tattggacgc
10501 aatttatttc catatatatt tttttttaga tttcgatagc aagaaagact gtttgaatat
10561 ttgaatccaa gacgcttgat tccttttttt tattaagatt aagacaaaag tgatcaatat
10621 ttctttatgc ttgttcctga agtataaaac ggtccatttg ttcctgaata gcttctttca
10681 aaagggcttc tgcttcctcg gtaaatgtct tggtagaaga tatgatttct tggaactgag
10741 gtttattagt ttttaagtaa gtacgtagct caacaagaaa tttccttacc tgtccaactt
10801 ctaatgaatc aagatagccg tttgttccgg tataaatagt cattatctgc tcttctaccg
10861 tgagaggagc tgattgggat tgtttaagca attcacgtaa tcgttgacct cttgccaatt
10921 gattctgagt agctttatcg agatcagaag caaattgtgc aaaggcttct aattctgcga
10981 attgtgctag ttctaatttt aatttaccag ctacttgttt catggctttt atttgagctg
11041 cggaccccac tctggaaacg gagataccca cattaatagc aggtctgatt ccagaattga
11101 ataggtcggc ggataagaag atttgtccat cagtaatgga aattacatta gtaggaatat
11161 aagccgaaac atctcccgat tgggtttcaa ctattggtaa ggcggtcata cttccttcac
11221 ctaaactaga acttaattta gcggctcttt ccaaaaggcg tgaatgcaaa taaaaaacat
11281 ctcctagata agcttcacga ccgggcggtc ttcgtaatag aagagacatt tggcgataag
11341 cttgcgcttg tttggaggga tcatcataaa tgattaaagt gtgtcgttca cgatacataa
11401 aatattcagc cagagctgct cctgtataag gagcaaggta ttgtaatgta gcaggggaat
11461 ctgccgtttc ggctaccaca atagtgtatt ccatcgctcc cctttcctgt aaagtagtta
11521 cgacctgggc cacagaagat gctttttgcc caatagctac ataaacacat attacatttt
11581 gaccttgttg attgaggatc gtatctgtgg ctactgctgt tttaccggtc tgtctgtccc
11641 caataattaa ttctcgctga ccacgtccta tagggatcat cgaatcaata gcaataagcc
11701 cggtttgaag aggctcatat acggaacggc gcgaaataat acccggggcg gcagattcga
11761 ttaatcgaaa ttcagaagct gaaatttcac ctctaccatc aataggttta gccagggcat
11821 ttataacacg acccaaataa gcctcactca cgggtatctg agcaattctt cccgttgctt
11881 ttacagaact tccttcttgt atcaataaac catcgcccat taatacaaca ccaacattat
11941 ttgattccaa attcagagca atgcctattg taccctcttc aaattcgact aattcacccg
12001 ccattacttc atcaagaccg tgaatacgag caatgccgtc gcctacttga agtacggtac
12061 cggtatttac aatctttact tctctattat attgttcaat acgttcacgg ataatattac
12121 taatttcgtc agctcgaatg gttaccatga ttctttcttt attatttttt gaaagaaaaa
12181 aataatacct acagtagaaa gactaatcag ttatttcttt cattgttccc aacatgccaa
12241 tattggacct aatggtacgt aaatgtaact cgttgttcaa acaactattc agagttccta
12301 gagctcctcg taaggcttgt tggaaaaccc gttgtcggac ttgattaatc gccctttgct
12361 gttcaaactg aatcgtttcg tttttgtaat tttctaattg ttccaaagtc ttataagttg
12421 aattaatcaa attcaatttt tctcgttcta tttcagagta tccattcact cgaaactgct
12481 cggcttcgct ttcgactttc cgtaagcgag aacgagcttt ttcgagttgt tcaatagccc
12541 ctccacgcag ttcttctgaa tttcgaatag tattcaagat cctctgtttt cgattatcta
12601 ataaatcact taatgaaagt agattatttt tccattcctt tccaaaattc cataatccct
12661 tcccgaacca aacatgaatc tttcgattca tttggctctc acgctcaatt acttaaggta
12721 aattctcata tcttttttta tgaatgtaat gagcctatct tctcttcttt gttcatattc
12781 caaaaagata tcgaaactaa tgtaatacca aaatattcgg aggactcttc tgacaaaata
12841 aaaaatatgt aattgtcagc aaagttgttt cttttttttt ttttcaaatc caaaaagctc
12901 ttcttactta gaataggtcg tcgattcagc attagataaa gggggtaaaa tccccgtttt
12961 tacaatttac aataagcggt tcaaatcatt ttatcaatat gagtatccta tatcgataaa
13021 atatttattt tgaaaccacc tctatattaa catagtggta gaaagagtac catgctgcgt
13081 ctagacttca aacagtttgt tttaaccatg ttaatagttc cacattattg gttaatagag
13141 aatcaaaatt gatttaccaa tgaatcgcga aatgctatgg ttcttacata taatttctga
13201 atttattcag aagtaattcg cgagatcatg cacctctctt tcctagttat aacggaaaag
13261 ggtacagctg ggtggtccag cctattcttg aaataaacaa ctcgcacaca ctccctttcc
13321 aaaaaaaatc aatacaccaa gcactacact tagatttatt ggatttgttg ctaaaatatc
13381 ggtattaaac ccgaaactcc cggcagatgg ccagtggccc aaagaaacga aagaatcggt
13441 tacgtttttc atatgatctc ctcttataga tagactaaaa aatcgaacag agttcttttt
13501 gtagcacttc gcccctcttt ttatttattc ttttattttt tctgaaattg agtcaaaaaa
13561 taaaaaatat tcgagttagt tataaattat gaactaacga actagccctt ttattggtta
13621 ttggaacact aacacttact aaaaagagtt tcccttggtc tatgaacggg aaggatgaaa
13681 gcgagtcagt atgctaattc ctcatccgca aatcagccct tcccgtaggt tcttttctca
13741 aagaataaag aattggagga gggaaatctt gatagaattt gaaaaagcaa acgacaagtc
13801 gaaggcaata aaatatgaaa aatgtattta tttttcatat ttctaagcta agattaaaca
13861 aaaggattcg caaataaaag tgctaatgct acaaccagtc cataaattgt taaagcttcc
13921 ataaaagcta gactaagcaa tagcgtacct cgtatttttc cctctgcctc aggctgtctc
13981 gcgataccct ctacagcttg acccgcagca gtcccttgac caactccggg tccaatagaa
14041 gcaagcccta cggccaatcc agccgcaata acggaagcgg cagaaatcag tggattcatg
14101 ataagttcct cgtaccaaaa aaaagaaatg gttaacgata caatcaacca atgagttatg
14161 acttaattat tccctcgcta ggaatcatcc agtcgaagta actaagaact tcggattgaa
14221 gtaataagat tattgaatca tcagaactac ttcgatatat cttttttact ttttagccac
14281 agagtctttg tgaacccata cgactttcgt tcttccattt cttggttcga actgttagtt
14341 gaattatttc ttgatttcat ccgtttattc attcaattca cagtcacaag gggccggaag
14401 gacttctagt ctattagaat cccctagagt agtaaaatta tatctttagt tcatttcata
14461 tataactagc actagtcaat atctaatatc acatatacat gtctttcttc cataacgtaa
14521 accaagcatt catcttagat tcaatcctat tcgagaatca agcgtcgaaa catctagaag
14581 ggttggctta tagttattca attacagata cctccctctc ctaaccgacc ctttctaaaa
14641 tactcaaaaa aatccctttt ttgtaaattc ttttgaacct taccttttct tattattcca
14701 cctagataaa tctaaatgga caaattgatt aggccgaata attccatatg tatagaaata
14761 tcattatttg attgatctaa gttcatgcaa tttattaata aaaatgaata atttatttat
14821 taattattaa tattttggtc aatcgttgaa taaaatcaac tgaaagggaa atcgtttcgc
14881 cctttttaat ttaatttaat tacacgtcgt aaacctatac aacaagaatt ataattattg
14941 acaaaaattc ttatattcaa attgttttaa caatgaatta ataatgagat ggactaagca
15001 atctaaagtg aatattcatt gagacgaagt atgatattaa gtgaaggaaa ggggaatttt
15061 aggaaaaaga tctttttttt ttagatcttt ttccccttac tctttaatat catcgtaatt
15121 tttttgctat cactctagat cgtatataaa atagttgtat atttagattc ccctattcta
15181 ttccctaagt taagtaattc tcttgagcca cccaccatat ttatacattg ctttgggcta
15241 agctaaataa gactatttca atgatggccc tccatggatt cacctatata agccgcggct
15301 aaagttgcaa aaataagagc ttgaatacca cttgtaaata atccaaggag catgacaggt
15361 ataggaacta ctaaaggtac taaagaaaca agaacaacaa ctactaattc atcagctaag
15421 atattcccga aaagtcgaaa actaagtgat aaaggttttg tgaaatcttc taagatgtta
15481 atgggtaaaa ggattggggt tggttgaata tattttccga aataacctaa tccttttttt
15541 gtaagacccg catagaaata tgccactgat gtgagtaaag ccaaagcaac agtagtattt
15601 atatcattcg tgggtgcggc taactcccca tgaggtaatt gtatgatttt ccaaggtaaa
15661 agagctcctg accaattaga aacaaaaata aataaaaaca tagtgccaat aaaaggaacc
15721 cagggcccat attcttcgcc aatttgagtt ttactcacat ctcgaataaa ttcaagaaca
15781 tattcgaaga aattctgacc cccggtcgga atggtttgtg ggttccgaac agctatagtg
15841 gctgaaccta ataagatagc aattacaacc caagaagtaa taagtacttg gccatgtact
15901 tggaaacccc ctatttgcca atagaaatgt tggcctactt ccacaccgga tatatcgtat
15961 aaccccttta gagtattgat ggaacatgat agaacattca tattgccttg ccctctgaaa
16021 aaattgaact ttaaacaaaa ttttttgatt caaccatctc tttgtctact tgaatcggat
16081 attttgaata ccaactaaga tttagaatac taataaatca cataatatcc ccagctattt
16141 ttatctcttt tttgaaattc agaaatagta agcgattcca taagggattt ctgaagtaag
16201 ttatttatct tattatgtta ttattaatca aggatttctt atatagctag aacgaccctc
16261 acaaattgcg aatactaatt tgttaagaat taatcggatt gaggatatgg cgtcatcatt
16321 cgctggaatt gaaatatctg cgagatcggg gtcacaattt gtatcggtta aacaaattgt
16381 tggaattcct aaagtaatac actctcgcag ggccgtatat tcttcgtgct gatcaacgat
16441 gattacaata tcgggtaccc ctgtcatata tttaatcccg cccagatatg tttgcaagcg
16501 agataattgt cttttcaaca tagcagcatc tcttttcggg agacggttga gtctccctgt
16561 tttttgttcc attctcaagt ccctgaactt atgaagtctt gtttcggtag tggaccaatt
16621 cgttaacata ccgccaagcc attttttatt aacataatga caccgggccc ttattgcagc
16681 ccactctact gaatcagctg ctttattttt ggtaccaaca attaagaatt gttttcccct
16741 acttgctgcg tcaaaaacta aatcacaagc ttctgataaa aaacgagcag ttctagtaag
16801 atttgtaata tgaataccct tacgctttgc agaaatataa ggcgccattt taggattcca
16861 tttcctagta ccatgaccaa aatgaactcc tgcctccatc atctcttcca aatttatgtt
16921 ccaatatctt cttgtcattt ctctccacac cccccctttt ttttttattc tttttcaaaa
16981 aaaaaaaaag agacgaggaa ccctgaactg aaataaataa ttgttccgat ggaaccttct
17041 cttctaccgt agattggacg tagatacacg acccaaacca ttattctttt ctattcatta
17101 ttctttttat taccaaagca aataaccata ccaaatgcag atagcgaaag agatgaatcc
17161 gttgttagga atcattaaat cctataaacg attgttcggg tatatcgtgg aaattttttg
17221 aaagacaaga atcaaataat tttttgtggt ggaacaaaat atctctcatc tccccctcga
17281 atagattctt tttttttgtt tccaaaggaa tgttgttatg ttgttttgaa gggtgcacta
17341 atcccttgaa tccggtacca acgggtatca ccccccccaa aacaacgttc tctttcaggc
17401 ctttcaacca atcgatacga ccccggagag ccgcttttgc taaaactcga gcagtttctt
17461 gaaaactcgc ttcagatatg aaactttgag tattgagaga tgctcttgtt attcccaata
17521 agacgactcg gtaacagatc gcttcttcca aagcgcgccc cattcgttct gctcgcaaca
17581 atccaataag ttctccgggt gaaaaaacat tagacattcc atcttctgaa accaacactt
17641 ttgatgttat ttgacgtaca ataatttcta gatgcctatt atgaatctgc accccctggg
17701 agcgataaac ctgttggatt ttattaacca aagagattcg gctttgcgct atagttagct
17761 cagcaccaat caagaatccc caaggaattc caagaattct tgttatacat ttgttccaac
17821 cctcaatcct cttttctaga ttcatggata ttgaatcaac cgaacgcact tctaacacct
17881 gttctacttt tggaagacct tgtgttatat caccagatct cgatttttca tatataaatg
17941 taactaatgt atctccttcg taaagggttt ccccataatg gccatgaaca gttgctccgg
18001 gggtggccaa ataaggctta gctgatcgta tcactatcga atcaacttga acaagtataa
18061 cttgacccga tttgaggggc ggtccatttt tggctataca tacattttca caaataaact
18121 gtccaagact aattatttta gatgtctctt cacaataatt gtgatggaga aaataccaat
18181 tcaaattgaa tggatttaaa ataatgttac gacacggatc gggattaaaa atttttccat
18241 tttcatccat taaataatat ttaaatttaa tcacttgaaa agtctgtttt aaattgtcaa
18301 gttgcaaata gttagttact aagatctgat tatgagttat taaatggtaa gatgaataaa
18361 aattctcaat tggaagggat gttcctaaag ggcccaatga attcctaatt ggaattaggg
18421 gatctttttt aattgatttt tttatcacac tgtgatattt tacatccttg aatggcccca
18481 ttcgagaaca attggctgct gacaaaatta tcaacgactg acattcctta tttctattta
18541 acaacgtatg aatagttcct tgaggttggt taatagattg ttgaattttt gccttggaat
18601 aggaataaat ggaagaaaag gggttgatat tggtacaatc tgatccatta tcagagagca
18661 atcctgaccc cgacggatca ttcctttttc cgatatacga aataggggat ttcactaagt
18721 tgattcttag gaaatgtcga atcaaaccat ttgtccttat ttcaacaaaa gaagcacggg
18781 cttcttcgca agaagaactt tttttgtctt ggttccaatt taatactaaa caagtccgaa
18841 ctaattgaat acttgtgtca gaaattcctc gaatcggttt gccatttcca taaaggatat
18901 aattgacaat tcgaagttgc acattatccc tttcctgcaa tggatccggt ggaaaaaggg
18961 ttgctaaatt tataccgtcc gttatttcat atgtgacgac aggtcgaact aaaacaaaaa
19021 actttttctt gctaggtgta atccgttgga catagatcca atttttcact tttttggatt
19081 ccttggaatt tctttttcct gttcctggtg gtatcaaaac gccggtatgt cgggatatct
19141 tatctgtctc tccaggaaaa tggatatctc cagaaaagat tttaagttca attcgttttt
19201 tttttctctc cacccggacc aacccaccga ctcggcttct tagatttaag gtgatttgtg
19261 tatctacccc aacgatacta ttgttccgta ccattatgga agaagatccg ggcaagatat
19321 gcacctcttc aggaatgaaa aaaaatcgat ctactttcat ttggtatttt ggcctaaatt
19381 ccttgactcc tcgatactca agcaaatcct cttttttgat gactgaatgc gtttctacag
19441 tcccatattt aataatgccc gaactctttc ttctgtatcg aggatcatcg aaataagcaa
19501 gaatactatt tcgacggaaa ataccattta cggggatttc aatcgagata cctgaacagg
19561 gcattagttc attctcgagt tcttgaatcg agtgtagtgg aatgatgaat ttatttcttc
19621 gcctttttga caataaatca gaattctcgt ggagaatagg cgaatatacg agattatact
19681 gaccagcaca tataattcga ttaaggtctg aataatcagg aatcctatct tcttttttac
19741 cagaaaaatc gaactaaata atttctgcct cgcctgatcg ttggttactc gagaggttag
19801 aagtatatct tcgcttgcca gaaagaaaat gcgcattcat ttgatcctga tccttgtgga
19861 tcgaaaggta gactagactg gacctcgagg ccttcctaat aatatccata aatggcttgt
19921 ttttggtaat agatgaacat taccgtatgt aaattcgggt gcatgataga catcggtact
19981 ccagtgcatt tctccgtctg aatcagaata aatatgtttt cgaaccttct ctttaaaatt
20041 caaagtggat attcctgcgc gaatctcagc aattacttgt tctgattcta catattgatc
20101 gttttgaact aaaagcaaac ttttgggtgg aatattcaca ttatgtagaa tatcttcact
20161 ctcaatagtt acatacaagt ctatagaaca tagaaaggcg ggatgcccat gacgtgtacg
20221 tgtcggatga accaaatcct cattgaattt gatttttcca ttagatgggg ctcgcacatg
20281 ttctgcagta ccccccgtga atatctccgg tatgaaaagt tcttaatgtt aattgagtac
20341 ccggttctcc aatcgattga cctgcaataa tacctacagc ttccccaatt caaccaggtc
20401 gccatgagta ggactccggc cataacataa tcgacaaatc caagatgtac tcctacaagt
20461 aaagggagtt cgaatagaga ttggttgtgc ccgaaaggtt atgaatcgat ttacaagtcc
20521 aatgccaatg tcttgatttc tagtggcaat acatcgcgga cccatgtata tatcatctgc
20581 taatacacga ccaattaatg tttggataaa aatcctttcc ggcatcatcc cattccgagg
20641 actcacagaa ataccccggg cggtgcacaa tccgttcgac gtacaacaat gtgttgaact
20701 acttcaacaa gtctgcgagt gagatatcca gcgtctgatg ttcgtacagc agtatccaca
20761 atctttaggg ctcgtagcaa gaaatgatgt attctgttaa agagagtcct tcgcgtaaat
20821 tgctttgaat aggtaaatca atcatttgtc cttgaggatc tgacattaat cctctcatac
20881 ctactaattg atgtacctga gatgcatttc ctctagctcc cgagaaagac attatatgaa
20941 ctggattaaa agggtcagtc atcctaaaat taggattcat ttcttgtcgc aaatattcac
21001 ttgtagcata ccatatttca atggattgac gtaatttttc taccgcgtgt acattcccat
21061 aatgatggtg tttttccaaa atcaaacttt gttgttcagc atcttgaact agccatccct
21121 tagaaggtat tgttaaaaga tcatcaattc ctaatgaaat ggatgtagca gtagcttgtt
21181 ggaaccccag agtttttact tgatccagga tatgtgatgt atatgccatt ccgaagtgat
21241 ctattaatct actaataagt cgtttcatgg cagttccgtt tatcgcttta ttgtgaaaga
21301 ccagattggc ccgttctgcc ataagtacct ccatattccg ctgagtagaa ttcgacaatg
21361 ggtttgagtc ggtgattgta aaacttcctt ttatcgatct tgattcgcgt ataaattccg
21421 gaactatgga cctagctgaa ccggagagcc ccgaagtccc acgggtatca tagaattacg
21481 ttaggtacca gatgaatagg cccgagaaaa cccctgtata gcttcttcga tttctcgata
21541 aagagcaata tgaccaacag tggttcgaat gtatataaaa aggatttgtt tttttagact
21601 tcttactatt agatagtgtc cataaatctc ataaaaagta cctaaagatt catagtgaac
21661 ttcgatggga gtttctcttg aagcaataac gcgttgatct agtcgccacc ggagccacaa
21721 aggactatct aaattgattc gtttctgccg ataagcccca attgcatcat aggaattaga
21781 aaaaaacggt tctttcgtat acttatagtg actattgtca cttctttttt gattttgata
21841 gtttctgcga ttacatggat tatatctatt tacacaaata cctcgatgat ttccgctcgt
21901 taatacatag agtccaataa gcatatcttg cgttggtacg gaaatgggat ccccaatagc
21961 cggagacaaa agattcatat gagaaaacat aagtaaacgg gcctctactt gagcctccaa
22021 ggataaaggt acatgaacag ccatttgatc tccatcaaaa tcagcattga atcccttgca
22081 aactaatgga tgtaaacaaa tagcgcgccc ctccactaaa acgggctgga atgcctgtat
22141 gcctaatcta tgcaaagtgg gtgctctatt cagcaataca ggatgtccct gcataacttc
22201 ttgaaggatt tcccaaacaa tcggctcttt ttctcgaatt ttactcttag caactcctat
22261 gttcgaagca agatgttgtc taattagacc acgaattaca aatgtctgga aaagttctat
22321 tgctatttca cgaggcaatc cacatcgatg taatgaaagt gaaggaccca cgacaatgac
22381 agaacgtcct gaataatcga ctcgtttgcc aagcagagtc tcacgaaatc ttccctcttt
22441 gccttcaatt acatcagaaa atgacttgta aactttatta tgaccgtccc tcattggttg
22501 tccccggatt ccattatcaa gaagtgtatc cacggcttct tgtactaatt tctcctgaca
22561 cattactaat tctcctggcg tagatctact tgttgttaat agatcggtaa gggtattgtt
22621 ccgatagata actcttctat agagttcatt aatatctgag ctcattagtt tacccccatc
22681 tatctgaatg atcggtctca actcaggagg aagaactggt aatagacaca aaaccatcca
22741 ttctggctct atatttgttc gaataaaatg cttagccaat tccacgcgtc taaccaaaaa
22801 gtcctttctt cttccaactt ttcgatcttc ccattcattc cctgtgtgcc cttcttcccc
22861 caattcttcc cattctacca acgaattctc tataataatt cgtaaatcta gatcggctaa
22921 ttgttctcgg atagcacccg cgccagtaga gatttctcga ttgcgaaatg tatcgaaacc
22981 ctgggtagta aaaaaaagtg ggatgctgta tttccaagat tggatttcat attcgaataa
23041 acctcgtaat cgtaagaaag tgggcttttt agttatgggc ctagcaaaag aaaaattggg
23101 ataggattct ataggatctc ccccccttca aaatcggacg tgaaagtttc ctttcatccg
23161 gctcaagtag gtacaccaaa taaggaaagg agttctcgtt ttcaaactct agaaaatccc
23221 aaaataaaaa ggtctactcc ttactcaagt tcccagtgaa gacgaaacaa gatttcagtg
23281 attccgtctt ctattaattc tttattcaaa ttcaattcca acaaataaaa tagaaaattc
23341 ttgagtagtc tacttccctt tgaatgataa atcccttaac tcttaataat taaaggaata
23401 ccttggaacc cataagggat ttacttgtct atatattgtt ccattcgatc ttttaggtcc
23461 cgacttcacc tcgatggtta ggccaccacg cccttaaagt ctatacgcga tagatagact
23521 cctagaacca tgacatattt gcttacttga acataatttc tttccacgaa aagaaaggaa
23581 atgtttcatt ctacaaaata aaaagctttt tttacgatgg tacaaataga aattcctctt
23641 tatttgattt gttacgaaat cgaccataga tcaattccct ttttatttgg gagtattgac
23701 tacaccccaa ttctgagctt catgttactc tttccaagtg cacatgtcag gtccagggca
23761 tcccaattgg attgactggg atgacagttt ctccttccga gtctgtaaaa tcagaatttc
23821 gatcaaatca cacatcgcag tatactaggc cttctaattc tttaagaggt ttatctaaaa
23881 gattcgcaat ataactagga agacgtttta aataccacac atgggttact gggcatgcga
23941 gtttgatata gcccatttga taccttcgta tccgagaatc aacaaattcg accccgcatt
24001 gttcacaaaa tttcgggtct tctttttcat ctccgattac tcgataattt ccacaagcac
24061 aaattccgct ttttatagga ccaaaattct tcacaaaata atccatcttt tccggtttgt
24121 tagttttgta atgaaaagta tagggttttg ttacctctcc aactatctct ccattaggca
24181 ggattttagt ggcccaagca cttatttgtt gaggagaaac tgatccaatt cggagctgtt
24241 gatgtttata tcgatcgatc atagaagaaa aattattatt cattccgatt aagcttcctt
24301 cctattaatc tggaagttct tctcagatac aaggaaatga ttcagttcca gagctaaaga
24361 tcgtagttct cgaacgagca atcgaaaaga ttctggagca tcttcgggat taggtattgt
24421 tcccccaatg atcgtagtac caagtacttc ctggcgagct ctaatatgat ccgatttata
24481 agtaagcatc tcttgtaaaa tatgagcaac cccaaaccct tctagagccc aaacctccat
24541 ttctcctacc cgctgtcccc cctgtttggc tcttcctcta aggggttgtt gtgtaacaag
24601 cgcataatgt ccactggagc gcccatggat tttatcatca acttgatgaa ttaatttcaa
24661 gatataaggc tttcctatta taacgggttg ttcaaaagga ttccccgtcc ttccatcaaa
24721 tattctgctt tttcctggat attcgggttc aaatacccat ggattcgctg tttgcttact
24781 ggcttcatat aattcagaaa acacaagttt tctcgaagct tcttgttcat atctctcatc
24841 aaaaggtgct attcgataat gtctgtctag cagactccct gctaacccta gtgaacattc
24901 aaatatctgt cctacattca ttcgtgaagg tactcctaat gggttaaaga ccatatcaac
24961 ggatcttcca tcttgtaaat aaggcatatc ttgtctaggc aaaattttgg aaatgatacc
25021 tttatttccg tgtcttccag ctactttatc gcctactttg atttcacgtt tctgtaaaat
25081 atatacacga atcgtttcgg gattataact agaaccaccc ctcttctgga tccacctcac
25141 atcaataacc cgacccctgc cacctatagg tagttttaga caagtttctt ttgaagtaga
25201 tacctgaata ccaagtatag ctcgtaacaa tctatcttcc ggggcatacg acgattcttt
25261 cacgacctgg ggtgttaatt tacctactaa aatatcacct gtctctaccc aagatcccag
25321 catcacaatt ccatttttat ctaaattgcg gagtaaatgg gcttctaaat gcggtatttc
25381 attagttact ttttcagggc cttggcttgt cacatgagtc tgaatttcat atttccgtat
25441 gtgaaaagaa gtataaatat cttcatatac caaacgctcg ctaataagta ctgcatcttc
25501 agaattgtaa ccctcccacg gcatataagc tactaatacg tttttcccca aagcaagttc
25561 gccaccaacc gtagcagcac catccgctaa aatttgtccc tttttaatgc atttaccccg
25621 aggaacctgg agtttttgat gcatacaagt atttttattg gaacgttgat atataactaa
25681 tggaatgctt agaatatctc cattacctgc taaaagaatc ttgtcagtat tggtataaac
25741 gacccttccc tcgcgttcgg ctatagcaag agcccccgaa tctagagctg cttgtcgttc
25801 caacccagtt ccaacaatgc atttctcgga gcgagaaaga ggaactgctt gacgttgcat
25861 attagaactc attaaagctc gattcgcatc attatgttcg ataaaaggaa tgagggaagc
25921 tccaatagaa aaatattgaa aaggaaaaat acttcgaaga tgaacctgtt cccatgcaat
25981 agtcaagaat tcttgacggt atcgagctgg aacaacctgt tcttcctgaa tatcctgatt
26041 taaggctaaa gaatttcctg ccgctaccat atagtattca tctctacctg gtgataaata
26101 aagcatccgt accccggttg acctctcaga aatttcataa aaagggcttt ctagagatcc
26161 ccaatgacca atcctcgcat gaattgctaa ggatccaata agtccaacat tgattccttc
26221 agatgtgtca attgggcaaa tacgtccata gtgactagga tggatatctc gtatccgaaa
26281 actagcagtg cgccctgtca gtcctccagg gcccaaataa cttaattttc tcccatgaac
26341 tatttgtgtc aatggattag ttcgatccaa aacttgagat aatgggtgta aaccgaaaaa
26401 ggattcataa gtagttgtta atggagttga ggttaccaaa ttctgaggtg tcggtatcaa
26461 tttatgccga attgctccac atatagtccc ccgaaccaca ttttctaaac gaaccagagc
26521 caatccgaat tgatcttgta aaagatctgc tacagaacga atacgtttat ttttcaaatg
26581 attcatatcg tcaagtgcac ccattccaaa tttcagccca atcaaatgat cggcggctgc
26641 caatatatct cgtggtaaca aaaatgtatt gttctggggt atatcaaggt tcagtcttcg
26701 gttcatattt cgtcgaccaa tccttcctaa ttcacatctt tgttgaaaga atttcttttg
26761 taattcctta cataaggatt cagaaaatac cggatcgcca cctacacaag caaattgttg
26821 ataaaactcc aaaatggcat tttcttttga cccaattttt tttctctcct tatcactcag
26881 aaaagacaaa aaaatttcag gatagcaaac attctctaga atttctctta gattcaaacc
26941 catagctgat gatagaacta gaatagatat tttttgtttc ctacttacac gagcccatat
27001 ccttgctttt ctatcaattt ctaattctga tcttcctccc caatctgata ttatggtgcc
27061 ggtatagacc gaaattccgt tatggtccaa ttctgatcgg taataaatac cgggactttg
27121 caatatttga ttgatcacaa ttctatatat tccattgact atagaagttc ccagggaatt
27181 cattagagga atgtttccga taaaaattgt ttgttcttgc atatccctac tgtttttcca
27241 aattaatccc gcggatacat ataattcaga agaatatgtg agtgattcat acacagcatc
27301 tctttccttt atcaagggtt cgaccaattg atatgtttcc acaaataatt gaaattcaat
27361 ttcttgatct gtatcttcaa tttttggaaa cttataaagt tcttccgtca aaccttgatc
27421 aatgaaccta caaaatcctt caaattgtat ctgattaaat ccaggtattg tagatattcc
27481 ctcatttcca tccccgagca tttttaattt cccatttatc aaaaaatacc actattggtt
27541 cattcttcat ctaattagat agattagata aatgatctag caatgatggc atttctattt
27601 tgtttaccga atcacatgaa attttaccca actccatatc tggaatgtat gaaatacgta
27661 tgaacggagg aagaaagaga attttctact taaattgaat tggaatttat tggaattttc
27721 aacagataca aatggaaaga aattgataaa acatccctag aaacagactt ctgctactta
27781 gacttattaa ttaagttata gaattttgta tagaatatca aaacaaaaat gattccattt
27841 ctaccattat tatgataata cacattccaa cctgcttgaa taccagaaaa ataaatggat
27901 tcgacatttg atcttttcgc tgagataaag gcataaaaat aagaaagaat atatagaatt
27961 agaatcggtt ttttagcatt taaccccctt ttctgttatg gatttcgttg ctaaaaaaat
28021 gatttgtaga gaagagagag attttgttta cggatttttg aatagaatac gattgtgaag
28081 tgtataagaa aagaaggttt gtatggctta accacgtgtg gagatatcta taatatccgt
28141 ctttcttctc ttttattgtt ttattgtcgt tctctgttct attcggggca acccgggttg
28201 tgctctatga aaacagaatt tcaattttct attcaattca aaattcaaat tgaagtatga
28261 tacttttctg atatctgata attctctatc ggaacatata taaataatat ataccgtcta
28321 acaatttctc ttgggggttt acatatactc ataattgttg ttataattaa aattgagaag
28381 gattttttga ttgaaaaaat ccatactgat tagttatata tcaagttgta ttttcttatg
28441 tcattaggaa aacaaaattt ggagattcaa atccaagaat cattcatgca ttctaagtca
28501 atagttaatg gttccgattt tcagaaattt gaattttgga ttttgcgact gaaaatccac
28561 atttgatttt tcaatagaaa ggtaagagaa agctttgaac attatgaatt tggagatcga
28621 aattgaaagg atgaatcaaa cccaatcaaa agggaagaag gattaggatt tctttgactt
28681 ttaggaaaaa ttaaggaaaa cagaactcaa ggtgcaagta caataaaaaa gcagttcagt
28741 aatcctggaa agttttcatc tattttgtat ttgtagcatt ttggcgacat ggccgagtgg
28801 taaggcagag gactgcaaat ccttttttcc ccagttcaaa tccgggtgtc gcctgatcaa
28861 caaaaaactc gaaatctctt cttttcttct gttctgttga tataacccgc cgaatgattc
28921 cccagcagaa gcagagaaag cagactgttg atacttgttt gattctaaac atctggtctg
28981 ggggtttttc taaaaaattg taaatatctt tgcattgcat atttaggctt caaggaaata
29041 ttcgaatgct agaggggcta tcaagacttc gcaattacct tctactacaa atcaaaattt
29101 tctattatta atgcattgta taatgactgg accttgaatt agattggaga gcccgatagg
29161 aaatctaaat agttgtggaa gggggcggaa gatactttat tatatacgag gaactcacga
29221 aaatctctga gtgctcaagc atccaatcaa ttgaaatgag ggtcaacaaa aaaagaatag
29281 gacctattat tcctacatgt tccattagta acattccctt gagatgttac tgcagatttt
29341 gcttgtgttt aatctttccc gattagaaat cctataggaa tttcttataa aatgagcgaa
29401 tttattggat tggtttatta atagtcttcg ttctttttga ctctgcgcca ttgattccac
29461 tattattagt gaggaataac ggaacaattc ctttatattt atagagatag gggacataat
29521 tcatatggat atagtaagtc ttgcttgggc tgctttaatg gtagtcttta ctttttccct
29581 ttcactcgta gtgtggggaa gaagtggact ctaggggtcc tactaattga gttaaggaag
29641 caaactgtat caatatcaat tgctttcgag atcgttctgc aacacgtttt gaacaaaatc
29701 aaaatatctt cattttgaaa ttccattgga ctcgactgga gtaatgtatt ataggaatca
29761 tcctctttca atcaaagagc tatttcaacg attcccatgt ttgtagttcg aaaggaagag
29821 gatcccagga aatttattcg aacctaattc ttccgaaatt ttctattcca atcaacggcc
29881 tcttacaggt gatactgagg agggccggac ccttttttta tttctttctc tctttactgt
29941 tcaaagaaga ggtagttttg ttaagtgtat acgcactttg tatgagaaag aaaggatata
30001 aacatagtgg ttgtctaacg agatactatg cagaataaga tcttcagatg agtcacatat
30061 tgcgcattta ccgctttcga atttttgaaa ttggatttat gctttatcga cttatttcat
30121 atcatggttc aggcgttaaa aatcggtgag gtttactctt ccttttcgat gcccgtggaa
30181 ctactgtcaa tggtttactc aattacttct tgggaatgtt aaaaaaaaga ttactacgtg
30241 attttttgaa tctgcctata tctatcgctt ttccttcatt gatttgattc tttcaataga
30301 taccgagatt cagattggaa atcaaaaatc tagtaattca aactataaga cataagagta
30361 atttagattg atcagaacaa atagatatag caaataaatg gaattggatg ctatgtcaat
30421 cccatatatg gaattgatat tcacatatat caagataata ttgtagattg atctatagat
30481 ccatatcaaa agatccatat caaatgcagc ctctatcttt attttattcc agggggcagc
30541 tttataacta caatctaact aataaatagt atggtagaaa gaaatagatg aatctttctt
30601 tctaccatac tatctatcta ttagaatact gccgattcta gtccatacat tttcatttaa
30661 gacatgaaat tagaatcttt ttcattttat ttcgtcaatt ttggctaaga actcagaagt
30721 caagtttcat tcaaattagt taataattaa tcgttttgac tgactgtttt tacgtaaatg
30781 ataagtagaa aagcggtagg aactagaata aatagtgcag tagcaataaa tgcaagaata
30841 tttacttcca taatctcatc ggttttttac ttcgcaataa ctcgggattt aatcccatag
30901 agatgataaa tctttggcct gtaaattcaa tgaatgaata ttacctctcg atgatcttga
30961 atcggatcaa tatcatgaat aacaatatct gaactatcaa atcaattcgt cgtcgagaat
31021 tgaatagtat aacataggaa gttcttttat ccataccgcc ccaaacttgg attcctgacc
31081 caatccaaaa ttcctttatt tatttatcat tatcattttt tctcatctgt tctttttttc
31141 tctctaatct atctagttcc ttcttgtaca atcatctgat gaagtctcat caaatagctc
31201 ttccacttcc agtggtcaca catagttaca aacccaaaca aacaataaaa gctaaatgga
31261 aaaagaaagg agtttagaac taaactattt ttgacttgga agacaaagaa gtgtgataaa
31321 gatgagaccg tataaaatga atattcatca aattgactat tttccgattt gttctttcgt
31381 cgatgggggc cttaaaacaa aatgaaaaat cggaaaaatg attcattccc ctttctaaga
31441 ggagtaggat ctttcctttc ccctcctttc ttcgtagatt attagccccg ggacacctat
31501 accaaaagct cagtgtgcaa tttgcatgaa atctattttt caacttcaaa ctagtaagtg
31561 aggttccata aatccgtagc cagaaaaata aattgttttt ttttttgttt tttctgggaa
31621 agtattttct tatattaaat tttgtattgg acaagaaagg aattcccctt gtgtatgcgc
31681 gcctcaaaaa ggtatagtac tcgattccat tacatgcatc gggggcaatc gaaaaagcca
31741 gcatttcttg gaatactgac tataatgcta ccaataatcg tactaatcca accgcatatg
31801 tctttctcct accaaaagga aagaaaaaag aaataaggat ttcccctttg ctttgacaat
31861 gaaattctgc ccccggtccc cttcataaaa agggagagat ttattgatat atttattgga
31921 tccatcggga ctgacggggc tcgaacccgc agcttccgcc ttgacagggc ggtgctctga
31981 ccaattgaac tacaatccca gggaaatacg ggatctagca gaaaatttga ttctttttta
32041 tctccggatc gggtatttct gaagtacgaa gggggttata tcatctcatg gcggattggc
32101 gaatttttgg gccgagctgg atttgaacca gcgtagacat attgccaacg aatttacagt
32161 ccgtccccat taaccgctcg ggcatcgacc caagaagaat caattttaga cttattggta
32221 atccatgatc aacttccttt cgtagtaccc tacccccagg ggaattcgaa tccccgctgc
32281 ctccttgaaa gagagatgtc ctaaaccact agacgatggg ggcctgcttg accaaccgcc
32341 atcatactat gatcatagta tgatcagttt tttgaaattg tcaatataat cgaatgattc
32401 tatccgaggg atctttcccc ctttcagaat tgcatagaat ttttttattc gtcattgatg
32461 aattattcat tagaatcgcc attagaaatc tagtagtagt attttttttt ttttggaatt
32521 atttcaattg aatttctttc gattatttta gtttagatta tttagtattt agaattttct
32581 ttttttatta taaataaaaa aaaaattaat aaatacaaaa aatagaaata ataaggaaga
32641 gtaggatttt tgcagggaat gattggtccg tcagaaaagg aaaaaggtgt gaaattctat
32701 ttctttcact ttcatttgat tcattgttaa gacgagatat ccttatctcc ctcccaccaa
32761 gacaggaaat taacaaacga gaaatctagt aagcgggatc aagaagaaaa ttcttttttc
32821 tccaagaatt tagttcagga gacaagtaga atctcttcat tccatgattc gatgaaatat
32881 cttgaatttt atgttgaatt gctaggtgta tgtacatgta tcaatcaagt gaattttgtt
32941 ctggtgggat caattcaata aaagaaaaaa agcaattcga gtcggtcttg aaacaattca
33001 ttgcattttc tcctagactt cctaggtaaa tccattttat tattcaacaa tgagccacta
33061 gacactatgt atctactgca tgtacttatg catatatact tatgtttata atatatgtac
33121 ctatagatat tttatccaca tagtgaataa ttccggaatt aaatcaaaaa ggccctttta
33181 actcagtggt agagtaacgc catggtaagg cgtaagtcat cggttcaaat ccgataaggg
33241 gctttgtaaa actccaatct agtattcata tttgagggga gaattgtatt tttatttgta
33301 ataaaaaaag taactaactg gataatacat tatcattata cttaattatt atacttagtt
33361 ataaagttga acatttgttt agtcaatttt cattattatg aatttctgaa taatgaaaag
33421 tcacttcttg aactcaccga atattcctat tttccattat accaaccaaa tccattcgaa
33481 aggttagaaa tcaacaaaag aaaaagtaag tggacctgac ctattgaatc atgactatat
33541 ccgctattct gatattaaaa ttcgatagag atgaaattgg agcagttgat ttttttttaa
33601 tttcattttt ttgttttgga ttccacaaga atttgtcgat atttccgatt aaatcttctt
33661 gttactagat tttctatagg aaaaattata ggaataaatt gttattcctt tcctctacag
33721 agaaaccttt cttccaagtc acaccataag agccatttat tatctttctt tgattccaga
33781 tcaaagatta atttcatcat taatttctat ctagattata tatctatatt atattaagta
33841 gattgtagat ttcgatgtat atctatcaga tcgtggcttc atgtaccaaa tatttcaata
33901 tcgttgcatc cggtattttt gttttgttcc aacagtgtga tgaagaatag atccgagaaa
33961 gagactttca ttttcagtct cttatttatt ttatttttat tgaattttcg attttctaaa
34021 aggaaaatct aaaaggaaaa atagtagatt atctcttttt ctaacagata aaagaatcta
34081 aaaataaata ttcgatcgaa ctgtcttttt tccttcgatc cgtggaaaga tatactctgg
34141 ggttttagat ttatttatat gaagtatgaa ggaaagggat cgcttggtcc ttgaagagtt
34201 ctttcaaaac aaaggattga ttgaattgtc ttattaggac aattaatggt tcatatgctt
34261 agtcagaagg aataatccaa tggagttcat ggatttacct aggtcagttt atgggctaat
34321 caataaagca tttttatctt cgaaacccat tggaaagggc agtgcaagag aaatcataca
34381 aaaatgatcg aatcttcgga cgccccgaaa aagatatgag gtgctcggaa atggtcgaag
34441 tagttgaata ggaggatcac tatgactata gcccttggta agtttaccaa agacgaaaat
34501 gatttatttg atattatgga tgactggtta cggagggacc gtttcgtttt tgtaggctgg
34561 tccggtctat tgctctttcc ttgtgcctat ttcgctgtag ggggttggtt cacaggtaca
34621 acctttgtaa cttcatggta tacccatgga ttggccagtt cttatttgga aggctgcaat
34681 ttcttaactg ccgcggtttc tactcctgct aatagtttag cacattcgtt gttgttacta
34741 tggggtcctg aagcacaagg agattttact cgttggtgtc aattgggggg tctgtggact
34801 tttgttgctc tccatggagc ttttggccta ataggtttca tgttacgtca attcgagctt
34861 gctcgatctg ttcaattgag accttataat gcaatcgcat tctctggtcc aattgctgtt
34921 tttgtttctg tatttctgat ttatccactg ggtcagtctg gttggttctt tgcacctagt
34981 tttggtgtag cagctatatt tcgattcatc ctcttttttc aagggtttca taattggacg
35041 ttgaacccat ttcatatgat gggagttgcc ggtgtattgg gcgctgcttt gctatgcgcc
35101 attcatggtg ctaccgtaga aaatacttta tttgaagacg gtgatggtgc aaatacattc
35161 cgtgctttta acccaactca agccgaagaa acttattcaa tggtcaccgc taaccgcttt
35221 tggtcccaaa tctttggggt tgctttttcc aataaacgtt ggttacattt ctttatgtta
35281 tttgtaccag taaccggttt atggatgagt gctcttggag tagtcggtct agccctgaac
35341 ctacgtgcct atgacttcgt ttctcaggaa attcgcgcag cggaagatcc tgaatttgag
35401 actttctaca ccaaaaatat tctcttaaac gaaggtattc gcgcttggat ggcggctcaa
35461 gatcagcctc atgaaaacct tatattccct gaggaggttc taccacgtgg aaacgctctt
35521 taatggaact ttagccttag ctggtcgtga ccaagaaacc actggtttcg cttggtgggc
35581 cgggaatgcc cgacttatca atttatccgg taaactacta ggggctcatg tagcccatgc
35641 tggattaatc gtattctggg ccggagcaat gaacctattt gaagtggccc atttcgtacc
35701 agagaagcct atgtatgaac aaggattaat tttacttccc cacctagcta ctctaggttg
35761 gggggtaggc cctgggggag aagttataga cacctttcca tactttgtat ctggagtact
35821 tcatttaatt tcttctgcag tattgggctt tggcggcatt tatcatgcac ttctgggacc
35881 tgagacactt gaagaatctt ttcccttctt tggttatgtc tggaaagatc gaaataaaat
35941 gaccacaatt ttaggtattc acttaatctt gttaggtcta ggtgcttttc ttctagtatt
36001 caaggctctt tattttgggg gcgtatatga tacctgggct ccgggagggg gagatgtaag
36061 aaaaattacc aacttgaccc ttagcccgag tatcatattt ggttatttac taaaatcccc
36121 ttttggaggg gaaggatgga ttgttagtgt ggacgattta gaagatataa tcggaggaca
36181 tgtatggtta ggttccattt gtatacttgg tggaatctgg catatcttaa ccaaaccctt
36241 cgcatgggct cgacgcgcac ttgtatggtc tggagaggct tacttatctt atagtttagg
36301 ggctttatcc gtctttggtt tcattgcttg ttgttttgtc tggttcaata ataccgctta
36361 tcctagtgaa ttttacggac ctactggacc agaagcttct caagctcaag catttacttt
36421 tctagttaga gaccaacgtc ttggggctaa cgtgggatcc gctcaaggac ctactggttt
36481 aggtaaatat ctaatgcgtt ccccgactgg agaagtcatt tttggaggag aaactatgcg
36541 tttttgggat ctgcgtgctc catggttaga gcctctaagg ggtccaaatg ggttagactt
36601 gagtaggttg aaaaaagaca tacaaccttg gcaggaacgg cgttccgcag aatatatgac
36661 tcatgctcct ttaggttctt taaattccgt gggtggtgta gctaccgaga tcaatgcagt
36721 caattatgtc tctcctagaa gttggttagc tacctctcat tttgttctag gattcttctt
36781 cttcgtaggt catttgtggc acgcgggaag ggctcgtgca gctgcagcag gatttgaaaa
36841 aggaattgat cgtgactttg aacctgttct ttccatgacc cctcttaatt gagatgagac
36901 aggagatcca atgcttgaat gaagtaaaaa tcactttgat tcaatcatac atcttggaat
36961 cagcctaagt attccttttt tgtattcctt ttttcttttt ttttttcaat tcattttatc
37021 taatttattt ttctggcttg gctaggtggg atagccgagc cattcccttt tctttcggat
37081 agcaggttgg gcaaaaccac taaagaaaaa aatctattca attagcaaaa aaggagagag
37141 agggattcga accctcgata gttctttgtt aaaactatac cggttttcaa gaccggggct
37201 atcaaccgct cagccatctc tccgaaagac tatttttatt ttattcctcc gaatagaaca
37261 tggccatagg ggtggatacc cccactatct gtactatctg taaaaagatc tcaggtgcga
37321 atccaccggt cgatctatct atccgtatat agatatatga tctagcatgc ccatttgtga
37381 aataaaaaat aaaattccat ttccccccac tccatgtacg aataaagtgc gaaaggggga
37441 gtagtaataa gtcatataga atcaatggat tcatgataaa gtaaaatccc tcgatgacat
37501 attttatcac aattaatatt ttttggctga tagagggatc aaatggtata tagttcattt
37561 gttggtagct tggaggatta aaagcatgac tcttgctttc caattggctg tttttgcatt
37621 aattgctact tcattaatct tattgattag cgtacccgtt gtatttgctt ctcctgatgg
37681 ctggtcaagt aacaaaaatg ttgtattttc tggtacatcc ttatggattg gattagtctt
37741 tctggtgggt atccttaatt ctctcatctc ttgaacctat tcgtcgcaga cccaaaacca
37801 aaatgacccc cctaattttt ctcggttgtg agacacatta aattggaatc taagtcccca
37861 aagaaaacgc aaatcaaata aagaaaacaa aaaaattaga ggggggtcaa acttcttgaa
37921 taaaaagaat acaattaaaa aaataattgg aatcgttccg aagagaatat gtgtcccggc
37981 actgcacaaa aaagatccgg ttatatatca tatatgtggg tacatattgt gtatcaagaa
38041 caaaaaaatg cggatatggt cgaatggtaa aatttctctt tgccaaggag aagatgcggg
38101 ttcgattccc gctatccgcc caagatccaa gataaagtaa ttttattact atttatttat
38161 tatttaattt cataaatagc attaaatata tccttaaatt aaggatttgg tatagttggc
38221 cgtgatagtg tagtgattct atccctcccc tacgttttct ttttccttcc acccccaaaa
38281 agcgaaaggc gggaattaat tactagttaa cagagtcaac cctaaaatag tttggcaaaa
38341 caagatgttg cggagacagg atttgaaccc gtgacctcaa ggttatgagc cttgcgagct
38401 accaaactgc tctaccccgc gccgaagata agaactgaaa actaatagat aaacaaggat
38461 taaatgcgcc cctccaccct atctgtacaa atagaatagc ccatttatac agaatggtaa
38521 aggggcttct atgatcatcg accatagaaa tagaaatgaa gcgttaatcc ttaccaactt
38581 gatcttgttg ctcctggcaa caaacatgca tgaaccattt cacgaagtat gtgtccggat
38641 agtccaaagt ctcgatagtt agctctcggc cttccggtca aaaaacaacg tcgatgaagg
38701 cgtgtaggtg cactattccg tggtggggat tgtaactttc cataaatttc ccatttgtca
38761 ctcaacgacg gaaccttgct tatttctttc tttgaggatc gacgaatcga atgatatttc
38821 tgttccaatt tttgcctctt cttctccctc tgaatcaaac ttttccttgc cataatggtt
38881 gaattcctat tagtatccat gatacaagtc gaatcctaga tgtagaaata gaagaaggtg
38941 gaccccctct ccgtcgaaag aaatgagatt atcgcagata cacacattaa aaatattaac
39001 caaatttgcc cgacgtagag gcaatcaaga aagccgcata agtgaatata taacctacag
39061 aaaagtgagc taatccaacc aatcttgctt gtacaatgga aagggccact ggtttatctc
39121 tccagcgaat caaattggcc aaaggtgtgc gttcatgagc ccatgctaaa gtttcaatca
39181 attcctgcca atatccacgc caagaaatta agaacataaa tccagtagcc caaacaagat
39241 gtccaaataa gaacatccat gcccaaaccg ataaactatt cataccaaaa ggattatatc
39301 cgttgataag ttgtgaagag tttaaccata aataatccct taaccagccc atcaaataag
39361 tggaagattc attaaactgt gaaacgttac cctgccataa tgtgatgtgc ttccaatgcc
39421 aataaaaagt aacccatcca atagtattta acatccaaaa aactgccaaa taaaacgcgt
39481 cccatgccga aatatcacaa gtaccgcctc gtcctgggcc atcgcacgga aaactataac
39541 cgaaatcctt tttatctggc attaacttgg aaccacgtgc atctaaagca ccttttacta
39601 agatcaatgt agttgtatgt aaaccaagag caatagcatg atgaaccaaa aagtctccag
39661 gacctattgt taaaaataat gaattactat tttcattaac agcatttaac caacccggca
39721 accagatgct tcgacccgca ttgaatgctg gaccactcgt tgaagataaa agtacatcga
39781 acccatatga agttttacca tgagcggatt gtatccattg agcaaatata ggttcaatca
39841 agatttgctt ctccggagtg ccaaaggcaa gcatgacatc attatgaaca taaagtccca
39901 gggtatggaa tcccagaaag aggctggccc aacttaaatg agatatgata gcttctttat
39961 gctctaacat tcttgccaat acattatctt cattttgctc cggattgtaa tctctaatga
40021 aaaatatagc tccatgagca aaagctcctg tcatgatgaa tcctgcgata tattggtggt
40081 gggtatataa tgcagcttga gtagtaaagt cttgtgctat gaatgcataa gcaggtaaag
40141 agtacatgtg ttgagctacc aaagaagtaa taacccctaa agaagctaga gcaaggccta
40201 attgaaaatg aagcgaatta ttgattgtgt cataaagacc cttatgtcca cgccccaatc
40261 gtcccccggg gggaatatgt gcatctaaaa ggtctttcat actgtgccca atcccgaaat
40321 tggttctata catatgacca gcaacgagaa aaataaatgc aatagctaaa tggtgatggg
40381 caatatcagt cagccataaa ctttgcgttt gtggatggaa tcccccgaga agagttagaa
40441 tggcagttcc cgccccttgg gcggtaccaa ataaatgact acttgaatcg gggttttgag
40501 cataaagatt ccattgacct gtaaaaagtg ggcctaaccc ttggggatgc ggtaatacat
40561 ctaagaaatt attccaccga acgtactccc ctctggatgc aggaatagca acatgaacta
40621 aatgccctgt ccaagccaag gaacttacgc caaagagtcc tgacaaatga tgattcagac
40681 gagattcggc atttttgaac caggaaacgc tcggtttcca tttcggttgt aggtgtaacc
40741 aacctgctat taaggatatg gcagaaagaa ataatagaaa aagagcacca gtataaagat
40801 cttcattagt gcgtaaaccg attgtatacc accactgata aacaccagaa taagcgatat
40861 tcactgggcc aagagcaccc cctcgagtaa aagcttccac ggccggttga ccaaaatgag
40921 gatcccaaat tgcatgagca ataggtctta catgtaaagg gtcctgtacc cacgactcaa
40981 aatttccttg ccaagctaca tgaaacagat ttccggaagt ccacagaaaa attattgcta
41041 attgaccaaa gtgagaagca aaaatattct gataaagacg ttcctcagta atatcatcat
41101 gactctcgaa gtcatgtgcg gtagcaatac caaaccaaat acgacgagta gtggggtcct
41161 gagctaagcc ttggctaaac cttggaaatc gtaatgccat aatgcttttc aaatcctcct
41221 agccattatc ctactgcaat aattcttgct aagaagaatg cccatgttgt ggcaattcca
41281 cccagaaggt aatgggttac tcctacagca cgtccttgta taatgctcaa ggctctcggc
41341 tgagtagcag gagcaacttt taatttatta tgagcccaaa cgatggattc aataagttct
41401 tgccaataac cacgtccact gaatagaaac attaaactaa aagcccagac aaaatgagca
41461 cctaggaaaa aaaggccata tgcagataat gaagaaccat aagactgaat tacctgggat
41521 gcctgtgccc ataagaaatc gcggagccac ccattaatag taatagaact ttgcgcaaag
41581 tttcctcccg tgatatgagt tactacccct tgatcactta cactgcccca aacatctgac
41641 tgcattttcc aactgaaatg gaatattact accgaaattg cattgtacat ccagaatagt
41701 cctaagaaga catgatccca ggccgatact tgacatgtac cccctcttcc aggtccatca
41761 caaggaaaac gaaaaccaag gtttgcttta tccggtgtca aacgggaact gcgagcaaat
41821 agaacacctt tcaagagtat cagtgccgtc acatgaatcg taaatgcatg aatgtgatgt
41881 accaagaaat ccgcggttcc taatggaata ggcaacaaag ccaccttgcc acccactgcc
41941 actaaatcac caccccccca agttaaactg gtacttgctg ttgcaccagg agccgttgca
42001 ccaggtgcta aagcatgggt gttttgtatc cattgagcaa aaacgggttg taattgtata
42061 gcggtatctg aaaacatatc ttgaggacgc cctaaagcgc tcatggtatc attatgaata
42121 tacaaaccaa aactgtgaaa gcctagaaat atacatgccc agttgagatg ggatatgatt
42181 gcatcacgat gtctaaggac acgatctaat agatcgttgt accgagtagt tggatcataa
42241 tctcttacca taaaaatggc tgcatgcgcg gcagcaccaa ctatgagaaa tccaccaatc
42301 cacatgtgat gtgtgaacaa tgacagttgt gtaccatagt cagtagctag atacggataa
42361 gggggcatgg aatacatatg gtgagctaca acaatggtta aagagcctaa catagctaag
42421 ttaagagata attgagcatg ccatgacgtt gttaggatct catataggcc tttatggccc
42481 tgacctgtaa atggaccttt atgagcttct aaaatatctt ttagtccatg accaataccc
42541 cagttggtcc tatacatgtg acccgctatc aggaaaagaa ttgcaatagc taaatggtga
42601 tgggcaatat cagtcagcca cagaccccca gttactggat ctaatcctcc acgaaaagta
42661 agaaagtccg catattttga ccaattcaag gtgaaaaatg gggttgctcc ctcggcaaaa
42721 ctgggataaa gttgagccaa aagatctcga ttcaagataa attcatgagg aagtggtatc
42781 tctttaggat ctactccagc gtttagaaat tggttaatcg gtaaagatac atgtacttga
42841 tgccccgccc aagagagaga cccaagtcct agtagccctg ccaaatggtg attcagcata
42901 gattctacat cttgaaacca agccaatttt ggcgccgctt tatgataatg aaaccaacca
42961 gcaaaaagca ttaacgctgc aaagaccaat gccccaattg ctgtacaata gagttgtaat
43021 tcactagtta ttccagatgc tcgccaaatc tgaaaaaaac cagaggttat ttgtattcct
43081 cggaaacccc cgcctacgtc accatttaat atttcttggc ccactattgg ccaaaccacc
43141 tgggcactag gcccaatgtg agttggatca cttagccacg cttcataatt agaaaaacga
43201 gcaccgtgga aatacatgcc gctcagccaa agaaagatga tggagagttg accgaaatgt
43261 gcactaaata cttttcgaga gatctcctcc aaatcactgg tatggctatc gaaatcgtga
43321 gcatcagcat gtaggttcca gatccaagtg gtagtatcag gccctttagc tattgttctt
43381 gagaaatgac ccggtctggc ccattcctcg aacgaagttt ttacgggatc cctatctacc
43441 aaaattttaa cttctggttc cggcgaacga ataatcattg agtcctcctc tttccggaca
43501 acacatacaa agagacccgc caacagtcaa ataattagtg aaccttagag atagagagat
43561 atttctataa ttagttcgtt tctcttctat ttttctatct cccatctatc tattttcttt
43621 agttatttac tagagcaatt atgatctgga agtcgatccg gggcaagtgt tcggatctat
43681 tatgacatag ccttgaggcg ctcaacggac cttttaacct tctaaaaacc tttttgggct
43741 ttggattgat ccaaaaacga cttttttgtg caacctagtg tatattcata gaagttatta
43801 gatggagctc tttaattttt tacctagaag attttaatta ctctattcca aatcacgcga
43861 gtagccatta gacattacta agagacatcc ccgctatata tatttagtga ttcgagggtt
43921 tattttatta gttttaataa taagaatttt gtttaattta atataataaa caaagtctat
43981 tttgtactct atctgtgtat ccttttttat tcctaaaaaa tagcagatga aatagaaggc
44041 ttagaaggga gataatgaaa ttatgtgatt gggtcttcca aaagcaaagg aataatccgt
44101 tttttagtta actgatctga tgggtccaac aaacaataaa ttataacaaa tatctaaatt
44161 ctaaataaaa aaatcaaaaa taatagacta agattctaaa taaaggataa taaataaacg
44221 ggatcttctt ttattcgaaa cgtctcgtga tcttcaacca attatgcgct tcaatataat
44281 taccgggagt aagcgctata gcctgtttcc aatactcagc ggcttgatcg aaccaagcct
44341 ctgcaatttc agaatctccc tgttgaatgg cctgttctcc ccggccggaa taggtagttc
44401 aattccttcc cttagaaccg tacttgagaa tttcttacct catacggctc agcagtcaat
44461 tcttttggtg tcccattttg atctatacca tatctaataa aatctaatga gatttctcat
44521 ggatctatcc cagttttagg gttaaccaaa agccaaatag gttaattaca tgagtttcaa
44581 actgaaattt ggatgaataa tccgtttatt tagttttatc ttttttccca ccttcagaag
44641 aataaagcat aggcatttct actagtgtta gaattttatg aaaggtaact atctcggttt
44701 catagataaa tttatataga atctttgaaa aagactttct ttcataagaa agaaaatact
44761 tactatcttt gggatctgat cctacaccgc tgctcaagac tttagtggat cgactctatt
44821 acataagtta attcctaatt tttatttcac atcatgagat aagtatttct tccatcatga
44881 cataagtacg cagttattat tgtatcggcc caaaacctcg ctaattgatc tttacggtgc
44941 ttcctctatc tctatcaatt aaagccttat atccatagaa aaaagttgct aggcattttt
45001 attttttcct attttgactt ctatgaagtt tctttctttg ctacagctga taaaaatcgt
45061 tgttttagac gatgcatatg tagaaagcct atttggttct actagttact ttactagatt
45121 tttctttttt tttttttttt tctttctata gtggagatag tcgcacgtaa tgacagatca
45181 cggccatatt attaaaagct tgtggtaaga atgggtttcg ttctagtgct cgaaaataat
45241 attccaaagc tttcgtatgt tctccattac ttgtgtggat aagccctata ttatagagta
45301 tataacttcg atcataggga tcaatttcta gtcgcatagc ttcataataa ttctgcaaag
45361 cttccgcgta atttccttcg gattgagccg acatccgtta cggtcgtcat tcaattgaaa
45421 gaatctccgt tccagaaccg tacgtgagat tttcacctca tacggctcct cccttatgtg
45481 cataatgaga ataatacata gaatcaaaaa agattcaacg atgaaaatat tctcattatg
45541 aactcagcag ggctagtgtt tttacaagaa atctctagcc aaccttcctg caagagattc
45601 tttcttaaca tcaagcctat tgggactaga tagaaatgat aagataactc caacaatttc
45661 tttgttttta acgcctccta atttccagga attagtcact tcaatagcct tcgatggtta
45721 tacgggtatc caaaggacga acgagatgga tgtttgttgt cccaaccatt cttttagtcc
45781 caagcccgct aaggaaaggg ctgacttaga acaaagtttt cgtgttgttg attcctaggt
45841 gtagtgcttc ttcccctctg ctgcctatta gcgctagtag agtaggattg acccgtaata
45901 cagaacctct aggcgtaacc tttcgcttaa tactagaatc gagaatcgaa acatagcatc
45961 tgaggttgca ttaatcgagg atacacgaca gaaggaattg ttctatttcc aaacttcacc
46021 ttcaaaaagc gtagattttt tcaaaaattt tctcgaatca cgtgtttttc tcctcgtaag
46081 actgagagaa atgactaaat atgaaataaa aaaaaaaaaa gaatcaaatc gcaccatctc
46141 tgtaataggt aaatgcctct ttttctcctg aagttgtcgg aattactcgt aataagatat
46201 tggctacaat tgaaaaggtc ttatcaataa aatttccatt tatccgtgat ctaggcatag
46261 gtagcaatcc attctagaat tcttctcatt acctctcatg ggaaaaagat cccacaaaga
46321 aaagaattgt atagtacgaa ataacataaa aacttctttt ttttttaaga aaaaaacaaa
46381 agatatgaat cctctattcc aattgttcct ttttgacagg aatcgataag aaataagaaa
46441 tatttcaagg cgattcgatt tcatactaat gtagtagtat aggaactatt ccgatttcgg
46501 tgaagttaca aattcgaaga actcgagaaa ttttgattga atcatgatac aaattacaaa
46561 gaagaaaaaa gaccgaataa tcattctatg atgaaaatag aataactgcc aattttgtgt
46621 acataacggg tatacactat acaatcaaat ctaaattttt tttatgaatt tctattctaa
46681 tagaggggta ggtgtttgtt gttgagaact ccaaaaccga aaagtaattt gaaaattttt
46741 ctggtatgga atcatagtct atataattag aattatgatt taagagtatc cattaactat
46801 agtctaaaag atatagacca tcaatcagtt gattcgttct aattcattga attaatccgt
46861 tataaaatat cagaaaaaga aaaagaaggg aacgttgttt tgcaaacatg aatcgaattt
46921 tttttttcac aatttttacg caaaattgta tctttatccc ggagcctcga aggaaagaaa
46981 aatcgttctt tgctttgact ttgatgaaaa attttcagtt aaaatggatt gatcatacct
47041 atccaataat ggaatatgga ttatgactga ctcgctattc actcggtttt tgggtcataa
47101 tcgttatgta ggagagatgg ccgagtggtt gaaggcgtag cattggaact gctatgtagg
47161 cttttgttta ccgagggttc gaatccctct ctttccgtac cttcgcttaa ttcaccaatt
47221 ttactaacaa caagggctca aatagcaatg gataccatta ttccaacagc tagacccttc
47281 tttgatctaa agatatagat tctcaattcc taattgctgt gacgcgtaaa atagaatact
47341 aaaaaataat aataatcaaa atactggaaa gaaaagagta gacaaggaat gaaaatagat
47401 ccttggtcta tgatacaaaa atgggggaaa tccagatcaa actcggattt atcttactta
47461 accttaggtt aatttacttc gcctaaaggg aagaaaattt tccgaaccct cggtttcagt
47521 ctgaggttta agtctgacga gaataatatt ctacgactag caattcattt attttcaaac
47581 cgacccattt actatctatt atttgattga ctaatccttt atattggaat gggtgaaggg
47641 tcaaatggtt tggtaattcc tcatgagggg atgaatcgag agaaatttga atcagagctc
47701 tggatttttg ttcatccttt gccgtaataa tatctcgggg tttgcagcga taactcggta
47761 tatctactat acgaccatta actaaaatat gtcgatggtt aactaattga cgggctgcgg
47821 gaatagttga agccataccc aatcgaaaaa ggatgttatc caaacgcatt tcaagtaatt
47881 gtagtaaaac ttgacctgtt gaccccttgg cttttctggc gatacgaacg tatttaagta
47941 attgtcgttc tgtaagacca taatgaaaac gcaatttttg tttttcttct agacgaatac
48001 gatattgaga ttttttcccg gaacgcgatt ggtttctaag atcacttccg ttcctaggct
48061 ttttattagt tagtcctggt aaagccccca ggcggcgtat ttttttgaaa cgaggtcctc
48121 ggtaacgcga cataaagact ccttattctt atttcttatt tagtatttcg aattaattct
48181 tatttctatt tattttattt tttattgaat tttattttac agaataaacc taaactaaaa
48241 ctaaactgaa tctaaatgaa gcgaagttta ctgaaatagt gtacttgtac tattactata
48301 aagaaaagaa gaatgggatg aattggataa atatacagac ccccttctat tatatatata
48361 atcctttccc gacataattg gaagttccta taataaattg atagcttttg gaaaaggaag
48421 aaggcgctat ttcaatattc tttgatttca aaggaacatt atcaatcatc taaaaaatgg
48481 aataaaaaaa aaagaatagg gaaaagccgg ctatcggaat cgaaccgatg accatcgcat
48541 tacaaatgcg atgctctaac ctctgagcta agcgggccca cataacagaa atcttatatg
48601 catagtaatt gactaaacta ttggaattgg aatcttagtt attaactatt caatattata
48661 ttgaatattc tagaacataa ggattaatat agcgatatag aatttcgatt tatcacaatt
48721 ctaataacaa ttctaatact aatattatta aatagtgatt gtaaatattg ttaatattct
48781 ttttttttca ttttccattt gaatggtaaa tgttcttttt catttctttt tttgtcattt
48841 gaaatccttt tgatttttta ttacagttct atattttatt ctatatcata tatatctctc
48901 attctatatt tatttcaaat tctaattgtt taatggaatg gttagttata actaatgaga
48961 cattcctccg ctttcaggcg aaagtgaaga taaaaaaaaa gaatcgaccg ttcaagtatt
49021 ccaaattgaa tggcaaaatg gcaggaagag agacatatag atggggtata tatccatcta
49081 tattgaattg cggattccga aatgataaaa tcatttttga ttggacaaaa aaaggtctcc
49141 tatagaagat agttaagaaa atcaaagagg agaaaacacg ttttcgagat aggaatcggt
49201 atctaatgaa ttcaatggtt ccagtataaa tgaaagaaaa agaaaaagga atgacatcac
49261 aacgagatcc taatctcaaa aagaaagggg gatatggcga aatcggtaga cgctacggac
49321 ttaattggat tgagccttgg tatggaaact tactaagtga tcactttcaa attcagagaa
49381 accctggaat taacaaaaat gggcaatcct gagccaaatc ctgttttccg aaaacaaaca
49441 aaggttcaga aaaaaaggat aggtgcagag actcaatgga agctattcta acaaatggag
49501 ttaaatgcgt tggtagagga atctttacat cgaaacttca gaaagaaaaa gaatgaagtg
49561 aaggataaac gtatatacat acgtattgaa tactatatca aaatcaaatg attaatgatg
49621 acccgaatct gtattttttc tataaaaaat agaagaattg gtgtgaatcg attctacatt
49681 gaagaaagaa tcgaatattc attgatcaaa ccattcactc catagtctga tagatctttt
49741 gaagaactga ttaatcggac gagaataaag atagagtccc gttctacatg tcaataccgg
49801 caacaatgaa atttatcgta agaggaaaat ccgtcgactt taaaaatcgt gagggttcaa
49861 gtccctctat ccccaaaaag actatttcac tccccaacta tttatccgac cccctttcct
49921 tagcggttcc aaattcctta tctttctcat tcactctatt cttttagaaa tggatttgag
49981 cgtaaatggc tttctcttat cacaagtctt gtgatatata tgatacacat agaaatgaac
50041 gtctttgagc aaggaatccc tagttgaatg attccctatc aatatcatta ctcatactga
50101 aacttacaaa gtcatctttt tgaagatcga agaaattccc cggctttgag aaaattttta
50161 atctactttt gtccttgtaa ttgacataga ccccagttct ctaataaaat gaggatacta
50221 cattgggaat agccgggata gctcagttgg tagagcagag gactgaaaat cctcgtgtca
50281 ccagttcaaa tctggttcct ggcacatgat taatttgtat gggtctctct tccctcgaat
50341 taatttctaa ttaattgata tgaatcaaca tacatattct tttagagtct agattagaat
50401 aatagcttta tccagtttgg cgagatatac cccatctatg ttctagatgg gtagagtttc
50461 ttagataaag tatctaaaag aattggattc tatctcctct tttttttctc ctctcgttca
50521 accgaatttg aatacgtaat acatattcga aaggttcaat tggttaattg ttgaaaggct
50581 caaaagtcga atccgaatct aggggggttg aaatagacaa gattcagctc agatccaaag
50641 aaatagaatc cgatattctc tcatttcttt gtcttttctt tcatattcga tttcttcatt
50701 ccggatttct ccattccttc ctatatgcct ttctagaacc catctaagta atgtgcgcag
50761 tacaaagttc atgatgcaga actcatttgg ttcatcctat tggtgtgacc catccgaaat
50821 aagtatcttc caaataaatg tgagaattcc aatgaatccc taattgtctt tttttgttag
50881 cctatcgata attccctaaa ttagacctgc ttaatctaga acagaacgtg caatccttga
50941 atatctgaaa ttgtctaagt ggaaatagct ttcttatcat tcaatgagca tcttgtattt
51001 cataaaaatt gggggcaata taatccttac gtaagggcca tcctatccaa ctttcaggca
51061 ttaagatacg tttcaagcgt ggatgattat cataagagat tcccaacata tcatatgatt
51121 ctcgttcttg aaaatccaca cttttccaaa cccagaaaac agacggaatt ctaggattcc
51181 tcctggaggc aaatactttt atgcatacct cctctggttg atccacacca tcctctattc
51241 tcgtaagatg atacacacta gctaacagcc cgccaggcgc tacatcatag gcacattgag
51301 agcggagata gttgtaccca tatacataaa aaatgacagc aatggaatgc caatcctcgg
51361 gctttatttg taaagtctct attccttggt aatcaaagcc caaagatcta tgaattagcc
51421 catgcttgac tagccaagca gacaaacgac cctgcatctt ttttatctct cccgcatttt
51481 tatttatata agtatttcac atttacgatg aaatttctga aaattgaccc accacttttt
51541 attctggaca aaggaatcct gtctaattca ctaattcggg ggaagatact gaatttttgt
51601 atttgaaaaa gatttccgta gggatctctg aagtagatgg gggttgataa agaactcttt
51661 gatcataatt tcccgtatga atactgtgtt gaacatgaaa cttgtgattg gtagtaaaac
51721 accgattcgc tcgttgagac ctaattcgat cttcatagag ttctcgagat attttcttac
51781 gaagttttgt tatagcatct ataaccgctt ccggtttagg tgggcaacct ggcaaatata
51841 catctacagg aattagctta tcgactcccc gaacagtact ataagaatcg gtactgaaca
51901 tcccgcctgt aattgtacag gctcccatag caataacata ttttggttca ggcatttgct
51961 catataatct cactaaagag ggggccattt tcattgttac tgttccggct gttaaaatta
52021 gatccgcttg tctaggactc gatcttggta ctagtccata acgatcaaag tcgaagcgtg
52081 agcctattag tgaagcaaat tcaatgaagc aacaactggt accatagaga agcggccata
52141 aactagagag tcttgaccaa tttgaaagat catttaatgt agttgaaata actgaatttt
52201 gggttgttcg atcaagtaaa ggaaactgaa tggaattcat aactgtctca atcttatttt
52261 ttccgttttt ctttttattg tctgaatatt caggagctaa gaccattcca atgccccctt
52321 tcgccatgca taaactaaac caataattaa gataagcacg aaaatgaaag cttctataaa
52381 tacagataca cccaatacgt cgaaactcat tgcccatgga taaagaaaaa ccgtttcaac
52441 atcaaaaaca acaaaaacta gagcaaacat ataataacgg attcgaaatt gtaaccaagc
52501 atcgcccatt ggttctatac ccgactcata agtagaaagt ttctccggcc ctttgctaat
52561 cggggctaac actccggaaa ttaaaaatgc caaaatagga acaaggatag atattattag
52621 aaatgcccaa aaaaaatcat attcgtaaag cagaaacata aacgcactcc tatgaacgtg
52681 gaaaatatac cggattcgat tggtcgattc gaattggaat tgtcaagtca tccataacta
52741 tttagtcaaa acaagaattc attttgatcg aaccgtctag tttgctttgt ttattggttt
52801 attgtagggc atatctcatt gcaagattca tcgactggaa tccgatttta tttccattat
52861 acttatttcc attttattta gttagtagaa ccttctaact atatattact cttatacaaa
52921 ttctcttgtt tctcttgttt tcatccagga ttttctctaa agacggggaa ttctaaatta
52981 attacttatc ttatttcttc tttaattaga aattctttaa agatttctat ttttttctat
53041 aaatagaatc aggaggtctt ttttcttatt ttttcttagt gatttagaat agaacaagta
53101 atcaaataga agagaatgta taggaatttc catctcaaga tttagaagat cttgtgttgg
53161 tatattcctt attattatta tttaataata gtattagggt tcgaatccag gtgacggggt
53221 ttttcttggt tgaatacaga aaaagaggac tggccttttt cgtgttgtgc ttcgctaggt
53281 cgaggtaagt aaggtatacg aaggaaaagc ctatttgaca atgaaagtga ccaaaggtat
53341 tcgtttttca aaaaacttta gcttgtacac aaatacagca ggcccttcct aaatccatgt
53401 gaattcctct tcgtagtttt tcatttcacc aggcccgtga aatgatttga cttccacaac
53461 tcaataagat tggggatatc aaaagaaagg gagtctcact aattctttta ttgtggatat
53521 gaatatgtaa ttcgcctccg aagattaatg acgaaaggtt ggtttcttta tccgcaattg
53581 aaaaaatcaa tatcgattgg atccgttgat atgcattttt tctttcatct gcttaaacga
53641 ttgccgtgag taaacttata ggaataattg gatttcactt agttacaagc aagaaataat
53701 aatgaagaaa tgaaaattat agaatttttt ggattttgca tttttatagg gctatacgga
53761 ctcgaaccgt agaccttctc ggtaaaacag gtcaaactta ttattattaa aatgatctga
53821 actgtttcaa agacccaaca tgcatttttt ttgcattggg ctctttcatt aactgatata
53881 aatatcagtt agtctgccat tttttttctt gacagaaaaa aagataagga aatggctcca
53941 tgtgctctga ttcattattt gggagcatta ccaaagtgtt tcaaaggtgg gattatcttg
54001 acgtaggtct gtctctggcc tagatcaacc taagttaaat gaagtctcta tcgttctgct
54061 gaaaaaatca aatatgaaac ttcatacacc ttaaagttca tatgacgaaa agagattttt
54121 ttgaggtcct tatactcatt atgcctagca ttgaatagac tgggtattca ccttatcaag
54181 atctcaaatc aatgatgggg tctgtttggc acctcctaaa tgggcgtcca aattggaccg
54241 aactctttgt caggctatgg ttccctcaaa gttatggagt aagacatcga tttctcaaca
54301 agatcaattt ttctgattgt atgatgaact cccttgaaaa acattggcgc gcgtgtaaac
54361 gagttgctct accaactgag ctatagccct tagtgcttgt gatacatatt ttatcatgta
54421 gataaattct tgtcaagata aatattccat gatccaacat caacaatctt tgatctcttt
54481 gagcggtatt ccttagatta gtattgctta ttaagtaata tgatatttat aatccatcga
54541 caggatgggt ttcatttggt tctctttggg atgataaatg acctacttaa ctcagtggtt
54601 agagtactgc tttcatacgg cgggagtcat tggttcaaat ccaatagtag gtaaaactta
54661 ttagatacca gagtcaatgg tatctaataa ggtttacgac ccacccttag tgatattgat
54721 tttttgattt tgtatctttt ctatttcatt tttgaatttg aatttttgca tcagaattgg
54781 attctgtttg attgtatttg attgtattca cccgacagaa tctaaatagg attagaaaga
54841 gaacttcttt ttattattcg aacgtaccaa ctagttatga aatcggattg atagcctcca
54901 cccgtgttct agctcgtcgg agagctagat ttgcctcaat tttttgtctc cttccttcag
54961 cctttttcac attagcttcc gctagttcaa gagtttgctg agcttcttgt ggatcaatgt
55021 cactaccctt ctccgcatca tttactaaaa cagtgatctc attattgcct attctagcaa
55081 aaccacccat cagagccatc gttaaccatt ggtcgttaag acgtattctc aaaatcccta
55141 tatctacagc tgtggcaata ggggcgtgat ttggtaatat gccaatttga ccgctattag
55201 tagataaaac aatttcttcc acttctgaat cccaaacaat tcgattaggg gtcagtacac
55261 taagatttaa ggtcatttct tcaaattgct ctccatttct aagttcatag ccttcgcggt
55321 agcttcatcg atattaccta ccaaataaaa ggcctgttca ggaagaccat ctaattctcc
55381 ggaaaggatc aattgaaatc ctcgaattgt ttctgctaga ccaacatatt tacctggaga
55441 accggtaaat acttctgcta cgaaaaaggg ttgtgataag aaacgctcaa tttttcgcgc
55501 tcttgctacg agtaaacgat cctcttcgga taattcgtcc aatccaagga tagctataat
55561 gtcctgaagt tctttgtaac gttgtaaagt ttgcttaact ctttgggcgg tttcgtaatg
55621 ttcctcacca acgatccgag gttgaagcat ggttgacgtt gaatctaaag gatctactgc
55681 tggataaata cctttggcag ccaatcctct tgatagtacg gtagtagcat ctaaatgtgc
55741 aaatgtcgta gcaggagcag ggtcggtcaa atcgtctgcg ggtacataaa ctgcttgaat
55801 agaggttatg gacccttctt tggtagaagt aattctttct tgtaaagaac ccatttcggt
55861 actcagggtg ggttgataac ccacagcgga aggcattcta cccaataagg ccgatacttc
55921 ggatcctgct tggacgaaac ggaagatatt gtcaataaaa agaagtacgt cttgctcatt
55981 aacatctcgg aaatattccg ccatagttag ggcagtcaaa ccaactctca tacgagctcc
56041 cggcggttca ttcatctgac cataaactag ggctactttt gattctgcaa tattttcttc
56101 attaattact ccagattctt tcatttccat gtaaagatca tttccttccc gagtacgttc
56161 acccactccg ccaaatacgg atacgccccc gtgagcttta gcaatattgt taatcaattc
56221 cataataagt actgttttac ccactccagc tcccccgaat agtccgattt ttcctccacg
56281 gcgataaggg gctaaaagat ctactacttc aattcctgtt tcaaaaatag ataattttgt
56341 atccaactgt ataaaggcgg gcgcagatct atgaatagga gacgttgtac tagtatctac
56401 aggccctaaa ttatcaacag gttctccgag cacgttaaaa attcgtccca gagtcgctcc
56461 cccgaccgga acacttatag gagctcctgt gtcaatcact tccattcctc tcgttagacc
56521 ctctgtagca ctcatagcta tagccctaac tcgattattt cctaataatt gctgtacctc
56581 acaagccaca ttaattggtt gaccaacact atctcgacct tgaactacca gagcgttata
56641 aatattcggc atcttgcccg ggggaaaggc tacatctagt accggaccga tgatttggac
56701 gacacgcccc gggttttttt tttcaagcgt ggaaacccca gaaccagaag tagtaggatt
56761 gattctcata ataataaaat aaataaatat gtcgaaatgt ttttgcaaaa attatcgaat
56821 tcaaaataaa tgtccgctag cacgtcgatc ggttaattca ataaaatggg aattagcact
56881 cgatttcgtt ggcaccatgc aattgaaccg attcaattgt ttacttattc actgagactg
56941 agtgaatttg caagcccacc caacctattt taattttaaa atctcaagtg gatgaatcag
57001 aatcttgaga aagtctttca tttgtctatc attatagaca atcccatcca tattatctat
57061 tctatggaat tcgaacctga actttatttt ctatttctat tacgattcat tatttgtatc
57121 taattggctc ctcttcttat ttatttttga tttcaatttc agcatatcga tttatgccta
57181 gcctattctt ttctttgtgt ttttctttct tttttatacc tttcatagat tcatagagga
57241 attccgtata ttttcacatc taggatttac atatacaaca tataccactg tcaaggggga
57301 agttcttatt atttaggtta gtcaggtatt tccatttcaa aaaaaaaaaa agtaaaaaag
57361 aaaaattggg ttgcgctata tatatgaaag agtatacaat aatgatgtat ttggcaaatc
57421 aaataccatg gtctaataat caaacattct gattagttga taatattagt attagttgga
57481 aattttgtga aagattccta tgaaaagttt cattaacacg gaattcgtgt cgagtagacc
57541 ttgttgttgt gagaattctt aattcatgag ttgtagggag ggatttatgt caccacaaac
57601 agagactaaa gcaagtgttg gattcaaagc tggtgttaaa gagtacaaat tgacttatta
57661 tactcctgag taccaaacca aggatactga tatattggca gcattccgag taactcctca
57721 acctggagtt ccacctgaag aagcaggggc cgcggtagct gccgaatctt ctactggtac
57781 atggacaact gtatggaccg atggacttac cagccttgat cgttacaaag ggcgatgcta
57841 ccgcatcgag cgtgttgttg gagaaaaaga tcaatatatt gcttatgtag cttacccttt
57901 agaccttttt gaagaaggtt ctgttaccaa catgtttact tccattgtag gtaacgtatt
57961 tgggttcaaa gccctgcgcg ctctacgtct ggaagatctg cgaatccctc ctgcttatgt
58021 taaaactttc caaggtccgc ctcatgggat ccaagttgaa agagataaat tgaacaagta
58081 tggtcgtccc ctgttgggat gtactattaa acctaaattg gggttatctg ctaaaaacta
58141 cggtagagcc gtttatgaat gtcttcgcgg tggacttgat tttactaaag atgatgagaa
58201 cgtgaactca caaccattta tgcgttggag agatcgtttc ttattttgtg ccgaagcact
58261 ttataaagca caggctgaaa caggtgaaat caaagggcat tacttgaatg ctactgcagg
58321 tacatgcgaa gaaatgatca aaagagctgt atttgctaga gaattgggcg ttccgatcgt
58381 aatgcatgac tacttaacgg ggggattcac cgcaaatact agcttggctc attattgccg
58441 agataatggt ctacttcttc acatccaccg tgcaatgcat gcggttattg atagacagaa
58501 gaatcatggt atccacttcc gggtattagc aaaagcgtta cgtatgtctg gtggagatca
58561 tattcactct ggtaccgtag taggtaaact tgaaggtgaa agagacataa ctttgggctt
58621 tgttgattta ctgcgtgatg attttgttga acaagatcga agtcgcggta tttatttcac
58681 tcaagattgg gtctctttac caggtgttct acccgtggct tcaggaggta ttcacgtttg
58741 gcatatgcct gctctgaccg agatctttgg ggatgattcc gtactacagt tcggtggagg
58801 aactttagga catccttggg gtaatgcgcc aggtgccgta gctaatcgag tagctctaga
58861 agcatgtgta aaagctcgta atgaaggacg tgatcttgct caggaaggta atgaaattat
58921 tcgcgaggct tgcaaatgga gcccggaact agctgctgct tgtgaagtat ggaaagagat
58981 cgtatttaat tttgcagcag tggacgtttt ggataagtaa aaacagtaga cattagcaga
59041 taaattagca ggaaataaag aaggataagg agaaagaact caagtaatta tccttcgttc
59101 tcttaattga attgcaatta aactcggccc aatcttttac taaaaggatt gagccgaata
59161 caacaaagat tctattgcat atattttgac taagtatata cttacctaga tatacaagat
59221 ttgaaataca aaatctagaa aactaaatca aaatctaaga ctcaaatctt tctattgttg
59281 tcttggatcc acaattaatc ctacggatcc ttaggattgg tatattcttt tctatcctgt
59341 agtttgtagt ttccctgaat caagccaagt atcacacctc tttctaccca tcctgtatat
59401 tgtccccttt gttccgtgtt gaaatagaac cttaatttat tacttatttt tttattaaat
59461 tttagatttg ttagtgatta gatattagta ttagacgaga ttttacgaaa caattatttt
59521 tttatttctt tataggagag gacaaatctc ttttttcgat gcgaatttga cacgacatag
59581 gagaagccgc cctttattaa aaattatatt attttaaata atataaaggg ggttccaaca
59641 tattaatata tagtgaagtg ttcccccaga ttcagaactt tttttcaata ctcacaatcc
59701 ttattagtta ataatcctag tgattggatt tctatgctta gtctgatagg aaataagata
59761 ttcaaataaa taattttata gcgaatgact attcatctat tgtattttca tgcaaatagg
59821 gggcaagaaa actctatgga aagatggtgg tttaattcga tgttgtttaa gaaggagttc
59881 gaacgcaggt gtgggctaaa taaatcaatg ggcagtcttg gtcctattga aaataccaat
59941 gaagatccaa atcgaaaagt gaaaaacatt catagttgga ggaatcgtga caattctagt
60001 tgcagtaatg ttgattattt attcggcgtt aaagacattc ggaatttcat ctctgatgac
60061 acttttttag ttagtgatag gaatggagac agttattcca tctattttga tattgaaaat
60121 catatttttg agattgacaa cgatcattct tttctgagtg aactagaaag ttctttttat
60181 agttatcgaa actcgaatta tcggaataat ggatttaggg gcgaagatcc ctactataat
60241 tcttacatgt atgatactca atatagttgg aataatcaca ttaatagttg cattgatagt
60301 tatcttcagt ctcaaatctg tatagatact tccattataa gtggtagtga gaattacggt
60361 gacagttaca tttatagggc cgtttgtggt ggtgaaagtc gaaatagtag tgaaaacgag
60421 ggttccagta gacgaactcg cacgaagggc agtgatttaa ctataagaga aagttctaat
60481 gatctcgagg taactcaaaa atacaggcat ttgtgggttc aatgcgaaaa ttgttatgga
60541 ttaaattata agaaattttt gaaatcaaaa atgaatattt gtgaacaatg tggatatcat
60601 ttgaaaatga gtagttcaga tagaattgaa cttttgatcg atccgggtac ttgggatcct
60661 atggatgaag acatggtctc tctagatccc attgaatttc attcggagga ggagccttat
60721 aaagatcgta ttgattctta tcaaagaaag acaggattaa ccgaggctgt tcaaacaggc
60781 ataggccaac taaacggcat tcccgtagca attggggtta tggattttca gtttatgggg
60841 ggtagtatgg gatccgtagt cggagagaaa atcacccgtt tgattgaata cgctgccaat
60901 caaattttac cccttattat agtgtgtgct tctggggggg cgcgcatgca ggaaggaagt
60961 ttgagcttga tgcaaatggc taaaatatcg tctgctttat atgattatca attaaataaa
61021 aagttatttt atgtatcaat ccttacatct ccgacaactg gtggagtgac agctagtttt
61081 ggtatgttgg gggatatcat tattgccgaa cccaacgcct acattgcatt tgcaggtaaa
61141 agagtaattg aacaaacatt gaataaaaca gtacccgaag gttcacaagc agctgaatac
61201 ttattccaga agggtttatt cgacctaatt gtaccacgta atcttttaaa aagcgttctg
61261 agtgagttat ttaagctcca cgcctttttt cctttgaatc aaaagtcaag caaaatcaag
61321 tagagcacta agttcaatta ttttatttgt gtttgtagca aaaaagtagt tagtttgtcg
61381 gaatcaaagt aaataagata ataatggcgc tttctttggt gatagaagat ctaattgtag
61441 aaagaatcaa aactaaagtt gaggataact ctttttttga cctatattcc tgattacgaa
61501 tcaagaagcc tttatcaaca agagtgagtt cttcctttcg tgaaattagg aaaataaaac
61561 gaatttcttc ttcttgtctt aggtatataa tttgaaattc aaatatagat aatagagttt
61621 tgtatctttc tctatctccc gaaaaaccat tttagctaaa aattcatgtt gggtcggatt
61681 cgaacgaatc tttcgataat ctgtaagaaa ctctttatct atttttagaa aattagaaga
61741 caagaacaaa agacaaagaa atgaagaaaa ataataaagt ttattatgat acatatcttt
61801 ctcatgtagg ggatgaataa gtccatttat ttagttctac agttctacat tctttgcact
61861 tattatacct actcagttag atttagatat atagatactt agatctatac taagaatttc
61921 aaattcttca aattctatta ataataaata ttatctaatt tctaattagt aattagaatt
61981 caaattctta atttaattat aattattaca agatatcttt atttatataa taacataata
62041 acagatacaa atagtaaatc gaggtacccc ttctatgaca aatttgaacc ttccatctat
62101 ttttgtgccg ttagtaggcc tagtctttcc ggcaattgca atggcttctt tatttcttca
62161 tgttcaaaaa aataagattg tttagatccg ctgggaccca atctcatcca tttttttttt
62221 gaaaacgtgg acttgtatca taacacagat atctatttat tggaatatag tataacatgt
62281 gatttccacc gaacataaag gaaaaaactc ttatgcccgc agaaatatga tatatggata
62341 tatcaattct aacaattttc aaatagatca ggatcgctgg atggctgaaa tgtagtcggt
62401 gaatctctat gtatatcgat atgtatagtg ggatcgtatt aaataaagag tatgttatta
62461 ttttagattt aaccaatttg atgaattact cctaaaggtt gacatcaaac tagtgctagt
62521 tcacctcaaa ctagtgctag ttgatgagag ttacttcgga aacaaaaaag taaagtcaaa
62581 tttctctggg gtattatctc aattccaata aaatgcaatc gggtaaagta tgacttggcg
62641 atcagaacat atatggatag aacttataac ggggtctcga aaaataagta atttctgctg
62701 ggcctttatc ctttttttag gttcattagg cttcttatta gttggaactt ccagttatct
62761 tggtagaaat ttgatatctt tttttccgcc tcagcaaatc attttttttc cacaaggact
62821 cgtgatgtct ttctacggaa ttgcgggtct ctttattagc tcttatttgt ggtgcacaat
62881 ttcctggaat gtaggtagtg gttatgatcg attcgataga aaggaaggaa tagtctgtat
62941 ttttcgttgg ggatttccgg gaaaaaatcg tcgcatattc ctccgattcc ttataaaaga
63001 tattcagtcc gttagaatag aagttaaaga gggtatttct gctcgtcgtg ttctttatat
63061 ggacatccga ggccaggggt ccattccctt gactcgtact gatgagaatt tgactccacg
63121 agaaattgaa caaaaggctg ctgaattagc ctatttcttg cgtgtaccaa ttgaagtatt
63181 ttgagaaatt gagatatcag tatcaggaaa caatattctg aatttcttca ttcgaagtga
63241 attcttagct tttttctgga ttctttctag attcaaagac taaccacaaa atcacaaaga
63301 aaatagattc attagtccga taccttgtat aaaactcatg tgtgtaagaa atattcgatc
63361 gcatagagtg tacgaatggg ttgattaaca attcacagat gaaaaaatgg caaaaaagaa
63421 agcattcact cctcttttct atcttgcatc tatagtattt ttgccctggt ggatttcttt
63481 ctcagttaat aaatgtctgg aatcttgggt taccaattgg tggaatactg ggcaatccga
63541 aatttttttg aataatattc aagaaaagag tcttctagaa aaattcatag aattagagga
63601 actcctcttc ttggacgaaa tgatcaagga atactcggaa acacatctcg aagagtttgg
63661 gataggaatc cataaagaaa cgatccaatt aatcaagata caaaatgaga atcgtatcca
63721 tacgattttg cacttctcga caaatatcat ctgttttatt attctaagcg ggtattcaat
63781 tttgggtaat gaaaaacttg ttattcttaa ctcttgggct caggaattcc tatataactt
63841 aagtgacaca gtaaaagctt tttctattct tttattaact gatttatgta tcggattcca
63901 ttcaccccac ggttgggaat taatgattgg ctctatctat aaagattttg gatttgttca
63961 taatgatcaa atcatatctg gtcttgtttc cacctttcca gtcattctcg atacaatttt
64021 taaatattgg attttccgtt atttaaatcg tctgtctccg tcacttgtag ttatttatca
64081 ttcaatgaat gactgataaa ggatccattg atattaatct aatccaatta gaatgcttgg
64141 tactttgtag ttgtacataa gcaaagtatt gaaaatcata tttactcttt ctatttctaa
64201 ccatcgggga gattcatcct atattattcc tagattattc cagcaaatag cagaatcgtg
64261 gctagggaac tatactagcg acctacccaa tttattgtag aaattttcgc gatcaatgat
64321 tggaccatgc aaactagaaa tgctttttct tggctaaaga aacagattac tcgatctatt
64381 tccgtatcgc tcatgatata tatcttaact cggacatcca tttcaagtgc atatcccatt
64441 tttgcacagc agggttatga aaatccacga gaagcgactg ggcgtattgt atgtgccaat
64501 tgccatttag ctaataagcc cgtggagatt gaggttccac aagcggtact tcctgatact
64561 gtatttgaag cagttgttcg aattccttat gatatgcaac tgaaacaggt tcttgctaat
64621 ggtaaaaggg gggggttgaa cgtgggggct gttcttattt taccggaggg gtttgaatta
64681 gctcctcccg atcgtatttc tcccgagatg aaagaaaaga ttggcaattt gtcttttcag
64741 agctatcgcc ccaataaaaa aaatattctt gtgataggcc ctgtccctgg tcaaaaatat
64801 agtgaaataa ccttccctat tctttccccg gaccctgcta ctaagaagga tgttcacttc
64861 ttaaaatatc ctatatacgt aggcgggaac aggggaaggg gtcagattta tcccgacggc
64921 agcaagagta acaatactgt ttataatgct acagcagcag gtatagtaag caaaatcata
64981 cgaaaagaaa agggtgggta tgagataacc ataacggatg cgtcggatgg acgtcaagtg
65041 gttgatatta tccctcccgg accagaactt cttgtttccg agggcgaatc tatcaaattt
65101 gatcaaccat taacgagtaa tcctaatgta ggcggatttg gtcagggaga tgcagaaata
65161 gtacttcaag atccattacg tgtccaagga cttttgttct tcttggcatc tgttattttg
65221 gcacaaatct ttttggttct taaaaagaaa cagttcgaga aggttcaatt ggccgaaatg
65281 aatttctaga ttcgcagatt tgtcgacatc aagttcgtaa aaagaaccaa attcttgttg
65341 gcgattattt atgatcaaaa aaatgaaatt ctgaaaactc ctttgtctta tttatactct
65401 tcttcaaaat ctacatacta tgtggtacaa gggattccca gcatctcgta gaaaaagagt
65461 atgtaatgta gaatttgaag aagagtattt gactttcatt atttttattt cgttttttaa
65521 aattggagta gtgtgactat gttactattg acagatttca atgccataag acgtatcaat
65581 agttttctat tctaaataga aagaaagtca aatttgtcta aatactagac ataaggaagc
65641 aggggataaa tgcggggaac aaaaaattct aggagggatt atttgtcttc ctagtcttcg
65701 acacaagaaa ggggtgtaga aaaatccttt tttcttgtgt cgaaacgaaa gagtaatgat
65761 tcttgatcct gtttgttaaa aattcctagt cttggtttcg atttttccag atgtatcaga
65821 aaccctttac cttaccccca ccccctttac gtataatata ctaagtggtg gacaaacaaa
65881 acaaaaaaag agaggaaatt ttattaatta aataaaactt cttcaatcaa cttatcttat
65941 acaaaatttg atgatgaaat atgaaaacaa taaaaaataa atagagtaat gtaatagaga
66001 gagtaaggtt ctacattaga ttagtataga aaggatttgc acgatatcta atatattata
66061 gcagccaaga aattgagtga ttccttcttt cttccaactt tgaaagtacc gatagatact
66121 atcatagaaa aagaagaggt ggtccgaata gtgaattttt caaaaacatg atcagaaaaa
66181 tgagaaaaat ggagtttttg aaaagaaaaa gaaatccatt ttatcattta gacgaaaaaa
66241 atattatgat tcttaagaac tcaacgggcc cttccccttc gaatcaaaca aacaaagaag
66301 ggaattccgt tgagttctta cgctttcatg ttgacgactc aattcattcg attactagag
66361 ggatgaaccc aatccggaat atgaaccata aaagaaaata cctattaaac cgattacaag
66421 aataccagct acagtaccta ttatccaaag aggaatcctt ccagtagtat cggccattta
66481 ccccacttcc ctccagattt catcaagtgg tcatgctaga gacataaaca gtcatggata
66541 attaaattat gagatccttc cgaatgagct aagagaatct tattgattct ctttcgtttt
66601 cttaattgaa gaaataattg gaaaataaaa cagcaagtac aaaaatgagt aataaccccc
66661 agtagagact ggtacgattc aattcaacat tttgttcgtt cgggtttgat tgtgtcgtag
66721 ctctataatt cggattaagt ttatcgttgg atgaactgca ttgctgatat tgatcccaaa
66781 aaaaagacgg taggtacagc taggccgtga acagccaacc atcgtactgt aaaaattgga
66841 taggttcgat ctatagtcat tagggcctcc taaaacgatc tactaaattc atcgagttgt
66901 tccaaaggat caaaacggcc agttattaat ggaattcctt gtcggctctc tgtaaaatac
66961 tcgtttggcc gagggcttcc aaacacatcg taagctaaac cggtgctgac aaataaccaa
67021 cccgcaatga atagggaagg tatagtaatg ctatgaatga cccagtatcg aatactggta
67081 ataatatcag caaacgaacg ttctcctgtg cttccagaca tgctgagctc cacatattct
67141 tgtacagtca aagaagatcg attccgtaaa agatgagatc agtaaatgac aattcactga
67201 aatttcatct ttgtgagatc gtcaatattg taccgaaggc gtctttagag tataccgaat
67261 cagtatagct atccttcttc tgacacagca acgcaatttg aaatagtatc aaaagtaagt
67321 actaaataat ttcttttttc ctttacttgt tgatgtaaaa tcatcttcca ttcaatagaa
67381 aattctttca attcaacgaa agagattctc atattcacac aatttaagta gatgcgagat
67441 atagaaattt gcttttcgta gttgtggaag cagttttgtt gttggaatcc tttttttaaa
67501 gaagaagtta atggtcgagt aagaaataag agtagtagat catattcgag gaaagaaaaa
67561 atcgaataat tggaatccat agttgtgatg cattgttgtg gatctcgatc caaaggttct
67621 ttcttgatct agctacaagg atggggcagt agggaaagat aaaatgtgga acctaataga
67681 aattactagt tttagaatct agttggacaa aaaaaagatt ttttcaagcg attgtgtgat
67741 aactttttct tcttctccat cattcaagat attatgtgaa ttaatatatt actaaatcta
67801 atgagttaaa cttaaatgaa agtaaaaaga aaaagtttta taaggtaact gttcgcttta
67861 aaatcgaaaa tggagtcgat acaattcaac agaatctaag aaatgatcaa attcgaaaat
67921 catttctatt tttattctat aaaaattcaa gtttcatttt tgaatgcagt tagacgatac
67981 agctcttatt agtttaatag tttactcaag agttactcaa tgaatcggtt gattggaatt
68041 gcgggatgga tagatgttac agatgatgaa tcaatttctt ttatatgtct gtcactttat
68101 ctttgttagt gctgtctgcc tataatgata gataaatcaa aaacttttca ttcaacttat
68161 tctttcaatt gaaattgaga tttttgccta tcctcctatt ttattttgaa aaatttgaaa
68221 cttaggtaag tgctttttaa acatatgtat aaaaagaaca tatttcattt aatttagccc
68281 cttcatgctt actataacta gttatttcgg ttttctatta gcggctttaa ctataacctc
68341 agctctattt attggtctga gcaagatacg acttatttaa actgaatatt taaaatgaac
68401 aattcataaa aagaaatcct tctgtgggat tacgcgtatt ctatatttac ttacgttacc
68461 aattgtcaat tcttgttcat tgtcattgag attcatgtca attcggatta atatttaggt
68521 atcgatatta cctctttttt tctcctttca aacaaataaa aatgattgaa gtttttctat
68581 ttggaatcgt gttaggtcta attcctatta ctttggctgg attattcgta actgcatatt
68641 tacaatatag gcgtggtgat cagttggacc tttgattaat taacatctct ttttgattga
68701 cctcctcctt tctttaattc acaggcacag gaggtcaaat tccgattgtt gtgaaagtta
68761 ctgaatgaat ctattttatt ctaattcgat ctaagaagaa aaaaatcacg ctctgtagga
68821 tttgaaccta cgacatcggg ttttggagac ccacgttcta ccgaactgaa ctaagagcgc
68881 tttcttatca gaatagataa gactgtaaac aaaaggattc ttttcataac cccaatacat
68941 tttgtatgca tatactagaa tagcatgata aaaatcaaag attatgtcca atttgaggcg
69001 atctcaattg atccctcgtt actgctcctt tgagcagtaa taggtaggga tgacaggatt
69061 tgaacctgtg acattttgta cccaaaacaa acgcgctacc aagctgcgcc acatcccttc
69121 aattgttcca cagtgtaatt gtagagaatt cctgtcttgt tttccacatg gttatttcct
69181 ccattgatat atacaaattt tctgctcatt tcgtcttttt ggtctcattt aacatataat
69241 agtaaaataa aaggaaaaga cttctcttat agattatata gaaaatactt atatacaatt
69301 atatacaaaa tatataaata cagaacccgt cgtaaaaatc aattagtatt tttcggaaat
69361 tctcggtaag aaagaagggg atgtattttt tttttctgtt ttaagaaaag gaaaatctta
69421 tttcccgaat cattgtacat tgcaatttga attaggaatt ctgtgtccaa ctctaagcag
69481 cccttaacta catatgcatc tgattatata tgtattatct attccaacaa ataatacaaa
69541 agaaggaggt ttttcaatgc gagatctaaa aacatatctc tctgtggcac cagtactaag
69601 tacgctatgg ttcggggctt tagcaggtct attgatagag attaatcgtt ttttcccgga
69661 tgcgttgaca ttcccctttt tttcattcta gttattgtca tgggaaggaa tgaagaagat
69721 tagagatcca atcaaatatt ggtgatgaat ccctctcccc ctcttttctc ttttttccct
69781 ttttagaata agggaggaaa gagaaagaat aaaaaaagtg gattcaacat tcgggctcaa
69841 gttcgaatta actgaatatt aataatagag gaatgggggt agaatagaag atctagggca
69901 agagtattat acaagatact taaatgatta cttcaatttg aaatatactt tagaaaaatc
69961 gttgtatttt actatgactt tgctttacta ttactttatt ttcttgattt taatctttta
70021 cttttagaat tggatttcaa gttagtaact tctattttat cctttcttcg ttttgaatcg
70081 aaaatagaag agttgagtaa atcaaaaatc caaaggaggt tcatggccaa ggggaaagat
70141 gtccgagtaa cggtgatttt ggaatgtact agttgtgtcc gaaacagtgt tgataaggta
70201 tcaagaggta tttccagata tattactcaa aagaaccggc acaatacgcc taatcgatta
70261 gaattgaaaa aattctgtcc ctattgttac aaacatacga ttcatgggga gataaagaaa
70321 tagagcgaac caagtacctg tgtcttaccc tttcaaggaa ggggaaaaaa tgacattata
70381 tatataacat atttaaatag aaaataaaca aatcttattt tttaaaaatc ctattttggg
70441 tggatttaaa ctgaattaga attaagaaat aggattttag ggataaggaa taaattaaac
70501 aaacaaacca tggataaatc caagcgacct tttcttaaat tcaagcgatc ttttcgtagg
70561 cgtttgcccc cgattcaatc gggggatcga attgattata gaaacatgag tttaattagt
70621 cgatttatta gtgaacaagg aaaaatatta tcaagacgag tgaatagatt gaccttgaaa
70681 caacaacgat taattactct tgctataaaa caagctcgta ttttatcttt gttacccttt
70741 ctcaataatg agaaacaatt tgaaagaacc gagtcgaccg ctagaactac tggttttaaa
70801 gcccgaaata aataggctta ctttttcttc acttgaatca taattacaag aatctagatt
70861 tgagtatcgt gtcgtaagaa aaaaaatgaa tcggaaaaaa agatttcttt ttttattgaa
70921 ttgaacgtgt tcattcattt tgactacttt agcatatttt ctcatagaaa tttctactct
70981 accttcccgg agttcattct ccggggaact ccatttaaat tattctggtg gattctttcc
71041 aatctacttc ctttatgatt tcgttcgaaa tcatataaag acaattccta tttgatatag
71101 ctatttgtgc aagtatttta cggttaagaa gcaactgtct cttgtacaga tcgtgtatta
71161 atctactata actataggat actccccttt cgcgaattac tgcgtttatc cgagtgatcc
71221 acaaacgacg aaaatctctc tttttcctat ccctatcccg atgagccgaa actaaagctc
71281 ttattttctg ttgagtaata gttcgagtaa gccttgaatg agccccccga aagcttgatg
71341 caaataaacg aatttttgtt ctacgtctcc gagctatata tccccgttta attctggtca
71401 ttgaataaat gaaactttga cgaataacta atcgattgcc tttctttcag ttattctttt
71461 cccccttcct agtctattaa taacaaaacg gatttttcca atgtataaaa taaaaattcc
71521 aatggctttg gctactctaa ccttcccgac cacgattttt tctttttttt ttttttaggt
71581 atttcactgc gaaataagaa agaaataaaa aattgtattt tcctaggtat caaaaatcta
71641 gtaaataaaa gaaatcaaaa aataaagtag tgggttcctt cgtttctatg gttacttctt
71701 aaacggtgag gtcttctcta tacaccggag cctttacttt atactttaat ttaatattta
71761 atcaactaat tgatgttatt gggaacttgt atagttcaca ctctttggct ctacccatga
71821 attatccagt aataggtctt tcacaatcag atctacctat acagtaagcg gtatttaatt
71881 atgaaagttt gctgggtagc tgaccctctt agtccgttct tgccagagtg ggagcctgcc
71941 taatctttat gttttatgct ttttaaataa gatttcctcc gcttaatgga taaccatttg
72001 ttaccaatgg agaatttctt atcatctgtg attggattta caccaacgga aaccataaac
72061 ttcatacaca atagagggat atgagagagt tttttttaaa taatgaatgg agttccttct
72121 tccatcctat cccattcacc ggtactgatc attgatactg taaaagtcgt tttcttgctt
72181 ttgtgccagc tcatgatcta aacgagtcgc acatacaccc tagtacatgt tcctcgacgc
72241 tgaggacagc cccgaagagc gggggatttc gtgacatttc tgattggctg tcttgtattt
72301 ctaataagtt gtttaatagt tggcatgttg aatcgtatac ataatatgat gggttggttt
72361 agattgatcc taaccgaatg atgatgaatt acttctattt aatagaatat tcaattcgaa
72421 gataaaatct caaatcacag atttgcgcga aatccatgtt attttcattc aaccgctaca
72481 agatcaacaa ttccataagc ttgggcttct gttgctgaca taaaaacatc tctttccata
72541 tcttcggata caacccataa gggtttcccc gttctttgta cataaaccct tgtgagggtt
72601 tcacgcagtt tcagcagttc ttccgcttcc aggacaaatt cgcctgtttg tgcctcataa
72661 aaagaactag caggttgatg gatcattacc ctgatgatat aacaaaataa aagcttcccc
72721 tatctcgcat gataaagcaa agagaaaaga aagataaaga atagaaaaaa gatagaattg
72781 aaccaaccgt acaggccatc ttttgtgcat acggcctcta caagaaaatt gacctcccct
72841 cctttctatt gaagaaagag aaaaaataga atctatcaga ctcagatggg taaatgatca
72901 aattccgatc cttcctttcg gaggagttaa aaaatactat gatggctccg ttgctttata
72961 tgtttatttt ttcttttttt ttttttgtct gtgattcacg aatcccaaag tttcttttta
73021 atccgatcaa ataaggaaaa aagtcttttt tttttttttt cgtactcttt cataacataa
73081 atattgttaa gaactctccg gcatgaaaac aaaaaagttt gtgacgctga actgaactcc
73141 cgatagataa gagaaaatcg gaaatacccc ttatctcata ctactctctc gatacagaat
73201 ctaatgtttt gaaaaaaaaa caatacaaaa atttctcata tcgaattcga agtgccatgc
73261 tattattact tagtattcat atggcgaagg catagtcttc ttttttctct caaataaaaa
73321 cctcattggc gccaagcgtg agggaatgct agacgtttgg taatttctcc tccgaccagg
73381 ataaaagatc ccattgaagc ggctaatccc atgcatattg tatggacatc tggtcgcaca
73441 aattgcatag tatcataaat agccacccca ggtattaccc agcccccagg agagtttata
73501 aacaaataca gatctttggt ctcatcctcg atactgagat ataccataag accaataagt
73561 tgattcgaaa tctcgctatc aacctcttgg cctaaaaaaa gtaatctttc tcgataaagt
73621 cggttgatta gggtaaaatt gtatccctta ggaaccgtac atgcgccttt tgatgcatac
73681 ggttcaaaaa aaaaatggtg aatcaatgta tagattccag tcctctttct ttttttctag
73741 aaaggttctt tcttacttct aacgaaaggg cttttcttcg attttttaat aaagacgagt
73801 tttgactcct tttttatatt ttcgattttc cattataaaa tttgaagtta taagaaaggg
73861 tcattaaact tatcgaatta acttctcatt gatgtattct ttcatcgaga tttaatccaa
73921 accgcgatgg tattttcttg ttcctgaatg ggtctgtttc atctttttag gtttatgctc
73981 tactccgggt aaagatccgc ccgatttgga tttgtacata taggacaaat gctcccatta
74041 ccatttcttt ttgtatttct tttttttttt caattcattt tatacaagta tttcttagag
74101 ttgagataac tttgcttgac aattaggatc tctttacaaa gaaaaaatat gaatagcaat
74161 catagatatc ttaccaatcc aattgggttt tttctaaacg gagcctggat acttcatttt
74221 tttagtccaa ccaagccaac cataaattat tctaattgaa tttttctaat tgataatagt
74281 aatatgaatc ccctcaaaaa tggatctaat tgcacttcac gctccaaatt tttgatgatt
74341 aaatttatct ttcttgggtg aaacggggga tatctcgatc gggggagaga acggggaaat
74401 accatatgac ccaatatatc tgacaagtcg cactatacgt caacccaaga tgcatcttcc
74461 tctccaggac ttcggaaagg gacttttgga acaccaatag gcattaaatg aaagaaagaa
74521 ctaaatacta tatttcactt tgaggtggaa acgtaacaat tttttttatt gtctttataa
74581 tattcatatt ggtttttatc gtatttattt tatccataga ttataaaaat tcataaagaa
74641 agacagaatg aataaactca aattattacg aataggtctt tctaatgata aataagtatg
74701 gactcattcg ctcatagaaa atgggatcaa ctcccccatt gcgtattggt acttatcgag
74761 tatagaataa atctgcttct ctttgttcct acgaacagaa ttgttccatt attaccaaca
74821 gaatagaaca cccttgttcg gaaataatcg actgaacaag agtggtccat aggatagtca
74881 tattatagtc ttttccaatg caataaagtt acgtagtgtc tatttatctt tgatataagg
74941 ggtatttcca tgggtttgcc ttggtatcgt gttcataccg ttgtattgaa tgatcccggt
75001 cggttgcttt ctgttcatat aatgcataca gctctggttg ctggttgggc cggttcgatg
75061 gctctgtatg aattagcggt ttttgatcct tctgatcctg ttcttgatcc aatgtggaga
75121 cagggtatgt tcgttatacc cttcatgact cgtttaggaa taaccaattc atggggcggt
75181 tggagtatca caggggggac tgtaacgaat ccgggtattt ggagttacga aggtgtagct
75241 ggagcacata ttgtgttttc tggcttatgc tttttggcag ctatctggca ttgggtgtat
75301 tgggatctag aaatattttg tgatgaacgt acaggaaaac cttctttgga tttgccaaag
75361 atctttggaa ttcatttatt tctctcaggg gtggcttgct ttggttttgg tgcatttcat
75421 gtaacaggct tgtatggtcc cggaatatgg gtgtccgacc cttatggact aacgggaaaa
75481 gtacaacctg taaatccagc gtggggcgtg gaaggttttg atccttttgt tccaggagga
75541 atagcctctc atcatattgc agcaggaaca ttgggcatat tagcgggcct attccatctt
75601 agcgtccgtc cgccacaacg tctatacaaa ggattgcgta tgggaaatat tgaaaccgtc
75661 ctttccagta gtatcgctgc tgtctttttt gcagcttttg ttgttgccgg aactatgtgg
75721 tatggttcgg caacaacccc gattgaatta tttgggccca ctcgttacca atgggatcag
75781 gggtacttcc agcaagaaat atatcgaaga gttagtgctg ggctagcaga aaatcaaagt
75841 ttatcagaag cctggtctaa aattcctgaa aaattagctt tttatgatta catcggcaat
75901 aatccggcaa aagggggatt attcagagcg ggctcaatgg ataacgggga tggaatagcg
75961 gttggatggt taggacaccc tatctttaga gataaagaag gccgtgaact ttttgtacgt
76021 cgtatgccta ctttttttga aacatttccg gtcgttttgg tagatggcga tggaattgtt
76081 agagccgatg ttccttttag aagggcagaa tcgaagtata gtgttgaaca agtaggtgta
76141 actgttgagt tctacggcgg tgaactcaac ggcgtcagtt atagtgatcc tgctactgtg
76201 aaaaaatatg ctagacgtgc tcaattgggt gaaatttttg aattagatcg tgctactttg
76261 aaatccgatg gtgtttttcg tagcagtcca aggggttggt ttacttttgg gcatgcttcg
76321 tttgctttgc tcttcttctt cggacacatt tggcatggtg ctagaacctt gttcagagat
76381 gtttttgctg gtattgaccc agatttagat gctcaagtcg aatttggagc attccaaaaa
76441 cttggagatc caactacaaa aagacaggca gcctgataca acattacttt ggtatctttc
76501 tttcgccctt attttctttc ttttactttt attgacatag ggtaccagag aaatctttat
76561 ttgaatcaac ttcgttttta ctcttgttcg ttctttatcc ggaagatgac aaaaaaaaga
76621 aaataaaaag aaacaaacag gtatgaaagc tataattgta aaccacgatc gaatctatgg
76681 aagcattggt ttatacattc ctcttagtct cgactctagg gataattttt ttcgctatct
76741 tttttcgaga accgcctaaa gttccaacta aaaagaacta aaaaggtgaa ataattcttc
76801 attatctcag ttgaagtact gagcctcccg ataccgggag gctcagtact tcaactagtc
76861 tccatgttcc tcgaatggat ctcttagttg ttgagaaggt tgcccaaaag cggtatataa
76921 ggcgtaccca gtaaaactta caagtaaacc agatataaag atggcgacta gggttgctgt
76981 ttccattctt atcatattta taaaatttca agaccccaat ggatctatga taggatcgtt
77041 tatttacaac ggaatggtat acaaagtcaa cagatctcaa tgaatacaat aggatttatg
77101 gctacacaaa ctgttgaaaa cagttctaga tctggtccaa gacgaactgc ggtaggagat
77161 ttattaaaac cattgaattc ggaatatggt aaagtagctc ctgggtgggg aactactcct
77221 ttgatgggtg tcgcaatggc cttatttgcg gtatttctat ctattatttt ggagatttat
77281 aattcttccg ttttattgga tggaatttca atgaattaga tctataagaa ccgcaaagtt
77341 cttgcttttg agtccaaaat gaatcattta gagctccgat ttctagtcca ttctattttc
77401 ttttggtagt tcgatcgtgg aatttctttg tttctgtatt tccggagtat gagtgtgtga
77461 cttgttataa ttgatcctat tgatagtaca gagaatgggt ctgtcatctt gatagagatg
77521 gttctacttc gtcagatatt tattctaata tttggaacac gaaatagatt aagaaatatt
77581 tgaactatga ttcatactta atattcagac ctcgtgtccg ggctccaaaa aattttcaaa
77641 caaagaattc taatttctaa atcgaaagat tcttttcttt caacccctat ttatattttg
77701 accaaaagca aaacctttct ttgaattttt agtcattcta tttattcagg gaataagtga
77761 tgatccgagg attcttactc agggaatcct tgatttgatt taggttaggt ttttttattg
77821 aatcatcgtg gttctagtat gaatctgagg ttttaatcga ttcatagggt cttaacaaga
77881 gaattcctat caataataaa gaaaacaaat aataaaagcc atattccaca aaaacaaatt
77941 ctagaaagaa atagggaaaa agagaattca agaggcccat aagtatcaaa ataaagataa
78001 agacgactgc gccaacttga tattttggta ttatcgccac aaagaagagc tttcggattt
78061 tccagagaag atgggatcag aacttaataa atttaaaact ttctattcca tatccgttgc
78121 aactagtatt tgggtgtttt tgcttgagct gtacgagatg aaagtctcat atacggttct
78181 cagaggggga gttccgccta tctcaataaa gtatatgatt ggttcgaaga acgtctcgag
78241 attcaagcaa ttgcggatga tataactagt aaatacgttc ctccccacgt caatatattt
78301 tattgtttag ggggaattac gcttacttgt tttttagtac aagtagctac tgggtttgct
78361 atgacttttt actatcgtcc gaccgttact gaggcttttg cttctgttca atacataatg
78421 actgaagcca actttggttg gttaatccga tcagttcatc gatggtcggc aagtatgatg
78481 gtcctaatga tgatcctgca tgtatttcgt gtgtatctca ccggcggatt taaaaaacct
78541 cgcgaattga cttgggttac aggtgtggtt ctggctgtat taaccgcatc ttttggcgta
78601 actggttatt ccttaccttg ggaccaagtc ggttattggg cagtgaaaat agtaacaggt
78661 gtccctgacg ctattcctgt aataggatca cccttggtcg aattattgcg cggaagcgct
78721 agtgtgggac aatctacttt gacccgtttt tatagtttac acacttttgt attgccgctt
78781 cttactgccg tatttatgtt aatgcacttt ccaatgatac gtaaacaagg tatttctggg
78841 cctttataga gaaaagaaaa atagatccta aatatttgta atcaatcatt tatcacttgg
78901 tggaggaata tatagtattt cattgctaca agtatggatt attgaaaata ataagacatg
78961 gatttggata tttcccttta actattcatg tcaactaaac ggggggattg aagggaattt
79021 tgtgaagaga aaatggatta tgggagtgtg tgacttgaac tattgattgg tctgtgtaga
79081 tatatgcctg ccacatggga attcacaacc aaatgtgtct ttgttccaat cgccgtgtaa
79141 gccctataca gaggataggc tggttcgctt aaagagaatc ttttctatga tcaggtccga
79201 atcatgttgt acatgagcag gctccgtaag atccagtata agtgaactag ataaaacgga
79261 atcaagattc cgttttatct agttcactta taagattaaa tagtatgtaa atgtattcat
79321 ttcctctgca gtgacacgat caatactact atcggagtga aacaagggat ctaaagaaga
79381 agagaggcta gactatatta gtaacaagca aaccttgtat gtgtatctcc aaatattttg
79441 gagataaata ccaattagaa ggtctgagac gacccagaaa gcacttgatc atatcatgat
79501 ctgatttgta agcctacttg ggtcttgagt atttacttgt aagaacggaa ttctttgttt
79561 tgtaatggat agttgcaact ccgtaaaaaa gaattcagtc aaatttttct tacattgaac
79621 cattcctata tcatatatgt gtatgtgtaa atacaggtac catatatata ttttatatgg
79681 atatatggag tcatttggtt ctttttattc ttgctcgagc tggatgatta aaaattatca
79741 tgtccagttc cctcggggga tggatctata agaattcacc tatcccaata acaaaaaaac
79801 ctgacttgaa tgatcctgta ttaagagcta aattggctaa aggtatgggt cataattatt
79861 atggagagcc cgcatggccc aatgatcttt tatatatttt tccagtagta attctaggta
79921 ctattgcatg taatgtaggc ttagccgttt tagaaccatc aatgattggt gaaccggcag
79981 atccatttgc aacccctttg gaaatattac ctgaatggta tttctttcct gtatttcaaa
80041 tacttcgtac agtgcccaat aaattattgg gggttctttt aatggtttca gtacctgcgg
80101 gattattaac agtacctttt ttagagaatg ttaataaatt ccaaaatcca tttcgccgtc
80161 cagtagcgac gactgtcttt ttgattggta ccgcagtcgc cctttggttg ggcattggtg
80221 caacattacc tattgataaa tccctaactt taggtctttt ttaaattttt aaattgattc
80281 aattgtgaaa taacacgaca tgtgtatcta gggaatagtt tcttcaaagc gaattctccc
80341 tagatacatc tattcaattt aattctgaat ttattttgaa tatatgatat attaatatat
80401 taattgtgct aaagagtttc aatctatttt cactaagtaa gtccaataga tttaaaactt
80461 attttttgct aaatcaatta cgaaatattt ttctaaaatg cccaatatcc gttttacatc
80521 ttcgctacga aaatgttcaa ttttcataag atcttcttgg ctgttattca aaaggtccaa
80581 caatgtatat atattggaca ttttgaggca attatagatc ctggaaggca attctgattg
80641 gtcaataaaa atcgatttca atgctatttt ttttttgttt tttatgagtt tagccaattt
80701 atcatgaaag gtaaaagggg ataaaggaac cgtgtgttga ttgtcctgta aatataagtt
80761 gtcttcctcc atatgtaaaa agggaataaa taaatcaatt aaatttcggg atgcttcatg
80821 aagtgcttct ttcggagtta aacttccgtt tgtccatatt tcgagaaaaa gtatctcttg
80881 tttttcattc ccattcccat aagaatgaat actatgattc gcgtttcgaa caggcatgaa
80941 tacagcatct ataggataac ttccatcttg aaagttatgt ggcgttttta taagatatcc
81001 acgatttctc tctatttgta atccaataca aaaatcaatt ggttccgtta aactggctat
81061 atgttgtgta ttatcaacga tttctacata aggcggcaag atgatatctt gggcagttac
81121 agatccagga cccttgacac aaatagatgc gtcagaagtt ccatatagat tacttcttaa
81181 tataatttct ttcaaattca ttaaaatttc atgtaccgat tcttgaatgc ccgttatggt
81241 agaatattca tgtgggactt tctcagattt tacacgtgtg atacatgttc cttctatttc
81301 tccaagtaaa gctcttcgca tcgcaatgcc tattgtgtcg gcttggcctt tcataagtgg
81361 agacagaata aagcgtccat aataaaggcg tttactgtct gttcttgatt caacacactt
81421 ccactgtagt gtccgagtag atactgttac tttctctcga accatagtac tattatttga
81481 ttagatcatc gaatctttta tttctcttga gatttcttca atgttcagtt ctacacacgt
81541 ctttttttcg gaggtctaca gccattatgt ggcataggag ttacatcccg tacgaaagtt
81601 aatagtatac cacttcgacg aatagctcgt aatgctgcat ctcttccgag accgggacct
81661 tttatcatga cttctgctcg ttgcatacct tgatccacta ctgtacggat agcgtttgct
81721 gctgcggttt gagcagcaaa cggtgttcct cttctcgtac ctttgaatcc agaagtaccg
81781 gcggaggacc aagaaactac tcgaccccgt acatctgtaa cagtgacaat ggtattattg
81841 aaacttgctt gaacatgaat aactcccttt ggtattctac gtgcaccctt acgtgaacca
81901 atacgtccat tcctacgcga actaattttc ggtatagctt ttgccatatt ttatcatctc
81961 gtaaatatga gtcagagata tatggatata tccatttcat gtcaaaacag attctttatt
82021 tgtacatcgg ctcttctggc aagtctgatt atccctgtct ttgtttatgt ctcgggttgg
82081 aacaaattac tataattcgt ccccgcctac ggattagtcg acatttttca caaattttac
82141 gaacggaagc tcttattttc atatttctca ttccttacct taattctgaa tctatttctt
82201 ggaagaaaat aagtttcttg aaatttttca tctcgaattg tattcccacg aaaggaatgg
82261 tgaagttgaa aaacgaatcc ttcaaatctt tgttgtggag tcgataaatt atacgccctt
82321 tggttgaatc ataaggactt acttcaattt tgactctatc tcctggcagt atccgtataa
82381 aactatgccg gatctttcct gaaacataat ttataatcag atctaaacaa acccggaaca
82441 gaccgttggg aaggcgattc agtaattaaa gcttcatgac tcctttttgg ttcttaaagt
82501 ccctttgagg tatcaactaa taagaaagat attagacaac cccccttttt tctttttcac
82561 aaataggaag tttcgaatcc aatttggata ttaaaaggat taccagatat aacacaaaat
82621 ctctccacct attccttcta gtcgagcctc tcggtctgtc attatacctc gagaagtaga
82681 aagaattaca atccccattc cacctaaaat tcgcggaatt cgttgataat tagaatagat
82741 tcgtagacca ggtcgactga ttcgttttaa atttaaaata tttctatagg gtcttttcct
82801 attccttcta tgtcgcaggg ttaaaaccaa aaaatatttg tttttttctc gatgttttct
82861 cacgttttcg ataaaacctt ctcgtaaaag tatttgaaca atattttcgg taatattagt
82921 agatgctatt cgaaccaccc tttttcgatc catatcagca tttcgtatag aagttattat
82981 ctcagcaata gtgtccctac ccatgatgaa ctaaaattat tggggcctcc aaatttgata
83041 taatcaacgt gttttttact tatttttttt ttgaatatga tatgaattat taaagatata
83101 tgcgtgagac acaatctact aattaatcta tttctttcaa ataccccact agaaacagat
83161 cacaatttca ttttataata cctcgggagc taatgaaact attttagtaa aatttaattc
83221 tctcaattcc cgggcgattg caccaaaaat tcgagttcct tttgatttcc ttccttcttg
83281 atcaataaca actgcagcat tgtcatcata tcgtattatc atcccgttgt cacgtttgag
83341 ttctttacag gtccgcacaa ttacagctct gactacttct gatctttcta ggggcatatt
83401 tggtacggct tctttgatca cagcaacaat aacgtcacca atatgagcat atcgacgatt
83461 gctagctcct atgattcgaa tacacatcaa ttctcgagcc ccgctgttat ccgctacatt
83521 taaatgggtc tgaggttgaa tcattttttt aatccgttct ttgaatgcaa agggcgaaga
83581 aaaaaaagaa atatttttgt ccaaaaaaaa agaaacatgc ggtttcgttt catatctaag
83641 agccctttcc gcattttttt ctattacatt acgaaataat gaattgagtt cgtataggca
83701 ttttagatgc tgctagtgaa atagcccttc tggctatatt ttctgttact ccacccattt
83761 cataaagtat tcgacccggt ttaacaacag ctacccaata ttcaggggat ccttttcctg
83821 aacccatacg tgtttctgcg ggtcttagtg taactggttt gtctggaaat atacgtaccc
83881 atatttttcc accacgacgt gcatttcgtg tcattgctcg tcggcctgct tctatttgtc
83941 tagatgtaat ccaagcaggt tcaagtgcct gaagagcata tttaccgaaa gaaatatgat
84001 tacctcgatg agatattccc ttcattcttc ctctatgttg tttacggaat ctggttcttt
84061 tggggttata gttgatggtt gtttctgaat tccatctcta ctacagaacc ggacgtgaga
84121 gtttcttctc atccagctcc tcgcgaataa aaggattcaa aaaatttaat tagaattaag
84181 ctagaatagt caatcttaag ttaagatata tatgtattta ctgagtaata ccttgaacgt
84241 gggattcttt gagatttcat tcaatctatt agtaatttgt atatcttgtt tgaatagata
84301 actaaacttt tgagttttat aaatagaaat ctaaaaaaaa attgtattat tataccaaat
84361 ccttattttg tcctttattg tattgtccta aattttgcaa taaaaaaagt tttcgcgggc
84421 gaatattgac tctttcaatc cctatttcat ttgtagggtt aactcgtgac ttctcagatc
84481 tccgaataca tgaattaatc tctggttcgt tccgccatcc cgaccagtga atcattaaga
84541 ttcctttttc aatagaatct tttgcattca caagttccgt cgttcccatc acttcttact
84601 taatggttag gtccgaattc tacaatggag ctcagaatga aattggttct tgagtcaatc
84661 ttctcagtct ttattggctc gaagctcttg attttttgtt ctatttctat aagaagattc
84721 attttattat ggtatgaatg cgtattgatg ctttattaca ctgcctttta tgagattact
84781 catagacctt acatattgga attttatatc attggtattc tttttctctc tttctctcat
84841 ccttccattt atccacatct tttttgtcta ttttgcttta caacttagaa tcagatttcc
84901 ttttttgttt atgcaaaaga tttcagttgc tacaaagata tgacctatat atcatatctt
84961 gactggttct ttagatccag ataatgcgaa gtgatgggtt ggttattagt tctatagttt
85021 ttagttcata ctatgtgggc tggtcttttt taatcctaac cctaaaaaac caacggagtc
85081 acacactaag catagcaatt atatcaaatg gtcaatcgaa tttttattca accttataga
85141 attaagaatt agaaatgttt cccttgattg attagaaaaa gaatgaattt gtcttttttt
85201 gttcaatcat tggatagaag ggaaagacaa gtagtaaaat tattcctcgt ctagaaatat
85261 ccaaattttg atgcccaata ctccatagat agttcgaact gtataagagc aataatcaat
85321 tttcgctcga atcgtttgta ggggaaccct accttctctg atccattcga cacgtgcaat
85381 ttcttttccg tcgatacgcc ccgcaatttg tatttgaatt ccttttgtat ctgcttgttc
85441 tgttaattca atagcctttt tcattgcttt tcgaaaggaa actctattct ttaattgtcc
85501 agctataaat tctgcaagaa tattagggtt tccataaggt tttgcaattc ttgtgacagc
85561 aatgttcagt tttcggttta cacaatgaaa ttctttttgt aaggtcgttt gtaattcttc
85621 gattccgcgc ggtcgacttt ctattaataa ttttgggaat cccataaaga ttatgacctg
85681 gatcagatcg attctttttt gaatctctat acgtgcaatt ccctcgacgc cagaggacgt
85741 tctcatattc ttttgtacat aattcttgat acaatctctt attttttgat cttcttgtaa
85801 accttcagaa taattttttg gttgtgaaaa ccaaagggaa tgatgacctt gggttgtacc
85861 cagtctgaaa ccaagtggat ttattttttg tcccataatc ccccactatt atacatatca
85921 cgatacggca tagctgtaga tttttttttc catctcgttt tttttaacga atacatctct
85981 acatattcat catctaaaga tatatctttc attacaatag ttatatgaca ggtcgatctt
86041 tttattggaa aactacgtcc tcgagctcga ggtttcaatt tcttcacagt agtacctcca
86101 ttgacttcgg ctttactaat gactaaattg gcttcgctgg aacccatatt gtaactagca
86161 tttgctgctg cagaataaat caatttcaaa atgggataac atgctcgata gggcatgagt
86221 tctagtatca taagcgtttc ctcataggaa cggccgcgaa tttgattaat tactcttcgt
86281 gctttgtcag cagacataga tatatgttca cctaaagcat atacttctgt ttttttcttc
86341 tttagcataa ggtttgcctc ctactactga atcataagca tctagatttt ttttattaat
86401 attaacgacg agatctatta tcgctttttg catgtcctct aaaatttaat gtaggtgcaa
86461 attctcccaa tttgtggcct accatactat ccgttatata aataggcaaa tgctcttttc
86521 cattatggat agcaatcgta tgaccgatca ttgtgggtat aatggtagat gcccgggacc
86581 aagttactat tatttctttt tctgcttttg tgttaagctt atcaattttt tttaataaat
86641 gattggctac aaagggattt ttttttagtg aacgtgtcac aagcttactc ctattttttt
86701 tttttttgta aaaacgaaga atttaattcg attttctctc ctatttacta cggcgacgaa
86761 gaatcaaatt atcactatat ttattccttt ttctacttct tcttccaagt gcaggataac
86821 cccaaggggt tgtgggtttt tttctaccaa ttggggctct cccttcacca cccccatggg
86881 gatggtctac agggttcata actactcctc ttactacagg acgcttacct agccaacgct
86941 tagatccggc tctacccaaa cttttctggt tcaccccaac attccccact tgtccgactg
87001 ttgctgagca gtttttggat atcaaacgga cctccccaga aggtaatttt aatgtggccg
87061 atttcccctc ttttgcaatc agtttcgcta cagcacccgc tgctctagct aattgtccac
87121 cctttccaag tgtgatttct atgttatgta tggccgtgcc taagggcata tcggttgaag
87181 tagattcttc ttttgatcaa tcaaaacccc ttcccaaact gtacaagctt cttccaaagc
87241 atacttcttt ctggatgtag atgatgatat ctatacagat ggatcttata tatatcgtag
87301 aatgaagtac cacatgggtg gatatatata tgaatccaaa tctgccgaat cactcatgtt
87361 atgatcttct acatcctggg tcttcccgtt ccgtcatctg gcttatgttc ttcatgtagc
87421 attcagaccg aatgactcta tgaaattacg tcgatacttc cacatattat gggtaacgta
87481 ggagacatct ctatttttcc cccggggaat ctttagaatt cccactgctt aactttcaat
87541 tcgcctctga ccatcaaatg aaatgtgaat aacccgtcct cctctctttg aaagaagggg
87601 cgcttccggt tctgtcggtg cttgaaacaa ttttgtcttc tccatattac tatatctcta
87661 gagtcaataa ttttatatga ggaactactg aactcaatca cttgctgccg ttactcttca
87721 gttttctgtt gaggtctatc ctgcagaggt actcaaattg gatcagtgat cgatttctag
87781 gtttcgtcgt aaacctaatt ggttatttcc aattacgtaa atcaatagtt caaaccgcac
87841 tcaaaggtag ggcatttccc atttttatag gaacttctgt accagaaaca atggtatctc
87901 caattatagc ccctctggga tgtaaaatat atctcttctc accatcccca tagtgtatga
87961 gacaaatgta tgcatttcga ttagggtcgt attctatggt tacgattcta ccatatatgt
88021 ctttttcatt ccgtcgaaaa tcgattttac ggtatagacg cttatgacct ccccctctat
88081 gccttgcggt aatgattcct ctggcattac gacctttacc acaatgatgc tgtccataga
88141 tcaaattatt tcgtggattg gatttcactt gactgtctac ggttccattg cgtgtgctcg
88201 gggtagaagt tttgtataaa tgtatcgcca tgctattaag tatttttttt taagttcttt
88261 tctttctaag aggtggaata gaataacccg gttgaagcgt aatgatcata cgtctgtaat
88321 gcattgtatg tcccataata ggtcccattc ttctactctt tcccggaagt cgatgactat
88381 tcatagctat taccttgaca ccaaagaaga gttcgaccca atgctttatt tctgtcctag
88441 ttgatcctga ttcgacatta gaagtatatt gatttttccc caataaccga atacttttgt
88501 ctgtaaatac tgcatatttg attccatcca taaatcgatt ttcttcccta tgagttatag
88561 tctcaataag aatgctagtt cttactgttc atatattatg atatgaatat accacaccaa
88621 ttcgttatgt atggatgatg agattccatt gatacagagc caattccaat agacttattg
88681 gagggtccca ttggcgtgca tccagtagga attgaaccta cgaattcgcc aattatgagt
88741 tgggcgcttt aaccattcag ccatggatgc ttagcgggga tcctcgtaca tggtgaataa
88801 ccaaattcca attgaaatga aatctttagg ataaatcaat gcaatttagt taggataaat
88861 caatgcaatt taggaggaat caatgagagg acatcaattc aaatcctgga ttttcgaatt
88921 gagagagata ttgagagaga tcaagaattc tcaccatttc ttagattcat ggacccaatt
88981 caattcagcg ggatccttca ttcacatttt tttccaccaa gaacgttttc taaaactctt
89041 tgacccccga atttggagta tcctactttc acgcaattca cagggttcaa caagcaatcg
89101 atatttcacg atcaagggtg taatactctt tgtagtagcg gtccttatat atcgtattaa
89161 caatcgaaat atggtcgaaa gaaaaaatct ctatttgata gggcttcttc ctatacctat
89221 gaattccatt ggacccagaa atgatacatt ggaagaatcc gttgggtctt ccaatatcaa
89281 taggttgatt gtttcgctcc tgtatcttcc caaaggaaaa aagatctctg agagttgttt
89341 cctgaatccg aaagagagta cttgggttct cccaataact aaaaagtgta gcatgcctga
89401 atctaactgg ggttcgcgtt ggtggaggaa ctggatcgga aaaaagaggg attctagttg
89461 taagatatct aatgaaaccg tcgctggaat tgagatctta ttcaaagaga aagatctcaa
89521 atatctggag tttctttttg tatattatat ggatgatccg atccgcaagg accatgattg
89581 ggaattgttt gatcgtcttt ctctgaggaa gagtcgaaat agaatcaact tgaattcggg
89641 accgctattc gaaatcttag tgaaacactg gatttcttat ctcatgtctg cttttcgtga
89701 aaaaatacca attgaagtgg agggtttctt caaacaacaa ggggctgggt caactattca
89761 atcaaatgat attgagcatg tttcccatct cttctcgaga aacaagtggg ctatttcttt
89821 gcaaaactgt gctcaatttc atatgtggca attccgccaa gatctcttcg ttagttgggg
89881 gaagaatccg cccgaatcgg attttttgag gaacgtatcg agagagaatt ggatttggtt
89941 agacaatgtg tggttggtaa acaaggatcg gttttttagc aaggtacaga atgtatcgtc
90001 aaatattcaa tatgattcca caagatctag tttcgttcaa gtaacggatt ctagccaact
90061 gaaaggatct tctgatcaat ccagagatca tttggattcc attagtaatg aggattcgga
90121 atatcacaca ttgattaatc aaagagagat tcaacaacga aaagaaagat cgattctttg
90181 ggatccttcc tttcttcaaa cggaacgaaa agagatagaa tcaggccgat tcccgaaatg
90241 cctttctgga tattcctcaa tgtcccggct attcacggaa cgtgagaagc agatgattaa
90301 tcatctgttt ccggaagaaa tcgaagaatt tcttgggaat cctacaagat ccgttcgttc
90361 ttttttctct gatagatggt cagaacttca tctgggttcg aatcctactg agaggtccac
90421 tagggatcag aaattgttga agaaacaaca agatctttct tttgtccctt ccaagcgatc
90481 ggaaaataaa gaaatggtta atatattcaa gataattacg tatttacaaa ataccgtctc
90541 aattcatcct atttcatcag atccgggatg tgatatggtt ccgaagatga accggatatg
90601 gacagttcca ataagatttc attcttgaac aaaaatccat tttttgattt atttcatcta
90661 ttccatgacc ggaacagggg aggatacacg ttacactacg attttgaatc agaagagaga
90721 tttcaagaaa tggcagatct attcactcta tcaataaccg agccggatct ggtgtatcat
90781 aagggatttg ccttttctat tgattcctgc ggattggatc aaaaacaatt cttgaatgag
90841 gccagggatg aatcgaaaaa gaaatcttta ttggttctac ctcctatttt ttatgaagag
90901 aatgaatctt tttctcgaag gatcagaaaa aaatgggtcc ggatctcctg cgggaatgat
90961 ttggaagatc caaaaccaaa aatagtggta tttgctagca acaacataat ggaggcagtc
91021 actcaatata gattgatccg aaatctgatt caaatccaat atagtaccta tgggtacata
91081 agaaatgtat tgaatcgatt ctttttaatg aatagatccg atcgcaactt cgaatatgga
91141 attcaaaggg atcaaatagg aaaggatact ctgaatcata gaactataat gaaatatacg
91201 atcaaccaat atttatcgaa tttgaaaaag agtcagaaga aatggttcga gcctcttatt
91261 ttgatttctc gaaccgagag atccatgaat cgggatcctg atgcatatag atacaaatgg
91321 tccaatggga gcaagaattt ccaggaacat ttggaacagt ccgtttcgga gcagaagagc
91381 cgttttcaag tagtgttcga tcgattacgt attaatcaat attcgattga ttggtctgag
91441 gttatcgaca aaaaagattt gtctaagcca cttcgtttct ttttgtccaa gtcacttctt
91501 tttttgtcca agttgctttt ctttttgtct aactcacttc cttttttctg tgtgagtttc
91561 ggaaatatcc ccattcatag gtccgagatc tacatctatg aattgaaagg tccgaatgat
91621 caactctgca atcagttgtt agaatcaata ggtcttcaaa ttgttcattt gaaaaaatgg
91681 aaacccttct tattggacga tcatgatact tcccaaaaat cgaaattctt gatcaatgga
91741 ggaacaatat caccattttt gttcaataag ataccaaagt ggatgattga ctcattccat
91801 actagaaata atcgcaggaa atcctttgat aacccggatt cctatttctc aatgatattc
91861 cacgatcaag acaattggct gaatcccgtg aaaccatttc atagaagttc attgatatct
91921 tctttttata aagcaaatcg acttcgattc ttgaataatc cacatcactt ctgcttctat
91981 tggaacacaa gattcccctt ttctgtggaa aaggcccgta tcaataattc tgattttacg
92041 tatggacaat tcctcaatat cttgttcatt cgcaacaaaa tattttcttt gtgcgtcggt
92101 aaaaaaaaac atgctttttg ggggagagat actatttcac caatcgagtc acaggtatct
92161 aacatattca tacctaacga ttttccacaa agtggtgacg aaacgtataa cttgtacaaa
92221 tctttccatt ttccaagtcg atccgatcca ttcgttcgta gagctattta ctcgatcgca
92281 gacatttctg gaacacctct aacagagggg caaatagtca attttgaaag aacttattgt
92341 caacctcttt cagatatgaa tctatctgat tcagaaggga agaacttgca tcagtatctc
92401 aatttcaatt caaacatggg tttgattcac actccatgtt ctgagaaaga tttatcatcc
92461 gaaaagagga aaaaacggag tctttgtcta aagaaatgcg ttgagaaagg gcagatgtat
92521 agaacctttc aacgagatag tgctttttca actctctcaa aatggaatct attccaaaca
92581 tatatgccat ggttccttac ttcgacaggg tacaaatatc taaatttgat atttttagat
92641 actttttcag acctattgcc aatactaagt agcagtcaaa aatttgtacc catttttcat
92701 gatattatgc atggatcagg tatatcatgg cgaattcttc agaaaaaatt gtgtcttcca
92761 caatggaatc tgataagtga gatctcgagt aagtgtttac ataatcttct tctgtccgaa
92821 gaaatgattc atcgaaataa tgagtcacca ttgatatcga cacatctgag atcgccaaat
92881 gctcgggagt tcctctattc aatccttttc cttcttcttg ttgctggata tctcgttcgt
92941 acacatcttc tctttgtttc ccgggcctct agtgagttac agacagagtt cgaaaaggtc
93001 aaatctttga tgattccatc atctatgatt gagttgcgaa aacttctgga taggtatcct
93061 acatctgaac cgaattcttt ctggttaaag aatctctttc tagttgctct ggaacaatta
93121 ggagattctc tagaagaaat acggggttct gcttctggcg gcaacatgct tggtcccgct
93181 tatggggtca aatcaatacg ttctaagaag aaagattgga atatcaatct catcgagatc
93241 atcgatctca taccaaatcc catcaatcga atcacttttt cgagaaatac gagacatcta
93301 agtcatacaa gtaaagagat ctattcattg ataagaaaaa gaaaaaacgt gaacggggat
93361 tggattgatg ataaaataga atcctgggtc gcgaacagtg attcgattga tgatgaagaa
93421 agagaattct tggttcagtt ctccacctta acgacagaaa ataggattga tcaaattcta
93481 ttgagtctga ctcatagtga tcgtttatca aagaatgact ctggttatca aatgattgaa
93541 caaccgggag caatttactt acgatactta gttgacattc ataaaaagca tctaatgaat
93601 tatgagttca atccatcctg tttagcagaa agacggatat tccttgctca ttatcagaca
93661 atcacttatt cacaaacttc gtgtggggaa aatagttttc atttcccatc tcatggaaaa
93721 cccttttcgc tccgcttagc cttatccccc tctaggggta ttttagtgat aggttctata
93781 ggaactggac gatcctattt ggtcaaatac ctagcgacaa actcctatgt tcctttcatt
93841 acggtatttc tgaacaagtt cctggataac aagcctaaag gttttcttct tgatgagatc
93901 gatattgatg atagtgacga tattgatgat agtgacaatc ttgatgctag tgacgatatc
93961 gatcgtgacc ttgatacgga gctgaaactg ctaactagga tgaatgggct aactatggat
94021 atgatgccgg aaatagaccg attttatatc acccttcaat tcgaattagc aaaagcaatg
94081 tctccttgca taatatggat tccaaacatt catgatctgg atgtgaatga gtcgaatgac
94141 ttagccctcg gtctattagt gaaccatctc tccagggatt gtgaaagatg ttctactaga
94201 aatattcttg ttattgcttc gactcatatt ccccaaaaag tggatcccgc tctaatagct
94261 ccgaataaat taaatacgtg cattaagata cgaaggcttc ttcttccaca acaacgaaag
94321 cactttttca ctctttcata tactagggga tttcacttgg aaaagaaaat gttccatact
94381 aacggattcg ggtccataac catgggttcc aatgcacgag atcttgtagc acttaccaat
94441 gaggtcctat cgattagtat tacacagaag aaatcaatta tagacactaa tacaattaga
94501 tccgctcttc atagacaaac ttgggatttg cgatcccagg taagatcggt tcaggatcat
94561 gggatccttt tctatcagat aggaagggct gtagcacaaa atgtacttct aagtaattgc
94621 cccatagatc ctatatctat ctatatgaag aagaaatcat gtaacgaagg ggattcttat
94681 ttgtacaaat ggtacttcga gcttggaacg agcatgaaga gattaacgat acttctttat
94741 cttttgagtt gttctgccgg atcggtcgct caagatcttt ggtctttatc cggacccgat
94801 gaaaaaaatg ggatcacttc ttatggactc gttgagaatg attctgatct agttcatggc
94861 ctattagaag tagaaggcgc tctggtggga tcttcacgga cagaaaaaga ttgcagtcag
94921 tttgataatg atcgagtgac attgcttctt cggcccgaac cgaggaatcc cttagatatg
94981 atgcaaaacg gctcttgttc tatccttgat cagagatttc tctatgaaaa atatgaatcg
95041 gagtttgaag aaggggaggg agaaggagcc cttgacccgc aggaggattt attcaatcac
95101 atagtttggg ctcctagaat atggcgccct tggggctttc tatttgattg tatcgaaagg
95161 cccaatgaat tgggatttcc ctattggtcc aggtcatttc ggggcaagcg gatcatttat
95221 gatgaagagg atgagcttca agagaatgat tcggagttct tgcagagtgg aaccatgcag
95281 taccagacac gagatagatc ttccaaagaa caaggccttt ttcgaataag ccaattcatt
95341 tgggaccctg cagatccact ctttttccta ttcaaagatc agccccctgg ctctgtgttt
95401 tcacatcgag aattatttgc agatgaagag atgtcaaagg ggcttcttac ttcccaaaca
95461 gaccctccta catctatata taaacgctgg tttatcaaga atacgcaaga aaagcacttc
95521 gaattgttga ttaatcgtca gagatggctt agaaccaaca gttcattatc taatggatct
95581 ttccgttcta atactctatc cgagagttat cagtatttat caaatctgtt cctatctaac
95641 ggaacgctat tggatcaaat gacaaagaca ttgttgagaa aaagatggct tttcccggat
95701 gaaatgaaaa ttggattcat gtaacaggag aaagatttcc cattccttag ccggaaagat
95761 atgtggccat gaaagaggga ttaagtggaa cagaattgac tgggtggtag agtcgtggaa
95821 acgcttgttt cttccatatt ttggacctta gctccatgga agaatatgtt actgctgaaa
95881 cacggaagaa ttgaaatctt agatcaaaac actatgtatg gatggtatga actgcctaaa
95941 caagaattct tgaacagcaa acaaccagtt cagatattca cgaccaagaa gtactggatt
96001 ctctttcgga taggccctga aaggagaagg aaggctggaa tgccaacagg cgtctattat
96061 attgaattta cccgatagtc cccattttgg gaacgtccag tgccaaagtc actgaatggg
96121 taagtcgcca atccctggac tatgtaatgt actttatctg ctgggttacg ggcgggcatt
96181 ttaccagagg tttctaatct acccttgtgt gattcctgtt gaagcatata ctcggggggt
96241 gggtgcaggg cggacgattt taaagcggac tccccattca ttagatagag aagatcacca
96301 agatttcgcg atccgctgcc gaatttattc caattccaag agctcggatc gaatcggtat
96361 atcaataccg attcgatccg agctctctta ttgagaatgc tcattcaatg agcattctca
96421 atattatgcc ttgaagagga ctcgaacctc cacgctattt agcacgagat tttgagtctc
96481 gcgtgtctac catttcacca ccaaggcatc ttgaaagtga atcgtattcc atgaatatga
96541 tatctatcta gtgtgatgta tggaatatat gacaaaggtg gatctattga tcggtcatgt
96601 catataggcc cgagttggac atccaattgc ttcgatttga attatccgga gaatgcaatg
96661 cctgatatat atcaaaaaga tggacaatca aacctatttc tcgattcact caaagaggtg
96721 aatagggtcc caatagagat atgtaaaaag caggtccgat tacgcgtatt cctaatccta
96781 aatggaatgt aatgatgtag gaatccatat gtaaacatag tatctattta gataggcccg
96841 aatgacccct tctcataatg agaatgtata taaccctatt ccggcctggt ccggtatgga
96901 atgaacttat aatcatggaa tcgactcgat catcagatta taagttcata accctagccc
96961 attcccattt tgggcggaac agatctacta attctttgat tccagttagt aagagggatc
97021 ttgaactaag aaatagaccc tagaagctaa aaaaggctat cctgagcaat tgcaataatt
97081 gggttcattg atattcctgg tatagtagat gctatcacac atacaatcat actcaattcg
97141 atggaattgt ttgatcttaa aggggatctt ctataatttc gcacgtgagg ggttatttct
97201 tggtttcgtc cagtcattaa taactttatt atttttagat aatagtagat agaaacaacg
97261 cttgtaagga gtcctattaa aaccaagaaa tataggcctg cctgccatcc acaccagaat
97321 aaatagagtt ttccgaaaaa acctgctagt ggaggaagac ctcctaggga taagagacat
97381 agggctaaag agagagccaa aaaaggatct tttgtgtata atcctgcata atctcgaatg
97441 ttatcagttc cggtacgtag accaaataat acaatgcaag caaaagttcc tagattcatg
97501 gagatataga acagcatata agttatcatg cttgcatatc catcatttga gtctccaaca
97561 attattccaa taattacata tccgatttgg cctatggacg aatatgcaag catacgtttc
97621 atgcttgttt gagtaatagc aatgagattt cccaatatca tgctaagaat agctaggatt
97681 tccagaagaa gatgccattc gtttgatgag aaataaaaag gaatatcgaa aattcgagtg
97741 gctgaagctg aagcagctac tttcgaagta acagaaagaa aagcaacgac tggagtggga
97801 gagtcagagt cgaaaagagg attcctcact tctttctctc attcaaaacc gtgcatgaga
97861 ctttcatctc acacggctcc taagtgataa aagaaagaag aacccatttt ctttcttttt
97921 tgattacctt cctcgcgtat gtataagacc gaatccattc gatttctaaa aaggattact
97981 aatccttaac ttttcgagga atccttcatc agtggttgtg aatgactgat tttttcaatc
98041 ttttcgacct tggtttcgta ggagcaagtc agaaagattg agaaatagaa ccatctgatt
98101 taattcgttc tcaatagcca cgagatgatc atcttagggt gatccttttg tcgacggatg
98161 ctcttattac actcgtagtc tctgaaggat gagaaccaac tatgtagcat ctacatcgag
98221 aattcaagta ttgtatacgt cattagtccg atcctttgta ggaactaccc gtaataacga
98281 acttgcaaaa tggatctgtt tatcataaag agattcgtcg ttcctgaccc tgcttcacct
98341 taattgttat ttgaacaagt aaaagttctg tcttggtccg agtggggata gcatttctct
98401 tctgcatgtc catggagttt tgaaaaatcc aaacatctca gagatagata gagaggtagg
98461 aatttctcga acgaaccgca ctccttcgta tacgtcagga gtccattgat gagaaggggc
98521 tggggaaagc ttgaacccaa ttcctacggt aatgaatatg agcgcaattg aaattcctgg
98581 ggagttatac atttgtgtat tgataagacc gtttactatt tcttgaagct caatctctcc
98641 cccggatgaa ccatatagcc aagagaaacc atgaaccaga atagaagagc ttgccccacc
98701 catgagtaaa tatttcatag tagcctcatt agaccgtaca tctttcttgg tatatccaga
98761 taataggtag gagcataaac tgaaacattc tggggctaca aagatagtta ttaaatcgtt
98821 agcaccgcat aaaaacattc cccctagagt agctgttaat acgaataaga gaaactctgt
98881 tatagccatt tctgtacatt caatgtactc tacggataga ggaatacata gagttgaaca
98941 tagtaaaata agaaattgaa agatttcgtt gaaattgttc gtttggaaat ttcccgaaaa
99001 gctaatcata ggttcttctc tccatcggaa caatagggcc gttatgctca ttactaaact
99061 tgttgaagag atgaaatata accaaggtat atctttttga tcagaggttg aatcgatcat
99121 cagaagaaga attaggccaa aaattaggat acattctggg aaaatcaaac ttccatcgaa
99181 gagaagcaaa tgaaaggctt tcataaaaat tctcgtagaa tcgagaatga agttttcatt
99241 ctgtacatgc cagatcatga attagtaact gcttccaatt tccaaaaaaa atcccaattg
99301 tgtcgaactt tccatttttg gaatagttac ggaatctcca tgaataggat caaaccttat
99361 tccatggtat ttacatgagg ttcctcttta agaaagtccc cgagaggctt agttgatcca
99421 tgatttatgt ttcatctttc cttttcgttt gtttcgagaa atctatcgat caattccgat
99481 tctttctttt tctcttgatt cttttccgat cgagatgtat agatcctgtt catggattaa
99541 cgaaaatgtg caaaagctct atttgcctct gccattctat gagtctcttc ctttttgcgt
99601 atggcatcgc cactcccttt ggcagcatcc actaattcgg aacttaattt gaaagccata
99661 tttcgacccg gacgttttcg ggatgccgct aataaccaac gaatggcaag tgcttttcct
99721 tgtgtggatc ctatttcaat gggaacttga tgagtcgatc cacctacacg tcttgctttt
99781 actgttatat cgggagttac tccacgtatt gcttgacgta aaacggatag tggatttgtt
99841 tctgtctttt gttgaatctt tttcacggct cgatagataa tttgataagc caatgatttt
99901 tttccgtgtt tcagaatacg gttaaccaac atgttaacta atcgattacg ataaattgga
99961 tcggattttg ctgttttttt ttctgcagta cctcgacgtg acatgagcgt gaaaggggtt
100021 caagaatcag ttttcttttt ataagggcta aaatcactta ttttggcttt tttaccccat
100081 attgtagggt ggatctcgaa agatatgaaa gatctccctc caagccgtac atacgacttt
100141 catcgaatac ggctttccgc agaattctat atgtatctat gagatcgagt atggaattct
100201 gtttactcac tttaaattga gtatccgttt ccctcccttt cctgctagga ttggaaatcc
100261 tgtattttac atatccatac gattgagtcc ttgggtttcc gaaatagtgt aaaaagaagt
100321 gcttcgaatc attgctattt gactcggacc tgttctaaaa aagtcgaggt atttcgaatt
100381 gtttgttgac acggacaaag tcagggaaaa cctctgaaat tatttcaata ttgaaccttg
100441 gacatataag agttccgaat cgaatctctt tagaaagaag atcttttgtc tcatggtagc
100501 ctgctccagt ccccttacga aactttcgtt attgggttag ccatacactt cacatgtttc
100561 tagcgattca catggcatca tcaaatgata caagtcttgg ataagaatct acaacgcact
100621 agaacgccct tgttgacgat cctttactcc gacagcatct agggttcctc gaacaatgtg
100681 atatctcaca ccgggtaaat ccttaaccct tccccctctt actaagacta cagaatgttc
100741 ttgtaaatta tggccaatac cgggtatata agcagtgatt tcaaatccag aggttaatcg
100801 tactctggca actttacgta aggcagagtt tggttttttt ggggtgatag tggaaaagtt
100861 gacagataag tcacccttac tgccactcta cagaaccgta catgagattt tcacctcata
100921 cggctcctcg ttcaattctt tcgaattcat tggatccttt ccgcgttcga gaatcccccc
100981 cttcttccac tccgccccga agagtaacta ggaccaattt agtcacgttt tcatgttcca
101041 attgaacact gtccattttt gattattctc aaaggataag attattctct ttaccaaaca
101101 tatgcggatc caatcacgat cttatatata agaagaacaa aagatctttc ttgatcaatc
101161 cctttgcccc tcattcttca agaataagga agatcctttt caagtttgaa tttgttcatt
101221 tggaatctgg gttcttctac ttcatattta tttaatatga atattttccc tctctttttt
101281 ttatatcatt ccttaagtcc cataggtttg atcctgtaga atttgaccca ttttctcatt
101341 gaacgaaagg tacgaaataa atcagattga taaaagtacc atgtgaaatc ttcggttttt
101401 ccccttcctc gatccctatc ccataggtta ggtacagtgt ttgaatcaat agagaacctt
101461 ttcttctgta tgaatcgata ttattccatt ccaaatcctt cccgatacct cccaaggaaa
101521 atctcgaatt tggatcccaa attgacgggt tagtgtgagc ttatccatgc ggttatgcac
101581 tctttgaata ggaatccgtt ttctgaaaga tcctggcttt cgtactttgg tgggtctccg
101641 agatcctttc gatgacctat gttgaaggga tatctatcta atccgatcga ttgcgtaaag
101701 cccgcggtag caacggaacc ggggaaagta tacagaaaag acagttcttt tctattatat
101761 tagtattttc tattatatta gatatattag actattatat tagattagta ttagttagtg
101821 atcccgactt agtgagtctg atgaattgtt ggcaccagtc ctacattttg tctctgtgga
101881 ccgaggagaa aaggggctcg gcgggaagag gagtgtacca tgagagaagc aaggaggtca
101941 acctctttca aatatacaac atggattctg gcaatgtagt tggactctca tgtcgatccg
102001 aatgaatcat cctttccacg gaggtaaatc tttgcctgct aggcaagagg atagcaagtt
102061 ccaaattctg tctcggtagg acatgtattt ctattactat gaaattcata aatgaagtag
102121 ttaatggtag ggttaccatt atcctttttg tagtgacgaa tcttgtatgt gttcctaaga
102181 aaaggaattt gtccattttt cggggtctca aaggggcgtg gaaacgcata agaactcttg
102241 aatggaaaag agatgtaact ccagttcctt cggaatcggt agtcaatcct atttccgata
102301 ggggcagttg acaattgaat ccgattttga ccattatttt catatccgta atagtgcgaa
102361 aagaaggccc ggctccaagt tgttcaagaa tagtggcgtt gagtttctcg accctttgac
102421 ttaggattag tcagttctat ttctcgatgg ggcggggaag ggatataact cagcggtaga
102481 gtgtcacctt gacgtggtgg aagtcatcag ttcgagcctg attatcccta agcccaatgt
102541 gagtttttct agttggattt gctcccccgc cgtcgttcaa tgagaatgga taagaggctc
102601 gtgggattga cgtgaggggg cagggatggc tatatttctg ggagcgaact ccgggcgaat
102661 atgaagcgca tggatacaag ttatgccttg gaatgaaaga caattccgaa tccgctttgt
102721 ctacgaacaa ggaagctata agtaatgcaa ctatgaatct catggagagt tcgatcctgg
102781 ctcaggatga acgctggcgg catgcttaac acatgcaagt cggacgggaa gtggtgtttc
102841 cagtggcgga cgggtgagta acgcgtaaga acctgccctt gggaggggaa caacagctgg
102901 aaacggctgc taataccccg taggctgagg agcaaaagga ggaatccgcc cgaggagggg
102961 ctcgcgtctg attagctagt tggtgaggca atagcttacc aaggcgatga tcagtagctg
103021 gtccgagagg atgatcagcc acactgggac tgagacacgg cccagactcc tacgggaggc
103081 agcagtgggg aattttccgc aatgggcgaa agctgacgga gcaatgccgc gtggaggtag
103141 aaggcccacg ggtcgtgaac ttcttttccc ggagaagaag caatgacggt atctggggaa
103201 taagcatcgg ctaactctgt gccagcagcc gcggtaatac agaggatgca agcgttatcc
103261 ggaatgattg ggcgtaaagc gtctgtaggt ggctttttaa gtccgccgtc aaatcccagg
103321 gctcaaccct ggacaggcgg tggaaactac caagctggag tacggtaggg gcagagggaa
103381 tttccggtgg agcggtgaaa tgcgtagaga tcggaaagaa caccaacggc gaaagcactc
103441 tgctgggccg acactgacac tgagagacga aagctagggg agcgaatggg attagatacc
103501 ccagtagtcc tagccgtaaa cgatggatac taggcgctgt gcgtatcgac ccgtgcagtg
103561 ctgtagctaa cgcgttaagt atcccgcctg gggagtacgt tcgcaagaat gaaactcaaa
103621 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaaagcgaa
103681 gaaccttacc agggcttgac atgccgcgaa tcctcttgaa agagaggggt gccttcggga
103741 acgcggacac aggtggtgca tggctgtcgt cagctcgtgc cgtaaggtgt tgggttaagt
103801 cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt gagtttggaa ccctgaacag
103861 actgccggtg ataagccgga ggaaggtgag gatgacgtca agtcatcatg ccccttatgc
103921 cctgggcgac acacgtgcta caatggccgg gacaaagggt cgcgatcccg cgaggtgagc
103981 taaccccaaa aacccgtcct cagttcggat tgcaggctgc aactcgcctg catgaagccg
104041 gaatcgctag taatcgccgg tcagccatac ggcggtgaat tcgttcccgg gccttgtaca
104101 caccgcccgt cacactatgg gagctggcca tgcccgaagt cgttacctta accgcaagga
104161 gggggatgcc gaaggcaggg ctagtgactg gagtgaagtc gtaacaaggt agccgtactg
104221 gaaggtgcgg ctggatcacc tccttttcag ggagagctaa tgcttgttgg gtattttggt
104281 ttgacactgc ttcacacccc caaaaaaaag aagggagcta cgtctgagtt aaacttggag
104341 atggaagtct tctttccttt ctcgacggtg aagtaagacc aagctcatga gcttattatc
104401 ctaggtcgga acaagttgat aggaccccct tttttacgtc cccatgttcc ccccgtgtgg
104461 cgacatgggg gcgaaaaaag gaaagagagg gatggggttt ctctcgcttt tggcatagcg
104521 ggcccccagt gggaggctcg cacgacgggc tattagctca gtggtagagc gcgcccctga
104581 taattgcgtc gttgtgcctg ggctgtgagg gctctcagcc acatggatag ttcaatgtgc
104641 tcatcggcgc ctgaccctga gatgtggatc atccaaggca cattagcatg gcgtactcct
104701 cctgttcgaa ccggggtttg aaaccaaact cctcctcagg aggatagatg gggcgattcg
104761 ggtgagatcc aatgtagatc caactttcga ttcactcgtg ggatccgggc ggtccggggg
104821 ggaccaccac ggctcctctc ttctcgagaa tccatacatc ccttatcagt gtatggacag
104881 ctatctctcg agcacaggtt tagcaatggg aaaataaaat ggagcaccta acaacgcatc
104941 ttcacagacc aagaactacg agatcgcccc tttcattctg gggtgacgga gggatcgtac
105001 cattcgagcc gtttttttct tgactcgaaa tgggagcagg tttgaaaaag gatcttagag
105061 tgtctagggt tgggccagga gggtctctta acgccttctt ttttcttctc atcggagtta
105121 tttcacaaag acttgccagg gtaaggaaga aggggggaac aagcacactt ggagagcgca
105181 gtacaacgga gagttgtatg ctgcgttcgg gaaggatgaa tcgctcccga aaaggaatct
105241 attgattctc tcccaattgg ttggaccgta ggtgcgatga tttacttcac gggcgaggtc
105301 tctggttcaa gtccaggatg gcccagctgc gccagggaaa agaatagaag aagcatctga
105361 ctacttcatg catgctccac ttggctcggg gggatatagc tcagttggta gagctccgct
105421 cttgcaattg ggtcgttgcg attacgggtt ggatgtctaa ttgtccaggc ggtaatgata
105481 gtatcttgta cctgaaccgg tggctcactt tttctaagta atggggaaga ggaccgaaac
105541 gtgccactga aagactctac tgagacaaag atgggctgtc aagaacgtag aggaggtagg
105601 atgggcagtt ggtcagatct agtatggatc gtacatggac ggtagttgga gtcggcggct
105661 ctcccagggt tccctcatct gagatctctg gggaagagga tcaagttggc ccttgcgaac
105721 agcttgatgc actatctccc ttcaaccctt tgagcgaaat gcggcaaaag aaaaggaagg
105781 aaaatccatg gaccgacccc atcatctcca ccccgtagga actacgagat caccccaagg
105841 acgccttcgg catccagggg tcacggaccg accatagaac cctgttcaat aagtggaacg
105901 cattagctgt ccgctctcag gttgggcagt cagggtcgga gaagggcaat gactcattct
105961 tagttagaat gggattccaa ctcagcacct tttgagtgag attttgagaa gagttgctct
106021 ttggagagca cagtacgatg aaagttgtaa gctgtgttcg ggggggagtt attgtctatc
106081 gttggcctct atggtagaat cagtcggggg acctgagagg cggtggttta ccctgcggcg
106141 gatgtcagcg gttcgagtcc gcttatctcc aactcgtgaa cttagccgat acaaagcttt
106201 atgatagcac ccaatttttc cgattcggcg gttcgatcta tgatttatca ttcatggacg
106261 ttgataagat ccatccattt agcagcacct taggatggca tagccttaaa agtgaagggc
106321 gaggttcaaa cgaggaaagg cttacggtgg atacctaggc acccagagac gaggaagggc
106381 gtagtaatcg acgaaatgct tcggggagtt gaaaataagc atagatccgg agattcccga
106441 atagggcaac ctttcgaact gctgctgaat ccatgggcag gcaagagaca acctggcgaa
106501 ctgaaacatc ttagtagcca gaggaaaaga aagcaaaagc gattcccgta gtagcggcga
106561 gcgaaatggg agcagcctaa accgtgaaaa cggggttgtg ggagagcaat acaagcgtcg
106621 tgctgctagg cgaagcagcc cgaatgctgc accctagatg gcgaaagtcc agtagccgaa
106681 agcatcacta gcttatgctc tgacccgagt agcatggggc acgtggaatc ccgtgtgaat
106741 cagcaaggac caccttgcaa ggctaaatac tcctgggtga ccgatagcga agtagtaccg
106801 tgagggaagg gtgaaaagaa cccccatcgg ggagtgaaat agaacatgaa accgtaagct
106861 cccaagcagt gggaggagcc agggctctga ccgcgtgcct gttgaagaat gagccggcga
106921 ctcataggca gtggcttggt taagggaacc caccggagcc gtagcgaaag cgagtcttca
106981 tagggcaatt gtcactgctt atggacccga acctgggtga tctatccatg accaggatga
107041 agcttgggtg aaactaagtg gaggtccgaa ccgactgatg ttgaagaatc agcggatgag
107101 ttgtggttag gggtgaaatg ccactcgaac ccagagctag ctggttctcc ccgaaatgcg
107161 ttgaggcgca gcagttgact ggacatctag gggtaaagca ctgtttcggt gcgggccgcg
107221 agagcggtac caaatcgagg caaactctga atactagata tgacctcaaa ataacagggg
107281 tcaaggtcgg ctagtgagac gatgggggat aagcttcatc gtcgagaggg aaacagcccg
107341 gatcaccagc taaggcccct aaatgatcgc tcagtgataa aggaggtagg ggtgcagaga
107401 cagccaggag gtttgcctag aagcagccac ccttgaaaga gtgcgtaata gctcactgat
107461 cgagcgctct tgcgccgaag atgaacgggg ctaagcgatc tgccgaagct gtgggatgta
107521 aaaatacatc ggtaggggag cgttccgcct tagagagaag cctccgcgcg agcggtggtg
107581 gacgaagcgg aagcgagaat gtcggcttga gtaacgcaaa cattggtgag aatccaatgc
107641 cccgaaaacc taagggttcc tccgcaaggt tcgtccacgg agggtgagtc agggcctaag
107701 atcaggccga aaggcgtagt cgatggacaa caggtgaata ttcctgtact gccccttgtt
107761 ggtcccgagg gacggaggag gctaggttag ccgaaagatg gttatcggtt caagaacgta
107821 aggtgtccct gctttgtcag ggtaagaagg ggtagagaaa atgcctcgag ccaatgttcg
107881 aataccaggc gctacggcgc tgaagtaacc catgccatac tcccaggaaa agctcgaacg
107941 actttgagca agagggtacc tgtacccgaa accgacacag gtgggtaggt agagaatacc
108001 taggggcgcg agacaactct ctctaaggaa ctcggcaaaa tagccccgta acttcgggag
108061 aaggggtgcc tcctcacaaa gggggtcgca gtgaccaggc ccgggcgact gtttaccaaa
108121 aacacaggtc tccgcaaagt cgtaagacca tgtatggggg ctgacgcctg cccagtgccg
108181 gaaggtcaag gaagttggtg acctgatgac aggggagccg gcgaccgaag ccccggtgaa
108241 cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac
108301 ccgcacgaaa ggcgtaacga tctgggcact gtctcggaga gaggctcggt gaaatagaca
108361 tgtctgtgaa gatgcggact acctgcacct ggacagaaag accctatgaa gcttcactgt
108421 tccctgggat tggctttggg cctttcctgc gcagcttagg tggaaggcga agaaggcctc
108481 cttccggggg ggcccgagcc atcagtgaga taccactctg gaagggctag aattctaacc
108541 ttgtgtcagg acctacgggc caagggacag tctcaggtag acagtttcta tggggcgtag
108601 gcctcccaaa aggtaacgga ggcgtgcaaa ggtttcctcg ggccggacgg agattggccc
108661 tcgagtgcaa aggcagaagg gagcttgact gcaagaccca cccgtcgagc agggacgaaa
108721 gtcggcctta gtgatccgac ggtgccgagt ggaagggccg tcgctcaacg gataaaagtt
108781 actctaggga taacaggctg atcttcccca agagctcaca tcgacgggaa ggtttggcac
108841 ctcgatgtcg gctcttcgcc acctggggct gtagtatgtt ccaagggttg ggctgttcgc
108901 ccattaaagc ggtacgtgag ctgggttcag aacgtcgtga gacagttcgg tccatatccg
108961 gtgtgggcgt tagagcattg agaggacctt tccctagtac gagaggaccg ggaaggacgc
109021 acctctggtg taccagttat cgtgcccacg gtaaacgctg ggtagccaag tgcggagcgg
109081 ataactgctg aaagcatcta agtagtaagc ccaccccaag atgagtgctc tcctattccg
109141 acttccccag agcctccggt agcacagccg agacagcgac gggttctctg cccctgcggg
109201 gatggagcga cagaagtttt tttgagaatt caagagaagg tcacggcgag acgagccgtt
109261 tatcattacg ataggtgtca agtggaagtg cagtgatgta tgcagctgag gcatcctaac
109321 agaccggtag acttgaacct tgttcctaca tgacctgatc aattcgatca ggcactcgcc
109381 atctattttc attgttcaaa tctttgacaa cacgaaaaaa ccattgttca actctttgac
109441 aacatgaaaa aaccaaaagc tctgccctcc ctctctatct atccaaggga tggaagggca
109501 gaggcctttg gtgtcccctc cagtcaagaa ttggggcctc acaatcacta gccaatatgc
109561 ttttctctca tgcctttctt cgttcatggt tcgatattct ggtgtcctag gcgtagagga
109621 accacaccaa tccatcccga acttggtggt taaactctac tgcggtgacg atactgtagg
109681 ggaggtcctg cggaaaaata gctcgacgcc aggatgataa aaagcttaac acctctcatt
109741 cttattactt tttcaatatg aaaacgaaaa aaaaaaaaat gaaaaatcaa aaggtcgttt
109801 tattcaaaac cccaattgtg acatcccttc tctcccactt cacacctcgg aacgcaccct
109861 tcttatagag ataaacgcgc cttcacatct tcttaacccg aaatggctgg ggagaggaaa
109921 ggttcctttt tttgagggta ctcccgggaa cagatccagt ggagacgggg tggggcctgt
109981 agctcagagg attagagcac gtggctacga accacggtgt cgggggttcg aatccctcct
110041 cgcccacaac cggcccaaaa gggaagtacc tttccctctg ggggtaggaa aatcatgatc
110101 gggatagcga accaaaagct atggaacttg ggtgtgggtc ttttgtcgaa atggaatggc
110161 ttttcttttt ctctttttat ttatcgtgaa tgggggaatc attacacata gtatgcccgg
110221 tcagcatatt tttttgtttt acgccccgta actcttcctc agccaggctt gggcagaata
110281 gcagagcaag tattagtagc ataacaaaaa agccttcctc gtcattaata tctttgctcg
110341 cggcaattgt gacctctcgg gagaatcgat gactgcatct ttgatgcagt gctagtatat
110401 ctgagacttc ttaattggct agttgtaaat agccccaggg ctatggaaca aaggattatc
110461 tcggacctag accgaggtat tgatggtgat tttctaatct cgcagaacag aatgtgatac
110521 gatgagatag aatgcaatag aaacaaagac agggaacggg ttacctactc ttaacgggca
110581 aagcgagccc ctttattctg aattctttaa ttcagaatca atcaaatctc cccaagtagg
110641 attcgaacct acgaccaatc ggttaacagc cgaccgctct accactgagc tactgaggaa
110701 caacaggaga ttcgatctca tagagttcaa ttcccgttcc caacccatga ccaatatgag
110761 ctcgaagctt ccttcgtaac tcccggaact tcttcgtagt ggctccctta catgcctcat
110821 ttcagaggga acctcaaagt ggctctattt cattatattc catccatatc ccaattccat
110881 tcatttaata tccctttggt gtcattgaca taacagatgt cgtttctagt ctatctcttt
110941 ctatttcttt tctatatatg gaaagttcaa aaatcatcat ataataatcc agaaattgca
111001 atagaaaaga aataagggag gtttgtgatg atttttcaat cttttctact aggtaatcta
111061 gtatccttat gcatgaagat aatcaattcg gtcgttgtgg tcggactcta ttatggattt
111121 ctgaccacat tctccatagg gccctcttat ctcttccttc tccgagctct ggttatggaa
111181 gaaggaaccg agaagaaggt atcagcaaca actggtttta ttacggggca gctcatgatg
111241 ttcatatcga tctattatgc gcctctgcat ctagcattgg gtagacctca tacaataact
111301 gtcctagctc taccatatct tttgtttcat ttcttctgga acaatcacaa acactttttt
111361 gattatggat ctactaccag aaattcaatg cgtaatctca gcattcaatg tgtattcctg
111421 aataatctca tttttcaatt attcaaccat ttcattttac caagttcaat gttagccaga
111481 ttagtcaaca tttatctctt tcgatgcaac agcaagatct tatttgtaac aagtggtttt
111541 gttggttggt taattggtca cattttattc atgaaatggc ttggattggt attagtctgg
111601 atacggcaaa atcattctat tagatcgaat aagtacattc gatctaataa gtaccttgtg
111661 ttagaattga gaaattctat ggctcggatc tttagtattc tcttatttat tacctgtgtc
111721 tactatttag gcagaatacc ctcacccatt cttactaaga aactaaaaga agcctcaaaa
111781 acagaagaaa gggtggaaag tgaggaagaa agagatgtag aaatagaaac agcttccgaa
111841 atgaagggga ctaaacagga acaagaggga tccactgaag aagatcctta tccttctcct
111901 tccctttttt cggaagaaag gtgggatccg gacaaaatcg atgaaacgga agaaatccga
111961 gtgaatggaa aggacaaaat aaaggataaa ttccactctc accttacaga gacaggctat
112021 aattgtaatt gtgaattaaa aaaaacagaa aataaggaat ttgattcaca aagttgaaaa
112081 gagtaagtaa taaactaata aaaagattga aacataagct aaatacaaga aaagataaga
112141 agagatgcgt ccgcccccta tatatttgat accttctcct acaatgaaac taataacccc
112201 aaccccgtta tcagtcccat caattactcg tcgatcaaaa aaatgagtaa attcagctaa
112261 tcctcttatc ccaccaacta agaatcttgt ataaaaagca tctatgtaag cacgattata
112321 tgaccaatca tatatgccat ttataatttt gtcccacaga attctcttag gacccttttt
112381 aacaaaagaa ttaattaact caaaattttt taaagaagaa taaatgggtt tatataaaaa
112441 ggatgctata aatattccga aataagctaa ccatttcgat aatatatcca aattccctcc
112501 ctcttggttg aaaggaattc ctatagatcc aacaaacaaa gtaaagagtc ctaatacaaa
112561 tattgggaat agcatagtat tgtccgattc ataaggatag gaataaaccg ctttatgctc
112621 aaaatgagca atagtcataa aaggtcgtgt catctttctt ccatttttat caattggata
112681 tttagttttt gcaaaaaaat aagtactttc attattattc atagttaata aacaagagtt
112741 tttcttaact ccgtttttac cccatagaga tattgaatag aagggggttt tttgtttccc
112801 accataattt ggaaaatgag cgtttaaatg cccttcaaaa gtaagtaaat agatccgaaa
112861 catataaaat gcggttaatc ccgccgtggc ccaagctatt attgcgaaaa ttggcgaata
112921 caaccaacta tcattaagaa tttcatcttt ggaccaaaaa caagcaagag gtggaatacc
112981 acaaagagaa agtgtaccta ataaaaatgt gattttgcta attggtacat gttttcttaa
113041 acctcccata agacccatat tctgactttt agctggagaa tatccaacaa tagtttccat
113101 tgaatgaata atggatccgg atcctaaaaa taataatgct ttggaataag catgagtaat
113161 caaatgaaat aaagcgcttc gataagaccc cataccaaga gctaacatca tataacccaa
113221 ttgagacatt gtggaatagg ctaaacctct cttaatgtct ttttgagcaa gagctaaagt
113281 agctcctaat aatactgtta ttattcctat aaccgagatc aaatacatta tgtaaggtat
113341 aactctgaaa agaggaagaa gccgagctac aagaaaaatt cccgccgcta ccatagtagc
113401 agcatgtata agagccgaaa tgggagtagg cccctccatg gcatcaggta accatacatg
113461 aagggggaat tgggcggatt tagcaactgc accggcaaat aagagaacag cacataaagt
113521 aacaaataaa aaatcgactt cattattata aatcaagtta ttgaatattt cgaataaatc
113581 cctaaattcg aaactccctg ttatccaata aaaacctaaa attcctaata ataaaccaaa
113641 atcccctaca cgattagtta caaacgcttt ttgacaagca tttgccgcaa caggtcgtgt
113701 aaaccaaaat cctattaata gataggaaca cagcccaacc aattcccaaa aaatataaat
113761 ttgtatcaaa ttcgaactag taactaatcc caacatggaa gtactgaaaa aactcatata
113821 agcaaaaaat ctcaaatagc cttgatcatg agccatataa ttatcactat aaataagaac
113881 cataattcca accgtagtga ttaatattga cataatagaa gtaagtgggt cgatcaagta
113941 tccgaagtct aaagaaaaat cattattgat gatccaagac catacatatt gataaaaaga
114001 actgctattt atttgctgaa tagacaggta gattgaaaaa accatgacta tgcttaacaa
114061 taaaacactc tgaaaagccc acatacggcg aaaacttttt gttgccgttg gaaaaagaaa
114121 aagtcccgct cctattaaca tagggactgg aagtggaatg aaaggtatga tccacgcata
114181 ttcatatgtc tgttccataa aaaagttttg aattcttaat taattgtttc cgattcaccg
114241 gatcttacct cttttgaaag gagtcaataa aaagtcaaaa tatggactaa ctgaaactaa
114301 tttaaaactt aaatcgaatt ttctattctt acttattctg agtctttgct aaatacttca
114361 actattgaaa tcaagaagtt acaattggtc aaatgatatg aaagggatta attactagtc
114421 tcttttgaaa taggcctatt tttctccaag tttgaccagt gaatcgaacg gggattcaag
114481 tttttcattt catgaagtaa aaatgcggtt cttatcttta aacctttcga ggtattttat
114541 tgcatgtaaa tgaaatgtgg aaccataaat agaaatcgag tattttttgg attctttatt
114601 ttattttttc tttttattaa gttcaactaa tttcctttct acagaacagc cgattagcaa
114661 attctatagg tatagatttt atgaatcaaa aataatgtga aataaagata ccagtcaata
114721 gagaaccttt tttttacaat tatgaatgtt ttatggaata gaaaaacttg aaaaaaacac
114781 atattgacct tcttttttta tttccagtat tatgcaattt tcacacatct tttgcctatc
114841 tcgataatgt tttattttag gacgacacta ttagctcgaa aataaatagt agtaaaaaga
114901 attcgttttg aacaatagat gtctttcaca tccagctata acaatgagta attttttaat
114961 ttctaaatgg cagttccaaa aaaacgcact tcgacatcaa aaaagcgtat tcgtaaaaat
115021 atttggaaaa ggaagggata ttcgatcgca ttaaaggctt tttcattagc gaaatctctt
115081 tctaccggga attcaaaaag tttttttgta cgccaaacaa aaataaataa gtaataaaac
115141 gttcgaataa tttgaatcaa cttgaaaaaa gaattcaatt attcttaaat tattcaatta
115201 gataataatt gaataattta acgatttccc tttcatattt gatattgatt agctcaccaa
115261 tcaatacgta atggaactcg cttcgctttt ctgattgata gataaaataa tagaattagg
115321 aaatcctcta tttactgaat aataactttt ttgttgacaa aagagtaaac atcatttcta
115381 ttccaaggtg gggagtttca ttttccccat cgacctattt gcagaattcc attaaaaaaa
115441 aattctatat ttccattcta tttccatatc tatagaagaa cgtatataaa aatctttagt
115501 gaaattagtg aaagttaaga actcattgaa actaattgat tctattttga aacctttttg
115561 ttttgtctaa ctttctaact ctttattttc tctgaattat tatatagata cccatgtata
115621 tcttgccctt aacccaatag agaaaattgc ttaatgaaat tctgtatgac tggttgtcaa
115681 ttttgagcga tgcaaaatag gttcttttct ttctattttg tcttcaaaat ccattttttg
115741 ttttagattt ctgaaataaa ataaatagga aatagctgat taaacaatga aaacaaaaaa
115801 tttgggaact ctattcctta attgagtata gaacggttta gttacaagag ttcaattcga
115861 ggaaagcata aaatatggga aagtcccagg ttaaataaaa aaaactaaga ctctaaactc
115921 aaatctaaaa taatgaacct tcaacttcaa attcctattt gaacaacttt ttattgttat
115981 tgatccattt gaatcattac taaactaaaa tagcttcctc aatctcgacg attgcttatt
116041 cataggctat tatgagttca agacaggccg ctatggtgaa attggtagac acgctgctct
116101 taggaagcag tgctaatgca tctcggttcg agtccgagtg gcggcatacc gtcttctaaa
116161 aaggataaat agatcttata atgaattcaa ttcccgattt cctttttaga attatgtaat
116221 taagggactc ttctttttta agatttttta tgatattttc aaccttagag catatattaa
116281 ctcacatttc cttttcgatc gtttcaattg taattacaat tcatttgata acctttttag
116341 tcgatgaaat cgtaaaacta tacgattcat cagaaaaggg cataatagtt acttttttct
116401 gtataacagg attattagtt actcgttgga tttcttctgg acatttccca ctaagcgatt
116461 tatatgaatc attaattttc ctttcatgga gtttctccct tattcatata attccgtatt
116521 tcaaaaaaaa tgttttaatt ttaagtaaaa taactggccc tagtgctatt tttacccaag
116581 gctttgctac gtcaggtatt ttaactgaaa tacaccaatc tgtaatatta gtacctgctc
116641 ttcaatccga gtggttaata atgcacgtaa gtatgatgat attgggctat gcagctcttt
116701 tatgtggatc attattatca gtagcacttc tagtgattac atttcgaaaa aacagacagc
116761 ttttttataa gagcaatggt tttttaaacg agtcattttt cttgggtgaa aatgttttac
116821 aaaatacttc ttttttttct gctaaaaatt attacaggtc ccaattgatt caacaattgg
116881 attattggag ttatcgggtt attagtttag gatttacttt tttaaccata ggaatccttt
116941 cgggagcggt atgggctaat gaagcgtggg ggtcgtattg gaattgggac ccaaaagaaa
117001 cttgggcatt tattacttgg atcgtatttg caatttattt acatactcga acaaatagaa
117061 atttgcgggg tgcaaattct gcaattgtag cgtctatagg ctttcttata atttggatat
117121 gctattttgg ggtcaatctt ttaggaatag ggttacatag ttatggttct tttccatcaa
117181 catttaattg aattcaagac aagttattac aaatacaaga gcgggcggcg cattgtatga
117241 accagcgtgc ggaccgtgtg aatcatcaat acaatatttg attcacacgg ttttctacca
117301 tatgtagttc aatttcattg tttttactta acttaagagt taagagaaga aaaaaagtct
117361 tctttttttc attgtccaag aatgtttttc aaaacaaaca taggtttttt ttatttcagt
117421 catccaaatt atctataaaa aaaattagat agaataactt cgaccttgtc aactgctaat
117481 gaaagaacga aatccgggta tataccaata cctattacgg gtaaaaagat ggagatcgaa
117541 agaaataact ctcgcggtcc agaatcaaaa aaagaatcct tcggggcatt aaatagcttg
117601 tatccataga acatctggcg tgacatagat aatgaataaa taggagttaa tatcattcca
117661 attgccatta caaaagtaat tagtattttt ggaattaaaa gatatttttg gccggtaatt
117721 attccaaaaa atactatcaa ttcggcaaca aaaccactca tacctggtaa tgcaagggaa
117781 gccatcgaaa agctactgaa catcgtgaac atttttggca ttggaatagc tattccgccc
117841 atttcgtcaa gataaacaag gcggattcta tcataagtcg ttcccgccaa gaaaaaaagt
117901 gcagcaccaa taaatccatg agatattatt tgtaaaaggg ctccattaag tcccgtgtcg
117961 gttagagaac taattcctat aattatgaaa cccatatgag agacagagga ataggctatt
118021 ctttttttta aattccgttg gccaagagat gttaaagctg catagattat ttgtattgta
118081 cctattatca tcaaccaagg agaaaatata gaatgggcat gaggtaataa ttccatattg
118141 attcgaatta atccatacgc tcccattttt aataaaattc cggctagaag catacaagta
118201 ctgtaatgtg cttctccatg ggtatctggt aaccatgtgt gtagggggat aatgggcgat
118261 ttgacagcaa aagcaataaa aaatccaata tagaatatta tttctaaaac cacaggatat
118321 gactgattaa ctgatgtttc aaaatttaat gttggttcat tagaaccata taaagcaaga
118381 cccaaaactc ccattaagag aaaaacagaa ccccccgccg tgtacaaaat aaattttgta
118441 gctgagtaca gacgtttctt tcctccccac atgcatagaa gtagataaac aggaattaat
118501 tctaactccc acatgatgaa aaaaagtaaa aggtcccgag acgaaaatga tccaatttga
118561 ccactgtaca ttgctaacat gagaaaatgg aataatcgag aatctcgagt aactggccaa
118621 gccgctaaag tagctaaagt agtgataaat cctgttaata aaatgggtcc tatagaaagt
118681 ccatctattc ctaatctcca atggaaatca aaaaaattga tccatttata atcctccact
118741 agttggatta atggatcatc cgattggaaa tgataacaaa atgcataagt cgttagaagg
118801 agttctaaaa tacatataca tatcgtatac cacctaatta ccctatttcc tttatgggga
118861 agaaagaaaa ttaaggaacc cgcaaatatt ggaaaaacta caattattgt taaccaagga
118921 aaataattcg tagtaaagac aagatacact tggaccataa aaacccgtgc tcaaaatatt
118981 gtgattttcg agcacaggtt tgtcggtaaa aaaaattaaa tggattcaag tagagttttc
119041 tcgaacgtat caataagcta gacccatact gcgagttgtt tcatgccata aataaactcg
119101 gacactcaag aaatctgttg gacaggcgga ttcacatctc ttacaaccaa cacagtcctc
119161 tgttcgtgga gcagaagcaa tttgtttagc cttacaaccg tcccaaggta tcatttctaa
119221 tacatcggtg ggcaggctcg gacacattga gtacatccta tacacgtatc ataaatcttt
119281 actgaatgtg acattgggtc tatacgtttt tgaatgttag aaattttcga tctagtaaac
119341 ttagaaacga atcatataat catatattta tataccagat gaatcaatga gttatcataa
119401 ttttctaatc aacccccttc tggattggtt tatgagatat gagagagggc caaaatactt
119461 tgatttctta tgttttgcaa acaagatcac accttacgta gcaaacatgc taattaaaat
119521 cgatttatca atattagaat ctagatgatt aatactaatt attcaacaaa tttgattggt
119581 tgatacgagt tgattttctg ttacggtaaa ttgatgaaac aatagccagt ccaatggctg
119641 cttcagcggc tgcaatagct ataacaaaaa ttgagaaaat gtctcctttt aattgacgat
119701 tatcaaaaaa atcagaaaat gttacaaaat ttatattaac cgcattcaat ataagttcaa
119761 gacacataag ggctctaacc atatttcgac ttgtgatcaa tccatagatc ccgatagaaa
119821 ataaataggc actcaaaaca agtacatgtt cgagaatcat taaacaactc cttatcaatc
119881 tcgactcctt tcaatatgaa caacaattca accgatttaa ttgactagta tataacaagt
119941 atggaacaaa gaaatatatt ggtactagat tgacctaaag tctttctatt tatacaacag
120001 gaattcaaat agaattgaag gaaaatgaat gtgataagac agaacaaaat tttatttgaa
120061 ttccaagttt taatagaaat tttttattga cgagctacag caattgcacc tattaaagca
120121 actaaaagga ttattgaaat cagttcaaat ggaagaaaaa aatctgttga taaatgaatt
120181 ccaatttgtt gactattact tataaaatct tgctctataa tctggtttga tcttgtagtc
120241 caaataatcc cgtaccatga cgtatctgaa atagtagtaa ttagtgaaat aaaaagactt
120301 atacaaacca tcgaagtaat tccatctcct acggtccaaa gatgaaaatc tttgtaatat
120361 tctgaaccat tcatgaacat cacagcaaaa atgattaaaa catttatagc tcctacgtaa
120421 ataagtactc gcagcagcta caaaatagga gttagataga atatagaata acgatgtaca
120481 aacaagaacc aatcccaagg aaaaggcaga ataaattgga ttgggaagta ataccactcc
120541 tagaccccct aatataagac ccgaccctag aaagactaaa agaaaatcat gtattggttc
120601 agataaatcc attttttatc aaaaatcaaa aacgaagaat ttcatgactt tattgacctg
120661 accaggaaaa aagaagtttt tcaatttttt atgatacttc ttaattgtta attgaatgaa
120721 attgtaatgg gtatgaattg acgtagatgc ttttatttta ttggaccact atcaattctt
120781 tattcgtcga acgagtagtt taaacctatc gattttggat atcatttatc tactttgaaa
120841 ccattactat tattataact ataatataga aatccgtttt gttttcaatc taaattaagc
120901 taggagtctc attaaccaac cactagtttg aattgaacaa gcaaaaatat cattctttta
120961 gatccgaact aagccttcgt aattcggaat ttttttcgaa tttagggttt attcattttt
121021 tatttgaggt aaattcgaaa ttgttcgaat tgtgtaatca tcaattactg acattggtaa
121081 gcgacccaaa gcgatttgat tataattcaa ttcgtgacga tcataagtag aaagttcata
121141 ttcttcggtc attgataaac aatttgttgg acaatactca acgcaattac cacaaaatat
121201 acagattcca aaatcaatac tgtaattaag caatcgtttc tttcgaatat cagtttccaa
121261 cttccaatca acaacgggta aatctatagg acatacacgc acacatactt cacaagcaat
121321 gcatttatca aattcaaagt ggattcggcc tcggaaacgt tccgatgtga tcaatttttc
121381 gtaggggtat tgaatagtta caggtaaacg atttgcgtgg gacagggtaa tcatgaaacc
121441 ttggccgatg tatctggcgg ctcgtattgt ttgttgacca taatttatga attcagttat
121501 catagggagc atatttagaa tatctataaa aaagatttta tgcttgtttc tttctcttgt
121561 ttgagacaag tcgtgaatct agaatattgt agtcttttac agtgaaagaa gttgggacga
121621 ggttgtcaat aatagattac ctagagaaat aggtaaaaga aatttccacc caagatttaa
121681 tagttggtcc attctcagcc tcggtaaagt ccatcttgtt gcaataggaa tgaacaaaaa
121741 caaataagtt ttggctaatg tgataaagat accaattagt gttccaaaga ctttacccct
121801 tttatttatg ccaaatagct caggaacaaa tatgtacgga atagaaagat tccaacctcc
121861 caaataaaga actgttacaa ataatgaaga aactagtaga ttcagatatg aagcaactgt
121921 aaaatcaaac caaatttgat acctgaatat tcggtttgat accctgctac taattcttct
121981 tctgcttctt ggtaaatcaa aaggtaatct ttcacactcg gctagagaag aaattagaaa
122041 aacgataaac ccgatgggtt gacgccacaa attccacccc caaaagccat attttgactg
122101 cgcttccact atatcaactg tacttaaact gttagataat catagtcgat gataacatca
122161 ctgtgcccat cgctattaca gaaccgtacg tgagattttc atctcatacg gctcctcaga
122221 ggtcacaaat aaatctaagg accctttcct attctttatc ttgatatgtt tgtcagatag
122281 agtaaaaatc tatcctaagg tcccaaatta gaccaatgga attctgtctg ctatatttaa
122341 aactaataaa tacgggcttc tgaattgatc tcatctttta agaattttca tttttctttg
122401 ttgattaata accttatcat taaataaaat gcgctttata gcaatatcac atatacattt
122461 caacctcgaa ttctcaatta cgaaaaaaat tagagagtcc attagttcat gaatcatgac
122521 aaaaaatttc tctctcgaac tagaaatcaa aatggaatta taggaaagaa agaataaaaa
122581 caaaaaaaga aaaaagtaag aaaaaaaaag acatcccccc tttttgcttt tgcaattaga
122641 ttcttttctt tctatttcta ttttatttca ttcctattct cctttctcag aaaaagggcc
122701 tttaaccaaa gtaaaagatt acttcgttct tgatagttat ttacttactc agtggatagg
122761 aacatactct ggatcagaat catggggagt acttcttgat catttctacg aacgtaaagc
122821 cccaattcga attcctttta tgtacagaaa tatcctcttg gataacttac ataatctcaa
122881 ttactaatcc tttgtgtatc ttggtcttcc taaccatcca ctcatttttg ctttcaacct
122941 cccgttgtgg aaatccatct atggtaatag acagtaaaaa ctccatacag ttgatctttt
123001 gaacccgctt caagctatca tgacaattca ccaatcttgg ggtaaacaat ctctattgct
123061 tatgtttact tttttcacca tttgattctt gtacatagga aatgagactc aaccttttta
123121 ctgcaaattt agaagccgtt ttctttcact catataacta tctggtttag ttcatcaacc
123181 caaatgctga ataaaaatga aaatatatat attcaatcaa atctttttac ctttgtttct
123241 agaaagaaaa gaatttggag aaattttagg tctcaccgaa tcacacgtag agatattgat
123301 aacacacata gagctaatgg tattttcata actaattgat tgagcagctg cccgtagacc
123361 acctaaaaaa gaatatttat tatttgatcc atatcccgac ataagaagtc caacgggagc
123421 aatacttgaa atggcaatcc agaaaaaaac accaatacta agatcggcta gaacaaggtg
123481 atcaccaaaa ggaattactg aataacttag aaagatggat attactgcta tggatggtcc
123541 gatactgaat aaacgagtat ctcctgtaga tggaataagg ttctctttca aaagtagttt
123601 tgtcccatct gctagagctt gaagaattcc taaagggcca gcatattcag gtccgatacg
123661 ttgttgtatt cctgcagata tttctctttc taaccaaaca attactagta cacctattgt
123721 gattcctaat acaagagtca aaatagggaa aagcatccat atgatcccat agacttcttt
123781 taaggattcc aatttggaaa aagaattgat agtttctatt tctgttgtat caattatcat
123841 ttcaacgatc aacttctccc ataatgatat ctatgctacc tagtattgtc ataatatcag
123901 ccaatttcat tcttttaact aactgaggaa gaatttgcaa attgataaaa cctggtgggc
123961 gaattttcca tctccaagga aaaacgctct gatctcctat gagaaaaatt cccaattctc
124021 cttttggggc ttcaactctc acataaagtt cttgtttcga caattcaaaa gttggagaag
124081 gttttttact aataaaccga tattcaaaat cattccattc aggatctttt aatctgtcaa
124141 aacgtcggat ttctaaattt tcgtaaggcc ctcctggaat tccttccaga gcctgttgaa
124201 taatctttat ggattctgtc atttcaccga ttcgtactaa ataacgagct aatgaatccc
124261 cttctcgttg ccattgaacc tgccaatcaa attcgtcgta agactcataa tgatcaactt
124321 tacgaagatc ccattctatt ccggaagctc gtagcattgg tcccgataac ccccaattta
124381 atgcttcgtc tcccccaata atgcctacgc cttcaactcg ttctaaaaaa ataggattcc
124441 gggtaataag tttttgatac tcagcaaccc ctgttaaaaa ataatcgcaa aaatccaaac
124501 atttatctat ccagccatag ggtagatcgg cagccactcc cccgatacga aaataattat
124561 gcatcattcg cataccggtg gcagcttcga agaggtcata tatcaattct ctttctcgaa
124621 aaatatagaa gaaaggggtc tgcgcaccaa tatccgccat aaaagggcct agccataaca
124681 aatgagaagc tatccgactc aactccaaca taatgactct gatatagcta gcccttttag
124741 gtacttgaat attgcctaat tgttcgggtc catttatggt tattgcttct gtgaacatag
124801 tagctaaata atcccaacgt gttacataag gcaaatattg tataattgtt cggttttccg
124861 caattttctc catccctcta tgtaaataac ccaatattgg ttcgcagtcg acaacatctt
124921 caccatctag agtaacgatg agtcgaagaa caccgtgcat tgatgggtgc tgaggcccca
124981 tattgactat catgaggtct tttcttgtag ttggtgcagt cataagtttt ttaccgattc
125041 attcttccat gaattgctga aagtgaaaag aagttcatca aaatttaatc gaaacatata
125101 agtgaaaatg aaatgactct tcaaataaat caaattaacg agtttttgtc tctcgaatgt
125161 ccaactgatt aattaattct ttataacgta ctctattttt ttttgacaaa taagctagga
125221 gtcgttgacg ttttcccaaa attttcttca aacctctctg agataaatag tcttttttgt
125281 gcaattctaa atgtgaagta agtctccgta tcttattggt gaaattgaat acttgaaatt
125341 caacagatcc tctcttttct tcttgagaaa taactgaaat gacagaattt tttaccataa
125401 aagaatttcc cctttcttta ttttacagat atggatttta tcgaatttta tcgatcagta
125461 ataataatgc cagtaatttg aacgtggtat atagacttaa tttctttatg aactcctaat
125521 tttatcaatt ccaataaatt aatcaaattc aaaatttgat tcagatagga atccaaaaag
125581 atggtaggta cttttttttt cattcacaaa agcgactaat ttaaacctaa aatcctaaaa
125641 tgaagaagat tttgttgatt cctttctaga tctaatcgat actttattga tttagtatcg
125701 tctactcgaa ttagattcga atgagatgta agaaaaagca tgtgtacatt tgtttacttt
125761 cagatactct atacgaaaca ggatatatag tactatcaat ttattttcaa ttgtggatac
125821 atatgtatcc ttaagatact gaaacgacta ccattattgg tatcaaacca ataacgattc
125881 atacaagcta aatcttctaa tcgataatta ggccaaagaa agaacttcaa tttaattaat
125941 tcatttttct ctttataaag aggtttcctt tcatccaaaa attgactcca gttttttaca
126001 ttgttttcgt tgcaaaatac tgaatttcta tcgatgccat tccaattcaa agaattaaac
126061 aaacttcgaa ttctcaattc tctacgacgt ctagaccata aaatattttc aggaacaagc
126121 aaatcaaaat gatttttgtc tgtatttatt ctttgagttt gaggttgcag aatgaattca
126181 tcaaaattct ttttatcaac atatctttgt tcggggtatc tttgattagt ttggtgttta
126241 cttttatgaa ccaatgaaat acctatggtt tgatacataa taaattgtcc attatttttt
126301 acagacaacc gaataggttc gataattaat atccccttct tcatcaattc tgtaagagtt
126361 aaattcttct gaatcagcat tatatccaaa ctcatttctc tcctttgaat tgacgatata
126421 gcaattttgc ttggatttat cagtcgaagc aggagacaat ataccttgat attctcgatc
126481 attctttgat tcaaagcatc gttccatctc aattgaaaaa gcaaataacg tttcaagaac
126541 aaatctagtt ctgcttccgt gttgcttttg tattgttttt tctttttacc cttctttgtg
126601 tctgattccg cgtaatcttt tttaagagcg ttttgatgtt ttgagagaac agggcccaga
126661 tttcctttgt tttctatatc tgatccacgc tctttttctc cttgacttgc gggttctttt
126721 gcttcttgaa ttcgattctt tattttttta tttgatcgta gaaaaaagtt ttgtttttgg
126781 tttttattga tgtttttatt tgactaacat tttcatttgt attcaaattt aaaagaagta
126841 atttgcttgg tataatccac ggttttattt tatatacatt ataaagtggt acaaattctg
126901 ggaagaacca aaattccaga ttcaatatgg gacgatttaa tattttttca ttcattccca
126961 tccaatcaaa aaaggctttt ttcgaatttt tttgattgtt ttctggattt tgatgaatcg
127021 taagataaaa aaagcctttt ttatcaattt tatcaattat ttgataatta ttaataccaa
127081 ttttagtatt tggattactg ttggtatcga tcttaaccca ggcctcaata tcttcttttt
127141 gtctaagaga aaaatggata attttccaat caaaatattt tctatcgaga tttctttcta
127201 tatatagaat attgcctttt cttagataat tattgatatg aagattgccg agcatatcaa
127261 aaaggttgtg tttggacgtg ttggaattag aagaaatttc gaggttctta tttacttgaa
127321 agggtaatct agaaataaaa gagtcatttt ttttttcata attaatcgat ttatatgcta
127381 aaagatcata tctataacat ttttgaaaat tatctttttg gtttgctaat gaatagagct
127441 cagaatcatt ttcttttttg taatgaatta attggtcttt ttcatatgaa ttccatttgt
127501 ttaaatttcg attttgagcc atacaacctt gattaaccct atttcgccat ttttgtggca
127561 ttaatctaga ccatctaatc tgagataaat cgtattgata atgccgtctt aaccagtttt
127621 tccattgatt gattctataa ctctgaagtt tcttatgttt taattcagaa tgaaatattc
127681 ctagtgttcg aaaatagtcc tttattttag tcttaaggaa aaaagacgtt ctgttatatt
127741 gaagaacaga tcttaattta gacaaattaa taacttgggg ttgtgataat ttgtaaaata
127801 cgatatgctt gtgataagta ggataaatca aaaaaaatat gtgaattttt cttactaata
127861 ttataaagtg acttttttat agtcgaaata aagtgaattt ttttttgatt attaattttt
127921 tcttgattta tttcattatt ggaaatgtat ttatcaatca atttgtttgt tgattcaaga
127981 aagagttgtg tattaattct gggaatatta atgatagata aaaatagatc gatgtataat
128041 ctttgaatga ataattttag aaaataatgg aatttccata ttaatcgagt atttcttctt
128101 tttaatattt ggaaaatctt ttttggcgat tcgaattttt taatattatt tgttttatta
128161 ggactaatgt ctatttctgg agttactttc tttttctctt ttgtaattct ttctatttga
128221 tttttgattg tacttgttct atcagtcaaa tccttcattt tgctttctat cagtgaagaa
128281 tttggccaat ttccagattc aatttgacta aatgattcgt taattatctg attactcatt
128341 agagaatctt tttctttttt cgtttcattc gattcatcta tttctttgag tctaaataat
128401 acaattggat ttacttttga aagttctttt ttcatttttt ttataaatag actacttttg
128461 ataagccatt ttttggtttc ttttgaaatt cttcgaaata attttatttt tcctttgaaa
128521 acttttagag ttataaaata tttctttttg aattttccaa tttttttttc gagttcctta
128581 aaaatgggct caaaaaaaga agggcgtttt cggggagaac caaagggaag ttcagcttcc
128641 attccccaaa ctgttaaaaa acaaaaatca tctttttgtt ttttcttttt cattagctct
128701 ccacgggagg agtacagttt agatatatgc caaggtttca gacaaaaagg aaataatatt
128761 ttgatctgaa tgccatcttt caaccaattt tttggaaatt ctgtttctga taattgaaca
128821 ccattataag tacatttaat atgcatttct ctattccatt cctgcaaatc ttcagaccat
128881 tcaggaagtt gcaagactaa catacgcccg agatttttgg ctattatcaa tgaaggtaat
128941 acaatatatt ttcgaagaat tgattgagtt attaacatgt aacctcttat tatttgcgca
129001 aaaggaatgg tatcccaggc ttctgctatc tctatccgtg ctttttcctt tcttttgttc
129061 tccccttttt tgtccttttc ctttttctct tctctttttg tttgttcttc tctagactct
129121 agaatcttga attctccttc tttacctgac caatttcgaa aaattggttt aatcagtcca
129181 gagatatcaa aagaaaaaag aaaggggggg gttattctgt caagaaaaag gggggaatgc
129241 acatttgctt gaaagagttt ccaaataact gttttgcgcc tttgagcccg catagagcct
129301 ttgattatac ctcgccgaaa atctggttgt tgcgaatagc gtattaaagc cacttccttt
129361 gtttgatctt gatctgcggt atcagtatct ttggtatcag gatcgttatt ctggttgttg
129421 gcagtaaaaa tcactacacg tttggctttt cttgaacgaa tttgatgatc cagtggtacg
129481 ccctcttgat agtcacccga ttgttgttcc aattcggtga ttaatttatg tgaccagcga
129541 ggtatttttt tactgatttc ttttattcca atcgattttt tttcagatgt tgtcccatta
129601 ggagcaattg cattgaatac aaattttaca aatttagttc ttttttctga attcactctt
129661 ccctgttctt ggtctgaaaa taaagaaagg tctttcaaat ttaaactcga ttttggttcg
129721 ttaccaaatt cattgattaa agttaagaac tcgtcaattt ctgttgataa tggtttttta
129781 gcaaccgtat ccactttttg ttccaattct tggtaatcag tattcggaag aaagatagta
129841 tgaatcctat ttattctaac cctctctttc aaattttcta gcgaagtatt gtttatgatt
129901 gaaggtgaaa actttttttt gattgttcct cgatatggtc catttaacaa aggatcatac
129961 attttaggca cgtattcttt tttagtatca tcattacaca atctagtcct tgtttcaagt
130021 atatcgagag aaaaagattc cttgtctaga acttcaagtc gatttaaaaa ttccttattc
130081 agattattac ttttttcttt gttggtagaa atccactgat tgtccagttc attagggagt
130141 gttttttgga gtgacaatag gggtatcctt ctttttatca ttttccaaaa agttgataaa
130201 cttggcgggt atgtaaaaga tattctttgt tttccatcac ttttacatgt gttaaaaaaa
130261 tattgtgaca tttccgttct tatggcctgt tcaaatcgat tattctttat gtagcgaaat
130321 ggtcgattcc atcgattata atcgaaaaga agactcacaa gaggctgttg aaaccagaag
130381 aggtctttat tttcattttt tttatcaagc agttgcaatt taaaaatttc tgtattcccc
130441 gtgttattat tattcagata agaatcctca taatcataaa ttggactatt actagtatta
130501 atattattat agcctgtctc tgtaaggtga gagtggaatt tatcctttat tttgtccttt
130561 ccattcactc ggatttcttc cgtttcatcg attttgtccg gatcccacct ttcttccgaa
130621 aaaagggaag gagaaggata aggatcttct tcagtggatc cctcttgttc ctgtttagtc
130681 cccttcattt cggaagctgt ttctatttct acatctcttt cttcctcact ttccaccctt
130741 tcttctgttt ttgaggcttc ttttagtttc ttagtaagaa tgggtgaggg tattctgcct
130801 aaatagtaga cacaggtaat aaataagaga atactaaaga tccgagccat agaatttctc
130861 aattctaaca caaggtactt attagatcga atgtacttat tcgatctaat agaatgattt
130921 tgccgtatcc agactaatac caatccaagc catttcatga ataaaatgtg accaattaac
130981 caaccaacaa aaccacttgt tacaaataag atcttgctgt tgcatcgaaa gagataaatg
131041 ttgactaatc tggctaacat tgaacttggt aaaatgaaat ggttgaataa ttgaaaaatg
131101 agattattca ggaatacaca ttgaatgctg agattacgca ttgaatttct ggtagtagat
131161 ccataatcaa aaaagtgttt gtgattgttc cagaagaaat gaaacaaaag atatggtaga
131221 gctaggacag ttattgtatg aggtctaccc aatgctagat gcagaggcgc ataatagatc
131281 gatatgaaca tcatgagctg ccccgtaata aaaccagttg ttgctgatac cttcttctcg
131341 gttccttctt ccataaccag agctcggaga aggaagagat aagagggccc tatggagaat
131401 gtggtcagaa atccataata gagtccgacc acaacgaccg aattgattat cttcatgcat
131461 aaggatacta gattacctag tagaaaagat tgaaaaatca tcacaaacct cccttatttc
131521 ttttctattg caatttctgg attattatat gatgattttt gaactttcca tatatagaaa
131581 agaaatagaa agagatagac tagaaacgac atctgttatg tcaatgacac caaagggata
131641 ttaaatgaat ggaattggga tatggatgga atataatgaa atagagccac tttgaggttc
131701 cctctgaaat gaggcatgta agggagccac tacgaagaag ttccgggagt tacgaaggaa
131761 gcttcgagct catattggtc atgggttggg aacgggaatt gaactctatg agatcgaatc
131821 tcctgttgtt cctcagtagc tcagtggtag agcggtcggc tgttaaccga ttggtcgtag
131881 gttcgaatcc tacttgggga gatttgattg attctgaatt aaagaattca gaataaaggg
131941 gctcgctttg cccgttaaga gtaggtaacc cgttccctgt ctttgtttct attgcattct
132001 atctcatcgt atcacattct gttctgcgag attagaaaat caccatcaat acctcggtct
132061 aggtccgaga taatcctttg ttccatagcc ctggggctat ttacaactag ccaattaaga
132121 agtctcagat atactagcac tgcatcaaag atgcagtcat cgattctccc gagaggtcac
132181 aattgccgcg agcaaagata ttaatgacga ggaaggcttt tttgttatgc tactaatact
132241 tgctctgcta ttctgcccaa gcctggctga ggaagagtta cggggcgtaa aacaaaaaaa
132301 tatgctgacc gggcatacta tgtgtaatga ttcccccatt cacgataaat aaaaagagaa
132361 aaagaaaagc cattccattt cgacaaaaga cccacaccca agttccatag cttttggttc
132421 gctatcccga tcatgatttt cctaccccca gagggaaagg tacttccctt ttgggccggt
132481 tgtgggcgag gagggattcg aacccccgac accgtggttc gtagccacgt gctctaatcc
132541 tctgagctac aggccccacc ccgtctccac tggatctgtt cccgggagta ccctcaaaaa
132601 aaggaacctt tcctctcccc agccatttcg ggttaagaag atgtgaaggc gcgtttatct
132661 ctataagaag ggtgcgttcc gaggtgtgaa gtgggagaga agggatgtca caattggggt
132721 tttgaataaa acgacctttt gatttttcat tttttttttt ttcgttttca tattgaaaaa
132781 gtaataagaa tgagaggtgt taagcttttt atcatcctgg cgtcgagcta tttttccgca
132841 ggacctcccc tacagtatcg tcaccgcagt agagtttaac caccaagttc gggatggatt
132901 ggtgtggttc ctctacgcct aggacaccag aatatcgaac catgaacgaa gaaaggcatg
132961 agagaaaagc atattggcta gtgattgtga ggccccaatt cttgactgga ggggacacca
133021 aaggcctctg cccttccatc ccttggatag atagagaggg agggcagagc ttttggtttt
133081 ttcatgttgt caaagagttg aacaatggtt ttttcgtgtt gtcaaagatt tgaacaatga
133141 aaatagatgg cgagtgcctg atcgaattga tcaggtcatg taggaacaag gttcaagtct
133201 accggtctgt taggatgcct cagctgcata catcactgca cttccacttg acacctatcg
133261 taatgataaa cggctcgtct cgccgtgacc ttctcttgaa ttctcaaaaa aacttctgtc
133321 gctccatccc cgcaggggca gagaacccgt cgctgtctcg gctgtgctac cggaggctct
133381 ggggaagtcg gaataggaga gcactcatct tggggtgggc ttactactta gatgctttca
133441 gcagttatcc gctccgcact tggctaccca gcgtttaccg tgggcacgat aactggtaca
133501 ccagaggtgc gtccttcccg gtcctctcgt actagggaaa ggtcctctca atgctctaac
133561 gcccacaccg gatatggacc gaactgtctc acgacgttct gaacccagct cacgtaccgc
133621 tttaatgggc gaacagccca acccttggaa catactacag ccccaggtgg cgaagagccg
133681 acatcgaggt gccaaacctt cccgtcgatg tgagctcttg gggaagatca gcctgttatc
133741 cctagagtaa cttttatccg ttgagcgacg gcccttccac tcggcaccgt cggatcacta
133801 aggccgactt tcgtccctgc tcgacgggtg ggtcttgcag tcaagctccc ttctgccttt
133861 gcactcgagg gccaatctcc gtccggcccg aggaaacctt tgcacgcctc cgttaccttt
133921 tgggaggcct acgccccata gaaactgtct acctgagact gtcccttggc ccgtaggtcc
133981 tgacacaagg ttagaattct agcccttcca gagtggtatc tcactgatgg ctcgggcccc
134041 cccggaagga ggccttcttc gccttccacc taagctgcgc aggaaaggcc caaagccaat
134101 cccagggaac agtgaagctt catagggtct ttctgtccag gtgcaggtag tccgcatctt
134161 cacagacatg tctatttcac cgagcctctc tccgagacag tgcccagatc gttacgcctt
134221 tcgtgcgggt cggaacttac ccgacaagga atttcgctac cttaggaccg ttatagttac
134281 ggccgccgtt caccggggct tcggtcgccg gctcccctgt catcaggtca ccaacttcct
134341 tgaccttccg gcactgggca ggcgtcagcc cccatacatg gtcttacgac tttgcggaga
134401 cctgtgtttt tggtaaacag tcgcccgggc ctggtcactg cgaccccctt tgtgaggagg
134461 caccccttct cccgaagtta cggggctatt ttgccgagtt ccttagagag agttgtctcg
134521 cgcccctagg tattctctac ctacccacct gtgtcggttt cgggtacagg taccctcttg
134581 ctcaaagtcg ttcgagcttt tcctgggagt atggcatggg ttacttcagc gccgtagcgc
134641 ctggtattcg aacattggct cgaggcattt tctctacccc ttcttaccct gacaaagcag
134701 ggacacctta cgttcttgaa ccgataacca tctttcggct aacctagcct cctccgtccc
134761 tcgggaccaa caaggggcag tacaggaata ttcacctgtt gtccatcgac tacgcctttc
134821 ggcctgatct taggccctga ctcaccctcc gtggacgaac cttgcggagg aacccttagg
134881 ttttcggggc attggattct caccaatgtt tgcgttactc aagccgacat tctcgcttcc
134941 gcttcgtcca ccaccgctcg cgcggaggct tctctctaag gcggaacgct cccctaccga
135001 tgtattttta catcccacag cttcggcaga tcgcttagcc ccgttcatct tcggcgcaag
135061 agcgctcgat cagtgagcta ttacgcactc tttcaagggt ggctgcttct aggcaaacct
135121 cctggctgtc tctgcacccc tacctccttt atcactgagc gatcatttag gggccttagc
135181 tggtgatccg ggctgtttcc ctctcgacga tgaagcttat cccccatcgt ctcactagcc
135241 gaccttgacc cctgttattt tgaggtcata tctagtattc agagtttgcc tcgatttggt
135301 accgctctcg cggcccgcac cgaaacagtg ctttacccct agatgtccag tcaactgctg
135361 cgcctcaacg catttcgggg agaaccagct agctctgggt tcgagtggca tttcacccct
135421 aaccacaact catccgctga ttcttcaaca tcagtcggtt cggacctcca cttagtttca
135481 cccaagcttc atcctggtca tggatagatc acccaggttc gggtccataa gcagtgacaa
135541 ttgccctatg aagactcgct ttcgctacgg ctccggtggg ttcccttaac caagccactg
135601 cctatgagtc gccggctcat tcttcaacag gcacgcggtc agagccctgg ctcctcccac
135661 tgcttgggag cttacggttt catgttctat ttcactcccc gatgggggtt cttttcaccc
135721 ttccctcacg gtactacttc gctatcggtc acccaggagt atttagcctt gcaaggtggt
135781 ccttgctgat tcacacggga ttccacgtgc cccatgctac tcgggtcaga gcataagcta
135841 gtgatgcttt cggctactgg actttcgcca tctagggtgc agcattcggg ctgcttcgcc
135901 tagcagcacg acgcttgtat tgctctccca caaccccgtt ttcacggttt aggctgctcc
135961 catttcgctc gccgctacta cgggaatcgc ttttgctttc ttttcctctg gctactaaga
136021 tgtttcagtt cgccaggttg tctcttgcct gcccatggat tcagcagcag ttcgaaaggt
136081 tgccctattc gggaatctcc ggatctatgc ttattttcaa ctccccgaag catttcgtcg
136141 attactacgc ccttcctcgt ctctgggtgc ctaggtatcc accgtaagcc tttcctcgtt
136201 tgaacctcgc ccttcacttt taaggctatg ccatcctaag gtgctgctaa atggatggat
136261 cttatcaacg tccatgaatg ataaatcata gatcgaaccg ccgaatcgga aaaattgggt
136321 gctatcataa agctttgtat cggctaagtt cacgagttgg agataagcgg actcgaaccg
136381 ctgacatccg ccgcagggta aaccaccgcc tctcaggtcc cccgactgat tctaccatag
136441 aggccaacga tagacaataa ctcccccccg aacacagctt acaactttca tcgtactgtg
136501 ctctccaaag agcaactctt ctcaaaatct cactcaaaag gtgctgagtt ggaatcccat
136561 tctaactaag aatgagtcat tgcccttctc cgaccctgac tgcccaacct gagagcggac
136621 agctaatgcg ttccacttat tgaacagggt tctatggtcg gtccgtgacc cctggatgcc
136681 gaaggcgtcc ttggggtgat ctcgtagttc ctacggggtg gagatgatgg ggtcggtcca
136741 tggattttcc ttccttttct tttgccgcat ttcgctcaaa gggttgaagg gagatagtgc
136801 atcaagctgt tcgcaagggc caacttgatc ctcttcccca gagatctcag atgagggaac
136861 cctgggagag ccgccgactc caactaccgt ccatgtacga tccatactag atctgaccaa
136921 ctgcccatcc tacctcctct acgttcttga cagcccatct ttgtctcagt agagtctttc
136981 agtggcacgt ttcggtcctc ttccccatta cttagaaaaa gtgagccacc ggttcaggta
137041 caagatacta tcattaccgc ctggacaatt agacatccaa cccgtaatcg caacgaccca
137101 attgcaagag cggagctcta ccaactgagc tatatccccc cgagccaagt ggagcatgca
137161 tgaagtagtc agatgcttct tctattcttt tccctggcgc agctgggcca tcctggactt
137221 gaaccagaga cctcgcccgt gaagtaaatc atcgcaccta cggtccaacc aattgggaga
137281 gaatcaatag attccttttc gggagcgatt catccttccc gaacgcagca tacaactctc
137341 cgttgtactg cgctctccaa gtgtgcttgt tccccccttc ttccttaccc tggcaagtct
137401 ttgtgaaata actccgatga gaagaaaaaa gaaggcgtta agagaccctc ctggcccaac
137461 cctagacact ctaagatcct ttttcaaacc tgctcccatt tcgagtcaag aaaaaaacgg
137521 ctcgaatggt acgatccctc cgtcacccca gaatgaaagg ggcgatctcg tagttcttgg
137581 tctgtgaaga tgcgttgtta ggtgctccat tttattttcc cattgctaaa cctgtgctcg
137641 agagatagct gtccatacac tgataaggga tgtatggatt ctcgagaaga gaggagccgt
137701 ggtggtcccc cccggaccgc ccggatccca cgagtgaatc gaaagttgga tctacattgg
137761 atctcacccg aatcgcccca tctatcctcc tgaggaggag tttggtttca aaccccggtt
137821 cgaacaggag gagtacgcca tgctaatgtg ccttggatga tccacatctc agggtcaggc
137881 gccgatgagc acattgaact atccatgtgg ctgagagccc tcacagccca ggcacaacga
137941 cgcaattatc aggggcgcgc tctaccactg agctaatagc ccgtcgtgcg agcctcccac
138001 tgggggcccg ctatgccaaa agcgagagaa accccatccc tctctttcct tttttcgccc
138061 ccatgtcgcc acacgggggg aacatgggga cgtaaaaaag ggggtcctat caacttgttc
138121 cgacctagga taataagctc atgagcttgg tcttacttca ccgtcgagaa aggaaagaag
138181 acttccatct ccaagtttaa ctcagacgta gctcccttct tttttttggg ggtgtgaagc
138241 agtgtcaaac caaaataccc aacaagcatt agctctccct gaaaaggagg tgatccagcc
138301 gcaccttcca gtacggctac cttgttacga cttcactcca gtcactagcc ctgccttcgg
138361 catccccctc cttgcggtta aggtaacgac ttcgggcatg gccagctccc atagtgtgac
138421 gggcggtgtg tacaaggccc gggaacgaat tcaccgccgt atggctgacc ggcgattact
138481 agcgattccg gcttcatgca ggcgagttgc agcctgcaat ccgaactgag gacgggtttt
138541 tggggttagc tcacctcgcg ggatcgcgac cctttgtccc ggccattgta gcacgtgtgt
138601 cgcccagggc ataaggggca tgatgacttg acgtcatcct caccttcctc cggcttatca
138661 ccggcagtct gttcagggtt ccaaactcaa cgatggcaac taaacacgag ggttgcgctc
138721 gttgcgggac ttaacccaac accttacggc acgagctgac gacagccatg caccacctgt
138781 gtccgcgttc ccgaaggcac ccctctcttt caagaggatt cgcggcatgt caagccctgg
138841 taaggttctt cgctttgcat cgaattaaac cacatgctcc accgcttgtg cgggcccccg
138901 tcaattcctt tgagtttcat tcttgcgaac gtactcccca ggcgggatac ttaacgcgtt
138961 agctacagca ctgcacgggt cgatacgcac agcgcctagt atccatcgtt tacggctagg
139021 actactgggg tatctaatcc cattcgctcc cctagctttc gtctctcagt gtcagtgtcg
139081 gcccagcaga gtgctttcgc cgttggtgtt ctttccgatc tctacgcatt tcaccgctcc
139141 accggaaatt ccctctgccc ctaccgtact ccagcttggt agtttccacc gcctgtccag
139201 ggttgagccc tgggatttga cggcggactt aaaaagccac ctacagacgc tttacgccca
139261 atcattccgg ataacgcttg catcctctgt attaccgcgg ctgctggcac agagttagcc
139321 gatgcttatt ccccagatac cgtcattgct tcttctccgg gaaaagaagt tcacgacccg
139381 tgggccttct acctccacgc ggcattgctc cgtcagcttt cgcccattgc ggaaaattcc
139441 ccactgctgc ctcccgtagg agtctgggcc gtgtctcagt cccagtgtgg ctgatcatcc
139501 tctcggacca gctactgatc atcgccttgg taagctattg cctcaccaac tagctaatca
139561 gacgcgagcc cctcctcggg cggattcctc cttttgctcc tcagcctacg gggtattagc
139621 agccgtttcc agctgttgtt cccctcccaa gggcaggttc ttacgcgtta ctcacccgtc
139681 cgccactgga aacaccactt cccgtccgac ttgcatgtgt taagcatgcc gccagcgttc
139741 atcctgagcc aggatcgaac tctccatgag attcatagtt gcattactta tagcttcctt
139801 gttcgtagac aaagcggatt cggaattgtc tttcattcca aggcataact tgtatccatg
139861 cgcttcatat tcgcccggag ttcgctccca gaaatatagc catccctgcc ccctcacgtc
139921 aatcccacga gcctcttatc cattctcatt gaacgacggc gggggagcaa atccaactag
139981 aaaaactcac attgggctta gggataatca ggctcgaact gatgacttcc accacgtcaa
140041 ggtgacactc taccgctgag ttatatccct tccccgcccc atcgagaaat agaactgact
140101 aatcctaagt caaagggtcg agaaactcaa cgccactatt cttgaacaac ttggagccgg
140161 gccttctttt cgcactatta cggatatgaa aataatggtc aaaatcggat tcaattgtca
140221 actgccccta tcggaaatag gattgactac cgattccgaa ggaactggag ttacatctct
140281 tttccattca agagttctta tgcgtttcca cgcccctttg agaccccgaa aaatggacaa
140341 attccttttc ttaggaacac atacaagatt cgtcactaca aaaaggataa tggtaaccct
140401 accattaact acttcattta tgaatttcat agtaatagaa atacatgtcc taccgagaca
140461 gaatttggaa cttgctatcc tcttgcctag caggcaaaga tttacctccg tggaaaggat
140521 gattcattcg gatcgacatg agagtccaac tacattgcca gaatccatgt tgtatatttg
140581 aaagaggttg acctccttgc ttctctcatg gtacactcct cttcccgccg agcccctttt
140641 ctcctcggtc cacagagaca aaatgtagga ctggtgccaa caattcatca gactcactaa
140701 gtcgggatca ctaactaata ctaatctaat ataatagtct aatatatcta atataataga
140761 aaatactaat ataatagaaa agaactgtct tttctgtata ctttccccgg ttccgttgct
140821 accgcgggct ttacgcaatc gatcggatta gatagatatc ccttcaacat aggtcatcga
140881 aaggatctcg gagacccacc aaagtacgaa agccaggatc tttcagaaaa cggattccta
140941 ttcaaagagt gcataaccgc atggataagc tcacactaac ccgtcaattt gggatccaaa
141001 ttcgagattt tccttgggag gtatcgggaa ggatttggaa tggaataata tcgattcata
141061 cagaagaaaa ggttctctat tgattcaaac actgtaccta acctatggga tagggatcga
141121 ggaaggggaa aaaccgaaga tttcacatgg tacttttatc aatctgattt atttcgtacc
141181 tttcgttcaa tgagaaaatg ggtcaaattc tacaggatca aacctatggg acttaaggaa
141241 tgatataaaa aaaagagagg gaaaatattc atattaaata aatatgaagt agaagaaccc
141301 agattccaaa tgaacaaatt caaacttgaa aaggatcttc cttattcttg aagaatgagg
141361 ggcaaaggga ttgatcaaga aagatctttt gttcttctta tatataagat cgtgattgga
141421 tccgcatatg tttggtaaag agaataatct tatcctttga gaataatcaa aaatggacag
141481 tgttcaattg gaacatgaaa acgtgactaa attggtccta gttactcttc ggggcggagt
141541 ggaagaaggg ggggattctc gaacgcggaa aggatccaat gaattcgaaa gaattgaacg
141601 aggagccgta tgaggtgaaa atctcatgta cggttctgta gagtggcagt aagggtgact
141661 tatctgtcaa cttttccact atcaccccaa aaaaaccaaa ctctgcctta cgtaaagttg
141721 ccagagtacg attaacctct ggatttgaaa tcactgctta tatacccggt attggccata
141781 atttacaaga acattctgta gtcttagtaa gagggggaag ggttaaggat ttacccggtg
141841 tgagatatca cattgttcga ggaaccctag atgctgtcgg agtaaaggat cgtcaacaag
141901 ggcgttctag tgcgttgtag attcttatcc aagacttgta tcatttgatg atgccatgtg
141961 aatcgctaga aacatgtgaa gtgtatggct aacccaataa cgaaagtttc gtaaggggac
142021 tggagcaggc taccatgaga caaaagatct tctttctaaa gagattcgat tcggaactct
142081 tatatgtcca aggttcaata ttgaaataat ttcagaggtt ttccctgact ttgtccgtgt
142141 caacaaacaa ttcgaaatac ctcgactttt ttagaacagg tccgagtcaa atagcaatga
142201 ttcgaagcac ttctttttac actatttcgg aaacccaagg actcaatcgt atggatatgt
142261 aaaatacagg atttccaatc ctagcaggaa agggagggaa acggatactc aatttaaagt
142321 gagtaaacag aattccatac tcgatctcat agatacatat agaattctgc ggaaagccgt
142381 attcgatgaa agtcgtatgt acggcttgga gggagatctt tcatatcttt cgagatccac
142441 cctacaatat ggggtaaaaa agccaaaata agtgatttta gcccttataa aaagaaaact
142501 gattcttgaa cccctttcac gctcatgtca cgtcgaggta ctgcagaaaa aaaaacagca
142561 aaatccgatc caatttatcg taatcgatta gttaacatgt tggttaaccg tattctgaaa
142621 cacggaaaaa aatcattggc ttatcaaatt atctatcgag ccgtgaaaaa gattcaacaa
142681 aagacagaaa caaatccact atccgtttta cgtcaagcaa tacgtggagt aactcccgat
142741 ataacagtaa aagcaagacg tgtaggtgga tcgactcatc aagttcccat tgaaatagga
142801 tccacacaag gaaaagcact tgccattcgt tggttattag cggcatcccg aaaacgtccg
142861 ggtcgaaata tggctttcaa attaagttcc gaattagtgg atgctgccaa agggagtggc
142921 gatgccatac gcaaaaagga agagactcat agaatggcag aggcaaatag agcttttgca
142981 cattttcgtt aatccatgaa caggatctat acatctcgat cggaaaagaa tcaagagaaa
143041 aagaaagaat cggaattgat cgatagattt ctcgaaacaa acgaaaagga aagatgaaac
143101 ataaatcatg gatcaactaa gcctctcggg gactttctta aagaggaacc tcatgtaaat
143161 accatggaat aaggtttgat cctattcatg gagattccgt aactattcca aaaatggaaa
143221 gttcgacaca attgggattt tttttggaaa ttggaagcag ttactaattc atgatctggc
143281 atgtacagaa tgaaaacttc attctcgatt ctacgagaat ttttatgaaa gcctttcatt
143341 tgcttctctt cgatggaagt ttgattttcc cagaatgtat cctaattttt ggcctaattc
143401 ttcttctgat gatcgattca acctctgatc aaaaagatat accttggtta tatttcatct
143461 cttcaacaag tttagtaatg agcataacgg ccctattgtt ccgatggaga gaagaaccta
143521 tgattagctt ttcgggaaat ttccaaacga acaatttcaa cgaaatcttt caatttctta
143581 ttttactatg ttcaactcta tgtattcctc tatccgtaga gtacattgaa tgtacagaaa
143641 tggctataac agagtttctc ttattcgtat taacagctac tctaggggga atgtttttat
143701 gcggtgctaa cgatttaata actatctttg tagccccaga atgtttcagt ttatgctcct
143761 acctattatc tggatatacc aagaaagatg tacggtctaa tgaggctact atgaaatatt
143821 tactcatggg tggggcaagc tcttctattc tggttcatgg tttctcttgg ctatatggtt
143881 catccggggg agagattgag cttcaagaaa tagtaaacgg tcttatcaat acacaaatgt
143941 ataactcccc aggaatttca attgcgctca tattcattac cgtaggaatt gggttcaagc
144001 tttccccagc cccttctcat caatggactc ctgacgtata cgaaggagtg cggttcgttc
144061 gagaaattcc tacctctcta tctatctctg agatgtttgg atttttcaaa actccatgga
144121 catgcagaag agaaatgcta tccccactcg gaccaagaca gaacttttac ttgttcaaat
144181 aacaattaag gtgaagcagg gtcaggaacg acgaatctct ttatgataaa cagatccatt
144241 ttgcaagttc gttattacgg gtagttccta caaaggatcg gactaatgac gtatacaata
144301 cttgaattct cgatgtagat gctacatagt tggttctcat ccttcagaga ctacgagtgt
144361 aataagagca tccgtcgaca aaaggatcac cctaagatga tcatctcgtg gctattgaga
144421 acgaattaaa tcagatggtt ctatttctca atctttctga cttgctccta cgaaaccaag
144481 gtcgaaaaga ttgaaaaaat cagtcattca caaccactga tgaaggattc ctcgaaaagt
144541 taaggattag taatcctttt tagaaatcga atggattcgg tcttatacat acgcgaggaa
144601 ggtaatcaaa aaagaaagaa aatgggttct tctttctttt atcacttagg agccgtgtga
144661 gatgaaagtc tcatgcacgg ttttgaatga gagaaagaag tgaggaatcc tcttttcgac
144721 tctgactctc ccactccagt cgttgctttt ctttctgtta cttcgaaagt agctgcttca
144781 gcttcagcca ctcgaatttt cgatattcct ttttatttct catcaaacga atggcatctt
144841 cttctggaaa tcctagctat tcttagcatg atattgggaa atctcattgc tattactcaa
144901 acaagcatga aacgtatgct tgcatattcg tccataggcc aaatcggata tgtaattatt
144961 ggaataattg ttggagactc aaatgatgga tatgcaagca tgataactta tatgctgttc
145021 tatatctcca tgaatctagg aacttttgct tgcattgtat tatttggtct acgtaccgga
145081 actgataaca ttcgagatta tgcaggatta tacacaaaag atcctttttt ggctctctct
145141 ttagccctat gtctcttatc cctaggaggt cttcctccac tagcaggttt tttcggaaaa
145201 ctctatttat tctggtgtgg atggcaggca ggcctatatt tcttggtttt aataggactc
145261 cttacaagcg ttgtttctat ctactattat ctaaaaataa taaagttatt aatgactgga
145321 cgaaaccaag aaataacccc tcacgtgcga aattatagaa gatccccttt aagatcaaac
145381 aattccatcg aattgagtat gattgtatgt gtgatagcat ctactatacc aggaatatca
145441 atgaacccaa ttattgcaat tgctcaggat agcctttttt agcttctagg gtctatttct
145501 tagttcaaga tccctcttac taactggaat caaagaatta gtagatctgt tccgcccaaa
145561 atgggaatgg gctagggtta tgaacttata atctgatgat cgagtcgatt ccatgattat
145621 aagttcattc cataccggac caggccggaa tagggttata tacattctca ttatgagaag
145681 gggtcattcg ggcctatcta aatagatact atgtttacat atggattcct acatcattac
145741 attccattta ggattaggaa tacgcgtaat cggacctgct ttttacatat ctctattggg
145801 accctattca cctctttgag tgaatcgaga aataggtttg attgtccatc tttttgatat
145861 atatcaggca ttgcattctc cggataattc aaatcgaagc aattggatgt ccaactcggg
145921 cctatatgac atgaccgatc aatagatcca cctttgtcat atattccata catcacacta
145981 gatagatatc atattcatgg aatacgattc actttcaaga tgccttggtg gtgaaatggt
146041 agacacgcga gactcaaaat ctcgtgctaa atagcgtgga ggttcgagtc ctcttcaagg
146101 cataatattg agaatgctca ttgaatgagc attctcaata agagagctcg gatcgaatcg
146161 gtattgatat accgattcga tccgagctct tggaattgga ataaattcgg cagcggatcg
146221 cgaaatcttg gtgatcttct ctatctaatg aatggggagt ccgctttaaa atcgtccgcc
146281 ctgcacccac cccccgagta tatgcttcaa caggaatcac acaagggtag attagaaacc
146341 tctggtaaaa tgcccgcccg taacccagca gataaagtac attacatagt ccagggattg
146401 gcgacttacc cattcagtga ctttggcact ggacgttccc aaaatgggga ctatcgggta
146461 aattcaatat aatagacgcc tgttggcatt ccagccttcc ttctcctttc agggcctatc
146521 cgaaagagaa tccagtactt cttggtcgtg aatatctgaa ctggttgttt gctgttcaag
146581 aattcttgtt taggcagttc ataccatcca tacatagtgt tttgatctaa gatttcaatt
146641 cttccgtgtt tcagcagtaa catattcttc catggagcta aggtccaaaa tatggaagaa
146701 acaagcgttt ccacgactct accacccagt caattctgtt ccacttaatc cctctttcat
146761 ggccacatat ctttccggct aaggaatggg aaatctttct cctgttacat gaatccaatt
146821 ttcatttcat ccgggaaaag ccatcttttt ctcaacaatg tctttgtcat ttgatccaat
146881 agcgttccgt tagataggaa cagatttgat aaatactgat aactctcgga tagagtatta
146941 gaacggaaag atccattaga taatgaactg ttggttctaa gccatctctg acgattaatc
147001 aacaattcga agtgcttttc ttgcgtattc ttgataaacc agcgtttata tatagatgta
147061 ggagggtctg tttgggaagt aagaagcccc tttgacatct cttcatctgc aaataattct
147121 cgatgtgaaa acacagagcc agggggctga tctttgaata ggaaaaagag tggatctgca
147181 gggtcccaaa tgaattggct tattcgaaaa aggccttgtt ctttggaaga tctatctcgt
147241 gtctggtact gcatggttcc actctgcaag aactccgaat cattctcttg aagctcatcc
147301 tcttcatcat aaatgatccg cttgccccga aatgacctgg accaataggg aaatcccaat
147361 tcattgggcc tttcgataca atcaaataga aagccccaag ggcgccatat tctaggagcc
147421 caaactatgt gattgaataa atcctcctgc gggtcaaggg ctccttctcc ctccccttct
147481 tcaaactccg attcatattt ttcatagaga aatctctgat caaggataga acaagagccg
147541 ttttgcatca tatctaaggg attcctcggt tcgggccgaa gaagcaatgt cactcgatca
147601 ttatcaaact gactgcaatc tttttctgtc cgtgaagatc ccaccagagc gccttctact
147661 tctaataggc catgaactag atcagaatca ttctcaacga gtccataaga agtgatccca
147721 tttttttcat cgggtccgga taaagaccaa agatcttgag cgaccgatcc ggcagaacaa
147781 ctcaaaagat aaagaagtat cgttaatctc ttcatgctcg ttccaagctc gaagtaccat
147841 ttgtacaaat aagaatcccc ttcgttacat gatttcttct tcatatagat agatatagga
147901 tctatggggc aattacttag aagtacattt tgtgctacag cccttcctat ctgatagaaa
147961 aggatcccat gatcctgaac cgatcttacc tgggatcgca aatcccaagt ttgtctatga
148021 agagcggatc taattgtatt agtgtctata attgatttct tctgtgtaat actaatcgat
148081 aggacctcat tggtaagtgc tacaagatct cgtgcattgg aacccatggt tatggacccg
148141 aatccgttag tatggaacat tttcttttcc aagtgaaatc ccctagtata tgaaagagtg
148201 aaaaagtgct ttcgttgttg tggaagaaga agccttcgta tcttaatgca cgtatttaat
148261 ttattcggag ctattagagc gggatccact ttttggggaa tatgagtcga agcaataaca
148321 agaatatttc tagtagaaca tctttcacaa tccctggaga gatggttcac taatagaccg
148381 agggctaagt cattcgactc attcacatcc agatcatgaa tgtttggaat ccatattatg
148441 caaggagaca ttgcttttgc taattcgaat tgaagggtga tataaaatcg gtctatttcc
148501 ggcatcatat ccatagttag cccattcatc ctagttagca gtttcagctc cgtatcaagg
148561 tcacgatcga tatcgtcact agcatcaaga ttgtcactat catcaatatc gtcactatca
148621 tcaatatcga tctcatcaag aagaaaacct ttaggcttgt tatccaggaa cttgttcaga
148681 aataccgtaa tgaaaggaac ataggagttt gtcgctaggt atttgaccaa ataggatcgt
148741 ccagttccta tagaacctat cactaaaata cccctagagg gggataaggc taagcggagc
148801 gaaaagggtt ttccatgaga tgggaaatga aaactatttt ccccacacga agtttgtgaa
148861 taagtgattg tctgataatg agcaaggaat atccgtcttt ctgctaaaca ggatggattg
148921 aactcataat tcattagatg ctttttatga atgtcaacta agtatcgtaa gtaaattgct
148981 cccggttgtt caatcatttg ataaccagag tcattctttg ataaacgatc actatgagtc
149041 agactcaata gaatttgatc aatcctattt tctgtcgtta aggtggagaa ctgaaccaag
149101 aattctcttt cttcatcatc aatcgaatca ctgttcgcga cccaggattc tattttatca
149161 tcaatccaat ccccgttcac gttttttctt tttcttatca atgaatagat ctctttactt
149221 gtatgactta gatgtctcgt atttctcgaa aaagtgattc gattgatggg atttggtatg
149281 agatcgatga tctcgatgag attgatattc caatctttct tcttagaacg tattgatttg
149341 accccataag cgggaccaag catgttgccg ccagaagcag aaccccgtat ttcttctaga
149401 gaatctccta attgttccag agcaactaga aagagattct ttaaccagaa agaattcggt
149461 tcagatgtag gatacctatc cagaagtttt cgcaactcaa tcatagatga tggaatcatc
149521 aaagatttga ccttttcgaa ctctgtctgt aactcactag aggcccggga aacaaagaga
149581 agatgtgtac gaacgagata tccagcaaca agaagaagga aaaggattga atagaggaac
149641 tcccgagcat ttggcgatct cagatgtgtc gatatcaatg gtgactcatt atttcgatga
149701 atcatttctt cggacagaag aagattatgt aaacacttac tcgagatctc acttatcaga
149761 ttccattgtg gaagacacaa ttttttctga agaattcgcc atgatatacc tgatccatgc
149821 ataatatcat gaaaaatggg tacaaatttt tgactgctac ttagtattgg caataggtct
149881 gaaaaagtat ctaaaaatat caaatttaga tatttgtacc ctgtcgaagt aaggaaccat
149941 ggcatatatg tttggaatag attccatttt gagagagttg aaaaagcact atctcgttga
150001 aaggttctat acatctgccc tttctcaacg catttcttta gacaaagact ccgttttttc
150061 ctcttttcgg atgataaatc tttctcagaa catggagtgt gaatcaaacc catgtttgaa
150121 ttgaaattga gatactgatg caagttcttc ccttctgaat cagatagatt catatctgaa
150181 agaggttgac aataagttct ttcaaaattg actatttgcc cctctgttag aggtgttcca
150241 gaaatgtctg cgatcgagta aatagctcta cgaacgaatg gatcggatcg acttggaaaa
150301 tggaaagatt tgtacaagtt atacgtttcg tcaccacttt gtggaaaatc gttaggtatg
150361 aatatgttag atacctgtga ctcgattggt gaaatagtat ctctccccca aaaagcatgt
150421 ttttttttac cgacgcacaa agaaaatatt ttgttgcgaa tgaacaagat attgaggaat
150481 tgtccatacg taaaatcaga attattgata cgggcctttt ccacagaaaa ggggaatctt
150541 gtgttccaat agaagcagaa gtgatgtgga ttattcaaga atcgaagtcg atttgcttta
150601 taaaaagaag atatcaatga acttctatga aatggtttca cgggattcag ccaattgtct
150661 tgatcgtgga atatcattga gaaataggaa tccgggttat caaaggattt cctgcgatta
150721 tttctagtat ggaatgagtc aatcatccac tttggtatct tattgaacaa aaatggtgat
150781 attgttcctc cattgatcaa gaatttcgat ttttgggaag tatcatgatc gtccaataag
150841 aagggtttcc attttttcaa atgaacaatt tgaagaccta ttgattctaa caactgattg
150901 cagagttgat cattcggacc tttcaattca tagatgtaga tctcggacct atgaatgggg
150961 atatttccga aactcacaca gaaaaaagga agtgagttag acaaaaagaa aagcaacttg
151021 gacaaaaaaa gaagtgactt ggacaaaaag aaacgaagtg gcttagacaa atcttttttg
151081 tcgataacct cagaccaatc aatcgaatat tgattaatac gtaatcgatc gaacactact
151141 tgaaaacggc tcttctgctc cgaaacggac tgttccaaat gttcctggaa attcttgctc
151201 ccattggacc atttgtatct atatgcatca ggatcccgat tcatggatct ctcggttcga
151261 gaaatcaaaa taagaggctc gaaccatttc ttctgactct ttttcaaatt cgataaatat
151321 tggttgatcg tatatttcat tatagttcta tgattcagag tatcctttcc tatttgatcc
151381 ctttgaattc catattcgaa gttgcgatcg gatctattca ttaaaaagaa tcgattcaat
151441 acatttctta tgtacccata ggtactatat tggatttgaa tcagatttcg gatcaatcta
151501 tattgagtga ctgcctccat tatgttgttg ctagcaaata ccactatttt tggttttgga
151561 tcttccaaat cattcccgca ggagatccgg acccattttt ttctgatcct tcgagaaaaa
151621 gattcattct cttcataaaa aataggaggt agaaccaata aagatttctt tttcgattca
151681 tccctggcct cattcaagaa ttgtttttga tccaatccgc aggaatcaat agaaaaggca
151741 aatcccttat gatacaccag atccggctcg gttattgata gagtgaatag atctgccatt
151801 tcttgaaatc tctcttctga ttcaaaatcg tagtgtaacg tgtatcctcc cctgttccgg
151861 tcatggaata gatgaaataa atcaaaaaat ggatttttgt tcaagaatga aatcttattg
151921 gaactgtcca tatccggttc atcttcggaa ccatatcaca tcccggatct gatgaaatag
151981 gatgaattga gacggtattt tgtaaatacg taattatctt gaatatatta accatttctt
152041 tattttccga tcgcttggaa gggacaaaag aaagatcttg ttgtttcttc aacaatttct
152101 gatccctagt ggacctctca gtaggattcg aacccagatg aagttctgac catctatcag
152161 agaaaaaaga acgaacggat cttgtaggat tcccaagaaa ttcttcgatt tcttccggaa
152221 acagatgatt aatcatctgc ttctcacgtt ccgtgaatag ccgggacatt gaggaatatc
152281 cagaaaggca tttcgggaat cggcctgatt ctatctcttt tcgttccgtt tgaagaaagg
152341 aaggatccca aagaatcgat ctttcttttc gttgttgaat ctctctttga ttaatcaatg
152401 tgtgatattc cgaatcctca ttactaatgg aatccaaatg atctctggat tgatcagaag
152461 atcctttcag ttggctagaa tccgttactt gaacgaaact agatcttgtg gaatcatatt
152521 gaatatttga cgatacattc tgtaccttgc taaaaaaccg atccttgttt accaaccaca
152581 cattgtctaa ccaaatccaa ttctctctcg atacgttcct caaaaaatcc gattcgggcg
152641 gattcttccc ccaactaacg aagagatctt ggcggaattg ccacatatga aattgagcac
152701 agttttgcaa agaaatagcc cacttgtttc tcgagaagag atgggaaaca tgctcaatat
152761 catttgattg aatagttgac ccagcccctt gttgtttgaa gaaaccctcc acttcaattg
152821 gtattttttc acgaaaagca gacatgagat aagaaatcca gtgtttcact aagatttcga
152881 atagcggtcc cgaattcaag ttgattctat ttcgactctt cctcagagaa agacgatcaa
152941 acaattccca atcatggtcc ttgcggatcg gatcatccat ataatataca aaaagaaact
153001 ccagatattt gagatctttc tctttgaata agatctcaat tccagcgacg gtttcattag
153061 atatcttaca actagaatcc ctcttttttc cgatccagtt cctccaccaa cgcgaacccc
153121 agttagattc aggcatgcta cactttttag ttattgggag aacccaagta ctctctttcg
153181 gattcaggaa acaactctca gagatctttt ttcctttggg aagatacagg agcgaaacaa
153241 tcaacctatt gatattggaa gacccaacgg attcttccaa tgtatcattt ctgggtccaa
153301 tggaattcat aggtatagga agaagcccta tcaaatagag attttttctt tcgaccatat
153361 ttcgattgtt aatacgatat ataaggaccg ctactacaaa gagtattaca cccttgatcg
153421 tgaaatatcg attgcttgtt gaaccctgtg aattgcgtga aagtaggata ctccaaattc
153481 gggggtcaaa gagttttaga aaacgttctt ggtggaaaaa aatgtgaatg aaggatcccg
153541 ctgaattgaa ttgggtccat gaatctaaga aatggtgaga attcttgatc tctctcaata
153601 tctctctcaa ttcgaaaatc caggatttga attgatgtcc tctcattgat tcctcctaaa
153661 ttgcattgat ttatcctaac taaattgcat tgatttatcc taaagatttc atttcaattg
153721 gaatttggtt attcaccatg tacgaggatc cccgctaagc atccatggct gaatggttaa
153781 agcgcccaac tcataattgg cgaattcgta ggttcaattc ctactggatg cacgccaatg
153841 ggaccctcca ataagtctat tggaattggc tctgtatcaa tggaatctca tcatccatac
153901 ataacgaatt ggtgtggtat attcatatca taatatatga acagtaagaa ctagcattct
153961 tattgagact ataactcata gggaagaaaa tcgatttatg gatggaatca aatatgcagt
154021 atttacagac aaaagtattc ggttattggg gaaaaatcaa tatacttcta atgtcgaatc
154081 aggatcaact aggacagaaa taaagcattg ggtcgaactc ttctttggtg tcaaggtaat
154141 agctatgaat agtcatcgac ttccgggaaa gagtagaaga atgggaccta ttatgggaca
154201 tacaatgcat tacagacgta tgatcattac gcttcaaccg ggttattcta ttccacctct
154261 tagaaagaaa agaacttaaa aaaaaatact taatagcatg gcgatacatt tatacaaaac
154321 ttctaccccg agcacacgca atggaaccgt agacagtcaa gtgaaatcca atccacgaaa
154381 taatttgatc tatggacagc atcattgtgg taaaggtcgt aatgccagag gaatcattac
154441 cgcaaggcat agagggggag gtcataagcg tctataccgt aaaatcgatt ttcgacggaa
154501 tgaaaaagac atatatggta gaatcgtaac catagaatac gaccctaatc gaaatgcata
154561 catttgtctc atacactatg gggatggtga gaagagatat attttacatc ccagaggggc
154621 tataattgga gataccattg tttctggtac agaagttcct ataaaaatgg gaaatgccct
154681 acctttgagt gcggtttgaa ctattgattt acgtaattgg aaataaccaa ttaggtttac
154741 gacgaaacct agaaatcgat cactgatcca atttgagtac ctctgcagga tagacctcaa
154801 cagaaaactg aagagtaacg gcagcaagtg attgagttca gtagttcctc atataaaatt
154861 attgactcta gagatatagt aatatggaga agacaaaatt gtttcaagca ccgacagaac
154921 cggaagcgcc ccttctttca aagagaggag gacgggttat tcacatttca tttgatggtc
154981 agaggcgaat tgaaagttaa gcagtgggaa ttctaaagat tccccggggg aaaaatagag
155041 atgtctccta cgttacccat aatatgtgga agtatcgacg taatttcata gagtcattcg
155101 gtctgaatgc tacatgaaga acataagcca gatgacggaa cgggaagacc caggatgtag
155161 aagatcataa catgagtgat tcggcagatt tggattcata tatatatcca cccatgtggt
155221 acttcattct acgatatata taagatccat ctgtatagat atcatcatct acatccagaa
155281 agaagtatgc tttggaagaa gcttgtacag tttgggaagg ggttttgatt gatcaaaaga
155341 agaatctact tcaaccgata tgcccttagg cacggccata cataacatag aaatcacact
155401 tggaaagggt ggacaattag ctagagcagc gggtgctgta gcgaaactga ttgcaaaaga
155461 ggggaaatcg gccacattaa aattaccttc tggggaggtc cgtttgatat ccaaaaactg
155521 ctcagcaaca gtcggacaag tggggaatgt tggggtgaac cagaaaagtt tgggtagagc
155581 cggatctaag cgttggctag gtaagcgtcc tgtagtaaga ggagtagtta tgaaccctgt
155641 agaccatccc catgggggtg gtgaagggag agccccaatt ggtagaaaaa aacccacaac
155701 cccttggggt tatcctgcac ttggaagaag aagtagaaaa aggaataaat atagtgataa
155761 tttgattctt cgtcgccgta gtaaatagga gagaaaatcg aattaaattc ttcgttttta
155821 caaaaaaaaa aaaaatagga gtaa
//
LOCUS TIPNPSS 6425 bp ds-DNA SYN 09-AUG-1990
DEFINITION A.tumefaciens T-DNA vector containing octopine T-DNA borders and
markers: neomycin-phosphotransferase - octopine synthase (3' end)
and Sp/Sm adenyltransferase. complete cds.
ACCESSION M35007
KEYWORDS neomycin phosphotransferase;
streptomycin/spectinomycin adenyltransferase.
SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 6425)
AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
Depicker,A.
TITLE Cloning and sequence analysis of truncated T-DNA inserts from
Nicotiana tabacum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.R.Gheysen, 01-JUN-1990.
FEATURES from to/span description
pept 2782 1985 (c) neomycin phosphotransferase (gtg start codon)
pept.ps 5009 4042 (c) streptomycin/spectinomycin adenyltransferase
(gtg start codon)
mRNA / 1811 1105 (c) octopine synthase (3' end)
recomb 24 25 T-DNA end/plant DNA start
recomb 40 41 plant DNA end/T-DNA start
recomb 1094 1095 plant DNA end/T-DNA start
recomb 2786 2787 T-DNA end/plant DNA start
recomb 3044 3045 T-DNA end/plant DNA start
recomb 3354 3355 plant DNA end/T-DNA start
recomb 5585 5586 T-DNA end/plant DNA start
recomb 6389 6390 T-DNA end/plant DNA start
signal 1650 1645 (c) poly-A signal
signal 1686 1681 (c) poly-A signal
site 1634 1633 (c) major poly-A site
site 1 24 left T-DNA border
site 372 395 24 bp border-like sequence
site 1569 1592 24 bp border-like sequence
site 1669 1692 24 bp border-like sequence
site 1779 1756 (c) 24 bp border-like sequence
site 2128 2105 (c) 24 bp border-like sequence
site 2449 2472 24 bp border-like sequence
site 2485 2462 (c) 24 bp border-like sequence
site 3660 3683 24 bp border-like sequence
site 3875 3898 24 bp border-like sequence
site 4359 4336 (c) 24 bp border-like sequence
site 5868 5891 24 bp border-like sequence
BASE COUNT 1509 a 1754 c 1710 g 1452 t
ORIGIN
1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa
61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca
121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa
181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct
241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc
301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat
361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc
421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt
481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt
541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa
601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg
661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa
721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc
781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac
841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc
901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc
961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc
1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa
1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc
1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc
1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc
1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa
1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt
1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat
1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga
1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata
1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta
1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata
1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt
1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct
1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc
1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa
1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc
1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg
2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat
2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga
2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg
2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg
2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc
2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat
2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa
2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg
2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata
2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa
2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct
2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca
2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag
2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg
2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc
2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg
3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg
3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg
3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc
3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg
3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat
3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc
3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac
3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag
3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg
3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat
3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat
3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa
3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg
3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt
3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg
3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg
3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg
4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta
4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag
4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc
4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg
4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag
4321 ocgttaaggt ttcatttagc gcctcaaata gatcctgttc aggaaccgga tcaaagagtt
4381 cctccgccgc tggacctacc aaggcaacgc tatgttctct tgcttttgtc agcaagatag
4441 ccagatcaat gtcgatcgtg gctggctcga agatacctgc aagaatgtca ttgcgctgcc
4501 attctccaaa ttgcagttcg cgcttagctg gataacgcca cggaatgatg tcgtcgtgca
4561 caacaatggt gacttctaca gcgcggagaa tctcgctctc tccaggggaa gccgaagttt
4621 ccaaaaggtc gttgatcaaa gctcgccgcg ttgtttcatc aagccttacg gtcaccgtaa
4681 ccagcaaatc aatatcactg tgtggcttca ggccgccatc cactgcggag ccgtacaaat
4741 gtacggccag caacgtcggt tcgagatggc gctcgatgac gccaactacc tctgatagtt
4801 gagtcgatac ttcggcgatc accgcttccc tcatgatgtt taactttgtt ttagggcgac
4861 tgccctgctg cgtaacatcg ttgctgctcc ataacatcaa acatcgaccc acggcgtaac
4921 gcgcttgctg cttggatgcc cgaggcatag actgtacccc aaaaaaacag tcataacaag
4981 ccatgaaaac cgccactgcg ccgttaccac cgctgcgttc ggtcaaggtt ctggaccagt
5041 tgcgtgaggc catacgctac ttgcattaca gcttacgaac cgaacaggct tatgtccact
5101 gggttcgtgc cttcatccgt ttccacggtg tgcgtcaccc ggcaaccttg ggcagcagcg
5161 aagtcgaggc atttctgtcc tggctggcga acgagcgcaa ggtttcggtc tccacgcatc
5221 gtcaggcatt ggcggccttg ctgttcttct acggcaagtg ctgtgcacgg atctgccctg
5281 gcttcaggag atcggaagac ctcggccgtc cgggcgcttg ccggtggtgc tgaccccgga
5341 tgaagtggtt cgcatcctcg gttttctgga aggcgagcat cgtttgttcg cccagcttct
5401 gtatggaacg ggcatgcgga tcagtgaggg tttgcaactg cgggtcaagg actggatttc
5461 gatcacggca cgatcatcgt gcgggagggc aagggctcca aggatcgggc cttgatgtta
5521 cccgagagct tggcacccag cctgcgcgag cagctgtctc gtgcacgggc atggtggctg
5581 aaggactagg ccgagggccg cagcggcgtt gcgcttcccg acgcccttga gcggaagtat
5641 ccgcgcgccg ggcattcctg gccgtggttc tgggtttttg cgcagcacac gcattcgacc
5701 gatccacgga gcggtgtcgt gcgtcgccat cacatgtatg accagacctt tcagcgcgcc
5761 ttcaaacgtg ccgtagaaca agcaggcatc acgaagcccg ccacaccgca caccctccgc
5821 cactcgttcg cgacggcctt gctccgcagc ggttacgaca ttcgaaccgt gcaggatctg
5881 ctcggccatt ccgacgtctc tacgacgatg atttacacgc atgtgctgaa agttggcggt
5941 gccggagtgc gctcaccgct tgatgcctgc cgcccctcac tgtgagaggt agggcagcgc
6001 aagtcaatcc tagcggattc actacccctg cgcgaaggcc atcggtgccg catcgaacgg
6061 ccggttgcgg aaagtcctcc ctgcgtccgc tgatggccgg cagcagcccg tcgttgaagg
6121 atccctgaaa gcgacgttgg atgttaacat ctacaaattg ccttttctta cgaccatgta
6181 cgtaagcgct tacgtttttg gtggaccctt gaggaaactg gtagctgttg tgggcctgtg
6241 gtctcaagat ggatcattaa tttccacctt cacctacgat ggggggcatc gcaccggtga
6301 gtaatattgt acggctaaga gcgaatttgg cctgtagacc tcaattgcga gctttctaat
6361 ttcaaactat tcgggcctaa cttttggtgt gatgatgctg actggcagga tatataccgt
6421 tgtaat
//
LOCUS TOBPRMMG 200 bp ds-DNA PLN 09-AUG-1990
DEFINITION N.tabacum promoter activating a promoterless nptII marker gene.
ACCESSION M34757
KEYWORDS .
SOURCE N.tabacum (strain SR1) DNA.
ORGANISM Nicotiana tabacum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
Depicker,A.
TITLE Cloning and sequence analysis of truncated T-DNA inserts from
Nicotiana tabacum
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.R.Gheysen, 01-JUN-1990.
FEATURES from to/span description
pept 198 > 200 ORF
mRNA 130 > 200 mRNA (5' end +/- 2 bp)
recomb 193 194 T-DNA end/plant DNA start
signal 25 30 CAAT box
signal 47 55 CAAT box
signal 101 107 TATA box
BASE COUNT 64 a 53 c 32 g 51 t
ORIGIN
1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg
61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta
121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg
181 atacaccaaa tcggatcatg
//GenBank-Updates@genbank.bio.net (08/10/90)
LOCUS HUMMPRCI 9081 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human cation-independent mannose 6-phosphate receptor mRNA,
complete cds.
ACCESSION J03528
KEYWORDS mannose 6-phosphate receptor.
SOURCE Human placenta, cDNA to mRNA (library of J.E.Sadler), clones
PP[4,112,188), RB3 and RC5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 9081)
AUTHORS Oshima,A., Nolan,C.M., Kyle,J.W., Grubb,J.H. and Sly,W.S.
TITLE The human cation-independent mannose 6-phophate receptor: Cloning
and sequence of the full-length cDNA and expression of functional
receptor in COS cells
JOURNAL J. Biol. Chem. 263, 2553-2562 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by R.Miller, 04-JAN-1988.
FEATURES from to/span description
pept 148 7623 mannose 6-phosphate receptor precursor
/hgml_locus_uid="LY0166H" /nomgen="M6PR"
/map="12"
sigp 148 267 mannose 6-phosphate receptor signal peptide
matp 268 7620 mannose 6-phosphate receptor
mRNA < 1 9081 MPR-CI mRNA
BASE COUNT 2243 a 2233 c 2476 g 2129 t
ORIGIN Chromosome 12; 475 bp upstream of EcoRI site.
1 gagcccagtc gagccgcgct cacctcgggc tcccgctccg tctccacctc cgcctttgcc
61 ctggcggcgc gaccccgtcc cgggcgcggc cccagcagtc gcgcgccgtt agcctcgcgc
121 ccgccgcgca gctccgggcc cggcgcgatg ggggccgccg ccggccggag cccccacctg
181 gggcccgcgc ccgcccgccg cccgcagcgc tctctgctcc tgctgcagct gctgctgctc
241 gtcgctgccc cggggtccac gcaggcccag gccgccccgt tccccgagct gtgcagttat
301 acatgggaag ctgttgatac caaaaataat gtactttata aaatcaacat ctgtggaagt
361 gtggatattg tccagtgcgg gccatcaagt gctgtttgta tgcacgactt gaagacacgc
421 acttatcatt cagtgggtga ctctgttttg agaagtgcaa ccagatctct cctggaattc
481 aacacaacag tgagctgtga ccagcaaggc acaaatcaca gagtccagag cagcattgcc
541 ttcctgtgtg ggaaaaccct gggaactcct gaatttgtaa ctgcaacaga atgtgtgcac
601 tactttgagt ggaggaccac tgcagcctgc aagaaagaca tatttaaagc aaataaggag
661 gtgccatgct atgtgtttga tgaagagttg aggaagcatg atctcaatcc tctgatcaag
721 cttagtggtg cctacttggt ggatgactcc gatccggaca cttctctatt catcaatgtt
781 tgtagagaca tagacacact acgagaccca ggttcacagc tgcgggcctg tccccccggc
841 actgccgcct gcctggtaag aggacaccag gcgtttgatg ttggccagcc ccgggacgga
901 ctgaaggtgg tgcgcaagga caggcttgtc ctgagttacg tgagggaaga ggcaggaaag
961 ctagactttt gtgatggtca cagccctgcg gtgactatta catttgtttg cccgtcggag
1021 cggagagagg gcaccattcc caaactcaca gctaaatcca actgccgcta tgaaattgag
1081 tggattactg agtatgcctg ccacagagat tacctggaaa gtaaaacttg ttctctgagc
1141 ggcgagcagc aggatgtctc catagacctc acaccacttg cccagagcgg aggttcatcc
1201 tatatttcag atggaaaaga atatttgttt tatttgaatg tctgtggaga aactgaaata
1261 cagttctgta ataaaaaaca agctgcagtt tgccaagtga aaaagagcga tacctctcaa
1321 gtcaaagcag caggaagata ccacaatcag accctccgat attcggatgg agacctcacc
1381 ttgatatatt ttggaggtga tgaatgcagc tcagggtttc agcggatgag cgtcataaac
1441 tttgagtgca ataaaaccgc aggtaacgat gggaaaggaa ctcctgtatt cacaggggag
1501 gttgactgca cctacttctt cacatgggac acggaatacg cctgtgttaa ggagaaggaa
1561 gacctcctct gcggtgccac cgacgggaag aagcgctatg acctgtccgc gctggtccgc
1621 catgcagaac cagagcagaa ttgggaagct gtggatggca gtcagacgga aacaaagaag
1681 aagcattttt tcattaatat ttgtcacaga gtgctgcagg aaggcaaggc acgagggtgt
1741 cccgaggacg cggcagtgtg tgcagtggat aaaaatggaa gtaaaaatct gggaaaattt
1801 atttcctctc ccatgaaaga gaaaggaaac attcaactct cttattcaga tggtgatgat
1861 tgtggtcatg gcaagaaaat taaaactaat atcacacttg tatgcaagcc aggtgatctg
1921 gaaagtgcac cagtgttgag aacttctggg gaaggcggtt gcttttatga gtttgagtgg
1981 cacacagctg cggcctgtgt gctgtctaag acagaagggg agaactgcac ggtctttgac
2041 tcccaggcag ggttttcttt tgacttatca cctctcacaa agaaaaatgg tgcctataaa
2101 gttgagacaa agaagtatga cttttatata aatgtgtgtg gcccggtgtc tgtgagcccc
2161 tgtcagccag actcaggagc ctgccaggtg gcaaaaagtg atgagaagac ttggaacttg
2221 ggtctgagta atgcgaagct ttcatattat gatgggatga tccaactgaa ctacagaggc
2281 ggcacgccct ataacaatga aagacacaca ccgagagcta cgctcatcac ctttctctgt
2341 gatcgagacg cgggagtggg cttccctgaa tatcaggaag aggataactc cacctacaac
2401 ttccggtggt acaccagcta tgcctgcccg gaggagcccc tggaatgcgt agtgaccgac
2461 ccctccacgc tggagcagta cgacctctcc agtctggcaa aatctgaagg tggccttgga
2521 ggaaactggt atgccatgga caactcaggg gaacatgtca cgtggaggaa atactacatt
2581 aacgtgtgtc ggcctctgaa tccagtgccg ggctgcaacc gatatgcatc ggcttgccag
2641 atgaagtatg aaaaagatca gggctccttc actgaagtgg cttccatcag taacttggga
2701 atggcaaaga ccggcccggt ggttgaggac agcggcagcc tccttctgga atacgtgaat
2761 gggtcggcct gcaccaccag cgatggcaga cagaccacat ataccacgag gatccatctc
2821 gtctgctcca ggggcaggct gaacagccac cccatctttt ctctcaactg ggagtgtgtg
2881 gtcagtttcc tgtggaacac agaggctgcc tgtcccattc agacaacgac ggatacagac
2941 caggcttgct ctataaggga tcccaacagt ggatttgtgt ttaatcttaa tccgctaaac
3001 agttcgcaag gatataacgt ctctggcatt ggtaagattt ttatgtttaa tgtctgcggc
3061 acaatgcctg tctgtgggac catcctggga aaacctgctt ctggctgtga ggcagaaacc
3121 caaactgaag agctcaagaa ttggaagcca gcaaggccag tcggaattga gaaaagcctc
3181 cagctgtcca cagagggctt catcactctg acctacaaag ggcctctctc tgccaaaggt
3241 accgctgatg cttttatcgt ccgctttgtt tgcaatgatg atgtttactc agggcccctc
3301 aaattcctgc atcaagatat cgactctggg caagggatcc gaaacactta ctttgagttt
3361 gaaaccgcgt tggcctgtgt tccttctcca gtggactgcc aagtcaccga cctggctgga
3421 aatgagtacg acctgactgg cctaagcaca gtcaggaaac cttggacggc tgttgacacc
3481 tctgtcgatg ggagaaagag gactttctat ttgagcgttt gcaatcctct cccttacatt
3541 cctggatgcc agggcagcgc agtggggtct tgcttagtgt cagaaggcaa tagctggaat
3601 ctgggtgtgg tgcagatgag tccccaagcc gcggcgaatg gatctttgag catcatgtat
3661 gtcaacggtg acaagtgtgg gaaccagcgc ttctccacca ggatcacgtt tgagtgtgct
3721 cagatatcgg gctcaccagc atttcagctt caggatggtt gtgagtacgt gtttatctgg
3781 agaactgtgg aagcctgtcc cgttgtcaga gtggaagggg acaactgtga ggtgaaagac
3841 ccaaggcatg gcaacttgta tgacctgaag cccctgggcc tcaacgacac catcgtgagc
3901 gctggcgaat acacttatta cttccgggtc tgtgggaagc tttcctcaga cgtctgcccc
3961 acaagtgaca agtccaaggt ggtctcctca tgtcaggaaa agcgggaacc gcagggattt
4021 cacaaagtgg caggtctcct gactcagaag ctaacttatg aaaatggctt gttaaaaatg
4081 aacttcacgg ggggggacac ttgccataag gtttatcagc gctccacagc catcttcttc
4141 tactgtgacc gcggcaccca gcggccagta tttctaaagg agacttcaga ttgttcctac
4201 ttgtttgagt ggcgaacgca gtatgcctgc ccacctttcg atctgactga atgttcattc
4261 aaagatgggg ctggcaactc cttcgacctc tcgtccctgt caaggtacag tgacaactgg
4321 gaagccatca ctgggacggg ggacccggag cactacctca tcaatgtctg caagtctctg
4381 gccccgcagg ctggcactga gccgtgccct ccagaagcag ccgcgtgtct gctgggtggc
4441 tccaagcccg tgaacctcgg cagggtaagg gacggacctc agtggagaga tggcataatt
4501 gtcctgaaat acgttgatgg cgacttatgt ccagatggga ttcggaaaaa gtcaaccacc
4561 atccgattca cctgcagcga gagccaagtg aactccaggc ccatgttcat caacgccgtg
4621 gaggactgtg agtacacctt tgcctggccc acagccacag cctgtcccat gaagagcaac
4681 gagcatgatg actgccaggt caccaaccca agcacaggac acctgtttga tctgagctcc
4741 ttaagtggca gggcgggatt cacagctgct tacagcgaga aggggttggt ttacatgagc
4801 atctgtgggg agaatgaaaa ctgccctcct ggcgtggggg cctgctttgg acagaccagg
4861 attagcgtgg gcaaggccaa caagaggctg agatacgtgg accaggtcct gcagctggtg
4921 tacaaggatg ggtccccttg tccctccaaa tccggcctga gctataagag tgtgatcagt
4981 ttcgtgtgca ggcctgaggc cgggccaacc aataggccca tgctcatctc cctggacaag
5041 cagacatgca ctctcttctt ctcctggcac acgccgctgg cctgcgagca agcgaccgaa
5101 tgttccgtga ggaatggaag ctctattgtt gacttgtctc cccttattca tcgcactggt
5161 ggttatgagg cttatgatga gagtgaggat gatgcctccg ataccaaccc tgatttctac
5221 atcaatattt gtcagccact aaatcccatg cacggagtgc cctgtcctgc cggagccgct
5281 gtgtgcaaag ttcctattga tggtcccccc atagatatcg gccgggtagc aggaccacca
5341 atactcaatc caatagcaaa tgagatttac ttgaattttg aaagcagtac tccttgctta
5401 gcggacaagc atttcaacta cacctcgctc atcgcgtttc actgtaagag aggtgtgagc
5461 atgggaacgc ctaagctgtt aaggaccagc gagtgcgact ttgtgttcga atgggagact
5521 cctgtcgtct gtcctgatga agtgaggatg gatggctgta ccctgacaga tgagcagctc
5581 ctctacagct tcaacttgtc cagcctttcc acgagcacct ttaaggtgac tcgcgactcg
5641 cgcacctaca gcgttggggt gtgcaccttt gcagtcgggc cagaacaagg aggctgtaag
5701 gacggaggag tctgtctgct ctcaggcacc aagggggcat cctttggacg gctgcaatca
5761 atgaaactgg attacaggca ccaggatgaa gcggtcgttt taagttacgt gaatggtgat
5821 cgttgccctc cagaaaccga tgacggcgtc ccctgtgtct tccccttcat attcaatggg
5881 aagagctacg aggagtgcat catagagagc agggcgaagc tgtggtgtag cacaactgcg
5941 gactacgaca gagaccacga gtggggcttc tgcagacact caaacagcta ccggacatcc
6001 agcatcatat ttaagtgtga tgaagatgag gacattggga ggccacaagt cttcagtgaa
6061 gtgcgtgggt gtgatgtgac atttgagtgg aaaacaaaag ttgtctgccc tccaaagaag
6121 ttggagtgca aattcgtcca gaaacacaaa acctacgacc tgcggctgct ctcctctctc
6181 accgggtcct ggtccctggt ccacaacgga gtctcgtact atatgaatct gtgccagaaa
6241 atatataaag ggcccctggg ctgctctgaa agggccagca tttgcagaag gaccacaact
6301 ggagacgtcc aggtcctggg actcgttcac acgcagaagc tgggtgtcat aggtgacaaa
6361 gttgttgtca tgtactccaa aggttatccg tgtggtggaa ataagaccgc atcctccgtg
6421 atagaattga cctgtacaaa gacggtgggc agacctgcat tcaagaggtt tgatatcgac
6481 agctgcactt actacttcag ctgggactcc cgggctgcct gcgccgtgaa gcctcaggag
6541 gtgcagatgg tgaatgggac catcaccaac cctataaatg gcaagagctt cagcctcgga
6601 gatatttatt ttaacctgtt cagagaatct ggggacatga ggaccaatgg ggacaactac
6661 ctgtatgaga tcctactttc ctccatcaca agctccagaa acccggcgtg ctctggagcc
6721 aacatatgcc aggtgaagcc caacgatcag cacttcagtc ggaaagttgg aacctctgac
6781 aagaccaagt actaccttca agacggcgat ctcgatgtcg tgtttgcctc ttcctctaag
6841 tgcggaaagg ataagaccaa gtctgtttct tccaccatct tcttccactg tgaccctctg
6901 gtggaggacg ggatccccga gttcagtcac gagactgccg actgccagta cctcttctct
6961 tggtacacct cagccgtgtg tcctctgggg gtgggctttg acagcgagaa tcccggggac
7021 gacgggcaga tgcacaaggg gctgtcagaa cggagccagg cagtcggcgc ggtgctcagc
7081 ctgctgctgg tggcgctcac ctgctgcctg ctggccctgt tgctctacaa gaagaagagg
7141 agggaaacaa tgataagtaa gctgacctct tgctgtagga gaagttccaa cgtgtcctac
7201 aaatactcaa aggtgaataa ggaagaagag acagatgaga atgaaacaga gtggctgatg
7261 gaagagatcc agctgcctcc tccacggcag ggaaaggaag ggcaggagaa cggccatatt
7321 accaccaagt cagtgaaagc cctcagctcc ctgcatgggg atgaccagga cagtacggat
7381 gaggttctga ccatcccaga ggtgaaagtt cactcgggca ggggagctgg ggcagagagc
7441 tcccacccag tgagaaacgc acagagcaat gcccttcagg agcgtgagga cgatagggtg
7501 gggctggtca ggggtgagaa ggcgaggaaa gggaagtcca gctctgcaca gcagaagaca
7561 gtgagctcca ccaagctggt gtccttccat gacgacagcg acgaggacct cttacacatc
7621 tgactccgca gtgcctgcag gggagcacgg agccgcggga cagccaagca cctccaacca
7681 aataagactt ccactcgatg atgcttctat aattttgcct ttaacagaaa ctttcaaaag
7741 ggaagagttt ttgtgatggg ggagagggtg aaggaggtca ggccccactc cttcctgatt
7801 gtttacagtc attggaataa ggcatggctc agatcggcca cagggcggta ccttgtgccc
7861 agggttttgc cccaagtcct catttaaaag cataaggccg gacgcatctc aaaacagagg
7921 gctgcattcg aagaaaccct tgctgcttta gtcccgatag gtatttgacc ccgatatatt
7981 ttagcatttt aattctctcc ccctatttat tgactttgac aattactcag gtttgagaaa
8041 aaggaaaaaa aaacagccac cgtttcttcc tgccagcagg ggtgtgatgt accagtttgt
8101 ccatcttgag atggtgaggc tgtcagtgta tggggcagct tccggcggga tgttgaactg
8161 gtcattaatg tgtcccctga gttggagctc attctgtctc ttttctcttt tgctttctgt
8221 ttcttaaggg cacacacacg tgcgtgcgag cacacacaca tacgtgcaca gggtccccga
8281 gtgcctaggt tttggagagt ttgcctgttc tatgccttta gtcaggaatg gctgcacctt
8341 tttgcatgat atcttcaagc ctgggcgtac agagcacatt tgtcagtatt tttgccggct
8401 ggtgaattca acctgcccaa agattgattt gtgtgtttgt gtgtgtgtgt gtgtgtgtgt
8461 gtgtgtgtgt gagtggagtt gaggtgtcag agaaaatgaa ttttttccag atttggggta
8521 taggtctcat ctcttcaggt tctcatgata ccacctttac tgtgcttatt tttttaagaa
8581 aaaagtgttg atcaaccatt cgacctataa gaagccttaa tttgcacagt gtgtgactta
8641 cagaaactgc atgaaaaatc atgggccaga gcctcggccc tagcattgca cttggcctca
8701 tgctggaggg aggctgggcg ggtacagcgc ggaggaggag ggaggccagg cgggcatggc
8761 gtggaggagg tgggaggccg ggcggtcaca gcatggagga ggagggaggc gctgctggtg
8821 ttcttattct ggcggcagcg cctttcctgc catgtttagt gaatgacttt tctcgcattg
8881 tagaattgta tatagactct ggtgttctat tgctgagaag caaaccgccc tgcagcatcc
8941 ctcagcctgt accggtttgg ctggcttgtt tgatttcaac atgagtgtat tttttaaaat
9001 tgatttttct cttcattttt ttttcaatca actttactgt aatataaagt attcaacaat
9061 ttcaataaaa gataaattat t
//
LOCUS HUMMRA 5185 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human mannose receptor mRNA, complete cds.
ACCESSION J05550
KEYWORDS mannose receptor.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5185)
AUTHORS Taylor,M.E., Conary,J.T., Lennartz,M.R., Stahl,P.D. and
Drickamer,K.
TITLE Primary structure of the mannose receptor contains multiple motifs
resembling carbohydrate-recognition domains
JOURNAL J. Biol. Chem. 265, 12156-12162 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Drickamer, 18-MAY-1990.
FEATURES from to/span description
pept 104 4474 mannose receptor precursor
sigp 104 157 mannose receptor signal peptide
matp 158 4471 mannose receptor
BASE COUNT 1602 a 999 c 1198 g 1386 t
ORIGIN
1 gggaacttgg attaggtgga gaggcagttg gggggcctcg ttgttttgcg tcttagttcc
61 gccctcctgt ccatcaggag aaggaaagga taaaccctgg gccatgaggc tacccctgct
121 cctggttttt gcctctgtca ttccgggtgc tgttctccta ctggacacca ggcaattttt
181 aatctataat gaagatcaca agcgctgcgt ggatgcagtg agtcccagtg ccgtccaaac
241 cgcagcttgc aaccaggatg ccgaatcaca gaaattccga tgggtgtccg aatctcagat
301 tatgagtgtt gcatttaaat tatgcctggg agtgccatca aaaacagact gggttgctat
361 cactctctat gcctgtgact caaaaagtga atttcagaaa tgggagtgca aaaatgacac
421 acttttgggg atcaaaggag aagatttatt ttttaactac ggcaacagac aagaaaagaa
481 tattatgctc tacaagggat cgggtttatg gagcaggtgg aagatctatg gaaccacaga
541 caatctgtgc tccagaggtt atgaagccat gtatacgcta ctaggcaatg ccaatggagc
601 aacctgtgca ttcccgttca agtttgaaaa caagtggtac gcagattgca cgagtgctgg
661 gcggtcggat ggatggctct ggtgcggaac cactactgac tatgacacag acaagctatt
721 tggatattgt ccattgaaat ttgagggcag tgaaagctta tggaataaag acccgctgac
781 cagcgtttcc taccagataa actccaaatc cgctttaacg tggcaccaag cgaggaaaag
841 ctgccaacaa cagaacgctg agctcctgag catcacagag atacatgagc aaacatacct
901 gacaggatta accagttcct tgacctcagg actctggatt ggacttaaca gtctgagctt
961 caacagcggt tggcagtgga gtgaccgcag tcctttccga tatttgaact ggttaccagg
1021 aagtccatca gctgaacctg gaaaaagctg tgtgtcacta aatcctggaa aaaatgctaa
1081 atgggaaaat ctggaatgtg ttcagaaact gggctatatt tgcaaaaagg gcaacaccac
1141 tttaaattct tttgttattc cctcagaaag tgatgtgcct actcactgtc ctagtcagtg
1201 gtggccgtat gccggtcact gttacaagat tcacagagat gagaaaaaaa tccagaggga
1261 tgctctgacc acctgcagga aggaaggcgg tgacctcaca agtatccaca ccatcgagga
1321 attggacttt attatctccc agctaggata tgagccaaat gacgaattgt ggatcggctt
1381 aaatgacatt aagattcaaa tgtactttga gtggagtgat gggacccctg taacgtttac
1441 caaatggctt cgtggagaac caagccatga aaacaacaga caggaggatt gtgtggtgat
1501 gaaaggcaag gatgggtact gggcagatcg gggctgtgag tggcctcttg gctacatctg
1561 caagatgaaa tcacgaagcc aaggtccaga aatagtggaa gtcgaaaaag gctgcaggaa
1621 aggctggaaa aaacatcact tttactgcta tatgattgga catacgcttt caacatttgc
1681 agaagcaaac caaacctgta ataatgagaa tgcttattta acaactattg aagacagata
1741 tgaacaagcc ttcctgacta gtttcgttgg cttaaggcct gaaaaatatt tctggacagg
1801 actttcagat atacaaacca aagggacttt tcagtggacc atcgaggaag aggttcggtt
1861 cacccactgg aattcagata tgccagggcg aaagccaggg tgtgttgcca tgagaaccgg
1921 gattgcaggg ggcttatggg atgttttgaa atgtgatgaa aaggcaaaat ttgtgtgcaa
1981 gcactgggca gaaggagtaa cccacccacc gaagcccacg acgactcccg aacccaaatg
2041 tccggaggat tggggcgcca gcagtagaac aagcttgtgt ttcaagctgt atgcaaaagg
2101 aaaacatgag aagaaaacgt ggtttgaatc tcgagatttt tgtcgagctc tgggtggaga
2161 cttagctagc atcaataaca aagaggaaca gcaaacaata tggcgattaa taacagctag
2221 tggaagctac cacaaactgt tttggttggg attgacatat ggaagccctt cagaaggttt
2281 tacttggagt gatggttctc ctgtttcata tgaaaactgg gcttatggag aacctaataa
2341 ttatcaaaat gttgaatact gtggtgagct gaaaggtgac cctactatgt cttggaatga
2401 tattaattgt gaacacctta acaactggat ttgccagata caaaaaggac aaacaccaaa
2461 acctgagcca acaccagctc ctcaagacaa tccaccagtt actgaagatg ggtgggttat
2521 ttacaaagac taccagtatt atttcagcaa agagaaggaa accatggaca atgcgcgagc
2581 gttttgcaag aggaattttg gtgatcttgt ttctattcaa agtgaaagtg aaaagaagtt
2641 tctatggaaa tatgtaaaca gaaatgatgc acagtctgca tattttattg gtttattgat
2701 cagcttggat aaaaagtttg cttggatgga tggaagcaaa gtggattacg tgtcttgggc
2761 cacaggtgaa cccaattttg caaatgaaga tgaaaactgt gtgaccatgt attcaaattc
2821 agggttttgg aatgacatta actgtggcta tccaaacgcc ttcatttgcc agcgacataa
2881 cagtagtatc aatgctacca cagttatgcc taccatgccc tcggtcccat cagggtgcaa
2941 ggaaggttgg aatttctaca gcaacaagtg tttcaaaatc tttggattta tggaagaaga
3001 aagaaaaaat tggcaagagg cacgaaaagc ttgtataggc tttggaggga atctggtctc
3061 catacaaaat gaaaaagagc aagcatttct tacctatcac atgaaggact ccactttcag
3121 tgcctggact gggctgaatg atgtcaattc agaacacacg ttcctttgga cggatggacg
3181 aggagtccat tacacaaact gggggaaagg ttaccctggt ggaagaagaa gcagtctttc
3241 ttatgaagat gctgactgtg ttgttattat tggaggtgca tcaaatgaag caggaaaatg
3301 gatggatgat acctgcgaca gtaaacgagg ctacatatgc cagacacgat ccgacccttc
3361 cttgactaat cctccagcaa cgattcaaac agatggcttt gttaaatatg gcaaaagcag
3421 ctattcactc atgagacaaa aatttcaatg gcatgaagcg gagacatact gcaagcttca
3481 caattccctt atagccagca ttctggatcc ctacagtaat gcatttgcgt ggctgcagat
3541 ggaaacatct aatgaacgtg tgtggatcgc cctgaacagt aacttgactg ataatcaata
3601 cacttggact gataagtgga gggtgaggta cactaactgg gctgctgatg agcccaaatt
3661 gaaatcagca tgtgtttatc tggatcttga tggctactgg aagacagcac attgcaatga
3721 aagtttttac tttctctgta aaagatcaga tgaaatccct gctactgaac ccccacaact
3781 gcctggcaga tgcccggagt cagatcacac agcatggatt cctttccatg gtcactgtta
3841 ctatattgag tcctcatata caagaaactg gggccaagct tctctggaat gtcttcgaat
3901 gggttcctct ctggtttcca ttgaaagtgc tgcagaatcc agttttctgt catatcgggt
3961 tgagccactt aaaagtaaaa ccaatttttg gataggattg ttcagaaatg ttgaagggac
4021 gtggctgtgg ataaataaca gtccggtctc ctttgtcaac tggaacacag gagatccctc
4081 tggtgaacgg aatgattgtg tagctttaca tgcgtcttct gggttttgga gtaatattca
4141 ctgttcttcc tacaaaggat atatttgtaa aagaccaaaa attattgatg ctaaacctac
4201 tcatgaatta cttacaacaa aagctgacac aaggaagatg gacccttcta aaccgtcttc
4261 caacgtggcc ggagtagtca tcattgtgat cctcctgatt ttaacgggtg ctggccttgc
4321 cgcctatttc ttttataaga aaagacgtgt gcacctacct caagagggcg cctttgaaaa
4381 cactctgtat tttaacagtc agtcaagccc aggaactagt gatatgaaag atctcgtggg
4441 caatattgaa cagaatgaac actcggtcat ctagtacctc aatgcgattc tgagatattt
4501 gaatttcata aaattgtaac tgaaatttaa aatttttagt tcaatgtgat tgttttcttt
4561 aaaatgagta ctgaattgta ctggtctgtc cttttttcct ttgcctaatt gaagaaataa
4621 ttgcttgttt tctagcctgg caagatattt tcataaaaga gggataacaa tgctgattac
4681 taccttttaa aatattttag ataaatgcac agcaccacag caccacatct aagcattagt
4741 gatgggtagc tgatgtcagc ttcatgtgga ttttaagcac tctagaaaca atgaagcttc
4801 ttggcatatt ttaaggagct cccaaaatgt gttacctatt aaattgtaac tcagcaagta
4861 gaagaccatt tgaaaagtca ggtacaaatt tcctcaagtg gcataaaaat gtagtcagtt
4921 ttctctttta ccagttttta tttccactcc aattatttag aactttattt gtacatgtgc
4981 agaagaataa ggcagctgag aatcttgttt cccccaagag agttttacag gctgagtgtt
5041 gcaaatgtgt tctttgtcct gttatatgta tatcaggaat acaaggatgt gaaataaaac
5101 tgtaaatttg cataactgga tgtacttaga taatgtgaaa taaacattaa agacaaggtc
5161 tatttttaat aaaaaaaaaa aaaaa
//
LOCUS BFRACP 1788 bp ss-RNA PHG 10-AUG-1990
DEFINITION Bacteriophage fr maturation and coat protein genes, complete cds,
and replicase gene, 5' end.
ACCESSION M31635
KEYWORDS coat protein; maturation protein; replicase.
SOURCE Bacteriophage fr RNA.
ORGANISM Bacteriophage fr
Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses;
Leviviridae.
REFERENCE 1 (bases 1 to 1788)
AUTHORS Berzin,V., Avots,A., Jansone,I., Gintnere,L. and Tsimanis,A.
TITLE Sequence of the genes coding for the A-protein and coat protein of
bacteriophage fr
JOURNAL Nucleic Acids Res. 15, 6741-6741 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 121 1302 maturation protein (gtg start codon)
pept 1328 1720 coat protein
pept 1754 > 1788 replicase protein
BASE COUNT 443 a 454 c 473 g 418 t
ORIGIN
1 cccctttcgg ggtcctgctc gacttcctgt caagctaaat gccattttta atgtctttag
61 cgagacgcta ccatggctat cgctgtaggt agccgcaatt ccattgctag ggagcctcgt
121 gtgcgaaagt tcataccaac cgagagaatg agtaagagcc acgtggtctc cgtccgcgaa
181 tatgctgacg gggaactcga ggataactct ttaccgttga tttaccgatc caattggtcg
241 ccgggccagt acacaagtac gggtccgcgc acaaaggaat ggcactaccc atcgtcctac
301 tcacgtgggg cgataggaat caaggctctt gatcaaggta agtacgcacg attgggcaca
361 tcgtggggcc gcgagttcga agaacgtgcc ggttatggca tgtctatcga cgcacgtagt
421 tgttatagcc tattccccgt tagtcaaaac ttaacgtgga tagacgtacc aacgaacgta
481 gctaatcgcg ccacgactga ggtcttaggt aaggtcactc aaggtaattt taaccttggc
541 gtggcccttg ctgaagcccg gtctacggcc tcacaactgt cgacgcaaac catcgcgttg
601 attaaggcgt acaccgcggc tcgccgcggg aactggcgcc aggcactccg ctacttagcg
661 ctgaacgaga atcgaaaatt caattcgaag tccgtcgcaa gcaggtggct ggagttgcag
721 ttcgggtgga tgccgcttct cagcgatatc caaggtgcgt atgagatgct caccaaagtg
781 catcttaagg catttatgcc tatgcgtgcc gtgagtcaag tcggacaaaa cgtcagtttg
841 tctggccggc tcacttctcc ggctgcaagc tataagtcta cgtgcaacat atcacgacgc
901 attgtgatat ggttttacat aaacgatgca cgtctggctt ggctgtcctc cttagggatt
961 ttgaacccgc taggaatagt gtgggaaaag gtcccgttct ctttcctggt cgattggttg
1021 ctgccggttg ggaacatgct tgaggggctt accgccccga taggctgttc gtatcaatcg
1081 ggaacagtaa ccgacgtaat atcaggagag tcgacaataa tcgccgatga catctatggt
1141 tgggatacag tgcgacctgc aaccgctaag gtgcaaatca gtgctgtcca ccggggggta
1201 caaagcgtgt ggcccacaac gggcgtatac gttaagtcac ctttctcgat ggtccatacc
1261 ttagatgcct tggcactttt caggcaacgc ctctggaaat agaggcccta accgaaggga
1321 gagccacatg gcttcgaact ttgaagagtt cgttctcgtc gacaatggcg gaacgggaga
1381 tgtaaaagtc gctccgagca acttcgctaa cggggttgca gaatggatca gctcgaactc
1441 acgttctcag gcttacaaag tgacctgtag cgtgcgtcag agctctgcga acaatcggaa
1501 atacaccgtc aaggtcgagg tcccgaaagt ggcaactcag gtccaaggcg gcgttgagct
1561 tcctgttgcg gcgtggcgct cgtacatgaa tatggaatta actattccgg tattcgcgac
1621 gaacgacgac tgtgccttaa tcgttaaggc attgcaaggc acctttaaaa ctggtaaccc
1681 aattgcaaca gccatcgcag ccaactcggg aatctactaa gaaacccgtg ccattccaac
1741 atgaggaata cccatgtcaa aatcaacaaa gaagttcaac tctttatg
//
LOCUS HUMERCC1A 1098 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human excision repair protein (ERCC-1) mRNA, complete cds.
cDNA clone pcDE.
ACCESSION M28650
KEYWORDS alternative splicing; excision repair protein.
SOURCE Human, cDNA to mRNA, clone pcDE.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1098)
AUTHORS Hoeijmakers,J.H.J.
TITLE Characterization of genes and proteins involved in excision repair
of human cells
JOURNAL J. Cell Sci. Suppl. 6, 111-125 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 143 1036 excision repair protein
site 863 916 alternatively spliced out exon
BASE COUNT 243 a 352 c 312 g 191 t
ORIGIN
1 aagtgctgcg agccctgggc cacgctggcc gtgctggcag tgggccgcct cgatccctct
61 gcagtctttc ccttgaggct ccaagaccag caggtgaggc ctcgcggcgc tgaaaccgtg
121 aggcccggac cacaggctcc agatggaccc tgggaaggac aaagaggggg tgccccagcc
181 ctcagggccg ccagcaagga agaaatttgt gatacccctc gacgaggatg aggtccctcc
241 tggagtggcc aagcccttat tccgatctac acagagcctt cccactgtgg acacctcggc
301 ccaggcggcc cctcagacct acgccgaata tgccatctca cagcctctgg aaggggctgg
361 ggccacgtgc cccacagggt cagagcccct ggcaggagag acgcccaacc aggccctgaa
421 acccggggca aaatccaaca gcatcattgt gagccctcgg cagaggggca atcccgtact
481 gaagttcgtg cgcaacgtgc cctgggaatt tggcgacgta attcccgact atgtgctggg
541 ccagagcacc tgtgccctgt tcctcagcct ccgctaccac aacctgcacc cagactacat
601 ccatgggcgg ctgcagagcc tggggaagaa cttcgccttg cgggtcctgc ttgtccaggt
661 ggatgtgaaa gatccccagc aggccctcaa ggagctggct aagatgtgta tcctggccga
721 ctgcacattg atcctcgcct ggagccccga ggaagctggg cggtacctgg agacctacaa
781 ggcctatgag cagaaaccag cggacctcct gatggagaag ctagagcagg acttcgtctc
841 ccgggtgact gaatgtctga ccaccgtgaa gtcagtcaac aaaacggaca gtcagaccct
901 cctgaccaca tttggatctc tggaacagct catcgccgca tcaagagaag atctggcctt
961 atgcccaggc ctgggccctc agaaagcccg gaggctgttt gatgtcctgc acgagccctt
1021 cttgaaagta ccctgatgac cccagctgcc aaggaaaccc ccagtgtaat aataaatcgt
1081 cctcccaggc caggctca
//
LOCUS MZECPPETE 3916 bp ds-DNA ORG 10-AUG-1990
DEFINITION Maize chloroplast cytochrome b559 alpha- and beta-subunit (psbE and
F), cytochrome b6-f complex (petE), and photosystem II polypeptide,
complete cds, Pro-tRNA and Trp-tRNA.
ACCESSION J04502
KEYWORDS cytochrome; photosystem II protein; transfer RNA-Pro;
transfer RNA-Trp.
SOURCE Maize (strain FR9 cms + FR37) DNA, clone pZmc503.
ORGANISM Chloroplast Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE 1 (bases 2565 to 2725)
AUTHORS Haley,J. and Bogorad,L.
TITLE A 4-kDa maize chloroplast polypeptide associated with the
cytochrome b-6-f complex: Subunit 5, encoded by the chloroplast
petE gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1534-1538 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3916)
AUTHORS Haley,J. and Bogorad,L.
TITLE Expression of the maize chloroplast petE gene
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by L.Bogorad, 07-MAY-1989.
Author address: J.Haley
Department of Cellular and Developmental Biology
Harvard University Biological Laboratories
16 Divinity Ave
Cambridge, MA 02138
Ph: 617-495-4260
FEATURES from to/span description
pept 454 332 (c) ORF40
pept 696 580 (c) photosystem II protein (psbL)
pept 838 719 (c) cytochrome b559 beta subunit (psbF)
pept 1100 849 (c) cytochrome b559 alpha subunit (psbE)
pept 2337 2432 ORF31
pept 2596 2709 cytochrome b6-f complex subunit 5 (petE)
pept 3499 3627 ORF42
tRNA 2903 2830 (c) Trp-tRNA
tRNA 3115 3042 (c) Pro-tRNA
BASE COUNT 1242 a 684 c 658 g 1332 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatcctctc ccctctttct tgttgcttca taagagtgaa tccattttat gggcgaaagg
61 cgggggcttt aaatcgaccg atggtttgct tcactcacta acatcattaa caaacaaaag
121 aataaatgga gggattccaa ccatcagagc aaaggttttc tctttgttat ttttacaaat
181 agaaataggt aaccaatttc tagattatgg aacaaaaccg cgttataaca ataagaattc
241 cgcgggccct ttccgctcta atcagataaa ggggagtaag gacccgctaa gttcctactt
301 tttcatgttt acaatctgga tcctccgatt actatagaga tgaacccaat ccagaatacg
361 aaccgtaaaa gaaaaaacct attaaaccaa tcacaagaat accagttaca gtacctatca
421 gccaaagagg aattcttcca gtagtatcgg ccatttcccc tactttcctc cacattttct
481 caagtggtca tgctagagac aaaaacagtc atggatagtt ataaagatgg tatccttcca
541 aatgggataa gagaattctt actactctct ttctttctct caattgaaga agtaattgga
601 aaataaaaca gcaagtacaa aaatgagtaa taaaccccag tatagactgg tacgattcaa
661 ttcaacattt tgttcattcg ggtttgattg tgtcatagtt ctatagttgg aatttggttt
721 atcgttggat gaactgcatt gctgatattg atcccaagaa aaaaaccgtg ggtacagcta
781 atccgtgaac agccagccat cgcactgtaa aaataggata ggttcgatct atggtcattg
841 ggggcctcct aaaaggatct actaaattca tctagttgtt ctaaagaatc aaaacggtcg
901 gttattaatg gaattccttg tcggctttcc gtgaaatact cgtttggtcg aggacttcca
961 aacacgtcat aagctaaacc cgtactgaca aataaccaac ccgcaatgaa tagggaaggt
1021 atagtaatgc tatgaataac ccagtatcga atactggtaa taatatcagc aaaagaacgt
1081 tctcccgtgc ttccagacat gctgagctcc caaaattttt attcaaaaaa ggaattgatt
1141 ccgtaaaaga tgggatccac cagtaaatag aaaattactg atatttcatc cttgtgagat
1201 tgtcaatttt gtaccaaagg tgtattttga gtataccgaa ttagtatagc tatccttcct
1261 atggcacagc aatccagttt tgcttggtcc cgaaacagaa ttcctttttt ctcttttttg
1321 ttccttgtct ataggaaaat tacatgttat tcaaggcatc aatagaaccc cacaattttt
1381 tgggtcctac ttattttcat tgtcttcgga atagtagaat aatttaattt ggaatagcgg
1441 ccaagatctt gggaaaatct aagttaatga ttaataagat aaagaattta ggaaggatat
1501 tctcatattg acgcaataca aagataagta tatgcaaaat cgatcccttt ttagttaaag
1561 gaatttaatt ggttagcata aaataatatc taataaatag aaaatcgaat agcggataat
1621 ccgttatgaa agaaacggaa tacattcttt gaagaatcaa gattcgtaat caatccttgt
1681 cttgtttgtt ggattaggtc taattttctt gaccaaacag caagcatgga actttacaag
1741 aagaacttaa aagaaaagga taattgaagt aactttgaat ccacttttgt tggggttcaa
1801 aaaacgaata aaaataaata aaaaaaagta aatttaagga atagttccct ttttgagggg
1861 gccctcgggg gtcgtggaat gcttttcttc tcctcttatt ccatatggaa tacaatgagt
1921 taaaatttgc tccaaaaaga ggatgaaatc catcctatta aaaaagaaat aatccgaatc
1981 cgaaatagaa agaatttttt ttttttcaaa ttcaattctt tatttatctt ttattccaaa
2041 attctcccga aaatccaatt tcatttttca atggggttag atgatctagt tcttaatatt
2101 attagtttac ttaactgaca gattccacaa caaatctctt gattcggaat tagggactca
2161 tgttccgtct gatgaatcga ttttctcttt tttttacact tctgtatctc actctatctt
2221 gttttttagt attatctaaa ataaccgatg aattatgaat tttccataac ttaaacttag
2281 gtaagtgctt taccaacata tgtagtgtag taaaaaaaat ggaattgaac cccttcatgc
2341 ttactataac tagttatttc ggttttctac tggctgcttt aactataacc ccagctctat
2401 ttattagctt gaacaagata cgtcttattt gaaatgaatt gaatagaaga atctttcttt
2461 tggattcttg gtattctaga ctcttttcca cactaattac caattctttt cttggtcatt
2521 gagattcgtg ggtagtttag actattattt agagatagat cgtacctctt tttttatccc
2581 ctcgaacaaa tcgaaatgat tgaagttttt ctatttggaa tcgtcttagg cctaattcct
2641 attactttag cgggattatt cgtgactgcg tatttgcaat acaggcgtgg ggatcagttg
2701 gatctttgat tgagtaatat ttcttttttg attgacctcc tctctggtct ggaggaggtc
2761 aaattggagt tgcaattcta ctttgttttt ttttaagtta ttttagtttc gacataagat
2821 atatggaatc acgctctgta ggatttgaac ctacgacatc gggttttgga gacccgcgtt
2881 ctaccaaact gaactaagag cgctttcaaa acaaaaaaaa aaaagaaaat ccttttctat
2941 tcctaacgtg tctcacgtcc gtatagtatc cacaaattca agttataccc actttaatcg
3001 atctccccac tactgcccat aacgaagaga gaattaatag gtagggatga caggatttga
3061 acctgtgaca ttttgtaccc aaaacaaacg cgctaccaag ctgcgctaca tcccttttcc
3121 aaattgttgt acaatgccat tgtacacaat tcctttcttg ttttccacat cgtaattttc
3181 ttctatttct ttatccatat agaactttct tgtcatttct tgtttttggt ctcatataag
3241 tcaaggaagg ggtataaact aaaatccagt caaatttcac ctataaaaga aagactacta
3301 ttccttagta atgtatagga agaaggggtc atctttttta gggataggaa aatctcgtct
3361 atacggttca ttctatatat atataatatt tattttgttt ttttagttag gaatttcgcc
3421 taaacaaaag aaatacaaag gatcttgggc aagagtatct gatcatatat gtattccaat
3481 acggaaggag gattttcaat gcgggatata aaaacatatc tctctgtagc acccgtgcta
3541 agtactctat ggtttggggc tttagcaggt ttattgatag aaatcaatcg tttattccca
3601 gatgctttgt cattcccttt tttttaatta tagttattcc tatgcgagag atagaattct
3661 tcgtgacatg acgaaaattc cccctttttg aattcttttt tagtatatga agcaaaaaga
3721 aagaaaagat ggataaggat tgtattcttt aattatttct ctatgtttta ttacttaatt
3781 tacgaatttc aaaaattttg tattctattg gattggattt gttcgagaat tcgaagaatt
3841 acaacaaaat ctttagaaat cacattttta gttaggaact tctatggatt ttattcttct
3901 tcttttcttt ggatcc
//
LOCUS RATLOX 2672 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Rat aorta lysyl oxidase mRNA, complete cds.
ACCESSION J02903
KEYWORDS lysyl oxidase.
SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7,
13, 8-1 and IIB.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2672)
AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D.,
Troxler,R.F. and Kagan,H.M.
TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and
predicted amino acid sequence
JOURNAL Biochemistry 29, 4863-4870 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
P.C.Trackman, 07-FEB-1990.
FEATURES from to/span description
pept 287 1516 lysyl oxidase precursor
sigp 287 349 pot. lysyl oxidase signal peptide
matp 350 1513 lysyl oxidase
site 2252 2274 region of dyad symmetry
BASE COUNT 650 a 763 c 622 g 637 t
ORIGIN 192 bp upstream of TaqI site.
1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc
61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg
121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc
181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt
241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg
301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc
361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa
421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga
481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt
541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg
601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg
661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc
721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac
781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga
841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc
901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat
961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga
1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg
1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca
1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag
1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg
1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga
1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa
1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa
1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac
1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt
1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa
1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat
1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta
1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat
1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca
1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt
1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg
1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc
2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt
2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac
2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc
2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta
2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac
2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat
2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct
2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct
2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg
2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat
2641 gcagtactgt aatccccgtc cctcctggag cc
//
LOCUS RHMDCTABA 764 bp ds-DNA BCT 10-AUG-1990
DEFINITION R.meliloti dctA and dctB genes, 5' end.
ACCESSION M33555
KEYWORDS regulatory protein.
SOURCE R.meliloti DNA.
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 764)
AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F.
TITLE Genetic analysis and regulation of the Rhizobium meliloti genes
controlling C-4-dicarboxylic acid transport
JOURNAL Gene 85, 135-144 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 726 > 764 dctA protein
BASE COUNT 149 a 239 c 252 g 124 t
ORIGIN
1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg
61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc
121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg
181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg
241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc
301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg
361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca
421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc
481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc
541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg
601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt
661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc
721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg
//
LOCUS BSUCOMGA 6278 bp ds-DNA BCT 10-AUG-1990
DEFINITION B.subtilis comG-[1,2,3,4,5,6,and 7] proteins in comG operon,
complete cds.
ACCESSION M29691 M22854
KEYWORDS exogenous DNA-binding.
SOURCE B.subtilis (strain 168) DNA, clone pED4.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 6278)
AUTHORS Albano,M., Breitling,R. and Dubnau,D.A.
TITLE Nucleotide sequence and genetic organization of the Bacillus
subtilis comG operon
JOURNAL J. Bacteriol. 171, 5386-5404 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by D.A.Dubnau, 02-MAR-1989.
ORF1 is required for the ability of competent cultures to resolve
into two populations with different cell densities on Renografin
gradients, as well as for full expression of comE, another late
competent locus. ORF1 shows significant similarity to the virB
ORF11 protein from Agrobacterium tumefaciens, which is probably
involved in T-DNA transfer. The N-terminal sequences of comG ORF3
and, to a lesser extent, the comG ORF4 and 5 proteins are similar
to a class of pilin proteins from members of the genera
Bacteroides, Pseudomonas, Neisseria and Moraxella. All of the comG
proteins, except ORF1, possess hydrophobic domains that are
potentially capable of spanning the bacterial membrane and may be
part of the DNA transport machinery.
FEATURES from to/span description
pept 1298 2368 comG-1 protein (ttg start codon)
pept 2420 3391 comG-2 protein
pept 3405 3701 comG-3 protein
pept 3691 4122 comG-4 protein (ttg start codon)
pept 4106 4453 comG-5 protein
pept 4863 5237 comG-7 protein
ORF 4365 4862 comG-6 protein (put.)
ORF < 1 886 ORF1 (AA at 2)
ORF 6126 6276 ORF2
mRNA 1278 > 6278 comG mRNA
signal 1100 1124 pot. transcription termination signal
signal 5501 5533 transcription termination signal
signal 5696 5715 transcription termination signal
signal 1243 1248 comG -35 region
signal 1266 1271 comG -10 region
BASE COUNT 1960 a 1122 c 1383 g 1813 t
ORIGIN 1 bp upstream of BglII site.
1 agatctaatc catttttctc actggcctca gtgtgaaaag tggtttgaaa ataaccatca
61 cgttaatttt ttgcgagtag atacaactga aacggaaaat gaagcagtat ttgggtcgat
121 tgtttatgat caggggcttg gtgaagaaaa agaccatact gtttttcact tttatatcac
181 cagacaatat ttttttacaa tcaactttga cttttcaatt ttgagagaga ttaaaggcaa
241 agaagttgtt cggcaaatgg aaagagcgga caatgcgata gaggggtttt taattcttct
301 cggcgaacta atgaatgcgt atttaatcgg tgttgatgaa tttgaagtca agctgagaaa
361 gctcagatgg caaattaaag acgacaatag caaaagcatt ttaaaccgcg tccatctcct
421 gcgccatgaa ctgatgattt ggaaaaattt gatattaagc gctaaaaaaa ttgaaatggc
481 gttgaaagaa acctttttac ctcaaaatga agggaaaaag gattatcagc ggacacaact
541 gaagattgac aggggattta catacatcag cgaatttgaa ggggagctta acaatctgct
601 gcattcagag gaagtcatta cctcacatag ggggaatgaa attgtaaaag cgctgaccat
661 tttcacgacg ctttttactc cgattacagc tctgggtgcc ttatggggga tgaacttttc
721 agtgatgccg gaactgaatt ggaaatacgg atatctcttt tccctcttat tgattgtcac
781 atctacagtt ctgatctatc tctatttgag aaaaaaaggc tggacgggag atatgctgca
841 ggagcggaag aagaaaaaga aacctcgaaa aaggcggact ctataggatg tttcatattt
901 tgtgcagcgt gccccgcttt ttcaccagac atatcagggt gaccggatac gatgtcaagg
961 ggcttatgac agagcattaa atccgcagtt tatcgattct tgaaaatgac caaatgaccg
1021 gtattgttgc attaggcgat ctttccgttg agaaagatac tggtcaataa gcgaaaacag
1081 cataatgaaa atggaatcta gcaggcatgg tgaccatgtc tgctttttta tttataggga
1141 aaattataat gacaggggta cattcagttg aaagtctttt ttcttgccag aaagaattgg
1201 tttttcagca tataacatct cacaaaatca cgttttccct gtttgattac cttttcttct
1261 ttttctacaa tatgcgttga aaggagaggg aatcaaattg gattcaatag aaaaggtaag
1321 caaaaacttg attgaagagg catatctaac aaaggcttct gatattcaca ttgtgccgag
1381 ggagcgggac gctatcattc attttcgggt cgatcatgcc ttgctgaaaa aaagggacat
1441 gaaaaaagaa gagtgcgtaa gactgatttc acattttaaa tttctttcag caatggatat
1501 aggtgaaagg cgaaagccgc aaaacggttc gcttacgtta aagttgaaag agggaaatgt
1561 tcatttaaga atgtcaacgc tgcccacaat taatgaagaa agcctcgtga tcagagtgat
1621 gccccaatac aatatccctt cgattgataa attgtcgcta tttccgaaga caggagccac
1681 attactctcg tttttaaaac attcccatgg catgctcatt tttaccgggc cgactggttc
1741 agggaagact accacattat actctctcgt tcaatatgca aaaaaacact ttaatcgaaa
1801 tattgtcaca ttagaggacc ctgttgaaac aagggacgaa gatgttcttc aggttcaggt
1861 gaatgaaaaa gccggtgtaa cttattccgc aggtctgaaa gcaattttgc gccatgaccc
1921 cgatatgatt attttaggtg agatcagaga cgcggaaaca gctgaaattg cggtgcgggc
1981 agcgatgacg ggacatctgg tactaacgag ccttcatacg agagacgcaa agggcgcaat
2041 ttacagactg cttgaattcg gtatcaatat gaatgaaatc gaacagactg tcattgcaat
2101 agcggctcag cgcttggttg atttggcttg cccgttttgt gaaaacggat gttcatcagt
2161 gtattgccga cagtcacgaa atactaggag agctagcgtt tatgagcttc tatacgggaa
2221 aaatcttcag caatgtatcc aggaggcaaa aggaaatcat gcaaattacc aatatcaaac
2281 gcttcgtcaa attatcagaa aaggaattgc gctcggctat ttaacgacaa acaactatga
2341 ccggtgggtt tatcatgaaa aagattagaa agtctggttg ttaaaggatc aagccaggtt
2401 attaaagagg ctcggtgaaa tgactgcggg cggatataca cttctggatg gattacgcct
2461 gatggaactt cagatgaata agaggcaggc ggctgacttg actgattcgg tcacttgttt
2521 gagggaaggg gctccgtttt atcaagtact aaagagtttg tcatttcata aggaagccgt
2581 aggtatttgt tattttgctg aaacacatgg tgaactgcct gcttcaatga tccagagcgg
2641 agagctgctg gaacgaaaaa ttgcacaggc agaccagctg aaaagagtgc tgcgctatcc
2701 gcttttcctc atctttacgg tcgctgtcat gttttatatg ttacagtcca tcatcattcc
2761 tcagttttcc ggtatctatc aatcgatgaa tatggaaacc tcacgttcaa ccgatatgct
2821 ttttgctttt tttcagcata ttgatcttgt gatcattttg cttgttcttt ttacagcagg
2881 tatcgggatt tattattggc ttgtgtttaa gaaaaaatca cctgcccggc aaatgctgat
2941 ttgtatcagg attcctttgg ttggaaagct tgtaaagctg tttaacagct actttttttc
3001 tttgcagcta agcagccttt taaaatcagg cctctcaatt tatgacagcc ttaatgcatt
3061 taaacatcaa acgtttctcc ctttctaccg ctgcgaggct gaacaattga ttgaacggct
3121 aaaagccggt gagtcaattg aatccgctat ttgtggaagc cttttttatg aaactgattt
3181 atcaaaagtc atatctcacg gccagctgag cggccgattg gatcgggagc ttttcacata
3241 cagccaattc atattacagc ggctggaaca caaagcgcaa aaatggacag gcatccttca
3301 gccaatgatt tatggatttg ttgcagcgat gatcttactt gtgtatttat ctatgcttgt
3361 gcctatgtat cagatgatga atcaaatgtg aaaggaagag gctgatgaat gagaaaggat
3421 ttacacttgt tgaaatgtta atcgtgctct ttattatttc gattttgctt ttaattacga
3481 taccgaacgt cacgaaacat aatcaaacca ttcaaaaaaa gggctgtgaa ggcttacaaa
3541 acatggttaa ggcacaaatg actgcatttg agcttgatca tgaaggacaa actccgagcc
3601 ttgccgattt acagtcagag ggctatgtga aaaaggatgc tgtctgtcca aatggtaagc
3661 gcattatcat caccggcgga gaagttaagg ttgaacatta aattaaacga ggagaagggg
3721 tttacccttt tagaaagttt gcttgtgtta agccttgcct ctatcctcct ggtggccgtc
3781 ttcactacac ttcctcctgc ttatgacaat acagctgtcc gacaggcagc aagtcagctg
3841 aaaaatgata ttatgctcac acagcagact gctatttccc gtcaacaaag aacaaaaatt
3901 ctctttcata aaaaagaata tcaattagtc attggtgata cggttattga acgtccgtat
3961 gcaacgggac tttctataga actgctgaca ttaaaagacc gtttggaatt taatgagaaa
4021 gggcacccga atgcaggcgg aaaaatacga gtaaaaggcc atgccgttta tgacataaca
4081 gtttatctag ggagcgggag agtcaatgtg gagagaaaat aaaggttttt ctacaataga
4141 aacaatgtct gcgctaagcc tgtggctgtt tgtgctgctg acagtcgtcc ccttgtggga
4201 caagctgatg gctgatgaaa aaatggcgga atcacgagaa attggctatc agatgatgaa
4261 tgagagcatt agcaaatatg tcatgagtgg tgaaggagcc gcgtcaaaaa cgattacaaa
4321 gaacaatcat atctatgcaa tgaagtggga ggaggagggc gaatatcaaa acgtatgtat
4381 caaagccgca gcttataaag aaaaatcatt ttgcctcagc attttgcaga cagaatggct
4441 acacgcttct taacgtatta ttttcgctct cagtcttttt gctcatatca ggatcgttag
4501 ctgcgattat ccatctgttt ttgtctcgac agcaggaaca tgacggtttc acacagcagg
4561 aatggatgat ttcgatagaa cagatgatga atgaatgcaa ggaatcacag gcagttaaga
4621 cagccgagca tgggagcgtg ttaatctgca ccaatctttc cggacaagac atccgttttg
4681 acatttatca ttcaatgata agaaaaagag tggatggcaa agggcatgtt ccgattttag
4741 atcatattac tgccatgaaa gctgatattg aaaatggtgt tgttttgctg aaaattgaga
4801 gtgaagacca aaaagtgtat caaactgctt ttccagtcta ttcgtattta ggaggggggt
4861 gaatgtatcg tacaagaggg tttatttatc cagctgttct ttttgtgtca gcgcttgtgc
4921 tgttaatcgt gaactttgtt gctgctcaat atatttcacg ctgcatgttt gagaaggaaa
4981 caaaagagtt atacatagga gagaatttgc ttcaaaatgg ggtgcttctt tcgattcggc
5041 atgttctaga ggaacggaaa ggccaggagg gtacgcagca atttctatat ggacgggttt
5101 cttattacat tcatgataca tcgataaaag aacaaaaaga aatcaactta agagtgtcaa
5161 cggattcggg aacagaaaga actgcacaga tcgtgtttga ccaaaaacag aaaaaactgc
5221 tgagatggac agaataaaac agtgtaaagg gtataaaaaa agtcatgtga gacaacactc
5281 ataataattg aatgatgagg tgatcacgtg aaaacgaatg attatgttaa atatatgacg
5341 cagcaatttg tcaaatatat agatactccg agagatgagc gaaaagaacg aaaagaggtg
5401 cggaaagaaa caaaaacgcc tgtttcccag cagtggttcg gtattttacc ctatggcttc
5461 cgactttggc tgaaacggaa aaaataaccg caaataaacg aataaggtcc ttcaaaaaat
5521 ggaggacctt attgatattc ttctaatatg gcaattttat tgaccttttg gctataagga
5581 tcaaatgaaa tcgtcacaaa aacgccgaat tcttttgacc cttccctcag agttaaatgg
5641 tattgcttca ctgcttcatc ttttctttta cggtcccata ctttttgttt gaacagtacc
5701 tgtgcgagcg ggtacctttt ttttgcttct tttacagcaa tctcttccca tttggacatg
5761 tggcgggcgg ttacaagcgg tgtttcttct gcgtgagcgg ctgtggtgcc aaagacgaga
5821 agagatagac aaatcacaca ttgtttgatc atcatgctgt cacctttctt tgtttattat
5881 taccaaataa taatgggata tgcatttaac ttctcacata acaatcccaa aaatttctaa
5941 aaaattgaaa aaatgagcaa tactgagcaa gactttgtaa tatgatgaaa acattctttt
6001 aaacgaacaa aatgagcgat ttcggtgttt ttaaatctat aaatcgttga ttatactcta
6061 tttgtgaagt tctttaaaga gaacgattgt catatcaagt tacagtgttt tacaggaggt
6121 aagatatgtt tcgattgttt cacaatcagc aaaaggcgaa gacgaaactg aaagttctgc
6181 ttatctttca gctttcagtc attttcagtc tgactgccgc aatatgctta caattttcga
6241 tgatacaagc gctgcttttc atgatattga aacatttg
//
LOCUS CELTC1A 234 bp ds-DNA INV 10-AUG-1990
DEFINITION C.elegans Tc1(1664) element with a DNA insert.
ACCESSION M22301 M29977
KEYWORDS Tc1 element; Tc1 repetitive element.
SOURCE C.elegans (var. Bristol, strain N2) DNA, clone pCeh45.
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Harris,L.J. and Rose,A.M.
TITLE Structural analysis of Tc1 elements in Caenorhabditis elegans var.
Bristol (strain N2)
JOURNAL Plasmid 22, 10-21 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
L.J.Harris, 18-JAN-1989.
FEATURES from to/span description
rpt < 1 > 234 Tc1(1664) insertion element
rpt 85 94 direct repeat copy A
rpt 107 116 direct repeat copy B
rpt 128 137 direct repeat copy C
recomb 87 88 Tc1(1664) DNA end/inserted DNA start
recomb 141 142 inserted DNA end/Tc1(1664) DNA start
BASE COUNT 67 a 36 c 32 g 99 t
ORIGIN 3 bp upstream of EcoRV site.
1 gatatccact tttggttttt tgtgtgtaac ttttttctca agcatccatt tgacttgaat
61 ttttccgtgt gcataaagcg aaatgtttcg caaattaagc aaataaagtt tcgcaaagaa
121 ttaaattgtt tcgcaaacga aacgcaaatt tgcggaccaa acattacatg attatcgatt
181 ttttctgaat tttatttcaa ttttttgatt ttttcgtttt tccaattttc atta
//
LOCUS HUMDEF1A 448 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human neutrophil peptide (defensin) 1 mRNA, complete cds.
ACCESSION M21130 M22160
KEYWORDS cytotoxic protein; defensin 1; microcidal protein;
neutrophil peptide.
SOURCE Human leukocyte (myeloid cell line HL-60), cDNA to mRNA, clones
HNP1a and HNP1b.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 448)
AUTHORS Daher,K.A., Lehrer,R.I., Ganz,T. and Kronenberg,M.
TITLE Isolation and characterization of human defensin cDNA clones
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7327-7331 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
K.Daher, 25-NOV-1988.
Submission form for [1] kindly provided by M.Kronenberg,
01-JAN-1989.
FEATURES from to/span description
pept 51 335 neutrophil peptide 1 precursor /nomgen="DEF1"
/map="8p23" /hgml_locus_uid="LR0109A"
sigp 51 107 neutrophil peptide 1 signal peptide
matp 243 332 neutrophil peptide 1
mRNA < 1 448 neutrophil peptide 1 mRNA
BASE COUNT 106 a 127 c 110 g 105 t
ORIGIN Unreported.
1 tctgccctct ctggtcaccc tgcctagcta gaggatctgt gaccccagcc atgaggaccc
61 tcgccatcct tgctgccatt ctcctggtgg ccctgcaggc ccaggctgag ccactccagg
121 caagagctga tgaggttgct gcagccccgg agcagattgc agcggacatc ccagaagtgg
181 ttgtttccct tgcatgggac gaaagcttgg ctccaaagca tccaggctca aggaaaaaca
241 tggcctgcta ttgcagaata ccagcgtgca ttgcaggaga acgtcgctat ggaacctgca
301 tctaccaggg aagactctgg gcattctgct gctgagcttg cagaaaaaga aaaatgagct
361 caaaatttgc tttgagagct acagggaatt gctattactc ctgtaccttc tgctcaattt
421 cctttcctca tcccaaataa atgccttg
//
LOCUS MUSMIPAA 753 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse macrophage inflammatory protein (MIP)-1a mRNA, complete cds.
ACCESSION X12531 M19382
KEYWORDS lymphokine; macrophage inflammatory protein.
SOURCE Mouse cell line RAW264.7., cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 753)
AUTHORS Davatelis,G., Tekamp-Olson,P., Wolpe,S.D., Hermsen,K., Luedke,C.,
Gallegos,C., Coit,D., Merryweather,J. and Cerami,A.
TITLE Cloning and characterization of a cDNA for murine macrophage
inflammatory protein (MIP), a novel monokine with inflammatory and
chemokinetic properties
JOURNAL J. Exp. Med. 167, 1939-1944 (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by G.Davatelis, 27-MAY-1988.
FEATURES from to/span description
pept 71 349 macrophage inflammatory protein precursor
sigp 71 139 macrophage inflammatory protein signal peptide
matp 140 346 macrophage inflammatory protein
mRNA < 1 753 MIP mRNA
signal 753 753 polyA signal
BASE COUNT 190 a 203 c 143 g 217 t
ORIGIN
1 aagcagcagc gagtaccagt cccttttctg ttctgctgac aagctcaccc tctgtcacct
61 gctcaacatc atgaaggtct ccaccactgc ccttgctgtt cttctctgta ccatgacact
121 ctgcaaccaa gtcttctcag cgccatatgg agctgacacc ccgactgcct gctgcttctc
181 ctacagccgg aagattccac gccaattcat cgttgagtat tttgaaacca gcagcctttg
241 ctcccagcca ggtgtcattt tcctgactaa gagaaaccgg cagatctgcg ctgactccaa
301 agagacctgg gtccaagaat acatcactga cctggaactg attgcctgag agtcttggag
361 gcagcgagga accccccaaa cctccatggg tcccgtgtag agcaggggct tgagccgaac
421 attcctgcca cctgcatagc tccatctcct ataagctgtt tgctgccaag tagccacatc
481 gagggactct tcacttgaaa ttttatttaa tttaatccta ttggtttaat actatttaat
541 tttgtaattt attttattgt catacttgta tttgtgacta tttattctga aagacttcag
601 gacacgttcc tcaaccccca tctccctccc agttgttcac actgtttggt gacagctatt
661 ctaggtagac atgatgacaa agtcatgaac tgacaatgta caatagatgc tttgtttata
721 ccagagaagt aataaatatg ccctttaaca agt
//
LOCUS ONGSSDNA 154 bp ds-DNA INV 10-AUG-1990
DEFINITION O.volvulus species-specific DNA.
ACCESSION J03990 M22037
KEYWORDS .
SOURCE O.volvulus (strain Liberian) DNA, clone C1A1-2.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 154)
AUTHORS Harnett,W., Chambers,A.E., Renz,A. and Parkhouse,R.M.E.
TITLE An oligonucleotide probe specific for Onchocerca volvulus
JOURNAL Mol. Biochem. Parasitol. 35, 119-126 (1989)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
W.Harnett, 18-JAN-1989.
BASE COUNT 50 a 23 c 27 g 54 t
ORIGIN
1 gaattcaaaa aaatatagtc gaatattttt cttaggaccc agtttgaagg tacgtacccg
61 ttttttgaaa ttatatgtca taggtcatct gttaataatc aattttgcaa aatgcgtttt
121 tcgccgaaaa aatcgccgtg taaatgtgga attc
//
LOCUS RHMDCTAB 2300 bp ds-DNA BCT 10-AUG-1990
DEFINITION R.meliloti C-4-dicarboxylate carrier protein (dctA) gene, complete
cds, and dctB protein gene, partial cds.
ACCESSION M26399 M29729
KEYWORDS C4-dicarboxylate carrier protein; membrane protein;
transport protein.
SOURCE R.meliloti (strain 2011) DNA, clone pRmSC121.
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 2300)
AUTHORS Engelke,T., Jording,D., Kapp,D. and Puehler,A.
TITLE Identification and sequence analysis of the Rhizobium meliloti dctA
gene encoding the C4-dicarboxylate carrier
JOURNAL J. Bacteriol. 171, 5551-5560 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Jording, 13-OCT-1989.
FEATURES from to/span description
pept 750 2111 C4-dicarboxylate carrier A (put.)
pept 786 2111 C4-dicarboxylate carrier A' (put.)
pept 555 < 1 (c) dctB protein
signal 582 597 activator sequence
signal 673 689 5' promoter region
BASE COUNT 388 a 758 c 707 g 447 t
ORIGIN
1 gtcgacgcgc cgggaaatat aaagcccggg ccgattgctg accgtgccca tggcgaaatg
61 ttcggccatg ccgtcgcgga cggcgagccg gaaataatcg cggaaggcat agtcgttgcc
121 gacaaagctc gtcggctcct gccagttgct ggccgcgacg gcgacgccgc tccggtcgat
181 cagataaatg accgcggctt cggcgcttgt cgccagggcc tcgagcttac ggttgatgcg
241 gtcgagcgac ggcctgtccg gcgaaagcaa tgcgccacga atggctgcgt cgtcggcgag
301 gacaagcggc agggcgcgct gccgttccac gactgctcga agaagcgagg ctttcaggct
361 ggcgtcgatc cggctctgac cggcaaggcc ggcgagcgcc tgcgaccggc cgtagtctcg
421 cgcaagaaga aggccggctg ccagaaggac cagcgcgact gccgcgaaaa cgagccagga
481 ccgtcgggcc cggctgcgaa gcgcatgcgg atcgctcgct tctgcaggaa gtttgaccat
541 gcgaacatgg tgcatgtttt cgcccaggac gccagcactt ctgtgcggaa atccgcacat
601 atccacgaac ggcaagcgag caccgctccc aaaaatgtca tggattcaat cgcataggcc
661 ttccgcggca aactggcacg catgttgctg acaagctcca caaggcagcc actgctgtcg
721 atcttcggaa agcggcccgg gaggccggca tgttgccgga ctgggcctgc cacgtggagg
781 atatcatgat catcgaacat tccgcggagg tccgcggcaa gacacccctt taccgccatc
841 tctatgttca ggtgctggcg gcgatcgccg cgggcatcct gctcgggcat ttctatcctg
901 atatcggcac ggagctcaaa ccgcttggcg acgccttcat caggctcgtg aagatgatca
961 tcgcgccggt gatcttcctg acggtcgcga ccgggattgc cggcatgacc gatctcgcca
1021 aggtcggccg cgtcgccggc aaggcgatga tctactttct cgccttctcc accctcgcgc
1081 ttgtcgtcgg cctcgtcgtc gcaaacgtgg tgcagccggg cgcgggcatg catatcgatc
1141 cggcctcgct ggacgccaag gcggtcgcga cctatgccga gaaagcgcat gagcagtcga
1201 tcaccggctt cctgatgaac atcatcccga cgacgctcgt cggcgccttc gccgaaggcg
1261 acatcctgca ggtcctgttc atctcggtgc tcttcggtat ctcgctggcg atcgtcggca
1321 agaaagccga gcccgtggtc gatttcctgc aggcgctgac gctgccgatc ttccggctcg
1381 ttgcgatcct gatgaaggcc gccccgatcg gcgccttcgg cgccatggcc ttcaccatcg
1441 gcaagtacgg tatcgcctcg atcgccaatc tcgccatgct gatcggcacc ttctatctga
1501 cgtcgtttct cttcgtcttc atcgttctcg gcgcggtcgc acgctataac ggcttctcga
1561 tcctctcgct catccgctac atcaaggagg agctgctgct ggtgctcggg acgtcctctt
1621 cggaggcggc tcttccgggc ctcatgaaca agatggagaa ggccggctgc aagcgctcgg
1681 tcgtcggtct cgtcattccg accggttatt ccttcaacct ggacggcacc aatatctaca
1741 tgacccttgc ggccctgttc atcgcccagg cgaccgatac gccgctctca tacggcgacc
1801 agattctgct gctcctcgtc gcaatgctga gctcgaaggg tgcggccggc attaccggcg
1861 ccggcttcat cacgcttgcc gcaacgctct cggtcgttcc ctccgtgccg gtcgccggca
1921 tggcgctgat cctcggcatc gaccgcttca tgtcggaatg ccgcgccctg accaatttcg
1981 tcggcaacgc ggttgcgacg atcgtggtgg cgaagtggga gggcgagctc gatcaggcgc
2041 agctttccgc agctctcggc ggcgaggcgt ccgtcgaggc catcccggcg gtcgtccagc
2101 ccgccgaata agcttgcctc ccagggcgag accgcacgct ggtgcgcttg gcgcggtccg
2161 gttctccggg ccgcgctttt ttatcgccgg ccaggcctga gggggtgggc actcggcaca
2221 agatcagatg ccgtgcagca cctgcgtcgc cttcactgcc gccgaggccc ggttttcgac
2281 gccgagcttc acatagatct
//
LOCUS HUMMYOHP 6016 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human perinatal myosin heavy chain mRNA, complete cds.
ACCESSION M36769 M35250 Y00821
KEYWORDS myosin; myosin heavy chain.
SOURCE Human fetal muscle, cDNA to mRNA, clone pSMHCP.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 2651 to 6016)
AUTHORS Leinwand,L.A.
TITLE ;
JOURNAL Unpublished (1989) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 2651 to 6016)
AUTHORS Feghali,R. and Leinwand,L.A.
TITLE Molecular genetic characterization of a developmentally regulated
human perinatal myosin heavy chain
JOURNAL J. Cell Biol. 108, 1791-1797 (1989)
STANDARD simple automatic
REFERENCE 3 (bases 1 to 2650)
AUTHORS Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A.
TITLE Generation of a full-length human perinatal myosin
heavy-chain-encoding cDNA
JOURNAL Gene 89, 289-294 (1990)
STANDARD full staff_review
COMMENT Author address: L.A.Leinwand Albert Einstein College of Medicine
1300 Morris Park Avenue Bronx, N.Y. 10461, USA.
Submitted (12-APR-1989) on tape to the EMBL data library. Draft
entry and computer-readable sequence for [3] kindly submitted by
I.Karsch-Mizrachi, 18-JUN-1990.
FEATURES from to/span description
pept 74 5887 perinatal myosin heavy chain
mRNA < 1 6016 perinatal myosin heavy chain mRNA
BASE COUNT 1953 a 1300 c 1586 g 1177 t
ORIGIN
1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa
61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc
121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc
181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag
241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga
301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat
361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat
421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt
481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca
541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc
601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca
661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca
721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc
781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg
841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt
901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa
961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt
1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga
1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac
1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc
1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc
1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa
1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga
1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag
1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct
1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat
1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt
1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat
1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta
1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc
1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg
1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc
1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc
1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga
2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat
2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct
2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat
2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg
2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca
2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct
2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag
2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg
2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt
2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa
2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct
2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc
2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat
2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa
2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga
2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa
3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc
3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga
3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga
3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg
3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa
3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa
3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc
3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga
3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc
3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa
3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg
3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt
3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag
3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga
3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct
3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga
3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga
4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct
4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg
4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac
4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt
4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc
4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga
4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct
4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga
4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga
4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac
4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt
4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca
4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga
4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt
4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag
4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa
4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac
5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat
5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga
5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt
5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga
5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga
5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca
5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct
5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca
5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa
5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca
5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc
5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct
5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc
5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc
5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat
5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac
6001 agtaaatgta tacatt
//
LOCUS CDCMANABD 2155 bp ds-DNA BCT 10-AUG-1990
DEFINITION C.saccharolyticum beta-D-mannanase (manA) gene, complete cds.
ACCESSION M36063
KEYWORDS beta-D-mannanase.
SOURCE C.saccharolyticum DNA, clone lambda-NZP2.
ORGANISM Caldocellum saccharolyticum
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 2155)
AUTHORS Leuthi,E., Bhana Jasmat,N., Grayling,R.A., Love,D.R. and
Bergquist,P.L.
TITLE Cloning, sequence analysis and expression in Escherichia coli of a
gene coding for a beta-mannanase from the extremely thermophilic
bacteria Caldocellum saccharolyticum
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Luethi, 03-JUL-1990.
Author address: E.Luethi
University of Auckland
Department of Cellular and Molecular Biology
Private Bag
Auckland
NEW ZEALAND
email:SYSTEM@CELVAX.AUKUNI.AC.NZ
FEATURES from to/span description
pept 841 1881 beta-D-mannanase precursor (E.C. 3.2.11.78)
sigp 841 963 beta-D-mannanase signal peptide
matp 964 1878 beta-D-mannanase
pept < 1 685 ORF1 (AA at 2)
binding 818 830 manA ribosome binding site
BASE COUNT 721 a 334 c 590 g 510 t
ORIGIN
1 gggtagcaat acatggtttg gattccaggc atggtcgatg cagagggtag cggagtatta
61 ctatgtgaca ggagataaag atgcaggggc actgcttgag aagtgggtaa gctggataaa
121 gagtgtagtg aagttgaata gtgatggtac atttgcgata ccatcgacgc ttgattggag
181 tgggcagcca gacacatgga atgggacata tacaggtaat ccgaacttgc atgtgaaggt
241 agtagattat gggacggatt taggaataac ggcatcactt gcgaatgcac tactttatta
301 cagtgcaggg acgaagaagt atggggtatt tgatgaggaa gcgaagaatt tagcgaagga
361 attgctggac aggatgtgga agttatacag ggatgagaaa ggtttatcgg cgccagagaa
421 gagagcggac tacaagaggt tctttgagca agaggtatac attccggcag gatggacagg
481 gaagatgccg aatggagatg taataaagag cggagttaag tttatagaca taaggagcaa
541 gtacaaacaa gatcctgatt ggccgaagtt agaggcggca tacaagtcag ggcaggtacc
601 ggagttcaga tatcacaggt tctgggcaca gtgtgacata gcaattgtta atgcaacata
661 tgaaattctg ttcggtaatc aataatgagt aggtaaatgg aaatttagcg gggtggcaca
721 tctataagtt tggtgtgctg cctcgctaaa atcctgtatg gaagtgttcg aaaaatagta
781 caaaaaaatg gcgaggtaaa atggcaaaaa tgcataaaaa agagggggtt aggttttaag
841 atgagactaa aaacaaaaat aagaaagaaa tggttgagtg tactatgtac agttgttttt
901 ttattgaaca ttttgtttat agcaaatgta acgattttac ccaaagttgg tgcggctaca
961 tctaatgatg gagtagtgaa gatagatacg agtacgttaa taggaaccaa tcacgcacat
1021 tgctggtaca gagatagact tgatacggca ttgcgtggaa ttaggtcatg gggtatgaac
1081 tctgtgaggg tagtgctgag taatggttac cgatggacga agataccagc aagtgaagtg
1141 gcaaatatta tatcattgtc aagaagtctt ggtttcaaag ctattatatt agaagtgcac
1201 gacacaacag gatatggaga agatggggca gcatgttcat tggcacaagc agtggaatat
1261 tggaaggaga taaagagcgt attagacggt aacgaagatt ttgtaattat aaacattggt
1321 aatgagccgt atgggaacaa taactatcaa aactgggtta atgacacgaa gaacgctata
1381 aaagcactta gagatgcagg attcaagcac acgataatgg tggatgcgcc gaactggggt
1441 caggattggt ctaatactat gagagataat gcccagagca taatggaagc agatccgctg
1501 cgcaatttgg tattttcgat tcatatgtat ggcgtataca atacagcaag caaggtcgaa
1561 gagtacatca aatcatttgt tgataagggg ttaccattgg ttattggaga atttggacat
1621 cagcacacag atggtgaccc tgatgaagaa gctattgtca ggtatgcaaa acagtacaag
1681 ataggattat ttagttggtc gtggtgtgga aattcgagct atgttgggta tttggacatg
1741 gtaaacaact gggaccccaa taatccaact ccatggggac aatggtataa aactaatgca
1801 attggtacat cttctacacc aacacctacg tcgacagtaa cgccaacgcc acccccacgc
1861 caacaccaac accgacagta acagcgacac caacaccaac acctacacct gttagcacac
1921 ctgcgacaag tgggcagata aaggtactgt atgctaacaa ggagacaaac agcactacaa
1981 acacgataag gccgtggttg aaggtagtga atagtggtag cagtagcata gatttgagca
2041 gggtaacgat aaggtactgg tacacggtag atggtgagag ggcacagact ggatatcaga
2101 ctgggcacag ataggagcaa gcaatgtcaa cattcaagtt tgtgaagctg agagt
//
LOCUS MUSMLHRCA 1431 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse lymph node homing receptor mRNA, complete cds.
ACCESSION M36005
KEYWORDS lymph node homing receptor.
SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22+(C3H).
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1431)
AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K.
TITLE The mouse lymph node homing receptor is identical with the
lymphocyte cell surface marker Ly-22: Role of the EGF domain in
endothelial binding
JOURNAL Cell 61, 611-622 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 52 1170 lymph node homing receptor precursor
sigp 52 165 lymph node homing receptor signal peptide
matp 166 1167 lymph node homing receptor
BASE COUNT 412 a 323 c 362 g 334 t 0 others
ORIGIN Chromosome 1.
1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt
61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg
121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat
181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat
241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc
301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc
361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag
421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac
481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg
541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat
601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct
661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt
721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca
781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag
841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc
901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt
961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc
1021 tcaaagatca aagaaggtga ctacaacccc ctcttcattc ctgtagccgt catggtcacc
1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa
1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag
1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag
1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga
1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt
1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a
//
LOCUS MUSMLHRCB 1431 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse lymph node homing receptor mRNA, complete cds.
ACCESSION M36058
KEYWORDS lymph node homing receptor.
SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22-(SJL).
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1431)
AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K.
TITLE The mouse lymph node homing receptor is identical with the
lymphocyte cell surface marker Ly-22: Role of the EGF domain in
endothelial binding
JOURNAL Cell 61, 611-622 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 52 1170 lymph node homing receptor
sigp 52 165 lymph node homing receptor signal peptide
matp 166 1167 lymph node homing receptor
BASE COUNT 412 a 322 c 362 g 335 t 0 others
ORIGIN Chromosome 1.
1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt
61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg
121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat
181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat
241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc
301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc
361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag
421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac
481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg
541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat
601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct
661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt
721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca
781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag
841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc
901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt
961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc
1021 tcaaagatca aagaaggtga ctataacccc ctcttcattc ctgtagccgt catggtcacc
1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa
1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag
1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag
1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga
1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt
1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a
//
LOCUS RATMHRT1B 562 bp ds-DNA ROD 10-AUG-1990
DEFINITION Rat MHC class II A-beta RT1.B-b-beta gene, partial cds.
ACCESSION M36151
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Rat (strain BUF-(RT-b)) DNA.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 562)
AUTHORS Figueroa,F., Guenther,E. and Klein,J.
TITLE MHC polymorphism pre-dating speciation
JOURNAL Nature 335, 265-267 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 562 MHC A-beta RT1.B-b-beta cell surface
glycoprotein (AA at 1)
BASE COUNT 122 a 171 c 173 g 96 t
ORIGIN
1 agagactccc caagggattt cgtgtaccag ttcgagggcc agtgctacta caccaccggg
61 acgcagcgca tgcggctcgt gaccagacac atctacaacc gggaggagta cgtgcgcttc
121 gacagcgacc tgggcgagta ccgcgcgctg accgagctgg ggcggccctc agccgagtac
181 tggaataagc agtacctcga gcagacgcgg gccgagctgg acagggtctg cagatacaac
241 tacgaggggc cgggggctct cacctccctg agacggcttg agcagcccaa tgtggccatc
301 tccctgtcca ggacagaggc ccttaaccac cacaacctgc tggtctgctc agtgacagat
361 ttctacccag cccagatcaa agtgcgctgg ttccggaatg gccaggagga gacgacgggg
421 gtcgtgtcca cacagcttat taggaatggg gactggacct tccagatcct ggtcatgctg
481 gagatcacgc ctcagcgggg agatgtgtac acctgccatg ttgaccaccc cagccttcag
541 agccctgtca cagtggagtg gc
//
LOCUS WHTMTTGPAA 1743 bp ds-DNA ORG 10-AUG-1990
DEFINITION Wheat mitochondrial Pro-tRNA gene.
ACCESSION M36018
KEYWORDS transfer RNA-Pro.
SOURCE Wheat mitochondrial DNA, clone H-P2.
ORGANISM Mitochondrion Triticum aestivum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE 1 (bases 1 to 1743)
AUTHORS Joyce,P.B.M., Spencer,D.F. and Gray,M.W.
TITLE Multiple sequence rearrangements accompanying the duplication of a
Pro-tRNA gene in wheat mitochondrial DNA
JOURNAL Plant Mol. Biol. 11, 833-843 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 831 905 Pro-tRNA
BASE COUNT 483 a 396 c 414 g 450 t
ORIGIN
1 aagcttcatg aaggaagggt ccaaggacct aagcgacaac ctctacgaaa aagaaaaacc
61 tatatataga tagtggtatg tgttcgctga cactaaatag acttggctcg gggcagaact
121 gagtggctag gctaagttcc ctagtcttct attggcctac ccacccctac gacatggagg
181 cgcttaccct tctccaacaa gcgttccacc acgcggcgat tgattaccgc tttccgtctc
241 aatcagttca gaccaagccc cttctctgat cgggggaagg tcattctaaa acatcagatc
301 tcccattgat gctgatcaaa aataagttta tccacctaga atatatgata ggtaccgaaa
361 gattggatgt catttctaaa caccaggcgg gacgagcagc cctataccac gtgtagccac
421 actctagtgt ccttttctac ttagttggac agatcacttc agaaaatcgt ataaaaatca
481 agcaagaaaa cggatgcgct aacgcgcaac ggctttcgcg ctagttgctc aaaaaatcgt
541 ataaaaatca agcaagaaaa aggttctggc aggctgcgtg ggactgtaaa tcctctttcg
601 ctggctgggc cctttggact cgaaatccaa acggagtgag tggttcgatt ccactctcag
661 aacgagactg aacgaaataa aatgcaagtg aaacgagacg agaatcaaat tgtaggcttc
721 tttcctaaaa gcggtggttc tcgcctcccc gtgcccaaag cggggtgggc gacacagcgc
781 tgcggttctt ttctttatcg atcgggtaga tccatatgtt ctgaggggga cgaggtgtag
841 cgcagtctgg tcagcgcatc tgttttgggt acagagggcc ataggttcga atcctgtcac
901 cttgatgtgg tattcacaca atggggccga agtgcaaagc cccgcagcct atccgtggtc
961 gggaaggcag gggtattaaa agtgaaactt ggcttgcgtg tgcgcgcaca caaaggagac
1021 tgagttggat tgcaaagcct ctcagacgga aggaacctgt tttaaaaacc cctttctaca
1081 atcttgagaa atctcacgca agcaactaca cgccgcaagt cttcggtatt gtcgcaagaa
1141 ttgaaaagat gactccgctt tttctttcga gcatgcaagg cctcagtaga gcgagtctgt
1201 gcatgagtag cgcagtttac aaactagtga atggtcaaat gaataggctc gtttcatcag
1261 gggtaaccaa ggagtaagaa tgatttgata aagtaaacat tccctttgag tgcggttcct
1321 tcagaacaga ctcttccctt ttcagattgc ttctttgggc cacttgagca tcgactgggc
1381 aaggttggat tccctctact tcggggtggc atctatctca catgttttac attgatccat
1441 agccctccgt tagataggaa cagctttgct aaatactgag aactctcgaa tagaatatga
1501 gaagggaaag atcctttaca taaggaacta ttgatcatag gcgtacactc tgaaaagaat
1561 tgaatgaaac tctctgtttc gtcactttga tatcattagc tgtctctgta gctggctctc
1621 atgtctggtg ttccaagtaa aaacttttca actagctggg tcaccgctgt tttaaaccct
1681 gacttggaat cagagccata aacgagtgat tcggtcggac ggacgattga gttcttaaag
1741 ctt
//
LOCUS YSCRSDSBA 76 bp ss-tRNA PLN 10-AUG-1990
DEFINITION S.cerevisiae Lys-tRNA.
ACCESSION M36150
KEYWORDS transfer RNA-Lys.
SOURCE S.cerevisiae tRNA, clone alpha-S288c.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 76)
AUTHORS Madison,J.T., Boguslawski,S.J. and Teetor,G.H.
TITLE Nucleotide sequence of a lysine transfer ribonucleic acid from
bakers' yeast
JOURNAL Science 176, 687-689 (1972)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 76 Lys-tRNA
anticdn 34 36 anticodon stt
modified 1 1 p
modified 9 9 m2a
modified 16 16 d
modified 17 17 d
modified 20 20 d
modified 26 26 m22g
modified 27 27 p
modified 28 28 p
modified 37 37 t6a
modified 46 46 m7g
modified 47 47 d
modified 48 48 m5c
modified 55 55 p
modified 58 58 m1a
modified 67 67 p
BASE COUNT 15 a 18 c 22 g 16 t 5 others
ORIGIN
1 tccttgttag ctcagddggd agagcgttcg gctsttaacc gaaatgdcag gggttcgagc
61 cccctatgag gagcca
//
LOCUS TNCPOLY 3660 bp ss-RNA VRL 10-AUG-1990
DEFINITION Tobacco necrosis virus RNA dependent RNA polymerase and coat
protein genes, complete cds.
ACCESSION M33002
KEYWORDS RNA dependent RNA polymerase; coat protein; polymerase.
SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell
line, cDNA to viral RNA, clones pFM2[0,1,2,3].
ORGANISM Tobacco necrosis virus
Viridae; ss-RNA nonenveloped viruses.
REFERENCE 1 (bases 1 to 3660)
AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J.
TITLE Genome structure of tobacco necrosis virus strain A
JOURNAL Virology 177, 699-709 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Meulewaeter, 20-MAR-1990, for release after publication.
FEATURES from to/span description
pept 60 665 RNA dependent RNA polymerase (put.)
/transl_except=(666..668)
669 2234 RNA dependent RNA polymerase
pept 2218 2436 ORF 3
pept 2440 2610 ORF 4
pept 2613 3443 coat protein
pept 3467 3646 ORF 6
site 666 668 readthrough codon
BASE COUNT 1034 a 890 c 895 g 841 t
ORIGIN
1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa
61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact
121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag
181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt
241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa
301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg
361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg
421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg
481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac
541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag
601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt
661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac
721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct
781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc
841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc
901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa
961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga
1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag
1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc
1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc
1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg
1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat
1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg
1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa
1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta
1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact
1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc
1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga
1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg
1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt
1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat
1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca
1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc
1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat
2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt
2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg
2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg
2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt
2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa
2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa
2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt
2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca
2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac
2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg
2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc
2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc
2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg
2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag
2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa
2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca
3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca
3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg
3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag
3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta
3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg
3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg
3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa
3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga
3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt
3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa
3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag
//
LOCUS BMORGA1 120 bp ds-DNA INV 10-AUG-1990
DEFINITION B.mori 5.8S ribosomal RNA upstream region.
ACCESSION M36169
KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
SEGMENT 1 of 2
SOURCE B.mori DNA.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
Ishikawa,H.
TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and
its presumed ancestor
JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987)
STANDARD simple staff_review
BASE COUNT 34 a 15 c 20 g 51 t
ORIGIN
1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca
61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta
//
LOCUS BMORGA2 354 bp ds-DNA INV 10-AUG-1990
DEFINITION B.mori 5.8S ribosomal RNA.
ACCESSION M36302
KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
SEGMENT 2 of 2
SOURCE B.mori DNA.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
Ishikawa,H.
TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and
its presumed ancestor
JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 139 305 5.8S rRNA
BASE COUNT 103 a 74 c 74 g 103 t
ORIGIN About 50 bp after segment 1.
1 gctcgtgtct ctctcgtcgt cgttgacggt ttcgtaaatt aacacaaaat acattttatt
61 ttatttattg tttttttttc gacaagaaaa aaaaaaaaga aattttttat ttttatttaa
121 acattacaga aaaaacgaaa atcattaccc tggacggtgg atcacttggc tcgcgggtcg
181 atgaagaacg cagttaactg cgcgtcatag tgtgaactgc aggacacatt tgaacatcga
241 catttcgaac gcacattgcg gtccgtggag acacatccag gaccactcct gtctgagggc
301 cggctgcata aaaaacatta atcacattgc gtggctcgtg gcgcgctcgt gatc
//
LOCUS BMORGB1 120 bp ds-DNA INV 10-AUG-1990
DEFINITION B.mandarina 5.8S ribosomal RNA upstream region.
ACCESSION M36170
KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
SEGMENT 1 of 2
SOURCE B.mandarina DNA.
ORGANISM Bombyx mandarina
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
Ishikawa,H.
TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and
its presumed ancestor
JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987)
STANDARD simple staff_review
BASE COUNT 34 a 15 c 20 g 51 t
ORIGIN
1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca
61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta
//
LOCUS BMORGB2 138 bp ds-DNA INV 10-AUG-1990
DEFINITION B.mandarina 5.8S ribosomal RNA.
ACCESSION M36303
KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
SEGMENT 2 of 2
SOURCE B.mandarina DNA.
ORGANISM Bombyx mandarina
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 138)
AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
Ishikawa,H.
TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and
its presumed ancestor
JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 89 5.8S rRNA
BASE COUNT 35 a 39 c 34 g 30 t
ORIGIN About 50 bp after segment 1.
1 ctgcaggaca catttgaaca tcgacatttc gaacgcacat tgcggtccgt ggagacacat
61 ccaagaccac tcctgtctga gggccggctg cataaaaaac attaatcaca ttgcgtggct
121 cgtggcgcgc tcgtgatc
//
LOCUS BOVINTERA 1003 bp ss-mRNA MAM 10-AUG-1990
DEFINITION Bovine interleukin-1 alpha mRNA, complete cds.
ACCESSION M36182
KEYWORDS interleukin-1 alpha.
SOURCE Bovine macrophage, cDNA to mRNA.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1003)
AUTHORS Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W.
TITLE The nucleotide sequence for the cDNA of bovine interleukin-1 alpha
JOURNAL Nucleic Acids Res. 16, 9053-9053 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 33 839 interleukin-1 alpha precursor
sigp 33 368 interleukin-1 alpha signal peptide
matp 369 836 interleukin-1 alpha
mRNA < 1 1003 interleukin-1 alpha mRNA
BASE COUNT 313 a 218 c 215 g 257 t
ORIGIN
1 tgttgctagc tcggttcagc aaagaagtga agatggccaa agtccctgac ctctttgaag
61 acctgaagaa ctgttacagt gaaaatgaag actacagttc tgaaattgac cacctctctc
121 tcaatcagaa gtccttctat gatgcaagct atgagccact tcgtgaggac cagatgaata
181 agtttatgtc cctggatacc tcggaaacct ctaagacatc caagcttagc ttcaaggaga
241 atgtggtgat ggtggcagcc agtgggaaga ttctgaagaa gagacggttg agtttaaatc
301 agttcatcac cgatgatgac ctggaagcca ttgccaataa tacagaagaa gaaatcatca
361 agcccagatc agcacattac agcttccaga gtaacgtgaa atacaacttt atgagagtca
421 tccaccagga atgcatcctg aacgacgccc tcaatcaaag tataattcga gatatgtcag
481 gtccatacct gacggctact acattaaata atctggagga ggcagtgaaa tttgacatgg
541 ttgcttatgt atcagaagag gattctcagc ttcctgtgac tctaagaatc tcaaaaactc
601 aactgtttgt gagtgctcaa aatgaagacg aacccgtctt gctaaaggag atgcctgaga
661 cacccaaaat catcaaagat gagaccaacc tcctcttctt ctgggaaaag catggctcta
721 tggactactt caaatcagtt gcccatccaa agttgtttat tgccacaaag caagaaaaat
781 tggtgcacat ggcaagtggg ccgccctcga tcactgactt tcagatattg gaaaaatagc
841 cttgactgtg cactctactt acttgtaaag tggtgaccat ccgtatgtac tatgtacatg
901 aaggagtcga gcccttcact gttagtcact cgctgagcat gtgctgagct tttgtaattc
961 taaatgaatg tttactctct ttgtaagaga gaacacaaag tcc
//
LOCUS CHKCALMA 450 bp ss-mRNA VRT 10-AUG-1990
DEFINITION Chicken calmodulin mRNA, complete cds.
ACCESSION M36167
KEYWORDS calmodulin.
SOURCE Chicken, cDNA to mRNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 450)
AUTHORS Iida,Y.
TITLE cDNA sequences and molecular evolution of calmodulin genes of
chicken and eel
JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 450 calmodulin
BASE COUNT 160 a 62 c 122 g 106 t
ORIGIN
1 atggctgatc aactgacaga agagcagatt gcagaattca aagaagcttt ttcactattt
61 gacaaggatg gtgatggtac tataactaca aaggagttgg ggactgtgat gagatcactt
121 ggtcagaacc ccacagaagc agaattacag gacatgatca atgaagtaga cgctgatggc
181 aatggcacaa ttgacttccc agagtttctg acaatgatgg caagaaaaat gaaagataca
241 gatagcgaag aagaaattag agaagcgttc cgtgtgtttg acaaggatgg taatggttac
301 attagtgctg cagaacttcg tcatgtgatg acaaatcttg gggagaagct aacagatgaa
361 gaagttgatg aaatgattag ggaagcagac attgatggtg atggtcaagt aaactatgaa
421 gagtttgtac agatgatgac agcgaagtga
//
LOCUS FSBCALMA 450 bp ss-mRNA VRT 10-AUG-1990
DEFINITION E.electricus calmodulin mRNA, complete cds.
ACCESSION M36168
KEYWORDS calmodulin.
SOURCE E.electricus, cDNA to mRNA.
ORGANISM Electrophorus electricus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Cypriniformes; Characoidei; Electrophoridae.
REFERENCE 1 (bases 1 to 450)
AUTHORS Iida,Y.
TITLE cDNA sequences and molecular evolution of calmodulin genes of
chicken and eel
JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 450 calmodulin
BASE COUNT 133 a 96 c 137 g 84 t
ORIGIN
1 atggcagatc agctgactga ggaacagatt gctgagttca aggaggcgtt ttccctcttt
61 gacaaagatg gtgacggcac catcaccacc aaagagctgg gtactgtgat gcgctctctg
121 ggccaaaacc ccaccgaggc agagctgcag gacatgatca atgaagtgga tgctgacggc
181 aatggaacaa tagacttccc ggagttcctg accatgatgg ccaagaaaat gaaggacaca
241 gacagtgaag aagagatccg agaagccttc cgagtttttg acaaggacgg taatggctac
301 atcagtgcag ccgagttgcg acatgtcatg actaacttgg gcgagaagct gacggacgag
361 gaggtggatg agatgatccg agaggccgac atcgatggcg acggccaggt gaactatgaa
421 gagttcgtgc aaatgatgac tgcaaagtga
//
LOCUS HALTRL1L 88 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Leu-tRNA-1.
ACCESSION M35741
KEYWORDS transfer RNA; transfer RNA-Leu.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 88 Leu-tRNA
anticdn 37 39 Leu-tRNA anticodon gag
modified 29 29 m22g
modified 40 40 m1g
modified 42 42 m5c
modified 59 59 m5c
modified 65 65 m1g
modified 66 66 p
modified 67 67 p
modified 68 68 cm
modified 69 69 m1i
BASE COUNT 16 a 29 c 29 g 13 t 1 others
ORIGIN 5' end of mature tRNA.
1 gcgtgggtag ccaanccagg ccaacggcgc agcgttgagg gcgctgtcct gtagaggtcc
61 gccggttcaa atccggtccc acgcacca
//
LOCUS HALTRL2L 88 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Leu-tRNA-2.
ACCESSION M35742
KEYWORDS transfer RNA; transfer RNA-Leu.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 88 Leu-tRNA
anticdn 37 39 Leu-tRNA anticodon cag
modified 29 29 m22g
modified 40 40 m1g
modified 60 60 m5c
modified 65 65 m1g
modified 66 66 p
modified 67 67 p
modified 68 68 cm
modified 69 69 m1i
BASE COUNT 18 a 28 c 25 g 16 t 1 others
ORIGIN 5' end of mature tRNA.
1 gcagggatag ccaantctgg ccaacggcgc agcgttcagg gcgctgtctc ataggagtcc
61 gcaggttcaa atcctgctcc ctgcacca
//
LOCUS HALTRL3L 87 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Leu-tRNA-3.
ACCESSION M35743
KEYWORDS transfer RNA; transfer RNA-Leu.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 87)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 87 Leu-tRNA
anticdn 36 38 Leu-tRNA anticodon caa
modified 28 28 m1i
modified 39 39 m1g
modified 41 41 p
modified 59 59 m5c
modified 64 64 m1g
modified 65 65 p
modified 66 66 p
modified 67 67 cm
modified 68 68 m1i
BASE COUNT 20 a 27 c 25 g 14 t 1 others
ORIGIN 5' end of mature tRNA.
1 gcgagggtag ctaantcagg aaaaagcggc ggactcaaga tccgctcccg taggggtccg
61 tgggttcaaa tccctcccct cgcacca
//
LOCUS HALTRL4L 86 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Leu-tRNA-4.
ACCESSION M35744
KEYWORDS transfer RNA; transfer RNA-Leu.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 86)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 86 Leu-tRNA
anticdn 35 37 Leu-tRNA anticodon tag
modified 27 27 m22g
modified 38 38 m1g
modified 48 48 m5g
modified 53 53 m1g
modified 54 54 p
modified 55 55 p
modified 56 56 cm
BASE COUNT 16 a 27 c 28 g 14 t 1 others
ORIGIN 5' end of mature tRNA.
1 gcgcgggtag ccaantggcc aaaggcgcag cgcttaggac gctgtggtgt agaccttcgc
61 aggttcgaac cctgtcccgc gcacca
//
LOCUS HALTRL5L 88 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Leu-tRNA-5.
ACCESSION M35745
KEYWORDS transfer RNA; transfer RNA-Leu.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 88 Leu-tRNA
anticdn 37 39 Leu-tRNA anticodon taa
modified 29 29 m2g
modified 40 40 m1g
modified 42 42 p
modified 65 65 m1g
modified 66 66 p
modified 67 67 p
modified 68 68 cm
BASE COUNT 16 a 28 c 29 g 14 t 1 others
ORIGIN 5' end of mature tRNA.
1 gcgggggtgg ctganccagg ccaaaagcgg cggacttaag atccgctccc gtaggggttc
61 gcgagttcga atctcgtccc ccgcacca
//
LOCUS HALTRS1S 88 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Ser-tRNA-1.
ACCESSION M35746
KEYWORDS transfer RNA; transfer RNA-Ser.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 88 Ser-tRNA
anticdn 37 39 Ser-tRNA anticodon gct
modified 29 29 m22g
modified 40 40 t6a
modified 60 60 m5c
modified 61 61 m5c
modified 65 65 m1g
modified 66 66 p
modified 67 67 p
modified 68 68 cm
BASE COUNT 15 a 31 c 27 g 14 t 1 others
ORIGIN 5' end of mature tRNA.
1 gttgcggtag ccaancctgg cccaaggcgc tgggttgcta actcagtggc gtcaagcccc
61 cggggttcga atccccgccg caacgcca
//
LOCUS HALTRS2S 88 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Ser-tRNA-2.
ACCESSION M35747
KEYWORDS transfer RNA; transfer RNA-Ser.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 88)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 88 Ser-tRNA
anticdn 36 38 Ser-tRNA anticodon cga
modified 13 13 p
modified 28 28 m22g
modified 36 36 ac4c
modified 41 41 p
modified 60 60 m5c
modified 65 65 m1g
modified 66 66 p
modified 67 67 p
modified 68 68 cm
modified 69 69 m1i
BASE COUNT 20 a 28 c 22 g 17 t 1 others
ORIGIN 5' end of mature tRNA.
1 gccgaggtag cctancccgg ccaaggcggt agattcgaaa tctactgtcc attcggacac
61 gtgagttcaa atctcaccct cggcgcca
//
LOCUS HALTRS3S 84 bp ss-tRNA RNA 10-AUG-1990
DEFINITION H.volcanii Ser-tRNA-3.
ACCESSION M35748
KEYWORDS transfer RNA; transfer RNA-Ser.
SOURCE H.volcanii tRNA.
ORGANISM Halobacterium volcanii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Halobacteriales; Halobacteriaceae.
REFERENCE 1 (bases 1 to 84)
AUTHORS Gupta,R.
TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
and three serine tRNAs
JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 84 Ser-tRNA
anticdn 33 35 Ser-tRNA anticodon gga
modified 25 25 m22g
modified 56 56 m5c
modified 61 61 m1g
modified 62 62 p
modified 63 63 p
modified 64 64 cm
modified 65 65 m1i
BASE COUNT 14 a 26 c 28 g 15 t 1 others
ORIGIN 5' end of mature tRNA.
1 gccaggatgg ccgancggta aggcgcacgc ctggaaagcg tgttccctct gggatcgggg
61 gttcaaatcc ctctcctggc gcca
//
LOCUS HUMIGHVY 920 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human Ig germline H-chain gene D5-region.
ACCESSION M35647
KEYWORDS diversity exon; germline; immunoglobulin heavy chain.
SOURCE Human placental DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 920)
AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
TITLE Human immunoglobulin D segments: Isolation of a new D segment and
polymorphic deletion of the D-1 segment
JOURNAL Immunol. Lett. 17, 329-334 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 604 / 633 Ig H-chain D-region (AA at 604)
signal 456 462 heptamer recombination signal
signal 488 497 nonamer recombination signal
signal 575 583 nonamer recombination signal
signal 596 602 heptamer recombination signal
signal 634 640 heptamer recombination signal
signal 654 662 nonamer recombination signal
BASE COUNT 256 a 269 c 255 g 140 t
ORIGIN
1 ggatccaggc agcaacatca cgggagaatg cagggctccc agacatccca accctcttgc
61 aggcctctcc tgggaagaga ctgcatccac caccaaacag catggagccc gctgcatagt
121 aactgagtca gtgactgacc tggagggcag gggagcagtg agccggagcc cagaccatag
181 ggaccgagac cagccgctga catcccgagc ccctcactgg tggccccaga aaatatggaa
241 acaggacgga cccacattcc cagctggacc agggcggaga ctgctgagcc ccgagcacca
301 gccccaagaa acaccaggca aaatcatcag aggaggctcc tgagaaagag aggaggggag
361 gtctccctca ccagcaaatg cttcccttaa ccaaacacag ggtccacgca actcccccca
421 ggacaaagga ggagccccct gtacagcact gggctcagag tcctctccaa gacaccctga
481 gtttcagaca aaaaccccct ggaaatcaca gtatcagcag gagaactggc cagagacagc
541 aaggggggac tccgtgacac ccgtggggac aggaggattt tgtgggggct tgtgtcactg
601 tgagaatatt gtaatagtac tactttctat gcccacagtg acacagcccc agtcccaaag
661 ccctgctgta aatgcttcca cttctggagc tgagggggag cgtctgggaa gtagggccta
721 gggtgtccac caatgccaaa acacaccaga cttcccccag acatcacccc caccagccag
781 cgagcagagt aaacagaaaa tgagaagcaa gctgggaact gctggcacag gccccaagga
841 aagagctggg gttgtgcaag agaagatgca ggcagagcta gcagggcctt ttactgtttc
901 tgctttctgg cagagaattc
//
LOCUS HUMIGHVZ 688 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human Ig germline H-chain gene D4-region.
ACCESSION M35648
KEYWORDS diversity exon; germline; immunoglobulin heavy chain.
SOURCE Human peripheral blood cell DNA, clone D31.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 688)
AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
TITLE Human immunoglobulin D segments: Isolation of a new D segment and
polymorphic deletion of the D-1 segment
JOURNAL Immunol. Lett. 17, 329-334 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 70 / 100 Ig H-chain D5-region (AA at 71)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
signal 42 50 nonamer recombination signal
signal 63 69 heptamer recombination signal
signal 101 107 heptamer recombination signal
signal 121 129 nonamer recombination signal
BASE COUNT 185 a 160 c 209 g 134 t
ORIGIN
1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt
61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt
121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct
181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc
241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca
301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc
361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttca agatcgatgg
421 ctgggagtga gccgagacgg acagtgtggg aagggcacag ggaaggagaa gcagccgcta
481 tcctacactg tcatctttca agagtttgcc ctgtgcccac aatgctgcat catgggatcg
541 ttaacactga tgtagacaca gctaaagaga gaatcagtga atggatttgc agcacagatc
601 tgaataaatt ctcagaatgt ggagccacac agaagcaagc acacggaaag tggcctgatc
661 gaatttcaag tacagtggca cttcagct
//
LOCUS HUMIGHWA 690 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human Ig germline H-chain gene D4-region.
ACCESSION M35649
KEYWORDS diversity exon; germline; immunoglobulin heavy chain.
SOURCE Human peripheral blood cell DNA, deletion clone D30.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 690)
AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Hongo,T.
TITLE Human immunoglobulin D segments: Isolation of a new D segment and
polymorphic deletion of the D-1 segment
JOURNAL Immunol. Lett. 17, 329-334 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 70 / 100 Ig H-chain D5-region (AA at 71)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
signal 42 50 nonamer recombination signal
signal 62 68 heptamer recombination signal
signal 101 107 heptamer recombination signal
signal 121 129 nonamer recombination signal
BASE COUNT 186 a 160 c 212 g 132 t
ORIGIN
1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt
61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt
121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct
181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc
241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca
301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc
361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttcg agatcaatgg
421 ctgggagtga gccgagcagg acagcgtggg aagagcacag ggaaggagga gcagccgcta
481 tcctacactg tcatctttcg aaagtttgcc ttgtgcccac actgctgcat catgggatcg
541 ttaacactga tgtagacaca gctaaagaga gaatcatgga atggatttgc agcacagatc
601 tgaataaatt ctcagaatgt ggagcagcac agaagcaagc acacagaaag tggcctgatg
661 caaggacaag ttcagtggca ctagcatgct
//
LOCUS HUMIGHWB 681 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human Ig germline H-chain gene D1-region.
ACCESSION M35650
KEYWORDS diversity exon; germline; immunoglobulin heavy chain.
SOURCE Human peripheral blood cell DNA, clone D31.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
TITLE Human immunoglobulin D segments: Isolation of a new D segment and
polymorphic deletion of the D-1 segment
JOURNAL Immunol. Lett. 17, 329-334 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept / 69 / 99 Ig H-chain D5-region (AA at 70)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
signal 41 49 nonamer recombination signal
signal 61 67 heptamer recombination signal
signal 100 106 heptamer recombination signal
signal 120 128 nonamer recombination signal
BASE COUNT 187 a 156 c 203 g 135 t
ORIGIN
1 ccagcaagat cgggctccag tgacaccgag acaggacgga ggattttgtg ggggctcgtg
61 tcactgtgag gatattgtac taatggtgta tgctataccc acagtgacac agccccattc
121 ccaaagccct actgcaaaga ttccacttgc tgggctgagg ggctggggag cgtctgggaa
181 atagggctag ggaccagcaa tgcccaaaac gcagattccg tcccatacat caccccattc
241 accagtcagc agagtaaaca gaaaatgaga agcagctggg aagcttccac aggcccaagg
301 aaagagcttt ggcgggtgtg taagagggga tgcgggcaga gcgagcaggg ccttttgctg
361 tttctgcttt cctgtgcaga gagttcagca gctggtgttc gagatcaatg gctgggagtg
421 agccgagcag gacagcgtgg gaagagcaca gggaaggagg agcagccgct atcctacact
481 gtcatctttc gaaagtttgc cttgtgccca cactgctgca tcatgggatc gttaacactg
541 atgtagacac agctaaagag agaatcatgg aatggatttg cagcacagat ctgaataaat
601 tctcagaatg tggagcagca cagaagcaag cacacagaaa gtggcctgat gcaaggacaa
661 gttcagtggc actagcatgc t
//
LOCUS HUMTCAGVR 508 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human T-cell receptor germline gamma-chain gene V4-region
(V4; subgroup I), exons 1 and 2.
ACCESSION M36285
KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline;
variable region.
SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 508)
AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L.,
Cann,H., Cohen,D., Dausset,J. and Sigaux,F.
TITLE The V-gamma locus of the human T cell receptor gamma gene
JOURNAL J. Exp. Med. 168, 1383-1394 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 43 T-cell receptor gamma-chain (V4), exon 1
/hgml_locus_uid="LE0226F" /nomgen="TCRGV4"
/map="7p15"
159 > 508 T-cell receptor gamma-chain (V4), exon 2
IVS 44 158 T-cell receptor gamma-chain (V4) intron A
BASE COUNT 128 a 119 c 122 g 139 t
ORIGIN
1 atgcagtggg ccctagcggt gcttctagct ttcctgtctc ctggtgagtg cgctgcctac
61 agagaggatc atgggttttg ttttctttat tttcttcttt tgcaaggatt gccatactaa
121 ggaattcctc attatatttt gtgttgttcc cattgcagcc agtcagaaat cttccaactt
181 ggaagggaga acgaagtcag tcatcaggca gactgggtca tctgctgaaa tcacttgtga
241 tcttgctgaa ggaagtaccg gctacatcca ctggtaccta caccaggagg ggaaggcccc
301 acagcgtctt ctgtactatg actcctacac ctccagcgtt gtgttggaat caggaatcag
361 cccagggaag tatgatactt acggaagcac aaggaagaac ttgagaatga tactgcgaaa
421 tcttattgaa aatgactctg gagtctatta ctgtgccacc tgggatgggc acagtgattc
481 agatccgccc tacaccacac tgaaaacc
//
LOCUS HUMTCAGVS 509 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human T-cell receptor germline gamma-chain gene V5-region
(V5; subgroup I), exons 1 and 2.
ACCESSION M36286
KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline;
variable region.
SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 509)
AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L.,
Cann,H., Cohen,D., Dausset,J. and Sigaux,F.
TITLE The V-gamma locus of the human T cell receptor gamma gene
JOURNAL J. Exp. Med. 168, 1383-1394 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 43 T-cell receptor gamma-chain (V5), exon 1
/hgml_locus_uid="LF0226G" /nomgen="TCRGV5"
/map="7p15"
160 > 508 T-cell receptor gamma-chain (V5), exon 2
IVS 44 159 T-cell receptor gamma-chain (V5) intron A
BASE COUNT 131 a 117 c 122 g 139 t
ORIGIN
1 atgcggtggg ccctactggt gcttctagct ttcctgtctc ctggtgagta cgctgcctac
61 agagaggctc acaggttggg ttttgttttg tttacttctt ttgaaagggg tgcgatacaa
121 aggaatacct cattaaattt tgtgttgttc ccattgcagc cagtcagaaa tcttccaact
181 tggaaggggg aacgaagtca gtcacgaggc cgactaggtc atctgctgaa atcacttgtg
241 accttactgt aataaatgcc ttctacatcc actggtacct acaccaggag gggaaggccc
301 cacagcgtct tctgtactat gacgtctcca actcaaagga tgtgttggaa tcaggactca
361 gtccaggaaa gtattatact catacaccca ggaggtggag ctggatattg atactacgaa
421 atctaattga aaatgattct ggggtctatt actgtgccac ctgggacagg cacagtgatt
481 cagacctgtc ctacaccaca ctgaaaatc
//
LOCUS MUSNM23A 664 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse tumor metastatic process-associated protein (NM23) gene mRNA,
3' end.
ACCESSION M35970
KEYWORDS tumor metastatic process-associated protein.
SOURCE Mouse cell line K-1735, cDNA to mRNA, clone pNM23.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 664)
AUTHORS Steeg,P.S., Bevilacqua,G., Kopper,L., Thorgeirsson,U.P.,
Talmadge,J.E., Liotta,L.A. and Sobel,M.E.
TITLE Evidence for a novel gene associated with low tumor metastatic
potential
JOURNAL J Natl Cancer Inst 80, 200-204 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 501 tumor metastatic process-associated protein
NM23 (AA at 1)
mRNA < 1 664 NM23 mRNA
BASE COUNT 150 a 154 c 201 g 159 t
ORIGIN
1 cagtcgcagc cggcggtaaa gccttgtcat ctgaagggga ccatggccaa cagtgagcgt
61 accttcattg ccatcaagcc tgatggggtc cagcgggggc tggtgggcga gatcatcaag
121 cggttcgagc agaaggggtt ccgccttgtt ggtctgaagt ttctgcaggc ttcagaggac
181 cttctcaagg agcactacac tgacctgaag gaccgcccct tctttactgg cctggtgaaa
241 tacatgcact caggaccagt ggttgctatg gtctgggagg gtctgaatgt ggtgaagaca
301 ggccgcgtga tgcttggaga gaccaacccc gcagactcta agcctgggac catacgagga
361 gacttctgca ttcaagttgg caggaacatc attcatggca gcgattctgt aaagagcgca
421 gagaaggaga tcagcttgtg gtttcagcct gaggagctgg tggagtacaa gagctgtgcg
481 cagaactgga tctatgagtg ataggacggt gccggttttc tacctgctta ctcttgttct
541 cacaggcagg ggaccagcaa ccctagatat ttctggaact tctttgacct ggaaggaacc
601 tttgggagcc gtgactccct gtgcagtgtt acgtgccact gttagattaa agtgtttaat
661 ctgt
//
LOCUS SYNHGHOA 582 bp ds-DNA SYN 10-AUG-1990
DEFINITION Synthetic human growth hormone gene, complete cds.
ACCESSION M36282
KEYWORDS growth hormone.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 582)
AUTHORS Ohtsuka,E., Taniyama,Y., Iwai,S., Kitano,K., Miyamoto,S., Ohgi,T.,
Sakuragawa,Y., Fujiyama,K., Ikari,T., Kobayashi,M., Miyake,T.,
Shibahara,S., Tokunaga,T., Ikehara,M., Ono,A., Ueda,T., Tanaka,T.,
Baba,H., Miki,T., Sakurai,A. and Oishi,T.
TITLE The synthesis of human growth hormone gene
JOURNAL Nucleic Acids Symp Ser 12, 79-82 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 579 growth hormone
BASE COUNT 146 a 153 c 129 g 154 t
ORIGIN
1 atgttcccaa ctattccact gagtcgcctg ttcgataacg cgatgctgcg tgcgcatcgt
61 ctgcaccaac tggctttcga cacttaccag gagttcgaag aagcatacat cccgaaagaa
121 cagaaataca gcttccttca gaacccacag acctcgttgt gtttctctga aagtatcccg
181 accccttcta accgcgaaga gacccagcag aaatcgaacc ttgaactgct tcgtatctcg
241 ctgcttctca ttcagtcgtg gctggagcca gtacagttcc tgcgttcggt tttcgcaaac
301 tcactggtat acggtgcgtc tgacagtaac gtttacgacc tgctgaaaga ccttgaagaa
361 gggatccaga ccctgatggg tcgcctggaa gatggttcac cacgcactgg tcagatcttc
421 aaacagactt actccaaatt cgatactaac tctcataacg atgatgctct gctgaaaaac
481 tacggcctgc tgtactgttt ccgtaaagat atggataaag ttgaaacttt cctgcgtatc
541 gttcagtgtc gttctgttga agggtcgtgt ggcttctaat ag
//
LOCUS AC2TFDC 803 bp ds-DNA BCT 10-AUG-1990
DEFINITION Plasmid pAC27 (from Pseudomonas putida) pyrocatechase (tfdC)
gene, complete cds.
ACCESSION M36279
KEYWORDS pyrocatechase.
SOURCE Plasmid pAC27(from Pseudomonas putida, strain AC867) DNA.
ORGANISM Plasmid pAC27
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 803)
AUTHORS Ghosal,D. and You,I.-S.
TITLE Nucleotide homology and organization of chlorocatechol oxidation
genes of plasmids pJP4 and pAC27
JOURNAL Mol. Gen. Genet. 211, 113-120 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 21 803 pyrocatechase (tfdC)
BASE COUNT 174 a 240 c 250 g 139 t
ORIGIN
1 atcttgaaag gagacgagtc atggataaac gagttgccga ggtcgcaggc gcgatcgtcg
61 aggcagtacg caaaattttg ctggacaagc gcgtcacgga agccgaatac cgcgcgggtg
121 tcgactatct caccgaggtc gcacagacgc gggaaaccgc gctgcttctg gacgttttcc
181 tgaacagcac catcatcgaa ggcaaggcgc agcgctcgcg gacctctgcg cctgcgatcc
241 aggggccgta cttcctggaa ggtgctcctg tagttgaagg cgtcctcaag acctacgata
301 ccgacgacca caaaccgctg atcattcgcg gtacggtgcg ctcggacacg ggcgagttgc
361 tcgctggcgc tgtcatcgac gtgtggcact cgacgcctga tggcttgtac agcgggatcc
421 acgacaacat ccccgtggac tactaccgcg gaaaactcgt gacggattcc cagggcaact
481 atcgcgtgcg caccacgatg ccagtgccat accagatccc ctacgagggg ccgactgggc
541 gtctgctggg ccacctgggc agccatacct ggcgtccgcc gcacgtgcac ttcaaggtgc
601 gcaaggacgg tttcgaaccg ttgaccacgc aatactactt cgaagggggc aaatgggtgg
661 acgatgactg ctgtcacggc gtcacccccg acctgattac gcccgagacg atcgaggacg
721 gggtgcgggt catgaccctg gacttcgtaa tcgagcgtga gcaggccgag caacgcaagt
781 cggctacgga gacagtggca tga
//
LOCUS AFATFDC 788 bp ds-DNA BCT 10-AUG-1990
DEFINITION Plasmid pJP4 (from Alcaligenes eutrophus) pyrocatechase (tfdC)
gene, complete cds.
ACCESSION M36280
KEYWORDS pyrocatechase.
SOURCE Plasmid pJP4 (from Alcaligenes eutrophus, strain JMP134) DNA.
ORGANISM Plasmid pJP4
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 788)
AUTHORS Ghosal,D. and You,I.-S.
TITLE Nucleotide homology and organization of chlorocatechol oxidation
genes of plasmids pJP4 and pAC27
JOURNAL Mol. Gen. Genet. 211, 113-120 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 21 788 pyrocatechase (tfdC)
BASE COUNT 178 a 203 c 240 g 167 t
ORIGIN
1 gttttcatga cggaggcaaa gtgaacaaaa gagtcaagga tgttgtcgat gcgatcgtcg
61 cggcggtaca gcgggtgctc gaccagaagg aggttactga ggcggaatac cgaactgcgg
121 ttcattacct catgcaagta gccgaacagc gtgaaaccgc tcttctttgc gatgtttttt
181 tcaatagcac cgtggctgcg acgaaggctc gtattagcga aggttcaacg cctgctatcg
241 agggacccta ctatcgcgac gacgcccctc tggtcgatga ccggctcaag acttacgaca
301 cggacgacca caagcctttg cttatccagg gaacggtcaa agcggtcgac gggagcgttg
361 tcgaggacgt gacgattgat gtctggcatt cgacgcccga tgggaagtac agcggtttcc
421 atgacgacat cccgactgat ttttatcgag ggaagctcag ggtgggcacc gatggcagct
481 tccgcgtgcg cacaacgatg ccggtgccgt atcagatccc ggatcagggt cccacgggcg
541 cattgctcga aaccatgggt ggtcactcgt ggcgtcccgc tcatgtacat ttcaaggtga
601 aggcgccggg atatgaaacg ttgaccacgc agtactactt cgaaggtgga gattggatca
661 cggacgactg ttgcaacggc gtccagtcta gtctgattac tcccgatatt gtggaagagg
721 gtgtccgcct gatgaacatc aacttcgtta tcgagcccgc gcgtgcgcag gcgggggcaa
781 acccgtga
//
LOCUS ECOHSEII 530 bp ds-DNA BCT 10-AUG-1990
DEFINITION E.coli heat-stable enterotoxin II (STII) gene, complete cds.
ACCESSION M35729
KEYWORDS enterotoxin II.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 530)
AUTHORS Picken,R.N., Mazaitis,A.J., Maas,W.K., Rey,M. and Heyneker,H.
TITLE Nucleotide sequence of the gene for heat-stable enterotoxin II of
Escherichia coli
JOURNAL Infect. Immun. 42, 269-275 (1983)
STANDARD simple staff_review
FEATURES from to/span description
pept 166 381 enterotoxin II (STII)
BASE COUNT 188 a 86 c 88 g 168 t
ORIGIN
1 taaataccta caacgggtga ttgacactac actcattaac tatactgcaa gtagcattaa
61 aaatcttaat aaaggagagc ttcgtcacat ttttttgact tgactcatat aaaagcccac
121 tggtataagt tttattgctt atagcaataa ggttgaggtg attttatgaa aaagaatatc
181 gcatttcttc ttgcatctat gttcgttttt tctattgcta caaatgccta tgcatctaca
241 caatcaaata aaaaagatct gtgtgaacat tatagacaaa tagccaagga aagttgtaaa
301 aaaggttttt taggggttag agatggtact gctggagcat gctttggcgc ccaaataatg
361 gttgcagcaa aaggatgcta atatatttat caatagcatt cagcaccata tacacaaaaa
421 taatttttca taaaaagaac tctataaaat aaatattttt tgtgacaatg tcctaacgca
481 agacggacat tgtccatttc tcactgcagg taaatgatct gtaaatagtc
//
LOCUS RAT6P26B 1086 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Rat 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase mRNA,
3' end.
ACCESSION M15685
KEYWORDS 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase.
SOURCE Rat liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1086)
AUTHORS Colosia,A.D., Lively,M., El-Maghrabi,M.R. and Pilkis,S.J.
TITLE Isolation of a cDNA clone for rat liver 6-phosphofructo 2-
kinase/fructose 2,6-bisphosphatase
JOURNAL Biochem. Biophys. Res. Commun. 143, 1092-1098 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 915 6-phosphofructo 2-kinase/fructose
2,6-bisphosphatase (AA at 1)
BASE COUNT 290 a 279 c 264 g 253 t
ORIGIN
1 gcagaaaaca tcaagcaagt gaaacttggt agtcctgatt acatagactg tgaccaagaa
61 aaggttttgg aagactttct aaagagaata gagtgctatg agatcaacta ccaacctttg
121 gatgaggaat tggacagcca cctgtcctac atcaagatct tcgacgtggg cacacgctac
181 atggtaaatc gagtgcagga ccacgttcag agccgtacag cctactacct catgaacatc
241 catgtcacac ctcgatctat ctacctatgc cgccatggtg agagtgaact caaccttaga
301 ggccgcattg gaggtgactc tggcctctca gctcggggca agcagtatgc ctatgcacta
361 gccaacttca tccggtctca aggcatcagc tccctgaaag tatggactag ccacatgaag
421 aggaccattc agaccgctga agccctaggt gtcccctatg aacagtggaa ggccctgaat
481 gagattgatg cgggtgtctg tgaagagatg acctatgaag aaattcagga acactaccct
541 gaggaatttg cactacggga ccaggataaa tatcgttacc gctatccgaa gggagagtcc
601 tatgaggatc tggttcagcg tcttgaacca gttataatgg agctagaacg gcaagaaaat
661 gtactggtga tctgtcacca ggctgtcatg cggtgcctcc tggcatactt cctggataaa
721 agttcagatg agctgcccta tctcaagtgt cctctgcata ctgtgctcaa actcacacct
781 gtggcttatg gctgcagagt ggagtccatc tacctgaatg tggaggctgt gaacacacac
841 cgggacaagc ctgagaatgt ggacatcacc cgtgaagctg aggaagcctt ggacactgta
901 cctgcccatt actgagccct tttcaagtga tcagattgtc tctgttctca cccccttcct
961 cctgtaggag ctgctgccct tgttctcctt aagcagactc tggctatggc ctgagagtgc
1021 cctacctcca gtgaagaagt ccgtagcagc tcccaaacag gtctcaattc ctagccacaa
1081 ctaagg
//
LOCUS RATFABP 491 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Rat fatty liver acid binding protein (FABP) mRNA, complete cds.
ACCESSION M35991
KEYWORDS fatty acid binding protein.
SOURCE Rat adult liver, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 491)
AUTHORS Gordon,J.I. and Lowe,J.B.
TITLE Analyzing the structures, functions and evolution of two abundant
gastrointestinal fatty acid binding proteins with recombinant DNA
and computational techniques
JOURNAL Chem. Phys. Lipids 38, 137-158 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 40 423 fatty acid binding protein (FABP)
mRNA < 1 491 fatty acid binding protein mRNA
BASE COUNT 161 a 95 c 128 g 107 t
ORIGIN
1 ctgttggtgg cagctgggaa aggaaacctc attgccacca tgaacttctc cggcaagtac
61 caagtgcaga gccaagagaa ctttgagccc ttcatgaagg cgatgggtct gcctgaggac
121 ctcatccaga aagggaagga catcaagggg gtgtcagaaa tcgtgcatga agggaagaaa
181 gtcaaactca ccatcaccta tgggtccaag gtgatccaca atgagttcac cttgggggag
241 gagtgcgaac tggagaccat gactggggaa aaggtcaagg cagtggttaa gatggagggt
301 gacaataaaa tggtgacaac tttcaaaggc ataaagtccg tgactgaatt caatggagac
361 acaatcacca ataccatgac actgggtgac atcgtctaca agagagtcag caagagaatt
421 tagacaaggc tgtatttcat attcttttac agtataaaat taatacaata aagttacctc
481 tgttttgaaa t
//
LOCUS RATFABPX 564 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Rat intestinal fatty acid binding protein (FABP) mRNA, complete
cds.
ACCESSION M35992
KEYWORDS fatty acid binding protein.
SOURCE Rat adult small intestinal epithelium, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 564)
AUTHORS Gordon,J.I. and Lowe,J.B.
TITLE Analyzing the structures, functions and evolution of two abundant
gastrointestinal fatty acid binding proteins with recombinant DNA
and computational techniques
JOURNAL Chem. Phys. Lipids 38, 137-158 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept 13 411 fatty acid binding protein (FABP)
BASE COUNT 179 a 108 c 146 g 131 t
ORIGIN
1 acagctgaca tcatggcatt tgatggcact tggaaagtag accggaatga gaactatgaa
61 aagttcatgg agaaaatggg cattaacgtg gtgaagagga agcttggagc tcatgacaac
121 ttgaaactga cgatcacaca ggaaggaaat aaattcacag tcaaagaatc aagcaacttc
181 cgaaacattg atgttgtgtt tgaactcggc gtcgactttg cctatagtct agcagatgga
241 acagaactca ctgggacctg gaccatggag ggaaataaac ttgttggaaa attcaaacgt
301 gtagacaatg gaaaggagct gattgctgtc cgagagattt ctggtaacga actaatccaa
361 acctacacat atgaaggagt ggaggccaag cggatcttta agaaggaata ggccagcttc
421 ttggagcctg gcgcaatgca gaagggctag cttggtgtcg gattcctttc tccatctcgc
481 tgataatgca ggctcactca ctgtgtcatg agcactgctt ccaagccttg tcagagccaa
541 agaagtaaaa gttgattagg actt
//
LOCUS BSTRR5SA 117 bp ds-DNA RNA 10-AUG-1990
DEFINITION B.stearothermophilus 5S RNA.
ACCESSION M25591
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE B.stearothermophilus RNA.
ORGANISM Bacillus stearothermophilus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 117)
AUTHORS Zimmermann,J. and Erdmann,V.A.
TITLE Binding sites of E.coli and B.stearothermophilus ribosomal proteins
on B.stearothermophilus 5S RNA
JOURNAL Nucleic Acids Res. 5, 2267-2288 (1978)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rRNA 1 117 5S ribosomal RNA
BASE COUNT 25 a 36 c 38 g 18 t
ORIGIN
1 cctagtggtg atagcggagg ggaaacaccc gttcccatcc cgaacacgga agttaagccc
61 tccagcgccg atggtagttg gggccagcgc ccctgcaaga gtaggccgct gctaggc
//
LOCUS CHKVIM1 2842 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken vimentin gene, exons 1 and 2.
ACCESSION M15850 J02759 M25923
KEYWORDS intermediate filament; vimentin.
SEGMENT 1 of 3
SOURCE Chicken DNA, library of Dodgson.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2842)
AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
TITLE The chicken vimentin gene: Nucleotide sequence, regulatory
elements, and comparison to the hamster gene
JOURNAL J. Biol. Chem. 262, 8112-8120 (1987)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Zehner,Z.E. and Paterson,B.M.
TITLE The chicken vimentin gene: Aspects of organization and
transcription during myogenesis
JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by B.A.Roe, 11-MAY-1987.
FEATURES from to/span description
pept 824 1368 vimentin, exon 1
1458 + 1518 vimentin, exon 2
pre-msg 737 > 2842 VIM mRNA
IVS 1369 1457 VIM cds intron A
IVS 1519 > 2842 VIM cds intron B
BASE COUNT 569 a 827 c 882 g 564 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttgagg atttctgtcc tgcctttgag gctctttggg gatcacttct aatgaccttt
61 ttgtaagagc tctttcccga gacagtgacc gtgtgcctac agaaactaaa agcaaacaga
121 ccctcggtta aaaccgctcc tgccgctccc ccgctgacca ggagcgctgt gcccgaagca
181 aagcgatgcc cctcctgcag ccgtgcgagt gcgcagcgcc gtccgcaatg agcgcaactc
241 cgtcggcacc gagcaggctg ggcgggcccc gggagctgcg ggagcgcgct gagcccatga
301 gcacaggacg ggaggctgag gggggccgtc ggaagcagct aaccccaaac ccaccgacgg
361 ttcacgtttc tatctgtttg ctttggacgt tcggtagccc ttcgtgaata gcatcactcc
421 cccgatctcg gtgcctttca gagagctaag aaagaaggga cggggctcgc acaaagttca
481 acgcatgcat ttttaagcac tcggcccgtt ctaggagcgc tggcgagcac cgcagcccgt
541 cccgccctcg gaatcactgc gctgtcgaag gggcgtaggg cgcaaggcac ctcggcagcc
601 gcccgggtcc ctccgccttt gtcccagcgc cgccaccgcc gccctccctc cgcccctccc
661 cgggcccgac gcggcattgg cgacggacgg ggcggggcgg cgcggcggcg ggcggggcgg
721 cggccccata agaggggccc cggggtgggg acgccgctct tcttcgccgc cgcgctccga
781 gcccgtcgct cccggattac aaagccgctc cgttcctcgc gccatgagct tcaccagcag
841 caagaactcc tcgtaccgcc gcatgttcgg cgggggcagc cggcccagca gcggcacccg
901 ctacatcacg tccagcaccc gctattccct gggcagcgcc ctgcggccca gcagcgcccg
961 ctacgtgtcc gcctcgcccg gcggcgtgta ccgcaccaag gcgacgtcgg tgcggctgcg
1021 gagcagcatg ccgcccatgc ggatgcacga cgccgtggac ttcaccctgg cggacgccat
1081 caacacggag ttcaaggcga accgcaccaa cgagaaggta gagctgcagg agctcaacga
1141 ccgcttcgcc aactacatcg acaaggtgcg cttcctggag cagcagaaca agatcctgct
1201 ggccgagctg gagcagctca agggcaaagg cacgtcccgc ttgggcgacc tgtacgagga
1261 ggagatgcgg gacgtgcggc gccaggtgga ccagctgacc aacgacaagg cccgcgtcga
1321 ggtggagcgc gacaacctgg ccgacgacat catgcgcctg cgggagaagt gagtggggcc
1381 gcgggccggg gccgggaagc gggcgggcac ccacccctcc ctcacctcac ctcacctcac
1441 cctgtcgctg cttgcaggtt gcaggaggag atgctgcagc gggaggaggc cgagagcacc
1501 ctgcagtcct tccgacaggt gggcgcggcg ccggggcggt gggggcggaa ggtcggggcg
1561 gggctgggct gggggctgct ctgcccgcac aaaggcgctc gcgcagcctc ttgttctccc
1621 cgttgttttc tttagggggg tgccgcagtc cgggggggag ggtgaggcta ttgtcccgct
1681 ttatccggtc tggccggctc cacaaaagcg ctccgacgtg gaatcgcttt ctgatccaat
1741 agttgtgaaa acaggccatc tgcgcgccgg gctgaagggt attaatggtt tctatgggat
1801 tcacagagga aatgcagatc caggcattat ggccggtgtg gtggcagcag attgaaataa
1861 tggatccctt tgtgtcggag gggagggtgc cgtgggctgc attcctacca aatgcgtaat
1921 ggtgggggca tattcctggg ggagactgtg tgaaatacag aactccttct ggatgagcca
1981 cgtcggtctg cgctctcaga agactgtccc acatttgaag tatttcctat cagtggtgtc
2041 tgccgatcgg aagcagacaa tgggctcctt gggatgagct cgcagcagct gtctgcgtcc
2101 ccttgacgtg cattaaaaat gtagaattta atcctatata agcaaagatg tgaatgtaag
2161 aaacagtgca ctggagatga gtgtgtagga agctgcggtg tcacacagat gaaccaaagc
2221 cgatgtgaca gccaaaacca actgtgaagc ctgtgaaaag ttgtgagctc aagttctgat
2281 ttgtgcaaaa acagctctcg gcttggattg agggctgttt ggactagcag atgtttagga
2341 atgtgccttc tcaaaacaag ataacttcct gaaaactgct ggtgtgaaac aacactgctg
2401 tcactcctct gagctggaac gagctgccct gtatctccag acagctgggc gcaggctgac
2461 cgtcccatag agctctccat aggcaatagg tgtcaccatg caggaagttg tcatttgaat
2521 tttagcattt tccctagcgg tcttagtgag acagtgtggg agcagacaca tggcagggtc
2581 tgggaatgag gtgtccactg acgttctggg ggctgaggct tgaaatagca gctccaggta
2641 ctgcctttaa cctgtgctgc tgtgttacca ggctgagccg atccaggttg ttgccaagtc
2701 agtgcagatg catttccctt tttttgtttt aaattactag agttcaagca caagctcctt
2761 tgacttaaag ctagtttaat catgcgtgct tctctggtgc gcttcaaggc atgtcttcct
2821 gtccctccaa ccgatgaagc tt
//
LOCUS CHKVIM2 685 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken vimentin gene, exons 3 and 4.
ACCESSION M15851 J02759 M25923
KEYWORDS intermediate filament; vimentin.
SEGMENT 2 of 3
SOURCE Chicken DNA, library of J.B.Dodgson.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 685)
AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
TITLE The chicken vimentin gene: Nucleotide sequence, regulatory
elements, and comparison to the hamster gene
JOURNAL J. Biol. Chem. 262, 8112-8120 (1987)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Zehner,Z.E. and Paterson,B.M.
TITLE The chicken vimentin gene: Aspects of organization and
transcription during myogenesis
JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by by B.A.Roe, 11-MAY-1987.
FEATURES from to/span description
pept + 7 102 vimentin, exon 3
246 + 407 vimentin, exon 4
pre-msg < 1 > 685 VIM mRNA
IVS < 1 6 VIM cds intron B
IVS 103 245 VIM cds intron C
IVS 408 > 685 VIM cds intron D
BASE COUNT 189 a 142 c 158 g 196 t
ORIGIN About 3 kb after segment 1.
1 gatcaggatg ttgacaatgc ctctctggca ggccttgatc ttgagcgccc cgttgagtcc
61 ctgcaagaag aaattgtctt cttgaagaag cttcatgatg aggtaagctg aataatgagg
121 tctgtgtctg ggcaccgtca gtcaagtcaa caagtaaatt gtctctggtt ctggccccga
181 gtcctaacct gtgcattttc tgcatagtgt tgtttattaa ctccaagtgt ctacttctct
241 ctaaggaaat ccgggaactg caggctcaac tccaggaaca gcacatccaa atcgatatgg
301 atgtttctaa gcctgatctt actgctgccc tgcgcgatgt tcgtcaacaa tatgaaagcg
361 ttgctgctaa gaatcttcag gaagctgaag agtggtacaa gtccaaagta agtacaatgc
421 gttttaggat tatgcagatg cagtaggcat tcacatgaat tttaggtaac agctttgctg
481 tgttccttca gtactgaatg ttagcacagc cacagagtag ggtagtcttc aaactatcag
541 tcagggaaga atacagttgg aaaagtcatt tgtgccctaa gtgcagcatc agactaatgg
601 gctcatttcc tgttcttccc tattcttgta gctggaggca gttcttaagt atatacctaa
661 gaaaggaatg aacaaatgca ttaaa
//
LOCUS CHKVIM3 2857 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken vimentin gene, exons 5-9.
ACCESSION M15852 J02759 M25922
KEYWORDS intermediate filament; vimentin.
SEGMENT 3 of 3
SOURCE Chicken DNA, library of J.B.Dodgson.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 2857)
AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
TITLE The chicken vimentin gene: Nucleotide sequence, regulatory
elements, and comparison to the hamster gene
JOURNAL J. Biol. Chem. 262, 8112-8120 (1987)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Zehner,Z.E. and Paterson,B.M.
TITLE The chicken vimentin gene: Aspects of organization and
transcription during myogenesis
JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by by B.A.Roe, 11-MAY-1987. Polyadenylation signals are located at
positions 2546-2551, 2594-2599, 2829-2834, and 2850-2855. Multiple
mRNAs are found for vimentin, each using a different
polyadenylation site.
FEATURES from to/span description
pept + 159 284 vimentin, exon 5
901 1121 vimentin, exon 6
1216 1259 vimentin, exon 7
1685 1770 vimentin, exon 8
2256 2297 vimentin, exon 9
pre-msg < 1 > 2297 VIM mRNA
IVS < 1 158 VIM cds intron D
IVS 285 900 VIM cds intron E
IVS 1122 1215 VIM cds intron F
IVS 1260 1684 VIM cds intron G
IVS 1771 2255 VIM cds intron H
BASE COUNT 816 a 536 c 611 g 892 t 2 others
ORIGIN About 0.5 kb after segment 2.
1 tcagatagga aaggtcagtg taacgttcat gtttaaatat ttcaagcgct agaacttccc
61 tgtaatggct tcagttgcct ccaaatcctg ggtctagaag tggtccatta caccattttt
121 gtgaagtgtt atgctaacca atctgggtgt ttccacagtt tgcagatctc tccgaagctg
181 ctaataggaa caatgatgcc ctgcgccagg ccaaacaaga agctaatgaa tatcgcagac
241 agattcagtc tctcacctgt gaagttgatg cccttaaagg aagtgtaagt agacgttctt
301 cctgattcag ctttttaagc actatgctgt agtttagggg atgtgtgtgg ctgcagctca
361 tctcccacag aagacagcag tgcacccatc cttcctgaaa aagactctct gggctccatt
421 tgtctgcatg tactttgtag tgcactgctc agattgctac actggactgg tctttctcaa
481 ggtgtaaaca atacagaatg agctatacga agtggaaaag tgcagatctt gcacttccta
541 gaatacagtt gcctgaaagg ctggcttgaa atggaagtaa ctgtatagta ctcctgtaga
601 ttccctcacc aagtactgtc tgcatgctga cttgtactgt atgtgtttgg cttactgtgg
661 gggtttgctt tgctgtctgg tagaataaat gtgaaacatc tctcatcaga caccagaata
721 acacacctgg actttgcaag gggaggataa ggacatctcc ttatggggac atatttgagc
781 atgttcagaa aggacactgt cattcaactt ccctttgcgt ttttttctta agttatgaag
841 ggtcttgaca ttagaaagag tggctgaaat gtagttaaat ttgactttat tctctcctag
901 aatgaatccc tggagcgcca gatgcgtgaa atggaggaga attttgctgt tgaagctgct
961 aactaccagg acactattgg ccgcctgcag gatgagattc agaacatgaa ggaagaaatg
1021 gctcgccatc ttcgtgagta ccaggacctg ctgaatgtaa agatggctct tgatattgag
1081 attgctacct acagaaaact gctggaggga gaagagagca ggtaaaaatc acatgcaggc
1141 acctaccttc tagcagttac agtttagaga ttccttacca ttgtctgact tgatctattt
1201 cttcttttcc ttcaggatta acatgcctat tccaaccttt gcttctttga acctgagagg
1261 taagctgttt catttgggcc tgtgtaagca gctgttttta atctttagca aaaaggtcat
1321 ggatagttcc ttttgctgta gtatttgaac tggtgtaaat gcagttgtat atgaaagtca
1381 ttcttacaca caactgccag tatttgcagt tacttaattt tggatgagag gtggtacttt
1441 gttcctgggt gtacacgtaa gggagtatca atacataaca agtgacttgg atttctgttg
1501 aagctgcttt aagaggaata agaagaaaat aagcaaagct agtttataat tcattatcat
1561 gcaaaagaga agaggacttc tttgtataca aaactgggat tctgaaagta gatactgaca
1621 ttcctcagtt ctttgaggag gcacttcatc tagactcttg gtaacaatac attttcttct
1681 gcagaaacca acattgagtc tcagccaatt gttgacactc actcgaagag gacacttcta
1741 attaagaccg tggaaactag agatggacag gttggtggtt actttgattt tttttcattc
1801 tatctcagct ggttactttg cccagtgctg tagtctgggc cactcattgg tgcagcctgt
1861 atatggcaga cagtgtctgg cagtccaagc tgcatggctg aatgtgctga tgtagcaaat
1921 gtacttctgt ggaatagacc tcaaaacaga tctgagaaca gcttgagtga ctgacagccc
1981 gtgtttagag gagactttag gttcagttga gttgcaagtc ctcaagattt ctcccattgc
2041 ataagaaaaa tgttgcagga gaagcgttta ttaggggcag gtagtgtatt tgtactaaat
2101 tgaagtttag atggttaagc gtgtgcagct accatttaaa cttctcattc ataggcaaat
2161 gctgatacct ttgttcacat ctgtgctact atttgtttca gttctaactc tttataacaa
2221 ctgtcttaag cttnnatgcc tattttccaa cacaggttat taatgaaact tcccagcatc
2281 acgatgactt ggagtaaagt gaagatgcaa acttaatgca ggagaaattc ttaccagcaa
2341 ggttttaaaa agttcatgtc ttaaaggaag aaacagcttt caagtgcctt tctccagttt
2401 tccatgagcg caagattatt atgctaggaa ataggtctta gatcttgcaa actgactctc
2461 cctgaaggat tagagtttac aatggagtct agtttacaaa tagcaatatc ttgtgctgca
2521 atactgtttt taagtatctg aatttaataa aactgctttt tccagcacag tatgagcaac
2581 ctgtcgctac ttcaataaat ctttggaaaa tggctcttga tgtgttctaa tttaacttca
2641 tgactttctg caaagccata acttaatgct ggaattacta tacggttgac aactccagta
2701 ctgattgtgt gaaatattgt tttcagatta actagacaaa ctgtcttccc atttactgct
2761 taggttttgg aaccaattaa aatggactat aactggcaga tgcataatgt attgatactt
2821 atcagttgaa taaaatgata cttcaagcta ataaaaa
//
LOCUS CHKVIMN 591 bp ds-DNA VRT 10-AUG-1990
DEFINITION chicken intermediate filament protein vimentin gene, 3' end.
ACCESSION X00185 J00915
KEYWORDS vimentin.
SOURCE Chicken DNA and cDNA to embryonic muscle mRNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 591)
AUTHORS Zehner,Z.E. and Paterson,B.M.
TITLE Characterization of the chicken vimentin gene: Single copy gene
producing multiple mRNAs
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 911-915 (1983)
STANDARD simple staff_review
REFERENCE 2 (bases 30 to 591)
AUTHORS Zehner,Z.E. and Paterson,B.M.
TITLE Vimentin gene expression during myogenesis: Two functional
transcripts from a single copy gene
JOURNAL Nucleic Acids Res. 11, 8317-8332 (1983)
STANDARD simple staff_review
COMMENT
FEATURES from to/span description
mRNA < 1 308 vimentin mRNA (alt.)
mRNA < 1 367 vimentin mRNA (alt.)
mRNA < 1 591 vimentin mRNA (alt.)
BASE COUNT 193 a 101 c 105 g 192 t
ORIGIN
1 cagcatcacg atgacttgga gtaaagctga agtgaagatg caaacttaat gcaggagaaa
61 ttcttaccag caacctttta aaaagttcat gtcttaaagg aagaaacagc tttcaagtgc
121 ctttctccag tttttccatg agcgcaagat tattatgcta ggaaataggt cttagatctt
181 gcaaactgac tctccctgaa ggattagagt ttacaatgga gtctagttta caaatagcaa
241 tatcttgtgc tgcaatactg tttttaagta tctgaattta ataaaactgc tttttccagc
301 acagtatgag caacctgtcg ctacttcaat aaatctttgg aaaatggctc ttgatgtgtt
361 ctaatttaac ttcatgactt tctggaaagc cataacttaa tgctggaatt actatacggt
421 tgacaactcc agtactgatt gtgtggaata ttgttttcag attaactaga caaactgtct
481 tcccatttac tgcttaggtt ttggaaccaa ttaaaatgga ctataactgg cagatgcata
541 atgtattgat acttatcagt tgaataaaat gatacttcaa gctaataaaa a
//
LOCUS ECORRD 1541 bp ss-rRNA RNA 10-AUG-1990
DEFINITION E.coli 16S ribosomal RNA.
ACCESSION J01859 M24828 M24911 K02555 M24996
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE Escherichia coli rRNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1541)
AUTHORS Ehresmann,C., Stiegler,P., Carbon,P. and Ebel,J.-P.
TITLE Recent progress in the determination of the primary sequence of the
16 S RNA of Escherichia coli
JOURNAL FEBS Lett. 84, 337-341 (1977)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1541)
AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P.
TITLE The sequence of Escherichia coli ribosomal 16 S RNA determined by
new rapid gel methods
JOURNAL FEBS Lett. 94, 152-156 (1978)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 1541)
AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P.
TITLE The complete nucleotide sequence of the ribosomal 16-S RNA from
Escherichia coli: Experimental details and cistron heterogeneities
JOURNAL Eur. J. Biochem. 100, 399-410 (1979)
STANDARD full staff_review
REFERENCE 4 (bases 1346 to 1541)
AUTHORS Taylor,B.H., Prince,J.B., Ofengand,J. and Zimmermann,R.A.
TITLE Nonanucleotide sequence from 16S ribonucleic acid at the peptidyl
transfer ribonucleic acid binding site of the Escherichia coli
ribosome
JOURNAL Biochemistry 20, 7581-7588 (1981)
STANDARD full staff_review
REFERENCE 5 (bases 1442 to 1541)
AUTHORS Schnare,M.N. and Gray,M.W.
TITLE 3'-terminal sequence of wheat mitochondrial 18S ribosomal RNA:
Further evidence of a eubacterial evolutionary origin
JOURNAL Nucleic Acids Res. 10, 3921-3932 (1982)
STANDARD full staff_review
REFERENCE 6 (bases 1 to 1541)
AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P.
TITLE The determination of the primary structure of the 16s ribosomal rna
of escherichia coli: III. Further studies
JOURNAL Biochimie 57, 711-748 (1975)
STANDARD simple staff_entry
REFERENCE 7 (bases 1 to 1541)
AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P.
TITLE the determination of the primary structure of the 16S ribosomal RNA
of Escherichia coli: (2) nucleotide sequences of products from
partial enzymatic hydrolysis
JOURNAL Biochimie 54, 901-967 (1972)
STANDARD simple staff_entry
REFERENCE 8 (sites)
AUTHORS Zimmermann,R.A. and Singh-Bergmann,K.
TITLE Binding sites for ribosomal proteins S8 and S15 in the 16S RNA of
Escherichia coli
JOURNAL Biochim. Biophys. Acta 563, 422-431 (1979)
STANDARD simple staff_entry
REFERENCE 9 (bases 558 to 890)
AUTHORS Gregory,R.J., Zeller,M.L., Thurlow,D.L., Gourse,R.L., Stark,M.J.R.,
Dahlberg,A.E. and Zimmermann,R.A.
TITLE Interaction of ribosomal proteins S6, S8, S15 and S18 with the
central domain of 16S ribosomal RNA from Escherichia coli
JOURNAL J. Mol. Biol. 178, 287-302 (1984)
STANDARD simple staff_review
COMMENT [2], [3] contain updated sequence data for the original work by the
same laboratory [1]. There were too many discrepancies between [1]
and [2], [3] to list each revision in our sites table. The sequence
shown is from [3]. [1], [2], [3] point to a number of cistron
heterogeneities. There is uncertainty, however, with regard to
assigning these various heterogeneities to specific cistrons. The
RNA method used by [1], [2], [3] gives the average of all the
cistrons present in the cell [3]. The heterogeneities are
classified by their relative proportions into major, minor and
undetermined species. The sequence shown corresponds to the major
species. The heterogeneities were annotated as variations in the
sites table. It is not known which of the residues "c" (base 633)
or "a" (base 641) undergoes a deletion, giving rise to the minor
component "atctg". [3] suggests the existence of one or two mutated
cistrons among the known seven cistrons of ribosomal RNA.
With the exception of a single base deletion, this sequence is
identical to the current 16S rDNA sequence for the E.coli rRNB
gene.
FEATURES from to/span description
rRNA 1 1541 16S rRNA
variant 76 76 g in major species; c in undetermined species
variant 80 80 a in major species; c in undetermined species
variant 88 89 tc in major species; tgc in undetermined
species
variant 90 92 ttt in major species; substitution of one of
the t by c in undetermined species
variant 179 179 t in major species; c in minor species
variant 182 182 c in major species 1; t in major species 2
variant 193 194 ca in major species; cga in undetermined
species
variant 267 272 tcacct in major species; ccacct or tcaccc in
minor species
variant 283 285 ccc in major species; cc in minor species
conflict 348 349 ag in [3]; ga in [2]
modified 526 526 m7g = 7-methylguanosine
variant 630 631 ct in major species; cct in minor species
variant 632 634 gca in major species; ga in minor species (see
comment)
variant 640 642 tac in major species; tc in minor species (see
comment)
conflict 853 855 ttc in [3]; ctt in [2]
revision 915 916 tg in [3]; tcg in [1],[2] (typing error)
variant 965 965 g in undetermined species 1; 2-methylguanosine
in species 2
conflict 1027 1029 ctt in [3]; ttc in [2]
variant 1071 1072 gt in major species; gtt in undetermined
species
variant 1074 1074 t in major species; a in undetermined species
modified 1099 1099 m5c = 5-methylcytidine
modified 1206 1206 m2g = 2-methylguanosine
variant 1280 1280 c in major species; t in minor species
variant 1320 1322 tcg in major species; tg in minor species
modified 1401 1401 m4c = n4-methylcytidine
modified 1402 1402 cm = 2'-o-methylcytidine
variant 1405 1405 t in major species; c in minor species
variant 1406 1406 5-methylcytidine in major species; c in minor
species
conflict 1490 1493 gaag in [3]; gag in [4]
modified 1497 1497 um = 2'-o-methyluridine
modified 1517 1517 m66a = n6,n6-dimethyladenosine
modified 1518 1518 m66a = n6,n6-dimethyladenosine
BASE COUNT 389 a 352 c 487 g 313 t
ORIGIN 5' end of mature rRNA.
1 aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
61 gtcgaacggt aacaggaaga agcttgctct ttgctgacga gtggcggacg ggtgagtaat
121 gtctgggaaa ctgcctgatg gagggggata actactggaa acggtagcta ataccgcata
181 acgtcgcaag accaaagagg gggaccttcg ggcctcttgc catcggatgt gcccagatgg
241 gattagctag taggtggggt aacggctcac ctaggcgacg atccctagct ggtctgagag
301 gatgaccagc cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg
361 gaatattgca caatgggcgc aagcctgatg cagccatgcc gcgtgtatga agaaggcctt
421 cgggttgtaa agtactttca gcggggagga agggagtaaa gttaatacct ttgctcattg
481 acgttacccg cagaagaagc accggctaac tccgtgccag cagccgcggt aatacggagg
541 gtgcaagcgt taatcggaat tactgggcgt aaagcgcacg caggcggttt gttaagtcag
601 atgtgaaatc cccgggctca acctgggaac tgcatctgat actggcaagc ttgagtctcg
661 tagagggggg tagaattcca ggtgtagcgg tgaaatgcgt agagatctgg aggaataccg
721 gtggcgaagg cggccccctg gacgaagact gacgctcagg tgcgaaagcg tggggagcaa
781 acaggattag ataccctggt agtccacgcc gtaaacgatg tcgacttgga ggttgtgccc
841 ttgaggcgtg gcttccggag ctaacgcgtt aagtcgaccg cctggggagt acggccgcaa
901 ggttaaaact caaatgaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt
961 cgatgcaacg cgaagaacct tacctggtct tgacatccac ggaagttttc agagatgaga
1021 atgtgccttc gggaaccgtg agacaggtgc tgcatggctg tcgtcagctc gtgttgtgaa
1081 atgttgggtt aagtcccgca acgagcgcaa cccttatcct ttgttgccag cggtccggcc
1141 gggaactcaa aggagactgc cagtgataaa ctggaggaag gtggggatga cgtcaagtca
1201 tcatggccct tacgaccagg gctacacacg tgctacaatg gcgcatacaa agagaagcga
1261 cctcgcgaga gcaagcggac ctcataaagt gcgtcgtagt ccggattgga gtctgcaact
1321 cgactccatg aagtcggaat cgctagtaat cgtggatcag aatgccacgg tgaatacgtt
1381 cccgggcctt gtacacaccg cccgtcacac catgggagtg ggttgcaaaa gaagtaggta
1441 gcttaacctt cgggagggcg cttaccactt tgtgattcat gactggggtg aagtcgtaac
1501 aaggtaaccg taggggaacc tgcggttgga tcacctcctt a
//
LOCUS HUMRSALPA 320 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 1.
ACCESSION M28029
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 320)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 320 alphoid repeat, dimer 1
BASE COUNT 100 a 54 c 59 g 107 t
ORIGIN
1 tgatgtgtgt attcaacgca cagagttgaa cgttccttta gacagagtag attggaaaca
61 ctctttttgt agaattttca agtggaggta tcaagcgctt tgaggccaat gatagaaaag
121 gaaatacctt cgtataataa ttagacagaa tcattctcag aaaccgcttt gcaatgtgtg
181 cgttcaactc acagtgttta acctttcttt tcatacagtt gtttcgaaac actcttttta
241 cagaatctac aagtaaatat ttggacctct ttgaagtctt cgttagaaat gagatttctt
301 catataatac tagctagaag
//
LOCUS HUMRSALPB 314 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 2.
ACCESSION M28030
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 314)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 314 alphoid repeat, dimer 2
BASE COUNT 89 a 51 c 65 g 109 t
ORIGIN
1 ttctctcagt aacttgtatt tgctcggtgc tatcaactct cagagttgaa ctttccttta
61 gaaacagtag atttgaaact ctctttttgc ggaatttgta agtggagata tcaagagttt
121 tgaggccaat ggtagaaaag gaaatatctt cgtatgtaaa ctagatagaa tcattctcag
181 aaactacttt ggcacgtgtg cgttcaactc acagtgttta acctttcttt tcatagagca
241 gtttggaaac actcagtttg taaagtcagc aactggatat ttggatgtat ttgattcctt
301 cgttggaaat ggga
//
LOCUS HUMRSALPC 2550 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone pHS53.
ACCESSION M28031 M28033 M28032
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2550)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 850 alphoid pentamer
rpt 851 1700 alphoid pentamer
rpt 1701 2550 alphoid pentamer
BASE COUNT 743 a 496 c 524 g 787 t
ORIGIN
1 gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg gataaacttc
61 ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt caactcacag
121 agttgaacct tgctttcata gttcagcttt caaacactct ttttgtagaa tctgcaagtg
181 gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca tcaaacctag
241 accgaagcat tctcagaatg tttcctttga tgactgcatt caactcacag aggtgaacaa
301 tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt ggatatgtgg
361 acctctgtga agatttcttt ggaaacgggt tcatcttcac agaaaaacta aacagaagca
421 ttctcagaaa ctgctttgtg atgtttgtgt tccacttcag gaattaaact ttcctcttga
481 cagagcagct ctgaaaccct cttattctag aatctgcaag tggacatttg gaggactttg
541 aggcctgtgg tggaaaagga aaatcttcac ataaaaacta gatggaagca ttctcagaaa
601 ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga tagagcaggt
661 tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg aggcctactg
721 tagtaaagga aataacttca tctaaaaacc aaacggaagc attcacagac aattcttagt
781 gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt ttccaaacac
841 actttctgta gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg
901 gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt
961 caactcacag agttgaacct tgctttcata gttcagcttt caaacactct ttttgtggaa
1021 tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca
1081 tcaaacctag accgaagcat tctcagaatg tttcctgtga cgactgcatt caactcacag
1141 agatgaacaa tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt
1201 tgatatgtgg acctctgtga agatttcgtt ggaaacgggt tcatcttcac agaaaaacta
1261 aacagaagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa gaattgaact
1321 ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag tggacatttg
1381 gagggctttg aggcctgtgg tgcaaaagga aaatcttcac ataaaaacta gatggaagca
1441 ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga
1501 tagagcaggt tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg
1561 aggcctactg tagtaaagga aataacttca tctaaaatcc aaacggaagc attcacagac
1621 aattcttagt gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt
1681 ttccaaacac actttctcta gaatctgcaa gtggatattt ggacttctct gaggatttcg
1741 ttggaaacgg gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga
1801 tgtttgcatt caactcacag agttgaacct tgctttcata gttcagcttt caaacactct
1861 ttttgtggaa tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta
1921 tatcttcaca tcaaacctag acagaagcat tctcagaatg tttcctgtga cgactgcatt
1981 caactcacag aggtgaacaa ccctgctgat ggagcagttt tgaaactctc tttctttgga
2041 ttctgcaagt ggatatgtgg acctctgtga agatttcgtt ggaaacggtt tcatcttcac
2101 agaaaaacta aacaggagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa
2161 gaattgaact ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag
2221 tggacatttg gagggctttg aggcctgtgg tggaaaagga aaatcttcac ataaaaacta
2281 gatggaagca ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca
2341 ttcctataga tagagcaggt tgtaaacaat gtttttgtag aatctgcgat tggagatttg
2401 gactgctttg aggcctactg tagtaaagga aataacttca tctaaaaacc aaacgcaagc
2461 attcacagac aattcttagt gatcattgga ttgaactaac agagctgaac attcctttag
2521 atggagaagt ttccaaacac actttctgca
//
LOCUS HUMRSALPD 134 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-11A.
ACCESSION M28034
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 134)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 134 alphoid repeat
BASE COUNT 47 a 23 c 27 g 37 t
ORIGIN
1 tcacagagtt gaacgatcct ttacacagag cagacttgta acactctttt tgtggaattt
61 gcaagtggag atttcagccg ctttgaagtc aaaggtagaa aaggaaatat cttcctataa
121 aaactagaca gaat
//
LOCUS HUMRSALPE 146 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-23A.
ACCESSION M28035
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 146)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 146 alphoid repeat
BASE COUNT 46 a 28 c 26 g 43 t 3 others
ORIGIN
1 ttttttccca actcacacga gttaacgatc ctttacacag agcagacttg tannnctctc
61 tttatggaat ttgcaagtgg agatttcagc cgctttgagg tcaatggtag aaaaggaaat
121 atcttcctat aaaaactaga cagaat
//
LOCUS HUMRSALPF 171 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-10A.
ACCESSION M28036
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 171)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 171 alphoid repeat
BASE COUNT 53 a 27 c 38 g 53 t
ORIGIN
1 aattctcaga gtcttctttg tgatgtgtgc tttcaactca ccgagataaa gatttctctt
61 gatagagcaa attggaaaca ctctttttgt agaatttgca agggtacatt gagagcgctt
121 tcaggcctat ggtagaaaag ggaatatctt tccataaaag gtagacagaa g
//
LOCUS HUMRSALPG 137 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-11B.
ACCESSION M28037
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 137)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 137 alphoid repeat
BASE COUNT 33 a 27 c 30 g 47 t
ORIGIN
1 gattctcaga aactcctttg tgatgtgtgc gttcaactca cagagtttaa cctttctttt
61 catagagcag ttaggaaaca ctctgtttgt gaagtctgca agtggatatt cagacctctt
121 tgaggccttc gttggaa
//
LOCUS HUMRSALPJ 158 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-03B.
ACCESSION M28040
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 158)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 158 alphoid repeat
BASE COUNT 44 a 31 c 40 g 43 t
ORIGIN
1 gcaactcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa ccgagcgcaa
61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt
121 tgaggccttc gttggaaacg ggatttcttc ctataaag
//
LOCUS HUMRSALPK 138 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alphoid repetitive DNA, subclone p340-10B.
ACCESSION M28039
KEYWORDS alphoid repetitive sequence; repetitive DNA.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 138)
AUTHORS Zaitsev,I.Z. and Rogaev,E.I.
TITLE Structural analysis of alphoid DNA of primates
JOURNAL Mol. Biol. 20, 530-538 (1986)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt 1 138 alphoid repeat
BASE COUNT 37 a 27 c 32 g 42 t
ORIGIN
1 caatctcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa cattcctctt
61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt
121 tgaggccttc gttggaaa
//
LOCUS HUMHBGF1 1082 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 1.
ACCESSION M23017
KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT 1 of 3
SOURCE Human fetal liver DNA, clone ChIK117-1.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1082)
AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
TITLE Cloning of the gene coding for human class 1 heparin-binding growth
factor and its expression in fetal tissues
JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.-M.Chiu, 13-MAR-1989.
FEATURES from to/span description
pept 602 + 770 heparin-binding growth factor 1, exon 1
/nomgen="FGFA" /map="5q31.3-q33.2"
/hgml_locus_uid="LD0150G"
pre-msg 568 > 1082 heparin-binding growth factor 1 mRNA and
intron
IVS 771 > 1082 heparin-binding growth factor 1 intron A
BASE COUNT 258 a 285 c 262 g 277 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttccct taacatacta accctttact ttccctgttg tgtccctgaa aggcctcctg
61 tgcctttggc tgcaggtccc gaacgtccag gccatctgtg ctatctgctt cgcggtacct
121 caccaacgca acgtgagggt ggagggcaga accttggtcc tggcctctca gcttttgtgg
181 gtttcagcca gaccctaggt gttattttag tgcaactttg gtgtttaatt tgaggatgtg
241 tgtggaccag aaggagggac caaaacatga ttcttttccc catggtcaga tgattaaatt
301 tgaagttcta aaaaatgcag tttggtccaa agctgtgtcc aattgggaag agagaaaaat
361 gccctggaaa cccctcccag gcctgggacc atccttcctt aaccaccagc cacctcacag
421 gcccgcggac tgcgggcatc acctgggcag gctgtgctta ctcactaccc gggaaccctg
481 tgccctggag ctgtccttcc tctcttcaaa gtgcattttg tgcctttgct ggaagaaccg
541 actacaggtt tgttcaattt cttacagtct tgaaagcgcc acaagcagca gctgctgagc
601 catggctgaa ggggaaatca ccaccttcac agccctgacc gagaagttta atctgcctcc
661 agggaattac aagaagccca aactcctcta ctgtagcaac gggggccact tcctgaggat
721 ccttccggat ggcacagtgg atgggacaag ggacaggagc gaccagcaca gtaagcccat
781 ctctatggca ccccccttcc ctttctgaca tcttctgtag tcaaggtggg aggaaggtgc
841 acatttaagt acaggtactt gcttctccaa ggttctattc aggcatgaca cattcagagg
901 tggagtcaca taaatgcgta aaatgtctgg gaaatgaaaa tagggacttg tgggggccac
961 cacttaccca aacgtgtcct atttcaagtt ttttaaagca ctctctgctg acccaacaga
1021 acgggctgcc ggtgctcaat tgctgtatgt tttcccaggt ttctgtaact agtgaaagat
1081 ct
//
LOCUS HUMHBGF2 427 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 2.
ACCESSION M23086
KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT 2 of 3
SOURCE Human adult liver DNA, clone ChIL208-1.4.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 427)
AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
TITLE Cloning of the gene coding for human class 1 heparin-binding growth
factor and its expression in fetal tissues
JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.-M.Chiu, 13-MAR-1989.
FEATURES from to/span description
pept + 186 + 289 heparin-binding growth factor 1, exon 2
/nomgen="FGFA" /map="5q31.3-q33.2"
pre-msg < 1 > 427 heparin-binding growth factor 1 mRNA and
introns
IVS < 1 185 heparin-binding growth factor 1 intron A
IVS 290 > 427 heparin-binding growth factor 1 intron B
BASE COUNT 106 a 85 c 108 g 128 t
ORIGIN Unknown number of bp after segment 1.
1 cagctttctt tggaaggcaa agaaaaaggg actgtatttc tatgttttga ttaatctgag
61 gctcatcctg agggctccgt gaaatgaatg agcagaattt tccatggcca actgtcctgg
121 ctgccgggtc ctatcggcaa aagcgtagtg tttatttact tttgctcgtg ttatttttat
181 tccagttcag ctgcagctca gtgcggaaag cgtgggggag gtgtatataa agagtaccga
241 gactggccag tacttggcca tggacaccga cgggctttta tacggctcag taagtatgaa
301 gctgacatgc ttccagacgt tggccaaggt ttgaggtttc cagaaatctt gttacatgga
361 gtgaggcaaa ctataaagca acaattagtc tctgtttgtt attttttcca gaaggattcc
421 caccctc
//
LOCUS HUMHBGF3 664 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 3.
ACCESSION M23087
KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT 3 of 3
SOURCE Human fetal liver DNA, clone ChIK105a-2.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 664)
AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
TITLE Cloning of the gene coding for human class 1 heparin-binding growth
factor and its expression in fetal tissues
JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.-M.Chiu, 13-MAR-1989.
FEATURES from to/span description
pept + 304 498 heparin-binding growth factor 1, exon 3
/nomgen="FGFA" /map="5q31.3-q33.2"
pre-msg < 1 > 498 heparin-binding growth factor 1 mRNA and
intron
IVS < 1 303 heparin-binding growth factor 1 intron A
BASE COUNT 181 a 153 c 142 g 188 t
ORIGIN Unknown number of bp after segment 2.
1 tgaggactct tagaagtgct cttatcagta gcatcttaat tactttacaa tggattttaa
61 atggaaagga agtttacaat aatagcaaat gcatattgac agctctttag tgcccggtgc
121 tgttctaagt ccttatgact accctgtgaa ataagttcca ccatgacccc aattttcctg
181 aaaaggagac tgaggcatgg agagctttag tattttgccc aatgtcacac agctagtaaa
241 tggggacccc catgtgaaac tactcactga ttgtcctact ctcttgtggt tttatctttt
301 tagcagacac caaatgagga atgtttgttc ctggaaaggc tggaggagaa ccattacaac
361 acctatatat ccaagaagca tgcagagaag aattggtttg ttggcctcaa gaagaatggg
421 agctgcaaac gcggtcctcg gactcactat ggccagaaag caatcttgtt tctccccctg
481 ccagtctctt ctgattaaag agatctgttc tgggtgttga ccactccaga gaagtttcga
541 ggggtcctca cctggttgac ccaaaaatgt tcccttgacc attggctgcg ctaaccccca
601 gcccacagag cctgaatttg taagcaactt gcttctaaat gcccagttca cttctttgca
661 gagc
//
LOCUS HUMASAT3 1741 bp ds-DNA PRI 10-AUG-1990
DEFINITION Human alpha satellite and satellite 3 DNA.
ACCESSION M21305
KEYWORDS alpha satellite; satellite 3; satellite DNA.
SOURCE Human DNA, clone pTR9.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1741)
AUTHORS Vissel,B., Earle,E. and Choo,K.H.
TITLE A clone containing contiguous satellite 3 and alpha satellite
sequences, separated by a direct repeat of 9 bp
JOURNAL Unpublished (1989) Royal Children's Hospital, Victoria, Australia
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by B.Vissel 29-MAR-1989.
Indirect evidence suggests that this sequence may represent a
natural junction sequence between alpha satellite and satellite 3
sequences on chromosome 9.
FEATURES from to/span description
site 1 1482 human satellite 3 DNA
site 1501 1741 human alpha satellite DNA
rpt 1483 1491 direct repeat 1A
rpt 1492 1500 direct repeat 1B
signal 5 9 caat box (put.)
signal 71 77 tata box (put.)
ORF 38 207 ORF 1 (alt.)
ORF 43 172 ORF 1 (alt.)
signal 630 634 caat box (put.)
signal 718 722 tata box (put.)
ORF 752 1109 ORF 2 (alt.)
ORF 757 925 ORF 2 (alt.)
BASE COUNT 661 a 152 c 534 g 394 t
ORIGIN
1 gaattcaata gaatggaatg gtatcagctc gagtggaatg gaatggaatg gaatggaatg
61 gcatggaatc gtataaagtg gaatggaatc aactcgagtg gaatggaatg gaatggaatg
121 gaatggaatg cagtacaatg caatagaatg gaatggaatg aactcgagtt gactggaatg
181 gaatggaatg gaatgcattt gaattgaatg gacctgaaga gaatggaatg gaatggaatg
241 gaatgaagtg gcctcgaatg caatagatta gattgcaacg gaatcatctt gagtggaatg
301 gagtgaaatg gaacggaatg gaatggaatg gaatggaata gaatggaatg gaatggaatg
361 gaatagaatg gaatggaatg gaatggaatg aaatgcaata gaatggaatg gtatcaactc
421 cagtggaatg gaatggaatg caatggaatg gaatggaatg gaatggaatg gaatcaattc
481 acctggaacg gaatggaatg caatggaatg gaatggaatg gaatgcagtg gaatggaatg
541 gaatagaatg gaatggaatg gaatggaatg gaatccactt gagtggaatg gaatggaatg
601 gaatggattc gaacggaatg gaatggaatg caatggaatg gaatgcatgt gattgaatgg
661 acacgaaaag aattgaatgg aaatgaatgg aatcgaatga agtggcctcg aatgcaatat
721 aatggaatgg aatcaagtcg agtggaatgg aatggaatgg gttggaatgg aatggaatgg
781 aatggaatgc attagaatgg aatggaatca actcgagtgg aatggaatgg aatggaatgg
841 aatggaatgg aatgcaatgg aatggaatgg aatggaatgg aatggaatgg aatggaatgc
901 aatggaatgg aatggaatgc aatagaatgg aatggaatca cctcgactgg aatggaatgg
961 aatggaatgc atttgaattg aatggacccg aagagaatgg aatggaatgg aatggaagga
1021 gtggcctcaa atgcaataga atgcaatgga atcatctcga gtggaatgga gtgaaatgga
1081 atggaatgga atggaatgga atgctataga atgaaatgga atgaagtcaa gtggaatgga
1141 atggaatgga atgggttgga agggaatgaa atggaatgca atagaatgga aaggaatgga
1201 atggaatgaa ctggccaatg caatagaatt gaatggaatc aactggagtg gaatggaatg
1261 gaatggaatg gattggaatg gaatgcaata gaatggaatg gaatcaactc gagtaaaatg
1321 gaatggaatg gaatggaatg gaatggaatg gaatggaatt gaatggaatg gaatggaatg
1381 catttgaatt gaatggacac gaaaagaatg gattggaatg gaatggaatg aagtagcctc
1441 gagtgcaatg atatggaatg gcatcaactc gagtggaatg gagtgaaaaa ggtgaaaaag
1501 gaaatacctt cacacaaaaa ctagacagaa gcattctcag aatcttcctt ttgatgtgtg
1561 cattcaactc acagagttga acctttcttt tgatagagca gttttgaaac actctctttg
1621 taggatctgc aactggacat ttggagctct ttgataccta cagtgaaaaa ggaaatatct
1681 tcacataaaa acttcacaga agcattctca aaaacatttt tcagatgtgt gcattgaatt
1741 c
//
LOCUS DROLINEJA 5154 bp ds-DNA INV 10-AUG-1990
DEFINITION D.melanogaster LINE element J-1.
ACCESSION M22874
KEYWORDS LINE element; dispersed repetitive element;
long interspersed element.
SOURCE D.melanogaster DNA, clone J-1.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 5154)
AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
TITLE The Drosophila mobile element jockey belongs to LINEs and contains
coding sequences homologous to some retroviral proteins
JOURNAL Gene 70, 253-262 (1988)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
pept / 414 2165 ORF1, nucleic-acid binding protein (AA at 1)
(put.)
pept 2162 4912 ORF2, reverse transcriptase (put.)
rpt 115 5134 jockey dispersed repetitive element
BASE COUNT 1580 a 1226 c 1072 g 1276 t
ORIGIN
1 catacccaac ctgaacacgt ttttctcaat atttgtgcca caaaatactt cgccagtaaa
61 ttaacatgaa gtcgctaaat gcgattacaa ttgacgacgc aatcatatac aaaaaaaaat
121 cattcacatg ggagatgagc aatcgagtgg acgtgttcac agaagtcgcg agataaaaca
181 aaaacgtaat tgtgatccat cacaaacatc tgcgcagatc gtgtgcttat ctcacaaaca
241 aaatctattt ttagtcactg cataacggtg acggcttcgg ttcgcgaaac ttatcagcaa
301 ctagcaattt ctaagctgtg ttgtttttgc ccctcgccct gcgcgctgcg caagcgggag
361 gttgttacaa tttaccttac aagtaaaccg gtaaatctta tcgtgtttag taaatatcaa
421 ttgcattata cggcataagt ataaagacaa ttgatataat ggagaattca tttgctcaat
481 cgcgacctag caatgggtgc gataaatttg agaaaatgag gaaagtagca ggtgttgagc
541 caggagaatt acgctcccaa ctccgcgcca gctgtgcagt tgtttcccct aacctggaag
601 gtatgccaac tcaatctgcg gtctccagct taatggtgac aatcagcagc aacaccaatg
661 caagtgttac ctgcactatt tctaacgtac aggccaacat gatctgtact cctacataca
721 ctgattgcac aaccgtgacc actagcattt gcccaactac gccttatgac aatggactgc
781 cgacacctct gtcatcactg cccaataagc catctaaagc gaattgcccc tttcaagcac
841 atgatcgtac tgtcaacagg aaacgaaaag gcgtgtctca gcccccatta cctatcctca
901 ccccttctcc aagccgtaaa actaaaaggc aggccactat gccactcaat gaggaggcct
961 ctacctccac tgcagcagca ttaaataaca atcgcttcgc gcttttgtcc gctgaagcgg
1021 agaatatgga gcaagacgtg tcggatgctg attctgacat tgaagactct gctgcccgag
1081 atggtggtgg acaatccgct aaatatagca aacccccagc catatgcgta ccaagtgtaa
1141 gcgatccggt caccttggaa cgggctctca atctgagcac cggctcctca aactactaca
1201 tccgcatttc tagatttggt gtatccagaa tctatacagc caaccctgat gctttccgca
1261 ccgctgtaaa agaactaaat aagttaaatt gtcaattctg gcatcaccaa cttaaagaag
1321 aaaaacccta cagagtagtg cttaaaggaa tccatgctaa tgttcctagt tcgcagatag
1381 aacaagcatt tagtgatcac ggctatgagg tccttaatat ctattgcccc agaaagtctg
1441 actggaagaa cattcaggta aacgaagatg ataatgaagc tacaaaaaac ttcaaaacta
1501 gacaaaattt gttttatatt aatcttaaac aaggcccgaa tgttaaagag tctcttaaga
1561 taactcgact tggcagatac agagtcactg ttgagcgcgc tacacgtaga aaagaactgc
1621 tacaatgtca aagatgccaa atttttggac actctaagaa ctattgcgcc caggatccta
1681 tttgtggtaa atgtagtggt ccccatatga ccgggttcgc tttgtgcata agtgacgtat
1741 gtctgtgtat aaattgtggt ggtgatcatg tctcgacaga caaaagctgc cctgtcagag
1801 cagagaaagc caagaagcta aaaccaaggt ccaggctacc gatgactaat aatattgcca
1861 cactcaaacc tccacaacgt tcttcaagcg gttacatacc agctgaggca ttaagaacca
1921 acatctctta tgctgatatt gctcgacgca acacgactca atctagggct cgtgctactg
1981 tgcaggctga agttatacca acgtcggaca atagccttaa caataaattt atgacgttag
2041 acaactccat tcgggccatc aatacgagaa tggacgaact atttaagctt atacacgaaa
2101 ctgtagaggc taataaagct ttcagagaac tggttcaggt tctaattaca cgtattccta
2161 aatgactcaa ccaaccttaa aaatcggatt gtggaacgct cgcggattaa caaggggctc
2221 tgaggagctt cggatattcc tcagcgatca cgatatagac gtaatgctta ccacggaaac
2281 acacatgcga gttggtcagc gcatctatct cccagggtat cttatgtatc acgcccacca
2341 ccccagtggt aacagtagag gtggctctgc agtcatcata aaatctagac tttgtcacag
2401 ccctctgaca cctatctcta ctaatgacag gcagatagcg agagtgcacc tgcaaacatc
2461 ggttgggacc gtcactgtag ctgctgttta tctacctcca gcagaaagat ggatagtaga
2521 tgacttcaaa tccatgtttg ctgcgttagg caacaaattt attgctggtg gtgattacaa
2581 tgccaaacat gcatggtggg ggaacccaag atcctgtcct agaggtaaaa tgttgcaaga
2641 agtcattgca catgggcaat accaagttct ggctacgggc gaacccactt tctactctta
2701 caaccctttg ttaacaccat cagcccttga tttttttata acctgtgggt acggcatggg
2761 caggctagat gtacaaactc tccaggaact ctcgtcggac catcttccta ttctggctgt
2821 attgcacgct acgccgttaa agaaaccaca acgcgtacga ctacttgccc ataatgctga
2881 cataaacata ttcaaaaccc atcttgaaca gctgagtgag gtaaatatgc aaattctgga
2941 ggcggtggac attgataatg ccacaagcct tttcatgagc aaactaagtg aggctgctca
3001 gcttgctgca ccgagaaatc ggcatgaagt agaggccttc agaccacttc aacttccttc
3061 cagtatattg gcactgctca ggctaaaacg aagagttcga aaagaatatg ctagaacagg
3121 tgatccccgc atgcaacaga tccacagtag actggccaac tgcctgcata aggcccttgc
3181 tcgaagaaag caggcccaaa tagatacctt cttggataac ttgggtgctg acgcgagcac
3241 aaattactca ctgtggcgta tcacgaaacg gttcaaagct cagcccaccc caaaatcagc
3301 aatcaaaaat ccgtctggtg gctggtgtcg cactagcttg gaaaaaactg aagtgttcgc
3361 taacaacctt gagcaacgtt ttacacccta taactatgca ccggaaagtc tctgtcgtca
3421 ggttgaagaa tacttggaat cgccctttca aatgagcctg cctctgagtg ctgtcacact
3481 ggaagaagtg aagaatttaa tagccaagct gccacttaag aaagctcctg gagaagatct
3541 tcttgataat agaaccatta gacttctccc agatcaagca ttgcagttcc ttgccttaat
3601 attcaacagc gttcttgatg ttggctactt tccgaaagct tggaaatcgg cgagcataat
3661 tatgatccat aagactggaa aaacaccgac agacgttgac tcgtacaggc ccaccagctt
3721 actcccatct ctgggtaaaa ttatggagag gctgatccta aacaggctgc tcacatgcaa
3781 ggatgttacc aaagcgattc ccaaatttca gtttggcttc cggttgcagc acggtactcc
3841 tgagcaacta catagagtag tgaactttgc tctggaagct atggaaaaca aggagtatgc
3901 agtaggtgcc tttcttgata ttcaacaggc atttgacaga gtctggcacc ctgggctcct
3961 gtacaaagcg aagaggctgt tcccgccgca gctatatttg gttgttaaaa gtttcctgga
4021 agaacgcaca ttccacgtct ctgttgatgg gtacaaatca tcaatcaagc caattgcagc
4081 tggagttcct caaggaagcg ttcttggccc aaccctatac tcagtttttg cttcggacat
4141 gcctactcac acaccagtca cagaggtaga cgaagaagat gtgctcatag ccacctacgc
4201 tgacgatact gctgtgctca cgaaaagtaa aagtatcctg gctgccactt ctggtctaca
4261 ggaatacctg gatgcattcc agcaatgggc tgagaactgg aatgtgcgca tcaacgctga
4321 gaagtgtgcc aatgtgacgt tcgccaaccg aacaggtagc tgtccgggtg tcagtctgaa
4381 tggaagactg atcagacacc atcaggctta taaatacctt ggtattaccc tcgataggaa
4441 gctcaccttc agcaggcaca tcacaaatat tcagcaagcg ttcaggacca aggttgctcg
4501 gatgtcttgg ctcattgcac cacgcaacaa actgtcgctt ggctgcaagg tcaatattta
4561 caagtccata ttggccccct gcctgttcta cggcctgcag gtatacggca ttgctgcgaa
4621 gagtcacctt aataagatcc ggattttaca ggcgaagacc ttaagaagaa tttcgggggc
4681 tccttggtat atgagaacaa gagacatcga acgcgacctc aaggtgccca aattaggaga
4741 caagctccag aacatcgccc aaaaatatat ggaaaggctt aatgtacacc ccaacagcct
4801 agcaaggaag ctaggaactg cagctgtggt caatgctgac cctcggacta gagtcaaaag
4861 aagactcaag cgacaccacc ctcatgacct ccctaacctg gttttgacct agaaagtctt
4921 agttttaaaa ttcattagaa taatcaaata aataataatt actatgttat atcaactatt
4981 ataattctcc ctatcatttt tagattaaaa atctgttagt cttaagtaac caagacacat
5041 tgtaaaataa aataatttaa gcagatcaaa ttaagttgcc gcatgggtaa cagtgcgttg
5101 atcaaataat aaaaacatca taaaaaaaaa aaaaatatac aaaattaatg atgg
//
LOCUS DROLINEJB 490 bp ds-DNA INV 10-AUG-1990
DEFINITION D.melanogaster LINE element J-1.
ACCESSION M22875
KEYWORDS LINE element; dispersed repetitive element;
long interspersed element.
SOURCE D.melanogaster DNA, clone J-0.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 490)
AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
TITLE The Drosophila mobile element jockey belongs to LINEs and contains
coding sequences homologous to some retroviral proteins
JOURNAL Gene 70, 253-262 (1988)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt < 1 490 jockey dispersed repetitive element
BASE COUNT 218 a 64 c 51 g 157 t
ORIGIN
1 tttgttatta aaattcatta gaataatcaa ataaataata attactatgt tatatcaact
61 attataattc tccctatcat ttttaggatt aaaaatctgt tagtcttaag taaccaagac
121 acattgtaaa ataaaataat ttaagcagat caaattaagt tgccgcatgg gtaacagtgc
181 gttgatcaaa taataaaaac atcatcatat aaagacctcc ctaacctggt tttgacctag
241 aaagtcttag ttttaaaatt cattagaata atcaaataaa taataattac tatgttatat
301 caactattat aattctccct atcattttta ggattaaaaa tctgttagtc ttaagtaacc
361 aagacacatt gtaaaataaa ataatttaag cagatcaaat taagttgccg catgggtaac
421 agtgcgttga tcaaataata aaaacatcat cgtatataat taaaaaaaaa aaaaaaaaaa
481 aaaaaaaaaa
//
LOCUS DROLINEJC 367 bp ds-DNA INV 10-AUG-1990
DEFINITION D.melanogaster LINE element J-1.
ACCESSION M22876
KEYWORDS LINE element; dispersed repetitive element;
long interspersed element.
SOURCE D.melanogaster DNA, clone J-4.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 367)
AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
TITLE The Drosophila mobile element jockey belongs to LINEs and contains
coding sequences homologous to some retroviral proteins
JOURNAL Gene 70, 253-262 (1988)
STANDARD simple staff_entry
COMMENT
FEATURES from to/span description
rpt < 1 367 jockey dispersed repetitive element
BASE COUNT 135 a 76 c 53 g 103 t
ORIGIN
1 ctatctcccg atctctattg tacaccccaa cagcctagca aggaagctag gaactgcagc
61 tgtggtcaat gctagccctc agcgctagag tcaaaagaag actcaagcga caccaccctc
121 atgacctccc taacctggtt ttgacctaga aagtcttagt tttaaaattc attagaataa
181 tcaaataaat aataattact atgttatatc aactattata attctcccta tcatttttag
241 gattaaaaat ctgttagtct taagtaacca agacacattg taaaataaaa taatttaagc
301 agatcaaatt aagttgccgc atgggtaaca gtgcgttgat caaataataa aaacatcatc
361 gtaaaaa
//
LOCUS HUMPOVRA 1254 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human poliovirus receptor mRNA, clone H20A.
ACCESSION M24407
KEYWORDS oncogene; poliovirus receptor; transmembrane protein;
tyrosine kinase; viral receptor.
SOURCE Human Hela cell cDNA to mRNA, clone H20A.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1254)
AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R.
TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide
sequence, and expression of a new member of the immunoglobulin
superfamily
JOURNAL Cell 56, 855-865 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1254; revises [1])
AUTHORS Racaniello,V.R.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence kindly provided
by V.Racaniello, 27-APR-1989.
FEATURES from to/span description
pept 1 1254 poliovirus receptor
revision 1020 1020 a in [1]; g in [1]
BASE COUNT 255 a 389 c 362 g 248 t
ORIGIN
1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc
61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc
121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat
181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa
241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc
301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac
361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga
421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca
481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac
541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc
601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc
661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg
721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat
781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc
841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt
901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct
961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata
1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg
1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg
1141 tgtccctcga gtacagagca tgccagcgcc tcagctaatg ggcatgtctc ctattcagct
1201 gtgagcagag agaacagctc ttcccaggat ccacagacag agggcacaag gtga
//
LOCUS HUMPOVRB 1179 bp ss-mRNA PRI 10-AUG-1990
DEFINITION Human poliovirus receptor mRNA, clone H20B.
ACCESSION M24406
KEYWORDS oncogene; poliovirus receptor; transmembrane protein;
tyrosine kinase; viral receptor.
SOURCE Human Hela cell cDNA to mRNA, clone H20B.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1179)
AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R.
TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide
sequence, and expression of a new member of the immunoglobulin
superfamily
JOURNAL Cell 56, 855-865 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1179; revises [1])
AUTHORS Racaniello,V.R.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence kindly provided
by V.Racaniello, 27-APR-1989.
FEATURES from to/span description
pept 1 1179 poliovirus receptor
revision 1020 1020 a in [1]; g in [1]
BASE COUNT 234 a 368 c 341 g 236 t
ORIGIN
1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc
61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc
121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat
181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa
241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc
301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac
361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga
421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca
481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac
541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc
601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc
661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg
721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat
781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc
841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt
901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct
961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata
1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg
1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg
1141 tgtccctcga gtgagcatca ccagagctgc cgtaattga
//
LOCUS MUSSNAP25 2040 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse SNAP-25 mRNA encoding synaptosomal associated protein 25,
complete cds.
ACCESSION M22012
KEYWORDS synaptosomal associated protein 25.
SOURCE Mouse (strain BALB/c) adult neural brain cell, cDNA to mRNA, clones
pMuBr8, p8.1, p8.51, p8.52, p8.53, and p8.71.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2040)
AUTHORS Oyler,G.A., Higgins,G.A., Hart,R.A., Battenberg,E., Billingsley,M.,
Bloom,F.E. and Wilson,M.C.
TITLE The identification of a novel synaptosomal associated protein,
SNAP25, differentially expressed by neuronal subpopulations
JOURNAL J. Cell Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
M.Wilson 18-JAN-1989
FEATURES from to/span description
pept 164 784 synaptosomal associated protein 25
mRNA < 1 2037 synaptosomal associated protein 25 mRNA
BASE COUNT 613 a 462 c 430 g 535 t
ORIGIN Chromosome 2.
1 cccgaggttt ggagctgtct ttccttccct ccctacccgg cggctcctcc actcttgcta
61 cctgcaggga tcagcggaca gcatcctctg aagaagacaa ggttccttaa ctaagcacca
121 ctgacttgct ggccccggcg cccagcaacc ccccaccact accatggccg aggacgcaga
181 catgcgtaat gaactggagg agatgcagag gagggctgac cagctggctg atgagtccct
241 ggaaagcacc cgtcgcatgc tgcagctggt cgaagagagt aaagatgctg gcatcaggac
301 tttggttatg ttggatgagc aaggcgaaca actggaacgc attgaggaag ggatggacca
361 aatcaataag gatatgaaag aagcagaaaa gaatttgacg gacctaggaa aattctgcgg
421 gctttgtgtg tgtccctgta acaagcttaa atccagtgat gcttacaaaa aagcctgggg
481 caataatcag gatggagtag tggccagcca gcctgcccgt gtggtggatg aacgggagca
541 gatggccatc agtggtggct tcatccgcag ggtaacaaac gatgcccggg aaaatgaaat
601 ggatgaaaac ctagagcagg tgagcggcat catcggaaac ctccgtcata tggccctaga
661 catgggcaat gagattgaca cccagaatcg ccagattgac aggatcatgg agaaggctga
721 ctccaacaaa accagaattg atgaagccaa ccaacgtgca acaaagatgc tgggaagtgg
781 ttaaatctgc cgttctgctg tgctgtcctc caatgttgtt ggacaagaga gaagagagct
841 ccttcatgct tctctcatgg tattacctag taagacttac acacacacac acacacacac
901 acacacacac acacacacac acacacacac agagtagtca cccccattgt aaatgtctgt
961 gtggtttgtc agcttcccaa tgataccatg tgtcttttgt tttctccggc tctctttctt
1021 tgccaaaggt tgtacatagt ggtcatctgg tgactctatt tcctgactta agagttcttg
1081 ggtctctctc tttcttttct cagtggcgtt tgctgaatga caacaattta ggaatgctca
1141 atgtactgtt gatttttctc aatacacagt attgttcttg taaaactgtg acttaccaca
1201 gagctactac cacagtcctt tcttagggtg tcaggctctg aatctctcca aatgtgctct
1261 ctttggttcc tcagtgctat tctttgtctt tatgatttca taattagaca atgtgaaatt
1321 acataacagg cattgcacta aaagtgatgt gatttatgca tttatgcatg agaactaaat
1381 agacttttag atcctactta aacaaaaact tccatgacag tagcatactg acaagaaaac
1441 acacacaaca gcaacaataa caaagcaaca actacgcatg ctcagcattg ggacactgtc
1501 aagattaagt cataccagca aaacctgcag ctgtgtcacc ttcttctgtc aacatacaga
1561 ctgatcataa tgatcccttc tttacacaca cacacacaca cacacacaca cacacacaca
1621 caaatggaat ttaaccaact tcccagaatt gatgaagcaa atatatgttt ggctgaaact
1681 attgtaaatg ggtgtaatat agggtttgtc gaatgctttt gaaagctctg ttttccagac
1741 aatactcttg tgtggaaaac gtgaagatct tctaagtctg gctcttgtga tcaccaaacc
1801 ctggtgcatc agtacaacac tttgcgctaa tctagagcta tgcacaacca aattgctgag
1861 atgtttagta gctgataaag aaacctttaa aaaattatat aaatgaatga aatatagata
1921 aactgtgaga taaatatcat tacagcatgt atattaaatc cctcctgtct cctctgttgg
1981 tttgtgaagt gatttgacat tttgtagcta gtttaaaatt attaaaaatt atagatgtta
//
LOCUS CHKNCAMC1 26 bp ss-mRNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12.
ACCESSION M24014 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 1 of 6
SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA, clone
pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 7 to 18)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 26)
AUTHORS Prediger,E.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
FEATURES from to/span description
pept < 1 + 18 neural cell adhesion molecule, exon 12 (AA at
1)
IVS 19 > 26 intron
BASE COUNT 10 a 6 c 5 g 5 t
ORIGIN
1 tcaagacaca gccagttcgt aagtaa
//
LOCUS CHKNCAMC2 29 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12A.
ACCESSION M23991 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 2 of 6
SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 6B1, and cDNA to
mRNA, clone pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 7 to 21)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 6; 22 to 29)
AUTHORS Prediger,E.A.
JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
STANDARD full staff_entry
FEATURES from to/span description
pept + 7 + 21 neural cell adhesion molecule exon 12A
IVS < 1 6 N-CAM intron
IVS 22 > 29 N-CAM intron
BASE COUNT 6 a 10 c 6 g 7 t
ORIGIN
1 ctgcaggtat ccctcactca cgtaagtgc
//
LOCUS CHKNCAMC3 54 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12B.
ACCESSION M23992 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 3 of 6
SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 12AC3, and cDNA
to mRNA, clone pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 13 to 46)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 12; 47 to 54)
AUTHORS Prediger,E.A.
JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
STANDARD full staff_entry
FEATURES from to/span description
pept + 13 + 46 neural cell adhesion molecule exon 12B
IVS < 1 12 N-CAM intron
IVS 47 > 54 N-CAM intron
BASE COUNT 11 a 13 c 12 g 18 t
ORIGIN
1 ttctttcttt agcaagtgct gctgccccca ccatgtttag cagagagtga gtat
//
LOCUS CHKNCAMC4 62 bp ds-DNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12C.
ACCESSION M23993 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 4 of 6
SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 7C1, and cDNA to
mRNA, clone pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 13 to 54)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 12; 55 to 62)
AUTHORS Prediger,E.A.
JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
STANDARD full staff_entry
FEATURES from to/span description
pept + 13 + 54 neural cell adhesion molecule exon 12C
IVS < 1 12 N-CAM intron
IVS 55 > 62 N-CAM intron
BASE COUNT 18 a 20 c 12 g 12 t
ORIGIN
1 atgtttccat agcaactcag cctctccctg catccgaaag cacagagcca ccaagtgagt
61 ag
//
LOCUS CHKNCAMC5 3 bp ss-mRNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12D.
ACCESSION M23994 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 5 of 6
SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 3)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
COMMENT Exon 12D represents a very small exon.
FEATURES from to/span description
pept + 1 + 3 neural cell adhesion molecule, exon 12D
BASE COUNT 2 a 0 c 1 g 0 t
ORIGIN
1 aag
//
LOCUS CHKNCAMC6 33 bp ss-mRNA VRT 10-AUG-1990
DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 13.
ACCESSION M37649 J04140
KEYWORDS neural cell adhesion molecule.
SEGMENT 6 of 6
SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 33)
AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
TITLE Four exons encode a 93-base-pair insert in three neural cell
adhesion molecule mRNAs specific for chicken heart and skeletal
muscle
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 33)
AUTHORS Prediger,E.A.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT
FEATURES from to/span description
pept + 13 > 33 neural cell adhesion molecule, exon 13
IVS < 1 12 intron
BASE COUNT 9 a 11 c 7 g 6 t
ORIGIN
1 ccattgtttc aggggaaccc agtgcaccca aac
//
LOCUS BADLDH 1147 bp ds-DNA BCT 10-AUG-1990
DEFINITION B.caldolyticus lactate dehydrogenase (LDH) gene, complete cds.
ACCESSION M19394 M19386
KEYWORDS lactate dehydrogenase.
SOURCE B.caldolyticus (strain DSM 405) DNA, clone pLDHY1.
ORGANISM Bacillus caldolyticus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1147)
AUTHORS Zuelli,F., Weber,H. and Zuber,H.
TITLE Nucleotide sequences of lactate dehydrogenase genes from the
thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus
and B.caldotenax
JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by F.Zuelli, 27-MAY-1988.
FEATURES from to/span description
pept 146 1099 lactate dehydrogenase (AC 1.1.1.27)
BASE COUNT 282 a 275 c 321 g 269 t
ORIGIN 301 bp upstream of NcoI site.
1 tgctcccgtt gtttgccgcg gcgtctgtca tgaaatggac aaacaatagt caaacaatcg
61 ccacaatgcc gcatgcatcg cggtgcgcct ttcgcgtaaa atagtttgtg aaagtgttca
121 cattgtattg agggggatga atgcaatgaa aaacaacggt ggaacccgag tagtggtcat
181 cggcaccggg tttgtcggcg ccagttatgc gtttgcctta atgaatcaag ggattgccga
241 tgagatcgtg ctcatcgatg cgaatgaaag caaggccata ggcgatgcga tggacttcaa
301 ccatgggaaa gtatttgcgc cgaagccggc tgacatttgg cacggcgatt acgatgattg
361 ccgcgatgcc gatttggttg tcatttgcgc cggcgccaac caaaaaccgg gcgagacgcg
421 gcttgatctt gtggacaaaa acattgccat tttccgctcg atcgttgagt cggtcatggc
481 atccggattt caaggactgt ttctcgtcgc caccaatccg gtcgacattt taacgtacgc
541 gacgtggaaa ttcagcggcc tgccgcatga gcgggtgatc ggttcgggga cgattttaga
601 tacggcgcgg ttccgctttt tgttgggcga gtatttctct gtcgctccgc aaaatgttca
661 tgcctatatt attggggaac acggcgacac tgaactcccg gtctggagcc aggctgatat
721 cggcggcgtg ccgatccgca agctggtcga gtccaaaggg gaagaagcgc aaaaagagct
781 cgagcgcatt tttgtcaatg tgcgcgatgc cgcctaccaa attattgaga aaaaaggagc
841 gacgtactac ggaattgcga tgggacttgc ccgcgtgacg cgcgccattt tgcataacga
901 aaacgctatt ttgaccgtat cggcctacct cgatggccca tatggggagc gcgacgtcta
961 catcggcgtg ccggctgtca ttaaccgcaa tggcatccgc gaggtgatcg aaattgaatt
1021 gaatgatgac gaaaaaaatc gattccatca tagcgcagcc acattaaaaa gcgtgctagc
1081 ccgttttttt acgcgatgag gcaacaaacg aaaagttgtg ccaacagtct ttctcgtcca
1141 aacaggg
//
LOCUS BMELDHA 1423 bp ds-DNA BCT 10-AUG-1990
DEFINITION B.megaterium L-lactate dehydrogenase gene.
ACCESSION M22305
KEYWORDS L-lactate dehydrogenase.
SOURCE B.megaterium (strain DSM 090) DNA, clone pMH/pML.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1423)
AUTHORS Waldvogel,S., Weber,H. and Zuber,H.
TITLE Nucleotide sequence of the L-lactate dehydrogenase gene from the
mesophilic bacterium B.megaterium. Preparation and properties of a
hybrid lactate dehydrogenase comprising moieties of the B.
megaterium and B.stearothermophilus enzymes
JOURNAL Biol. Chem. Hoppe-Seyler 368, 1391-1399 (1987)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1423; revises [1])
AUTHORS Suter,F.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer readable copy of sequence [1] kindly
submitted by S.Waldvogel 02-OCT-1987, and [2] by F.Suter
16-MAY-1989.
Author address [2]
F.Suter
ETH-Hoenggerberg
Inst. f. Molekularbiologie und Biophysik
CH 8093 ZUERICH
Switzerland
FEATURES from to/span description
pept 250 1206 L-lactate dehydrogenase (AA at 1)
BASE COUNT 449 a 263 c 277 g 434 t
ORIGIN
1 taacatacaa atagagaatg ttaaactctt tagctttctg caaaacaagt tgcctttact
61 caacacttta aacgtttaag atctttctaa atgtggcact tcatgtagaa ggacattttt
121 ttaaatcgcc ctgtaaaatt gtttactatt ttttaaaggg tgtgattttt atcacagctg
181 aagctattta ttcttgttac actaacattg tgaaaaacat cacaaaataa aattcaaagg
241 atgatacaaa tgaaaacaca atttacacca aaaacacgaa aagttgccgt tatcggaact
301 ggttttgttg gctcaagcta cgctttttca atggtgaatc aaggtattgc caatgaatta
361 gtgttaatcg atatgaacaa agaaaaagca gaaggtgaag cacgtgatat caatcatgga
421 atgccatttg ccacaccgat gaaaatctgg gctggagatt ataaagactg tgctgacgct
481 gatttagcag ttattacagc gggcgctaat caagctccag gggaaacacg cttagatcta
541 gttgaaaaaa acgttaaaat tttcgaatgc attgtaaaag atattatgaa cagcggattt
601 gacggcatca ttttagtggc aacaaatcca gttgatattc tcgcacacgt tacacaaaaa
661 gtatcaggat taccaaacgg acgggtaatt ggttcaggaa cgattcttga cacagctcgc
721 ttccgctact tgttaagcga ctatttcgaa gtagattctc gcaacgtcca cgcttatatt
781 atgggggaac atggagatac ggaatttcct gtttggagcc acgcgcaaat tggcggtgtg
841 aagctcgaac attttatcaa tactgccgct attgaaaaag aaccggatat gcagcatcta
901 ttcgaacaaa cccgcgatgc ggcttaccat attattaatc gaaaaggagc gacttattac
961 ggaattgcaa tggggcttgt acgcattacc aaggctattt tagatgatga aaattctatt
1021 ttaacagtat ctgctttatt agaaggacaa tacggtattt ctgatgtgta tatcggcgta
1081 ccagctatca ttaataaaaa cggcgtgcgt caaattattg aattgaattt aactcctcac
1141 gaacagcagc agctcgagca ctctgctagc attcttaagc aaactcgcga cagagctttt
1201 gtgtaacatc taaagatttt tgcgggggac tcccccgcac cttgttttaa atatgtagta
1261 gaggtgtatg aatatgacgt ggactcaggt atataatcct ttagataata tttggctttc
1321 tgcactaatt gcactcattc ccattatctt tttctttatt gctttaactc ttttgaaatt
1381 aaaaggacac attgcttgcc ggtattacgg tgcttctttc tat
//
LOCUS BSTLDH 1150 bp ds-DNA BCT 10-AUG-1990
DEFINITION B.stearothermophilus lactate dehydrogenase (LDH) gene, complete
cds.
ACCESSION M19396 M19386
KEYWORDS lactate dehydrogenase.
SOURCE B.stearothermophilus (strain NCIB 8924) DNA, clone pLDHY1.
ORGANISM Bacillus stearothermophilus
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1150)
AUTHORS Zuelli,F., Weber,H. and Zuber,H.
TITLE Nucleotide sequences of lactate dehydrogenase genes from the
thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus
and B. caldotenax
JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by F.Zuelli, 27-MAY-1988.
FEATURES from to/span description
pept 149 1102 lactate dehydrogenase (EC 1.1.1.27)
BASE COUNT 283 a 270 c 321 g 276 t
ORIGIN 304 bp upstream of NcoI site.
1 gtgctcccgt tgtttgccgc ggcgtctgtc atgaaatgga caaacaatag tcaaacaatc
61 gccacaatcg cgcatgcatt gcggtgcgcc tttcgcgtaa aatatttata tgaaagtgtt
121 cgcattatat tgagggagga tgaatgcaat gaaaaacaac ggtggagccc gagtagtggt
181 catcggcgcc gggtttgtcg gcgccagtta tgtgtttgcc ttaatgaatc aagggattgc
241 cgatgagatc gtgctcatcg atgcgaatga aagcaaggcc ataggcgatg cgatggactt
301 caaccatggg aaagtatttg cgccgaagcc ggttgacatt tggcacggcg attacgatga
361 ttgccgcgat gccgatttgg ttgtcatttg cgccggcgcc aaccaaaaac cgggcgagac
421 gcggcttgat cttgtggaca aaaacattgc cattttccgc tcgatcgttg agtcggtcat
481 ggcatccgga tttcaaggac tgtttctcgt cgccaccaat ccggtcgaca ttttaacgta
541 cgcgacgtgg aaattcagcg gcctgccgca tgagcgggtg atcggttcgg ggacgatttt
601 agatacggcg cggttccgct ttttgttggg cgagtatttc tctgtcgctc cgcaaaatgt
661 tcatgcctat attattgggg aacacggcga cactgaactc ccggtctgga gccaggctta
721 tatcggcgtc atgccgatcc gcaagctggt cgagtccaaa ggggaagaag cgcaaaaaga
781 tctcgagcgc atttttgtca atgtgcgcga tgccgcctac caaattattg agaaaaaagg
841 agcgacgtac tacgggattg cgatggggct tgcccgcgtg acgcgcgcca ttttgcataa
901 cgaaaacgct attttgaccg tatcagccta cctcgatggc ctatatgggg agcgcgacgt
961 ctacatcgga gtgccggctg tcattaaccg caatggcatc cgcgaggtga tcgaaattga
1021 attgaatgat gacgaaaaaa atcgattcca tcatagcgca gctacattaa aaagcgtgct
1081 agcccgtgct tttacgcgat gaggcaacaa acgaaaagtt gtgccgacag tctttctcgt
1141 ccaaacaggg
//
LOCUS MUSIGVDJA 351 bp ss-mRNA ROD 10-AUG-1990
DEFINITION Mouse Ig active H-chain mRNA variable region (V186-2 DFL16.1 J2)
(AA at 1) anti-NP hybridoma, clone H1-7.
ACCESSION M18548
KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon;
rearranged gene; variable region.
SOURCE Mouse (strain C57BL/6J) anti-NP hybridoma mRNA, clone H1-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 351)
AUTHORS Blier,P.R. and Bothwell,A.
TITLE A limited number of B cell lineages generates the heterogeneity of
a secondary immune response
JOURNAL J. Immunol. 139, 3996-4006 (1987)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
A.Bothwell 03-MAY-1988.
FEATURES from to/span description
ORF < 1 > 351 Ig heavy-chain V-region (V-D-J2) (AA at 1)
recomb 293 294 V(186-2) region end/ D(FL16.1) start
recomb 308 309 D(FL16.1) region end/J2 start
BASE COUNT 84 a 92 c 89 g 79 t 7 others
ORIGIN Unreported.
1 caggtcaaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agnnaagctg
61 tcctgcaagg cttctggcta caccttcacc acctncttga tncactgggt gaaacagagg
121 cctggacgag gccttgagtg gattggaagg attgatccta atggtggtgg tactaagtac
181 gatgnaagtt caagagcaag gccacactga ctgtagacga accctccagc acagcctaca
241 tgcagctcag cagcctgaca tctgaggact ctgcggncta ttattgtgca acctattact
301 acggtctcta ctttaactac tgggnaaggc accactctca cagtctcctc a
//GenBank-Updates@genbank.bio.net (08/13/90)
LOCUS MUSCZSOD 535 bp ss-mRNA ROD 13-AUG-1990
DEFINITION Mouse Cu-Zn superoxide dismutase mRNA, complete cds.
ACCESSION M35725
KEYWORDS Cu-Zn superoxide dismutase.
SOURCE Mouse SWR/J liver, cDNA to mRNA, clone pS61-10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 535)
AUTHORS Bewley,G.C.
TITLE cDNA and deduced amino acid sequence of murine Cu-Zn superoxide
dismutase
JOURNAL Nucleic Acids Res. 16, 2728-2728 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 5 469 Cu-Zn superoxide dismutase (EC 1.15.11)
mRNA < 1 469 Cu-Zn superoxide dismutase mRNA
BASE COUNT 148 a 112 c 159 g 116 t
ORIGIN
1 aagcatggcg atgaaagcgg tgtgcgtgct gaagggcgac ggtccggtgc agggaaccat
61 ccacttcgag cagaaggcaa gcggtgaacc agttgtgttg tcaggacaaa ttacaggatt
121 aactgaaggc cagcatgggt tccacgtcca tcagtatggg gacaatacac aaggctgtac
181 cagtgcagga cctcatttta atcctcactc taagaaacat ggtggcccgg cggatgaaga
241 gaggcatgtt ggagacctgg gcaatgtgac tgctggaaag gacggtgtgg ccaatgtgtc
301 cattgaagat cgtgtgatct cactctcagg agagcattcc atcattggcc gtacaatggt
361 ggtccatgag aaacaagatg acttgggcaa aggtggaaat gaagaaagta caaagactgg
421 aaatgctggg agccgcttgg cctgtggagt gattgggatt gcgcagtaaa cattccctgt
481 gtggtctgaa gtctcagact caactgctac cctcaaacca ttaaactgta atctg
//
LOCUS MZECPSIIA 721 bp ds-DNA ORG 13-AUG-1990
DEFINITION Maize chloroplast ribosomal protein S11 (rpS11) gene, complete cds,
and rpoA gene, 5' end.
ACCESSION M35831
KEYWORDS ribosomal protein S11; rpS11 gene; rpoA gene.
SOURCE Maize chloroplast DNA.
ORGANISM Chloroplast Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE 1 (bases 1 to 721)
AUTHORS Markmann-Mulisch,U. and Subramanian,A.R.
TITLE Nucleotide sequence of maize chloroplast rpS11 with conserved amino
acid sequence between eukaryotes, bacteria and plastids
JOURNAL Biochem. Int. 17, 655-664 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 200 631 ribosomal protein S11 (rpS11)
pept 695 > 721 rpoA gene product
BASE COUNT 252 a 120 c 147 g 202 t
ORIGIN
1 agacaggggt aatctttcga aaaagaagct tttctttcta atagcttttc tttctaatta
61 aagtaaaaag tacccacgga aatgcccaaa ttgcaaataa aaaaaatgaa agtaaaggat
121 atattttaac ctgaaacgga tatctttgta tcttttttct ttttgttatt tctaactcat
181 atttatgaga taataaaata tgacaaaagc tataccaaaa ataggttcac gtaagaaagt
241 gcgtattggt ttgcgtagga atgcccgttt tagtttacgg aagagtgcac gtagaataac
301 aaaagggatt attcatgttc aagctagttt caacaatacc attataaccg ttacagaccc
361 acaaggtcgg gtcgttttct ggtcctccgc aggtacttgt ggattcaaaa gctcaagaaa
421 agcatcaccc tatgctggtc aaagaacagc agtagatgct attcgtacag tgggtttgca
481 acgagcagaa gttatggtaa aaggtgctgg tagcggaaga gatgccgcat tacgagccat
541 tgctaaaagt ggtgtacggt taagttgtat acgcgatgta acacctatgc cgcataatgg
601 atgtcgacct cctaaaaaaa gacgtctgta aaaaaaaatg aaaccgcttt caagagaaat
661 aaacgattca atgatcaaat aatactagtc tgttatggtt cgagaagaga taacaggatc
721 c
//
LOCUS ACUCPCG 103 bp ds-DNA ORG 13-AUG-1990
DEFINITION A.calamus chloroplast 4.5S rRNA gene.
ACCESSION M36166
KEYWORDS 4.5S ribosomal RNA.
SOURCE A.calamus chloroplast DNA.
ORGANISM Chloroplast Acorus calamus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Arecidae; Arales; Araceae; Acorus calamus.
REFERENCE 1 (bases 1 to 103)
AUTHORS Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S.
TITLE Low-molecular-weight rRNAs sequences and plant phylogeny
reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from
Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae)
JOURNAL Pl. Syst. Evol. 156, 13-27 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 102 4.5S rRNA
BASE COUNT 30 a 20 c 31 g 22 t
ORIGIN
1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgccaagtgg aagtgcagtg
61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac
//
LOCUS LIGCPCG 103 bp ds-DNA ORG 13-AUG-1990
DEFINITION L.calthifolia chloroplast 4.5S rRNA gene.
ACCESSION M36165
KEYWORDS 4.5S ribosomal RNA.
SOURCE L.calthifolia chloroplast DNA.
ORGANISM Chloroplast Ligularia calthifolia
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Asterales; Asteraceae; Ligularia calthifolia.
REFERENCE 1 (bases 1 to 103)
AUTHORS Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S.
TITLE Low-molecular-weight rRNAs sequences and plant phylogeny
reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from
Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae)
JOURNAL Pl. Syst. Evol. 156, 13-27 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 103 4.5S rRNA
BASE COUNT 29 a 20 c 32 g 22 t
ORIGIN
1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg
61 atgtatgcag ctgaggcatc ctaacagacc ggtagacttg aac
//
LOCUS MBICPR1A 128 bp ss-rRNA RNA 13-AUG-1990
DEFINITION M.thermoautotrophicum 5S ribosomal RNA.
ACCESSION M36186
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE M.thermoautotrophicum (strain delta-H; variant 1) RNA.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 128)
AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
methanogen species, secondary structure models, and molecular
evolution
JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 128 5S rRNA
BASE COUNT 18 a 30 c 41 g 39 t
ORIGIN
1 ggggcatggc ggtcatggcg caggggttat acctgatctc gtttcgatct cagtagttaa
61 gtcctgctgc gttgtgggtg tgtactgcgg ttttttgctg tgggaagccc acttcactgc
121 cagccact
//
LOCUS MBICPR2A 131 bp ss-rRNA RNA 13-AUG-1990
DEFINITION M.thermoautotrophicum 5S ribosomal RNA.
ACCESSION M36187
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE M.thermoautotrophicum (strain delta-H; variant 2) RNA.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 131)
AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
methanogen species, secondary structure models, and molecular
evolution
JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 131 5S rRNA
BASE COUNT 17 a 30 c 41 g 43 t
ORIGIN
1 gggttttggc ggtcatggcg tgggggttta tacctgatct cgtttcgatc tcagtagtta
61 agtcctgctg cgttgtgggt gtgtactgcg gttttttgct gtgggaagcc cacttcactg
121 ccagaccacc c
//
LOCUS MESCPRA 130 bp ss-rRNA RNA 13-AUG-1990
DEFINITION M.barkeri 5S ribosomal RNA.
ACCESSION M36188
KEYWORDS 5S ribosomal RNA; ribosomal RNA.
SOURCE M.barkeri (strain DSM 800) RNA.
ORGANISM Methanosarcina barkeri
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanomicrobiales; Methanosarcinaceae.
REFERENCE 1 (bases 1 to 130)
AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
methanogen species, secondary structure models, and molecular
evolution
JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 130 5S rRNA
BASE COUNT 27 a 40 c 33 g 30 t
ORIGIN
1 ggagtttggc gccatagcgg cagtgtaact cctgtaccca tcccgaacac agtagataag
61 cctgcccgcg ttccttactg tactgaagtg tgcgagcctt cgggaactct ggatcgctgc
121 catactcacc
//
LOCUS RICCPQBP 1442 bp ds-DNA ORG 13-AUG-1990
DEFINITION O.sativa L. chloroplast Q-B protein (psbA) gene, complete cds.
ACCESSION M36191
KEYWORDS Q-B protein; thylakoid membrane protein.
SOURCE O.sativa L. (var. Labelle) chloroplast DNA.
ORGANISM Chloroplast Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE 1 (bases 1 to 1442)
AUTHORS Wu,N.-H., Cote,J.-C. and Wu,R.
TITLE Structure of the chloroplast psbA gene encoding the Q-B protein
from Oryza sativa L
JOURNAL Dev. Genet. 8, 339-350 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 257 1318 Q-B protein (psbA)
mRNA 177 > 1442 Q-B protein mRNA
BASE COUNT 350 a 275 c 302 g 515 t
ORIGIN
1 ggttcgagtc ccgggcaacc catatggaaa ctagaaagga aaaggagcaa tctgagtttt
61 gatttttcac tcacttcatt tacaaaattt tttggtttgg taaattttgt gtatggatat
121 acaactgtcg gggctggctt ggttgacatt ggtatatagt ctatattata ctgttaaata
181 acaagccttc tattatcttt ctagttaata cgtgtgcttg ggagtccttg caatttgaat
241 aaaccaagat cttaccatga ctgcaatttt agagagacgc gaaagtacaa gcctgtgggg
301 tcgcttctgc aactggataa ctagcaccga aaaccgtctt tacatcggat ggttcggtgt
361 tttgatgatc cctaccttat tgaccgcaac ttctgtattt attatcgcct tcatcgctgc
421 ccctccagta gatattgatg gtattcgtga gcctgtttct ggttctttac tttacggaaa
481 caatattatc tctggtgcca ttattcctac ttctgcggcg atcggattgc acttttaccc
541 aatttgggaa gctgcatctg ttgatgagtg gttatacaat ggtggtcctt atgagctaat
601 tgttctacac ttcttacttg gtgtagcttg ttatatgggt cgtgagtggg aactgagttt
661 ccgtctgggt atgcgtcctt ggattgctgt tgcatattca gctcctgttg cagctgctac
721 tgctgttttc ttgatttacc ctattggtca aggaagtttc tctgatggta tgcctttagg
781 aatatctggt actttcaact ttatgattgt attccaggca gagcataaca tccttatgca
841 tccatttcac atgttaggtg tagctggtgt attcggcggt tccctattca gtgctatgca
901 tggttccttg gtaacctcta gtttgatcag ggaaaccact gaaaacgaat ctgctaatga
961 gggttacaga tttggtcaag aggaagaaac ttataatatt gtggccgctc atggttattt
1021 tggccgatta atcttccaat atgctagttt taacaactct cgttctttac acttcttctt
1081 ggctgcttgg cctgtagtag gaatttggtt taccgcttta ggtattagta ctatggcttt
1141 caatctaaac ggattcaatt tcaaccaatc tgtagttgat agccaaggtc gcgttattaa
1201 tacttgggct gatatcatca accgtgctaa tcttggtatg gaagtaatgc acgaacgtaa
1261 tgctcacaac ttccctctag acctagctgc tcttgaagtt ccatctctta atggataagg
1321 cttttctgct aacatatagc aatttttgaa gaaaggaaag ctagaaatac ccaatatctt
1381 gctgaagcaa gatattgggt atttcttttt ttttttattt tgaatctttc tattctgaat
1441 tc
//
LOCUS RATC7AH7A 3561 bp ss-mRNA ROD 13-AUG-1990
DEFINITION Rat cytochrome P450 cholesterol 7-alpha-hydroxylase (P450 VII)
mRNA, complete cds.
ACCESSION J05509
KEYWORDS cholesterol 7-alpha-hydroxylase; cytochrome P450.
SOURCE Rat (strain Sprague-Dawley) cholestyramine-treated female liver,
cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3561)
AUTHORS Li,Y.C., Wang,D.P. and Chiang,J.Y.L.
TITLE Regulation of cholesterol 7-alpha-hydroxylase in the liver:
Cloning, sequencing, and regulation of cholesterol
7-alpha-hydroxylase mRNA
JOURNAL J. Biol. Chem. 265, 12012-12019 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Chiang,J.Y., 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 47 1558 cholesterol 7-alpha-hydroxylase (EC
1.14.13.17)
mRNA < 1 1983 P450 VII mRNA
BASE COUNT 1048 a 719 c 695 g 1099 t
ORIGIN
1 gctaggcaaa gagtctcccc tttggaaatt ttcctgcttt tgcaaaatga tgactatttc
61 tttgatttgg ggaattgccg tgttggtgag ctgttgcata tggtttattg ttggaataag
121 gagaaggaaa gctggtgaac ctcctttgga gaacgggttg attccgtacc tgggctgtgc
181 tctgaaattt ggatctaatc ctcttgagtt cctaagagct aatcaaagga agcatggtca
241 cgtttttacc tgcaaactga tggggaaata tgtccatttc atcacaaact ccctgtcata
301 ccacaaagtc ttatgtcatg gaaaatattt tgactggaaa aaatttcatt acactacttc
361 tgcgaaggca tttggacaca gaagcattga cccaaatgat ggaaatacca cggaaaatat
421 aaacaacact tttaccaaaa ccctccaggg agatgctctg tgttcacttt ctgaagccat
481 gatgcaaaac ctccaatctg tcatgagacc tcctggcctt cctaaatcaa agagcaatgc
541 ctgggtcacg gaagggatgt atgccttctg ttaccgagtg atgtttgaag ccggctatct
601 aacactgttt ggcagagata tttcaaagac agacacacaa aaagcactta ttctaaacaa
661 ccttgacaac ttcaaacaat ttgaccaagt ctttccggca ctggtggcag gccttcctat
721 tcacttgttc aagaccgcac ataaagctcg ggaaaagctg gctgagggat tgaagcacaa
781 gaacctgtgt gtgagggacc aggtctctga actgatccgt ctacgtatgt ttctcaatga
841 cacgctctcc acctttgacg acatggagaa ggccaagacg cacctcgcta tcctctgggc
901 atctcaagca aacaccattc ctgcaacctt ttggagctta tttcaaatga tcaggagtcc
961 tgaagcaatg aaagcagcct ctgaagaagt gagtggagct ttacagagtg ctggccaaga
1021 gctcagctct ggagggagtg ccatttactt ggatcaagtg caactgaatg acctgccggt
1081 actagacagc atcatcaagg aggctctgag gctttccagt gcatccttga atatccgcac
1141 agctaaggag gacttcactc tccatcttga ggacggttcc tataacatcc gaaaagatga
1201 catgatagct ctttatccac agttaatgca cttggatcct gaaatctacc cagacccttt
1261 gactttcaaa tatgaccggt accttgatga aagcgggaaa gcaaagacca ccttctacag
1321 taatggaaac aagctgaagt gtttctacat gcccttcgga tcaggcgcga caatatgtcc
1381 tggaagactc tttgccgtcc aagaaatcaa gcagtttttg atcctgatgc tctcctgctt
1441 tgaactggag tttgtggaga gccaagtcaa gtgtccccct ctagaccagt cccgggcagg
1501 cttgggaatt ttgccaccac tacatgatat tgagtttaaa tataaactga aacactgata
1561 cgtggttgga agaagcgaac actggatgat gtcacttggc ggctgagagt catcacttaa
1621 acaggccttc gggaccaatg ctcactgatg cgccctagcg actggattag tgggaagaac
1681 tttgttctcg ctgcccacat tcctgggtgt tcacatagct ggggccagag cttcatcact
1741 ttcagaaagc aatgtctttt gtatttattt tcaaaatgaa gatattccaa ttggcaggat
1801 atttttccta aggaaattgc tttatatttt tatgaaaact accaattaat tatgaaaggc
1861 ttgaaattca cgttttagtg aaattactga tttttcacta gtaaggttct tcaggtgtga
1921 aactgtatta taaaaatgtt gtaatgggtc acactgtgct ttgcataaag gtaaaggaaa
1981 ctatgtttca gccttttctg tgtctatgag attcgaaaat aatcttactg ttctagaaac
2041 actggggaga atccgacatc gtctagctat attttatttt actgttgcga gaaattttaa
2101 ttccagtttt caactacctt atctttcccc cattttgaca tgcatgccaa tgagaagagt
2161 attttttagg aattaacaag gcacctccca gaaccctacc ctgagacttt taagccttta
2221 atcccagcac tcgagaagta agagccaggc aagatctctg agtctgaggt tattctggtc
2281 tacatcagct ccagataagc caggactaca gaatgggatc ttgtctaaaa aatacagcta
2341 atctttatgt cataactgat tatgaatcaa cctaaaagat aaattttcaa tcaggactca
2401 gagaaaatga gcaattaaaa aacttagctc tgaggtatgt ggaattcatt aagtacaagt
2461 tgacattaca tgttctttaa aaatagttta tgttttatct ctaaatgccc tgcagatgaa
2521 gaataataat gaaaagttga ataatactgt ttaaacacta agtgcaataa tgctttggta
2581 atgtacttta agagaatcat tagccgtgcc agttttacta aaatatattt atatgtaaat
2641 tatatttatc tttttcttat accataaata taaaaatatt gcaacattta ggtaattttt
2701 aaaattacat acctttcaga aaattcgaat ggtatgaatg tttgtatgtt ttttaacttt
2761 gaacagaaca tttaaattat tcatctacgg tgatttttat cttatttatt tctttttgtc
2821 tcattcatat cttgaagaaa tccaaaaata tctgaaggaa tcgctcactc aaatctctcc
2881 ctatcgttac agaaaaattc aataccatct ctttttcctc ggggactgaa gcagggtctc
2941 gtgggtcccg agcagaggct cctgctgcag cgagctttat ccacgggact ccttaaacct
3001 tttaaaatct tatcactatt atcatgcagt tattacctaa gtaggatatt tccctttcct
3061 ttttcatttc agcagagtcc cttagcaacc caggctgact gggaccctcc atgtagctta
3121 agctgtgaac tcactgtact tctgttttca cttattttag gaagtaattt tccctatcag
3181 aaattttaat tgtttagatg atgtataaga gtaacacaat tctgttatat actaatctgt
3241 agtaactaat tgttcttaga cagttgatga ctctcaattg atgatcatac atctttccat
3301 ggcttcttga atgcccattt ctcatacaca gaataatggg tttcacggtg atatcttcct
3361 ttcatgtctt tattcttgtg cggtgatggt tggcaaatga tacccatgga gcaaggttac
3421 tcttcctatt tctgtgcagc ctaagtgtta agaaataaat tttttaaata cttagaggga
3481 aggcacattt tgtgtcatat gtgagtgaca tgtgacacac agactagcaa tccatgagta
3541 aaattttatt gggattaccg g
//
LOCUS MUSTEAA 2397 bp ss-mRNA ROD 13-AUG-1990
DEFINITION Mouse tea (20.5) mRNA, complete cds.
ACCESSION M32484
KEYWORDS cell surface protein.
SOURCE Mouse (strain AKR/J) T-lymphoma cell line SL12.4.10, cDNA to mRNA,
clone 20.5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2397)
AUTHORS MacLeod,C.L., Finley,K., Kakuda,D., Kozak,C.A. and Wilkinson,M.F.
TITLE Activated T cells express a novel gene on chromosome 8 that is
closely related to the murine exotropic retroviral receptor
JOURNAL Mol. Cell. Biol. 10, 3663-3674 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.L.MacLeod, 02-MAR-1990, for release after publication.
FEATURES from to/span description
pept 410 1771 Tea gene product (20.5)
BASE COUNT 548 a 571 c 587 g 691 t
ORIGIN Chromosome 8.
1 gggtgtcttt cctcatcgct gccctggcct cggttatggc cggcctttgc tatgctgaat
61 ttggggcccg agtacccaag actggatctg cgtatctata cacttacgtc acggtcggag
121 agctgtgggc cttcatcact ggctggaatc tcatcctgtc atatgtcata ggtacgtcca
181 gtgtcgcaag agcatggagt ggcacctttg acgaacttct taataaacag attggccagt
241 ttttcaaaac gtacttcaaa atgaattaca ctggtctggc agagtatcca gacttctttg
301 ccgtgtgcct tgtattactc ctggcaggtc ttttatcttt tggagtaaaa gagtctgctt
361 gggtgaataa atttttacag ctattaatat cctggtcctt ctctttgtca tggtggctgg
421 gtttgtgaaa ggaaatgtgg ctaactggaa gatcagtgaa gagtttctca aaaatatatc
481 agcaagtgct agagaaccac cttctgagaa cggaacaagc atctacgggg ctggcggctt
541 tatgccctat ggctttacag ggacgttggc tggtgctgca acgtgctttt atgcctttgt
601 gggctttgac tgcattgcaa caaccggtga agaggttcgg aatccacaaa aggcgatccc
661 catcggaata gtgacgtcct tacttgtctg ctttatggct tactttgggg tttctgcagc
721 tttaacgctt atgatgcctt actacctcct ggatgagaaa agtccactcc cagtcgcgtt
781 tgagtatgtc agatggggcc ccgccaaata cgttgtcgca gcaggctccc tctgcgcctt
841 atcaacaagt cttcttggat ccattttccc aatgcctcgt gtaatctatg ctatggcgga
901 ggatgggttg cttttcaaat gtctagctca aatcaattcc aaaacgaaga caccagtaat
961 tgctactttg tcatcgggtg cagtggcagc tgtgatggcc tttctttttg acctgaaggc
1021 cctcgtggac atgatgtcta ttggcaccct catggcctac tctctggtgg cagcctgtgt
1081 gcttattctc aggtaccaac ctggcttgtg ttacgagcag cccaaataca cccctgagaa
1141 agaaactctg gaatcatgta ccaatgcgac tttgaagagc gagtcccagg tcaccatgct
1201 gcaaggacag ggtttcagcc tacgaaccct cttcagcccc tctgccctgc ccacacgaca
1261 gtcggcttcc cttgtgagct ttctggtggg attcctggct ttcctcatcc tgggcttgag
1321 tattctaacc acgtatggcg tccaggccat tgccagactg gaagcctgga gcctggctct
1381 tctcgccctg ttccttgtcc tctgcgctgc cgtcattctg accatttgga ggcagccaca
1441 gaatcagcaa aaagtagcct tcatggtccc gttcttaccg tttctgccgg ccttcagcat
1501 cctggtcaac atttacttga tggtccagtt aagtgcggac acttggatca gattcagcat
1561 ctggatggcg cttggctttc tgatctattt cgcctatggc attagacaca gcttggaggg
1621 taaccccagg gacgaagaag acgatgagga tgccttttca gaaaacatca atgtagcaac
1681 agaagaaaag tccgtcatgc aagcaaatga ccatcaccaa agaaacctca gcttaccttt
1741 catacttcat gaaaagacaa gtgaatgttg atgctggccc tcggtcttac cacgcatacc
1801 ttaacaatga gtacactgtg gccggatgcc accatcgtgc tgggctgtcg tgggtctgct
1861 gtggacatgg cttgcctaac ttgtacttcc tcctccagac agcttctctt cagatggtgg
1921 attctgtgtc tgaggagact gcctgagagc actcctcagc tatatgtatc cccaaaacag
1981 tatgtccgtg tgcgtacatg tatgtctgcg atgtgagtgt tcaatgttgt ccgttattag
2041 tctgtgacat aattccagca tggtaattgg tggcatatac tgcacacact agtaaacagt
2101 atattgctga atagagatgt attctgtata tgtcctaggt ggctggggaa atagtggtgg
2161 tttctttatt aggtatatga ccatcagttt ggacatactg aaatgccatc ccctgtcagg
2221 atgtttaaca gtggtcatgg gtggggaagg gataaggaat gggcattgtc tataaattgt
2281 aatgcatata tccttctcct acttgctaag acagctttct taaacggcca gggagagtgt
2341 ttctttcctc tgtatgacaa gatgaagagg tagtctgtgg ctggagatgg ccaatcc
//
LOCUS MUS195A 1183 bp ss-mRNA ROD 13-AUG-1990
DEFINITION Mouse 19.5 mRNA, complete cds.
ACCESSION M32486
KEYWORDS cell surface protein.
SOURCE Mouse (strain AKR/J) T lymphoma cell line AKR/J SL12.4, cDNA to
mRNA, clone 19.5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1183)
AUTHORS MacLeod,C., Fong,A., Seal,B., Walls,L. and Wilkinson,M.
TITLE Isolation of novel cDNA clones from T-lymphoma
JOURNAL Cell Growth and Differ. 1, 271-279 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
C.L. MacLeod, 02-MAR-1990, for release after publication.
FEATURES from to/span description
pept 55 876 19.5 protein
BASE COUNT 297 a 250 c 255 g 381 t
ORIGIN Chromosome 16.
1 cctaactgac aaagtgggga gagtaaggtg tgcgcaaaca ggacaagttg ggtcatgggg
61 agtttcaaag gacatgctct ccctgggagt ttcttcttcg ccatgggctt ttggtggact
121 atgaagaaca tcctgaaatc tgtctacaaa aggcaaactc gaacctgcta ccttaactct
181 aaaacattat tacgtcggac agagatttgg gaaggagttg ttgtgctttt aatgtctctc
241 actggtatag ctggtgaaca gtttatctca ggaggacctg ccttgatctt gcataaagat
301 ggccagtgga accagatcct gggctggcat cacacaacca tgtacttatt ctttgggcta
361 cagggtataa cccaaatcat atgtttcact actaatgtac ttccactttc ctcaagcaag
421 ttaatgttat caattgccat ctttgtggag acatttatgt tctacaacca cacacacggt
481 cgggaaatga ttgacatttt tgtacaccaa cttctggtct tcgttggcac attttcgggt
541 ctggttgcct tcttggagtt cctcgtaaag aacaacgcac ttctggagct cctgcggtgc
601 agtctcctca tgtttcaagg aacctggttc tggcagatgg cgtttgtgct gtacccccca
661 tgtggaagtg ctacatggaa cctgtcagat attcaaaata aaatgtttct ctcaatgtgc
721 ttttgctggc attatgcatc aatccttatc ctcattggag taaaatatgc tttggccaac
781 tggttagtca agtctaggct gaggaagggc tgcacctcag aagttggact cctgaagcat
841 gctgaccgtg agcaagaatc agaagaagaa gtatgatctt gaagtctttc ttgataagcc
901 ttctcccttt gcgttgcctt tgttcatggc tttgtttcct gacctctggt ctcaagaaca
961 cttgtctgag gctgactcca tgctgtttgt acttccagtt ttgttaaagt gttggacttt
1021 aagtatctta ctttcagctc tgaaagaacc atgagtgata aattcacttt ttacactgtg
1081 catgccatgt aattcaagac caatcataat tgttttccaa agtttagttt cgtgtccatt
1141 tattaaaaat attttttttt attttccggg tagatacctt caa
//
LOCUS MUSCYR61A 2018 bp ss-mRNA ROD 13-AUG-1990
DEFINITION Mouse Cyr61 mRNA, complete cds.
ACCESSION M32490
KEYWORDS cyr61 product.
SOURCE Mouse (strain BALB/c) fibroblast cell line 3T3, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2018)
AUTHORS O'Brien,T.P., Yang,G.P., Sanders,L. and Lau,L.F.
TITLE Expression of cyr61, a growth factor-inducible immediate-early gene
JOURNAL Mol. Cell. Biol. 10, 3569-3577 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
L.F.Lau, 02-MAR-1990, for release after publication.
FEATURES from to/span description
pept 190 1329 Cyr61 product
mRNA < 1 2018 Cyr61 mRNA
site 1453 1551 49 bases repeat
BASE COUNT 500 a 536 c 527 g 455 t
ORIGIN
1 agaccgtgag cgagagcgcc ccagagaagc gcctgcaatc tctgcgcctc ctccgccagc
61 acctcgagag aaggacaccc gccgcctcgg ccctcgcctc accgcactcc gggcgcattt
121 gatcccgctg ctcgccggct tgttggttct gtgtcgccgc gctcgccccg gttcctcctg
181 cgcgccacaa tgagctccag caccttcagg acgctcgctg tcgccgtcac ccttctccac
241 ttgaccagac tggcgctctc cacctgcccc gccgcctgcc actgccctct ggaggcaccc
301 aagtgcgccc cgggagtcgg gttggtccgg gacggctgcg gctgctgtaa ggtctgcgct
361 aaacaactca acgaggactg cagcaaaact cagccctgcg accacaccaa ggggttggaa
421 tgcaatttcg gcgccagctc caccgctctg aaagggatct gcagagctca gtcagaaggc
481 agaccctgtg aatataactc cagaatctac caaaacgggg aaagcttcca gcccaactgt
541 aaacaccagt gcacatgtat tgatggcgcc gtgggctgca ttcctctgtg tccccaagaa
601 ctgtctctcc ccaatctggg ctgtcccaac ccccggctgg tgaaagtcag cgggcagtgc
661 tgtgaagagt gggtttgtga tgaagacagc attaaggact ccctggacga ccaggatgac
721 ctcctcggac tcgatgcctc ggaggtggag ttaacgagaa acaatgagtt aatcgcaatt
781 ggaaaaggca gctcactgaa gaggcttcct gtctttggca ccgaaccgcg agttcttttc
841 aaccctctgc acgcccatgg ccagaaatgc atcgttcaga ccacgtcttg gtcccagtgc
901 tccaagagct gcggaactgg catctccaca cgagttacca atgacaaccc agagtgccgc
961 ctggtgaaag agacccggat ctgtgaagtg cgtccttgtg gacaaccagt gtacagcagc
1021 ctaaaaaagg gcaagaaatg cagcaagacc aagaaatccc cagaaccagt cagatttact
1081 tatgcaggat gctccagtgt caagaaatac cggcccaaat actgcggctc ctgcgtagat
1141 ggccggtgct gcacacctct gcagaccaga actgtgaaga tgcggttccg atgcgaagat
1201 ggagagatgt tttccaagaa tgtcatgatg atccagtcct gcaaatgtaa ctacaactgc
1261 ccgcatccca acgaggcatc gttccgactg tacagcctat tcaatgacat ccacaagttc
1321 agggactaag tgcctccagg gttcctagtg tgggctggac agaggagaag cgcaagcatc
1381 atggagacgt gggtgggcgg aggatgaatg gtgccttgct cattcttgag tagcattagg
1441 gtatttcaaa actgccaagg ggctgatgtg gacggacagc agcgcagccg cagttggaga
1501 atgccaaggg gctgatgtgg acggacagca gcgcagccgc agttggagaa gacttcgctt
1561 catagtactg gagcgggcat tattgctcca tattggagca tgtttacgga tgacgttctg
1621 ttttctgttt gtaaattatt tgctaagtgt atttttttgc tccagacccc cccccccttt
1681 cttggttcta caattgtaat agagacaaaa taagattagt tgggccaagt gaaagccctg
1741 cttgtccttt gacagaagta aatgaaagcg cctctcattc cttcccgagc ggagggggac
1801 actctgtgag tgtccttggg gcagctacct gcactctaaa actgcaaaca gaaaccaggt
1861 gttttaagat tgaatgtttt tttatttatc aaagtgtagc ttttggggag ggaggggaaa
1921 tgtaatactg gaataatttg taaatgattt taattttata tcagtgaaga gaatttattt
1981 ataaaattaa tcatttaata aagaaatatt tacctaaa
//GenBank-Updates@genbank.bio.net (08/14/90)
LOCUS RATNCAMC 1195 bp ds-DNA ROD 14-AUG-1990
DEFINITION Rat neural cell adhesion molecule (NCAM-C) gene, exon 1.
ACCESSION M32612
KEYWORDS neural cell adhesion molecule.
SOURCE Rat (strain Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1195)
AUTHORS Chen,A., Reyes,A. and Akeson,R.
TITLE Transcription initiation sites and structural organization of the
extreme 5' region of the rat neural cell adhesion molecule gene
JOURNAL Mol. Cell. Biol. 10, 3314-3324 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Akeson 06-MAR-1990, for release after publication.
FEATURES from to/span description
pept 1142 / 1193 neural cell adhesion molecule, exon 1 (NCAM-C)
pre-msg 947 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 948 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 950 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 951 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 952 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 955 > 1193 NCAM-C mRNA and introns (alt.)
pre-msg 956 > 1193 NCAM-C mRNA and introns (alt.)
IVS 1194 > 1195 NCAM-C intron 1.
site 307 484 homopurine-homopyrimidine region
BASE COUNT 324 a 275 c 333 g 263 t
ORIGIN
1 gaattcttgc tcttgctcct gtgccgtaat acacaccgaa aagccagggc agcagtggag
61 ggagaggaag agaggattcc aatacttgca agcagtttct ggtaatttct gaatactact
121 tctaaatgtc tctgcatttt cttctaactg cccccccgca cccccccccc acttctgaga
181 tctgagttgt gctgctgtga gttgctccct ctgtgggaag gtaaacttca taggtaagag
241 aaaacaaaca acaaagcaaa cctccacaat cctcatttct ctccaaattt ccaatttcct
301 tcacttagga ggaagaggag gaagaggagg aagagggtga ggaggaggaa gaggaagaag
361 aggaagaaga ggaagaagag gaagaagagg aagaagagga agaagaggat gagggaaagg
421 aggagaagag gaggaggagg aggaggaaga aaaggagaag aaggagaagg agaaaaggag
481 gaagccattt gcttctattg caaggacttt agtaactaca tggggtgggt gtgggggctc
541 tacaactttg aaaagcgagc tgaatctaaa aattctttct ttttttttcc taattattta
601 aacgttcaaa ttcccgatta aggaacgctg gagggcagaa ggatctccaa aggcttgagt
661 gaaggaaaag ggtgaaaaga agaaatccca gcttctgcct ggaaggttct gtgtgaaaga
721 gcccggctcc cttggtgact ccaggccgtg ttttgcaggc ggccgcatct gcctcccctg
781 tctcttacct cctggatgtt aggaactatt tgtggtcggc gtggtggaag gacacagtga
841 ggttctcacc tccgcccccc gcccgtcgct cgcatcccca gttccatcaa agccaacccg
901 ggccagcgca gggatctccg agttgcgagt gtgctgaggc tggggctgtc actcattctc
961 cgctcagcgg tgaacgcagc tcggcagtgg ctggcaagaa acaattctgc aaaaataatc
1021 atacccagcc tggcaattgt ctgctcctcg gtccattgct ccgccgccgt ccacagtcgc
1081 ttgcaaggga aggcactgaa tttaccgcgg ccagaacatc cctcccagcc ggcagtttac
1141 aatgctgcga actaaggatc tcatctggac tttgtttttc ctgggaactg caggt
//
LOCUS YSCTEC1A 2308 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae TEC1 protein gene, complete cds.
ACCESSION M32797
KEYWORDS TEC1 protein.
SOURCE S.cerevisiae (strain GRF18) haploid cell DNA, clone pBC43-2.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2308)
AUTHORS Laloux,I., Dubois,E., Dewerchin,M. and Jacobs,E.
TITLE TEC1, a gene involved in the activation of Ty1 and Ty1-mediated
gene expression in Saccharomyces cerevisiae: Cloning and molecular
analysis
JOURNAL Mol. Cell. Biol. 10, 3541-3550 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by I.M.Laloux 13-MAR-1990.
FEATURES from to/span description
pept 395 1855 TEC1 protein (TEC1)
signal 289 294 TATA box
BASE COUNT 781 a 513 c 410 g 604 t
ORIGIN Chromosome 2, right arm.
1 tgcgtgtttc agattgcgtt tgcttattat tctgggaaac tctccggaaa caagctgagc
61 tggactccac gaatggtccc tcacaaggct cttggctcat gcacagttaa aaagcacctg
121 tagctaccat cagcaatcca tggtgctgga gtttctctcg atggtacggc ccatagctgg
181 agaaaatagc ttattaccaa aaaagtaatt cccccgcaga ggtatatatt aagtttgaaa
241 caaaaggctt gcttaatttt ttacatatga caaccctttt cttactttta taaatgttta
301 tgaaacagga gattcttcga ctggtgccag attcctccca tatcgacaag aagaataatc
361 cacctatttc aacaattctg atacctgttt aaccatgagt cttaaagaag acgactttgg
421 caaggataat tctagaaata tagaatcata tactggtaga atttttgacg tatatataca
481 aaaagattcg tattcacagt cggccttgga tgatatgttt ccagaagccg tagtttcaac
541 cgccgcttgt gtgaaaaatg aagcggagga taacatcaat ctcatagaca cgcatcctca
601 attcgaactg gtaaatactg gactgggtgc taaatcggac gatttgaaat ctccatcagc
661 aaaggctacg ttcactgaca agcagaggaa gaatgaagta ccaaatatat ctgtgagcaa
721 ctactttccc ggacaaagta gcgaaacgtc gtcaacaacg gaatcttgga ctatcggttg
781 tgataagtgg tcagaaaagg tagaagaggc attccttgag gcacttagac tgataatgaa
841 aaatgggacc acaaaaataa aaataagaaa tgccaatttt ggaagaaacg agctgatttc
901 attatatatc aagcacaaaa ccaacgagtt cagaaccaaa aagcaaattt cttcccatat
961 tcaagtctgg aagaagacca tacaaaacaa aatcaaggac tcgctgaccc tatcatcaaa
1021 ggagaaggag cttctacacc ttatcgaaca tggcgctgaa caaactactg aaaactcaaa
1081 cctgttttat gacatatttg aagaaattat cgactctcta ccttcagtca gtgattctgg
1141 aagtttaacc cctaaaaacc tctatgtaag taataatagc agtggattgt cagtacattc
1201 aaaactgctt acgccaatca ctgcttccaa cgagaaaaaa attgaaaatt tcataaaaac
1261 taatgctgca tctcaagcca aaacccccct catttacgct aagcacattt atgaaaacat
1321 agacggctac aagtgcgttc cgtcaaagag gcctcttgaa caactttccc ccacggaact
1381 ccaccaggga gatcgcccca ataaggctag cttttccaac aagaaggcaa tcctggagag
1441 tgcaaaaaaa atcgaaatag agcagagaaa gataatcaac aaataccaaa gaatttcccg
1501 catacaagaa catgaaagta atcctgagtt cagttccaat tccaattccg gttcagagta
1561 cgaatcggag gaagaagtag tcccaagatc agccacagtc acacaactcc aaagcagacc
1621 agtgccatac tacaagaata atggaatgcc ctactcactc tccaaagtac gaggaaggcc
1681 catgtatcca agacctgctg aagatgctta caatgccaat tatattcaag gtctgcccca
1741 gtaccaaaca tcttattttt cgcagctgtt attatcatca ccccagcatt acgaacattc
1801 tccacatcaa aggaacttta cgccatccaa ccaatcgcat gggaactttt attaaatgtc
1861 tacatacata catacatctc gtacataaat acgcatacgt atcttcgtag taagaaccgt
1921 cacagatatg attgagcacg gtacaattat gtattagtca aacattacca gtctcgaaca
1981 aaagctactc ctgcaacact cttctatcgc acatgtatgg ttcttattgt ttcccgagtt
2041 cttttttact gacgcgccag aacgagtaag aaagttctct agcgccatgc tgaaattttt
2101 ttcacttcaa cggacagcga ttttttttct ttttcctccg aaataatgtt gcagcggttc
2161 tcgatgcctc aagaattgca gaagtaaacc agccaataca catcaaaaaa caactttcat
2221 tactgtgatt ctctcagtct gttcatttgt cagatattta aggctaaaag gaaatgttgt
2281 cgagactatc tttattgagt aactcgag
//
LOCUS YSCACO1A 3130 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae aconitase (ACO1) gene, complete cds.
ACCESSION M33131
KEYWORDS aconitase; isocitrate hydro-lyase.
SOURCE S.cerevisiae (strain S288C) DNA, clone SG7-1.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3130)
AUTHORS Gangloff,S.P., Marguet,D. and Lauquin,G.J.-M.
TITLE Molecular cloning of the yeast mitochondrial aconitase gene (ACO1)
and evidence of a synergistic regulation of expression by glucose
plus glutamate
JOURNAL Mol. Cell. Biol. 10, 3551-3561 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.J.-M.Lauquin 21-MAR-1990, for release after publication.
FEATURES from to/span description
pept 648 2987 aconitase (ACO1) (EC 4.2.1.3)
binding 357 364 HAP2-HAP3 binding site
signal 503 513 TATA box
BASE COUNT 907 a 604 c 668 g 951 t
ORIGIN
1 gaattccaaa ggctcacaaa tggcattccg tactgatata cttcgcactt tacatatgct
61 tatataaaaa tgccagtttc ccatcgtcaa ttacgcagag gtagccttcg tttattttct
121 ctctttttgt atatcattgt attaatcata atccattgta ttttactttg tcttatctgg
181 ctttaatgaa tgacgccggt cacacgcggg tgcctgattc tcgattgtgc caagccattt
241 gggcacggtg tcaaattacc taaaaaatgg ccgagagccg caaaagggag gtccgcgggg
301 ccgggcaata ccctttgttt ttcgagcatt tcggcgccga aatcggaaag gtcctgacca
361 atcaatagag aaattagtgc atacgagaaa aatttgaacc cttcagttgt tctccgcagg
421 cgactttaac catcaaacct ccaatcgctg ccggttttcg gaaaggcaag cacaaaaagg
481 gaggacaagg aaaatctttt gttatatata taatcttccg gttttagagg ttaattagga
541 ggagtatgtt gcttaattcc gttgtccctt gttctgttca ctctttctta gttattacat
601 agtagaacga aggtaataaa tactatcatt attactatag atatacaatg ctgtctgcac
661 gttctgccat caagagaccc attgttcgtg gtcttgcgac agtctccaac ttgactagag
721 attcaaaagt caaccaaaac ttattagaag atcattcttt tattaactac aagcagaatg
781 tggaaacgct ggatatcgta agaaaaagat taaacaggcc atttacctac gcggaaaaga
841 ttttgtacgg tcacttggat gaccctcatg gtcaagatat tcagagaggt gtttcatacc
901 taaaattaag accagatcgt gttgcctgtc aagatgctac tgctcaaatg gctattttac
961 aatttatgtc cgctggttta ccacaggttg ctaagccagt cactgtccac tgtgaccatt
1021 tgattcaagc acaagttggt ggtgaaaaag atttgaagag agctatagat ctaaacaagg
1081 aagtttatga tttcttggcc tctgccactg cgaaatataa catgggtttc tggaagccag
1141 gttccggtat cattcaccaa attgttctgg aaaactacgc tttcccaggt gctttgatca
1201 ttggtactga ctcccataca ccaaatgctg gtggtttagg tcaattggct attggtgttg
1261 gtggtgctga tgccgttgat gttatggcag gtcgtccatg ggaattgaag gctccaaaga
1321 tcttaggtgt taagttgact ggtaagatga acggttggac ttctccaaag gatattattt
1381 tgaaattggc tggtatcaca actgtcaaag gtggtactgg taaaattgtt gaatattttg
1441 gtgatggtgt tgacactttc tccgctactg gtatgggtac catttgtaat atgggtgctg
1501 aaatcggtgc taccacatct gttttcccat tcaacaaatc tatgattgaa tatttggaag
1561 caactggtcg tggtaagatc gctgactttg ctaaattata ccacaaggat ctattatctg
1621 ctgataagga tgctgaatac gatgaggtcg tcgaaattga cttgaacact ctggaaccat
1681 acatcaatgg gccatttacc cccgatttgg ctactccagt ttctaagatg aaggaagttg
1741 ctgttgctaa taactggcca ttggatgtca gagtcggttt gatcggttct tgtaccaatt
1801 cctcttatga agatatgtct cgttcagcat ccattgtcaa ggacgctgct gctcatggtt
1861 tgaaatccaa gaccattttc actgttactc caggttctga acaaatcaga gccactattg
1921 aacgtgatgg ccaattagaa accttcaaag aatttggtgg tatcgttttg gcaaacgcct
1981 gtggcccatg tattggtcaa tgggatcgta gagatatcaa gaaaggtgac aagaatacta
2041 ttgtttcctc ttacaacaga aatttcactt ctagaaatga tggtaaccca caaactcatg
2101 cttttgttgc atctccagaa ttagtaactg cgttcgccat tgcgggtgat ttgagattca
2161 accctctaac agacaaatta aaggacaagg atggtaatga gttcatgttg aaaccaccac
2221 atggtcgatg gtttgcctcg aaagaggtta tgatgctggt gagaacactt accaagctcc
2281 acctgcagac cgtagccacc gttgaagtta aagtttctcc aacttcagac cgtctacaac
2341 tgttgaaacc attcaaacct tgggatggta aggatgctaa agacatgcca atcttgatta
2401 aggccgtcgg taagacaact actgatcata tttctatggc tggtccatgg ttgaaataca
2461 gaggtcattt agaaaacatt tctaataact atatgattgg tgctattaat gctgaaaaca
2521 agaaggctaa ctgtgttaaa aatgtatata ctggtgaata caaaggtgtt ccagacactg
2581 ctagagatta cagagaccaa ggtatcaagt gggttgttat tggtgatgaa aactttggtg
2641 aaggttcctc tcgtgaacac gctgctttgg aaccaagatt cttgggcggt ttcgctatca
2701 tcacaaagtc tttcgctcgt atccatgaaa ctaacttgaa aaaacaaggt ctattgccat
2761 tgaacttcaa gaacccagct gactatgaca agatcaaccc tgatgacaga atcgatattc
2821 tgggtctagc tgaattggct ccaggtaagc ctgtaacaat gagagttcat ccaaagaatg
2881 gtaagccatg ggatgctgtg ttgacccata ctttcaacga tgagcaaatt gaatggttca
2941 aatatggttc tgccttaaat aaaattaagg ccgatgagaa gaaataatga aaacattgtt
3001 ataatctttt aaaggttatt atttattttg tcttctgtac acgtaccctt gtttatcttt
3061 tctgccttaa atttaatgac gttcggctgg agaagtcaag actatgaaat atatctcgta
3121 atttatgatc
//
LOCUS HIVTRA1 59 bp ss-RNA VRL 14-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region.
ACCESSION M27956
KEYWORDS .
SEGMENT 1 of 2
SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson.
Isolated in Paris. Sibling sequences are available (see 1989:
I-A-169).
FEATURES from to/span description
pept < 1 > 59 gag protein (AA at 2)
BASE COUNT 26 a 8 c 10 g 15 t
ORIGIN
1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca
//
LOCUS HIVTRA2 268 bp ss-RNA VRL 14-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION M27948
KEYWORDS .
SEGMENT 2 of 2
SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 268)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT Sibling sequences are available (see 1989: I-A-170).
FEATURES from to/span description
pept < 1 > 268 env protein (AA at 1)
BASE COUNT 95 a 46 c 54 g 73 t
ORIGIN
1 caattaaccc cactctgtgt tactttaaat tgcactgatt atttggggaa tgctactaat
61 accactagta gtagcggggg agggatggag agaggagaaa taaaaaactg ctctttcaat
121 atcacgacaa gcataagaga taagatgcag agagaatatg cactttttta taaacttgat
181 gtagtaccaa cagataatga taatactagc tataggttga taagttgtaa cacctcagtc
241 attacacagg cctgtccaaa agtatcct
//
LOCUS FLAH1424 1749 bp ss-RNA VRL 14-AUG-1990
DEFINITION Influenza A/Mallard/Gurjev/263/82 hemagglutinin subtype H14 gene.
ACCESSION M35997
KEYWORDS hemagglutinin.
SOURCE Influenza A/H14/263/82 RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (sites)
AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
TITLE Molecular characterization of a new hemagglutinin, subtype H14, of
influenza A virus
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1749)
AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by Y.Kawaoka, 27-JUN-1990.
Author address: Y.Kawaoka
St. Jude Childrens Research Hospital
332 N. Lauderdale
P.O. Box 318, Memphis, TN
38101
BASE COUNT 575 a 360 c 412 g 402 t
ORIGIN
1 agcaaaagca ggggaaaatg attgcactca tattggttgc actggctctg agccacactg
61 cttattctca gatcacaaat gggacaacag gaaaccccat tatatgcttg gggcatcatg
121 cagtggaaaa cggcacatct gttaaaacac taacagacaa tcacgtagaa gttgtgtcag
181 ctaaagaatt agttgagacg aaccacactg atgaactgtg cccaagcccc ttgaagcttg
241 tcgacgggca agactgccac ctcatcaatg gtgcattggg gagtccaggc tgtgaccgtt
301 tgcaggacac cacttgggat gtcttcattg aaaggcccac tgcagtagac acatgttatc
361 cattcgacgt cccagattac cagagtctca gaagcatcct agcaagcagt gggagtttgg
421 agttcatcgc cgaacaattc acctggaatg gtgtcaaagt tgacggatca agcagtgctt
481 gtttgagggg cggtcgcaac agcttcttct cccgactaaa ctggctaacc aaagcaacaa
541 atggaaacta tggacctatt aacgtcacta aagaaaatac gggctcttat gtcaggctct
601 atctctgggg agtgcatcac ccatcaagcg ataatgagca aacggatctc tacaaggtgg
661 caacagggag agtaacagta tctacccgct cggaccaaat cagtattgtt cccaatatag
721 gaagtagacc gagggtaagg aatcagagcg gcaggataag catctactgg accctagtaa
781 acccagggga ctccatcatt ttcaacagta ttgggaattt gattgcacca agaggccact
841 acaaaataag caaatctact aagagcacag tgcttaaaag tgacaaaagg attgggtcat
901 gcacaagccc ttgcttaact gataaaggtt cgatccaaag tgacaaacct tttcagaatg
961 tatcaaggat tgctatagga aactgcccga aatatgtaaa gcaagggtcc ctgatgttag
1021 caactggaat gcgcaacatc cctggcaaac aggcaaaggg cttatttggg gcaattgctg
1081 gattcattga aaatggttgg caaggcctga ttgatgggtg gtatggattc aggcaccaaa
1141 atgctgaagg aacaggaact gctgcagacc tgaagtcaac tcaggcagcc attgatcaga
1201 taaatggcaa gctgaacaga ttgatagaga agacaaatga aaaatatcac caaatagaaa
1261 aggaattcga acaggtggaa ggaagaatac aagaccttga gaagtacgtt gaggacacta
1321 agattgattt gtggtcatac aatgctgaat tgctagtagc actagagaat cagcacacaa
1381 tagatgtcac agactccgaa atgaacaagc tttttgaaag agtaagaagg caattaagag
1441 agaatgcaga agatcaaggc aacggttgtt tcgagatatt ccatcagtgt gacaacaatt
1501 gtatagaaag cattagaaac ggaacttatg accacaacat ctacagggat gaagccatca
1561 acaatcgaat caaaataaat cctgtcactt tgacgatggg gtacaaggac ataatcctgt
1621 ggatttcttt ctccatgtca tgctttgtct tcgtggcact gattctggga tttgttctat
1681 gggcttgtca aaacgggaat atccgatgcc aaatctgtat ataaagaaaa aacacccttg
1741 tttctactc
//
LOCUS FLAH14244 1716 bp ss-RNA VRL 14-AUG-1990
DEFINITION Influenza A/Mallard/Gurjev/244/82 hemagglutinin subtype H14 gene.
ACCESSION M35996
KEYWORDS hemagglutinin.
SOURCE Influenza A/H14/244/82 RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (sites)
AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
TITLE Molecular characterization of a new hemagglutinin, subtype H14, of
influenza A virus
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1716)
AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by Y.Kawaoka, 27-JUN-1990.
Author address: Y.Kawaoka
St. Jude Childrens Research Hospital
332 N. Lauderdale
P.O. Box 318, Memphis, TN
38101
BASE COUNT 562 a 346 c 405 g 403 t
ORIGIN
1 ttggttgcac tggctctgag ccaaactgct tattctcaga tcacaaatgg gacaacagga
61 aaccccatta tatgcttggg gcaccatgca gtggaaaacg gcacatctgt taaaacacta
121 acagacaatc acgtggaagt tgtgtcagct aaagaattag ttgagacgaa acacactgat
181 gaactgtgcc caagcccctt gaagcttgtc gacgggcaag actgcgacct catcaatggt
241 gcattgggga gtccaggctg tgatcgtttg caggacacca cttgggatgt cttcattgaa
301 aggcccactg cagtagacac atgttatcca ttcgacgtcc cagattacca gagtctcaga
361 agcattctag caagcagtgg gagtctggag ttcatcgccg aacaattcac ctggaatggt
421 gtcaaagttg acggatcaag cagtgcttgt ttgaggggcg gtcgcaacag cttcttctcc
481 cggctaaact ggctaaccaa agcaacaaat ggaaactatg gacctattaa cgtcactaaa
541 gaaaatacgg gctcttatgt caggctctat ctctggggag tgcatcaccc atcaagcgat
601 aatgagcaaa cggatctcta caaggtggca acagggagag tgacagtatc tacccgctcg
661 gatcaaatca gtattgttcc caatatagga agtagaccga gagtaaggaa tcagagcggc
721 aggataagca tctactggac cctagtaaac ccaggggact ccatcatttt caacagtatt
781 gggaatttga ttgcaccaag aggccattac aaaataagta aatctactaa gagcacagtg
841 cttaaaagtg acaaaaggat tgggtcatgc acaagccctt gcttaactga taaaggttcg
901 atccaaagtg acaaaccttt tcagaatgta tcaaggattg ctataggaaa ctgcccgaag
961 tatgtaaagc aagggtcctt gatgttagca actggaatgc gcaacatccc tggcaaacag
1021 gcaaagggct tatttggggc aattgctgga ttcattgaaa atggttggca aggcctgatt
1081 gatgggtggt atggattcag gcaccaaaat gctgaaggaa caggaactgc tgcagacctg
1141 aagtcaactc aggcagccat tgatcagata aatggcaagc tgaacagatt aatagaaaaa
1201 acaaatgaaa aatatcacca aatagaaaag gaattcgaac aggtggaagg aagaatacaa
1261 gaccttgaga agtacgttga ggacactaag attgatttgt ggtcatacaa tgctgaattg
1321 ctagtcgcac tagagaatca gcacacaata gatgtcacag actccgaaat gaacaagctt
1381 tttgaaagag taagaaggca attaagagag aatgcagaag atcaaggcaa cggttgtttc
1441 gagatattcc atcagtgtga caacaattgt atagaaagta ttagaaacgg aacttatgac
1501 cacaacatct acagggatga agccatcaac aatcgaatca aaataaatcc tgtcactttg
1561 acgatggggt acaaggacat aatcctgtgg atttctttct ccatgtcatg ctttgtcttc
1621 gtggcactga ttttgggatt tgttctatgg gcttgtcaaa acgggaatat ccgatgccaa
1681 atctgtatat aaagaaaaaa cacccttgtt tctact
//
LOCUS DOGCPLA01 458 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 1 (non-coding).
ACCESSION M28140 J04961
KEYWORDS pancreatic lipase.
SEGMENT 1 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 458)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pre-msg 386 > 458 pancreatic lipase mRNA and intron
IVS 419 > 458 pancreatic lipase intron A
signal 274 277 CAAT box
signal 351 354 TATA box
BASE COUNT 133 a 94 c 99 g 132 t
ORIGIN
1 ttggagggca tttccaccta ttctaagtga gaaatactac cctccattat tggattttgc
61 tgctgtgttc ttactggaac ttgtaaacta tttgatgata acatagaaaa tgggaaagag
121 gggccgcatt ggctcagaat tacatgcaca ggtacatcgg tacaaaggta gggatctttt
181 acctcctttt aatcaagcat tctttcatct tcccagactg tgggaagatc tgactactat
241 aattagtctt gttttgtgta tttgagcaga ttacaatatt aagtgcacac agcgtgttct
301 atcatcacct gctgtttcct tgaagcggcc tccaaatgat aaggctccag tataaaaaaa
361 aaaaaaaagg cctgggtact tgattgaaag ggagggaagg cccagaacat cccacagggt
421 aagtcacctc ttctgtcctc ccaccagggt tgaagcag
//
LOCUS DOGCPLA02 129 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 2.
ACCESSION M28141 J04961
KEYWORDS pancreatic lipase.
SEGMENT 2 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 129)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept 41 + 89 pancreatic lipase precursor, exon 2 (EC
3.1.1.3) (first expressed exon)
sigp 41 + 89 pancreatic lipase signal peptide
IVS < 1 40 pancreatic lipase intron A
IVS 90 > 129 pancreatic lipase intron B
BASE COUNT 36 a 26 c 37 g 30 t
ORIGIN About 38 bp after segment 1.
1 aataatcggt gagactgaga atgttccgaa tttattgtag atggtgagca tctggacaat
61 cgcgctgttt ctgctgggag cagccaaagg taggaaacgt cacaccggcc cctcgggaag
121 gacttaaat
//
LOCUS DOGCPLA03 235 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 3.
ACCESSION M28142 J04961
KEYWORDS pancreatic lipase.
SEGMENT 3 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 235)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 195 pancreatic lipase precursor, exon 3
sigp + 41 42 pancreatic lipase signal peptide
matp 43 + 195 pancreatic lipase
IVS < 1 40 pancreatic lipase intron B
IVS 196 > 235 pancreatic lipase intron C
BASE COUNT 59 a 66 c 59 g 51 t
ORIGIN About 506 bp after segment 2.
1 ggggagctaa gccgtgagca gagtcatcct tcctctacag caaaggaagt ttgctatgaa
61 caaatcggat gcttttctga cgcggagccc tgggccggga cagcaatcag gcctctgaaa
121 gttctcccct ggagccccga gagaatcggc acccgcttcc tgctctacac caacaagaac
181 ccaaacaact ttcaagtgag acctctgtca ttttagtgcc attgtgactg gagtg
//
LOCUS DOGCPLA04 206 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 4.
ACCESSION M28143 J04961
KEYWORDS pancreatic lipase.
SEGMENT 4 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 166 pancreatic lipase precursor, exon 4
matp + 41 + 166 pancreatic lipase
IVS < 1 40 pancreatic lipase intron C
IVS 167 > 206 pancreatic lipase intron D
BASE COUNT 55 a 60 c 44 g 47 t
ORIGIN About 445 bp after segment 3.
1 ggctacaaaa taatgcgacc ctttcccact tttatcacag actctccttc cctctgatcc
61 atcaacgatt gaggcatcaa attttcaaac agacaagaag acccggttca tcatccatgg
121 cttcatagac aagggagaag agaactggct gctggatatg tgcaaggtag gggccggctc
181 ccaggctcac agtctgccct gctccc
//
LOCUS DOGCPLA05 215 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 5.
ACCESSION M28144 J04961
KEYWORDS pancreatic lipase.
SEGMENT 5 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 215)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 175 pancreatic lipase precursor, exon 5
matp + 41 + 175 pancreatic lipase
IVS < 1 40 pancreatic lipase intron D
IVS 176 > 215 pancreatic lipase intron E
BASE COUNT 44 a 58 c 67 g 46 t
ORIGIN About 2090 bp after segment 4.
1 gtgcccctgt catgacatcc tggtgtctct ccaacatcag aacatgttca aggttgagga
61 ggtgaactgc atctgcgtgg actggaagaa aggttcccaa acctcatata cacaggctgc
121 caacaacgtg cgggtggtgg gcgcccaggt ggcccagatg ctcagcatgc tctcggtgag
181 tctgccggct gggctcctgc tgaggtaggc agtgc
//
LOCUS DOGCPLA06 189 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 6.
ACCESSION M28145 J04961
KEYWORDS pancreatic lipase.
SEGMENT 6 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 189)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 149 pancreatic lipase precursor, exon 6
matp + 41 + 149 pancreatic lipase
IVS < 1 40 pancreatic lipase intron E
IVS 150 > 189 pancreatic lipase intron F
BASE COUNT 49 a 55 c 49 g 36 t
ORIGIN About 957 after segment 5.
1 accacaaaaa tgttctgaaa atatgatctt ccttctccag gcaaactaca gctactcacc
61 ttcccaagtc cagctcatcg gccacagcct gggagcccac gtggctgggg aggcagggag
121 caggactcca ggtctgggca ggattacagg taagacccaa ggggcaggct ccagattttg
181 tccccagta
//
LOCUS DOGCPLA07 380 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 7.
ACCESSION M28146 J04961
KEYWORDS pancreatic lipase.
SEGMENT 7 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 380)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 1 + 120 pancreatic lipase precursor, exon 7
matp + 1 + 120 pancreatic lipase
IVS 121 > 380 pancreatic lipase intron G
BASE COUNT 99 a 89 c 101 g 91 t
ORIGIN About 1340 bp after segment 6.
1 ggttggatcc tgtagaagca agtttccagg gcactcctga agaggttcga cttgatccca
61 ctgatgctga ctttgttgat gtgattcaca cagatgcagc tcccctgatc ccattcctgg
121 gtgagcccaa tgacgcttta gttgcaaaca cgtgcaataa tgtttgaacc aaaaagagcc
181 ctgtgtggtg aaaagctcat tgctgctcca aacatttcag gttttggaac aagccaacag
241 atgggtcacc ttgacttctt ccccaatggt ggagaggaaa tgccaggatg caagaagaat
301 gccctgtcac agatcgtgga cctagatggc atttgggaag gtaaagccca agcagaacgg
361 ggcgtcactt ccctggggtg
//
LOCUS DOGCPLA08 198 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 8.
ACCESSION M28147 J04961
KEYWORDS pancreatic lipase.
SEGMENT 8 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 198)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 159 pancreatic lipase precursor, exon 8
matp + 41 + 159 pancreatic lipase
IVS < 1 40 pancreatic lipase intron G
IVS 160 > 198 pancreatic lipase intron H
BASE COUNT 38 a 61 c 40 g 59 t
ORIGIN About 1366 bp after segment 7.
1 ccccaaccac ccgtgcagac ttccttgttt ctttctccag gaactcggga cttcgtggct
61 tgcaatcacc tgagaagtta caagtattac tcagagagca tcctcaaccc tgatggattt
121 gcttcctacc cctgtgcttc ctacagggcc tttgaatctg taagctatcg tccttccttg
181 agtccctcgt ggacggtt
//
LOCUS DOGCPLA09 211 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 9.
ACCESSION M28148 J04961
KEYWORDS pancreatic lipase.
SEGMENT 9 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 211)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 42 + 171 pancreatic lipase precursor, exon 9
matp + 42 + 171 pancreatic lipase
IVS < 1 41 pancreatic lipase intron H
IVS 172 > 211 pancreatic lipase intron I
BASE COUNT 55 a 56 c 41 g 59 t
ORIGIN About 675 bp after segment 8.
1 gatttacatc ttacagctgg ctctctctct tttctcctta gaacaagtgc ttcccctgcc
61 cagatcaagg gtgcccacag atgggtcact atgctgataa atttgctgtc aagacaagtg
121 atgagacaca gaaatacttc ctgaacaccg gagattccag caattttgct cgtaagtttt
181 accttgaccc tgctcaccca tgtcaaggag a
//
LOCUS DOGCPLA10 189 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 10.
ACCESSION M28149 J04961
KEYWORDS pancreatic lipase.
SEGMENT 10 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 189)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 149 pancreatic lipase precursor, exon 10
matp + 41 + 149 pancreatic lipase
IVS < 1 40 pancreatic lipase intron I
IVS 150 > 189 pancreatic lipase intron J
BASE COUNT 47 a 38 c 43 g 61 t
ORIGIN About 1871 bp after segment 9.
1 gtgcatgagt caattgtact cttttctatc ccctgtgcag gctggagata cggggtttct
61 ataacattgt ctgggaaaag agccactggt caggctaaag ttgctttgtt tggaagtaag
121 ggaaatactc atcaattcaa tatcttcaag taagtttcct ctttcagcgt cacgtctcac
181 tggatgcta
//
LOCUS DOGCPLA11 248 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 11.
ACCESSION M28150 J04961
KEYWORDS pancreatic lipase.
SEGMENT 11 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 248)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 41 + 208 pancreatic lipase precursor, exon 11
matp + 41 + 208 pancreatic lipase
IVS < 1 40 pancreatic lipase intron J
IVS 209 > 248 pancreatic lipase intron K
BASE COUNT 72 a 42 c 59 g 75 t
ORIGIN About 756 after segment 10.
1 gtttctatgc atgagaatgt ttctatctct ttatttcaag ggggattctc aaaccaggct
61 ctactcattc caatgagttt gatgcaaagc ttgatgttgg aacaattgag aaagtcaagt
121 ttctttggaa taacaacgtg gtaaacccaa cctttcccaa agtgggtgca gccaagatca
181 ccgtgcaaaa gggagaggag aaaacagtgt atgtatatct ttgctggatt gtgtctaagg
241 atgttggt
//
LOCUS DOGCPLA12 184 bp ds-DNA MAM 14-AUG-1990
DEFINITION Canine pancreatic lipase gene, exon 12.
ACCESSION M28151 J04961
KEYWORDS pancreatic lipase.
SEGMENT 12 of 12
SOURCE Canine spleen DNA, clones L[13,20,4].
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 184)
AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
Scheele,G.A.
TITLE Structure of the canine pancreatic lipase gene
JOURNAL J. Biol. Chem. 264, 12895-12901 (1989)
STANDARD simple staff_review
FEATURES from to/span description
pept + 37 100 pancreatic lipase precursor, exon 12
matp + 37 97 pancreatic lipase
pre-msg < 1 156 pancreatic lipase mRNA and intron
IVS < 1 36 pancreatic lipase intron K
signal 135 140 poly-A signal
BASE COUNT 44 a 60 c 39 g 41 t
ORIGIN About 3005 bp after segment 11.
1 tcacaacact tacaaagctc tcattttggt ctgcaggcac agcttctgca gcgaaagcac
61 cgtgagggag gatgttctgc tcaccctcac gccctgttaa cgtccaggcg caacccgggc
121 accgtgttaa cagcaataaa aaccactgat gcatttaccc actcccacgt ctgtcgctgc
181 tgtc
//
LOCUS MZECPSECX 395 bp ds-DNA ORG 14-AUG-1990
DEFINITION Maize chloroplast 50S ribosomal protein (secX) gene, complete cds.
ACCESSION M35956
KEYWORDS 50S ribosomal protein.
SOURCE Maize chloroplast DNA.
ORGANISM Chloroplast Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE 1 (bases 1 to 395)
AUTHORS Markmann-Mulisch,U., von Knoblauch,K., Lehmann,A. and
Subramanian,A.R.
TITLE Nucleotide sequence and linkage map position of the secX gene in
maize chloroplast and evidence that it encodes a protein belonging
to the 50S ribosomal subunit
JOURNAL Biochem. Int. 15, 1057-1067 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 165 278 50S ribosomal protein (secX)
binding 151 154 secX ribosomal binding site (put.)
BASE COUNT 153 a 55 c 74 g 113 t
ORIGIN
1 gaagcgtacc gaagactcga aggatactga agatttgaag gataccaaag attcaaagga
61 ttaggttttt ctggggtaat aacttccaag tttcaaaatt taagtgaaga gacttatttt
121 ttccaaaaga atagattcat agtttaagaa aggaataccc atatatgaaa ataagagctt
181 ccgttcgtaa aatttgtaca aaatgtcgac tgattcgcag gcgtgggcga attagagtta
241 tttgttccaa tccgaagcat aaacaaagac aggggtaatc tttcgaaaaa gaagcttttc
301 tttctaatag cttttctttc taattaaagt aaaaagtacc cacggaaatg cccaaattgc
361 aaataaaaaa aatgaaagta aaggatatat tttaa
//
LOCUS ATHTGYA 210 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana tRNA-Tyr gene.
ACCESSION M35957
KEYWORDS transfer RNA-Tyr.
SOURCE A.thaliana leaf DNA, clone pAtT1.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Stange,N., Gross,H.J. and Beier,H.
TITLE Wheat germ splicing endonuclease is highly specific for plant
pre-tRNAs
JOURNAL EMBO J. 7, 3823-3828 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 71 107 Tyr-tRNA, exon 1
120 155 Tyr-tRNA, exon 2
RNA 71 181 precursor Tyr-tRNA (+/- 5 bp)
IVS 108 119 Tyr-tRNA intron (no splice consensus)
BASE COUNT 59 a 37 c 44 g 70 t
ORIGIN
1 ctaacttaat acaaatcatt gtgttcatag aatatttaga tcagtacaca tgcatgaaat
61 agaatacaat ccgaccttag ctcagttggt agagcggagg actgtagtag acgcagatta
121 tccttaggtc actggttcga atccggtagg tcggaatttg ctcccacatg agagcttttt
181 atttttcttt cgttgtgaca ttaaggtttt
//
LOCUS ATHTGYB 210 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana tRNA-Tyr gene.
ACCESSION M35958
KEYWORDS transfer RNA-Tyr.
SOURCE A.thaliana leaf DNA, clone pAtT3.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 210)
AUTHORS Stange,N., Gross,H.J. and Beier,H.
TITLE Wheat germ splicing endonuclease is highly specific for plant
pre-tRNAs
JOURNAL EMBO J. 7, 3823-3828 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 71 107 Tyr-tRNA, exon 1
120 155 Tyr-tRNA, exon 2
RNA 71 178 precursor Tyr-tRNA (+/- 5 bp)
IVS 108 119 Tyr-tRNA intron (no splice consensus)
BASE COUNT 66 a 34 c 41 g 69 t
ORIGIN
1 tcataagttt aaatttgatt aaaaagcttc tcttgaatca tcaacatgca taaagtgtta
61 taatacaaaa ccgaccttag ctcagttggt agagcggagg actgtagttg acgcagataa
121 tccttaggtc actggttcga atccggtagg tcggatcatt aaaattgaaa gttttttttt
181 tccatttcgt ttcaaagtgc ttagacaaga
//
LOCUS STMCHOPA 3104 bp ds-DNA BCT 14-AUG-1990
DEFINITION Streptomyces sp. cytochrome-P450-like protein (choP) gene and
cholesterol oxidase (choA) gene, complete cds.
ACCESSION M31939 J03356
KEYWORDS cholesterol oxidase; cytochrome-P450-like protein.
SOURCE Streptomyces sp. (strain SA-COO) DNA, clone pCO-[1,3].
ORGANISM Streptomyces sp.
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 963 to 3104)
AUTHORS Ishizaki,T., Hirayama,N., Shinkawa,H., Nimi,O. and Murooka,Y.
TITLE Nucleotide sequence of the gene for cholesterol oxidase from a
Streptomyces sp
JOURNAL J. Bacteriol. 171, 596-601 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1450)
AUTHORS Horii,M., Ishizaki,T., Paik,S.-Y., Manome,T. and Murooka,Y.
TITLE An operon containing the gene for cholesterol oxidase and a
cytochrome P-450-like protein from Streptomyces sp
JOURNAL J. Bacteriol. 172, 3644-3653 (1990)
STANDARD full staff_review
COMMENT Draft entry and clean copy of sequence for [1] kindly provided by
Y.Murooka, 01-DEC-1989. Draft entry and printed sequence for [2]
kindly submitted by Y.Murooka, 02-FEB-1990, for release after
publication.
FEATURES from to/span description
mRNA 100 > 1283 choP mRNA
pept 141 1286 cytochrome-P450-like protein (choP)
pept 1361 3001 cholesterol oxidase precursor (choA, EC
1.1.3.6)
sigp 1361 1486 cholesterol oxidase signal peptide
matp 1487 2998 cholesterol oxidase
mRNA 1384 > 3104 choA mRNA
signal 65 70 -35 region
signal 89 94 -10 region
binding 129 134 choP ribosomal binding site (put.)
binding 1340 1345 choA ribosomal binding site (put.)
BASE COUNT 518 a 1234 c 932 g 420 t
ORIGIN
1 ggtacccccg gtaaccgaac cccgtgcccg tacccgcagg tagcccgccg ggggaacgat
61 ctcgttgaca gccttcacat cgcctccata cggtcatttc gacggcgccg aacatgccga
121 tcactcgtga aagggcatac atgacccagg ctgcgccggt gaccttctcg acagtccgtg
181 agaattactt cggtccgccg gcggagatgc aggcactgcg gcacaaggca cccgtgacgc
241 gcacggcctt cgccgacggc cgccccggct ggctcgtcac cggatattcc gccgcgcgtg
301 cggtgttatc cgattcccgt ttcaccgccc gcggagaacg cgagcacccc gccgtgccgc
361 gcgccgcgac cctggaggac gagcgctgcc gcaggctgat cgccggacag ttcaccgccc
421 gccggatgcg gcagctcacc gggcggaccg agcggatcgt ccgcgaacac ctcgacgcca
481 tggagcacat gggctccccg gccgacctcg tcgagcactt cgccctcccc gtcccctcgc
541 tcgtcatcgc cgaactgctc ggcgtgcccc cccccgaccg ggagcacttc cagcacgaca
601 ccctccggtg gggcggcttc ggccggtcca ccgaggaggt caccgaggcc ttcgtctccc
661 tcggcggcca actgcagcgc ctcgtgcggc tcaagcgcac cgagccgggc gacgacctcc
721 tcagcggcct gatcgccgcc gaccccgcgc tcaccgacga ggaactcgcc tcgatcgcct
781 tcctgttgct ggtcgccggc cacgggacca ccgcccacca gatcgccctc ggcgccttcc
841 tgctgctcga acaccccgac cagctggccg cgctgcgcgc cgaccccgcc ctgacggaga
901 gcgcggtcga ggagctgctg cgccacctga gcgtcgtcca ccacggcccc acccgtgccg
961 cgctgcagga cgccgacatc gagggcaccc ccgtcaaggc gggcgaggtc gtcgtggtct
1021 cgctcggcgc cgccaaccgc gaccccgcac gcttcgagcg gcccgacgcc gtggatgtca
1081 cccgcgagga caccggccac ctcgccttcg ggcacggcat gcaccagtgc ctcggccggc
1141 aactggcccg catcgaactg cgcgtcgccc tcaccgccct cctcgaacgc ttcccgcacc
1201 tgcgcctggc ctgccccgcg gcggagatcc cgctccgcca cgacatgcag gtctacggcg
1261 ccgaccggct gccggtcgcc tggtgaccgt ccccgtcacc cccccgaacc accttcccca
1321 cccctgaaga aacgtctcgg aaaggtcagg tagaccccgc gtgactgcac aacagcacct
1381 gtcccgccgc cgcatgctcg gcatggccgc cttcggcgcc gccgccctcg ccgggggcac
1441 caccatcgcc gccccccgtg cggccgccgc cgccaagtcc gcggcggaca acggcggtta
1501 cgtccccgcc gtcgtcatcg gcaccggcta cggcgcggcc gtctccgcgc tgcgcctcgg
1561 cgaggcgggt gtgcagaccc tgatgctgga gatgggccag ctgtggaacc agcccggccc
1621 cgacggcaac atcttctgcg gcatgctcaa cccggacaag cggtccagct ggttcaagaa
1681 ccgcaccgag gccccgctcg gcagcttcct ctggctcgac gtcgtcaacc ggaacatcga
1741 cccctacgcg ggtgtcctgg accgtgtgaa ctacgaccag atgtcggtct acgtgggccg
1801 cggcgtcggc ggcggctcgc tcgtcaacgg cggcatggcc gtggagccca agcgctcgta
1861 cttcgaggag atcctcccgc gggtcgactc ctccgagatg tacgaccgct acttcccccg
1921 cgccaactcc atgctccgcg tcaaccacat cgacaccaag tggttcgagg acaccgagtg
1981 gtacaagttc gcccgcgtct cgcgcgagca ggcgggcaag gccggtctcg gcaccgtctt
2041 cgtccccaac gtctacgact tcggctacat gcagcgcgag gccgcgggcg aggtgcccaa
2101 gtccgccctg gcgaccgagg tcatctacgg caacaaccac ggcaagcaga gcctggacaa
2161 gacctacctg gccgccgcac tcggcaccgg caaggtcacc atccagaccc tgcaccaggt
2221 caagacgatc cgtcagacga aggacggcgg ctacgcgctg accgtcgagc agaaggacac
2281 cgacggcaag ctcctggcca ccaaggagat ctcctgccgc tacctgttcc tcggcgcggg
2341 cagcctcggc tccaccgaac tgctggtgcg cgcccgcgac accggcaccc tgccgaacct
2401 caactccgag gtgggcgcgg gctggggccc caacggcaac atcatgaccg cccgggccaa
2461 ccacatgtgg aaccccaccg gcgcccacca gtcctccatc cccgccctcg gcatcgacgc
2521 gtgggacaac agcgactcct cggtcttcgc ggagatcgcc cccatgccgg ccggcctgga
2581 gacgtgggtc agcctctacc tcgcgatcac caagaacccc cagcgcggca ccttcgtgta
2641 cgacgccgcg acggaccgcg cgaagctcaa ctggacccgt gaccagaacg cccccgcggt
2701 caacgcagcc aaggcgctgt tcgaccggat caacaaggcg aacggcacga tctaccggta
2761 cgacctcttc ggcacccagc tgaaggcctt cgccgacgac ttctgctacc acccgctcgg
2821 cggctgcgtc ctgggcaagg cgacggacga ctacggccgc gtcgccggtt acaagaacct
2881 ctacgtgacc gacggttcgc tgatcccggg ttccgtcggc gtcaacccgt tcgtgaccat
2941 cacggcgctg gccgagcgga acgtcgagcg catcatcaag caggacgtca cggcgtcgta
3001 atcacgcacc gaccgggccg gaagggaaga acggcccggt caccgcacca tcccggaagg
3061 aacgcaccat tcccgtgaac gccgacgaca acagcctgtg gatc
//
LOCUS STMCEFDA 2494 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.clavuligerus isopenicillin N epimerase (cefD) gene and
deacetoxycephalosporin C synthetase (DAOCS) gene, complete cds.
ACCESSION M32324 M24140
KEYWORDS deacetoxycephalosporin C synthetase; isopenicillin N epimerase.
SOURCE S.clavuligerus DNA, clone pOW380.
ORGANISM Streptomyces clavuligerus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1520 to 2494)
AUTHORS Kovacevic,S., Weigel,B.J., Tobin,M.B., Ingolia,T.D. and Miller,J.R.
TITLE Cloning, characterization, and expression in Escherichia coli of
the Streptomyces clavuligerus gene encoding deacetoxycephalosporin
C synthetase
JOURNAL J. Bacteriol. 171, 754-760 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1706)
AUTHORS Kovacevic,S., Tobin,M.B. and Miller,J.R.
TITLE The beta-lactam biosynthetis genes for isopenicillin N epimerase
and deacetoxycephalosporin C synthetase are expressed from a single
transcript in Streptomyces clavuligerus
JOURNAL J. Bacteriol. 172, 3952-3958 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.R.Miller, 26-FEB-1990, for release after publication.
FEATURES from to/span description
pept 284 1480 isopenicillin N epimerase (cefD)
pept 1559 2494 deacetoxycephalosporin C synthetase (DAOCS)
mRNA 152 > 1706 cefD mRNA
BASE COUNT 387 a 931 c 791 g 385 t
ORIGIN
1 ccatgggtga taactcctgt ggagcgggac aaaaggtcga aagattcaaa tgccgagcag
61 cccggacggt ttactgcata atgtgggcta cctatgctgg acgacaggac tcttgaagtg
121 ctcttcggct ggtcttcaga atctcttcgc tatttttccg aagcggacag aaaatcttcc
181 tgttcgagat cgagtcatga ctgctacgct gcatcgggcc gatcgaattt cgatggctgc
241 actgaattga ctgttcttct ggatttgcgg gagatgcgtt gacatggcgg tagccgactg
301 ggaagaagcc cgcggccgta tgctgctcga ccccaccgtc gtcaacctca acaccggctc
361 cgggggaccg ctgccgcgca gcgccttcga gcgggtcacc ggcttccgcg cccatctcgc
421 ggccgagccg atggacttcc tgctccgcga ggtccccgca ctgctgtggc aggcgcggga
481 gagcctcgcc cgcctcatcg gcggggaccc gctgcgcctc gccctggcca ccaacgtcac
541 cgccgccgtc aacctcgtcg cgtcgtcact gcgcctcgaa gcgcccggcg agatcctgct
601 cagcgacgac gagtacacgc ccatgcgctg gtgctgggag cgggtcgccc ggcggcacgg
661 cctggagctg aggacgttcc ggctgcccga gctgccctcg gacccggccg agatcaccgc
721 ggcggcggtc gccgcgatgg gaccgcgcac ccggctgttc ttcttcagcc atgtcgtctc
781 cacgaccggg ctgatcctgc ccgccgccga actgtgcgag gaggcccgcg cacggggcat
841 caccaccgtg gtcgacggcg cccacgcacc cggcttcctc gacctcgacc tctcccggat
901 cccctgcgac ttctacgcgg gcagcggcca caaatggctg ctcgccccca ccggggtcgg
961 ctttctccac ctcgcccccg gccgcctgga agaactggag cccacccagg tgagctgggc
1021 gtacgagccc ccggagggca gcggcccgcc ggccgcgcgc gaccgcttcg gcagcacacc
1081 cgggctgcgc agactcgaat gcgaggggac ccgggacatc tgcccctggc tcgccacacc
1141 ggagtcgatc gacttccagg ccgagctggg ccccggggcg atccgcgccc gccgccgcga
1201 gctgacggac cacgcgcgcc gcctgctcgc cgaccgcccg ggccgcacgc tcctcacccc
1261 cgactccccg gagctgtccg gcggcatggt ggcctaccgg ctgcccccgg gaaccgacgc
1321 ggccgagctg cgccgggggc tctgggagcg cttccggatc gaggccgcgg tggccgagca
1381 gccgcccggg ccggtgctcc ggatctcggc gaacttctac accaccgaag aggagatcga
1441 ccgcctggcg gacgcgctgg acgcgctgac gggcgagtga tccccccggc tcgcggaccg
1501 cctcccccgc gctgttgacc acccggttca cggattacga gaggatcagt gagagttgat
1561 ggacacgacg gtgcccacct tcagcctggc cgaactccag cagggcctgc accaggacga
1621 gttccgcagg tgtctgaggg acaagggcct cttctatctg acggactgcg gtctgaccga
1681 caccgagctg aagtcggcca aggacctcgt catcgacttc ttcgagcacg gcagcgaggc
1741 ggagaagcgc gccgtcacct cgcccgtccc caccatgcgc cgcggcttca ccgggctgga
1801 gtcggagagc accgcccaga tcaccaatac cggcagctac tccgactact cgatgtgcta
1861 ctcgatgggc accgcggaca acctcttccc gtccggtgac ttcgagcgga tctggaccca
1921 gtacttcgac cgccagtaca ccgcctcccg cgcggtcgcc cgggaggtcc tgcgggcgac
1981 cgggaccgag cccgacggcg gggtcgaggc cttcctcgac tgcgagccgc tgctgcggtt
2041 ccgctacttc ccgcaggtcc ccgagcaccg cagcgccgag gagcagcccc tgcggatggc
2101 gccgcactac gacctgtcga tggtcaccct catccagcag acaccctgcg ccaacggctt
2161 cgtcagcctc caggccgagg tcggcggcgc gttcacggac ctgccctacc gtccggacgc
2221 cgtcctcgtc ttctgcggcg ccatcgcgac cctggtgacc ggcggccagg tcaaggcccc
2281 ccggcaccat gtcgcggccc cccgcaggga ccagatagcg ggcagcagcc gcacctccag
2341 tgtgttcttc ctccgtccca acgcggactt caccttctcc gtcccgctgg cgcgcgagtg
2401 cggcttcgat gtcagcctgg acggcgagac cgccacgttc caggattgga tcgggggcaa
2461 ctacgtgaac atccgccgca catccaaggc atag
//
LOCUS AMTMTCYB 225 bp ds-DNA ORG 14-AUG-1990
DEFINITION Salamander (A.tigrinum) mitochondrial cytochrome b, partial cds.
ACCESSION M26230 M28046
KEYWORDS cytochrome; cytochrome b.
SOURCE A.tigrinum liver mitochondrial DNA.
ORGANISM Mitochondrion Ambystoma tigrinum
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Caudata; Ambystomatidae; Ambystoma tigrinum.
REFERENCE 1 (bases 1 to 225)
AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S.,
Villablanca,F.X. and Wilson,A.C.
TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification
and sequencing with conserved primers
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 225)
AUTHORS Kocher,T.D. and White,T.J.
TITLE Evolutionary analysis via PCR
JOURNAL (in) Erlich,H. (Ed.);
PCR Technology: Princip. and Appl. for DNA amplification: 137-147;
Stockton Press, New York (1989).
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly
submitted by T.D.Kocher, 18-JUL-1989.
Bernie 12-SEP-1989 initial full entry [1]
FEATURES from to/span description
pept < 1 > 225 cytochrome b (AA at 1)
BASE COUNT 68 a 39 c 35 g 83 t
ORIGIN
1 acagggttat ttttagctat acattataca gcagatacat catcagcatt ctcatccgta
61 gcccacattt gccgagatgt aaattatggt tgacttatac gaaatattca cgcaaacggc
121 gcttcattct tttttatttg tatctttctt catattggtc gaggaatata ttacggctca
181 tatatgttca aagaaacatg aaacattgga gtaattttat tattt
//
LOCUS BLYLEC 972 bp ss-mRNA PLN 14-AUG-1990
DEFINITION Barley root-specific lectin mRNA, complete cds.
ACCESSION M29280 M24846
KEYWORDS lectin.
SOURCE Barley (var. Betzes) embryo, cDNA to mRNA, clone BLc3.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 972)
AUTHORS Lerner,D.R. and Raikhel,N.V.
TITLE Cloning and characterization of root-specific barley lectin
JOURNAL Plant Physiol. 91, 124-129 (1989)
STANDARD simple staff_review
COMMENT Draft entry and sequence for [1] kindly submitted by N.Raikhel,
16-MAY-1989.
FEATURES from to/span description
pept 16 654 lectin precursor
sigp 16 93 lectin signal peptide (put.)
matp 94 606 lectin
BASE COUNT 197 a 285 c 291 g 199 t
ORIGIN
1 cagaaaacaa gaaggatgaa gatgatgagc accagggccc tcgctctcgg cgcggccgcc
61 gtcctcgcct tcgcggcggc gaccgcgcac gcccagaggt gcggcgagca gggcagcaac
121 atggagtgcc ccaacaacct ctgctgcagc cagtacgggt actgcggcat gggcggcgac
181 tactgcggca agggctgcca gaacggcgcc tgctacacca gcaagcgctg cggcactcag
241 gccggcggca agacatgccc taacaaccac tgctgcagcc agtggggtta ctgcggcttc
301 ggcgccgagt actgcggcgc cggctgccag ggcggcccct gccgcgccga catcaagtgc
361 ggcagccagg ccggcggcaa gctttgcccc aacaacctct gctgcagcca gtggggttac
421 tgcggcctcg gctccgagtt ctgcggcgag ggctgccagg gcggtgcttg cagcaccgac
481 aagccgtgcg gcaaggccgc cggcggcaaa gtttgcacca acaactactg ctgcagcaag
541 tggggatcct gtggcatcgg cccgggctac tgcggcgcag gttgccagag cggcggctgc
601 gacggtgtct tcgccgaggc catcgccgcc aactccactc ttgtcgcaga atgatgatct
661 tgctaatggc agtattattg caacgacgaa taatccgtgg cagttttgtt gccacgtacg
721 gtctcccttc acttactttt agcactagtc cttaataatt ctccagcctt gcaatatgac
781 gtgcaggttg ctacatgcat ggacatattg cagtgagaag tactgtgtgg caatataggg
841 tgtactattg ttgccacaaa tttagttctt tcttgttacg tacgtacagt tgtcaggatg
901 catgcatccc cgttgtaatg ttggagtact ccatgatttc gttgcaatat atatattgcc
961 atgagtctaa ag
//
LOCUS CHKMTCYB 225 bp ds-DNA ORG 14-AUG-1990
DEFINITION Chicken mitochondrial cytochrome b, partial cds.
ACCESSION M26231 M28048
KEYWORDS cytochrome; cytochrome b.
SOURCE Chicken mitochondrial DNA.
ORGANISM Mitochondrion Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus.
REFERENCE 1 (bases 1 to 225)
AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S.,
Villablanca,F.X. and Wilson,A.C.
TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification
and sequencing with conserved primers
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 225)
AUTHORS Kocher,T.D. and White,T.J.
TITLE Evolutionary analysis via PCR
JOURNAL (in) Erlich,H. (Ed.);
PCR Technology: Princip. and Appl. for DNA amplification: 137-147;
Stockton Press, New York (1989).
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly
submitted by T.D.Kocher, 18-JUL-1989.
Bernie 12-SEP-1989 initial full entry [1]
Mia 13-AUG-1990 citation update
FEATURES from to/span description
pept < 1 > 225 cytochrome b (AA at 1)
BASE COUNT 56 a 84 c 35 g 50 t
ORIGIN
1 accgccctac tactagccat gcactacaca gcagacacat ccctagcctt ctcctccgta
61 gcccacactt gccggaacgt acaatacggc tgactcatcc ggaatctcca cgcaaacggc
121 gcctcattct tcttcatctg tatcttcctt cacatcggac gaggcctata ctacggctcc
181 tacctctaca aggaaacctg aaacacagga gtaatcctcc tcctc
//
LOCUS HUMMUC18A 2940 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human MUC18 gene mRNA (3' variant), complete cds.
ACCESSION M29277
KEYWORDS MUC18 glycoprotein; cell adhesion molecule;
immunoglobulin-like protein; integral membrane glycoprotein.
SOURCE Human (isolate JuSo) melanoma (cell line Mel JuSo), cDNA to mRNA,
clone zapy1, drop 1, drop 4, drop 4.2, drop 4.7, drop 4.11.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2940)
AUTHORS Lehmann,J.M., Riethmueller,G. and Johnson,J.P.
TITLE MUC18, a marker of tumor progression in human melanoma, shows
sequence similarity to the neural cell adhesion molecules of the
immunoglobulin superfamily
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9891-9895 (1989)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
J.P.Johnson, 06-OCT-89.
FEATURES from to/span description
pept 8 1819 MUC18 glycoprotein precursor
sigp 8 91 MUC18 glycoprotein signal peptide
matp 92 1816 MUC18 glycoprotein
site 122 373 immunoglobin like (V set) domain I
site 464 691 immunoglobin like (V set) domain II
site 793 985 immunoglobin like (C2 set) domain
site 1073 1246 immunoglobin like (C2 set) domain
site 1334 1522 immunoglobin like (C2 set) domain
BASE COUNT 668 a 861 c 856 g 555 t
ORIGIN
1 gggaagcatg gggcttccca ggctggtctg cgccttcttg ctcgccgcct gctgctgctg
61 tcctcgcgtc gcgggtgtgc ccggagaggc tgagcagcct gcgcctgagc tggtggaggt
121 ggaagtgggc agcacagccc ttctgaagtg cggcctctcc cagtcccaag gcaacctcag
181 ccatgtcgac tggttttctg tccacaagga gaagcggacg tcatcttccg tgtgcgccag
241 ggccagggcc agagcgaacc tggggagtac gagcaagcgg ctcagcctcc aggacagagg
301 ggctactctg gccctgactc aagtcacccc ccaagacgag cgcatcttct tgtgccaggg
361 caagcgcctc ggtcccagga gtaccgcatc cagctccgcg tctacaaagc tccggatgcc
421 aaacatccag gtcaaccccc tgggcatccc tgtgaacagt aaggagcctg aggaggtcgc
481 tacctgtgta gggaggaacg ggtaccccat tcctcaagtc atctggtaca agaatggccg
541 gcctctgaag gaggagaaga accgggtcca cattcagtcg tcccagactg tggagtcgag
601 tggtttgtac accttgcaga gtattctgaa ggcacagctg gttaaagaag acaaagatgc
661 ccagttttac tgtgagctca actaccggct gcccagtggg aaccacatga aggagtccag
721 ggaagtcacc gtccctgttt tctacccgac agaaaaagtg tggctggaag tggagcccgt
781 gggaatgctg aaggaagggg accgcgtgga aatcaggtgt ttggctgatg gcaaccctcc
841 accacacttc agcatcagca agcagaaccc cagcaccagg gaggcagagg aagagacaac
901 caacgacaac ggggtcctgg tgctggagcc tgcccggaag gaacacagtg ggcgctatga
961 atgtcaggcc tggaacttgg acaccatgat atcgctgctg agtgaaccac aggaactact
1021 ggtgaactat gtgtctgacg tccgagtgag tcccgcagcc cctgagagac aggaaggcag
1081 cagcctcacc ctgacctgtg aggcagagag tagccaggac ctcgagttcc agtggctgag
1141 agaagagaca gaccaggtgc tggaaagggg gcctgtgctt cagttgcatg acctgaaacg
1201 ggaggcagga ggcggctatc gctgcgtggc gtctgtgccc agcatacccg gcctgaaccg
1261 cacacagctg gtcaagctgg ccatttttgg ccccccttgg atggcattca aggagaggaa
1321 ggtgtgggtg aaagagaata tggtgttgaa tctgtcttgt gaagcgtcag ggcacccccg
1381 gcccaccatc tcctggaacg tcaacggcac ggcaagtgaa caagaccaag atccacagcg
1441 agtcctgagc accctgaatg tcctcgtgac cccggagctg ttggagacag gtgttgaatg
1501 cacggcctcc aacgacctgg gcaaaaacac cagcatcctc ttcctggagc tggtcaattt
1561 aaccaccctc acaccagact ccaacacaac cactggcctc agcacttcca ctgccagtcc
1621 tcataccaga gccaacagca cctccacaga gagaaagctg ccggagccgg agagccgggg
1681 cgtggtcatc gtggctgtga ttgtgtgcat cctggtcctg gcggtgctgg gcgctgtcct
1741 ctatttcctc tataagaagg gcaaggctgc cgtgcaggcg ctcagggaag caggagatca
1801 cgctgccccc gtctcgtaag accgaacttg tagttgaagt taagtcagat aagctcccag
1861 aagagatggg cctcctgcag gcagcagcgg tgacaagagg gctccgggag accagggaga
1921 gaaatacatc gatctgaggc attagccccg aatcacttca gctcccttcc ctgcctggac
1981 cattcccagc tccctgctca ctcttctctc agccaaagct caaagggact agagagaagc
2041 ctcctgctcc cctcgcctgc acaccccctt tcagagggcc actgggttag gacctgagga
2101 cctcacttgg ccctgcaagg cccgcttttc agggaccagt ccaccaccat ctcctccacg
2161 ttgagtgaag ctcatcccaa gcaaggagcc ccagtctccc gagcgggtag gagagtttct
2221 tgcagaacgt gttttttctt tacacacatt atgctgtaaa tacgctcgtc ctgccagcag
2281 ctgagctggg tagcctctct gagctggttt cctgccccaa aggctggcat tccaccatcc
2341 aggtgcacca ctgaagtgag gacacaccgg agccaggcgc ctgctcatgt tgaagtgcgc
2401 tgttcacacc cgctccggag agcaccccag cagcatccag aagcagctgc agtgcaagct
2461 tgcatgcctg cgtgttgctg caccaccctc ctgtctgcct cttcaaagtc tcctgtgaca
2521 ttttttcttt ggtcagaggc caggaactgt gtcattcctt aaagatacgt gccggggcca
2581 ggtgtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcggcgg atcacaaagt
2641 cagacgagac catcctggct aacacggtga aaccctgtct ctactaaaaa tacaaaaaaa
2701 aattagctag gcgtagtggt tggcacctat agtcccagct actcggaagg ctgaagcagg
2761 agaatggtat gaatccagga ggtggagctt gcagtgagcc gagaccgtgc cactgcactc
2821 cagcctgggc aacacagcga gactccgtct cgagccggcc ggttgcgcgg gccctcggac
2881 cctcagagag gcgagggttc gagggcacga gttcgaggcc aacctggtcc acatgggttg
//
LOCUS HUMTELB 262 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human DNA from the end of the chromosomal telomere.
ACCESSION M19947 J04078
KEYWORDS repetitive sequence.
SOURCE Human placental DNA, clone pHuR 143.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 262)
AUTHORS Moyzis,R.K., Buckingham,J.M., Cram,L.S., Dani,M., Deaven,L.L.,
Jones,M.D., Meyne,J., Ratliff,R.L. and Wu,J.R.
TITLE A highly conserved repetitive DNA sequence, (TTAGGG)n, present at
the telomeres of human chromosomes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6622-6626 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence [1] kindly submitted by
S.Jett 27-JUL-88.
The (ttaggg)n telomeric sequence is found at the ends of all human
chromosomes as well as in some other higher eukaryotic genomes.
FEATURES from to/span description
site 80 81 rare insertion/substitution
site 243 244 rare insertion/substitution
BASE COUNT 44 a 0 c 131 g 87 t
ORIGIN Unreported.
1 tttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg
61 gttagggtta gggttaggga gtagggttag ggttagggtt agggttaggg ttagggttag
121 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag
181 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag
241 ggtgttaggg ttagggttag gg
//
LOCUS LCOLHICDH 1320 bp ds-DNA BCT 14-AUG-1990
DEFINITION L.confusus L-2-hydroxyisocaproate dehydrogenase (L-HicDH) gene,
complete cds.
ACCESSION M31425 M28050
KEYWORDS L-2-hydroxyisocaproate dehydrogenase.
SOURCE L.confusus (strain DSM20196) DNA.
ORGANISM Lactobacillus confusus
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 1320)
AUTHORS Lerch,H.-P., Frank,R. and Collins,J.
TITLE Cloning, sequencing and expression of the L-2-hydroxyisocaproate
dehydrogenase-encoding gene of Lactobacillus confusus in
Escherichia coli
JOURNAL Gene 83, 263-270 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 328 1260 L-2-hydroxyisocaproate dehydrogenase
ORF < 1 243 ORF (AA at 1)
mRNA 301 1304 LDH mRNA (3' end put.)
signal 1267 1303 transcription termination signal
binding 318 323 ribosome binding site
signal 267 272 -35 region
signal 290 295 -10 region
BASE COUNT 333 a 263 c 353 g 371 t
ORIGIN
1 ttctttttgt acaccatgac aatgtttgct gctaagacat cttcgccagt tgaaacggcc
61 cgtttgtcag gtatggcgca agctggtggt tactttatgt ctgcattcgg tccaatgctt
121 tatggtatgg cttttaccgc aaatccgaat ggggtcatcc aaaacgtcgt ttatttggta
181 ttggtgattg tgatgattgt cgctgccgtg atgatggcga tgactaagca tttgtttgac
241 taaaataatg atgagatgtt gttcagtggt gaacaacgtc tcttttttgt ataatgaaca
301 tacgacaaaa acaatttggg ggatattatg gcacgtaaga ttggaattat cggccttgga
361 aacgttgggg ctgcagtagc gcacggattg attgcacaag gtgtagccga cgactacgtc
421 tttattgatg caaacgaagc aaaggtgaag gctgatcaaa ttgatttcca agacgcaatg
481 gcgaacttgg aagcgcacgg taacattgtg attaacgatt gggcagcctt ggctgatgct
541 gatgttgtga tttcaacact ggggaacatc aagttgcaac aagacaaccc aaccggtgac
601 cgttttgctg agttgaagtt taccagcagc atggtgcaat cagtcggcac aaacttgaag
661 gaatctggtt tccacggcgt attggtcgtg atttcaaacc cggtcgacgt gattacggcc
721 ttgttccaac acgtgactgg tttcccagct cacaaggtta tcggaaccgg tactttgctt
781 gacacggcgc gtatgcaacg tgcagttggt gaggcgtttg atttggatcc acgttctgtt
841 tcaggttaca acttgggtga gcacggtaac tcacaattcg tagcttggtc aacggtgcgc
901 gtgatgggtc aaccaatcgt gacgttggct gatgccggcg atattgactt ggcggccatc
961 gaagaggaag cacgtaaggg tggcttcacg gtcttgaatg gtaagggcta cacgagttat
1021 ggtgttgcaa cgtcagcaat ccgcattgcc aaggctgtta tggctgacgc gcatgctgaa
1081 ttggttgtct caaatcgtcg cgatgacatg ggaatgtact tgtcataccc agcgattatt
1141 ggtcgcgatg gtgtcttggc agaaacgacg cttgatttga cgacggatga gcaagaaaag
1201 cttttgcaat cacgtgacta catccaacaa cgtttcgacg aaattgtgga tacactctaa
1261 aaacacaaaa agtggcttca atctggtgat tggaccactt ttttctaatt gtgcatgaaa
//
LOCUS SIVAGM266 300 bp ss-RNA VRL 14-AUG-1990
DEFINITION Simian (African green monkey) immunodeficiency virus, isolate 266,
partial pol cds.
ACCESSION M22974
KEYWORDS .
SOURCE Simian (African green monkey) immunodeficiency virus proviral DNA,
isolate 266.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Li,Y., Naidu,Y.M., Daniel,M.D. and Desrosiers,R.C.
TITLE Extensive genetic variability of simian immunodeficiency virus from
African green monkeys
JOURNAL J. Virol. 63, 1800-1802 (1989)
STANDARD full staff_review
COMMENT Hard copy of sequence kindly provided by R.C. Desrosiers,
25-JAN-1989.
FEATURES from to/span description
pept < 1 > 300 pol polyprotein (AA at 1)
pre-msg < 1 > 300 genomic mRNA
pre-msg < 1 > 300 tat, rev, nef subgenomic mRNA
BASE COUNT 108 a 59 c 74 g 59 t
ORIGIN
1 gcaggactac tggcaggcag ttggatacct gactggacct ttgtaagcgt cccacctctg
61 gtaacattgt ggtacacctt aacaaaggaa cccatcccag gagaagatgt gtattatgta
121 gatggagctt gcaacaggaa ctcaagagaa ggaaaagcag ggtatataac tcagcaaggg
181 aaacagagag tggagaagct agaaaatacc acaaatcagc aagcagaatt aacagctata
241 aaaatggcct tagaagacag tggtcccagg gtaaacattg ttacagactc tcaatatgca
//
LOCUS STYPOL3E 454 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.typhimurium DNA polymerase III epsilon subunit gene, 5' end.
ACCESSION M26045 M29726
KEYWORDS polymerase III.
SOURCE S.typhimurium (strain LT2) DNA, clone pFF16.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 454)
AUTHORS Lancy,E.D., Lifsics,M.R., Kehres,D.G. and Maurer,R.
TITLE Isolation and characterization of mutants with deletions in dnaQ,
the gene for the editing subunit of DNA polymerase III in
Salmonella typhimurium
JOURNAL J. Bacteriol. 171, 5572-5580 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by R.A.Maurer, 10-JUL-1989.
FEATURES from to/span description
pept 59 > 454 polymerase III epsilon (editing) subunit
BASE COUNT 107 a 90 c 96 g 104 t 57 others
ORIGIN
1 gtagacttcc tgtaattgaa atcgataaca aaacgcaagt ctgacataaa tgaccgatat
61 gagcactgca attacacgac agatcgtcct cgataccgaa accaccggta tgaatcagat
121 aggcgcgcac tatgaaggtc acaagattat tgagatcggt gcggttgagg tgataaaccg
181 tcgtctgacc ggcaacaatt ttcatgttta cctgaagccc gatcgccttg tcgatccaga
241 ggcttttggc gtacacggta ttgccgatga gtttctgctg gataagccgg tttttgctga
301 tgtggtcgat gagtttcttg attatatnnn nggcgcggag ctggtcatcc ataacgcatn
361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncctaaaac
421 caatactttc tgcaaagtta ccgacagcct ggcg
//
LOCUS RHMSYRMA 2636 bp ds-DNA BCT 14-AUG-1990
DEFINITION R.meliloti syrM gene, complete cds.
ACCESSION M33495
KEYWORDS symR gene.
SOURCE R.meliloti (strain 1021) DNA, clones pMB[1,2].
ORGANISM Rhizobium meliloti
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 2636)
AUTHORS Long,S.R.
JOURNAL Unpublished (1990)
STANDARD full staff_review
REFERENCE 2 (bases 850 to 1990)
AUTHORS Barnett,M.J. and Long,S.R.
TITLE DNA sequence and translational product of a new
nodulation-regulatory locus: SyrM has sequence similarity to NodD
protein
JOURNAL J. Bacteriol. 172, 3695-3700 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.R.Long, 02-APR-1990, for release after publication.
FEATURES from to/span description
pept 991 1971 SyrM protein
BASE COUNT 534 a 769 c 792 g 541 t
ORIGIN
1 gcatgcgcga cgagctgctg aacgagagtc tgttcttcgg tctcagccat gccagcaggg
61 ccatttccaa atggtcgacg actgcaatac gttccggccg cactcgtcgc tcggataccg
121 caccccggca gcctatcctg ggatcatcgc cgcaaccgag ccagttggtg tatcgaaaac
181 ggtcgaggct ctaatcgccg ctggatgaaa agttcagtgg caggtccgaa aggccacgta
241 ttcggagcca acctcctcta gcgatcgctc tctcctgaaa acttgacagg gcgtcccccg
301 accgccttgc cgaggcggcg cgtgcccgac ggcggcctcg acagcaagcc ggtctcgacg
361 gtcttctatc ggcgctgcag gcaatgggca tttgaaaccg tttcaagagt ctccttaaca
421 gagctcgcgc cgggtgaaga ccctgtatga tggatagtga agtgcagggg cggctgctta
481 ccggccgttg agtgaggcta tgtatccaaa acgtggatga cagtcatgcg aaaacgcgat
541 ttctcctgtc tcgccgaatg ccattacaag aacgttagat ataaccggag ggcagaagtg
601 ggcagcgaga tcaattccga cagactcctt tgcaatgaaa gcatgatttg cgaccagggc
661 ctcagatgtg ttgaggcagt tttcttcagt aagtttcagg cggggccact ttggcgagat
721 cattttgtgg cttgctcgtt ggcactgaaa tatgatgtga gccggcaatt cctcgaaagc
781 attctggacc agcgggtctt cgacatccat caggctaggc ctgcgaggcc cggggggcgt
841 cacagccaag tacttctgat taacggaaaa tcacgacgct tcccatttgt cggcgggcca
901 ggaaagtctt tggcacgcac acccatattg tgacgacctg gatcttcggg atgtggcgcg
961 ggaacattac ctgccgagga gaatagatcc atggatcagc ccacttggaa gcggccgcat
1021 agggccaaat ttgccggtgt cagtgacgcc gcacaacagc ggcaaatgcc gaaccttgcg
1081 tccatcgacc taaacctact ggtggatctt gaggctctgc tgcagtatcg gcacattact
1141 caagcggccc agcacgtcgg ccgtagccaa ccggcgatga gcagagcctt gtcaaggctg
1201 cgcggcatgt taaaggacga tcttctggtt gccggctcta gaggcttggt cctgacgcct
1261 ctggctgaat gcttgaccca gatgctgcct tcagtactag acgcgattcg ccagatgatg
1321 aacctcagct tggctccggc gcaacggcga tggaaggtga caatggctat gcccgatcac
1381 caagcggtgg ttctgctgcc acatctgttg ccgcggttac acgagcgcgc ccctcatctc
1441 gacattgtga ccgatccgct tttgggcggc gcgctggggt tacttgaaca aggtgagatc
1501 gacgtggtcg tcgggcagat gggcgccgct ccgctcggct acttgcggcg cagactctac
1561 gccgacagct tcacctgcgt gctacgccac aatcacccgg ccttggcgca ggagtggacg
1621 atcgaggctt tcgcggcctt gcgccacgtc gccattgcct cggaacctga cgagcttttc
1681 ggtcagatct atgacaggtt aaccaaactc ggactgcagc gtggtgatcc gatggtggtt
1741 tccaccgtac tgaccgcggc ggtcttgatc gcggcgactg actcagtgct ggttgtacca
1801 agccgcgtgg cgacccgtgt cgccgccatg ctgtcgctcg cggtgattcc cccgcctgtg
1861 gaactgagac catacgaagt cgcgctgatc tggcacgagc ggtgccatcg agacccagag
1921 catcgctggc tgcgtggcga aatcgccgcc gcagcgtcca cggccggtta gacgaagact
1981 agcgaatagg cgaggatggg tgagcataat gaatgacgac ggtttgatga ttgactatcc
2041 cggggcggtg gaggacccgg tcgatgcgcg atttgcccga tgactcacac ccgggagggg
2101 gtgagtccga gcgcgcctac cacatgcggc tgggtccaga caccgccgtt ttcgtcaacc
2161 ggccgaagtc gcaagggggc acgagcgagg atcgagttgc aacgatgagc cgtcatagcg
2221 tcatagcggc tgtgcacgcc ccaatcccgg cgggttcgcg gcgttgtgtc aaatgctcct
2281 ctcatcgcta ccgatgacaa tacgcgaccc tgctccgaac gtaggtgttc cgtcagtcgc
2341 cccacgtcgc cagcgcggac tgcgatccgc tgagacggaa ctatgcttag ccgccgcatt
2401 ccgtcgcggt gtgggaaatt gcacgcgcag cagagtcttg acgatgctaa aaccatgagg
2461 agcccttgga ggcaccggtc ttttcagagg tgctctctct gctgcgcgcc atccatactt
2521 tggacgatct gcaccgtcga gaacgcaata acgctcctgg cttctgccta acgatacggt
2581 ccgacccggc gtgaatcgac atgagcttag cgtacccatc gtcgcaaggt gcatgc
//
LOCUS ECCCPL 615 bp ss-mRNA INV 14-AUG-1990
DEFINITION E.granulosus cyclophilin mRNA, 3'end.
ACCESSION J04664 M24616
KEYWORDS cyclophilin.
SOURCE E.granulosus, cDNA to mRNA, clone EA21.
ORGANISM Echinococcus granulosus
Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Cestoda;
Eucestoda; Cyclophyllidea; Taeniidae.
REFERENCE 1 (bases 1 to 615)
AUTHORS Lightowlers,M.W., Haralambous,A. and Rickard,M.D.
TITLE Amino acid sequence homology between cyclophilin and a cDNA-cloned
antigen of Echinococcus granulosus
JOURNAL Mol. Biochem. Parasitol. 36, 287-290 (1989)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Lightowlers, 02-MAY-1989.
This E.granulosus antigen exhibits 71% amino acid homology with
human cyclophilin.
FEATURES from to/span description
pept < 1 488 antigen ORF (cyclophilin) (AA at 3)
mRNA < 1 615 CPL mRNA
BASE COUNT 145 a 135 c 166 g 169 t
ORIGIN
1 tgggcgtgaa gtgcttcttc gacatctcca ttggcggtaa gccagcgggt cgcatcgtct
61 ttgccctgtt tgatgacgtc ccaaagacag tagagaactt ccgtgcactg tgcaccggtg
121 agaagggttt cggatacaag ggctcgaagt ttcaccgaat tattcccggt tttatgtgcc
181 aaggtggtga ttttactgcc gggaatggta ccggtggcaa gagcatatac gggagcaaat
241 ttgaggatga aaatttcaat cacaagcaca gcaagccgat gatgctctcg atggcgaatg
301 cgggtaagaa caccaatggc tcgcaattct tcatcactac cgccgtcacc agctggcttg
361 atgggaagca tgttgtcttt ggtgaggtag aaagtggcga agatgttgtt aaggatatgg
421 aggccgtcgg tagcagtagt ggcaagactt cccaagaagt actgattact gactgcggtc
481 agctgtaaat caatcggccg aacgttcact ctattccctt tattaattgt atcgcgctaa
541 tcctgctcct cattgcttct ttacccgttt gtggccgtaa cgtgtttcaa caatgaagtc
601 gttgtgcgta ttttg
//
LOCUS ECOINTDLP 2269 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli ArgU-tRNA gene overlap with prophage DLP12 integrase gene,
complete cds.
ACCESSION M27155
KEYWORDS insertion element IS3; integrase; transfer RNA-ArgU.
SOURCE E.coli (strain K12) DNA, clones pDC1 and pDM1614.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2269)
AUTHORS Lindsey,D.F., Mullin,D.A. and Walker,J.R.
TITLE Characterization of the cryptic lambdoid prophage DLP12 of
Escherichia coli and overlap of the DLP12 integrase gene with the
tRNA gene ArgU
JOURNAL J. Bacteriol. 171, 6197-6205 (1989)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by D.F.Lindsey, 16-AUG-1989.
A homologous region to bacteriophage P22 excisionase is located at
positions 1423-1205. Positions 1727-1422 are 97.4 % identical to
phage lambda positions 31627-31322 encoding the C-terminal end of
exonuclease, and bases 1726-2126 are 92 % identical to phage lambda
bases 39972-40372 encoding the C-terminal 102 residues of the acid
P gene product and the N-terminal 31 residues of the ren gene
product.
FEATURES from to/span description
tRNA 73 149 Arg-tRNA
tRNA 48 227 Arg-tRNA precursor (alt.)
tRNA 48 237 Arg-tRNA precursor (alt.)
pept 1328 165 (c) DLP12 integrase
trns 2127 > 2269 insertion element IS3 (alpha-4/beta-4)
signal 14 19 -35 region
signal 36 41 -10 region
BASE COUNT 539 a 596 c 512 g 622 t
ORIGIN
1 agatcaaaag ccattgactc agcaagggtt gaccgtataa ttcacgcgat tacaccgcat
61 tgcggtatca acgcgccctt agctcagttg gatagagcaa cgaccttcta agtcgtgggc
121 cgcaggttcg aatcctgcag ggcgcgccat tacaattcaa tcagttacgc cttctttata
181 tcctccataa tttcagagtg ggacatattt gggacattat caccaaaaat gtcgtctatt
241 ttcctcgcat gctctgtcaa atgattaggc gcaaggtgag catacctacg aaccatttct
301 atggactccc atccgcccat ttcctgaagc actgataatg ggacgcctga ctgaatcagc
361 cagcttgccc aggtgtgtct gaggtcatgg aaacggaaat cttcaattcc tgcacgacga
421 caagctgata gccatgatgt cttgctgtcg atgcgcatct tcctgaccgc aggcgttgat
481 gttccatctg ctcgcttagc cgccttggta tgtacaaaca cccatttgtg atgcttgcct
541 atttgatcac gcaacacttt acaggcggta tcgttcagcg ccacaccaat ggcgcggttt
601 gatttgctct cttctggatt cacccaggca actcgtcgct gcatgtcgat ttgttgccat
661 tccagattta tgatgttcga ctttctcaga ccagttgcca gcgcaaactt gacgacagat
721 ttcagtggtt cggggcactc atcaataagg cgttttgctt cctccttttc cagccatctg
781 actcgcttgt ttctgaccgc tggtatcttg atgacaggcg ctttttccag ccacttccag
841 tcgcgttctg cagcacggag aatggccttt atcatggcaa gatgctttgc ctttgtctga
901 gttgatactg gctttggttc ataaacaggc agttctttac ctttcctgat ggcggcctga
961 actttctgtt tccatatttc tttcgtcttt ctgttatgca ttctgcttac agcagagtaa
1021 atctttgcct ccgagatatc tttaagcctt ataccctcaa aatgttcaag ccagaactca
1081 atccggcttt tatctgaatc gagagatttt ttatcagctt tttcctcaag ccatcttagg
1141 caggcctctt caaaagtgac atcaggtaaa tcccctagct tttctactcg ccagagttct
1201 gcttttcgct tgtcgtgcaa ctcctgagct tgccgtttgt cctttgtgcc aagagattcc
1261 ttaattcgtt tcccgcccgg gagcgaatac gaggcatacc atatttcatt tctgcggaag
1321 agtgacattt tctttcctct gttatgccat cacccgcgct cacctggaca gtatgcagcg
1381 gagactgaag cgccgcaatg caggcttgcc gtgttgtgag gtacccggat attatcgtga
1441 ggatgcgtca tcgccattgc tccccaaata caaaaccaat ttcagccagt gcctcgtcca
1501 ttttttcgat gaactccggc accatctcgt caaaactcgc catgtacttt tcattccgct
1561 caatcacgac ataatgcagg ccttcacgct tcatgcgcgg gtcatagttg gcaaagtacc
1621 aggcatcttt tcgcgtcacc cacatgctgt actgcacctg ggccatgtaa gccgatttta
1681 ttgcctcgaa accaccgagc cggaatttca tgaaatcccg ggaggtacga gtattgccgg
1741 aagcgtggcc tgtatccgga tgcagagtct tatccgtgga aatcgaacgc gcattactgg
1801 ttggttacca acttgtacca gaacatgcgg gccaatgcgc tggctgacgc ggaattacgg
1861 cgcaaggctg ccgatgaact gacctgtatg acagcgcgaa ttaaccgtgg tgagacgata
1921 cctgaaccag taaaacaact tcctgttatg ggcggtagac ctctaaatcg tgcacaggct
1981 ctggcgaaga tcgcagaaat taaagctaag ttcggactga aaggagcaag tgtatgacgg
2041 gcaaagaggc aattattcat tacctgggga cgcataagag cttctgtgca caggacgttg
2101 ccgcggtaac aggcgcaacc gtaatctgat cttacccagc aatagtggac acgcggctaa
2161 gtgagtaaac tctcagtcag aggtgactca catgacaaaa acagtatcaa ccagtaaaaa
2221 accccgtaaa cagcattcgc ctgaatttcg cagtgaagcc ctgaagctt
//
LOCUS MHVJHM2CA 49 bp ss-RNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone JHM-a.
ACCESSION M22059
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-a.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 16 a 10 c 4 g 19 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct
//
LOCUS MHVJHM2CB 44 bp ss-RNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone JHM-b.
ACCESSION M23059
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-b.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 44)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 14 a 9 c 4 g 17 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct
//
LOCUS MHVJHM2CC 49 bp ds-DNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone B1-c.
ACCESSION M23060
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-c.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 49)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 16 a 10 c 4 g 19 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct
//
LOCUS MHVJHM2CD 44 bp ds-DNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone B1-d.
ACCESSION M23061
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-d.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 44)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 14 a 9 c 4 g 17 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct
//
LOCUS MHVJHM2CE 45 bp ds-DNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-e.
ACCESSION M23062
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-e.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 45)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 14 a 10 c 4 g 17 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taatctaatc taaacggcac ttcct
//
LOCUS MHVJHM2CF 40 bp ds-DNA VRL 14-AUG-1990
DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-f.
ACCESSION M23063
KEYWORDS .
SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-f.
ORGANISM Murine hepatitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Coronaviridae.
REFERENCE 1 (bases 1 to 40)
AUTHORS Makino,S. and Lai,M.M.C.
TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses
during passages in vitro
JOURNAL Virology 169, 227-232 (1989)
STANDARD simple staff_entry
COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT 12 a 9 c 4 g 15 t
ORIGIN 47 bp upstream of 5' end of genomic RNA.
1 cttgtagttt aaatctaatc taatctaaac ggcacttcct
//
LOCUS MUSPDGFA 906 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse platelet-derived growth factor A chain (PDGFA) mRNA,
complete cds.
ACCESSION M29464
KEYWORDS platelet-derived growth factor;
platelet-derived growth factor A chain.
SOURCE Mouse F9 embryonal carcinoma cell line, cDNA to mRNA, clone F9A5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 906)
AUTHORS Mercola,M.
JOURNAL Unpublished (1989)
STANDARD full staff_review
REFERENCE 2 (sites for [1])
AUTHORS Mercola,M., Wang,C., Kelly,J., Brownlee,C., Jackson-Grusby,L.,
Stiles,C. and Bowen-Pope,D.
TITLE Selective expression of PDGF A and its receptor during early mouse
embryogenesis
JOURNAL Dev. Biol. 138, 114-122 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Mercola, 26-OCT-1989.
FEATURES from to/span description
pept 63 653 platelet-derived growth factor A chain
BASE COUNT 217 a 250 c 270 g 169 t
ORIGIN
1 cgggctggag cgcccgcccc ggccctcgcc tccctgccga gcttccagcg cctcgggacg
61 cgatgaggac ctgggcttgc ctgctgctcc tcggctgcgg atacctcgcc catgccctgg
121 ccgaggaagc cgagataccc cgggagttga tcgagcggct ggctcgaagt cagatccaca
181 gcatccggga cctccagcga ctcttggaga tagactccgt aggggctgag gatgccttgg
241 agacaagtct gagagcccat gggtcccatg ccattaacca tgtgcccgag aagcggcctg
301 tgcccattcg caggaagaga agtattgagg aagccattcc tgcagtttgc aagaccagga
361 cggtcattta cgagatacct cggagccagg tggaccccac atcggccaac ttcctgatct
421 ggcccccatg tgtggaggtg aagcgctgca ctggctgttg taacaccagc agcgtcaagt
481 gccagccttc acgggtccac caccgcagtg tcaaggtggc caaagtggag tatgtcagga
541 agaagccaaa attgaaagag gtccaggtga ggttagagga acacctggag tgtgcatgtg
601 cgacctccaa cctgaaccca gaccatcggg aggaggagac agatgtgagg tgagatagag
661 ccggccgccc ctctcccggg gacacgatgt gcgtggcgtg tgacattcct gaacatacta
721 tgtatggtgc ttcattgcca atgtgcgtgc ggtctttgtt ctcctccgtg aaaaaccgtg
781 tccgaggaca aagagacagt gtccgtttgt tcagtgtgac atcaaagcaa gtactgtagc
841 actcagagag acagtgagaa agcttccttg tcacagagag cgaaaacaaa accacaaaaa
901 agaaaa
//
LOCUS ORFPRTPS 660 bp ds-DNA VRL 14-AUG-1990
DEFINITION Orf virus homologue of retroviral pseudoprotease gene, complete
cds.
ACCESSION M30023 J04371
KEYWORDS pseudoprotease.
SOURCE Orf virus (strain NZ2) DNA.
ORGANISM Orf virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Parapoxvirus.
REFERENCE 1 (bases 1 to 660)
AUTHORS Mercer,A.A., Fraser,K.M., Stockwell,P.A. and Robinson,A.J.
TITLE A homologue of retroviral pseudoproteases in the parapoxvirus, orf
virus
JOURNAL Virology 172, 665-668 (1989)
STANDARD full staff_entry
COMMENT Draft entry and clean copy of sequence [1] kindly submitted by
A.A.Mercer 12-JUL-1989.
FEATURES from to/span description
pept 100 579 pseudoprotease homologue
BASE COUNT 157 a 170 c 166 g 167 t
ORIGIN
1 gctggacttg cgccgcgcca tgctgacccg cccctgattt ttctggagaa agtgtaaatt
61 gtacaccccg tagtcgatcg gccgctcgcc accctagcca tggagttctg ccacacggaa
121 actcttcagg ttgttcgcct cagccaaaac gccactatac cagccagagg atctcctggt
181 gccgcaggat tagacctatg tagtgcatac gactgtgtca tcccttcgca ttgcagtcgc
241 gtagtattta ctgaccttct gataaagccc ccttctggtt gctacggacg catagcacca
301 cgttctggcc ttgcagtcaa acactttatc gacgtgggcg ctggcgtgat cgacgaagac
361 taccgaggca acgtcggagt ggtgctgttt aactttggaa acagtgactt tgaggttaag
421 aagggagaca ggatagctca gctaatatgc gaacgcatat cgtgcccagc agttcaagag
481 gttaactgct tggacaacac tgaccgcggc gactctggat ttggttctac tggatcgggt
541 gcttgtggtg gccgcgatac agcgtggtac atcagctaat caaaattgta aaatgtagct
601 tctttttatt cgagagtctc gcacagttgc gtagataaca ctaattacaa cacactttaa
//
LOCUS SP8ALU156 192 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage SP82 Alu156 promoter region DNA.
ACCESSION M22133 M20661 J03904
KEYWORDS promoter.
SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Alu156.
ORGANISM Bacteriophage SP82
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 192)
AUTHORS McAllister,C.F. and Achberger,E.C.
TITLE Effect of polyadenine-containing curved DNA on promoter utilization
in Bacillus subtilis
JOURNAL J. Biol. Chem. 263, 11743-11749 (1988)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by E.C.Achberger, 14-SEP-1988.
The nucleotide sequence contains a promoter from bacteriophage SP82
that is recognized by the major B.subtilis RNA polymerase, the
sigma-43-containing enzyme. DNA with this sequence displays
sequence-dependent DNA curvature.
FEATURES from to/span description
site 131 136 -35 region
site 147 152 -10 region
mRNA 159 > 174 Bacteriophage SP82 mRNA start
mRNA 156 > 192 Alu156 mRNA
BASE COUNT 62 a 45 c 38 g 47 t
ORIGIN
1 gaattccccg gatccgtcct tccaagaaag atatcctaac agcacaagag cggaaacacg
61 ttttgttcta catccagaac aacctctgct aaaattcctg aaaaattttg caaaaagttg
121 ttgactttct ctacgaggtg tggcataata atcttaacaa cagcaggacg ctaggacgga
181 tccggggaat tc
//
LOCUS SP8BAL129 156 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage SP82 Bal129 promoter region DNA.
ACCESSION M22134 M20660 J03904
KEYWORDS promoter.
SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Bal129.
ORGANISM Bacteriophage SP82
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 156)
AUTHORS McAllister,C.F. and Achberger,E.C.
TITLE Effect of polyadenine-containing curved DNA on promoter utilization
in Bacillus subtilis
JOURNAL J. Biol. Chem. 263, 11743-11749 (1988)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by E.C.Achberger, 14-SEP-1988.
The nucleotide sequence contains a promoter from bacteriophage SP82
that is recognized by the major B.subtilis RNA polymerase, the
sigma-43-containing enzyme. This is the site of transcription
initiation for a major early mRNA for Bacteriophage SP82.
FEATURES from to/span description
site 60 65 -35 region
site 83 88 -10 region
mRNA 95 > 126 Bacteriophage SP82 mRNA start
mRNA 95 > 156 Bal129 mRNA
BASE COUNT 60 a 29 c 29 g 38 t
ORIGIN
1 gaattccccg gatccgtcac ccctaagaaa aaatatctac agaaaatatg aaaaagttgt
61 tgacatttct tcccatccat gctataataa agtcatagag aacaacacta tcaaatgaat
121 ggagagattg atatgatgaa tggggatgac ggatcc
//
LOCUS XELBGAI 2027 bp ds-DNA VRT 14-AUG-1990
DEFINITION X.laevis adult beta-globin gene (beta-AI), 5' flank.
ACCESSION M27933
KEYWORDS beta-globin; globin.
SOURCE X.laevis adult DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2027)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis
beta-globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT 792 a 266 c 284 g 685 t
ORIGIN
1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct
61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat
121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc
181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt
241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt
301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta
361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa
421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata
481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc
541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct
601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat
661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca
721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct
781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt
841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat
901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg
961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa
1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt
1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag
1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt
1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata
1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca
1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt
1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag
1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg
1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat
1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata
1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc
1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt
1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa
1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc
1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg
1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca
1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc
//
LOCUS XELBGLI 6777 bp ds-DNA VRT 14-AUG-1990
DEFINITION X.laevis larval beta-globin gene (beta-LI), 5' flank.
ACCESSION M27934
KEYWORDS beta-globin; globin.
SOURCE X.laevis larval DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 6777)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-
globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT 2121 a 1247 c 1228 g 2181 t
ORIGIN
1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc
61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt
121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt
181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag
241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt
301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca
361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac
421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac
481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag
541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac
601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa
661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga
721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta
781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca
841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga
901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa
961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat
1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt
1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt
1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt
1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg
1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg
1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg
1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag
1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg
1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc
1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa
1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga
1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat
1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg
1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag
1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat
1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt
1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa
2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag
2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa
2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat
2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc
2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga
2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat
2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca
2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac
2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct
2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg
2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa
2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt
2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt
2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt
2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat
2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat
3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat
3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact
3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa
3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt
3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc
3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat
3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct
3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga
3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa
3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca
3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc
3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa
3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc
3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata
3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa
3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta
3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca
4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat
4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc
4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct
4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa
4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc
4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat
4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt
4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac
4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg
4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac
4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta
4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac
4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa
4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc
4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg
4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag
4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc
5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct
5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg
5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata
5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat
5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt
5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat
5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca
5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc
5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa
5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc
5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag
5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct
5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt
5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg
5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg
5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat
6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta
6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca
6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg
6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt
6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt
6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga
6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat
6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa
6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca
6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca
6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat
6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac
6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc
//
LOCUS XELBGLII 910 bp ds-DNA VRT 14-AUG-1990
DEFINITION X.laevis larval beta-globin gene (beta-LII), 5' flank.
ACCESSION M27932
KEYWORDS beta-globin; globin.
SOURCE X.laevis larval DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 910)
AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
Knoechel,W.
TITLE Sequence analysis of the upstream regions of Xenopus laevis
beta-globin genes and arrangement of repetitive elements within the
globin gene clusters
JOURNAL Mol. Biol. Rep. 14, 17-26 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable copy of sequence [1] kindly
submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT 322 a 152 c 128 g 308 t
ORIGIN
1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc
61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc
121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt
181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc
241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt
301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta
361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct
421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt
481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta
541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact
601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac
661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag
721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct
781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca
841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat
901 attttttttt
//
LOCUS YSCADK1 2367 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae adenylate kinase (ADK1) gene, complete cds, and
histone (H2A-1) gene, 3' end.
ACCESSION M18455
KEYWORDS adenylate kinase; histone.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2367)
AUTHORS Konrad,M.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
REFERENCE 2 (sites for [1])
AUTHORS Konrad,M.
TITLE Analysis and in vivo disruption of the gene coding for adenylate
kinase (ADK1) in the yeast Saccharomyces cerevisiae
JOURNAL J. Biol. Chem. 263, 19468-19474 (1988)
STANDARD simple staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly provided
by M.Konrad 21-JUL-1988.
FEATURES from to/span description
pept 917 1585 adenylate kinase (EC 2.7.4.3)
pept < 1 359 histone H2A-1 protein (AA at 3)
ORF 2009 > 2367 ORF
BASE COUNT 777 a 473 c 435 g 682 t
ORIGIN 1 bp upstream of HindIII site; chromosome 4.
1 aagcttctca atctagatct gctaaggctg gtttgacatt cccagtcggt agagtgcaca
61 gattgctaag aagaggtaac tacgcccaaa gaattggttc tggtgctcca gtctacttga
121 ctgctgtctt ggaatatttg gccgctgaaa ttttagaatt agctggtaat gctgctaggg
181 ataacaagaa gaccagaatt attccaagac atttgcaatt ggctatcaga aatgatgacg
241 aattgaacaa gctattgggt aacgttacca ttgcccaagg tggtgttttg ccaaacatcc
301 atcaaaactt gttgccaaag aagtctgcca aggctaccaa ggcttctcaa gaattataag
361 atcggttctg gtattttaaa gaaggcggaa ggaactaaac tgcttctcca ttgtaactat
421 attaagtgtc tattagacct ttctgttttt cctttttatt tttctatata taatttattg
481 ggtatataat taacggtaac atatatcatg catgatataa attttttgtt gtgtcttcct
541 gttttctctg ttgctctgtt taggttcatt gggcactgtt gcttacttga acatgattga
601 gtagcaaaca cattctgtgc tttgttcact ttgatagtgt gacgaaattt tcacaaccaa
661 aggaaaaaag ctcacgattg cgcgatcctg tcacggagaa ctgtccgcag cagcccgcgg
721 cttctgtcct tattcgccca tttttttttg attttcgact ttttcactct ggctagtttt
781 attacgcata tttgtatata tatatacgca taaatttctg aaatggatta cctggtttat
841 cagaaacgag agctttaata aagaaacatt aacgtttctc tggtaaagtc accacacagc
901 atcaaatata acagtaatgt ctagctcaga atccattaga atggtcctaa ttggcccacc
961 tggtgccggt aaaggtactc aagctccaaa tttgcaagag cgtttccatg ccgctcactt
1021 ggccactggt gacatgttga gatctcaaat cgcaaagggc actcaattag gtttggaagc
1081 aaagaaaatt atggaccaag gtggtttagt ctctgatgac attatggtta acatgatcaa
1141 ggatgaattg accaacaatc cagcttgtaa gaatgggttc atcttggacg gtttcccaag
1201 aaccattcct caggctgaaa aattggacca aatgttgaaa gaacaaggaa ctcctttgga
1261 aaaagccatc gaattgaagg ttgatgatga attgttggtt gccagaatta ccggtagatt
1321 aattcaccca gcctctggca gatcctacca caagatcttt aacccaccaa aggaagacat
1381 gaaggatgac gtcaccggtg aagctttagt tcaaagatct gatgacaatg cagacgcctt
1441 gaagaagaga ttagctgctt accatgctca aaccgaacca attgttgact tttacaaaaa
1501 gaccggtatc tgggctggtg ttgatgcttc ccaacctcct gctactgttt gggctgacat
1561 cttgaacaag ctaggtaagg attaatgacc ttgcgcaatg tcttctaaat atcttttctt
1621 ttttttaaat taatgaaaat ataataaatt tttgtatgtg agtacatata tccgcagtat
1681 actataaatg aaataataac ataagtgttt gtaactggct ataacgtcct taaacatgtg
1741 cactgccatt aagtacttag tcaatatttt tatcgttgag aacgaacgca aatcatatta
1801 ctaaatagat gaacaaaagc gcatcacata aacaaaaacg gtataacagt aatagtgaag
1861 ctttttataa acctataacg aaataagcgc ttttcaggtt ttataattca gtcagagttc
1921 taactggaca tcgttttgca ggggataaaa aaaaaaagga agcttcaacc cacaatacca
1981 aaaaagcgaa gaaaacagcc aatgccaaat gacaataaga cacccaaatg ggtccagcac
2041 tcccaagttt actaaaaaac cagtaacccc gaatgataaa attcctgaaa gagaagaaaa
2101 atccaatgaa gtgaagacac ctaaaattcc attattcacg tttgccaaaa gcaaaaacta
2161 ttcaaggccg tcaaccgcaa ttcatacctc acctcatcaa ccaagtgatg taaaaccgac
2221 ttcccataaa cagttgcaac agccaaaatc ctcaccactg aaaaaaaata actataattc
2281 ttttcctcac tcaaatctgg aaaaaataag caacagcaaa ctactctccc ttcttcggtc
2341 caaaacgtca gcaggaagaa ttgaatc
//
LOCUS YSCSSB1A 1842 bp ss-mRNA PLN 14-AUG-1990
DEFINITION Yeast (S.cerevisiae) heat shock protein 70 (YG101, SSB1) mRNA,
complete cds.
ACCESSION M25395
KEYWORDS heat shock protein.
SOURCE Yeast (S.cerevisiae, strain GRF-18) DNA, and cDNA to mRNA, clone
F430.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (sites for [2])
AUTHORS Normington,K., Kohno,K., Kozutsumi,Y., Gething,M.-J. and
Sambrook,J.
TITLE S.cerevisiae encodes an essential protein homologous in sequence
and function to mammalian BiP
JOURNAL Cell 57, 1223-1236 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1842)
AUTHORS Normington,K.D.
JOURNAL Unpublished (1989) Univ, of Texas SW Medical Center, Dallas, Texas
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by K.D.Normington, 09-JUN-1989.
FEATURES from to/span description
pept 1 1842 heat shock protein 70
BASE COUNT 510 a 385 c 418 g 529 t
ORIGIN
1 atggctgaag gtgttttcca aggtgctatc ggtatcgatt taggtacaac ctactcttgt
61 gttgctactt acgaatcctc cgttgaaatt attgccaacg aacaaggtaa cagagtcacc
121 ccatctttcg ttgctttcac tccagaagaa agattgattg gtgatgctgc caagaaccaa
181 gctgctttga acccaagaaa cactgtcttc gatgctaagc gtttgattgg tagaagattc
241 gacgacgaat ctgttcaaaa ggacatgaag acctggcctt tcaaggttat cgacgtcgat
301 ggtaacccag tcatcgaagt ccaatacttg gaagaaacca agactttctc cccacaagaa
361 atttccgcta tggttttgac caagatgaag gaaattgctg aagctaagat tggtaagaag
421 gttgaaaagg ccgtcattac tgtcccagct tactttaacg acgctcaaag acaagctacc
481 aaggatgccg gtgccatttc tggtttgaac gttttgcgta tcatcaacga acctactgcc
541 gctgctattg cttacggtct aggtgctggt aagtccgaaa aggaaagaca tgttttgatt
601 ttcgatttgg gtggtggtac tttcgatgtt tccttgttgc acattgctgg tggtgtttac
661 actgttaaat ctacttccgg taacactcac ttgggtggtc aagatttcga caccaacttg
721 ttggaacact tcaaggccga attcaagaag aagactggtt tggacatctc cgacgatgcc
781 agagctttga gaagattgag aactgctgct gaaagagcta agagaacctt atcttctgtc
841 actcaaacta ccgttgaagt tgactctttg tttgacggtg aagatttcga atcctctttg
901 actagagcta gatttgaaga cttgaacgcc gcattgttca agtctacttt ggaacctgtt
961 gaacaagttt tgaaggatgc taagatctct aagtctcaaa tcgacgaagt tgtcttggtt
1021 ggtggttcca ccagaattcc aaaggtccaa aagttgttgt ctgacttctt tgacggtaag
1081 caattggaaa aatctattaa cccagatgaa gctgttgctt acggtgctgc tgttcaaggt
1141 gctatcttga ccggccaatc cacatctgac gaaaccaagg acttgttgtt gttagatgtt
1201 gctccattat ctctaggtgt tggtatgcaa ggtgacatgt tcggtatcgt tgttccaaga
1261 aacactactg ttccaaccat caagagaaga acctttacta catgtgctga caaccaaacc
1321 accgttcaat tcccagtcta ccaaggtgaa cgtgttaact gtaaagaaaa cactttgttg
1381 ggtgaattcg acttgaagaa catcccaatg atgccagctg gtgaaccagt cttggaagct
1441 atcttcgaag ttgatgctaa cggtatcttg aaggttactg ccgtcgaaaa gtctaccggt
1501 aagtcttcta acatcactat ctctaacgct gttggtagat tgtcttctga agaaattgaa
1561 aagatggtta accaagctga agagttcaag gctgccgatg aagcttttgc caagaagcac
1621 gaagctagac aaagattgga atcctacgtt gcctccatcg aacaaactgt cactgaccca
1681 gtcttgtctt ctaaattgaa gagaggttcc aagtccaaga ttgaagctgc tttgtccgat
1741 gctttggctg ctttgcaaat cgaagaccca tctgctgatg aattgagaaa ggctgaagtt
1801 ggtttgaaga gagttgtcac caaggccatg tcttctcgtt aa
//
LOCUS ATHGPA1 2030 bp ss-mRNA PLN 14-AUG-1990
DEFINITION A.thaliana G protein alpha subunit 1 (GP-alpha-1) mRNA, complete
cds.
ACCESSION M32887
KEYWORDS G protein.
SOURCE A.thaliana (strain Landsberger) flow DNA and, cDNA to mRNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 2030)
AUTHORS Ma,H., Yanofsky,M.F. and Meyerowitz,E.M.
TITLE Molecular cloning and characterization of GPA1, a G protein alpha
subunit gene from Arabidopsis thaliana
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3821-3825 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.M.Meyerowitz, 16-MAR-1990, for release after publication.
FEATURES from to/span description
pept 600 1751 G protein alpha subunit 1 (GP-alpha-1)
pept 490 552 ORF1
pept 568 591 ORF2
mRNA 401 1885 GP-alpha-1 mRNA
mRNA 401 1921 GP-alpha-1 mRNA (alt.)
signal 312 318 pot. TATA box
rpt 17 36 inverted repeat 1A
rpt 188 207 inverted repeat 1B
rpt 323 339 inverted repeat 2A
rpt 341 357 inverted repeat 2B
BASE COUNT 631 a 346 c 413 g 640 t
ORIGIN Chromosome 2 near erecta.
1 gttaacttaa tagtatataa aataaaaatg catataggtt ccgtaattaa tctcttatcg
61 tcacgagagg cacatctttt tccaacattt gaccactctc tctctctctc tcaggacctt
121 tcggcgtaat ttcgtcttcc cctttgctta acattttctt tctttctttt tgaccaaata
181 ttaaaaatat atccattttt attttatttt taattaaatt cataatttgc atttgtaata
241 gaaaaaaaaa gaagagaata aatccaaaag agtgaagcaa aaacattaaa gcggaaagaa
301 agtggtaaaa caataataga aacaggagaa gcagaagtac tacttcttct tcttctgctc
361 tcttctcaga ccttgttttg tactttcttc ttcttcttct ttcttcttct tgtttgcgaa
421 ctccgatatc ttcttcacta cctttgactc catttctttt tcttcttcag gtgtaggcat
481 tgtcttgtta tgagaagcaa ctgtagctgg aagctcaagt atttgttttt agctgtggag
541 cttgaatctt gatagttttc gacttctatg ttattacctg tggggatata gaaacaatca
601 tgggcttact ctgcagtaga agtcgacatc atactgaaga tactgatgag aatacacagg
661 ctgctgaaat cgaaagacgg atagagcaag aagcaaaggc tgaaaagcat attcggaagc
721 ttttgctact tggtgctggg gaatctggaa aatctacaat ttttaagcag ataaaacttc
781 tattccaaac gggatttgat gaaggagaac taaagagcta tgttccagtc attcatgcca
841 atgtctatca gactataaaa ttattgcatg atggaacaaa ggagtttgct caaaatgaaa
901 cagattctgc taaatatatg ttatcttctg aaagtattgc aattggggag aaactatctg
961 agattggtgg taggttagac tatccacgtc ttaccaagga catcgctgag ggaatagaaa
1021 cactatggaa ggatcctgca atccaggaaa cttgtgctcg tggtaatgag cttcaggttc
1081 ctgattgtac gaaatatctg atggagaact tgaagagact atcagatata aattatattc
1141 caactaagga ggatgtactt tatgcaagag ttcgcacaac tggtgtcgtg gaaatacagt
1201 tcagccctgt gggagagaat aaaaaaagtg gtgaagtgta ccgattgttt gacgtgggtg
1261 gacagagaaa tgagaggagg aaatggattc atctgtttga aggtgtaaca gctgtgatat
1321 tttgtgctgc catcagcgag tacgaccaaa cgctctttga ggacgagcag aaaaacagga
1381 tgatggagac caaggaatta ttcgactggg tcctgaaaca accctgtttt gagaaaacat
1441 ccttcatgct gttcttgaac aagttcgaca tatttgagaa gaaagttctt gacgttccgt
1501 tgaacgtttg cgagtggttc agagattacc aaccagtttc aagtgggaaa caagagattg
1561 agcatgcata cgagtttgtg aagaagaagt ttgaggagtt atattaccag aacacggcgc
1621 cggatagagt ggacagggta ttcaaaatct acaggacgac ggctttggac cagaagcttg
1681 taaagaaaac gttcaagctc gtagatgaga cactaagaag gagaaattta ctggaggctg
1741 gccttttatg accttattat tacatatctc tagtaaatta cctctcctta ttattataag
1801 aaaaactcga aaactgaatg accgtgtaat ttatctttcg ggacaaaaga cttagcgatt
1861 caaaatctaa tgtgtctcga tggctacgac tagtttctat tttatcattg tttttgttaa
1921 cattcctctg tctttgactt cttatttttt ttctcatcaa aaacatctca ttttgatctt
1981 gtttttgggg ttatattatt attaaaatga ggcatccaca tcccgaaatc
//
LOCUS BLYABA 998 bp ss-mRNA PLN 14-AUG-1990
DEFINITION Barley abscisic acid (ABA) mRNA, complete cds.
ACCESSION M36000
KEYWORDS abscisic acid.
SOURCE Barley (H.vulgare L. cv. Himalaya) seed, cDNA to mRNA, clone pHV
A1-1.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 998)
AUTHORS Hong,L., Uknes,S.J. and Ho,T.-h.D.
TITLE Cloning and characterization of a cDNA encoding a mRNA rapidly-
induced by ABA in barley aleurone layers
JOURNAL Plant Mol. Biol. 11, 495-506 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 120 761 abscisic acid (ABA)
mRNA < 1 998 ABA mRNA
BASE COUNT 249 a 284 c 323 g 142 t
ORIGIN
1 gtgccggtag taaatcatga gcatctcttg cgactcgaaa cgtagtacag caacagccta
61 aagcgagtcc gagtggtgat tccagttcgt gtttgtttga gctagatcgt gagacgaaga
121 tggcctccaa ccagaaccag gggagctacc acgccggcga gaccaaggcc cgcaccgagg
181 agaagaccgg gcagatgatg ggcgccacca agcagaaggc ggggcagacc accgaggcca
241 ccaagcagaa ggccggcgag acggccgagg ccaccaagca gaagaccggc gagacggccg
301 aggccgccaa gcagaaggcc gccgaggcca aggacaagac ggcgcagacg gcgcaggcgg
361 ccaaggacaa gacgtacgag acggcgcagg cggccaagga gcgcgccgcc cagggcaagg
421 accagaccgg cagcgccctc ggcgagaaga cggaggcggc caagcagaag gccgccgaga
481 cgacggaggc ggccaagcag aaggccgccg aggcaaccga ggcggccaag cagaaggcgt
541 ccgacacggc gcagtacacc aaggagtccg cggtggccgg caaggacaag accggcagcg
601 tcctccagca ggccggcgag acggtggtga acgccgtggt gggcgccaag gacgccgtgg
661 caaacacgct gggcatggga ggggacaaca ccagcgccac caaggacgcc accaccggcg
721 ccaccgtcaa ggacaccacc accaccacca ggaatcacta gacgcatgcg ttcgcgctta
781 atttccgttc ctttagtcgt gtttggtcgt tcgagggcct tctacatatt tcatatttgt
841 atgtttccac tctttcatga tttccgctca tttagtgtaa agtttgcctc cgatttgatg
901 tactcgtctc tggttctgta atgagttata atccatgggc tttggtgtaa atggataacg
961 aggacactcg aaggcggcaa taaagttgta tgtgatcg
//
LOCUS MVARRNAG 1466 bp ds-DNA BCT 14-AUG-1990
DEFINITION M.vannielii 16S ribosomal RNA gene.
ACCESSION M36507
KEYWORDS 16S rRNA; small subunit ribosomal RNA.
SOURCE M.vannielii DNA.
ORGANISM Methanococcus vannielii
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanococcales; Methanococcaceae.
REFERENCE 1 (bases 1 to 1466)
AUTHORS Jarsch,M. and Boeck,A.
TITLE Sequence of the 16S ribosomal RNA gene from Methanococcus
vannielii: Evolutionary implications
JOURNAL Syst. Appl. Microbiol. 6, 54-59 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 1466 16S ribosomal RNA
BASE COUNT 335 a 372 c 473 g 286 t
ORIGIN
1 attccggttg atcccgccgg aggctactgc tattggggtt cgactaagcc atgcgagtct
61 atggtttcgg ccatggcgga cggctcatta acacgtggtt aacttaacct caggtggagc
121 ataaccttgg gaaactgagg ataattctcc ataagaaaag cagtctggaa cgattctttt
181 ctgaaagcat atgcgcccga ggataggact gcgctcgatt aggtagttgg tggggtaatg
241 gcccaccaag cctacgatcg atacgggcct tgagagaggg agcccggaga tggggactga
301 gacacggccc caggccctac ggggcgcagc aggcgcgaaa cctccgcaat gcacgaaagt
361 gcgacggggg gaccccaagt gctcatgcac agcatgggct tttatcaagt gtaaacagct
421 tgaggaataa gggctgggca agttcggtgc cagcagccgc ggtaataccg acggcccgag
481 tggtagccac tcttattggg cctaaagcgt ccgtagccgg tccagtaagt ccctgtttaa
541 attctctggc ttaaccagag gactggcagg gatactgctg gacttgggac cgggagagga
601 caagggtact ccaggggtag cggtgaaatg tgttgatcct tggaggacca cctatggcga
661 aggcacttgt ctggaacggg tccgacggtg agggacgaaa gccaggggcg cgaaccggat
721 tagatacccg ggtagtcctg gccgtaaact ctgcgaacta ggtgtcacct gggcctcgag
781 cccaggtggt gccgaaggga agccgttaag ttcgccgcct ggggagtacg gtcgcaagac
841 tgaaacttaa aggaattggc gggggagcac cacaacgggt ggagcctgcg gtttaattgg
901 attcaacgcc gggcatctca ccacgagcga cagcatgatg acggccaggt tgacgacctt
961 gcctgaagcg ctgagaggtg gtgcatggcc atcgtcagct cgtaccgcga ggcgtcctgt
1021 taagtcaggt aacgagcgag acccgtgccc tatgttgcga ctactttctc cggaaggtaa
1081 gcactcatag gggaccgcta gcgctaagct agaggaagga gcgggcaacg ataggtccgc
1141 atgccccgaa tctcctgggc tacacgcggg ctacaatggc taggacaatg ggctgctacc
1201 ctgaaaaggg acgcgaatct ccgaaaccta gtcgtagttc ggatcgtggg ctgtaactcg
1261 cccacgtgaa gctggaatcc gtagtaatcg cagttcataa tactgcggtg aatgtgtccc
1321 tgctccttgc acacaccgcc cgtcacacca cccgagttgg gttcaggtga ggccttggcc
1381 tttggctagg gtcgaacctg ggctcagcga ggggggtgaa gtcgtaacaa ggtagccgta
1441 ggggaacctg cggctggatc acctcc
//
LOCUS TOM2A11 526 bp ss-mRNA PLN 14-AUG-1990
DEFINITION Tomato fruit-specific protein mRNA, complete cds.
ACCESSION M21775 X13741
KEYWORDS fruit-specific protein.
SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 2A11.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 526)
AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
TITLE Isolation and characterization of a fruit-specific cDNA and the
corresponding genomic clone from tomato
JOURNAL Plant Mol. Biol. 13, 639-651 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.E.Rose, 09-DEC-1989, for release after publication.
FEATURES from to/span description
pept 34 324 fruit-specific protein
BASE COUNT 163 a 97 c 92 g 174 t
ORIGIN
1 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag
61 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg
121 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag
181 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc
241 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac
301 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct
361 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa
421 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc
481 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattcc
//
LOCUS TOM3H11 774 bp ss-mRNA PLN 14-AUG-1990
DEFINITION Tomato fruit-specific protein mRNA, complete cds.
ACCESSION M21776 X13742
KEYWORDS fruit-specific protein.
SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 3H11.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 774)
AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.E.Rose, 09-DEC-1989, for release after publication.
Author address: R.E.Rosen,
Calgene Inc.
1920 Fifth St.
Davis, CA 95616
FEATURES from to/span description
pept 214 504 fruit-specific protein
BASE COUNT 247 a 143 c 119 g 265 t
ORIGIN
1 tttttttgag caaagggcaa ctcagatatc caaagatgaa tccaacatat agcttacagc
61 tgggagaaca ttgtctaact cttctgaaat ttaaatgtta tccagaatcc ttcatcataa
121 aataatatca aaatgcaaat ctattttttc tactcttgtc tagcttcaac tttcttcttc
181 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag
241 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg
301 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag
361 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc
421 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac
481 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct
541 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa
601 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc
661 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattccacat taattcacgt
721 atttatttca cttatgatac gtatttttgt tcctttcgcg taaaaaaaaa aaaa
//
LOCUS TOMFSPG 4654 bp ds-DNA PLN 14-AUG-1990
DEFINITION Tomato fruit-specific protein gene, complete cds.
ACCESSION M21777 X13743
KEYWORDS fruit-specific protein.
SOURCE Tomato (strain VFNT Cherry) DNA.
ORGANISM Lycopersicon esculentum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae.
REFERENCE 1 (bases 1 to 4654)
AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
TITLE Isolation and characterization of a fruit-specific cDNA and the
corresponding genomic clone from tomato
JOURNAL Plant Mol. Biol. 13, 639-651 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.E.Rose, 09-DEC-1989, for release after publication.
FEATURES from to/span description
pept 1379 1445 fruit-specific protein, exon 1
2160 2383 fruit-specific protein, exon 2
pre-msg 1326 2640 mRNA and introns
IVS 1446 2159 fruit-specific protein intron
signal 1064 1067 TATA-box (put.)
signal 1135 1138 TATA-box (put.)
signal 1190 1193 TATA-box (put.)
signal 1291 1297 TATA box (put.)
signal 2640 2640 polyA signal
BASE COUNT 1709 a 723 c 642 g 1580 t
ORIGIN
1 ctcgagccct ttaaaaagta tagtcaatat ttacggtgac cgtgaatttc ttaattatga
61 tatataattt aaaagaaatc atgatcacat tctactgatg agaacatgtg ctaatcaagg
121 gaaaacatgg atgtgaaaaa tactttttgt taaaagtaaa aaaaaatgtg aaattttgtt
181 agttatttac tacctataca ttatttgagc atgtgcaaac tttacaaata cctaatagaa
241 gattttcacc tgcctgtata tatgtaaatt aattataatg aacactctca cataaaataa
301 ttatcagtat atacattaat acttgccctc cacaatgaat taaataaaat gtagaacatg
361 atctacactt caataaaact aagaccataa agaataattt caaaatatac acatgtcaac
421 aataaattat ttgcatatta tattaactta ctaaacaatc tttacttttg aaatataaaa
481 ataatcaagt tataagtctg ctcaaagtaa agcacttgtt agactcatct gattttgaga
541 aggtaagcaa attgatggtg cataatagtc acaagtaaaa tataaaatag atttcattag
601 taaaattgtt ttttactttc tttatatata attatcaata tccttcaatg gtaggttaat
661 tatattgtta acttcttgtt gaattaaagc aataagacaa gaatattaaa gataaaagaa
721 caataaaaat agaaagacta agagataaga gttttcttat tcttctttca ataagtatca
781 tcaagtgtat acaatataaa tttttgtatt tttgatctat ctatttataa tgttatatat
841 aagcatacaa aagatcagtc ataaatatga ctttaatcat gaaaataatg aaagagatta
901 tgaaggcgta aggttactag aataatagtc attaaaaaaa ggggttatct ttataattga
961 ataattgatg aagtaatgga gataattagt gagcataaat ttttttaaaa aaatggacat
1021 ttacactata atattttata acactttccc ttaaacatct aggtataaat aatgagtctt
1081 gtcaaaatct tagtaggaaa aattctgtga aattttttta gtgaaaacaa atgatataaa
1141 tatcttgaat actcattatt tgttgtctca ttaaaaatct tatctgacct ataaaataaa
1201 ttatttgctc aactcaaaat agtttttcat tctaaaatta gtataattat tagtgaatat
1261 ttaattaaca taattgtata ctaaggggcc tataaattgg attcttctca aagaaaaata
1321 aaatcaccac acaactttct tcttctgctc atcaattagc aattaatcca aaaccattat
1381 ggctgccaaa aattcagaga tgaagtttgc tatcttcttc gttgttcttt tgacgaccac
1441 tttaggttca caacacttct cccttatttt gttttcttaa tttcttggaa gtcatatgca
1501 tgtgtttggt atcatggtat atatataaag gaaaatattt ttcttaatta ctggttttct
1561 aatgtttggt aggtaatcgg aaattattat gagataatga acttgcaaag tcattattat
1621 ataacttttt ttttatactt tgatttaaga attcattttt ctcattttat ataaacttat
1681 ttttcaacag aaaatatttt tcgaactatt caaacacacc ctaagacatt acatatatat
1741 atatatacac cctccgtttt atattactta atgcctattg agttggccca ccctttaaga
1801 atgattcaat tagagatatg ttttactaaa ttaacctatg ctttaagact ctaaatttgg
1861 ctattactat tttacgttgt aatttaatga caaacatttc ataatgacta tagtctgaac
1921 ttaattagac agacgtatct atagtttgct tactaatgat tcatagctat atatttggag
1981 aggagagaga caaacgatat taagaaaggg aggagagagg cgaggtaaat ctgaaataga
2041 gaagagaaag gcaaccaatt ttgatcatct atcatacttt tgattattat ttttattata
2101 tgtacgttta cattacagtt ttcgaattct tacattaatc ttaatcataa tatatacagt
2161 tgatatgtct ggaatttcga aaatgcaagt gatggctctt cgagacatac ccccacaaga
2221 aacattgctg aaaatgaagc tacttcccac aaatattttg ggactttgta acgaaccttg
2281 cagctcaaac tctgattgca tcggaattac cctttgccaa ttttgtaagg agaagacgga
2341 ccagtatggt ttaacatacc gtacatgcaa cctgttgcct tgaacaatat caatgatcta
2401 tcgatcgatc tatctatcta tttatctgtc tctgcgcgta tagtgttgtc tgtacctttg
2461 gtgtgaagaa tatgaataaa gggatacata tatctagata tattctaggt aatgtcctat
2521 tgtatttaaa atttgtagca atgattgttt gaataaaaac ataccatgag tgaaataatt
2581 attccacatt aattcacgta tttatttcac ttatgatacg tatttttgtt cctttcgcgt
2641 agatttttga tccttttccc ttttgaatat taaacattaa acacaaataa tgtttattaa
2701 attaagttaa tatttttatt tagctattta tatttttatt tgaaatcaaa cttgataaat
2761 atttataaag ataattaaca agtaatgtga cactaacacc atgtaatatt atcttgtcgt
2821 tatttatgat aatattttaa aattataatt tcagttaaaa aattattaaa aaaacatact
2881 tttaaaaagt gagttagcct ccgctaccca catacttatg aattggacta gttgtttttt
2941 gacccacaaa aagaatgggc taattaaacc tgacctatca aatttcagaa tctgcataga
3001 ttagtccgaa cgaaatgagt cagcccgtat tgaacaaaat atcaacaagg acgttatgta
3061 aagatgttta agaaggaaaa aagatttcta atacatatgg actttcaata tcccaacttt
3121 gtctggcgat ctgaaccctg cttagtttgt tgatcattaa cttgtcttgc tatgtattta
3181 agatttaaac tttatatgtt taaacttaca gaaaatacat ataaatctct caagacttgg
3241 caacataatt tactttagta cttaaactac atgaaaattt aaatatcctt ttaacatctt
3301 tgaagtgaat taaattatca caatccgagc ctacaccttg gacgtggccg gcactcaaga
3361 accagtgctg gtccccaagc taaccctcat cctgactgac tacaagcgga aggctaactt
3421 aagtatacaa aagcttaaaa ctgaataaaa taaactttac aaggttttaa cacaaatgaa
3481 caactttgaa gaaaataata tattcaacta gccataaaat agacaacttt agtctttaaa
3541 acatttaata aaataaatgc aaaatataga ctccttaact aaactgacta tctatggagc
3601 ctctaattga taaagatgga agtcgggaca agaccacgac atcctgacta aactgagaag
3661 taaataaaat cccccggaaa aaaaggagcc tcaccatggc taactcgaac tcggggatat
3721 atcaatgaag ctcctgttga tgatcttgaa gacatgtctc tgcatcatca aaagatgcag
3781 gccaaatggc tcagtacgta aaatgtacga gtatgtaagg gaaattctaa agtataacat
3841 aagcttgata cttgaataaa aggaaacata cttacctctt ttcaactcaa ctcaaattaa
3901 gaataagata ctcaactcaa agattaggta ttcaacgcaa atatggcact ctactcaatg
3961 aagtacaaat taactcagga tactcgactt aagatactca actcccgaca ctcaactgaa
4021 ctcatttcaa tataaagcag cttaaaacaa gttcagtata aagtaaagtt gtttaaaaac
4081 atgatgtcaa ctctgtgtgt ataataagga tacaacataa ctttgaaatg tatataaaaa
4141 tacaattaac tgatgtatat aaaaatacat taatctatgg gagattctct aaccgacaac
4201 catcacttaa gggctaagat gatgatatag cgatctaccg cacgctgcca tcgcatctta
4261 tacccggcca aaggtataag acctgaactg cctaatgaat ccactaataa actgttaaaa
4321 ggaatcatct aaaaagtatg acccttttct acccatagtg gctaacatgg tttatggggg
4381 ctgtgagtta tctgaactct cccccatatc ggtgctcaat actactccaa aaaatatact
4441 gctcttatgt ttaaaaacat actgattctg tggtttgaaa ttattgctta aagcttagat
4501 ttttgaaaag ctctcttttg aaaatcgtag tttccttttt cttctattaa agctagacat
4561 aggctatgta gaactctagc ttaccttcct tctcaaaagt ttgaaaacat ttgcttagat
4621 tcttagggac tacttagttc ccttgttgga attc
//
LOCUS ACLP322P 1337 bp ds-DNA SYN 14-AUG-1990
DEFINITION Synthetic plasmid pWH1266 origin of replication (ori) region.
ACCESSION M36473
KEYWORDS .
SOURCE A.calcoaceticus lwoffi plasmid and pBR322 DNA, clone pWH1266.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 1337)
AUTHORS Hunger,M., Schmucker,R., Kishan,V. and Hillen,W.
TITLE Analysis and nucleotide sequence of an origin of DNA replication in
Acinetobacter calcoaceticus and its use for Escherichia coli
shuttle plasmids
JOURNAL Gene 87, 45-51 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
orgrpl 310 337 origin of replication
BASE COUNT 447 a 229 c 251 g 410 t
ORIGIN
1 gatcgtagaa atatctatga ttatcttgaa gaacgcaacc ctatagcagc tattgaaatt
61 gatgatttaa ttgaagaaaa gacagattta gttgttgata atcgactgat ggggcgcaca
121 ggcagacaga aagatactag ggagttagtg atacatccgc attatgtggt tgtatatgac
181 atcactgata taatacggat actcagagtg ctacacacat cgcaggagtg gtcatgactt
241 actcatgtac tttggattat ttagtgttat aaaatcctga tttataaatt ttttttgtta
301 aaaaagataa aagccccttg caattgcttg gggctttacc gtaatttatg gggtacagat
361 cttcgatact gacatatcgg caatcgaaag cattaaggtt tgacgaccgc taatgatttc
421 accacagggg cttaatgtac ctgtcttaaa ttctaaggtt ttaactcgct ttgtcaagca
481 tagaccccaa aaatttagcc aatgtctgta actcaatctg tccatgtgtg ggtgatgagg
541 tacagtgacg ctagcacaca tcggaaaaac gctattacta ggggaactga acagagtagc
601 ggacgcaatg agtagtcatt taattggcgg ttatgagcgt gttcaggcgg tgctatcaat
661 cgtaatcata acagtggcag cttgatacag tgatgtcatc cctgatgcga aagcgaccga
721 ccgacggtac atcgaatggg aatactttag ggtgattttt aagaatcgct ctagggtgag
781 tatttcccat tcagctctgc tccctccctc tggtacttta atcaaaagca ctactaaaca
841 tatgttttta aataaaaaat attgatatag agataatatt agtaagaata attaaacaat
901 tgaatataga taaatcattg ttaaataaag attaattatt aaaatgaatg tatacttata
961 tataaatcaa tgatttaaaa tatttgataa agaaaacttt tcaaaaaaaa tataattgag
1021 attgtgtcat ttcggtcaat tcttaatatg ttccacgcaa gttttagcta tggtgctaaa
1081 cagaaatttg ctgaaaaaga acttttcact gaactggtta aaatgtaagc agcctgagag
1141 ccgccaaaaa ttttaaaaac aaaccgcctt aatcatcttc aaaaaatacc tctaaaacct
1201 caccatttgc gttttaagac ccatatttca tcctgccctt atgttcccat gctgatagct
1261 ataaagtgtc tgtaatcgct tcctatgacg ttctaggctg ttgataactt ttggaacaac
1321 gcaaaatgtt aaaatcc
//
LOCUS ATHUXP1A 1078 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana ubiquitin extension protein (UBQ1) gene, complete cds.
ACCESSION J05507
KEYWORDS gene UBQ1; ubiquitin extension protein.
SOURCE A.thaliana (variety Columbia) DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 1078)
AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D.
TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
localization, and expression of their promoters in transgenic
tobacco
JOURNAL J. Biol. Chem. 265, 12486-12493 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Callis, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 44 146 ubiquitin extension protein (UBQ1)
234 320 ubiquitin extension protein
406 508 ubiquitin extension protein
888 954 ubiquitin extension protein
1031 1057 ubiquitin extension protein
pre-msg < 1 > 1078 UBQ1 mRNA and introns
IVS 147 233 UBQ1 intron A
IVS 321 405 UBQ1 intron B
IVS 509 887 UBQ1 intron C
IVS 955 1030 UBQ1 intron D
BASE COUNT 295 a 195 c 214 g 374 t
ORIGIN
1 ccttgtttag ggtttctacg tgagagaaga cgaaacacaa aagatgcaga tcttcgtgaa
61 aaccttgacc ggcaagacca tcactctcga ggtcgagagc agcgacacca tcgacaatgt
121 caaggccaag atccaagaca aagaaggtat cattcttcct cactcaatct ggattcttct
181 ctttagcttt ttgaaattca gatctcttat catttacttg tttctccttt aaggaatccc
241 tccggatcag cagagattga tcttcgccgg aaagcagctc gaagatggcc gtactttggc
301 tgactacaac atccagaaag gtacgaaatc atccgaatcc ttctgttgat catttcgatg
361 atctgattgt ataaactcta atggattgtt atcatttgta aacagaatct acacttcatc
421 ttgtgttgag gcttagagga ggtattattg agccttcctt gatgatgctt gctcgtaagt
481 acaatcagga taagatgatt tgccgcaagt aagtttcaat ttctccctaa gattgtagtt
541 ttgttgatta atagttcaaa gtaccaaatg tttattagtg tgttagggtt ttagttgttt
601 taatgggaat taacttagct tctcaagcct tatacaatgc gtgttgaggt tataaatgtt
661 cccaatatca aacatttcca ttctctaaac tttgtcagct ggtcaaattt ctcatctttc
721 cagatgattt tcttgaatgg tacatttttg caaatggttt tgaaaatgtg ttgaagcttg
781 tatttttgat tgctgaaaca tcttgtttgt tggatagttt gtcatctatg gtccgatctg
841 atgtctcaat gagattattc taatgtaaat ttcgcatctt ttggtaggtg ctatgctcgt
901 cttcacccaa gagctgtcaa ctgcaggaag aagaagtgtg gtcacagcaa ccaggtttgt
961 ttttgttact tgtttctttt tggtgtttat cagctgaaat gatatctgaa caaacttttt
1021 ttatttgcag ttgaggccta agaagaagat caagtagaga gactcttatc aagaatcc
//
LOCUS ATHUXP2A 976 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana ubiquitin extension protein (UBQ2) gene, complete cds.
ACCESSION J05508
KEYWORDS gene UBQ2; ubiquitin extension protein.
SOURCE A.thaliana (variety Columbia) DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 976)
AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D.
TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
localization, and expression of their promoters in transgenic
tobacco
JOURNAL J. Biol. Chem. 265, 12486-12493 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Callis, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 62 164 ubiquitin extension protein (UBQ2)
243 329 ubiquitin extension protein
433 535 ubiquitin extension protein
771 837 ubiquitin extension protein
929 955 ubiquitin extension protein
pre-msg < 1 > 976 UBQ2 mRNA and introns
IVS 165 242 UBQ2 intron A
IVS 330 432 UBQ2 intron B
IVS 536 770 UBQ2 intron C
IVS 838 928 UBQ2 intron D
BASE COUNT 262 a 164 c 209 g 341 t
ORIGIN
1 cttcattttt tacactgcaa acgccgtttc gagacttccg gcgaagaaga cgaacgcaaa
61 gatgcagatt ttcgtgaaaa cgctaaccgg caagaccatc accctcgagg tcgagagcag
121 tgacaccatc gacaatgtca aggccaagat ccaggacaaa gaaggtatat tttctaatgc
181 tttttaaaaa ttgtaacttt gaaattgtga tctcttagtg attcgttttt gctctgtttc
241 aggaattcct ccggaccagc agaggttgat cttcgccgga aaacagcttg aagatggtcg
301 taccttggct gactacaaca tccagaaagg tctctttctc gaatttgaat gttttggatt
361 ctttagaaaa tgaaactagt atagctatga ttgttttgtt gatccaattc tgttttgttt
421 actataaaac agagtcgact ttgcatcttg ttctgaggct taggggaggt atcattgagc
481 cttcattgat gatgttggct cgtaagtata accaagacaa aatgatttgt cgcaagtaag
541 atctctgatc ttttctcttt gttgttctaa gtgtaccatg tccatgtgtg atgattatag
601 taaaaatggg gaattaactt aggggtgaac tattattcat tttctacgtg gattatgatt
661 ctcttagttt gctgataaat gtattgaaat ctatctagtt ctttcactgt aacactatgg
721 ttgtgtgaaa actgtgtctc atgtttgagt tgttaacctt gttggtttag gtgttatgct
781 cgtctccacc caagagctgt gaactgcagg aagaagaagt gtggccatag caaccaggtt
841 tggttttttg tctctttaga ttttaaaatt ccacagaatt tgtttgattt gttagttgtt
901 aagtgtgatt ttctgtttat ttttgcagtt gaggccaaag aagaagatca agtgatcttt
961 tgttcaccaa aagctt
//
LOCUS ATHUXP5A 720 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana ubiquitin extension protein (UBQ5) gene, complete cds.
ACCESSION J05539
KEYWORDS gene UBQ5; ubiquitin extension protein.
SOURCE A.thaliana (variety Columbia) DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 720)
AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D.
TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
localization, and expression of their promoters in transgenic
tobacco
JOURNAL J. Biol. Chem. 265, 12486-12493 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Callis, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 112 585 ubiquitin extension protein (UBQ5)
site 339 340 ubiquitin-extension protein junction
BASE COUNT 209 a 159 c 164 g 188 t
ORIGIN
1 aagcccaaat aaaataaggg tagtaacgta attgagctaa gaaaccctag aggtctcctt
61 cgcctacaaa atccccattt tgataatctt cagcagccgt tgcctcaaaa gatgcagatc
121 ttcgtgaaaa ccctaacggg gaagaccata acccttgagg ttgaatcatc cgacaccatc
181 gacaatgtga aggcgaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg
241 attttcgccg ggaaacagct tgaagacggc cgtaccctcg ccgactacaa catccagaag
301 gaatcgacgc ttcatctcgt cctccgtctc cgtggtggtg ctaagaagag gaagaagaag
361 acttacacca agccgaagaa gatcaagcac aagcataaga aggttaagct cgctgttctt
421 cagttttaca aaatcgatgg atctggaaag gttcagcgtt tgaggaagga gtgccctaac
481 gcaacctgtg gagctgggac tttcatggcg agtcattttg atcgccatta ctgtggtaag
541 tgtggactca cctacgttta ccagaaagaa ggagttgaag cttgatctgt tgtagcggta
601 gatcgatcct tctctttctc tcttatcttt taaaaactgc atttctattt gggaattttg
661 taagctcttt aatttgagtt atcatggatt ctatgttgaa catctttcgt tctggattat
//
LOCUS ATHUXP6A 681 bp ds-DNA PLN 14-AUG-1990
DEFINITION A.thaliana ubiquitin extension protein (UBQ6) gene, complete cds.
ACCESSION J05540
KEYWORDS gene UBQ6; ubiquitin extension protein.
SOURCE A.thaliana (variety Columbia) DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 681)
AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D.
TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
localization, and expression of their promoters in transgenic
tobacco
JOURNAL J. Biol. Chem. 265, 12486-12493 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Callis, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 102 575 ubiquitin extension protein (UBQ6)
site 329 330 ubiquitin-extension protein junction
BASE COUNT 191 a 154 c 153 g 183 t
ORIGIN
1 gagctccaaa accctagttt ctcaaccact actcttttat ttcttctcac cacttaaaga
61 gtttccccag aaattttctt ccgccgtaaa agcaaaaaaa gatgcagatc ttcgtgaaaa
121 ccctaacggg gaagacgatc actctcgagg tcgagtcctc tgacaccatc gacaatgtca
181 aggccaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg attttcgccg
241 gaaagcagct cgaagacgga cgtaccttag ccgattacaa catccagaag gaatcaacgc
301 ttcaccttgt ccttcgtctc cgtggaggtg ctaagaagag gaagaagaag acctacacca
361 agcctaagaa gatcaagcac aagcacaaga aggtcaagct cgctgttctt cagttctaca
421 aggttgatgg ttcaggtaag gttcagcgtt tgaggaagga gtgccctaac gcaacctgtg
481 gtgctgggac tttcatggcg agtcatttcg atcgtcacta ctgtggtaag tgtggtctca
541 cctacgttta ccagaaagaa ggagctcagg aatgattttc atctcgatct ctatcatttt
601 gaattgaata ctgctttttt tttggaattt ggaagttgtt tttggatgtt gtggatctta
661 tgttgaactt gtttgaattt c
//
LOCUS BLYG1HORDA 1614 bp ds-DNA PLN 14-AUG-1990
DEFINITION Barley gamma-1 hordein storage protein gene, complete cds.
ACCESSION M36378
KEYWORDS gamma-hordein; storage protein.
SOURCE Barley (cv. Carina) leaf DNA, clone lambda-hor-gamma-1.
ORGANISM Hordeum vulgare
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 1614)
AUTHORS Cameron-Mill,N. and Brandt,A.
TITLE A gamma-hordein gene
JOURNAL Plant Mol. Biol. 11, 449-461 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 379 1296 gamma-1 hordein precursor
sigp 379 435 gamma-1 hordein signal peptide
matp 436 1293 gamma-1 hordein storage protein
signal 1372 1377 polyA signal
signal 1428 1433 polyA signal
BASE COUNT 516 a 433 c 271 g 394 t
ORIGIN
1 agatccagtt tatccctcac aacttttaaa acatgacaaa gttagtttta gaaggattgg
61 aagatagata cttatgtgag atgtaaagtg aataagatga gtcagcacct attatcaacc
121 tcatcgtgca cttagaattt atgtacaacc acaaagcaac tttgatgatg tgtccaaaat
181 tacctttgta tgtagtgcca ccaaacacaa acattccaaa tgatcagttc gataagcacg
241 aacgcacatt aaaagaggga agaaaccatg aaaaggaacc atgatatgac atctataaag
301 aggcctacat tttgtagatc acataccatc atccatcctt tacattacta agaaaggttt
361 agctaacaca aatccaccat gaagatctta atcatcctta caatccttgc catggcgact
421 accttcgcca cctctgagat gcaagtcaac cctagtgtcc aagtacaacc gacacaacaa
481 caaccatatc ccgagtctca gcaaccattc atcagtcaat cacaacaaca atttccccag
541 ccacaacaac cattccctca gcaaccacaa cagccatttc ctcaatccca gcagcaatgc
601 ctccagcaac cacaacacca atttccccaa cccacacaac aattccccca acggccacta
661 ctaccattta ctcacccgtt cctaacattc cccgaccaac tactccctca gccaccacac
721 caatcattcc cccagccccc acaatcatac cctcagccac cgttacaacc ttttcctcag
781 cccccgcaac aaaaatatcc cgagcaacca caacaaccat ttccttggca acaaccaact
841 attcagctat atctccaaca acaactgaat ccgtgcaagg agtttctctt gcaacagtgc
901 aggcctgtgt cattgttatc gtacatctgg tcaaagatcg tgcaacagag cagttgtcgg
961 gtgatgcagc aacaatgttg cctgcaattg gcacagatcc ctgaacaata caaatgcaca
1021 gcaatcgata gcattgtgca tgccatcttc atgcagcaag gccaaagaca aggcgtgcag
1081 atcgtgcaac aacaacctca gccgcaacag gtgggtcaat gtgtgctcgt ccagggtcag
1141 ggcgtcgtcc aaccgcaaca actagctcaa atggaagcaa tcagaacgtt ggtgttgcag
1201 tccgttccat ccatgtgcaa ctttaatgtc ccacctaact gctccaccat caaagcacca
1261 tttgttggtg tagtcaccgg tgttggagga caatgaaaag atgagagcta taataattag
1321 tacatggatc atcgttgcat aatcgatgga ccaatcgatg taataacgat aaataaagtg
1381 tcgtgcacca tgatgtgtga ttcgaccagg acttgtccaa tcgtgggaat aaaagtcata
1441 aaaagttctt ctaagtgctt catttcttat actttttccc attcattgtg tatgtaaaag
1501 tcgatacaaa actggaatcg taagtctatc cgatctatga tttagtttag acaactagat
1561 taatgttcct tcggtgttag tcgtgcattg tagtaggttc aatctggtgg tacc
//
LOCUS BOLSLSGRA 1286 bp ss-mRNA PLN 14-AUG-1990
DEFINITION B.campestris S-locus-specific glycoprotein mRNA, partial cds.
ACCESSION M36301
KEYWORDS S-glycoprotein; S-locus-specific glycoprotein; glycoprotein.
SOURCE B.campestris, cDNA to mRNA.
ORGANISM Brassica campestris
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 1286)
AUTHORS Takayama,S., Isogai,A., Tsukamoto,C., Ueda,Y., Hinata,K.,
Okazaki,K. and Suzuki,A.
TITLE Sequences of S-glycoproteins, products of the Brassica campestris
self-incompatibility locus
JOURNAL Nature 326, 102-105 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 1257 S-6-glycoprotein
BASE COUNT 345 a 278 c 316 g 347 t
ORIGIN
1 ggatccgtcg tcttgattct attttgtcct gccttttcga tcaacacttt gtcgtctaca
61 gaatctctta gaatctcaag caacagaaca cttgtatctc caggtaataa cttcgaactc
121 ggcttcttcc gaaccaactc aagttctcgt tggtatctcg ggatatggta caagaaattg
181 ctcgacagaa cctatgtatg ggttgccaac agagataacc cactctccaa tgccattgga
241 accctcaaaa tctcaggcaa taatcttgtc ctccttggtc acaccaataa atctgtttgg
301 tcgacgaatc ttactagagg aaatgagaga cttccggtgg tggcagacgt tctctctaat
361 ggaaacttcg tgatgcgaga ctccagtaac aacgacgcaa gtgaatactt gtggcaaagt
421 ttcgattacc ctacggatac tttgcttcca gagatgaaac tgggttacga cctcaaaaca
481 gggttgaaca ggttccttac atcatggaga agttcagatg atccatcaag cggggatttc
541 tcgtacaagc tcgaaacccg aagccttcct gagttttatc tatggcatgg gatctttcca
601 atgcatcgga gtggtccatg gaatggagtc cgatttagtg gcataccaga ggaccaaaag
661 ctgagttaca tggtgtacaa cttcacagag aatagtgaag aggtcgctta tacattccga
721 atgaccaaca acagcatcta ctcgagattg acactaagtt ccgaaggcta ttttcagcga
781 cttacgtgga atccgtcaat agggatatgg acagcgttct ggtcttctcc agtggacccc
841 cagtgcgata catacataat gtgcgggcct tacgcttact gtggcgtgaa cacatcacct
901 gtttgtaact gtatccaagg gttcaatccc cggaatatac agcagtggga tcagagagtc
961 tgggcaggtg ggtgtataag gaggacgcgg cttagctgca gtggagatgg ttttacaagg
1021 atgaagaaca tgaagctgcc agaaactacg atggcgattg tcgaccgcag tattggtgtg
1081 aaagaatgtg agaagaggtg ccttagcgat tgtaattgta ctgcttttgc aaatgcggat
1141 atccggaatg gtgggacggg ttgtgtgatt tggaccggac ggcttgacga tatgcggaat
1201 tacgttgctc acggtcaaga tctttatgtc agattggctg ttgctgacct tgtttagctc
1261 tttctcttaa aataaagcac ggatcc
//
LOCUS BOVFPKPA 1835 bp ss-mRNA MAM 14-AUG-1990
DEFINITION Bovine fructose 6-phosphate, 2-kinase: fructose 2,6-bisphosphatase
mRNA, complete cds.
ACCESSION M34241
KEYWORDS bifunctional enzyme; fructose 2,6-bisphosphatase;
fructose 6-phosphate,2-kinase.
SOURCE Bovine heart, cDNA to mRNA, clones BH[9G,7H].
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1835)
AUTHORS Sakata,J. and Uyeda,K.
TITLE Bovine heart fructose 6-phosphate, 2-kinase/fructose
2,6-bisphosphatase: Complete amino acid sequence and localization
of phosphorylation sites
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4951-4955 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Uyeda, 11-MAY-1990, for release after publication.
FEATURES from to/span description
pept / 30 1622 fructose 6-phosphate,2-kinase:fructose
2,6-bisphosphatase (EC 2.7.1.105 and EC
3.1.3.46)
BASE COUNT 466 a 493 c 493 g 383 t
ORIGIN
1 caaccgacat ctcctgaaga gttgccatgt ccgggaatcc tgcctcttcc tcagaacaga
61 acaacaatag ctatgaaacc aaagcaagtc tccgaatatc agagaagaaa tgttcatggg
121 catcttacat gacaaactct ccaaccctca tcgttatgat tggcttgcca gcccgtggca
181 aaacctacgt gtccaagaaa cttacacgct acctcaactg gattggggtg cccaccaaag
241 tgtttaatct tggggtgtat cggcggcaag cagtcaagtc ctataagtcc tatgacttct
301 tccggcatga caacgaggag gccatgaaga ttcgcaaaca gtgtgccctg gtggcactga
361 aagacgtgaa ggcgtatctc acggaggaga gcgggcagat tgcggtgttt gatgctacca
421 ataccactcg agagaggagg gacttgattt tgaactttgc cgaggagaat tctttcaagg
481 tgttctttgt ggagtccgtg tgcgatgatc ctgatgtcat tgctgccaac atcctggagg
541 taaaggtgtc gagccccgac taccctgaaa ggaacaggga gaatgtgatg gatgacttcc
601 taaagaggat tgagtgctac aaagtcacct atcaacccct tgacccagac agccatgaca
661 aggatctttc tttcatcaag gtgataaacg tgggccagag atttctcgtg aacaaagtcc
721 aggactacat ccagagcaag atcgtctact acctcatgaa tatccacgtc caccctcgca
781 ccatctacct ttgccggcat ggagagagcg agttcaacct cttggggaag attgggggtg
841 actcaggcct ctcagtgcga ggaaaacagt ttgcccaggc tctaaggaag tttctggagg
901 aacaggagat agcagacctc aaagtgtgga cgagccagtt gaaaaggact atccagaccg
961 cagaatctct gggggtgacc tacgagcagt ggaagattct gaacgagatt gatgctggcg
1021 tgtgcgagga gatgacttac gcggagattc aggagcagta tccggatgag tttgcgcttc
1081 gagatgaaga gaaatatctg taccgatatc ctggagggga gtcataccag gacctcgtgc
1141 agcggttgga gccagtcatc atggagctgg agcgccaggg caatgtcctc gttatctccc
1201 accaggctgt catgcgctgc ctcctggcct acttcttgga caagggtgca gatgagctac
1261 catacctgag gtgccctctc cataccatct tcaaacttac tcctgtggcc tacgggtgca
1321 aagtggaaac aattaaactc aatgtggagg ctgtgaacac gcaccgtgac aagccaacta
1381 acaactttcc caagagccaa acccctgtaa ggatgagaag gaacagcttt acgcctctgt
1441 ccagttcgaa tacaatcagg cgtccaagaa attacagtgt tgggagccgg cccctccagc
1501 ccctcagccc gctccgtgcc ctggacacgc aagaaggggc cgaccagccg aagacccaag
1561 cagagacctc gcgggctgca cacaggctcc cgtctccagc gccccccacg tcgccctcct
1621 gatggcgtgg aggctgaggc cagaacgctc ccccggagac tggggtctgc tgaagacctg
1681 ggcgtgtcag ctccactggg gctggagcag gaagtcccgt ggggctcgga cattcggggc
1741 cacccgagtg agcctggaaa agctgccctc catttcttcc cctcatgcct aatgaaaacc
1801 gtgaaaagct cggcttccct gtacctggcc cctcg
//
LOCUS BOVTHYRA 2045 bp ds-DNA MAM 14-AUG-1990
DEFINITION Bovine thyroglobulin gene, 5' end.
ACCESSION M35823
KEYWORDS thyroglobulin.
SOURCE Bovine DNA, clone CBT1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 2045)
AUTHORS Ledent,C., Parnentier,M. and Vassart,G.
TITLE Tissue-specific expression and methylation of thyroglobulin-
chloromyhenicol acetyltransferase fusion gene in transgenic mice
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by C.Ledent, 27-JUN-1990.
FEATURES from to/span description
mRNA 2037 > 2045 thyroglobulin mRNA, exon 1
signal 2006 2011 TATA box
rpt 826 1025 bovine monomeric repeat
rpt 1712 1816 bovine monomeric repeat
BASE COUNT 467 a 508 c 579 g 491 t
ORIGIN Chromosome 14q11-q15.
1 gatccaggag ggagaggtga gagggccaga ggtgaggagc ctgcagagag aatggagacc
61 caagccagga gcgggctcac ggaggccaga ccaaggagtt ggtgctgagt ttccgtagag
121 catcctcctg gctcacgagg catcagatga tggtaggagc tgagtcatcg tgagaaagag
181 gttttcacgg tgtggaggca gcgcctggca tggctctctc caagctcatc gcttcattgc
241 tggagtggag agaagcacct ctccttccag gtacactgtt ttgactcaga aggaagcccg
301 ggcggctctc accctcactg actctgctgt gcctcttccc aggggccagg ccagggagaa
361 tgactcagca cacaagctgg cctgtgcaag accctgcctg catgcatggg tgggggaagc
421 atgaggagtg ggctttctgg tagacctcag tcatggtctc agcattattt tttaaaagga
481 ttttatttat ttgttttgat gtggaccagt tttaaaatct ttattgaatt tgttaccatg
541 ttgcttctgt tttatgtttt ggttttttgg ccacgaggca tgggggatgt tagctccccg
601 accagggact gaacccttgc cccctgcatg gaaaggtcaa gtcttaactt cttgactgcc
661 taggaagtcc cggtttcaac attttagaaa attcatctgc tttagaacac ctggacctgg
721 agagctggca agggctgcac atggccaggc tcacggtaat attacagcgt ctccatagcc
781 tgggcagaac ttggcttgtg tgattggggc tgtgggctcc ccaggtggtg ctagtggtaa
841 agaacccttc tgccagtgca ggagacataa gagatgcggg ttctatccct ggatcaggaa
901 gatcccctgg agaaggaaat ggcaacctgc tccaggattc ttgcctggag aatcccaagg
961 acagaagagc ctggcgggcc gtagtccatg gcgttgcaaa gagtcagaca caaccgaagc
1021 gacttagcac ccaatggggg ctggagaggg ccaagctggg ggattctaat tctccagtaa
1081 atgccagcat gtcttctcaa atcatcttta cctaaaacct aaggctcatt gtgatctccc
1141 tcactcccaa gtgctctccc acacctgccc acctaccttc tgaacaccca tggagcctgt
1201 ctccatcccc caccccttta atgggagaag ccactggaga aagctcactg atgtacaggc
1261 tttatgtgga agacaagctt cctgctgcct tttggttgtc tgacgtcctg ggacagaggg
1321 gaaaggggga tgactacgag tatgactgtg cgtgtgtttg gcttatctca tcaaaatctc
1381 tacattctgt gttaatggat ctgcctgttt tgttccctgc catatcctca tggcctagaa
1441 tagtgtctgc ttctctatca gactctaaag aaacattgct aggagggaag gaaggagcat
1501 ggatgaggag ggagggagca ttgtgtttct ctcacggtgg gcctgaacgt gtggcccacc
1561 aagttgttaa ctttggcctt tacccctgaa gatgaattat gaagccacac ccccagttct
1621 tccttggtgg ctcagatggt caagaatcca cctgcaatgc gggagacctg ggtttgatcc
1681 ctgggttggg aagatcccct ggagaaggga atggctaccc actccagtat tctggcctgg
1741 agaatcccat ggacagagga gcctggcggg atgcagtcca tggggtctca gagagtcaga
1801 tgtgactgag cgactttcac acacattcgt ccctggttct gctcccctac agcctccaca
1861 agattttcac cccacactgg ccacatgagt gtcctccagg ggaacagacg caggtggagg
1921 acctccttgt gaccagcaga gaaaacaggg tgggcactgc ttccctgagt gcctgtgggt
1981 gggggctaag tacccacagc agtgctataa aggctccttg gccagagccc taaggtgggc
2041 agcag
//
LOCUS BRPTUBBA 4571 bp ds-DNA INV 14-AUG-1990
DEFINITION B.pahangi beta-tubulin gene, complete cds.
ACCESSION M36380
KEYWORDS beta-tubulin.
SOURCE B.pahangi DNA, clone pBTY.
ORGANISM Brugia pahangi
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Gnathostomatidae.
REFERENCE 1 (bases 1 to 4571)
AUTHORS Guenette,S., Prichard,R.K., Klein,R.D. and Matlashewski,G.
TITLE Characterization of a beta-tubulin gene and beta-tubulin gene
products of Brugia pahangi
JOURNAL Biochem. Pharmacol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Guenette, 11-JUL-1990.
FEATURES from to/span description
pept 396 452 beta-tubulin, exon 1
891 999 beta-tubulin, exon 2
2115 2341 beta-tubulin, exon 3
2424 2552 beta-tubulin, exon 4
2718 2880 beta-tubulin, exon 5
3002 3192 beta-tubulin, exon 6
3299 3394 beta-tubulin, exon 7
3630 3816 beta-tubulin, exon 8
4047 4234 beta-tubulin, exon 9
pre-msg < 395 > 4400 beta-tubulin mRNA and introns
IVS 453 890 beta-tubulin intron A
IVS 1000 2114 beta-tubulin intron B
IVS 2342 2423 beta-tubulin intron C
IVS 2553 2717 beta-tubulin intron D
IVS 2881 3001 beta-tubulin intron E
IVS 3193 3298 beta-tubulin intron F
IVS 3395 3629 beta-tubulin intron G
IVS 3817 4046 beta-tubulin intron H
signal 66 69 reverse CAAT box
signal 90 95 weak TATA box
signal 4395 4400 polyA signal
BASE COUNT 1278 a 783 c 910 g 1600 t
ORIGIN
1 ggtgtgcaga tagcacgagg aaatgagagg agcagagaag tagggcgcag tcagacagtt
61 ggatgattgc gcctcccgtg tcacttccct atataccttt gcaaggagga catgacaaag
121 ttcgaggtta tttctcgacg cgtctctctg tatactcaca tgtctgtctg tctgtctgtt
181 tgtccgtctg cctgtctgcc ttggttgcta ggaagcccgt cgactttctc cacagtttag
241 tgagtgtttg tgtgatggat aggatggttg gcgcgcggat gggtcggcca aacagttctg
301 tgactgattg tagttgccgg atgacttgat cagtaattac cggattgatt gcaggtctca
361 tttcggtcga caagatttca ttaagtgttt aagctatgag agaaattgtc cacgttcaag
421 ctggtcaatg tggcaaccag attggtgcca aggtattgat tttctgctct ttctctttat
481 cctttaggaa gatgactcat cagagggtat ctttccgcag aaattttccc gagatataat
541 tcccacttcc cctcattgaa attgttttgg tatttaggta tcatcgtcaa aatattccat
601 ttcatttctt aatgtgctat cgtttatcat ctgctcttcg attgtatctt ctcccttctt
661 tgcttcattg attcctaatg caatacataa tacctaccat ctatcacggt taataaaatt
721 cgatgttaga tcacaaagat caagttgtga taatctttcg tataataagt tgttgttttc
781 aacaatctgc agttaggtaa gatgtctgag gagcaagcaa aatgtggatg taggggatta
841 ctgaagttaa aaattgaaaa ttttaaacat aacttgagtt tttttttcag ttctgggaag
901 taatatcgga tgaacatggt gttcaacctg atggtacata caaaggtgat tcagacctgc
961 aaattgaacg aatcaacgtc tactataatg aagcgaatgg ttggtccatt attggagttt
1021 tttttcttct ttgctttgga atgttctttt ttgttttctc tactagttgt tgccactcgt
1081 tttcacacac attatacata gttctacata cacatgcata ccacacatag tcccatatat
1141 aaacaattgt tgtctgacat cgaacattag ttcggcggat gaggttcatc agggcgtcct
1201 caaaataatg tgttcgagaa ttaaatgaag aatgtctgcg tctcttttca aggtccatcg
1261 tggtgaaatc atttgagaca taaggagatg gatggactgt ttaaattaga taaatttaaa
1321 tacaagtttc agaataaccg acataatcaa agatgttttt gttacagaaa tatttgggaa
1381 tgattatagc aagattgtaa aacagatatt ttgacatcta cctgtctttg atcacaacta
1441 ttgtttatga tatttttggg atttttgcaa tgtaatgaca cttattttta gaatatcgta
1501 aacactttct gacctaactt gtgatttgtt cgaagcctgg tagtgaaatt tatttggcgg
1561 ttgtgaaaat acagtaaata accgcactat ataaggaaag aataacattg tcactttaac
1621 caaacaaatc tcaaattgtc gtcacttcaa ccaaaacctt atcaaacagt gtttttgacc
1681 tattctaact tctaacccca gtgtttcttg attgtcttga ttgttattct ataacaccta
1741 caaagaataa gaattccttc aattcaaact tttcacgagt tattcgtctt tgtcagtcta
1801 cactatgagg acttcgtcgt caacttattc caagataaaa aactttcagt gcaacagtct
1861 tctttttttt ctcatctctt cgtaatggaa ttaaaaccaa acaagacgtt cgtcatatgt
1921 tagcttgtca gtaatgtttg taatacatct ttattttcgt ctttttgttt ttcctctctt
1981 cattatgcag tgaaaaggag tatttttctt taggttttct gtaaagatat acttgataat
2041 agattgccat aatgaaccaa gcatcgttgt aaatcttcac ttcgtctcaa gcgatattaa
2101 ttttttgtgt tcagggggca aatatgtgcc acgagcagtc cttgttgatt tggaaccagg
2161 taccatggat tctattcgag gaggtgagtt cgggcaacta ttccgacctg acaattttgt
2221 ttttgggcaa agtggagctg gcaacaactg ggctaaggga cattatacgg aaggtgcgga
2281 actagttgat aatgtgttgg acgtgatacg aaaagaagct gagggatgcg attgtcttca
2341 ggtacggatt gccatagttt tataagacat tttaattgtg gatgtgtttt tattttaaag
2401 aaaaaattaa aaaatatttt cagggatttc aactaacgca ttcacttggt ggtggtaccg
2461 gttccggcat gggaacattg ctgatctcga aaattcgtga ggagtatccg gatcgaatta
2521 tgagctcttt ttcggttgtg ccatcgccca aagtatgtat ttgaatttct gattcttatt
2581 tttaacatct gttatatcta ttttaaacat agctccataa cttttaagtg gcggttgtct
2641 cacaaagtgg tttttcggct atcatttcat tttcatagtt gaagaaaaag tatgctagac
2701 gtattaatga tttccaggta tcagatgttg tgttggaacc ctacaatgca acattatcag
2761 tccaccaact agttgaaaac actgacgaaa ctttctgcat tgataacgag gctttgtatg
2821 acatctgctt ccgaacgttg aagttggcaa atccaactta cggtgacctc aaccatttgg
2881 gttcgtcctt taatttattc ttcgaactgt ccggtttaaa tgcattctat ttaactattt
2941 ataaaattcg gatttaattg taatttattc aactcttcta aaggttaaaa tttctattta
3001 gtgtctgtga caatgtcggg agtaacaact tgcttacgtt tccctggaca gttgaacgcc
3061 gatctccgta aacttgccgt caatatggtg ccattcccac ggttgcattt ctttatgcca
3121 ggatttgctc ctctctctgc tcgtgatgct gctgcttatc gagccctcaa tgttgctgaa
3181 cttactcaac aggtctctgt ttatttttat ggtcagggtc tctttttttg aaagtaatcc
3241 atttataatc gctttaaatg tctcctgttt tatgtttttt aatatcttat ttcttcagat
3301 gtttgatgcc aaaaatatga tggcagcatg tgatccgcgt catggtcgtt acctaaccgt
3361 agctgccatg ttccgaggta gaatgtctat gcgggtaagt gaattttgat aatctccatg
3421 ttaaactcag ccgctgaagc atgaagattt taatccatgt gatgtcggtt tttcttgacc
3481 tgaacttagt ggtgtatcac ttggtcttag ttatgttctt cctatttcca gagtgcttgc
3541 ttaactgcaa ttcttttgtg gaagcttgtt tttgatgtag ggtttttttc ttcatcgctg
3601 tttcaaagtt tttaataagt aggtcttagg aagtagacga gcaaatgatg caagtacaga
3661 ataagaattc atcgtatttc gttgaatgga ttccaaataa cgtaaagaca gctgtttgcg
3721 acattccacc acgtggatta aagatgagcg caacatttat tggaaataca acagctatac
3781 aagaactttt caagcgaatt tccgaacagt ttactggtga acttattaat tccatctcat
3841 ttgagcttag tttgatagct aaaaactaaa tttctggcaa gattgaaaag agatacaatt
3901 gggtatgtgt aggttacatg atttttttgc catattatga gttttatgca aaaggtgaac
3961 atttttaata tgtttgttcc gaatcgtctg aacttgcgtt ctatggaggt tatacttaag
4021 tcgatccgat tttttaatga tttcagccat gttccgacgt aaagcattct tgcattggta
4081 tactggcgaa ggtatggatg aaatggaatt cacggaagcg gagagtaata tgaatgactt
4141 ggtgtccgaa tatcaacaat atcaggatgc gacggctgat gaagaaggtg atcttcagga
4201 aggtgaatcg gaatacattg aacaggaaga gtgagcacaa agtgtcagct ttgtgaaaaa
4261 ggaactttta cagtatttta tttatcgtct tttccatgtt tttatgttgt tacattttat
4321 tttgttactc gctttcgaca tatattagca agttttaaag aattatagcg gcaaatctca
4381 ttgttgtact tctcaataaa ttacaatcaa aagttaatga ttcctaatgc aaagtcgtaa
4441 ggcagaaatc acgtgaaaaa cagtaatcat gattctctga aagttccaat ttgaaataaa
4501 gtactaattt tatgtgggca gtggacggta ttccgtgaca gaagatgcta cagaagattg
4561 cagcagtttt t
//
LOCUS BSPCELALKA 3498 bp ds-DNA BCT 14-AUG-1990
DEFINITION Bacillus sp. alkaline cellulase gene, complete cds.
ACCESSION M27420
KEYWORDS alkaline cellulase; endo-beta-1,4-glucanase.
SOURCE Bacillus sp. (strain KSM-635) DNA, clone pBC100.
ORGANISM Bacillus sp.
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3498)
AUTHORS Ozaki,K., Shikata,S., Kawai,S., Ito,S. and Okamoto,K.
TITLE Molecular cloning and nucleotide sequence of a gene for alkaline
cellulase from Bacillus sp. KSM-635
JOURNAL J. Gen. Microbiol. 136, 1327-1334 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by S.Ito, 30-AUG-1989, for release after publication.
FEATURES from to/span description
pept 605 3430 alkaline cellulase precursor (EC 3.2.1.4)
sigp 605 691 alkaline cellulase signal peptide
matp 692 3427 alkaline cellulase
binding 530 536 SD1 ribosomal binding site
binding 595 600 SD2 ribosomal binding site
signal 471 476 -35 region
signal 494 499 -10 region
BASE COUNT 1133 a 605 c 704 g 1056 t
ORIGIN
1 ggcacctgag gaagtagcga atcctgttct tgatgcacat ccatacttaa cttctggatt
61 tgccttcatg agtcgtgatg aaaacggaag tgcaccatta catggactgt ttgcatttaa
121 ttattcggca ctgattagct gtggcatttc cgcttctgct ctttctggaa tgaagtacgg
181 ggtcccaaga cttgtcactg ccattgccga tcagttattc caagatgatc gagacgagat
241 tctaaaggac ttctttgagt atgatgagaa ggagtttgtc ggaaactggc ctttaaacgt
301 ctaaatgaac ataatagcga aagggcttaa ccaaaaatat gaattgaacc cacataaatt
361 tgtgggtttt tattaatcaa aaaaatggta aataaaccta ttttaacaat gcttataacc
421 atttttctat ttattgcata aaaaaatcag taaaaaaatt catttatatg tagacgtaaa
481 ttaacaaata ttatattata tatacgaaag cggtttcgaa aaatagagga aggaggagag
541 tttttagttt ttgttgtttg tttattgtaa gcgtttacta ttaatacatt tctgggaggt
601 tattatgaaa ataaagcaaa ttaaacaatc tttatctttg cttttaatca tcacactcat
661 tatgtcacta tttgttccta tggcttcagc aaacacaaat gagtctaagt ctaatgcatt
721 tcctttttct gatgttaaaa aaacttcttg gtcttttcca tatataaagg atttatatga
781 gcaagaagtt attacaggaa catctgcaac aacgttctct ccaacagatt ccgttactcg
841 tgcacaattt acagtgatgc ttacccgtgg tcttggacta gaagcatctt ctaaagatta
901 cccttttaaa gatcgtaaaa actgggctta caaagaaatt caagctgcat atgaagctgg
961 aattgtaact gggaaaacaa acggtgaatt tgcaccaaat gaaaacatta ctcgtgaaca
1021 aatggctgct atggccgtac gtgcttatga atacttagaa aatgagctat ctttaccaga
1081 agagcaaaga gaatataatg actcttcttc tatttcaacc tttgctcaag atgctgttca
1141 aaaagcatac gtattagagc taatggaagg aaatacagat ggatattttc aaccaaaaag
1201 aaactctact agagaacagt ctgctaaagt tatctctact ttactttgga aagtagctag
1261 tcatgattat ttataccata cagaagctgt taaaagccct tcagaagctg gtgcgcttca
1321 gttagtagaa ctaaacggac aattaacact agctggtgaa gatggtactc ccgttcaatt
1381 acgtggaatg agtacacatg gcctacaatg gttcggtgaa atcgtaaacg aaaacgcttt
1441 cgtagcacta tcgaatgatt ggggatctaa catgattcgt ctcgctatgt acattggcga
1501 aaatggatat gcaacaaacc ctgaagtaaa agatttagtt tatgaaggaa ttgaattagc
1561 gtttgagcac gatatgtatg taattgttga ctggcatgta catgctcctg gtgatcctag
1621 agcggatgta tactcaggtg cttatgattt cttcgaagaa attgctgatc attacaaaga
1681 tcatccgaaa aaccattata tcatttggga actagcaaac gaaccaagtc caaataataa
1741 cggtggacct ggattaacaa atgatgaaaa aggttgggaa gctgtaaaag aatatgcaga
1801 gccaatcgtt gaaatgttgc gtgaaaaagg tgacaacatg attttagttg gaaatcctaa
1861 ctggagccaa cgtcctgact tatcagctga caacccaatt gatgcagaaa atatcatgta
1921 ttctgttcac ttctacacag gctcacatgg cgcttctcac attggttacc ctgaaggaac
1981 accaagctct gaacgttcta atgttatggc taacgttcgt tatgctctag acaatggcgt
2041 tgctgtgttt gcgacagagt ggggtacgag tcaagcgaat ggagatggag gaccttattt
2101 tgatgaagct gatgtttggc ttaatttctt aaacaaacat aacattagct gggcaaactg
2161 gtcgttaacg aacaaaaatg agatttctgg agcatttaca ccttttgagc ttggtagaac
2221 agatgctaca gatcttgatc caggtgctaa tcaagtatgg gcacccgagg aactaagttt
2281 atctggtgaa tatgttcgtg ctcgtattaa aggaattgag tatacaccta tcgaccgcac
2341 aaaattcaca aagcttgttt gggattttaa cgatggaaca acacaaggat tccaagttaa
2401 tggagacagc cctaacaaag aaagcattac tttaagtaat aataatgatg cattacaaat
2461 tgaaggatta aatgtaagta atgatatttc tgaaggaaac tactgggata atgtacgcct
2521 gtcagctgat ggctggagtg aaaatgtaga tattttaggt gctacagagc ttacaattga
2581 tgttatcgtt gaagaaccga caacagtttc aattgctgct attccacaag gacctgctgc
2641 tggctgggct aacccgacta gagcaattaa agtaactgaa gacgatttcg aatctttcgg
2701 agatggatac aaagctctcg taactattac ttctgaagat tcaccttcac ttgaaaccat
2761 tgcaactagt cctgaagaca atacaatgag caatatcatt ctatttgtag gtactgaaga
2821 tgcagatgtt atttctttag ataatatcac ggtttctggt actgagattg aaattgaagt
2881 tattcacgat gaaaaaggaa cagcaacact tccttctact tttgaagatg gaactcgcca
2941 aggctgggat tggcatacag aatcaggagt taagacagct cttacaattg aagaagctaa
3001 tggatctaac gctctttcat gggaatatgc gtatcctgaa gtaaaaccaa gtgatggttg
3061 ggctactgct cctcgtctag acttctggaa agacgaacta gttcgtggca caagcgacta
3121 tattagtttt gacttttaca tcgatgcagt tcgtgcttct gaaggtgcta tatcaattaa
3181 cgccgttttc caaccacctg caaacgggta ttggcaagaa gttccaacta catttgaaat
3241 tgatttaaca gagcttgatt ctgcaactgt aacttctgat gagttgtatc attatgaagt
3301 aaaaattaac attagagaca ttgaggctat tacagacgat acagagcttc gtaacttatt
3361 actaatcttt gctgatgaag acagtgattt tgctggtaga gtttttgttg ataatgtaag
3421 atttgaataa tttaaaaaca gtagatagag agactctcta tctacctgtt tattgcttac
3481 tattcgtctt ccactttt
//
LOCUS BSUPHOA3A 743 bp ds-DNA BCT 14-AUG-1990
DEFINITION B.subtilis alkaline phosphatase (phoAIII) gene, complete cds.
ACCESSION M33634
KEYWORDS alkaline phosphatase.
SOURCE B.subtilis (strain 168) DNA.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 743)
AUTHORS Bookstein,C., Edwards,C.W., Kapp,N.V. and Hulett,F.M.
TITLE The Bacillus subtilis 168 alkaline phosphatase III gene: Impact of
a phoAIII mutation on total alkaline phosphatase synthesis
JOURNAL J. Bacteriol. 172, 3730-3737 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.M.Hulett, 06-APR-1990, for release after publication.
FEATURES from to/span description
pept 503 > 743 alkaline phosphatase precursor (phoAIII, EC
3.1.3.1)
sigp 503 598 alkaline phosphatase signal peptide
matp 599 > 743 alkaline phosphatase
BASE COUNT 221 a 149 c 131 g 242 t
ORIGIN
1 gatccgtaaa catggctgct tccattttat ttctgtatgt aaatttggat atttattttt
61 tatccaagtt aacacctctt ccgttctttc tcggtggaac gggtcagcaa tccccgctaa
121 gtactcagaa aaaacgtcca tgttgttccc tcctaaaata aaatgatgcc actccatatg
181 aactgctcag acatgtgctt acacaatacc tcttgccaac gtaatacttt aaattgttca
241 agatgtaagc gggttgcttc tatgcgacta tagcagattt cagtccgcct atccacggcc
301 agtatctatt cctccaattt atttctttag tactatcgct ctttccttct aaaacttctc
361 ataaaagaat aaccattatt taagggtgcc agttcattat tcttgtaaat ccaatcttta
421 aaatcgatta atactagctt aacagtttaa aaatataatt gggttgtcat tgagattcat
481 ctatatttag gaggttatcc agttgaaaaa attcccaaag aaattactgc ctatcgcggt
541 tttatcatca attgcgttca gcagcttagc cagcggcagt gtgcctgaag ccagcgccca
601 ggaaaagaaa aaggggaacc aagacgaaat taaaaatgtt attgttctga ttggtgatgg
661 tatgggtgtg tcttatacgt ctgcttatcg atacttaaag gataataaaa agacaaaagt
721 tgtggagcct acggcttttg atc
//
LOCUS CHTSIGAA 2180 bp ds-DNA BCT 14-AUG-1990
DEFINITION C.trachomatis sigma-A protein (sigA) gene, complete cds.
ACCESSION M36475
KEYWORDS sigma factor; sigma-70 homolog.
SOURCE C.trachomatis (strain MoPn) DNA.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 2180)
AUTHORS Engel,J.N. and Ganem,D.
TITLE A polymerase chain reaction-based approach to cloning sigma factors
from eubacteria and its application to the isolation of a sigma -70
homolog from Chlamydia trachomatis
JOURNAL J. Bacteriol. 172, 2447-2455 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 360 2075 sigma-A protein (sigA)
BASE COUNT 679 a 380 c 517 g 603 t 1 others
ORIGIN
1 tctagaggga tttgcccaat gggagcgaga tttattaaca gaatctcgaa cagaacgctc
61 tggcaaaatc gtaattaaaa cgaaagcttt tggagccact ccgttctact attgtgtagt
121 aacgttagat aaaggacctt tggcagaaca tgtattaggg gttttatacc ctgcaaaagc
181 aagttttttt acaaatcttt cctatattta atgataaata gggatgtcgg tgtaggaagt
241 ttttctaggg ccgaatctga tcagctggtt ttttatctaa aaatccctag atttcttgtt
301 ttccttaaga taaactgtca ctttctaggc cgatttttcc ttagttttaa tttgttagca
361 tgcgcatcca tacgctagat agtcaacgtg ctgatgcagc ccaagaagaa gaaatccaaa
421 gaaagttaga agagctagtc actcttgcta aggatcaagg gttcatcacg tatgaagaaa
481 ttaatgaaat tcttccccct tctttcgatt cgccagaaca gatagatcaa gttttaattt
541 ttctggcggg gatggacgtt caagtcttaa accaagcaga cgtagagcgg cagaaagaaa
601 gaaaaaaaga agctaaagag ctagaagggt tggctaagcg ttctgaggga acgcctgatg
661 atccagtgcg tatgtatctg aaggaaatgg gtacagttcc tcttcttaca agagaagagg
721 aggtggagat ttctaaaagg atagaaaaag ctcaggtaca aatagaaaga attattttac
781 gctttcgtta ttcgactaaa gaagcggttt ctattgcgca atacttaatt aatggtaagg
841 aacgatttga taagatcgtt tccgnaaaag aggtggaaga taagacacat tttcttaatc
901 ttttgccgaa gttaatctct ttgctgaaag aagaggactc ttacttagaa gagcggcttt
961 tggctttgaa agatccggca ctgtccaagc aagatcaagc aaagttgaac gatgagcttg
1021 aaaaatgccg tattcgaaca caagcttatc tgagatgttt tcattgtcgt cacaatgtca
1081 cggaagattt tggagaagtt gtttttaaag cttatgactc gttcttacag ttggaacaac
1141 aaatcaatga tttaaaggtt cgtgctgaaa gaaataagtt tgctgcagca aaactagctg
1201 cagcccgacg caagctttac aaacgagaag ttgcagcggg acgtactctt gaagagttta
1261 aaaaagatgt gcggatgttg cagcgttgga tggataaaag ccaggaagca aagcaagaga
1321 tggtggaatc caacttacgt ttggtgatct ctatcgcgaa aaaatatacc aaccgagggc
1381 tgtctttctt ggatttgatt caagaaggaa atatgggctt aatgaaagct gttgaaaaat
1441 ttgagtatcg ccggggttat aaattttcaa cttatgccac ttggtggatt cgtcaggctg
1501 tgacgcgagc tattgccgat caggcaagaa ccattcgaat tcctgttcat atgatcgaga
1561 ccattaataa ggtgcttcgt ggagccaaga aattaatgat ggaaactgga aaagagccta
1621 cgcccgaaga actcggagag gaactaggtt tcactccaga ccgtgttcga gaaatttata
1681 agatcgctca gcatccgatt tctttacagg ctgaggttgg agatagtgga gaaagctctt
1741 ttggagattt cttggaagat acagctgttg aatctccggc agaggcaaca ggctactcca
1801 tgttgaaaga caaaatgaag aaagtgctaa aaacgcttac tgatcgcgaa cgttttgttt
1861 tgatccatcg gtttggcctt ttagatggcc gtcccaaaac tttggaagag gtaggctccg
1921 cgttcaacgt gacacgagag cggattcgac aaatcgaagc caaagctttg cgaaaaatgc
1981 gtcatcctat ccgttccaaa cagctacgag cattcttgga tttattggaa gaagagaaga
2041 ctggttcggg caagattaag agttataaga attaaggatt ccttttcata aggagtatgc
2101 ttgtatcggt tagacgtaac gaactttcgt gtttgggtat ctataggggt ctcagagcag
2161 gaacgctatc acaagcaacc
//
LOCUS CUSRGSBHA 1137 bp ds-DNA PLN 14-AUG-1990
DEFINITION Melon 17S rRNA, 5.8S rRNA, and 25S rRNA gene region.
ACCESSION M36377
KEYWORDS 17S ribosomal RNA; 25S ribosomal RNA; intraribosomal spacer region.
SOURCE C.melo 4-wk old cotyledon DNA, clone pML17.
ORGANISM Cucumis melo
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Violales; Cucurbitaceae.
REFERENCE 1 (bases 1 to 1137)
AUTHORS Kavanagh,T.A. and Timmis,J.N.
TITLE Structure of melon rDNA and nucleotide sequence of the 17-25S
spacer region
JOURNAL Theor. Appl. Genet. 76, 673-680 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA < 1 259 17S rRNA
rRNA 476 638 5.8S rRNA
rRNA 876 > 1137 25S rRNA
BASE COUNT 253 a 333 c 324 g 227 t
ORIGIN
1 atagatcatt gcaattgttg gtcttcaacg aggaattcct agtaagcgcg agtcatcagc
61 tcgcgttgac tacgtccctg ccctttgtac acaccgcccg tcgctcctac cgattgaatg
121 gtccggtgaa gtgttcggat cgcggcgacg tgggcggttc gctgcccgcg acgtcgcgag
181 aagtccactg aaccttatca tttagaggaa ggagaagtcg taacaaggtt tccgtaggtg
241 aactgcggaa ggatcattgt cgatgcctaa acatcaaacg acccgcgaac gcgtttaaaa
301 acaaactgtt cgcgttaggg gcggggggaa gcatgctctt tggctgcctc ctccccttcc
361 aacgcgttta aacaaaaccc cgcgcaggtc gcgccaagga acttgaaatg aattcgcctg
421 tcccctgccc cggcctcggc gtgcggggga tggagcattc tagtcgtatt actaacaacg
481 actctcggca acggatatct cggctctcgc atcgatgaag aacgtagcga aatgcgatac
541 ttggtgtgaa ttgcaggatc ccgcgaacca ccgagtcttt gaacgcaagt tgcgcccgga
601 gccttctggc cgagggcacg tctgcctggg cgtcacgcat cgctgccccc accacacaac
661 tctccccatg cggggtcgtt gtgaaggcag ggacacacac tggcctcccg tacgcaccgt
721 cgtgcggatg gcttaaattt gagtcctcga tgctcgtcgt cgcgacacta cggtggttga
781 ttcaactcgg tgacgcgtct cgacctcgac gtcgacttca cggactcctt cacgaccctt
841 cgaacgccgc cccttaaaag gacgacgctc tcgacgcgac cccaggtcag gcgggactac
901 ccgtgagagt ttaagcatat caataagcgg aggaaaagaa acttacaagg attcccctag
961 taacggcgag cgaaccggga agagcccagc ttgagaatcg ggcgtcctcg acgtccgaat
1021 tgtagtctgg agaagcgtcc tcagcggcgg accgggcaca agtccctgga aggggcgcca
1081 gagagggtga gagagcccgt tgcgctcgga ccctgtcgca ccacgaggcg ctgtcaa
//
LOCUS DOGFIXA 3080 bp ss-mRNA MAM 14-AUG-1990
DEFINITION Canine blood clotting factor IX mRNA, complete cds.
ACCESSION M33826
KEYWORDS blood clotting cascade protein; blood clotting factor IX; protease.
SOURCE Dog hemophilic liver, cDNA to mRNA, clone dF9.28.
ORGANISM Canis familiaris
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE 1 (bases 1 to 3080)
AUTHORS Axelrod,J.H., Read,M.S., Brinkhous,K.M. and Verma,I.M.
TITLE Phenotypic correction of factor IX deficiency in skin fibroblasts
of hemophilic dogs
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5173-5177 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.H.Axelrod, 23-APR-1990, for release after publication.
FEATURES from to/span description
pept 22 1380 factor IX protein precursor
sigp 22 138 factor IX signal peptide
matp 139 1377 factor IX blood circulatory protein
matp 577 1377 factor IX activated protein
matp 673 1377 factor IX activated protein
mRNA < 1 3080 factor IX mRNA
BASE COUNT 878 a 640 c 677 g 885 t
ORIGIN Chromosome X.
1 aagcagcgcc tgaataggat catggcagaa gcatcgggcc tcgtcaccgt ctgcctttta
61 ggatatctac tcagtgccga atgtgcagtt tttcttgatc gtgaaaatgc caccaaaatt
121 ctgagtcggc caaagaggta taattcaggt aaactggaag agtttgttcg agggaacctt
181 gagagagaat gtatagaaga aaagtgcagt tttgaagaag cacgggaagt ttttgaaaac
241 actgaaaaaa ccactgaatt ttggaagcaa tatgttgatg gagatcaatg tgaatccaat
301 ccatgtttaa atgacggtgt atgcaaggat gacattaatt cctatgaatg ttggtgtcga
361 gctggatttg aaggaaagaa ctgtgaatta gatgtaacat gcaacattaa gaatggcaga
421 tgcaagcagt tttgtaaatt gggccccgat aacaaggtgg tttgttcctg tactacggga
481 taccaacttg cggaagacca aaggtcctgt gaaccagcag tgccatttcc atgtggaaga
541 gtttctgtcc ctcacatttc tatgacacgc acccgtgctg aaactctttt ttccaatatg
601 gactatgaaa attcaactga agtggaaaaa attttggata acgtcaccca accgcttaac
661 gacttcactc gagttgttgg tggaaaagat gccaaaccag gtcaattccc ttggcaggtc
721 cttttgaatg ggaaagttga tgcattctgc ggaggttcca tcatcaatga aaaatgggtg
781 gtaactgcag cccactgtat tgagcctgat gttaaaatta ccatagttgc aggtgagcat
841 aacaccgaga agagggaaca tacagagcag aagcgaaacg tgattcgcac tattcttcac
901 cacagctata atgcaactat taataagtac aaccatgaca tcgcccttct ggaactggat
961 gagcccttaa cgctgaacag ctatgtaaca cctatttgca ttgctgacag ggaatactcg
1021 aacatcttcc tcaaatttgg gtctggctat gtgagtggct gggggagagt cttcaacaaa
1081 gggcgatcgg cttcaattct tcaatacctt aaagttccac ttgttgaccg agccacgtgc
1141 cttcggtcca cgaagttcac catttataac aacatgttct gtgctggctt ccatgaggga
1201 ggtaaagatt catgccaggg cgatagtggg ggaccccatg tcaccgaagt agaaggcata
1261 agtttcttaa ctgggattat tagctggggt gaagagtgtg cgatgaaagg gaagtatgga
1321 atatatacca aggtgtcccg gtatgtcaac tggattaaag aaaagacgaa gctcacctaa
1381 agaataatgt atttccaagg ttgacacgtt tagggtagaa aatggacaag gtcctttact
1441 aactaatcac tttttttatc tctttagatt tgcactatat acattctctg catactgctt
1501 tttctctttc tggggagaaa tctatctaga attcctattt tactagacta agtgaattag
1561 aaaatgtaat cactacggga atgtactgtg atgggacttg tgaccactcc cacaggtcta
1621 gcccttggca ccattgtgag gttaggttat tcctccgtcc cgtcagtgtt aagtttctcc
1681 actggggcaa ctccctgatt ctccctcctt ggcagcattc catgttccag acctttctta
1741 cctttcccat ggaaagcatc atcaaaatgt gttagatcta catccaggat atttgatcta
1801 gttcagaaca agtctaacac ctcacttgtt aaggaagaac acaggagcaa ctgacaggtt
1861 gcaactcacc agaaaacact atttcctttt ctgtatgctt attcctgctt ccttatctct
1921 tctgtttcct aatcctgaaa tcagtgtctc tctttctctt tctctctctc tttttcctac
1981 agaggtttaa aggagggaag ggacacatca tgctgtttta ctactgtcca cagttataca
2041 tgtctatcaa acccagactt gctttcagtt tggtctttga cttgcttttc ggagcatagg
2101 gatgaagcca ggtgcctgaa gaccttcagg aaaatttgtc tgaaagagtc atgttactga
2161 aatacatggc atcgtggaag gaatgaccga cccatcagaa tacgtccaat aagcatttgt
2221 aattgtgttg tgatagaggt aactaagaag agtgacatca actcctgcgt ctcatccccc
2281 atgtgaaaaa actgtgaact aaaagagaac agccagtgtg caacgtagaa ctagtagagt
2341 cttaaggaag aattcattgg tgggtctcca gtactggccg gagctaagga agaagttgcc
2401 ctcgaccaaa gaacatgagc accctgtctc ttaaactagc atgtccccac agtggagaag
2461 gatgtactgg tggcttaaag gcatgagtca ggggtgtctg ggtggctcag tggttgagcc
2521 tttggctcag ggcatgatcc cagcgtcctg ggattgagtc ccgcatcagg cttcctgcag
2581 ggggcctgct tctccctctg cttatgtctc tgcttctctg tgtgtgtgtg tgtgtctctc
2641 atgaaaaaaa taaataaaat ctttaaaaaa aaaaaaggta tgagtcattc cagtgagcca
2701 tctcactccc ccttttctgg tttcaggttc tctaggcaac cttttgatta taattaggcc
2761 ttccatattg aattttctaa agagttgctg accaacctct gcatgtactt tagtgtgtta
2821 ggctggctcc ctttatgaat taataaactg ttgttctggt tgatactgta gctttttgtg
2881 aatgtggctg atgtgaatca gtcatcccac atgttgttat atatgagact actgacaaaa
2941 tcatgtttgg gaccactttg acctgaccaa gctgctgcct tcacctcacc cccagccagg
3001 catttacttt agtccttcaa atattaagta ccaataaatg tcttttcaaa tttcaaaaaa
3061 aaaaaaaaaa aaaaaaaaaa
//
LOCUS ENHSREHPA 722 bp ss-mRNA INV 14-AUG-1990
DEFINITION E.histolytica serine rich protein (SHEHP) mRNA, complete cds.
ACCESSION M34438
KEYWORDS serine rich protein.
SOURCE E.histolytica (strain HM1:IMSS), cDNA to mRNA.
ORGANISM Entamoeba histolytica
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina;
Entamoebidae.
REFERENCE 1 (bases 1 to 722)
AUTHORS Stanley,S.L.Jr., Becker,A., Kunz-Jenkins,C., Foster,L. and Li,E.
TITLE Molecular cloning and expression of a membrane antigen of Entamoeba
histolytica possessing multiple tandem repeats
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4976-4980 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.L.Stanley,Jr., 22-MAY-1990, for release after publication.
FEATURES from to/span description
pept 4 705 serine rich protein precursor
sigp 4 42 serine rich protein signal peptide
matp 43 702 serine rich protein
ORF 188 688 ORF2
site 247 582 multiple tandem repeats
BASE COUNT 315 a 121 c 118 g 168 t
ORIGIN
1 tcaatgttcg catttttatt gtttattgca ttcactagtg caactaatat cattcttgat
61 ttggatcaag aagtaaaaga tactaatatc tacggtgtat ttttaaagaa cgaagctagt
121 cctgaaaagc ttgaagaagc tgaagaaaaa gaaaaaagta gctcagcaaa accagaatca
181 agttcaaatg aagataatga agatgatgaa gatgaaaaag caagttcaag tgataactca
241 gaatcaagct caagtgataa accagataat aaaccagaag caagttcaag tgataaacca
301 gaagcaagtt caagtgataa accagataat aaaccagaag caagttcaag tgataaacca
361 gataataaac cagaagcaag ttcaagtgat aaaccagata ataaaccaga agcaagttca
421 agtgataaac cagataataa accagaagca agctcaagtg ataaaccaga taataaacca
481 gaagcaagct caactaataa accagaagca agctcaacta ataaaccaga agcaagctca
541 actaataaac cagaagcaag ctcaactaat aaaccagaag caagctcaac tagtaattca
601 aatgataaat caggaagtag ttcagataac gataataata accttgatgc tgcatcaagt
661 ccattcattg ttttctgtgc tatcattata gctatcatct tctaaatgag ttatttgact
721 tt
//
LOCUS EUBBAIOA 6212 bp ds-DNA BCT 14-AUG-1990
DEFINITION Eubacterium sp. bile acid-inducible operon (baiBCDEA2F), complete
cds.
ACCESSION M36292
KEYWORDS 7-alpha-dehydroxylation; bile acid-inducible operon.
SOURCE Eubacterium sp. DNA.
ORGANISM Eubacterium sp.
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 1 to 6212)
AUTHORS Mallonee,D.H., White,W.B. and Hylemon,P.B.
TITLE Cloning and sequencing of a bile acid-inducible operon from
Eubacterium sp. VPI-12708
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.Mallonee, 10-JUL-1990.
Medical College of Virginia-VCU
Dept. of Microbiology
Box 678 MCV Station
Richmond, VA 23298-0678
FEATURES from to/span description
pept 24 1586 baiB protein
pept 1634 3256 baiC protein
pept 3235 3552 baiD protein
pept 3597 4097 baiE protein
pept 4130 4879 baiA2 protein
pept 4932 6212 baiF protein
BASE COUNT 1864 a 1324 c 1698 g 1326 t
ORIGIN
1 aaaagatatt aagcattaag aaaatgcaca aaaaatcagc gtgtgagagg gagggcaagg
61 agttgaagcg tgactttttt aacaagttta atttggggac atcgaacttt gtcacgccgg
121 gaaaacagtt ggaatacgtt tcggaatgca agccagattc tactgcggtc atttgcttag
181 ataaagaaca gaactgttcc gttattactt ggcatcagct gcacgtctat tccagccagc
241 tggcatggta ccttatagaa aatgagattg gcccggggtc gatcgtactt acaatgtttc
301 cgaacagcat cgagcacatt attgcggtat ttgcaatctg gaaggcgggc gcctgctata
361 tgcccatgtc ctataaggcg gcggaatccg agatcaggga ggcctgcgat accatccacc
421 cgaatgcggc ttttgcggaa tgcaagattc caggattaaa attctgcctt agcgcagacg
481 agatatatga ggcgatggaa ggaagatcca aggagatgcc ttcggaccgt ctggccaatc
541 cgaacatgat atccttatca ggcggaacca gcggaaagat gaagttcatc cgtcagaacc
601 ttccatgcgg gctggacgat gagacgatca gaagctggtc tttgatgtct ggaatgggat
661 ttgagcagcg ccagctgctg gtaggcccgc tgtttcatgg cgcgcctcac tccgcggcgt
721 ttaatggact gttcatgggc aacaccctgg tactgaccag gaacctttgc ccgggaaata
781 tcctgaacat gattaagaaa tataagattg aatttataca gatggtgccg accctgatga
841 accggcttgc caaactggag ggagtcggaa aagaagactt tgcatccctg aaggcgctgt
901 gccatacagg gggcgtctgt tctccctggc ttaagcagat ctggatcgac ctgctggggc
961 ctgaaaagat ctatgagatg tattccatga cggaatgcat cggccttacc tgcatccggg
1021 gagacgagtg ggtgaagcat ccgggaagca tcggacggcc agtgggcgat agcaaggtgt
1081 ctatccggga tgagaatggc aaggaagttg cgccttttga gattggcgag atctatatga
1141 cagcgccggc ctcctatctg gttaccgagt acatcaattg ggaaccgctg gaagtgaaag
1201 agggaggctt ccgaagcgta ggggatatcg gctacgtgga tgagcagggc tatctgtact
1261 tttctgaccg gcgcagcgac atgctggtat caggcggaga aaacgtgttc gccaccgaag
1321 tcgagacggc gcttttgaga tataaggata tcctggacgc tgtagtggta gggataccgg
1381 atgaagatct ggggcgaagg ctccatgcgg tcattgagac agggaaagag ataccggcag
1441 aggaactgaa aacattcctg agaaagtatc tgactccata taagatacca aagacgttcg
1501 agttcgtaag gagcatacga aggggagaca atggaaaggc cgacaggaag cggatcctgg
1561 aagattgtat tgcccgcggg ggatgattct ataaatgcaa agaaaacaaa ttatataaag
1621 gaggagtaac aaaatgagtt acgaagcact tttttcacca ttcaaggtca gaggactgga
1681 acttaaaaac cgtatcgtcc tgcctggaat gaacaccaag atggcaaaga acaagcacga
1741 cataggcgag gatatgatag cctaccatgt tgccagggca aaagcgggat gcgcgttaaa
1801 tatatttgaa tgcgtagcat tatgtccggc gcctcacgct tatatgtata tggggcttta
1861 tacggaccat catgtagaac agcttaagaa attgacggat gcagtccatg aagcaggcgg
1921 caagatgggc atccagctgt ggcatggagg attcagcccg cagatgttct ttgacgagac
1981 caacaccctg gaaactccgg acactcttac ggtagagagg attcatgaga tcgtagaaga
2041 attcggacgc ggcgcaagga tggctgttca ggctggattt gacgcagtag aattccatgc
2101 ggctcacagt tatctgcctc acgagttctt aagccctgga atgaacaaac gtacggatga
2161 gtacggcgga agttttgaga accgctgcag attctgttat gaagtcgttc aggcaatccg
2221 ttccaatatc ccggatgaca tgccattctt tatgcgtgca gactgcatcg acgaattaat
2281 ggaacagacc atgacagagg aagagatcgt tacatttatc aataagtgcg cagaacttgg
2341 cgtggatgtg gcagaccttt cccgtggaaa cgcgacttca ttcgcaaccg tatatgaagt
2401 tccgccattc aacctggctc atggcttcaa catagagaat atttacaaca tcaaaaagca
2461 gatcaatatc ccggttatgg gagttggccg tatcaataca ggagagatgg caaacaaggt
2521 cattgaagaa ggcaagtttg acctggtagg catcggacgc gcccagcttg cagatccaaa
2581 ctggatcacc aaagtaagag aaggcaaaga agacctgatc cgccactgta tcggatgtga
2641 ccagggatgc tatgacgcag tcatcaatcc aaagatgaag catatcacct gcacccacaa
2701 tccaggattg tgcttagagt atcagggaat gccaaagaca gacgctccta agaaagtcat
2761 gatcgtagga ggcggaatgg caggcatgat cgctgcggaa gtattaaaga ccagaggcca
2821 taacccggta atcttcgagg catccgacaa gcttgcagga cagttcaggc tggcaggcgt
2881 agcgccgatg aagcaggatt gggcagatgt tgcagaatgg gaagcaaaag aagtagagcg
2941 ccttggaatc gaagtacgtc tgaataccga agtgactgca gagaccatca aggaattcaa
3001 tccggataat gtcatcatcg cagtaggctc tacctatgcg ctgcctgaga ttccgggaat
3061 cgacagccca agcgtatact cccagtatca ggtactgaaa ggggaagtaa atccgacagg
3121 ccgtgtagcc gttatcggat gcggactggt tggtacggaa gtcgcagaac ttctggcatc
3181 cagaggcgca caggtaatcg cgatcgagag gaagggcgta ggtaccggcc ttagatgctt
3241 cgcagaatgt tcatgaaccc ggaattcaaa tattacaaga tcgccaagat gtccggaaca
3301 aatgtcaccg ctttagagca gggcaaggtt cactacatca tgacagacaa gaagaccaaa
3361 gaagtgacgc agggagtcct ggaatgcgac gctaccgtta tctgtacagg aattaccgca
3421 cgtccaagcg atgggcttaa ggcaagatgc gaagaacttg gaatcccggt tgaggtgatc
3481 ggagacgctg ctggcgcaag agactgcacg atcgcgacac gcgaaggcta tgacgcagga
3541 atggcaatct agaaaatcag aacttatcaa tcttacatat agaaaggatg atacatatga
3601 cattagaaga gagagttgaa gcattagaaa aagaattgca ggagatgaag gatattgagg
3661 caatcaagga actgaaagga aagtatttcc gctgcctgga cggaaagatg tgggatgagc
3721 tggagaccac cctgtcacca aatatcgtaa cctcttattc caacgggaaa ctggtattcc
3781 atagcccgaa ggaagttacc gattacttaa agagctcgat gccaaaagaa gagatcagca
3841 tgcatatggg ccacacgccg gagatcacca ttgacagcga gactacggct acgggcagat
3901 ggtatctgga agatagactg atctttacgg acggtaagta caaagacgta ggaatcaatg
3961 gcggcgcgtt ctatacagac aaatatgaga agatagacgg ccagtggtac atccttgaaa
4021 ccggctatgt acgaatctat gaagaacatt tcatgcgtga tccaaagatc catatcacga
4081 tgaacatgca caaataagaa tattgtaaaa gaaaggcagg agtaagagta tgaatctcgt
4141 acaagacaaa gttacgatca tcacaggcgg cacaagaggt attggattcg ccgctgccaa
4201 aatatttatc gacaatggcg caaaagtatc catcttcgga gagacgcagg aagaagtaga
4261 tacagcgctt gcacagttaa aagaacttta tccggaagaa gaggttctgg gattcgcgcc
4321 ggatcttaca tccagagacg cagttatggc agcggtaggc caggtagcac agaaatatgg
4381 cagactggat gtcatgatca acaatgcagg aattaccagc aacaacgtat tctccagagt
4441 gtctgaagaa gagttcaagc atattatgga catcaacgta acaggcgtat tcaacggcgc
4501 atggtgcgca taccagtgca tgaaggatgc caaaaagggc gttatcatca acacggcatc
4561 cgttacaggc atcttcggat cactctcagg cgtaggatat ccggccagca aggcaagcgt
4621 gatcggactc acccatggac ttggaagaga gatcatccgc aagaatatcc gtgtagtagg
4681 agtggctcct ggagttgtga acacggatat gaccaatggc aatcctccgg agatcatgga
4741 aggatatctg aaggcgcttc cgatgaagag aatgcttgag ccggaagaga tcgctaatgt
4801 atacctgttc ctggcatctg acttggcaag cggcattacg gctactacgg tcagcgtaga
4861 cggggcttac agaccataat tttaattttt actaagtaga atatgtgata tagaaaagga
4921 gatataaaaa catggctgga ataaaagatt ttccaaaatt cggagctctt gcagggctta
4981 agatacttga cagcggatct aacatcgccg gacctttagg cggaggcctt ctggcagaat
5041 gcggagcaac ggtcatccat tttgaaggac caaagaaacc tgataaccag agaggatggt
5101 acggctatcc acagaatcac cgtaatcagc tgtctatggt agcagacatc aaatctgaag
5161 aaggaagaaa gatcttcctt gatctgatca aatgggcaga tatctgggta gagtcatcca
5221 aaggcggaca gtatgacagg ctgggacttt ccgatgaagt catctgggaa gtaaatccta
5281 agattgccat cgtgcacgta tccggatatg gacagacagg agacccgtct tacgttacac
5341 gtgcatccta tgacgcagta ggccaggcat tcagcggcta tatgtcactg aacggaacaa
5401 cggaagcgct gaagatcaat ccttatctga gcgatttcgt atgcggactt accacatgct
5461 gggctatgct tgcctgctat gtaagcacca ttcttaccgg aaaaggcgaa tctgttgacg
5521 ttgcacagta cgaagcgctg gcacgtatca tggacggacg tatgatccag tacgctacag
5581 acggcgtgaa gatgccaaga accggcaata aggatgcgca ggctgccctg ttcagcttct
5641 acacctgtaa agacggacgt acgatcttta tcggaatgac tggcgcggaa gtatgtaaga
5701 gaggcttccc gatcatcgga cttccggtac ctggaaccgg agacccggac ttcccggaag
5761 gcttcacagg ctggatgatc tatactcctg taggacagag aatggaaaag gctatggaga
5821 agtatgtatc tgagcatacg atggaagaag tagaggctga gatgcaggca caccagattc
5881 catgccagag agtatacgag ctggaagact gcctgaacga tcctcactgg aaagcacgtg
5941 gaactattac ggagtgggat gacccgatga tgggacatat cacaggcctt ggactgatca
6001 acaagttcaa gagaaatcct tccgaaatct ggagaggcgc tccgctgttc ggtatggata
6061 accgcgatat cctgaaagac ctgggatatg acgatgcaaa gatcgatgaa ctctatgagc
6121 agggcatcgt caatgaattc gaccttgaca ctactatcaa acgctataga ctggatgaag
6181 taattccaca tatgagaaag aaagaggagt aa
//
LOCUS FDICPECDA 2371 bp ds-DNA BCT 14-AUG-1990
DEFINITION F.diplosiphon phycoerythrin-associated linker protein (cpeCD) gene,
complete cds.
ACCESSION M33832
KEYWORDS cpeC gene; cpeD gene; phycoerythrin-associated linker protein.
SOURCE F.diplosiphon DNA.
ORGANISM Fremyella diplosiphon
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria.
REFERENCE 1 (bases 1 to 2371)
AUTHORS Federspiel,N.A. and Grossman,A.R.
TITLE Characterization of the light-regulated operon encoding the
phycoerythrin-associated linker proteins from the cyanobacterium
Fremyella diplosiphon
JOURNAL J. Bacteriol. 172, 4072-4081 (1990)
STANDARD full staff_entry
COMMENT Authorin copy for [1] kindly submitted by N.A.Federspiel,
24-APR-1990, for release after publication.
FEATURES from to/span description
pept 454 1314 phycoerythrin-associated linker protein (cpeC)
pept 1355 2104 phycoerythrin-associated linker protein (cpeD)
mRNA 267 > 2371 cpeCD mRNA
binding 440 444 cpeC ribosomal binding site
binding 1339 1343 cpeD ribosomal binding site
BASE COUNT 679 a 529 c 514 g 649 t
ORIGIN
1 gatctgctca agcagtagat ttattaatca aagcaggggc gacggagtga agtggggatt
61 ggggactggg gatgaggagg atgaggggga tgaggaggat gagggggatg aggggaaata
121 accattaccc attccccatt ccccattacc caatttccca tgcccattac aaatagtttg
181 tgcaaattga gtgcaaaatt ctctcatttc atcaagattt acaaatcttg atgtacatct
241 ttccgcgctc ataagaaaat gatcataaca aagcgtataa aacgcctaaa accttgagca
301 tctaggaaat cacagctttc atgcttatgg aagctggtaa tttgggttgc aacaaatcaa
361 tccaaagaat tgatgagcct ggatgagcaa tctgaaattt atttacaaat cgaaacaaat
421 ttcttaaact ttcgttaaca ggagaaacaa ttaatgccat ttggaccagc ttcacgcttg
481 ggagtcagcc tatttgatga aactcctccc gttgagtggg taccaggtcg ctcacaagaa
541 gaagcagaaa caatcattcg ggcaatctat cggcaagtat taggtaatgc ctatgtgatg
601 gaaagtgagc ggcttgctgt gcctgaatcc cagtttaagc ggggtgagtt gagcgtccgc
661 gagtttgtca gagcagtggc taaatctgaa ctatatcgtt ctcgcttttt caccagttgt
721 gcgcgctacc gagccattga actcaacttc cgccatctat tgggtcgtcc accactagat
781 ttagaagaaa tgcgctccca cagcacaatc cttgatactc aagggtttga agctgagatt
841 gattcttata tcgatggtga tgagtatcag tctacttttg gcgagaacat tgtaccttac
901 atccgaggct ataaaaccga agcgcttcag agcatggtgc aatttactca taccttccaa
961 ctggtacgag gtgcttctag cagcagcctg aagggtgact tatctggcaa ggctcctaag
1021 ctgaatgcat tagtgattca aagcacacca acagcagtaa tttcacctgc tagcgctgga
1081 gcaaccttct cgacaccacc tactggtgcc cgtacccgtc ttggagtcga tgctagtgct
1141 ggtggcaaag tttaccgcat tgaagttaca ggttatcgtg ccaaaacctt caataatatt
1201 tccaagtttc gccgttccaa tcaagtcttt ctggtgccct acgaaaagct ctctcaagag
1261 tatcaacgga ttcaccagca aggcggcgtg atcgcaagta tcactcctgt ataaattagg
1321 tgcaaactta aaaaattgag gagcagaaat tttaatggca tcccagacaa ttcttgaact
1381 ttggccctct agtagcttag aagaagttca aactattatc cgtgcagttt acaaacaggt
1441 tttaggcaac cctcatgtta tggagagtga gcggttggtg acagcagaat cacaattatg
1501 cgatcgctcc atcaccgtgc gggaatttgt ccgcagcgtt gccaagtctg atttttatcg
1561 caaccgctac ttccaatcct gcgctcccta ccgatttgta gaacttaact tcttacattt
1621 gcttggtcgc gcaccccagg atcaaagaga agtttccgaa cacatcgttc gtactgtagc
1681 tgaaggctac gatgctgaaa ttgactccta tatcgatagt agtgaatatg aagcagcctt
1741 tggtgaaaac gtagtgcctt actatcgtgg tagaagtagc gaagccaact ccaagcaagt
1801 aggcttcaac cgcatatttg cccttgatcg cggccctgcc caaattgata gtgcagttaa
1861 atcggctcaa ttggtctatg ctgttgctac taacagcgcc aacgcgatca aagcctcttc
1921 atccacagtc attggctctg gaactgaaaa acgattcaaa atcttggtgc aaggttccaa
1981 attcgacagt ccccgacgca tcagtaccac tgagtacatt gttccagcta gtaagatgac
2041 tccccaaatt cagcggatta atcgtacttc tggcaaaatc gtcagcatta ctgaaattgt
2101 ctaaccttta acagggtggg cattaatact tgtctgttct aaatgttgaa ccgattagta
2161 ttaacaactc gtaattcgta gttactaatt cgtaattacg ttacggatag ggatttagct
2221 cccacctgta acggcatgat tttatagaag tcggagactc aaaccctcag cttaattacg
2281 aattacgaat tacgaattac gaattagtaa tttaggttga gacattgccc gccccagatc
2341 attcactttt ttaatttgta attttcaaaa a
//
LOCUS HIVPRBAC 582 bp ds-DNA SYN 14-AUG-1990
DEFINITION Synthetic HIV type-I protease (syn BAC) gene, complete cds.
ACCESSION M36450
KEYWORDS HIV type-1 protease; HIV-PR gene.
SOURCE Synthetic oligonucleotide DNA, clone pET3AM.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 582)
AUTHORS Cheng,Y.-S.E., McGowan,M.H., Kettner,C.A., Schloss,J.V.,
Erickson-Viitanen,S. and Yin,F.H.
TITLE High-level synthesis of recombinant HIV-1 protease and the recovery
of active enzyme from inclusion bodies
JOURNAL Gene 87, 243-248 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 7 573 HIV type-I protease (BAC)
BASE COUNT 120 a 125 c 170 g 167 t
ORIGIN
1 gaattcatgg aagatctggc tttcctgcaa ggtaaggctc gtgagttctc ttctgaacag
61 actcgtgcta actctccgac tcgtcgtgag ctgcaggttt ggggtcgtga taacaactct
121 ccgtctgagg ctggcgccga tcgtcagggt accgtttctt ttaactttcc gcaggttact
181 ctgtggcagc gtccactggt tactattaag attggtggtc agctgaagga agctctgctg
241 gataccggtg ctgatgatac tgtactggag gagatgtctc tcccgggccg ttggaagccg
301 aagatgatcg gtggtattgg tggtttcatt aaggtacgtc agtatgatca gattctgatt
361 gagatctgtg gccataaggc gatcggtacc gttctggtgg gcccgactcc ggttaacatt
421 atcggccgta acctgctgac tcagatcggt tgtactctga actttccgat ttctccgatt
481 gagactgttc cggttaagct gaagccgggt atggatggtc cgaaggttaa gcagtggcca
541 ctgactgagg agaagattaa ggctctggtt taatagaagc tt
//
LOCUS HNNMTPIT 1336 bp ds-DNA ORG 14-AUG-1990
DEFINITION H.annuus mitochondrial plasmid pIT DNA.
ACCESSION M36422
KEYWORDS mitochondrial plasmid.
SOURCE H.annuus (strain CANP3) mitochondrial plasmid pIT DNA, clone
pITC20.
ORGANISM Mitochondrion Helianthus annuus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Asterales; Asteraceae; Compositae; Helianthus annuus.
REFERENCE 1 (bases 1 to 1336)
AUTHORS Perez,C., Dujon,B., Heizmann,P. and Berville,A.
TITLE Sequence of a mitochondrial plasmid of sunflower (Helianthus
annuus) and its relationship to other mitochondrial plasmids
JOURNAL Plant Sci. 58, 59-69 (1988)
STANDARD simple staff_entry
BASE COUNT 382 a 314 c 290 g 350 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattctttt gttccctcaa caagcaacta atcacaagga aagaaaacaa aagacaaaga
61 aaggcggttt tgagacataa aaaataaatg ctaaacttgc tttttaatct caaaaggtcg
121 atcttgttgt tttgagaact caaatgcagc tatcattatc ggggtgctcc gccacccctc
181 gtttccatca tccctcgact gcagtcgagg gtacgtctcc cttccctaaa aaaagatcga
241 gctttttact tattacagaa gagagagagc accttacaga agagagacac atcgaccgtt
301 ttttacttat tacagaagag agatagcacc ttacagagag atagcacatc gaccgttttt
361 tacttattac agaagagaga gacctaaaaa ggttcaaatg agaaatatct taagtgatcg
421 tggaaactga gctttgtaaa ggagtagaga gtgtaagacg gggttgcgag tctttgcgag
481 ttgggtagtc acctctccaa cccattccgg acgttgcccc aagcccccta gcagcgctct
541 cgaaactcta caatgcgcag acaagaaggc gattgagtta acggaagcgt ctcaaaaaca
601 ttctttttga agaacaagaa gaagactcag cgctgggtaa gtggtttaag acttttcccg
661 accctcggct ccttggggag tagtgccccc aagacacagc cacgccaatc cgacgcggct
721 ccatacccct agtttagtat atagagctgt ttttcttttt ttctctgagg agcaaagaaa
781 aaagtaaaaa atttttttat ttttggacgg ccagcccggt aggaaggtga gataaatctc
841 cccctccccg gcctccccct ctgttaaggt cgagtcctta ctcaattcat cggatgaacg
901 gactcggctg gacttgttag ctgcataaaa gcatagaaca tagcccggag aatgaagatc
961 acccgaactt cattcccctg gctacaaccg gcagcaggag aagctcttct ctttgctggc
1021 caagttatcg ccatggggcc cccccccctc ccatggcttg ctttgttgct gctcttttac
1081 tgcctatagc tgacaagtcc gctctccccg acttaacacc gagaaaaaca aaaaacagct
1141 ctatatacta ggggaggaga gcggagtgag tgcaggccct tcaacgagag cattgttttg
1201 tggtccgggt tcggtattga attatgaaac tcctttctta ttgatgattt acggcctttt
1261 aaaacagttg ttcagtctca aatggtaccc tatattgatc ttgaaagaaa tcattttgtc
1321 tgtcaagatg gtcata
//
LOCUS HSE1GBA 3958 bp ds-DNA VRL 14-AUG-1990
DEFINITION Equine herpesvirus type 1 glycoprotein B (gB) gene, complete cds.
ACCESSION M36298
KEYWORDS antigen; glycoprotein B; structural protein.
SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, from COS cells.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 3958)
AUTHORS Bonass,W.A., Elton,D.M., Stocks J,-M., Killington,R.A.,
Meredith,D.M. and Halliburton,I.W.
TITLE Molecular analysis of the Equine herpesvirus type-1 strain Ab1,
glycoprotein B gene and its expression in COS cells
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.A.Bonass, 10-JUL-1990.
Leeds University
Dept. of Microbiology
Leeds LS2 9JT, UNITED KINGDOM
FEATURES from to/span description
pept 603 3545 glycoprotein B precursor (gB)
sigp 603 860 glycoprotein B signal protein
matp 603 3542 glycoprotein B
mRNA 475 > 3958 glycoprotein B mRNA
signal 370 373 CAT box
signal 453 458 TATA box
signal 3554 3560 polyA signal
site 2165 2166 internal cleavage site
BASE COUNT 1019 a 1075 c 1010 g 854 t
ORIGIN
1 gttaacgtgt atggcgaagt gctgctacat acgtacgtat cgtcctacaa cgggttttgc
61 gccaggcgcg ggttttgcgc ggcggtgagt cgagcgggta ccatcataga taaccgctct
121 agcacgtccg cgttcgactc gcatcagttc atgaaggcgg cgctgcttcg ccaccccatt
181 gaccagtcgc tcatgccgtc cataacacac aagtttttcg agctgatcaa cgggcccgtg
241 tttgacaacg ctggccacaa ctttgcgcag ccgccaaaca cggcattata ttacagcgtt
301 gaaaacgttg ggttgttacc gcatctcaag gaggaactag ctcggtttat gattactgcg
361 gctaaaggtg attggtcaat tagcgagttt caaaggtttt attgctttga gggagtgaca
421 ggtgtgacgg ccacgcagcg gctggcgtgg aaatatatcg gggagctcat cctagccgcc
481 gcagtattct cctcggtttt ccactgtgga gaggtgcgcc tcctgcgcgc agatcgtacc
541 tacccggact ccagcggcgc acagcgctgc gtgagcggca tttacataac ctacgaggcg
601 tcatgtcctc tggttgccgt tctgtcggcg gctccacatg gggcaattgg cgcggagacg
661 gtggtgattt acgacagcga cgtgttctct ctcctgtatg cagtgctcca gcagctggct
721 cctggatcgg gagccaacta ggcaatgttg gaaacttact cgccaccccc cacccgctgg
781 gaaagccggc atcatcgagg gtgggcacaa tagttctagc ctgtttgttg ctttttggaa
841 gctgtgttgt tagagccgta cccaccacgc caagcccccc aactagtact cccacttcca
901 tgtcaacgca ctcccatggg acagtagacc ctacgctgct ccccacagaa acgcccgacc
961 cactcagact ggctgtgcgc gagtccggta tactcgctga ggatggagac ttttacacct
1021 gcccaccgcc taccggatcc accgtcgtac gcatcgaacc acctagaact tgccccaagt
1081 ttgaccttgg gagaaacttc acggagggga ttgctgttat ttttaaggaa aacatcgctc
1141 cctacaaatt cagggcaaac gtatactaca aggacatcgt tgtaacacgt gtgtggaaag
1201 gatacagcca tacgtccctg tccgacagat acaatgacag ggttccggtt tcggtggagg
1261 agatcttcgg tctcatcgac agtaagggaa aatgttcgtc aaaggccgag tacctcagag
1321 ataacatcat gcaccacgcg taccacgacg acgaggacga ggtggagctt gatttggtgc
1381 cgtccaagtt tgcaactccg ggggccagag cctggcagac caccaacgat actacgtctt
1441 acgtggggtg gatgccatgg aggcactaca cgtcaacgtc tgtcaactgc atcgtcgagg
1501 aggtggaggc gcggtccgtc tacccctacg actccttcgc cctgtccacc ggtgatattg
1561 tgtacgcgtc tccgttttac ggcctgaggg ctgccgctcg catagagcac aatagctacg
1621 cgcaggacag tttcaggcaa gttgaagggt acaggccccg cgacttagac agtaaactac
1681 aagccgaaga gccggttacc aaaaatttta tcactacccc gcatgtcacc gtcagctgga
1741 actggaccga gaagaaagtc gaggcgtgta cgctgaccaa atggaaagag gtcgacgaac
1801 tcgtcaggga cgagttccgc gggtcctaca gatttactat tcgatccatc tcgtctacgt
1861 ttatcagtaa cactactcaa tttaagttgg aaagtgcccc ccttactgaa tgtgtatcca
1921 aagaagcaaa ggaagccata gactcgatat acaaaaagca gtacgagtct acgcacgtct
1981 ttagcggtga tgtggaatat tacctggcac gcggggggtt cttaattgca ttcagaccta
2041 tgctctccaa cgaactcgcc aggctgtacc tgaacgagct tgtgagatct aaccgcacct
2101 acgacctaaa aaatctattg aaccccaatg caaacaataa caataacacc acgcgaagac
2161 gcaggtctct cctgtcagta ccagaacctc agccaaccca agatggtgtg catagagaac
2221 aaattctaca tcgcttgcac aaacgagcag tggaggcaac ggcaggtacc gattcttcca
2281 acgtcaccgc caaacagctg gagctcatca aaaccacgtc gtctatcgag tttgccatgc
2341 tacagtttgc atacgatcac atccaatccc acgtcaatga aatgctaagt agaatagcaa
2401 ctgcgtggtg taccctccaa aacaaagagc ggaccctatg gaacgaaatg gtgaagatta
2461 acccgagcgc catagtctcc gcaacccttg acgagcgagt tgcagcgagg gtcctggggg
2521 acgtgatagc tataacgcac tgcgccaaaa tagagggcaa cgtgtacttg caaaactcca
2581 tgcgctcgat ggacagtaac acgtgctact cccgcccccc cgtaacattt acaattacta
2641 agaatgcaaa caacagaggg tcgatagaag gccagctggg agaggagaac gagattttca
2701 cggagcgcaa gctgatcgag ccgtgcgccc tcaatcagaa gcgctacttt aagtttggca
2761 aagagtacgt ttactacgag aactacacgt tcgtccgcaa agtgcccccc acggaaatcg
2821 aggttatcag cacgtacgtt gaactaaact tgaccctttt ggaagaccgc gagtttctgc
2881 ccctggaggt gtacacgcgg gctgagctgg aggacaccgg cctgctagac tacagcgaaa
2941 tacagcgccg caaccagctc cacgctctca ggttttacga catcgacagc gtggtcaacg
3001 tggacaatac cgcagtgatt atgcagggga tcgccagctt tttcaagggc ctgggtaaag
3061 tgggggaggc cgtgggaacg ctcgttctcg cggccgccgg cgctgttgtt tcaaccgtat
3121 ctggaatagc ttcgttttta aacaacccat ttggggggct agccatcggc ctgctggtaa
3181 tcgccggcct ggtagctgcg ttttttgctt acagatatgt aatgcagatc cgcagtaacc
3241 ccatgaaagc tctatacccc ataacaacaa aggccttgaa aaacaaagcc aaaacttcct
3301 acggccagaa cgaggaggac gatgggagcg actttgatga ggccaagctt gaagaggctc
3361 gcgaaatgat caaatacatg tctatggttt cggccctgga aaagcaggaa aagaaagcta
3421 taaagaaaaa cagtggggtt ggcctgatcg ccagtaacgt ctcaaagctg gccctgcgaa
3481 ggcgcggtcc caaatatacc cgactccaac agaacgatac catggaaaat gaaaaaatgg
3541 tttaaacatg tttaataaat attatgacac gtactcaaag tgtgacctca tatttgcata
3601 accactttct agttccggcc ccaaggatat ttaagcctag tatctccgcc gaggtttcat
3661 cctcattcac caactcacac ttagagttga cgcttcctct tgcgcctttg ctctcgccgc
3721 tcctgtgtta gcgtatactg cccaagaaat ggattctcca cgcggtatct ccacagctac
3781 cggtgatgcc cacgccgagg ccgcggtttc cccagccgcg gaaatccaga taaaaacgga
3841 agcccccgat gtagacggac cagaagccac tactgagtgt ttagaccaca cctacaccca
3901 acagacaagc gggggtgatg gcctagatgc tatcgatacg gacgatctct ggagatgg
//
LOCUS HSE1GPEIA 4707 bp ds-DNA VRL 14-AUG-1990
DEFINITION Equine herpes virus type 1 glycoproteins gE, gI and 10 kDa protein
genes, complete cds.
ACCESSION M36299
KEYWORDS glycoprotein gE; glycoprotein gI.
SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, clone pUC19.
ORGANISM Equine herpesvirus type 1
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Alphaherpesvirinae.
REFERENCE 1 (bases 1 to 4707)
AUTHORS Elton,D.M., Bonass,W.A., Killington,R.A., Meredith,D.M. and
Halliburton,I.W.
TITLE DNA sequence analysis of the 4.7Kb BamHI to EcoRI fragment of the
Equine herpesvirus type-1 short unique region
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.M.Elton, 10-JUL-1990.
University of Leeds
Dept of Microbiology
Leeds LS2 9JT, UNITED KINGDOM
FEATURES from to/span description
pept 594 1868 glycoprotein I precursor (gI)
sigp 594 701 glycoprotein I precursor (gI)
matp 702 1865 glycoprotein I precursor (gI)
pept 2101 3753 glycoprotein E precursor (gE)
sigp 2101 2208 glycoprotein E precursor (gE)
matp 2209 3750 glycoprotein E precursor (gE)
pept 4478 > 4707 10 kDa protein
ORF < 1 484 ORF1
ORF 3750 4140 ORF4
signal 544 550 gI TATA box
signal 1924 1929 gI polyA signal
signal 1955 1958 gE CAT box
signal 4139 4144 gE polyA signal
BASE COUNT 1204 a 1291 c 1107 g 1105 t
ORIGIN
1 gaattcaact tcccacaagg agagcatatg acatggttga agttctggtt cgtctacgat
61 ggtggaaacc taccagtgca gttttatgaa gcccaggcat tcgcaagacc cgtgcctccg
121 gataaccacc ctggatttga ttctgttgag tcggagatta cacaaaataa aacagacccg
181 aaaccaggcc aggcggaccc caaacccaat cagcctttta agtggcccag catcaaacac
241 ttggccccaa gactcgatga ggtggatgag gtcatagagc ccgtaacaaa gcccccaaaa
301 acgtctaaga gcaactctac gtttgtgggc atcagcgtcg gtttgggtat cgccggccta
361 gtattggtgg gcgtcattct atacgtctgc ttgcgtcgga agaaggaact gaaaaagtct
421 gcacagaacg gcttgactcg cctacgctcg acctttaagg atgttaaata tacccagctt
481 ccgtaaacag tgttgcgtaa cctgctggga ggtgtccacg gccttaaagc ttcgcggttt
541 ggagatataa cgcacaacct acaacaaacg cgacacagca agtagtagtc gctatggcca
601 aactcactgg gatgttcagc gctgcgatat tactgtctat ggctatatgc tcaaccgcaa
661 tcatatatcg cggagaacat atgagcatgt acctaaacgc cagctctgag tttgccgtgt
721 accccactga tcagtccctt gttttggttg gccacttgct ctttctcgac ggacaacgct
781 tacccaccac caactatagt gggctgatcg aattgattca ttacaactac tccagcgttt
841 gctacactgt tatccaaacg atatcgtatg aatcatgccc gcgtgtagcc aacaatgctt
901 tcagatcgtg cctccacaaa acttctaagc actaccacga ctatttccga gtcaatgcct
961 ctgttgaaac caacgttctc ttaaacatca caaagccaca gcctacagat tccggggcgt
1021 atatccttcg cgtaaaactt gaccacgcgc caaccgcaga tgtttttgga gtttccgcct
1081 ttgtttacga tctaaaatct aaaacggtcc ccgatccaat gcccaccaca caaacggtag
1141 aacctacaac gagctatgtg tcgactccca catacgacta taccgatgac gtaaccaccg
1201 aaactgaatc cacatcaaca tctacccaac aggcgatgac ctccactcaa acccctagcg
1261 ctacatgggg aacccagcta accacagagc tgccgacaaa cgaaactgtg gttattggtc
1321 aggaggccct gttatgccat tggttccagc catcgacaag ggtgccgacc ctgtatctgc
1381 atctgttggg acgcactggc aatctcccgg aagatgttct actggtcgaa gactctgagt
1441 ttcttcgtac cacatcgcct gcacataggc cttctgcatc acccgctgac ggtgatgatt
1501 ttaaacagac aaactcaact tcccttaagg cgcgcaacaa gatcgtcgca atggtggtta
1561 tcccgaccgc gtgtgtacta atgctcctgt tggtggttgt cggtgccatc ataaacggtg
1621 ccgtgcgcaa acatttattg agttgcgcaa gccgcaggat ctaccgctcc ggacaggggg
1681 gcgcatcggc ggccgaacgg agacggctga cttgcggtcc tactttagcc gcgtcatcgg
1741 agtcgctggc cgacgataca acgtcatcac ctccaacccc caaaccttcg aagaaaacca
1801 agttggagac cgatccgctt atggaacagc tgaaccggaa actggaggcc atcaaagaag
1861 aatcatagtt gtgggggtag atggggttgg tattaaagtt tgtgtattat cgattttata
1921 tttattaaaa tttgtgaaac ataaacatct tgtgcaatgt ttacattatt tgtgattggg
1981 acggtccact gggaggtggt acaactcggg tttaaagctc tggatgtttg gtaggaaact
2041 cacagttctc cactttggcg tcaaagcaat cagacgtcta attcgaagta gaacgtcaca
2101 atggagctgt tggccgcaag tcgcgcttgt atattttttg ggctagtaac agtactcgat
2161 gcgtggggag tccaacaagt tgaactttcc gagggggctt gggctatgat cgacggaagg
2221 gacgttttaa cccctactaa cacaactact cgggtcacaa aggcctggac gtttttggaa
2281 acccctcccg gttgcgctgg cgacatatca gttaagaagg tgtgcgtgag ccatagtctg
2341 tgcgaagata acattataat aggaaagcac tgtaacctct taactgggga acatggcatt
2401 gcgttggccg agtttaacgt agtaaacgga tcgctgcgca gaacagacga tgtgtacttt
2461 gtgaatggta cagtctttcc aatccttgcc gaaacccgca gcgtcctaca aatccatagg
2521 gcaaccccct ctatcgcagg ggtttacacc ctccacgttt ccatcgacgg aatgatgaaa
2581 cactccgtcg tgctgctcac cgtcaagaag ccgcccaaac aaccgcaacc acgcttgcgc
2641 gttaagaccc cgccacccgt aaccgttcct caggttcccg taaagaccca cacggatttt
2701 gtggtgcacg gataccactc gcgcgtgtac gctgatggcg aatctttcga gctgtcggtg
2761 aacctggagt cacatatcgt agagcccagc ttcagcgcgg agattcagtg gtactatatg
2821 aatacatcat cgtcatcatg cgatctattt cgagttttcg aaacctgcat ctttcacccg
2881 acagccatgg cctgcctgca cccggaacaa cacacctgca gcttcacatc ccccatcaga
2941 gcgaccaaga tcctacaccg ggtgtatgga aactgcagcg atcatggaaa ttcgtggcct
3001 tctaggtgcc atagcactct gctgggcaat cgtctatact ttattcaacc agcacagaac
3061 agagtggacc tgttgttcaa agacactccc gcgtcggcta ccgggctgta tgtgtttgta
3121 ttattgtaca acggacatcc ggaggcgtgg acgtatacgc tgctgtcaac cgcaaatcac
3181 tttatgaatg tgcttactga cgtgacccgc ccacggctag gagagcactt ttatacggac
3241 ctcgggcaca aaatcatcac tcctcatcca tctgtagcta ccactgaaga gttgggagct
3301 tggactcgac actacctcgc ctttttgctg gttattatct gcacgtgcgc ggcgctgcta
3361 gttgcattgg tggtgtgggg ctgtattctc tacatccgaa gcaaccgtaa gccgtatgaa
3421 gtgctgaacc cctttgaaac ggtttacacg agcgttccaa gcaacgaccc ctcggacgag
3481 gtcttggtgt ttgagcgcct agcttcggac tctgacgact ccttcgactc tgattcagac
3541 gaagagttgg aatacccacc acctcccaaa ccagctccac agctcccacc ataccagttt
3601 gtagacgggg gagacgcccc tagcggcagg tccggattca aggtttggtt ccgcgataca
3661 cccgaggcgt ccccggttcc tcttcataaa ccaacgctac agggtccaga ctacagccgg
3721 gtagcgtcga agctaaagtc gatactaaaa tgagcagcaa cagcgataac acagagtgct
3781 tcgggggagt caactatgcc gagggaatgc gcaacggtaa acgcaaccct gtcagaaaca
3841 gcacctttca agagtatctc gacgcgcgta acgcgcgtta tcccagatcc ggctcaacct
3901 ccgattccga cgaggactac acaaccagat caaagtacga gtcagatgtc agcgagttta
3961 aaaaaatgat ggatctggaa actctacctc ccccaaaggc tgagccgcaa gctcagaagg
4021 ccgagcctga tgctgcgaag gaggagccag tcagcaccac tagctacatc ttaaacgaat
4081 gggtggctcc tatgattggg cattttctgg caatgtgtat gtatgagttg cttttcaaat
4141 aaaaacaaac attaacccct gtaaacatcc gtttgtctac tgtgtatgat agagttaaac
4201 ccaaccctag agagttatgt atttaatgcc ctgggacccc gcggaagtca tatatccctc
4261 ggccccctca tttgggcgca cattgcctgc ccggcggcag tcttactccc ttagctcgcc
4321 ctcttgcata agataaacta ttcccctccc agctagtttc acccaccaga ttaagcgagg
4381 ttttccctct cagcgatcac ttttcaccac cgaagaacag gccctcatcg gtttccctcc
4441 gtgttttccc atccatctat ccaaccacta cattttcatg gagaaggcgg aggctgccgc
4501 agttgttata cccctgtcag tttccaaccc cagctaccgt ggaagcggta tgtccgacca
4561 agaagtaagc gaagaacaat ctgctggaga tgcctgggtg tctgcagcaa tggcagccgc
4621 agaggcggtg gctgctgccg ctacctccac cggaattgat aacactaacg actacacgta
4681 caccgctgct tctgagaatg gggatcc
//
LOCUS HUMADRA2RA 2072 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human alpha-2-adrenergic receptor (alpha-2 c2) gene, complete cds.
ACCESSION M34041
KEYWORDS G-protein coupled receptor protein; alpha-2-adrenergic receptor;
plasma membrane protein.
SOURCE Human placenta DNA, clone alpha-2 C2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2072)
AUTHORS Lomasney,J.W., Lorenz,W., Allen,L.F., King,K., Regan,J.W.,
Yang-Feng,T.L., Caron,M.G. and Lefkowitz,R.J.
TITLE Expansion of the alpha-2-adrenergic receptor family:
Characterization of a human alpha-2-adrenergic receptor subtype,
the gene for which is located on chromosome 2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5094-5098 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.W.Lomasney, 03-MAY-1990, for release after publication.
FEATURES from to/span description
pept 413 1765 alpha-2-adrenergic receptor (alpha-2 C2)
/hgml_locus_uid="LY0100G" /nomgen="ADRA2RL1"
/map="2"
BASE COUNT 316 a 705 c 660 g 391 t
ORIGIN Chromosome 2.
1 ctgcaggcgc ggctggggcg aaacgtgcga gctgagcggg cgcaaggtcc tccgcgcctc
61 ctttaagaac cggcccagcc cggcccgcgc ccccagagcg tacggcatcc gcgtggcgga
121 gggcgcgact ttctccggtc ccgggcggga cggggacggc ggcgggacaa cttgggaaac
181 ttctctgggg cggacggcag ggaccccggg caccgtggag gaggatgtag gagggcggct
241 gctggtcctg ggtgttcccg acctcctagg ccccgctcgt ccaggccatg gggctccagc
301 gccctcgcgg cgccggaggg gcgacgctct tgtctagccg agccgggcag cgctgtcgtc
361 cacggtgcgc actgggcggg cagcgctccc tctgcccacc tcccgccccg tcatggacca
421 ccaggacccc tactccgtgc aggccacagc ggccatagcg gcggccatca ccttcctcat
481 tctctttacc atcttcggca acgctctggt catcctggct gtgttgacca gccgctcgct
541 gcgcgcccct cagaacctgt tcctggtgtc gctggccgcc gccgacatcc tggtggccac
601 gctcatcatc cctttctcgc tggccaacga gctgctgggc tactggtact tccggcgcac
661 gtggtgcgag gtgtacctgg cgctcgacgt gctcttctgc acctcgtcca tcgtgcacct
721 gtgcgccatc agcctggacc gctactgggc cgtgagccgc gcgctggagt acaactccaa
781 gcgcaccccg cgccgcatca agtgcatcat cctcactgtg tggctcatcg ccgccgtcat
841 ctcgctgccg cccctcatct acaagggcga ccagggcccc cagccgcgcg ggcgccccca
901 gtgcaagctc aaccaggagg cctggtacat cctggcctcc agcatcggat ctttctttgc
961 tccttgcctc atcatgatcc ttgtctacct gcgcatctac ctgatcgcca aacgcagcaa
1021 ccgcagaggt cccagggcca agggggggcc tgggcagggt gagtccaagc agccccgacc
1081 cgaccatggt ggggctttgg cctcagccaa actgccagcc ctggcctctg tggcttctgc
1141 cagagaggtc aacggacact cgaagtccac tggggagaag gaggaggggg agacccctga
1201 agatactggg acccgggcct tgccacccag ttgggctgcc cttcccaact caggccaggg
1261 ccagaaggag ggtgtttgtg gggcatctcc agaggatgaa gctgaagagg aggaagagga
1321 ggaggaggag gaggaagagt gtgaacccca ggcagtgcca gtgtctccgg cctcagcttg
1381 cagccccccg ctgcagcagc cacagggctc ccgggtgctg gccaccctac gtggccaggt
1441 gctcctgggc aggggcgtgg gtgctatagg tgggcagtgg tggcgtcgaa gggcgcacgt
1501 gacccgggag aagcgcttca ccttcgtgct ggctgtggtc attggcgttt ttgtgctctg
1561 ctggttcccc ttcttcttca gctacagcct gggcgccatc tgcccgaagc actgcaaggt
1621 gccccatggc ctcttccagt tcttcttctg gatcggctac tgcaacagct cactgaaccc
1681 tgttatctac accatcttca accaggactt ccgccgtgcc ttccggagga tcctgtgccg
1741 cccgtggacc cagacggcct ggtgagcccg cctgcgctgc ccctgtgggt tggtgcggtg
1801 gcgccggggt caccctgctt cttgccctgc tgtgtgtggc tgcctcccct gggctttctg
1861 ctccctgccc agatcctgta ggcctcatct taggaacccc ttgggagggg tgggcagggg
1921 ggctgctagc aagggtccca gtgaagcttc cccttgccgg cttagctgtg ggggacccct
1981 tctccaccct ctccctgagc acaggccgat ggaggtggtt caaatcctct ggaacatagc
2041 caagaccagg agaagagaga gcactttctt cc
//
LOCUS HUMARF1BA 1816 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human ADP-ribosylation factor (ARF1) mRNA, complete cds.
ACCESSION M36340 M25203
KEYWORDS ADP-ribosylation factor.
SOURCE Human foreskin fibroblast, cDNA to mRNA, clone pCD2.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 29 to 866)
AUTHORS Bobak,D.A., Nightingale,M.S., Murtagh,J.J., Price,S.R., Moss,J. and
Vaughan,M.
TITLE Molecular cloning, characterization, and expression of human
ADP-ribosylation factors: Two guanine nucleotide-dependent
activators of cholera toxin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6101-6105 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1816)
AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C.
TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved
family of GTP-binding proteins
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer copy of sequence [1] kindly submitted by
D.Bobak, 02-JUN-1989.
Draft entry and computer-readable sequence for [2] kindly submitted
by R.A.Kahn, 10-JUL-1990.
National Cancer Institute
Lab of Biol. Chem.
DCT Bldg. 37, RM 5D-02
Bethesda, MD 20892
FEATURES from to/span description
pept 78 623 ADP-ribosylation factor (ARF1)
BASE COUNT 405 a 512 c 469 g 430 t
ORIGIN
1 ggggaaaacc aacgcctggc tcggagcagc agcctctgag gtgtccctgg ccagtgtcct
61 tccacctgtc cacaagcatg gggaacatct tcgccaacct cttcaagggc ctttttggca
121 aaaaagaaat gcgcatcctc atggtgggcc tggatgctgc agggaagacc acgatcctct
181 acaagcttaa gctgggtgag atcgtgacca ccattcccac cataggcttc aacgtggaaa
241 ccgtggagta caagaacatc agcttcactg tgtgggacgt gggtggccag gacaagatcc
301 ggcccctgtg gcgccactac ttccagaaca cacaaggcct gatcttcgtg gtggacagca
361 atgacagaga gcgtgtgaac gaggcccgtg aggagctcat gaggatgctg gccgaggacg
421 agctccggga tgctgtcctc ctggtgttcg ccaacaagca ggacctcccc aacgccatga
481 atgcggccga gatcacagac aagctggggc tgcactcact acgccacagg aactggtaca
541 ttcaggccac ctgcgccacc agcggcgacg ggctctatga aggactggac tggctgtcca
601 atcagctccg gaaccagaag tgaacgcgac ccccctccct ctcactcctc ttgccctctg
661 ctttactctc atgtggcaaa cgtgcggctc gtggtgtgag tgccagaagc tgcctccgtg
721 gtttggtcac cgtgtgcatc gcaccgtgct gtaaatgtgg cagacgcagc ctgcggccag
781 gctttttatt taatgtaaat agtttttgtt tccaatgagg cagtttctgg tactcctatg
841 caatattact cagctttttt tattgtaaaa agaaaaatca actcactgtt cagtgctgag
901 aggggatgta ggcccatggg cacctggcct ccaggagtcg ctgtgttggg agagccggcc
961 acgcccttgg cttagagctg tgttgaaatc cattttggtg gttggtttta acccaaactc
1021 agtgcatttt ttaaaatagt taagaatcca agtcgagaac acttgaacac acagaaggga
1081 gaccccgcct agcatagatt tgcagttacg gcctggatgc cagtcgccag cccagctgtt
1141 cccctcggga acatgaggtg gtggtggcgc agcagactgc gatcaattct gcatggtcac
1201 agtagagatc cccgcaactc gcttgtcctt gggtcaccct gcattccata gccatgtgct
1261 tgtccctgtg ctcccacggt tcccaggggc caggctggga gcccacagcc accccactat
1321 gccgcaggcc gccctaccca ccttcaggca gcctatggga cgcaggcccc atctgtccct
1381 cggtccgcgt gtggccagag tggtccgtcg tccccaacac tcgtgctcgc tcagacactt
1441 tggcaggatg tctggggcct caccagcagg agcgcgtgca agccgggcag gcggtccacc
1501 tagacccaca gcccctcggg agcaccccac ctctgtgtgt gatgtagctt tctctccctc
1561 agcctgcaag ggtccgattt gccatcgaaa aagacaacct ctactttttt cttttgtatt
1621 ttgataaaca ctgaagctgg agctgttaaa tttatcttgg ggaaacctca gaactggtct
1681 atttggtgtc gtaggaacct cttactgctt tcaatacacg attagtaatc aactgttttg
1741 tatacttgtt ttcagttttc atttcgacaa acaagcactg taattatagc tattagaata
1801 aaatctctta actatt
//
LOCUS HUMARF2A 1529 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human ADP-ribosylation factor 2 (ARF2) mRNA, complete cds.
ACCESSION M36341 M31890
KEYWORDS ADP-ribosylation factor 2.
SOURCE Human breast cancer cell line MDA-MB 231, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 766)
AUTHORS Monaco,L., Murtagh,J.J., Newman,K.B., Tsai,S.-C., Moss,J. and
Vaughan,V.
TITLE Selective amplification of an mRNA and related pseudogene for a
human ADP-ribosylation factor, a guanine nucleotide-dependent
protein activator of cholera toxin
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 66 to 1529)
AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C.
TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved
family of GTP-binding proteins
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
L.Monaco, 02-FEB-1990.
Draft entry and computer-readable sequence for [2] kindly submitted
by R.A.Kahn, 10-JUL-1990.
National Cancer Institute
Lab of Biol. Chem.
DCT Bldg. 37, RM 5D-02
Bethesda, MD 20892
FEATURES from to/span description
pept 130 672 ADP-ribosylation factor 2 (ARF2)
BASE COUNT 427 a 294 c 295 g 513 t
ORIGIN
1 ctgcctccct ctttcttcct ccgctctttc tcttccctct cgtttagttt gcctggagct
61 tgaaaggaga aagcacgggg tcgccccaaa ccccttctgc ttctgcccat cacaagtgcc
121 actaccgcca tgggcctcac tatctcctcc ctcttctccc gactatttgg caagaagcag
181 atgcgcattt tgatggttgg attggatgct gctggcaaga caaccattct gtataaactg
241 aagttagggg agatagtcac caccattcct accattggtt ttaatgtgga aacagtagaa
301 tataagaaca tttgtttcac agtatgggat gttggtggtc aagatagaat taggcctctc
361 tggaagcatt acttccagaa tacccagggt cttatttttg tggtagatag caacgatcgt
421 gaaagaattc aggaagtagc agatgagctg cagaaaatgc ttctggtaga tgaattgaga
481 gatgcagtgc tgctactttt tgcaaacaaa caggatttgc caaatgctat ggccatcagt
541 gaaatgacag ataaactagg gcttcagtct cttcgtaaca gaacatggta tgttcaagcc
601 acttgtgcaa cacaaggaac tggtctgtat gaaggacttg actggctgtc aaatgagctt
661 tcaaaacgtt aaatgaaatt ggatatctaa ccaaggacat gtttgataaa attggtctag
721 gcttgttaca acaaaattag tttgtatctt ggttattaaa cagtatctgg gactggtttg
781 ggcagaatat taaacttatt ttgttgccaa ttattgttta ccgagtataa tgttgctatt
841 tagcaatgtg cttggtttta aagaaattct ccttgggaaa aaagtatcct cttttaattt
901 tacttcccat aagcgtaaat gcctggacat agctcttgtg aacctttaaa taaattgttt
961 gagtgttttt gagccccaga caaataatgt tttaaagtta tcccttgcta ctttactgat
1021 acctttatca ttcctgagac agtttgctaa tttaaaaatg tagcattcca tttgtattta
1081 tttctctccc ttgccaaaaa gattttctaa tactgcttgt accagccaga gaaagatcca
1141 aaacactact cagctctctt gcactgagga aatttttccc cctacattga ctcctggcct
1201 acatcagcca aacttaacct tggtggggtt tggatttgat agccaattag ttctgtgctg
1261 gttgcaaaga attgatattt agatggtttt taatactcag cagattgtct tcccatattg
1321 tgtctttttt atgttgcatg ttgcttttgt tatcagcctg attttttgct cagtatatga
1381 tagttctgct gatgttttgt ttattgggca gacatatctt cattaagagt ttttggaaaa
1441 ctcatcaaat tcgatgaata cattttcttc ataacccatt tggaattatt cctaataaaa
1501 tgataaaata cgtaaaaaaa aaggaattc
//
LOCUS HUMCDC25HS 2055 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human cdc25Hs mRNA, complete cds.
ACCESSION M34065
KEYWORDS mitotic inducer.
SOURCE Human HeLa cell line D98/AH-2, cDNA to mRNA, clone BSK1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2055)
AUTHORS Sadhu,K., Reed,B.I., Richardson,H. and Russell,P.
TITLE Human homolog of fission yeast cdc25 mitotic inducer is
predominantly expressed in G-2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5139-5143 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Russell, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 211 1632 CDC25Hs ORF
mRNA < 1 2055 cdc25Hs mRNA
BASE COUNT 576 a 472 c 477 g 530 t
ORIGIN
1 caggaagact ctgagtccga cgttggccta cccagtcgga aggcagagct gcaatctagt
61 taactacctc ctttccccta gatttccttt cattctgctc aagtcttcgc ctgtgtccga
121 tccctatcta ctttctctcc tcttgtagca agcctcagac tccaggcttg agctaggttt
181 tgtttttctc ctggtgagaa ttcgaagacc atgtctacgg aactcttctc atccacaaga
241 gaggaaggaa gctctggctc aggacccagt tttaggtcta atcaaaggaa aatgttaaac
301 ctgctcctgg agagagacac ttcctttacc gtctgtccag atgtccctag aactccagtg
361 ggcaaatttc ttggtgattc tgcaaaccta agcattttgt ctggaggaac cccaaaatgt
421 tgcctcgatc tttcgaatct tagcagtggg gagataactg ccactcagct taccacttct
481 gcagaccttg atgaaactgg tcacctggat tcttcaggac ttcaggaagt gcatttagct
541 gggatgaatc atgaccagca cctaatgaaa tgtagcccag cacagcttct ttgtagcact
601 ccgaatggtt tggaccgtgg ccatagaaag agagatgcaa tgtgtagttc atctgcaaat
661 aaagaaaatg acaatggaaa cttggtggac agtgaaatga aatatttggg cagtcccatt
721 actactgttc caaaattgga taaaaatcca aacctaggag aagaccaggc agaagagatt
781 tcagatgaat taatggagtt ttccctgaaa gatcaagaag caaaggtgag cagaagtggc
841 ctatatcgct ccccgtcgat gccagagaac ttgaacaggc caagactgaa gcaggtggaa
901 aaattcaagg acaacacaat accagataaa gttaaaaaaa agtatttttc tggccaagga
961 aagctcagga agggcttatg tttaaagaag acagtctctc tgtgtgacat tactatcact
1021 cagatgctgg aggaagattc taaccagggg cacctgattg gtgatttttc caaggtatgt
1081 gcgctgccaa ccgtgtcagg gaaacaccaa gatctgaagt atgtcaaccc agaaacagtg
1141 gctgccttac tgtcggggaa gttccagggt ctgattgaga agttttatgt cattgattgt
1201 cgctatccat atgagtatct gggaggacac atccagggag ccttaaactt atatagtcag
1261 gaagaactgt ttaacttctt tctgaagaag cccatcgtcc ctttggacac ccagaagaga
1321 ataatcatcg tgttccactg tgaattctcc tcagagaggg gcccccgaat gtgccgctgt
1381 ctgcgtgaag aggacaggtc tctgaaccag tatcctgcat tgtactaccc agagctatat
1441 atccttaaag gcggctacag agacttcttt ccagaatata tggaactgtg tgaaccacag
1501 agctactgcc ctatgcatca tcaggaccac aagactgagt tgctgaggtg tcgaagccag
1561 agcaaagtgc aggaagggga gcggcagctg cgggagcaga ttgcccttct ggtgaaggac
1621 atgagcccat gataacattc cagccactgg ctgctaacaa gtcaccaaaa agacactgca
1681 gaaaccctga gcagaaagag gccttctgga tggccaaacc caagattatt aaaagatgtc
1741 tctgcaaacc aacaggctac caacttgtat ccaggcctgg gaatggatta ggtttcagca
1801 gagctgaaag ctggtggcag agtcctggag ctggctctat aaggcagcct tgagttgcat
1861 agagatttgt attggttcag ggaactctgg cattcctttt cccaactcct catgtcttct
1921 cacaagccag ccaactcttt ctctctgggc ttcgggctat gcaagagcgt tgtctacctt
1981 ctttctttgt attttccttc tttgtttccc cctctttctt ttttaaaaat ggaaaaataa
2041 acactacaga atgag
//
LOCUS HUMDBTG 1401 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human delta-beta-thalassemia gene region DNA.
ACCESSION M36375
KEYWORDS delta-beta-thalassemia.
SOURCE Human 30 yr old japanese female blood cell DNA, clone Jp-thal.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1401)
AUTHORS Shiolawa,S., Yamada,H., Takihara,Y., Matsunaga,E., Ohba,Y.,
Yamamoto,K. and Fukumaki,Y.
TITLE Molecular analysis of japanese delta-beta-thalassemia
JOURNAL Blood 72, 1771-1776 (1988)
STANDARD simple staff_entry
BASE COUNT 390 a 341 c 237 g 433 t
ORIGIN
1 gagctcttta gtttaattag attccattgg tcaatttttg cttttgctgc aattgctttt
61 cacgctttca tcatgaaatc tgtgcccgtg tttatatcat gaatagtatt gccttgattt
121 ttttctaggc tttttatagt ttggggtttt tcatttaagt ctctaatagc tcatagaatc
181 tttcttcaac tcctgcaatc aaagggaaac gtaataggat tcttgtaaca gagggaactc
241 tattagcaaa catgggtcca agaacagctc cccccaccac caccaccacc acccgccccc
301 accatccctg ctcctggagc tggagtgcaa tggcacaatc acctactcct cccaggttca
361 agcaattccc ttgcctcagc ctcccgagta gctaggacta caggtacatg ccaccacacc
421 cagctaattt tttgtatttt agtagagatg gggtttcacc atgttggcca ggatggtctc
481 gatctcctga cctcgtgatc tgcccacttc agcctcccaa agtgctggga ttacaggcgt
541 gagccactgc tcccggccaa taatagcttt ttaaaaaaaa ttctgtgatt cttaaaccag
601 tcttgagttg tcctgcgcta aaactaatga gcaaatatta aactcttctt aaacagttct
661 ctttaagtgt ctctttctcc cccactgaga attcttccat gacagagcct ataacttaaa
721 catccctatt accagactac caggaacatc taaatagaca gccctcacaa caatgcttca
781 cattcattac tatcacaaca ccattattgg gtattaataa ctccccttta tgggtgagaa
841 agcagggtaa gattagataa tcgagtctca gaggaaacat acgacgttct caaattcacc
901 caacaagcca atggcagatt caggaattga gctctgttct tactccaaac gccatgctct
961 cgcaaactcc ttatttaaat ttttttttct ttttttactg tggcatgaac aaacacatgc
1021 tgatgactgt agtctatatt tggaataact attttgaacg ctacaacgat atttttgagt
1081 tgacataacg tatgaaaata cactactgag ttcatgctca catttactct gaatacttcc
1141 gcgtttgttt catatactat ctgaaggtta tacagtacct aatttattta taactgtctt
1201 caatgcacta caacactgac cttgtgcaca gaattcttgg gtctggaagg tgaggttatt
1261 tgatcctgcc ccctgctatt tgaaggatca tcccacacag cagagaatct attaatgcaa
1321 agcaagtgag agagagttat ctcaacctca ctaacttact tgcagaaagg ttctgtacta
1381 ctatagctgc ggaatcaccc c
//
LOCUS HUMHER3A 4975 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human epidermal growth factor receptor (HER3) mRNA, complete cds.
ACCESSION M34309
KEYWORDS HER3 gene; epidermal growth factor receptor.
SOURCE Human cell line MDA-MB-361, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4975)
AUTHORS Plowman,G.D., Whitney,G.S., Neubauer,M.G., Green,J.M.,
McDonald,V.L., Todaro,G.J. and Shoyab,M.
TITLE Molecular cloning and expression of another epidermal growth factor
receptor-related gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4905-4909 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.D.Plowman, 16-MAY-1990, for release after publication.
FEATURES from to/span description
pept 199 4227 HER3 protein precursor
sigp 199 255 HER3 signal peptide
matp 256 4224 HER3 protein
BASE COUNT 1195 a 1309 c 1327 g 1144 t
ORIGIN Chromosome 12q13.
1 ctctcacaca cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga
61 ttgcaatttg caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag
121 gtggctcttg cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc
181 ttcaccctct gcggagtcat gagggcgaac gacgctctgc aggtgctggg cttgcttttc
241 agcctggccc ggggctccga ggtgggcaac tctcaggcag tgtgtcctgg gactctgaat
301 ggcctgagtg tgaccggcga tgctgagaac caataccaga cactgtacaa gctctacgag
361 aggtgtgagg tggtgatggg gaaccttgag attgtgctca cgggacacaa tgccgacctc
421 tccttcctgc agtggattcg agaagtgaca ggctatgtcc tcgtggccat gaatgaattc
481 tctactctac cattgcccaa cctccgcgtg gtgcgaggga cccaggtcta cgatgggaag
541 tttgccatct tcgtcatgtt gaactataac accaactcca gccacgctct gcgccagctc
601 cgcttgactc agctcaccga gattctgtca gggggtgttt atattgagaa gaacgataag
661 ctttgtcaca tggacacaat tgactggagg gacatcgtga gggaccgaga tgctgagata
721 gtggtgaagg acaatggcag aagctgtccc ccctgtcatg aggtttgcaa ggggcgatgc
781 tggggtcctg gatcagaaga ctgccagaca ttgaccaaga ccatctgtgc tcctcagtgt
841 aatggtcact gctttgggcc caaccccaac cagtgctgcc atgatgagtg tgccgggggc
901 tgctcaggcc ctcaggacac agactgcttt gcctgccggc acttcaatga cagtggagcc
961 tgtgtacctc gctgtccaca gcctcttgtc tacaacaagc taactttcca gctggaaccc
1021 aatccccaca ccaagtatca gtatggagga gtttgtgtag ccagctgtcc ccataacttt
1081 gtggtggatc aaacatcctg tgtcagggcc tgtcctcctg acaagatgga agtagataaa
1141 aatgggctca agatgtgtga gccttgtggg ggactatgtc ccaaagcctg tgagggaaca
1201 ggctctggga gccgcttcca gactgtggac tcgagcaaca ttgatggatt tgtgaactgc
1261 accaagatcc tgggcaacct ggactttctg atcaccggcc tcaatggaga cccctggcac
1321 aagatccctg ccctggaccc agagaagctc aatgtcttcc ggacagtacg ggagatcaca
1381 ggttacctga acatccagtc ctggccgccc cacatgcaca acttcagtgt tttttccaat
1441 ttgacaacca ttggaggcag aagcctctac aaccggggct tctcattgtt gatcatgaag
1501 aacttgaatg tcacatctct gggcttccga tccctgaagg aaattagtgc tgggcgtatc
1561 tatataagtg ccaataggca gctctgctac caccactctt tgaactggac caaggtgctt
1621 cgggggccta cggaagagcg actagacatc aagcataatc ggccgcgcag agactgcgtg
1681 gcagagggca aagtgtgtga cccactgtgc tcctctgggg gatgctgggg cccaggccct
1741 ggtcagtgct tgtcctgtcg aaattatagc cgaggaggtg tctgtgtgac ccactgcaac
1801 tttctgaatg gggagcctcg agaatttgcc catgaggccg aatgcttctc ctgccacccg
1861 gaatgccaac ccatgggggg cactgccaca tgcaatggct cgggctctga tacttgtgct
1921 caatgtgccc attttcgaga tgggccccac tgtgtgagca gctgccccca tggagtccta
1981 ggtgccaagg gcccaatcta caagtaccca gatgttcaga atgaatgtcg gccctgccat
2041 gagaactgca cccaggggtg taaaggacca gagcttcaag actgtttagg acaaacactg
2101 gtgctgatcg gcaaaaccca tctgacaatg gctttgacag tgatagcagg attggtagtg
2161 attttcatga tgctgggcgg cacttttctc tactggcgtg ggcgccggat tcagaataaa
2221 agggctatga ggcgatactt ggaacggggt gagagcatag agcctctgga ccccagtgag
2281 aaggctaaca aagtcttggc cagaatcttc aaagagacag agctaaggaa gcttaaagtg
2341 cttggctcgg gtgtctttgg aactgtgcac aaaggagtgt ggatccctga gggtgaatca
2401 atcaagattc cagtctgcat taaagtcatt gaggacaaga gtggacggca gagttttcaa
2461 gctgtgacag atcatatgct ggccattggc agcctggacc atgcccacat tgtaaggctg
2521 ctgggactat gcccagggtc atctctgcag cttgtcactc aatatttgcc tctgggttct
2581 ctgctggatc atgtgagaca acaccggggg gcactggggc cacagctgct gctcaactgg
2641 ggagtacaaa ttgccaaggg aatgtactac cttgaggaac atggtatggt gcatagaaac
2701 ctggctgccc gaaacgtgct actcaagtca cccagtcagg ttcaggtggc agattttggt
2761 gtggctgacc tgctgcctcc tgatgataag cagctgctat acagtgaggc caagactcca
2821 attaagtgga tggcccttga gagtatccac tttgggaaat acacacacca gagtgatgtc
2881 tggagctatg gtgtgacagt ttgggagttg atgaccttcg gggcagagcc ctatgcaggg
2941 ctacgattgg ctgaagtacc agacctgcta gagaaggggg agcggttggc acagccccag
3001 atctgcacaa ttgatgtcta catggtgatg gtcaagtgtt ggatgattga tgagaacatt
3061 cgcccaacct ttaaagaact agccaatgag ttcaccagga tggcccgaga cccaccacgg
3121 tatctggtca taaagagaga gagtgggcct ggaatagccc ctgggccaga gccccatggt
3181 ctgacaaaca agaagctaga ggaagtagag ctggagccag aactagacct agacctagac
3241 ttggaagcag aggaggacaa cctggcaacc accacactgg gctccgccct cagcctacca
3301 gttggaacac ttaatcggcc acgtgggagc cagagccttt taagtccatc atctggatac
3361 atgcccatga accagggtaa tcttgggggg tcttgccagg agtctgcagt ttctgggagc
3421 agtgaacggt gcccccgtcc agtctctcta cacccaatgc cacggggatg cctggcatca
3481 gagtcatcag aggggcatgt aacaggctct gaggctgagc tccaggagaa agtgtcaatg
3541 tgtagaagcc ggagcaggag ccggagccca cggccacgcg gagatagcgc ctaccattcc
3601 cagcgccaca gtctgctgac tcctgttacc ccactctccc cacccgggtt agaggaagag
3661 gatgtcaacg gttatgtcat gccagataca cacctcaaag gtactccctc ctcccgggaa
3721 ggcacccttt cttcagtggg tctcagttct gtcctgggta ctgaagaaga agatgaagat
3781 gaggagtatg aatacatgaa ccggaggaga aggcacagtc cacctcatcc ccctaggcca
3841 agttcccttg aggagctggg ttatgagtac atggatgtgg ggtcagacct cagtgcctct
3901 ctgggcagca cacagagttg cccactccac cctgtaccca tcatgcccac tgcaggcaca
3961 actccagatg aagactatga atatatgaat cggcaacgag atggaggtgg tcctgggggt
4021 gattatgcag ccatgggggc ctgcccagca tctgagcaag ggtatgaaga gatgagagct
4081 tttcaggggc ctggacatca ggccccccat gtccattatg cccgcctaaa aactctacgt
4141 agcttagagg ctacagactc tgcctttgat aaccctgatt actggcatag caggcttttc
4201 cccaaggcta atgcccagag aacgtaactc ctgctccctg tggcactcag ggagcattta
4261 atggcagcta gtgcctttag agggtaccgt cttctcccta ttccctctct ctcccaggtc
4321 ccagcccctt ttccccagtc ccagacaatt ccattcaatc tttggaggct tttaaacatt
4381 ttgacacaaa attcttatgg tatgtagcca gctgtgcact ttcttctctt tcccaacccc
4441 aggaaaggtt ttccttattt tgtgtgcttt cccagtccca ttcctcagct tcttcacagg
4501 cactcctgga gatatgaagg attactctcc atatcccttc ctctcaggct cttgactact
4561 tggaactagg ctcttatgtg tgcctttgtt tcccatcaga ctgtcaagaa gaggaaaggg
4621 aggaaaccta gcagaggaaa gtgtaatttt ggtttatgac tcttaacccc ctagaaagac
4681 agaagcttaa aatctgtgaa gaaagaggtt aggagtagat attgattact atcataattc
4741 agcacttaac tatgagccag gcatcatact aaacttcacc tacattatct cacttagtcc
4801 tttatcatcc ttaaaacaat tctgtgacat acatattatc tcattttaca caaagggaag
4861 tcgggcatgg tggctcatgc ctgtaatctc agcactttgg gaggctgagg cagaaggatt
4921 acctgaggca aggagtttga gaccagctta gccaacatag taagaccccc atctc
//
LOCUS HUMMEHA 959 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human microsomal epoxide hydrolase (HmEH) mRNA, partial cds.
ACCESSION M36374
KEYWORDS microsomal epoxide hydrolase.
SOURCE Human liver, cDNA to mRNA, clone HmEH7.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 959)
AUTHORS Craft,J.A., Jackson,M.R. and Burchell,B.
TITLE Partial nucleotide sequence of a cloned cDNA for human liver
microsomal epoxide hydrolase
JOURNAL Biochem. Soc. Trans. 15, 708-709 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 959 microsomal epoxide hydrolase (AA at 3) (EC
3.3.2.3)
BASE COUNT 217 a 272 c 251 g 219 t
ORIGIN
1 cttcagtgct gggctttgcc atctactggt tcatctcccg ggacaaagag gaaactttgc
61 cacttgaaga tgggtggtgg gggccaggca cgaggtccgc agccagggag gacgacagca
121 tccgcccttt caaggtggaa acgtcagatg aggagatcca cgacttacac cagaggatcg
181 ataagttccg tttcacccca cctttggagg acagctgctt ccactatggc ttcaactcca
241 actacctgaa gaaagtcatc tcctactggc ggaatgaatt tgactggaag aagcaggtgg
301 agattctcaa caagtaccct cacttcaaga ctaaaattga agggctggac atccacttca
361 tccacgtgaa gcccccccag ctgcccgcag gccatacccc gaagcccttg ctgatggtga
421 acggctggcc cggctctttc tacgagtttt ataagatcat cccactcctg actgacccca
481 agaaccatgg cctgagcgac gagcacgttt ttgaagtcat ctgcccttcc atccctggct
541 atggcttctc agaggcatcc tccaagaagg ggttcaactc ggtggccacc gccaggatct
601 tttacaagct gatgctgcgg ctgggcttcc aggaattcta cattcaagga ggggactggg
661 ggtccctgat ctgcactaat atggcccagc tggtgcccag ccacctgaaa ggcctgcact
721 tgaacatggc tttggtttta agcaacttct ctaccctgac cctcctcctg ggacagcgtt
781 tcgggaggtt tcttggcctc actgagaggg atgtggagct gctgtacccc gtcaaggaga
841 aggtattcta cagcctgatg agggagagcg gctacatgca catccagtgc accaagcctg
901 acaccgtagg ctctgctctg aatgactctc ctgtgggtct ggctgcctat attctagag
//
LOCUS HUMRNPSMBA 847 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human small nuclear ribonucleoprotein particle SmB mRNA, 3' end.
ACCESSION M34081
KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle.
SOURCE Human HeLa cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 847)
AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P.
TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by
the polymerase chain reaction
JOURNAL J. Immunol. 145, 636-643 (1990)
STANDARD full staff_entry
COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990,
for release after publication.
FEATURES from to/span description
pept < 1 675 small nuclear ribonucleoprotein particle (SmB)
mRNA < 1 847 snRNP SmB mRNA
BASE COUNT 170 a 251 c 252 g 174 t
ORIGIN
1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc
61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag
121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc
181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc
241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc
301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt
361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga
421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag
481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc
541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct
601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg
661 cgaggccttc tttgaccctt ggccacagag tatggaagta gctccgcaga ggcgtgggct
721 cgattcctca gggccacgtt accacagacc tgtttgtttc ttatgctgtt gttcgtggag
781 tctcatggga ttgtctggtt tccttacagg gccccctccc ccgggaatgc gcccaccaag
841 gccctag
//
LOCUS HUMRNPSMBB 702 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human small nuclear ribonucleoprotein particle SmB' mRNA, 3' end.
ACCESSION M34082
KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle.
SOURCE Human HeLa cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 702)
AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P.
TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by
the polymerase chain reaction
JOURNAL J. Immunol. 145, 636-643 (1990)
STANDARD full staff_entry
COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990,
for release after publication.
FEATURES from to/span description
pept < 1 702 small nuclear ribonucleoprotein particle
(SmB')
BASE COUNT 147 a 215 c 212 g 128 t
ORIGIN
1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc
61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag
121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc
181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc
241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc
301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt
361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga
421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag
481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc
541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct
601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg
661 cgagggcccc ctcccccggg aatgcgccca ccaaggccct ag
//
LOCUS HUMTPMYOC 1191 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human skeletal muscle beta-tropomyosin mRNA, complete cds.
ACCESSION X06825 M36268
KEYWORDS actin-binding protein; beta-tropomyosin.
SOURCE Human skeletal muscle, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1191)
AUTHORS Liautard,J.P.
JOURNAL Unpublished (1988) CRBM du CNRS, 34033 Montpelier Cedex, France
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1191)
AUTHORS Widada,J.S., Ferraz,C., Capony,J.-P. and Liautard,J.-P.
TITLE Complete nucleotide sequence of the adult skeletal isoform of human
skeletal muscle beta-tropomyosin
JOURNAL Nucleic Acids Res. 16, 3109-3109 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 107 961 beta-tropomyosin
mRNA < 1 1191 beta-tropomyosin mRNA
BASE COUNT 321 a 320 c 368 g 182 t
ORIGIN
1 cccgctccgt cctcctcgcc tgccaccggt gcacccagtc cgctcaccca gcccagtccg
61 tccggtcctc accgcctgcc ggccggccca ccccccaccg caggccatgg acgccatcaa
121 gaagaagatg cagatgctga agctggacaa ggagaacgcc atcgaccgcg ccgagcaggc
181 cgaagccgac aagaagcaag ctgaggaccg ctgcaagcag ctggaggagg agcagcaggc
241 cctccagaag aagctgaagg ggacagagga tgaggtggaa aagtattctg aatccgtgaa
301 ggaggcccag gagaaactgg agcaggccga gaagaaggcc actgatgctg aggcagatgt
361 ggcctccctg aaccgccgca ttcagctggt tgaggaggag ctggaccggg cccaggagcg
421 cctggctaca gccctgcaga agctggagga ggccgagaag gcggctgatg agagcgagag
481 aggaatgaag gtcatcgaaa accgggccat gaaggatgag gagaagatgg aactgcagga
541 gatgcagctg aaggaggcca agcacatcgc tgaggattca gaccgcaaat atgaagaggt
601 ggccaggaag ctggtgatcc tggaaggaga gctggagcgc tcggaggaga gggctgaggt
661 ggccgagagt aaatgtgggg acctagagga ggagctgaaa attgttacca acaacttgaa
721 atccctggag gcccaggcgg acaagtattc caccaaagaa gataaatatg aagaggagat
781 caaactgttg gaggagaagc tgaaggaggc tgagacccga gcagagtttg ccgagaggtc
841 tgtggcaaag ttggagaaaa ccatcgatga cctagaagat gaagtctatg cccagaagat
901 gaagtacaag gccattagcg aggaactgga caacgcactc aatgacatca cctccctctg
961 agccccacgc ccagcgtgcc acctcagctc tcttctctcc tctcctttcc attctctcta
1021 tggggagggg agagcaggca ggaggagcag aaattgccaa cattgcacag ccaggctggg
1081 agcagcctag ggagagcccc catcatgccc accacccact ctggcactgg cttcatcctt
1141 tacctatccc cttccaccct cctttgcttg cttaataaat tctgaacttg g
//
LOCUS HUMTRHA 1322 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human thyroid releasing hormone (TRH) mRNA, complete cds.
ACCESSION M36317
KEYWORDS thyroid releasing hormone.
SOURCE Human brain, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1322)
AUTHORS Mandel,G. and Goodman,R.H.
TITLE Using the brain to screen cloned genes
JOURNAL Trends Neurosci. 10, 101-104 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 103 870 thyroid releasing hormone precursor (TRH)
sigp 103 174 thyroid releasing hormone signal peptide
matp 175 867 thyroid releasing hormone
BASE COUNT 315 a 344 c 372 g 291 t
ORIGIN
1 tccttggatt cgggagtatt gcaaactcta cccagccagt ttgcactctt cagctcagca
61 tcttggaaag ctctgcagag tctccacttc gcagactcca ggatgccggg accttggttg
121 ctgctggctc tggctttgat cttcacccta actggtatcc ctgaatcctg cgccttgccg
181 gaggcagccc aggaggaagg tgcagtgact cctgaccttc ctggcctgga gaatgttcag
241 gtccggccag aacgtcgatt cttgtggaaa gacctccagc gggtgagagg ggacctcggt
301 gctgccttag actcctggat cacaaaacgc cagcatccag gcaaaaggga ggaggaggaa
361 aaagacattg aagctgaaga gaggggagac ttgggagaag ggggagcctg gagactccac
421 aaacgacagc accccggccg acgtgccaac caggacaagt attcatgggc agatgaggag
481 gacagtgact ggatgccacg gtcctggtta ccagatttct ttctggattc ctggttctca
541 gatgtccccc aagtcaagcg gcagcaccct ggcaggcgat ccttcccctg gatggagtct
601 gatgtcacca agaggcaaca tccaggccgg aggttcatag atcccgagct ccaaagaagc
661 tgggaagaaa aagagggaga gggtgtctta atgcctgaga aacgccagca tcctggcaaa
721 agggcattgg gtcatccctg tgggccccag gggacttgtg gtcaaacagg cctgctccag
781 cttctaggtg acctgagcag gggtcaggag accctggtga agcaaagccc acaagtggaa
841 ccctgggaca aggaacctct ggaggagtaa ggccagagtc aggctttagg tctaggatga
901 tgtaagccct gtattcccta tcctgttccc ttcactagct gtcctctctt agatgctaac
961 cctgggccct ctgtacatcg tccacccaaa ccccttcctt accgacttca gagactttag
1021 aaagccagtc aggaagttaa aaccctactt atcccttcaa gcaagggggt gggagtcaca
1081 ccccttcagc actggccaag atggttcttt cacaccttct aagggacctc ctcagaaagg
1141 aagggtagaa ttgaaatgtt ttggtgttaa aacttctgta atctgcccca tgtggtaaga
1201 ctgacctggt atagcttcag cgcatcctcc aaggttgggg tccctgagca gtttgggaga
1261 tgtttagata tgtccttggg gtggggggac gcatcttgta tataaccctg ctcttgggaa
1321 tc
//
LOCUS MAARNA1A 163 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-1.
ACCESSION M35975
KEYWORDS RNA-1 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 163)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 103 > 163 115 kDa protein
ORF 92 > 163 ORF1
ORF 107 > 163 ORF2
BASE COUNT 39 a 38 c 24 g 62 t
ORIGIN 5' end of RNA-1.
1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttt tccttgtatt
61 ttccacagta ttacgttcat tcaatactgt gatgaattca ctatgaatgc tgacgcccta
121 tccaccgatt cctgccttag tatgcgagaa cctttatccc atg
//
LOCUS MAARNA1B 115 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-1.
ACCESSION M35976
KEYWORDS RNA-1 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 115)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 101 > 115 115 kDa protein
ORF 105 > 115 ORF1
BASE COUNT 29 a 25 c 18 g 43 t
ORIGIN 5' end of RNA-1.
1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttc cttgcgcttt
61 ccacagcatt acgttcattc aatactgtga agaattcatt atgaatgctg acgcc
//
LOCUS MAARNA2A 108 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-2.
ACCESSION M36389
KEYWORDS RNA-2 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 108)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 56 > 108 100 kDa protein
BASE COUNT 26 a 19 c 19 g 44 t
ORIGIN 5' end of RNA-2.
1 ggtttttatc ttttcgcgat tcaaaagata agtttcagag tctaatcttt tcaatatgtt
61 cactcttttg agatgtcttg gatttggcgt caacgacctt accgatat
//
LOCUS MAARNA2B 109 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-2.
ACCESSION M36390
KEYWORDS RNA-2 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 109)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 57 > 109 100 kDa protein
BASE COUNT 25 a 19 c 20 g 45 t
ORIGIN 5' end of RNA-2.
1 ggtttttatc ttttcgcgat tgaaaagata agtttcctga gtctaatctt ttcaatatgt
61 tcactctttt gagatgtctt ggatttggcg tcaacgacct taccgatat
//
LOCUS MAARNA3B 305 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-3.
ACCESSION M36391
KEYWORDS RNA-3 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 305)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 259 > 305 35 kDa protein
BASE COUNT 87 a 57 c 48 g 113 t
ORIGIN 5' end of RNA-3.
1 ggttttcatc ttacacacgc ttgtgcaaga tagttaatca ttccaattca actcaattaa
61 cgttttttac agtgtaattc gtacttttcg taagtaagtt tctgtaaaag cgtttcttgt
121 tttaatttgg cctaacacgt aattcgtact cttcgtgagt aagttgtgtt agccatacct
181 atcctttaaa ttcctgtcaa tttaaaaaga aaatcattcc cattcgcgta attcgtactc
241 ttcgtgagta agttgcaaat ggagaataca aaaacaaatg cctcgagttc tggaatgttt
301 tcttc
//
LOCUS MAARNA3C 290 bp ss-RNA VRL 14-AUG-1990
DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-3.
ACCESSION M36392
KEYWORDS RNA-3 leader sequence.
SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
ORGANISM Alfalfa mosaic virus
Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE 1 (bases 1 to 290)
AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
TITLE Structure of the 5'-terminal untranslated region of the genomic
RNAs from two strains of alfalfa mosaic virus
JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 244 > 290 35 kDa protein
BASE COUNT 90 a 52 c 41 g 107 t
ORIGIN 5' end of RNA-3.
1 ggttttaata ccattttcaa aatattccaa ttcaactcaa ttaacccttt tacagtgtaa
61 ttcgtacttt tcgtaagtaa gtttctgtaa aagcgtttct tgttttaatt tggcctaaca
121 cgtaattcgt actcttcgtg agtaagttgt gttagccata tctatccttt aaatttttgt
181 caaatttaaa aaagaaaatc attcccattc gcgtaattcg tactcttcgt gagtaagttg
241 caaatggaga atacaaaaac aaatgcctcg agaactggaa tatcttcttc
//
LOCUS MBFRRDA 1476 bp ds-DNA RNA 14-AUG-1990
DEFINITION M.formicicum 16S rRNA.
ACCESSION M36508
KEYWORDS 16S ribosomal RNA.
SOURCE M.formicicum (strain DSM 1312) rRNA.
ORGANISM Methanobacterium formicicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 1476)
AUTHORS Lechner,K., Wich,G. and Boeck,A.
TITLE The nucleotide sequence of the 16S rRNA gene and flanking regions
from Methanobacterium formicicum: The phylogenetic relationship
between methanogenic and halophilic archaebacteria
JOURNAL Syst. Appl. Microbiol. 6, 157-163 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 1476 16S rRNA
BASE COUNT 348 a 355 c 472 g 301 t
ORIGIN
1 agtccgtttg atcctggcgg aggccactgc tattgggttt cgattaagcc atgcaagtcg
61 aacgatcctt cggggtcgtg gcgtacggct cagtaacacg tggataacct aaccttagga
121 ctgggataac cctgggaaac tggggataat accggatatg tagggctgcc tggaatggtt
181 ccctattgaa atgttccgac gcctaaggat ggatctgcgg cagattaggt agttggcggg
241 gtaaatgccc accaagccag taatctgtac gggttgtgag agcaagagcc cggagatgga
301 acctgagaca aggttccagg ccctacgggg cgcagcaggc gcgaaacctc cgcaatgcac
361 gaaagtgcga cgggggaaac ccaagtgcca ctcttaacgg ggtggctttt cttaagtgta
421 aaaagctttt ggaataagag ctgggcaaga ccggtgccag ccgccgcggt aacaccggca
481 gctcaagtgg tggccgtttt tattgggcct aaagcgttcg tagccggctt gataagtctc
541 tggtgaaatc tcacggctta accgtgagaa ttgctggaga tactattagg cttgaggccg
601 ggagaggtta gcggtactcc cggggtaggg gtgaaatcct ataatcccgg gaggaccacc
661 tgtggcgaag gcggctaact ggaacgggcc tgacggtgag taacgaaagc caggggcgcg
721 aaccggatta gatacccggg tagtcctggc cgtaaacgat gtggacttgg tgttgggatg
781 gctccgagct gccccagtgc cgaagggaag ctgttaagtc caccgcctgg gaagtacggt
841 cgcaagactg aaacttaaag gaattggcgg gggagcacca caacgcgtgg agcctgcggt
901 ttaattggat tcaacgccgg acatctcacc aggggcgaca gcagaatgat agccaggttg
961 atgaccttgc ttgacaagct gagaggaggt gcatggccgc cgtcagctcg taccgtgagg
1021 cgtcctgtta agtcaggcaa cgagcgagac ccacgccctt agttaccagc ggatccttcg
1081 ggatgccggg cacactaagg ggaccgccag tgataaactg gaggaaggag tggacgacgg
1141 taggtccgta tgccccgaat cccctgggct acacgcgggc tacaatggtt aggacaatgg
1201 gttccgacac tgaaaggtgg aggtaatctc ctaaacctgg ccttagttcg gattgagggc
1261 tgtaactcgc cctcatgaag ctggaatgcg tagtaatcgc gtgtcataac cgcgcggtga
1321 atacgtccct gctccttgca cacaccgccc gtcacgccac ccaaaaaggg tttggatgag
1381 gccatagtct ttggttatgg tcgaatctag gttctttgag gagggcgaag tcgtaacaag
1441 gtagccgtag gggaacctgc ggctggatca cctcct
//
LOCUS MSHLTRERA 190 bp ss-RNA VRL 14-AUG-1990
DEFINITION Harvey murine sarcoma virus LTR 75-bp direct-repeat enhancer
region.
ACCESSION M26322
KEYWORDS enhancer region; long terminal repeat.
SOURCE Harvey murine sarcoma virus, cDNA to viral RNA, clone pM14-1.
ORGANISM Harvey murine sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses; Murine sarcoma viruses.
REFERENCE 1 (bases 1 to 190)
AUTHORS Ostrowski,M.C., Huang,A.L., Kessel,M., Wolford,R.G. and Hager,G.L.
TITLE Modulation of enhancer activity by the hormone responsive
regulatory element from mouse mammary tumor virus
JOURNAL EMBO J. 3, 1891-1899 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
rpt 21 95 75 bp direct-repeat enchancer
rpt 96 170 75 bp direct-repeat enchancer
BASE COUNT 31 a 54 c 48 g 57 t
ORIGIN
1 ggctggaccg catctgggga ccatctgttc ttggccctga gccggggcag gaactgtcta
61 ccacagatat cctgtttggc ccatattcag ctgttctatc tgttcttggc cctggagccg
121 ggcaggaact gtctaccaca gatatcctgt ttggcccata ttcagctgtt tctctgttcc
181 tgaccttgat
//
LOCUS MUSKERDA 1381 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse keratin D mRNA, complete cds.
ACCESSION M36376
KEYWORDS keratin D.
SOURCE Mouse F9 teratocarcinoma stem cell, cDNA to mRNA, clone B53.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1381)
AUTHORS Alonso,A., Weber,T. and Jorcano,J.L.
TITLE Cloning and characterization of keratin D, a murine endodermal
cytoskeletal protein induced during in vitro differentiation of F9
teratocarcinoma cells
JOURNAL Roux's Arch. Dev. Biol. 196, 16-21 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 50 1321 keratin D
mRNA < 1 1381 keratin D mRNA
site 28 42 TCTCC motif
BASE COUNT 366 a 373 c 401 g 241 t
ORIGIN
1 aactcctgtt ctggtctctc gcttcgctct cctctcctct ccagacaaga tgagcttcac
61 aacccgctcc accaccttct ccaccaacta ccggtccctg ggctctgtgc ggactcccag
121 ccagcgggtc cggcctgcca gcagcgcagc cagcgtctat gcaggtgctg ggggctccgg
181 gtcacggata tccgtgtccc gctccgtctg gggtggctct gtggggtccg caggcctggc
241 gggaatgggt ggaatccaga ccgagaagga gaccatgcaa gacctgaacg atcgcctggc
301 cagctaccta gacaaggtga agagtctgga aactgagaac aggagactgg agagcaaaat
361 ccgggaacat ctggagaaga aggggcccca gggcgtcaga gactggggcc actacttcaa
421 gatcatcgaa gacctgaggg ctcagatcct tgcgaattct gtggacaatg cccgcatcgt
481 cttgcagatc gacaatgccc gccttgccgc cgatgacttt agagtcaagt atgagacaga
541 actagccatg cgccagtctg tggagagcga catccatgga ctccgcaagg tggtagatga
601 caccaacatc acaaggctgc agctggagac agaaatcgag gcactcaagg aagaacttct
661 gttcatgaag aagaatcatg aagaggaagt ccaaggtctg gaagcccaga ttgccagctc
721 tggattgact gtggaagtgg atgcccccaa atctcaggac ctcagcaaga tcatggcgaa
781 catccgcgcc cagtatgaag cgctgggcca gaagaaccgc gaggaactgg acaagtactg
841 gtctcagcag attgaggaga gtaccacagt tgtcaccacc aagtctgccg aaatcaggga
901 cgctgagacc acactcacgg agctgagacg caccctccag accttggaga ttgacttgga
961 ctccatgaaa aaccagaaca tcaacttgga gaacagcctc ggggatgtgg aggcccgata
1021 caaggcacag atggagcagc tcaatggggt ccttctgcat ctggagtcag agctggcaca
1081 aactcgggca gagggccagc gccaggccca ggaatatgaa gccctcttga acatcaaggt
1141 gaagcttgag gcagagattg ccacctaccg ccgcttgctg gaggatggag aagatttcag
1201 tctcaacgat gccctagact cgagcaactc catgcaaact gtgcagaaga caactacccg
1261 taagatcgtg gatggcagag tggtgtccga gactaatgac accagagttc tgaggcactg
1321 aggcagagaa ggagggaacc cctaggaact gagggaccaa taaaagttga gagctcactg
1381 g
//
LOCUS MUSMBPMLD1 792 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse myelin basic protein (shi-mld) gene, exon 1.
ACCESSION M36273 M36274
KEYWORDS myelin basic protein; shi-mld gene.
SEGMENT 1 of 2
SOURCE Mouse mld mutated DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 792)
AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K.,
Oshimura,M. and Mikoshiba,K.
TITLE Gene organization and transcription of duplicated MBP genes of
myelin deficient (shi-mld) mutant mouse
JOURNAL EMBO J. 7, 77-83 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA 713 > 792 shi-mld mRNA
signal 623 628 GC box
signal 630 634 CAAT box
signal 680 685 TATA box
BASE COUNT 194 a 234 c 206 g 158 t
ORIGIN Chromosome 18.
1 cctccagcgg aatccgggag agggattgct cagtgccctg cttccctgga ctgtaagctg
61 cagaaagatg tgggaagtcc tgttctccac tgagaacact aaaagcacct tttgtcaaac
121 gaccgcttca catctggggc ttgtgcactg gtggcctttt aaaccagaga caacccacaa
181 gatacctaac ctgcggggct ctctggtaca gtgagcaact caggaaatgc tttggcttga
241 ttgctgtggg ctctcaggcc atcgccctct ggagtggttc ttttaatgag aacctgaaga
301 ttggcccctg agccatgtat accaagcaag ctcaatccag gttagctccc tctggttggg
361 gcaagctaac gtgctccttg ggccccgcgc gtaactgtgc gttttatagg agacagctag
421 ttcaagaccc caggaagaaa gcggctttgt ccccctctag gcctcgtaca ggcccacatt
481 catatctcat tgttgttgca ggggaggcag atgcgatcca gaacaatggg acctcggctg
541 aggacacggc ggtgacagac tccaagcaca cagcagaccc aaagaataac tggcaaggcg
601 cccacccagc tgacccaggg aaccgccccc acttgatccg cctcttttcc cgagatgccc
661 cgggaaggag gacaacacct tcaaagacag gccctcagag tccgacgagc ttcagaccat
721 ccaagaagac cccacagcag cttccggagg cctggatgtc atggcatcac agaagagacc
781 ctcacagcga tc
//
LOCUS MUSMBPMLD2 200 bp ds-DNA ROD 14-AUG-1990
DEFINITION Figure 7. Nucleotide sequences near the inverted exon 3.
ACCESSION M36275
KEYWORDS myelin basic protein; shi-mld gene.
SEGMENT 2 of 2
SOURCE Mouse mld mutated DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K.,
Oshimura,M. and Mikoshiba,K.
TITLE Gene organization and transcription of duplicated MBP genes of
myelin deficient (shi-mld) mutant mouse
JOURNAL EMBO J. 7, 77-83 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept / 43 / 144 shi-mld myelin basic protein, exon 3 (AA at
43)
pre-msg < 1 > 200 shi-mld mRNA and introns
IVS < 1 42 shi-mld intron B
IVS 145 > 200 shi-mld intron C
BASE COUNT 54 a 62 c 40 g 44 t
ORIGIN Chromosome 18; about 11 kbp after segment 1.
1 ctacgccctc tccatcctca gctgctcgct tctctctttc aggactcaca cacgagaact
61 acccattatg gctccctgcc ccataattcg cagcacggcc ggacccaaga tgaaaaccca
121 gtagtccatt tcttcaagaa cattgtaagt gactgtgcac aggggaacca agaaatcatc
181 aagggaaagg ttgtggatcc
//
LOCUS MUSMIGP4A 1284 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse monokine induced by gamma interferon (MIG) mRNA, complete
cds.
ACCESSION M34815
KEYWORDS MIG protein; gamma interferon-induced;
monokine induced by gamma interferon;
platelet factor 4-related protein; secreted protein.
SOURCE Mouse (strain BAB/14) macrophage cell line RAW 264.7, cDNA to mRNA,
clone 119/24.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1284)
AUTHORS Farber,J.M.
TITLE A macrophage mRNA selectively induced by gamma-interferon encodes a
new member of the platelet factor 4 family of cytokines
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5238-5242 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.M.Farber, 04-JUN-1990, for release after publication.
FEATURES from to/span description
pept 95 475 gamma interferon induced monokine precursor
(MIG)
sigp 95 157 gamma interferon induced monokine signal
peptide
matp 158 472 gamma interferon induced monokine
signal 6 16 TATA region
BASE COUNT 401 a 316 c 247 g 320 t
ORIGIN
1 tttcctaaat aaatatgatc cccaagaaca tgctctctaa agacattctc ggacttcact
61 ccaacacagt gactcaatag aactcagctc tgccatgaag tccgctgttc ttttcctttt
121 gggcatcatc ttcctggagc agtgtggagt tcgaggaacc ctagtgataa ggaatgcacg
181 atgctcctgc atcagcacca gccgaggcac gatccactac aaatccctca aagacctcaa
241 acagtttgcc ccaagcccca attgcaacaa aactgaaatc attgctacac tgaagaacgg
301 agatcaaacc tgcctagatc cggactcggc aaatgtgaag aagctgatga aagaatggga
361 aaagaagatc aaccaaaaga aaaagcaaaa gagggggaaa aaacatcaaa agaacatgaa
421 aaacagaaaa cccaaaacac cccaaagtcg tcgtcgttca aggaagacta cataagagac
481 cattacttta ccaacaagca ccctgaatct taatgggttt tagattgtac tgaaaagcct
541 tccctggcag agcagccttt aatacatagg cttttaatac attaactcaa ctacaaaaca
601 taaagtgtta atttgaaatt ataactaact ttaggaagtt aattgcaaaa ctccaatagt
661 aacaattgct agaggcaaaa actctgtgtt ctacacagcc aacaaaattt catcacgccc
721 ttgagcctag tcgtgataac atcagatctg ggcaagtgtc cctttccttc atagctatcc
781 aatgcacaac agctgtctgg cttccagagc cacacatttg gcagcctccg gagacttctg
841 aggctcacgt caccaagtcc caggcctgtc tgtttgctgg tgagctagat agacctcacc
901 aagctggaga ggccctcggc agctgcattt gggtcagcct agagcccctg cacacattgt
961 gtctcagaga tggtgctaat ggttttgggg ttctacagtg gagaccacca gagttggcct
1021 tcagaacctc ccacgtagct ttcgagacca tgggatttca ttattaactt gatcccatct
1081 tcagagctta ttctaagttt gcctcttcaa taaaactctc ctagaaggtt gtggctgtag
1141 cttagtggca gaacacttgg tgttgcaggg accaggtcct tcactaacag tgcaaaaact
1201 taaccaattt aaagaacatt ttctggctac tcaaattctc ttaaatttat tcctgtttca
1261 caagtaaaca cttcgctgct atct
//
LOCUS MUSZFPB1 561 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger (Zfp-35) gene, exon 1.
ACCESSION J04770
KEYWORDS zinc finger protein.
SEGMENT 1 of 3
SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 561)
AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J.
TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is
unregulated during spermatogenesis
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
Cunliffe,V., 06-JUN-1990.
FEATURES from to/span description
pre-msg 233 > 561 zinc finger protein (Zfp-35) mRNA and introns
IVS 286 > 561 Zfp-35 intron A
BASE COUNT 141 a 135 c 149 g 136 t
ORIGIN Chromosome 18 bands B3-C.
1 ccctagaagt tttgacatcg aaccacagca aagataagtt tgagtcaagc gggtccccct
61 atgctaaagc tgggagtcgc tttagggaac ttcctgttcc taattcacag agcgcgtact
121 tgggagggtg caccatagag ctgtgatcct cccacattcc tagagcggcc aagcgaggtc
181 gcattgcatc caaacggatg ttgaaggaat tgtgggtggc ggcgtttgac cggtagtttc
241 ttcatcgtcg gagagtaagg gaggcttctt gggcatggag agcaggtggg ttcttggcct
301 agtgttctgt ggacagcaag ttcgtgctcc ctgggccttt cattcaactg catgaacgtc
361 cgagatgttg gggctggtgg aataacgaag aggatgagcc cacactaatc agcctgcaga
421 gctggaaaag agatacctga cttgggttag aatcgaagga acgagagaga ataacatact
481 cttaattttt aaagccagta ttaccccccc cccccccaca cacacacacc gaattatcgt
541 tccatttacg cctctacttt a
//
LOCUS MUSZFPB2 340 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger (Zfp-35) gene, exon 2.
ACCESSION M36145
KEYWORDS zinc finger protein.
SEGMENT 2 of 3
SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 340)
AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J.
TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is
unregulated during spermatogenesis
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
V,Cunliffe, 06-JUN-1990.
FEATURES from to/span description
pept 231 + 279 zinc finger protein (Zfp-35)
pre-msg < 1 > 340 Zfp-35 mRNA and introns
IVS < 1 66 Zfp-35 intron A
IVS 280 > 340 Zfp-35 intron B
BASE COUNT 90 a 79 c 66 g 105 t
ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 1.
1 gtcaccaaga ggaaacgcac atgaaagtta aatatatttt ttctccttta cgtttttcat
61 atctagaaac tctagtctca gaaatgacat ctctcctctg tacaatggtt ctcatctgag
121 ctcagggcta ccaagttgcc tgctggcttc aaaccattat gtggacaaaa ggatcctaca
181 gaagaaatgg taattctcca gttgcttcct gaggtgtcca gttgtgccct atggagatcc
241 aattcagtta cgaatctcaa gaccatcact ttctgtcagg tgagcctcag tgcatttcag
301 gttacgtggg tgatagcagt attcttcact gtctacactc
//
LOCUS MUSZFPB3 2268 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger (Zfp-35) gene, exon 3.
ACCESSION M36146
KEYWORDS zinc finger protein.
SEGMENT 3 of 3
SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2268)
AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J.
TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is
unregulated during spermatogenesis
JOURNAL Genomics (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
V,Cunliffe, 06-JUN-1990.
FEATURES from to/span description
pept + 87 1780 zinc finger protein (Zfp-35)
pre-msg < 1 2166 Zfp-35 mRNA and introns
IVS < 1 86 Zfp-35 intron B
signal 2125 2130 polyA signal
signal 2148 2153 polyA signal
BASE COUNT 685 a 488 c 526 g 569 t
ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 2.
1 ctgcagggaa atactcattt gttctacctg gagctttctt acaacatgga gcgcattttc
61 catttagatt ttgtttatta ttccagatgg tgagactaag atcaagattg gagagccagc
121 tacagaggag gaaatgacag gaaagattgg aacagtgact gaagagtccg gcagccttga
181 ggaggatgtt ccccatgact ccagaggcaa agaattccgt gaatttggag aagaattgaa
241 cgatcagatg cttttcagaa gaagacagta taactgtgat gaatgtgacc aaagctttgc
301 ttggagtaca ggtctcatta ggcaccaaag aactcattgg aaaccttacg aatgtgagga
361 atgtggaaag gcctttcgga tgagctcagc cctggttctg catcagagaa ttcatactgg
421 agagaagccc tatccttgta gttggtgtat taaaagtttt agtcggagct cagaccttat
481 taaacatcaa agagtccaca ctggtgaaaa accttacaag tgtgatgagt gtggcaaggc
541 cttcagtcag agctcagatc ttatgataca tcagagaatc cacacaggag aaaaacccta
601 ccaatgtagt cattgtagta aaagctttag ccagcactca ggcatggtta aacatctgag
661 aatccacact ggagagaagc cttatatgtg taaccattgt tacaaacatt tcagtcagag
721 ctctgatctt ataaaacatc aaaggatcca cactggggag aaaccataca agtgtgatgt
781 gtgtgggaag gcctttagtc agagctctga tcgtattctc catcagagaa tccacactgg
841 ggagaagcca tatccgtgtg ctcagtgtaa caaaagtttt agtcaaaatt cagaccttat
901 taaacacaga aggatccaca ctggagagaa accgtataaa tgtagtgaat gtgggaaggc
961 ttttaaccag agctcagtcc ttattctgca tcaaagaatt cataccggag agaaacccta
1021 tccgtgtaac cagtgtacca aaagctttag taggctttca gatcttatta atcatcaacg
1081 aattcacacc ggagagaagc cttacccatg tagtcagtgc agtaaaatgt ttagtagaag
1141 gtcacacctt gttaaacatt acagaatcca cactggggag aagccctatg agtgtgataa
1201 gtgtggcaaa actttcagtc agagttccaa ccttattctc caccagagaa tccacactgg
1261 agagaaacca tacccgtgca acagctgttc taaaagcttc agtcgcggtt cagatctcat
1321 aaagcatcag agggtccaca ctggagaaaa accatacacg tgcaatctgt gcagtaagag
1381 tttcagtcaa agttcagacc tcactaaaca tcagagagtg cattctgggg agaagcccta
1441 ccactgtagt agttgtaaca aagcctttcg tcagagttct gaccttattc tccaccacag
1501 agttcacacg ggagaaagac catatgcgtg tacacagtgc cctagaagtt tcagtcaaaa
1561 gtcagacctg attaaacacc agagaattca cactggagag aagccatata aatgtatgtg
1621 tgggaaggct ttcagtcagt gctcagcctt tacccttcat cagagaatcc acactggaga
1681 gaaaccgtat ccctgtgctc agtgtggcaa aagcttcagt cagcgctctg atctggttaa
1741 ccatcaaaga gttcacgctg atcaaaaact acaaatgtga ctgatacctg tgtgaagcct
1801 tcagcactgt gaccaatctt actggatatc agaggatctg caggagccgt ttgcacctct
1861 ctgaaacgag aaaaaggaag agtgcctccc aggcggaatc aaagcaggtc agttggcaga
1921 gttcccggct tgctgatgca tggaacaatg agaagaggtg tgcttgaggc tcaagtcaag
1981 ggctgaatct tctgttgacc ttggtggaag gaacggggct tactggatgg gaacagatgt
2041 cttgtgtgta tattgcagca aattgttaat actggcttgt ggctttgctc cccatggtac
2101 tgtagtataa aaaaggctat gagaaataca ctcactgctg gcgtggtaat gacctgcagc
2161 cctcccatgt cagtctcttg catcgttttt ctgtctttcc tataatcatc ctcacttccc
2221 cctcaggggc tgttcgactt cacgccggcc ggactagttc aaggatcc
//
LOCUS PP1PAR1 285 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage miniplasmid P1 parA gene, 5' end.
ACCESSION M36424
KEYWORDS centromeric-like DNA.
SEGMENT 1 of 2
SOURCE Bacteriophage P1 plasmid prophage DNA, from E.coli.
ORGANISM Bacteriophage P1
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 285)
AUTHORS Austin,S. and Abeles,A.
TITLE The partition functions of P1, P7, and F miniplasmids
JOURNAL Basic Life Sci. 30, 215-226 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 232 > 285 parA protein
binding 218 222 parA ribosomal binding site
signal 149 154 parA -35 region
signal 171 176 parA -10 region
BASE COUNT 98 a 57 c 60 g 70 t
ORIGIN
1 aagcttggct gcccattgaa gcaggaaatc accaaaatga ttcaggctac aacctgaacg
61 tagaagaaat ccgcgtcctt tatgcgtgga ggatgccaaa gcatgttgtg acacacttgg
121 caaaggagta agcatgcaga gaatgctatg tacaagcatc tacgcataca ttattatttt
181 atgcagcatt tttaattaaa ttcaaaaata cagcataaag gatgactttc gatgagtgat
241 tccagccagc ttcacaaggt tgctcaaaga gcaaacagaa tgctc
//
LOCUS PP1PAR2 178 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage miniplasmid P1 parB gene, 3' end.
ACCESSION M36425
KEYWORDS centromeric-like DNA.
SEGMENT 2 of 2
SOURCE Bacteriophage P1 plasmid prophage DNA, clone pALA161, from E.coli.
ORGANISM Bacteriophage P1
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 178)
AUTHORS Austin,S. and Abeles,A.
TITLE The partition functions of P1, P7, and F miniplasmids
JOURNAL Basic Life Sci. 30, 215-226 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 51 parB protein
BASE COUNT 62 a 42 c 21 g 53 t
ORIGIN 2150 bp after segment 1.
1 tcgataaaaa gccgaagcct taaactttcg ccattcaaat ttcactatta actgactgtt
61 tttaaagtaa attactctaa aatttcaagg tgaaatcgcc acgatttcac cttggatttt
121 accttcctcc cctcctcccg aaaaaaataa aaaaattgct tgtcacgaga aagtcaac
//
LOCUS PP7C4RNA 612 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage P7 c4-RNA (c4, orfx, ant1) gene, 5' end.
ACCESSION M35139
KEYWORDS antirepressor protein; antisense RNA; c4 repressor.
SOURCE Bacteriophage P7 (strain c1.48ts) DNA, from E.coli (strain C600).
ORGANISM Bacteriophage P7
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 612)
AUTHORS Citron,M. and Schuster,H.
TITLE The c4 repressors of bacteriophages P1 and P7 are antisense RNAs
JOURNAL Cell (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Schuster, 06-JUL-1990.
FEATURES from to/span description
pept 362 583 orfx
pept 580 > 612 ant1 protein
binding 346 354 orfx ribosomal binding site
mut 278 278 c in P7c1.48ts; t in P7c4.2ts
signal 46 74 promoter 51a
signal 86 114 promoter 51b
site 31 47 Op 51
site 146 152 presumptive c4 target site a1
site 160 167 presumptive c4 target site b1
site 271 278 c4 element b'
site 285 291 c4 element a'
site 315 321 c4 target site a2
site 344 351 c4 target site b2
BASE COUNT 147 a 158 c 128 g 179 t
ORIGIN
1 acaaataata tatctattgc acgacactaa tctgctctaa taaatttgta tttttaagtc
61 acgaatgcta tcttttcgca tcatattgac cttttaatcg ttcaggctta tagttccgcc
121 gtcgtagcaa attctgcgac cgggtttagc agcctgaatg attgtgcgga cagccgcaga
181 tatccgatat tgcggtattt ttgtgtccgt aaaaccgcgt tacgcccaaa ttatggtggg
241 gcgtgatggg gaggcttcgg cctgctggtt tcacaatcgc cagtctgcta accccgtcac
301 gtcctgccac ctgtttagca gcgggtagca ggttgttaaa cctgattgtg aggccgtaac
361 tatggttaat gccaatcctt gcgcacgcca agaattcatc tggcgtttct attcctgtaa
421 aaaacaccac tatcacttcg tcatcgcagc aacagaagac gaagcacgct ctcaattgcc
481 tgatggcccc tgcattttta ctgcccgttt ttcaactaac tcgcgcaatt cacttagtta
541 ctggaacctc cccttctctg ccgacgttca ggggggttta tgaaaaaacc tctcgtcacc
601 cgtaatgaca ta
//
LOCUS PSEPGLAA 1986 bp ds-DNA BCT 14-AUG-1990
DEFINITION P.solanacearum polygalacturonase A (PGA, pglA) gene, complete cds.
ACCESSION M33692
KEYWORDS alpha-1,4-endopolygalacturonase A.
SOURCE P.solanacearum Race 1 (strain AW) DNA, clone pJE8.
ORGANISM Pseudomonas solanacearum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1986)
AUTHORS Huang,J. and Schell,M.A.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 1715)
AUTHORS Huang,J. and Schell,M.A.
TITLE DNA sequence analysis of pglA and mechanism of export of its
polygalacturonase product from Pseudomonas solanacearum
JOURNAL J. Bacteriol. 172, 3879-3887 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Schell, 13-APR-1990, for release after publication.
FEATURES from to/span description
pept 100 1689 polygalacturonase A precursor (pglA)
sigp 100 162 polygalacturonase A signal peptide
matp 163 1686 polygalacturonase A
mRNA 58 > 1986 pglA mRNA
signal 27 59 promoter region
signal 27 31 -30 Pseudomonas promoter consensus
binding 89 92 pglA ribosomal binding site (put.)
BASE COUNT 351 a 685 c 616 g 334 t
ORIGIN
1 ctgcagcacg cccttcgcgc agacgaaggc gtgccgctgc ttcgctgatc ccggcgcatt
61 gcgtcctcac aaaagacctc tcttttcgga gaattgacct tgaatcaccg ctacacactc
121 ctcgcactcg ccgccgcagc cctgtctgcc ggagcgcacg caaccggtac gtccgtcacg
181 gccccgtggg gcgaggtggc cgagcccagc ctgccggcgg attcggcggt ctgcaagacg
241 ctgtcggcct cgatcacgcc gatcaaaggc tccgtcgatt cggtggacgg caaccccgcc
301 aactcccagc cggacgcgag ccgcatccag tccgccatcg acaactgccc ggccggccag
361 gccgtcaagc tcgtgaaggg cagcgccggc gaatccggct tcctgagcgg ctcgctgaag
421 ctcaagtcgg gcgtgacgct gtggatcgac accggcgtga ccctgttcgc ctcgcgcaac
481 ccggccgact acgacaatgg cctgggcacc tgcggcaccg ccaccaccag caacgacaag
541 tcgtgcaacg cgctgatcgt ggcgcgcgat acggcgggca gcggcatcgt cggcgccggc
601 gccatcgatg gccgcggcgg cagcctggtc accagcggcc cgaacgcgaa ccggctcacc
661 tggtgggaca tcgcttacct gaacaagacc aaggggctga accagcagaa cccgcgcctg
721 atccagacgt ataacggcag cgcgttcacg ctgtacggcg tgacagtgca gaactcgccc
781 aacttccata tcgtcaccac cggcacgtcg ggcgtaacgg catggggcat caagatcgtg
841 acgccgagcc tggcctacgc cgtggcgggc tacaagtgcc cgtccggctc cacgcccgac
901 aaggtcacgc ccgcgacctg cttcacgccc gagacggtca agaacaccga cggcttcgac
961 ccggggcaat cgaccaacgt ggtgctggcc tattcctaca tcaacacggg tgacgaccac
1021 gtcgcggtca aggccagcag cggcccgacg cgcaacctgc tcttcgcgca caaccatttc
1081 tactatgggc acggcctgtc gatcggcagc gaaaccaata ccggcgtgag caacatgctg
1141 gtgaccgacc tgacgatgga cggcaacgac agctcggccg gcaacggcct gcgcatcaag
1201 tccgatgcct cgcgcggcgg caaggtcacc aacatcgtct acgacggcat ctgcatgcgc
1261 aacgtcaagg agccgctggt gttcgacccc ttctacagca gcgtgaaggg ttcgctctat
1321 ccgaacttca ccaacatcgt cgtcaagaac ttccatgacc tgggcagcgc caagtccatc
1381 aagcgcacga tgacgttcct gggctacaaa gccaacaagc agaagaaccc gctgacgatc
1441 acgctcgaca acgtggtctt cgacggcacg ctgccggcct tcgaggggtc ccactacggc
1501 gggccggctt cgcccaacgg ggtgcacttc accttcggcg gcaccggtcc ggtgagcttt
1561 gccgacgcca tcgtcacgtc gtcgaccacc gatgtgacgg tgaccggcac gccgggcacg
1621 gccgcggcgg tggattgcag caaggccttc gtgccgctga agtccgtggc accgacctcg
1681 ccgatctgat cgggcgactc cgctcagcgg tacccgttct gccgcttgta ctggcggatg
1741 gcttcgtcgt ggttggggtc ctcgtgtgca ttgtgcgcgc gctgatcttc gaactcacgg
1801 atggcattgg cctgagcctc cgtcagtttg ggcagcggcg gcttgcgttg gggtgacggg
1861 gtggtgctgt acgtcaccgg ctggctgtcg gcggagttcg tcttcgtagg cgatgtggtg
1921 tccgcccttt tcaccgtggt agtcgttgcg cgcttcggtc tcgcggtcca gttgggtccg
1981 ctgcag
//
LOCUS RATCATHHA 1362 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat cathepsin H mRNA, complete cds.
ACCESSION M36320 Y00708
KEYWORDS cathepsin H; cysteine protease.
SOURCE Rat (strain Wistar) kidney, cDNA to mRNA, clone lambda-H44.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1362)
AUTHORS Ishidoh,K.
JOURNAL Unpublished (1987) Tokyo Metro Inst of Med Sci,Tokyo 113, Japan
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 1362)
AUTHORS Ishidoh,K., Imajoh,S., Emori,Y., Ohno,D., Kawasaki,H., Minami,Y.,
Kominami,E., Katunuma,N. and Suzuki,K.
TITLE Molecular cloning and sequencing of cDNA for rat cathepsin H
JOURNAL FEBS Lett. 226, 33-37 (1987)
STANDARD simple staff_entry
COMMENT EMBL features not translated to GenBank features:
key from to description
SITE 873 874 processing site
SITE 211 213 pot. glycosylation site
SITE 298 301 pot. glycosylation site
SITE 685 688 pot. glycosylation site
FEATURES from to/span description
pept 4 1005 cathepsin H pre-pro-peptide
sigp 4 342 cathepsin H signal peptide
matp 343 1002 cathepsin H
mRNA < 1 1362 cathepsin H mRNA
signal 1320 1325 poly A signal
BASE COUNT 369 a 341 c 343 g 309 t
ORIGIN
1 gcgatgtgga ctgcgctgcc cctgctgtgc gctggggcct ggctgctgag tgctggggcc
61 accgctgagc tgaccgtgaa tgccatagaa aagtttcact ttacgtcatg gatgaaacag
121 catcaaaaga cgtacagctc gagggagtac agccacaggc tgcaggtgtt tgccaacaac
181 tggaggaaga ttcaagccca caaccagagg aaccacacat ttaaaatggg attgaaccag
241 ttttcagata tgagctttgc cgaaataaaa cacaaatacc tctggtcaga gcctcagaat
301 tgctcagcca ccaaaagtaa ctacctccgt ggtactggcc cctacccatc ctccatggac
361 tggaggaaga aaggaaatgt cgtttcacca gtgaagaacc agggggcctg tggcagctgc
421 tggactttct caaccactgg ggccctagag tcagctgtgg ctattgccag tgggaaaatg
481 atgaccttgg ctgagcagca gctggtggat tgtgcccaga acttcaacaa tcatggctgc
541 caaggaggtc tccccagcca ggccttcgag tacatcctgt acaacaaggg catcatggga
601 gaggacagct acccttacat aggcaagaat ggtcagtgca aattcaaccc agaaaaggcc
661 gtcgcgttcg tcaagaatgt tgtcaacatc acactcaatg atgaggctgc aatggtagag
721 gctgtggccc tatacaatcc tgtgagcttt gcctttgagg tgactgaaga ttttatgatg
781 tataaaagtg gtgtctactc cagtaactcc tgtcataaaa ctccagataa agtaaaccat
841 gcagtcctgg ctgttggcta tggagaacag aatggattac tctactggat tgtgaaaaac
901 tcttggggct ccaactgggg gaacaatggg tacttcctca ttgagcgtgg aaagaacatg
961 tgtggcctgg ctgcctgtgc ctcctacccc atccctcagg tataagccac ggctgcacag
1021 gccaactgct tggcagacaa agggaggaac tggtcctacg atgagaatgc cgccctggag
1081 aaagttgttc agaaatccac ccagaggccc tctcactcct gagtctagac gcctaaagac
1141 aagtaaggaa gaacttgacc agcaacaagc ccgcccatgt gacgacatca ccagccatac
1201 gctttgtttg aatatggttt ttaaatgacc caaaaccatg tggacctaga atcttctctt
1261 ttcagctctc ttcatatagg gaaagctccg atggcttacc ttttctatgt tgtatattca
1321 ataaatgcag tgaacgcctg cctgggctgg gcctagaccc ct
//
LOCUS RATDRTKCA 471 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat delayed-rectifier-type K+ channel mRNA, complete cds.
ACCESSION M36461
KEYWORDS delayed-rectifier-type K+ channel.
SOURCE Rat neonatal heart and ovariectomized, diethylstilbestrol-primed
uterus, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 471)
AUTHORS Folander,K., Smith,J.S., Antanavage,J., Bennett,C., Stein,R.B. and
Swanson,R.
TITLE Cloning and expression of the delayed-rectifier I-s-K channel from
neonatal rat heart and diethylstilbestrol-primed rat uterus
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2975-2979 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 41 433 delayed-rectifier-type K+ channel
BASE COUNT 110 a 132 c 122 g 107 t
ORIGIN
1 agagttttgc tccacatcag ggaaaccgtg aagccccagg atggccctgt ccaattccac
61 aactgttctg ccttttctgg cctcgctgtg gcaggaaaca gatgagccgg gtggcaatat
121 gtcggcggac ttggctcgta ggtcccagct ccgagatgac agcaagctgg aggctctcta
181 tatcctcatg gtgctgggtt tcttcggctt cttcaccctg ggcatcatgc tgagttacat
241 ccgatccaag aagctggaac actcgcacga ccctttcaac gtgtacatcg agtcggacgc
301 ctggcaggag aaaggcaagg ccctcttcca ggcccgcgtt ctggagagct tcagagcttg
361 ctatgtcatt gaaaaccagg cggccgtaga acaacctgcc acacaccttc ctgaactgaa
421 gccactgtca tgaaccccat agttaattaa tagacaagtg ataagtgggt c
//
LOCUS RATPAPA 1603 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat prostatic acid phosphatase (rPAP) mRNA, complete cds.
ACCESSION M32397
KEYWORDS prostatic acid phosphatase.
SOURCE Rat (strain Sprague-Dawley) prostate, cDNA to mRNA, clone
rPAP-[145,343A].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1603)
AUTHORS Roiko,K., Jaenne,O.A. and Vihko,P.
TITLE Primary structure of rat secretory acid phosphatase and comparison
to other acid phosphatases
JOURNAL Gene 89, 223-229 (1900)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Vihko, 02-MAR-1990, for release after publication.
FEATURES from to/span description
pept 41 1186 prostatic acid phosphatase precursor (rPAP; EC
3.1.3.2)
sigp 41 133 prostatic acid phosphatase signal peptide
matp 134 1183 prostatic acid phosphatase
mRNA < 1 1603 rPAP mRNA
signal 1583 1588 polyA signal
BASE COUNT 434 a 414 c 387 g 368 t
ORIGIN
1 cacctcctac agctgccaca agcagctctc ttcccccacc atgagagctg tccctctgca
61 cctcgtcggg acagcaagcc tcacccttgg cttcttgctc ctgctatctc tccgcctgga
121 cccaggccaa gccaaggagt tgaagtttgt gacattggtg ttccggcatg gagaccgagg
181 tcccatcgag acctttccta atgaccccat taaggaatcc tcgtggccac aaggatttgg
241 ccaactcacc aagtggggca tgggacagca ctacgaactc ggaagttata taaggagaag
301 atacgggaga ttcttgaaca actcctataa acatgaccag gtttatatcc gaagcacaga
361 tgttgacagg actctgatga gcgctatgac aaacctcgca gccctgtttc cccctgaggg
421 gatcagcatc tggaatccca gactgctctg gcagcccatc ccagtgcaca ccgtgtctct
481 ctctgaggat cggttgctat acctgccttt cagggactgt cctcgctttc aagaactcaa
541 gagtgagact ttaaaatctg aggagttcct gaagaggctt caaccatata aaagcttcat
601 agacaccttg ccatcgctgt cgggattcga ggaccaggat ctttttgaaa tctggagtag
661 gctttacgac cctttatatt gcgagagtgt tcacaatttc accttccgca cctgggccac
721 agaggacgcc atgactaagt tgaaggagtt gtcagaatta tctctgttat ctctttatgg
781 aattcacaag cagaaagaga aatctagact ccaggggggc gtcctggtca atgaaattct
841 caagaacatg aagcttgcaa ctcaaccaca gaaggccagg aagttgatca tgtattctgc
901 atatgacact actgtgagtg gcctgcagat ggcgctagag ctttataatg gacttctacc
961 tccctacgct tcctgccaca taatggaatt gtaccaggat aatgggggga ccttcgtgga
1021 gatgtactac cggaatgaga cccagaacga gccctaccca ctcacgctgc cgggctgtac
1081 ccacagctgc cctctggaga agtttgcaga gctactggac cccgtgatcc cccaggactg
1141 ggccacagag tgtatgggca caagcaacca ccaagcgtcg ctgtaaaaga acaaccagtt
1201 gccagtgtaa tcaggaaaag agttaggtca aacttgtggg tctgccgggg atccatgatt
1261 agcacaaaga actctattgc tgaaggattc gaagattttg cttgagcggg tgagtaaagc
1321 tgccctggga atgtctgatg taaaaacagg tttcagacga agcctaccaa gttagcgtgc
1381 aagcatgaaa aagaagacac ttatcccagg ctaatgggca aaggacctga ttcacctgac
1441 aacaacgctc acctgttccc ctgccttctc tcacgaggca aaaagatgag acaggaggac
1501 acctggagga gttttccata cccagcctag aacgatcaga ttttatatgt atgtcaaaag
1561 acgctgggta gcagaactga ataataaaag cctcttggtg tcc
//
LOCUS RHLNIFHA 524 bp ds-DNA BCT 14-AUG-1990
DEFINITION R.leguminosarum nitrogen fixation protein nifH gene, 5' end.
ACCESSION M36435
KEYWORDS nitrogen fixation protein.
SOURCE R.leguminosarum DNA, clone pGB5.
ORGANISM Rhizobium leguminosarum
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE 1 (bases 1 to 524)
AUTHORS Roelvink,P.W., Harmsen,M., van Kammen,A. and van den Bos,R.C.
TITLE The nifH promoter region of Rhizobium leguminosarum: Nucleotide
sequence and promoter elements controlling activation by NifA
protein
JOURNAL Gene 87, 31-36 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 435 > 524 nifH protein
mRNA 364 > 524 nifH mRNA
binding 420 426 nifH ribosomal binding site
BASE COUNT 124 a 139 c 137 g 124 t
ORIGIN
1 gttaacccta tgcgggcacc agacgcggca tggcgcttgt catcgccgga gacagcgctt
61 tgcggcaagt gatcagaatt tgttacggcc ggtctggtga tggctgctct aacaaagtcc
121 tagaagctac gcatggataa atgagacatc ctgccggttg gtcagatatt gcacaaacgg
181 ccaccattca ttggccctca gagggtgaca atctatttcg acattgtcat cacctttgtc
241 ggttacttga caagcgtttg tgttctcaat cccgtatttt acgcccacat agctggaatc
301 gcggcgaatt attgttcgtc agctcaatcg gccgcttggc acgaatcttg agagctattg
361 agaggcagcg gaacggccgc cgcattccgt tgcgggtaac caaattgctt cgaacacatg
421 aaggaacgcc aagcatggca gctctgcgtc agatcgcatt ctatggaaaa ggcggaattg
481 gcaagtccac tacgtcccaa aacacgctgg ccgcccttgt cgac
//
LOCUS STYAHPCFA 2608 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.typhimurium alkyl hydroperoxide reductase (ahpCF) gene, complete
cds.
ACCESSION J05478
KEYWORDS alkyl hydroperoxide reductase; oxidoreductase.
SOURCE S.typhimurium (strain TN1379) DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2608)
AUTHORS Tartaglia,L.A., Storz,G., Brodsky,M.H., Lai,A. and Ames,B.N.
TITLE Alkyl hydroperoxide reductase from Salmonella typhimurium: Sequence
and homology to thioresoxin reductase and other flavoprotein
disulfide oxidoreductases
JOURNAL J. Biol. Chem. 265, 10535-10540 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.Tartaglia, 05-JUN-1990.
FEATURES from to/span description
pept 166 720 alkyl hydroperoxide reductase C22 protein
(aphC)
pept 969 2534 alkyl hydroperoxide reductase F52a protein
(aphF)
mRNA 142 > 2608 ahpCF mRNA
binding 155 160 aphC ribosomal binding site (put.)
binding 959 964 aphF ribosomal binding site (put.)
signal 105 110 ahpCF -35 region
signal 129 134 ahpCF -10 region
BASE COUNT 652 a 655 c 724 g 577 t
ORIGIN 13 minutes on S.typhimurium map.
1 tcaggcggcc tttttacttt agatggctga cagggcgcag tgttcacaat gtcgcccgaa
61 tcgggttgtt agttaacgct tattgatttg ataatggaaa cgcattagcc gaatcagcaa
121 aatttcgtta acttactcct caacgaaaac acggaggaag tatagatgtc cttaattaac
181 accaaaatca aacctttcaa aaaccaggcg ttcaaaaacg gtgaattcat tgaagtcacc
241 gagaaagata ccgaaggccg ctggagcgtc ttcttcttct acccggccga ttttaccttt
301 gtttgcccga ctgaactggg tgacgttgcc gaccattacg aagaactgca gaagctgggc
361 gtagacgttt attccgtctc aaccgatact cacttcacgc acaaagcatg gcacagcagc
421 tctgaaacta tcgcaaaaat caaatatgcg atgatcggcg acccgactgg cgccctgacc
481 cgtaacttcg acaacatgcg tgaagatgaa ggtctggcag acgtgcgacc tttgttgttg
541 acccgcaggg tatcatccag gcgatcgaag ttaccgctga aggtatcggc cgtgacgcgt
601 ctgacctgct gcgtaaaatt aaagcagcac agtacgttgc cgctcaccca ggcgaagtat
661 gcccggcgaa atggaaagaa ggcgaacgac tctggctcca tccttagacc tggtcggtaa
721 aatctaaatt tccgtcgtct ttcacgccat agctgcgttg gcgtcgcctg ctcaccccgg
781 tcacttactt cagtaagctc ccggggattc tcaggctagc cgccttactc tgacgtgaaa
841 tactttggaa attgtttcag tcatgggcgc acccgcgccc atttcatttc agcaccatga
901 cgcaagttgc attaatgcag tcccgcgaag tcggcttgca tgatgatgtt ttcagagagg
961 gaagaagaat gctcgacaca aatatgaaaa cccagctcag ggcttacctt gagaaactga
1021 ccaaacctgt tgagctgatt gccacgctgg atgacagcgc taaatcggcg gaaatcaagg
1081 aactgctggc tgaaatcgcc gagctgtccg acaaagtcac gtttaaagaa gacaatactc
1141 tgccggtgcg caagccgtct ttcctgatta cgaatccagg ttctcagcaa gggccgcgct
1201 ttgccggttc tccgctgggc cacgaattta cctcgctggt tttggcgctg ctgtggaccg
1261 gcggtcatcc gtcaaaagaa gcgcagtcgc tgctggagca gattcgcgat attgacggcg
1321 attttgaatt cgaaacttat tattcactct cctgccataa ctgcccggac gtagtgcagg
1381 cgctgaacct gatggcggtg ctgaacccgc gtattaaaca tacggcaatt gatggcggca
1441 ccttccagaa cgaaatcacc gaacgtaacg tgatgggcgt tccggcggta tttgttaacg
1501 gtaaagagtt cggccaggga cgtatgactc tgaccgagat tgtcgctaaa gtggatactg
1561 gcgcagaaaa acgcgcggcg gaagcgctga acaaacgcga tgcgtatgac gtactgattg
1621 tcggctccgg cccggcgggc gcggcggcgg cggtctactc ggcgcgcaaa ggcatccgta
1681 ccggtctgat gggcgagcgc ttcggcggtc aggtgctgga taccgtggat atcgaaaact
1741 atatctcggt gccgaaaacc gaaggccaga aactggcggg cgcgctgaaa gcgcatgtca
1801 gcgattacga cgttgacgtt atcgacagcc agagcgcaag taaactggtt ccggccgcaa
1861 cagaaggcgg cttgcatcag attgaaacgg cgtccggcgc agtattgaaa gcgcgtagta
1921 tcatcatcgc caccggcgcg aaatggcgca atatgaatgt gccgggcgag gatcagtatc
1981 gcaccaaagg cgtcacctat tgtccgcact gcgacggtcc gttatttaaa gggaagcgtg
2041 tggcggtcat cggcggcggc aactctggcg ttgaagcggc tatcgatctg gcaggtattg
2101 tagaacatgt tacgttgctg gagttcgcgc cggagatgaa ggcggatcag gtgctacagg
2161 acaaagtgcg tagtctgaaa aatgtcgata tcattttgaa tgcgcaaacg acagaagtga
2221 agggcgacgg ttcgaaagtg gtgggactgg agtatcgtga ccgtgtcagc ggcgatattc
2281 atagcgtcgc gctggcaggg atttttgtgc agattggtct gttgccgaat acccactggc
2341 tggaaggcgc gcttgagcgt aaccgcatgg gcgagatcat tattgacgcc aaatgtgaaa
2401 ccagcgttaa gggcgtcttc gccgcaggcg attgcaccac cgttccttac aaacagatta
2461 tcatcgccac gggcgaaggg gcgaaagcct cgttgagcgc ctttgattat ctgattcgca
2521 ccaaaatcgc ataaaaagaa gtaagcacac ctgcaaggcc agttaactgg ccatcgtaaa
2581 tggccgatag tgtaagctat tcgggcct
//
LOCUS TMGCG 6355 bp ss-RNA VRL 14-AUG-1990
DEFINITION Tobacco mild green mosaic virus complete genome.
ACCESSION M34077 M22483
KEYWORDS coat protein.
SOURCE Tobacco mild green mosaic virus (strain U2-TMV), cDNA to viral RNA,
from N.tabacum cv. Samsum.
ORGANISM Tobacco mild green mosaic virus
Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses;
Tobamovirus.
REFERENCE 1 (bases 6127 to 6355)
AUTHORS Garcia-Arenal,F.
TITLE Sequence and structure at the genome 3' end of the U2-strain of
Tobacco mosaic virus, a histidine-accepting tobamovirus
JOURNAL Virology 167, 201-206 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 6355)
AUTHORS Solis,I. and Garcia-Arenal,F.
TITLE The complete nucleotide sequence of the genomic RNA of the
tobamovirus tobacco mild green mosaic virus
JOURNAL Virology 177, 553-558 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Garcia-Arenal, 10-FEB-1989. The RNA appears to have a tRNA-
like, L-shaped structure at the 3' terminus, linked to a quasi-
continuous double-helical stalk, with five pseudoknots involved in
the formation of the whole structure. However, the structure of
U2-TMV RNA is less stringently conserved than the 3' termini of
"vulgare" and other histidine-accepting tobamoviruses.
Draft entry and computer-readable sequence for [1] kindly submitted
by F.Garcia-Arenal, 08-MAY-1990, for release after publication.
FEATURES from to/span description
pept 71 3406 126 kDa protein
pept 71 3403 183 kDa protein
3407 4900 183 kDa protein
pept 4890 5660 28.5 kDa protein
pept 5666 6145 coat protein
BASE COUNT 1933 a 1112 c 1489 g 1821 t
ORIGIN
1 gatgttttaa tagttttcga caacaacaat taaaacaaaa acaacatatt acaaacaaca
61 aacaacaaca atggcacaca tacaatctat aattagcaac gcccttcttg aaagcgtgag
121 tggtaaaaac actctcgtta atgaccttgc aagaaggcgc atgtacgata cggccgtgga
181 agaatttaac gcccgcgacc gtagaccaaa ggtcaacttt tccaaaacta ttagcgaaga
241 gcaaacgctt ctagtctcca acgcgtaccc ggagttccag attacctttt ataatactca
301 aaatgccgta cacagtttgg ctggaggttt gagagcatta gaattggaat atctgatgct
361 acaagttccc tatggatcgc cgacatatga tataggtggg aactttgcag cacatttgtt
421 caaaggcagg gattacgtgc attgctgtat gcccaatctg gacatacgag atataatgag
481 gcacgaagga caaaaggact caattgagat gtatttgtcc agattgtctc gttctaacaa
541 ggtaattcct gagtttcaaa gggaggcttt taacaggtat gcagaagctc ccaacgaagt
601 ctgctgctct aaaacttttc aggattgtcg aatacatccg ccagagaata gtggtagaag
661 atacgctgtt gctctgcaca gtttgtatga tattcctgtg catgagtttg gagctgcgtt
721 aatatctaag aatatacatg tatgttatgc agcttccatt ttggcagaag cattattact
781 agaccagacg gaggttacgc ttaatgaaat aggcgcaact ttcaaaagag aaggtgatga
841 tgtttctttt ttctttgctg atgaaagtac tttaaattat agtcataaat acaaaaatat
901 cttgcattat gtagttaaat cttactttcc tgcttctagt agaatagttt actttaagga
961 atttttagtc actagggtta atacttggtt ttgtaaattt accaaagtag atacctatat
1021 tctgtacaag agtgttagac aagtagggtg tgatagtgat cagttctatg aggcgatgga
1081 agacgccttt gcttacaaga aaaccttggc catgttcaac actgaaagag caatctttag
1141 agacacggct tcggttaact tttggttccc taagatgaag gacatggtga tagtaccgct
1201 gtttgagggt tctattacca gcaaaaagat gacaaggagt gaggtcattg ttaatcgtga
1261 cttcgtttac acagtgctta atcatatcag aacatatcaa gccaaagcgt taacttacca
1321 gaacgtatta tctttcgtgg agtctataag atcccgcgtg ataatcaatg gtgttactgc
1381 taggtctgaa tgggatgtag ataaagcaat tcttcaaccc ttgtcaatga ctttcttctt
1441 gcagactaag ctggctgcgc ttcaagacga tatagtaatg ggaaagtttc ggtgcttgga
1501 taagaccact tctgaactta tttgggatga ggtgggcaaa ttttttggaa acgttttccc
1561 cactatcaaa gagagattgg tgagcaggaa aattctggat gtaagtgaga atgctctgaa
1621 gatcaagatc ccagatctgt atgtcacatg gaaagacagg ttcgtagctg aatacaccaa
1681 gtctgaggag ttaccgcatc tagatatcaa gaaggactta gaagaagctg agcaaatgta
1741 cgacgcgtta tcagaattat ctatccttaa gggtgctgat aatttcgata tcgcgaagtt
1801 caaagacatg tgcaaggctt tagatgttag tcctgatgtg gcagcacgag taatcgttgc
1861 agtggccgag aatagaagcg gtttaactct tacttttgat aagccaaccg aggagaatgt
1921 ggctaaggct cttaaaagca cggcgtctga ggccgtggta tgtcttgaac cgacatccga
1981 agaggtgaac gtaaataaat tttctattgc tgagaaaggg agattgcctg tgtgtgcaga
2041 aagtcatggt ttgacgaatg ctaacttaga gcaccaggag ttggagtccc tcaacgattt
2101 ccataaggct tgcgtggata gtgtgattac aaagcaaatg gcatcggttg tctacactgg
2161 ctcactcaaa gttcaacaaa tgaagaacta tgtggacagt ttggcagctt cgttgtccgc
2221 cactgtatca aatctatgca agtcactaaa ggatgaagtc gggtatgatt ctgattccag
2281 ggagaaagtt ggtgtttggg atgtcacttt gaaaaagtgg ctcctcaaac ctgcggccaa
2341 aggtcattca tggggagttg tcctggatta caaggggaaa atgtttactg cacttctatc
2401 ttatgaagga gatagaatgg tgactgagag cgactggagg agggtggctg tatcatctga
2461 tacaatggta tattctgata ttgcaaagct ccaaaatctg aggaaaacaa tgagagacgg
2521 tgaaccccac gaacctactg caaagatggt acttgtggat ggggtgcctg gttgtggaaa
2581 gtacaaagga gattttgaaa gatttgatct tgatgaggat ttgatcttgg ttcctggaaa
2641 acaagctgct gctatgatca gaagaagggc taattcatct ggactgataa gagccacaat
2701 ggacaatgtg agaacggtag attcacttct aatgcatcca aaaccgcgat cacacaagag
2761 gctttttatt gatgaagggt tgatgctgca caccggttgt gttaacttcc tggtgcttat
2821 ctctggttgc gacatcgcat acatttacgg agatacacag cagattcctt tcattaacag
2881 agttcagaat ttcccgtatc ccaaacattt tgagaagctg caagtggatg aagttgagat
2941 gaggaggacc acactgagat gcccaggtga tgtgaatttt ttcctacaat cgaagtacga
3001 aggagcggtg acaaccactt caactgtaca acgatcggtc tcatctgaga tgataggcgg
3061 taagggagta ctaaacagtg tttccaaacc actaaaaggg aaaattgtaa ctttcactca
3121 ggctgataaa tttgagttag aggagaaggg ctataagaat gtgaacaccg ttcatgagat
3181 ccaaggagaa acctttgaag atgtgtcgct ggtcagattg acggcaactc cactgactct
3241 gatttccaag tcttccccgc atgttctagt cgctctgact agacacacaa agagcttcaa
3301 atattacacc gtagtgttag atcctttagt acagataatt agtgatttgt cttctttaag
3361 ctccttcctt ttagaaatgt atatggtaga agcaggtagt agatagcaat tacagatgga
3421 tgcagtgttc aaaggtcata atctctttgt ggcaacacct aaatcaggag actttccaga
3481 tctacagttc tattacgatg tatgcctccc tggtaatagt actatactta acaagtatga
3541 tgctgttacc atgaggttac gtgataatag tcttaatgtg aaggattgtg ttcttgattt
3601 ttccaaaagt attccgatgc caaaggaggt gaaaccatgt ctagagccag ttttgcgtac
3661 cgcggcggaa ccgccaaggg ctgcaggact actcgaaaat ctggttgcaa tgattaaaag
3721 aaatttcaac gcaccagacc tgacggggac gattgacatt gagagcaccg catctgttgt
3781 agtagataag ttttttgata gctattttat taaaaaagaa aaatacacaa aaaatattgc
3841 tggagtgatg acgaaggatt caatgatgag atggttggaa aacaggaaag aagtactatt
3901 ggacgacttg gctaactaca attttacaga tctgccggcc atcgatcagt acaagcacat
3961 gatcaaggct caaccaaaac agaaattgga cctttcaatt cagaatgaat accctgctct
4021 gcaaacaatt gtctaccatt cgaagcagat caacggtatt ttggccggtt tctcagagct
4081 tacaaggttg ctgctcgagg catttgattc taagaagttt cttttcttta ctaggaaaac
4141 tccagaacag attcaagaat ttttctcgga tctcgactcg cacgttccta tggatgtgtt
4201 agaactggat atttctaagt atgataagtc acagaacgag tttcattgtg ctgtagagta
4261 tgaaatatgg aaaagattgg gtctcaatga gtttttggcc gaagtgtgga aacaagggca
4321 caggaaaaca actttgaagg attacattgc tggaatcaag acatgtctgt ggtatcaaag
4381 gaaaagcggt gatgtgacta ctttcatcgg caatactgtt ataatagcag cttgcttggg
4441 ttcaatgtta ccgatggaaa aggtcataaa aggtgctttt tgtggagacg attccgtttt
4501 gtattttcca aagggtttgg atttccctga cattcagtca tgtgctaatc tcatgtggaa
4561 ttttgaggcc aaactgtata gaaagaggta cggttacttt tgtggtagat acatcataca
4621 ccatgataag ggagcaatag tgtattatga tcctttgaag ttgatctcca aacttggggc
4681 aaaacatatc aaggattatg atcacttaga agagttaagg gtgtctttgt gcgatgttgc
4741 ttgttcgctc ggaaactggt gcttaggctt tccgcagctg aacgcagcta tcaaggaggt
4801 tcataaaacc gcgattgatg gttcgtttgc ttttaattgt gttaacaaat ttttgtgtga
4861 taaattttta tttagaactt tgtttttaaa tggctgttag tctcagagat actgtcaaaa
4921 ttagcgagtt cattgatctt tcgaaacagg atgagatact tccggcattc atgactaagg
4981 tcaagagtgt tagaatatcg actgtggaca agattatggc tgttaagaat gatagtcttt
5041 ctgatgtaga tttacttaaa ggtgttaagt tagttaagaa agggtatgtg tgcttagctg
5101 atttggtagt gtctggggag tggaatctcc cggataactg ccgtggtggt gtcagtgttt
5161 gtattgtaga taagagaatg aaaaggagta aggaagcaac gctgggtgcg tatcacgccc
5221 ctgcttgcaa aaagaatttt tcttttaagc taatccctaa ttattcaata acatccgagg
5281 atgctgagaa gcacccgtgg caagtgttag tgaatatcaa aggagtggct atggaagaag
5341 gatactgtcc tttatctttg gagttcgttt caatttgtgt agtacataaa aataatgtaa
5401 gaaaaggttt gagggaacgt attttgagtg tgacagacgg ctcgccaatt gaactcactg
5461 aaaaggttgt tgaggagttc gtggatgaag taccaatggc tgtgaaactc gaaaaggttc
5521 cggaaaacaa aaaagaaatg gtaggtaata atgttaataa taagaaaata aataacagtg
5581 gtaagaaggg ttttaaaatt gaggaaattg aggataatgt aagtgatgac gagtctatcg
5641 cgtcatcgag tacgttttaa tcaatatgcc ttatacaatc aactctccga gccaatttgt
5701 ttacttatct tccgcttacg cagatcctgt gcagctgatc aatctgtgta caaatgcatt
5761 gggtaaccag tttcaaacgc aacaagctag gacaacagtc caacagcaat ttgcggatgc
5821 ctggaaacct gtgcctagta tgacagtgag atttcctgca tcggatttct atgtgtatag
5881 atataattcg acgcttgatc cgttgatcac ggcgttatta aatagctttg atactagaaa
5941 tagaataata gaggttgata atcaacccgc accgaatact actgaaatcg ttaacgcgac
6001 tcagagggta gacgatgcta ctgtagctat aagggcttca atcaataatt tggctaatga
6061 actggttcgt ggaactggca tgttcaatca agcaggcttt gagactgcta gtggacttgt
6121 ctggaccaca actccggcta cttagctatt gttgtgagat ttcctaaaat aaagtcgctg
6181 aagacttaaa attcagggtg gctgatacca aaatcagcag tggttgttcg tccacttaaa
6241 tataacgatt gtcatatctg gatccaacag ttaaaccatg tgatggtgta tactgtggta
6301 tggcgtaaaa catcggagag gttcgaatcc tcccctaacc gccggtagcg gccca
//
LOCUS TRNTNPIA 1333 bp ds-DNA BCT 14-AUG-1990
DEFINITION Transposon Tn21 integrase (tnpI) gene, complete cds, and
adenyltransferase (aadA), 5' end.
ACCESSION M33633
KEYWORDS integrase.
SOURCE Transposon Tn21 DNA, clone pMON90.
ORGANISM Transposon Tn21
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1333)
AUTHORS Mercier,J., Lachapelle,J., Couture,F., Lafond,M., Vezina,G.,
Boissinot,M. and Levesque,R.C.
TITLE Structural and functional characterization of tnpI a recombinase
locus in Tn21 and related transposons
JOURNAL J. Bacteriol. 172, 3745-3757 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.C.Levesque, 06-APR-1990, for release after publication.
FEATURES from to/span description
ORF 209 613 potential ORF1
ORF 665 1060 potential ORF2
ORF 1150 137 (c) potential ORF3
ORF 1298 > 1333 adenyltransferase (aadA)
site 1288 1291 integration site GTTA
BASE COUNT 286 a 396 c 408 g 243 t
ORIGIN
1 ggatccatca ggcaacgacg ggctgctgcc ggccatcagc ggacgcaggg aggactttcc
61 gcaaccggcc gttcgatgcg gcaccgatgg ccttcgcgca ggggtagtga atccgccagg
121 attgacttgc gctgccctac ctctcactag tgaggggcgg cagcgcatca agcggtgagc
181 gcactccggc accgccaact ttcagcacat gcgtgtaaat catcgtcgta gagacgtcgg
241 aatggccgag cagatcctgc acggttcgaa tgtcgtaacc gctgcggagc aaggccgtcg
301 cgaacgagtg gcggagggtg tgcggtgtgg cgggcttcgt gatgcctgct tgttctacgg
361 cacgtttgaa ggcgcgctga aaggtctggt catacatgtg atggcgacgc acgacaccgc
421 tccgtggatc ggtcgaatgc gtgtgctgcg caaaaaccca gaaccacggc caggaatgcc
481 cggcgcgcgg atacttccgc tcaagggcgt cgggaagcgc aacgccgctg cggccctcgg
541 cctggtcctt cagccaccat gcccgtgcac gcgacacgtg ctcgcgcagg ctgggtgcca
601 agctctcggg taacatcaag gcccgatcct tggagccctt gccctcccgc acgatgatcg
661 tgccgtgatc gaaatccaga tccttgaccc gcagttgcaa accctcactg atccgcatgc
721 ccgttccata cagaagctgg gcgaacaaac gatgctcgcc ttccagaaaa ccgaggatgc
781 gaaccacttc atccggggtc agcaccaccg gcaagcgccg cgacggccga ggtcttccga
841 tctcctgaag ccagggcaga tccgtgcaca gcaccttgcc gtagaagaac agcaaggccg
901 ccaatgcctg acgatgcgtg gagaccgaaa ccttgcgctc gttcgccagc caggacagaa
961 atgcctcgac ttcgctgctg cccaaggttg ccgggtgacg cacaccgtgg aaacggatga
1021 aggcacgaac ccagtggaca taagcctgtt cggttcgtaa gctgtaatgc aagtagcgta
1081 tgcgctcacg caactggtcc agaaccttga ccgaacgcag cggtggtaac ggcgcagtgg
1141 cggttttcat ggcttgttat gactgttttt tggggtacag tctatgcctc gggcatccaa
1201 gcagcaagcg cgttacgccg tgggtcgatg tttgatgtta tggagcagca acgatgttac
1261 gcagcagggc agtcgcccta aaacaaagtt aaacatcatg agggaagcgg tgatcgccga
1321 agtatcgact caa
//
LOCUS TTERRDA 1504 bp ss-rRNA RNA 14-AUG-1990
DEFINITION T.tenax 16S rRNA.
ACCESSION M35966
KEYWORDS 16S ribosomal RNA.
SOURCE T.tenax RNA.
ORGANISM Thermoproteus tenax
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Thermoproteales; Thermoproteaceae.
REFERENCE 1 (bases 1 to 1504)
AUTHORS Leinfelder,W., Jarsch,M. and Bock,A.
TITLE The phylogenetic position of the sulfur-dependent archaebacterium
Thermoproteus tenax: Sequence of the 16S rRNA gene
JOURNAL Syst. Appl. Microbiol. 6, 164-170 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
rRNA 1 1504 16S rRNA
BASE COUNT 297 a 448 c 563 g 196 t
ORIGIN
1 aaaccggttg atcctgccgg acctgaccgc tatcggggtg gggctaagcc atgcgagtcg
61 cgcgcccggg gcgccgggcg cggcgcacgg ctcagtaaca cgtacccaac ctaacctcgg
121 gagggggaca accccgggaa actggggctg atcccccata ggggaagggc gctggaaggc
181 cccttcctcc aaagggatcg cgggcgatct cccgcggtcc gcccgagggt gggggtacgg
241 cccatcaggt tgttggcggg gtaacggccc gccaagccga agacgggtag gggcggtgag
301 agccgtgagc cccgagatgg gcactgagac aagggcccag gccctacggg gtgcagcagg
361 cgcgaatact ccgcaatgcg ggcaaccgcg acggggccac cccgagtgcc gggcgaagag
421 cccggctttt gcccggtgta aggagccggg cgaataagcg gggggtaagt ctggtgtcag
481 ccgccgcggt aataccagcc ccgcgagtgg tcagggtgat tactgggctt aaagcgcccg
541 tagccggccc ggcaagtcgc tcctgaaatc cccaggctca acctgggggc agggggcgat
601 actgccgggc tagggggcgg gagaggccgc cggtactccg ggggtagggg cgaaatccta
661 taatccccgg aggaccacca gtggcgaaag cgggcggcca gaacgcgccc gacggtgagg
721 ggcgaaagcc gggggagcaa aggggattag atacccctgt agtcccggcc gtaaacgatg
781 cgggctagct gtcggccggg cttagggccc ggccggtggc gtagggaaac cgttaagccc
841 gccgcctggg gagtacggcc gcaaggctga aacttaaagg aattggcggg ggggcaccac
901 aaggggtgaa gcttgcggct taattggagt caacgccgga aaccttaccc ggggcgacag
961 caggatgaag gccaggctaa cgaccttgcc ggacgagctg agaggaggtg catggccgtc
1021 gtcagctcgt gccgtgaggt gtccggttaa gtccggcaac gagcgagacc cccaccccta
1081 gttgctaccc cgctcttcgg ggcggggggc acactagggg gactgccggc gtaagccgga
1141 ggaaggaggg ggcgacggca ggtcagtatg ccccgaaacc ccggggctgc acgcgagctg
1201 caatggcggg gacagcggga tccgaccccg aaagggggag gcaatcccgt aaaccccgcc
1261 ccagtaggga tcgagggctg caactcgccc tcgtgaacgt ggaatcccta gtaaccgcgt
1321 gtcaccaacg cgcggtgaat acgtccctgc cccttgcaca caccgcccgt cgcaccaccc
1381 gagggagttc tctgcgaggc ccctcgcttg gggcaaccca ggtgggggga cgagcagaga
1441 actcccgagg ggggtgaagt cgtaacaagg tagccgtagg ggaacctgcg gttggatcac
1501 ctcc
//
LOCUS VACPOLRDA 15212 bp ds-DNA VRL 14-AUG-1990
DEFINITION Vaccinia virus RNA polymerase (rpo30), polyA polymerase large
subunit and DNA polymerase genes, complete cds.
ACCESSION M36339
KEYWORDS DNA polymerase; RNA polymerase; polyA polymerase large subunit.
SOURCE Vaccinia virus (strain WR) DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 15212)
AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y.
TITLE Eukaryotic transcriptional elongation factor homolog is an integral
subunit of the vaccinia virus encoded RNA polymerase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 2041 to 8280)
AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y.
TITLE Eukaryotic transcriptional elongation factor homolog is an integral
subunit of the vaccinia virus encoded RNA polymerase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Authorin copy for [1] kindly submitted by B.Y.Ahn, 10-JUL-1990.
National Institutes of Health
NIAID/LVD
9000 Rockville Pike
Bethesda, Maryland 20892
FEATURES from to/span description
pept 1621 182 (c) polyA polymerase large subunit
pept 3831 1618 (c) ORF1
pept 4530 3958 (c) ORF2
pept 5365 4586 (c) RNA polymerase subunit (rpo30)
pept 5414 6439 ORF3
pept 6576 8279 ORF4
pept 8361 8861 ORF5
pept 8986 9807 ORF6
pept 12834 9814 (c) DNA polymerase
pept 12866 13153 ORF7
pept 13537 13148 (c) ORF8
BASE COUNT 4992 a 2561 c 2400 g 5259 t
ORIGIN
1 aagcttcctg cgtactcaaa gtagataaac cctcatcgcc cgcgtgtgag agaagacctt
61 cgtccccgtc cagatgcgag agaatgaata accctggaaa acaagttccg tttatgagga
121 cggacatgct acaaaatatg ttcgcggcta atcgcgacaa cgtggcgtcg agacttttga
181 actaaaatac aattatatcc ttttcgatat taataaatcc gtgtcgtcca ggttttttat
241 ctctttcagt atgtgaatag ataggtattt tatctctatt catcatcgaa tttaagagat
301 ccgataaaca ttgtttgtat tctccagatg tcagcatctg atacaacaat atatgtgcac
361 ataaacctct ggcacttatt tcatgtacct tccccttatc actaaggaga atagtatttg
421 agaaatatgt atacatgata ttatcatgaa ttagatatac agaatttgta acactctcga
481 aatcacacga tgtgtcggcg ttaagatcta atatatcact cgataacaca ttttcatcta
541 gatacactag acatttttta aagctaaaat agtctttagt agtgacagta actatgcgat
601 tattttcatc gatgatacat ttcatcggca tattattacg cttaccatca aagactatac
661 catgtgtata tctaacgtat tctagcatgg ttgccatacg cgcattaaac ttttcaggat
721 ctttggatag atcttccaat ctatctattt gagaaaacat ttttatcatg ttcaatagtt
781 gaaacgtcgg atccactata tagatattat ctataaagat tttaggaact acgttcatgg
841 tatcctggcg aatattaaaa ctatcaatga tatgattatc gttttcatct tttatcacca
901 tatagtttct aagatatggg attttactta atataatatt atttcccgta ataaatttta
961 ttagaaatgc caaatctata agaaaagttc tagaattagt ctgaagaata tctatatcgc
1021 cgtatagtat atttggatta attagatata gagaatatga tccgtaacat atacaacttt
1081 tattatggcg tctaagatat tcttccatca acttattaac atttttgact agggaagata
1141 cattatgacg tcccattact tttgccttgt ctattactgc gacgttcata gaatttagca
1201 tatctcttgc caattcttcc attgatgtta cattataaga aattttagat gaaattacat
1261 ttggagcttt aatagtaaga actcctaata tgtccgtgta tgtggtcact aatacagatt
1321 gtagttctat aatcgtaaat aatttaccta tattatatgt ttgagtctgt ttagaaaagt
1381 agctaagtat acgatctttt atttctgatg cagatgtatc aacatcggaa aaaaatcttt
1441 ttttattctt ttttactaaa gatacaaata tgtctttgtt aaaaacagtt attttctgaa
1501 tatttctagc ttgtaatttt aacatatgat attcgttcac actaggtact ctgcctaaat
1561 aggtttctat aatctttaat gtaatattag gaaaagtatt ctgatcagga ttcctattca
1621 ttttgaggat ttaaaactct gattattgtc taatatggtc tctacgcaaa ctttttcaca
1681 gagcgataga gtttttgata actcgttttt cttaagaaat ataaaactac tgtctccaga
1741 gctcgctcta tcttttattt tatttaattc gatacaaact cctgatactg gttcagaaag
1801 taattcatta attttcagtc ctttatagaa gatatttaat atagataata caaaattttc
1861 agttcttgat atcgatctga ttgatcctag aactagatat attaataacg tgctcattag
1921 gcagtttatg gcagcttgat aattagatat agtatattcc agttcatatt tattagatac
1981 cgcattgccc agattttgat attctatgaa ttcctctgaa aataaatcca aaataactaa
2041 acattctatt ttttgtggat tagtgtactc tcttccctct atcatgttca ctactggtgt
2101 ccacgatgat aaatatctag agggaatata atatagtcca taggatgcca atctagcaat
2161 gtcgaataac tgtaatttga ttcttcgttc ttcattatga attgattctt gaggtataaa
2221 cctaacacaa attatattat tagacttttc gtatgtaatg tctttcatgt tataagtttt
2281 taatcctgga atagaatcta ttttaatgag gcttttaaac gcagagttct ccaacgagtc
2341 aaagcataat actctgttgt ttttcttata tacgatgtta cgattttctt ctttgaatgg
2401 aataggtttt tgaattagtt tataattaca acataataga taaggaagtg tgcaaatagt
2461 acgcggaaaa aacataatag ctcccctgtt ttcatccatg gttttaagta aatgatcact
2521 ggcttcttta gtcaatggat attcgaacat taaccgtttc atcatcattg gacagaatcc
2581 atatttctta atgtaaagag tgatcaaatc attgtgttta ttgtaccatc ttgttgtaaa
2641 tgtgtattcg gttatcggat ctgctccttt ttctattaaa gtatcgatgt cgatctcgtc
2701 taagaattca actatatcga catatttcat ttgtatacac ataaccatta ctaacgtaga
2761 atgtatagga agagatgtaa cgggaacagg gtttgttgat tcgcaaacta ttctaataca
2821 taattcttct gttaatacgt cttgcacgta atctattata gatgccaaga tatctatata
2881 attattttgt aagatgatgt taactatgtg atctatataa gtagtgtaat aattcatgta
2941 tttcgatata tgttccaact ctgtctttgt gatgtctagt ttcgtaatat ctatagcatc
3001 ctcaaaaaat atattcgcat atattcccaa gtcttcagtt ctatcttcta aaaaatcttc
3061 aacgtatgga atataataat ctattttacc tcttctgata tcattaatga tatagttttt
3121 gacactatct tctgtcaatt gattcttatt cactatatct aagaaacgga tagcgtccct
3181 aggacgaact actgccatta atatctctat tatagcttct ggacataatt catctattat
3241 accagaatta atgggaacta ttccgtatct atctaacata gttttaagaa agtcagaatc
3301 taagacctga tgttcatata ttggttcata catgaaatga tctctattga tgatagtgac
3361 tatttcattc tctgaaaatt ggtaactcat tctatatatg ctttccttgt tgatgaagga
3421 tagaatatac tcaatagaat ttgtaccaac aaactgttct cttatgaatc gtatatcatc
3481 atctgaaata atcatgtaag gcatacattt aacaattaga gacttgtctc ctgttatcaa
3541 tatactattc ttgtgataat ttatgtgtga ggcaaatttg tccacgttct ttaattttgt
3601 tatagtagat atcaaatcca atggagctac agttcttggc ttaaacagat atagtttttc
3661 tggaacgaat tctacaacat tattataaag gactttgggt agataagtgg gatgaaatcc
3721 tattttaatt aatgcgatag ccttgtcctc gtgcagatat ccaaacgctt ttgtgatagt
3781 atggcattca ttgtctagaa acgctctacg aatatctgtg acagatatca tctttagaga
3841 atatactagt cgcgttaata gtactacaat ttgtattttt taatctatct caataaaaaa
3901 attaatatgt atgattcaat gtataactaa actactaact gttattgata actagaatca
3961 gaatctaatg atgacgtaac caagaagttt atctactgcc aatttagctg cattattttt
4021 agcatctcgt ttagattttc catctgcctt atcgaatact cttccgtcga tgtctacaca
4081 ggcataaaat gtaggagagt tactaggccc cactgattca atacgaaaag accaatctct
4141 cctagtaatt tggcagtact cattaataac ggtgacaggg ttagcacctt tccaatcaat
4201 aattttttta gccggaataa catcatcaaa agacttatga tcctctctca ttgatttttc
4261 gcgggataca tcatctatta tgacgtcagc catagcatca gcatccggct tatccgcctc
4321 cgttgtcata aaccaacgag gaggaatatc gtcggagctg tacaccatag cactacgttg
4381 aagatcgtac agagctttat taacttctcg cttctccata ttaagttgtc tagttagttg
4441 tgcagcagta gctccttcga ttccaatggt tttaatagcc tcacacacaa tctctgcgtt
4501 agaacgctcg tcgatataga ttttagacat ttttagagag aactaacgca atcagtaata
4561 aaactaattt attttatcat tttttttatt catcatcctc tggtggttcg tcgtttctat
4621 cgaatgtagc tctgattaac ccgtcatcta taggtgatgc tggttctgga gattctggag
4681 gagatggatt attatctgga agaatctctg ttatttcctt gttttcatgt atcgattgcg
4741 ttgtaacatt aagattgcga aatgctctaa atttgggagg cttaaagtgt tgtttgcaat
4801 ctctacacgc gtgtctaact agtggaggtt cgtcagcggc tctagtttga atcatcatcg
4861 gcgtagtatt cctactttta cagttaggac acggtgtatt gtatttctcg tcgagaacgt
4921 taaaataatc gttgtaactc acatccttta ttttatctat attgtattct actcctttct
4981 taatgcattt tataccgaat aagagatagc gaaggaattc tttttcggtg ccgctagtac
5041 ccttaatcat atcacatagt gttttatatt ccaaatttgt ggcaatagac ggtttatttc
5101 tatacgatag tttgtttctg gaatcctttg agtattctat accaatatta ttctttgatt
5161 cgaatttagt ttcttcgata ttagattttg tattacctat attcttgatg tagtactttg
5221 atgatttttc catggcccat tctattaagt cttccaagtt ggcatcatcc acatattgtg
5281 atagtaattc tcggatatca gtagcggtta ccgccattga tgtttgttca ttggatgagt
5341 aactactaat gtatacattt tccatttata acacttatgt attaactttg ttcatttata
5401 ttttttcatt attatgttga tattaacaaa agtgaatata tatatgttaa taattgtatt
5461 gtggttatac ggctacaatt ttataattag tgaaagtcag tgtccgatga tcaatgacga
5521 tagctttact ctgaaaagaa agtatcaaat cgatagtgcg gagtcaacaa taaaaatgga
5581 taagaagagg acaaagtttc agaatagagc caaaatggta aaagaaataa atcagacaat
5641 aagagcagca caaactcatt acgagacatt gaaactagga tacataaaat ttaagagaat
5701 gattaggact actactctag aagatatagc accatctatt ccaaataatc agaaaactta
5761 taaactattc tcggacattt cagccatcgg caaagcatca cggaatccaa gtaagatggt
5821 atatgctctg ctgctttaca tgtttcccaa tttgtttgga gatgatcata gattcattcg
5881 ttatagaatg catccaatga gtaaaatcaa acacaagatc ttctctcctt tcaaacttaa
5941 tcttattaga atattagtgg aagaaagatt ctataataat gaatgcagat ctaataaatg
6001 gagaataatt ggaacacaag ttgataaaat gttgatagct gaatctgata aatatacaat
6061 agatgcaagg tataacctaa aacccatgta tagaatcaag ggaaaatctg aagaagatac
6121 cctctttatc aaacagatgg tagaacaatg tgtgacatcc caggaattgg tggaaaaagt
6181 gttgaagata ctgtttagag atttgttcaa gagtggagaa tacaaagcgt acagatacga
6241 tgatgatgta gaaaatggat ttattggatt ggatacacta aaattaaaca ttgttcatga
6301 tatagttgaa ccatgtatgc ctgttcgtag gccagtggct aagatactgt gtaaagaaat
6361 ggtaaataaa tactttgaga atccgctaca tattattggt aaaaatcttc aagagtgcat
6421 tgactttgtt agtgaatagg catttcatct ttctccaata ctaattcaaa ttgttaaatt
6481 aataatggat agtataaata gttattagtg ataaaatagt aaaaataatt attagaataa
6541 gagtgtagta tcatagataa ctctcttcta taaaaatgga ttttattcgt agaaagtatc
6601 ttatatacac agtagaaaat aatatagatt ttttaaagga tgatacatta agtaaagtaa
6661 acaattttac cctcaatcat gtactagctc tcaagtatct agttagcaat tttcctcaac
6721 atgttattac taaggatgta ttagctaata ccaatttttt tgttttcata catatggtac
6781 gatgttgtaa agtgtacgaa gcggttttac gacacgcatt tgatgcaccc acgttgtacg
6841 ttaaagcatt gactaagaat tatttatcgt ttagtaacac aatacaatcg tacaaggaaa
6901 ccgtgcataa actaacacaa gatgaaaaat ttttagaggt tgccaaatac atggacgaat
6961 taggagaact tataggcgta aattatgact tagttcttaa tccattattt cacggagggg
7021 aacccatcaa agatatggaa atcatttttt taaaactgtt taagaaaaca gacttcaaag
7081 ttgttaaaaa attaagtgtt ataagattac ttatttgggc ttacctaagc aagaaagata
7141 caggcataga gtttgcggat aatgatagac aagatatata tactctattt caacaaactg
7201 gtagaatcgt ccatagcaat ctaacagaaa cgtttagaga ttatatcttt cccggagata
7261 agactagcta ttgggtgtgg ttaaacgaaa gtatagctaa tgatgcggat attgttctta
7321 atagacacgc cattaccatg tatgataaaa ttcttagtta tatatactct gagataaaac
7381 agggacgcgt taataaaaac atgcttaagt tagtttatat ctttgagcct gaaaaagata
7441 tcagagaact tctgctagaa atcatatatg atattcctgg agatatccta tctattattg
7501 atgcaaaaaa cgacgattgg aaaaaatatt ttattagttt ttataaagct aattttatta
7561 acggtaatac atttattagt gatagaacgt ttaacgagga cttattcaga gttgttgttc
7621 aaatagatcc cgaatatttc gataatgaac gaattatgtc tttattctct acgagtgctg
7681 cggacattaa acgatttgat gagttagata ttaataacag ttatatatct aatataattt
7741 atgaggtgaa cgatatcaca ttagatacaa tggatgatat gaagaagtgt caaatcttta
7801 acgaggatac gtcgtattat gttaaggaat acaatacata cctgtttttg cacgagtcgg
7861 atcccatggt catagagaac ggaatactaa agaaactgtc atctataaaa tccaagagta
7921 gacggctgaa cttgtttagc aaaaacattt taaaatatta tttagacgga caattggctc
7981 gtctaggtct tgtgttagat gattataaag gagacttgtt agttaaaatg ataaaccatc
8041 ttaagtctgt ggaggatgta tccgcattcg ttcgattttc tacagataaa aaccctagta
8101 ttcttccatc gctaatcaaa actattttag ctagttataa tatttccatc atcgtcttat
8161 ttcaaaggtt tttaagagat aatctatatc atgtagaaga attcttggat aaaagcatcc
8221 atctaaccaa gacggataag aaatatatac ttcaattgat aagacacggt agatcataga
8281 acagaccaaa tatattatta ataatttgta tatacataga tataattatc acatattaaa
8341 aattcacaca tttttgataa atgggaactg ctgcaacaat tcagactccc accaaattaa
8401 tgaataaaga aaatgcagaa atgattttgg aaaaaattgt tgatcatata gttatgtata
8461 ttagtgacga atcaagtgat tcagaaaata atcctgaata tattgatttt cgtaacagat
8521 acgaagacta tagatctctc attataaaaa gtgatcacga gtttgtaaag ctatgtaaaa
8581 atcatgcaga gaaaagttct ccagaaacgc aacaaatgat tatcaaacac atatacgaac
8641 aatatcttat tccagtatct gaagtactat taaaacctat aatgtccatg ggtgacataa
8701 ttacatataa cggatgtaaa gacaatgaat ggatgctaga acaactctct accctaaact
8761 ttaacaatct ccgcacatgg aactcatgta gcataggcaa tgtaacgcgt ctgttttata
8821 cattttttag ttatctgatg aaagataaac taaatatata agtataatcc cattctaata
8881 ctttaacctg atgtattagc atcttattag aatattaacc taactaaaag acataacata
8941 aaaactcatt acatagttga taaaaagcgg taggatataa atattatggc tgccaccgtt
9001 ccgcgttttg acgacgtgta caaaaatgca caaagaagaa ttctagatca agaaacattt
9061 tttagtagag gtctaagtag accgttaatg aaaaacacat atctatttga taattacgcg
9121 tatggatgga taccagaaac tgcaatttgg agtagtagat acgcaaactt agatgcaagt
9181 gactattatc ccatttcgtt gggattactt aaaaagttcg agtttctcat gtctctatat
9241 aaaggtccta ttccagtata cgaagaaaaa gtaaatactg aattcattgc taatggatcg
9301 ttctctggta gatacgtatc atatcttcga aagttttctg ctcttccaac aaacgagttt
9361 attagttttt tgttactgac ttccattcca atctataata tcttgttctg gtttaaaaat
9421 actcagtttg atattactaa acacacatta ttcagatacg tctatacaga taatgccaaa
9481 cacctggcgt tggctaggta tatgcatcaa acaggagact ataagccttt gtttagtcgt
9541 ctcaaagaga attatatatt taccggtccc gttccaatag gtatcaaaga tataaatcac
9601 cctaatctta gtagagcaag aagtccatcc gattatgaga cattagctaa tattagtact
9661 atattgtact ttaccaagta tgatccggta ttaatgtttt tattgtttta cgtacctggg
9721 tattcaatta ctacaaaaat tactccagcc gtagaatatc taatggataa actgaatcta
9781 acaaagagcg acgtacaact gttgtaaatt attttatgct tcgtaaaatg taggttttga
9841 accaaacatt ctttcaaaga atgagatgca taaaacttta ttatccaata gattgactat
9901 ttcggacgtc aatcgtttaa agtaaacttc gtaaaatatt ctttgatcac tgccgagttt
9961 aaaacttcta tcgataattg tttcatatgt tttaatattt acaagttttt tggtccatgg
10021 tacattagcc ggacaaatat atgcaaaata atatcgttct ccaagttcta tagtttctgg
10081 attattttta ttatattcag taaccaaata catattaggg ttatctgcgg atttataatt
10141 tgagtgatgc attcgactca acataaataa ttctagagga gacgatctac tatcaaattc
10201 ggatcgtaaa tctgtttcta aagaacggag aatatctata catacctgat tagaattcat
10261 ccgtccttca gacaacatct cagacagtct ggtcttgtat gtcttaatca tattcttatg
10321 aaacttggaa acatctcttc tagtttcact agtaccttta ttaattctct caggtacaga
10381 ttttgaattc gacgatgccg agtatttcat cgttgtatat ttcttcttcg attgcataat
10441 cagattctta tataccgcct caaactctat tttaaaatta ttaaacaata ctctattatt
10501 aatcagtcgt tctaactcct ttgctatttc tatggactta tctacatctt gactgtctat
10561 ctctgtaaac acggagtcgg tatctccata cacgctacga aaacgaaatc tgtaatctat
10621 aggcaacgat gttttcacaa tcggattaat atctctatcg tccatataaa atggattact
10681 taatggattg gcaaaccgta acataccgtt agataactct gctccattta gtaccgattc
10741 tagatacaag atcattctac gtcctatgga tgtgcaactc ttagccgaag cgtatgagta
10801 tagagcacta tttctaaatc ccatcagacc atatactgag ttggctacta tcttgtacgt
10861 atattgcatg gaatcataga tggccttttc agttgaactg gtagcctgtt ttaacatctt
10921 tttatatctg gctctctctg ccaaaaatgt tcttaatagt ctaggaatgg ttccttctat
10981 cgatctatcg aaaattgcta tttcagagat gaggttcggt agtctaggtt cacaatgaac
11041 cgtaatatat ctaggaggtg gatatttctg aagcaagagc tgattattta tttcttcttc
11101 caatctattg gtactaacaa cgacaccgac taatgtttcc ggagatagat ttccaaagat
11161 acacacatta ggatacagac tgttataatc aaagattaat acattattac taaacatttt
11221 ttgttttgga gcaaatacct taccgccttc ataaggaaac ttttgttttg tttctgatct
11281 aactaagata gttttagttt ccaacaatag ctttaacagt ggacccttga tgactgtact
11341 cgctctatat tcgaatacca tggattgagg aagcacatat gttgacgcac ccgcgtctgt
11401 ttttgtttct actccataat actcccacaa atactgacac aaacaagcat catgaataca
11461 gtatctagcc atatctaaag ctatgtttag attataatcc ttatacatct gagctaaatc
11521 aacgtcatcc tttccgaaag ataatttata tgtatcatta ggtaaagtag gacataatag
11581 tacgacttta aatccatttt cccaaatatc tttacgaatt actttacata taatatcctc
11641 atcaacagtc acataattac ctgtggttaa aacctttgca aatgcagcgg ctttgccttt
11701 cgcgtccgta gtatcgtcac cgatgaacgt catttctcta actcctctat ttaatacttt
11761 acccatgcaa ctgaacgcgt tcttggatat agaatccaat ttgtacgaat ccaatttttc
11821 agatttttga atgaatgaat atagatcgaa aaatatagtt ccattattgt tattaacgtg
11881 aaacgtagta ttggccatgc cgcctactcc cttatgacta gactgatttc tctcataaat
11941 acagagatat acagcttcct ttttgtccgg agatctaaag ataatcttct ctcctgttaa
12001 taactctaga cgattagtaa tatatctcag atcaaagtta tgtccgttaa aggtaacgac
12061 atagtcgaac gttagttcca acaattgttt agctattcgt aacaaaacta tttcagaaca
12121 taaaactagt tctcgttcgt aatccatttc cattagtgac tgtatcctca aacatcctct
12181 atcgacggct tcttgtattt cctgttccgt taacatctct tcattaatga gcgtaaacaa
12241 taatcgttta ccacttaaat cgatataaca gtaacttgta tgcgagattg ggttaataaa
12301 tacagaagga aacttcttat cgaagtgaca ctctatatct agaaataagt acgatcttgg
12361 gatatcgaat ctaggtattt ttttagcgaa acagttacgt ggatcgtcac aatgataaca
12421 tccattgtta atctttgtca aatattgctc gtccaacgag taacatccgt ctggagatat
12481 cccgttagaa atataaaacc aactaatatt gagaaattca tccatggtgg cattttgtat
12541 gctgcgtttc tttggctctt ctatcaacca catatctgcg acggagcatt ttctatcttt
12601 aatatctaga ttataactta ttgtctcgtc aatgtctata gttctcatct ttcccaacgg
12661 cctcgcatta aatggaggag gagacaatga ctgatatatt tcgtccgtca ctacgtaata
12721 aaagtaatga ggaaatcgta taaatacggt ctcaccattt cgacatctgg atttcagata
12781 taaaaatctg ttttcaccgt gactttcaaa ccaattaatg caccgaacat ccatttatag
12841 aatttagaaa tatattttca tttaaatgaa tcccaaacat tggggaagag ccgtatggac
12901 cattattttt atagtacttt cgcaagcggg tttagacggc aacatagaag cgtgtaaacg
12961 aaaactatat actatagtta gcactcttcc atgtcctgca tgtagacggc acgcgactat
13021 cgctatagag gacaataatg tcatgtctag cgatgatctg aattatattt attatttttt
13081 catcagatta tttaacaatt tggcatctga tcccaaatac gcgatcgatg tgacaaaggt
13141 taacccttta taaacttaac ccattataaa acttatgatt agtcacgact gaaataaccg
13201 cgtgattatt ttttggtata attctacacg gcatggtttc tgtgactatg aattcaaccc
13261 ccgttacatt agtgaaatct ttaacaaaca gcaagggttc gtcaaagaca taaaactcat
13321 tgtttacaat cgaaatagac cccctatcac acttaaaata aaaaatatcc ttatccttta
13381 ccaccaaata aaattctgat tggtcaatgt gaatgtattc acttaacagt tccacaaatt
13441 tatttattaa ctccgaggca catacatcgt cggtattttt tatggcaaac tttactcttc
13501 cagcatccgt ttctaaaaaa atattaacga gttccattta tatcatccaa tattattgaa
13561 atgacgttga tggacaaatg atacaaataa gaaggtacgg tacctttgtc caccatctcc
13621 tccaattcat gctctatttt gtcattaact ttaatgtatg aaaacagtac gccacatgct
13681 tccatgacag tgtgtaacac tttggataca aaatgtttga cattagtata attgttcaag
13741 actgtcaatc tataatagat agtagctata atatattcta tgatggtatt gaagaagatg
13801 acaaccttgg catattgatc atttaacaca gacatggtat caacagatag cttgaatgaa
13861 agagaatcag taattggaat aagcgtcttc tcgatggagt gtccgtatac caacatgtct
13921 gatattttga tgtattccat taaattattt agttttttct ttttattctc gttaaacagc
13981 atttctgtca acggacccca acatcgttga ccgattaagt tttgattgat ttttccgtgt
14041 aaggcgtatc tagtcagatc gtatagccta tccaataatc catcgtctgt gtgtagatca
14101 catcgtacac tttttaattc tctatagaag agcgacagac atctggagca attacagaca
14161 gcaatttctt tattctctac agatgtaaga tacttgaaga cattcctatg atgatgcaga
14221 attttggata acacggtatt gatggtatct gttaccataa ttcctttgat ggctgatagt
14281 gtcagagcac aagatttcca atctttgaca atttttagca ccattatctt tgttttgata
14341 tctatatcag acagcatggt gcgtctgaca acacaaggat taagacggaa agatgaaatg
14401 attctctcaa catcttcaat ggataccttg ctattttttc tggcattatc tatatgtgcg
14461 agaatatcct ctagagaatc agtatccttt ttgatgatag tggatctcaa tgacatggga
14521 cgtctaaacc ttcttattct atcaccagat tgcatggtga tttgtcttct ttcttttatc
14581 ataatgtaat ctctaaattc atcggcaaat tgtctatatc taaaatcata atatgagatg
14641 tttacctcta caaatatctg ttcgtccaat gttagagtat ttacatcagt tttgtattcc
14701 aaattaaaca tggcaacgga tttaatttta tattcctcta ttaagtcctc gtcgataata
14761 acagaatgta gataatcatt taatccatcg tacatggttg gaagatgctt gttgacaaaa
14821 tctttaattg tcttgatgaa ggtgggacta tatctaacat cttgattaat aaaatttata
14881 acattgtcca taggatactt tgtaactagt tttatacaca tctcttcatc ggtaagccca
14941 gacagaatat cgtgaacagg tggtatatta tattcatcag atatacgaag aacaatgtcc
15001 aaatctatat tgtttaatat attatataga tgtagtgtag ctcctacagg aatatcttta
15061 actaagtcaa tgatttcatc aaccgttaga tctattttaa agttaatcat ataggcattg
15121 atttttaaaa ggtatgtagc cttgactaca ttctcattaa ttaaccattc caagtcactg
15181 tgtgtaagaa gattatattc tatcataagc tt
//
LOCUS YSCERD2A 3327 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae ER lumen protein retaining receptor (ERD2) gene,
complete cds.
ACCESSION M34777
KEYWORDS ER lumen protein retaining receptor; HDEL receptor.
SOURCE S.cerevisiae (strain SEY2102) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3327)
AUTHORS Semenza,J.C., Hardwick,K.G., Dean,N. and Pelham,H.R.B.
TITLE ERD2, a yeast gene required for the receptor-mediated retrieval of
luminal ER proteins from the secretory pathway
JOURNAL Cell 61, 1349-1357 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Semenza, 01-JUN-1990, for release after publication.
FEATURES from to/span description
pept 1337 1358 ER lumen protein retaining receptor, exon 1
(ERB2)
1456 2093 ER lumen protein retaining receptor, exon 2
ORF 2492 2960 ORF1
IVS 1359 1455 ERB2 intron A
site 1359 1364 consensus splice donor
site 1416 1422 branchpoint sequence
site 1453 1455 acceptor sequence
BASE COUNT 996 a 639 c 574 g 1118 t
ORIGIN
1 aagcttcctg acggaatgtt aattctcgtt gaccctgagc aggctgttga gcccggggat
61 ttctgcatag ccagacttgg gggtgatgag tttaccttca agaaactgat ccatccaaac
121 catttttggg tttagttgct gcttccgctg gtatccttca agatgtcatt gaaggtaagt
181 acgatcttga ggccggcgaa aacaaattca acttttaaat caaaatttta gaataaaagt
241 ctactgcatt aacaaagata aatttctaat ttttttatgt atattatatt ttttggaaga
301 gaaaatattt gtagtgcttc tctacactgt tttttttttt tcgttcatgt tgtctttcct
361 ctcctgaata tatttaattt aataaatcta tcttatctga catatagaag ctgtttcgat
421 agagttttcg actggccaac agttgttact gactggataa aaatttacga tttttcctcc
481 aatcttggct ttcaaaatga gaaataagag tcttgaaatt caaagaatgt atctatgtat
541 gaatatagaa tatgctcaaa tatttagttt tgtataagga agttctaaat aattttccat
601 ggcgtgcttt taccaaagta ctgaacaggg agattattta tcattttttt tcttactctt
661 cgagttgaag tcgattgtag gtgatgacat gaacattaaa taaaatgtac ttgaacaaat
721 cagaccagca aacacccaca tcgcaatggt actgggtttt ctcgttattc tgtggaagaa
781 ttcaattacc ggattattgt taaaaaagcc cattagaagt atcctaacga ttgattaaag
841 taaattgtta ttcacgctac tatcgcaatc gtccaactga accttattct ttattgtata
901 taatctctgc cgagaaattc ttatacttat gtatctggga attcaccagg ttagaatacc
961 ttcccctcct caatttctaa tttttttttc tacccttatt tttctctctt gcttcatcgc
1021 tctcaaagct aaaagctaat acagcaaaca aaagtgttat taggaattca attgagtcgt
1081 gaaagaaaaa taatattcgt cttacttgag gactcacttt ggtcaattga gattaaagtg
1141 tagtctgcac cactgaattc ctcgttttcc tttaaagatc cgggatggct tatctgcggt
1201 acccggactg aaaacgacgt tttagctaaa tattgaacac gttaaaaata agttcaaatt
1261 tcgaggcaga tgaatttgct gagggcactg ttcaagagtg agatatacgc gcaagttgag
1321 gcataatcag taagcaatga atccgtttag aatcttaggt atgttactat ttggagtttc
1381 atgaggcttt tcccgccgta gatcgaaccc aatcttacta acagagaaag ggctttttcc
1441 cgaccatcaa gacaggtgat ttatcacatc taaccagtat actgatcctg attcataata
1501 tcaagaccac aaggtacatt gaaggtattt ctttcaagac ccaaacgttg tacgctttgg
1561 ttttcataac acgatacttg gatctcttga cttttcactg ggtatcccta tacaatgctc
1621 taatgaaaat atttttcatt gtatctaccg cttacattgt agtgctatta caagggtcta
1681 aaagaaccaa caccattgcg tataatgaaa tgcttatgca tgataccttt aagatccagc
1741 atttactaat tgggagtgct ctaatgagtg tttttttcca tcacaagttc acttttcttg
1801 aattagcatg gagtttttct gtatggttgg agagtgtggc tattctacct caattgtaca
1861 tgctatctaa gggtgggaag actagaagtc taactgttca ttatattttt gccatgggat
1921 tatacagagc attgtatatt cctaactgga tttggaggta cagcacggaa gataaaaaat
1981 tggacaagat tgccttcttc gcgggacttt tgcaaactct gttgtactct gatttctttt
2041 acatttacta cactaaagtc atcagaggaa agggtttcaa actgccaaaa taaaaaaaaa
2101 acctgtttta aatagtgttc gcagcaatat atacaaattt caatatatca atgctatacc
2161 aaaagaaaaa aaaaatacat aaaattgatg ttgatattgg tctatttctt tttatactat
2221 gatatgtatg cattaatctc tttttagctc ataaaattct ttccttacac catccttggt
2281 gacaataagg atttccagcc catcacccac ttgtatatgt ctttctgtag cagaagtgaa
2341 cgagtctctc accagtttga tgacttcttc cacggacaag tatttcaaag gctttttgac
2401 tttaccgttt gtacctggct catattgaat ttttgaaatt aacctgattg tccaaaaatg
2461 gcatgatcaa tgatgccgca gcaccacctg ctctacactg ttctctttcg taggagccaa
2521 actgggtcga acgaatagac agcgccctta ccatcttcgt caagacccgc aatgatcgta
2581 tgaacgtagt aagggaaaaa cctcttcccg tacagaagat gttgaatgtt ccttgctgca
2641 gagtttatag atagtttttt gtcgttgtgg tcgaaatggt accattttac actatttttg
2701 aatcttttta ctaaagcgtc gccgtctgct gcaaatccat tcgccgacat gactatgtta
2761 tcaccacaat caaaaacctt gggttcataa cgagaattaa ttgagtaatc ggtgatattt
2821 cttgtatcgc ctgctaacac tgcgaaatct tcacctgcaa tgcccaggat tgtaccacca
2881 ttatcaccgt aaggattgaa ttgatgttca atgggtgtat ttgacgcctc cgaagagtat
2941 tctgatgcaa tagtggccat attttgttta actttatagt tcaatagtct tggctactct
3001 ctttccaact cagttcacct tgttattata ccgcttgttt ttgccaccct ttgagtttct
3061 cgatccttta agttggaaaa gataaaaaca attatcgctc cgtacgaaca ccaaatctct
3121 gcgacgcaga gacggcaata caaaatcaga aaaaaaaaac aatttatcct atgcttcact
3181 agtcacaggc tctacgaaac attctatcgc ctcactgcct ttacatcact taagcaagaa
3241 gttttaaata ttacggcaat tcctttgcag caataccttt actttatttc gagagaaaat
3301 agatcgtaaa atgcctactc caagctt
//
LOCUS YSCLYS2A 4980 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae alpha-aminoadipate reductase (LYS2) gene, complete
cds.
ACCESSION M36287 M14967 M14968
KEYWORDS LYS2 gene; alpha-aminoadipate reductase.
SOURCE S.cerevisiae (strain S288C) DNA, clone pDP6.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 746 and 3921 to 4980)
AUTHORS Fleig,U.N., Pridmore,R.D. and Philippsen,P.
TITLE Construction of LYS2 cartridges for use in genetic manipulations of
Saccharomyces cerevisiae
JOURNAL Gene 46, 237-245 (1986)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 4632)
AUTHORS Morris,M.E. and Jinks-Robertson,S.
TITLE Nucleotide sequence of the LYS2 gene of Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.Jinks-Robertson, 10-JUL-1990.
Emory University
Dept. of Biology
1510 Clifton Road N.E.
Atlanta, GA 30322
FEATURES from to/span description
pept 297 4475 alpha-aminoadipate reductase (LYS2; EC
1.2.1.31)
BASE COUNT 1523 a 942 c 1026 g 1489 t
ORIGIN Chromosome II.
1 tctagaggca tcgcacagtt ttagcgagga aaactcttca atagttttgc cagcggaatt
61 ccacttgcaa ttacataaaa aattccggcg gtttttcgcg tgtgactcaa tgtcgaaata
121 cctgcctaat gaacatgaac atcgcccaaa tgtatttgaa gacccgctgg gagaagttca
181 agatatataa gtaacaagca gccaatagta taaaaaaaaa tctgagttta ttacctttcc
241 tggaatttca gtgaaaaact gctaattata gagagatatc acagagttac tcactaatga
301 ctaacgaaaa ggtctggata gagaagttgg ataatccaac tctttcagtg ttaccacatg
361 actttttacg cccacaacaa gaaccttata cgaaacaagc tacatattcg ttacagctac
421 ctcagctcga tgtgcctcat gatagttttt ctaacaaata cgctgtcgct ttgagtgtat
481 gggctgcatt gatatataga gtaaccggtg acgatgatat tgttctttat attgcgaata
541 acaaaatctt aagattcaat attcaaccaa cgtggtcatt taatgagctg tattctacaa
601 ttaacaatga gttgaacaag ctcaattcta ttgaggccaa tttttccttt gacgagctag
661 ctgaaaaaat tcaaagttgc caagatctgg aaaggacccc tcagttgttc cgtttggcct
721 ttttggaaaa ccaagatttc aaattagacg agttcaagca tcatttagtg gactttgctt
781 tgaatttgga taccagtaat aatgcgcatg ttttgaactt aatttataac agcttactgt
841 attcgaatga aagagtaacc attgttgcgg accaatttac tcaatatttg actgctgcgc
901 taagcgatcc atccaattgc ataactaaaa tctctctgat caccgcatca tccaaggata
961 gtttacctga tccaactaag aacttgggct ggtgcgattt cgtggggtgt attcacgaca
1021 ttttccagga caatgctgaa gccttcccag agagaacctg tgttgtggag actccaacac
1081 taaattccga caagtcccgt tctttcactt atcgcgacat caaccgcact tctaacatag
1141 ttgcccatta tttgattaaa acaggtatca aaagaggtga tgtagtgatg atctattctt
1201 ctaggggtgt ggatttgatg gtatgtgtga tgggtgtctt gaaagccggc gcaacctttt
1261 cagttatcga ccctgcatat cccccagcca gacaaaccat ttacttaggt gttgctaaac
1321 cacgtgggtt gattgttatt agagctgctg gacaattgga tcaactagta gaagattaca
1381 tcaatgatga attggagatt gtttcaagaa tcaattccat cgctattcaa gaaaatggta
1441 ccattgaagg tggcaaattg gacaatggcg aggatgtttt ggctccatat gatcactaca
1501 aagacaccag aacaggtgtt gtagttggac cagattccaa cccaacccta tctttcacat
1561 ctggttccga aggtattcct aagggtgttc ttggtagaca tttttccttg gcttattatt
1621 tcaattggat gtccaaaagg ttcaacttaa cagaaaatga taaattcaca atgctgagcg
1681 gtattgcaca tgatccaatt caaagagata tgtttacacc attattttta ggtgcccaat
1741 tgtatgtccc tactcaagat gatattggta caccgggccg tttagcggaa tggatgagta
1801 agtatggttg cacagttacc catttaacac ctgccatggg tcaattactt actgcccaag
1861 ctactacacc attccctaag ttacatcatg cgttctttgt gggtgacatt ttaacaaaac
1921 gtgattgtct gaggttacaa accttggcag aaaattgccg tattgttaat atgtacggta
1981 ccactgaaac acagcgtgca gtttcttatt tcgaagttaa atcaaaaaat gacgatccaa
2041 actttttgaa aaaattgaaa gatgtcatgc ctgctggtaa aggtatgttg aacgttcagc
2101 tactagttgt taacaggaac gatcgtactc aaatatgtgg tattggcgaa ataggtgaga
2161 tttatgttcg tgcaggtggt ttggccgaag gttatagagg attaccagaa ttgaataaag
2221 aaaaatttgt gaacaactgg tttgttgaaa aagatcactg gaattatttg gataaggata
2281 atggtgaacc ttggagacaa ttctggttag gtccaagaga tagattgtac agaacgggtg
2341 atttaggtcg ttatctacca aacggtgact gtgaatgttg cggtagggct gatgatcaag
2401 ttaaaattcg tgggttcaga atcgaattag gagaaataga tacgcacatt tcccaacatc
2461 cattggtaag agaaaacatt actttagttc gcaaaaatgc cgacaatgag ccaacattga
2521 tcacatttat ggtcccaaga tttgacaagc cagatgactt gtctaagttc caaagtgatg
2581 ttccaaagga ggttgaaact gaccctatag ttaagggctt aatcggttac catcttttat
2641 ccaaggacat caggactttc ttaaagaaaa gattggctag ctatgctatg ccttccttga
2701 ttgtggttat ggataaacta ccattgaatc caaatggtaa agttgataag cctaaacttc
2761 aattcccaac tcccaagcaa ttaaatttgg tagctgaaaa tacagtttct gaaactgacg
2821 actctcagtt taccaatgtt gagcgcgagg ttagagactt atggttaagt atattaccta
2881 ccaagccagc atctgtatca ccagatgatt cgtttttcga tttaggtggt cattctatct
2941 tggctaccaa aatgattttt accttaaaga aaaagctgca agttgattta ccattgggca
3001 caattttcaa gtatccaacg ataaaggcct ttgccgcgga aattgacaga attaaatcat
3061 cgggtggatc atctcaaggt gaggtcgtcg aaaatgtcac tgcaaattat gcggaagacg
3121 ccaagaaatt ggttgagacg ctaccaagtt cgtacccctc tcgagaatat tttgttgaac
3181 ctaatagtgc cgaaggaaaa acaacaatta atgtgtttgt taccggtgtc acaggatttc
3241 tgggctccta catccttgca gatttgttag gacgttctcc aaagaactac agtttcaaag
3301 tgtttgccca cgtcagggcc aaggatgaag aagctgcatt tgcaagatta caaaaggcag
3361 gtatcaccta tggtacttgg aacgaaaaat ttgcctcaaa tattaaagtt gtattaggcg
3421 atttatctaa aagccaattt ggtctttcag atgagaagtg gatggatttg gcaaacacag
3481 ttgatataat tatccataat ggtgcgttag ttcactgggt ttatccatat gccaaattga
3541 gggatccaaa tgttatttca actatcaatg ttatgagctt agccgccgtc ggcaagccaa
3601 agttctttga ctttgtttcc tccacttcta ctcttgacac tgaatactac tttaatttgt
3661 cagataaact tgttagcgaa gggaagccag gcattttaga atcagacgat ttaatgaact
3721 ctgcaagcgg gctcactggt ggatatggtc agtccaaatg ggctgctgag tacatcatta
3781 gacgtgcagg tgaaaggggc ctacgtgggt gtattgtcag accaggttac gtaacaggtg
3841 cctctgccaa tggttcttca aacacagatg atttcttatt gagatttttg aaaggttcag
3901 tccaattagg taagattcca gatatcgaaa attccgtgaa tatggttcca gtagatcatg
3961 ttgctcgtgt tgttgttgct acgtctttga atcctcccaa agaaaatgaa ttggccgttg
4021 ctcaagtaac gggtcaccca agaatattat tcaaagacta cttgtatact ttacacgatt
4081 atggttacga tgtcgaaatc gaaagctatt ctaaatggaa gaaatcattg gaggcgtctg
4141 ttattgacag gaatgaagaa aatgcgttgt atcctttgct acacatggtc ttagacaact
4201 tacctgaaag taccaaagct ccggaactag acgataggaa cgccgtggca tctttaaaga
4261 aagacaccgc atggacaggt gttgattggt ctaatggaat aggtgttact ccagaagagg
4321 ttggtatata tattgcattt ttaaacaagg ttggattttt acctccacca actcataatg
4381 acaaacttcc actgccaagt atagaactaa ctcaagcgca aataagtcta gttgcttcag
4441 gtgctggtgc tcgtggaagc tccgcagcag cttaaggttg agcattacgt atgatatgtc
4501 catgtacaat aattaaatat gaattaggag aaagacttag cttcttttcg ggtgatgtca
4561 cttaaaaact ccgagaataa tatataataa gagaataaaa tattagttat tgaataagaa
4621 ctgtaaatca gctggcgtta gtctgctaat ggcagcttca tcttggttta ttgtagcatg
4681 aatcatattt gccttttttt cctgtaattc aatgattctt gcttctatac tatcctcaat
4741 gcaaaacctt gtgatcttca caggtcgata ctgaccaatt ctatgaactc tatcaccact
4801 ttgccattca acactagggt tccaccatgg gtctaaaatg aatacttgcg aagcttcaca
4861 aagattcaaa gcaacaccgc ccgcctttaa actgaccaag aaaacctcgc attgaatgtt
4921 gttcatgaaa tacttgatgg tttcatctct ttgcgtcggt gacatactac cctgaagctt
//
LOCUS YSCMTATPSA 2820 bp ds-DNA ORG 14-AUG-1990
DEFINITION Yeast mitochondrial oxi3 gene, 3' end, ATPase subunit 3 (aap1) and
ATPase subunit 6 (oli2) gene, complete cds.
ACCESSION M36379
KEYWORDS oli2 region.
SOURCE Yeast (strain J69-1B) mitochondrial DNA, clone.
ORGANISM Mitochondrion Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE 1 (bases 1 to 2820)
AUTHORS Novitski,C.E., Macreadie,I.G., Maxwell,R.J., Lukins,H.B.,
Linnane,A.W. and Nagley,P.
TITLE Biogenesis of mitochondria: Genetic and molecular analysis of the
oli2 region of mitochondrial DNA in Saccharomyces cerevisiae
JOURNAL Curr. Genet. 8, 135-146 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 63 cytochrome c oxidase subunit I (oxi3)
pept 1019 1165 ATPase subunit 3 (aap1)
pept 1872 2651 ATPase subunit 6 (oli2)
binding 701 709 ribosomal binding site C (put.)
binding 1814 1821 ribosomal binding site A (put.)
binding 1835 1845 ribosomal binding site B (put.)
BASE COUNT 1159 a 200 c 183 g 1278 t
ORIGIN
1 gaattcttat taacttctcc accagctgta cactcattta atacaccagc tgtacaatct
61 taagttataa tatttaatta tttacttaat aattaaaaaa aaaagtaaat attatatcta
121 aaacttaata atataataat aatattctta taaaaaaaaa tatataaaaa aaaaatatat
181 aaaatttatt aaaatatctt aatatattat cctttttggg aactatataa ataaatacta
241 atataatcct attatatata tatatatata aaataatata tatatataat taatataaat
301 aatatttata attatttttt taataatata tataatttaa tatattaatt aatattatat
361 aattattaaa tatattataa tattattatt attttataat aaaaatattt ttaatactaa
421 ttattattta ttatttataa atatataaat agtatgttta atattattaa tactaaaaaa
481 aagtataatt ataattagga tctacaatac atttatctga taaatattaa tattaatatt
541 aatatttata ttaataaagg gattaaatta attgtatcca atttaattaa attatagata
601 tattatttat aatattaata tattgtttta ttaaaaaggt aaaaatagtt tttattttat
661 atataaatat aggatataaa taaatatatt atatatatat taagaatata tttatatttt
721 acatataatt atttataata taaatatctc cgcaaagccg gattaatgta attatttaat
781 aattttattt aataatttat taaaataaat atttacattt aataatattt atattatgtc
841 agttatttta tattaatgtt taatctatta taatattttt ttttataaat atattattta
901 tattaataaa tatatatatt atttttataa tatatatata tttttattaa atatttatta
961 aatatttatt aaattattat aatgttgtta ttaatcttat taaaaaatat atataaaaat
1021 gccacaatta gttccatttt attttatgaa tcaattaata tatggtttct tattaatgat
1081 tctattatta attttattct cacaattctt tttacctatg atcttaagat tatatgtatc
1141 tagattattt atttctaaat tataatatat attattaatt tatttattca tataaatatt
1201 attattatat ataaatatta ataatattta tacttattta ataataataa taaaataaaa
1261 aaaataatta taatttaata tatttaatat atttccttac agactatata tttatatata
1321 tatattaaat acaatttaat ttaatttaat tatgttattt attaaataaa gttatattat
1381 aatataataa caatattata tattattata taattataat atattttaat ataattatca
1441 aaagaaataa taaaaaatat taataagaat ataatttaat aattattaaa aaaattctta
1501 tttatagtcc ggtccgcccc cgcggggcgg accccaaagg aggagtaata aaaattatta
1561 aatacaaata ttatatatat ataattcatt atatatatat atatataata attaatctta
1621 ttttttatat atttatttat atatctattt atatttatat tttatatata tttatttata
1681 tatctaaggg gtttggtccc tcccccgtaa gtataagtat acgggggagg gtccctcact
1741 atttatattt atatttatat ttatatttat atttatattt tatatatttt atatatttat
1801 aaataaagta taataagata taattatgat taattattta taagttatag ttttataaat
1861 ttataattat tatgtttaat ttattaaata catatattac atcaccatta gatcaatttg
1921 agattagact attatttggt ttacaatcat catttattga tttaagttgt ttaaatttaa
1981 caacattttc attatatact attattgtat tattagttat tacaagttta tatctattaa
2041 ctaataataa taataaaatt attggttcaa gatgattaat ttcacaagaa gctatttatg
2101 atactattat aaatatgctt aaaggacaaa ttggaggtaa aaattgaggt ttatatttcc
2161 ctatgatctt tacattattt atgtttattt ttattgctaa tttaattagt atgattccat
2221 attcatttgc attatcagct catttagtat ttattatctc tttaagtatt gttatttgat
2281 taggtaatac tattttaggt ttatataaac atggttgagt attcttctca ttattcgtac
2341 ctgctggtac accattacca ttagtacctt tattagttat tattgaaact ttatcttata
2401 ttgctagagc tatttcatta ggtttaagat taggttctaa tatcttagct ggtcatttat
2461 taatggttat tttagctggt ttactattta attttatgtt aattaattta tttactttag
2521 tattcggttt tgtaccttta gctatgatct tagctattat gattttagaa ttcgctattg
2581 gtattatcca atcttatgtt tgacttatct taacagcatc atacttaaaa gatacattat
2641 acttacatta aattataaaa taaaaattat ataaaaaaaa tataataata ataataatat
2701 aaaataaaaa ataaaaaata aaacaatgaa aaaacaaaat ttaaattcta ttttattaat
2761 gtatattaat tatattatta attattttaa taatattcat aaaaatcaat taaaaaaaga
//
LOCUS YSCSPO7A 1846 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae SPO7 gene, complete cds, and FUN14 and FUN37 genes, 5'
end.
ACCESSION M36073
KEYWORDS antimutator phenotype; premeiotic DNA replication.
SOURCE S.cerevisiae (strain AB972) DNA, clone lambda-G12b and pLF31.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1846)
AUTHORS Whyte,W., Koepp,L.H., Lamb,J., Crowley,J.C. and Kaback,D.B.
TITLE Molecular cloning of chi I DNA from S.cerevisiae
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.B.Kaback, 05-JUL-1990.
FEATURES from to/span description
pept 564 1343 SPO7 protein
pept 1624 > 1832 FUN14 protein
pept 375 < 1 (c) FUN37 protein
BASE COUNT 556 a 308 c 415 g 567 t
ORIGIN Chromosome I.
1 gtcgactgtg gtgttgtcac tactcaacgt attcgcacta ctaacactga aattgaggtt
61 tggttgcaat tgtctgtatg tttcggtggc atcttgtaat gggatatcag tagagttgcg
121 catgaatttc tccaattgct gtgcatcgga gtataaataa ctcagagaac catttatcct
181 ggacctcgta gaaaaatcta aagaattgaa tgtattggga gtagatttgt tggaaatttg
241 caggtgtatt gctgagggaa ttcggaaatc taataatgtt ctcgatgtgg ccgttatatc
301 ctcgtagcta ttttgcgtac tccaatgggt gctctgataa aatgccctta gtacttggtc
361 catatagggt agcatcaaga tcggtcttct ctgttcgtgt ctttttccta acgtatattt
421 gctttgtttc ttcactcaac aataaagtca aagtaaaatt aaatactaat tattcttaaa
481 agggaagatg cgaaatttag cgaaaatcta ttgattatac acacaaagga agaaaggtag
541 tggaaagcta aataaaggag gtcatggagc cagagagcat aggcgatgtg gggaaccatg
601 cccaggatga tagtgccagt atagtgtccg ggcctcgcag gcgttctact agcaagacat
661 ccagtgcgaa gaatatacgg aactccagta atatctctcc agcatcgatg attttcagga
721 atttgttgat actggaggat gatttaagac gccaagctca cgaacaaaag atactgaagt
781 ggcaattcac tttgttctta gcgtctatgg ccggtgtagg cgcatttacc ttctacgaac
841 tttatttcac ttcagattat gtcaagggcc tccatagggt tattttgcaa ttcactcttt
901 ctttcatttc cattactgta gttctttttc atatcagtgg acaatataga agaactatcg
961 tcattccaag aagatttttt acctctacta ataaagggat taggcagttt aatgtgaagc
1021 tagttaaagt acagtctacg tgggacgaga aatacacaga ttcagtaaga tttgtgagtc
1081 gaacaattgc ttattgtaat atttattgtt tgaaaaaatt tctgtggctt aaagacgata
1141 atgccattgt gaaattttgg aaaagtgtca cgatacaatc ccaaccgagg atcggagctg
1201 tggatgtgaa attagtcctc aaccccagag catttagtgc agagattaga gaaggatggg
1261 agatttatag agacgagttt tgggccaggg aaggtgctag aagacgcaaa caagcgcacg
1321 aactccgacc taaatcagaa tgaaagagtt ggagggcttc ttccttcgaa taagaggtca
1381 tatttaccta tgtaaaattg taaccatcta tgttcacaca taaattatat tttatacatt
1441 attagaagtg aagctgttgt gtcgtgaaaa ttttacaaat ccgtcatttc atatttaagt
1501 tttccaacaa gtgctagaaa acctaggggt tgttgaaatt ggttaaacaa ggcatcttat
1561 tatacataca acagcataac gctagagggg caagaaggaa gaacttaaaa taataggtgt
1621 aaaatgactt tggcttttaa tatgcaacgg ttggtgtttc gtaatttgaa tgttgggaag
1681 cgcatgttca agaacgtccc cttatggagg tttaatgtcg ccaataaatt aggaaagccc
1741 ttaactcgct ctgtagggtt aggcggtgct ggcatagttg ctggtggctt ttacttgatg
1801 aatcgccagc cttctaagtt gatattcaat gattctttag gggcag
//
LOCUS YSKERD2A 1248 bp ds-DNA PLN 14-AUG-1990
DEFINITION K.lactis ER lumen protein retaining receptor (ERD2) gene, complete
cds.
ACCESSION M34844
KEYWORDS ER lumen protein retaining receptor; HDEL receptor.
SOURCE K.lactis (SEY2102) DNA.
ORGANISM Kluyveromyces lactis
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1248)
AUTHORS Lewis,M.J., Sweet,D.J. and Pelham,H.R.B.
TITLE The ERD2 gene determines the specificity of the luminal ER protein
retention system
JOURNAL Cell 61, 1359-1363 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Lewis, 06-JUN-1990, for release after publication.
FEATURES from to/span description
pept 390 414 ER lumen protein retaining receptor, exon 1
(ERD2)
572 1206 ER lumen protein retaining receptor, exon 2
IVS 415 571 ERD2 intron A
site 415 420 consensus splice donor
site 528 534 branchpoint sequence
site 569 571 acceptor sequence
BASE COUNT 368 a 235 c 202 g 443 t
ORIGIN
1 gaattcttca ctcttcaccc ctcctctcta acgagactcg gataaacaga ctgtaacgat
61 gacgacgacg acgatgatga taatgattat gatgatgatg attgaatgtg ccatttgacg
121 tttttgacct gtaaacccct tttgtaaata gcctctgtcc tttttaagta cttatgcggt
181 gtctgttaat acaatctttg tggctcacgt gatttattat ttttctcaat gtgggccaaa
241 attgcaatct cttaactctg aaaaataaaa caaggcagta ctaatcgctt cttaaacact
301 tgttctttaa agattgaaat aaactaccga catttgtcgt tctaagttaa attcaaatca
361 aaacagcttt caattgagtg tgtcacatta tgttgaacgt tttcagaata gcaggtatgt
421 gaaaagaact gattttatat cattgttgga cgtcaacctc gatcggaacg ttatatcttg
481 gttctgaatg atgatttcca tcggcttatc tctactggca gcatctttac taacattgtt
541 attaccaact attttgtaat cgagtgttta ggtgatttct ctcatttggc tagtatcatc
601 attttgatac aatcgatcac aacatctaac tcagttgatg gtatctcatt gaaaactcaa
661 ctgctataca ccttggtctt tatcacacgt tatttgaacc tatttaccaa atggacctcc
721 ttgtacaact tcttaatgaa aattgttttc atttcatctt cggtttacgt cattgtgtta
781 atgcgccaac aaaaatttaa aaaccctgtc gcatatcaag acatgatcac cagagatcaa
841 tttaaaatca agtttttaat agtaccatgc attctcctag gattaatttt caattatcgt
901 ttcagtttta tacaaatatg ctggtccttc tctctatggt tggaaagtgt tgcaatcctt
961 cctcaattgt ttatgttgac taaaacaggt aaagcaaaac aattgacatc tcattatatt
1021 tttgcattgg ggttataccg tgccttgtac attccaaatt ggatatggag gtattatacc
1081 gaagaaagat tcgataaatt atcagttttc accggtgtga ttcaaacttt agtgtactct
1141 gatttcttct acatctatta tcagaaggtt atcaaactcg gtggagattt agaattacca
1201 caatagatga caggatttgc atctccactt gcagtcagtt cgaagctt
//
LOCUS CELCKINII 3180 bp ds-DNA INV 14-AUG-1990
DEFINITION C.elegans casein kinase II-alpha gene, complete cds.
ACCESSION J05274
KEYWORDS casein kinase II-alpha.
SOURCE C.elegans DNA.
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 3180)
AUTHORS Hu,E. and Rubin,C.S.
TITLE Casein kinase II from Caenorhabditis elegans: Properties and
developmental regulation of the enzyme; Cloning and sequence
analyses of cDNA and the gene for the catalytic subunit
JOURNAL J. Biol. Chem. 265, 5072-5080 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 250 288 casein kinase II-alpha, exon 1
533 703 casein kinase II-alpha, exon 2
763 890 casein kinase II-alpha, exon 3
940 1024 casein kinase II-alpha, exon 4
1084 1380 casein kinase II-alpha, exon 5
1838 1962 casein kinase II-alpha, exon 6
2018 2099 casein kinase II-alpha, exon 7
2301 2456 casein kinase II-alpha, exon 8
pre-msg 211 3073 casein kinase II-alpha mRNA and introns
IVS 289 532 casein kinase II-alpha intron A
IVS 704 762 casein kinase II-alpha intron B
IVS 891 939 casein kinase II-alpha intron C
IVS 1025 1083 casein kinase II-alpha intron D
IVS 1381 1837 casein kinase II-alpha intron E
IVS 1963 2017 casein kinase II-alpha intron F
IVS 2100 2300 casein kinase II-alpha intron G
BASE COUNT 974 a 580 c 544 g 1082 t
ORIGIN
1 actttcgcag cgggaatatc tcaccacgca cgagttcaca tttggttttt tctctgcgct
61 ctctcgcatt tgcgtactct tttcatgttc tggcatgtat tttcaaaatt attcgggttt
121 tatcagtttt cccgagattt tactatttct actgaaaatt tcgttctaat aattcaatta
181 taacagccgt tttgtcaaaa cttgctcaaa aataacgtgt tacttgtttc agatcgttat
241 tcttgggaaa tgcctccgat tcctagccgg gcgagggtct acgccgaggt gagcaagttt
301 gtatagccag gagttaatta ttgaacagaa tatattgctc atacataact cacaatgacg
361 gaattaacgt tatccgtttt agttgtggaa tactgtgaaa cactgtacgc cacatttgtt
421 agattttata aatgttcata ggaatttata gtctcgtaaa gaaaaactat ttgtaaaagt
481 cactaaatta tgtttttatg agtttatata ttataaatca caattgtttc aggtgaatcc
541 gtcgagacca cgcgagtact gggactatga ggctcacatg atcgaatggg gacaaatcga
601 tgattatcag ctggtacgca agttgggacg tggaaaatat tcggaggttt ttgaaggatt
661 caagatgagc actgacgaga aagttgttgt caagattctc aaggtaagtt ttacaaaaat
721 aattgtaata aagctttcaa aatgtaattt gtttgttttc agccagtcaa aaagaagaag
781 atcaagcgtg agattaaaat tttggagaat cttcgtggcg ggacaaacat catcaccctc
841 ctcgacgtcg tcaaggatcc tatttcgaga actccggctc ttattttcga gttggtttac
901 tatttgaaga tttttagaat taataatttc tatttgaagg catgttaaca attcggactt
961 caagcaattg tatcaaaccc tcagtgacta tgatatccgt tactacctct acgagctctt
1021 gaaggtactt ttttcgtctg tccattaatt tttgaattca taacgtttca atttattctc
1081 caggctctcg acttctgcca ttctcaagga attatgcaca gagatgttaa gccgcacaat
1141 gtgatgattg acgccgaaaa acgggaactt cgattaatag attggggatt agccgaattc
1201 tatcatccaa gacaggacta taatgttcgt gttgcttcta gatatttcaa gggacccgag
1261 cttcttgtcg attatcaatg ttacgattat tcgttggata tgtggagttt gggatgcatg
1321 ttggctagta tgatcttcag gaaggaacca ttcttccacg gccatgataa ctacgatcag
1381 gtaagcactt taagctgaag tagcaaagaa ccgcaaatcc acaaatttca ttaaaatcca
1441 atattggaat atagaaaaca tttgatttat ctgattatgg cttttatttc agtgagaaca
1501 atggggaaaa acgcttaaaa aattaccgga aagtttaaaa tttccgaaaa aaaccgaaaa
1561 ttttccaatt ttgagcattt ttcaattgtt ccagctgcaa aaaactattt tcagataaag
1621 aaattatttt tttgaattcc atagctaaaa ttttactgat aagtttggct tattccgtag
1681 aatttcttgc cggcgattga ttcgcacccc tgagctgaaa tttaaatgaa taaataatat
1741 tgtcagggta tttattaaat attcatcaat aatttttatg atttttaata tttataaatt
1801 cacgaaatct gattttctta taaaaagttt tttgcagcta gtccgcattg caaaggtatt
1861 gggaactgat gagctttacg agtacattgc gagatatcac attgatctcg atccacgctt
1921 caacgatatt ctcgggcgac actcaagaaa acgctgggaa aggtatattt tgatcgaagc
1981 gaagtttcat gtttttctta aaattcattt gttgtagatt tatccacgct gaaaatcaac
2041 atcttgtcac cccagaggct cttgatttcc ttgataaact tcttcgctac gatcacgcgg
2101 taagaatcgg aaaaaagaga aattacacca aattagttac tgaaaaccag tttaacaaga
2161 acacagagaa ataaaaacaa ttttcacttg ttttattgaa aaaaaagaca tttgatactc
2221 gaaactaaac tataaaatat cttttatttg ttaaattaat ttatttccag aatcttttaa
2281 ttaaatttca ccatattcag gaacgcctaa ctgctcaaga ggccatgggt cacgaatact
2341 tccgcccagt tgttgaagca catgctcgtg ccaatggaac tgaacaagct gacggacagg
2401 gtgcatcgaa ttccgcttct tctcaatcct ccgatgccaa aattgacggc gcttaaaatt
2461 tctttctatt ttttttttaa ttttcctgtt gactatttcc ttttttatta atttccacac
2521 cactcctctc catcattatt cctttgctct aaaaattggc taagatatta aatcttcccc
2581 gcttctccga aaatttccct ctacatacca caaaagtgcc taatcttttc gtgagaaacg
2641 ttctttgcaa attctatttt taggattttt ttgtgtaatt catgcctctg aaattccgtt
2701 tcaattttaa atcttttttg tcccatttca tattgttctc cttcttctcc cctttcctct
2761 tcctcaaaca accacaatca tcatttttat aaatcgaatc acagttcgtg agaaaaaaaa
2821 ctgttacttg tatatcaata ttgtctttgg ttttgaaaaa cctactattg ttcctatgtt
2881 gagttgtgaa ctatgttcac ttttagagtt ttagatttaa aatgctcaac ggacgagtgc
2941 tccttcgatc gattgctcgc cggacaatgg catcgaaaat tgagccagga tctcgcagcc
3001 gatttatttg gatgtccagg ctaccgcgcc aatggtaatt aaatggaaat aataaagcat
3061 gattaaaaat aatataaaaa acttgtatct ggaaaaaatt aacgaaactt ttcagataca
3121 tttagatttc gaatgagttt ttttctaact aggcacagtt tagctagtag gcattctaaa
//
LOCUS DMORGX 1650 bp ds-DNA BCT 14-AUG-1990
DEFINITION D.mobilis ribosomal 16S RNA gene.
ACCESSION M36474
KEYWORDS ribosomal 16S RNA.
SOURCE D.mobilis DNA.
ORGANISM Desulfurococcus mobilis
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Thermoproteales; Desulfurococcaceae.
REFERENCE 1 (bases 1 to 1650)
AUTHORS Kjems,J., Garrett,R.A. and Ansorge,W.
TITLE The sequence of the 16S RNA gene and its flanking region from the
archaebacterium Desulfurococcus mobilis
JOURNAL Syst. Appl. Microbiol. 9, 22-28 (1987)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 71 1565 16S ribosomal RNA
BASE COUNT 323 a 492 c 602 g 233 t
ORIGIN
1 ccgacgaggg ggagggagcc acttaagccg agcgctccag aaaccccggc cccaacccca
61 agcggccgta actccggttg atcctgccgg tcccgaccgc tatcggggtg gggctaagcc
121 atgggagtcg cacgctccgc cgctgcgggg cgtggcggac ggctgagtaa cacgtggcta
181 acctaccctc gggaggggga taacaccggg aaactggtgc taatccccca taggggagga
241 ggcctggaag ggttcctccc cgaaagggtg tggcaggggt taacgctgct acaccgcccg
301 aggatggggc tacggcccat taggttgttg gcggggtaac ggcccgccaa gccgataatg
361 ggtaggggcc gtgagagcgg gagcccccag atgggcactg agacaagggc ccaggcccta
421 cggggcgcac caggcgcgaa acctccgcaa tgcgggaaac cgtgacgggg ccaccccgag
481 tgccccctta cgggggcttt tccccgctgt aggaaggcgg gggaataagc ggggggcaag
541 tctggtgtca gccgccgcgg taataccagc cccgcgagtg gtcgggacga ttattgggcc
601 taaagcgccc gtagccggcc cggcaagtcc cctcctaaat tcccgggctc aacccgggga
661 ctggagggga tactgccggg ctagggggtg ggagaggccg agggtactcc cggggtaggg
721 gcgaaatcct ataatcccgg gaggaccacc agtggcgaag gcgctcggct ggaacacgcc
781 cgacggtgag gggcgaaagc cgggggagcg aaccggatta gatacccggg tagtcccggc
841 tgtaaacgat gcgggctagg tgttgggtgg gcttagagcc cacccagtgc cgcagggaag
901 ccgttaagcc cgccgcctgg ggagtacggc cgcaaggctg aaactcaaag gaattggcgg
961 gggagcacca caaggggtgg agcctgcggt tcaattggag tcaacgccgg gaatctcacc
1021 gggggagaca gcaggatgac ggccaggtta aaggccttgc ctgacgcgct gagaggaggt
1081 gcatggccgt cgccagctcg tgctgtgaag tgtccggtta agtccggaaa cgagcgagac
1141 ccccacccct agttgctacc cggggctacg gctccggggc acactagggg gactgccgcc
1201 gtttaaggcg gaggaaggag ggggccacgg caggtcagca tgccccgaac cccccgggct
1261 acacgcgggc tacaatggcg gggacagcgg gatccgaccc cgaaaggggg aggcaatccc
1321 tcaaaccccg ccgtggttgg gatcgagggc tgcaactcgc cctcgtgaac gaggaatccc
1381 tagtaaccgc gcgtcaacat cgcgcggtga atacgtccct gctccttgca cacaccgccc
1441 gtcgctccac ccgaggggag ggggagtgag gcccggcccc ttgggtcggg tcgaactccc
1501 cctccctgag gggggagaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca
1561 cctcctgcct caggccgggg ctggggccgg ggctggaatg cgctaaggct taagtggctc
1621 ccgcctcctc tattcattga tgcaacacgg
//
LOCUS ECOHBB221 3703 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli beta-globin gene, complete cds.
ACCESSION M34058
KEYWORDS beta-globin.
SEGMENT 1 of 2
SOURCE E.coli (strain HB101) DNA, isolated from patient GD.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 3703)
AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M.,
Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J.
TITLE Molecular basis for dominantly inherited inclusion body
beta-thalassemia
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.L.Thein, 07-MAY-1990.
Author address: S.L.Thien
Institute of Molecular Medicine,
MRC Molecular Haematology Unit
John Radcliffe Hospital
Oxford, OX3 9DU U.K.
FEATURES from to/span description
pept 1612 1703 beta-globin, exon 1
1834 2056 beta-globin, exon 2
2907 + 3015 beta-globin, exon 3
IVS 1704 1833 beta-globin intron A
IVS 2057 2906 beta-globin intron B
IVS 3016 > 3703 beta-globin intron C
signal 1484 1492 CAAT box
signal 1532 1538 TATA box
BASE COUNT 1055 a 707 c 736 g 1205 t
ORIGIN
1 tctatttatt tagcaataat agagaaagca tttaagagaa taaagcaatg gaaataagaa
61 atttgtaaat ttccttctga taactagaaa tagaggatcc agtttctttt ggttaaccta
121 aattttattt cattttattg ttttatttta ttttatttta ttttatttta ttttgtgtaa
181 tcgtagtttc agagtgttag agctgaaagg aagaagtagg agaaacatgc aaagtaaaag
241 tataacactt tccttactaa accgacatgg gtttccaggt aggggcagga ttcaggatga
301 ctgacagggc ccttagggaa cactgagacc ctacgctgac ctcataaatg cttgctacct
361 ttgctgtttt aattacatct tttaatagca ggaagcagaa ctctgcactt caaaagtttt
421 tcctcacctg aggagttaat ttagtacaag gggaaaaagt acagggggat gggagaaagg
481 cgatcacgtt gggaagctat agagaaagaa gagtaaattt tagtaaagga ggtttaaaca
541 aacaaaatat aaagagaaat aggaacttga atcaaggaaa tgattttaaa acgcagtatt
601 cttagtggac tagaggaaaa aaataatctg agccaagtag aagacctttt cccctcctac
661 ccctactttc taagtcacag aggctttttg ttcccccaga cactcttgca gattagtcca
721 ggcagaaaca gttagatgtc cccagttaac ctcctatttg acaccactga ttaccccatt
781 gatagtcaca ctttgggttg taagtgactt tttatttatt tgtatttttg actgcattaa
841 gaggtctcta gttttttatc tcttgtttcc caaaacctaa taagtaacta atgcacagag
901 cacattgatt tgtatttatt ctatttttag acataattta ttagcatgca tgagcaaatt
961 aagaaaaaca acaacaaatg aatgcatata tatgtatatg tatgtgtgta catatacaca
1021 tatatatata tatttttttt cttttcttac cagaaggttt taatccaaat aaggagaaga
1081 tatgcttaga actgaggtag agttttcatc cattctgtcc tgtaagtatt ttgcatattc
1141 tggagacgca ggaagagatc catctacata tcccaaagct gaattatggt agacaaaact
1201 cttccacttt tagtgcatca atttcttatt tgtgtaataa gaaaattggg aaaacgatct
1261 tcaatatgct taccaagctg tgattccaaa tattacgtaa atacacttgc aaaggaggat
1321 gtttttagta gcaatttgta ctgatggtat ggggccaaga gatatatctt agagggaggg
1381 ctgagggttt gaagtccaac tcctaagcca gtgccagaag agccaaggac aggtacggct
1441 gtcatcactt agacctcacc ctgtggagcc acaccctagg gttggccaat ctactcccag
1501 gagcagggag ggcaggagcc agggctgggc ataaaagtca gggcagagcc atctattgct
1561 tacatttgct tctgacacaa ctgtgttcac tagcaacctc aaacagacac catggtgcac
1621 ctgactcctg aggagaagtc tgccgttact gccctgtggg gcaaggtgaa cgtggatgaa
1681 gttggtggtg aggccctggg caggttggta tcaaggttac aagacaggtt taaggagacc
1741 aatagaaact gggcatgtgg agacagagaa gactcttggg tttctgatag gcactgactc
1801 tctctgccta ttggtctatt ttcccaccct taggctgctg gtggtctacc cttggaccca
1861 gaggttcttt gagtcctttg gggatctgtc cactcctgat gctgttatgg gcaaccctaa
1921 ggtgaaggct catggcaaga aagtgctcgg tgcctttagt gatggcctgg ctcacctgga
1981 caacctcaag ggcacctttg ccacactgag tgagctgcac tgtgacaagc tgcacgtgga
2041 tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc cttcttttct
2101 atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacagttta gaatgggaaa
2161 cagacgaatg attgcatcag tgtggaagtc tcaggatcgt tttagtttct tttatttgct
2221 gttcataaca attgttttct tttgtttaat tcttgctttc tttttttttc ttctccgcaa
2281 tttttactat tatacttaat gccttaacat tgtgtataac aaaaggaaat atctctgaga
2341 tacattaagt aacttaaaaa aaaactttac acagtctgcc tagtacatta ctatttggaa
2401 tatatgtgtg cttatttgca tattcataat ctccctactt tattttcttt tatttttaat
2461 tgatacataa tcattataca tatttatggg ttaaagtgta atgttttaat atgtgtacac
2521 atattgacca aatcagggta attttgcatt tgtaatttta aaaaatgctt tcttctttta
2581 atatactttt ttgtttatct tatttctaat actttcccta atctctttct ttcagggcaa
2641 taatgataca atgtatcatg cctctttgca ccattctaaa gaataacagt gataatttct
2701 gggttaaggc aatagcaata tttctgcata taaatatttc tgcatataaa ttgtaactga
2761 tgtaagaggt ttcatattgc taatagcagc tacaatccag ctaccattct gcttttattt
2821 tatggttggg ataaggctgg attattctga gtccaagcta ggcccttttg ctaatcatgt
2881 tcatacctct tatcttcctc ccacagctcc tgggcaacgt gctggtctgt gtgctggccc
2941 atcactttgg caaagaattc accccaccag tgcagccaca cctatcagtg gtgtggctaa
3001 tgccctggcc cacaagtatc actaagctcg ctttcttgct gtccaatttc tattaaaggt
3061 tcctttgttc cctaagtcca actactaaac tgggggatat tatgaagggc cttgagcatc
3121 tggattctgc ctaataaaaa acatttattt tcattgcaat gatgtattta aattatttct
3181 gaatatttta ctaaaaaggg aatgtgggag gtcagtgcat ttaaaacata aagaaatgaa
3241 gagctagttc aaaccttggg aaaatacact atatcttaaa ctccatgaaa gaaggtgagg
3301 ctgcaaacag ctaatgcaca ttggcaacag ccctgatgcc tatgccttat tcatccctca
3361 gaaaaggatt caagtagagg cttgatttgg aggttaaagt tttctatgct gtattttaca
3421 ttacttattg ttttagctgt cctcatgaat gtcttttcac tacccatttg cttatcctgc
3481 atctctcagc cttgactcca ctcagttctc ttgcttagag ataccacctt tcccctgaag
3541 tgttccttcc atgttttacg gcgagatggt ttctcctcgc ctggccactc agccttagtt
3601 gtctctgttg tcttatagag gtctacttga agaaggaaaa acagggggca cggtttgact
3661 gtcctgtgag cccttcttcc ctgcctcccc cactcacagt gac
//
LOCUS ECOHBB222 552 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli beta-globin gene, complete cds.
ACCESSION M34059
KEYWORDS beta-globin.
SEGMENT 2 of 2
SOURCE E.coli (strain HB101) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 552)
AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M.,
Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J.
TITLE Molecular basis for dominantly inherited inclusion body
beta-thalassemia
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by S.L.Thein, 07-MAY-1990.
Author address: S.L.Thein
Institute of Molecular Biology,
MRC Molecular Haematology Unit
John Radcliffe Hospital,
Oxford, OX3 9DU, U.K.
FEATURES from to/span description
pept + 169 272 beta-globin, exon 4
IVS < 1 168 beta-globin intron C
BASE COUNT 153 a 113 c 108 g 178 t
ORIGIN
1 tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc
61 agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc
121 taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac
181 gtgctggtct gtgtgctggc ccatcacttt ggcaaataat tcaccccacc agtgcaggct
241 gcctatcaga aagtggtggc tggtgtggct aatgccctgg cccacaagta tcactaagct
301 cgctttcttg ctgtccaatt tctattaaag gttcctttgt tccctaagtc caactactaa
361 actgggggat attatgaagg gccttgagca tctggattct gcctaataaa aaacatttat
421 tttcattgca atgatgtatt taaattattt ctgaatattt tactaaaaag ggaatgtggg
481 aggtcagtgc atttaaaaca taaagaaatg aagagctagt tcaaaccttg ggaaaataca
541 ctatatctta aa
//
LOCUS HUMETN1 682 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human endothelin (ETN) gene, complete cds.
ACCESSION M25377 J04819 J05489
KEYWORDS endothelin.
SEGMENT 1 of 4
SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA
to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 682)
AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and
Quertermous,T.
TITLE Structural organization and chromosomal assignment of the gene
encoding endothelin
JOURNAL J. Biol. Chem. 264, 10851-10857 (1989)
STANDARD simple staff_review
REFERENCE 2 (bases 2 to 421)
AUTHORS Lee,M.-E., Bloch,K.D., Clifford,J.A. and Quertermous,T.
TITLE Functional analysis of the endothelin-1 gene promoter: Evidence for
an endothelial cell-specific cis-acting sequence
JOURNAL J. Biol. Chem. 265, 10446-10450 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer readable copy of sequence [1] kindly
submitted by K.D.Bloch (06-AUG-89).
FEATURES from to/span description
pept 519 + 582 endothelin prepropeptide, exon 1
/nomgen="EDN1" /map="unassigned"
/hgml_locus_uid="LU0066V"
pre-msg 252 > 682 ETN mRNA and introns (5' end put.)
IVS 583 > 682 ETN intron A
BASE COUNT 121 a 178 c 169 g 213 t 1 others
ORIGIN Chromosome 6.
1 ggcaggttta gcaaaggtct ctaatgggna ttttattttt cttagccctg cccccgaatt
61 gtcagacggc ggcgtctgcc tctgaagtta gcagtgattt ctttcgggcc tggccttatc
121 tccggctgca cgttgcctgt tggtgactaa taacacaata acattgtctg gggctggaat
181 aaagtcggag ctgtttaccc ccactctaat aggggttcaa tataaaaagc cggcagagag
241 ctgtccaagt cagacgcgcc tctgcatctg cgccaggcga acgggtcctg cgcctcctgc
301 agtcccagct ctccaccgcc gcgtgcgcct gcagacgctc cgctcgctgc cttctctcct
361 ggcaggcgct gcttttctcc ccgttaaagg gcacttgggc tgaaggatcg ctttgagatc
421 tgaggaaccc gcagcgcttt gagggacctg aagctgtttt tcttcgtttt cctttgggtt
481 cagtttgaac gggaggtttt tgatcccttt ttttcagaat ggattatttg ctcatgattt
541 tctctctgct gtttgtggct tgccaaggag ctccagaaac aggtaggcac gctcgttgac
601 ttgtaagtct cggaattaca agttagtgtg ttcttatcca ccttcatgct tttcttgctt
661 ctatttttcc ccgttctttt ta
//
LOCUS MUSKER19 5422 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse keratin 19 gene, complete cds.
ACCESSION M36120
KEYWORDS keratin 19.
SOURCE Mouse (strain C34/HeNCrlBR) embryo fibroblast DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5422)
AUTHORS Lussier,M., Filion,M., Compton,J.G., Nadeau,J.H., Lapointe,L. and
Royal,A.
TITLE The mouse keratin 19-encoding gene: Sequence, structure and
chromosomal assignment
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.Lussier, 06-JUL-1990.
FEATURES from to/span description
pept 375 803 keratin 19, exon 1
3728 3810 keratin 19, exon 2
4029 4185 keratin 19, exon 3
4553 4714 keratin 19, exon 4
4832 4957 keratin 19, exon 5
5070 5324 keratin 19, exon 6
pre-msg 304 > 5422 keratin 19 mRNA and introns
IVS 804 3727 keratin 19 intron A
IVS 3811 4028 keratin 19 intron B
IVS 4186 4552 keratin 19 intron C
IVS 4715 4831 keratin 19 intron D
IVS 4958 5069 keratin 19 intron E
signal 184 188 CAAT box
signal 278 284 ATA box
signal 5402 5408 poly-A signal
BASE COUNT 1255 a 1429 c 1431 g 1307 t
ORIGIN
1 gatatcgtca ctccggaagt tagagttgtc taagcctgtg tagtaagggg ctgaagggcc
61 agaaaaggga cgtgacatgt tggcagtagc tttggagtgg gctggggcgg ggcagctctg
121 ggaaggactg agacctctgg ctcctgggag gggagaggta ggagcagaat cgccaggaat
181 tgaccaatgg ggaaagagcc catatttgca ctctgggagc ttggaaattt ctgatacccg
241 ccccttcaac atctccatcc cccttcccgc cccgggcata aaaagccaca ggtgagggcc
301 ttgtcactcc tcctgcggcc agcagttctc agacctgcgt ccctttttcc ttcgctctgg
361 tctccctcct catcatgact tcctatagct atcgccagac ctcagctatg tcttcctttg
421 ggggtacggg cgggggttca gtacgcattg ggtcaggggg tgttttccgc gcacccagca
481 tccacggggg ctcaggtggc cgcggcgtgt ccgtgtcctc cacccgcttc gtgacctcgt
541 cctccgggag ctatggcgga gtccgcggtg gaagttttag tgggaccctg gctgtgtctg
601 atgggctgct gtctggcaat gagaagatca ccatgcaaaa cctcaatgat cgtctcgcct
661 cctacttgga caaggtgcgc gccctagagc aggccaatgg cgagctggag gtgaagatcc
721 gcgactggta ccagaagcag ggacccggac cctcccgaga ttacaaccac tactttaaga
781 ccatcgagga cttgcgcgac aaggtgggta gcggccaggc cctgggaggt gcacctgttg
841 gggtggtgcg gcagggcggg gaggttgtgg aaactcagcc cgcgggtgga caagcccgtt
901 actacgggcc gggcaggaac ccgcagggtg gggcacaaag tctgggctgg accctgggat
961 tcgagtggag gcggggtgga agggaactga gacaggttaa gtgggtgggg ctttcggaag
1021 cggtggggac ctgcttggat agtcagggat atctatccaa agagagcatt taacacagct
1081 ttgggctgcc cagagatgaa gataccgagc cgcagcctca gctgggggtg gggtgttggg
1141 gagggctggg gagctgccgg atattagtct gtaggctcag aggtcacctg agcctgtggc
1201 cctcttctgg gactcttgag cattgctttc tccagttgtg tgggttgttc ctgtctggtt
1261 ttctttacag cagctgtcct ctcactctcc tcctctccag cccaggggac cacctaggct
1321 cacacggagc tacgagaaga atgtaaaacc tgggggctgc acagtaaaac cagatcttaa
1381 aaaaaaaaac ttccagtgag gtgtggtagc tcaggcctgt aatcccagga gcaggtggat
1441 gagttcatag ctatgcccag ctacagttaa cttgaatcta gtcagggcta catgagtctc
1501 taggctccca cccggccgtc aaacagcccg aacagacaga cagacagaca gacagacaga
1561 cagacagaca gacaacaact ctaacacgaa acaaaagaaa aataaaaaca attattttag
1621 taagtaagag ggaggtggaa gtcttgaccc tctgcttctg tggacctctc tcctggagtc
1681 ccccggtcca gggaagacaa cctgcaagtc tctgtgggct ccaacatcca acaggctggt
1741 tgggtggggc attgccctca ctcctaggga agagctatgt cctgaagaaa tgtaccagag
1801 accaggcagg cgtttaatag cgtcctctcg ctagtgctcc caacatgcag ttctcaagag
1861 cattttccag ggctgtgtga acttagcatg gagctagaca cacaacacag caagcagcta
1921 gagaggtgtt tttttgaccc agtgatggac agctagaatc ctaggcggta cacatctttc
1981 tttctcttgc tttgtccctt ccccttccac acactcccag cacgcatatc tagaagcaat
2041 tcttactccg ctgtgaatgc tctcctccgg gtttttatat tctgttgttt tcgtcaacaa
2101 ccccccaccc ccaaactatc tatgtagctg agcgtggtgg ctgatacgtg gtggcttatc
2161 cagcattgag aaactgaggc agaaagaagg acgatgaagt taatgccagt tgaaccatac
2221 agtgagactc tgtctcaaaa tgtcaaacca gagctggggc gcagtggcac acacctttaa
2281 tcccagcact tggggaggca gagaaggtgg atctctgtga attcaaggcc agtctacaga
2341 gtgagtccag gacagccagg gctacaaaga gaaatcctgt ctcgaacccc ctagtccttc
2401 acccccaaag tctaacaaac cagaggaaac caacccaacc caacccaacc caacccaacc
2461 caacccaacc caacccaacc aaaaacatta aaccaaaata atcctgcaga ttccaacctg
2521 gtgacaggat aagtggccat gaccttaatc tttgaaaaac actgaaccct ggtactgaat
2581 tggtggtaaa cttaatgatt tgagtgaacc cctcaaacca gagtcttcat attgggtaaa
2641 ctcaggattt ccagtcacgt acagtggcag tactaaccta ggagtgaggc tgagttacca
2701 tgaagagagt acaaggtgcc tggcacatca ccgaagcaga atgtgctcta agtgtgggac
2761 acagacaggg ataagtgttg cttttctttt ctttaggatt tatttattta cttatctatt
2821 aatttattta tatgtatatt ggtgttttgc ttgcatgtat gtctgtgtga ggaacttgaa
2881 ttgcaaacag ttgtgagctg ccacgagggt gctgcaaatt gaactctgtt cctttggaag
2941 aatatccagg gctcttaatc tccaagttac ctctccagtc cctgggataa atgatccatc
3001 catccttcct tccttccttc cttccttcct tccttccttc cttccttcct tccgtctttc
3061 ctctttcttt cttttttttt tttttttggt ttttggtttt tggttttttg agacagagtt
3121 tctctgtgta gccctggctg gctcaaactc agaaatcctt ctgcctctgc ctcccaagtg
3181 gtggggttaa aggcatgtgc caccactgcc tggagggata aatgattctt ttttttttaa
3241 gatttattta tttattatat ctaagtacac tgtagctgtc ttcagacact ctggaagaga
3301 gcgtcagatc tagtttcgga tggttgtgag ccaccatgtg gttgctggga tttgaactca
3361 ggaccttagg aagagcagtc agtgctctta accgctgagc catctctcca gcccaggata
3421 aatgattctt aaagcttcat gtttctaggt ccttttcaga tcacttagag tttcacctgc
3481 acaggtcttg gccatgccct tctggaattg cctctttgcc tgcctccctt cacccaacct
3541 cctttacagt gtgttgtgct ccagctctgg ggctgtggat cgatgccaaa gtctcccaga
3601 gagcagacta gtgggtagtg tcaagggagt gggggctatc cattcctctc tccttctctg
3661 ggaagaaagt agattgataa tactggcttg ccaaactggt gccaactgcc agcctcctcc
3721 cttctagatt cttggtgcca ccattgacaa ctccaagatt gtcctacaga ttgacaatgc
3781 tcgcctggct gcagatgact tcagaaccaa gtaagtgtcc ctgtattggg ggctgcggaa
3841 gccagggcaa gtgagaggag agcagcggca ggtcttatag tgcttgatga agtagaggcc
3901 tggactctgc ccttctccac ttgagctggc tcaaagcaat gccccccctc cccgactcct
3961 aaaccctggg aaagttggca gagacacaga tccagcacta gcctgactgc aactatttct
4021 tcggaaaggt ttgagacaga acacgccttg cgtctgagcg tggaggccga catcaacggc
4081 ctgcgccggg tgctggatga gctgactctg gccaggactg acctggagat gcagattgag
4141 agcctgaagg aggagctggc ctacctgaag aagaaccatg aggaggtggg tcaagctggg
4201 ccttctgtcc attttgtccc aggattggga gaccctatta tcttgggctt actgaccatg
4261 acctcatctc ttggccctgg ctttgatcat gcctctggca gtctcagcag ggccatgaca
4321 tcatcactat gatgacctta ctgaccatga catcatcact atggccctgg ctttgatcat
4381 gcctctcgca gtctcagcag ggcatggtaa ccatgacatc tactttccag tgctggctcc
4441 tagatttgta cctccaaacc agccgtcttc agctcagggc ttgggggagt gacctgatcc
4501 aagttcagca tgtgtaggtc ttgacatgcc cttgactgtg gtgtcttcac aggaaattac
4561 tgccctgagg agccaggtgg gtggccaggt cagtgtggag gtggattcca ctcccggtgt
4621 cgacctagcc aagatcctga gtgagatgag aagtcagtat gagatcatgg ccgagaagaa
4681 ccggaaggat gctgaagcca cctaccttgc tcgggtatgc agaggatgct ggtatcccgt
4741 gagggtggcg aggagagacc ctgaaccctc acccatccac ccgcagagag aagcttgttg
4801 cttattttct ccttaacttg tcttgttaca gattgaggag ctgaacaccc aggtcgccgt
4861 ccactctgag cagatccaga taagcaagac cgaagtcacg gaccttcgac ggaccctcca
4921 gggccttgag attgagctgc agtcccagct cagcatggta cgtgtctcca ccctacggcc
4981 tgcacacttg tgccctggca acctcggtgc ctgtgcccgt gttcaggaac ctgcctctgc
5041 tgagtcacgc cccctttccc tcccctcaga aagctgccct ggaaggcacg ctggcagaga
5101 cggaggcccg ttatggagtc cagctgtcac agatccagag cgtgatcagc ggttttgaag
5161 cccagctgag cgacgtgcgt gccgacatag agcgccagaa ccaggagtat aagcagctca
5221 tggacatcaa gtccaggctg gagcaggaga tcgccaccta ccgcagcctg ctggagggcc
5281 aggaagccca ctacaacaat ctgcccaccc ccaaggccat ctgagctacc agcgagactc
5341 ccctgggaag gggcctgact ggggtgataa aagtttactc taacccctcc ctcgacttgt
5401 caataaaact atcctccaag gg
//
LOCUS VIBHEMOX 1586 bp ds-DNA BCT 14-AUG-1990
DEFINITION V.parahaemolyticus thermolabile hemolysin (TL) gene, complete cds.
ACCESSION M36437
KEYWORDS thermolabile hemolysin.
SOURCE V.parahaemolyticus DNA.
ORGANISM Vibrio parahaemolyticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1586)
AUTHORS Taniguchi,H., Hirano,H., Kubomura,S., Higashi,K. and Mizuguchi,Y.
TITLE Comparison of the nucleotide sequences of the genes for the
thermostable direct hemolysin and the thermolabile hemolysin from
Vibrio parahaemolyticus
JOURNAL Microb. Pathog. 1, 425-432 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 124 1380 thermolabile hemolysin precursor (TL)
sigp 124 183 thermolabile hemolysin signal peptide
matp 184 1377 thermolabile hemolysin
BASE COUNT 466 a 378 c 361 g 381 t
ORIGIN
1 tcgactgtct ggagtattta ctcagaatta gagggtttag tgagaaaaat tctcatctaa
61 acaatgttat agccaagtat tttttcaatg tgcttgggtc aataaccatt acaagaagag
121 tgaatgatga aaaaaacaat cacactatta actgcattac tcccgcttgc ttctgcagtt
181 gccgaagagc caaccttatc accagaaatg gtttcagcgt ctgaagtgat cagcacgcaa
241 gaaaaccaaa cctataccta tgttcgctgt tggtatcgca ccagctactc gaaagatgat
301 ccagcgaccg attgggaatg ggcaaaaaac gaagatggta gctacttcac cattgacggc
361 tactggtgga gctccgtttc atttaaaaac atgttctaca ccaacacgtc gcaaaacgtt
421 atccgtcagc gttgtgaagc aacattagat ttggcgaacg agaacgcaga cattacgttc
481 ttcgccgctg acaatcgctt ctcatacaac cacacgatct ggagcaacga cgcagcaatg
541 cagccagatc aaatcaacaa agtggttgca ctcggtgaca gcttgtctga tacaggcaac
601 atctttaacg catcacaatg gcgcttccct aacccgaaca gctggttctt aggtcacttc
661 tccaacggtt ttgtgtggac agaatacatt gccaaagcga agaaccttcc gctctacaac
721 tgggcagttg gcggcgcggc tggtgagaac caatacatcg cgctaacagg ggttggtgat
781 caagtttctt cgtacttaac ctacgcaaaa ctggcgaaga actacaaacc agcaaacacc
841 ttgtttacgc ttgagtttgg tttgaatgac ttcatgaact acaaccgtgg cgttccagaa
901 gtgaaagcgg attatgcaga agcactgatt cgtttgacgg acgcaggtgc gaagaacttc
961 atgttgatga cactgccaga tgcgacgaaa gcgcctcagt ttaagtactc aacacaagaa
1021 gagatcgaca aaattcgtgc gaaagtgctt gagatgaacg agttcatcaa ggcacaagcg
1081 atgtactaca aagcgcaagg ttacaacatc acgttgtttg atactcacgc cttgttcgag
1141 acgctaactt ctgcgcccga agagcacggt ttcgtgaacg cgagcgatcc ttgtttggac
1201 atcaaccgct catcgtctgt cgattacatg tacacccacg cattgcgctc tgagtgtgca
1261 gcgtctggtg ctgagaagtt tgtgttctgg aatgtcacgc atccaacaac agcaactcac
1321 cgctatgttg cagagaaaat gctagaaagt agcaacaact tagccgagta ccgtttctaa
1381 ccggacacgg cttctgagtt gaaaccttat cttcgtacac acgttgataa cgaacacatc
1441 gtggccattt ttatcgaagg aacgttgtgg tcacagcagt cacaacgcta aacaagttac
1501 agtggcgcga cgtcggttcc ccctaaagac tggtaaagcg taacctgagt gagaaactgg
1561 ttgtagcggt tctccaacaa agaagc
//
LOCUS CELCAPKCC1 195 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 1.
ACCESSION M37114 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 1 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 195)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 121 + 190 cAMP-dependent protein kinase catalytic
subunit C, exon 1
pre-msg 1 > 195 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS 191 > 195 intron A
BASE COUNT 63 a 47 c 48 g 37 t
ORIGIN
1 gtttgaggta acttaaacag aaagcagcga agaccacaca agagaagact gcaagtgtct
61 tctgaagtgt gacaacgtcc catccagtcc cgatcgtggt cttcgtctac cagcaaaaca
121 atgctcaagt ttctgaaacc aaaatcgtcg gacgagggaa gctcgaagga caacaagaac
181 tcggcgtcgc gtatg
//
LOCUS CELCAPKCC2 75 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 2.
ACCESSION M37115 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 2 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 75)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 + 70 cAMP-dependent protein kinase catalytic
subunit C, exon 2
pre-msg < 1 > 75 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron A
IVS 71 > 75 intron B
BASE COUNT 26 a 17 c 18 g 14 t
ORIGIN
1 attttcagtc aaagaattcc tggacaaggc acgcgaagac ttcaaacaac gatgggagaa
61 tccggctcag gtaca
//
LOCUS CELCAPKCC3 142 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 3.
ACCESSION M37116 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 3 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 142)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 + 137 cAMP-dependent protein kinase catalytic
subunit C, exon 3
pre-msg < 1 > 142 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron B
IVS 138 > 142 intron C
BASE COUNT 44 a 27 c 35 g 36 t
ORIGIN
1 attttcagaa cacggcgtgt cttgacgact ttgataggat aaaaaccctt ggaaccggat
61 cattcgggcg cgtgatgctt gtgaagcata agcagtcggg caactattat gccatgaaaa
121 ttctagacaa acaaaaggtt gt
//
LOCUS CELCAPKCC4 113 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 4.
ACCESSION M37117 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 4 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 113)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 + 108 cAMP-dependent protein kinase catalytic
subunit C, exon 4
pre-msg < 1 > 113 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron C
IVS 109 > 113 intron D
BASE COUNT 30 a 23 c 27 g 33 t
ORIGIN
1 attttcaggt tgtaaaactc aagcaagtgg agcacacgtt gaacgaaaag cgcattctcc
61 aggctatcga ttttccgttt ttggtgaaca tgacgttctc tctcaagggt gag
//
LOCUS CELCAPKCC5 222 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 5.
ACCESSION M37118 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 5 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 222)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 + 217 cAMP-dependent protein kinase catalytic
subunit C, exon 5
pre-msg < 1 > 222 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron D
IVS 218 > 222 intron E
BASE COUNT 50 a 60 c 44 g 68 t
ORIGIN
1 tatttcagac aactccaact tgtacatggt gctcgaattc atctcgggag gtgagatgtt
61 ctcacattta cgccgaatcg gacgtttctc cgagccccac tcgcgtttct acgccgctca
121 aattgtgctt gcctttgagt accttcactc cttagatctc atctatcgtg atctgaaacc
181 cgagaacttg cttatcgatt ccactggata tctaaaggta gg
//
LOCUS CELCAPKCC6 397 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 6.
ACCESSION M37119 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 6 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 397)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 + 392 cAMP-dependent protein kinase catalytic
subunit C, exon 6
pre-msg < 1 > 397 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron E
IVS 393 > 397 intron F
BASE COUNT 119 a 73 c 99 g 106 t
ORIGIN
1 attttaaggt aaccgatttc ggatttgcga aacgtgtcaa aggacgaacg tggacattgt
61 gcggcacacc agaatacttg gcacccgaga ttattttgtc aaaaggatac aataaggctg
121 tcgattggtg ggcacttggt gtgttgatct acgaaatggc cgccggttat ccgccgttct
181 ttgccgatca accgattcaa atttatgaga agattgtgtc aggaaaagtg aaattcccgt
241 cgcacttctc gaatgagttg aaagacttgc tgaagaattt gctccaagtt gatcttacaa
301 agcgatacgg taacttgaaa aatggagttg ccgatatcaa aaatcacaaa tggtttggaa
361 gtaccgactg gattgccatc tatcagaaga aggtgag
//
LOCUS CELCAPKCC7 729 bp ss-mRNA INV 14-AUG-1990
DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
exon 7.
ACCESSION M37120 J05289 M35424
KEYWORDS cAMP-dependent protein kinase catalytic subunit C.
SEGMENT 7 of 7
SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
lambda-CC[2,3,5].
ORGANISM Caenorhabditis elegans
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 729)
AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
TITLE Cloning, characterization, and expression of the gene for the
catalytic subunit of cAMP-dependent protein kinase in
Caenorhabditis elegans
JOURNAL J. Biol. Chem. 265, 6896-6907 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept + 9 134 cAMP-dependent protein kinase catalytic
subunit C, exon 7
pre-msg < 1 729 cAMP-dependent protein kinase catalytic
subunit C mRNA and introns
IVS < 1 8 intron F
BASE COUNT 197 a 197 c 86 g 249 t
ORIGIN
1 aatttcagat cgaagcaccg ttccttccaa aatgccgtgg ccctggcgac gcatctaact
61 ttgacgacta cgaggaagag ccgcttcgta tctcgggcac cgagaaatgc gccaaagagt
121 ttgccgagtt ttaattcatt atctttgtcg attccttcca taagcttttt gtttcaaatt
181 tcactctctc tctctctctt ttccatattc ctcaaaattg tactagtcca aaacaaaacc
241 ctgtccctat tgtccggtgg tttgtcttcc ctcacccaaa ccttctcatt ctcagccaaa
301 ttgaattcaa tattgttgcg acaattgaaa atgatgaaga tgcttactat ttcttccctt
361 ccttctgaac cccttatttt ttaatgttca tacattttct tgccccgatc ccgtccccct
421 aataataatt tgtaaaaata atagattgga atttgctcta atatatataa ttattatata
481 tcaaaatttt ctagtcaacc cctgactttc cctcctccaa ttcagcctcc cgcaaacttt
541 tttctagaaa agttgacttc tgtctcaaca aagaaatgga atcccacaaa accattaatt
601 gtatgttttt ttctcatttt ccttttcctc tctcatgtat acatatatat gtattttaaa
661 cacttttcac cccccccccc cccacacaca ctcaccgcaa aaatccccca gtcgtcataa
721 aaagtagtt
//
LOCUS DRODEC1A 3784 bp ds-DNA INV 14-AUG-1990
DEFINITION D.melanogaster defective chorion-1 fc125 (dec-1) gene, complete
cds.
ACCESSION M35887 M35688 X07697
KEYWORDS dec-1 protein; eggshell protein.
SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Hawley,R.J. and Waring,G.L.
TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene
required for proper assembly of the Drosophila eggshell
JOURNAL Genes Dev. 2, 341-349 (1988)
STANDARD simple automatic
REFERENCE 2 (bases 44 to 3784)
AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T.
TITLE Multiple proteins are produced from the dec-1 eggshell gene in
Drosophila by alternative RNA splicing and proteolytic cleavage
events
JOURNAL Dev. Biol. (1990) In press
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Waring, 28-JUN-1990.
EMBL features not translated to GenBank features:
key from to description
PRM 14 18 put. TATA-box
SITE 44 44 transcription initiation site
FEATURES from to/span description
pept 119 3490 fc125 protein precursor (dec-1)
sigp 119 187 fc125 protein signal peptide
matp 188 3487 fc125 protein
BASE COUNT 1103 a 999 c 1004 g 678 t
ORIGIN Chromosome 7C-X.
1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa
3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca
3061 tcgagtccac aaaagtccat catcagcacc atccgaaacc gaaatcgaaa acgcaccatc
3121 atctgacccc caagtgggtt ccattttcac ctacggcgag ggattgctgc atccgtttat
3181 gggtctgctg ccggtggaga ggcccgatga tccgtggaac cagaagccat acgatcctca
3241 ccatcctctc tacacgggcg gtgggagcta cgatgcctat ctgagggacg gtcgccatcg
3301 aagggacacg cacatcatgg gtcaaggtac gcagcatggc atcctgacac ccggcatgtt
3361 ggagagactt cttcggatca agatggattt ccagcgtaga tttcccccat ttatacaagg
3421 gtatgctgaa tcaccatacg aatcttaccc gtgtagaggt acagcctccg gttctgggca
3481 agatttctaa gccgaaaact aagaccaagc ccaagaatga ggacgaaccc gtcttcgaat
3541 tgggagccgc tgagcgtagc ctgttcgagg acgagaccaa cgattcttta gagaaagatc
3601 ccgaaccaga accggatgag gaggatgacc gggatgtgga agagcccagc gagagcagtg
3661 aacctagggg attcagcagc aaaaaaagcc gggatgagaa cgacatcgat tacttcaatt
3721 tcgatgacga ttgatgttga tgattgactt ttattataat tgccaataaa attaatatta
3781 taaa
//
LOCUS DRODEC1B 3798 bp ds-DNA INV 14-AUG-1990
DEFINITION Drosophila melanogaster defective chorion-1 fc106 (dec-1) gene,
complete cds.
ACCESSION M35888 M35688 X07697
KEYWORDS dec-1 protein; eggshell protein.
SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Hawley,R.J. and Waring,G.L.
TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene
required for proper assembly of the Drosophila eggshell
JOURNAL Genes Dev. 2, 341-349 (1988)
STANDARD simple automatic
REFERENCE 2 (bases 44 to 3798)
AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T.
TITLE Multiple proteins are produced from the dec-1 eggshell gene in
Drosophila by alternative RNA splicing and proteolytic cleavage
events
JOURNAL Dev. Biol. (1990) In press
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Waring, 28-JUN-1990.
EMBL features not translated to GenBank features:
key from to description
PRM 14 18 put. TATA-box
SITE 44 44 transcription initiation site
FEATURES from to/span description
pept 119 2971 fc106 protein precursor (dec-1)
sigp 119 187 fc106 protein signal peptide
matp 188 2968 fc106 protein
BASE COUNT 1106 a 1003 c 1008 g 681 t
ORIGIN Chromosome 7C-X.
1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
2941 aatggccagg gcgacgcatt tcagacaata aacccgaagc aaccaggcga agtcggtgga
3001 tcggagagcc agaagagcaa ctctaatcct ccaacgacgt tgacaccagc tccacaagag
3061 caaccgcaag agcatcgagt ccacaaaagt ccatcatcag caccatccga aaccgaaatc
3121 gaaaacgcac catcatctga cccccaagtg ggttccattt tcacctacgg cgagggattg
3181 ctgcatccgt ttatgggtct gctgccggtg gagaggcccg atgatccgtg gaaccagaag
3241 ccatacgatc ctcaccatcc tctctacacg ggcggtggga gctacgatgc ctatctgagg
3301 gacggtcgcc atcgaaggga cacgcacatc atgggtcaag gtacgcagca tggcatcctg
3361 acacccggca tgttggagag acttcttcgg atcaagatgg atttccagcg tagatttccc
3421 ccatttatac aagggtatgc tgaatcacca tacgaatctt acccgtgtag aggtacagcc
3481 tccggttctg ggcaagattt ctaagccgaa aactaagacc aagcccaaga atgaggacga
3541 acccgtcttc gaattgggag ccgctgagcg tagcctgttc gaggacgaga ccaacgattc
3601 tttagagaaa gatcccgaac cagaaccgga tgaggaggat gaccgggatg tggaagagcc
3661 cagcgagagc agtgaaccta ggggattcag cagcaaaaaa agccgggatg agaacgacat
3721 cgattacttc aatttcgatg acgattgatg ttgatgattg acttttatta taattgccaa
3781 taaaattaat attataaa
//
LOCUS DRODEC1C 5740 bp ds-DNA INV 14-AUG-1990
DEFINITION Drosophila melanogaster defective chorion fc177 (dec-1) gene,
complete cds.
ACCESSION M35889 M35688 X07697
KEYWORDS dec-1 protein; eggshell protein.
SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Hawley,R.J. and Waring,G.L.
TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene
required for proper assembly of the Drosophila eggshell
JOURNAL Genes Dev. 2, 341-349 (1988)
STANDARD simple automatic
REFERENCE 2 (bases 44 to 5740)
AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T.
TITLE Multiple proteins are produced from the dec-1 eggshell gene in
Drosophila by alternative RNA splicing and proteolytic cleavage
events
JOURNAL Dev. Biol. (1990) In press
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Waring, 28-JUN-1990.
EMBL features not translated to GenBank features:
key from to description
PRM 14 18 put. TATA-box
SITE 44 44 transcription initiation site
FEATURES from to/span description
pept 119 4888 fc177 protein precursor (dec-1)
sigp 119 187 fc177 protein signal peptide
matp 188 4885 fc177 protein
BASE COUNT 1716 a 1481 c 1496 g 1046 t 1 others
ORIGIN Chromosome 7C-X.
1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa
3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca
3061 tcgagtccac aaaaggttag cacatttcca caggtttggg caagaggcag ggttaaatgc
3121 gaccacatcc aagggttgcg gatgtggcag gttggattgt ctgtgcggca ggagttgtcg
3181 atgcggaaga agaggcttgg aatctagagt ggtttctagt agaacttcgg gaacttgcca
3241 gtgcaaagcc agtcaccgga acaagcgttg tgaatatgga accttggaga ccatcgatga
3301 gggttctctt aacgaactta gaagagaata taaactggga ctaaaagaga tcacgctgag
3361 tccagatgaa gatcccgccg aggccctgat gcgctacaat gcggcctcca ttcgcgaagc
3421 tttggagcga gccagtatgg aaccgctgga gatcggtggg gatcagtatg aagaggatgc
3481 gcaacaggag cccatggagg aggagcaact acagcatgat cccaacacag agccccagta
3541 caatcataag gactttgtgc ggctaaccac ttccacagca tctccgatca caagtaccac
3601 agaagctgcc actccaacag gcagtgactc tacttcagaa gcaactgtga cacctgaagt
3661 caccaccacc actagcacat ccacaacaac aacaaccgaa agcaccaaag atgagggact
3721 ggacatgcag caagactcac aggctgaggc tgagagttct catgtcacca agtcgatatc
3781 gaagcaggag gccganattc atcagctgca cagtattgtg gaggagctga agaatgagat
3841 ccttaaactg aatttgcgtt gcagcacgat aatatcaaac aatgtggcca aagaaccggt
3901 cacagagaag aacccacccg tggtggagga accatccaag caggaggata agccaaaggt
3961 cgaggagaaa gtgattgccg aagagcaagc cccagtggag cccgaggaag atctggaaga
4021 ggacgaggat agtacctcta ttagtacgac cacagagact ccttcaccaa gtggtagtta
4081 ctcaacaaaa cctggacttt cccttggatc gcctagagtg gatgaacaaa gtggctcgtc
4141 caataaactg gactacgacg acgataacaa ttggcaacgc attctggcca atcgtggcta
4201 tgacacggac tacctaacca aatcccatga gcgtcagttt gcccagggtc aaaacctgga
4261 gatgcccaag aactgcaatt acgatggaaa tggcagccaa gagtatggcc cctatccgga
4321 gttccaagca gatgaaccca gcacggatac ggagggcaag gctaaaaggg ctttaagtgt
4381 gaaacaacag gcgcagctgc tcaatgcggc cctaaatgat agtggaagcg attcgagcga
4441 tggcaccacc accaccacca caccctctcc gtacgcgatg aggggaaagt ttgtgagacg
4501 gaggagcacg gcaagaaggg ttcccattcc aaagatagga aaggcgagcg acgaggtatg
4561 ggtacgatct cctcgtcaag cgaaaatgcc ccaacgaccc aagaagtcca tgtccaagcc
4621 gaaaaagcag agctcgcagg tgactacgca ggccaccgtg agcagcacta agctggacag
4681 tttggtggat gtgctcaagg atctggtgcg cctgcaaatt caaaaggaga aaaaatccag
4741 cctattaaga acccagagca ataacctttc gaaaaccaaa cccaaatcta ttaagcccgt
4801 caaagtcatc aagcgaaaaa ggttgcgcag gaggcaacac aagtctattg ccacgaccat
4861 caggagtccc attcaaacga aagcataaga tttcagccca tttaactaat tcacagatcc
4921 caactaatcc tatacataat ttagatatag tataagcata ttcactattc gttcgcaatt
4981 cattcatatc catccatctt tcatttgcat atccctcgtc catatcgcca gtccatcatc
5041 agcaccatcc gaaaccgaaa tcgaaaacgc accatcatct gacccccaag tgggttccat
5101 tttcacctac ggcgagggat tgctgcatcc gtttatgggt ctgctgccgg tggagaggcc
5161 cgatgatccg tggaaccaga agccatacga tcctcaccat cctctctaca cgggcggtgg
5221 gagctacgat gcctatctga gggacggtcg ccatcgaagg gacacgcaca tcatgggtca
5281 aggtacgcag catggcatcc tgacacccgg catgttggag agacttcttc ggatcaagat
5341 ggatttccag cgtagatttc ccccatttat acaagggtat gctgaatcac catacgaatc
5401 ttacccgtgt agaggtacag cctccggttc tgggcaagat ttctaagccg aaaactaaga
5461 ccaagcccaa gaatgaggac gaacccgtct tcgaattggg agccgctgag cgtagcctgt
5521 tcgaggacga gaccaacgat tctttagaga aagatcccga accagaaccg gatgaggagg
5581 atgaccggga tgtggaagag cccagcgaga gcagtgaacc taggggattc agcagcaaaa
5641 aaagccggga tgagaacgac atcgattact tcaatttcga tgacgattga tgttgatgat
5701 tgacttttat tataattgcc aataaaatta atattataaa
//
LOCUS DROGLDYYR 6458 bp ds-DNA INV 14-AUG-1990
DEFINITION D.melanogaster glucose dehydrogenase (GLD) gene, complete cds.
ACCESSION M29298 X07358 X13581 X13582
KEYWORDS YYRR box; glucose dehydrogenase.
SOURCE D.melanogaster (strain OregonR) DNA, clone pEG25D20.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1460 to 1531)
AUTHORS Cavener,D.R.
JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235
STANDARD full staff_entry
REFERENCE 2 (bases 1460 to 1531)
AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
Schonbaum,C. and Xiao,X.
TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in
Drosophila and other eukaryotes
JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988)
STANDARD full staff_entry
REFERENCE 3 (bases 78 to 1142 and 3792 and 4208)
AUTHORS Whetten,R., Organ,E., Krasney,P., Cox-Foster,D. and Cavener,D.
TITLE Molecular structure and transformation of the glucose dehydrogenase
gene in Drosophila melanogaster
JOURNAL Genetics 120, 475-484 (1988)
STANDARD full staff_entry
REFERENCE 4 (bases 1 to 6458)
AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R.
TITLE Evolution of the glucose dehydrogenase gene in Drosophila
JOURNAL Mol. Biol. Evol. 7, 155-177 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.R.Cavener, 20-OCT-1990.
FEATURES from to/span description
pept 3838 4005 glucose dehydrogenase (GLD), exon 2 (EC
1.1.99.10)
4079 4199 glucose dehydrogenase, exon 3
4539 6088 glucose dehydrogenase, exon 4
pre-msg 502 6458 glucose dehydrogenase mRNA and introns
IVS 837 3828 GLD intron A
IVS 4006 4078 GLD intron B
IVS 4200 4538 GLD intron C
signal 394 400 GLD TATA box
site 1457 1532 YYRR box tetranucleotide repeat region
BASE COUNT 1782 a 1464 c 1418 g 1785 t 9 others
ORIGIN Chromosome 3-84C8.
1 tatctgtctg acattaatat ccaaaaatgt aataaattca catccacaaa ttgtgaaaaa
61 aaatcaacaa cgcagttctg cagccgttcg actttatttt gccagtgctt cttaacttgg
121 ctggaaatcg ttaaactcgc aggcgcacga gcaagcagct ttttgtgtgg gttgtttagc
181 cgaaagcggt ggttgaagaa acctgtgacg tcttagccga agtcaggggt gcttaaagaa
241 agttttacaa cactagacca tattcatgag taaagggttg agtaataaaa tacataaaac
301 gtaagaaata ataataatac agattctaaa agttattagg taaaatttag accaatttag
361 acctactcat tgcaaacact caaaagctcc cgattcagac caagtttcag agagcgcagc
421 tttgcggcca gctttaagct gtctttcgtt gagttcgagc ttttcgtcag tttaaaaaga
481 ctggcgcctg ctggtcagaa gctgagtcgg taacggtctg cgtctgcgcg cagttcgaac
541 aagttgagaa agagaccaac agaaagccca tccaagtgca gtgatcaata cggtaactga
601 caaaaaccct agaagtcagg gcttaaaaac gattttgcag cgctgccagt gggtttttgt
661 gtgataaaaa aaaagcggct cagaaaaact tgctgacagc agatagcaca cacgtttttg
721 tcgtcttcgg tccattgaaa aattttcccg aggcattttc tataaggaat aaacaattaa
781 ttcaatattt aaagcataga aggaaactag acaccacatc accggactct acgatcgtaa
841 gttgatgcaa tcgtctttat ttctattatt ttctgccttt tcggtttttg cacaacccca
901 aaatccaaaa attcgcgatg tccgtttctg gcattgagga agctcaaaag atttggacag
961 ctttttggcc cgaagtctgc tggaaatttg ccattgtcat aagcccaaga aacggaattt
1021 tgttcggtct tctgatggct cacggcgata aatttactgc actttgtttg caatagctcc
1081 aattcgattc ggtttgtttt tctggcacag agcacagtat gcctcacgat ttctgctcta
1141 gactaaaact ataatcccaa atatgaaaac cgaattagct aattagttaa ggctaaagtc
1201 tgaactggaa tcttctgact aagatcccac cccttccaaa cgtggacctt acacaattca
1261 ctcaaaattc gcatagcttt atctggcgcc tttgtggcac tcataaattg cattacttta
1321 tgattattat tctaatatat gcacggttcc acattaataa gcacccgaaa aactatgcct
1381 aaacatatgg cagaagcggc aacgagataa ctgtataaca aggcccaaaa tcaagccggt
1441 atctacggta ttgttcacac tgactggctg actggctgac tggctgactg actgcctgat
1501 tgagtgactg actgactgac tgactaactg acaggcagct caaagaaacc ttcaaacgca
1561 gcaacgaaga caatgccaag ctatatcatt gaagttatcg tgcgagtgca aaacaaacca
1621 ccagaagaca actgcagctc gcgtcggttt tattcgacac ccgtgtcgct tcagtgaaca
1681 agggattatg caccatgctt tacatccgat aaagttgtta attccaaaaa aaaaagtcta
1741 attatgacag cgccattgat aatcgtgtca gataagttac acaggaagtg cgagtaatca
1801 aatgataaca agcaaaaatg tcgttaggct gattaaataa ttgtcttcat gttctcacca
1861 ttaaacaaaa aaatatttta aaagtcattg gtttctctga attttttatt taaattaaaa
1921 attctccatc tgctataaca catattttta tttttttaac gaaacattat tttaacttaa
1981 ctggctaaac tgcagataaa atcagaagtc caagtcctcc aaaaatctgt atattagaga
2041 tcatttaacc ctaataaacc aggcggaaaa taattaagtc ggggaaaatg tattgcggct
2101 atcccctttt cgtatcccta ggattttccc accaatttcc acgagccgcc acgcagaagt
2161 ggacctaaat ctgttagcta ggcaaaactt gccccgaaca acttttgatt ttctgagtgc
2221 tgcacgactt ttgcacgcgt ttttctcggc gtttgaagtt cgtctgtggc cacatttgtt
2281 gcatggcttt tactaaactg ctggaaaacg ggctgcgagg tcaactgcct ctcttttcga
2341 ttttccacgt tcgcgtgagn tgcgtatgtt ctgcgacttt gttttgactg tttttttgtt
2401 tttctatttt tttgtgggct cgcgctctgc ttcactttcc tggcatcctt caactttaaa
2461 agcgaaatgc aattgaaatg aatgaaaaat cgtgttccaa atgtcagatg cgagccccat
2521 tgtggcatcc aaaacagacg aactgtagca attgcatggt cttctcgatc tccgcttagg
2581 tgattctctt gtttcactct cggtttcagc ttgcgtttct gtttcaatta cttttctgct
2641 ttggttttgg cttcggtttc tgtttctgtt tctgtttctg ttttcggatg gatacaaatt
2701 gcaatttgat tgcttcaagc tgttcgagtt aattgaggtc aaaagcgctg tcagcatatt
2761 ccaaaaatgc actgatttcg gggatcggtt cggtttgggt ttcggtttca attttgggcg
2821 cctaattaac cgcccaaaaa ggagtgtgaa gagcaatatt ggaggcaagc aattaagatg
2881 gtttcttcac ataaaaggag cttagaacgg atcccgtttc agagctgctc gggagtcaat
2941 ctgatcagac ccgggggtca taagtcgacc accaacgaga cgaacaaatg acataattac
3001 attgcacaat actcaaccgt ggttggctac cgtaattcca tgcagcctca ttgcttctgt
3061 tttaggcatt tgctcgtcaa tttcgaaatt ccaaatagga aagaaaaggg aagccagcca
3121 gacatttttg gccgctcggg gctaaggtca tgtgaaacta tgcagaagct gccgccgacg
3181 gaacaaaagc caagctcgga tgatgagcta tgacttctgt ccgtccggcg ttctttttag
3241 acctttcgta atcgccaatg cagggggcag ttgccgtttg tttgcaaatg gcaagatata
3301 gttgccgntt gtttgcaaat ggcnnagata taaaaaatat cttttttgnt ttcaatttca
3361 tttcgntttt ttcctttcgg ccgcttatgc cgctgtgtgc atgggacaga cgacgcccat
3421 tgtctctcgg tgagtactct tattaattac agtactgctg ctggagcgtt actatgataa
3481 tgtttagtct ttgtctcagt ggacaaggtt aaacagaagt tcatgttgcg gtcttcggaa
3541 aggtttatga ctttcccaat ggctttcaat taatcagctg gaaattaaat aattaaatga
3601 ccatatcatt ttgcgattgc ttttaataaa aagagtatat tgctgacaac gtctttatcg
3661 ttctaacttt gtaaaacttt aaacaagcta gaaagagact tagaacacga acttgtagct
3721 ttatttanaa aaaaagttcg tgctctaagn cntttgtatt ttgaaacttg atcatagttt
3781 tttacctatt taatacttaa aataagaaat tcccattgat ttttttagtc tatcaacatg
3841 tccgccagcg cctcagcctg cgattgtttg gtgggcgtac ccactgggcc caccctggcc
3901 tccacatgtg gtggtagcgc cttcatgctg ttcatgggcc tcctggaggt ctttatccgc
3961 tcccagtgtg atctcgagga tccctgcgga agggccagca gtcgggtaag ctgtaaatga
4021 cgaaggccac taagagtggc tacccatatc tcaatataga tttccaacat tattacagtt
4081 tcgatcggag ccggactacg agtacgattt cattgtcatt ggcggcggct cagcgggctc
4141 tgtggtggcc tctcgactgt ccgaggtgcc ccaatggaag gtgcttttga ttgaagccgg
4201 taagggatgt gccccagcaa ctcataccca tacacatacc tacatactta agttacactg
4261 ctccggcaat tagaggcacg ttatgtggca tttgtttaga agcaattgaa gcttaacagc
4321 ggcataaact gcaacaatgg caccaacact aatgccactt gggcaagact cggcggcaaa
4381 gaagcaaccg caactgtcac gagcattatg gagcgataat ttctagtcct aatatagaac
4441 cgacaaacta atcatacact tatataaact acaaaaaact ataacaacat gatatttttt
4501 aattatgtat ttaacagtat cttaaccgtc ttacacaggt ggcgatgaac ccgtgggagc
4561 ccagataccc tcgatgttcc tcaactttat tggcagcgac attgactacc gctacaacac
4621 ggaacctgag ccaatggcct gtctgtcctc catggagcag cgctgctact ggccacgtgg
4681 caaggtactt ggcggaacct cggtattgaa cggaatgatg tatgtccgcg gcaaccggga
4741 ggactatgat gattgggcag cggatggaaa tcctggttgg gcctacaacg acgtgcttcc
4801 gttcttcaaa aagtcagagg acaacctgga tctggatgaa gtgggcacgg aatatcacgc
4861 caagggtggc ctcctgccag tgggcaagtt cccatacaac ccacctctat cctacgctat
4921 tctgaaggcc ggcgaggagt tgggcttctc cgttcacgat ctaaatggtc agaactctac
4981 tggatttatg attgcccaga tgacggcccg aaatggcatc agatatagct cagcccgggc
5041 tttcctgcgt ccggctcgca tgcgcaacaa cctgcatatc ctgctgaaca ccacggccac
5101 caagatcctt attcatccgc acactaagaa tgttctggga gtggaagtca gtgaccaatt
5161 cggcagcacc cgaaaaatcc tggtcaagaa ggaggtggtt ttaagtgctg gcgctgtgaa
5221 ctctccgcat atcctactac taagtggagt gggtcccaag gacgaactgc agcaggtgaa
5281 tgtgaggacc gtgcacaacc taccaggtgt gggcaagaat ctacacaatc acgtgaccta
5341 ctttaccaac ttcttcattg acgatgcgga cacggcgcca cttaactggg ccacggctat
5401 ggagtacttg ctctttcggg atggtctcat gtccggcact ggcatttcgg atgtgaccgc
5461 aaagctggcc actcgctatg ccgacagtcc cgaacgtccc gatctccagc tgtactttgg
5521 cggctatctg gccagttgtg cccgtacagg acaggtgggc gagttgctct cgaataactc
5581 tcggtccatt cagatcttcc cagctgtttt gaaccctcga tcgcggggat ttattggcct
5641 gcgatctgca gatcctctgg agccgccgcg tattgtggcc aattacctaa cacacgagcg
5701 ggatgtgaag acgctggtgg agggcattaa gtttgtcatc cgattgtcgc aaacgacgcc
5761 gttgaagcag tacggcatgc ggctggacaa gacggtggtc aagggatgcg aggcccatgc
5821 ctttggtagc gatgcctact gggagtgcgc cgtgaggcag aacacgggtc cggagaatca
5881 ccaggctggc tcctgcaaaa tgggtcccag ccacgacccc atggcggtgg tcaaccacga
5941 gttgcgggtc cacggaattc gcggactccg agtcatggac accagcataa tgcccaaggt
6001 cagctcggga aacacccatg cccccgccgt gatgatcgct gagaagggcg cctacctgct
6061 gaagagggcc tggggcgcca aggtctgacg cgtggatgcg acgtggacgt tgcatagagt
6121 aatttaatca gaatccaagc gaaaattaca gcgagtaaaa ttttaagcga aattacactt
6181 atggcacctc agatccaaaa ctaaactatt ttaccctaat ccaatcttat ttaatccgat
6241 cctattatga aggtttctca ctctaggcac accgcgttgc gcaatttcca ttaaagttat
6301 ctataagtta acaggctccc ttaaaactaa ctactgttac ccactgaata tgccgtcaaa
6361 cttcttgaac agcacactca gctcaaatta aagacaaata aaacttatta ggtatctagt
6421 aaatttacta aaattccaaa ttaaatttat gtgaattc
//
LOCUS DRPGLDYYR 7382 bp ds-DNA INV 14-AUG-1990
DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, complete cds.
ACCESSION M29299 X07359
KEYWORDS YYRR box; glucose dehydrogenase.
SOURCE D.pseudoobscura DNA, clone pBy2.1.
ORGANISM Drosophila pseudoobscura
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 2131 to 2198)
AUTHORS Cavener,D.R.
JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235
STANDARD full staff_entry
REFERENCE 2 (bases 2131 to 2198)
AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
Schonbaum,C. and Xiao,X.
TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in
Drosophila and other eukaryotes
JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988)
STANDARD full staff_entry
REFERENCE 3 (bases 1 to 7382)
AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R.
TITLE Evolution of the glucose dehydrogenase gene in Drosophila
JOURNAL Mol. Biol. Evol. 7, 155-177 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.R.Cavener, 20-OCT-1990.
FEATURES from to/span description
pept 3959 4126 glucose dehydrogenase (GLD), exon 2 (EC
1.1.99.10)
4203 4323 glucose dehydrogenase, exon 3
5530 7079 glucose dehydrogenase, exon 4
pre-msg 777 > 7382 GLD mRNA and introns
IVS 1136 3948 GLD intron A
IVS 4127 4202 GLD intron B
IVS 4324 5529 GLD intron C
site 2129 2200 YYRR box tetranucleotide repeat region
BASE COUNT 1983 a 1667 c 1611 g 2121 t
ORIGIN Chromosome 3-84C8.
1 gaattccaat catattaatt ttaatatcat gaagcacacc aggagttttc ttatctattt
61 gttttttttt ttcaaagcat acatgactct acttttcttt ggtttcatat cactttttgg
121 aagttctcaa ctctcaaata caaagtatat tctgaacaag tattattaca tattactatc
181 ttactttttt atcaaaacct ttacaaatat ttttccactt tcaatgccca ggcttggcga
241 aaacaacaat actagaacgt agttccactc agaggtactt tattttacct ctgttcggtc
301 cgcttggaaa tcattaaacc acgcagggcc tctccgcgga aagagtgcct ttttgtgtgg
361 gctgtgtgct gtctgagagc gggggttgaa gaaacctgtg acgtcttatt gaagcatagg
421 ggtggctggc tgccataatc tggagttttg cagcactgtc aaagctgtat gagtcatcaa
481 atattcagta aagttcttat agaactaaag cttttaatgc atagttttta ttttgattgt
541 tatttttaga agttttagaa tgaattgtat ttgtttttac aaaaagaaat ctctcttttg
601 attaacagca aagcccaact taacgaagcc aaacgacgca gcgcgctagc aagcttgcca
661 accagcttta agctgccgct tcgtcaaatg aaaccttcgt taagccaaaa gcttacctca
721 gtggcgcagc gcggcttttg ggaagcttaa aaagcctcgc gctcgccggc cagagactga
781 gtcggtaacg gtctgcgtct gcgcgcagtt caaaacaggt tgaaaaaaac gaggcccagt
841 gcttaaaaga agggaacact aaaaacccta gaaggcagcc caacaaatac cgttgtacgc
901 atgagtgtct ttgtgttgtg tccaagaaaa taagaagaag aagaaactgc aacaagaaac
961 gtttaaataa attgtttaca attgtgcatt ttttacgagt cgaacagttc gtttttatca
1021 tctccatgcc attgagcaat tggcctagag gcatttctat ttttcaataa ggaataaata
1081 attaaatttt caaaaattta acaaacagaa aaggaaacct gaaccgacat agaccgtaag
1141 ttgatgcaat cgcctgatgc ccgatacctg atggctgatg cctgatgcct gagtttttct
1201 tttatttcgc tattttctgc gtgtttcggc catgtccgtt ttggcattga ggaagctaaa
1261 cgaaaactgt tttggccagc ctctgcctct tggactgctg gaaattttac cattgtcata
1321 agcccaagac ccaagaaacg gaacttcaac agagtcttgt gttgtgtcta ttggtcttct
1381 gatgcctcat ggcgataaat tgagttgcac ttttattgta tttttgtttt ttgcaatagc
1441 tccaattcca attcgattcg ttttgtttgc cgtcagtgta catgtctcac gattttactg
1501 cctctagaag accaaaaccc gttccagttc atagttggag gtccattggc caaatgaatg
1561 actacaattt gccgcaatgt gtttgagcat taaatttgct atagttttta tccactgatt
1621 atgttccagt ttttaagcgg aggggaagtt gaaaattgct ggtctgcaaa ttgactccaa
1681 agtagatgtg aaatttcttg aaatgatgtg aatttttctc agtttaattg aatgatggat
1741 tgccaatgga tgccaatgaa gttcttcaaa ttattgggtg tgaaagtctc acaaattttg
1801 tgaatttgat ggggaagcta ttattatgac aggtccaaag atctcacccc atccaaagag
1861 ccaaagacag agcccttaca caatccccgc actataatca catggtttta tctggcgcct
1921 ttgtcatggc acgcataaat tgcattactt tatgattata attctaataa atgcacagtt
1981 cccacattaa taagcgccct cgccctcccc tactcagaaa ccatcgctaa acatatggca
2041 aggaggtaca actaaaactc caaccaaacc caaaaaaact ctatctctgt ctgcggtatt
2101 gttcacaccc ctgagcacaa gaccgacata ctgactgact gactgactgg ctgactgact
2161 gactcactaa ctgactgatt gactgactgg cagactgaca actcaaagaa accttcaaat
2221 gcaacggcag caatgaagac aatgccaagc gatatcaata caaaaaccac aagaagacaa
2281 ctgcaacgca actacgatta taatcgacat gcgggatcgg atcgggttgg atcgcttcag
2341 tgcacaagat attatgtaca tccgataaaa ttgttaatac aaaaaaatca agctgttcat
2401 aattatgaaa aaaaactcac agataacatt gttgtcgcaa aaggaagtgt gaataggcca
2461 aagaagaggc acaaaaagtg tcattaatta attgaataat tacagagaca tgacagagag
2521 cagcacttgc ttaggatttc ccttaagttt tccatagatt tccatgccac aatactggaa
2581 aagcaaaact tgtcccgaac aacttttaat tttctgattg ctgcacgact tttgcacgcg
2641 tttttctttg gcttttgtga agttcgcctg tacttgtacc cgtagccaaa gcctgcctgg
2701 aaaacttgtt gcgaggtcaa ctgcctctct taaagccaca gttgcgtatg cactctggct
2761 ttagtttggc tttgctttac tttcgtggaa tccttcaact ttaaaacaag aaaaggacac
2821 caaaatccca ttcaaattga atgaaaaatc gtgtttcaaa tgtcaaatgc gaacaccaaa
2881 gaaacaaaaa aaaaacagac gaactgtagc aattgtagca tgatcttctc tggtgattct
2941 ctggctctga ttttttgggt ttcattttca gtttcgggct ttggtggttg tttttgtttt
3001 tggcttccac gtttggtttg gttgcaaatt gcaatttgat tgctttaagc cgtttcgatt
3061 taattgaggt caaaaacgtt gtcggattct tcccaaaaat gtgctgattt gtgggttggg
3121 ttttgggctt taaaagtaca tatttgtcta attgtcatgc ctggatgtgg tatgtagcaa
3181 ctttatcggt ctccccaaaa ctctcactta aaaatcactc aaatctcacc cgcccagagg
3241 tcatcaaaac gtttgcaaat gacataatta taatggacat tactcaatct ctcattgctt
3301 cttgttttag gcagtaggcc ccagcagaga gtacgagtaa tttgaaattt caaataggaa
3361 aagagccaga gcccgagcct tggccgctcg tccggggcta aggtcatgtg aaactatgca
3421 gctgactgaa caaaagccaa gagctaagac ccaaccaaga tgatgagctc tgtctgtctg
3481 acttggcctt ttggtaattg ccaatgctgc tgctgctgct ggccagttgc cgtttgccaa
3541 atgcccaacg atatcaaaaa agtatctttt tctgatttta attccatttt attcagcttt
3601 tgtttttact tcttcgactg cgatgacgcc cattgtctgc tgcggttggt acttttatta
3661 attgcagtac tgctgctgtt gctggagaat tacttagatg attttttgtc tttgccatgc
3721 cccatggatc ggcaaggtta aacagaagtt tggaattgcg gaatttggaa ttatatttat
3781 gattttgccc tttggctttg gttttgtgtt taatcaacgg caaattatta aaacaaacaa
3841 aaatacgagt agatacattt aaatattgaa ttataatttc tgaagctttg tttctcaagt
3901 tcgtttctta aatctttcgt tttctgctct tttcattcca cagcacagcc ttatcataat
3961 ggccaccagt ccatcctcct gcgattgttt ggtgggcgta cccacggggc ccaccctcgc
4021 ctccacctgc ggcggcagcg ccttcatgct gttcatgggc cttttggagg tcttcatacg
4081 ctcccaatgc gacctcgagg atccttgcgg acgggccagc acgcgggtaa gccaaagatc
4141 cgtatatggt caagacgtgt ggctaaccag atcacattca atccaatcca ttttgttggc
4201 agtttcgttc ggagccggat tacgagtacg atttcattgt cattggcggc ggctcggccg
4261 gctctgtggt tgcctcacgt ctctccgagg tgccccagtg gaaggtgttg ctgatcgaag
4321 ccggtaagtg ttgttgtttc gtctctcgcg tgtgtgtggg tgtgtgtgag tgttgcatgc
4381 aacatgttgc cagccgcagt gtgctcacgg gcaattagat gcacgttatg tatgtgcatt
4441 gtttagaagt atattgaagc ctaacagcac cagcagcagc agcagcacca gcagcagcta
4501 agacttgggc aaccgcaact ctgccacgag cattatgtaa actaaacgag ctgctgccac
4561 tgccacctag agcagcggct ccattgctgc ccccaaggca gttccccatg caaatgtctt
4621 cttgctatat aggatttgct gcatgccccc gccaatgcct catggaggtt ttttacgttc
4681 cgcttgactt gacttttttg cattggaata attgttggtt ttttgtacct tacatttttt
4741 tatgaatgaa ttataatgca tgaacaatac ggggaacgtg gaacttcgga acgagtgtgt
4801 agggcattca cttttcctta acgagaagca aatatgcggt aaactctgct gaagaggctt
4861 tttagcactg aataatttta catttaattt gcgatatttg aaaactggat ctatagcacc
4921 aggttttaga ggaaagcatt agcaaatcac ttaaatcttt gatggaatat attacaatcg
4981 tttaaaaccc aaaaaacaaa acgaaaatcc ctctatttgg tcggagcacc aaaaaacacc
5041 tcattaaaca tcataatttt tcgccattca tcactaataa cttaattagt tattcaggtg
5101 tctagaaaat cacgttaata cctctctcta tcgctgggaa aacattctca tagatcccat
5161 tagacccgtc ttatcagcat tcttctttct ggttggctta ctttgtgggt taccgtctat
5221 tcgtcgtttt caattgtata cgtgacaatt tttatgccca gtcactaatt acactaattg
5281 cataatgaaa ccctcgagga aagagaaaca ttttcaaagt tgtcaatcgg gttggcggta
5341 cttatacaat gccccagtca ggctgcagtt aagtgcaatt agttgtggcc attacttggc
5401 ttattgtcta tgaaaactac actttcaaag actccaaact catgctcatc acaagtatgg
5461 tttaatcttg ggttgacatt agacatgcaa ttcatcaaat aaaaattcgt attgtttgta
5521 ccatttcagg tggcgatgag ccggtgggtg cacagatccc ctccatgttc ctgaatttca
5581 ttggcagtga cattgactat cgctacaata cggagcccga gaggatggcc tgcttgtcct
5641 ccatggagca gcgttgctac tggcctcgcg gcaaggtcct aggcggcact tcggtgatga
5701 atggcatgat gtacatacgc ggcaatcgcg aggactacga cgaatgggcg gcccagggaa
5761 atccgggctg gtcctaccag gatgtcctgc cgttcttcaa gaaatcagag gataatctcg
5821 aattggatgc cgtgggcacc gaataccatg ccaagggcgg ccttttgcca gtgggcaagt
5881 tcccctacaa tcctccactg tcttatgccc tgctgaaggc cggcgaggag atgggcttct
5941 cggtgcagga tctaaacgga cagaattcga caggcttcat gattgcccag atgacggcca
6001 ggaatggcat tcgctatagt tcggccaggg ctttcctaag acccgctcgc atgcgtaaca
6061 atctgcacat attgctgaac accactgtga ccaaggtgct catccatccg ggaaccaaaa
6121 acgttgtggg tgtggaggtt agcgatcagt ttggcagcat gcgcaagatc cttgtcaaga
6181 aggaggtgat cgtcagtggt ggtgcggtga actcgccaca gatcctgctg ctgagtggcg
6241 taggacccaa agaggatctg caaaaggtga atgtccgtcc agtccatcat ctcccgggcg
6301 tgggaaagaa tctccagaac catgtggcgt acttcacgaa cttcttcatc gatgatgcgg
6361 atacggcgcc cctcaactgg gccacggcca tggagtactt gctcttccgc gatggcctga
6421 tgtcgggcac tggcatctcc gatgtcaccg caaagatggc cacccgctgg gccgatcgcc
6481 ccaatctgcc cgatctgcag ctctattttg gcggctacct ggccagctgt gcccgcaccg
6541 gacaggtggg agaactcctc tcgaacaact ctcgagccat acagatattc ccggctgtac
6601 tcaaccccaa gtctcgtggt tacattaccc tgcgctcggc cgatcccctg gatccccccc
6661 gcatctttgc caactatctg acggatgagc gcgatgttaa gaccctggtg gagggcatca
6721 agttcgccat aaggctgtcg cagacgtcac ccttgaagca gtacggcatg cggctggaca
6781 agaccgtggt gaagggctgt gagtcgcacg ccttcgcgag cgatgcctac tgggagtgcg
6841 ctgtgcgcca gaacaccggg ccggagaatc atcaggcggg ctcctgtaaa atgggtccct
6901 cccacgatcc catggcggtg gttaatcacg agctgagggt ccatggcgtg cggggactgc
6961 gcgtgatgga caccagcatt atgcccaagg tgacggcggg caatacccat gcaccggcgg
7021 tgatgattgc ggagaagggc gcctatctgc tgaagcgggc ctggggtgcc aaggtctgac
7081 gcgtggatgc gacgtggacg ttgcatagag taatttaatc aaattccaac gaaaattact
7141 acaaattcaa atttaagcga aaattattat taaggaagct ctgaactttt tctataccta
7201 ttataaaacc tttttggccc gatccaatcc gatctgaacc catgccccac tctaagcact
7261 ctgcgttgcg caatctacga ctcgacaatc gctctcgata agtttaacca tctaacagta
7321 ctattattta aactaattct tgttaaaaac ccattgaaat atcgaaattt cgaatatcac
7381 tc
//
LOCUS DRVGLDYYR1 945 bp ds-DNA INV 14-AUG-1990
DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, 5' end.
ACCESSION M29300
KEYWORDS glucose dehydrogenase.
SEGMENT 1 of 2
SOURCE D.virilis DNA, clone pCG79.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 945)
AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R.
TITLE Evolution of the glucose dehydrogenase gene in Drosophila
JOURNAL Mol. Biol. Evol. 7, 155-177 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.R.Cavener, 20-OCT-1990.
FEATURES from to/span description
pept 943 > 945 glucose dehydrogenase (GLD; EC 1.1.99.10)
mRNA 545 > 945 GLD mRNA
BASE COUNT 325 a 163 c 189 g 268 t
ORIGIN Chromosome 3-84C8.
1 gcatcctcgg taaaaagtgc acttcgagga tattttttgg tgatatttaa attttttcgg
61 atttgcttca aacgtattat ttaaagctat acataaaaac gttatgaata cgtgttttaa
121 cgatttggaa aatttaaacc ggcaacattg gtaaatttgg ggaaaacgtt tgtatgaaag
181 ttgtttactt accgtccgat cagttttaaa atcattttca aagctatacc cgttagttta
241 atatatatta ttgatcgttt tcattcaaag tatatatttc gttaacgctc gtaactttaa
301 tgtttgcatg ttaataaata gtcatttcaa tagttagaat atgtatttat attaaattaa
361 aaaaataaat attatgccag caagcaattg gagagcgaca agcgagcttt aagctacaca
421 acgtgaggcg cagctggaac cagttggctc ccgaaccaac tgaaaaacga accagccaac
481 agctcaagct tcgtattggc gtttttgggg ggagtttaaa aagacgcgcg ctgccggcct
541 ttcactgagt cgacgccggt ctgcgattgc gcgcagttca aaaccaagtt gctaaaatca
601 tagccaggcc taaactgcat gcagacttgt gagctgcccg ttaagccaaa gaaggcaaaa
661 ctaagtactg tgcgtgtgtt actgtgctac tgtgtgtgct aacgaatgaa aaagaaaaaa
721 aatcaccaaa aaataaaccg aaaaacattt aaaaaatagt ttttgtcaaa ttattgtgca
781 cgtcttcggt ggttgtgcaa tttgccaaaa aggcattacc gataaggaga ggcacaataa
841 gaaaataaaa gccaaaagag ctaattgttt gttttgcggc aaaccgaaaa ggaaactggc
901 caacaactgg acacgggcat agaaaacttg aattaataca acatg
//
LOCUS DRVGLDYYR2 50 bp ds-DNA INV 14-AUG-1990
DEFINITION D.virilis YYRR box located in intron-1 of Gld.
ACCESSION X07360
KEYWORDS YYRR box; glucose dehydrogenase.
SEGMENT 2 of 2
SOURCE D.virilis DNA, clone pCG79.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 50; enum. 1 to 50)
AUTHORS Cavener,D.R.
JOURNAL Unpublished (1988) Vanderbilt Univ,Nashville,TN 37235
STANDARD simple automatic
REFERENCE 2 (bases 1 to 50)
AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
Schonbaum,C. and Xiao,X.
TITLE Title is missing from this entry
JOURNAL Nucleic Acids Res. (1988) In press
STANDARD simple automatic
COMMENT *source: library=Charon 30; clone=pCG79;
EMBL features not translated to GenBank features:
key from to description
RPT 3 50 tetranucleotide repeat
SITE 3 50 CTGA motif
BASE COUNT 15 a 14 c 11 g 10 t
ORIGIN About 2 kbp downstream of segment 1.
1 aactgactca ctgactcact gactgactga ctgacggact gacagactga
//
LOCUS HUMSYB1A1 727 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 1.
ACCESSION M36196 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 1 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 727)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept 721 + 722 synaptobrevin 1 (SYB1), exon 1
IVS 723 > 727 synaptobrevin 1 intron A
signal 544 549 TATA box
BASE COUNT 130 a 217 c 257 g 123 t
ORIGIN Chromosome 12.
1 aagcttcgtg cttgctttgg tgggacggaa ccccgcatcc ggcatctggc accgggagct
61 tgtggctgtg tctcgaggct cctgcaccct ctcagagcgt tagtcagagg cggggcggtg
121 gtgcggagcg agggccagag cagggcacca gggctgggcg cgcaccagga agccgccgcc
181 tccagctgaa tgcgcgagtc ccggcgcggg gtggggcagg ggcgggcacg gtcttcagta
241 actgcctagc gattggttgg gaggcaccac aaatgcctaa caaactgctg aagcgccctc
301 cgaagggaca gacctcatct ccaccccgcc ccgtccctcc gccagactgg acgcaaacta
361 tggtcgcatt gatcactgaa ggatcctcac agcaaccgct cctttccgga gtcggatgag
421 aggagagttg tgactggcaa ttggcagggg cggggcgggc taggcctgta gcgctgggcg
481 accgtcctgg gcatggattg ggccgcgggg ttgtcaccgt tatccgggag gcgtggtcag
541 cactaataaa ggcggaggcc ggcgcggcag ctgcagtaag ttccagcgca cgtagaccgc
601 ggggtggtcg gcgcgaggcg gagcttggca gttccgtcca cttcagccgc agcgtccctc
661 gccgggtgtc tcgccgcagc ctccggagag gaacagaccc tcactctctc tgtcagaaaa
721 atgtgag
//
LOCUS HUMSYB1A2 153 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 2.
ACCESSION M36197 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 2 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 153)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 22 + 148 synaptobrevin 1 (SYB1), exon 2
pre-msg < 1 > 153 SYB1 mRNA and introns
IVS < 1 21 SYB1 intron A
IVS 149 > 153 SYB1 intron B
BASE COUNT 37 a 49 c 42 g 25 t
ORIGIN Chromosome 12.
1 tctgcacatg tctttccaca ggtctgctcc agctcagcca cctgctgaag ggacagaagg
61 gactgcccca ggtgggggtc cccctggccc tcctcctaac atgaccagta acagacgact
121 acagcaaacc caggcacaag tggaggaggt agg
//
LOCUS HUMSYB1A3 186 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 3.
ACCESSION M36198 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 3 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 186)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 23 + 181 synaptobrevin 1 (SYB1), exon 3
pre-msg < 1 > 186 SYB1 mRNA and introns
IVS < 1 22 SYB1 intron B
IVS 182 > 186 SYB1 intron C
BASE COUNT 50 a 36 c 60 g 40 t
ORIGIN Chromosome 12.
1 tttgtttctg tgccctcctc aggtggtgga catcatacgt gtgaacgtgg acaaggtcct
61 ggagagggac cagaagctgt cagagctgga tgaccgagct gatgccttgc aggcaggagc
121 atcacaattt gagagcagtg ctgccaagct aaagaggaag tattggtgga aaaactgcaa
181 ggtgaa
//
LOCUS HUMSYB1A4 79 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 4.
ACCESSION M36199 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 4 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 79)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 23 + 74 synaptobrevin 1 (SYB1), exon 4
pre-msg < 1 > 79 SYB1 mRNA and introns
IVS < 1 22 SYB1 intron C
IVS 75 > 79 SYB1 intron D
BASE COUNT 16 a 14 c 19 g 30 t
ORIGIN Chromosome 12.
1 ttttttgtcc atgtttcctc agatgatgat catgctggga gccatctgtg ccatcatcgt
61 ggtagttatt gtaagtaag
//
LOCUS HUMSYB1A5 428 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 5.
ACCESSION M36200 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 5 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 428)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 22 38 synaptobrevin 1 (SYB1), exon 5
pre-msg < 1 > 428 SYB1 mRNA and introns
IVS < 1 21 SYB1 intron D
BASE COUNT 85 a 139 c 89 g 115 t
ORIGIN Chromosome 12.
1 cttcctgatt cctgtgtcca gtctactttt ttacttgaga atgtacccac cccttccctg
61 ttgtccattg ccatccacat tcatgtcctc tgccctctgt ttgctctctc aacacacttc
121 cccacccacc gtcctccatt ccagcccagg cttctccatc acccattcct cctttttcgt
181 tgcgttcatt tgcactctgt ccctcaacac tagaaatgct gctcgtggca caatctaagt
241 cattacccga agagcaacag tggcgcctcc tccctgcctg ctttttctgt actctcaagt
301 tcccccaaag ccccaaagag ttggaggcca agggaagggg cagggagggg agtggctgag
361 gcgaagtacc catgaagctg cccagacttg ggaggagaag agtatcggtg cccatggtga
421 cttctaga
//
LOCUS HUMSYB2A1 954 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 1.
ACCESSION M36201 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 1 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 954)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept 948 + 949 synaptobrevin 2 (SYB2), exon 1
IVS 950 > 954 SYB2 intron A
BASE COUNT 125 a 361 c 305 g 163 t
ORIGIN Chromosome 17.
1 ctgcagtcgg gtcgctgacc taagcctggg gcaaagctag gtcttggcca gtcagcagcc
61 accacgtgaa gcagcccctg cccccatcac ccccggcagc cttgtgccct cctcctcggg
121 gtcctcagct caccccacct cagtcgggcc tgctccagcg gttaagtacc cctagtctga
181 accttaatcg tgaggcgggg cagccccctc cccacaccct gtagacactc ccccgtcctc
241 cgccccctgc ttttcttcct accctcctac ccagtggcca gggaccccag gcttcctctg
301 agacatccac cagcggtgat gggttgggac caatctcccc cactcctctc caccctagca
361 gtaaggtccc cgctctcgat actgccccag cacctgcact ggccctctct gatttcccaa
421 ctccagccca ttccaagtgc tgaaggcggg tcgctaggtg ctgggacctg ggcgcggggc
481 aagaccttgg gctgtgtggg gtgcaggctg ggtttactgg gttcacgatt gagggagtgg
541 tctgggggtt aggtgtaggg ccaggctgcg ttgtgctcta aggggcaggc ggcaggcggt
601 ttgggggggc tgtggtcaga cagggtgggg tggcggtggg tttggggaag ccgcagcagc
661 ccgccttggg gtcccgcgag tgccgaggga gctgccgggg catggtggcg cggggcccgt
721 ggtcctgccc ggcacgaccc ccgccccgca gcgccccgca gcgccccgcc cccgcggccc
781 cgctcgcgac gcgtctcccg cggcgccccg cccccgcccg cagtgcccgg atgcgggtga
841 cgcgcggccg ccatctttcc gtcccgggca gccagcgcca gtcggagcca gcgcgagcgc
901 cgccgccatc actgccgctg ccaagtcctc cacccgctgc ccccgccatg tgag
//
LOCUS HUMSYB2A2 148 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 2.
ACCESSION M36202 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 2 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 148)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 23 + 143 synaptobrevin 2 (SYB2), exon 2
pre-msg < 1 > 148 SYB2 mRNA and introns
IVS < 1 22 SYB2 intron A
IVS 144 > 148 SYB2 intron B
BASE COUNT 25 a 58 c 41 g 24 t
ORIGIN Chromosome 17.
1 ttaatccgtc cctgctcctc aggtctgcta ccgctgccac ggccccccct gctgccccgg
61 ctggggaggg tggtccccct gcaccccctc caaacctcac cagtaacagg agactgcagc
121 agacccaggc ccaggtggat gaggtgag
//
LOCUS HUMSYB2A3 186 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 3.
ACCESSION M36203 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 3 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 186)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 23 + 181 synaptobrevin 2 (SYB2), exon 3
pre-msg < 1 > 186 SYB2 mRNA and introns
IVS < 1 22 SYB2 intron B
IVS 182 > 186 SYB2 intron C
BASE COUNT 47 a 49 c 61 g 29 t
ORIGIN Chromosome 17.
1 gcctcgtggg cccttctccc aggtggtgga catcatgagg gtgaacgtgg acaaggtcct
61 ggagcgagac cagaagctgt cggagctgga cgaccgtgca gatgcactcc aggcgggggc
121 ctcccagttt gaaacaagcg cagccaagct caagcgcaaa tactggtgga aaaacctcaa
181 ggtaag
//
LOCUS HUMSYB2A4 77 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 4.
ACCESSION M36204 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 4 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 77)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 21 + 72 synaptobrevin 2 (SYB2), exon 4
pre-msg < 1 > 77 SYB2 mRNA and introns
IVS < 1 20 SYB2 intron C
IVS 73 > 77 SYB2 intron D
BASE COUNT 16 a 24 c 15 g 22 t
ORIGIN Chromosome 17.
1 tcctcgcctc tcacccccag atgatgatca tcttgggagt gatttgcgcc atcatcctca
61 tcatcatcat aggtgag
//
LOCUS HUMSYB2A5 146 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 5.
ACCESSION M36205 J05611
KEYWORDS intrinsic membrane protein; synaptic vesicle protein;
synaptobrevin 1.
SEGMENT 5 of 5
SOURCE Human DNA, clone pcVP18-[1,5].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 146)
AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
TITLE structure and chromosomal localizations of two human genes encoding
synaptobrevins 1 and 2
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Sudhof, 06-JUL-1990.
FEATURES from to/span description
pept + 27 43 synaptobrevin 2 (SYB2), exon 5
pre-msg < 1 > 146 SYB2 mRNA and introns
IVS < 1 26 SYB2 intron D
BASE COUNT 24 a 65 c 19 g 38 t
ORIGIN Chromosome 17.
1 tcatcttctt ccccctctct ccacagttta cttcagctct taaatccccg aggagtctgc
61 cctgcctaga gaagggcctc tcccccaacc ctcagccgtt cctccacctc tcagccatat
121 ctttcagccc ccactcccct ggatcc
//
LOCUS SPICPATP 4066 bp ds-DNA ORG 14-AUG-1990
DEFINITION Spinach plastid genes atpI-H-F for ATP synthase CF(O) subunits
IV, III and I.
ACCESSION X03775 M35501
KEYWORDS ATP synthase proteolipid; ATP synthetase; ATP synthetase CFo genes;
atpA gene; atpF gene; atpH gene; atpI gene; synthetase.
SOURCE Spinach chloroplast DNA.
ORGANISM Chloroplast Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae;
Spinacia oleracea.
REFERENCE 1 (bases 1793 to 2048)
AUTHORS Alt,J., Winter,W., Sebald,W., Moser,J.G., Schedel,R.,
Westhoff,P. and Herrmann,R.G.
TITLE Localization and nucleotide sequence of the gene for the ATP
synthase proteolipid subunit on the spinach plastid chromosome
JOURNAL Curr. Genet. 7, 129-138 (1983)
STANDARD simple staff_entry
REFERENCE 2 (bases 1 to 4066)
AUTHORS Hennig,J. and Herrmann,R.G.
TITLE Chloroplast ATP synthase of spinach contains nine nonidentical
subunit species, six of which are encoded by plastid chromosomes in
two operons in a phylogenetically conserved arrangement
JOURNAL Mol. Gen. Genet. 203, 117-128 (1986)
STANDARD simple staff_entry
COMMENT EMBL features not translated to GenBank features:
key from to description
OPR 1369 1414 pot. stem-loop structure
OPR 2049 2079 pot. stem-loop structure
OPR 3775 3799 stem-loop structure pot.
transcription terminator
INVREP 2319 2326 inverted repeat A
INVREP 2334 2341 inverted repeat A'
INVREP 2522 2534 inverted repeat B
INVREP 3348 3359 inverted repeat B'
SITE 2602 2604 pot. translation stop codon
SITE 3312 3314 pot. translation stop codon
Between positions 2584 and 3340 six stem-loop structures I(1)-I(6)
are described. They designate secondary structures resembling those
of class II introns
FEATURES from to/span description
pept 357 1100 CFo subunit IV (atpI)
pept 1793 2038 CFo subunit III (atpH) proteolipid
pept 2434 2578 CFo subunit I (atpF)
3343 3752 CFo subunit I (atpF)
pept 3818 > 4066 CF1 subunit alpha (atpA)
pre-msg 2434 3749 primary transcript of atpF gene
IVS 2579 3342 put. intron (no splice consensus)
binding 348 351 atpI ribosome binding site
binding 1780 1783 atpH ribosome binding site
binding 2423 2426 atpF ribosome binding site
binding 3810 3813 atpA ribosome binding site
signal 206 211 atpI -35 region
signal 241 246 atpI -10 region
signal 1693 1698 atpH -35 region
signal 1710 1715 atpH -35 region
signal 1724 1728 atpH -10 region
signal 1744 1749 atpH -10 region
BASE COUNT 1362 a 591 c 777 g 1336 t
ORIGIN
1 ctatttgttt aattgataca aattgtaacc cggatctcgc agctctttcg attccggcaa
61 atgatgacgc tatagcttca attcgattaa ttcataccaa attagtattt gcaatttgtg
121 aaggccgctc tagctatata agaaatcctt gattcataat aaaatcaaaa attgacttcc
181 taaattctct atcggttact agattttgaa tctcaaaaac tagttcaaaa taacagggga
241 tattatgtaa ttaatcagta tccgaaatag aaaattttaa ttaaagtaga caagtcgaga
301 aagagctgat tgaatcaaaa taattttttt aagttatatt tctgtaagag gacaatatga
361 atgttctatc atattcaatc aacccgctaa aggggttata tgctatatcg ggtgtggaag
421 taggtcaaca tttttattgg caaataggag gtttccaaat ccatgacaag gcccttataa
481 cttcttgggt tgtaattgct atcttattag gttcagctgc tatagctgtt cggagtccgc
541 aaacaattcc gactggtggt caaaattttt ttgaatatgt ccttgaattc atccgagacg
601 tgagcaaaac tcaaattggc gaagaatatc gcccgtgggt tccctttatt gggactatgt
661 ttctatttat ttttgtttct aattggtcag gggctctttt accttggaaa atcatacagt
721 tacctcatgg ggagttagcc gcacccacga acgatataaa tactactgtt gctttagctt
781 tactcgcgtc agtagcctat ttctatgcag gtcttacaaa aaaaggatta ggttattttg
841 gtaaatacat tcaaccaact ccaattcttt tacccattaa catcttagaa gatttcacaa
901 aaccgctatc acttagtttt cgacttttcg gaaatatatt agcggatgaa ttagtagttg
961 ttgttcttgt ttctttagta cctttagtgg ttcctatacc tgtcatgttt cttggcttat
1021 ttacaagtgg tattcaggct cttatttttg caactttagc cgcagcttat ataggcgaat
1081 ccctggaagg tcatcattga tttgtcttag gaagagttta tctcctagtt tagatatatg
1141 tgtatgtgtg gctcaagata ctctataaag ataatctatt tagagcatat aaatatccaa
1201 atacatacag tctagtggta atagaaaaaa cgatattcga gaagtgtaaa aaaaaaagac
1261 gttggttagt cgagagggga tacccctgtt atatggaatc taatgactat aagctaattc
1321 ttgcagattc gatgtttcga agaatgttca aaaattcgat tgaatttcaa atataatagg
1381 cggtttacgt tatgtaagaa acctatgtat attttatatt agatattgac aagttatata
1441 tgaaagaata tttaatttgc cctacttgaa tttggataga gacaccaacc gacgaagtcc
1501 tttcctattc gtttatgact gcgaattgaa tggataaaca gacaaaatat aaaaaaaggt
1561 cgaagaatga ttagaaaaag aaaaaggaaa tggaaaaact caagttgtat tgattcagaa
1621 agactcaaca atataggaac taaaaaaatc aagtctttct aattatttaa taatattaat
1681 tattatttca actggataaa tattagcaat ggaataatta agtcataatg cattggttga
1741 ttgtatcatt aaccatttct tttttttttt tgtgtgtgtg aggaacttta tcatgaatcc
1801 actgattgct gccgcatccg ttattgctgc tggattggct gtagggttgg cttctattgg
1861 acctggagtt ggtcaaggta ctgctgcggg acaagctgta gaaggtattg cgagacagcc
1921 cgaagcagaa ggaaaaatac gaggtacttt attacttagt ttagcgttta tggaagcttt
1981 aacaatttat ggattggttg tagcattagc gcttttattt gcgaatcctt ttgtttaatc
2041 cgaaaaggaa aagaaatagg agaaatacac atttctttta tagtcttgaa cttgcaggtt
2101 gctttttcac atttataaga aaatatcgct cctacacaat tacttattcg ttgagaaaat
2161 aatacacgga aaggacttaa tttgaggatg aagaattcgt gttacccact cggtttcttc
2221 tttcctcccc ctttttagtc cgaaggagaa gtgttgcaac aaagagagta tttcgaaatt
2281 cacatgaaac ctagtaccta attagtaatt ctataaatcc aataagtatt attcttattg
2341 ggaatctcaa taaaaaaaat aaaattcatt tagaagtagc aaacaattga aataatacaa
2401 cgattttttt gtttatctat aagaggagat catatgaaaa atgtaaccga ttctttcgtt
2461 ttcttgggtc actggccatc cgccgggagt ttcgggttta ataccgatat tttagcaaca
2521 aatctaataa atctcagtgt agtgcttggt gtattgatct tttttggaaa gggagtgtgt
2581 gcgggttgtt tatttcaaaa ataggttgga ttcaaccaac tgtaccgctt tttttaatta
2641 gggcgaaagg tgcatgattt cacgaatgac ttctgaaatt aataataaag aaatcatatg
2701 taagaactag agcatttcgt gatttgttgg taaatatact ttgattctct atcaaccaat
2761 aatgtgggac cataacatgg ttaaaactta aattgtttga agtccaggca cagcagggta
2821 ttctttctac caccatgtta atacttaaat ataccgaata ccgaggctaa aattaaataa
2881 agtaaagaaa catagttaca aatttttcga tatataacac tcatgtcgat aaaattgttg
2941 gaatctctta tttattgtta taggaaaaat gtttcggctt tttttttatt aagtaaataa
3001 atgccaaagg ctgagtcgat gacctacgta taaaataaga aacatttttg gatttgaata
3061 aaaaaaaaac aactttgctg acaattactt atatattttt ttttgggggg tcagaagagt
3121 tctccgaata ttctagtctt gattagtgat gggtttccat tttttgtcga ctatgaacag
3181 agaagagagg ataagttcat tacattcaaa aaagatatgg aaatttgcca taaaaaattg
3241 aagtaattga gcgtgagagc caaatgaatt gaaaaattca cgtttggttc gggaagggat
3301 catgaatgtt ttgaaatgaa tggaaagata atctactttc attaagtgat ttattagata
3361 atcgaaaaca gaggatcttg aatactattc gaaattcaga agaactacgc ggaaaggcca
3421 ttgaacagct ggaaaaagcc cgggctcgct taaagaaagt cgaaatggac gcggatcagt
3481 ttcgagtgaa tggatattct gaaatagaac gagaaaagat gaatttgatt aattcaactt
3541 ataaaacttt agaacaattt gaaaattaca aaaacgaaac cattcagttt gaacaacaaa
3601 aagcgattaa tcaagtccga cagcgggttt tccaacaagc cttacaagga gctctaggaa
3661 ctctgaatag ttgtttgaac aatgagttac atttacggac catcaatgct aatattggca
3721 tgtttggtgc gatgaacgaa ataactgatt agtccttcta tttaattgta ggtattattt
3781 ttttgtttca aaaaaaaaaa gaatttaaga aagactcatg gcaaccattc gagcagatga
3841 aattagcaaa attatccgtg aacgtattga aggatataat cgagaagtaa aggttgtaaa
3901 taccggtaca gtgcttcaag tgggtgacgg cattgctcgt attcacggtc ttgatgaagt
3961 aatggcaggt gaattagtag aatttgaaga gggaacaata ggtattgctc tgaatttgga
4021 atcaaataat gttggtgttg tattaatggg tgacgggttg atgata
//
LOCUS DRVSEV1 4245 bp ds-DNA INV 14-AUG-1990
DEFINITION D.virilis sevenless gene, exon 2.
ACCESSION M34543
KEYWORDS sevenless protein.
SEGMENT 1 of 3
SOURCE D.virilis DNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (sites)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
TITLE Comparison of the sevenless genes of Drosophila virilis and
Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 4245)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by G.M.Rubin, 24-MAY-1990, for release after publication.
FEATURES from to/span description
pept / 2804 + 3629 sevenless protein, exon 2 (AA at 2805) (5' end
is putative)
IVS 3630 > 4245 sevenless protein intron A
BASE COUNT 1103 a 957 c 922 g 1261 t 2 others
ORIGIN
1 tataagtgtg tcgtattgtt ggccaataat tagccgagaa tgtcaaatca cttgctgtta
61 attgatgttt atcgagagta gagtcgaact aatcgttgtt acagcttggc tctgcttgtg
121 ctttgaatag tgtactttga gctgcataaa ttcagctgat agagttggcg cattttcggt
181 ggcacttcaa gcattatgtg taactattcc agatagatac ctttgtgctc tggaatcgcc
241 catgacgctg ggtaaatggg gctagctggc tacgtcaacc gaagtttcgt cgcatctttg
301 tcgttgttga tcgtcttgct ctggcttgcg acttcggctg gggttgtttt tatactctgt
361 acacatggaa attggacaaa tggggtataa tgggcagacg gtagcaactc gtaaaccatg
421 aatttcccat atgtatttct gatcgggaca aagagctgcg tcggcctggc tatgtctgtc
481 cgtccgttaa gataatgaaa attcaatact taacaaacgc agttccagct tttcttggaa
541 gtttttaaga aatggaagaa acttttatat ttgtgtgcgg ctagagtctt acgcttgggc
601 ttacacacac ttcttatgct ttatattaag cggcgaaatt tcataacgat cggactataa
661 ataccgaagt tctgccaaaa cacgttgact ttgatgtggg tgctaggcaa gcaagcggca
721 tgctttgggg tgtttgtttt tttttgtgta caatctttgt aaacgacagc ttgtcgcttt
781 gtccagggta acgcttagtc ggtcaattct gatcagagca cttacttgtt tttggagctg
841 tcgcagtcgt tgtcgcagtc gcaatcgccg gttgccgctt gtcgtttgtc gtttgccgtt
901 tgccgtttgc cggttccatc agttaagcga cgatcgcggc acggttaagt acgcagcgct
961 cagatataaa tacatagcca caggctgcag atacatttgt ggatataaag atacggctgt
1021 gacatattag ccgcgcgact ctctgaagta gcttttgaag ttcaatcccg tcccgcactg
1081 tcccgcttcg cccaatcttg caaaaagggc tacaataaaa tgcaattaag ctgctcgaat
1141 caattctcag caaattgaat taaagcgcgt gaagagctcc tctctacaca cacacacccc
1201 ccctccaccc ctccacccac acccataagc aataacaaca agcccctgat atgtgggcga
1261 gtgttatccg aaactcgaag tccatagcct cgagctcgag aactttgaga actttgccca
1321 cggagtgctg tgaaaatata caaaatgcag atgggcataa acagaaatta acgccaaacg
1381 gaattgctcg ctcatagaat aatgcgaata attaattaat tacgggcaat tttttgcaca
1441 attttttttt ttttttttgt tttttgatgc aattaaaaat gttttgtgtg catattttct
1501 ctggcatttt cgcgcgctta attagtgatt tatttatttt atttatgcat gtttttcttt
1561 tttttttttt tggttttcat ttttctcttg tgctgtgtgt gtttttggag ttaattaata
1621 aagcggcatt ttaacatttt ttgttatttt atgtgcgaat cggcaaatca attgcatttc
1681 attaatatat tcctccgacg gaatttatag cgattccatg cggtttcttc ttaatctgca
1741 tttatgcatt tgtgcggaga aagtgatccc cgtcccaccc aaagacccgc gctctatcac
1801 tcgcacctaa atatagtttt tcccataagg cttaaattgt tgagggggga cttttgtagt
1861 ttttgtggtt tgttgtgatt atttgatgat tattttcata tttattgatt atacacccaa
1921 aaatgttgac aggaatctcc cgatttgccc gattgtattt cactttcccc ttattcagct
1981 tgcaatcgaa gctagaacct tgtgcttgga tatgtgcaca tatttgggtg tcatatttca
2041 gtctcgaaan tttcatatag atcgggcaat atgcagtgca gttaacagag aacggggacc
2101 gaagccaagc cgtggtgtgt ctctataggc agtagctaga gtctgatccg aactccaatc
2161 tgtgttctct atataaaaag ttctccgatt ctttgccaag tatctgaatc aattgcaagc
2221 tttttttttt tatgctcagc agttgcgagc aggagacaaa ccaaacttta agccgggctt
2281 ctatttcagg agattccaat tccaattgtt tgcgtagatc gtaacagatc aaaaataggt
2341 acattgaact gttttggggc aattggttta gtttattatt gttgcgtgtt tgtcgattaa
2401 cataatatgt taattcattt atatgcatgt gccgagacgt ttctgatttt tcgagttttt
2461 cgttttcaat ttcagcaaat tgtgcgctaa aatgtaaatg caaaaacgtc gagccgagct
2521 tcttcaaatc gaaatcgaac gacaacaaat tagaaaccag cccagagaca gtcgcacaca
2581 cacacacaca aacatacact cgagcacaca cacacactcg gacacgcaca accattcgca
2641 cacacacaca cacacacata ccgtggtcaa agtcagagtc ggacggacgg cagctccggc
2701 cagttccgac cgccggaacg cagagtccca cttgccactt ggcattgccg tggcgtaggt
2761 cgagcttaat ttttgtgctt cgatgaacat ttaaattgtg gactatgttt tggcgagaag
2821 atgcagcaca gcagcagcag caacagcaac aacaacaaca gcaacagcag cagcagcaac
2881 agccgccgca tccacccaaa cgcttgagct ttagtttcaa tgttaaaatc gctgtcaatg
2941 tgaacaccaa gatgtccacg actcacatca accaggagcg ctccaagcag caaacaacaa
3001 ccggatctcg atctcgatct cgttccaatt ccaattccag tgtcagttgc aaaggcgacg
3061 gcgatcgccg tgttcggcgg cacacaacgc ggcttgtggg cctgcgtcag cagctgttgc
3121 atctgggacg tcagctaaac ccgggccaat tcttggtcac gggtcacggc ggcatctcca
3181 cgatactaat tgccaatttg ctgctgctgc tgctgctctc gctctgctgc aatgtctgtt
3241 gccgctccca catcgagccg gatcaaaact tgacaccaac aacaacatca ccagctgctg
3301 ttgcagttgt tccgatgttg ctgcccctgg cccagaccca tatgcgaccc cagctggaca
3361 gcgatgtggt cgagaaggtg gccgtttgga caaagcatgt cggtgcggcg ccacccagca
3421 tcgccgaggg cattgccatc agcagcgtgg tcagaatgcc acccagcatc caaaccccaa
3481 ccgaaactgt gcgcaggcaa gaacaacaac gacaacagca gcaacagcaa caagaggcgg
3541 ctgcggctgc ggctgccgac gccgccatcg atgagcgcat cgtactggag cgcgtcaccc
3601 gcgactgtgt gcagcgctgc attgtggagg tgagtcgcta gttgattaac gggcggttta
3661 aactataact aaatggaatt gttattgtta ttgttattgt gaccttgagc aacgtgcttg
3721 tgcaccgtaa gtacagtttg gtatgtccgt tattccatac ttcctggcaa agcgcatgtt
3781 agttaacaga atccttcaac ttatcgtaac ttacttgctt tgggtgtccc aatgcacttg
3841 acttgccttg acagttcttt aatactctta aggactttct agttattgta gagctttgct
3901 tgcactaaac taacttgttc agatcgttct aaaatctttc tggcaaagct catgttagtg
3961 aactacagca cacttatcat aaagctgtag ctctggcaag caaggcttta agggcttaag
4021 acttaaagag tttctagtgc tggtatatca atgaagccat aaagcttttt tataaactta
4081 acttgttccg acatttacat tatacttcct ggcaaagcgt atgttagtaa aatgcagcat
4141 tttaaacgta tcgtaacttg ctctggatat caacatacgt ggctttaggc catagattgt
4201 gctagcaagc ttaagaattt aacattaacg ctccganaaa aagtt
//
LOCUS DRVSEV2 5687 bp ds-DNA INV 14-AUG-1990
DEFINITION D.virilis sevenless gene, exon 3,4,5,6 and 7.
ACCESSION M34544
KEYWORDS sevenless protein.
SEGMENT 2 of 3
SOURCE D.virilis DNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (sites)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
TITLE Comparison of the sevenless genes of Drosophila virilis and
Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 5687)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by G.M.Rubin, 24-MAY-1990.
FEATURES from to/span description
pept + 1897 1971 sevenless protein (sev), exon 3
4050 4115 sevenless protein, exon 4
4177 4512 sevenless protein, exon 5
4574 4810 sevenless protein, exon 6
4867 + 5186 sevenless protein, exon 7
IVS < 1 1896 sev intron B
IVS 1972 4049 sev intron C
IVS 4116 4176 sev intron D
IVS 4513 4573 sev intron E
IVS 4811 4866 sev intron F
IVS 5187 + 5687 sev intron G
BASE COUNT 1665 a 1222 c 1168 g 1629 t 3 others
ORIGIN
1 ccaacagata taaccatgtc cctctgtcct tttatctgtc accaagcgta ctagtctctt
61 agttttaaag ctattgtctc gaaacctctc tgttgcaagc agtatatatg ttagtactgg
121 atcggattgg atcgaagttc ttgtaagaaa aacttctttt caagatatct tgaccaaaac
181 cgtcctctac ttgttaaact atgctccttc tatataagct caattatatg taatttccat
241 ccttaagata ggtaaatgat caaaatgcga tgcccaaatg tttataagct ataaacttat
301 ctaaagtctg gcattttgta gacatttgag ctgtttataa gttcagttta agctgagata
361 atatgaaaac ttgaatataa tttagtattc ggttggactt tgtattaagt tttgacataa
421 ataagggctt tttttatagc ttgttgtttc gcagctttaa tgctgagaca gaccataacc
481 tgcgcttgtc tgttagaaat ttgtctgtta aagacctttg atccattatt ttactatgta
541 ttactaggtt aaccaagatt cgagaagctc tcgagttgtt tgccttcgct ctgaattggc
601 atgtcgctat ttttggctgc ccctcgccgc aacaacaatg aatcgatcgt ttgcggaccg
661 agtcggagat ctctttacga tcgatcacgt gctaaaagcg aaaagaaaga gaaaaaaagg
721 cagaaactcg tttggcgtga cgccgttaaa tgtcgatttg gtcaatagat ttatgcattc
781 gtaatgcgcg aatatctcag aggttgttaa aatgcctcta aagtcaataa tcaggtctat
841 gtttgagttg gcggctgtca cttgagaaaa gtagctaaat caaatcatca aaatagctat
901 aaaaagctgc attacctaag cagagtgatc tacaaaagcg ctgctgcttt ttgcggagtt
961 ttcttttcaa aaaaaagtta gactaaataa attagctaca aaatagccag aactgcaggc
1021 aatggcaaat atgacaagtt tcgtgtgaaa aacagcccat tctagcttgt aaatgacagc
1081 acaaaggtcc cagctctatt tcggcagtaa gccagaaaaa gggcacttct ttgttttctc
1141 gctgaaaata gccaaaatct agttttgaaa tagccataac tgactataaa atagccaaac
1201 tagcagacac ttgcaaatgt cagctattat caagttgggt agttgggtaa aaaaactcta
1261 gctagaaaat ggcaaaattt gttcgttgcg gatagggaga atagccaagt ctcgctataa
1321 aatagctaaa ttgacaacac aactctagcc tttttttttc aataaagcta gaacaagggc
1381 taaatcagct aaaaaagaat ttccggtatg caaatagaaa attctagtta aataaagcag
1441 aaatagtaat atactagctc tagctctttg ctctatttag cttgaaaatg gccaagtata
1501 cccaaaaaaa aaaatagctg atttctttct ttgctgtaat cagttagaaa atagtctagt
1561 ctagctagaa aatagtgaat tgctaacata ctgatccaag ctcactctct ctgttttgct
1621 tatatttagc ttgaaaatag ctatcttggc aatattggct gtagccaatt ctcgccctgc
1681 ctgctaccga tctgcctttt cacattgctt ttatctataa atgttaaaga ctgtgaattg
1741 attcattgcc attgacattg aattgaaaat tcttcccacg atttggattt cccttactct
1801 ttctttcctt gcccctgctt gctgcggcat ttccgttgca aatcggggct cttttgtgct
1861 aattgatttt cacttggatt tctttctttt ttccaggagg atctgtttct ggacgagttt
1921 ggcattaaat gcgagaaggc cgacaacagt gacaagtgct ataaaacacg agtaagtctg
1981 ccccccaaaa accaacacgt ccccctcgcc gaaacggtgt caagtgtttt gacaataacc
2041 ccccctcaag catgccccat tatccccaac taagcaaaag gccacaaagt cgcgtaattg
2101 gaaaccacct ttcaacttta ccagacaacg tgacgacgga ccgagcgacc gaccgaccga
2161 atgacccata aaaaaaaaca agaacaaatt cccagaaacc agaaacgaaa cgtatttacg
2221 gctcaaaaac cctgacaaaa aggaatatac ttaagcaaac gcacgaaaag aatggcaatg
2281 tctgtcttca ttgagccgaa gattaaatac ccttttatac agctcgtaat tgatgagaat
2341 atcttgcaat ggattctacc ttaccatagg atctatatga tatagtagtc caatgataac
2401 agaaagcaag gatatctctc acnagtatag aaaaagattc tctagctgtg cgatataaat
2461 ataaatagaa ncccaacaca aataactttt gcctgtgaca tggctattca caaatgccat
2521 atgccctttt gatacacact tgcaacagtt acagggtatc aggagtggca agcgagggat
2581 agccagcggt aaggccatca ggcacttgtt aaggaaaaat cataaacaca ttcgatatcc
2641 aagccagttc tgcctaacat gtatttcgga ttaaccacag ccagttgagg ttgggagtag
2701 ggagtgggat cgacctttga tgtgattaag ttaccaagca acacagctga gcagcttttt
2761 actggctttt aattgtcgcc atgctcgtga gtttagctag ttgcatacga actcaaagtg
2821 catcaggcgc ggccaatcgt gctccagctc cagacagcaa aggaaaccgt cgaacaacaa
2881 ttgtaaacgc tgctgtcggc tctcaattag tttgcttgca aggcatcgga tccggcggtt
2941 aatttccgtt tgagttttac agcaatccat cagggtacaa tagtttatct agaaattagc
3001 tgtgagcttg gctagcgcta agagacttac agactcgtac ttacctaata ataacatgga
3061 attttgctat cacgaaatac aagagtagtt attgaacata aaatatattt tcagagcata
3121 gtctaacagc tcactagtta tctgaattgt gtacgcccaa aggtaaatag tgttgagtgt
3181 gaaatagttt aaatatgggg aacagttggc tctgcttttt atacccttgc agagggtatt
3241 ataattttgt cgtgaaatgt gtaacgcata gaaggagaca tctctgaccc cataaagtat
3301 atatattctt aatcagcatc aacagccgag tcgatatagc catgtccgtc tgttcgtctg
3361 tccgtctgtc tgtttccatg tcgaactagt ccctcagttt taaagctatc ttaatgaaac
3421 tttgcagaac tccctctttc tgttgcactc agcacatatg tgaaaaccag ctggatcgga
3481 ccactatatc atatagctgc cataggaacg atcggtcgaa aattaagttt ttgtatgaaa
3541 aacattttgt tatcaagata tcttgaccaa actcggcatt tattagtctt actatgctcc
3601 tcatatatat gcaaaatctt attaagatcg gaccactata tcatatagct gccataggaa
3661 cgatcggtcg aaaattaagt tgtatgaaaa aacattttgt ttatcaagat atcttgacca
3721 aactcggcat ttactatttt cccggtactt cttagatagg ggcaaagcac tatgagcatt
3781 atgaaaaggt tgggtctgca agggtattag atctttggcg tgccgaagat agcccttctt
3841 tctcgttttt actggataag caaacgagtg cgaggcaaac gagggtatct cctagactac
3901 ttttgtagtg cactgatatt aatagctttt taaggctgat taaactttca gctggcttta
3961 gataagcctc actttactat gcacctcctg tcttttccct gatgggccaa ctcaacctca
4021 actaatctga tcgtaatctc tctttgcagt gcaacaaggg ctgtgcgcaa tggtatcgcg
4081 cgctgaagga gatcgagccc tgccaggagg cctgtgtgag tgatatctga cagctgccgt
4141 cttaatgata gcttcttaac tgcattattt tgataggctt ccacacagtt ttatccgtac
4201 gatatgccct gcattggagc ctgcgagacg gcgcagcgcg attattggca tatgcagcgg
4261 ctggccatgg ctcggttggt ggagacaacg cagccgcagc tgttggagat gacggatgag
4321 tcgagcactt tgaccatcaa atgggccatg cagttcccgg agaactattt ggccagtcga
4381 ccctttaata tacaatacca gcaagtggac aatcaatcgg aacctgaatg gcacaatctg
4441 gcggactacg actgcgatga atactacgta tgcgaaattc tggaggcgct tgtgccctac
4501 acccgctaca aggtatgtcc atcatatatt aagtgaaata tctctggaga tttaaatttg
4561 aactcttttt cagtttcgct tcgagctgcc atttggggag agcagcgaag acgtgctcta
4621 ttcgccagca acgcctgttt acgaaacgcc gatggagggc gcccccatct ctgcgcccat
4681 tatcgtggca ctgctggcgc tggatgagca tcatgtgttc gtgcactggc gtcccggacg
4741 ctatagcaat gcgcccattg aaggctatcg cgtgctcttg accagcgcag gcaacaccag
4801 ccgcgaacag gtacgtaacg agctccaact gccaagctgc caaatctaaa cgctcatctc
4861 taccagctgc tgcctgctca acgtactagc tgcatctttg cccagctgca gccgctcacc
4921 aactacaccg tggcgctgac catgatcaac aaacagggcg agggaccgtc tacggtggtc
4981 agcattgtga ccaagtcgcc cctggagcca cagcagctgc agagcgtgct gcttgcgagc
5041 gagcacagca tcatctggca atcactggag ccagcgggcg agacacgcct cctctacacc
5101 tctgagccgg cggccattag cgattttacc ttttcgcagc gcgagcagcg gctctggctg
5161 ctggacgaat tgggtcagtt gcacaggtga gcaattggcc atttcagtcg ctcttcaatt
5221 atcttttcga gagtgaagat tatttttttt tgtaggattt cgaggtgaaa ttacgtaaag
5281 tttctacagt aggaaaagct ttctagctag atcatcagtc agtcagtcat atatggatgg
5341 ggagctgcag ttaagctcta gccaagacca gattttgttc agcctcttta gtcacagtta
5401 aatcagtcag tcagtcaatc actcagtcag tcgaatgaat atataattta gtacgatcaa
5461 ttgtgactgt tccttgatca tcagtcagtt agttattcaa tcngtttaat caaaaatcaa
5521 tcagtaaagt aaattggcta ctacagctac aggttgtcca aaaaaatgga tagtttagga
5581 tacattcaat cagccgtcaa tcaatcagtc agtcataata tgatatactt ctattatgcc
5641 tctctatgag ttcagctaca gctaagacca actggggtgt gtgttga
//
LOCUS DRVSEV3 8779 bp ds-DNA INV 14-AUG-1990
DEFINITION D.virilis sevenless gene, exons 8, 9, 10, 11 and 12.
ACCESSION M34545
KEYWORDS sevenless protein.
SEGMENT 3 of 3
SOURCE D.virilis DNA.
ORGANISM Drosophila virilis
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (sites)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
TITLE Comparison of the sevenless genes of Drosophila virilis and
Drosophila melanogaster
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 8779)
AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [2] kindly submitted
by G.M.Rubin, 24-MAY-1990.
FEATURES from to/span description
pept + 340 4973 sevenless protein (sev), exon 8
5836 6113 sevenless protein (sev), exon 9
6174 6365 sevenless protein (sev), exon 10
7286 7722 sevenless protein (sev), exon 11
7791 8175 sevenless protein (sev), exon 12
IVS < 1 339 sev intron H
IVS 4974 5835 sev intron I
IVS 6114 6173 sev intron J
IVS 6366 7285 sev intrn K
IVS 7723 7790 sev intron L
BASE COUNT 1930 a 2456 c 2435 g 1958 t
ORIGIN
1 tctggtagat gagcgttaga ttggcagctt ggcagttgga gctctcaagc tctatatata
61 tataccattt gtcctgactg attgactgat tgacggactg attggggatc taaatttccc
121 actcccataa ctcggccaaa actcaaccga ttttcatgag gcttggcttt ttattcatgg
181 tttggcctgt atatcaattt ggcattttca aagatccttg agagaattca ttttataggc
241 gccacgggca agggcgggtc taccggctgg tccgtcataa gtacatcgcc caatctctaa
301 gctctacttc agtttagact tgcaattttc tgtttacagc cagctgctgg atgagacgac
361 gacaagcgct gcccggcgtc tgcggctgga gctgccctca aatgggagca gccagtggac
421 gccacgcaag ctgagcctgg actggttgca gcgacgcctc tacatagcag cgcaggcgaa
481 cagcagcgac ggggcagaag gaggatttga gctgtttagc agcaatctgg agggcggcga
541 tgtgcagatg gcgggcgtgc agctgggcct ggtcgtggag cagctggagc tggacgccct
601 gaacggctgg ctcttctggt gcgatgcaga cagcctctgg cgtctcgatc tgagcagcaa
661 gcagcagctg cgccttacac agcccgcagg agcacccgga cgtttcatgt tggagccgca
721 gcgctggctg ctgcacgtgc tgctgccgca ggagaatcag ctgctggagc tgagctacga
781 tggtgggcac aagcatgcgc tggcactgtc caacgacagc tggcgcggat ttgcctggtc
841 cagcgatcag gcacagctgc tactggcgaa tgagacacag ttgcagctgc tggacggaca
901 gacgctggtg ccgcttgcca actggtcgcc ggatggcggc tgctgcgctt tgctgcccct
961 ggaacgtcgc cgccagccgc tgtcactgga gccgccggcg ccgcgcgagt tgcgcgccct
1021 gctaggagcg cagggcgcgc acataacatg gcagccgccg gcagctaatc cctaccagac
1081 tgcaacggcg gccgcgcgca atttcagcta cgagctggag gtgctcgacg tggccagcca
1141 aagcgcctac aacatacgca acatacgcgt gccccacttt ggcctggagc gcctgcaggc
1201 ggacaatctc tatcagctgc gcgtgcgcgc aaacaatgcg gccggccgtg cgggcgtgtg
1261 gacagcgccg ctggcgacgc gcacctggcc cctgggcgat catcgtctgc gctgggccac
1321 gcagcgcggc tcgctgtaca ccacgaacga gctgggcggc cagctgcagc cgctgcccgt
1381 gcagctggca tcgtcgcccg ggccgctagc cctggtcaat gccagcgtgg cgtactatgt
1441 cagcgggcgg gagcaatcgc tgcactgcgt caatttgctg cagccgcagc tgagctgcac
1501 cgacgagcgg ctggagcatg taggcgccgt tgcctatgac tggcgcggcg gtctgctcta
1561 ctggacggat ctggcgcgcg attgtgtcca gcgtctggat ccgttcagcg gtgagcgcga
1621 gctgttgccc atctttggcg cccgccattt ggccctcgac tcggcccagg gacatctcta
1681 ctactcgagc tcggcgcatt tggcgcgccg ttcgctcagc gctctctcca cccatcagcc
1741 ggagctggag tactatcatg tcaatggact ggccggccag attagcggct tctgtctgga
1801 tctgccgcag cgacacatct actggcttgt cgccggcaac tcggcgctgc atctctatcg
1861 cacagcattg tcggcgggcg gcagccaggc cgccgtccca ctgcagctgc tgaccacgct
1921 gccggcggcg gacgcactgc cgcatacgct gcagcatctg gcgccgctgg gtgcgctgct
1981 ctggctggcg gcggatggcc ggggcgcgca tctgctgcgg ctcgcggccc aactggagac
2041 ggacacggat actatgcgtc tgctgccgga gggcctggtc gagccgctca gcgccgtgca
2101 gctgctggag cgttcggctg ggccgccgcc accgccgccg gacgagggcg tgcgtccgct
2161 cgccgtgccg ccggacagtg tgcacatcga cgagggcggc cactggaatg actttcgtgt
2221 gcgctggcag cctgctgcca gtggcggcaa tcacagcgtc tgctacaagc tgctgctgga
2281 gcacggcagc gagcgtctga tcacgctgga gctcctcacg ccctttgccc gcatcacaca
2341 actggcccag gcgccgctcg gtctgcgcat cagcattacg ccgcacaccg cctggcgcgc
2401 cggatccacc acccgggtgc agctggacac gcccgttgcg gcgcccacgc agccgcggcg
2461 gctgcgcgtc tttgtcgagc gtcaggcggc gccgctgcag ttggcgccga atgtaagcgc
2521 tctgctgcgc tgggatgtgc ccgaggagca tgcgggcagt cagtcgctgc agtatcgcat
2581 cagctgctgg cgcggcagcg agttgcattc ggagctgctg ctcaatcaga gcacattgga
2641 ggcgcgcgtc gagcatctgc agccggagga gacgtatcgt tttcaggtgc aggcccatgt
2701 ggccgccacg ggcctggccg ccggcgccac cagccatgcg ctgcacgttt cgcccgaggt
2761 gcaatcggtg ccgcgtctgc tctatgccaa tgccgagcac attggcgaac tggatctgga
2821 tacgggccat cgcaagcagc tggtgcacac cgccagtcct gtcgagcatt tggttgtgct
2881 gcagggcgag cagcgtttgc tttgggtcaa cgagcacgtc gagctgttga gccatgtgcc
2941 gggtaaggcg ccggccaagc tggcacgcat gcgtgccgag gtgctggccc tgaccgtcga
3001 ctgggtgcag cgcatcgtct actgggccga gctggatgcc gcggacggcg gctgtgtcat
3061 ctatagcctg gatctctgtc gcttcgatgg acgcattctg cagggcgaac gtctctggag
3121 cacgccgcgc ggccagctgc tgcgcgatct tgtcgccctg ccgcatgccc gccagcttgt
3181 ctggctgcag cacgatctgg actcgcgcaa tgccacgctg cagggtcgca gcctggccaa
3241 tggcagcgcg ctcacctttg agggcgtcac gttaccgctg tggcgcctct tcgagggcag
3301 ccaggagccg ctggccgaga cccttaatct ggtggatcat ttgggccgtc tgtgcgttta
3361 ccatgtggcc cggcaactgt gcacgtccag cgcactgcga gcccagctca atctgctgaa
3421 cgatgatatc ggccagctgg cccaggatcc gggctatctg tatgctctgc gcaatggtag
3481 tgtgcgcgca tatggccgcc gccggcagca actggaattt ctgctggaac tgcagccgga
3541 cgaggtacgt ctgctgcgcg cctacaacta tcaggcgtat cccagtcgtc gctgcctgct
3601 gctgcccacc acggctgccg ccctcgaatc aacgccgtcc agctgcgagg agacgcagtg
3661 cagcctccag ctgccggcgc tcagcgccgc accagactgc ccgctgccgg tgcccgggct
3721 taactatcag ctgaatctta gcagcagcag cagaagcgca cagctggagc tgcgctcgct
3781 gcactccgcc gccggcctga cgctcaacat aagtcaactg cagccctatc aggcctacga
3841 gctgcgcgct caggtcggca gctactatca gcaacaactg ggccaggagc cgctgcagct
3901 gcccgtcctg acgctgcaca cggctgcggc cacgcccagt gcgccacgaa atttcagcgg
3961 acgggcactg agtcccagcg aactggagct cagctggctg gcgccgctgg agctgcgcag
4021 cgcgagtgtc tattatacgc tgcactggca gctgcagctg gaggataccg aggagcagtc
4081 acaggagcag cccgcccagg agcagcgcgt ggagacggcg ggcgtgcagc ggctgacggg
4141 tctgcagccc gcacgactct atcaggtctg gctgcaggcg catgccacgc ccagcaaata
4201 caatagcagc gggcggctgc tgatccgcag ctatgcgccg ctgccgccgc tgcagctgat
4261 cgagctgaac gcgtacggaa tgacactcgc ctggccgggc acaccggatg cgctcagttc
4321 gctcaccctg gaatgtcagt cgctgcgcga acagctccag ttcaatgtgg ccggaaatca
4381 tacacaaatg cggctggcac cgctgcagcc gaagacgcgc tactcgtgcc gcctggccct
4441 ggcgtatgcc gccacgccgg gcgcacccat ttactttggt cccagtcatg agtacgaaac
4501 gctgggtgat gcgcccagcg cgccgggtcg gccgcagctg gagcatatag ccggcgaaat
4561 ctttcgcgtc agctggacgc cagctctgga caacggttca cccattttgc tatacaatct
4621 ggaggctctg caggcccggc gcaccaatcg tcgtcggcgg cggcgccgag agactacgct
4681 ttccctattg ccctgggccg aggagccgct ggtcatcgag gatcagtggc tggacttttg
4741 taataccacc gagctgagtt gcattgtgcg cgaattgcat acccgtcgcc tgctgctgtt
4801 tcgcgtccgt gcccgcaatc ggccgcacgg ctggggaccc tacagcgagg acagtgaacg
4861 catcgctgag ccctttgtct cgcccgagaa gcgcggctcc ctcgtcttgg ccatcattgc
4921 gcccgctgcc attgtttcca gctgtgtgct cgccttggtg ctcgtgcgca aatgtaagtt
4981 caaaatgaat atttttccgc ccgacaatca gtcaatcatt caactagtcg tcggtcagtt
5041 tgtcaatctt tcagtcgaag aatcaatcga taagaaaatg agtggctaaa taagtaagtt
5101 agtcaatcaa atagtctgtc tttttgtcaa tcagtcgatc aatcaatcgc tcaatccatc
5161 tggtattgcc attcagccaa ttaaccagtc aatccagcaa tcattcaatt agtagtccgt
5221 aggttatcag tcaactatga taaccaatca atccgtcagc ttgataattt gtcgtttaac
5281 tagtcaagaa atcaatgtta atcaagtaat tggccaatta atcagtcagt tcagcagact
5341 gtctagtata actgtgaatt gaaggttcat gaatctgtca gtcaaacggc caatcaaatc
5401 aatcatttgg aaaatttttg atcacccaat ctcaaagtaa ttaagtgagg caagcagtca
5461 attaggcaga taaacagtta atcaggatgt cagtcaatca atcaatcata aatagtctgc
5521 cacctaatcc atcatgattc aatcagtcaa tcagccaata ctcacgtagt caattgatat
5581 tccggacaat cagtcagcca gtctgaaatt aatcagtaaa tcaatcgaat atgagtcagt
5641 caatcaatca attaacacgt aatcaatcaa tcaatcagtc aatggctcat taaatcaata
5701 agcagatcag aaaaatgttt aatcattcac tcaatcagtc agcctaatca atcatgaccc
5761 tttcattcaa tcaatcaatc aatcagtcag tcacatacac aagactcgct aattagtaaa
5821 tctgcgattt tgcagtgcaa aagcgtcgtc atcgcgccaa aaagctgctg cagcagagcc
5881 gcccaagcat ctggagcaat ctttcggctc tgcagacgca gcagcagctg ctggccgcac
5941 ggagtcgcac cttctcgatg tcgctgagcg atgcggacat tgcgctgctg ccgcagatca
6001 actggaaccg gctgacgctg ttgcgtttcc tgggcagcgg cgcctttggc gaggtgtacg
6061 agggtcagct gcaggccgag gacgaggcgc agccgcagcg tgttgccatc aaggtgagtg
6121 gagcagcagt tttttttttt gccacaattc aaggattctc ctcgccttta cagagtctgc
6181 gcaagggcgc cagcgagttc gcggagctgc tgcaggaagc gcagctgatg agcaatttca
6241 agcacgaaaa catcgtttgc ctgattggca tctgctgtga cacggactcc atatcgctga
6301 ttatggagca catggaggcc ggcgatttgc tcagctattt gcgtgcggca aggcccagtt
6361 cacaggtgag cgagcaagca ttaatattta ttatacgctg tagccattgc aaatgggtaa
6421 agggtaaact gtttttgtgc atatgcatgt gacgcacagt atgtccatgc ctttaatctt
6481 gatcaggacg cactgcgttc gtctaggcag gttcctcggt ctctcctttc gttttccaaa
6541 tatcgtaaaa tatcgaaaca aggttttttt tatttacttt atttaattta tttgttaatg
6601 gtcgacaaac acgagtgtct ttctaattat ttaatacatc aaatcatgaa aattaaatac
6661 taaaaaccta aaagcatcac cgactagata tataaagcgt taagtagata ggataaagag
6721 ataccatatg gatatattta gagctcatag cgggaaatag tcagtgagtt cccgaaaata
6781 tatgtacatg ttgaagatta taaatgctag atacaccaca atatggatga ggattaagcg
6841 cctttttgga tgcctccaat caaaactatt ccaaaatcaa aaggatttag aggttccgca
6901 ttcagctttt gtatgtacca acattttgat gccatattta cacgctgaaa gttttacaaa
6961 gatcggacaa taaacaccaa agttcctcga gaatgccata actgaaaatg tggaaccaaa
7021 gaaaagtaga cagaagagag acaaatgtga gtctgttgtg gcagccggat acaggatatc
7081 tctagtcgag cacttgaatt gagcactctg atgccatatt tatacgctga aagtttcatt
7141 atgatcggac cataaacacc gaagatcctc gagaattcca tatctgaaaa tgtggaatca
7201 agcaatagta gagattcagc tagaatgtgt gtgtggctac agggaatatt ctagtcgagc
7261 actcttatgt atggttcgtt tgcaggaagc gctatccaag ctgcagctgc cggagctgtt
7321 gagcatgtgc ctggacgtgg ccaatggctg cagctatatg gaggatatgc actttgtgca
7381 tcgcgatctc gcctgccgca attgtctggt gtcggatgga gccgcaattg ggggacggcg
7441 cattgttaaa attggcgact ttgggctggc acgcgacatc tacaagagcg actattaccg
7501 gaaggagggc gagggactgc tgcccgtgcg ctggatggcg ctcgagagcc tggtggatgg
7561 cctatttagc acacagtcag atgtgtgggc cttcggtgtg ctctgctggg agatattcac
7621 gctcggccag caaccgtatg cggcgcgcaa caacttcgag gtgctcgccc atgtcaagga
7681 gggcggacgg ctgcagcagc cggagcgctg tcccgagaag ctgtaagtgg gccagctctt
7741 ccagttaact tctctctact caatcactgt cccatttatg ctgtccacag ttatgcgctg
7801 ctcttgcaat gctggcgctc tgagccctgg gagagaccca gctttaagcg ctgcctcagc
7861 acgctgcagg ccctcagctc ggatttgcgg cgcactgaaa tgctggccac ggatgagacg
7921 ccgttggttt ctgcattgtg cgctttcaag ccggacgcca aggtgcgttt cgatgatgcg
7981 ccgcagcgtc ttacgctgca cttggacgcc aaagatacgg ttagcacaac cgatgccgat
8041 acgacaggaa gcccaaccac acccaccgca ccaaccacgc ccaccacgac caccagcaca
8101 atcgcagtcg taagcacagc tccatcgtcc gaaaatggcc agctctatgc aaatgagggc
8161 atttcgggcc tttgaaatcc cctttcattt ggatctgccg tcgcaagttg cggcgccaca
8221 aattgatgaa cattcgcatt cgcatgtttg ggcgcactca accatgtaac gcccacttgc
8281 cgaaacaccg cccacacgcg catacacgat tgtgggctta gaattatggt ttctatttaa
8341 tcgatttact attctgtact gaatccaaaa gctaaacaag accaaaagat tgacctcaac
8401 aagtcctaca tgtaaatcaa atggttctgt ttagtttttc ttttttgttt ttgttgtttt
8461 tttttttttt ttggtatacc caacaagacg agtatgctta ttcgaatata caagtagagc
8521 ctaaagtcga atcaataaat atcatgatat ttttgtagaa atagccttta tcattaaatt
8581 attatatttt gcctcaatca ggctgattga cggttgatga tcaatcaaat ataaggtatg
8641 cgctactcga aaatattata aataaatgaa ctcaacagta aatgaagata ttcttgtaaa
8701 accaggctgc atgttttagt ctgcctgatt gattagctaa cggactgact gatcatcagt
8761 caatcttttc gagtatgcg
//
LOCUS ECOLAMBAA 186 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli lambda receptor (lamB) gene, 5' end.
ACCESSION M24997
KEYWORDS lamB; lambda receptor.
SOURCE E.coli (K-12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 186)
AUTHORS Clement,J.-M., Hedgpeth,J., Hofnung,M. and Jacob,F.
TITLE Sequence de l'ADN correspondant a la region du peptide signal pour
le recepteur de lambda chez E. coli K12
JOURNAL C. R. Acad. Sci., D, Sci. Nat. 289, 1033-1036 (1979)
STANDARD simple staff_entry
FEATURES from to/span description
pept 82 > 186 lambda receptor precursor
sigp 82 156 lambda receptor signal peptide
matp 157 > 186 lambda receptor
BASE COUNT 51 a 46 c 48 g 41 t
ORIGIN
1 ccggcgttta agcaccccac aaaacacaca aagctgtcac aggtgatgtg aaaaaagaaa
61 agcaatgact caggagatag aatgatgatt actctgcgca aacttcctct ggcggttgcc
121 gtcgcagcgg gcgtaatgtc tgctcaggca atggctgttg atttccacgg ctatgcacgt
181 tccggt
//
LOCUS HUMCRPR 1797 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human cysteine-rich peptide mRNA, complete cds.
ACCESSION M33146
KEYWORDS cysteine-rich protein.
SOURCE Human normal term placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1797)
AUTHORS Liebhaber,S.A., Emery,J.G., Urbanek,M., Wang,X. and Cooke,N.E.
TITLE Characterization of a human cDNA encoding a widely expressed and
highly conserved cysteine-rich protein with an unusual zinc-finger
motif
JOURNAL Nucleic Acids Res. 18, 3871-3879 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.A.Liebhaber, 22-MAR-1990, for release after publication.
FEATURES from to/span description
pept 36 617 cysteine-rich protein
BASE COUNT 405 a 507 c 493 g 392 t
ORIGIN
1 cctgccgccc ctgcgccgcc gagccagctg ccagaatgcc gaactgggga ggaggcaaga
61 aatgtggggt gtgtcagaag acggtttact ttgccgaaga ggttcagtgc gaaggcaaca
121 gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac agtaccactg
181 tggccgtgca tggtgaggag atttactgca agtcctgcta cggcaagaag tatgggccca
241 aaggctatgg ctacgggcag ggcgcaggca ccctcagcac tgacaagggg gagtcgctgg
301 gtatcaagca cgaggaagcc cctggccaca ggcccaccac caaccccaat gcatccaaat
361 ttgcccagaa gattggtggc tccgagcgct gcccccgatg cagccaggca gtctatgctg
421 cggagaaggt gattggtgct gggaagtcct ggcataaggc ctgctttcga tgtgccaagt
481 gtggcaaagg ccttgagtca accaccctgg cagacaagga tggcgagatt tactgcaaag
541 gatgttatgc taaaaacttc gggcccaagg gctttggttt tgggcaagga gctggggcct
601 tggtccactc tgagtgaggc caccatcacc caccacaccc tgcccactcc tgcgcttttc
661 atcgccattc cattcccagc agctttggag acctccagga ttatttctct gtcagccctg
721 ccacatatca ctaatgactt gaacttgggc atctggctcc ctttggtttg ggggtctgcc
781 tgaggtccca ccccactaaa gggctcccca ggcctgggat ctgacaccat caccagtagg
841 agacctcagt gttttgggtc taggtgagag caggcccctc tccccacacc tcgccccaca
901 gagctctgtt cttagcctcc tgtgctgcgt gtccatcatc agctgaccaa gacacctgag
961 gacacatctt ggcacccaga ggagcagcag caacaggctg gagggagagg gaagcaagac
1021 caagatgagg aggggggaag gctgggtttt ttggatctca gagattctcc tctgtgggaa
1081 agaggttgag cttcctggtg tccctcagag taagcctgag gagtcccagc ttagggagtc
1141 actattggag gcagagaggc atgcaggcgg ggtcctagga gcccctgctt ctccaggcct
1201 cttgcctttg agtctttgtg gaatggatag cctcccacta ggactgggag gagaataacc
1261 caggtcttaa ggaccccaaa gtcaggatgt tgtttgatct tctcaaacat ctagttccct
1321 gcttgatggg aggatcctaa tgaaatacct gaaacatata ttggcattta tcaatggctc
1381 aaatcttcat ttatctctgg ccttaaccct ggctcctgag gctgcggcca gcagagccca
1441 ggccagggct ctgttcttgc cacacctgct tgatcctcag atgtggaggg aggtaggcac
1501 tgcctcagtc ttcatccaaa cacctttccc tttgccctga gacctcagaa tcttcccttt
1561 aacccaagac cctgcctctt ccactccacc cttctccagg gacccttaga tcatcactcc
1621 acccctgcca ggccccaggt taggaatagt ggtgggagga aggggaaagg gctgggcctc
1681 accgctccca gcaactgaaa ggacaacact atctggagcc acccactgaa agggctgcag
1741 gcatgggctg tacccaagct gatttctcat ctggtcaata aagctgttta gaccaga
//
LOCUS HUMCSPBB 910 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human cytotoxin serine protease-C mRNA, complete cds.
ACCESSION M36118
KEYWORDS cytotoxin serine protease-C.
SOURCE Human lymphokine activated cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 910)
AUTHORS Klein,J.L., Selvakumar,A., Trapani,J.A. and Dupont,B.
TITLE Characterization of a novel human cytotoxic lymphocyte specific
serine protease cDNA clone (CSP-C)
JOURNAL Tissue Antigens (1900) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Klein, 06-JUL-1990.
FEATURES from to/span description
pept 53 793 cytotoxin serine protease-C precursor
sigp 53 157 cytotoxin serine protease-C signal peptide
matp 158 790 cytotoxin serine protease-C
BASE COUNT 231 a 245 c 250 g 184 t
ORIGIN
1 ttcgatatcg agctcctcca acagctctga cctgggcagc cttcctgaga aaatgcagcc
61 attcctcctc ctgttggcct ttcttctgac ccctggggct gggacagagg agatcatcgg
121 gggccatgag gccaagcccc actcccgccc ctacatggcc tttgttcagt ttctgcaaga
181 gaagagtcgg aagaggtgtg gcggcatcct agtgagaaag gactttgtgc tgacagctgc
241 tcactgccag ggaagctcca taaatgtcac cttgggggcc cacaatatca aggaacagga
301 gcggacccag cagtttatcc ctgtgaaaag acccatcccc catccagcct ataatcctaa
361 gaacttctcc aacgacatca tgctactgca gctggagaga aaggccaagt ggaccacagc
421 tgtgcggcct ctcaggctac ctagcagcaa ggcccaggtg aagccagggc agctgtgcag
481 tgtggctggc tggggttatg tctcaatgag cactttagca accacactgc aggaagtgtt
541 gctgacagtg cagaaggact gccagtgtga acgtctcttc catggcaatt acagcagagc
601 cactgagatt tgtgtggggg atccaaagaa gacacagacc ggtttcaagg gggactccgg
661 ggggcccctc gtgtgtaagg acgtagccca aggtattctc tcctatggaa ataaaaaagg
721 gacacctcca ggagtctaca tcaaggtctc acacttcctg ccctggataa agagaacaat
781 gaagcgcctc taacagcagg catgagacta accttcctct gggcctgacc atctctggga
841 cagaggcaag aatccccaag ggtgggagtc ggggttgcag gactgtaata aatggatcga
901 gcacgatatc
//
LOCUS HUMFKBP 1532 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human FK506-binding protein (FKBP) mRNA, complete cds.
ACCESSION M34539
KEYWORDS FK506-binding protein;
immunosuppressive agent FK506-binding protein;
peptidylprolyl cis-trans isomerase.
SOURCE Human peripheral blood T-cell, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1532)
AUTHORS Maki,N., Sekiguchi,F., Nishimaki,J., Miwa,K., Hayano,T.,
Takahashi,N. and Suzuki,M.
TITLE Complementary DNA encoding the human T-cell FK506-binding protein,
a peptidylprolyl cis-trans isomerase distinct from cyclophilin
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5440-5443 (1990)
STANDARD full staff_entry
COMMENT Draft entry and printed sequence for [1] kindly submitted by
N.Takahasi, 5-16-1990, for release after publication.
FEATURES from to/span description
pept 79 405 FK506-binding protein (FKBP)
signal 1507 1512 Poly-A signal
BASE COUNT 341 a 386 c 339 g 466 t
ORIGIN
1 gaattcgggc cgccgccagg tcgctgttgg tccacgccgc ccgtcgcgcc gcccgcccgc
61 tcagcgtccg ccgccgccat gggagtgcag gtggaaacca tctccccagg agacgggcgc
121 accttcccca agcgcggcca gacctgcgtg gtgcactaca ccgggatgct tgaagatgga
181 aagaaatttg attcctcccg ggacagaaac aagcccttta agtttatgct aggcaagcag
241 gaggtgatcc gaggctggga agaaggggtt gcccagatga gtgtgggtca gagagccaaa
301 ctgactatat ctccagatta tgcctatggt gccactgggc acccaggcat catcccacca
361 catgccactc tcgtcttcga tgtggagctt ctaaaactgg aatgacagga atggcctcct
421 cccttagctc cctgttcttg gatctgccat ggagggatct ggtgcctcca gacatgtgca
481 catgagtcca tatggagctt ttcctgatgt tccactccac tttgtataga catctgccct
541 gactgaatgt gttctgtcac tcagctttgc ttccgacacc tctgtttcct cttccccttt
601 ctcctcgtat gtgtgtttac ctaaactata tgccataaac ctcaagttat tcattttatt
661 ttgttttcat tttggggtga agattcagtt tcagtctttt ggatataggt ttccaattaa
721 gtacatggtc aagtattaac agcacaagtg gtaggttaac attagaatag gaattggtgt
781 tggggggggg gtttgcaaga atattttatt ttaatttttt ggatgaaatt tttatctatt
841 atatattaaa cattcttgct gctgcgctgc aaagccatag cagatttgag gcgctgttga
901 ggactgaatt actctccaag ttgagagatg tctttgggtt aaattaaaag ccctacctaa
961 aactgaggtg gggatgggga gagcctttgc ctccaccatt cccacccacc ctccccttaa
1021 accctctgcc tttgaaagta gatcatgttc actgcaatgc tggacactac aggtatctgt
1081 ccctgggcca gcagggacct ctgaagcctt ctttgtggcc tttttttttt ttcatcctgt
1141 ggtttttcta atggactttc aggaattttg taatctcata actttccaag ctccaccact
1201 tcctaaatct taagaacttt aattgacagt ttcaattgaa ggtgctgttt gtagacttaa
1261 cacccagtga aagcccagcc atcatgacaa atccttgaat gttctcttaa gaaaatgatg
1321 ctggtcatcg cagcttcagc atctcctgtt ttttgatgct tggctccctc tgctgatctc
1381 agtttcctgg cttttcctcc ctcagcccct tctcacccct ttgctgtcct gtgtagtgat
1441 ttggtgagaa atcgttgctg cacccttccc ccagcaccat ttatgagtct caagttttat
1501 tattgcaata aaagtgcttt atgcccgaat tc
//
LOCUS HUMINTA2 272 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human interferon alpha-2' gene, partial cds.
ACCESSION M28028
KEYWORDS interferon alpha-2'.
SOURCE Human leukocytes DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 272)
AUTHORS Ovchinnikov,Y.A., Sverdlov,E.D., Monastyrskaya,G.S., Tsarev,S.A.,
Zaitseva,E.M., Arsenyan,S.G., Chakhmakhcheva,O.G.,
Novokhatskii,A.S., Aspetov,R.D. and Kuznetsov,V.P.
TITLE Expression of mutant human interferon alpha-2 in Escherichia coli
cells
JOURNAL Mol. Biol. 18, 36-45 (1984)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 > 272 interferon alpha-2' (AA at 2)
BASE COUNT 63 a 76 c 66 g 67 t
ORIGIN
1 actggtggcc ctcctggtgc tcagctgcaa gtcaagctgc tctgtgggct gtgatctgcc
61 tcaaacccac aacctgggta gcaggaggac cttgatgctc ctggcacaga tgaggagaat
121 ctctcttttc tcctgcttga aggacagaca tgactttgga tttccccagg aggagtttgg
181 caaccagttc caaaaggctg aaaccatccc tgtcctccat gagatgatcc agcagatctt
241 caatctcttc agcacaaagg actcatctgc tg
//
LOCUS HUMMLC1AA 836 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human embryonic myosin alkali light chain (MLC1) mRNA, complete
cds.
ACCESSION M36172
KEYWORDS embryonic myosin alkali light chain.
SOURCE Human embryo, cDNA to mRNA, clone GT14.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 836)
AUTHORS Arnold,H.-H., Lohse,P., Seidel,U. and Bober,E.
TITLE A novel human myosin alkali light chain is developmentally
regulated: Expression in fetal cardiac and skeletal muscle and in
adult atria
JOURNAL Eur. J. Biochem. 178, 53-60 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 57 650 embryonic myosin alkali light chain (MLC1)
BASE COUNT 199 a 238 c 232 g 167 t
ORIGIN
1 cagtctctcg gtttcttctc agatcactcc tctgccaaag atcccaacaa gacaacatgg
61 ctcccaagaa gcctgagcct aagaaggagg cagccaagcc agctccagct ccagctccag
121 cccctgcacc agcccctgcc ccagctcctg aggctcccaa ggaacctgcc tttgacccca
181 agagtgtaaa gatagacttc actgccgacc agattgaaga gttcaaagag gccttttcat
241 tgtttgaccg gaccccgact ggagagatga agatcaccta cggccagtgc ggggatgtac
301 tgcgggccct gggccagaac cctaccaatg ccgaggtgct gcgtgtgctg ggcaagccca
361 agcctgaaga gatgaatgtc aagatgctgg actttgagac gttcttgccc atcctgcagc
421 acatttcccg caacaaggag cagggcacct atgaggactt cgtggagggc ctgcgtgtct
481 ttgacaagga gagcaatggc acggtcatgg gtgctgagct tcggcacgtc cttgccaccc
541 tgggagagaa gatgactgag gctgaagtgg agcagctgtt agctgggcaa gaggatgcca
601 atggctgcat caattatgaa gcctttgtca agcacatcat gtcagggtga agcagagtct
661 tccaggtgcc tggcccttgg ctttagccat accagggtga gttaaagaga ggccccggct
721 gggtgagctg agatggagtc ctcgacttat caccacacca ctgccccaag gaccttacag
781 gccctccctg ttaataaaca gctctaacac ggccaggctg ggctctggga ttctga
//
LOCUS MUSAIC2B 3113 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse interleukin 3 receptor-like protein (AIC2B) mRNA, complete
cds.
ACCESSION M34397
KEYWORDS IL-3 receptor; interleukin 3 receptor.
SOURCE Mouse, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3113)
AUTHORS Gorman,D.M., Itoh,N., Kitamura,T., Schreurs,J., Yonehara,S.,
Yahara,I., Arai K,-I. and Miyajima,A.
TITLE Cloning and expression of a novel interleukin 3 receptor-like gene:
Identification of another member of the cytokine receptor gene
family
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5459-5463 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.M.Gorman, 22-MAY-1990, for release after publication.
FEATURES from to/span description
pept 288 2978 interleukin 3 receptor-like protein (AIC2B)
precursor
sigp 288 353 interleukin 3 receptor-like protein signal
peptide
matp 354 2975 interleukin 3 receptor-like protein signal
peptide
BASE COUNT 727 a 955 c 806 g 625 t
ORIGIN
1 gagctgactg ggagatggtg gggagcgagc tactctggca gaactaaatg tcatggggca
61 cagataaata ggaagagcct gcaactcact ggcacttgga ggctcccgaa ggaggctgcc
121 tgtcgcccaa gcacagagcc acaaaggatg cagtctagga gggaagaatc acaagccctg
181 taagatgagt ggagccaaac ccccagccaa gcaccaatac agaaccccgg gacaatgagg
241 acacccccct gcccatagct tccagtgcag ccaccaaaag tgccaaaatg gaccagcaaa
301 tggcactcac atgggggctg tgctacatgg cactggtggc tctctgttgg ggacacgggg
361 tgacagaggc agaagaaacg gtccctctga agactctgca gtgctacaat gactacacca
421 accacatcat ctgcagctgg gcggacacag aggatgccca ggggctaatc aacatgaccc
481 tctatcacca gctagagaaa aaacagccag tgtcctgtga gctcagtgag aaactcatgt
541 ggtcagagtg cccgtcatcc caccgctgtg tgcccagaag atgtgtcatc ccctatacac
601 gattttccat cacaaacgaa gactactact ccttccggcc agatagtgat ctgggcatcc
661 agctcatggt gccacttgcc cagaatgtgc agccaccact tcccaagaac gtcagcatca
721 gctcctctga ggatcgtttc ctgctggagt ggagtgtgtc ccttggggat gcccaggtct
781 cctggctttc atcaaaggac atagagtttg aggtggctta taagcggctt caggactcct
841 gggaggatgc ctacagtctc cacactagca aatttcaggt gaatttcgag ccaaagctat
901 tcctacccaa cagcatctat gcgccccgtg tgcgcactcg gctgtacccg ggttcaagct
961 tgtctgggag acccagcaga tggagcccag aggctcactg ggactcccag ccaggggaca
1021 aggcccagcc acagaacctt caatgcttct ttgatgggat ccagtccctc cactgctcct
1081 gggaggtgtg gacccagacg actggctctg tttcctttgg gctcttctat cgccccagcc
1141 ctgtagctcc ggaggagaaa tgctctccgg tggtgaagga gccgccgggg gccagtgtct
1201 acacccgcta ccattgcagt ctacctgtgc ctgagcccag tgcacacagc cagtacacag
1261 tctctgttaa gcacctggaa caagggaagt tcatcatgag ctataaccac atccagatgg
1321 agcctccaac cctcaacctg accaagaaca gagacagcta cagcctgcat tgggaaactc
1381 agaagatggc ttactcattc attgagcaca cattccaggt ccagtacaag aagaaatcgg
1441 acagctggga ggacagcaag acagagaacc tagatcgagc ccatagcatg gacctctccc
1501 agctggagcc agacacctca tactgcgcca gggtgagggt caagcccatc tctaactacg
1561 atgggatctg gagcaagtgg agcgaagagt acacttggaa gactgactgg gtgatgccca
1621 cgctgtggat agtcctcatc ctggtctttc tcatcctcac cttgctcctg atccttcgct
1681 ttggctgtgt ctctgtatac aggacgtaca ggaagtggaa ggaaaagatc cccaacccca
1741 gcaagagcct cctgttccag gatggaggta aaggtctctg gcctcctggc agcatggcag
1801 ccttcgccac taagaacccc gctctccagg ggccacagag caggcttctt gctgagcaac
1861 agggggagtc atatgcacat ttggaagaca acaacgtgtc acctctcact atagaggacc
1921 ctaatataat tcgagttcca ccatccgggc ctgatacaac cccagctgcc tcatccgaat
1981 ccacagagca acttcccaat gttcaagtag agggaccaac tcctaacaga cctaggaagc
2041 aattacccag ctttgacttc aatgggccct acctggggcc tccccaatcc cactctctgc
2101 ctgatctccc agaccagctg ggttcccccc aggtgggtgg gagcctgaag ccagcactgc
2161 caggctcctt ggagtacatg tgtctggccc ctggaggtca agtgcaactg gttccattgt
2221 cccaggtgat ggggcagggc caggctatgg atgtgcagtg tgggtccagc ctggagacct
2281 cagggagccc ttctgtggag ccaaaggaga accctccagt tgagctgagc atggaggaac
2341 aggaggcacg ggacaaccca gtgactctgc ccataagctc tgggggccct gagggcagta
2401 tgatggcctc tgattatgtc actcctggag atccggtgct cactctgccc acagggcccc
2461 tgtctacctc tctgggcccc tctctagggt tgccctcagc ccaaagcccc agtctctgtc
2521 ttaagctgcc cagggtcccc tctggaagcc cagctctagg gccaccaggg tttgaggact
2581 atgtggagct gcctccaagt gtgagccagg ctgccaagtc ccctccaggc catcctgctc
2641 ctcctgtggc aagcagcccc acagtgatcc caggagagcc cagggaggaa gtgggcccag
2701 catccccaca tcccgaaggc ctccttgttc ttcagcaggt tggggactac tgcttcctcc
2761 ctggcctggg acctggctcc ctctcaccac acagtaagcc accctctcca agtctgtgtt
2821 ctgagactga ggacctagtc caggacttgt ctgtcaaaaa gtttccctat cagcccatgc
2881 cccaggcgcc agccattcag tttttcaagt ccctaaagca tcaggactac ctgtccctgc
2941 ccccttggga caatagccag tctgggaagg tgtgctgagt ctgtctcctc ccaatctcac
3001 cagcagcctg gcaccgcagc ctgtggtcct cagcctgagc atcaccacag aagcctctct
3061 gagttcacac tcctccttgc tcccagccct gacatggcaa tacccccacc tgt
//
LOCUS MUSIGHADP 363 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION M36209
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 363 Ig heavy-chain V region (AA at 1)
BASE COUNT 88 a 89 c 98 g 88 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaaatcgc
301 tactatggta actactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc
361 tca
//
LOCUS MUSIGHADQ 357 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION M36210
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 357 Ig heavy-chain V region (AA at 1)
BASE COUNT 88 a 83 c 99 g 87 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaggaaac
301 tataggtacg acgagtttgc ttactggggc caagggactc tggtcactgt ctctgca
//
LOCUS MUSIGHADR 336 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION M36211
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 336)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 336 Ig heavy-chain V region (AA at 1)
BASE COUNT 81 a 77 c 94 g 84 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatggggc
301 tactatggta acctactggt acttcgatgt ctgggg
//
LOCUS MUSIGHADS 354 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H163-130F2.
ACCESSION M36212
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130F2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V region (AA at 1)
BASE COUNT 86 a 83 c 97 g 88 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatactat
301 aggtacgaca ggtttgctta ctggggccaa gggactctgg tcactgtctc tgca
//
LOCUS MUSIGHADT 357 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H163-130H9.
ACCESSION M36213
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 357 Ig heavy-chain V region (AA at 1)
BASE COUNT 86 a 91 c 102 g 78 t
ORIGIN
1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aaggggctac
301 ggtagccact ggtacttcga tgtctggggc gcagggacca cggtcaccgt ctcctca
//
LOCUS MUSIGHADU 331 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H2-4C2.
ACCESSION M36214
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 331 Ig heavy-chain V region (AA at 2)
BASE COUNT 94 a 79 c 83 g 75 t
ORIGIN
1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctggatatac
61 cttcacaaac tatggaatga attgggtgaa gcaggctcca ggaaagggtt taaagtggat
121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt
181 tgccttccct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa
241 tgaggacacg gctacatatt tctgtgcaag atatgactac aactactggt acttcgatgt
301 ctggggcgca gggaccacgg tcaccgtctc c
//
LOCUS MUSIGHADV 354 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-1VH.
ACCESSION M36215
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V region (AA at 1)
BASE COUNT 88 a 86 c 99 g 81 t
ORIGIN
1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagtcagggg
301 tatggttacg ttgctatgga ctactggggt caaggaacct cagtcaccgt ctcc
//
LOCUS MUSIGHADW 360 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-10VH.
ACCESSION M36216
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 360 Ig heavy-chain V region (AA at 1)
BASE COUNT 89 a 90 c 92 g 89 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatcttac
301 tacggtagta gctactactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS MUSIGHADX 360 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-22VH.
ACCESSION M36217
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 360 Ig heavy-chain V region (AA at 1)
BASE COUNT 91 a 89 c 96 g 84 t
ORIGIN
1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc
61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct
121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat
181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta
241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcactat
301 ggttacgacg gttactatgc tatggactac tggggtcaag gaacctcagt caccgtctcc
//
LOCUS MUSIGHADY 342 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-23VH.
ACCESSION M36218
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 342)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 342 Ig heavy-chain V region (AA at 1)
BASE COUNT 89 a 83 c 92 g 78 t
ORIGIN
1 gaggtccagc ttcagcagtc aggacctgag ctggtgaaac ctggggcctc agtgaagata
61 tcctgcaagg cttctggata cacattcact gactacaaca tgcactgggt gaagcagagc
121 catggaaaga gccttgagtg gattggatat atttatcctt acaatggtgg tactggctac
181 aaccagaagt tcaagagcaa ggcaacattg actgtagaca attcctccag cacagcctac
241 atggagctcc gcagcctgac atctgaggac tctgcagtct attactgtgc aagagggggg
301 ctacgccctt ttgcttactg gggccaaggg actctggtca ct
//
LOCUS MUSIGHADZ 357 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-5VH.
ACCESSION M36219
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 357)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 357 Ig heavy-chain V region (AA at 1)
BASE COUNT 87 a 88 c 100 g 82 t
ORIGIN
1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagatgccct
301 tatggtaact acgtttggta cttcgatgtc tggggcgcag ggaccacggt caccgtc
//
LOCUS MUSIGHAEA 366 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-7VH.
ACCESSION M36220
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 366)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 366 Ig heavy-chain V region (AA at 1)
BASE COUNT 90 a 92 c 94 g 90 t
ORIGIN
1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc
61 tcctgtgcaa cttctgggtt caccttcact gattactaca tgagctgggt ccgccagcct
121 ccaggaaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga
301 gacccccctt actacggcta cgctatggac tactggggtc aaggaacctc agtcaccgtc
361 tcctca
//
LOCUS MUSIGHAEB 328 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H230-2VH.
ACCESSION M36221
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 328 Ig heavy-chain V region (AA at 2)
BASE COUNT 87 a 86 c 80 g 75 t
ORIGIN
1 acctggcctg gtggcgccct cacagagcct gtccatcaca tgcactgtct cagggttctc
61 attaaccagc tatggtgtaa gctgggttcg ccagcctcca ggaaagggtc tggagtggct
121 gggagtaata tggggtgacg ggagcacaaa ttatcattca gctctcatat ccagactgag
181 catcagcaag gataactcca agagccaagt tttcttaaaa ctgaacagtc tgcaaactga
241 tgacacagcc acgtactact gtgccaaacc ttatgattac gaagggtact actttgacta
301 ctggggccaa ggcaccactc tcacagtc
//
LOCUS MUSIGHAEC 331 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H230-5VH.
ACCESSION M36222
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 331 Ig heavy-chain V region (AA at 2)
BASE COUNT 74 a 88 c 86 g 83 t
ORIGIN
1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc
61 actgagcact tctggtatgg gtgtgagctg gattcgtcag ccttcaggaa agggtctgga
121 gtggctggca cacatttact gggatgatga caagcactat aacccatccc tgaagagccg
181 gctcacaatc tccaaggata cctccagcaa ccaggtattc ctcaagatca ccagtgtgga
241 cactgcagat actgccacat actactgtgc tcgaagagat ggttacgacg aggggtttgc
301 ttactggggc caagggactc tggtcactgt c
//
LOCUS MUSIGHAED 334 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H250-6VH.
ACCESSION M36223
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 334)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 334 Ig heavy-chain V region (AA at 2)
BASE COUNT 75 a 81 c 95 g 83 t
ORIGIN
1 gggaggctta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac
61 tttcagtagc tatgccatgt cttgggttcg ccagtctcca gagaagaggc tggagtgggt
121 cgcagaaatt agtagtggtg gtagttacac ctactatcca gacactgtga cgggccgatt
181 caccatctcc agagacaatg ccaagaacac cctgtacctg gaaatgagca gtctgaggtc
241 tgaggacacg gccatgtatt actgtgcaag ggatctcgac tatggtaact cctggtttgc
301 ttactggggc caagggactc tggtcactgt ctct
//
LOCUS MUSIGHAEE 354 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H280-10VH.
ACCESSION M36224
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V region (AA at 1)
BASE COUNT 88 a 90 c 90 g 86 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatctaag
301 atcccgtact actttgacta ctggggccaa ggcaccactc tcacagtctc ctca
//
LOCUS MUSIGHAEF 354 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H280-15VH.
ACCESSION M36225
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V region (AA at 1)
BASE COUNT 95 a 88 c 89 g 82 t
ORIGIN
1 caggtccagc ttcagcagtc tggggctgaa ctggcaaaac ctggggcctc agtgaagatg
61 tcctgcaagg cttctggcta cacctttact agctactgga tgcactgggt aaaacagagg
121 cctggacagg gtctggaatg gattggatac attaatccta gcactggtta tactgagtac
181 aatcagaagt tcaaggacaa ggccacattg actgcagaca aatcctccag cacagcctac
241 atgcaactga gcagcctgac atctgaggac tctgcactgt attactgtgc aagatgggtc
301 tattactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca
//
LOCUS MUSIGHAEG 360 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H35-C10VH.
ACCESSION M36226
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 360)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 360 Ig heavy-chain V region (AA at 1)
BASE COUNT 84 a 87 c 100 g 89 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagagggcga
301 actatgattc ccgcctggtt tgcttactgg ggccaaggga ctctggtcac tgtctctgca
//
LOCUS MUSIGHAEH 345 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H35-C7VH.
ACCESSION M36227
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 345)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 345 Ig heavy-chain V region (AA at 1)
BASE COUNT 85 a 78 c 96 g 83 t 3 others
ORIGIN
1 caggttcagc tgcagcagtc tggagctgaa ctggtaaagc ctggggcttc agtgaagttg
61 tcctgcaagg cttctggcta caccttcaca agctatgata taaactgggt gaggcagagg
121 cctgaacagg gacttgagtg gattggatgg atttttcctg gagatggtag tactaagtac
181 aatgagaagt tcaagggcaa ggccacactg actacagaca aatcctccag cacagcctac
241 atgcagctca gcaggctgac atctgaggac tctgctgtct atttctgtgc aaactggnac
301 gggtttgctt actggnncca agggactctg gtcactgtct ctgca
//
LOCUS MUSIGHAEI 354 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H61-15VH.
ACCESSION M36228
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 354)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 354 Ig heavy-chain V region (AA at 1)
BASE COUNT 90 a 89 c 93 g 82 t
ORIGIN
1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc
61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct
121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat
181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta
241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcattac
301 tacggctact atgctatgga ctactggggt caaggaacct cagtcaccgt ctcc
//
LOCUS MUSIGHAEJ 351 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H9-A15VH.
ACCESSION M36229
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 351)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 351 Ig heavy-chain V region (AA at 1)
BASE COUNT 96 a 82 c 91 g 81 t 1 others
ORIGIN
1 gttcagctgc agcagtctgg agctgagctg atgaagcctg ggncctcagt gaagatatcc
61 tgcaaggcta ctggctacag attcagtagc tactggatgg agtgggtaaa gcagaggcct
121 ggacatggcc ttgagtggat tggagagctt ttacctggaa atagtaatac taattacaat
181 gagaagttca agggcaaggc cacattcgct gcagatacat cctccaacac agcctacatg
241 caactcagca gcctgacatc tgaggactct gccgtctatt actgtgcaag agatctgaaa
301 aatggtgact cgttctactt tgactactgg ggccaaggca ccactctcac a
//
LOCUS MUSIGHAEK 333 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H9-D3VH.
ACCESSION M36230
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 333)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 333 Ig heavy-chain V region (AA at 1)
BASE COUNT 95 a 73 c 84 g 80 t 1 others
ORIGIN
1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc
61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct
121 ccaggaaagg atttaaagtg gatgggctgg ataaacacct atactggaga gccaacattt
181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaagagaa
301 aattcctact ggtatttcga tgtctggggc gca
//
LOCUS MUSIGHAEL 334 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
IC5-3A4HV.
ACCESSION M36231
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 334)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 334 Ig heavy-chain V region (AA at 2)
BASE COUNT 86 a 69 c 95 g 84 t
ORIGIN
1 gggagactta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac
61 tttcagtagt tatggcatgt cttggtttcg ccagattccg gagaagaggc tggagtgggt
121 cgcaaccatt agtggtggtg gtagttatat ttactatgta gacagtgtga aggggcgatt
181 caccatctcc agagacaatg ccaagaacaa cctgtacctg caaatgagca gtctgaggtc
241 tgaggacacg gccttgtatt attgtgcaag agggaatata atataccacg aggactatgc
301 tatggactac tggggtcaag gaacctcagt cacc
//
LOCUS MUSIGHAEM 328 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L11-2E2HV.
ACCESSION M36232
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 328 Ig heavy-chain V region (AA at 2)
BASE COUNT 85 a 78 c 96 g 69 t
ORIGIN
1 ggcagagctt gtgaagccag gggcctcagt caagttgtcc tgcacagctt ctggcttcaa
61 cattaatgac acctatatgc actgggtgaa gcagaggcct gaacagggcc tggagtggat
121 tggaaggatt gatcctgcga atggtaatac taaatatgac ccgaagttcc agggcaaggc
181 cactataaca gcagacacat cctccaacac agcctacctg cagttcagca gcctgacatc
241 tgaggacact gccgtctatt actgtgatag agaggactac ggtagtaggt acggggggtg
301 gtacttcgat gtctggggcg cagggacc
//
LOCUS MUSIGHAEN 331 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L2-10C1HV.
ACCESSION M36233
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 331)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 331 Ig heavy-chain V region (AA at 2)
BASE COUNT 90 a 78 c 89 g 74 t
ORIGIN
1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctgggtatac
61 cttcacaaac tatggaatga actgggtgaa gcaggctcca ggaaagggtt taaagtggat
121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt
181 tgccttctct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa
241 tgaggacacg gctacatatt tctgtgcaag atgggactac gggtactggt acttcgatgt
301 ctggggcgca gggaccacgg tcaccgtctc c
//
LOCUS MUSIGHAEO 328 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L2-5D2HV.
ACCESSION M36234
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 328)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 328 Ig heavy-chain V region (AA at 2)
BASE COUNT 76 a 85 c 82 g 83 t 2 others
ORIGIN
1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc
61 actgagcact tctggtatgg gtgnnagctg gattcgtcag ccttcaggaa agggtctgga
121 gtggctggca cacatttact gggatgatga caagcgctat aacccatccc tgaagagccg
181 gctcacaatc tccaaggata cctccagaaa ccaggtattc ctcaagatca ccagtgtgga
241 cactgcagat actgccacat actactgtgc tcggtcgtat ggtaatgggg attactatgc
301 tatggactac tggggtcaag gaacctca
//
LOCUS MUSIGHAEP 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H120-2HV.
ACCESSION M36235
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 89 c 78 g 74 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgta cacgttcgga
301 ggggggacca agctggaaat aaaa
//
LOCUS MUSIGHAEQ 317 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H130-5VK.
ACCESSION M36236
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 317)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 317 Ig heavy-chain V region (AA at 1)
BASE COUNT 91 a 76 c 71 g 79 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtacac gttcggaggg
301 gggaccaagc tggaaat
//
LOCUS MUSIGHAER 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H130-6VK.
ACCESSION M36237
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 92 c 78 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAES 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H146-24B1VK.
ACCESSION M36238
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 90 c 75 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
301 tcggggacaa agttggaaat aaaa
//
LOCUS MUSIGHAET 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H147-25H1VK.
ACCESSION M36239
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 84 a 89 c 77 g 74 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact aacgttcgga
301 ggggggacca agctggaaat aaaa
//
LOCUS MUSIGHAEU 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H163-130H9.
ACCESSION M36240
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130-F2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 92 c 78 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAEV 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H163-130H9.
ACCESSION M36241
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 92 c 78 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAEW 321 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H2-4C2.
ACCESSION M36242
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 321 Ig heavy-chain V region (AA at 1)
BASE COUNT 93 a 79 c 69 g 80 t
ORIGIN
1 aatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactc ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcac tatagtgagc ttccgtggac gttcggtgga
301 ggcaccaagc tggaaatcaa a
//
LOCUS MUSIGHAEX 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-1.
ACCESSION M36243
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 90 c 75 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
301 tcggggacaa agttggaaat aaaa
//
LOCUS MUSIGHAEY 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-10.
ACCESSION M36244
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 90 c 75 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
301 tcggggacaa agttggaaat aaaa
//
LOCUS MUSIGHAEZ 303 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-22.
ACCESSION M36245
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 303 Ig heavy-chain V region (AA at 1)
BASE COUNT 82 a 78 c 73 g 70 t
ORIGIN
1 tctcacaaat tcatgtccac atcagtagga gacagggtca gcatcacctg caaggccagt
61 caggatgtgg gtactgctgt agcctggtat caacagaaac cagggcaatc tcctaaacta
121 ctgatttact gggcatccac ccggcacact ggagtccctg atcgcttcac aggcagtgga
181 tctgggacag atttcactct caccattagc aatgtgcagt ctgaagactt ggcagattat
241 ttctgtcagc aatatagcag ctatcctcgg acgttcggtg gaggcaccaa gctggaaatc
301 aaa
//
LOCUS MUSIGHAFA 303 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-23.
ACCESSION M36246
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 303 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 71 c 70 g 83 t 1 others
ORIGIN
1 tctccatcct ccttatctgc ctctctggga gaaagagtca gtctcacttg tcgggcaagt
61 caggacattg gtagtagctt aaactggctt cagcaggaac cagatggaac tattaaacgc
121 ctgatctacg ccacatccag tttagattct ggtgtcccca aaaggttcag tggcagtagg
181 tctgggtcag attattctct caccatcagc agccttgagt ctgaagattt tgtagactat
241 tactgtctac aatatgctag ttctccgtac acgttcggag gggggaccaa gctgnaaata
301 aaa
//
LOCUS MUSIGHAFB 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-5.
ACCESSION M36247
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 79 a 92 c 77 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAFC 297 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H220-7.
ACCESSION M36248
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 297 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 72 c 73 g 69 t 5 others
ORIGIN
1 aaattcatgt ccacatcagt aggaganagg gtcagcatca cctgcaaggc cagtcaggat
61 gtgagtactg ctgtagcctg gtatcaacag aaaccaggac aatctcctaa actactgatt
121 tactcggcat cctaccggta cactggagtc cctgatcgct tcactggcag tggatctggg
181 nnggatttca ctttcaccat cagcagtgtg caggctgaag acctggcagt ttattactgt
241 cagcnacatn atagtactcc gtggacgttc ggtggaggca ccaagctgga aatcaaa
//
LOCUS MUSIGHAFD 315 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H230-2.
ACCESSION M36249
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 315)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 315 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 80 c 75 g 77 t
ORIGIN
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccattcac gttcggctcg
301 gggacaaagt tggaa
//
LOCUS MUSIGHAFE 303 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H230-5.
ACCESSION M36250
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 303)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 303 Ig heavy-chain V region (AA at 1)
BASE COUNT 77 a 77 c 75 g 74 t
ORIGIN
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtggac gttcggtgga
301 ggc
//
LOCUS MUSIGHAFF 318 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H250-6.
ACCESSION M36251
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 318 Ig heavy-chain V region (AA at 1)
BASE COUNT 83 a 81 c 77 g 76 t 1 others
ORIGIN
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtgnac gttcggtgga
301 ggcaccaagc tggaaatc
//
LOCUS MUSIGHAFG 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H280-10.
ACCESSION M36252
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 79 a 92 c 77 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAFH 312 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H280-15.
ACCESSION M36253
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 312 Ig heavy-chain V region (AA at 1)
BASE COUNT 72 a 88 c 79 g 73 t
ORIGIN
1 gaaaatgtgc tcacccagtc tccagcaatc atgtctgcat ctccagggga aaaggtcacc
61 atgacctgca gggccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag
121 tcaggtgcct cccccaaact ctggatttat agcacatcca acttggcttc tggagtccct
181 gctcgcttca gtggcagtgg gtctggcacc tcttactctc tcacaatcag cagtgtggag
241 gctgaagatg ctgccactta ttactgccag cagtacagtg gttacccgta cacgttcgga
301 ggggggacca ag
//
LOCUS MUSIGHAFI 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H35-C10.
ACCESSION M36254
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 79 a 92 c 77 g 76 t
ORIGIN
1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
301 gctgggacca agctggagct gaaa
//
LOCUS MUSIGHAFJ 302 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H35-C7.
ACCESSION M36255
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 302)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 302 Ig heavy-chain V region (AA at 1)
BASE COUNT 75 a 76 c 77 g 74 t
ORIGIN
1 gcttctttgg ctgtgtctct agggcagagg gccaccatat cctgcagagc cagtgaaagt
61 gttgatagtt atggcaatag ttttatgcac tggtaccagc agaaaccagg acagccaccc
121 aaactcctca tctatcgtgc atccaaccta gaatctggga tccctgccag gttcagtggc
181 agtgggtcta ggacagactt caccctcaca attaatcctg tggaggctga tgatgttgca
241 acctattact gtcagcaaag taatgaggat ccgctcacgt tcggtgctgg gaccaagctg
301 ga
//
LOCUS MUSIGHAFK 321 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H61-15.
ACCESSION M36256
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
BASE COUNT 95 a 75 c 78 g 71 t 2 others
ORIGIN
1 gacatcaaga tgacccagtc tccatcctcc atgtatgcnt cgctgggaga gagagtcact
61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
181 agattcagtg gcagtggatc tgggcaanat tattctctaa ccatcagcag cctggagtct
241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg
301 gggaccaagc tggaaataaa a
//
LOCUS MUSIGHAFL 324 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H9-A15.
ACCESSION M36257
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 324)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 324 Ig heavy-chain V region (AA at 1)
BASE COUNT 72 a 94 c 76 g 82 t
ORIGIN
1 caaattgttc tttcccagtc tccagcagtc atgtctgcat ctcctgggga gaaggtcacc
61 ttgacctgca gtgccagctc aagtgtaagt tccacctact tgtactggta ccagcagaag
121 ccaggatcct cccccaaact ctggatttat agcacatcca acctggcttc tggagtccct
181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag
241 gctgaagatg ctgcctctta tttctgccat cagtggagta gttacccacc gacgttcggt
301 ggaggcacca agctggaaat caaa
//
LOCUS MUSIGHAFM 300 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
H9-D3.
ACCESSION M36258
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 300)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 300 Ig heavy-chain V region (AA at 1)
BASE COUNT 81 a 69 c 63 g 87 t
ORIGIN
1 cagattacat cctccctgtc tgtctctctg ggagacagag tcaccatcag ttgcagtgca
61 agtcagggca ttaacagtta tttaaactgg tatcagcaga aaccggatgg aactgttaaa
121 ctcctgatct atttcacatc aattttatat tcaggagtcc catcaaggtt cagtggcagt
181 gggtctggga cagattattc tctcaccatc agcaacttgg aacctgaaga tattgccact
241 tactattgtc agcagtatac tattcttcca ttcacgttcg gctcggggac aaagttggaa
//
LOCUS MUSIGHAFN 318 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
IC5-3A4.
ACCESSION M36259
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 318)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 318 Ig heavy-chain V region (AA at 1)
BASE COUNT 87 a 80 c 75 g 76 t
ORIGIN
1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
61 atcccctgca aggccagtca ggatgtaggt actgctgtag cctggtatca acagaaacca
121 ggacaatctc ctaaaatact gatttactgg gcatccaacc ggcaaattgg agtccctgat
181 cgcttcacag gcagtggatc tgggacagat ttcactctca ccattagcta tgtgcagtct
241 gaagacttgg cagattattt ctgtcagcaa tatagcacct atcccacgtt cggtgctggg
301 accaagctgg agctgaaa
//
LOCUS MUSIGHAFO 312 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L11-2E2.
ACCESSION M36260
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 312)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 312 Ig heavy-chain V region (AA at 1)
BASE COUNT 92 a 76 c 67 g 77 t
ORIGIN
1 gacatccaga tgacacagtc tccatcctca ctgtctgcat ctctgggagg caaagtcacc
61 atcacttgca aggcaagcca agacattaac aagtatatag cttggtacca acacaagcct
121 ggaaaaggtc ctaggctgtt catacattat acatctacat tacagccagg catcccatca
181 aggttcagtg gaagtgggtc tgggagacat tattccttca gcatcagcaa cctggagcct
241 gaagatattg caacttatta ttgtctacag tatgataatc ttctgacgtt cggtggaggc
301 accaagctgg aa
//
LOCUS MUSIGHAFP 321 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L2-10C1.
ACCESSION M36261
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 321)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 321 Ig heavy-chain V region (AA at 1)
BASE COUNT 93 a 77 c 71 g 80 t
ORIGIN
1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtggac gttcggtgga
301 ggcaccaagc tggaaatcaa a
//
LOCUS MUSIGHAFQ 288 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
L2-5D2.
ACCESSION M36262
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 288)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 288 Ig heavy-chain V region (AA at 1)
BASE COUNT 78 a 65 c 68 g 73 t 4 others
ORIGIN
1 cagtctccag cctccctgtc tgcatctgtg ggagaaactg tcaccatcac atgtcgagca
61 agtgggaata ttcacaatta tttagcatgg tatcagcaga nacagggaaa atctcctcag
121 ctcctggtct ataatgcaaa aaccttagca gatggtgtgc catcaaggtt cagtggcagt
181 ggatcaggaa cacaatattc tctcaagatc aacagcctgc agcctgaaga ttttgggagt
241 tattactgtc aacatttttg gnnnactccg tacacgttcg gagggggg
//
LOCUS MUSIGHAFR 363 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION M36207
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 363)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 363 Ig heavy-chain V region (AA at 1)
BASE COUNT 85 a 88 c 101 g 89 t
ORIGIN
1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc gaggatttac
301 tacggtagta gctactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc
361 tca
//
LOCUS MUSIGHAFS 348 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION M36208
KEYWORDS immunoglobulin heavy-chain; processed gene; variable region.
SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
TITLE A set of closely related antibodies dominates the primary antibody
response to influenza virus hemagglutinin
JOURNAL J. Immunol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Kavaler, 06-JUL-1990.
FEATURES from to/span description
pept < 1 > 348 Ig heavy-chain V region (AA at 1)
BASE COUNT 96 a 80 c 93 g 78 t 1 others
ORIGIN
1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc
61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct
121 ccaggaaagg gtttaaagtg gatgggctgg ataaacacct acactggaga gccaacatat
181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaggagat
301 ggtaactact ggtacttcga tgtctggggc gcagggacca cggtcacc
//
LOCUS MUSMHEA2A1 478 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II A-beta 2 gene, exon 2.
ACCESSION M30158
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Mouse (haplotype SAX2) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 478)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept / 154 + 423 A-beta 2 chain, exon 2 (AA at 156)
IVS < 1 153 A-beta 2 chain intron A
IVS 424 > 478 A-beta 2 chain intron B
BASE COUNT 118 a 99 c 132 g 129 t
ORIGIN
1 aaataaatgc caataagcat taaatgacat tgactttgtc tcttaatgtt gggcttttga
61 gcacattgct atctgactta gtgtaccagg tacaggcttt ccctgaggtg cggctggttc
121 ctaactggtt cctaatcctt ttatggtttc cagagaattt tgtgattcag gcaaaggcag
181 actgttactt caccaatggg acagaaaagg ttcacttcct ggtcaggttc atcttcaacc
241 tggaggagta tttacatttc gacagtgacc tggggatgtt cgtggcgctg acggagctgg
301 gggaacctga cgctgaccag tggaacaaac gcctagatct cctggagatg agcagagccg
361 ctgtgaacat ggtctgcagg cagaagtaca agctgggggc ccccttcatc gtggagagaa
421 atggtgggta acagcaaggt ctaggggttc tccgtgagaa ccttgtaata ggcagttt
//
LOCUS MUSMHEA2A2 526 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II A-beta 2 gene, exon 3.
ACCESSION M30159
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (haplotype SAX2) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 526)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 49)
IVS < 1 48 A-beta 2 chain intron B
IVS 331 > 526 A-beta 2 chain intron C
BASE COUNT 115 a 135 c 136 g 140 t
ORIGIN
1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg cctccagagg
61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg
121 tgacaggctt ctaccccagg gacataagtg tcaagtggtt ccggaatgga caggaggagc
181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag
241 taatgctgga aatgatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg
301 gcctcctgag acctgtttct gtggcatgga gtgagcttca tttagttttc tgtatttctg
361 gaccttgacc tgtaggagct atcatttctg tgttcatctt gatgacaaga caagttcccc
421 tgatctaaga atcctagtgc cagggcagga gaaaacagga caggcatcca ttctctgaat
481 gtccaagaga ggctgttaac ccacaacttg cctgtaatcc tgggac
//
LOCUS MUSMHEAII1 491 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II A-beta 2 gene, exon 2.
ACCESSION M30156
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Mouse (haplotype SAX1) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 491)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept / 158 + 427 A-beta 2 chain, exon 2 (AA at 160)
IVS < 1 157 A-beta 2 chain intron A
IVS 428 > 491 A-beta 2 chain intron B
BASE COUNT 123 a 97 c 135 g 136 t
ORIGIN
1 aaataaatgt caataagcat taaatgacat taagtgactt tgtctcttaa tgttgggctt
61 ttgagcacat tgctatctga cttagtgtac caggtacagg ctttcccaga ggtgcggctg
121 gttcctaact ggttcctaat ccttttatgg tttccagaga attttgtgat tcaggcaaag
181 gcggactgtt acttcaccaa tgggacagaa aaggttcact tcctggtcag gttcatcttc
241 aacctggagg aatatttaca tttcgacagt gacctgggga tgtttgtggc gctgacggag
301 ctgggggaac ctgacgctga ccagtggaac aaacgcctgg atctcctgga gacgagcaga
361 gccgctgtga acatggtctg caggcagaag tacaatctgg gggccccctt cattgtggag
421 agaaatggtg ggtaacagca aggtctaggg gttctccgtg agaaccttgt aataggcatt
481 tgaaaggttt t
//
LOCUS MUSMHEAII2 592 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II A-beta 2 gene, exon 3.
ACCESSION M30157
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (haplotype SAX1) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 592)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 50)
IVS < 1 48 A-beta 2 chain intron B
IVS 331 > 592 A-beta 2 chain intron C
BASE COUNT 138 a 147 c 148 g 159 t
ORIGIN
1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg tctccagagg
61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg
121 tgacaggctt ctatcccggg aacataagtg tcaagtggtt ccggaatgga caggaggaga
181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag
241 taatgctgga aataatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg
301 gcctcctgag acctgtttct gtggcatgga gtgagcttta ttagttttct gtaattctgg
361 accttgacct gtaggagctg tcatttctgt gtcttgatga caagacaagt tcccctgatc
421 taagaatcct aatgccaggg gcaggagaaa acaggacagg catccattct ctgaatgtcc
481 aagagataga taaggaggct gttaacccac aacttgcctc taaccctggg acctagtatc
541 ctttagcact ttaatcttta cagacactgc cactggctat aagtcaggga tc
//
LOCUS MUSMHEB2B1 518 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II E-beta 2 gene, exon 2.
ACCESSION M30160
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Mouse (haplotype SAX) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 518)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept / 92 + 358 E-beta 2 chain, exon 2 (AA at 94)
IVS < 1 91 E-beta 2 chain intron A
IVS 359 > 518 E-beta 2 chain intron B
BASE COUNT 120 a 94 c 151 g 153 t
ORIGIN
1 ctgcaggtgt cacagttcct ttcactgact gccattctgg agcattgtct gtcctcacag
61 acatcctgta attgtgttca tgtacccaca gcacgttttc tggagcagtt gaaggctgag
121 tgtcactact tcaatgggaa ggagcgtgtg tggagtgtga ccagattcat ctataaccag
181 gaagagtttg tccgctttaa cactgacttt gggaagttcc tggcagtgac tgagctgggg
241 cggcccatag ctgagtactt gaacacccag aaggacatgc tggacaatta tcgtgcctca
301 gtggacaggt gcagaaataa ctatgacctt gtggatatct tcatgttgaa cttaaaaggt
361 aagcattaga tagagagtag atgggttgag gggtgtgtgt gtgtgtgtgt gtgtgtgtgt
421 gtgtgtgtgt gtgtgtaaga gaggatagtt ctgtgtcacg tgtgtggaga tcataagaca
481 ggttactgat tctcgccatc tatcctatgg tcctgaga
//
LOCUS MUSMHEB2B2 573 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II E-beta 2 gene, exon 3.
ACCESSION M30161
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (haplotype SAX) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 573)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept + 97 / 378 E-beta 2 chain, exon 3 (AA at 97)
IVS < 1 96 E-beta 2 chain intron B
IVS 379 > 573 E-beta 2 chain intron C
BASE COUNT 138 a 155 c 143 g 137 t
ORIGIN
1 gagatcttta gcctgggtgg atggaggcag gtaggcagcc cacagggaga acagctctat
61 cctcagggct ctctctcacc tccctttctc ctctagctga gcccaaggtg actgtgtatc
121 ctgcaaagac acagcctctg gaacaccaca acctcctggt ctgctctgtg aatgacttct
181 accctggcag cattgaagtc agatggttcc ggaatggcga ggaggagaag actggagttg
241 tgtccaccgg actgatccaa aatggagatt ggacctacca gaccctgttg atgctggaga
301 tggttcctca gggtggagag gtttacacct gccaggtgga gcatcccagc ctgaccagcc
361 ctgtcacagt ggagtggagt gagtgggaaa cccttgactc tgcaaatgcc caaccaccat
421 ataataggag tgactttctc tgcctgttcc ctttctctga ccctgtaaat ccctgcctac
481 cagggaagca catgggtgac tccacaggaa aaatgaatac aatttccaag ttactagctt
541 ttattcctca cctagttcaa catctatgcc agt
//
LOCUS MUSMHEBI1 445 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II E-beta 1 gene, exon 2.
ACCESSION M30154
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 1 of 2
SOURCE Mouse (haplotype SAX) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 445)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept / 11 + 280 E-beta 1 chain, exon 2 (AA at 13)
IVS < 1 10 E-beta 1 chain intron A
IVS 281 > 445 E-beta 1 chain intron B
BASE COUNT 101 a 108 c 161 g 75 t
ORIGIN
1 gtctccacag cacggttttt ggaacaagct aaacatgagt gtcatttcta caacggaacg
61 cagcgcgtgc ggtttctgtt gagacaaatc cacaacaggg aggagaacgt gcgcttcgac
121 agcgacctgg gcgagttccg cgcagtgaac gagctggggc ggccagacgc cgagtacttc
181 aacagccaga aggaggtcct ggaggatttg cgggccgctg tggacacgta ctgcagacac
241 aactacgaga tcttagataa cttccttgtg ccgcggagag gtgagacagg aggcggtggg
301 tggggcggaa ccacggtgag ggtggggctg tggggagcag caggaggcgg tgcgcacgtg
361 cgcaggagcc gcaggggatg ctgggttccc tgccgctgga gccacaggcg gctttaagca
421 gcctcatggc aggagaacag aattc
//
LOCUS MUSMHEBI2 525 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse MHC class II E-beta 1 gene, exon 3.
ACCESSION M30155
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SEGMENT 2 of 2
SOURCE Mouse (haplotype SAX) liver DNA.
ORGANISM Mus saxicola
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 525)
AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
TITLE Structure of class II genes in wild mouse Mus saxicola: Functional
and evolutionary implications
JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Cam, 27-NOV-1989, for release after publication.
FEATURES from to/span description
pept + 102 / 383 E-beta 1 chain, exon 3 (AA at 102)
IVS < 1 101 E-beta 1 chain intron C
IVS 384 > 525 E-beta 1 chain intron D
BASE COUNT 116 a 144 c 141 g 124 t
ORIGIN
1 ggatccagtc tggatggatg gaggcaggca ggcaggcagg cagcctacag gaaggacagc
61 tccaccctca tggcaccttc tcacctctct ttctcctcta gttgagccta cagtgactgt
121 gtaccctaca aagacacagc ccctggaaca ccacaacctc ctggtctgct ctgtgagtga
181 cttctaccca ggcagcattg aagtcagatg gttccggaat ggtgaggagg agaagacggg
241 aattgtgtcc actggcctga tccgaaatgg agactggact ttccagatgc tggtgatgct
301 ggagacggtt cctcaggtgg aagaggttta cacctgccag gtggagcatc ccagcctgac
361 cagccctgtc acagtccagt ggagtgagtg ataacttcca tactctgtga atgtcattcc
421 accagggagg gggcgtggct tatccctgcc tgtcagcttt ctctgcccac atactctttc
481 cactggtttt atgctgtcct gactttcacc acagcttaaa gggaa
//
LOCUS MUSMMTVA 448 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse mammary tumor virus long terminal repeat.
ACCESSION M35959
KEYWORDS long terminal repeat.
SOURCE Mouse lymphoma cell line MLA DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 448)
AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H.
TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas
acquire a selective advantage in lymphocytes by alteration in the
U3 region of the long terminal repeat
JOURNAL J. Virol. 64, 2474-2483 (1990)
STANDARD simple staff_entry
BASE COUNT 150 a 70 c 95 g 133 t
ORIGIN
1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact
61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc
121 tcagttaaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag
181 aagagtgtgc ttgtcaaaat aagacacagg tggtggcaac cagggactta taggggacct
241 tacatctaca gaccaacaga tgcccccttt cttaaaacaa ggatgtgaga caggtggttt
301 cctgagtcgg tttggtatca aatgttctaa tctgagctct tagtgttcta ttttcctgtg
361 ttcttttgga atctatccaa gtcttatgta aatgcttatg taaaccatga tataaaagag
421 tgctgatttt tttgagtaaa tttgcaac
//
LOCUS MUSMMTVB 527 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse mammary tumor virus long terminal repeat.
ACCESSION M35960
KEYWORDS long terminal repeat.
SOURCE Mouse lymphoma cell line DL8 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 527)
AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H.
TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas
acquire a selective advantage in lymphocytes by alteration in the
U3 region of the long terminal repeat
JOURNAL J. Virol. 64, 2474-2483 (1990)
STANDARD simple staff_entry
BASE COUNT 169 a 82 c 120 g 156 t
ORIGIN
1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact
61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc
121 tcagttgaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag
181 aagagtgtgc ttgtcaaaat aggagacagg tggtggcaac cagggactta taggggacct
241 tacatctaca gacggttagc ttggcttccc agggttcaaa ctgttcttaa aacaaggatg
301 tgagacaagt gcgcaacggt tagcttggct tcccagggtt caaactgttc ttaaaacaag
361 gatgtgagac aagtggtttc ctgagttggt ttggtatcaa atgttctaat ctgagctctt
421 agtgttctat tttcctatgt tcttttggaa tctatccaag tcttatgtaa atgcttatgt
481 aaaccatgat ataaaagagt gctgattttt ttgagtaaac ttgcaac
//
LOCUS PFDSYN 85 bp ds-DNA SYN 14-AUG-1990
DEFINITION Bacteriophage fd synthetic G2 promoter region DNA.
ACCESSION M24953
KEYWORDS promoter.
SOURCE Synthetic DNA.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 85)
AUTHORS Ovchinnikov,Y.A., Efimov,V.A. and Chakhmakcheva,O.G.
TITLE Synthesis of a polynucleotide corresponding to the promoter region
of bacteriophage fd DNA
JOURNAL FEBS Lett. 100, 341-346 (1979)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA < 1 14 G2 mRNA
BASE COUNT 34 a 17 c 15 g 19 t
ORIGIN
1 aaatcagtct ttaccctgtc tattatcgtc agaagcaaag cgaattgcat caaaaagatt
61 aagaggaagc ccgaaagact tcaaa
//
LOCUS PRMSPCA 200 bp ds-DNA BCT 14-AUG-1990
DEFINITION P.vulgaris spc r-protein operon L24 gene, 3' end, and L5 gene, 5'
end.
ACCESSION M36264
KEYWORDS r-protein.
SOURCE P.vulgaris DNA.
ORGANISM Proteus vulgaris
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 200)
AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M.
TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus
vulgaris, Serratia marcescens and Salmonella typhimurium
JOURNAL J. Mol. Biol. 204, 326-329 (1988)
STANDARD simple staff_entry
REFERENCE 2 (sites)
AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
TITLE Translational regulation of the spc operon in Escherichia coli:
Identification and structural analysis of the target site for S8
repressor protein
JOURNAL J. Mol. Biol. 204, 309-329 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 57 r-protein L24
pept 71 > 200 r-protein L5
BASE COUNT 64 a 36 c 47 g 53 t
ORIGIN
1 ttcgaagacg gcaaaaaagt tcgtttcttc aagtctaata aagaaactat caagtaattt
61 ggagtatacg atggcgaaac tgcatgatta ctacaaagac gaggtagtcc aaaaactgat
121 gtctcagttt ggttaccatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa
181 tatgggtgtt ggtgaagcga
//
LOCUS PSCTETR 1110 bp ds-DNA BCT 14-AUG-1990
DEFINITION Plasmid pSC101 tetracycline resistance element repressor protein
(tetR) gene, complete cds.
ACCESSION M36272
KEYWORDS tetracycline resistance element repressor protein.
SOURCE Plasmid pSC101 DNA.
ORGANISM Plasmid pSC101
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 1110)
AUTHORS Brow,M.A.D., Pesin,R. and Sutcliffe,J.G.
TITLE The tetracycline repressor protein of pSC101
JOURNAL Mol. Biol. Evol. 2, 1-12 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept 42 701 tetracycline resistance element repressor
protein (tetR)
BASE COUNT 235 a 298 c 351 g 226 t
ORIGIN
1 aagcttatcg atgataagct gtcaaacatg agaattcgcg aatgaacaag ctccaacgcg
61 aggccgtgat ccgaaccgcg ctcgaactgc ttaacgacgt gggcatggaa ggtctaacga
121 cgcgccgact ggctgagcgc ctcggggtgc aacagccagc gctctactgg catttcaaga
181 acaagcgtgc gttgctcgac gcacttgccg aagccatgct gacgataaat cacacgcatt
241 cgacgccaag ggatgacgac gactggcgtt cgttcctgaa gggcaatgca tgcagttttc
301 gacgggcgtt gctcgcttat cgcgatggcg cgcgtattca tgccgggacg cggccagccg
361 cgccgcagat ggaaaaagcc gacgcgcagc ttcgcttcct ttgcgatgct ggcttttcgg
421 caggtgacgc gacctatgcg ttgatggcaa tcagctactt caccgtcggc gctgttcttg
481 agcagcaagc tagcgaggca gacgccgagg agcggggcga agatcagttg accacctcag
541 cgtctacgat gccggcgcgc ctacagagcg cgatgaaaat cgtctacgaa ggcggtccgg
601 acgcggcatt cgagcgaggc ctggctctca tcatcggcgg tcttgaaaaa atgaggctca
661 ctacgaacga cattgaggtg ctgaagaatg ttgacgaatg acagggggcg gcaggtgcgg
721 agggcgcggt tgcttcgtca tatgaagcaa agtcacctag ctgaattaat gggtgtggat
781 caggcaaccg tgtcgcgctg ggagcggggc acccttgcat tgtcggatgg gaggtggtca
841 gcggttcttc aattgcttac cgggccttcc gattcatcgt acgacgctgc gctgaagcgt
901 ctggtgcaat cctccgccca caaagtccat ctggtagcga ccggacacat tgtttgctcg
961 cggcatctcc ggccaggcaa agggaattgc ggattgacct agccgaactc cttggtaaat
1021 cgctgcgtgt ttatgcgtcc ccgagatagt tgcggccgac tctgcgctta atgggctcgg
1081 ttggcatgag gggcggctgg ggtcactcga
//
LOCUS PTVCOMPL 359 bp ss-RNA VRL 14-AUG-1990
DEFINITION Potato spindle tuber viroid complete sequence.
ACCESSION M36163
KEYWORDS .
SOURCE Potato spindle tuber viroid RNA.
ORGANISM Potato spindle tuber viroid
Viridae; Nonclassified viruses.
REFERENCE 1 (bases 1 to 359)
AUTHORS Gross,H.J. and Riesner,D.
TITLE Viroids: A class of subviral pathogens
JOURNAL Angew Chem. Int. Ed. Engl. 19, 231-243 (1980)
STANDARD simple staff_entry
BASE COUNT 73 a 108 c 102 g 76 t
ORIGIN
1 cggaactaaa ctcgtggttc ctgtggttca cacctgacct cctgagcaga aaagaaaaaa
61 gaaggcggct cggaggagcg cttcagggat ccccggggaa acctggagcg aactggcaaa
121 aaaggacggg ggggagtgcc cagcggccga caggagtaat tcccgccgaa acagggtttt
181 cacccttcct ttcttcgggt gtccttcctc gcgcccgcag gaccacccct cgcccccttt
241 gcgctgtcgc ttcggctact acccggtgga aacaactgaa gctcccgaga accgcttttt
301 ctctatctta cttgcttcgg ggcgagggtg tttagccctt ggaaccgcag ttggttcct
//
LOCUS RATCBFB 1415 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat CCAAT binding transcription factor-B subunit (CBF-B) mRNA,
complete cds.
ACCESSION M34238
KEYWORDS CCAAT binding transcription factor-B subunit;
heteromeric mammalian transcription factor; nuclear protein;
transcription activator.
SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
pCBF-B1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1415)
AUTHORS Maity,S.N., Vuorio,T. and de Crombrugghe,B.
TITLE The B subunit of a heteromeric rat CCAAT-binding transcription
factor shows a striking sequence identity with the yeast Hap2
transcription factor
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5378-5382 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.N.Maity, 11-MAY-1990, for release after publication.
FEATURES from to/span description
pept 171 1196 CCAAT binding transcription factor-B subunit
(CBF-B)
BASE COUNT 371 a 371 c 417 g 256 t
ORIGIN
1 cagcggcagt ggcggcggcg cggcggctgg agcctctgat tgggtttcgg ggtccggtac
61 tggagccaat cagcgcgggc agcgaaccgg gggagcgagg cacggagtcc ctacctggca
121 gctgcctggg atctgtagag tgaagcttca ggacacaacg tggccgggcc atggagcagt
181 atacagcaaa tagcaatagt tccacagagc agatcgtggt gcaggctggc cagattcagc
241 agcagcagca gggtggtgtc actgctgtcc agctgcagac tgaggcccag gtggcatccg
301 cctcaggcca gcaagtccag accctccagg tagttcaggg gcagccatta atggtgcaag
361 tcagtggagg ccagctgatt acatcaactg gccaacccat catggtacag gctgtgcctg
421 gtggacaagg ccaaactatc atgcaagtac ctgtgtctgg aacacaaggt ttacagcaga
481 tacagttggt accccctgga cagatccaga tccagggcgg gcaggctgtg caggtgcaag
541 gccagcaggg acagacccag cagatcatca ttcagcagcc acagaccgcg gtcaccgctg
601 gccagactca gacacaacaa cagattgctg tccagggaca gcaagtggcc cagactgctg
661 aagggcagac tattgtctac cagccagtta atgcagatgg cacaatcctc cagcaaggca
721 tgatcaccat cccagcagcc agtttggcag gggcacagat cgttcagaca ggagccaata
781 ccaacacaac cagtagtgga caagggactg tcactgtgac actgccagtg gcagggaatg
841 tggtcaactc aggaggaatg gtcatgatgg taccaggggc tggctctgtg cctgctatcc
901 aaagaatccc tttacctggg gcagagatgc tggaagaaga gccgttgtat gtgaatgcca
961 aacagtatca ccgcatcctt aagaggagac aagcgagggc caagctagag gcagaaggga
1021 agatcccaaa ggaaagaagg aaatacctcc atgagtctcg gcaccgacac gccatggcac
1081 ggaagcgtgg ggaagggggc cgcttcttct ctccaaaaga aaaggacagt cctcacatgc
1141 aggatccaaa ccaagctgat gaagaagcca tgacacagat catccgagtt tcctaaccac
1201 aggagggtgg cagagcggat gagtcacagt cctctccact gttcctggaa atggactgac
1261 ttccagttgg gcactgacag tctcactctg ccctttctca ggacagaaac tacttagctc
1321 agtattacag ctgcagtgat ggctgacaaa ctgaagaggc aagcttttgt ctctcccttt
1381 cattgaggac ctgttcagac tgttggtgac attga
//
LOCUS RATRTG2 902 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat thyroglobulin (rTg-2) mRNA, complete cds.
ACCESSION M35965
KEYWORDS thyroglobulin.
SOURCE Rat (strain Fisher) thyrocyte, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 902)
AUTHORS Graves,P.N. and Davies,T.F.
TITLE A second thyroglobulin messenger RNA species (rTg-2) in rat
thyrocytes
JOURNAL Mol. Endocrinol. 4, 155-161 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 732 thyroglobulin (rTg-2)
mRNA < 1 902 thyroglobulin mRNA
BASE COUNT 239 a 216 c 234 g 213 t
ORIGIN
1 atgatgacct tggtcttgtg ggtctcgact ttgttgagct cagtctgcct ggtagcagcc
61 aacatctttg agtaccaagt ggatgcacag ccactccgcc cctgtgagct gcaaagggag
121 aaggcctttc tgaagcagga tgaatatgtt ccacagtgct ctgaagatgg aagtttccag
181 acagttcagt gccaaaatga tggccaatct tgctggtgtg tggattctga tggcacggaa
241 gtacctggca gcaggcagct gggaaggcca acagcatgtc tgtccttctg ccagctacac
301 aagcaacgaa tcctgctgag cagctacatt aacagcacag atgctctcta ccttcctcag
361 tgccaggatt cagggaacta tgcaccagtg cagtgcgacc tgcagcaagt gcaatgctgg
421 tgtgtagaca cagaggggat ggaggtgtat ggcacccgcc agcaggggag gccaacacgg
481 tgtccaagga gctgtgagat aagaagccgc cgtctcctcc atggagtagg agacaaatcg
541 ccaccccagt gcgatgcaga tggggagttt atgcctgtcc agtgcaagtt tgtcaacacc
601 acagacatga tgatttttga tctgattcac aactacaaca gcaacaaacc tcacctcaaa
661 tgccctagta catctgttat actgtggcaa gagtcaagag cacctttaac atgtgctgaa
721 gaatcatcct aattggaata tagaagacta tggggaaatc ctgagaggca gatcgtctag
781 tcacctgcaa gaccacctat ggtggccaat ttgacttgtg tgtattgtta tgctgtgtat
841 agctttctgc caaagactgc ctgccctggc ccaggagaga ataaacacac tgttaattta
901 tt
//
LOCUS SMASPCA 201 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.marcescens spc operon r-protein L24 gene , 3' end, and L5 gene,
5' end.
ACCESSION M36265
KEYWORDS r-protein.
SOURCE S.marcescens DNA.
ORGANISM Serratia marcescens
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 201)
AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M.
TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus
vulgaris, Serratia marcescens and Salmonella typhimurium
JOURNAL J. Mol. Biol. 204, 326-329 (1988)
STANDARD simple staff_entry
REFERENCE 2 (sites)
AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
TITLE Translational regulation of the spc operon in Escherichia coli:
Identification and structural analysis of the target site for S8
repressor protein
JOURNAL J. Mol. Biol. 204, 309-329 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 57 r-protein L24
pept 72 > 201 r-protein L5
BASE COUNT 65 a 40 c 46 g 50 t
ORIGIN
1 ttcgaagacg ctaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt
61 ggagtaatac gatggcgaaa ctgcatgatt actacaaaga cgaggtagtc aaacaactga
121 tgtctcagtt tgattacaac tctgtcatgc aagtccctcg ggtcgagaag atcaccctga
181 acatgggtgt tggtgaagcg a
//
LOCUS STYSPCA 201 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.typhimurium spc operon r-protein L24 gene, 3' end, and L5 gene,
5' end.
ACCESSION M36266
KEYWORDS r-protein.
SOURCE S.typhimurium DNA.
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 201)
AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M.
TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus
vulgaris, Serratia marcescens and Salmonella typhimurium
JOURNAL J. Mol. Biol. 204, 326-329 (1988)
STANDARD simple staff_entry
REFERENCE 2 (sites)
AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
TITLE Translational regulation of the spc operon in Escherichia coli:
Identification and structural analysis of the target site for S8
repressor protein
JOURNAL J. Mol. Biol. 204, 309-329 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 57 r-protein L24
pept 72 > 201 r-protein L5
BASE COUNT 67 a 38 c 46 g 50 t
ORIGIN
1 ttcgaagacg gtaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt
61 ggagtagtac gatggcgaaa ctgcatgatt actacaaaga cgaagtagtt aacaaactca
121 tgactgagtt taactacaat tctgtcatgc aagtccctcg ggtcgagaag atcaccctga
181 acatgggtgt tggtgaagcg a
//
LOCUS GCOADDA 1771 bp ds-DNA PLN 14-AUG-1990
DEFINITION G.verrucosa 18S ribosomal RNA gene.
ACCESSION M33638
KEYWORDS 18S ribosomal RNA.
SOURCE G.verrucosa (isolate Norway) male tetrasporangial DNA.
ORGANISM Gracilaria verrucosa
Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae;
Florideophycideae; Gracilariaceae; Gracilariaceae.
REFERENCE 1 (bases 1 to 1771)
AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A.
TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red
algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson)
Papenfuss and Gracilariopsis sp
JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Ragan, 08-APR-1990. Atlantic Res. Lab., National Research
Council of Canada, 1411 Oxford St., Halifax, Nova Scotia, CANADA
B3H 3Z1
BASE COUNT 454 a 366 c 498 g 453 t
ORIGIN
1 caacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccaaa
181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
241 tttttctgat cgcaattatt gcgacgcacc gttcaaattt ctgacctatc aactttggat
301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
601 tcgtagtcgg attttggtgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
661 cctttgtgga tgggagctag gtggtgctta attggatcac ctagctgccg ccaccgttta
721 ctgtgaaaaa aatagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
781 atagaatagg acccggtcct attttgttgg tttgcttgaa tcgggtaatg attaagaggg
841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga
1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
1741 ttccgtaggt gaacctgcag aaggatcaga a
//
LOCUS GRO18SAA 1782 bp ds-DNA PLN 14-AUG-1990
DEFINITION Gracilariopsis sp. 18S ribosomal RNA gene.
ACCESSION M33639
KEYWORDS 18S ribosomla RNA.
SOURCE Gracilariopsis sp. (isolate England-1) diploid tetrasporophyte DNA.
ORGANISM Gracilariopsis sp.
Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae;
Florideophycideae; Gracilariaceae; Gracilariaceae.
REFERENCE 1 (bases 1 to 1782)
AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A.
TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red
algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson)
Papenfuss and Gracilariopsis sp
JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Ragan, 08-APR-1990.
BASE COUNT 443 a 372 c 515 g 452 t
ORIGIN
1 caacctggtt gatcctgcca gtggtatatg cttgtctata ggactaagcc atgcaagtgc
61 aagtatgagt gtattataca acgaaactgc gaatggctct gtaaaacagc tataatttct
121 tcggtgctaa atactactcg gataaccgta gtaattctag agctaatacg tgcctacaca
181 gcgacgcaag tcgtggtaca aattagatac acaagccagc ttttttggtg attctagatt
241 ttttttctga tcgcacattg ttgcgacgca ccgttcaaat ttctgaccta tcaactttgg
301 atggtaaggt aatggcttac catggttgtg acgggtaacg gaccgtgggt gcgggactcc
361 ggagagggag cctgagagac ggctaccaca tccaaggaag gcagcaggcg cgcaacttac
421 ccaatccgga caccgggagg tagtgacaag aaatagcaat agagggcccg atgggttttc
481 taattggaat gagaacaaga taaacagctt atcgaggagc cagcagaggg caagtctggt
541 gccagcagcc gcggtaattc cagctctgta agcgtatacc aaagttgttg cagttaaaac
601 gctcgtagtc gaattttggc gtgtgatttg ggtcgttctc gcggacgcgc tcaggttgtg
661 cgcctttgtg gagagggggt gtggtggtgc ttgagtgcgc tgccatgctg ccgccaccgt
721 ttactgtgaa aaaattagag tgttcaaagc aggcgattgc cctgaataca ttagcatgga
781 ataatagaat aggacccggt tctattttgt tggtttgttt gaatcaggta atgattaaca
841 gggacggttg ggggcattcg tattccggcg ctagaggtga aattcttgga ttgtcggaag
901 acgaacagct gcgaaagcgt ctgccaagga cgttttcatt gatcaagaac gaaagtaagg
961 ggatcgaaga cgatcagata ccgtcgtagt ctttactata aacgatgagg actggagatc
1021 ggacaagact gatttatggc ttgttcggca tccttcgaga aatcaaagtg tttgctttct
1081 ggggggagta tggtcgcaag gctgaaactt aaaggaattg acggaagggc atcaccgggt
1141 gtggagcctg cggcttaatt tgactcaaca cgggaaaact taccaggtca ggacatagta
1201 aggattgaca gattgagagc tctttcttga ttctatggtt ggtggtgcat ggccgttctt
1261 agttggtgga gtgatctgtc tggttaattc cgttaacgag cgagacctgg gcgtgctagc
1321 taggcgccgt taccaattga gtttggtagc gaggcggtgc cttcctagac ggactgtggg
1381 cgtctagccc acggaagctc caggcaataa caggtctgag atgcccttag atgtcctggg
1441 ccgcacgcgt gctacactga acggcccaac gagtggagga tgtgcgagag catttccgaa
1501 tctccaaatc cgttcgtgat ggggatcgac ggttgcaatt ttccgtcgtc aacgaggaat
1561 accttgtaag cgcgggtcat catcccgcgc tgaatacgtc cctgcccttt gtacacaccg
1621 cccgtcgctc ctaccgattg agtggtccgg tgaggccttg ggagggctgg atgcgctgtt
1681 tttatgcaga cattttggcc tgaacctggt caaaccttat cacttagagg aaggagaagt
1741 cgtaacaagg tttccgtagg tgaacctgca gaaggatcag aa
//
LOCUS REO1SIG1 1463 bp ds-RNA VRL 14-AUG-1990
DEFINITION Reovirus type 1 sigma-1 protein (S1) gene, complete cds.
ACCESSION M35963
KEYWORDS cell-attachment protein; hemagglutinin; sigma-1 protein.
SOURCE Reovirus type 1 (strain Lang) RNA.
ORGANISM Reovirus type 1
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1463)
AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N.
TITLE Structure of the reovirus cell-attachment protein: A model for the
domain organization of sigma-1
JOURNAL J. Virol. 64, 2976-2989 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 14 1426 sigma-1 protein
BASE COUNT 426 a 291 c 370 g 376 t
ORIGIN
1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact
61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca
121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat
181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg
241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca
301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga
361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt
421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg
481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga
541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt
601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg
661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta
721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc
781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg
841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat
901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga
961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac
1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt
1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc
1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca
1201 gacgggggat actgaaccat cgtcaactat tgatccatgg tctacgacat ttgccgcggt
1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg
1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt
1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc
1441 ctcggcacaa ggggtcaatc atc
//
LOCUS REO2SIG1 1440 bp ds-RNA VRL 14-AUG-1990
DEFINITION Reovirus type 2 sigma-1-s protein (S1) gene, complete cds.
ACCESSION M35964
KEYWORDS cell-attachment protein; hemagglutinin; sigma-1-s protein.
SOURCE Reovirus type 2 (strain Jones) RNA.
ORGANISM Reovirus type 2
Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE 1 (bases 1 to 1440)
AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N.
TITLE Structure of the reovirus cell-attachment protein: A model for the
domain organization of sigma-1
JOURNAL J. Virol. 64, 2976-2989 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
pept 14 1402 sigma-1-s protein
BASE COUNT 384 a 314 c 383 g 359 t
ORIGIN
1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac
61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa
121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact
181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg
241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc
301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga
361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat
421 tactaatgac cttggatcat tgaatacgag ggtgacgact gaattgaacg atgtccgcca
481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt
541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg
601 gttcgaccgc aataatgttc tgcagttatt cgtatcgaac cagcagaaag ggttgggatt
661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg
721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc
781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct
841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag
901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc
961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata
1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga
1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt
1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc
1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc
1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat
1321 cacgaattat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat
1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc
//
LOCUS MARCMYC 100 bp ds-DNA ROD 14-AUG-1990
DEFINITION Woodchuck mutant c-myc gene, intron A and breakpoint region.
ACCESSION M30385
KEYWORDS c-myc oncogene.
SOURCE Woodchuck (individual W64, chronically infected with woodchuck
hepatitis virus [WHV]) hepatocellularcarcinoma (HCC) DNA.
ORGANISM Marmota monax
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae;
Marmotini.
REFERENCE 1 (bases 1 to 100)
AUTHORS Moeroey,T., Marchio,A., Etiemble,J., Trepo,C., Tiollais,P. and
Buendia,M.-A.
TITLE Rearrangement and enhanced expression of c-myc in hepatocellular
carcinoma of hepatitis virus infected woodchucks
JOURNAL Nature 324, 276-279 (1986)
STANDARD simple staff_review
FEATURES from to/span description
IVS < 1 > 100 c-myc intron A
recomb 48 49 cellular DNA of unknown function end/c-myc DNA
start
BASE COUNT 24 a 26 c 29 g 21 t
ORIGIN 3 bp upstream of BGlII site.
1 ggagatctca ggaagtaagg cactcagaga gacagtgggc gattcgtagg attccccacg
61 gcgattccaa ctcgcccccg ctctttttaa gaagttggcg
//
LOCUS MARWHV1 120 bp ds-DNA ROD 14-AUG-1990
DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 5' junction.
ACCESSION M23563
KEYWORDS c-myc oncogene.
SEGMENT 1 of 2
SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA.
ORGANISM Marmota monax
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae;
Marmotini.
REFERENCE 1 (bases 1 to 120)
AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C.,
Tiollais,P. and Buendia,M.-A.
TITLE Activation of c-myc by woodchuck hepatitis virus insertion in
hepatocellular carcinoma
JOURNAL Cell 55, 627-635 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 6 c-myc oncogene
recomb 92 95 woodchuck DNA end/virus DNA start
BASE COUNT 41 a 23 c 25 g 31 t
ORIGIN
1 gcctaagttg aactgttgga gggaggaact ggaatctcct aaattctcac tccttactaa
61 gggaaagcaa ggaaaaaggt tccttctgac tgaaggcata aaaaaccatt atcctaattt
//
LOCUS MARWHV2 40 bp ds-DNA ROD 14-AUG-1990
DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 3' junction.
ACCESSION M23564
KEYWORDS c-myc oncogene.
SEGMENT 2 of 2
SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA.
ORGANISM Marmota monax
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae;
Marmotini.
REFERENCE 1 (bases 1 to 40)
AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C.,
Tiollais,P. and Buendia,M.-A.
TITLE Activation of c-myc by woodchuck hepatitis virus insertion in
hepatocellular carcinoma
JOURNAL Cell 55, 627-635 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
recomb 15 16 virus DNA end/woodchuck DNA start
BASE COUNT 5 a 14 c 7 g 14 t
ORIGIN About 900 bp after segment 1.
1 agggcagcat tgatctacgc ttcctgcttc cctctttctc
//
LOCUS BSUPAIA 1367 bp ds-DNA BCT 14-AUG-1990
DEFINITION B.subtilis negative regulator of protease synthase and sporulation
(pai) genes, complete cds.
ACCESSION M36471
KEYWORDS regulatory protein.
SOURCE B.subtilis (strain DB104) DNA, clone pPAI121.
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1367)
AUTHORS Honjo,M., Nakayama,A., Fukazawa,K., Kawamura,K., Ando,K.,
Hori,M. and Furutani Y,
TITLE A novel Bacillus subtilis gene involved in negative control of
sporulation and degradative-enzyme production
JOURNAL J. Bacteriol. 172, 1783-1790 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 100 618 negative regulator pai 1
pept 640 1263 negative regulator pai 2
BASE COUNT 504 a 205 c 283 g 375 t
ORIGIN
1 tttttggaat ggtaacaagg tcagtatgtg ttccgtgttt aaatatataa aagggacaca
61 agacaattga attttaaata aaggggctga atccgaacaa tgagtgtaaa aatgaaaaaa
121 tgcagccggg aagatttaca aacacttcaa caattgagta ttgaaacatt caatgacatt
181 tttaaagaac agaactcacc tgaaaatatg aaagcctatt tagaaagcgc atttaacact
241 gagcagctgg aaaaagagtt atctaatatg tcttcgcaat tcttttttat ttactttgat
301 catgaaatcg ctggatatgt aaaggtcaat atcgatgatg ctcagtctga agaaatgggt
361 gctgaatcac ttgaaatcga gagaatttat ataaagaaca gctttcaaaa acatgggctt
421 ggcaaacatc tgctgaataa agcgatagaa attgcgctgg aacgtaataa aaagaacatt
481 tggctaggtg tgtgggaaaa aaatgaaaat gccattgcct tttataagaa aatggggttt
541 gttcagaccg gcgcccactc attttatatg ggtgatgaag aacaaacgga tttaatcatg
601 gctaaaacac tcatataata tttttcgaag gggatgatga tgtatattcc aaaatatttt
661 aaagtcacaa atgctgaaga aatttggaat tttgtacaag aaaactcctt tggtacggtt
721 gtcacaacag aacaaggaaa accaattgcc acacatttgc cgttaggatt caataaaaaa
781 gacgatcatt actatattac ggggcatttc gcttatggaa atcctcagtg gagaacgttt
841 gaagcctgtg aagatgtgct tgtgatgttt cagggaccgc acgcttatat ttcttcttcc
901 tggtattcgc gcgaaaatgt tccgacgtgg aattatcagg ctgtccacat gtatggaaaa
961 gcaagtatgt tagagaaaga tgaattagca gaagaattaa caatcatgtt ggagaaatac
1021 gaaaaacatc gtgacaatcc tgtgttatgg gataagcttt ctccaaaact cttagaaagt
1081 gaattgaaag gtattgttgg atttaagatt aaggtggaag acattcaagc ggcatataaa
1141 ttaagccaga accgaaatga aacggattat atgaacgtca ttgaacagtt gcaaaatgaa
1201 gaaaatccaa atgcaaaaca aatggcagaa ctgatggaag ataaactcaa aaagcaaata
1261 taagcttact ttgctgaaga agtagcagaa aacgccaact caagcggaga accctggcac
1321 ttggcgtttt tttatgatta atatttatcc cattaaaaat ggaattc
//
LOCUS HUMA2MGL 2041 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human alpha-2-macroglobulin mRNA, 3' end.
ACCESSION M36501
KEYWORDS alpha-2-macroglobulin.
SOURCE Human liver, cDNA to mRNA, clone pha2ml.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2041)
AUTHORS Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P.,
Scott,J., Eddy,R.L. and Shows,T.B.
TITLE Human alpha-2-macroglobulin gene is located on chromosome 12
JOURNAL Somat. Cell Mol. Genet. 11, 285-289 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 1932 alpha-2-macroglobulin (AA at 1)
/hgml_locus_uid="LH0012N" /nomgen="A2M"
/map="12p13.3-p12.3"
BASE COUNT 561 a 528 c 481 g 471 t
ORIGIN
1 cccgccttcc tagctgtccc agtggagaag gaacaagcgc ctcactgcat ctgtgcaaac
61 gggcggcaaa ctgtgtcctg ggcagtaacc ccaaagtcat taggaaatgt gaatttcact
121 gtgagcgcag aggcactaga gtctcaagag ctgtgtggga ctgaggtgcc ttcagttcct
181 gaacacggaa ggaaagacac agtcatcaag cctctgttgg ttgaacctga aggactagag
241 aaggaaacaa cattcaactc cctactttgt ccatcaggtg gtgaggtttc tgaagaatta
301 tccctgaaac tgccaccaaa tgtggtagaa gaatctgccc gagcttctgt ctcagttttg
361 ggagacatat taggctctgc catgcaaaac acacaaaatc ttctccagat gccctatggc
421 tgtggagagc agaatatggt cctctttgct cctaacatct atgtactgga ttatctaaat
481 gaaacacagc agcttactcc agagatcaag tccaaggcca ttggctatct caacactggt
541 taccagagac agttgaacta caaacactat gatggctcct acagcacctt tggggagcga
601 tatggcagga accagggcaa cacctggctc acagcctttg ttctgaagac ttttgcccaa
661 gctcgagcct acatcttcat cgatgaagca cacattaccc aagccctcat atggctctcc
721 cagaggcaga aggacaatgg ctgtttcagg agctctgggt cactgctcaa caatgccata
781 aagggaggag tagaagatga agtgaccctc tccgcctata tcaccatcgc ccttctggag
841 attcctctca cagtcactca ccctgttgtc cgcaatgccc tgttttgcct ggagtcagcc
901 tggaagacag cacaagaagg ggaccatggc agccatgtat ataccaaaga cctgctggcc
961 tatgcttttg ccctggcagg taaccaggac aagaggaagg aagtactcaa gtcacttaat
1021 gaggaagctg tgaagaaaga caactctgtc cattgggagc gccctcagaa acccaaggca
1081 ccagtggggg atttttacga accccaggct ccctctgctg aggtggagat gacatcctat
1141 gtgctcctcg cttatctcac ggcccagcca gccccaacct cggaggacct gacctctgca
1201 accaacatcg tgaagtggat cacgaagcag cagaatgccc agggcggttt ctcctccacc
1261 caggacacag tggtggctct ccatgctctg tccaaatatg gagcagccac atttaccagg
1321 actgggaagg ctgcacaggt gactatccag tcttcaggga cattttccag caaattccaa
1381 gtggacaaca acaaccgcct gttactgcag caggtctcat tgccagagct gcctggggaa
1441 tacagcatga aagtgacagg agaaggatgt gtctacctcc agacatcctt gaaatacaat
1501 attctcccag aaaaggaaga gttccccttt gctttaggag tgcagactct gcctcaaact
1561 tgtgatgaac ccaaagccca caccagcttc caaatctccc taagtgtcag ttacacaggg
1621 agccgctctg cctccaacat ggcgatcgtt gatgtgaaga tggtctctgg cttcattccc
1681 ctgaagccaa cagtgaaaat gcttgaaaga tctaaccatg tgagccggac agaagtcagc
1741 agcaaccatg tcttgattta ccttgataag gtgtcaaatc agacactgag cttgttcttc
1801 acggttctgc aagatgtccc agtaagagat ctgaaaccag ccatagtgaa agtctatgat
1861 tactacgaga cggatgagtt tgcaattgct gagtacaatg ctccttgcag caaagatctt
1921 ggaaatgctt gaagaccaca aggctgaaaa gtgctttgct ggagtcctgt tctcagagct
1981 ccacagaaga cacgtgtttt tgtatcttta aagacttgat gaataaacac tttttctggt
2041 c
//
LOCUS HUMPPA1CO 1500 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human alpha-1 collagen type I mRNA, 5' end.
ACCESSION M36546
KEYWORDS alpha-1 collagen type I.
SOURCE Human, cDNA to mRNA, clone pHUCI.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1500)
AUTHORS Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T.,
Jaenisch,R. and Prockop,D.J.
TITLE Structure of a full-length cDNA clone for the prepro-alpha1(I)
chain of human type I procollagen
JOURNAL Biochem. J. 253, 919-922 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 84 > 1500 alpha-1 collagen type I
/hgml_locus_uid="LG0047H" /nomgen="COL1A1"
/map="17q21.3-q22"
sigp 84 149 alpha-1 collagen type I signal peptide
matp 567 > 1500 alpha-1 collagen type I
matp 618 > 1500 alpha-1 collagen type I alpha chain
BASE COUNT 257 a 488 c 488 g 267 t
ORIGIN
1 ggcacgcgga gtgtgaggcc acgcatgagc ggacgctaac cccctcccca gccacaaaga
61 gtctacatgt ctagggtcta gacatgttca gctttgtgga cctccggctc ctgctcctct
121 tagcggccac cgccctcctg acgcacggcc aagaggaagg ccaagtcgag ggccaagacg
181 aagacatccc accaatcacc tgcgtacaga acggcctcag gtaccatgac cgagacgtgt
241 ggaaacccga gccctgccgg atctgcgtct gcgacaacgg caaggtgttg tgcgatgacg
301 tgatctgtga cgagaccaag aactgccccg gcgccgaagt ccccgagggc gagtgctgtc
361 ccgtctgccc cgacggctca gagtcaccca ccgaccaaga aaccaccggc gtcgagggac
421 ccaagggaga cactggcccc cgaggcccaa ggggacccgc aggcccccct ggccgagatg
481 gcatccctgg acagcctgga cttcccggac cccccggacc ccccggacct cccggacccc
541 ctggcctcgg aggaaacttt gctccccagc tgtcttatgg ctatgatgag aaatcaaccg
601 gaggaatttc cgtgcctggc cccatgggtc cctctggtcc tcgtggtctc cctggccccc
661 ctggtgcacc tggtccccaa ggcttccaag gtccccctgg tgagcctggc gagcctggag
721 cttcaggtcc catgggtccc cgaggtcccc caggtccccc tggaaagaat ggagatgatg
781 gggaagctgg aaaacctggt cgtcctggtg agcgtgggcc tcctgggcct cagggtgctc
841 gaggattgcc cggaacagct ggcctccctg gaatgaaggg acacagaggt ttcagtggtt
901 tggatggtgc caagggagat gctggtcctg ctggtcctaa gggtgagcct ggcagccctg
961 gtgaaaatgg agctcctggt cagatgggcc cccgtggcct gcctggtgag agaggtcgcc
1021 ctggagcccc tggccctgct ggtgctcgtg gaaatgatgg tgctactggt gctgccgggc
1081 cccctggtcc caccggcccc gctggtcctc ctggcttccc tggtgctgtt ggtgctaagg
1141 gtgaagctgg tccccaaggg ccccgaggct ctgaaggtcc ccagggtgtg cgtggtgagc
1201 ctggcccccc tggccctgct ggtgctgctg gccctgctgg aaaccctggt gctgatggac
1261 agcctggtgc taaaggtgcc aatggtgctc ctggtattgc tggtgctcct ggcttccctg
1321 gtgcccgagg cccctctgga ccccagggcc ccggcggccc tcctggtccc aagggtaaca
1381 gcggtgaacc tggtgctcct ggcagcaaag gagacactgg tgctaaggga gagcctggcc
1441 ctgttggtgt tcaaggaccc cctggccctg ctggagagga aggaaagcga ggagctcgag
//
LOCUS MACTRIIS1 280 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 1.
ACCESSION M37561 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 1 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 280)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 156 + 270 triosephosphate isomerase, exon 1 (E.C.
5.3.1.1)
IVS 271 > 280 triosephosphate isomerase intron A
signal 46 49 CAAT box
signal 93 98 TATA box
BASE COUNT 50 a 77 c 111 g 42 t
ORIGIN
1 gcaggagggc gggcgggggg cagggtctcc gggggaccgg tcgggccatg gcggaggacg
61 gcgaggaggc ggagttccac ttctcggcgc tctatataag cgggcagtgg cccggctgcg
121 cgcagacact gaccttcagc gtctcgctcc atgccatggc gccttccagg aagttcttcg
181 ttggggggaa ctggaagatg aacgggcgga agcagaatct gggggagctc atcggcactc
241 tgaacgcggc caaggtgccg gcagacaccg gtaagcctcg
//
LOCUS MACTRIIS2 144 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 2.
ACCESSION M37562 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 2 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 144)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 134 triosephosphate isomerase, exon 2 (E.C.
5.3.1.1)
IVS < 1 10 triosephosphate isomerase intron A
IVS 135 > 144 triosephosphate isomerase intron B
BASE COUNT 33 a 38 c 39 g 34 t
ORIGIN
1 ttgccctcag aagtggtttg tgctcccccc actgcctata tcgacttcgc ccggcagaag
61 ctagatccca agattgctgt ggctgcacag aactgctaca aagtgactaa tggggccttt
121 actggggaga tcaggtgaga tcga
//
LOCUS MACTRIIS3 105 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 3.
ACCESSION M37563 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 3 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 105)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 95 triosephosphate isomerase, exon 3 (E.C.
5.3.1.1)
IVS < 1 10 triosephosphate isomerase intron B
IVS 96 > 105 triosephosphate isomerase intron C
BASE COUNT 23 a 23 c 35 g 24 t
ORIGIN
1 cttcctttag ccctggcatg atcaaagact gcggagccac gtgggtggtc ctggggcact
61 cagagagaag acatgtcttt ggggagtcag atgaggttag tagcc
//
LOCUS MACTRIIS4 153 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 4.
ACCESSION M37564 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 4 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 153)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 143 triosephosphate isomerase, exon 4 (E.C.
5.3.1.1)
IVS < 1 10 triosephosphate isomerase intron C
IVS 144 > 153 triosephosphate isomerase intron D
BASE COUNT 39 a 29 c 49 g 36 t
ORIGIN
1 ttctcaacag ctgattgggc agaaagtggc ccatgctctg gctgagggac tcggagtaat
61 cgcctgtatt ggggagaagc tagatgaaag ggaagctggc atcactgaga aggttgtttt
121 cgagcagaca aaggtcatcg caggtatctc ttg
//
LOCUS MACTRIIS5 102 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 5.
ACCESSION M37570 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 5 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 102)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 7 + 92 triosephosphate isomerase, exon 5 (E.C.
5.3.1.1)
IVS < 1 6 triosephosphate isomerase intron D
IVS 93 > 102 triosephosphate isomerase intron E
BASE COUNT 26 a 25 c 31 g 20 t
ORIGIN
1 ctgcagataa tgtgaaagac tggagcaagg ttgtcctggc ctatgagcct gtgtgggcca
61 ttggtactgg caagactgca acgccccaac aggtaaccga cg
//
LOCUS MACTRIIS6 108 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 6.
ACCESSION M37571 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 6 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 108)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 + 98 triosephosphate isomerase, exon 6 (E.C.
5.3.1.1)
IVS < 1 10 triosephosphate isomerase intron E
IVS 99 > 108 triosephosphate isomerase intron F
BASE COUNT 26 a 24 c 32 g 26 t
ORIGIN
1 cttgttctag gcccaggaag tacatgagaa gctccgagga tggcttaagt ccaacgtctc
61 tgaagcagtg gctcagagca cccgtatcat ttatggaggt gagtggct
//
LOCUS MACTRIIS7 632 bp ds-DNA PRI 14-AUG-1990
DEFINITION M.mulatta triosephosphate isomerase gene, exon 7.
ACCESSION M37572 M35724
KEYWORDS triosephosphate isomerase.
SEGMENT 7 of 7
SOURCE M.mulatta DNA.
ORGANISM Macaca mulatta
Unclassified.
REFERENCE 1 (bases 1 to 632)
AUTHORS Old,S.E. and Mohrenweiser,H.W.
TITLE Nucleotide sequence of the triosephosphate isomerase gene from
Macaca mulatta
JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept + 11 129 triosephosphate isomerase, exon 7 (E.C.
5.3.1.1)
IVS < 1 10 triosephosphate isomerase intron F
BASE COUNT 122 a 201 c 154 g 155 t
ORIGIN
1 cccttcccag gctctgtgac tggggcaacc tgcaaggagc tggccagcca gcctgacgtg
61 gatggcttcc ttgtgggtgg tgcttccctc aagcccgaat tcgtggacat catcaatgcc
121 aaacaatgag ccccatccac cttccctacc cttcctgcca agccagggac taagcagccc
181 agaagcccag tcactgcccc ttccccgcac atgcttctga tggtgtcgcc tgctccttcc
241 tgtggcctca tccaaactgt accttccttt actatcttca gcctgtaatg gttgggacca
301 ggccaatccc ttctccactt actataatgg ttggaactaa atgtcaccaa ggtggcttct
361 cctcggctga gagatggaag gggtgggatt tgctcctggg ttccctaggc cctagtgagg
421 gcaggagaga agccatcctc tcccttctta cacagtgagg ccgagatcct cccctccgaa
481 ggcaggagtg ccgccctctc ccatggtgcc catgcctgtg tgctgtgtac gtgaaccacc
541 cacatgtgag ggaataaaca cctagcacta ggtcttgtgg tctgtctgcc ttcactggac
601 ttgcccagat aatcttcctt tttgaggcag ct
//
LOCUS MUS2NF3MK 2470 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger protein (mkr3) mRNA, 3' end.
ACCESSION M36514
KEYWORDS mkr3 gene; zinc finger protein.
SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 15.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2470)
AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P.
TITLE Specific and ubiquitous expression of different Zn finger protein
genes in the mouse
JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 1287 zinc finger protein (mkr3; AA at 1)
BASE COUNT 695 a 501 c 488 g 786 t
ORIGIN
1 ataaaaactc acactggtga gaagcccttc cggtgtaaag tatgtgcaag gacttttaga
61 aattcctcat gccttaagac taactttcga attcacactg gaataaaacc atacaaatgt
121 aattactgtt ggaaagcctt cactgcacgc tcaggcctta ctaagcatgt actaattcac
181 aatggtgaga agccctatga gtgtaaggag tgtgggaaag ccttcagtac atcttccggc
241 cttgttgaac atataagaat tcatacagga gagaagccct ttgaatgtta ccagtgtggg
301 aaagccttgg tccattcctc atcccttgtt ggacatttaa gaactcacac tggagagaaa
361 ccctttgagt gtaatcagtg tgacaaaact tttacacgat cttcttatct tcgtattcat
421 atgcgaactc acaccggaga gaaaccatat gaatgtaaag agtgtgggaa aactttccct
481 gagcgctcat gccttactaa acacataagg acacacactg gtgaaaggcc ctatgaatgt
541 aaggagtgtg acaaaggctt tattagcttt gctcagctta ctgtacacat aaaaactcat
601 agttctgaga gaccctttca gtgtaaggta tgcacaaaat cttttagaaa ctcttcatcc
661 cttgagaccc acttccgaat tcacactgga gtaaaaccct ataaatgcag ttactgtggg
721 aaagccttca ctgctcgttc aggccctacc atccatttac gcaatcacac tggggagaaa
781 tcctatgcat gccaagagtg tggaaaggcc tttagcactt cctcaggcct tattgcacat
841 ataagaagtc acaaaggaga gaaacccttt gaatgtgacc actgtgggaa ggcctttcct
901 tcttcatctt atcttaatgt gcatttgaaa attcacactg gggaaaaacc ctttcggtgt
961 acggtatgtg ggaaaacatt tacgtgttct tcttaccttc cggttcacat gcgaactcac
1021 actggaggga ggccttttcg gtgtataata tgtggaaggt catttttgtg gtcctcgtac
1081 cttagagttc acatgcgaat tcacactgga gagaaaccct atgtatgtca gtactgtgga
1141 aaagctttta cagagcactc aggccttaat aaacatttac ggaaacacac aggagagaaa
1201 ccatatgaat ataaggaatg tggggaaaac ttcactactt ctgctgatgc taatgaacat
1261 gaaactcccc actggggaga cctttgaatg taagggatca gaaaattgat agccgttgac
1321 catcatttgc aatcaaaatt tatagtctat acaagtctta ttgataagca acgtataaaa
1381 aacattcggt tgctatgtgc tgtaaaaaca actcattctg aggatgctct gagtttaagg
1441 attgtgggag aaatcaaaat ttcatgatat atgtatgaac tgaaggtttg taatataggg
1501 aacgaaaacc tttagtgttt ctatgagtgt tgatgcaaga catggagaaa tgtttctatg
1561 ttcacgtgtg gtctcacagt tcgagaaatg tctttctgaa gtttgggaag taactttaca
1621 tttgaacaca agctacatgc gtgtaagagg ggatttgctt cagtgtcttg gatggcaaca
1681 cttggctttc ttgggtgctt ctctatgaac tttgaatatt tctccatttg tgtattttta
1741 tgacctcaat tccattgtta gttttccttt ttaaaatatt cctgtttgat gtgattgaaa
1801 cttgttttta tctagtatct gttatatgtc ccatgttggc attgtgaacc taatgacaca
1861 aactcccatc tcctcactac ttgttagttt atttgttgtt gtttttttcc cctcttttct
1921 ttggagacag tctcactttc tggcctagta taacatgaaa gttatgtagc ctagactggt
1981 ccctagctca tggcagtcta gacccagcct tctgagtgca tagattacag gtgcaagcca
2041 ccatacctgc tctttctact gttttaagta taaacttcac tagcactgaa aaaaccatca
2101 tgttttacaa ttgtgttcag gttcttaaca caagtaggtt tggatctgac caatacaatc
2161 ccttactctg agttctaagt ctttggtagc cattccattc tctgcttgga gttcagcttt
2221 tctttaggct ttgcttctta gatcatgtag tatctgtatt aactctactt agctttttta
2281 tttttgtgac agggtcttgc tgtagctcaa actgacctca aactcgcaaa cctttttgcc
2341 tcagacttac tgctacagtt ataggcatgt gccttcatgc ctagctggcg tatatcaatc
2401 tacataatgt tctcaaggtg gaactgtgtt atggtcagtt attttaagac atctaattat
2461 cctttggcct
//
LOCUS MUS2NF4MK 1258 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger protein (mkr4) mRNA, partial cds.
ACCESSION M36515
KEYWORDS mkr3 gene; zinc finger protein.
SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 14.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1258)
AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P.
TITLE Specific and ubiquitous expression of different Zn finger protein
genes in the mouse
JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 1258 zinc finger protein (mkr4; AA at 1)
BASE COUNT 356 a 335 c 321 g 246 t
ORIGIN
1 aaagtccttt ttccaagtat cttctctgtt caggcaccgg agatccacac tggagaaaag
61 ctctacgact gtagccattg tgggaaaggc ttctcttaca actcagacct caggatacat
121 cagaagatcc atacaggaga gaaacgccat ggctgcgtcg actgtggcaa agccttcaca
181 caaaagtcca cactgaggat gcaccagaag atccacacgg gcgagagggc ctatgtctgt
241 attgagtgtg gacaggcttt tatccagaag acacacctgg tcgcacaccg aagaattcac
301 tcaagagtga ggccctgtgt atctctcgac cgtgcgaaac ccttcagcag cgctcccaac
361 ctccttccgc gtaagaaagt tcaaatgaga gagaaatcgt ccatctgcgc ggagtgtggg
421 aaggccttta cctacaggtc ggagctaatc attcatcaga gaactcacac gggagagaag
481 ccttatcagt gcggggactg tggcaaagcc ttcacccaga agtcagccct caccgtgcac
541 cgaagaatcc acacggggga gaagtcgtat gtgtgtgtga agtgcgggct agccttcgtg
601 caaagggcac acttggatgc acaccaggtg attcacactg gagaaaaacc ttaccagtgt
661 ggtcattgtg ggaaattctt cacttccaag tcgcaactcc acgtgcacaa gcggattcac
721 acgggggaaa aaccctatgt gtgcagtaac tgtgggaagg cctttgccaa caggtcaaat
781 ctcatcacac atcagaaaac tcatacaggg gagaaagcct atgtctgtgc aaggtgtggg
841 aaagcattca ctcagaggtc agaacctgtt acacaccaga ggatacatac cggagagaag
901 ccatatgggt gtagaccctg cgggaaagcc tttacccaga aatcacacct cagtatacac
961 gagaagattc acaccggaga gagacagtac ggatgccgag actgtgggaa agccttcaac
1021 cagaaatcca tactcatcgt gcaccagaaa atccacacgg gggagaaacc gcaggtgtgt
1081 gccgagtgcg ggcgagcttt catccgcaag tccaacttca tcacccatca gcgcatccat
1141 actggcgaga aaccttatgg atgcaccgac tgcggcaagt cctttacctc caagtcgcag
1201 ctcctagtgc accggccaat ccacaccggc gagaaacctt acgtgtgtgc cgagtgtg
//
LOCUS MUS2NF5MK 2433 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Mouse zinc finger protein (mkr5) mRNA, 3' end.
ACCESSION M36516
KEYWORDS mkr3 gene; zinc finger protein.
SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 10.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2433)
AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P.
TITLE Specific and ubiquitous expression of different Zn finger protein
genes in the mouse
JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 1845 zinc finger protein (mkr5; AA at 1)
BASE COUNT 752 a 516 c 536 g 629 t
ORIGIN
1 cagcagtctg tacaggaagc ccaggatctg cttccaaggc aggactcaca tgctgaacgg
61 gtaacaggca gaacttggag cactaaactt gagtgctcca ctttcagaga tcaggattct
121 gagtgtacgt ttgaaaggaa tgagcaagag acagtcactc cgaacagagc cttctcagag
181 gggagagatg gtatgtgtat cgaatctgga agatggtttc atttgaacag ttcagacgag
241 agaagtcata actgtgactc aggtaaaagt ttctcctcaa atccagtagt tgtaaaggaa
301 accggaatct gttcaggaaa aaaacttttc caatgtaatg aatgtaagaa aactttcacc
361 cagagctcct cccttaccgt tcaccagaga attcatactg gagagaagcc ctataaatgc
421 aatcagtgtg ggaaggcctt cagtgacggg tcctccttcg cacggcacca gaggtaccat
481 acaggcaaga agccgtacga gtgcccggag tgtggcaaag ctttcataca gaacacctcg
541 ctcgttcgtc actggaggta ctatcacact ggggagaaac ccttcgattg catcgactgt
601 gggaaagcct tcagtgacca catagggctt aatcaacaca ggaggattca cactggagag
661 aaaccataca cgtgtgaagt gtgtcacaaa tcctttaggt acggctcatc cctcactgtg
721 caccaaagga ttcatactgg agagaaaccg tacgagtgtg agatttgcag aaaagccttc
781 agccaccatg catccctcac tcagcatcaa cgagtgcatt ctggagaaaa gccttttaaa
841 tgtaaagagt gtgggaaagc ttttaggcag aatatacacc ttgctagtca ctggaggatc
901 cataccgggg agaagccctt tgagtgtggg gaatgtggga aatctttcag catcagctca
961 cagcttgcca cgcatcagag aattcataca ggagagaagc cctttgagtg taaggtttgt
1021 aggaaagctt ttaggcagaa tatacacctt gctagtcact ggagaatcca taccggggag
1081 aagccctttg agtgtgggga atgtgggaaa tctttcagca tcagctcaca gcttgccacg
1141 catcagagaa ttcatacagg agagaagcct tatgaatgta aggtttgtag gaaagcattt
1201 acgcagaagg ctcaccttgc acaacatcag aagactcata cgggggagaa gccatacgag
1261 tgcaaggaat gtggcaaggc cttcagccag accactcacc tcattcaaca ccagagggtt
1321 catactggag agaagcccta taaatgcctg gaatgtggga aggcctttgg tgataactca
1381 tcctgtactc aacatcagag acttcacact ggcctgaggc cttatgagtg tgtggagtgt
1441 ggaaagacat tcaagaccaa gtcatccctt atctgttatt gtagatgtca cacgggagag
1501 aaaccttatg agtgcagtgc atgtggtaaa gcctttagcc actggcagtc ccttagtgta
1561 catcagagaa ttcattcggg gaagaaaccg tatgaatgta aggaatgtcg gaaaaccttc
1621 atccagattg gacaccttaa ccaacataag agagtccata ccggggagag aacgtacaac
1681 tacaagaagg gcaggagggc cttcaggcag actgcacact ttgctcacca tcagcagatt
1741 cattccggaa agtcacctgc tcaccactct ttgccttcca catcgaatcc tgtggatctc
1801 ttctccaaat tcgtctggaa tccatcctca ctgccatcat cataacctca tattttcacc
1861 acacttgttt atcaacttgt ctctttgccc ctttgttcat tcttgtccct tatgagtcct
1921 tcatattaca accaggttgg attatttttg gagtgtaaat gtaattaatt tgctcattta
1981 aagcttatat tcactgggga aaaaaaaaga agttagctga aacataaaga aacctcatga
2041 atctaagaaa tgttccatat cttgctctgg gtagtatatc tgtaaatctt agcaagccct
2101 gtatgtagta ttttaagaac tttctgactt tgttatcctt taatttttaa tgctgttata
2161 ttaagaaaaa cacatttata ataaatatgg tattttaaat aaaaagtatt atatcttata
2221 gctatttttc agataaacaa aaacaaattt gatacaaagt aaaatatgta taaggttggt
2281 gttccagaac aaaatctgca ggaatattta cagcatggta taaagcaacg aaaccaggtc
2341 atccaccaac attttgcctt agattttcta atatttcagc atatgagtag cacagtatta
2401 aaacactata ggtagtagca tttcacagaa ttc
//
LOCUS ORCHRCOM 1579 bp ds-DNA MAM 14-AUG-1990
DEFINITION O.orca highly repetitive element DNA.
ACCESSION M36451
KEYWORDS repetitive element.
SOURCE O.orca DNA.
ORGANISM Orcinus orca
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Odontoceta; Delphinidae; Orcininae.
REFERENCE 1 (bases 1 to 1579)
AUTHORS Widegren,B., Arnason,U. and Akusjaervi,G.
TITLE Characteristics of a conserved 1,579-bp highly repetitive component
in the killer whale, Orcinus orca
JOURNAL Mol. Biol. Evol. 2, 411-419 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rpt 1 1579 highly repetitive element
BASE COUNT 368 a 358 c 333 g 520 t
ORIGIN
1 atcttatgat gtttcttttt ttgtgtgact tatttcatgt agaatcatcg tacctgaatc
61 cactcattat gctgctacgg gcctgatgac atagatttca ttgctgagtg atattgcatt
121 gtacgtaagt accacaaagt ctttatccat ttttcgcttt ctgcgatact gaacttgtac
181 cataaacgag gttcttgtaa acagagccgt cacaaacttt gggatggctg tgtctttttc
241 attttaattt caactaagct ataggaccat aagtggaagt gccctaggct ctgttgcttt
301 gttgtttaga tgtttcagga aacaccatac acttctccag agtggctgtt ggcaatttac
361 atcccgccca tcagcataac aaggctccca attctccatg gcctgtcccg cctttctgga
421 ttttacactt ttttcagatg gcccttttga ccgtggggca gtgagacttc atgtagtgca
481 gatttccttt gcaagcttgc ttggttggcc aaaaagttcg tatgcgtttc tttctgaata
541 tattcaggaa aaaacgcata agaccttttt ggccaagtgc atcattgtgg acgttctgcc
601 tgttttccta tgctttacat gcaattccag tctacctcct gaaatcggtg tctggcaatt
661 ttgccccgct ttcaagtcct cttggcagcc ttacttcaat atatttttgg acgatagctg
721 tcatttataa ctctgcaggt ttgtgaatca cagtgcccct gagctccttt cttcaactgg
781 ctttcttgtg agctggctgc aacaccgcag gattgcttga ggccctcgtg tggttccggc
841 agggcacgct gagccttttg ttaattcctc ttcctggtgg gaaatgaggg ttaaatttgc
901 ccgtccagac acctccagct agtctctcat tgattctccc tattcctgtt cattttccgc
961 agaaattgca aactgggcca aacaggaggt taaaggcact gactctccaa gtggggagag
1021 tgttagtaaa gcgtctggaa tcttgcaccc gagtaccagg ggacgaaaac tgagacacat
1081 ttgaacacgt ttcccgatca cacggtggat catactctgg gttccacatg catgttttag
1141 ctgaaggaag aatcccttaa acctggagag ttgagaccca tggaatgggt accatgcaat
1201 atgacttcaa agggtctgta tttgctaact gaaactcact aatcctatca ctgctgcgtt
1261 tatgccactg tacacacgct tgattccctt tcggagacat ataagtccac aggttttaag
1321 attcttacta gtcaggtata ttcttaggag tttaatatgg agtgttgagt ccactcgatc
1381 agcaaggagt agctcttgtc tattaaatat ttggcttatg gaacggtatc tgtgctgatt
1441 tcaatctctg gttttatgca gcacctcaac tcacctttcc ccttaagcaa gcataagttg
1501 ctgttctaca tttgagaccc tgttctgttt tgtaattcag ttcctgtgta gccaagttta
1561 cattccgtgt agtagtgat
//
LOCUS RICADH2A 3779 bp ds-DNA PLN 14-AUG-1990
DEFINITION Rice alcohol dehydrogenase (adh2) gene, complete cds.
ACCESSION M36469
KEYWORDS adh2 gene; alcohol dehydrogenase.
SOURCE Rice DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 3779)
AUTHORS Xie,Y. and Wu,R.
TITLE Molecular analysis of an alcohol dehydrogenase-encoding genomic
clone (adh2) from rice
JOURNAL Gene 87, 185-191 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 639 669 alcohol dehydrogenase (adh2), exon 1
768 904 alcohol dehydrogenase, exon 2
1017 1063 alcohol dehydrogenase, exon 3
1672 1998 alcohol dehydrogenase, exon 4
2114 2195 alcohol dehydrogenase, exon 5
2301 2376 alcohol dehydrogenase, exon 6
2471 2529 alcohol dehydrogenase, exon 7
2628 2723 alcohol dehydrogenase, exon 8
2854 3012 alcohol dehydrogenase, exon 9
3149 3247 alcohol dehydrogenase, exon 10
IVS 670 767 adh2 intron A
IVS 905 1016 adh2 intron B
IVS 1064 1671 adh2 intron C
IVS 1999 2113 adh2 intron D
IVS 2196 2300 adh2 intron E
IVS 2377 2470 adh2 intron F
IVS 2530 2627 adh2 intron G
IVS 2724 2853 adh2 intron H
IVS 3013 3148 adh2 intron I
BASE COUNT 1002 a 799 c 867 g 1111 t
ORIGIN
1 taatcattac caacagatat cttagaaaaa aaagctatta ttcctcaaaa ggaaaaaaaa
61 ggctattatt ccttacaaga aaaaagaaga aggcaatgca ttatgctttt ctaaatgtct
121 agattaattc atagttggac agatatgacc gcatatcaac gttaattact gccgatttga
181 ctgttgatta gacatatctt agcactaaac ctcactgaaa attcggagaa tttactattg
241 gagagaagaa aaggcatcca tcctccttgt cttcactctg caatctgcat catcatccgc
301 ggaaaccgga aaaccaccga agctcgaaaa ccggcggagg caagagcagc tagcagcatg
361 cagcagacgc agagccttcc tcttcttcca agcaaaaaat attcagatga cgcctcctcc
421 tctttccccc ctcctcccac cttgcttggt tcctttcagg cgtacttgcc cgcgagactc
481 ctaccactct ctgatcccta tatatacacc ccctcctccc ctctcgtctc ctcaccagaa
541 ttccaagcaa cgaactgcga gtgattcaag aaaaaagaaa acctgagctt tcgatctcta
601 cggagtggtt tcttgttctt tgaaaaagag ggggattaat ggcgacaggg aaggtgatca
661 agtgcaaagg tccgttcttg ctgttcttgg agcggttcgt gagaaaatta gtttgatgtt
721 ttgtggggag gagaagctga ttagatcggg ttttggggtt tttgcagcgg cggtggcatg
781 ggaggccggg gaagcgctgt cgatcgagga ggtggaggtt gcgccgccgc agcgcatgga
841 ggtccgcgtc aagatcctct acaccgccct ctgccacacc gacgtctact tctgggaggc
901 caaggtaatt aatcagcatc tgccatggct gagctttctc accttctcaa gcttgcaaga
961 atgatcagtg actaactaac ttggtgtgtg tgtggtttct gctttgatct tcgcaggggc
1021 aaacacctgt tttccctagg atcttgggcc atgaagctgg agggtatgtg taatctgaac
1081 caaccctgtc caattcttgt tcttcttcta ctacactgtt tagtggttca gtctattctg
1141 ttcttgtatt gttttttttt tgttgaaaat ggtttttggt gtttagtgga atttagtgtg
1201 tacttgtacg tcataggttg tttggtttat ggtattcggt tactaactat agttagtgtg
1261 cacgttgggg agtcctgatt agtcagtggc attttcatta agttcagtaa agaaatctac
1321 caagtgtcca attgtgcatg atattttacc tggtttggat aaatcatatg aaggaatcaa
1381 agaacagcgt cattgatgac atatacaacg acaattggat tcctaggata atatactttt
1441 cttcatgtgc tcctctatgc tagtgcttta ttactttctc tcctttgatg ttttgaggta
1501 ctcaaaggaa aaagacacac tgaatcatta tccatcttat gaactgttta aacttgactt
1561 acttatcact tctctgctca aatggacccc tttttttttt catttccatg gaagatatat
1621 gtgtaatcct aacaatcaaa ttgatctaac aaaacaattt tttgctcaca gcattgtgga
1681 gagtgtggga gagggtgtga ccgaactcgc cgcgggcgac catgtcctcc cggtgttcac
1741 cggcgagtgc aaggagtgtg atcactgcaa atcggaggag agcaacatgt gtgacctcct
1801 caggatcaac gtcgaccgcg gcgtcatgat cggcgacggc aagtcccgat tcaccatcaa
1861 ggggaagccc atcttccact ttgttggcac ttccaccttc agcgagtaca ccgtcatcca
1921 tgtcggctgc ctcgcgaaga tcaacccgga ggcgcctctc gacaaggtct gcattctcag
1981 ctgcggtttc tcgtccaggt aagatgatgg atggcttcat gattatgatg acgtagtagt
2041 tttcacagtg aaatttggaa aggtcttttt ttttttggat gcattaagtt gatgggtttt
2101 gctgttcttg caggtttggt gcgacagtga atgtcgcgaa accgaaaaag ggccagaccg
2161 tagctatttt cggtcttgga gctgtaggtc ttgctgtaat gggcacacgc acaaagatat
2221 attcatttcc atttgatttg cagattgttg agatcttgaa actcaaactg atgaatgttt
2281 gtaaaattac atgattatag gctatggaag gtgccaggct gtctggagca tcgaggatca
2341 ttggtgtgga cctgaaccct gcgaaattcg aacaaggtac aatcttttat catctttgag
2401 tgaaaattac atgtttggta tatgttctgg attgaacact gaactctgca acttgttttc
2461 atctcttcag ctaagaagtt tggctgcact gacttcgtaa accccaagga ccacagcaag
2521 ccagtgcagg tacatcacat tgcctcacta atttctctta ttatcacaca ctgttcatga
2581 catggagtgt acatacttaa acatcaactc tgcaatggtg atctcaggtg ctgattgaga
2641 tgaccaacgg tggactcgac agggccgttg aatgcacggg caacatcaac gccatgatat
2701 cctgcttcga atgtgtccat gatgtcagtc tctttcttac aaacatgaaa catgatagta
2761 acaaattgaa gatgatcatt tcagcaatgc ctccagttct gctgtaaact gtatcctgaa
2821 aactaagagt ggcaatctga aacctgaatc cagggatggg gcgttgcggt gctggtcggc
2881 gtgaaggatg atgtgttcaa gacccacccg atgaacttcc tcaacgagaa gacgctgaag
2941 gggacgttga tcttcggtaa ctacaagccg cgcaccgacc tgcccaacgt cgtcgagctg
3001 tacatgaaga aggtaaaaat aattgcagaa cagaatcaga tttgcaacct gaatgtgcat
3061 tgttcagaaa tctatggtca gtgattccaa tgagccactg attgatcact attttatgct
3121 aatgatttct ggtgtttgat gtctgaagga gctggagctg gagaagttca tcacgagcgt
3181 gccgttctcg gagatcaaca cggcgttcga tctcatgctc aaggggagag tctccgctgc
3241 gtcatgagta tgtatgagta gagatgatga tgatctatct ctctctctct ctctctgcag
3301 tttcagtgca tacaagaaac caaaaatgaa agcaaaaaat ggatgagaag tttgaggttt
3361 tccatttaag tttaataatg gtcgtgagac agggcttgtg tggagtcagc tctgtttact
3421 ccctcctttc atggcttgat cttgtttatt gtcagcattg tggcttagca gattttcacc
3481 agttttaccc tgtaaattag taccattctg atcaatcgta ataaactact agcagtgtga
3541 ggtactattg ctggaactga attgattgtg cagtaaagag gtgaatagca gccagtatga
3601 aggttttaca atttgaacat cgaaacgaaa tccacacgaa attcataaat ttcatcgcat
3661 cttcgtgaag cttaccagat tctatcgatt cgatctagct ccaataggtc gatgggagca
3721 tctattgatt ttttcgacag acgaaaattt tgcaagggac tagctaggga gttgtcgac
//
LOCUS YSCCTA1A 2785 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae catalase A (CTA1) gene, complete cds.
ACCESSION M36510
KEYWORDS catalase A.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2785)
AUTHORS Cohen,G., Rapatz,W. and Ruis,H.
TITLE Sequence of the Saccharomyces cerevisiae CTA1 gene and amino acid
sequence of catalase A derived from it
JOURNAL Eur. J. Biochem. 176, 159-163 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 820 2367 catalase A (CTA1)
BASE COUNT 917 a 529 c 571 g 768 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcttag aaggtgaaga aatagtacta gattcatatg attgttataa taatggtgaa
61 ttgttgcctc aactaatact tttggtccaa atcttgacaa ttctttgcca aattccaggt
121 ttatgcaaac tggacataaa agcaatggaa aggcaagtgg agagaattgt aaagaagtgt
181 ttacaattga tagaaggtgc ccgcgccact acaaactgta gtgccacatg gaaacgttgt
241 attatgaagc gtctagccga ttaccccata aaaaagtgcg tttctatcga aaaaccttcg
301 aaaggaaact cattaacaag ggaagaacta agagatgtta tggctcggag agttttgaaa
361 agcgaaatag attcgctgca agtttgtgaa gaaaccatcg acaagaatta caaggttatt
421 cctgatgaaa agctgctaac taatatttta aagagaaagt tgacagagga agaaaaaagc
481 tctgtcaaac gtccttgcgt gaagaagtga gcggttgttc taaccactat ttaaagccgc
541 aattagtaat gcaaaaagtt ggccggaatt agccgcgcaa gttggtgggg tcccttaatc
601 cgaaaaagga cggctttaac aaatataaac tccgaaaatc cccacagtga cagaattgga
661 gaaacaacca gttttgatat cgccatacat ataaagagat gtagaaagca ttcttcactg
721 taatgtccaa atcgtacatt tgaatttctt gtaggtttat ttaaaaggta agttaaataa
781 atataatagt acttacaaat aaatttggaa ccctagaaga tgtcgaaatt gggacaagaa
841 aaaaatgaag taaattactc tgatgtaaga gaggatagag ttgtgacaaa ctccactggt
901 aatccaatca atgaaccatt tgtcacccaa cgtattgggg aacatggccc tttgcttttg
961 caagattata acttaattga ttctttggct catttcaaca gggaaaatat tcctcaaagg
1021 aatccacatg ctcatggttc tggtgccttc ggctattttg aagtaaccga tgacattact
1081 gatatctgcg ggtctgctat gtttagtaaa attgggaaaa gaacgaaatg tctaacaaga
1141 ttttcgactg tgggtggtga taaaggtagt gccgacacgg ttcgtgatcc aagggggttt
1201 gccaccaaat tctacactga agaaggtaat ttagattggg tctacaataa tacaccggta
1261 ttctttatca gagacccttc caagttccct cactttatcc acacacagaa gagaaaccca
1321 caaaccaacc taagggatgc tgacatgttt tgggatttcc tcaccactcc tgaaaatcag
1381 gtggccattc atcaagtaat gatccttttt tcagaccgtg gtacccctgc caactaccgt
1441 agtatgcatg gttattctgg tcatacctat aaatggtcca ataaaaacgg agattggcat
1501 tatgtgcaag ttcatatcaa aaccgatcaa ggaataaaga atttgaccat agaagaggct
1561 accaaaattg cgggatccaa tccagattac tgccagcagg atttatttga ggctattcag
1621 aatggaaact atccttcctg gacagtttat attcaaacaa tgaccgaacg cgatgccaaa
1681 aaattaccat tttcagtctt tgatttgact aaagtatggc ctcaggggca attcccttta
1741 cggcgtgtgg gtaagattgt tttgaacgag aatccactga acttcttcgc acaggtggaa
1801 caagctgcct tcgcccccag taccacggtt ccttaccaag aagcaagcgc tgatccagta
1861 ttacaggccc gtttgttttc atatgcggat gctcatagat acaggctagg tcctaacttc
1921 catcaaatac ccgtaaactg tccatatgca tctaaatttt tcaatcccgc tatcagagat
1981 ggaccgatga atgttaacgg caacttcggc tcagaaccta catatttggc caacgataaa
2041 tcgtacacgt atatccaaca ggacagaccc attcaacaac accaagaggt atggaatggg
2101 ccagctatcc cttatcattg ggcaacatcc ccaggtgatg tagatttcgt gcaagcaaga
2161 aatctctacc gcgttttggg taaacaacct ggacagcaaa agaacttggc atataacatc
2221 ggcattcatg tagaaggcgc ctgtcctcaa atacagcagc gcgtttatga tatgtttgct
2281 cgtgttgata agggactatc tgaggcaatt aaaaaagtag ctgaggcaaa acatgcttct
2341 gagctttcga gtaactccaa attttgaaac gctcaagtaa caaatgagtg gcgttgtttc
2401 cacgacaatt atttatgata gtgtgtattt ttaacacatt ttatttatta caatttatgt
2461 attttgttat gaattattta tttatacgac taataggtga tgctcatatt ctcgtgttag
2521 aaagttaaaa aaattatcat ttcacacata ggaaagctcg tcgcgccggg gaaaaagctg
2581 aggaatctct attattaggg gtaaagttca acacattcag tatgagataa gtgtgtcttc
2641 aagagagatg cagcactgag tagggaacca agaaacgatg tctgaattac tagatagctt
2701 tgagacagag tttgcgaaat tttataccga cagcaatctg gaagagacaa accttcaaaa
2761 atgtcttgat catactcatg aattc
//
LOCUS YSCILV1A 2766 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae threonine deaminase (ILV1) gene, complete cds.
ACCESSION M36383
KEYWORDS ILV1 gene; threonine deaminase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2766)
AUTHORS Kielland-Brandt,M.C., Holmberg,S., Petersen,J.G.L. and
Nilsson-Tillgren,T.
TITLE Nucleotide sequence of the gene for threonine deaminase (ILV1) of
Saccharomyces cerevisiae
JOURNAL Carlsberg Res. Commun. 49, 567-575 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 661 756 ORF1
pept 778 2508 threonine deaminase (ILV1)
BASE COUNT 797 a 569 c 510 g 890 t
ORIGIN
1 gaattcgttc tcattacatg gccgcccacc atgcaccttt acatttcgta tttctttcct
61 ctctgcgcta tatctttggg aaagatttgc aatcaaggca aacacccgct tgtccccttt
121 caattcttgt gttatttcaa aaaccatcct ttcataatat atttgtattt ctttcgccct
181 ggactctacc agttcatatt tagaaccttc cacatatcgt ccaattctgc agcccacatc
241 tttttccacc acgatacggg aaacagaatg ggtccttgga ttctcgctaa acagtctctc
301 tattcccctg ttcaaaaccc tcaagatatt tgtttcccgc aacagctgca attgcaattg
361 atcaatccta tgcgaaaatg ccgagtttat gttattcaag acgcatttta aaaaattcac
421 tagcggctcc ttgaaattca ttatgtctga tgaatatgaa aaccttttcc tgactaccaa
481 gactctttaa ctcttctctc tttattgcat attatctctg ctattttgtg acgttcaatt
541 ttaattgacg cgaaaaagaa aaaataagaa gggcaaaaag aaaaagcgca gcgggtagca
601 aatttggaat cgcataaaaa gaaaaaaaaa atatcaaaga aaaagagtca tctcaaacat
661 atgtctgcag atacttcatt atcagctttg aaaacttttg ttgttgctgc tttgagttct
721 ttcttgtgtg agtgctacaa gccacattta aactaagtca attacacaaa gttagtgatg
781 tcagctactc tactaaagca accattatgt acggttgttc ggcaaggtaa acagtccaaa
841 gtgtctggat tgaacctttt gagactaaag gctcatttgc acagacaaca cctgtcacct
901 tccttgataa aactacactc tgaattgaaa ttggatgagc tgcaaactga taacacccct
961 gattacgtcc gtttagtttt aaggtcctct gtatacgatg ttattaatga atctccaatc
1021 tctcaaggtg taggtttgtc ttcccgtcta aacacgaatg tcatcttgaa aagagaagat
1081 ctattgcctg ttttctcttt caagcttcgt ggtgcctata acatgattgc caagttggac
1141 gattctcaaa gaaaccaggg tgttattgcc tgttcagctg ggaatcatgc ccaaggtgtg
1201 gcctttgctg ctaaacactt gaaaatacct gctactatcg ttatgcctgt ttgtacacca
1261 tctattaagt atcaaaatgt ctcgagatta gggtctcaag tcgtcctata tggtaacgat
1321 tttgacgagg ctaaggctga atgtgccaaa ttggctgaag agcgtggctt gacgaacatt
1381 cctcctttcg atcatcctta tgtcattgcc ggtcaaggta ctgtagctat ggaaatccta
1441 agacaagtac gtaccgctaa taagatcggt gctgtctttg ttcccgtcgg cggtggtggt
1501 ttaattgctg gtattggtgc ttatttgaaa agggttgctc ctcatatcaa aactattggt
1561 gttgaaactt acgatgcggc cactttacat aattccttgc aacgcaacca gagaactcct
1621 ttacctgtgg tgggtacttt tgccgatggt acgtctgtgc gtatgattgg tgaagaaaca
1681 tttagagtcg cccaacaagt ggttgatgaa gttgttcttg ttaacactga cgaaatctgt
1741 gctgcagtaa aggatatttt tgaagatact agaagtattg tagaaccatc tggtgccctt
1801 tcagtagccg gtatgaagaa atacatctct accgtacatc cagaaattga ccacactaaa
1861 aacacctatg ttcccatcct ttctggtgct aacatgaact ttgatagatt aagatttgtt
1921 tccgaacgtg ctgttcttgg tgaaggaaag gaagtcttca tgttagttac tttacccgac
1981 gtccctggtg cgttcaagaa aatgcaaaag atcatccacc caagatctgt cactgaattc
2041 tcttaccgtt acaatgaaca tcgtcatgag tcctctagtg aagtgcccaa ggcttacatt
2101 tacacttctt tcagcgtcgt tgacagagaa aaggaaatca agcaagttat gcaacagttg
2161 aatgctttag gttttgaagc tgtggatatc tccgataacg aattggctaa atctcatggt
2221 agatacttgg ttggtggtgc ttctaaggtt cctaatgaaa gaattatttc atttgaattc
2281 cctgaaagac caggtgcctt gactaggttc cttggaggcc taagcgattc ttggaatctt
2341 actttattcc attatagaaa ccatggtgcc gatatcggta aggttttagc tggtatttcc
2401 gttcctccaa gggaaaactt aaccttccaa aaattcttgg aagatttagg ctacacttat
2461 catgatgaaa ctgataacac tgtttatcaa aaattcttga aatattaaaa ccgacaattt
2521 actttataaa tttacgcaac aacttgttag gtaaaatgaa cgtcgtcata atttaaatac
2581 tagacagcgc gtgtatgtta ctcatattta tactcatatc atgtgaacct ataaaccaat
2641 tcaatacaag atttacagtt ctcttgtttt cttcttccag ttttcccctt ttatttcctg
2701 tagaataaac ttaaacagtg atcattaatt tataattttc tgctttgtaa atacggcggt
2761 aatgga
//
LOCUS YSPRPS6A 1940 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.pombe ribosomal protein S6 (rps6) gene, complete cds.
ACCESSION M36382
KEYWORDS ribosomal protein S6.
SOURCE S.pombe DNA.
ORGANISM Schizosaccharomyces pombe
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1940)
AUTHORS Gross,T., Nischt,R., Gatermann,K., Swida,U. and Kaeufer,N.F.
TITLE Primary structure of the ribosomal protein gene S6 from
Schizosaccharomyces pombe
JOURNAL Curr. Genet. 13, 57-63 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 495 1214 ribosomal protein S6 (rps6) precursor
sigp 495 614 ribosomal protein S6 signal peptide
matp 615 1211 ribosomal protein S6
mRNA 478 1240 rps6 mRNA (alt.)
mRNA 478 1260 rps6 mRNA (alt.)
BASE COUNT 592 a 371 c 355 g 622 t
ORIGIN
1 gatcatctct atacgctagc gttttgttga gtaaataatt ctgttaatga tgtacaaacc
61 attgagttca gctagttatt taaaaaatgc aacgagtata tcttataaaa ttaagtcata
121 caagacatca atttcctcgt aacgatgata aatcacatct atggaaacat agcagactat
181 gactttaata ttaattacct ttctccataa attaaacgaa aattaaagat gagatgaaaa
241 gtgataatta tagtaacttg taagttatat tcaacagtaa aatcctaatt tccttttcaa
301 ttttctgatt cacaatgtat atattagcta cccgcgtgtt tgtttactaa aaatagtgca
361 ttaatcatcc atcgcaaaat atttataacg agagatataa agacttgtaa atctatccca
421 aattgtgact gctcttagag gagaaactta gctagtcaaa aatacaacat tactgccggg
481 aacctcaacg caaaatgaag ttaaacattt cttatccagc aaatggtaca caaaaattga
541 ttgaaatcga cgatgatcgt cgtcttcgag tctttatgga gaagagaatg ggacaagagg
601 ttccaggtga ctctgtaggc ccggagtttg ctggatatgt cttcaaaatt accggtggta
661 acgacaaaca aggatttccc atgtttcaag gcgtcttgct tcctcaccgt gttcgtcttt
721 tgcttcgtgc tggccaccct tgctatcgcc cacgccgtga tggtgagcgc aagcgtaaat
781 ctgtccgtgg ttgcatcgtc ggccaagatt tggctgtcct tgctcttgcc attattaaac
841 aaggtgaaca agatatccct ggtttgaccg atgttaccgt ccccaagcgt cttggtccaa
901 agcgtgcctc taagatccgc cgcttcttta acttgtccaa ggaagatgat gtccgtcaat
961 tcgttattcg tcgtgaggtt gttcccaaaa aggaaggaaa gaagccttat accaaggccc
1021 ctaagattca acgtcttgtt actcctagaa ctcttcaaca caagcgccac cgctttgctc
1081 ttaagcgcag acaagctgaa aagaaccgtg aagaggctgc tgagtttgct caacttatgg
1141 ctaagcgtgt cgctgaggct aagcaaaaga gagaggtcgt caaggctcgc cgtgcctctt
1201 ctttgaaaaa gtaaataggc atttttaagt aactttggtc atctccatat tattattttg
1261 ctggaacttt tcttatgggg tttattgttg ttgaataaac gtagttagta acattctcta
1321 agcatcgttt gtattcactt atgagtagga atgttaactg taattaggat tatcttcacg
1381 aagctgcttt ttgtggtaat ccttcatgct tttaaataat tttaggatct atatagtaga
1441 ggtaattaat gatattctaa tcaatctagc ttcaacacac acatgatcaa tttttatacg
1501 ttttttaatc attcatatac actacgctaa aaccttttac agattgtttt actaaaataa
1561 ttattatctg cgaaacactc cttctatttt cttgaggata tgctaggaga aaatattata
1621 ctataagtat aagagggggt ttgataaaac tatttaatta cctctgctca ctagagtttt
1681 tatcctttat gttcctatct aagtcgccta aatctcaaca atttggcgca gacaaaagtt
1741 gtttgcgttg cgctagtttc tacaaccaca ccactaatac caactagcaa gaaccagaat
1801 ttctgagcta ctttaggtag taaaataaga ggtgaattgt gagccccggt ttgctttttt
1861 agacataagg cgaacacgag ttagtgctgt ctgttgccaa attattttaa ttcggcttta
1921 tcaaaacaat atagcatcga
//
LOCUS CNAGMDHA 1334 bp ss-mRNA PLN 14-AUG-1990
DEFINITION C.vulgaris glyoxysomal malate dehydrogenase mRNA, complete cds.
ACCESSION M33148
KEYWORDS glyoxysomal malate dehydrogenase.
SOURCE C.vulgaris (variety Sugar Baby) cotyledon, cDNA to mRNA.
ORGANISM Citrullus vulgaris
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Violales; Cucurbitaceae.
REFERENCE 1 (bases 1 to 1334)
AUTHORS Gietl,C.
TITLE Glyoxysomal malate dehydrogenase from watermelon is synthesized
with an amino-terminal transit peptide
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5773-5777 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Gietl, 22-MAR-1990.
FEATURES from to/span description
pept 70 1140 glyoxysomal malate dehydrogenase precursor (EC
1.1.1.37)
sigp 70 180 glyoxysomal malate dehydrogenase signal
peptide
matp 181 1137 glyoxysomal malate dehydrogenase
mRNA < 1 1334 glyoxysomal malate dehydrogenase mRNA
BASE COUNT 340 a 275 c 340 g 379 t
ORIGIN
1 caacgctaag ttcccaaagg tttctgatct tgaagcggtt ggtttgtttt tctgtttgtc
61 aaactaatta tgcagccgat tccggatgtt aaccagcgca ttgctcgaat ctctgcgcat
121 cttcatcctc ccaagtctca gatggaggag agttcagctt tgaggagggc gaattgccgg
181 gctaaaggcg gagctcccgg gttcaaagtc gcaatacttg gcgctgccgg tggcattggc
241 cagccccttg cgatgttgat gaagatgaat cctctggttt ctgttctaca tctatatgat
301 gtagtcaatg cccctggtgt caccgctgat attagccaca tggacacggg tgctgtggtg
361 cgtggattct tggggcagca gcagctggag gctgcgctta ctggcatgga tcttattata
421 gtccctgcag gtgttcctcg aaaaccagga atgacgaggg atgatctgtt caaaataaac
481 gcaggaattg tcaagactct gtgtgaaggg attgcaaagt gttgtccaag agccattgtc
541 aacctgatca gtaatcctgt gaactccacc gtgcccatcg cagctgaagt tttcaagaag
601 gctggaactt atgatccaaa gcgacttctg ggagttacaa tgctcgacgt agtcagagcc
661 aatacctttg tggcagaagt attgggtctt gatcctcggg atgttgatgt tccagttgtt
721 ggcggtcatg ctggtgtaac cattttgccc cttctatctc aggtgaagcc tccaagttct
781 ttcacacaag aagagattag ttacctgact gataggattc aaaatggtgg aacagaagtt
841 gtcgaggcca aagcaggagc tggctcagca actctctcaa tggcttatgc tgccgttaag
901 tttgcagatg catgcctcag gggcttaaga ggagatgctg gtgtcattga atgcgcgttt
961 gtgtcttctc aggtgactga acttccattc tttgcatcaa aagtacgact tggtcgcaat
1021 ggtatcgaag aagtatactc ccttggcccg ctaaatgagt atgagaggat tggattggag
1081 aaagcgaaga aagagttggc aggaagcatt gagaagggag tttccttcat cagaagctga
1141 agagatgcca attaccatta gttttaatag aaacattcca tctcttatag attacttgtg
1201 ctcaatgttt tcctggagat tgaagttgat tgaaatgata ccacaccacg tatttttata
1261 ctaataaaac tatatcgcca tcatgtcgat atttaatgca caaccaaaag ggttggatta
1321 gagtaccttt tatg
//
LOCUS HAMACAPT 1837 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
partial cds.
ACCESSION J05590 M22755
KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
tunicamycin-resistance.
SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
TRG-[4,22].
ORGANISM Cricetulus longicaudatus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 889 to 960)
AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S.
TITLE Amplification and molecular cloning of the hamster tunicamycin-
sensitive N-acetylglucosamine-1-phosphate transferase gene: The
hamster and yeast enzymes share a common peptide sequence
JOURNAL J. Biol. Chem. 263, 19796-19803 (1988)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1837)
AUTHORS Zhu,X. and Lehrman,M.A.
TITLE Cloning, sequence and expression of a cDNA encoding hamster
UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
transferase
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman,
04-APR-1989. Draft entry and computer-readable sequence for [1]
kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES from to/span description
pept 148 1374 N-acetylglucosamine-1-phosphate transferase
(GPT)
binding 67 79 dolichol binding site
binding 222 234 dolichol binding site
BASE COUNT 363 a 515 c 418 g 541 t
ORIGIN
1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 14-AUG-1990
DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
genes, complete cds.
ACCESSION M34440
KEYWORDS nuclear antigen.
SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 10089)
AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
Rickinson,A. and Kieff,E.
TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
EBNA-3A, EBNA-3B and EBNA-3C genes
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.T.Sample, 23-MAY-1990.
Kieff Laboratory
Harvard Medical School
75 Francis Street
Boston, MA 02115
FEATURES from to/span description
pept 189 524 nuclear antigen EBNA-3A, exon 1
610 3051 nuclear antigen EBNA-3A, exon 2
pept 3237 3593 nuclear antigen EBNA-3B, exon 1
3672 6155 nuclear antigen EBNA-3B, exon 2
pept 6277 6636 nuclear antigen EBNA-3C, exon 1
6711 9560 nuclear antigen EBNA-3C, exon 2
IVS 525 609 EBNA-3A intron
IVS 3594 3671 EBNA-3B intron
IVS 6637 6710 EBNA-3C intron
BASE COUNT 2495 a 2837 c 2619 g 2138 t
ORIGIN
1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
10081 ggaggatcc
//
LOCUS HUMPAPB 3061 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION M34840
KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE Human prostate, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1199)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3061)
AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
Moore,D.L.
TITLE Covalent structure, disulfide bonding and identification of
reactive surface and active site residues of human prostatic acid
phosphatase
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.L.Van Etten, 08-JUN-1990.
Chemistry Department
Purdue University
West Lafayette, IN 47907
FEATURES from to/span description
pept 15 1175 prostatic acid phosphatase precursor (EC
3.1.3.2) /hgml_locus_uid="LA0168F"
/nomgen="ACPP" /map="3q21-qter"
sigp 15 110 prostatic acid phosphatase signal peptide
matp 111 1172 prostatic acid phosphatase
BASE COUNT 917 a 642 c 681 g 821 t
ORIGIN Chromosome 3q21-qter.
1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
3061 a
//
LOCUS LDV3ECPA 1064 bp ss-RNA VRL 14-AUG-1990
DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete
cds.
ACCESSION M34856
KEYWORDS viral capsid protein.
SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
A16, B24, B63, B90, B103, C44, dt4.
ORGANISM Lactate dehydrogenase-elevating virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae.
REFERENCE 1 (bases 1 to 1064)
AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A.
TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid
protein (Vp1) gene
JOURNAL Virology (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.K.Godeny, 06-JUN-1990.
FEATURES from to/span description
pept 637 984 capsid protein
pept 135 650 unidentified ORF
BASE COUNT 267 a 223 c 265 g 309 t
ORIGIN 3' terminus region.
1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 14-AUG-1990
DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
(frhADGB) gene, complete cds.
ACCESSION J02914
KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE M.thermoautotrophicum (strain delta-H) DNA.
ORGANISM Methanobacterium thermoautotrophicum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanobacteriaceae.
REFERENCE 1 (bases 1 to 3903)
AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
TITLE Cloning, sequence determination, and expression of the genes
encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
reducing hydrogenase from Methanobacterium thermoautotrophicum
delta-H
JOURNAL Biochemistry (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.A.Alex, 25-MAY-1990.
FEATURES from to/span description
pept 298 1515 F420 hydrogenase alpha-subunit (frhA)
pept 1524 2000 F420 hydrogenase delta-subunit (frhD)
pept 2114 2824 F420 hydrogenase gamma-subuint (frhG)
pept 2837 3682 F420 hydrogenase beta-subunit (frhB)
binding 284 293 ribosomal binding site (put.)
binding 1514 1520 ribosomal binding site (put.)
binding 2100 2106 ribosomal binding site (put.)
binding 2823 2832 ribosomal binding site (put.)
BASE COUNT 1201 a 878 c 965 g 859 t
ORIGIN
1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
3901 cga
//
LOCUS MUSLRCNA 1751 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse loricrin gene, complete cds.
ACCESSION M34398
KEYWORDS cell envelope protein; loricrin.
SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1751)
AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
Yuspa,S.H. and Roop,D.R.
TITLE Identification of a major keratinocyte cell envelope protein,
loricrin
JOURNAL Cell 61, 1103-1112 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Rothnagel, 22-MAY-1990.
FEATURES from to/span description
pept 71 1516 loricrin
signal 1740 1745 polyA signal
BASE COUNT 223 a 493 c 706 g 329 t
ORIGIN
1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
1741 ataaatttgc t
//
LOCUS PFAGIPA 1906 bp ds-DNA INV 14-AUG-1990
DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
cds.
ACCESSION J05544
KEYWORDS glucosephosphate isomerase.
SOURCE P.falciparum (strain 7G8) DNA.
ORGANISM Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae.
REFERENCE 1 (bases 1 to 1906)
AUTHORS Kaslow,D.C. and Hill,S.
TITLE Cloning metabolic pathway genes by complementation in Escherichia
coli: Isolation and expression of Plasmodium falciparum
glucosephosphate isomerase
JOURNAL J. Biol. Chem. 265, 12337-12341 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.C.Kaslow, 04-JUN-1990.
FEATURES from to/span description
pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT 782 a 174 c 289 g 661 t
ORIGIN Chromosome 14.
1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS RATGADA 3198 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION M34445
KEYWORDS glutamic acid decarboxylase.
SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of
D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
pCR22HP.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3198)
AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
TITLE Characterization of a cDNA coding for rat glutamic acid
decarboxylase
JOURNAL Mol. Brain Res. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.I.Gottlieb, 23-MAY-1990.
FEATURES from to/span description
pept 184 1965 glutamic acid decarboxylase
mRNA < 1 3198 glutamic acid decarboxylase mRNA
signal 3174 3179 polyA signal
BASE COUNT 869 a 742 c 767 g 820 t
ORIGIN
1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
3181 taaaaaatat tctccatg
//
LOCUS RATTSHRA 5270 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION M34842
KEYWORDS TSH receptor; thyroid stimulating hormone receptor;
transmembrane protein.
SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 5270)
AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
McBride,O.W. and Kohn,L.D.
TITLE Cloning, chromosomal assignment, and regulation of the rat
thyrotropin receptor by thyrotropin, agents which increase cAMP
levels and thyroid autoantibodies
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5677-5681 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.D.Kohn, 05-JUN-1990.
FEATURES from to/span description
pept 55 2349 thyroid stimulating hormone receptor precursor
sigp 55 117 thyroid stimulating hormone receptor signal
peptide
matp 118 2346 thyroid stimulating hormone receptor
signal 5256 5261 poly-A signal
BASE COUNT 1415 a 1343 c 1176 g 1336 t
ORIGIN
1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS STMBLIPA 840 bp ds-DNA BCT 14-AUG-1990
DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete
cds.
ACCESSION M34538
KEYWORDS beta-lactamase inhibitory protein.
SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
ORGANISM Streptomyces clavuligerus
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 840)
AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
TITLE Isolation and characterization of a beta-lactamase inhibitory
protein from Streptomyces clavuligerus and cloning and analysis of
corresponding gene
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.E.Jensen, 25-MAY-1990.
Dept. of Microbiology
University of Alberta
Edmonton, Alberta Canada T6G 2E9
FEATURES from to/span description
pept 67 672 beta-lactamase inhibitory protein precursor
(bliP)
sigp 67 174 beta-lactamase inhibitory protein signal
peptide
matp 175 669 beta-lactamase inhibitory protein
binding 52 63 ribosomal binding site
signal 686 724 transcription termination signal
BASE COUNT 140 a 248 c 320 g 132 t
ORIGIN
1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS TRBPARPA 763 bp ss-mRNA INV 14-AUG-1990
DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION M34790
KEYWORDS procyclic acidic repetitive protein; procyclin.
SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
pPS-1.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 763)
AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
TITLE Inhibition of protein synthesis results in super-induction of
procyclin RNA
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Boothroyd, 04-JUN-1990.
Stanford University School of Medicine
Dept of Microbiology and Immunology
Sherman Fairchild Science Building
Stanford, CA 94305-5402 USA
FEATURES from to/span description
pept 29 466 procyclin
mRNA < 1 763 procyclin mRNA
rpt 215 393 12 bp degenerate repeats
BASE COUNT 192 a 215 c 156 g 200 t
ORIGIN
1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS W10PPIBCA 4761 bp ds-DNA BCT 14-AUG-1990
DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
complete cds, and ippD gene, 3' end.
ACCESSION M34849
KEYWORDS antigen.
SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
clones HB101/pHS[4108,5357].
ORGANISM Plasmid pWR100
Prokaryota; Bacteria.
REFERENCE 1 (bases 1 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes
(ipaB and ipaC) of Shigella flexneri
JOURNAL Microb. Pathog. 4, 345-357 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 538 and 3987 to 4761)
AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J.
JOURNAL Unpublished (1988)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by B.Baudry, 05-JUN-1990.
Center for Vaccine Development
University of Maryland at Baltimore
10 South Pine Street
Baltimore, MD 21201
FEATURES from to/span description
pept 606 1073 polypeptide I (ippI)
pept 1079 2821 polypeptide B (ipaB)
pept 2841 3932 polypeptide C (ipaC)
pept 3983 > 4761 ipaD protein
pept < 1 549 unidentified ORF
binding 595 600 ippI ribosomal binding site
binding 1065 1070 ipaB ribosomal binding site
binding 2829 2834 ipaC ribosomal binding site
binding 3971 3977 ipaD ribosomal binding site
signal 556 561 ippI -10 region
signal 2723 2728 ipaC -35 region
signal 2745 2750 ipaC -10 region
BASE COUNT 1769 a 952 c 805 g 1235 t
ORIGIN
1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
4741 agacaatatg ttaaaaagct t
//
LOCUS YEPOMPHA 824 bp ss-mRNA BCT 14-AUG-1990
DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene,
complete cds.
ACCESSION M34854
KEYWORDS cationic outer membrane protein;
enterobacterial outer membrane protein;
lipopolysaccharide binding protein; secretory protein;
structural membrane protein.
SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
ORGANISM Yersinia enterocolitica
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 824)
AUTHORS Hirvas,L., Koski,P. and Vaara,M.
TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
comparison to known ompH sequences
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by L.H.Hirvas, 06-JUN-1990.
University of Helsinki
FEATURES from to/span description
pept 324 818 ompH protein precursor (gtg start codon)
sigp 324 392 ompH protein signal peptide
matp 393 815 ompH protein
binding 308 317 ompH ribosomal binding site
signal 120 125 ompH -35 region
signal 143 148 ompH -10 region
BASE COUNT 257 a 147 c 198 g 221 t 1 others
ORIGIN
1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS YSCEXG1A 2056 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
(EXG1) gene, complete cds.
ACCESSION M34341
KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2056)
AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
Nebreda,A.R., Mendez,E. and del Rey,F.
TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
in the yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.del Rey, 17-MAY-1990.
Instituto de Microbiologia bioquimica
Facultad de Biologia
CSIC/Universidad de Salamanca
37008 Salamanca, SPAIN
FEATURES from to/span description
pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase precursor (EXG1, EC 3.2.1.58)
sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase signal peptide
matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan
glucanohydrolase
mRNA 265 > 1709 EXG1 mRNA
signal 153 159 TATA box
BASE COUNT 610 a 395 c 396 g 655 t
ORIGIN Chromosome XII-right arm.
1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
2041 gagcttttcc gccggc
//
LOCUS CORDTXRA 2529 bp ds-DNA BCT 14-AUG-1990
DEFINITION C.diphtheriae toxin repressor (dtxR) gene, complete cds.
ACCESSION M34239
KEYWORDS regulatory protein; toxin repressor.
SOURCE C.diphtheriae (strain C7) DNA, clone pHH2500..
ORGANISM Corynebacterium diphtheriae
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 1 to 1390)
AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R.
TITLE Molecular cloning and DNA sequence analysis of the diphtheria tox
iron-dependent regulatory element (dtxR) from Corynebacterium
diphtheriae
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5968-5972 (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 2529)
AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R.
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.R.Murphy, 11-MAY-1990.
University Hospital
Dept Medicine-Rm E613
88 East Newton Street
Boston, MA 02118-2393
FEATURES from to/span description
pept 681 1361 toxin repressor protein (dtxR)
pept 25 456 ORF1
pept 1384 2370 ORF3
signal 595 629 promoter sequence
binding 664 669 ribosomal binding site (put.)
binding 1200 1283 DNA-binding region (put.)
BASE COUNT 691 a 615 c 593 g 630 t
ORIGIN 1 bp upstream of HindIII site.
1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa
61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct
121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac
181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac
241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt
301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga
361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt
421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc
481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct
541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg
601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac
661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta
721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc
781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg
841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg
901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata
961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac
1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag
1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg
1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag
1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg
1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc
1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat
1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt
1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat
1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt
1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa
1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt
1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac
1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac
1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt
1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat
1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac
1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac
2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc
2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat
2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt
2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca
2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga
2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc
2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt
2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat
2521 aataagctt
//
LOCUS ECOANSBA 1643 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION M34234
KEYWORDS L-asparaginase II; ansB gene.
SOURCE E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and
pansB7.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1643)
AUTHORS Bonthron,D.T.
TITLE L-asparaginase II of Escherichia coli K-12: Cloning and sequencing
of the ansB gene and its position on the chromosomal map
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by D.T.Bonthron, 11-MAY-1990.
FEATURES from to/span description
pept 346 1392 L-asparaginase II (ansB) (EC 3.5.1.1)
sigp 346 411 L-asparaginase II signal peptide
matp 412 1389 L-asparaginase II
binding 335 339 ribosomal binding site (put.)
site 1400 1425 transcription termination signal (put.)
BASE COUNT 442 a 404 c 405 g 392 t
ORIGIN 63.8 min on K12 map.
1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa
61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg
121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt
181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac
241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca
301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa
361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat
421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct
481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa
541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat
601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc
661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg
721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca
781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac
841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc
901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac
961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc
1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca
1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt
1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc
1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa
1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc
1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc
1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg
1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg
1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta
1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc
1621 agtatcacgg cattctcaag ctt
//
LOCUS HUMADE5RC 1946 bp ds-DNA SYN 14-AUG-1990
DEFINITION Synthetic human/adenovirus type 5 recombination junction.
ACCESSION M34061
KEYWORDS .
SOURCE Human fetal bone marrow fibroblasts transformed with Adeno5/SV40
recombinant virus (cell line H13.1) DNA, clone pH13.1SV1.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 1946)
AUTHORS Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and
Siniscalco,M.
TITLE Preferential viral integration at the highly recombinogenic
chromosomal site 1p36 in human cells transformed with an
Adeno5/SV40 hybrid
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Romani 08-MAY-1990.
FEATURES from to/span description
recomb 429 430 adenovirus type 5 end/Human DNA start
rpt 430 1134 tandem Alu repeats
BASE COUNT 492 a 548 c 510 g 396 t
ORIGIN Chromosome 1p36.
1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg
61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca
121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc
181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac
241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac
301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc
361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt
421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg
481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg
541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg
601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg
661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca
721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc
781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata
841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt
901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct
961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg
1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat
1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat
1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc
1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt
1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg
1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc
1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt
1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt
1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct
1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc
1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga
1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg
1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg
1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc
1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga
1921 ctgaacggag gcagggccaa aaccct
//
LOCUS MCLPCCA 1701 bp ds-DNA BCT 14-AUG-1990
DEFINITION M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes,
complete cds, and linker protein, 5' end.
ACCESSION M34254
KEYWORDS light-harvesting biliprotein; phycoerythrocyanin.
SOURCE M.laminosus (strain Fischerella PCC 7603) DNA.
ORGANISM Mastigocladus laminosus
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria.
REFERENCE 1 (bases 1 to 1701)
AUTHORS Eberlein,M. and Kufer,W.
TITLE Genes encoding both subunits of phycoerythrocyanin, a light-
harvesting, biliprotein from the cyanobacterium Mastigocladus
laminosus
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.Kufer, 11-MAY-1990.
Botanisches Institut
Ludwig-Maximilians- Universitaet Muenchen
Menzingerstr. 67
D-8000 Muenchen 19
Federal Republic of Germany
FEATURES from to/span description
pept 279 797 phycoerythrocyanin beta-subunit
pept 884 1372 phycoerythrocyanin alpha-subunit
pept 1528 > 1701 phycoerythrocyanin linker protein precursor
matp 1531 > 1701 phycoerythrocyanin linker protein
BASE COUNT 526 a 312 c 383 g 480 t
ORIGIN
1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag
61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc
121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt
181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa
241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag
301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc
361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg
421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta
481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat
541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc
601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag
661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca
721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag
781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt
841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac
901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc
961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa
1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac
1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg
1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc
1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc
1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt
1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac
1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta
1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa
1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt
1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa
1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa
1681 agatttgcaa ctgccgaagc t
//
LOCUS YSCTSDNAA 238 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii.
ACCESSION M34310
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 238)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. 10, 4415-4419 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 146 g 92 t
ORIGIN
1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt
181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt
//
LOCUS YSCTSDNAB 297 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i.
ACCESSION M34311
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 297)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 184 g 113 t
ORIGIN
1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg
181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg
241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg
//
LOCUS YSCTSDNAC 325 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii.
ACCESSION M34312
KEYWORDS .
SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 325)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 199 g 126 t
ORIGIN
1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt
181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg
241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg
301 tgggtgtggt gtgtgggtgt gtggg
//
LOCUS YSCTSDNAD 248 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i.
ACCESSION M34313
KEYWORDS .
SOURCE S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone
YLP108CA-4-i.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 248)
AUTHORS Wang,S.-S. and Zakian,V.A.
TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA
polymerase reveals two domains
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.-S.Wang, 16-MAY-1990.
BASE COUNT 0 a 0 c 151 g 97 t
ORIGIN
1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg
181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt
241 gtgggtgg
//
LOCUS PLYORIA 375 bp ds-DNA VRL 14-AUG-1990
DEFINITION Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION M34048
KEYWORDS origin of replication.
SOURCE Polyomavirus BK (strain WWT) DNA, from human urine.
ORGANISM Polyomavirus BK
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 375)
AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL J. Virol. 64, 3864-3871 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 375 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 302 R block
BASE COUNT 123 a 86 c 95 g 71 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
361 aactggacaa aggcc
//
LOCUS PLYORIB 424 bp ds-DNA VRL 14-AUG-1990
DEFINITION Human polyomavirus BK (strain UT) origin of replication.
ACCESSION M34049
KEYWORDS origin of replication.
SOURCE Human polyomavirus BK (strain UT) DNA, from human urine.
ORGANISM Polyomavirus BK
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Polyomaviruses.
REFERENCE 1 (bases 1 to 424)
AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P.,
Subramani,S. and Traavik,T.
TITLE At least two type of control regions can be found among naturally
occurring BK virus strains
JOURNAL J. Virol. 64, 3864-3871 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.Johansen, 07-MAY-1990.
University of Tromso
P.O. Box 977
N-9001, Tromso, NORWAY
e-mail: terje.johansen@esuit.uit.no
FEATURES from to/span description
orgrpl 97 119 polyomavirus origin of replication
site 1 424 regulatory region
site 143 210 P block
site 211 249 Q block
site 250 261 R block
site 263 315 P block
site 316 349 Q block
site 350 361 R block
BASE COUNT 134 a 103 c 105 g 82 t
ORIGIN
1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
421 ggcc
//
LOCUS MXAFRZE 2493 bp ds-DNA BCT 14-AUG-1990
DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete
cds.
ACCESSION M35192
KEYWORDS gliding motility regulatory protein.
SOURCE M.xanthus (strain DZF1) DNA.
ORGANISM Myxococcus xanthus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria;
Myxococcaceae.
REFERENCE 1 (bases 1 to 2493)
AUTHORS McCleary,W.R. and Zusman,D.R.
TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
Salmonella typhimurium
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5898-5902 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.R.McCleary, 05-JUN-1990.
FEATURES from to/span description
pept 64 2397 gliding motility regulatory protein (frzE)
BASE COUNT 375 a 848 c 926 g 344 t
ORIGIN
1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS RATPCBB 463 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat PCB-binding protein mRNA, complete cds..
ACCESSION J05536
KEYWORDS PCB binding protein; binding protein.
SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
pPCB6.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 463)
AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
Gillner,M., Gustafsson,J.-A. and Lund,J.
TITLE Cloning, structure, and expression of a rat binding protein for
polychlorinated biphenyls: Homology to the hormonally regulated
progesterone-binding protein uteroglobin
JOURNAL J. Biol. Chem. 265, 12690-12693 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
J.Lund, 15-JUN-1990.
FEATURES from to/span description
pept 58 348 PCB binding protein precursor
sigp 58 114 PCB binding protein signal peptide
matp 115 345 PCB binding protein
BASE COUNT 117 a 156 c 84 g 106 t
ORIGIN
1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS YSBCPR 2990 bp ds-DNA PLN 14-AUG-1990
DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION M35199
KEYWORDS NADPH-cytochrome P450 reductase.
SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1.
ORGANISM Candida tropicalis
Eukaryota; Plantae; Thallobionta; Basidiomycotina;
Deuteromycotina.
REFERENCE 1 (bases 1 to 2990)
AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C.
TITLE Isolation and characterization of the alkane-inducible NADPH-
cytochrome P450 oxidoreductase gene from Candida tropicalis:
Identification of invariant residues within similar amino acid
sequences of divergent flavoproteins
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
T.R.Sutter, 15-JUN-1990.
Author name and address
T.R.Sutter
CIIT, P.O. Box 12137
Research Triangle Park, NC 27709
FEATURES from to/span description
pept 554 2596 NADPH-cytochrome P450 reductase
signal 198 204 TATA box
BASE COUNT 988 a 442 c 558 g 1002 t
ORIGIN
1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS HLTLYS 642 bp ss-mRNA INV 14-AUG-1990
DEFINITION H.rufescens sperm lysin mRNA, complete cds.
ACCESSION M34388
KEYWORDS sperm lysin.
SOURCE H.rufescens testis, cDNA to mRNA, clones r13 and r14.
ORGANISM Haliotis rufescens
Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda;
Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE 1 (bases 1 to 642)
AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D.
TITLE Species-specific sequences of abalone sperm lysin, the sperm
protein that creates a hole in the egg envelope
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES from to/span description
pept 16 480 sperm lysin precursor
sigp 16 69 sperm lysin signal peptide
matp 70 477 sperm lysin
mRNA < 1 642 sperm lysin mRNA
signal 625 630 poly-A signal
BASE COUNT 202 a 124 c 151 g 165 t
ORIGIN
1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg
61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt
121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt
181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac
241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc
301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat
361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac
421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga
481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc
541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta
601 aaacggaaag attgtacatt ccagaataaa tagatttact ac
//
LOCUS HLTLYS2 650 bp ss-mRNA INV 14-AUG-1990
DEFINITION H.corrugata sperm lysin mRNA, complete cds.
ACCESSION M34389
KEYWORDS sperm lysin.
SOURCE H.corrugata testis, cDNA to mRNA, clones p1 and p3.
ORGANISM Haliotis corrugata
Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda;
Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE 1 (bases 1 to 650)
AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D.
TITLE Species-specific sequences of abalone sperm lysin, the sperm
protein that creates a hole in the egg envelope
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES from to/span description
pept 16 483 sperm lysin precursor
sigp 16 69 sperm lysin signal peptide
matp 70 480 sperm lysin
mRNA 1 650 sperm lysin mRNA
signal 635 640 poly-A signal
BASE COUNT 194 a 134 c 162 g 160 t
ORIGIN
1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg
61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa
121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga
181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt
241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg
301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt
361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca
421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag
481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc
541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg
601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact
//
LOCUS HUMROSSA 1843 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human 52 kD ribonucleoprotein Ro/SSA, complete cds.
ACCESSION M34551
KEYWORDS ribonucleoprotein.
SOURCE Human thymocyte, cDNA to mRNA, clone FI18.1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1843)
AUTHORS Itoh,K., Itoh,Y. and Frank,M.B.
TITLE Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd
Ro/SSA proteins are encoded by separate genes
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable [or printed] sequence for [1]
kindly submitted by M.B.Frank, 18-MAY-1990.
Author address: M.B.Frank
Oklahoma Medical Res Foundation
Arthritis and Immunology program
825 WE 13th St, Oklahoma City, OK 73104
FEATURES from to/span description
pept 39 1466 52 kD Ro/SSA ribonucleoprotein
signal 1827 1832 poly-A signal
BASE COUNT 474 a 458 c 488 g 423 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga
61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga
121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg
181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac
241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac
301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga
361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc
421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa
481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga
541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact
601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc
661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc
721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga
781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag
841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc
901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac
961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt
1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg
1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc
1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat
1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg
1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct
1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc
1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg
1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg
1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt
1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc
1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg
1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt
1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg
1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc
//
LOCUS TRBBS12 4004 bp ds-DNA INV 14-AUG-1990
DEFINITION T.brucei expression site-associated protein (BS1.1 and BS1.2)
genes, 5' end and complete cds.
ACCESSION M34520
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
clone BS1.1-2.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 4004)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept < 1 129 expression site-associated protein (BS1.1)
pept 1318 2151 expression site-associated protein (BS1.2)
site 3083 3604 RIME sequence
BASE COUNT 1181 a 650 c 1052 g 1121 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg
61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga
121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat
181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg
241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa
301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg
361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa
421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt
481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc
541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa
601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga
661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg
721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt
781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta
841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc
901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt
961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac
1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat
1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga
1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg
1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt
1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg
1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa
1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat
1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca
1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct
1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt
1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct
1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg
1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa
1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta
1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat
1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga
1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt
2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa
2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat
2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa
2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct
2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc
2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa
2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta
2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga
2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg
2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt
2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg
2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag
2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac
2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt
2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac
2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa
3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta
3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg
3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt
3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt
3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca
3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca
3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg
3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg
3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg
3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg
3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat
3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga
3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat
3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc
3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt
3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat
3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc
//
LOCUS TRBBS134 2396 bp ds-DNA INV 14-AUG-1990
DEFINITION T.brucei expression site-associated protein (BS1.3 and BS1.4)
genes, 3' and 5' ends.
ACCESSION M34521
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
clone BS1.3-4.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 2396)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept 1836 > 2396 expression site-associated protein (BS1.4)
ORF < 1 1206 expression site-associated protein (BS1.3)
BASE COUNT 744 a 357 c 642 g 653 t
ORIGIN 1 bp upstream of HindIII site.
1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa
61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat
121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt
181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga
241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat
301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac
361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat
421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga
481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc
541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc
601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac
661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg
721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca
781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg
841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac
901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca
961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg
1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt
1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc
1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag
1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct
1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa
1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct
1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt
1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt
1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat
1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa
1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat
1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg
1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag
1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt
1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg
1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag
1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc
2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca
2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg
2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg
2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg
2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg
2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa
//
LOCUS TRBBS15 714 bp ss-mRNA INV 14-AUG-1990
DEFINITION T.brucei expression site-associated protein (BS1.5) gene, partial
cds.
ACCESSION M34522
KEYWORDS expression site-associated protein.
SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to
mRNA, clone BS1.5.
ORGANISM Trypanosoma brucei
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 714)
AUTHORS Hobbs,M.R. and Boothroyd,J.C.
TITLE An expression-site-associated gene family of trypanosomes is
expressed in vivo and shows homology to a VSG gene
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Hobbs, 22-MAY-1990.
FEATURES from to/span description
pept < 1 > 714 expression site-associated protein (BS1.5) (AA
at 1)
BASE COUNT 226 a 116 c 203 g 169 t
ORIGIN
1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg
61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg
121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg
181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag
241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg
301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct
361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa
421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga
481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg
541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat
601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca
661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc
//
LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 14-AUG-1990
DEFINITION E.coli DNA polymerase (polB) gene, 5' flank.
ACCESSION M35371
KEYWORDS DNA polymerase; polB gene.
SOURCE E.coli (strain W3110) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 4666)
AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R.
TITLE Nucleotide sequence and deletion analysis of the polB gene of
E.coli
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Chen, 20-JUN-1990.
Author address: H.Chen
Baylor College of Medicine
Cell Biology and IMG
One Baylor Plaza
Houston, TX 77030
email: hchen@vulture.bcm.tmc.edu
FEATURES from to/span description
signal 1209 1214 -35 region
signal 1229 1234 -10 region
binding 1250 1255 ribosome binding site
site 1193 1208 lexA box
BASE COUNT 1100 a 1278 c 1257 g 1031 t
ORIGIN
1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg
61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg
121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga
181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc
241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc
301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg
361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg
421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca
481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct
541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg
601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat
661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga
721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac
781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac
841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga
901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca
961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg
1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat
1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga
1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa
1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc
1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca
1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa
1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt
1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat
1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt
1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt
1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt
1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc
1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat
1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc
1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg
1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt
1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc
2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag
2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag
2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc
2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg
2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac
2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg
2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc
2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac
2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg
2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc
2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa
2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt
2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc
2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac
2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa
2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa
3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc
3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga
3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc
3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac
3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat
3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg
3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc
3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg
3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc
3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca
3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa
3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac
3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat
3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga
3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac
3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc
3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg
4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt
4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga
4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt
4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa
4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg
4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc
4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct
4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca
4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca
4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc
4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc
//
LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA,
complete cds.
ACCESSION M35410
KEYWORDS insulin-like growth factor binding protein 2.
SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1387)
AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B.,
Francke,U. and Swaroop,A.
TITLE Sequence analysis, expression and chromosomal localization of a
gene, isolated from a subtracted human retina cDNA library, that
encodes an insulin-like growth factor binding protein (IGFBP2)
JOURNAL Exp. Eye Res. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Swaroop, 20-JUN-1990.
FEATURES from to/span description
pept 64 1050 insulin-like growth factor binding protein 2
(IGFBP2) precursor /hgml_locus_uid="LN0220S"
/nomgen="IBP1" /map="7p13-p12"
sigp 64 180 insulin-like growth factor binding protein 2
signal peptide
matp 181 1047 insulin-like growth factor binding protein 2
mRNA < 1 1387 IGFBP2 mRNA
signal 1362 1367 poly-A signal
binding 175 197 ATP binding site
BASE COUNT 232 a 455 c 477 g 223 t
ORIGIN
1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc
61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg
121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg
181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg
241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg
301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag
361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc
421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac
481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga
541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct
601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact
661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag
721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag
781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg
841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg
901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga
961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct
1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg
1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg
1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg
1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc
1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga
1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta
1381 aagtgtg
//
LOCUS HUMLBPA 1431 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA,
complete cds.
ACCESSION M35533
KEYWORDS lipopolysaccharide binding protein.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites; for [2])
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
TITLE Structure and function of lipopolysaccharide binding protein
JOURNAL Science (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1431)
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Tobias, 21-JUN-1990.
Author address: P.S.Tobias
Department of Immunology, IMM-12
10466 N. Torrey Pines Rd
La Jolla, CA 92037
FEATURES from to/span description
pept 1 > 1431 lipopolysaccharide binding protein (LBP)
precursor
sigp 1 75 lipopolysaccharide binding protein signal
peptide
matp 76 1431 lipopolysaccharide binding protein
BASE COUNT 319 a 417 c 359 g 336 t
ORIGIN
1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc
61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag
121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct
181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc
241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc
301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag
361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc
421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc
481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc
541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg
601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca
661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc
721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt
781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat
841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca
901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc
961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc
1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata
1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc
1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca
1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg
1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg
1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag
1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t
//
LOCUS HUMPEC12L 2344 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human cell 12-lipoxygenase gene, complete cds.
ACCESSION M35418
KEYWORDS lipoxygenase.
SOURCE Human platelet/erythroleukemia cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2344)
AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A.
TITLE Molecular cloning, primary structure, and expression of the human
platelet/erythroleukemia cell 12-lipoxygenase
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5638-5642 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.D.Funk, 20-JUN-1990.
FEATURES from to/span description
pept 42 2033 12-lipoxygenase
signal 2324 2329 poly-A signal
mRNA < 1 2344 12-lipoxygenase mRNA
BASE COUNT 514 a 692 c 642 g 496 t
ORIGIN
1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc
61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg
121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg
181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca
241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg
301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga
361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc
421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag
481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc
541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc
601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct
661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt
721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc
781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag
841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca
901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc
961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa
1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg
1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg
1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca
1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac
1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac
1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc
1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc
1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg
1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc
1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc
1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc
1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca
1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg
1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc
1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat
1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact
1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag
2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt
2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac
2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt
2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg
2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac
2341 actg
//
LOCUS HUMRALBA 1327 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds.
ACCESSION M35416
KEYWORDS GTP-binding protein.
SOURCE Human retina, cDNA to mRNA, clone AS181.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1327)
AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U.
TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP
binding protein
JOURNAL Somat. Cell Mol. Genet. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by A.Swaroop, 20-JUN-1990.
FEATURES from to/span description
pept 171 791 GTP-binding protein (RALB)
mRNA < 1 1327 RALB mRNA
signal 1303 1308 poly-A signal
BASE COUNT 368 a 273 c 373 g 313 t
ORIGIN Chromosome cen-q13.
1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc
61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc
121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca
181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag
241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg
301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga
361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc
421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa
481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc
541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca
601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca
661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa
721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt
781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg
841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc
901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa
961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc
1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt
1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta
1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag
1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag
1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga
1321 aacagct
//
LOCUS LACLACR 1332 bp ds-DNA BCT 14-AUG-1990
DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene,
complete cds.
ACCESSION M35375
KEYWORDS lactose phosphotransferase system repressor; lactose repressor.
SOURCE L.lactis (strain MG1820) DNA.
ORGANISM Lactococcus lactis
Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods;
Lactobacillaceae.
REFERENCE 1 (bases 1 to 1332)
AUTHORS Van Rooijen,R.J. and Devos,W.M.
TITLE Molecular cloning, transcriptional analysis, and nucleotide
sequence of LACR, a gene encoding the repressor of the lactose
phosphotransferase system of Lactococcus lactis
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.J.Van Rooijen, 20-JUN-1990.
Author address: R.J.Van Rooijen
Netherlands Inst for Dairy Research
Kernhemseweg 2
P.O. Box 20
6710 BA EDE
THE NETHERLANDS
FEATURES from to/span description
pept 370 1155 lactose repressor (lacR; alt.)
pept 388 1155 lactose repressor (lacR; alt.)
mRNA 79 1245 lactose repressor mRNA
signal 1215 > 1155 transcription termination signal
binding 353 357 ribosomal binding site
signal 45 51 -35 region
signal 68 74 -10 region
BASE COUNT 469 a 207 c 198 g 458 t
ORIGIN
1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt
61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca
121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac
181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata
241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg
301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa
361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta
421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt
481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt
541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag
601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc
661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa
721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg
781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata
841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa
901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa
961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt
1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc
1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt
1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac
1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt
1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga
1321 taattggata tc
//
LOCUS MARCMYCA 1391 bp ds-DNA ROD 14-AUG-1990
DEFINITION Woodchuck c-myc protein gene, exon 1.
ACCESSION M35498
KEYWORDS c-myc protein.
SOURCE Woodchuck (Marmota monax) DNA.
ORGANISM Marmota monax
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae;
Marmotini.
REFERENCE 1 (bases 1 to 1391)
AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J.
TITLE Evolutionary conservation of target sequences for cis-acting
regulation in c-myc exon 1 and its upstream sequences
JOURNAL Gene (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.A.Buendia, 21-JUN-1990.
FEATURES from to/span description
pre-msg 818 > 1391 c-myc mRNA and introns
IVS 1378 > 1391 c-myc intron 1
site 159 220 P0 promoter
site 790 794 TATA box
site 964 968 TATA box
BASE COUNT 317 a 385 c 383 g 306 t
ORIGIN
1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc
61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat
121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac
181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg
241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg
301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc
361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc
421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt
481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag
541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa
601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg
661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag
721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca
781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg
841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg
901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag
961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac
1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt
1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc
1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg
1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg
1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg
1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta
1381 agcaccagat c
//
LOCUS ONGOSTLE 214 bp ds-DNA INV 14-AUG-1990
DEFINITION O.volvulus recombinant antigen gene, 3' end.
ACCESSION M35370
KEYWORDS T-cell epitope; recombinant antigen.
SOURCE O.volvulus DNA.
ORGANISM Onchocerca volvulus
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Similiidae.
REFERENCE 1 (bases 1 to 214)
AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B.
TITLE The identification of an Onchocerca-specific recombinant antigen
containing a T cell epitope
JOURNAL Unpublished (19900
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.B.Nutman, 19-JUN-1990.
Author address: T.B.Nutman
Inst. Lab. of Parasitic Diseases
NIAID
National Institutes of Health
Bldg 4, Rm 126
9000 Rockville Pike
Bethesda, MD 20892
email: tbn@helix.nih.gov
FEATURES from to/span description
pept < 1 93 recombinant antigen (AA at 1)
site 91 93 nematode splice junction
BASE COUNT 81 a 37 c 31 g 65 t
ORIGIN
1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt
61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa
121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga
181 aaactttcac ttatgtaaaa ttgcgcacaa gacc
//
LOCUS PSEIAAL 2766 bp ds-DNA BCT 14-AUG-1990
DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds.
ACCESSION M35373
KEYWORDS IAA-lysine synthetase.
SOURCE P.syringae savastanoi DNA.
ORGANISM Pseudomonas syringae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2766)
AUTHORS Roberto,F.F., Klee,H., White,F., Nordeen,R. and Kosuge,T.
TITLE Expression and fine structure of the gene encoding
N-epsilon-(indole-3-acetyl)-L-lysine synthetase from Pseudomonas
savastanoi
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5797-5801 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.J.Klee, 20-JUN-1990.
FEATURES from to/span description
pept 1100 2287 IAA-lysine synthetase (iaaL)
pept 95 1003 ORF1
BASE COUNT 637 a 760 c 754 g 615 t
ORIGIN
1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc
61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag
121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag
181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat
241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac
301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt
361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat
421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt
481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct
541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat
601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt
661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc
721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac
781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg
841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac
901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca
961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt
1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa
1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc
1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt
1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct
1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa
1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt
1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct
1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt
1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa
1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca
1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt
1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg
1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt
1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac
1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat
1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat
1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc
2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt
2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga
2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa
2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg
2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc
2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga
2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc
2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg
2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac
2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg
2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct
2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag
2761 ccgcct
//
LOCUS RABLPBA 1446 bp ss-mRNA MAM 14-AUG-1990
DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA,
complete cds.
ACCESSION M35534
KEYWORDS lipopolysaccharide binding protein.
SOURCE Rabbit liver, cDNA to mRNA.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (ases 1 to 1446ites; for [2]
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
TITLE Structure and function of lipopolysaccharide binding protein
JOURNAL Science (1990) In press
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1446)
AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.S.Tobias, 21-JUN-1990.
Author address: P.S.Tobias
Department of Immunology, IMM-12
10466 N. Torrey Pines Rd
La Jolla, CA 92037
FEATURES from to/span description
pept 1 > 1446 lipopolysaccharide binding protein (LBP)
precursor
sigp 1 78 lipopolysaccharide binding protein signal
peptide
matp 79 1446 lipopolysaccharide binding protein
BASE COUNT 316 a 454 c 391 g 285 t
ORIGIN
1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct
61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg
121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg
181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac
241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag
301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc
361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc
421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc
481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac
541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg
601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca
661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca
721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca
781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt
841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg
901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag
961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc
1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa
1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc
1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact
1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc
1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag
1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac
1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg
1441 agagtt
//
LOCUS RATUKATPA 1484 bp ss-mRNA ROD 14-AUG-1990
DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete
cds.
ACCESSION M35535
KEYWORDS (H+,K+)-ATPase beta-subunit.
SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1484)
AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P.,
Forte,J.G. and Levenson,R.
TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression,
chromosomal assignment, and relationship to Na,K-ATPase beta
subunits
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by V.A.Canfield, 21-JUN-1990.
Author address: V.A.Canfield
Yale University School of Medicine
Dept. of Cell Biology
P.O. Box 3333
New Haven, CT 06510
email: levenson@YALEMED
FEATURES from to/span description
pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C.
3.6.1.3)
mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit
BASE COUNT 363 a 457 c 362 g 302 t
ORIGIN
1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta
61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag
121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc
181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg
241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct
301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct
361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt
421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga
481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc
541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt
601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg
661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa
721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac
781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta
841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc
901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt
961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta
1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca
1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca
1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga
1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc
1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg
1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata
1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg
1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta
//
LOCUS YSCMYO2A 5675 bp ds-DNA PLN 14-AUG-1990
DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds.
ACCESSION M35532
KEYWORDS myosin-1.
SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 5675)
AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A.
TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for
vectorial transport of vesicles
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.C.Johnston, 21-JUN-1990.
Author address: G.C.Johnston
Dalhousie University
7E Tupper Medical Bldg.
Halifax, N.S.
CANADA B3H 4H7
email: JOHNSTON@AC.DAL.CA
FEATURES from to/span description
pept 581 5305 myosin 1 isoform (MYO2)
binding 138 171 ATP-binding site
binding 443 523 actin-binding site
rpt 926 981 heptad repeat
rpt 1010 1086 heptad repeat
BASE COUNT 1994 a 986 c 1110 g 1585 t
ORIGIN Chromosome 15 right arm.
1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt
61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa
121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt
181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa
241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc
301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta
361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc
421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt
481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg
541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac
601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga
661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt
721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc
781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt
841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct
901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc
961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc
1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg
1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga
1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc
1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc
1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag
1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa
1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca
1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg
1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac
1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat
1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact
1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca
1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc
1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat
1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct
1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta
1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga
2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat
2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc
2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac
2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc
2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga
2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg
2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg
2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt
2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg
2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc
2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc
2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg
2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat
2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt
2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc
2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat
3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt
3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc
3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag
3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat
3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac
3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa
3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt
3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag
3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa
3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca
3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga
3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga
3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca
3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg
3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc
3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct
3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg
4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac
4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc
4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat
4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca
4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag
4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa
4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta
4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa
4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat
4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt
4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg
4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt
4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg
4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca
4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca
4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac
4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat
5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc
5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac
5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc
5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca
5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat
5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa
5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg
5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat
5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga
5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat
5641 aataacaact aagaataaac acattctcat tttta
//
LOCUS BSURPOF 3078 bp ds-DNA BCT 14-AUG-1990
DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
complete cds.
ACCESSION M34995
KEYWORDS RNA polymerase; minor sigma-37 factor.
SOURCE B.subtilis (strains 168 and Marburg), clones
lambda-[21,gt11,1,11,21,31].
ORGANISM Bacillus subtilis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 368 to 2297)
AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus
subtilis RNA polymerase: Isolation, nucleotide sequence,
chromosomal locus, and cryptic function
JOURNAL J. Bacteriol. 169, 771-778 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 3078)
AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
TITLE Similar organization of the sigB and spoIIA operons encoding
alternate sigma factors of Bacillus subtilis RNA polymerase
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.W.Price, 18-FEB-1987. Draft entry and computer-readable
sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES from to/span description
pept 200 529 ORF V
pept 526 1008 ORF W
pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg
start codon)
pept 1762 2361 ORF X
mRNA 131 2368 sigB operon mRNA
signal 131 168 sigB operon promoter
signal 2368 2401 sigB transcription termination signal
binding 188 194 ORF V ribosomal binding site
binding 511 519 ORF W ribosomal binding site
binding 954 961 sigB ribosomal binding site
binding 1750 1753 ORF X ribosomal binding site
BASE COUNT 999 a 575 c 755 g 749 t
ORIGIN 40 degrees on B.subtilis map.
1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
3061 caaactgctg cgaagctt
//
LOCUS HUMERPR 1624 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human erythropoietin receptor mRNA, complete cds.
ACCESSION M34986
KEYWORDS erythropoietin receptor.
SOURCE Human cell line OCI-M1, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1624)
AUTHORS Ehrenman,K. and St John,T.
TITLE Sequence of the human erythropoietin gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.St.John, 07-JUN-1990.
Author address: T.St.John
Hutchinson Cancer Res Center
1124 Columbia St.
Seattle, WA
Email: TSTJOHN@FHCRCVAX
FEATURES from to/span description
pept 1 1527 erythropoietin receptor precursor
sigp 1 72 erythropoietin receptor signal peptide
matp 73 1524 erythropoietin receptor
BASE COUNT 276 a 537 c 498 g 313 t
ORIGIN
1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
1621 agga
//
LOCUS HUMGFP40H 4379 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete
cds.
ACCESSION M30135
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor;
megakeryoblastic leukemia cell growth factor.
SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4379)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 12-DEC-1989.
FEATURES from to/span description
pept 796 909 P40 T-cell and mast cell growth factor
precursor, exon 1
1022 1057 P40 T-cell and mast cell growth factor
precursor, exon 2
1140 1172 P40 T-cell and mast cell growth factor
precursor, exon 3
2461 2592 P40 T-cell and mast cell growth factor
precursor, exon 4
4096 4215 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 796 849 P40 T-cell and mast cell growth factor signal
peptide
matp 850 909 P40 T-cell and mast cell growth factor
1022 1057 P40 T-cell and mast cell growth factor
1140 1172 P40 T-cell and mast cell growth factor
2461 2592 P40 T-cell and mast cell growth factor
4096 4212 P40 T-cell and mast cell growth factor
IVS 910 1021 hP40 intron A
IVS 1058 1139 hP40 intron B
IVS 1173 2460 hP40 intron C
IVS 2593 4095 hP40 intron D
signal 742 747 TATA box
signal 4354 4359 polyA signal
rpt 2965 3265 Alu repeat
binding 370 376 AP-2 binding site
binding 624 629 IRF-1 binding site
binding 627 633 AP-1 binding site
BASE COUNT 1208 a 921 c 904 g 1346 t
ORIGIN
1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS MEFDNABP 1710 bp ds-DNA BCT 14-AUG-1990
DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION M34778
KEYWORDS DNA binding protein HMfB.
SOURCE M.fervidus DNA.
ORGANISM Methanothermus fervidus
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Methanobacteriales; Methanothermaceae.
REFERENCE 1 (sites; for [2])
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
TITLE HMf, a DNA binding protein isolated from the hypothermophilic
archaeon Methanothermus fervidus, is most closely related to
histones
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5788-5791 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 1710)
AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by K.Sandman, 01-JUN-1990.
Author address: K.Sandman
Department of Microbiology
Ohio State University
484 W. 12th Ave.
Columbus, Ohio 43210
E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES from to/span description
pept 578 787 DNA binding protein HMfB
pept 1662 805 (c) ORF285
signal 450 457 promoter
signal 523 530 promoter
signal 790 814 transcription termination signal
BASE COUNT 619 a 263 c 265 g 561 t 2 others
ORIGIN 1 bp upstream of HindIII site.
1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS MUSINT1A 5607 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION K02593 M34750
KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE Mouse (BALB/c) DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1098 to 5606)
AUTHORS Van Ooyen,A. and Nusse,R.
TITLE Structure and nucleotide sequence of the putative mammary oncogene
int-1; Proviral insertions leave the protein-encoding domain intact
JOURNAL Cell 39, 233-240 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 5607)
AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
Otte,A., Schuuring,E. and van Ooyen,A.
TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation
by insertion of proviral DNA of the mouse mammary tumor virus
JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer readable sequence for [2] kindly submitted
by R.Nusse, 31-MAY-1990.
FEATURES from to/span description
pept 1779 1882 Wnt-1 protein (exon 1)
2452 2705 Wnt-1 protein (exon 2)
3279 3544 Wnt-1 protein (exon 3)
4002 4490 Wnt-1 protein (exon 4)
IVS 1883 2451 Wnt-1 cds intron A
IVS 2706 3278 Wnt-1 cds intron B
IVS 3545 4001 Wnt-1 cds intron C
site 1133 1134 MMTV insertion site in tumor 17 [1]
site 1394 1395 MMTV insertion site in tumor 35 [1]
site 1631 1632 MMTV insertion site in tumor 102 [1]
site 4495 4496 MMTV insertion site in tumor 53 [1]
mRNA 1428 5398 Wnt-1 mRNA (alt.)
mRNA 1595 5398 Wnt-1 mRNA (alt.)
signal 1562 1566 TATA box
signal 1573 1577 TATA box
BASE COUNT 1188 a 1681 c 1519 g 1219 t
ORIGIN Chromosome 15.
1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
5581 gaccctaaat gagaccaact agggatc
//
LOCUS MUSP40M 3808 bp ds-DNA ROD 14-AUG-1990
DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
cds.
ACCESSION M30136
KEYWORDS T-cell growth factor; cytokine; mast cell growth factor.
SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
DNA, clone lambda-S40.1a.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3808)
AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
Van Snick,J.
TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic
organization, and comparison with the mouse gene
JOURNAL J. Immunol. 144, 4235-4241 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.Van Snick, 21-NOV-1989.
FEATURES from to/span description
pept 644 757 P40 T-cell and mast cell growth factor
precursor, exon 1
846 881 P40 T-cell and mast cell growth factor
precursor, exon 2
965 997 P40 T-cell and mast cell growth factor
precursor, exon 3
2184 2315 P40 T-cell and mast cell growth factor
precursor, exon 4
3393 3512 P40 T-cell and mast cell growth factor
precursor, exon 5
sigp 644 697 P40 T-cell and mast cell growth factor signal
peptide
matp 698 757 P40 T-cell and mast cell growth factor
846 881 P40 T-cell and mast cell growth factor
965 997 P40 T-cell and mast cell growth factor
2184 2315 P40 T-cell and mast cell growth factor
3393 3509 P40 T-cell and mast cell growth factor
IVS 758 845 mP40 intron A
IVS 882 964 mP40 intron B
IVS 998 2183 mP40 intron C
IVS 2316 3392 mP40 intron D
signal 589 594 TATA box
binding 227 233 AP-2 binding site
binding 475 480 IRF-1 binding site
binding 478 484 AP-1 binding site
BASE COUNT 1012 a 903 c 869 g 1024 t
ORIGIN Chromosome 13.
1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 14-AUG-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
cds.
ACCESSION M29938
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
ZQ30-1.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1105)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 13 777 MHC SLA-DQ alpha chain precursor
sigp 13 81 MHC SLA-DQ alpha chain signal peptide
matp 82 774 MHC SLA-DQ alpha chain
mRNA < 1 1105 MHC SLA-DQ mRNA
BASE COUNT 254 a 290 c 255 g 306 t
ORIGIN
1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1081 aaacaataaa tcacttcatg aggtt
//
LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 14-AUG-1990
DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
cds.
ACCESSION M29939
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
LeGuern,C.
TITLE Class II genes of miniature swine: III. Characterization of an
expressed pig class II gene homologous to HLA-DQA
JOURNAL Immunogenetics 31, 52-56 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept 69 836 MHC SLA-DQ alpha chain precursor
sigp 69 137 MHC SLA-DQ alpha chain signal peptide
matp 138 833 MHC SLA-DQ alpha chain
BASE COUNT 273 a 313 c 266 g 322 t
ORIGIN
1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS PIGMHDQBA 212 bp ds-DNA MAM 14-AUG-1990
DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION M29944
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 212)
AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
Gustafsson,K. and LeGuern,C.
TITLE Class II genes of miniature swine: II. Molecular identification and
characterization of B (beta) genes from SLA-c haplotype
JOURNAL Immunogenetics 31, 1-6 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.LeGuern, 17-NOV-1989.
FEATURES from to/span description
pept < 1 12 MHC SLA-DO beta chain
BASE COUNT 38 a 58 c 52 g 64 t
ORIGIN
1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS PP2LEND 266 bp ds-DNA PHG 14-AUG-1990
DEFINITION Bacteriophage P2 left end DNA.
ACCESSION M34756
KEYWORDS .
SOURCE Bacteriophage P2 DNA.
ORGANISM Bacteriophage P2
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Ziermann,R. and Calendar,R.
TITLE Characterization of the cos sites of Bacteriophages P2 and P4
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Ziermann, 05-JUN-1990.
Author address: R.Ziermann
University of California
Dept. Mol. Cell Biology
Berkeley, CA 94720
Email: calendar@garnet.berkeley.edu
FEATURES from to/span description
site 1 19 cohesive end of phage DNA
site 20 266 left end of phage DNA
BASE COUNT 41 a 58 c 99 g 68 t
ORIGIN Map position 0.0-0.8%.
1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
241 gagccagccg tttatctcgc ggatcc
//
LOCUS RCABCHH 314 bp ds-DNA BCT 14-AUG-1990
DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
F108) genes, 3' end, and 5' end respectively.
ACCESSION M34843
KEYWORDS .
SOURCE R.capsulata DNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 314)
AUTHORS Yang,Z. and Bauer,C.E.
TITLE Rhodobacter capsulatus genes involved in early steps of the
bacteriochlorophyll biosynthetic pathway
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.E.Bauer, 04-JUN-1990.
FEATURES from to/span description
pept < 1 79 bacteriochlorophyll biosynthesis protein
(bchH) (AA at 2)
pept 108 > 314 bacteriochlorophyll biosynthesis protein
(F108)
BASE COUNT 59 a 87 c 119 g 49 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
301 agcgcgtgct gcag
//
LOCUS TREANTAT 999 bp ds-DNA ORG 14-AUG-1990
DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION M34848
KEYWORDS .
SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA.
ORGANISM Kinetoplast Trypanosoma evansi
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae;
Trypanosoma evansi.
REFERENCE 1 (bases 1 to 999)
AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
Steinert,M. and Hamers,R.
TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
JOURNAL Mol. Biochem. Parasitol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by E.Bajyana Songa, 05-JUN-1990.
FEATURES from to/span description
site 367 516 kinetoplast minicircle common region (ORI)
BASE COUNT 391 a 78 c 203 g 327 t
ORIGIN
1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS VIBSCRRE 1524 bp ds-DNA BCT 14-AUG-1990
DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds,
and sucrose transport (scrA) protein, 5' end.
ACCESSION M35009
KEYWORDS Scr repressor protein; sucrose transport protein.
SOURCE V.alginolyticus DNA.
ORGANISM Vibrio alginolyticus
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Vibrionaceae.
REFERENCE 1 (bases 1 to 1524)
AUTHORS Blatch,G.L. and Woods,D.R.
TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
repressor-encoding gene
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.L.Blatch, 10-JUN-1990.
Author address: G.L.Blatch
University of Cape Town
Dept. Microbiology
Private Bag
Rondebosch
7700 SOUTH AFRICA
Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
FEATURES from to/span description
pept 36 < 1 (c) sucrose transport protein (scrA)
pept 271 555 Scr repressor protein (scrR)
pept 791 1090 ORF5
binding 264 267 scrR ribosomal binding site
binding 780 785 ORF5 ribosomal binding site
signal 178 183 scrA -10 region
signal 203 208 scrA -35 region
signal 190 195 scrR -10 region
signal 162 167 scrR -10 region
signal 762 767 ORF5 -35 region
signal 742 747 ORF5 -35 region
site 78 91 repressor operator site (put.)
site 229 242 repressor operator site (put.)
site 12 71 ScrR helix-turn-helix motif
BASE COUNT 466 a 344 c 321 g 393 t
ORIGIN
1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
1501 gccaattgtg atattgaaga agct
//
LOCUS XELNRP1A 2431 bp ss-mRNA VRT 14-AUG-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
complete cds.
ACCESSION M34894
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2431)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 74 1117 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2431 nrp-1B mRNA
signal 2407 2418 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
site 921 941 alternative exon (put.)
BASE COUNT 663 a 522 c 543 g 703 t
ORIGIN
1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS XELNRP1B 2324 bp ss-mRNA VRT 14-AUG-1990
DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
complete cds.
ACCESSION M34895
KEYWORDS RNA-binding protein; nervous system-specific protein.
SOURCE X.laevis stage 24, cDNA to mRNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 2324)
AUTHORS Richter,K., Good,P.J. and Dawid,I.B.
TITLE A developmentally regulated, nervous system-specific gene in
Xenopus encodes a putative RNA-binding protein
JOURNAL New Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.J.Good, 05-JUN-1990.
FEATURES from to/span description
pept 29 1051 pot. RNA-binding protein (nrp-1B)
mRNA < 1 2324 nrp-1B mRNA
signal 2306 2311 poly-A signal
site 22 27 RNP2 consensus sequence
site 58 68 RNP1 consensus sequence
site 111 116 RNP2 consensus sequence
site 147 157 RNP1 consensus sequence
BASE COUNT 647 a 467 c 520 g 690 t
ORIGIN
1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS YSJSACDHY 2569 bp ds-DNA PLN 14-AUG-1990
DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION M34929
KEYWORDS saccharopine dehydrogenase.
SOURCE Y.lipolytica (strain W29) DNA.
ORGANISM Yarrowia lipolytica
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2569)
AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
Gaillardin,C.
TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
lipolytica
JOURNAL Mol. Cell. Biol. (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.M.Fournier, 06-JUN-1990.
ORF1 is probably not translated, since a lacZ fusion/expression
experiment did not produce results.
FEATURES from to/span description
pept 649 1929 ORF1
pept 1764 655 (c) saccharopine dehydrogenase
mRNA 582 2060 ORF1 mRNA (alt.)
mRNA 598 2060 ORF1 mRNA (alt.)
mRNA 599 2060 ORF1 mRNA (alt.)
mRNA 602 2060 ORF1 mRNA (alt.)
mRNA 606 2060 ORF1 mRNA (alt.)
mRNA 609 2060 ORF1 mRNA (alt.)
mRNA 1807 639 (c) ORF2 mRNA (alt.)
mRNA 1809 639 (c) ORF2 mRNA (alt.)
mRNA 1811 639 (c) ORF2 mRNA (alt.)
mRNA 1812 639 (c) ORF2 mRNA (alt.)
mRNA 1814 639 (c) ORF2 mRNA (alt.)
signal 521 528 TATA box
signal 565 575 TATA box
BASE COUNT 668 a 573 c 666 g 662 t
ORIGIN
1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS HUMVINC 5102 bp ss-mRNA PRI 14-AUG-1990
DEFINITION Human vinculin mRNA, complete cds.
ACCESSION M33308
KEYWORDS cytoskeletal protein; vinculin.
SOURCE Human endothelial cells, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5102)
AUTHORS Weller,P.A., Ogryzko,E.P., Corben,E.B., Zhidkova,N.I., Patel,B.,
Price,G.J., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R.
TITLE Complete sequence of human vinculin and assignment of the gene to
chromosome 10
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5667-5671 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.A.Weller, 28-MAR-1990.
FEATURES from to/span description
pept 51 3251 vinculin
BASE COUNT 1379 a 1248 c 1273 g 1202 t
ORIGIN Chromosome 10.
1 gaattccact tctctgtcgc ccgcggttcg ccgccccgct cgccgccgcg atgccagtgt
61 ttcatacgcg cacgatcgag agcatcctgg agccggtggc acagcagatc tcccacctgg
121 tgataatgca cgaggagggc gaggtggacg gcaaagccat tcctgacctc accgcgcccg
181 tggccgccgt gcaggcggcc gtcagcaacc tcgtccgggt tggaaaagag actgttcaaa
241 ccactgagga tcagattttg aagagagata tgccaccagc atttattaag gttgagaatg
301 cttgcaccaa gcttgtccag gcagctcaga tgcttcagtc agacccttac tcagtgcctg
361 ctcgagatta tctaattgat gggtcaaggg gcatcctctc tggaacatca gacctgctcc
421 ttaccttcga tgaggctgag gtccgtaaaa ttattagagt ttgcaaagga attttggaat
481 atcttacagt ggcagaggtg gtggagacta tggaagattt ggtcacttac acaaagaatc
541 ttgggccagg aatgactaag atggccaaga tgattgacga gagacagcag gagctcactc
601 accaggagca ccgagtgatg ttggtgaact cgatgaacac cgtgaaagag ttgctgccag
661 ttctcatttc agctatgaag atttttgtaa caactaaaaa ctcaaaaaac caaggcatag
721 aggaagcttt aaaaaatcgc aattttactg tagaaaaaat gagtgctgaa attaatgaga
781 taattcgtgt gttacaactc acctcttggg atgaagatgc ctgggccagc aaggacactg
841 aagccatgaa gagagcattg gcctccatag actccaaact gaaccaggcc aaaggttggc
901 tccgtgaccc tagtgcctcc ccaggggatg ctggtgagca ggccatcaga cagatcttag
961 atgaagctgg aaaagttggt gaactctgtg caggcaaaga acgcagggag attctgggaa
1021 cttgcaaaat gctagggcag atgactgatc aagtggctga cctccgtgcc agaggacaag
1081 gatcctcacc ggtggccatg cagaaagctc agcaggtatc tcagggtctg gatgtgctca
1141 cagcaaaagt ggaaaatgca gctcgcaagc tggaagccat gaccaactca aagcagagca
1201 ttgcaaagaa gatcgatgct gctcagaact ggcttgcaga tccaaatggt ggaccggaag
1261 gagaagagca gattcgaggt gctttggctg aagctcggaa aatagcagaa ttatgtgatg
1321 atcctaaaga aagagatgac attctacgtt cccttgggga aatatctgct ctgacttcta
1381 aattagcaga tctacgaaga caggggaaag gagattctcc agaggctcga gccttggcca
1441 aacaggtggc cacggccctg cagaacctgc agaccaaaac caaccgggct gtggccaaca
1501 gcagaccggc caaagcagct gtacaccttg agggcaagat tgagcaagca cagcggtgga
1561 ttgataatcc cacagtggat gaccgtggag tcggtcaggc tgccatccgg gggcttgtgg
1621 ccgaagggca tcgtctggct aatgttatga tggggcctta tcggcaagat cttctcgcca
1681 agtgtgaccg agtggaccag ctgacagccc agctggctga cctggctgcc agaggggaag
1741 gggagagtcc tcaggcacga gcacttgcat ctcagctcca agactcctta aaggatctaa
1801 aagctcggat gcaggaggcc atgactcagg aagtgtcaga tgttttcagc gataccacaa
1861 ctcccatcaa gctgttggca gtggcagcca cggcgcctcc tgatgcgcct aacagggaag
1921 aggtatttga tgagagggca gctaactttg aaaaccattc aggaaagctt ggtgctacgg
1981 ccgagaaggc ggctgcggtt ggtactgcta ataaatcaac agtggaaggc attcaggcct
2041 cagtgaagac ggcccgagaa ctcacacccc aggtggtctc ggctgctcgt atcttactta
2101 ggaaccctgg aaatcaagct gcttatgaac attttgagac catgaagaac cagtggatcg
2161 ataatgttga aaaaatgaca gggctggtgg acgaagccat tgataccaaa tctctgttgg
2221 atgcttcaga agaagcaatt aaaaaagacc tggacaagtg caaggtagct atggccaaca
2281 ttcagcctca gatgctggtt gctggggcaa ccagtattgc tcgtcgggcc aaccggatcc
2341 tgctggtggc taagagggag gtggagaatt ccgaggatcc caagttccgt gaggctgtga
2401 aagctgcctc tgatgaattg agcaaaacca tctccccaat ggtgatggat gcaaaagctg
2461 tggctggaaa catttccgac cctggactgc aaaagagctt cctggactca ggatatcgga
2521 tcctgggagc tgtggccaag gtcagagaag ccttccaacc tcaggagcct gacttcccgc
2581 cgcctccacc agaccttgaa caactccgac taacagatga gcttgctcct cccaaaccac
2641 ctctgcctga aggtgaggtc cctccaccta ggcctccacc accagaggaa aaggatgaag
2701 agttccctga gcagaaggcc ggggaggtga ttaaccagcc aatgatgatg gctgccagac
2761 agctccatga tgaagctcgc aaatggtcca gcaagggcaa tgacatcatt gcagcagcca
2821 agcgcatggc tctgctgatg gctgagatgt ctcggctggt aagagggggc agtggtacca
2881 agcgggcact cattcagtgt gccaaggaca tcgccaaggc ctcagatgag gtgactcggt
2941 tggccaagga ggttgccaag cagtgcacag ataaacggat tagaaccaac ctcttacagg
3001 tatgtgagcg aatcccaacc ataagcaccc agctcaaaat cctgtccaca gtgaaggcca
3061 ccatgctggg ccggaccaac atcagtgatg aggagtctga gcaggccaca gagatgctgg
3121 ttcacaatgc ccagaacctc atgcagtctg tgaaggagac tgtgcgggaa gctgaagctg
3181 cttcaatcaa aattcgaaca gatgctggat ttacactgcg ctgggttaga aagactccct
3241 ggtaccagta ggcacctggc tgagcctggc tggcacagaa acctctacta aaaagaagga
3301 aaatgatctg agtcccagga gctgcccaga gttgctggga gctgaaaaat cacatcctgg
3361 cctggcacat cagaaaggaa tgggggcctc ttcaaattag aagacattta tactcttttt
3421 tcatggacac tttgaaatgt gtttctgtat aaagcctgta ttctcaaaca cagttacact
3481 tgtgcaccct ctatcccaat aggcagactg ggtttctagc ccatggactt cacataagct
3541 cagaatccaa gtgaacacta gccagacact ctgctctgcc cttgttccct aggggacact
3601 tccctctgtt tctctttcct tggctcccat tcactcttcc agaatcccaa gacccagggc
3661 ccaggcaaat cagttactaa gaagaaaatt gctgtgcctc ccaaaattgt tttgagcttt
3721 ccatgttgct gccaaccata ccttccttcc ctgggctgtg ctacctgggt ccttttcaga
3781 agtgagcttt gctgctacag gggaaggtgg cctctgtgga gccccagcat atgggggcct
3841 ggattcattt cctgcccttc ctcagtttaa tccttctagt ttcccacaat ataaaactgt
3901 acttcactgt caggaagaaa tcacagaatc atatgattct gcttttacca tgcccctgag
3961 caatgtctgt gctagggaaa ctccccgtcc catatcctgc ctcagcccgc caaggtagcc
4021 atcccatgaa cacactgtgt cctggtgctc tctgccactg gaagggcaga gtagccaggg
4081 tgtggccctg ccatcttccc agcagggcca ctcccggcac tccatgctta gtcactgcct
4141 gcagaggtct gtgctgaggc cttatcattc attcttagct cttaattgtt cattttgagc
4201 tgaaatgctg cattttaatt ttaaccaaaa catgtctcct atatcctggt ttttgtagcc
4261 ttcctccaca tcctttctaa acaagatttt aaagacatgt aggtgtttgt tcatctgtaa
4321 ctctaaaaga tcctttttaa attcagtcct aagaaagagg agtgcttgtc ccctaagagt
4381 gtttaatggc aaggcagccc tgtctgaagg acacttcctg cctaagggag agtggtattt
4441 gcagactaga attctagtgc tgctgaagat gaatcaatgg gaaatactac tcctgtaatt
4501 cctacctccc tgcaaccaac tacaaccaag ctctctgcat ctactcccaa gtatggggtt
4561 caagagagta atgggtttca tatttcttat caccacagta agttcctact aggcaaaatg
4621 agagggcagt gtttcctttt tggtacttat tactgctaag tatttcccag cacatgaaac
4681 cttatttttt ccaaagccag aaccagatga gtaaaggagt aagaaccttg cctgaacatc
4741 cttccttccc acccatcgct gtgtgttagt tcccaacatc gaatgtgtac aacttaagtt
4801 ggtcctttac actcaggctt tcactatttc ctttaaaatg aggatgatta ttttcaaggc
4861 cctcagcata tttgtatagt tgcttgcctg atataaatgc aatattaatg cctttaaagt
4921 atgaatctat gccaaagatc acttgttgtt ttactaaaga aagattactt agaggaaata
4981 agaaaaatca tgtttgctct cccggttctt ccagtggttt gagacactgg tttacacttt
5041 atgccggatg tgcttttctc caatatcagt gctcgagaca cagtgaagca aattaaaaaa
5101 aa
//
LOCUS HUMPSBGA01 134 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 1.
ACCESSION M32622
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 1 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 134; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept 50 + 113 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 1
sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 4
signal peptide
IVS 114 > 134 PSG4 intron A
BASE COUNT 33 a 44 c 36 g 21 t
ORIGIN Chromosome 19q13.2-13.3.
1 ggatcctagg cttatctcca cagaggagaa cacacaagca gcagagacca tggggcccct
61 ctcagcccct ccctgcacac acctcatcac ctggaagggg gtcctgctca caggtgagga
121 gagaacttcc tggg
//
LOCUS HUMPSBGA02 1010 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 2.
ACCESSION M32623 M22436 D51537
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 2 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2] and hsCGM4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1010)
AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
Zimmermann,W.
TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene
family: Isolation and sequencing of N-terminal domain exons
JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 1010; for [2])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990.
FEATURES from to/span description
pept + 468 + 833 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 2
sigp + 468 505 pregnancy-specific beta-1-glycoprotein 4
signal peptide
matp 506 + 833 pregnancy-specific beta-1-glycoprotein 4
IVS < 1 467 PSG4 intron A
IVS 834 > 1010 PSG4 intron B
BASE COUNT 281 a 225 c 267 g 237 t
ORIGIN Chromosome 19q13.2-13.3, about 650 bp after segment 1.
1 gagctcacac agtcatcggg gggggaagac agacatgcta agtgatctag aatgtgagat
61 caggtgttga caagaaccct ggagggagga gagcagggaa aggtcagaaa gggaagaccc
121 agggtctctg aaggaggtat caggaaagaa gtctaaggat gccctgatgt gagcaggacc
181 tgagggcagt gtggaggggg ccgtgcggac cctggggaag aggaatccaa aaagaaaaat
241 gccaaggtca gaagtgttga aggaatgggg gtcatgctgc tgatcttgac ctagtgggac
301 agtaggacac acacacatac actcacgccc ctttagtgtg tgtatgtgtt tgtatgtgtg
361 tgtttgtgtg tcttcaaggc tgaggattga agagaccttc tcaggaccca tcttttcacc
421 ccaatacata ggtctcaata ttaactgatg ctctctgtac ctcctagcat cacttttaaa
481 cttctggaat ccgcccacaa ctgcccaagt cacgattgaa gcccagccac ccaaagtttc
541 tgaggggaag gatgttcttc tacttgtcca caatttgccc cagaatcttg ctggctacat
601 ttggtacaaa gggcaaatga catacctcta ccattacatt acatcatatg tagtagacgg
661 tcaaagaatt atatatgggc ctgcatacag tggaagagaa agagtatatt ccaatgcatc
721 cctgctgatc cagaatgtca cgcaggagga tgcaggatcc tacaccttac acatcataaa
781 gcgacgcgat gggactggag gagtaactgg acatttcacc ttcaccttac accgtgagtg
841 attccacatg atccctgggt gttgggggac aggggtcact tctacttcac acacacagga
901 ttctcaggcc tggactctgc ctgtgtccct ctctgcatta agtccatgct ggggtttggg
961 catttagtgc aggacacaca gaggagacaa atttcaacag atcagaattc
//
LOCUS HUMPSBGA03 430 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 3.
ACCESSION M32624
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 3 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 430; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 124 + 402 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 3
matp + 124 + 402 pregnancy-specific beta-1-glycoprotein 4
IVS < 1 123 PSG4 intron B
IVS 403 > 430 PSG4 intron C
BASE COUNT 108 a 117 c 103 g 102 t
ORIGIN Chromosome 19q13.2-13.3, about 5500 bp after segment 2.
1 ttccatgggc ttttaaggac ttgggtgggc tgagaggtgg gaaatgccaa ctctgattga
61 aaaatgcctt tggaagaatc aaagatgcca cacagggcaa acctctctct gttttctgca
121 cagtggagac tcccaagccc tccatctcca gcagcaactt aaatcccagg gaggccatgg
181 aggctgtgat cttaacctgt gatcctgcga ctccacccgc aagctaccag tggtggatga
241 atggtcagag cctccctatg actcacaggt tgcagctgtc caaaaccaac aggaccctct
301 ttatatttgg tgtcacaaag tatattgcag gaccctatga atgtgaaata cggaacccag
361 tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct ccgtgagtat cctctgttcc
421 tctgtgagcc
//
LOCUS HUMPSBGA04 573 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 4.
ACCESSION M32625
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 4 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 573; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 181 365 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 4
369 + 435 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 4
matp + 181 + 435 pregnancy-specific beta-1-glycoprotein 4
IVS < 1 180 PSG4 intron C
IVS 436 > 573 PSG4 intron D
site 366 368 internal stop codon
BASE COUNT 146 a 149 c 132 g 146 t
ORIGIN Chromosome 19q13.2-13.3, about 270 bp after segment 3.
1 agcttgtgat gggagaaaca ggtgtaaggt ctcaggctcc agatcagtga acacagcggg
61 gatttggctg ggacttcagt gttgcgactt ggctcacagg gtcactgtgg cccttccaca
121 gaccaggatt ttcccttccc tctgacaatg tcacctgtga ctttattctc tttgctccag
181 atggcctgga tgcccccacc atttcttcct catacaccta ttaccataca ggggaattcc
241 ccaatctctc ctgcctcaca gactctcacc cactggcaga gcattcttgg ctgattgatg
301 ggaagttcca gcaatcagca caagtgtgct ttatccccca aatcactaaa acatatagag
361 gggtttaggt ctgtttcatc cataactcag ccactgatgg aacaaatctc ataatcaaga
421 ggatcatagt gcctggtaag tggatccctg gagcattggc aatatgtttt ccagtgaagc
481 tatctatgct atcagggaaa agccaccgtc cctcgtcaaa gggagaggga aaatcaaaaa
541 cccagcacag ggaatatgtt tcgtctccgg ggg
//
LOCUS HUMPSBGA05 647 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 5.
ACCESSION M32626
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 5 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 647; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 158 + 436 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 5
matp + 158 + 436 pregnancy-specific beta-1-glycoprotein 4
IVS < 1 157 PSG4 intron D
IVS 437 > 647 PSG4 intron E
BASE COUNT 183 a 168 c 147 g 149 t
ORIGIN Chromosome 19q13.2-13.3, about 1800 bp after segment 4.
1 gatcatcatg catctatctt gtgacagaag caccagctat tggctttcaa ggactcgggt
61 gggctgagag gtgggagatg ccaactctga ttgaagtatg cctgtggagg aatcaaaggt
121 gccacacagg acaatcttct ctctgttatc cacacagcaa agctgtccaa gccctacatc
181 acaatcaaca acttaaaccc cagagagaat aaggatgtct taaccttcac ctgtgaacct
241 aagagtaaga actacaccta catttggtgg ctaaatggtc agagcctccc tgtcagtccc
301 agggtaaagc gacccattga aaacaggatc ctcattctac ccaatgtcac gagaaatgaa
361 acaggacctt atcaatgtga aatacgggac cgatatggtg gcatccgcag tgacccagtc
421 accctgaatg tcctctgtga gtatcttttt gttcctctgt gggccaggac accagcttaa
481 atccaaacca ccagaggcca ggtctctcag tctctctcca gtccaagtat agacaccttt
541 acttctggac atccgaggtg gccatgactc cctgccctgg gaaaacctag gtaggcacag
601 cttaaccaag aatataaggg gaggggacgc tcttgtcatg gagactt
//
LOCUS HUMPSBGA06 558 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 6.
ACCESSION M32627
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 6 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 558; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 45 / 299 pregnancy-specific beta-1-glycoprotein 4
(PSG4) precursor, exon 6
matp + 45 / 299 pregnancy-specific beta-1-glycoprotein 4
IVS < 1 44 PSG4 intron E
IVS 300 > 558 PSG4 intron F
BASE COUNT 161 a 144 c 103 g 150 t
ORIGIN Chromosome 19q13.2-13.3, about 200 bp after segment 5.
1 ttcctcggat gacatcacct atggctttat tctctttgcg ccagatggtc cagacctccc
61 cagcatttac ccttcattca cctattaccg ttcaggagaa aacctctact tgtcctgctt
121 cgccgagtct aacccacggg cacaatattc ttggacaatt aatgggaagt ttcagctatc
181 aggacaaaag ctctctatcc cccaaataac tacaaagcat agtgggctct atgcttgctc
241 tgttcgtaac tcagccactg gcaaggaaag ctccaaatcc atcacagtca aagtctctgg
301 taagtggatc ccagcatcct tggcaatagg gatttaggtg gagtctatct ggcattcagg
361 gaagagtgag gaaaacattt ttattcccag cctgtgtccc atgggcacaa gcaaatccca
421 aattctactc ctgaacactc ccaatttgtc taagaacttc caaaacttta acaaacaggc
481 tgatatcttc ttaaaattcc cagcctagac caagcaggga gaacattgat ttcattgaaa
541 taattgacaa tattgagg
//
LOCUS HUMPSBGA07 621 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 1.
ACCESSION M32628
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 7 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 621; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept 360 + 423 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 1
sigp 360 + 423 pregnancy-specific beta-1-glycoprotein 5
signal peptide
IVS 424 > 621 PSG5 intron A
BASE COUNT 176 a 135 c 193 g 113 t 4 others
ORIGIN Chromosome 19q13.2-13.3, about 7000 bp after segment 6.
1 tatgtgtgtg actgtgtggg tctgtgtgtg tctcttctgt gtgtgtgtgt ctgcacaaag
61 tgtggttgag gtttggtgaa agaatcactg ctgaaaaatg cagaggcctc cacaattccc
121 agggacctga aacacagaca aaaggaaaaa cagaaggagg acaaggaggc aggactgaga
181 gaggagggga cagagaggtg tcctgggcct gaccccaccc atgagcctga gaagtgctcc
241 tgccctggag agaggctcag cacagaagga ggaaggacag cacagcctac agccgtgctc
301 aggaagtttc tggatcctag gctcagctcc acagaggaga acacgcaggc gcagagacca
361 tggggcccct ctcagcccct ccctgcacac agcacatcac ctggaagggg ctcctgctca
421 caggtgagga gagaacttcc tgggagagga caggaggagg aagaagagtg actggattgg
481 gtctcctgga gaggatnnnn ttctaaaaaa taaaagaagt ccagcacttt gggaggatga
541 ggtgggtgga ttatgagatc aggagttcaa ggtcagtcct gccaacacag tgaagccctg
601 tctctactaa aaatacaaaa a
//
LOCUS HUMPSBGA08 173 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, intron
A.
ACCESSION M32629
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 8 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 173; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
IVS < 1 > 173 pregnancy-specific beta-1-glycoprotein 5
intron A
BASE COUNT 56 a 29 c 62 g 26 t
ORIGIN Chromosome 19q13.2-13.3, about 560 bp after segment 7.
1 gagctcacgc tgtcatgggg aggaagacag acatgcaaag agatctagaa tgtgaggtca
61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag
121 ggtctctgaa ggacgtgtca ggaaagaagt ctagagatgc cctgatgtga gca
//
LOCUS HUMPSBGA09 547 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 2.
ACCESSION M32630
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 9 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 547; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 9 + 374 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 2
sigp + 9 47 pregnancy-specific beta-1-glycoprotein 5
signal peptide
matp 48 + 374 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 8 PSG5 intron A
IVS 375 > 547 PSG5 intron B
BASE COUNT 149 a 139 c 117 g 142 t
ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 8.
1 cctcctagca tcacttttaa acttctggaa cctgcctatc actgctcaag tcacgattga
61 agccctgcca cccaaagttt ccgaggggaa ggatgttctt ctacttgtcc acaatttgcc
121 tcagaatctt gctggctaca tctggtacaa aggacaactg atggacctct accattacat
181 tacatcatat gtagtagacg gtcaaataaa tatatatggg cctgcataca ctggacgaga
241 aacagtatat tccaatgcat ccctgctgat ccagaatgtc acccgggaag acgcaggatc
301 ctacacctta cacatcataa agcgaggtga taggactaga ggagtaactg gatatttcac
361 cttcaactta taccgtgagt gattccacat gatccctggg tgttggggga caggggtcat
421 ttctacttca cacacacagg attgtcaggc ctgcactgtg cctgtgtccc tctctgcatt
481 atgtcccatg ctgcggtttg ggcatttagt gcaggacaca cacagaggag acaaatttca
541 acagatc
//
LOCUS HUMPSBGA10 458 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 3.
ACCESSION M32631
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 10 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 458; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 167 + 445 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 3
matp + 167 + 445 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 166 PSG5 intron B
IVS 446 > 458 PSG5 intron C
BASE COUNT 117 a 122 c 108 g 111 t
ORIGIN Chromosome 19q13.2-13.3, about 5200 bp after segment 9.
1 gagctctcag attgccatgc atctgtcttg taacacacac acctgccatg ggcttttaag
61 gacttgggtg ggctgacagg tgggaaatgc caactctgat tgaaaagtgc ctttggagga
121 atcaaaggtg ccacacaggg caatcttctc tctgttttct gcaaagtgga aactcccaag
181 ccctccatct ccagtagcaa cttaaacccc agggaggcca tggaggctgt gatcttaacc
241 tgtgatcctg cggcttcaga cacaagctac ctgtggtgga tgaatggtca gagcctccct
301 atgactcaca ggttgcagct gtccaaaacc aacaggaccc tctttatatt tggtgtcaca
361 aagtatattg caggacccta tgaatgtgaa atacggaacc cagtgagtgc cagccacagt
421 gacccagtca ccctgaatct cctccgtgag tatcctct
//
LOCUS HUMPSBGA11 359 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 4.
ACCESSION M32632
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 11 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 359; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 95 267 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 4
272 + 347 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 4
matp + 95 + 347 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 94 PSG5 intron C
IVS 348 > 359 PSG5 intron D
site 268 270 internal stop codon
BASE COUNT 89 a 108 c 65 g 97 t
ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 10.
1 acttggctca cagggtcact gtggcccttc cacagaccag gattttccct tccctctgac
61 aatgtcacct gtgactttat tctctttgct ccagatggcc tggatgcccc caccatttct
121 tcctcataca cctattacca tacaggggaa gtccccaatc tctcctgcct cacagactct
181 cacccactgg cagagcattc ttggctgatt gatgggaagt tccagcaatc agcacaagtg
241 tgctttattc cccaaatcac caaaacatag agggatctat gtctgtttca tccataactc
301 agccactgat ggaacaaatc tcataatcaa gaggatcata gtccctggta agtggatcc
//
LOCUS HUMPSBGA12 381 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 5.
ACCESSION M32633
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 12 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 381; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 33 + 311 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 5
matp + 33 + 311 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 32 PSG5 intron D
IVS 312 > 381 PSG5 intron E
BASE COUNT 111 a 105 c 81 g 84 t
ORIGIN Chromosome 19q13.2-13.3, about 2000 bp after segment 11.
1 acaggacaat cttctctctg ttatccacac agtgaagctg cccaagccct acatcaccat
61 caacaactca aaacccaggg agaataagga tgtcttagcc ttcacctgtg aacctaagag
121 tgagaactac acctacattt ggtggctaaa tggtcagagc ctcccggtca gtcccagggt
181 aaagcgaccc attgaaaaca ggatcctcat tctacccagt gtcacgagaa atgaaacagg
241 accctatgaa tgtgaaatac gggaccgaga tggtggcatg cgcagtgacc cagtcaccct
301 gaatgtcctc tgtgagtatc ttttgttcct ctgtgggcca ggacaccagc ttaaatctaa
361 atgaccacaa gccagtcctc t
//
LOCUS HUMPSBGA13 540 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 6.
ACCESSION M32634
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 13 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 540; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 40 + 294 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 6
matp + 40 + 294 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 39 PSG5 intron E
IVS 295 > 540 PSG5 intron F
BASE COUNT 132 a 142 c 108 g 158 t
ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 12.
1 ggatgacatc acctgtggct tcattcttct ttgctccaga tggtccagac ctccccagca
61 tttacccttc attcacctat taccgttcag gagaaaacct ctacttgtcc tgcttcgcgg
121 aatctaaccc accggcagag tatttttgga caattaatgg gaagtttcag caatcaggac
181 aaaagctctc tatcccccaa attactacaa agcatagagg gctctatact tgctctgttc
241 gtaactcagc tactggcaag gaaagctcca aatccatgac agtcgaagtc tctggtaagt
301 ggatcccagc atccttggca atagggtttt aggtggagtc tacctggctt tcagagaaga
361 gttaggaaaa catttttatt cccagcctgt gtcccatggg cacaagcaaa tcccaaattc
421 tcctcctgaa cccttccaat ttgtctctac aaactctctt ctccttgttt ttctgttttc
481 tcatggctga ccttgtgtct ggcctaagaa agttagggag ggggctttat cagccctgag
//
LOCUS HUMPSBGA14 320 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 7.
ACCESSION M32635
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 14 of 14
SOURCE Human placenta DNA, clones cosPSG4/5-[1,2].
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 320; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 202 245 pregnancy-specific beta-1-glycoprotein 5
(PSG5) precursor, exon 7
matp + 202 245 pregnancy-specific beta-1-glycoprotein 5
IVS < 1 201 PSG5 intron F
IVS 286 > 320 PSG5 intron G
BASE COUNT 94 a 78 c 54 g 94 t
ORIGIN Chromosome 19q13.2-13.3, about 4300 bp after segment 13.
1 ggatcaccat cctagaaaca tggtgagagg gaaaaaaaaa actgcaagaa tataatcatc
61 tcccatcaat ttcccaagag gaataatgta gtacttgaag aaacaattat agagtacctc
121 atgctacatg cctcttcctg aggctccccc atgtaaaata acatcacctt cattccttct
181 ttacttttct ttccatgaca gctccttcag gaataggacg tcttcctctc cttaatccaa
241 tatagcagcc gtgaagtcat ttctgtattt caggaagact ggcaggtatg atggcctttt
301 ctcttatcct ggtcctgcag
//
LOCUS HUMPSBGB1 237 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 4.
ACCESSION M32617
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 1 of 5
SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 237; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept < 1 + 14 pregnancy-specific beta-1-glycoprotein (PSG7),
exon 4 (AA at 2)
IVS 15 > 237 PSG7 intron D
BASE COUNT 76 a 52 c 50 g 59 t
ORIGIN Chromosome 19q13.2-13.3.
1 gatcatagtc cctggtaagt ggatcctgga gcattggcaa tatgttttcc agggaagtct
61 atctggctat cagggaagag ccacctgccc tctgcagagg agagggaaaa tcaaaactca
121 ggaaagggaa tatatttctg ctccaaaacc accaggtttt cctgtacctc actctttcta
181 gatcattctt cagactatgc gcactaacaa tgaacaatct gaaaagaaat taagaaa
//
LOCUS HUMPSBGB2 435 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 5.
ACCESSION M32618
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 2 of 5
SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 435; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 25 + 303 pregnancy-specific beta-1-glycoprotein 7
(PSG7), exon 5
IVS < 1 24 PSG7 intron D
IVS 304 > 435 PSG7 intron E
BASE COUNT 120 a 123 c 87 g 105 t
ORIGIN Chromosome 19q13.2-13.3; about 1800 bp after segment 1.
1 atcttctctc tgttattcac acagtgaagc tgcctaagcc ctacatcacc atcaataact
61 taaaccccag ggagaataag gatgtctcaa ccttcacctg tgaacctaag agtgagaact
121 acacctacat ttggtggcta aatggtcaga gcctcccggt cagtcccagg gtaaagcgac
181 gcattgaaaa caggatcctc attctaccca gtgtcacgag aaatgaaaca ggaccctatc
241 aatgtgaaat acgggaccga tatggtggca tccgcagtga cccagtcacc ctgaatgtcc
301 tctgtgagta tccttgagta tcctttcttt cctctgtggg ccaggacaca ccagcttaac
361 tccaaacgac cagaggccag tcctctcagt ctctctccgg tccaagtata gacatattta
421 cttctggaca tccga
//
LOCUS HUMPSBGB3 274 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 6.
ACCESSION M32619
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 3 of 5
SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 274; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept + 8 / 262 pregnancy-specific beta-1-glycoprotein 7
(PSG7), exon 6
IVS < 1 7 PSG7 intron E
IVS 263 > 274 PSG7 intron F
BASE COUNT 72 a 78 c 52 g 72 t
ORIGIN Chromosome 19q13.2-13.3; about 300 bp after segment 2.
1 cctccagatg gtccagacct ccccagaatt tacccttcat tcacctatta ccattcagga
61 gaaaacctct acttgtcctg ttttgcggac tctaacccac cggcacagta ttcttggaca
121 attaatggga agtttcagct atcaggacaa aagctttcta tcccccagat tactacaaag
181 catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa
241 tccgtgacag tcagagtctc tggtaagtgg atcc
//
LOCUS HUMPSBGB4 206 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 1.
ACCESSION M32620
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 4 of 5
SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 206; for [1])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1],[2] kindly
submitted by J.Thompson, 06-MAR-1990.
FEATURES from to/span description
pept 50 + 113 pregnancy-specific beta-1-glycoprotein 6
(PSG6) precursor, exon 1
sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 6
signal peptide
IVS 114 > 206 PSG6 intron A
BASE COUNT 59 a 50 c 66 g 31 t
ORIGIN Chromosome 19q13.2-13.3; about 7800 bp after segment 3.
1 ggatcctagg ctcatctcca caggggagaa cacacagaca gcagagacca tgggacccct
61 ctcagcccct ccctgcactc agcacatcac ctggaagggg ctcctgctca caggtgagga
121 gaggacttcc tgggagagga caggaggagg aagcagagtg actggatggg gtatctggag
181 aggatggggt tctaaaaaat aaaaga
//
LOCUS HUMPSBGB5 1026 bp ds-DNA PRI 14-AUG-1990
DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 2.
ACCESSION M32621 M22435 D51537
KEYWORDS pregnancy-specific beta-1-glycoprotein.
SEGMENT 5 of 5
SOURCE Human leukocyte DNA, clones lambda-PSG6/7-1 and hsCGM3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1026)
AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
Zimmermann,W.
TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene
family: Isolation and sequencing of N-terminal domain exons
JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
TITLE The human pregnancy-specific glycoprotein genes are tightly linked
on the long arm of chromosome 19 and are coordinately expressed
JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
STANDARD full staff_review
REFERENCE 3 (bases 439 to 846; for [2])
AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
Ammaturo,D. and Hardman,N.
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990.
This gene was named human carcinoembryonic antigen (hsCGM3) in [1],
but [2] says it is actually one of the pregnancy-specific
beta-1-glycoproteins.
FEATURES from to/span description
pept + 484 / 846 pregnancy-specific beta-1-glycoprotein 6
(PSG6) precursor, exon 2
sigp + 484 521 pregnancy-specific beta-1-glycoprotein 6
signal peptide
matp 522 / 846 pregnancy-specific beta-1-glycoprotein 6
IVS < 1 483 PSG6 intron A
IVS 847 > 1026 PSG6 intron B
BASE COUNT 291 a 235 c 264 g 236 t
ORIGIN Chromosome 19q13.2-13.3; about 1200 bp after segment 4.
1 gagctcacac tctcatgggg aggaagacag acatgcaaag agatatagaa tgtgaggtca
61 ggtgttgaca agaaccctag agggagcaga gcagggaaag gtcagaaagg gaagacccag
121 ggtctctgaa gcaggcatca ggaaagaagt ctaaggatgc cctgatgtga gcaggacctg
181 agggcagtgt ggagggggcc gtgcggaccc ctggggaaga ggattgcaaa cagaaaaatg
241 ccaaggtcag gagtgttgaa ggaatggggg tcatgctgct gaccttgacc tagtaggaca
301 gtaggacaca cacacataca cacacacaaa cacacatgcc cttttgtgtg tgtgtgtttg
361 tatgtgtgtg tgtgcatatc ttcaaggctg atgattgaag agaccttctc aggacacagg
421 gccccatctt ttcaccccaa tacataggtc caaatattaa ctgatgctgt ctctacctcc
481 tagcatcact tttaaacttc tggaacctgc ccaccactgc ccaagtaata attgaagcca
541 agccacccaa agtttccgag gggaaggatg ttcttctact tgtccacaat ttgccccaga
601 atcttactgg ctacatctgg tacaaagggc aaatgacgga cctctaccat tacattacat
661 catatgtagt acacggtcaa attatatatg ggcctgccta cagtggacga gaaacagtat
721 attccaatgc atccctgctg atccagaatg tcacacagga ggatgcagga tcctacacct
781 tacacatcat aaagcgaggc gatgggactg gaggagtaac tggatatttc actgtcacct
841 tatactgtga gtgattccgc atgatccctg ggtgttgggg ggcaggggtc atttctactt
901 cacacacaca gaattgtcag gcctggactc tgcctgtgtc actctctgca ttatgtccca
961 tgctggggtt tgggcattta gtgcaggaca cacacagagg agacacattt caacagatca
1021 gaattc
//
LOCUS MSQRTR1A 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 16bs.
ACCESSION M35448
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 16bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 76 a 78 c 64 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt
//
LOCUS MSQRTR1B 348 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 916.
ACCESSION M35449
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 916.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 348 T1 retroposon
BASE COUNT 79 a 75 c 63 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatat actcgacagt cgtcttaact ttaaactgca
61 gcttgatgag gttctactaa aagctaatcg aactcttggg tttattttac gttttacctc
121 tatttttaga gattaaagct tcttaagaaa cctttattat gctctggtaa ggcctcttct
181 tgaatatgct agcatcatct ggaatcctcc tactattgat ggctgttcga gaattgaaag
241 cattcagcgc ctttttacca gggttgcttt tcgtcgtttg ttcggtgctg cctcactacc
301 tccctatgaa acgagattgc agttattcaa tcttcactct ttaagctt
//
LOCUS MSQRTR1C 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 52a.
ACCESSION M35450
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 52a.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 78 a 76 c 64 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcccattc
181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1D 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone L34.
ACCESSION M35451
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone L34.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 77 a 77 c 64 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1E 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 914a.
ACCESSION M35452
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 914a.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 77 a 76 c 65 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctgtggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1F 343 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 13bs.
ACCESSION M35453
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 13bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 343)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 343 T1 retroposon
BASE COUNT 78 a 73 c 60 g 132 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta taatttacag tcgtcttaac tttaaactgc
61 agcttgatga gattctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagtatcatc tggaattgtc ctactattga tggctgttcg agaattgaat
241 gccttcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcagtta ttcaatcttc actctttaag ctt
//
LOCUS MSQRTR1G 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 12bs.
ACCESSION M35454
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 12bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 78 a 79 c 62 g 130 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acccttattg tgctctggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcatcgttt tttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt
//
LOCUS MSQRTR1H 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 9bs.
ACCESSION M35456
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 9bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 77 a 77 c 64 g 131 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1I 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 6bs.
ACCESSION M35457
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 6bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 78 a 76 c 64 g 131 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1J 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 4bs.
ACCESSION M35461
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 4bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 76 a 77 c 64 g 132 t
ORIGIN
1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS MSQRTR1K 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 1bs.
ACCESSION M35462
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 1bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 78 a 80 c 64 g 127 t
ORIGIN
1 gtgttgtgtc catccgtgac cttggtataa tactcgacag tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagctaatc gaacccttgg gtttatttta cgttttacct
121 ctatttttag agatcaaagc ttcctaagca acctttattg tgctttggta aggcctattc
181 ttgaatatgc tagtatcctc tggaatcctc ctactattga ttgctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1L 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 5bs.
ACCESSION M35463
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 5bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 81 a 80 c 64 g 124 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1M 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 3bs.
ACCESSION M35464
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 3bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 80 a 80 c 65 g 124 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggagtcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1N 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 11bs.
ACCESSION M35465
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 11bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 82 a 80 c 64 g 123 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacca
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1O 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 14bs.
ACCESSION M35466
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 14bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 81 a 82 c 64 g 122 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1P 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 31b.
ACCESSION M35467
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 31b.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 80 a 80 c 65 g 124 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctgcta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1Q 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 15bs.
ACCESSION M35468
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 15bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 81 a 80 c 64 g 124 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1R 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 17bs.
ACCESSION M35469
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 17bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 82 a 79 c 64 g 124 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgacttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agattaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1S 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 18bs.
ACCESSION M35470
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 18bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 82 a 81 c 64 g 122 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaaccctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgatag cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1T 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 19bs.
ACCESSION M35471
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 19bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 81 a 82 c 64 g 122 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS MSQRTR1U 349 bp ds-DNA INV 14-AUG-1990
DEFINITION A.gambiae T1 retroposon, clone 28bs.
ACCESSION M35472
KEYWORDS retroposon; retrotransposon.
SOURCE A.gambiae (colony G3) DNA, clone 28bs.
ORGANISM Anopheles gambiae
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Nematocera;
Culicoidea; Culicidae; Anophelelinae.
REFERENCE 1 (bases 1 to 349)
AUTHORS Besansky,N.J.
TITLE Evolution of the T1 retroposon family in the Anopheles gambiae
complex
JOURNAL Mol. Biol. Evol. 7, 229-246 (1990)
STANDARD simple staff_review
FEATURES from to/span description
trns 1 349 T1 retroposon
BASE COUNT 79 a 80 c 64 g 126 t
ORIGIN
1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
61 agcttgatgt ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
181 ttgaatatgc tagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS SHRMTRRDA 1136 bp ss-RNA RNA 14-AUG-1990
DEFINITION Brine shrimp mitochondrion 16S ribosomal RNA precursor.
ACCESSION X12965 M21833
KEYWORDS 16S ribosomal RNA; ribosomal RNA.
SOURCE Brine shrimp mitochondrion, cDNA to RNA, clones pArLSU-[1,2].
ORGANISM Mitochondrion Artemia salina
Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea;
Branchiopoda; Sarsostraca; Anostraca; Artimiidae; Artemia salina.
REFERENCE 1 (sites)
AUTHORS Sastre,L.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 2 (bases 1 to 1136; enum. 1 to 1136)
AUTHORS Sastre,L.
TITLE ;
JOURNAL Unpublished (1988) see COMMENT for author address
STANDARD simple automatic
REFERENCE 3 (bases 518 to 1129)
AUTHORS Palmero,I., Renart,J. and Sastre,L.
TITLE Isolation of cDNA clones coding for mitochondrial 16S ribosomal RNA
from the crustacean Artemia
JOURNAL Gene 68, 239-248 (1988)
STANDARD full staff_review
REFERENCE 4 (bases 1 to 1136; revises [1],[2],[3])
AUTHORS Sastre,L.
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT Author address: Sastre L., Instituto de Investigaciones Biomedicas
del CSIC, Av Arzobispo Morcillo 4, 28029 Madrid, Spain.
Submitted (22-SEP-1988) on tape to the EMBL data library. [1]
Submitted (17-NOV-1988) to the EMBL data library. [2]
Draft entry and computer-readable sequence for [3] kindly provided
by L.Sastre, 01-MAR-1989; for [4], 12-MAY-1989.
FEATURES from to/span description
rRNA 1 1136 16S ribosomal RNA
revision 548 550 cct in [2]; ct in [1]
revision 555 557 act in [2]; at in [1]
revision 778 780 gtc in [2]; gc in [1]
revision 906 908 tta in [2]; ta in [1]
revision 981 982 tc in [2]; ttc in [1]
revision 1003 1007 cagcc in [2]; cc in [1]
revision 1111 1112 ta in [4]; tca in [2]
BASE COUNT 382 a 196 c 218 g 340 t
ORIGIN
1 atttaaaaca atattatatt agtaaaagaa tcgaatttat tcaaggtcta agtactgaaa
61 aggaaagtct actaattaca aagaagttat tgtttgtacc ttttgtatca gggtttatta
121 aatttcaatc tttattgctg aatcccgaaa caaggtgatc tatattggtt aaagatgatt
181 gtttcataaa tctcttcaag accatatcgt tgcgaaaata tttcgtacct ttatatctgg
241 ttggcagaat actcctttag ggtactccgc aaggaagatt ttctagggga cgagctctag
301 aaaaaattat attataatta tcaagaaaaa catgactagg cttaaaagtt gcctcggttt
361 aataaccatt gttagtaaaa tgataatttc aaaataatta attaagactg gccgatgcta
421 attacataaa tttagcaatc acaatgatag aattagtaga actataaatg cacttaacac
481 aatattcctc acaaaatccc aaaggaattc ggcaaaactc atctccgcct gtttaacaaa
541 gacatcgcct cttaactttg ggaggtcgag cctgcccact gatgaattaa agggccgtgg
601 tatactgacc atgcgaaggt agcataatca ttagcctttt gatttgaggc tggaatgaat
661 ggtttgacga gagatggtct gtctcttcga ttaaattgaa gttaatcttt aagtgaaaaa
721 gcttaaatgt acttggaggg cgataagacc ctatagatct ttacatttaa ttcttttgtc
781 ttgcggtagg taattagaca gagtaaaaca atgttcggtt ggggcgacgg taagaacaga
841 ataaacactt acaacataaa cacatcaata aatgaccatt gatccttaga tgaataaaga
901 ccaagttacc ttagggataa cagcgtaatt ctttttgaga gttcaaatcg acaaaagagt
961 ttgcgagcct cgatgttggt tcagggaccc tactcggtgc agcagcctag agaggcagtc
1021 tgttcgactg ttaaacccct acgtgatctg agttcagacc gacgtaagcc aggttagttt
1081 ctacccccaa ggcaaatcca ctcatgatag tacgaaagga cctcttgaga ttttat
//
LOCUS RATI145TR 9852 bp ss-mRNA ROD 14-AUG-1990
DEFINITION Rat inositol-1,4,5-triphosphate receptor mRNA, complete cds.
ACCESSION J05510
KEYWORDS inositol-1,4,5-triphosphate receptor.
SOURCE Rat adult brain, cDNA to mRNA, clones pI16, pI17, pI19, and pI25.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 9852)
AUTHORS Mignery,G.A., Newton,C.L., Archer,B.T.III. and Suedhof,T.C.
TITLE Structure and expression of the rat inositol-1,4,5-triphosphate
receptor
JOURNAL J. Biol. Chem. 265, 12679-12685 (1900)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.C.Suedhof, 19-APR-1990.
FEATURES from to/span description
pept 330 8579 inositol-1,4,5-triphosphate receptor (alt.,
clone pI16)
pept 330 1280 inositol-1,4,5-triphosphate receptor, exon 1
(alt., clones pI17, pI19, and pI25)
1326 8579 inositol-1,4,5-triphosphate receptor, exon 2
(alt., clones pI17, pI19, and pI25)
IVS 1281 1325 inositol-1,4,5-triphosphate receptor intron A
(no splice consensus)
signal 9033 9083 poly-A signal
signal 9832 9837 poly-A signal
variant 752 752 a in pI16; g in pI25
variant 802 802 g in pI16; a in pI17
variant 1923 1923 g in pI16; a in pI17
variant 2181 2181 g in pI16; a in pI17
variant 3340 3340 g in pI16; a in pI17
BASE COUNT 2688 a 2324 c 2498 g 2342 t
ORIGIN
1 tggggtcact tccttcacac aggactttgg cggaaatctc tacacccgtg tctcgtcatc
61 cccttcgaga gcattaacta tttaaaacca ggagagaaag cgcacgccga gaggaggtgt
121 gggtgttccg ctcccatcct aacggaacga gctccctctt cgcggacatg ggattgccca
181 gcggctgcta acccctgtcc tggtcctgat ccctcaaacc ggcgtggctc cccggtcacc
241 aaggagctga ctacaaagga tttgcatcct tggctgggag tccattggct acagagtgcc
301 tgacctgggt caggcttttc aacaaggaca tgtctgacaa aatgtctagt ttcctacata
361 tcggagacat ttgttctctg tatgcagagg gatctacgaa tggatttatc agcaccttgg
421 gcttggttga tgaccgttgc gttgtacagc cagaagctgg ggaccttaac aatccaccca
481 agaaattcag agactgcctc tttaagctat gtcctatgaa tcgatattct gcacagaagc
541 agttctggaa agctgctaag cccggggcca acagtacaac agatgcggtg ctgctcaaca
601 aattgcatca tgctgcagac ttggaaaaga agcagaatga gacagaaaac aggaaattgc
661 tggggacggt catccagtat ggcaacgtga tccagctcct acatttgaaa agcaataaat
721 acttaactgt gaataagagg cttcctgccc tactggagaa gaacgccatg agagtgacgt
781 tggatgaggc tggaaatgaa gggtcttggt tttacattca gccgttctac aagctccgct
841 ccatcggaga cagtgtggtc ataggtgaca aggtagtttt gaaccctgtc aatgctgggc
901 agccgctaca tgccagcagc catcagctgg tggataaccc gggctgcaat gaggtcaact
961 ccgtcaactg taacacaagc tggaaaatag tgcttttcat gaaatggagt gataacaaag
1021 atgacattct aaaaggaggt gatgtggtaa gactcttcca tgctgagcaa gagaagtttc
1081 tcacgtgcga tgagcacagg aagaagcagc acgtcttcct gaggacaacc ggcagacagt
1141 cagccacgtc ggccaccagc tccaaagccc tgtgggaagt ggaggtagtc cagcatgacc
1201 catgtcgggg tggagctggg tactggaata gcctcttccg gttcaagcac ctggctacag
1261 ggcattactt ggcagcagag gtagaccctg actttgagga agaatgcctg gagtttcagc
1321 cctcagtgga ccctgatcag gacgcatctc ggagcaggtt gagaaatgcc caagaaaaaa
1381 tggtgtactc tctggtctct gtgcctgagg gcaacgacat ctcctccatc tttgagctag
1441 accccacaac tctacgtgga ggtgacagcc ttgtcccaag gaactcctat gtccggctca
1501 gacacctgtg tacaaacacc tgggttcaca gcacaaacat ccccattgac aaggaagagg
1561 aaaaacctgt catgctgaag attggtacct ctcccctgaa ggaggacaag gaagcttttg
1621 ccatagtccc cgtttctcct gctgaggttc gggacctgga ctttgccaat gatgccagca
1681 aggtgctggg ctccatcgct gggaagttgg aaaagggcac catcacccag aatgagagaa
1741 ggtctgtcac caagctgctg gaagacttgg tttactttgt cacgggtgga acgaactctg
1801 gccaagacgt gcttgaagta gtcttctcta agcccaaccg agagcggcag aagctgatga
1861 gggaacagaa tattctcaag cagatcttca agcttttgca ggcccccttc acggactgcg
1921 gggatggccc aatgcttcgg ctggaggagc tgggggacca gcggcacgct cctttcagac
1981 acatttgccg actctgctac agggtcctga gacactcaca gcaagactac aggaagaacc
2041 aggagtacat agccaagcag tttggcttca tgcagaagca gattggctat gatgtgctgg
2101 ccgaagacac catcactgcc ctgctccaca acaaccgaaa gctcctggaa aagcacatca
2161 ccgcagcaga gattgacacg tttgtcagcc tggtgcgaaa gaacagggag cccaggttct
2221 tggattacct ctctgacctc tgtgtatcca tgaataagtc aatccctgtg acccaggagc
2281 tcatttgtaa agctgtgctg aatcccacca atgctgacat cctgattgag accaagttgg
2341 ttctttctcg ttttgagttt gaaggcgttt ccaccggaga gaatgctctg gaagctgggg
2401 aagatgagga agaggtgtgg ctgttctgga gggacagcaa caaagagatc cgtagcaaga
2461 gtgttaggga attggcacaa gatgctaaag agggacagaa ggaagacagg gatgtcctca
2521 gctactacag gtatcagctg aacctctttg caaggatgtg tctggaccgc cagtacctgg
2581 ccatcaatga aatatccggg cagctggatg ttgacctcat cctccgctgc atgtccgatg
2641 agaacctccc ctatgacctc agagcatcct tttgccgcct catgcttcac atgcatgtgg
2701 accgagatcc ccaggagcag gtgacccctg tgaaatacgc ccgtctgtgg tcagaaattc
2761 cctccgagat cgccattgac gactatgaca gcagtggagc atcgaaggat gaaatcaagg
2821 agaggtttgc tcagacgatg gagtttgtgg aggagtacct gagagatgtg gtttgtcaaa
2881 gattcccctt ttctgataag gagaaaaata agctcacatt tgaggttgtg aacttagcca
2941 ggaacctcat atactttggc ttctacaact tttctgacct tctccgatta accaagatcc
3001 tcttggcaat cttagactgt gttcatgtga caaccatctt ccccattagc aagatgacga
3061 aaggcgaaga gaataaaggc agcaacgtga tgagatctat ccatggagtt ggggagctca
3121 tgacccaggt ggtgcttcgg ggaggaggct ttttgcccat gactcctatg gctgctgccc
3181 ctgaaggaaa tgtgaagcag gcagagccag agaaagagga tatcatggtc atggacacca
3241 agttaaagat cattgaaata ctccagttta ttttgaatgt gagattggat tataggatct
3301 cctgcctcct gtgtatattt aagcgggagt ttgatgaaag caattcccag tcatcagaaa
3361 catcctccgg aaacagcagc caagaagggc caagtaatgt accaggtgct cttgactttg
3421 aacacattga agagcaagca gaaggcatct ttgggggaag tgaggagaac acacctctgg
3481 acttggatga tcatggtggc agaaccttcc tcagggtcct gctccacttg acaatgcatg
3541 attacccacc cctggtgtct ggggccctgc agctcctctt ccggcatttc agccagaggc
3601 aggaggtcct tcaggccttc aaacaggttc aactgctggt tactagccaa gatgtggaca
3661 actacaaaca gatcaagcaa gacttggacc agctaagatc catcgtggag aagtctgagc
3721 tctgggtata caaaggccag ggtcccgatg agcctatgga cggagcctct ggtgaaaacg
3781 aacataagaa aactgaggag gggaccagta agccactgaa gcacgaaagc accagcagct
3841 acaactaccg agtggtgaaa gagattctga ttcgacttag caagctctgt gttcaagaga
3901 gcgcgtcggt gaggaagagc aggaagcagc agcagcgact gctgaggaac atgggtgccc
3961 atgctgtggt gctggagctg ctgcagatcc cctatgagaa ggccgaagat accaagatgc
4021 aggagatcat gaggctggct cacgaatttc tgcagaattt ctgcgcaggc aaccagcaga
4081 accaagcttt gctacataaa cacataaacc tgtttctcaa cccagggatc ctggaagcag
4141 tgaccatgca gcacatcttc atgaacaact tccagctgtg cagtgagatc aacgagagag
4201 tggtccagca tttcgttcac tgcatagaga cccacggtcg aaatgtccag tatatcaagt
4261 ttctccagac tattgtcaag gccgaaggga aattcattaa aaagtgccaa gacatggtca
4321 tggctgagct tgtcaattct ggagaggacg tcctcgtgtt ctacaatgac agagcctctt
4381 tccagactct gatccagatg atgcggtccg agcgtgaccg gatggatgag aacagccctc
4441 tcttcatgta ccacatccat ctggtggagc tcttggctgt gtgcacagag ggcaagaatg
4501 tgtacacaga gatcaagtgc aactccctgc tcccgctcga tgacatcgtt cgtgtggtca
4561 ctcatgaaga ctgtatcccc gaggttaaga ttgcttacat caacttcctg aatcactgct
4621 atgtggacac tgaggtggag atgaaggaga tttacacaag taatcacatg tggaagttgt
4681 ttgagaattt cctcgtggac atctgcaggg cctgtaacaa cacaagtgac aggaagcatg
4741 cagactcggt tctggagaag tacgtcactg aaatcgtgat gagcattgtc accaccttct
4801 tcagctctcc cttctcagac cagagtacca ctctgcagac tcgccagcct gtctttgtgc
4861 agctcctgca aggcgtgttc cgagtttacc actgcaactg gttgatgccg agccagaaag
4921 cctcggtgga gagctgtatc cgcgtgctct ctgacgtagc caagagccgg gccatagcca
4981 ttcctgttga cctagacagc caagtcaaca acctcttcct gaagtcccac aacattgtgc
5041 agaaaacagc catgaactgg cggttatcag cccgcaacgc tgctcgtaga gactctgtcc
5101 tggcagcttc cagagactac cgaaatatta ttgagaggtt acaggacatc gtgtctgccc
5161 tggaggaccg actcaggccc ctggtccagg ccgagctgtc tgtgctcgtg gatgttctac
5221 acagaccgga actgcttttc cctgagaaca cggatgccag gaggaaatgc gaaagtggag
5281 gtttcatctg caagctcata aaacacacca agcagctgct agaggagaat gaagagaaac
5341 tgtgcattaa agtcttacag accctcaggg aaatgatgac caaagataga ggctatggag
5401 aaaagcaaat ttccattgat gaattggaaa atgccgagct gccacagcca ccggaagctg
5461 agaactccac agaggagctt gaaccaagtc cacccctgag gcaactggaa gaccataaaa
5521 ggggtgaggc actccggcaa attttggtca accgttacta tggaaacatc agaccttcag
5581 gaagaagaga gagccttacc agctttggca atggtccact atcaccagga ggacccagca
5641 agcctggggg aggaggggga ggtcctgggt ccggctccac aagccggggt gagatgagcc
5701 tggctgaggt tcaatgtcat ctcgacaagg aaggcgcctc caatctggtc atcgacctca
5761 taatgaatgc atccagcgac cgagtattcc acgaaagcat tctcctggcc atcgcacttc
5821 tggaaggagg caacaccact atccagcact catttttctg ccggctgacg gaagataaga
5881 aatcagagaa gttcttcaag gtcttttacg atcgaatgaa ggtggcccag caggaaatca
5941 aggcgacagt gaccgtgaac accagcgatt tgggaaacaa aaagaaagat gatgaagtgg
6001 accgggatgc cccatcccga aagaaagcca aagagcccac aacacagata acagaagagg
6061 tccgggatca gctcttggaa gcatctgctg ccaccagaaa agcctttacc accttccgga
6121 gggaggccga ccctgatgac cattaccaat ctggggaggg cacccaggct acaacggaca
6181 aagccaagga tgacctagag atgagtgcag tcatcaccat catgcagcct atcctgcgct
6241 tcctgcagct cctgtgtgaa aaccacaacc gcgatctgca gaatttcctt cgttgccaaa
6301 ataataagac caactacaac ttggtgtgtg agacactgca gtttctggac tgtatctgtg
6361 ggagcacaac tggaggcctt ggtcttctcg gactgtacat aaatgaaaag aatgtagctc
6421 ttatcaacca aaccctggag agtctgactg agtactgtca agggccttgc catgaaaacc
6481 agaactgcat cgccacccac gagtccaatg gcatcgatat catcacagcc ctcatcctta
6541 acgatatcaa ccctctgggg aagaaacgga tggacctggt gttagaactg aagaacaatg
6601 cttcaaagct actcctggcc atcatggaaa gcagacacga tagtgaaaat gcagagagga
6661 tcctgtacaa catgaggccc aaggagctgg tggaagtgat caagaaggcc tacatgcaag
6721 gcgaagtgga gtttgaggat ggggagaacg gtgaggatgg agcagcctcc cccaggaacg
6781 tgggccataa catctacatc ctcgctcacc agttggctcg gcataacaaa gaacttcaaa
6841 ccatgctgaa acctggaggc caggtggatg gcgatgaagc tctggagttc tatgcgaagc
6901 acacagcaca aattgagatt gtcagactgg accgaactat ggaacagatt gtcttccctg
6961 tgcccagcat ctgtgaattt ctgactaagg aatcgaaact tcgaatatat tacaccacag
7021 agagggatga gcaaggcagc aagatcaatg acttcttcct gcgctccgag gacctcttta
7081 atgagatgaa ctggcagaag aaacttcgag cccagcctgt cttgtactgg tgcgcccgaa
7141 acatgtcttt ctggagcagc atctccttca acctggccgt cctgatgaac ctgctggtgg
7201 cgtttttcta cccatttaaa ggagtgagag gaggaacact agagccgcac tggtccggcc
7261 tcctgtggac agccatgctc atctctctgg caattgtcat tgcgctgccc aagccccatg
7321 gcatccgggc cttaattgct tctacaatcc tacgactgat attttcagtt ggattacagc
7381 ccacactgtt tctgctggga gctttcaatg tctgcaataa aatcatcttc ctgatgagct
7441 ttgtgggcaa ctgtgggacc ttcaccagag gctaccgagc catggttctg gacgttgagt
7501 tcctctatca tttgctgtat ctactgatct gtgccatggg cctcttcgta catgagtttt
7561 tctatagttt gctgcttttt gatttagtgt acagagagga gactttgctt aatgtcatta
7621 aaagtgtcac ccgcaatgga cggcccatca tcttgacagc ggccctggct ctgattctgg
7681 tttacctgtt ctcaattgtg ggctacctgt tcttcaagga tgactttatc ttggaagtag
7741 acaggttgcc caatgagaca gctggtccag aaactggtga gagtttggcc aacgatttcc
7801 tgtactctga tgtgtgccgg gtagagacgg gggagaactg cacctctccg gcacccaaag
7861 aagagctgct ccctgtggaa gaaacggaac aggataagga gcacacgtgt gagacactgc
7921 tcatgtgcat cgtcaccgtt ctgagtcacg ggctgcggag tgggggcgga gtaggagatg
7981 tgctcaggaa gccatccaaa gaggagcctc tgtttgctgc aagggtgatc tatgacctcc
8041 tcttcttctt catggtcatc atcattgtcc tgaacctcat ttttggggtc atcatcgaca
8101 cctttgctga cctgaggagt gagaagcaga agaaggaaga gatcttaaaa accacgtgct
8161 tcatttgtgg cttggaaagg gacaagtttg acaataagac cgtcaccttt gaagagcaca
8221 ttaaggaaga acacaacatg tggcactatc tgtgcttcat cgtgctggtg aaagtgaagg
8281 actccaccga gtacaccggg cccgagagct acgtggcaga gatgatcagg gaaagaaacc
8341 ttgattggtt ccccaggatg agagccatgt ccctggtcag cagtgactct gaaggggaac
8401 agaacgagct gaggaacctg caggagaagc tggagtccac catgaagctg gtcaccaatc
8461 tttctggcca gctgtcagaa ctaaaggacc agatgacaga acaaaggaag cagaaacaaa
8521 gaattggcct tctaggacat cctcctcaca tgaatgtcaa cccacagcag ccagcctaag
8581 caaatgagac ggagggactg tactcaaccc tttttaaatt attgtcagcg tgggtctggc
8641 tcttggttct gatctgccct ctaaggttac atgtgcgctt agtacatttg taaatactcg
8701 gttttgtatt gtatgtatat gattgcaatt ctcagaggtt tggactttcg tattgtaatt
8761 agctctgttg gcatggtgac ttgtcactcc tgccaaaaat attaaaaatg ccttttttgg
8821 aaggactaca gaaagtacct gatttgcact tgaaccagat tatagattta aaaagtattt
8881 gacatgtatt ttgttattta aaactagaat agccagtatt tatgtttttt ataaaactgt
8941 gcaatacaaa ttatgcaatc accataactc tgtaactcct gagtgtctta agggagtgcg
9001 catctttgaa gctgatctgt taatacttat gtaataaatg gttaaatatc aaatgatgat
9061 gctgctgcca aaattatatt aatagcaagt ttctggcccc tgggcaattt tgtaccttgt
9121 aattatccta tggtgatgct gtttctcgtt gctaatggca ttagtgcctc gtatcctagt
9181 gataactcca ggtctgtgaa ccattcaaac aacattcatt tttgagacaa agcaacttta
9241 gtttcaaggg taattttaag cttcaaaatt aatcatttaa agtatttctt taagagagcc
9301 atgttagagg ctcacacgtt agcttgaaag gagtcgatga aataattttt ttaaagggaa
9361 ctttttacac gacgtttgga ataacagcat attgctgacc agtcagtgtc atctcccagg
9421 tgaattttga tgtcacgtta tagtcaaatg agctagctta tggtttctag attctcttca
9481 tctgaaccaa catgatgcag taggtgagaa gttattatgc atatatacac atatacatcc
9541 atataaggca aagtaggagc tgtcccctta ggatgcatag ctgccccctt agggtgcaga
9601 gctcaacact gacagtgctg ttctgagaga gccacgtttg ggttttattt ctttgtcaca
9661 tgatttcttt tctggatgtg tgcaagtata acaggaagtg ttttctacct gttgcttgtt
9721 ttgtacctgg gtctcgtttt actagaccat ctctgcacaa aagtttaaaa aatgaaccgt
9781 aagcagagtt ctgaagcaag tcaagtttgt aagtgcatac ctgaaattat ttaataaatg
9841 acgcagaata ct
//GenBank-Updates@genbank.bio.net (08/15/90)
LOCUS RHPNIFDK 3500 bp ds-DNA PLN 15-AUG-1990
DEFINITION Parasponia rhizobium nifD and nifK genes coding for the alpha- and
beta-subunits of the Mo-Fe protein of nitrogenase, complete cds.
ACCESSION X01139
KEYWORDS nitrogenase.
SOURCE Parasponia rhizobium (strain ANU289) DNA, clones pR289nif-[3,4,5].
ORGANISM Parasponia rhizobium
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Hamamelidae; Urticales; Ulmaceae.
REFERENCE 1 (bases 1 to 3500)
AUTHORS Weinman,J.J., Fellows,F.F., Gresshoff,P.M., Shine,J. and Scott,K.F.
TITLE Structural analysis of the genes encoding the molybdenum-iron
protein of nitrogenase in the Parasponia rhizobium strain ANU289
JOURNAL Nucleic Acids Res. 12, 8329-8344 (1984)
STANDARD simple staff_review
COMMENT EMBL features not translated to GenBank features:
key from to description
PRM 108 124 consensus promotor sequence
SITE 135 135 transcription start
RBS 163 168 pot. ribosome binding site
RBS 1753 1758 pot. ribosome binding site
SITE 3322 3355 pot. stem-loop structure
FEATURES from to/span description
pept 176 1678 Mo-Fe protein alpha-subunit
pept 1767 3308 Mo-Fe protein beta-subunit
BASE COUNT 826 a 957 c 992 g 725 t
ORIGIN 1 bp upstream of EcoRI site.
1 gaattctccg tgcaaagcgc gatgtcgcct tcgcaacaac aaccagcccc atcggacgaa
61 acgcgctaac tgtttttatt tattctgctt tttgtgctcg cgccgcgctg gcatgctcgt
121 tgcagtcttg ttcaagaagc tgctcccgca cagttaattc ttgaaggaca tcagcatgag
181 tctcgccacg acccagagca tcgcagaaat cagggctcgc aataaagagc tgatcgagga
241 ggtgctgaaa gtctatccgg agaagaccgc gaaacggcgt gccaagcacc tcaacgttca
301 ccaagccggc aagtcggact gcggggtcaa gtccaacatc aaatcaatac ctggtgtgat
361 gacaatcaga ggctgcgcct atgcaggatc caaaggggtg gtctggggac cgatcaagga
421 catggtccat atcagccatg gcccggtcgg ctgtggtcag tattcgtggg gctcgcgtcg
481 caactattat gttggcacga cgggcgtcga tagtttcgtg accctgcagt tcacctccga
541 cttccaggaa aaggacatcg tatttggcgg cgacaagaag ctgatcaaag tccttgacga
601 aatccaggag ctgttcccgc tcaacaacgg catcaccatc caatcggaat gcccgatcgg
661 actgatcggg gacgacatcg aggctgtgtc aagatcgaaa tccaaagaat acggcggcaa
721 gaccatcgtg cctgttcgct gtgagggctt tcgcggcgtg tcgcaatcgc ttggccacca
781 cattgccaat gacgcggtgc gcgattggat cttcgacaag ctagagcccg agggcgaacc
841 aaagttccag ccgacgccct acgacgttgc gatcatcgga gactacaata ttggcggcga
901 tgcctggtca tcgcgcattc tgctggaaga aatgggcttg cgggtgattg cgcagtggtc
961 cggcgacggt tccctcgccg aactcgaagc aacgccgaag gcaaagctca atattctgca
1021 ttgctaccgt tccatgaact acatctcccg ccacatggag gagaagtttg gcatcccctg
1081 gtgcgagtac aacttcttcg gaccgtcgaa gatcgcagaa tcgctgcgca agattgcggg
1141 ctatttcgac gacaagatca aggaaggcgc cgagcgagta attgaaaaat accagccact
1201 ggtggacgcc gtaatcgcaa aatatcgccc ccgcctggag ggcaagactg tgatgctgta
1261 cgtcggcggg cttcgtccac gtcatgtgat tggcgcgtac gaggatctcg gcatggaagt
1321 cgtgggcacc ggatacgagt tcggccacaa cgacgattat cagcgcaccg cccagcacta
1381 cgttaaggac agcacgctca tctacgacga cgtcaatggc tatgaattcg agcgcttcgt
1441 cgaaaaggtc caaccagatc tggttggctc gggcatcaag gagaaatacg ttttccaaaa
1501 gatgggtgtg ccgttcccgg agatgcattc ctgggactat tccggcccat atcacggcta
1561 tgacggcttt gcgatcttcg cgcgggacat ggacatggct gtcaactcgc cgatctggaa
1621 gaagacgaag gccccctgga aggaagctgc gaagccgaag ctcttggctg cagaataaca
1681 agcacttggt tccacaatag agcgatcaat cccgctctct gcggagagct ggggcgacat
1741 catttcgata gtgaaggatc ttaacaatgg cgcagagtgc agaccatgtg ctcgatcatc
1801 tcgaactgtt ccgcggtcca gaataccaac aaatgctggc cgacaagaag atgttcgaga
1861 atccccgcga tcctgccgag gtcgaacgta tccgagcagt gacgaaaacg cccgaatatc
1921 gcgagaagaa ttttgcggag gcgcttgcgg taaatccggc caaggcttgc cagccgcttg
1981 gcgccgtatt cgtctcggtt ggttttgaag gcacgctgcc cttcgtccat ggctcgcagg
2041 gctgcgtggc ctattaccgc agccatctgt cgcggcactt caaggagccg agctcctgcg
2101 tgtcttcgtc gatgacggaa gacgccgctg tattcggggg gctgaacaat atgatcgatg
2161 gcctcgccaa cagctacaac atgtacaaac ccaagatgat ttgctcgacg acctgcatgg
2221 ccgaggtgat cggcgatgac ctgaacgcct tcatcaagac atcaaaagaa aaaggctcgg
2281 ttcggcggag ttcgactcct ttcgcgcaca ctccagcgtt cgtcggcagc cacgtcaccg
2341 gctatgacaa cgcactcaag ggcattctcg agcacttttg gaacggcaag gccggaacgg
2401 cgccgaagct ggagcgcaaa ccaaacgagg caatcaacat catcggcggt ttcgatggca
2461 ataccgttgg aaaccttcgt gagatcaagc gaatcttagc gttgatgggc atcaaacaca
2521 cgattctcgc cgataactct gaagtcttcg ataccccgac tgatggcgag ttccggatgt
2581 atgacggcgg tacccacgtg gaggacacgg ccaacgcgat tcacgccaag gcgacaatct
2641 ccatgcagca atggtgtacg gaaaaaacgc tgccgttcgt gtccgagcat ggacaggacg
2701 ttgtgtcttt caattacccg gtaggtgtat ccgcgacgga tgatcttctc gtggccttgt
2761 cacgcatcag cggcaaggag attccggagc aactcgcgcg agagcgtggc cgcttggttg
2821 atgccatcgc ggattccagc gcgcatatcc atggcaagaa gttcgcgatc tacggcgatc
2881 cggatctctg ctatgggttg gctgcctttc tgctcgaact cggcgccgag cctactcatg
2941 tgctgtccac caacggcaac aacgtggcag gagaaaatgc gacgctgttt gcaggctcgc
3001 catttggaga acttccagcc tatccgggac gagacctctg gcacatgcgc tcgctcttgt
3061 tcacagagcc ggttgacttt ctgattggca acacccatgg caagtacctg gagcgtgaca
3121 ctggaacgcc attgatccgc atcggctttc caatttttga tcggcatcac catcaccgct
3181 tccctgtatg gggctatcag ggcggcctga atgtgctggt gaagatcctc gacaagatct
3241 tcgacgaaat cgacaagaag accagcgttc ttggcaaaac tgactacagt ttcgacatca
3301 ttcgttgatg acgggcagtg cgcgtgggct cgccgaaaca gcggcgagcc cacgctgggc
3361 actggttgac attgaaattt tcttccgctg agaggaaaat gctgatgagt tcgtctagtc
3421 ggccacggtc cagggtattt tcaggcgaac cgggctgccg aagaatggaa gtaagtcgga
3481 ggctgagcgc aagaagggct
//
LOCUS RHPNIFH 2030 bp ds-DNA PLN 15-AUG-1990
DEFINITION Parasponia rhizobium nitrogenase (nifH) gene, iron protein
component.
ACCESSION K00487
KEYWORDS nifH gene; nitrogenase; unidentified reading frame.
SOURCE Parasponia rhizobium (strain ANU289) DNA.
ORGANISM Parasponia rhizobium
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Hamamelidae; Urticales; Ulmaceae.
REFERENCE 1 (bases 1 to 2030)
AUTHORS Scott,K.F., Rolfe,B.G. and Shine,J.
TITLE Nitrogenase structural genes are unlinked in the nonlegume symbiont
Parasponia rhizobium
JOURNAL DNA 2, 141-148 (1983)
STANDARD full staff_review
COMMENT [1] states the iron protein subunit is encoded on a separate operon
from other components of the nitrogenase enzyme complex, unlike
previously studied nitrogen-fixing prokaryotes.
FEATURES from to/span description
pept 576 1460 nifH (nitrogenase iron protein)
mRNA 421 > 1460 nifH mRNA
BASE COUNT 430 a 572 c 618 g 410 t
ORIGIN 5 bp upstream of PstI site
1 ctgcagggcc cttgtaaggc gcttcttgct gcctttaagc tcatgcgcac cgatctgatc
61 agctggatca atcgggaggt cagccgcaca attgatctcg tcatcctcga ccacgaaccc
121 catcgccggc cacttgcctt gaggttctga cctcgacctg catattgctc tccgcggatt
181 gccgccactg gcttgcaaga agaggagcaa gtcccgttcc agttgaggaa atcgaaccag
241 atcatgccaa accggcgttt tccggttgat gggtgtggcc gttgttcgtt ttctgacagc
301 cgcgcagatc ctgtccggtg caaacctccc tggggtagct cagcggctcg ttggcttttt
361 agagcgtaat caagaagctt aataagcgcg gacagtgttg gcatggcgat tgctgttgag
421 ttgcagcaac actgagtgag ggctgggtgc acgccgacgc gtaagacgag cgatgcgctc
481 cttcccttga acccgtgtgc cccgtttctg agagagaaac aagctcgcgt gtcggaagca
541 cgcaactttt ggcaaatcgg ttgatggaga acaacatgtc ttcactgaga caaatcgcgt
601 tctacggaaa gggcggcatc ggcaagtcga ccacgtccca gaatacgttg gcggcactgg
661 ccgagatggg ccagaaaatc ctgatcgtgg gatgcgatcc taaggcggac tcgacgcgcc
721 tcatcctgca cgcgaaggcg caggacacga ttttgagcct tgcagcgagc gctggcagcg
781 tggaagacct cgaactcgag gacgtgatga aggtcggcta caaggacatc cgatgcgtgg
841 agtccggtgg tcccgagccg ggtgtcggct gcgcgggccg cggcgtcatc acctcgatca
901 atttcctgga ggagaacggc gcctatgaga acattgacta tgtctcatat gacgtgctcg
961 gcgacgtcgt ttgcggtggc tttgcgatgc cgatccggga aaacaaggcg caggagatct
1021 atatcgtgat gtctggagaa atgatggcaa tgtatgccgc aaacaatatc tccaaaggta
1081 tcctgaaata cgccaactct ggcggcgtgc ggctgggcgg cctgatctgc aacgagcggc
1141 agaccgataa ggagctggag ctggcggagg cgctggccaa gaagttaggt actcagctga
1201 tctacttcgt gccgcgcgac aatgtggtgc agcatgccga gctacggcgc atgacggtgc
1261 tggagtatgc ccctgagtcg cagcaggccg atcactatcg caatcttgcg accaaggttc
1321 acaacaatgg cggcaaaggc atcattccga ctccgatctc catggatgag ctcgaggaca
1381 tgctgatgga gcatggcatt atgaagcccg tcgacgaatc catcgtcggc aagaccgccg
1441 ccgaactcgc ggcctcgtaa aggtcgcggg tcgcggcctt gtgaaggcgc gcgacggatg
1501 ccggtctccc tcacccccca tccggggaga ccggcattct gacgattatc tgaccagcca
1561 gagtggagct ggcaaccgtg accgctatgg gaacccaaaa catcatgaca ggagcgcact
1621 tccttccgct tatggcttct tgcgccgtcg aggcgagcag caaggtgcaa agaggaattg
1681 cgacctaccg agcgctcact ggcgtcctcc tgaagaggcc gacattgcga ccgacagcaa
1741 tttcgattgc catgtcctgg cgtcaatcct ggcggccgct cgatggatgg tggcccgctt
1801 cccgagcgcc ctgtccgcca ccagctggcg accctgctcg cagcaatttc catcggttga
1861 ggtcgatatc tcggagcagc tcctggcgtc taagtgcgat gagaatgacg agatcgcgat
1921 ggtgcgcgat cttttgctca agcaacgctc gacggacggg catattcggg ctggctagcc
1981 gcgatgattg cgcgccgcgc catagagcca gatcacctgt gggaagatct
//
LOCUS RHPHBEM 1520 bp ds-DNA PLN 15-AUG-1990
DEFINITION P.andersonii haemoglobin gene, complete cds.
ACCESSION M36509
KEYWORDS haemoglobin.
SOURCE P.andersonii DNA.
ORGANISM Parasponia andersonii
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Hamamelidae; Urticales; Ulmaceae.
REFERENCE 1 (bases 1 to 1520)
AUTHORS Landsmann,J., Dennis,E.S., Higgins,T.J.V., Appleby,C.A.,
Kortt,A.A. and Peacock,W.J.
TITLE Common evolutionary origin of legume and non-legume plant
haemoglobins
JOURNAL Nature 324, 166-168 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept 198 313 haemoglobin, exon 1
436 550 haemoglobin, exon 2
877 993 haemoglobin, exon 3
1153 1293 haemoglobin, exon 4
IVS 314 435 haemoglobin intron A
IVS 551 876 haemoglobin intron B
IVS 994 1152 haemoglobin intron C
BASE COUNT 470 a 281 c 276 g 493 t
ORIGIN
1 ttatcttact aaaaagaaaa cgaaaataaa aaacccaaag atatggctcc ccaataccct
61 gaagagttac acacgatccc cattttttct actatatata cagagtgcct tcaccagatt
121 ttccaaacac actccaacat atcccattgc ccaaataaaa atttctcagc ttttagtccc
181 ctcaacccac agaagccatg agcagctcag aagttaacaa agttttcaca gaggagcagg
241 aagctctggt ggtgaaagca tgggctgtaa tgaagaagaa ctctgctgaa ctgggtcttc
301 aattcttcct caagtaagtc aaaattatat atagtacact ttttatttac tttgcttctt
361 ttatagacca agtttttgaa taaaagggta ctattttttt ttcctgaaaa aaattggttg
421 attgaaactt tgcaggatat ttgagattgc accgtctgcc aagaacttgt tctcttattt
481 gaaggactct ccggttcctt tggagcagaa cccaaagctc aagccccatg ctacgactgt
541 cttcgttatg gtaaagccaa cttttgttct cctattccct tatcctaatt ttacaagaat
601 ctaatgttaa taaaatagta ttttgcctat ttaaacaacc aaaaatttag acacaactat
661 ataaaacatt taaattcttg tggtttatga taccttgatc tacaatgatt ccaacttccc
721 gtgttgcatt tatgagttgt gctagcaaca gtcgcatcac agtcgtctat tccagaaagg
781 acgactgtga ctcttgagac atatcaaagc aaagctcagc aatttttatg tttctcactt
841 gctctgttct ttttctctgg tacttgtcct ggaaagacat gtgagtctgc ggttcaactt
901 cggaaagccg gaaaagtgac agtgaaagaa tcagacttga aaagaattgg ggctatccac
961 ttcaaaactg gcgtagttaa tgaacatttt gaggtactac cctggccact tagtagatat
1021 aattccctaa gtgtaatcca aacatttgtt gtttagagtc aaattattat tattctgtat
1081 ggtggttctt gaataatcga tcttattatg gtatttacta attatattat gcatgggaaa
1141 aacgatttgt aggtcacaag gtttgcactt ttggagacca taaaggaagc agtaccagaa
1201 atgtggtcac ctgagatgaa gaacgcatgg ggagtagctt atgatcagtt ggttgctgcc
1261 atcaagttcg aaatgaaacc ctccagtact tgagaatttt tatagttctt ggaacaattg
1321 ggtttgaata atgtgacaaa acttatactt aattacgttt gcatgagaga gaggtaataa
1381 ttgcatagtg tataacttgc atatgtatca tagtgtgacg caatctctcc acttgtgttg
1441 ttcatcttgt tcaaaaggaa ttagtctttc actttacatt ttgggtggaa gtatggaatg
1501 aaatcagagt ttcattgatt
//
LOCUS PT7RNAA 266 bp ds-DNA PHG 15-AUG-1990
DEFINITION Bacteriophage T7 RNA polymerase gene 1, 3' end.
ACCESSION M24964 M24965
ORGANISM Bacteriophage T7
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 266)
AUTHORS Osterman,H.L. and Coleman,J.E.
TITLE T7 ribonucleic acid polymerase-promoter interactions
JOURNAL Biochemistry 20, 4884-4892 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 201 RNA polymerase (gene 1; AA at 1)
mRNA < 1 266 gene 1 mRNA
BASE COUNT 72 a 66 c 63 g 65 t
ORIGIN
1 ccggctgacg ctgcgaacct gttcaaagca gtgcgcgaaa ctatggttga cacatatgag
61 tcttgtgatg tactggctga tttctacgac cagttcgctg accagttgca cgagtctcaa
121 ttggacaaaa tgccagcact tccggataaa ggtaacttga acctccgtga catcttagag
181 tcggacttcg cgttcgcgta acgccaaatc aatacgactc actatagagg gacaaactca
241 aggtcattcg caagagtggc ctttat
//
LOCUS PT7RNAB 139 bp ds-DNA PHG 15-AUG-1990
DEFINITION Bacteriophage T7 class III RNA polymerase promoter L1 fragment.
ACCESSION M24966
ORGANISM Bacteriophage T7
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 139)
AUTHORS Osterman,H.L. and Coleman,J.E.
TITLE T7 ribonucleic acid polymerase-promoter interactions
JOURNAL Biochemistry 20, 4884-4892 (1981)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 58 > 139 L1 mRNA
BASE COUNT 44 a 25 c 28 g 42 t
ORIGIN
1 cggtatttaa ttaaatattc tccctgtggt ggctcgaaat taatacgact cactataggg
61 agaacaatac gactacggga gggttttctt atgatgacta taagacctac taaaagtaca
121 gactttgagg tattcactc
//
LOCUS PT7RNAC 141 bp ds-DNA PHG 15-AUG-1990
DEFINITION Bacteriophage T7 L2 nonpromoter fragment.
ACCESSION M24967
ORGANISM Bacteriophage T7
Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE 1 (bases 1 to 141)
AUTHORS Osterman,H.L. and Coleman,J.E.
TITLE T7 ribonucleic acid polymerase-promoter interactions
JOURNAL Biochemistry 20, 4884-4892 (1981)
STANDARD simple staff_review
BASE COUNT 36 a 35 c 35 g 35 t
ORIGIN
1 cggaagtgct ggcattttgt ccaattgaga ctcgtgcaac tggtcagcga actggtcgta
61 gaaatcagcc agtacatcac aagactcata tgtgtcaacc atagtttcgc gcactgcttt
121 gaacaggttc gcagcgtcag c
//
LOCUS SIVSMMM7 1210 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) pol region.
ACCESSION M27256
KEYWORDS .
SOURCE Simian immunodeficiency virus (isolate SMM-M7).
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 1210)
AUTHORS Li,Y.
JOURNAL Unpublished (1989)
STANDARD full staff_entry
COMMENT This sequence corresponds to the 3' third of the pol gene. Kindly
provided in computer-readable form by Yen Li.
Author address:Y.Li
New England Regional Primate Research Center
Southborough, Massachusetts 01772
(508-481-0400).
BASE COUNT 478 a 207 c 269 g 256 t
ORIGIN
1 gcccggccag taatccgccc accattgctc ccgaatttcg acccctcctc tagtcagatt
61 agtgttcaat ttggtaaagg atcccatcga agaaatagga acattttatg tggatggctc
121 ttgcaataaa cagtcaaaag agggaaaagc aggatacata acagacagaa ggaggagcaa
181 aataaagttc ttagaacaga ctaccaatca gcgagcagaa ttagaagcct ttctcatggc
241 agtaacagat tcaggagcag aggcaaatat tatagtagat tctcaatatg tgatggggat
301 agtgacaagg caacccactg aatcagaaag taaaatagta aatcagataa tagaagaaat
361 gatcaaaaag acagcagtat atgtgacata ggtaccagct cataaaggtc taggaagaaa
421 tcaagaaata gaccatttag ttagtcaaag gattaggcaa gtcttgttcc tagaaaagat
481 agaaccagcc caagaagagc acgaaaaata tcacagcaat gtaaaagaat tggtctttaa
541 atttaggata ccaagattag tagcaaaaca gatagtagat acctgtgata aatgccagca
601 gaaaggagaa gctatacata gacaggtaaa cacagagtta agaatttggc aaatagactg
661 cacacaccta gagggcaaag ttgttatagt agcagtacat gtggctagtg gattcataga
721 ggcagaagta atcccacaag aaacaggaag acagacagca ttgttcctgt taaaattagc
781 tagcaggtgg cccatcacac acctgcacac agataatggt gctaactttg cttcgcaaga
841 agtaaagatg gtagcctagt gggcagatat agaacacacc tttaaggtac catataatcc
901 acaaagtcaa agagtagtag aagcaatgaa tcatcaccta aagaatcaga tagagagaat
961 tagagagcag gcaaattcag tagaaacaat agtgctcatg gcagttcatt gcatgaattt
1021 taaaagaagg ggaggaatag gggatatgac cccagcagaa agattaatta atatgatcac
1081 cacagaacaa gaaatacaat tccaacaatc aaaaaattca aaatttaaaa attttcgggt
1141 ctatttcaga gaaggcagag accaactgtg gaaaggaccc ggtgaattac tgtggaaagg
1201 ggaaggagca
//
LOCUS ADEAD5A 180 bp ds-DNA VRL 15-AUG-1990
DEFINITION Adenovirus type 5 packaging domain region.
ACCESSION M36423
KEYWORDS .
SOURCE Adenovirus type 5 (strain dl309) DNA.
ORGANISM Mastadenovirus h5
Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE 1 (bases 1 to 180)
AUTHORS Graeble,M. and Hearing,P.
TITLE Adenovirus type 5 packaging domain is composed of a repeated
element that is functionally redundant
JOURNAL J. Virol. 64, 2047-2056 (1990)
STANDARD simple staff_review
FEATURES from to/span description
site 1 156 packaging domain
BASE COUNT 48 a 27 c 52 g 53 t
ORIGIN
1 gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag taaatttggg
61 cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga agtgaaatct
121 gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg gcatttgacc
//
LOCUS CAJFJAAB 1932 bp ds-DNA BCT 15-AUG-1990
DEFINITION C.coli flagellin (flaB) gene, complete cds.
ACCESSION M35141
KEYWORDS flaB gene; flagellin.
SOURCE C.coli (strain VC167, serogroup LIO 8) DNA.
ORGANISM Campylobacter coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1932)
AUTHORS Guerry,P., Logan,S.M., Thornton,S. and Trust,T.J.
TITLE Genomic organization and expression of Campylobacter flagellin
genes
JOURNAL J. Bacteriol. 172, 1853-1860 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 211 1932 flagellin (flaB)
mRNA 185 > 1932 flagellin mRNA
BASE COUNT 638 a 325 c 387 g 582 t
ORIGIN
1 taacaaatcc aagcctagta gtaatactag gcttttttat ttctaaataa aacttggaac
61 attctttagc gtttactgta atttatacaa atccaagcct agtagtaata ctaggctttt
121 tttatttcta aataaaattt caatttgaat caaaacttgg aacacttctt gctttaatct
181 tttcgatgca atattttgaa aggatttaaa atgggtttta gaataaacac caacatcggt
241 gcattgaacg cacatgcaaa ttcagttgtt aatgctaggg agcttgacaa gtctttaagt
301 agacttagct caggtcttag aatcaactcc gcagcagatg atgcttcagg gatggcgata
361 gcagattctt tgcgttcaca agcagcaact ttaggtcaag ctataaacaa tggtaatgat
421 gctataggta tcttgcaaac tgcagataag gctatggatg agcaacttaa aatcttagat
481 accatcaaga ctaaagcgac tcaagctgct caagatggtc aaagcttaaa aacaagaact
541 atgcttcaag cagacatcaa ccgtttgatg gaagaacttg ataatatcgc aaataccact
601 tcatttaatg gcaaacaact tttaagtggt ggttttacca atcaagaatt ccaaatcggt
661 tcaagttcaa atcaaactat taaagcaagt ataggagcaa ctcagtcttc taaaatcggt
721 gtaacaagat ttgaaacagg ttcacaaagt ttttcttcag gcactgtagg acttactatt
781 aaaaactaca acggtatcga agattttaaa tttgatagtg tagtgatttc tacttcagta
841 ggaacaggtc ttggagcttt ggctgaagag atcaacagaa atgcagataa aacaggaatt
901 cgtgcaactt ttgatgtaaa atctgtagga gcctatgcaa taaaagcagg aaatacttct
961 caggattttg ctatcaatgg ggttgttatc ggacaaataa attataatga cggtgataac
1021 aatggtcaac ttatctcagc tatcaatgct gtaaaagata caactggtgt tcaagcctct
1081 aaagatgaaa atggtaaact tgttcttact tcggccgatg gtagagggat taaaatcaca
1141 ggtagcatag gtgtaggagc tggtatattg cacactgaaa attatggaag gttatcttta
1201 gttaaaaatg atggtagaaa tatcaatata agtggaacag gtctttcagc tataggtatg
1261 ggtgctacag acatgatttc tcaatcttca gtatctctaa gagagtcaaa agggcaaatt
1321 tcagcagcca atgctgatgc tatgggcttt aatgcttata atggcggcgg cgctaagcaa
1381 attattttcg cttctagtat tgcaggattt atgtctcagg ctggttcagg cttctctgct
1441 ggttcgggat tttcagtagg tagtggtaaa aattattcag ccattttatc agcttctata
1501 cagatagtat ctagcgcagc ttctatcagt agcacctatg ttgtttctac tggttcaggt
1561 ttctctgctg gttcaggtaa ttctcaattt gcagctttaa gaataagtac agtaagtgct
1621 catgatgaaa ctgcaggtgt aactacactt aagggtgcaa tggctgtgat ggatatagca
1681 gaaactgcta ttaccaattc tgatcaaatc agagcggata taggtgctgt gcaaaatcag
1741 ctccaagtaa cgataaataa tattaccgta acccaggtaa atgttaaagc agcagaatca
1801 accataagag atgtggattt cgctgcagaa agtgcaaatt tttctaagta caatatcctt
1861 gcgcagtcgg gttcatatgc tatgagccaa cgtaacgctg tgcaacaaaa tgtcttaaaa
1921 cttttacaat aa
//
LOCUS CAJFLA 1719 bp ds-DNA BCT 15-AUG-1990
DEFINITION C.coli flagellin gene, complete cds.
ACCESSION M26945
KEYWORDS flagellin.
SOURCE C.coli (strain VC167) DNA.
ORGANISM Campylobacter coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE 1 (bases 1 to 1719)
AUTHORS Logan,S.M., Trust,T.J. and Guerry,P.
TITLE Evidence for posttranslational modification and gene duplication of
Campylobacter flagellin
JOURNAL J. Bacteriol. 171, 3031-3038 (1989)
STANDARD simple staff_entry
FEATURES from to/span description
pept 1 1719 flagellin
BASE COUNT 563 a 284 c 365 g 507 t
ORIGIN
1 atgggatttc gtattaacac aaatgttgca gcattaaatg ctaaagcaaa ttcggatcta
61 aacagcagag cattagatca atcactttca agactcagtt caggtcttag aatcaactcc
121 gcagcagatg tagcttcagg gatggcgata gcagatagtt taagatctca ggcaaatact
181 ttgggtcagg ctatatctaa tggtaatgat gctttaggta tcttgcaaac tgcagataag
241 gctatggatg agcaacttaa aatcttagat accatcaaga ctaaagcgac tcaagctgct
301 gaagatggtc aaagcttaaa aacaagaact atgcttcaag cagacatcaa ccgtttgatg
361 gaagaacttg ataatatcgc aaataccact tcatttaatg gcaaacaact tttaagtggt
421 ggttttacca atcaagaatt ccaaatcggt tcaagttcaa atcaaactat taaagcaagt
481 ataggagcaa ctcagtcttc taaaatcggt gtaacaagat tgaacaggtt cacaaagttt
541 tcttcaggca ctgtagggct tactatcaaa aactacaacg gtatcgaaga ttttaaattt
601 gatagtgtag tgatttctac ttcagtagga acaggtcttg gagctttggc tgaagagatc
661 aacagaaatg cagataaaac aggaattcgt gcaacttttg atctaaaatc tgtaggagcc
721 tatgcaataa aagcaggaaa tacttctcag gattttgcta tcaatggggt tgttataggt
781 aaggttgatt attcagatgg tgatgagaat ggttctttaa tttcagctat caatgctgta
841 aaagatacaa ctggtgttca agcctctaaa gatgaaaatg gtaaacttgt tcttacttcg
901 gccgatggta gagggattaa aatcacaggt agcataggtg taggagctgg tatattgcac
961 actgaaaatt atggaaggtt atctttagtt aaaaatgatg gtagagatat caatataagt
1021 ggaacaggtt tttcagctat aggtatgggt gctacagaca tgatttctca atcttcagta
1081 tctctaagag agtcaaaagg gcaaatttca gcagccaatg ctgatgctat gggctttaat
1141 gcttataatg gcggcggcgc taagcaaatt attttcgctt ctagtattgc agggtttatg
1201 tctcaggctg gttcaggctt ctctgctggt tcgggatttt cagtaggtag tggtaaaaat
1261 tattcagcca ttttatcagc ttctatacag atagtatcta gcgcagcttc tatcagtagc
1321 acctatgttg tttctactgg ttcaggtttc tctgctggtt caggtaattc tcaatttgca
1381 gctttaagaa taagtacagt aagtgctcat gatgaaactg caggtgtaac tacacttaag
1441 ggtgcaatgg ctgtgatgga tatagcagaa actgctatta ccaatcttga tcaaatcaga
1501 gcggatatag gttctgtgca aaatcaaatc acatcgacta taaacaacat tactgtaacc
1561 caggtaaatg ttaaatcagc agaatcacaa atcagagatg tagattttgc aagcgagagt
1621 gcaaattact ctaaagcaaa tatattggct caaagtggtt cttatgctat ggctcaagca
1681 aattcaagcc agcaaaatgt tttaagatta ctacagtag
//
LOCUS CHKLNKPA1 215 bp ds-DNA VRT 15-AUG-1990
DEFINITION Chicken cartilage link protein gene, exon 2.
ACCESSION M35035
KEYWORDS cartilage link protein.
SEGMENT 1 of 5
SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
lambda gLP532 and lambda gLP12.1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 51 to 65 and 182 to 195)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 215)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1,2] kindly
submitted by I.Kiss, 04-JUN-1990.
FEATURES from to/span description
pept 87 + 186 cartilage link protein, exon 2 (first
expressed exon
pre-msg < 1 > 215 cartilage link protein mRNA and introns
IVS < 1 60 cartilage link protein intron A
IVS 187 > 215 cartilage link protein intron B
BASE COUNT 63 a 43 c 47 g 62 t
ORIGIN
1 gaattccata aagggttcca aaaaattgat gagcctttct gttatgtgat gcccttacag
61 tgaagaagat tcttgtgact gtgaagatga caagtctact ctttctggtg ctgatttctg
121 tctgctgggc agaacctcat cctgacaact caagcctgga gcatgagagg attattcaca
181 tccaaggtaa ggaaatacat cagaaaacgc ctttt
//
LOCUS CHKLNKPA2 460 bp ds-DNA VRT 15-AUG-1990
DEFINITION Chicken cartilage link protein gene, exon 3.
ACCESSION M35036
KEYWORDS cartilage link protein.
SEGMENT 2 of 5
SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
lambda gLP39.13 and lambda gLP33.7.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 49 to 63 and 429 to 442)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 460)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.Kiss, 04-JUN-1990.
FEATURES from to/span description
pept + 59 + 433 cartilage link protein, exon 3
pre-msg < 1 > 460 cartilage link protein mRNA and introns
IVS < 1 58 cartilage link protein intron B
IVS 434 > 460 cartilage link protein intron C
BASE COUNT 145 a 90 c 113 g 112 t
ORIGIN
1 tctgtaaaag gtggagtgca gactaattct cctttttgtt tttctccttg aattgtagaa
61 gaaaatggac cccgcctact tgtggtagca gaacaagcta agatcttctc tcagcgaggt
121 ggcaacgtca cactgccttg taaattttac catgaacaca catcaacagc tggctcagga
181 acccacaaaa tccgggtcaa gtggaccaaa ctcacctcag attacctcaa agaagtggat
241 gtctttgtcg caatgggaca ccacagaaag agctacggaa agtatcaggg cagagtgttt
301 ctgagggaaa gcagtgagaa cgatgcctct cttataatca cgaatataat gctggaggat
361 tatgggagat acaagtgcga agtgattgaa ggattagagg acgacacagc agtggtagct
421 ctgaatttgg aaggtaggta acatctaatg tagacttaaa
//
LOCUS CHKLNKPA3 427 bp ds-DNA VRT 15-AUG-1990
DEFINITION Chicken cartilage link protein gene, exon 4.
ACCESSION M35037
KEYWORDS cartilage link protein.
SEGMENT 3 of 5
SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
lambda gLP33.7 and lambda gLP10.1.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 44 to 58 and 352 to 365)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 427)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.Kiss, 04-JUN-1990.
FEATURES from to/span description
pept + 54 + 356 cartilage link protein, exon 4
pre-msg < 1 > 427 cartilage link protein mRNA and introns
IVS < 1 53 cartilage link protein intron C
IVS 357 > 427 cartilage link protein intron D
BASE COUNT 99 a 108 c 105 g 115 t
ORIGIN
1 aaaaaccctt ctagtgggga ttacccccag ctcacctctt tttgccattt caggtgttgt
61 tttcccctat tctccacgtc tgggtcgtta caacctaaac ttccatgagg ctcagcaagc
121 ttgcctggac caggactcca tcattgcctc cttcgaccag ctctacgagg cctggaggtc
181 agggctggac tggtgcaatg ctggctggct cagtgatggt tcagtgcagt accctatcac
241 caagcccaga gagccctgtg gagggaagaa tacggtgccc ggtgtcagaa actatggctt
301 ctgggataaa gagaggagcc gatatgatgt tttctgcttt acttcaaact tcaatggtaa
361 gaacctggtt tacatttacc ttgcaagggt ctttttccat gctttaaaaa gaaagagatg
421 ccagcgg
//
LOCUS CHKLNKPA4 826 bp ds-DNA VRT 15-AUG-1990
DEFINITION Chicken cartilage link protein gene, exon 5.
ACCESSION M35038
KEYWORDS cartilage link protein.
SEGMENT 4 of 5
SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
lambda gLP10.1 and lambda gLP39.23.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 15 to 29)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1 to 826)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.Kiss, 04-JUN-1990.
FEATURES from to/span description
pept + 25 314 cartilage link protein, exon 5
pre-msg < 1 > 826 cartilage link protein mRNA and introns
IVS < 1 24 cartilage link protein intron D
signal 786 792 AATAAA sequence
BASE COUNT 262 a 166 c 162 g 236 t
ORIGIN
1 atggctccct ccgtctctcc ccaggtcgtt tttactacct aatacaccca accaagctga
61 cctatgatga agccgtgcag gcctgcctga aggatggcgc tcagattgcc aaggttgggc
121 agatattcgc tgcctggaag ctccttggtt atgaccgctg tgatgccggc tggctggcag
181 acggcagcgt ccgctacccc atctccagac ccagaaagcg ctgcagcccc aacgaggctg
241 ccgtccgctt tgtaggcttt cctgataaaa agcacaagct gtatggtgtc tactgtttca
301 gagcttacaa ctgaaaatac ctagagctgc aacagtcttt aattcattaa gaacatgtga
361 aatatttcga tatgaactcg tgcaagttac caaaactgtg ataaaccttt cttacttact
421 gtagagtcat tttcataaac caaaaccatt aatttgtttt tgtttctgtt taaatatttt
481 tgtaaaagta tcattccata gatatttaaa aataatataa gtttaatgga agctctaggt
541 aagaagagcc aaattcttta agctacgtca tcccaacaaa atataatttt catgaatggg
601 gcatgcaata gagcttgaca attgctagga cacaattatg gaatgtaagg ctactcaaag
661 cagaagcttt taaaagcaca aattttacat gtttgtaccc gtttgagata cacagcaaat
721 tgattgtatc tggagttttg aattaagatg tttttgttta taggggtcag tgaggttttg
781 caaaaaataa aaattaaaaa aaaaaaaaaa aaaaaaaaag gccgcc
//
LOCUS CHKLNKPA5 217 bp ds-DNA VRT 15-AUG-1990
DEFINITION Chicken cartilage link protein gene, exon 6.
ACCESSION M35039
KEYWORDS cartilage link protein.
SEGMENT 5 of 5
SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
lambda gLP10.1 and lambda gLP39.23.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 217)
AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
Argraves,W.S., Sparks,K.J. and Goetinck,P.
TITLE Structure of the chicken link protein gene: Exons correlate with
the protein domains
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by I.Kiss, 04-JUN-1990.
FEATURES from to/span description
pre-msg < 1 217 cartilege link protein mRNA and intron
signal 44 49 poly-A signal
signal 98 103 poly-A signal
BASE COUNT 102 a 24 c 22 g 69 t
ORIGIN
1 tataatattt aatatttctt aagctattta cacatcacaa gaaaataaaa aattggaaaa
61 aaaaatcaaa tgatcaagtc ttagaagaag attattgaat aaaatctgaa accagctatt
121 aaggtttaga agagaagaag tactttattt ccttacatct tatctgtatc taaatataca
181 tctgtttttt aaactatcaa tgaaaaaaaa aaaaaaa
//
LOCUS CHTCRPA 3012 bp ds-DNA BCT 15-AUG-1990
DEFINITION C.trachomatis 9-kD and 60-kD cysteine-rich and 15 kD serine-rich
outer membrane protein genes, complete cds.
ACCESSION M35148 M23180 M35161
KEYWORDS cysteine-rich outer membrane protein;
serine-rich outer membrane protein.
SOURCE C.trachomatis (serovar L1) DNA.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 753 and 1715 to 2577)
AUTHORS Lambden,P.R., Everson,J.S., Ward,M.E. and Clarke,I.N.
TITLE Sulfur-rich proteins of Chlamydia trachomatis: Developmentally
regulated transcription of polycistronic mRNA from tandem promoters
JOURNAL Gene 87, 105-112 (1990)
STANDARD simple staff_review
REFERENCE 2 (bases 483 to 3012)
AUTHORS Clarke,I.N., Ward,M.E. and Lambden,P.R.
TITLE Molecular cloning and sequence analysis of a developmentally
regulated cysteine-rich outer membrane protein from Chlamydia
trachomatis
JOURNAL Gene 71, 307-314 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 185 451 9-kDa cysteine-rich outer membrane protein
pept 703 2259 60-kDa cysteine-rich outer membrane protein
precursor
sigp 703 735 60-kD serine-rich outer membrane protein
signal peptide
matp 736 2256 60-kDa cysteine-rich outer membrane protein
pept 2437 2889 15-kDa serine-rich outer membrane protein
mRNA 93 2296 CrP operon mRNA (alt.)
mRNA 159 2296 CrP operon mRNA (alt.)
mRNA 160 2296 CrP operon mRNA (minor alt.)
mRNA 2406 2965 SrP mRNA
BASE COUNT 898 a 537 c 678 g 899 t
ORIGIN
1 tttgtttgct ttgatttgct aattacctgt tattagacga tttgttttaa aaaacaattg
61 atataatttt tattttataa tgtaatattg tctatgaggg ctagtttctt ttattattaa
121 aagaattgct tttatcgata aaagaaactt caagagccct tttctagaaa ggagtctgga
181 agttatgaaa aaaactgctt tactcgctgc tttatgtagt gttgtttctt taagtagttg
241 ttgtcgtatc gttgactgtt gcttcgaaga tccatgcgca cctatccaat gttcaccttg
301 tgaatctaag aagaaagacg tagacggtgg ttgcaactct tgtaacgggt atgtcccagc
361 ttgcaaacct tgcggagggg atacgcacca agatgctgaa catggccctc aagctagaga
421 aattccagtt gacggcaaat gcagacaata ggtagcgcaa gttaagagcc tacccacaac
481 agatgtagtt agtaaggaag ttggcttcct tactaactat ttcggctaac aagaaaatgt
541 tgagggtaaa agttagttaa taacaatttc tacccgatgg cagacaaaaa ataatctatg
601 cgaataggag atcctatgaa caaactcatc agacgagcag tgacgatctt cgcggtgact
661 agtgtggcga gtttatttgc tagcggggtg ttagagacct ctatggcaga gtttatctct
721 acaaacgtta ttagcttagc tgacaccaaa gcgaaagaca acacttctca taaaagcaaa
781 aaagcaagaa aaaaccacag caaagagact cccgtaaacc gtaaaaaggt tgctccggtt
841 catgagtcta aagctacagg acctaaacag gattcttgct ttggcagaat gtatacagtc
901 aaagttaatg atgatcgtaa tgttgaaatc acacaagctg ttcctaaata tgctacggta
961 ggatctccct atcctgttga aattactgct acaggtaaaa gggattgtgt tgatgttatc
1021 attactcagc aattaccatg tgaagcagag ttcgtacgca gtgatccagc gacaactcct
1081 actgctgatg gtaagctagt ttggaaaatt gaccgcttag gacaaggcga aaagagtaaa
1141 attactgtat gggtaaaacc tcttaaagaa ggttgctgct ttacagctgc aacagtatgc
1201 gcttgtccag agatccgttc ggttacaaaa tgtggacaac ctgctatctg tgttaaacaa
1261 gaaggcccag agaatgcttg tttgcgttgc ccagtagttt acaaaattaa tgtagtgaac
1321 caaggaacag caacagctcg taacgttgtt gttgaaaatc ctgttccgga tagttacgct
1381 cattcttctg gacagcgtgt actaacgttt actcttggag atatgcaacc tggagagcac
1441 agaacaatta ctgtagagtt ttgtccgctt aaacgtggtc gtgctaccaa tatagcaatg
1501 gtttcttact gtggaggaca taaaaataca gcaagcgtaa caactgtgat caacgagcct
1561 tgcgtacaag taagtattgc aggagcagat tggtcttatg tttgtaagcc tgtagaatat
1621 gtgatctccg tttccaatcc tggagatctt gtgttgcgag atgtcgtcgt taaagacact
1681 ctttctcccg gagtcacagt tcttgaagct gcaggagctc aaatttcttg taataaagta
1741 gtttggactg tgaaagaact gaatcctgga gagtctctac agtataaagt tctagtaaga
1801 gcacaaactc ctggacaatt cacaaataat gttgttgtga agagctgctc tgactgtggt
1861 acttgtactt cttgcgcaga agcgacaact tactggaaag gagttgctgc tactcatatg
1921 tgcgtagtag atacttgtga ccctgtttgt gtaggagaaa atactgttta ccgtatttgt
1981 gtcaccaaca gaggttctgc agaagataca aatgtttctt taatgcttaa attctctaaa
2041 gaactgcaac ctgtatcctt ctctggacca actaaaggaa cgattacagg caatacagta
2101 gtattcgatt cgttacctag attaggttct aaagaaactg tagagttttc tgtaacattg
2161 aaagcagtat cagctggaga tgctcgtggg gaagcgattc tttcttccga tacattgact
2221 gttccagttt ctgatacaga gaatacacac atctattaat ctttgatttt atcgatgtgt
2281 aggtgccgtc cagggattcc tgggcggctt tttttgttat ctatatgaaa ataaaagagt
2341 tcattttcgt tctcagagca tattctagat gggtttttga aaaaaataag tgtttgtgta
2401 gactccctgc tcacaaccaa aaaaggaatg taaaatatga gcactgtacc cgttgttcaa
2461 ggagctggat cttccaattc ggcacaggat atttccacta gttctgtacc attaacactg
2521 caagggcgta tatcgaatct tctatcttcc actgcattta aggtgggatt agtggtgatg
2581 ggactacttt tagtgatggc tacgatattc ctagtttcgg cagcttcgtt tgtaaatccc
2641 atctatctag ctattcctgc tattgtggga tgcgtgaata tctgcgtagg aattttatcc
2701 atggaaggat actgttctcc ggagagatgg agcttatgta agaaggtatt aaaggcttca
2761 gaagatatca tcgatgatgg gcagataaac aactctaata aagtgtttac tgatgagagg
2821 ttgaatgcca taggtggggt agtggaatct ctatctagaa gaaatagtct ggtggatcag
2881 acccaatgat aagagattgc tctataggca aaagatgata gcggcagttt ttatggatga
2941 tctgctgaca gatgatgtat ggaaagggag gaggaaagag tcctcctccc agattttatt
3001 gagctggagt tt
//
LOCUS DDIGP80A 1545 bp ss-mRNA INV 15-AUG-1990
DEFINITION D.discoideum membrane-associated glycoprotein (gp80) mRNA,
complete cds.
ACCESSION M36545
KEYWORDS gp80 gene; membrane-associated glycoprotein.
SOURCE D.discoideum, cDNA to mRNA.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 1545)
AUTHORS Siu,C.-H., Wong,L.M., Lam,T.Y., Kamboj,R.K., Choi,A. and Cho,A.
TITLE Molecular mechanisms of cell-cell interaction in Dictyostelium
discoideum
JOURNAL Biochem. Cell Biol. 66, 1089-1099 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 1 1545 membrane-associated glycoprotein (gp80)
precursor
sigp 1 48 membrane-associated glycoprotein (gp80) signal
peptide
matp 49 1542 membrane-associated glycoprotein (gp80)
BASE COUNT 502 a 332 c 209 g 502 t
ORIGIN
1 atgaaatttt tattagtatt gataatatta tataatattt taaatagtgc acattcagct
61 ccaacaataa cagctgtttc aaatggaaaa tttggtgttc caacatatat taccattaca
121 ggtactggat ttacaggaac tccagttgta actattggtg gccagacctg tgatccagtt
181 attgtagcca ataccgcatc gttacaatgc caattttctg ctcaattagc tccaggaaat
241 tcaaattttg atgttattgt aaaggttggt ggtgtaccat ctacaggtgg taatggtctt
301 tttaaatata cacctccaac tctttcaaca atatttccaa ataatggaag aattggtatg
361 attttagttg atggaccatc caatatatct ggatacaaat taaatgtgaa cgactctatt
421 aactctgcta tgttatctgt tactgctgat tcagtatccc caacaattta tttcctcgtg
481 ccaaatacaa tcgctggtgg tctacttaat cttgaactca ttcaaccatt tggcttttca
541 acaattgtaa cttccaaatc agtgttttct ccaaccatta catcaatcac cccattagct
601 tttgatctca caccaaccaa tgtaaccgtc actggtaaat actttgttac tacagctagt
661 gttacaatgg gaagtcatat ctatacagga ttgactgttc aagatgatgg aacaaattgt
721 catgttattt ttactactcg ttcagtttat gaatcatcaa atactataac tgctaaagct
781 tcaacaggtg tcgatatgat ttatttagac aatcaaggta atcaacaacc aataactttt
841 acatataacc caccaaccat tacttcaaca aaacaagtca atgactctgt tgagatctca
901 acaaccaata ctggtactga tttcactcaa atttctttaa ccatgggaac ctcaagccca
961 acaaaccttg taatcactgg tacaaatgaa aagattgtta taactcttcc acatgctctt
1021 ccagaaggtg aaattcaatt caatttgaaa gctggtatct caaatgttgt cacatcaact
1081 ttattagtta ctccggttat aaatagtgtc actcaagcac ctcacaatgg tggaagtatt
1141 acaatttcag gtatcttttt aaacaatgcc catgtttcga ttgttgttga ccaaaatact
1201 actgatatag tttgtgctcc agattcaaat ggtgaatcaa tcatttgtcc agttgaagct
1261 ggtagtggta ctattaattt agtcgttaca aactataaaa actttgcttc agatccaact
1321 attaaaactg aagccacaac ctctacaacc tatacaattc cagacactcc aactccaact
1381 gatacagcca ccccatctcc aactccaact gaaacagcca ccccatctcc aactccaaaa
1441 ccaaccagca caccagaaga aactgaagca ccttcatcag caacaactct tatttcacca
1501 ttatctttaa ttgttatttt catttctttt gttttattaa tttaa
//
LOCUS ECOMANXF 1474 bp ds-DNA BCT 15-AUG-1990
DEFINITION E.coli enzyme III-Man function protein (manX (ptsL)) gene,
complete cds, and manY (pel) gene, 5' end.
ACCESSION M36404
KEYWORDS enzyme III-Man function protein; manX gene; manY gene; pel gene;
ptsL gene.
SOURCE E.coli (strain K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1474)
AUTHORS Saris,P.E.J., Liljestroem,P. and Palva,E.T.
TITLE Nucleotide sequence of manX (ptsL) encoding the enzyme III-Man
(II-a-Man) function in the phosphotransferase system of Escherichia
coli K-12
JOURNAL FEMS Microbiol. Lett. 49, 69-73 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 258 1205 enzyme III-Man function protein (manX (ptsL))
pept 1268 > 1474 manY (pel) gene product
mRNA 120 > 1474 manXYZ operon mRNA (5' end put.)
BASE COUNT 411 a 319 c 376 g 368 t
ORIGIN
1 cctttgcaaa cgaatgtgac aaggatattt tacctttcga aatttctgct aatcgaaagt
61 taaattacgg atcttcatca cataaaataa ttttttcgat atctaaaata aatcgcgaaa
121 cgcaggggtt tttggttgta gcccttatct gaatcgattc gattgtggac gacgattcaa
181 aaatacatct ggcacgttga ggtgttaacg ataataaagg aggtagcaag tgaccattgc
241 tattgttata ggcacacatg ggttggggct gcagagcagg ttgcttaaaa cggcagaaag
301 tgctgttagg cgagcaggaa aacgtcggct ggatcaattt cgttccaggt gaaaatgccg
361 aaacgctgat tgaaaagtac aacgctcagt tggcaaaact cgacaccact aaaggcgtgc
421 tgtttctcgt tgatacatgg ggaggcagcc cgttcaatgc tgccagccgc attgtcgtcg
481 acaaagagca ttatgaagtc attgcaggcg ttaacattcc aatgctcgtg gaaaggttaa
541 tggcccgtga tgatgaccca agctttgatg aactggtggc actggcagta gaaacaggcc
601 gtgaaggcgt gaaagcactg aaagccaaac cggttgaaaa agccgcgcca gcacccggtg
661 ccgcagcacc aaaagcggct ccaactccgg caaaaccaat ggggccaaac gactacatgg
721 ttattggcct tgcgcgtatc gacgaccgtc tgattcacgg tcaggtcgcc acccgctgga
781 ccaaagaaac caatgtctcc cgtattattg ttgttagtga tgaagtggct gcggataccg
841 ttcgtaagac actgctcacc caggttgcac ctccgggcgt aacagcacac gtagttgatg
901 ttgccaaaat gattcgcgtc tacaacaacc cgaaatatgc tggcgaacgc gtaatgctgt
961 tatttaccaa cccaacagat gtagagcgtc tcgttgaagg cggcgtgaaa atcacctctg
1021 ttaacgtcgg tggtatggca ttccgtcagg gtaaaaccca ggtgaataac gcggtttcgg
1081 ttgatgaaaa agatatcgag gcgttcaaga aactgaatgc gcgcggtatt gagctggaag
1141 tccgtaaggt ttccaccgat ccgaaactga aaatgatgga tctgatcagc aaaatcgata
1201 agtaacgtat tgtgttgatt atcactcagt tttcacactt aagtcttacg taaacaggag
1261 aagtacaatg gagattacca ctcttcaaat tgtgctggta tttatcgtag cctgtatcgc
1321 aggtatggga tcaatcctcg atgaatttca gtttcaccgt cctctaatcg cgtgtaccct
1381 ggtgggctat cgttcttggg gatatgaaaa ccggtattat tatcggtggt acgctggaaa
1441 tgatcgcgct gggctggatg aacatcggtg ctgc
//
LOCUS FSCCKPA 1428 bp ss-mRNA VRT 15-AUG-1990
DEFINITION T.californica creatine kinase mRNA, complete cds.
ACCESSION M36427
KEYWORDS creatine kinase.
SOURCE T.californica electric organ, cDNA to mRNA, clone CK52g8.
ORGANISM Torpedo californica
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata;
Chondrichthyes; Elasmobranchii; Euselachii; Neoselachii;
Squalomorphii; Torpediniformes; Torpedinoidea; Torpedinidae.
REFERENCE 1 (bases 1 to 1428)
AUTHORS West,B.L., Babbitt,P.C., Mendez,B. and Baxter,J.D.
TITLE Creatine kinase protein sequence encoded by a cDNA made from
Torpedo californica electric organ mRNA
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 7007-7011 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 90 1235 creatine kinase (E.C. 2.7.3.2)
BASE COUNT 348 a 398 c 394 g 288 t
ORIGIN
1 ggtcacccac accagcggta gttccagcac caagcaggac aaggtccaga gtggttcacc
61 gtgcgccagg agtcagccaa cctccaacca tgcctttcgg aaacactcac aataaatgga
121 agctgaacta ttcggcggcg gaagaattcc ccgacctcag caagcacaac aaccacatgg
181 ccaaggcttt aaccctggac atctacaaga aacttcggga caaggagact ccaagtggct
241 tcaccctcga tgatatcatc cagacaggag tggacaaccc aggtcacccc ttcatcatga
301 ccgtgggctg cgtggctggc gatgaggaat gctacgaggt tttcaaggac ctgttcgatc
361 ccgtcattga ggaccgccac ggtggctaca aaccaactga caagcacaag actgacctga
421 accaggagaa cctgaagggc ggcgatgacc tcgacccgaa ttacgtcctg agcagccggg
481 tgcgcactgg ccgcagcatc aagggcatcg ccctgcctcc tcactgcagc cgcggggagc
541 gccgtctggt tgagaagctc tgcatagacg gtctcgccac cttgacgggc gagttccagg
601 gcaagtacta ccccctctcc tccatgtctg atgcagagca gcagcagctg atcgatgacc
661 acttcctgtt tgacaaaccc atctctcctc tgcttctcgc ctctggcatg gctcgggact
721 ggcccgatgg ccggggcatt tggcataaca acgacaagac cttcctggtc tgggtcaacg
781 aggaggacca cctccgagtc atctcgatgc agaaaggtgg caacatgaag gaggtcttca
841 ggcgcttctg cgttggtctg aagaagatcg aggacatttt cgtgaaggct ggccgtggct
901 tcatgtggaa cgagcacctg ggctacgtcc tgacctgccc gtccaacctg ggcactggcc
961 tccgtggtgg tgtccacgtg aaaatccctc acctctgcaa gcacgagaag ttcagcgagg
1021 tcctcaagag aacgaggctg cagaaacgtg ggacaggtgg agtggatacc gcagcggttg
1081 gcagcatcta tgacatctcc aacgccgacc gtctgggctt ctccgaggtg gaacaggtcc
1141 agatggtggt ggacggtgtg aagctgatgg tcgagatgga gaagaggctg gaaaatggga
1201 aaagcatcga tgacctgatg ccggctcaga agtagacctt gggttggctg ggtgcctgcc
1261 actctgagat gccttgaaat atcacaggtc gcgaactttg aactttccca ctccaatctt
1321 tcttggccac agatctcgtg tctcaaatga ggaagcagaa ggtttggttt catcacattc
1381 agatttgcta gacacaattt taaccttgat gacacattaa taaaatat
//
LOCUS HUMLBPP2A 1541 bp ss-mRNA PRI 15-AUG-1990
DEFINITION Human phosphatase 2A-beta catalytic subunit mRNA, complete cds.
ACCESSION M36511
KEYWORDS phosphatase 2A-beta catalytic subunit.
SOURCE Human lung fibroblast, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1541)
AUTHORS Hemmings,B.A., Wernet,W., Mayer,R., Maurer,F., Hofsteenge,J. and
Stone,S.R.
TITLE The nucleotide sequence of the cDNA encoding the human lung protein
phosphatase 2A-beta catalytic subunit
JOURNAL Nucleic Acids Res. 16, 11366-11366 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 22 951 phosphatase 2A-beta catalytic subunit
BASE COUNT 436 a 296 c 327 g 482 t
ORIGIN
1 ccgagcccca gcccggccgc catggacgac aaggcgttca ccaaggagct ggaccagtgg
61 gtcgagcagc tgaacgagtg taagcagctg aacgagaacc aagtgcggac gctgtgcgag
121 aaggcaaagg aaattttaac aaaagaatca aatgtgcaag aggttcgttg ccctgttact
181 gtctgtggag atgtgcatgg tcaatttcat gatcttatgg aactctttag aattggtgga
241 aaatcaccgg atacaaacta cttattcatg ggtgactatg tagacagagg atattattca
301 gtggagactg tgactcttct tgtagcatta aaggtgcgtt atccagaacg cattacaata
361 ttgagaggaa atcacgaaag ccgacaaatt acccaagtat atggctttta tgatgaatgt
421 ctgcgaaagt atgggaatgc caacgtttgg aaatatttta cagatctctt tgattatctt
481 ccacttacag ctttagtaga tggacagata ttctgcctcc atggtggcct ctctccatcc
541 atagacacac tggatcatat aagagccctg gatcgtttac aggaagttcc acatgagggc
601 ccaatgtgtg atctgttatg gtcagatcca gatgatcgtg gtggatgggg tatttcacca
661 cgtggtgctg gctacacatt tggacaagac atttctgaaa cctttaacca tgccaatggt
721 ctcacactgg tttctcgtgc ccaccagctt gtaatggagg gatacaattg gtgtcatgat
781 cggaatgtgg ttaccatttt cagtgcaccc aattactgtt atcgttgtgg gaaccaggct
841 gctatcatgg aattagatga cactttaaaa tattccttcc ttcaatttga cccggcgcct
901 cgtcgtggtg agcctcatgt tacacggcgc accccagact acttcctata aatttctcct
961 gggaaacctg cctttgtatg tggaagtata cctggctttt taaaatatat gtatttaaaa
1021 acaaaaagca acagtaatct atgtgtttct gtaacaaatt gggatctgtc ttggcattaa
1081 accacatcat ggaccaaatg tgccatacta atgatgagca tttagcacaa tttgagactg
1141 aaatttagta cactatgttc tagataggtc agtctaacag tttgcctgct gtatttatag
1201 taaccatttt cctttggact gttcaagcaa aaaaggtaac taactgcttc atctcctttt
1261 gcgcttattt ggaaatttta gttatagtgt ttaactggca tggattaata gagttggagt
1321 tttattttta agaaaaattc acaagctaac ttccactaat ccattatcct ttattttatt
1381 gaaatgtata attaacttaa ctgaagaaaa ggttcttctt gggagtatgt tgtcataaca
1441 tttaaagaga tttcccttca tttaaactaa attactgttt tatgttgatc tgcatatttc
1501 tgtatatttg tcatgacagt gcttgcatcc tatttggtgt g
//
LOCUS HUMPDEGA 978 bp ss-mRNA PRI 15-AUG-1990
DEFINITION Human cGMP phosphodiesterase gamma-subunit (PDEG) mRNA,
complete cds.
ACCESSION M36476
KEYWORDS cGMP phosphodiesterase gamma-subunit.
SOURCE Human retina, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 978)
AUTHORS Tuteja,N., Danciger,M., Klisak,I., Tuteja,R., Inana,G.,
Mohandas,T., Sparkes,R.S. and Farber,D.B.
TITLE Isolation and characterization of cDNA encoding the gamma-subunit
of cGMP phosphodiesterase in human retina
JOURNAL Gene 88, 227-232 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 102 365 cGMP phosphodiesterase gamma-subunit (PDEG)
mRNA < 1 978 PDEG mRNA
BASE COUNT 213 a 341 c 257 g 167 t
ORIGIN
1 ccgcactcac agcacagccc cctgagaccc gccctgcact tgaccgcagc aggagggagt
61 ccaggagcca aggttgccgc ggtgtctccg tcagcctcac catgaacctg gaaccgccca
121 aggctgagtt ccggtcagcc accagggtgg ccgggggacc tgtcaccccc aggaaagggc
181 cccctaaatt taagcagcga cagaccaggc agttcaagag caagccccca aagaaaggcg
241 ttcaagggtt tggggacgac atccctggaa tggaaggcct gggaacagac atcacagtca
301 tctgcccttg ggaggccttc aaccacctgg agctgcacga gctggcccaa tatggcatca
361 tctagcacga ggcccctgct gaagtccaga ccctccccct cctgcccact atgctaaacc
421 ctgctcagga ttcctgttga ggagatgacc tccctagccc cagatggcac ctggacacca
481 ggatgggact gcaacctcag gtctccccct acatattaat accagtcacc aggagcccac
541 cacctccctc taggatgccc cctcagggtg gccaggccct gctcaacatc tggagacaca
601 ggcccacccc tcagtcctgc ccacagagag gcttggtcgg tctccactcc cagggagaac
661 gggaagtgga ccccagcccg ggagcctgct ggaccccaga tcgtcccctc ctcccagctg
721 gaaagctagg gcaggtctcc ccagagtgct tctgcacccc agccccctgt cctgcctgta
781 aggggataca gagaagctcc ccgtctctgc atcccttccc aggggggtgc ccttagtttg
841 gacatgctgg gtagcaggac tccagggcgt gcacggtgag cagatgaggc cccaagctca
901 tcacaccagg gggccatcct tctcaataca gcccgccctt gcagtcccta tttcaaaata
961 aaattagtgt gtccttgc
//
LOCUS HUMSON3A 1449 bp ds-DNA PRI 15-AUG-1990
DEFINITION Human son3 protein gene, partial cds.
ACCESSION M36428
KEYWORDS son3 protein.
SOURCE Human placenta DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1449)
AUTHORS Berdichevskii,F.B., Chumakov,I.M. and Kiselev,L.L.
TITLE Determination of the nucleotide sequence of the son3 fragment of
the human genome: Identification of a new protein with an unusual
structure and homology with DNA-binding proteins
JOURNAL Mol. Biol. 22, 639-646 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 1449 son3 protein (AA at 1)
BASE COUNT 487 a 348 c 329 g 285 t
ORIGIN
1 cgggctctgc tcagccctaa agaaagtagt ggaggagaaa aagaagtacc tccccctcct
61 aaagagacac tgcctgattc aggattttct gccaatattg aggatattaa tgaagcagat
121 ttagtgagac cgttacttcc taaggacatg gaacgtctta caagccttag agctggcatt
181 gaaggacctt tacttgcaag tgatgttgga cgtgacagat ctgctgccag cccggttgta
241 agtagtatgc cagaaagagc ttcagagtct tcttcagagg aaaaagatga ttatgaaatt
301 tttgtaaaag ttaaggacac tcacgaaaaa agcaagaaaa ataagaaccg tgataagggg
361 gagaaagaga agaaaagaga tcctcattta agatctcgaa gtaagcgttc caaatcttct
421 gaacacaaat cacgcaagcg taccagtgaa tctcgttcta gggcaagaaa gagatcatct
481 aagtccaagt ctcatcgctc tcagacacgt tcacggtcac gttcaagacg caggaggaga
541 agcagcagat caagatcaaa gtctagagga agaagatctg tatcaaaaga gaagcgcaaa
601 agatctccaa agcacagatc caagtctagg gaaagaaaaa gaaaaagatc aagctccagg
661 gataaccgaa agacagttag agctcgaagt cgaaccccaa gtcgtcggag tcggagtcat
721 actccaagtc gtcgacgaag gtctagatct gtgggtagaa gaaggagctt tagcatttcc
781 ccaagccgcc gcagccgcac ccccagccgc cgcagccgca cccccagccg ccgcagccgc
841 acccccagcc gccgcagccg cacccccagc cgccggagcc gcacccctag ccgtcggagc
901 cgcaccccaa gccgccggag aagatcaagg tctgtggtaa gaagacgaag cttcagtatc
961 tcaccagtca gattaaggcg atcaagaaca cccttaagaa gaaggtttag cagatctccc
1021 atccgtcgta aaagatccag gtcttctgaa cgaggcagat cacccaaacg tctgacagat
1081 ttggataagg ctcaattact tgaaatagcc aaagctaatg cagctgccat gtgtgctaag
1141 gctggtgtcc ctttaccacc aaacctaaag cctgcacctc cacctactat agaagagaaa
1201 gttgctaaaa agtcaggagg agctactata gaagaactaa ctgagaaatg taaacagatc
1261 gcacagagta aagaagatga tgatgtaata gtgaataaac ctcatgtttc ggatgaagag
1321 gaagaagaac ctccttttta tcatcatccc tttaaactca gtgaacccaa acctattttt
1381 ttcaatctga atattgctgc agcaaaacca actccaccaa aaagccaggt aacattaaca
1441 aaagaattc
//
LOCUS MYXGFA 2269 bp ds-DNA VRL 15-AUG-1990
DEFINITION Myxoma virus growth factor and M-T9 genes, complete cds.
ACCESSION M15806 M35234
KEYWORDS M-T9 gene product; growth factor.
SOURCE Myxoma virus (strain Lausanne) DNA, clone pMYH-1.
ORGANISM Myxoma virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus.
REFERENCE 1 (bases 1 to 1421)
AUTHORS Upton,C., Macen,J.L. and McFadden,G.
TITLE Mapping and sequencing of a gene form myxoma virus that is related
to those encoding epidermal growth factor and transforming growth
factor alpha
JOURNAL J. Virol. 61, 1271-1275 (1987)
STANDARD full staff_review
REFERENCE 2 (bases 584 to 2269)
AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G.
TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi
protein important for virus virulence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Computer-readable sequence for [1] kindly provided by C.Upton,
09-MAY-1987. Draft entry and computer-readable sequence for [2]
kindly submitted by C.Upton, 14-JUN-1990.
Author address: C.Upton
University of Alberta
Dept of Biochemistry
471 Med Sci Bldg
Edmonton
Alberta, CANADA T6G 2H7
email: USERCU11@ualtamts
FEATURES from to/span description
pept 204 461 growth factor
pept 717 2246 M9-R gene product
BASE COUNT 685 a 441 c 540 g 603 t
ORIGIN 1239 bp upstram of DdeI site; about 13 kb from 3' viral end.
1 ttaaacaaga tacaacatac ggacgcggct atgttctcgg aagtcataga cggtattgtc
61 gcggaagaac agcaggtgat tggatttatt cagaaaaaat gtaaatataa cacgacatac
121 tacaatgtac gtagcggcgg gtgtaaaata tccgtctatc taaccgcggc agttgttggc
181 tttgtcgcat acggaatact aaaatggtac cgagggacct agtcgcaact ctcttatgtg
241 cgatgtgtat tgtacaggca acgatgcctt cgttggataa ttatctgtat attattaaac
301 gtattaaact atgtaacgac gactataaaa actattgtct aaataacgga acctgtttca
361 ccgtagcatt aaacaatgtt tcacttaacc cgttttgtgc gtgtcatatt aactacgtgg
421 gaagccgatg tcagtttatt aatctaatta ccattaagta acccgtttta catgtataat
481 aatacatacg tatttttaga taactttaat aaataacatt gtataaactt acttatcata
541 tacggtacac ataacgaata acactacatg tttttatata tacataggtt tggaaaaaac
601 ttaatcacga acgtatcatt agacaatgac tccatctagg aggggttttg ggaactacgt
661 acacgatata ttcacatcgc gaaaacataa ataataattt tttacaacga ttcacgatgt
721 cgcgcacttt attgagattt ctggaagatg gtgcaatgag cgacgtaaca gtcgtcgccg
781 gggactcgac gtttctcggg cataaagtta ttttatctct tcactcggat tacttctatc
841 gtctgtttaa tggagacttt acctcgcccg atacggttac gctggacgcg acggacgatg
901 ccgttcgtac ggtgtttacg tatatgtacg cgggatgtga cgggttaaac gatcgtacga
961 tagacgattt acaatccatt atcgtattgg cggactacct gggtataacg aaactggtgg
1021 acgaatgcgt acgtcgtatc gtatctaaag tggacgtatt aaactgcgta ggggtatata
1081 cgtttgcgga gacgtatcat ataacggact tgcagcgggc ggccaaaacg tttttaacag
1141 aactactggg gtctaaagaa gcgttcgaag aactatccca agacgatgcg gttatcgcgt
1201 taagggaaac gcgtaacatt gtcgatagac gatccattct tagagcgatc ctgttatggg
1261 ttcgaaaatg tccagatcgt atcgaacaac taaaggtgtt agtcgccgcc gtagacgacg
1321 tagacgacga tgacaacgta tatacgatct acgagagata cgctgaagaa ctaaaggata
1381 tgatcgcgtg tccattatcc tataattgcg tcgttgtggt cgacagagat agatacgttc
1441 gcctcattaa cccagacacc ctatggagta aacgcgtgac gtacatacgt aaacgcgcca
1501 taggcgatcg attcaccgtc gtttgtatga acaacgttct atactgttta gggggtacgt
1561 tagacggggc acccacgtgt gacgtgttgg cctacgatct actgacgaac gaatacagtt
1621 taatgccgga gatgggacac tatagacgta atgcgtcggc gtgtatcgta aatggatata
1681 tatacgtcgt aggaggcgta gacgaagaaa acagattaat cggttccgta gagtactggc
1741 aacccggaat ggaggaatgg cacgacgctc cttatctaca ggcgaacgta gaaacggcta
1801 cggtgtgtta caggaacgag ttgtggatcg taggaggcac cgtggactta tatcatccca
1861 cgtttataag cgcagttaag aaattaacag acaatcgatg gatgtcgatg gaacctcttc
1921 ccgaaccacg atcgggtgct acgaccgtcg tgtataataa tcgattatac tgcataggcg
1981 gaaggataca cggtggcgcg tacacaaatc acgtctacaa ctatttagac gagtcacgta
2041 cgtgggaacg ggtaggggat atggcgaacg tacgcagaaa tcccagttgt tgtgtgtaca
2101 ataaggcgat ttacgtattg ggagggaata caaacgccgt agagaaatac aacgggtgga
2161 agtggcaaga ggtaggtaat atatccacgt atcccgcgtg taataatacc gcgtatccat
2221 ttttttatac caacgacgag atataaaacg agtatgatat acaagtcgt
//
LOCUS MYXMAP1A 2204 bp ds-DNA VRL 15-AUG-1990
DEFINITION Myxoma virus MAP1 gene, complete cds, and M-T8 gene, 5' end.
ACCESSION M35233
KEYWORDS M-T8 gene product; MAP1 gene product serpi protein.
SOURCE Myxoma virus (strain Lausanne) DNA, clone pBU-3.
ORGANISM Myxoma virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus.
REFERENCE 1 (bases 1 to 2204)
AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G.
TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi
protein important for virus virulence
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.Upton, 14-JUN-1990.
Author address: C.Upton
University of Alberta
Dept of Biochemistry
471 Med Sci Bldg
Edmonton
Alberta, CANADA T6G 2H7
email: USERCU11@ualtamts
FEATURES from to/span description
pept 363 1472 MAP1 gene product
pept 1450 > 2204 M-T8 gene product
BASE COUNT 592 a 520 c 569 g 523 t
ORIGIN
1 ggatccgtaa caacacgtgt gtcgtagcgt atacataatg ccgtaaatga cagtcataaa
61 accatcgagt cgtcccaggc cgaggaaaaa caaaaatata aaagtaaata catacagaac
121 gagcgccatg gatctctctc cgggaagtgt ccacgagggt atcgtatatt ttaaagacgg
181 aatattcaaa gtccgcctac tcggatacga gggacacgag tgtattcttt tggactatct
241 gaactacagg caagacacgt tggatcggtt gaaggaacga ctcgtgggac gcgtgattaa
301 aacgcgagtc gttcgcgcgg acggtttata cgtggacctg cgacgttttt tttgagggtt
361 aaatgaagta tctggtcctc gtcttatgtt taacgtcgtg cgcgtgtcga gatatcggac
421 tatggacgtt ccgatacgtc tacaacgaaa gcgacaacgt cgtgttctca ccgtacggct
481 tgacctccgc gttgtccgtg ttacggatcg cggcgggcgg taacacgaaa cgagaaatag
541 acgtccccga atccgtcgtg gaggactccg acgcctttct cgcgttacgg gagttgttcg
601 tagacgcatc cgttccgtta cgtcccgagt ttacggcgga gttctcctcg cgattcaata
661 cctccgtgca acgcgtgacg tttaactcgg agaacgtcaa agacgtcatt aactcgtacg
721 ttaaggataa gacgggagga gacgtcccac gcgtattgga cgcctcccta gaccgagata
781 ctaaaatgct gctattgagc tccgttcgta tgaagacgag ctggagacac gtattcgacc
841 cttcgttcac gacggatcaa cctttttatt ccggaaacgt cacatacaag gtacgtatga
901 tgaataaaat agatacgttg aaaacggaga cgtttacgct tagaaacgtg ggatactccg
961 taacggaact gccgtataaa cggcgtcaaa cggccatgtt gctcgtcgtt ccggacgact
1021 tgggagagat cgtgcgggcc ctcgatcttt ctctagtacg cttctggata cgcaacatga
1081 ggaaagacgt gtgtcaggtg gtaatgccca agttctccgt cgaatcggtc ctggatctga
1141 gggacgccct ccagagactg ggggtgcgag acgcgttcga tccatcccgg gcggacttcg
1201 gtcaggcgtc cccgtcgaac gatctatacg tcacgaaggt gttacagacg tccaagatag
1261 aggcggacga acggggaacg acggcgtcga gcgacacagc catcaccctc atccccagga
1321 acgccctcac ggcgatcgtg gcgaacaaac cgtttatgtt tctcatctat cacaagccta
1381 caacgaccgt gttgtttatg ggaacgataa caaagggtga aaaagtaata tacgatacgg
1441 agggtcgaga tgatgtcgta tcctctgtat aaactctttt tgaagggtaa actatgcgac
1501 gtcgaaatcg tcgcggaagg caaaagcatc cgagcgcatc ggttggtgct ttccgcgtat
1561 tctaaatact tttacaactt gtttaatggg aatttcttag aaaaaaacgt agacgtaatc
1621 gacttagaag cggattataa aaccgtattt gacgtgattt attacatgta tacagaatcg
1681 atagaattac acaaagggaa taccgaatcc attttctcat tggttcatta cctacagatt
1741 aaacccctga ttaaaaaatg tatctacgag tttaacagca tcgtgaacga agaaaactgt
1801 atacgtctgt ttaagttcgc cgaattatac gacctgtccg agttgaaacg cagggcgcga
1861 tggcttatgc ccagtctcgt tatgaatgag aaagatcgcc tgcgggagat gtccttggac
1921 gacctatccc tgatgttagt ccagatacgg aacacggtcg atcgaagtat cgctttgtcg
1981 gcgatcacgg aatggataca gacaaacgtt cgcgaacgta ggagacacgc cgtccatctg
2041 gcgacgtgtt taggggatgt cccaggaacc gcatcctcca gagccgtata caaacactac
2101 atgtcggaac tacgtattcg ggttacggaa tttcaaccgg cgtatcacaa ctgcgtcgtg
2161 tacctgggag gatcgatgaa aggtcgcgtc accgccctgg atcc
//
LOCUS MZEMT2BATP 2054 bp ss-mRNA ORG 15-AUG-1990
DEFINITION Maize mitochondrial F-1-ATPase subunit-2 mRNA, complete cds.
ACCESSION M36087
KEYWORDS ATPase subunit-2.
SOURCE Maize (inbred line A188) embryo kernel mitochondrion, cDNA to mRNA.
ORGANISM Mitochondrion Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE 1 (bases 1 to 2054)
AUTHORS Ehrenshaft,M. and Brambl,R.
TITLE Respiration and mitochondrial biogenesis in germinating embryos of
maize
JOURNAL Plant Physiol. 93, 295-304 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 6 1667 F-1-ATPase subunit-2
BASE COUNT 421 a 539 c 558 g 536 t
ORIGIN
1 cggccatggc gtcccgccgg gtcgtctcct cgctcctccg ctccgcgtcc cgcctgcggg
61 ccgcctcgcc cgctgctcca cgaccgcgcg cgccaccgca ccgcccgtcc ccggccgggt
121 acctcttcaa ccgcgctgcc gcctacgcct cttccgccgc ggcccaggcg gcacctgcca
181 ccccgccgcc ggccaccggg aagaccgggg ggggcaagat caccgacgag ttcaccggcg
241 ctggcgccat cggccaggtg tgccaggtga tcggcgccgt cgttgacgtg cgcttcgatg
301 agggcctccc gcccatcctc acggcgctcg aggtgctcga caacaacatc cgcctcgtgc
361 tcgaggtggc gcagcacctt ggcgagaaca tggtgcgcac catcgctatg gacggcacgg
421 aggggctcgt ccgcggccag cgcgtcctca acactggctc ccccatcacc gtgcctgttg
481 gcagggctac ccttggacgc atcataaatg ttattggtga accgattgat gagaagggtg
541 acataaagac aaaccacttc ctccctattc atcgtgaagc ccctgctttt gttgagcagg
601 ccactgagca gcaaattctt gttactggaa tcaaggtcgt ggatcttctt gcaccctacc
661 aaaggggtgg aaagattggt ctcttcggtg gtgcaggagt gggtaaaact gtgctcatta
721 tggagttgat caacaatgtt gctaaggccc atggtggttt ctctgtgttt gctggtgttg
781 gagaacgtac ccgtgaaggt aatgatctgt acagggaaat gattgaaagt ggtgtcatta
841 agctagatga caagcagagc gaaagcaagt gtgctcttgt ttacgggcag atgaatgagc
901 ccccgggtgc tcgtgctcgt gttgggttga ctggtttgac tgttgctgaa catttccgtg
961 atgctgaagg acaagatgtg cttctgttta ttgacaacat tttccgtttt actcaggcaa
1021 actctgaggt gtctgctctt cttggacgta tcccatctgc tgtgggatac cagccaaccc
1081 ttgccactga tcttggagga ctgcaagagc gtattacgac aacaaagaag ggttctatta
1141 catctgtgca ggccatctac gtgcctgccg atgacttgac ggatcctgct cctgctacta
1201 cctttgccca tcttgatgct acaactgtgt tgtcacgaca gatctctgag cttggtattt
1261 atcctgctgt tgatccactg gattccacat caagaatgct ttctccccac gtgctgggtg
1321 aggatcacta caacactgct cgtggtgtgc agaaggttct tcagaactac aaaaatcttc
1381 aggatattat tgctatcttg ggtatggatg agctcagtga ggatgacaag ctgacagtcg
1441 cccgtgcaag aaagattcag cgtttcctga gccagccttt ccatgtcgct gaagttttca
1501 cgggtgctcc aggaaagtat gtggagctga aggaaagcgt gaagagtttc cagggtgttt
1561 tggatgggaa gtatgatgac ctccctgagc agtcattcta catggttggt ggcattgagg
1621 aagtcattgc taaggctgag aaaattgcca aggagtctgc ttcataagga ggcttcttgc
1681 ttgttcaacc ctgtacaagt tccatttttg gattttaagc gtttatttat gcttttccca
1741 gttaggcatg acgagctgga gagtccatct cctgctgaga gatgtttgtt ttacccttct
1801 ttgcttcctc caccttacac ccaaataagc aactgcagtg ccgttggttt tggctgcacc
1861 caaactacat gactgaagaa acttgtggcc tgtgtaacgc gaatccatca gaacgccaaa
1921 gttatggctt ctggttgtgg caaattatgg ttcctccctg ttcggttgag tggttgcatt
1981 ctggaggtat tgttctggac tcaggctaat gattgtgcgt gcaactgttt cggagtcatt
2041 tcaaagggtt atcc
//
LOCUS PFAMTSSU 935 bp ds-DNA ORG 15-AUG-1990
DEFINITION P.falciparum mitochondrial small subunit rRNA gene.
ACCESSION M23443
KEYWORDS small subunit ribosomal RNA.
SOURCE P.falciparum (strain C10) mitochondrial DNA.
ORGANISM Mitochondrion Plasmodium falciparum
Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
Microsporida; Haemosporina; Plasmodiidae; Plasmodium falciparum.
REFERENCE 1 (bases 1 to 935)
AUTHORS Gardner,M.J., Bates,P.A., Ling,I.T., Moore,D.J., McCready,S.,
Gunasekera,M.B.R., Wilson,R.J.M. and Williamson,D.H.
TITLE Mitochondrial DNA of the human malarial parasite Plasmodium
falciparum
JOURNAL Mol. Biochem. Parasitol. 31, 11-18 (1988)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 > 935 small subunit ribosomal RNA
BASE COUNT 376 a 91 c 132 g 336 t
ORIGIN
1 aagcttgata aagtaatatt tcttttagga agacagtatt attaaaatat tgtaaacttt
61 ttattttatt tttaaatatt gataaaaata aaaaatagta tttgctattt tctgtgccag
121 cagcagcggt aatacagaaa tgcaagcgtt attcatttta ttaggcgtaa agcgttttaa
181 ggttttatat taattttatg tttaaatatt taaattaaat ttaaaataaa ttaataaata
241 ataatataat agagtattat aaaagtatta agaatttttt gagaagtagt gaaatacaat
301 gatacaaaaa agaatatcaa aggcggaagc ataatactat ataattactg acacttaaaa
361 acgaaagcta aggtagcaaa taggattaga taccctagta gtcttagctg taaactatga
421 atattttata tttatatttt ataaatataa taactaacgt gataaatatt ccgcctgagt
481 agtatattcg caagaatgaa attcaaagga attgacggga gcttatacaa gtggtggaac
541 atgtggctta attcgatgca acacgataaa ccttaccaaa atttaacaat atttttaata
601 ttaagaaatt aatattttaa taaaatatat aggtagtgca tggctgtcgt cagttcgtgc
661 tgtgaagtgt taattttagt attataacga acgtaacctt ttataaaaaa aatttttata
721 ataaataata ataaagatta cgtcaagtca ttatgctcct tatattttgg gctgctcacg
781 tgttacataa aatattacaa tattttatta tatgttaaat ataataatta aaatatattt
841 atagttcaga ttataaattg aaactcattt atataaagat ggaatcacta gtaatcgcta
901 atcagaatta tagcggtgaa taagttctta agctt
//
LOCUS PSEALGR3A 120 bp ds-DNA BCT 15-AUG-1990
DEFINITION P.aeruginosa alginate synthesis regulatory protein (algR3) gene,
5' end.
ACCESSION M35259
KEYWORDS alginate synthesis regulatory protein.
SOURCE P.aeruginosa (strain 8882) DNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 120)
AUTHORS Kato,J., Misra,T.K. and Chakrabarty,A.M.
TITLE AlgR3, a protein resembling eukaryotic histone H1, regulates
alginate synthesis in Pseudomonas aeruginosa
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2887-2891 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 70 > 120 alginate synthesis regulatory protein (algR3)
mRNA 45 > 120 algR3 mRNA
BASE COUNT 24 a 39 c 37 g 20 t
ORIGIN
1 cgaacccgtt ggcgagaggg ggtttgcggg tctagtatgg gcgcaaccac gtccgcctgg
61 aggcacgtca tgtcggccaa caagaagccc gtcaccaccc ccttgcacct gttgcagcaa
//
LOCUS STYOMPH 992 bp ds-DNA BCT 15-AUG-1990
DEFINITION S.typhimurium cationic 16 kD outer membrane protein (ompH) gene,
complete cds.
ACCESSION J05101 M36486
KEYWORDS ompH gene; outer membrane protein.
SOURCE S.typhimurium (strain LT2 subline, isolate SH5014) DNA, clones
pUCHS[14,16].
ORGANISM Salmonella typhimurium
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 992)
AUTHORS Koski,P., Rhen,M., Kantele,J. and Vaara,M.
TITLE Isolation, cloning, and primary structure of a cationic 16 kDa
outer membrane protein of Salmonella typhimurium
JOURNAL J. Biol. Chem. 264, 18973-18980 (1989)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 992)
AUTHORS Koski,P., Hirvas,L. and Vaara,M.
TITLE Complete sequence of the ompH gene encoding the 16-kDa cationic
outer membrane protein of Salmonella typhimurium
JOURNAL Gene 88, 117-120 (1989)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.S.Vaara 02-SEP-1989.
FEATURES from to/span description
pept 311 796 cationic outer membrane protein precursor (gtg
start codon)
sigp 311 370 cationic outer membrane protein signal peptide
matp 371 793 cationic outer membrane protein
signal 142 147 -35 region
signal 165 170 -10 region
signal 854 879 transcription termination signal
binding 293 305 ribosome binding site
BASE COUNT 281 a 224 c 260 g 227 t
ORIGIN 334 bp upstream of PstI site.
1 gatccgtcat ctgcgccgtc agatgtaccg gattacagcg atccaggcaa catccgtatg
61 tccgcgggta tcgcattaca atggatgtcc cattggggcc gttggtcttc tcctacgccc
121 agccgtttaa aaagtacgat ggagacaaag ccgagcagtt ccagtttaac attggtaaaa
181 cctggtaatt gttcactgca aaggaatgca ttggtagtgt agcgatgact tttggcgatg
241 cccccaggga tcgccaggcc acgcaaagag ctgtaccttc gggtgcaaat gggatggtaa
301 ggagtttatt gtgaaaaagt ggttattagc tgcaggtctt ggtttggcga tggtaacgtc
361 cgcacaggct gctgacaaaa ttgcaatcgt caacatgggt aatctgttcc aacaggttgc
421 gcagaagacg ggtgtatcca atacactgga aaacgaattt aaaggccgtg cggctgaact
481 gcaaaaaatg gaaaccgatc tgcaatctaa aatgcagcgt ctgcaatcca tgaaagcagg
541 tagcgatcgt actaagctgg aaaaagacgt gatgtctcag cgccagactt tcgcacaaaa
601 agcgcaggct tttgagaaag atcgcgctcg tcgttccaac gaagaacgca acaaactggt
661 gactcgtatc cagactgcgg tgaaaaaagt ggctaacgac cagagtatcg atctggtggt
721 agacgcaaac accgttgctt acaacagcag cgatgtgaaa gacatcaccg ctgacgtact
781 gaaacaggtt aaataagtaa tgcccttcaa ttcgactggc tgacttagca gaacagttgg
841 atgcagaatt acacggtgat ggcgatatcg tcatcaccgg cgttgcgtcc atgcaatgtg
901 caacaacagg ccacattacg tttatggtga atcctaagta ccgtgaacac ttaggtttat
961 gccaggcttc tgcggttgtc atgacgcagg ac
//
LOCUS SIVAGM155 9794 bp ds-DNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV), complete genome.
ACCESSION M29975
KEYWORDS .
SOURCE Simian immunodeficiency virus (isolate 155) proviral DNA, clone 4.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9794)
AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
Olmstead,R.A. and Hirsch,V.M.
TITLE Simian immunodeficiency viruses from African green monkeys display
unusual genetic diversity
JOURNAL J. Virol. 64, 1086-92 (1990)
STANDARD full staff_entry
COMMENT Kindly submitted prior to publication and in a computer-readable
form by Phillip Johnson, Georgetown University, Rockville MD (301-
496-2976). The 155 isolate is from a monkey imported from Kenya.
FEATURES from to/span description
pept 931 2493 gag polyprotein
pept 2199 5342 pol polyprotein
pept 5260 5958 vif protein
pept 5741 6100 vpx protein
pept 6051 6268 tat protein, exon 2 (first expressed exon)
8492 8633 tat protein, exon 3 (AA at 8493)
pept 6208 6268 rev protein, exon 2 (first expressed exon)
8492 8700 rev protein, exon 3 (AA at 8494)
pept 6275 8581 env polyprotein
pept 8724 9416 nef protein
LTR 1 726 5' LTR
LTR 9070 9794 3' LTR
rpt 1 625 R repeat 5' copy
rpt 9578 9794 R repeat 3' copy
binding 727 744 primer (Lys-tRNA) binding site
signal 9675 9680 poly-A signal
BASE COUNT 3321 a 1905 c 2450 g 2118 t
ORIGIN
1 tggatgggat ttattactcc gataggagaa ataagatcct taatctgtat gccctcaatg
61 aatggggaat cattgatgat tggaacgcat ggtcaaaagg acctgggata agatacccga
121 ggtgctttgg cttctgcttc aagctagtac cggttgccct gcatgaggaa gcagaaacat
181 gtgaaaggca ttgcttggta cacccagcac aactgcatga agaccctgat ggtataaatc
241 atggagaaat attggcatgg aagtttgatc caatgttggc tgttcagtac gacccctcaa
301 gggagtactt tacagactta tattcaacag ttggtacagg aaactagccg accacaggct
361 tgcggtttcc tggttgccta ggagatgaca ttaagaactg ctgacgggac tttccagcac
421 gggactttcc aaggcgggac atgggcggta cggggagtgg ctttaccctc agagctgcat
481 aaaagcagat gctcgctggc ttgtaactca gtctcttact aggagaccag cttgagcctg
541 ggtgttcgct ggttagccta acctggttgg ccaccagggg taaggactcc ttggcttaga
601 aagctaataa acttgcctgc attagagctt atctgagtca agtgccctca ttaatgcctc
661 actcttgaac gggagaagtt ccttactggg ttctctctca aacccaggcg agagaaactc
721 cagcatggcg cccgaacagg gacttgagtg aaggcacgta cagctgagaa gacgtcggac
781 gcgaaggaac cgcggggtgc gacgtgaccg agaagggctc ggtgagtagg cttctcgagt
841 gccgggaaaa agctcgagcc tagttagagg actaggaagg gccgtagccg taactactct
901 gggcaagtag ggcaggcgga cgggtacgta atgggggcgg ctacctcagc actgaatagg
961 agacaattag atgaatttga gcatatacga cttcgcccga acggaaagaa aaagtatcaa
1021 attaaacatt taatatgggc aggcaagaag atggaccgct tcggcctcca tgagaagtta
1081 ttggagacag aggaaggttg taaaaagatc atagaagttc tctctcccct agaaccaaca
1141 gggtcggaag gaatgaaaag tctgtataat ctggtgtgcg tattgctttg cgtccaccaa
1201 gaaaagaaag tgaaagacac agaggaagct ttagcaatag taagacaatg ctgccaccta
1261 gtggacaaag aaaaaactgc agttacgcca cctggtggac agcagaaaaa taacacagga
1321 ggaacagcga cacctggtgg cagccaaaat tttcccgcac aacagcaagg gaatgcatgg
1381 gtgcatgtac cactttcacc tcgcacccta aatgcatggg taaaagcagt agaagagaaa
1441 aaatttgggg cagaaatagt acccatgttc caagccctct cagaaggctg caccccatat
1501 gacatcaatc agatgcttaa tgtcttagga gatcatcagg gggccttgca aatagtgaaa
1561 gaaataatta atgaggaagc agcccagtgg gatgtaaccc acccaccgcc ggcaggcccc
1621 ttgccagcgg gacagctcag ggatccgggg ggatcagata tagcagggac cactagtaca
1681 gtgcaagagc agctagagtg gatctatact gctaacccaa gggtagatgt aggggccatc
1741 tatcgaagat ggatcatcct agggttacaa aaatgtgtaa aaatgtacaa tccagtgtct
1801 gttttagata tcagacaagg gcccaaagaa ccattcaaag attatgtaga cagattctat
1861 aaagcaataa gagcagaaca agcttcagga gaagtcaaac aatggatgac agaatctttg
1921 ctcattcaga atgccaaccc agattgcaaa gtaattttga agggcctagg gatgcacccc
1981 actcttgaag aaatgctgac agcctgtcaa ggggtgggag gcccaagtta caaagccaaa
2041 gtcatggcag aaatgatgca gaacctgcag agtcagaaca tggtacagca gggaggtgga
2101 aggggaagac caagaccccc gccaaagtgt tacaactgtg gaaaatttgg ccacatgcag
2161 aggcagtgtc ctgagccaag aaaaataaaa tgtcttaaat gtggaaagcc agggcactta
2221 gcaaaagact gcaggggaca ggtgaatttt ttagggtatg gccggtggat ggggacaaaa
2281 ccaagaaatt ttcccgcagc cactcttggg gcggaaccaa gtgcgccccc tccaccgaac
2341 aactctacac cttacgaccc agcaaagaag ctcctgcagc agtatgcaga gaaagggaaa
2401 caaatgagaa atcagaacag aaacccccca gcgaacaatc cagattggaa cgagggatat
2461 tctttgaact ccctctttgg agaagaccaa taaggacctg tataatagga ggaactgccg
2521 ttaaggcatt attagataca ggggcagatg acactataat aaaggataca gatttacaat
2581 taaggggatc atggagacca aaaatagtag gaggaattgg gggagggtta aacgtaaaag
2641 aatatgataa tgtagaagta caattggaag acaagatatt aagaggaaca gtcctcatag
2701 gagcaactcc catcaatatc ataggaagaa actttttagc ccaggcagga gccaaattag
2761 tgatggggca attgtcgcag acaataccaa tcaccccggt acgcttaaag gaaggggcca
2821 gaggaccacg attgaagcaa tggccactct ctaaagaaaa aataatagcc ctgcaagaaa
2881 tttgcaaaac attagaggaa gaaggaaaat taagcagggt agggggagac aatgcataca
2941 atacaccagt attctgtata aggaaaaaag acaaatcaca gtggagaatg ctggtagatt
3001 tcagggaact caacaaagct acacaagact tctttgaagt ccaattaggt ataccccatc
3061 cagcagggtt aaagaaaatg aagcaaataa ccattataga tgtgggggat gcatattata
3121 gcataccact ggatcctgag tttagaaaat acacagcttt caccatccct acggtaaaca
3181 atgagggacc aggcataaga tatcaattta attgcctacc gcagggctgg aagggatccc
3241 cgacaatttt ccaaaacaca gcatcaaaaa ttctagaaga aataaagaaa gaattaaaac
3301 agctgacgat tgtccagtac atggatgacc tctgggtagg atcacaagaa gagggtccaa
3361 agcatgatca gctagtacaa acacttagga atagattgca agaatgggga ttagaaacac
3421 cagagaaaaa ggtgcaaaga gaacctccct ttgagtggat gggatataaa ttatggcctc
3481 ataaatggaa gttacaaagt atagaattag agaagaaaga acaatggaca gtgaatgatc
3541 ttcagaaatt ggtagggaaa ttaaattggg cagcacaatt atatccagga ttgagaacaa
3601 aaaatatctg taagctactt agaggaaaga aaaatttatt agacgtggta gaatggaccc
3661 cagaggcaga agcagagtac gaagaaaaca aggagatcct aaaaacagag caagaaggta
3721 cttattatgc accagaaaaa ccccttaggg cagcagtaca gaaattagga gatgggcaat
3781 ggtcatacca attcaagcag gaaggaaaaa tcttaaaggt agggaagttc gccaaacaga
3841 aagctactca caccaatgag ttgcgtgtac tagcaggagt agtacagaaa atagggaaag
3901 aggccctagt aatttgggga caattaccca cttttgaact cccagtggag agggacacat
3961 gggaacaatg gtgggcagac tattggcaag tcagttggat acccgaatgg gactttgtca
4021 gtgttccgcc cttagtaact ttgtggtata cactgactaa ggaacccatc ccgggagagg
4081 atgtctacta tgtagatgga gcctgtaata gacagtcgaa agagggaaaa gcaggctaca
4141 taacccaaca aggcaaacaa agagtacaac agctagaaaa cacaacaaat caacaagctg
4201 aactgacagc cataaaaatg gccttggagg atagcggccc taaagtcaat atagtaacag
4261 attcacaata tgcgatgggc atattgacag cacagcccac acagagtgac tccccactag
4321 tagaacaaat aatagcacag atggtacaga aagaagccat ctatctgcaa tgggtacctg
4381 ctcataaagg tatagggggc aatgaagaaa tagacaaatt agtaagcaag ggagttagaa
4441 gaatattgtt cattggcagg atagaagaag cacaagaaga acatgatagg tatcacagta
4501 actggagaaa tctagcagac acatttggat tgccacaaat agtagctaaa gaaattgtag
4561 caatgtgccc aaaatgtcaa gtaaaagggg aaccaataca tggacaagta gatgcttcac
4621 caggagtgtg gcagatggac tgcacacata tagaaggaaa aatagtgata gtagcggtcc
4681 atgtagccag tgggtttata gaagcagagg ttatccctag ggaaacagga aaagagacag
4741 caaagttctt gttaaaaata ataggaagat ggcccatcac tcacctccat acagataatg
4801 gaccaaattt cacttctcag gaagtagctg ctatgtgctg gtggggaaag gtagaacaca
4861 caacgggggt accatataat ccacagtccc agggatctat agaaagtatg aacaaacaat
4921 tgaaagagat aattggaaaa ataagagatg actgtcaata tacagaaaca gcagtactta
4981 tggcctgcca cattcacaat tttaaaagaa agggaggaat aggggggcta acagctgcag
5041 agagactaat aaatatgata acaacacaat tagaaatcaa cactctacaa accaaaatcc
5101 aaaaaatttt gaattttaga gtctactaca gagaaggcag agatccagtg tggaagggac
5161 ctgctcgcct gatctggaaa ggagaaggcg cggtagttct caaggaaggt gaagaactga
5221 aggtagttcc gagaaggaaa gcaaaaatca taaaagacta tgagccaaga aaaacattgg
5281 gtgatgagac tcacctggaa ggtgcaggag gaagtgatca ccaaatggca ggggatagtt
5341 agatattgga tgaataaaag gaatctgaaa tgggaataca aaatgcatta tcaaatcact
5401 tgggcatggt acactatgag cagatatgta atacccctcc caggaagtgg agaaatccat
5461 gtggatatct attggcattt agctccaaaa caaggatggc tctcaactta tgcagtagga
5521 atacaatatg ttagcctagt aaatgataaa tatagaacag aattagatcc caatacagca
5581 gactccatga tacattgtca ttattttacc tgttttacag atagagccat ccaacaggca
5641 ctaaggggaa acaggttcat cttctgtcaa tttccaggag gacataaact aacaggtcag
5701 gtaccctcct tgcaatattt agcattacta gcccatcaaa atggcctcag gaagagatcc
5761 cagagaggag agaccaggag gactagaaat ttgggatctc agcagggagc cgtgggacga
5821 atggctcaga gatatggtag aagaaatcaa caacgaagcc aaactgcatt ttggccgaga
5881 actcctatac caagtatgga attattgtca ggaggaaggg gagagacagg gaagacccat
5941 agcggaaagg gcatataagt attatcgctt agttcagaaa gctctctttg tgcatttccg
6001 gtgtggatgt cgcaggagac aaccctttga gccatacgag gagaggagaa atggacaagg
6061 gggaggaaga ccaggacgtg tcccaccagg acttgattaa acaatacagg aaaccccttg
6121 agacatgtac aaataaatgc ttttgcaaaa aatgctgtta tcattgccaa ttctgcttct
6181 tacggaaagg actaggtatt acctatcatg cctttaggac cagaagaaag aagattgctt
6241 cggctgatcg cattcctgta ccgcagcagt aagtatgaca aagttcttag gaatttttat
6301 agtattagga atagggatag gaatagggat aagtacaaaa cagcagtgga taacagtgtt
6361 ctatggagta ccagtatgga aaaacagctc agtccaagct ttttgcatga cacctactac
6421 taggttgtgg gcaactacta attgcatacc agatgatcat gactatacag aagtaccact
6481 gaatataaca gagccatttg aagcatgggc agacagaaat cccttagtag cacaagcagg
6541 aagtaacatt cacctgctgt ttgaacagac attaaagccc tgtgtaaagc tatcacctct
6601 atgtatcaaa atgaattgtg tagagttaaa aggctccgca acctctaccc cagcaacctc
6661 tactacggca ggaaccaaac taccctgtgt tagaaataaa acagactcca acctacagtc
6721 atgcaacgac accatcatag aaaaggagat gaatgacgag gcagcgtcaa actgcacctt
6781 tgctatggct gggtacatta gggaccaaaa gaagaattac tcagtagtat ggaatgatgc
6841 agaaatcttt tgtaagcgta gtacatcgca taatgggaca aaagagtgct atatgatcca
6901 ctgtaatgat tcagttataa aggaagcttg tgataagaca tattgggatg aattaagact
6961 aagatattgt gctccagcag gatacgcttt gcttaaatgt aatgattggg attatgcagg
7021 atttaagcca gaatgttcta atgtttcagt agtgcattgc acaactttaa tgaatacaac
7081 agtaaccact ggtctgttat tgaatggaag ctattcagaa aatcgaaccc agatctggca
7141 aaaacatgga gtgagcaatg actcagtgtt aatcttgctc aataagcatt ataacctgac
7201 agttacatgc aaaaggccag ggaataagac agtcttgcca gtaacgataa tggcaggatt
7261 agtcttccac tcacagaagt ataatacaag actaaggcag gcctggtgcc acttccaggg
7321 caattggaaa ggagcttgga aggaagtaca agaggaaata gtaaaattac caaaagaacg
7381 gtaccaaggc accaatgata caaacaaaat ctttttgcaa agacaatttg gagacccaga
7441 agcagcaaat ctatggttca actgtcaagg ggaattcttc tactgtaaaa tggactggtt
7501 tttaaattat ctgaataatt taacagtgga tgctgatcat aatcattgta aaaacaacgc
7561 agggaaaggt cgaagtccag gtccctgtgt acagagaact tatgttgcct gccatatccg
7621 atctgtcata aatgattggt atactatatc aaagaaaaca tatgctccac caagagaagg
7681 acatttgcag tgcacgtcca cagttactgg gatgacagta gagctaaact ataataacca
7741 gaacaggaca aatgtaacat tgagtcccca gatagaaacc atctgggcgg cagaattggg
7801 cagatacaaa ttggtagaga ttacaccaat tggatttgca cccacagaag tcaggcgata
7861 cacgggaggc caagagaggc aaaaacgagt cccgttcgtg ctagggttcc taggcttctt
7921 gggagctgct gggactgcaa tgggagcagc ggcgacagcc ctgacggtcc agtctcagca
7981 tttacttgct gggatattgc agcagcagaa gaatctgctg gcggctgtgg gagctcaaca
8041 gcagatgttg aagctgacca tttggggtgt gaaaaacctc aatgcccgcg tcacagctct
8101 tgagaagtac ctggcggatc aggcacggtt aaacgcttgg gggtgcgcgt ggaaacaagt
8161 atgtcataca acagtaccct ggacgtggaa taatacacca gagtggaata atatgacctg
8221 gttggagtgg gaaaaacaga tagaaggatt ggagggcaac ataacaaaac aattggaaca
8281 ggcaagggaa caagaggaaa agaatttgga tgcttatcaa aagttgtcag actggtcgag
8341 tttttggtct tggttcgatt tttcaaaatg gctgaacatt ttaaagatag gctttttggc
8401 agtaataggc gttatagggt taagattgct ttacacatta tatacttgca tagctagggt
8461 taggcagggt tactctcctt tatctcctca gatccatatc catccgtgga agggacagcc
8521 agacaacgca ggagagccag aagaaggtgg aagaacaggc aaaagcaaat ctacgcatta
8581 gcagaaagaa tttgggggac gagacaagag gaccagttgg tgcaggcaat tgaccaattg
8641 gttcttgaca ctcagcatct ggttacacaa cagctgcctg accctccttc tcaagcttag
8701 aagcgcctgg cagtacttac aatatgggct tggggagctc aaagccgcag cacaagaagc
8761 agttaaccat ctggcgagct ttgcacgcaa cgcggcacac cagatatggc ttgcttgcag
8821 atccgcttat cgggcaatca tcaactctcc aagaagagtg cgacaagggc ttgaggaagt
8881 ccttaattag gaagagaaat ggcaacatga ctccagaagg aagacgtcta caggacgggg
8941 accaatggga tgaatggtca gatgaagaag atgaagtggg atttccagta agaccaagag
9001 tgccactaag acaaataaca tacaaacttg cagtagattt ttcgcacttt ttaaaagaaa
9061 agggaggact ggatgggatt tattactccg ataggagaaa taagatcctt aatctgtatg
9121 ccctcaatga atggggaatc attgatgatt ggaacgcatg gtcaaaagga cctgggataa
9181 gatacccgag gtgctttggc ttctgcttca agctagtacc ggttgccctg catgaggaag
9241 cagaaacatg tgaaaggcat tgcttggtac acccagcaca actgcatgaa gaccctgatg
9301 gtataaatca tggagaaata ttggcatgga agtttgatcc aatgttggct gttcagtacg
9361 acccctcaag ggagtacttt acagacttat attcaacagt tggtacagga aactagccga
9421 ccacaggctt gcggtttcct ggttgcctag gagatgacat taagaactgc tgacgggact
9481 ttccagcacg ggactttcca aggcgggaca tgggcggtac ggggagtggc tttaccctca
9541 gagctgcata aaagcagatg ctcgctggct tgtaactcag tctcttacta ggagaccagc
9601 ttgagcctgg gtgttcgctg gttagcctaa cctggttggc caccaggggt aaggactcct
9661 tggcttagaa agctaataaa cttgcctgca ttagagctta tctgagtcaa gtgccctcat
9721 taatgcctca ctcttgaacg ggagaagttc cttactgggt tctctctcaa acccaggcga
9781 gagaaactcc agca
//
LOCUS SIVAGM3 9625 bp ds-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) proviral, complete genome.
ACCESSION M30931
KEYWORDS complete genome.
SOURCE Simian immunodeficiency virus (isolate AGM3) from African Green
monkey proviral genomic DNA.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9625)
AUTHORS Baier,M., Garber,C., Mueller,C., Cichutek,K. and Kurth,R.
TITLE Complete nucleotide sequence of a simian immunodeficiency virus
from African green monkeys: A novel type of intragroup divergence
JOURNAL Unpublished (1990); Paul-Ehrlich-Institute, 6070 Langen 1, Germany
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.Baier 20-DEC-1989.
This sequence was taken from an infectious molecular clone (used
for heterologous infection of the pigtail macaque). The 3' LTR
sequence does not appear to match the 5' LTR sequence.
FEATURES from to/span description
pept 431 1996 gag polyprotein
pept > 1687 4827 pol polyprotein (NH2 terminus uncertain)
pept 4763 5461 vif protein
pept 5244 5603 vpX protein
pept 5554 5771 tat protein, exon 2 (first expressed exon)
8013 8154 tat protein, exon 3 (AA at 8014)
pept 5711 5771 rev protein, exon 2 (first expressed exon)
8013 8221 rev protein, exon 3 (AA at 8015)
pept 5778 8411 env polyprotein
pept 8245 8934 nef protein
BASE COUNT 3324 a 1827 c 2383 g 2091 t
ORIGIN
1 cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc taacctggtt
61 ggccaccagg ggtaaggact ccttggctta gaaagctaat aaatcttcgc tgcattagag
121 cttctctgag tcaagtgccc tcattgacgc ctcactcttg aacgggtaaa acttccttac
181 tgggttctct ctcaacccag gcgagagaaa ctccagcagt ggcgcccgaa cagggacttg
241 acttgagtga aggcacgtac agctgagaag acgtcggacg cgaaggaagg cgcggggtgc
301 gacgtgacca agaagggctt ggtgagtagg cttctcgagt gccgggaaaa agctcgagcc
361 tagttagagg actaggaagg gccgtagcca taactactct gggcaagtag ggcaggcgga
421 cgggtacgca atgggggcgg ctacctcagc actaaatagg agacaattag acaaatttga
481 gcatatacga cttcgcccga ccggaaagaa aaagtaccaa attaaacatt taatatgggc
541 aggcaaggaa atggagcgct tcggcctcca tgagagatta ctagaatcag aagaaggatg
601 taagaagatc atagaagtac tctacccgct agaaccaaca gggtcggagg gcttaaaaag
661 tctgtttaac cttgtgtgcg tattgttttg cgtacacaaa gataaggaag tgaaagacac
721 agaagaagca gtagcaatag taagacaatg ctgccatcta gtggagaaag aaagaaatgc
781 agaaagaaat acaacagaga catctagtgg acaaaagaaa aatgacaagg gagtaacagt
841 gccacctggt ggcagtcaaa atttcccagc acaacaacag ggaaatgcat ggatacatgt
901 gcccttgtca ccacgcacct taaatgcgtg ggtaaaagca gtagaggaga aaaaattcgg
961 agcagaaata gtgcccatgt tccaggcttt atcagaaggg tgcacaccct atgacatcaa
1021 tcaaatgctt aatgtcctgg gagaccatca aggggcgcta caaatagtaa aagaaatcat
1081 caatgaggaa gcagcccagt gggatatagc tcacccacca ccagcaggac cattaccagc
1141 aggacaactc agagacccta gaggctctga catagcagga accaccagca cagtgcaaga
1201 acagctggaa tggatataca cagccaatcc cagagtagat gtgggtgcca tctatagaag
1261 gtggattatc ctggggttgc aaaaatgtgt aaaaatgtac aacccagtgt ctgtcttaga
1321 cataagacag gggcccaaag aagcattcaa agactacgta gataggttct acaaagcaat
1381 aagagctgag caggcctcag gagaagtaaa acagtggatg acagaatcat tactcattca
1441 gaatgctaat ccagactgta aagtcatcct aaagggcctg ggaatgcatc ccactctaga
1501 agaaatgtta actgcctgtc aaggagtggg aggaccaagt tacaaagcaa aagtgatggc
1561 agaaatgatg caaaatatgc aaagccagaa catgatgcaa cagggcggtc agagaggaag
1621 accaagaccc ccagtaaagt gttacaattg tggaaaattt ggccatatgc aaagacaatg
1681 ccctgaacca agaaagatga gatgcttgaa atgtgggaaa ccagggcatt tagcaaaaga
1741 ttgcagagga caggtaaatt ttttagggta tggccggtgg atgggagcga aacccagaaa
1801 ttttcccgcc gctactcttg gggtggagcc aactgcgccc cctccaccga gtccatacga
1861 ccctgcaaag aagctcctgc agcaatatgc agacaagggg aagcagttga gggaacaaag
1921 gaaaaaacca ccagcagtga atcccgattg gacagaggga tattctttga actccctctt
1981 tggagaagac caataaaaac agtttacata gaaggggtcc ccatcagagc attattagat
2041 acgggggcag atgataccat tataaaagaa gcagatttac aattatcagg aacatggaaa
2101 ccaaaaataa tagggggcat tggaggggga ctcaatgtaa aagagtatag tgatagggaa
2161 gtaagattgg aagacaaaat tttgagaggg accatattga taggaagcac tcccataaac
2221 ataattggaa gaaatatatt agcaccagca ggagccaaat tagtaatggg tcaactgtca
2281 gaacaaattc ccattacccc tgtgaaatta aaagaagggg ctagaggacc tttcttaaaa
2341 caatggcccc tctccaaaga aaaaataaaa gccttacagg aaatatgtga ccaattagag
2401 aaagaaggaa aaattagcaa gataggagga gagaatgcat acaacactcc agtgttttgc
2461 ataaagaaaa aagacaagtc acaatggaga atgttagtag attttaggga actaaacaaa
2521 gcaacacaag attttttcga agtacagtta ggcatacctc atccatcagg gttcgaaaag
2581 atgacggaaa taacagtatt agacataggg gatgcctatt attcaatacc attagaccca
2641 gagtttagaa agtataccgc ttttaccatt ccatcagtaa ataatcaagg gccaggtact
2701 agatatcagt tcaactgtct tccacaagga tggaagggat ccccaactat ttttcagaac
2761 acagcagctt ccattctaga agaaataaaa aaggagttaa aacccctaac cattgtgcaa
2821 tacatggatg acctatgggt agggtctcag gaagatgaat acacgcatga tcggttggta
2881 gaacaactaa gaatgaaatt aagtgcctgg ggattagaaa caccagacaa gaaagtacag
2941 aaaaaaccac cttatgagtg gatgggatac aaattgtggc cacacaagtg gcagataagc
3001 agcatagaat tagaagacaa agaagaatgg actgtaaatg atatacaaag actagtgggg
3061 aaactaaatt gggcagcaca gctttaccca ggactcagaa ctaaaaactt gtgtaaatta
3121 atcagaggaa aaaagaactt actagaaaca gtaacctgga cagaggaagc agaagcagaa
3181 tatgcagaaa acaaagagat cttaaaaacg gaacaggaag ggacctacta caaaccagga
3241 agacccatca gagcagcagt gcaaaaacta gaaggaggtc aatggagtta ccaattcaag
3301 caagagggac aagtattaaa agtaggtaaa tacacaaagc agaaaaacac tcataccaat
3361 gagttccgtg tattggcagg attagtacaa aaactttgta aagaatcttt agttatatgg
3421 ggagagttgc cagtccttga actcccaata gagagggaag tatgggaaca atggtgggct
3481 gattactggc aggtaagttg gattccagac tgggaatttg tcagtacccc acccctagta
3541 aaattatggt ataccctgac aaaagaaccc ataccaaagg aagatgtcta ctatgtggat
3601 ggagcttgta atagaaattc aagggaagga aaagcaggat atatcacaca atatgggaaa
3661 caaagggtgg aaaaattaga aaatacaaca aaccagcaag cagaattaat ggccataaaa
3721 atggcactag aagatagtgg gcctaatgta aacatagtaa cagattcaca atatgcaatg
3781 ggaatattaa ctgcccaacc cacacagagt gactcaccct taatagaaca aattatagca
3841 ctaatggtac aaaaacatca gatatacttg caatgggtac cagcagacaa agggatagga
3901 ggcaatgaag agatagataa actagtaagt caagggatga ggaaaatttt atttttagaa
3961 aaaatagaag aagcccagga ggaacatgaa aggtaccata ataattggag gaacttagca
4021 gacacttatg ggctaccaca aattgtggca aaagaaatag tagccatgtg tccaaaatgt
4081 cagataaaag gggaaccagt ccatgggcaa gtagatgcct cgccaggggt atggcaaatg
4141 gactgtacac atttagaagg caaggtaatc atagtagcag tccatgtagc cagtggattc
4201 atagaagcag aagttatacc tagagaaaca gggaaagaaa cagcaaaatt tttattaaag
4261 atactaagta gatggcccat aacccaactg catacagaca atggacccaa ttttacgtct
4321 caagaagtag cagcaatgtg ttggtgggga aaaatagaac acaccacagg tgtaccctat
4381 aaccctcaat cacaaggctc tatagagagt atgaataaac agttaaaaga aataattggg
4441 aaaataagag atgactgtca atacacagaa acagcagtac ttatggcatg ccacatccac
4501 aattttaaaa gaaagggagg aatagggggg ttaacaccgg cagagagatt aatcaatatg
4561 attactacac aattagaatt acaacaccta caaaccaaaa ttcaaaaaat tttaaatttt
4621 agagtctact acagagaagg gagagatcct gtctggaaag gaccaggaca gttaatttgg
4681 aaaggggaag gtgcagtggt catcaaagga ggtgtggaat taaaagaata cccaagaagg
4741 aaagcaaaaa ttataaagga ttatgaacca agaaaaagaa tgggtgatga gagtaacttg
4801 gaaggtgccg gaggagctga taactaaatg gcaagggata gtgaggtact ggatgaggac
4861 tagaaaatta gactggaaat atcgaatgca ctaccaaatt acatgggcat ggtacacaat
4921 gagtagatat gagatacccc tagggcaaca tggaagtata catgtagatc tatattggca
4981 tctgacacca gaaaagggat ggctatcaac atatgctgag gggatacagt atctaagcaa
5041 tagggatcct tggtatagga cagaattgga tcctgcaaca gcagatagcc tgatacatac
5101 ccattatttt acttgtttta cagaaagggc catcaggaaa gccctattgg gacagaggtt
5161 caccttctgt cagttccccg agggacacaa gaaaacagga caggtaccct ctttgcaata
5221 cttagctctc cttgcacacc aaaatggcct caggcagaga tcccagagaa gcaagaccgg
5281 gggaactaga aatatgggat ttgagcaggg agccgtggga cgaatggcta agagacatgc
5341 tagaagatat caatcaggaa gccaagatgc attttgggcg cgagctcctg ttccaagtat
5401 ggaactattg tcaggaggag ggagaaagga atcgcactcc catgctagaa agggcttata
5461 aatattataa attggtgcaa aaagctctct ttgtgcattt ccggtgtgga tgccgcagaa
5521 gacaaccctt tgaaccatac gaagaaagga gggatggaca agggggagga cgagcagggc
5581 gcgtaccacc aggacttgat tgaacaactc aaagcacccc tgaagcggtg tacaaacaag
5641 tgctattgta aatgttgctg ttatcactgt cagctttgct ttttacaaaa gggattaggt
5701 gttacctatc atgcccctag gatcagaaga aagaagattg ctccgcttga tcgctttcct
5761 gaacaaaaac agtgagtatg aagctgacat tactgatagg gatactatta atagggatag
5821 gagtagtgct taatacaagg caacaatggg tcacagtatt ttatggagta ccagtatgga
5881 aaaacagctc agtacaggct ttctgcatga cacccaccac cagactatgg gcaactacta
5941 actcgatacc agatgatcat gactacacag aggtaccatt aaacatcact gaaccatttg
6001 aagcatgggc tgacagaaac cccttagtag cacaagcagg aagtaatata cacctgctat
6061 ttgagcagac tctgaagcca tgtgtaaaat tatcaccttt gtgcattaaa atgtcctgtg
6121 tagaattgaa ctcctctgag cctaccacca ctcctaaaag taccacggcc tcaacaacca
6181 atatcacagc ctcaacaacc actttgccgt gtgtccagaa caagacaagt actgtgttag
6241 aatcatgtaa tgaaacaatc atagaaaagg aattaaatga agagcctgct tctaattgta
6301 catttgcaat ggcagggtat gtaagagatc agaaaaagaa gtattcagtg gtgtggaatg
6361 atgcagaaat catgtgtaag aagggtaaca attctaacag agaatgttat atgattcatt
6421 gtaatgattc agttataaaa gaagcctgtg ataaaacata ttgggatgag ttaagattaa
6481 ggtactgtgc cccggcaggg tttgctttat taaaatgcaa cgattatgat tatgcagggt
6541 ttaagacaaa ctgttctaat gtttcagtgg tgcattgtac taacttgata aatacaacag
6601 tgactactgg actgttgttg aatgggagct actcagagaa tcgaacccag atatggcaga
6661 aacatagagt aagcaatgac tcagtgttag tgttatttaa taaacattac aatctaacag
6721 ttacttgcaa aagaccagga aacaaaacag tcttaccagt aacaatcatg gcagggctag
6781 tgtttcattc tcagaggtac aatacaaggc tgagacaagc ttggtgtcac ttccagggca
6841 actggagagg agcctggaaa gaagtaaaaa atgaaatagt aaaattacca aaagatagat
6901 accaaggaac caatgatact gaagagattt atctgcagag actatttgga gatccagaag
6961 cagcaaattt atggtttaat tgtcaggggg aattcttcta ttgtaaaatg gattggtttc
7021 taaattacct gaataatcgt acagtagatc cggaccataa tccgtgtaat ggtacgaagg
7081 gaaaaggtaa ggcaccagga ccctgtgcac aaagaacata tgttgcttgc catatacgat
7141 ctgtcattaa tgattggtac acactatcaa ggaaaaccta tgcaccgcca agagaagggc
7201 acttgcaatg cacatccacg gtaacgggta tgtcagtgga gctaaattac aatagtaaga
7261 acaggactaa tgtaacatta agtccccaga tagaaaccat ctgggcagca gaattgggca
7321 ggtacaaatt agtagaaatt acaccaattg gcttcgcacc cacagaagta agaaggtata
7381 cgggaggtca tgacagaaca aagcgagtcc cgttcgtgct agggttccta ggcttcttag
7441 gagctgctgg gactgcaatg ggagcagcgg cgacagccct gacggtccag tctcagcatt
7501 tacttgctgg gatactgcag cagcagaaga atctgctggc ggctgtggag gctcaacagc
7561 agatgttgaa gctgaccatt tggggtgtga aaaacctcaa tgcccgcgtc acagctcttg
7621 agaagtacct agaggaccag gcgcggttga atgcttgggg gtgcgcatgg aagcaagtct
7681 gtcatacaac cgtaccgtgg cagtggaata ataggacccc tgattggaat aatatgactt
7741 ggctggaatg ggaaagacag atatcgtatt tggaaggtaa cataacaaca caattagagg
7801 aagccagagc acaggaggag aagaatttgg atgcatacca aaaattaagt agttggtcag
7861 atttctggtc ttggttcgat ttctcaaagt ggctgaacat tctaaaaata ggatttttgg
7921 atgtactagg tattatagga ttaagattgc tttatacagt atattcttgc atagctaggg
7981 ttaggcaggg ttactctcct ctttctccac agatccatat ccacccgtgg aagggacagc
8041 cagacaacgc agaagggcca ggagaaggtg gagacaagcg caagaacagc tccgagcctt
8101 ggcagaaaga atctggcaca gcagagtgga agagcaactg gtgcaagcga ttgaccaatt
8161 ggtgctcgat cagcagcatc tggctataca acagttgcct gaccctccta gttcatctta
8221 ggagcgcttt ccagtacata caatatgggc ttggggaact caaagccgca gcacaagaag
8281 cagttgtcgc tttggcacgc cttgcacaaa acgcgggcta ccagatatgg cttgcttgca
8341 gatccgctta tagggcaatc atcaactctc caagaagagt gcgacaaggc cttgaaggaa
8401 tccttaatta ggaagagaaa tggtaaaatg actccagaag gaagaaaatt acaagaagga
8461 gataaatggg atgaatggtc tgatgaagaa gatgaagtag gatttccagt aagaccaaga
8521 gtgccgctaa gacaaatgac ctataaatta gcggtggact tttcgcactt tttaaaagaa
8581 aaggggggac tggatgggat ttattactcc gacaggagga atcagatcct aaacctgtac
8641 gccctcaatg agtggggaat cattgatgat tggaatgctt ggtcagaagg accaggaatc
8701 agatacccaa gatgcttcgg cttctgcttt aaattggtac cagtagacct gcatgaggaa
8761 gcagagactt gtgagagaca ttgcctggtg catccagcac aagtgaggga agaccctgat
8821 ggaatcaacc atggagaagt cttggtctgg aagtttgatc ccatgttagc agtccaatat
8881 gaccctaaca gaaaatatct cactgacatg catgatcttg gcaagaggaa gtagctaacc
8941 gcaggcttgt ggttaagcac atcaccatgg tgatgacatt aagaactgct gacgggactt
9001 tccagcaagg gactttccag ggcgggtcat gggcggtacg gggagtggct ttaccctcag
9061 agctgcataa aagcagatgc tcgctggctt gtaactcagt ctcttactag gagaccagct
9121 tgagcctggg tgttcgctgg ttagcctaac ctggttggcc accaggggta aggactcctt
9181 ggcttagaaa gctaataaat cttcgctgca ttaggcagag acttgtgaga gacattgcct
9241 ggtgcatcca gcacaagtga gggaagaccc tgatggaatc aaccatggag aagtcttggt
9301 ctggaagttt gatcccatgt tagcagtcca atatgaccct aacagaaaat atctcactga
9361 catgcatgat cttggcaaga ggaagtagct aaccgcaggc ttgtggttaa gcacatcacc
9421 atggtgatga cattaagaac tgctgacggg actttccagc aagggacttt ccagggcggg
9481 tcatgggcgg tacggggagt ggctttaccc tcagagctgc ataaaagcag atgctcgctg
9541 gcttgtaact cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc
9601 taacctggtt ggccaccagg ggtaa
//
LOCUS SIVAGM691 683 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat.
ACCESSION M33719
KEYWORDS .
SEGMENT 1 of 2
SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African
green monkey proviral DNA.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 683)
AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
Olmstead,R.A. and Hirsch,V.M.
TITLE Simian immunodeficiency viruses from African green monkeys display
unusual genetic diversity
JOURNAL J. Virol. 64, 1086-92 (1990)
STANDARD full staff_entry
COMMENT Kindly submitted prior to publication in computer-readable form by
Phillip Johnson. The ver-1 isolate is from a monkey imported from
Ethiopia.
Author address:Phillip Johnson
Georgetown University
Rockville, MD
(301-496-2976)
FEATURES from to/span description
LTR 1 683 long terminal repeat
BASE COUNT 174 a 149 c 187 g 173 t
ORIGIN
1 tggatgggat ttattactcc gaaagaaggg aaaagatttt gaacctgtat gcattaaatg
61 aatggggaat catagatgat tggcaagctt atactccagg tccaggcatc agatatccaa
121 gatgctttgg gttctgtttt gaattagtgc cagtggacct tagtgaggaa gcgcaaggat
181 gtgaaaggca ctgtctggtc catcctgctc aattacagga ggatccagat ggtatctggc
241 atggagaaac attggtctgg agattcaatc ccatgctagc atgcaaggcc atgccaggag
301 tgttcaatga catgcatgca acagtgggga agtagcttgc ggttagcgcg tccgggacct
361 gtgtaccaac cagcatagca accatgctaa tgagctaggg actttccaga aggggagtgg
421 tttaaccctc agatattgta tataagcaga tgctcttggg cttgtaactc agtgctctta
481 ctaggagcca gctagagcct gggtgttcgc tggtagccta acctggactg gccctccagg
541 ggtaagagcc tccacggctt gaatgcttaa taaaccttgc ctgcattaga agtacttcga
601 gtcgtgtggt cccattgccg cctccgttca cgggaatcct caatactggg ttctctcttg
661 cccaggggag agaaactcca gca
//
LOCUS SIVAGM692 1542 bp ss-DNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) gag gene, complete cds.
ACCESSION M29974
KEYWORDS .
SEGMENT 2 of 2
SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African
green monkey proviral DNA, clone ver-1(692).
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 1542)
AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
Olmstead,R.A. and Hirsch,V.M.
TITLE Simian immunodeficiency viruses from African green monkeys display
unusual genetic diversity
JOURNAL J. Virol. 64, 1086-92 (1990)
STANDARD full staff_entry
COMMENT Kindly submitted prior to publication in computer-readable form by
Phillip Johnson. The ver-1 isolate is from a monkey imported from
Ethiopia.
Author address:Phillip Johnson
Georgetown University
Rockville, MD
(301-496-2976)
FEATURES from to/span description
pept 1 1542 gag polyprotein
BASE COUNT 532 a 299 c 405 g 306 t
ORIGIN
1 atgggttcgg gttcctcagc actgtcaggg agaaaattag accaatttga acatatacgt
61 cttcgcccga acggaaagaa aaagtaccaa ttgaaacatt taatatgggc aggcaaggaa
121 atggagcgct ttggcctcca tgaaaagttg ttagaaacag aagaggggtg taaaaagatc
181 atagaagtat tgcttccctt agaaccaacc gggtcggaag gtttaaaaag cctgttcaat
241 ttgacctgcg tcatttgctg cattcatcag gaagcgaaag tgaaagacac agaggaagca
301 gtaataagaa taaagcaaca gtgccatcta gtggacaaag gtgagaatgc agccaaagga
361 atagataaga caacaccgac acctagtggt aggagtcaaa attacccggc acaacagcag
421 aataatgtat gggtacatgt gccacttagc cccagaacat taaatgcttg ggtaaaagta
481 attgaagaaa agaaatttgg agcagagata gttcccatgt ttcaggccct gtcagaagga
541 tgtaccccat atgatgtgaa ccaaatgttg aatgttctag gagaccatca gggggccctg
601 cagatagtga aagaggtcat caatgaagaa gctgcccagt gggacattac acatccccca
661 ccagcagggc cgctcccagc agggcaattg agagatccaa gggggtcaga catagcaggg
721 actactagta ccattcaaga acaactagaa tggatttaca cagccaaccc aagaatagac
781 gtgggagcta tctataggag atgggtaata gcagggctgc aaaaatgtgt cagaatgtat
841 aatccaacag gggttctgga tataagacaa ggaccaagag aatcttttag cgattatgta
901 gatagattct acaaggccct gagagcagaa caagcctctc aggatgttaa gaattggatg
961 acagacactc tgttgattca aaatgctaac ccagagtgta aggtcattct gaaagggcta
1021 ggcatgcacc ctaccttgga agaaatgctt acggcatgcc agggagtagg gggaccccaa
1081 tacaaagcca aattgatggt agaaatgatg aatcaaatgc agggggtcaa catggtacag
1141 caagcaggaa taggaggtag agggagagga agaccagtta aatgctacaa atgtggaaaa
1201 tttgggcatg tgcagaaaaa ttgcactcaa aaagggccag tagtatgcct gaaatgtgga
1261 aaacctggcc attttgctcg agattgcaga ggagcagtaa attttttagg gtatggcagg
1321 tggatgggag caaaaccaaa aaatttttta gaacacagag cagcagtccc ctccgcccct
1381 ccaccgccgc acaacccagg ggcgtacgac gaagccactc ggcttctgga gaaatatacc
1441 caagagggag cccaacaaag gagaaaagta gagaagagct cccaagcggg gagggaggaa
1501 gaggattatt ccttgaaatc cctctttgga gaagaccaat aa
//
LOCUS SIVAGM90 723 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat.
ACCESSION M33718
KEYWORDS .
SOURCE Simian immunodeficiency virus (isolate 90) from African green
monkey proviral DNA, PCR clone 03F.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 723)
AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
Olmstead,R.A. and Hirsch,V.M.
TITLE Simian immunodeficiency viruses from African green monkeys display
unusual genetic diversity
JOURNAL J. Virol. 64, 1086-92 (1990)
STANDARD full staff_entry
COMMENT Kindly submitted in computer-readable form prior to publication by
Phillip Johnson, Georgetown University, Rockville, MD (301-496-
2976).
Author address:Phillip Johnson
Georgetown University
Rockville, MD
(301-496-2976)
FEATURES from to/span description
LTR 1 723 long terminal repeat
BASE COUNT 187 a 160 c 202 g 174 t
ORIGIN
1 tggatgggat ttattactcc gaaaggagga atagaatcct caacctatat gctcttaatg
61 aatggggaat cattgatgat tggaatgcat ggtcagcagg accaggcata agatatcccc
121 gctgctttgg cttttgcttc aagttagtac cggtagagat gcatgaagag gcagaaacct
181 gtgagagaca ttgcttggtg catcctgcac aagtaaaaga ggaccccgat ggcatcagtc
241 atggagagac cttggtctgg aagtttgacc cctatgttag cagtgcagta tgacccaaac
301 agacagtatt tagaagacat gcatgcactg gtgaagagga agtagctaac cgcaggcttg
361 tggttaagcc gttgccgggg agatgacatt tgaaactgct gacaagggac tttccaaggg
421 actttccagg gcgggccatg ggcggtacgg ggagtggttt taccctcaga gctgcataaa
481 agcagatgct cgctggcttg taactcagtc tcttactagg agaccagctt gagcctgggt
541 gttcgctggt tagcctaacc tggttggcca ccaggggtaa ggactccttg gcttggaaag
601 ctaataaaca ttgcctgcat tagagcttat ccgagtcaag tgccctcatt gacgcctcac
661 tcaagcaggg gaaccgttcc ttactgggtt ctctctctga cccaggcgag agaaactcca
721 gca
//
LOCUS SIVMNDGB1 9215 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) gag, pol, vif, vpR, tat,
rev, env and nef genes.
ACCESSION M27470 X15781
KEYWORDS .
SOURCE Simian immunodeficiency virus (isolate GB1) from African mandrill.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9215)
AUTHORS Tsujimoto,H., Hasegawa,A., Maki,N., Fukasawa,M., Miura,T.,
Speidel,S., Cooper,R.W., Moriyama,E.N., Gojobori,T. and Hayami,M.
TITLE Sequence of a novel simian immunodeficiency virus from a
wild-caught African mandrill
JOURNAL Nature 341, 539-541 (1989)
STANDARD full staff_entry
COMMENT The mandrill virus is distinct from all other primate immuno-
deficiency viruses, thus it can be regarded as a type 4 virus.
There is neither a vpX nor a vpU coding sequence. The splice sites
and coding regions for tat and rev are tentative.
FEATURES from to/span description
pept 450 1958 gag polyprotein
pept < 1745 4774 pol (NH2-terminus uncertain; AA at 1745)
pept 4728 5246 vif
pept 5227 5541 vpR protein
pept 5471 5730 tat protein, exon 2 (first expressed exon)
7950 8037 tat protein, exon 3 (AA at 7951)
pept 5590 5677 rev protein, exon 2 (first expressed exon)
7950 8167 rev protein, exon 3 (AA at 7952)
pept 5661 8126 env polyprotein
pept 8170 8814 nef
binding 275 291 primer (Lys-tRNA) binding site
signal 9191 9196 poly-A signal
BASE COUNT 3323 a 1478 c 2196 g 2218 t
ORIGIN
1 ggagtctcta ctacagaggc taagggttgt atctctgagc agatcccctt agagcaagga
61 ccagagtcct gagtgactgg gtctgagcac ctcactcggg gctgatcacc tcgaggtagt
121 ggaactcctt gcttgcttgc tattgtcttc aataaagtaa cttagaatta gagcaagtga
181 gtaagtgtta tccattgtgc gcctctcttc taaacctgtt gtgttctcat ttagagaaca
241 gaaggacttc tagttaaccc tagaagcctt tcagtggcgc ccgaacagga cttgaagaga
301 ggcactgaca cttgaggcag agcactccgc ctggaagaag caggttgaag gagagtggac
361 tggtctgaag acgccaggag gtgagtcagt gggactgact ttacaagaat tagttgtacc
421 ctagtgtaag gggcagcata gtcagagcaa tgggtaatgg gaactctgcc ttgttaggga
481 ctgatttgga taaatttgag aaaataagat taaagagagg tggtaaaaaa tgttatagat
541 tgaaacacct ctgttggtgt aaaggtgaat tagatagatt tggcttatcg gataaactcc
601 ttgaaacaca gcaaggatgt gaaaaaatcc tctcagtatg ttggccatta tatgaccaag
661 gatcagataa tctaaaagct ttggtaggga cagtctgtgt tgtagcctgc atacacgcag
721 gtatagaaat taagagcaca caagatgctt taaaaaaatt aaaagtcata acaagaaagg
781 aagaaaagca ggaggatgaa agtaagaatt tccctgtaca aagggatgca gcaggacagt
841 atcagtatac tccaataagt cctaggatta tacagacatg ggtaaaaaca gtggaagaaa
901 agaagtggaa accggaggtc atccctctat tctcagcatt gacagaagga gcaatcagtc
961 atgatttgaa tatcatgctg aatgcagtag gagatcatca gggagcaatg caagtcttaa
1021 aagatgtaat taatgagcaa gcagcagaat gggatctaac acatcctcaa caacaaccag
1081 cacaaccagg aggaggatta aggacccctt caggctctga tatagcagga actacttcta
1141 cagtggaaga acaattggca tggatgaata tgcaacaaaa tgcaatcaat gtaggaacaa
1201 tctataagag ttggattata ctgggcatga atagattggt aaaaagtcat tgtccaataa
1261 gtataacaga tgtaagacag ggaccaaagg aagcttttaa agactatgta gatagattct
1321 acaatgtaat gagagcagaa caagcttcag gagaagtaaa gatgtggatg cagcagcatc
1381 tgcttataga aaatgcaaac ccagaatgca agcagatttt gagaagctta gggaaaggag
1441 caactttaga ggaaatgttg gaagcatgtc agggagtagg tgggccacaa cataaagcca
1501 gattaatggc agaaatgatg agaacagtgg taggacaatc acaaaatttt gtgcagcaga
1561 gagggcctca aagaggacca gttagacaac ctactggaag gaaacctatc tgcttcaact
1621 gtaataaaga agggcatgta gcaaggttct tcaaggcccc tagaaggaaa gggtgctgga
1681 attgtggagc aatggatcat cagaaagctc aatgccctaa gccagctcag cagcagaggg
1741 ttaatttttt agggtatggc ccttggggtc cctccaaacc ggggaattat ccggcacaag
1801 aggtgactcc aacagctcca ccattagagg agaaacctct gcagaaaact ctgagcactt
1861 atcagaaatt agggagaggg ctcaggcaga agatgaagga ggagaagaga gaggaggatt
1921 ttcattccct gagtactctc tttcaagaag accaatagaa gaggtctcag tggatggtgt
1981 cactataaga gctctactag atacaggagc tgatgatacc atctttaatg aaagaaatat
2041 aaaattaaaa ggaaattggc agccaaaaat tataggggga ataggtggaa acttaagagt
2101 aaaacagtat gataatgtat atgtagaaat aagagggaag ggaacatttg ggacagtatt
2161 gataggacct actccaatag atataatagg gagaaacata atggaaaaat taggaggaaa
2221 attaatattg gcacaattgt ctgataaaat accaataaca aaagtgaaat taaaaccagg
2281 agtagatgga cccagaataa aacaatggcc tttaagtaaa gagaaaatag ttggtcttca
2341 gaaaatatgt gatagattag aggaggaagg aaaaattagt agggtagatc caggaaataa
2401 ttacaataca cctatctttg ccataaagaa gaaggataaa aatgaatgga gaaaattaat
2461 agactttaga gaattaaaca agttaacaca ggattttcat gaattacagt taggtatacc
2521 tcacccagca ggaataaaaa agtgtaaaag aataacagtc ctagatatag gggatgccta
2581 ttttagtata cctctggatc cagattatag accctatact gcctttacgg taccatcagt
2641 taataatcaa gcaccaggaa aaagatacat gtataatgtt cttcctcaag ggtggaaggg
2701 aagtccatgt atctttcaag ggacagtagc atcactgctg gaggtattta gaaagaacca
2761 tccaacagta cagttatatc aatacatgga tgatttgttt gtagggtcag actatacagc
2821 agaagagcat gagaaagcta tagtagaatt aagggcttta ttaatgacat ggaacttaga
2881 aacacctgaa aagaaatatc agaaagaacc tccctttcat tggatggggt atgagttaca
2941 cccagataag tggaagatag aaaaggttca actaccagaa ttagcagaac agccaacagt
3001 aaatgaaata cagaaattgg taggtaaatt aaattgggct gcacagttat atcctgggat
3061 caaaacaaaa caactgtgca agctaataag aggaggacta aacataacag agaaagtcac
3121 aatgacagaa gaagcaagac tggaatatga acaaaataaa gagatcttgg ctgaagaaca
3181 agaagggtct tattatgatc ctaataagga attatatgta agatttcaga aaacaacagg
3241 aggagatata tcatttcaat ggaagcaagg aaataaggtt ttaagagcag ggaaatatgg
3301 gaaacagaaa acagcacata gtaatgacct catgaaattg gcaggtgcta cgcagaaggt
3361 aggaagagaa agtatagtaa tctggggttt tgtaccaaaa atgcagatac ccactacaag
3421 ggagatatgg gaagattggt ggcatgagta ttggcagtgt acatggatac cagaagtaga
3481 atttatcagc acacctatgt tagaaaggga atggtatagc ttgtccccag aacctctaga
3541 gggggtagaa acatattatg ttgatggagc agctaacagg gacagtaaaa tgggaaaagc
3601 aggatatatt acagatagag gttttcaaag ggtagaagaa tatctaaata ccaccaatca
3661 gcagacagaa ttacatgcag taaaactagc tctagaagat agtggaagtt atgttaacat
3721 agtaacagat tcacaatatg tagtaggtat actagcaagc agacctactg aaacagatca
3781 ccccatagta aaggaaataa tagaattaat gaaaggaaaa gaaaaaattt atttaagttg
3841 gctaccagca cacaaaggga taggagggaa tgagcaaata gataagctag taagttcagg
3901 aatcagaaaa gtcttattcc tacaaaatat agaaccagca caggaagaac atgagaaata
3961 tcatagcaat gaagcacaat taagagagaa attccactta ccagctctag tagccaaaca
4021 gattgtgcaa agttgcagta agtgctgtca tcatggagag cccataaagg gacagacaga
4081 tgcttcactt ggagtctggc agatagattg cacacatctg gaaaatcaaa ttattatagt
4141 agcagtgcat gtagcttcag gcttcatgaa ggcagaagtt ataacagcag aaactggaaa
4201 aaagacagca gagtttctgt taaagttagc agcacaatgg cctattagta aactacacac
4261 agataatggg cctaacttta ctagtcagga agtagaaacc atgtgttggt ggttagggat
4321 agaacacaca tttggaatac cctataaccc acaaagtcag ggggtagtgg aaaataaaaa
4381 taagtatcta aaagaattga ttgagaaaat aagagaagat tgcaaagaat taaaaacagc
4441 agtagccatg gccacattca ttcataattt taaacaaagg ggaggactag gggggatgac
4501 agcaggagag agaatagtaa atatgatcaa tacagaatta gaatatcaat atcaacaaaa
4561 tcaaatttca aaaaatttaa attttaaggt ttacttcaga gaaggaagag atcagctgtg
4621 gaaaggacct ggtatccttt tgtggaaagg agaaggggca gtagttttaa aatatcaaga
4681 agagataaag atagtaccta gaagaaagtg taaaataata aaagattatg gagagagtgg
4741 aaagaatagt caggttaact tggaaagtgt ctagtcagag aatagaaaag tggcactggt
4801 tagtaagaag acagatggca tgggccactg caaataatga ggaaggatgt tggtggctgt
4861 atcctcattt tatggcttat aatgaatggt atacttgcag taaagtagtg attataataa
4921 atagggacat aagattaata gttagaagct attggcattt gcaaatagag gtaggatgct
4981 taagtactta tgcagtaagc atagaagcag tagttagacc gccacccttt gagaaagagt
5041 ggtgtacaga gataactcca gaggtagcag atcatctaat acatttacat ttttatgact
5101 gcttcatgga cagtgcagtt atgaaagcca tcaggggaga agaagtgtta aaagtttgta
5161 gatttccagc tggccataaa gcacaaggtg ttctctcttt gcagtttctc tgcttgagag
5221 tcatctatgg gccagaagag agatgagcaa gtatcagaag atcaaggacc tcccagagag
5281 ccatacaatc agtggctagc agatactatg gaggaaataa aggaagaagc aagaaagcac
5341 ttccctctca ttatcctaaa tgcagtatca gaatattgtg tgcaaaacac agggagtgag
5401 gaagaggcct gtgagaaatt tattacctta atgaatagag ccatttgggt ccacctagct
5461 caagggtgtg atggaacctt cagggaaaga agaccacaac tgcccccctc aggattcagg
5521 ccaagaggag atagattata agcaactgct agaagagtat tatcagcctt tgcaagcttg
5581 tgagaataaa tgctggtgca agaaatgctg ctttcattgt atgctttgct ttcaaaagaa
5641 gggtttagga ataaggtacc atgtctacag gaaacgtgta ccaggaacta ataagaagat
5701 acctggtagt ggtgaagaag ctatacgaag gtaagtatga agtgtccagg tctttttctt
5761 atactatgtt tagcctacta gtaggtatta taggaaaaca atatgtgaca gtcttctatg
5821 gagtaccagt atggaaggaa gctaaaacac atttgatttg tgctacagat aattcaagtc
5881 tctgggtaac cactaattgc ataccttcat tgccagatta tgatgaggta gaaattcctg
5941 atataaagga aaattttaca ggacttataa gggaaaatca gatagtttat caagcatggc
6001 atgctatggg aagtatgtta gataccatac ttaagccatg tgtaaagatt aacccatatt
6061 gtgttaagat gcaatgtcag gaaacagaaa atgtatcagc aacaacagct aagcctataa
6121 ctacacctac tactacatct acagttgcaa gtagtacaga gatttactta gatgtagata
6181 aaaataatac agaagaaaag gtagagagga atcatgtatg taggtataac ataacaggac
6241 tatgcaggga ttcgaaggaa gaaatagtaa caaattttag aggggatgat gtgaaatgtg
6301 aaaataatac ttgctatatg aatcattgta atgagtcagt taatacagaa gactgtcaga
6361 agggactttt gataagatgt attttaggtt gtgtgcctcc aggatatgtc atgttaagat
6421 ataatgagaa gttaaataat aataaattgt gtagcaatat atcagcagtg cagtgtactc
6481 agcacttagt agccacagta agtagctttt ttggctttaa tggaactatg cataaggaag
6541 gagaattgat acccatagat gataaatata ggggcccaga ggaatttcat caaaggaagt
6601 ttgtctataa ggtgccagga aaatatggct taaagataga atgtcacaga aaaggaaata
6661 ggtcagtagt gagtactcca tcagctacag gattattatt ttatcatggg ttagaacctg
6721 gaaagaattt aaagaaaggc atgtgcacct tcaaaggacg ttgggggtta gcactttgga
6781 gtctagctaa agaactaaat aaattaaatg actccatcaa agtgaaccag acctgtaaaa
6841 attttactag cactggagag gagaacaaac aaaacacgga caagcaaaag gagtttgcca
6901 aatgcataaa gactcttaag atagataatt atactacatc aggagataga gcagcagaaa
6961 tgatgatgat gacatgtcaa ggtgaaatgt tcttctgtaa tgtaacaaga atcatgaggg
7021 catggaatga tcctaatgag aagaagtggt atccttatgc ctcatgtcaa attaggcaaa
7081 tagtagatga ctggatgcaa gtaggaagaa agatatattt accacctaca tcaggattta
7141 ataatcacat aaggtgtaca catagggtaa cagaaatgta ctttgaaatg caaaagatag
7201 atagtaatga aacaaaaatg caaattaaat tcttgcctcc cagtgaaacc tccaatcaat
7261 ttgttgctta tggagctcat tataaattag tcaaaataat gccaattggc atagcaccta
7321 cagatgtgaa aagacacact ttacctgaac atcataaaga gaagagagga gcagtaatac
7381 ttggtatcct tggtctgctc tcgctggcag gatccgcgat gggctcagtg tcggtggcac
7441 tgactgtcca atctcagtct ttggtgactg ggatagtgga acaacaaaaa cagttgttga
7501 agctcataga gcaacagtct gaactcttaa aactcaccat atggggagta aagaatttac
7561 agactcgcct gaccagtttg gagaattata tcaaggacca agctttgctg tctcaatggg
7621 ggtgttcatg ggcacaggtg tgtcatactt ctgtagagtg gactaataca agcatcactc
7681 caaattggac atcagaaact tggaaggaat gggagacaag aactgattat ctgcaacaaa
7741 acattacaga aatgttaaaa caggcatatg atcgagagca aagaaacaca tatgaattac
7801 agaagttagg agaccttaca tcttgggcaa gttggtttga ctttacttgg tgggttcaat
7861 acttaaaatg gggagttttc ttagtgttag gaattatagg attaagaatt ttgttagcct
7921 tatggaatac aataagtagg tttaggcagg gctatcgacc tgtcttttca caggactgcc
7981 agcagaacct ataccgcaaa cggccagaca acggagaaga agaaagcaac agcttagaac
8041 taggagagca caactccgag aacttgaagg aagaatcctt aaacagatcc ttgatagagg
8101 acctgaccag ctttgccagg gagtgaccaa tttggctttg gctgaaaaat ctgagagcag
8161 caattgaata tgggttcctc gcagtccaag aagcgatcag aagcttgggt tcgctactcg
8221 tcagctttgc ggcaattagt tggagggccg gttacaccgg atggctacaa gcaaatagaa
8281 tcttcacagg gtgcagagaa gcaatcattg ctgcggggac gtgcatatgg cacatactca
8341 gaaggattag acaaagtgca gaacgacccc ttaactaaag atgagaaact tgacttaaca
8401 cagcaggatc cagaagagga ggaagaagtt ggatttcctg tgtgtcgcca agtttcctta
8461 agagtgccat catacaaaga tctgatagac ttctctcatt ttataaaaga aaagggggga
8521 ctgggaggga tatattatag caggagaaga gaagaaatcc tagatctcta tgcagagaat
8581 gagtggggat ttgaacctgg atggcaacag tatacgacag gtccaggaac cagatatcct
8641 aagacatttg gattcctgtt taagctggaa ccagtgagca gagctatagg agatgagtat
8701 gcagctaaca atcatctgtt acactcctcc cagttatgtc ctcaggaaga tccagaagga
8761 gagaccctca tgtggtctgg gaccctcatc ttgcctatga ctttgcagca ttaacatatc
8821 accctgagtg tttcaataag gctaagagta ttgaacatct gccattttgg aagaggaagt
8881 agcctaaccg caaaaccaca tcctactgca gaactgtagt tgcttggcaa cctgcttagc
8941 aacctggact ggcgcttgcg cgctaggaag ggactttcca aacagggagg gggaggctcg
9001 ccccatgctg ctatataagc agctgcattt cgcttgttcg ggagtctcta ctacagaggc
9061 taagggttgt atctctgagc agatcccctt agagcaagga ccagagtcct gagtgactgg
9121 gtctgagcac ctcactcggg gctgatcacc tcgaggtagt ggaactcctt gcttgcttgc
9181 tattgtcttc aataaagtaa cttagaatta gagca
//
LOCUS SIVMNE 9628 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus (SIV) complete proviral genome.
ACCESSION M32741
KEYWORDS complete genome.
SOURCE Simian immunodeficiency virus from captive Macaque nemestrina
proviral DNA, clone 8.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9628)
AUTHORS Benveniste,R.E., Heidecker,G., Greenwood,J. and Gonda,M.A.
TITLE ;
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Kindly submitted in computer-readable form by R. Benveniste. The
gag protein sequence was reported in J. Virol. 62, 2587-2595, 1988.
This molecular clone, after transfection into T-cell lines,
produces infectious viral particles. In particular, clone 8 has
been inoculated intravenously into two pig-tailed macaques causing
CD4 lymphocyte depletion; see J. Virol. 62, 2091-2101, 1988. The
env cds is truncated as is the case with MM251 and MM142.
Author address:R.Benveniste
National Cancer Institute
Frederick, MD
(301-698-5836)
FEATURES from to/span description
pept 533 2053 gag polyprotein
pept < 1708 4878 pol polyprotein (NH2-terminus uncertain)
pept 4808 5452 vif protein
pept 5280 5618 vpX protein
pept 5619 5924 vpR protein
pept 5770 6065 tat protein, exon 2 (first expressed exon)
8280 8376 tat protein, exon 3 (AA at 8281)
pept 5996 6065 rev protein, exon 2 (first expressed exon)
8280 8533 rev protein, exon 3 (AA at 8282)
pept.ps 6072 8779 env protein (premature stop codon)
pept 6072 8276 env protein
8280 8717 env protein
pept 8551 9342 nef protein
site 8277 8279 env protein in-frame stop codon
BASE COUNT 3294 a 1807 c 2379 g 2148 t
ORIGIN 5' end of 5' LTR R region (putative mRNA start).
1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagagt
121 ggctccacgc ttgcttgctt aaagacctct tcaataaagc tgccttttag aagtaagcca
181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcaact cggtactcga taataagaag
241 accctggtct gttaggaccc tttctgcttt gggaaaccga agcaggaaaa tccctagcag
301 attggcgccc gaacagggac ttgaaggaga gtgagagact cctgagtacg gctgagtgaa
361 ggcagtaagg gcggcaggaa ccaaccacgg cggagtgctc ctagaaaggc gcgggtcggt
421 accagacggc gtgaggagcg ggagagaaga ggcctccggt tgcaggtaag tgcaacacaa
481 aaaagagata gctgtctttt atccaggaag ggataataag atagagtggg agatgggcgc
541 gagaaactcc gtcttgtcag ggaagaaagc agatgaatta gaaaaaatta ggctacgacc
601 cggcgggaag aaaaagtaca tgttgaagca tgtagtatgg gcagcaaatg aattagatag
661 atttggatta gcagaaagcc tgttggagaa caaagaagga tgtcaaaaaa tactttcggt
721 cttagctcca ttagtgccaa caggctcaga aaatttaaag agcctttata atactgtctg
781 cgtcatctgg tgcattcacg cagaagagaa agtgaaacac actgaggaag caaaacagat
841 agtgcagaga cacctagtgg tggaaacagg aacagcagaa actatgccaa aaacaagtag
901 accaacagca ccatctagtg gcagaggagg aaattaccca gtacaacaag taggtggtaa
961 ctatacccac ctaccattaa gcccgagaac attaaatgcc tgggtaaaat tgatagagga
1021 gaagaaattt ggagcagaag tagtgccagg atttcaggca ctgtcagaag gctgcacccc
1081 ctatgacatt aatcagatgt taaattgtgt gggagaacat caagcagcta tgcagattat
1141 cagagaaatt ataaacgagg aggctgcaga ttgggacttg cagcacccac aacaagctcc
1201 acaacaagga cagcttaggg agccgtcagg atcagacatt gcaggaacaa ctagtacagt
1261 agatgaacaa atccagtgga tgtacagaca acagaacccc ataccagtag gcaacattta
1321 caggagatgg atccaactgg ggttgcaaaa atgtgtcaga atgtataacc caacaagcat
1381 tctagatgta aaacaagggc caaaagagcc atttcagagc tatgtagaca ggttctacaa
1441 aagcttaaga gcagaacaaa cagatccagc agtaaagaat tggatgactc aaacactgct
1501 gattcaaaat gctaacccag attgcaagct agtgctgaag gggctgggta tgaatcccac
1561 cctagaagaa atgctgacgg cttgtcaagg agtaggagga ccaggacaaa aggcaagatt
1621 aatggcagaa gccctgaaag aggcccttgc accagggcca ctcccttttg cagcagccca
1681 acagaaggga ccaagaaagc caattaagtg ttggaattgt gggaaagagg gacactctgc
1741 aaggcaatgc agaaccccaa gaagacaggg ctgctggaaa tgtggacaaa tgggccatgt
1801 tatggccaaa tgcccagaca gacaggcagg ttttttaggc tttggcccat ggggaaagaa
1861 gccccgcaat ttccccatgg cccaaatgca tcaggggctg acgccaactg ctcccccaga
1921 ggacccagct gtggatctgc taaaaaacta catgcagttg ggcaaacagc agagagaaag
1981 caaaaggaag ccttacaagg aggtgacaga ggatttgctg cacctcaatt ctctctttgg
2041 agaagaccag tagtcactgc tcatattgag ggacagcctg cagaagtatt attagataca
2101 ggggctgatg attctattgt agcaggaata gagttaggtc cacattatac cccaaaaata
2161 gtaggaggaa taggaggttt tattaatact aaagaataca aaaatgtaaa aatagaagtt
2221 ttaggcaaaa ggattaaagg gacaatcatg acaggggaca ccccgattaa catttttggt
2281 agaaatttgc taacagctct ggagatgtct ctaaatttcc ccatagctaa ggtagagcct
2341 gtaaaagtca ccttaaagcc aggaaaagat ggaccaaaat tgaggcagtg gccattatca
2401 aaagaaaaga tagttgcatt aagagaaatc tgtgaaaaga tggaaaagga tggtcagttg
2461 gaggaagctc ccccgaccaa tccatacaac acccccacat ttgccataaa gaaaaaggac
2521 aagaacaaat ggagaatact gatagatttt agggaactaa ataaggtcac tcaggacttt
2581 acagaagtcc aattgggaat accacaccct gcaggactag caaaaaggaa gaggatcaca
2641 gtactggatg taggtgacgc atatttctcc atacctctag atgaagaatt taggcagtac
2701 actgctttta ctttaccatc agtaaataat gcagaaccag gaaaacgata catttataag
2761 gttctgcctc aggggtggaa ggggtcacca gccatcttcc aacacactat gagaaatgtg
2821 ctggaaccct tcaggaaggc aaatccagat gtgaccttag tccagtatat ggatgacatc
2881 ttagtagcta gtgacaggac agacctggaa catgacaggg tagttttaca gttaaaggaa
2941 ctcttaaata gcatagggtt ttctacccca gaagagaagt tccaaaaaga tcccccattt
3001 caatggatgg ggtatgaatt gtggccaaca aaatggaagt tgcaaaagat agagttgcca
3061 caaaaagaga cctggacagt gaatgatata cagaagttag taggagtatt aaattgggca
3121 gctcaaattt atccaggtat aaaaaccaaa catctctgta ggttaattag aggaaaaatg
3181 actctaacag aggaagttca gtggactgag atggcagagg cagaatatga ggaaaataaa
3241 ataattctca gtcaggaaca agaaggatgt tattaccaag aaggcaagcc attagaggcc
3301 acggtaataa agaatcagga caatcagtgg tcttataaga ttcaccaaga agacaaaata
3361 ctaaaagtag gaaaatttgc aaagataaaa aatacacata ccaatggagt tagactatta
3421 gcacatgtaa tacagaaaat aggaaaggaa gcaatagtga tctggggaca ggtcccaaaa
3481 ttccacttac cagttgagaa agatgtatgg gaacagtggt ggacagacta ttggcaggta
3541 acctggatac cgaaatggga ttttatctca acaccaccac tagtaagatt agtcttcaat
3601 ctggtaaagg accctataaa gggagaagaa acctattatg tagatggatc atgtaataaa
3661 cagtcaaaag aagggaaagc aggatatatc acagataggg gcaaagacaa agtaaaagtc
3721 ttagaacaga ctactaatca acaagcagaa ttggaagcat ttctcatggc attggcagac
3781 tcagggccaa aggcaaatat tatagtagat tcacaatatg ttatgggaat aataacagga
3841 tgccctacag aatcagagag caggctagtt aaccaaataa tagaagaaat gattaaaaag
3901 acagaaattt atgtagcatg ggtgccagca cacaaaggta taggaggaaa ccaagaaata
3961 gaccacctag ttagtcaagg gattagacaa gttctcttct tggaaaagat agagccagca
4021 caagaagaac atgataaata ccatagtaat gtaaaagaat tggtattcaa atttggatta
4081 cccagactag tggccaaaca gatagtagac acatgtgata aatgtcatca gaaaggagaa
4141 gctatacatg ggcaggtaaa ttcagatcta gggacttggc aaatggattg tacccatcta
4201 gagggaaaaa taatcatagt tgcagtacat gtagctagtg gattcataga agcagaagta
4261 attccacaag agacaggaag acagacagca ctatttctgt taaaattggc aagcagatgg
4321 cctattacgc atctacacac agataatggt gccaactttg cttcgcaaga agtaaagatg
4381 gttgcatggt gggcagggat agagcacacc tttggggtac catacaatcc acagagtcag
4441 ggagtagtgg aagcaatgaa tcaccatcta aaaaatcaaa tagatagaat cagggaacaa
4501 gcaaattcaa tggaaaccat agtattaatg gcagttcatt gcatgaattt taaaagaagg
4561 ggaggaatag gggatatgac tccagcagaa agattactta acatgatcac tacagaacaa
4621 gaaatacaat tccaacaatc aaaaaactca aaatttaaaa attttcgggt ctattacaga
4681 gaaggcagag atcagctgtg gaaaggacct ggtgagctat tgtggaaagg ggaaggagca
4741 gtcgtcttaa aggtagggac agacattaag gtagtaccca gaagaaaggc taagattatc
4801 aaagattatg gaggaggaaa agaggtggat agcagttccc acatggagga taccggagag
4861 gctagagagg tggcatagcc tcataaaata tctgaaatat aaaactaaag atctacaaaa
4921 ggtttgctat gtgccccatc ataaggtcgg atgggcatgg tggacctgca gcagagtaat
4981 cttcccacta caagaaaaaa gccaattaga agtacaaggg tattggaatt tgacaccaga
5041 aagagggtgg ctcagtactc atgcagtgag aataacctgg tactcaagga acttttggac
5101 agatgtaaca ccagactgtg cagacatttt actgcatagc acttatttcc cttgctttac
5161 agcgggagaa gtgagaaggg ccatcagggg agaacaactg ctgtcttgct gcaggttccc
5221 gagagctcat aagacccagg taccaagtct acagtactta gcactgagag tagtaagtta
5281 tgtcagatcc cagagagaga atcccacctg gaaacagtgg agaagagaca ataggagaag
5341 ccttcgaatg gctaaacaga acagtagagg agataaacag agaggcagta aaccacctac
5401 caagggagtt gattttccag gtttggcaaa ggtcttggga atactggcat gatgaacaag
5461 ggatgtcgca aagctatgta aagtacagat acttgtgttt aatacaaaag gctttattta
5521 tgcattgcaa gaaaggctgt agatgtctag gggaaggaca tggggcaggg ggatggagac
5581 caggacctcc tcctcctccc cctccaggac tagcataaat ggaagaaaga cctccagaag
5641 atgaaggccc acaaagggaa ccatgggatg aatgggtagt ggaggttctg gaggaactga
5701 aagaagaagc tttaaaacat tttgatcctc gcttgctaac tgcgcttggt aatcatatct
5761 ataatagaca tggagacacc cttgagggag caggagaact cattaaaatc ctccaacggg
5821 cgctcttcat gcacttcaga ggcggctgca cccactctag aatcggccaa tctggaggag
5881 gaaatcctct ctcaactata ccgccctcta gaagaatgct ataacacatg ctattgcaaa
5941 aagtgttgct accattgcca gttttgtttt cttaaaaagg gcttggggat atgttatgag
6001 cagtcacgca gaagaagaag aactccgaag aaggctaagg ctaatacatc ttctgcatca
6061 aacaagtaag tatgggatgt cttgggaatc agctgcttat cgccatcttg tttctaagtg
6121 cctatgggat ctattgcatt caatatgtca cagtctttta tggtgtacca gcttggagga
6181 atgcgacaat tcccctcttc tgtgtaacca ggaataggga tacttgggga acaactcagt
6241 gcctaccaga taatgatgat tattcagaat tggcccttaa tattacagaa agctttgatg
6301 cttgggagaa tacagtcaca gaacaggcaa tagaggatgt atggcatctc tttgagacct
6361 caataaagcc ttgtgtaaaa ttaaccccat tatgcattac tatgaaatgc aacaaaagtg
6421 agacagataa atggggattg acaaaatcat caacaacaac agcaccaaca gcaataccaa
6481 caaaagcaga ggcaataaaa gtggtcaatg agaatagtcc ttgtataaat catgataatt
6541 gcacaggctt ggaacaagag ccaatgataa gctgtaaatt caacatgaca gggttaaaaa
6601 gagacaagag aagagagtac aatgaaactt ggtactctgc agatttggtt tgtgaacaag
6661 gtaatagcac tgaaaatgaa agtagatgtt acatgaatca ctgtaacact tctgttattc
6721 aagaatcttg tgacaaacat tattgggatg ctattagatt taggtattgt gcacctccag
6781 gttatgcttt gcttagatgt aatgacacaa attattcagg ctttatgcct aactgttcta
6841 aggtggtggt ctcttcatgc acaagaatga tggagacaca gacttctact tggtttggct
6901 ttaatggaac tagagcagaa aatagaactt atatttactg gcatagcaaa gataatagga
6961 ctataattag tttgaataag tattataatc taacaatgaa atgtagaaga ccaggaaata
7021 agacagtttt accagtcacc atcatgtctg gattggtttt ccactcacaa ccaatcaatg
7081 ataggccaaa acaggcatgg tgtaggtttg aaggaaattg gaaggaggca ataaaagagg
7141 taaagcagac cattgtcaaa catcccaggt atactggaac taacaatact gataaaatca
7201 atttgacggc tcctggagga ggagatccgg aagttacctt catgtggaca aattgcagag
7261 gagagtttct ctactgtaaa atgaattggt ttctaaattg ggtagaagat aagaatctga
7321 ctggaactac ccagaagcca caggaacggc ataaaaggaa ttacgtgcca tgtcatatta
7381 gacaaataat caacacttgg cataaagtag gcagaaatgt ttatttgcct ccaagagagg
7441 gagacctcac gtgtaattcc acagtgacca gtctcatagc aaacatagat tggattgatg
7501 gaaaccaaac taatatcacc atgagtgcag aggtggcaga actgtatcga ttggaattgg
7561 gagattataa attagtagag atcactccaa ttggcttggc ccccacaaat gtgaagaggt
7621 acactactgg tggcacctca agaaataaaa gaggggtctt tgtgctaggg ttcttaggtt
7681 ttctcgcaac ggcaggttct gcaatgggcg cggcgtcgtt gacgctgacc gctcagtccc
7741 ggactttatt ggctgggata gtgcagcaac agcaacagct gttggacgtg gtcaagagac
7801 aacaagaatt gttgcgactg accgtctggg gaacaaagaa cctccagact agagtcactg
7861 ccatcgagaa gtacttaaag gaccaggcgc agctaaatgc ttggggatgt gcatttagac
7921 aagtctgcca tactactgta ccatggccaa atgcaaatct aacaccaaat tggaacaatg
7981 agacttggca agagtgggag cgaaaggttg acttcttgga ggaaaatata acggcccttt
8041 tagaagaggc acaaattcaa caagaaaaga acatgtatga attacaaaag ttgaatagct
8101 gggatgtgtt tggcaattgg tttgaccttg cttcttggat aaggtatata caatacggag
8161 tttatatagt tgtaggagta atactgttaa gaatagtgat ctatatagta caaatgctag
8221 ctaagttaag gcaagggtat aggccagtgt tctcttcccc accttcttat ttccagtaga
8281 cccatatccg acaggaccag gcactgccaa ccaaagaagg aacagaagga gacggtggag
8341 gcagcggtgg caacagctcc tggccttggc agatagaata tattcatttc ctgatccgcc
8401 aactaatacg cctcttgact tggttattca gcaactgcag aaccttgcta tcgagagcat
8461 accagatcct ccaaccaata ttccagagat tctccacgac cctacagaga atccgagaag
8521 tcctcaggac tgaactaacc tacctacaat atgggtggag ctacttccaa gaggcggtcc
8581 aagtcgcctg gagatctgcg acagagactc ttgcgggcgc gtggggagac ttatgggaga
8641 ctctgggaag agttggaaga tggatactcg caatccctag gaggatcaga caagggctcg
8701 agcttactct cttgtgaggg acagaaatac aatcagggac agtttatgaa tactccatgg
8761 aaaaacccag ctggagagag ggaaaaatta gcatacagaa aacaaaatat agatgatata
8821 gatgaagaag ataatgactt ggtaggggta ccagtgaggc cacgagttcc cttaagaata
8881 ataagttaca aattggcagt agatatgtct cattttataa aagaaaaggg gggactggaa
8941 gggatttatt acagtgaaag aagacataaa atcttagaca tgtacttaga aaaggaagaa
9001 ggcatcatgc cagattggca gaattacacc tcgggaccag gacctagata cccaaagaca
9061 tttggctggc tatggaaatt agtccctgta aatgtatcag atgaggcaca ggagggtgag
9121 gagaattatt tactgcatcc agctcaaact tcccagtggg atgacccttg gggagaggtt
9181 ctagtatgga agtttgatcc aactctagcc tacacttatg aggcatatat tagataccca
9241 gaagagtttg gaagcaagtc aggcctgtca gaggaagagg ttagaagaag gctaaccgca
9301 agaggcctct taaaaatggc tgacaagagg gaaactagct gagacagcag ggactttcca
9361 taaggggatg tcatggggag gtactgggga ggagccggtc gggaacaccc actttcttga
9421 tgtataaata tcactgcatt tcgctctgta ttcagtcgct ctgcggagag gctggcagat
9481 tgagccctgg gaggttctct ccagcactag caggtagagc ctgggtgttc cctgctagac
9541 tctcaccagc acttggccgg tgctgggcag agtggctcca cgcttgcttg cttaaagacc
9601 tcttcaataa agctgccttt tagaagta
//
LOCUS SIVAGM677 2438 bp ss-RNA VRL 15-AUG-1990
DEFINITION Simian immunodeficiency virus LTR and gag gene, complete cds.
ACCESSION M29973
KEYWORDS .
SOURCE Simian immunodeficiency virus (isolate 677,(gri-1)) from African
green monkey.
ORGANISM Simian immunodeficiency virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 2438)
AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
Olmstead,R.A. and Hirsch,V.M.
TITLE Simian immunodeficiency viruses from African green monkeys display
unusual genetic diversity
JOURNAL J. Virol. 64, 1086-92 (1990)
STANDARD full staff_entry
COMMENT Kindly submitted prior to publication by P. Johnson, Georgetown
University, Rockville MD (301-496-2976). The remainder of this
complete genomic sequence will become available later in 1990. The
gri-1 isolate is from a monkey imported from Ethiopia.
Author address:P.Johnson
Georgetown University
Rockville, MD
(301-496-2976)
FEATURES from to/span description
pept 897 2438 gag polyprotein
LTR 1 688 5' LTR
rpt 461 588 R repeat 5' copy
binding 689 706 primer (Lys-tRNA) binding site
BASE COUNT 752 a 503 c 689 g 494 t
ORIGIN
1 tggatgggat atattactct gaaagaagag aaaagatcct gaatttgtat gccttgaacg
61 agtggggaat aatagatgat tggcaagctt actcaccagg cccggggata aggtacccga
121 gagtctttgg cttctgcttt aagctagtcc cagtggacct gcatgaggag gcacgcaact
181 gtgagagaca ctgtctgatg catccagcac agatggggga agatcctgat ggaatagatc
241 atggagaagt cttggtctgg aagtttgacc cgaagttggc ggtggagtac cgcccggaca
301 tgtttaagga catgcacgaa catgcaaagc gctagtgtca gcactttgcg gttgggactt
361 tccgccaggg actttccaca gtgggtggat cggaggcggt acaggggcgg tactgggagt
421 ggctttcccc tcagagctgc ataaaagcag atgctcgctg gcttgtaact cagtctctta
481 ctaggagacc agctagagcc tgggtgttcg ctggttagcc taacccggtt ggccaccggg
541 ggtaaggact ccttggcttc atatagctca ataaacctgc tcgcttagtc gctatattgg
601 agtcaagtgc tcattgctgc gccgagcctc tagaggtgaa cctctcttac tgggttctcc
661 tgtacccagg tgggagaaac tccagcagtg gcgcccgaac agggacttga gaagaggcat
721 cggcaccgac cgctgagttg ctgagcgtcg gagagggacg actcaggtag ggtgagagcc
781 tacgagtttt ttgctaccta gtcagcgaga aaggctaggc cgcgacaggg gcgcgggtcc
841 cattagtggc aaccaaccca gttggacgaa gggttggtag gggacgggtc ggagcaatgg
901 gcgggggtca ctcagcactg tcagggagaa gcctcgacac gttcgagaag attaggctac
961 gtccgaacgg gaaaaagaag taccaaatta aacatttaat atgggcagga aaagaaatgg
1021 aacgatttgg gttacatgag aaacttttag aaacaaaaga aggctgtcaa aaaatcatag
1081 aagttttaac cccgttggaa ccgacaggct ccgaggggct aaaagctctg tttaatttgt
1141 gctgcgtcat ttggtgcatt cacgcagaac agaaagtgaa agacacagag gaagctgtag
1201 taacagttaa gcaacactac catctagtgg acaaaaatga gaaagcagct aaaaagaaaa
1261 atgagacaac agcgccacct ggtggcgaat caagaaatta cccagtagta aatcagaata
1321 atgcctgggt acaccagcct ttgtctccgc gcacgttaaa tgcgtgggtc aaatgcgtgg
1381 aggaaaaaag gtggggagca gaagtagtcc ccatgttcca agcactctca gagggatgtc
1441 tctcctatga tgtaaatcag atgctcaatg taataggaga ccatcagggg gcattacaaa
1501 ttcttaagga agtcattaat gaagaagcag cagagtggga caggacacac agaccaccag
1561 ctggcccgtt accagcaggg cagctaagag acccgacagg gtcagatata gcaggaacta
1621 ccagctcaat tcaggaacaa atagagtgga ccttcaatgc caatccaaga atagacgtag
1681 gggcacaata cagaaaatgg gttattttgg gcttacaaaa ggtagtgcag atgtacaatc
1741 cccaaaaggt cctagacatt cgacagggac ctaaagaacc cttccaggac tatgtagaca
1801 gattctataa agccctgaga gcagaacaag caccacagga tgttaaaaat tggatgacac
1861 aaactttgct tatccagaat gccaatccgg attgtaaatt gattctgaaa ggattgggaa
1921 tgaatccaac cttggaggaa atgctaatag cttgccaggg agtaggaggg ccacaacata
1981 aggctaagct aatggtagaa atgatgagta atggacagaa tatggtccaa gtgggacctc
2041 agaaaaaggg cccccgaggg ccgctaaaat gctttaattg tggcaaattt ggacatatgc
2101 aaagggaatg caaggcacca agacagatca aatgctttaa gtgcggcaaa attggccata
2161 tggcaaaaga ctgcaagaat ggacaggcaa attttttagg gtatggccat tggggaggag
2221 cgaaaccaag aaattttgtg caatacagag gagacacagt tggtctggaa ccaacagccc
2281 ccccaatgga aacagcttac gatccagcaa agaagctcct ccagcagtat gcagagaagg
2341 gacagcgcct gagagaggag agagaacaga caaggaaaca gaaggagaaa gaagtggagg
2401 atgtttcctt gagctccctc tttggaggag accaatga
//
LOCUS BOVMHDQBQ1 624 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exon 2.
ACCESSION M30008
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 1 of 2
SOURCE Bovine (Holstein individual 2042) DNA, clone Q1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 624)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept / 172 + 438 MHC DQ-beta cell surface glycoprotein, exon 2
(AA at 174)
pre-msg < 1 > 624 MHC DQ-beta mRNA and introns
IVS < 1 171 MHC DQ-beta intron A
IVS 439 > 624 MHC DQ-beta intron B
BASE COUNT 103 a 192 c 243 g 86 t
ORIGIN Chromosome 23.
1 cccgggttca cagcgggagg cgcagggccg ggctggagcg caacaggggt tgagaggcgg
61 cgggtttcag gtttagggac cctctggcgg cggcggcacc tccccatctg gccgagcggc
121 gccgcgtggg gctgtggggc tgagcctgac cgagcggctg tctccccgca gaggatttcg
181 tggtccagtt taagggcctg tgttacttca ccaacgggac ggagcgagtg cggctcgtgg
241 tcagacacat ctacaaccgg gaggagtacg cgcggtttga cagcgacgtg aacgagtacc
301 gggcggtgac ctctggggcg ccgcacgccg agtactggaa cagccagaag gacctcctgg
361 agcagaggcg ggccgaggtg gacagggtgt gcagacacaa ctaccaggtg gctgccccct
421 tcacctggca gcggctaggt gagtacgggc tgccctccgc gggcccgccc tccacccgag
481 actcagcgcg ggagggggcc gggtctccag ggcggggttc ccaggcccgc atagggacag
541 ggaggccggg gcttcgcgga ggggcaggga ccgacgctcc gcggaaatgg acactcgcag
601 ccctggacct ctccccgcag aggc
//
LOCUS BOVMHDQBQ2 1151 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4.
ACCESSION M30007
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 2 of 2
SOURCE Bovine (Holstein individual 2042) DNA, clone Q1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1151)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 113 394 MHC DQ-beta cell surface glycoprotein, exon 3
870 / 980 MHC DQ-beta cell surface glycoprotein, exon 4
pre-msg < 1 > 980 MHC DQ-beta mRNA and introns
IVS < 1 112 MHC DQ-beta intron B
IVS 395 869 MHC DQ-beta intron C
BASE COUNT 243 a 310 c 324 g 274 t
ORIGIN Chromosome 23, about 3.7 kb after segment 1.
1 tggaatccgg ggatcttcct actctggaac cgaggaagga ctcttctcca tgggagacgt
61 gctgtgcggt ctcatgtctc actgtgtctt ttcctgtctg ttcctccctc agtggaacct
121 acagtgacca tctccccgtc caggactgag gctctaaacc accacaacct gctggtctgc
181 tcggtgacag atttctatcc gggccagatc aaggttcggt ggttccggaa tgaccgggag
241 gagacagctg gtgttgtgtc cacccctctt attaggaacg gggactggac cttccagatc
301 ctcgtgatgc tggaaatgac cccccagcga ggagatgtct acacctgccg cgtggagcac
361 cccagcctcc agagtcccat ctcagtggag tggcgtaagg gcacttggtc tcctttcact
421 gtgggcccta caggataggg cagacagagc ttcccgggtt catcccatct cacctctagt
481 ccccagcatc cctactgaaa tcagaggaca caagagtgct catacctcat agcaggggca
541 ttggaagagc ctagttacat tgtctttcca gatacgggag ctcactcaca caccatggcc
601 ccagagcccc acccagggag ctctgcagga gtgacaggtc caaggttatg catgtgtcct
661 tgaggggcag ggattggctt tctctgctta ttcaccttcc cagtctgtcc aaggatcttt
721 tgctgggtcc ctcacctggg ggtggttaga atgaagaact gagttcccct ggtacttcca
781 cttcctgtac ctcagactgg acttcaggat tctcaaggga cactgtggga tgtggagaca
841 aatgctgaca ctcaggctct gctccccagg ggcgcagtct gaatctgccc agagcaagat
901 gctgagtggt gttgggggct tcgtgctggg gctgatcttc ctcgggctgg gcctcattat
961 ccgtcacagg agccagaagg gtaaggagct ctggggacat ggggaagact ttgactggga
1021 ccttcttctc agggaggctc tagatgtagc tcttttccct gaccctgaca taaaggaggt
1081 taaggtggtg gcaggaagaa acaagcaacc tagggagaga ctgaagtctt actttactga
1141 ttgaaaggta g
//
LOCUS BOVMHDQBY1 779 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exon 1.
ACCESSION M30006
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 1 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone Y1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 779)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept 467 + 575 MHC DQ-beta cell surface glycoprotein, exon 1
pre-msg 467 > 779 MHC DQ-beta mRNA and introns
IVS 576 > 779 MHC DQ-beta intron A
signal 351 357 CAAT box
signal 384 391 TATA box
site 292 304 X box
site 324 333 Y box
BASE COUNT 204 a 179 c 164 g 232 t
ORIGIN Chromosome 23.
1 ggatcctgaa gggctacagt ccatggggtc gtaaagagta gaacacaact cattaattaa
61 cactttcact tttattttcc catacctcaa attctaagaa caacaggttt taaataaata
121 tcacagaaat atctactctt gaatcatttt ttttcattat ttaaactcct aaggcattca
181 atattcagat attttataac tgagagaaca ttttcatctc tatccagtgt aatttgatta
241 ggacacagtg ccaggcatta gattaagaac cttcaaaaaa aaaatgtcta cccagaaaca
301 gatgaagttt ttccgctcca ctgctgattg gtcccttttc tagggactct ccaatcttgc
361 catacatgga agctctcata ggctttttat tctgtgaagt aggctcacca gatccactgt
421 gtttgagctg tgttgactac cattagttct tcctttgttc tcaattatgt ttgggatggt
481 ggctctgcgg atccccagag ccctctggac agcagttgtg atggtgaccc tggtgatgct
541 gagcacccca ggggctgagg gcagagactc accaagtaag tgcagggcag ctgctccctg
601 gagccaccac actggggagc aggctctgag ggacccttgg gctggggtgt gatcttggga
661 tactgtcttt tatcacacat ttcctcccat tgggaatgag ggctatgtta cattctcatt
721 tccaccctct aaggacaagg tgaggacaat tcccctccca caggtttaac cctgggaat
//
LOCUS BOVMHDQBY2 977 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exon 2.
ACCESSION M30005
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 2 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone Y1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 977)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 559 + 825 MHC DQ-beta cell surface glycoprotein, exon 2
pre-msg < 1 > 977 MHC DQ-beta mRNA and introns
IVS < 1 558 MHC DQ-beta intron A
IVS 826 > 977 MHC DQ-beta intron B
BASE COUNT 191 a 264 c 338 g 182 t 2 others
ORIGIN Chromosome 23, about 0.9 kb after segment 1.
1 actggcgcaa ctgttggaag gcgatcggtg cgggcctctt cgctattagc cagctggacg
61 aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga
121 cgttgtaaaa cgacgccagt gccaagctta attctacagg tcctttctca tcccttgaac
181 tctcctgttg tcgtttgtct ctgaggttcc caggagttca gggtaaaatg ggatttaatg
241 tgagaatctt ttaagtatag agatggatgc aaaatcaacc tgccgccctg tttacttgat
301 tctgagcctc tagggatcac aggtcctagg gctctctcag cgtcaggcct cctcacatcc
361 tgggagccct cagagggggc ggnaagcccg ggttcacagc gggaggcgca gggccgggct
421 ggagcggaac agggtttgag aggcggctgg tttcaggttt aaagaccccg tggcggcggc
481 ggcacctccc catctggccg agcggcgccg cgtggggctg tggggctgag cctgacagag
541 cggctgtctc ccccgcagag gatttcgtgg tccagtttat gggccagtgt tatttcacca
601 acgggacgga gcgggtgcgg tacgtgacca gatacatcta caaccaggag gagtacgcgc
661 gcttcgacag cgactggggc gagtaccggg cgctgacccg ctggcggccg gccgccgagt
721 actggaacag ccagaaggac atcctggagc agacgtgggc cgaggtggac agggtgtgca
781 gaaacaacta ccaggtggaa gcccccttca cctggcagcg gcaaggtgag tgccggnctc
841 tccgcggggc cgccctccac ccgccaggac ttcgcgcagg gagggactga gtcctccgag
901 gcggtcccca gaccctcgaa tgggacagag gggcgctgag ggacagggga ccgagggcac
961 agcgtatggg gcggggg
//
LOCUS BOVMHDQBY3 1199 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4.
ACCESSION M30004
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 3 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone Y1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1199)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 129 + 410 MHC DQ-beta cell surface glycoprotein, exon 3
pept + 907 + 1017 MHC DQ-beta cell surface glycoprotein, exon 4
pre-msg 467 > 1199 MHC DQ-beta mRNA and introns
IVS < 1 128 MHC DQ-beta intron B
IVS 411 906 MHC DQ-beta intron C
IVS 1018 > 1199 MHC DQ-beta intron D
BASE COUNT 263 a 338 c 321 g 277 t
ORIGIN Chromosome 23, about 3.7 kb after segment 2.
1 atctaaatcc aagccttgga atccaacgat ctttccactc tggtatcaag gaatgactcc
61 tgcccatggg agacatgctg tgcggtctca tgtctcactg tgtcttttcc tgtctgttcc
121 tccctcagtg gaacctacag tgaccatctc cccgtccagg acagaggctc taaaccacca
181 caacctgctg gtctgctcgg tgacggattt ctatccgggc cagatcaagg ttcggtggtt
241 ccggaatgac cgggaggaga cagccggcgt tgtgtccacc cctcttatag ggaatgggga
301 ctggaccttc cagatcctcg tgatgctgga aatgaccccc cagcgaggag atgtctacac
361 ctgccgcgtg gagcacccca gcctccagag ccccatcatg gtggagtggc gtaagggcac
421 ttggtttcct ttcactgtgg gcctaccgga cagggcagac agagcttccc ctgtccatgc
481 cctctcatcc cttgtcccca gcatcactac tgaactggaa atcacaggac acaagagtgc
541 tcatgcctcc tagcacaggc atcagaagag ccaaatcaca ttgtcttttc acatacaggg
601 agctcactgt acacatcatg gccccagagc ccagcctggt agctctgtag aactgactgg
661 tgaccatagt cttaaggtct aaggttatgg aagtgtccct gagagcaggg atccactttc
721 accttctctc acctgcccac tgtgtccaaa gatctgttgg tgggtccctc ccctggggtg
781 gtcagaatgg agagccacgt tcccctgaca cctccacctc ctgtacctca gactagacct
841 caagcttcct aaaggaatac catgagatgt ggggacaaac gctgacactc gggctctgct
901 ccccaggggc acagtctgaa tctgcccaga gcaagatgct gagtggtgtt gggggcttcg
961 tgctggggct gatcttcctc gggctgggcc tcattatccg tcacaggagc cagaagggta
1021 aggaactctg gggaaatggg aagatgggct gtgattcaga ccctctgttc agatcagcct
1081 ctgcctctga atgtagctct ttcctcctga tcctgaaacg gggaggcggg gctggggatg
1141 ggaggaaatg aacaacctag ggagacattg gagtttgact ttactagttt gaaagggta
//
LOCUS BOVMHDQBY4 883 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DQ-beta gene, exon 5.
ACCESSION M30003
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 4 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone Y1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 883)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 521 534 MHC DQ-beta cell surface glycoprotein, exon 5
pre-msg < 1 841 MHC DQ-beta mRNA and introns
IVS < 1 520 MHC DQ-beta intron D
site 263 276 MHC DQ-beta g/t cluster implicated to
contribute additional information to
polyadenylation
BASE COUNT 200 a 201 c 226 g 256 t
ORIGIN Chromosome 23, about 0.3 kb after segment 3.
1 tttgtgtcat gagatctttt gtagacattg tgacccctag cagaaggtgc tctatttctg
61 ttctgtgtca gtgggattgt gggacaggta aaggagggaa gggtgtgaga tgagtgtgcc
121 tgggcgcagt gtctcattca tgacctgttc cctgctatgg aatcaagagt tagggaagaa
181 gtttctgtag gaggttctgt aggaagctcc tgaggttgtt ccccagaacc aggccataac
241 tttgatggca cctttctgtg aaacttggag ccagagctct ggtttgaaag atagacacca
301 ggatatcacc tactttgtgc cacatgttgg tgcctactgc ctgtgggcat ttataagtga
361 ttgaatgtgg tagaaagaag gtgaactatc actgcaattt actaaaaaat tgaaatcttc
421 atatccctca gaaggacaac agctgcttcc tggcttccca tgcctccttg ttaggttgaa
481 tgtgcgtgcc tgtgtgctga tcactctctc tcttctacag ggctcatgcg ctgactcctg
541 aggatatttt gggattggtg tttgctcttc tataatgtgt gcctgatctt gcccggaatt
601 cccagattcc tgtcagcctg tcccactctg agatcagagt caggtcacca ggtcatttcc
661 cgtggccatc ccccaaccac ggatctggct gtgatgctgc ttcctccact gaccctggaa
721 tctctgcctg tgcgttgtca gctgaatcta ctcagatccc aaaagcttct gacatagaca
781 tcagaagggg gacggagagt gtccccgcta gtctttagcc cagtgtttag aagctattaa
841 tcagataaga gagacacctc aaggttgatg gagtttcacc agg
//
LOCUS BOVMHDRB1 459 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta gene, exon 2.
ACCESSION M30012
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 1 of 3
SOURCE Bovine (Holstein individual 2042) DNA, clone A1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 459)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept / 21 + 290 MHC DR-beta cell surface glycoprotein, exon 2
(AA at 23)
pre-msg 21 290 MHC DR-beta mRNA and introns
IVS < 1 20 MHC DR-beta intron A
IVS 291 > 459 MHC DR-beta intron B
BASE COUNT 108 a 92 c 169 g 90 t
ORIGIN Chromosome 23.
1 gatctatcct ctctctgcag cacatttcct ggagtattct aagagcgagt gtcatttctt
61 caacgggacc gagcgggtgc ggttcctgga cagatactac actaatggag aagagaccgt
121 gcgcttcgac agcgactggg gcgagttccg ggcggtgacc gagctggggc cgcaggaccg
181 cgagtactgg aacagccaga aggacttcct ggaggagaag cgggccgagg tggacagggt
241 gtgcagacac aactacgggg gtatggagag tttcactgtg cagcggcgag gtgagcgcgg
301 gggtggactg gccagtgtgg agcagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
361 gtgtgagaga gagagagaga gacagagaca gagacagaga cagagataga cagacagaaa
421 cagagatact tcactcactc tggtcgagtg tgtaccgac
//
LOCUS BOVMHDRB2 427 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta gene, exon 3.
ACCESSION M30013
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 2 of 3
SOURCE Bovine (Holstein individual 2042) DNA, clone A1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 427)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 71 + 352 MHC DR-beta cell surface glycoprotein, exon 3
pre-msg < 1 > 427 MHC DR-beta mRNA and introns
IVS < 1 70 MHC DR-beta intron B
IVS 353 > 427 MHC DR-beta intron C
BASE COUNT 95 a 124 c 107 g 101 t
ORIGIN Chromosome 23, about 2.7 kb after segment 1.
1 ctgaaaggca gctaaccaag gagacttact ctgttgtcct cactgattcc ctccaccttt
61 tctctcctag tggagcctac agtgactgtg tatcctgcaa agactcagcc cctgcagcac
121 cacaacctcc tggtctgctc tgtgaacggt ttctacccag gccacattga agtcaggtgg
181 ttccggaacg cccatgaaga ggaggctggg gtgatctcca caggcctgat ccagaatgga
241 gactggacct tccagaccat ggtgatgctt gaaacagttc ctcagagtgg agaggtctac
301 acctgccaag tggatcaccc cagccggacg agccctatca cagtagaatg gagtgagctt
361 tctgatctca taaatccctc acccactgtg gagggggctt gctttcctct gagtgtcccc
421 tgagtgt
//
LOCUS BOVMHDRB3 276 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta gene, exon 4.
ACCESSION M30014
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein.
SEGMENT 3 of 3
SOURCE Bovine (Holstein individual 2042) DNA, clone A1.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 276)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept + 116 / 226 MHC DR-beta cell surface glycoprotein, exon 4
pre-msg < 1 > 226 MHC DR-beta mRNA and introns
IVS < 1 115 MHC DR-beta intron C
BASE COUNT 61 a 63 c 68 g 84 t
ORIGIN Chromosome 23, about 0.35 kb after segment 2.
1 attctgattc ttccgggtag ccttctttcc tcattcccat agttcacaat ttcagcatca
61 caattagaga agagaatttg ggataaaaat gactaaaact ggcttctttt ctcaggggca
121 cggtctgact ctgctcagag caagatgatg agtggagtcg ggggcttcgt tctgggtctg
181 ctcttccttg ccgtggggct cttcatctac ttcaggaatc agaaaggtaa ggagcttgtt
241 ctttggacag ctgagcctcc ccactgactt ttggag
//
LOCUS BOVMHDRBE1 483 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta pseudogene, exon 1.
ACCESSION M30011
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein;
pseudogene.
SEGMENT 1 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone E4.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 483)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept.ps 167 + 236 pseudo-MHC DR-beta, exon 1
pre-msg < 167 > 483 pseudo-MHC DR-beta mRNA and introns
IVS 237 > 483 pseudo-MHC DR-beta intron A
signal 9 14 CAAT box
BASE COUNT 114 a 100 c 124 g 145 t
ORIGIN Chromosome 23.
1 gagctcaccc aatccaggaa caaagatatg agccatttgt tggtatcact tggaatgtgg
61 gtggaggagg gctcatgtct ttactgagtg agacttccct gctcccccac accttgtctt
121 ttcctgttct ccagcatggt gtgactgttt ccccagaggc tcctggatgg cagctctgac
181 agtgatactg atggtgatga accctcccct ggcttgggcc agggacaccc acataagtgc
241 gtacctttcc ggcgggggtg aggggggtga gctatcatgg gatgggggga aggaagggag
301 ctagctttgt cactgtattc aggccatgtc ccttaaaatt gtgacatatt cttcatacta
361 tatatagtgg ctaagctgag tctgaataat tggtaacatt ttctgatgtt catatgtaac
421 atcagtgtac cttatggtat atttcaatat ataggggaat ttattcattc acattatatt
481 gaa
//
LOCUS BOVMHDRBE2 929 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta pseudogene, exon 2.
ACCESSION M30010
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein;
pseudogene.
SEGMENT 2 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone E4.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 929)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept.ps + 228 + 500 pseudo-MHC DR-beta, exon 2
pre-msg < 1 > 929 pseudo-MHC DR-beta mRNA and introns
IVS < 1 227 pseudo-MHC DR-beta intron A
IVS 501 > 929 pseudo-MHC DR-beta intron B
BASE COUNT 203 a 209 c 302 g 215 t
ORIGIN Chromosome 23, about 5.4 kb after segment 1.
1 gtcgaccact gaagccactt ggagacctga ggggtctcct ctgcccacct tcgcctccct
61 gcactgtagg cagatgaaag aagggcccgt ggtagttcag gggtgcctgt ggagccaatg
121 agggagccct agtggccttc ctgtgcttgg gcagccctca ttggtggccg tcacatcagt
181 tccttcctgg gagcccacca ggtgaccgaa tcctggtgtg cccacagcac atttgatggt
241 gcagggcaag tccgagtgtc atttctccat ccggactgag caggtacgat tcttggccag
301 atacttctat aaccagaagg agttggtgca ttttgtcagc aacgatgtgg gtgagttcag
361 ggcagtgacc gagcggggca ggctcttcgc tgagagttgg aatcatcaga aggacttagt
421 ggagtgaacg caggctgtgg tggacacgtt ctgcagatac aactactgga ttggggagag
481 cttcatcctg cagcagcaag gtgagcacag gggtgggcgg ccaggggact ggggacagtg
541 tgtgtgtgtg tgtgtgtgtg tgagagagag agagagagac aaagagatag agagactgag
601 tcccggtgaa tgtgttgtat tatgagcaag tatgcttaag gagagttcct gtgagagcat
661 gttgcctgga gaaatgacac ttggacttgc cctgcaccat gaaatttgct gtgggaacag
721 caggattcgg tcaccctggt gggctcccag gaaggaactg atgtgacggc caccaatgac
781 gggctgccca agcacaggag ggccactagt gctccctcat tggctttaca ggcacccctc
841 aactaccatg ggttcttctt tcatctgcct gtatgacttt gtcagttatt gtgaaggaag
901 agacagtgtg tgtggtgggg ggagtacct
//
LOCUS BOVMHDRBE3 548 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta pseudogene, exon 3.
ACCESSION M30002
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein;
pseudogene.
SEGMENT 3 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone E4.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 548)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept.ps + 12 + 293 pseudo-MHC DR-beta, exon 3
pre-msg < 1 > 548 pseudo-MHC DR-beta mRNA and introns
IVS < 1 11 pseudo-MHC DR-beta intron B
IVS 294 > 548 pseudo-MHC DR-beta intron C
site 425 427 in-frame stop codon
BASE COUNT 122 a 139 c 134 g 153 t
ORIGIN Chromosome 23, about 5.8 kb after segment 2.
1 tttcctccta gtggaggatc ctacagtgac tgtgtatcct gcaaagaccc agcctctgca
61 gcaccacaac ctcctggtct gctctgtgaa tggtttctat ccaggacacg ttgaagtcag
121 gtggttccag aacggccatg aagaggctgg agtgatctcc acaggcctga tccagaatgg
181 agactggacc ttccagaccg tggtgatgct tgaaacagtt cctcagagtg gagaggtcta
241 cgcctgccaa gtggagcacc ccagccggac gagccctctc acagtggaat ggagtgagaa
301 gctttctgat ctcgtaagtt cctcacccac caagaagggg gcttgctcac ctctgagtgt
361 caggtttctc ctctctccat accatatttt ttatttgctt catgctcttt ctttcttagc
421 acaaattgtt ggggagtagc tctgtgatag cctgtgttag aaatcctctg atagtttaca
481 gatatcgttt gatagtttct atcaatacct atacctgctg gtgagacagt tcttcctggc
541 aggcagag
//
LOCUS BOVMHDRBE4 206 bp ds-DNA MAM 15-AUG-1990
DEFINITION Bovine MHC class II DR-beta pseudogene, exon 4.
ACCESSION M30009
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility protein;
pseudogene.
SEGMENT 4 of 4
SOURCE Bovine (Holstein individual 2042) DNA, clone E4.
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 206)
AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes
JOURNAL Immunogenetics 31, 37-44 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
M.A.M.Groenen, 20-NOV-1989.
FEATURES from to/span description
pept.ps + 86 / 196 pseudo-MHC DR-beta, exon 4
pre-msg < 1 > 196 pseudo-MHC DR-beta mRNA and introns
IVS < 1 85 pseudo-MHC DR-beta intron C
BASE COUNT 47 a 42 c 54 g 63 t
ORIGIN Chromosome 23, about 0.35 kb after segment 3.
1 cttccaggca accttcttct cccatcctca aaagcttagg gaagttggat tgggataaga
61 tcactgaaac ttacttcttt tctaggggca tgatctgact ctgctcagag caggatgatg
121 agtggagtca ggggctttgt tgtgggtctg ctcttccttg ggatcaggtt gttcatctac
181 tttaggaatc agaaaggtaa ggatcc
//
LOCUS VECPCE30 143 bp ds-DNA SYN 15-AUG-1990
DEFINITION Expression vector pCE30, partial sequence.
ACCESSION M36426
KEYWORDS expression vector.
SOURCE Synthetic DNA.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 143)
AUTHORS Elvin,C.M., Thompson,P.R., Argall,M.E., Hendry,P., Stamford,N.P.J.,
Lilley,P.E. and Dixon,N.E.
TITLE Modified bacteriophage lambda promoter vectors for overproduction
of proteins in Escherichia coli
JOURNAL Gene 87, 123-126 (1990)
STANDARD simple staff_entry
BASE COUNT 37 a 35 c 40 g 31 t
ORIGIN
1 agggcagcat tcaaagcaga aggctttggg gtgtgtgata cgaaacgaag cattgggatc
61 cccgggaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc
121 caacttaatc gccttgcagc aca
//
LOCUS CHKPPPTH 1723 bp ss-mRNA VRT 15-AUG-1990
DEFINITION Chicken parathyroid hormone mRNA, complete cds.
ACCESSION M36522
KEYWORDS parathyroid hormone.
SOURCE Chicken parathyroid gland, cDNA to mRNA, clones cPTH-[11,12,3].
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 1723)
AUTHORS Khosla,S., Demay,M., Pines,M., Hurwitz,S., Potts,J.T.Jr. and
Kronenberg,H.M.
TITLE Nucleotide sequence of cloned cDNAs encoding chicken
preproparathyroid hormone
JOURNAL J. Bone Miner. Res. 3, 689-698 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 128 487 parathyroid hormone precursor
sigp 128 202 parathyroid hormone signal peptide
matp 221 484 parathyroid hormone
BASE COUNT 626 a 311 c 331 g 455 t
ORIGIN
1 ttttaaagtt agatttaagg gatccactaa accaattcag tagtctttaa atatacttga
61 catcaagaca cagccatctg ctgacatacc ccaaccagaa aactgttaag gacaatatct
121 gataaaaatg acttctacaa aaaatctggc caaggccata gtgattttat atgctatatg
181 tttttttaca aactctgatg gaagaccaat gatgaagaga tcggtgagtg agatgcaatt
241 aatgcataac cttggagagc atcgacacac tgtggagaga caggactggc ttcagatgaa
301 gctgcaggat gtgcacagtg cccttgagga tgccaggacc cagaggcctc gaaacaagga
361 ggatattgtc ctgggggaga taagaaaccg gaggctgctc cctgagcatt tgcgggcagc
421 agtgcagaag aaatccattg acctggacaa agcttacatg aatgtactct ttaaaactaa
481 gccatgatga aaagaccaag agcattataa ctgtccaagt aagcacatgt ctgtagatca
541 ctgaccagtt agggcatttt atttattatt ttttttttaa ctcaaactat gataaggatt
601 aaaggctcca tgccagactg tagccccact gagatgggta tttcacaact aaatagtaaa
661 gtgtatttat aggccaccca tggccattgc tgctaactcc caggtatctt ttaaatggct
721 aatgtaactc attaacttcc aggagaatta aaaacaaatg gcaaaacaaa aaacaacaaa
781 gaccacctgc aatagaataa gaaagttgaa aaacatttaa gaccagttct accactccta
841 tatggagagc atttgtctgt aatctttaga cctactagta ctgtaaacta acaacgtaat
901 ataggcataa ctgcattatg cctagggtta aacttcaagt ttgtcctaat gaaaggaacg
961 caaacttaaa tccactctta ctttcccaag aaggcctaaa gccagaccaa tgtcagtaac
1021 atagacaaag ctgcatgata ataacttagg attaaagagt gcgaacatga aaaatagaag
1081 gaacccaaag cttaagatta aagtagaatg aaataaattg tgcatgaaaa agaagaacga
1141 agttttacaa gatactgaaa tgaaagggag gtttattaac tttccctctt aattatgagc
1201 tgtcaccttt tggaactgca ggaacagtga gagcagagat tgtagcatat atgtatgcaa
1261 agccctaact atagaactgg gaaatggttc aacacgagat aaaaacaaga cttgtttcaa
1321 ttgttatcat ctctccttca gtcaataatc tatgagtttc tgtatattgt gcttaggcca
1381 catgggtaag tggctcacat aaaattactc atcttcacat gtgcacttat acagaattgg
1441 gatttcagtt tgttaaaacc ctgaaattac aaccattaaa atatagaaat caaaacctgg
1501 gaaccatcag ttaaaatata agcaggattc agaaagaatt tgacaggaac atggatggga
1561 gaaaatgatg ataataatat agaaaagaaa gcagcaaata taaaatgatt ttgaattgta
1621 tagacaagta tgtgcttatg acctcgacca cttctgaata ataagaatat ttcccctgta
1681 gaagtgacag cagtttcctc ccaatgttcc actgtgagaa ttc
//
LOCUS CUC11SGB 1684 bp ss-mRNA PLN 15-AUG-1990
DEFINITION Pumpkin 11-S globulin beta-subunit mRNA, complete cds.
ACCESSION M36407
KEYWORDS 11-S globulin beta-subunit.
SOURCE Pumpkin (cv. Kurokawa Amakuri Nankin) cotyledon mRNA, clone
pPG-beta-2.
ORGANISM Cucurbita pepo
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Violales; Cucurbitaceae.
REFERENCE 1 (bases 1 to 1684)
AUTHORS Hayashi,M., Mori,H., Nishimura,M., Akazawa,T. and Hara-Nishimura,I.
TITLE Nucleotide sequence of cloned cDNA for pumpkin 11-S globulin beta-
subunit
JOURNAL Eur. J. Biochem. 172, 627-632 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 31 1473 11-S globulin beta-subunit precursor
sigp 31 93 11-S globulin beta-subunit signal peptide
matp 94 918 11-S globulin beta-subunit gamma-chain
matp 919 1470 11-S globulin beta-subunit delta-chain
BASE COUNT 457 a 406 c 463 g 358 t
ORIGIN
1 ctaatagccc ttctcttctc cataccagca atggctcgct cttctctttt taccttttta
61 tgtttagcag ttttcatcaa tggctgcctc tctcagattg agcagcagag cccctgggaa
121 ttccaaggca gcgaagtatg gcaacagcac cgctaccaat ctcctagagc ctgtcgtctt
181 gagaatcttc gagctcaaga ccccgttcgc cgggctgagg cggaggcgat cttcactgaa
241 gtctgggacc aggacaacga tgagttccag tgcgccggcg tcaatatgat ccgccataca
301 atccggccca aaggtctgct tcttcctggt ttctctaatg ctcctaaact catcttcgtc
361 gcccaaggct tcggtattcg cggcattgca atccccggct gtgcagagac ttaccagact
421 gatttacgaa gatcgcaatc ggccggatct gcgttcaaag accagcatca gaagatccgc
481 cccttcagag agggagatct cctcgtcgtc ccggccggag tttctcactg gatgtataat
541 cgaggacagt ccgatctcgt tttgatcgta ttcgctgaca ctcgcaacgt cgcaaaccaa
601 atcgatccct acctcagaaa attctacctt gccggaaggc cagagcaggt agaaagaggc
661 gtagaggaat gggaaagaag tagccgaaag ggatcttccg gcgagaaatc aggcaatata
721 ttcagcggat ttgcagacga atttctagag gaagctttcc agatcgacgg tggactggtt
781 aggaagctaa agggagaaga cgacgagaga gacagaatcg tgcaggtcga cgaagatttc
841 gaggtgcttc taccggagaa agatgaagaa gagagatcga gaggaagata catcgaatca
901 gaatcagaat cggagaatgg cttagaagaa accatttgca cactccgatt aaagcaaaac
961 atcggccgat ctgttcgcgc cgacgtgttc aacccacgcg gcggccgaat ctccacggcc
1021 aactaccata ccctccccat tctccgccaa gtccgcctta gcgccgaacg aggagtcctc
1081 tacagcaacg cgatggtggc gccgcactac acagtgaaca gtcactcagt gatgtacgcg
1141 acgagaggca acgcgagagt gcaggtggtg gacaacttcg ggcagtcagt gttcgacggc
1201 gaggtccggg aaggacaggt actgatgatt ccgcagaact tcgtggtgat taaacgagca
1261 agcgacagag gattcgagtg gatcgcattc aagacgaacg acaacgcaat cacgaatctg
1321 ctggcggggc gagtgtcgca gatgaggatg ttgccgctgg gagtgctgtc gaacatgtac
1381 cggatctcga gagaggaggc gcagaggctg aagtacgggc agcaggagat gagggtgctc
1441 agccccggaa ggtcgcaggg aagaagagag tgaaaatgaa gaagtgggta gtgggtaatg
1501 ggtaatggga aatatatata tatggtagta gtaatctaat gtaatttagt gaataaagag
1561 cgagctttca ggtgatgccg ccgacgagcc ctgcttgtta ccggccggaa aaaatggaga
1621 aatctctcag aaagacaccg agttttaata ataaaagtaa taatattcgc ctcttttttc
1681 cttc
//
LOCUS DROKINLA 2175 bp ds-DNA INV 15-AUG-1990
DEFINITION D.melanogaster kinesin-like protein (nod) gene, complete cds.
ACCESSION M36195
KEYWORDS kinesin-like protein; nod gene.
SOURCE D.melanogaster DNA.
ORGANISM Drosophila melanogaster
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE 1 (bases 1 to 2175)
AUTHORS Zhang,P., Brodeur,B.A., Goldstein,L.S.B. and Hawley,R.
TITLE A kinesin-like protein required for the distributive chromosome
segregation in Drosophila
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P.Zhang, 06-JUL-1990.
Author address: P.Zhang
Albert Einstein College of Medicine
Molecular Genetics Dept.
1300 Morris Park Avenue
Bronx, NY 10461
FEATURES from to/span description
pept 72 2072 kinesin-like protein (nod)
mRNA 1 2175 nod mRNA
BASE COUNT 557 a 594 c 561 g 463 t
ORIGIN
1 caaagtaaaa taattacggt gaatgcaagc caattgtgca ttattcaaac aacttcaatt
61 cttcaatctg catggagggc gccaaattaa gcgcagttcg gattgcggtc cgcgaggcgc
121 cgtaccgcca gttcttgggg cgtcgggagc ccagcgtcgt ccagtttccg ccatggagcg
181 acggaaagtc gttaatagtg gatcagaatg aattccactt cgatcacgcc tttcccgcga
241 ccatcagcca ggatgagatg taccaggcgc tgatcttgcc gctggtggac aagctgctcg
301 agggattcca gtgcactgca ctcgcctacg gccagacggg aacgggcaag agctactcaa
361 tgggcatgac acctccggga gagatactgc ccgagcacct gggtattctg cctcgcgccc
421 tgggcgacat ttttgagcgc gtgaccgccc ggcaggagaa caacaaggat gcgattcagg
481 tgtacgcctc cttcatagag atctacaatg agaaaccctt cgatctgctg ggctccacgc
541 cacatatgcc catggtggcg gcgcgttgcc agcgatgcac ctgccttcct ttgcacagcc
601 aggcggatct gcatcacatc ttggagctag gcactcgcaa tcgacgcgtt cgtcccacca
661 atatgaattc caatagttcg cgatcccatg ccatagtcac cattcacgtg aagagtaaaa
721 cccatcactc gcggatgaat attgtggatc tggccggttc agaaggcgtg cggcgaactg
781 ggcacgaggg cgtggccagg caggagggcg tcaacatcaa tctgggcctg ttgagcatca
841 acaaggtggt gatgtccatg gcggcgggcc acacagtgat accataccgc gacagcgtcc
901 ttaccacagt tctgcaggcc tcgctaaccg cgcagtcgta tctgaccttt ctggcctgca
961 tcagtccgca tcaatgcgat ctcagcgaga cgttgtccac cctgcgtttt ggcaccagtg
1021 ccaagaagct tcggctgaat ccgatgcaag tggcgcgcca gaagcaatcg ctggccgcac
1081 ggacaacaca cgtcttccgc caagcgctat gcacctcgac ggccatcaag tcaaacgcag
1141 ccaatcataa tagcatagtg gttccaaaat ccaaatatag cacaaccaag ccgctgagcg
1201 ccgtgctcca tcgaactcgc tccgaacttg gcatgacgcc caaagctaag aaaagggctc
1261 gcgagctatt ggagctggag gagaccacgc tggagctctc gtctatacac attcaggaca
1321 gcagtctgag tctgttgggt ttccatagcg atagcgataa ggataggcat ttaatgcctc
1381 ccccaacagg gcaagagcca aggcaagcca gcagccagaa ctctacgcta atgggcattg
1441 tcgaagagac cgagcccaag gaatcgtcaa aggtgcaaca gtcaatggtt gcccccacgg
1501 tgcccacaac tgtacgctgc cagctgttca acaccaccat cagtcccatc agtctacggg
1561 catccagctc tcagcgagaa cttagcggca tccagccaat ggaggagaca gtagtggctt
1621 cgccacagca gccatgcctt cgtcgttccg tgcgtctagc gagtagcatg cgttcgcaga
1681 actatggagc cattcccaag gttatgaatt tgcggcgcag cacgcggctg gcgggaatcc
1741 gggaacatgc cacctccgtt gttgtgaaaa acgagacgga tgcgataccg caccttcgaa
1801 gtacagtgca aaaaaaacgt acgcgaaacg tgaaacctgc gcccaaggcc tggatggcca
1861 ataatacaaa atgttttctg gacctgctta acaatggaaa cgttaagcaa ttgcaggaga
1921 ttccagggat cggtccaaag tccgccttta gtttggcctt gcacagatcc cgcctgggtt
1981 gcttcgagaa tctttttcaa gtcaaatccc tgcccatttg gtcgggaaat aaatgggaac
2041 gattttgtca aattaactgt ctcgacactt gatacaatta ctaattaaat agcattttaa
2101 ttcgaatata gtatagtgat tgttatttat gtggcatata ctttgatttt acaactatag
2161 taggagtaaa aaaag
//
LOCUS HAMCADCA 3902 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Hamster carbamoyl-phosphate synthetase mRNA, partial cds.
ACCESSION J05503
KEYWORDS carbamoyl-phosphate synthetase.
SOURCE Hamster cell line 165-28, cDNA to mRNA, clone pCAD142.
ORGANISM Mesocricetus auratus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae;
Cricetini.
REFERENCE 1 (bases 1 to 3902)
AUTHORS Simmer,J.P., Kelly,R.E., Rinker,A.G.Jr., Scully,J.L. and Evans,D.R.
TITLE Mammalian carbamyl phosphate synthetase (CPS): cDNA sequence and
evolution of the CPS domain of the Syrian hamster multifunctional
protein CAD
JOURNAL J. Biol. Chem. 265, 10395-10402 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 3902 carbamoyl-phosphate synthetase (E.C.6.3.5.5;
AA at 3)
BASE COUNT 829 a 1056 c 1135 g 882 t
ORIGIN
1 tcaggcccct ggcaccagag gtttctatta agaccccacg ggtattcaat gcagggggtg
61 cccctcggat ctgtgccttg gactgcggcc tcaagtataa tcagatcaga tgtctctgcc
121 agcttggggc tgaggttact gtggtgccct ggaaccacga attagacagt cagaagtatg
181 atggcctttt tctgagtaat ggacctggcg atcctgcctc ttatcctggt gtggtagcca
241 cactgaaccg cgtcttgtct gagcccaatc cccgacctgt gtttggaatc tgccttggac
301 accagctgtt ggctttagcc attggggcca aaacttacaa aatgaggtat ggaaaccgag
361 gccacaacca gccctgttta ctggtgggca ccgggcgctg ctttctgacg tctcagaatc
421 acgggtttgc cgtggatgca gactcgctgc cagcaggctg gactccgctc ttcaccaatg
481 ccaacgactg ttccaacgaa ggcattgtac atgacagcct gccctttttc agtgtccagt
541 ttcacccaga gcaccgagct ggcccttcag atatggaact gctttttgat gtatttctgg
601 agactgtgag agaggctgta gctgggaacc ccgggggcca gacagttaaa gagcggttgg
661 tgcagcgcct ctgtccccct gggcttctca ttcctggttc tgggcttcca ccaccacgga
721 aggttctgat cctaggctct gggggcctct ccattggcca ggctggagaa tttgactact
781 caggctctca ggccattaaa gccctgaagg aggagaacat ccagacgctg ctgatcaacc
841 ccaacattgc tacagtgcag acctcgcagg ggctggcaga caaggtctac ttccttccca
901 ttacacctca ctacgtaacc caggtgattc ggaatgaacg cccagatggt gtgttactga
961 cttttggggg ccaaacagcc cttaactgcg gtgtagaact gaccaaagcc ggagtgctag
1021 ctcggtatgg ggttcgggtc ttgggtacac ctgtggagac cattgaactg actgaggacc
1081 gacgagcctt cgcggccagg atggctgaga tcggagagca tgtagccccc agcgaagcgg
1141 caaattctct tgaacaggct caggcagctg ctgagcgact gggctaccct gtgctggtgc
1201 gtgcagcctt tgccctgggt ggtcttggtt ctggctttgc ttccaccaaa gaggaactct
1261 cagctcttgt ggctccagct ttcgcccata ccagccaggt gctgatagac aagtctctga
1321 agggctggaa ggagattgaa tatgaggtgg tgagagacgc ctatggcaac tgtgtgacgg
1381 tatgtaacat ggagaactta gacccactgg gcatccacac tggtgagtcc atagtggtgg
1441 cgcccagcca gacgctgaat gacagagagt accaacttct gcgacggaca gctatcaaag
1501 tcacccagca cctggggatc gtcggggagt gcaacgtgca gtatgccttg aacccggagt
1561 ctgagcagta ttacatcatt gaagtaaatg ccaggctgtc tcgaagctct gccctggcca
1621 gtaaggccac aggctatcct ctagcctatg tggcagccaa gctggcgttg ggcattcccc
1681 tgccggagct caggaactct gtcactgggg gaacagcagc ctttgagcct agcctggact
1741 actgtgtggt aaagattcct cgatgggacc tcagcaagtt cttgcgtgtc agtacgaaga
1801 ttgggagctg tatgaagagt gttggtgaag tcatgggcat tggacgctca tttgaagagg
1861 ccttccaaaa ggccctgcgc atggtggatg agaactgtgt gggcttcgac catacagtga
1921 agccagtcag tgatgtggag ttggagacac caacagataa gcggatcttt gtggtggctg
1981 ctgctctgtg ggctggctac tcggtggagc gcctgtatga gctcacacgc atcgactgct
2041 ggttcctgca tcgaatgaag cgtatcgtga cccacgccca gttgctggaa caacaccgag
2101 gacagccgtt gtctcaagac ctgctgcacc aggccaagtg cctcggcttc tcagacaaac
2161 aaattgccct tgcagtcctg agcacagagc tggcggttcg aaagctacgt caggaactgg
2221 gaatctgccc tgcagtgaaa cagattgaca cagttgcggc tgagtggcca gcacagacca
2281 attacctgta cctgacatac tggggcaaca cccatgacct cgactttcga actcctcacg
2341 tcctggtcct tggctctggt gtctaccgca tcggctccag tgttgagttt gactggtgtg
2401 ccgtcggctg catccagcag ctccggaaga tgggttataa gaccatcatg gtgaactaca
2461 acccagagac agtcagcaca gactatgaca tgtgcgaccg actctacttt gatgagatct
2521 cctttgaggt ggtgatggac atctatgagc tggagaaccc cgacggcgtg atcctgtcca
2581 tgggtggaca gctgcccaac aacatggcca tggctctgca tcggcagcag tgccgagtgc
2641 tgggcacctc cccggaagcg atcgattcag ctgagaaccg gttcaagttc tcccggcttc
2701 tagataccat cggcatcagc cagcctcagt ggcgtgaact cagtgacctc gagtctgctc
2761 gccagttctg ccagactgtg gggtacccct gtgtggtgcg cccctcctat gtgctcagcg
2821 gtgccgctat gaatgtggcc tacactgatg gggacctgga gcgcttcctg agcagtgcgg
2881 ccgctgtctc caaggagcac cccgtggtca tctccaaatt catccaggaa gcaaaggaga
2941 ttgatgtgga cgctgtggcc tgcgatggcg tcgtgtcagc cattgccatc tccgagcacg
3001 tggagaatgc aggtgtgcat tcaggggatg ctacgctggt caccccccca caagacatca
3061 cccccaaaac tctggagcgg atcaaagcca ttgtgcatgc cgtggggcag gaactacagg
3121 tcacagggcc cttcaatctg cagctcattg ccaaggatga ccagctgaaa gttattgagt
3181 gcaatgtgcg tgtctctcgc tccttcccct tcgtgtctaa gacgctgggt gttgacctag
3241 tggccttggc cacgaggatc atcatgggag agaaggtaga acccatcgga ctcatgacgg
3301 gctctggagt cgtgggagta aaggtgcctc agttctcctt ctcgcgcttg gcgggtgctg
3361 atgtggtgct gggcgtggag atgaccagta ctggagaagt agctggcttt ggagagagcc
3421 gttgtgaggc ctacctcaaa gccatgctta gcactggctt taagatcccc aagaagaaca
3481 tcctgctgac catcggcagc tacaagaaca aaagtgagct gctcccgact gtgcggttgc
3541 tggagagcct gggctatagc ctctacgcca gcctgggtac ggcggacttc tacactgagc
3601 acggggtcaa ggtgacagct gtggactggc actttgaaga ggctgtggat ggcgagtgcc
3661 cgccacagcg gagcatcttg gatcagctgg ctgagaatca ctttgagtta gtgattaacc
3721 tgtcaatgcg tggggccggg ggtcgacggc tttcctcctt cgtcaccaag ggctaccgca
3781 cgcggcgcct ggctgctgac ttctctgtgc ctctcatcat cgacatcaag tgcaccaaac
3841 tcttcgtgga ggccctgggt cagattggcc ccgccccgcc tttgaaggtt catgtagact
3901 gc
//
LOCUS LEIKPDNP 376 bp ds-DNA ORG 15-AUG-1990
DEFINITION L.aethiopica kinetoplast DNA.
ACCESSION M36194
KEYWORDS .
SOURCE Kinetoplast L.aethiopica (strain 1467/85) promastigote, clone R3,.
ORGANISM Kinetoplast Leishmania aethiopica
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora;
Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE 1 (bases 1 to 376)
AUTHORS Laskay,T., Kiessing,R., Rinke de Wit,T.F. and Wirth,D.F.
TITLE Generation of species-specific DNA probes for Leishmania aethiopica
JOURNAL Unpublished (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by T.F.Rinke de Wit, 06-JUL-1990.
Author address: T.F.Rinke de Wit
Leiden University Hospital
Rijnsbugerweg 10
2300 RC Leiden
THE NETHERLANDS
email:WBLGIPHAR@HLERUL52.BITNET
BASE COUNT 113 a 99 c 70 g 94 t
ORIGIN
1 ctctaatagc ccaggaccta tcgtcgccac tctccgaact atagaaagac ccgcgctgta
61 ggcacaatag gaccaactgt actacctgca gtggctagac cactactggc aaatcaatag
121 aactattacc tttaactata agtgatttaa ctttaaccta taatagaaca ttattcgtcg
181 ctcattcccg ggccccacgt agcctttccc atgaagttcg tataccgact ctacggttca
241 agtttatata ccggttcact ccgttgcacc atggtgacct tacgtcacta gatacaattg
301 atattaataa ttaaatacag ccaagatagg cggcatgtgc cacagagtag cggcaggaag
361 ccagccaatg agcata
//
LOCUS LMIB19KP 938 bp ss-mRNA INV 15-AUG-1990
DEFINITION L.migratoria basic 19kD hemolymph protein mRNA, complete cds.
ACCESSION M36206
KEYWORDS basic 19k protein.
SOURCE L.migratoria adult female fat body, cDNA to mRNA, clone
lambda-LmF2.
ORGANISM Locusta migratoria
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Orthoptera; Caelifera;
Acrididea; Acridoidea; Acrididae.
REFERENCE 1 (bases 1 to 938)
AUTHORS Kanost,M.R., Bradfield,J.Y., Cook,K.E., Locke,J., Wells,M.A. and
Wyatt,G.R.
TITLE Gene structure, cDNA sequence, and developmental regulation of a
low molecular weight hemolymph protein from Locusta migratoria
JOURNAL Arch. Insect Biochem. Physiol. 8, 203-217 (1988)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by M.R.Kanost, 06-JUL-1990.
FEATURES from to/span description
pept 54 572 basic 19k protein precursor
sigp 54 95 basic 19k protein signal peptide
matp 108 569 basic 19k protein
BASE COUNT 231 a 267 c 222 g 218 t
ORIGIN
1 agctctgctg tctcctgtcc actccacacc acaggctcag taccaggatc aggatgaagc
61 tggtggtggc tgcagttctc gcgatggccg cgtcgcggtg gcggcgcctg tcggcccacg
121 gccaggtgcc gtccagcacg tgcgccgaca tgctgcccgt gcacggcaac gcaatgccca
181 gcacagccct gccctacacc atcaccgtgt cgcccacctc cgtcaacggc ggcgacaccg
241 tcagagtgca catctcgggc acggaggagt tccgcggcgt ctacctgcag cgaggagggg
301 ccaagagcag taggagagtt cctgctgccc gccggagaga acaacaagat cgccctgtcc
361 gactgcccgc cggacacaac aacgccttct catacatttc gcgcacaccc ctggacacac
421 tggacatcga ctggaaggca ccatacacca gcgatgaaat cgttttcagg gctactttcg
481 tcaagagctt ctccgagttc tgggtcggcg ttgagtcacc gaagatcaca ttgggaccgc
541 tacgtcaact tgacaacgca gttgctgctt agtgactgaa gtcgccatat tcatatacga
601 gcacatccag tactgatgtc ctagtttatc acaacatcgc cgcaccacca ctttcacgtt
661 ctctactact aaaatggtag ataaatcgct tattacagct gttagctgca tataagagaa
721 gcgtttcaaa acgagaaact ctttttgatt ttgtactgag ggaattcaag taaagatttg
781 acaggcagac gtcaccatct tgttcaagac ttggcatcca gtttgcctgt ctgctgtgtg
841 tttgtagatg ctcacacttc ttgtgatatt tactaccaca aattttgtac tcaagacttg
901 aagaattgaa atatattctc taattaatat aaaaaaaa
//
LOCUS MUSALDAA 8190 bp ds-DNA ROD 15-AUG-1990
DEFINITION Mouse aldolase A gene, complete cds.
ACCESSION J05517
KEYWORDS aldolase A.
SOURCE Mouse (strain RIII S/J and Blue Spruce (outbred Swiss Webster))
adult DNA, clone lambda 16.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 8190)
AUTHORS Stauffer,J.K., Colbert,M.C. and Ciejek-Baez,E.
TITLE Nonconservative utilization of aldolase A alternative promoters
JOURNAL J. Biol. Chem. 265, 11773-11782 (1990)
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.K.Stauffer, 15-JUN-1990.
FEATURES from to/span description
pept 4301 4412 aldolase A, exon 4 (E.C. 4.1.2.13) (first
expressed exon)
4493 4704 aldolase A, exon 5
5125 5179 aldolase A, exon 6
5278 5438 aldolase A, exon 7
5727 5810 aldolase A, exon 8
5908 6082 aldolase A, exon 9
6222 6421 aldolase A, exon 10
6519 6614 aldolase A, exon 11
pre-msg 1700 > 6614 aldolase A mRNA and introns (alt.)
pre-msg 1740 > 6614 aldolase A mRNA and introns (alt.)
IVS 1804 1951 aldolase A intron A (put.)
IVS 2023 4278 aldolase A intron B (put.)
IVS 2205 4278 aldolase A intron C (alt.)
IVS 3256 4278 aldolase A intron C (alt.)
IVS 4413 4492 aldolase A intron D
IVS 4705 5124 aldolase A intron E
IVS 5180 5277 aldolase A intron F
IVS 5439 5726 aldolase A intron G
IVS 5811 5907 aldolase A intron H
IVS 6083 6221 aldolase A intron I (no splice consensus)
IVS 6422 6518 aldolase A intron J
signal 3032 3036 CAAT box
signal 1673 1676 TATA box
signal 2235 2240 TATA box
signal 3089 3094 TATA box
signal 3132 3137 TATA box
BASE COUNT 1676 a 1884 c 2061 g 1876 t 693 others
ORIGIN
1 gatccttgct ttttgaagcc ttagaatgaa gccagcattc ctggccttgg gagggcaggc
61 acgggagact ccaaggcctg gggaaagcaa ctctagtcca aaccagtttc tcttgctggt
121 tgtagtcttt tgggcaaacc actgagtttc tatctcatta ttttgtgatg agccccccac
181 gagtgtgacc cccattcaag gtggctcaga agcagagtgc ttgccttgtg tttgtgacat
241 cccaagttca attcatcact gaggaaaccc ctccctttaa gatttatctt atctctgaac
301 gttttcccga ttgtatgact cgtatgtatc tgaggaagtc agaagaaatg tcagatcccc
361 caggatcttg ggatctggag tcgtgatggc tgtgagtcac tgtatatatg tgctggagct
421 gaactcaggt cctctggaac agccattgct cttaaccact gagccatggt ccggacacct
481 ggcttagaca gggtcccttt ctgtcagtgg ttctcaacct gtgggttatg gccctttgtg
541 ggggtggagg tgggtattaa cttatacagg gctgacctaa ggttataaaa acccagatat
601 ttatgattca taacagcaaa attacaggtg taaagtagca acaaaaattc ttttttggtt
661 gggagtacca caacatgggg aactgtatta aaaggtagca ttaggaaggt tgggcaccac
721 tgctctcgta gccctggcta tcctagaact caaatagtag atcaggctgg tccaaactga
781 cagagatcta tctctgccag cgtcagcact aggaagtgag taaattccat gatagccagg
841 ccatacagtg aaaccctgtc tcaaaacagg acaagaggaa ccccagtact tagtaggttg
901 aagtaaggat tgtcattttt tttgaggcca gcttgggttt catggctctt gactagtctg
961 agctgtagag ggagagcctg tctcacgagg aagcttagga gggagatatt atagtttggt
1021 ttatgccagc aagaaagtcc aaagtcccag aaattatctt catgaggatt gaaacatgtt
1081 ttctggtcct gacttcctct aggttgcata gggctttgag agtatagtat acctactatg
1141 tgcgcataca cacacgcgcg cgcgcgcgca cgcgacacac acaggaccca gtgggacaga
1201 tactttatca ctgctgctgt tcagcatgga gggagcttct ttccagtgct ttgtctctcc
1261 gtccactggg cctggtgggt gggtgctcct cagccctctg cttacccacc tctctcttct
1321 cctttagggt tgggcccctc gatgccctgg cctgctgccc actgtgtgac tgtgcctgtg
1381 cctgccagct cccagactgc cagagcctca actgcctctg tttcgagatc aagctcagat
1441 gaaagatggg gctggggacg ttgttctttg gggagtggcc agtccccagg gccccctcta
1501 tgatcctcag gacatcatta tactggagct atggatggca ggcccagcct aattacctgg
1561 gttccttgag ttctctgaaa ggcaggattc tgagagccct tggaccgctg aaaagggcct
1621 gatgctctgg ccagtgcccc tgcctttctt cctctccctt ccctgataaa ctattgtatg
1681 tgaggtagga tcgagacatt gctcacccag gcaacagtgt gggaggtttc tgccaacctg
1741 gactatcagg ataaagggat ggccagccac accctgcctt tagactcctg gttattttaa
1801 gaggtgagta tcctgcctga ctctgctctc ctttggaaaa aaaaaaaaag ttcaaccacc
1861 agcaggcacc agagtcaagg gaggagggaa ccagaggagg gcagtgggag gcaatatcta
1921 gatgttttcc cttcttgttc tgccttaaca gatcctggac ctgagactga tttcttgact
1981 aatttcactg tatttccaag gaagaggttc ctctaaagac cggtgagtga gcagtggcac
2041 ctcctcctct caaggcaaac caaagctgcc tcttcttcac cccccacgca gggatgaatg
2101 tcaggagcct caggtttccc taaatatagg tcccggccgc gggattcgtg gtggggaaag
2161 ggcaggggtt accgagaagg tctgggacac tggtgcgggg gtgtgtaggg gaggggtggg
2221 gagtaggagc tgccttaaaa cccagccctg gactgccggg ctcactctct gctgaccggg
2281 ctctgcggct tctgtcactg cgccacaggt gggccgctat ccggattgca ggatgggaat
2341 gggggttgcg gattgggacc tgaggaaact gactgctctg agagttacag ggtgacaaga
2401 gagctccgag acggattttt ttattttgga gaaggaaatc aggttcggga aagacctgtc
2461 tggcttgggc cagtccttgt cggtcatttc ctcaaactgg gtgtgtttag ctcgcgggtg
2521 gtgcctcccg ccaatctgct aggcaacgcc aggcctggat acgccactca gttccgatgt
2581 ggccggcaca ctagttctgg gaggttttgc ctgcgtacca tgtcactcgc cgtgctctgg
2641 ccagggagag atggaatgng ccctgcattt tagtcaagcg acgaagcagg caggcaggga
2701 ggctccgaag ctctgcgttc ttagcagtga cgtcaggctg caactacaca gccggaagcc
2761 tgggtcttgg aggagaggcc agccaccatc tcactctgac cccctcccta ctcttcgcca
2821 acccacattc cggctgagtc acatgttccg cgcgcgccag gcaggggttg gggggggggg
2881 tgttgggggg ggggggtggt gacctgcggg atgtggctcg agtcacgtcc tagcggggcg
2941 gaggagggat cgtgttctag ccgcttgtct cctccccagt gccgcctcct atcggagcat
3001 cttggggcgg tctgcgcaca gtgcccacct tcaattgacg gttcccgtcc ctgcaaggga
3061 aaaaacctgc agagggcgga gcggcgcctt taaatgtccg gggccccgcc tccggtcccc
3121 cccaacccag ctgaataggc tgggttctct tggaacgcgc agcagaacca ggttctggtg
3181 accctagccg ttcgctcctt agtcctttcg cctacccacc ggcgtaccag gcagacccac
3241 cccgtcctgt gccaggtgag cgccatttac acgtgctcgg ggaagggtct atggggttag
3301 gatcttgggc cggtggcggg cagtgcagag ccgtcttccc cacggcccct cacttctcct
3361 ttttctaccc ccacgcttgc ccccagcccn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
3781 nnnnnnnnnc tggttctctc ttaactctcg cctttgggtt gctatgtggc tttgagcaca
3841 gatcatttct ttcttgggct ctttcagatg agggtattag gctcctgccc tattcgtgat
3901 ccttaaattc taaaatatcc cggttcaatt ttgtttctag gcaaggtgac ccatggcaac
3961 gcgcaggcca gatgggtcag cttcaacatg accgctgtcc tggctctggc ttcttcttcc
4021 ccagttggcc agtgagcgaa cccactctga gctgggcaac acccagcaac agacagagtt
4081 aggaaaggta caggaagagg caggtctagt atagggaagt cgggagtagg ggagagctct
4141 gggacaggaa gtatcccagg accctcaggg agtggggcag gggaggtggg ggctagtgcc
4201 ctggcctcca ggaagctttg taccggggag accatgggat ggtccaacta agcgctggtc
4261 tctgcctccc tcacccagga aagcaactgc caccggcacc atgccccacc catacccagc
4321 actgaccccg gagcagaaga aggagctgtc tgacatcgct caccgcattg tggctccggg
4381 caagggcatc ctggctgcag atgagtccac cggtgcggta caggagaaga aagggaggag
4441 gacccaggtt ggagctagca ggctgatccc ttatctccat catgactttt aggaagcatt
4501 gccaagcgcc tgcagtccat tggcaccgag aacaccgagg agaacaggcg cttctaccgc
4561 cagctgctgc tgactgcaga cgaccgtgtg aatccctgca ttgggggggt gatcctcttc
4621 cacgagacac tgtaccagaa ggcagatgat ggacgtccct tcccccaagt tatcaagtcc
4681 aagggtggtg ttgtgggcat taaggtaaga gggcagactc tggggggggg gtaagattag
4741 aggaggatct cggagaaagg gattaatagg tagggagggg gtaatatggc tagcaggcct
4801 agagactcag gtggatgtat cagcataatt ttttttcagt gtttggggtg aacttaggtc
4861 cttgtgcatg tcggcaagcg cgctgttgcc aacttaatgg ttccctgtga tacaagaagg
4921 tgatttcatg gtgaagaagt gaaaaggttt tctcagtgtg cagtagcacc aggtccctct
4981 agtccagtta acattctctc aaatatacac atcttttctc ataaatatgt gcaagccatg
5041 agaggctaca gtgaaaggtg aagtttgggc ctgggtagag gagacagggg ccataaagct
5101 gactgctggt ctcctccctg gcaggtagat aagggtgtgg tgcccctggc aggaaccaat
5161 ggcgagacaa ctacccaggg taagaatgat ctgcctgcct ccttcccttc tccaccagct
5221 catcagagtt ccagagtgag tctgatcaaa agccttctct ttattcttcc ccttcagggc
5281 tggatgggct gtctgaacgc tgtgcccagt ataagaagga tggagccgac tttgccaagt
5341 ggcgctgtgt gctaaagatt ggggaacata ctccctcggc cctggccatc atggaaaatg
5401 ccaatgttct ggcccgttat gccagcatct gccagcaggt gggattggac tacttcctaa
5461 cacattgatg cagcgcgggc tagctttctg tctatctgcc aggatatctg cctcctcaga
5521 gcagctgctc tcaatacccg ctgtggccag gtcttgagtg gaggtctgca atgtagaggt
5581 ggcaacaggt gtacaggcag attgatagga ttgcttgtcc cctgtaaact gctgaggcct
5641 ttgaagcctg ggtctctgtc atcaagttaa tggtgaggag gctcctagtc aggaggcctt
5701 gcctcattac cctgtccctc ccacagaatg gcattgtacc cattgtggag cctgaaattc
5761 tccctgatgg ggaccatgac ttgaagcgct gccagtatgt tactgagaag gtagtgccat
5821 ctgctgtaga tagtgtgtgc tgcgcgtagt atcgtttcac ttctcgtctg cnnnnnnnnn
5881 nnnnnnnnnn nccctgctgt cttccaggtc ctggcggctg tctacaaggc tctgagcgac
5941 caccatgtct atctggaagg cacattgctg aagcccaaca tggtcacccc tggccatgct
6001 tgcacccaga aattttccaa tgaggagatt gccatggcaa cggtcacagc acttcgtcgc
6061 acagtgcccc ctgctgtcac tggtgaggcc actcctcatc ttggtggtga ggtggatgca
6121 ccatcacatt tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
6181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
6241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
6301 nnngccatgg gccttgactt tctcctatgg tcgagccctg caggcctctg ctctaaaggc
6361 ctggggtggg aagaaggaga acctgaaggc agcccaggag gagtacatca agcgcgccct
6421 ggtaaggcag gcaggcaggc gtggaagtgt gaacaggtgc ctgggcgggg tggggaggga
6481 ctcaagaaga gaattcctct gattcctctt ccttttaggc caacagcctc gcttgtcaag
6541 gaaagtatac cccaagtggc cagtctggag ccgcagccag tgaatctctc ttcatctcta
6601 accatgccta ctaaccagag ctgaactaag gctgctccat caacactcca ggcccctgcc
6661 tacccacttg ctattgaaga ggggtcttca ggctctttcc catcactctt gctgccctcg
6721 tgtgcggtgt tgtctgtgaa tgctaaatct gccatccctt ccagcccact gccaataaac
6781 aactatttaa gggggagtct gttgttcatg tcttgtaggg tataggggag ggctgaggaa
6841 agagctactt gggttcttct tcttggacag taaaaggaag gggttttttg accagagctt
6901 tgagaaaggc atagtattat gggatgttct ttgcctacat ctaattgaag gtaactttta
6961 cactaattaa tattcagttt aagccaacca agggcttatg aatacttggc aaggattgta
7021 tcagggctaa cacatttatg cgttttgggg actatggagc tttggagacg agatctctct
7081 gcagtgacat aggtatacag ctcactgcag aactcttggg ttccaggttg agaatggagc
7141 ctcagagctg ctgatgttcc ctggtgatag aataagaagc acatcaaacc atgggccact
7201 gtatcttgcc acattatatt gagtgtagtc ggtgtgctag tgcacacttt aatccagcac
7261 tcaggaggca gaggcaggca ggaggcaact ggaactcaca aagtgagttc caggacagcc
7321 agggctatac agagaaaccc tgtcttgaaa aaaaaaaatt ctggcctaaa tgaatggata
7381 cagtgtatct gcctttggag gccaaaaggc gtgtatcaag tgctagcttc tggcaagata
7441 agaaacctta aggagtaggg cttcgactat actcagtagc agagtcttgc atggtactca
7501 tggttgtgag cacatgtggt gctaactgct gagtctctct cagtccatca tactctagta
7561 tatagtcaga gactctagat actgacgact agactagact cgtcgtctnn nnnnnnnnnn
7621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
7681 nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ccttcccaag catctttttc tttgacactt
7741 tcgttttcag tgatctgcgt agaattgtct tactaggagt atcaaagcat agtctccact
7801 gtcctaatat tcccatgtat tggccaatag tcaaagctat gcgcaggctg tggatagagc
7861 ccagtggctg agtacccaaa gctctggttc cttccccagt gctgcaaggg aaaactcaaa
7921 tccctatgct tccccaaact tcagcctccc attttactgc tcatcacgta cttgtagcct
7981 tgctctctag aattctgtag cccacactgg ccttgaactc tcaagatctg ctttccaagt
8041 actgggatga aaggcatgtg ctattctcct agcttctatg aggcgatcct ttttatttta
8101 tatacattgg tattaactga atgtgtgtat gtgtgtagtg tgatccggta cgagctcgag
8161 cgtatagtga gtcgatacat catgcgcgct
//
LOCUS MUSCR2AA 2102 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse complement receptor (Cr2) gene, 5' end.
ACCESSION M36470
KEYWORDS Cr2 gene; complement receptor.
SOURCE Mouse (strain Balb/c) spleen, cDNA to mRNA, clone 31-1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2102)
AUTHORS Kurtz,C.B., O'Toole,E., Christensen,S.M. and Weis,J.H.
TITLE The murine complement receptor gene family. IV. Alternative
splicing of Cr2 gene transcripts predicts two distinct gene
products that share homologous domains with both human Cr2 and Cr1
JOURNAL J. Immunol. 144, 3581-3591 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 67 > 2102 complement receptor (Cr2)
BASE COUNT 590 a 472 c 452 g 588 t
ORIGIN
1 ctcttcctct ccttgctaca ggctcacaac tcacagagcg caacctgcca ttggactgct
61 gcacacatgg gatccttggg ttcgctctgg gttttcttca ctctcatcac tccaggagtt
121 cttggtcagt gtaagttgct gccaaagtat tcttttgcta aaccttctat tgtgagtgat
181 aaatctgagt ttgccattgg aacaacttgg gaatacaaat gtcgccctgg gtattttagg
241 aagtcattta ttatcacctg cttagaaacc tccaagtggt cagatgctca gcagttctgt
301 aaacgtaaac catgtatgaa tcctcaagaa cccctccatg gttctgtgca tataaacacg
361 ggtatcgagt ttgggtcaac aattacgtat tcttgtaatc aaggatatcg actcattggt
421 gactcgtctg ctacatgtat tgtatcagac aatactgtaa tgtgggataa tgatatgcct
481 ctttgtgaat ctattccttg tgagtcacct ccagccatct ccaatggaga cttctacagc
541 agcagcagag acagcttttt ctatgggatg gtagtaactt attattgcca taccggaaag
601 aatagggaaa aactgtttga tctggtgggt gagaagtcaa tatattgtac cagcaaagac
661 aatcaagttg gcatctggaa tagtccacct cctcagtgta ttcctagagt caagtgccca
721 atgccagaaa ttgaaaatgg actagtggag tctggattta aacactcctt cttcttaaat
781 gatacagtaa tatttaagtg caaatctggc tttaccatga aaggcagcag aatagcatgg
841 tgccagccaa acagcaaatg gagccctcca ttgccaacat gcttcatggg atgtctacca
901 cctcaaaata tcctccatgg tgattataac aaaaaggatg agttcttctc tgttggccag
961 aaagtgtcat atacgtgtaa ccctggctat actctcattg gaactaacct cgtggagtgt
1021 acatccttgg gaacctggag caatacagtc ccgacatgtg aagtgaaatc atgtgatgca
1081 attccaaacc atcttctcca tggccgtgtg tttcttcccc ctaatctcca gcttggggca
1141 gaggtttcct ttgtttgtga cttagggttc cagttaaaag gcaaaccttc tagtcagtgt
1201 atcccagaag gagagacagt aatctggaat aataagtttc ctgtctgtga acagatttct
1261 tgtgaccctc ctcctgaagt caaaaatgct cggaaaccct attattctct tcccatagtt
1321 cctggaactg ttctgaggta cacttgttca cctagctacc gcctcattgg agaaaaggct
1381 atcttttgta taagtgaaaa tcaagtgcat gccacctggg ataaagctcc tcctatatgt
1441 gaatctgtga ataaaaccat ttcttgctca gatcccatag taccaggggg attcatgaat
1501 aaaggatcta aggcaccatt cagacatggt gattctgtga catttacctg taaagccaac
1561 ttcaccatga aaggaagcaa aactgtctgg tgccaggcaa atgaaatgtg gggaccaaca
1621 gctctgccag tctgtgagag tgatttccct ctggagtgcc catcacttcc aacgattcat
1681 aatggacacc acacaggaca gcatgttgac cagtttgttg cggggttgtc tgtgacatac
1741 agttgtgaac ctggctattt gctcactgga aaaaagacaa ttaagtgctt atcttcagga
1801 gactgggatg gtgtcatccc gacatgcaaa gaggcccagt gtgaacatcc aggaaagttt
1861 cccaatgggc aggtaaagga acctctgagc cttcaggttg gcacaactgt gtacttctcc
1921 tgtaatgaag ggtaccaatt acaaggacaa ccctctagtc agtgtgtaat tgttgaacag
1981 aaagccatct ggactaagaa gccagtatgt aaagaaattc tctgcccacc acctccacct
2041 gttcgtaatg gaagtcatac aggcagcttt tcagaaaatg taccatatgg aagcacagtt
2101 ac
//
LOCUS NEUALCA 1639 bp ds-DNA PLN 15-AUG-1990
DEFINITION N.crassa allantoicase (alc) gene, complete cds.
ACCESSION J02927
KEYWORDS allantoicase.
SOURCE N.crassa (strain Oak Ridge), clone pALC-1.
ORGANISM Neurospora crassa
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE 1 (bases 1 to 1639)
AUTHORS Lee,H., Fu,Y.-H. and Marzluf,G.A.
TITLE Nucleotide sequence and DNA recognition elements of alc, the
structural gene which encodes allantoicase, a pirine catabolic
enzyme of Neurospora crassa
JOURNAL Biochemistry (1990) In press
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.A.Marzluf, 12-JUL-1990.
FEATURES from to/span description
pept 250 340 allantoicase (alc), exon 1
413 1386 allantoicase (alc), exon 2
IVS 341 412 alc intron A
signal 129 135 TATA box
BASE COUNT 383 a 441 c 443 g 372 t
ORIGIN
1 cgttgcagat cgaatacgac ggttaggtac gacgaagaag gaccacgatt gtcgttgctg
61 ttacgtactt tgacctcctc aacgcactat cttgcttaag ctatcgctct tgtctgtcgc
121 tgtggtgata taaattctgc gcctgctctt ggtttattcc gaggacgctc gttccatctc
181 tgtttttttt ttctctctgt gacatcgagg actgaagtct cacttattca aatacacatt
241 tccctcacca tgaccgacat cgattacaag ctcgaggctg ttccggccac tcggattgcc
301 gccgatgata tcgacaagac tttccgttcc agcaccatcg gtccgtagca tccatctcac
361 caaacatggc aacccaaacc tttcaactaa cggaagtcga gctgggatac agatcttatc
421 tcaggggctc tcggtggcaa ggtttccggt ttctcggacg aatggttcgc cgaagcagcc
481 aacctcctca ctcctacagc cccaatccgc cagccgggaa agatggttta caccggcgcc
541 tggtatgacg gatgggagac aaggagacac aaccctgccg agttcgactg ggttgtgatc
601 cgtctgggcg tcgcctcggg taccgtcgag ggtgtcgaga ttgacacggc tttcttcaac
661 ggcaaccatg cgcccgccat ctcggtcgag ggttgcttca gccaaaacga cgatgaggtt
721 ctgtcatgga agggcgagct gggtggatgg gagactattc ttggcgttca agagtgcggc
781 ccttcgcaga gattctgctg gaaactcgag aaccctacca agaagcagta cacccatgtg
841 cgactaaaca tgtaccccga cggcggcatt gccaggttcc gtctgtttgg acacgccgta
901 ccggtcttcc ccgacaatac ggatgccatc tttgacttgg cggctgccca gaacggcgga
961 gttgcgatct cctgcagtga ccagcacttt ggtaccaagg acaaccttat ccttccgggc
1021 cgcggcaagg acatgggcga cggttgggag acagcacgct cgcgcaccaa gggccacgtc
1081 gactggacca tcatcagact cggcgcgccc ggctacattc agaatttcat ggtcgacacg
1141 gctcacttcc gcggtaacta cccccagcag gtcaagctgc aacgtatcga gtggaagagc
1201 gagggcaggc cgggagcgga ttctgagggc tggacagagg ttgttgagcc catcaagtgc
1261 ggtcccgatc aggaacaccc tgtcgagagc ttggtgaagg acaagccgtt cacccacgtc
1321 aagctcatca ttgtgcctga cggcggagtg aaaagactgc gggtgtttgc gaagagggct
1381 gtttaagaaa ttaccaagct atatatctga aggcaattat tcggtgagag cagcatttac
1441 ggggagccat caacagcgag cgatccacat aaaaaggggg aggacctcat ttagtatgat
1501 gggcaacgag tgcagtcatt tagccgcgaa gaatcgaaat ctctcagatc tttgattgtc
1561 tgcgcttaag taacaaagtc taattctcaa tcagctttcg tcgtagagta aaattagaag
1621 gatgcacggc tgcccacga
//
LOCUS RATINHA 1561 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Rat inhibin alpha-subunit mRNA, complete cds.
ACCESSION M36453
KEYWORDS inhibin.
SOURCE Rat female (strain Sprague-Dawley) ovary, cDNA to mRNA, clone
rINA-13.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1561)
AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E.
TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit
complementary deoxyribonucleic acids and expression in the ovary
JOURNAL Mol. Endocrinol. 1, 561-568 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 256 1356 inhibin alpha-subunit precursor
sigp 256 954 inhibin alpha-subunit signal peptide
matp 955 1353 inhibin alpha-subunit
mRNA < 1 1561 inhibin alpha-subunit mRNA
BASE COUNT 308 a 465 c 440 g 348 t
ORIGIN
1 ggacactaga atgctgtgtt gttagaggag tggagagagg aagatgtgct aagtgtagca
61 gtacacacct ataatcctag cacttgagag gttgaaggca ggaggatgag acattcaggt
121 cattcttagc tacatgaaga gtttaaggcc agcacggatt acaggatatc tgtttctggg
181 gaaaaaggag gggaagagag agaggaaagg gcaaagggca gagtgtgggc tccctgtcgt
241 cagggcaaga gaactatggt gatccagccg tctctgctgc tccttttgct gttgactcta
301 caggatgtgg acagctgcca ggggccagaa cttgtccggg agcttgtcct ggccaaagtg
361 aaggcactat tcctagatgc cttggggccc ccagcaatgg atggggaagg tgggggtcct
421 ggaataaggc ggctgcctcg aagacatgcc cttgggggct tcatgcacag gacctctgaa
481 ccagaggagg aggatgtctc ccaggccatc cttttcccag ccacaggtgc cacctgtgag
541 gatcaggcag ctgctggagg gcttgcccag gagcctgagg aaggtctctt cacttatgta
601 ttccggccat cccaacacat acgcagccac caggtgactt cagcccagct gtggttccac
661 acggggctcg acaggaagag cacagcagcc tccaatagct ctaggcccct gctagatctt
721 ctggtgctgt catctggggg gcccatggct gtgcctgtgt ccttgggaca gagcccccca
781 cgctgggctg tcctgcacct ggcggcctcc gctttccctc tgttgaccca ccccatcctc
841 gtgttgctgc tgcggtgccc actctgttct tgctcaggcc ggcctgagac cactcctttc
901 ctggtggccc acactagggc tcgagccccc agtgcggggg agagggctcg acgttcagct
961 ccctcgatgc cttggccttg gtctcctgca gccttgcgtt tgctgcagag gcctccagag
1021 gaaccctctg cccatgcctt ctgccatcga gctgccctca acatctcctt ccaggagctg
1081 ggctgggacc gctggatcgt acaccctccc agcttcattt tccactactg ccatggtagc
1141 tgcgggatgc ccacatctga tctgcccctg ccagtccctg gggctccccc taccccggct
1201 cagcccctgt ttttggtgcc aggggccaag ccctgctgtg cagctctacc agggagcatg
1261 aggtccctac gcgtccgaac cacctcagat ggaggctact ctttcaagta tgagatggta
1321 ccgaacctca ttacacaaca ctgtgcttgt atctaaaagc acctcgtctc ctcctccaca
1381 gccactggcc accatcacct caccatccca cggtcggtcg gtcggtcggt cgtcagctag
1441 gaggaaggtg ggtgtggaaa gtagacagtt tccacttcct tttcccttca tctttctgtc
1501 tgaggcttcc acaccccact ccacccaggt cctgtggata acaataaaga aggaagtgtg
1561 t
//
LOCUS RATINHB 1543 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Rat inhibin beta-A-subunit mRNA, complete cds.
ACCESSION M37482
KEYWORDS inhibin.
SOURCE Rat female (strain Sprague-Dawley) granulosa cell, cDNA to mRNA,
clone rINB-5.
ORGANISM Rattus rattus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1543)
AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E.
TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit
complementary deoxyribonucleic acids and expression in the ovary
JOURNAL Mol. Endocrinol. 1, 561-568 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 163 1437 inhibin beta-A-subunit precursor
sigp 163 1086 inhibin beta-A-subunit signal peptide
matp 1087 1434 inhibin beta-A-subunit
mRNA < 1 1543 inhibin beta-A subunit mRNA
BASE COUNT 435 a 356 c 454 g 298 t
ORIGIN
1 ctctgacctc atgagacaag agccggctgg caaaacagaa gggacccgaa agagaatttg
61 ctgaagagga gaaggaaaaa agtccaaaaa acctgtacgt gaggggtggg gaggaaaagc
121 agggccttta aagaaggcaa ccacacgact tttgctgcca ggatgccctt gctttggctg
181 agaggatttc tgttggcaag ttgctggatt atagtgagga gttcccccac cccaggatcc
241 gaggggcacg gcgcagcccc ggactgcccg tcctgtgcgc tggccaccct tccgaaggat
301 ggacctaact ctcagccaga gatggtagag gctgtcaaga agcacatctt aaacatgctg
361 cacttgaaga agagacccga tgtcacccag ccggtaccca aggcggcgct tctcaacgcg
421 atcagaaagc ttcatgtggg taaagtgggg gaaaacgggt atgtggagat agaggacgac
481 attggcagga gggccgaaat gaatgaactc atggagcaga cctcggagat catcaccttt
541 gccgagtcag gcacagccag gaagacactg cattttgaga tttccaagga aggcagtgac
601 ctgtcagtcg tggagcgtgc agaagtctgg ctcttcctga aagtccccaa ggccaacagg
661 accaggacca aagtcaccat ccgtctgttt cagcagcaga agcatccaca gggcagcttg
721 gacatggggg atgaggccga ggaaatgggc ttgaaggggg agaggagtga actgttgcta
781 tcagagaaag tggtagatgc tcggaagagc acttggcaca tcttcccagt gtctagcagc
841 atccagcgcc tgctggacca ggggaagagt tccctggatg tgcggattgc ttgtgaacag
901 tgccaggaga gcggtgccag cctagtgctc ctgggcaaga agaagaagaa agaggtggat
961 ggagacggga agaagaaaga cggaagtgac ggagggctgg aagaggaaaa agaacagtca
1021 cacagacctt tcctcatgct gcaggctagg cagtctgaag accatcctca ccgcaggcgt
1081 aggcggggct tggagtgtga tggcaaggtc aacatttgct gtaagaaaca gttctttgtc
1141 agcttcaagg atattggctg gaatgactgg atcattgctc cctctggcta tcatgccaac
1201 tattgtgagg gtgagtgccc aagccacata gcaggcacct ctgggtcctc actctccttc
1261 cactcaacag tcattaacca ctaccgcatg aggggtcaca gcccctttgc caaccttaag
1321 tcatgctgtg tgcccaccaa gctgagaccc atgtccatgc tgtattatga tgatggtcaa
1381 aacattatca aaaaggacat tcagaacatg attgtggagg agtgtggctg ctcctagagt
1441 tgccaggtcc cagagcaaat ggatctaggg tgtccaggaa aagacagtgg caaatgaaga
1501 aaaatatata agatttctgc ctaaacaaga caaccagaaa aat
//
LOCUS RSBMNP 1201 bp ss-RNA VRL 15-AUG-1990
DEFINITION Bovine syncytial virus major nucleocapsid protein (N) mRNA,
complete cds.
ACCESSION M35076
KEYWORDS major nucleocapsid protein.
SOURCE Bovine syncytial virus (strain A51908) MDBK cell, cDNA to mRNA.
ORGANISM Bovine syncytial virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Spumavirinae.
REFERENCE 1 (bases 1 to 1201)
AUTHORS Samal,S.K., Zamora,M., McPhillips,T.H. and Mohanty,S.B.
TITLE Molecular cloning and sequence analysis of bovine respiratory
syncytial virus mRNA encoding the major nucleocapsid protein
JOURNAL Unpublished (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by S.K.Samal, 12-JUL-1990.
Author address: S.K.Samal
Univ Maryland at College Park
Dept. Veterinary Medicine
College Park, MD 20742
FEATURES from to/span description
pept 16 1191 major nucleocapsid protein
mRNA 1 1200 major nucleocapsid protein mRNA
BASE COUNT 434 a 196 c 270 g 301 t
ORIGIN
1 ggggcaaata caaaaatggc tcttagcaag gtcaaactaa atgacacttt caacaaggat
61 caactgttat caaccagcaa atatactatt caacgtagta caggtgacaa cattgatata
121 cccaattatg atgtacaaaa acatctcaat aagttgtgtg gtatgctact aataacagaa
181 gatgccaatc ataaatttac aggattgata ggtatattat atgctatgtc ccgattgggg
241 agagaagata cccttaaaat actcaaagat gcaggctacc aagtaagggc caatggggtt
301 gatgtgataa cacatcgaca ggatgtgaat ggaaaagaaa tgaaatttga agtgctaaca
361 ttagtcagct taacatcaga agttcaaggc aatatagaaa tagagtcaag gaagtcttac
421 aaaaagatgc taaaagagat gggagaggta gccccagaat acagacatga ctctcctgat
481 tgtggtatga tagtgctatg tgttgctgct ttggttataa caaaattagc agcaggtgat
541 agatcaggcc tcactgcagt cattaggaga gccaacaatg tactaaggaa tgaaatgaaa
601 cgatacaaag gacttatccc gaaagatata gctaacagct tctatgaagt gattgaaaag
661 taccctcatt acatagatgt attcgtacat tttggcattg ctcaatcctc aactagagga
721 ggtagtaggg tagaaggaat ctttgcaggg ttattcatga atgcatatgg agcaggtcaa
781 gtgatgttaa gatggggtgt attagccaaa tcagtcaaga acattatgct tggtcatgcc
841 agcgtgcaag cagaaatgga acaggttgta gaggtctatg aatatgcaca aaagttaggt
901 ggagaagctg gtttttatca catattgaac aaccctaaag catcactgtt atccttaaca
961 caattcccca acttctctag tgtagtccta ggcaatgctg caggactagg tataatgggt
1021 gagtatagag gtacaccaag aaaccaagac ttgtatgatg ctgccaaagc atatgcggaa
1081 caattaaaag agaatggggt catcaattac agtgtattag atctgactac agaggaacta
1141 gaggcaatca agaaccaatt gaatcccaaa gacaatgatg tggaactgtg agttaataaa
1201 a
//
LOCUS URELOCAB 558 bp ds-DNA BCT 15-AUG-1990
DEFINITION U.urealyticum urease locus proteins A and B, complete cds.
ACCESSION M36190
KEYWORDS urease locus-encoded protein.
SOURCE U.urealyticum (serotype 8) DNA.
ORGANISM Ureaplasma urealyticum
Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas;
Mycoplasmatales; Mycoplasmataceae.
REFERENCE 1 (bases 1 to 558)
AUTHORS Willoughby,J.J., Russell,W.C., Thirkell,D. and Burdon,M.G.
TITLE PCR primers that detect Ureaplasma species and a study of the
urease locus by 'PCR walking'
JOURNAL Unpublished (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.J.Willoughby, 27-JUN-1990.
Author address: J.J.Willoughby
University of St. Andrews
Biochemistry and Microbiology
North Street
St. Andrews, Fife KY16 9AL
SCOTLAND
FEATURES from to/span description
pept 23 349 urease locus protein A
pept 436 522 urease locus protein B
BASE COUNT 209 a 78 c 111 g 160 t
ORIGIN
1 tttataagga gataatgatt atatgtcagg atcatcaaat caattcactc caggtaaatt
61 agtaccagga gcaattaact tcgctgaagg cgaaaatgtg atgaacgaag gtagagaagc
121 aaaagtaatc agcattaaaa atactggtga ccgtcctatc caagttggat cacatttgca
181 cttatttgaa acaaatagtg cattagtatt ctttgatgaa aaaggaaacg aagacaaaga
241 acgtaaagtt gcttatggac gtcgtttcga tattctcagt actgctattc gttttgaacc
301 aggagacaaa aaagaagttt cagttattga tttagtcgga acacgttgaa gtttgaggtg
361 taaacggctt agttaacggc aaaaccttaa aaaataatct atttacaagt ttctatatag
421 acgaagggga acattatgtt taaaatttca agaaaaaatt actcagatct atatggtatc
481 acaactggtg atagcgttag attaggagac acaaatcttt gagttaaagt tgaaaaagac
541 ttaactactt atggcgaa
//
LOCUS YSCFUR1A 2123 bp ds-DNA PLN 15-AUG-1990
DEFINITION S.cerevisiae uracil phosphoribosyltransferase (FUR1) gene,
complete cds.
ACCESSION M36485
KEYWORDS uracil phosphoribosyltransferase.
SOURCE S.cerevisiae (strain FL100, ATCC 28383) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 2123)
AUTHORS Kern,L., de Montigny,J., Jund,R. and Lacroute,F.
TITLE The FUR1 gene of Saccharomyces cerevisiae: Cloning, structure and
expression of wild-type and mutant alleles
JOURNAL Gene 88, 149-157 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept 895 1650 uracil phosphoribosyltransferase (FUR1)
mRNA 886 1791 FUR1 mRNA (alt.)
mRNA 888 1791 FUR1 mRNA (alt.)
signal 841 848 TATA box
signal 1896 1901 poly-A signal
BASE COUNT 659 a 427 c 392 g 645 t
ORIGIN
1 atcgataaaa gaactaatgt ttcccaaaga aataggaaaa agggaataaa gaataatagg
61 ccccacaaag acataaacag cagtcctgac tggggcaact gcacagagga accgattggc
121 agagcgaaaa agcaaacggc atgaacaggg ccaagaactc tcggaatttt accactaata
181 ttaaattgca gcgacaacat tttggcgaag aaatacaagg tggccagcca gccttgtgat
241 atctacaaat tcagatgctt cagataaatt gttaatgcta ttcaacctaa ctttgggagt
301 aaaccaagaa aacttgaaaa atgttctgga aaacatttct caggtgcaga tagctcaaat
361 tagggttaga gacctgcctt caggatctgc caccgctaag gtccgtctgg catatcctac
421 aacacagtct ttggagaagg taagaaaact gttccatggc gctctagttg atggaaggcg
481 catccaagtg gtgattgcat ctgatgaatc gtcccacttg tcgtattaga gtttgtcaac
541 gacactcaca aggtatttaa tcagcaaaat ccccgccaca aactattttt ttgaagacat
601 gctttctcat gactgcctaa taacaatacc tcattctact agtaatcgac ctatgtaatt
661 atttcataaa ctataaagca ggtcattgca ataacagaaa ggccggtttt tctataagct
721 tatctcatcg cataaaaaat cgacagttgt aattatctcc ggcggacttt tccctttccg
781 tctttttttt caaaattttt ttttttttca cttcttcttt caaagctgcc tcaaaagaga
841 tatatatatt ggtaagaatc ctcttccaat actagcttca tttcttcttg aaccatgaac
901 ccgttattct ttttggcttc tccattcttg taccttacat atcttatata ttatccaaac
961 aaagggtctt tcgttagcaa acctagaaat ctgcaaaaaa tgtcttcgga accatttaag
1021 aacgtctact tgctacctca aacaaaccaa ttgctgggtt tgtacaccat catcagaaat
1081 aagaatacaa ctagacctga tttcattttc tactccgata gaatcatcag attgttggtt
1141 gaagaaggtt tgaaccatct acctgtgcaa aagcaaattg tggaaactga caccaacgaa
1201 aacttcgaag gtgtctcatt catgggtaaa atctgtggtg tttccattgt cagagctggt
1261 gaatcgatgg agcaaggatt aagagactgt tgtaggtctg tgcgtatcgg taaaatttta
1321 attcaaaggg acgaggagac tgctttacca aagttattct acgaaaaatt accagaggat
1381 atatctgaaa ggtatgtctt cctattagac ccaatgctgg ccaccggtgg tagtgctatc
1441 atggctacag aagtcttgat taagagaggt gttaagccag agagaattta cttcttaaac
1501 ctaatctgta gtaaggaagg gattgaaaaa taccatgccg ccttcccaga ggtcagaatt
1561 gttactggtg ccctcgacag aggtctagat gaaaacaagt atctagttcc agggttgggt
1621 gactttggtg acagatacta ctgtgtttaa ataaatcaca cccgaacacc atcttgaagg
1681 ttcagaacgg ctgaagccat atcaactttg ggtttctact gttttaaatt tcctttctcg
1741 ttttaaactt ttgttgccgt ctcttctact atcaattttt gttgttcatg catgtttaat
1801 tacctttttt gtaaaaataa tataaacgta ccaatggtca tttataacaa atatgcttga
1861 aaaatctaac gactctgttt cttacattag gttcgaataa acacggtaca tgtcctctag
1921 ccaatctgac atttttggtc caaagtcttt gaaaggtaga taaccccgtt aaaatagaac
1981 caccaatcca tgtagtatat tttctttctg aaggggctat aatctttatc taggatgttc
2041 ctttggttaa tgcctccaaa tcccatagca ttcggtctcc aaagccttta agcgttgtag
2101 ttccgccact taggattatc gat
//
LOCUS YSCMET16A 1986 bp ds-DNA PLN 15-AUG-1990
DEFINITION S.cerevisiae 3'-phosphoadenylyl sulfate reductase (MET16) gene,
complete cds.
ACCESSION J05591
KEYWORDS 3'-phosphoadenylyl sulfate reductase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1986)
AUTHORS Thomas,D., Barbey,R. and Surdin-Kerjan,Y.
TITLE Gene-enzyme relationship in the sulfate assimilation pathway of
Saccharomyces cerevisiae: Study of the 3'-phosphoadenylylsulfate
(PAPS) reductase structural gene
JOURNAL J. Biol. Chem. (1990) In press
STANDARD simple staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Surdin-Kerjan, 28-JUN-1990.
FEATURES from to/span description
pept 792 1562 3'-phosphoadenylyl sulfate reductase (MET16)
signal 686 692 TATA-box
site 643 648 cis-acting element in general control of AA
synthesis
site 612 618 UAS (methionine metabolism)
BASE COUNT 670 a 368 c 379 g 569 t
ORIGIN
1 atgcatcttg cctctttgat attggttgga tcttcttatg gcttccacga actctcttgt
61 gtaaatatct ggatttctac cgtcctcaat gtattgaaca acttccaagg gaatgtccac
121 cttagacaag ctggattgag gatcgttgct tctcacgttc agcttgtaca agcgatccac
181 atttctttgc aagttggtga tcattccctt ggtggcttct ggagtaccag gaaaatcata
241 tatcgagaca cctaattcaa cgaaggactc aataatcgaa gccacttggt cttgagtagt
301 ggccagttct tgctgcaatt gttcattgtt agtgctgttt ccattcatct tatcggttta
361 tttttctata tatttgcctc tttctcaaac aggagttagt agttaaaagt acgaagttct
421 tgttctttaa tgcgcgctga caaaagaatt ggataaaaga gaatggtggg gggacaagaa
481 ggaaatttgt cctagtttaa catgaatggc atcttgttac cgggtggaca tcacctattg
541 attctaaata tctttacggt ttatcatact gttctttatt ccgtcgttat tctttttatt
601 tttatcatca tttcacgtgg ctagtaaaag aaaagccaca acatgactca gcaaatctcg
661 acaaagtaaa agctcataga gatagtatta tattgatata aaaaaagtat actgtactgt
721 ttgtaacctt ttcaatgctt taagatcaaa actaaggcca gcaaaggtat caacccatag
781 caactcataa aatgaagacc tatcatttga ataatgatat aattgtcaca caagaacagt
841 tggatcattg gaatgaacaa ctaatcaagc tggaaacgcc acaggagatt attgcatggt
901 ctatcgtaac gtttcctcac cttttccaaa ccactgcatt tggtttgact ggcttggtta
961 ctatcgatat gttgtcaaag ctatctgaaa aatactacat gccagaacta ttatttatag
1021 acactttgca ccatttccca caaactttaa cactaaaaaa cgagattgag aaaaaatact
1081 accagcctaa aaatcaaacc attcacgtat ataagccgga tggatgtgaa tcggaggcag
1141 attttgcctc gaaatacggg gatttcttat gggagaaaga tgatgacaag tacgattatc
1201 tggccaaagt ggaacctgca catcgtgcct acaaagagct acatataagt gctgtgttta
1261 ctggtagaag aaaatcacaa ggttctgccc gctcccaact gtcgattatt gaaatagacg
1321 aacttaatgg aatcttaaaa ataaatccat tgatcaattg gacgttcgag caggttaaac
1381 agtatataga tgcaaacaat gtaccataca acgaactttt ggaccttgga tatagatcca
1441 ttggtgatta ccattccaca caacccgtca aggaaggtga agatgagaga gcaggaagat
1501 ggaagggcaa ggcaagaccg agtgtggaat tcatgaagcc agccgattcg cgcaattttt
1561 aaagcaagat gcctagatag atagagtacg atatataacc atatgtatgt gactaattat
1621 ttattcctta ataacaccaa tgattacaac tttctaaagc tggcggagaa ttcgcgctgt
1681 acgagaaaag agcgaaaaca gaggaatatt caaactaaga accaaactgc gataaagagg
1741 attgaaagga aaaacgaaag aaaaggtaaa ctgacaaata tatacattaa ccgatgggta
1801 atttcagatt tcctataaaa accaagctac caccagggtt tatcaatgct cgcatactta
1861 gggataactt caaaagacaa caatttaaag agaatgaaat ccttgttaaa tctttgaaat
1921 tcatcgctag aaatatgaac cttccaacaa aactgaggtt ggaggctcag ttaaaactaa
1981 atgcat
//
LOCUS CHT59KD 2429 bp ds-DNA BCT 15-AUG-1990
DEFINITION C.trachomatis 59-kDa immunogenic protein (SK59) gene, complete cds.
ACCESSION M31119
KEYWORDS antigen; immunogenic protein.
SOURCE C.trachomatis L2 (strain LGV-2 434BU) elementary body DNA, clone
beta-1.
ORGANISM Chlamydia trachomatis
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE 1 (bases 1 to 2429)
AUTHORS Kahane,S., Weinstein,Y. and Sarov,I.
TITLE Cloning, characterization and sequence of a novel 59-kDa protein of
Chlamydia trachomatis
JOURNAL Gene 90, 61-67 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by Y.Weinstein, 05-JAN-1990, for release after publication.
FEATURES from to/span description
pept 466 2043 59-kDa immunogenic protein
BASE COUNT 700 a 626 c 417 g 686 t
ORIGIN
1 ggatcccgaa ttgggtaact ctcagaccca cacataaggc catatgctcg agtacgtgag
61 ccactccact agaatcttgc gggcaagtcc gaaaagaaat attgaacaca ttttcatcat
121 catcattcac gatcatcatg atcgttgccc cggtcggagt atgttccact tcaattagct
181 tgctctcgat ctcgggaaga tcctgactca acttgactac aaaatttcta taggtatccc
241 agttttcata tcccactcaa tcttctataa tagagaagct tgttgcatct ccctattttc
301 gattcaccta acatagaaga cagctactgt gagctcttat atccacacaa atattctttc
361 tgaaggcttc tcttattaaa aaaaaagacg ggactcgatt gagtccccat actagactag
421 cttcctaaaa tataaggcca ggactactcg tctgatttca agacgatgaa tcgcaccaca
481 tctcccttga gaaaccataa ggagaacatt ctctcccttt cgagtttttc aaaacctgat
541 ttaactcttc aacggaagcg acctcgctgc ctattcaccg ctaagataag ctgtccagga
601 gcgacgcctg cagaagctgc aggcgagctg cctccacagc aactaccaga atcctcgggt
661 atctgctgcc aatccgagtt tcttacaaat ttctggagta atgtaagtct cacggactcc
721 catcttctgg caacgctgaa acgccatcct ctgttgggtc tgtgtaaccg tcacaggtat
781 ctcgcttgtt ttcccttcac gacgattttt aaaataacac gagtccctgg catcattagg
841 gaaatggcat tacgcaacgc actcaaagac tctacttctt tttccattgt aagccacaat
901 gacatcttct tccagccccg ctttttctgc tggagaacct ttaacaacat ccgtcaccaa
961 acgttccgta cacttttcca atttgtaaca agtagccaat tcagaatcta tcggttgcaa
1021 ggtaactccc aaaaagcctc ttgttacctg cccatcacta atcaattgat caatgactcg
1081 tttagccatc aagctaggaa tagcaaaccc tattccaata tatcccccgc taccactgac
1141 aatggcagta ttaaccccga taacttgacc attgattgtt taacaatgga ccgccctgaa
1201 ttcccaggat taatggcagc atctgttgta acaaatcttc gaaatctaca atatgtagat
1261 gatttcttcc tttcagcact aacgaccccg atagtgaccg ttgcttgcaa tccaaaagga
1321 tttccaatag caatagccca gtcacctatc tgcagtcgat cagaattccc aaaagtcaaa
1381 aatggtaatt tctctgctgt aattttgatc acagcaagat ctgtttttgg atctaacccc
1441 acgatcttag gctgtgtatt tttgtccatc gtggagagta acatgaattt ttcctgcatc
1501 ctcgactaca tgatggttag taacaacata accattcgaa ttcagaaaca tagaacccag
1561 ttcctcttac agcatcacgc cgctgctgcg gacgctgctg ctctctatcg aaggcaaccc
1621 aaaaaatcga ttaaaaaatt cgtcattaaa ataatcaaaa acaaaagggt tctcttgaag
1681 cctcttttgt ttcctggaga agcaatagcc tggttccctg ttttaggaaa attttcatat
1741 atatcaactc caggacgttg ccttagacgc gcgacccgag taaaacctcg ggatacttct
1801 ttaggagatc ttcttgtgaa acctcttgat ctccgtgagg atactgcaag acaaatatca
1861 gccattaaga atctttcttt gacgcactat agcctagcat tggcgaagag aaaacgtgat
1921 gtcgatagca acacacataa taataatctt ttcaatcatc ttttccttga taagcgatct
1981 gcgtctagcc cggtttttca tttatgcacc ataacaagca gatatgcagc atacaaaatc
2041 taatgatgca aatcaaggag actactctga tgattctcca atctaaaaaa ctaacgtggt
2101 tttagaacgg atgcaaccgg cctctccaat cagtgcagga gattctacaa cggtaacccc
2161 tgcctgtctc aaagcttctt gtttgctaaa agcatcccca cttttccctg aaataatagc
2221 tcctgcatgt cccatacgtt tccctttggg gagccgtagc tcctgcaata aatgcaatca
2281 caggcttact actatgttga cgtatccaat ctgcagcttc ctcttcagcg cttccaccaa
2341 tctccaatca taagaacagc ttctgtttgg ctatcctttt caaactcttg gagaggcatc
2401 gataaaagat gtgccacttt aaaggatcc
//
LOCUS AFAAZU 810 bp ds-DNA BCT 15-AUG-1990
DEFINITION A.denitrificans azurin (azu) gene, complete cds.
ACCESSION M30388
KEYWORDS azurin.
SOURCE A.denitrificans (strain NCTC8582) DNA.
ORGANISM Alcaligenes denitrificans
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Aerobic rods and cocci; Alcaligenaceae.
REFERENCE 1 (bases 1 to 810)
AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., van De Kamp,M. and
Canters,G.W.
TITLE Isolation and sequencing of the Alcaligenes denitrificans
azurin-encoding gene: Comparison with the genes encoding blue
copper proteins from Pseudomonas aeruginosa and Alcaligenes
denitrificans
JOURNAL Gene 90, 15-20 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.W.Canters, 01-DEC-1989, for release after publication.
FEATURES from to/span description
pept 307 756 azurin (azu) precursor
sigp 307 366 azurin signal peptide
matp 367 753 azurin
site 190 203 fnr-box
site 251 267 ntrA-box
binding 296 299 ribosome binding site
signal 778 800 terminator
BASE COUNT 160 a 238 c 245 g 167 t
ORIGIN
1 cccgccgctg tgctgccttg catgctcgaa ctctacttgt ttgcaattgt ttgcaggcat
61 cctacgaaga tggaagaccc ttcgtattgc ggtttgtcaa tgggcacggt ttcggtgcgc
121 cggatgggcc aataccccta tgcggcatgg ggatttcccc tgtttttggg catctgaacg
181 gggtgggatt gatgtccgtc aatagcgcgc ttttttcgcc gtcttagact tgtgcgtggc
241 ggcagcgacg caggcatgtg cctggcgcga gtcgaagaat ggccgccctg tttacggaga
301 gtctccatgc tggcaaaagc caccctagct atcgttctgt ccgcagccag cctgcccgtg
361 ctggctgctc aatgcgaagc aaccatcgaa agcaacgacg ccatgcagta caacctgaag
421 gaaatggtcg ttgacaaaag ctgcaagcag ttcacggtgc acctcaagca cgtcggcaag
481 atggccaagg tcgccatggg ccacaactgg gtgctgacca aggaagccga caagcagggc
541 gtcgccactg acggcatgaa cgccggcctg gcgcaggact acgtgaaggc gggcgatacc
601 cgtgtcatcg cgcacaccaa ggtcatcggc ggcggcgaat cggattcggt aacgttcgac
661 gtgtccaagc tgaccccggg cgaagcctat gcctacttct gctcgttccc cggccactgg
721 gccatgatga agggcacgct caagctgagc aactgacccc gccctagcgc gcagataccg
781 gcccagggcc ggtttttttt gtcttggggc
//
LOCUS PSEAZU 1287 bp ds-DNA BCT 15-AUG-1990
DEFINITION P.aeruginosa azurin (azu) gene, complete cds.
ACCESSION M30389
KEYWORDS azurin.
SOURCE P.aeruginosa (strain CIT135) DNA.
ORGANISM Pseudomonas aeruginosa
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 1287)
AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., Van De Kamp,M. and
Canters,G.W.
TITLE Isolation and sequencing of the Alcaligenes denitrificans
azurin-encoding gene: Comparison with the genes encoding blue
copper proteins from Pseudomonas aeruginosa and Alcaligenes
denitrificans
JOURNAL Gene 90, 15-20 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by G.W.Canters, 01-DEC-1989, for release after publication.
FEATURES from to/span description
pept 213 < 1 (c) ORF1
pept 489 935 azurin (azu) precursor
sigp 489 548 azurin signal peptide
matp 549 932 azurin
pept > 1287 985 (c) ORF2 (AA at 1287)
site 318 333 ntrA-box
site 403 416 fnr-box
signal 958 985 terminator (bidirectional azu and ORF2)
binding 476 481 ribosome binding site
binding 224 220 (c) ribosome binding site (ORF1)
BASE COUNT 229 a 423 c 428 g 207 t
ORIGIN
1 ctgcaggctc tgcgggatga tcccgatcac ttcgctgccg gcggccaatg cggcgtccgc
61 cacggtgccc atcagaccga ccgcgccgcc accgtagacc agggtcaggc cgcgctcggc
121 caggtgccgg ccgagggcca cggcggcttc ctggtagacc ggggaagcgc cggggctggc
181 gccacagaat acgcagacgg aacgcaaggt catgatcgac tcctgtcggg ggtggaaaaa
241 ggcgcacagg gtagcggctg ggagcgcttc gaccaagccg tgcgaagcgt tgccggacgt
301 tgcgtcgcag gcgcgaagcg gcacatctgt gctaaaacag gagttccccg tagtaaacgc
361 cgggcagatc ccgctcgatg ccccgccacg tccggttcgg gtttgacctg aatcagtgga
421 actcggtgcc cgatcgggca gtctgctctt tcaggattca tcgcccaacc tgcctaggag
481 gctgctccat gctacgtaaa ctcgctgccg tatccctgct gtccctgctc agtgcgccgc
541 tgctggctgc cgagtgctcg gtggacatcc agggtaacga ccagatgcag ttcaacacca
601 atgccatcac cgtcgacaag agctgcaagc agttcaccgt caacctgtcc caccccggca
661 acctgccgaa gaacgtcatg ggccacaact gggtactgag caccgccgcc gacatgcagg
721 gcgtggtcac cgacggcatg gcttccggcc tggacaagga ttacctgaag cccgacgaca
781 gccgcgtcat cgcccacacc aagctgatcg gctcgggcga gaaggactcg gtgaccttcg
841 acgtctccaa gctgaaggaa ggcgagcagt acatgttctt ctgcaccttc ccgggccact
901 ccgcgctgat gaagggcacc ctgaccctga agtgatgcgc gagcgatccg ctgcatgaaa
961 aagcccggcc gctgccgggc tttttcatgg gcgcgcgccg ggctcagcgc gcgtagctgc
1021 cgccatcgcc tcgccggcca gttggtgcac gcgccgggtc ggatgccact cgtcccagaa
1081 gtagtactgg tccgggttgg cgcaggccgg gcggacgctg ggctgggtcg gctggcaggg
1141 cgcgtccagc tccaccaggc catagcgcgc cgggttgcgc cgcaagtggc ggctgaaggt
1201 gagatggtcg aaccagctca gctccaggcc gcgggtcttg cgcagggcgg cgagctggat
1261 cggcaggctg gcgttgactg cctgcag
//
LOCUS MZEADH1CM 6167 bp ds-DNA PLN 15-AUG-1990
DEFINITION Z.mays alcohol dehydrogenase (ADH-1 C-m allele) gene, complete cds.
ACCESSION M32984
KEYWORDS alcohol dehydrogenase.
SOURCE Z.mays DNA.
ORGANISM Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 6167)
AUTHORS Osterman,J.C. and Dennis,E.S.
TITLE Molecular analysis of the ADH1-Cm allele of maize
JOURNAL Plant Mol. Biol. 13, 203-212 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by J.C.Osterman, 18-MAR-1990.
FEATURES from to/span description
pept 1217 1250 alcohol dehydrogenase, exon 1 (ADH-1) (EC
1.1.1.1)
1785 1921 alcohol dehydrogenase, exon 2
2019 2065 alcohol dehydrogenase, exon 3
2482 2807 alcohol dehydrogenase, exon 4
2894 2976 alcohol dehydrogenase, exon 5
3070 3145 alcohol dehydrogenase, exon 6
3487 3548 alcohol dehydrogenase, exon 7
3636 3731 alcohol dehydrogenase, exon 8
3823 3984 alcohol dehydrogenase, exon 9
4085 4201 alcohol dehydrogenase, exon 10
pre-msg 1110 > 4201 ADH-1 mRNA and introns
IVS 1251 1784 ADH-1 intron A
IVS 1922 2018 ADH-1 intron B
IVS 2066 2481 ADH-1 intron C
IVS 2808 2893 ADH-1 intron D
IVS 2977 3069 ADH-1 intron E
IVS 3146 3486 ADH-1 intron F
IVS 3549 3635 ADH-1 intron G
IVS 3732 3822 ADH-1 intron H
IVS 3985 4084 ADH-1 intron I
BASE COUNT 1574 a 1335 c 1378 g 1880 t
ORIGIN 1 bp upstream of BamHI site.
1 ggatccaata ggctagtcac ttttacttta gcttctgaga tccaaacagt cacttaggac
61 atgtttggaa gcacaccagt ttttaaaaaa ctttttccta tcctcaattt ctagaaaatg
121 gtttatgaaa aaaaatttgg gtgggatgtt tgtaacccag tttctagttt tttttataaa
181 gagagtagct tcttggtttt agttagagga gagtagcttc ttggttttta agaaactggg
241 aatccagttt ctataaactg gaacataaat aagtatattt ggaatcactt tagtttgtac
301 aaaccgattt cttagaaatt ggatgcttat aaataggccc tcaatgtcct tgttgggttt
361 atgaaattta catctattac cacattttta aaaatagagg aagagtatgc tagtagttat
421 gtataaaaaa actagaaact gtttttttta aaaaaaaact gagttccagt ttcctttatc
481 taattctttt ataagctatt ttttagaaaa ggatagaaac tgtttttaaa aaaactggtg
541 tgcttctgtt taactcttcg taagaacagt gttacgtccc gtgtctatat tttgcttttg
601 ttgaaagcca tcgtaagtac atgcttgcgt gggtgaaatg ccatcgcaat gctacaactt
661 ttcggctccc tcctgcttcg gtgcttccac atgccctgca cggcgtctag aaaccctaat
721 gattcagcag cacacctgtc cgcctagccg cctacgcgta cacagaaaac aaattttttg
781 tccacacacg cgcgcgctcc gagccgcaga tccgagctag cgcggcgcat ccgacggcca
841 cgacagcgcg gtgccgtcct ccgccgccac cgcttggcgc ttgtccgcac cccccaccag
901 tccaccacct cccccacgag cgaaaaccac ggtccacgga ccacggctat gttccactcc
961 aggtggaggc tgcagccccg gtttcgcaag ccgcgccgtg gtttgcttgc ccacaggcgg
1021 ccaaaccgca ccctccttcc cgtcgtttcc catctcttcc tcctttagag ctaccactat
1081 ataaatcagg gctcattttc tcgctcctca caggctcgtc tcgctttgga tcgattggtt
1141 tcgtaagtgg tgagggactg agggtctcgg agtggattga tttgggattc tgttcgaaga
1201 tttgcggagg ggggcaatgg cgaccgcggg gaaggtgatc aagtgcaaag gtccgccttg
1261 tttctcctct gtctcttgat ctgactaatc ttggtttatg attcgttgag taattttggg
1321 gaaagcttcg tccacagttt tttttttcga tgaacagtgc cgcagtggcg ctgatcttgt
1381 atgctatcct gcaatcgtgg tgaacttatt tcttttatat ccttcactcc catgaaaggc
1441 tagtaatctt tctcgatgta acatcgtcca gcactgctat taccgtgtgg tccatccgac
1501 agtctggctg aacacatcat acgatattga gcaaagatct atcttccctg ttctttaatg
1561 aaagacgtca ttttcatcag tatgatctaa gaatgttgca acttgcaagg aggcgtttct
1621 ttctttgaat ttaactaact cgttgagtgg ccctgtttct cggacgtaag gcctttgctg
1681 ctccacacat gtccattcga attttaccgt gtttagcaag ggcgaaaagt ttgcatcttg
1741 atgatttagc ttgactatgc gattgctttc ctggacccgt gcagctgcgg tggcatggga
1801 ggccggcaag ccactgtcga tcgaggaggt ggaggtagcg cctccgcagg ccatggaggt
1861 gcgcgtcaag atcctcttca cctcgctctg ccacaccgac gtcgacttct gggaggccaa
1921 ggtatctaat cagccatccc atttgtgatc tttgtcagta gatatgatac aacaactcgc
1981 ggttgacttg cgccttcttg gcggcttatc tgtcttaggg gcagactccc gtgttccctc
2041 ggatctttgg ccatgaggct ggagggtatg ttctattccc cgatttactt cactatgttg
2101 ctgactatat atgtgctgtg tttatatttt gcatatttat tatgtttttg cgtctgaatt
2161 tatgggtatg gttggtggtc tttgtttact gttttactag atgcatgtgg aagagtcaga
2221 agaaatagtt tttgtttgaa atggtatacc aacggttgga tattatctgt gtggacatca
2281 gatgttctgg gttactggca gtggactttt gacagattta tctatgattc tttcattagc
2341 agtttcttcg gctaatttac tcttactatt ttttcagtat acaaaggcac gtacagcttg
2401 gattgtgtag aatcatttta gatctgttat ctgaggcaaa tttgcttatt ctagccgcct
2461 gaaaattctt gattttgcca gtatcataga gagtgttgga gagggtgtga ctgacgtagc
2521 tccgggcgac catgtccttc ctgtgttcac tggggagtgc aaggagtgcg ctcactgcaa
2581 gtcggcagag agcaacatgt gtgatctgct caggatcaac accgaccgcg gtgtgatgat
2641 tgccgatggc aagtcgcggt tttcaatcaa tgggaagcct atctaccact ttgttgggac
2701 ttccaccttc agcgagtaca ctgtcatgca tgtcggttgt gttgcaaaga tcaatcctca
2761 ggctcccctt gataaagttt gcgtccttag ctgtggtatt tctaccggta agttcattta
2821 ctacattttg gtgtggatgc tggagtacat ttatcttgag atgctgagtt acacaaattc
2881 tttatctgtt taggtcttgg tgcatcaatt aatgttgcaa aacctccgaa gggttcgaca
2941 gtggctgttt tcggtttagg agccgttggt cttgccgtaa gtgttgaaac gatttgcttg
3001 ttctatgacc tttcaattgc aatgagaacg tgtgttgggt ttgcatctga ttaccctgcg
3061 catggttagg ctgcagaagg tgcaaggatt gctggagcgt caaggatcat tggtgtcgac
3121 ctgaacccca gcagattcga agaaggtaca gtacacacac atgtatatat gtatgatgta
3181 tcccttcgat cgaaggcatg ccttggtata atcactgagt agtcatttta ttactttgtt
3241 ttgacaagtc agtagttcat ccatttgtcc cattttttca gcttggaagt ttggttgcac
3301 tggccttggt ctaataactg agtagtcatt ttattacgtt gtttcgacaa gtcagtagct
3361 catccatctg tcccattttt tcagctagga agtttggttg cactggcctt ggactaataa
3421 ctgattagtc attttattac attgtttcga caagtcagta gctcatccat ctgtcccatt
3481 tttcagctag gaagttcggt tgcactgaat ttgtgaaccc aaaagaccac aacaagccgg
3541 tgcaggaggt ctgtttcttt acccaaggca acaaaaggtt atcacagctt atgctgaact
3601 tggccataac attcaataat tcctttatgg tctaggtact tgctgagatg accaacggag
3661 gggtcgaccg cagcgtggaa tgcactggca acatcaatgc tatgatccaa gctttcgaat
3721 gtgttcatga tgtaagtata tgtatacact ctcagctact ttcattctcc aggttccctt
3781 catccagaca tgcatgttct aaccgccgcc ctcgtgatcc agggctgggg tgttgccgtg
3841 ctggtgggtg tgccgcataa ggacgctgag ttcaagaccc acccgatgaa cttcctgaac
3901 gaaaggaccc tgaaggggac cttctttggc aactataagc cacgcactga tctgccaaat
3961 gtggtggagc tgtacatgaa aaaggtaaat tgcaaagtgc tgttccttcg gtttccttac
4021 cagccgagct tttgctgaaa aactgttaag aatcgttcct gcaattctgc ttggctctgc
4081 acaggagctg gaggtggaga agttcatcac gcacagcgtc ccgttcgccg agatcaacaa
4141 ggcgttcgac ctgatggcca agggggaggg catccgctgc atcatccgca tggagaacta
4201 gatttcgctg tctagtttgt gatctggctg ggcttggggt taataaagga ggcaatgcta
4261 gcctgccctt tcgatgagga ggtacataca cgctggcgat ggaccgcgct tgtgtgtcgc
4321 gttcagtttg gcttttgcca agcagtaggg tagcttcccg tgtcggtaat tatatggtat
4381 gaaccatcac cttttggcgc aatacatggt atgaacgtaa gatacaaatt ccaactacct
4441 ctagctcgct tgtgtgctat atgtatctct ctcgacggat gacacaagat cgcttctata
4501 tccgaagtga aactaaaagg agaaggaaaa gaaggtaaca gaataggaac cggtttggtg
4561 agaattggag aggattcatg aaagagaaaa tcccttttca ttaaatttta aatagcaagt
4621 gatttactct ctcatgatct cctccagttt ccatttcatc aaaacaaacc ttattcattt
4681 tcccctctaa tctctttctt gtcaccaccg gtggagcaag gtgattaaag agactaaatt
4741 attattcaat gaatagtagg ggttttagcc cctcaattcc tccaatacct ttgctcccaa
4801 ataagggggt gtttggtttc tagggactaa ttcctccaat acctttgctc ccaaataagg
4861 gggtgtttgg tttctaggga ctaatgttta gtcccatcat ttttttttct attttagtct
4921 ataaattgct aaatatagaa actaaaataa attaaaatat agttttagtt tctatatttg
4981 acaattttag aactaaaatg gaataaaatg tagggactaa aaattagtct agaaatcaaa
5041 caccccctaa atccctaaga gccgaggaag gggattaaaa aggataaaat cttctttgtg
5101 ttcaatttta aataggactc gccgtatcgg taaggccttg ttcgtttaca ttggattgca
5161 cctggaatcg ttccggctaa tcaaagttta tataaattag agaagcaatc cggatcggaa
5221 tcgttccgac ccaccaatcc gacgcaaacg aacaaggcct aaggcttcgc ggcggggctc
5281 gcagtccgga cgccggagag ggggagtgga gatggagaat gacaaggggg tgttctggaa
5341 agtttccttt ccaagagtaa gggtggttgg tttcgtacac taatttttaa gagcgtttgg
5401 ttaagaaaca gagaaaaatg gagtaactct attcttattt tttatgttta gttttcatta
5461 aaaaaggagc agaataccac ttgaagttct tatatagaaa tttatcataa atagttaaaa
5521 tgctctcact ccataaaaac aatcggatgc tagcgctctt cttcctatcc taccctctat
5581 attcatatga ctctttaacc aaacagagaa cggagcggct ccgctctatt ttactcttca
5641 accaaataaa aaaggagcaa ctctgtttgt catacgcgga atagaacgga tttatcctca
5701 aaaactagaa tggagcccct ctattttagt cgattctcca accaaacgca tagtgtctcc
5761 atttcattct attttagtct ctaaattgac aaatacataa actaaattat attttaagtt
5821 ttcgtattta atcaatccct accaaccaaa cactccctaa tttcgcatat cagccccaaa
5881 tcaagagtgg ttgacccatc gagacgttat cggcggatca aaggcatgcc ccgctaagca
5941 ataagtgtct aaactaacgt gccgtcgatc tcattaaaca gcaccacgag ctaaacagaa
6001 tgccaacctc aaaatcaaac atcacctgga tgctggatct gacatccgac ctaggtgcta
6061 ggcaacgatt gtgcgtagtg ctgaccatat ttgagatttt cactttattt attaaaaaaa
6121 agaggccagc agggtgggcc gctacccggc ctggtggccg agctaga
//
LOCUS CFICMCASE 1828 bp ds-DNA BCT 15-AUG-1990
DEFINITION C.uda endoglucanase gene, complete cds.
ACCESSION M36503
KEYWORDS endoglucanase.
SOURCE C.uda CB4 DNA.
ORGANISM Cellulomonas uda
Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE 1 (bases 1 to 1828)
AUTHORS Nakamura,K., Misawa,N. and Kitamura,K.
TITLE Sequence of a cellulase gene of Cellulomonas uda CB4
JOURNAL J. Biotechnol. 4, 247-254 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 105 1184 endoglucanase
BASE COUNT 348 a 542 c 557 g 381 t
ORIGIN
1 ctgcagagtc agggaggcag cgctcacgta atattgcagc gtgaccgcgt gttctctgtc
61 tctgacgttc agtttcttta ctaccatcca taatgagtga atttatgccc ctgcgtgctt
121 tagtggcggt gatagtgaca acggcagtaa tgctggtgcc ccgggcgtgg gcgcagacgg
181 cctgggagcg ttataaggcc cgttttatga tgccggacgc gcgtatcatt gataccgcca
241 atggcaatgt gtcgcatacg gaaggccagg gcttcgccat gctcctggcg gtggcgaata
301 acgatcgccc ggcgttcgac aagctgtggc agtggacgga cagcaccctg cgcgacaagt
361 ctaacgggct gttttactgg cgctataacc cggtggcgcc ggacccgatc gccgataaaa
421 acaacgccac cgatggcgat accctgatcg cctgggcgct gctgcgcgcg caaaagcagt
481 ggcaggacaa gcgctacgcc acggcctccg atgccatcac cgcctccctg ctgaaatata
541 cggtggtgac tttcgccggt cgccaggtga tgctcccggg cgtgaagggg tttaaccgca
601 acgaccacct gaaccttaac ccctcctatt tcatcttccc ggcctggcgg gcctttgcgg
661 agcggacgca cctgaccgcc tggcggacat tgcagagtga cgggcaggcg ctgctggggc
721 aaatgggctg ggggaaatcg catctgccca gcgactgggt ggcgctgcgg gcggatggca
781 agatgctgcc ggccaaagag tggccgccgc ggatgagttt cgatgcgatc cgtatcccgc
841 tgtatatctc gtgggtcgat ccgcacagcg ccttgctcgc accgtggaaa gcctggatgc
901 agagttaccc gcgcctgcaa actccggcgt ggatcaacgt tagcaccaac gaggtcgccc
961 cgtggaatat ggccggcggc ctgctggcgg tgcgtgattt aacgcttggc gaaccgctgg
1021 aacgccgcag attgacgaca aggatgatta ttactccgcc agcctcaagc tgctggtctg
1081 gctggcgaaa caggatcagc gctagcgctg tgatggcttt gcaggtttct cagcccgtat
1141 gcctgcgggc tgagagaaaa gagcaggaac gtctcacgat gtaaggccgc cagaataggc
1201 ggccttgtcg cttattgcgg ataaggcacc caactgccgc cattcagctg gacataaggc
1261 ttgccctgat actggataac gatggcgttg gcgttttcgg acaccgccgc gctctgcggc
1321 aggttggcga catactgctg ccagttgacg ctgtcttcgc tgaacatttt gccgtcgagg
1381 gcgcgcgcac caccagctcc gacaccgcca ggtagctgct gggctgatcg atgataattg
1441 gcgcgccttc atgtggcgcc ttcatgccga agaatttcac cgccgtcggg acgttagtga
1501 tcgacgggct cgggatatcc cgcaggccag acacctgcat cttatcgccc ttcagcgcgc
1561 cgccgtgttc cggcaccacc accaccatca ccttacgccc cgatttttcc agttcggtga
1621 agaagttatc caggtcgtca aacagcttct gcgcccgcac tttgtagtcc gcggttttgc
1681 tttgccccgg gaagtgattg ccgtcatgca gcggcagggt gttatagaac gtggcgctcc
1741 gcggattgct gctggcctct tcggtcttca gccacgggtt gagaaccgcg agatcctcat
1801 acactggcga accatcaaat gcctgcag
//
LOCUS HUMTAPA1 1496 bp ss-mRNA PRI 15-AUG-1990
DEFINITION Human 26-kDa cell surface protein TAPA-1 mRNA, complete cds.
ACCESSION M33680
KEYWORDS 26-kDa cell surface protein TAPA-1;
target of antiproliferative antibody.
SOURCE Human cell line OCI-LY8, cDNA to mRNA, clones 7-3 and 8-1.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1496)
AUTHORS Oren,R., Takahashi,S., Doss,C., Levy,R. and Levy,S.
TITLE TAPA-1, the target of an anti-proliferative antibody, defines a new
family of transmembrane proteins
JOURNAL Mol. Cell. Biol. 10, 4007-4015 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by S.Levy, 10-APR-1990, for release after publication.
FEATURES from to/span description
pept 239 949 26-kDa cell surface protein TAPA-1
signal 1455 1460 Poly-A signal
BASE COUNT 257 a 504 c 413 g 322 t
ORIGIN
1 ccattgtgct ggaaaggcgc gcaacggcgg cgacggcggc gaccccaccg cgcatcctgc
61 caggcctccg cgcccagccg cccacgcgcc cccgcgcccc gcgccccgac cctttcttcg
121 cgcccccgcc cctcggcccg ccaggccccc ttgccggcca cccgccaggc cccgcgccgg
181 cccgcccgcc gcccaggacc ggcccgcgcc ccgcaggccg cccgccgccc gcgccgccat
241 gggagtggag ggctgcacca agtgcatcaa gtacctgctc ttcgtcttca atttcgtctt
301 ctggctggct ggaggcgtga tcctgggtgt ggccctgtgg ctccgccatg acccgcagac
361 caccaacctc ctgtatctgg agctgggaga caagcccgcg cccaacacct tctatgtagg
421 catctacatc ctcatcgctg tgggcgctgt catgatgttc gttggcttcc tgggctgcta
481 cggggccatc caggaatccc agtgcctgct ggggacgttc ttcacctgcc tggtcatcct
541 gtttgcctgt gaggtggccg ccggcatctg gggctttgtc aacaaggacc agatcgccaa
601 ggatgtgaag cagttctatg accaggccct acagcaggcc gtggtggatg atgacgccaa
661 caacgccaag gctgtggtga agaccttcca cgagacgctt gactgctgtg gctccagcac
721 actgactgct ttgaccacct cagtgctcaa gaacaatttg tgtccctcgg gcagcaacat
781 catcagcaac ctcttcaagg aggactgcca ccagaagatc gatgacctct tctccgggaa
841 gctgtacctc atcggcattg ctgccatcgt ggtcgctgtg atcatgatct tcgagatgat
901 cctgagcatg gtgctgtgct gtggcatccg gaacagctcc gtgtactgag gccccgcagc
961 tctggccaca gggacctctg cagtgccccc taagtgaccc ggacacttcc gagggggcca
1021 tcaccgcctg tgtatataac gtttccggta ttactctgct acacgtagcc tttttacttt
1081 tggggttttg tttttgttct gaactttcct gttacctttt cagggctgat gtcacatgta
1141 ggtggcgtgt atgagtggag acgggcctgg gtcttgggga ctggagggca ggggtccttc
1201 tgcccctggg gtcccagggt gctctgcctg ctcagccagg cctctcctgg gagccactcg
1261 cccagagact cagcttggcc aacttggggg gctgtgtcca cccagcccgc ccgtcctgtg
1321 ggctgcacag ctcaccttgt tccctcctgc cccggttcga gagccgagtc tgtgggcact
1381 ctctgccttc atgcacctgt cctttctaac acgtcgcctt caactgtaat cacaacatcc
1441 tgactccgtc atttaataaa gaaggaacat caggcatgct aaaaaaaaaa aaaaaa
//
LOCUS DROSYNCL 3727 bp ds-DNA SYN 15-AUG-1990
DEFINITION Synthetic cloning vector encoding heat-shock protein 82/neomycin
phosphotransferase fusion protein (hsp82-neo) gene, complete cds.
ACCESSION M32616
KEYWORDS heat-shock protein 82; neomycin phosphotransferase.
SOURCE Synthetic, D.pseudoobscura, D.melanogaster and bacterial DNA, clone
pHS85.
ORGANISM Cloning vector
Artificial sequences; Cloning vehicles.
REFERENCE 1 (bases 1 to 3727)
AUTHORS Sass,H.
TITLE P-transposable vectors expressing a constitutive and
thermoinducible hsp82-neo fusion gene for Drosophila germline
transformation tissue-culture transfection
JOURNAL Gene 89, 179-186 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.Saas 06-MAR-1990, for release after publication.
FEATURES from to/span description
pept 2068 2925 heat-shock protein 82/neomcyn
phosphotransferase fusion protein (hsp82-neo)
IVS 1005 2067 hsp82 intron A
pre-msg 6 3610 hsp82-neo fusion protein mRNA and intron
site 6 868 D.pseudoobscura hsp82 gene 5' flank
site 869 1004 D.pseudoobscura heat-shock protein 82, exon 1
site 2068 2126 D.pseudoobscura hsp82 truncated exon 2
site 2127 2142 coding linker
site 2143 3269 neomycin phosphotransferase coding sequence
site 3270 3610 non-coding 3'flank of D.melanogaster hsp82
gene with Poly-A signal
site 3620 3726 multiple cloning site (MCS)
BASE COUNT 950 a 866 c 882 g 1029 t
ORIGIN 23 on XR.
1 ggatccgatg gatttttacc atattattat tatttctagc cacgttgcaa ctctatgtca
61 gtaccggaaa tagcagccct ggagtctctt agcctctaga aacggctaga acattctacg
121 cttgtggttg gttttcattg aaagcaggcg tcttttatat actttacggt atatagctac
181 atgtatataa tggtatactt catcaatatc atcaatctat gaattttaat ttttaagagt
241 acatatataa attaacatgg gggatatagt tctcaatacc caagtatttg aattttccat
301 ctctcatcgg gggtaattca tgaaccggtt ccagccgaaa aatgaacgaa attcatgaga
361 gattattttt tcgggattgc ttgccaatac atttcggaaa aacaaaatgt actacatttt
421 tgtcatctca gggtgctcca attaattatg aatgctacga cactacaaag cagcttggaa
481 atccgaattt taacaataat taaaggaaat agggtatagc gtatataggg tatcatagct
541 gaaacgggta taccaacaat aatgacgcag cacttacgtt tcactccgta ctcacttacg
601 atttatgctt ataatttttg ttcacctctt ttacttaaac ctcactttaa aaacaatcaa
661 ataaatggga gtatttatgt atatttctaa gattacggcg gtattgttct gctgtctgcg
721 gtcacactgg ttttcagcct cggtgcaact ctgtttcagt accggaaata gcagccctgg
781 attctcgtag cctctagaaa cgtctagaaa attctacgct tggggttggt ttgctataaa
841 agcaggcggg ccgactgttg ccggctcgag tcttgaaaaa tttttgtcca gtgaaggtgc
901 gtttgcttag agcgcagtgc aacaaagtga atttattcta cacaaatcga agtgaaaata
961 tatatatatt tttatctctg ctgttaaatt aaaacacata caaggtaagc gttaacaatg
1021 aaagtgcatt tatttaacaa aatgtaaaga tctgctgtgg tgcaatgctt gctgcgcgtc
1081 tgctgatgaa aagttcttga cccaaatgca gaaaatcaat agaatctgtg aaatcttcta
1141 taatcttaaa attagattaa agttctattt ttttgcccga gtttgtaacc acgggcgata
1201 aaaagtagct ttacgcctcg cacaccaata cacgaacaga aaaattatgc cggctgtaat
1261 atgagctcgg cgcgaaattt ctagatgacc ggttcttaga acatcaacct tgcatgtcca
1321 acaaatgctg gttaattaaa gacgtgcctt aacttaattt tcttggcaca cgtgcttatt
1381 tgaattcagt cttttgcact tgccatgcac acagccacac atatgtgaat ttgcgaattt
1441 gccactcatg catacactca tgtatgttcc atcatcgaga aaattcgaaa atcgtgaatc
1501 aaacttcggc atgaatcaaa tttcaaagag gtctttgttt ccacctggtt ctagaagttt
1561 cctttcgcgt gcttggatac ctatcttatg cataaacggt ttctgcacat gtaacttgaa
1621 cacatacaca cttgcaaaca tatgtatgta catatgcata ccctgaccac aaaattttca
1681 gcaaacttta gccgtacatc aaaccaccaa agagctgtgc tgttgtcaag gagaattttc
1741 ttccagaaag cttcaattag attgtttatc tgggggtgat gtacgcattg gacaacccta
1801 tgcgctctag aaacttccag taaatgttaa ctggatgtac aatgggtaca tccctaagcg
1861 tgcgagtgta tgcgtgttcg ctaactgtaa tgtatgtgtg ttcgtgtgcg aaagagaaaa
1921 ggatgagaag tctgccattt tgaaataaaa agattttgtg ctaggggggt ggggaaatat
1981 gattatcgaa aatgggcagt gaacaatgca gctgcatatt taatgagttg tgactaattc
2041 tcgtgtggta ttttcttgct cttccagatg cccgaagaag ctgagacttt cgcattccag
2101 gctgagattg ctcagcttat gtcgttgatc cggccaagct tggatggatt gcacgcaggt
2161 tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc
2221 tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag
2281 accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg
2341 gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac
2401 tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc
2461 gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc
2521 tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc
2581 ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg
2641 ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat
2701 gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc
2761 cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa
2821 gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat
2881 tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagcggg actctggggt
2941 tcgaaatgac cgaccaagcg acgcccaacc tgccatcacg agatttcgat tccaccgccg
3001 ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg atgatcctcc
3061 agcgcgggga tctcatgctg gagttcttcg cccaccccgg gctcgatccc ctcgcgagtt
3121 ggttcagctg ctgcctgagg ctggacgacc tcgcggagtt ctaccggcag tgcaaatccg
3181 tcggcatcca ggaaaccagc agcggctatc cgcgcatcca tgcccccgaa ctgcaggagt
3241 ggggaggcac gatggccgct ttggtcgatc gatgataaac ataaaaccaa ataaacaaca
3301 agcaaatgtg ttttaaaaat ctaacttctg agcgagtatt tattgggggg aataaacaat
3361 ctatgaatcg gattctttgc gcagcagctg ctcaatggcc tccaccgtgg acactccgtt
3421 ggttatcatt attatcttgt ttcgcgatcg agatcccttg tccaaagaaa cgtcgctctt
3481 tcgaagacct agaactttcg acagaaactt gaccagttcg gcgttagctt ctccctcgct
3541 gggcggagcg gcgatttgga cgcccactcc ttcaaagcca attcctgtga ttccgttctg
3601 cttagccccc ccggaattgg gtacccccac cgcggtggcg gccgctctag aactagtgga
3661 tcccccgggc tgcaggaatt cgatatcaag cttatcgata ccgtcgacct cgaggggggg
3721 cccggta
//
LOCUS ECOARGD 1221 bp ds-DNA BCT 15-AUG-1990
DEFINITION E.coli acetylornithine aminotransferase (argD) gene, complete cds.
ACCESSION M32796
KEYWORDS acetylornithine aminotransferase.
SOURCE E.coli (K12) DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1221)
AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N.
TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine
aminotransferases: Evolutionary relationship with ornithine
aminotransferases
JOURNAL Gene 90, 69-78 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by A.H.T.Boyen 13-MAR-1990, for release after publication.
FEATURES from to/span description
pept 1 1221 acetylornithine aminotransferase (argD) (EC
2.6.1.11)
BASE COUNT 261 a 290 c 376 g 294 t
ORIGIN 73 minutes.
1 atggcaattg aacaaacagc aattacacgc gcgactttcg atgaagtgat cctgccgatt
61 tatgctccgg cagagtttat tccggtaaaa ggtcagggca gccgaatctg ggatcagcaa
121 ggcaaggagt atgtcgattt cgcgggtggc attgcagtta cggcgttggg ccattgccat
181 cctgcgctgg tgaacgcgtt aaaaacccag ggcgaaactc tgtggcatat cagtaacgtt
241 ttcaccaatg aaccggcgct gcgtcttggg cgtaaactga ttgaggcaac gtttgccgaa
301 cgcgtggtgt ttatgaactc cggcacggaa gctaacgaaa ccgcctttaa actggcacgc
361 cattacgcct gtgtgcgtca tagcccgttc aaaaccaaaa ttattgcctt ccataacgct
421 tttcatggtc gctcgctgtt taccgtttcg gtgggtgggc agccaaaata ttccgacggc
481 tttgggccga aaccggcaga catcatccac gttcccttta acgatctcca tgcagtgaaa
541 gcggtgatgg atgatcacac ctgtgcggtg gtggttgagc cgatccaggg cgagggcggt
601 gtgacggcag cgacgccaga gtttttgcag ggcttgcgcg agctgtgcga tcaacatcag
661 gcattattgg tgtttgatga agtgcagtgc gggatggggc ggaccggcga tttgtttgct
721 tacatgcact acgcgttagc gccggatatt ctgacctctg cgaaagcgtt aggcggcggc
781 ttcccgatta gcgccatgct gaccacggcg gaaattgctt ctgcgtttca tcctggttct
841 cacggttcca cctacggcgg taatcctctg gcctgtgcag tagcgggggc ggcgtttgat
901 atcatcaata cccctgaagt gctggaaggc attcaggcga aacgccagcg ttttgttgac
961 catctgcaga agatcgatca gcagtacgat gtatttagcg atattcgcgg tatggggctg
1021 ttgattggcg cagagctgaa accacagtac aaaggtcggg cgcgtgattt cctgtatgcg
1081 ggcgcagagg ctggcgtaat ggtgctgaat gccggaccgg atgtgatgcg ttttgcaccg
1141 tcgctggtgg tggaagatgc ggatatcgat gaagggatgc aacgtttcgc ccacgcggtg
1201 gcgaaggtgg ttggggcgta a
//
LOCUS YSCARG8 1272 bp ds-DNA PLN 15-AUG-1990
DEFINITION S.cerevisiae acetylornithine aminotransferase (ARG8) gene, complete
cds.
ACCESSION M32795
KEYWORDS acetylornithine aminotransferase.
SOURCE S.cerevisiae FL100 DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1272)
AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N.
TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine
aminotransferases: Evolutionary relationship with ornithine
aminotransferases
JOURNAL Gene 90, 69-78 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer readable sequence for [1] kindly submitted
by A.H.T.Boyen 13-MAR-1990, for release after publication.
FEATURES from to/span description
pept 1 1272 acetylornithine aminotransferase (ARG8) (EC
2.6.1.11)
BASE COUNT 404 a 230 c 283 g 355 t
ORIGIN
1 atgtttaaaa gatatttatc cagtacgtca tcaagaagat ttacaagcat tttagaggaa
61 aaggcctttc aagtgaccac ttactctaga cctgaagatc tatgtataac tagaggtaaa
121 aatgcaaagc tgtatgatga cgtgaatggt aaagaatata tcgatttcac cgcaggtatt
181 gcggtgaccg cattaggcca tgcaaatcct aaagtggcag aaattctgca ccatcaggct
241 aacaaactgg ttcattcctc caacctttac ttcactaagg aatgtttgga tttaagtgaa
301 aagattgttg aaaagaccaa gcaattcggt ggtcaacacg acgcctcaag agtattttta
361 tgtaattctg gtacggaagc aaatgaagct gctttgaagt ttgcaaagaa acatggtata
421 atgaaaaatc ctagcaagca aggcattgtt gcatttgaga actcttttca tggccgtact
481 atgggcgctt tatctgtcac ttggaatagt aaatatagaa ctccttttgg ggatttggtt
541 ccccatgtct cattcttaaa tttgaatgac gaaatgacca aactacaaag ttatatcgag
601 accaaaaagg acgagattgc tggtttaatt gtcgagccca tacaaggtga aggtggggtt
661 tttcccgtag aagttgaaaa gctaaccgga ttgaagaaaa tatgtcaaga taatgatgtg
721 attgtcattc atgatgaaat tcaatgcggt ttgggccgtt caggtaaact atgggctcat
781 gcttatttac caagtgaggc tcatccggat atttttacat ctgccaaagc attgggaaat
841 ggcttcccca tcgctgccac catcgtcaat gaaaaagtta ataatgcttt gagagttggt
901 gaccacggca ccacgtatgg tggtaatccg ctggcctgtt ctgtaagcaa ctatgttttg
961 gataccatag cagacgaagc ttttttgaaa caagtctcta agaagagtga tatcttacaa
1021 aagcgcttgc gcgaaattca agccaaatat ccaaatcaaa taaagactat cagaggaaaa
1081 ggtttgatgc ttggtgctga gttcgtcgaa ccacccaccg aggtcatcaa aaaggccaga
1141 gaattgggac ttttgatcat taccgctggt aagagtaccg ttagatttgt tcccgcatta
1201 acgattgaag acgaactaat cgaagaaggg atggatgctt ttgaaaaggc tattgaagcg
1261 gtttacgctt aa
//
LOCUS MZEMTMINI 1445 bp ds-DNA ORG 15-AUG-1990
DEFINITION Maize mitochondrion 1.4 kb minicircle DNA open reading frame.
ACCESSION M36398
KEYWORDS .
SOURCE Maize mitochondrion 1.4 kb minicircle DNA.
ORGANISM Mitochondrion Zea mays
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE 1 (bases 1 to 1445)
AUTHORS Smith,A.G. and Pring,D.R.
TITLE Nucleotide sequence and molecular characterization of a maize
mitochondrial plasmid-like DNA
JOURNAL Curr. Genet. 12, 617-623 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 120 353 ORF 1
pept 1240 1356 ORF 2
pept 1245 1403 ORF 3
BASE COUNT 375 a 327 c 320 g 423 t
ORIGIN
1 gaattccttc ctttggtcgg actactcttt ttaggttatt gccttcggtc aaccctaaat
61 aagttgattg tcaaattgcg ctgtaactgc attcagttga atatgcggat attttatcaa
121 tgaatctcga tatcctgttg ataaagattg gatttcttgc gattctgatc gttttatcaa
181 tccaaatcat cgatgaatat ttccataaag tgatctgtga tcctttagtc tcaatatcag
241 ttgtttcctg ccgggataac ttgggttatg ctagccacct acttctacaa acaggtgaga
301 tccacctggg tgggttcgaa tcccatctgc tagatgcgtg gtcatggaat tgaaacctct
361 atggctggcc caagggaacc ggtcttgtcg attgacctag cttaggaaga gcccagtgaa
421 cctatccaca agtcaacccc cagggataat ggaaaacctc attcgcccat tggcaaacac
481 ttaaatatga ggacattcct ctggcaagac aggttagaga cttgagagac taaagacaag
541 aaggcacagg ttgtagtttt cttccaaggc caaaagcccc gcatggtgga agaagctact
601 ggtaagtccg agggggggct taactgcgat agttgaccga cgcgacgcta taccggaaag
661 gccttcgggg tgttgaaagt atggaacttt tattctcgca tagcttggga aagggtatcc
721 ggtgaaactc cccttaaaag ggtttttccc ccgtaccccc ttttcccaaa aaatttttta
781 aaaaaagtgg atcagtgaac ctatctttat ctgattaaat cagtggttag gttcactact
841 atttatagat aacaacccta gccttggggg gacaccccct ccccccaatc ccccctgtct
901 ggttttgttt taaaccaagt ttgcagggcg agcttgtttt gttatttata attagttatt
961 tcatgtttga tccgagcttc gggataggga acctctcttg tcagaaaggc ttccctctcc
1021 cttggtctct tgaaacagga cttttattca ctcagctatg cttcccggaa atccggatta
1081 aagaataaag acttctatac ctttccggga agcagagcag agggaaacgg agccctcgcc
1141 ccggagggga atcaattctc tggtttatcg ttcttatgct gttgcggtta taacgatagg
1201 aattactaga taacatcctc taggaattac tagataacaa tggaatggtt gagcctacta
1261 tctcaagtgt tggaaggctc aacctacttg cttgtccctc tccactatcg ttccggtctt
1321 accttccctc gagtccgatc tcgggaaggc gcttaggcag gggccccaag actaagcagg
1381 taatacaata cctatattta tagagggctt ttacctcgat aaatgagggc gcttcctata
1441 atgtg
//
LOCUS NGOTEM1A 1199 bp ds-DNA BCT 15-AUG-1990
DEFINITION N.gonorrhoeae plasmid pFA7 beta-lactamase (TEM-1) gene, 3' end.
ACCESSION M36543
KEYWORDS beta-lactamase.
SOURCE N.gonorrhoeae plasmid pFA7 DNA.
ORGANISM Neisseria gonorrhoeae
Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE 1 (bases 1 to 1199)
AUTHORS Sanchez-Pescador,R., Stempien,M.S. and Urdea,M.S.
TITLE Rapid chemiluminescent nucleic acid assays for detection of TEM-1
beta-lactamase-mediated penicillin resistance in Neisseria
gonorrhoeae and other bacteria
JOURNAL J. Clin. Microbiol. 26, 1934-1938 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 21 beta-lactamase (TEM-1) (AA at 1)
BASE COUNT 344 a 237 c 178 g 440 t
ORIGIN
1 tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat
61 ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg
121 accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccta tctataaact
181 cttggcttgg ttctaatccc tctaaacgat tattatcaat agccgctcta accgcttttt
241 ctcggcttaa tttttctgtc tctgttataa aattgcttat tcattcttgt tcttctttca
301 aaaaaaagtt aagtaaaata cctacctaaa tttttactag ttcgcaatct acgagcttat
361 aacctcgttt tttcaattca tttaaaaaat cagattttga gcctaatttg atctattgct
421 atcgttaccc gctagaaata cccagtaatt acgcaaatct tcattggtaa ctttcgtaat
481 atctgtgtaa tgatcttcga gtatttttaa gcaatctcta gcccataaac cgtactcgtg
541 attgctcatc ttagggtttt gcttatcgag tttgacgaac ttcccatact tgtttttatg
601 tggaaatact ggccgtttgc aacttcttca attttttgag ctgttcgttt tttactacca
661 atcacaaaat ttaaagagtg aatagtacgc ccacgcttga tttgttcaac ctcaacgact
721 aaatcagatt tctcgttaat ctcagttatt gcaggttcca aaacacgttg atttaatgaa
781 ttaaatctag gtattattca acctgaagcc attctttagt tttctactgt aatttcacga
841 ctaccaacag agcgatattg tgtaattagc tcataaattc gaattgaatg tacactgttg
901 aaataagcga tatgtttgag ttgatattgc gtgaattgcc ctttaagttg cgttaggtat
961 ggcataactt catcagtcat tgcaattcta aaacgcccct ctttctgaaa tatgttctag
1021 aggaaaccca acgaaattca gttacacggt ctttatcttc agttttaaca cttcggtcat
1081 aaatccgttt tatagccgcc tgaatttgct tataggcgtt atcttggctt atttctggaa
1141 actcacggac aaaatcagcc accgtaaaat caaaaatttt ttgattagat ttcggatcc
//
LOCUS FLANAX 1461 bp ss-RNA VRL 15-AUG-1990
DEFINITION Influenza A/Chile/1/83 (H1N1), neuraminidase (seg 6), cDNA to mRNA.
ACCESSION M24783 M33023
KEYWORDS neuraminidase.
SOURCE Influenza virus type A, cDNA to viral RNA.
ORGANISM Influenza virus type A
Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses;
Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE 1 (bases 1 to 1461)
AUTHORS Schreier,E., Roeske,H., Driesel,G., Kuenkel,U., Petzold,D.R.,
Berlinghoff,R. and Michel,S.
TITLE Complete nucleotide sequence of the neuraminidase gene of the human
influenza virus A/Chile/1/83 (H1N1)
JOURNAL Arch. Virol. 99, 271-276 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 21 1433 neuraminidase
BASE COUNT 466 a 263 c 343 g 388 t 1 others
ORIGIN
1 agcaaaagca ggagtttaaa atgaatccaa atcagaaaat aataaccatt ggatcaatct
61 gtatgacaat cggaataatt agtctaatat tgcaaatagg aaatattatt tcaatatggg
121 ttagccactc aatccaaact ggaagtcaaa accacactgg aatatgcaac caaagaatca
181 ttacttatga aaatagcacc tgggtaaatc aaacatatgt caatattaac aacactaacg
241 ttgttgctgg aaaggacaca acttcagtga cattagccgg caattcatct ctttgtccta
301 tccgtgggtg ggctatatac agcaaagaca acagcataag aattggttcc aaaggagatg
361 tttttgtcat aagagaacct tttatatcat gttctcactt ggaatgcaga accttttttc
421 tgacccaagg tgctctatta aatgacaagc attcaaatgg gaccgttaag gacagaagcc
481 cttatagggc cttaatgagc tgtcctatag gtgaagctcc gtctccatac aattcaaggt
541 ttgaatcagt tgcttggtca gcaagcgcat gtcatgatgg catgggctgg ctaacaatcg
601 gaatttctgg tccagatgat ggagcagtgg ctgtactaaa atacaacggc ataataactg
661 aaaccataaa aagttggagg aagcgaatat taagaacaca agagtctgaa tgtgtctgtg
721 taaacggttc atgttttacc ataatgaccg atggcccgag taatggacct gcctcgtaca
781 gaatcttcaa aatcgagaag gggaagatta ctaaatcaat adagttggat gcacccaatt
841 ctcattacga ggaatgttcc tgttacccag acaccggcac agtgatgtgt gtgtgcagag
901 acaattggca tggttcgaat cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa
961 taggatacat ctgcagtggg gttttcggtg acaatccgcg tcccaaagat ggaaaaggca
1021 gctgtgatcc agtaactgtt gatggagcag acggagtaaa ggggttttca tacaggtatg
1081 gtaatggtgt ttggatagga aggactaaaa gtaacagctc cagaaaggga tttgagatga
1141 tttgggatcc taatggatgg acagataccg atagtaattt cttagtgaaa caggatgtag
1201 tggcaatgac tgattggtca gggtacagcg gaagtttcgt tcaacatcct gagctaacag
1261 gattggactg tatgaggcct tgcttctggg ttgaattaat cagaggacga cctagagaaa
1321 agacaacaat ctggactagt gggagcagca tttctttttg tggcgtgaat agtgatactg
1381 caaattggtc ttggccagac ggtgccgagt tgccattcac cattgacaag tagtccgttg
1441 aaaaaactcc ttgtttctac t
//
LOCUS YSPURA4 1764 bp ds-DNA PLN 15-AUG-1990
DEFINITION S.pombe orotidine-5'-phosphate decarboxylase (ura4) gene.
ACCESSION M36504
KEYWORDS orotidine-5'-phosphate decarboxylase.
SOURCE S.pombe DNA.
ORGANISM Schizosaccharomyces pombe
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1764)
AUTHORS Grimm,C., Kohli,J., Murray,J. and Maundrell,K.
TITLE Genetic engineering of Schizosaccharomyces pombe: A system for gene
disruption and replacement using the ura4 gene as a selectable
marker
JOURNAL Mol. Gen. Genet. 215, 81-86 (1988)
STANDARD simple staff_entry
FEATURES from to/span description
pept 534 1328 orotidine-5'-phosphate decarboxylase (ura4)
mRNA 484 > 1328 ura4 mRNA
BASE COUNT 550 a 274 c 349 g 591 t
ORIGIN
1 aagcttagct acaaatccca ctggctatat gtatgcattt gtgttaaaaa agtttgtata
61 gattatttaa tctactcagc attctttctc taaataggaa tttgttactt aatggagaaa
121 aaaatgtttc gatttaccta gtgtatttgt ttgtatactc acgtttaatt tcaaacatcc
181 attctatctt gtgtaatttt tggcatggtg aaaaagataa tcagccttat aatctttaca
241 aaagtaagaa attctgtaaa taagccttaa tgcccttgct ttaaattaaa atggttcttt
301 ttcatgataa tgtttgcact ttgtgaatat attttagata gttctgtgag gtataattaa
361 gatgttttag agacttatac aattttgtct ttataaattc ttaattgatt ttaccatccc
421 agtttaacta tgcttcgtcg gcatctctgc acatgtcgtg ttttcttacc gtattgtcct
481 accaagaacc tcttttttgc ttggatcgaa attaaaggtt taaaagcaaa gttatggatg
541 ctagagtatt tcaaagctat tcagctagag ctgaggggat gaaaaatccc attgccaagg
601 aattgttggc tttgatggaa gaaaagcaaa gcaacttgtc agtcgcggtc gatttgacga
661 agaaatccga aatcttagaa ttggtagata aaattggacc ctatgtctgt gttatcaaga
721 cacatattga cgttgtcgag gatttcgacc aggatatggt agaaaaactg gtggccttag
781 gtaaaaagca tcgttttctt atctttgagg atcgcaaatt cgcagacatt ggaaataccg
841 tcaagctaca atatgcatct ggtgtgtaca aaattgcttc ttgggctcat atcacaaatt
901 gccatacagt gccaggcgag ggtattatac aaggcctcaa agaagttggt ttacctttgg
961 gacgtggtct cttgcttttg gctgaaatgt cttccaaagg ctctttggct actggttcct
1021 acacagagaa aaccttagaa tggtttgaga agcataccga tttttgcttt ggctttatag
1081 ctggtcgtcg atttcctaac cttcaaagcg actacataac tatgtcccct ggtatcggct
1141 tggatgttaa aggagacggg ctgggacagc aatatcgtac tcctgaagaa gtgattgtaa
1201 actgcggtag cgatatcatc attgttggtc gtggagtcta tggagctggt cgtaatcctg
1261 ttgtcgaagc caagagatat agagaagctg gttggaaggc atatcagcaa agactttctc
1321 agcattaaaa aaagactaat gtaaaatttt tttggttggt tattgaaaaa gtcgatgcct
1381 tgtttgcgtt tgttttccta ggcgttttat gtcagaaggc atttagaatt agtatacaag
1441 tactctttgg taaaatttta tgtagcgact aaaatattaa ctattataga taaacacctt
1501 gggaataaaa agtaatttgc tatagtaatt tattaaacat gctcctacaa cattaccaca
1561 atcttttctc ttggattgac attgaataag aaaagagtga atttttttag acttgtaatg
1621 ataactatgt acaaagccaa tgaaagatgt atgtagatga atgtaaaata ccatgtagac
1681 aaacaagata aaacttggtt ataaacattg gtgttggaac agaataaatt agatgtcaaa
1741 aagtttcgtc aatatcacaa gctt
//
LOCUS BMEGDH1 2834 bp ds-DNA BCT 15-AUG-1990
DEFINITION B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION D90043
KEYWORDS glucose dehydrogenase.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2834)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES from to/span description
pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47)
ORF 181 867 ORF1
ORF 1086 1946 ORF2
signal 89 94 put. -35 region for ORF1
signal 116 121 put. -10 region for ORF1
signal 167 172 ORF2 ribosome binding site
signal 893 928 termination signal
signal 972 977 put. -35 region for ORF2
signal 997 1003 put. -10 region for ORF2
signal 1070 1076 ORF2 ribosome binding site
signal 1949 1954 glucose dehydrogenase ribosome binding site
signal 2751 2783 termination signal for glucose dehydrogenase
BASE COUNT 896 a 442 c 644 g 852 t
ORIGIN
1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
2821 ctggtgttga attc
//
LOCUS BMEGDH2 1202 bp ds-DNA BCT 15-AUG-1990
DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION D90044
KEYWORDS glucose dehydrogenase; isozyme.
SOURCE Bacillus megaterium (strain IAM1030) DNA.
ORGANISM Bacillus megaterium
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 1202)
AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
Urabe,I. and Okada,H.
TITLE Active and silent isozyme genes of glucose dehydrogenase from
Bacillus megaterium IAM1030
JOURNAL J. Bacteriol. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Toshihide
Mitamura Department of Fermentation Technology Osaka University 2-1
Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax:
06-876-9036
FEATURES from to/span description
pept 125 910 glucose dehydrogenase
signal 27 32 put. -35 region
signal 49 55 put. -10 region
signal 111 116 SD sequence
signal 928 959 termination signal
BASE COUNT 427 a 190 c 268 g 317 t
ORIGIN
1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
1201 ag
//
LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 15-AUG-1990
DEFINITION B.mori PTTH mRNA.
ACCESSION D90082
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
mRNA, clones P1, P2, C2, C9 and C19.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 1023)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept 34 708 preproPTTH
matp 379 708 PTTH subunit
signal 768 773 polyadenylation signal
signal 826 831 polyadenylation signal
signal 997 1002 polyadenylation signal
signal 1004 1009 polyadenylation signal
BASE COUNT 358 a 179 c 186 g 300 t
ORIGIN
1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
1021 aaa
//
LOCUS BMOPTTHP4 944 bp ss-mRNA INV 15-AUG-1990
DEFINITION B.mori preproPTTH mRNA.
ACCESSION D90083
KEYWORDS PTTH; preproPTTH.
SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
mRNA, clone P4 and C21.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 944)
AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone
JOURNAL Science 247, 1333-1335 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Hironori
Ishizaki Department of Biology, School of Science, Nagoya
University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111
x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323
FEATURES from to/span description
pept < 1 631 preproPTTH
matp 302 631 PTTH subunit
signal 691 696 polyadenylation signal
signal 749 754 polyadenylation signal
signal 918 923 polyadenylation signal
signal 925 930 polyadenylation signal
BASE COUNT 337 a 163 c 177 g 267 t
ORIGIN
1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 15-AUG-1990
DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase
(EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION D90047
KEYWORDS Ip; complex II; iron sulfur subunit; sdh B;
succinate-ubiquinone oxidoreductase.
SOURCE Human liver mitochondrion, cDNA to mRNA.
ORGANISM Mitochondrion Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae;
Homo sapiens.
REFERENCE 1 (bases 1 to 958)
AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
of iron sulfur(Ip) subunit of liver mitochondria
JOURNAL Biochem. Biophys. Res. Commun. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kiyoshi
Kita Department of Parasitology Juntendo University 2-1-1,Hongo
Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax:
03-814-9300
FEATURES from to/span description
pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B)
(AA at 1)
site 74 95 iron-sulfur binding site I
site 167 179 iron-sulfur binding site II
site 224 236 iron-sulfur binding site III
BASE COUNT 319 a 212 c 204 g 223 t
ORIGIN
1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS HUMNCAW 2287 bp ss-mRNA PRI 15-AUG-1990
DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION D90064
KEYWORDS CEA; CEA gene family; PI-anchored membrane protein.
SOURCE Human white blood cells, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2287)
AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
Matsuoka,Y.
TITLE Characterization of a cDNA clone encoding a new species of the
nonspecific cross-reacting antigen (NCA), a member of the CEA gene
family
JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Fumiko
Arakawa First Department of Biochemistry School of Medicine Fukuoka
University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
092-801-1011 x2892 Fax: 092-801-3600
FEATURES from to/span description
ORF 87 1136 nonspecific cross-reacting antigen ORF
BASE COUNT 618 a 593 c 453 g 623 t
ORIGIN
1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
2281 gctgatc
//
LOCUS PIGDESTN 1666 bp ss-mRNA MAM 15-AUG-1990
DEFINITION Porcine destrin mRNA.
ACCESSION D90053 J05290
KEYWORDS actin-binding protein; cofilin; destrin.
SOURCE Pig adult brain, cDNA to mRNA, clone PD2.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 1666)
AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
Iida,K. and Yahara,I.
TITLE Destrin, a mammalian actin-depolymerizing protein, is closely
related to cofilin: Cloning and expression of porcine brain destrin
cDNA
JOURNAL J. Biol. Chem. 265, 5768-5773 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji
Moriyama Department of Biophysics and Biochemistry Faculty of
Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
Phone: 03-821-2111 x4408
FEATURES from to/span description
pept 54 551 destrin
mRNA < 1 1666 destrin mRNA
signal 1645 1660 polyadenylation signal
BASE COUNT 492 a 322 c 372 g 480 t
ORIGIN
1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt
//
LOCUS RATPMP70X 3324 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION D90038 J05256
KEYWORDS PMP70; peroxisomal membrane protein.
SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
189,201].
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3324)
AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the
Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
JOURNAL J. Biol. Chem. (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Keiju
Kamijo Department of Biochemistry Shinshu University School of
Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax:
263-33-6458
FEATURES from to/span description
pept 36 2015 peroxisomal membrane protein (PMP70)
signal 3271 3276 polyadenylation signal (put.)
BASE COUNT 921 a 658 c 773 g 972 t
ORIGIN
1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
3301 attaaaaaaa aaaaaaagga attc
//
LOCUS VACSANT 1525 bp ds-DNA VRL 15-AUG-1990
DEFINITION Vaccinia virus surface (S) antigen gene.
ACCESSION D90076
KEYWORDS S gene; surface antigen.
SOURCE Vaccinia virus DNA.
ORGANISM Vaccinia virus
Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE 1 (bases 1 to 1525)
AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y.
TITLE Identification and nucleotide sequence of the gene encoding a
surface antigen induced by Vaccinia virus
JOURNAL Virology 177, 588-594 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Yoshiaki
Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315
FEATURES from to/span description
pept 382 1437 surface antigen S
BASE COUNT 568 a 229 c 258 g 470 t
ORIGIN
1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
1501 gctatagaat gaatgaggat acgcg
//
LOCUS YSCA1 881 bp ds-DNA PLN 15-AUG-1990
DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION D90072 X13682
KEYWORDS acidic ribosomal protein; ribosomal protein.
SOURCE S. cerevisiae (strain IFO-40028) DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Unpublished (1990)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 315; 631 to 881)
AUTHORS Mitsui,K. and Tsurugi,K.
TITLE Identification of A1 protein as the fourth member of 13 kDa-type
acidic ribosomal protein family in yeast Saccharomyces cerevisiae
JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
STANDARD full staff_entry
REFERENCE 3 (bases 277 to 742)
AUTHORS Tsurugi,K. and Mitsui,K.
TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1
from Saccharomyces cerevisiae
JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988)
STANDARD simple automatic
COMMENT These data kindly submitted in computer readable form by: Kazuhiro
Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257
FEATURES from to/span description
pept 313 633 acidic ribosomal protein A1
signal 125 135 UASrpg box1
signal 182 192 UASrpg box2
signal 716 721 poly(A) signal
variant 303 303 a in [1]; g in [3]
variant 684 685 tt in [1]; t in [3]
variant 719 719 a in [1]; t in [3]
BASE COUNT 270 a 165 c 157 g 289 t
ORIGIN
1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS YSCCDC23X 3107 bp ds-DNA PLN 15-AUG-1990
DEFINITION S. cerevisiae CDC23 protein gene.
ACCESSION D90081
KEYWORDS CDC23 protein.
SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
D22) DNA, clone YX34.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 3107)
AUTHORS Doi,A. and Doi,K.
TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
cerevisiae
JOURNAL Gene (1990) In press
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Kenji Doi
The Institute of Scientific and Industrial Research Osaka
University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
Fax: 06-877-4977
FEATURES from to/span description
ORF 765 2645 ORF for CDC23
site 153 161 calcium-binding site
BASE COUNT 1010 a 595 c 629 g 873 t
ORIGIN
1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS BSPRSDA 2996 bp ds-DNA BCT 15-AUG-1990
DEFINITION Bacillus sp. raw-starch-digesting amylase gene.
ACCESSION D90112
KEYWORDS alpha amylase; raw-starch-digesting amylase.
SOURCE Bacillus sp.(strain B1018) DNA.
ORGANISM Bacillus sp.
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 2996)
AUTHORS Itokor,P., Tsukagoshi,N. and Udaka,S.
TITLE Nucleotide sequence of the raw-starch-digesting amylase gene from
Bacillus sp. B1018 and its strong homology to the cyclodextrin
glucanotransferase genes
JOURNAL Biochem. Biophys. Res. Commun. 166, 630-636 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Shigezo
Udaka Department of Food Science and Technology, Faculty of
Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan
Phone: 052-782-5111 x6356 Fax: 052-781-4447
FEATURES from to/span description
pept 313 2454 raw-starch-digesting amylase precursor (EC
3.2.1.1)
sigp 313 393 raw-starch-digesting amylase signal peptide
matp 394 2454 raw-starch-digesting amylase mature peptide
binding 302 306 ribosome binding site
signal 101 106 -35 region
signal 125 130 -10 region
rpt 2567 2580 inverted repeat
rpt 2585 2598 inverted repeat
BASE COUNT 764 a 852 c 774 g 606 t
ORIGIN 10 bp upstream of RsaI site.
1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa
61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt
121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga
181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg
241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat
301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc
361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag
421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat
481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat
541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg
601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat
661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat
721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac
781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct
841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac
901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc
961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc
1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat
1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat
1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac
1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg
1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc
1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg
1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg
1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg
1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa
1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg
1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt
1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact
1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta
1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg
1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc
1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc
1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc
2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc
2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc
2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat
2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc
2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag
2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc
2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg
2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc
2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga
2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg
2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata
2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga
2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc
2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca
2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga
2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt
//
LOCUS HUMALPL 3101 bp ds-DNA PRI 15-AUG-1990
DEFINITION Human alkaline phosphatase (EC 3.1.3.1) gene.
ACCESSION D90054
KEYWORDS alkaline phosphatase.
SOURCE Human liver DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3101)
AUTHORS Matsuura,S., Kishi,F. and Kajii,T.
TITLE Characterization of a 5'-flanking region of the human liver/bone/
kidney alkaline phosphatase gene: Two kinds of mRNA from a single
gene
JOURNAL Biochem. Biophys. Res. Commun. 168, 993-1000 (1990)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Fumio
Kishi Department of Pediatrics Yamaguchi University School of
Medicine Ube, Yamaguchi 755 Japan Phone: 0836-22-2258 Fax:
0836-22-2696
FEATURES from to/span description
pre-msg 2130 3101 alkaline phosphatase mRNA and intron
IVS 2341 3101 alkaline phosphatase intron
rpt 375 664 Alu sequence
rpt 2631 2926 Alu sequence
BASE COUNT 775 a 752 c 692 g 882 t
ORIGIN chromosome 1; map position p34-36.1.
1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga
61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg
121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc
181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag
241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat
301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc
361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc
421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg
481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc
541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac
601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc
661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca
721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg
781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac
841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat
901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat
961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat
1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct
1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca
1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa
1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct
1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga
1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt
1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat
1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga
1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc
1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc
1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc
1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg
1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt
1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt
1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg
1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt
1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg
2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct
2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc
2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc
2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg
2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag
2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag
2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag
2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg
2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca
2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt
2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg
2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct
2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta
2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc
2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga
2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc
3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag
3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t
//
LOCUS RATCNRAA 2337 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Rat calcineurin A alpha mRNA, complete cds.
ACCESSION D90035
KEYWORDS calcineurin; calcineurin A alpha; calmodulin binding protein;
calmodulin-dependent protein phosphatase; isoform.
SOURCE Rat brain, cDNA to mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2337)
AUTHORS Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and
Tanaka,C.
TITLE The Complete Primary Structure of Calcineurin A, a Calmodulin
Binding Protein Homologous with Protein Phosphatases 1 and 2A
JOURNAL Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989)
STANDARD full staff_entry
COMMENT These data kindly submitted in computer readable form by: Takayoshi
Kuno Department of Pharmacology Kobe University School of Medicine
7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451
x3273 Fax: 078-351-6531 Peptides, 78-329 and 391-414, seem to be
putative catalytic domain and calmodulin binding domain,
respectively.
FEATURES from to/span description
ORF 208 1773 calcineurin A alpha
signal 1944 1950 polyadenylation signal
BASE COUNT 649 a 523 c 596 g 569 t
ORIGIN
1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt
61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc
121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc
181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg
241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca
301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg
361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt
421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc
481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact
541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg
601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat
661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca
721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg
781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac
841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg
901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac
961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac
1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac
1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac
1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc
1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc
1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc
1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc
1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca
1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg
1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc
1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag
1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct
1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac
1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg
1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg
1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa
1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca
1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc
2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga
2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta
2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca
2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt
2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg
//
LOCUS PHALPO 5710 bp ds-DNA PLN 15-AUG-1990
DEFINITION P.chrysosporium lignin peroxidase genes, complete cds.
ACCESSION M37701 M22720
KEYWORDS lignin peroxidase.
SOURCE P.chrysosporium (strain BKM-F-1767 (ATCC 24725)) DNA.
ORGANISM Phanerochaete chrysosporium
Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes;
Agaricales; Corticiaceae.
REFERENCE 1 (bases 3402 to 5365)
AUTHORS Walther,I., Kaelin,M., Reiser,J., Suter,F., Fritsche,B.,
Saloheimo,M., Leisola,M., Teeri,T., Knowles,J.K.C. and Fiechter,A.
TITLE Molecular analysis of a Phanerochaete chrysosporium lignin
peroxidase gene
JOURNAL Gene 70, 127-137 (1988)
STANDARD full staff_entry
REFERENCE 2 (bases 1 to 3543; 5096 to 5710)
AUTHORS Huoponen,K., Ollikka,P., Kaelin,M., Walther,I., Maentsaelae,P. and
Reiser,J.
TITLE Characterization of lignin peroxidase-encoding genes from
lignin-degrading basidiomycetes
JOURNAL Gene 89, 145-150 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly provided
by J.Reiser, 22-FEB-1989.
FEATURES from to/span description
pept 652 712 lignin peroxidase lpoB, exon 1
770 923 lignin peroxidase lpoB, exon 2
976 1032 lignin peroxidase lpoB, exon 3
1087 1301 lignin peroxidase lpoB, exon 4
1354 1395 lignin peroxidase lpoB, exon 5
1474 1552 lignin peroxidase lpoB, exon 6
1603 2026 lignin peroxidase lpoB, exon 7
2079 2143 lignin peroxidase lpoB, exon 8
2197 2218 lignin peroxidase lpoB, exon 9
pept 5098 5038 (c) lignin peroxidase lpoA, exon 1
4977 4824 (c) lignin peroxidase lpoA, exon 2
4770 4714 (c) lignin peroxidase lpoA, exon 3
4659 4445 (c) lignin peroxidase lpoA, exon 4
4391 4350 (c) lignin peroxidase lpoA, exon 5
4296 4218 (c) lignin peroxidase lpoA, exon 6
4167 3744 (c) lignin peroxidase lpoA, exon 7
3689 3625 (c) lignin peroxidase lpoA, exon 8
3562 3541 (c) lignin peroxidase lpoA, exon 9
IVS 713 769 lpoB intron A
IVS 924 975 lpoB intron B
IVS 1033 1086 lpoB intron C
IVS 1302 1353 lpoB intron D
IVS 1396 1473 lpoB intron E
IVS 1553 1602 lpoB intron F
IVS 2027 2078 lpoB intron G
IVS 2144 2196 lpoB intron H
IVS 2219 769 lpoB intron I
IVS 5037 4978 (c) lpoA intron A
IVS 4823 4771 (c) lpoA intron B
IVS 4713 4660 (c) lpoA intron C
IVS 4444 4392 (c) lpoA intron D
IVS 4349 4297 (c) lpoA intron E
IVS 4217 4168 (c) lpoA intron F
IVS 3743 3690 (c) lpoA intron G
IVS 3624 3563 (c) lpoA intron H
BASE COUNT 1204 a 1613 c 1594 g 1299 t
ORIGIN
1 agctcacttt acctatacac atctgcattc agtccttcca gttctctgac cctaacatcc
61 ggtaaatgta ccttcagtga tcgggacgga aggtatgggc ctttcgcata ggtgggtaat
121 ctgcgactgt atgttttgta tggtaccctg agacagtcac ttactgtttc tgctcgctcc
181 aggtaccatt gtcccgcctc tgcgtgattt ccgaggctgg actggcccat ctctgcccac
241 cctgtcctca tctgccaaga gccatcggaa tgccaagccg tgaccactcc aaccggtccc
301 gttctctcag ccactgcgca agtttcttac aggagggctg cttcgccgtt cattcgcggc
361 ctccggatag ctagcgagct tcgatgctcg tggccaatta tggaagcagt cgttgatcgc
421 accggtcccg tactgccttc gctcacaagc cgtgttgttg cgagactctc attcgctggc
481 tcagggtatt gtgcctgttt gctgaggcac agtgcagtca atacacactt gtctcgtcag
541 gacgcggttt gacattccgt ggtgcgtgaa acggtataaa agggatacgc gatttgcagc
601 atatcctcag gccattcgtc ttctacagcc caagttccaa gtcaaacggt catggccttc
661 aagcagctcg tcgcagcgat ttccctcgca ctctcgctca ccactgccaa tggtacgcac
721 cgcttctgca tgctgtgata acgggccccg actaacgcct ccgctgcagc cgccgtggtc
781 aaggagaagc gcgccacctg ctccaacggc gccaccgttg gcgacgcgtc ctgctgtgct
841 tggttcgatg tcctcgacga catacagcag aacctgttcc aaggaggcca gtgcggcgct
901 gaggcccacg agtctatccg tctgtaagtc aatacgctgg tgttgcgcca aggtcataga
961 ttcactttgc tgcagcgtgt tccacgatgc tattgccatc tctcctgcta tggaggccca
1021 gggcaagttc gggtatgtct ttccggcatg gcaatatttt acagcagaca ctgagatatt
1081 gcgcagcggt ggtggtgctg acggctccat catgatcttc gacgacatcg agcccaactt
1141 ccaccctaac attggcctcg acgagattat caacctccag aagccgttcg tccagaagca
1201 cggtgtcacc cctggtgact tcatcgcctt cgccggtgct gtcgcgctca gcaactgccc
1261 gggtgcccca cagatgaact tcttcactgg tcgtcgtcct ggtacgtctc ctctacgaat
1321 cgatctcgac acctcattca tatcgcctta tagctaccca gcccgcaccc gatggtctcg
1381 ttcccgagcc tttccgtgag tttgcagacc acttcatcgc atagttctta gctgacctct
1441 tcatcgcata gttcttagct gacttcagca cagacaccgt cgaccagatc atcgctcgtg
1501 ttaacgatgc cggcgagttc gacgagctcg agcttgtctg gatgctttcc gcgtaagtga
1561 ctgccgcctc gaatttccat cccgacttac accccgattc agccactccg ttgctgcagt
1621 caacgacgtg gacccgaccg tccagggcct gcccttcgac tccacccccg gaatcttcga
1681 ctcgcagttc ttcgtcgaga ctcagttccg tggtatcctc ttccccggct ccggtggcaa
1741 ccagggtgag gtcgagtccg gtatggctgg cgagatccgc atccagaccg accacactct
1801 cgcccgcgac tcccgcaccg cttgcgagtg gcagtcgttc gtcaacaacc agtccaagct
1861 cgtctccgac ttccagttca tcttccacgc cctcacccag ctcggccagg acccgaacgc
1921 gatgaccgac tgctcggatg tcatcccgat ctcgaagccc atccccggca accttccgtt
1981 ctcgttcttc ccccctggca agagcatgaa ggatgttgag caggctgtag tatccgattc
2041 agtccttgtc gcagagctta tgctgacggc ttctgcagtg cgccgagacc cccttcccca
2101 gcctcgtcac tctccccggc cccgcgacct ctgtcgctcg catgtgagta tctccgacgg
2161 tctatgaagc ccccagctga catattcctc ttccagcccc ccgccgccgg gtgcttaagt
2221 cattctatcg gtcatctttg gctgaaacgg agtatttgga atacggctca ctcgtaacgg
2281 taacttgcgc tcaagtgttt agaaatgtct cctttgtatc tacgcgattg gtccgctttt
2341 gacgatagat cgttactgtg ttcattgaaa ttctcgtccg cgcgccctgg agcgaaccgg
2401 ttagcattgc cacacgagag ctcttccgtt gctccaactc gagctgtaat ggtccaacgc
2461 tccacgctac atcaatttaa cctctcatgg gtacggtgta ttcggcaagt ttatctcaca
2521 taataagagg cacgctatca ttcgacgata caagaacatg agccttcgct tcgtttatga
2581 tattggttca ctgtcgagct aatttctgag ggttagcgct ctgacatgat cagctacagg
2641 aacggaggcc gtaccttgaa tgtgcccata aacccgctgt cttattcttc tcaaattgat
2701 tcttcatgtt tgaatcacgt ttgcaggtgc attcgtgtac ctgcggcgcg tacacgcggt
2761 atgtattggt cgcaaatcgc atcatggtga gatcttgctc ttcactcttg aagttgctac
2821 cgtataccac catgtgcagg aattctcgta catccctgtt tctcctcgaa tgtatgtgga
2881 gccagggaaa ccctaacccc ggattctgct gagatgcgtc gatgcacgca gccgtagcgg
2941 aggtccgtga ggtccgctcc ggccacgaag caggggccgt cctgaccggt cgaaggtcat
3001 gtcgtgcgac atagtcggct tccaggagga cgatatcgac caatacgtcg aaaggaggag
3061 actgcgggtc taggctggac gctgtttgcg agggcccggg ggagaacgag gccattggga
3121 gtcagcgaga ttattgaata gtcgaagggt attcattgag tcactaaggg aaacacttct
3181 gagccgctgg tagtacttgt gtatgcccgg gttctgcgcc tgataattag cctcgctcct
3241 ccgttgacgt tgggttttgg caataggaca tcaccacttt caccacgcgg acgcaatgcg
3301 aagggcacga gtggtatctc aatagctagt taccttccaa gaccctcaat catgatcgga
3361 agaagaggat gtgcaccgat atttcataag cccacggcag atatcgtaag agagtagacg
3421 aatgagattc gtagttaggt gcagagatac gatgaatgaa atctagtaaa gccgaagttc
3481 cgtcacgagt tagccggcca ccgttacagt cggtttgagg agtattctgt atggcatcat
3541 ttaagcaccc ggaggcggag ggctggagaa ggagcatgtc agcccagatt gcatttcctg
3601 aaagatctca tggattgtac tcacatgcgc tggacggacg tctcggggcc cgggagagtg
3661 gtgagagtcg ggaagggggt ctccgcacac tgtcatgcga tgttcagcag ccactctact
3721 gcatggtggg gtgaaatacg caccgcctgc tcaacgtcct tgatggtctt gccagcgggg
3781 aagaacgaga atgggaggtt gccagggatg ggcttggact gcgggataac atccgagcag
3841 tcggtcatcg cgttcgggtc ctggccgagc tgggtgaggg cgaggaagat gaactggaag
3901 tcatcgacga gcttggactg gttgttgacg aaggactgcc attcacacgc cgtgcgcgag
3961 tcgcgggcga tagtgtggtc ggactggatg cgaatttcgc cagggagcgg cgactcgacc
4021 tcgccttggt tgccaccaga gccggggaag gcggtaccac gaagctgagt ctcgacgaag
4081 aactgggagt cgaagattcc gggggtcgag tcaaagggca gaccctggac ggtcgggtcg
4141 acgtcgttca ccgctgcgac ggagtgcctg tcgaggtctc aggaagggag tgtcgaagtc
4201 aacagtgagt gacttacgcg gagagcatcc agacaagctc gagctcatcg aactcgcctg
4261 cgtcgttgac acggttgatg atttggtcga cagtgtctgc atgctagtca gtatagaccg
4321 cacctaactg cttggataag accacttacg gaagggctcg gggacaaggc catcaggagc
4381 gggctgggta gctaaagcag acagttagtt cgtaccatcc gcaaagcgag ttttgcaggt
4441 ataccaggtg cacgaccagt gaagaagttc atctgcgggg caccagggca gttgctgagc
4501 gcgacacgac cagcgaaggc gatgaagtca ccaggggtga caccgtgctt ctgaacgaat
4561 ggcttctgga gcttgacgat ctcgtcgaga ccgatgttag ggtggaacgc agtctcgata
4621 tcgtcgaaga tcatgatgga gccgtcagca ccaccgccgc tgcaaggagg gatcagcaaa
4681 cgactaggtg gcgcaacgcg ggtggcaact tacccgaact tgccctgtgc ctccatggcg
4741 ggcgaaattg cgatggagtc gtggaagacg ctgggcgggg tgttcaaaca tgcatagcag
4801 gagatcgcga cgggatcact cacagacgaa tcgactcgtg cgcctcagcg ccgcactggc
4861 cgccgtggaa caggttctgc tggatatcat ccaggacgtc gaaccaagcg cagcacgacg
4921 catcgccgac ggtcttgccg ttggaacagg tggcgcgctt ctcgatcacc gcagccgctg
4981 cacaagacga cgttcagcat gcagtccact ggtcaacgct aactgcgatg ggcataccgt
5041 tcgcagccga gagcaagaga gcgagagaga tagctgcgaa gagctgcttg aaggccatgt
5101 ccgctgtgtt gctggtgctg agtgggactg aagagactgg atgtctgagg gactgcggtg
5161 gtcctgtcgc ccttttatac cctaggcgtg gtcgacgtcc tggtattgtt cgccgtagaa
5221 cagtgtcgaa tcgacgtgac gcggtgcgcg gacatgcacg acactgcgcc agccaatgag
5281 gacgctgcca aaacgcagcc tgtgagcgag ttggtgcggt gccggcaacc atcaccgact
5341 cgtctcacat ttgggccact gcgtcgagcg cagttcgcgc cggcaccgct gttgaatagc
5401 acgcgagctc tgcaagaaag aatagggcgg cccatgagaa cagaaatccg agtcagagga
5461 attaactgcg cgtgccgatg agtcttgaca tgaggatgat ctaacgaaga gaccttgcat
5521 tgagccgttt ccagtgctgc caggggtaat cagtcggcat tactgccaag tccggggatg
5581 tactgctagc tcactcccat cgcaatatgt caccgagtat tgcctttgtg aacataccat
5641 tgattcggtc ccgatcatgc acgaacgact cccgcaaagt ggggccagtg actatcacgt
5701 ccgtgctcag
//
LOCUS XANXCAA 2333 bp ds-DNA BCT 15-AUG-1990
DEFINITION X.campestris major extracellular endoglucanase (engXCA) gene,
complete cds.
ACCESSION M32700
KEYWORDS major extracellular endoglucanase.
SOURCE X.campestris DNA.
ORGANISM Xanthomonas campestris
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Pseudomonadaceae.
REFERENCE 1 (bases 1 to 2333)
AUTHORS Gough,C.L., Dow,J.M., Keen,J., Henrissat,B. and Daniels,M.J.
TITLE Nucleotide sequence of the engXCA gene encoding the major
endoglucanase of Xanthomonas campestris pv. campestris
JOURNAL Gene 89, 53-59 (1990)
STANDARD simple staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by C.L.Gough, 09-MAR-1990.
FEATURES from to/span description
pept 383 1864 major extracellular endoglucanase (engXCA)
precursor
sigp 383 457 major extracellular endoglucanase signal
peptide (put.)
matp 458 1861 major extracellular endoglucanase (put.)
BASE COUNT 444 a 800 c 734 g 355 t
ORIGIN
1 gaattcccgg ggatcacaaa cgacgcgaac aagccgacct gcgggtccac gcctgcgacg
61 aacgcaaagg cgatgacttc gggaatcagg gcgaacgtgg caacggcgcc agccatcagt
121 tcgcgcgcag gcgaggcgcc attgcgccag ttcggtgcgc aggaaggaca tgggggacac
181 tccagggaca agaacgacat gcctgcggac agcgcgcagg gggcactagt gtgcgggaaa
241 cggccgctcc cgcagccgcg atgtgatcgg tgcggcaatg gtgttttctg tggggacgat
301 cacaccacgc gacgcgcgca cagaccaaga tgcccgcctt accgcgctcg ggtgtcgagc
361 ccggttctct agggagatca ccatgtccat attcaggacc gcaagcacgc tcgctttggc
421 caccgccctc gcactggccg ccgggccggc cttcagctat tccatcaaca acagcaggca
481 gatcgtcgac gacagcggca aggtcgtgca gctcaagggt gtgaacgtgt tcggcttcga
541 aaccggcaac cacgtgatgc atggcctgtg ggcacgcaac tggaaggaca tgatcgtgca
601 gatgcagggc ctgggcttca acgccgtgcg cctgccgttc tgcccggcca cgctgcgtag
661 cgacaccatg ccggccagca tcgactacag ccgcaacgcc gacctgcagg gcctgacctc
721 gctgcagatc ctcgacaagg tgatcgccga attcaatgcg cgcggcatgt atgtgctgct
781 ggatcaccac acccccgatt gcgccggcat ttccgagctc tggtacaccg gctcctatac
841 cgaggcacag tggctggccg acctgcgctt tgtggccaac cgctacaaga acgtgccgta
901 tgtactcggc ctggatctga agaacgaacc gcacggcgcc gccacctggg gtaccggcaa
961 cgccgccacc gattggaaca aggctgccga gcgcggctcg gccgcggtgt tggcggtcgc
1021 gccgaagtgg ctgatcgcgg tggaaggcat caccgacaac ccggtgtgct ccaccaacgg
1081 cggcatcttc tggggcggca acctgcagcc gctggcctgc accccgctca acatcccggc
1141 caaccgcctg ctgctggccc cgcacgtgta cggcccggac gtgttcgtgc agtcgtactt
1201 caacgacagc aacttcccca acaacatgcc cgccatctgg gaacgccatt tcggtcagtt
1261 cgccggcacg catgcgctgt tgctgggcga gttcggtggc aagtacggcg aaggcgacgc
1321 acgcgacaag acctggcagg acgcgctggt gaagtacctg cgcagcaagg gcatcaacca
1381 gggcttctac tggtcgtgga atcccaacag cggcgacacc ggcggcatcc tgcgcgatga
1441 ctggaccagc gtgcgccagg acaagatgac cctgctgcgc acgctgtggg gcaccgccgg
1501 caataccacg ccgacgccga ctcccacacc tacgcccaca ccgacaccga cgcctacccc
1561 cacgccgacg cccaccccgg gcaccagcac cttcagcacc aaggtgatcg cctcgccggt
1621 ggtggggtcg gcagcgcgaa aactgccggc ggcatcgcgg ctggcttgcc attggccggc
1681 cagcagcacg ggttggagag tctgggtcat cgcggcacct tcggttacgt ggaagcgccc
1741 gcacgcagca cgggcgatcg aacggcggat gagggtaacg cgcctgcgac gtgccacccg
1801 tttgaatcgt ggaccactac cggcaccggc ccatacaacg cagcacgcac cgcggctgcg
1861 ctaaacaagg ccgcgcgacg gcggtggcgc gtgctcagtg caggctgggc gcggtggcga
1921 tggcgtggtc gatcaccttc agcgctgcct cgcgctcggc accgtccacc accaggcgtg
1981 gcgcacggac acgctcgctg cccaggccca ccttttcctg caccagtttg atcagctgca
2041 cgaacttggg cacggtatcc aggcgcagca gcggcaggaa ccagtcgtac agttccttgg
2101 cggcggggta accgccgtcg cgtgccagtt cgaacaggcg taccgactcc ttcggactac
2161 tgtacttgac cagcccggcg atccacccct tggcgcccat gctcaggcct tcgacgatgg
2221 cgtcgtccat gccgaccagc agcgccagac gatcgcccag caattcctgc agcgcggcga
2281 agcggcgcac atcgccggaa gattccttta ctgcctgcag gattggggaa ttc
//
LOCUS FLAHANENJ8 1458 bp ss-RNA VRL 15-AUG-1990
DEFINITION Influenza virus A/NJ/8/76 (H1N1) hemagglutinin/neuraminidase (seg
4) gene, complete cds.
ACCESSION M27970
KEYWORDS hemagglutinin/neuraminidase.
SOURCE Influenza virus A/NJ/8/76, cDNA to viral RNA, clones pNA[6,28],
passed in embryonated eggs.
REFERENCE 1 (bases 1 to 1458)
AUTHORS Miki,T., Nishida,Y., Hisajima,H., Miyata,T., Kumahara,Y.,
Nerome,K., Oya,A., Fukui,T., Ohtsuka,E., Ikehara,M. and Honjo,T.
TITLE The complete nucleotide sequence of the influenza virus
neuraminidase gene of A/NJ/8/76 strain and its evolution by
segmental duplication and deletion
JOURNAL Mol. Biol. Med. 1, 401-413 (1983)
STANDARD simple staff_entry
FEATURES from to/span description
pept 21 1430 hemagglutinin/neuraminidase precursor
sigp 21 125 hemagglutinin/neuraminidase signal peptide
matp 126 1427 hemagglutinin/neuraminidase
BASE COUNT 462 a 257 c 343 g 396 t
ORIGIN
1 agcaaaagca ggagtttaaa atgaatacaa atcaaagaat aataaccatt gggacaatct
61 gtctaatagt tggaataatt agtctattat tgcagatagg aaatataatc ttgttatgga
121 tgagccattc aattcagact ggagaaaaaa gccatcctaa ggtatgcaac caaagtgtca
181 ttacctatga aaacaacaca tgggtgaacc agacttatgt aaacattagc aataccaata
241 ttgctgctgg acagggtgtg actccaataa tactagccgg caattcctct ctttgcccaa
301 tcagtgggtg ggctatatac agcaaagaca atagcataag gattggttcc aaaggagaca
361 tttttgtcat gagagagcca ttcatttcat gctctcactt ggaatgcaga accttttttc
421 tgacccaagg cgctttgctg aatgacaggc attctaatgg aaccgtcaag gacaggagtc
481 cttatagaac cttaatgagc tgccccatcg gtgaagctcc atctccgtac aattcaaggt
541 tcgaatcagt tgcttggtca gcaagtgcat gccatgatgg aatgggatgg ctaacaatcg
601 ggatttccgg tccagataat ggagcagtgg ctgttttaaa atacaatggt ataataacag
661 atacaataaa aagttggaga aacaaaatat taagaacaca agagtctgaa tgtgtttgta
721 taaacggttc gtgttttact ataatgactg acggcccaag caatgggcaa gcctcgtaca
781 aattattcaa aatggagaaa gggaagatta ttaagtcaat tgagctggat gcacctaatt
841 accactatga ggaatgctcc tgttaccctg atacaggcaa agtggtgtgt gtgtgcagag
901 acaattggca tgcttcgaat cgaccatggg tctctttcga tcagaatctt gattatcaaa
961 tagggtacat atgcagtggg gttttcggtg ataatccgcg ttctaatgat gggaaaggca
1021 attgtggccc agtactttct aatggagcaa atggagtgaa ggggttttca tttagatatg
1081 gcaatggtgt ttggatagga agaactaaaa gtatcagctc tagacgtgga tttgagatga
1141 tttgggatcc taatggatgg acagaaactg atagtagttt ctctatgaag caagatatta
1201 tagcattaac tgattggtcg ggatacagtg gaagttttgt ccaacatcct gaattaacag
1261 gaatgaactg cataaggcct tgtttctggg tagagttaat cagagggcaa cccaaggaga
1321 gcacaatctg gactagtgga agcagcattt ctttctgtgg cgtgaacagt ggcactgcaa
1381 gctggtcatg gccagacgga gctgatctgc cattcaccat tgacaagtag tttatccaaa
1441 aaactccttg tttctact
//
LOCUS HUMHIS3PRM 1125 bp ds-DNA PRI 15-AUG-1990
DEFINITION Human histone H3 gene, complete cds.
ACCESSION M26150
KEYWORDS histone.
SOURCE Human HeLa cell DNA, clone pST519.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1125)
AUTHORS Marashi,F., Helms,S., Shiels,A., Silverstein,S., Greenspan,D.S.,
Stein,G. and Stein,J.
TITLE Enhancer-facilitated expression of prokaryotic and eukaryotic genes
using human histone gene 5' regulatory sequences
JOURNAL Biochem. Cell Biol. 64, 277-289 (1986)
STANDARD simple staff_entry
FEATURES from to/span description
pept 557 964 histone H3 /hgml_locus_uid="LV0006C"
/nomgen="H3F2" /map="1q21"
mRNA 520 > 964 histone H3 mRNA (5' end + / - 4 bp)
signal 422 425 CAAT box
signal 463 468 CAAT box
signal 485 492 TATA box
BASE COUNT 298 a 283 c 267 g 277 t
ORIGIN
1 gcagcggcgt gataacagct cactgtaacc tcgaactcgg gctcaagcga tcctcatcga
61 cagccttctg agtagctggg attacaggcg agagcgccac gcccgactaa gagcattttc
121 taattgccca cacttcttat gcgacaccca gaaaaataca attttaaata aagcgcatat
181 gcaaataacc ctaatcgtct ccaatattca ctgatttctt ttttatattt taactagaaa
241 caattggagg tttccgcgtt gctttgtgtg gttgtaaatt ttaagacttc aggaaacttt
301 tccagtacaa gacttgtcca acagtggata tagcagctaa ggggttaaca aaatgacgtc
361 agagtagcta cggtaatggg caggagcctc tcttaatctg caaccaagca cagagatgga
421 ccaatccagg aagggcgcgg ggatttttga atttacttgg gtccaatggt tggtggtctg
481 actctataaa agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaaagct
541 tttctgtggt tttgccatgg ctcgtactaa acagacagct cggaaatcca ccggcggtaa
601 agcgccacgc aagcagctgg ctaccaaggc tgctcgcaag agcgcgccgg ctaccggggg
661 cgtgaaaaag cctcaccgtt accgcccggg cactgtggct ctgcgcgaga tccgccgcta
721 ccaaaagtcg accgagttgc tgattcggaa gctgccgttc cagcgcttgg tgcgagaaat
781 cgcccaagac ttcaagaccg atcttcgatt ccagagctcg gcggtgatgg cgctgcagga
841 ggcttgtgag gcctacttgg tagggctctt tgaggacaca aacctttgcg ccatccatgc
901 taagcgagtg actattatgc ccaaagacat ccagctcgct cgccgcattc gcggagaagc
961 gtaaatgtaa agtcactttt tcatcagtct taaaacccaa aggctctttt cagagccacc
1021 cacttattcc aacgaaagta gctgtgataa ttttttgttg tcttaacaga acaaatttct
1081 aaggaccccc ccggaaagca ttagactatg gcttaaagtt gatac
//
LOCUS MUSTUBMA1 786 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse alpha-tubulin gene M-alpha-1, 3' end.
ACCESSION M28729
KEYWORDS alpha-tubulin.
SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 786)
AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
TITLE Five mouse tubulin isotypes and their regulated expression during
development
JOURNAL J. Cell Biol. 101, 852-861 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 597 alpha-tubulin (AA at 1)
signal 773 778 poly-A signal
BASE COUNT 186 a 187 c 210 g 203 t
ORIGIN
1 gaattccaga ccaacctggt accctaccct cgtatccact tccctctggc cacttatgcc
61 cctgtcatct ctgctgagaa agcctaccac gagcagcttt ctgtagcaga gatcaccaat
121 gcctgctttg agccagccaa ccagatggtg aaatgtgacc ctcgccatgg taaatacatg
181 gcttgctgcc tgctgtaccg tggtgatgtg gttcccaaag atgtcaatgc tgccattgcc
241 accatcaaga ccaagcgtac catccagttt gtggactggt gccccactgg cttcaaggtt
301 ggcattaact accagcctcc cactgtggta cccggtggtg acctggccaa ggtgcagaga
361 gctgtgtgca tgctgagcaa caccacagcc attgctgagg cctgggctcg cctagatcac
421 aagtttgatc tgatgtatgc caagcgtgcc tttgtgcact ggtatgtggg tgagggcatg
481 gaggagggtg agttctctga ggcccgtgag gacatggctg ccctagagaa ggattatgag
541 gaggttggtg tggattctgt ggaaggcgag ggggaggaag aaggagagga atactaaatt
601 aaatgtcaca aggtgctgct tccacaggga tgtttattgt gttccaacac agaaagttgt
661 ggtctgatca gttaatttct atgtggcaat gtgtgctttc atacagttac tgacttatga
721 atgattgatt ttgacagaga ccccaagctg cccatttcac ttatgggttt taaataaaat
781 actccc
//
LOCUS MUSTUBMA2 1198 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse alpha-tubulin gene M-alpha-2, 3' end.
ACCESSION M28727
KEYWORDS alpha-tubulin.
SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1198)
AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
TITLE Five mouse tubulin isotypes and their regulated expression during
development
JOURNAL J. Cell Biol. 101, 852-861 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 1059 alpha-tubulin (AA at 1)
BASE COUNT 259 a 329 c 303 g 307 t
ORIGIN
1 gcaaataact atgcccgtgg ccactacacc attggcaagg agatcattga ccttgtcctg
61 gacaggattc gcaagctggc tgaccagtgc acgggtctcc agggcttgtt cgttttccac
121 agctttggcg ggggaactgg ctctggcttc acctccctgc tgatggagcg gctctctgtg
181 gattacggaa agaagtccaa gctggagttc tccatttacc cagcccccca ggtttccact
241 gctgtggttg agccctacaa ttccatcctc accacccaca ccaccctgga gcactctgat
301 tgtgccttca tggtagacaa tgaggccatc tatgacatct gtcgtagaaa cctcgacatt
361 gagcgcccaa cctacaccaa ccttaaccgc cttattagcc agattgtgtc ttccatcact
421 gcttccctca gatttgatgg ggccctcaat gttgatctga cagaattcca gaccaacctg
481 gtaccctacc ctcgcatcca cttccctctg gccacttatg cccctgtcat ctctgctgag
541 aaagcctacc atgagcagct ttctgtagca gagatcacca atgcctgctt tgagccagcc
601 aaccagatgg tgaaatgtga ccctcgccat ggtaaataca tggcttgctg cctgctatac
661 cgtggtgatg tggttcccaa agatgtcaat gctgccattg ccaccatcaa gaccaagcgc
721 acgatccagt ttgtagactg gtgccccact ggcttcaagg ttggcattaa ttaccagcct
781 cccactgtgg tacccggtgg tgacctggcc aaggtgcaga gagctgtgtg catgctgagc
841 aacaccacag ccattgctga ggcctgggct cgcctagatc acaagtttga tctgatgtat
901 gccaagcgtg cctttgtgca ctggtatgtg ggtgagggca tggaggaggg tgagttctct
961 gaggcccgtg aggacatggc tgccctagag aaggattatg aggaggttgg tgtggattct
1021 gtggaaggcg agggggagga agaaggagag gagtactaag tccattcctt gagccccctg
1081 tgtcgtcaaa tgctccagta ttagttgcag gcacctgatg cttctgtgct gtttccattc
1141 tgtgatcatg tcttctccat gttgtacctc ttaagttttc catgatgtct caaactaa
//
LOCUS MUSTUBMB2 488 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse beta-tubulin gene M-beta-2, 3' end.
ACCESSION M28739
KEYWORDS alpha-tubulin.
SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 488)
AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
TITLE Five mouse tubulin isotypes and their regulated expression during
development
JOURNAL J. Cell Biol. 101, 852-861 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 303 beta-tubulin (AA at 1)
signal 472 477 poly-A signal
BASE COUNT 121 a 116 c 131 g 120 t
ORIGIN
1 cccaacaacg tcaagacggc cgtgtgtgac atccctcctc gtggcctcaa gatgtcagcc
61 accttcattg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc
121 actgccatgt tccggcgcaa ggctttcctg cactggtaca cggctgaggg catggacgag
181 atggagttca ccgaggcgga gagcaacatg aatgacctgg tgtctgagta ccagcagtac
241 caggatgcca cggccgatga gcagggcgag ttcgaggagg aggagggtga agatgaggct
301 tgagaacttc tcagatacag tgtgcaccct tagtgaactt ctgttgtcct ccagcattgg
361 tctttctatt tgtaaattat ggtgctcagt ttgcctctgt cagaaattca ctgttgatgt
421 aatagtgtga acctctttca agatcacagt attgtctcag aaatctatat gaataaaaaa
481 gcatgtgg
//
LOCUS MUSTUBMB4 1454 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse beta-tubulin gene M-beta-4, 3' end.
ACCESSION M28730
KEYWORDS alpha-tubulin.
SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1454)
AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
TITLE Five mouse tubulin isotypes and their regulated expression during
development
JOURNAL J. Cell Biol. 101, 852-861 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 1140 beta-tubulin (AA at 1)
BASE COUNT 327 a 456 c 376 g 295 t
ORIGIN
1 gtcgacctgg aacccggcac catcgactct gtccgctccg gcccttttgg ccagatcttt
61 cggccagaca actttgtatt tggtcaatcc ggagcaggca acaactgggc caagggtcac
121 tacaccgagg gcgcgcagtt agtggatgcc gtcctggacg tggtgcgcaa agaggcggaa
181 agctgcgact gtctccaggg cttccagctc acccactcgc tcggaggtgg caccggctca
241 ggcatgggga ccttgctcat cagcaagatc cgagaggagt ttccagacag gatcatgaat
301 acgttcagcg tggtgccatc acccaaggtg tctgacacgg tggtggagcc ctacaatgcc
361 acactgtctg tgcatcagct ggtggagaac actgatgaga cctactgcat cgacaacgag
421 gccctgtacg acatctgctt ccgtacgctc aagctgacca cgcccacgta cggggacctc
481 aaccacctcg tgtcagccac catgagtgga gtcaccacct gcctacgttt cccgggccag
541 ctcaatgcag acctacgcaa gctggctgtg aacatggtgc cattcccccg tctccacttc
601 ttcatgccag gattagcacc cttgaccagc aggggcagcc agcagtaccg ggccctcacc
661 gtccctgagc tgacccaaca ggtgttcgat gctaagaaca tgatggctgc gtctgacccg
721 agacacggtc gctacctgac tgtggctgct gtcttccggg gacggatgtc catgaaggag
781 gtagacgagc agatgttaag tgtgcagagc aagaacagca gttacttcgt tgagtggatc
841 cccaacaatg tgaaggcagc cgtatgtgac atcccgcccc gcggcctgaa gatggcagcc
901 accttcatcg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc
961 accgccatgt tcagacgcaa ggccttcctg cactggtaca cggccgaagg catggacgag
1021 atggagttta cggaagcaga gagcaatatg aacgacctgg tgtccgagta ccagcagtac
1081 caggatgcca ctgctgaaga gggcgagttc gaagaggagg ctgaagagga ggtggcttaa
1141 gtctcctgcc atcactctgt ccctggggcc caccagcaaa gctttgaccc taagcatcac
1201 acccctgcac ctagttgcct cattccctag gaccccatga gcatcttcac catgaggcca
1261 agcccaggtt gcttctattt gcttcacctt taactcctaa accccactgt ctctccaacc
1321 tgccagggaa gggctcttct agttcccatg agcgcccctc aacacatgta cacacgcaca
1381 cacactccac cttcttagat cttgaaaatc ctttccttta tgccctgtcc cttccccagc
1441 actcctgaac cgat
//
LOCUS MUSTUBMB5 542 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Mouse beta-tubulin gene M-beta-5, 3' end.
ACCESSION M28732
KEYWORDS alpha-tubulin.
SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 542)
AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
TITLE Five mouse tubulin isotypes and their regulated expression during
development
JOURNAL J. Cell Biol. 101, 852-861 (1985)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 363 beta-tubulin (AA at 1)
signal 524 529 poly-A signal
BASE COUNT 126 a 136 c 149 g 131 t
ORIGIN
1 gaggtggatg agcagatgct caatgtgcag aacaagaata gcagctactt cgtggaatgg
61 atccccaaca atgtcaagac agctgtctgt gacatcccac cgcgtggcct caagatggca
121 gtcaccttca ttggaaacag cacagccatc caggagctgt tcaagcgcat ctctgagcag
181 tttacggcta tgttccgccg gaaggctttc ctccactggt acacggctga gggcatggac
241 gagatggagt tcaccgaggc tgagagcaac atgaacgacc tggtgtctga gtaccagcag
301 taccaggatg ccaccgctga agaggaagag gatttcggag aggaggcaga agaggaggcc
361 taacggcaga gagccctgca tcagctcagg ctgcttagac tccctcagcc tttctccaac
421 tgccctttgt cctccagttt ctttctgctg cctctgtctt gtatttgttt tgcttctgtt
481 ttctcattgg gggtaaatgg tgcctggcac atggcaggca ctcaataaat atttgtttgt
541 gg
//
LOCUS XELPAL 353 bp ss-mRNA VRT 15-AUG-1990
DEFINITION X.laevis parvalbumin mRNA, 3' end.
protein.
ACCESSION M28644
KEYWORDS parvalbumin.
SOURCE X.laevis tadpole, cDNA to mRNA, clone lambda-PV1.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 353)
AUTHORS Kay,B.K., Shah,A.J. and Halstead,W.E.
TITLE Expression of the Ca2+ -binding protein, parvalbumin, during
embryonic development of the frog, Xenopus laevis
JOURNAL J. Cell Biol. 104, 841-847 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept < 1 339 parvalbumin (AA at 1)
BASE COUNT 92 a 86 c 90 g 85 t
ORIGIN
1 agatttacta tggcattcgg tggtatcctg agtgaggctg acatctctgc tgccctgcag
61 aactgccaag ctgctgactc cttcaacttc aaaactttct ttgcccagtc tggtctgagc
121 agcaagtccg cagatgatgt gaaaaacgtc tttgccatcc tcgaccagga caggagcggc
181 ttcattgagg aagaggaact gaagttgttc ctccagaact tcagcgcaag tgccagggca
241 ctgactgatg ctgaaaccaa ggccttcctg gcagctggtg actctgatgg tgatggcaaa
301 attggagttg aagaattcca gtccctagtc aaaccttgaa gaagtaagac caa
//
LOCUS RATMLVI4 100 bp ss-mRNA ROD 15-AUG-1990
DEFINITION Rat Moloney murine leukemia provirus Mlvi-4 mRNA, partial sequence.
ACCESSION M36432
KEYWORDS provirus.
SOURCE Rat Moloney murine leukemia virus-induced T-cell lymphoma, cDNA to
mRNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Tsichlis,P.N., Lee,J.S., Bear,S.E., Lazo,P.A., Patriotis,C.,
Gustafson,E., Shinton,S., Jenkins,N.A., Copeland,N.G., Huebner,K.,
Croce,C., Levan,G. and Hanson,C.
TITLE Activation of multiple genes by provirus integration in the Mlvi-4
locus in T-cell lymphomas induced by Moloney murine leukemia virus
JOURNAL J. Virol. 64, 2236-2244 (1990)
STANDARD simple staff_entry
FEATURES from to/span description
mRNA < 1 > 100 Mlvi-4 mRNA
recomb 73 74 Rat DNA end/provirus DNA start
BASE COUNT 24 a 29 c 19 g 28 t
ORIGIN
1 ttactggaag ccctcctcat catgggattt catcacagta aacaacaatc tcacctctga
61 ccaggctgtc caggattctc ctcatggttt gtcgaaggtc
//GenBank-Updates@genbank.bio.net (08/16/90)
LOCUS THARGA 635 bp ds-DNA BCT 11-AUG-1990
DEFINITION T.acidophilum 5S rRNA gene.
ACCESSION M32297
KEYWORDS 5S ribosomal RNA.
SOURCE T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23.
ORGANISM Thermoplasma acidophilum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Thermoplasmales.
REFERENCE 1 (bases 1 to 635)
AUTHORS Ree,H.K. and Zimmerman,R.A.
TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA
genes from the archaebacterium Thermoplasma acidophilum
JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.K.Ree 23-FEB-1990.
FEATURES from to/span description
rRNA 213 335 5S rRNA
RNA 211 346 5S RNA ( 3' end +/- 4 bp)
site 184 191 put. promoter
site 333 354 put. terminator
BASE COUNT 175 a 140 c 140 g 180 t
ORIGIN
1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc
61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac
121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa
181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac
241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg
301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct
361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag
421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt
481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca
541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga
601 gcgccataga ccaggtcatc ttcttccaga agctt
//
LOCUS THARGG 4154 bp ds-DNA BCT 11-AUG-1990
DEFINITION T.acidophilum 23S ribosomal RNA gene.
ACCESSION M32298
KEYWORDS 23S ribosomal RNA.
SOURCE T.acidophilum (strain 122-1B2) DNA, clones pTH1-1, pL8 and pTH3-7.
ORGANISM Thermoplasma acidophilum
Prokaryota; Bacteria; Mendosicutes; Archaeobacteria;
Thermoplasmales.
REFERENCE 1 (bases 1 to 4154)
AUTHORS Ree,H.K. and Zimmermann,R.A.
TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA
genes from the archaebacterium Thermoplasma acidophilum
JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by H.K.Ree 23-FEB-1990.
FEATURES from to/span description
rRNA 521 3426 23S RNA
RNA 429 3473 23S RNA gene (3' end +/- 5 bp)
BASE COUNT 1097 a 903 c 1177 g 977 t
ORIGIN
1 tggtgttaag gtggatgcac tatatctatg agagctaccg tgagatggcc ataagggaga
61 tacgcgatat gatagagtag ctgcataagg caattcggaa taacagacgc ggcggttgtt
121 catcgcatag gtagagtaaa tctcacagag gactccgttg ttatagcggt atctgcgcct
181 catagaggac aggcgtttga cgcatgcaga tacataatcg accagataaa ggagagagcg
241 cccatatgga aacgtgacat cgtgcaggga tctcctgcga gctggcactg aacccatttt
301 ttacggtatc agcaagaaaa agttaattaa aatatatttc aaattctgaa aagtgagtat
361 tataaaatta ttggataatt gggcttccgg atcaaaatgc ttatatccct cttaatgata
421 tagtccatac acgcttacaa tgcacctttt aatgcatcac attaaaaggg aagatgtcga
481 attttgatga gactagatga gagatggcac aatggattgt gctctgctaa tctgcctaga
541 ggatggcttg gttcgggcgc cgaagaagga cgtgccaagc tgcgataagc cttggggagg
601 cgcatggagc cttagatcca aggatctccg aatgggactt cctgccgtaa ggcactccga
661 aaggagaggg aacccgggga attgaaacat cttagtaccc ggaggaaaag aaatcaattg
721 agataccgtt agtaaaggcg atcgaaagcg gtagaaggca aaccgaatag cccttcgaaa
781 gaagggaaag atgtggagtt tggtcttcct ctaatgcctc ctgaagcgag atgaatcttc
841 tggaaagaag agccttagaa ggtgatagcc ctgtaatcga agcttcagaa gctacaaggg
901 gaagtaacca gagtaccatg cgtcgttttt cgcgtgggaa tttgggtggc actaacatcc
961 aaccttaaat acgtcccgag tccgatagcg aacaaagtac cgtgagggaa agctgaaaag
1021 aaacccggaa gggtggtgaa aagagcctga aactaggcag agataaactt atagggcagt
1081 taagaggtga agtcgttaac tacgatggat cgctgtccta ttgtccgtgt tgaagaacgg
1141 gccagggagt tctgacgagt ggcaaggtta atcctgaaag gagtagccgt agcgaaagca
1201 actacccgca cagcaatggg ggggtggcgt ggtaaacgcg tttagtcact cgtgagagac
1261 ccgaagccgg tcgatctaca cctgagtagg ttgaagctca gtgaaagctg ggtggaggac
1321 cgaacctatt ctgatgtgca aatcgtttgg atgacttggg tgtaggggtt aaaggccaat
1381 ctaggccggc aatagcgggt tccccccgat actacccgca ggtagacctc gatggagatt
1441 ctcggcgagg tagagcgacc gattggttgg taagcagtcg aaaggctgcg ccgacttgtc
1501 aaactccgaa cttgtcgaga tcgtagaagt cgggtgctag ggggcaggga taagctttgc
1561 ttccgtgatg ggaacaacca agacgagggt taaggtccct aagttctagt taagtgcact
1621 aaataggttt gtggccaaag acagtgggga ggtaggctca gaagcagcca tccttcaaag
1681 agtgcgtaac agctcactca ccgaggtcac atgcctagaa gatggaaggg gctaaaacta
1741 gacaccgaga ccttcgagca ccgaaaggtg atctggtagg ggggcgtgcc atgtggatag
1801 aagtctcccc gagaggaggg atggaccgca tggtatcgcg gatcctggtg aaagtagcag
1861 agaagaaccg tgagaatcgg ttccgccgaa agggctaggg ttccttggca atgttcgtca
1921 gccgagggtt agtcgatcct aaggccatac ctaacaggat atggtcgaag gggaagccgg
1981 ttaatattcc ggcacactga acgttttgcc ctgtatgaga aggttcaggg taggggcggt
2041 acgggtgcca acgtatttat gctcataagc ggatggagag tcgtaatgac gagaagttcg
2101 cgaaagagcg tatgttcccg tttgggaatc gcctcgatcc ccggatccca tgaaaatcat
2161 gcaggggtca ggttcagtat cgtaccaaga accgacactg gtgcccctag gtgagaagcc
2221 taaggcgttt tgggataatg gacgcgaggg aaatcggcaa aatagctccg tatcttcggt
2281 ataaggagtg cctattccgt gagggaatag gtcgcagtga cgaagggact ccgactgttt
2341 accacaaaca cagatcgctg ctagtccgta aggatgtgta tagcggttga aacctgccca
2401 gtgctggtac ctgaaagccc cgtacaaggg gaagaagggc cagtaaacgg cgggggtaac
2461 tatgaccctc ttaaggtagc gtaataccta gccgcttaat tggcggcttg catgaaggtt
2521 caacgtgggt cccactgtcc ccgcgttcag cccagtgaaa ttgatgtact ggtgcacaat
2581 ccagtctctc ccacgtgaaa gcgaagtccc cgtggagctt tactgcagcc tgtagctgtg
2641 gtgcgatcct gactgcgtag tgtaggaagg agccgtcgaa gctctggttt cggccggagt
2701 ggaggcgcca atgaaacact tccctctcgg gattgcgtca ctaacctctt cggagggaca
2761 actattggtg ggcagtttgg gtggggcgcc acgcccctaa caacgtaaca ggggccccca
2821 aaggtcagct caggagggtc agaaatcctc cgtagagtgt aaaagcaaaa gctggcttga
2881 ctgtgttgca gacaactagc aacgcagatg cgaaagcagg gtttagcgaa ccacccagtt
2941 cctccttagt gggggcgggt gataagagag aagttacccc agggataact gagtcgtcct
3001 cggcaagagt acacatcgac ccgagggttt gctacttcga tgtcgtctgt tcctatcctg
3061 gtgctgcata aggtgccaag ggtggggctg ttcgcccatt aaaagggatc ctgagatggg
3121 ttcactacgt cgcgagacag tagggttgct tctccgtggg agtgctcgat gtctgagggg
3181 aaggggcctt tagtacgaga ggaacaaggg ctcgtgacct ctagtttacc ggttgtctgg
3241 caaggcatcg ccgggtagcc acgtcatacg cggataagag ctgaaagcat ctaagctcga
3301 agccgccccc gaaaatagac atcgtcatca gatcgcctct agaagagagg tttgatagag
3361 ccgggatgta aggatcgagc ttcggcgaga ttttaagtcc acggctacta aagatcgaag
3421 gcacaatcca tgctaaattc tcgtctagtt gaagtcataa gcgtgtttga tttttgtttt
3481 ttattttttt gccgttgtag tgtgaactat gatgaaaaat tatattgata taaaagagaa
3541 atagaaaaaa tgatcactgc tttatatttt ctatctgagc ctctttcagt atctgatcgg
3601 ttatgctgtc gaaaatctca cgtatcttat catttataac gacggcaggt attccgttgt
3661 caccgttttc tgctatttct ggtattagag gtatttttcc caggaacggg aggttgtatt
3721 cctttgcgga tttctccccg ccgccgttct tgaatatgta tgtgagctta ccgcagtgcg
3781 cacacaaaac cgctcatgtt ctctattatg ccaaggacag ggagcttcag ctgttttgca
3841 aagtttattg ccttctttgc atcgagaagg gcgacatcct gcggcgttac gactatgaca
3901 acgccatttg attctggtat cagctgggcc acgctcagtg caacgtctcc ggtgccaggc
3961 ggcatgtcaa gcacaacaac atccgtatcc ttccagttta cgtcttccag aaactgctgt
4021 attgccttgt gcatcagcga accacgccat attatcggcg tatcctctgt cggtatgagg
4081 aaacccatgg acaccacgcg aacaccgtat cttgtctctg ccggtattat tccctcctcg
4141 tcagcgtaaa gctt
//
LOCUS HIVANT70 583 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral LTR region.
ACCESSION M31171
KEYWORDS .
SOURCE Human immunodeficiency virus type 1 (isolate ANT70) proviral DNA.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 583)
AUTHORS De Leys,R., Vanderborght,B., Haesevelde,M.V., Heyndrickx,L.,
van Geel,A., Wauters,C., Bernaerts,R., Saman,E., Nijs,P.,
Willems,B., Taelman,H., van der Groen,G., Piot,P., Tersmette,T.,
Huisman,J.G. and van Heuverswyn,H.
TITLE Isolation and partial characterization of an unusual human
immunodeficiency retrovirus from two persons of west-central
African origin
JOURNAL J. Virol. 64, 1207-16 (1990)
STANDARD full staff_review
COMMENT Kindly provided prior to publication in computer-readable form by
R. De Leys, Innogenetics, Antwerp.
BASE COUNT 162 a 133 c 153 g 135 t
ORIGIN 5'-terminus of 5'LTR.
1 tggaagggtt aatttactcc cataaaagag cagaaatcct ggatctttgg gtgtataaca
61 ctcagggatt cttccctgat tggcagaact acacaccagg accaggaacc aggttcccac
121 tgacatttgg atggttgttt aaactagtac cagtgtcaga agaagaggca gaaagactag
181 gaaatacatg tgagagggct aatctcctgc atccagcatg tgcccatgga tttgaagata
241 cacataaaga aatactgatg tggaagtttg atagatctct aggcaacacc catgttgcta
301 tgataactca cccagagctc ttccagaagg actaaaaact gctgacctga agattgctga
361 cactgtggaa ctttccagca aagactgctg acactgcggg gactttccag tgggagggac
421 agggggcggt tcggggagtg gctaaccctc agaagctgca tataagcagc cgctttctgc
481 ttgtaccggg tctcggttag aggaccaggt ctgagcccgg gagctccctg gcctctagct
541 gaacccgctc gttaacgctc aataaagctt gccttgagtg aga
//
LOCUS HIVBAN1 59 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region.
ACCESSION M27953
KEYWORDS .
SEGMENT 1 of 2
SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by M. Goodenow.
Sibling sequences are available (see 1989: I-A-172).
BASE COUNT 26 a 7 c 10 g 16 t
ORIGIN
1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacta
//
LOCUS HIVBAN2 250 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region.
ACCESSION M27944
KEYWORDS .
SEGMENT 2 of 2
SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 250)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT Sibling sequences are available (see 1989: I-A-173).
BASE COUNT 96 a 40 c 42 g 72 t
ORIGIN
1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc
61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg
121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag
181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca
241 aaggtatcct
//
LOCUS HIVBRU 9229 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete
genome.
ACCESSION K02013
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 1 (HIV-1), isolate BRU proviral
DNA clone lambda-J19.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9229)
AUTHORS Wain-Hobson,S., Sonigo,P., Danos,O., Cole,S. and Alizon,M.
TITLE Nucleotide sequence of the AIDS Virus, LAV
JOURNAL Cell 40, 9-17 (1985)
STANDARD full staff_review
REFERENCE 2 (bases 1713 to 1748; revision of [1])
AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P.
TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis
of two isolates from African patients
JOURNAL Cell 46, 63-74 (1986)
STANDARD full staff_review
COMMENT The original LAV, sometimes called LAV-1 to distinguish it from
HIV2 (LAV-2), is now referred to as HIV-1bru. An infectious clone
of this virus has been constructed by Keith Peden, Molecular
Biology and Genetics, Johns Hopkins University School of Medicine,
Baltimore, MD 21205 (301) 955-3652. HIVNL43 is also an infectious
clone having for its 3' half a clone of the BRU isolate. The tat
sequence for the L20 clone is available (see 1989: I-A-54).
FEATURES from to/span description
pept 336 1874 gag polyprotein
pept < 1631 4678 pol polyprotein (NH2-terminus uncertain; AA at
1631)
pept 4623 5201 vif protein
pept 5141 5431 vpr protein
pept 5412 5626 tat protein, exon 2 (first expressed exon)
7972 8017 tat protein, exon 3 (AA at 7973)
pept 5551 5626 rev protein, exon 2 (first expressed exon)
7972 8246 rev protein, exon 3 (AA at 7974)
pept 5643 5888 vpu protein
pept 5803 8388 envelope polyprotein
pept 8390 9010 nef protein
IVS 290 5358 tat, rev, nef subgenomic mRNA intron 1
IVS 5627 7971 tat cds intron 2
IVS 5627 7971 rev cds intron 2
LTR < 1 180 5' LTR
LTR 8679 > 9229 3' LTR
rpt < 1 97 R repeat 5' copy
rpt 9133 9229 R repeat 3' copy
binding 182 199 primer (Lys-tRNA) binding site
revision 1712 1749 atttcttcagagcagaccagagccaacagccccaccag in [2];
ag in [1]
signal 9205 9210 poly-A signal
BASE COUNT 3289 a 1656 c 2232 g 2052 t
ORIGIN Cap site of genomic RNA.
1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac
61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca
181 gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag
241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc
301 aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
361 gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat
421 ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg
481 gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc
541 agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc
601 atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa
661 acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca
721 gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac
781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga
841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa
901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag
961 ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga
1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat
1081 ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg
1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac
1201 caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
1261 cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag
1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag
1381 catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc
1441 aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga
1501 ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta
1561 ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga
1621 gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc
1681 ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag
1741 ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg
1801 agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc
1861 cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga
1921 tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg
1981 aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca
2041 taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct
2101 gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt
2161 aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa
2221 aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat
2281 tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa
2341 atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt
2401 tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga
2461 tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt
2521 taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc
2581 acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc
2641 ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg
2701 atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt
2761 gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat
2821 gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga
2881 cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat
2941 ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac
3001 agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct
3061 aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca
3121 gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa
3181 aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga
3241 ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa
3301 actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg
3361 gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga
3421 gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac
3481 taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac
3541 tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg
3601 attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc
3661 agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa
3721 ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa
3781 attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga
3841 tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc
3901 tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat
3961 gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg
4021 aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc
4081 agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt
4141 aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg
4201 ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt
4261 agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga
4321 acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg
4381 gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac
4441 taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag
4501 cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt
4561 aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga
4621 ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa
4681 catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt
4741 atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag
4801 gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc
4861 atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc
4921 ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta
4981 taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata
5041 acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa
5101 agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga
5161 ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa
5221 gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta
5281 tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact
5341 gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag
5401 gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc
5461 agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt
5521 gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa
5581 gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg
5641 taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag
5701 ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca
5761 ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa
5821 tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg
5881 atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag
5941 gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat
6001 aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg
6061 gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag
6121 gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt
6181 gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat
6241 agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc
6301 agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata
6361 ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt
6421 acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct
6481 ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat
6541 gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg
6601 aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct
6661 aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac
6721 aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga
6781 aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact
6841 ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt
6901 aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa
6961 tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt
7021 actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa
7081 caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga
7141 caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac
7201 aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt
7261 gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca
7321 aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg
7381 ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc
7441 agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg
7501 caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg
7561 gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa
7621 ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag
7681 atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta
7741 atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg
7801 gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat
7861 ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta
7921 ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc
7981 ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac
8041 agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg
8101 agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt
8161 gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag
8221 tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca
8281 gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc
8341 cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa
8401 gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga
8461 gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac
8521 aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga
8581 ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc
8641 tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca
8701 acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg
8761 gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa
8821 gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt
8881 gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag
8941 gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa
9001 gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg
9061 tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt
9121 ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac
9181 tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca
//
LOCUS HIVELI 9176 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete
genome.
ACCESSION K03454
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 1, isolate ELI, proviral DNA,
clone E-H12.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9176)
AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P.
TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis
of two isolates from African patients
JOURNAL Cell 46, 63-74 (1986)
STANDARD full staff_review
COMMENT Sibling sequences of the ELI isolate are available (see 1989:
I-179)
FEATURES from to/span description
pept 336 1838 gag polyprotein
pept < 1634 4642 pol polyprotein (NH2-terminus uncertain; AA at
1634)
pept 4587 5165 vif protein
pept 5105 5395 vpr protein
pept 5376 5590 tat protein, exon 2 (first expressed exon)
7915 7999 tat protein, exon 3 (AA at 7916)
pept 5515 5590 rev protein, exon 2 (first expressed exon)
7915 8195 rev protein, exon 3 (AA at 7917)
pept 5607 5852 vpu protein
pept 5770 8331 envelope polyprotein precursor
pept 8333 8953 nef protein
IVS 5591 7914 tat cds intron 2
IVS 5591 7914 rev cds intron 2
LTR < 1 180 5' LTR
LTR 8625 > 9176 3' LTR
rpt < 1 97 R repeat 5' copy
rpt 9078 9175 R repeat 3' copy
binding 182 199 primer (Lys-tRNA) binding site
signal 9151 9156 mRNA polyadenylation signal
BASE COUNT 3333 a 1632 c 2179 g 2032 t
ORIGIN Cap site of genomic RNA.
1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctagcta gggaacccac
61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
121 gtgactctgg taactagaga tccctcagac ccctttagtc agagtggaaa atctctagca
181 gtggcgcccg aacagggacc tgaaagcgaa agtagaacca gaggagctct ctcgacgcag
241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg cagcgactgg tgagtacgct
301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
361 gcgggggaaa attagataaa tgggaaaaaa ttcggttacg gccaggagga aagaaaaaat
421 atagactaaa acatatagta tgggcaagca gggagctaga acgatatgca cttaatcctg
481 gccttttaga aacatcagaa ggctgtaaac aaataatagg gcagctacaa ccagctattc
541 agacaggaac agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtac
601 ataaaggaat agatgtaaaa gacaccaagg aagctttaga aaagatggag gaagagcaaa
661 acaaaagtaa gaaaaaggca cagcaagcag cagctgacac aggaaacaac agccaggtca
721 gccaaaatta tcctatagtg cagaacctac aggggcaaat ggtacatcag gccatatcac
781 ctagaacttt gaacgcatgg gtaaaagtaa tagaagaaaa ggctttcagc ccagaagtaa
841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa
901 acacagtggg gggacatcaa gcagccatgc aaatgctaaa agagaccatc aatgaagaag
961 ctgcagaatg ggataggtta catccagtgc atgcagggcc tattgcacca ggccagatga
1021 gagaaccaag gggaagtgat atagcaggaa ctactagtac ccttcaggaa caaatagcat
1081 ggatgacaag taacccacct atcccagtag gagaaatcta taaaagatgg ataattgtgg
1141 gattaaataa aatagtaaga atgtatagcc ctgtcagcat tttggacata agacagggac
1201 caaaggaacc ttttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag
1321 attgcaagac tatcttaaaa gcattgggac cacaggctac actagaagaa atgatgacag
1381 catgtcaggg agtggggggg cccagccata aagcaagagt tctggctgag gcaatgagcc
1441 aagcaacaaa ttcagttact acagcaatga tgcagagagg caattttaag ggcccaagaa
1501 aaattattaa gtgtttcaat tgtggcaaag aagggcacat agcaaaaaat tgcagggccc
1561 ctaggaaaaa gggctgttgg agatgtggaa aggaaggaca ccaactaaaa gattgcactg
1621 agagacaggc taatttttta gggagaattt ggccttccca caagggaagg ccggggaact
1681 ttctccaaag cagaccagag ccaacagccc caccagcaga gagcttcggg tttggggaag
1741 agataacccc ctctcaaaaa caggagcaga aagacaagga actgtatcct ttaacttccc
1801 tcaaatcact ctttggcaac gaccccttgt cgcaataaaa atagggggac agctaaagga
1861 agctctatta gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaaa
1921 atggaaacca aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca
1981 aatacccata gaaatctgtg gacagaaagc tataggtaca gtattagtag gacctacgcc
2041 tgtcaacata atcggaagaa atttgttgac ccagattggc tgcactttaa attttccaat
2101 tagtcctatt gaaactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa
2161 acaatggcca ttgacagaag aaaaaataaa agcattaaca gaaatttgta cagatatgga
2221 aaaggaagga aaaatttcaa gaattgggcc tgaaaatcca tacaatactc caatatttgc
2281 cataaagaaa aaagacagta ccaagtggag aaaattagta gatttcagag aacttaataa
2341 gagaactcaa gatttctggg aagttcaatt aggaataccg catcctgcag ggctgaaaaa
2401 gaaaaaatca gtaacagtac tggatgtggg tgatgcatat ttttcagttc ccttagatga
2461 agattttagg aaatataccg cctttaccat atctagtata aacaatgaga caccagggat
2521 tagatatcag tacaatgtgc ttccacaggg atggaaagga tcaccggcaa tattccaaag
2581 tagcatgaca aaaatcttag agccctttag aaaacaaaat ccagaaatgg ttatctatca
2641 atacatggat gatttgtatg taggatctga cttagaaata gggcagcata ggacaaaaat
2701 agagaaatta agagaacatc tattgaggtg gggatttacc agaccagata aaaaacatca
2761 gaaagaaccc ccatttcttt ggatgggtta tgaactccat cctgataaat ggacagtaca
2821 gtctataaaa ctgccagaaa aggagagctg gactgtcaat gatatacaga acttagtgga
2881 gagattaaac tgggcaagcc agatttatcc aggaattaaa gtaagacaat tatgtaaact
2941 ccttagggga accaaagcac taacagaagt aataccacta acagaagaag cagaattaga
3001 actggcagaa aacagggaaa ttttaaaaga accagtacat ggagtgtatt atgacccatc
3061 aaaagactta atagcagaaa tacagaaaca agggcacggc caatggacat accaaattta
3121 tcaagaacca tttaaaaatc tgaaaacagg aaagtatgca agaatgaggg gtgcccacac
3181 taatgatgta aagcaattag cagaggcagt gcaaagaata tccacagaaa gcatagtgat
3241 atggggaagg actcctaaat ttagactacc catacaaaag gaaacatggg aaacatggtg
3301 ggcagagtat tggcaagcca cttggattcc tgagtgggaa tttgtcaata cccctccttt
3361 agtaaaatta tggtaccagt tagagaagga acccataata ggagcagaaa ctttctatgt
3421 agatggggca gctaatagag agactaaatt aggaaaagca ggatatgtta ctgacagagg
3481 aagacagaaa gttgtccctt tgactgacac gacaaatcag aagactgagt tacaagcaat
3541 taatctagcc ttgcaggatt cgggattaga agtaaacata gtaacagatt cacaatatgc
3601 attaggaatc attcaagcac aaccagataa gagtgaatca gagttagtca atcaaataat
3661 agagcagtta ataaaaaagg aaaaggttta cctggcatgg gtaccagcac acaaaggaat
3721 tggaggaaat gaacaagtag ataaattagt cagtcaagga atcaggaaag tactattttt
3781 ggatggaata gataaggctc aagaagaaca tgagaaatat cacaacaatt ggagagcaat
3841 ggctagtgat tttaacctac cacccgtggt agcaaaagaa atagtagcta gctgtgataa
3901 atgtcagcta aaaggagaag ccatgcatgg acaagtagac tgtagtccag gaatatggca
3961 attagattgt acacacttag aaggaaaagt tatcctggta gcagttcatg tagccagtgg
4021 ctatatagaa gcagaagtta ttccagcaga aacagggcag gaaacagcat attttctttt
4081 aaaattagca ggaagatggc cagtaaaagt agtacataca gacaatggca gcaatttcac
4141 cagtgctgca gttaaggccg cctgttggtg ggcaggtatc aaacaggaat ttggaattcc
4201 ctacaatccc caaagtcaag gagtagtaga atctatgaat aaagaattaa agaaaattat
4261 aggacaggta agagatcaag ctgaacatct taagacagca gtacaaatgg cagtattcat
4321 ccacaatttt aaaagaagaa gggggattgg gggatacagt gcaggggaaa gaataataga
4381 cataatagca acagacatac aaactaaaga attacaaaaa caaattataa aaattcaaaa
4441 ttttcgggtt tattacagag acagcagaga tccaatttgg aaaggaccag caaagctcct
4501 ctggaaaggt gaaggggcag tagtaataca agacaagagt gacataaagg tagtaccaag
4561 aagaaaagta aagattatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc
4621 aagtagacag gatgaggatt aaaacatgga aaagtttagt aaaacaccat atgtatgttt
4681 caaagaaagc taacagatgg ttttatagac atcactatga aagcccccac ccaaaaataa
4741 gttcagaagt acacatccca ctaggagaag ctagactggt aataaaaaca tattggggtc
4801 tgcatacagg agaaagagaa tggcatctgg gtcagggagt ctccatagaa tggaggaaaa
4861 ggagatatag cacacaagta gaccctggcc tggcagacca actaattcat atgtattatt
4921 ttgattgttt ttcagaatct gctataagaa aagccatatt aggagatata gttagtccta
4981 ggtgtgagta tcaagcagga cataacaagg taggatccct acagtatttg gcactaacag
5041 cattaatagc accaaaacag ataaagccac ctttgcctag tgttaggaag ctaacagaag
5101 atagatggaa caagccccag cagaccaggg gccacagagg gagccataca atgaatgggc
5161 attagagctt ttagaggagc ttaagagtga agctgttaga cattttccta ggatatggct
5221 ccatagctta ggacaacata tttatgaaac ttatggggat acctgggtag gagttgaagc
5281 tataataaga atactgcaac aattactgtt tattcatttc agaattgggt gtcaacatag
5341 cagaataggc attattcgac agagaagagc aagaaatgga tccagtagat cctaacctag
5401 agccctggaa ccatccagga agtcagccta ggactccttg taacaagtgt cattgtaaaa
5461 agtgttgcta tcattgccca gtttgcttct taaacaaagg cttaggcatc tcctatggca
5521 ggaagaagcg gagacagcga cgaggacctc ctcaaggcgg tcaggctcat caagttccta
5581 taccaaagca gtaagtagta catgtaatgc aacctttagg gataatagca atagcagcat
5641 tagtagtagc aataatacta gcaatagttg tgtggaccat agtattcata gaatatagaa
5701 ggataaaaaa gcaaaggaga atagactgtt tacttgatag aataacagaa agagcagaag
5761 acagtggcaa tgagagcgag ggggatagag agaaattgtc aaaactggtg gaaatggggc
5821 atcatgctcc ttgggatatt gatgacctgt agtgctgcag acaatctgtg ggtcacagtt
5881 tattatgggg tgcctgtatg gaaggaagca accaccactc tattttgtgc atcagatgct
5941 aaatcatatg aaacagaggc acataatatc tgggccacac atgcctgtgt acccacggac
6001 cccaacccac aagaaatagc actggaaaat gtgacagaaa actttaacat gtggaaaaat
6061 aacatggtgg aacagatgca tgaggatata atcagtttat gggatcaaag cctaaaacca
6121 tgtgtaaaat taaccccact ctgtgtcact ttaaactgta gtgatgaatt gaggaacaat
6181 ggcactatgg ggaacaatgt cactacagag gagaaaggaa tgaaaaactg ctctttcaat
6241 gtaaccacag tactaaaaga taagaagcag caagtatatg cactttttta tagacttgat
6301 atagtaccaa tagacaatga tagtagtacc aatagtacca attataggtt aataaattgt
6361 aatacctcag ccattacaca ggcttgtcca aaggtatcct ttgagccaat tcccatacat
6421 tattgtgccc cagctggttt tgcgattcta aagtgtagag ataagaagtt caatggaaca
6481 ggcccatgca caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtggtgtca
6541 actcaactgc tgttgaatgg cagtctagca gaagaagagg tcataattag atccgaaaat
6601 ctcacaaaca atgctaaaaa cataatagca catcttaatg aatctgtaaa aattacctgt
6661 gcaaggccct atcaaaatac aagacaaaga acacctatag gactagggca atcactctat
6721 actacaagat caagatcaat aataggacaa gcacattgta atattagtag agcacaatgg
6781 agtaaaactt tacaacaagt agctagaaaa ttaggaaccc ttcttaacaa aacaataata
6841 aagtttaaac catcctcagg aggggaccca gaaattacaa cacacagttt taattgtgga
6901 ggggaattct tctactgtaa tacatcagga ctgtttaata gtacatggaa tattagtgca
6961 tggaataata ttacagagtc aaataatagc acaaacacaa acatcacact ccaatgcaga
7021 ataaaacaaa ttataaagat ggtggcaggc aggaaagcaa tatatgcccc tcctatcgaa
7081 agaaacattc tatgttcatc aaatattaca gggctactat tgacaagaga tggtggtata
7141 aataatagta ctaacgagac ctttagacct ggaggaggag atatgaggga caattggaga
7201 agtgaattat ataaatataa ggtagtacaa attgaaccac taggagtagc acccaccagg
7261 gcaaagagaa gagtggtgga aagagaaaaa agagcaatag gattaggagc tatgttcctt
7321 gggttcttgg gagcagcagg aagcacgatg ggcgcacggt cagtgacgct gacggtacag
7381 gccagacaat taatgtctgg tatagtgcaa cagcaaaaca atttgctgag ggctatagag
7441 gcgcaacagc atctgttgca actcacggtc tggggcatta aacagctcca ggcaagaatc
7501 ctggctgtgg aaagatacct aaaggatcaa cagctcctag gaatttgggg ttgctctgga
7561 aaacacattt gcaccactaa tgtgccctgg aactctagtt ggagtaatag atctctaaat
7621 gagatttggc agaacatgac ctggatggag tgggaaagag aaattgacaa ttacacaggc
7681 ttaatatata gcttaattga ggaatcgcag acccagcaag aaaagaatga aaaagaattg
7741 ttggaattgg acaagtgggc aagtttgtgg aattggttta gcataacaca atggctgtgg
7801 tatataaaaa tattcataat gataatagga ggcttgatag gtttaagaat agtttttgct
7861 gtgctttctt tagtaaatag agttaggcag ggatactcac ctctgtcgtt tcagaccctc
7921 ctcccagccc cgaggggacc cgacaggccc gaaggaacag aagaagaagg tggagagcga
7981 ggcagagaca gatccgtgag attgctgaac ggattctcgg cacttatctg ggacgacctg
8041 cggagcctgt gcctcttcag ctaccaccgc ttgagagact taatcttaat tgcagtgagg
8101 attgtagaac ttctgggacg cagggggtgg gacatcctca aatatctgtg gaatctccta
8161 cagtattgga gtcaggaact gaggaacagt gctagtagct tgtttgatgc catagcaata
8221 gcagtagctg aggggacaga tagagttata gaaataatac aaagagcttg cagagctgtt
8281 cttaacatac ccagaagaat aagacagggc ttagaaaggt ctttacttta aaatgggtgg
8341 caaatggtca aaaagtagta tagtgggatg gcctgctata agggaaagaa taagaagaac
8401 taatccagca gcagatgggg taggagcagt atctcgagac ctggaaaaac atggggcaat
8461 cacaagtagc aatacagcaa gtactaatgc tgactgtgcc tggctagaag cacaagaaga
8521 gagcgacgag gtgggctttc cagtcagacc ccaggtacct ttaagaccaa tgacttacaa
8581 agaagctcta gatctcagcc actttttaaa agaaaagggg ggactggaag ggctaatttg
8641 gtccaaaaag agacaagaga tccttgatct ttgggtctac aacacacaag gcatcttccc
8701 tgattggcaa aactacacac cagggccagg gatcagatat ccactaacct ttggatggtg
8761 ctacgagcta gtaccagttg atccacagga ggtagaagaa gacactgaag gagagaccaa
8821 cagcttgtta caccctatat gccagcatgg aatggaggac ccggagagac aagtgttaaa
8881 atggagattt aacagcagac tagcatttga gcacaaggcc cgagagatgc atccggagtt
8941 ctacaaaaac tgatgacacc gagctttcta caagggactt tccgctgggg actttccagg
9001 gaggcgtgga ctgggcggga ctggggagtg gctaaccctc agatgctgca tataagcagc
9061 tgctttttgc ctgtactggg tctctctggt tagaccagat ttgagcctgg gagctctctg
9121 gctagctagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaa
//
LOCUS HIVHXB2 9718 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete
genome.
ACCESSION K03455
KEYWORDS .
SOURCE Human immunodeficiency virus type 1 (isolate HXB2) proviral DNA.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 653)
AUTHORS Starcich,B., Ratner,L., Josephs,S.F., Okamato,T., Gallo,R.C. and
Wong-Staal,F.
TITLE Characterization of long terminal repeat sequences of HTLV-III
JOURNAL Science 227, 538-540 (1985)
STANDARD full staff_review
REFERENCE 2 (bases 493 to 674; 9577 to 9718)
AUTHORS Ratner,L., Haseltine,W., Patarca,R., Livak,K.J., Starcich,B.,
Josephs,S.J., Doran,E.R., Rafalski,J.A., Whitehorn,E.A.,
Baumeister,K., Ivanoff,L., Petteway,S.R.Jr., Pearson,M.L.,
Lautenberger,J.A., Papas,T.S., Ghrayeb,J., Chang,N.T.,
Gallo,R.C. and Wong-Staal,F.
TITLE Complete nucleotide sequence of the AIDS virus, HTLV-III
JOURNAL Nature 313, 277-284 (1985)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 9718)
AUTHORS Ratner,L., Fisher,A., Jagodzinski,L.L., Mitsuya,H., Liou,R.-S.,
Gallo,R.C. and Wong-Staal,F.
TITLE Complete nucleotide sequences of functional clones of the AIDS
virus
JOURNAL AIDS Res. Hum. Retroviruses 3, 57-69 (1987)
STANDARD full staff_review
REFERENCE 4 (bases 5611 to 5611; revises [3])
AUTHORS Ratner,L.
JOURNAL Unpublished (1987) Washington U. Med School, St. Louis, MO
STANDARD full staff_review
REFERENCE 5 (bases 6224 to 8794)
AUTHORS Reitz,M.S.
JOURNAL Unpublished (1988)
STANDARD full staff_review
COMMENT Submitted in computer-readable form by L. Ratner. A second
envelope gene sequence for the HXB2D clone has been independently
determined by Reitz and co-workers ([5]); the seven or so
differences from the Ratner sequence ([3] and [4]) have not been
resolved nor annotated herein.
The vpU cds annotated below does not possess a start codon in the
normal position (bases 6061 to 6063; "ACG"). As there are no
nearby in-frame upstream or downstream start codons, HXB2 may not
possess a functional vpU cds. This clone is infectious [3].
Sibling sequences are available (see 1989: I-A-47).
FEATURES from to/span description
pept 789 2291 gag polyprotein
pept < 2084 5095 pol polyprotein (NH2-terminus uncertain; AA at
2084)
pept 5040 5618 vif protein
pept 5558 5794 vpr protein
pept 5830 6044 tat protein, exon 2 (first expressed exon)
8378 8423 tat protein, exon 3 (AA at 8379)
pept 5969 6044 rev protein, exon 2 (first expressed exon)
8378 8652 rev protein, exon 3 (AA at 8380)
pept < 6061 6309 vpu protein (AA at 6061, put.)
pept 6224 8794 envelope polyprotein
pept 8796 9167 nef protein (premature termination)
IVS 743 5776 tat, rev, nef mRNA intron 1
IVS 6045 8377 tat cds intron 2
IVS 6045 8377 rev cds intron 2
IVS 6045 8377 tat, rev, nef mRNA intron 2
LTR 1 634 5' LTR
LTR 9085 9718 3' LTR
rpt 454 551 R repeat 5' copy
rpt 9538 9635 R repeat 3' copy
binding 636 653 primer (Lys-tRNA) binding site
signal 9611 9616 poly-A signal
BASE COUNT 3411 a 1773 c 2370 g 2164 t
ORIGIN 5' terminus of 5' LTR-U3.
1 tggaagggct aattcactcc caacgaagac aagatatcct tgatctgtgg atctaccaca
61 cacaaggcta cttccctgat tagcagaact acacaccagg gccagggatc agatatccac
121 tgacctttgg atggtgctac aagctagtac cagttgagcc agagaagtta gaagaagcca
181 acaaaggaga gaacaccagc ttgttacacc ctgtgagcct gcatggaatg gatgacccgg
241 agagagaagt gttagagtgg aggtttgaca gccgcctagc atttcatcac atggcccgag
301 agctgcatcc ggagtacttc aagaactgct gacatcgagc ttgctacaag ggactttccg
361 ctggggactt tccagggagg cgtggcctgg gcgggactgg ggagtggcga gccctcagat
421 cctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga
481 gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct
541 tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc
601 agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacctgaaag
661 cgaaagggaa accagagctc tctcgacgca ggactcggct tgctgaagcg cccgcacggc
721 aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg gaggctagaa
781 ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat cgatgggaaa
841 aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa
901 gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta
961 gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat
1021 tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca
1081 aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaaaaaa gcacagcaag
1141 cagcagctga cacaggacac agcaatcagg tcagccaaaa ttaccctata gtgcagaaca
1201 tccaggggca aatggtacat caggccatat cacctagaac tttaaatgca tgggtaaaag
1261 tagtagaaga gaaggctttc agcccagaag tgatacccat gttttcagca ttatcagaag
1321 gagccacccc acaagattta aacaccatgc taaacacagt ggggggacat caagcagcca
1381 tgcaaatgtt aaaagagacc atcaatgagg aagctgcaga atgggataga gtgcatccag
1441 tgcatgcagg gcctattgca ccaggccaga tgagagaacc aaggggaagt gacatagcag
1501 gaactactag tacccttcag gaacaaatag gatggatgac aaataatcca cctatcccag
1561 taggagaaat ttataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata
1621 gccctaccag cattctggac ataagacaag gaccaaagga accctttaga gactatgtag
1681 accggttcta taaaactcta agagccgagc aagcttcaca ggaggtaaaa aattggatga
1741 cagaaacctt gttggtccaa aatgcgaacc cagattgtaa gactatttta aaagcattgg
1801 gaccagcggc tacactagaa gaaatgatga cagcatgtca gggagtagga ggacccggcc
1861 ataaggcaag agttttggct gaagcaatga gccaagtaac aaattcagct accataatga
1921 tgcagagagg caattttagg aaccaaagaa agattgttaa gtgtttcaat tgtggcaaag
1981 aagggcacac agccagaaat tgcagggccc ctaggaaaaa gggctgttgg aaatgtggaa
2041 aggaaggaca ccaaatgaaa gattgtactg agagacaggc taatttttta gggaagatct
2101 ggccttccta caagggaagg ccagggaatt ttcttcagag cagaccagag ccaacagccc
2161 caccagaaga gagcttcagg tctggggtag agacaacaac tccccctcag aagcaggagc
2221 cgatagacaa ggaactgtat cctttaactt ccctcaggtc actctttggc aacgacccct
2281 cgtcacaata aagatagggg ggcaactaaa ggaagctcta ttagatacag gagcagatga
2341 tacagtatta gaagaaatga gtttgccagg aagatggaaa ccaaaaatga tagggggaat
2401 tggaggtttt atcaaagtaa gacagtatga tcagatactc atagaaatct gtggacataa
2461 agctataggt acagtattag taggacctac acctgtcaac ataattggaa gaaatctgtt
2521 gactcagatt ggttgcactt taaattttcc cattagccct attgagactg taccagtaaa
2581 attaaagcca ggaatggatg gcccaaaagt taaacaatgg ccattgacag aagaaaaaat
2641 aaaagcatta gtagaaattt gtacagagat ggaaaaggaa gggaaaattt caaaaattgg
2701 gcctgaaaat ccatacaata ctccagtatt tgccataaag aaaaaagaca gtactaaatg
2761 gagaaaatta gtagatttca gagaacttaa taagagaact caagacttct gggaagttca
2821 attaggaata ccacatcccg cagggttaaa aaagaaaaaa tcagtaacag tactggatgt
2881 gggtgatgca tatttttcag ttcccttaga tgaagacttc aggaagtata ctgcatttac
2941 catacctagt ataaacaatg agacaccagg gattagatat cagtacaatg tgcttccaca
3001 gggatggaaa ggatcaccag caatattcca aagtagcatg acaaaaatct tagagccttt
3061 tagaaaacaa aatccagaca tagttatcta tcaatacatg gatgatttgt atgtaggatc
3121 tgacttagaa atagggcagc atagaacaaa aatagaggag ctgagacaac atctgttgag
3181 gtggggactt accacaccag acaaaaaaca tcagaaagaa cctccattcc tttggatggg
3241 ttatgaactc catcctgata aatggacagt acagcctata gtgctgccag aaaaagacag
3301 ctggactgtc aatgacatac agaagttagt ggggaaattg aattgggcaa gtcagattta
3361 cccagggatt aaagtaaggc aattatgtaa actccttaga ggaaccaaag cactaacaga
3421 agtaatacca ctaacagaag aagcagagct agaactggca gaaaacagag agattctaaa
3481 agaaccagta catggagtgt attatgaccc atcaaaagac ttaatagcag aaatacagaa
3541 gcaggggcaa ggccaatgga catatcaaat ttatcaagag ccatttaaaa atctgaaaac
3601 aggaaaatat gcaagaatga ggggtgccca cactaatgat gtaaaacaat taacagaggc
3661 agtgcaaaaa ataaccacag aaagcatagt aatatgggga aagactccta aatttaaact
3721 gcccatacaa aaggaaacat gggaaacatg gtggacagag tattggcaag ccacctggat
3781 tcctgagtgg gagtttgtta atacccctcc cttagtgaaa ttatggtacc agttagagaa
3841 agaacccata gtaggagcag aaaccttcta tgtagatggg gcagctaaca gggagactaa
3901 attaggaaaa gcaggatatg ttactaatag aggaagacaa aaagttgtca ccctaactga
3961 cacaacaaat cagaagactg agttacaagc aatttatcta gctttgcagg attcgggatt
4021 agaagtaaac atagtaacag actcacaata tgcattagga atcattcaag cacaaccaga
4081 tcaaagtgaa tcagagttag tcaatcaaat aatagagcag ttaataaaaa aggaaaaggt
4141 ctatctggca tgggtaccag cacacaaagg aattggagga aatgaacaag tagataaatt
4201 agtcagtgct ggaatcagga aagtactatt tttagatgga atagataagg cccaagatga
4261 acatgagaaa tatcacagta attggagagc aatggctagt gattttaacc tgccacctgt
4321 agtagcaaaa gaaatagtag ccagctgtga taaatgtcag ctaaaaggag aagccatgca
4381 tggacaagta gactgtagtc caggaatatg gcaactagat tgtacacatt tagaaggaaa
4441 agttatcctg gtagcagttc atgtagccag tggatatata gaagcagaag ttattccagc
4501 agaaacaggg caggaaacag catattttct tttaaaatta gcaggaagat ggccagtaaa
4561 aacaatacat actgacaatg gcagcaattt caccggtgct acggttaggg ccgcctgttg
4621 gtgggcggga atcaagcagg aatttggaat tccctacaat ccccaaagtc aaggagtagt
4681 agaatctatg aataaagaat taaagaaaat tataggacag gtaagagatc aggctgaaca
4741 tcttaagaca gcagtacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat
4801 tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa
4861 agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag
4921 aaattcactt tggaaaggac cagcaaagct cctctggaaa ggtgaagggg cagtagtaat
4981 acaagataat agtgacataa aagtagtgcc aagaagaaaa gcaaagatca ttagggatta
5041 tggaaaacag atggcaggtg atgattgtgt ggcaagtaga caggatgagg attagaacat
5101 ggaaaagttt agtaaaacac catatgtatg tttcagggaa agctagggga tggttttata
5161 gacatcacta tgaaagccct catccaagaa taagttcaga agtacacatc ccactagggg
5221 atgctagatt ggtaataaca acatattggg gtctgcatac aggagaaaga gactggcatt
5281 tgggtcaggg agtctccata gaatggagga aaaagagata tagcacacaa gtagaccctg
5341 aactagcaga ccaactaatt catctgtatt actttgactg tttttcagac tctgctataa
5401 gaaaggcctt attaggacac atagttagcc ctaggtgtga atatcaagca ggacataaca
5461 aggtaggatc tctacaatac ttggcactag cagcattaat aacaccaaaa aagataaagc
5521 cacctttgcc tagtgttacg aaactgacag aggatagatg gaacaagccc cagaagacca
5581 agggccacag agggagccac acaatgaatg gacactagag cttttagagg agcttaagaa
5641 tgaagctgtt agacattttc ctaggatttg gctccatggc ttagggcaac atatctatga
5701 aacttatggg gatacttggg caggagtgga agccataata agaattctgc aacaactgct
5761 gtttatccat tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga
5821 gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag
5881 cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt
5941 ttcataacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga
6001 gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta
6061 acgcaaccta taccaatagt agcaatagta gcattagtag tagcaataat aatagcaata
6121 gttgtgtggt ccatagtaat catagaatat aggaaaatat taagacaaag aaaaatagac
6181 aggttaattg atagactaat agaaagagca gaagacagtg gcaatgagag tgaaggagaa
6241 atatcagcac ttgtggagat gggggtggag atggggcacc atgctccttg ggatgttgat
6301 gatctgtagt gctacagaaa aattgtgggt cacagtctat tatggggtac ctgtgtggaa
6361 ggaagcaacc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca
6421 taatgtttgg gccacacatg cctgtgtacc cacagacccc aacccacaag aagtagtatt
6481 ggtaaatgtg acagaaaatt ttgacatgtg gaaaaatgac atggtagaac agatgcatga
6541 ggatataatc agtttatggg atcaaagcct aaagccatgt gtaaaattaa ccccactctg
6601 tgttagttta aagtgcactg atttgaagaa tgatactaat accaatagta gtagcgggag
6661 aatgataatg gagaaaggag agataaaaaa ctgctctttc aatatcagca caagcataag
6721 aggtaaggtg cagaaagaat atgcattttt ttataaactt gatataatac caatagataa
6781 tgatactacc agctatagct tgacaagttg taacacctca gtcattacac aggcctgtcc
6841 aaaggtatcc tttgagccaa ttcccataca ttattgtgcc ccggctggtt ttgcgattct
6901 aaaatgtaat aataagacgt tcaatggaac aggaccatgt acaaatgtca gcacagtaca
6961 atgtacacat ggaattaggc cagtagtatc aactcaactg ctgttaaatg gcagtctagc
7021 agaagaagag gtagtaatta gatctgtcaa tttcacggac aatgctaaaa ccataatagt
7081 acagctgaac acatctgtag aaattaattg tacaagaccc aacaacaata caagaaaaag
7141 aatccgtatc cagagaggac cagggagagc atttgttaca ataggaaaaa taggaaatat
7201 gagacaagca cattgtaaca ttagtagagc aaaatggaat aacactttaa aacagataga
7261 tagcaaatta agagaacaat tcggaaataa taaaacaata atctttaagc aatcctcagg
7321 aggggaccca gaaattgtaa cgcacagttt taattgtgga ggggaatttt tctactgtaa
7381 ttcaacacaa ctgtttaata gtacttggtt taatagtact tggagtactg aagggtcaaa
7441 taacactgaa ggaagtgaca caatcaccct cccatgcaga ataaaacaaa ttataaacat
7501 gtggcagaaa gtaggaaaag caatgtatgc ccctcccatc agtggacaaa ttagatgttc
7561 atcaaatatt acagggctgc tattaacaag agatggtggt aatagcaaca atgagtccga
7621 gatcttcaga cttggaggag gagatatgag ggacaattgg agaagtgaat tatataaata
7681 taaagtagta aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt
7741 gcagagagaa aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc
7801 aggaagcact atgggcgcag cctcaatgac gctgacggta caggccagac aattattgtc
7861 tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt
7921 gcaactcaca gtctggggca tcaagcagct ccaagcaaga atcctagctg tggaaagata
7981 cctaaaggat caacagctcc tagggatttg gggttgctct ggaaaactca tttgcaccac
8041 tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagatct ggaatcacac
8101 gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat
8161 tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg
8221 ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa aattattcat
8281 aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa
8341 tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa tcccgagggg
8401 acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag acagatccat
8461 tcgattagtg aacggatcct tggcacttat ctgggacgat ctgcggagcc tgtgcctctt
8521 cagctaccac cgcttgagag acttactctt gattgtaacg aggattgtgg aacttctggg
8581 acgcaggggg tgggaagccc tcaaatattg gtggaatctc ctacagtatt ggagtcagga
8641 actaaagaat agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac
8701 agatagggtt atagaagtag tacaaggagc ttgtagagct attcgccaca tacctagaag
8761 aataagacag ggcttggaaa ggattttgct ataagatggg tggcaagtgg tcaaaaagta
8821 gtgtgattgg atggcttact gtaagggaaa gaatgagacg agctgagcca gcagcagatg
8881 gggtgggagc agcatctcga gacctggaaa aacatggagc aatcacaagt agcaacacag
8941 cagctaccaa tgctgcttgt gcctggctag aagcacaaga ggaggaggag gtgggttttc
9001 cagtcacacc tcaggtacct ttaagaccaa tgacttacaa ggcagctgta gatcttagcc
9061 actttttaaa agaaaagggg ggactggaag ggctaattca ctcccaaaga agacaagata
9121 tccttgatct gtggatctac cacacacaag gctacttccc tgattgacag aactacacac
9181 cagggccagg ggtcagatat ccactgacct ttggatggtg ctacaagcta gtaccagttg
9241 agccagataa gatagaagag gccaataaag gagagaacac cagcttgtta caccctgtga
9301 gcctgcatgg gatggatgac ccggagagag aagtgttaga gtggaggttt gacagccgcc
9361 tagcatttca tcacgtggcc cgagagctgc atccggagta cttcaagaac tgctgacatc
9421 gagcttgcta caagggactt tccgctgggg actttccagg gaggcgtggc ctgggcggga
9481 ctggggagtg gcgagccctc agatcctgca tataagcagc tgctttttgc ctgtactggg
9541 tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg
9601 cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt
9661 gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagca
//
LOCUS HIVMAB 59 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) gag region.
ACCESSION M27955
KEYWORDS .
SOURCE Human immunodeficiency virus type 1 (isolate MAB), clone 01.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson.
Sibling sequences are available (see 1989: I-A-184).
FEATURES from to/span description
pept < 1 > 59 gag protein (AA at 2)
BASE COUNT 24 a 7 c 12 g 16 t
ORIGIN Within gag cds.
1 ctataaaaga tggataatcc tggggttaaa taaaatagta agaatgtata gccctgtca
//
LOCUS HIVMN 9738 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete
genome.
ACCESSION M17449
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 1 (isolate MN), proviral DNA
(non-infectious).
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9738)
AUTHORS Gurgo,C., Guo,H.-G., Franchini,G., Aldovini,A., Collalti,E.,
Farrell,K., Wong-Staal,F., Gallo,R.C. and Reitz,M.S.Jr.
TITLE Envelope sequences of two new United States HIV-1 isolates
JOURNAL Virology 164, 531-536 (1988)
STANDARD full staff_review
COMMENT Sequence kindly provided in computer-readable form by M. Reitz,
N.C.I., Bethesda, MD. 20892 U.S.A.
The MN isolate was taken from a pediatric AIDS patient in 1984.
The pol coding sequence shows an in-frame stop codon at 3783; the
nef protein is prematurely truncated at position 9357. The vpU
protein is prematurely truncated at position 6142.
FEATURES from to/span description
pept 787 2307 gag polyprotein
pept < 2091 3782 pol polyprotein (NH2-terminus uncertain; AA at
2091; in-frame stop codon at 3783)
3786 5111 pol polyprotein
pept 5056 5634 vif protein
pept 5574 5864 vpr protein
pept 5845 6059 tat protein, exon 2 (first expressed exon)
8396 8486 tat protein, exon 3 (AA at 8397)
pept 5984 6059 rev protein, exon 2 (first expressed exon)
8396 8667 rev protein, exon 3 (AA at 8398)
pept 6076 6144 vpu protein (premature termination)
pept 6239 8809 envelope polyprotein
pept 8811 9359 nef protein (premature termination at 9357
relative to other HIV-1 sequences)
LTR 1 633 5' LTR
LTR 9106 9738 3' LTR
rpt 453 550 R repeat 5' copy
rpt 9558 9655 R repeat 3' copy
binding 635 652 primer (Lys-tRNA) binding site
site 3783 3785 pol cds in-frame stop codon
signal 9631 9636 poly-A signal
BASE COUNT 3463 a 1769 c 2344 g 2162 t
ORIGIN Left end of viral genome.
1 tggatgggtt aatttactcc caaaagagac aagacatcct tgatctgtgg gtctaccaca
61 cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac
121 tgacctttgg atggtgcttc aagctagtac cagttgagcc agagaagata gaagaggcca
181 ataaaggaga gaacaactgc ttgttacacc ctatgagcca gcatggatgg atgacccgga
241 gagagaagtg ttagtgtgga agtctgacag ccacctagca tttcagcatt atgcccgaga
301 gctgcatccg gagtactaca agaactgctg acatcgagct atctacaagg gactttccgc
361 tggggacttt ccagggaggt gtggcctggg cgggaccggg gagtggcgag ccctcagatc
421 gtgcatataa gcagctgctt tctgcctgta ctgggtctct ctggttagac cagatctgag
481 cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt
541 gagtgcttca agtagtgtgt gcccgtctgt tatgtgactc tggtagctag agatccctca
601 gatcctttta ggcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc
661 gaaagaaaaa ccagagctct ctcgacgcag gactcggctt gctgaagcgc gcacggcaag
721 aggcgagggg cggcgactgg tgagtacgcc aaaaattctt gactagcgga ggctagaagg
781 agagagatgg gtgcgagagc gtcggtatta agcgggggag aattagatcg atgggaaaac
841 attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatgtagt atgggcaagc
901 agggagctag aacgattcgc agtcaatcct ggcctgttag aaacatcaga aggctgtaga
961 caaatactgg gacagctaca accatccctt cagacaggat cagaagaact taaatcatta
1021 tataatacag tagcaaccct ctattgtgtg catcaaaaga tagagataaa agacaccaag
1081 gaagctttag agaaaataga ggaagagcaa aacaaaagta agaaaaaagc acagcaagca
1141 gcagctgaca caggaaacag aggaaacagc agccaagtca gccaaaatta ccccatagtg
1201 cagaacatcg aggggcaaat ggtacatcag gccatatcac ctagaacttt aaatgcatgg
1261 gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa tacccatgtt ttcagcatta
1321 tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg gggacatcaa
1381 gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcagaatg ggatagattg
1441 catccagtgc atgcagggcc tattacacca ggccagatga gagaaccaag gggaagtgac
1501 atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgacaaa taatccacct
1561 atcccagtag gagaaatcta taaaagatgg ataatcctgg gattaaataa aatagtaagg
1621 atgtatagcc cttccagcat tctggacata agacaaggac caaaggaacc ctttagagac
1681 tatgtagacc ggttctataa aactctaaga gccgagcaag cttcacagga ggtaaaaaac
1741 cggacgacag aaaccttgtt ggtccaaaat gcgaacccag attgtaagac tattttaaaa
1801 gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg agtgggagga
1861 cctggtcata aagcaagagt tttggcggaa gcgatgagcc aagtaacaaa ttcagctacc
1921 ataatgatgc agagaggcaa ttttaggaat caaagaaaga ttatcaagtg cttcaattgt
1981 ggcaaagaag ggcacatagc caaaaattgc agggccccta ggaaaagggg ctgttggaaa
2041 tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa ttttttaggg
2101 aagatctggc cttcctgcaa gggaaggcgg aattttcctc agagcagaac agagccaaca
2161 gccccaccag aagagagctt caggtttggg gaagagacaa caactcccta tcagaagcag
2221 gagaagaagc aggagacgat agacaaggac ctgtatcctt tagcttccct caaatcactc
2281 tttggcaacg acccattgtc acaataaaga taggggggca actaaaggaa gctctattag
2341 atacaggagc agatgataca gtattaggag aaatgaattt gccaagaaga tggaaaccaa
2401 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcag ataaccatag
2461 gaatctgtgg acataaagct ataggtacag tattagtagg acctacacct gtcaacataa
2521 ttggaagaaa tctgttgact cagcttgggt gcactttaaa ttttcccatt agtcctattg
2581 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat
2641 tgacagaaga aaaaataaaa gcattaatag aaatttgtac agaaatggaa aaggaaggga
2701 aaatttcaaa aattgggcct gaaaatccat acaatactcc agtatttgcc ataaagaaaa
2761 aagacagtac taaatggaga aaattagtag atttcagaga acttaataag aaaactcaag
2821 acttctggga agttcaatta ggaataccac atcctgcagg gttaaaaaag aaaaaatcag
2881 taacagtact ggatgtgggt gatgcatatt tttcagttcc cttagataaa gacttcagga
2941 agtatactgc atttaccata cctagtataa acaatgaaac accagggatt agatatcagt
3001 acaatgtgct tccacaggga tggaaaggat caccagcaat attccaaagt agcatgacaa
3061 aaatcttaga gccttttaga aaacaaaatc cagacatagt tatctatcaa tacatggatg
3121 atttgtatgt aggatctgac ttagaaatag ggcagcatag agcaaaaata gaggaactga
3181 gacgacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag aaagaacctc
3241 cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctatagtgc
3301 taccagaaaa agacagctgg actgtcaatg acatacagaa gttagtggga aaattgaatt
3361 gggcaagtca gatttacgca gggattaaag taaagcaatt atgtaaactc cttagaggaa
3421 ccaaagcact aacagaagta ataccactaa cagaagaagc agagctagaa ctggcagaaa
3481 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa
3541 tagcagaagt acagaagcag gggcaaggcc aatggacata tcaaatttat caagagccat
3601 ttaaaaatct gaaaacaggc aaatatgcaa gaatgagggg tgcccacact aatgatgtaa
3661 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtaata tggggaaaga
3721 ctcctaaatt tagactaccc atacaaaaag aaacatggga aacatggtgg acagagtata
3781 cgtaagccac ctggattcct gagtgggagg ttgtcaatac ccctccctta gtgaaattat
3841 ggtaccagtt agagaaagaa cccatagtag gtgcagaaac tttctatgta gatggggcag
3901 ctaacaggga gactaaaaaa ggaaaagcag gatatgttac taacagagga agacaaaagg
3961 ttgtctccct aactgacaca acaaatcaga agactgagtt acaagcaatt catctagctt
4021 tgcaagattc agggttagaa gtaaacatag taacagactc acaatatgca ttaggaatca
4081 ttcaagcaca accagataaa agtgaatcag agttagtcag tcaaataata gagcagttaa
4141 taaaaaagga aaaggtctat ctggcatggg taccagcaca caaaggaatt ggaggaaatg
4201 aacaagtaga taaattagtc agtgctggaa tcaggaaagt actattttta gatggaatag
4261 ataaggccca agaagaccat gagaaatatc acagtaattg gagagcaatg gctagtgact
4321 ttaacctacc acctatagta gcaaaagaaa tagtagccag ctgtgataaa tgtcagctaa
4381 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ctagattgta
4441 cacatttaga aggaaaagtt atcctggtag cagttcatgt agccagtgga tacatagaag
4501 cagaagttat tccagcagag acagggcagg agacagcata ctttctctta aaattagcag
4561 gaagatggcc agtaaaaaca atacatacag acaatggccc caatttcacc agtactacgg
4621 ttaaggccgc ctgttggtgg acgggaatca agcaggaatt tggcattccc tacaatcccc
4681 aaagtcaagg agtaatagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa
4741 gagatcaggc tgaacatctt aagagagcag tacaaatggc agtattcatc cacaatttta
4801 aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtaggc ataatagcaa
4861 cagacataca aactaaagaa ctacaaaaac aaattacaaa aattcaaaat tttcgggttt
4921 attacaggga cagcagagat ccactttgga aaggaccagc aaagcttctc tggaaaggtg
4981 aaggggcagt agtaatacaa gataataatg acataaaagt agtgccaaga agaaaagcaa
5041 aggtcattag ggattatgga aaacagacgg caggtgatga ttgtgtggca agcagacagg
5101 atgaggatta gaacatggaa aagtttagta aaacaccata tgtatatttc aaagaaagct
5161 aaaggacggt tttatagaca tcactatgaa agcactcatc caagaataag ttcagaagta
5221 cacatcccac taggggatgc tagattggta ataacaacat attggggtct gcatacagga
5281 gaaagagact ggcatttagg tcagggagtc tccatagaat ggaggaaaaa gagatatagc
5341 acacaagtag accctgacct agcagaccac ctaattcatc tgcattactt tgattgtttt
5401 tcagactctg ccataagaaa ggccatatta ggacatagag ttagtcctat ttgtgaattt
5461 caagcaggac ataacaaggt aggacctcta cagtacttgg cactaacagc attaataaca
5521 ccaaaaaaga taaagccacc tttgcctagt gttaagaaac tgacagagga tagatggaac
5581 aagccccaga agaccaaggg ccacagaggg agccatacaa tcaatgggca ctagagcttt
5641 tagaggagct taagaatgaa gctgttagac attttcctag gatatggctc catggcttag
5701 ggcaacatat ctatgaaact tatggggata cttgggcagg agtggaagcc ataataagaa
5761 ttctacaaca actgctgttt attcatttca gaattgggtg tcgacatagc agaataggca
5821 ttattcgaca gaggagagca agaaatggag ccagtagatc ctagactaga gccctggaag
5881 catccaggaa gtcagcctaa gactgcttgt accacttgct attgtaaaaa gtgttgcttt
5941 cattgccaag tttgtttcac aaaaaaagcc ttaggcatct cctatggcag gaagaagcgg
6001 agacagcgac gaagagctcc tgaagacagt cagactcatc aagtttctct accaaagcag
6061 taagtagtac atgtaatgca acctttagta atagcagcaa tagtagcatt agtagtagca
6121 ggaataatag caatagttgt gtgatccata gtattcatag aatataggaa aataagaaga
6181 caaagaaaaa tagacaggtt aattgataga ataagcgaaa gagcagaaga cagtggcaat
6241 gagagtgaag gggatcagga ggaattatca gcactggtgg ggatggggca cgatgctcct
6301 tgggttatta atgatctgta gtgctacaga aaaattgtgg gtcacagtct attatggggt
6361 acctgtgtgg aaagaagcaa ccaccactct attttgtgca tcagatgcta aagcatatga
6421 tacagaggta cataatgttt gggccacaca agcctgtgta cccacagacc ccaacccaca
6481 agaagtagaa ttggtaaatg tgacagaaaa ttttaacatg tggaaaaata acatggtaga
6541 acagatgcat gaggatataa tcagtttatg ggatcaaagc ctaaagccat gtgtaaaatt
6601 aaccccactc tgtgttactt taaattgcac tgatttgagg aatactacta ataccaataa
6661 tagtactgct aataacaata gtaatagcga gggaacaata aagggaggag aaatgaaaaa
6721 ctgctctttc aatatcacca caagcataag agataagatg cagaaagaat atgcacttct
6781 ttataaactt gatatagtat caatagataa tgatagtacc agctataggt tgataagttg
6841 taatacctca gtcattacac aagcttgtcc aaagatatcc tttgagccaa ttcccataca
6901 ctattgtgcc ccggctggtt ttgcgattct aaaatgtaac gataaaaagt tcagtggaaa
6961 aggatcatgt aaaaatgtca gcacagtaca atgtacacat ggaattaggc cagtagtatc
7021 aactcaactg ctgttaaatg gcagtctagc agaagaagag gtagtaatta gatctgagaa
7081 tttcactgat aatgctaaaa ccatcatagt acatctgaat gaatctgtac aaattaattg
7141 tacaagaccc aactacaata aaagaaaaag gatacatata ggaccaggga gagcatttta
7201 tacaacaaaa aatataatag gaactataag acaagcacat tgtaacatta gtagagcaaa
7261 atggaatgac actttaagac agatagttag caaattaaaa gaacaattta agaataaaac
7321 aatagtcttt aatcaatcct caggagggga cccagaaatt gtaatgcaca gttttaattg
7381 tggaggggaa tttttctact gtaatacatc accactgttt aatagtactt ggaatggtaa
7441 taatacttgg aataatacta cagggtcaaa taacaatatc acacttcaat gcaaaataaa
7501 acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc ccattgaagg
7561 acaaattaga tgttcatcaa atattacagg gctactatta acaagagatg gtggtaagga
7621 cacggacacg aacgacaccg agatcttcag acctggagga ggagatatga gggacaattg
7681 gagaagtgaa ttatataaat ataaagtagt aacaattgaa ccattaggag tagcacccac
7741 caaggcaaag agaagagtgg tgcagagaga aaaaagagca gcgataggag ctctgttcct
7801 tgggttctta ggagcagcag gaagcactat gggcgcagcg tcagtgacgc tgacggtaca
7861 ggccagacta ttattgtctg gtatagtgca acagcagaac aatttgctga gggccattga
7921 ggcgcaacag catatgttgc aactcacagt ctggggcatc aagcagctcc aggcaagagt
7981 cctggctgtg gaaagatacc taaaggatca acagctcctg gggttttggg gttgctctgg
8041 aaaactcatt tgcaccacta ctgtgccttg gaatgctagt tggagtaata aatctctgga
8101 tgatatttgg aataacatga cctggatgca gtgggaaaga gaaattgaca attacacaag
8161 cttaatatac tcattactag aaaaatcgca aacccaacaa gaaaagaatg aacaagaatt
8221 attggaattg gataaatggg caagtttgtg gaattggttt gacataacaa attggctgtg
8281 gtatataaaa atattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc
8341 tgtactttct atagtgaata gagttaggca gggatactca ccattgtcgt tgcagacccg
8401 ccccccagtt ccgaggggac ccgacaggcc cgaaggaatc gaagaagaag gtggagagag
8461 agacagagac acatccggtc gattagtgca tggattctta gcaattatct gggtcgacct
8521 gcggagcctg ttcctcttca gctaccacca cagagactta ctcttgattg cagcgaggat
8581 tgtggaactt ctgggacgca gggggtggga agtcctcaaa tattggtgga atctcctaca
8641 gtattggagt caggaactaa agagtagtgc tgttagcttg cttaatgcca cagctatagc
8701 agtagctgag gggacagata gggttataga agtactgcaa agagctggta gagctattct
8761 ccacatacct acaagaataa gacagggctt ggaaagggct ttgctataag atgggtggca
8821 aatggtcaaa acgtgtgact ggatggccta ctgtaaggga aagaatgaga cgagctgaac
8881 cagctgagct agcagcagat ggggtgggag cagcatcccg agacctggaa aaacatggag
8941 cactcacaag tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag
9001 aggaggagga agtgggtttt ccagtcaaac ctcaggtacc tttaagacca atgacttaca
9061 aagcagcttt agatcttagc cactttttaa aagaaaaggg gggactggat gggttaattt
9121 actcccaaaa gagacaagac atccttgatc tgtgggtcta ccacacacaa ggctacttcc
9181 ctgattggca gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt
9241 gcttcaagct agtaccagtt gagccagaga agatagaaga ggccaataaa ggagagaaca
9301 actgcttgtt acaccctatg agccagcatg gatggatgac ccggagagag aagtgttagt
9361 gtggaagtct gacagccacc tagcatttca gcattatgcc cgagagctgc atccggagta
9421 ctacaagaac tgctgacatc gagctatcta caagggactt tccgctgggg actttccagg
9481 gaggtgtggc ctgggcggga ccggggagtg gcgagccctc agatcgtgca tataagcagc
9541 tgctttctgc ctgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg
9601 gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag
9661 tgtgtgcccg tctgttatgt gactctggta gctagagatc cctcagatcc ttttaggcag
9721 tgtggaaaat ctctagca
//
LOCUS HIVNDK 9143 bp ds-DNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete
genome.
ACCESSION M27323
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 1 (isolate NDK), proviral DNA
infectious clone.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9143)
AUTHORS Spire,B., Sire,J., Zachar,V., Rey,F., Barre-Sinoussi,F.,
Galibert,F., Hampe,A. and Chermann,J.-C.
TITLE Nucleotide sequence of HIV1-NDK a highly cytopathic strain of the
human immunodeficiency virus HIV1
JOURNAL Gene 81, 275-84 (1989)
STANDARD full staff_review
COMMENT Kindly provided prior to publication by J.-C. Chermann, Pasteur
Institute, Marseille.
This is an infectious molecular clone of an isolate estimated to be
10,000 times more cytopathic in vitro than a prototypical HIV-1.
[1] reports that only minor sequence differences appear to be
responsible for the "acute biological effect".
FEATURES from to/span description
pept 334 1827 gag polyprotein
pept < 1623 4631 pol polyprotein (NH2-terminus uncertain)
pept 4576 5154 vif protein
pept 5094 5384 vpR protein
pept 5365 5579 tat protein, exon 2 (first expressed exon)
7883 7928 tat protein, exon 3 (AA at 7884)
pept 5504 5579 rev protein, exon 2 (first expressed exon)
7883 8157 rev protein, exon 3 (AA at 7885)
pept 5596 5841 vpU protein
pept 5759 8299 env polyprotein
pept 8301 8924 nef protein
pre-msg 1 9143 genomic mRNA
BASE COUNT 3307 a 1617 c 2193 g 2026 t
ORIGIN 5'-terminus of 5' R repeat (putative mRNA start).
1 ggtctctctg gttagaccag atttgagcct gggagctctg gctaattagg gaacccactg
61 cttaagcctc aataaagctt gccttgagcg cttcaagtag tgtgtgcccg tctgttgtgt
121 gactctggta actagagatc cctcagatcc gtttagtccg agtgaaaaat ctctagcagt
181 ggcgcccgaa cagggacctg aaagcgaaag tagaaccaga gaagatctct cgacgcagga
241 ctcggcttgc tgaagcgcgc acagcaagag gcgaggggca gcgactggtg agtacgctga
301 aatttttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc
361 gggggaaaat tagatacatg ggaaagaatt cggttacggc caggaggaaa gaaaaaatat
421 gcactaaaac atttgatatg ggcaagcagg gagctagaac gatttacact taatcctggc
481 cttttagaga catcagaagg ctgtaaacaa ataataggac agctacaacc atctattcaa
541 acaggatcag aagaaattag atcattatat aatacagtag caaccctcta ttgtgtacat
601 gaaaggatag aggtaaaaga caccaaagaa gctgtagaaa agatggagga agaacaaaac
661 aaaagtaaga aaaagacaca gcaagcagca gctgatagca gccaggtcag ccaaaattac
721 cctatagtgc agaacctaca ggggcaaatg gtacatcagg ccatatcacc tagaactttg
781 aacgcatggg taaaagtaat agaagaaaag gccttcagcc cggaagtaat acccatgttt
841 tcagcattat cagaaggagc caccccacaa gatttaaaca ccatgctaaa cacagtgggg
901 ggacatcaag cagctatgca aatgctaaaa gagaccatca atgacgaagc tgcagaatgg
961 gacagattac atccagtgca tgcagggcct gttgcaccag gccaaatgag agaaccaagg
1021 ggaagtgata tagcaggaac tactagtacc cttcaggaac aaatagcatg gatgacaagc
1081 aacccaccta tcccagtagg agaaatctat aaaagatgga taatcctggg attaaataaa
1141 atagtaagaa tgtatagccc tgtcagcatt ttggacataa gacagggacc aaaggaacct
1201 tttagagact atgtagaccg gttctataaa actctaagag ccgagcaagc ttcacaggat
1261 gtaaaaaact ggatgacaga aaccttgttg gtccaaaatg caaacccaga ttgtaaaact
1321 atcttaaaag cattgggacc acaggctaca ctagaagaaa tgatgacagc atgccaggga
1381 gtgggggggc ccggccataa agcaagagtt ttggctgagg caatgagcca agtaacaggt
1441 tcagctactg cagtaatgat gcagagaggc aattttaagg gcccaagaaa aagtattaag
1501 tgtttcaact gtggcaagga agggcacaca gcaaaaaatt gcagggcccc tagaaaaaag
1561 ggctgttgga aatgcggaag ggaaggacac caaatgaaag attgcactga aagacaggct
1621 aattttttag ggaagatttg gccttcccac aagggaaggc cggggaattt tcttcagagc
1681 agaccagagc caacagcccc accagcagag agcttcgggt ttggggagga gataaccccc
1741 tctcagaaac aggagcagaa agacaaggaa ctgtatcctt tagcttccct caaatcactc
1801 tttggcaacg acccctcgtc acaataaaga tagggggaca gctaaaggaa gctctattag
1861 atacaggagc agatgataca gtattagaag aaataaattt gccaggaaaa tggaagccaa
1921 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcaa atactcatag
1981 aaatctgtgg atataaagct atgggtacag tattagtagg acctacacct gtcaacataa
2041 ttggaagaaa tttgttgacc cagattggct gcactttaaa ttttccaatt agtcctattg
2101 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat
2161 tgacagaaga aaaaataaaa gcattaacag aaatttgtac agaaatggaa aaggaaggaa
2221 aaatttcaag aattgggcct gaaaatccat ataatactcc aatatttgcc ataaagaaaa
2281 aagacagtac caagtggaga aaattagtag atttcagaga acttaataag agaactcaag
2341 atttctggga ggttcaatta ggaataccgc atcctgcagg gctgaaaaag aaaaaatcag
2401 taacagtact ggatgtgggt gatgcatatt tctcagttcc cttagatgaa gattttagga
2461 aatataccgc atttaccata cctagtataa acaatgagac accagggatt agatatcagt
2521 acaatgtgct cccacaggga tggaaaggat caccggcaat attccaaagt agcatgacaa
2581 aaatcttaga gccctttaga aaacaaaatc cagaaatagt tatctatcaa tacatggatg
2641 atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata gaggaattaa
2701 gagaacatct attgaggtgg ggatttacca caccagataa aaaacatcag aaagaacctc
2761 catttctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctataaacc
2821 tgccagaaaa agaaagctgg actgtcaatg atatacagaa gttagtgggg aaattaaact
2881 gggcaagcca gatttatgca ggaattaaag taaagcaatt atgtaaactc cttaggggaa
2941 ccaaagcact aacagaagta gtaccactaa cagaagaagc agaattagaa ctggcagaaa
3001 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa
3061 tagcagaact acagaaacaa ggggacggcc aatggacata ccaaatttat caagaaccat
3121 ttaaaaatct aaaaacagga aagtatgcaa gaacgagggg tgcccacact aatgatgtaa
3181 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtgata tggggaaaga
3241 ctcctaaatt taaactaccc atacaaaagg aaacatggga aacatggtgg atagagtatt
3301 ggcaagccac ctggattcct gagtgggaat ttgtcaatac ccctccttta gtaaaattat
3361 ggtaccagtt agagaaggaa cccataatag gagcagaaac tttctatgta gatggggcag
3421 ctaatagaga gactaaatta ggaaaagcag gatatgttac tgacagagga agacagaaag
3481 ttgtcccttt cactgacacg acaaatcaga agactgagtt acaagcaatt aatctagctt
3541 tacaggattc gggattagaa gtaaacatag taacagattc acaatatgca ctaggaatca
3601 ttcaagcaca accagataag agtgaatcag agttagtcag tcaaataata gagcagctaa
3661 taaaaaagga aaaggtttac ctggcatggg taccagcaca caaaggaatt ggaggaaatg
3721 aacaagtaga taaattagtc agtcagggaa tcaggaaagt actatttttg gatggaatag
3781 ataaggctca ggaagaacat gagaaatatc acaacaattg gagagcaatg gctagtgatt
3841 ttaacctacc acctgtggta gcgaaagaaa tagtagctag ctgtgataaa tgtcagctaa
3901 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ttagattgta
3961 cacatctgga aggaaaagtt atcctggtag cagttcatgt agccagtggc tatatagaag
4021 cagaagttat tccagcagaa acggggcaag aaacagcata ctttctctta aaattagcag
4081 gaagatggcc agtaaaagta gtacatacag ataatggcag caatttcacc agtgctacag
4141 ttaaggccgc ctgttggtgg gcagggatca aacaggaatt tggaattccc tacaatcccc
4201 aaagtcaagg agtagtagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa
4261 gagatcaagc tgaacatctt aagacagcag tacaaatggc agtatttatc cacaatttta
4321 aaagaaaagg ggggattggg ggatacagtg caggggaaag aataatagac ataatagcaa
4381 cagacataca aactagagaa ttacaaaaac aaatcataaa aattcaaaat tttcgggttt
4441 attacaggga cagcagagat ccaatttgga aaggaccagc aaagcttctc tggaaaggtg
4501 aaggggcagt agtaatacaa gacaatagtg acataaaggt agtaccaaga agaaaagtaa
4561 agatcattag ggattatgga aaacagatgg caggtgatga ttgtgtggca agtagacagg
4621 atgaggatta acacatggaa aagtttagta aaataccata tgtatgtttc aaagaaagct
4681 aacagatggt tttatagaca tcactatgac agccaccacc caaaaataag ttcagaagta
4741 cacatcccac taggagaagc tagactggta gtaacaacat attggggtct gcatacagga
4801 gaaaaagaat ggcatctggg tcagggagtc tccatagaat ggaggaaaag gagatatagc
4861 acacaagtag accctggcct ggcagaccaa ctaattcata tgtattattt tgattgtttt
4921 gcagaatctg ctataagaaa agccatatta ggacatatag ttagtcctag ttgtgagtat
4981 caagcaggac ataacaaggt aggatcctta cagtatttgg cactagcagc attaatagca
5041 ccaaaaaaga taaagccacc tttgcctagt gttaggaagc taacagaaga tagatggaac
5101 aagccccaga agaccaaggg ccgcagaggg agccatacaa tgaatggaca ttagagcttt
5161 tagaggagct taagagtgaa gctgtcagac attttcctag gatatggctc catagcttag
5221 gacaacatat ctatgaaact tatggggata cctgggcagg tgttgaagct ataataagaa
5281 ttctgcaaca actactgttt attcatttca gaattgggtg tcaacatagc agaataagta
5341 ttactcgaca gagaagagca agaaatggat ccagtagatc ctaatctaga gtcctggaac
5401 catccaggaa gtcagcctag gactgcttgt aataagtgtc attgtaaaaa gtgttgctat
5461 cattgccaag tttgcttcat aacgaaaggc ttaggcatct cctatggcag gaagaagcgg
5521 agacagcgac gaaaacctcc tcaaggcgat caggctcatc aagttcctat accagagcag
5581 taagtattcc atgtaatgca acctttagtg ataatagcaa tagcagcatt agtagtagca
5641 ataataatag caatagttgt gtggaccata gtatacatag aatataggag gataaaaagg
5701 caaagaaaaa tagactgttt aattgataga ataagagaaa gagcagaaga cagtggcaat
5761 gagagcgagg gagaaagaga ggaattgtca aaacttgtgg aaatggggca tcatgctcct
5821 tgggatgttg atgacctgta gtgctgcaga agatttgtgg gtcacagttt attatggggt
5881 gcctatatgg aaggaagcaa ctaccactct attttgtgca tcagatgcta aagcatataa
5941 aaaagaggca cataatatct gggccacaca tgcctgtgta cccacggacc ccaacccaca
6001 agaaatagaa ttggaaaatg tgacagaaaa ctttaacatg tggaaaaata acatggtgga
6061 acaaatgcat gaggatataa tcagtttatg ggatcaaagc ctaaaaccat gtgtaaaatt
6121 aactccactc tgtgtcactt taaactgcac tgatgaattg aggaacagca agggcaatgg
6181 gaaggtagaa gaggaggaaa aaaggaaaaa ctgctctttc aatgtaaggg ataagaggga
6241 gcaagtatat gcactttttt ataaacttga tatagtgcca atagacaata ataataggac
6301 caatagtact aattataggt taataaattg tgatacctca accattacac aggcttgtcc
6361 aaagatatcc tttgaaccaa ttcccataca tttttgtgcc ccagctggtt ttgcaattct
6421 aaagtgtaga gataagaagt tcaatgggac aggcccatgc tcaaatgtca gcacagtaca
6481 atgtacacat ggaattagac cagtggtgtc aactcaactg ctgttgaatg gcagtctagc
6541 agaagaagag atcataatta gatctgaaaa tctcacaaac aatgttaaaa ccataatagt
6601 acagcttaat gcatctatag taattaattg tacaaggccc tacaaatata caaggcaaag
6661 gacatcgata ggactaaggc aatcactcta tacaataaca ggaaaaaaga agaaaacagg
6721 atacatagga caagcacatt gtaaaattag cagagcagaa tggaataaag ctttacaaca
6781 ggtagctaca aaactaggaa accttcttaa caaaacaaca ataactttta agccatcctc
6841 aggaggggac ccagaaatta catcacacat gcttaattgt ggaggggact tcttctactg
6901 taatacatca agactgttta atagtacatg gaatcagact aatagtacag ggttcaataa
6961 tggcacagtc acactcccat gcagaataaa acaaattgta aacttgtggc agagagtagg
7021 aaaagcaatg tatgcccctc ccatcgaagg actaattaaa tgttcatcaa atattacagg
7081 gctactatta acaagagatg gtggtgcaaa taatagttct catgagacca tcagacctgg
7141 aggaggagat atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat
7201 tgaaccaata ggagtagcac ccaccaaggc aaggagaaga gtggtggaaa gagaaaaaag
7261 agcaatagga ctaggagctg tgttccttgg gttcttggga gcagcaggaa gcacgatggg
7321 cgcagcgtca gtgacgctga cggtacaggc cagacagtta atgtctggta tagtgcacca
7381 gcaaaacaat ttgctgaggg ctatagaggc gcaacaacat ctgttgcaac tcacggtctg
7441 gggcattaaa cagctccagg caagagtcct ggctgtggaa agatacctaa gggatcaaca
7501 gctcctagga atttggggtt gctctggaag gcacatttgc accactaatg taccctggaa
7561 ctctagttgg agtaatagat ctctagatga gatttggcag aacatgacct ggatggagtg
7621 ggaaagagaa attgacaatt acacaggctt aatatacagc ttaattgaag aatcgcagat
7681 ccagcaagaa aagaatgaaa aagaattatt ggaattggac aaatgggcaa gtttgtggaa
7741 ttggtttagc ataacaaaat ggctgtggta tataaaatta ttcataatga tagtaggagg
7801 cctgataggt ttaagaatag tttttgctgt gctttctgta gtaaatagag ttaggcaggg
7861 atactcacct ctgtcatttc agaccctcct cccagtcccg aggggacccg acaggcccga
7921 agaaatagaa gaagaaggtg gagagcgagg cagagacaga tccattcgat tggtgaacgg
7981 attatttgca cttttctggg acgacctgag gaacctgtgc ctcttcagct accaccgctt
8041 gagagactca atcttaattg cagcgaggat tgtggaactt ctgggacgca gggggtggga
8101 agccctcaaa tacctgtgga acctcctgca gtattggagt caggaactga ggaatagtgc
8161 tagtagcttg cttgatacca tagcaatagc agtagctgag aggacagata gggttataga
8221 agtagtacaa agagcttgca gagctattct taacgtaccc agaagaataa gacagggctt
8281 ggaaaggctt ttgctataaa atgggtggca aatggtcaaa aagtagttta gttggatggc
8341 ctgctataag ggaaagaata agaaaaactg atccagcagc agatggggta ggagcagtat
8401 ctcgagacct ggaaaaacat ggggcaatca caagtagcaa tacagcaagt actaatgata
8461 cctgtgcatg gctggaagca caagaggaga gcgaggaggt gggctttcca gttagacctc
8521 aggtaccttt aagaccaatg acttacaaag aagctgttga tcttagccac tttttaaaag
8581 aaaagggggg actggaaggg ctaatttggt ccaagaaaag acaagagatc cttgatcttt
8641 gggtctacaa cacacaaggc atcttccccg attggcagaa ctacacacca gggccaggga
8701 tcagataccc actgaccttt ggatggtgct tccagctagt accagttgat ccacaggagg
8761 tagaagaggc cactgaaaga gaggacaact gcttgttaca ccctatgtgc cagcaaggaa
8821 tggaggaccc ggagagacaa gtgttaatgt ggagatttaa cagcagacta gcacttgagc
8881 acaaggcccg agagctgcat ccggagttct acaaagactg ctgacatcga gtttcctaca
8941 agggactttc cgctggggac tttccaggga ggcgtggcct gggcgggact tgggagtggc
9001 taaccctcag atgctgcata taagcagctg cttttcgcct gtactgggtc tctctggtta
9061 gaccagattt gagcctggga gctctctggc taattaggga acccactgct taagcctcaa
9121 taaagcttgc cttgagtgct tca
//
LOCUS HIVOYI 9190 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral DNA, complete
genome.
ACCESSION M26727
KEYWORDS complete genome.
SOURCE Human immunodeficiency virus type 1 (isolate OYI) proviral DNA.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 9190)
AUTHORS Wain-Hobson,S. and Pasteur Institute,Paris.
TITLE ;
JOURNAL AIDS Res. Hum. Retroviruses (1990) In press
STANDARD full staff_review
COMMENT Kindly provided in computer-readable form by Simon Wain-Hobson,
Pasteur Institute, Paris.
OYI, designated elsewhere as isolate 397, was taken from a healthy
Gabonese presenting an atypical Western Blot. Tree analysis (see
Part III) reveals that the sequence is closely related to the North
American isolate SF2 and the European virus HAN(across the genome).
This is the first report of a virus from Africa that clusters with
North American rather than Zairean viruses: OYI and SF2 differ by
approximately 7% in envelope. The single C -> S substitution at
residue 22 of the OYI tat protein renders it inactive, but may not
account for the avirulence of the virus. The pattern of env
termination and nef start is not what is typically encountered.
Sibling sequences for OYI(397) are available (see 1989:I-A-181).
FEATURES from to/span description
pept 336 1835 gag polyprotein
pept < 1628 4639 pol polyprotein
pept 4584 5162 vif protein
pept 5102 5392 vpR protein
ORF 5599 5721 vpu ORF
pept 5373 5587 tat protein, exon 2 (first expressed exon)
7918 8008 tat protein, exon 3 (AA at 7919)
pept 5512 5587 rev protein, exon 2 (first expressed exon)
7918 8192 rev protein, exon 3 (AA at 7920)
pept 5767 8334 env polyprotein
pept 8336 8971 nef protein
binding 182 199 primer (Lys-tRNA) binding site
pre-msg 1 9190 genomic mRNA
signal 9166 9171 poly-A signal
BASE COUNT 3304 a 1630 c 2202 g 2054 t
ORIGIN
1 ggtctctcta gctagaccag atctgagccc gggagctctc tggctaacta gggaacccac
61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca
181 gtggcgcccg aacagggacc ttaaagtgaa agtggaacca gaggagctct ctcgacgcag
241 gactcggctt gcttagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca
301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
361 gcgggggaga attagataaa tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat
421 atcaactaaa acatatagta tgggcaagca gggagctaga acgattcgca attaatcctg
481 gcctgttaga aacatcagaa ggttgtagac aaatactggg acagctacaa ccatccctta
541 agacaggatc agaagaaatt agatcattat ataatacagt agcaactctt tattgtgtgc
601 atcaaaagat agaggtaaaa gacaccaagg aagctttaga taagatagag gaagagcaaa
661 acaaaagtaa gaaaaaagca cagcaaacag cagctgacac aggaaacagc agccaggtca
721 gccaaaatta ccctatagta cagaaccttc aggggcaaat ggtacatcag cccatatcac
781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa
841 tacccatgtt ttcagcatta gcagaaggag ccaccccaca agatctaaac accatgctaa
901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag
961 ctgcagaatg ggatagattg catccagtac atgcagggcc tattgcacca ggccagatga
1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat
1081 ggatgacaaa taatccacct atcccagtag gagaaatcta taaaagatgg ataatcctgg
1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac
1201 caaaggaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag
1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag
1381 catgtcaggg agtgggggga cccggccata aagcaagagt tttggctgaa gcaatgagcc
1441 aagtaaattc agtcaccgta atgatgcaga aaggcaattt taagaaccaa agaaagactg
1501 ttaagtgttt caattgtggc aaagaagggc acatagccaa aaattgcagg gctcctagga
1561 aaaagggctg ttggaaatgt ggaagggaag gacaccaaat gaaagattgt actgagagac
1621 aggctaattt tttagggaag atctggcctt cccacaaggg aaggccaggg aattttcttc
1681 agaacagacc agagccaaca gccccaccag cagagagctt cgggtttggg gaagagacaa
1741 caactccccc tcagaagcag gagccgatag acaagggact gtatccttta acctccctca
1801 gatcactctt tggcaacgac ccatcgtcac aataaagata ggggggcaac taaaagaagc
1861 tctattagat acaggagcag atgatacagt attagaagaa atgaatttgc caggaagatg
1921 gaaaccaaaa atgatagggg gaattggagg ttttatcaaa gtaagacagt atgatcagat
1981 actcatagaa atctgtggac ataaagctat aggtacagta ttagtaggac ctacacctgt
2041 caacataatt ggaagaaatc tgttgactca gcttggttgt actttaaatt ttcccattag
2101 tcctattgaa actgtaccag taaaattaaa gccaggaatg gatggcccaa aagttaaaca
2161 atggccattg acagaagaga aaataaaagt attaatagaa atttgtacag aaatggaaaa
2221 ggaagggaaa atttcaaaag ttgggcctga aaatccatac aatactccag tatttgccat
2281 aaagaaaaaa gacagtacta aatggagaaa attagtagat ttcagagaac ttaataagag
2341 aactcaggac ttctgggaag tccaattagg aataccacat ccagcagggt taaaaaagaa
2401 aaaatcagta acagtactgg atgtgggtga tgcatacttt tcagttccct tagataaaga
2461 cttcagaaag tatactgcat ttaccatacc tagtataaac aatgagacac cagggattag
2521 atatcagtac aatgtgcttc cacagggatg gaaaggatca ccagcgatat tccaaagtag
2581 tatgacaaaa atcttagagc cttttagaaa acaaaatcca gacatagtta tctatcaata
2641 catggatgat ttgtatgtag gatctgactt agaaataggg cagcatagaa caaaaataga
2701 ggaactgaga caacatctgt tgaggtgggg atttaccaca ccagacaaaa aacatcagaa
2761 agaacctcca ttcctttgga tgggttatga actccatcct gataaatgga cagtacagcc
2821 tataatgctg ccagaaaaag acagctggac tgtcaatgac atacagaagt tagtgggaaa
2881 attaaattgg gcaagtcaga tttatgcagg gattaaagta aagaacttat gtaaactcct
2941 taggggaacc aaagcactaa cagaagtaat accactaaca gaagaagcag agctagaact
3001 ggcagaaaac agggagattc taaaagaacc agtacatgga gtgtattatg acccatcaaa
3061 agacttagta gcagaattac agaaacaggg acaaggccaa tggacatatc aaatttatca
3121 agagccattt aaaaatctga aaacaggaaa gtatgcaaga atgaggggtg cccacactaa
3181 tgatgtaaaa cagttaacag aggcagtgca aaaaataacc caagaaagca tagtaatatg
3241 gggaaagact cctaaattta aactacccat acaaaaagaa acatgggaag catggtggac
3301 ggagtattgg caagccacct ggattcctga atgggagttt gtcaataccc ctcccttagt
3361 aaaattatgg taccagttag agaaagaccc catagtagga gcagaaactt tctatgtaga
3421 tggggcagct aatagggaga ctaaattagg aaaagcagga tatgttactg acagaggaag
3481 acagaaagtt gtctccctaa ctgacacaac aaatcagaag actgaattac aagcaattca
3541 tctagctctg caggattcgg gattagaagt aaacatagta acagactcac aatatgcatt
3601 aggaatcatt caagcacaac cagataaaag tgaatcagag ttagtcagtc aaataataga
3661 gcagttaata aaaaaggaaa aggtctacct ggcatgggta ccagcacaca aaggaattgg
3721 aggaaatgaa caagtagata aattagtcag tgctggaatc aggaaagtac tatttttgga
3781 tggaatagat aaggcccaag aggaacatga gaaatatcac agtaactgga gagcaatggc
3841 tagtgatttt aacctgccac ctgtagtagc aaaagaaata gtagccagct gtgataaatg
3901 ccagctaaaa ggagaagcca tgcatggaca agtagactgt agtccaggaa tatggcaact
3961 agattgtaca catttagaag gaaaaattat cctggtagca gttcatgtag ccagtggata
4021 tatagaagca gaagttattc cagcagagac agggcaggaa acagcatact ttatcttaaa
4081 attagcagga agatggccag taaaaacaat acatacagac aatggcagca atttcaccag
4141 tactacggtt aaggccgcct gttggtgggc agggatcaag caggaatttg gcattcccta
4201 caatccccaa agtcaaggag tagtagaatc tatgaataat gaattaaaga aaattatagg
4261 acaagtaaga gatcaggctg aacatcttaa gacagcagta caaatggcag tatttatcca
4321 caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa tagtagatat
4381 aatagctaca gacatacaaa ctaaagaact acaaaaacaa attacaaaaa ttcaaaattt
4441 tcgggtttat tacagggaca gcagagaacc actttggaaa ggaccagcaa agcttctttg
4501 gaaaggtgaa ggggcagtag taatacaaga taatagtgac ataaaagtag tgccaagaag
4561 gaaagcaaag atcattaggg attatggaaa acagatggca ggtgatgatt gtgtggcaag
4621 tagacaggat gaggattaga acatggaaaa gtttagtaaa acaccatatg tatgtttcaa
4681 agaaagctaa gggatggttt tatagacatc actatgaaag cactcatcca agaataagtt
4741 cagaagtaca catcccacta ggggatgcta ccttggtagt aacaacatat tggggtctgc
4801 atacaggaga aagagaatgg catttgggcc agggagcctc tatagaatgg aggaaaaaga
4861 gatatagcac acaagtagac cctggcctag cagaccaact aattcataca tattattttg
4921 attgtttttc agaatctgct ataagaaatg ccatactagg aaatatagtt agtcctaggt
4981 gtgaatatcc agcaggacat aacaaggtag gatctctaca atacttggca ctagcagcat
5041 tgataaaacc aaaaaagata aagccacctt tgcctagtgt tacgaaacta acagaggata
5101 gatggaacaa gccccagaag accaagggcc acagagggag ccatacaatg aatggacact
5161 agagctttta gaggagctta agagtgaagc tgttagacat tttcctagga tatggctcca
5221 tagcttagaa caatatatct atgaaactta tggggatact tgggaaggag tggaagccat
5281 aataagaatt ctgcaacaac tgctgtttat tcatttcaga attgggtgtc aacatagcag
5341 aataggcatt actcgacaga gaagagcaag aaatggagcc agtagatcct agactagagc
5401 cctggaagca tccaggaagt cagcctaaga ctgctagtaa caattgctat tgtaaaaggt
5461 gttgccttca ctgtcaagtt tgtttcacaa aaaaaggctt aggcatttcc tatggcagga
5521 agaagcggag acagcgacga agagctcctc aagacagtaa gactcatcaa gtttctctat
5581 caaagcagta agtaatacat gtaatatact ctttacaaat attagcaata gtagcattag
5641 tagtagtaac aataatagca atagttgtgt ggaccatagt actcttagaa tataggaaaa
5701 tattaagaca aagaaaaata gacaggttaa ttgatagaat aagagaaaga gcagaagaca
5761 gtggcaatga cagcgagggg gaccaggaag aattatcagc gcttgtggag atggggcacc
5821 atgctccttg ggatgttgat gatatgtagt gctgcagaaa atttgtgggt cacagtctat
5881 tatggggtac ctgtgtggaa agaagcaacc accactctat tctgtgcatc agatgctaga
5941 gcatatgcta cagaggtaca taatgtttgg gccacacatg cctgtgtacc cacagacccc
6001 aacccacaag aagtagtatt gggaaatgtg acagaaaatt ttgacatgtg gaaaaataat
6061 atggtagaac aaatgcaaga agatataatc agtttatggg atcaaagcct aaagccatgt
6121 gtaaaattaa ccccactctg tgttacttta gattgcactg atgttaatac cactagtagt
6181 agtttgagga atgctactaa taccacaagt agtagttggg aaacgatgga gaaaggagaa
6241 ttaaaaaact gctctttcaa taccaccaca agcataagag ataagatgca ggaacaatat
6301 gcactttttt ataaacttga tgtattacca atagataaga atgatactaa atttaggtta
6361 atacattgta acacctcaac cattacacag gcctgtccaa agatatcctt tgagccaatt
6421 cccatgcatt attgtactcc ggctggtttt gcgattctaa agtgtaatga taagaagttc
6481 aatggaacag gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaagcca
6541 gtagtgtcaa ctcaactgct gttaaatggc agcctagcag aagaagaggt aataattaga
6601 tctagcaatt tcacaaacaa tgctaaaatc ataatagtac agctgaataa atctgtagaa
6661 attaattgta caagacccaa caacaataca agaaacagga tatcaatagg accagggaga
6721 gcatttcata caacaaaaca aataatagga gatataagac aagcacattg taaccttagt
6781 agagcaacat gggagaaaac tttagaacag atagctacaa aattaagaaa acaatttagg
6841 aataaaacaa tagcctttga tcgatcctca ggaggggatc cagaaattgt aatgcacagt
6901 tttaattgtg gaggggaatt tttctactgt aatacatcac aactgtttaa tagtacttgg
6961 aatgatacta caagggcaaa tagcactgaa gtaactatca cactcccatg tagaataaaa
7021 caaattgtaa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagtgga
7081 caaattagat gttcatcaaa gattacaggg ctgctattaa caagagatgg tggtaagaac
7141 accacgaacg ggatcgaaat cttcagacct gcaggaggag acatgaggga caattggaga
7201 agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag
7261 gcaaggagaa gagtggtgca gagagaaaaa agagcagtgg gaatgctagg agctatgttc
7321 cttgggttct tgggagcagc aggaagcact atgggcgcac ggtcgatgac gctgacggta
7381 caggccagac aattattgtc tggtatagtg cagcagcaga acaatctgct gagggctatt
7441 gaggcgcaac agcacctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga
7501 gtcctggctg tggaaagata cctaaaggat caacagctcc tagggatttg gggttgctct
7561 ggaaagctca tttgcaccac tactgtgcct tggaatgcta gttggagtaa taaatctctg
7621 aatgagattt gggataacat gacctggatg cagtgggaaa gagaaattga caattacaca
7681 cacttaatat acaccttaat tgaagaatcg cagaaccaac aggaaaagaa tgaacaagaa
7741 ttattggaat tggataagtg ggcaggtttg tggagttggt ttagcataac aaactggctg
7801 tggtatataa gaatattcat aataatagta ggaggcttgg taggtttaag aatagttttt
7861 gctgtacttt ctatagtgaa tagagttagg cagggatact caccattatc gtttcagacc
7921 cgcctcccaa cccagagggg acccgacagg cccgaaggaa tcgaagaaga aggtggagag
7981 agagacagag acagatccgg aagattagtg gatggattct tagcacttat ctgggacgat
8041 ctgcggagcc tgtgcctctt cagctaccac cgcttgagag acttaatctt gattgtagcg
8101 aggattgtgg aacttctggg acgcaggggg tgggaagtcc tgaaatattg gtggaatctc
8161 ctgcagtatt ggagccagga actaaagaat agtgtcatta gcttgctcaa cgccacagct
8221 atagcagtag ctgaggggac agatagggtt atagaaatag tacaaagagc ttatagagct
8281 tttctcaata tacctagaag aataagacag ggcttggaaa gggctttgct ataagatggg
8341 tggcaagtgg tcaaaatgta gtatgaaggg atggcctact ataagggaaa gaatgaagcg
8401 agctgagcta cagccacctg agccagcagc agaaggggtg ggagcagcat ctcgagacct
8461 ggaaaaacat ggagcaatca ctagtagcaa tacagcagct actaatgctg attgtgcctg
8521 gctagaagca caagaggatg aagaggtggg ttttccagtc agacctcagg tacctttaag
8581 accaatgact tacaagggag ctttagatct tagccacttt ttaaaagaaa aggggggact
8641 ggaagggcta atttactccc agaaaagaca agatattctt gatctgtggg tctatcacac
8701 acaaggctat ttccctgatt ggcagaacta cacaccaggg ccagggatca gatatccact
8761 gtgctttgga tggtgcttca agttagtacc aatggatcca gaccaggtag aagaggccaa
8821 cgaaggagag aacaacagct tgttacatcc tataagcctg catgggatgg atgacccaga
8881 gaaagaagtg ttagtgtgga agtttgacag ccgcctagca tttcgtcaca tggcccgaga
8941 ggtgcatccg gagtactaca aagactgctg acatcgagct ttctacaggg gactttccgc
9001 tggggacttt ccagggaggc gtggcctggg cgggaccggg gagtggcgag ccctcagatg
9061 ctgcatataa gcagctgctt tttgcctgta ctgggtctct ctagctagac cagatctgag
9121 cccgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt
9181 gagtgcttca
//
LOCUS HIVPHIL1 259 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION M27911
KEYWORDS .
SOURCE Human immunodeficiency virus type 1 (isolate 799), clone B7.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 259)
AUTHORS Ou,C.-Y. and colleagues CDC Atlanta GA,
JOURNAL Unpublished (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by C.-Y. Ou, CDC,
Atlanta. The sequence below, designated 799-B7, appears to be a
major form of the sibling sequences 799, taken from a hemophilia
patient in Philadelphia.
FEATURES from to/span description
pept < 1 > 259 env region (AA at 2)
BASE COUNT 99 a 45 c 56 g 59 t
ORIGIN Approximately 1172 nt from env start (BRU coordinates).
1 ttcaacacaa ctgtttaata gtacttggaa tgttactgaa gggttaaata acactgcaaa
61 taacagtgaa ggaactatca cactcccatg cagaataaaa caaattataa acatgtggca
121 ggaagtagga aaagcaatgt atgcccctcc catcagagga caaattagat gttcatcaaa
181 tattacaggg ctgctattaa caagagatgg tggtaatgaa actaacggga ccgagatctt
241 cagacctgga ggaggagat
//
LOCUS HIVSWB881 59 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region.
ACCESSION M27952
KEYWORDS .
SEGMENT 1 of 3
SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01.
ORGANISM Human immunodeficiency virus type 1
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Lentivirinae.
REFERENCE 1 (bases 1 to 59)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for
viral mixtures and preferred nucleotide substitutions
JOURNAL j acquired immuno defic syndr 2, 344-52 (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson.
B88 is a so-called "slow virus" isolated in Sweden. Sibling
sequences available (see 1989: I-A-174).
FEATURES from to/span description
pept < 1 > 59 gag region (AA at 2)
BASE COUNT 26 a 8 c 10 g 15 t
ORIGIN
1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca
//
LOCUS HIVSWB882 314 bp ss-DNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), tat first expressed
exon of isolate L1.01.
ACCESSION M31462
KEYWORDS .
SEGMENT 2 of 3
SOURCE Human immunodeficiency virus type 1 (isolate B88).
REFERENCE 1 (bases 1 to 314)
AUTHORS Meyerhans,A., Cheynier,R., Albert,J., Seth,M., Kwok,S., Sninsky,J.,
Morfeldt-Manson,L., Asjo,B. and Wain-Hobson,S.
TITLE Temporal fluctuations in HIV quasispecies in vivo are not reflected
by sequential HIV isolations
JOURNAL Cell 58, 901-910 (1989)
STANDARD full staff_review
COMMENT Kindly submitted in computer-readable form by S. Wain-Hobson,
Pasteur Institute, Paris.
The B88 isolate is considered to be a "slow-low" viral form.
Sibling sequences from pbl's and viruses propagated on pbl's are
available. This sequence corresponds to the earliest sample major
form, L1.01.
FEATURES from to/span description
pept 30 / 244 tat exon 2 (first expressed exon)
BASE COUNT 111 a 57 c 70 g 76 t
ORIGIN
1 cattattcaa cagaggagag caagaagaaa tggagccagt agatcctaga ctagagccct
61 ggaagcatcc aggaagtcag cctaagactg cttgtaacaa ttgctattgt aaaaagtgtt
121 gctttcattg ccaagtttgt ttcacaaaaa aaggcttagg catctcctat ggcaggaaga
181 agcggagaca gcgacgaaga gctcctcagg acagtgagac tcatcaagtt tctctatcaa
241 agcagtaagt agtacatgta atgcaatctt tagaaatatt agcaatagta gcattagtag
301 tagcaataat acta
//
LOCUS HIVSWB883 250 bp ss-RNA VRL 11-AUG-1990
DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION M27947
KEYWORDS .
SEGMENT 3 of 3
SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01.
REFERENCE 1 (bases 1 to 250)
AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
Wain-Hobson,S.
TITLE HIV1 isolates are rapidly evolving quasispecies: Evidence for viral
mixtures and preferred nucleotide substitutions
JOURNAL J. Acq. Immun. Defic. Syndrome. 2, 344-8 (1989)
STANDARD full staff_review
COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson.
This stretch comes from the first hypervariable region of env.
Sibling sequences are available (see 1989: I-A-175).
FEATURES from to/span description
ORF < 3 245 env ORF (AA at 3)
BASE COUNT 96 a 40 c 42 g 72 t
ORIGIN
1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc
61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg
121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag
181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca
241 aaggtatcct
//
LOCUS ALFLBA 578 bp ss-mRNA PLN 11-AUG-1990
DEFINITION Alfalfa leghemoglobin mRNA, complete cds.
ACCESSION M36100
KEYWORDS leghemoglobin.
SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL549.
ORGANISM Medicago sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 578)
AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and
Huguet,T.
TITLE Identification of two groups of leghemoglobin genes in alfalfa
(Medicago sativa) and a study of their expression during root
nodule development
JOURNAL Plant Mol. Biol. 11, 761-772 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 7 450 leghemoglobin
BASE COUNT 189 a 91 c 126 g 172 t
ORIGIN
1 agaaatatga gtttcaccga taagcaggag gctttagtca atagctcatg ggaggcattc
61 aaacaaaacc ttcctcgtta tagtgttttt ttctacactg tcatattaga gaaagcacct
121 gcagcaaaag gtttgttctc ttttcttaag aactcagctg aagtacaaga tagtcctcaa
181 ctccaagctc acgctgaaaa ggtttttgga ctggtgcgtg actcagctgt tcaacttcga
241 gcaacagggg gggtagtttt gggagatgct acgttgggtg ctatccacgt tcgaaaggga
301 gttgttgatc ctcattttgt ggtggttaaa gaggctttgc tcaaaacaat aaaagaagca
361 gctggagaca aatggagtga agagcttaac actgcttggg aagtagctta tgatgcactg
421 gcaactgcaa ttaaaaaagc aatgagttaa acctgtgatg atctattatc ataaaaagat
481 taataaataa aatatgtatt actaaaactt gttaaacaag ttcctataat aaattatgat
541 aaaaatattt atgaaatatt gttagtgttt atgccggt
//
LOCUS ALFLBB 521 bp ss-mRNA PLN 11-AUG-1990
DEFINITION Alfalfa leghemoglobin mRNA, complete cds.
ACCESSION M36101
KEYWORDS leghemoglobin.
SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL154.
ORGANISM Medicago sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 521)
AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and
Huguet,T.
TITLE Identification of two groups of leghemoglobin genes in alfalfa
(Medicago sativa) and a study of their expression during root
nodule development
JOURNAL Plant Mol. Biol. 11, 761-772 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept > 1 396 leghemoglobin
BASE COUNT 175 a 77 c 112 g 157 t
ORIGIN
1 aaatcattta aacaaaaccc tggtaatagt gttttgttct acactataat attggagaaa
61 gcccctgcag caaagggcat gttctctttt cttaaggact cagctggagt acaggatagc
121 cctaaactcc aaagccatgc tgaaaaagtt tttggaatgg tgcgcgattc agctgctcaa
181 ctccgagcaa caggaggagt agttttggga gatgctacat tgggtgcaat ccacattcag
241 aaaggagttg ttgatcctca ttttgcggtg gttaaagaag ctctgctgaa aacaataaag
301 gaagtatcag gagataaatg gagcgaagaa ttgaacactg cttgggaagt agcctatgat
361 gcattggcaa ctgcaattaa gaaggcaatg gtttaaattt gggatgatct atagttatta
421 ctcaatttaa taaataaaaa atgtattact aaaacttgtt taacaagttt ctataataaa
481 tgttgttgaa atgactatta aatattattt gagtttatgt t
//
LOCUS CLTPHYC 1077 bp ds-DNA BCT 11-AUG-1990
DEFINITION Calothrix sp. phycocyanin alpha (PC2-alpha) and phycocyanin beta
(PC2-beta) genes, complete cds.
ACCESSION M36276
KEYWORDS phycocyanin.
SOURCE Calothrix sp. (strain PCC 7601 (Fremyella diplosiphon UTEX 481))
DNA.
ORGANISM Calothrix sp.
Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria;
Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE 1 (bases 1 to 1077)
AUTHORS Capuano,V., Mazel,D., Tandeau de Marsac,N. and Houmard,J.
TITLE Complete nucleotide sequence of the red-light specific set of
phycocyanin genes from the cyanobacterium Calothrix PCC 7601
JOURNAL Nucleic Acids Res. 16, 1626-1626 (1988)
STANDARD full staff_review
FEATURES from to/span description
pept 1 519 phycocyanin beta (PC2-beta)
pept 589 1077 phycocyanin alpha (PC2-alpha)
BASE COUNT 276 a 259 c 242 g 300 t
ORIGIN
1 atgcttgacg cttttactaa agtagtttcc caggctgata ctagaggcgc atacattagc
61 gatgcagaaa tcgatgcatt gaagacaatg gttgctgctg gttctaagcg catggatgtt
121 gttaaccgca ttaccggcaa tgcctccaca atcgttgcta atgcagctcg cgctttattt
181 gaagaacaac cccagttgat tgcacctggt ggtaatgctt acaccaaccg tcgtatggct
241 gcttgcttac gcgacatgga aattatctta cgctatgtaa cctacgcagt atttgcaggt
301 gatgctagcg ttttagacga tcgctgcttg aacggtcttc gtgaaacata tcaagcatta
361 ggcgttcctg gcgcttccgt atctactggc gttcaaaaga tgaaagaagc tgcgatcgct
421 attgctaacg accccagcgg tgttacccgt ggtgactgta gctccttaat gtctgaattg
481 ggtagctact tcgatcgcgc tgctgctgca gttggctagt aaatcgctag ttaattattt
541 ctactttctg acgcataact ttagatattt gttgagagaa taatcagcat gaaaacacct
601 ttaaccgaag ccgtagctac cgcagattcc caaggtcggt ttttgagctc cacagaactt
661 caagttgctt ttggtcgttt ccgtcaagca tctgctagct tggatgcagc taaagcatta
721 agcagcaaag ctaacagctt ggctcaaggt gccgttaacg cagtttacca aaagttcccc
781 tacaccaccc aaatgcaagg taagaacttc gcttctgacc aacgcggtaa agacaagtgt
841 gcacgcgaca tcggttacta cattcgcatc gttacctact gtctagttgc tggtggtaca
901 ggccctctag atgactacct aattggtggt ttggcagaaa tcaaccgcac ctttgatctg
961 tctcccagct ggtacgttga agctctcaaa tacatcaaag ctaatcatgg tttgagtggc
1021 gatcctgctg ttgaagcaaa ctcctacatt gactacgcaa tcaacgccct aagctag
//
LOCUS PINACT 745 bp ds-DNA PLN 11-AUG-1990
DEFINITION P.contorta (lodgepole pine) actin gene, last two exons.
ACCESSION M36171
KEYWORDS actin.
SOURCE P.contorta (var. latifolia) seed DNA.
ORGANISM Pinus contorta
Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae;
Pinaceae.
REFERENCE 1 (bases 1 to 745)
AUTHORS Kenny,J.R., Dancik,B.P., Florence,L.Z. and Nargang,F.E.
TITLE Nucleotide sequence of the carboxyl-terminal portion of a lodgepole
pine actin gene
JOURNAL Can. J. For. Res. 18, 1595-1602 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 420 actin, exon X (AA at 1)
555 620 actin, exon X + 1
IVS 421 554 actin intron A
BASE COUNT 196 a 124 c 172 g 253 t
ORIGIN
1 aagcttgctt atgtagccct tgactatgag caagaacttg aaacatcgaa gagtagttct
61 tcacaagaaa agaattatga gcttcctgat ggacaggtca ttaccgttgg tgctgagcgt
121 ttcagatgtg ctgaggtttt gttccagcca tctctcattg gaatggaagc agcgggtatc
181 catgagacta catacaattc cattatgaaa tgtgatgtgg atatcagaaa agatctctat
241 ggaaacattg ttcttagtgg tggttctacc atgtttcctg gtatagctga tcgtatgagc
301 aaagagatca ctgcacttgc tcccagtagc atgaaaatca aagttgtagc accccctgag
361 aggaagtaca gtgtttggat cggaggttct atcttggcat ctcttagcac atttcagcag
421 gttattctca actttctaca tgccggaact tgaggttctt gtttggtttg gtggtattta
481 atgtgctttg caatatataa atattatttc tttgattgta tatttaacca tgccttgatt
541 gtttttgtat gcagatgtgg attgccaaga ccgagtatga ggagtcaggt ccctcaattg
601 tccacagaaa gtgtttttaa acagagtgct ctattgatta tgctttatga ttggccttta
661 ggtatattgt gagggtggaa gacacgatta ccgagatttt gatttagtat catgctttag
721 gatttgcatc tgtatagttg aattc
//
LOCUS RATMHCI 1355 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Rat MHC class I RT1.C/E mRNA, 3' end.
ACCESSION X16979
KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene;
integral membrane protein; major histocompatibility complex.
SOURCE Rat liver, cDNA to mRNA, clone RT.BM1.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1355)
AUTHORS Parker,K.E., Carter,C.A. and Fabre,J.W.
TITLE A rat class I cDNA clone with an Alu-like sequence and mapping to
two genes in RT1.C/E
JOURNAL Immunogenetics 31, 211-214 (1990)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 857 MHC RT1.C/E (AA at 3)
mRNA < 1 1355 MHC RT1.C/E mRNA
rpt 1210 1349 type 2 Alu-like repeat
BASE COUNT 322 a 334 c 387 g 312 t
ORIGIN
1 attgggagcg ggagactcgg aaagccaggg acaccgggag gaacttcaaa gtgaatctga
61 ggaccctgct ccgctactac aatcagagtg acgacgaatc tcacacgctg cagtggatgt
121 acggctgcga cgtggggccc gatgggcacc tgctccgcgg ttattgtcag gaggcttatg
181 acggccggga ttatatctcc ttgaacgagg acctgcgctc ctggaccgcg actgacatgg
241 cctcacaggc ctctaagatc aagtcagagg aagttggtga ggcccatcac cagagggcct
301 acctgcaagg tccttgcgtg gagtggctcc acacatacct acatctggga aaggagacgc
361 tgctgcgctc agacccccca aaggcacatg tgacccttca ccccagacct gaaggtgatg
421 tcaccctgag gtgctgggcc ctgggcttct accctgctga catcaccttg acctggcaat
481 tgaatgggga ggacctgacc caggacatgg agcttgtgga gaccaggcct gcaggggatg
541 gaaccttcca gaagtgggca gctgtggtag tgccttctgg ggaggagctg aagtacacat
601 gccatgtgga gcatgaggga ctgcctgaac ctctcgccct gagatgggag ccttctccat
661 tcaccgactc cagcatgcca gtgattgttg tccttggagc tgtggccatc attggagctg
721 tggccatcat tggagctgtg gccatcattg gagctgttgt gaggaggagg aagagaaaca
781 caggtgaaaa aggcagctat gctcatgttc taggcagcaa ggctttccag atctctgact
841 ggcctcagaa ggcatgaaaa gacagttgcc tcgtggggct gagcgacaag ttctgactaa
901 acatctgagg gagctgcttc ccctgaggac aatgtgaaga actggcaatc ccagaccttc
961 ttgcctccct ggacacctga cttcacaaca gaaaataatc tgactctggg ctgggtgttt
1021 cctattgctt ttatctgcac tgagtggaaa tgcggtttcc aacagtgtaa cgcagtaccg
1081 ttgtgtggtg gtagtgacat tgcacacagg tttgtgagct gggggtgtga tgtgcttttg
1141 ttactgtagg cagaacaagc tagaagtcca ccattcctct tatggtcatt gttttaaaac
1201 aaaacaaatg ctggagagat ggctcagtgt taagagcact gacttctctt acagaggtcc
1261 tggttcaaat cccagcaacc acatggtggc tcacaaccat atgtaatgag atctgatgct
1321 ctcttctgat gtgtctgaac acagctatat atgat
//
LOCUS SCOMTCOIII 1120 bp ds-DNA ORG 11-AUG-1990
DEFINITION S.commune mitochondrial cytochrome oxidase III (COIII) gene,
complete cds.
ACCESSION M36270
KEYWORDS cytochrome oxidase III.
SOURCE S.commune mitochondrial DNA.
ORGANISM Mitochondrion Schizophyllum commune
Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes;
Agaricales; Schizophyllaceae; Schizophyllum commune.
REFERENCE 1 (bases 1 to 1120)
AUTHORS Phelps,L.G., Burke,J.M., Ullrich,R.C. and Novotny,C.P.
TITLE Nucleotide base sequence of the mitochondrial COIII gene of
Schizophyllum commune
JOURNAL Curr. Genet. 14, 401-403 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 147 953 cytochrome oxidase III (COIII)
BASE COUNT 387 a 155 c 158 g 420 t
ORIGIN
1 ggatcttaca aagaatatta tctttcaact ttcttttaaa taataaaatt acttgttaat
61 taattcaaat tacaagtgta aagccatagt ataaaacaaa atttataaat aaataaataa
121 aaaataacaa ataataaata acaaaaatga caagtattaa attctatcaa tcatttagtg
181 cacacttagt acaacattca ccttggccaa ttttagttag tttcagtctt tttaatttag
241 ctattggtac agtattaact atgcatggat acagtcacag ttcaacaact tttgatttag
301 gtttagcagt aactgttgga tcaattttat tatggactag agatattgta attgaaggat
361 ctttcttagg agatcacaca aaacaagtac aagaaggatt aatcatagga tttatcttat
421 ttattataag tgaagtattc gcttttattt cagtattctg ggcttatttc catagtgctt
481 tatcaccagc tgtagaatta ggatcaacat ggccaccagt aggaataatt ccattagata
541 ccttttcatt accattattt aatacaataa ttttattatc tagtggagct tttgttacct
601 atggacacca tgctattttc tcaggaaaaa gattagattc aatcatagga ttatttttaa
661 cagtagcctt agcattaatc tttagttact ttcaagcatt tgaatatatt catgctggat
721 tttccatgtc cgatagtgta tttggaacag tattttttgc aagtacagga ctacatggga
781 tacatgttat gcttgggact ctttttcttt tcgtatcttt tcttagacaa gttaactatc
841 aaactacaaa agaacacaat attggacttg aaacaagtat tctatattgg cactttgttg
901 atcttgtgtg gttattctta tttcttgttg tttacttttg gggtggtgca taattatttc
961 ttttctttta ttgaaagtaa taactattca ttagaactta taaactatgt gcaatttcca
1021 attttattcg gatttatcaa tacaggggga aaaacttctc ccttagaaaa tacaagaaat
1081 tgaagacaaa gaattaaatg taataaataa aataattaca
//
LOCUS STYHSQ 135 bp ds-DNA SYN 11-AUG-1990
DEFINITION S.potsdam and S.typhimurium hsdS recombinant DNA.
ACCESSION M15940
KEYWORDS DNA binding protein; type I restriction and modification system.
SOURCE S.potsdam and S.typhimurium DNA, clone lambda-hsd SQ.
ORGANISM Artificial gene
Artificial sequences; Genes.
REFERENCE 1 (bases 1 to 135)
AUTHORS Fuller-Pace,F.V. and Murray,N.E.
TITLE Two DNA recognition domains of the specificity polypeptides of a
family of type I restriction enzymes
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9368-9372 (1986)
STANDARD full staff_review
COMMENT The hsdS genes of the SB (S.typhimurium) and SP (S.potsdam) have a
conserved sequence of around 100 bp flanked by two nonhomologous
variable regions of around 500 bp. Recombination between the hsdS
genes of SB and SP generates a system (SQ) with a different
recognition specificity. The crossover point lies in the central
conserved region. Concomitant with the generation of a new
combination of flanking variable regions is the recombination of
minor differences in the central conserved region. A polypeptide
domain encoded on the 5' side of the crossover dictates recognition
of the trinucleotide component "aac" (positions 25-27) of the
target sequence, and a second domain, encoded on the 3' side of the
crossover (positions 50-54), governs recognition of the tetra- or
penta-nucleotide component. Parts of the variable regions are
implicated in the determination of the specificity of interaction
between protein and DNA.
FEATURES from to/span description
pept < 1 > 135 hsdS specificity protein (AA at 1)
recomb 32 101 SP DNA end/SB DNA start (uncertain within
span)
BASE COUNT 46 a 39 c 24 g 26 t
ORIGIN Unreported.
1 ataccaatcc cgtcacttgc tgaacaaaaa atcatcgccg aaaaactcga tacgctgctg
61 gcgcaggtag acagcaccaa agcacgtctt gagcaaatcc cacaaatcct gaaacgtttt
121 cgccaatcag tgata
//
LOCUS RATCYP2A3A 14315 bp ds-DNA ROD 11-AUG-1990
DEFINITION Rat cytochrome P450 II A3 (CYP2A3) gene, complete cds.
ACCESSION M33190
KEYWORDS coumarin 7-hydroxylase; cytochrome P450 II A3.
SOURCE Rat (Sprague-Dawley) DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 14315)
AUTHORS Ueno,T. and Gonzalez,F.
TITLE Complete sequence of the rat CYP2A3 gene specifically transcribed
in lung
JOURNAL Nucleic Acids Res. 18, 4623-4623 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by F.Gonzalez 23-MAR-1990, for release after publication.
FEATURES from to/span description
pept 3446 3625 cytochrome P450 II A3, exon 1
3892 4054 cytochrome P450 II A3, exon 2
4736 4885 cytochrome P450 II A3, exon 3
5184 5344 cytochrome P450 II A3, exon 4
6879 7055 cytochrome P450 II A3, exon 5
8457 8598 cytochrome P450 II A3, exon 6
9049 9236 cytochrome P450 II A3, exon 7
9806 9947 cytochrome P450 II A3, exon 8
11123 11304 cytochrome P450 II A3, exon 9
pre-msg 3422 11497 cytochrome P450 II A3 mRNA and introns
IVS 3626 3891 intron A
IVS 4055 4735 intron B
IVS 4886 5183 intron C
IVS 5345 6878 intron D
IVS 7056 8456 intron E
IVS 8599 9048 intron F
IVS 9237 9805 intron G
IVS 9948 11122 intron H
signal 11466 11471 poly-A signal
BASE COUNT 3819 a 3801 c 2926 g 3769 t
ORIGIN Chromosome 7.
1 gaattccaac agagacccca cttgtgcacc cgaaatgata cacccagaag aaggtacaag
61 cagagctaag cgaggcagag gctcccatca taacctacta atcaaaacga tagcagcgtg
121 aacatgttca caagcatgga ggacttgggg acaacagtgg ctcgctaagg aggagaaagc
181 aaacatgagt cgatagtcta cagtgagaga tgatagacaa caaggcaaag tcctgtgcag
241 gcttcagctc catccagagc aatctgaaca tctaacgctg atcagaaatt gcccaacagg
301 gcaccaagcc tccaggataa acttcctact cacagatcct tgagtatggt ggaagagggc
361 actgatgctt ggctttgatg aagaagcact ggggtgggag gcttctcaca caccaccagc
421 ttctttaggg ttgtcctgga ggcatttcaa aatgagacag gccttgcttg cccaccgtaa
481 ctgaatgtta ccaaagacac caggagcttt ggctacagga catagaaata tcaatctcaa
541 actgaccagg aaattgtctc cttagtggtt ctgggagtat cattggtctg accaagtgtg
601 gaccctctcg ctacaataaa aacaagaggc tcccaaaggc agattgaagt tcagaggaac
661 atcaatccat atcctttttt cttgtcaagt aataagatgt gataataaca gatgaataaa
721 atagctattt attgacttta gcattaactc aaccattact accaagaaga agattcacct
781 ataaagttca gtttgttatt gaagtcagct cttcaaggac acacaacaga caagatctca
841 gggacactga ttggattgcc caggagttgg tatctgccca ggttagggtt ccattctatg
901 ctgtgccatt gctctcaatc acggaaccag aaagggagac tggaaccatc agagggagac
961 ttcccaaggt tgctgcttat tcactctgga aacctgtctg agatgcttgc tgctgttccc
1021 acatgtctct ggcatgtgac cgctgtcacc gatccttcat tcacgcccct gtgtctctcc
1081 tgcataactc ctgttcctgc tctgtcattc atgtgcccgg aaaaaaaaat caataaggac
1141 aaatacactt gatctctcct gaactctcca agctctgttc ccattctggc ccccgtgccg
1201 aacatgagaa catgcctgac cactcagggt ccttctgggt gcctcctcca ctgagaaaca
1261 gattctctaa attacccctt cctcctattc aagtggaaac aaggggagag gagaggtccg
1321 tgctgggtat agtgcaggtg caggagagat tgatgcctgc caagcctagg ctaatcccat
1381 tccccccccc gccccttcct cctccaaaat caacaaacaa cctttctctc tgctttcttc
1441 tcggcccttc gtttcctccc caaccggtct cagagtccaa cagcctaaaa tgagatgagc
1501 taattgccac tcatgagctc caacctcacc caatggtcta atgctgggat accatcatgg
1561 gagggaccca tccatcaaag cctgccagca tccacccaag gtccctccca gggtctcaag
1621 gaagccaggg tcagcaagca cctagtgttt atatcagagc ctgctggaac caaatccata
1681 cccagcatgg tgccacactt atgccttcac tcaaacaatt ccagtgacca cctgccgccc
1741 ctctcaaggt ctctctcagg gtaagaatat caccaatatc cttttggtgt caagctccgt
1801 gctaactcct ttttcacctt agtgcttgcc tgggggctat ctcaatgcca accccatcct
1861 cctccctcct gggagcccat cctgcagcta agcttaattc caggtgtctc cagggtgagc
1921 tgagctggat aatgaggagg aaagagaatc aaatccctgc tcctttatga cccggtcatc
1981 tcaatgaggt tggtgatgca ccaactgcac aatgactaag attccacaga tgggatacac
2041 atcatcttcc tgggctacgg gttcatcaaa actgtccaga ttattaaaga agacttacaa
2101 tgtgagagaa cagtagggtc attttccctt ctccatagcc accctcagca tcccagggtc
2161 ctcttctgtt ccccatgctg gacgctcatc taaatctctc cacctcagcc cactgataac
2221 atccaggcat cagcacacat acccaccccc caccaaatat gacctttatt tccccaatgc
2281 tctcttctca ctggtgggaa agatgtctag acagggaaag ctaaaagtat gagcaagagt
2341 agatgttaat gcctctgact ttcgggcttt ccatgtatgt gccccccatg ccacccctat
2401 gttttgttca cgtgcaccca tagaaaacat gtgtaaatta cagcatcatt gagccagtga
2461 ctctgatggc ctcctctaca aatctcgata ggaatccagt ggggctttcg taaattagaa
2521 ctttgctacc atattggagt tgtgattatc atttctcaag acctagacag acccccagat
2581 attggctcta gtactgtgcc aaattccagt gaaatatgac tctggtcttc cctacctttc
2641 tcagatctta tgttccagca atccatccta gactcagaga cttctctggg ttctgagcat
2701 taagagacag catgaagcca taaccaaagt gcatccacgc cggggcacag gtgttgactc
2761 caagatcctc ctgcagtttg tgtccttcca ggcactcccc atccacagcc tttcttgact
2821 caagatccat aacccatagc cctggagtcg ccaattctcg gttccttaca ttcaggaccc
2881 acttcattga aagtctccca gggttggctg tggaagtgta ggtctgtaat cccagcactc
2941 agagcagaca ggtggatctc tgtagtctat gtacagactc cggactaccc agaactacat
3001 agtgaaaact tgtctcaaac aagaaagaaa agtgatcttt caacttcata gccattcatc
3061 ctatttcgaa caaatgaatt atagggttct tctctcctaa gaccctaata tggctatcac
3121 ttgtgccctt tctcaaagga agggtctcta gtatacatgt catgaaggat ccagttctgg
3181 cttctctcta ctgtcctctg taatgcatag ttttggagca cttcctcctt agacaccaca
3241 aactcagtct actccctgtt aatctgtcat ggagccctcc tcctccttga gtgttggcta
3301 tgtcccaaac taggtgcatg acgggggtag tctggaggta aaattagaca gttgcataat
3361 caagaccaaa ggccagcctt ctatctcaga atgtataaaa acaagctaat tcagccatca
3421 ctgtgtctgt catcccatcg ccaccatgct ggcctcagga ctccttctgg tggcctcagt
3481 ggccttcctc agtgtcctgg tcttgatgtc tgtctggaag cagaggaagc tctcagggaa
3541 gctgcctcct ggacccaccc cattgccctt catcgggaac tacctccagc tgaacacaga
3601 gaaaatgtac agctctctca tgaaggtgcc actgggcatg gagatgggtc gaatgggggc
3661 gtggagggat gcccaatttt gattgaggtt ttgtggtagg tattgaccag atgggagtgt
3721 aattttaggc caggggaatt gtgaagaaat cagcactttc taggccagtg gtatcacatc
3781 ttgggacacc cagtctgata actattagaa ctctagagag atgaatccca gaccggatat
3841 caaagaatga accatcccct cttaccttct ctgatcattc tcctccatca gatcagccaa
3901 cgttacggtc ctgtattcac catccacctg ggacctcgcc gagttgtggt gctgtgcgga
3961 caggaggcag tcaaggaggc tctggtggac caagctgagg aattcagtgg tcggggagag
4021 caggccacct tcgactggct tttcaaaggc tatggtgagg ggaagaccca ggggaatgtg
4081 gcagaggaca tttgagggca tcagtttccc cagatttctc cctgactgtc tatcccctcc
4141 aagcaatctt ctccccacat ccctgtctca gactggtctc tctgccagtg tttcctgact
4201 gccccacctc cacccatgcc tccttttcca cctagcaatt gctgtcggaa tcctgccgcc
4261 tctaagggga tccgaagctt ccctgcaacc cacaccatca ttttcagctt ctgcgattgc
4321 tttttctctt cctctcctct gctttccagt gtcagacatc acagaacgtt tgttcctagg
4381 ttctatctac cagcaacgat ttcttctctt tccagactct ctttgtgtct ctctctttat
4441 ttccctctct ttttttgctc aattacactt ctttcacaat tgtatccacc ctcagctctc
4501 tgcagtctcc tgtctctcac ctctgtctct gcacctctgt ctccctcttc ttactgtcca
4561 gtaacttgct cttttacctg tactttcgtg gtgccttctt gtgttttgca tttgcatttc
4621 caaggatctg tgtgcatctt tttccttctt gaccgcctga aggggaacag cgcaagggtg
4681 gtcaatggcc agccacgccc cctgacttct cctggcttct cattcacctt cccaggcgta
4741 gccttcagca gcggggagcg agccaaacag ctaaggcgct tctccatcgc cacgctgcgg
4801 gacttcggcg tgggcaagcg tggcatcgag gagcgtatcc aagaggaggc gggctttctc
4861 atcgagtcat ttcgaaagac gaacggtaag cggagcaccg ttctccggac ctaccaaggg
4921 aaagcagcac catgtcccag gatagaaact aggctaggga agtgtatgct ggtcccctac
4981 tgtgggatct ggtctcagcg tttttggttc agcgctgcag ccagtgccct tcttgagttc
5041 tcagattcaa agtatctgat gctaacaccc aggcgatgct ttttgcctag ttccaatgtc
5101 tccctttctt tcctcctcct cctccccttc ctccccctcc tccccctcct cctcctccct
5161 ctcctcctct tccccatcca caggtgccct cattgacccc accttctatc tgagccggac
5221 agtctccaat gtcattagct caatagtctt cggggaccgc ttcgactatg aggacaaaga
5281 gttcctgtca ctgcttcgaa tgatgctggg aagcttccag ttcacagcta cctccacggg
5341 gcaggtaacc tgtttcaacc ttgtcctgtc atacctcctg accttgactg ccagctcact
5401 cccaaactgc ttccagcagg caccttcaca ctagcccttg cagttttgca agaacaaata
5461 aatgtcaata tgtagacatc ctcaacagct gcccctgaaa tcaaaagctg gcagtctatc
5521 tccctttccc cctccctttt ccccttcccc tttccccctc cccttccccc tcccctctcc
5581 ctcccctccc cttccctctc atgtacacac acacacacac acacacacac acaccacaca
5641 cacacacaca gccactcagc tccttcagct gggcacatgt taccaccaca atttgtttta
5701 atttctaaca caagattgcc ataggaaggg tccgctgaaa acaccacccc tcttcagttc
5761 ctgaacatct ggcactgcag aaaacacaga gctagcccac tggagtatgt agatgagtgc
5821 cccttactca gccacggaac taagcaaacg tctgaggctc caagcgaaat ttctgcacag
5881 atgtccccca aacatccaag taagggtctt tcaacacagc tcattcacac tcctaagcac
5941 gaccatctgt gaagctgttt gataacaaag caggtgaaga tatggatatc agcactactt
6001 aaaatagaaa taataagaca attttggaca agcatacatt gtgtattgag ccttttcagc
6061 cccatacccc ccaccttccc tgttctcaca cttcctcctc ttcctcctcc tcctcttcct
6121 cttcctcctc ttcctcctcc tccttctgtt agtcctgttt gtccctgaga cagttttatt
6181 tctacttcta agtgatacac atgtacatat atatgcatat gtatgtacaa cgtgatttga
6241 attaatacca cagattgaaa gaacacatac acgggttctc taatactggt taaatttatt
6301 tgatatgatc tcccattgtg tctgctttcc tgaaaatttc cataccattg tttttaatgg
6361 ctgaaaaatt ttacacacac acacacacac acatacacat atacatatat acatatacat
6421 atatatctca cattttctct ttcattacct tgtctttgga cagctagttt gagtgcataa
6481 ttcagttact gtgactaccc aggactggtt ggtttaatac tggaaaacct gcgaaccttc
6541 accatccaaa ccctgctagt atttgaacag ctgcctctag ctaatgtcac ccaaatacct
6601 aagcatctac agatgaaact tcagcccaac aactgaatgt cattcaagcc agccatgcgc
6661 aagaactcca aacctatata tgcgaccaaa tccacctcac ttaaaatctt ggacaatagc
6721 cctctcactt aaactataag aaaactatat tgcctccttg ccctaccagg ttacctgaac
6781 ctcaggacag gtgacagaaa agtcccttgc ctgaaaacag ccaccacacc tgttggattt
6841 tatgcagtct tggtttctcc tcctttttcc accctcagct ctatgagatg ttctcttctg
6901 tgatgaaaca cctgccaggc ccccagcaac aggcctttaa ggagctgcag gggctggagg
6961 acttcataac caagaaggtg gaacagaatc agcgcacgct ggatcccaat tccccaaggg
7021 acttcatcga ctctttcctc atccgaatgc tggaggtaaa cccagaagtt agggagcatg
7081 gcgggaggtc caaagccaag caggatgaaa tcacgcttgg ggaatggggg taatggtgga
7141 ccacttaaat cagtcctctc aatataaacc tcacatatta ccactcagca ccactgttcc
7201 tttcacttct cagtctcagg aggatggaca gactgagatc acagccagga tgtggtgctg
7261 ccatatgcca ggcaccattt taaaccattt taaaggctta gtgatatcca cttatttagt
7321 attcacacct tggacgaggt tgtgttatga ctcccatttt acaaatgagt aggctgcagc
7381 tctgagaata taggcaactc gtcctgcatc acagctgtct caggctgaca cacaacttaa
7441 cgttgcctgt cctcattgtc attcacgtct gtggcacagg tgacctagct ttattgtgac
7501 cttctctcac ctatatggaa cagaggcagg ggtgttttgt gataggctgt agccctagtt
7561 ggcctgaaac ttgctctgca caccaggcaa gtctcaaacc tgccgtggtc ctcctgaggc
7621 tgctttctga tggcttggat tatcaacatg tggtatccgg ctgagatgga acttcatgta
7681 gtctgaatgc aaaggattga gggatgagat tcccctagaa aatattaagg ttcgtaatga
7741 agagatggga tttattgcct agcattcatt gtctgagggt tggaaagatg ggtcagtggt
7801 taagagcact ggctgctctt ccagagtgcc agtgggttca atcccagcac atgcatggca
7861 gctcacaacc atgtcatttc agtcccagta gatccaatgc cctctttggg cctccatggt
7921 tactgaacgc atgtagtgct ctgacacaca tgcaggcgaa acactcatac actcaaaata
7981 atttttaaaa agctctgtat caggatgggg atttagctca gtggtagagc acttgcctag
8041 gaagcgcaag gccctgggtt tggtccccag ctccgaaaaa aagaacaaaa aaaaagctct
8101 gtatctacct ctgaaaaacc atctcaagtt ttccctgact tcctccaaca tgacgtcagt
8161 accattgttt cttgggctct cttcagcctt cctttcctaa tttcgaccat taaataactg
8221 acgactaagt tagtaacaat cagacaagca ctaagtcacg aatgacagaa gagcatttat
8281 ggtcccaggt cacttccaaa tgtctgggcc ccggagagaa ctcagcatag ccagggttcc
8341 tcttctaagc cttaggtctg gcccaaagag aagtgagttc aagttgggaa aaaaaagcta
8401 gaagggtacc aagagtgtaa ggaatggatt tgacgtgaat atttccctcc ctgcaggaaa
8461 agaagaaccc caatactgag ttctacatga agaacttggt gctgactacc ctaaatctct
8521 tctttgccgg cacagagacc gtcagcacca ccctgcgtta cggctttctg ttgctcatga
8581 agcacccgga tattgagggt aacgaaggct tcaagggaca ggaaagcagg ggttgagggg
8641 agcataacct ggatatagat tctaaatttg catcttttat ctgccacaat gtcccctacc
8701 tttctcaaga aaccggactc tcaaacagct agctaccccc tgctttgcct agacagagcc
8761 acagctggtg ggcaccagtc tctagttgct taaccaactg gtgtgctgct cgccacactt
8821 cagaggagtt acttccccgt ctcccgggtc tcctcgcaac tcccgtacaa ttatggtatg
8881 atccatccag ctcacgggaa acttcttcac caatgcagga gttgcacata ttgaaaatga
8941 tctacaatgc catacactgt tccatggcct ccatcccgtc ttagtttctg aactatcatc
9001 tctactgtgg cgttcttgga ttcctaaacc ggtccctgtt cttctcagct aaggttcacg
9061 aggagattga ccgggtgatt ggcaggaacc ggcaggccaa gtatgaggac cgaatgaaga
9121 tgccctacac ggaggctgtg atccacgaga tccagagatt tgcagacatg atccccatgg
9181 gcctggctcg cagggtcacc aaggacacca agtttcgaga gttcctcctc cccaaggtgc
9241 tgattcacct agcctagagt gacctctaga cctcttcatg ggcctccagg acctcctgct
9301 cattagaaac cacccggccc ctctcttggt ttcttagtga caggctttct ctgtcaccac
9361 atcccactgc cctcctactt agagactcct atatcttgtc ttcaccccag actttcccac
9421 tcaggaattt taactccatg cctcttaaga agtataaact taatttatct agatcttgac
9481 tgtgttggag ataggagtct catgtctgct atatatcctg tttggggcta cacactttca
9541 atacccccaa atcttgactt gagaggcaca atttgcccca gaagacaaag cctgtattac
9601 tcatctgtcc tggctactag aacatgagtc ccacatcccc catatcttaa ggataacctc
9661 acattcggca aagctcctcc ctcaaaggtc ccctaactca ctatctttac cttcacttgg
9721 ataccctagc tcaccctttt aacccctgtg ggccagcagg acattcatcc aaacatacct
9781 catggagtct cttcttcatc accagggtac tgaagtattt cctatgctgg gctctgtact
9841 gaaagaccct aagttcttct ccaaccccaa cgacttcaac ccaaagcact tcctagatga
9901 caagggacag tttaagaaga gtgatgcctt tgtgcccttt tccattggta agaacccact
9961 gccgagtgcc agaccactgc tcacaccaac agataagctt ctccattgct tctgtctgat
10021 gcagccaaat atttctctct attttggaat attctcttgc atctaactct cggctagtta
10081 gctgtagctc ccataattcc caagcaccat gtatctgaac cctggagacg tgatggtcaa
10141 actttagtga taggtgaagg gaatggaacc tcaaactaag ccagaggtct ttctcaggtc
10201 acacaaccag ttcttaaaaa ggatgtggcc acatggcagc cataccattc cctatagctg
10261 tagggaaagg tgtaaaagta ttttttaatc tcctgatgta aatgtctccg aggcttactg
10321 cctccctctg ctaacctagg cctagccctg gaagcttcta gcctctgaac aatcttatct
10381 agcctagaat gttttcagcc tctgagactt gctgctaaat aagctcacac cttcctagtt
10441 ccctctgatc tctggatggc tggttcaact cagctgtttt ggctcaaact cctctaacag
10501 attcagtctc tctctttgcc tctcctgagc tgctctactt ggctccatgc taactttggc
10561 aatctgttct gatcttctgt ctccttctca ttctctggct tgttctgtct tcacctgggc
10621 ctagctcgct ctctcagtaa cctgcctcta tataactgtc ccagttaaac tgcccccttt
10681 ctccatgttt ctctctcttc actgttctct ttgtctgcca ctcaattaga cgtcattttc
10741 aaacatgggt gctttcttct acaaactaac tttatcttca ttgtttggaa tttaaagtgt
10801 gtaccaaggg tttgcattcc agccagaggg attaaaggtt gtgtgctaag gctgagccac
10861 tccacaccta gaaacaggtt tttcagtaaa taacacaatc tcggggttca cagtgtgatc
10921 gaatatcctg caacagaagg gaatccacat ttctcattgt tacagcctgg tatggatcaa
10981 cccactacaa gtacaaatac aactgaggct caggagttag agcgttcctt aaacgtctca
11041 gcacttcctc ccatgaagaa tgcagccagg gatttgctat cggcaaaaac tggcttcacc
11101 ttcccctgtc ttgtctcttt aggaaaacgg tattgtttcg gggaaggact ggcaaggatg
11161 gaactctttc tcttcctcac aaacatcatg cagaacttct gcttcaaatc cccacaggca
11221 ccccaggaca tcgatgtgtc tcctagactt gtgggctttg ccacaatccc accaaactac
11281 actatgagtt tcttgtcccg ttgagccagc attgcgtgag agtaaaggga agaatggggg
11341 ttgaaactta gaaagggtcg gatgtactga agaaacagag gaaaaagagt agacctgctg
11401 aagatatata tttcaaaggt agagccagag aagggggaaa tatctactat actgtgaata
11461 ggaataataa aataataaaa gtagatatta tttatggcat aagtagtctt tgtcaagttt
11521 atgctaaggg gtcttacaat ctcacttaat actcagaagt gcatgggaag gggaagagag
11581 ttcatgctca ttttattagg gtcacattct tgatcttatg tttccctagt ctaagtttac
11641 caaagtctca cagaatccaa gagctcacta atatttggac aaagatctgt gttcattggc
11701 cttgggcaga ttttcaaata tcaccacctc tcagataaaa catgaaagct caagtctatt
11761 gccacatcat cctgggcaga tctttgggtg ggcaacctgc caggctctac cacttgcctg
11821 gccacactct gatctactta cagagaaaac tcagggcctg caccatgaca tgaatgtgtg
11881 ctgatttatg gatgcccacg taagtgtggt gcagggtttc ctcaagttat gcacgagtat
11941 attttctgtc atgagttggc ttccaaaggc caagtgtttg ctcacagata agagtcagtt
12001 aaagcagttg aaaagatgtg ctgtggtaca ctggtgtaaa tctggtgagt agatgcctgt
12061 gcgatgcagt ctcaacagca gtggacacag cagccagcac tttctaggct tcttactttt
12121 atgatgtcag tattagtcat acccaggagg aagggacaga tataaggata tcaaggtgac
12181 atctttgaga actgtattgc aagccctccc tggtagtaaa tttcaaccct gcaagctgca
12241 taagcttcct gggataatga agagaagaag gagggaacag gggtgaggat ggctggggag
12301 gggatggggt agagtaggga gaatgactca attagaaaag tgcttgccaa gcaataatga
12361 ggacccgagt ttggatcctc agcacccctc aaaagcccag cgtaatggca caagcctgtc
12421 accctgatgc tgtgaggtgc gtgaaaattg cctggtccct gaagatcact gggcagccca
12481 gtccagtcaa atggtgaact cgggatttag agaccctgtc tcaaagaaca aagtggagac
12541 aggaagatca acagagtcaa ctaacctggg ggctagctca ctagaccctt ggggctctca
12601 aagactgagc caccaaccaa ggtgcacacg tggcctggac agaggccccc agcacatacg
12661 tagcatatgt acatctcaat ctttatgtgg gttccccaac aactggagca ggagctatca
12721 taaagctatt gcctgtcagt ggaatccatt cccctaactg ggctgccttg tctcacctcc
12781 gtgggagagg gtgcacttag ccctacagag acttgatgca caaggaaggg aatataccca
12841 ggtgggcccc atcctctcag aagagaaggg gatgggagat gggggagagg tttgtaagga
12901 gggaacagga gttgggcatt gttcaggatg gaaaatgaat aaataaatta attaatgggg
12961 aaaactggag agaaactaaa gaagacatgt actgttgacc tgtggcctct acacacccat
13021 gtacacatgc acctgcatgc acatacaaaa atacacacac acacacacac acacacacac
13081 acacacacac acagactagc tcatttttag caactggcaa gaaagccttt gaaagtgagg
13141 gagggaaaaa aaactaacac attctttaat gctgaaaaac attaaaataa caaacaaatg
13201 aaaagctcct aaagacgtga cctgggctgc tgagacaggc cacattcaac caacgatgtt
13261 gcattacata gcagcagagc ttgcagcctt ctgaagggca tccgtgacct tcttcaggct
13321 aagctgtatg ttagggttca aagtttccca tcactttatg aatattagcc taaccagagt
13381 aactttgatg aacgagatgg ccagaaacca acagcaaagt caaaaccaca aaagccatgg
13441 gtatgtggaa attaagagac gtgccatcaa aattgtttct tttcaaataa aacttaggca
13501 tataccgagg tagtagagag acagttcaat ggttaaaagc acagaatgct catgcagagg
13561 acttgagttg ggtccctggc actcatgtct ggcagcctaa agtcagctgt aactaactaa
13621 ctaactaact aactccaggg gatccaatgc catcctctgg attccatggg gcctgcatgt
13681 atatatgcat atgcacacac acacacacac acacacacac acacaaagac acacacaaag
13741 atacacacaa agacacatga ctttaaaaat aagtcttttt aaaaatataa ctattaatga
13801 aattaatgaa gtgcaactaa tgttttccat acatgtataa attatgcaat gactgaatca
13861 tagcaacaaa aaatgtagac ttatacaaca gtggttcaaa gaaatattac atggtaaatc
13921 acaaaacact taaataatgg aacaaaaata aatacacaca gacttacagg atgagaacag
13981 ccacagcagg agatgcaggt aaactcatag catggttttt aaagaaatgt caaattcata
14041 aacggtgttt taaattttta cccactagat tgaaagactc actaaactga tgtttttttt
14101 tctctttttt gtgattatac atacttattt tactgggtat tttttattta catttcaaat
14161 gtgattccct ttcgaggttt cccggccata tgcactctat cacaactccc tccccttctt
14221 ctataagggt gttccctctc cccaaccacc ccaccttcct gccccccacc gacattcccc
14281 tgcactgggg gtccagcctc agcagggatc aaggg
//
LOCUS ACARRDX 119 bp ss-rRNA RNA 11-AUG-1990
DEFINITION A.castellani 5S ribosomal RNA.
ACCESSION M28984
KEYWORDS 5S ribosomal RNA.
SOURCE A.castellani ribosomal RNA from Viluy encephalomyelitis patient
KPN..
ORGANISM Acanthamoeba castellanii
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina;
Acanthamoebidae.
REFERENCE 1 (bases 1 to 119)
AUTHORS Chumakov,K.M. and Karavanov,A.S.
TITLE Preliminary characterization of an organism isolated from a case of
Viluy encephalomyelitis indicates a protozoal, rather than viral,
aetiology
JOURNAL J. Gen. Microbiol. 132, 1127-1133 (1986)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 1 119 5S ribosomal RNA
BASE COUNT 26 a 36 c 37 g 20 t
ORIGIN
1 ggatacggcc atactgcgca gaaagcaccg cttcccatcc gaacagcgaa gttaagctgc
61 gcgaggcggt gttagtactg gggtgggcga ccacccggga atccaccgtg ccgtatcct
//
LOCUS BMOCHORB 538 bp ss-mRNA INV 11-AUG-1990
DEFINITION B.mori early chorion protein mRNA, 3' end.
ACCESSION M36291
KEYWORDS early chorion protein; eggshell protein.
SOURCE B.mori, cDNA to mRNA, clone m2G12.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 538)
AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C.
TITLE A complex set of early chorion DNA sequences from Bombyx mori
JOURNAL Dev. Biol. 112, 368-376 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 482 early chorion protein precursor (AA at 3)
sigp < 1 44 early chorion protein (AA at 1)
matp 45 479 early chorion protein
signal 518 523 poly-A signal
BASE COUNT 104 a 128 c 159 g 147 t
ORIGIN
1 tcctcatctg tgcttcggca ttattcgtcc agtctgctgt cagtcaatat ctgagtcgcg
61 tcggctgtgg ttgtggaggg gttggcaatg gcttgggcta cggtggtatc ggctataacg
121 gctatgtcgg taacgatatc ggcgctgctg gagctctagg agcttccaat ggagggtgtc
181 tcaacgtcgt cagctcctct gccgctccta ctagtttagg tgtagcctct gagaactctt
241 acgagggaac tgtcggtgtc tgcggtaatc tgcccctctt gggtactgca atcgtcaccg
301 gcgagttctc tactggcggc cttggtggta ttaactacgg ctgcggtaat ggcgctgttg
361 ggatcaccgc tgaagatcgc gccggtatcg gctacgccgg cggcctcggc tacggatcag
421 gctacggact cggctacgga ggctacgccg gcaatggatg tggatgcggt ggcgcctatt
481 gaatgctaga atatatataa aaaaacacgt tttttttaat aaattccttt aaaaaaat
//
LOCUS BMODNA 586 bp ss-mRNA INV 11-AUG-1990
DEFINITION B.mori early chorion protein mRNA, 3' end.
ACCESSION M36290
KEYWORDS early chorion protein; eggshell protein.
SOURCE B.mori, cDNA to mRNA, clone m6F6.
ORGANISM Bombyx mori
Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia;
Bombycoidea; Bombycidae.
REFERENCE 1 (bases 1 to 586)
AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C.
TITLE A complex set of early chorion DNA sequences from Bombyx mori
JOURNAL Dev. Biol. 112, 368-376 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 483 early chorion protein precursor (AA at 1)
sigp < 1 39 early chorion protein (AA at 1)
matp 40 480 early chorion protein
signal 575 580 poly-A signal
BASE COUNT 117 a 133 c 166 g 170 t
ORIGIN
1 ttgtctggat gtgctctcgt ggctcagttc acagttggcc agtacatcgg ccgcgtgaac
61 aatggttgtg gatgcgggag tttcgacaac cgtggcctcg gttacaccgc tggctgtggt
121 ctcaccgctg ctagttctct tgcagcctcc catggaggag ggttattcgt cgtcacctcc
181 tctgccgcgc ctgctggtct cggcatagct tccgagaaca gatacgaagg cgctgtcgat
241 gtgtgcggca acattccatt cttgggcacc gctgatgtcg caggcgagtt ccccactgcg
301 ggcattggtg agatcaacta cggctgcggc gatggagcag tcgccattac cgctgaaggt
361 ggtctcggct acgctggagg acttgactac actggtggac tcggctatgc tagtggactt
421 ggctacggct taggctatgg agaatacgtt ggatgcagtt gtggttgtgg tgacatctac
481 tagaatggat attgttatat taatcttgta caagtatgta cattcaggtt gtaattttta
541 caagaattaa attttatttt tccgaataaa ctctggtata taattt
//
LOCUS BTHINSECT 977 bp ds-DNA BCT 11-AUG-1990
DEFINITION B.thuringiensis 27.3 Kd cytolytic insecticidal protein gene,
complete cds.
ACCESSION M35968
KEYWORDS 27.3 Kd cytolytic insecticidal protein.
SOURCE B.thuringiensis morrisoni (mosquitocidal isolate PG-14) DNA.
ORGANISM Bacillus thuringiensis
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 977)
AUTHORS Galjart,N.J., Sivasubramanian,N. and Federici,B.A.
TITLE Plasmid location, cloning, and sequence analysis of the gene
encoding a 27.3-kilodalton cytolytic protein from Bacillus
thuringiensis subsp. morrisoni (PG-14)
JOURNAL Curr. Microbiol. 16, 171-177 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept 67 816 27.3 Kd cytolytic insecticidal protein
BASE COUNT 349 a 161 c 149 g 318 t
ORIGIN
1 aagtttaaaa tcataattta atgttgaaag gccactattc taattaactt aaggagttgt
61 ttatttatgg aaaatttaaa tcattgtcca ttagaagata taaaggtaaa tccatggaaa
121 acccctcaat caacagcaag ggttattaca ttacgtgttg aggatccaaa tgaaatcaat
181 aatcttcttt ctattaacga aattgataat ccgaattata tattgcaagc aattatgtta
241 gcaaatgcat ttcaaaatgc attagttccc acttctacag attttggtga tgccctacgc
301 tttagtatgg caaaaggttt agaaatcgca aacacaatta caccgatggg tgctgtagtg
361 agttatgttg atcaaaatgt aactcaaacg aataaccaag taagtgttat gattaataaa
421 gtcttagaag tgttaaaaac tgtattagga gttgcattaa gtggatctgt aatagatcaa
481 ttaactgcag cagttacaaa tacgtttaca aatttaaata ctcaaaaaaa tgaagcatgg
541 attttctggg gcaaggaaac tgctaatcaa acaaattaca catacaatgt cctgtttgca
601 atccaaaatg cccaaactgg tggcgttatg tattgtgtac cagttggttt tgaaattaaa
661 gtatcagcag taaaggaaca agttttattt ttcacaattc aagattctgc gagctacaat
721 gttaacatcc aatctttgaa atttgcacaa ccattagtta gctcaagtca gtatccaatt
781 gcagatctta ctagcgctat taatggaacc ctctaatctt agtagctata tttattaaag
841 atggtaatat cacaagtata aatacttgtg gtattaccta ccattcttaa attatatcca
901 aaatcatgcg ttaatctaca ttcccctttc tctaaaattt gttcttcaca catccacatt
961 tttcgataaa acaatct
//
LOCUS ECOPYRI 133 bp ds-DNA BCT 11-AUG-1990
DEFINITION E.coli ATCase regulatory protein (pyrI) gene, 3' end.
ACCESSION M28578
KEYWORDS ATCase regulatory protein.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 133)
AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G.,
Glansdorff,N. and Pierard,A.
TITLE Structure-function relationship in allosteric aspartate
carbamoyltransferase from Escherichia coli: I. Primary structure of
a pyrI gene encoding a modified regulatory subunit
JOURNAL J. Mol. Biol. 186, 707-713 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 81 ATCase regulatory protein (pyrI) (EC 2.1.3.2)
signal 100 129 transcription termination signal
BASE COUNT 41 a 27 c 29 g 36 t
ORIGIN
1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttcccat
61 aatgtggtgc tggccaatta attgcggttg gtaataaaag ttggctccct ataatgagcc
121 agacttttta ccg
//
LOCUS HUMBHD 1269 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human placental 17-beta-hydroxysteroid dehydrogenase mRNA, complete
cds.
ACCESSION M36263
KEYWORDS placental 17-beta-hydroxysteroid dehydrogenase.
SOURCE Human placenta, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 1269)
AUTHORS Peltoketo,H., Isomaa,V., Maeentausta,O. and Vihko,R.
TITLE Complete amino acid sequence of human placental 17-beta-
hydroxysteroid dehydrogenase deduced from cDNA
JOURNAL FEBS Lett. 239, 73-77 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 10 996 placental 17-beta-hydroxysteroid dehydrogenase
precursor (EC 1.1.1.62)
matp 13 993 placental 17-beta-hydroxysteroid dehydrogenase
mRNA < 1 1269 placental 17-beta-hydroxysteroid dehydrogenase
mRNA
BASE COUNT 220 a 378 c 437 g 234 t
ORIGIN
1 agtctcacca tggcccgcac cgtggtgctc atcaccggct gttcctcggg catcggcctg
61 cacttggccg tacgtctggc ttcagatcca tcccagagct tcaaagtgta tgccacgttg
121 agggacctga aaacacaggg ccggctgtgg gaggcggccc gggccctggc atgccctccg
181 ggatccctgg agacgttgca gctggacgta agggactcaa aatccgtggc cgctgcccgg
241 gaacgcgtga ctgagggccg cgtggacgtg ctggtgtgta acgcaggcct gggcctgctg
301 gggccgctgg aggcgctggg ggaggacgcc gtggcctctg tgctggacgt gaatgtagta
361 gggactgtgc ggatgctgca ggccttcctg ccagacatga agaggcgcgg ttcgggacgc
421 gtgttggtga ccgggagcgt gggaggattg atggggctgc ctttcaatga cgtttattgc
481 gccagcaagt tcgcgctcga aggcttatgc gagagtctgg cggttctgct gctgcccttt
541 ggggtccact tgagcctgat cgagtgcggc ccagtgcaca ccgccttcat ggagaaggtg
601 ttgggcagcc cagaggaggt gctggaccgc acggacatcc acaccttcca ccgcttctac
661 caatacctcg cccacagcaa gcaagtcttt cgcgaggcgg cgcagaaccc tgaggaggtg
721 gcggaggtct tcctcaccgc tttgcgcgcc ccgaagccga ccctgcgcta cttcaccacc
781 gagcgcttcc tgcccctgct gcggatgcgc ctggacgacc ccagcggctc caactacgtc
841 accgccatgc accgggaagt gttcggcgac gttccggcaa aggccgaggc tggggccgag
901 gctgggggcg gggccgggcc tggggcagag gacgaggccg ggcgcagtgc ggtgggggac
961 cctgagctcg gcgatcctcc ggccgccccg cagtaaaggc ttcctcagcc gctgtctccc
1021 gcgcccttct ttgtcccctg ggtctgtgtg gtccctgggg atggggcggc ggtagcagct
1081 gtgggtggct aattaagata gatcgcgtta gccagtttta ccagcgcagc taggcgcgat
1141 ggcgtcgcct gtaatgccag cgctttggga ggcggaggca ggaggatcgc tcaagccccg
1201 gagttggaga ccagccagag caacacagtg agacccccat ctctacaaaa ataaagaaaa
1261 tttaaaaat
//
LOCUS HUMC1Q 84 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human complement component 1, beta-chain (C1QB) mRNA, partial cds.
ACCESSION M36278
KEYWORDS complement component 1.
SOURCE Human liver, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 84)
AUTHORS Reid,K.B.M., Bentley,D.R. and Wood,K.J.
TITLE Cloning and characterization of the complementary DNA for the B
chain of normal human serum C1q
JOURNAL Philos. Trans. R. Soc. Lond., B, Biol. Sci. 306, 345-354 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 84 complement 1-q subcomponent beta-chain (AA at
1) /hgml_locus_uid="LR0054Y" /nomgen="C1QB"
/map="1p"
BASE COUNT 18 a 25 c 22 g 19 t
ORIGIN
1 gccaccgaca agaactcact actgggcatg gagggtgcca acagcatctt ttccgggttc
61 ctgctctttc cagatatgga ggcc
//
LOCUS HUMGAPDA 197 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase mRNA, 3' flank.
ACCESSION M36164
KEYWORDS glyceraldehyde-3-phosphate dehydrogenase.
SOURCE Human pancreatic adenocarcinoma cell line HPAF, cDNA to mRNA, clone
pII5B.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 197)
AUTHORS Schek,N., Hall,B.L. and Finn,O.J.
TITLE Increased glyceraldehyde-3-phosphate dehydrogenase gene expression
in human pancreatic adenocarcinoma
JOURNAL Cancer Res. 48, 6354-6359 (1988)
STANDARD simple staff_review
FEATURES from to/span description
signal 175 180 poly-A signal /hgml_locus_uid="LM0055R"
/nomgen="GAPD" /map="12p13"
BASE COUNT 50 a 69 c 46 g 31 t 1 others
ORIGIN
1 ccctggacca ccagccccag caagagcaca agaggaagag agagaccctc actgctgggg
61 agtccctgcc acactcagtc ccccaccaca ctgaatctcc cctcctcaca gttgccatgt
121 agaccccttg aagaggggag gggcctaggg agccgcacct tgtcatgtac catcaataaa
181 gtaccctgtg ctnaacc
//
LOCUS HUMGYPAA 445 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human glycophorin A mRNA, partial cds.
ACCESSION M36281
KEYWORDS glycophorin A.
SOURCE Human, cDNA to mRNA, clone pHGpA-3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 445)
AUTHORS Siebert,P.-D. and Fukuda,M.
TITLE Molecular biological study of the structure and expression of human
glycophorin A
JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 251-266 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 445 glycophorin A precursor (AA at 2)
/hgml_locus_uid="LW0071X" /nomgen="GYPA"
/map="4q28-q31"
sigp < 1 67 glycophorin A signal peptide (AA at 1)
matp 68 > 445 glycophorin A
BASE COUNT 135 a 103 c 84 g 123 t
ORIGIN
1 tgatctcagg atgtatggaa aaataatctt tgtattacta ttgtcagcaa ttgtgagcat
61 atcagcatca agtaccactg gtgtggcaat gcacacttca acctcttctt cagtcacaaa
121 gagttacatc tcatcacaga caaatgatac gcacaaacgg gacacatatg cagccactcc
181 tagagctcat gaagtttcag aaatttctgt tagaactgtt taccctccag aagaggaaac
241 cggagaaagg gtacaacttg cccatcattt ctctgaacca gagataacac tcattatttt
301 tggggtgatg gctggtgtta ttggaacgat cctcttaatt tcttacggta ttcgccgact
361 gataaagaaa agcccatctg atgtaaaacc tctcccctca cctgacacag acgtgccttt
421 aagttctgtt gaaattgaga acccc
//
LOCUS HUMGYPCAA 915 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human glycophorin C mRNA, complete cds.
ACCESSION M36284
KEYWORDS glycophorin C; syaloglycoprotein.
SOURCE Human reticulocytes and fetal liver, cDNA to mRNA, clones pGCW5 and
pGCF22.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 915)
AUTHORS Cartron,J.-P., Colin,Y., Le Van Kim,C., Rahuel,C., Blanchard,D.,
Bloy,C. and London,J.
TITLE Structure of human erythrocyte glycophorin C deduced from cDNA
analysis
JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 267-285 (1986)
STANDARD simple staff_review
COMMENT The first 6 nucleotides at the 5' end were not present in the cDNA
clones but where deduced from the known protein sequence.
FEATURES from to/span description
pept 1 387 glycophorin C /hgml_locus_uid="LM0084R"
/nomgen="GYPC" /map="2q14-q21"
mRNA < 1 915 glycophorin C mRNA
BASE COUNT 228 a 249 c 262 g 176 t
ORIGIN
1 atgtggtcga cgagaagccc caacagcacg gcgtggcctc tcagcctcga gcctgatccg
61 gggatggcct ctgcctccac cacaatgcat actaccacca ttgcagagcc tgatccaggg
121 atgtctggat ggccggatgg cagaatggag acctccaccc ccaccataat ggacattgtc
181 gtcattgcag gtgtgattgc tgctgtggcc atcgtcctag tctccctcct cttcgtcatg
241 ctgcgctaca tgtaccggca caagggcacg taccacacca atgaggccaa gggcacggag
301 tttgctgaga gtgcagatgc agccctgcag ggagaccctg ccctccaaga tgctggtgat
361 agcagcagaa aggagtactt tatttgaggg acaacagact tcacttccct gaatgcctcc
421 cccatctcca tcaggaaaaa tacaccccat cgcccagtat ccccgtcgat accaccagac
481 agagagagag agtacacttg atttcttccc gagatagtta cctagaaaca ctaggtgcct
541 gcccaaggag gaacggagga ggactcgcgc tacaagaggc cactcccagg gacccaggga
601 ggcgatggcc accccagagg ccaccttttg ctccacggag gtgggagaga atctgggcac
661 atggggcccc ctagggcagt gcaggacaac atcagctcac tggcaggaaa gtccttgttg
721 agggtgaggg ggtgctgggg tacccggggg ctggggaagc aaggaaataa gtcatctgta
781 tgctgactgg ggataatggc atcaatgtca gtccttgact ttggggggaa cagcaggtgc
841 cagagctaaa aggtaccttt gtctgccatt gatccagcta agaacgattg gaaataaatt
901 ggaaatgtaa ccgag
//
LOCUS HUMIGDD 102 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36090
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
AW.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 102)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 102 Ig H-chain VDJ-region (AA at 2)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 26 a 23 c 36 g 17 t
ORIGIN
1 gagagctgag gacacggctg tgtattactg tgcgaaagat cgagcagcaa cggacgggct
61 actactacgg tatggacgtc tggggccaag ggaccacggt ca
//
LOCUS HUMIGDE 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36091
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
AW.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig H-chain V-D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 22 a 24 c 30 g 24 t
ORIGIN
1 cctgcagctg aactctgtaa ctcccgagga cacggctgtg tattactgtg caagaggact
61 gggtggcttt tgatatctgg ggccaaggga caatcgtcac
//
LOCUS HUMIGDF 101 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36092
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
AW.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 101)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 101 Ig heavy-chain V-D-J region (AA at 3)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 17 a 27 c 34 g 23 t
ORIGIN
1 ccgccgcgga cacggccgtg tattactgtg ccagggtagg atattgtact aatggtgttt
61 ccgggatggt tcgacccctg gggccaggga accctggtca c
//
LOCUS HUMIGDG 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36093
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 22 a 34 c 25 g 19 t
ORIGIN
1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt
61 gcgaccattg actactgggg ccagggaacc ctggtcacgg
//
LOCUS HUMIGDH 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36094
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 23 a 30 c 24 g 23 t
ORIGIN
1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt
61 gcgagggggg atggttacta ctactactac atggacgtct
//
LOCUS HUMIGDI 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36095
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept.ps < 1 > 100 Ig heavy-chain V-D-J region (AA at 3)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 25 a 22 c 30 g 23 t
ORIGIN
1 aaatgaacag cctgagagct gaggacacgg ctgtgtattg aacccacggt ggtagctgct
61 actctgcttt tgatatctgg ggccaaggga caatggtcac
//
LOCUS HUMIGDJ 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36096
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 24 a 23 c 30 g 23 t
ORIGIN
1 tctgagagcc gaggacacgg ctgtgtatta ctgtgcaaca gggccgtcac tggttattat
61 aaactactac tacggtatgg acgtctgggg ccaagggcca
//
LOCUS HUMIGDK 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36097
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 2)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 26 a 24 c 33 g 17 t
ORIGIN
1 ggcgtatctg caaatgaaca gcctgaaaac cgaggacacg gccgtgtatt actgtactag
61 acgggacggt atggacgtct ggggccaagg gaccacggtc
//
LOCUS HUMIGDL 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION M36098
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene; variable region.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 21 a 18 c 33 g 28 t
ORIGIN
1 acggccgtgt attactgtgc gagagatagg ggatttctac gagtagtggt ggtagctgct
61 actctgcttt tgatatctgg ggccaaggga caatggtcac
//
LOCUS HUMIGDM 100 bp ds-DNA PRI 11-AUG-1990
DEFINITION Human Ig rearranged heavy-chain gene D-J region, partial cds.
ACCESSION M36099
KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon;
processed gene.
SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient
TD.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 100)
AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
TITLE Continuing rearrangement but absence of somatic hypermutation in
immunoglobulin genes of human B cell precursor leukemia
JOURNAL J. Exp. Med. 168, 229-245 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 100 Ig heavy-chain D-J region (AA at 1)
/hgml_locus_uid="LR0220V" /nomgen="IGH@"
/map="14q32.33"
BASE COUNT 24 a 15 c 29 g 32 t
ORIGIN
1 ggtaggttta gaatgaggtc tgtgtcactg tggtattacg atattttgac tggttattat
61 aaactactac tacggtatgg acgtctgggg ccaagggcca
//
LOCUS LAMPYRIB 135 bp ds-DNA PHG 11-AUG-1990
DEFINITION Bacteriophage lambda mutant truncated regulatory protein (pyrI)
gene from E.coli, 3' end.
ACCESSION M28579
KEYWORDS ATCase regulatory protein.
SOURCE Bacteriophage lambda (strain lambda OTC3) DNA.
ORGANISM Bacteriophage lambda
Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
REFERENCE 1 (bases 1 to 135)
AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G.,
Glansdorff,N. and Pierard,A.
TITLE Struture-function relationship in allosteric aspartate
carbamoyltransferase from Escherichia coli: I. Primary structure of
a pyrI gene encoding a modified regulatory subunit
JOURNAL J. Mol. Biol. 186, 707-713 (1985)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 75 ATCase regulatory protein
BASE COUNT 54 a 24 c 22 g 35 t
ORIGIN
1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttatact
61 aagttggcat tataaaaaag cattgcttat caatttgttg caagcaacag gtcactatca
121 gtcaaaataa aatca
//
LOCUS MUSBGALA 240 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Mouse beta-1,4-galactosyltransferase mRNA, 5' end.
ACCESSION M36289
KEYWORDS beta-1,4-galactosyltransferase.
SOURCE Mouse, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 240)
AUTHORS Shaper,J.H., Hollis,G.F. and Shaper,N.L.
TITLE Evidence for two forms of murine beta-1,4-galactosyltransferase
based on cloning studies
JOURNAL Biochimie 70, 1683-1688 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 52 > 240 beta-1,4-galactosyltransferase (long form)
precursor
sigp 52 111 beta-1,4-galactosyltransferase (long form)
signal peptide (put.)
matp 112 > 240 beta-1,4-galactosyltransferase (long form)
pept 91 > 240 beta-1,4-galactosyltransferase (short form)
BASE COUNT 27 a 91 c 77 g 45 t
ORIGIN Chromosome 4.
1 gggccgtcct ctcagccgta gcccaccccc tcttaaagcc gcggcgggaa gatgaggttt
61 cgtgagcagt tcctgggcgg cagcgccgcg atgccgggcg cgaccctgca gcgggcctgc
121 cgcctgctcg tggccgtctg cgcgctgcac ctcggcgtca ccctcgtcta ttacctctct
181 ggccgcgatc tgagccgcct gccccagttg gtcggagtct cctctacact gcagggcggc
//
LOCUS MUSIGHADN 348 bp ds-DNA ROD 11-AUG-1990
DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds.
ACCESSION M36103
KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene;
variable region.
SOURCE Mouse hybridoma GA733 DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Caton,A.J.
TITLE Comparative sequences analysis of CO17-1A antigen-specific mono-
clonal antibodies
JOURNAL Hybridoma 5, 11-16 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 348 Ig heavy-chain V-J region (AA at 1)
recomb 294 303 V-region end/D-region start
recomb 303 304 D-region end/J3-region start
BASE COUNT 91 a 83 c 91 g 83 t
ORIGIN
1 caggttcagc tacagcagtc tggagctgag gtgatgaggc ctggggcctc agtgaagata
61 tcctgcaagg ctactggcta cacattcact aggtactaca tacaatgggg taaaaacaga
121 gggcctggac atggccttga gtggattgga gagattttac ctggaactct tactaattac
181 aatgagaaat tcaagggcaa ggccgcattc actgcagata gatcctccaa cacagcctac
241 atgcaactca gcagccttac atctgaggac tctgccgtct attactgtgc aagagatggt
301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS MUSIGHADO 348 bp ds-DNA ROD 11-AUG-1990
DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds.
ACCESSION M36104
KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene;
variable region.
SOURCE Mouse hybridoma CO17A-1A DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 348)
AUTHORS Caton,A.J.
TITLE Comparative sequences analysis of CO17-1A antigen-specific mono-
clonal antibodies
JOURNAL Hybridoma 5, 11-16 (1986)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 348 Ig heavy-chain V-J region (AA at 1)
recomb 294 303 V-region end/D-region start
recomb 303 304 D-region end/J3-region start
BASE COUNT 81 a 79 c 101 g 87 t
ORIGIN
1 caggtccagt tgcagcagtc tttagctgag ctggtaaggc ctgggacttc agtgaaggtg
61 tcctgcaagg cttctggata cgccttcact aattacttga tagagtgggt aaagcagagg
121 cctggacagg gccttgagtg gattggggtg attaatcctg gaagtggtgg tactaactac
181 aatgagaagt tcaagggcaa ggcaacactg actgcagaca aatcctccag cactgcctac
241 atgcagctca gcagcctgac atctgatgac tctgcggtct atttctgtgc aagagatggt
301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS RICGICR 524 bp ds-DNA PLN 11-AUG-1990
DEFINITION O.sativa Gly-tRNA gene.
ACCESSION M35993
KEYWORDS transfer RNA-Gly.
SOURCE O.sativa DNA.
ORGANISM Oryza sativa
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida;
Commelinidae; Cyperales; Poaceae.
REFERENCE 1 (bases 1 to 524)
AUTHORS Reddy,P.S. and Padayatty,J.D.
TITLE Effects of 5' flanking sequences and changes in the 5' internal
control region on the transcription of rice Gly-tRNA-GCC gene
JOURNAL Plant Mol. Biol. 11, 575-583 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 381 451 Gly-tRNA
anticdn 413 415 Gly-tRNA anticodon cgg
BASE COUNT 166 a 72 c 98 g 187 t 1 others
ORIGIN
1 gcgcatgttt gtttcagctt aagattattg taatntagat tatttaaatc aagattactt
61 taagtttgga ttataataag ctgatataaa ataagctaca agttgtttgt ttttctagat
121 tattaaaggc atctaagggt agtgggtttt tagccacttc aattaatctg gaaaaagctc
181 atatagaaaa gattattgga ttatagtaat ctggcttata gttataataa tctagcatca
241 ataatctact tgtttgtttc agcttacttc taataattta gtttataata attctgaaac
301 tgaattaaac agggtcttag acatatgagt tggatttgtg tagagctgaa attagtttac
361 tcgtagaatc aagaacgaaa gcaccagtgg tctagtggta gaatagtacc ctgccacggt
421 acagacccgg gttcgattcc cggctggtgc attattaatt tttggctctt ccattttaga
481 taaagacaat tggcaacttg atgcatttat gaacaaatct gcgc
//
LOCUS M27244 1413 bp ds-DNA UNA 11-AUG-1990
DEFINITION Figure 2. The sequence and predicted coding capacity of the
HOX-2.5 h14 cDNA.
ACCESSION M34857 M27244
REFERENCE 1 (bases 1 to 1413)
AUTHORS Bogarad,L.D., Utset,M.F., Awgulewitsch,A., Miki,T., Hart,C.P. and
Ruddle,F.H.
TITLE the developmental expression pattern of a new murine homeo box
gene: hox-2.5
JOURNAL Dev. Biol. 133, 537-549 (1989)
STANDARD unannotated staff_entry
COMMENT
FEATURES from to/span description
BASE COUNT 366 a 386 c 311 g 350 t
ORIGIN
1 gcgccggggg agctgctcaa acagggcacg cccgagtaca gtttggaaac ttcggcgggc
61 agggaggctg tcctgtctaa tcaaagagct ggctacgggg acaataaaat ttgcgaagga
121 agcgaggaca aagagaggcc gggtcaaacc aacccctctg ccaactggct gcacgcccgc
181 tcttcccgga aaaagcgctg tccctacacc aaataccaga cgctggagct agagaaggag
241 tttctgttca atatgtacct caccagggac cgcaggcatg aagtggccag actcctcaat
301 ctgagtgaga gacaagtcaa aatctggttt cagaaccggc ggatgaaaat gaagaagatg
361 aataacggag caggcaaaga gtaaagatga ccaccccccc cttccccagc tcactcttat
421 tatttatgtg atggtcaaaa agccactgct gtctgggtgt acccaacgtg agtggggaag
481 agtatctcct ctttaaaatc cctcatctgc acctggagcc tgtctcctct cctttgcgcc
541 tctctctctc tctctctctc tctctctctc tctctctctc tctctctcca ctctcctggt
601 gttaagagga aattttatta agttagaagc tagctatact tggttcctag aaaagtgatg
661 agcctcgctc cgaggaaaga gaccctagtc gagctcctag catccctgct aatttttctg
721 gaaaacccca gcccctctct tccagtctgc ctgttccctc cctgtaacct agccagacgc
781 cacccaggta cagtttaact gcacacagcc cagtggatgc ccacagacat agagtttgga
841 gtctatgcta atcatcgtgg agaaaggaca tcaagacgtt gttcatcaaa ccaagcaggg
901 cagctctcag atcaggttca ttttggttgt gccctgatcc aaacacacac taggaagatg
961 tctagagatg ccctcaatga ggcactccag tcccaaagtc tcaacatccc aatatgtatc
1021 ctatggaatt gccagctaga cccatatctg gggaggcctg tgggagatgt ggccattcct
1081 gtccccaagc tgttgtgagc tagaaagggc aagaaaaagc agggagtggt tttatgaagg
1141 taccattagt ggagtcccct cgtattttcc ctgcaggaag tgtttaattc tgcatgtttc
1201 ctttccttcc aactaaagga ggtcaaagtg ggttgtagag ccctgacagt gctgtttccc
1261 cattcataca ttcctatcct catttgacag aatgtaactt ttcttgctga ccacagtctc
1321 agatttctct ctatcctcca actttgaaat caactctcct ttccaccaaa cccatcttgc
1381 accagcaaag acaagtctcc cattgtaaaa aaa
//
LOCUS CHKBGLOA 168 bp ds-DNA VRT 11-AUG-1990
DEFINITION Chicken beta-3-globin gene, exon N.
ACCESSION M33204
KEYWORDS beta-3-globin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 168)
AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
TITLE Chromosomal arrangement of the chicken beta-type globin genes
JOURNAL Cell 24, 669-677 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept / 7 / 168 beta-3-globin, exon N (AA at 7)
IVS < 1 6 beta-3-globin intron N (no splice consensus)
BASE COUNT 37 a 58 c 38 g 35 t
ORIGIN
1 tacaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctc
61 tccagcccca ctgccatcct tggcaacccc atggtccgcg cccacggcaa gaaagtgctc
121 acctcctttg gggatgctgt gaagaaccta gacaacatca agaacacc
//
LOCUS CHKBGLOB 90 bp ds-DNA VRT 11-AUG-1990
DEFINITION Chicken beta-1-globin gene, exon N.
ACCESSION M33205
KEYWORDS beta-1-globin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 90)
AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
TITLE Chromosomal arrangement of the chicken beta-type globin genes
JOURNAL Cell 24, 669-677 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept / 1 / 90 beta-1-globin, exon N (AA at 1)
BASE COUNT 22 a 29 c 20 g 19 t
ORIGIN
1 aggttctttg ataacttcgg gaacctctcc agccccaccg ccatcattgg taaccccaag
61 gtccgtgctc acggcaaaaa agtgctgagc
//
LOCUS CHKBGLOC 234 bp ds-DNA VRT 11-AUG-1990
DEFINITION Chicken beta-2-globin gene, exon N.
ACCESSION M33206
KEYWORDS beta-2-globin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 234)
AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
TITLE Chromosomal arrangement of the chicken beta-type globin genes
JOURNAL Cell 24, 669-677 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept / 7 / 228 beta-2-globin, exon N (AA at 7)
IVS < 1 6 beta-2-globin intron N (no splice consensus)
IVS 229 > 234 beta-2-globin intron N+1
BASE COUNT 53 a 68 c 65 g 48 t
ORIGIN
1 tgcaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctg
61 tccagtgcca ccgccatcat tggcaacccc atggttcgcg cccacggcaa gaaagtgctg
121 agctcctttg gggaggctgt aaagaacctg gacaacatca agaagagctt tgcccagctg
181 agcaaactgc actgtgacaa gctgcacgtg gaccccgaga acttcagggt gagt
//
LOCUS CHKBGLOD 121 bp ds-DNA VRT 11-AUG-1990
DEFINITION Chicken beta-4-globin gene, exon N.
ACCESSION M33207
KEYWORDS beta-4-globin.
SOURCE Chicken DNA.
ORGANISM Gallus gallus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves;
Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE 1 (bases 1 to 121)
AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
TITLE Chromosomal arrangement of the chicken beta-type globin genes
JOURNAL Cell 24, 669-677 (1981)
STANDARD simple staff_review
FEATURES from to/span description
pept / 1 / 121 beta-4-globin, exon N (AA at 1)
BASE COUNT 19 a 43 c 29 g 30 t
ORIGIN
1 ctccttgggg acatcctgat catcgtcctg gcctcccact ttcccaggga tttcactcct
61 gcctgccaat ttgcctggca gaagctggtc aacgttgtgg cccatgccca ggcccgcaag
121 t
//
LOCUS ECOTRYA 87 bp ss-RNA RNA 11-AUG-1990
DEFINITION E.coli Tyr-tRNA.
ACCESSION M35994
KEYWORDS transfer RNA-Tyr.
SOURCE E.coli.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 87)
AUTHORS Doctor,B.P., Loebel,J.E., Sodd,M.A. and Winter,D.B.
TITLE Nucleotide sequence of Escherichia coli tyrosine transfer
ribonucleic acid
JOURNAL Science 163, 693-695 (1969)
STANDARD simple staff_review
FEATURES from to/span description
tRNA 1 87 Tyr-tRNA
anticdn 36 38 Tyr-tRNA anticodon rta
modified 8 8 p
modified 18 18 gm
modified 20 20 q
modified 41 41 p
modified 66 66 p
BASE COUNT 19 a 27 c 24 g 16 t 1 others
ORIGIN
1 gggtgggttc cccgagcggg caaagggagc agactrtata tctgccggtc acagacttcg
61 aaggttcgaa tccttcccca cccacca
//
LOCUS HUMMHDCBA 142 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human MHC class II HLA-DC-beta chain mRNA, 5' end.
ACCESSION M35982
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human, cDNA to mRNA, clone pII-beta-5.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 142)
AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
TITLE Signal sequences distinguish class II histocompatibility antigen
beta chains of different loci
JOURNAL Scand. J. Immunol. 19, 91-97 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 142 MHC HLA-DC-beta chain precursor (AA at 2)
sigp < 1 79 MHC HLA-DC-beta signal peptide
matp 80 > 142 MHC HLA-DC-beta
BASE COUNT 27 a 40 c 44 g 31 t
ORIGIN
1 aggggggatc cccggagacc ttcgggtagc aactgtcacc ttgatgctgg cgatcctgag
61 ctcctcactg gctgagggca gagactctcc cgaggatttc gtgtaccagt ttaagggcct
121 gtgctacttc accaacggga cg
//
LOCUS HUMMHDRAA 157 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human MHC class II HLA-DR-alpha chain mRNA, 5' end.
ACCESSION M35979
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human, cDNA to mRNA, clone pII-alpha-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 157)
AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
TITLE Signal sequences distinguish class II histocompatibility antigen
beta chains of different loci
JOURNAL Scand. J. Immunol. 19, 91-97 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 8 > 157 HLA-DR-alpha chain precursor
sigp 8 82 HLA-DR-alpha chain signal peptide
matp 83 > 157 HLA-DR-alpha chain
BASE COUNT 43 a 31 c 40 g 43 t
ORIGIN
1 gaagaaaatg gccattagtg gagtccctgt gctaggattt ttcatcatag ctgtgctgat
61 gagcgctcag gaatcatggg ctatcaaaga agaacatgtg atcatccagg ccgagttcta
121 tctgaatcct gaccaatcag gcgagtttat gtttgac
//
LOCUS HUMMHDRBB 157 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end.
ACCESSION M35980
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human, cDNA to mRNA, clone pII-beta-3.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 157)
AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
TITLE Signal sequences distinguish class II histocompatibility antigen
beta chains of different loci
JOURNAL Scand. J. Immunol. 19, 91-97 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 8 > 157 MHC HLA-DR-beta chain precursor
sigp 8 94 MHC HLA-DR-beta chain signal peptide
matp 95 > 157 MHC HLA-DR-beta chain
BASE COUNT 26 a 42 c 46 g 43 t
ORIGIN
1 ctccagcatg gtgtgtctga ggctccctgg aggctcctgc atggcagttc tgacagtgac
61 actgatggtg ctgagctccc cactggcttt ggctggggac accagaccac gtttcttgga
121 gtactctacg tctgagtgtc atttcttcaa tgggacg
//
LOCUS HUMMHDRBC 154 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end.
ACCESSION M35981
KEYWORDS cell surface glycoprotein; class II gene;
integral membrane protein; major histocompatibility complex.
SOURCE Human, cDNA to mRNA, clone pII-beta-4.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 154)
AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
TITLE Signal sequences distinguish class II histocompatibility antigen
beta chains of different loci
JOURNAL Scand. J. Immunol. 19, 91-97 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 5 > 154 MHC HLA-DR-beta chain precursor
sigp 5 91 MHC HLA-DR-beta chain signal peptide
matp 92 > 154 MHC HLA-DR-beta chain
BASE COUNT 27 a 37 c 49 g 41 t
ORIGIN
1 cagcatggtg tgtctgaggc tccctggagg ctcctgcatg gcagttctga cagtgacact
61 gatggtgctg agctccccac tggctttggc tggggacacc agaccacgtt tcttggagga
121 ggttaagttt gagtgtcatt tcttcaacgg gacg
//
LOCUS MUSIAPA 494 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Mouse intracisternal A particle mRNA, 5 'end.
ACCESSION M36322
KEYWORDS intracisternal A particle.
SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line
PCC3, cDNA to mRNA, clone EC26.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 494)
AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C.
TITLE Murine A type retroviruses promote high levels of gene expression
in embryonal carcinoma cells
JOURNAL Development 102, 23-30 (1988)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 477 > 494 intracisternal A particle mRNA
BASE COUNT 111 a 124 c 125 g 134 t
ORIGIN
1 ctgcagctag aaaggctctg cggcatctct gagcctattc tagggagaca tgtcatcttt
61 caagaaggtt gagtgtccaa gtgtccttct ctgcaggcaa aacgacaagg gagcaggtca
121 gggttcctct gggtaaaagc ctgtgagcct aagagcctaa tcctgtacat ggctccttaa
181 cctacacatt ggggatttga cctctatctc cactctcatt aatatgggtg gcctatttgc
241 ctcttattaa aaggaaaggg ggagatgttg ggagccgccc ccacattcgc cgttacaaga
301 tggcgctgac agctgtgttc taagtggtaa acaaataatc tgcgcatgtg ccgagggtgg
361 ttctccactc catgtctctg ccttccccgt gacgtcaact cggccatggc tgcagccaat
421 cagggagtga cacgtcctag gcgaaggata attctcctta atagggacgg gttttcgttt
481 tttctctctc tctt
//
LOCUS MUSIAPB 492 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Mouse intracisternal A particle mRNA, 5 'end.
ACCESSION M36323
KEYWORDS intracisternal A particle.
SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line
PCC3, cDNA to mRNA, clone EC32.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 492)
AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C.
TITLE Murine A type retroviruses promote high levels of gene expression
in embryonal carcinoma cells
JOURNAL Development 102, 23-30 (1988)
STANDARD simple staff_review
FEATURES from to/span description
mRNA 475 > 492 intracisternal A particle mRNA
BASE COUNT 111 a 118 c 129 g 134 t
ORIGIN
1 ctgcagtaga aaggctctgc ggcatattga gcctattcta gggagacatg tcatctttca
61 agaaggttga gtgtccaagt gtccttctct gcaggcaaaa cgacacggga gcaggtcagg
121 gttgcctctg ggtaaaagcc tgtgagccta agagctaatc ctgtaatggc tccttaacct
181 acacattggg gatttgacct cttctccact ctcattaata tgggtggcct atttgcctct
241 tattaaaagg aaagggggag atgttgggag ccgcccccac attcgccgtt acaagatggc
301 gtgacagctg tgttctaagt ggtaaacaaa taatctgcgc atgtgccgag ggtggttctc
361 cactccatgt gctctgcctt cccggtgacg tcaactcggc catggctgca gccaatcagg
421 gagtgacacg tcctaggcga aggataattc tccttaatag ggacgggttt tatcgttttt
481 tctcgctctc tt
//
LOCUS PT4SB 122 bp ds-DNA PHG 11-AUG-1990
DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION M35988
KEYWORDS serine transfer-RNA.
SOURCE Bacteriophage T4 (strain R24) DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 122)
AUTHORS McClain,W.H.
TITLE Specific duplications fostered by a DNA structure containing
adjacent inverted repeat sequences
JOURNAL J. Mol. Biol. 204, 27-40 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA < 1 > 122 Ser-tRNA
BASE COUNT 25 a 35 c 36 g 26 t
ORIGIN
1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ccggagcgac
61 tgccggtctt taaaaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc
121 cg
//
LOCUS PT4SC 121 bp ds-DNA PHG 11-AUG-1990
DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION M35989
KEYWORDS serine transfer-RNA.
SOURCE Bacteriophage T4 (strain R66) DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 121)
AUTHORS McClain,W.H.
TITLE Specific duplications fostered by a DNA structure containing
adjacent inverted repeat sequences
JOURNAL J. Mol. Biol. 204, 27-40 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA < 1 > 121 Ser-tRNA
BASE COUNT 24 a 36 c 34 g 27 t
ORIGIN
1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac
61 tccggtcttt aaaaccggca gtcgctccgg cgactcatag gttcaaatcc tatcgcctcc
121 g
//
LOCUS PT4SEA 122 bp ds-DNA PHG 11-AUG-1990
DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION M35987
KEYWORDS serine transfer-RNA.
SOURCE Bacteriophage T4 (strain R51) DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 122)
AUTHORS McClain,W.H.
TITLE Specific duplications fostered by a DNA structure containing
adjacent inverted repeat sequences
JOURNAL J. Mol. Biol. 204, 27-40 (1988)
STANDARD simple staff_review
FEATURES from to/span description
tRNA < 1 > 122 tRNA-Ser
BASE COUNT 24 a 35 c 36 g 27 t
ORIGIN
1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac
61 tgccggtttt aaagaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc
121 cg
//
LOCUS VFARDNAA 1234 bp ds-DNA PLN 11-AUG-1990
DEFINITION V.faba ribosomal spacer DNA repeat.
ACCESSION M36318
KEYWORDS direct repeat; spacer DNA.
SOURCE V.faba DNA, clone VER 1.
ORGANISM Vicia faba
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 1234)
AUTHORS Kato,A., Yakura,K. and Tanifuji,S.
TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA
JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985)
STANDARD simple staff_review
FEATURES from to/span description
rpt 78 228 direct repeat copy A
rpt 1064 1123 direct repeat copy A
BASE COUNT 320 a 329 c 304 g 281 t
ORIGIN
1 agacaagtcc atgcaaagga ctaggcaaga acgcattgcg taggccatgg tttgcgttgc
61 ttccgcctca tggcccaccg acacgcgaaa agccgaaaaa catgaaggga tcgcccgaat
121 ggtacgaaac cttgcattcc atctcaattt ggtgtcagat taatggatga aaaaattcag
181 accggaactc attgcgtagc ccgacgttgg cttgctttcc gccccttgga gaaaccgcac
241 acgcacgcgt tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa
301 ctccttggac actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt
361 attcccctaa ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgttcg
421 ggccggaaat tcaaaattgt cgaaacgcta cgaaactttg cagatccctt ctatgtgtta
481 tttgggactt ccatgcaaaa attcatcggg agattattcc cctaaggcgc gtttgcctcc
541 gctcccgttc atggagaaac cgcacacgca cgcgttccgg tcggaaattg aaagtagacc
601 gagggcttgt cgaaacgcta cgaaactcct tggacactct caatgtgtta tttgggaagt
661 gcatgcaaaa atccatcggg agattattcc cctaaggcgc gtttgcctcc gctcccgtcg
721 gaaaaaccgc acgggggccc gttcgggccg gaaattcaaa attgtcgaaa cgctacgaaa
781 ctttgcagat cccttctatg tgttatttgg gacttccatg caaaaattga tcgggagatt
841 attcccctaa ggcgcgtttg cctccgctcc cgttcatgga gaaaccgcac acgcacgcgt
901 tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa ctccttggac
961 actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt attcccctaa
1021 ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgacac gcgaaaagcc
1081 gaaatatatc gagggatcgc ccgaatggca tgaaacttgg catgccatct caatttggtg
1141 gcagattaat gcatgaaaaa ttccagaccg gaactcattc cgtagcccga cgtttgtttc
1201 cttccgcccc ttgaaaaaaa ctcaattgca cgcg
//
LOCUS VFARDNAB 149 bp ds-DNA PLN 11-AUG-1990
DEFINITION V.faba ribosomal spacer DNA repeat.
ACCESSION M36319
KEYWORDS .
SOURCE V.faba DNA, clone VER 6.
ORGANISM Vicia faba
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1 to 149)
AUTHORS Kato,A., Yakura,K. and Tanifuji,S.
TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA
JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985)
STANDARD simple staff_review
BASE COUNT 39 a 40 c 35 g 35 t
ORIGIN
1 ccgacacgcg aaaagccgaa atatatcgag ggatcgcccg aatggcatga aacttggcat
61 gccatctcaa tttggtggca gattaatgca tgaaaaattc cgaccggaac tcattccgta
121 gcccgacgtt tgtttgcttc cgccccttg
//
LOCUS XEBEBNA1 185 bp ds-DNA VRT 11-AUG-1990
DEFINITION X.borealis external transcribed spacer and 18S rRNA gene, 5' end.
ACCESSION M36006
KEYWORDS 18S ribosomal RNA.
SEGMENT 1 of 2
SOURCE X.borealis DNA, clone pXbr101.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 185)
AUTHORS Furlong,J.C. and Maden,B.E.H.
TITLE Patterns of major divergence between the internal transcribed
spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis,
and of minimal divergence within ribosomal coding regions
JOURNAL EMBO J. 2, 443-448 (1983)
STANDARD simple staff_review
FEATURES from to/span description
rRNA 23 > 185 18S ribosomal RNA
site 1 22 external transcribed spacer
BASE COUNT 43 a 44 c 46 g 52 t
ORIGIN
1 ggttcccccc ggagccgagg gctacctggt tgatcctgcc agtagcatat gcttgtctca
61 aagattaagc catgcacgtg taagtacgca cggccggtac agtgaaactg cgaatggctc
121 attaaatcag ttatggttcc tttgatcgct ccatctgtta cttggataac tgtggtaatt
181 ctaga
//
LOCUS XEBEBNA2 1589 bp ds-DNA VRT 11-AUG-1990
DEFINITION X.borealis 18S rRNA gene, 3' end, 5.8S rRNA gene, complete,
ITS2, and 28S ribosomal RNA gene, 5 ' end.
ACCESSION M36008 M36010 M36012 M36014 M36016
KEYWORDS 18S ribosomal RNA; 28S ribosomal RNA; 5.8S ribosomal RNA.
SEGMENT 2 of 2
SOURCE X.borealis DNA, clone pXbr101.
ORGANISM Xenopus borealis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 1589)
AUTHORS Furlong,J.C. and Maden,B.E.H.
TITLE Patterns of major divergence between the internal transcribed
spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis,
and of minimal divergence within ribosomal coding regions
JOURNAL EMBO J. 2, 443-448 (1983)
STANDARD simple staff_review
FEATURES from to/span description
rRNA < 1 1 18S ribosomal RNA
site 401 954 internal transcribed spacer 1 (ITS1)
rRNA 955 1116 5.8S ribosomal RNA
site 1117 1452 internal transcribed spacer 2 (ITS2)
rRNA 1453 > 1589 28S ribosomal RNA
BASE COUNT 287 a 560 c 546 g 196 t
ORIGIN About 1500 bp after segment 1.
1 gaattcccag taagtgcggg tcataagctc gcgttgatta agtccctgcc ctttgtacac
61 accgcccgtc gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg
121 ggtcggcaac ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa
181 gtaaaagtcg taacaaggtt tccgtaggtg aacctgcgga aggatcatta accgcccgtc
241 gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg ggtcggcaac
301 ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa gtaaaagtcg
361 taacaaggtt tccgtaggtg aacctgcgga aggatcatta acgagagagg gcgagaggct
421 cacggtgcga gagcgcggct ctgccgcacc cccgccaccg tcccggctcc ccccgcggac
481 gacgacaggt ccgggcgcgg gctcccgccc gccccgacac gaggagagag cggaaaagga
541 acaacaacaa aaaaaacgag cccgggtacc tagccggggg ggggcgaggg cggggacggg
601 gggcagaggg ccctccgacc ccccctccct cctcccgggc ggttcgaaga ccccggccca
661 cggccggcgg cggcgacgag gggcgagccg tccggaggcg gggggcgggc gcggaggggg
721 gacggggggg aggcggaggt cacggccgga cgcggccgcg tccctccgcc cgcccccccc
781 ccgtctgtcc gcgccccgcc ccgcgcgccg gtcgcgtcag tccccgccgc cccgtcggcg
841 gcgccccggg gcgcccgggg cccgggggcc cgccccgtcc agcttccacg gcgcccgcgc
901 gcgccggtac ccctgccgcg aggcaacgag agagaaaaac aaacgaaaaa aatctcgcga
961 ctcttagcgg tggatcactc ggctcgtgcg tcgatgaaga acgcagctag ctgcgagaat
1021 tagtgtgaat tgcaggacac attgatcatc gacacttcga acgcaccttg cggccccggg
1081 ttcctcccgg ggctacgcct gtctgagggt cgctccgacg tccatcgccc ccgccgggtc
1141 cccgacccgg ggaaaacggg cggcgcggct ggggccgtcg caggggtcga gggacgcccc
1201 cccaccgccc ttccggggga ggtgaggggg ccccgcgacc ccttcgtccc cccaaggcca
1261 gaccccgacc atcccggcgg ccccccccag ggccccggcg cggctgtctg tgggaaaccg
1321 atttcacggc tgccggcgcc gcgggcgctg gggacccgcc ggggcgggaa gagcccccgg
1381 gcgggccggg gccggggaag gcgcgcttga acccgcgccc ccccgcgtcc cccgcccgcc
1441 cccccgctcg actcagacct cagatcagac gtggcgaccc gctgaattta agcatattac
1501 taagcggagg aaaagaaact aaccaggatt cccccagtaa cggcgagtga agagggaaga
1561 gcccagcgcc gaatccccgc ccgcccggc
//
LOCUS BOVTGFB 1117 bp ss-mRNA MAM 11-AUG-1990
DEFINITION Bovine transforming growth factor-beta-1 (TGF beta-1) mRNA,
3' end.
ACCESSION M36271
KEYWORDS transforming growth factor-beta-1.
SOURCE Bovine fibropapilloma, cDNA to mRNA, (library of Okayama and Berg).
ORGANISM Bos taurus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE 1 (bases 1 to 1117)
AUTHORS Van Obberghen-Schilling,E., Kondaiah,P., Ludwig,R.L.,
Sporn,M.B. and Baker,C.C.
TITLE Complementary deoxyribonucleic acid cloning of bovine transforming
growth factor-beta-1
JOURNAL Mol. Endocrinol. 1, 693-698 (1987)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 950 transforming growth factor-beta-1 precursor
(AA 75 at 3)
matp 612 947 transforming growth factor-beta-1
BASE COUNT 244 a 379 c 301 g 193 t
ORIGIN
1 aggccatact ggccctttac aacagtaccc gcgaccgggt ggccggggaa agtgccgaaa
61 cggagcctga gccagaggcg gactactacg ccaaggaggt cacccgcgtg ctaatggtgg
121 aatacggcaa caaaatctat gacaaaatga agtctagctc gcacagcata tatatgttct
181 tcaacacgtc cgagctccgg gaagcggtgc ccgaacctgt gttgctctct cgggcagacg
241 tgcgcctgct gaggctcaag ttaaaagtgg agcagcacgt ggagctgtac cagaaatata
301 gcaacaattc ctggcgctac ctcagcaacc ggctgctcgc ccccagcgac tcaccggagt
361 ggctgtcctt tgacgtcact ggagttgtgc ggcagtggct gacccgcaga gaggaaatag
421 agggctttcg cctcagtgcc cactgttcct gtgacagtaa agataacacg cttcaagtgg
481 acattaacgg gttcagttcc ggccgccggg gtgacctcgc caccattcac ggcatgaacc
541 ggcccttcct gctcctcatg gccacccctc tggagagggc ccagcacctg cacagctccc
601 gccaccgccg agccctggac accaactact gcttcagctc cacagaaaag aactgctgtg
661 ttcgtcagct ctacattgac ttccggaagg acctgggctg gaagtggatt catgaaccca
721 aggggtacca cgccaatttc tgcctggggc cctgccctta catctggagc ctggatacac
781 agtacagcaa ggtcctggcc ctgtacaacc agcacaaccc gggcgcttcg gcggcgccgt
841 gctgcgtgcc tcaggcgctg gagcccctgc ccatcgtgta ctacgtgggc cgcaagccca
901 aggtggagca gttgtccaac atgatcgtgc gctcctgcaa gtgcagctga ggccccgtcc
961 caccccaaca gcccccgccc cgtagccccg ccacccggca gcccggcccc gcccccgccc
1021 gcctcaccag gactgtattt aaggacacag caccaccccc ccccccccac tcccatcaag
1081 cccacctggg gtccattaaa ggtggcgaga ggaaaaa
//
LOCUS FSBPROLA 1148 bp ss-mRNA VRT 11-AUG-1990
DEFINITION O.keta (chum salmon) prolactin mRNA, complete cds.
ACCESSION M36267
KEYWORDS prolactin.
SOURCE O.keta pituitary gland, cDNA to mRNA.
ORGANISM Oncorhynchus keta
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes;
Actinopterygii; Salmoniformes; Salmonoidei; Salminidae.
REFERENCE 1 (bases 1 to 1148)
AUTHORS Song,S., Trinh,K.-Y., Hew,C.L., Hwang,S.-J., Belkhode,S. and
Idler,D.R.
TITLE Molecular cloning and expression of salmon prolactin cDNA
JOURNAL Eur. J. Biochem. 172, 279-285 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 19 654 prolactin precursor
sigp 19 87 prolactin signal peptide
matp 88 651 prolactin
mRNA 1 1148 prolactin mRNA
BASE COUNT 319 a 284 c 239 g 306 t
ORIGIN
1 gtccgaaaag aaggaaagat ggctcgccga tcccagggta ccaaactcca cttagcagtt
61 ctgtgtctag ttgtgtcctg tcatgccatt ggccttagtg acctaatgga gagagcttcc
121 cagcgatcag acaagcttca ctcactcagc acttccctca ccaaggacct ggactctcac
181 ttcccaccaa tgggacgagt gatgatgcca cgcccgtcta tgtgtcacac ctcctcgctc
241 cagacaccca aggacaagga gcaagcactc aaggtatcgg agaatgagct gatctccctg
301 gctcgctacc tcctcctggc ctggaacgat cccctgctgc tgctctcctc agaggcgccg
361 actctgcccc acacgccgtc caatggcgac atcagcagta agatcaggga actgcaggac
421 tactccaaga gcctgggaga tggactggac ataatggtca acaagatggg accctcctcc
481 cagtacattt cttcaatccc cttcaagggt ggagacctcg gcaatgacaa gacctcccgc
541 ctcatcaact tccacttcct catgtcttgc ttccgcaggg actcccacaa aatcgacagt
601 ttcctcaagg tccttaggtg ccgggctaca aatatgcgac cagaaacatg ttaggagaaa
661 atggcaggat tttggttctg gattgttcca ttttcaaact gatagtcaaa atggggtagg
721 aattcaagag agtgattgtc ttgttgtagt ttgattttgt gaaataacta atgctgccac
781 ctacatcaca tttggactat tcatagatct atacagttag tagtcatcct ggttatctga
841 taccgacatg tttacctata catatcttta taggtagttt atgttcggta atgcattatg
901 catcagaaaa atgatgcaac ctaagtattg gtcacttata tgatcatatt caatagattc
961 atatatagat aacaaaggat cctcctagtc attcttagta gatatacagg atggctgata
1021 ggtagatgcc atgaaccgtc atgattaaca atctagaact tgaatatagg taactcagcc
1081 tttgtaaact attgataatt gcttccaaga gttctgttta cacaataaat caagtaagat
1141 tttgcaat
//
LOCUS PETIRAB 183 bp ds-DNA ORG 11-AUG-1990
DEFINITION P.hybrida chloroplast rps19' and rp12 genes, partial cds.
ACCESSION M35955
KEYWORDS inverted repeat; rp12; rps19'.
SOURCE Petunia hybrida chloroplast DNA.
ORGANISM Chloroplast Petunia hybrida
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae; Petunia hybrida.
REFERENCE 1 (bases 1 to 183)
AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E.
TITLE Sequence analysis of the junction of the large single copy region
and the large inverted repeat in the petunia chloroplast genome
JOURNAL Curr. Genet. 14, 487-492 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept > 183 143 (c) rp12 gene product (AA at 181)
pept 91 < 1 (c) rps19' gene product (gtg start codon)
BASE COUNT 58 a 28 c 26 g 71 t
ORIGIN
1 cgcccatacc ataattcaga attcatcaaa tattagataa atttaataaa tgattggcta
61 caaaggggtt tttttttagt gaacgtgtca caagcttact cctatttttt ttttttttgt
121 aaagtttttg taaaacgaag aattaaattc gatttcctct cctatttact atggcgacga
181 aga
//
LOCUS PETIRABX 162 bp ds-DNA ORG 11-AUG-1990
DEFINITION P.hybrida chloroplast rps19 and rp12 genes, partial cds.
ACCESSION M37322
KEYWORDS inverted repeat; rp12; rps19.
SOURCE Petunia hybrida chloroplast DNA.
ORGANISM Chloroplast Petunia hybrida
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Asteridae; Solanales; Solanaceae; Petunia hybrida.
REFERENCE 1 (bases 1 to 162)
AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E.
TITLE Sequence analysis of the junction of the large single copy region
and the large inverted repeat in the petunia chloroplast genome
JOURNAL Curr. Genet. 14, 487-492 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 41 rp12 gene product (AA at 3)
pept 92 > 162 rps 19 gene product (gtg start codon)
BASE COUNT 70 a 26 c 21 g 45 t
ORIGIN
1 tcttcgtcgc catagtaaat aggagaggaa atcgaattta attcttcgtt ttacaaaaac
61 ttacaaaaaa aaaaaaaata ggagtaagct tgtgacacgt tcactaaaaa aaaacccctt
121 tgtagccaat catttattaa ataaaattga taagcttaac ac
//
LOCUS RATATROPO 1633 bp ds-DNA ROD 11-AUG-1990
DEFINITION Rat alpha-tropomyosin and N5 genes, 5' flank.
ACCESSION J05467
KEYWORDS alpha-tropomyosin.
SOURCE Rat (Sprague-Dawley) adult liver DNA.
ORGANISM Rattus norvegicus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1633)
AUTHORS Herrera,V.L.M. and Ruiz-Opazo,N.
TITLE Regulation of alpha-tropomyosin and N5 genes by a shared enhancer
JOURNAL J. Biol. Chem. 265, 9555-9562 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
N.Ruiz-Opazo, 04-JUN-1990.
FEATURES from to/span description
signal 569 772 enhancer
BASE COUNT 424 a 509 c 405 g 294 t 1 others
ORIGIN 1 bp upstream of HindIII site.
1 aagcttaaag agatgggggg ggggggtgtc ctcctgcctt cagtgcagac ctggctggcc
61 cctttccctc tgcagagttc accctacgga ttgaaccatt tccgtagcta atttgtagca
121 tgttttcagc accagcaccc aagtcgctaa cagagaaaag aaggacnccc accccccacc
181 cccaatagat ccgagcaagg aaaaaaatga aaagctagat ctgaagggat tttaggaatc
241 cactaccctc tgaatttaga tatgaggaaa cctgaactaa caagatggat ggctcagcag
301 gtaacggagc ttgctatcaa gctcaaagac ctgtgcacgg atcgaacacc cacatggtag
361 aaacaactga ttcttaaaga gttgctctct gaccgccaca gcaccaccac gccacgccca
421 cgcccctata agtaaacgac taaagtaagg gaagctacag catggaagct gtttctaatc
481 acagctacac aatgtagctg ttagctagct aatgactgaa gaccagaccc ctcccactcc
541 ccacgatgtt aactgagtct aaactcaaca cacacacaca cacacacaca cacacacaca
601 cacacacctt ggcacactcc ttctaccctt tccaaagcac agcacaggca cataactcag
661 caaaccacac acacacacac acacacacac acacacacac acacacacac acaccttggc
721 acactccttc taccctttcc aaagcacagc acaggcacat aactcagcaa accgggccct
781 aagttcctaa tgtattttac tgcccacctc cagcttgggt gaaacgattg ggagacttcc
841 tgtgtgtatc attatgccca ccatcacggg cagagagaca gggcctcagg aatggccccc
901 acaccttcct ccacctctat ttctctgcgt gggttgaaat ttccagactg tgagagccca
961 ggactatgac tggtgggtgg actcagagct gctagaagga cgcagctagg tactgagaac
1021 tcccaggccc agatcgaagg tgcggtggct gtgactggca gcccccggac cttacagtga
1081 gcaccccagg acccggaggc caagtgagat ggtgcgcaca caccacctgc cctgttcccg
1141 ggattccctg tgactcccga cagactcttc tggggaccct gaccgccagg cgacggctct
1201 gggaggtggc tagagactcc tcgccagtac cgggctcctc tccagcgagg tccgggagct
1261 ctttcccagg cgtgcaggtt tagaaattaa ggggagcctc gcccggccgc ggtctcggag
1321 gctcgggtcc ccatatatag tcatatccac cgtcgactgg gagcggcagg cggcagcgaa
1381 tggcgtgagg ccccgcggga ggagcgagga gggggcatgg ggcggaggca ggagactggg
1441 aaagggggaa ggaggaacaa gcttttccaa aaaagtattg gctgacctaa ggaatgcggt
1501 cgccccctgg gaaagtacat atcagggagc agcaggcacg tccgcgctcg caccttccgg
1561 cctctgccac cctaccgcgc tcgcgtctcc tctggctcct ctcgctaagg gcccctcgcc
1621 accgccacca tgg
//
LOCUS STMXYLI 1201 bp ds-DNA BCT 11-AUG-1990
DEFINITION S.violaceoniger xylose isomerase gene, complete cds.
ACCESSION M36269
KEYWORDS glucose isomerase; xylose isomerase.
SOURCE S.violaceoniger DNA.
ORGANISM Streptomyces violaceoniger
Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE 1 (bases 1 to 1201)
AUTHORS Drocourt,D., Bejar,S., Calmels,T., Reynes,J.P. and Tiraby,G.
TITLE Nucleotide sequence of the xylose isomerase gene from Streptomyces
violaceoniger
JOURNAL Nucleic Acids Res. 16, 9337-9337 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 24 1193 xylose isomerase
BASE COUNT 187 a 455 c 402 g 157 t
ORIGIN
1 cgtacggaac aaggagcagc gccatgagct tccagcccac ccccgaggac aagttcacct
61 tcggtctgtg gaccgtcggc tggcagggaa gggacccgtt cggcgacgcc acccgccctg
121 ccctcgaccc ggtcgagacc gtgcagcgcc tggccgagct gggcgcctac ggagtgacct
181 tccacgacga cgacctgatc cccttcgggt cgtccgacac cgagcgcgag tcgcacatca
241 agcggttccg ccaggccctg gacgccactg gcatgacggt gccgatggcc accacgaacc
301 tcttcaccca ccccgtcttc aaggacggcg gtttcaccgc caacgaccgc gacgtgcgcc
361 gctacgcgct gcgcaagacg atccgcaaca tcgacctggc ggccgagctg ggcgccaaga
421 cgtacgtcgc ctggggcggc cgtgagggcg ccgagtccgg tggcgccaag gacgtgcgcg
481 acgccctcga ccgcatgaag gaggcgttcg acctcctcgg cgagtacgtc accgcccagg
541 gctacgacct ccgcttcgcc atcgagccca agcccaacga gccccgcggc gacatcctcc
601 tgcccaccgt cggccacgcc ctggccttca tcgagcgcct ggagcgcccg gagctgtacg
661 gcgtcaaccc ggaggtcggc cacgagcaga tggccggcct gaacttcccg cacggcatcg
721 cgcaggccct gtgggcgggc aagctcttcc acatcgacct caacggccag tccggcatca
781 agtacgacca ggacctgcgg ttcggcgccg gcgacctgcg ggcggcgttc tggctggtcg
841 acctcctgga gagcgccggt tacgagggcc cgcggcactt cgacttcaag ccgccgcgga
901 ccgaggactt cgacggcgtg tgggcctcgg ccgagggctg catgcgcaac tacctgatcc
961 tcaaggagcg cgcggccgcc ttccgcgccg acccggaggt gcaggaggcc ctgcgcgccg
1021 cgcgtctgga ccagctggcc cagccgaccg cggcggacgg cctggaggcc ctgctcgccg
1081 accgcaccgc gttcgaggac ttcgacgtgg aggcggccgc cgcgcgcgcg gcatggccgt
1141 tcgaacgcct cgaccagctg gcgatggacc acctgctggg cgcgcgcggc tgacggtcgc
1201 g
//
LOCUS BNAALS 3315 bp ds-DNA PLN 11-AUG-1990
DEFINITION B.napus chloroplast acetolactate synthase (ALS) gene, complete cds.
ACCESSION M20498
KEYWORDS acetolactate synthase.
SOURCE B.napus (cv. Westar) 3 week old leaf DNA.
ORGANISM Brassica napus
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 3315)
AUTHORS Wiersma,P.A., Schmiemann,M.G., Condie,J.A., Crosby,W.L. and
Moloney,M.M.
TITLE Isolation, expression and phylogenetic inheritance of an
acetolactate synthase gene from Brassica napus
JOURNAL Mol. Gen. Genet. 219, 413-420 (1989)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by W.L.Crosby, 06-DEC-1988, for release after publication.
FEATURES from to/span description
pept 784 2697 acetolactate synthase (ALS) precursor (EC
4.1.3.18)
sigp 784 1011 acetolactate synthase (ALS) signal peptide
matp 1012 2694 acetolactate synthase (ALS)
signal 2111 2116 transcriptional stop sequence (put.)
signal 446 459 promoter element (put.)
signal 665 668 CAAT box
signal 705 709 TATA box
signal 2894 2899 polyA signal
BASE COUNT 954 a 712 c 702 g 947 t
ORIGIN
1 tctagatata aaagtatttt attacttaaa aagtatattt ttttgttaat taaagatatc
61 ttttagatgg actattacta ttttgtgaac tttctttttt ttctgaaatc atattatata
121 tacatatttt tttgttttta aattattttt taaaacttta taaactttcc ttttttattt
181 atattgttat ttgaaaaatt ttaagaaagt aaagaataat atttaaatgt acttttttta
241 tttcattagg gatatcatag taatcaacca ccgtgagagt taacgtgagt gcgacaccta
301 aaaaagtgac ttctcaaata atattatgga gatgtatcta caagaaaaat aaacattgtt
361 gtgtaaattt tgaataaatg actgagggca tccacatggt aaaatttact aagagtatct
421 gaacaaaaat attaaaataa aaataaaaag aaagaaagag aaatcagaaa acaagagaag
481 ccattggtta agaagagact cacacattct tttgacagag tttttctacg gatgacattt
541 attgaatgag acaacaaatt ttacagcatt aagttttatt catctaatta acctattaat
601 aattttatgt taagagacat ggataaagaa acttattgct gcagacgctc tgagatgggg
661 ttggcaatat tagttacgag ccaacttgaa gtgtcacatg catgtataaa ttttagttct
721 ctctcacaga gtttatttac tcattcaaat taagcaattt ctcgcaacac tccattttgc
781 accatggctt cgttttcgtt cttcggcacc attccgtcgt ctcccacaaa agcttccgtc
841 ttctccctgc cggtgtcggt aactacgctc ccgtccttcc cgcgccgccg tgctactcgt
901 gtctccgttt ccgccaactc gaagaaagac caagaccgca cagcttcacg tcgagagaat
961 ccgagcacat tcagctccaa atacgctccc aacgtgcccc gcagtggcgc agacatcctg
1021 gtcgaagccc tggagcgtca aggagtggac gtagtcttcg cttacccagg aggcgcatca
1081 atggagatcc atcaagccct aactcgctcc aacacaatcc gaaacgtcct tccccgtcac
1141 gaacaaggag gtatcttcgc cgccgagggt tacgctcgtt cctccggtaa acccggaatc
1201 tgcatcgcca cttccggtcc aggagctatg aatctcgtca gcggattagc cgacgccctg
1261 tttgacagcg tacccctcat cgcaatcaca ggacaggtcc ctcgccggat gattggtacc
1321 atggcgttcc aggagacacc cgttgttgag gtaacgagga ctataacgaa acataactat
1381 cttgttatgg aagttgatga tatacctagg atcgttcgag aagctttctt tctagctact
1441 tcggttagac cgggaccggt tcttatagac gtccccaaag atgttcagca acagtttgcg
1501 attcctaact gggaacagcc tatgcgctta cctctttaca tgtctacgat gcctaaaccc
1561 cccaaagttt ctcacttaga gcagattctt aggttggttt cggagtctaa gaggcctgtc
1621 ttgtacgttg gaggtggttg tctgaactcg agtgaggaac tgcgcagatt tgtggaactt
1681 actggcatcc ctgttgctag tacgttcatg ggacttggat cgtatccttg tgacgatgaa
1741 gagttctctc tgcaaatgct aggaatgcat ggaacagtgt acgctaatta cgctgtcgag
1801 tatagcgatc ttctgcttgc ttttggggtt aggtttgacg accgtgtgac cggaaagctt
1861 gaggcctttg ctagccgggc caagatcgtg catattgata ttgattctac cgaaatcggg
1921 aagaacaaga cacctcatgt gtcggtgtgt tgtgatgttc agctagcctt gcaagggatg
1981 aacgaggttc ttgagaaccg acgagatgtg cttgacttcg gggaatggag atgtgaattg
2041 aacgaacaga gactaaagtt ccctctccgc tacaagacgt ttggggaaga gattcctcca
2101 cagtacgcca ttcaactact tgacgagcta accgacggga aggcaattat cactactggt
2161 gtcgggcaac accagatgtg ggccgcccaa ttctacagat tcaagaaacc ccgccaatgg
2221 ctgtcttcag gaggcctagg agccatgggg ttcggtcttc ctgcagccat gggagccgct
2281 atagccaacc cgggagcagt ggttgtcgac attgatgggg atggtagctt catcatgaac
2341 attcaagaac tggcaaccat cagggttgag aatcttccag tcaaggtttt gctgattaat
2401 aatcagcacc tcggaatggt ccttcagtgg gaagaccact tctacgcagc taacagagcc
2461 gattcttttc tgggagaccc ggcgaaccct gaggcggtat tcccggatat gctgttgttc
2521 gccgcatcgt gcggtatacc agccgccagg gtcaccagaa gggaggacct ccgagaggca
2581 atccagacaa tgctggacac acctggacca ttcttgttgg atgtggtctg tcctcaccag
2641 gaccatgtgt taccactcat ccctagtggc ggcaccttca aggacattat tgtgtagtac
2701 tactttaatg tttgattatt ctatgctctc ttctgttttg taataacaaa gtaaaaccag
2761 tttgcaatat cagaagatct tctgattatc aatatcatgc tactctcata atgaaaaacg
2821 caaacttcac tctttattta tatgagaata ataactgtag tgtttacata acaatttata
2881 agtatgaaag actaataaaa aaaaagttat atacttgata aaaaggcaga agtaagttta
2941 caaatgtagt aggccatcat aacgatctga taaggaattt ttgtataggg actaatacct
3001 ctatgaagga gaacatgtat cgcaacgagt tgacatatgt gtgaacctca gatggactcg
3061 atgttaagtt ctacgttcct gcatgaaaaa aaaaaaacat acaatttttt tcaagcatgg
3121 gtaaccttat tgaatacgaa caatccagac gaaaatgggg caaaactacc ttgagatctg
3181 cacctgaaca gaaaactcca ggaaccagac ctgtgatcat cacaactcta aaagctgaac
3241 tatcctgctg tatggtctca aatgtgttct gcatctcctt atcaatagca tttttcggct
3301 actggcctgt ctaga
//
LOCUS HUM5AR 2102 bp ss-mRNA PRI 11-AUG-1990
DEFINITION Human steroid 5-alpha-reductase mRNA, complete cds.
ACCESSION M32313
KEYWORDS dihydrotestosterone; steroid 5-alpha-reductase.
SOURCE Human adult prostate, cDNA to mRNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2102)
AUTHORS Andersson,S. and Russell,D.W.
TITLE Structural and biochemical properties of cloned and expressed human
and rat steroid 5-alpha-reductases
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3640-3644 (1990)
STANDARD full staff_review
COMMENT Draft entry and computer-readable sequence [1] kindly submitted by
D.W. Russell, 23-FEB-1990, for release after publication.
FEATURES from to/span description
pept 31 810 steroid 5-alpha-reductase (EC 1.3.99.5)
mRNA < 1 2102 steroid 5-alpha-reductase mRNA
BASE COUNT 470 a 482 c 486 g 664 t
ORIGIN
1 gggcatggag cacgctgccc agccctggcg atggcaacgg cgacgggggt ggcggaggag
61 cgcctgctgg ccgcgctcgc ctacctgcag tgcgccgtgg gctgcgcggt cttcgcgcgg
121 aatcgtcaga cgaactcagt gtacggccgc cacgcgctgc ccagccacag gctccgagtg
181 ccggcgcggg ccgcctgggt ggtgcaggag ctgccctcgc tggccctgcc gctctaccag
241 tacgccagcg agtccgcccc gcgtctccgc agcgcgccca actgcatcct cctggccatg
301 ttcctcgtcc actacgggca tcggtgctta atttacccgt ttctgatgcg aggaggaaag
361 cctatgccac tgttggcatg tacaatggcg attatgttct gtacctgtaa cggctatttg
421 caaagcagat acttgagcca ttgtgcagtg tatgctgatg actgggtaac agatccccgt
481 tttctaatag gttttggctt gtggttaaca ggcatgttga taaacatcca ttcagatcat
541 atcctaagga atctcagaaa accaggagat actggataca aaataccaag gggaggctta
601 tttgaatacg taactgcagc caactatttt ggagaaatca tggagtggtg tggctatgcc
661 ctggccagct ggtctgtcca aggcgcggct tttgctttct tcacgttttg ttttttatct
721 ggtagagcaa aagagcatca tgagtggtac ctccggaaat ttgaagagta tccaaagttc
781 agaaaaatta taattccatt tttgttttaa gtgcgttttt catgaaatta tcttcaactt
841 gaagctttcc aatggcgctt ctctatggac tttgtaaata agttatatct ttgtaatttt
901 cctgctactt tatcattttc aagatgtcct ctaggaattt tttttctagt aattttgcaa
961 tctacctaat aagtacctaa atacgctgaa atggaggttg aatatcctac tgtgtaacag
1021 gtcagaattt caagctctgg gtaataactg ctgatatttt ttctaatttc aaatttacct
1081 cttttggcta tgtcttgcca agtgtgtatg agactagact ttacaactgt ctttgatggc
1141 attttcagaa caataaatgt cacaatccct tctatagccc cctacagtga tctcttcaag
1201 gtcaactgca gtgttgcttc cctcccccta tagggctgga atctgtctag gagccctctc
1261 tcggaggcca cagaggctgg gggtagccat tgtgcagtca tggcccgggg gaaacttgcc
1321 aaccttcgtg tcaggtgctg tgtgtaagtg gagaacttgg ggatagagga ggaagctcct
1381 cgtggccctt ccaaggtgag gcaaaggcat ctggacttgt tccagcccag cccaccgggt
1441 gacatcaccg ggcagggagg ggtgctggtg gtggttcata cggagtaagc tgctctgcct
1501 gtgtgagtgg ctcctgggcc ctaaacaggc acctttaggc catgggtcac tcaccgtgag
1561 ccatcaatgt gctctggtct gacatggttt ctctctgtct tctagtctag acctagtttt
1621 tttgttctgt tccccacgta tggatatagt agagattgtt gtctgtgaaa tttctctttt
1681 gtagattttg agttttccct tgtagtgtaa agaatgatca ctttctgtaa caataacaag
1741 accacttttt aagatttatc ctgtttgttc tttgttgatt gaaacataat aattgttaaa
1801 attctctaca gccttctttt tcttccatag ctaatcttcc ttctaatagt ttttgctttc
1861 tgttttgctg ttgttgcttt gcaaagcttt cccctcatag cctgtacctg ttatcaatat
1921 aaaataatct tcctgttgaa tgcttcatga cttgaattct actttgataa aaacattgcc
1981 atactgcttt ttatcttgat gaattcatct ggcattgctt tgccttatca tctcatctgg
2041 agtttttaaa tgccatttgt ttcagttgtc tttaacaaca taataaatag actttgccat
2101 tt
//
LOCUS SPIBADH 1797 bp ss-mRNA PLN 11-AUG-1990
DEFINITION Spinach betaine-aldehyde dehydrogenase (BADH) mRNA, complete cds.
ACCESSION M31480
KEYWORDS betaine-aldehyde dehydrogenase.
SOURCE Spinach salt-stressed leaf, cDNA to mRNA.
ORGANISM Spinacia oleracea
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE 1 (bases 1 to 1797)
AUTHORS Weretilnyk,E.A. and Hanson,A.D.
TITLE Molecular cloning of a plant betaine-aldehyde dehydrogenase, an
enzyme implicated in adaptation to salinity and drought
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2745-2749 (1990)
STANDARD full staff_review
COMMENT Draft entry and printed sequence for [1] kindly submitted by
A.D.Hanson, 27-MAR-1990,, for release after publication.
FEATURES from to/span description
pept 68 1561 betaine-aldehyde dehydrogenase (BADH) (EC
1.2.1.8)
BASE COUNT 504 a 324 c 443 g 526 t
ORIGIN
1 cgttgcgtgc tcgccttacc ctctcaactc aatttcttca acccaatttc ttcgcattta
61 accaagaatg gcgttcccaa ttcctgctcg tcagctattc atcgacggag agtggagaga
121 acccattaaa aaaaatcgca tacccgtcat caatccgtcc actgaagaaa tcatcggtga
181 tattccggca gccacggctg aagatgtgga ggttgcggtg gtggcagctc gaagagcctt
241 taggaggaac aattggtcag caacatctgg ggctcatcgt gccacatact tgcgtgctat
301 tgctgctaag ataacagaaa aaaaagatca tttcgttaaa ctggaaacca ttgattctgg
361 gaaacctttt gatgaagcag tgctggacat tgatgacgtt gcttcatgtt ttgaatattt
421 tgccggacaa gcagaagctc ttgatggtaa acaaaaggct ccagtcaccc tgcctatgga
481 aaggttcaaa agtcatgttc tcaggcagcc ccttggtgtt gttggattaa tatccccatg
541 gaattaccca cttctaatgg ctacatggaa aattgctcca gcacttgctg ctgggtgtac
601 agctgtactt aagccatccg agttggcatc tgtgacttgt ctagaattcg gtgaagtttg
661 caacgaagtg ggacttcctc caggcgtgtt gaatatcttg acaggattag gtccagatgc
721 tggtgcacca ttagtatcac accccgatgt tgacaagatt gcctttactg ggagtagtgc
781 cactggaagc aaggttatgg cttctgctgc ccaattggtt aagcctgtta cattagaact
841 tgggggtaaa agtcctattg tagtgtttga agatgttgat attgataaag ttgtggaatg
901 gactattttt ggctgtttct ggacaaatgg tcaaatatgt agtgcaacgt ctagactgct
961 tgtgcatgaa agtattgcag ctgagtttgt tgataagctt gtaaaatgga cgaaaaacat
1021 taaaatttct gacccatttg aagaaggatg ccggcttggc cctgttatta gtaaaggaca
1081 gtacgacaaa attatgaagt tcatatcaac agcaaagagt gagggggcaa ctattttgta
1141 tggaggttcc cgtcctgagc atttgaagaa aggttattac attgaaccca ccattgtaac
1201 tgatatctcc acatccatgc aaatatggaa agaggaagtt tttggccctg tcttgtgtgt
1261 taaaacattt agttccgaag atgaagccat tgcattggca aatgatacag agtacggttt
1321 agctgctgct gtgttttcta atgatcttga aagatgtgag aggataacga aggctctaga
1381 agttggagct gtttgggtta attgctcaca accatgcttt gttcaagctc cttggggagg
1441 catcaagcgt agtggttttg gacgtgaact tggagaatgg ggtatccaga attacttgaa
1501 tatcaagcag gtgactcaag atatttctga tgaaccatgg ggatggtaca agtctccttg
1561 aagctatgat caaatttgaa tgacggtgtt gtttttgtta agtgagcagc ggttggactg
1621 taccttgaaa tggttcgcag agaaggtcga gtttacagta aaaatggatt gaataaaggg
1681 ttggttgatg cagaagtcca acaagcatag cttaattttg ttgtatcatg taatagtgtg
1741 tattatttca gacaattgag ttggatcaga aataagatga tacagagttt gacaatg
//
LOCUS ECORTSA 1407 bp ds-DNA BCT 11-AUG-1990
DEFINITION E.coli rts gene, complete cds.
ACCESSION M36321
KEYWORDS rts gene.
SOURCE E.coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1407)
AUTHORS Flamm,J.A., Friesen,J.D. and Otsuka,A.J.
TITLE The nucleotide sequence of the Escherichia coli rts gene
JOURNAL Gene 74, 555-558 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 294 1304 rts gene product (gtg start codon)
BASE COUNT 383 a 316 c 298 g 410 t
ORIGIN
1 gcgcgcattc tatggagaca tgcgagttca tgcaactaaa aaattgcata atttgtttta
61 ttggtcacat tttatgcgac acgatgaaga aacagccgaa cggagggcag attagcacac
121 tttttcaaca tcattgtgct caacaatgcg ctcctgctaa accataattc tttttatcag
181 atggaatatc tgtcacattg cttttcaacg atagcttcct ggcagagatt ttttcttatt
241 attcctcccc atctggtgtt accctcctgc ccattaaccc attcaacaga actgtgacgc
301 gccatggcaa atatcgcttt gccgatagag ctatgaccgc cagaaacatg cttatgagta
361 taaaagagca aacgttaatg acgccttacc tacagtttga ccgcaaccag tgggcagctc
421 tgcgtgattc cgtacctatg acgttatcgg aagatgagat cgcccgtctc aaaggtatta
481 atgaagatct ctcgttagaa gaagttgccg agatctattt acctttgtca cgtttgctga
541 acttctatat aagctcgaat ctgcgccgtc aggcagttct ggaacagttt cttggtacca
601 acgggcaacg cattccttac attatcagta ttgctggcag tgtcgcggtg gggaaaagta
661 caaccgcccg tgtattgcag gcgctattaa gccgttggcc ggaacatcgt cgtgttgaac
721 tgatcactac agatggcttc cttcacccta atcaggttct gaaagaacgt ggtctgatga
781 agaagaaagg cttcccggaa tcgtatgata tgcatcgcct ggtgaagttt gtttccgatc
841 tcaaatccgg cgtgccaaac gttacagcac ctgtttactc acatcttatt tatgatgtga
901 tcccggatgg agataaaacg gttgttcagc ctgatatttt aattcttgaa gggttaaatg
961 tcttacagag cgggatggat tatccacacg atccacatca tgtatttgtt tctgattttg
1021 tcgatttttc gatatatgtt gatgcaccgg aagacttact tcagacatgg tatatcaacc
1081 gttttctgaa attccgcgaa ggggctttta ccgacccgga ttcctatttt cataactacg
1141 cgaaattaac taaagaagaa gcgattaaga ctgccatgac attgtggaaa gagatcaact
1201 ggctgaactt aaagcaaaat attctaccta ctcgtgagcg cgccagttta atcctgacga
1261 aaagtgctaa tcatgcggta gaagaggtca gactacgcaa ataatttgca ggggagcgaa
1321 tactcccctt tcttattttt ctgcactacg cagggatatt tcaccgccca tccagggttt
1381 tattattcca tcctgctcaa gtaataa
//
LOCUS MUSBCLQ 1056 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Mouse complement component subunit Clq B-chain mRNA, complete cds.
ACCESSION M36293
KEYWORDS complement component C1q.
SOURCE Mouse Englebreth-Holm-Swarm tumor, cDNA to mRNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 1056)
AUTHORS Wood,L., Pulaski,S. and Vogeli,G.
TITLE cDNA clones coding for the complete murine B chain of complement
Clq: Nucleotide and derived amino acid sequences
JOURNAL Immunol. Lett. 17, 59-62 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 157 918 complement component subunit Clq B-chain
precursor
sigp 157 231 complement component subunit Clq B-chain
signal peptide
matp 232 915 complement component subunit Clq B-chain
BASE COUNT 236 a 320 c 285 g 215 t
ORIGIN
1 acagcaaggg aaggttcttc cttcctctag ggacccagac ttccgctttc tgaggacagc
61 agcaggctct gggctctggg aatccactgc tgtccggcct agaagcatca cagaacacca
121 ggattccata cacaggaagc ccctgaggct gagctgatga agacacagtg gggtgaggtc
181 tggacacacc tgttactgct gcttctaggt tttctccatg tgtcctgggc ccaaagcagc
241 tgcaccgggc cccctggcat ccctggcatc cctggggtcc ctggggttcc tggctctgat
301 ggccaacctg gcactccagg gattaaaggg gagaaagggc tccctggact ggctggagac
361 cttggtgagt ttggagagaa aggggaccca gggatccctg ggactccagg caaagttggc
421 cctaagggtc ccgtcggccc taagggtact ccaggcccct ctggaccccg cggtcccaaa
481 ggcgattctg gggactacgg ggctacacag aaagtcgcct tctctgccct gaggaccatc
541 aacagcccct tgcgaccgaa ccaggtcatt cgcttcgaaa aggtgatcac caacgcgaac
601 gagaactatg agccacgcaa cggcaagttc acctgcaagg tgcctggcct ctactacttc
661 acctatcatg ccagctcccg gggcaacctg tgtgtgaatc tcgttcgtgg ccgcgatcgg
721 gacagcatgc agaaagtagt caccttctgt gactatgccc agaacacctt ccaggtgacc
781 acaggtgggg tagtcttgaa gctagagcaa gaggaggttg ttcacctgca ggccacagac
841 aagaactccc tcctgggcat tgagggtgcc aacagcatct tcactggctt tctgcttttc
901 cctgacatgg atgcgtaatc acggggtcaa attacaccta tccaacacca tcttcctgct
961 cctgcagcaa tcctccctgg acccctgaca tcaccccctt gactgcctga aacccagacc
1021 agagccctgt agatgttaca gaacgaatgg gtcaat
//
LOCUS MUSHOX23R 3542 bp ss-mRNA ROD 11-AUG-1990
DEFINITION Mouse Hox2.3 protein mRNA, complete cds.
ACCESSION X06762 M36183
KEYWORDS Hox2.3 protein; homeo box.
SOURCE Mouse plasmacytoma cell line, cDNA to mRNA, clones pRO.8, pR1.2,
and genomic clones L13 and L2.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 3542)
AUTHORS Meijlink,F., Verrijzer,P., De Laaf,L., De Graaff,W. and
Deschamps,J.
TITLE A mouse homeobox containing gene on chromosome 11: Sequence and
tissue-specific expression
JOURNAL Nucleic Acids Res. 15, 6773-6786 (1987)
STANDARD simple staff_review
REFERENCE 2 (bases 1184 to 2206; revises [1])
AUTHORS Verrijzer,P., De Graaff,W., Deschamps,J. and Meijlink,F.
TITLE Nucleotide sequence of the Hox2.3 gene region
JOURNAL Nucleic Acids Res. 16, 2729-2729 (1988)
STANDARD simple staff_review
FEATURES from to/span description
pept 2187 2840 Hox2.3 polypeptide
mRNA 2100 3373 Hox2.3 mRNA
site 2595 2774 homeo box
site 3371 3371 hox2.2 mRNA
BASE COUNT 738 a 924 c 951 g 890 t 39 others
ORIGIN Chromosome 11.
1 gagctcggcc ccgcgagcga cgcaggagct gggcctccca cagcagcgtc ccccgccgcg
61 ccagtccccg ctagtggtag tatctcgtaa tagcttctgt gtgtgagcta ccgtggatct
121 ccttcccttc tcttgggggt ccggggggaa aaaaagaaaa ggattttaag caaggactcc
181 ctcgtcctgc gagggtgatc gactgcggcc tggcagaacc ccctcgcccc cgccccatgt
241 aaaaaagcct ccttgtgcaa tggtctgttt cctttgaacg tgcttctttg taatgaccga
301 ggtaccgatt tctgctaagt tttcccaaca acatgaaact gcctattcac gccgtaattc
361 tttctgtctc ccgctcactt tctctctttc tctcgctctc tttctctcac cgcgtcccca
421 tctttcctcg caaccccctc tccccgctgc cctccctagc tggctttctc tcttgcttct
481 ctcttttcct cctgtccccc cacccccacc ccctttggtt tgacaatttt gtcttaagtg
541 tttctcaaaa gagattactt tagttagcat gcgcgctgtg agcattgtta aaagtgttct
601 taggtttact gtgaagagaa tgtatcctgt atctgtgaat tgctttatgg gggggaggga
661 gggctaatta tatattttgt tgttcctcta tactttgttc tgttgtctgc gcctgaaaag
721 ggcggaagag ttacaataaa gtttacaagc agagaacccg agnnnnnnnn nnnnnnnnnn
781 nncccgggcc gccgctcctc attcgctcct aggcgccttg cagggctggg ggtggggggg
841 agctggtcag caggctcctg ggctggccta ggctaggtcg ctgagaggag ggggcggggg
901 cgggggctgg aagcaggtgg tgcgagtccc tgggcccagg ggcgcagggg gtgagggagg
961 cggctgaacg tgattggagg agagaggatc gagggagggg agccaagaga aaccccctcc
1021 ccttgcattc tgaggctgaa ggaccaggga gactccagcg cccaggccgc tcttgggaag
1081 agatctaccc aggctggtgg ctagtgtccc ccgccgcttt tctctttgtt tccgtgtgtg
1141 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tttggggnnn nnnnnnnnnn
1201 nnnnnngggg tgagaaagat gcagcgcaag acttctgagt ttgtatttcc attttttcct
1261 tggactcagg ttggggaaac aggagcagag ggaagcggtt cctccctacc tcccctctct
1321 gggacgtcgt cactttctcc gacgtttcta ggcctcggct tgccgcagcc ttgcttcctt
1381 cgttgcttct gccttcctgg cagccacgct ccagtgagtg aggcatccgc cttccggaac
1441 cgggaaagca aagagccgga cccaagcctc cttcctccct tcctttttct cccagccccc
1501 attccattct ttttaaattt tgtatatctt tttattgtca tcagaaatct gagcgtccaa
1561 cgccttattg ggttggatct ctgccttagg gacgccttgg tcacatctag ttactacaac
1621 tggggcacta agacaaccgg gaggccaggc tcgcctcctc ttgggagaag agcagcagct
1681 cggatgaatt aacccaaatt aataaatatt cggccagcac ccaccgacca agttgcgaac
1741 attcaatccc tgcgtctctc tggctctgta accggctggg ggaaatgggt gggggatgac
1801 aacacggttc cctcagaggt tatttatttt ctcttccact caattccttc ttccccaaat
1861 ctcgcctgca agctgcctcc agcccgcggg ggtcgacagc ggcccttaag cccccagccc
1921 caatccgcag agctcggcct tcccattcat tattgatcat attttataaa tcgaacgcga
1981 cacaattttt tccacattac tgggagcctc cgggaggccg tcatagcatt ggccgagggg
2041 atatcacgtg ggccggggtc acgtggtcag aagaggaaaa agggggtcct tttggtgtaa
2101 atctggactc taattctgta atatatcaag gaatctcgta aaaccgacac taaaacgtcc
2161 ccgactacaa atcatccggc caaattatga gttcattgta ttatgcgaat gctttatttt
2221 ctaaatatcc agccgcaagt tcggttttcg ctccaggagc cttccccgaa caaacttctt
2281 gcgcctttgc ttccaacccc cagcgcccgg gctatggagc aggtccgggc gctcctttct
2341 ccgcctcggt gcagggtctg tactccggcg ggggggccat ggcgggccag agcgcggctg
2401 gcgtctatgc ggccggctac gggctcgaac cgagttcctt caacatgcac tgcgcgccct
2461 ttgagcagaa cctctccggg gtgtgtccgg gcgacgccgc caaggccggt ggcgccaagg
2521 agcagaggga ctcggacttg gcggccgaga gtaacttccg gatctacccc tggatgcgaa
2581 gctcagggcc tgaccgaaag cgaggccgcc agacctacac gcgctaccag accctggagc
2641 tggagaaaga atttcactac aatcgctacc tgactcggcg gaggcgcatc gagatcgcgc
2701 acacgctctg cctcaccgaa agacagatca agatctggtt tcagaaccgg cgcatgaagt
2761 ggaaaaagga gaacaaaacc tcaggaccgg gaaccaccgg ccaggacaag gcggaaggcg
2821 aggaggagga ggaagagtga gggacagaga aagccaagag gaggaaagcg aagagaggga
2881 gaacccaatt atgggaactg aagcatgaaa ctcaaataaa ggggcaaact atttaaatga
2941 agaggtctaa aaataataag gaaagagaag ggtgaaattt gggtttctta acactgtaaa
3001 aaaatatact acctatggga aagtgtgttg tctgtttttg tacagtatcg gaaggacatt
3061 atctacctgt tctgtggctt tctggaatgt gcctcccctt ttctatgttg ctagtaaggt
3121 ctttgtaaaa tcttgctgtt ttgtaagccc tctttgaagc tgtctttgtg aactgtggtt
3181 ccagatgaac agattagtgc ggttccttac ctaccccata cttcccagta gcagtactag
3241 aggggttgta tggagccctg aggatccact agcttctgcg tctggtgcat tttggctgct
3301 gtttctaggt cctattcacg agcctctttc tgtatatctg aaggatggaa aataaaacag
3361 gattaaatat caacagacac gctgtgtttt ctgtctccct ctggctgggg gaggtgtaag
3421 ggttggtagt gggcgagtgt cccagagccc ctctgctggt ctttttctgt cggtgctggg
3481 tttgctttta ttttccagat agcctctgct atagagactg ccagctagtg aatccccaac
3541 at
//
LOCUS POLVP1A 900 bp ss-RNA VRL 11-AUG-1990
DEFINITION Poliovirus type 3 VP1 gene, partial cds.
ACCESSION M37321 M36161 M36162
KEYWORDS .
SOURCE Poliovirus type 3 (P3/Leon/1947) RNA.
ORGANISM Poliovirus type 3
Unclassified.
REFERENCE 1 (bases 1 to 900)
AUTHORS Minor,P.D., Evans,D.M.A., Schild,G.C., Ferguson,M. and Almond,J.W.
TITLE Identification of an antigenic site in the neutralization of type 3
poliovirus
JOURNAL Rev Infect Dis 6, 516-518 (1984)
STANDARD simple staff_review
REFERENCE 2 (bases 34 to 900)
AUTHORS Almond,J.W., Cann,A.J., Minor,P.D., Reeve,P., Schild,G.C.,
Hauptmann,R. and Stanway,G.
TITLE Nucleotide sequence from neurovirulent and attenuated strains of
type 3 poliovirus
JOURNAL Rev Infect Dis 6, 487-493 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept < 1 > 900 VP1 (AA at 1)
BASE COUNT 238 a 250 c 206 g 206 t
ORIGIN
1 ggtattgaag atttgatttc tgaagttgca cagggcgccc taactttgtc actcccgaag
61 caacaggata gcttacctga tactaaggcc agtggcccgg cgcattccaa ggaggtacct
121 gcactcactg cagtcgagac tggagccacc aatcctctgg caccatccga cacagttcaa
181 acgcgccacg tagtccaacg acgcagcagg tcagagtcca caatagaatc attcttcgca
241 cgcggggcgt gcgtcgctat tattgaggtg gacaatgaac aaccaaccac ccgggcacag
301 aaactatttg ccatgtggcg cattacatac aaagatacag tgcagttgcg ccgtaagttg
361 gagtttttca catactctcg ttttgacatg gaattcacct tcgtggtaac cgccaacttc
421 accaacgcta ataatgggca tgcactcaac caggtgtacc agataatgta catcccccca
481 ggggcaccca caccaaagtc atgggacgac tacacttggc aaacatcttc caacccgtcc
541 atattttaca cctatggggc tgccccggcg cgaatctcag tgccatacgt ggggttagcc
601 aatgcttact cgcactttta cgacggcttc gccaaggtgc cattgaagac agatgccaat
661 gaccagattg gtgattcctt gtacagcgcc atgacagttg atgactttgg tgtattggca
721 gttcgtgttg tcaatgatca caaccccact aaagtaacct ccaaagtccg catttacatg
781 aaacccaaac acgtacgtgt ctggtgccct agaccgccgc gcgcggtacc ttattatgga
841 ccaggggtgg actataagaa caacttggac cccttatctg agaaaggttt gaccacatat
//
LOCUS YSCATRP3A 1240 bp ds-DNA PLN 11-AUG-1990
DEFINITION S.cerevisiae glutamine amidotransferase (TRP3) gene, 3' end.
ACCESSION M36300
KEYWORDS TRP3 gene; glutamine amidotransferase.
SOURCE S.cerevisiae DNA.
ORGANISM Saccharomyces cerevisiae
Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes;
Endomycetales; Saccharomycetaceae.
REFERENCE 1 (bases 1 to 1240)
AUTHORS Aebi,M., Furter,R., Prantl,F., Niederberger,P. and Huetter,R.
TITLE Structure and function of the TRP3 gene of Saccharomyces
cerevisiae: Analysis of transcription, promoter sequence, and
sequence coding for a glutamine amidotransferase
JOURNAL Curr. Genet. 8, 165-172 (1984)
STANDARD simple staff_review
FEATURES from to/span description
pept 401 > 1240 glutamine amidotransferase (TRP3)
mRNA 309 > 1240 TRP3 mRNA (alt.)
mRNA 314 > 1240 TRP3 mRNA (alt.)
mRNA 320 > 1240 TRP3 mRNA (alt.)
BASE COUNT 352 a 267 c 259 g 362 t
ORIGIN
1 ttcacttttt ttgcttctta taggcctttt tgaactattt tctgttaggg gcttaaagaa
61 agcgcagcaa atggctgatt ttaaatgctg ttttctcgaa agcaatgctg atgatgtacc
121 acaaagtagt atttgcgctt agcttgtcga tataagtatg cctttgagcc agtattaaac
181 ttcctactat ctcttctgga cctcaccgtg ctattcatat gactgacgac ggaaaatgcc
241 tcttgtagta atcattgaat gatcatcaat ggctcgttga ctcattctaa tctatagcag
301 atattaaggt acgcaaaaag ttcgacaagg agctggtcag tattaacaca ttaccaaatc
361 tgtttggttc tacatagaac gccataaaag taagaaaaaa atgtctgtgc acgctgcaac
421 aaacccaatc aataagcatg tggttctaat tgacaactac gattccttta cctggaacgt
481 ttacgagtac ttgtcccagg agggcgccaa agtgagcgtc taccgtaacg atgcaattac
541 agttccagaa attgccgcct tgaatcccga cacattgctt atctcgcttg gactaggcca
601 cccaaagaca gattctggca tttcaagaga ctgtatccgg tattttactg ggaaaattcc
661 tgtatttgga atctgtatgg gccagcaatg catgtttgac gtatttggtg gtgaagttgc
721 ctacgctggt gagattgtcc acggtaaaac gtccccaatc tctcacgaca actgtggaat
781 tttcaagaac gtgccgcaag gtattgctgt gacaagatac cattcattgg ccgggacaga
841 atcgtcccta ccatcctgct tgaaggttac tgcgagtacc gaaaatggaa ttatcatggg
901 tgtaagatac aagaagtaca ctgtagaagg tgtgcaattt catccggaat ccatcttgac
961 cgaggaaggt catctgatga tcaggaacat tttaaatgtc agtggaggca cttgggagga
1021 aaacaaatca tctccttcaa attctatttt ggaccgtatc tatgctcggc gtaaaataga
1081 cgtcaatgag cagtctaaaa tcccaagttt cacctttcaa gacttacaat ctaactatga
1141 tttaggtctt gccccaccgt tacaggattt ctacacggtg ttgtcatcat cccataaaag
1201 agccgttgtt cttgctgaag tcaagcgtgc ctctccatcg
//